Query psy8499
Match_columns 591
No_of_seqs 750 out of 4285
Neff 10.4
Searched_HMMs 46136
Date Fri Aug 16 20:29:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8499.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8499hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 99.9 3.8E-28 8.3E-33 243.6 3.8 269 322-590 353-932 (958)
2 KOG3608|consensus 99.9 1.6E-26 3.5E-31 209.3 12.9 213 377-590 135-377 (467)
3 KOG2462|consensus 99.9 1.6E-26 3.4E-31 205.0 5.6 128 431-559 131-264 (279)
4 KOG1074|consensus 99.9 3.8E-27 8.3E-32 236.5 1.7 214 351-564 607-933 (958)
5 KOG3608|consensus 99.9 3.6E-25 7.9E-30 200.6 13.2 178 382-560 185-374 (467)
6 KOG2462|consensus 99.9 2.2E-26 4.9E-31 204.0 4.9 130 456-586 129-264 (279)
7 KOG3623|consensus 99.8 2E-21 4.4E-26 192.0 6.6 102 350-451 211-330 (1007)
8 KOG3623|consensus 99.8 2.6E-21 5.6E-26 191.3 5.9 102 269-370 211-330 (1007)
9 KOG3576|consensus 99.6 1E-16 2.2E-21 134.7 1.6 106 457-562 117-236 (267)
10 KOG3576|consensus 99.6 4E-16 8.8E-21 131.1 2.3 107 404-510 118-238 (267)
11 PLN03086 PRLI-interacting fact 99.2 3.6E-11 7.8E-16 121.8 7.4 144 432-590 409-565 (567)
12 PLN03086 PRLI-interacting fact 99.1 2.5E-10 5.5E-15 115.8 7.7 59 432-494 455-514 (567)
13 PHA00733 hypothetical protein 98.9 4.8E-10 1E-14 92.4 3.3 78 513-591 42-125 (128)
14 PHA02768 hypothetical protein; 98.9 1E-09 2.2E-14 73.2 2.1 43 539-582 6-48 (55)
15 PHA00733 hypothetical protein 98.9 3.1E-09 6.7E-14 87.6 5.4 50 431-481 74-123 (128)
16 KOG3993|consensus 98.7 2E-09 4.3E-14 101.6 -0.3 185 376-560 267-480 (500)
17 PHA02768 hypothetical protein; 98.7 1.2E-08 2.6E-13 68.1 2.1 40 458-498 6-45 (55)
18 KOG3993|consensus 98.6 4.9E-09 1.1E-13 99.0 -1.4 184 404-587 268-480 (500)
19 PF13465 zf-H2C2_2: Zinc-finge 98.3 2.9E-07 6.2E-12 52.4 1.9 25 553-577 1-26 (26)
20 PHA00732 hypothetical protein 98.2 1.2E-06 2.6E-11 65.0 3.2 45 513-562 3-48 (79)
21 PF13465 zf-H2C2_2: Zinc-finge 98.2 1.2E-06 2.7E-11 49.8 2.1 24 445-468 1-25 (26)
22 PHA00732 hypothetical protein 98.0 3E-06 6.5E-11 62.9 2.6 45 457-506 1-46 (79)
23 PF01352 KRAB: KRAB box; Inte 98.0 6.4E-07 1.4E-11 56.8 -1.5 33 42-74 8-41 (41)
24 PF05605 zf-Di19: Drought indu 97.9 1.4E-05 3.1E-10 55.0 3.6 50 538-590 2-54 (54)
25 PHA00616 hypothetical protein 97.9 4.4E-06 9.6E-11 53.0 0.7 26 565-590 1-26 (44)
26 PHA00616 hypothetical protein 97.8 8.2E-06 1.8E-10 51.8 1.6 28 457-484 1-28 (44)
27 PF00096 zf-C2H2: Zinc finger, 97.8 6.1E-06 1.3E-10 45.7 0.9 23 566-588 1-23 (23)
28 PF12756 zf-C2H2_2: C2H2 type 97.7 1.3E-05 2.8E-10 64.1 1.6 73 486-561 1-73 (100)
29 PF00096 zf-C2H2: Zinc finger, 97.7 2.5E-05 5.4E-10 43.2 1.8 22 458-479 1-22 (23)
30 PF05605 zf-Di19: Drought indu 97.7 8.4E-05 1.8E-09 51.2 4.6 20 458-478 3-22 (54)
31 PF12756 zf-C2H2_2: C2H2 type 97.6 2.5E-05 5.4E-10 62.5 1.5 73 459-534 1-73 (100)
32 PF13894 zf-C2H2_4: C2H2-type 97.6 2.7E-05 5.7E-10 43.6 1.1 24 566-589 1-24 (24)
33 PF13912 zf-C2H2_6: C2H2-type 97.5 5.8E-05 1.3E-09 43.6 1.3 26 565-590 1-26 (27)
34 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00018 3.8E-09 40.1 2.0 23 458-480 1-23 (24)
35 PF13912 zf-C2H2_6: C2H2-type 97.1 0.00024 5.2E-09 40.9 1.4 25 457-481 1-25 (27)
36 PF13909 zf-H2C2_5: C2H2-type 96.8 0.0005 1.1E-08 38.3 0.9 24 566-590 1-24 (24)
37 smart00355 ZnF_C2H2 zinc finge 96.8 0.00079 1.7E-08 38.2 1.6 24 566-589 1-24 (26)
38 smart00355 ZnF_C2H2 zinc finge 96.2 0.0037 8E-08 35.3 2.1 19 487-505 3-21 (26)
39 KOG2231|consensus 96.2 0.0079 1.7E-07 62.9 5.7 116 404-535 100-236 (669)
40 PF12874 zf-met: Zinc-finger o 96.2 0.0022 4.8E-08 36.0 1.0 23 566-588 1-23 (25)
41 PF09237 GAGA: GAGA factor; I 96.0 0.0032 7E-08 41.0 1.3 26 565-590 24-49 (54)
42 COG5189 SFP1 Putative transcri 96.0 0.0022 4.7E-08 59.2 0.7 24 455-478 347-372 (423)
43 PF13909 zf-H2C2_5: C2H2-type 95.9 0.0044 9.5E-08 34.4 1.5 23 458-481 1-23 (24)
44 COG5189 SFP1 Putative transcri 95.6 0.005 1.1E-07 56.9 1.4 63 513-586 351-419 (423)
45 PF12874 zf-met: Zinc-finger o 95.6 0.0067 1.4E-07 34.0 1.3 20 459-478 2-21 (25)
46 PF12171 zf-C2H2_jaz: Zinc-fin 95.6 0.0055 1.2E-07 35.1 0.9 22 566-587 2-23 (27)
47 PF09237 GAGA: GAGA factor; I 95.5 0.0091 2E-07 38.9 1.8 18 514-531 27-44 (54)
48 KOG2231|consensus 95.4 0.031 6.8E-07 58.6 6.4 50 351-400 184-236 (669)
49 COG5236 Uncharacterized conser 95.0 0.018 3.9E-07 53.9 2.9 123 431-560 152-303 (493)
50 COG5236 Uncharacterized conser 94.4 0.045 9.8E-07 51.4 3.9 79 106-185 151-245 (493)
51 KOG2785|consensus 94.2 0.11 2.5E-06 49.9 6.2 22 404-425 4-25 (390)
52 KOG2482|consensus 94.1 0.11 2.4E-06 48.9 5.7 64 416-479 128-217 (423)
53 KOG2482|consensus 93.9 0.19 4.2E-06 47.3 7.0 74 243-317 281-356 (423)
54 KOG2785|consensus 93.8 0.2 4.4E-06 48.3 7.1 76 242-317 4-90 (390)
55 PF12171 zf-C2H2_jaz: Zinc-fin 93.7 0.022 4.7E-07 32.6 0.4 21 296-316 2-22 (27)
56 PF13913 zf-C2HC_2: zinc-finge 92.9 0.061 1.3E-06 30.0 1.2 19 567-586 4-22 (25)
57 PRK04860 hypothetical protein; 92.8 0.051 1.1E-06 46.8 1.3 36 457-496 119-155 (160)
58 PRK04860 hypothetical protein; 92.5 0.057 1.2E-06 46.5 1.2 38 539-580 120-158 (160)
59 PF13913 zf-C2HC_2: zinc-finge 92.0 0.12 2.6E-06 28.9 1.8 19 486-505 4-22 (25)
60 smart00451 ZnF_U1 U1-like zinc 91.3 0.11 2.4E-06 31.8 1.2 23 565-587 3-25 (35)
61 PF01352 KRAB: KRAB box; Inte 90.9 0.046 1E-06 34.8 -0.8 14 2-15 28-41 (41)
62 KOG4173|consensus 90.1 0.43 9.4E-06 41.6 4.1 47 107-155 80-128 (253)
63 KOG1146|consensus 88.8 0.13 2.8E-06 57.6 0.1 26 374-399 463-488 (1406)
64 COG5048 FOG: Zn-finger [Genera 88.7 0.14 2.9E-06 53.4 0.1 152 431-582 290-462 (467)
65 smart00451 ZnF_U1 U1-like zinc 88.6 0.27 5.7E-06 30.1 1.4 22 457-478 3-24 (35)
66 KOG2893|consensus 87.6 0.21 4.5E-06 44.4 0.6 47 431-480 11-58 (341)
67 KOG2893|consensus 87.1 0.19 4.2E-06 44.6 0.1 42 243-287 12-53 (341)
68 KOG1146|consensus 86.5 0.19 4.1E-06 56.3 -0.3 24 564-587 517-540 (1406)
69 COG5048 FOG: Zn-finger [Genera 85.1 0.2 4.4E-06 52.1 -1.0 55 104-158 31-88 (467)
70 TIGR00622 ssl1 transcription f 82.6 1.6 3.4E-05 34.7 3.3 34 106-152 1-34 (112)
71 KOG4173|consensus 80.7 0.89 1.9E-05 39.7 1.4 45 458-504 80-126 (253)
72 cd00350 rubredoxin_like Rubred 79.1 1.1 2.5E-05 26.9 1.2 24 107-141 2-25 (33)
73 PF09538 FYDLN_acid: Protein o 77.1 1.3 2.9E-05 35.2 1.4 13 457-469 26-38 (108)
74 PF12013 DUF3505: Protein of u 75.5 4 8.7E-05 32.8 3.8 20 405-425 13-32 (109)
75 PF09538 FYDLN_acid: Protein o 75.2 1.8 3.9E-05 34.5 1.6 33 536-578 7-39 (108)
76 TIGR00622 ssl1 transcription f 73.8 4.4 9.6E-05 32.3 3.4 49 432-481 57-105 (112)
77 PF10571 UPF0547: Uncharacteri 72.1 2 4.3E-05 24.2 0.9 8 460-467 17-24 (26)
78 KOG2186|consensus 71.2 2.5 5.3E-05 38.5 1.7 45 431-477 4-48 (276)
79 COG2888 Predicted Zn-ribbon RN 69.8 2.8 6.1E-05 28.7 1.3 31 431-465 28-58 (61)
80 COG4049 Uncharacterized protei 68.3 1.8 4E-05 28.9 0.2 21 566-586 18-38 (65)
81 KOG2186|consensus 67.3 3.2 7E-05 37.8 1.6 46 377-424 4-49 (276)
82 COG4049 Uncharacterized protei 66.0 2.8 6E-05 28.1 0.7 26 105-130 16-41 (65)
83 TIGR02300 FYDLN_acid conserved 62.0 4.5 9.7E-05 32.8 1.3 14 428-441 7-20 (129)
84 cd00729 rubredoxin_SM Rubredox 62.0 4.9 0.00011 24.4 1.2 9 458-466 3-11 (34)
85 TIGR02098 MJ0042_CXXC MJ0042 f 61.8 2.7 5.8E-05 26.2 0.1 8 460-467 5-12 (38)
86 smart00834 CxxC_CXXC_SSSS Puta 61.6 2.8 6.1E-05 26.5 0.1 11 458-468 6-16 (41)
87 PF12013 DUF3505: Protein of u 60.6 4.7 0.0001 32.4 1.3 25 566-590 81-109 (109)
88 PF09986 DUF2225: Uncharacteri 56.9 2.6 5.6E-05 38.6 -0.9 18 457-474 5-22 (214)
89 TIGR02605 CxxC_CxxC_SSSS putat 55.1 4.3 9.2E-05 27.4 0.2 11 458-468 6-16 (52)
90 PF13717 zinc_ribbon_4: zinc-r 54.6 4.5 9.8E-05 24.9 0.2 8 433-440 5-12 (36)
91 smart00614 ZnF_BED BED zinc fi 54.5 8.1 0.00018 25.8 1.4 11 540-550 20-30 (50)
92 PF05443 ROS_MUCR: ROS/MUCR tr 53.2 6.7 0.00015 32.5 1.0 21 567-590 74-94 (132)
93 smart00659 RPOLCX RNA polymera 52.6 7.6 0.00016 25.2 1.0 11 458-468 3-13 (44)
94 smart00734 ZnF_Rad18 Rad18-lik 52.0 11 0.00024 21.2 1.5 19 540-559 3-21 (26)
95 PF09986 DUF2225: Uncharacteri 51.2 3.8 8.2E-05 37.6 -0.8 20 485-504 6-25 (214)
96 TIGR02300 FYDLN_acid conserved 50.6 9.9 0.00021 30.9 1.5 33 347-389 7-39 (129)
97 PF09723 Zn-ribbon_8: Zinc rib 50.3 5 0.00011 25.7 -0.1 13 458-470 6-18 (42)
98 TIGR00373 conserved hypothetic 50.2 9.1 0.0002 33.1 1.5 17 106-122 109-125 (158)
99 PF02892 zf-BED: BED zinc fing 49.7 9 0.0002 24.8 1.1 20 566-585 17-40 (45)
100 TIGR00373 conserved hypothetic 49.2 9.2 0.0002 33.1 1.3 16 431-446 110-125 (158)
101 PF08274 PhnA_Zn_Ribbon: PhnA 49.0 7.3 0.00016 22.8 0.4 26 539-574 3-28 (30)
102 smart00531 TFIIE Transcription 48.4 6.4 0.00014 33.6 0.2 33 539-575 100-133 (147)
103 smart00531 TFIIE Transcription 48.3 9.7 0.00021 32.5 1.3 35 105-142 98-132 (147)
104 PF13719 zinc_ribbon_5: zinc-r 47.4 11 0.00023 23.4 1.0 10 188-197 26-35 (37)
105 PF15269 zf-C2H2_7: Zinc-finge 47.3 14 0.0003 23.6 1.5 23 187-209 20-42 (54)
106 PRK00398 rpoP DNA-directed RNA 46.2 6 0.00013 25.9 -0.2 10 458-467 4-13 (46)
107 PRK14890 putative Zn-ribbon RN 46.0 12 0.00026 25.8 1.2 32 538-574 25-57 (59)
108 COG1997 RPL43A Ribosomal prote 45.9 8.5 0.00018 28.8 0.5 11 458-468 54-64 (89)
109 PRK00464 nrdR transcriptional 45.3 3.8 8.1E-05 35.1 -1.6 15 458-472 29-43 (154)
110 PRK06266 transcription initiat 44.6 11 0.00023 33.4 1.0 16 106-121 117-132 (178)
111 PRK06266 transcription initiat 44.4 12 0.00025 33.2 1.2 14 431-444 118-131 (178)
112 PF14353 CpXC: CpXC protein 44.1 17 0.00038 30.1 2.2 15 457-471 38-52 (128)
113 PF03604 DNA_RNApol_7kD: DNA d 43.7 17 0.00037 21.7 1.4 11 107-117 1-11 (32)
114 PRK09678 DNA-binding transcrip 39.6 6 0.00013 28.8 -1.1 13 566-578 28-42 (72)
115 COG1996 RPC10 DNA-directed RNA 38.7 14 0.0003 24.5 0.6 10 458-467 7-16 (49)
116 PF06524 NOA36: NOA36 protein; 38.6 11 0.00024 34.5 0.2 38 157-195 139-179 (314)
117 PRK04023 DNA polymerase II lar 38.2 24 0.00051 39.5 2.6 9 377-385 627-635 (1121)
118 KOG2807|consensus 38.0 51 0.0011 31.6 4.3 34 105-151 275-308 (378)
119 COG0068 HypF Hydrogenase matur 36.3 6.7 0.00014 41.8 -1.8 56 486-546 125-181 (750)
120 PHA00626 hypothetical protein 35.2 17 0.00037 24.6 0.6 15 564-578 22-36 (59)
121 KOG4167|consensus 34.2 12 0.00026 39.7 -0.3 24 458-481 793-816 (907)
122 PF05443 ROS_MUCR: ROS/MUCR tr 34.0 20 0.00044 29.7 1.0 23 486-511 74-96 (132)
123 PF07754 DUF1610: Domain of un 33.4 22 0.00047 19.6 0.7 10 564-573 15-24 (24)
124 COG5151 SSL1 RNA polymerase II 32.7 28 0.00061 32.9 1.8 24 187-210 388-411 (421)
125 KOG4167|consensus 32.3 10 0.00022 40.3 -1.2 24 566-589 793-816 (907)
126 COG1592 Rubrerythrin [Energy p 31.1 25 0.00055 30.5 1.2 24 106-141 134-157 (166)
127 PRK00432 30S ribosomal protein 29.9 26 0.00056 23.5 0.8 12 564-575 36-47 (50)
128 PRK04023 DNA polymerase II lar 29.7 42 0.0009 37.7 2.7 9 296-304 627-635 (1121)
129 PF14446 Prok-RING_1: Prokaryo 29.3 25 0.00054 23.9 0.6 27 106-143 5-31 (54)
130 COG4957 Predicted transcriptio 28.5 27 0.00058 28.6 0.8 20 568-590 79-98 (148)
131 COG1198 PriA Primosomal protei 28.4 30 0.00065 38.0 1.5 44 487-572 438-482 (730)
132 KOG2593|consensus 28.0 44 0.00095 33.6 2.3 34 539-574 129-162 (436)
133 PF06524 NOA36: NOA36 protein; 26.6 17 0.00036 33.5 -0.7 78 134-222 98-179 (314)
134 PF09845 DUF2072: Zn-ribbon co 26.2 32 0.00069 28.3 0.9 14 376-389 1-14 (131)
135 TIGR01206 lysW lysine biosynth 25.0 32 0.0007 23.5 0.6 30 539-575 3-32 (54)
136 PF04959 ARS2: Arsenite-resist 24.7 23 0.00049 32.3 -0.2 25 565-589 77-101 (214)
137 PF02176 zf-TRAF: TRAF-type zi 23.4 27 0.00059 24.2 0.0 27 551-577 23-54 (60)
138 KOG2593|consensus 22.7 44 0.00095 33.6 1.3 34 133-168 128-161 (436)
139 smart00349 KRAB krueppel assoc 22.4 37 0.00081 21.4 0.6 18 5-22 31-48 (61)
140 PF01363 FYVE: FYVE zinc finge 22.3 41 0.00088 24.1 0.8 9 513-521 11-19 (69)
141 PF04959 ARS2: Arsenite-resist 21.3 67 0.0014 29.3 2.1 29 374-402 75-103 (214)
142 PF13878 zf-C2H2_3: zinc-finge 20.8 62 0.0013 20.6 1.3 23 566-588 14-38 (41)
143 PF08790 zf-LYAR: LYAR-type C2 20.5 35 0.00076 19.7 0.1 19 566-585 1-19 (28)
144 COG3364 Zn-ribbon containing p 20.1 63 0.0014 25.1 1.4 13 376-388 2-14 (112)
145 COG4530 Uncharacterized protei 20.1 81 0.0018 24.8 2.0 31 347-387 7-37 (129)
146 PF05290 Baculo_IE-1: Baculovi 20.0 71 0.0015 26.3 1.7 12 486-497 123-134 (140)
No 1
>KOG1074|consensus
Probab=99.94 E-value=3.8e-28 Score=243.62 Aligned_cols=269 Identities=28% Similarity=0.588 Sum_probs=181.3
Q ss_pred CCcccccccccCChhHHHHHHhhhcCCC-CCCCccccccCChHHHHHHHHhcCCcc-------------ccccccccccC
Q psy8499 322 KNKCVLCSKEFSSKSYLTEHMKLHQGIK-NKCDLCSKEFSCKRYVTEHMKLHQEIT-------------YKCDLCLKEYS 387 (591)
Q Consensus 322 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~~~-------------~~C~~C~~~f~ 387 (591)
+.+|.+|.+.|.+.+.|..|++.|+|++ |+|.+||..|.++.+|..|...|.++- +.|.+|.-.+.
T Consensus 353 khkCr~CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~e~~p~~~m~p~~~~e~l~~~i~st~~p 432 (958)
T KOG1074|consen 353 KHKCRFCAKVFGSDSALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHREKYPHVQMNPHPVQEHLQYVITSTGLP 432 (958)
T ss_pred cchhhhhHhhcCchhhhhhhhhccCCCCCeeecccccccccccceeeeeeeccccCCccccCCCCchhhhcceeeccccC
Confidence 4589999999999999999999999998 799999999999999999998887542 45666666666
Q ss_pred CHHHHHHHHHHhC---------------CC--------------------------------------------------
Q psy8499 388 NKIYLTEHLKLHQ---------------GI-------------------------------------------------- 402 (591)
Q Consensus 388 ~~~~l~~H~~~h~---------------~~-------------------------------------------------- 402 (591)
.-.....+....+ +.
T Consensus 433 ~g~~vpp~k~~~~~~~~e~~~~~~sts~g~~~~~~~~~sv~~~~ts~~~~~~~s~~~~~~~~~i~~~s~e~e~~vs~g~~ 512 (958)
T KOG1074|consen 433 YGPSVPPEKAEEEAATVEPKLLVRSTSVGSATESLTPSSVSFGETSAPPLPAFSKFVLMKTVEIKSKSEEPEPAVSEGSA 512 (958)
T ss_pred CCCCCCCCCCcchhccccccccccccccCCCCCcccccccccccccCCCCCccccccccCCcccccccCCCCcccccccc
Confidence 5433332220000 00
Q ss_pred --------------------------------------------------------------------------------
Q psy8499 403 -------------------------------------------------------------------------------- 402 (591)
Q Consensus 403 -------------------------------------------------------------------------------- 402 (591)
T Consensus 513 ~~~~~~gs~l~~s~~ks~~s~~~~~~~~~~~asa~m~~~~~~~~p~g~s~~~~aq~~~l~d~~~~~~~~~~tsseS~kl~ 592 (958)
T KOG1074|consen 513 ISGVLEGSPLRMSSGKSVESLPVEADLLNHAASAGMFPPSYVSRPLGPSEDTTAQALQLVDKIPEALIEISTSSESPKLT 592 (958)
T ss_pred ccccccCCccccccccCccccchhccccchhhccccCCchhhcCCCCcchhhHHHhhhhhccChhhcceeecccCCcccc
Confidence
Q ss_pred ------------CCccccchhccCCchhHHHHHHHhCCCC-CCccccccccCChhhHHHHHHhhCCc-----ccccc---
Q psy8499 403 ------------KNKCVLCSKEFSRKRRLTEHMKLHLGIK-NKCNLCSKEFSCKSHLKEHMQVHQGI-----TYKCN--- 461 (591)
Q Consensus 403 ------------~~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~~-----~~~C~--- 461 (591)
.++|.+|-+...-+++|+.|.++|+|++ |+|.+||+.|+++.+|+.|+-+|..+ .|.|+
T Consensus 593 slv~~~~~~~TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ 672 (958)
T KOG1074|consen 593 SLVENSENKRTDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTF 672 (958)
T ss_pred ccccccccccCCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchh
Confidence 0006666666666666666666666666 56666666666666666666666542 36666
Q ss_pred cccccccchhhHHHHHHhhCCCC--------------CCCccCccccCChhhHHHHHhhhCC----------------CC
Q psy8499 462 LCSKEFYRKQNLTKHMKLHQGIK--------------KKCDLCSKEFSTKCSLTEHMKLHQG----------------IK 511 (591)
Q Consensus 462 ~C~~~f~~~~~L~~H~~~h~~~~--------------~~C~~C~~~f~~~~~L~~H~~~h~~----------------~~ 511 (591)
+|.+.|...-.|..|+++|.+.. -+|..|.+.|.....+..++..|.+ +.
T Consensus 673 ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~ 752 (958)
T KOG1074|consen 673 ICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEEL 752 (958)
T ss_pred hhcccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCccccccccccccc
Confidence 66666666666666666665211 1466666666666666666544411 00
Q ss_pred -----CCCCcchhcccCchhHHHHHHhh-----------------------cC---------------------------
Q psy8499 512 -----KKCDLCSKEFSCKRSVTEHMKLH-----------------------LG--------------------------- 536 (591)
Q Consensus 512 -----~~C~~C~~~f~~~~~l~~H~~~h-----------------------~~--------------------------- 536 (591)
+.+..|+..+.....+..+-..+ ++
T Consensus 753 ~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~ 832 (958)
T KOG1074|consen 753 DVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQ 832 (958)
T ss_pred ccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcc
Confidence 13444554444433332221110 00
Q ss_pred ---------------------------------------------CCCCCCcchhhcCChhhHHHHhhhcCCc-ccccCc
Q psy8499 537 ---------------------------------------------IKNKCDLCSKEFSTKHTLTEHMKLHQGI-RYKCDL 570 (591)
Q Consensus 537 ---------------------------------------------~~~~C~~C~~~f~~~~~l~~H~~~h~~~-~~~C~~ 570 (591)
.+..|.+||+.|.+..+|..|||+|++. ||.|.+
T Consensus 833 ~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~f 912 (958)
T KOG1074|consen 833 ETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHF 912 (958)
T ss_pred cccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchh
Confidence 0124999999999999999999999985 999999
Q ss_pred CccccccchhHHhHhhhccC
Q psy8499 571 CTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 571 C~~~f~~~~~l~~H~~~H~~ 590 (591)
|+++|.++.+|+.||.+|+.
T Consensus 913 C~~aFttrgnLKvHMgtH~w 932 (958)
T KOG1074|consen 913 CEEAFTTRGNLKVHMGTHMW 932 (958)
T ss_pred hhhhhhhhhhhhhhhccccc
Confidence 99999999999999999985
No 2
>KOG3608|consensus
Probab=99.94 E-value=1.6e-26 Score=209.29 Aligned_cols=213 Identities=27% Similarity=0.562 Sum_probs=184.0
Q ss_pred cccc--ccccccCCHHHHHHHHHHhCCC-----------C---Cccc--cchhccCCchhHHHHHHHhCCCC-CCccccc
Q psy8499 377 YKCD--LCLKEYSNKIYLTEHLKLHQGI-----------K---NKCV--LCSKEFSRKRRLTEHMKLHLGIK-NKCNLCS 437 (591)
Q Consensus 377 ~~C~--~C~~~f~~~~~l~~H~~~h~~~-----------~---~~C~--~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~ 437 (591)
|.|. .|+..|.+...+..|+..|.-. + ..|. .|.+.|.++..|++|++.|++++ ..|+.||
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg 214 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCG 214 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHH
Confidence 5665 4899999999999998776421 1 2353 58899999999999999999988 6899999
Q ss_pred cccCChhhHHHHHHhhCC---cccccccccccccchhhHHHHHHhhCCCCCCCccCccccCChhhHHHHHhh-hCCCCC-
Q psy8499 438 KEFSCKSHLKEHMQVHQG---ITYKCNLCSKEFYRKQNLTKHMKLHQGIKKKCDLCSKEFSTKCSLTEHMKL-HQGIKK- 512 (591)
Q Consensus 438 ~~f~~~~~l~~H~~~h~~---~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~-h~~~~~- 512 (591)
.-|+++..|..|.+.-+. .+|.|..|.|.|.+...|..|+..|-. -|+|+.|+.+....++|.+||+. |...+|
T Consensus 215 ~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-~ykCplCdmtc~~~ssL~~H~r~rHs~dkpf 293 (467)
T KOG3608|consen 215 ELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN-CYKCPLCDMTCSSASSLTTHIRYRHSKDKPF 293 (467)
T ss_pred HHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh-cccccccccCCCChHHHHHHHHhhhccCCCc
Confidence 999999999999887665 689999999999999999999998865 38999999999999999999886 555566
Q ss_pred CCCcchhcccCchhHHHHHHhhcCCCCCCCc--chhhcCChhhHHHHhhhcC-Cc---ccccCcCccccccchhHHhHhh
Q psy8499 513 KCDLCSKEFSCKRSVTEHMKLHLGIKNKCDL--CSKEFSTKHTLTEHMKLHQ-GI---RYKCDLCTKEFSRKGRLTEHMK 586 (591)
Q Consensus 513 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~~---~~~C~~C~~~f~~~~~l~~H~~ 586 (591)
+|+.|++.|.+.++|.+|..+|+...|.|+. |.++|.+...|+.|++.++ |. +|.|..|++.|++-.+|..|+.
T Consensus 294 KCd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~ 373 (467)
T KOG3608|consen 294 KCDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLM 373 (467)
T ss_pred cccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHH
Confidence 9999999999999999999999977789988 9999999999999998766 43 6999999999999999999977
Q ss_pred hccC
Q psy8499 587 LHLG 590 (591)
Q Consensus 587 ~H~~ 590 (591)
.-||
T Consensus 374 kkH~ 377 (467)
T KOG3608|consen 374 KKHG 377 (467)
T ss_pred Hhhc
Confidence 6554
No 3
>KOG2462|consensus
Probab=99.93 E-value=1.6e-26 Score=205.00 Aligned_cols=128 Identities=32% Similarity=0.694 Sum_probs=77.3
Q ss_pred CCccccccccCChhhHHHHHHhhCC----cccccccccccccchhhHHHHHHhhCCCCCCCccCccccCChhhHHHHHhh
Q psy8499 431 NKCNLCSKEFSCKSHLKEHMQVHQG----ITYKCNLCSKEFYRKQNLTKHMKLHQGIKKKCDLCSKEFSTKCSLTEHMKL 506 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l~~H~~~h~~----~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~ 506 (591)
++|+.||+.+.+.++|-+|.+.|.. +.+.|++|||.|.+...|..|+++|+ ...+|.+||+.|.....|+-|+|+
T Consensus 131 ~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-l~c~C~iCGKaFSRPWLLQGHiRT 209 (279)
T KOG2462|consen 131 YKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-LPCECGICGKAFSRPWLLQGHIRT 209 (279)
T ss_pred eeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC-CCcccccccccccchHHhhccccc
Confidence 4566666666666666666655543 34566666666666666666666665 335566666666666666666666
Q ss_pred hCCCCC-CCCcchhcccCchhHHHHHHhhcCCC-CCCCcchhhcCChhhHHHHhh
Q psy8499 507 HQGIKK-KCDLCSKEFSCKRSVTEHMKLHLGIK-NKCDLCSKEFSTKHTLTEHMK 559 (591)
Q Consensus 507 h~~~~~-~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~ 559 (591)
|+|++| .|+.|++.|+++++|+.||++|.+.+ |+|..|+|+|..++.|..|..
T Consensus 210 HTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 210 HTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred ccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 666666 66666666666666666666666555 566666666666666666643
No 4
>KOG1074|consensus
Probab=99.93 E-value=3.8e-27 Score=236.47 Aligned_cols=214 Identities=24% Similarity=0.536 Sum_probs=134.3
Q ss_pred CCCccccccCChHHHHHHHHhcC-CccccccccccccCCHHHHHHHHHHhCCCC-----Cccc---cchhccCCchhHHH
Q psy8499 351 KCDLCSKEFSCKRYVTEHMKLHQ-EITYKCDLCLKEYSNKIYLTEHLKLHQGIK-----NKCV---LCSKEFSRKRRLTE 421 (591)
Q Consensus 351 ~C~~C~~~f~~~~~l~~H~~~h~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-----~~C~---~C~~~f~~~~~l~~ 421 (591)
+|-+|.+....++.|+.|.++|. |+||+|.+||+.|.++.+|+.|+-+|.... +.|+ +|-+.|.....|..
T Consensus 607 qCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lpQ 686 (958)
T KOG1074|consen 607 QCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLPQ 686 (958)
T ss_pred ceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccccc
Confidence 47777777777777777777775 677777777777777777777776665432 4666 67777777777777
Q ss_pred HHHHhCCCC--------------CCccccccccCChhhHHHHHHhhCC-----------------c----cccccccccc
Q psy8499 422 HMKLHLGIK--------------NKCNLCSKEFSCKSHLKEHMQVHQG-----------------I----TYKCNLCSKE 466 (591)
Q Consensus 422 H~~~h~~~~--------------~~C~~C~~~f~~~~~l~~H~~~h~~-----------------~----~~~C~~C~~~ 466 (591)
|+++|.+.. -+|..|.+.|.....+..++..|.+ . +..+..|+..
T Consensus 687 hIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~~ 766 (958)
T KOG1074|consen 687 HIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCGRE 766 (958)
T ss_pred eEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccccccc
Confidence 777765311 1567777777666666655544411 1 3456666666
Q ss_pred ccchhhHHHHHHhh-----------------------CCCC-C-CCccCccccCChhhHH----HHH-------------
Q psy8499 467 FYRKQNLTKHMKLH-----------------------QGIK-K-KCDLCSKEFSTKCSLT----EHM------------- 504 (591)
Q Consensus 467 f~~~~~L~~H~~~h-----------------------~~~~-~-~C~~C~~~f~~~~~L~----~H~------------- 504 (591)
+.....+..+--.+ ++.+ . .+..++..-...-... .=.
T Consensus 767 ~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n 846 (958)
T KOG1074|consen 767 LEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKTN 846 (958)
T ss_pred cCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccccccccc
Confidence 66555544432111 1111 1 2434432221111000 000
Q ss_pred -hhhCC-------------------------CCCCCCcchhcccCchhHHHHHHhhcCCC-CCCCcchhhcCChhhHHHH
Q psy8499 505 -KLHQG-------------------------IKKKCDLCSKEFSCKRSVTEHMKLHLGIK-NKCDLCSKEFSTKHTLTEH 557 (591)
Q Consensus 505 -~~h~~-------------------------~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H 557 (591)
.++.+ .+..|.+|++.|.+..+|+.|+|+|+|++ |.|.+|++.|.++.+|+.|
T Consensus 847 ~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvH 926 (958)
T KOG1074|consen 847 EITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVH 926 (958)
T ss_pred cccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhh
Confidence 00000 01159999999999999999999999999 6999999999999999999
Q ss_pred hhhcCCc
Q psy8499 558 MKLHQGI 564 (591)
Q Consensus 558 ~~~h~~~ 564 (591)
|.+|.+.
T Consensus 927 MgtH~w~ 933 (958)
T KOG1074|consen 927 MGTHMWV 933 (958)
T ss_pred hcccccc
Confidence 9999875
No 5
>KOG3608|consensus
Probab=99.93 E-value=3.6e-25 Score=200.57 Aligned_cols=178 Identities=27% Similarity=0.541 Sum_probs=97.5
Q ss_pred cccccCCHHHHHHHHHHhCCCC-CccccchhccCCchhHHHHHHHhCCC---CCCccccccccCChhhHHHHHHhhCCcc
Q psy8499 382 CLKEYSNKIYLTEHLKLHQGIK-NKCVLCSKEFSRKRRLTEHMKLHLGI---KNKCNLCSKEFSCKSHLKEHMQVHQGIT 457 (591)
Q Consensus 382 C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~~---~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 457 (591)
|-+.+.+++.|++|++.|++++ ..|+.||..|.++..|..|++..+.. .|.|..|.|.|.+...|+.|+..|-. -
T Consensus 185 Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-~ 263 (467)
T KOG3608|consen 185 CTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN-C 263 (467)
T ss_pred hhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh-c
Confidence 5555555555555555555555 35555555555555555555544322 24555555555555555555555543 3
Q ss_pred cccccccccccchhhHHHHHHhhC-CCC-CCCccCccccCChhhHHHHHhhhCCCCCCCCc--chhcccCchhHHHHHHh
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMKLHQ-GIK-KKCDLCSKEFSTKCSLTEHMKLHQGIKKKCDL--CSKEFSCKRSVTEHMKL 533 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~~h~-~~~-~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~--C~~~f~~~~~l~~H~~~ 533 (591)
|+|+.|+.+....+.|..|++.-+ ..+ |+|+.|++.|.+.+.|.+|..+|+....+|+. |..+|.+...|++|++.
T Consensus 264 ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS~~~y~C~h~~C~~s~r~~~q~~~H~~e 343 (467)
T KOG3608|consen 264 YKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHSKTVYQCEHPDCHYSVRTYTQMRRHFLE 343 (467)
T ss_pred ccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhccccceecCCCCCcHHHHHHHHHHHHHHH
Confidence 555555555555555555555422 222 45555555555555555555555544445555 55555555555555554
Q ss_pred hc-CC---CCCCCcchhhcCChhhHHHHhhh
Q psy8499 534 HL-GI---KNKCDLCSKEFSTKHTLTEHMKL 560 (591)
Q Consensus 534 h~-~~---~~~C~~C~~~f~~~~~l~~H~~~ 560 (591)
++ |. .|.|..|++.|++-.+|..|++.
T Consensus 344 vhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~k 374 (467)
T KOG3608|consen 344 VHEGNNPILYACHCCDRFFTSGKSLSAHLMK 374 (467)
T ss_pred hccCCCCCceeeecchhhhccchhHHHHHHH
Confidence 43 22 24566666666666666666543
No 6
>KOG2462|consensus
Probab=99.92 E-value=2.2e-26 Score=204.00 Aligned_cols=130 Identities=30% Similarity=0.666 Sum_probs=121.5
Q ss_pred cccccccccccccchhhHHHHHHhhCCCC----CCCccCccccCChhhHHHHHhhhCCCCCCCCcchhcccCchhHHHHH
Q psy8499 456 ITYKCNLCSKEFYRKQNLTKHMKLHQGIK----KKCDLCSKEFSTKCSLTEHMKLHQGIKKKCDLCSKEFSCKRSVTEHM 531 (591)
Q Consensus 456 ~~~~C~~C~~~f~~~~~L~~H~~~h~~~~----~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~ 531 (591)
..|+|+.||+.+.+.++|.+|.++|-... +.|++||+.|.+...|+.|+++|+ ..-+|.+|||.|...-.|+-|+
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~-l~c~C~iCGKaFSRPWLLQGHi 207 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT-LPCECGICGKAFSRPWLLQGHI 207 (279)
T ss_pred CceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC-CCcccccccccccchHHhhccc
Confidence 35999999999999999999999997533 689999999999999999999998 3348999999999999999999
Q ss_pred HhhcCCC-CCCCcchhhcCChhhHHHHhhhcCCc-ccccCcCccccccchhHHhHhh
Q psy8499 532 KLHLGIK-NKCDLCSKEFSTKHTLTEHMKLHQGI-RYKCDLCTKEFSRKGRLTEHMK 586 (591)
Q Consensus 532 ~~h~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~~-~~~C~~C~~~f~~~~~l~~H~~ 586 (591)
|+|+|++ |.|..|+++|..+++|+.||++|.+. +|+|..|+|+|..++.|.+|..
T Consensus 208 RTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 208 RTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred ccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhh
Confidence 9999999 79999999999999999999999985 8999999999999999999965
No 7
>KOG3623|consensus
Probab=99.84 E-value=2e-21 Score=192.05 Aligned_cols=102 Identities=32% Similarity=0.659 Sum_probs=92.1
Q ss_pred CCCCccccccCChHHHHHHHHhcC---CccccccccccccCCHHHHHHHHHHhCC--------------CCCccccchhc
Q psy8499 350 NKCDLCSKEFSCKRYVTEHMKLHQ---EITYKCDLCLKEYSNKIYLTEHLKLHQG--------------IKNKCVLCSKE 412 (591)
Q Consensus 350 ~~C~~C~~~f~~~~~l~~H~~~h~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~--------------~~~~C~~C~~~ 412 (591)
..|++|.+.+.....|..|++.-+ +..|.|..|.++|..+..|.+|+.+|.. .+++|..|||.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 379999999999999999987643 5679999999999999999999998853 24799999999
Q ss_pred cCCchhHHHHHHHhCCCC-CCccccccccCChhhHHHHHH
Q psy8499 413 FSRKRRLTEHMKLHLGIK-NKCNLCSKEFSCKSHLKEHMQ 451 (591)
Q Consensus 413 f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~ 451 (591)
|..+..|++|+|+|.|++ |.|+.|+|+|.....+-.||.
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 999999999999999998 899999999999888888874
No 8
>KOG3623|consensus
Probab=99.83 E-value=2.6e-21 Score=191.30 Aligned_cols=102 Identities=34% Similarity=0.679 Sum_probs=92.3
Q ss_pred CCccccccccCChHHHHHHHhhhCCc---eeecccCccccCCHHHHHHHHHHhcC--------------CCCcccccccc
Q psy8499 269 NKCNLCSKEFSCKSHLKEHMQVHQGI---TYKCDLCSKEFYRKQNLTKHMKVHQG--------------MKNKCVLCSKE 331 (591)
Q Consensus 269 ~~C~~C~~~f~~~~~L~~H~~~h~~~---~~~C~~C~~~f~~~~~L~~H~~~h~~--------------~~~~C~~C~~~ 331 (591)
..|++|.+.+.....|+.|++.-+.+ .|.|..|..+|..+..|.+|+.+|.. .+|+|..||+.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 37999999999999999998755433 69999999999999999999998853 25799999999
Q ss_pred cCChhHHHHHHhhhcCCC-CCCCccccccCChHHHHHHHH
Q psy8499 332 FSSKSYLTEHMKLHQGIK-NKCDLCSKEFSCKRYVTEHMK 370 (591)
Q Consensus 332 f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~ 370 (591)
|..+..|+.|+++|.|++ |.|+.|++.|.....+..||.
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 999999999999999998 899999999999999999973
No 9
>KOG3576|consensus
Probab=99.62 E-value=1e-16 Score=134.68 Aligned_cols=106 Identities=31% Similarity=0.710 Sum_probs=67.5
Q ss_pred ccccccccccccchhhHHHHHHhhCCCC-CCCccCccccCChhhHHHHHhhhCCCCC-CCCcchhcccCchhHHHHHHhh
Q psy8499 457 TYKCNLCSKEFYRKQNLTKHMKLHQGIK-KKCDLCSKEFSTKCSLTEHMKLHQGIKK-KCDLCSKEFSCKRSVTEHMKLH 534 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~-~~C~~C~~~f~~~~~L~~H~~~h~~~~~-~C~~C~~~f~~~~~l~~H~~~h 534 (591)
.|.|.+|+|.|.....|.+|++-|...+ +-|..||+.|...-.|++|+|+|+|.+| +|..|++.|..+-+|..|++.-
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kv 196 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKV 196 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHH
Confidence 4777777777777777777777777655 5677777777777777777777777666 6777777777766666666544
Q ss_pred cCCC------------CCCCcchhhcCChhhHHHHhhhcC
Q psy8499 535 LGIK------------NKCDLCSKEFSTKHTLTEHMKLHQ 562 (591)
Q Consensus 535 ~~~~------------~~C~~C~~~f~~~~~l~~H~~~h~ 562 (591)
+|.. |.|+.||.+-.....+..|++.|+
T Consensus 197 hgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~h 236 (267)
T KOG3576|consen 197 HGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHH 236 (267)
T ss_pred cCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcC
Confidence 4321 234444444444444444444443
No 10
>KOG3576|consensus
Probab=99.59 E-value=4e-16 Score=131.10 Aligned_cols=107 Identities=34% Similarity=0.672 Sum_probs=74.3
Q ss_pred CccccchhccCCchhHHHHHHHhCCCC-CCccccccccCChhhHHHHHHhhCC-cccccccccccccchhhHHHHHHhhC
Q psy8499 404 NKCVLCSKEFSRKRRLTEHMKLHLGIK-NKCNLCSKEFSCKSHLKEHMQVHQG-ITYKCNLCSKEFYRKQNLTKHMKLHQ 481 (591)
Q Consensus 404 ~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~~~~~L~~H~~~h~ 481 (591)
+.|.+|++.|.-...|.+|++-|...+ +-|..||+.|.+.-.|++|+++|+| +||+|..|+++|+.+-.|..|++.-+
T Consensus 118 ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kvh 197 (267)
T KOG3576|consen 118 FTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKVH 197 (267)
T ss_pred eeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHHc
Confidence 456666666666666677776666654 5677777777777777777777777 67777777777777777777766554
Q ss_pred CCC------------CCCccCccccCChhhHHHHHhhhCCC
Q psy8499 482 GIK------------KKCDLCSKEFSTKCSLTEHMKLHQGI 510 (591)
Q Consensus 482 ~~~------------~~C~~C~~~f~~~~~L~~H~~~h~~~ 510 (591)
|.. |.|+.||.+-.....+..|+..|+..
T Consensus 198 gv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 198 GVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred CchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 432 45777777777777777777666553
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.18 E-value=3.6e-11 Score=121.79 Aligned_cols=144 Identities=18% Similarity=0.401 Sum_probs=88.7
Q ss_pred CccccccccCChhhHHHHHHhhCCccccccc--ccccccchhhHHHHHHhhCCCCCCCccCccccCChhhHHHHHhhhCC
Q psy8499 432 KCNLCSKEFSCKSHLKEHMQVHQGITYKCNL--CSKEFYRKQNLTKHMKLHQGIKKKCDLCSKEFSTKCSLTEHMKLHQG 509 (591)
Q Consensus 432 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~ 509 (591)
.|+.|...... ..|..|.....-..-.|+. ||..| .+..|..| +.|+.|++.|. ...|..|+++|+
T Consensus 409 ~C~NC~~~i~l-~~l~lHe~~C~r~~V~Cp~~~Cg~v~-~r~el~~H--------~~C~~Cgk~f~-~s~LekH~~~~H- 476 (567)
T PLN03086 409 ECRNCKHYIPS-RSIALHEAYCSRHNVVCPHDGCGIVL-RVEEAKNH--------VHCEKCGQAFQ-QGEMEKHMKVFH- 476 (567)
T ss_pred ECCCCCCccch-hHHHHHHhhCCCcceeCCccccccee-eccccccC--------ccCCCCCCccc-hHHHHHHHHhcC-
Confidence 46666555442 3344555443333345653 66666 33444444 25777777774 466777777764
Q ss_pred CCCCCCcchhcccCchhHHHHHHhhcCCC-CCCCcchhhcC----------ChhhHHHHhhhcCCcccccCcCccccccc
Q psy8499 510 IKKKCDLCSKEFSCKRSVTEHMKLHLGIK-NKCDLCSKEFS----------TKHTLTEHMKLHQGIRYKCDLCTKEFSRK 578 (591)
Q Consensus 510 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~C~~C~~~f~~~ 578 (591)
....|+ |++.+ .+..|..|+.+|.+.+ +.|.+|++.|. ....|..|.......++.|..||+.|..+
T Consensus 477 kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~CG~rt~~C~~Cgk~Vrlr 554 (567)
T PLN03086 477 EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESICGSRTAPCDSCGRSVMLK 554 (567)
T ss_pred CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhcCCcceEccccCCeeeeh
Confidence 233677 77544 5567777777777666 47777777774 23578888888766688888888887655
Q ss_pred hhHHhHhhhccC
Q psy8499 579 GRLTEHMKLHLG 590 (591)
Q Consensus 579 ~~l~~H~~~H~~ 590 (591)
. |..|+..-|.
T Consensus 555 d-m~~H~~~~h~ 565 (567)
T PLN03086 555 E-MDIHQIAVHQ 565 (567)
T ss_pred h-HHHHHHHhhc
Confidence 4 7777766543
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.07 E-value=2.5e-10 Score=115.76 Aligned_cols=59 Identities=20% Similarity=0.626 Sum_probs=28.1
Q ss_pred CccccccccCChhhHHHHHHhhCCcccccccccccccchhhHHHHHHhhCCCC-CCCccCcccc
Q psy8499 432 KCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCSKEFYRKQNLTKHMKLHQGIK-KKCDLCSKEF 494 (591)
Q Consensus 432 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~-~~C~~C~~~f 494 (591)
.|+.|++.|. ...|..|+.+++ .+|.|+ ||+.+ .+..|..|+.+|.+.+ +.|+.|++.|
T Consensus 455 ~C~~Cgk~f~-~s~LekH~~~~H-kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v 514 (567)
T PLN03086 455 HCEKCGQAFQ-QGEMEKHMKVFH-EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMV 514 (567)
T ss_pred cCCCCCCccc-hHHHHHHHHhcC-CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCcc
Confidence 4555555553 344555555543 445555 55432 3345555554444433 3455555544
No 13
>PHA00733 hypothetical protein
Probab=98.94 E-value=4.8e-10 Score=92.35 Aligned_cols=78 Identities=19% Similarity=0.311 Sum_probs=59.8
Q ss_pred CCCcchhcccCchhHHHH------HHhhcCCCCCCCcchhhcCChhhHHHHhhhcCCcccccCcCccccccchhHHhHhh
Q psy8499 513 KCDLCSKEFSCKRSVTEH------MKLHLGIKNKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKEFSRKGRLTEHMK 586 (591)
Q Consensus 513 ~C~~C~~~f~~~~~l~~H------~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~ 586 (591)
.|.+|.+.|.....|..+ +..+...++.|+.||+.|.+...|..|++.| ..+|.|++|+++|.....|..|+.
T Consensus 42 ~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h-~~~~~C~~CgK~F~~~~sL~~H~~ 120 (128)
T PHA00733 42 IRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT-EHSKVCPVCGKEFRNTDSTLDHVC 120 (128)
T ss_pred HHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC-CcCccCCCCCCccCCHHHHHHHHH
Confidence 566666666655554444 3444444578999999999999999999876 347999999999999999999999
Q ss_pred hccCC
Q psy8499 587 LHLGI 591 (591)
Q Consensus 587 ~H~~~ 591 (591)
..|+|
T Consensus 121 ~~h~~ 125 (128)
T PHA00733 121 KKHNI 125 (128)
T ss_pred HhcCc
Confidence 88876
No 14
>PHA02768 hypothetical protein; Provisional
Probab=98.86 E-value=1e-09 Score=73.16 Aligned_cols=43 Identities=19% Similarity=0.520 Sum_probs=37.6
Q ss_pred CCCCcchhhcCChhhHHHHhhhcCCcccccCcCccccccchhHH
Q psy8499 539 NKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKEFSRKGRLT 582 (591)
Q Consensus 539 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~ 582 (591)
|.|+.||+.|....+|..||++|+ ++|+|..|++.|.+.+.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~-k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN-TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC-CcccCCcccceecccceeE
Confidence 689999999999999999999988 6889999999998887764
No 15
>PHA00733 hypothetical protein
Probab=98.86 E-value=3.1e-09 Score=87.59 Aligned_cols=50 Identities=28% Similarity=0.536 Sum_probs=22.7
Q ss_pred CCccccccccCChhhHHHHHHhhCCcccccccccccccchhhHHHHHHhhC
Q psy8499 431 NKCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCSKEFYRKQNLTKHMKLHQ 481 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 481 (591)
+.|+.||+.|.+...|..|++.| ..+|.|+.|++.|.....|..|+...+
T Consensus 74 y~C~~Cgk~Fss~s~L~~H~r~h-~~~~~C~~CgK~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 74 YVCPLCLMPFSSSVSLKQHIRYT-EHSKVCPVCGKEFRNTDSTLDHVCKKH 123 (128)
T ss_pred ccCCCCCCcCCCHHHHHHHHhcC-CcCccCCCCCCccCCHHHHHHHHHHhc
Confidence 44444444444444444444433 123444444444444444444444333
No 16
>KOG3993|consensus
Probab=98.72 E-value=2e-09 Score=101.63 Aligned_cols=185 Identities=21% Similarity=0.305 Sum_probs=102.4
Q ss_pred cccccccccccCCHHHHHHHHHHh-CCCCCccccchhccCCchhHHHHHHHhCCCCCCcc--cc--ccccCChhhHHHHH
Q psy8499 376 TYKCDLCLKEYSNKIYLTEHLKLH-QGIKNKCVLCSKEFSRKRRLTEHMKLHLGIKNKCN--LC--SKEFSCKSHLKEHM 450 (591)
Q Consensus 376 ~~~C~~C~~~f~~~~~l~~H~~~h-~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~--~C--~~~f~~~~~l~~H~ 450 (591)
.|.|..|...|.+...|.+|.-.- --..|+|+.|+|.|+-..+|..|.|.|....-.=. .= .+.-.+....+.--
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~ 346 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAE 346 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhcc
Confidence 489999999999999999985211 11238899999999999999999888854321000 00 00000011111101
Q ss_pred Hhh---CCcccccccccccccchhhHHHHHHhhCCCC------------------CCCccCccccCChhhHHHHHhhhCC
Q psy8499 451 QVH---QGITYKCNLCSKEFYRKQNLTKHMKLHQGIK------------------KKCDLCSKEFSTKCSLTEHMKLHQG 509 (591)
Q Consensus 451 ~~h---~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~------------------~~C~~C~~~f~~~~~L~~H~~~h~~ 509 (591)
+.- ....|.|.+|+|+|.+..-|++|+.+|+... +-|+.|+-.+.....--.+...+.+
T Consensus 347 rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~ 426 (500)
T KOG3993|consen 347 RSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAG 426 (500)
T ss_pred ccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeec
Confidence 100 1135889999999999999999988876421 1244444333333222222222221
Q ss_pred --CCCCCCcchhcccCchhHHHHHHhhcCC-CCCCCcchhhcCChhhHHHHhhh
Q psy8499 510 --IKKKCDLCSKEFSCKRSVTEHMKLHLGI-KNKCDLCSKEFSTKHTLTEHMKL 560 (591)
Q Consensus 510 --~~~~C~~C~~~f~~~~~l~~H~~~h~~~-~~~C~~C~~~f~~~~~l~~H~~~ 560 (591)
+...|++|+..+.++..--.+.+.-... -|.|.+|..+|.+...|.+|+..
T Consensus 427 sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~ 480 (500)
T KOG3993|consen 427 SAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINK 480 (500)
T ss_pred cccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhh
Confidence 1125666666666555443333322212 25666777777777777776543
No 17
>PHA02768 hypothetical protein; Provisional
Probab=98.66 E-value=1.2e-08 Score=68.09 Aligned_cols=40 Identities=18% Similarity=0.545 Sum_probs=18.1
Q ss_pred cccccccccccchhhHHHHHHhhCCCCCCCccCccccCChh
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMKLHQGIKKKCDLCSKEFSTKC 498 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~ 498 (591)
|.|+.||+.|...++|..|+++|+ ..++|..|++.|...+
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~-k~~kc~~C~k~f~~~s 45 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN-TNLKLSNCKRISLRTG 45 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC-CcccCCcccceecccc
Confidence 444444444444444444444444 2334444444444433
No 18
>KOG3993|consensus
Probab=98.60 E-value=4.9e-09 Score=99.01 Aligned_cols=184 Identities=18% Similarity=0.273 Sum_probs=117.2
Q ss_pred CccccchhccCCchhHHHHHHHh-CCCCCCccccccccCChhhHHHHHHhhCCcccccc--c-ccc-cccchhhHHHHHH
Q psy8499 404 NKCVLCSKEFSRKRRLTEHMKLH-LGIKNKCNLCSKEFSCKSHLKEHMQVHQGITYKCN--L-CSK-EFYRKQNLTKHMK 478 (591)
Q Consensus 404 ~~C~~C~~~f~~~~~l~~H~~~h-~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~--~-C~~-~f~~~~~L~~H~~ 478 (591)
|.|..|...|.....|.+|.-.- --..|+|+.|+|.|.-..+|..|.|.|..++=.=. . =.+ .-.+....+.=.|
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~r 347 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAER 347 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccc
Confidence 68999999999999999996211 11348999999999999999999999975321100 0 000 0001111110001
Q ss_pred h---hCCCCCCCccCccccCChhhHHHHHhhhCCCCC------------------CCCcchhcccCchhHHHHHHhhcCC
Q psy8499 479 L---HQGIKKKCDLCSKEFSTKCSLTEHMKLHQGIKK------------------KCDLCSKEFSCKRSVTEHMKLHLGI 537 (591)
Q Consensus 479 ~---h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~------------------~C~~C~~~f~~~~~l~~H~~~h~~~ 537 (591)
. -+..-|.|.+|++.|.....|+.|..+|+.... -|+.|.-.+.....--.+...+.+.
T Consensus 348 sg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~s 427 (500)
T KOG3993|consen 348 SGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGS 427 (500)
T ss_pred cCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeecc
Confidence 0 011237899999999999999999988864211 2444444444333322222222222
Q ss_pred --CCCCCcchhhcCChhhHHHHhhhcCC-cccccCcCccccccchhHHhHhhh
Q psy8499 538 --KNKCDLCSKEFSTKHTLTEHMKLHQG-IRYKCDLCTKEFSRKGRLTEHMKL 587 (591)
Q Consensus 538 --~~~C~~C~~~f~~~~~l~~H~~~h~~-~~~~C~~C~~~f~~~~~l~~H~~~ 587 (591)
...|+.|+..+.++..--.+.|.-.. .-|.|.+|..+|...-.|.+|+..
T Consensus 428 ael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~ 480 (500)
T KOG3993|consen 428 AELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINK 480 (500)
T ss_pred ccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhh
Confidence 23688899888887765555543222 359999999999999999999765
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.33 E-value=2.9e-07 Score=52.40 Aligned_cols=25 Identities=48% Similarity=0.939 Sum_probs=19.7
Q ss_pred hHHHHhhhcCCc-ccccCcCcccccc
Q psy8499 553 TLTEHMKLHQGI-RYKCDLCTKEFSR 577 (591)
Q Consensus 553 ~l~~H~~~h~~~-~~~C~~C~~~f~~ 577 (591)
+|..||++|+|+ ||+|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 477888888875 7888888888863
No 20
>PHA00732 hypothetical protein
Probab=98.19 E-value=1.2e-06 Score=65.02 Aligned_cols=45 Identities=24% Similarity=0.541 Sum_probs=26.0
Q ss_pred CCCcchhcccCchhHHHHHHh-hcCCCCCCCcchhhcCChhhHHHHhhhcC
Q psy8499 513 KCDLCSKEFSCKRSVTEHMKL-HLGIKNKCDLCSKEFSTKHTLTEHMKLHQ 562 (591)
Q Consensus 513 ~C~~C~~~f~~~~~l~~H~~~-h~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 562 (591)
+|+.|++.|.+..+|+.|++. |.+ +.|+.||++|. .|..|+++..
T Consensus 3 ~C~~Cgk~F~s~s~Lk~H~r~~H~~--~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 3 KCPICGFTTVTLFALKQHARRNHTL--TKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred cCCCCCCccCCHHHHHHHhhcccCC--CccCCCCCEeC---ChhhhhcccC
Confidence 466666666666666666653 433 35666666665 3556655443
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.16 E-value=1.2e-06 Score=49.77 Aligned_cols=24 Identities=46% Similarity=1.009 Sum_probs=16.0
Q ss_pred hHHHHHHhhCC-ccccccccccccc
Q psy8499 445 HLKEHMQVHQG-ITYKCNLCSKEFY 468 (591)
Q Consensus 445 ~l~~H~~~h~~-~~~~C~~C~~~f~ 468 (591)
+|..|+++|++ +||.|+.|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 35666666666 6677777777665
No 22
>PHA00732 hypothetical protein
Probab=98.04 E-value=3e-06 Score=62.91 Aligned_cols=45 Identities=27% Similarity=0.594 Sum_probs=23.1
Q ss_pred ccccccccccccchhhHHHHHHh-hCCCCCCCccCccccCChhhHHHHHhh
Q psy8499 457 TYKCNLCSKEFYRKQNLTKHMKL-HQGIKKKCDLCSKEFSTKCSLTEHMKL 506 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~~H~~~-h~~~~~~C~~C~~~f~~~~~L~~H~~~ 506 (591)
||.|+.||+.|.+..+|..|++. |.+ +.|+.||++|. .|..|+.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~--~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL--TKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC--CccCCCCCEeC---Chhhhhcc
Confidence 35555555555555555555553 332 34555555554 34555543
No 23
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=97.99 E-value=6.4e-07 Score=56.79 Aligned_cols=33 Identities=12% Similarity=-0.066 Sum_probs=23.7
Q ss_pred ccccccccccccccCcchhh-hhhhccccCCCcc
Q psy8499 42 SNESDRINIEENDWNDAVTR-DCLEFFKSCPKVE 74 (591)
Q Consensus 42 ~~~~~~~~~~~~~~q~~l~~-vmle~~~~~~s~~ 74 (591)
..++.++|..++++|++||+ +|+|||.++.|+|
T Consensus 8 v~fs~eEW~~L~~~Qk~ly~dvm~Eny~~l~slG 41 (41)
T PF01352_consen 8 VYFSQEEWELLDPAQKNLYRDVMLENYRNLVSLG 41 (41)
T ss_dssp T---HHHHHTS-HHHHHHHHHHHHHTTTS---S-
T ss_pred EEcChhhcccccceecccchhHHHHhhcccEecC
Confidence 35788899999999999999 9999999999875
No 24
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.88 E-value=1.4e-05 Score=55.01 Aligned_cols=50 Identities=22% Similarity=0.474 Sum_probs=35.4
Q ss_pred CCCCCcchhhcCChhhHHHHhhh-cCC--cccccCcCccccccchhHHhHhhhccC
Q psy8499 538 KNKCDLCSKEFSTKHTLTEHMKL-HQG--IRYKCDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 538 ~~~C~~C~~~f~~~~~l~~H~~~-h~~--~~~~C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
.|.|++|++ ..+..+|..|... |.+ +.+.|++|...+. .+|..|+..+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 367888888 4566788888654 433 2588888887655 378888887764
No 25
>PHA00616 hypothetical protein
Probab=97.87 E-value=4.4e-06 Score=53.04 Aligned_cols=26 Identities=27% Similarity=0.461 Sum_probs=13.8
Q ss_pred ccccCcCccccccchhHHhHhhhccC
Q psy8499 565 RYKCDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 565 ~~~C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
||+|+.||+.|..+++|..|++.|||
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg 26 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHK 26 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcC
Confidence 34555555555555555555555554
No 26
>PHA00616 hypothetical protein
Probab=97.83 E-value=8.2e-06 Score=51.85 Aligned_cols=28 Identities=21% Similarity=0.484 Sum_probs=15.8
Q ss_pred ccccccccccccchhhHHHHHHhhCCCC
Q psy8499 457 TYKCNLCSKEFYRKQNLTKHMKLHQGIK 484 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 484 (591)
||+|+.||+.|..+++|..|++.|+|.+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~ 28 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQN 28 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCC
Confidence 4555555555555555555555555543
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.83 E-value=6.1e-06 Score=45.65 Aligned_cols=23 Identities=52% Similarity=1.051 Sum_probs=17.4
Q ss_pred cccCcCccccccchhHHhHhhhc
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKLH 588 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~H 588 (591)
|+|+.|++.|.+++.|..|+++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 56777777777777777777764
No 28
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.73 E-value=1.3e-05 Score=64.15 Aligned_cols=73 Identities=25% Similarity=0.488 Sum_probs=18.9
Q ss_pred CCccCccccCChhhHHHHHhhhCCCCCCCCcchhcccCchhHHHHHHhhcCCCCCCCcchhhcCChhhHHHHhhhc
Q psy8499 486 KCDLCSKEFSTKCSLTEHMKLHQGIKKKCDLCSKEFSCKRSVTEHMKLHLGIKNKCDLCSKEFSTKHTLTEHMKLH 561 (591)
Q Consensus 486 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h 561 (591)
+|..|+..|.+...|..||...++.... ....+.....+..+++.-.+..+.|.+|++.|.+..+|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP---DQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ---------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccccc---cccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 4788888888888888888766654321 112223444455555444444567888888888888888888754
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.68 E-value=2.5e-05 Score=43.17 Aligned_cols=22 Identities=50% Similarity=1.050 Sum_probs=11.7
Q ss_pred cccccccccccchhhHHHHHHh
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMKL 479 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~~ 479 (591)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 4455555555555555555544
No 30
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.66 E-value=8.4e-05 Score=51.16 Aligned_cols=20 Identities=25% Similarity=0.740 Sum_probs=8.9
Q ss_pred cccccccccccchhhHHHHHH
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMK 478 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~ 478 (591)
|.||+|++ ..+...|..|..
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~ 22 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCE 22 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHH
Confidence 44555555 233344555533
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.60 E-value=2.5e-05 Score=62.45 Aligned_cols=73 Identities=23% Similarity=0.477 Sum_probs=15.7
Q ss_pred ccccccccccchhhHHHHHHhhCCCCCCCccCccccCChhhHHHHHhhhCCCCCCCCcchhcccCchhHHHHHHhh
Q psy8499 459 KCNLCSKEFYRKQNLTKHMKLHQGIKKKCDLCSKEFSTKCSLTEHMKLHQGIKKKCDLCSKEFSCKRSVTEHMKLH 534 (591)
Q Consensus 459 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h 534 (591)
+|..|+..|.+...|..|+...++.... ....+.....+..+++.-......|.+|++.|.+...|..||+.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP---DQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ---------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Cccccccccccccccccccccccccccc---cccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 3667777777777777776655554321 112222334444444433333346777777777777777777653
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.59 E-value=2.7e-05 Score=43.60 Aligned_cols=24 Identities=38% Similarity=0.915 Sum_probs=16.3
Q ss_pred cccCcCccccccchhHHhHhhhcc
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKLHL 589 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~H~ 589 (591)
|.|++|++.|.+...|+.|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 567777777777777777777764
No 33
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.45 E-value=5.8e-05 Score=43.58 Aligned_cols=26 Identities=38% Similarity=0.724 Sum_probs=22.1
Q ss_pred ccccCcCccccccchhHHhHhhhccC
Q psy8499 565 RYKCDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 565 ~~~C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
||.|+.|++.|.+...|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 57888899999888888888888865
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.28 E-value=0.00018 Score=40.13 Aligned_cols=23 Identities=39% Similarity=0.979 Sum_probs=10.1
Q ss_pred cccccccccccchhhHHHHHHhh
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMKLH 480 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~~h 480 (591)
|.|++|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34445555555555555554443
No 35
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.11 E-value=0.00024 Score=40.93 Aligned_cols=25 Identities=28% Similarity=0.651 Sum_probs=15.6
Q ss_pred ccccccccccccchhhHHHHHHhhC
Q psy8499 457 TYKCNLCSKEFYRKQNLTKHMKLHQ 481 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~~H~~~h~ 481 (591)
||.|+.|++.|.+..+|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4566666666666666666665553
No 36
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.81 E-value=0.0005 Score=38.28 Aligned_cols=24 Identities=42% Similarity=0.718 Sum_probs=17.4
Q ss_pred cccCcCccccccchhHHhHhhhccC
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
|+|+.|+++.. ...|.+|+++|+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 67888888887 7788888888764
No 37
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.76 E-value=0.00079 Score=38.20 Aligned_cols=24 Identities=46% Similarity=0.904 Sum_probs=13.9
Q ss_pred cccCcCccccccchhHHhHhhhcc
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKLHL 589 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~H~ 589 (591)
|.|+.|+++|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 455666666666666666665543
No 38
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.21 E-value=0.0037 Score=35.31 Aligned_cols=19 Identities=47% Similarity=0.962 Sum_probs=7.6
Q ss_pred CccCccccCChhhHHHHHh
Q psy8499 487 CDLCSKEFSTKCSLTEHMK 505 (591)
Q Consensus 487 C~~C~~~f~~~~~L~~H~~ 505 (591)
|+.|++.|.+...|..|++
T Consensus 3 C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 3 CPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCcchhCCHHHHHHHHH
Confidence 3334444444444444433
No 39
>KOG2231|consensus
Probab=96.18 E-value=0.0079 Score=62.88 Aligned_cols=116 Identities=23% Similarity=0.599 Sum_probs=67.6
Q ss_pred CccccchhccCCchhHHHHHHHhCCCCCCccccccccCChhhHHHHHHhhCCccccccccc---------ccccchhhHH
Q psy8499 404 NKCVLCSKEFSRKRRLTEHMKLHLGIKNKCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCS---------KEFYRKQNLT 474 (591)
Q Consensus 404 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~---------~~f~~~~~L~ 474 (591)
+.|.+|++.|.-.. .+-.|..| -.|.+...|+.|+..-++. +.|.+|- ....+...|.
T Consensus 100 ~~C~~C~~~~~~~~-----------~~~~~~~c-~~~~s~~~Lk~H~~~~H~~-~~c~lC~~~~kif~~e~k~Yt~~el~ 166 (669)
T KOG2231|consen 100 HSCHICDRRFRALY-----------NKKECLHC-TEFKSVENLKNHMRDQHKL-HLCSLCLQNLKIFINERKLYTRAELN 166 (669)
T ss_pred hhcCccccchhhhc-----------ccCCCccc-cchhHHHHHHHHHHHhhhh-hccccccccceeeeeeeehehHHHHH
Confidence 46888877664311 12368888 7777888888888554332 3444432 2223355666
Q ss_pred HHHHhhCC-C-----CCCCccCccccCChhhHHHHHhhhCCCCCCCCcc------hhcccCchhHHHHHHhhc
Q psy8499 475 KHMKLHQG-I-----KKKCDLCSKEFSTKCSLTEHMKLHQGIKKKCDLC------SKEFSCKRSVTEHMKLHL 535 (591)
Q Consensus 475 ~H~~~h~~-~-----~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C------~~~f~~~~~l~~H~~~h~ 535 (591)
.|+..--. . ...|..|...|-....|.+|++.++.. |..| +.-|..-..|..|.+.++
T Consensus 167 ~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h~~---chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 167 LHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDHEF---CHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred HHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccceeh---eeecCcccccchhcccchHHHHHhhhcC
Confidence 67653211 1 135777777777777777777765543 4444 344556666777766554
No 40
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.18 E-value=0.0022 Score=36.03 Aligned_cols=23 Identities=35% Similarity=0.917 Sum_probs=17.5
Q ss_pred cccCcCccccccchhHHhHhhhc
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKLH 588 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~H 588 (591)
|.|++|++.|.+...|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 56788888888888888887654
No 41
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.03 E-value=0.0032 Score=40.96 Aligned_cols=26 Identities=15% Similarity=0.397 Sum_probs=12.5
Q ss_pred ccccCcCccccccchhHHhHhhhccC
Q psy8499 565 RYKCDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 565 ~~~C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
|-.|++|+..+++..+|++|+.++|+
T Consensus 24 PatCP~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 24 PATCPICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp -EE-TTT--EESSHHHHHHHHHHHTT
T ss_pred CCCCCcchhhccchhhHHHHHHHHhc
Confidence 45555555555555555555555443
No 42
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.02 E-value=0.0022 Score=59.19 Aligned_cols=24 Identities=38% Similarity=0.845 Sum_probs=19.3
Q ss_pred Cccccccc--ccccccchhhHHHHHH
Q psy8499 455 GITYKCNL--CSKEFYRKQNLTKHMK 478 (591)
Q Consensus 455 ~~~~~C~~--C~~~f~~~~~L~~H~~ 478 (591)
++||+|++ |.|+++....|+.|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~l 372 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHML 372 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhh
Confidence 57888876 8888888888888865
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.95 E-value=0.0044 Score=34.39 Aligned_cols=23 Identities=48% Similarity=0.846 Sum_probs=11.1
Q ss_pred cccccccccccchhhHHHHHHhhC
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMKLHQ 481 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~~h~ 481 (591)
|.|+.|++... ...|..|++.|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45555555554 555555555543
No 44
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.65 E-value=0.005 Score=56.88 Aligned_cols=63 Identities=25% Similarity=0.487 Sum_probs=38.8
Q ss_pred CCCc--chhcccCchhHHHHHHhhc-CCC---CCCCcchhhcCChhhHHHHhhhcCCcccccCcCccccccchhHHhHhh
Q psy8499 513 KCDL--CSKEFSCKRSVTEHMKLHL-GIK---NKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKEFSRKGRLTEHMK 586 (591)
Q Consensus 513 ~C~~--C~~~f~~~~~l~~H~~~h~-~~~---~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~ 586 (591)
+|++ |+|.++....|+.|+.--+ ..+ ..=+.=-..| .-..+||.|++|+|.++....|+-|+.
T Consensus 351 kCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F-----------~~~~KPYrCevC~KRYKNlNGLKYHr~ 419 (423)
T COG5189 351 KCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF-----------SAKDKPYRCEVCDKRYKNLNGLKYHRK 419 (423)
T ss_pred cCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc-----------cccCCceeccccchhhccCccceeccc
Confidence 6766 7777777777777765311 111 0000001111 122468999999999999999998875
No 45
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.60 E-value=0.0067 Score=34.03 Aligned_cols=20 Identities=30% Similarity=0.911 Sum_probs=9.0
Q ss_pred ccccccccccchhhHHHHHH
Q psy8499 459 KCNLCSKEFYRKQNLTKHMK 478 (591)
Q Consensus 459 ~C~~C~~~f~~~~~L~~H~~ 478 (591)
.|++|++.|.+...|..|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 34444444444444444443
No 46
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.57 E-value=0.0055 Score=35.12 Aligned_cols=22 Identities=45% Similarity=0.898 Sum_probs=17.8
Q ss_pred cccCcCccccccchhHHhHhhh
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKL 587 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~ 587 (591)
|-|.+|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 6788888888888888888775
No 47
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.53 E-value=0.0091 Score=38.93 Aligned_cols=18 Identities=22% Similarity=0.706 Sum_probs=5.6
Q ss_pred CCcchhcccCchhHHHHH
Q psy8499 514 CDLCSKEFSCKRSVTEHM 531 (591)
Q Consensus 514 C~~C~~~f~~~~~l~~H~ 531 (591)
|++|+..+.+..+|++|+
T Consensus 27 CP~C~a~~~~srnLrRHl 44 (54)
T PF09237_consen 27 CPICGAVIRQSRNLRRHL 44 (54)
T ss_dssp -TTT--EESSHHHHHHHH
T ss_pred CCcchhhccchhhHHHHH
Confidence 333333333333333333
No 48
>KOG2231|consensus
Probab=95.43 E-value=0.031 Score=58.59 Aligned_cols=50 Identities=20% Similarity=0.377 Sum_probs=21.9
Q ss_pred CCCccccccCChHHHHHHHHhcCCccccc---cccccccCCHHHHHHHHHHhC
Q psy8499 351 KCDLCSKEFSCKRYVTEHMKLHQEITYKC---DLCLKEYSNKIYLTEHLKLHQ 400 (591)
Q Consensus 351 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~C---~~C~~~f~~~~~l~~H~~~h~ 400 (591)
.|..|...|.....|..|++.+++.-.-| +.++..|.+...|..|.+.+|
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~h~~chfC~~~~~~neyy~~~~dLe~HfR~~H 236 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFDHEFCHFCDYKTGQNEYYNDYDDLEEHFRKGH 236 (669)
T ss_pred cchhhhhhhccHHHHHHhhccceeheeecCcccccchhcccchHHHHHhhhcC
Confidence 45555555555555555554443221112 112233444455555554444
No 49
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.00 E-value=0.018 Score=53.90 Aligned_cols=123 Identities=26% Similarity=0.553 Sum_probs=68.1
Q ss_pred CCccc--cccccCChhhHHHHHHhhCCccccccccc---ccc------cchhhHHHHHHhhC---CCC--CCCccCcccc
Q psy8499 431 NKCNL--CSKEFSCKSHLKEHMQVHQGITYKCNLCS---KEF------YRKQNLTKHMKLHQ---GIK--KKCDLCSKEF 494 (591)
Q Consensus 431 ~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~C~~C~---~~f------~~~~~L~~H~~~h~---~~~--~~C~~C~~~f 494 (591)
|.|+. |..+......|+.|.++.+++ +-|.+|- +.| .++..|+.|...-. |.+ +.|..|...|
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~-~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~F 230 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF-VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYF 230 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc-EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhcccee
Confidence 45654 566666677788887776653 4566653 223 23445555543221 212 3677777777
Q ss_pred CChhhHHHHHhhhCCCCCCCCcchhc-------ccCchhHHHHHHhhcCCCCCCCc--c----hhhcCChhhHHHHhhh
Q psy8499 495 STKCSLTEHMKLHQGIKKKCDLCSKE-------FSCKRSVTEHMKLHLGIKNKCDL--C----SKEFSTKHTLTEHMKL 560 (591)
Q Consensus 495 ~~~~~L~~H~~~h~~~~~~C~~C~~~-------f~~~~~l~~H~~~h~~~~~~C~~--C----~~~f~~~~~l~~H~~~ 560 (591)
-+-..|..|+|.-+. .|-+|++. |.+-.+|..|.+.-+ |.|.+ | -..|.....|..|+..
T Consensus 231 YdDDEL~~HcR~~HE---~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h---y~ct~qtc~~~k~~vf~~~~el~~h~~~ 303 (493)
T COG5236 231 YDDDELRRHCRLRHE---ACHICDMVGPIRYQYFKSYEDLEAHFRNAH---YCCTFQTCRVGKCYVFPYHTELLEHLTR 303 (493)
T ss_pred cChHHHHHHHHhhhh---hhhhhhccCccchhhhhCHHHHHHHhhcCc---eEEEEEEEecCcEEEeccHHHHHHHHHH
Confidence 777777777776444 46666542 555566666654321 22211 1 1356666667777644
No 50
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.41 E-value=0.045 Score=51.37 Aligned_cols=79 Identities=23% Similarity=0.444 Sum_probs=40.9
Q ss_pred cccccc--cccccCChhhHHHHHHhhCCCccccCcccc---ccC------ChhHHHHHHHHhCCC---C--Ccccccccc
Q psy8499 106 GYKCDL--CSKEYSYKHTLTEHMKLHQGIRYKCDLCSK---EFS------CKRYLTEHMKVHQGM---K--NKCVLCSKE 169 (591)
Q Consensus 106 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~---~f~------~~~~l~~H~~~h~~~---~--~~C~~C~~~ 169 (591)
.|.|+. |..+......|..|.+..++ .+-|.+|-. .|. +...|..|...-... + ..|..|...
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~-~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG-FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC-cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence 356653 55555556677777765444 355666532 222 233445554321111 1 356666666
Q ss_pred cCChhHHHHHHhhhcC
Q psy8499 170 FSSKSYLTEHMKRHQG 185 (591)
Q Consensus 170 f~~~~~l~~H~~~h~~ 185 (591)
|-+-..|..|++..+.
T Consensus 230 FYdDDEL~~HcR~~HE 245 (493)
T COG5236 230 FYDDDELRRHCRLRHE 245 (493)
T ss_pred ecChHHHHHHHHhhhh
Confidence 6666666666665444
No 51
>KOG2785|consensus
Probab=94.20 E-value=0.11 Score=49.95 Aligned_cols=22 Identities=27% Similarity=0.475 Sum_probs=17.0
Q ss_pred CccccchhccCCchhHHHHHHH
Q psy8499 404 NKCVLCSKEFSRKRRLTEHMKL 425 (591)
Q Consensus 404 ~~C~~C~~~f~~~~~l~~H~~~ 425 (591)
+.|.-|...|.+...-+.|+++
T Consensus 4 ftC~tC~v~F~~ad~Qr~HyKS 25 (390)
T KOG2785|consen 4 FTCNTCNVEFDDADEQRAHYKS 25 (390)
T ss_pred ceeeceeeeeccHHHHHHHhhh
Confidence 5788888888888777888754
No 52
>KOG2482|consensus
Probab=94.08 E-value=0.11 Score=48.93 Aligned_cols=64 Identities=30% Similarity=0.511 Sum_probs=39.2
Q ss_pred chhHHHHHHHhCCCCC--CccccccccC-ChhhHHHHHHh-hC---Cc-------------------ccccccccccccc
Q psy8499 416 KRRLTEHMKLHLGIKN--KCNLCSKEFS-CKSHLKEHMQV-HQ---GI-------------------TYKCNLCSKEFYR 469 (591)
Q Consensus 416 ~~~l~~H~~~h~~~~~--~C~~C~~~f~-~~~~l~~H~~~-h~---~~-------------------~~~C~~C~~~f~~ 469 (591)
+..|..+++.-.+..+ +|-.|...+. .++....|+-. |. |. .+.|-.|.+.|..
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrd 207 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRD 207 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCC
Confidence 4566777666555443 7888987765 45666777643 32 12 2556666666666
Q ss_pred hhhHHHHHHh
Q psy8499 470 KQNLTKHMKL 479 (591)
Q Consensus 470 ~~~L~~H~~~ 479 (591)
+..|+.||+.
T Consensus 208 kntLkeHMrk 217 (423)
T KOG2482|consen 208 KNTLKEHMRK 217 (423)
T ss_pred cHHHHHHHHh
Confidence 6666666654
No 53
>KOG2482|consensus
Probab=93.92 E-value=0.19 Score=47.32 Aligned_cols=74 Identities=20% Similarity=0.361 Sum_probs=36.7
Q ss_pred ccccccccccChHHHHHHHHHhCCCCCC--ccccccccCChHHHHHHHhhhCCceeecccCccccCCHHHHHHHHHH
Q psy8499 243 KCVLCSKEFSRKRRLTEHMKLHLGIKNK--CNLCSKEFSCKSHLKEHMQVHQGITYKCDLCSKEFYRKQNLTKHMKV 317 (591)
Q Consensus 243 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~--C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~ 317 (591)
.|-.|.....+...|..||+.-+.-... =..=+-.|...-.+..-+|... ..-.|-.|.-.|.....|..||.-
T Consensus 281 ~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~-~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 281 VCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQK-KKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred EEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHh-hccccccccccccCcchhhhhccc
Confidence 6888888888888888888765432110 0000111222222222222111 123455555666666666666643
No 54
>KOG2785|consensus
Probab=93.83 E-value=0.2 Score=48.26 Aligned_cols=76 Identities=21% Similarity=0.336 Sum_probs=42.2
Q ss_pred CccccccccccChHHHHHHHHHh-CCC--C----CCccccccccCChHHHHHHHh----hhCCceeecccCccccCCHHH
Q psy8499 242 NKCVLCSKEFSRKRRLTEHMKLH-LGI--K----NKCNLCSKEFSCKSHLKEHMQ----VHQGITYKCDLCSKEFYRKQN 310 (591)
Q Consensus 242 ~~C~~C~~~f~~~~~l~~H~~~h-~~~--~----~~C~~C~~~f~~~~~L~~H~~----~h~~~~~~C~~C~~~f~~~~~ 310 (591)
++|..|...|.+...-+.|+++- +.. + .-=++=-..|..+..-..=.. .-.+.++.|.+|.+.|.+...
T Consensus 4 ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~a 83 (390)
T KOG2785|consen 4 FTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPKA 83 (390)
T ss_pred ceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChhh
Confidence 57888888888888888887642 100 0 001111122221111111100 112237889999999999888
Q ss_pred HHHHHHH
Q psy8499 311 LTKHMKV 317 (591)
Q Consensus 311 L~~H~~~ 317 (591)
...|+..
T Consensus 84 ~~~hl~S 90 (390)
T KOG2785|consen 84 HENHLKS 90 (390)
T ss_pred HHHHHHH
Confidence 8888754
No 55
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.75 E-value=0.022 Score=32.61 Aligned_cols=21 Identities=43% Similarity=0.906 Sum_probs=12.6
Q ss_pred eecccCccccCCHHHHHHHHH
Q psy8499 296 YKCDLCSKEFYRKQNLTKHMK 316 (591)
Q Consensus 296 ~~C~~C~~~f~~~~~L~~H~~ 316 (591)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 456666666666666666653
No 56
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.86 E-value=0.061 Score=30.04 Aligned_cols=19 Identities=37% Similarity=0.896 Sum_probs=11.2
Q ss_pred ccCcCccccccchhHHhHhh
Q psy8499 567 KCDLCTKEFSRKGRLTEHMK 586 (591)
Q Consensus 567 ~C~~C~~~f~~~~~l~~H~~ 586 (591)
.|++||+.| ..+.|..|++
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 466666666 4555666654
No 57
>PRK04860 hypothetical protein; Provisional
Probab=92.79 E-value=0.051 Score=46.75 Aligned_cols=36 Identities=19% Similarity=0.683 Sum_probs=23.6
Q ss_pred ccccccccccccchhhHHHHHHhhCCCC-CCCccCccccCC
Q psy8499 457 TYKCNLCSKEFYRKQNLTKHMKLHQGIK-KKCDLCSKEFST 496 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~-~~C~~C~~~f~~ 496 (591)
+|.|. |+. ....+.+|.++|.+.+ |+|..|+..|..
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~ 155 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF 155 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence 56776 665 5566677777776655 677777766654
No 58
>PRK04860 hypothetical protein; Provisional
Probab=92.47 E-value=0.057 Score=46.48 Aligned_cols=38 Identities=26% Similarity=0.691 Sum_probs=29.0
Q ss_pred CCCCcchhhcCChhhHHHHhhhcCCc-ccccCcCccccccchh
Q psy8499 539 NKCDLCSKEFSTKHTLTEHMKLHQGI-RYKCDLCTKEFSRKGR 580 (591)
Q Consensus 539 ~~C~~C~~~f~~~~~l~~H~~~h~~~-~~~C~~C~~~f~~~~~ 580 (591)
|.|. |+. ....+++|.++|++. +|.|..|+..|.....
T Consensus 120 Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~~ 158 (160)
T PRK04860 120 YRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKGE 158 (160)
T ss_pred EEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEecc
Confidence 5787 877 677788888888875 6888888888775543
No 59
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.05 E-value=0.12 Score=28.87 Aligned_cols=19 Identities=32% Similarity=0.782 Sum_probs=9.9
Q ss_pred CCccCccccCChhhHHHHHh
Q psy8499 486 KCDLCSKEFSTKCSLTEHMK 505 (591)
Q Consensus 486 ~C~~C~~~f~~~~~L~~H~~ 505 (591)
.|+.||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 355555555 4445555543
No 60
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.35 E-value=0.11 Score=31.82 Aligned_cols=23 Identities=26% Similarity=0.612 Sum_probs=18.2
Q ss_pred ccccCcCccccccchhHHhHhhh
Q psy8499 565 RYKCDLCTKEFSRKGRLTEHMKL 587 (591)
Q Consensus 565 ~~~C~~C~~~f~~~~~l~~H~~~ 587 (591)
+|.|++|+..|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 57788888888888888888764
No 61
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=90.89 E-value=0.046 Score=34.78 Aligned_cols=14 Identities=21% Similarity=0.427 Sum_probs=9.6
Q ss_pred CccccccccccCCC
Q psy8499 2 ELNVEESKDHVGLS 15 (591)
Q Consensus 2 ~~~~e~~~~~~~~~ 15 (591)
|||+|||.+++++|
T Consensus 28 dvm~Eny~~l~slG 41 (41)
T PF01352_consen 28 DVMLENYRNLVSLG 41 (41)
T ss_dssp HHHHHTTTS---S-
T ss_pred hHHHHhhcccEecC
Confidence 89999999999986
No 62
>KOG4173|consensus
Probab=90.09 E-value=0.43 Score=41.57 Aligned_cols=47 Identities=19% Similarity=0.603 Sum_probs=33.2
Q ss_pred ccccc--cccccCChhhHHHHHHhhCCCccccCccccccCChhHHHHHHHH
Q psy8499 107 YKCDL--CSKEYSYKHTLTEHMKLHQGIRYKCDLCSKEFSCKRYLTEHMKV 155 (591)
Q Consensus 107 ~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~ 155 (591)
+.|++ |...|.+...+..|...-++ -.|..|.+.|++...|..|+..
T Consensus 80 ~~cqvagc~~~~d~lD~~E~hY~~~h~--~sCs~C~r~~Pt~hLLd~HI~E 128 (253)
T KOG4173|consen 80 FACQVAGCCQVFDALDDYEHHYHTLHG--NSCSFCKRAFPTGHLLDAHILE 128 (253)
T ss_pred ccccccchHHHHhhhhhHHHhhhhccc--chhHHHHHhCCchhhhhHHHHH
Confidence 55664 67788877777777643333 3688888888888888888653
No 63
>KOG1146|consensus
Probab=88.77 E-value=0.13 Score=57.57 Aligned_cols=26 Identities=27% Similarity=0.328 Sum_probs=21.5
Q ss_pred CccccccccccccCCHHHHHHHHHHh
Q psy8499 374 EITYKCDLCLKEYSNKIYLTEHLKLH 399 (591)
Q Consensus 374 ~~~~~C~~C~~~f~~~~~l~~H~~~h 399 (591)
.+.|+|+.|+..|+....|..|+|..
T Consensus 463 ~kt~~cpkc~~~yk~a~~L~vhmRsk 488 (1406)
T KOG1146|consen 463 FKTLKCPKCNWHYKLAQTLGVHMRSK 488 (1406)
T ss_pred cccccCCccchhhhhHHHhhhccccc
Confidence 46788888888888888888888873
No 64
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=88.70 E-value=0.14 Score=53.42 Aligned_cols=152 Identities=30% Similarity=0.458 Sum_probs=84.4
Q ss_pred CCccccccccCChhhHHHHHH--hhCC---cccccc--cccccccchhhHHHHHHhhCCCCC-CCc--cCccccCChhh-
Q psy8499 431 NKCNLCSKEFSCKSHLKEHMQ--VHQG---ITYKCN--LCSKEFYRKQNLTKHMKLHQGIKK-KCD--LCSKEFSTKCS- 499 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l~~H~~--~h~~---~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~-~C~--~C~~~f~~~~~- 499 (591)
+.|..|...|.....|..|.+ .|.+ +++.|+ .|++.|.....+..|...|.+... .+. .+...+.....
T Consensus 290 ~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (467)
T COG5048 290 IKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLLNN 369 (467)
T ss_pred CCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCccccccccCC
Confidence 455555555555555555555 4443 366666 577777777777777766666552 222 23333332222
Q ss_pred -----HHHHHhhhCCCCCCC--CcchhcccCchhHHHHHHhhcCCC---CCCCcchhhcCChhhHHHHhhhcCCcccccC
Q psy8499 500 -----LTEHMKLHQGIKKKC--DLCSKEFSCKRSVTEHMKLHLGIK---NKCDLCSKEFSTKHTLTEHMKLHQGIRYKCD 569 (591)
Q Consensus 500 -----L~~H~~~h~~~~~~C--~~C~~~f~~~~~l~~H~~~h~~~~---~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~ 569 (591)
+......+......+ ..|...+.....+..|...|.... +.+..|.+.|.....+..|++.|......+.
T Consensus 370 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (467)
T COG5048 370 EPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLC 449 (467)
T ss_pred CCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCceee
Confidence 111111111211122 226667777777777777776544 3678888888888888888888876643343
Q ss_pred cCccccccchhHH
Q psy8499 570 LCTKEFSRKGRLT 582 (591)
Q Consensus 570 ~C~~~f~~~~~l~ 582 (591)
.+-+.|.....+.
T Consensus 450 ~~~~~~~~~~~~~ 462 (467)
T COG5048 450 SILKSFRRDLDLS 462 (467)
T ss_pred ccccccchhhhhh
Confidence 4444444433333
No 65
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.56 E-value=0.27 Score=30.06 Aligned_cols=22 Identities=27% Similarity=0.714 Sum_probs=13.3
Q ss_pred ccccccccccccchhhHHHHHH
Q psy8499 457 TYKCNLCSKEFYRKQNLTKHMK 478 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~~H~~ 478 (591)
+|.|++|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3556666666666666666653
No 66
>KOG2893|consensus
Probab=87.59 E-value=0.21 Score=44.44 Aligned_cols=47 Identities=30% Similarity=0.723 Sum_probs=28.6
Q ss_pred CCccccccccCChhhHHHHHHhhCCcccccccccccccchhhHHHH-HHhh
Q psy8499 431 NKCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCSKEFYRKQNLTKH-MKLH 480 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H-~~~h 480 (591)
.-|.+|++.|.....|..|++. +-|+|.+|.|...+--.|..| |.+|
T Consensus 11 pwcwycnrefddekiliqhqka---khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 11 PWCWYCNREFDDEKILIQHQKA---KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred ceeeecccccchhhhhhhhhhh---ccceeeeehhhhccCCCceeehhhhh
Confidence 3466666666666666666554 336677777766666666665 3343
No 67
>KOG2893|consensus
Probab=87.14 E-value=0.19 Score=44.64 Aligned_cols=42 Identities=31% Similarity=0.631 Sum_probs=23.9
Q ss_pred ccccccccccChHHHHHHHHHhCCCCCCccccccccCChHHHHHH
Q psy8499 243 KCVLCSKEFSRKRRLTEHMKLHLGIKNKCNLCSKEFSCKSHLKEH 287 (591)
Q Consensus 243 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H 287 (591)
-|.+|++.|....-|..|++..+ ++|.+|.+...+--.|..|
T Consensus 12 wcwycnrefddekiliqhqkakh---fkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH---FKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc---ceeeeehhhhccCCCceee
Confidence 46666666666666666654322 5566666655555555555
No 68
>KOG1146|consensus
Probab=86.54 E-value=0.19 Score=56.34 Aligned_cols=24 Identities=25% Similarity=0.585 Sum_probs=17.7
Q ss_pred cccccCcCccccccchhHHhHhhh
Q psy8499 564 IRYKCDLCTKEFSRKGRLTEHMKL 587 (591)
Q Consensus 564 ~~~~C~~C~~~f~~~~~l~~H~~~ 587 (591)
++|.|..|+.+++++.+|..|+..
T Consensus 517 ~p~~C~~C~~stttng~LsihlqS 540 (1406)
T KOG1146|consen 517 KPYPCRACNYSTTTNGNLSIHLQS 540 (1406)
T ss_pred CcccceeeeeeeecchHHHHHHHH
Confidence 467777777777777777777653
No 69
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=85.05 E-value=0.2 Score=52.10 Aligned_cols=55 Identities=29% Similarity=0.583 Sum_probs=37.3
Q ss_pred CccccccccccccCChhhHHHHHHhhCCC-ccccCc--cccccCChhHHHHHHHHhCC
Q psy8499 104 DIGYKCDLCSKEYSYKHTLTEHMKLHQGI-RYKCDL--CSKEFSCKRYLTEHMKVHQG 158 (591)
Q Consensus 104 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~-~~~C~~--C~~~f~~~~~l~~H~~~h~~ 158 (591)
...+.|+.|...|........|...+.+. ++.|.. |...+.....+..|...+..
T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (467)
T COG5048 31 PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88 (467)
T ss_pred CchhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhcccccc
Confidence 34567888888888888888887777664 566654 44566666666666665543
No 70
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=82.62 E-value=1.6 Score=34.74 Aligned_cols=34 Identities=21% Similarity=0.560 Sum_probs=21.6
Q ss_pred cccccccccccCChhhHHHHHHhhCCCccccCccccccCChhHHHHH
Q psy8499 106 GYKCDLCSKEYSYKHTLTEHMKLHQGIRYKCDLCSKEFSCKRYLTEH 152 (591)
Q Consensus 106 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H 152 (591)
+|.|+.|+...- ..|..|++|+-...+...|.+-
T Consensus 1 GY~CPrC~skvC-------------~LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 1 GYFCPQCRAKVC-------------ELPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CccCCCCCCCcc-------------CCCCcCCcCCCEEeccchHHHh
Confidence 367777764432 2367788888777666666543
No 71
>KOG4173|consensus
Probab=80.66 E-value=0.89 Score=39.72 Aligned_cols=45 Identities=24% Similarity=0.663 Sum_probs=21.7
Q ss_pred ccccc--ccccccchhhHHHHHHhhCCCCCCCccCccccCChhhHHHHH
Q psy8499 458 YKCNL--CSKEFYRKQNLTKHMKLHQGIKKKCDLCSKEFSTKCSLTEHM 504 (591)
Q Consensus 458 ~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~ 504 (591)
|.|++ |...|........|-.+-++. .|..|.+.|.+...|..|+
T Consensus 80 ~~cqvagc~~~~d~lD~~E~hY~~~h~~--sCs~C~r~~Pt~hLLd~HI 126 (253)
T KOG4173|consen 80 FACQVAGCCQVFDALDDYEHHYHTLHGN--SCSFCKRAFPTGHLLDAHI 126 (253)
T ss_pred ccccccchHHHHhhhhhHHHhhhhcccc--hhHHHHHhCCchhhhhHHH
Confidence 45544 445555554444443322221 3555555555555555554
No 72
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=79.11 E-value=1.1 Score=26.90 Aligned_cols=24 Identities=25% Similarity=0.785 Sum_probs=14.5
Q ss_pred ccccccccccCChhhHHHHHHhhCCCccccCcccc
Q psy8499 107 YKCDLCSKEYSYKHTLTEHMKLHQGIRYKCDLCSK 141 (591)
Q Consensus 107 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~ 141 (591)
|.|..||..+.... .+..|++|+.
T Consensus 2 ~~C~~CGy~y~~~~-----------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE-----------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc-----------CCCcCcCCCC
Confidence 56777776664321 3567777764
No 73
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=77.13 E-value=1.3 Score=35.20 Aligned_cols=13 Identities=31% Similarity=0.636 Sum_probs=6.6
Q ss_pred ccccccccccccc
Q psy8499 457 TYKCNLCSKEFYR 469 (591)
Q Consensus 457 ~~~C~~C~~~f~~ 469 (591)
|-.|+.||..|.-
T Consensus 26 PivCP~CG~~~~~ 38 (108)
T PF09538_consen 26 PIVCPKCGTEFPP 38 (108)
T ss_pred CccCCCCCCccCc
Confidence 4455555555543
No 74
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=75.53 E-value=4 Score=32.81 Aligned_cols=20 Identities=15% Similarity=0.439 Sum_probs=13.4
Q ss_pred ccccchhccCCchhHHHHHHH
Q psy8499 405 KCVLCSKEFSRKRRLTEHMKL 425 (591)
Q Consensus 405 ~C~~C~~~f~~~~~l~~H~~~ 425 (591)
.|..|+..... +.+..|++.
T Consensus 13 IC~~C~~av~~-~~v~~HL~~ 32 (109)
T PF12013_consen 13 ICRQCQYAVQP-SEVESHLRK 32 (109)
T ss_pred EeCCCCcccCc-hHHHHHHHH
Confidence 57777766655 677777764
No 75
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=75.19 E-value=1.8 Score=34.49 Aligned_cols=33 Identities=27% Similarity=0.581 Sum_probs=17.4
Q ss_pred CCCCCCCcchhhcCChhhHHHHhhhcCCcccccCcCccccccc
Q psy8499 536 GIKNKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKEFSRK 578 (591)
Q Consensus 536 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~ 578 (591)
|.|..|+.||++|.... ..|-.|+.||..|.-.
T Consensus 7 GtKR~Cp~CG~kFYDLn----------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 7 GTKRTCPSCGAKFYDLN----------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred CCcccCCCCcchhccCC----------CCCccCCCCCCccCcc
Confidence 44555666666664332 1255566666665433
No 76
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=73.82 E-value=4.4 Score=32.27 Aligned_cols=49 Identities=20% Similarity=0.369 Sum_probs=26.6
Q ss_pred CccccccccCChhhHHHHHHhhCCcccccccccccccchhhHHHHHHhhC
Q psy8499 432 KCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCSKEFYRKQNLTKHMKLHQ 481 (591)
Q Consensus 432 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 481 (591)
.|--|+..|........= ..-....|.|+.|...|=..-++..|...|.
T Consensus 57 ~C~~C~~~f~~~~~~~~~-~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSPFD-ELKDSHRYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred cccCcCCCCCCccccccc-ccccccceeCCCCCCccccccchhhhhhccC
Confidence 366666666543211100 0111235777777777777777777766664
No 77
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=72.12 E-value=2 Score=24.21 Aligned_cols=8 Identities=38% Similarity=1.173 Sum_probs=3.4
Q ss_pred cccccccc
Q psy8499 460 CNLCSKEF 467 (591)
Q Consensus 460 C~~C~~~f 467 (591)
|+.||+.|
T Consensus 17 Cp~CG~~F 24 (26)
T PF10571_consen 17 CPHCGYDF 24 (26)
T ss_pred CCCCCCCC
Confidence 44444443
No 78
>KOG2186|consensus
Probab=71.16 E-value=2.5 Score=38.55 Aligned_cols=45 Identities=24% Similarity=0.561 Sum_probs=24.6
Q ss_pred CCccccccccCChhhHHHHHHhhCCcccccccccccccchhhHHHHH
Q psy8499 431 NKCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCSKEFYRKQNLTKHM 477 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~L~~H~ 477 (591)
|.|..||.....+ .+.+|+..-++.-|.|-.|++.|.. .....|.
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~ 48 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHT 48 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccCCeeEEeeccccccc-chhhhhh
Confidence 3466666555433 3444665555555666666666655 4455554
No 79
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=69.78 E-value=2.8 Score=28.68 Aligned_cols=31 Identities=19% Similarity=0.384 Sum_probs=14.1
Q ss_pred CCccccccccCChhhHHHHHHhhCCcccccccccc
Q psy8499 431 NKCNLCSKEFSCKSHLKEHMQVHQGITYKCNLCSK 465 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~ 465 (591)
|.|+.||..-..+. +.-.-.+.+|.|+.||.
T Consensus 28 F~CPnCGe~~I~Rc----~~CRk~g~~Y~Cp~CGF 58 (61)
T COG2888 28 FPCPNCGEVEIYRC----AKCRKLGNPYRCPKCGF 58 (61)
T ss_pred eeCCCCCceeeehh----hhHHHcCCceECCCcCc
Confidence 45666654433222 11222344566666653
No 80
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=68.25 E-value=1.8 Score=28.89 Aligned_cols=21 Identities=24% Similarity=0.626 Sum_probs=10.7
Q ss_pred cccCcCccccccchhHHhHhh
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMK 586 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~ 586 (591)
+.|+-||.-|....+..+|+.
T Consensus 18 lrCPRC~~~FR~~K~Y~RHVN 38 (65)
T COG4049 18 LRCPRCGMVFRRRKDYIRHVN 38 (65)
T ss_pred eeCCchhHHHHHhHHHHHHhh
Confidence 455555555555555555543
No 81
>KOG2186|consensus
Probab=67.32 E-value=3.2 Score=37.82 Aligned_cols=46 Identities=22% Similarity=0.536 Sum_probs=30.1
Q ss_pred ccccccccccCCHHHHHHHHHHhCCCCCccccchhccCCchhHHHHHH
Q psy8499 377 YKCDLCLKEYSNKIYLTEHLKLHQGIKNKCVLCSKEFSRKRRLTEHMK 424 (591)
Q Consensus 377 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~ 424 (591)
|.|..||..... ..+.+|+..-++..+.|..|++.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRNAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccCCeeEEeeccccccc-chhhhhhh
Confidence 667777766543 34555777766666777777777776 66666654
No 82
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=66.04 E-value=2.8 Score=28.06 Aligned_cols=26 Identities=12% Similarity=0.321 Sum_probs=19.1
Q ss_pred ccccccccccccCChhhHHHHHHhhC
Q psy8499 105 IGYKCDLCSKEYSYKHTLTEHMKLHQ 130 (591)
Q Consensus 105 ~~~~C~~C~~~f~~~~~l~~H~~~h~ 130 (591)
.-+.|+.||..|....++.+|+...+
T Consensus 16 ~~lrCPRC~~~FR~~K~Y~RHVNKaH 41 (65)
T COG4049 16 EFLRCPRCGMVFRRRKDYIRHVNKAH 41 (65)
T ss_pred eeeeCCchhHHHHHhHHHHHHhhHHh
Confidence 34678888888888888888876433
No 83
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=62.02 E-value=4.5 Score=32.78 Aligned_cols=14 Identities=29% Similarity=0.508 Sum_probs=7.0
Q ss_pred CCCCCccccccccC
Q psy8499 428 GIKNKCNLCSKEFS 441 (591)
Q Consensus 428 ~~~~~C~~C~~~f~ 441 (591)
|.|..|+.||++|-
T Consensus 7 GtKr~Cp~cg~kFY 20 (129)
T TIGR02300 7 GTKRICPNTGSKFY 20 (129)
T ss_pred CccccCCCcCcccc
Confidence 33445555555554
No 84
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=62.02 E-value=4.9 Score=24.38 Aligned_cols=9 Identities=22% Similarity=0.811 Sum_probs=3.6
Q ss_pred ccccccccc
Q psy8499 458 YKCNLCSKE 466 (591)
Q Consensus 458 ~~C~~C~~~ 466 (591)
|.|..||..
T Consensus 3 ~~C~~CG~i 11 (34)
T cd00729 3 WVCPVCGYI 11 (34)
T ss_pred EECCCCCCE
Confidence 334444433
No 85
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=61.78 E-value=2.7 Score=26.18 Aligned_cols=8 Identities=38% Similarity=1.103 Sum_probs=3.2
Q ss_pred cccccccc
Q psy8499 460 CNLCSKEF 467 (591)
Q Consensus 460 C~~C~~~f 467 (591)
|+.|+..|
T Consensus 5 CP~C~~~~ 12 (38)
T TIGR02098 5 CPNCKTSF 12 (38)
T ss_pred CCCCCCEE
Confidence 34444333
No 86
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=61.56 E-value=2.8 Score=26.53 Aligned_cols=11 Identities=36% Similarity=1.078 Sum_probs=5.7
Q ss_pred ccccccccccc
Q psy8499 458 YKCNLCSKEFY 468 (591)
Q Consensus 458 ~~C~~C~~~f~ 468 (591)
|.|+.||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 45555555554
No 87
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=60.61 E-value=4.7 Score=32.39 Aligned_cols=25 Identities=24% Similarity=0.518 Sum_probs=16.5
Q ss_pred ccc----CcCccccccchhHHhHhhhccC
Q psy8499 566 YKC----DLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 566 ~~C----~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
|.| ..|++.+.+...+++|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 566 6666666666666666666654
No 88
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=56.90 E-value=2.6 Score=38.64 Aligned_cols=18 Identities=33% Similarity=0.687 Sum_probs=9.9
Q ss_pred ccccccccccccchhhHH
Q psy8499 457 TYKCNLCSKEFYRKQNLT 474 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~~L~ 474 (591)
.+.||+|++.|.++.-+.
T Consensus 5 ~~~CPvC~~~F~~~~vrs 22 (214)
T PF09986_consen 5 KITCPVCGKEFKTKKVRS 22 (214)
T ss_pred ceECCCCCCeeeeeEEEc
Confidence 345666666666554333
No 89
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=55.14 E-value=4.3 Score=27.41 Aligned_cols=11 Identities=36% Similarity=1.165 Sum_probs=5.5
Q ss_pred ccccccccccc
Q psy8499 458 YKCNLCSKEFY 468 (591)
Q Consensus 458 ~~C~~C~~~f~ 468 (591)
|.|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44555555444
No 90
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=54.62 E-value=4.5 Score=24.89 Aligned_cols=8 Identities=25% Similarity=1.049 Sum_probs=3.1
Q ss_pred cccccccc
Q psy8499 433 CNLCSKEF 440 (591)
Q Consensus 433 C~~C~~~f 440 (591)
|+.|+..|
T Consensus 5 Cp~C~~~y 12 (36)
T PF13717_consen 5 CPNCQAKY 12 (36)
T ss_pred CCCCCCEE
Confidence 33333333
No 91
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=54.54 E-value=8.1 Score=25.82 Aligned_cols=11 Identities=45% Similarity=1.075 Sum_probs=5.4
Q ss_pred CCCcchhhcCC
Q psy8499 540 KCDLCSKEFST 550 (591)
Q Consensus 540 ~C~~C~~~f~~ 550 (591)
.|..|++.+..
T Consensus 20 ~C~~C~~~l~~ 30 (50)
T smart00614 20 KCKYCGKKLSR 30 (50)
T ss_pred EecCCCCEeee
Confidence 45555555443
No 92
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=53.22 E-value=6.7 Score=32.51 Aligned_cols=21 Identities=38% Similarity=0.789 Sum_probs=8.3
Q ss_pred ccCcCccccccchhHHhHhhhccC
Q psy8499 567 KCDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 567 ~C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
.|-+||+.|.. |++|+++|+|
T Consensus 74 ~clecGk~~k~---LkrHL~~~~g 94 (132)
T PF05443_consen 74 ICLECGKKFKT---LKRHLRTHHG 94 (132)
T ss_dssp E-TBT--EESB---HHHHHHHTT-
T ss_pred EEccCCcccch---HHHHHHHccC
Confidence 35555555443 3555555544
No 93
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=52.58 E-value=7.6 Score=25.20 Aligned_cols=11 Identities=36% Similarity=0.824 Sum_probs=5.2
Q ss_pred ccccccccccc
Q psy8499 458 YKCNLCSKEFY 468 (591)
Q Consensus 458 ~~C~~C~~~f~ 468 (591)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 44444444444
No 94
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=52.02 E-value=11 Score=21.22 Aligned_cols=19 Identities=21% Similarity=0.758 Sum_probs=10.0
Q ss_pred CCCcchhhcCChhhHHHHhh
Q psy8499 540 KCDLCSKEFSTKHTLTEHMK 559 (591)
Q Consensus 540 ~C~~C~~~f~~~~~l~~H~~ 559 (591)
.|++|++.+ ....++.|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 355565555 4455555543
No 95
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=51.21 E-value=3.8 Score=37.59 Aligned_cols=20 Identities=35% Similarity=0.665 Sum_probs=11.4
Q ss_pred CCCccCccccCChhhHHHHH
Q psy8499 485 KKCDLCSKEFSTKCSLTEHM 504 (591)
Q Consensus 485 ~~C~~C~~~f~~~~~L~~H~ 504 (591)
..|++|+..|.++.-+....
T Consensus 6 ~~CPvC~~~F~~~~vrs~~~ 25 (214)
T PF09986_consen 6 ITCPVCGKEFKTKKVRSGKI 25 (214)
T ss_pred eECCCCCCeeeeeEEEcCCc
Confidence 45677777666654444333
No 96
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=50.61 E-value=9.9 Score=30.89 Aligned_cols=33 Identities=12% Similarity=0.142 Sum_probs=19.2
Q ss_pred CCCCCCCccccccCChHHHHHHHHhcCCccccccccccccCCH
Q psy8499 347 GIKNKCDLCSKEFSCKRYVTEHMKLHQEITYKCDLCLKEYSNK 389 (591)
Q Consensus 347 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~ 389 (591)
|.+..|+.||+.|-... ..|..|+.||..|...
T Consensus 7 GtKr~Cp~cg~kFYDLn----------k~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 7 GTKRICPNTGSKFYDLN----------RRPAVSPYTGEQFPPE 39 (129)
T ss_pred CccccCCCcCccccccC----------CCCccCCCcCCccCcc
Confidence 44456666666664321 3466777777776554
No 97
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=50.35 E-value=5 Score=25.69 Aligned_cols=13 Identities=38% Similarity=0.933 Sum_probs=6.7
Q ss_pred cccccccccccch
Q psy8499 458 YKCNLCSKEFYRK 470 (591)
Q Consensus 458 ~~C~~C~~~f~~~ 470 (591)
|.|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (42)
T PF09723_consen 6 YRCEECGHEFEVL 18 (42)
T ss_pred EEeCCCCCEEEEE
Confidence 4555555555443
No 98
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=50.21 E-value=9.1 Score=33.12 Aligned_cols=17 Identities=6% Similarity=0.389 Sum_probs=7.9
Q ss_pred cccccccccccCChhhH
Q psy8499 106 GYKCDLCSKEYSYKHTL 122 (591)
Q Consensus 106 ~~~C~~C~~~f~~~~~l 122 (591)
.|.|+.|+..|+...++
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 34455555444444444
No 99
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=49.69 E-value=9 Score=24.79 Aligned_cols=20 Identities=30% Similarity=0.635 Sum_probs=9.3
Q ss_pred cccCcCccccccc----hhHHhHh
Q psy8499 566 YKCDLCTKEFSRK----GRLTEHM 585 (591)
Q Consensus 566 ~~C~~C~~~f~~~----~~l~~H~ 585 (591)
..|.+|++.+... +.|.+|+
T Consensus 17 a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 17 AKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp EEETTTTEE-----SSTHHHHHHH
T ss_pred EEeCCCCeEEeeCCCcHHHHHHhh
Confidence 4566666555442 5566665
No 100
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=49.18 E-value=9.2 Score=33.11 Aligned_cols=16 Identities=13% Similarity=0.231 Sum_probs=7.6
Q ss_pred CCccccccccCChhhH
Q psy8499 431 NKCNLCSKEFSCKSHL 446 (591)
Q Consensus 431 ~~C~~C~~~f~~~~~l 446 (591)
|.|+.|+..|+...++
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 3455555554444444
No 101
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=49.02 E-value=7.3 Score=22.84 Aligned_cols=26 Identities=35% Similarity=0.786 Sum_probs=10.5
Q ss_pred CCCCcchhhcCChhhHHHHhhhcCCcccccCcCccc
Q psy8499 539 NKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKE 574 (591)
Q Consensus 539 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~ 574 (591)
++|+.|+..+.. +.+..|.|+.|+..
T Consensus 3 p~Cp~C~se~~y----------~D~~~~vCp~C~~e 28 (30)
T PF08274_consen 3 PKCPLCGSEYTY----------EDGELLVCPECGHE 28 (30)
T ss_dssp ---TTT-----E----------E-SSSEEETTTTEE
T ss_pred CCCCCCCCccee----------ccCCEEeCCccccc
Confidence 356666665544 44456777777643
No 102
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=48.42 E-value=6.4 Score=33.64 Aligned_cols=33 Identities=18% Similarity=0.465 Sum_probs=21.0
Q ss_pred CCCCcchhhcCChhhHHH-HhhhcCCcccccCcCcccc
Q psy8499 539 NKCDLCSKEFSTKHTLTE-HMKLHQGIRYKCDLCTKEF 575 (591)
Q Consensus 539 ~~C~~C~~~f~~~~~l~~-H~~~h~~~~~~C~~C~~~f 575 (591)
|.|+.|+..|....++.. . ....|.|+.||...
T Consensus 100 Y~Cp~C~~~y~~~ea~~~~d----~~~~f~Cp~Cg~~l 133 (147)
T smart00531 100 YKCPNCQSKYTFLEANQLLD----MDGTFTCPRCGEEL 133 (147)
T ss_pred EECcCCCCEeeHHHHHHhcC----CCCcEECCCCCCEE
Confidence 578888888776555433 2 12238888888765
No 103
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=48.31 E-value=9.7 Score=32.53 Aligned_cols=35 Identities=26% Similarity=0.608 Sum_probs=16.2
Q ss_pred ccccccccccccCChhhHHHHHHhhCCCccccCccccc
Q psy8499 105 IGYKCDLCSKEYSYKHTLTEHMKLHQGIRYKCDLCSKE 142 (591)
Q Consensus 105 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~ 142 (591)
..|.|+.|+..|+....+.. ......|.|+.||..
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~---~d~~~~f~Cp~Cg~~ 132 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQL---LDMDGTFTCPRCGEE 132 (147)
T ss_pred cEEECcCCCCEeeHHHHHHh---cCCCCcEECCCCCCE
Confidence 34666666666654333221 001123666666554
No 104
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=47.36 E-value=11 Score=23.42 Aligned_cols=10 Identities=30% Similarity=0.923 Sum_probs=4.1
Q ss_pred cccCcccccc
Q psy8499 188 YKCNLCSKEF 197 (591)
Q Consensus 188 ~~C~~C~~~f 197 (591)
.+|+.|+..|
T Consensus 26 vrC~~C~~~f 35 (37)
T PF13719_consen 26 VRCPKCGHVF 35 (37)
T ss_pred EECCCCCcEe
Confidence 3444444333
No 105
>PF15269 zf-C2H2_7: Zinc-finger
Probab=47.29 E-value=14 Score=23.58 Aligned_cols=23 Identities=48% Similarity=0.713 Sum_probs=18.9
Q ss_pred ccccCccccccCChhHHHHHHHH
Q psy8499 187 KYKCNLCSKEFSSKSYLTEHMKL 209 (591)
Q Consensus 187 ~~~C~~C~~~f~~~~~l~~H~~~ 209 (591)
.|+|-.|+.+...++.|..||+.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 47888888888888888888863
No 106
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=46.25 E-value=6 Score=25.90 Aligned_cols=10 Identities=50% Similarity=1.275 Sum_probs=4.4
Q ss_pred cccccccccc
Q psy8499 458 YKCNLCSKEF 467 (591)
Q Consensus 458 ~~C~~C~~~f 467 (591)
|.|+.||..|
T Consensus 4 y~C~~CG~~~ 13 (46)
T PRK00398 4 YKCARCGREV 13 (46)
T ss_pred EECCCCCCEE
Confidence 4444444443
No 107
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=45.96 E-value=12 Score=25.79 Aligned_cols=32 Identities=25% Similarity=0.429 Sum_probs=17.4
Q ss_pred CCCCCcchhhcCCh-hhHHHHhhhcCCcccccCcCccc
Q psy8499 538 KNKCDLCSKEFSTK-HTLTEHMKLHQGIRYKCDLCTKE 574 (591)
Q Consensus 538 ~~~C~~C~~~f~~~-~~l~~H~~~h~~~~~~C~~C~~~ 574 (591)
+|.|+.||+.-..+ ..- +-.+.+|.|+.||..
T Consensus 25 ~F~CPnCG~~~I~RC~~C-----Rk~~~~Y~CP~CGF~ 57 (59)
T PRK14890 25 KFLCPNCGEVIIYRCEKC-----RKQSNPYTCPKCGFE 57 (59)
T ss_pred EeeCCCCCCeeEeechhH-----HhcCCceECCCCCCc
Confidence 46677777652221 111 112457888888863
No 108
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=45.92 E-value=8.5 Score=28.84 Aligned_cols=11 Identities=36% Similarity=1.044 Sum_probs=6.1
Q ss_pred ccccccccccc
Q psy8499 458 YKCNLCSKEFY 468 (591)
Q Consensus 458 ~~C~~C~~~f~ 468 (591)
|.|..||..|.
T Consensus 54 W~C~kCg~~fA 64 (89)
T COG1997 54 WKCRKCGAKFA 64 (89)
T ss_pred EEcCCCCCeec
Confidence 55555555554
No 109
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=45.31 E-value=3.8 Score=35.11 Aligned_cols=15 Identities=27% Similarity=0.574 Sum_probs=8.5
Q ss_pred cccccccccccchhh
Q psy8499 458 YKCNLCSKEFYRKQN 472 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~ 472 (591)
|+|+.||+.|.+...
T Consensus 29 ~~c~~c~~~f~~~e~ 43 (154)
T PRK00464 29 RECLACGKRFTTFER 43 (154)
T ss_pred eeccccCCcceEeEe
Confidence 556666666655443
No 110
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=44.63 E-value=11 Score=33.43 Aligned_cols=16 Identities=13% Similarity=0.567 Sum_probs=7.8
Q ss_pred cccccccccccCChhh
Q psy8499 106 GYKCDLCSKEYSYKHT 121 (591)
Q Consensus 106 ~~~C~~C~~~f~~~~~ 121 (591)
.|.|+.|+..|+...+
T Consensus 117 ~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 117 FFFCPNCHIRFTFDEA 132 (178)
T ss_pred EEECCCCCcEEeHHHH
Confidence 4555555555544443
No 111
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=44.42 E-value=12 Score=33.19 Aligned_cols=14 Identities=21% Similarity=0.458 Sum_probs=6.0
Q ss_pred CCccccccccCChh
Q psy8499 431 NKCNLCSKEFSCKS 444 (591)
Q Consensus 431 ~~C~~C~~~f~~~~ 444 (591)
|.|+.|+..|+...
T Consensus 118 Y~Cp~C~~rytf~e 131 (178)
T PRK06266 118 FFCPNCHIRFTFDE 131 (178)
T ss_pred EECCCCCcEEeHHH
Confidence 34444444444333
No 112
>PF14353 CpXC: CpXC protein
Probab=44.14 E-value=17 Score=30.08 Aligned_cols=15 Identities=20% Similarity=0.698 Sum_probs=6.9
Q ss_pred ccccccccccccchh
Q psy8499 457 TYKCNLCSKEFYRKQ 471 (591)
Q Consensus 457 ~~~C~~C~~~f~~~~ 471 (591)
.|.|+.||..|.-..
T Consensus 38 ~~~CP~Cg~~~~~~~ 52 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEY 52 (128)
T ss_pred EEECCCCCCceecCC
Confidence 344555554444333
No 113
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=43.74 E-value=17 Score=21.71 Aligned_cols=11 Identities=36% Similarity=0.869 Sum_probs=5.5
Q ss_pred ccccccccccC
Q psy8499 107 YKCDLCSKEYS 117 (591)
Q Consensus 107 ~~C~~C~~~f~ 117 (591)
|.|..|+..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 44555655553
No 114
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=39.61 E-value=6 Score=28.81 Aligned_cols=13 Identities=23% Similarity=0.618 Sum_probs=7.4
Q ss_pred cccC--cCccccccc
Q psy8499 566 YKCD--LCTKEFSRK 578 (591)
Q Consensus 566 ~~C~--~C~~~f~~~ 578 (591)
++|. .||.+|+..
T Consensus 28 ~qC~N~eCg~tF~t~ 42 (72)
T PRK09678 28 HQCQNVNCSATFITY 42 (72)
T ss_pred eecCCCCCCCEEEEE
Confidence 5565 566666544
No 115
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=38.69 E-value=14 Score=24.52 Aligned_cols=10 Identities=50% Similarity=1.275 Sum_probs=5.6
Q ss_pred cccccccccc
Q psy8499 458 YKCNLCSKEF 467 (591)
Q Consensus 458 ~~C~~C~~~f 467 (591)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 5555555555
No 116
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=38.61 E-value=11 Score=34.53 Aligned_cols=38 Identities=26% Similarity=0.502 Sum_probs=18.2
Q ss_pred CCCCCcccccccccCChhHHHHHHhhhc---CcccccCcccc
Q psy8499 157 QGMKNKCVLCSKEFSSKSYLTEHMKRHQ---GIKYKCNLCSK 195 (591)
Q Consensus 157 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~---~~~~~C~~C~~ 195 (591)
.+.-|+|.+|+.- .-...--.|+.+-. ...|+|..|++
T Consensus 139 GGrif~CsfC~~f-lCEDDQFEHQAsCQvLe~E~~KC~SCNr 179 (314)
T PF06524_consen 139 GGRIFKCSFCDNF-LCEDDQFEHQASCQVLESETFKCQSCNR 179 (314)
T ss_pred CCeEEEeecCCCe-eeccchhhhhhhhhhhhccccccccccc
Confidence 3334677776542 22233334443322 23577766665
No 117
>PRK04023 DNA polymerase II large subunit; Validated
Probab=38.19 E-value=24 Score=39.51 Aligned_cols=9 Identities=56% Similarity=0.888 Sum_probs=4.2
Q ss_pred ccccccccc
Q psy8499 377 YKCDLCLKE 385 (591)
Q Consensus 377 ~~C~~C~~~ 385 (591)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 345555543
No 118
>KOG2807|consensus
Probab=37.97 E-value=51 Score=31.63 Aligned_cols=34 Identities=24% Similarity=0.547 Sum_probs=20.5
Q ss_pred ccccccccccccCChhhHHHHHHhhCCCccccCccccccCChhHHHH
Q psy8499 105 IGYKCDLCSKEYSYKHTLTEHMKLHQGIRYKCDLCSKEFSCKRYLTE 151 (591)
Q Consensus 105 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~ 151 (591)
.+|.|+.|.... -..|-.|++|+-+..+...|.+
T Consensus 275 ~Gy~CP~Ckakv-------------CsLP~eCpiC~ltLVss~hLAR 308 (378)
T KOG2807|consen 275 GGYFCPQCKAKV-------------CSLPIECPICSLTLVSSPHLAR 308 (378)
T ss_pred CceeCCcccCee-------------ecCCccCCccceeEecchHHHH
Confidence 467777776432 1236677777776666655544
No 119
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=36.35 E-value=6.7 Score=41.82 Aligned_cols=56 Identities=20% Similarity=0.360 Sum_probs=28.9
Q ss_pred CCccCccccCChhhHHHHHhhhC-CCCCCCCcchhcccCchhHHHHHHhhcCCCCCCCcchh
Q psy8499 486 KCDLCSKEFSTKCSLTEHMKLHQ-GIKKKCDLCSKEFSCKRSVTEHMKLHLGIKNKCDLCSK 546 (591)
Q Consensus 486 ~C~~C~~~f~~~~~L~~H~~~h~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~ 546 (591)
.|..||-+|+-..+|=.=...-+ ..-+-|+.|.+.|.+..+-+-| .+...|+.||-
T Consensus 125 ~CT~CGPRfTIi~alPYDR~nTsM~~F~lC~~C~~EY~dP~nRRfH-----AQp~aCp~CGP 181 (750)
T COG0068 125 NCTNCGPRFTIIEALPYDRENTSMADFPLCPFCDKEYKDPLNRRFH-----AQPIACPKCGP 181 (750)
T ss_pred ccCCCCcceeeeccCCCCcccCccccCcCCHHHHHHhcCccccccc-----cccccCcccCC
Confidence 46666666665544433221111 1112577777777666553322 34445777764
No 120
>PHA00626 hypothetical protein
Probab=35.20 E-value=17 Score=24.61 Aligned_cols=15 Identities=27% Similarity=0.366 Sum_probs=12.1
Q ss_pred cccccCcCccccccc
Q psy8499 564 IRYKCDLCTKEFSRK 578 (591)
Q Consensus 564 ~~~~C~~C~~~f~~~ 578 (591)
.+|+|+.||+.|+..
T Consensus 22 nrYkCkdCGY~ft~~ 36 (59)
T PHA00626 22 DDYVCCDCGYNDSKD 36 (59)
T ss_pred cceEcCCCCCeechh
Confidence 479999999998754
No 121
>KOG4167|consensus
Probab=34.16 E-value=12 Score=39.74 Aligned_cols=24 Identities=33% Similarity=0.825 Sum_probs=22.8
Q ss_pred cccccccccccchhhHHHHHHhhC
Q psy8499 458 YKCNLCSKEFYRKQNLTKHMKLHQ 481 (591)
Q Consensus 458 ~~C~~C~~~f~~~~~L~~H~~~h~ 481 (591)
|.|..|++.|.....+..||++|.
T Consensus 793 FpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 793 FPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eehHHHHHHHHHHhhhhHHHHHHH
Confidence 999999999999999999999995
No 122
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=33.96 E-value=20 Score=29.74 Aligned_cols=23 Identities=39% Similarity=0.868 Sum_probs=12.5
Q ss_pred CCccCccccCChhhHHHHHhhhCCCC
Q psy8499 486 KCDLCSKEFSTKCSLTEHMKLHQGIK 511 (591)
Q Consensus 486 ~C~~C~~~f~~~~~L~~H~~~h~~~~ 511 (591)
.|-+||+.|.+ |++|++.|+|..
T Consensus 74 ~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 74 ICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp E-TBT--EESB---HHHHHHHTT-S-
T ss_pred EEccCCcccch---HHHHHHHccCCC
Confidence 57777777764 477777776654
No 123
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=33.41 E-value=22 Score=19.65 Aligned_cols=10 Identities=20% Similarity=0.720 Sum_probs=6.3
Q ss_pred cccccCcCcc
Q psy8499 564 IRYKCDLCTK 573 (591)
Q Consensus 564 ~~~~C~~C~~ 573 (591)
.+|.|+-||+
T Consensus 15 v~f~CPnCG~ 24 (24)
T PF07754_consen 15 VPFPCPNCGF 24 (24)
T ss_pred ceEeCCCCCC
Confidence 3577777763
No 124
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=32.72 E-value=28 Score=32.88 Aligned_cols=24 Identities=38% Similarity=0.807 Sum_probs=14.8
Q ss_pred ccccCccccccCChhHHHHHHHHc
Q psy8499 187 KYKCNLCSKEFSSKSYLTEHMKLH 210 (591)
Q Consensus 187 ~~~C~~C~~~f~~~~~l~~H~~~h 210 (591)
.|.|+.|...|-..-+...|...|
T Consensus 388 rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 388 RYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred ceechhhhhhhhhhhHHHHHHHHh
Confidence 466666666666666655565544
No 125
>KOG4167|consensus
Probab=32.33 E-value=10 Score=40.34 Aligned_cols=24 Identities=33% Similarity=0.589 Sum_probs=20.8
Q ss_pred cccCcCccccccchhHHhHhhhcc
Q psy8499 566 YKCDLCTKEFSRKGRLTEHMKLHL 589 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~~~H~ 589 (591)
|.|..||+.|..-.++..||++|-
T Consensus 793 FpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 793 FPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eehHHHHHHHHHHhhhhHHHHHHH
Confidence 889999999988888888988884
No 126
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=31.07 E-value=25 Score=30.46 Aligned_cols=24 Identities=17% Similarity=0.468 Sum_probs=16.5
Q ss_pred cccccccccccCChhhHHHHHHhhCCCccccCcccc
Q psy8499 106 GYKCDLCSKEYSYKHTLTEHMKLHQGIRYKCDLCSK 141 (591)
Q Consensus 106 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~ 141 (591)
.|.|++||.++.. ..|-.||+||.
T Consensus 134 ~~vC~vCGy~~~g------------e~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTHEG------------EAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcccC------------CCCCcCCCCCC
Confidence 5888888876532 23678888873
No 127
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=29.94 E-value=26 Score=23.50 Aligned_cols=12 Identities=25% Similarity=0.540 Sum_probs=8.0
Q ss_pred cccccCcCcccc
Q psy8499 564 IRYKCDLCTKEF 575 (591)
Q Consensus 564 ~~~~C~~C~~~f 575 (591)
.++.|.-||..+
T Consensus 36 ~r~~C~~Cgyt~ 47 (50)
T PRK00432 36 DRWHCGKCGYTE 47 (50)
T ss_pred CcEECCCcCCEE
Confidence 467777777654
No 128
>PRK04023 DNA polymerase II large subunit; Validated
Probab=29.72 E-value=42 Score=37.73 Aligned_cols=9 Identities=56% Similarity=1.065 Sum_probs=4.9
Q ss_pred eecccCccc
Q psy8499 296 YKCDLCSKE 304 (591)
Q Consensus 296 ~~C~~C~~~ 304 (591)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 455555554
No 129
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=29.35 E-value=25 Score=23.92 Aligned_cols=27 Identities=26% Similarity=0.676 Sum_probs=18.4
Q ss_pred cccccccccccCChhhHHHHHHhhCCCccccCcccccc
Q psy8499 106 GYKCDLCSKEYSYKHTLTEHMKLHQGIRYKCDLCSKEF 143 (591)
Q Consensus 106 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f 143 (591)
..+|+.|++.|... +....|+.|+..+
T Consensus 5 ~~~C~~Cg~~~~~~-----------dDiVvCp~Cgapy 31 (54)
T PF14446_consen 5 GCKCPVCGKKFKDG-----------DDIVVCPECGAPY 31 (54)
T ss_pred CccChhhCCcccCC-----------CCEEECCCCCCcc
Confidence 45788888888643 2246788888654
No 130
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=28.52 E-value=27 Score=28.65 Aligned_cols=20 Identities=35% Similarity=0.607 Sum_probs=9.1
Q ss_pred cCcCccccccchhHHhHhhhccC
Q psy8499 568 CDLCTKEFSRKGRLTEHMKLHLG 590 (591)
Q Consensus 568 C~~C~~~f~~~~~l~~H~~~H~~ 590 (591)
|-.+|+.|. +|++|+.+|.|
T Consensus 79 cLEDGkkfK---SLKRHL~t~~g 98 (148)
T COG4957 79 CLEDGKKFK---SLKRHLTTHYG 98 (148)
T ss_pred EeccCcchH---HHHHHHhcccC
Confidence 444444443 24444444444
No 131
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=28.41 E-value=30 Score=37.99 Aligned_cols=44 Identities=25% Similarity=0.643 Sum_probs=0.0
Q ss_pred CccCccccCChhhHHHHHhhhCCCCCCCCcchhcccCchhHHHHHHhhcCCC-CCCCcchhhcCChhhHHHHhhhcCCcc
Q psy8499 487 CDLCSKEFSTKCSLTEHMKLHQGIKKKCDLCSKEFSCKRSVTEHMKLHLGIK-NKCDLCSKEFSTKHTLTEHMKLHQGIR 565 (591)
Q Consensus 487 C~~C~~~f~~~~~L~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~~~ 565 (591)
|..||..+ +|+.|+.. +..|.... ..|..||. +...|
T Consensus 438 C~~Cg~v~------------------~Cp~Cd~~----------lt~H~~~~~L~CH~Cg~--------------~~~~p 475 (730)
T COG1198 438 CRDCGYIA------------------ECPNCDSP----------LTLHKATGQLRCHYCGY--------------QEPIP 475 (730)
T ss_pred cccCCCcc------------------cCCCCCcc----------eEEecCCCeeEeCCCCC--------------CCCCC
Q ss_pred cccCcCc
Q psy8499 566 YKCDLCT 572 (591)
Q Consensus 566 ~~C~~C~ 572 (591)
..|+.||
T Consensus 476 ~~Cp~Cg 482 (730)
T COG1198 476 QSCPECG 482 (730)
T ss_pred CCCCCCC
No 132
>KOG2593|consensus
Probab=27.98 E-value=44 Score=33.56 Aligned_cols=34 Identities=21% Similarity=0.497 Sum_probs=22.7
Q ss_pred CCCCcchhhcCChhhHHHHhhhcCCcccccCcCccc
Q psy8499 539 NKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKE 574 (591)
Q Consensus 539 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~ 574 (591)
|.|+.|.+.|+...+++.= .-....|.|..|+-.
T Consensus 129 Y~Cp~C~kkyt~Lea~~L~--~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 129 YVCPNCQKKYTSLEALQLL--DNETGEFHCENCGGE 162 (436)
T ss_pred ccCCccccchhhhHHHHhh--cccCceEEEecCCCc
Confidence 7888888888887776532 111236888888754
No 133
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=26.60 E-value=17 Score=33.48 Aligned_cols=78 Identities=23% Similarity=0.514 Sum_probs=41.6
Q ss_pred cccCcccc-ccCChhHHHHHHHHhCCCCCcccccccccCChhHHHHHHhhhcCcccccCccccccCChhHHHHHHHHc--
Q psy8499 134 YKCDLCSK-EFSCKRYLTEHMKVHQGMKNKCVLCSKEFSSKSYLTEHMKRHQGIKYKCNLCSKEFSSKSYLTEHMKLH-- 210 (591)
Q Consensus 134 ~~C~~C~~-~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~~~~~l~~H~~~h-- 210 (591)
-.|+.|.. ....+.-|..|--.-.-..-.|.+|.+. +-.|-|+.|+|..|..-. -...-..|+..-
T Consensus 98 aiCDfCEawvCHgrkCl~~HaC~Cpl~da~C~EC~R~----------vw~hGGrif~CsfC~~fl-CEDDQFEHQAsCQv 166 (314)
T PF06524_consen 98 AICDFCEAWVCHGRKCLSTHACTCPLQDAVCIECERG----------VWDHGGRIFKCSFCDNFL-CEDDQFEHQASCQV 166 (314)
T ss_pred hhhccchhheeccccccccccccCcCCCcEeeeeecc----------cccCCCeEEEeecCCCee-eccchhhhhhhhhh
Confidence 34666642 2233333444422211122356777654 334667789999997543 333344565432
Q ss_pred -CCCccccCCCcc
Q psy8499 211 -QEITYKCDLCLK 222 (591)
Q Consensus 211 -~~~~~~C~~C~~ 222 (591)
....|+|.-|++
T Consensus 167 Le~E~~KC~SCNr 179 (314)
T PF06524_consen 167 LESETFKCQSCNR 179 (314)
T ss_pred hhccccccccccc
Confidence 235788888875
No 134
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=26.16 E-value=32 Score=28.31 Aligned_cols=14 Identities=14% Similarity=0.551 Sum_probs=7.8
Q ss_pred cccccccccccCCH
Q psy8499 376 TYKCDLCLKEYSNK 389 (591)
Q Consensus 376 ~~~C~~C~~~f~~~ 389 (591)
|++|..||+.|.+.
T Consensus 1 PH~Ct~Cg~~f~dg 14 (131)
T PF09845_consen 1 PHQCTKCGRVFEDG 14 (131)
T ss_pred CcccCcCCCCcCCC
Confidence 34566666666543
No 135
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=24.98 E-value=32 Score=23.47 Aligned_cols=30 Identities=27% Similarity=0.506 Sum_probs=15.4
Q ss_pred CCCCcchhhcCChhhHHHHhhhcCCcccccCcCcccc
Q psy8499 539 NKCDLCSKEFSTKHTLTEHMKLHQGIRYKCDLCTKEF 575 (591)
Q Consensus 539 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f 575 (591)
++|+.||..+.-.... .|....|+.||..+
T Consensus 3 ~~CP~CG~~iev~~~~-------~GeiV~Cp~CGael 32 (54)
T TIGR01206 3 FECPDCGAEIELENPE-------LGELVICDECGAEL 32 (54)
T ss_pred cCCCCCCCEEecCCCc-------cCCEEeCCCCCCEE
Confidence 4566666655332211 13345677776655
No 136
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=24.74 E-value=23 Score=32.30 Aligned_cols=25 Identities=24% Similarity=0.538 Sum_probs=15.8
Q ss_pred ccccCcCccccccchhHHhHhhhcc
Q psy8499 565 RYKCDLCTKEFSRKGRLTEHMKLHL 589 (591)
Q Consensus 565 ~~~C~~C~~~f~~~~~l~~H~~~H~ 589 (591)
.|.|++|+|.|.-..-.+.|+..-|
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred EECCCCCCcccCChHHHHHHHhhcC
Confidence 5777777777777777777766543
No 137
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=23.43 E-value=27 Score=24.15 Aligned_cols=27 Identities=19% Similarity=0.497 Sum_probs=12.3
Q ss_pred hhhHHHHhhhc-CCcccccCc----Ccccccc
Q psy8499 551 KHTLTEHMKLH-QGIRYKCDL----CTKEFSR 577 (591)
Q Consensus 551 ~~~l~~H~~~h-~~~~~~C~~----C~~~f~~ 577 (591)
+..|..|+... ...+..|+. |+..+..
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~~ 54 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERVPR 54 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EEEH
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCccch
Confidence 45566666532 223455666 6665543
No 138
>KOG2593|consensus
Probab=22.66 E-value=44 Score=33.57 Aligned_cols=34 Identities=21% Similarity=0.419 Sum_probs=16.3
Q ss_pred ccccCccccccCChhHHHHHHHHhCCCCCccccccc
Q psy8499 133 RYKCDLCSKEFSCKRYLTEHMKVHQGMKNKCVLCSK 168 (591)
Q Consensus 133 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~ 168 (591)
.|.|+.|.+.|.+..+++.= .-....|.|..|+-
T Consensus 128 ~Y~Cp~C~kkyt~Lea~~L~--~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEALQLL--DNETGEFHCENCGG 161 (436)
T ss_pred cccCCccccchhhhHHHHhh--cccCceEEEecCCC
Confidence 36666666666555443321 01112356666654
No 139
>smart00349 KRAB krueppel associated box.
Probab=22.45 E-value=37 Score=21.37 Aligned_cols=18 Identities=22% Similarity=0.272 Sum_probs=6.9
Q ss_pred ccccccccCCCCCCCCCc
Q psy8499 5 VEESKDHVGLSPPFYDPS 22 (591)
Q Consensus 5 ~e~~~~~~~~~~~~~~p~ 22 (591)
++++..+...|.....|+
T Consensus 31 ~~~~~~~~~~~~~~~~~~ 48 (61)
T smart00349 31 LENYSNLVSLGFQVPKPD 48 (61)
T ss_pred HHhhccccccCCCcCCcc
Confidence 333444433343333333
No 140
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=22.31 E-value=41 Score=24.15 Aligned_cols=9 Identities=44% Similarity=1.309 Sum_probs=2.7
Q ss_pred CCCcchhcc
Q psy8499 513 KCDLCSKEF 521 (591)
Q Consensus 513 ~C~~C~~~f 521 (591)
.|..|++.|
T Consensus 11 ~C~~C~~~F 19 (69)
T PF01363_consen 11 NCMICGKKF 19 (69)
T ss_dssp B-TTT--B-
T ss_pred cCcCcCCcC
Confidence 455555555
No 141
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=21.32 E-value=67 Score=29.33 Aligned_cols=29 Identities=21% Similarity=0.452 Sum_probs=22.0
Q ss_pred CccccccccccccCCHHHHHHHHHHhCCC
Q psy8499 374 EITYKCDLCLKEYSNKIYLTEHLKLHQGI 402 (591)
Q Consensus 374 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 402 (591)
+..|.|..|+|.|.-....+.|+..-|.+
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 45699999999999999999999876654
No 142
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=20.75 E-value=62 Score=20.57 Aligned_cols=23 Identities=26% Similarity=0.391 Sum_probs=11.6
Q ss_pred cccCcCcccccc--chhHHhHhhhc
Q psy8499 566 YKCDLCTKEFSR--KGRLTEHMKLH 588 (591)
Q Consensus 566 ~~C~~C~~~f~~--~~~l~~H~~~H 588 (591)
-.|+.||..|.. ..+-+.|.+-|
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH 38 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYH 38 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHH
Confidence 356666655532 34445555544
No 143
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=20.49 E-value=35 Score=19.65 Aligned_cols=19 Identities=26% Similarity=0.684 Sum_probs=11.5
Q ss_pred cccCcCccccccchhHHhHh
Q psy8499 566 YKCDLCTKEFSRKGRLTEHM 585 (591)
Q Consensus 566 ~~C~~C~~~f~~~~~l~~H~ 585 (591)
|.|-.|++.| .....+.|.
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EEETTTTEEE-EGGGTTT--
T ss_pred CeeecCCCCc-CcCCcCCCC
Confidence 5688888888 444455553
No 144
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=20.12 E-value=63 Score=25.06 Aligned_cols=13 Identities=15% Similarity=0.588 Sum_probs=7.7
Q ss_pred cccccccccccCC
Q psy8499 376 TYKCDLCLKEYSN 388 (591)
Q Consensus 376 ~~~C~~C~~~f~~ 388 (591)
|.+|..||..|.+
T Consensus 2 pH~CtrCG~vf~~ 14 (112)
T COG3364 2 PHQCTRCGEVFDD 14 (112)
T ss_pred Cceeccccccccc
Confidence 4556666666655
No 145
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.08 E-value=81 Score=24.80 Aligned_cols=31 Identities=19% Similarity=0.066 Sum_probs=19.4
Q ss_pred CCCCCCCccccccCChHHHHHHHHhcCCccccccccccccC
Q psy8499 347 GIKNKCDLCSKEFSCKRYVTEHMKLHQEITYKCDLCLKEYS 387 (591)
Q Consensus 347 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~C~~C~~~f~ 387 (591)
|.+..|+.|++.|-.. ...|..|++||++|+
T Consensus 7 GtKridPetg~KFYDL----------NrdPiVsPytG~s~P 37 (129)
T COG4530 7 GTKRIDPETGKKFYDL----------NRDPIVSPYTGKSYP 37 (129)
T ss_pred cccccCccccchhhcc----------CCCccccCcccccch
Confidence 3345566777666432 145777888888884
No 146
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=20.03 E-value=71 Score=26.32 Aligned_cols=12 Identities=25% Similarity=0.875 Sum_probs=6.7
Q ss_pred CCccCccccCCh
Q psy8499 486 KCDLCSKEFSTK 497 (591)
Q Consensus 486 ~C~~C~~~f~~~ 497 (591)
+|+.|.-+|.+.
T Consensus 123 vCPvCkTSFKss 134 (140)
T PF05290_consen 123 VCPVCKTSFKSS 134 (140)
T ss_pred CCCccccccccc
Confidence 566665555544
Done!