Query psy8286
Match_columns 858
No_of_seqs 640 out of 3369
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 23:36:22 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8286.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8286hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.1E-28 2.4E-33 282.0 9.2 328 531-858 353-940 (958)
2 KOG2462|consensus 99.9 6E-26 1.3E-30 234.1 6.5 138 708-847 127-265 (279)
3 KOG1074|consensus 99.9 9.8E-26 2.1E-30 258.1 7.9 256 532-826 606-936 (958)
4 KOG3608|consensus 99.9 3.8E-24 8.2E-29 224.4 8.8 217 565-852 142-379 (467)
5 KOG2462|consensus 99.9 7.6E-24 1.6E-28 218.6 5.2 137 644-819 129-265 (279)
6 KOG3623|consensus 99.9 2.8E-23 6.1E-28 233.7 7.8 79 768-846 893-971 (1007)
7 KOG3608|consensus 99.8 1.8E-21 4E-26 204.4 4.0 231 506-809 134-398 (467)
8 KOG3623|consensus 99.8 5.8E-21 1.3E-25 215.2 7.7 81 738-818 891-971 (1007)
9 KOG3576|consensus 99.6 2.7E-16 5.8E-21 154.8 1.9 114 740-853 116-240 (267)
10 KOG3576|consensus 99.5 1.5E-14 3.2E-19 142.6 3.0 116 709-826 115-241 (267)
11 PLN03086 PRLI-interacting fact 99.3 1.3E-11 2.9E-16 142.5 10.4 148 684-851 407-566 (567)
12 PLN03086 PRLI-interacting fact 99.1 2.7E-10 5.9E-15 131.8 8.1 143 646-819 408-562 (567)
13 PHA00733 hypothetical protein 98.9 1.2E-09 2.5E-14 104.7 4.7 82 767-851 38-125 (128)
14 PHA00733 hypothetical protein 98.7 1.4E-08 3E-13 97.3 3.9 56 765-822 69-124 (128)
15 KOG3993|consensus 98.5 1.5E-08 3.3E-13 110.4 -0.2 171 682-852 265-485 (500)
16 PHA02768 hypothetical protein; 98.5 4.1E-08 8.9E-13 78.7 2.1 43 798-842 6-48 (55)
17 KOG3993|consensus 98.5 4.1E-08 9E-13 107.0 0.9 50 506-555 267-319 (500)
18 PHA02768 hypothetical protein; 98.4 1.4E-07 3E-12 75.8 1.7 42 770-813 6-47 (55)
19 PF13465 zf-H2C2_2: Zinc-finge 98.1 1.3E-06 2.8E-11 60.0 2.1 24 813-836 2-25 (26)
20 PF13465 zf-H2C2_2: Zinc-finge 98.0 4.2E-06 9.2E-11 57.4 2.2 24 785-808 2-25 (26)
21 KOG1146|consensus 97.9 4.2E-05 9.2E-10 94.4 11.7 114 414-530 463-613 (1406)
22 PHA00732 hypothetical protein 97.8 1.1E-05 2.5E-10 70.6 3.0 47 769-821 1-48 (79)
23 PHA00616 hypothetical protein 97.8 7.7E-06 1.7E-10 62.7 1.3 34 797-830 1-34 (44)
24 PHA00616 hypothetical protein 97.8 1E-05 2.3E-10 62.0 1.4 33 825-857 1-33 (44)
25 PHA00732 hypothetical protein 97.6 4.2E-05 9.1E-10 67.0 2.7 46 741-792 1-47 (79)
26 PF05605 zf-Di19: Drought indu 97.2 0.00036 7.9E-09 56.6 4.1 50 798-850 3-54 (54)
27 PF12756 zf-C2H2_2: C2H2 type 97.0 0.00021 4.5E-09 64.8 0.5 70 509-580 2-71 (100)
28 PF05605 zf-Di19: Drought indu 96.9 0.00063 1.4E-08 55.3 2.9 49 742-793 3-53 (54)
29 PF00096 zf-C2H2: Zinc finger, 96.9 0.00058 1.3E-08 45.1 2.1 22 798-819 1-22 (23)
30 PF12756 zf-C2H2_2: C2H2 type 96.9 0.0006 1.3E-08 61.7 2.4 71 744-819 2-72 (100)
31 PF00096 zf-C2H2: Zinc finger, 96.8 0.00075 1.6E-08 44.6 1.7 23 826-848 1-23 (23)
32 PF13894 zf-C2H2_4: C2H2-type 96.8 0.00088 1.9E-08 44.2 2.0 24 826-849 1-24 (24)
33 KOG2231|consensus 96.7 0.0013 2.9E-08 77.8 4.0 74 508-582 117-205 (669)
34 PF13894 zf-C2H2_4: C2H2-type 96.5 0.0014 3E-08 43.2 1.7 24 417-440 1-24 (24)
35 COG5189 SFP1 Putative transcri 96.5 0.001 2.2E-08 70.9 1.0 70 767-846 347-419 (423)
36 PF13912 zf-C2H2_6: C2H2-type 96.4 0.002 4.2E-08 44.4 1.7 25 825-849 1-25 (27)
37 KOG2231|consensus 96.4 0.0044 9.5E-08 73.6 5.6 100 507-611 183-301 (669)
38 PF13912 zf-C2H2_6: C2H2-type 96.1 0.0037 8E-08 43.0 1.9 25 797-821 1-25 (27)
39 COG5189 SFP1 Putative transcri 95.9 0.01 2.2E-07 63.5 5.3 30 411-440 344-375 (423)
40 COG5236 Uncharacterized conser 95.3 0.01 2.2E-07 64.1 2.5 78 645-735 151-244 (493)
41 PF09237 GAGA: GAGA factor; I 95.1 0.013 2.8E-07 46.2 1.8 31 823-853 22-52 (54)
42 PF13909 zf-H2C2_5: C2H2-type 95.0 0.012 2.7E-07 39.2 1.4 24 826-850 1-24 (24)
43 COG5048 FOG: Zn-finger [Genera 95.0 0.0094 2E-07 67.6 1.2 63 711-773 289-355 (467)
44 PRK04860 hypothetical protein; 94.9 0.014 3.1E-07 58.2 2.1 39 796-838 118-156 (160)
45 smart00355 ZnF_C2H2 zinc finge 94.8 0.021 4.5E-07 38.0 2.1 22 799-820 2-23 (26)
46 PF09237 GAGA: GAGA factor; I 94.4 0.029 6.3E-07 44.3 2.3 33 793-825 20-52 (54)
47 smart00355 ZnF_C2H2 zinc finge 94.4 0.033 7.2E-07 37.0 2.3 24 826-849 1-24 (26)
48 COG5236 Uncharacterized conser 94.3 0.021 4.6E-07 61.7 1.9 127 684-820 151-304 (493)
49 KOG2482|consensus 94.1 0.025 5.5E-07 61.2 2.0 137 506-642 144-358 (423)
50 PF13909 zf-H2C2_5: C2H2-type 93.8 0.047 1E-06 36.4 2.2 24 798-822 1-24 (24)
51 COG5048 FOG: Zn-finger [Genera 93.6 0.045 9.7E-07 62.1 2.9 148 684-833 289-454 (467)
52 PRK04860 hypothetical protein; 93.5 0.035 7.6E-07 55.4 1.6 35 770-808 120-154 (160)
53 PF12874 zf-met: Zinc-finger o 93.0 0.066 1.4E-06 35.9 1.9 21 798-818 1-21 (25)
54 PF12874 zf-met: Zinc-finger o 92.9 0.058 1.3E-06 36.2 1.4 23 826-848 1-23 (25)
55 KOG2785|consensus 92.7 0.098 2.1E-06 57.9 3.7 78 416-493 3-90 (390)
56 KOG2482|consensus 92.2 0.13 2.8E-06 55.9 3.7 74 507-582 280-357 (423)
57 KOG2785|consensus 91.7 0.18 3.9E-06 55.9 4.2 51 797-847 166-242 (390)
58 KOG1146|consensus 91.4 0.045 9.8E-07 68.7 -0.9 130 470-615 1227-1356(1406)
59 PF12171 zf-C2H2_jaz: Zinc-fin 90.2 0.12 2.7E-06 35.5 0.8 22 417-438 2-23 (27)
60 PF12171 zf-C2H2_jaz: Zinc-fin 89.6 0.15 3.2E-06 35.1 0.8 21 798-818 2-22 (27)
61 KOG4173|consensus 88.4 0.12 2.7E-06 52.3 -0.4 76 505-580 78-167 (253)
62 KOG4173|consensus 85.6 0.32 6.8E-06 49.5 0.8 77 741-820 79-170 (253)
63 KOG2893|consensus 83.8 0.45 9.8E-06 49.3 0.9 42 508-550 12-53 (341)
64 KOG2893|consensus 82.8 0.37 8E-06 50.0 -0.1 46 800-849 13-58 (341)
65 PF13913 zf-C2HC_2: zinc-finge 82.6 0.89 1.9E-05 30.9 1.7 21 417-438 3-23 (25)
66 TIGR00622 ssl1 transcription f 81.9 2.3 4.9E-05 39.9 4.6 49 771-821 57-105 (112)
67 smart00451 ZnF_U1 U1-like zinc 80.4 1.1 2.5E-05 32.3 1.8 22 825-846 3-24 (35)
68 smart00451 ZnF_U1 U1-like zinc 79.1 1.3 2.8E-05 32.0 1.7 23 416-438 3-25 (35)
69 PF12013 DUF3505: Protein of u 78.7 2.6 5.7E-05 39.2 4.1 25 826-850 81-109 (109)
70 PF13913 zf-C2HC_2: zinc-finge 78.1 1.7 3.7E-05 29.5 1.9 19 799-818 4-22 (25)
71 cd00350 rubredoxin_like Rubred 71.0 2.8 6E-05 30.4 1.7 10 824-833 16-25 (33)
72 COG4049 Uncharacterized protei 70.2 1.8 3.9E-05 35.0 0.6 29 822-850 14-42 (65)
73 PF12013 DUF3505: Protein of u 66.4 7.5 0.00016 36.1 4.0 21 563-583 88-108 (109)
74 TIGR00622 ssl1 transcription f 66.1 10 0.00023 35.5 4.7 18 507-524 16-33 (112)
75 COG4049 Uncharacterized protei 62.9 3.1 6.7E-05 33.7 0.6 34 411-444 12-45 (65)
76 cd00350 rubredoxin_like Rubred 61.9 5.3 0.00012 29.0 1.6 24 560-595 2-25 (33)
77 PF09986 DUF2225: Uncharacteri 60.8 2.4 5.2E-05 44.5 -0.5 27 414-440 3-29 (214)
78 PF09986 DUF2225: Uncharacteri 57.4 3.9 8.5E-05 42.9 0.4 43 768-810 4-61 (214)
79 PF02892 zf-BED: BED zinc fing 51.5 8.3 0.00018 29.6 1.3 28 413-440 13-44 (45)
80 cd00729 rubredoxin_SM Rubredox 51.0 9.8 0.00021 27.9 1.5 24 798-833 3-26 (34)
81 PF02892 zf-BED: BED zinc fing 50.2 13 0.00027 28.6 2.1 22 826-847 17-42 (45)
82 PF09538 FYDLN_acid: Protein o 48.7 11 0.00023 35.4 1.7 30 742-782 10-39 (108)
83 PF06524 NOA36: NOA36 protein; 48.5 7.3 0.00016 41.4 0.7 25 796-820 208-232 (314)
84 PRK00464 nrdR transcriptional 46.1 7.5 0.00016 38.7 0.3 12 770-781 29-40 (154)
85 KOG2186|consensus 45.1 8.8 0.00019 40.7 0.7 44 685-730 4-47 (276)
86 PF05443 ROS_MUCR: ROS/MUCR tr 43.6 11 0.00023 36.6 1.0 24 826-852 73-96 (132)
87 PRK00464 nrdR transcriptional 42.4 13 0.00027 37.1 1.3 18 588-605 29-46 (154)
88 COG1198 PriA Primosomal protei 40.7 19 0.00041 44.5 2.7 12 795-806 473-484 (730)
89 KOG2186|consensus 40.5 18 0.00039 38.5 2.1 46 770-818 4-49 (276)
90 COG1198 PriA Primosomal protei 40.4 16 0.00034 45.2 1.9 23 741-778 462-484 (730)
91 PF06524 NOA36: NOA36 protein; 40.2 21 0.00045 38.1 2.5 89 503-610 139-232 (314)
92 smart00614 ZnF_BED BED zinc fi 39.6 22 0.00047 28.2 2.0 19 828-846 21-44 (50)
93 TIGR02098 MJ0042_CXXC MJ0042 f 39.2 21 0.00045 26.5 1.7 6 799-804 27-32 (38)
94 PF09538 FYDLN_acid: Protein o 39.1 19 0.00042 33.7 1.9 29 508-544 11-39 (108)
95 COG1592 Rubrerythrin [Energy p 38.7 19 0.00041 36.3 1.8 23 797-832 134-156 (166)
96 PF04959 ARS2: Arsenite-resist 38.5 12 0.00026 39.3 0.4 32 413-444 74-105 (214)
97 COG5151 SSL1 RNA polymerase II 36.8 30 0.00066 37.7 3.1 78 741-821 322-412 (421)
98 KOG4377|consensus 36.4 21 0.00046 40.4 2.0 24 826-849 402-427 (480)
99 PRK14890 putative Zn-ribbon RN 36.2 36 0.00078 28.2 2.8 10 506-515 25-34 (59)
100 smart00531 TFIIE Transcription 35.5 28 0.0006 34.3 2.5 15 768-782 98-112 (147)
101 KOG2807|consensus 34.9 51 0.0011 36.4 4.4 25 796-820 344-368 (378)
102 TIGR00373 conserved hypothetic 34.4 29 0.00064 34.6 2.5 17 769-785 109-125 (158)
103 smart00531 TFIIE Transcription 34.4 31 0.00067 34.0 2.6 14 741-754 99-112 (147)
104 KOG2071|consensus 33.8 53 0.0011 39.2 4.7 29 413-441 415-443 (579)
105 smart00659 RPOLCX RNA polymera 33.7 29 0.00062 27.1 1.8 11 798-808 3-13 (44)
106 PRK04023 DNA polymerase II lar 33.1 39 0.00086 42.7 3.7 11 798-808 664-674 (1121)
107 PF13717 zinc_ribbon_4: zinc-r 32.4 28 0.0006 25.8 1.4 8 714-721 5-12 (36)
108 TIGR00373 conserved hypothetic 32.3 39 0.00085 33.8 3.0 39 789-836 101-139 (158)
109 PRK04023 DNA polymerase II lar 32.2 47 0.001 42.0 4.2 54 503-600 623-676 (1121)
110 smart00614 ZnF_BED BED zinc fi 31.4 30 0.00066 27.4 1.6 26 416-441 18-48 (50)
111 PRK09678 DNA-binding transcrip 31.2 23 0.0005 30.7 0.9 7 802-808 34-40 (72)
112 TIGR02300 FYDLN_acid conserved 31.1 33 0.00071 32.9 2.0 15 768-782 25-39 (129)
113 TIGR02605 CxxC_CxxC_SSSS putat 30.9 28 0.00061 27.6 1.4 30 559-595 5-34 (52)
114 PF04959 ARS2: Arsenite-resist 30.0 20 0.00043 37.7 0.4 30 794-823 74-103 (214)
115 COG4957 Predicted transcriptio 29.8 30 0.00066 33.3 1.6 24 827-853 78-101 (148)
116 PRK06266 transcription initiat 29.7 36 0.00078 34.7 2.3 16 769-784 117-132 (178)
117 COG5151 SSL1 RNA polymerase II 29.7 51 0.0011 36.1 3.4 24 741-764 388-411 (421)
118 smart00834 CxxC_CXXC_SSSS Puta 29.3 33 0.00072 25.5 1.5 30 559-595 5-34 (41)
119 PRK00398 rpoP DNA-directed RNA 29.0 22 0.00047 27.7 0.4 11 798-808 4-14 (46)
120 PF09723 Zn-ribbon_8: Zinc rib 29.0 13 0.00027 28.6 -0.9 10 472-481 6-15 (42)
121 PF12760 Zn_Tnp_IS1595: Transp 27.9 84 0.0018 24.5 3.5 11 795-805 35-45 (46)
122 TIGR02300 FYDLN_acid conserved 27.5 40 0.00086 32.4 1.9 30 770-810 10-39 (129)
123 KOG2071|consensus 27.3 41 0.00088 40.1 2.4 27 739-765 416-442 (579)
124 PRK06266 transcription initiat 27.0 47 0.001 33.9 2.5 35 793-836 113-147 (178)
125 COG1996 RPC10 DNA-directed RNA 26.4 37 0.00079 27.2 1.2 11 797-807 6-16 (49)
126 PRK09678 DNA-binding transcrip 26.1 20 0.00043 31.1 -0.3 42 798-841 2-45 (72)
127 smart00734 ZnF_Rad18 Rad18-lik 24.8 51 0.0011 22.6 1.6 20 417-437 2-21 (26)
128 PF02176 zf-TRAF: TRAF-type zi 24.6 49 0.0011 26.8 1.8 20 811-830 24-43 (60)
129 COG1592 Rubrerythrin [Energy p 24.4 48 0.001 33.4 2.0 24 559-595 134-157 (166)
130 KOG1280|consensus 24.4 50 0.0011 36.8 2.3 35 504-538 77-116 (381)
131 PF05443 ROS_MUCR: ROS/MUCR tr 23.6 40 0.00087 32.7 1.2 23 798-823 73-95 (132)
132 COG2888 Predicted Zn-ribbon RN 23.5 58 0.0013 27.1 1.9 8 797-804 50-57 (61)
133 PF15269 zf-C2H2_7: Zinc-finge 23.5 50 0.0011 25.7 1.4 22 797-818 20-41 (54)
134 KOG1044|consensus 23.4 21 0.00045 42.2 -0.9 42 685-727 193-234 (670)
135 KOG2593|consensus 23.1 72 0.0016 36.7 3.2 38 765-805 124-161 (436)
136 COG1997 RPL43A Ribosomal prote 22.9 59 0.0013 29.2 2.0 13 797-809 53-65 (89)
137 KOG2593|consensus 22.3 39 0.00085 38.8 1.0 34 505-538 127-160 (436)
138 PHA00626 hypothetical protein 22.3 43 0.00093 27.5 0.9 12 797-808 23-34 (59)
139 PF10571 UPF0547: Uncharacteri 22.2 57 0.0012 22.5 1.4 10 508-517 16-25 (26)
140 TIGR00595 priA primosomal prot 21.7 48 0.001 39.5 1.6 47 712-804 214-260 (505)
141 KOG1701|consensus 21.7 30 0.00065 39.5 -0.0 14 418-431 276-289 (468)
142 PF13389 DUF4107: Protein of u 21.1 38 0.00082 32.5 0.5 28 108-140 130-157 (158)
143 PRK14873 primosome assembly pr 21.0 53 0.0012 40.5 1.9 44 744-832 386-429 (665)
144 TIGR01384 TFS_arch transcripti 20.7 26 0.00057 32.2 -0.7 10 588-597 91-100 (104)
145 PF15269 zf-C2H2_7: Zinc-finge 20.7 58 0.0013 25.4 1.3 22 826-847 21-42 (54)
146 KOG4124|consensus 20.6 51 0.0011 36.7 1.4 29 412-441 209-237 (442)
147 KOG2807|consensus 20.3 1.3E+02 0.0029 33.3 4.4 31 558-594 344-374 (378)
148 KOG4167|consensus 20.1 24 0.00053 42.7 -1.2 25 825-849 792-816 (907)
No 1
>KOG1074|consensus
Probab=99.95 E-value=1.1e-28 Score=282.04 Aligned_cols=328 Identities=25% Similarity=0.455 Sum_probs=226.1
Q ss_pred CccCCCCCCccCChhHHHHHHHHhCCCCccccCCCCcccccchhhhcccccCCCCCC------------ccCCCCCcccc
Q psy8286 531 PLPCPVCSKTFPSKPSLTLHMRTHLKIKPYKCDVCDKSFVTAQKLSDHKNTHTGNAP------------FPCNMCDRHFK 598 (858)
Q Consensus 531 ~~~C~~C~k~F~s~~~L~~H~r~H~~ek~ykC~~C~k~F~s~~~L~~H~~~H~~ekp------------y~C~~C~k~F~ 598 (858)
..+|.+|.|.|.+.+.|..|+|.|++++||+|.+|+..|.++.+|+.|...|+..-| +.|.+|.-.|.
T Consensus 353 khkCr~CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~e~~p~~~m~p~~~~e~l~~~i~st~~p 432 (958)
T KOG1074|consen 353 KHKCRFCAKVFGSDSALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHREKYPHVQMNPHPVQEHLQYVITSTGLP 432 (958)
T ss_pred cchhhhhHhhcCchhhhhhhhhccCCCCCeeecccccccccccceeeeeeeccccCCccccCCCCchhhhcceeeccccC
Confidence 478999999999999999999999999999999999999999999999998876542 46777776666
Q ss_pred Cccchhhhhhhhcccccc-----------------cccccc-------cc------------------------------
Q psy8286 599 RYSNLAQHKKVIHLKKKK-----------------IKDFYC-------HC------------------------------ 624 (858)
Q Consensus 599 s~~~L~~H~k~~H~~~k~-----------------~k~~~C-------~C------------------------------ 624 (858)
.-....-|....+...-. .....| .|
T Consensus 433 ~g~~vpp~k~~~~~~~~e~~~~~~sts~g~~~~~~~~~sv~~~~ts~~~~~~~s~~~~~~~~~i~~~s~e~e~~vs~g~~ 512 (958)
T KOG1074|consen 433 YGPSVPPEKAEEEAATVEPKLLVRSTSVGSATESLTPSSVSFGETSAPPLPAFSKFVLMKTVEIKSKSEEPEPAVSEGSA 512 (958)
T ss_pred CCCCCCCCCCcchhccccccccccccccCCCCCcccccccccccccCCCCCccccccccCCcccccccCCCCcccccccc
Confidence 655555554222110000 000000 00
Q ss_pred --Cccccc---------------ccchhhhhhhcc--CCCcccCcCCCcccCchhhhhhhhc--------cCCCC-----
Q psy8286 625 --GEIFQS---------------RGKLLWHQETHA--TVPKQCTYCTEKFTHTSSLTRHIRK--------NHDPR----- 672 (858)
Q Consensus 625 --~k~F~s---------------~~~L~~H~~~h~--~~~~~C~~C~k~F~s~~~L~~H~~~--------~H~~~----- 672 (858)
+..+.. ...+..|...-. ...|.|.-=+-.-.....+..-+.. .....
T Consensus 513 ~~~~~~gs~l~~s~~ks~~s~~~~~~~~~~~asa~m~~~~~~~~p~g~s~~~~aq~~~l~d~~~~~~~~~~tsseS~kl~ 592 (958)
T KOG1074|consen 513 ISGVLEGSPLRMSSGKSVESLPVEADLLNHAASAGMFPPSYVSRPLGPSEDTTAQALQLVDKIPEALIEISTSSESPKLT 592 (958)
T ss_pred ccccccCCccccccccCccccchhccccchhhccccCCchhhcCCCCcchhhHHHhhhhhccChhhcceeecccCCcccc
Confidence 000000 000000100000 1112232222111111111110000 00000
Q ss_pred C-CCCCCCCCCCcccccccccccc-hhhhhhhhcccCCCCccccCccccccccchhhcchhhhcccC--CCcccccc---
Q psy8286 673 Y-LPVDRKKTDENVNCDICKIVVL-KSTLKSHMRIHDNAKPFSCTVCKKSFYTKWNLGLHQWTHVSR--MHMPFKCK--- 745 (858)
Q Consensus 673 ~-~~~~~~~~~~~~~C~~C~~~~~-~s~L~~H~~~H~~~k~y~C~~C~ksF~s~~~L~~H~~~H~~~--~~kpykC~--- 745 (858)
. .........+..+|-+|.++.. ++.|+.|.++|+|++||+|.+||+.|.++.+|+.|+..|... ..-+|.|+
T Consensus 593 slv~~~~~~~TdPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ 672 (958)
T KOG1074|consen 593 SLVENSENKRTDPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTF 672 (958)
T ss_pred ccccccccccCCccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchh
Confidence 0 0011233445678999999887 899999999999999999999999999999999999988653 12458899
Q ss_pred CCCCcccChHHHHHHHHhhcCC-C------------CccccccCccccCchhhhhHHhhhc-------------------
Q psy8286 746 SCNRGFYSSSEYQAHLRSHRNI-R------------PFTCNYCGRKFIRKFNCIRHVREHE------------------- 793 (858)
Q Consensus 746 ~C~k~F~s~s~L~~H~r~H~ge-k------------py~C~~Cgk~F~~~~~L~~H~r~Ht------------------- 793 (858)
+|.+.|.....|..|+++|.+. . .-+|..|.+.|.....+..++..|.
T Consensus 673 ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~ 752 (958)
T KOG1074|consen 673 ICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEEL 752 (958)
T ss_pred hhcccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCccccccccccccc
Confidence 9999999999999999999832 1 2468888888877666666654330
Q ss_pred ----------------------------------------------------CC--------------------------
Q psy8286 794 ----------------------------------------------------LK-------------------------- 795 (858)
Q Consensus 794 ----------------------------------------------------ge-------------------------- 795 (858)
.+
T Consensus 753 ~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~ 832 (958)
T KOG1074|consen 753 DVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQ 832 (958)
T ss_pred ccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcc
Confidence 00
Q ss_pred ---------------------------------------------cceecCCCCcccCChhhHHHHHhhhcCCCccccCc
Q psy8286 796 ---------------------------------------------KKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHI 830 (858)
Q Consensus 796 ---------------------------------------------kpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~ 830 (858)
....|.+||+.|.+.+.|..|||+|+|+|||.|.+
T Consensus 833 ~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~f 912 (958)
T KOG1074|consen 833 ETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHF 912 (958)
T ss_pred cccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchh
Confidence 02679999999999999999999999999999999
Q ss_pred cCcccCChhhHHHHHhhhCCCCCCCCCC
Q psy8286 831 CGKTSTTKSNHNKHVKTHHARETINTEG 858 (858)
Q Consensus 831 Cgk~F~~k~~L~~H~r~HH~~ep~~~e~ 858 (858)
|++.|..+.+|+.||.+|+...+.+.+|
T Consensus 913 C~~aFttrgnLKvHMgtH~w~q~~srrG 940 (958)
T KOG1074|consen 913 CEEAFTTRGNLKVHMGTHMWVQPPSRRG 940 (958)
T ss_pred hhhhhhhhhhhhhhhccccccCCCccCC
Confidence 9999999999999999999988887765
No 2
>KOG2462|consensus
Probab=99.92 E-value=6e-26 Score=234.11 Aligned_cols=138 Identities=35% Similarity=0.657 Sum_probs=130.5
Q ss_pred CCCccccCccccccccchhhcchhhhcccCC-CccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhh
Q psy8286 708 NAKPFSCTVCKKSFYTKWNLGLHQWTHVSRM-HMPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCI 786 (858)
Q Consensus 708 ~~k~y~C~~C~ksF~s~~~L~~H~~~H~~~~-~kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~ 786 (858)
....|+|..|++.+.+..+|.+|.++|..-. .+.+.|+.|++.|.+...|+.|+|+|+ -+++|.+|||.|.+...|+
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhh
Confidence 4457999999999999999999999997643 456899999999999999999999998 6899999999999999999
Q ss_pred hHHhhhcCCcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhhHHHHHhh
Q psy8286 787 RHVREHELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSNHNKHVKT 847 (858)
Q Consensus 787 ~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~L~~H~r~ 847 (858)
-|+|+|+|||||.|..|+|+|..+++|+.||++|.+.|+|+|..|+|.|..++.|.+|...
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999999999999999999999999999999999999999999864
No 3
>KOG1074|consensus
Probab=99.92 E-value=9.8e-26 Score=258.09 Aligned_cols=256 Identities=21% Similarity=0.408 Sum_probs=146.2
Q ss_pred ccCCCCCCccCChhHHHHHHHHhCCCCccccCCCCcccccchhhhcccccCCCCC----CccCC---CCCccccCccchh
Q psy8286 532 LPCPVCSKTFPSKPSLTLHMRTHLKIKPYKCDVCDKSFVTAQKLSDHKNTHTGNA----PFPCN---MCDRHFKRYSNLA 604 (858)
Q Consensus 532 ~~C~~C~k~F~s~~~L~~H~r~H~~ek~ykC~~C~k~F~s~~~L~~H~~~H~~ek----py~C~---~C~k~F~s~~~L~ 604 (858)
.+|-+|.+...-++.|+.|.|+|+|++||+|++|++.|.++.+|+.||..|.... .|.|+ +|-+.|...-.|.
T Consensus 606 NqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~lp 685 (958)
T KOG1074|consen 606 NQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTLP 685 (958)
T ss_pred cceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhccccccccccc
Confidence 4555666666666666666666666666666666666666666666655554332 24555 5555565555555
Q ss_pred hhhhhhccccccccccccccCcccccccchhhhhhhccCCCcccCcCCCcccCchhhhhhhhccCCCCCCCCCCCCCCCc
Q psy8286 605 QHKKVIHLKKKKIKDFYCHCGEIFQSRGKLLWHQETHATVPKQCTYCTEKFTHTSSLTRHIRKNHDPRYLPVDRKKTDEN 684 (858)
Q Consensus 605 ~H~k~~H~~~k~~k~~~C~C~k~F~s~~~L~~H~~~h~~~~~~C~~C~k~F~s~~~L~~H~~~~H~~~~~~~~~~~~~~~ 684 (858)
.|++.|.+......... . ......-+|..|.+.|.....+..++..+-+.+..+..
T Consensus 686 QhIriH~~~~~s~g~~a-------~----------e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~------- 741 (958)
T KOG1074|consen 686 QHIRIHLGGQISNGGTA-------A----------EGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDE------- 741 (958)
T ss_pred ceEEeecCCCCCCCccc-------c----------cccchhcccchhhhcccccccchhhhhccCCcccCCcc-------
Confidence 55555442211000000 0 00011246999999999999999998877544332211
Q ss_pred ccccccccccchhhhhhhhcccCCCC----ccccCccccccccchhhcchhhhc---------------------ccCCC
Q psy8286 685 VNCDICKIVVLKSTLKSHMRIHDNAK----PFSCTVCKKSFYTKWNLGLHQWTH---------------------VSRMH 739 (858)
Q Consensus 685 ~~C~~C~~~~~~s~L~~H~~~H~~~k----~y~C~~C~ksF~s~~~L~~H~~~H---------------------~~~~~ 739 (858)
..+.+.+..+. +..+..|+..+.....+..+-..+ .....
T Consensus 742 ---------------~~~~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~ 806 (958)
T KOG1074|consen 742 ---------------QMDERTETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSE 806 (958)
T ss_pred ---------------cccccccccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCC
Confidence 11112222222 344555555555444444332111 00112
Q ss_pred ccc-cccCCCCcccChHHHH----HHH--------------HhhcC------------------------CCCccccccC
Q psy8286 740 MPF-KCKSCNRGFYSSSEYQ----AHL--------------RSHRN------------------------IRPFTCNYCG 776 (858)
Q Consensus 740 kpy-kC~~C~k~F~s~s~L~----~H~--------------r~H~g------------------------ekpy~C~~Cg 776 (858)
++. .+..++..-...-... .-. .++.+ .....|..|+
T Consensus 807 k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCg 886 (958)
T KOG1074|consen 807 KPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCG 886 (958)
T ss_pred CCcccccCCCcCCccccccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccch
Confidence 444 5666654332211100 000 00000 0127899999
Q ss_pred ccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhhhcCCCcc
Q psy8286 777 RKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPF 826 (858)
Q Consensus 777 k~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy 826 (858)
+.|.....|..|+|+|+++|||.|.+|++.|..+.+|+.||.+|....++
T Consensus 887 k~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 887 KQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGTHMWVQPP 936 (958)
T ss_pred hcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhccccccCCC
Confidence 99999999999999999999999999999999999999999999876654
No 4
>KOG3608|consensus
Probab=99.90 E-value=3.8e-24 Score=224.42 Aligned_cols=217 Identities=27% Similarity=0.544 Sum_probs=165.7
Q ss_pred CCcccccchhhhcccccCC------------CCC-CccCC--CCCccccCccchhhhhhhhccccccccccccccCcccc
Q psy8286 565 CDKSFVTAQKLSDHKNTHT------------GNA-PFPCN--MCDRHFKRYSNLAQHKKVIHLKKKKIKDFYCHCGEIFQ 629 (858)
Q Consensus 565 C~k~F~s~~~L~~H~~~H~------------~ek-py~C~--~C~k~F~s~~~L~~H~k~~H~~~k~~k~~~C~C~k~F~ 629 (858)
|+..|.+...|.+|...|. .++ .+.|. .|-+.|.++..|++|++.|..+
T Consensus 142 Ce~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~e---------------- 205 (467)
T KOG3608|consen 142 CEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNE---------------- 205 (467)
T ss_pred cCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCC----------------
Confidence 5555555555555544432 111 24453 4677777777777776654422
Q ss_pred cccchhhhhhhccCCCcccCcCCCcccCchhhhhhhhccCCCCCCCCCCCCCCCcccccccccccchhhhhhhhcccCCC
Q psy8286 630 SRGKLLWHQETHATVPKQCTYCTEKFTHTSSLTRHIRKNHDPRYLPVDRKKTDENVNCDICKIVVLKSTLKSHMRIHDNA 709 (858)
Q Consensus 630 s~~~L~~H~~~h~~~~~~C~~C~k~F~s~~~L~~H~~~~H~~~~~~~~~~~~~~~~~C~~C~~~~~~s~L~~H~~~H~~~ 709 (858)
+...|+.|+..|+++..|-.|++.... ...
T Consensus 206 --------------KvvACp~Cg~~F~~~tkl~DH~rRqt~------------------------------------l~~ 235 (467)
T KOG3608|consen 206 --------------KVVACPHCGELFRTKTKLFDHLRRQTE------------------------------------LNT 235 (467)
T ss_pred --------------eEEecchHHHHhccccHHHHHHHhhhh------------------------------------hcC
Confidence 234577777777777777777664321 123
Q ss_pred CccccCccccccccchhhcchhhhcccCCCccccccCCCCcccChHHHHHHHHh-hcCCCCccccccCccccCchhhhhH
Q psy8286 710 KPFSCTVCKKSFYTKWNLGLHQWTHVSRMHMPFKCKSCNRGFYSSSEYQAHLRS-HRNIRPFTCNYCGRKFIRKFNCIRH 788 (858)
Q Consensus 710 k~y~C~~C~ksF~s~~~L~~H~~~H~~~~~kpykC~~C~k~F~s~s~L~~H~r~-H~gekpy~C~~Cgk~F~~~~~L~~H 788 (858)
.+|.|..|.|.|.++..|..|+..|.. -|+|+.|+.+....++|..|++. |...|||+|..|++.|.+.+.|.+|
T Consensus 236 n~fqC~~C~KrFaTeklL~~Hv~rHvn----~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH 311 (467)
T KOG3608|consen 236 NSFQCAQCFKRFATEKLLKSHVVRHVN----CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKH 311 (467)
T ss_pred CchHHHHHHHHHhHHHHHHHHHHHhhh----cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHH
Confidence 368899999999999999999998875 68999999999999999999885 7788999999999999999999999
Q ss_pred HhhhcCCcceecCC--CCcccCChhhHHHHHhhhc-C--CCccccCccCcccCChhhHHHHHhhhCCCC
Q psy8286 789 VREHELKKKYSCTI--CGKTFHRSYYLTAHVRVHN-G--ERPFSCHICGKTSTTKSNHNKHVKTHHARE 852 (858)
Q Consensus 789 ~r~Htgekpy~C~~--Cgk~F~~~s~L~~H~r~Ht-g--ekpy~C~~Cgk~F~~k~~L~~H~r~HH~~e 852 (858)
...|+ +-.|+|+. |.++|++...|++|++.|+ | +-+|.|..|++.|++..+|.+|++..|+-+
T Consensus 312 ~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 312 VQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred HHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 99888 67799998 9999999999999987655 4 456999999999999999999998888754
No 5
>KOG2462|consensus
Probab=99.89 E-value=7.6e-24 Score=218.61 Aligned_cols=137 Identities=32% Similarity=0.664 Sum_probs=124.7
Q ss_pred CCcccCcCCCcccCchhhhhhhhccCCCCCCCCCCCCCCCcccccccccccchhhhhhhhcccCCCCccccCcccccccc
Q psy8286 644 VPKQCTYCTEKFTHTSSLTRHIRKNHDPRYLPVDRKKTDENVNCDICKIVVLKSTLKSHMRIHDNAKPFSCTVCKKSFYT 723 (858)
Q Consensus 644 ~~~~C~~C~k~F~s~~~L~~H~~~~H~~~~~~~~~~~~~~~~~C~~C~~~~~~s~L~~H~~~H~~~k~y~C~~C~ksF~s 723 (858)
..|+|+.|++.+.+...|.+|.+.|-.. ...+.+.|..|+|.|.+
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~-----------------------------------~s~ka~~C~~C~K~YvS 173 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSL-----------------------------------DSKKAFSCKYCGKVYVS 173 (279)
T ss_pred Cceeccccccccccccccchhhcccccc-----------------------------------cccccccCCCCCceeee
Confidence 3578999999999999999988876432 12466899999999999
Q ss_pred chhhcchhhhcccCCCccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhhhcCCcceecCCC
Q psy8286 724 KWNLGLHQWTHVSRMHMPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTIC 803 (858)
Q Consensus 724 ~~~L~~H~~~H~~~~~kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~C 803 (858)
...|+.|+++|. .+++|.+||+.|.....|+-|+|+|+|||||.|+.|+|.|..+++|+.|+++|.+.|+|+|..|
T Consensus 174 mpALkMHirTH~----l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C 249 (279)
T KOG2462|consen 174 MPALKMHIRTHT----LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRC 249 (279)
T ss_pred hHHHhhHhhccC----CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcch
Confidence 999999999997 4899999999999999999999999999999999999999999999999999999999999999
Q ss_pred CcccCChhhHHHHHhh
Q psy8286 804 GKTFHRSYYLTAHVRV 819 (858)
Q Consensus 804 gk~F~~~s~L~~H~r~ 819 (858)
+|.|..++.|.+|...
T Consensus 250 ~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 250 GKSFALKSYLNKHSES 265 (279)
T ss_pred hhHHHHHHHHHHhhhh
Confidence 9999999999999753
No 6
>KOG3623|consensus
Probab=99.88 E-value=2.8e-23 Score=233.70 Aligned_cols=79 Identities=33% Similarity=0.720 Sum_probs=68.5
Q ss_pred CCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhhHHHHHh
Q psy8286 768 RPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSNHNKHVK 846 (858)
Q Consensus 768 kpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~L~~H~r 846 (858)
-.|.|..|+|.|...+.|.+|...|+|.+||+|.+|.|+|..+-.|..|+|.|.|||||+|+.|+|.|+....+.+||.
T Consensus 893 gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 893 GMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred ccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhhc
Confidence 4688888888888888888888888888888888888888888888888888888888888888888888888888874
No 7
>KOG3608|consensus
Probab=99.83 E-value=1.8e-21 Score=204.38 Aligned_cols=231 Identities=26% Similarity=0.508 Sum_probs=154.2
Q ss_pred cccc--CCcccccCChHHHHHHHhhcCC-----------------ccCC--CCCCccCChhHHHHHHHHhCCCCccccCC
Q psy8286 506 QRRC--DTCSEVFDKLKDYQEHIKNVHP-----------------LPCP--VCSKTFPSKPSLTLHMRTHLKIKPYKCDV 564 (858)
Q Consensus 506 ~~~C--~~C~k~F~~~~~L~~H~k~~H~-----------------~~C~--~C~k~F~s~~~L~~H~r~H~~ek~ykC~~ 564 (858)
.|.| ..|+..|.+...|.+|+. .|. +.|. .|.+.|.+++.|++|++.|.++|...|+.
T Consensus 134 ~f~C~WedCe~~F~s~~ef~dHV~-~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~ 212 (467)
T KOG3608|consen 134 NFRCGWEDCEREFVSIVEFQDHVV-KHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPH 212 (467)
T ss_pred hhccChhhcCCcccCHHHHHHHHH-HhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecch
Confidence 3566 469999999999999986 442 2332 36667777777777777777777777777
Q ss_pred CCcccccchhhhcccccC--CCCCCccCCCCCccccCccchhhhhhhhccccccccccccccCcccccccchhhhhhhcc
Q psy8286 565 CDKSFVTAQKLSDHKNTH--TGNAPFPCNMCDRHFKRYSNLAQHKKVIHLKKKKIKDFYCHCGEIFQSRGKLLWHQETHA 642 (858)
Q Consensus 565 C~k~F~s~~~L~~H~~~H--~~ekpy~C~~C~k~F~s~~~L~~H~k~~H~~~k~~k~~~C~C~k~F~s~~~L~~H~~~h~ 642 (858)
|+.-|.++..|..|.+.. ....+|.|..|.+.|.+...|..|+..|-
T Consensus 213 Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv------------------------------- 261 (467)
T KOG3608|consen 213 CGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV------------------------------- 261 (467)
T ss_pred HHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh-------------------------------
Confidence 777777777777775433 23456777777777777776666665432
Q ss_pred CCCcccCcCCCcccCchhhhhhhhccCCCCCCCCCCCCCCCcccccccccccchhhhhhhhcccCCCCccccCccccccc
Q psy8286 643 TVPKQCTYCTEKFTHTSSLTRHIRKNHDPRYLPVDRKKTDENVNCDICKIVVLKSTLKSHMRIHDNAKPFSCTVCKKSFY 722 (858)
Q Consensus 643 ~~~~~C~~C~k~F~s~~~L~~H~~~~H~~~~~~~~~~~~~~~~~C~~C~~~~~~s~L~~H~~~H~~~k~y~C~~C~ksF~ 722 (858)
.-|+|+.|+......++|.+||+..| ...+||+|+.|++.|.
T Consensus 262 -n~ykCplCdmtc~~~ssL~~H~r~rH-------------------------------------s~dkpfKCd~Cd~~c~ 303 (467)
T KOG3608|consen 262 -NCYKCPLCDMTCSSASSLTTHIRYRH-------------------------------------SKDKPFKCDECDTRCV 303 (467)
T ss_pred -hcccccccccCCCChHHHHHHHHhhh-------------------------------------ccCCCccccchhhhhc
Confidence 13677777777777777777776554 3456777777777777
Q ss_pred cchhhcchhhhcccCCCccccccC--CCCcccChHHHHHHHHhhc-C--CCCccccccCccccCchhhhhHHhh-h----
Q psy8286 723 TKWNLGLHQWTHVSRMHMPFKCKS--CNRGFYSSSEYQAHLRSHR-N--IRPFTCNYCGRKFIRKFNCIRHVRE-H---- 792 (858)
Q Consensus 723 s~~~L~~H~~~H~~~~~kpykC~~--C~k~F~s~s~L~~H~r~H~-g--ekpy~C~~Cgk~F~~~~~L~~H~r~-H---- 792 (858)
+.+.|.+|...|.. ..|.|+. |..+|.+...+++|++.++ | +.+|.|..|++.|.+-.+|..|++. |
T Consensus 304 ~esdL~kH~~~HS~---~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~ 380 (467)
T KOG3608|consen 304 RESDLAKHVQVHSK---TVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFRL 380 (467)
T ss_pred cHHHHHHHHHhccc---cceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhcccC
Confidence 77777777776653 3577766 7777777777777777655 3 3457777777777777777777653 3
Q ss_pred -cCCcceecCCCCcccCC
Q psy8286 793 -ELKKKYSCTICGKTFHR 809 (858)
Q Consensus 793 -tgekpy~C~~Cgk~F~~ 809 (858)
.|-+.|.=.+|.-.|.+
T Consensus 381 PsGh~RFtYk~~edG~mR 398 (467)
T KOG3608|consen 381 PSGHKRFTYKVDEDGFMR 398 (467)
T ss_pred CCCCCceeeeeccCceee
Confidence 23344555555555543
No 8
>KOG3623|consensus
Probab=99.83 E-value=5.8e-21 Score=215.23 Aligned_cols=81 Identities=27% Similarity=0.559 Sum_probs=78.2
Q ss_pred CCccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHH
Q psy8286 738 MHMPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHV 817 (858)
Q Consensus 738 ~~kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~ 817 (858)
....|.|+.|++.|...+.|.+|...|+|.+||+|.+|.|.|+.+..|..|.|.|.|+|||+|+.|+|+|+....+-.||
T Consensus 891 e~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQHM 970 (1007)
T KOG3623|consen 891 EDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQHM 970 (1007)
T ss_pred ccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhhh
Confidence 34589999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred h
Q psy8286 818 R 818 (858)
Q Consensus 818 r 818 (858)
.
T Consensus 971 N 971 (1007)
T KOG3623|consen 971 N 971 (1007)
T ss_pred c
Confidence 6
No 9
>KOG3576|consensus
Probab=99.59 E-value=2.7e-16 Score=154.83 Aligned_cols=114 Identities=29% Similarity=0.659 Sum_probs=89.2
Q ss_pred ccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhh
Q psy8286 740 MPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRV 819 (858)
Q Consensus 740 kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~ 819 (858)
..|.|.+|++.|.-..-|.+|++-|...+.|-|..||+.|.....|++|+|+|+|.+||+|..|+|+|.++-.|..|++.
T Consensus 116 d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~k 195 (267)
T KOG3576|consen 116 DSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKK 195 (267)
T ss_pred CeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHH
Confidence 35778888888888888888888888888888888888888888888888888888888888888888888888888654
Q ss_pred hcC-----------CCccccCccCcccCChhhHHHHHhhhCCCCC
Q psy8286 820 HNG-----------ERPFSCHICGKTSTTKSNHNKHVKTHHARET 853 (858)
Q Consensus 820 Htg-----------ekpy~C~~Cgk~F~~k~~L~~H~r~HH~~ep 853 (858)
-+| ++.|.|..||+.-.....+..|++.||..-|
T Consensus 196 vhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 196 VHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 333 3568888888888888888888888886544
No 10
>KOG3576|consensus
Probab=99.47 E-value=1.5e-14 Score=142.62 Aligned_cols=116 Identities=28% Similarity=0.577 Sum_probs=104.3
Q ss_pred CCccccCccccccccchhhcchhhhcccCCCccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhH
Q psy8286 709 AKPFSCTVCKKSFYTKWNLGLHQWTHVSRMHMPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRH 788 (858)
Q Consensus 709 ~k~y~C~~C~ksF~s~~~L~~H~~~H~~~~~kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H 788 (858)
...|.|.+|+|.|.-...|.+|++-|.. .+.|-|..||+.|...-+|++|+|+|+|.+||+|..|++.|..+-.|..|
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~--vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSD--VKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccH--HHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence 4568999999999999999999999875 47899999999999999999999999999999999999999999999999
Q ss_pred HhhhcC-----------CcceecCCCCcccCChhhHHHHHhhhcCCCcc
Q psy8286 789 VREHEL-----------KKKYSCTICGKTFHRSYYLTAHVRVHNGERPF 826 (858)
Q Consensus 789 ~r~Htg-----------ekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy 826 (858)
++.-+| .|.|.|+.||+.-.....+..|++.|+...|+
T Consensus 193 l~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Spa 241 (267)
T KOG3576|consen 193 LKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSPA 241 (267)
T ss_pred HHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCHH
Confidence 876544 46799999999999999999999998865543
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.26 E-value=1.3e-11 Score=142.49 Aligned_cols=148 Identities=20% Similarity=0.472 Sum_probs=118.4
Q ss_pred cccccccccccchhhhhhhhcccCCCCccccCc--cccccccchhhcchhhhcccCCCccccccCCCCcccChHHHHHHH
Q psy8286 684 NVNCDICKIVVLKSTLKSHMRIHDNAKPFSCTV--CKKSFYTKWNLGLHQWTHVSRMHMPFKCKSCNRGFYSSSEYQAHL 761 (858)
Q Consensus 684 ~~~C~~C~~~~~~s~L~~H~~~H~~~k~y~C~~--C~ksF~s~~~L~~H~~~H~~~~~kpykC~~C~k~F~s~s~L~~H~ 761 (858)
...|..|........|..|...-. -..-.|+. |+..|. +..+..| +.|+.|++.|. ...|..|+
T Consensus 407 ~V~C~NC~~~i~l~~l~lHe~~C~-r~~V~Cp~~~Cg~v~~-r~el~~H-----------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 407 TVECRNCKHYIPSRSIALHEAYCS-RHNVVCPHDGCGIVLR-VEEAKNH-----------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred eEECCCCCCccchhHHHHHHhhCC-CcceeCCcccccceee-ccccccC-----------ccCCCCCCccc-hHHHHHHH
Confidence 457888988888888888875432 23456874 999883 3444444 47999999996 68899999
Q ss_pred HhhcCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCC----------hhhHHHHHhhhcCCCccccCcc
Q psy8286 762 RSHRNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHR----------SYYLTAHVRVHNGERPFSCHIC 831 (858)
Q Consensus 762 r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~----------~s~L~~H~r~Htgekpy~C~~C 831 (858)
+.|+ +++.|+ |++.+ .+..|..|++.|.+++++.|..|++.|.. .+.|..|..++ |.+++.|..|
T Consensus 473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~C 547 (567)
T PLN03086 473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSC 547 (567)
T ss_pred HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEcccc
Confidence 9986 889999 99755 67899999999999999999999999952 45899999886 8999999999
Q ss_pred CcccCChhhHHHHHhhhCCC
Q psy8286 832 GKTSTTKSNHNKHVKTHHAR 851 (858)
Q Consensus 832 gk~F~~k~~L~~H~r~HH~~ 851 (858)
|+.|.. ..|..|+...|..
T Consensus 548 gk~Vrl-rdm~~H~~~~h~~ 566 (567)
T PLN03086 548 GRSVML-KEMDIHQIAVHQK 566 (567)
T ss_pred CCeeee-hhHHHHHHHhhcC
Confidence 998865 5678898877753
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.05 E-value=2.7e-10 Score=131.81 Aligned_cols=143 Identities=22% Similarity=0.409 Sum_probs=112.7
Q ss_pred cccCcCCCcccCchhhhhhhhccCCCCCCCCCCCCCCCccccc--ccccccchhhhhhhhcccCCCCccccCcccccccc
Q psy8286 646 KQCTYCTEKFTHTSSLTRHIRKNHDPRYLPVDRKKTDENVNCD--ICKIVVLKSTLKSHMRIHDNAKPFSCTVCKKSFYT 723 (858)
Q Consensus 646 ~~C~~C~k~F~s~~~L~~H~~~~H~~~~~~~~~~~~~~~~~C~--~C~~~~~~s~L~~H~~~H~~~k~y~C~~C~ksF~s 723 (858)
..|+.|....... .|..|...... ....|+ .|+..+.+..+..| +.|+.|++.|.
T Consensus 408 V~C~NC~~~i~l~-~l~lHe~~C~r------------~~V~Cp~~~Cg~v~~r~el~~H---------~~C~~Cgk~f~- 464 (567)
T PLN03086 408 VECRNCKHYIPSR-SIALHEAYCSR------------HNVVCPHDGCGIVLRVEEAKNH---------VHCEKCGQAFQ- 464 (567)
T ss_pred EECCCCCCccchh-HHHHHHhhCCC------------cceeCCcccccceeeccccccC---------ccCCCCCCccc-
Confidence 4688887766544 45577654321 135687 48888877777666 48999999996
Q ss_pred chhhcchhhhcccCCCccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccC----------chhhhhHHhhhc
Q psy8286 724 KWNLGLHQWTHVSRMHMPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIR----------KFNCIRHVREHE 793 (858)
Q Consensus 724 ~~~L~~H~~~H~~~~~kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~----------~~~L~~H~r~Ht 793 (858)
...|..|+..|+ +++.|+ |++.+ .+..|..|+++|.+.+++.|.+|++.|.. ...|..|...+
T Consensus 465 ~s~LekH~~~~H----kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C- 537 (567)
T PLN03086 465 QGEMEKHMKVFH----EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC- 537 (567)
T ss_pred hHHHHHHHHhcC----CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-
Confidence 678999999985 589999 99755 67999999999999999999999999952 35899999886
Q ss_pred CCcceecCCCCcccCChhhHHHHHhh
Q psy8286 794 LKKKYSCTICGKTFHRSYYLTAHVRV 819 (858)
Q Consensus 794 gekpy~C~~Cgk~F~~~s~L~~H~r~ 819 (858)
|.+++.|..||+.|..+ .|..|+..
T Consensus 538 G~rt~~C~~Cgk~Vrlr-dm~~H~~~ 562 (567)
T PLN03086 538 GSRTAPCDSCGRSVMLK-EMDIHQIA 562 (567)
T ss_pred CCcceEccccCCeeeeh-hHHHHHHH
Confidence 89999999999998765 57788643
No 13
>PHA00733 hypothetical protein
Probab=98.90 E-value=1.2e-09 Score=104.69 Aligned_cols=82 Identities=20% Similarity=0.301 Sum_probs=58.5
Q ss_pred CCCccccccCccccCchhhhhH------HhhhcCCcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhh
Q psy8286 767 IRPFTCNYCGRKFIRKFNCIRH------VREHELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSN 840 (858)
Q Consensus 767 ekpy~C~~Cgk~F~~~~~L~~H------~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~ 840 (858)
.+++.|.+|.+.|.....|..| +. +.+.+||.|..||+.|.+...|..|++.| +.+|.|..|++.|.....
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~-~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~s 114 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLT-SKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDS 114 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcc-cCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHH
Confidence 3445555555555554444443 32 33477888888888888888888888876 356888888888888888
Q ss_pred HHHHHhhhCCC
Q psy8286 841 HNKHVKTHHAR 851 (858)
Q Consensus 841 L~~H~r~HH~~ 851 (858)
|.+|+...|+.
T Consensus 115 L~~H~~~~h~~ 125 (128)
T PHA00733 115 TLDHVCKKHNI 125 (128)
T ss_pred HHHHHHHhcCc
Confidence 88888888864
No 14
>PHA00733 hypothetical protein
Probab=98.67 E-value=1.4e-08 Score=97.32 Aligned_cols=56 Identities=23% Similarity=0.551 Sum_probs=41.5
Q ss_pred cCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhhhcC
Q psy8286 765 RNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRVHNG 822 (858)
Q Consensus 765 ~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htg 822 (858)
++.++|.|+.|++.|.+...|..|++.| +.+|.|..|++.|.....|..|++..++
T Consensus 69 ~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 69 KAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 3467778888888888877777777765 3467888888888888888888766554
No 15
>KOG3993|consensus
Probab=98.53 E-value=1.5e-08 Score=110.39 Aligned_cols=171 Identities=19% Similarity=0.195 Sum_probs=104.2
Q ss_pred CCcccccccccccc-hhhhhhhhcccCCCCccccCccccccccchhhcchhhhcccCCC---------------------
Q psy8286 682 DENVNCDICKIVVL-KSTLKSHMRIHDNAKPFSCTVCKKSFYTKWNLGLHQWTHVSRMH--------------------- 739 (858)
Q Consensus 682 ~~~~~C~~C~~~~~-~s~L~~H~~~H~~~k~y~C~~C~ksF~s~~~L~~H~~~H~~~~~--------------------- 739 (858)
...|.|.+|+..+. .-.|.+|.-..--.--|+|+.|+|.|.-..+|..|.|.|..+.+
T Consensus 265 iGdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 265 IGDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 34578888877765 45677775322222237888888888888888888887754311
Q ss_pred ----------ccccccCCCCcccChHHHHHHHHhhcCCC-----------------CccccccCccccCchhhhhHHhhh
Q psy8286 740 ----------MPFKCKSCNRGFYSSSEYQAHLRSHRNIR-----------------PFTCNYCGRKFIRKFNCIRHVREH 792 (858)
Q Consensus 740 ----------kpykC~~C~k~F~s~s~L~~H~r~H~gek-----------------py~C~~Cgk~F~~~~~L~~H~r~H 792 (858)
.-|.|.+|++.|....-|+.|+.+|+... -+-|..|.-.+.....--.|...+
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~ 424 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYV 424 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeee
Confidence 12556666666666666666655554210 022333333332222211121111
Q ss_pred cC-CcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhhHHHHHhhhCCCC
Q psy8286 793 EL-KKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSNHNKHVKTHHARE 852 (858)
Q Consensus 793 tg-ekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~L~~H~r~HH~~e 852 (858)
.+ .....|.+|+-.+.++..--.|.+.-..+.-|.|.+|.-.|.+...|.+|+...|..+
T Consensus 425 a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 425 AGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred eccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 11 1224578888888777766666665556677999999999999999999999888654
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.52 E-value=4.1e-08 Score=78.74 Aligned_cols=43 Identities=26% Similarity=0.568 Sum_probs=32.0
Q ss_pred eecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhhHH
Q psy8286 798 YSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSNHN 842 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~L~ 842 (858)
|.|+.||+.|.+.+.|..||++|+ ++|+|..|++.|.+.+.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 677777777777777777777777 5777777777777766653
No 17
>KOG3993|consensus
Probab=98.46 E-value=4.1e-08 Score=107.03 Aligned_cols=50 Identities=30% Similarity=0.602 Sum_probs=40.0
Q ss_pred ccccCCcccccCChHHHHHHH--hhcC-CccCCCCCCccCChhHHHHHHHHhC
Q psy8286 506 QRRCDTCSEVFDKLKDYQEHI--KNVH-PLPCPVCSKTFPSKPSLTLHMRTHL 555 (858)
Q Consensus 506 ~~~C~~C~k~F~~~~~L~~H~--k~~H-~~~C~~C~k~F~s~~~L~~H~r~H~ 555 (858)
.|.|..|...|.+...|.+|. +.+| .|+|+.|+|.|+-..+|..|.|+|.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHK 319 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHK 319 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccC
Confidence 478888888888888888885 2233 6888888888888888888888885
No 18
>PHA02768 hypothetical protein; Provisional
Probab=98.37 E-value=1.4e-07 Score=75.77 Aligned_cols=42 Identities=24% Similarity=0.575 Sum_probs=21.5
Q ss_pred ccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhH
Q psy8286 770 FTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYL 813 (858)
Q Consensus 770 y~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L 813 (858)
|.|+.||+.|.+.+.|..|+++|+ ++|+|..|++.|.+.+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 455555555555555555555554 445555555555544433
No 19
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.13 E-value=1.3e-06 Score=59.96 Aligned_cols=24 Identities=46% Similarity=1.095 Sum_probs=15.7
Q ss_pred HHHHHhhhcCCCccccCccCcccC
Q psy8286 813 LTAHVRVHNGERPFSCHICGKTST 836 (858)
Q Consensus 813 L~~H~r~Htgekpy~C~~Cgk~F~ 836 (858)
|.+||++|+|++||.|++|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 566666666666666666666664
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=97.97 E-value=4.2e-06 Score=57.40 Aligned_cols=24 Identities=46% Similarity=0.926 Sum_probs=15.4
Q ss_pred hhhHHhhhcCCcceecCCCCcccC
Q psy8286 785 CIRHVREHELKKKYSCTICGKTFH 808 (858)
Q Consensus 785 L~~H~r~Htgekpy~C~~Cgk~F~ 808 (858)
|.+|+++|++++||.|++|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 556666666666666666666664
No 21
>KOG1146|consensus
Probab=97.95 E-value=4.2e-05 Score=94.40 Aligned_cols=114 Identities=15% Similarity=0.199 Sum_probs=77.8
Q ss_pred CcccccccccccCCChhHHHHHHhhcCCCCCCchhhhhhccccc-cccccccCCCCCCcccccCCCccccCCchhhhhhh
Q psy8286 414 KKKHACIYCKKTFTGWKLRKYHMKAEHPELFNEPETIKLLSKTQ-ASTSTVISEPSTSFVSQAVCPIKFENSNSFGTQIQ 492 (858)
Q Consensus 414 ~k~f~C~~C~k~F~~~~~L~~H~~~~H~~~~~~~~~~~~~~k~~-~~~~~~i~~~~~~~~~C~~C~~~F~~~~~L~~H~~ 492 (858)
.+.|+|+.|+..|+....|-.|||..|++-.. ....+...+ .......-....++|.|..|.+.+.....|-+|++
T Consensus 463 ~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~---~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Lsihlq 539 (1406)
T KOG1146|consen 463 FKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS---AYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQ 539 (1406)
T ss_pred cccccCCccchhhhhHHHhhhcccccccccch---hHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHHHH
Confidence 48899999999999999999999998866322 111111111 11111122334678999999999999999999987
Q ss_pred hccCC-------------------C----------------cc-ccCCcccccCCcccccCChHHHHHHHhhcC
Q psy8286 493 EFLPP-------------------S----------------LT-SIKNKQRRCDTCSEVFDKLKDYQEHIKNVH 530 (858)
Q Consensus 493 ~~h~~-------------------~----------------~~-~~~~k~~~C~~C~k~F~~~~~L~~H~k~~H 530 (858)
..... . .. ....-.++|..|+....-...|+.||...+
T Consensus 540 S~~h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~ 613 (1406)
T KOG1146|consen 540 SDLHRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASP 613 (1406)
T ss_pred HHhhHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCC
Confidence 43200 0 00 011234799999999999999999987543
No 22
>PHA00732 hypothetical protein
Probab=97.83 E-value=1.1e-05 Score=70.56 Aligned_cols=47 Identities=30% Similarity=0.672 Sum_probs=27.6
Q ss_pred CccccccCccccCchhhhhHHhh-hcCCcceecCCCCcccCChhhHHHHHhhhc
Q psy8286 769 PFTCNYCGRKFIRKFNCIRHVRE-HELKKKYSCTICGKTFHRSYYLTAHVRVHN 821 (858)
Q Consensus 769 py~C~~Cgk~F~~~~~L~~H~r~-Htgekpy~C~~Cgk~F~~~s~L~~H~r~Ht 821 (858)
||.|..|++.|.+...|+.|++. |.+ +.|+.|++.|. .|..|++++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---ChhhhhcccC
Confidence 45666666666666666666653 432 45666666665 3556655443
No 23
>PHA00616 hypothetical protein
Probab=97.80 E-value=7.7e-06 Score=62.70 Aligned_cols=34 Identities=18% Similarity=0.426 Sum_probs=20.4
Q ss_pred ceecCCCCcccCChhhHHHHHhhhcCCCccccCc
Q psy8286 797 KYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHI 830 (858)
Q Consensus 797 py~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~ 830 (858)
||+|..||+.|.+++.|..|++.|+|++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 4566666666666666666666666666665543
No 24
>PHA00616 hypothetical protein
Probab=97.76 E-value=1e-05 Score=61.99 Aligned_cols=33 Identities=21% Similarity=0.436 Sum_probs=30.9
Q ss_pred ccccCccCcccCChhhHHHHHhhhCCCCCCCCC
Q psy8286 825 PFSCHICGKTSTTKSNHNKHVKTHHARETINTE 857 (858)
Q Consensus 825 py~C~~Cgk~F~~k~~L~~H~r~HH~~ep~~~e 857 (858)
||+|..||+.|..++.|.+|+++||+++++..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 699999999999999999999999999998754
No 25
>PHA00732 hypothetical protein
Probab=97.58 E-value=4.2e-05 Score=67.04 Aligned_cols=46 Identities=22% Similarity=0.435 Sum_probs=24.8
Q ss_pred cccccCCCCcccChHHHHHHHHh-hcCCCCccccccCccccCchhhhhHHhhh
Q psy8286 741 PFKCKSCNRGFYSSSEYQAHLRS-HRNIRPFTCNYCGRKFIRKFNCIRHVREH 792 (858)
Q Consensus 741 pykC~~C~k~F~s~s~L~~H~r~-H~gekpy~C~~Cgk~F~~~~~L~~H~r~H 792 (858)
||.|+.|++.|.+...|+.|++. |. ++.|+.|++.|. .|..|++++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 35566666666666666666553 33 235666666555 355555443
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.23 E-value=0.00036 Score=56.65 Aligned_cols=50 Identities=30% Similarity=0.668 Sum_probs=28.9
Q ss_pred eecCCCCcccCChhhHHHHHh-hhcCC-CccccCccCcccCChhhHHHHHhhhCC
Q psy8286 798 YSCTICGKTFHRSYYLTAHVR-VHNGE-RPFSCHICGKTSTTKSNHNKHVKTHHA 850 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r-~Htge-kpy~C~~Cgk~F~~k~~L~~H~r~HH~ 850 (858)
|.|++|++. .+...|..|.. .|..+ +.+.|++|...+. .+|..|+..+|+
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 566666663 34456666643 34332 3466777766544 377777776664
No 27
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.99 E-value=0.00021 Score=64.77 Aligned_cols=70 Identities=27% Similarity=0.582 Sum_probs=10.7
Q ss_pred cCCcccccCChHHHHHHHhhcCCccCCCCCCccCChhHHHHHHHHhCCCCccccCCCCcccccchhhhcccc
Q psy8286 509 CDTCSEVFDKLKDYQEHIKNVHPLPCPVCSKTFPSKPSLTLHMRTHLKIKPYKCDVCDKSFVTAQKLSDHKN 580 (858)
Q Consensus 509 C~~C~k~F~~~~~L~~H~k~~H~~~C~~C~k~F~s~~~L~~H~r~H~~ek~ykC~~C~k~F~s~~~L~~H~~ 580 (858)
|.+|+..|.+...|..||...|.+..+ ....+.....+..+++.... ..+.|..|++.|.+...|..||+
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~~-~~~~l~~~~~~~~~~~~~~~-~~~~C~~C~~~f~s~~~l~~Hm~ 71 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDIP-DQKYLVDPNRLLNYLRKKVK-ESFRCPYCNKTFRSREALQEHMR 71 (100)
T ss_dssp -------------------------------------------------SSEEBSSSS-EESSHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccc-cccccccccccccccccccC-CCCCCCccCCCCcCHHHHHHHHc
Confidence 555555555555555555555544332 11111222333333222111 13445555555555555555544
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.95 E-value=0.00063 Score=55.26 Aligned_cols=49 Identities=29% Similarity=0.615 Sum_probs=25.8
Q ss_pred ccccCCCCcccChHHHHHHHHh-hcCC-CCccccccCccccCchhhhhHHhhhc
Q psy8286 742 FKCKSCNRGFYSSSEYQAHLRS-HRNI-RPFTCNYCGRKFIRKFNCIRHVREHE 793 (858)
Q Consensus 742 ykC~~C~k~F~s~s~L~~H~r~-H~ge-kpy~C~~Cgk~F~~~~~L~~H~r~Ht 793 (858)
|.|+.|++ -.+...|..|... |..+ +.+.|++|...+. .+|..|+..++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 55666666 3344556666443 3322 3466666665443 25666665543
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.93 E-value=0.00058 Score=45.14 Aligned_cols=22 Identities=45% Similarity=0.961 Sum_probs=10.1
Q ss_pred eecCCCCcccCChhhHHHHHhh
Q psy8286 798 YSCTICGKTFHRSYYLTAHVRV 819 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r~ 819 (858)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3444444444444444444443
No 30
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.85 E-value=0.0006 Score=61.71 Aligned_cols=71 Identities=21% Similarity=0.449 Sum_probs=13.1
Q ss_pred ccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhh
Q psy8286 744 CKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRV 819 (858)
Q Consensus 744 C~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~ 819 (858)
|..|+..|.+...|..|+...++...- ....+.....+..+++.-. ...+.|.+|++.|.+...|..||+.
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp ----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccc----ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcC
Confidence 566666666666666666554432110 1111122233333332211 1245566666666666666666554
No 31
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.77 E-value=0.00075 Score=44.60 Aligned_cols=23 Identities=43% Similarity=1.006 Sum_probs=21.7
Q ss_pred cccCccCcccCChhhHHHHHhhh
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKTH 848 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~H 848 (858)
|+|+.|++.|.++..|.+|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 78999999999999999999984
No 32
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.76 E-value=0.00088 Score=44.21 Aligned_cols=24 Identities=50% Similarity=1.025 Sum_probs=13.1
Q ss_pred cccCccCcccCChhhHHHHHhhhC
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKTHH 849 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~HH 849 (858)
|.|++|++.|.++..|..|+++||
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666554
No 33
>KOG2231|consensus
Probab=96.68 E-value=0.0013 Score=77.84 Aligned_cols=74 Identities=20% Similarity=0.510 Sum_probs=34.2
Q ss_pred ccCCcccccCChHHHHHHHhhcC-CccCCCCC---------CccCChhHHHHHHHHhCC-CCc----cccCCCCcccccc
Q psy8286 508 RCDTCSEVFDKLKDYQEHIKNVH-PLPCPVCS---------KTFPSKPSLTLHMRTHLK-IKP----YKCDVCDKSFVTA 572 (858)
Q Consensus 508 ~C~~C~k~F~~~~~L~~H~k~~H-~~~C~~C~---------k~F~s~~~L~~H~r~H~~-ek~----ykC~~C~k~F~s~ 572 (858)
.|..| -.|.+...|+.|+...| .+.|.+|- ....++..|..|++.--. ++. -.|..|...|...
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~ 195 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHKLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDD 195 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhhhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhccH
Confidence 45555 55556666666665444 34444442 122334555555543211 111 2355555555555
Q ss_pred hhhhcccccC
Q psy8286 573 QKLSDHKNTH 582 (858)
Q Consensus 573 ~~L~~H~~~H 582 (858)
..|.+|++.+
T Consensus 196 ~el~rH~~~~ 205 (669)
T KOG2231|consen 196 DELYRHLRFD 205 (669)
T ss_pred HHHHHhhccc
Confidence 5555554443
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.52 E-value=0.0014 Score=43.25 Aligned_cols=24 Identities=29% Similarity=0.595 Sum_probs=20.7
Q ss_pred cccccccccCCChhHHHHHHhhcC
Q psy8286 417 HACIYCKKTFTGWKLRKYHMKAEH 440 (858)
Q Consensus 417 f~C~~C~k~F~~~~~L~~H~~~~H 440 (858)
|.|++|++.|.+...|+.|++.+|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 789999999999999999999876
No 35
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.46 E-value=0.001 Score=70.94 Aligned_cols=70 Identities=19% Similarity=0.390 Sum_probs=40.3
Q ss_pred CCCccccc--cCccccCchhhhhHHhh-hcCCcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhhHHH
Q psy8286 767 IRPFTCNY--CGRKFIRKFNCIRHVRE-HELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSNHNK 843 (858)
Q Consensus 767 ekpy~C~~--Cgk~F~~~~~L~~H~r~-Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~L~~ 843 (858)
+|||+|++ |.|+++....|+.|+.- |...+...=+ .-..|.---...|||+|++|+|.+.....|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY 416 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCcccee
Confidence 47777776 77777777777777642 2111100000 00001001234578888888888888888888
Q ss_pred HHh
Q psy8286 844 HVK 846 (858)
Q Consensus 844 H~r 846 (858)
|..
T Consensus 417 Hr~ 419 (423)
T COG5189 417 HRK 419 (423)
T ss_pred ccc
Confidence 865
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.35 E-value=0.002 Score=44.36 Aligned_cols=25 Identities=40% Similarity=0.808 Sum_probs=14.7
Q ss_pred ccccCccCcccCChhhHHHHHhhhC
Q psy8286 825 PFSCHICGKTSTTKSNHNKHVKTHH 849 (858)
Q Consensus 825 py~C~~Cgk~F~~k~~L~~H~r~HH 849 (858)
||.|..|++.|.+...|..|++.|+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 3556666666666666666665554
No 37
>KOG2231|consensus
Probab=96.35 E-value=0.0044 Score=73.58 Aligned_cols=100 Identities=22% Similarity=0.415 Sum_probs=60.8
Q ss_pred cccCCcccccCChHHHHHHHhhcCCccCCCC------CCccCChhHHHHHHHHhCCCCccccC--CCC-ccc----ccch
Q psy8286 507 RRCDTCSEVFDKLKDYQEHIKNVHPLPCPVC------SKTFPSKPSLTLHMRTHLKIKPYKCD--VCD-KSF----VTAQ 573 (858)
Q Consensus 507 ~~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C------~k~F~s~~~L~~H~r~H~~ek~ykC~--~C~-k~F----~s~~ 573 (858)
-.|..|...|-....|.+|++ .+.|.|.+| +..|.....|..|.+.+| |.|. .|. +.| ....
T Consensus 183 p~C~~C~~~fld~~el~rH~~-~~h~~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~~~ei 257 (669)
T KOG2231|consen 183 PLCKFCHERFLDDDELYRHLR-FDHEFCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAFELEI 257 (669)
T ss_pred ccchhhhhhhccHHHHHHhhc-cceeheeecCcccccchhcccchHHHHHhhhcC----ccccccccccceeeehhHHHH
Confidence 468899999999999999988 444667777 356778888888988876 4454 443 222 3334
Q ss_pred hhhcccccCCCCCCccCC--CC----CccccCccchhhhhhhhc
Q psy8286 574 KLSDHKNTHTGNAPFPCN--MC----DRHFKRYSNLAQHKKVIH 611 (858)
Q Consensus 574 ~L~~H~~~H~~ekpy~C~--~C----~k~F~s~~~L~~H~k~~H 611 (858)
.|+.|.+.+.-++.|.|. .= ...+.....+..|.+..+
T Consensus 258 ~lk~~~~~~~~e~~~~~~~~r~Gr~s~~~r~~~~~~~~~~~~~~ 301 (669)
T KOG2231|consen 258 ELKAHNRFIQHEKCYICRPSRPGRPSSRYRGPYRRLESHFRVSD 301 (669)
T ss_pred HHHhhccccchheeccCCcccCCCCcccccCCcccccccccccc
Confidence 455444433334445553 11 223444555566665544
No 38
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.08 E-value=0.0037 Score=42.96 Aligned_cols=25 Identities=40% Similarity=0.716 Sum_probs=19.9
Q ss_pred ceecCCCCcccCChhhHHHHHhhhc
Q psy8286 797 KYSCTICGKTFHRSYYLTAHVRVHN 821 (858)
Q Consensus 797 py~C~~Cgk~F~~~s~L~~H~r~Ht 821 (858)
+|.|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 5788888888888888888887765
No 39
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.92 E-value=0.01 Score=63.45 Aligned_cols=30 Identities=30% Similarity=0.514 Sum_probs=25.7
Q ss_pred cccCccccccc--ccccCCChhHHHHHHhhcC
Q psy8286 411 IAKKKKHACIY--CKKTFTGWKLRKYHMKAEH 440 (858)
Q Consensus 411 ~~~~k~f~C~~--C~k~F~~~~~L~~H~~~~H 440 (858)
...+|+|+|++ |+|.+++...|+.|+.--|
T Consensus 344 v~d~KpykCpV~gC~K~YknqnGLKYH~lhGH 375 (423)
T COG5189 344 VKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGH 375 (423)
T ss_pred eecCceecCCCCCchhhhccccchhhhhhccc
Confidence 34469999988 9999999999999998544
No 40
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.30 E-value=0.01 Score=64.05 Aligned_cols=78 Identities=27% Similarity=0.425 Sum_probs=43.0
Q ss_pred CcccCc--CCCcccCchhhhhhhhccCCCCCCCCCCCCCCCccccccc---cccc-------chhhhhhhhcccCCCCcc
Q psy8286 645 PKQCTY--CTEKFTHTSSLTRHIRKNHDPRYLPVDRKKTDENVNCDIC---KIVV-------LKSTLKSHMRIHDNAKPF 712 (858)
Q Consensus 645 ~~~C~~--C~k~F~s~~~L~~H~~~~H~~~~~~~~~~~~~~~~~C~~C---~~~~-------~~s~L~~H~~~H~~~k~y 712 (858)
.|.|+. |.........|..|.+..|+. +-|.+| ++.| .+..|..|...-..+..|
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~-------------~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GF 217 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF-------------VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGF 217 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc-------------EEhHhhhcCcccCccceeeeecccccccccCCccccCc
Confidence 466654 666666677888888877753 567777 2222 234555555443333222
Q ss_pred ----ccCccccccccchhhcchhhhcc
Q psy8286 713 ----SCTVCKKSFYTKWNLGLHQWTHV 735 (858)
Q Consensus 713 ----~C~~C~ksF~s~~~L~~H~~~H~ 735 (858)
.|..|...|.+...|.+|++..|
T Consensus 218 KGHP~C~FC~~~FYdDDEL~~HcR~~H 244 (493)
T COG5236 218 KGHPLCIFCKIYFYDDDELRRHCRLRH 244 (493)
T ss_pred CCCchhhhccceecChHHHHHHHHhhh
Confidence 25555555555555555555433
No 41
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.07 E-value=0.013 Score=46.23 Aligned_cols=31 Identities=29% Similarity=0.683 Sum_probs=16.0
Q ss_pred CCccccCccCcccCChhhHHHHHhhhCCCCC
Q psy8286 823 ERPFSCHICGKTSTTKSNHNKHVKTHHARET 853 (858)
Q Consensus 823 ekpy~C~~Cgk~F~~k~~L~~H~r~HH~~ep 853 (858)
+.|-.|++|+..+.+..+|++|+..+|+.+|
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 4556666666666666666666666666554
No 42
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.02 E-value=0.012 Score=39.23 Aligned_cols=24 Identities=38% Similarity=0.855 Sum_probs=12.9
Q ss_pred cccCccCcccCChhhHHHHHhhhCC
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKTHHA 850 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~HH~ 850 (858)
|+|..|++... +..|.+|++.||+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 45666666555 5566666666553
No 43
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.97 E-value=0.0094 Score=67.63 Aligned_cols=63 Identities=25% Similarity=0.450 Sum_probs=30.6
Q ss_pred ccccCccccccccchhhcchhh--hcccCCCcccccc--CCCCcccChHHHHHHHHhhcCCCCcccc
Q psy8286 711 PFSCTVCKKSFYTKWNLGLHQW--THVSRMHMPFKCK--SCNRGFYSSSEYQAHLRSHRNIRPFTCN 773 (858)
Q Consensus 711 ~y~C~~C~ksF~s~~~L~~H~~--~H~~~~~kpykC~--~C~k~F~s~s~L~~H~r~H~gekpy~C~ 773 (858)
++.|..|...|.....|..|.+ .|.+...+++.|+ .|++.|.....+..|...|.+.+++.|.
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEK 355 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccc
Confidence 3455555555555555555555 3333111445555 4555555555555555555544444443
No 44
>PRK04860 hypothetical protein; Provisional
Probab=94.88 E-value=0.014 Score=58.17 Aligned_cols=39 Identities=33% Similarity=0.686 Sum_probs=29.1
Q ss_pred cceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCCh
Q psy8286 796 KKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTK 838 (858)
Q Consensus 796 kpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k 838 (858)
-+|.|. |++ ....+.+|.++|+++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 367887 776 666777888888888888888888777644
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=94.81 E-value=0.021 Score=37.98 Aligned_cols=22 Identities=41% Similarity=0.845 Sum_probs=10.1
Q ss_pred ecCCCCcccCChhhHHHHHhhh
Q psy8286 799 SCTICGKTFHRSYYLTAHVRVH 820 (858)
Q Consensus 799 ~C~~Cgk~F~~~s~L~~H~r~H 820 (858)
.|..|++.|.....|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 3444444444444444444433
No 46
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.39 E-value=0.029 Score=44.32 Aligned_cols=33 Identities=24% Similarity=0.596 Sum_probs=21.7
Q ss_pred cCCcceecCCCCcccCChhhHHHHHhhhcCCCc
Q psy8286 793 ELKKKYSCTICGKTFHRSYYLTAHVRVHNGERP 825 (858)
Q Consensus 793 tgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekp 825 (858)
..+.|-.|++|+..+++..+|++|+.++++.+|
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 346778888888888888888888887777665
No 47
>smart00355 ZnF_C2H2 zinc finger.
Probab=94.36 E-value=0.033 Score=36.96 Aligned_cols=24 Identities=38% Similarity=0.776 Sum_probs=21.9
Q ss_pred cccCccCcccCChhhHHHHHhhhC
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKTHH 849 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~HH 849 (858)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 689999999999999999999654
No 48
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=94.33 E-value=0.021 Score=61.68 Aligned_cols=127 Identities=24% Similarity=0.413 Sum_probs=74.7
Q ss_pred cccccc--cccccc-hhhhhhhhcccCCCCccccCccc---cccc------cchhhcchhhhcccC-CCc-cccccCCCC
Q psy8286 684 NVNCDI--CKIVVL-KSTLKSHMRIHDNAKPFSCTVCK---KSFY------TKWNLGLHQWTHVSR-MHM-PFKCKSCNR 749 (858)
Q Consensus 684 ~~~C~~--C~~~~~-~s~L~~H~~~H~~~k~y~C~~C~---ksF~------s~~~L~~H~~~H~~~-~~k-pykC~~C~k 749 (858)
.|.|+. |..... ...|+.|.+..++. +.|.+|- +.|. +...|+.|...-... +-+ --.|..|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence 366763 322221 34677787765543 6788874 3443 344566664332111 111 125888888
Q ss_pred cccChHHHHHHHHhhcCCCCccccccCcc-------ccCchhhhhHHhhhcCCcceecCC--CC----cccCChhhHHHH
Q psy8286 750 GFYSSSEYQAHLRSHRNIRPFTCNYCGRK-------FIRKFNCIRHVREHELKKKYSCTI--CG----KTFHRSYYLTAH 816 (858)
Q Consensus 750 ~F~s~s~L~~H~r~H~gekpy~C~~Cgk~-------F~~~~~L~~H~r~Htgekpy~C~~--Cg----k~F~~~s~L~~H 816 (858)
.|.+-..|.+|+|..+. .|-+|++. |+....|..|.+.-+ |.|.+ |- ..|.....|..|
T Consensus 229 ~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h 300 (493)
T COG5236 229 YFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEH 300 (493)
T ss_pred eecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHH
Confidence 88888888888887552 46677653 667777777765432 55533 31 467777778888
Q ss_pred Hhhh
Q psy8286 817 VRVH 820 (858)
Q Consensus 817 ~r~H 820 (858)
+..-
T Consensus 301 ~~~~ 304 (493)
T COG5236 301 LTRF 304 (493)
T ss_pred HHHH
Confidence 6443
No 49
>KOG2482|consensus
Probab=94.12 E-value=0.025 Score=61.18 Aligned_cols=137 Identities=28% Similarity=0.453 Sum_probs=83.8
Q ss_pred ccccCCcccccC-ChHHHHHHHhhcC-------------------------CccCCCCCCccCChhHHHHHHHH--hCCC
Q psy8286 506 QRRCDTCSEVFD-KLKDYQEHIKNVH-------------------------PLPCPVCSKTFPSKPSLTLHMRT--HLKI 557 (858)
Q Consensus 506 ~~~C~~C~k~F~-~~~~L~~H~k~~H-------------------------~~~C~~C~k~F~s~~~L~~H~r~--H~~e 557 (858)
..+|-+|...+. +++....|+-.+| .+.|-.|.+.|+.+..|+.||+. |...
T Consensus 144 slqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrri 223 (423)
T KOG2482|consen 144 SLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRRI 223 (423)
T ss_pred eeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCccc
Confidence 457999988764 5677777776555 47899999999999999999985 4332
Q ss_pred Cc----c----ccCC--CCcccc-c-----chhhhcccccC--------CCCCC--ccCCCCCccccCccchhhhhhhhc
Q psy8286 558 KP----Y----KCDV--CDKSFV-T-----AQKLSDHKNTH--------TGNAP--FPCNMCDRHFKRYSNLAQHKKVIH 611 (858)
Q Consensus 558 k~----y----kC~~--C~k~F~-s-----~~~L~~H~~~H--------~~ekp--y~C~~C~k~F~s~~~L~~H~k~~H 611 (858)
.| | .=.+ =+++-. . +..+..+...- .+..+ ..|-.|.....+...|..||+..|
T Consensus 224 nPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHmk~vH 303 (423)
T KOG2482|consen 224 NPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHMKIVH 303 (423)
T ss_pred CCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHHHHHH
Confidence 22 1 0000 011100 0 01111111000 01112 579999998888999999999887
Q ss_pred ccc-----------------------cccccccc-ccCcccccccchhhhhhhcc
Q psy8286 612 LKK-----------------------KKIKDFYC-HCGEIFQSRGKLLWHQETHA 642 (858)
Q Consensus 612 ~~~-----------------------k~~k~~~C-~C~k~F~s~~~L~~H~~~h~ 642 (858)
.-. +......| .|.-.|.....|..|+..+.
T Consensus 304 e~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~k 358 (423)
T KOG2482|consen 304 EFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVEDK 358 (423)
T ss_pred HhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccccc
Confidence 311 01122345 58889999999999986543
No 50
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=93.84 E-value=0.047 Score=36.42 Aligned_cols=24 Identities=25% Similarity=0.564 Sum_probs=16.8
Q ss_pred eecCCCCcccCChhhHHHHHhhhcC
Q psy8286 798 YSCTICGKTFHRSYYLTAHVRVHNG 822 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r~Htg 822 (858)
|+|..|++... +..|.+|++.|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 67888888777 7788888877653
No 51
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.60 E-value=0.045 Score=62.09 Aligned_cols=148 Identities=28% Similarity=0.443 Sum_probs=100.9
Q ss_pred cccccccccccc-hhhhhhhhc--ccCCC--CccccC--ccccccccchhhcchhhhcccCCCcccccc--CCCCcccCh
Q psy8286 684 NVNCDICKIVVL-KSTLKSHMR--IHDNA--KPFSCT--VCKKSFYTKWNLGLHQWTHVSRMHMPFKCK--SCNRGFYSS 754 (858)
Q Consensus 684 ~~~C~~C~~~~~-~s~L~~H~~--~H~~~--k~y~C~--~C~ksF~s~~~L~~H~~~H~~~~~kpykC~--~C~k~F~s~ 754 (858)
.+.|..|...+. ...|..|.+ .|.++ +++.|+ .|++.|.....+..|...|.+. .++.|. .|...+...
T Consensus 289 ~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 366 (467)
T COG5048 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSI--SPAKEKLLNSSSKFSPL 366 (467)
T ss_pred CCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCC--CccccccccCccccccc
Confidence 456666765554 456777777 67777 888898 7999999999999998888764 233443 344444333
Q ss_pred HHH-----HHHHHhhcCCCCccccc--cCccccCchhhhhHHhhhcCCc--ceecCCCCcccCChhhHHHHHhhhcCCCc
Q psy8286 755 SEY-----QAHLRSHRNIRPFTCNY--CGRKFIRKFNCIRHVREHELKK--KYSCTICGKTFHRSYYLTAHVRVHNGERP 825 (858)
Q Consensus 755 s~L-----~~H~r~H~gekpy~C~~--Cgk~F~~~~~L~~H~r~Htgek--py~C~~Cgk~F~~~s~L~~H~r~Htgekp 825 (858)
..- ......-...+.+.|.. |-..+.+...+..|...|.... .+.|..|.+.|.....+..|++.|....+
T Consensus 367 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467)
T COG5048 367 LNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAP 446 (467)
T ss_pred cCCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCc
Confidence 221 11111222334455544 7778888888888888887665 56778899999999999999999988887
Q ss_pred cccCccCc
Q psy8286 826 FSCHICGK 833 (858)
Q Consensus 826 y~C~~Cgk 833 (858)
+.|..+..
T Consensus 447 ~~~~~~~~ 454 (467)
T COG5048 447 LLCSILKS 454 (467)
T ss_pred eeeccccc
Confidence 77655443
No 52
>PRK04860 hypothetical protein; Provisional
Probab=93.50 E-value=0.035 Score=55.42 Aligned_cols=35 Identities=29% Similarity=0.695 Sum_probs=17.3
Q ss_pred ccccccCccccCchhhhhHHhhhcCCcceecCCCCcccC
Q psy8286 770 FTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFH 808 (858)
Q Consensus 770 y~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~ 808 (858)
|.|. |++ ....+++|.++|+++++|.|..|+..|.
T Consensus 120 Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 120 YRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 4554 444 3344455555555555555555555443
No 53
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=93.03 E-value=0.066 Score=35.94 Aligned_cols=21 Identities=38% Similarity=0.871 Sum_probs=10.6
Q ss_pred eecCCCCcccCChhhHHHHHh
Q psy8286 798 YSCTICGKTFHRSYYLTAHVR 818 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r 818 (858)
|.|.+|++.|.+...|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 345555555555555555544
No 54
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=92.88 E-value=0.058 Score=36.20 Aligned_cols=23 Identities=26% Similarity=0.772 Sum_probs=21.2
Q ss_pred cccCccCcccCChhhHHHHHhhh
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKTH 848 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~H 848 (858)
|.|.+|++.|.+...|..|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 68999999999999999999864
No 55
>KOG2785|consensus
Probab=92.72 E-value=0.098 Score=57.95 Aligned_cols=78 Identities=18% Similarity=0.283 Sum_probs=47.4
Q ss_pred ccccccccccCCChhHHHHHHhhc-CCCCC-----Cc-hhhhh-hccccc-ccccccc-CCCCCCcccccCCCccccCCc
Q psy8286 416 KHACIYCKKTFTGWKLRKYHMKAE-HPELF-----NE-PETIK-LLSKTQ-ASTSTVI-SEPSTSFVSQAVCPIKFENSN 485 (858)
Q Consensus 416 ~f~C~~C~k~F~~~~~L~~H~~~~-H~~~~-----~~-~~~~~-~~~k~~-~~~~~~i-~~~~~~~~~C~~C~~~F~~~~ 485 (858)
-|.|.-|...|.+...-+.|+++. |.-.. .- +.... |-.+.. ....... .+....++.|.+|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 388999999999999999999873 21000 00 00000 111110 1111111 133466788999999999999
Q ss_pred hhhhhhhh
Q psy8286 486 SFGTQIQE 493 (858)
Q Consensus 486 ~L~~H~~~ 493 (858)
....|+..
T Consensus 83 a~~~hl~S 90 (390)
T KOG2785|consen 83 AHENHLKS 90 (390)
T ss_pred hHHHHHHH
Confidence 88888864
No 56
>KOG2482|consensus
Probab=92.18 E-value=0.13 Score=55.89 Aligned_cols=74 Identities=24% Similarity=0.315 Sum_probs=46.1
Q ss_pred cccCCcccccCChHHHHHHHhhcCCccCCCC----CCccCChhHHHHHHHHhCCCCccccCCCCcccccchhhhcccccC
Q psy8286 507 RRCDTCSEVFDKLKDYQEHIKNVHPLPCPVC----SKTFPSKPSLTLHMRTHLKIKPYKCDVCDKSFVTAQKLSDHKNTH 582 (858)
Q Consensus 507 ~~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C----~k~F~s~~~L~~H~r~H~~ek~ykC~~C~k~F~s~~~L~~H~~~H 582 (858)
..|-+|.....+...|..||+.+|.|-=..= +--|..+-.+..=.|.. .+.-.|-.|...|.....|..||..+
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq--~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQ--KKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHH--hhccccccccccccCcchhhhhcccc
Confidence 5899999999999999999998884321100 01111111121112211 12345888889999999999998644
No 57
>KOG2785|consensus
Probab=91.70 E-value=0.18 Score=55.90 Aligned_cols=51 Identities=18% Similarity=0.323 Sum_probs=43.2
Q ss_pred ceecCCCCcccCChhhHHHHHhhhcCC-----------------------CccccCccC---cccCChhhHHHHHhh
Q psy8286 797 KYSCTICGKTFHRSYYLTAHVRVHNGE-----------------------RPFSCHICG---KTSTTKSNHNKHVKT 847 (858)
Q Consensus 797 py~C~~Cgk~F~~~s~L~~H~r~Htge-----------------------kpy~C~~Cg---k~F~~k~~L~~H~r~ 847 (858)
|-.|-.|++.|.+-..-..||..|+|- .-|.|-.|. +.|.+....+.||..
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 567999999999998889999888772 237899999 999999999999963
No 58
>KOG1146|consensus
Probab=91.38 E-value=0.045 Score=68.73 Aligned_cols=130 Identities=12% Similarity=0.078 Sum_probs=101.9
Q ss_pred CcccccCCCccccCCchhhhhhhhccCCCccccCCcccccCCcccccCChHHHHHHHhhcCCccCCCCCCccCChhHHHH
Q psy8286 470 SFVSQAVCPIKFENSNSFGTQIQEFLPPSLTSIKNKQRRCDTCSEVFDKLKDYQEHIKNVHPLPCPVCSKTFPSKPSLTL 549 (858)
Q Consensus 470 ~~~~C~~C~~~F~~~~~L~~H~~~~h~~~~~~~~~k~~~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C~k~F~s~~~L~~ 549 (858)
..+.|..|...|........|-...- ........+.|..|++.|.-...+. |+-..|.|.|..|...|.....|..
T Consensus 1227 ~gl~~~~~e~~f~~~~~~~~~a~~~~---~~~~~sGe~~c~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~l~~ 1302 (1406)
T KOG1146|consen 1227 IGLLPNALEQPFPQEPEPTATAPPKP---PELPASGEGECGAVDELLTPSFGIS-TLDVTHRYLCRQCKMAFDGEAPLTA 1302 (1406)
T ss_pred hcccHHhhhcCccCcccccccCCCCC---CcCcCCCcchhhhccccccCcccee-ecccchhHHHHHHHhhhcchhHHHH
Confidence 34567777777877766655543211 1112334578999999999988888 9888899999999999999999999
Q ss_pred HHHHhCCCCccccCCCCcccccchhhhcccccCCCCCCccCCCCCccccCccchhhhhhhhccccc
Q psy8286 550 HMRTHLKIKPYKCDVCDKSFVTAQKLSDHKNTHTGNAPFPCNMCDRHFKRYSNLAQHKKVIHLKKK 615 (858)
Q Consensus 550 H~r~H~~ek~ykC~~C~k~F~s~~~L~~H~~~H~~ekpy~C~~C~k~F~s~~~L~~H~k~~H~~~k 615 (858)
|. ..|-+.|.....+.-|+..+.....| |..|...|.....|..||+..+.+.+
T Consensus 1303 ~~-----------~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~~k 1356 (1406)
T KOG1146|consen 1303 HQ-----------RKFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHRRK 1356 (1406)
T ss_pred HH-----------HHHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhccc
Confidence 87 23556777888888898888888888 99999999999999999998776543
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=90.22 E-value=0.12 Score=35.52 Aligned_cols=22 Identities=36% Similarity=0.519 Sum_probs=20.7
Q ss_pred cccccccccCCChhHHHHHHhh
Q psy8286 417 HACIYCKKTFTGWKLRKYHMKA 438 (858)
Q Consensus 417 f~C~~C~k~F~~~~~L~~H~~~ 438 (858)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999975
No 60
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=89.56 E-value=0.15 Score=35.12 Aligned_cols=21 Identities=29% Similarity=0.710 Sum_probs=10.2
Q ss_pred eecCCCCcccCChhhHHHHHh
Q psy8286 798 YSCTICGKTFHRSYYLTAHVR 818 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r 818 (858)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 344555555555555544443
No 61
>KOG4173|consensus
Probab=88.42 E-value=0.12 Score=52.35 Aligned_cols=76 Identities=30% Similarity=0.671 Sum_probs=58.6
Q ss_pred cccccCC--cccccCChHHHHHHHhhcCCccCCCCCCccCChhHHHHHHHHh----------CCCCcccc--CCCCcccc
Q psy8286 505 KQRRCDT--CSEVFDKLKDYQEHIKNVHPLPCPVCSKTFPSKPSLTLHMRTH----------LKIKPYKC--DVCDKSFV 570 (858)
Q Consensus 505 k~~~C~~--C~k~F~~~~~L~~H~k~~H~~~C~~C~k~F~s~~~L~~H~r~H----------~~ek~ykC--~~C~k~F~ 570 (858)
..|.|+. |...|.+..++..|....|...|.+|.+.|.+...|..|+..- .|...|+| ..|+..|.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KFk 157 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGNSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKFK 157 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccchhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhhh
Confidence 3456654 8888999999999988888889999999999999999997542 24456777 34777777
Q ss_pred cchhhhcccc
Q psy8286 571 TAQKLSDHKN 580 (858)
Q Consensus 571 s~~~L~~H~~ 580 (858)
+...-++|+.
T Consensus 158 T~r~RkdH~I 167 (253)
T KOG4173|consen 158 TSRDRKDHMI 167 (253)
T ss_pred hhhhhhhHHH
Confidence 7777777754
No 62
>KOG4173|consensus
Probab=85.63 E-value=0.32 Score=49.50 Aligned_cols=77 Identities=27% Similarity=0.618 Sum_probs=49.3
Q ss_pred cccccC--CCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhh-h---------cCCcceecCC--CCcc
Q psy8286 741 PFKCKS--CNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVRE-H---------ELKKKYSCTI--CGKT 806 (858)
Q Consensus 741 pykC~~--C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~-H---------tgekpy~C~~--Cgk~ 806 (858)
.|.|.+ |.+.|..-..+..|...-++. .|.+|.+.|.+...|..|+.. | .|...|+|-+ |+..
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~~---sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHGN---SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhcccc---hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 466654 666777777777776554433 577777777777777777643 2 2445577743 7777
Q ss_pred cCChhhHHHHH-hhh
Q psy8286 807 FHRSYYLTAHV-RVH 820 (858)
Q Consensus 807 F~~~s~L~~H~-r~H 820 (858)
|.+...-+.|| ++|
T Consensus 156 FkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhHHHHhc
Confidence 77777777774 344
No 63
>KOG2893|consensus
Probab=83.76 E-value=0.45 Score=49.31 Aligned_cols=42 Identities=33% Similarity=0.616 Sum_probs=26.9
Q ss_pred ccCCcccccCChHHHHHHHhhcCCccCCCCCCccCChhHHHHH
Q psy8286 508 RCDTCSEVFDKLKDYQEHIKNVHPLPCPVCSKTFPSKPSLTLH 550 (858)
Q Consensus 508 ~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C~k~F~s~~~L~~H 550 (858)
.|=+|++.|.+...|..|++.. .|+|.+|.|...+--.|..|
T Consensus 12 wcwycnrefddekiliqhqkak-hfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAK-HFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeeecccccchhhhhhhhhhhc-cceeeeehhhhccCCCceee
Confidence 4777777777777777777633 36666666665555555555
No 64
>KOG2893|consensus
Probab=82.82 E-value=0.37 Score=49.96 Aligned_cols=46 Identities=28% Similarity=0.600 Sum_probs=23.9
Q ss_pred cCCCCcccCChhhHHHHHhhhcCCCccccCccCcccCChhhHHHHHhhhC
Q psy8286 800 CTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTSTTKSNHNKHVKTHH 849 (858)
Q Consensus 800 C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~~k~~L~~H~r~HH 849 (858)
|-+|++.|....-|..|++. |-|+|.+|.|...+--.|..|....|
T Consensus 13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhh
Confidence 55555555555555555432 23555555555555555555544333
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=82.56 E-value=0.89 Score=30.90 Aligned_cols=21 Identities=24% Similarity=0.377 Sum_probs=18.2
Q ss_pred cccccccccCCChhHHHHHHhh
Q psy8286 417 HACIYCKKTFTGWKLRKYHMKA 438 (858)
Q Consensus 417 f~C~~C~k~F~~~~~L~~H~~~ 438 (858)
..|+.|++.| ....|.+|++.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4699999999 78889999864
No 66
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=81.93 E-value=2.3 Score=39.88 Aligned_cols=49 Identities=20% Similarity=0.352 Sum_probs=28.2
Q ss_pred cccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhhhc
Q psy8286 771 TCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRVHN 821 (858)
Q Consensus 771 ~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Ht 821 (858)
.|--|...|........ ..-.....|.|..|...|-..-+.-.|...|.
T Consensus 57 ~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSPF--DELKDSHRYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred cccCcCCCCCCcccccc--cccccccceeCCCCCCccccccchhhhhhccC
Confidence 37777777765421110 00122345777777777777777777766664
No 67
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=80.44 E-value=1.1 Score=32.35 Aligned_cols=22 Identities=32% Similarity=0.616 Sum_probs=14.0
Q ss_pred ccccCccCcccCChhhHHHHHh
Q psy8286 825 PFSCHICGKTSTTKSNHNKHVK 846 (858)
Q Consensus 825 py~C~~Cgk~F~~k~~L~~H~r 846 (858)
+|.|.+|++.|.....+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3556666666666666666664
No 68
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=79.09 E-value=1.3 Score=32.05 Aligned_cols=23 Identities=30% Similarity=0.432 Sum_probs=21.2
Q ss_pred ccccccccccCCChhHHHHHHhh
Q psy8286 416 KHACIYCKKTFTGWKLRKYHMKA 438 (858)
Q Consensus 416 ~f~C~~C~k~F~~~~~L~~H~~~ 438 (858)
+|.|..|++.|.+...+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999975
No 69
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=78.74 E-value=2.6 Score=39.22 Aligned_cols=25 Identities=28% Similarity=0.672 Sum_probs=23.5
Q ss_pred ccc----CccCcccCChhhHHHHHhhhCC
Q psy8286 826 FSC----HICGKTSTTKSNHNKHVKTHHA 850 (858)
Q Consensus 826 y~C----~~Cgk~F~~k~~L~~H~r~HH~ 850 (858)
|.| ..|++.+.+...+.+|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 889 9999999999999999999986
No 70
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=78.07 E-value=1.7 Score=29.51 Aligned_cols=19 Identities=37% Similarity=0.882 Sum_probs=9.5
Q ss_pred ecCCCCcccCChhhHHHHHh
Q psy8286 799 SCTICGKTFHRSYYLTAHVR 818 (858)
Q Consensus 799 ~C~~Cgk~F~~~s~L~~H~r 818 (858)
.|..||+.| ....|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4444555543
No 71
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=71.02 E-value=2.8 Score=30.44 Aligned_cols=10 Identities=40% Similarity=1.358 Sum_probs=5.9
Q ss_pred CccccCccCc
Q psy8286 824 RPFSCHICGK 833 (858)
Q Consensus 824 kpy~C~~Cgk 833 (858)
.++.|++||.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 4556666654
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=70.23 E-value=1.8 Score=34.97 Aligned_cols=29 Identities=28% Similarity=0.561 Sum_probs=16.1
Q ss_pred CCCccccCccCcccCChhhHHHHHhhhCC
Q psy8286 822 GERPFSCHICGKTSTTKSNHNKHVKTHHA 850 (858)
Q Consensus 822 gekpy~C~~Cgk~F~~k~~L~~H~r~HH~ 850 (858)
||--+.|+.||+.|.....+.+|+...|+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 44445566666666555555555555443
No 73
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=66.42 E-value=7.5 Score=36.15 Aligned_cols=21 Identities=10% Similarity=0.154 Sum_probs=11.3
Q ss_pred CCCCcccccchhhhcccccCC
Q psy8286 563 DVCDKSFVTAQKLSDHKNTHT 583 (858)
Q Consensus 563 ~~C~k~F~s~~~L~~H~~~H~ 583 (858)
..|++.+.+...+.+|++.+|
T Consensus 88 ~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 88 PHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CCCCcEeccHHHHHHHHHHhc
Confidence 555555555555555555443
No 74
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=66.09 E-value=10 Score=35.54 Aligned_cols=18 Identities=11% Similarity=0.231 Sum_probs=8.4
Q ss_pred cccCCcccccCChHHHHH
Q psy8286 507 RRCDTCSEVFDKLKDYQE 524 (858)
Q Consensus 507 ~~C~~C~k~F~~~~~L~~ 524 (858)
..|+.|+-..-+..+|.+
T Consensus 16 ~~CpiCgLtLVss~HLAR 33 (112)
T TIGR00622 16 VECPICGLTLILSTHLAR 33 (112)
T ss_pred CcCCcCCCEEeccchHHH
Confidence 345555554444444443
No 75
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=62.90 E-value=3.1 Score=33.68 Aligned_cols=34 Identities=24% Similarity=0.254 Sum_probs=29.3
Q ss_pred cccCcccccccccccCCChhHHHHHHhhcCCCCC
Q psy8286 411 IAKKKKHACIYCKKTFTGWKLRKYHMKAEHPELF 444 (858)
Q Consensus 411 ~~~~k~f~C~~C~k~F~~~~~L~~H~~~~H~~~~ 444 (858)
..++.-++|+-|+..|.......+|+...|+-++
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~~~ 45 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHGWLF 45 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhhhhh
Confidence 3467778999999999999999999999887654
No 76
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=61.93 E-value=5.3 Score=28.96 Aligned_cols=24 Identities=33% Similarity=0.913 Sum_probs=13.3
Q ss_pred cccCCCCcccccchhhhcccccCCCCCCccCCCCCc
Q psy8286 560 YKCDVCDKSFVTAQKLSDHKNTHTGNAPFPCNMCDR 595 (858)
Q Consensus 560 ykC~~C~k~F~s~~~L~~H~~~H~~ekpy~C~~C~k 595 (858)
|+|..||..+.... .++.|++|+.
T Consensus 2 ~~C~~CGy~y~~~~------------~~~~CP~Cg~ 25 (33)
T cd00350 2 YVCPVCGYIYDGEE------------APWVCPVCGA 25 (33)
T ss_pred EECCCCCCEECCCc------------CCCcCcCCCC
Confidence 55666665543322 4566666654
No 77
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=60.75 E-value=2.4 Score=44.54 Aligned_cols=27 Identities=30% Similarity=0.479 Sum_probs=22.3
Q ss_pred CcccccccccccCCChhHHHHHHhhcC
Q psy8286 414 KKKHACIYCKKTFTGWKLRKYHMKAEH 440 (858)
Q Consensus 414 ~k~f~C~~C~k~F~~~~~L~~H~~~~H 440 (858)
+|.+.||+|++.|.+...+....+...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~ 29 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIR 29 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEee
Confidence 467899999999999988887776543
No 78
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=57.41 E-value=3.9 Score=42.95 Aligned_cols=43 Identities=23% Similarity=0.497 Sum_probs=24.3
Q ss_pred CCccccccCccccCchhhhhHHhh---h-------cCCcc-----eecCCCCcccCCh
Q psy8286 768 RPFTCNYCGRKFIRKFNCIRHVRE---H-------ELKKK-----YSCTICGKTFHRS 810 (858)
Q Consensus 768 kpy~C~~Cgk~F~~~~~L~~H~r~---H-------tgekp-----y~C~~Cgk~F~~~ 810 (858)
+.+.|++|++.|.++.-+....+. . .+..| ..|+.||++|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 446677777777666544444432 1 11222 4688888877654
No 79
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=51.46 E-value=8.3 Score=29.64 Aligned_cols=28 Identities=39% Similarity=0.579 Sum_probs=19.2
Q ss_pred cCcccccccccccCCCh----hHHHHHHhhcC
Q psy8286 413 KKKKHACIYCKKTFTGW----KLRKYHMKAEH 440 (858)
Q Consensus 413 ~~k~f~C~~C~k~F~~~----~~L~~H~~~~H 440 (858)
++...+|.+|++.+... +.|.+|++..|
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 45668999999998774 88999997655
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=50.99 E-value=9.8 Score=27.88 Aligned_cols=24 Identities=38% Similarity=0.994 Sum_probs=13.3
Q ss_pred eecCCCCcccCChhhHHHHHhhhcCCCccccCccCc
Q psy8286 798 YSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGK 833 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk 833 (858)
|+|.+||..+... +.|..|++||.
T Consensus 3 ~~C~~CG~i~~g~------------~~p~~CP~Cg~ 26 (34)
T cd00729 3 WVCPVCGYIHEGE------------EAPEKCPICGA 26 (34)
T ss_pred EECCCCCCEeECC------------cCCCcCcCCCC
Confidence 5666666654322 13556666664
No 81
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=50.24 E-value=13 Score=28.61 Aligned_cols=22 Identities=36% Similarity=0.693 Sum_probs=8.6
Q ss_pred cccCccCcccCCh----hhHHHHHhh
Q psy8286 826 FSCHICGKTSTTK----SNHNKHVKT 847 (858)
Q Consensus 826 y~C~~Cgk~F~~k----~~L~~H~r~ 847 (858)
.+|.+|++.+... +.|.+|++.
T Consensus 17 a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 17 AKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp EEETTTTEE-----SSTHHHHHHHHH
T ss_pred EEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 3455555444432 455555533
No 82
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=48.69 E-value=11 Score=35.36 Aligned_cols=30 Identities=37% Similarity=0.841 Sum_probs=15.4
Q ss_pred ccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCc
Q psy8286 742 FKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRK 782 (858)
Q Consensus 742 ykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~ 782 (858)
..|+.||++|..- +..|..|++||..|.-.
T Consensus 10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 3455566555442 22455566666555443
No 83
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=48.50 E-value=7.3 Score=41.36 Aligned_cols=25 Identities=28% Similarity=0.507 Sum_probs=14.8
Q ss_pred cceecCCCCcccCChhhHHHHHhhh
Q psy8286 796 KKYSCTICGKTFHRSYYLTAHVRVH 820 (858)
Q Consensus 796 kpy~C~~Cgk~F~~~s~L~~H~r~H 820 (858)
+++.|+.||+.......|..-.|+|
T Consensus 208 k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCcccccccceeeeecc
Confidence 5666777766666555555444444
No 84
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=46.08 E-value=7.5 Score=38.70 Aligned_cols=12 Identities=33% Similarity=0.908 Sum_probs=5.3
Q ss_pred ccccccCccccC
Q psy8286 770 FTCNYCGRKFIR 781 (858)
Q Consensus 770 y~C~~Cgk~F~~ 781 (858)
++|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 444444444443
No 85
>KOG2186|consensus
Probab=45.13 E-value=8.8 Score=40.70 Aligned_cols=44 Identities=32% Similarity=0.569 Sum_probs=26.1
Q ss_pred ccccccccccchhhhhhhhcccCCCCccccCccccccccchhhcch
Q psy8286 685 VNCDICKIVVLKSTLKSHMRIHDNAKPFSCTVCKKSFYTKWNLGLH 730 (858)
Q Consensus 685 ~~C~~C~~~~~~s~L~~H~~~H~~~k~y~C~~C~ksF~s~~~L~~H 730 (858)
|.|..|+....+..+..|+..-.+ .-|.|--|++.|.. .....|
T Consensus 4 FtCnvCgEsvKKp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH 47 (276)
T KOG2186|consen 4 FTCNVCGESVKKPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNH 47 (276)
T ss_pred EehhhhhhhccccchHHHHHhccC-CeeEEeeccccccc-chhhhh
Confidence 456666666665566666655444 45666666666665 445555
No 86
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=43.55 E-value=11 Score=36.60 Aligned_cols=24 Identities=33% Similarity=0.696 Sum_probs=10.8
Q ss_pred cccCccCcccCChhhHHHHHhhhCCCC
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKTHHARE 852 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~HH~~e 852 (858)
..|-.||+.|. .|++|+++|||..
T Consensus 73 i~clecGk~~k---~LkrHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFK---TLKRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EES---BHHHHHHHTT-S-
T ss_pred eEEccCCcccc---hHHHHHHHccCCC
Confidence 35666666664 3366666666543
No 87
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=42.40 E-value=13 Score=37.11 Aligned_cols=18 Identities=17% Similarity=0.587 Sum_probs=12.1
Q ss_pred ccCCCCCccccCccchhh
Q psy8286 588 FPCNMCDRHFKRYSNLAQ 605 (858)
Q Consensus 588 y~C~~C~k~F~s~~~L~~ 605 (858)
++|+.|+++|.+...+..
T Consensus 29 ~~c~~c~~~f~~~e~~~~ 46 (154)
T PRK00464 29 RECLACGKRFTTFERVEL 46 (154)
T ss_pred eeccccCCcceEeEeccC
Confidence 677777777776655443
No 88
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=40.73 E-value=19 Score=44.54 Aligned_cols=12 Identities=33% Similarity=0.672 Sum_probs=7.2
Q ss_pred CcceecCCCCcc
Q psy8286 795 KKKYSCTICGKT 806 (858)
Q Consensus 795 ekpy~C~~Cgk~ 806 (858)
..|..|+.||..
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 356666666653
No 89
>KOG2186|consensus
Probab=40.51 E-value=18 Score=38.51 Aligned_cols=46 Identities=35% Similarity=0.727 Sum_probs=28.0
Q ss_pred ccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHh
Q psy8286 770 FTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVR 818 (858)
Q Consensus 770 y~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r 818 (858)
|.|..||...... .+.+|+..-++ .-|.|--|++.|-+ .....|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 5677777665543 35556666555 45677777777766 45555644
No 90
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=40.36 E-value=16 Score=45.23 Aligned_cols=23 Identities=26% Similarity=0.771 Sum_probs=14.7
Q ss_pred cccccCCCCcccChHHHHHHHHhhcCCCCccccccCcc
Q psy8286 741 PFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRK 778 (858)
Q Consensus 741 pykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~ 778 (858)
...|..||.. ...|..|+.||-.
T Consensus 462 ~L~CH~Cg~~---------------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 462 QLRCHYCGYQ---------------EPIPQSCPECGSE 484 (730)
T ss_pred eeEeCCCCCC---------------CCCCCCCCCCCCC
Confidence 4567777753 2356778888754
No 91
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=40.25 E-value=21 Score=38.07 Aligned_cols=89 Identities=17% Similarity=0.438 Sum_probs=47.1
Q ss_pred CCcccccCCcccccCChHHHHHHHh-----hcCCccCCCCCCccCChhHHHHHHHHhCCCCccccCCCCcccccchhhhc
Q psy8286 503 KNKQRRCDTCSEVFDKLKDYQEHIK-----NVHPLPCPVCSKTFPSKPSLTLHMRTHLKIKPYKCDVCDKSFVTAQKLSD 577 (858)
Q Consensus 503 ~~k~~~C~~C~k~F~~~~~L~~H~k-----~~H~~~C~~C~k~F~s~~~L~~H~r~H~~ek~ykC~~C~k~F~s~~~L~~ 577 (858)
+.+.|+|.+|..-.---..+ .|+- ....|+|.-|++. ..|.|-.|...|-...-.+.
T Consensus 139 GGrif~CsfC~~flCEDDQF-EHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfCddHvrrK 200 (314)
T PF06524_consen 139 GGRIFKCSFCDNFLCEDDQF-EHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFCDDHVRRK 200 (314)
T ss_pred CCeEEEeecCCCeeeccchh-hhhhhhhhhhcccccccccccc-----------------cchhhhheeeeehhhhhhhc
Confidence 55677787776544322222 2321 1235777777653 33445555444432211111
Q ss_pred ccccCCCCCCccCCCCCccccCccchhhhhhhh
Q psy8286 578 HKNTHTGNAPFPCNMCDRHFKRYSNLAQHKKVI 610 (858)
Q Consensus 578 H~~~H~~ekpy~C~~C~k~F~s~~~L~~H~k~~ 610 (858)
-.+. ...+++.|+.|++.......|..-.+.|
T Consensus 201 g~ky-~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 201 GFKY-EKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred cccc-ccCCCCCCCCCCCcccccccceeeeecc
Confidence 1112 2347889999998887777776655544
No 92
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=39.64 E-value=22 Score=28.22 Aligned_cols=19 Identities=37% Similarity=0.871 Sum_probs=9.0
Q ss_pred cCccCcccCCh-----hhHHHHHh
Q psy8286 828 CHICGKTSTTK-----SNHNKHVK 846 (858)
Q Consensus 828 C~~Cgk~F~~k-----~~L~~H~r 846 (858)
|..|++.+... +.|.+|++
T Consensus 21 C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 21 CKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred ecCCCCEeeeCCCCCcHHHHHHHH
Confidence 44444444333 35555555
No 93
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=39.22 E-value=21 Score=26.48 Aligned_cols=6 Identities=50% Similarity=1.453 Sum_probs=2.3
Q ss_pred ecCCCC
Q psy8286 799 SCTICG 804 (858)
Q Consensus 799 ~C~~Cg 804 (858)
.|+.|+
T Consensus 27 ~C~~C~ 32 (38)
T TIGR02098 27 RCGKCG 32 (38)
T ss_pred ECCCCC
Confidence 333333
No 94
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=39.13 E-value=19 Score=33.67 Aligned_cols=29 Identities=34% Similarity=0.972 Sum_probs=16.0
Q ss_pred ccCCcccccCChHHHHHHHhhcCCccCCCCCCccCCh
Q psy8286 508 RCDTCSEVFDKLKDYQEHIKNVHPLPCPVCSKTFPSK 544 (858)
Q Consensus 508 ~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C~k~F~s~ 544 (858)
.|+.||++|-.+. .-|..|+.||..|.-.
T Consensus 11 ~Cp~CG~kFYDLn--------k~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 11 TCPSCGAKFYDLN--------KDPIVCPKCGTEFPPE 39 (108)
T ss_pred cCCCCcchhccCC--------CCCccCCCCCCccCcc
Confidence 5666666665531 2355566666655443
No 95
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=38.74 E-value=19 Score=36.31 Aligned_cols=23 Identities=52% Similarity=1.249 Sum_probs=15.8
Q ss_pred ceecCCCCcccCChhhHHHHHhhhcCCCccccCccC
Q psy8286 797 KYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICG 832 (858)
Q Consensus 797 py~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cg 832 (858)
-|.|.+||+. |-|+-|-+|++||
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCC
Confidence 5777777764 3456677777777
No 96
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=38.53 E-value=12 Score=39.34 Aligned_cols=32 Identities=31% Similarity=0.594 Sum_probs=25.7
Q ss_pred cCcccccccccccCCChhHHHHHHhhcCCCCC
Q psy8286 413 KKKKHACIYCKKTFTGWKLRKYHMKAEHPELF 444 (858)
Q Consensus 413 ~~k~f~C~~C~k~F~~~~~L~~H~~~~H~~~~ 444 (858)
.+.+|.|+.|+|.|+-....++|+..-|+++.
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~v 105 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPEKV 105 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HHHH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHHHH
Confidence 46689999999999999999999999997654
No 97
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=36.79 E-value=30 Score=37.72 Aligned_cols=78 Identities=21% Similarity=0.389 Sum_probs=45.4
Q ss_pred cccccCCCCcccChHHHHHHHHh----------hc--CCCCccccccCccccCchhhhhHHh-hhcCCcceecCCCCccc
Q psy8286 741 PFKCKSCNRGFYSSSEYQAHLRS----------HR--NIRPFTCNYCGRKFIRKFNCIRHVR-EHELKKKYSCTICGKTF 807 (858)
Q Consensus 741 pykC~~C~k~F~s~s~L~~H~r~----------H~--gekpy~C~~Cgk~F~~~~~L~~H~r-~Htgekpy~C~~Cgk~F 807 (858)
|..|+.|.........|.+-..- -. +.+.-.|-.|.-.|...- .|-. .-+....|.|+.|...|
T Consensus 322 Pi~CP~Csl~LilsthLarSyhhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~---~~~~~~~~ss~rY~Ce~CK~~F 398 (421)
T COG5151 322 PISCPICSLQLILSTHLARSYHHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPP---VSPFDESTSSGRYQCELCKSTF 398 (421)
T ss_pred CccCcchhHHHHHHHHHHHHHHhhccCcccccccCCCCCCCccceeccCCCCCCC---CCcccccccccceechhhhhhh
Confidence 67788887766555444432111 01 112234777877776432 1111 11223468999998888
Q ss_pred CChhhHHHHHhhhc
Q psy8286 808 HRSYYLTAHVRVHN 821 (858)
Q Consensus 808 ~~~s~L~~H~r~Ht 821 (858)
-.--..-.|...|.
T Consensus 399 C~dCdvfiHe~Lh~ 412 (421)
T COG5151 399 CSDCDVFIHETLHF 412 (421)
T ss_pred hhhhHHHHHHHHhh
Confidence 88888888877663
No 98
>KOG4377|consensus
Probab=36.35 E-value=21 Score=40.37 Aligned_cols=24 Identities=29% Similarity=0.553 Sum_probs=19.3
Q ss_pred cccCc--cCcccCChhhHHHHHhhhC
Q psy8286 826 FSCHI--CGKTSTTKSNHNKHVKTHH 849 (858)
Q Consensus 826 y~C~~--Cgk~F~~k~~L~~H~r~HH 849 (858)
|.|.. |+..|..-+.+..|.+.|-
T Consensus 402 fhc~r~Gc~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 402 FHCDRLGCEATLYSVSQMASHKRKHE 427 (480)
T ss_pred eeecccCCceEEEehhhhhhhhhhhh
Confidence 45654 8999999999999988774
No 99
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=36.19 E-value=36 Score=28.23 Aligned_cols=10 Identities=40% Similarity=0.876 Sum_probs=6.6
Q ss_pred ccccCCcccc
Q psy8286 506 QRRCDTCSEV 515 (858)
Q Consensus 506 ~~~C~~C~k~ 515 (858)
.|.|+.||+.
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 4667777765
No 100
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=35.46 E-value=28 Score=34.30 Aligned_cols=15 Identities=20% Similarity=0.521 Sum_probs=8.4
Q ss_pred CCccccccCccccCc
Q psy8286 768 RPFTCNYCGRKFIRK 782 (858)
Q Consensus 768 kpy~C~~Cgk~F~~~ 782 (858)
.-|.|+.|+..|...
T Consensus 98 ~~Y~Cp~C~~~y~~~ 112 (147)
T smart00531 98 AYYKCPNCQSKYTFL 112 (147)
T ss_pred cEEECcCCCCEeeHH
Confidence 346666666665543
No 101
>KOG2807|consensus
Probab=34.91 E-value=51 Score=36.44 Aligned_cols=25 Identities=24% Similarity=0.550 Sum_probs=18.3
Q ss_pred cceecCCCCcccCChhhHHHHHhhh
Q psy8286 796 KKYSCTICGKTFHRSYYLTAHVRVH 820 (858)
Q Consensus 796 kpy~C~~Cgk~F~~~s~L~~H~r~H 820 (858)
..|.|..|.-.|-.--+.-.|...|
T Consensus 344 ~~y~C~~Ck~~FCldCDv~iHesLh 368 (378)
T KOG2807|consen 344 GRYRCESCKNVFCLDCDVFIHESLH 368 (378)
T ss_pred CcEEchhccceeeccchHHHHhhhh
Confidence 4588888888887777777776665
No 102
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=34.43 E-value=29 Score=34.64 Aligned_cols=17 Identities=18% Similarity=0.168 Sum_probs=8.0
Q ss_pred CccccccCccccCchhh
Q psy8286 769 PFTCNYCGRKFIRKFNC 785 (858)
Q Consensus 769 py~C~~Cgk~F~~~~~L 785 (858)
-|.|+.|+.+|+....+
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 34455555444444444
No 103
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=34.38 E-value=31 Score=33.95 Aligned_cols=14 Identities=21% Similarity=0.624 Sum_probs=8.0
Q ss_pred cccccCCCCcccCh
Q psy8286 741 PFKCKSCNRGFYSS 754 (858)
Q Consensus 741 pykC~~C~k~F~s~ 754 (858)
-|.|+.|+..|...
T Consensus 99 ~Y~Cp~C~~~y~~~ 112 (147)
T smart00531 99 YYKCPNCQSKYTFL 112 (147)
T ss_pred EEECcCCCCEeeHH
Confidence 45666666666543
No 104
>KOG2071|consensus
Probab=33.81 E-value=53 Score=39.22 Aligned_cols=29 Identities=24% Similarity=0.401 Sum_probs=25.2
Q ss_pred cCcccccccccccCCChhHHHHHHhhcCC
Q psy8286 413 KKKKHACIYCKKTFTGWKLRKYHMKAEHP 441 (858)
Q Consensus 413 ~~k~f~C~~C~k~F~~~~~L~~H~~~~H~ 441 (858)
...+.+|..||.+|........||..|-.
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~d 443 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHDD 443 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhhh
Confidence 46678999999999999999999987643
No 105
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=33.69 E-value=29 Score=27.07 Aligned_cols=11 Identities=36% Similarity=0.960 Sum_probs=5.6
Q ss_pred eecCCCCcccC
Q psy8286 798 YSCTICGKTFH 808 (858)
Q Consensus 798 y~C~~Cgk~F~ 808 (858)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 45555555444
No 106
>PRK04023 DNA polymerase II large subunit; Validated
Probab=33.05 E-value=39 Score=42.70 Aligned_cols=11 Identities=27% Similarity=0.706 Sum_probs=6.4
Q ss_pred eecCCCCcccC
Q psy8286 798 YSCTICGKTFH 808 (858)
Q Consensus 798 y~C~~Cgk~F~ 808 (858)
+.|+.||..-.
T Consensus 664 y~CPKCG~El~ 674 (1121)
T PRK04023 664 DECEKCGREPT 674 (1121)
T ss_pred CcCCCCCCCCC
Confidence 55777765443
No 107
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=32.41 E-value=28 Score=25.84 Aligned_cols=8 Identities=25% Similarity=1.132 Sum_probs=3.0
Q ss_pred cCcccccc
Q psy8286 714 CTVCKKSF 721 (858)
Q Consensus 714 C~~C~ksF 721 (858)
|+.|+..|
T Consensus 5 Cp~C~~~y 12 (36)
T PF13717_consen 5 CPNCQAKY 12 (36)
T ss_pred CCCCCCEE
Confidence 33333333
No 108
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=32.31 E-value=39 Score=33.75 Aligned_cols=39 Identities=15% Similarity=0.255 Sum_probs=30.1
Q ss_pred HhhhcCCcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccC
Q psy8286 789 VREHELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTST 836 (858)
Q Consensus 789 ~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~ 836 (858)
+.......-|.|+.|+..|+.-.++. .-|.|+.||....
T Consensus 101 l~~e~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L~ 139 (158)
T TIGR00373 101 LEFETNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAMLD 139 (158)
T ss_pred HhhccCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEee
Confidence 33445566799999999999888885 2599999997543
No 109
>PRK04023 DNA polymerase II large subunit; Validated
Probab=32.24 E-value=47 Score=42.03 Aligned_cols=54 Identities=24% Similarity=0.627 Sum_probs=36.8
Q ss_pred CCcccccCCcccccCChHHHHHHHhhcCCccCCCCCCccCChhHHHHHHHHhCCCCccccCCCCcccccchhhhcccccC
Q psy8286 503 KNKQRRCDTCSEVFDKLKDYQEHIKNVHPLPCPVCSKTFPSKPSLTLHMRTHLKIKPYKCDVCDKSFVTAQKLSDHKNTH 582 (858)
Q Consensus 503 ~~k~~~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C~k~F~s~~~L~~H~r~H~~ek~ykC~~C~k~F~s~~~L~~H~~~H 582 (858)
......|+.||... ..+.|+.||.. ....+.|+.|+...
T Consensus 623 EVg~RfCpsCG~~t-------------~~frCP~CG~~---------------Te~i~fCP~CG~~~------------- 661 (1121)
T PRK04023 623 EIGRRKCPSCGKET-------------FYRRCPFCGTH---------------TEPVYRCPRCGIEV------------- 661 (1121)
T ss_pred cccCccCCCCCCcC-------------CcccCCCCCCC---------------CCcceeCccccCcC-------------
Confidence 44567899999884 23779999877 23457799885432
Q ss_pred CCCCCccCCCCCccccCc
Q psy8286 583 TGNAPFPCNMCDRHFKRY 600 (858)
Q Consensus 583 ~~ekpy~C~~C~k~F~s~ 600 (858)
.++.|+.|+..-...
T Consensus 662 ---~~y~CPKCG~El~~~ 676 (1121)
T PRK04023 662 ---EEDECEKCGREPTPY 676 (1121)
T ss_pred ---CCCcCCCCCCCCCcc
Confidence 236799998765443
No 110
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=31.36 E-value=30 Score=27.39 Aligned_cols=26 Identities=27% Similarity=0.681 Sum_probs=20.8
Q ss_pred ccccccccccCCCh-----hHHHHHHhhcCC
Q psy8286 416 KHACIYCKKTFTGW-----KLRKYHMKAEHP 441 (858)
Q Consensus 416 ~f~C~~C~k~F~~~-----~~L~~H~~~~H~ 441 (858)
.-.|.+|++.+... +.|.+|++..|+
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~ 48 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKHP 48 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHCc
Confidence 45799999998765 699999996553
No 111
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=31.17 E-value=23 Score=30.69 Aligned_cols=7 Identities=43% Similarity=0.762 Sum_probs=2.9
Q ss_pred CCCcccC
Q psy8286 802 ICGKTFH 808 (858)
Q Consensus 802 ~Cgk~F~ 808 (858)
.||.+|.
T Consensus 34 eCg~tF~ 40 (72)
T PRK09678 34 NCSATFI 40 (72)
T ss_pred CCCCEEE
Confidence 4444443
No 112
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=31.15 E-value=33 Score=32.93 Aligned_cols=15 Identities=33% Similarity=0.660 Sum_probs=7.5
Q ss_pred CCccccccCccccCc
Q psy8286 768 RPFTCNYCGRKFIRK 782 (858)
Q Consensus 768 kpy~C~~Cgk~F~~~ 782 (858)
.|..|++||..|.-.
T Consensus 25 ~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 25 RPAVSPYTGEQFPPE 39 (129)
T ss_pred CCccCCCcCCccCcc
Confidence 455555555554433
No 113
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=30.89 E-value=28 Score=27.63 Aligned_cols=30 Identities=33% Similarity=0.781 Sum_probs=17.5
Q ss_pred ccccCCCCcccccchhhhcccccCCCCCCccCCCCCc
Q psy8286 559 PYKCDVCDKSFVTAQKLSDHKNTHTGNAPFPCNMCDR 595 (858)
Q Consensus 559 ~ykC~~C~k~F~s~~~L~~H~~~H~~ekpy~C~~C~k 595 (858)
.|+|..|+..|.....+.. ...-.|+.|+.
T Consensus 5 ey~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (52)
T TIGR02605 5 EYRCTACGHRFEVLQKMSD-------DPLATCPECGG 34 (52)
T ss_pred EEEeCCCCCEeEEEEecCC-------CCCCCCCCCCC
Confidence 4778888877764322211 23456888875
No 114
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=30.03 E-value=20 Score=37.70 Aligned_cols=30 Identities=27% Similarity=0.637 Sum_probs=19.8
Q ss_pred CCcceecCCCCcccCChhhHHHHHhhhcCC
Q psy8286 794 LKKKYSCTICGKTFHRSYYLTAHVRVHNGE 823 (858)
Q Consensus 794 gekpy~C~~Cgk~F~~~s~L~~H~r~Htge 823 (858)
.+..|.|..|+|.|.-......|+..-|.+
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 344688888888888888888887665554
No 115
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=29.81 E-value=30 Score=33.33 Aligned_cols=24 Identities=25% Similarity=0.336 Sum_probs=14.2
Q ss_pred ccCccCcccCChhhHHHHHhhhCCCCC
Q psy8286 827 SCHICGKTSTTKSNHNKHVKTHHARET 853 (858)
Q Consensus 827 ~C~~Cgk~F~~k~~L~~H~r~HH~~ep 853 (858)
.|-.+|+.| ..|++|+.+|++.-|
T Consensus 78 icLEDGkkf---KSLKRHL~t~~gmTP 101 (148)
T COG4957 78 ICLEDGKKF---KSLKRHLTTHYGLTP 101 (148)
T ss_pred EEeccCcch---HHHHHHHhcccCCCH
Confidence 466666666 356666666665433
No 116
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=29.74 E-value=36 Score=34.74 Aligned_cols=16 Identities=25% Similarity=0.420 Sum_probs=7.7
Q ss_pred CccccccCccccCchh
Q psy8286 769 PFTCNYCGRKFIRKFN 784 (858)
Q Consensus 769 py~C~~Cgk~F~~~~~ 784 (858)
-|.|+.|+.+|.....
T Consensus 117 ~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 117 FFFCPNCHIRFTFDEA 132 (178)
T ss_pred EEECCCCCcEEeHHHH
Confidence 3555555555544443
No 117
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=29.68 E-value=51 Score=36.05 Aligned_cols=24 Identities=25% Similarity=0.580 Sum_probs=14.3
Q ss_pred cccccCCCCcccChHHHHHHHHhh
Q psy8286 741 PFKCKSCNRGFYSSSEYQAHLRSH 764 (858)
Q Consensus 741 pykC~~C~k~F~s~s~L~~H~r~H 764 (858)
.|+|+.|...|-.--+.-.|...|
T Consensus 388 rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 388 RYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred ceechhhhhhhhhhhHHHHHHHHh
Confidence 466666666666655555555554
No 118
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=29.27 E-value=33 Score=25.55 Aligned_cols=30 Identities=33% Similarity=0.796 Sum_probs=16.8
Q ss_pred ccccCCCCcccccchhhhcccccCCCCCCccCCCCCc
Q psy8286 559 PYKCDVCDKSFVTAQKLSDHKNTHTGNAPFPCNMCDR 595 (858)
Q Consensus 559 ~ykC~~C~k~F~s~~~L~~H~~~H~~ekpy~C~~C~k 595 (858)
.|+|..|+..|........ ...-.|+.|+.
T Consensus 5 ~y~C~~Cg~~fe~~~~~~~-------~~~~~CP~Cg~ 34 (41)
T smart00834 5 EYRCEDCGHTFEVLQKISD-------DPLATCPECGG 34 (41)
T ss_pred EEEcCCCCCEEEEEEecCC-------CCCCCCCCCCC
Confidence 4677777777654332211 23456777775
No 119
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=29.02 E-value=22 Score=27.70 Aligned_cols=11 Identities=36% Similarity=1.174 Sum_probs=5.3
Q ss_pred eecCCCCcccC
Q psy8286 798 YSCTICGKTFH 808 (858)
Q Consensus 798 y~C~~Cgk~F~ 808 (858)
|.|..||..|.
T Consensus 4 y~C~~CG~~~~ 14 (46)
T PRK00398 4 YKCARCGREVE 14 (46)
T ss_pred EECCCCCCEEE
Confidence 45555554443
No 120
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=28.98 E-value=13 Score=28.58 Aligned_cols=10 Identities=20% Similarity=0.345 Sum_probs=4.2
Q ss_pred ccccCCCccc
Q psy8286 472 VSQAVCPIKF 481 (858)
Q Consensus 472 ~~C~~C~~~F 481 (858)
|.|..|+..|
T Consensus 6 y~C~~Cg~~f 15 (42)
T PF09723_consen 6 YRCEECGHEF 15 (42)
T ss_pred EEeCCCCCEE
Confidence 3444444444
No 121
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=27.92 E-value=84 Score=24.46 Aligned_cols=11 Identities=36% Similarity=1.102 Sum_probs=6.5
Q ss_pred CcceecCCCCc
Q psy8286 795 KKKYSCTICGK 805 (858)
Q Consensus 795 ekpy~C~~Cgk 805 (858)
...|+|..|++
T Consensus 35 ~~~~~C~~C~~ 45 (46)
T PF12760_consen 35 RGRYRCKACRK 45 (46)
T ss_pred CCeEECCCCCC
Confidence 45566666654
No 122
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=27.52 E-value=40 Score=32.37 Aligned_cols=30 Identities=20% Similarity=0.324 Sum_probs=17.3
Q ss_pred ccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCCh
Q psy8286 770 FTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRS 810 (858)
Q Consensus 770 y~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~ 810 (858)
..|+.||++|... +..|-.|+.||..|.-.
T Consensus 10 r~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 4566666666532 22456666666666544
No 123
>KOG2071|consensus
Probab=27.32 E-value=41 Score=40.14 Aligned_cols=27 Identities=37% Similarity=0.639 Sum_probs=22.2
Q ss_pred CccccccCCCCcccChHHHHHHHHhhc
Q psy8286 739 HMPFKCKSCNRGFYSSSEYQAHLRSHR 765 (858)
Q Consensus 739 ~kpykC~~C~k~F~s~s~L~~H~r~H~ 765 (858)
.++-.|..||.+|........||..|.
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 456789999999999988888887764
No 124
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=26.95 E-value=47 Score=33.89 Aligned_cols=35 Identities=23% Similarity=0.437 Sum_probs=27.4
Q ss_pred cCCcceecCCCCcccCChhhHHHHHhhhcCCCccccCccCcccC
Q psy8286 793 ELKKKYSCTICGKTFHRSYYLTAHVRVHNGERPFSCHICGKTST 836 (858)
Q Consensus 793 tgekpy~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~~Cgk~F~ 836 (858)
....-|.|+.|+..|+.-..+. .-|.|+.||....
T Consensus 113 ~~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 113 ENNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred cCCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 3445699999999999887763 3699999997554
No 125
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=26.35 E-value=37 Score=27.19 Aligned_cols=11 Identities=36% Similarity=1.198 Sum_probs=6.3
Q ss_pred ceecCCCCccc
Q psy8286 797 KYSCTICGKTF 807 (858)
Q Consensus 797 py~C~~Cgk~F 807 (858)
.|.|..||+.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 35566666555
No 126
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=26.13 E-value=20 Score=31.08 Aligned_cols=42 Identities=21% Similarity=0.370 Sum_probs=27.2
Q ss_pred eecCCCCcccCChhhHHHHHhhhcCCCccccC--ccCcccCChhhH
Q psy8286 798 YSCTICGKTFHRSYYLTAHVRVHNGERPFSCH--ICGKTSTTKSNH 841 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r~Htgekpy~C~--~Cgk~F~~k~~L 841 (858)
+.|+.||....-...-..+.. ..++-++|. .||..|.....+
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEEE
Confidence 578888876643333222222 456778998 899999875543
No 127
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=24.78 E-value=51 Score=22.63 Aligned_cols=20 Identities=20% Similarity=0.355 Sum_probs=16.8
Q ss_pred cccccccccCCChhHHHHHHh
Q psy8286 417 HACIYCKKTFTGWKLRKYHMK 437 (858)
Q Consensus 417 f~C~~C~k~F~~~~~L~~H~~ 437 (858)
..|+.|++.+ ....+..|+.
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3699999999 6688889986
No 128
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=24.63 E-value=49 Score=26.83 Aligned_cols=20 Identities=25% Similarity=0.428 Sum_probs=9.1
Q ss_pred hhHHHHHhhhcCCCccccCc
Q psy8286 811 YYLTAHVRVHNGERPFSCHI 830 (858)
Q Consensus 811 s~L~~H~r~Htgekpy~C~~ 830 (858)
..|..|+...-..++..|.+
T Consensus 24 ~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 24 KELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CCHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHHccCCCCcEECCC
Confidence 34555555444444445555
No 129
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=24.43 E-value=48 Score=33.44 Aligned_cols=24 Identities=33% Similarity=0.854 Sum_probs=18.9
Q ss_pred ccccCCCCcccccchhhhcccccCCCCCCccCCCCCc
Q psy8286 559 PYKCDVCDKSFVTAQKLSDHKNTHTGNAPFPCNMCDR 595 (858)
Q Consensus 559 ~ykC~~C~k~F~s~~~L~~H~~~H~~ekpy~C~~C~k 595 (858)
.|.|.+||+.+ -++.|-.|++|+.
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCCC
Confidence 68999998754 4567889999983
No 130
>KOG1280|consensus
Probab=24.42 E-value=50 Score=36.76 Aligned_cols=35 Identities=23% Similarity=0.619 Sum_probs=23.2
Q ss_pred CcccccCCcccccCChHHHHHHHhhcCC-----ccCCCCC
Q psy8286 504 NKQRRCDTCSEVFDKLKDYQEHIKNVHP-----LPCPVCS 538 (858)
Q Consensus 504 ~k~~~C~~C~k~F~~~~~L~~H~k~~H~-----~~C~~C~ 538 (858)
...|.|++|++.--+...|..|+...|+ ..|++|.
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 3467777777777777777777776663 3455553
No 131
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=23.58 E-value=40 Score=32.72 Aligned_cols=23 Identities=43% Similarity=0.881 Sum_probs=12.0
Q ss_pred eecCCCCcccCChhhHHHHHhhhcCC
Q psy8286 798 YSCTICGKTFHRSYYLTAHVRVHNGE 823 (858)
Q Consensus 798 y~C~~Cgk~F~~~s~L~~H~r~Htge 823 (858)
-.|-+||+.|.. |++|++.|+|-
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S
T ss_pred eEEccCCcccch---HHHHHHHccCC
Confidence 567777776653 36777777663
No 132
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=23.53 E-value=58 Score=27.06 Aligned_cols=8 Identities=50% Similarity=1.439 Sum_probs=3.5
Q ss_pred ceecCCCC
Q psy8286 797 KYSCTICG 804 (858)
Q Consensus 797 py~C~~Cg 804 (858)
+|.|+.||
T Consensus 50 ~Y~Cp~CG 57 (61)
T COG2888 50 PYRCPKCG 57 (61)
T ss_pred ceECCCcC
Confidence 34444444
No 133
>PF15269 zf-C2H2_7: Zinc-finger
Probab=23.48 E-value=50 Score=25.75 Aligned_cols=22 Identities=32% Similarity=0.549 Sum_probs=19.1
Q ss_pred ceecCCCCcccCChhhHHHHHh
Q psy8286 797 KYSCTICGKTFHRSYYLTAHVR 818 (858)
Q Consensus 797 py~C~~Cgk~F~~~s~L~~H~r 818 (858)
.|+|-.|.+...-++.|..||+
T Consensus 20 ~ykcfqcpftc~~kshl~nhmk 41 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMK 41 (54)
T ss_pred cceeecCCcccchHHHHHHHHH
Confidence 3789999999999999999986
No 134
>KOG1044|consensus
Probab=23.35 E-value=21 Score=42.17 Aligned_cols=42 Identities=19% Similarity=0.372 Sum_probs=20.5
Q ss_pred ccccccccccchhhhhhhhcccCCCCccccCccccccccchhh
Q psy8286 685 VNCDICKIVVLKSTLKSHMRIHDNAKPFSCTVCKKSFYTKWNL 727 (858)
Q Consensus 685 ~~C~~C~~~~~~s~L~~H~~~H~~~k~y~C~~C~ksF~s~~~L 727 (858)
.+|..|.+++...-|..-- .|--..--.|..|+..|..-..+
T Consensus 193 vkc~~c~~fisgkvLqag~-kh~HPtCARCsRCgqmF~eGEEM 234 (670)
T KOG1044|consen 193 VKCEECEKFISGKVLQAGD-KHFHPTCARCSRCGQMFGEGEEM 234 (670)
T ss_pred eehHHhhhhhhhhhhhccC-cccCcchhhhhhhccccccchhe
Confidence 4677776665533333211 11111122477777777665544
No 135
>KOG2593|consensus
Probab=23.12 E-value=72 Score=36.72 Aligned_cols=38 Identities=18% Similarity=0.475 Sum_probs=21.0
Q ss_pred cCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCc
Q psy8286 765 RNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGK 805 (858)
Q Consensus 765 ~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk 805 (858)
+....|.|+.|.++|.....++- .-.....|.|..|+-
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGG 161 (436)
T ss_pred cccccccCCccccchhhhHHHHh---hcccCceEEEecCCC
Confidence 34455777777777765544432 122234577777754
No 136
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=22.87 E-value=59 Score=29.15 Aligned_cols=13 Identities=31% Similarity=0.879 Sum_probs=7.9
Q ss_pred ceecCCCCcccCC
Q psy8286 797 KYSCTICGKTFHR 809 (858)
Q Consensus 797 py~C~~Cgk~F~~ 809 (858)
-+.|..||..|.-
T Consensus 53 IW~C~kCg~~fAG 65 (89)
T COG1997 53 IWKCRKCGAKFAG 65 (89)
T ss_pred eEEcCCCCCeecc
Confidence 4666666666653
No 137
>KOG2593|consensus
Probab=22.30 E-value=39 Score=38.75 Aligned_cols=34 Identities=18% Similarity=0.423 Sum_probs=18.3
Q ss_pred cccccCCcccccCChHHHHHHHhhcCCccCCCCC
Q psy8286 505 KQRRCDTCSEVFDKLKDYQEHIKNVHPLPCPVCS 538 (858)
Q Consensus 505 k~~~C~~C~k~F~~~~~L~~H~k~~H~~~C~~C~ 538 (858)
..|.|+.|.+.|.....++-=--..-.|.|..|+
T Consensus 127 ~~Y~Cp~C~kkyt~Lea~~L~~~~~~~F~C~~C~ 160 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSLEALQLLDNETGEFHCENCG 160 (436)
T ss_pred ccccCCccccchhhhHHHHhhcccCceEEEecCC
Confidence 4466666666666665554322112346666665
No 138
>PHA00626 hypothetical protein
Probab=22.26 E-value=43 Score=27.46 Aligned_cols=12 Identities=33% Similarity=0.731 Sum_probs=6.0
Q ss_pred ceecCCCCcccC
Q psy8286 797 KYSCTICGKTFH 808 (858)
Q Consensus 797 py~C~~Cgk~F~ 808 (858)
.|+|..||+.|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 355555555544
No 139
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=22.20 E-value=57 Score=22.50 Aligned_cols=10 Identities=30% Similarity=0.776 Sum_probs=5.4
Q ss_pred ccCCcccccC
Q psy8286 508 RCDTCSEVFD 517 (858)
Q Consensus 508 ~C~~C~k~F~ 517 (858)
.|+.||..|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 4555555553
No 140
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=21.75 E-value=48 Score=39.55 Aligned_cols=47 Identities=28% Similarity=0.739 Sum_probs=0.0
Q ss_pred cccCccccccccchhhcchhhhcccCCCccccccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhh
Q psy8286 712 FSCTVCKKSFYTKWNLGLHQWTHVSRMHMPFKCKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVRE 791 (858)
Q Consensus 712 y~C~~C~ksF~s~~~L~~H~~~H~~~~~kpykC~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~ 791 (858)
..|..||. ..+|+.|+-. +..|.......|.+||....
T Consensus 214 ~~C~~Cg~---------------------~~~C~~C~~~----------l~~h~~~~~l~Ch~Cg~~~~----------- 251 (505)
T TIGR00595 214 LLCRSCGY---------------------ILCCPNCDVS----------LTYHKKEGKLRCHYCGYQEP----------- 251 (505)
T ss_pred eEhhhCcC---------------------ccCCCCCCCc----------eEEecCCCeEEcCCCcCcCC-----------
Q ss_pred hcCCcceecCCCC
Q psy8286 792 HELKKKYSCTICG 804 (858)
Q Consensus 792 Htgekpy~C~~Cg 804 (858)
-|..|+.||
T Consensus 252 ----~~~~Cp~C~ 260 (505)
T TIGR00595 252 ----IPKTCPQCG 260 (505)
T ss_pred ----CCCCCCCCC
No 141
>KOG1701|consensus
Probab=21.72 E-value=30 Score=39.50 Aligned_cols=14 Identities=43% Similarity=0.980 Sum_probs=10.0
Q ss_pred ccccccccCCChhH
Q psy8286 418 ACIYCKKTFTGWKL 431 (858)
Q Consensus 418 ~C~~C~k~F~~~~~ 431 (858)
.|-.|+|.-.-...
T Consensus 276 iC~~C~K~V~g~~~ 289 (468)
T KOG1701|consen 276 ICAFCHKTVSGQGL 289 (468)
T ss_pred hhhhcCCcccCcch
Confidence 79999988654443
No 142
>PF13389 DUF4107: Protein of unknown function (DUF4107)
Probab=21.06 E-value=38 Score=32.47 Aligned_cols=28 Identities=43% Similarity=0.602 Sum_probs=24.0
Q ss_pred CCcccccchhhhhhccccchhhhhhcccccccc
Q psy8286 108 TSTSNFLDYNAAQRGKQRKTVKILKIKNAKTRE 140 (858)
Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 140 (858)
|.-||+|-| |+||+|++|-+.+..+|+.
T Consensus 130 tGGSNWyHY-----Grr~k~aK~e~~~K~atKk 157 (158)
T PF13389_consen 130 TGGSNWYHY-----GRRRKLAKIEKDKKKATKK 157 (158)
T ss_pred cCCccceec-----chhhhhHHHHHHHHHhhcc
Confidence 778999999 9999999998877666664
No 143
>PRK14873 primosome assembly protein PriA; Provisional
Probab=21.02 E-value=53 Score=40.50 Aligned_cols=44 Identities=23% Similarity=0.648 Sum_probs=0.0
Q ss_pred ccCCCCcccChHHHHHHHHhhcCCCCccccccCccccCchhhhhHHhhhcCCcceecCCCCcccCChhhHHHHHhhhcCC
Q psy8286 744 CKSCNRGFYSSSEYQAHLRSHRNIRPFTCNYCGRKFIRKFNCIRHVREHELKKKYSCTICGKTFHRSYYLTAHVRVHNGE 823 (858)
Q Consensus 744 C~~C~k~F~s~s~L~~H~r~H~gekpy~C~~Cgk~F~~~~~L~~H~r~Htgekpy~C~~Cgk~F~~~s~L~~H~r~Htge 823 (858)
|..|+..+ .|+.|+ ..+..|.......|..||+ ..
T Consensus 386 C~~Cg~~~-------------------~C~~C~----------~~L~~h~~~~~l~Ch~CG~----------------~~ 420 (665)
T PRK14873 386 CARCRTPA-------------------RCRHCT----------GPLGLPSAGGTPRCRWCGR----------------AA 420 (665)
T ss_pred hhhCcCee-------------------ECCCCC----------CceeEecCCCeeECCCCcC----------------CC
Q ss_pred CccccCccC
Q psy8286 824 RPFSCHICG 832 (858)
Q Consensus 824 kpy~C~~Cg 832 (858)
.|..|+.||
T Consensus 421 ~p~~Cp~Cg 429 (665)
T PRK14873 421 PDWRCPRCG 429 (665)
T ss_pred cCccCCCCc
No 144
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=20.67 E-value=26 Score=32.17 Aligned_cols=10 Identities=20% Similarity=0.883 Sum_probs=4.5
Q ss_pred ccCCCCCccc
Q psy8286 588 FPCNMCDRHF 597 (858)
Q Consensus 588 y~C~~C~k~F 597 (858)
|.|..|+..+
T Consensus 91 y~C~~C~~~w 100 (104)
T TIGR01384 91 YKCTKCGYVW 100 (104)
T ss_pred EEeCCCCCee
Confidence 4444444433
No 145
>PF15269 zf-C2H2_7: Zinc-finger
Probab=20.66 E-value=58 Score=25.39 Aligned_cols=22 Identities=32% Similarity=0.540 Sum_probs=20.2
Q ss_pred cccCccCcccCChhhHHHHHhh
Q psy8286 826 FSCHICGKTSTTKSNHNKHVKT 847 (858)
Q Consensus 826 y~C~~Cgk~F~~k~~L~~H~r~ 847 (858)
|+|-.|.....-++.|-.||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 7899999999999999999974
No 146
>KOG4124|consensus
Probab=20.59 E-value=51 Score=36.68 Aligned_cols=29 Identities=3% Similarity=-0.236 Sum_probs=22.9
Q ss_pred ccCcccccccccccCCChhHHHHHHhhcCC
Q psy8286 412 AKKKKHACIYCKKTFTGWKLRKYHMKAEHP 441 (858)
Q Consensus 412 ~~~k~f~C~~C~k~F~~~~~L~~H~~~~H~ 441 (858)
+-.++|+| .|++...+...|+.|-..-|.
T Consensus 209 t~~~p~k~-~~~~~~~T~~~l~~HS~N~~~ 237 (442)
T KOG4124|consen 209 TTGTPKKM-PESLVMDTSSPLSDHSMNIDV 237 (442)
T ss_pred cccCCccC-cccccccccchhhhccccCCC
Confidence 34678988 489999999999999765554
No 147
>KOG2807|consensus
Probab=20.34 E-value=1.3e+02 Score=33.32 Aligned_cols=31 Identities=26% Similarity=0.703 Sum_probs=23.8
Q ss_pred CccccCCCCcccccchhhhcccccCCCCCCccCCCCC
Q psy8286 558 KPYKCDVCDKSFVTAQKLSDHKNTHTGNAPFPCNMCD 594 (858)
Q Consensus 558 k~ykC~~C~k~F~s~~~L~~H~~~H~~ekpy~C~~C~ 594 (858)
..|+|..|...|-..-..-.|...| .|+.|.
T Consensus 344 ~~y~C~~Ck~~FCldCDv~iHesLh------~CpgCe 374 (378)
T KOG2807|consen 344 GRYRCESCKNVFCLDCDVFIHESLH------NCPGCE 374 (378)
T ss_pred CcEEchhccceeeccchHHHHhhhh------cCCCcC
Confidence 5688999999988887777787665 477675
No 148
>KOG4167|consensus
Probab=20.09 E-value=24 Score=42.68 Aligned_cols=25 Identities=40% Similarity=0.712 Sum_probs=20.4
Q ss_pred ccccCccCcccCChhhHHHHHhhhC
Q psy8286 825 PFSCHICGKTSTTKSNHNKHVKTHH 849 (858)
Q Consensus 825 py~C~~Cgk~F~~k~~L~~H~r~HH 849 (858)
-|.|..|+|.|..-..+..||++|-
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 3788888888888888888888874
Done!