Query psy13406
Match_columns 1043
No_of_seqs 652 out of 4372
Neff 10.5
Searched_HMMs 46136
Date Fri Aug 16 18:40:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13406.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13406hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 2.4E-34 5.1E-39 310.8 9.8 237 713-952 604-939 (958)
2 KOG1074|consensus 100.0 1.3E-31 2.8E-36 289.8 7.7 56 713-768 878-933 (958)
3 KOG2462|consensus 99.9 1.1E-28 2.3E-33 235.7 6.1 136 776-941 129-264 (279)
4 KOG2462|consensus 99.9 1.5E-28 3.2E-33 234.7 6.8 139 738-913 126-264 (279)
5 KOG3608|consensus 99.9 7E-27 1.5E-31 227.3 13.6 281 62-345 36-379 (467)
6 KOG3608|consensus 99.9 1.5E-26 3.3E-31 225.0 13.3 226 713-949 133-381 (467)
7 KOG3623|consensus 99.9 7E-25 1.5E-29 232.9 22.0 85 707-798 887-971 (1007)
8 KOG3623|consensus 99.9 3.5E-25 7.5E-30 235.2 11.8 114 713-829 209-330 (1007)
9 KOG3576|consensus 99.7 1.7E-18 3.8E-23 155.5 2.7 121 808-945 117-237 (267)
10 KOG3576|consensus 99.7 5E-18 1.1E-22 152.6 3.3 112 120-231 115-237 (267)
11 PLN03086 PRLI-interacting fact 99.3 4E-12 8.6E-17 139.8 8.8 145 779-945 409-565 (567)
12 PLN03086 PRLI-interacting fact 99.2 3.9E-11 8.5E-16 132.1 9.3 141 95-254 408-560 (567)
13 PHA00733 hypothetical protein 99.0 1.3E-10 2.9E-15 103.6 3.9 55 889-945 70-124 (128)
14 PHA00733 hypothetical protein 99.0 2.9E-10 6.4E-15 101.5 4.8 54 862-917 71-124 (128)
15 KOG1146|consensus 99.0 2.7E-10 5.7E-15 132.8 5.4 122 809-946 1229-1354(1406)
16 PHA02768 hypothetical protein; 98.8 1.4E-09 3E-14 77.8 1.2 43 892-936 5-47 (55)
17 KOG3993|consensus 98.8 1.6E-09 3.4E-14 110.2 1.0 49 264-312 432-480 (500)
18 KOG3993|consensus 98.8 1.4E-09 3E-14 110.7 0.3 190 713-916 266-482 (500)
19 PHA02768 hypothetical protein; 98.7 4.8E-09 1E-13 75.1 1.6 42 865-908 6-47 (55)
20 PF13465 zf-H2C2_2: Zinc-finge 98.6 2.9E-08 6.3E-13 60.6 1.9 25 907-931 1-25 (26)
21 KOG1146|consensus 98.5 4.7E-08 1E-12 114.6 4.5 125 778-918 1229-1354(1406)
22 PF13465 zf-H2C2_2: Zinc-finge 98.4 3E-07 6.5E-12 56.2 2.5 26 729-754 1-26 (26)
23 PHA00616 hypothetical protein 98.3 3.2E-07 6.9E-12 62.3 2.4 34 892-925 1-34 (44)
24 PHA00616 hypothetical protein 98.2 6.5E-07 1.4E-11 60.8 2.0 32 865-896 2-33 (44)
25 PHA00732 hypothetical protein 98.0 2.3E-06 5E-11 68.6 1.8 45 864-914 1-46 (79)
26 PHA00732 hypothetical protein 97.9 8.2E-06 1.8E-10 65.5 3.4 45 151-201 2-47 (79)
27 PF05605 zf-Di19: Drought indu 97.8 3.9E-05 8.4E-10 57.2 4.8 49 865-916 3-53 (54)
28 PF05605 zf-Di19: Drought indu 97.7 4.9E-05 1.1E-09 56.6 3.9 51 892-945 2-54 (54)
29 COG5189 SFP1 Putative transcri 97.6 3.4E-05 7.3E-10 76.0 2.3 25 54-78 347-373 (423)
30 KOG2231|consensus 97.5 0.00061 1.3E-08 77.0 11.3 139 743-914 100-261 (669)
31 COG5189 SFP1 Putative transcri 97.4 3.5E-05 7.6E-10 75.9 0.3 24 862-885 347-372 (423)
32 PF00096 zf-C2H2: Zinc finger, 97.4 0.00014 3.1E-09 43.2 2.5 22 57-78 1-22 (23)
33 PF00096 zf-C2H2: Zinc finger, 97.4 0.00011 2.3E-09 43.7 2.0 23 715-737 1-23 (23)
34 PF13894 zf-C2H2_4: C2H2-type 97.2 0.00027 5.9E-09 42.6 2.1 24 921-944 1-24 (24)
35 PF12756 zf-C2H2_2: C2H2 type 97.1 0.00031 6.7E-09 61.4 2.8 75 866-945 1-75 (100)
36 KOG2231|consensus 97.1 0.0017 3.8E-08 73.4 8.7 143 715-888 100-263 (669)
37 PF12756 zf-C2H2_2: C2H2 type 97.0 0.00055 1.2E-08 59.8 2.9 72 97-173 2-73 (100)
38 PF13894 zf-C2H2_4: C2H2-type 96.9 0.00079 1.7E-08 40.5 2.6 23 57-79 1-23 (24)
39 PF13912 zf-C2H2_6: C2H2-type 96.8 0.00063 1.4E-08 42.3 1.6 25 920-944 1-25 (27)
40 PF13912 zf-C2H2_6: C2H2-type 96.8 0.00089 1.9E-08 41.6 2.0 23 151-173 2-24 (27)
41 COG5236 Uncharacterized conser 96.7 0.0019 4.1E-08 64.8 4.9 124 814-947 159-308 (493)
42 COG5236 Uncharacterized conser 96.4 0.0021 4.5E-08 64.5 2.7 69 180-256 222-303 (493)
43 PF09237 GAGA: GAGA factor; I 96.3 0.0023 5E-08 44.6 1.8 24 893-916 25-48 (54)
44 PF09237 GAGA: GAGA factor; I 96.2 0.0063 1.4E-07 42.5 3.3 32 711-742 21-52 (54)
45 smart00355 ZnF_C2H2 zinc finge 95.8 0.0097 2.1E-07 36.4 2.8 19 867-885 3-21 (26)
46 smart00355 ZnF_C2H2 zinc finge 95.8 0.0081 1.8E-07 36.8 2.4 24 837-860 1-24 (26)
47 KOG2785|consensus 95.8 0.018 3.8E-07 59.8 6.1 75 417-510 165-242 (390)
48 PF13909 zf-H2C2_5: C2H2-type 95.6 0.0081 1.8E-07 36.0 1.8 23 921-944 1-23 (24)
49 PF13909 zf-H2C2_5: C2H2-type 95.6 0.0096 2.1E-07 35.6 2.0 23 319-342 1-23 (24)
50 KOG2785|consensus 95.2 0.06 1.3E-06 56.0 7.7 68 714-798 3-89 (390)
51 PRK04860 hypothetical protein; 95.2 0.005 1.1E-07 57.5 -0.0 37 864-904 119-155 (160)
52 PF12874 zf-met: Zinc-finger o 95.2 0.014 3E-07 35.4 1.8 20 152-171 2-21 (25)
53 PF12874 zf-met: Zinc-finger o 95.2 0.015 3.3E-07 35.2 2.0 22 57-78 1-22 (25)
54 KOG2482|consensus 95.2 0.029 6.2E-07 56.7 4.9 133 807-942 143-356 (423)
55 COG5048 FOG: Zn-finger [Genera 95.1 0.0076 1.6E-07 70.0 1.1 149 177-325 288-453 (467)
56 COG5048 FOG: Zn-finger [Genera 95.1 0.0086 1.9E-07 69.6 1.3 149 149-297 288-453 (467)
57 PRK04860 hypothetical protein; 94.7 0.021 4.5E-07 53.4 2.5 35 179-217 120-154 (160)
58 KOG2482|consensus 94.4 0.085 1.8E-06 53.5 6.0 73 727-799 128-217 (423)
59 PF12171 zf-C2H2_jaz: Zinc-fin 93.8 0.034 7.4E-07 34.4 1.3 22 95-116 2-23 (27)
60 PF12171 zf-C2H2_jaz: Zinc-fin 93.4 0.051 1.1E-06 33.6 1.6 21 207-227 2-22 (27)
61 KOG4173|consensus 92.1 0.075 1.6E-06 49.6 1.6 81 51-144 74-168 (253)
62 KOG4173|consensus 91.4 0.2 4.4E-06 46.9 3.5 77 834-913 77-167 (253)
63 KOG2893|consensus 90.9 0.083 1.8E-06 50.4 0.6 44 152-199 12-55 (341)
64 PF13913 zf-C2HC_2: zinc-finge 90.6 0.24 5.2E-06 29.8 2.2 21 715-736 3-23 (25)
65 PF13913 zf-C2HC_2: zinc-finge 90.4 0.22 4.7E-06 30.0 1.8 21 921-942 3-23 (25)
66 KOG2893|consensus 90.3 0.08 1.7E-06 50.5 -0.1 50 716-769 12-62 (341)
67 TIGR00622 ssl1 transcription f 89.8 0.62 1.3E-05 40.0 4.8 24 178-201 81-104 (112)
68 smart00451 ZnF_U1 U1-like zinc 87.0 0.42 9E-06 31.7 1.7 23 920-942 3-25 (35)
69 smart00451 ZnF_U1 U1-like zinc 86.4 0.53 1.2E-05 31.2 1.9 22 94-115 3-24 (35)
70 COG4049 Uncharacterized protei 85.6 0.45 9.7E-06 34.0 1.2 30 916-945 13-42 (65)
71 TIGR00622 ssl1 transcription f 83.0 1.2 2.6E-05 38.3 3.0 20 775-794 13-32 (112)
72 cd00350 rubredoxin_like Rubred 82.2 1.1 2.4E-05 29.2 1.9 8 180-187 3-10 (33)
73 cd00350 rubredoxin_like Rubred 81.9 0.84 1.8E-05 29.8 1.3 12 207-218 2-13 (33)
74 COG4049 Uncharacterized protei 81.0 0.94 2E-05 32.4 1.3 30 888-917 13-42 (65)
75 KOG4167|consensus 80.4 0.44 9.5E-06 53.9 -0.6 27 92-118 790-816 (907)
76 PF12013 DUF3505: Protein of u 79.4 2.1 4.6E-05 37.6 3.5 25 921-945 81-109 (109)
77 PF12013 DUF3505: Protein of u 79.3 1.8 4E-05 38.0 3.0 25 777-802 11-35 (109)
78 KOG0717|consensus 77.6 1.4 3E-05 47.5 2.1 22 57-78 293-314 (508)
79 PF09986 DUF2225: Uncharacteri 72.5 0.99 2.1E-05 45.2 -0.5 24 54-77 3-26 (214)
80 PF09538 FYDLN_acid: Protein o 70.3 3.5 7.5E-05 35.7 2.3 31 56-107 9-39 (108)
81 cd00729 rubredoxin_SM Rubredox 65.3 5.3 0.00011 26.3 1.9 8 180-187 4-11 (34)
82 PF09986 DUF2225: Uncharacteri 62.5 2.1 4.6E-05 42.9 -0.5 20 808-827 5-24 (214)
83 KOG2186|consensus 62.4 3.7 8.1E-05 40.5 1.2 46 57-115 4-49 (276)
84 KOG2186|consensus 61.9 3.9 8.4E-05 40.4 1.1 46 151-199 4-49 (276)
85 TIGR02098 MJ0042_CXXC MJ0042 f 60.8 4.3 9.4E-05 27.4 0.9 34 57-105 3-36 (38)
86 PF09538 FYDLN_acid: Protein o 55.3 8.1 0.00018 33.5 1.9 9 236-244 28-36 (108)
87 TIGR02300 FYDLN_acid conserved 55.1 8.5 0.00018 33.8 2.0 16 92-107 24-39 (129)
88 KOG2636|consensus 54.9 7.5 0.00016 41.9 1.9 20 56-75 401-421 (497)
89 PF03604 DNA_RNApol_7kD: DNA d 53.4 8.5 0.00019 24.9 1.3 11 57-67 1-11 (32)
90 smart00659 RPOLCX RNA polymera 52.7 10 0.00022 26.7 1.7 10 151-160 3-12 (44)
91 PF02892 zf-BED: BED zinc fing 52.5 10 0.00022 26.8 1.7 23 920-942 16-42 (45)
92 KOG1280|consensus 52.5 60 0.0013 34.1 7.7 25 834-858 77-101 (381)
93 PF11931 DUF3449: Domain of un 52.1 5.5 0.00012 38.5 0.4 22 55-76 100-122 (196)
94 COG5151 SSL1 RNA polymerase II 50.4 9.3 0.0002 38.8 1.7 23 179-201 389-411 (421)
95 KOG4377|consensus 49.2 6.8 0.00015 41.6 0.6 21 926-946 409-429 (480)
96 COG1592 Rubrerythrin [Energy p 49.1 10 0.00022 35.7 1.6 10 204-213 147-156 (166)
97 PF15269 zf-C2H2_7: Zinc-finge 48.8 13 0.00029 25.4 1.7 23 262-284 20-42 (54)
98 PF15269 zf-C2H2_7: Zinc-finge 48.7 11 0.00024 25.8 1.3 22 319-340 21-42 (54)
99 TIGR02605 CxxC_CxxC_SSSS putat 47.7 9.9 0.00022 27.9 1.1 11 179-189 6-16 (52)
100 KOG2807|consensus 47.4 27 0.00058 36.2 4.3 24 206-229 345-368 (378)
101 PF05443 ROS_MUCR: ROS/MUCR tr 47.3 13 0.00028 33.6 2.0 27 713-742 71-97 (132)
102 smart00734 ZnF_Rad18 Rad18-lik 47.2 17 0.00037 22.2 1.8 19 152-171 3-21 (26)
103 PF05443 ROS_MUCR: ROS/MUCR tr 47.2 8.8 0.00019 34.6 0.9 25 893-920 73-97 (132)
104 PF10571 UPF0547: Uncharacteri 47.1 14 0.00031 22.5 1.5 8 237-244 17-24 (26)
105 PF13717 zinc_ribbon_4: zinc-r 47.1 16 0.00034 24.4 1.9 6 152-157 27-32 (36)
106 PRK00464 nrdR transcriptional 47.0 8.1 0.00018 36.0 0.7 12 263-274 29-40 (154)
107 KOG2807|consensus 46.4 17 0.00037 37.5 2.8 24 836-859 345-368 (378)
108 TIGR00373 conserved hypothetic 46.0 22 0.00049 33.6 3.5 20 148-167 107-126 (158)
109 TIGR00373 conserved hypothetic 46.0 18 0.0004 34.2 2.9 18 121-138 108-125 (158)
110 COG1592 Rubrerythrin [Energy p 45.6 11 0.00024 35.4 1.3 11 833-843 146-156 (166)
111 PRK00464 nrdR transcriptional 44.4 7.5 0.00016 36.2 0.1 11 95-105 29-39 (154)
112 KOG2071|consensus 44.2 17 0.00036 41.3 2.6 25 807-831 417-441 (579)
113 smart00531 TFIIE Transcription 44.0 29 0.00062 32.4 3.9 13 150-162 99-111 (147)
114 KOG2272|consensus 43.9 9.8 0.00021 37.4 0.7 12 207-218 222-233 (332)
115 PRK06266 transcription initiat 43.6 14 0.0003 35.8 1.7 15 837-851 118-132 (178)
116 COG4957 Predicted transcriptio 42.2 20 0.00042 31.7 2.2 29 711-742 73-101 (148)
117 COG1198 PriA Primosomal protei 41.9 22 0.00049 42.5 3.4 10 261-270 461-470 (730)
118 COG1996 RPC10 DNA-directed RNA 41.5 16 0.00036 26.2 1.3 10 151-160 7-16 (49)
119 PRK09678 DNA-binding transcrip 41.3 9.3 0.0002 30.2 0.1 6 97-102 4-9 (72)
120 TIGR02300 FYDLN_acid conserved 41.0 17 0.00036 32.1 1.6 8 153-160 12-19 (129)
121 PHA00626 hypothetical protein 40.6 14 0.0003 27.0 0.9 14 261-274 22-35 (59)
122 smart00614 ZnF_BED BED zinc fi 40.5 17 0.00038 26.4 1.5 21 837-857 19-44 (50)
123 smart00531 TFIIE Transcription 40.2 41 0.00089 31.4 4.3 13 122-134 99-111 (147)
124 COG1198 PriA Primosomal protei 39.4 23 0.00049 42.5 2.9 49 95-187 436-484 (730)
125 KOG4377|consensus 39.4 16 0.00034 39.0 1.4 21 239-259 408-428 (480)
126 PRK06266 transcription initiat 39.3 26 0.00056 33.9 2.8 18 149-166 116-133 (178)
127 PF12907 zf-met2: Zinc-binding 38.9 26 0.00056 24.0 1.9 21 893-913 2-25 (40)
128 PF06524 NOA36: NOA36 protein; 38.8 21 0.00045 35.5 2.1 95 173-286 137-233 (314)
129 PF08274 PhnA_Zn_Ribbon: PhnA 38.7 15 0.00032 23.4 0.7 7 151-157 20-26 (30)
130 PF02176 zf-TRAF: TRAF-type zi 38.6 33 0.00072 26.0 2.8 39 178-217 9-53 (60)
131 COG2888 Predicted Zn-ribbon RN 37.1 28 0.00062 25.9 2.0 10 714-723 27-36 (61)
132 PRK09678 DNA-binding transcrip 36.0 14 0.0003 29.3 0.3 7 211-217 34-40 (72)
133 KOG2071|consensus 35.7 22 0.00048 40.4 2.0 27 834-860 416-442 (579)
134 PRK04023 DNA polymerase II lar 35.5 33 0.00071 41.7 3.4 9 837-845 664-672 (1121)
135 PF09723 Zn-ribbon_8: Zinc rib 34.5 25 0.00054 24.5 1.4 29 354-382 6-34 (42)
136 COG5151 SSL1 RNA polymerase II 34.3 29 0.00062 35.4 2.3 81 775-859 320-411 (421)
137 PF04959 ARS2: Arsenite-resist 33.5 25 0.00054 34.9 1.7 30 918-947 75-104 (214)
138 PF02176 zf-TRAF: TRAF-type zi 32.8 25 0.00054 26.7 1.3 39 94-133 9-53 (60)
139 COG1997 RPL43A Ribosomal prote 32.6 17 0.00037 29.5 0.4 32 891-932 34-65 (89)
140 KOG3408|consensus 32.4 47 0.001 28.9 2.9 28 51-78 52-79 (129)
141 KOG2593|consensus 30.7 54 0.0012 35.8 3.7 15 150-164 128-142 (436)
142 KOG4124|consensus 29.6 9.6 0.00021 39.5 -1.8 23 918-940 396-418 (442)
143 PRK04023 DNA polymerase II lar 29.5 49 0.0011 40.3 3.5 13 177-189 650-662 (1121)
144 smart00834 CxxC_CXXC_SSSS Puta 27.9 39 0.00084 23.1 1.5 29 354-382 6-34 (41)
145 TIGR01384 TFS_arch transcripti 26.9 23 0.00049 30.8 0.2 13 178-190 16-28 (104)
146 PRK14890 putative Zn-ribbon RN 26.5 46 0.001 25.0 1.6 10 714-723 25-34 (59)
147 PRK00398 rpoP DNA-directed RNA 25.8 53 0.0011 23.3 1.9 30 354-386 4-33 (46)
148 KOG2593|consensus 25.6 50 0.0011 36.0 2.5 9 234-242 153-161 (436)
149 PF04959 ARS2: Arsenite-resist 25.3 44 0.00095 33.2 1.9 29 316-344 75-103 (214)
150 KOG1280|consensus 25.1 73 0.0016 33.5 3.4 24 778-801 80-103 (381)
151 PF14353 CpXC: CpXC protein 24.9 9.1 0.0002 34.8 -2.8 19 837-855 39-57 (128)
152 COG4530 Uncharacterized protei 24.6 57 0.0012 27.7 2.1 13 92-104 24-36 (129)
153 KOG2272|consensus 23.9 65 0.0014 31.9 2.6 20 713-732 98-117 (332)
154 KOG4167|consensus 23.2 33 0.00072 39.7 0.6 82 837-918 289-374 (907)
155 PF05290 Baculo_IE-1: Baculovi 22.9 35 0.00077 30.3 0.6 17 55-71 79-95 (140)
156 KOG3408|consensus 22.8 53 0.0012 28.6 1.6 27 90-116 53-79 (129)
157 KOG4124|consensus 22.2 28 0.00061 36.2 -0.1 24 204-227 347-372 (442)
158 PF05191 ADK_lid: Adenylate ki 21.9 41 0.00089 22.5 0.6 11 836-846 21-31 (36)
159 PRK14873 primosome assembly pr 21.8 66 0.0014 38.6 2.8 11 260-270 408-418 (665)
160 PRK14714 DNA polymerase II lar 20.9 81 0.0017 39.8 3.2 10 808-817 692-701 (1337)
161 TIGR00595 priA primosomal prot 20.3 81 0.0018 36.6 3.1 40 235-299 223-262 (505)
162 COG3357 Predicted transcriptio 20.2 71 0.0015 26.3 1.7 13 776-788 57-69 (97)
No 1
>KOG1074|consensus
Probab=100.00 E-value=2.4e-34 Score=310.81 Aligned_cols=237 Identities=27% Similarity=0.538 Sum_probs=184.4
Q ss_pred CccccccccccccCHHHHHHHHHHhcCCCceeccccccccCCHHHHHHHHHhccCCCCCCCCCCCcccC---cCCcccCC
Q psy13406 713 QEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLNRHLRKHSGVVDNMEAEAPYACS---ICDRKFTI 789 (1043)
Q Consensus 713 ~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~---~C~~~f~~ 789 (1043)
.+-.|-+|-++..-.+.|+.|.|+|+|++||+|.+|+++|+++.+|+.||-+|..... ....|.|+ +|-+.|.+
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~---~R~q~ScP~~~ic~~kftn 680 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPP---ARVQFSCPSTFICQKKFTN 680 (958)
T ss_pred CccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCcc---ccccccCCchhhhcccccc
Confidence 4678999999999999999999999999999999999999999999999999964221 13568999 99999999
Q ss_pred HHHHHHHHHHhcCCC-CCC---------CcccccccccccChHHHhhhhccc----------------CCCC----CCCC
Q psy13406 790 KGQLSRHLRAHENEK-PEK---------PQTCEVCQKTFDKMAKFKRHLKVH----------------DGVR----DHQC 839 (1043)
Q Consensus 790 ~~~l~~H~~~h~~~~-~~~---------~~~C~~C~~~f~~~~~l~~H~~~h----------------~~~~----~~~C 839 (1043)
.-.|..|+++|.+.. +.. .-.|..|.+.|.....+..++-.| +++. +..+
T Consensus 681 ~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e 760 (958)
T KOG1074|consen 681 AVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPE 760 (958)
T ss_pred cccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccc
Confidence 999999999998532 111 257999999998888887777655 2223 5789
Q ss_pred cccccccCChhHHHHHHh-----------------------hhcCCCcc-CCCcccccCCCHHHHHH-HHHhhc------
Q psy13406 840 QVCGNRFKQKSNLNIHMK-----------------------IHEGIKAH-QCNVCMMKFTNKSDLNR-HMRKHD------ 888 (1043)
Q Consensus 840 ~~C~~~f~~~~~l~~H~~-----------------------~h~~~~~~-~C~~C~~~f~~~~~l~~-H~~~h~------ 888 (1043)
..|+..+.....+..+-- .++++++. .+.+++..-...-...- =+..-.
T Consensus 761 ~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg 840 (958)
T KOG1074|consen 761 NSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEG 840 (958)
T ss_pred cccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccc
Confidence 999999987777665532 23456666 67777655433321110 000000
Q ss_pred -----------C------------------------CCCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCH
Q psy13406 889 -----------G------------------------VKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRK 933 (1043)
Q Consensus 889 -----------~------------------------~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 933 (1043)
+ .....|.+|++.|....+|..||++|+|+|||.|.+|++.|.++
T Consensus 841 ~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttr 920 (958)
T KOG1074|consen 841 LATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTR 920 (958)
T ss_pred cccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhh
Confidence 0 01267999999999999999999999999999999999999999
Q ss_pred HHHHHHHhhhcCCCCCCCC
Q psy13406 934 DHLKKHMRKCYGPPTGEYR 952 (1043)
Q Consensus 934 ~~l~~H~~~~~~~~~~~~~ 952 (1043)
.+|+.||.+|++..+...+
T Consensus 921 gnLKvHMgtH~w~q~~srr 939 (958)
T KOG1074|consen 921 GNLKVHMGTHMWVQPPSRR 939 (958)
T ss_pred hhhhhhhccccccCCCccC
Confidence 9999999999876654443
No 2
>KOG1074|consensus
Probab=99.97 E-value=1.3e-31 Score=289.82 Aligned_cols=56 Identities=36% Similarity=0.733 Sum_probs=53.0
Q ss_pred CccccccccccccCHHHHHHHHHHhcCCCceeccccccccCCHHHHHHHHHhccCC
Q psy13406 713 QEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLNRHLRKHSGV 768 (1043)
Q Consensus 713 ~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~ 768 (1043)
....|..|++.|...+.|..|+|+|+++|||.|.+|++.|.++.+|+.||..|+..
T Consensus 878 n~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMgtH~w~ 933 (958)
T KOG1074|consen 878 NAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMGTHMWV 933 (958)
T ss_pred chhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhcccccc
Confidence 34789999999999999999999999999999999999999999999999999874
No 3
>KOG2462|consensus
Probab=99.95 E-value=1.1e-28 Score=235.67 Aligned_cols=136 Identities=31% Similarity=0.717 Sum_probs=79.3
Q ss_pred CCcccCcCCcccCCHHHHHHHHHHhcCCCCCCCcccccccccccChHHHhhhhcccCCCCCCCCcccccccCChhHHHHH
Q psy13406 776 APYACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNIH 855 (1043)
Q Consensus 776 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 855 (1043)
..|+|..||+.+.+..+|.+|.++|-.-...+. +.|++|||.|.+...|+.|
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka----------------------------~~C~~C~K~YvSmpALkMH 180 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKA----------------------------FSCKYCGKVYVSMPALKMH 180 (279)
T ss_pred Cceeccccccccccccccchhhccccccccccc----------------------------ccCCCCCceeeehHHHhhH
Confidence 446777777777777777777766654333334 5555555555555555555
Q ss_pred HhhhcCCCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCHHH
Q psy13406 856 MKIHEGIKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDH 935 (1043)
Q Consensus 856 ~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~ 935 (1043)
+|+|+ -++.|.+|||.|...+.|+-|+|+|+|||||.|+.|+|+|..+++|+.||++|.+.|+|+|..|+|.|..++.
T Consensus 181 irTH~--l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~Sy 258 (279)
T KOG2462|consen 181 IRTHT--LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSY 258 (279)
T ss_pred hhccC--CCcccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHH
Confidence 55554 3455555555555555555555555555555555555555555555555555555555555555555555555
Q ss_pred HHHHHh
Q psy13406 936 LKKHMR 941 (1043)
Q Consensus 936 l~~H~~ 941 (1043)
|.+|..
T Consensus 259 LnKH~E 264 (279)
T KOG2462|consen 259 LNKHSE 264 (279)
T ss_pred HHHhhh
Confidence 555554
No 4
>KOG2462|consensus
Probab=99.95 E-value=1.5e-28 Score=234.72 Aligned_cols=139 Identities=30% Similarity=0.656 Sum_probs=98.4
Q ss_pred cCCCceeccccccccCCHHHHHHHHHhccCCCCCCCCCCCcccCcCCcccCCHHHHHHHHHHhcCCCCCCCccccccccc
Q psy13406 738 TGIKEFQCWICSKCFMEKSHLNRHLRKHSGVVDNMEAEAPYACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKT 817 (1043)
Q Consensus 738 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~ 817 (1043)
.....|+|..|++.+.+.++|-+|.+.|.... ..+.+.|++|++.|.+...|..|+|+|.- +
T Consensus 126 ~~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~----s~ka~~C~~C~K~YvSmpALkMHirTH~l-----~--------- 187 (279)
T KOG2462|consen 126 AKHPRYKCPECGKSYSTSSNLSRHKQTHRSLD----SKKAFSCKYCGKVYVSMPALKMHIRTHTL-----P--------- 187 (279)
T ss_pred ccCCceeccccccccccccccchhhccccccc----ccccccCCCCCceeeehHHHhhHhhccCC-----C---------
Confidence 34456999999999999999999999997643 36779999999999999999999888862 3
Q ss_pred ccChHHHhhhhcccCCCCCCCCcccccccCChhHHHHHHhhhcCCCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccc
Q psy13406 818 FDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNIHMKIHEGIKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSI 897 (1043)
Q Consensus 818 f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 897 (1043)
++|.+|||.|...+.|+.|+|+|||||||.|+.|++.|.++++|+.||++|.+.|+|+|..
T Consensus 188 -------------------c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~ 248 (279)
T KOG2462|consen 188 -------------------CECGICGKAFSRPWLLQGHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPR 248 (279)
T ss_pred -------------------cccccccccccchHHhhcccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcc
Confidence 4555555555555555555555555555666666666666666666666666666666666
Q ss_pred hhHHhCChHHHHHHHH
Q psy13406 898 CAKNFSRKDDLNRHMR 913 (1043)
Q Consensus 898 C~~~f~~~~~l~~H~~ 913 (1043)
|+|+|+.++.|.+|..
T Consensus 249 C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 249 CGKSFALKSYLNKHSE 264 (279)
T ss_pred hhhHHHHHHHHHHhhh
Confidence 6666666666666654
No 5
>KOG3608|consensus
Probab=99.94 E-value=7e-27 Score=227.27 Aligned_cols=281 Identities=25% Similarity=0.524 Sum_probs=230.9
Q ss_pred CCcccCCHHHHHHHHHHhccCCCCCCCCCCC-CcccCC--CCCcccCc-hHHHHHHhhhccC------------------
Q psy13406 62 CPKRYARKNRLTNHLRTHEAGSGDEKSEGGE-GSFTCS--QCPKTFVD-KWHLNRHLKSHSE------------------ 119 (1043)
Q Consensus 62 C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~-~~~~C~--~C~~~f~~-~~~l~~H~~~h~~------------------ 119 (1043)
|--.-+....+..|+.-|..+.++....... ..+.|. .|++...+ ...|.+|...|.-
T Consensus 36 C~~~~s~~~h~~dHL~~h~~~~~e~~~~~eh~de~qC~w~~C~f~~~~~s~~l~RHvy~H~y~~~l~q~G~~al~~~~di 115 (467)
T KOG3608|consen 36 CTGEWSLNGHIGDHLEHHAKAQDERGAHAEHTDEHQCTWNSCDFRTENSSADLIRHVYFHCYHTKLKQQGKLALDLHPDI 115 (467)
T ss_pred hcchhhhhhHHHHHHHhhhccccccccccccccceeEEeccCCccccchHHHHHhhhhhhhhHHHHHHHHHHHHhcCCCc
Confidence 4444444555556665555444333322222 257775 48877766 5899999876620
Q ss_pred ----------------CCcccc--ccCCccccCHHHHHHHHHhccC------------C-cceec--ccCCcccCCHHHH
Q psy13406 120 ----------------NKVFRC--EQCRFDFYVKREYNRHMKIHDG------------V-KVFLC--SVCSKTFTDKVKF 166 (1043)
Q Consensus 120 ----------------~~~~~C--~~C~~~f~~~~~l~~H~~~h~~------------~-~~~~C--~~C~~~f~~~~~l 166 (1043)
-..|.| +.|+..|.+...|..|+..|.- + -.+.| ..|-+.|.++..|
T Consensus 116 g~c~~~f~~~~~ip~~g~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~L 195 (467)
T KOG3608|consen 116 GACTAPFRLMEKIPALGQNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRL 195 (467)
T ss_pred CcccCCcchhhccccchhhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHH
Confidence 012566 5799999999999999877641 1 23567 4699999999999
Q ss_pred HHHHHhcCCCCCeecCCCcccccCcchHHHHHhhcc--CCCCccCCCCCccccChhHHHhhhhhccCCCcccCCcccccc
Q psy13406 167 NRHMRAHEGIKPFQCSVCSESFTQRSNLNIHLRIHD--GIRPFQCNVCYICFTNKSDLNRHSRIHNGIKPFLCSMCAKCF 244 (1043)
Q Consensus 167 ~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 244 (1043)
++|++.|+++|...|+.||..|.++..|..|++..+ ...+|+|..|.+.|.++..|..|++.|- .-|+|+.|+.+.
T Consensus 196 reH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHv--n~ykCplCdmtc 273 (467)
T KOG3608|consen 196 REHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHV--NCYKCPLCDMTC 273 (467)
T ss_pred HHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhh--hcccccccccCC
Confidence 999999999999999999999999999999997543 4569999999999999999999999985 569999999999
Q ss_pred CChhhHHhhhc-cccCCcceeCCCCCCcCCCHHHHHHHHHHhCCCCCcccCc--CccccCCHHHHHHHHHHhc-C--CCC
Q psy13406 245 SRKDDLNRHMK-IHEGSKRYLCTMCPKCFTRKDDLNRHMRNHDGLKPFHCSV--CFESFTQKALLNIHLRIHT-N--SKP 318 (1043)
Q Consensus 245 ~~~~~l~~H~~-~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~--~~~ 318 (1043)
...++|..||+ .|..++||+|+.|.+.|.+.+.|.+|+.+|. +..|.|.. |..+|.+...|++|++.++ | .-+
T Consensus 274 ~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~ 352 (467)
T KOG3608|consen 274 SSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPIL 352 (467)
T ss_pred CChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCc
Confidence 99999999987 4778999999999999999999999999998 77899988 9999999999999998876 3 457
Q ss_pred ccCccchhhcCCHHHHHHHHHHhcCCC
Q psy13406 319 HACSICTESFSQKSNLYIHLKLQHGVN 345 (1043)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~~~~~ 345 (1043)
|.|-.|++.|.+..+|..|+...|+-+
T Consensus 353 Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 353 YACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred eeeecchhhhccchhHHHHHHHhhccc
Confidence 999999999999999999999988743
No 6
>KOG3608|consensus
Probab=99.94 E-value=1.5e-26 Score=224.96 Aligned_cols=226 Identities=27% Similarity=0.512 Sum_probs=202.1
Q ss_pred Ccccc--ccccccccCHHHHHHHHHHhcC------------CCc-eeccc--cccccCCHHHHHHHHHhccCCCCCCCCC
Q psy13406 713 QEFPC--SECGKVFTRKTTLKRHVRIHTG------------IKE-FQCWI--CSKCFMEKSHLNRHLRKHSGVVDNMEAE 775 (1043)
Q Consensus 713 ~~~~C--~~C~k~F~~~~~L~~H~~~H~~------------~~~-~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~~~~ 775 (1043)
..|.| ..|+..|.+...|..|+-.|.- ++| +.|.- |.+.|.+++.|+.|++.|++ +
T Consensus 133 ~~f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~-------e 205 (467)
T KOG3608|consen 133 QNFRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSN-------E 205 (467)
T ss_pred hhhccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCC-------C
Confidence 56888 5799999999999999987752 222 66754 99999999999999999998 8
Q ss_pred CCcccCcCCcccCCHHHHHHHHHHhcCCCCCCCcccccccccccChHHHhhhhcccCCCCCCCCcccccccCChhHHHHH
Q psy13406 776 APYACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNIH 855 (1043)
Q Consensus 776 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 855 (1043)
+...|+.||.-|.++..|..|++..+.-.. .+|.|..|.|.|.+...|+.|+..| ..-|+|+.|+.+....+.|..|
T Consensus 206 KvvACp~Cg~~F~~~tkl~DH~rRqt~l~~-n~fqC~~C~KrFaTeklL~~Hv~rH--vn~ykCplCdmtc~~~ssL~~H 282 (467)
T KOG3608|consen 206 KVVACPHCGELFRTKTKLFDHLRRQTELNT-NSFQCAQCFKRFATEKLLKSHVVRH--VNCYKCPLCDMTCSSASSLTTH 282 (467)
T ss_pred eEEecchHHHHhccccHHHHHHHhhhhhcC-CchHHHHHHHHHhHHHHHHHHHHHh--hhcccccccccCCCChHHHHHH
Confidence 899999999999999999999998775433 3899999999999999999999988 4579999999999999999999
Q ss_pred Hh-hhcCCCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccc--hhHHhCChHHHHHHHHhcc-CCC--cccCccchhh
Q psy13406 856 MK-IHEGIKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSI--CAKNFSRKDDLNRHMRKHD-GIK--PFLCSMCAEA 929 (1043)
Q Consensus 856 ~~-~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~~~--~~~C~~C~~~ 929 (1043)
|+ .|...|||+|+.|++.|.+.++|.+|...|. +--|.|.. |..+|.+...|++|++.++ |.. +|+|-.|++.
T Consensus 283 ~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ 361 (467)
T KOG3608|consen 283 IRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRF 361 (467)
T ss_pred HHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhh
Confidence 97 5778999999999999999999999999998 67799999 9999999999999999776 544 5999999999
Q ss_pred cCCHHHHHHHHhhhcCCCCC
Q psy13406 930 FSRKDHLKKHMRKCYGPPTG 949 (1043)
Q Consensus 930 f~~~~~l~~H~~~~~~~~~~ 949 (1043)
|.+..+|..|+.+.|+-..+
T Consensus 362 ft~G~~L~~HL~kkH~f~~P 381 (467)
T KOG3608|consen 362 FTSGKSLSAHLMKKHGFRLP 381 (467)
T ss_pred hccchhHHHHHHHhhcccCC
Confidence 99999999999998886543
No 7
>KOG3623|consensus
Probab=99.93 E-value=7e-25 Score=232.89 Aligned_cols=85 Identities=36% Similarity=0.817 Sum_probs=80.2
Q ss_pred CCCCCCCccccccccccccCHHHHHHHHHHhcCCCceeccccccccCCHHHHHHHHHhccCCCCCCCCCCCcccCcCCcc
Q psy13406 707 KKGDTPQEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLNRHLRKHSGVVDNMEAEAPYACSICDRK 786 (1043)
Q Consensus 707 ~~~~~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C~~~ 786 (1043)
..+....+|.|+.|+|+|...++|.+|...|+|.+||+|.+|.|+|.-+-+|..|+|.|.| ++||.|..|+|+
T Consensus 887 ~~kte~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSG-------EKPfQCdKClKR 959 (1007)
T KOG3623|consen 887 HAKTEDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSG-------EKPFQCDKCLKR 959 (1007)
T ss_pred cccCccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccC-------CCcchhhhhhhh
Confidence 4455568899999999999999999999999999999999999999999999999999999 999999999999
Q ss_pred cCCHHHHHHHHH
Q psy13406 787 FTIKGQLSRHLR 798 (1043)
Q Consensus 787 f~~~~~l~~H~~ 798 (1043)
|...+.+..||.
T Consensus 960 FSHSGSYSQHMN 971 (1007)
T KOG3623|consen 960 FSHSGSYSQHMN 971 (1007)
T ss_pred cccccchHhhhc
Confidence 999999999985
No 8
>KOG3623|consensus
Probab=99.92 E-value=3.5e-25 Score=235.20 Aligned_cols=114 Identities=31% Similarity=0.669 Sum_probs=93.6
Q ss_pred CccccccccccccCHHHHHHHHHHhc--CCCceeccccccccCCHHHHHHHHHhccCCCCC------CCCCCCcccCcCC
Q psy13406 713 QEFPCSECGKVFTRKTTLKRHVRIHT--GIKEFQCWICSKCFMEKSHLNRHLRKHSGVVDN------MEAEAPYACSICD 784 (1043)
Q Consensus 713 ~~~~C~~C~k~F~~~~~L~~H~~~H~--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~------~~~~~~~~C~~C~ 784 (1043)
+...|++|++.+.+...|+.|+..-+ .+..|.|.+|.++|..+..|.+||..|....+. ....+.|+|..|+
T Consensus 209 qlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECg 288 (1007)
T KOG3623|consen 209 QLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECG 288 (1007)
T ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccc
Confidence 45689999999999999999986532 345689999999999999999999888654332 1234679999999
Q ss_pred cccCCHHHHHHHHHHhcCCCCCCCcccccccccccChHHHhhhhc
Q psy13406 785 RKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKTFDKMAKFKRHLK 829 (1043)
Q Consensus 785 ~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 829 (1043)
|.|..+..|+.|+|+|.|++ ||.|+.|.|.|.....+-.||-
T Consensus 289 KAFKfKHHLKEHlRIHSGEK---PfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 289 KAFKFKHHLKEHLRIHSGEK---PFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhhhHHHHHhhheeecCCC---CcCCcccccccccCCccccccc
Confidence 99999999999999999886 8999999999998888888873
No 9
>KOG3576|consensus
Probab=99.72 E-value=1.7e-18 Score=155.55 Aligned_cols=121 Identities=29% Similarity=0.641 Sum_probs=67.4
Q ss_pred CcccccccccccChHHHhhhhcccCCCCCCCCcccccccCChhHHHHHHhhhcCCCccCCCcccccCCCHHHHHHHHHhh
Q psy13406 808 PQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNIHMKIHEGIKAHQCNVCMMKFTNKSDLNRHMRKH 887 (1043)
Q Consensus 808 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h 887 (1043)
.|.|.+|+|.|.-...|.+|++.|...+.|-|..||+.|....+|++|+|+|+|.+||+|..|++.|+.+-.|..|+++-
T Consensus 117 ~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~kv 196 (267)
T KOG3576|consen 117 SFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLKKV 196 (267)
T ss_pred eeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHHHH
Confidence 45555555555555555555555555555555555555555555555555555555555555555555555555555443
Q ss_pred cCCCCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCHHHHHHHHhhhcC
Q psy13406 888 DGVKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 888 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 945 (1043)
+|... .|. .+ -...|.|.|..||..-.+...+..|++.+|.
T Consensus 197 hgv~~--------~ya----yk-----err~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 197 HGVQH--------QYA----YK-----ERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred cCchH--------HHH----HH-----HhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 33210 000 00 0124557787788777777777777777664
No 10
>KOG3576|consensus
Probab=99.70 E-value=5e-18 Score=152.61 Aligned_cols=112 Identities=35% Similarity=0.755 Sum_probs=78.5
Q ss_pred CCccccccCCccccCHHHHHHHHHhccCCcceecccCCcccCCHHHHHHHHHhcCCCCCeecCCCcccccCcchHHHHHh
Q psy13406 120 NKVFRCEQCRFDFYVKREYNRHMKIHDGVKVFLCSVCSKTFTDKVKFNRHMRAHEGIKPFQCSVCSESFTQRSNLNIHLR 199 (1043)
Q Consensus 120 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 199 (1043)
...|.|.+|++.|...+.|.+||+-|...+.|-|..|||.|....+|.+|+|+|+|.+||+|..|++.|.++-.|..|++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 45577777777777777777777777777777788888888877788888888888888888888888888777877776
Q ss_pred hccCC-----------CCccCCCCCccccChhHHHhhhhhccC
Q psy13406 200 IHDGI-----------RPFQCNVCYICFTNKSDLNRHSRIHNG 231 (1043)
Q Consensus 200 ~h~~~-----------~~~~C~~C~~~f~~~~~l~~H~~~h~~ 231 (1043)
.-+|. +.|.|+.||.+-.....+..|++.|+.
T Consensus 195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp 237 (267)
T KOG3576|consen 195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHP 237 (267)
T ss_pred HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCC
Confidence 54442 345555555555555555555555443
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.31 E-value=4e-12 Score=139.81 Aligned_cols=145 Identities=19% Similarity=0.449 Sum_probs=107.1
Q ss_pred ccCcCCcccCCHHHHHHHHHHhcCCCCCCCccccc--ccccccChHHHhhhhcccCCCCCCCCcccccccCChhHHHHHH
Q psy13406 779 ACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEV--CQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNIHM 856 (1043)
Q Consensus 779 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 856 (1043)
.|+.|..... ...|..|.....-. .-.|+. |+..|. +..+..| +.|+.|++.|. ...|..|+
T Consensus 409 ~C~NC~~~i~-l~~l~lHe~~C~r~----~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 409 ECRNCKHYIP-SRSIALHEAYCSRH----NVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred ECCCCCCccc-hhHHHHHHhhCCCc----ceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHH
Confidence 5666666554 33444665433221 345774 888873 3344444 48999999995 67899999
Q ss_pred hhhcCCCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhC----------ChHHHHHHHHhccCCCcccCccc
Q psy13406 857 KIHEGIKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFS----------RKDDLNRHMRKHDGIKPFLCSMC 926 (1043)
Q Consensus 857 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~~C~~C 926 (1043)
++|+ +++.|+ |++.+ .+..|..|+++|.+.+++.|+.|++.|. ....|..|+..+ |.+++.|..|
T Consensus 473 ~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~C 547 (567)
T PLN03086 473 KVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSC 547 (567)
T ss_pred HhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEcccc
Confidence 9886 789999 99655 6789999999999999999999999885 245899998885 8999999999
Q ss_pred hhhcCCHHHHHHHHhhhcC
Q psy13406 927 AEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 927 ~~~f~~~~~l~~H~~~~~~ 945 (1043)
|+.|..+ .|..|+-..|.
T Consensus 548 gk~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 548 GRSVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred CCeeeeh-hHHHHHHHhhc
Confidence 9888765 56778776664
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.20 E-value=3.9e-11 Score=132.11 Aligned_cols=141 Identities=20% Similarity=0.477 Sum_probs=73.6
Q ss_pred ccCCCCCcccCchHHHHHHhhhccCCCcccccc--CCccccCHHHHHHHHHhccCCcceecccCCcccCCHHHHHHHHHh
Q psy13406 95 FTCSQCPKTFVDKWHLNRHLKSHSENKVFRCEQ--CRFDFYVKREYNRHMKIHDGVKVFLCSVCSKTFTDKVKFNRHMRA 172 (1043)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 172 (1043)
-.|+.|.+.... ..|..|...-. ...-.|+. |+..|. +..+..| +.|+.|++.|. ...|..|++.
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C~-r~~V~Cp~~~Cg~v~~-r~el~~H---------~~C~~Cgk~f~-~s~LekH~~~ 474 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYCS-RHNVVCPHDGCGIVLR-VEEAKNH---------VHCEKCGQAFQ-QGEMEKHMKV 474 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhCC-CcceeCCcccccceee-ccccccC---------ccCCCCCCccc-hHHHHHHHHh
Confidence 346666555432 23335543221 12234552 666552 2222222 35666666664 4556666666
Q ss_pred cCCCCCeecCCCcccccCcchHHHHHhhccCCCCccCCCCCcccc----------ChhHHHhhhhhccCCCcccCCcccc
Q psy13406 173 HEGIKPFQCSVCSESFTQRSNLNIHLRIHDGIRPFQCNVCYICFT----------NKSDLNRHSRIHNGIKPFLCSMCAK 242 (1043)
Q Consensus 173 h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~----------~~~~l~~H~~~h~~~~~~~C~~C~~ 242 (1043)
|+ +++.|+ |++.+ .+..|..|+.+|.+.+++.|.+|++.|. ....|..|.... +.+++.|..|++
T Consensus 475 ~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk 549 (567)
T PLN03086 475 FH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGR 549 (567)
T ss_pred cC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCC
Confidence 54 456666 66543 4456666666666666666666666653 123566666553 566666666666
Q ss_pred ccCChhhHHhhh
Q psy13406 243 CFSRKDDLNRHM 254 (1043)
Q Consensus 243 ~f~~~~~l~~H~ 254 (1043)
.|..+ .|..|+
T Consensus 550 ~Vrlr-dm~~H~ 560 (567)
T PLN03086 550 SVMLK-EMDIHQ 560 (567)
T ss_pred eeeeh-hHHHHH
Confidence 65443 344444
No 13
>PHA00733 hypothetical protein
Probab=99.04 E-value=1.3e-10 Score=103.61 Aligned_cols=55 Identities=27% Similarity=0.614 Sum_probs=35.7
Q ss_pred CCCCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCHHHHHHHHhhhcC
Q psy13406 889 GVKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 889 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 945 (1043)
+.+||.|+.|++.|.+...|..|++.| ..+|.|..|++.|.....|..|+...|+
T Consensus 70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 70 AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 356666666666666666666666654 3456666777777776676666666665
No 14
>PHA00733 hypothetical protein
Probab=99.01 E-value=2.9e-10 Score=101.45 Aligned_cols=54 Identities=26% Similarity=0.635 Sum_probs=31.3
Q ss_pred CCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCChHHHHHHHHhccC
Q psy13406 862 IKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHMRKHDG 917 (1043)
Q Consensus 862 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 917 (1043)
.++|.|+.|++.|.+...|..|++.| +.+|.|.+|++.|.....|.+|+...++
T Consensus 71 ~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 71 VSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 55566666666666666666665554 2345666666666666666666655544
No 15
>KOG1146|consensus
Probab=99.00 E-value=2.7e-10 Score=132.76 Aligned_cols=122 Identities=12% Similarity=0.181 Sum_probs=83.3
Q ss_pred cccccccccccChHHHhhhhcccC----CCCCCCCcccccccCChhHHHHHHhhhcCCCccCCCcccccCCCHHHHHHHH
Q psy13406 809 QTCEVCQKTFDKMAKFKRHLKVHD----GVRDHQCQVCGNRFKQKSNLNIHMKIHEGIKAHQCNVCMMKFTNKSDLNRHM 884 (1043)
Q Consensus 809 ~~C~~C~~~f~~~~~l~~H~~~h~----~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 884 (1043)
..|..|...|.....+..|-+--- ....+.|..|++.|.....+. |+-. ..+|.|..|...|.....|..|.
T Consensus 1229 l~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~~~~~~-~l~~---~~~~~~~~~~~~~~~~~~l~~~~ 1304 (1406)
T KOG1146|consen 1229 LLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTPSFGIS-TLDV---THRYLCRQCKMAFDGEAPLTAHQ 1304 (1406)
T ss_pred ccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccCcccee-eccc---chhHHHHHHHhhhcchhHHHHHH
Confidence 345666666666665555533111 112367888888887777766 5532 34678888888888888888776
Q ss_pred HhhcCCCCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCHHHHHHHHhhhcCC
Q psy13406 885 RKHDGVKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDHLKKHMRKCYGP 946 (1043)
Q Consensus 885 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~~ 946 (1043)
..|-+.|.....+.-|+..+.....| |.+|...|.....|..||+..++.
T Consensus 1305 -----------~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1305 -----------RKFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred -----------HHHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhc
Confidence 23555566666666677777777777 999999999999999999976653
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.80 E-value=1.4e-09 Score=77.76 Aligned_cols=43 Identities=23% Similarity=0.544 Sum_probs=28.9
Q ss_pred CcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCHHHH
Q psy13406 892 PFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDHL 936 (1043)
Q Consensus 892 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 936 (1043)
.|.|+.||+.|+...+|..||++|+ ++|+|..|++.|.+.+.|
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 3667777777777777777777766 566777777776665554
No 17
>KOG3993|consensus
Probab=98.77 E-value=1.6e-09 Score=110.21 Aligned_cols=49 Identities=16% Similarity=0.095 Sum_probs=27.8
Q ss_pred eCCCCCCcCCCHHHHHHHHHHhCCCCCcccCcCccccCCHHHHHHHHHH
Q psy13406 264 LCTMCPKCFTRKDDLNRHMRNHDGLKPFHCSVCFESFTQKALLNIHLRI 312 (1043)
Q Consensus 264 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 312 (1043)
.|++|+-.+.++..--.+.+.-..+..|.|.+|.-.|.+...|.+|+..
T Consensus 432 ~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~ 480 (500)
T KOG3993|consen 432 LPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINK 480 (500)
T ss_pred CCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhh
Confidence 4555655555544434444433444556666666666666666666654
No 18
>KOG3993|consensus
Probab=98.77 E-value=1.4e-09 Score=110.65 Aligned_cols=190 Identities=18% Similarity=0.242 Sum_probs=106.0
Q ss_pred CccccccccccccCHHHHHHHHHHhcCCCceeccccccccCCHHHHHHHHHhccCCCCCC--------------------
Q psy13406 713 QEFPCSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLNRHLRKHSGVVDNM-------------------- 772 (1043)
Q Consensus 713 ~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~-------------------- 772 (1043)
..|.|.+|...|.....|.+|.-.-...-.|+|+.|+|.|....+|..|.|+|.......
T Consensus 266 GdyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 266 GDYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 569999999999999999999732222234999999999999999999999996432111
Q ss_pred ------CCCCCcccCcCCcccCCHHHHHHHHHHhcCCCCCCCcccccccccccChHHHhhhhcccCCCCCCCCccccccc
Q psy13406 773 ------EAEAPYACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRF 846 (1043)
Q Consensus 773 ------~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f 846 (1043)
..+..|.|.+|++.|.....|+.|+.+|+.....+.- .-.|..... -.--+.|+.|+-.+
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~-----a~~f~~s~~---------~~l~~~~~~~a~h~ 411 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEK-----APKFLLSRV---------IPLMHFNQAVATHS 411 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhc-----ccCcchhhc---------cccccccccccccc
Confidence 0112345555555555555555554444432100000 000000000 00013455555544
Q ss_pred CChhHHHHHHhhhcC-CCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCChHHHHHHHHhcc
Q psy13406 847 KQKSNLNIHMKIHEG-IKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHMRKHD 916 (1043)
Q Consensus 847 ~~~~~l~~H~~~h~~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 916 (1043)
.....--.|...+.+ .....|++|+..+.++..--.|.+.-..+..|.|.+|.-+|.+...|.+|+.+-|
T Consensus 412 ~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~H 482 (500)
T KOG3993|consen 412 SASDSHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCH 482 (500)
T ss_pred ccccccccceeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcC
Confidence 433322222222221 1223566676666666555555554444556777777777777777777776533
No 19
>PHA02768 hypothetical protein; Provisional
Probab=98.71 E-value=4.8e-09 Score=75.06 Aligned_cols=42 Identities=21% Similarity=0.486 Sum_probs=23.2
Q ss_pred cCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCChHHH
Q psy13406 865 HQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDL 908 (1043)
Q Consensus 865 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 908 (1043)
|.|+.||+.|.....|..||++|+ ++|+|..|++.|.+.+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 555555555555555555555555 455555555555544433
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.57 E-value=2.9e-08 Score=60.62 Aligned_cols=25 Identities=52% Similarity=1.135 Sum_probs=18.2
Q ss_pred HHHHHHHhccCCCcccCccchhhcC
Q psy13406 907 DLNRHMRKHDGIKPFLCSMCAEAFS 931 (1043)
Q Consensus 907 ~l~~H~~~h~~~~~~~C~~C~~~f~ 931 (1043)
+|.+||++|+|++||.|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 4677777777777777777777775
No 21
>KOG1146|consensus
Probab=98.55 E-value=4.7e-08 Score=114.55 Aligned_cols=125 Identities=13% Similarity=0.168 Sum_probs=88.2
Q ss_pred cccCcCCcccCCHHHHHHHHHHhcCC-CCCCCcccccccccccChHHHhhhhcccCCCCCCCCcccccccCChhHHHHHH
Q psy13406 778 YACSICDRKFTIKGQLSRHLRAHENE-KPEKPQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCGNRFKQKSNLNIHM 856 (1043)
Q Consensus 778 ~~C~~C~~~f~~~~~l~~H~~~h~~~-~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 856 (1043)
..|..|...|.....+..|-+--..+ ...+.+.|..|++.|.....+. |+- ...+|.|..|...|.....|..|.
T Consensus 1229 l~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~~~~~~-~l~---~~~~~~~~~~~~~~~~~~~l~~~~ 1304 (1406)
T KOG1146|consen 1229 LLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTPSFGIS-TLD---VTHRYLCRQCKMAFDGEAPLTAHQ 1304 (1406)
T ss_pred ccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccCcccee-ecc---cchhHHHHHHHhhhcchhHHHHHH
Confidence 45677777777777666654322111 1234578999999888887777 653 345689999999999999999888
Q ss_pred hhhcCCCccCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCChHHHHHHHHhccCC
Q psy13406 857 KIHEGIKAHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHMRKHDGI 918 (1043)
Q Consensus 857 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 918 (1043)
+ .|-+.|.....+.-|+..+..-..| |.+|...|.....|..||+.-.++
T Consensus 1305 ~-----------k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~ 1354 (1406)
T KOG1146|consen 1305 R-----------KFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHR 1354 (1406)
T ss_pred H-----------HHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhc
Confidence 2 2344555666666677777666777 999999999999999999854443
No 22
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.35 E-value=3e-07 Score=56.16 Aligned_cols=26 Identities=46% Similarity=0.933 Sum_probs=23.0
Q ss_pred HHHHHHHHhcCCCceeccccccccCC
Q psy13406 729 TLKRHVRIHTGIKEFQCWICSKCFME 754 (1043)
Q Consensus 729 ~L~~H~~~H~~~~~~~C~~C~~~f~~ 754 (1043)
+|.+|+++|++++||.|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 58899999999999999999999863
No 23
>PHA00616 hypothetical protein
Probab=98.33 E-value=3.2e-07 Score=62.30 Aligned_cols=34 Identities=15% Similarity=0.325 Sum_probs=23.5
Q ss_pred CcCccchhHHhCChHHHHHHHHhccCCCcccCcc
Q psy13406 892 PFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSM 925 (1043)
Q Consensus 892 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 925 (1043)
||+|+.||+.|..++.|.+|++.|+|+++|.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 4667777777777777777777777777766653
No 24
>PHA00616 hypothetical protein
Probab=98.22 E-value=6.5e-07 Score=60.82 Aligned_cols=32 Identities=19% Similarity=0.257 Sum_probs=16.7
Q ss_pred cCCCcccccCCCHHHHHHHHHhhcCCCCcCcc
Q psy13406 865 HQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCS 896 (1043)
Q Consensus 865 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~ 896 (1043)
|+|+.||+.|..++.|.+|++.|||+++|.|+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 45555555555555555555555555555443
No 25
>PHA00732 hypothetical protein
Probab=98.02 E-value=2.3e-06 Score=68.58 Aligned_cols=45 Identities=27% Similarity=0.635 Sum_probs=26.3
Q ss_pred ccCCCcccccCCCHHHHHHHHHh-hcCCCCcCccchhHHhCChHHHHHHHHh
Q psy13406 864 AHQCNVCMMKFTNKSDLNRHMRK-HDGVKPFLCSICAKNFSRKDDLNRHMRK 914 (1043)
Q Consensus 864 ~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~ 914 (1043)
||.|+.|++.|.+...|+.|++. |+ ++.|+.|++.|. .|..|+++
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhcc
Confidence 45666666666666666666653 33 245666666665 35556543
No 26
>PHA00732 hypothetical protein
Probab=97.93 E-value=8.2e-06 Score=65.45 Aligned_cols=45 Identities=31% Similarity=0.673 Sum_probs=24.0
Q ss_pred eecccCCcccCCHHHHHHHHHh-cCCCCCeecCCCcccccCcchHHHHHhhc
Q psy13406 151 FLCSVCSKTFTDKVKFNRHMRA-HEGIKPFQCSVCSESFTQRSNLNIHLRIH 201 (1043)
Q Consensus 151 ~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~~C~~~f~~~~~l~~H~~~h 201 (1043)
|.|..|++.|.+...|..|++. |. ++.|+.|++.|. .|..|++++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 5555555555555555555553 32 235555665554 355555433
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.79 E-value=3.9e-05 Score=57.15 Aligned_cols=49 Identities=29% Similarity=0.573 Sum_probs=22.9
Q ss_pred cCCCcccccCCCHHHHHHHHHh-hcCC-CCcCccchhHHhCChHHHHHHHHhcc
Q psy13406 865 HQCNVCMMKFTNKSDLNRHMRK-HDGV-KPFLCSICAKNFSRKDDLNRHMRKHD 916 (1043)
Q Consensus 865 ~~C~~C~~~f~~~~~l~~H~~~-h~~~-~~~~C~~C~~~f~~~~~l~~H~~~h~ 916 (1043)
|.||+|++ ..+...|..|... |..+ +.+.|++|...+. .+|.+||..++
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 45555555 3334455555443 2222 3455555555433 25555555443
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.68 E-value=4.9e-05 Score=56.62 Aligned_cols=51 Identities=24% Similarity=0.512 Sum_probs=41.4
Q ss_pred CcCccchhHHhCChHHHHHHHHhccC-C-CcccCccchhhcCCHHHHHHHHhhhcC
Q psy13406 892 PFLCSICAKNFSRKDDLNRHMRKHDG-I-KPFLCSMCAEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 892 ~~~C~~C~~~f~~~~~l~~H~~~h~~-~-~~~~C~~C~~~f~~~~~l~~H~~~~~~ 945 (1043)
.|.|++|++ ..+...|..|....+. + +.+.|++|...+. .+|..||..+|+
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 489999999 5667899999875443 3 5699999999765 499999998774
No 29
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.58 E-value=3.4e-05 Score=76.00 Aligned_cols=25 Identities=36% Similarity=0.784 Sum_probs=21.6
Q ss_pred ccceecCC--CCcccCCHHHHHHHHHH
Q psy13406 54 VKKFKCNI--CPKRYARKNRLTNHLRT 78 (1043)
Q Consensus 54 ~~~~~C~~--C~~~f~~~~~l~~H~~~ 78 (1043)
.|+|+|++ |+|.|.+...|+.|+.-
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lh 373 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLH 373 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhc
Confidence 48999997 99999999999999854
No 30
>KOG2231|consensus
Probab=97.51 E-value=0.00061 Score=76.98 Aligned_cols=139 Identities=26% Similarity=0.582 Sum_probs=87.1
Q ss_pred eeccccccccCCHHHHHHHHHhccCCCCCCCCCCCcccCcCCcccCCHHHHHHHHHHhcCCCCCCCccccccccc-----
Q psy13406 743 FQCWICSKCFMEKSHLNRHLRKHSGVVDNMEAEAPYACSICDRKFTIKGQLSRHLRAHENEKPEKPQTCEVCQKT----- 817 (1043)
Q Consensus 743 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~C~~C~~~----- 817 (1043)
+.|.+|++.|.-.. ..-.|..| ..|.+...|+.|++..|. .+.|.+|-..
T Consensus 100 ~~C~~C~~~~~~~~-------------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~-----~~~c~lC~~~~kif~ 154 (669)
T KOG2231|consen 100 HSCHICDRRFRALY-------------------NKKECLHC-TEFKSVENLKNHMRDQHK-----LHLCSLCLQNLKIFI 154 (669)
T ss_pred hhcCccccchhhhc-------------------ccCCCccc-cchhHHHHHHHHHHHhhh-----hhccccccccceeee
Confidence 66888887763221 11257777 777788888888865553 3455555322
Q ss_pred ----ccChHHHhhhhcccCC-CCC----CCCcccccccCChhHHHHHHhhhcCCCccCCCccc------ccCCCHHHHHH
Q psy13406 818 ----FDKMAKFKRHLKVHDG-VRD----HQCQVCGNRFKQKSNLNIHMKIHEGIKAHQCNVCM------MKFTNKSDLNR 882 (1043)
Q Consensus 818 ----f~~~~~l~~H~~~h~~-~~~----~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~------~~f~~~~~l~~ 882 (1043)
..+...|..|++.-.. ++. -.|..|...|-....|.+|++.++ |.|.+|+ ..|.....|..
T Consensus 155 ~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~ 230 (669)
T KOG2231|consen 155 NERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEE 230 (669)
T ss_pred eeeehehHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHH
Confidence 2345667777653221 221 358888888888888888888665 5566663 45677788888
Q ss_pred HHHhhcCCCCcCcc--chh-HHhCChHHHHHHHHh
Q psy13406 883 HMRKHDGVKPFLCS--ICA-KNFSRKDDLNRHMRK 914 (1043)
Q Consensus 883 H~~~h~~~~~~~C~--~C~-~~f~~~~~l~~H~~~ 914 (1043)
|.|.+| |.|. .|. +.|.....+..|++.
T Consensus 231 HfR~~H----flCE~~~C~~~~f~~~~~~ei~lk~ 261 (669)
T KOG2231|consen 231 HFRKGH----FLCEEEFCRTKKFYVAFELEIELKA 261 (669)
T ss_pred HhhhcC----ccccccccccceeeehhHHHHHHHh
Confidence 888776 7777 564 344444344555543
No 31
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.43 E-value=3.5e-05 Score=75.88 Aligned_cols=24 Identities=38% Similarity=0.639 Sum_probs=19.4
Q ss_pred CCccCCCc--ccccCCCHHHHHHHHH
Q psy13406 862 IKAHQCNV--CMMKFTNKSDLNRHMR 885 (1043)
Q Consensus 862 ~~~~~C~~--C~~~f~~~~~l~~H~~ 885 (1043)
+|||+|++ |++.+.+...|+.||.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~l 372 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHML 372 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhh
Confidence 58888887 8888888888888876
No 32
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.39 E-value=0.00014 Score=43.21 Aligned_cols=22 Identities=41% Similarity=0.967 Sum_probs=14.0
Q ss_pred eecCCCCcccCCHHHHHHHHHH
Q psy13406 57 FKCNICPKRYARKNRLTNHLRT 78 (1043)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~ 78 (1043)
|+|++|++.|.+...|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 5566666666666666666654
No 33
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.39 E-value=0.00011 Score=43.75 Aligned_cols=23 Identities=52% Similarity=1.090 Sum_probs=14.0
Q ss_pred cccccccccccCHHHHHHHHHHh
Q psy13406 715 FPCSECGKVFTRKTTLKRHVRIH 737 (1043)
Q Consensus 715 ~~C~~C~k~F~~~~~L~~H~~~H 737 (1043)
|.|+.|++.|.+...|++|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 45666666666666666666553
No 34
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.17 E-value=0.00027 Score=42.58 Aligned_cols=24 Identities=38% Similarity=0.897 Sum_probs=14.1
Q ss_pred ccCccchhhcCCHHHHHHHHhhhc
Q psy13406 921 FLCSMCAEAFSRKDHLKKHMRKCY 944 (1043)
Q Consensus 921 ~~C~~C~~~f~~~~~l~~H~~~~~ 944 (1043)
|.|++|++.|.+...|..|++.+|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666554
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.12 E-value=0.00031 Score=61.36 Aligned_cols=75 Identities=29% Similarity=0.528 Sum_probs=20.9
Q ss_pred CCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCCHHHHHHHHhhhcC
Q psy13406 866 QCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 866 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 945 (1043)
+|.+|+..|.+...|..||...|+... + ....+.....|..+++.- -...+.|.+|++.|.+...|..||+.++.
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~-~~~~~~C~~C~~~f~s~~~l~~Hm~~~~H 75 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKK-VKESFRCPYCNKTFRSREALQEHMRSKHH 75 (100)
T ss_dssp -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTTT
T ss_pred Ccccccccccccccccccccccccccc---c-cccccccccccccccccc-cCCCCCCCccCCCCcCHHHHHHHHcCccC
Confidence 477888888888888888866554321 1 222222444444454432 22258899999999999999999986543
No 36
>KOG2231|consensus
Probab=97.07 E-value=0.0017 Score=73.44 Aligned_cols=143 Identities=22% Similarity=0.546 Sum_probs=90.8
Q ss_pred cccccccccccCHHHHHHHHHHhcCCCceeccccccccCCHHHHHHHHHhccCCCCCCCCCCCcccCcC----------C
Q psy13406 715 FPCSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLNRHLRKHSGVVDNMEAEAPYACSIC----------D 784 (1043)
Q Consensus 715 ~~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~~~~~~~~C~~C----------~ 784 (1043)
+.|.+|++.|.-.. ..-.|..| -.|.+...|+.|++.-|. .+.|.+| .
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~---------~~~c~lC~~~~kif~~e~ 157 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQHK---------LHLCSLCLQNLKIFINER 157 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhhh---------hhccccccccceeeeeee
Confidence 67888887763211 11357777 777777778888744332 2445554 2
Q ss_pred cccCCHHHHHHHHHHhcC-CCCCCC-cccccccccccChHHHhhhhcccCCCCCCCCccc------ccccCChhHHHHHH
Q psy13406 785 RKFTIKGQLSRHLRAHEN-EKPEKP-QTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVC------GNRFKQKSNLNIHM 856 (1043)
Q Consensus 785 ~~f~~~~~l~~H~~~h~~-~~~~~~-~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~l~~H~ 856 (1043)
+.| +..+|..|+..-.. +...+. -.|..|...|-....|.+|++.++ |.|..| +..|.....|..|.
T Consensus 158 k~Y-t~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~Hf 232 (669)
T KOG2231|consen 158 KLY-TRAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHF 232 (669)
T ss_pred ehe-hHHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHh
Confidence 223 35667777654322 222222 469999999999999999998765 567777 45678888999999
Q ss_pred hhhcCCCccCCC--cccc-cCCCHHHHHHHHHhhc
Q psy13406 857 KIHEGIKAHQCN--VCMM-KFTNKSDLNRHMRKHD 888 (1043)
Q Consensus 857 ~~h~~~~~~~C~--~C~~-~f~~~~~l~~H~~~h~ 888 (1043)
|.+| |.|. .|.- .|.....+..|++.|.
T Consensus 233 R~~H----flCE~~~C~~~~f~~~~~~ei~lk~~~ 263 (669)
T KOG2231|consen 233 RKGH----FLCEEEFCRTKKFYVAFELEIELKAHN 263 (669)
T ss_pred hhcC----ccccccccccceeeehhHHHHHHHhhc
Confidence 9876 6777 4543 3333435555555443
No 37
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.96 E-value=0.00055 Score=59.76 Aligned_cols=72 Identities=21% Similarity=0.375 Sum_probs=14.1
Q ss_pred CCCCCcccCchHHHHHHhhhccCCCccccccCCccccCHHHHHHHHHhccCCcceecccCCcccCCHHHHHHHHHhc
Q psy13406 97 CSQCPKTFVDKWHLNRHLKSHSENKVFRCEQCRFDFYVKREYNRHMKIHDGVKVFLCSVCSKTFTDKVKFNRHMRAH 173 (1043)
Q Consensus 97 C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h 173 (1043)
|.+|+..|.+...|..||...++-.. . ....+.....+..+++... ...+.|.+|++.|.+...|..||+.+
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~---~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDI---P-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp ----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred cccccccccccccccccccccccccc---c-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 55566666666666666544333210 0 1111112222222322111 11356666666666666666666643
No 38
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.93 E-value=0.00079 Score=40.49 Aligned_cols=23 Identities=48% Similarity=1.032 Sum_probs=13.0
Q ss_pred eecCCCCcccCCHHHHHHHHHHh
Q psy13406 57 FKCNICPKRYARKNRLTNHLRTH 79 (1043)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~h 79 (1043)
|.|++|++.|.+...|..|+++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 45666666666666666666554
No 39
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.83 E-value=0.00063 Score=42.30 Aligned_cols=25 Identities=36% Similarity=0.769 Sum_probs=17.6
Q ss_pred cccCccchhhcCCHHHHHHHHhhhc
Q psy13406 920 PFLCSMCAEAFSRKDHLKKHMRKCY 944 (1043)
Q Consensus 920 ~~~C~~C~~~f~~~~~l~~H~~~~~ 944 (1043)
||.|..|++.|.+...|..|++.|+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4677777777777777777776654
No 40
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.77 E-value=0.00089 Score=41.61 Aligned_cols=23 Identities=39% Similarity=0.818 Sum_probs=10.2
Q ss_pred eecccCCcccCCHHHHHHHHHhc
Q psy13406 151 FLCSVCSKTFTDKVKFNRHMRAH 173 (1043)
Q Consensus 151 ~~C~~C~~~f~~~~~l~~H~~~h 173 (1043)
|.|..|++.|.+...|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 34444444444444444444433
No 41
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.73 E-value=0.0019 Score=64.76 Aligned_cols=124 Identities=25% Similarity=0.518 Sum_probs=75.7
Q ss_pred ccccccChHHHhhhhcccCCCCCCCCccccc---cc------CChhHHHHHHhhhcCCCcc----CCCcccccCCCHHHH
Q psy13406 814 CQKTFDKMAKFKRHLKVHDGVRDHQCQVCGN---RF------KQKSNLNIHMKIHEGIKAH----QCNVCMMKFTNKSDL 880 (1043)
Q Consensus 814 C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~---~f------~~~~~l~~H~~~h~~~~~~----~C~~C~~~f~~~~~l 880 (1043)
|..+......|+.|.+..++ .+.|.+|-. .| -++..|+.|...-..+..| .|..|...|..-..|
T Consensus 159 c~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL 236 (493)
T COG5236 159 CHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDEL 236 (493)
T ss_pred hhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccceecChHHH
Confidence 44444445555555554333 234444432 22 2455666666443323223 488888888888888
Q ss_pred HHHHHhhcCCCCcCccchhH-------HhCChHHHHHHHHhccCCCcccCcc--ch----hhcCCHHHHHHHHhhhcCCC
Q psy13406 881 NRHMRKHDGVKPFLCSICAK-------NFSRKDDLNRHMRKHDGIKPFLCSM--CA----EAFSRKDHLKKHMRKCYGPP 947 (1043)
Q Consensus 881 ~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~~~~~l~~H~~~~~~~~ 947 (1043)
.+|+|.-| + .|.||++ -|..-.+|.+|.+.-+ |.|.+ |- ..|.....|..|+..-|+..
T Consensus 237 ~~HcR~~H-E---~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~ 308 (493)
T COG5236 237 RRHCRLRH-E---ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVN 308 (493)
T ss_pred HHHHHhhh-h---hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHHHHHHhhcc
Confidence 88887644 2 4666654 3777888888877433 66653 32 46888999999998877754
No 42
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.39 E-value=0.0021 Score=64.46 Aligned_cols=69 Identities=26% Similarity=0.614 Sum_probs=37.6
Q ss_pred ecCCCcccccCcchHHHHHhhccCCCCccCCCCCc-------cccChhHHHhhhhhccCCCcccCCc--cc----cccCC
Q psy13406 180 QCSVCSESFTQRSNLNIHLRIHDGIRPFQCNVCYI-------CFTNKSDLNRHSRIHNGIKPFLCSM--CA----KCFSR 246 (1043)
Q Consensus 180 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~-------~f~~~~~l~~H~~~h~~~~~~~C~~--C~----~~f~~ 246 (1043)
.|..|...|-.-..|.+|++..+. .|-+|++ -|.+...|..|.+.-+ |.|.+ |- .+|..
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~ 293 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPY 293 (493)
T ss_pred hhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEecc
Confidence 355566556555556666554331 2444443 3666667777765433 44432 31 45777
Q ss_pred hhhHHhhhcc
Q psy13406 247 KDDLNRHMKI 256 (1043)
Q Consensus 247 ~~~l~~H~~~ 256 (1043)
...|..|+..
T Consensus 294 ~~el~~h~~~ 303 (493)
T COG5236 294 HTELLEHLTR 303 (493)
T ss_pred HHHHHHHHHH
Confidence 7777777643
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.32 E-value=0.0023 Score=44.56 Aligned_cols=24 Identities=25% Similarity=0.599 Sum_probs=8.1
Q ss_pred cCccchhHHhCChHHHHHHHHhcc
Q psy13406 893 FLCSICAKNFSRKDDLNRHMRKHD 916 (1043)
Q Consensus 893 ~~C~~C~~~f~~~~~l~~H~~~h~ 916 (1043)
-.|++|+..+....+|++|+.++|
T Consensus 25 atCP~C~a~~~~srnLrRHle~~H 48 (54)
T PF09237_consen 25 ATCPICGAVIRQSRNLRRHLEIRH 48 (54)
T ss_dssp EE-TTT--EESSHHHHHHHHHHHT
T ss_pred CCCCcchhhccchhhHHHHHHHHh
Confidence 333333333333333333333333
No 44
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.17 E-value=0.0063 Score=42.50 Aligned_cols=32 Identities=31% Similarity=0.556 Sum_probs=20.9
Q ss_pred CCCccccccccccccCHHHHHHHHHHhcCCCc
Q psy13406 711 TPQEFPCSECGKVFTRKTTLKRHVRIHTGIKE 742 (1043)
Q Consensus 711 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~ 742 (1043)
+..+-.|++|+.++....+|++|+.++++.+|
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 34677888888888888888888888777665
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.80 E-value=0.0097 Score=36.38 Aligned_cols=19 Identities=47% Similarity=0.822 Sum_probs=7.1
Q ss_pred CCcccccCCCHHHHHHHHH
Q psy13406 867 CNVCMMKFTNKSDLNRHMR 885 (1043)
Q Consensus 867 C~~C~~~f~~~~~l~~H~~ 885 (1043)
|+.|++.|.+...|..|++
T Consensus 3 C~~C~~~f~~~~~l~~H~~ 21 (26)
T smart00355 3 CPECGKVFKSKSALKEHMR 21 (26)
T ss_pred CCCCcchhCCHHHHHHHHH
Confidence 3333333333333333333
No 46
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.79 E-value=0.0081 Score=36.76 Aligned_cols=24 Identities=46% Similarity=0.862 Sum_probs=16.8
Q ss_pred CCCcccccccCChhHHHHHHhhhc
Q psy13406 837 HQCQVCGNRFKQKSNLNIHMKIHE 860 (1043)
Q Consensus 837 ~~C~~C~~~f~~~~~l~~H~~~h~ 860 (1043)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 467777777777777777777554
No 47
>KOG2785|consensus
Probab=95.79 E-value=0.018 Score=59.83 Aligned_cols=75 Identities=21% Similarity=0.284 Sum_probs=51.8
Q ss_pred cceecCcccccccccccHHHHHHHhhccccccccchhhhhhhhhhHhhhhhccccccccccccccCCCCCCcccccccc-
Q psy13406 417 TVYTCEMCEKTFKYKLPLVKHCKKQHQVDLEMSESEDENKASVNTAKVKKESVEGVSMEGKSLSEFSGPVGNFKCALCE- 495 (1043)
Q Consensus 417 ~~~~C~~C~~~f~~~~~L~~H~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~C~~C~- 495 (1043)
.|-.|-+|++.+.+....+.||...|+.-.+-..-.......+.-...+. ...|.|..|+
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV-------------------~~~~~CL~CN~ 225 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKV-------------------GIGFICLFCNE 225 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHh-------------------ccCceEEEecc
Confidence 45789999999999999999999999875543221111111111111111 2358999999
Q ss_pred --cccceeccccccccc
Q psy13406 496 --KTFVRKQTLDRHVTV 510 (1043)
Q Consensus 496 --k~f~~~~~L~~H~~~ 510 (1043)
+.|.+..+.+.||..
T Consensus 226 ~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 226 LGRPFSSLEAVRAHMRD 242 (390)
T ss_pred ccCcccccHHHHHHHhh
Confidence 999999999999864
No 48
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.60 E-value=0.0081 Score=35.97 Aligned_cols=23 Identities=30% Similarity=0.726 Sum_probs=13.3
Q ss_pred ccCccchhhcCCHHHHHHHHhhhc
Q psy13406 921 FLCSMCAEAFSRKDHLKKHMRKCY 944 (1043)
Q Consensus 921 ~~C~~C~~~f~~~~~l~~H~~~~~ 944 (1043)
|+|+.|+.... +..|..|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 56666666665 666666666654
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.56 E-value=0.0096 Score=35.64 Aligned_cols=23 Identities=52% Similarity=0.674 Sum_probs=12.7
Q ss_pred ccCccchhhcCCHHHHHHHHHHhc
Q psy13406 319 HACSICTESFSQKSNLYIHLKLQH 342 (1043)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~~~ 342 (1043)
|+|+.|++... ...|..|++.+|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666665 556666666655
No 50
>KOG2785|consensus
Probab=95.25 E-value=0.06 Score=56.03 Aligned_cols=68 Identities=24% Similarity=0.500 Sum_probs=44.2
Q ss_pred ccccccccccccCHHHHHHHHHH--hcCCCceeccccccccCCHHHHHHHHHhccCCC----------------C-CCCC
Q psy13406 714 EFPCSECGKVFTRKTTLKRHVRI--HTGIKEFQCWICSKCFMEKSHLNRHLRKHSGVV----------------D-NMEA 774 (1043)
Q Consensus 714 ~~~C~~C~k~F~~~~~L~~H~~~--H~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~----------------~-~~~~ 774 (1043)
.|.|.-|...|.+...-+.|+++ |. .+|++-+..-.++. . ....
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHR-----------------YNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~ 65 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHR-----------------YNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEA 65 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHH-----------------hhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhc
Confidence 48999999999999999999875 32 23333332211110 0 0123
Q ss_pred CCCcccCcCCcccCCHHHHHHHHH
Q psy13406 775 EAPYACSICDRKFTIKGQLSRHLR 798 (1043)
Q Consensus 775 ~~~~~C~~C~~~f~~~~~l~~H~~ 798 (1043)
..++.|.+|.+.|.+......|+.
T Consensus 66 ~~~~~c~~c~k~~~s~~a~~~hl~ 89 (390)
T KOG2785|consen 66 ESVVYCEACNKSFASPKAHENHLK 89 (390)
T ss_pred ccceehHHhhccccChhhHHHHHH
Confidence 456788888888888877777764
No 51
>PRK04860 hypothetical protein; Provisional
Probab=95.24 E-value=0.005 Score=57.45 Aligned_cols=37 Identities=22% Similarity=0.524 Sum_probs=22.0
Q ss_pred ccCCCcccccCCCHHHHHHHHHhhcCCCCcCccchhHHhCC
Q psy13406 864 AHQCNVCMMKFTNKSDLNRHMRKHDGVKPFLCSICAKNFSR 904 (1043)
Q Consensus 864 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 904 (1043)
+|.|. |+. ....+++|.++|+++++|.|..|+..|..
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~ 155 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVF 155 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEE
Confidence 46665 655 45555666666666666666666665543
No 52
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.17 E-value=0.014 Score=35.42 Aligned_cols=20 Identities=30% Similarity=0.959 Sum_probs=9.9
Q ss_pred ecccCCcccCCHHHHHHHHH
Q psy13406 152 LCSVCSKTFTDKVKFNRHMR 171 (1043)
Q Consensus 152 ~C~~C~~~f~~~~~l~~H~~ 171 (1043)
.|.+|++.|.+...|..|++
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCcCCHHHHHHHHC
Confidence 44455555555555555544
No 53
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.17 E-value=0.015 Score=35.18 Aligned_cols=22 Identities=45% Similarity=1.025 Sum_probs=13.5
Q ss_pred eecCCCCcccCCHHHHHHHHHH
Q psy13406 57 FKCNICPKRYARKNRLTNHLRT 78 (1043)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~ 78 (1043)
|.|++|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666543
No 54
>KOG2482|consensus
Probab=95.16 E-value=0.029 Score=56.75 Aligned_cols=133 Identities=23% Similarity=0.384 Sum_probs=82.3
Q ss_pred CCccccccccccc-ChHHHhhhhc-ccC---C------------------CCCCCCcccccccCChhHHHHHHhh--hcC
Q psy13406 807 KPQTCEVCQKTFD-KMAKFKRHLK-VHD---G------------------VRDHQCQVCGNRFKQKSNLNIHMKI--HEG 861 (1043)
Q Consensus 807 ~~~~C~~C~~~f~-~~~~l~~H~~-~h~---~------------------~~~~~C~~C~~~f~~~~~l~~H~~~--h~~ 861 (1043)
....|-.|...+. +++....|+- +|. | --.+.|-+|.+.|+.+..|+.|||. |-.
T Consensus 143 fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrdkntLkeHMrkK~Hrr 222 (423)
T KOG2482|consen 143 FSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRDKNTLKEHMRKKRHRR 222 (423)
T ss_pred eeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCCcHHHHHHHHhccCcc
Confidence 3567999998775 4666777764 332 1 1137899999999999999999964 322
Q ss_pred CCc--------cCCCc--ccccCCCHHHHHHHH--Hhh---------------cCCCC--cCccchhHHhCChHHHHHHH
Q psy13406 862 IKA--------HQCNV--CMMKFTNKSDLNRHM--RKH---------------DGVKP--FLCSICAKNFSRKDDLNRHM 912 (1043)
Q Consensus 862 ~~~--------~~C~~--C~~~f~~~~~l~~H~--~~h---------------~~~~~--~~C~~C~~~f~~~~~l~~H~ 912 (1043)
..| |.=.+ =|++.. ....|. .+- .+..+ ..|-.|...+-+...|..||
T Consensus 223 inPknreYDkfyiINY~ev~ks~t---~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CLfC~~~~en~~~l~eHm 299 (423)
T KOG2482|consen 223 INPKNREYDKFYIINYLEVGKSWT---IVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCLFCTNFYENPVFLFEHM 299 (423)
T ss_pred cCCCccccceEEEEeHhhcCCccc---hhhhhhhhhhhccccccccchhhhhcCCCCccceEEEeeccchhhHHHHHHHH
Confidence 211 11111 111111 111111 111 01122 57888988888888899998
Q ss_pred HhccC---------------------------CCcccCccchhhcCCHHHHHHHHhh
Q psy13406 913 RKHDG---------------------------IKPFLCSMCAEAFSRKDHLKKHMRK 942 (1043)
Q Consensus 913 ~~h~~---------------------------~~~~~C~~C~~~f~~~~~l~~H~~~ 942 (1043)
.+-|. .+.-.|-.|...|.....|..||.-
T Consensus 300 k~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 300 KIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred HHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 75432 1124588999999999999999974
No 55
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.13 E-value=0.0076 Score=70.00 Aligned_cols=149 Identities=30% Similarity=0.485 Sum_probs=88.0
Q ss_pred CCeecCCCcccccCcchHHHHHh--hccCC--CCccCC--CCCccccChhHHHhhhhhccCCCcccCCc--cccccCChh
Q psy13406 177 KPFQCSVCSESFTQRSNLNIHLR--IHDGI--RPFQCN--VCYICFTNKSDLNRHSRIHNGIKPFLCSM--CAKCFSRKD 248 (1043)
Q Consensus 177 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~ 248 (1043)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...|.+..++.+.. |...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 35677777777777777777777 67777 777777 57777777777777777777666666643 333333322
Q ss_pred hHHhhh-----ccccCCcceeCC--CCCCcCCCHHHHHHHHHHhCCCC--CcccCcCccccCCHHHHHHHHHHhcCCCCc
Q psy13406 249 DLNRHM-----KIHEGSKRYLCT--MCPKCFTRKDDLNRHMRNHDGLK--PFHCSVCFESFTQKALLNIHLRIHTNSKPH 319 (1043)
Q Consensus 249 ~l~~H~-----~~h~~~~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~ 319 (1043)
.-..+. ..-.....+.+. .|...+.....+..|...|.... .+.+..|.+.|.....+..|++.|....++
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL 447 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence 211111 111122233333 25556666666666666665544 455566777777777777777777666555
Q ss_pred cCccch
Q psy13406 320 ACSICT 325 (1043)
Q Consensus 320 ~C~~C~ 325 (1043)
.|..+.
T Consensus 448 ~~~~~~ 453 (467)
T COG5048 448 LCSILK 453 (467)
T ss_pred eecccc
Confidence 554443
No 56
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.08 E-value=0.0086 Score=69.56 Aligned_cols=149 Identities=24% Similarity=0.433 Sum_probs=96.5
Q ss_pred cceecccCCcccCCHHHHHHHHH--hcCCC--CCeecC--CCcccccCcchHHHHHhhccCCCCccCCC--CCccccChh
Q psy13406 149 KVFLCSVCSKTFTDKVKFNRHMR--AHEGI--KPFQCS--VCSESFTQRSNLNIHLRIHDGIRPFQCNV--CYICFTNKS 220 (1043)
Q Consensus 149 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~--C~~~f~~~~ 220 (1043)
.++.|..|...|.....|.+|.+ .|.++ +++.|+ .|++.|.....+..|...|.+..++.|.. |...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 35777888888888888888877 67777 788887 68888888888888888887777766655 333333322
Q ss_pred HHHhh-----hhhccCCCcccCC--ccccccCChhhHHhhhccccCCc--ceeCCCCCCcCCCHHHHHHHHHHhCCCCCc
Q psy13406 221 DLNRH-----SRIHNGIKPFLCS--MCAKCFSRKDDLNRHMKIHEGSK--RYLCTMCPKCFTRKDDLNRHMRNHDGLKPF 291 (1043)
Q Consensus 221 ~l~~H-----~~~h~~~~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~ 291 (1043)
.-..+ ...-.....+.+. .|...+.....+..|...|.... .+.+..|.+.|.....+..|++.|....++
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPL 447 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccccccccCCce
Confidence 21111 1112223344443 36666677777777776665544 456677888888888888888777766666
Q ss_pred ccCcCc
Q psy13406 292 HCSVCF 297 (1043)
Q Consensus 292 ~C~~C~ 297 (1043)
.|..+.
T Consensus 448 ~~~~~~ 453 (467)
T COG5048 448 LCSILK 453 (467)
T ss_pred eecccc
Confidence 554443
No 57
>PRK04860 hypothetical protein; Provisional
Probab=94.70 E-value=0.021 Score=53.40 Aligned_cols=35 Identities=23% Similarity=0.638 Sum_probs=17.6
Q ss_pred eecCCCcccccCcchHHHHHhhccCCCCccCCCCCcccc
Q psy13406 179 FQCSVCSESFTQRSNLNIHLRIHDGIRPFQCNVCYICFT 217 (1043)
Q Consensus 179 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~ 217 (1043)
|.|. |+. ....+.+|.++|.++++|.|..|+..|.
T Consensus 120 Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~ 154 (160)
T PRK04860 120 YRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLV 154 (160)
T ss_pred EEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeE
Confidence 5554 544 3444455555555555555555555443
No 58
>KOG2482|consensus
Probab=94.39 E-value=0.085 Score=53.52 Aligned_cols=73 Identities=23% Similarity=0.409 Sum_probs=41.7
Q ss_pred HHHHHHHHHHhcC-CCceecccccccc-CCHHHHHHHHHhccCCCCCCCC---------------CCCcccCcCCcccCC
Q psy13406 727 KTTLKRHVRIHTG-IKEFQCWICSKCF-MEKSHLNRHLRKHSGVVDNMEA---------------EAPYACSICDRKFTI 789 (1043)
Q Consensus 727 ~~~L~~H~~~H~~-~~~~~C~~C~~~f-~~~~~L~~H~~~h~~~~~~~~~---------------~~~~~C~~C~~~f~~ 789 (1043)
+..|.+|++.-.+ ....+|-+|...+ .+++....|+-.-|+..-.... -..+.|-.|.+.|..
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrd 207 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRD 207 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCC
Confidence 3456666654422 2346799998776 4677788887655543221111 123556666666666
Q ss_pred HHHHHHHHHH
Q psy13406 790 KGQLSRHLRA 799 (1043)
Q Consensus 790 ~~~l~~H~~~ 799 (1043)
+..|+.|||.
T Consensus 208 kntLkeHMrk 217 (423)
T KOG2482|consen 208 KNTLKEHMRK 217 (423)
T ss_pred cHHHHHHHHh
Confidence 6666666653
No 59
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.78 E-value=0.034 Score=34.39 Aligned_cols=22 Identities=41% Similarity=0.830 Sum_probs=13.8
Q ss_pred ccCCCCCcccCchHHHHHHhhh
Q psy13406 95 FTCSQCPKTFVDKWHLNRHLKS 116 (1043)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~ 116 (1043)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666653
No 60
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.37 E-value=0.051 Score=33.60 Aligned_cols=21 Identities=29% Similarity=0.812 Sum_probs=11.3
Q ss_pred ccCCCCCccccChhHHHhhhh
Q psy13406 207 FQCNVCYICFTNKSDLNRHSR 227 (1043)
Q Consensus 207 ~~C~~C~~~f~~~~~l~~H~~ 227 (1043)
|.|.+|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 445555555555555555554
No 61
>KOG4173|consensus
Probab=92.13 E-value=0.075 Score=49.61 Aligned_cols=81 Identities=26% Similarity=0.565 Sum_probs=59.3
Q ss_pred cccccceecCC--CCcccCCHHHHHHHHHHhccCCCCCCCCCCCCcccCCCCCcccCchHHHHHHhhh-c---------c
Q psy13406 51 KKMVKKFKCNI--CPKRYARKNRLTNHLRTHEAGSGDEKSEGGEGSFTCSQCPKTFVDKWHLNRHLKS-H---------S 118 (1043)
Q Consensus 51 ~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~-h---------~ 118 (1043)
+.....|.|.+ |...|.....+..|..+-+. ..|..|.+.|.+...|..|+.. | .
T Consensus 74 ~~~~~~~~cqvagc~~~~d~lD~~E~hY~~~h~-------------~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veR 140 (253)
T KOG4173|consen 74 KPRVPAFACQVAGCCQVFDALDDYEHHYHTLHG-------------NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVER 140 (253)
T ss_pred ccccccccccccchHHHHhhhhhHHHhhhhccc-------------chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHc
Confidence 33456788987 88889888888888754332 3699999999999999999753 2 2
Q ss_pred CCCcccc--ccCCccccCHHHHHHHHHh
Q psy13406 119 ENKVFRC--EQCRFDFYVKREYNRHMKI 144 (1043)
Q Consensus 119 ~~~~~~C--~~C~~~f~~~~~l~~H~~~ 144 (1043)
|...|.| +.|+..|.+...-+.||..
T Consensus 141 G~dMy~ClvEgCt~KFkT~r~RkdH~I~ 168 (253)
T KOG4173|consen 141 GQDMYQCLVEGCTEKFKTSRDRKDHMIR 168 (253)
T ss_pred CccHHHHHHHhhhhhhhhhhhhhhHHHH
Confidence 4556777 5677777777777777653
No 62
>KOG4173|consensus
Probab=91.42 E-value=0.2 Score=46.88 Aligned_cols=77 Identities=29% Similarity=0.624 Sum_probs=50.7
Q ss_pred CCCCCCcc--cccccCChhHHHHHHhhhcCCCccCCCcccccCCCHHHHHHHHHh-hc---------CCCCcCccc--hh
Q psy13406 834 VRDHQCQV--CGNRFKQKSNLNIHMKIHEGIKAHQCNVCMMKFTNKSDLNRHMRK-HD---------GVKPFLCSI--CA 899 (1043)
Q Consensus 834 ~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~---------~~~~~~C~~--C~ 899 (1043)
.+.|.|++ |.+.|........|..+-++ -.|..|.+.|.+...|..|+.. |. |..-|.|-+ |+
T Consensus 77 ~~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 77 VPAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred cccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 46688987 88888888888888755443 3588888888888888888753 32 222244543 55
Q ss_pred HHhCChHHHHHHHH
Q psy13406 900 KNFSRKDDLNRHMR 913 (1043)
Q Consensus 900 ~~f~~~~~l~~H~~ 913 (1043)
..|.+..+-+.||-
T Consensus 154 ~KFkT~r~RkdH~I 167 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMI 167 (253)
T ss_pred hhhhhhhhhhhHHH
Confidence 55555555555543
No 63
>KOG2893|consensus
Probab=90.90 E-value=0.083 Score=50.39 Aligned_cols=44 Identities=27% Similarity=0.665 Sum_probs=31.6
Q ss_pred ecccCCcccCCHHHHHHHHHhcCCCCCeecCCCcccccCcchHHHHHh
Q psy13406 152 LCSVCSKTFTDKVKFNRHMRAHEGIKPFQCSVCSESFTQRSNLNIHLR 199 (1043)
Q Consensus 152 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 199 (1043)
.|.+|++.|....-|..|++.- -|+|.+|.+...+--.|..|-.
T Consensus 12 wcwycnrefddekiliqhqkak----hfkchichkkl~sgpglsihcm 55 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAK----HFKCHICHKKLFSGPGLSIHCM 55 (341)
T ss_pred eeeecccccchhhhhhhhhhhc----cceeeeehhhhccCCCceeehh
Confidence 4788888888887777777653 3778888777777777766643
No 64
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=90.63 E-value=0.24 Score=29.84 Aligned_cols=21 Identities=38% Similarity=0.971 Sum_probs=15.0
Q ss_pred cccccccccccCHHHHHHHHHH
Q psy13406 715 FPCSECGKVFTRKTTLKRHVRI 736 (1043)
Q Consensus 715 ~~C~~C~k~F~~~~~L~~H~~~ 736 (1043)
.+|+.|++.| ....|.+|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4677888887 67777777653
No 65
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=90.41 E-value=0.22 Score=30.04 Aligned_cols=21 Identities=33% Similarity=0.781 Sum_probs=13.6
Q ss_pred ccCccchhhcCCHHHHHHHHhh
Q psy13406 921 FLCSMCAEAFSRKDHLKKHMRK 942 (1043)
Q Consensus 921 ~~C~~C~~~f~~~~~l~~H~~~ 942 (1043)
..|++||+.| ....|..|++.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 3577777777 56666666654
No 66
>KOG2893|consensus
Probab=90.26 E-value=0.08 Score=50.50 Aligned_cols=50 Identities=26% Similarity=0.579 Sum_probs=41.0
Q ss_pred ccccccccccCHHHHHHHHHHhcCCCceeccccccccCCHHHHHHH-HHhccCCC
Q psy13406 716 PCSECGKVFTRKTTLKRHVRIHTGIKEFQCWICSKCFMEKSHLNRH-LRKHSGVV 769 (1043)
Q Consensus 716 ~C~~C~k~F~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H-~~~h~~~~ 769 (1043)
-|.+|++.|.....|..|++..+ |+|.+|.+...+-..|..| |.+|...+
T Consensus 12 wcwycnrefddekiliqhqkakh----fkchichkkl~sgpglsihcmqvhketi 62 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAKH----FKCHICHKKLFSGPGLSIHCMQVHKETI 62 (341)
T ss_pred eeeecccccchhhhhhhhhhhcc----ceeeeehhhhccCCCceeehhhhhhhhh
Confidence 59999999999999999987543 9999999998888888888 66665443
No 67
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.78 E-value=0.62 Score=40.00 Aligned_cols=24 Identities=25% Similarity=0.706 Sum_probs=13.7
Q ss_pred CeecCCCcccccCcchHHHHHhhc
Q psy13406 178 PFQCSVCSESFTQRSNLNIHLRIH 201 (1043)
Q Consensus 178 ~~~C~~C~~~f~~~~~l~~H~~~h 201 (1043)
.|+|+.|...|-..-++-.|...|
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred ceeCCCCCCccccccchhhhhhcc
Confidence 456666666665555555555554
No 68
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=87.01 E-value=0.42 Score=31.72 Aligned_cols=23 Identities=22% Similarity=0.697 Sum_probs=19.9
Q ss_pred cccCccchhhcCCHHHHHHHHhh
Q psy13406 920 PFLCSMCAEAFSRKDHLKKHMRK 942 (1043)
Q Consensus 920 ~~~C~~C~~~f~~~~~l~~H~~~ 942 (1043)
+|.|.+|++.|.....+..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 58899999999999999999864
No 69
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=86.39 E-value=0.53 Score=31.18 Aligned_cols=22 Identities=41% Similarity=0.804 Sum_probs=14.1
Q ss_pred cccCCCCCcccCchHHHHHHhh
Q psy13406 94 SFTCSQCPKTFVDKWHLNRHLK 115 (1043)
Q Consensus 94 ~~~C~~C~~~f~~~~~l~~H~~ 115 (1043)
+|.|.+|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3566666666666666666664
No 70
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=85.57 E-value=0.45 Score=33.99 Aligned_cols=30 Identities=30% Similarity=0.643 Sum_probs=22.1
Q ss_pred cCCCcccCccchhhcCCHHHHHHHHhhhcC
Q psy13406 916 DGIKPFLCSMCAEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 916 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 945 (1043)
.|+.-+.|+.||..|........|+.+.|+
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 456667777777777777777777777665
No 71
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=83.03 E-value=1.2 Score=38.27 Aligned_cols=20 Identities=30% Similarity=0.730 Sum_probs=10.4
Q ss_pred CCCcccCcCCcccCCHHHHH
Q psy13406 775 EAPYACSICDRKFTIKGQLS 794 (1043)
Q Consensus 775 ~~~~~C~~C~~~f~~~~~l~ 794 (1043)
+-|..|++|+....+...|.
T Consensus 13 ~LP~~CpiCgLtLVss~HLA 32 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLA 32 (112)
T ss_pred CCCCcCCcCCCEEeccchHH
Confidence 34555566655555444443
No 72
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=82.19 E-value=1.1 Score=29.22 Aligned_cols=8 Identities=38% Similarity=0.883 Sum_probs=3.0
Q ss_pred ecCCCccc
Q psy13406 180 QCSVCSES 187 (1043)
Q Consensus 180 ~C~~C~~~ 187 (1043)
+|.+||..
T Consensus 3 ~C~~CGy~ 10 (33)
T cd00350 3 VCPVCGYI 10 (33)
T ss_pred ECCCCCCE
Confidence 33333333
No 73
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=81.87 E-value=0.84 Score=29.76 Aligned_cols=12 Identities=25% Similarity=0.800 Sum_probs=7.2
Q ss_pred ccCCCCCccccC
Q psy13406 207 FQCNVCYICFTN 218 (1043)
Q Consensus 207 ~~C~~C~~~f~~ 218 (1043)
|+|.+||..+..
T Consensus 2 ~~C~~CGy~y~~ 13 (33)
T cd00350 2 YVCPVCGYIYDG 13 (33)
T ss_pred EECCCCCCEECC
Confidence 566666666543
No 74
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=80.98 E-value=0.94 Score=32.44 Aligned_cols=30 Identities=37% Similarity=0.701 Sum_probs=17.8
Q ss_pred cCCCCcCccchhHHhCChHHHHHHHHhccC
Q psy13406 888 DGVKPFLCSICAKNFSRKDDLNRHMRKHDG 917 (1043)
Q Consensus 888 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 917 (1043)
.||--+.|+.|+..|....+..+|+..-++
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 345556666666666666666666654443
No 75
>KOG4167|consensus
Probab=80.36 E-value=0.44 Score=53.88 Aligned_cols=27 Identities=37% Similarity=0.741 Sum_probs=22.9
Q ss_pred CCcccCCCCCcccCchHHHHHHhhhcc
Q psy13406 92 EGSFTCSQCPKTFVDKWHLNRHLKSHS 118 (1043)
Q Consensus 92 ~~~~~C~~C~~~f~~~~~l~~H~~~h~ 118 (1043)
.+-|.|.+|+++|....+++.||++|.
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 456889999999999889999998885
No 76
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=79.39 E-value=2.1 Score=37.57 Aligned_cols=25 Identities=32% Similarity=0.611 Sum_probs=20.0
Q ss_pred ccC----ccchhhcCCHHHHHHHHhhhcC
Q psy13406 921 FLC----SMCAEAFSRKDHLKKHMRKCYG 945 (1043)
Q Consensus 921 ~~C----~~C~~~f~~~~~l~~H~~~~~~ 945 (1043)
|.| ..|++.+.+...++.|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 778 7888888888888888887775
No 77
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=79.27 E-value=1.8 Score=38.01 Aligned_cols=25 Identities=20% Similarity=0.466 Sum_probs=18.0
Q ss_pred CcccCcCCcccCCHHHHHHHHHHhcC
Q psy13406 777 PYACSICDRKFTIKGQLSRHLRAHEN 802 (1043)
Q Consensus 777 ~~~C~~C~~~f~~~~~l~~H~~~h~~ 802 (1043)
...|..|+....- ..+..|++..|.
T Consensus 11 vlIC~~C~~av~~-~~v~~HL~~~H~ 35 (109)
T PF12013_consen 11 VLICRQCQYAVQP-SEVESHLRKRHH 35 (109)
T ss_pred EEEeCCCCcccCc-hHHHHHHHHhcc
Confidence 3568888877765 788888885543
No 78
>KOG0717|consensus
Probab=77.58 E-value=1.4 Score=47.53 Aligned_cols=22 Identities=27% Similarity=0.709 Sum_probs=18.9
Q ss_pred eecCCCCcccCCHHHHHHHHHH
Q psy13406 57 FKCNICPKRYARKNRLTNHLRT 78 (1043)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~ 78 (1043)
+-|.+|+++|.+...|.+|..+
T Consensus 293 lyC~vCnKsFKseKq~kNHEnS 314 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENS 314 (508)
T ss_pred eEEeeccccccchHHHHhhHHH
Confidence 7799999999999999999753
No 79
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=72.48 E-value=0.99 Score=45.18 Aligned_cols=24 Identities=29% Similarity=0.641 Sum_probs=16.0
Q ss_pred ccceecCCCCcccCCHHHHHHHHH
Q psy13406 54 VKKFKCNICPKRYARKNRLTNHLR 77 (1043)
Q Consensus 54 ~~~~~C~~C~~~f~~~~~l~~H~~ 77 (1043)
.|.+.|++|++.|.++.-.....+
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r 26 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIR 26 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCce
Confidence 356788888888887654444443
No 80
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=70.33 E-value=3.5 Score=35.73 Aligned_cols=31 Identities=19% Similarity=0.578 Sum_probs=20.2
Q ss_pred ceecCCCCcccCCHHHHHHHHHHhccCCCCCCCCCCCCcccCCCCCcccCch
Q psy13406 56 KFKCNICPKRYARKNRLTNHLRTHEAGSGDEKSEGGEGSFTCSQCPKTFVDK 107 (1043)
Q Consensus 56 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~ 107 (1043)
+..|+.||+.|-.. +..|..|+.||..|.-.
T Consensus 9 KR~Cp~CG~kFYDL---------------------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCPSCGAKFYDL---------------------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cccCCCCcchhccC---------------------CCCCccCCCCCCccCcc
Confidence 45677777777543 34566777777777644
No 81
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=65.35 E-value=5.3 Score=26.27 Aligned_cols=8 Identities=38% Similarity=0.883 Sum_probs=3.1
Q ss_pred ecCCCccc
Q psy13406 180 QCSVCSES 187 (1043)
Q Consensus 180 ~C~~C~~~ 187 (1043)
+|.+||..
T Consensus 4 ~C~~CG~i 11 (34)
T cd00729 4 VCPVCGYI 11 (34)
T ss_pred ECCCCCCE
Confidence 34444433
No 82
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=62.49 E-value=2.1 Score=42.86 Aligned_cols=20 Identities=30% Similarity=0.477 Sum_probs=10.6
Q ss_pred CcccccccccccChHHHhhh
Q psy13406 808 PQTCEVCQKTFDKMAKFKRH 827 (1043)
Q Consensus 808 ~~~C~~C~~~f~~~~~l~~H 827 (1043)
.+.|++|++.|.+..-+...
T Consensus 5 ~~~CPvC~~~F~~~~vrs~~ 24 (214)
T PF09986_consen 5 KITCPVCGKEFKTKKVRSGK 24 (214)
T ss_pred ceECCCCCCeeeeeEEEcCC
Confidence 45566666666555443333
No 83
>KOG2186|consensus
Probab=62.43 E-value=3.7 Score=40.46 Aligned_cols=46 Identities=28% Similarity=0.679 Sum_probs=26.2
Q ss_pred eecCCCCcccCCHHHHHHHHHHhccCCCCCCCCCCCCcccCCCCCcccCchHHHHHHhh
Q psy13406 57 FKCNICPKRYARKNRLTNHLRTHEAGSGDEKSEGGEGSFTCSQCPKTFVDKWHLNRHLK 115 (1043)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 115 (1043)
|.|.+||.+.... .+.+|+...+ ...|.|-.|+++|.. .+...|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCr-----------n~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCR-----------NAYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhcc-----------CCeeEEeeccccccc-chhhhhhh
Confidence 6677777665543 4445665554 134666666666655 55555544
No 84
>KOG2186|consensus
Probab=61.86 E-value=3.9 Score=40.37 Aligned_cols=46 Identities=26% Similarity=0.657 Sum_probs=20.8
Q ss_pred eecccCCcccCCHHHHHHHHHhcCCCCCeecCCCcccccCcchHHHHHh
Q psy13406 151 FLCSVCSKTFTDKVKFNRHMRAHEGIKPFQCSVCSESFTQRSNLNIHLR 199 (1043)
Q Consensus 151 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 199 (1043)
|.|.+||.+.. +..+.+|+..-++ .-|.|-.|+++|.. ..+..|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 44555554443 2344445444444 23555555555544 34444443
No 85
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=60.75 E-value=4.3 Score=27.45 Aligned_cols=34 Identities=15% Similarity=0.535 Sum_probs=20.0
Q ss_pred eecCCCCcccCCHHHHHHHHHHhccCCCCCCCCCCCCcccCCCCCcccC
Q psy13406 57 FKCNICPKRYARKNRLTNHLRTHEAGSGDEKSEGGEGSFTCSQCPKTFV 105 (1043)
Q Consensus 57 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~~~C~~C~~~f~ 105 (1043)
+.|+.|+..|.-...... .....+.|+.|+..|.
T Consensus 3 ~~CP~C~~~~~v~~~~~~---------------~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLG---------------ANGGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeCHHHcC---------------CCCCEEECCCCCCEEE
Confidence 568888887765443211 0122467888877764
No 86
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=55.26 E-value=8.1 Score=33.52 Aligned_cols=9 Identities=33% Similarity=1.051 Sum_probs=3.3
Q ss_pred cCCcccccc
Q psy13406 236 LCSMCAKCF 244 (1043)
Q Consensus 236 ~C~~C~~~f 244 (1043)
.|+.||..|
T Consensus 28 vCP~CG~~~ 36 (108)
T PF09538_consen 28 VCPKCGTEF 36 (108)
T ss_pred cCCCCCCcc
Confidence 333333333
No 87
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=55.11 E-value=8.5 Score=33.83 Aligned_cols=16 Identities=6% Similarity=0.025 Sum_probs=9.8
Q ss_pred CCcccCCCCCcccCch
Q psy13406 92 EGSFTCSQCPKTFVDK 107 (1043)
Q Consensus 92 ~~~~~C~~C~~~f~~~ 107 (1043)
..|..|+.||..|.-.
T Consensus 24 k~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 24 RRPAVSPYTGEQFPPE 39 (129)
T ss_pred CCCccCCCcCCccCcc
Confidence 3456677777666544
No 88
>KOG2636|consensus
Probab=54.92 E-value=7.5 Score=41.90 Aligned_cols=20 Identities=25% Similarity=0.781 Sum_probs=10.5
Q ss_pred ceecCCCC-cccCCHHHHHHH
Q psy13406 56 KFKCNICP-KRYARKNRLTNH 75 (1043)
Q Consensus 56 ~~~C~~C~-~~f~~~~~l~~H 75 (1043)
.|.|.||| .+|.-+..+.+|
T Consensus 401 ey~CEICGNy~Y~GrkaF~RH 421 (497)
T KOG2636|consen 401 EYNCEICGNYVYKGRKAFDRH 421 (497)
T ss_pred ccceeeccCccccCcHHHHHH
Confidence 45555555 555555555555
No 89
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=53.44 E-value=8.5 Score=24.86 Aligned_cols=11 Identities=18% Similarity=0.637 Sum_probs=7.1
Q ss_pred eecCCCCcccC
Q psy13406 57 FKCNICPKRYA 67 (1043)
Q Consensus 57 ~~C~~C~~~f~ 67 (1043)
|.|..||..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 56777777664
No 90
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=52.65 E-value=10 Score=26.70 Aligned_cols=10 Identities=20% Similarity=0.976 Sum_probs=4.2
Q ss_pred eecccCCccc
Q psy13406 151 FLCSVCSKTF 160 (1043)
Q Consensus 151 ~~C~~C~~~f 160 (1043)
|.|..|+..|
T Consensus 3 Y~C~~Cg~~~ 12 (44)
T smart00659 3 YICGECGREN 12 (44)
T ss_pred EECCCCCCEe
Confidence 3444444443
No 91
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=52.48 E-value=10 Score=26.81 Aligned_cols=23 Identities=26% Similarity=0.658 Sum_probs=10.5
Q ss_pred cccCccchhhcCCH----HHHHHHHhh
Q psy13406 920 PFLCSMCAEAFSRK----DHLKKHMRK 942 (1043)
Q Consensus 920 ~~~C~~C~~~f~~~----~~l~~H~~~ 942 (1043)
..+|.+|++.+... ..|..|++.
T Consensus 16 ~a~C~~C~~~~~~~~~~ts~l~~HL~~ 42 (45)
T PF02892_consen 16 KAKCKYCGKVIKYSSGGTSNLKRHLKK 42 (45)
T ss_dssp -EEETTTTEE-----SSTHHHHHHHHH
T ss_pred eEEeCCCCeEEeeCCCcHHHHHHhhhh
Confidence 34566666665553 566666643
No 92
>KOG1280|consensus
Probab=52.48 E-value=60 Score=34.12 Aligned_cols=25 Identities=16% Similarity=0.303 Sum_probs=13.8
Q ss_pred CCCCCCcccccccCChhHHHHHHhh
Q psy13406 834 VRDHQCQVCGNRFKQKSNLNIHMKI 858 (1043)
Q Consensus 834 ~~~~~C~~C~~~f~~~~~l~~H~~~ 858 (1043)
...|.|++|++.=.+...|..|+..
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s 101 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLS 101 (381)
T ss_pred cccccCCcccccccchhHHHHHhhh
Confidence 3345666666555555555555533
No 93
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=52.09 E-value=5.5 Score=38.52 Aligned_cols=22 Identities=27% Similarity=0.839 Sum_probs=0.0
Q ss_pred cceecCCCC-cccCCHHHHHHHH
Q psy13406 55 KKFKCNICP-KRYARKNRLTNHL 76 (1043)
Q Consensus 55 ~~~~C~~C~-~~f~~~~~l~~H~ 76 (1043)
+.|.|.+|| .+|.-+..+.+|.
T Consensus 100 ~ey~CEICGN~~Y~GrkaFekHF 122 (196)
T PF11931_consen 100 VEYKCEICGNQSYKGRKAFEKHF 122 (196)
T ss_dssp -----------------------
T ss_pred CeeeeEeCCCcceecHHHHHHhc
Confidence 457777775 4455666666664
No 94
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=50.42 E-value=9.3 Score=38.77 Aligned_cols=23 Identities=30% Similarity=0.837 Sum_probs=12.6
Q ss_pred eecCCCcccccCcchHHHHHhhc
Q psy13406 179 FQCSVCSESFTQRSNLNIHLRIH 201 (1043)
Q Consensus 179 ~~C~~C~~~f~~~~~l~~H~~~h 201 (1043)
|+|+.|...|-.--+.-.|...|
T Consensus 389 Y~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 389 YQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred eechhhhhhhhhhhHHHHHHHHh
Confidence 55555555555555555555444
No 95
>KOG4377|consensus
Probab=49.18 E-value=6.8 Score=41.60 Aligned_cols=21 Identities=19% Similarity=0.347 Sum_probs=17.9
Q ss_pred chhhcCCHHHHHHHHhhhcCC
Q psy13406 926 CAEAFSRKDHLKKHMRKCYGP 946 (1043)
Q Consensus 926 C~~~f~~~~~l~~H~~~~~~~ 946 (1043)
|+..|...+.+..|.|+|-..
T Consensus 409 c~~tl~s~sqm~shkrkheRq 429 (480)
T KOG4377|consen 409 CEATLYSVSQMASHKRKHERQ 429 (480)
T ss_pred CceEEEehhhhhhhhhhhhhh
Confidence 889999999999999887554
No 96
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=49.12 E-value=10 Score=35.69 Aligned_cols=10 Identities=30% Similarity=0.670 Sum_probs=4.5
Q ss_pred CCCccCCCCC
Q psy13406 204 IRPFQCNVCY 213 (1043)
Q Consensus 204 ~~~~~C~~C~ 213 (1043)
+.|-+|++|+
T Consensus 147 e~P~~CPiCg 156 (166)
T COG1592 147 EAPEVCPICG 156 (166)
T ss_pred CCCCcCCCCC
Confidence 4444444444
No 97
>PF15269 zf-C2H2_7: Zinc-finger
Probab=48.78 E-value=13 Score=25.38 Aligned_cols=23 Identities=35% Similarity=0.710 Sum_probs=19.3
Q ss_pred ceeCCCCCCcCCCHHHHHHHHHH
Q psy13406 262 RYLCTMCPKCFTRKDDLNRHMRN 284 (1043)
Q Consensus 262 ~~~C~~C~~~f~~~~~l~~H~~~ 284 (1043)
.|+|-.|+.+...++.|-.||+.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 37899999988888999888874
No 98
>PF15269 zf-C2H2_7: Zinc-finger
Probab=48.70 E-value=11 Score=25.78 Aligned_cols=22 Identities=32% Similarity=0.608 Sum_probs=19.4
Q ss_pred ccCccchhhcCCHHHHHHHHHH
Q psy13406 319 HACSICTESFSQKSNLYIHLKL 340 (1043)
Q Consensus 319 ~~C~~C~~~f~~~~~l~~H~~~ 340 (1043)
|+|-.|..+...++.|..||+-
T Consensus 21 ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCCcccchHHHHHHHHHH
Confidence 7798999999999999999975
No 99
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=47.69 E-value=9.9 Score=27.91 Aligned_cols=11 Identities=27% Similarity=1.171 Sum_probs=4.9
Q ss_pred eecCCCccccc
Q psy13406 179 FQCSVCSESFT 189 (1043)
Q Consensus 179 ~~C~~C~~~f~ 189 (1043)
|+|..|+..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 44444444444
No 100
>KOG2807|consensus
Probab=47.37 E-value=27 Score=36.17 Aligned_cols=24 Identities=25% Similarity=0.606 Sum_probs=14.2
Q ss_pred CccCCCCCccccChhHHHhhhhhc
Q psy13406 206 PFQCNVCYICFTNKSDLNRHSRIH 229 (1043)
Q Consensus 206 ~~~C~~C~~~f~~~~~l~~H~~~h 229 (1043)
.|+|..|...|-.--+.-.|...|
T Consensus 345 ~y~C~~Ck~~FCldCDv~iHesLh 368 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIHESLH 368 (378)
T ss_pred cEEchhccceeeccchHHHHhhhh
Confidence 466666666666555555565544
No 101
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=47.30 E-value=13 Score=33.55 Aligned_cols=27 Identities=48% Similarity=0.920 Sum_probs=16.8
Q ss_pred CccccccccccccCHHHHHHHHHHhcCCCc
Q psy13406 713 QEFPCSECGKVFTRKTTLKRHVRIHTGIKE 742 (1043)
Q Consensus 713 ~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~ 742 (1043)
....|-+||+.|.. |++|++.|+|..|
T Consensus 71 d~i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 71 DYIICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp S-EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred CeeEEccCCcccch---HHHHHHHccCCCH
Confidence 45789999999865 5999999987654
No 102
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=47.22 E-value=17 Score=22.21 Aligned_cols=19 Identities=26% Similarity=0.604 Sum_probs=11.7
Q ss_pred ecccCCcccCCHHHHHHHHH
Q psy13406 152 LCSVCSKTFTDKVKFNRHMR 171 (1043)
Q Consensus 152 ~C~~C~~~f~~~~~l~~H~~ 171 (1043)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 466776666 4556666654
No 103
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=47.16 E-value=8.8 Score=34.60 Aligned_cols=25 Identities=44% Similarity=1.066 Sum_probs=15.6
Q ss_pred cCccchhHHhCChHHHHHHHHhccCCCc
Q psy13406 893 FLCSICAKNFSRKDDLNRHMRKHDGIKP 920 (1043)
Q Consensus 893 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~ 920 (1043)
..|-+||+.|.. |++|++.|+|..|
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred eEEccCCcccch---HHHHHHHccCCCH
Confidence 568889988876 5889988888643
No 104
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=47.14 E-value=14 Score=22.54 Aligned_cols=8 Identities=38% Similarity=1.011 Sum_probs=3.2
Q ss_pred CCcccccc
Q psy13406 237 CSMCAKCF 244 (1043)
Q Consensus 237 C~~C~~~f 244 (1043)
|+.||..|
T Consensus 17 Cp~CG~~F 24 (26)
T PF10571_consen 17 CPHCGYDF 24 (26)
T ss_pred CCCCCCCC
Confidence 44444433
No 105
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=47.09 E-value=16 Score=24.43 Aligned_cols=6 Identities=50% Similarity=1.204 Sum_probs=2.2
Q ss_pred ecccCC
Q psy13406 152 LCSVCS 157 (1043)
Q Consensus 152 ~C~~C~ 157 (1043)
+|+.|+
T Consensus 27 ~C~~C~ 32 (36)
T PF13717_consen 27 RCSKCG 32 (36)
T ss_pred ECCCCC
Confidence 333333
No 106
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=47.04 E-value=8.1 Score=36.02 Aligned_cols=12 Identities=42% Similarity=0.697 Sum_probs=5.7
Q ss_pred eeCCCCCCcCCC
Q psy13406 263 YLCTMCPKCFTR 274 (1043)
Q Consensus 263 ~~C~~C~~~f~~ 274 (1043)
++|+.||++|..
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 444444444443
No 107
>KOG2807|consensus
Probab=46.44 E-value=17 Score=37.52 Aligned_cols=24 Identities=29% Similarity=0.617 Sum_probs=16.0
Q ss_pred CCCCcccccccCChhHHHHHHhhh
Q psy13406 836 DHQCQVCGNRFKQKSNLNIHMKIH 859 (1043)
Q Consensus 836 ~~~C~~C~~~f~~~~~l~~H~~~h 859 (1043)
.|.|..|...|-..-+.-.|-..|
T Consensus 345 ~y~C~~Ck~~FCldCDv~iHesLh 368 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIHESLH 368 (378)
T ss_pred cEEchhccceeeccchHHHHhhhh
Confidence 467777777776666666666554
No 108
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=46.01 E-value=22 Score=33.57 Aligned_cols=20 Identities=20% Similarity=0.323 Sum_probs=10.6
Q ss_pred CcceecccCCcccCCHHHHH
Q psy13406 148 VKVFLCSVCSKTFTDKVKFN 167 (1043)
Q Consensus 148 ~~~~~C~~C~~~f~~~~~l~ 167 (1043)
..-|.|+.|+..|+...++.
T Consensus 107 ~~~Y~Cp~c~~r~tf~eA~~ 126 (158)
T TIGR00373 107 NMFFICPNMCVRFTFNEAME 126 (158)
T ss_pred CCeEECCCCCcEeeHHHHHH
Confidence 33455555555555555553
No 109
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=46.00 E-value=18 Score=34.16 Aligned_cols=18 Identities=17% Similarity=0.333 Sum_probs=8.5
Q ss_pred CccccccCCccccCHHHH
Q psy13406 121 KVFRCEQCRFDFYVKREY 138 (1043)
Q Consensus 121 ~~~~C~~C~~~f~~~~~l 138 (1043)
.-|.|+.|+..|.....+
T Consensus 108 ~~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 108 MFFICPNMCVRFTFNEAM 125 (158)
T ss_pred CeEECCCCCcEeeHHHHH
Confidence 334455555554444444
No 110
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=45.59 E-value=11 Score=35.45 Aligned_cols=11 Identities=36% Similarity=1.054 Sum_probs=6.3
Q ss_pred CCCCCCCcccc
Q psy13406 833 GVRDHQCQVCG 843 (1043)
Q Consensus 833 ~~~~~~C~~C~ 843 (1043)
|+.|-+|++||
T Consensus 146 ge~P~~CPiCg 156 (166)
T COG1592 146 GEAPEVCPICG 156 (166)
T ss_pred CCCCCcCCCCC
Confidence 45555666665
No 111
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=44.41 E-value=7.5 Score=36.24 Aligned_cols=11 Identities=36% Similarity=0.782 Sum_probs=4.5
Q ss_pred ccCCCCCcccC
Q psy13406 95 FTCSQCPKTFV 105 (1043)
Q Consensus 95 ~~C~~C~~~f~ 105 (1043)
++|+.|+++|.
T Consensus 29 ~~c~~c~~~f~ 39 (154)
T PRK00464 29 RECLACGKRFT 39 (154)
T ss_pred eeccccCCcce
Confidence 34444444443
No 112
>KOG2071|consensus
Probab=44.23 E-value=17 Score=41.35 Aligned_cols=25 Identities=28% Similarity=0.777 Sum_probs=17.0
Q ss_pred CCcccccccccccChHHHhhhhccc
Q psy13406 807 KPQTCEVCQKTFDKMAKFKRHLKVH 831 (1043)
Q Consensus 807 ~~~~C~~C~~~f~~~~~l~~H~~~h 831 (1043)
.+..|..||..|........||.+|
T Consensus 417 ~pnqC~~CG~R~~~~ee~sk~md~H 441 (579)
T KOG2071|consen 417 SPNQCKSCGLRFDDSEERSKHMDIH 441 (579)
T ss_pred CcchhcccccccccchhhhhHhhhh
Confidence 4566777777777776666666555
No 113
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=44.00 E-value=29 Score=32.43 Aligned_cols=13 Identities=23% Similarity=0.682 Sum_probs=7.2
Q ss_pred ceecccCCcccCC
Q psy13406 150 VFLCSVCSKTFTD 162 (1043)
Q Consensus 150 ~~~C~~C~~~f~~ 162 (1043)
.|.|+.|+..|..
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 4555555555553
No 114
>KOG2272|consensus
Probab=43.91 E-value=9.8 Score=37.35 Aligned_cols=12 Identities=33% Similarity=0.686 Sum_probs=8.0
Q ss_pred ccCCCCCccccC
Q psy13406 207 FQCNVCYICFTN 218 (1043)
Q Consensus 207 ~~C~~C~~~f~~ 218 (1043)
|.|..|.+-|.-
T Consensus 222 FvCa~CekPFlG 233 (332)
T KOG2272|consen 222 FVCAKCEKPFLG 233 (332)
T ss_pred eeehhcCCcccc
Confidence 667777776654
No 115
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=43.65 E-value=14 Score=35.77 Aligned_cols=15 Identities=27% Similarity=0.519 Sum_probs=6.5
Q ss_pred CCCcccccccCChhH
Q psy13406 837 HQCQVCGNRFKQKSN 851 (1043)
Q Consensus 837 ~~C~~C~~~f~~~~~ 851 (1043)
|.|+.|+..|.....
T Consensus 118 Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 118 FFCPNCHIRFTFDEA 132 (178)
T ss_pred EECCCCCcEEeHHHH
Confidence 444444444443333
No 116
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=42.22 E-value=20 Score=31.75 Aligned_cols=29 Identities=45% Similarity=0.749 Sum_probs=22.7
Q ss_pred CCCccccccccccccCHHHHHHHHHHhcCCCc
Q psy13406 711 TPQEFPCSECGKVFTRKTTLKRHVRIHTGIKE 742 (1043)
Q Consensus 711 ~~~~~~C~~C~k~F~~~~~L~~H~~~H~~~~~ 742 (1043)
.+.-..|-.+||.| .+|++|+.+|.|..|
T Consensus 73 tpD~IicLEDGkkf---KSLKRHL~t~~gmTP 101 (148)
T COG4957 73 TPDYIICLEDGKKF---KSLKRHLTTHYGLTP 101 (148)
T ss_pred CCCeEEEeccCcch---HHHHHHHhcccCCCH
Confidence 33457899999998 568999999987654
No 117
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=41.88 E-value=22 Score=42.50 Aligned_cols=10 Identities=20% Similarity=0.567 Sum_probs=5.2
Q ss_pred cceeCCCCCC
Q psy13406 261 KRYLCTMCPK 270 (1043)
Q Consensus 261 ~~~~C~~C~~ 270 (1043)
+...|..||+
T Consensus 461 ~~L~CH~Cg~ 470 (730)
T COG1198 461 GQLRCHYCGY 470 (730)
T ss_pred CeeEeCCCCC
Confidence 3455555553
No 118
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=41.48 E-value=16 Score=26.19 Aligned_cols=10 Identities=20% Similarity=0.939 Sum_probs=4.5
Q ss_pred eecccCCccc
Q psy13406 151 FLCSVCSKTF 160 (1043)
Q Consensus 151 ~~C~~C~~~f 160 (1043)
|.|..|++.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 4444444444
No 119
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=41.28 E-value=9.3 Score=30.21 Aligned_cols=6 Identities=33% Similarity=1.232 Sum_probs=2.5
Q ss_pred CCCCCc
Q psy13406 97 CSQCPK 102 (1043)
Q Consensus 97 C~~C~~ 102 (1043)
|+.|+.
T Consensus 4 CP~Cg~ 9 (72)
T PRK09678 4 CPLCQH 9 (72)
T ss_pred CCCCCC
Confidence 444443
No 120
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=40.96 E-value=17 Score=32.10 Aligned_cols=8 Identities=25% Similarity=0.700 Sum_probs=3.2
Q ss_pred cccCCccc
Q psy13406 153 CSVCSKTF 160 (1043)
Q Consensus 153 C~~C~~~f 160 (1043)
|+.|++.|
T Consensus 12 Cp~cg~kF 19 (129)
T TIGR02300 12 CPNTGSKF 19 (129)
T ss_pred CCCcCccc
Confidence 44444433
No 121
>PHA00626 hypothetical protein
Probab=40.57 E-value=14 Score=27.00 Aligned_cols=14 Identities=21% Similarity=0.560 Sum_probs=8.3
Q ss_pred cceeCCCCCCcCCC
Q psy13406 261 KRYLCTMCPKCFTR 274 (1043)
Q Consensus 261 ~~~~C~~C~~~f~~ 274 (1043)
..|.|+.||+.|+.
T Consensus 22 nrYkCkdCGY~ft~ 35 (59)
T PHA00626 22 DDYVCCDCGYNDSK 35 (59)
T ss_pred cceEcCCCCCeech
Confidence 35666666666654
No 122
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=40.54 E-value=17 Score=26.38 Aligned_cols=21 Identities=33% Similarity=0.909 Sum_probs=12.1
Q ss_pred CCCcccccccCCh-----hHHHHHHh
Q psy13406 837 HQCQVCGNRFKQK-----SNLNIHMK 857 (1043)
Q Consensus 837 ~~C~~C~~~f~~~-----~~l~~H~~ 857 (1043)
-.|.+|++.+... +.|.+|++
T Consensus 19 a~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecCCCCEeeeCCCCCcHHHHHHHH
Confidence 3566666666443 46666665
No 123
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=40.22 E-value=41 Score=31.41 Aligned_cols=13 Identities=15% Similarity=0.830 Sum_probs=6.7
Q ss_pred ccccccCCccccC
Q psy13406 122 VFRCEQCRFDFYV 134 (1043)
Q Consensus 122 ~~~C~~C~~~f~~ 134 (1043)
.|.|+.|+..|..
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 4555555555544
No 124
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=39.40 E-value=23 Score=42.46 Aligned_cols=49 Identities=20% Similarity=0.607 Sum_probs=33.9
Q ss_pred ccCCCCCcccCchHHHHHHhhhccCCCccccccCCccccCHHHHHHHHHhccCCcceecccCCcccCCHHHHHHHHHhcC
Q psy13406 95 FTCSQCPKTFVDKWHLNRHLKSHSENKVFRCEQCRFDFYVKREYNRHMKIHDGVKVFLCSVCSKTFTDKVKFNRHMRAHE 174 (1043)
Q Consensus 95 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 174 (1043)
..|..||..+ .|+.|+..+ . .|.......|..||..
T Consensus 436 l~C~~Cg~v~-------------------~Cp~Cd~~l------t----~H~~~~~L~CH~Cg~~--------------- 471 (730)
T COG1198 436 LLCRDCGYIA-------------------ECPNCDSPL------T----LHKATGQLRCHYCGYQ--------------- 471 (730)
T ss_pred eecccCCCcc-------------------cCCCCCcce------E----EecCCCeeEeCCCCCC---------------
Confidence 4688888864 688887652 2 3444456888888864
Q ss_pred CCCCeecCCCccc
Q psy13406 175 GIKPFQCSVCSES 187 (1043)
Q Consensus 175 ~~~~~~C~~C~~~ 187 (1043)
...|..|+.||..
T Consensus 472 ~~~p~~Cp~Cgs~ 484 (730)
T COG1198 472 EPIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCCC
Confidence 3457888888865
No 125
>KOG4377|consensus
Probab=39.38 E-value=16 Score=39.02 Aligned_cols=21 Identities=19% Similarity=0.363 Sum_probs=15.8
Q ss_pred ccccccCChhhHHhhhccccC
Q psy13406 239 MCAKCFSRKDDLNRHMKIHEG 259 (1043)
Q Consensus 239 ~C~~~f~~~~~l~~H~~~h~~ 259 (1043)
-|+.+|.+.+.+..|.+.|..
T Consensus 408 Gc~~tl~s~sqm~shkrkheR 428 (480)
T KOG4377|consen 408 GCEATLYSVSQMASHKRKHER 428 (480)
T ss_pred CCceEEEehhhhhhhhhhhhh
Confidence 377888888888888877753
No 126
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=39.30 E-value=26 Score=33.90 Aligned_cols=18 Identities=28% Similarity=0.578 Sum_probs=9.9
Q ss_pred cceecccCCcccCCHHHH
Q psy13406 149 KVFLCSVCSKTFTDKVKF 166 (1043)
Q Consensus 149 ~~~~C~~C~~~f~~~~~l 166 (1043)
.-|.|+.|+..|+...++
T Consensus 116 ~~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAM 133 (178)
T ss_pred CEEECCCCCcEEeHHHHh
Confidence 345566666655555444
No 127
>PF12907 zf-met2: Zinc-binding
Probab=38.88 E-value=26 Score=24.04 Aligned_cols=21 Identities=29% Similarity=0.801 Sum_probs=11.0
Q ss_pred cCccchhHHhC---ChHHHHHHHH
Q psy13406 893 FLCSICAKNFS---RKDDLNRHMR 913 (1043)
Q Consensus 893 ~~C~~C~~~f~---~~~~l~~H~~ 913 (1043)
+.|.+|..+|. +...|+.|..
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~e 25 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAE 25 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHH
Confidence 45666664443 3445555654
No 128
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=38.78 E-value=21 Score=35.50 Aligned_cols=95 Identities=23% Similarity=0.449 Sum_probs=48.9
Q ss_pred cCCCCCeecCCCcccccCcchHHHHHhhc--cCCCCccCCCCCccccChhHHHhhhhhccCCCcccCCccccccCChhhH
Q psy13406 173 HEGIKPFQCSVCSESFTQRSNLNIHLRIH--DGIRPFQCNVCYICFTNKSDLNRHSRIHNGIKPFLCSMCAKCFSRKDDL 250 (1043)
Q Consensus 173 h~~~~~~~C~~C~~~f~~~~~l~~H~~~h--~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 250 (1043)
.+|.+.|+|.+|..-.-.-.. -.|+.+- .....|+|.-|++. -.|.|--|.-.|-.-...
T Consensus 137 ~hGGrif~CsfC~~flCEDDQ-FEHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfCddHvr 198 (314)
T PF06524_consen 137 DHGGRIFKCSFCDNFLCEDDQ-FEHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFCDDHVR 198 (314)
T ss_pred cCCCeEEEeecCCCeeeccch-hhhhhhhhhhhcccccccccccc-----------------cchhhhheeeeehhhhhh
Confidence 456677888888654322222 2444321 12345777777652 123444444434322111
Q ss_pred HhhhccccCCcceeCCCCCCcCCCHHHHHHHHHHhC
Q psy13406 251 NRHMKIHEGSKRYLCTMCPKCFTRKDDLNRHMRNHD 286 (1043)
Q Consensus 251 ~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~ 286 (1043)
+.=++. ...+++.|+.|+........|..-.|+|.
T Consensus 199 rKg~ky-~k~k~~PCPKCg~et~eTkdLSmStR~hk 233 (314)
T PF06524_consen 199 RKGFKY-EKGKPIPCPKCGYETQETKDLSMSTRSHK 233 (314)
T ss_pred hccccc-ccCCCCCCCCCCCcccccccceeeeecch
Confidence 111111 23467888888887777777766666653
No 129
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=38.68 E-value=15 Score=23.36 Aligned_cols=7 Identities=29% Similarity=1.222 Sum_probs=3.1
Q ss_pred eecccCC
Q psy13406 151 FLCSVCS 157 (1043)
Q Consensus 151 ~~C~~C~ 157 (1043)
|.|+.|+
T Consensus 20 ~vCp~C~ 26 (30)
T PF08274_consen 20 LVCPECG 26 (30)
T ss_dssp EEETTTT
T ss_pred EeCCccc
Confidence 4444443
No 130
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=38.60 E-value=33 Score=25.96 Aligned_cols=39 Identities=31% Similarity=0.586 Sum_probs=16.9
Q ss_pred CeecCC--CcccccCcchHHHHHhhccCCCCccCCC----CCcccc
Q psy13406 178 PFQCSV--CSESFTQRSNLNIHLRIHDGIRPFQCNV----CYICFT 217 (1043)
Q Consensus 178 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~ 217 (1043)
+..|+. |...+. +..|..|+...-..++..|.+ |+..+.
T Consensus 9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 9 PVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp EEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred EeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 345555 333333 445666665444445555555 555444
No 131
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=37.14 E-value=28 Score=25.94 Aligned_cols=10 Identities=60% Similarity=1.707 Sum_probs=5.7
Q ss_pred cccccccccc
Q psy13406 714 EFPCSECGKV 723 (1043)
Q Consensus 714 ~~~C~~C~k~ 723 (1043)
.|.|+.||+.
T Consensus 27 ~F~CPnCGe~ 36 (61)
T COG2888 27 KFPCPNCGEV 36 (61)
T ss_pred EeeCCCCCce
Confidence 3666666644
No 132
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=36.03 E-value=14 Score=29.25 Aligned_cols=7 Identities=29% Similarity=0.595 Sum_probs=2.9
Q ss_pred CCCcccc
Q psy13406 211 VCYICFT 217 (1043)
Q Consensus 211 ~C~~~f~ 217 (1043)
.||.+|.
T Consensus 34 eCg~tF~ 40 (72)
T PRK09678 34 NCSATFI 40 (72)
T ss_pred CCCCEEE
Confidence 3444443
No 133
>KOG2071|consensus
Probab=35.72 E-value=22 Score=40.36 Aligned_cols=27 Identities=37% Similarity=0.760 Sum_probs=22.5
Q ss_pred CCCCCCcccccccCChhHHHHHHhhhc
Q psy13406 834 VRDHQCQVCGNRFKQKSNLNIHMKIHE 860 (1043)
Q Consensus 834 ~~~~~C~~C~~~f~~~~~l~~H~~~h~ 860 (1043)
.++..|..||.+|........||-.|.
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhh
Confidence 567889999999999888888887774
No 134
>PRK04023 DNA polymerase II large subunit; Validated
Probab=35.53 E-value=33 Score=41.74 Aligned_cols=9 Identities=33% Similarity=1.198 Sum_probs=4.6
Q ss_pred CCCcccccc
Q psy13406 837 HQCQVCGNR 845 (1043)
Q Consensus 837 ~~C~~C~~~ 845 (1043)
+.|+.||..
T Consensus 664 y~CPKCG~E 672 (1121)
T PRK04023 664 DECEKCGRE 672 (1121)
T ss_pred CcCCCCCCC
Confidence 445555544
No 135
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=34.45 E-value=25 Score=24.47 Aligned_cols=29 Identities=24% Similarity=0.654 Sum_probs=20.2
Q ss_pred ccccccccccccccccccccccccCcCcc
Q psy13406 354 FNCSKCLNIVRIPVEIKVEVPFVCKACAS 382 (1043)
Q Consensus 354 ~~c~~c~~~~~~~~~~~~~~~~~c~~c~~ 382 (1043)
|.|..|+..|.....+.......|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~~~~~~~CP~Cg~ 34 (42)
T PF09723_consen 6 YRCEECGHEFEVLQSISEDDPVPCPECGS 34 (42)
T ss_pred EEeCCCCCEEEEEEEcCCCCCCcCCCCCC
Confidence 56777777776666655556667888876
No 136
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=34.26 E-value=29 Score=35.42 Aligned_cols=81 Identities=28% Similarity=0.534 Sum_probs=42.9
Q ss_pred CCCcccCcCCcccCCHHHHHHHHHHhcC-------CC----CCCCcccccccccccChHHHhhhhcccCCCCCCCCcccc
Q psy13406 775 EAPYACSICDRKFTIKGQLSRHLRAHEN-------EK----PEKPQTCEVCQKTFDKMAKFKRHLKVHDGVRDHQCQVCG 843 (1043)
Q Consensus 775 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~-------~~----~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 843 (1043)
.-|..|+.|+........|.+-. |+- +. ..+.-.|-.|+-.|.....-.. ..-+....|.|+.|.
T Consensus 320 sLPi~CP~Csl~LilsthLarSy--hhL~PLk~f~E~p~~~~~ks~~Cf~CQ~~fp~~~~~~~--~~~~ss~rY~Ce~CK 395 (421)
T COG5151 320 SLPISCPICSLQLILSTHLARSY--HHLYPLKPFVEKPEGTNPKSTHCFVCQGPFPKPPVSPF--DESTSSGRYQCELCK 395 (421)
T ss_pred cCCccCcchhHHHHHHHHHHHHH--HhhccCcccccccCCCCCCCccceeccCCCCCCCCCcc--cccccccceechhhh
Confidence 45778888876554433333211 110 11 0123447777777764321110 011223457777777
Q ss_pred cccCChhHHHHHHhhh
Q psy13406 844 NRFKQKSNLNIHMKIH 859 (1043)
Q Consensus 844 ~~f~~~~~l~~H~~~h 859 (1043)
..|-...+.-.|-..|
T Consensus 396 ~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 396 STFCSDCDVFIHETLH 411 (421)
T ss_pred hhhhhhhHHHHHHHHh
Confidence 7777777777776665
No 137
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=33.53 E-value=25 Score=34.91 Aligned_cols=30 Identities=17% Similarity=0.440 Sum_probs=22.4
Q ss_pred CCcccCccchhhcCCHHHHHHHHhhhcCCC
Q psy13406 918 IKPFLCSMCAEAFSRKDHLKKHMRKCYGPP 947 (1043)
Q Consensus 918 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~ 947 (1043)
+..|.|..|+|.|.-...+++|+.+.|...
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 345999999999999999999998887644
No 138
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=32.77 E-value=25 Score=26.65 Aligned_cols=39 Identities=18% Similarity=0.500 Sum_probs=16.0
Q ss_pred cccCCC--CCcccCchHHHHHHhhhccCCCcccccc----CCcccc
Q psy13406 94 SFTCSQ--CPKTFVDKWHLNRHLKSHSENKVFRCEQ----CRFDFY 133 (1043)
Q Consensus 94 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~----C~~~f~ 133 (1043)
+..|+. |...+. +..|..|+...-..++..|.. |+..+.
T Consensus 9 ~v~C~~~cc~~~i~-r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~~ 53 (60)
T PF02176_consen 9 PVPCPNGCCNEMIP-RKELDDHLENECPKRPVPCPYSPYGCKERVP 53 (60)
T ss_dssp EEE-TT--S-BEEE-CCCHHHHHHTTSTTSEEE-SS----S--EEE
T ss_pred EeeCCCCCccccee-HHHHHHHHHccCCCCcEECCCCCCCCCCccc
Confidence 345555 333332 445555555444444455555 555443
No 139
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=32.63 E-value=17 Score=29.49 Aligned_cols=32 Identities=19% Similarity=0.564 Sum_probs=19.3
Q ss_pred CCcCccchhHHhCChHHHHHHHHhccCCCcccCccchhhcCC
Q psy13406 891 KPFLCSICAKNFSRKDDLNRHMRKHDGIKPFLCSMCAEAFSR 932 (1043)
Q Consensus 891 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 932 (1043)
.+|.|+.|++. .+.|. +...+.|..||..|.-
T Consensus 34 ~~~~Cp~C~~~--------~VkR~--a~GIW~C~kCg~~fAG 65 (89)
T COG1997 34 AKHVCPFCGRT--------TVKRI--ATGIWKCRKCGAKFAG 65 (89)
T ss_pred cCCcCCCCCCc--------ceeee--ccCeEEcCCCCCeecc
Confidence 45778888764 22233 2334778888877754
No 140
>KOG3408|consensus
Probab=32.44 E-value=47 Score=28.91 Aligned_cols=28 Identities=21% Similarity=0.431 Sum_probs=20.6
Q ss_pred cccccceecCCCCcccCCHHHHHHHHHH
Q psy13406 51 KKMVKKFKCNICPKRYARKNRLTNHLRT 78 (1043)
Q Consensus 51 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 78 (1043)
..+...|-|-.|.+-|.+...|..|.++
T Consensus 52 lPG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 52 LPGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCCceeehhhhhhhhcchHHHHHHHhc
Confidence 4445667788888888888888888754
No 141
>KOG2593|consensus
Probab=30.66 E-value=54 Score=35.80 Aligned_cols=15 Identities=27% Similarity=0.720 Sum_probs=7.2
Q ss_pred ceecccCCcccCCHH
Q psy13406 150 VFLCSVCSKTFTDKV 164 (1043)
Q Consensus 150 ~~~C~~C~~~f~~~~ 164 (1043)
-|.|+.|.+.|....
T Consensus 128 ~Y~Cp~C~kkyt~Le 142 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLE 142 (436)
T ss_pred cccCCccccchhhhH
Confidence 355555555554333
No 142
>KOG4124|consensus
Probab=29.62 E-value=9.6 Score=39.47 Aligned_cols=23 Identities=30% Similarity=0.739 Sum_probs=15.5
Q ss_pred CCcccCccchhhcCCHHHHHHHH
Q psy13406 918 IKPFLCSMCAEAFSRKDHLKKHM 940 (1043)
Q Consensus 918 ~~~~~C~~C~~~f~~~~~l~~H~ 940 (1043)
.|+|+|++|.+++.....|+-|.
T Consensus 396 nk~~r~~i~~~~~k~~~~l~~~~ 418 (442)
T KOG4124|consen 396 NKPYRCEVCSKRYKNLNGLKYHR 418 (442)
T ss_pred cCcccChhhhhhhccCCCCCcee
Confidence 56788888887776665555443
No 143
>PRK04023 DNA polymerase II large subunit; Validated
Probab=29.54 E-value=49 Score=40.35 Aligned_cols=13 Identities=15% Similarity=0.603 Sum_probs=7.1
Q ss_pred CCeecCCCccccc
Q psy13406 177 KPFQCSVCSESFT 189 (1043)
Q Consensus 177 ~~~~C~~C~~~f~ 189 (1043)
..+.|+.|+....
T Consensus 650 ~i~fCP~CG~~~~ 662 (1121)
T PRK04023 650 PVYRCPRCGIEVE 662 (1121)
T ss_pred cceeCccccCcCC
Confidence 3466666655443
No 144
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=27.89 E-value=39 Score=23.12 Aligned_cols=29 Identities=17% Similarity=0.542 Sum_probs=16.2
Q ss_pred ccccccccccccccccccccccccCcCcc
Q psy13406 354 FNCSKCLNIVRIPVEIKVEVPFVCKACAS 382 (1043)
Q Consensus 354 ~~c~~c~~~~~~~~~~~~~~~~~c~~c~~ 382 (1043)
|.|..|+..|.............|+.|+.
T Consensus 6 y~C~~Cg~~fe~~~~~~~~~~~~CP~Cg~ 34 (41)
T smart00834 6 YRCEDCGHTFEVLQKISDDPLATCPECGG 34 (41)
T ss_pred EEcCCCCCEEEEEEecCCCCCCCCCCCCC
Confidence 45666666665444443344455777765
No 145
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=26.89 E-value=23 Score=30.77 Aligned_cols=13 Identities=15% Similarity=0.519 Sum_probs=8.8
Q ss_pred CeecCCCcccccC
Q psy13406 178 PFQCSVCSESFTQ 190 (1043)
Q Consensus 178 ~~~C~~C~~~f~~ 190 (1043)
.+.|+.|+..+..
T Consensus 16 ~~~C~~C~~~~~~ 28 (104)
T TIGR01384 16 VYVCPSCGYEKEK 28 (104)
T ss_pred eEECcCCCCcccc
Confidence 4777777766554
No 146
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=26.53 E-value=46 Score=24.99 Aligned_cols=10 Identities=50% Similarity=1.378 Sum_probs=5.5
Q ss_pred cccccccccc
Q psy13406 714 EFPCSECGKV 723 (1043)
Q Consensus 714 ~~~C~~C~k~ 723 (1043)
.|.|+.||+.
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 3566666544
No 147
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=25.76 E-value=53 Score=23.35 Aligned_cols=30 Identities=17% Similarity=0.348 Sum_probs=17.1
Q ss_pred ccccccccccccccccccccccccCcCcccccc
Q psy13406 354 FNCSKCLNIVRIPVEIKVEVPFVCKACASEETK 386 (1043)
Q Consensus 354 ~~c~~c~~~~~~~~~~~~~~~~~c~~c~~~~~~ 386 (1043)
|.|..|+..+....... ...|+.|+.....
T Consensus 4 y~C~~CG~~~~~~~~~~---~~~Cp~CG~~~~~ 33 (46)
T PRK00398 4 YKCARCGREVELDEYGT---GVRCPYCGYRILF 33 (46)
T ss_pred EECCCCCCEEEECCCCC---ceECCCCCCeEEE
Confidence 55666666555443332 5667777765543
No 148
>KOG2593|consensus
Probab=25.56 E-value=50 Score=36.01 Aligned_cols=9 Identities=33% Similarity=0.925 Sum_probs=4.9
Q ss_pred cccCCcccc
Q psy13406 234 PFLCSMCAK 242 (1043)
Q Consensus 234 ~~~C~~C~~ 242 (1043)
.|.|..|+-
T Consensus 153 ~F~C~~C~g 161 (436)
T KOG2593|consen 153 EFHCENCGG 161 (436)
T ss_pred eEEEecCCC
Confidence 455666654
No 149
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=25.34 E-value=44 Score=33.22 Aligned_cols=29 Identities=17% Similarity=0.345 Sum_probs=22.4
Q ss_pred CCCccCccchhhcCCHHHHHHHHHHhcCC
Q psy13406 316 SKPHACSICTESFSQKSNLYIHLKLQHGV 344 (1043)
Q Consensus 316 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~ 344 (1043)
+..|.|.+|+|.|.-..-..+|+...|.+
T Consensus 75 ~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 75 EDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred CCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 34589999999999999999999998874
No 150
>KOG1280|consensus
Probab=25.07 E-value=73 Score=33.53 Aligned_cols=24 Identities=17% Similarity=0.419 Sum_probs=10.8
Q ss_pred cccCcCCcccCCHHHHHHHHHHhc
Q psy13406 778 YACSICDRKFTIKGQLSRHLRAHE 801 (1043)
Q Consensus 778 ~~C~~C~~~f~~~~~l~~H~~~h~ 801 (1043)
|.|++|+..=.+...|..|+...|
T Consensus 80 ftCPyC~~~Gfte~~f~~Hv~s~H 103 (381)
T KOG1280|consen 80 FTCPYCGIMGFTERQFGTHVLSQH 103 (381)
T ss_pred ccCCcccccccchhHHHHHhhhcC
Confidence 444444444444444444444433
No 151
>PF14353 CpXC: CpXC protein
Probab=24.93 E-value=9.1 Score=34.81 Aligned_cols=19 Identities=32% Similarity=0.789 Sum_probs=9.7
Q ss_pred CCCcccccccCChhHHHHH
Q psy13406 837 HQCQVCGNRFKQKSNLNIH 855 (1043)
Q Consensus 837 ~~C~~C~~~f~~~~~l~~H 855 (1043)
|.|+.||..|.-...+..|
T Consensus 39 ~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 39 FTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred EECCCCCCceecCCCEEEE
Confidence 4555555555544444443
No 152
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.60 E-value=57 Score=27.65 Aligned_cols=13 Identities=8% Similarity=0.217 Sum_probs=7.6
Q ss_pred CCcccCCCCCccc
Q psy13406 92 EGSFTCSQCPKTF 104 (1043)
Q Consensus 92 ~~~~~C~~C~~~f 104 (1043)
..|..|++||++|
T Consensus 24 rdPiVsPytG~s~ 36 (129)
T COG4530 24 RDPIVSPYTGKSY 36 (129)
T ss_pred CCccccCcccccc
Confidence 3455666666666
No 153
>KOG2272|consensus
Probab=23.93 E-value=65 Score=31.93 Aligned_cols=20 Identities=20% Similarity=0.479 Sum_probs=15.5
Q ss_pred CccccccccccccCHHHHHH
Q psy13406 713 QEFPCSECGKVFTRKTTLKR 732 (1043)
Q Consensus 713 ~~~~C~~C~k~F~~~~~L~~ 732 (1043)
.-|.|++|++...+...++.
T Consensus 98 ~CF~Cd~Cn~~Lad~gf~rn 117 (332)
T KOG2272|consen 98 ACFRCDLCNKHLADQGFYRN 117 (332)
T ss_pred ccchhHHHHHHHhhhhhHhh
Confidence 46999999988877766654
No 154
>KOG4167|consensus
Probab=23.22 E-value=33 Score=39.70 Aligned_cols=82 Identities=12% Similarity=0.052 Sum_probs=51.9
Q ss_pred CCCcccccccCChhHHHHHHhhhcCCCc--cCCCcccccCCCHHHHHHHHHhhcC--CCCcCccchhHHhCChHHHHHHH
Q psy13406 837 HQCQVCGNRFKQKSNLNIHMKIHEGIKA--HQCNVCMMKFTNKSDLNRHMRKHDG--VKPFLCSICAKNFSRKDDLNRHM 912 (1043)
Q Consensus 837 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~--~~C~~C~~~f~~~~~l~~H~~~h~~--~~~~~C~~C~~~f~~~~~l~~H~ 912 (1043)
+.|.-|...|.....+-.|+.+++..-+ +.|..|+..|.....--.|+..+-- .+.-.|.+|.+.|..-..+..++
T Consensus 289 ~p~k~~St~p~~e~G~~s~~~~psa~wpqhlP~g~~~~~~p~s~g~s~~a~~~Qv~~P~g~~~si~~~~~~llP~a~~pI 368 (907)
T KOG4167|consen 289 RPLKSPSTIPLNEMGPQSQQLSPSAMWPQHLPDGRAQPGSPESSGQSKGAFGEQVDAPNGLTCSICLKEFKLLPAANGPI 368 (907)
T ss_pred ccccccccchhhhcCCccccccchhhccccCcccccCCCCCCcccccccchHhhhcCCCCCccccccCCCCcCcCCCCce
Confidence 6777888888877777777777765544 6777777766554444444443321 12245777777777666666666
Q ss_pred HhccCC
Q psy13406 913 RKHDGI 918 (1043)
Q Consensus 913 ~~h~~~ 918 (1043)
+.+.+.
T Consensus 369 ~aP~~~ 374 (907)
T KOG4167|consen 369 RAPGGM 374 (907)
T ss_pred ecCCCC
Confidence 665554
No 155
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=22.94 E-value=35 Score=30.33 Aligned_cols=17 Identities=35% Similarity=0.741 Sum_probs=13.4
Q ss_pred cceecCCCCcccCCHHH
Q psy13406 55 KKFKCNICPKRYARKNR 71 (1043)
Q Consensus 55 ~~~~C~~C~~~f~~~~~ 71 (1043)
+.|.|++|..+...+.-
T Consensus 79 ~lYeCnIC~etS~ee~F 95 (140)
T PF05290_consen 79 KLYECNICKETSAEERF 95 (140)
T ss_pred CceeccCcccccchhhc
Confidence 78999999988766543
No 156
>KOG3408|consensus
Probab=22.85 E-value=53 Score=28.62 Aligned_cols=27 Identities=37% Similarity=0.833 Sum_probs=23.9
Q ss_pred CCCCcccCCCCCcccCchHHHHHHhhh
Q psy13406 90 GGEGSFTCSQCPKTFVDKWHLNRHLKS 116 (1043)
Q Consensus 90 ~~~~~~~C~~C~~~f~~~~~l~~H~~~ 116 (1043)
.|.+.|-|-.|.+-|.+...|..|.+.
T Consensus 53 PG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 53 PGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCCceeehhhhhhhhcchHHHHHHHhc
Confidence 467789999999999999999999864
No 157
>KOG4124|consensus
Probab=22.25 E-value=28 Score=36.23 Aligned_cols=24 Identities=29% Similarity=0.730 Sum_probs=16.7
Q ss_pred CCCccCCC--CCccccChhHHHhhhh
Q psy13406 204 IRPFQCNV--CYICFTNKSDLNRHSR 227 (1043)
Q Consensus 204 ~~~~~C~~--C~~~f~~~~~l~~H~~ 227 (1043)
.++|.|.+ |.+.+.+...|+.|..
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~ 372 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKL 372 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccc
Confidence 35677766 7777777777777653
No 158
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=21.95 E-value=41 Score=22.51 Aligned_cols=11 Identities=27% Similarity=0.936 Sum_probs=5.1
Q ss_pred CCCCccccccc
Q psy13406 836 DHQCQVCGNRF 846 (1043)
Q Consensus 836 ~~~C~~C~~~f 846 (1043)
+-.|..||-.+
T Consensus 21 ~~~Cd~cg~~L 31 (36)
T PF05191_consen 21 EGVCDNCGGEL 31 (36)
T ss_dssp TTBCTTTTEBE
T ss_pred CCccCCCCCee
Confidence 34455555433
No 159
>PRK14873 primosome assembly protein PriA; Provisional
Probab=21.82 E-value=66 Score=38.63 Aligned_cols=11 Identities=18% Similarity=0.507 Sum_probs=6.0
Q ss_pred CcceeCCCCCC
Q psy13406 260 SKRYLCTMCPK 270 (1043)
Q Consensus 260 ~~~~~C~~C~~ 270 (1043)
.....|..||+
T Consensus 408 ~~~l~Ch~CG~ 418 (665)
T PRK14873 408 GGTPRCRWCGR 418 (665)
T ss_pred CCeeECCCCcC
Confidence 34455666654
No 160
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=20.95 E-value=81 Score=39.81 Aligned_cols=10 Identities=20% Similarity=0.487 Sum_probs=4.9
Q ss_pred Cccccccccc
Q psy13406 808 PQTCEVCQKT 817 (1043)
Q Consensus 808 ~~~C~~C~~~ 817 (1043)
.|.|+.|+..
T Consensus 692 vy~CPsCGae 701 (1337)
T PRK14714 692 VYVCPDCGAE 701 (1337)
T ss_pred ceeCccCCCc
Confidence 3445555553
No 161
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=20.33 E-value=81 Score=36.62 Aligned_cols=40 Identities=20% Similarity=0.475 Sum_probs=23.8
Q ss_pred ccCCccccccCChhhHHhhhccccCCcceeCCCCCCcCCCHHHHHHHHHHhCCCCCcccCcCccc
Q psy13406 235 FLCSMCAKCFSRKDDLNRHMKIHEGSKRYLCTMCPKCFTRKDDLNRHMRNHDGLKPFHCSVCFES 299 (1043)
Q Consensus 235 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 299 (1043)
..|+.|+-. +..|.......|..||+.. ..|..|+.|+..
T Consensus 223 ~~C~~C~~~----------l~~h~~~~~l~Ch~Cg~~~---------------~~~~~Cp~C~s~ 262 (505)
T TIGR00595 223 LCCPNCDVS----------LTYHKKEGKLRCHYCGYQE---------------PIPKTCPQCGSE 262 (505)
T ss_pred cCCCCCCCc----------eEEecCCCeEEcCCCcCcC---------------CCCCCCCCCCCC
Confidence 447777542 3334445567788887643 236678888653
No 162
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=20.20 E-value=71 Score=26.26 Aligned_cols=13 Identities=31% Similarity=0.772 Sum_probs=9.0
Q ss_pred CCcccCcCCcccC
Q psy13406 776 APYACSICDRKFT 788 (1043)
Q Consensus 776 ~~~~C~~C~~~f~ 788 (1043)
.|-.|..||..|.
T Consensus 57 ~Pa~CkkCGfef~ 69 (97)
T COG3357 57 RPARCKKCGFEFR 69 (97)
T ss_pred cChhhcccCcccc
Confidence 4566777777775
Done!