Query psy16259
Match_columns 896
No_of_seqs 1007 out of 4918
Neff 9.2
Searched_HMMs 46136
Date Fri Aug 16 16:49:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16259.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16259hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 2.8E-32 6.1E-37 300.5 5.9 248 577-832 605-932 (958)
2 KOG1074|consensus 100.0 5.3E-32 1.2E-36 298.3 5.5 208 633-841 605-912 (958)
3 KOG3608|consensus 99.9 1.8E-27 3.9E-32 237.6 8.8 187 639-834 185-379 (467)
4 KOG2462|consensus 99.9 2.9E-27 6.3E-32 232.3 5.3 133 689-829 130-265 (279)
5 KOG2462|consensus 99.9 4.4E-27 9.5E-32 231.1 5.9 134 660-801 129-265 (279)
6 KOG3608|consensus 99.9 1.8E-26 4E-31 230.5 7.9 207 606-818 178-397 (467)
7 KOG3623|consensus 99.9 4.6E-23 1E-27 223.5 11.2 106 325-430 210-330 (1007)
8 KOG3623|consensus 99.9 3.4E-23 7.3E-28 224.6 8.9 119 466-599 211-331 (1007)
9 KOG3576|consensus 99.7 5.3E-18 1.1E-22 157.5 1.9 119 687-805 115-238 (267)
10 KOG3576|consensus 99.7 1.2E-17 2.7E-22 155.0 2.2 111 631-741 115-236 (267)
11 PLN03086 PRLI-interacting fact 99.3 4.6E-12 1E-16 141.7 7.8 101 719-832 455-565 (567)
12 PLN03086 PRLI-interacting fact 99.2 3.9E-11 8.4E-16 134.3 8.7 97 663-766 455-561 (567)
13 KOG1146|consensus 99.2 9.9E-10 2.1E-14 129.8 20.4 109 717-836 1228-1357(1406)
14 PF01352 KRAB: KRAB box; Inte 99.1 1.1E-11 2.3E-16 87.7 -0.5 41 199-244 1-41 (41)
15 PHA00733 hypothetical protein 99.1 1.2E-10 2.7E-15 106.6 4.5 86 743-833 38-125 (128)
16 PHA00733 hypothetical protein 99.0 4.2E-10 9.2E-15 103.1 5.1 80 687-768 38-122 (128)
17 KOG3993|consensus 98.9 7.8E-11 1.7E-15 122.3 -1.7 199 633-838 267-489 (500)
18 KOG3993|consensus 98.8 7.2E-10 1.6E-14 115.3 0.0 53 325-377 267-319 (500)
19 PHA02768 hypothetical protein; 98.8 1.7E-09 3.7E-14 80.5 1.8 43 780-824 6-48 (55)
20 PHA02768 hypothetical protein; 98.6 1.4E-08 3E-13 75.8 1.9 23 746-768 6-28 (55)
21 PF13465 zf-H2C2_2: Zinc-finge 98.5 3.8E-08 8.2E-13 62.8 1.8 26 794-819 1-26 (26)
22 KOG1146|consensus 98.4 7E-07 1.5E-11 106.4 9.0 95 355-458 438-539 (1406)
23 PF13465 zf-H2C2_2: Zinc-finge 98.2 5E-07 1.1E-11 57.6 1.5 22 678-699 3-24 (26)
24 KOG3671|consensus 98.2 3.4E-05 7.5E-10 82.9 15.6 10 6-15 299-308 (569)
25 KOG3671|consensus 98.2 1.9E-05 4E-10 84.9 13.1 6 155-160 494-499 (569)
26 PHA00616 hypothetical protein 98.1 7.6E-07 1.6E-11 63.2 1.2 39 779-817 1-39 (44)
27 PHA00732 hypothetical protein 98.0 3.4E-06 7.3E-11 69.9 3.3 21 746-766 2-22 (79)
28 PHA00732 hypothetical protein 98.0 3.1E-06 6.8E-11 70.0 3.1 45 779-829 1-46 (79)
29 PHA00616 hypothetical protein 98.0 3.6E-06 7.7E-11 59.8 1.7 33 662-694 2-34 (44)
30 COG5048 FOG: Zn-finger [Genera 97.7 6.9E-05 1.5E-09 86.3 8.3 55 380-434 32-88 (467)
31 PF05605 zf-Di19: Drought indu 97.6 4.7E-05 1E-09 58.5 3.3 49 780-831 3-53 (54)
32 COG5048 FOG: Zn-finger [Genera 97.6 0.00012 2.7E-09 84.2 8.2 57 463-519 31-89 (467)
33 PF05605 zf-Di19: Drought indu 97.6 9.9E-05 2.2E-09 56.7 4.4 51 746-803 3-53 (54)
34 PF00096 zf-C2H2: Zinc finger, 97.5 4.5E-05 9.7E-10 47.2 1.3 22 780-801 1-22 (23)
35 KOG2231|consensus 97.4 0.00024 5.2E-09 81.3 7.0 143 326-514 100-260 (669)
36 KOG1830|consensus 97.4 0.0041 8.8E-08 65.8 15.3 7 7-13 246-252 (518)
37 KOG2231|consensus 97.3 0.0012 2.5E-08 75.9 11.0 46 606-667 100-146 (669)
38 COG5189 SFP1 Putative transcri 97.3 9.1E-05 2E-09 74.9 1.4 58 742-799 346-418 (423)
39 PF00096 zf-C2H2: Zinc finger, 97.2 9.8E-05 2.1E-09 45.7 0.7 23 746-768 1-23 (23)
40 KOG1923|consensus 97.2 0.0012 2.5E-08 75.3 9.1 6 223-228 405-410 (830)
41 PF12756 zf-C2H2_2: C2H2 type 97.1 0.00022 4.7E-09 63.1 1.9 72 748-830 2-73 (100)
42 PF12756 zf-C2H2_2: C2H2 type 97.1 0.00024 5.1E-09 62.9 1.9 72 720-802 2-73 (100)
43 COG5189 SFP1 Putative transcri 97.1 0.00021 4.5E-09 72.4 1.4 53 777-829 347-420 (423)
44 PF13894 zf-C2H2_4: C2H2-type 97.0 0.00033 7.2E-09 43.7 1.6 21 781-801 2-22 (24)
45 PF13894 zf-C2H2_4: C2H2-type 97.0 0.00029 6.2E-09 44.0 1.0 24 808-831 1-24 (24)
46 PF13912 zf-C2H2_6: C2H2-type 96.9 0.0005 1.1E-08 44.4 1.5 23 780-802 2-24 (27)
47 COG5236 Uncharacterized conser 96.7 0.0014 3E-08 67.2 3.8 137 634-780 152-316 (493)
48 PF13912 zf-C2H2_6: C2H2-type 96.6 0.001 2.2E-08 42.9 1.3 26 807-832 1-26 (27)
49 KOG4672|consensus 96.4 0.029 6.3E-07 59.4 11.4 9 223-231 475-483 (487)
50 COG5236 Uncharacterized conser 96.4 0.0028 6E-08 65.1 3.4 91 729-833 200-307 (493)
51 PF09237 GAGA: GAGA factor; I 96.3 0.0022 4.7E-08 46.6 1.8 30 777-806 22-51 (54)
52 smart00355 ZnF_C2H2 zinc finge 96.3 0.0032 6.8E-08 39.9 2.3 22 781-802 2-23 (26)
53 PRK04860 hypothetical protein; 95.9 0.0035 7.6E-08 59.8 1.7 40 778-821 118-157 (160)
54 PF13909 zf-H2C2_5: C2H2-type 95.8 0.0033 7.1E-08 39.2 0.7 23 808-831 1-23 (24)
55 KOG2785|consensus 95.8 0.014 3E-07 61.8 5.7 51 779-829 166-242 (390)
56 smart00355 ZnF_C2H2 zinc finge 95.7 0.0074 1.6E-07 38.1 2.1 24 746-769 1-24 (26)
57 PF12874 zf-met: Zinc-finger o 95.6 0.0062 1.3E-07 38.4 1.3 23 780-802 1-23 (25)
58 KOG2482|consensus 95.3 0.036 7.7E-07 57.4 6.2 77 746-829 280-356 (423)
59 PF09237 GAGA: GAGA factor; I 95.2 0.016 3.6E-07 42.2 2.5 30 631-660 22-51 (54)
60 PF13909 zf-H2C2_5: C2H2-type 95.0 0.012 2.6E-07 36.6 1.2 22 780-802 1-22 (24)
61 KOG2482|consensus 94.9 0.035 7.5E-07 57.4 4.9 65 674-738 128-216 (423)
62 KOG2785|consensus 94.8 0.077 1.7E-06 56.3 7.2 79 325-403 3-90 (390)
63 PF12874 zf-met: Zinc-finger o 94.7 0.022 4.7E-07 35.8 1.9 22 382-403 1-22 (25)
64 PRK04860 hypothetical protein; 94.7 0.018 3.8E-07 55.1 2.1 37 745-791 119-155 (160)
65 PF12171 zf-C2H2_jaz: Zinc-fin 94.6 0.016 3.5E-07 37.3 1.0 22 780-801 2-23 (27)
66 PF12171 zf-C2H2_jaz: Zinc-fin 93.7 0.045 9.7E-07 35.2 1.8 22 808-829 2-23 (27)
67 KOG2893|consensus 92.4 0.11 2.4E-06 50.9 3.2 45 748-802 13-58 (341)
68 KOG2893|consensus 92.3 0.042 9E-07 53.8 0.2 46 782-831 13-58 (341)
69 KOG4849|consensus 91.4 3.4 7.4E-05 43.3 12.6 13 222-234 360-372 (498)
70 cd07765 KRAB_A-box KRAB (Krupp 90.0 0.13 2.8E-06 34.4 0.8 35 200-234 2-36 (40)
71 KOG4173|consensus 89.2 0.095 2.1E-06 50.3 -0.5 84 717-803 79-171 (253)
72 TIGR00622 ssl1 transcription f 89.1 0.94 2E-05 40.0 5.6 54 747-814 57-110 (112)
73 PF13913 zf-C2HC_2: zinc-finge 87.4 0.49 1.1E-05 29.7 2.0 19 635-654 4-22 (25)
74 PF13913 zf-C2HC_2: zinc-finge 87.0 0.46 9.9E-06 29.9 1.7 18 747-765 4-21 (25)
75 smart00451 ZnF_U1 U1-like zinc 86.9 0.42 9E-06 32.7 1.7 22 779-800 3-24 (35)
76 KOG0132|consensus 86.0 20 0.00044 42.2 15.1 20 207-226 772-791 (894)
77 TIGR00622 ssl1 transcription f 85.3 2.5 5.3E-05 37.5 6.0 48 719-768 57-104 (112)
78 smart00451 ZnF_U1 U1-like zinc 85.3 0.67 1.5E-05 31.7 2.1 22 297-318 3-24 (35)
79 KOG4173|consensus 84.6 0.43 9.4E-06 46.0 1.1 75 296-373 78-166 (253)
80 COG4049 Uncharacterized protei 84.0 0.32 6.9E-06 36.2 -0.0 30 803-832 13-42 (65)
81 KOG1925|consensus 82.9 2.4 5.2E-05 46.6 5.9 16 148-163 301-316 (817)
82 KOG2675|consensus 82.3 0.84 1.8E-05 49.5 2.3 21 148-168 283-304 (480)
83 cd00350 rubredoxin_like Rubred 81.2 1 2.2E-05 30.5 1.5 10 806-815 16-25 (33)
84 KOG1984|consensus 81.2 1.1E+02 0.0024 37.1 18.6 46 294-345 335-380 (1007)
85 PF12013 DUF3505: Protein of u 79.9 1.8 4E-05 38.7 3.3 24 780-803 81-108 (109)
86 PF12013 DUF3505: Protein of u 77.5 2.8 6.1E-05 37.5 3.8 26 717-742 80-109 (109)
87 cd00350 rubredoxin_like Rubred 76.8 1.4 3E-05 29.8 1.2 10 718-727 2-11 (33)
88 PF02892 zf-BED: BED zinc fing 74.4 1.3 2.8E-05 32.3 0.6 12 780-791 17-28 (45)
89 PF09986 DUF2225: Uncharacteri 72.8 1.2 2.6E-05 45.3 0.1 44 777-820 3-61 (214)
90 KOG2461|consensus 69.6 6.7 0.00015 43.6 5.0 206 583-815 188-395 (396)
91 PF01213 CAP_N: Adenylate cycl 67.6 1.7 3.8E-05 46.5 0.0 10 151-160 284-294 (312)
92 cd00729 rubredoxin_SM Rubredox 64.9 4 8.7E-05 27.8 1.4 10 780-789 3-12 (34)
93 COG4049 Uncharacterized protei 64.4 3 6.6E-05 31.2 0.7 27 629-655 13-39 (65)
94 PF09538 FYDLN_acid: Protein o 64.3 5.2 0.00011 35.5 2.3 10 719-728 11-20 (108)
95 PF03276 Gag_spuma: Spumavirus 64.1 42 0.00092 38.1 9.7 11 206-216 371-381 (582)
96 KOG1701|consensus 62.6 3.7 8E-05 44.6 1.3 40 439-478 276-315 (468)
97 KOG2186|consensus 61.8 3.2 7E-05 41.8 0.7 50 662-714 4-53 (276)
98 PF04959 ARS2: Arsenite-resist 60.5 2.4 5.2E-05 42.6 -0.5 30 804-833 74-103 (214)
99 smart00614 ZnF_BED BED zinc fi 57.2 5.7 0.00012 29.8 1.1 20 781-800 20-44 (50)
100 PF02892 zf-BED: BED zinc fing 57.1 6.8 0.00015 28.5 1.5 22 744-765 15-40 (45)
101 KOG1701|consensus 56.5 3.1 6.7E-05 45.2 -0.5 38 299-338 276-315 (468)
102 PF13717 zinc_ribbon_4: zinc-r 56.1 9.2 0.0002 26.5 1.9 11 328-338 5-15 (36)
103 PF04625 DEC-1_N: DEC-1 protei 56.1 62 0.0014 33.9 8.6 12 2-13 17-28 (407)
104 PF09986 DUF2225: Uncharacteri 55.4 7.8 0.00017 39.4 2.2 20 605-624 5-24 (214)
105 smart00614 ZnF_BED BED zinc fi 54.3 8.6 0.00019 28.8 1.7 24 745-768 18-47 (50)
106 KOG2186|consensus 54.1 6.4 0.00014 39.7 1.3 46 354-402 4-49 (276)
107 PRK09678 DNA-binding transcrip 53.9 4.9 0.00011 32.7 0.3 41 780-822 2-44 (72)
108 PRK14890 putative Zn-ribbon RN 53.1 13 0.00028 28.7 2.4 8 746-753 26-33 (59)
109 PF09538 FYDLN_acid: Protein o 52.7 9.5 0.00021 33.9 2.0 8 608-615 12-19 (108)
110 KOG1984|consensus 52.7 4.5E+02 0.0098 32.2 15.7 28 346-373 353-380 (1007)
111 PF15449 Retinal: Retinal prot 52.7 4.3E+02 0.0094 33.1 15.8 20 153-172 1197-1216(1287)
112 KOG4377|consensus 52.2 6.1 0.00013 42.7 0.8 19 785-803 409-427 (480)
113 TIGR02098 MJ0042_CXXC MJ0042 f 52.1 8.3 0.00018 26.9 1.2 11 299-309 4-14 (38)
114 TIGR00373 conserved hypothetic 52.0 12 0.00026 35.9 2.8 33 713-754 105-137 (158)
115 KOG2461|consensus 51.7 23 0.0005 39.4 5.2 59 525-583 186-244 (396)
116 PF01690 PLRV_ORF5: Potato lea 51.6 12 0.00025 41.8 2.9 6 241-246 106-111 (465)
117 COG5188 PRP9 Splicing factor 3 51.6 10 0.00022 40.0 2.3 27 404-430 369-396 (470)
118 smart00531 TFIIE Transcription 50.5 14 0.00031 35.0 2.9 36 715-755 97-133 (147)
119 KOG0162|consensus 50.5 2.2E+02 0.0047 33.8 12.5 127 1-127 923-1052(1106)
120 PF06524 NOA36: NOA36 protein; 50.4 14 0.0003 37.4 2.9 25 604-628 208-232 (314)
121 KOG4377|consensus 50.0 12 0.00025 40.6 2.4 23 808-830 402-426 (480)
122 smart00659 RPOLCX RNA polymera 49.3 11 0.00024 27.5 1.5 27 780-817 3-29 (44)
123 TIGR02300 FYDLN_acid conserved 48.9 14 0.0003 33.5 2.3 13 779-791 26-38 (129)
124 PF05290 Baculo_IE-1: Baculovi 48.5 9.9 0.00022 34.5 1.4 12 809-820 123-134 (140)
125 PF13719 zinc_ribbon_5: zinc-r 48.3 12 0.00026 26.1 1.5 9 328-336 5-13 (37)
126 TIGR02605 CxxC_CxxC_SSSS putat 48.0 11 0.00023 28.5 1.4 11 578-588 6-16 (52)
127 smart00349 KRAB krueppel assoc 48.0 14 0.0003 26.3 2.1 35 200-234 2-36 (61)
128 TIGR00373 conserved hypothetic 47.9 16 0.00034 35.2 2.8 34 321-363 105-138 (158)
129 PF10571 UPF0547: Uncharacteri 47.7 15 0.00033 23.3 1.7 9 747-755 16-24 (26)
130 PRK00398 rpoP DNA-directed RNA 47.0 6.7 0.00015 28.8 0.1 28 780-817 4-31 (46)
131 smart00834 CxxC_CXXC_SSSS Puta 46.9 12 0.00027 26.4 1.5 11 606-616 6-16 (41)
132 COG5151 SSL1 RNA polymerase II 46.4 20 0.00042 37.3 3.3 23 746-768 389-411 (421)
133 PRK00464 nrdR transcriptional 45.1 8.5 0.00018 36.6 0.5 16 606-621 29-44 (154)
134 COG1996 RPC10 DNA-directed RNA 44.9 11 0.00023 28.1 0.9 29 779-817 6-34 (49)
135 PRK06266 transcription initiat 44.6 17 0.00036 35.7 2.5 33 715-756 115-147 (178)
136 PF05443 ROS_MUCR: ROS/MUCR tr 44.5 9.8 0.00021 35.1 0.8 24 690-716 73-96 (132)
137 COG2331 Uncharacterized protei 43.6 9.5 0.0002 30.8 0.5 32 779-818 12-44 (82)
138 PRK06266 transcription initiat 43.3 20 0.00043 35.2 2.8 33 322-363 114-146 (178)
139 PRK12270 kgd alpha-ketoglutara 42.8 86 0.0019 38.8 8.2 16 212-227 233-249 (1228)
140 smart00531 TFIIE Transcription 42.6 23 0.0005 33.6 3.1 37 323-363 97-133 (147)
141 PRK00464 nrdR transcriptional 42.5 13 0.00028 35.5 1.3 14 578-591 29-42 (154)
142 COG1198 PriA Primosomal protei 42.5 18 0.00039 43.6 2.8 12 407-418 473-484 (730)
143 PRK12270 kgd alpha-ketoglutara 41.2 1E+02 0.0022 38.1 8.5 6 240-245 270-275 (1228)
144 COG1592 Rubrerythrin [Energy p 41.0 17 0.00036 35.0 1.8 23 779-814 134-156 (166)
145 TIGR02300 FYDLN_acid conserved 40.9 20 0.00042 32.5 2.1 35 779-824 9-43 (129)
146 PF05290 Baculo_IE-1: Baculovi 40.6 13 0.00028 33.8 0.9 17 714-730 77-93 (140)
147 PF06524 NOA36: NOA36 protein; 39.7 17 0.00036 37.0 1.6 94 684-802 137-232 (314)
148 PRK09678 DNA-binding transcrip 38.9 13 0.00029 30.2 0.7 19 603-621 25-45 (72)
149 PF09723 Zn-ribbon_8: Zinc rib 38.5 19 0.00041 25.9 1.3 11 606-616 6-16 (42)
150 PHA00626 hypothetical protein 38.2 13 0.00029 28.1 0.5 14 806-819 22-35 (59)
151 PF05443 ROS_MUCR: ROS/MUCR tr 37.8 17 0.00036 33.6 1.2 27 777-806 70-96 (132)
152 COG1592 Rubrerythrin [Energy p 37.7 21 0.00046 34.3 1.9 23 717-752 134-156 (166)
153 PF07800 DUF1644: Protein of u 37.5 82 0.0018 29.8 5.6 56 779-836 80-137 (162)
154 KOG4307|consensus 36.5 1.9E+02 0.0042 34.0 9.3 6 201-206 325-330 (944)
155 KOG2071|consensus 36.2 26 0.00056 40.3 2.6 41 687-727 416-523 (579)
156 smart00734 ZnF_Rad18 Rad18-lik 35.2 34 0.00075 21.7 2.0 19 663-682 3-21 (26)
157 KOG3408|consensus 34.8 25 0.00055 31.3 1.8 35 285-319 45-79 (129)
158 KOG4124|consensus 34.4 12 0.00026 39.5 -0.3 27 805-831 396-422 (442)
159 COG1997 RPL43A Ribosomal prote 34.0 24 0.00051 29.6 1.4 11 746-756 54-64 (89)
160 PF15269 zf-C2H2_7: Zinc-finge 33.8 23 0.0005 25.3 1.1 23 779-801 20-42 (54)
161 KOG0119|consensus 33.5 6.8E+02 0.015 28.6 12.6 122 18-139 397-540 (554)
162 COG2888 Predicted Zn-ribbon RN 32.3 34 0.00074 26.4 1.9 8 746-753 28-35 (61)
163 PF15269 zf-C2H2_7: Zinc-finge 32.2 31 0.00067 24.7 1.5 23 465-487 20-42 (54)
164 PF03604 DNA_RNApol_7kD: DNA d 32.0 27 0.00059 23.5 1.2 8 719-726 2-9 (32)
165 KOG2593|consensus 32.0 36 0.00077 37.6 2.7 38 713-753 124-161 (436)
166 PRK04023 DNA polymerase II lar 31.4 46 0.001 40.9 3.7 9 662-670 627-635 (1121)
167 KOG1044|consensus 30.1 19 0.00041 41.0 0.3 10 550-559 161-170 (670)
168 TIGR01384 TFS_arch transcripti 29.9 25 0.00053 31.1 1.0 10 382-391 17-26 (104)
169 PRK14873 primosome assembly pr 29.8 36 0.00078 41.0 2.6 7 746-752 411-417 (665)
170 COG4957 Predicted transcriptio 29.6 28 0.0006 31.7 1.2 25 690-717 77-101 (148)
171 KOG4167|consensus 29.3 11 0.00024 43.8 -1.6 27 778-804 791-817 (907)
172 KOG3408|consensus 28.9 19 0.0004 32.1 0.0 24 744-767 56-79 (129)
173 PRK04023 DNA polymerase II lar 28.5 56 0.0012 40.2 3.8 10 325-334 626-635 (1121)
174 KOG2272|consensus 28.1 53 0.0011 33.2 2.9 17 382-398 100-116 (332)
175 TIGR00595 priA primosomal prot 27.7 47 0.001 38.7 3.0 10 744-753 239-248 (505)
176 PF12907 zf-met2: Zinc-binding 26.5 17 0.00038 25.7 -0.5 26 808-833 2-30 (40)
177 KOG2272|consensus 26.3 36 0.00078 34.4 1.4 17 437-453 221-237 (332)
178 cd00730 rubredoxin Rubredoxin; 26.0 51 0.0011 24.8 1.9 11 606-616 2-12 (50)
179 KOG2593|consensus 26.0 61 0.0013 35.9 3.2 16 493-508 128-143 (436)
180 PF01363 FYVE: FYVE zinc finge 25.6 32 0.00069 27.6 0.8 27 780-818 10-36 (69)
181 KOG2807|consensus 25.2 1.1E+02 0.0023 32.6 4.6 18 661-678 290-307 (378)
182 PF13878 zf-C2H2_3: zinc-finge 24.9 77 0.0017 22.6 2.6 24 298-321 14-39 (41)
183 COG1198 PriA Primosomal protei 24.7 35 0.00077 41.2 1.3 49 718-816 436-484 (730)
184 PF08274 PhnA_Zn_Ribbon: PhnA 24.1 31 0.00068 22.8 0.4 24 781-815 4-27 (30)
185 KOG4124|consensus 23.7 51 0.0011 35.1 2.1 26 292-317 344-371 (442)
186 PF15135 UPF0515: Uncharacteri 23.7 62 0.0013 32.9 2.5 13 465-477 155-167 (278)
187 COG4957 Predicted transcriptio 23.4 44 0.00096 30.4 1.3 24 780-806 77-100 (148)
188 PF12760 Zn_Tnp_IS1595: Transp 23.2 81 0.0018 23.1 2.5 8 718-725 19-26 (46)
189 KOG0696|consensus 23.2 30 0.00065 38.0 0.3 35 634-668 93-128 (683)
190 PF09416 UPF1_Zn_bind: RNA hel 22.5 61 0.0013 30.7 2.1 15 657-671 10-24 (152)
191 PF14353 CpXC: CpXC protein 22.5 26 0.00057 32.2 -0.3 22 605-626 38-59 (128)
192 PRK03564 formate dehydrogenase 22.3 57 0.0012 35.0 2.1 78 716-820 186-265 (309)
193 KOG4167|consensus 21.4 24 0.00052 41.3 -0.9 26 380-405 791-816 (907)
194 PF00301 Rubredoxin: Rubredoxi 21.4 62 0.0013 24.0 1.5 7 466-472 35-41 (47)
195 PF08790 zf-LYAR: LYAR-type C2 21.4 30 0.00066 22.4 -0.1 17 327-344 2-18 (28)
196 KOG2807|consensus 20.7 1.1E+02 0.0023 32.5 3.6 91 293-416 272-374 (378)
197 PTZ00303 phosphatidylinositol 20.5 29 0.00063 40.7 -0.4 13 298-310 461-473 (1374)
198 COG5151 SSL1 RNA polymerase II 20.5 43 0.00093 34.9 0.8 55 746-814 363-417 (421)
199 PF07975 C1_4: TFIIH C1-like d 20.2 44 0.00096 25.2 0.6 26 716-741 20-45 (51)
200 KOG1044|consensus 20.0 88 0.0019 35.9 3.0 43 545-589 12-54 (670)
No 1
>KOG1074|consensus
Probab=99.97 E-value=2.8e-32 Score=300.49 Aligned_cols=248 Identities=23% Similarity=0.453 Sum_probs=173.8
Q ss_pred CCcCccccCcccchhhhhcccccccCCCcccCCCCCCCcccccccccccccccCC----CCccCC---ccCccCCChhHH
Q psy16259 577 PHKCNYCAKSFTRKEHLVNHVRQHTGETPYQCQYCPKAFSRKDHLVNHVRQHTGE----SPHKCS---YCKQSFTRKDHL 649 (896)
Q Consensus 577 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~----~~~~C~---~C~~~F~~~~~L 649 (896)
+-.|-+|-+...-++.|+.|+|+|+||+||+|++||++|.++.+|+.|+-+|... ..+.|+ +|-+.|.+.-.|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 5689999999999999999999999999999999999999999999999988754 357899 999999999999
Q ss_pred HHHHHhhcCC-C------------CcccccCCcccCChHHHHHHHhhhcCC----------------C----CCCccccc
Q psy16259 650 NNHVRLHTGE-S------------PHKCHFCSKSFTRKEHLNNHIRLHTGE----------------S----PHKCDFCL 696 (896)
Q Consensus 650 ~~H~~~h~~~-~------------~~~C~~C~k~f~~~~~L~~H~~~H~~~----------------~----~~~C~~C~ 696 (896)
..|+++|.+. . .-+|..|.+.|.....+..++..|.+. . +..+..|+
T Consensus 685 pQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~ 764 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCG 764 (958)
T ss_pred cceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccccc
Confidence 9999999842 1 146999999999988888888776332 2 34566666
Q ss_pred ccccChhHHHHHhhhh---cCCCccccccCCCCCCChhhHhhhhhcccCCCCc-cCCCCCCCCCCchhHHH-HH------
Q psy16259 697 KTFTRKEHLTNHLKQH---AGETEHVCNICSKPFLKKEHLVNHSRSHTGERPF-ACTDCGKSFPLKGNLLF-HQ------ 765 (896)
Q Consensus 697 k~f~~~~~L~~H~~~h---~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~-~C~~C~~~F~~~~~l~~-H~------ 765 (896)
..+.....+..+-..+ ... +.++.....=..+...++++++. .+.+++..-...-+..- =+
T Consensus 765 ~~~~~e~~i~~~g~te~asa~~--------~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~ 836 (958)
T KOG1074|consen 765 RELEGEMAISVRGSTEEASANL--------DEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSM 836 (958)
T ss_pred cccCcccccccccchhhhhcCh--------hhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccc
Confidence 6665554444332111 000 00111111111222334555555 55555543332211110 00
Q ss_pred -----------HhhcCCC----------------Cc--cccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCcc
Q psy16259 766 -----------RSHRKQN----------------GE--VLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKS 816 (896)
Q Consensus 766 -----------~~H~~~~----------------~~--~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~ 816 (896)
.++-+.. +. .......|.+||+.|...++|..|||+|+|+|+|.|.+|++.
T Consensus 837 l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~a 916 (958)
T KOG1074|consen 837 LNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEA 916 (958)
T ss_pred cccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhh
Confidence 0000000 00 001126699999999999999999999999999999999999
Q ss_pred ccchhhHHhhhccccC
Q psy16259 817 FVEHTCMIKHIRKNHT 832 (896)
Q Consensus 817 f~~~~~l~~H~~~~H~ 832 (896)
|.++.+|+.||.+|+-
T Consensus 917 FttrgnLKvHMgtH~w 932 (958)
T KOG1074|consen 917 FTTRGNLKVHMGTHMW 932 (958)
T ss_pred hhhhhhhhhhhccccc
Confidence 9999999999998443
No 2
>KOG1074|consensus
Probab=99.97 E-value=5.3e-32 Score=298.30 Aligned_cols=208 Identities=24% Similarity=0.478 Sum_probs=146.9
Q ss_pred CccCCccCccCCChhHHHHHHHhhcCCCCcccccCCcccCChHHHHHHHhhhcCC----CCCCcc---cccccccChhHH
Q psy16259 633 PHKCSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIRLHTGE----SPHKCD---FCLKTFTRKEHL 705 (896)
Q Consensus 633 ~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~----~~~~C~---~C~k~f~~~~~L 705 (896)
+-+|-+|-+....++.|+.|.|+|+||+||+|.+||++|+++.+|+.||-+|... ..|.|. +|-+.|.....|
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~~~ic~~kftn~V~l 684 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHKAKPPARVQFSCPSTFICQKKFTNAVTL 684 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcccccccCccccccccCCchhhhcccccccccc
Confidence 4678888888888888888888888888888888888888888888888887653 347888 888888888888
Q ss_pred HHHhhhhcCCC-------------ccccccCCCCCCChhhHhhhhhcc----------------cCCC----CccCCCCC
Q psy16259 706 TNHLKQHAGET-------------EHVCNICSKPFLKKEHLVNHSRSH----------------TGER----PFACTDCG 752 (896)
Q Consensus 706 ~~H~~~h~~~~-------------~~~C~~C~~~f~~~~~L~~H~~~H----------------~~~k----~~~C~~C~ 752 (896)
..|+++|.+.. .-+|.+|.+.|.....+..++--| +++. +..+..|+
T Consensus 685 pQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~ 764 (958)
T KOG1074|consen 685 PQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSCG 764 (958)
T ss_pred cceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccccccc
Confidence 88888887321 246888888887777776665444 3333 57788999
Q ss_pred CCCCCchhHHHHHHhhcCC-----------------CCccccCCc-cCCCCCcccCCchhHH----H-------------
Q psy16259 753 KSFPLKGNLLFHQRSHRKQ-----------------NGEVLERPF-KCDVCPKDFTCKGHLV----S------------- 797 (896)
Q Consensus 753 ~~F~~~~~l~~H~~~H~~~-----------------~~~~~~k~~-~C~~C~k~f~~~~~L~----~------------- 797 (896)
..+.....+..+--.+... .....+++. .+.+++..-...-... .
T Consensus 765 ~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~ 844 (958)
T KOG1074|consen 765 RELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATK 844 (958)
T ss_pred cccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccccccc
Confidence 9998887777664433100 001224566 6777765443322111 0
Q ss_pred -HhhhhcC------------------------CCCccCCccCccccchhhHHhhhccccCCCcCCCCCC
Q psy16259 798 -HLRSHLN------------------------ERPHVCPHCSKSFVEHTCMIKHIRKNHTLANVNVSPM 841 (896)
Q Consensus 798 -H~~~H~~------------------------~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~ 841 (896)
=..++-+ .....|..|++.|...+.|+.|||+ |++++...+.+
T Consensus 845 ~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rT-Htg~KPF~C~f 912 (958)
T KOG1074|consen 845 TNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRT-HTGPKPFFCHF 912 (958)
T ss_pred cccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhc-CCCCCCccchh
Confidence 0001100 0237899999999999999999999 99988755444
No 3
>KOG3608|consensus
Probab=99.94 E-value=1.8e-27 Score=237.62 Aligned_cols=187 Identities=24% Similarity=0.532 Sum_probs=113.8
Q ss_pred cCccCCChhHHHHHHHhhcCCCCcccccCCcccCChHHHHHHHhhhc--CCCCCCcccccccccChhHHHHHhhhhcCCC
Q psy16259 639 CKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIRLHT--GESPHKCDFCLKTFTRKEHLTNHLKQHAGET 716 (896)
Q Consensus 639 C~~~F~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~--~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~ 716 (896)
|.+.|.++..|++|+|.|+++|...|+.||.-|+++..|..|++..+ ...+|.|..|.|.|.+...|+.|+..|-+
T Consensus 185 Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn-- 262 (467)
T KOG3608|consen 185 CTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRHVN-- 262 (467)
T ss_pred hhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHhhh--
Confidence 66666666666666666666666666666666666666666665433 23456666666666666666666666543
Q ss_pred ccccccCCCCCCChhhHhhhhhc-ccCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCC--CCcccCCch
Q psy16259 717 EHVCNICSKPFLKKEHLVNHSRS-HTGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDV--CPKDFTCKG 793 (896)
Q Consensus 717 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~--C~k~f~~~~ 793 (896)
-|+|..|+.+....+.|..|++. |..+|||+|+.|++.|.+.++|.+|+.+|.. --|.|+. |...|++..
T Consensus 263 ~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS~-------~~y~C~h~~C~~s~r~~~ 335 (467)
T KOG3608|consen 263 CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHSK-------TVYQCEHPDCHYSVRTYT 335 (467)
T ss_pred cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhccc-------cceecCCCCCcHHHHHHH
Confidence 36666666666666666666653 5556666666666666666666666666542 3466665 666666666
Q ss_pred hHHHHhhhhc-C--CCCccCCccCccccchhhHHhhhccccCCC
Q psy16259 794 HLVSHLRSHL-N--ERPHVCPHCSKSFVEHTCMIKHIRKNHTLA 834 (896)
Q Consensus 794 ~L~~H~~~H~-~--~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 834 (896)
.|++|++.|+ | +-+|.|-.|++.|.+..+|..|++++|...
T Consensus 336 q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 336 QMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred HHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 6666665433 3 234666666666666666666666666543
No 4
>KOG2462|consensus
Probab=99.93 E-value=2.9e-27 Score=232.34 Aligned_cols=133 Identities=33% Similarity=0.660 Sum_probs=104.4
Q ss_pred CCCcccccccccChhHHHHHhhhhcC---CCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHH
Q psy16259 689 PHKCDFCLKTFTRKEHLTNHLKQHAG---ETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQ 765 (896)
Q Consensus 689 ~~~C~~C~k~f~~~~~L~~H~~~h~~---~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~ 765 (896)
.|+|..|||.+.+.++|.+|.+.|-. .+.+.|++|||.|.+..+|..|+|+|+ -+++|.+|||.|.....|.-|+
T Consensus 130 r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHi 207 (279)
T KOG2462|consen 130 RYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGHI 207 (279)
T ss_pred ceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhccc
Confidence 46666666666666666666666643 456778888888888888888888876 5678888888888888888888
Q ss_pred HhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHHhhhcc
Q psy16259 766 RSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHIRK 829 (896)
Q Consensus 766 ~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 829 (896)
|+|+| ||||.|..|+|+|..+++|+.||++|.+.|.|+|..|+|.|..++.|.+|...
T Consensus 208 RTHTG------EKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 208 RTHTG------EKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred ccccC------CCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 88887 78888888888888888888888888888888888888888888888888765
No 5
>KOG2462|consensus
Probab=99.93 E-value=4.4e-27 Score=231.10 Aligned_cols=134 Identities=34% Similarity=0.683 Sum_probs=106.6
Q ss_pred CCcccccCCcccCChHHHHHHHhhhcC---CCCCCcccccccccChhHHHHHhhhhcCCCccccccCCCCCCChhhHhhh
Q psy16259 660 SPHKCHFCSKSFTRKEHLNNHIRLHTG---ESPHKCDFCLKTFTRKEHLTNHLKQHAGETEHVCNICSKPFLKKEHLVNH 736 (896)
Q Consensus 660 ~~~~C~~C~k~f~~~~~L~~H~~~H~~---~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H 736 (896)
..|+|..|||.+.+.++|.+|..+|-. .+.+.|.+|||.|.+-..|+.|+++|+ -+++|.+|||.|.+...|+-|
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQGH 206 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQGH 206 (279)
T ss_pred CceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhhcc
Confidence 347777777777777777777777642 456788888888888888888888887 568888888888888888888
Q ss_pred hhcccCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhh
Q psy16259 737 SRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRS 801 (896)
Q Consensus 737 ~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~ 801 (896)
+|+|||||||.|..|+|.|..+++|+.||++|.+ .|+|+|..|+|+|..++.|.+|...
T Consensus 207 iRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~------~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 207 IRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSD------VKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccccCCCCccCCcccchhcchHHHHHHHHhhcC------CccccCcchhhHHHHHHHHHHhhhh
Confidence 8888888888888888888888888888888877 6788888888888888888888653
No 6
>KOG3608|consensus
Probab=99.93 E-value=1.8e-26 Score=230.49 Aligned_cols=207 Identities=25% Similarity=0.500 Sum_probs=182.5
Q ss_pred ccCC--CCCCCcccccccccccccccCCCCccCCccCccCCChhHHHHHHHhhc--CCCCcccccCCcccCChHHHHHHH
Q psy16259 606 YQCQ--YCPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKDHLNNHVRLHT--GESPHKCHFCSKSFTRKEHLNNHI 681 (896)
Q Consensus 606 ~~C~--~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~F~~~~~L~~H~~~h~--~~~~~~C~~C~k~f~~~~~L~~H~ 681 (896)
+.|. -|-+.|.++..|++|++.|+++|-..|+.||.-|.++..|-.|++.-+ ...+|.|..|.|.|.+...|..|+
T Consensus 178 ~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv 257 (467)
T KOG3608|consen 178 TMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHV 257 (467)
T ss_pred eeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHH
Confidence 4565 488999999999999999999999999999999999999999987543 456899999999999999999999
Q ss_pred hhhcCCCCCCcccccccccChhHHHHHhh-hhcCCCccccccCCCCCCChhhHhhhhhcccCCCCccCCC--CCCCCCCc
Q psy16259 682 RLHTGESPHKCDFCLKTFTRKEHLTNHLK-QHAGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTD--CGKSFPLK 758 (896)
Q Consensus 682 ~~H~~~~~~~C~~C~k~f~~~~~L~~H~~-~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~--C~~~F~~~ 758 (896)
+.|-+ -|+|+.|+.+....+.|.+||+ .|..+|+|+|+.|++.|.+.++|.+|..+|. +-.|.|+. |.++|++.
T Consensus 258 ~rHvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~ 334 (467)
T KOG3608|consen 258 VRHVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTY 334 (467)
T ss_pred HHhhh--cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHH
Confidence 99864 5999999999999999999987 5788999999999999999999999999998 77899998 99999999
Q ss_pred hhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhh-hh-----cCCCCccCCccCcccc
Q psy16259 759 GNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLR-SH-----LNERPHVCPHCSKSFV 818 (896)
Q Consensus 759 ~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~-~H-----~~~~~~~C~~C~~~f~ 818 (896)
..|++|++.++..+. +-+|+|..|++.|++-.+|.+|++ .| .|-++|.-..|.-+|.
T Consensus 335 ~q~~~H~~evhEg~n---p~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~~PsGh~RFtYk~~edG~m 397 (467)
T KOG3608|consen 335 TQMRRHFLEVHEGNN---PILYACHCCDRFFTSGKSLSAHLMKKHGFRLPSGHKRFTYKVDEDGFM 397 (467)
T ss_pred HHHHHHHHHhccCCC---CCceeeecchhhhccchhHHHHHHHhhcccCCCCCCceeeeeccCcee
Confidence 999999998774322 568999999999999999999975 55 3566777778877774
No 7
>KOG3623|consensus
Probab=99.88 E-value=4.6e-23 Score=223.52 Aligned_cols=106 Identities=32% Similarity=0.656 Sum_probs=89.3
Q ss_pred CcccCccccccCChHHHhhhhcccc--CCCccccCccCcccCChHHHHHHHhhccC-------------CCCeecCcccc
Q psy16259 325 PFTCHVCGKAFDFQAELMSHGKCHL--GGSMYTCAVCFHVFANEESLQRHTKRHSI-------------DRPYCCSVCMK 389 (896)
Q Consensus 325 ~~~C~~C~~~f~~~~~L~~H~~~h~--~~~~~~C~~C~~~f~~~~~L~~H~~~h~~-------------~~~~~C~~C~~ 389 (896)
...|.+|++.+.....|..|++..+ .+..|.|..|..+|..+..|.+||.+|.. .+.|+|.+|+|
T Consensus 210 lltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgK 289 (1007)
T KOG3623|consen 210 LLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGK 289 (1007)
T ss_pred hhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccch
Confidence 3679999999999999999986543 45568899999999999999999988753 26688999999
Q ss_pred ccCChHHHHHHHhhccCCCceecCCCCCcccchhhhhhhhh
Q psy16259 390 TFARKEHLENHTRCHTGETPYKCQYCGKNFSRKEHMVNHVR 430 (896)
Q Consensus 390 ~F~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 430 (896)
.|+.+-.|+.|+|+|.|+|+|.|+.|+|.|.....+..||.
T Consensus 290 AFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 290 AFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 99989999999999999999999999999988888888874
No 8
>KOG3623|consensus
Probab=99.88 E-value=3.4e-23 Score=224.59 Aligned_cols=119 Identities=27% Similarity=0.612 Sum_probs=94.0
Q ss_pred cccCCCCCccCCHHHHHHHHhhc--CCCCCcccCccCCccCChHHHHHHHHhhcCCCCcccCccCCcccchhhhhhhhhc
Q psy16259 466 HVCQVCGKKYTRKEHLANHMRSH--TNDTPFKCEICTKSFTRKEHFTNHIMWHTGETPHRCDFCSKTFTRKEHLLNHVRQ 543 (896)
Q Consensus 466 ~~C~~C~~~f~~~~~L~~H~~~h--~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~ 543 (896)
..|+.|++.+.....|+.|++.- ..+..|.|..|.++|..+..|.+|+.+|.....- ..++..
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dq---------------a~sltq 275 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQ---------------AISLTQ 275 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcc---------------cccccc
Confidence 56888888888888888887642 3455688888888888888888888887542211 022222
Q ss_pred cCCCCCccCCccCcccCChhhhhcccccccCCCCCcCccccCcccchhhhhccccc
Q psy16259 544 HTGESPHKCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYCAKSFTRKEHLVNHVRQ 599 (896)
Q Consensus 544 h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~ 599 (896)
-...+.|+|.+|||.|+.+..|++|+|+|.|++||.|..|+|.|....++..||..
T Consensus 276 sa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 276 SALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 33456799999999999999999999999999999999999999999999999853
No 9
>KOG3576|consensus
Probab=99.69 E-value=5.3e-18 Score=157.48 Aligned_cols=119 Identities=29% Similarity=0.517 Sum_probs=89.5
Q ss_pred CCCCCcccccccccChhHHHHHhhhhcCCCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHH
Q psy16259 687 ESPHKCDFCLKTFTRKEHLTNHLKQHAGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQR 766 (896)
Q Consensus 687 ~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~ 766 (896)
...|.|.+|+|.|.-...|.+||+-|...+.|-|..|||.|....+|++|+|+|+|.+||+|..|++.|..+-.|..|++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 34577888888888888888888888888888888888888888888888888888888888888888888888888877
Q ss_pred hhcCCCC-----ccccCCccCCCCCcccCCchhHHHHhhhhcCC
Q psy16259 767 SHRKQNG-----EVLERPFKCDVCPKDFTCKGHLVSHLRSHLNE 805 (896)
Q Consensus 767 ~H~~~~~-----~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~ 805 (896)
.-++... +..+|-|.|+.||.+-.....+..|++.|+..
T Consensus 195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 6555321 12345677777777777777777777766543
No 10
>KOG3576|consensus
Probab=99.67 E-value=1.2e-17 Score=155.04 Aligned_cols=111 Identities=33% Similarity=0.622 Sum_probs=82.2
Q ss_pred CCCccCCccCccCCChhHHHHHHHhhcCCCCcccccCCcccCChHHHHHHHhhhcCCCCCCcccccccccChhHHHHHhh
Q psy16259 631 ESPHKCSYCKQSFTRKDHLNNHVRLHTGESPHKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFCLKTFTRKEHLTNHLK 710 (896)
Q Consensus 631 ~~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~ 710 (896)
...|.|.+|+|.|.-...|++|++.|...+.|-|..|||.|...-+|++|+|+|+|.+||+|..|+|.|+.+-.|..|++
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl~ 194 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHLK 194 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHHH
Confidence 44578888888888888888888888888888888888888888888888888888888888888888888888888877
Q ss_pred hhcCC-----------CccccccCCCCCCChhhHhhhhhccc
Q psy16259 711 QHAGE-----------TEHVCNICSKPFLKKEHLVNHSRSHT 741 (896)
Q Consensus 711 ~h~~~-----------~~~~C~~C~~~f~~~~~L~~H~~~H~ 741 (896)
..+|. |-|+|+.||.+-...+.+..|++.|+
T Consensus 195 kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~h 236 (267)
T KOG3576|consen 195 KVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHH 236 (267)
T ss_pred HHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcC
Confidence 65553 33445555554444444444444443
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.28 E-value=4.6e-12 Score=141.66 Aligned_cols=101 Identities=23% Similarity=0.426 Sum_probs=53.9
Q ss_pred ccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccC--------
Q psy16259 719 VCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFT-------- 790 (896)
Q Consensus 719 ~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~-------- 790 (896)
.|++|++.|. ...|..|+++|+ ++|.|. ||+.| .+..|..|+++|.. ++++.|.+|++.|.
T Consensus 455 ~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp------~Kpi~C~fC~~~v~~g~~~~d~ 523 (567)
T PLN03086 455 HCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCP------LRLITCRFCGDMVQAGGSAMDV 523 (567)
T ss_pred cCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCC------CCceeCCCCCCccccCccccch
Confidence 4555555553 344555555543 445555 55433 34555555555544 45555555555552
Q ss_pred --CchhHHHHhhhhcCCCCccCCccCccccchhhHHhhhccccC
Q psy16259 791 --CKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHIRKNHT 832 (896)
Q Consensus 791 --~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 832 (896)
....|..|+.++ |.+++.|..||+.|.. ..|..|+...|.
T Consensus 524 ~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrl-rdm~~H~~~~h~ 565 (567)
T PLN03086 524 RDRLRGMSEHESIC-GSRTAPCDSCGRSVML-KEMDIHQIAVHQ 565 (567)
T ss_pred hhhhhhHHHHHHhc-CCcceEccccCCeeee-hhHHHHHHHhhc
Confidence 123566666654 6666666666665543 345666655553
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.18 E-value=3.9e-11 Score=134.35 Aligned_cols=97 Identities=23% Similarity=0.550 Sum_probs=43.0
Q ss_pred ccccCCcccCChHHHHHHHhhhcCCCCCCcccccccccChhHHHHHhhhhcCCCccccccCCCCCC----------Chhh
Q psy16259 663 KCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFCLKTFTRKEHLTNHLKQHAGETEHVCNICSKPFL----------KKEH 732 (896)
Q Consensus 663 ~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~----------~~~~ 732 (896)
.|..|++.|. ...|..|+++|+ ++|.|. ||+.+ .+..|..|+.+|.+++++.|..|++.|. ....
T Consensus 455 ~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~ 529 (567)
T PLN03086 455 HCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRG 529 (567)
T ss_pred cCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhh
Confidence 3444444442 333444444432 344444 44322 3344444444444444444444444442 1234
Q ss_pred HhhhhhcccCCCCccCCCCCCCCCCchhHHHHHH
Q psy16259 733 LVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQR 766 (896)
Q Consensus 733 L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~ 766 (896)
|..|+.+. |.+++.|..||+.|.. ..|..|+.
T Consensus 530 Lt~HE~~C-G~rt~~C~~Cgk~Vrl-rdm~~H~~ 561 (567)
T PLN03086 530 MSEHESIC-GSRTAPCDSCGRSVML-KEMDIHQI 561 (567)
T ss_pred HHHHHHhc-CCcceEccccCCeeee-hhHHHHHH
Confidence 55555543 5555555555554442 23445544
No 13
>KOG1146|consensus
Probab=99.18 E-value=9.9e-10 Score=129.81 Aligned_cols=109 Identities=14% Similarity=0.230 Sum_probs=82.4
Q ss_pred ccccccCCCCCCChhhHhhhhhc--cc--CCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCc
Q psy16259 717 EHVCNICSKPFLKKEHLVNHSRS--HT--GERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCK 792 (896)
Q Consensus 717 ~~~C~~C~~~f~~~~~L~~H~~~--H~--~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~ 792 (896)
...|..|...|.....+..|-+- +- ....+.|..|++.|.....+. |+-. +.+|.|.+|...|...
T Consensus 1228 gl~~~~~e~~f~~~~~~~~~a~~~~~~~~~sGe~~c~~~~~~~~~~~~~~-~l~~---------~~~~~~~~~~~~~~~~ 1297 (1406)
T KOG1146|consen 1228 GLLPNALEQPFPQEPEPTATAPPKPPELPASGEGECGAVDELLTPSFGIS-TLDV---------THRYLCRQCKMAFDGE 1297 (1406)
T ss_pred cccHHhhhcCccCcccccccCCCCCCcCcCCCcchhhhccccccCcccee-eccc---------chhHHHHHHHhhhcch
Confidence 46788888888888888777542 11 112388999999988888777 6643 4568899999999999
Q ss_pred hhHHHHhhhh-----------------cCCCCccCCccCccccchhhHHhhhccccCCCcC
Q psy16259 793 GHLVSHLRSH-----------------LNERPHVCPHCSKSFVEHTCMIKHIRKNHTLANV 836 (896)
Q Consensus 793 ~~L~~H~~~H-----------------~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~ 836 (896)
..|..|.+.- -..+.| |..|...|.....|+.|||..|...+.
T Consensus 1298 ~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c~~~~~~~~alqihm~~~~~~~kt 1357 (1406)
T KOG1146|consen 1298 APLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLACEVLLSGREALQIHMRSSAHRRKT 1357 (1406)
T ss_pred hHHHHHHHHHHhccCccccCCCCcccCcccccc-chHHHhhcchhHHHHHHHHHhhhcccC
Confidence 9998888532 223456 999999999999999999987665443
No 14
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=99.10 E-value=1.1e-11 Score=87.67 Aligned_cols=41 Identities=15% Similarity=0.021 Sum_probs=26.5
Q ss_pred eeeeeeeecccccccccCChhhhhHhhhhccccccccccccccccc
Q psy16259 199 QQQSDLIKTNGTGDVNPDEFNQLLAYHEVFGKLQSSVVTHTSSTLG 244 (896)
Q Consensus 199 ~~~~dv~~~~~~~e~~~l~~~q~~~y~~v~~e~~~~~~~~~~sslg 244 (896)
|+|+||+++|+++||..|+..||.+|+|+|+||+.+ +.++|
T Consensus 1 Vtf~Dvav~fs~eEW~~L~~~Qk~ly~dvm~Eny~~-----l~slG 41 (41)
T PF01352_consen 1 VTFEDVAVYFSQEEWELLDPAQKNLYRDVMLENYRN-----LVSLG 41 (41)
T ss_dssp ------TT---HHHHHTS-HHHHHHHHHHHHHTTTS--------S-
T ss_pred CeEEEEEEEcChhhcccccceecccchhHHHHhhcc-----cEecC
Confidence 689999999999999999999999999999999999 66665
No 15
>PHA00733 hypothetical protein
Probab=99.06 E-value=1.2e-10 Score=106.62 Aligned_cols=86 Identities=21% Similarity=0.362 Sum_probs=55.0
Q ss_pred CCCccCCCCCCCCCCchhHHHH--HHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccch
Q psy16259 743 ERPFACTDCGKSFPLKGNLLFH--QRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEH 820 (896)
Q Consensus 743 ~k~~~C~~C~~~F~~~~~l~~H--~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~ 820 (896)
.+.+.|.+|.+.|.....|..| ++.|.... .+++|.|+.||+.|.+...|..|++.| +.+|.|..|++.|...
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~---~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~ 112 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSK---AVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNT 112 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccC---CCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCH
Confidence 3445566666555555555554 22221110 146777777777777777777777766 3467788888888888
Q ss_pred hhHHhhhccccCC
Q psy16259 821 TCMIKHIRKNHTL 833 (896)
Q Consensus 821 ~~l~~H~~~~H~~ 833 (896)
..|.+|+++.|..
T Consensus 113 ~sL~~H~~~~h~~ 125 (128)
T PHA00733 113 DSTLDHVCKKHNI 125 (128)
T ss_pred HHHHHHHHHhcCc
Confidence 8888887777753
No 16
>PHA00733 hypothetical protein
Probab=98.99 E-value=4.2e-10 Score=103.09 Aligned_cols=80 Identities=21% Similarity=0.290 Sum_probs=34.1
Q ss_pred CCCCCcccccccccChhHHHHHh--h---hhcCCCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhH
Q psy16259 687 ESPHKCDFCLKTFTRKEHLTNHL--K---QHAGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNL 761 (896)
Q Consensus 687 ~~~~~C~~C~k~f~~~~~L~~H~--~---~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l 761 (896)
.+++.|.+|.+.|.....|..|. + .++++++|.|.+|++.|.+...|..|+++| +.+|.|.+|++.|.....|
T Consensus 38 ~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL 115 (128)
T PHA00733 38 QKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDST 115 (128)
T ss_pred hhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHH
Confidence 34455555555555544443331 1 112233444444444444444444444433 1234444444444444444
Q ss_pred HHHHHhh
Q psy16259 762 LFHQRSH 768 (896)
Q Consensus 762 ~~H~~~H 768 (896)
..|++..
T Consensus 116 ~~H~~~~ 122 (128)
T PHA00733 116 LDHVCKK 122 (128)
T ss_pred HHHHHHh
Confidence 4444433
No 17
>KOG3993|consensus
Probab=98.94 E-value=7.8e-11 Score=122.30 Aligned_cols=199 Identities=16% Similarity=0.191 Sum_probs=134.7
Q ss_pred CccCCccCccCCChhHHHHHH--HhhcCCCCcccccCCcccCChHHHHHHHhhhcCCCCCCc-c--cccccccChhHHHH
Q psy16259 633 PHKCSYCKQSFTRKDHLNNHV--RLHTGESPHKCHFCSKSFTRKEHLNNHIRLHTGESPHKC-D--FCLKTFTRKEHLTN 707 (896)
Q Consensus 633 ~~~C~~C~~~F~~~~~L~~H~--~~h~~~~~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C-~--~C~k~f~~~~~L~~ 707 (896)
.|.|.+|...|...-.|.+|. ||-+. .|+|.+|+|.|+...+|..|.|+|.-..--.= . -=.+.-.+....+.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~v--EYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHV--EYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEe--eecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 499999999999999999996 44433 49999999999999999999999963210000 0 00000000111110
Q ss_pred Hhhh--hcCCCccccccCCCCCCChhhHhhhhhcccCCC-----------------CccCCCCCCCCCCchhHHHHHHhh
Q psy16259 708 HLKQ--HAGETEHVCNICSKPFLKKEHLVNHSRSHTGER-----------------PFACTDCGKSFPLKGNLLFHQRSH 768 (896)
Q Consensus 708 H~~~--h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k-----------------~~~C~~C~~~F~~~~~l~~H~~~H 768 (896)
=.|. -..+.-|.|.+|+|.|++...|++|..+|+... -+.|..|+-.|.....--.|...+
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~ 424 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYV 424 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeee
Confidence 0010 012346899999999999999999988776321 134555655555443333333333
Q ss_pred cCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHHhhhccccCCCcCCC
Q psy16259 769 RKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHIRKNHTLANVNV 838 (896)
Q Consensus 769 ~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~ 838 (896)
.+. +..-.|..||..+.++..--.|.|.-.-+.-|.|.+|--.|-+..+|.+|+.+.|..+.-++
T Consensus 425 a~s-----ael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse~rqv 489 (500)
T KOG3993|consen 425 AGS-----AELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSELRQV 489 (500)
T ss_pred ecc-----ccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHHhhhh
Confidence 221 22345899999999998888888777778889999999999999999999999998776544
No 18
>KOG3993|consensus
Probab=98.81 E-value=7.2e-10 Score=115.27 Aligned_cols=53 Identities=23% Similarity=0.506 Sum_probs=35.9
Q ss_pred CcccCccccccCChHHHhhhhccccCCCccccCccCcccCChHHHHHHHhhcc
Q psy16259 325 PFTCHVCGKAFDFQAELMSHGKCHLGGSMYTCAVCFHVFANEESLQRHTKRHS 377 (896)
Q Consensus 325 ~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 377 (896)
.|.|..|...|..--.|.+|.--..-...|+|.+|+++|.-..+|..|.|-|.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHK 319 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHK 319 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccC
Confidence 37788888777777777777533333345777777777777777777766553
No 19
>PHA02768 hypothetical protein; Provisional
Probab=98.80 E-value=1.7e-09 Score=80.55 Aligned_cols=43 Identities=19% Similarity=0.485 Sum_probs=34.7
Q ss_pred ccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHH
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMI 824 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~ 824 (896)
|+|+.||+.|....+|..||++|+ ++|+|..|++.|.+.+.|.
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 778888888888888888888887 6888888888887776664
No 20
>PHA02768 hypothetical protein; Provisional
Probab=98.62 E-value=1.4e-08 Score=75.81 Aligned_cols=23 Identities=26% Similarity=0.752 Sum_probs=10.5
Q ss_pred ccCCCCCCCCCCchhHHHHHHhh
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSH 768 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H 768 (896)
|.|+.||+.|...++|..|+|+|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H 28 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKH 28 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhc
Confidence 44444444444444444444444
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.54 E-value=3.8e-08 Score=62.75 Aligned_cols=26 Identities=46% Similarity=0.950 Sum_probs=21.0
Q ss_pred hHHHHhhhhcCCCCccCCccCccccc
Q psy16259 794 HLVSHLRSHLNERPHVCPHCSKSFVE 819 (896)
Q Consensus 794 ~L~~H~~~H~~~~~~~C~~C~~~f~~ 819 (896)
+|++||++|+|+++|.|++|+++|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47888888888888888888888863
No 22
>KOG1146|consensus
Probab=98.39 E-value=7e-07 Score=106.38 Aligned_cols=95 Identities=22% Similarity=0.334 Sum_probs=67.7
Q ss_pred ccCccCcccCChHHHHHHHh-hccCCCCeecCccccccCChHHHHHHHhhccCCCceecCCCCCcccchhhhhhhhhc--
Q psy16259 355 TCAVCFHVFANEESLQRHTK-RHSIDRPYCCSVCMKTFARKEHLENHTRCHTGETPYKCQYCGKNFSRKEHMVNHVRK-- 431 (896)
Q Consensus 355 ~C~~C~~~f~~~~~L~~H~~-~h~~~~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~-- 431 (896)
.|..|+..+.....+.-|+. .+...+.|.|..|+..|+....|..|||..+-+..- .+|.. .+.|.+.
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~--~~c~~-------gq~~~~~ar 508 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS--AYCKA-------GQNHPRLAR 508 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch--hHhHh-------ccccccccc
Confidence 47778888888887777765 455668899999999999999999999974433221 33321 1223222
Q ss_pred ----ccCCCccccccccccccChhhhhhhcc
Q psy16259 432 ----HTGETPHRCEICKKAFTRKEHYVNHVM 458 (896)
Q Consensus 432 ----h~~~~~~~C~~C~~~f~~~~~L~~H~~ 458 (896)
-.+.++|.|..|...+..+..|.+|++
T Consensus 509 g~~~~~~~~p~~C~~C~~stttng~Lsihlq 539 (1406)
T KOG1146|consen 509 GEVYRCPGKPYPCRACNYSTTTNGNLSIHLQ 539 (1406)
T ss_pred cccccCCCCcccceeeeeeeecchHHHHHHH
Confidence 234467899999999999999998875
No 23
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.23 E-value=5e-07 Score=57.58 Aligned_cols=22 Identities=55% Similarity=1.129 Sum_probs=8.6
Q ss_pred HHHHhhhcCCCCCCcccccccc
Q psy16259 678 NNHIRLHTGESPHKCDFCLKTF 699 (896)
Q Consensus 678 ~~H~~~H~~~~~~~C~~C~k~f 699 (896)
.+|+++|+|+++|.|++|+++|
T Consensus 3 ~~H~~~H~~~k~~~C~~C~k~F 24 (26)
T PF13465_consen 3 RRHMRTHTGEKPYKCPYCGKSF 24 (26)
T ss_dssp HHHHHHHSSSSSEEESSSSEEE
T ss_pred HHHhhhcCCCCCCCCCCCcCee
Confidence 3333333333333333333333
No 24
>KOG3671|consensus
Probab=98.21 E-value=3.4e-05 Score=82.87 Aligned_cols=10 Identities=10% Similarity=0.278 Sum_probs=4.1
Q ss_pred ccccCCCCCC
Q psy16259 6 ELSQFPASHK 15 (896)
Q Consensus 6 ~~~~~~~~~~ 15 (896)
.+..++.+..
T Consensus 299 ~~~~~~Sv~~ 308 (569)
T KOG3671|consen 299 NPNGLPSVGQ 308 (569)
T ss_pred CCCCCccccc
Confidence 3344444433
No 25
>KOG3671|consensus
Probab=98.18 E-value=1.9e-05 Score=84.85 Aligned_cols=6 Identities=17% Similarity=0.285 Sum_probs=3.7
Q ss_pred eeeeee
Q psy16259 155 NIGAVL 160 (896)
Q Consensus 155 ~~~~~~ 160 (896)
++++||
T Consensus 494 gR~~Lm 499 (569)
T KOG3671|consen 494 GRDALM 499 (569)
T ss_pred cHHHHH
Confidence 566666
No 26
>PHA00616 hypothetical protein
Probab=98.15 E-value=7.6e-07 Score=63.16 Aligned_cols=39 Identities=21% Similarity=0.248 Sum_probs=31.3
Q ss_pred CccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccc
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSF 817 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 817 (896)
+|+|..||+.|..++.|+.|++.|+|+++|.|++=-..|
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~~y~~f 39 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEYFYIYF 39 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeEEEEEE
Confidence 578888888888888888888888888888887644444
No 27
>PHA00732 hypothetical protein
Probab=98.04 E-value=3.4e-06 Score=69.85 Aligned_cols=21 Identities=33% Similarity=0.439 Sum_probs=9.6
Q ss_pred ccCCCCCCCCCCchhHHHHHH
Q psy16259 746 FACTDCGKSFPLKGNLLFHQR 766 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~ 766 (896)
|.|.+||+.|.+..+|+.|++
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r 22 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHAR 22 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhh
Confidence 444444444444444444444
No 28
>PHA00732 hypothetical protein
Probab=98.04 E-value=3.1e-06 Score=70.02 Aligned_cols=45 Identities=31% Similarity=0.531 Sum_probs=39.9
Q ss_pred CccCCCCCcccCCchhHHHHhhh-hcCCCCccCCccCccccchhhHHhhhcc
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRS-HLNERPHVCPHCSKSFVEHTCMIKHIRK 829 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~-H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 829 (896)
+|.|..||+.|.+...|+.|++. |++ +.|+.|++.|. .|..|++.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~ 46 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYS 46 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---Chhhhhcc
Confidence 58999999999999999999984 654 68999999998 58889866
No 29
>PHA00616 hypothetical protein
Probab=97.96 E-value=3.6e-06 Score=59.79 Aligned_cols=33 Identities=15% Similarity=0.370 Sum_probs=18.0
Q ss_pred cccccCCcccCChHHHHHHHhhhcCCCCCCccc
Q psy16259 662 HKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDF 694 (896)
Q Consensus 662 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~ 694 (896)
|+|..||+.|.++++|..|++.|+|++++.|+.
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 455555555555555555555555555555544
No 30
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=97.72 E-value=6.9e-05 Score=86.30 Aligned_cols=55 Identities=47% Similarity=0.968 Sum_probs=28.9
Q ss_pred CCeecCccccccCChHHHHHHHhhccCCCceecCC--CCCcccchhhhhhhhhcccC
Q psy16259 380 RPYCCSVCMKTFARKEHLENHTRCHTGETPYKCQY--CGKNFSRKEHMVNHVRKHTG 434 (896)
Q Consensus 380 ~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~ 434 (896)
+.+.|..|...|.....+..|.+.+.+++++.|.. |...|.....+.+|.+.++.
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (467)
T COG5048 32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHN 88 (467)
T ss_pred chhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhcccccc
Confidence 34455555555555555555555555555555543 33444445555555554443
No 31
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.63 E-value=4.7e-05 Score=58.52 Aligned_cols=49 Identities=35% Similarity=0.733 Sum_probs=23.1
Q ss_pred ccCCCCCcccCCchhHHHHhh-hhcCC-CCccCCccCccccchhhHHhhhcccc
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLR-SHLNE-RPHVCPHCSKSFVEHTCMIKHIRKNH 831 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~-~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~H 831 (896)
|.|++|++. .+...|..|.. .|.++ +.+.|++|...+. .+|.+||+..|
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 455555553 23345555543 33332 3355555555433 25555555544
No 32
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=97.61 E-value=0.00012 Score=84.18 Aligned_cols=57 Identities=37% Similarity=0.795 Sum_probs=45.6
Q ss_pred CCccccCCCCCccCCHHHHHHHHhhcCCCCCcccCc--cCCccCChHHHHHHHHhhcCC
Q psy16259 463 ETPHVCQVCGKKYTRKEHLANHMRSHTNDTPFKCEI--CTKSFTRKEHFTNHIMWHTGE 519 (896)
Q Consensus 463 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~--C~~~f~~~~~L~~H~~~h~~~ 519 (896)
...+.|..|...|.....+..|.+.+.+++++.|.. |...|.....+.+|.+.+...
T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (467)
T COG5048 31 PRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNN 89 (467)
T ss_pred CchhhcccccccccccchhhhhcccccccCCccccccccccccCCcchhhhhccccccc
Confidence 356788999999999888999999999988888866 566777778888887776543
No 33
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.57 E-value=9.9e-05 Score=56.73 Aligned_cols=51 Identities=27% Similarity=0.618 Sum_probs=30.9
Q ss_pred ccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhc
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHL 803 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~ 803 (896)
|.|++|++. .+...|..|....+... .+.+.|++|...+. .+|..||+.++
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~----~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSE----SKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCC----CCCccCCCchhhhh--hHHHHHHHHhc
Confidence 677777773 44566777766544321 23567777776544 37777776554
No 34
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.50 E-value=4.5e-05 Score=47.21 Aligned_cols=22 Identities=41% Similarity=0.978 Sum_probs=11.5
Q ss_pred ccCCCCCcccCCchhHHHHhhh
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRS 801 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~ 801 (896)
|+|..|++.|.++..|++||+.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3455555555555555555544
No 35
>KOG2231|consensus
Probab=97.43 E-value=0.00024 Score=81.34 Aligned_cols=143 Identities=22% Similarity=0.540 Sum_probs=80.6
Q ss_pred cccCccccccCChHHHhhhhccccCCCccccCccCcccCChHHHHHHHhhccCCCCeecCccccc---------cCChHH
Q psy16259 326 FTCHVCGKAFDFQAELMSHGKCHLGGSMYTCAVCFHVFANEESLQRHTKRHSIDRPYCCSVCMKT---------FARKEH 396 (896)
Q Consensus 326 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~---------F~~~~~ 396 (896)
+.|.+|+++|.... ..-.|..| -.|.+...|+.|+...| +.+.|.+|-.. ..++..
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H--~~~~c~lC~~~~kif~~e~k~Yt~~e 164 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQH--KLHLCSLCLQNLKIFINERKLYTRAE 164 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhh--hhhccccccccceeeeeeeehehHHH
Confidence 56888888774321 11257788 77888888888885433 24666666321 123444
Q ss_pred HHHHHhhccCCCceecCCCCCcccchhhhhhhhhcccCCCccccccccccccChhhhhhhcccccCCCccccCCC-----
Q psy16259 397 LENHTRCHTGETPYKCQYCGKNFSRKEHMVNHVRKHTGETPHRCEICKKAFTRKEHYVNHVMWHTGETPHVCQVC----- 471 (896)
Q Consensus 397 L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C----- 471 (896)
|..|++.-. . +++.|.. --.|..|...|.....|.+|++.++ |.|..|
T Consensus 165 l~~h~~~gd-------~-d~~s~rG---------------hp~C~~C~~~fld~~el~rH~~~~h----~~chfC~~~~~ 217 (669)
T KOG2231|consen 165 LNLHLMFGD-------P-DDESCRG---------------HPLCKFCHERFLDDDELYRHLRFDH----EFCHFCDYKTG 217 (669)
T ss_pred HHHHHhcCC-------C-ccccccC---------------CccchhhhhhhccHHHHHHhhccce----eheeecCcccc
Confidence 555543211 0 1121111 1247777777777777777776654 445555
Q ss_pred -CCccCCHHHHHHHHhhcCCCCCcccC--ccC-CccCChHHHHHHHH
Q psy16259 472 -GKKYTRKEHLANHMRSHTNDTPFKCE--ICT-KSFTRKEHFTNHIM 514 (896)
Q Consensus 472 -~~~f~~~~~L~~H~~~h~~~~~~~C~--~C~-~~f~~~~~L~~H~~ 514 (896)
+..|....+|..|.+.++ |.|+ .|. +.|.....+..|++
T Consensus 218 ~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~~~ei~lk 260 (669)
T KOG2231|consen 218 QNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAFELEIELK 260 (669)
T ss_pred cchhcccchHHHHHhhhcC----ccccccccccceeeehhHHHHHHH
Confidence 345667777888877655 6666 563 33443334444444
No 36
>KOG1830|consensus
Probab=97.43 E-value=0.0041 Score=65.84 Aligned_cols=7 Identities=29% Similarity=0.629 Sum_probs=3.0
Q ss_pred cccCCCC
Q psy16259 7 LSQFPAS 13 (896)
Q Consensus 7 ~~~~~~~ 13 (896)
++++|+.
T Consensus 246 ~~~lp~~ 252 (518)
T KOG1830|consen 246 YSSLPPN 252 (518)
T ss_pred hhccCCC
Confidence 3344444
No 37
>KOG2231|consensus
Probab=97.34 E-value=0.0012 Score=75.89 Aligned_cols=46 Identities=30% Similarity=0.769 Sum_probs=26.1
Q ss_pred ccCCCCCCCcccccccccccccccCCCCccCCccCccCCChhHHHHHHH-hhcCCCCcccccC
Q psy16259 606 YQCQYCPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKDHLNNHVR-LHTGESPHKCHFC 667 (896)
Q Consensus 606 ~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~F~~~~~L~~H~~-~h~~~~~~~C~~C 667 (896)
+.|.+|++.|.-.. ..-.|..| -.|.+...|+.|++ .|. .+.|.+|
T Consensus 100 ~~C~~C~~~~~~~~------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~---~~~c~lC 146 (669)
T KOG2231|consen 100 HSCHICDRRFRALY------------NKKECLHC-TEFKSVENLKNHMRDQHK---LHLCSLC 146 (669)
T ss_pred hhcCccccchhhhc------------ccCCCccc-cchhHHHHHHHHHHHhhh---hhccccc
Confidence 56777776653221 11256777 66777777777774 342 2445544
No 38
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.29 E-value=9.1e-05 Score=74.91 Aligned_cols=58 Identities=24% Similarity=0.603 Sum_probs=37.1
Q ss_pred CCCCccCCC--CCCCCCCchhHHHHHHhhc-CC------C------CccccCCccCCCCCcccCCchhHHHHh
Q psy16259 742 GERPFACTD--CGKSFPLKGNLLFHQRSHR-KQ------N------GEVLERPFKCDVCPKDFTCKGHLVSHL 799 (896)
Q Consensus 742 ~~k~~~C~~--C~~~F~~~~~l~~H~~~H~-~~------~------~~~~~k~~~C~~C~k~f~~~~~L~~H~ 799 (896)
++|||+|++ |.|.+++...|+.|+.-=| .. . -.+..|||+|++|+|.++....|+-|.
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecc
Confidence 469999987 9999999999999986321 10 0 011235666666666666666666553
No 39
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.25 E-value=9.8e-05 Score=45.67 Aligned_cols=23 Identities=52% Similarity=0.911 Sum_probs=13.4
Q ss_pred ccCCCCCCCCCCchhHHHHHHhh
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSH 768 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H 768 (896)
|.|++|++.|.++.+|+.|+++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 45666666666666666665543
No 40
>KOG1923|consensus
Probab=97.21 E-value=0.0012 Score=75.30 Aligned_cols=6 Identities=33% Similarity=0.778 Sum_probs=2.2
Q ss_pred Hhhhhc
Q psy16259 223 AYHEVF 228 (896)
Q Consensus 223 ~y~~v~ 228 (896)
.++++|
T Consensus 405 ~~ee~F 410 (830)
T KOG1923|consen 405 RFEEQF 410 (830)
T ss_pred HHHHHH
Confidence 333333
No 41
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.12 E-value=0.00022 Score=63.14 Aligned_cols=72 Identities=25% Similarity=0.491 Sum_probs=15.2
Q ss_pred CCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHHhhh
Q psy16259 748 CTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHI 827 (896)
Q Consensus 748 C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 827 (896)
|..|+..|.+...|..||...|+.. .. ....+.....|..+++.-. ...|.|.+|++.|.+...|+.||
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~---------~~-~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm 70 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFD---------IP-DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHM 70 (100)
T ss_dssp ----------------------------------------------------------SSEEBSSSS-EESSHHHHHHHH
T ss_pred ccccccccccccccccccccccccc---------cc-ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHH
Confidence 5566666666666666665444311 00 1112223333333333211 12466666666666666666666
Q ss_pred ccc
Q psy16259 828 RKN 830 (896)
Q Consensus 828 ~~~ 830 (896)
+.+
T Consensus 71 ~~~ 73 (100)
T PF12756_consen 71 RSK 73 (100)
T ss_dssp HHT
T ss_pred cCc
Confidence 653
No 42
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.10 E-value=0.00024 Score=62.94 Aligned_cols=72 Identities=26% Similarity=0.463 Sum_probs=16.2
Q ss_pred cccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHh
Q psy16259 720 CNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHL 799 (896)
Q Consensus 720 C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~ 799 (896)
|.+|+..|.+...|..||...++...- ....+.....+..+++.-. ...+.|.+|++.|.+...|+.||
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~~-------~~~~~C~~C~~~f~s~~~l~~Hm 70 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKKV-------KESFRCPYCNKTFRSREALQEHM 70 (100)
T ss_dssp ----------------------------------------------------------SSEEBSSSS-EESSHHHHHHHH
T ss_pred ccccccccccccccccccccccccccc----ccccccccccccccccccc-------CCCCCCCccCCCCcCHHHHHHHH
Confidence 566666666666666666544432100 1111223333333333211 22466777777777777777777
Q ss_pred hhh
Q psy16259 800 RSH 802 (896)
Q Consensus 800 ~~H 802 (896)
+.+
T Consensus 71 ~~~ 73 (100)
T PF12756_consen 71 RSK 73 (100)
T ss_dssp HHT
T ss_pred cCc
Confidence 643
No 43
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.08 E-value=0.00021 Score=72.42 Aligned_cols=53 Identities=26% Similarity=0.642 Sum_probs=34.0
Q ss_pred cCCccCCC--CCcccCCchhHHHHhh-hhc------------------CCCCccCCccCccccchhhHHhhhcc
Q psy16259 777 ERPFKCDV--CPKDFTCKGHLVSHLR-SHL------------------NERPHVCPHCSKSFVEHTCMIKHIRK 829 (896)
Q Consensus 777 ~k~~~C~~--C~k~f~~~~~L~~H~~-~H~------------------~~~~~~C~~C~~~f~~~~~l~~H~~~ 829 (896)
+|||+|++ |+|+++....|+-||. -|- .+|||+|++|+|.++....|+-|.+-
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence 46777765 7777777777777764 331 13667777777777777777766543
No 44
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.04 E-value=0.00033 Score=43.65 Aligned_cols=21 Identities=38% Similarity=0.906 Sum_probs=8.6
Q ss_pred cCCCCCcccCCchhHHHHhhh
Q psy16259 781 KCDVCPKDFTCKGHLVSHLRS 801 (896)
Q Consensus 781 ~C~~C~k~f~~~~~L~~H~~~ 801 (896)
.|++|++.|.+...|+.|+++
T Consensus 2 ~C~~C~~~~~~~~~l~~H~~~ 22 (24)
T PF13894_consen 2 QCPICGKSFRSKSELRQHMRT 22 (24)
T ss_dssp E-SSTS-EESSHHHHHHHHHH
T ss_pred CCcCCCCcCCcHHHHHHHHHh
Confidence 344444444444444444443
No 45
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.98 E-value=0.00029 Score=43.96 Aligned_cols=24 Identities=38% Similarity=0.823 Sum_probs=20.6
Q ss_pred ccCCccCccccchhhHHhhhcccc
Q psy16259 808 HVCPHCSKSFVEHTCMIKHIRKNH 831 (896)
Q Consensus 808 ~~C~~C~~~f~~~~~l~~H~~~~H 831 (896)
|.|++|++.|.+...|++|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 789999999999999999999866
No 46
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.89 E-value=0.0005 Score=44.39 Aligned_cols=23 Identities=48% Similarity=0.879 Sum_probs=12.7
Q ss_pred ccCCCCCcccCCchhHHHHhhhh
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSH 802 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H 802 (896)
|+|.+|++.|.+...|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 45555555555555555555544
No 47
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.70 E-value=0.0014 Score=67.23 Aligned_cols=137 Identities=29% Similarity=0.499 Sum_probs=72.3
Q ss_pred ccCCc--cCccCCChhHHHHHHHhhcCCCCcccccCC---cccC------ChHHHHHHHhhhcCCCCC----Cccccccc
Q psy16259 634 HKCSY--CKQSFTRKDHLNNHVRLHTGESPHKCHFCS---KSFT------RKEHLNNHIRLHTGESPH----KCDFCLKT 698 (896)
Q Consensus 634 ~~C~~--C~~~F~~~~~L~~H~~~h~~~~~~~C~~C~---k~f~------~~~~L~~H~~~H~~~~~~----~C~~C~k~ 698 (896)
|.|+. |......-..|+.|.+..++ .+.|.+|- +.|. ++..|+.|...-..+.-| .|.+|...
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~ 229 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIY 229 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccce
Confidence 44443 44444444455555554333 24444442 2232 344555554432222222 47777777
Q ss_pred ccChhHHHHHhhhhcCCCccccccCCCC-------CCChhhHhhhhhcccCCCCccCCC--CC----CCCCCchhHHHHH
Q psy16259 699 FTRKEHLTNHLKQHAGETEHVCNICSKP-------FLKKEHLVNHSRSHTGERPFACTD--CG----KSFPLKGNLLFHQ 765 (896)
Q Consensus 699 f~~~~~L~~H~~~h~~~~~~~C~~C~~~-------f~~~~~L~~H~~~H~~~k~~~C~~--C~----~~F~~~~~l~~H~ 765 (896)
|.+...|..|+|.-+ -.|-+|++. |++..+|..|.+.-+ |.|.. |. ..|.....|..|+
T Consensus 230 FYdDDEL~~HcR~~H----E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h~ 301 (493)
T COG5236 230 FYDDDELRRHCRLRH----EACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEHL 301 (493)
T ss_pred ecChHHHHHHHHhhh----hhhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHHH
Confidence 777777777776543 246666653 666677777765422 44432 21 3577777777777
Q ss_pred HhhcCCCCccccCCc
Q psy16259 766 RSHRKQNGEVLERPF 780 (896)
Q Consensus 766 ~~H~~~~~~~~~k~~ 780 (896)
..-|+.+....++++
T Consensus 302 ~~~h~~~~~~~~~~~ 316 (493)
T COG5236 302 TRFHKVNARLSEIPR 316 (493)
T ss_pred HHHhhcccccCcCCC
Confidence 665555544444433
No 48
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.60 E-value=0.001 Score=42.92 Aligned_cols=26 Identities=27% Similarity=0.659 Sum_probs=23.7
Q ss_pred CccCCccCccccchhhHHhhhccccC
Q psy16259 807 PHVCPHCSKSFVEHTCMIKHIRKNHT 832 (896)
Q Consensus 807 ~~~C~~C~~~f~~~~~l~~H~~~~H~ 832 (896)
+|.|..|++.|.+...|..|++.++.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 58999999999999999999988654
No 49
>KOG4672|consensus
Probab=96.43 E-value=0.029 Score=59.39 Aligned_cols=9 Identities=22% Similarity=0.593 Sum_probs=4.3
Q ss_pred Hhhhhcccc
Q psy16259 223 AYHEVFGKL 231 (896)
Q Consensus 223 ~y~~v~~e~ 231 (896)
+|...|-|.
T Consensus 475 aY~~FMkEM 483 (487)
T KOG4672|consen 475 AYNAFMKEM 483 (487)
T ss_pred HHHHHHHHH
Confidence 455555443
No 50
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=96.35 E-value=0.0028 Score=65.10 Aligned_cols=91 Identities=25% Similarity=0.474 Sum_probs=59.5
Q ss_pred ChhhHhhhhhcccCCCCc----cCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCc-------ccCCchhHHH
Q psy16259 729 KKEHLVNHSRSHTGERPF----ACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPK-------DFTCKGHLVS 797 (896)
Q Consensus 729 ~~~~L~~H~~~H~~~k~~----~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k-------~f~~~~~L~~ 797 (896)
++..|+.|...-..+.-| .|..|...|..-..|.+|+|..|. +|.+|++ -|++-..|..
T Consensus 200 ~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDEL~~HcR~~HE----------~ChICD~v~p~~~QYFK~Y~~Le~ 269 (493)
T COG5236 200 RSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHE----------ACHICDMVGPIRYQYFKSYEDLEA 269 (493)
T ss_pred ecccccccccCCccccCcCCCchhhhccceecChHHHHHHHHhhhh----------hhhhhhccCccchhhhhCHHHHHH
Confidence 445566665543222222 588888888888888888886442 3666654 4677777888
Q ss_pred HhhhhcCCCCccCCc--cC----ccccchhhHHhhhccccCC
Q psy16259 798 HLRSHLNERPHVCPH--CS----KSFVEHTCMIKHIRKNHTL 833 (896)
Q Consensus 798 H~~~H~~~~~~~C~~--C~----~~f~~~~~l~~H~~~~H~~ 833 (896)
|.+. .-|.|.+ |- +.|...-.|+.|+.+.|..
T Consensus 270 HF~~----~hy~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~ 307 (493)
T COG5236 270 HFRN----AHYCCTFQTCRVGKCYVFPYHTELLEHLTRFHKV 307 (493)
T ss_pred Hhhc----CceEEEEEEEecCcEEEeccHHHHHHHHHHHhhc
Confidence 8763 2266643 43 5678888899998887753
No 51
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.34 E-value=0.0022 Score=46.64 Aligned_cols=30 Identities=23% Similarity=0.441 Sum_probs=13.9
Q ss_pred cCCccCCCCCcccCCchhHHHHhhhhcCCC
Q psy16259 777 ERPFKCDVCPKDFTCKGHLVSHLRSHLNER 806 (896)
Q Consensus 777 ~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~ 806 (896)
+.|-.|++|+..+.+..+|++||.++++.+
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 445555555555555555555555554443
No 52
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.28 E-value=0.0032 Score=39.88 Aligned_cols=22 Identities=41% Similarity=0.838 Sum_probs=10.8
Q ss_pred cCCCCCcccCCchhHHHHhhhh
Q psy16259 781 KCDVCPKDFTCKGHLVSHLRSH 802 (896)
Q Consensus 781 ~C~~C~k~f~~~~~L~~H~~~H 802 (896)
+|..|++.|.+...|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 4445555555555555554444
No 53
>PRK04860 hypothetical protein; Provisional
Probab=95.93 E-value=0.0035 Score=59.78 Aligned_cols=40 Identities=23% Similarity=0.488 Sum_probs=30.0
Q ss_pred CCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchh
Q psy16259 778 RPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHT 821 (896)
Q Consensus 778 k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~ 821 (896)
-+|.|. |++ ....+++|.++|+++++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 357887 877 6667788888888888888888888776543
No 54
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.83 E-value=0.0033 Score=39.21 Aligned_cols=23 Identities=35% Similarity=0.824 Sum_probs=12.3
Q ss_pred ccCCccCccccchhhHHhhhcccc
Q psy16259 808 HVCPHCSKSFVEHTCMIKHIRKNH 831 (896)
Q Consensus 808 ~~C~~C~~~f~~~~~l~~H~~~~H 831 (896)
|+|+.|++... ...|.+|++++|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhC
Confidence 45666666655 556666666544
No 55
>KOG2785|consensus
Probab=95.82 E-value=0.014 Score=61.76 Aligned_cols=51 Identities=20% Similarity=0.415 Sum_probs=38.5
Q ss_pred CccCCCCCcccCCchhHHHHhhhhcCC-----------------------CCccCCccC---ccccchhhHHhhhcc
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRSHLNE-----------------------RPHVCPHCS---KSFVEHTCMIKHIRK 829 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~H~~~-----------------------~~~~C~~C~---~~f~~~~~l~~H~~~ 829 (896)
|-.|-+|++.|.+-..-..||..|+|- .-|.|-.|+ +.|.+....++||+.
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~ 242 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRD 242 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhh
Confidence 456777777777777777777766652 347888888 888888888888874
No 56
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.72 E-value=0.0074 Score=38.10 Aligned_cols=24 Identities=42% Similarity=0.855 Sum_probs=19.3
Q ss_pred ccCCCCCCCCCCchhHHHHHHhhc
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSHR 769 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H~ 769 (896)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 578888888888888888888664
No 57
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.59 E-value=0.0062 Score=38.37 Aligned_cols=23 Identities=48% Similarity=0.987 Sum_probs=14.5
Q ss_pred ccCCCCCcccCCchhHHHHhhhh
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSH 802 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H 802 (896)
|.|++|++.|.+...|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 45666666666666666666543
No 58
>KOG2482|consensus
Probab=95.28 E-value=0.036 Score=57.37 Aligned_cols=77 Identities=14% Similarity=0.315 Sum_probs=41.6
Q ss_pred ccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHHh
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIK 825 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~ 825 (896)
..|-.|.....+...|..||++-|.-.-.-..+-| +.-|-++-.+..=+|.- .+.-.|-.|+-+|.....|..
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~-----~Ln~YqrvrviNyiRkq--~~~~~c~~cd~~F~~e~~l~~ 352 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDY-----SLNFYQRVRVINYIRKQ--KKKSRCAECDLSFWKEPGLLI 352 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhcccc-----ccchhhhhhHHHHHHHH--hhccccccccccccCcchhhh
Confidence 57888888888888888888764331100000111 22333333333333321 122346667777777777777
Q ss_pred hhcc
Q psy16259 826 HIRK 829 (896)
Q Consensus 826 H~~~ 829 (896)
||..
T Consensus 353 hm~e 356 (423)
T KOG2482|consen 353 HMVE 356 (423)
T ss_pred hccc
Confidence 7754
No 59
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.20 E-value=0.016 Score=42.24 Aligned_cols=30 Identities=20% Similarity=0.516 Sum_probs=15.0
Q ss_pred CCCccCCccCccCCChhHHHHHHHhhcCCC
Q psy16259 631 ESPHKCSYCKQSFTRKDHLNNHVRLHTGES 660 (896)
Q Consensus 631 ~~~~~C~~C~~~F~~~~~L~~H~~~h~~~~ 660 (896)
+.|-.|++|+..+....+|++|+.++++.+
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 445566666666666666666666555544
No 60
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.00 E-value=0.012 Score=36.61 Aligned_cols=22 Identities=36% Similarity=0.789 Sum_probs=10.7
Q ss_pred ccCCCCCcccCCchhHHHHhhhh
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSH 802 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H 802 (896)
|+|..|+.... +..|++|+++|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 44555555554 55555555544
No 61
>KOG2482|consensus
Probab=94.90 E-value=0.035 Score=57.44 Aligned_cols=65 Identities=26% Similarity=0.405 Sum_probs=37.6
Q ss_pred hHHHHHHHhhhcCC-CCCCccccccccc-ChhHHHHHhhhhcCC----------------------CccccccCCCCCCC
Q psy16259 674 KEHLNNHIRLHTGE-SPHKCDFCLKTFT-RKEHLTNHLKQHAGE----------------------TEHVCNICSKPFLK 729 (896)
Q Consensus 674 ~~~L~~H~~~H~~~-~~~~C~~C~k~f~-~~~~L~~H~~~h~~~----------------------~~~~C~~C~~~f~~ 729 (896)
+..|..|++.-.+. ..-+|-+|...+. .++....|+-..|+- ..+.|-+|.+.|+.
T Consensus 128 ~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifrd 207 (423)
T KOG2482|consen 128 KEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFRD 207 (423)
T ss_pred HHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccCC
Confidence 45566666554321 2357999987764 566677777433321 13556666666666
Q ss_pred hhhHhhhhh
Q psy16259 730 KEHLVNHSR 738 (896)
Q Consensus 730 ~~~L~~H~~ 738 (896)
+..|+.|||
T Consensus 208 kntLkeHMr 216 (423)
T KOG2482|consen 208 KNTLKEHMR 216 (423)
T ss_pred cHHHHHHHH
Confidence 666666655
No 62
>KOG2785|consensus
Probab=94.80 E-value=0.077 Score=56.34 Aligned_cols=79 Identities=24% Similarity=0.379 Sum_probs=39.8
Q ss_pred CcccCccccccCChHHHhhhhccc--c---CCCcccc-CccCcccCChHHHHHHHh---hccCCCCeecCccccccCChH
Q psy16259 325 PFTCHVCGKAFDFQAELMSHGKCH--L---GGSMYTC-AVCFHVFANEESLQRHTK---RHSIDRPYCCSVCMKTFARKE 395 (896)
Q Consensus 325 ~~~C~~C~~~f~~~~~L~~H~~~h--~---~~~~~~C-~~C~~~f~~~~~L~~H~~---~h~~~~~~~C~~C~~~F~~~~ 395 (896)
.|+|..|...|.....-+.|++.- . ..+.+.= ++=-..|..+..-..=.+ .-...-++.|.+|.+.|....
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 377888888888887777787641 1 0000000 000011111100000000 112344677888888888888
Q ss_pred HHHHHHhh
Q psy16259 396 HLENHTRC 403 (896)
Q Consensus 396 ~L~~H~~~ 403 (896)
....|+..
T Consensus 83 a~~~hl~S 90 (390)
T KOG2785|consen 83 AHENHLKS 90 (390)
T ss_pred hHHHHHHH
Confidence 88777754
No 63
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.73 E-value=0.022 Score=35.82 Aligned_cols=22 Identities=32% Similarity=0.880 Sum_probs=12.2
Q ss_pred eecCccccccCChHHHHHHHhh
Q psy16259 382 YCCSVCMKTFARKEHLENHTRC 403 (896)
Q Consensus 382 ~~C~~C~~~F~~~~~L~~H~~~ 403 (896)
|.|++|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4455555555555555555543
No 64
>PRK04860 hypothetical protein; Provisional
Probab=94.72 E-value=0.018 Score=55.08 Aligned_cols=37 Identities=24% Similarity=0.631 Sum_probs=22.3
Q ss_pred CccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCC
Q psy16259 745 PFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTC 791 (896)
Q Consensus 745 ~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~ 791 (896)
+|.|. |++ ....+++|.++|++ +++|+|..|+..|..
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g------~~~YrC~~C~~~l~~ 155 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRG------EAVYRCRRCGETLVF 155 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcC------CccEECCCCCceeEE
Confidence 46665 665 55566666666665 556666666666553
No 65
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.56 E-value=0.016 Score=37.26 Aligned_cols=22 Identities=41% Similarity=0.908 Sum_probs=13.6
Q ss_pred ccCCCCCcccCCchhHHHHhhh
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRS 801 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~ 801 (896)
|.|.+|++.|.+...|+.|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666653
No 66
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.70 E-value=0.045 Score=35.18 Aligned_cols=22 Identities=23% Similarity=0.611 Sum_probs=20.9
Q ss_pred ccCCccCccccchhhHHhhhcc
Q psy16259 808 HVCPHCSKSFVEHTCMIKHIRK 829 (896)
Q Consensus 808 ~~C~~C~~~f~~~~~l~~H~~~ 829 (896)
|.|.+|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 7899999999999999999985
No 67
>KOG2893|consensus
Probab=92.42 E-value=0.11 Score=50.88 Aligned_cols=45 Identities=31% Similarity=0.663 Sum_probs=28.5
Q ss_pred CCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHH-hhhh
Q psy16259 748 CTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSH-LRSH 802 (896)
Q Consensus 748 C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H-~~~H 802 (896)
|.+|++.|....-|..|++. |-|+|.+|.|..-+--.|..| |.+|
T Consensus 13 cwycnrefddekiliqhqka----------khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----------KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhh----------ccceeeeehhhhccCCCceeehhhhh
Confidence 66777777777777777663 346677776666555566655 3444
No 68
>KOG2893|consensus
Probab=92.34 E-value=0.042 Score=53.76 Aligned_cols=46 Identities=22% Similarity=0.489 Sum_probs=26.3
Q ss_pred CCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHHhhhcccc
Q psy16259 782 CDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMIKHIRKNH 831 (896)
Q Consensus 782 C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H 831 (896)
|-.|++.|....-|.+|++. |-|+|.+|.|..-+--.|..|--.+|
T Consensus 13 cwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 13 CWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhh
Confidence 56666666666666666542 33666666666655555555544444
No 69
>KOG4849|consensus
Probab=91.41 E-value=3.4 Score=43.28 Aligned_cols=13 Identities=0% Similarity=0.286 Sum_probs=8.8
Q ss_pred hHhhhhccccccc
Q psy16259 222 LAYHEVFGKLQSS 234 (896)
Q Consensus 222 ~~y~~v~~e~~~~ 234 (896)
..|+|||..|...
T Consensus 360 AEFEdiM~RNrai 372 (498)
T KOG4849|consen 360 AEFEDIMTRNRAI 372 (498)
T ss_pred HHHHHHHhhcchh
Confidence 3467888877664
No 70
>cd07765 KRAB_A-box KRAB (Kruppel-associated box) domain -A box. The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through
Probab=90.03 E-value=0.13 Score=34.40 Aligned_cols=35 Identities=11% Similarity=-0.053 Sum_probs=31.3
Q ss_pred eeeeeeecccccccccCChhhhhHhhhhccccccc
Q psy16259 200 QQSDLIKTNGTGDVNPDEFNQLLAYHEVFGKLQSS 234 (896)
Q Consensus 200 ~~~dv~~~~~~~e~~~l~~~q~~~y~~v~~e~~~~ 234 (896)
.++|+++.|..++|..++..++..|.+.+.+++..
T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 36 (40)
T cd07765 2 TFEDVAVYFSQEEWELLDPAQRDLYRDVMLENYEN 36 (40)
T ss_pred cceeeeeecCHHHHhcCCHHHHHHHHHHHHHhhcc
Confidence 57899999999999999999999999998887755
No 71
>KOG4173|consensus
Probab=89.23 E-value=0.095 Score=50.32 Aligned_cols=84 Identities=20% Similarity=0.403 Sum_probs=45.2
Q ss_pred cccccc--CCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHHhhcCC----CCccccCCccC--CCCCcc
Q psy16259 717 EHVCNI--CSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRKQ----NGEVLERPFKC--DVCPKD 788 (896)
Q Consensus 717 ~~~C~~--C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~----~~~~~~k~~~C--~~C~k~ 788 (896)
.|.|++ |-..|.....+..|...-++ ..|..|.+.|.+...|..|+..-|.. ..+.+..-|.| +-|+..
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 466665 55666666666666544333 35777777777777777776421110 00001223566 346666
Q ss_pred cCCchhHHHHhh-hhc
Q psy16259 789 FTCKGHLVSHLR-SHL 803 (896)
Q Consensus 789 f~~~~~L~~H~~-~H~ 803 (896)
|.+...-+.||. +|.
T Consensus 156 FkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhHHHHhcc
Confidence 666666666653 443
No 72
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=89.13 E-value=0.94 Score=40.04 Aligned_cols=54 Identities=24% Similarity=0.460 Sum_probs=32.6
Q ss_pred cCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccC
Q psy16259 747 ACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCS 814 (896)
Q Consensus 747 ~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~ 814 (896)
.|--|...|....... .+ .......|+|..|...|-..-+.-.|...|. |+-|.
T Consensus 57 ~C~~C~~~f~~~~~~~------~~--~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~------CPGC~ 110 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSP------FD--ELKDSHRYVCAVCKNVFCVDCDVFVHESLHC------CPGCI 110 (112)
T ss_pred cccCcCCCCCCccccc------cc--ccccccceeCCCCCCccccccchhhhhhccC------CcCCC
Confidence 4777777776543111 00 0111346788888888877777777777664 66665
No 73
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=87.40 E-value=0.49 Score=29.74 Aligned_cols=19 Identities=32% Similarity=0.826 Sum_probs=10.8
Q ss_pred cCCccCccCCChhHHHHHHH
Q psy16259 635 KCSYCKQSFTRKDHLNNHVR 654 (896)
Q Consensus 635 ~C~~C~~~F~~~~~L~~H~~ 654 (896)
.|+.||+.| ..+.|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 466666666 4555555554
No 74
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=87.02 E-value=0.46 Score=29.87 Aligned_cols=18 Identities=33% Similarity=0.711 Sum_probs=8.2
Q ss_pred cCCCCCCCCCCchhHHHHH
Q psy16259 747 ACTDCGKSFPLKGNLLFHQ 765 (896)
Q Consensus 747 ~C~~C~~~F~~~~~l~~H~ 765 (896)
.|..||+.| ....|.+|+
T Consensus 4 ~C~~CgR~F-~~~~l~~H~ 21 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHE 21 (25)
T ss_pred cCCCCCCEE-CHHHHHHHH
Confidence 344444444 344444444
No 75
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=86.91 E-value=0.42 Score=32.72 Aligned_cols=22 Identities=32% Similarity=0.644 Sum_probs=15.9
Q ss_pred CccCCCCCcccCCchhHHHHhh
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLR 800 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~ 800 (896)
+|.|++|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4667777777777777777765
No 76
>KOG0132|consensus
Probab=85.96 E-value=20 Score=42.18 Aligned_cols=20 Identities=10% Similarity=-0.006 Sum_probs=10.1
Q ss_pred cccccccccCChhhhhHhhh
Q psy16259 207 TNGTGDVNPDEFNQLLAYHE 226 (896)
Q Consensus 207 ~~~~~e~~~l~~~q~~~y~~ 226 (896)
.|+..-|+-.....|.-|.+
T Consensus 772 ~~g~~~~g~~g~~~Rer~G~ 791 (894)
T KOG0132|consen 772 RFGRRRQGDRGARDRERYGN 791 (894)
T ss_pred cccccccccccchhhhhhcC
Confidence 44445555555555555533
No 77
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=85.32 E-value=2.5 Score=37.47 Aligned_cols=48 Identities=21% Similarity=0.453 Sum_probs=25.7
Q ss_pred ccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHHhh
Q psy16259 719 VCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSH 768 (896)
Q Consensus 719 ~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H 768 (896)
.|--|.+.|........= .-.....|.|+.|...|-..-++-.|...|
T Consensus 57 ~C~~C~~~f~~~~~~~~~--~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 57 FCFGCQGPFPKPPVSPFD--ELKDSHRYVCAVCKNVFCVDCDVFVHESLH 104 (112)
T ss_pred cccCcCCCCCCccccccc--ccccccceeCCCCCCccccccchhhhhhcc
Confidence 366666666543211100 011223477777777777777777776555
No 78
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=85.30 E-value=0.67 Score=31.66 Aligned_cols=22 Identities=23% Similarity=0.487 Sum_probs=13.5
Q ss_pred ccccCCCCCCCCChHHHHHHhh
Q psy16259 297 NHTCDLCGKFFPYRYQMIVHRR 318 (896)
Q Consensus 297 ~~~C~~C~~~f~~~~~L~~H~~ 318 (896)
.|.|++|++.|.+...+..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 3556666666666666666654
No 79
>KOG4173|consensus
Probab=84.58 E-value=0.43 Score=45.97 Aligned_cols=75 Identities=23% Similarity=0.379 Sum_probs=43.6
Q ss_pred cccccCC--CCCCCCChHHHHHHhhhcCCCCCcccCccccccCChHHHhhhhccc----------cCCCccccC--ccCc
Q psy16259 296 GNHTCDL--CGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSHGKCH----------LGGSMYTCA--VCFH 361 (896)
Q Consensus 296 ~~~~C~~--C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h----------~~~~~~~C~--~C~~ 361 (896)
..+.|.+ |-..|.....+..|...-++ -.|.+|.+.|.+...|..|+..- .|..+|.|- .|+.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 3355654 66677776666666544333 24788888888777777776422 234455553 3555
Q ss_pred ccCChHHHHHHH
Q psy16259 362 VFANEESLQRHT 373 (896)
Q Consensus 362 ~f~~~~~L~~H~ 373 (896)
.|.+...-+.|+
T Consensus 155 KFkT~r~RkdH~ 166 (253)
T KOG4173|consen 155 KFKTSRDRKDHM 166 (253)
T ss_pred hhhhhhhhhhHH
Confidence 555555555554
No 80
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=84.04 E-value=0.32 Score=36.16 Aligned_cols=30 Identities=30% Similarity=0.540 Sum_probs=17.5
Q ss_pred cCCCCccCCccCccccchhhHHhhhccccC
Q psy16259 803 LNERPHVCPHCSKSFVEHTCMIKHIRKNHT 832 (896)
Q Consensus 803 ~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 832 (896)
.||--+.|+-||+-|....+..+|+.+.|.
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 455555666666666666666666655553
No 81
>KOG1925|consensus
Probab=82.94 E-value=2.4 Score=46.58 Aligned_cols=16 Identities=19% Similarity=0.164 Sum_probs=8.7
Q ss_pred CccccceeeeeeeccC
Q psy16259 148 QTHLHIQNIGAVLSPS 163 (896)
Q Consensus 148 ~~~~~~~~~~~~~~~~ 163 (896)
+...+-.++|+++-++
T Consensus 301 ~~~r~~~t~W~s~D~~ 316 (817)
T KOG1925|consen 301 SASRPCATLWASLDPV 316 (817)
T ss_pred ccccccchhhhccCcc
Confidence 3344455677766444
No 82
>KOG2675|consensus
Probab=82.31 E-value=0.84 Score=49.47 Aligned_cols=21 Identities=10% Similarity=0.108 Sum_probs=11.3
Q ss_pred Ccccccee-eeeeeccCCCccc
Q psy16259 148 QTHLHIQN-IGAVLSPSYGQHC 168 (896)
Q Consensus 148 ~~~~~~~~-~~~~~~~~~~~~~ 168 (896)
.....+++ +...|-.++++..
T Consensus 283 kVt~dmkThKNP~LR~~~~~~~ 304 (480)
T KOG2675|consen 283 KVTDDMKTHKNPNLRATSGVPA 304 (480)
T ss_pred hCChhhhcccChhhhccCCCCC
Confidence 34445565 7777755544433
No 83
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=81.22 E-value=1 Score=30.53 Aligned_cols=10 Identities=50% Similarity=1.271 Sum_probs=5.9
Q ss_pred CCccCCccCc
Q psy16259 806 RPHVCPHCSK 815 (896)
Q Consensus 806 ~~~~C~~C~~ 815 (896)
.++.|++|+.
T Consensus 16 ~~~~CP~Cg~ 25 (33)
T cd00350 16 APWVCPVCGA 25 (33)
T ss_pred CCCcCcCCCC
Confidence 5556666654
No 84
>KOG1984|consensus
Probab=81.18 E-value=1.1e+02 Score=37.05 Aligned_cols=46 Identities=17% Similarity=0.499 Sum_probs=28.4
Q ss_pred CccccccCCCCCCCCChHHHHHHhhhcCCCCCcccCccccccCChHHHhhhh
Q psy16259 294 VMGNHTCDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSHG 345 (896)
Q Consensus 294 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~ 345 (896)
..++.+|..|..- ..--|+.-.+-+.|.|.+|+.+-........|+
T Consensus 335 ~sgPvRC~RCkaY------inPFmqF~~~gr~f~Cn~C~~~n~vp~~yf~~L 380 (1007)
T KOG1984|consen 335 ESGPVRCNRCKAY------INPFMQFIDGGRKFICNFCGSKNQVPDDYFNHL 380 (1007)
T ss_pred CCCCcchhhhhhh------cCcceEEecCCceEEecCCCccccCChhhcccC
Confidence 3456778888431 122244444556688888888777776666665
No 85
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=79.88 E-value=1.8 Score=38.70 Aligned_cols=24 Identities=21% Similarity=0.412 Sum_probs=14.9
Q ss_pred ccC----CCCCcccCCchhHHHHhhhhc
Q psy16259 780 FKC----DVCPKDFTCKGHLVSHLRSHL 803 (896)
Q Consensus 780 ~~C----~~C~k~f~~~~~L~~H~~~H~ 803 (896)
|.| ..|+..+.+...+++|++.++
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 556 666666666666666666554
No 86
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=77.51 E-value=2.8 Score=37.50 Aligned_cols=26 Identities=23% Similarity=0.337 Sum_probs=18.6
Q ss_pred cccc----ccCCCCCCChhhHhhhhhcccC
Q psy16259 717 EHVC----NICSKPFLKKEHLVNHSRSHTG 742 (896)
Q Consensus 717 ~~~C----~~C~~~f~~~~~L~~H~~~H~~ 742 (896)
.|.| ..|++.+.+...|.+|++.++|
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 3677 7777777777777777776654
No 87
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=76.82 E-value=1.4 Score=29.83 Aligned_cols=10 Identities=30% Similarity=0.906 Sum_probs=4.6
Q ss_pred cccccCCCCC
Q psy16259 718 HVCNICSKPF 727 (896)
Q Consensus 718 ~~C~~C~~~f 727 (896)
|+|.+||..+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 3444454443
No 88
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=74.37 E-value=1.3 Score=32.32 Aligned_cols=12 Identities=33% Similarity=0.628 Sum_probs=4.4
Q ss_pred ccCCCCCcccCC
Q psy16259 780 FKCDVCPKDFTC 791 (896)
Q Consensus 780 ~~C~~C~k~f~~ 791 (896)
.+|..|++.+..
T Consensus 17 a~C~~C~~~~~~ 28 (45)
T PF02892_consen 17 AKCKYCGKVIKY 28 (45)
T ss_dssp EEETTTTEE---
T ss_pred EEeCCCCeEEee
Confidence 445555554443
No 89
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=72.84 E-value=1.2 Score=45.28 Aligned_cols=44 Identities=32% Similarity=0.699 Sum_probs=29.5
Q ss_pred cCCccCCCCCcccCCchhHHHHhhh-h--c-------CCCC-----ccCCccCccccch
Q psy16259 777 ERPFKCDVCPKDFTCKGHLVSHLRS-H--L-------NERP-----HVCPHCSKSFVEH 820 (896)
Q Consensus 777 ~k~~~C~~C~k~f~~~~~L~~H~~~-H--~-------~~~~-----~~C~~C~~~f~~~ 820 (896)
+|.++|++|++.|.++.-+....|. . + +..| ..|+.||++|...
T Consensus 3 ~k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 3 DKKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CCceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 4667888888888877666655543 1 1 2233 4699999998654
No 90
>KOG2461|consensus
Probab=69.56 E-value=6.7 Score=43.62 Aligned_cols=206 Identities=10% Similarity=-0.079 Sum_probs=98.4
Q ss_pred ccCcccchhhhhcccccccCCCcccCCCCCCCcccccccccccccccCCCCccCCccCccCCChhHHHHHHHhhcCCCCc
Q psy16259 583 CAKSFTRKEHLVNHVRQHTGETPYQCQYCPKAFSRKDHLVNHVRQHTGESPHKCSYCKQSFTRKDHLNNHVRLHTGESPH 662 (896)
Q Consensus 583 C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~ 662 (896)
|...+........+...+.......++.++..+.....+..+...+.+++++.+..+.-.-... +.
T Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~--------------~~ 253 (396)
T KOG2461|consen 188 CHISILTSAMSSNSAEVLDSDSHNLADEESVSLKEQGQLLEPVEVASGPQPVKAGTGHPAVSSS--------------PE 253 (396)
T ss_pred cccccccccCCCcchhhcccccCCccccccccccccccccccccccCcccccccCCCCcccccC--------------cc
Confidence 3334444444445555555556666666666666666666666666666665555443222111 12
Q ss_pred ccccCCcccCChHHHHHHHhhhcCCCCCCcc--cccccccChhHHHHHhhhhcCCCccccccCCCCCCChhhHhhhhhcc
Q psy16259 663 KCHFCSKSFTRKEHLNNHIRLHTGESPHKCD--FCLKTFTRKEHLTNHLKQHAGETEHVCNICSKPFLKKEHLVNHSRSH 740 (896)
Q Consensus 663 ~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~--~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~H 740 (896)
.+..+..++........+......+....|. .|...-........|...-... ++..+.....+..|...+
T Consensus 254 ~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~ 326 (396)
T KOG2461|consen 254 SFVSPSAAGCDSNSPGEQFHTLAREWETLKDDASPSDAPDHGSVTAAPDAPISRR-------ARLSLPKQLVLDQSEVPA 326 (396)
T ss_pred ccCchhhhcCCCCCcccccccccccccccccCCCccccccccccccccCCccccc-------cCcccccccccccccccc
Confidence 2222333333332222222222222222221 1222222222222222211111 233344444555666666
Q ss_pred cCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCc
Q psy16259 741 TGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSK 815 (896)
Q Consensus 741 ~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 815 (896)
++...+.+.++.+.+.....+..|...+.. +..+.+..+...+.....+..++.+|+....+.+..|.+
T Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 395 (396)
T KOG2461|consen 327 TVSVWTGETIPVRTPAGQLIYTQSHSMEVA------EPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMSFVCK 395 (396)
T ss_pred cccccCcCcccccccccccchhhhhhcccC------CCCcccccccccceeccccceeeeeccccccccccccCC
Confidence 666666666666666666666666655543 444444444444444445555556666666666655544
No 91
>PF01213 CAP_N: Adenylate cyclase associated (CAP) N terminal; InterPro: IPR013992 Cyclase-associated proteins (CAPs) are highly conserved actin-binding proteins present in a wide range of organisms including yeast, fly, plants, and mammals. CAPs are multifunctional proteins that contain several structural domains. CAP is involved in species-specific signalling pathways [, , , ]. In Drosophila, CAP functions in Hedgehog-mediated eye development and in establishing oocyte polarity. In Dictyostelium (slim mold), CAP is involved in microfilament reorganisation near the plasma membrane in a PIP2-regulated manner and is required to perpetuate the cAMP relay signal to organise fruitbody formation. In plants, CAP is involved in plant signalling pathways required for co-ordinated organ expansion. In yeast, CAP is involved in adenylate cyclase activation, as well as in vesicle trafficking and endocytosis. In both yeast and mammals, CAPs appear to be involved in recycling G-actin monomers from ADF/cofilins for subsequent rounds of filament assembly [, ]. In mammals, there are two different CAPs (CAP1 and CAP2) that share 64% amino acid identity. All CAPs appear to contain a C-terminal actin-binding domain that regulates actin remodelling in response to cellular signals and is required for normal cellular morphology, cell division, growth and locomotion in eukaryotes. CAP directly regulates actin filament dynamics and has been implicated in a number of complex developmental and morphological processes, including mRNA localisation and the establishment of cell polarity. Actin exists both as globular (G) (monomeric) actin subunits and assembled into filamentous (F) actin. In cells, actin cycles between these two forms. Proteins that bind F-actin often regulate F-actin assembly and its interaction with other proteins, while proteins that interact with G-actin often control the availability of unpolymerised actin. CAPs bind G-actin. In addition to actin-binding, CAPs can have additional roles, and may act as bifunctional proteins. In Saccharomyces cerevisiae (Baker's yeast), CAP is a component of the adenylyl cyclase complex (Cyr1p) that serves as an effector of Ras during normal cell signalling. S. cerevisiae CAP functions to expose adenylate cyclase binding sites to Ras, thereby enabling adenylate cyclase to be activated by Ras regulatory signals. In Schizosaccharomyces pombe (Fission yeast), CAP is also required for adenylate cyclase activity, but not through the Ras pathway. In both organisms, the N-terminal domain is responsible for adenylate cyclase activation, but the S cerevisiae and S. pombe N-termini cannot complement one another. Yeast CAPs are unique among the CAP family of proteins, because they are the only ones to directly interact with and activate adenylate cyclase []. S. cerevisiae CAP has four major domains. In addition to the N-terminal adenylate cyclase-interacting domain, and the C-terminal actin-binding domain, it possesses two other domains: a proline-rich domain that interacts with Src homology 3 (SH3) domains of specific proteins, and a domain that is responsible for CAP oligomerisation to form multimeric complexes (although oligomerisation appears to involve the N- and C-terminal domains as well). The proline-rich domain interacts with profilin, a protein that catalyses nucleotide exchange on G-actin monomers and promotes addition to barbed ends of filamentous F-actin []. Since CAP can bind profilin via a proline-rich domain, and G-actin via a C-terminal domain, it has been suggested that a ternary G-actin/CAP/profilin complex could be formed. This entry represents the N-terminal domain of CAP proteins. This domain has an all-alpha structure consisting of six helices in a bundle with a left-handed twist and an up-and-down topology [].; GO: 0003779 actin binding, 0007010 cytoskeleton organization; PDB: 1TJF_B 1S0P_A.
Probab=67.62 E-value=1.7 Score=46.47 Aligned_cols=10 Identities=10% Similarity=0.102 Sum_probs=0.0
Q ss_pred cccee-eeeee
Q psy16259 151 LHIQN-IGAVL 160 (896)
Q Consensus 151 ~~~~~-~~~~~ 160 (896)
.+.++ +...|
T Consensus 284 ~~~~THKNP~L 294 (312)
T PF01213_consen 284 KDMMTHKNPAL 294 (312)
T ss_dssp -----------
T ss_pred cchhcccCccc
Confidence 34444 55555
No 92
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=64.94 E-value=4 Score=27.81 Aligned_cols=10 Identities=30% Similarity=0.667 Sum_probs=5.6
Q ss_pred ccCCCCCccc
Q psy16259 780 FKCDVCPKDF 789 (896)
Q Consensus 780 ~~C~~C~k~f 789 (896)
|+|.+||..+
T Consensus 3 ~~C~~CG~i~ 12 (34)
T cd00729 3 WVCPVCGYIH 12 (34)
T ss_pred EECCCCCCEe
Confidence 5566666544
No 93
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=64.43 E-value=3 Score=31.18 Aligned_cols=27 Identities=30% Similarity=0.570 Sum_probs=15.0
Q ss_pred cCCCCccCCccCccCCChhHHHHHHHh
Q psy16259 629 TGESPHKCSYCKQSFTRKDHLNNHVRL 655 (896)
Q Consensus 629 ~~~~~~~C~~C~~~F~~~~~L~~H~~~ 655 (896)
.||.-+.|+-||..|....++.+|+..
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhH
Confidence 344455566666666555555555543
No 94
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=64.27 E-value=5.2 Score=35.53 Aligned_cols=10 Identities=30% Similarity=0.889 Sum_probs=4.6
Q ss_pred ccccCCCCCC
Q psy16259 719 VCNICSKPFL 728 (896)
Q Consensus 719 ~C~~C~~~f~ 728 (896)
.|..||+.|.
T Consensus 11 ~Cp~CG~kFY 20 (108)
T PF09538_consen 11 TCPSCGAKFY 20 (108)
T ss_pred cCCCCcchhc
Confidence 3444444443
No 95
>PF03276 Gag_spuma: Spumavirus gag protein; InterPro: IPR004957 The Spumavirus gag protein is a core viral polyprotein that undergoes specific enzymatic cleavages in vivo to yield the mature protein.; GO: 0019028 viral capsid
Probab=64.10 E-value=42 Score=38.07 Aligned_cols=11 Identities=0% Similarity=-0.221 Sum_probs=6.4
Q ss_pred ecccccccccC
Q psy16259 206 KTNGTGDVNPD 216 (896)
Q Consensus 206 ~~~~~~e~~~l 216 (896)
|.|+..+|++.
T Consensus 371 mmft~~n~~Lv 381 (582)
T PF03276_consen 371 MMFTNQNFDLV 381 (582)
T ss_pred ceeecCCcchh
Confidence 44566666654
No 96
>KOG1701|consensus
Probab=62.60 E-value=3.7 Score=44.60 Aligned_cols=40 Identities=15% Similarity=0.341 Sum_probs=19.7
Q ss_pred ccccccccccChhhhhhhcccccCCCccccCCCCCccCCH
Q psy16259 439 RCEICKKAFTRKEHYVNHVMWHTGETPHVCQVCGKKYTRK 478 (896)
Q Consensus 439 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 478 (896)
.|..|+|...-...-.+=|..--...=|.|..|++...-+
T Consensus 276 iC~~C~K~V~g~~~ac~Am~~~fHv~CFtC~~C~r~L~Gq 315 (468)
T KOG1701|consen 276 ICAFCHKTVSGQGLAVEAMDQLFHVQCFTCRTCRRQLAGQ 315 (468)
T ss_pred hhhhcCCcccCcchHHHHhhhhhcccceehHhhhhhhccc
Confidence 6777877665443322222111112346777776654433
No 97
>KOG2186|consensus
Probab=61.83 E-value=3.2 Score=41.77 Aligned_cols=50 Identities=26% Similarity=0.472 Sum_probs=29.4
Q ss_pred cccccCCcccCChHHHHHHHhhhcCCCCCCcccccccccChhHHHHHhhhhcC
Q psy16259 662 HKCHFCSKSFTRKEHLNNHIRLHTGESPHKCDFCLKTFTRKEHLTNHLKQHAG 714 (896)
Q Consensus 662 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~k~f~~~~~L~~H~~~h~~ 714 (896)
|.|..||.+... -.+.+|+-..++ .-|.|-.|++.|.. .+...|.+--+.
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~kCITE 53 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTKCITE 53 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhhhcch
Confidence 566777666543 345556666555 44667677777665 556666554443
No 98
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=60.49 E-value=2.4 Score=42.63 Aligned_cols=30 Identities=33% Similarity=0.544 Sum_probs=19.1
Q ss_pred CCCCccCCccCccccchhhHHhhhccccCC
Q psy16259 804 NERPHVCPHCSKSFVEHTCMIKHIRKNHTL 833 (896)
Q Consensus 804 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 833 (896)
++..|.|.+|+|.|+-..-+++||..+|..
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 344577777777777777777777777764
No 99
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=57.24 E-value=5.7 Score=29.80 Aligned_cols=20 Identities=40% Similarity=0.786 Sum_probs=11.2
Q ss_pred cCCCCCcccCCc-----hhHHHHhh
Q psy16259 781 KCDVCPKDFTCK-----GHLVSHLR 800 (896)
Q Consensus 781 ~C~~C~k~f~~~-----~~L~~H~~ 800 (896)
.|..|++.+... +.|.+|++
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 466666666543 35555554
No 100
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=57.09 E-value=6.8 Score=28.46 Aligned_cols=22 Identities=32% Similarity=0.551 Sum_probs=8.6
Q ss_pred CCccCCCCCCCCCCc----hhHHHHH
Q psy16259 744 RPFACTDCGKSFPLK----GNLLFHQ 765 (896)
Q Consensus 744 k~~~C~~C~~~F~~~----~~l~~H~ 765 (896)
.-..|.+|++.+... ++|.+|+
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL 40 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHL 40 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHH
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhh
Confidence 344555555555442 4444444
No 101
>KOG1701|consensus
Probab=56.49 E-value=3.1 Score=45.19 Aligned_cols=38 Identities=24% Similarity=0.502 Sum_probs=23.4
Q ss_pred ccCCCCCCCCChHHHHHHhh--hcCCCCCcccCccccccCCh
Q psy16259 299 TCDLCGKFFPYRYQMIVHRR--YHTERKPFTCHVCGKAFDFQ 338 (896)
Q Consensus 299 ~C~~C~~~f~~~~~L~~H~~--~h~~~~~~~C~~C~~~f~~~ 338 (896)
.|-.|+|...-...=-+=|. .|. .-|+|..|+++..-+
T Consensus 276 iC~~C~K~V~g~~~ac~Am~~~fHv--~CFtC~~C~r~L~Gq 315 (468)
T KOG1701|consen 276 ICAFCHKTVSGQGLAVEAMDQLFHV--QCFTCRTCRRQLAGQ 315 (468)
T ss_pred hhhhcCCcccCcchHHHHhhhhhcc--cceehHhhhhhhccc
Confidence 69999987765443333332 333 348899998765443
No 102
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=56.09 E-value=9.2 Score=26.46 Aligned_cols=11 Identities=18% Similarity=0.836 Sum_probs=4.9
Q ss_pred cCccccccCCh
Q psy16259 328 CHVCGKAFDFQ 338 (896)
Q Consensus 328 C~~C~~~f~~~ 338 (896)
|+.|+..|.-.
T Consensus 5 Cp~C~~~y~i~ 15 (36)
T PF13717_consen 5 CPNCQAKYEID 15 (36)
T ss_pred CCCCCCEEeCC
Confidence 44444444433
No 103
>PF04625 DEC-1_N: DEC-1 protein, N-terminal region; InterPro: IPR006719 The defective chorion-1 gene (dec-1) in Drosophila encodes follicle cell proteins necessary for proper eggshell assembly. Multiple products of the dec-1 gene are formed by alternative RNA splicing and proteolytic processing []. Cleavage products include S80 (80 kDa) which is incorporated into the eggshell, and further proteolysis of S80 gives S60 (60 kDa). This domain is present at the N-terminal of these proteins.; GO: 0005213 structural constituent of chorion, 0007304 chorion-containing eggshell formation, 0005576 extracellular region, 0042600 chorion
Probab=56.07 E-value=62 Score=33.94 Aligned_cols=12 Identities=42% Similarity=0.576 Sum_probs=6.3
Q ss_pred CcccccccCCCC
Q psy16259 2 PIQCELSQFPAS 13 (896)
Q Consensus 2 ~~~~~~~~~~~~ 13 (896)
|.++.|--+|++
T Consensus 17 Pa~s~LGavPD~ 28 (407)
T PF04625_consen 17 PAQSILGAVPDS 28 (407)
T ss_pred chhhhhccCCcc
Confidence 344555555554
No 104
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=55.44 E-value=7.8 Score=39.39 Aligned_cols=20 Identities=25% Similarity=0.474 Sum_probs=9.6
Q ss_pred cccCCCCCCCcccccccccc
Q psy16259 605 PYQCQYCPKAFSRKDHLVNH 624 (896)
Q Consensus 605 ~~~C~~C~k~f~~~~~L~~H 624 (896)
.+.|++|++.|..+.-+...
T Consensus 5 ~~~CPvC~~~F~~~~vrs~~ 24 (214)
T PF09986_consen 5 KITCPVCGKEFKTKKVRSGK 24 (214)
T ss_pred ceECCCCCCeeeeeEEEcCC
Confidence 34555555555544433333
No 105
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=54.31 E-value=8.6 Score=28.81 Aligned_cols=24 Identities=38% Similarity=0.624 Sum_probs=18.3
Q ss_pred CccCCCCCCCCCCc-----hhHHHHHH-hh
Q psy16259 745 PFACTDCGKSFPLK-----GNLLFHQR-SH 768 (896)
Q Consensus 745 ~~~C~~C~~~F~~~-----~~l~~H~~-~H 768 (896)
.-.|.+|++.+... ++|.+|++ +|
T Consensus 18 ~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 18 RAKCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EEEecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 46799999888655 58888887 44
No 106
>KOG2186|consensus
Probab=54.14 E-value=6.4 Score=39.73 Aligned_cols=46 Identities=33% Similarity=0.638 Sum_probs=20.1
Q ss_pred cccCccCcccCChHHHHHHHhhccCCCCeecCccccccCChHHHHHHHh
Q psy16259 354 YTCAVCFHVFANEESLQRHTKRHSIDRPYCCSVCMKTFARKEHLENHTR 402 (896)
Q Consensus 354 ~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~F~~~~~L~~H~~ 402 (896)
|.|.+||...... .|.+|+-+.++ .-|.|-.|++.|.. .++..|..
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 4444454443322 23334444443 33455555555544 34444433
No 107
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=53.91 E-value=4.9 Score=32.68 Aligned_cols=41 Identities=24% Similarity=0.536 Sum_probs=25.7
Q ss_pred ccCCCCCcccCCchhHHHHhhhhcCCCCccCC--ccCccccchhh
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSHLNERPHVCP--HCSKSFVEHTC 822 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~--~C~~~f~~~~~ 822 (896)
+.|+.||........-..+-. ..++.++|. .||..|.+...
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYES 44 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEEE
Confidence 468888876644443333322 556778887 88888876543
No 108
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=53.07 E-value=13 Score=28.69 Aligned_cols=8 Identities=50% Similarity=1.618 Sum_probs=4.5
Q ss_pred ccCCCCCC
Q psy16259 746 FACTDCGK 753 (896)
Q Consensus 746 ~~C~~C~~ 753 (896)
|.|+.||.
T Consensus 26 F~CPnCG~ 33 (59)
T PRK14890 26 FLCPNCGE 33 (59)
T ss_pred eeCCCCCC
Confidence 55555554
No 109
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=52.72 E-value=9.5 Score=33.88 Aligned_cols=8 Identities=38% Similarity=1.061 Sum_probs=3.2
Q ss_pred CCCCCCCc
Q psy16259 608 CQYCPKAF 615 (896)
Q Consensus 608 C~~C~k~f 615 (896)
|..||+.|
T Consensus 12 Cp~CG~kF 19 (108)
T PF09538_consen 12 CPSCGAKF 19 (108)
T ss_pred CCCCcchh
Confidence 44444443
No 110
>KOG1984|consensus
Probab=52.71 E-value=4.5e+02 Score=32.19 Aligned_cols=28 Identities=18% Similarity=0.310 Sum_probs=14.6
Q ss_pred ccccCCCccccCccCcccCChHHHHHHH
Q psy16259 346 KCHLGGSMYTCAVCFHVFANEESLQRHT 373 (896)
Q Consensus 346 ~~h~~~~~~~C~~C~~~f~~~~~L~~H~ 373 (896)
+.-.+.+.|.|.+|+..-........|+
T Consensus 353 qF~~~gr~f~Cn~C~~~n~vp~~yf~~L 380 (1007)
T KOG1984|consen 353 QFIDGGRKFICNFCGSKNQVPDDYFNHL 380 (1007)
T ss_pred EEecCCceEEecCCCccccCChhhcccC
Confidence 3334445566666665555555544443
No 111
>PF15449 Retinal: Retinal protein
Probab=52.67 E-value=4.3e+02 Score=33.11 Aligned_cols=20 Identities=10% Similarity=0.013 Sum_probs=10.1
Q ss_pred ceeeeeeeccCCCccccccC
Q psy16259 153 IQNIGAVLSPSYGQHCIQQD 172 (896)
Q Consensus 153 ~~~~~~~~~~~~~~~~~q~~ 172 (896)
-+.+-..|.++-.|.=|++.
T Consensus 1197 dsqRr~al~AlnPqPFVRRt 1216 (1287)
T PF15449_consen 1197 DSQRRTALCALNPQPFVRRT 1216 (1287)
T ss_pred cccccccccccCCCcccccc
Confidence 34444455555555555543
No 112
>KOG4377|consensus
Probab=52.15 E-value=6.1 Score=42.67 Aligned_cols=19 Identities=26% Similarity=0.487 Sum_probs=13.5
Q ss_pred CCcccCCchhHHHHhhhhc
Q psy16259 785 CPKDFTCKGHLVSHLRSHL 803 (896)
Q Consensus 785 C~k~f~~~~~L~~H~~~H~ 803 (896)
|+..|.+.+.+..|.|.|.
T Consensus 409 c~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 409 CEATLYSVSQMASHKRKHE 427 (480)
T ss_pred CceEEEehhhhhhhhhhhh
Confidence 6677777777777777765
No 113
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=52.08 E-value=8.3 Score=26.89 Aligned_cols=11 Identities=27% Similarity=0.622 Sum_probs=5.1
Q ss_pred ccCCCCCCCCC
Q psy16259 299 TCDLCGKFFPY 309 (896)
Q Consensus 299 ~C~~C~~~f~~ 309 (896)
.|+.|+..|.-
T Consensus 4 ~CP~C~~~~~v 14 (38)
T TIGR02098 4 QCPNCKTSFRV 14 (38)
T ss_pred ECCCCCCEEEe
Confidence 34555544433
No 114
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=52.03 E-value=12 Score=35.93 Aligned_cols=33 Identities=21% Similarity=0.435 Sum_probs=18.6
Q ss_pred cCCCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCC
Q psy16259 713 AGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKS 754 (896)
Q Consensus 713 ~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~ 754 (896)
.+..-|.|+.|+..|....++.. -|.|+.||..
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~~---------~F~Cp~Cg~~ 137 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAMEL---------NFTCPRCGAM 137 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHHc---------CCcCCCCCCE
Confidence 34445666666666666555531 3666666654
No 115
>KOG2461|consensus
Probab=51.69 E-value=23 Score=39.45 Aligned_cols=59 Identities=15% Similarity=0.193 Sum_probs=33.7
Q ss_pred CccCCcccchhhhhhhhhccCCCCCccCCccCcccCChhhhhcccccccCCCCCcCccc
Q psy16259 525 DFCSKTFTRKEHLLNHVRQHTGESPHKCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYC 583 (896)
Q Consensus 525 ~~C~k~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C 583 (896)
..|...+.+......+...........+..++..+.....+..+.....+.++..+..+
T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (396)
T KOG2461|consen 186 KICHISILTSAMSSNSAEVLDSDSHNLADEESVSLKEQGQLLEPVEVASGPQPVKAGTG 244 (396)
T ss_pred cccccccccccCCCcchhhcccccCCccccccccccccccccccccccCcccccccCCC
Confidence 33444444444445555555555566666666666666666666666666655554443
No 116
>PF01690 PLRV_ORF5: Potato leaf roll virus readthrough protein; InterPro: IPR002929 This family consists mainly of the Potato leafroll virus (PLrV) read through protein otherwise known as the minor capsid protein. This is generated via a readthrough of open reading frame 3, the coat protein, allowing transcription of open reading frame 5 to give an extended coat protein with a large C-terminal addition or read through domain []. The read through protein is essential for the circulative aphid transmission of PLrV [] and Beet western yellows virus []. The N-terminal region of the luteovirus readthrough domain determines virus binding to Buchnera GroEL and is essential for virus persistence in the aphid [].; GO: 0019028 viral capsid
Probab=51.63 E-value=12 Score=41.80 Aligned_cols=6 Identities=33% Similarity=0.296 Sum_probs=2.3
Q ss_pred ccccCC
Q psy16259 241 STLGLA 246 (896)
Q Consensus 241 sslg~~ 246 (896)
+.-|+.
T Consensus 106 sceG~q 111 (465)
T PF01690_consen 106 SCEGYQ 111 (465)
T ss_pred Eeccee
Confidence 333443
No 117
>COG5188 PRP9 Splicing factor 3a, subunit 3 [RNA processing and modification]
Probab=51.57 E-value=10 Score=39.95 Aligned_cols=27 Identities=19% Similarity=0.598 Sum_probs=17.8
Q ss_pred ccCCCceecCCCC-Ccccchhhhhhhhh
Q psy16259 404 HTGETPYKCQYCG-KNFSRKEHMVNHVR 430 (896)
Q Consensus 404 h~~~~~~~C~~C~-~~f~~~~~L~~H~~ 430 (896)
|.-.+.|.|.+|| +++.....+.+|..
T Consensus 369 hgLd~ef~CEICgNyvy~GR~~FdrHF~ 396 (470)
T COG5188 369 HGLDIEFECEICGNYVYYGRDRFDRHFE 396 (470)
T ss_pred cCCCcceeeeecccccccchHHHHhhhh
Confidence 4445667788887 66666666776653
No 118
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=50.47 E-value=14 Score=35.04 Aligned_cols=36 Identities=19% Similarity=0.453 Sum_probs=17.8
Q ss_pred CCccccccCCCCCCChhhHhh-hhhcccCCCCccCCCCCCCC
Q psy16259 715 ETEHVCNICSKPFLKKEHLVN-HSRSHTGERPFACTDCGKSF 755 (896)
Q Consensus 715 ~~~~~C~~C~~~f~~~~~L~~-H~~~H~~~k~~~C~~C~~~F 755 (896)
..-|.|+.|+..|.....+.. ++ +..|.|+.||...
T Consensus 97 ~~~Y~Cp~C~~~y~~~ea~~~~d~-----~~~f~Cp~Cg~~l 133 (147)
T smart00531 97 NAYYKCPNCQSKYTFLEANQLLDM-----DGTFTCPRCGEEL 133 (147)
T ss_pred CcEEECcCCCCEeeHHHHHHhcCC-----CCcEECCCCCCEE
Confidence 345666666666654443332 11 2226666666543
No 119
>KOG0162|consensus
Probab=50.46 E-value=2.2e+02 Score=33.84 Aligned_cols=127 Identities=15% Similarity=0.163 Sum_probs=0.0
Q ss_pred CCcccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy16259 1 MPIQCELSQFPASHKTSTSMEVDTPPPPSSSRGSEPPSPNTGGVSGGFISTSPSQPSPQHGHHTLTSLTSPHQMPPPSPI 80 (896)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pppppp 80 (896)
+|-+++-+.--+..++.-++.-.....++-..+.+++.-.-.+.......++-|.+.|.......+...+...-+.--|-
T Consensus 923 lP~~Skps~k~p~~~tg~s~g~~~~~~~~r~~~~~~~~~~~~~~~paA~~~p~p~~~~~~~~~~~~~p~~~s~~~s~~~~ 1002 (1106)
T KOG0162|consen 923 LPPNSKPSRKKPRKATGYSSGRDAASTPTRRAPQNKQAYGQNGVSPAAKGSPLPAQKPVNTYNQRPPPVSTSTTTSQQPS 1002 (1106)
T ss_pred CCCCCCcCCcCcccCCCCCCCcccccCCCCCCCCCcccccCCCCCccccCCCCCCCCCCCccccCCCCCCcccccccccc
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCC
Q psy16259 81 ATSPPPPSASDKHDSQSSPHITQHTGPGIFQSPP---PTSHISKIPSPSS 127 (896)
Q Consensus 81 ppppp~p~~~~~~~~~~~~~~~~~~~~~~~~~pp---p~pp~~~~p~~~~ 127 (896)
.-|+.-|..-..-+....+.....+..+..+||| |+||||+-++..|
T Consensus 1003 ~~p~s~p~~~~~~~~~~~~~~~~~~s~~q~~pP~~g~P~PpPp~~~~k~p 1052 (1106)
T KOG0162|consen 1003 ARPSSKPTVFTKVPDAGASGNGRKPSGPQRPPPPAGRPKPPPPAKPPKNP 1052 (1106)
T ss_pred cCCCCCCccccccCcccCcccccCCCCCCCCCCCCCCCCccCCCCCCCCc
No 120
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=50.40 E-value=14 Score=37.44 Aligned_cols=25 Identities=24% Similarity=0.464 Sum_probs=14.2
Q ss_pred CcccCCCCCCCcccccccccccccc
Q psy16259 604 TPYQCQYCPKAFSRKDHLVNHVRQH 628 (896)
Q Consensus 604 ~~~~C~~C~k~f~~~~~L~~H~~~h 628 (896)
+++.|+.|+........|..-.|.|
T Consensus 208 k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCcccccccceeeeecc
Confidence 5566666666655555555444444
No 121
>KOG4377|consensus
Probab=50.03 E-value=12 Score=40.63 Aligned_cols=23 Identities=26% Similarity=0.507 Sum_probs=18.8
Q ss_pred ccCC--ccCccccchhhHHhhhccc
Q psy16259 808 HVCP--HCSKSFVEHTCMIKHIRKN 830 (896)
Q Consensus 808 ~~C~--~C~~~f~~~~~l~~H~~~~ 830 (896)
|.|. -|+..|...+.+..|-|++
T Consensus 402 fhc~r~Gc~~tl~s~sqm~shkrkh 426 (480)
T KOG4377|consen 402 FHCDRLGCEATLYSVSQMASHKRKH 426 (480)
T ss_pred eeecccCCceEEEehhhhhhhhhhh
Confidence 4554 4899999999999999984
No 122
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=49.32 E-value=11 Score=27.45 Aligned_cols=27 Identities=19% Similarity=0.550 Sum_probs=15.7
Q ss_pred ccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccc
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSF 817 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 817 (896)
|.|..||..|... ...+.+|+.||..-
T Consensus 3 Y~C~~Cg~~~~~~-----------~~~~irC~~CG~rI 29 (44)
T smart00659 3 YICGECGRENEIK-----------SKDVVRCRECGYRI 29 (44)
T ss_pred EECCCCCCEeecC-----------CCCceECCCCCceE
Confidence 6666776666532 23456677776544
No 123
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=48.92 E-value=14 Score=33.45 Aligned_cols=13 Identities=15% Similarity=0.153 Sum_probs=6.0
Q ss_pred CccCCCCCcccCC
Q psy16259 779 PFKCDVCPKDFTC 791 (896)
Q Consensus 779 ~~~C~~C~k~f~~ 791 (896)
|..|..||..|.-
T Consensus 26 p~vcP~cg~~~~~ 38 (129)
T TIGR02300 26 PAVSPYTGEQFPP 38 (129)
T ss_pred CccCCCcCCccCc
Confidence 4445555554433
No 124
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=48.46 E-value=9.9 Score=34.50 Aligned_cols=12 Identities=50% Similarity=1.101 Sum_probs=6.3
Q ss_pred cCCccCccccch
Q psy16259 809 VCPHCSKSFVEH 820 (896)
Q Consensus 809 ~C~~C~~~f~~~ 820 (896)
+|+.|.-+|++.
T Consensus 123 vCPvCkTSFKss 134 (140)
T PF05290_consen 123 VCPVCKTSFKSS 134 (140)
T ss_pred CCCccccccccc
Confidence 355555555543
No 125
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=48.31 E-value=12 Score=26.07 Aligned_cols=9 Identities=33% Similarity=0.921 Sum_probs=3.6
Q ss_pred cCccccccC
Q psy16259 328 CHVCGKAFD 336 (896)
Q Consensus 328 C~~C~~~f~ 336 (896)
|+.|+..|.
T Consensus 5 CP~C~~~f~ 13 (37)
T PF13719_consen 5 CPNCQTRFR 13 (37)
T ss_pred CCCCCceEE
Confidence 344444443
No 126
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=48.02 E-value=11 Score=28.50 Aligned_cols=11 Identities=27% Similarity=0.896 Sum_probs=4.7
Q ss_pred CcCccccCccc
Q psy16259 578 HKCNYCAKSFT 588 (896)
Q Consensus 578 ~~C~~C~k~f~ 588 (896)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 34444444443
No 127
>smart00349 KRAB krueppel associated box.
Probab=47.97 E-value=14 Score=26.29 Aligned_cols=35 Identities=14% Similarity=-0.005 Sum_probs=29.5
Q ss_pred eeeeeeecccccccccCChhhhhHhhhhccccccc
Q psy16259 200 QQSDLIKTNGTGDVNPDEFNQLLAYHEVFGKLQSS 234 (896)
Q Consensus 200 ~~~dv~~~~~~~e~~~l~~~q~~~y~~v~~e~~~~ 234 (896)
.+.++.+.+..++|..++..++..+.+.+.+.+..
T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 36 (61)
T smart00349 2 TFEDVAVYFSQEEWEQLDPAQRNLYRDVMLENYSN 36 (61)
T ss_pred eeecceeecCHHHHhccCHHHHHHHHHHHHHhhcc
Confidence 45677778888899999999999999998888776
No 128
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=47.95 E-value=16 Score=35.18 Aligned_cols=34 Identities=24% Similarity=0.448 Sum_probs=20.8
Q ss_pred CCCCCcccCccccccCChHHHhhhhccccCCCccccCccCccc
Q psy16259 321 TERKPFTCHVCGKAFDFQAELMSHGKCHLGGSMYTCAVCFHVF 363 (896)
Q Consensus 321 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 363 (896)
.+..-|.|+.|+.+|+...++. ..|.|+.||...
T Consensus 105 ~~~~~Y~Cp~c~~r~tf~eA~~---------~~F~Cp~Cg~~L 138 (158)
T TIGR00373 105 TNNMFFICPNMCVRFTFNEAME---------LNFTCPRCGAML 138 (158)
T ss_pred cCCCeEECCCCCcEeeHHHHHH---------cCCcCCCCCCEe
Confidence 3445566777777776666663 246677776543
No 129
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=47.72 E-value=15 Score=23.35 Aligned_cols=9 Identities=44% Similarity=1.228 Sum_probs=4.5
Q ss_pred cCCCCCCCC
Q psy16259 747 ACTDCGKSF 755 (896)
Q Consensus 747 ~C~~C~~~F 755 (896)
.|+.||+.|
T Consensus 16 ~Cp~CG~~F 24 (26)
T PF10571_consen 16 FCPHCGYDF 24 (26)
T ss_pred cCCCCCCCC
Confidence 355555554
No 130
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=47.00 E-value=6.7 Score=28.80 Aligned_cols=28 Identities=21% Similarity=0.549 Sum_probs=15.0
Q ss_pred ccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccc
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSF 817 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 817 (896)
|+|..||..|...... ..+.|+.||..+
T Consensus 4 y~C~~CG~~~~~~~~~----------~~~~Cp~CG~~~ 31 (46)
T PRK00398 4 YKCARCGREVELDEYG----------TGVRCPYCGYRI 31 (46)
T ss_pred EECCCCCCEEEECCCC----------CceECCCCCCeE
Confidence 6666666665432111 145677776554
No 131
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=46.93 E-value=12 Score=26.37 Aligned_cols=11 Identities=36% Similarity=1.150 Sum_probs=5.0
Q ss_pred ccCCCCCCCcc
Q psy16259 606 YQCQYCPKAFS 616 (896)
Q Consensus 606 ~~C~~C~k~f~ 616 (896)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (41)
T smart00834 6 YRCEDCGHTFE 16 (41)
T ss_pred EEcCCCCCEEE
Confidence 44444444443
No 132
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=46.43 E-value=20 Score=37.31 Aligned_cols=23 Identities=22% Similarity=0.624 Sum_probs=10.4
Q ss_pred ccCCCCCCCCCCchhHHHHHHhh
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSH 768 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H 768 (896)
|.|+.|...|-.--+.-.|...|
T Consensus 389 Y~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 389 YQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred eechhhhhhhhhhhHHHHHHHHh
Confidence 44444444444444444444333
No 133
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=45.05 E-value=8.5 Score=36.63 Aligned_cols=16 Identities=25% Similarity=0.629 Sum_probs=8.7
Q ss_pred ccCCCCCCCccccccc
Q psy16259 606 YQCQYCPKAFSRKDHL 621 (896)
Q Consensus 606 ~~C~~C~k~f~~~~~L 621 (896)
++|..||+.|.....+
T Consensus 29 ~~c~~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECLACGKRFTTFERV 44 (154)
T ss_pred eeccccCCcceEeEec
Confidence 5566666666554433
No 134
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=44.92 E-value=11 Score=28.07 Aligned_cols=29 Identities=24% Similarity=0.672 Sum_probs=17.6
Q ss_pred CccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccc
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSF 817 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 817 (896)
-|+|..||+.|. .-.......|++||..-
T Consensus 6 ~Y~C~~Cg~~~~----------~~~~~~~irCp~Cg~rI 34 (49)
T COG1996 6 EYKCARCGREVE----------LDQETRGIRCPYCGSRI 34 (49)
T ss_pred EEEhhhcCCeee----------hhhccCceeCCCCCcEE
Confidence 477888887771 11233556788887543
No 135
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=44.55 E-value=17 Score=35.74 Aligned_cols=33 Identities=24% Similarity=0.546 Sum_probs=19.0
Q ss_pred CCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCC
Q psy16259 715 ETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFP 756 (896)
Q Consensus 715 ~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~ 756 (896)
..-|.|+.|+..|....++. .-|.|+.||....
T Consensus 115 ~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 115 NMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEMLE 147 (178)
T ss_pred CCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCCe
Confidence 34566666666666655542 2366666665443
No 136
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=44.52 E-value=9.8 Score=35.09 Aligned_cols=24 Identities=42% Similarity=0.627 Sum_probs=13.3
Q ss_pred CCcccccccccChhHHHHHhhhhcCCC
Q psy16259 690 HKCDFCLKTFTRKEHLTNHLKQHAGET 716 (896)
Q Consensus 690 ~~C~~C~k~f~~~~~L~~H~~~h~~~~ 716 (896)
..|-+||+.|. .|++|++.|+|..
T Consensus 73 i~clecGk~~k---~LkrHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFK---TLKRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EES---BHHHHHHHTT-S-
T ss_pred eEEccCCcccc---hHHHHHHHccCCC
Confidence 46777777776 3477877777654
No 137
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.62 E-value=9.5 Score=30.79 Aligned_cols=32 Identities=22% Similarity=0.754 Sum_probs=21.9
Q ss_pred CccCCCCCcccCCchhHHHHhhhhcCCCCc-cCCccCcccc
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPH-VCPHCSKSFV 818 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~-~C~~C~~~f~ 818 (896)
.|+|..|+..| .+.+||+ +.|+ .|+.|+..|+
T Consensus 12 ~Y~c~~cg~~~----dvvq~~~----ddplt~ce~c~a~~k 44 (82)
T COG2331 12 SYECTECGNRF----DVVQAMT----DDPLTTCEECGARLK 44 (82)
T ss_pred EEeecccchHH----HHHHhcc----cCccccChhhChHHH
Confidence 48899998877 4555554 3444 5999987653
No 138
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=43.32 E-value=20 Score=35.24 Aligned_cols=33 Identities=21% Similarity=0.528 Sum_probs=20.1
Q ss_pred CCCCcccCccccccCChHHHhhhhccccCCCccccCccCccc
Q psy16259 322 ERKPFTCHVCGKAFDFQAELMSHGKCHLGGSMYTCAVCFHVF 363 (896)
Q Consensus 322 ~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 363 (896)
+..-|.|+.|+.+|+....+. ..|.|+.||...
T Consensus 114 ~~~~Y~Cp~C~~rytf~eA~~---------~~F~Cp~Cg~~L 146 (178)
T PRK06266 114 NNMFFFCPNCHIRFTFDEAME---------YGFRCPQCGEML 146 (178)
T ss_pred CCCEEECCCCCcEEeHHHHhh---------cCCcCCCCCCCC
Confidence 334567777777776666552 246677776554
No 139
>PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=42.84 E-value=86 Score=38.78 Aligned_cols=16 Identities=25% Similarity=0.573 Sum_probs=7.4
Q ss_pred ccccCChhhh-hHhhhh
Q psy16259 212 DVNPDEFNQL-LAYHEV 227 (896)
Q Consensus 212 e~~~l~~~q~-~~y~~v 227 (896)
..+.+++.|. ..|.|+
T Consensus 233 ~Ad~l~f~ef~~ay~dL 249 (1228)
T PRK12270 233 GAETMDFAQFWAAYEDI 249 (1228)
T ss_pred ccccCCHHHHHHHHHHH
Confidence 3444555554 345443
No 140
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=42.65 E-value=23 Score=33.57 Aligned_cols=37 Identities=16% Similarity=0.397 Sum_probs=18.2
Q ss_pred CCCcccCccccccCChHHHhhhhccccCCCccccCccCccc
Q psy16259 323 RKPFTCHVCGKAFDFQAELMSHGKCHLGGSMYTCAVCFHVF 363 (896)
Q Consensus 323 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f 363 (896)
...|.|+.|+..|.....+..- . ....|.|+.||...
T Consensus 97 ~~~Y~Cp~C~~~y~~~ea~~~~---d-~~~~f~Cp~Cg~~l 133 (147)
T smart00531 97 NAYYKCPNCQSKYTFLEANQLL---D-MDGTFTCPRCGEEL 133 (147)
T ss_pred CcEEECcCCCCEeeHHHHHHhc---C-CCCcEECCCCCCEE
Confidence 3456666666666654433220 0 12236666666543
No 141
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=42.52 E-value=13 Score=35.46 Aligned_cols=14 Identities=43% Similarity=0.778 Sum_probs=7.9
Q ss_pred CcCccccCcccchh
Q psy16259 578 HKCNYCAKSFTRKE 591 (896)
Q Consensus 578 ~~C~~C~k~f~~~~ 591 (896)
++|..||++|....
T Consensus 29 ~~c~~c~~~f~~~e 42 (154)
T PRK00464 29 RECLACGKRFTTFE 42 (154)
T ss_pred eeccccCCcceEeE
Confidence 55666666665543
No 142
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=42.50 E-value=18 Score=43.59 Aligned_cols=12 Identities=33% Similarity=0.905 Sum_probs=6.8
Q ss_pred CCceecCCCCCc
Q psy16259 407 ETPYKCQYCGKN 418 (896)
Q Consensus 407 ~~~~~C~~C~~~ 418 (896)
..+..|+.||..
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 345666666644
No 143
>PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=41.18 E-value=1e+02 Score=38.13 Aligned_cols=6 Identities=17% Similarity=-0.009 Sum_probs=2.6
Q ss_pred cccccC
Q psy16259 240 SSTLGL 245 (896)
Q Consensus 240 ~sslg~ 245 (896)
++++|.
T Consensus 270 ISN~G~ 275 (1228)
T PRK12270 270 LTNPGG 275 (1228)
T ss_pred EecCCc
Confidence 344444
No 144
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=40.96 E-value=17 Score=35.01 Aligned_cols=23 Identities=43% Similarity=0.998 Sum_probs=18.1
Q ss_pred CccCCCCCcccCCchhHHHHhhhhcCCCCccCCccC
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCS 814 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~ 814 (896)
-|.|.+||.. |-|+-|-+|++||
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCC
Confidence 4888888764 4567888899998
No 145
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=40.92 E-value=20 Score=32.50 Aligned_cols=35 Identities=26% Similarity=0.374 Sum_probs=26.8
Q ss_pred CccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCccccchhhHH
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFVEHTCMI 824 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~l~ 824 (896)
...|..||+.|.. .+..|-.|+.||..|.-...++
T Consensus 9 Kr~Cp~cg~kFYD-----------Lnk~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 9 KRICPNTGSKFYD-----------LNRRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred cccCCCcCccccc-----------cCCCCccCCCcCCccCcchhhc
Confidence 4679999999954 3457999999999986654443
No 146
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=40.63 E-value=13 Score=33.79 Aligned_cols=17 Identities=24% Similarity=0.460 Sum_probs=9.7
Q ss_pred CCCccccccCCCCCCCh
Q psy16259 714 GETEHVCNICSKPFLKK 730 (896)
Q Consensus 714 ~~~~~~C~~C~~~f~~~ 730 (896)
+.+-|+|.+|..+....
T Consensus 77 d~~lYeCnIC~etS~ee 93 (140)
T PF05290_consen 77 DPKLYECNICKETSAEE 93 (140)
T ss_pred CCCceeccCcccccchh
Confidence 34556666666655443
No 147
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=39.65 E-value=17 Score=36.96 Aligned_cols=94 Identities=21% Similarity=0.381 Sum_probs=45.5
Q ss_pred hcCCCCCCcccccccccChhHHHHHhhh--hcCCCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhH
Q psy16259 684 HTGESPHKCDFCLKTFTRKEHLTNHLKQ--HAGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNL 761 (896)
Q Consensus 684 H~~~~~~~C~~C~k~f~~~~~L~~H~~~--h~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l 761 (896)
.+|.+.|+|.+|+.-. -...--.|+.. ......|+|.-|++. ..|.|-.|...|-
T Consensus 137 ~hGGrif~CsfC~~fl-CEDDQFEHQAsCQvLe~E~~KC~SCNrl-----------------Gq~sCLRCK~cfC----- 193 (314)
T PF06524_consen 137 DHGGRIFKCSFCDNFL-CEDDQFEHQASCQVLESETFKCQSCNRL-----------------GQYSCLRCKICFC----- 193 (314)
T ss_pred cCCCeEEEeecCCCee-eccchhhhhhhhhhhhcccccccccccc-----------------cchhhhheeeeeh-----
Confidence 3566778888876432 22222344321 112235777777653 2245555544443
Q ss_pred HHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhh
Q psy16259 762 LFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSH 802 (896)
Q Consensus 762 ~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H 802 (896)
..|+|..-- .....+++.|+.||........|..=.|+|
T Consensus 194 ddHvrrKg~--ky~k~k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 194 DDHVRRKGF--KYEKGKPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred hhhhhhccc--ccccCCCCCCCCCCCcccccccceeeeecc
Confidence 234443210 011135666666666665555555544544
No 148
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=38.89 E-value=13 Score=30.18 Aligned_cols=19 Identities=32% Similarity=0.578 Sum_probs=10.2
Q ss_pred CCcccCC--CCCCCccccccc
Q psy16259 603 ETPYQCQ--YCPKAFSRKDHL 621 (896)
Q Consensus 603 ~~~~~C~--~C~k~f~~~~~L 621 (896)
+.-++|. .||..|.....+
T Consensus 25 ~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 25 ERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred eeeeecCCCCCCCEEEEEEEE
Confidence 3445565 566666554443
No 149
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=38.54 E-value=19 Score=25.87 Aligned_cols=11 Identities=36% Similarity=1.135 Sum_probs=5.3
Q ss_pred ccCCCCCCCcc
Q psy16259 606 YQCQYCPKAFS 616 (896)
Q Consensus 606 ~~C~~C~k~f~ 616 (896)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (42)
T PF09723_consen 6 YRCEECGHEFE 16 (42)
T ss_pred EEeCCCCCEEE
Confidence 44555554443
No 150
>PHA00626 hypothetical protein
Probab=38.24 E-value=13 Score=28.09 Aligned_cols=14 Identities=21% Similarity=0.368 Sum_probs=8.9
Q ss_pred CCccCCccCccccc
Q psy16259 806 RPHVCPHCSKSFVE 819 (896)
Q Consensus 806 ~~~~C~~C~~~f~~ 819 (896)
..|+|..||+.|..
T Consensus 22 nrYkCkdCGY~ft~ 35 (59)
T PHA00626 22 DDYVCCDCGYNDSK 35 (59)
T ss_pred cceEcCCCCCeech
Confidence 45677777766643
No 151
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=37.85 E-value=17 Score=33.58 Aligned_cols=27 Identities=33% Similarity=0.502 Sum_probs=15.3
Q ss_pred cCCccCCCCCcccCCchhHHHHhhhhcCCC
Q psy16259 777 ERPFKCDVCPKDFTCKGHLVSHLRSHLNER 806 (896)
Q Consensus 777 ~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~ 806 (896)
+....|-+|||.|+. |++|++.|+|-.
T Consensus 70 ~d~i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 70 PDYIICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp SS-EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred cCeeEEccCCcccch---HHHHHHHccCCC
Confidence 344678888888864 588888887653
No 152
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=37.67 E-value=21 Score=34.31 Aligned_cols=23 Identities=39% Similarity=0.918 Sum_probs=15.7
Q ss_pred ccccccCCCCCCChhhHhhhhhcccCCCCccCCCCC
Q psy16259 717 EHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCG 752 (896)
Q Consensus 717 ~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~ 752 (896)
.|.|.+||+.+ -|+-|-.|++||
T Consensus 134 ~~vC~vCGy~~-------------~ge~P~~CPiCg 156 (166)
T COG1592 134 VWVCPVCGYTH-------------EGEAPEVCPICG 156 (166)
T ss_pred EEEcCCCCCcc-------------cCCCCCcCCCCC
Confidence 57777777653 356677777777
No 153
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=37.55 E-value=82 Score=29.84 Aligned_cols=56 Identities=27% Similarity=0.355 Sum_probs=37.3
Q ss_pred CccCCCC-CcccCCchhHHHHhhhhcCCCCccCCccCccc-cchhhHHhhhccccCCCcC
Q psy16259 779 PFKCDVC-PKDFTCKGHLVSHLRSHLNERPHVCPHCSKSF-VEHTCMIKHIRKNHTLANV 836 (896)
Q Consensus 779 ~~~C~~C-~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f-~~~~~l~~H~~~~H~~~~~ 836 (896)
...|++| |..+.+.. ..--|.+.+.|+=.|..=+=.| -+...|++|.|..|+....
T Consensus 80 ~L~CPLCRG~V~GWtv--ve~AR~~LN~K~RsC~~e~C~F~GtY~eLrKHar~~HP~~rP 137 (162)
T PF07800_consen 80 ELACPLCRGEVKGWTV--VEPARRFLNAKKRSCSQESCSFSGTYSELRKHARSEHPSARP 137 (162)
T ss_pred cccCccccCceeceEE--chHHHHHhccCCccCcccccccccCHHHHHHHHHhhCCCCCC
Confidence 4679999 44544432 2335666677777776533334 4677899999999998654
No 154
>KOG4307|consensus
Probab=36.48 E-value=1.9e+02 Score=34.00 Aligned_cols=6 Identities=33% Similarity=-0.031 Sum_probs=2.3
Q ss_pred eeeeee
Q psy16259 201 QSDLIK 206 (896)
Q Consensus 201 ~~dv~~ 206 (896)
+.|+..
T Consensus 325 ~nd~rk 330 (944)
T KOG4307|consen 325 FNDGRK 330 (944)
T ss_pred cchhhh
Confidence 344333
No 155
>KOG2071|consensus
Probab=36.17 E-value=26 Score=40.31 Aligned_cols=41 Identities=32% Similarity=0.738 Sum_probs=0.0
Q ss_pred CCCCCcccccccccChhHHHHHhhhh------------------------------------------------------
Q psy16259 687 ESPHKCDFCLKTFTRKEHLTNHLKQH------------------------------------------------------ 712 (896)
Q Consensus 687 ~~~~~C~~C~k~f~~~~~L~~H~~~h------------------------------------------------------ 712 (896)
..+-+|..||.+|........||.+|
T Consensus 416 ~~pnqC~~CG~R~~~~ee~sk~md~H~dwh~r~n~~~r~s~~~~~sR~Wf~s~s~W~~~K~~e~t~e~~~~~~~~~~~~~ 495 (579)
T KOG2071|consen 416 DSPNQCKSCGLRFDDSEERSKHMDIHDDWHRRKNTTIRGSSKWQKSRSWFPSKSGWLAAKAGEETDEKEKVEHEELQIKK 495 (579)
T ss_pred CCcchhcccccccccchhhhhHhhhhhhhhhhhhhhcccccccccccceeechhhhccccccccccccccccchhhcchh
Q ss_pred -------------cCCCccccccCCCCC
Q psy16259 713 -------------AGETEHVCNICSKPF 727 (896)
Q Consensus 713 -------------~~~~~~~C~~C~~~f 727 (896)
.+++...|.+|+..|
T Consensus 496 ~~s~~~k~~~Vp~d~e~~~~C~IC~EkF 523 (579)
T KOG2071|consen 496 ELSLRSKYELVPADSERQASCPICQEKF 523 (579)
T ss_pred hhhhhccceecccCcccccCCccccccc
No 156
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=35.25 E-value=34 Score=21.70 Aligned_cols=19 Identities=21% Similarity=0.684 Sum_probs=9.3
Q ss_pred ccccCCcccCChHHHHHHHh
Q psy16259 663 KCHFCSKSFTRKEHLNNHIR 682 (896)
Q Consensus 663 ~C~~C~k~f~~~~~L~~H~~ 682 (896)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 355555555 3344455543
No 157
>KOG3408|consensus
Probab=34.80 E-value=25 Score=31.34 Aligned_cols=35 Identities=26% Similarity=0.435 Sum_probs=29.2
Q ss_pred ccccCCCCCCccccccCCCCCCCCChHHHHHHhhh
Q psy16259 285 DIKADLDLPVMGNHTCDLCGKFFPYRYQMIVHRRY 319 (896)
Q Consensus 285 ~~~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~ 319 (896)
....+.++++.+.|-|-.|.+-|.+...|..|.+.
T Consensus 45 ~~~~D~dlPG~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 45 NQEIDPDLPGGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred cCcCCCCCCCCceeehhhhhhhhcchHHHHHHHhc
Confidence 34456678888999999999999999999999764
No 158
>KOG4124|consensus
Probab=34.37 E-value=12 Score=39.52 Aligned_cols=27 Identities=26% Similarity=0.533 Sum_probs=19.9
Q ss_pred CCCccCCccCccccchhhHHhhhcccc
Q psy16259 805 ERPHVCPHCSKSFVEHTCMIKHIRKNH 831 (896)
Q Consensus 805 ~~~~~C~~C~~~f~~~~~l~~H~~~~H 831 (896)
.|+|+|++|.++++..-.|+-|....|
T Consensus 396 nk~~r~~i~~~~~k~~~~l~~~~~~~~ 422 (442)
T KOG4124|consen 396 NKPYRCEVCSKRYKNLNGLKYHRTHSH 422 (442)
T ss_pred cCcccChhhhhhhccCCCCCceeehhh
Confidence 478889999888887777776655433
No 159
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=34.00 E-value=24 Score=29.60 Aligned_cols=11 Identities=36% Similarity=1.213 Sum_probs=5.1
Q ss_pred ccCCCCCCCCC
Q psy16259 746 FACTDCGKSFP 756 (896)
Q Consensus 746 ~~C~~C~~~F~ 756 (896)
..|..||..|.
T Consensus 54 W~C~kCg~~fA 64 (89)
T COG1997 54 WKCRKCGAKFA 64 (89)
T ss_pred EEcCCCCCeec
Confidence 44444444443
No 160
>PF15269 zf-C2H2_7: Zinc-finger
Probab=33.82 E-value=23 Score=25.30 Aligned_cols=23 Identities=35% Similarity=0.723 Sum_probs=19.3
Q ss_pred CccCCCCCcccCCchhHHHHhhh
Q psy16259 779 PFKCDVCPKDFTCKGHLVSHLRS 801 (896)
Q Consensus 779 ~~~C~~C~k~f~~~~~L~~H~~~ 801 (896)
.|+|-+|..+...++.|-.||+.
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 37899999988888888888874
No 161
>KOG0119|consensus
Probab=33.48 E-value=6.8e+02 Score=28.56 Aligned_cols=122 Identities=15% Similarity=0.161 Sum_probs=0.0
Q ss_pred CCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCC
Q psy16259 18 TSMEVDTPPPPSSSRGSEP----PSPNTGGVSGGFISTSPSQPSPQHGH--HTLTSLTSPHQMPPPSPIATSPPPPSASD 91 (896)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~p----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ppppppppppp~p~~~~ 91 (896)
.....+.++-...+..-+| ++.............---|.+|+.+. +...-.++..|.||-+|+|...-...+..
T Consensus 397 Pg~~~p~~p~n~~p~~~pp~~~gps~q~~~~~sa~vp~g~~P~~Pp~~~~~p~~~~~ppg~p~pP~~p~Pg~~s~~~s~~ 476 (554)
T KOG0119|consen 397 PGTPIPRPPQNSAPSSIPPYGSGPSLQSASVHSAPVPGGLAPAYPPTSYAPPPQSGQPPGIPLPPHPPPPGMQSAQSSSL 476 (554)
T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCcccccccccCCCCcccCCCCCCccCCCCCccCCCCCCCCCCCCCCCCccccccccC
Q ss_pred C----------------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccc
Q psy16259 92 K----------------HDSQSSPHITQHTGPGIFQSPPPTSHISKIPSPSSSSPSSSSQLMQQ 139 (896)
Q Consensus 92 ~----------------~~~~~~~~~~~~~~~~~~~~ppp~pp~~~~p~~~~~~~~~~~~~~~~ 139 (896)
+ ...+++-+.........++++...+.-+|..++++.....+......
T Consensus 477 P~q~s~~~~pp~~r~a~~~a~Pg~p~~~~~~~~vPpp~g~~p~~pPgap~P~~~~~~s~~~~~P 540 (554)
T KOG0119|consen 477 PQQASTTSIPPGDRQAQAAAPPGAPFHGGNYNAVPPPPGLQPANPPGAPPPPPSPLFSLLNLAP 540 (554)
T ss_pred CcccccccCCcccccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcchhcccccCC
No 162
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=32.32 E-value=34 Score=26.40 Aligned_cols=8 Identities=50% Similarity=1.680 Sum_probs=4.5
Q ss_pred ccCCCCCC
Q psy16259 746 FACTDCGK 753 (896)
Q Consensus 746 ~~C~~C~~ 753 (896)
|.|+.||.
T Consensus 28 F~CPnCGe 35 (61)
T COG2888 28 FPCPNCGE 35 (61)
T ss_pred eeCCCCCc
Confidence 55555554
No 163
>PF15269 zf-C2H2_7: Zinc-finger
Probab=32.21 E-value=31 Score=24.71 Aligned_cols=23 Identities=35% Similarity=0.522 Sum_probs=19.7
Q ss_pred ccccCCCCCccCCHHHHHHHHhh
Q psy16259 465 PHVCQVCGKKYTRKEHLANHMRS 487 (896)
Q Consensus 465 ~~~C~~C~~~f~~~~~L~~H~~~ 487 (896)
.|+|-.|..+...++.|-.||+-
T Consensus 20 ~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 20 KYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred cceeecCCcccchHHHHHHHHHH
Confidence 37899999999999999999874
No 164
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=31.99 E-value=27 Score=23.45 Aligned_cols=8 Identities=25% Similarity=0.970 Sum_probs=3.2
Q ss_pred ccccCCCC
Q psy16259 719 VCNICSKP 726 (896)
Q Consensus 719 ~C~~C~~~ 726 (896)
.|..||..
T Consensus 2 ~C~~Cg~~ 9 (32)
T PF03604_consen 2 ICGECGAE 9 (32)
T ss_dssp BESSSSSS
T ss_pred CCCcCCCe
Confidence 34444443
No 165
>KOG2593|consensus
Probab=31.99 E-value=36 Score=37.63 Aligned_cols=38 Identities=26% Similarity=0.523 Sum_probs=20.3
Q ss_pred cCCCccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCC
Q psy16259 713 AGETEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGK 753 (896)
Q Consensus 713 ~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~ 753 (896)
+....|.|..|.+.|.....|+. +-.....|.|..|+-
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCGG 161 (436)
T ss_pred cccccccCCccccchhhhHHHHh---hcccCceEEEecCCC
Confidence 34455777777777665554432 111223466666664
No 166
>PRK04023 DNA polymerase II large subunit; Validated
Probab=31.40 E-value=46 Score=40.89 Aligned_cols=9 Identities=44% Similarity=0.988 Sum_probs=4.7
Q ss_pred cccccCCcc
Q psy16259 662 HKCHFCSKS 670 (896)
Q Consensus 662 ~~C~~C~k~ 670 (896)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 445555554
No 167
>KOG1044|consensus
Probab=30.06 E-value=19 Score=40.96 Aligned_cols=10 Identities=30% Similarity=0.889 Sum_probs=5.3
Q ss_pred ccCCccCccc
Q psy16259 550 HKCNYCAKSF 559 (896)
Q Consensus 550 ~~C~~C~~~f 559 (896)
|+|..|+...
T Consensus 161 fkc~~c~~vL 170 (670)
T KOG1044|consen 161 FKCKSCSAVL 170 (670)
T ss_pred eehhhhcccc
Confidence 5555555443
No 168
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=29.89 E-value=25 Score=31.08 Aligned_cols=10 Identities=30% Similarity=0.707 Sum_probs=4.9
Q ss_pred eecCcccccc
Q psy16259 382 YCCSVCMKTF 391 (896)
Q Consensus 382 ~~C~~C~~~F 391 (896)
+.|..|+..+
T Consensus 17 ~~C~~C~~~~ 26 (104)
T TIGR01384 17 YVCPSCGYEK 26 (104)
T ss_pred EECcCCCCcc
Confidence 4555555443
No 169
>PRK14873 primosome assembly protein PriA; Provisional
Probab=29.84 E-value=36 Score=40.97 Aligned_cols=7 Identities=43% Similarity=0.942 Sum_probs=3.2
Q ss_pred ccCCCCC
Q psy16259 746 FACTDCG 752 (896)
Q Consensus 746 ~~C~~C~ 752 (896)
..|.+||
T Consensus 411 l~Ch~CG 417 (665)
T PRK14873 411 PRCRWCG 417 (665)
T ss_pred eECCCCc
Confidence 4444444
No 170
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=29.55 E-value=28 Score=31.68 Aligned_cols=25 Identities=36% Similarity=0.348 Sum_probs=19.4
Q ss_pred CCcccccccccChhHHHHHhhhhcCCCc
Q psy16259 690 HKCDFCLKTFTRKEHLTNHLKQHAGETE 717 (896)
Q Consensus 690 ~~C~~C~k~f~~~~~L~~H~~~h~~~~~ 717 (896)
..|-++||.|. .|++|+.+|.|..+
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCCH
Confidence 46888888886 68888888887653
No 171
>KOG4167|consensus
Probab=29.26 E-value=11 Score=43.78 Aligned_cols=27 Identities=26% Similarity=0.434 Sum_probs=24.3
Q ss_pred CCccCCCCCcccCCchhHHHHhhhhcC
Q psy16259 778 RPFKCDVCPKDFTCKGHLVSHLRSHLN 804 (896)
Q Consensus 778 k~~~C~~C~k~f~~~~~L~~H~~~H~~ 804 (896)
.-|.|..|+|.|..-..+..||++|.-
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHHH
Confidence 358999999999999999999999963
No 172
>KOG3408|consensus
Probab=28.91 E-value=19 Score=32.13 Aligned_cols=24 Identities=25% Similarity=0.499 Sum_probs=16.4
Q ss_pred CCccCCCCCCCCCCchhHHHHHHh
Q psy16259 744 RPFACTDCGKSFPLKGNLLFHQRS 767 (896)
Q Consensus 744 k~~~C~~C~~~F~~~~~l~~H~~~ 767 (896)
..|.|-.|.+-|.+...|..|.++
T Consensus 56 GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 56 GQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred ceeehhhhhhhhcchHHHHHHHhc
Confidence 346677777777777777777654
No 173
>PRK04023 DNA polymerase II large subunit; Validated
Probab=28.46 E-value=56 Score=40.21 Aligned_cols=10 Identities=40% Similarity=0.890 Sum_probs=5.3
Q ss_pred CcccCccccc
Q psy16259 325 PFTCHVCGKA 334 (896)
Q Consensus 325 ~~~C~~C~~~ 334 (896)
...|..||..
T Consensus 626 ~RfCpsCG~~ 635 (1121)
T PRK04023 626 RRKCPSCGKE 635 (1121)
T ss_pred CccCCCCCCc
Confidence 3456666654
No 174
>KOG2272|consensus
Probab=28.12 E-value=53 Score=33.23 Aligned_cols=17 Identities=24% Similarity=0.667 Sum_probs=8.7
Q ss_pred eecCccccccCChHHHH
Q psy16259 382 YCCSVCMKTFARKEHLE 398 (896)
Q Consensus 382 ~~C~~C~~~F~~~~~L~ 398 (896)
|.|++|++...+...++
T Consensus 100 F~Cd~Cn~~Lad~gf~r 116 (332)
T KOG2272|consen 100 FRCDLCNKHLADQGFYR 116 (332)
T ss_pred chhHHHHHHHhhhhhHh
Confidence 55555555554444433
No 175
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=27.68 E-value=47 Score=38.71 Aligned_cols=10 Identities=30% Similarity=0.806 Sum_probs=4.7
Q ss_pred CCccCCCCCC
Q psy16259 744 RPFACTDCGK 753 (896)
Q Consensus 744 k~~~C~~C~~ 753 (896)
....|.+||+
T Consensus 239 ~~l~Ch~Cg~ 248 (505)
T TIGR00595 239 GKLRCHYCGY 248 (505)
T ss_pred CeEEcCCCcC
Confidence 3345555554
No 176
>PF12907 zf-met2: Zinc-binding
Probab=26.46 E-value=17 Score=25.75 Aligned_cols=26 Identities=19% Similarity=0.572 Sum_probs=12.6
Q ss_pred ccCCccCcccc---chhhHHhhhccccCC
Q psy16259 808 HVCPHCSKSFV---EHTCMIKHIRKNHTL 833 (896)
Q Consensus 808 ~~C~~C~~~f~---~~~~l~~H~~~~H~~ 833 (896)
++|.+|...|. ....|+.|...+|+.
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK 30 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPK 30 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCC
Confidence 34555554442 223455555555554
No 177
>KOG2272|consensus
Probab=26.26 E-value=36 Score=34.37 Aligned_cols=17 Identities=35% Similarity=0.821 Sum_probs=10.1
Q ss_pred ccccccccccccChhhh
Q psy16259 437 PHRCEICKKAFTRKEHY 453 (896)
Q Consensus 437 ~~~C~~C~~~f~~~~~L 453 (896)
-|.|..|.+-|.....+
T Consensus 221 HFvCa~CekPFlGHrHY 237 (332)
T KOG2272|consen 221 HFVCAKCEKPFLGHRHY 237 (332)
T ss_pred heeehhcCCcccchhhh
Confidence 36677777766554433
No 178
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=26.04 E-value=51 Score=24.77 Aligned_cols=11 Identities=27% Similarity=1.041 Sum_probs=5.5
Q ss_pred ccCCCCCCCcc
Q psy16259 606 YQCQYCPKAFS 616 (896)
Q Consensus 606 ~~C~~C~k~f~ 616 (896)
|+|..||+.+.
T Consensus 2 y~C~~CgyiYd 12 (50)
T cd00730 2 YECRICGYIYD 12 (50)
T ss_pred cCCCCCCeEEC
Confidence 44555555444
No 179
>KOG2593|consensus
Probab=25.99 E-value=61 Score=35.91 Aligned_cols=16 Identities=31% Similarity=0.686 Sum_probs=7.4
Q ss_pred CcccCccCCccCChHH
Q psy16259 493 PFKCEICTKSFTRKEH 508 (896)
Q Consensus 493 ~~~C~~C~~~f~~~~~ 508 (896)
-|.|..|.+.|.....
T Consensus 128 ~Y~Cp~C~kkyt~Lea 143 (436)
T KOG2593|consen 128 GYVCPNCQKKYTSLEA 143 (436)
T ss_pred cccCCccccchhhhHH
Confidence 3455555554444333
No 180
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=25.62 E-value=32 Score=27.63 Aligned_cols=27 Identities=30% Similarity=0.751 Sum_probs=11.1
Q ss_pred ccCCCCCcccCCchhHHHHhhhhcCCCCccCCccCcccc
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSKSFV 818 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~ 818 (896)
-.|.+|++.|.. -.+.+.|..||..|=
T Consensus 10 ~~C~~C~~~F~~------------~~rrhhCr~CG~~vC 36 (69)
T PF01363_consen 10 SNCMICGKKFSL------------FRRRHHCRNCGRVVC 36 (69)
T ss_dssp SB-TTT--B-BS------------SS-EEE-TTT--EEE
T ss_pred CcCcCcCCcCCC------------ceeeEccCCCCCEEC
Confidence 347777777742 134556777777664
No 181
>KOG2807|consensus
Probab=25.19 E-value=1.1e+02 Score=32.59 Aligned_cols=18 Identities=33% Similarity=0.781 Sum_probs=8.8
Q ss_pred CcccccCCcccCChHHHH
Q psy16259 661 PHKCHFCSKSFTRKEHLN 678 (896)
Q Consensus 661 ~~~C~~C~k~f~~~~~L~ 678 (896)
|..|++|+-......+|.
T Consensus 290 P~eCpiC~ltLVss~hLA 307 (378)
T KOG2807|consen 290 PIECPICSLTLVSSPHLA 307 (378)
T ss_pred CccCCccceeEecchHHH
Confidence 445555555544444443
No 182
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=24.87 E-value=77 Score=22.64 Aligned_cols=24 Identities=29% Similarity=0.736 Sum_probs=16.1
Q ss_pred cccCCCCCCCCC--hHHHHHHhhhcC
Q psy16259 298 HTCDLCGKFFPY--RYQMIVHRRYHT 321 (896)
Q Consensus 298 ~~C~~C~~~f~~--~~~L~~H~~~h~ 321 (896)
.+|..||..|.. ...-..|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 578888887754 455566766653
No 183
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=24.67 E-value=35 Score=41.17 Aligned_cols=49 Identities=22% Similarity=0.750 Sum_probs=34.6
Q ss_pred cccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHH
Q psy16259 718 HVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVS 797 (896)
Q Consensus 718 ~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~ 797 (896)
..|..||.. ++|+.|+..+ ..|+.. ...+|..||..
T Consensus 436 l~C~~Cg~v-------------------~~Cp~Cd~~l------t~H~~~----------~~L~CH~Cg~~--------- 471 (730)
T COG1198 436 LLCRDCGYI-------------------AECPNCDSPL------TLHKAT----------GQLRCHYCGYQ--------- 471 (730)
T ss_pred eecccCCCc-------------------ccCCCCCcce------EEecCC----------CeeEeCCCCCC---------
Confidence 459889887 5799998763 344432 34678888753
Q ss_pred HhhhhcCCCCccCCccCcc
Q psy16259 798 HLRSHLNERPHVCPHCSKS 816 (896)
Q Consensus 798 H~~~H~~~~~~~C~~C~~~ 816 (896)
...|..|+.||-.
T Consensus 472 ------~~~p~~Cp~Cgs~ 484 (730)
T COG1198 472 ------EPIPQSCPECGSE 484 (730)
T ss_pred ------CCCCCCCCCCCCC
Confidence 3478899999854
No 184
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=24.10 E-value=31 Score=22.80 Aligned_cols=24 Identities=33% Similarity=0.913 Sum_probs=9.1
Q ss_pred cCCCCCcccCCchhHHHHhhhhcCCCCccCCccCc
Q psy16259 781 KCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCSK 815 (896)
Q Consensus 781 ~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 815 (896)
+|+.|+..+.. .+..-|.|+.|+.
T Consensus 4 ~Cp~C~se~~y-----------~D~~~~vCp~C~~ 27 (30)
T PF08274_consen 4 KCPLCGSEYTY-----------EDGELLVCPECGH 27 (30)
T ss_dssp --TTT-----E-----------E-SSSEEETTTTE
T ss_pred CCCCCCCccee-----------ccCCEEeCCcccc
Confidence 36666655543 3344566666654
No 185
>KOG4124|consensus
Probab=23.72 E-value=51 Score=35.05 Aligned_cols=26 Identities=19% Similarity=0.410 Sum_probs=17.2
Q ss_pred CCCccccccCC--CCCCCCChHHHHHHh
Q psy16259 292 LPVMGNHTCDL--CGKFFPYRYQMIVHR 317 (896)
Q Consensus 292 ~~~~~~~~C~~--C~~~f~~~~~L~~H~ 317 (896)
+...+.|.|.+ |++.+.....|..|.
T Consensus 344 ~~~~~~~~~~vp~~~~~~~n~ng~~~~~ 371 (442)
T KOG4124|consen 344 VVVDKPYKCPVPNCDKAYKNQNGLKYHK 371 (442)
T ss_pred EEecCCCCCCCCcchhhcccCcceeecc
Confidence 33455677754 888887777776664
No 186
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=23.65 E-value=62 Score=32.86 Aligned_cols=13 Identities=15% Similarity=0.585 Sum_probs=6.8
Q ss_pred ccccCCCCCccCC
Q psy16259 465 PHVCQVCGKKYTR 477 (896)
Q Consensus 465 ~~~C~~C~~~f~~ 477 (896)
.|.|..|+..|..
T Consensus 155 ef~C~~C~h~F~G 167 (278)
T PF15135_consen 155 EFHCPKCRHNFRG 167 (278)
T ss_pred eeecccccccchh
Confidence 3555555555543
No 187
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=23.43 E-value=44 Score=30.45 Aligned_cols=24 Identities=29% Similarity=0.378 Sum_probs=19.6
Q ss_pred ccCCCCCcccCCchhHHHHhhhhcCCC
Q psy16259 780 FKCDVCPKDFTCKGHLVSHLRSHLNER 806 (896)
Q Consensus 780 ~~C~~C~k~f~~~~~L~~H~~~H~~~~ 806 (896)
..|-++||.|+ .|++|+.+|.|--
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gmT 100 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGLT 100 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCCC
Confidence 56999999995 6899999987743
No 188
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=23.17 E-value=81 Score=23.07 Aligned_cols=8 Identities=38% Similarity=0.987 Sum_probs=3.5
Q ss_pred cccccCCC
Q psy16259 718 HVCNICSK 725 (896)
Q Consensus 718 ~~C~~C~~ 725 (896)
+.|+.||.
T Consensus 19 ~~CP~Cg~ 26 (46)
T PF12760_consen 19 FVCPHCGS 26 (46)
T ss_pred CCCCCCCC
Confidence 44444443
No 189
>KOG0696|consensus
Probab=23.15 E-value=30 Score=37.96 Aligned_cols=35 Identities=26% Similarity=0.540 Sum_probs=16.6
Q ss_pred ccCCccCccCCChhHHHHH-HHhhcCCCCcccccCC
Q psy16259 634 HKCSYCKQSFTRKDHLNNH-VRLHTGESPHKCHFCS 668 (896)
Q Consensus 634 ~~C~~C~~~F~~~~~L~~H-~~~h~~~~~~~C~~C~ 668 (896)
|.|+-=++.+.+-..-..| .++|+-..|--|++||
T Consensus 93 F~CPGadkg~dtDdpr~kHkf~~~tYssPTFCDhCG 128 (683)
T KOG0696|consen 93 FSCPGADKGPDTDDPRSKHKFKIHTYSSPTFCDHCG 128 (683)
T ss_pred EECCCCCCCCCCCCcccccceeeeecCCCchhhhHH
Confidence 4454444555444444444 2344444444555554
No 190
>PF09416 UPF1_Zn_bind: RNA helicase (UPF2 interacting domain); InterPro: IPR018999 UPF1 (or regulator of nonsense transcripts 1 homologue) is an essential RNA helicase that detects mRNAs containing premature stop codons and triggers their degradation. This domain contains 3 zinc binding motifs and forms interactions with another protein (UPF2) that is also involved nonsense-mediated mRNA decay (NMD) []. ; GO: 0003677 DNA binding, 0004386 helicase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0000184 nuclear-transcribed mRNA catabolic process, nonsense-mediated decay, 0005737 cytoplasm; PDB: 2IYK_B 2WJY_A 2WJV_A 2XZL_A.
Probab=22.49 E-value=61 Score=30.65 Aligned_cols=15 Identities=33% Similarity=0.620 Sum_probs=8.1
Q ss_pred cCCCCcccccCCccc
Q psy16259 657 TGESPHKCHFCSKSF 671 (896)
Q Consensus 657 ~~~~~~~C~~C~k~f 671 (896)
...-..+|..|+|-|
T Consensus 10 ~p~~vv~C~~c~kWF 24 (152)
T PF09416_consen 10 DPSCVVKCNTCNKWF 24 (152)
T ss_dssp -CCCEEEETTTTEEE
T ss_pred CcccEeEcCCCCcEe
Confidence 334445666666665
No 191
>PF14353 CpXC: CpXC protein
Probab=22.47 E-value=26 Score=32.24 Aligned_cols=22 Identities=23% Similarity=0.508 Sum_probs=13.3
Q ss_pred cccCCCCCCCcccccccccccc
Q psy16259 605 PYQCQYCPKAFSRKDHLVNHVR 626 (896)
Q Consensus 605 ~~~C~~C~k~f~~~~~L~~H~~ 626 (896)
.|.|+.||..|.-...+.-|-.
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~D~ 59 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYHDP 59 (128)
T ss_pred EEECCCCCCceecCCCEEEEcC
Confidence 3567777777666555555543
No 192
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=22.32 E-value=57 Score=35.01 Aligned_cols=78 Identities=19% Similarity=0.428 Sum_probs=40.9
Q ss_pred CccccccCCCCCCChhhHhhhhhcccCCCCccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhH
Q psy16259 716 TEHVCNICSKPFLKKEHLVNHSRSHTGERPFACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHL 795 (896)
Q Consensus 716 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~k~~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L 795 (896)
..-.|++||-.=.- + + .+.....|.|-..|..|+-... -...+|..||.. ..|
T Consensus 186 ~~~~CPvCGs~P~~-s-~-v~~~~~~G~RyL~CslC~teW~--------------------~~R~~C~~Cg~~----~~l 238 (309)
T PRK03564 186 QRQFCPVCGSMPVS-S-V-VQIGTTQGLRYLHCNLCESEWH--------------------VVRVKCSNCEQS----GKL 238 (309)
T ss_pred CCCCCCCCCCcchh-h-e-eeccCCCCceEEEcCCCCCccc--------------------ccCccCCCCCCC----Cce
Confidence 34568888765321 1 1 1223345667777888876432 123568888852 122
Q ss_pred HHH-hhhh-cCCCCccCCccCccccch
Q psy16259 796 VSH-LRSH-LNERPHVCPHCSKSFVEH 820 (896)
Q Consensus 796 ~~H-~~~H-~~~~~~~C~~C~~~f~~~ 820 (896)
... +..- .+.+...|..|+--++..
T Consensus 239 ~y~~~~~~~~~~r~e~C~~C~~YlK~~ 265 (309)
T PRK03564 239 HYWSLDSEQAAVKAESCGDCGTYLKIL 265 (309)
T ss_pred eeeeecCCCcceEeeecccccccceec
Confidence 211 0000 123557788888766653
No 193
>KOG4167|consensus
Probab=21.42 E-value=24 Score=41.26 Aligned_cols=26 Identities=23% Similarity=0.476 Sum_probs=21.7
Q ss_pred CCeecCccccccCChHHHHHHHhhcc
Q psy16259 380 RPYCCSVCMKTFARKEHLENHTRCHT 405 (896)
Q Consensus 380 ~~~~C~~C~~~F~~~~~L~~H~~~h~ 405 (896)
..|.|.+|++.|....+++.||+.|.
T Consensus 791 giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 791 GIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred ceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 45889999999988888888888874
No 194
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=21.38 E-value=62 Score=23.99 Aligned_cols=7 Identities=71% Similarity=1.683 Sum_probs=2.2
Q ss_pred cccCCCC
Q psy16259 466 HVCQVCG 472 (896)
Q Consensus 466 ~~C~~C~ 472 (896)
|.|++|+
T Consensus 35 w~CP~C~ 41 (47)
T PF00301_consen 35 WVCPVCG 41 (47)
T ss_dssp -B-TTTS
T ss_pred CcCcCCC
Confidence 4444443
No 195
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=21.36 E-value=30 Score=22.41 Aligned_cols=17 Identities=35% Similarity=0.606 Sum_probs=7.2
Q ss_pred ccCccccccCChHHHhhh
Q psy16259 327 TCHVCGKAFDFQAELMSH 344 (896)
Q Consensus 327 ~C~~C~~~f~~~~~L~~H 344 (896)
.|-.|++.| .......|
T Consensus 2 sCiDC~~~F-~~~~y~~H 18 (28)
T PF08790_consen 2 SCIDCSKDF-DGDSYKSH 18 (28)
T ss_dssp EETTTTEEE-EGGGTTT-
T ss_pred eeecCCCCc-CcCCcCCC
Confidence 355555555 33334444
No 196
>KOG2807|consensus
Probab=20.70 E-value=1.1e+02 Score=32.54 Aligned_cols=91 Identities=19% Similarity=0.330 Sum_probs=57.5
Q ss_pred CCccccccCCCCCCCCChHHHHHHhhhcCCCCCcccCccccccCChHHHhhhhc----------cccCC--CccccCccC
Q psy16259 293 PVMGNHTCDLCGKFFPYRYQMIVHRRYHTERKPFTCHVCGKAFDFQAELMSHGK----------CHLGG--SMYTCAVCF 360 (896)
Q Consensus 293 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~----------~h~~~--~~~~C~~C~ 360 (896)
...+.|.|+.|+...- ..|-.|++|+-+....-.|.+-.+ +-..+ +.-.|-.|+
T Consensus 272 ~~~~Gy~CP~CkakvC--------------sLP~eCpiC~ltLVss~hLARSyhhL~PL~~F~Eip~~~~~~~~~Cf~C~ 337 (378)
T KOG2807|consen 272 LSGGGYFCPQCKAKVC--------------SLPIECPICSLTLVSSPHLARSYHHLFPLKPFVEIPETEYNGSRFCFACQ 337 (378)
T ss_pred cccCceeCCcccCeee--------------cCCccCCccceeEecchHHHHHHHhhcCCcchhhccccccCCCcceeeec
Confidence 3456789999976432 357789999988877766655321 11111 112266772
Q ss_pred cccCChHHHHHHHhhccCCCCeecCccccccCChHHHHHHHhhccCCCceecCCCC
Q psy16259 361 HVFANEESLQRHTKRHSIDRPYCCSVCMKTFARKEHLENHTRCHTGETPYKCQYCG 416 (896)
Q Consensus 361 ~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~F~~~~~L~~H~~~h~~~~~~~C~~C~ 416 (896)
... .....|.|..|...|-.--+.-.|...| .|..|.
T Consensus 338 ~~~-------------~~~~~y~C~~Ck~~FCldCDv~iHesLh------~CpgCe 374 (378)
T KOG2807|consen 338 GEL-------------LSSGRYRCESCKNVFCLDCDVFIHESLH------NCPGCE 374 (378)
T ss_pred ccc-------------CCCCcEEchhccceeeccchHHHHhhhh------cCCCcC
Confidence 221 1334689999999998888888887766 466675
No 197
>PTZ00303 phosphatidylinositol kinase; Provisional
Probab=20.54 E-value=29 Score=40.73 Aligned_cols=13 Identities=31% Similarity=0.754 Sum_probs=9.8
Q ss_pred cccCCCCCCCCCh
Q psy16259 298 HTCDLCGKFFPYR 310 (896)
Q Consensus 298 ~~C~~C~~~f~~~ 310 (896)
..|..|++.|.+.
T Consensus 461 dtC~~C~kkFfSl 473 (1374)
T PTZ00303 461 DSCPSCGRAFISL 473 (1374)
T ss_pred CcccCcCCccccc
Confidence 4588888888654
No 198
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=20.52 E-value=43 Score=34.92 Aligned_cols=55 Identities=22% Similarity=0.540 Sum_probs=34.8
Q ss_pred ccCCCCCCCCCCchhHHHHHHhhcCCCCccccCCccCCCCCcccCCchhHHHHhhhhcCCCCccCCccC
Q psy16259 746 FACTDCGKSFPLKGNLLFHQRSHRKQNGEVLERPFKCDVCPKDFTCKGHLVSHLRSHLNERPHVCPHCS 814 (896)
Q Consensus 746 ~~C~~C~~~F~~~~~l~~H~~~H~~~~~~~~~k~~~C~~C~k~f~~~~~L~~H~~~H~~~~~~~C~~C~ 814 (896)
-.|-.|.-.|.....-.. ........|+|+.|...|-.--..-.|...|. |..|.
T Consensus 363 ~~Cf~CQ~~fp~~~~~~~--------~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh~------C~gCe 417 (421)
T COG5151 363 THCFVCQGPFPKPPVSPF--------DESTSSGRYQCELCKSTFCSDCDVFIHETLHF------CIGCE 417 (421)
T ss_pred ccceeccCCCCCCCCCcc--------cccccccceechhhhhhhhhhhHHHHHHHHhh------CCCCc
Confidence 346667776664422111 11111346999999999998888888887773 66664
No 199
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=20.19 E-value=44 Score=25.18 Aligned_cols=26 Identities=19% Similarity=0.364 Sum_probs=13.8
Q ss_pred CccccccCCCCCCChhhHhhhhhccc
Q psy16259 716 TEHVCNICSKPFLKKEHLVNHSRSHT 741 (896)
Q Consensus 716 ~~~~C~~C~~~f~~~~~L~~H~~~H~ 741 (896)
..|+|+.|+..|=..=++-.|..+|.
T Consensus 20 ~~y~C~~C~~~FC~dCD~fiHE~LH~ 45 (51)
T PF07975_consen 20 SRYRCPKCKNHFCIDCDVFIHETLHN 45 (51)
T ss_dssp EEE--TTTT--B-HHHHHTTTTTS-S
T ss_pred CeEECCCCCCccccCcChhhhccccC
Confidence 45777777777777777777766653
No 200
>KOG1044|consensus
Probab=20.01 E-value=88 Score=35.89 Aligned_cols=43 Identities=23% Similarity=0.486 Sum_probs=22.9
Q ss_pred CCCCCccCCccCcccCChhhhhcccccccCCCCCcCccccCcccc
Q psy16259 545 TGESPHKCNYCAKSFTRKEHLVNHVRQHTGDSPHKCNYCAKSFTR 589 (896)
Q Consensus 545 ~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~k~f~~ 589 (896)
++.....|..|++...-. .|+.+ ..|-..+-|.|..||.....
T Consensus 12 ~~~~~i~c~~c~~kc~ge-vlrv~-d~~fhi~cf~c~~cg~~la~ 54 (670)
T KOG1044|consen 12 TGKQGIKCDKCRKKCSGE-VLRVN-DNHFHINCFQCKKCGRNLAE 54 (670)
T ss_pred ccccceehhhhCCccccc-eeEee-ccccceeeeeccccCCCccc
Confidence 344455677887765432 22222 22333455778888776543
Done!