Query psy366
Match_columns 734
No_of_seqs 509 out of 4027
Neff 10.6
Searched_HMMs 46136
Date Fri Aug 16 22:22:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy366.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/366hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.1E-30 2.3E-35 267.9 2.5 234 429-707 605-936 (958)
2 KOG1074|consensus 100.0 4.7E-30 1E-34 263.2 5.3 199 531-733 604-934 (958)
3 KOG3608|consensus 99.9 3.6E-27 7.9E-32 217.7 13.1 233 459-731 135-377 (467)
4 KOG2462|consensus 99.9 5.5E-28 1.2E-32 218.6 4.9 135 589-728 128-265 (279)
5 KOG3608|consensus 99.9 2.4E-26 5.2E-31 212.3 14.1 194 344-586 178-376 (467)
6 KOG2462|consensus 99.9 1.6E-27 3.4E-32 215.7 5.0 135 531-670 129-264 (279)
7 KOG3623|consensus 99.9 2.1E-23 4.6E-28 210.4 4.8 114 430-553 211-330 (1007)
8 KOG3623|consensus 99.9 1.1E-22 2.5E-27 205.2 8.6 107 344-450 211-330 (1007)
9 KOG3576|consensus 99.7 1.1E-18 2.3E-23 149.0 2.1 114 617-732 115-238 (267)
10 KOG3576|consensus 99.7 6E-18 1.3E-22 144.5 1.7 86 559-645 114-199 (267)
11 PLN03086 PRLI-interacting fact 99.2 2.3E-11 5E-16 126.4 6.4 97 620-726 454-560 (567)
12 PLN03086 PRLI-interacting fact 99.2 3.1E-11 6.7E-16 125.4 6.2 146 533-702 408-565 (567)
13 PHA00733 hypothetical protein 99.0 1.7E-10 3.7E-15 97.2 4.5 52 675-729 71-122 (128)
14 KOG1146|consensus 99.0 6.8E-11 1.5E-15 130.2 1.7 77 60-136 438-539 (1406)
15 PHA00733 hypothetical protein 99.0 3.7E-10 7.9E-15 95.2 5.0 97 602-702 23-124 (128)
16 PHA02768 hypothetical protein; 98.9 5.4E-10 1.2E-14 75.7 1.8 44 649-694 5-48 (55)
17 KOG3993|consensus 98.7 2.3E-09 5E-14 103.4 1.1 168 533-704 268-485 (500)
18 KOG3993|consensus 98.7 1.8E-09 3.8E-14 104.2 -0.2 113 430-558 268-382 (500)
19 PHA02768 hypothetical protein; 98.7 6.3E-09 1.4E-13 70.6 1.9 42 88-131 6-47 (55)
20 PF13465 zf-H2C2_2: Zinc-finge 98.6 1.1E-08 2.3E-13 59.3 1.0 25 664-688 1-25 (26)
21 KOG1146|consensus 98.5 7.3E-08 1.6E-12 106.9 5.6 134 5-138 463-641 (1406)
22 PF13465 zf-H2C2_2: Zinc-finge 98.5 6.7E-08 1.4E-12 55.9 1.6 26 606-631 1-26 (26)
23 PHA00616 hypothetical protein 98.3 1.4E-07 3.1E-12 60.6 1.1 34 649-682 1-34 (44)
24 PHA00616 hypothetical protein 98.3 5.2E-07 1.1E-11 58.1 2.2 34 591-624 1-34 (44)
25 PHA00732 hypothetical protein 98.3 5.3E-07 1.1E-11 68.3 2.6 44 649-698 1-45 (79)
26 PHA00732 hypothetical protein 98.1 1.9E-06 4E-11 65.3 2.7 49 619-674 1-49 (79)
27 PF05605 zf-Di19: Drought indu 98.1 4E-06 8.7E-11 59.0 4.1 52 677-731 2-54 (54)
28 PF05605 zf-Di19: Drought indu 98.0 4.5E-06 9.7E-11 58.7 3.4 37 620-657 3-39 (54)
29 PF00096 zf-C2H2: Zinc finger, 97.8 1.1E-05 2.5E-10 45.3 1.5 23 707-729 1-23 (23)
30 PF00096 zf-C2H2: Zinc finger, 97.6 2.3E-05 5E-10 44.1 1.1 22 650-671 1-22 (23)
31 PF13894 zf-C2H2_4: C2H2-type 97.5 5.3E-05 1.2E-09 43.2 1.8 24 707-730 1-24 (24)
32 PF13912 zf-C2H2_6: C2H2-type 97.4 8.2E-05 1.8E-09 43.7 1.7 26 706-731 1-26 (27)
33 PF12756 zf-C2H2_2: C2H2 type 97.3 0.00013 2.8E-09 59.9 2.2 71 622-699 2-72 (100)
34 PF09237 GAGA: GAGA factor; I 97.2 0.00019 4.2E-09 47.2 2.0 41 694-734 12-52 (54)
35 PF12756 zf-C2H2_2: C2H2 type 97.2 0.00014 3E-09 59.7 1.6 22 650-671 51-72 (100)
36 COG5189 SFP1 Putative transcri 97.1 0.00014 3.1E-09 68.1 0.7 23 704-726 396-418 (423)
37 PF13894 zf-C2H2_4: C2H2-type 97.1 0.00022 4.8E-09 40.5 1.3 21 651-671 2-22 (24)
38 COG5189 SFP1 Putative transcri 97.1 0.00013 2.9E-09 68.3 0.2 71 56-136 347-419 (423)
39 PF13912 zf-C2H2_6: C2H2-type 97.1 0.00029 6.2E-09 41.4 1.5 23 650-672 2-24 (27)
40 KOG2231|consensus 96.9 0.0017 3.8E-08 69.4 6.3 104 564-681 117-240 (669)
41 smart00355 ZnF_C2H2 zinc finge 96.7 0.0015 3.2E-08 37.8 2.2 24 707-730 1-24 (26)
42 PF09237 GAGA: GAGA factor; I 96.6 0.00072 1.6E-08 44.6 0.8 31 84-114 21-51 (54)
43 PF13909 zf-H2C2_5: C2H2-type 96.4 0.002 4.4E-08 36.5 1.5 24 707-731 1-24 (24)
44 PF12874 zf-met: Zinc-finger o 96.3 0.0021 4.6E-08 36.8 1.4 23 707-729 1-23 (25)
45 smart00355 ZnF_C2H2 zinc finge 96.3 0.0025 5.4E-08 36.8 1.8 22 651-672 2-23 (26)
46 KOG2231|consensus 96.1 0.014 3E-07 62.7 7.3 123 534-672 117-262 (669)
47 PRK04860 hypothetical protein; 95.9 0.0037 7.9E-08 55.0 1.4 40 648-691 118-157 (160)
48 COG5236 Uncharacterized conser 95.9 0.0051 1.1E-07 58.7 2.4 130 87-226 151-306 (493)
49 COG5236 Uncharacterized conser 95.6 0.0064 1.4E-07 58.0 2.1 123 568-703 159-307 (493)
50 KOG2482|consensus 95.6 0.013 2.7E-07 56.2 4.1 153 574-729 127-357 (423)
51 PF12874 zf-met: Zinc-finger o 95.6 0.0058 1.3E-07 35.0 1.1 21 651-671 2-22 (25)
52 PF13909 zf-H2C2_5: C2H2-type 95.6 0.0053 1.2E-07 34.7 0.9 20 651-671 2-21 (24)
53 KOG2785|consensus 95.3 0.053 1.1E-06 53.4 6.9 80 371-450 3-89 (390)
54 PF12171 zf-C2H2_jaz: Zinc-fin 95.2 0.006 1.3E-07 35.7 0.3 22 707-728 2-23 (27)
55 KOG2785|consensus 95.2 0.049 1.1E-06 53.6 6.6 75 285-365 165-242 (390)
56 PF12171 zf-C2H2_jaz: Zinc-fin 94.9 0.017 3.7E-07 33.7 1.6 21 620-640 2-22 (27)
57 KOG2482|consensus 94.8 0.033 7.3E-07 53.4 4.0 132 5-138 193-357 (423)
58 PRK04860 hypothetical protein; 94.7 0.015 3.2E-07 51.2 1.5 38 87-128 119-156 (160)
59 COG5048 FOG: Zn-finger [Genera 94.5 0.012 2.5E-07 63.9 0.5 153 561-714 288-454 (467)
60 COG5048 FOG: Zn-finger [Genera 93.8 0.024 5.3E-07 61.4 1.1 141 590-733 288-445 (467)
61 PF13913 zf-C2HC_2: zinc-finge 93.1 0.052 1.1E-06 30.9 1.2 22 8-30 3-24 (25)
62 PF13913 zf-C2HC_2: zinc-finge 92.3 0.11 2.3E-06 29.6 1.7 19 708-727 4-22 (25)
63 smart00451 ZnF_U1 U1-like zinc 92.0 0.11 2.3E-06 32.6 1.7 23 706-728 3-25 (35)
64 TIGR00622 ssl1 transcription f 90.7 0.45 9.8E-06 38.6 4.4 22 617-638 13-34 (112)
65 KOG2893|consensus 90.0 0.097 2.1E-06 47.4 0.2 46 60-110 12-58 (341)
66 smart00451 ZnF_U1 U1-like zinc 89.6 0.21 4.5E-06 31.2 1.4 22 619-640 3-24 (35)
67 KOG4173|consensus 88.2 0.26 5.5E-06 43.8 1.5 77 563-643 80-170 (253)
68 KOG4173|consensus 88.0 0.29 6.2E-06 43.5 1.6 79 57-139 78-170 (253)
69 cd00350 rubredoxin_like Rubred 87.3 0.38 8.1E-06 29.6 1.4 9 676-684 16-24 (33)
70 KOG2893|consensus 85.7 0.27 5.9E-06 44.6 0.3 49 147-199 12-60 (341)
71 COG4049 Uncharacterized protei 80.5 0.81 1.8E-05 31.0 0.9 29 112-140 14-42 (65)
72 PF12013 DUF3505: Protein of u 79.1 2.9 6.3E-05 34.5 4.0 25 707-731 81-109 (109)
73 COG4049 Uncharacterized protei 78.9 0.75 1.6E-05 31.2 0.4 30 615-644 13-42 (65)
74 PF12013 DUF3505: Protein of u 78.5 1.9 4.1E-05 35.7 2.7 24 678-701 81-108 (109)
75 TIGR00622 ssl1 transcription f 77.0 2 4.4E-05 34.9 2.4 92 56-169 13-105 (112)
76 cd00729 rubredoxin_SM Rubredox 74.3 1.8 4E-05 26.8 1.2 8 677-684 18-25 (34)
77 PF13717 zinc_ribbon_4: zinc-r 73.8 2 4.4E-05 27.0 1.3 32 534-572 4-35 (36)
78 PF09986 DUF2225: Uncharacteri 73.6 1.2 2.6E-05 41.9 0.3 23 618-640 4-26 (214)
79 PF09538 FYDLN_acid: Protein o 70.7 2.2 4.7E-05 34.9 1.1 14 114-127 25-38 (108)
80 PF09986 DUF2225: Uncharacteri 70.4 1.4 3E-05 41.5 -0.0 12 708-719 50-61 (214)
81 PF10571 UPF0547: Uncharacteri 69.6 2.8 6E-05 24.1 1.1 10 651-660 16-25 (26)
82 PF13719 zinc_ribbon_5: zinc-r 68.1 3.1 6.8E-05 26.3 1.2 32 534-572 4-35 (37)
83 TIGR02098 MJ0042_CXXC MJ0042 f 65.6 3.7 7.9E-05 26.1 1.2 13 534-546 4-16 (38)
84 PF12907 zf-met2: Zinc-binding 64.5 2.4 5.3E-05 27.2 0.2 29 678-706 2-33 (40)
85 KOG2186|consensus 63.7 3.8 8.3E-05 38.2 1.4 46 592-640 4-49 (276)
86 COG1592 Rubrerythrin [Energy p 62.4 4.1 9E-05 36.0 1.3 24 429-466 134-157 (166)
87 PF15269 zf-C2H2_7: Zinc-finge 62.2 5.8 0.00013 25.6 1.6 27 141-167 16-42 (54)
88 PRK00398 rpoP DNA-directed RNA 61.3 4.8 0.0001 27.0 1.2 11 650-660 4-14 (46)
89 COG1592 Rubrerythrin [Energy p 61.0 4.1 8.8E-05 36.1 1.0 24 561-598 133-156 (166)
90 PF09538 FYDLN_acid: Protein o 59.8 7.2 0.00016 31.9 2.2 33 143-186 7-39 (108)
91 COG2888 Predicted Zn-ribbon RN 59.4 7.2 0.00016 27.3 1.7 9 706-714 50-58 (61)
92 PF03604 DNA_RNApol_7kD: DNA d 59.3 4.7 0.0001 24.5 0.8 11 620-630 1-11 (32)
93 smart00734 ZnF_Rad18 Rad18-lik 57.8 9.3 0.0002 22.0 1.8 19 60-79 3-21 (26)
94 KOG2186|consensus 57.6 4.8 0.00011 37.6 1.0 54 116-174 4-57 (276)
95 PF02176 zf-TRAF: TRAF-type zi 57.4 5.6 0.00012 28.4 1.1 26 662-687 23-52 (60)
96 TIGR00373 conserved hypothetic 56.4 9.7 0.00021 33.9 2.7 17 591-607 109-125 (158)
97 smart00614 ZnF_BED BED zinc fi 55.5 8.1 0.00017 26.5 1.6 20 679-698 20-44 (50)
98 PF02892 zf-BED: BED zinc fing 54.0 7.2 0.00016 25.9 1.1 10 651-660 18-27 (45)
99 TIGR00373 conserved hypothetic 53.9 10 0.00022 33.7 2.4 17 87-103 109-125 (158)
100 TIGR02300 FYDLN_acid conserved 53.0 7.8 0.00017 32.2 1.3 10 89-98 11-20 (129)
101 smart00531 TFIIE Transcription 52.4 12 0.00026 32.9 2.6 14 618-631 98-111 (147)
102 PHA00626 hypothetical protein 52.1 9.5 0.00021 26.3 1.4 13 676-688 22-34 (59)
103 PRK06266 transcription initiat 51.5 5.9 0.00013 36.0 0.5 17 591-607 117-133 (178)
104 PRK14890 putative Zn-ribbon RN 50.7 9.6 0.00021 26.8 1.3 7 592-598 26-32 (59)
105 PRK06266 transcription initiat 50.6 13 0.00027 33.9 2.5 37 613-660 111-147 (178)
106 smart00659 RPOLCX RNA polymera 50.1 9.6 0.00021 25.3 1.2 10 620-629 3-12 (44)
107 PF02176 zf-TRAF: TRAF-type zi 49.2 7.8 0.00017 27.7 0.7 42 617-661 7-54 (60)
108 smart00834 CxxC_CXXC_SSSS Puta 49.2 6 0.00013 25.6 0.1 10 650-659 6-15 (41)
109 COG5151 SSL1 RNA polymerase II 48.4 9.2 0.0002 36.8 1.2 49 116-168 363-411 (421)
110 PRK00464 nrdR transcriptional 47.9 6.2 0.00013 34.7 0.1 12 650-661 29-40 (154)
111 COG5151 SSL1 RNA polymerase II 47.3 15 0.00032 35.5 2.4 25 705-729 387-411 (421)
112 TIGR02605 CxxC_CxxC_SSSS putat 47.1 6.7 0.00015 27.1 0.1 13 648-660 4-16 (52)
113 smart00531 TFIIE Transcription 46.2 15 0.00034 32.2 2.3 40 527-572 94-133 (147)
114 TIGR02300 FYDLN_acid conserved 46.0 10 0.00022 31.5 1.0 14 618-631 25-38 (129)
115 COG1198 PriA Primosomal protei 44.7 13 0.00028 41.9 1.9 29 585-628 456-484 (730)
116 PRK09678 DNA-binding transcrip 43.3 7.3 0.00016 29.0 -0.2 14 676-689 26-41 (72)
117 COG0068 HypF Hydrogenase matur 42.2 4 8.6E-05 44.5 -2.4 76 593-685 103-181 (750)
118 PF12907 zf-met2: Zinc-binding 42.1 17 0.00037 23.4 1.4 14 575-588 17-30 (40)
119 PF05443 ROS_MUCR: ROS/MUCR tr 41.4 12 0.00026 31.9 0.8 23 707-732 73-95 (132)
120 COG1996 RPC10 DNA-directed RNA 41.1 8.8 0.00019 26.0 -0.0 10 620-629 7-16 (49)
121 PRK00464 nrdR transcriptional 40.0 8.7 0.00019 33.8 -0.3 12 174-185 29-40 (154)
122 KOG2272|consensus 39.9 22 0.00047 33.2 2.2 15 706-720 280-294 (332)
123 PF06524 NOA36: NOA36 protein; 39.1 13 0.00028 35.0 0.6 25 648-672 208-232 (314)
124 PF13451 zf-trcl: Probable zin 38.0 19 0.00042 24.4 1.2 38 676-713 3-40 (49)
125 COG1198 PriA Primosomal protei 37.5 22 0.00049 40.1 2.4 12 675-686 473-484 (730)
126 KOG2807|consensus 36.1 65 0.0014 31.7 4.8 24 649-672 345-368 (378)
127 KOG1280|consensus 35.8 26 0.00057 34.6 2.2 38 618-655 78-115 (381)
128 COG1997 RPL43A Ribosomal prote 34.9 22 0.00048 27.3 1.2 12 676-687 52-63 (89)
129 COG1594 RPB9 DNA-directed RNA 34.4 8.3 0.00018 31.9 -1.2 32 232-263 5-36 (113)
130 PRK04023 DNA polymerase II lar 33.7 34 0.00073 39.4 2.9 11 590-600 625-635 (1121)
131 KOG4167|consensus 32.5 13 0.00028 40.5 -0.4 25 706-730 792-816 (907)
132 KOG2593|consensus 30.9 37 0.0008 35.0 2.4 14 114-127 127-140 (436)
133 PF01363 FYVE: FYVE zinc finge 30.1 25 0.00053 26.0 0.9 26 650-687 10-35 (69)
134 PF05191 ADK_lid: Adenylate ki 29.9 24 0.00052 22.2 0.6 8 652-659 4-11 (36)
135 cd00730 rubredoxin Rubredoxin; 29.6 23 0.00049 24.3 0.5 13 250-262 2-14 (50)
136 PF04959 ARS2: Arsenite-resist 29.5 16 0.00034 34.2 -0.4 30 674-703 74-103 (214)
137 PF07754 DUF1610: Domain of un 29.0 23 0.00049 20.0 0.3 10 648-657 15-24 (24)
138 PF05443 ROS_MUCR: ROS/MUCR tr 29.0 28 0.00061 29.7 1.1 23 650-675 73-95 (132)
139 PRK04023 DNA polymerase II lar 28.8 35 0.00075 39.4 2.0 9 563-571 627-635 (1121)
140 PF08790 zf-LYAR: LYAR-type C2 27.9 20 0.00043 21.0 0.0 19 620-639 1-19 (28)
141 PF00301 Rubredoxin: Rubredoxi 27.7 22 0.00049 23.9 0.2 12 533-544 2-13 (47)
142 PF14353 CpXC: CpXC protein 27.7 15 0.00032 31.4 -0.8 20 532-551 38-57 (128)
143 KOG2807|consensus 27.3 71 0.0015 31.5 3.5 35 590-638 275-309 (378)
144 KOG4167|consensus 27.2 17 0.00036 39.8 -0.7 25 145-169 792-816 (907)
145 PF13878 zf-C2H2_3: zinc-finge 27.1 51 0.0011 21.5 1.8 23 707-729 14-38 (41)
146 TIGR01384 TFS_arch transcripti 26.9 32 0.00068 28.0 1.0 13 458-470 16-28 (104)
147 smart00440 ZnF_C2C2 C2C2 Zinc 26.4 19 0.00041 23.3 -0.3 8 678-685 29-36 (40)
148 COG3357 Predicted transcriptio 26.1 43 0.00093 25.9 1.5 28 648-685 57-84 (97)
149 PF08274 PhnA_Zn_Ribbon: PhnA 26.1 27 0.00058 20.9 0.3 10 679-688 4-13 (30)
150 COG4530 Uncharacterized protei 25.4 34 0.00074 27.3 0.9 12 86-97 25-36 (129)
151 PF09723 Zn-ribbon_8: Zinc rib 24.9 43 0.00093 21.9 1.2 8 287-294 27-34 (42)
152 KOG1842|consensus 24.7 35 0.00075 35.1 1.0 27 619-645 15-41 (505)
153 KOG4124|consensus 24.0 17 0.00038 35.7 -1.1 23 648-670 348-372 (442)
154 COG4957 Predicted transcriptio 23.7 42 0.0009 28.2 1.1 23 707-732 77-99 (148)
155 PF05290 Baculo_IE-1: Baculovi 23.4 57 0.0012 27.5 1.9 58 339-415 76-135 (140)
156 KOG2071|consensus 23.1 63 0.0014 34.9 2.6 42 340-381 415-523 (579)
157 PF04959 ARS2: Arsenite-resist 22.6 38 0.00082 31.7 0.8 29 648-676 76-104 (214)
158 COG4888 Uncharacterized Zn rib 22.4 29 0.00062 27.5 -0.0 14 648-661 21-34 (104)
159 PTZ00255 60S ribosomal protein 22.4 36 0.00078 26.6 0.5 12 676-687 53-64 (90)
160 KOG3408|consensus 22.3 59 0.0013 26.8 1.7 26 703-728 54-79 (129)
161 PF06524 NOA36: NOA36 protein; 21.8 52 0.0011 31.1 1.5 23 428-450 208-230 (314)
162 cd00065 FYVE FYVE domain; Zinc 21.7 58 0.0012 22.8 1.4 26 651-688 4-29 (57)
163 KOG2593|consensus 21.5 55 0.0012 33.7 1.7 42 524-570 120-161 (436)
164 KOG4377|consensus 21.3 45 0.00097 33.9 1.0 19 655-673 409-427 (480)
165 KOG1280|consensus 21.1 80 0.0017 31.5 2.6 40 530-569 77-116 (381)
166 PF08209 Sgf11: Sgf11 (transcr 20.5 44 0.00095 20.6 0.5 24 7-31 4-27 (33)
No 1
>KOG1074|consensus
Probab=99.96 E-value=1.1e-30 Score=267.86 Aligned_cols=234 Identities=22% Similarity=0.468 Sum_probs=159.6
Q ss_pred ccccccccccccChHHHHhHHhhhccCCCccccCcchhhhcCcHhHHHHhhhcCCCCcccccccCccccccccccccchh
Q psy366 429 PYKCDICEQHFKFNYDIQMHKRCVHSNIRPYQCTLCSASFKRSSHLKQHGKTHIKPEHDLKTKMNLFTCHQCQSTEEAGC 508 (734)
Q Consensus 429 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~c~~c~~~~~~~~ 508 (734)
|.+|-+|-++..-+..|+.|++ .|+|++||+|.+||+.|.++.+|+.|+-.|-.. +
T Consensus 605 PNqCiiC~rVlSC~saLqmHyr-tHtGERPFkCKiCgRAFtTkGNLkaH~~vHka~---------p-------------- 660 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYR-THTGERPFKCKICGRAFTTKGNLKAHMSVHKAK---------P-------------- 660 (958)
T ss_pred ccceeeeeecccchhhhhhhhh-cccCcCccccccccchhccccchhhcccccccC---------c--------------
Confidence 4679999999999999999999 999999999999999999999999998887311 0
Q ss_pred hhhhhhhhhHhhhhcccccCCCCccccc---cccccccchhHHHHHhhhhCC-----------CCCCcccccccccccCC
Q psy366 509 KCNMMLMKCQLLHSHLNTQDNKINYSCE---QCKVQFSCKSDMRKHAKTHLP-----------AIGRSYTCDQCGKQLSY 574 (734)
Q Consensus 509 ~~~~~~~~~~~l~~h~~~~~~~~~~~C~---~C~~~f~~~~~l~~H~~~h~~-----------~~~~~~~C~~C~~~f~~ 574 (734)
.-...+.|+ +|-+.|...-.|..|+++|.. .+.-.-+|..|.+.|..
T Consensus 661 -------------------~~R~q~ScP~~~ic~~kftn~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~ 721 (958)
T KOG1074|consen 661 -------------------PARVQFSCPSTFICQKKFTNAVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSD 721 (958)
T ss_pred -------------------cccccccCCchhhhcccccccccccceEEeecCCCCCCCcccccccchhcccchhhhcccc
Confidence 111346777 777778877788888777752 11223568888888877
Q ss_pred hHHHHhHhhhhc----------------CCCC----cccccchhHhhcCHHHHHhHhh----------------------
Q psy366 575 AKTLANHIKGVH----------------LKIK----KHSCENCAKSFFSLAELRQHSP---------------------- 612 (734)
Q Consensus 575 ~~~L~~H~~~~H----------------~~~~----~~~C~~C~~~f~~~~~l~~H~~---------------------- 612 (734)
...+..++. .| +++. +..+..|+..+.....+..+-.
T Consensus 722 a~~f~~~~s-e~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~ 800 (958)
T KOG1074|consen 722 ARSFSQQIS-EQPSPESEPDEQMDERTETEELDVTPPPPENSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGE 800 (958)
T ss_pred cccchhhhh-ccCCcccCCcccccccccccccccCCCccccccccccCcccccccccchhhhhcChhhhcCccccchhhh
Confidence 777777765 23 2222 5678888888776555444321
Q ss_pred -hcCCCCcc-ccccccccccChhHHh----hhhhh--------hcc----------------------------cccccc
Q psy366 613 -VHSEQKSF-VCELCGASFKQRTCLW----SHKKW--------NHE----------------------------ELIYKF 650 (734)
Q Consensus 613 -~h~~~~~~-~C~~C~~~f~~~~~l~----~H~~~--------h~~----------------------------~~~~~~ 650 (734)
.++++++. .+..++..-....... .-... -.. ......
T Consensus 801 ~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h 880 (958)
T KOG1074|consen 801 EDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAH 880 (958)
T ss_pred hcccCCCCcccccCCCcCCccccccCchhhcccccccccccccccccccCCCcchhhhhhccccccCCCCCCCCcccchh
Confidence 23455666 6666665443322111 00000 000 000126
Q ss_pred ccccccccccCchhHHHHhhhccCCccccCCCcCCcCCcchHHHhhhhhccCCCCCc
Q psy366 651 ECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNYDMLRHKQDVHSTTRPY 707 (734)
Q Consensus 651 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~ 707 (734)
.|.+||+.|.+.++|++|||+|+|++||.|.+|++.|+.+.+|+.||. .|.+..|+
T Consensus 881 ~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnLKvHMg-tH~w~q~~ 936 (958)
T KOG1074|consen 881 VCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNLKVHMG-THMWVQPP 936 (958)
T ss_pred hhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhhhhhhc-cccccCCC
Confidence 788888888888888888888888888888888888888888888884 67666554
No 2
>KOG1074|consensus
Probab=99.96 E-value=4.7e-30 Score=263.23 Aligned_cols=199 Identities=24% Similarity=0.458 Sum_probs=161.8
Q ss_pred CccccccccccccchhHHHHHhhhhCCCCCCcccccccccccCChHHHHhHhhhhcCCC----Cccccc---chhHhhcC
Q psy366 531 INYSCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGKQLSYAKTLANHIKGVHLKI----KKHSCE---NCAKSFFS 603 (734)
Q Consensus 531 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~----~~~~C~---~C~~~f~~ 603 (734)
.+-.|-+|-+..+.++.|+.|+++| ++++||+|.+||+.|+++.+|+.||. +|... -.|.|+ +|.+.|..
T Consensus 604 dPNqCiiC~rVlSC~saLqmHyrtH--tGERPFkCKiCgRAFtTkGNLkaH~~-vHka~p~~R~q~ScP~~~ic~~kftn 680 (958)
T KOG1074|consen 604 DPNQCIICLRVLSCPSALQMHYRTH--TGERPFKCKICGRAFTTKGNLKAHMS-VHKAKPPARVQFSCPSTFICQKKFTN 680 (958)
T ss_pred Cccceeeeeecccchhhhhhhhhcc--cCcCccccccccchhccccchhhccc-ccccCccccccccCCchhhhcccccc
Confidence 3568999999999999999999999 88999999999999999999999998 77554 357899 99999999
Q ss_pred HHHHHhHhhhcCC-CC------------ccccccccccccChhHHhhhhhhhccc------------------ccccccc
Q psy366 604 LAELRQHSPVHSE-QK------------SFVCELCGASFKQRTCLWSHKKWNHEE------------------LIYKFEC 652 (734)
Q Consensus 604 ~~~l~~H~~~h~~-~~------------~~~C~~C~~~f~~~~~l~~H~~~h~~~------------------~~~~~~C 652 (734)
.-.|.+|+++|.+ .. .=+|..|.+.|.....+..++..+-+. ..-+..+
T Consensus 681 ~V~lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e 760 (958)
T KOG1074|consen 681 AVTLPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPE 760 (958)
T ss_pred cccccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCccc
Confidence 9999999999973 22 246999999998888888888766211 0014567
Q ss_pred ccccccccCchhHHHHhhhc-----------------------cCC----------------------------------
Q psy366 653 TLCGKKFVKNYSLYEHMKRH-----------------------SDV---------------------------------- 675 (734)
Q Consensus 653 ~~C~~~f~~~~~L~~H~~~H-----------------------~~~---------------------------------- 675 (734)
..|+..+.....+..+--++ +++
T Consensus 761 ~~~~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg 840 (958)
T KOG1074|consen 761 NSCGRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEG 840 (958)
T ss_pred cccccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhcccccccccc
Confidence 77887776655544432211 110
Q ss_pred -------------------------------------ccccCCCcCCcCCcchHHHhhhhhccCCCCCcccCcchhhcCC
Q psy366 676 -------------------------------------RPYKCDICGHGFKLNYDMLRHKQDVHSTTRPYLCTICSATFKT 718 (734)
Q Consensus 676 -------------------------------------~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~ 718 (734)
....|.+||+.|.+.+.|..|+ ++|+|++||.|.+|++.|..
T Consensus 841 ~~t~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~-rTHtg~KPF~C~fC~~aFtt 919 (958)
T KOG1074|consen 841 LATKTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHM-RTHTGPKPFFCHFCEEAFTT 919 (958)
T ss_pred cccccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhh-hcCCCCCCccchhhhhhhhh
Confidence 1167999999999999999999 78999999999999999999
Q ss_pred hHHHHHHHhhccCCC
Q psy366 719 ARALKQHGRTHIKSR 733 (734)
Q Consensus 719 ~~~l~~H~~~H~~~~ 733 (734)
+.+|+.||.+|+..-
T Consensus 920 rgnLKvHMgtH~w~q 934 (958)
T KOG1074|consen 920 RGNLKVHMGTHMWVQ 934 (958)
T ss_pred hhhhhhhhccccccC
Confidence 999999999998653
No 3
>KOG3608|consensus
Probab=99.95 E-value=3.6e-27 Score=217.69 Aligned_cols=233 Identities=23% Similarity=0.490 Sum_probs=196.4
Q ss_pred cccCc--chhhhcCcHhHHHHhhhcCCCCcccccccCccccccccccccchhhhhhhhhhhHhhhhcccccCCC-Ccccc
Q psy366 459 YQCTL--CSASFKRSSHLKQHGKTHIKPEHDLKTKMNLFTCHQCQSTEEAGCKCNMMLMKCQLLHSHLNTQDNK-INYSC 535 (734)
Q Consensus 459 ~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~c~~c~~~~~~~~~~~~~~~~~~~l~~h~~~~~~~-~~~~C 535 (734)
|.|.. |+..|.+...|..|+..|..-- ..+ -..+..++ ..+.|
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~c---------------------------------eyd-~~~~~~D~~pv~~C 180 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFC---------------------------------EYD-IQKTPEDERPVTMC 180 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhh---------------------------------hhh-hhhCCCCCCceeec
Confidence 66754 9999999999999988875210 000 00111122 23444
Q ss_pred --ccccccccchhHHHHHhhhhCCCCCCcccccccccccCChHHHHhHhhhhc-CCCCcccccchhHhhcCHHHHHhHhh
Q psy366 536 --EQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGKQLSYAKTLANHIKGVH-LKIKKHSCENCAKSFFSLAELRQHSP 612 (734)
Q Consensus 536 --~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H-~~~~~~~C~~C~~~f~~~~~l~~H~~ 612 (734)
..|.+.+.++..|+.|++.| ++++..-|+.||..|+++..|..|++..- ....+|.|..|.|.|.+...|..|++
T Consensus 181 ~W~~Ct~~~~~k~~LreH~r~H--s~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~ 258 (467)
T KOG3608|consen 181 NWAMCTKHMGNKYRLREHIRTH--SNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVV 258 (467)
T ss_pred cchhhhhhhccHHHHHHHHHhc--CCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHH
Confidence 56999999999999999999 78899999999999999999999987533 34568999999999999999999999
Q ss_pred hcCCCCccccccccccccChhHHhhhhhhhccccccccccccccccccCchhHHHHhhhccCCccccCCC--cCCcCCcc
Q psy366 613 VHSEQKSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDI--CGHGFKLN 690 (734)
Q Consensus 613 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~--C~~~f~~~ 690 (734)
.|- .-|+|+.|+.+....++|..|++..|.+ .+||+|..|++.|.+.+.|++|+.+|+ +..|+|+. |..+|.+.
T Consensus 259 rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~-dkpfKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~s~r~~ 334 (467)
T KOG3608|consen 259 RHV--NCYKCPLCDMTCSSASSLTTHIRYRHSK-DKPFKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHYSVRTY 334 (467)
T ss_pred Hhh--hcccccccccCCCChHHHHHHHHhhhcc-CCCccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcHHHHHH
Confidence 986 4699999999999999999999998887 499999999999999999999999998 77899998 99999999
Q ss_pred hHHHhhhhhccCCCC--CcccCcchhhcCChHHHHHHHhhccC
Q psy366 691 YDMLRHKQDVHSTTR--PYLCTICSATFKTARALKQHGRTHIK 731 (734)
Q Consensus 691 ~~L~~H~~~~H~~~~--~~~C~~C~~~f~~~~~l~~H~~~H~~ 731 (734)
..|.+|++.+|.|.. +|.|..|++.|++..+|..|++.-++
T Consensus 335 ~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~ 377 (467)
T KOG3608|consen 335 TQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHG 377 (467)
T ss_pred HHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhc
Confidence 999999999996654 59999999999999999999765544
No 4
>KOG2462|consensus
Probab=99.94 E-value=5.5e-28 Score=218.61 Aligned_cols=135 Identities=27% Similarity=0.635 Sum_probs=115.2
Q ss_pred CCcccccchhHhhcCHHHHHhHhhhcCC---CCccccccccccccChhHHhhhhhhhccccccccccccccccccCchhH
Q psy366 589 IKKHSCENCAKSFFSLAELRQHSPVHSE---QKSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSL 665 (734)
Q Consensus 589 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~---~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L 665 (734)
...|+|+.||+.+.+.++|.+|..+|-. .+.+.|++||+.|.+..+|..|+++|. -+++|.+|||.|....-|
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~----l~c~C~iCGKaFSRPWLL 203 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT----LPCECGICGKAFSRPWLL 203 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC----CCcccccccccccchHHh
Confidence 3457788888888888888888777743 567889999999999999999999887 358899999999999999
Q ss_pred HHHhhhccCCccccCCCcCCcCCcchHHHhhhhhccCCCCCcccCcchhhcCChHHHHHHHhh
Q psy366 666 YEHMKRHSDVRPYKCDICGHGFKLNYDMLRHKQDVHSTTRPYLCTICSATFKTARALKQHGRT 728 (734)
Q Consensus 666 ~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 728 (734)
+-|+|+|||||||.|..|+++|..+++|+.|+ .+|++.++|.|+.|+|+|..++.|..|...
T Consensus 204 QGHiRTHTGEKPF~C~hC~kAFADRSNLRAHm-QTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 204 QGHIRTHTGEKPFSCPHCGKAFADRSNLRAHM-QTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred hcccccccCCCCccCCcccchhcchHHHHHHH-HhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 99999999999999999999999999999998 578999999999999999999999998653
No 5
>KOG3608|consensus
Probab=99.94 E-value=2.4e-26 Score=212.34 Aligned_cols=194 Identities=28% Similarity=0.563 Sum_probs=171.3
Q ss_pred ccc--cccccccCChhhHHHHHhhhcCCCcccCcccccccCChHHHHHHHHhhcCCCCccccCcccccccCCchHHHhHh
Q psy366 344 HSC--DVCDKSFVNRAGLRLHKLIHSEERGFVCDLCGASFKQRPALWAHKKLHDAKLEYKFKCILCDRKFHRNSKLNSHM 421 (734)
Q Consensus 344 ~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~l~~H~ 421 (734)
+.| ..|.+.|.++..|+.|++.|++++...|+.||..|.++..|..|++........+|+|..|.|.|.+...|..|+
T Consensus 178 ~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv 257 (467)
T KOG3608|consen 178 TMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHV 257 (467)
T ss_pred eeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHH
Confidence 445 459999999999999999999999999999999999999999999888777778999999999999999999999
Q ss_pred hhcCCCCccccccccccccChHHHHhHHhhhccCCCccccCcchhhhcCcHhHHHHhhhcCCCCcccccccCcccccccc
Q psy366 422 RTHSDVRPYKCDICEQHFKFNYDIQMHKRCVHSNIRPYQCTLCSASFKRSSHLKQHGKTHIKPEHDLKTKMNLFTCHQCQ 501 (734)
Q Consensus 422 ~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~c~~c~ 501 (734)
..|. .-|+|+.|+......++|..|++..|+..+||+|+.|++.|.+.+.|.+|+.+|.
T Consensus 258 ~rHv--n~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~HS------------------- 316 (467)
T KOG3608|consen 258 VRHV--NCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQVHS------------------- 316 (467)
T ss_pred HHhh--hcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHhcc-------------------
Confidence 9986 4699999999999999999999999999999999999999999999999998874
Q ss_pred ccccchhhhhhhhhhhHhhhhcccccCCCCcccccc--ccccccchhHHHHHhhhhC-CCCCCcccccccccccCChHHH
Q psy366 502 STEEAGCKCNMMLMKCQLLHSHLNTQDNKINYSCEQ--CKVQFSCKSDMRKHAKTHL-PAIGRSYTCDQCGKQLSYAKTL 578 (734)
Q Consensus 502 ~~~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~--C~~~f~~~~~l~~H~~~h~-~~~~~~~~C~~C~~~f~~~~~L 578 (734)
+..|.|+. |.++|.+...|.+|++.++ +..+.+|.|-.|++.|++-.+|
T Consensus 317 ----------------------------~~~y~C~h~~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L 368 (467)
T KOG3608|consen 317 ----------------------------KTVYQCEHPDCHYSVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSL 368 (467)
T ss_pred ----------------------------ccceecCCCCCcHHHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhH
Confidence 23478877 8888888888999977554 5556678888888888888888
Q ss_pred HhHhhhhc
Q psy366 579 ANHIKGVH 586 (734)
Q Consensus 579 ~~H~~~~H 586 (734)
..|+...|
T Consensus 369 ~~HL~kkH 376 (467)
T KOG3608|consen 369 SAHLMKKH 376 (467)
T ss_pred HHHHHHhh
Confidence 88887666
No 6
>KOG2462|consensus
Probab=99.94 E-value=1.6e-27 Score=215.67 Aligned_cols=135 Identities=30% Similarity=0.597 Sum_probs=103.2
Q ss_pred CccccccccccccchhHHHHHhhhhCCCC-CCcccccccccccCChHHHHhHhhhhcCCCCcccccchhHhhcCHHHHHh
Q psy366 531 INYSCEQCKVQFSCKSDMRKHAKTHLPAI-GRSYTCDQCGKQLSYAKTLANHIKGVHLKIKKHSCENCAKSFFSLAELRQ 609 (734)
Q Consensus 531 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~ 609 (734)
..|.|+.||+.+.+.++|.+|.++|.+.. .+.+.|++|++.|.+...|..|++ .|+ -+++|.+|||+|....-|+-
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHir-TH~--l~c~C~iCGKaFSRPWLLQG 205 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIR-THT--LPCECGICGKAFSRPWLLQG 205 (279)
T ss_pred CceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhh-ccC--CCcccccccccccchHHhhc
Confidence 45889999999999999999998886544 345778888888888888888877 555 56778888888888888888
Q ss_pred HhhhcCCCCccccccccccccChhHHhhhhhhhccccccccccccccccccCchhHHHHhh
Q psy366 610 HSPVHSEQKSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSLYEHMK 670 (734)
Q Consensus 610 H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~ 670 (734)
|+|+|+|||||.|+.|+++|.++++|+.||++|.+. ++|+|+.|+|+|..++.|.+|..
T Consensus 206 HiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~--K~~qC~~C~KsFsl~SyLnKH~E 264 (279)
T KOG2462|consen 206 HIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDV--KKHQCPRCGKSFALKSYLNKHSE 264 (279)
T ss_pred ccccccCCCCccCCcccchhcchHHHHHHHHhhcCC--ccccCcchhhHHHHHHHHHHhhh
Confidence 888888887777777777777777777777777666 66777777777777777777654
No 7
>KOG3623|consensus
Probab=99.88 E-value=2.1e-23 Score=210.45 Aligned_cols=114 Identities=22% Similarity=0.487 Sum_probs=94.9
Q ss_pred cccccccccccChHHHHhHHhhhccC-CCccccCcchhhhcCcHhHHHHhhhcCCCCcc-----cccccCcccccccccc
Q psy366 430 YKCDICEQHFKFNYDIQMHKRCVHSN-IRPYQCTLCSASFKRSSHLKQHGKTHIKPEHD-----LKTKMNLFTCHQCQST 503 (734)
Q Consensus 430 ~~C~~C~~~f~~~~~l~~H~~~~H~~-~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~-----~~~~~~~~~c~~c~~~ 503 (734)
..|++|++.+.....|+.|++..|.. +..|.|.+|.++|.++..|.+|+.+|...... .....+.|+|.+|++.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 57899999999899999998877754 44589999999999999999999988654432 2234567999999999
Q ss_pred ccchhhhhhhhhhhHhhhhcccccCCCCccccccccccccchhHHHHHhh
Q psy366 504 EEAGCKCNMMLMKCQLLHSHLNTQDNKINYSCEQCKVQFSCKSDMRKHAK 553 (734)
Q Consensus 504 ~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~ 553 (734)
|. ....|..|+++|.|++||.|+-|++.|+....+..||.
T Consensus 291 FK----------fKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FK----------FKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hh----------hHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 88 88899999999999999999999999998888888864
No 8
>KOG3623|consensus
Probab=99.87 E-value=1.1e-22 Score=205.24 Aligned_cols=107 Identities=30% Similarity=0.615 Sum_probs=94.1
Q ss_pred ccccccccccCChhhHHHHHhhhc--CCCcccCcccccccCChHHHHHHHHhhcCCC-----------CccccCcccccc
Q psy366 344 HSCDVCDKSFVNRAGLRLHKLIHS--EERGFVCDLCGASFKQRPALWAHKKLHDAKL-----------EYKFKCILCDRK 410 (734)
Q Consensus 344 ~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-----------~~~~~C~~C~~~ 410 (734)
..|++|++.+.....|+.|++.-+ .+..|.|.+|.++|..+..|.+|+.+|.... .++|+|..|+|.
T Consensus 211 ltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECgKA 290 (1007)
T KOG3623|consen 211 LTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECGKA 290 (1007)
T ss_pred hcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccchh
Confidence 689999999999999999987643 4566999999999999999999999886432 357999999999
Q ss_pred cCCchHHHhHhhhcCCCCccccccccccccChHHHHhHHh
Q psy366 411 FHRNSKLNSHMRTHSDVRPYKCDICEQHFKFNYDIQMHKR 450 (734)
Q Consensus 411 f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 450 (734)
|..+-.|+.|+|+|.|++||.|+.|++.|........|+.
T Consensus 291 FKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmS 330 (1007)
T KOG3623|consen 291 FKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMS 330 (1007)
T ss_pred hhhHHHHHhhheeecCCCCcCCcccccccccCCccccccc
Confidence 9999999999999999999999999999999988888875
No 9
>KOG3576|consensus
Probab=99.72 E-value=1.1e-18 Score=149.05 Aligned_cols=114 Identities=32% Similarity=0.652 Sum_probs=95.3
Q ss_pred CCccccccccccccChhHHhhhhhhhccccccccccccccccccCchhHHHHhhhccCCccccCCCcCCcCCcchHHHhh
Q psy366 617 QKSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNYDMLRH 696 (734)
Q Consensus 617 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H 696 (734)
...|.|.+|++.|.....|.+|++-|..- +.+.|..||+.|.....|++|+|+|+|.+||+|..|+++|+++-+|..|
T Consensus 115 ~d~ftCrvCgK~F~lQRmlnrh~kch~~v--kr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 115 QDSFTCRVCGKKFGLQRMLNRHLKCHSDV--KRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCeeeeehhhhhhhHHHHHHHHhhhccHH--HHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence 45577888888888888888888777655 7788888888888888999999999999999999999999999999999
Q ss_pred hhhccCCC----------CCcccCcchhhcCChHHHHHHHhhccCC
Q psy366 697 KQDVHSTT----------RPYLCTICSATFKTARALKQHGRTHIKS 732 (734)
Q Consensus 697 ~~~~H~~~----------~~~~C~~C~~~f~~~~~l~~H~~~H~~~ 732 (734)
.+++|+.. +.|.|..||.+-.....+..|++.|+..
T Consensus 193 l~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~ 238 (267)
T KOG3576|consen 193 LKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPF 238 (267)
T ss_pred HHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCC
Confidence 98888743 4588999999988889999998888743
No 10
>KOG3576|consensus
Probab=99.69 E-value=6e-18 Score=144.55 Aligned_cols=86 Identities=35% Similarity=0.684 Sum_probs=81.9
Q ss_pred CCCcccccccccccCChHHHHhHhhhhcCCCCcccccchhHhhcCHHHHHhHhhhcCCCCccccccccccccChhHHhhh
Q psy366 559 IGRSYTCDQCGKQLSYAKTLANHIKGVHLKIKKHSCENCAKSFFSLAELRQHSPVHSEQKSFVCELCGASFKQRTCLWSH 638 (734)
Q Consensus 559 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 638 (734)
.+..|.|.+|++.|.....|.+|++ .|...++|.|..|||.|.....|++|+|+|+|.+||+|..|+++|+++-+|..|
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~k-ch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcslesh 192 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLK-CHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESH 192 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhh-hccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHH
Confidence 4567999999999999999999999 899999999999999999999999999999999999999999999999999999
Q ss_pred hhhhccc
Q psy366 639 KKWNHEE 645 (734)
Q Consensus 639 ~~~h~~~ 645 (734)
++..|+.
T Consensus 193 l~kvhgv 199 (267)
T KOG3576|consen 193 LKKVHGV 199 (267)
T ss_pred HHHHcCc
Confidence 9988875
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.18 E-value=2.3e-11 Score=126.37 Aligned_cols=97 Identities=23% Similarity=0.494 Sum_probs=63.5
Q ss_pred cccccccccccChhHHhhhhhhhccccccccccccccccccCchhHHHHhhhccCCccccCCCcCCcCC----------c
Q psy366 620 FVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFK----------L 689 (734)
Q Consensus 620 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~----------~ 689 (734)
+.|+.|++.|. ...|..|++.+| .++.|+ ||+.+ ....|..|+++|.+++|+.|..|++.|. .
T Consensus 454 ~~C~~Cgk~f~-~s~LekH~~~~H----kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~ 526 (567)
T PLN03086 454 VHCEKCGQAFQ-QGEMEKHMKVFH----EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDR 526 (567)
T ss_pred ccCCCCCCccc-hHHHHHHHHhcC----CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhh
Confidence 46777777764 466777777664 456777 77544 4567777777777777777777777773 1
Q ss_pred chHHHhhhhhccCCCCCcccCcchhhcCChHHHHHHH
Q psy366 690 NYDMLRHKQDVHSTTRPYLCTICSATFKTARALKQHG 726 (734)
Q Consensus 690 ~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 726 (734)
.+.|..|.. . .|.+++.|..||+.|+.+. +..|+
T Consensus 527 ~s~Lt~HE~-~-CG~rt~~C~~Cgk~Vrlrd-m~~H~ 560 (567)
T PLN03086 527 LRGMSEHES-I-CGSRTAPCDSCGRSVMLKE-MDIHQ 560 (567)
T ss_pred hhhHHHHHH-h-cCCcceEccccCCeeeehh-HHHHH
Confidence 346777763 2 3667777777777766553 56664
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.16 E-value=3.1e-11 Score=125.44 Aligned_cols=146 Identities=24% Similarity=0.499 Sum_probs=114.0
Q ss_pred cccccccccccchhHHHHHhhhhCCCCCCcccccc--cccccCChHHHHhHhhhhcCCCCcccccchhHhhcCHHHHHhH
Q psy366 533 YSCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQ--CGKQLSYAKTLANHIKGVHLKIKKHSCENCAKSFFSLAELRQH 610 (734)
Q Consensus 533 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H 610 (734)
..|+-|...... ..|..|...-. -..-.|+. |+..|. +..|..| +.|+.|++.|. ...|..|
T Consensus 408 V~C~NC~~~i~l-~~l~lHe~~C~---r~~V~Cp~~~Cg~v~~-r~el~~H----------~~C~~Cgk~f~-~s~LekH 471 (567)
T PLN03086 408 VECRNCKHYIPS-RSIALHEAYCS---RHNVVCPHDGCGIVLR-VEEAKNH----------VHCEKCGQAFQ-QGEMEKH 471 (567)
T ss_pred EECCCCCCccch-hHHHHHHhhCC---CcceeCCcccccceee-ccccccC----------ccCCCCCCccc-hHHHHHH
Confidence 468888877654 44557764431 12346874 999883 4444555 47999999996 6889999
Q ss_pred hhhcCCCCccccccccccccChhHHhhhhhhhcccccccccccccccccc----------CchhHHHHhhhccCCccccC
Q psy366 611 SPVHSEQKSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFV----------KNYSLYEHMKRHSDVRPYKC 680 (734)
Q Consensus 611 ~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~----------~~~~L~~H~~~H~~~~~~~C 680 (734)
+++++ +|+.|+ ||+.+ .+..|..|+.+|.+. +++.|++|++.|. ..+.|..|+.+. |.+++.|
T Consensus 472 ~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~--Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C 544 (567)
T PLN03086 472 MKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPL--RLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPC 544 (567)
T ss_pred HHhcC--CCccCC-CCCCc-chhHHHhhhhccCCC--CceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEc
Confidence 99986 899999 99755 678999999998888 8999999999995 235899999995 8999999
Q ss_pred CCcCCcCCcchHHHhhhhhccC
Q psy366 681 DICGHGFKLNYDMLRHKQDVHS 702 (734)
Q Consensus 681 ~~C~~~f~~~~~L~~H~~~~H~ 702 (734)
..||+.+. ...|..|+..+|.
T Consensus 545 ~~Cgk~Vr-lrdm~~H~~~~h~ 565 (567)
T PLN03086 545 DSCGRSVM-LKEMDIHQIAVHQ 565 (567)
T ss_pred cccCCeee-ehhHHHHHHHhhc
Confidence 99999995 6778999877775
No 13
>PHA00733 hypothetical protein
Probab=99.04 E-value=1.7e-10 Score=97.17 Aligned_cols=52 Identities=21% Similarity=0.534 Sum_probs=23.8
Q ss_pred CccccCCCcCCcCCcchHHHhhhhhccCCCCCcccCcchhhcCChHHHHHHHhhc
Q psy366 675 VRPYKCDICGHGFKLNYDMLRHKQDVHSTTRPYLCTICSATFKTARALKQHGRTH 729 (734)
Q Consensus 675 ~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~H 729 (734)
++||.|+.|++.|.+...|..|++ .| +.+|.|+.|++.|.....|..|++..
T Consensus 71 ~kPy~C~~Cgk~Fss~s~L~~H~r-~h--~~~~~C~~CgK~F~~~~sL~~H~~~~ 122 (128)
T PHA00733 71 VSPYVCPLCLMPFSSSVSLKQHIR-YT--EHSKVCPVCGKEFRNTDSTLDHVCKK 122 (128)
T ss_pred CCCccCCCCCCcCCCHHHHHHHHh-cC--CcCccCCCCCCccCCHHHHHHHHHHh
Confidence 444444444444444444444442 22 12344445555555444444444443
No 14
>KOG1146|consensus
Probab=99.03 E-value=6.8e-11 Score=130.15 Aligned_cols=77 Identities=21% Similarity=0.234 Sum_probs=55.6
Q ss_pred CccccccccCChHHHHHHHHHHhcCCCccccccccccccChhhHHHHHHHhcC-------------------------CC
Q psy366 60 MCDQCGKHFTSSQSLARHIKCVHLKIKHHWCDICGKSFFAASELKQHSFIHSE-------------------------QK 114 (734)
Q Consensus 60 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-------------------------~~ 114 (734)
.|.-|+..+.+...+..|+...+...+.|+|+.|++.|.....|..|||.-+- .+
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~ 517 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGK 517 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCC
Confidence 45667777888888888887777777889999999999999999999986221 13
Q ss_pred CccccchhhhccCcchHHHhhh
Q psy366 115 GFVCDLCGASFKQRSCLWSHKK 136 (734)
Q Consensus 115 ~~~C~~C~~~f~~~~~l~~H~~ 136 (734)
+|.|..|..++..+.+|.+|++
T Consensus 518 p~~C~~C~~stttng~Lsihlq 539 (1406)
T KOG1146|consen 518 PYPCRACNYSTTTNGNLSIHLQ 539 (1406)
T ss_pred cccceeeeeeeecchHHHHHHH
Confidence 4555556555555555555554
No 15
>PHA00733 hypothetical protein
Probab=99.01 E-value=3.7e-10 Score=95.19 Aligned_cols=97 Identities=22% Similarity=0.350 Sum_probs=80.2
Q ss_pred cCHHHHHhHhhhcCCCCccccccccccccChhHHhhh--hhh---hccccccccccccccccccCchhHHHHhhhccCCc
Q psy366 602 FSLAELRQHSPVHSEQKSFVCELCGASFKQRTCLWSH--KKW---NHEELIYKFECTLCGKKFVKNYSLYEHMKRHSDVR 676 (734)
Q Consensus 602 ~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H--~~~---h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 676 (734)
.+...|..+-..-..++++.|.+|...|.....|..+ ++. ++++ +||.|+.||+.|.+.+.|..|++.| +.
T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~--kPy~C~~Cgk~Fss~s~L~~H~r~h--~~ 98 (128)
T PHA00733 23 VTLEELKRYHSLTPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAV--SPYVCPLCLMPFSSSVSLKQHIRYT--EH 98 (128)
T ss_pred cCHHHhhhhhcCChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCC--CCccCCCCCCcCCCHHHHHHHHhcC--Cc
Confidence 3566666666655668899999999999998887776 222 2333 7999999999999999999999987 46
Q ss_pred cccCCCcCCcCCcchHHHhhhhhccC
Q psy366 677 PYKCDICGHGFKLNYDMLRHKQDVHS 702 (734)
Q Consensus 677 ~~~C~~C~~~f~~~~~L~~H~~~~H~ 702 (734)
+|.|++|++.|.....|.+|+...|+
T Consensus 99 ~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 99 SKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CccCCCCCCccCCHHHHHHHHHHhcC
Confidence 89999999999999999999987775
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.90 E-value=5.4e-10 Score=75.70 Aligned_cols=44 Identities=23% Similarity=0.558 Sum_probs=37.2
Q ss_pred ccccccccccccCchhHHHHhhhccCCccccCCCcCCcCCcchHHH
Q psy366 649 KFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNYDML 694 (734)
Q Consensus 649 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~ 694 (734)
.|+|+.||+.|...++|..||++|+ +||+|..|++.|.+.+.|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4788999999999999999999987 7888999998888777664
No 17
>KOG3993|consensus
Probab=98.75 E-value=2.3e-09 Score=103.37 Aligned_cols=168 Identities=19% Similarity=0.261 Sum_probs=97.7
Q ss_pred cccccccccccchhHHHHHhhhhCCCCCCcccccccccccCChHHHHhHhhhhcC-------------------------
Q psy366 533 YSCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGKQLSYAKTLANHIKGVHL------------------------- 587 (734)
Q Consensus 533 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~------------------------- 587 (734)
|.|..|...|.+.-.|.+|.-.. ..-..|+|++|+|+|+-..+|..|.+ .|.
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC~R--IV~vEYrCPEC~KVFsCPANLASHRR-WHKPR~eaa~a~~~P~k~~~~~rae~~e 344 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRCPR--IVHVEYRCPECDKVFSCPANLASHRR-WHKPRPEAAKAGSPPPKQAVETRAEVQE 344 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccCCe--eEEeeecCCcccccccCchhhhhhhc-ccCCchhhhhcCCCChhhhhhhhhhhhh
Confidence 55555555555555555552111 01123555555555555555555554 331
Q ss_pred --------CCCcccccchhHhhcCHHHHHhHhhhcCCCC-----------------ccccccccccccChhHHhhhhhhh
Q psy366 588 --------KIKKHSCENCAKSFFSLAELRQHSPVHSEQK-----------------SFVCELCGASFKQRTCLWSHKKWN 642 (734)
Q Consensus 588 --------~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~-----------------~~~C~~C~~~f~~~~~l~~H~~~h 642 (734)
.+.-|.|.+|+|.|.+.+.|+.|+.+|+... .+-|..|+-.+.....--.+...+
T Consensus 345 a~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~ 424 (500)
T KOG3993|consen 345 AERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYV 424 (500)
T ss_pred ccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeee
Confidence 1224778888888888888888877765311 123444444444433333333333
Q ss_pred ccccccccccccccccccCchhHHHHhhhccCCccccCCCcCCcCCcchHHHhhhhhccCCC
Q psy366 643 HEELIYKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNYDMLRHKQDVHSTT 704 (734)
Q Consensus 643 ~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~~ 704 (734)
.+.. .-..|++|+..+.++..=-.+.+.-..+.-|.|.+|...|.+...|.+|+...|..+
T Consensus 425 a~sa-el~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~~ltrhin~~Hpse 485 (500)
T KOG3993|consen 425 AGSA-ELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSPGLTRHINKCHPSE 485 (500)
T ss_pred eccc-cccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCcchHhHhhhcChHH
Confidence 2221 124577888777777666666665556677888888888888888888887766543
No 18
>KOG3993|consensus
Probab=98.73 E-value=1.8e-09 Score=104.18 Aligned_cols=113 Identities=20% Similarity=0.372 Sum_probs=61.2
Q ss_pred cccccccccccChHHHHhHHhhhccCCCccccCcchhhhcCcHhHHHHhhhcCCCCcccccccCccccccccccccchhh
Q psy366 430 YKCDICEQHFKFNYDIQMHKRCVHSNIRPYQCTLCSASFKRSSHLKQHGKTHIKPEHDLKTKMNLFTCHQCQSTEEAGCK 509 (734)
Q Consensus 430 ~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~~~~~~~c~~c~~~~~~~~~ 509 (734)
|.|.+|...|.+...|.+|.- .---.-.|+|+.|++.|.-..+|..|.++|-.......++..+-+-. .+
T Consensus 268 yiCqLCK~kYeD~F~LAQHrC-~RIV~vEYrCPEC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~-----~~---- 337 (500)
T KOG3993|consen 268 YICQLCKEKYEDAFALAQHRC-PRIVHVEYRCPECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQA-----VE---- 337 (500)
T ss_pred HHHHHHHHhhhhHHHHhhccC-CeeEEeeecCCcccccccCchhhhhhhcccCCchhhhhcCCCChhhh-----hh----
Confidence 666666666666666666642 11111236777777777777777777777743322221111111000 00
Q ss_pred hhhhhhhhHhhhhc--ccccCCCCccccccccccccchhHHHHHhhhhCCC
Q psy366 510 CNMMLMKCQLLHSH--LNTQDNKINYSCEQCKVQFSCKSDMRKHAKTHLPA 558 (734)
Q Consensus 510 ~~~~~~~~~~l~~h--~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~ 558 (734)
......+- -.....+..|.|.+|++.|.....|++|+.+|+..
T Consensus 338 ------~rae~~ea~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~ 382 (500)
T KOG3993|consen 338 ------TRAEVQEAERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRA 382 (500)
T ss_pred ------hhhhhhhccccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhcc
Confidence 00000000 11113334799999999999999999998888643
No 19
>PHA02768 hypothetical protein; Provisional
Probab=98.71 E-value=6.3e-09 Score=70.59 Aligned_cols=42 Identities=17% Similarity=0.287 Sum_probs=22.3
Q ss_pred cccccccccccChhhHHHHHHHhcCCCCccccchhhhccCcchH
Q psy366 88 HWCDICGKSFFAASELKQHSFIHSEQKGFVCDLCGASFKQRSCL 131 (734)
Q Consensus 88 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 131 (734)
|.|+.||+.|...+.|..|+++|+ ++|+|..|++.|...+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 455555555555555555555554 355555555555544433
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.63 E-value=1.1e-08 Score=59.27 Aligned_cols=25 Identities=44% Similarity=1.060 Sum_probs=18.1
Q ss_pred hHHHHhhhccCCccccCCCcCCcCC
Q psy366 664 SLYEHMKRHSDVRPYKCDICGHGFK 688 (734)
Q Consensus 664 ~L~~H~~~H~~~~~~~C~~C~~~f~ 688 (734)
+|..||++|+|++||.|++|+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEES
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeC
Confidence 4667777777777777777777774
No 21
>KOG1146|consensus
Probab=98.54 E-value=7.3e-08 Score=106.94 Aligned_cols=134 Identities=16% Similarity=0.179 Sum_probs=82.8
Q ss_pred hhhhhcccchhhccCcHHHHHHHHhhcCCcchhHhhcccccc-ccc-ccCCCCCCCCCccccccccCChHHHHHHHHHH-
Q psy366 5 ISRLICSLCSETFKSKPRLRAHIWKCQMEPETILETLGHQFE-SQL-FTGPMEKCDIMCDQCGKHFTSSQSLARHIKCV- 81 (734)
Q Consensus 5 ~~~~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~- 81 (734)
.+-|+|+.|+..|+....|-.||+..+.+........+.... ... -+.....++|.|..|..+++.+..|-+|++..
T Consensus 463 ~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~LsihlqS~~ 542 (1406)
T KOG1146|consen 463 FKTLKCPKCNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLSIHLQSDL 542 (1406)
T ss_pred cccccCCccchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHHHHHHHHh
Confidence 356899999999999999999999755554443322222111 111 11122367899999999999999999998632
Q ss_pred hcCC-----------------------------------------CccccccccccccChhhHHHHHHH-hcCCCCcccc
Q psy366 82 HLKI-----------------------------------------KHHWCDICGKSFFAASELKQHSFI-HSEQKGFVCD 119 (734)
Q Consensus 82 h~~~-----------------------------------------~~~~C~~C~~~f~~~~~l~~H~~~-h~~~~~~~C~ 119 (734)
|..+ -.+.|.+|+.--.-..+|+.||.. ++...|..+.
T Consensus 543 h~~~lee~~~~~g~~v~~~~~~v~s~~P~~ag~~~~ags~~pktkP~~~C~vc~yetniarnlrihmtss~~s~~p~~~L 622 (1406)
T KOG1146|consen 543 HRNELEEAEENAGEQVRLLPASVTSAVPEEAGLGPSAGSSGPKTKPSWRCEVCSYETNIARNLRIHMTASPSSSPPSLVL 622 (1406)
T ss_pred hHHHHHHHHhccccchhhhhhhhcccCcccccCCCCCCCCCCCCCCCcchhhhcchhhhhhccccccccCCCCCChHHHh
Confidence 2110 124566666666666666666652 3333335555
Q ss_pred chhhhccCcchHHHhhhhh
Q psy366 120 LCGASFKQRSCLWSHKKWN 138 (734)
Q Consensus 120 ~C~~~f~~~~~l~~H~~~~ 138 (734)
.++-.+.....+..+.+.+
T Consensus 623 q~~it~~l~~~~~~~~~lp 641 (1406)
T KOG1146|consen 623 QQNITSSLASLLGGQGRLP 641 (1406)
T ss_pred hhcchhhccccccCcCCCC
Confidence 5655555555555554444
No 22
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.47 E-value=6.7e-08 Score=55.92 Aligned_cols=26 Identities=38% Similarity=0.883 Sum_probs=19.8
Q ss_pred HHHhHhhhcCCCCccccccccccccC
Q psy366 606 ELRQHSPVHSEQKSFVCELCGASFKQ 631 (734)
Q Consensus 606 ~l~~H~~~h~~~~~~~C~~C~~~f~~ 631 (734)
+|..|+++|+|++||+|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 46778888888888888888887753
No 23
>PHA00616 hypothetical protein
Probab=98.35 E-value=1.4e-07 Score=60.65 Aligned_cols=34 Identities=21% Similarity=0.429 Sum_probs=24.1
Q ss_pred ccccccccccccCchhHHHHhhhccCCccccCCC
Q psy366 649 KFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDI 682 (734)
Q Consensus 649 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~ 682 (734)
||+|+.||+.|..++.|..|++.|+|++|+.|++
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 4667777777777777777777777777776653
No 24
>PHA00616 hypothetical protein
Probab=98.27 E-value=5.2e-07 Score=58.11 Aligned_cols=34 Identities=21% Similarity=0.277 Sum_probs=26.7
Q ss_pred cccccchhHhhcCHHHHHhHhhhcCCCCcccccc
Q psy366 591 KHSCENCAKSFFSLAELRQHSPVHSEQKSFVCEL 624 (734)
Q Consensus 591 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 624 (734)
||+|+.||+.|...++|..|++.|+|++|+.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5778888888888888888888888888877765
No 25
>PHA00732 hypothetical protein
Probab=98.26 E-value=5.3e-07 Score=68.26 Aligned_cols=44 Identities=30% Similarity=0.757 Sum_probs=32.6
Q ss_pred ccccccccccccCchhHHHHhhh-ccCCccccCCCcCCcCCcchHHHhhhh
Q psy366 649 KFECTLCGKKFVKNYSLYEHMKR-HSDVRPYKCDICGHGFKLNYDMLRHKQ 698 (734)
Q Consensus 649 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~C~~C~~~f~~~~~L~~H~~ 698 (734)
||.|+.||+.|.+.++|+.|++. |+ ++.|+.||++|. .|..|.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~ 45 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFY 45 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhc
Confidence 47788888888888888888874 54 357888888886 4777763
No 26
>PHA00732 hypothetical protein
Probab=98.11 E-value=1.9e-06 Score=65.33 Aligned_cols=49 Identities=29% Similarity=0.588 Sum_probs=27.6
Q ss_pred ccccccccccccChhHHhhhhhhhccccccccccccccccccCchhHHHHhhhccC
Q psy366 619 SFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSLYEHMKRHSD 674 (734)
Q Consensus 619 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~ 674 (734)
||.|+.||+.|.+..+|..|++.+|. ++.|+.||+.|. .|..|++++.+
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~----~~~C~~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT----LTKCPVCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC----CCccCCCCCEeC---ChhhhhcccCC
Confidence 45566666666666666666654322 135666666665 35566655543
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.10 E-value=4e-06 Score=58.95 Aligned_cols=52 Identities=25% Similarity=0.614 Sum_probs=33.7
Q ss_pred cccCCCcCCcCCcchHHHhhhhhccCCC-CCcccCcchhhcCChHHHHHHHhhccC
Q psy366 677 PYKCDICGHGFKLNYDMLRHKQDVHSTT-RPYLCTICSATFKTARALKQHGRTHIK 731 (734)
Q Consensus 677 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~H~~ 731 (734)
.|.|++|++. .+...|..|....|..+ +.+.|++|...+. .+|..||+.+++
T Consensus 2 ~f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 2 SFTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred CcCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 3677777774 34566777776667654 3577777777544 377777776653
No 28
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.04 E-value=4.5e-06 Score=58.71 Aligned_cols=37 Identities=24% Similarity=0.559 Sum_probs=15.8
Q ss_pred cccccccccccChhHHhhhhhhhccccccccccccccc
Q psy366 620 FVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGK 657 (734)
Q Consensus 620 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~ 657 (734)
|.|++|++. .+...|..|....|....+.+.||+|..
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~ 39 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSS 39 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchh
Confidence 444444442 2234444444444443333444444444
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.78 E-value=1.1e-05 Score=45.35 Aligned_cols=23 Identities=43% Similarity=0.922 Sum_probs=18.4
Q ss_pred cccCcchhhcCChHHHHHHHhhc
Q psy366 707 YLCTICSATFKTARALKQHGRTH 729 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H 729 (734)
|.|+.|++.|.++.+|+.||++|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 67888888888888888888775
No 30
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.62 E-value=2.3e-05 Score=44.07 Aligned_cols=22 Identities=41% Similarity=1.029 Sum_probs=11.1
Q ss_pred cccccccccccCchhHHHHhhh
Q psy366 650 FECTLCGKKFVKNYSLYEHMKR 671 (734)
Q Consensus 650 ~~C~~C~~~f~~~~~L~~H~~~ 671 (734)
|.|+.|++.|.++..|..|++.
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhH
Confidence 3455555555555555555544
No 31
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.53 E-value=5.3e-05 Score=43.18 Aligned_cols=24 Identities=42% Similarity=0.910 Sum_probs=17.7
Q ss_pred cccCcchhhcCChHHHHHHHhhcc
Q psy366 707 YLCTICSATFKTARALKQHGRTHI 730 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H~ 730 (734)
|.|++|++.|.+..+|+.|+++|+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 578888888888888888888764
No 32
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.42 E-value=8.2e-05 Score=43.74 Aligned_cols=26 Identities=38% Similarity=0.856 Sum_probs=22.1
Q ss_pred CcccCcchhhcCChHHHHHHHhhccC
Q psy366 706 PYLCTICSATFKTARALKQHGRTHIK 731 (734)
Q Consensus 706 ~~~C~~C~~~f~~~~~l~~H~~~H~~ 731 (734)
||.|..|++.|.+..+|..|++.|.+
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 68888999999999999999888864
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.30 E-value=0.00013 Score=59.95 Aligned_cols=71 Identities=23% Similarity=0.415 Sum_probs=12.1
Q ss_pred cccccccccChhHHhhhhhhhccccccccccccccccccCchhHHHHhhhccCCccccCCCcCCcCCcchHHHhhhhh
Q psy366 622 CELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNYDMLRHKQD 699 (734)
Q Consensus 622 C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~ 699 (734)
|..|+..|.+...|..|+...|+-.. + ....+.....|..+++.-. ...+.|.+|++.|.+...|..|++.
T Consensus 2 C~~C~~~f~~~~~l~~H~~~~H~~~~-~-----~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 2 CLFCDESFSSVDDLLQHMKKKHGFDI-P-----DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp ------------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHH
T ss_pred cccccccccccccccccccccccccc-c-----ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcC
Confidence 55666666666666666555554310 0 1111122223333332211 1245555555555555555555543
No 34
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=97.24 E-value=0.00019 Score=47.17 Aligned_cols=41 Identities=20% Similarity=0.451 Sum_probs=24.2
Q ss_pred HhhhhhccCCCCCcccCcchhhcCChHHHHHHHhhccCCCC
Q psy366 694 LRHKQDVHSTTRPYLCTICSATFKTARALKQHGRTHIKSRT 734 (734)
Q Consensus 694 ~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~ 734 (734)
..+....+..+.|-.|++|+..+.+..+|++|+..+++.+|
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTTS-
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcccC
Confidence 44443444566788888888888888888888888887765
No 35
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.22 E-value=0.00014 Score=59.73 Aligned_cols=22 Identities=41% Similarity=0.859 Sum_probs=10.7
Q ss_pred cccccccccccCchhHHHHhhh
Q psy366 650 FECTLCGKKFVKNYSLYEHMKR 671 (734)
Q Consensus 650 ~~C~~C~~~f~~~~~L~~H~~~ 671 (734)
+.|.+|++.|.+...|..||+.
T Consensus 51 ~~C~~C~~~f~s~~~l~~Hm~~ 72 (100)
T PF12756_consen 51 FRCPYCNKTFRSREALQEHMRS 72 (100)
T ss_dssp EEBSSSS-EESSHHHHHHHHHH
T ss_pred CCCCccCCCCcCHHHHHHHHcC
Confidence 4455555555555555555543
No 36
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.12 E-value=0.00014 Score=68.08 Aligned_cols=23 Identities=35% Similarity=0.879 Sum_probs=15.2
Q ss_pred CCCcccCcchhhcCChHHHHHHH
Q psy366 704 TRPYLCTICSATFKTARALKQHG 726 (734)
Q Consensus 704 ~~~~~C~~C~~~f~~~~~l~~H~ 726 (734)
.+||+|++|+|+|+..-.|+-|.
T Consensus 396 ~KPYrCevC~KRYKNlNGLKYHr 418 (423)
T COG5189 396 DKPYRCEVCDKRYKNLNGLKYHR 418 (423)
T ss_pred CCceeccccchhhccCccceecc
Confidence 35677777777777666666664
No 37
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.12 E-value=0.00022 Score=40.53 Aligned_cols=21 Identities=38% Similarity=0.926 Sum_probs=7.9
Q ss_pred ccccccccccCchhHHHHhhh
Q psy366 651 ECTLCGKKFVKNYSLYEHMKR 671 (734)
Q Consensus 651 ~C~~C~~~f~~~~~L~~H~~~ 671 (734)
.|++|++.|.+...|..|+++
T Consensus 2 ~C~~C~~~~~~~~~l~~H~~~ 22 (24)
T PF13894_consen 2 QCPICGKSFRSKSELRQHMRT 22 (24)
T ss_dssp E-SSTS-EESSHHHHHHHHHH
T ss_pred CCcCCCCcCCcHHHHHHHHHh
Confidence 344444444444444444433
No 38
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.08 E-value=0.00013 Score=68.26 Aligned_cols=71 Identities=21% Similarity=0.406 Sum_probs=39.2
Q ss_pred CCCCCccc--cccccCChHHHHHHHHHHhcCCCccccccccccccChhhHHHHHHHhcCCCCccccchhhhccCcchHHH
Q psy366 56 KCDIMCDQ--CGKHFTSSQSLARHIKCVHLKIKHHWCDICGKSFFAASELKQHSFIHSEQKGFVCDLCGASFKQRSCLWS 133 (734)
Q Consensus 56 ~~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 133 (734)
++||+|++ |+|.|++...|+-|+..-|...+...=+ .-..|...-...+||+|++|+++|+.-..|+.
T Consensus 347 ~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p----------~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKY 416 (423)
T COG5189 347 GKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENP----------SPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKY 416 (423)
T ss_pred CceecCCCCCchhhhccccchhhhhhccccCcccCCCC----------CccccccccccCCceeccccchhhccCcccee
Confidence 45666664 6666666666666665334222111100 00001111134588888888888888888887
Q ss_pred hhh
Q psy366 134 HKK 136 (734)
Q Consensus 134 H~~ 136 (734)
|..
T Consensus 417 Hr~ 419 (423)
T COG5189 417 HRK 419 (423)
T ss_pred ccc
Confidence 754
No 39
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.07 E-value=0.00029 Score=41.40 Aligned_cols=23 Identities=48% Similarity=0.896 Sum_probs=12.3
Q ss_pred cccccccccccCchhHHHHhhhc
Q psy366 650 FECTLCGKKFVKNYSLYEHMKRH 672 (734)
Q Consensus 650 ~~C~~C~~~f~~~~~L~~H~~~H 672 (734)
|+|..|++.|.+..+|..|++.|
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCccCCccCChhHHHHHhHHh
Confidence 45555555555555555555544
No 40
>KOG2231|consensus
Probab=96.88 E-value=0.0017 Score=69.35 Aligned_cols=104 Identities=26% Similarity=0.627 Sum_probs=61.3
Q ss_pred cccccccccCChHHHHhHhhhhcCCCCcccccchhH---------hhcCHHHHHhHhhhcCC-CCc----cccccccccc
Q psy366 564 TCDQCGKQLSYAKTLANHIKGVHLKIKKHSCENCAK---------SFFSLAELRQHSPVHSE-QKS----FVCELCGASF 629 (734)
Q Consensus 564 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~---------~f~~~~~l~~H~~~h~~-~~~----~~C~~C~~~f 629 (734)
.|..| -.|.+...|+.|+...|.. +.|.+|-. ...+...|..|++.-.. ++. -.|..|...|
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H~~---~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~~f 192 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQHKL---HLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHERF 192 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhhhh---hccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhhhh
Confidence 46666 6666777777777666642 34444431 12245666666654322 222 3477777777
Q ss_pred cChhHHhhhhhhhccccccccccccc------cccccCchhHHHHhhhccCCccccCC
Q psy366 630 KQRTCLWSHKKWNHEELIYKFECTLC------GKKFVKNYSLYEHMKRHSDVRPYKCD 681 (734)
Q Consensus 630 ~~~~~l~~H~~~h~~~~~~~~~C~~C------~~~f~~~~~L~~H~~~H~~~~~~~C~ 681 (734)
.....|..|++.+|. .|.+| +..|.....|..|.|.++ |.|+
T Consensus 193 ld~~el~rH~~~~h~------~chfC~~~~~~neyy~~~~dLe~HfR~~H----flCE 240 (669)
T KOG2231|consen 193 LDDDELYRHLRFDHE------FCHFCDYKTGQNEYYNDYDDLEEHFRKGH----FLCE 240 (669)
T ss_pred ccHHHHHHhhcccee------heeecCcccccchhcccchHHHHHhhhcC----cccc
Confidence 777777777776553 35555 344556666777766543 6666
No 41
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.65 E-value=0.0015 Score=37.83 Aligned_cols=24 Identities=50% Similarity=0.873 Sum_probs=16.9
Q ss_pred cccCcchhhcCChHHHHHHHhhcc
Q psy366 707 YLCTICSATFKTARALKQHGRTHI 730 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H~ 730 (734)
|.|+.|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 467777777777777777777654
No 42
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.64 E-value=0.00072 Score=44.57 Aligned_cols=31 Identities=26% Similarity=0.408 Sum_probs=14.1
Q ss_pred CCCccccccccccccChhhHHHHHHHhcCCC
Q psy366 84 KIKHHWCDICGKSFFAASELKQHSFIHSEQK 114 (734)
Q Consensus 84 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~ 114 (734)
.+.|-.||+|+..+....+|++|+.++++.+
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 3445555555555555555555555554444
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.38 E-value=0.002 Score=36.46 Aligned_cols=24 Identities=33% Similarity=0.571 Sum_probs=16.3
Q ss_pred cccCcchhhcCChHHHHHHHhhccC
Q psy366 707 YLCTICSATFKTARALKQHGRTHIK 731 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H~~ 731 (734)
|+|+.|++... ...|..|+++|++
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 57788887777 7788888877653
No 44
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=96.32 E-value=0.0021 Score=36.83 Aligned_cols=23 Identities=35% Similarity=0.896 Sum_probs=18.9
Q ss_pred cccCcchhhcCChHHHHHHHhhc
Q psy366 707 YLCTICSATFKTARALKQHGRTH 729 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H 729 (734)
|.|++|++.|.+...|+.|++.+
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 56888888888888888888765
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.31 E-value=0.0025 Score=36.83 Aligned_cols=22 Identities=45% Similarity=0.886 Sum_probs=10.6
Q ss_pred ccccccccccCchhHHHHhhhc
Q psy366 651 ECTLCGKKFVKNYSLYEHMKRH 672 (734)
Q Consensus 651 ~C~~C~~~f~~~~~L~~H~~~H 672 (734)
.|+.|++.|.+...|..|++.|
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 2 RCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCcchhCCHHHHHHHHHHh
Confidence 3444444454444444444443
No 46
>KOG2231|consensus
Probab=96.12 E-value=0.014 Score=62.70 Aligned_cols=123 Identities=24% Similarity=0.495 Sum_probs=84.1
Q ss_pred ccccccccccchhHHHHHhh-hhCCCCCCccccccccc---------ccCChHHHHhHhhhhcCCCC----cccccchhH
Q psy366 534 SCEQCKVQFSCKSDMRKHAK-THLPAIGRSYTCDQCGK---------QLSYAKTLANHIKGVHLKIK----KHSCENCAK 599 (734)
Q Consensus 534 ~C~~C~~~f~~~~~l~~H~~-~h~~~~~~~~~C~~C~~---------~f~~~~~L~~H~~~~H~~~~----~~~C~~C~~ 599 (734)
.|..| ..|.+...|+.|+. .| +.+.|.+|-. ..-+...|..|+..-=.++. --.|..|..
T Consensus 117 ~~~~c-~~~~s~~~Lk~H~~~~H-----~~~~c~lC~~~~kif~~e~k~Yt~~el~~h~~~gd~d~~s~rGhp~C~~C~~ 190 (669)
T KOG2231|consen 117 ECLHC-TEFKSVENLKNHMRDQH-----KLHLCSLCLQNLKIFINERKLYTRAELNLHLMFGDPDDESCRGHPLCKFCHE 190 (669)
T ss_pred CCccc-cchhHHHHHHHHHHHhh-----hhhccccccccceeeeeeeehehHHHHHHHHhcCCCccccccCCccchhhhh
Confidence 57888 77889999999984 55 3456666532 22345667777652211221 246999999
Q ss_pred hhcCHHHHHhHhhhcCCCCccccccc------cccccChhHHhhhhhhhccccccccccc--ccc-ccccCchhHHHHhh
Q psy366 600 SFFSLAELRQHSPVHSEQKSFVCELC------GASFKQRTCLWSHKKWNHEELIYKFECT--LCG-KKFVKNYSLYEHMK 670 (734)
Q Consensus 600 ~f~~~~~l~~H~~~h~~~~~~~C~~C------~~~f~~~~~l~~H~~~h~~~~~~~~~C~--~C~-~~f~~~~~L~~H~~ 670 (734)
.|.....|..|++.++ |.|..| +..|.....|..|.+..| |.|. .|. +.|.....+.++++
T Consensus 191 ~fld~~el~rH~~~~h----~~chfC~~~~~~neyy~~~~dLe~HfR~~H------flCE~~~C~~~~f~~~~~~ei~lk 260 (669)
T KOG2231|consen 191 RFLDDDELYRHLRFDH----EFCHFCDYKTGQNEYYNDYDDLEEHFRKGH------FLCEEEFCRTKKFYVAFELEIELK 260 (669)
T ss_pred hhccHHHHHHhhccce----eheeecCcccccchhcccchHHHHHhhhcC------ccccccccccceeeehhHHHHHHH
Confidence 9999999999999876 777777 356788899999999877 4566 565 34444445566665
Q ss_pred hc
Q psy366 671 RH 672 (734)
Q Consensus 671 ~H 672 (734)
.|
T Consensus 261 ~~ 262 (669)
T KOG2231|consen 261 AH 262 (669)
T ss_pred hh
Confidence 44
No 47
>PRK04860 hypothetical protein; Provisional
Probab=95.86 E-value=0.0037 Score=55.01 Aligned_cols=40 Identities=18% Similarity=0.587 Sum_probs=32.7
Q ss_pred cccccccccccccCchhHHHHhhhccCCccccCCCcCCcCCcch
Q psy366 648 YKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNY 691 (734)
Q Consensus 648 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~~~~ 691 (734)
.+|.|. |+. ....+.+|.++|+|+++|.|..|+..|....
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~ 157 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKG 157 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEec
Confidence 578888 887 6678889999999999999999988886543
No 48
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.86 E-value=0.0051 Score=58.66 Aligned_cols=130 Identities=25% Similarity=0.400 Sum_probs=85.7
Q ss_pred cccccc--ccccccChhhHHHHHHHhcCCCCccccchh---hhcc------CcchHHHhhhhhcccCc--ccccCCcccc
Q psy366 87 HHWCDI--CGKSFFAASELKQHSFIHSEQKGFVCDLCG---ASFK------QRSCLWSHKKWNHEELS--YKFECTLCGK 153 (734)
Q Consensus 87 ~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~---~~f~------~~~~l~~H~~~~~~~~~--~~~~C~~C~~ 153 (734)
.|.||. |..+......|+.|.+..++ .+.|++|- +.|. ++..|+.|...--.+.+ .--.|.+|..
T Consensus 151 ~F~CP~skc~~~C~~~k~lk~H~K~~H~--~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~ 228 (493)
T COG5236 151 SFKCPKSKCHRRCGSLKELKKHYKAQHG--FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKI 228 (493)
T ss_pred HhcCCchhhhhhhhhHHHHHHHHHhhcC--cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccc
Confidence 467765 77777777788888776443 46677773 2333 34556666543322211 1236999999
Q ss_pred cccChHHHHHHHHhcCCCCCCCCCccccc-------ccchhhhhhhhhcccccceecCcc--h----hhhhhchHHHHHH
Q psy366 154 KFVKNYSLYEHMKRHSDVRPYKCDICGHG-------FKLNYDMLRHKQDVHSNIIHNCNL--C----SATFKTARGLKQH 220 (734)
Q Consensus 154 ~f~~~~~L~~H~~~h~~~~~~~C~~C~~~-------f~~~~~l~~H~~~~h~~~~~~C~~--C----~~~f~~~~~l~~H 220 (734)
.|-+...|..|++.-+. +|-+|++. |.+..+|.+|....| |.|.+ | -..|.....|..|
T Consensus 229 ~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~~~h----y~ct~qtc~~~k~~vf~~~~el~~h 300 (493)
T COG5236 229 YFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFRNAH----YCCTFQTCRVGKCYVFPYHTELLEH 300 (493)
T ss_pred eecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhhcCc----eEEEEEEEecCcEEEeccHHHHHHH
Confidence 99999999999986432 57777653 788889999986555 43432 2 2468889999999
Q ss_pred Hhhhcc
Q psy366 221 EKKHMT 226 (734)
Q Consensus 221 ~~~h~~ 226 (734)
+...++
T Consensus 301 ~~~~h~ 306 (493)
T COG5236 301 LTRFHK 306 (493)
T ss_pred HHHHhh
Confidence 865544
No 49
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.65 E-value=0.0064 Score=58.02 Aligned_cols=123 Identities=28% Similarity=0.552 Sum_probs=78.4
Q ss_pred cccccCChHHHHhHhhhhcCCCCcccccchh---Hhh------cCHHHHHhHhhhcCCCCcc----ccccccccccChhH
Q psy366 568 CGKQLSYAKTLANHIKGVHLKIKKHSCENCA---KSF------FSLAELRQHSPVHSEQKSF----VCELCGASFKQRTC 634 (734)
Q Consensus 568 C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~---~~f------~~~~~l~~H~~~h~~~~~~----~C~~C~~~f~~~~~ 634 (734)
|..+......|+.|.+..|. .+.|.+|- +.| .+...|+.|...-..+..| .|..|...|-+...
T Consensus 159 c~~~C~~~k~lk~H~K~~H~---~~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdDDE 235 (493)
T COG5236 159 CHRRCGSLKELKKHYKAQHG---FVLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDE 235 (493)
T ss_pred hhhhhhhHHHHHHHHHhhcC---cEEhHhhhcCcccCccceeeeecccccccccCCccccCcCCCchhhhccceecChHH
Confidence 44444445555555554443 23444443 223 3456677776654444344 38888888888888
Q ss_pred Hhhhhhhhcccccccccccccccc-------ccCchhHHHHhhhccCCccccCCC--cC----CcCCcchHHHhhhhhcc
Q psy366 635 LWSHKKWNHEELIYKFECTLCGKK-------FVKNYSLYEHMKRHSDVRPYKCDI--CG----HGFKLNYDMLRHKQDVH 701 (734)
Q Consensus 635 l~~H~~~h~~~~~~~~~C~~C~~~-------f~~~~~L~~H~~~H~~~~~~~C~~--C~----~~f~~~~~L~~H~~~~H 701 (734)
|..|++..|.. |-+|++. |.+...|..|.+. ..|.|.+ |- ..|.....|+.|+.+.|
T Consensus 236 L~~HcR~~HE~------ChICD~v~p~~~QYFK~Y~~Le~HF~~----~hy~ct~qtc~~~k~~vf~~~~el~~h~~~~h 305 (493)
T COG5236 236 LRRHCRLRHEA------CHICDMVGPIRYQYFKSYEDLEAHFRN----AHYCCTFQTCRVGKCYVFPYHTELLEHLTRFH 305 (493)
T ss_pred HHHHHHhhhhh------hhhhhccCccchhhhhCHHHHHHHhhc----CceEEEEEEEecCcEEEeccHHHHHHHHHHHh
Confidence 88888887764 7777754 6677778888754 2366654 32 46788888999988888
Q ss_pred CC
Q psy366 702 ST 703 (734)
Q Consensus 702 ~~ 703 (734)
..
T Consensus 306 ~~ 307 (493)
T COG5236 306 KV 307 (493)
T ss_pred hc
Confidence 63
No 50
>KOG2482|consensus
Probab=95.65 E-value=0.013 Score=56.16 Aligned_cols=153 Identities=17% Similarity=0.258 Sum_probs=91.4
Q ss_pred ChHHHHhHhhhhcCCCCcccccchhHhhc-CHHHHHhHhhhcCC----------------------CCcccccccccccc
Q psy366 574 YAKTLANHIKGVHLKIKKHSCENCAKSFF-SLAELRQHSPVHSE----------------------QKSFVCELCGASFK 630 (734)
Q Consensus 574 ~~~~L~~H~~~~H~~~~~~~C~~C~~~f~-~~~~l~~H~~~h~~----------------------~~~~~C~~C~~~f~ 630 (734)
-+..|..+++..-......+|-.|...+. +.+....|+-.-++ -..+.|-.|.+.|+
T Consensus 127 L~eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lniGlpDniVyvnelLehLkekL~r~~CLyCekifr 206 (423)
T KOG2482|consen 127 LKEALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNIGLPDNIVYVNELLEHLKEKLERLRCLYCEKIFR 206 (423)
T ss_pred HHHHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhccCCCcceeeHHHHHHHHHHHHhhheeeeeccccC
Confidence 35677777775544445567888877663 55566666543222 12477888888888
Q ss_pred ChhHHhhhhhhhccccccc--------ccccc--ccccccCchhHHHHh--hhc---------------cCCcc--ccCC
Q psy366 631 QRTCLWSHKKWNHEELIYK--------FECTL--CGKKFVKNYSLYEHM--KRH---------------SDVRP--YKCD 681 (734)
Q Consensus 631 ~~~~l~~H~~~h~~~~~~~--------~~C~~--C~~~f~~~~~L~~H~--~~H---------------~~~~~--~~C~ 681 (734)
.+..|+.||+...-..+.| |.=.+ =|++. .....+. .+- .+..+ ..|-
T Consensus 207 dkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~---t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~CL 283 (423)
T KOG2482|consen 207 DKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSW---TIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVCL 283 (423)
T ss_pred CcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCcc---chhhhhhhhhhhccccccccchhhhhcCCCCccceEEE
Confidence 8888888876542211111 00000 01110 0011111 110 01223 5899
Q ss_pred CcCCcCCcchHHHhhhhhccCCC--------------------------CCcccCcchhhcCChHHHHHHHhhc
Q psy366 682 ICGHGFKLNYDMLRHKQDVHSTT--------------------------RPYLCTICSATFKTARALKQHGRTH 729 (734)
Q Consensus 682 ~C~~~f~~~~~L~~H~~~~H~~~--------------------------~~~~C~~C~~~f~~~~~l~~H~~~H 729 (734)
.|....-....|..||..+|.-. ..-.|..|+..|.....|..||.-+
T Consensus 284 fC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e~ 357 (423)
T KOG2482|consen 284 FCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVED 357 (423)
T ss_pred eeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhcccc
Confidence 99998888999999998888310 1236999999999999999998643
No 51
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.62 E-value=0.0058 Score=34.95 Aligned_cols=21 Identities=33% Similarity=0.790 Sum_probs=11.2
Q ss_pred ccccccccccCchhHHHHhhh
Q psy366 651 ECTLCGKKFVKNYSLYEHMKR 671 (734)
Q Consensus 651 ~C~~C~~~f~~~~~L~~H~~~ 671 (734)
.|.+|++.|.+...|..|++.
T Consensus 2 ~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 2 YCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCcCCHHHHHHHHCc
Confidence 455555555555555555543
No 52
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.60 E-value=0.0053 Score=34.70 Aligned_cols=20 Identities=30% Similarity=0.607 Sum_probs=8.3
Q ss_pred ccccccccccCchhHHHHhhh
Q psy366 651 ECTLCGKKFVKNYSLYEHMKR 671 (734)
Q Consensus 651 ~C~~C~~~f~~~~~L~~H~~~ 671 (734)
+|+.|++... ...|.+|+++
T Consensus 2 ~C~~C~y~t~-~~~l~~H~~~ 21 (24)
T PF13909_consen 2 KCPHCSYSTS-KSNLKRHLKR 21 (24)
T ss_dssp E-SSSS-EES-HHHHHHHHHH
T ss_pred CCCCCCCcCC-HHHHHHHHHh
Confidence 4444444444 4444444444
No 53
>KOG2785|consensus
Probab=95.25 E-value=0.053 Score=53.37 Aligned_cols=80 Identities=18% Similarity=0.294 Sum_probs=43.7
Q ss_pred cccCcccccccCChHHHHHHHHhhcC--CCC-cccc-CcccccccCCchHHHhH---hhhcCCCCccccccccccccChH
Q psy366 371 GFVCDLCGASFKQRPALWAHKKLHDA--KLE-YKFK-CILCDRKFHRNSKLNSH---MRTHSDVRPYKCDICEQHFKFNY 443 (734)
Q Consensus 371 ~~~C~~C~~~f~~~~~l~~H~~~h~~--~~~-~~~~-C~~C~~~f~~~~~l~~H---~~~h~~~~~~~C~~C~~~f~~~~ 443 (734)
.|.|.-|...|.+...-+.|+++--. +.. +... =++=-..|.....-..- ...-.++-++.|.+|.+.|.+..
T Consensus 3 ~ftC~tC~v~F~~ad~Qr~HyKSdWHRYNLKRkVA~lPPItaE~F~~k~~s~~~~~~~~~e~~~~~~~c~~c~k~~~s~~ 82 (390)
T KOG2785|consen 3 GFTCNTCNVEFDDADEQRAHYKSDWHRYNLKRKVASLPPITAEEFNEKVLSDDSEKEENLEEAESVVYCEACNKSFASPK 82 (390)
T ss_pred cceeeceeeeeccHHHHHHHhhhhHHHhhHHhHhhcCCCcCHHHHhHHHhhhhhhhhhhhhhcccceehHHhhccccChh
Confidence 47899999999999888888865211 100 0000 00111112111110000 00123456788888888888888
Q ss_pred HHHhHHh
Q psy366 444 DIQMHKR 450 (734)
Q Consensus 444 ~l~~H~~ 450 (734)
....|+.
T Consensus 83 a~~~hl~ 89 (390)
T KOG2785|consen 83 AHENHLK 89 (390)
T ss_pred hHHHHHH
Confidence 8888876
No 54
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=95.24 E-value=0.006 Score=35.66 Aligned_cols=22 Identities=32% Similarity=0.770 Sum_probs=13.7
Q ss_pred cccCcchhhcCChHHHHHHHhh
Q psy366 707 YLCTICSATFKTARALKQHGRT 728 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~ 728 (734)
|.|.+|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666654
No 55
>KOG2785|consensus
Probab=95.21 E-value=0.049 Score=53.59 Aligned_cols=75 Identities=15% Similarity=0.119 Sum_probs=46.4
Q ss_pred CcccccccccccCChHHHHHHHhhcCCCCccccccccccccCcchHHHHHhhhcCCCCCccccccc---cccCChhhHHH
Q psy366 285 ILSQCPVCKLTFTDKRKLKSHLKTHSIERSYTCHHCGKQLCGASNLSLHIKSVHLKIKDHSCDVCD---KSFVNRAGLRL 361 (734)
Q Consensus 285 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~---~~f~~~~~l~~ 361 (734)
++-.|-.|+..+.+...-..||..+++-..-.=. -......|..-+.. .-...+.|-.|+ +.|.+....+.
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdre----YL~D~~GLl~YLge--KV~~~~~CL~CN~~~~~f~sleavr~ 238 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDRE----YLTDEKGLLKYLGE--KVGIGFICLFCNELGRPFSSLEAVRA 238 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCCcCCchH----hhhchhHHHHHHHH--HhccCceEEEeccccCcccccHHHHH
Confidence 4577999999999999999999887753110000 01111222222211 012346788888 88999999999
Q ss_pred HHhh
Q psy366 362 HKLI 365 (734)
Q Consensus 362 H~~~ 365 (734)
||..
T Consensus 239 HM~~ 242 (390)
T KOG2785|consen 239 HMRD 242 (390)
T ss_pred HHhh
Confidence 9875
No 56
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.87 E-value=0.017 Score=33.71 Aligned_cols=21 Identities=33% Similarity=0.675 Sum_probs=12.2
Q ss_pred cccccccccccChhHHhhhhh
Q psy366 620 FVCELCGASFKQRTCLWSHKK 640 (734)
Q Consensus 620 ~~C~~C~~~f~~~~~l~~H~~ 640 (734)
|.|..|++.|.+...|..|++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTT
T ss_pred CCcccCCCCcCCHHHHHHHHc
Confidence 455566666666655555554
No 57
>KOG2482|consensus
Probab=94.77 E-value=0.033 Score=53.38 Aligned_cols=132 Identities=20% Similarity=0.286 Sum_probs=70.6
Q ss_pred hhhhhcccchhhccCcHHHHHHHHhhc---CCcch-----------------hHhhcccc----cccccc---------c
Q psy366 5 ISRLICSLCSETFKSKPRLRAHIWKCQ---MEPET-----------------ILETLGHQ----FESQLF---------T 51 (734)
Q Consensus 5 ~~~~~C~~C~~~f~~~~~l~~H~~~~~---~~~~~-----------------~~~~~~~~----~~~~~~---------~ 51 (734)
.-+++|-.|-+.|+.+..|+.||++-. +.|.. ......+. .++... .
T Consensus 193 L~r~~CLyCekifrdkntLkeHMrkK~HrrinPknreYDkfyiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~e 272 (423)
T KOG2482|consen 193 LERLRCLYCEKIFRDKNTLKEHMRKKRHRRINPKNREYDKFYIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNE 272 (423)
T ss_pred HhhheeeeeccccCCcHHHHHHHHhccCcccCCCccccceEEEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhc
Confidence 346899999999999999999998632 12211 00000000 000000 1
Q ss_pred CCCCCCCCCccccccccCChHHHHHHHHHHhcCCCccccccccccccChhhHHHHHHHhcCCCCccccchhhhccCcchH
Q psy366 52 GPMEKCDIMCDQCGKHFTSSQSLARHIKCVHLKIKHHWCDICGKSFFAASELKQHSFIHSEQKGFVCDLCGASFKQRSCL 131 (734)
Q Consensus 52 ~~~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l 131 (734)
.+.+.....|-.|.....+...|..||+.+|.=.-.-.=+.=+-.|-..-.+..=.| .....-.|-.|.-.|.....|
T Consensus 273 d~a~a~~v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiR--kq~~~~~c~~cd~~F~~e~~l 350 (423)
T KOG2482|consen 273 DDAEALSVVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIR--KQKKKSRCAECDLSFWKEPGL 350 (423)
T ss_pred CCCCccceEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHH--HHhhccccccccccccCcchh
Confidence 111112268999999999999999999988831100000000011111111111111 112234577777888888888
Q ss_pred HHhhhhh
Q psy366 132 WSHKKWN 138 (734)
Q Consensus 132 ~~H~~~~ 138 (734)
..||..+
T Consensus 351 ~~hm~e~ 357 (423)
T KOG2482|consen 351 LIHMVED 357 (423)
T ss_pred hhhcccc
Confidence 8887643
No 58
>PRK04860 hypothetical protein; Provisional
Probab=94.69 E-value=0.015 Score=51.22 Aligned_cols=38 Identities=16% Similarity=0.488 Sum_probs=24.8
Q ss_pred ccccccccccccChhhHHHHHHHhcCCCCccccchhhhccCc
Q psy366 87 HHWCDICGKSFFAASELKQHSFIHSEQKGFVCDLCGASFKQR 128 (734)
Q Consensus 87 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 128 (734)
+|.|. |++ ....+.+|.++|.++++|.|..|+..|...
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 46676 666 455666777777777777777776666543
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.54 E-value=0.012 Score=63.90 Aligned_cols=153 Identities=24% Similarity=0.374 Sum_probs=96.9
Q ss_pred CcccccccccccCChHHHHhHhhh-hcCCC--Cccccc--chhHhhcCHHHHHhHhhhcCCCCccccccc--cccccChh
Q psy366 561 RSYTCDQCGKQLSYAKTLANHIKG-VHLKI--KKHSCE--NCAKSFFSLAELRQHSPVHSEQKSFVCELC--GASFKQRT 633 (734)
Q Consensus 561 ~~~~C~~C~~~f~~~~~L~~H~~~-~H~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C--~~~f~~~~ 633 (734)
.++.|..|...|.....|..|.+. .|.++ +++.|+ .|++.|.....+..|...|.+..++.+..- ...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 356677777777777777777774 67777 788888 688888888888888888888777776553 33333332
Q ss_pred HHhhhhhhh---ccccccccccc--cccccccCchhHHHHhhhccCCc--cccCCCcCCcCCcchHHHhhhhhccCCCCC
Q psy366 634 CLWSHKKWN---HEELIYKFECT--LCGKKFVKNYSLYEHMKRHSDVR--PYKCDICGHGFKLNYDMLRHKQDVHSTTRP 706 (734)
Q Consensus 634 ~l~~H~~~h---~~~~~~~~~C~--~C~~~f~~~~~L~~H~~~H~~~~--~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~ 706 (734)
.-..+.... .......+.+. .|-..+.....+..|...|...+ .+.+..|.+.|.....|..|+ ..|....+
T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 446 (467)
T COG5048 368 NNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHK-KIHTNHAP 446 (467)
T ss_pred CCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhhccCcccccccc-cccccCCc
Confidence 211111111 11111334443 36667777777777777776655 466777888888888888887 46666666
Q ss_pred cccCcchh
Q psy366 707 YLCTICSA 714 (734)
Q Consensus 707 ~~C~~C~~ 714 (734)
+.|..++.
T Consensus 447 ~~~~~~~~ 454 (467)
T COG5048 447 LLCSILKS 454 (467)
T ss_pred eeeccccc
Confidence 55554443
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.76 E-value=0.024 Score=61.35 Aligned_cols=141 Identities=23% Similarity=0.302 Sum_probs=106.5
Q ss_pred CcccccchhHhhcCHHHHHhHhh--hcCCC--Cccccc--cccccccChhHHhhhhhhhcccccccccccc--ccccccC
Q psy366 590 KKHSCENCAKSFFSLAELRQHSP--VHSEQ--KSFVCE--LCGASFKQRTCLWSHKKWNHEELIYKFECTL--CGKKFVK 661 (734)
Q Consensus 590 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~--C~~~f~~ 661 (734)
.++.|..|...|.....|..|.+ .|.++ +|+.|+ .|++.|.....+..|...+.+. .++.+.. +...+..
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 365 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSI--SPAKEKLLNSSSKFSP 365 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCC--CccccccccCcccccc
Confidence 57899999999999999999999 89999 999999 8999999999999999999887 5555544 4444444
Q ss_pred chh-----HHHHhhhccCCccccCCC--cCCcCCcchHHHhhhhhccCCCC--CcccCcchhhcCChHHHHHHHhhccCC
Q psy366 662 NYS-----LYEHMKRHSDVRPYKCDI--CGHGFKLNYDMLRHKQDVHSTTR--PYLCTICSATFKTARALKQHGRTHIKS 732 (734)
Q Consensus 662 ~~~-----L~~H~~~H~~~~~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~--~~~C~~C~~~f~~~~~l~~H~~~H~~~ 732 (734)
... .......-...+.+.+.. |...+.....+..|. ..|...+ .+.+..|++.|.....|..|++.|...
T Consensus 366 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (467)
T COG5048 366 LLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHI-ITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNH 444 (467)
T ss_pred ccCCCCccchhhccCccCCccccccccchhhhhcccccccccc-ccccccCCcCCCCCcchhhccCcccccccccccccC
Confidence 332 111111222334455553 667777777788886 4566554 578999999999999999999998754
Q ss_pred C
Q psy366 733 R 733 (734)
Q Consensus 733 ~ 733 (734)
.
T Consensus 445 ~ 445 (467)
T COG5048 445 A 445 (467)
T ss_pred C
Confidence 3
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=93.14 E-value=0.052 Score=30.87 Aligned_cols=22 Identities=32% Similarity=0.770 Sum_probs=18.8
Q ss_pred hhcccchhhccCcHHHHHHHHhh
Q psy366 8 LICSLCSETFKSKPRLRAHIWKC 30 (734)
Q Consensus 8 ~~C~~C~~~f~~~~~l~~H~~~~ 30 (734)
..|++||++| ..+.|..|+..|
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~C 24 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKIC 24 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHhc
Confidence 5799999999 689999998643
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.28 E-value=0.11 Score=29.62 Aligned_cols=19 Identities=32% Similarity=0.810 Sum_probs=9.5
Q ss_pred ccCcchhhcCChHHHHHHHh
Q psy366 708 LCTICSATFKTARALKQHGR 727 (734)
Q Consensus 708 ~C~~C~~~f~~~~~l~~H~~ 727 (734)
.|++||+.| ..+.|..|+.
T Consensus 4 ~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred cCCCCCCEE-CHHHHHHHHH
Confidence 455555555 4444555543
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.98 E-value=0.11 Score=32.59 Aligned_cols=23 Identities=22% Similarity=0.648 Sum_probs=16.7
Q ss_pred CcccCcchhhcCChHHHHHHHhh
Q psy366 706 PYLCTICSATFKTARALKQHGRT 728 (734)
Q Consensus 706 ~~~C~~C~~~f~~~~~l~~H~~~ 728 (734)
+|.|++|++.|....++..|++.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccccCCccCCHHHHHHHHCh
Confidence 46777777777777777777654
No 64
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=90.72 E-value=0.45 Score=38.55 Aligned_cols=22 Identities=23% Similarity=0.451 Sum_probs=13.7
Q ss_pred CCccccccccccccChhHHhhh
Q psy366 617 QKSFVCELCGASFKQRTCLWSH 638 (734)
Q Consensus 617 ~~~~~C~~C~~~f~~~~~l~~H 638 (734)
+-|-.|++||....+...|.+-
T Consensus 13 ~LP~~CpiCgLtLVss~HLARS 34 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARS 34 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHh
Confidence 3566677777666666666543
No 65
>KOG2893|consensus
Probab=90.00 E-value=0.097 Score=47.42 Aligned_cols=46 Identities=33% Similarity=0.667 Sum_probs=35.5
Q ss_pred CccccccccCChHHHHHHHHHHhcCCCccccccccccccChhhHHHH-HHHh
Q psy366 60 MCDQCGKHFTSSQSLARHIKCVHLKIKHHWCDICGKSFFAASELKQH-SFIH 110 (734)
Q Consensus 60 ~C~~C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H-~~~h 110 (734)
.|=.|++.|.+...|.+|.+.. -|+|-+|.|...+--.|..| |.+|
T Consensus 12 wcwycnrefddekiliqhqkak-----hfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKAK-----HFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeeecccccchhhhhhhhhhhc-----cceeeeehhhhccCCCceeehhhhh
Confidence 3888999999999998887744 48899998887777777777 3444
No 66
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=89.59 E-value=0.21 Score=31.24 Aligned_cols=22 Identities=32% Similarity=0.759 Sum_probs=15.0
Q ss_pred ccccccccccccChhHHhhhhh
Q psy366 619 SFVCELCGASFKQRTCLWSHKK 640 (734)
Q Consensus 619 ~~~C~~C~~~f~~~~~l~~H~~ 640 (734)
+|.|++|+..|.+..++..|++
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4667777777777777776664
No 67
>KOG4173|consensus
Probab=88.25 E-value=0.26 Score=43.84 Aligned_cols=77 Identities=22% Similarity=0.402 Sum_probs=41.7
Q ss_pred ccccc--cccccCChHHHHhHhhhhcCCCCcccccchhHhhcCHHHHHhHhhhc----------CCCCcccccc--cccc
Q psy366 563 YTCDQ--CGKQLSYAKTLANHIKGVHLKIKKHSCENCAKSFFSLAELRQHSPVH----------SEQKSFVCEL--CGAS 628 (734)
Q Consensus 563 ~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h----------~~~~~~~C~~--C~~~ 628 (734)
|.|++ |...|.+......|....|++ .|..|.+.|++..-|..|+..- .|...|+|-+ |+..
T Consensus 80 ~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~K 155 (253)
T KOG4173|consen 80 FACQVAGCCQVFDALDDYEHHYHTLHGN----SCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEK 155 (253)
T ss_pred ccccccchHHHHhhhhhHHHhhhhcccc----hhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhh
Confidence 44544 555555555555555544543 3556666666555555554321 2344566643 6666
Q ss_pred ccChhHHhhhhhhhc
Q psy366 629 FKQRTCLWSHKKWNH 643 (734)
Q Consensus 629 f~~~~~l~~H~~~h~ 643 (734)
|.+...-..|+-..|
T Consensus 156 FkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 156 FKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhHHHHhc
Confidence 666666666654443
No 68
>KOG4173|consensus
Probab=87.96 E-value=0.29 Score=43.53 Aligned_cols=79 Identities=23% Similarity=0.436 Sum_probs=48.4
Q ss_pred CCCCccc--cccccCChHHHHHHHHHHhcCCCccccccccccccChhhHHHHHHH-hc---------CCCCccc--cchh
Q psy366 57 CDIMCDQ--CGKHFTSSQSLARHIKCVHLKIKHHWCDICGKSFFAASELKQHSFI-HS---------EQKGFVC--DLCG 122 (734)
Q Consensus 57 ~~~~C~~--C~~~f~~~~~l~~H~~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~-h~---------~~~~~~C--~~C~ 122 (734)
..|.|++ |...|.+......|....|. -.|.+|.+.|++...|..|+.. |. |..-|.| +.|+
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~----~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG----NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc----chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 3466775 77788887777777654453 3688888888888888888653 21 2333455 2355
Q ss_pred hhccCcchHHHhhhhhc
Q psy366 123 ASFKQRSCLWSHKKWNH 139 (734)
Q Consensus 123 ~~f~~~~~l~~H~~~~~ 139 (734)
..|.+...-..|+...|
T Consensus 154 ~KFkT~r~RkdH~I~~H 170 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMH 170 (253)
T ss_pred hhhhhhhhhhhHHHHhc
Confidence 55555555555544433
No 69
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=87.32 E-value=0.38 Score=29.60 Aligned_cols=9 Identities=44% Similarity=1.531 Sum_probs=4.6
Q ss_pred ccccCCCcC
Q psy366 676 RPYKCDICG 684 (734)
Q Consensus 676 ~~~~C~~C~ 684 (734)
.|+.|++||
T Consensus 16 ~~~~CP~Cg 24 (33)
T cd00350 16 APWVCPVCG 24 (33)
T ss_pred CCCcCcCCC
Confidence 445555554
No 70
>KOG2893|consensus
Probab=85.68 E-value=0.27 Score=44.64 Aligned_cols=49 Identities=27% Similarity=0.510 Sum_probs=39.8
Q ss_pred cCCcccccccChHHHHHHHHhcCCCCCCCCCcccccccchhhhhhhhhccccc
Q psy366 147 ECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLNYDMLRHKQDVHSN 199 (734)
Q Consensus 147 ~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~ 199 (734)
-|-+|++.|.+..-|..|++. +-|+|-+|-+..-+-..|..|-..+|.+
T Consensus 12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvhke 60 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHKE 60 (341)
T ss_pred eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhhh
Confidence 499999999999999999864 3499999998887778888887666643
No 71
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=80.52 E-value=0.81 Score=31.03 Aligned_cols=29 Identities=24% Similarity=0.452 Sum_probs=16.9
Q ss_pred CCCCccccchhhhccCcchHHHhhhhhcc
Q psy366 112 EQKGFVCDLCGASFKQRSCLWSHKKWNHE 140 (734)
Q Consensus 112 ~~~~~~C~~C~~~f~~~~~l~~H~~~~~~ 140 (734)
|+..+.|+-||+.|....++.+|+...|+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 44555666666666666666666555443
No 72
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=79.11 E-value=2.9 Score=34.53 Aligned_cols=25 Identities=20% Similarity=0.533 Sum_probs=23.4
Q ss_pred ccc----CcchhhcCChHHHHHHHhhccC
Q psy366 707 YLC----TICSATFKTARALKQHGRTHIK 731 (734)
Q Consensus 707 ~~C----~~C~~~f~~~~~l~~H~~~H~~ 731 (734)
|.| ..|++...+...++.|++.+||
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 899 9999999999999999999876
No 73
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=78.94 E-value=0.75 Score=31.17 Aligned_cols=30 Identities=23% Similarity=0.471 Sum_probs=18.2
Q ss_pred CCCCccccccccccccChhHHhhhhhhhcc
Q psy366 615 SEQKSFVCELCGASFKQRTCLWSHKKWNHE 644 (734)
Q Consensus 615 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~ 644 (734)
.||--+.|+-||..|.....+..|+...|+
T Consensus 13 DGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 13 DGEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CCceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 455556666666666666666666655443
No 74
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=78.53 E-value=1.9 Score=35.66 Aligned_cols=24 Identities=29% Similarity=0.570 Sum_probs=15.5
Q ss_pred ccC----CCcCCcCCcchHHHhhhhhcc
Q psy366 678 YKC----DICGHGFKLNYDMLRHKQDVH 701 (734)
Q Consensus 678 ~~C----~~C~~~f~~~~~L~~H~~~~H 701 (734)
|.| ..|++...+...|.+|.+..|
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 666 666666666666666665554
No 75
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=77.03 E-value=2 Score=34.90 Aligned_cols=92 Identities=21% Similarity=0.308 Sum_probs=49.5
Q ss_pred CCCCCccccccccCChHHHHHHHHHHhc-CCCccccccccccccChhhHHHHHHHhcCCCCccccchhhhccCcchHHHh
Q psy366 56 KCDIMCDQCGKHFTSSQSLARHIKCVHL-KIKHHWCDICGKSFFAASELKQHSFIHSEQKGFVCDLCGASFKQRSCLWSH 134 (734)
Q Consensus 56 ~~~~~C~~C~~~f~~~~~l~~H~~~~h~-~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 134 (734)
.-|..|++||-+..+...|.+-. +|. ..++|. . -.+. .....-.|--|...|........
T Consensus 13 ~LP~~CpiCgLtLVss~HLARSy--HHLfPl~~f~--e----v~~~----------~~~~~~~C~~C~~~f~~~~~~~~- 73 (112)
T TIGR00622 13 ELPVECPICGLTLILSTHLARSY--HHLFPLKAFQ--E----IPLE----------EYNGSRFCFGCQGPFPKPPVSPF- 73 (112)
T ss_pred CCCCcCCcCCCEEeccchHHHhh--hccCCCcccc--c----cccc----------ccCCCCcccCcCCCCCCcccccc-
Confidence 34778999999999988888753 231 222221 0 0000 01111236666666654321110
Q ss_pred hhhhcccCcccccCCcccccccChHHHHHHHHhcC
Q psy366 135 KKWNHEELSYKFECTLCGKKFVKNYSLYEHMKRHS 169 (734)
Q Consensus 135 ~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 169 (734)
. .......|+|+.|...|-..-+.-.|...|.
T Consensus 74 --~-~~~~~~~y~C~~C~~~FC~dCD~fiHe~Lh~ 105 (112)
T TIGR00622 74 --D-ELKDSHRYVCAVCKNVFCVDCDVFVHESLHC 105 (112)
T ss_pred --c-ccccccceeCCCCCCccccccchhhhhhccC
Confidence 0 0111236778888888887777777776663
No 76
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=74.33 E-value=1.8 Score=26.76 Aligned_cols=8 Identities=75% Similarity=1.705 Sum_probs=3.9
Q ss_pred cccCCCcC
Q psy366 677 PYKCDICG 684 (734)
Q Consensus 677 ~~~C~~C~ 684 (734)
|..|++||
T Consensus 18 p~~CP~Cg 25 (34)
T cd00729 18 PEKCPICG 25 (34)
T ss_pred CCcCcCCC
Confidence 44555554
No 77
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=73.78 E-value=2 Score=27.01 Aligned_cols=32 Identities=25% Similarity=0.838 Sum_probs=18.8
Q ss_pred ccccccccccchhHHHHHhhhhCCCCCCccccccccccc
Q psy366 534 SCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGKQL 572 (734)
Q Consensus 534 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f 572 (734)
.|+.|+..|.-.++.. +..+...+|+.|+..|
T Consensus 4 ~Cp~C~~~y~i~d~~i-------p~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKI-------PPKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCCHHHC-------CCCCcEEECCCCCCEe
Confidence 4666666666555432 2445566777777665
No 78
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=73.57 E-value=1.2 Score=41.95 Aligned_cols=23 Identities=26% Similarity=0.639 Sum_probs=13.2
Q ss_pred CccccccccccccChhHHhhhhh
Q psy366 618 KSFVCELCGASFKQRTCLWSHKK 640 (734)
Q Consensus 618 ~~~~C~~C~~~f~~~~~l~~H~~ 640 (734)
+.++|++|+..|.++.-.....+
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r 26 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIR 26 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCce
Confidence 44566666666666554444443
No 79
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=70.69 E-value=2.2 Score=34.86 Aligned_cols=14 Identities=36% Similarity=0.883 Sum_probs=6.3
Q ss_pred CCccccchhhhccC
Q psy366 114 KGFVCDLCGASFKQ 127 (734)
Q Consensus 114 ~~~~C~~C~~~f~~ 127 (734)
.|-.|+.||..|.-
T Consensus 25 ~PivCP~CG~~~~~ 38 (108)
T PF09538_consen 25 DPIVCPKCGTEFPP 38 (108)
T ss_pred CCccCCCCCCccCc
Confidence 34444444444433
No 80
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=70.41 E-value=1.4 Score=41.54 Aligned_cols=12 Identities=17% Similarity=0.440 Sum_probs=8.5
Q ss_pred ccCcchhhcCCh
Q psy366 708 LCTICSATFKTA 719 (734)
Q Consensus 708 ~C~~C~~~f~~~ 719 (734)
.||.||.+|...
T Consensus 50 vCP~CgyA~~~~ 61 (214)
T PF09986_consen 50 VCPHCGYAAFEE 61 (214)
T ss_pred ECCCCCCccccc
Confidence 578887777654
No 81
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=69.63 E-value=2.8 Score=24.12 Aligned_cols=10 Identities=40% Similarity=0.913 Sum_probs=6.0
Q ss_pred cccccccccc
Q psy366 651 ECTLCGKKFV 660 (734)
Q Consensus 651 ~C~~C~~~f~ 660 (734)
.|+.||..|.
T Consensus 16 ~Cp~CG~~F~ 25 (26)
T PF10571_consen 16 FCPHCGYDFE 25 (26)
T ss_pred cCCCCCCCCc
Confidence 4666666653
No 82
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=68.07 E-value=3.1 Score=26.34 Aligned_cols=32 Identities=34% Similarity=0.860 Sum_probs=16.2
Q ss_pred ccccccccccchhHHHHHhhhhCCCCCCccccccccccc
Q psy366 534 SCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGKQL 572 (734)
Q Consensus 534 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f 572 (734)
.|+.|+..|....+.. +..+...+|+.|+..|
T Consensus 4 ~CP~C~~~f~v~~~~l-------~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDKL-------PAGGRKVRCPKCGHVF 35 (37)
T ss_pred ECCCCCceEEcCHHHc-------ccCCcEEECCCCCcEe
Confidence 4555665555444321 1334456666666555
No 83
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=65.63 E-value=3.7 Score=26.15 Aligned_cols=13 Identities=31% Similarity=0.830 Sum_probs=6.7
Q ss_pred ccccccccccchh
Q psy366 534 SCEQCKVQFSCKS 546 (734)
Q Consensus 534 ~C~~C~~~f~~~~ 546 (734)
.|+.|+..|.-..
T Consensus 4 ~CP~C~~~~~v~~ 16 (38)
T TIGR02098 4 QCPNCKTSFRVVD 16 (38)
T ss_pred ECCCCCCEEEeCH
Confidence 4555555555443
No 84
>PF12907 zf-met2: Zinc-binding
Probab=64.47 E-value=2.4 Score=27.22 Aligned_cols=29 Identities=21% Similarity=0.443 Sum_probs=15.8
Q ss_pred ccCCCcCCcC---CcchHHHhhhhhccCCCCC
Q psy366 678 YKCDICGHGF---KLNYDMLRHKQDVHSTTRP 706 (734)
Q Consensus 678 ~~C~~C~~~f---~~~~~L~~H~~~~H~~~~~ 706 (734)
+.|.+|..+| .....|..|....|....+
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~ 33 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTF 33 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCH
Confidence 4566666544 3334466666666665443
No 85
>KOG2186|consensus
Probab=63.66 E-value=3.8 Score=38.22 Aligned_cols=46 Identities=24% Similarity=0.553 Sum_probs=28.5
Q ss_pred ccccchhHhhcCHHHHHhHhhhcCCCCccccccccccccChhHHhhhhh
Q psy366 592 HSCENCAKSFFSLAELRQHSPVHSEQKSFVCELCGASFKQRTCLWSHKK 640 (734)
Q Consensus 592 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~ 640 (734)
|.|..||....- ..+.+|+..-++ .-|.|..|++.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 456666665543 345556666655 55777777777766 56666654
No 86
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=62.36 E-value=4.1 Score=36.03 Aligned_cols=24 Identities=29% Similarity=0.818 Sum_probs=16.3
Q ss_pred ccccccccccccChHHHHhHHhhhccCCCccccCcchh
Q psy366 429 PYKCDICEQHFKFNYDIQMHKRCVHSNIRPYQCTLCSA 466 (734)
Q Consensus 429 ~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~C~~C~~ 466 (734)
.|.|++||+ ++.++.|-+|++||.
T Consensus 134 ~~vC~vCGy--------------~~~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGY--------------THEGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCC--------------cccCCCCCcCCCCCC
Confidence 577777776 444567777777773
No 87
>PF15269 zf-C2H2_7: Zinc-finger
Probab=62.24 E-value=5.8 Score=25.62 Aligned_cols=27 Identities=26% Similarity=0.557 Sum_probs=22.8
Q ss_pred cCcccccCCcccccccChHHHHHHHHh
Q psy366 141 ELSYKFECTLCGKKFVKNYSLYEHMKR 167 (734)
Q Consensus 141 ~~~~~~~C~~C~~~f~~~~~L~~H~~~ 167 (734)
.+++.|+|-.|..+...++.|-.||+.
T Consensus 16 gkp~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 16 GKPFKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred CCCccceeecCCcccchHHHHHHHHHH
Confidence 345678999999999999999999875
No 88
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=61.34 E-value=4.8 Score=27.00 Aligned_cols=11 Identities=27% Similarity=1.102 Sum_probs=5.7
Q ss_pred ccccccccccc
Q psy366 650 FECTLCGKKFV 660 (734)
Q Consensus 650 ~~C~~C~~~f~ 660 (734)
|.|+.||..|.
T Consensus 4 y~C~~CG~~~~ 14 (46)
T PRK00398 4 YKCARCGREVE 14 (46)
T ss_pred EECCCCCCEEE
Confidence 45555555544
No 89
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=61.00 E-value=4.1 Score=36.09 Aligned_cols=24 Identities=25% Similarity=0.682 Sum_probs=15.6
Q ss_pred CcccccccccccCChHHHHhHhhhhcCCCCcccccchh
Q psy366 561 RSYTCDQCGKQLSYAKTLANHIKGVHLKIKKHSCENCA 598 (734)
Q Consensus 561 ~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~ 598 (734)
+.|.|++||. ++.|+.|-+||+||
T Consensus 133 ~~~vC~vCGy--------------~~~ge~P~~CPiCg 156 (166)
T COG1592 133 KVWVCPVCGY--------------THEGEAPEVCPICG 156 (166)
T ss_pred CEEEcCCCCC--------------cccCCCCCcCCCCC
Confidence 3577777766 45556667777776
No 90
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=59.77 E-value=7.2 Score=31.86 Aligned_cols=33 Identities=33% Similarity=0.690 Sum_probs=23.4
Q ss_pred cccccCCcccccccChHHHHHHHHhcCCCCCCCCCcccccccch
Q psy366 143 SYKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFKLN 186 (734)
Q Consensus 143 ~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 186 (734)
+.+..|+.||..|-.. +..|..|+.||..|...
T Consensus 7 GtKR~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 7 GTKRTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred CCcccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 3456788888888663 34677888888887655
No 91
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=59.37 E-value=7.2 Score=27.32 Aligned_cols=9 Identities=44% Similarity=1.194 Sum_probs=5.9
Q ss_pred CcccCcchh
Q psy366 706 PYLCTICSA 714 (734)
Q Consensus 706 ~~~C~~C~~ 714 (734)
+|+|+.||.
T Consensus 50 ~Y~Cp~CGF 58 (61)
T COG2888 50 PYRCPKCGF 58 (61)
T ss_pred ceECCCcCc
Confidence 577777664
No 92
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=59.30 E-value=4.7 Score=24.53 Aligned_cols=11 Identities=36% Similarity=1.153 Sum_probs=6.1
Q ss_pred ccccccccccc
Q psy366 620 FVCELCGASFK 630 (734)
Q Consensus 620 ~~C~~C~~~f~ 630 (734)
|.|..||..+.
T Consensus 1 Y~C~~Cg~~~~ 11 (32)
T PF03604_consen 1 YICGECGAEVE 11 (32)
T ss_dssp EBESSSSSSE-
T ss_pred CCCCcCCCeeE
Confidence 45666666554
No 93
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=57.82 E-value=9.3 Score=21.95 Aligned_cols=19 Identities=16% Similarity=0.503 Sum_probs=11.5
Q ss_pred CccccccccCChHHHHHHHH
Q psy366 60 MCDQCGKHFTSSQSLARHIK 79 (734)
Q Consensus 60 ~C~~C~~~f~~~~~l~~H~~ 79 (734)
.||+|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 466776666 4456666654
No 94
>KOG2186|consensus
Probab=57.61 E-value=4.8 Score=37.59 Aligned_cols=54 Identities=31% Similarity=0.636 Sum_probs=37.8
Q ss_pred ccccchhhhccCcchHHHhhhhhcccCcccccCCcccccccChHHHHHHHHhcCCCCCC
Q psy366 116 FVCDLCGASFKQRSCLWSHKKWNHEELSYKFECTLCGKKFVKNYSLYEHMKRHSDVRPY 174 (734)
Q Consensus 116 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~ 174 (734)
|.|.+||.+-.-. .|.+|+-.-++. -|.|..|+++|.. .+...|..--+....|
T Consensus 4 FtCnvCgEsvKKp-~vekH~srCrn~---~fSCIDC~k~F~~-~sYknH~kCITEaQKY 57 (276)
T KOG2186|consen 4 FTCNVCGESVKKP-QVEKHMSRCRNA---YFSCIDCGKTFER-VSYKNHTKCITEAQKY 57 (276)
T ss_pred Eehhhhhhhcccc-chHHHHHhccCC---eeEEeeccccccc-chhhhhhhhcchHHHh
Confidence 6788888765543 466688877774 5888888888887 7777887655443333
No 95
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=57.35 E-value=5.6 Score=28.45 Aligned_cols=26 Identities=23% Similarity=0.530 Sum_probs=12.0
Q ss_pred chhHHHHhhhccCCccccCCC----cCCcC
Q psy366 662 NYSLYEHMKRHSDVRPYKCDI----CGHGF 687 (734)
Q Consensus 662 ~~~L~~H~~~H~~~~~~~C~~----C~~~f 687 (734)
...|..|+..--..++..|.+ |+..+
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~~~~GC~~~~ 52 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPYSPYGCKERV 52 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS----S--EE
T ss_pred HHHHHHHHHccCCCCcEECCCCCCCCCCcc
Confidence 335555555444455555555 55544
No 96
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=56.37 E-value=9.7 Score=33.86 Aligned_cols=17 Identities=18% Similarity=0.180 Sum_probs=7.8
Q ss_pred cccccchhHhhcCHHHH
Q psy366 591 KHSCENCAKSFFSLAEL 607 (734)
Q Consensus 591 ~~~C~~C~~~f~~~~~l 607 (734)
-|.||.|+..|+...++
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 34444444444444444
No 97
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=55.53 E-value=8.1 Score=26.47 Aligned_cols=20 Identities=30% Similarity=0.732 Sum_probs=9.9
Q ss_pred cCCCcCCcCCcc-----hHHHhhhh
Q psy366 679 KCDICGHGFKLN-----YDMLRHKQ 698 (734)
Q Consensus 679 ~C~~C~~~f~~~-----~~L~~H~~ 698 (734)
.|.+|++.+... ++|.+|+.
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 455565555333 34555543
No 98
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=53.96 E-value=7.2 Score=25.90 Aligned_cols=10 Identities=40% Similarity=1.056 Sum_probs=3.5
Q ss_pred cccccccccc
Q psy366 651 ECTLCGKKFV 660 (734)
Q Consensus 651 ~C~~C~~~f~ 660 (734)
.|.+|++.+.
T Consensus 18 ~C~~C~~~~~ 27 (45)
T PF02892_consen 18 KCKYCGKVIK 27 (45)
T ss_dssp EETTTTEE--
T ss_pred EeCCCCeEEe
Confidence 3444444443
No 99
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=53.93 E-value=10 Score=33.68 Aligned_cols=17 Identities=12% Similarity=0.127 Sum_probs=9.0
Q ss_pred ccccccccccccChhhH
Q psy366 87 HHWCDICGKSFFAASEL 103 (734)
Q Consensus 87 ~~~C~~C~~~f~~~~~l 103 (734)
.|.|+.|+..|....++
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 45555555555544444
No 100
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=53.00 E-value=7.8 Score=32.16 Aligned_cols=10 Identities=30% Similarity=0.820 Sum_probs=5.2
Q ss_pred cccccccccc
Q psy366 89 WCDICGKSFF 98 (734)
Q Consensus 89 ~C~~C~~~f~ 98 (734)
.|+.||++|.
T Consensus 11 ~Cp~cg~kFY 20 (129)
T TIGR02300 11 ICPNTGSKFY 20 (129)
T ss_pred cCCCcCcccc
Confidence 4555555554
No 101
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=52.44 E-value=12 Score=32.85 Aligned_cols=14 Identities=14% Similarity=0.558 Sum_probs=6.6
Q ss_pred CccccccccccccC
Q psy366 618 KSFVCELCGASFKQ 631 (734)
Q Consensus 618 ~~~~C~~C~~~f~~ 631 (734)
.-|.|+.|+..|..
T Consensus 98 ~~Y~Cp~C~~~y~~ 111 (147)
T smart00531 98 AYYKCPNCQSKYTF 111 (147)
T ss_pred cEEECcCCCCEeeH
Confidence 34445555544443
No 102
>PHA00626 hypothetical protein
Probab=52.08 E-value=9.5 Score=26.28 Aligned_cols=13 Identities=31% Similarity=0.552 Sum_probs=7.1
Q ss_pred ccccCCCcCCcCC
Q psy366 676 RPYKCDICGHGFK 688 (734)
Q Consensus 676 ~~~~C~~C~~~f~ 688 (734)
..|.|..||..|+
T Consensus 22 nrYkCkdCGY~ft 34 (59)
T PHA00626 22 DDYVCCDCGYNDS 34 (59)
T ss_pred cceEcCCCCCeec
Confidence 3455555555554
No 103
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=51.45 E-value=5.9 Score=35.97 Aligned_cols=17 Identities=24% Similarity=0.350 Sum_probs=8.4
Q ss_pred cccccchhHhhcCHHHH
Q psy366 591 KHSCENCAKSFFSLAEL 607 (734)
Q Consensus 591 ~~~C~~C~~~f~~~~~l 607 (734)
-|.|+.|+..|+...++
T Consensus 117 ~Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 117 FFFCPNCHIRFTFDEAM 133 (178)
T ss_pred EEECCCCCcEEeHHHHh
Confidence 35555555555544443
No 104
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=50.71 E-value=9.6 Score=26.82 Aligned_cols=7 Identities=43% Similarity=1.170 Sum_probs=2.9
Q ss_pred ccccchh
Q psy366 592 HSCENCA 598 (734)
Q Consensus 592 ~~C~~C~ 598 (734)
|.||.||
T Consensus 26 F~CPnCG 32 (59)
T PRK14890 26 FLCPNCG 32 (59)
T ss_pred eeCCCCC
Confidence 3444443
No 105
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=50.64 E-value=13 Score=33.86 Aligned_cols=37 Identities=24% Similarity=0.591 Sum_probs=24.4
Q ss_pred hcCCCCccccccccccccChhHHhhhhhhhcccccccccccccccccc
Q psy366 613 VHSEQKSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLCGKKFV 660 (734)
Q Consensus 613 ~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~ 660 (734)
.-.+..-|.|+.|+..|+...++. ..|.||.||....
T Consensus 111 ~e~~~~~Y~Cp~C~~rytf~eA~~-----------~~F~Cp~Cg~~L~ 147 (178)
T PRK06266 111 EEENNMFFFCPNCHIRFTFDEAME-----------YGFRCPQCGEMLE 147 (178)
T ss_pred hccCCCEEECCCCCcEEeHHHHhh-----------cCCcCCCCCCCCe
Confidence 334445678888888777776653 3477888876554
No 106
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=50.11 E-value=9.6 Score=25.26 Aligned_cols=10 Identities=30% Similarity=1.022 Sum_probs=4.1
Q ss_pred cccccccccc
Q psy366 620 FVCELCGASF 629 (734)
Q Consensus 620 ~~C~~C~~~f 629 (734)
|.|..||..|
T Consensus 3 Y~C~~Cg~~~ 12 (44)
T smart00659 3 YICGECGREN 12 (44)
T ss_pred EECCCCCCEe
Confidence 3444444433
No 107
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=49.21 E-value=7.8 Score=27.69 Aligned_cols=42 Identities=17% Similarity=0.356 Sum_probs=21.4
Q ss_pred CCcccccc--ccccccChhHHhhhhhhhcccccccccccc----ccccccC
Q psy366 617 QKSFVCEL--CGASFKQRTCLWSHKKWNHEELIYKFECTL----CGKKFVK 661 (734)
Q Consensus 617 ~~~~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~----C~~~f~~ 661 (734)
..+..|+. |...+. +..|..|+...-.. ++..|++ |+..+..
T Consensus 7 ~~~v~C~~~cc~~~i~-r~~l~~H~~~~C~~--~~v~C~~~~~GC~~~~~~ 54 (60)
T PF02176_consen 7 FRPVPCPNGCCNEMIP-RKELDDHLENECPK--RPVPCPYSPYGCKERVPR 54 (60)
T ss_dssp TSEEE-TT--S-BEEE-CCCHHHHHHTTSTT--SEEE-SS----S--EEEH
T ss_pred CCEeeCCCCCccccee-HHHHHHHHHccCCC--CcEECCCCCCCCCCccch
Confidence 34566766 333333 55677777655554 5567777 7766643
No 108
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=49.19 E-value=6 Score=25.58 Aligned_cols=10 Identities=40% Similarity=1.288 Sum_probs=4.7
Q ss_pred cccccccccc
Q psy366 650 FECTLCGKKF 659 (734)
Q Consensus 650 ~~C~~C~~~f 659 (734)
|.|+.||..|
T Consensus 6 y~C~~Cg~~f 15 (41)
T smart00834 6 YRCEDCGHTF 15 (41)
T ss_pred EEcCCCCCEE
Confidence 4444444444
No 109
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=48.40 E-value=9.2 Score=36.77 Aligned_cols=49 Identities=20% Similarity=0.464 Sum_probs=28.8
Q ss_pred ccccchhhhccCcchHHHhhhhhcccCcccccCCcccccccChHHHHHHHHhc
Q psy366 116 FVCDLCGASFKQRSCLWSHKKWNHEELSYKFECTLCGKKFVKNYSLYEHMKRH 168 (734)
Q Consensus 116 ~~C~~C~~~f~~~~~l~~H~~~~~~~~~~~~~C~~C~~~f~~~~~L~~H~~~h 168 (734)
-.|-.|.-.|+-... |.--. ...+..|.|+.|...|...-+.-.|...|
T Consensus 363 ~~Cf~CQ~~fp~~~~---~~~~~-~~ss~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 363 THCFVCQGPFPKPPV---SPFDE-STSSGRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred ccceeccCCCCCCCC---Ccccc-cccccceechhhhhhhhhhhHHHHHHHHh
Confidence 347777766655421 11000 01123688888888888887777777666
No 110
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=47.92 E-value=6.2 Score=34.66 Aligned_cols=12 Identities=50% Similarity=1.124 Sum_probs=5.5
Q ss_pred cccccccccccC
Q psy366 650 FECTLCGKKFVK 661 (734)
Q Consensus 650 ~~C~~C~~~f~~ 661 (734)
++|+.||++|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 344444444443
No 111
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=47.25 E-value=15 Score=35.46 Aligned_cols=25 Identities=28% Similarity=0.573 Sum_probs=16.3
Q ss_pred CCcccCcchhhcCChHHHHHHHhhc
Q psy366 705 RPYLCTICSATFKTARALKQHGRTH 729 (734)
Q Consensus 705 ~~~~C~~C~~~f~~~~~l~~H~~~H 729 (734)
..|+|+.|-..|-.-.+.-.|...|
T Consensus 387 ~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 387 GRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceechhhhhhhhhhhHHHHHHHHh
Confidence 3477777777776666666665555
No 112
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=47.06 E-value=6.7 Score=27.09 Aligned_cols=13 Identities=46% Similarity=1.377 Sum_probs=6.8
Q ss_pred ccccccccccccc
Q psy366 648 YKFECTLCGKKFV 660 (734)
Q Consensus 648 ~~~~C~~C~~~f~ 660 (734)
+.|.|+.||..|.
T Consensus 4 Yey~C~~Cg~~fe 16 (52)
T TIGR02605 4 YEYRCTACGHRFE 16 (52)
T ss_pred EEEEeCCCCCEeE
Confidence 3455555555554
No 113
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=46.22 E-value=15 Score=32.17 Aligned_cols=40 Identities=23% Similarity=0.678 Sum_probs=20.9
Q ss_pred cCCCCccccccccccccchhHHHHHhhhhCCCCCCccccccccccc
Q psy366 527 QDNKINYSCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGKQL 572 (734)
Q Consensus 527 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~~f 572 (734)
..+...|.|+.|+..|+....+..- . . ...|.|+.||...
T Consensus 94 e~~~~~Y~Cp~C~~~y~~~ea~~~~---d--~-~~~f~Cp~Cg~~l 133 (147)
T smart00531 94 ETNNAYYKCPNCQSKYTFLEANQLL---D--M-DGTFTCPRCGEEL 133 (147)
T ss_pred ccCCcEEECcCCCCEeeHHHHHHhc---C--C-CCcEECCCCCCEE
Confidence 3444567777777776654433220 0 1 2236677776644
No 114
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=45.99 E-value=10 Score=31.48 Aligned_cols=14 Identities=21% Similarity=0.389 Sum_probs=6.7
Q ss_pred CccccccccccccC
Q psy366 618 KSFVCELCGASFKQ 631 (734)
Q Consensus 618 ~~~~C~~C~~~f~~ 631 (734)
.|-.|+.||..|.-
T Consensus 25 ~p~vcP~cg~~~~~ 38 (129)
T TIGR02300 25 RPAVSPYTGEQFPP 38 (129)
T ss_pred CCccCCCcCCccCc
Confidence 34455555554433
No 115
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=44.72 E-value=13 Score=41.85 Aligned_cols=29 Identities=24% Similarity=0.567 Sum_probs=16.3
Q ss_pred hcCCCCcccccchhHhhcCHHHHHhHhhhcCCCCcccccccccc
Q psy366 585 VHLKIKKHSCENCAKSFFSLAELRQHSPVHSEQKSFVCELCGAS 628 (734)
Q Consensus 585 ~H~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 628 (734)
.|.......|.+||. ....|..|+.||..
T Consensus 456 ~H~~~~~L~CH~Cg~---------------~~~~p~~Cp~Cgs~ 484 (730)
T COG1198 456 LHKATGQLRCHYCGY---------------QEPIPQSCPECGSE 484 (730)
T ss_pred EecCCCeeEeCCCCC---------------CCCCCCCCCCCCCC
Confidence 444445566666663 23456667777654
No 116
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=43.31 E-value=7.3 Score=29.00 Aligned_cols=14 Identities=29% Similarity=0.574 Sum_probs=6.8
Q ss_pred ccccCC--CcCCcCCc
Q psy366 676 RPYKCD--ICGHGFKL 689 (734)
Q Consensus 676 ~~~~C~--~C~~~f~~ 689 (734)
+-+.|. .||.+|..
T Consensus 26 ~Y~qC~N~eCg~tF~t 41 (72)
T PRK09678 26 RYHQCQNVNCSATFIT 41 (72)
T ss_pred eeeecCCCCCCCEEEE
Confidence 334555 55555543
No 117
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=42.16 E-value=4 Score=44.54 Aligned_cols=76 Identities=26% Similarity=0.586 Sum_probs=40.5
Q ss_pred cccchhHhhcCHHHHHhHhhhcCCCCcc-ccccccccccChhHHhhhh-hhhcccccccc-ccccccccccCchhHHHHh
Q psy366 593 SCENCAKSFFSLAELRQHSPVHSEQKSF-VCELCGASFKQRTCLWSHK-KWNHEELIYKF-ECTLCGKKFVKNYSLYEHM 669 (734)
Q Consensus 593 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~-~C~~C~~~f~~~~~l~~H~-~~h~~~~~~~~-~C~~C~~~f~~~~~L~~H~ 669 (734)
.|+.|-+...+..+-+.+. || .|..||-.|+--.+|--=. .+.. ..| .|+.|.+.+....+-+-
T Consensus 103 ~C~~Cl~Ei~dp~~rrY~Y-------PF~~CT~CGPRfTIi~alPYDR~nTsM----~~F~lC~~C~~EY~dP~nRRf-- 169 (750)
T COG0068 103 TCEDCLEEIFDPNSRRYLY-------PFINCTNCGPRFTIIEALPYDRENTSM----ADFPLCPFCDKEYKDPLNRRF-- 169 (750)
T ss_pred hhHHHHHHhcCCCCcceec-------cccccCCCCcceeeeccCCCCcccCcc----ccCcCCHHHHHHhcCcccccc--
Confidence 5888877665543322211 22 3888888777655542211 1111 122 57777777666543322
Q ss_pred hhccCCccccCCCcCC
Q psy366 670 KRHSDVRPYKCDICGH 685 (734)
Q Consensus 670 ~~H~~~~~~~C~~C~~ 685 (734)
| .+|..|+.||=
T Consensus 170 --H--AQp~aCp~CGP 181 (750)
T COG0068 170 --H--AQPIACPKCGP 181 (750)
T ss_pred --c--cccccCcccCC
Confidence 2 35667777774
No 118
>PF12907 zf-met2: Zinc-binding
Probab=42.08 E-value=17 Score=23.45 Aligned_cols=14 Identities=29% Similarity=0.380 Sum_probs=6.5
Q ss_pred hHHHHhHhhhhcCC
Q psy366 575 AKTLANHIKGVHLK 588 (734)
Q Consensus 575 ~~~L~~H~~~~H~~ 588 (734)
...|..|....|.+
T Consensus 17 ~~~L~eH~enKHpK 30 (40)
T PF12907_consen 17 EPQLKEHAENKHPK 30 (40)
T ss_pred HHHHHHHHHccCCC
Confidence 34455555444444
No 119
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=41.37 E-value=12 Score=31.86 Aligned_cols=23 Identities=48% Similarity=0.754 Sum_probs=13.3
Q ss_pred cccCcchhhcCChHHHHHHHhhccCC
Q psy366 707 YLCTICSATFKTARALKQHGRTHIKS 732 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H~~~ 732 (734)
..|-+||+.|.. |++|+++|+|-
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gl 95 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGL 95 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S
T ss_pred eEEccCCcccch---HHHHHHHccCC
Confidence 567777777765 37777777664
No 120
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=41.06 E-value=8.8 Score=25.95 Aligned_cols=10 Identities=30% Similarity=1.016 Sum_probs=5.5
Q ss_pred cccccccccc
Q psy366 620 FVCELCGASF 629 (734)
Q Consensus 620 ~~C~~C~~~f 629 (734)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 5555555555
No 121
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=40.02 E-value=8.7 Score=33.78 Aligned_cols=12 Identities=33% Similarity=0.822 Sum_probs=5.7
Q ss_pred CCCCcccccccc
Q psy366 174 YKCDICGHGFKL 185 (734)
Q Consensus 174 ~~C~~C~~~f~~ 185 (734)
++|+.||.+|.+
T Consensus 29 ~~c~~c~~~f~~ 40 (154)
T PRK00464 29 RECLACGKRFTT 40 (154)
T ss_pred eeccccCCcceE
Confidence 444455544443
No 122
>KOG2272|consensus
Probab=39.91 E-value=22 Score=33.21 Aligned_cols=15 Identities=13% Similarity=0.578 Sum_probs=11.0
Q ss_pred CcccCcchhhcCChH
Q psy366 706 PYLCTICSATFKTAR 720 (734)
Q Consensus 706 ~~~C~~C~~~f~~~~ 720 (734)
-|.|..|++...++.
T Consensus 280 cf~Cs~Cdkkl~~K~ 294 (332)
T KOG2272|consen 280 CFSCSTCDKKLTQKN 294 (332)
T ss_pred ccccccccccccccc
Confidence 378888888776654
No 123
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=39.06 E-value=13 Score=34.98 Aligned_cols=25 Identities=20% Similarity=0.384 Sum_probs=13.8
Q ss_pred cccccccccccccCchhHHHHhhhc
Q psy366 648 YKFECTLCGKKFVKNYSLYEHMKRH 672 (734)
Q Consensus 648 ~~~~C~~C~~~f~~~~~L~~H~~~H 672 (734)
+++.||.||........|..-.|+|
T Consensus 208 k~~PCPKCg~et~eTkdLSmStR~h 232 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLSMSTRSH 232 (314)
T ss_pred CCCCCCCCCCcccccccceeeeecc
Confidence 4566666666655555555444444
No 124
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=37.97 E-value=19 Score=24.42 Aligned_cols=38 Identities=18% Similarity=0.458 Sum_probs=18.9
Q ss_pred ccccCCCcCCcCCcchHHHhhhhhccCCCCCcccCcch
Q psy366 676 RPYKCDICGHGFKLNYDMLRHKQDVHSTTRPYLCTICS 713 (734)
Q Consensus 676 ~~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~ 713 (734)
+++.|..||..|.....=+.-....--...|-.|+-|-
T Consensus 3 k~l~C~dCg~~FvfTa~EQ~fy~eKgf~n~p~RC~~CR 40 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTAGEQKFYAEKGFDNEPKRCPSCR 40 (49)
T ss_pred eeEEcccCCCeEEEehhHHHHHHhcCCcCCCccCHHHH
Confidence 45566666666655555444443211112355565553
No 125
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=37.52 E-value=22 Score=40.07 Aligned_cols=12 Identities=33% Similarity=0.750 Sum_probs=7.4
Q ss_pred CccccCCCcCCc
Q psy366 675 VRPYKCDICGHG 686 (734)
Q Consensus 675 ~~~~~C~~C~~~ 686 (734)
..|..|+.||..
T Consensus 473 ~~p~~Cp~Cgs~ 484 (730)
T COG1198 473 PIPQSCPECGSE 484 (730)
T ss_pred CCCCCCCCCCCC
Confidence 456667777643
No 126
>KOG2807|consensus
Probab=36.14 E-value=65 Score=31.72 Aligned_cols=24 Identities=21% Similarity=0.642 Sum_probs=13.1
Q ss_pred ccccccccccccCchhHHHHhhhc
Q psy366 649 KFECTLCGKKFVKNYSLYEHMKRH 672 (734)
Q Consensus 649 ~~~C~~C~~~f~~~~~L~~H~~~H 672 (734)
.|.|..|...|-..-+...|...|
T Consensus 345 ~y~C~~Ck~~FCldCDv~iHesLh 368 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIHESLH 368 (378)
T ss_pred cEEchhccceeeccchHHHHhhhh
Confidence 355666665555555555554444
No 127
>KOG1280|consensus
Probab=35.83 E-value=26 Score=34.61 Aligned_cols=38 Identities=29% Similarity=0.481 Sum_probs=22.4
Q ss_pred CccccccccccccChhHHhhhhhhhccccccccccccc
Q psy366 618 KSFVCELCGASFKQRTCLWSHKKWNHEELIYKFECTLC 655 (734)
Q Consensus 618 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C 655 (734)
..|.|++|+..=.+...|..|+...|.+....-.|++|
T Consensus 78 qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c 115 (381)
T KOG1280|consen 78 QSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLC 115 (381)
T ss_pred ccccCCcccccccchhHHHHHhhhcCcccCcceeeecc
Confidence 35667777666666666666666666554333334444
No 128
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=34.89 E-value=22 Score=27.28 Aligned_cols=12 Identities=42% Similarity=0.999 Sum_probs=7.4
Q ss_pred ccccCCCcCCcC
Q psy366 676 RPYKCDICGHGF 687 (734)
Q Consensus 676 ~~~~C~~C~~~f 687 (734)
.-+.|..||..|
T Consensus 52 GIW~C~kCg~~f 63 (89)
T COG1997 52 GIWKCRKCGAKF 63 (89)
T ss_pred CeEEcCCCCCee
Confidence 346666666666
No 129
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=34.37 E-value=8.3 Score=31.91 Aligned_cols=32 Identities=19% Similarity=0.280 Sum_probs=19.9
Q ss_pred cccCCCccccccccCCCcccCcchhhhcCChH
Q psy366 232 CDQCHSTEEAGCKCNRQLLQCSLCSETFKSKP 263 (734)
Q Consensus 232 ~~~c~~~~~~~~~~~~~~~~C~~C~~~f~~~~ 263 (734)
|+.|++...+....+...+.|..|+.......
T Consensus 5 Cp~Cgsll~p~~~~~~~~l~C~kCgye~~~~~ 36 (113)
T COG1594 5 CPKCGSLLYPKKDDEGGKLVCRKCGYEEEASN 36 (113)
T ss_pred cCCccCeeEEeEcCCCcEEECCCCCcchhccc
Confidence 45555555444444445788988888776653
No 130
>PRK04023 DNA polymerase II large subunit; Validated
Probab=33.70 E-value=34 Score=39.44 Aligned_cols=11 Identities=27% Similarity=0.896 Sum_probs=6.6
Q ss_pred CcccccchhHh
Q psy366 590 KKHSCENCAKS 600 (734)
Q Consensus 590 ~~~~C~~C~~~ 600 (734)
....|+.||..
T Consensus 625 g~RfCpsCG~~ 635 (1121)
T PRK04023 625 GRRKCPSCGKE 635 (1121)
T ss_pred cCccCCCCCCc
Confidence 34567777654
No 131
>KOG4167|consensus
Probab=32.53 E-value=13 Score=40.51 Aligned_cols=25 Identities=24% Similarity=0.491 Sum_probs=23.5
Q ss_pred CcccCcchhhcCChHHHHHHHhhcc
Q psy366 706 PYLCTICSATFKTARALKQHGRTHI 730 (734)
Q Consensus 706 ~~~C~~C~~~f~~~~~l~~H~~~H~ 730 (734)
-|.|..||+.|-.-.+++.||++|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 4999999999999999999999996
No 132
>KOG2593|consensus
Probab=30.88 E-value=37 Score=34.96 Aligned_cols=14 Identities=29% Similarity=0.906 Sum_probs=6.4
Q ss_pred CCccccchhhhccC
Q psy366 114 KGFVCDLCGASFKQ 127 (734)
Q Consensus 114 ~~~~C~~C~~~f~~ 127 (734)
..|.|+.|++.|..
T Consensus 127 ~~Y~Cp~C~kkyt~ 140 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTS 140 (436)
T ss_pred ccccCCccccchhh
Confidence 34444444444444
No 133
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=30.11 E-value=25 Score=25.99 Aligned_cols=26 Identities=38% Similarity=0.767 Sum_probs=11.0
Q ss_pred cccccccccccCchhHHHHhhhccCCccccCCCcCCcC
Q psy366 650 FECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGF 687 (734)
Q Consensus 650 ~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f 687 (734)
-.|..|++.|.- -.+.+.|..||..|
T Consensus 10 ~~C~~C~~~F~~------------~~rrhhCr~CG~~v 35 (69)
T PF01363_consen 10 SNCMICGKKFSL------------FRRRHHCRNCGRVV 35 (69)
T ss_dssp SB-TTT--B-BS------------SS-EEE-TTT--EE
T ss_pred CcCcCcCCcCCC------------ceeeEccCCCCCEE
Confidence 367888888831 24556677777766
No 134
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=29.90 E-value=24 Score=22.22 Aligned_cols=8 Identities=38% Similarity=1.385 Sum_probs=3.7
Q ss_pred cccccccc
Q psy366 652 CTLCGKKF 659 (734)
Q Consensus 652 C~~C~~~f 659 (734)
|+.||..|
T Consensus 4 C~~Cg~~Y 11 (36)
T PF05191_consen 4 CPKCGRIY 11 (36)
T ss_dssp ETTTTEEE
T ss_pred cCCCCCcc
Confidence 44444444
No 135
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=29.65 E-value=23 Score=24.31 Aligned_cols=13 Identities=15% Similarity=0.769 Sum_probs=8.8
Q ss_pred ccCcchhhhcCCh
Q psy366 250 LQCSLCSETFKSK 262 (734)
Q Consensus 250 ~~C~~C~~~f~~~ 262 (734)
|+|..|+..+...
T Consensus 2 y~C~~CgyiYd~~ 14 (50)
T cd00730 2 YECRICGYIYDPA 14 (50)
T ss_pred cCCCCCCeEECCC
Confidence 6677777777643
No 136
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=29.45 E-value=16 Score=34.19 Aligned_cols=30 Identities=27% Similarity=0.495 Sum_probs=17.4
Q ss_pred CCccccCCCcCCcCCcchHHHhhhhhccCC
Q psy366 674 DVRPYKCDICGHGFKLNYDMLRHKQDVHST 703 (734)
Q Consensus 674 ~~~~~~C~~C~~~f~~~~~L~~H~~~~H~~ 703 (734)
++..|.|.+|+|.|....-+..|+...|..
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH~e 103 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKHPE 103 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-HH
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcCHH
Confidence 344577777777777777777776666653
No 137
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=29.00 E-value=23 Score=19.97 Aligned_cols=10 Identities=40% Similarity=0.939 Sum_probs=5.6
Q ss_pred cccccccccc
Q psy366 648 YKFECTLCGK 657 (734)
Q Consensus 648 ~~~~C~~C~~ 657 (734)
.+|.||.||+
T Consensus 15 v~f~CPnCG~ 24 (24)
T PF07754_consen 15 VPFPCPNCGF 24 (24)
T ss_pred ceEeCCCCCC
Confidence 3566666653
No 138
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=28.97 E-value=28 Score=29.65 Aligned_cols=23 Identities=39% Similarity=0.845 Sum_probs=10.6
Q ss_pred cccccccccccCchhHHHHhhhccCC
Q psy366 650 FECTLCGKKFVKNYSLYEHMKRHSDV 675 (734)
Q Consensus 650 ~~C~~C~~~f~~~~~L~~H~~~H~~~ 675 (734)
..|-.||+.|. .|.+|+++|.|.
T Consensus 73 i~clecGk~~k---~LkrHL~~~~gl 95 (132)
T PF05443_consen 73 IICLECGKKFK---TLKRHLRTHHGL 95 (132)
T ss_dssp EE-TBT--EES---BHHHHHHHTT-S
T ss_pred eEEccCCcccc---hHHHHHHHccCC
Confidence 45666666554 345666666554
No 139
>PRK04023 DNA polymerase II large subunit; Validated
Probab=28.76 E-value=35 Score=39.35 Aligned_cols=9 Identities=44% Similarity=1.195 Sum_probs=5.0
Q ss_pred ccccccccc
Q psy366 563 YTCDQCGKQ 571 (734)
Q Consensus 563 ~~C~~C~~~ 571 (734)
..|+.||..
T Consensus 627 RfCpsCG~~ 635 (1121)
T PRK04023 627 RKCPSCGKE 635 (1121)
T ss_pred ccCCCCCCc
Confidence 456666554
No 140
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=27.89 E-value=20 Score=21.03 Aligned_cols=19 Identities=32% Similarity=0.608 Sum_probs=9.4
Q ss_pred cccccccccccChhHHhhhh
Q psy366 620 FVCELCGASFKQRTCLWSHK 639 (734)
Q Consensus 620 ~~C~~C~~~f~~~~~l~~H~ 639 (734)
|.|-.|++.| .....+.|.
T Consensus 1 ~sCiDC~~~F-~~~~y~~Ht 19 (28)
T PF08790_consen 1 FSCIDCSKDF-DGDSYKSHT 19 (28)
T ss_dssp EEETTTTEEE-EGGGTTT--
T ss_pred CeeecCCCCc-CcCCcCCCC
Confidence 4566666666 334444443
No 141
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=27.71 E-value=22 Score=23.94 Aligned_cols=12 Identities=25% Similarity=0.772 Sum_probs=6.7
Q ss_pred cccccccccccc
Q psy366 533 YSCEQCKVQFSC 544 (734)
Q Consensus 533 ~~C~~C~~~f~~ 544 (734)
|.|..|++.+..
T Consensus 2 y~C~~CgyvYd~ 13 (47)
T PF00301_consen 2 YQCPVCGYVYDP 13 (47)
T ss_dssp EEETTTSBEEET
T ss_pred cCCCCCCEEEcC
Confidence 456666655543
No 142
>PF14353 CpXC: CpXC protein
Probab=27.71 E-value=15 Score=31.39 Aligned_cols=20 Identities=20% Similarity=0.617 Sum_probs=11.7
Q ss_pred ccccccccccccchhHHHHH
Q psy366 532 NYSCEQCKVQFSCKSDMRKH 551 (734)
Q Consensus 532 ~~~C~~C~~~f~~~~~l~~H 551 (734)
.|.|+.||..|.-...+.-|
T Consensus 38 ~~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 38 SFTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred EEECCCCCCceecCCCEEEE
Confidence 45666676666655544444
No 143
>KOG2807|consensus
Probab=27.30 E-value=71 Score=31.46 Aligned_cols=35 Identities=14% Similarity=0.336 Sum_probs=23.1
Q ss_pred CcccccchhHhhcCHHHHHhHhhhcCCCCccccccccccccChhHHhhh
Q psy366 590 KKHSCENCAKSFFSLAELRQHSPVHSEQKSFVCELCGASFKQRTCLWSH 638 (734)
Q Consensus 590 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 638 (734)
.-|.||.|.... -.-|-.|++|+.......+|.+-
T Consensus 275 ~Gy~CP~Ckakv--------------CsLP~eCpiC~ltLVss~hLARS 309 (378)
T KOG2807|consen 275 GGYFCPQCKAKV--------------CSLPIECPICSLTLVSSPHLARS 309 (378)
T ss_pred CceeCCcccCee--------------ecCCccCCccceeEecchHHHHH
Confidence 357888885432 23677788888777777666543
No 144
>KOG4167|consensus
Probab=27.17 E-value=17 Score=39.76 Aligned_cols=25 Identities=48% Similarity=0.741 Sum_probs=23.6
Q ss_pred cccCCcccccccChHHHHHHHHhcC
Q psy366 145 KFECTLCGKKFVKNYSLYEHMKRHS 169 (734)
Q Consensus 145 ~~~C~~C~~~f~~~~~L~~H~~~h~ 169 (734)
.|.|..|++.|....++..||++|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 5899999999999999999999995
No 145
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=27.13 E-value=51 Score=21.46 Aligned_cols=23 Identities=22% Similarity=0.492 Sum_probs=13.2
Q ss_pred cccCcchhhcCC--hHHHHHHHhhc
Q psy366 707 YLCTICSATFKT--ARALKQHGRTH 729 (734)
Q Consensus 707 ~~C~~C~~~f~~--~~~l~~H~~~H 729 (734)
-.|+.||..|.. ..+-+.|.+-|
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH 38 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYH 38 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHH
Confidence 356666666654 44555565555
No 146
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=26.94 E-value=32 Score=28.04 Aligned_cols=13 Identities=23% Similarity=0.629 Sum_probs=8.5
Q ss_pred ccccCcchhhhcC
Q psy366 458 PYQCTLCSASFKR 470 (734)
Q Consensus 458 ~~~C~~C~~~f~~ 470 (734)
.+.|+.|+..+..
T Consensus 16 ~~~C~~C~~~~~~ 28 (104)
T TIGR01384 16 VYVCPSCGYEKEK 28 (104)
T ss_pred eEECcCCCCcccc
Confidence 4677777766543
No 147
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=26.43 E-value=19 Score=23.27 Aligned_cols=8 Identities=63% Similarity=1.647 Sum_probs=3.2
Q ss_pred ccCCCcCC
Q psy366 678 YKCDICGH 685 (734)
Q Consensus 678 ~~C~~C~~ 685 (734)
|.|..|+.
T Consensus 29 y~C~~C~~ 36 (40)
T smart00440 29 YVCTKCGH 36 (40)
T ss_pred EEeCCCCC
Confidence 33444443
No 148
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=26.11 E-value=43 Score=25.93 Aligned_cols=28 Identities=25% Similarity=0.611 Sum_probs=17.2
Q ss_pred cccccccccccccCchhHHHHhhhccCCccccCCCcCC
Q psy366 648 YKFECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGH 685 (734)
Q Consensus 648 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 685 (734)
.|-.|..||+.|.+.. -.+|-+|+.|..
T Consensus 57 ~Pa~CkkCGfef~~~~----------ik~pSRCP~CKS 84 (97)
T COG3357 57 RPARCKKCGFEFRDDK----------IKKPSRCPKCKS 84 (97)
T ss_pred cChhhcccCccccccc----------cCCcccCCcchh
Confidence 5667777777776521 134667777753
No 149
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=26.09 E-value=27 Score=20.93 Aligned_cols=10 Identities=40% Similarity=1.301 Sum_probs=1.7
Q ss_pred cCCCcCCcCC
Q psy366 679 KCDICGHGFK 688 (734)
Q Consensus 679 ~C~~C~~~f~ 688 (734)
+|+.|+..++
T Consensus 4 ~Cp~C~se~~ 13 (30)
T PF08274_consen 4 KCPLCGSEYT 13 (30)
T ss_dssp --TTT-----
T ss_pred CCCCCCCcce
Confidence 3555555554
No 150
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.37 E-value=34 Score=27.35 Aligned_cols=12 Identities=25% Similarity=0.277 Sum_probs=5.3
Q ss_pred Cccccccccccc
Q psy366 86 KHHWCDICGKSF 97 (734)
Q Consensus 86 ~~~~C~~C~~~f 97 (734)
.|..||.||++|
T Consensus 25 dPiVsPytG~s~ 36 (129)
T COG4530 25 DPIVSPYTGKSY 36 (129)
T ss_pred CccccCcccccc
Confidence 344444444444
No 151
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=24.90 E-value=43 Score=21.90 Aligned_cols=8 Identities=38% Similarity=1.012 Sum_probs=3.9
Q ss_pred cccccccc
Q psy366 287 SQCPVCKL 294 (734)
Q Consensus 287 ~~C~~C~~ 294 (734)
..|+.|+.
T Consensus 27 ~~CP~Cg~ 34 (42)
T PF09723_consen 27 VPCPECGS 34 (42)
T ss_pred CcCCCCCC
Confidence 34555554
No 152
>KOG1842|consensus
Probab=24.74 E-value=35 Score=35.09 Aligned_cols=27 Identities=26% Similarity=0.502 Sum_probs=15.5
Q ss_pred ccccccccccccChhHHhhhhhhhccc
Q psy366 619 SFVCELCGASFKQRTCLWSHKKWNHEE 645 (734)
Q Consensus 619 ~~~C~~C~~~f~~~~~l~~H~~~h~~~ 645 (734)
.|.|++|...|.+-..|..|+...|++
T Consensus 15 gflCPiC~~dl~~~~~L~~H~d~eH~~ 41 (505)
T KOG1842|consen 15 GFLCPICLLDLPNLSALNDHLDVEHFE 41 (505)
T ss_pred cccCchHhhhhhhHHHHHHHHhhhccc
Confidence 355666666666666666665555443
No 153
>KOG4124|consensus
Probab=23.98 E-value=17 Score=35.70 Aligned_cols=23 Identities=22% Similarity=0.537 Sum_probs=15.8
Q ss_pred ccccccc--ccccccCchhHHHHhh
Q psy366 648 YKFECTL--CGKKFVKNYSLYEHMK 670 (734)
Q Consensus 648 ~~~~C~~--C~~~f~~~~~L~~H~~ 670 (734)
++|.|++ |++.+.....|..|-.
T Consensus 348 ~~~~~~vp~~~~~~~n~ng~~~~~~ 372 (442)
T KOG4124|consen 348 KPYKCPVPNCDKAYKNQNGLKYHKL 372 (442)
T ss_pred CCCCCCCCcchhhcccCcceeeccc
Confidence 5677754 7777777777776643
No 154
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=23.70 E-value=42 Score=28.20 Aligned_cols=23 Identities=35% Similarity=0.472 Sum_probs=14.5
Q ss_pred cccCcchhhcCChHHHHHHHhhccCC
Q psy366 707 YLCTICSATFKTARALKQHGRTHIKS 732 (734)
Q Consensus 707 ~~C~~C~~~f~~~~~l~~H~~~H~~~ 732 (734)
..|-.+|+.|. +|++|+.+|.|-
T Consensus 77 IicLEDGkkfK---SLKRHL~t~~gm 99 (148)
T COG4957 77 IICLEDGKKFK---SLKRHLTTHYGL 99 (148)
T ss_pred EEEeccCcchH---HHHHHHhcccCC
Confidence 35666777664 366777776653
No 155
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=23.45 E-value=57 Score=27.47 Aligned_cols=58 Identities=28% Similarity=0.548 Sum_probs=32.6
Q ss_pred CCCCCccccccccccCChhhHHHHHhhhcCCCcccCcccccccC--ChHHHHHHHHhhcCCCCccccCcccccccCCch
Q psy366 339 LKIKDHSCDVCDKSFVNRAGLRLHKLIHSEERGFVCDLCGASFK--QRPALWAHKKLHDAKLEYKFKCILCDRKFHRNS 415 (734)
Q Consensus 339 ~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~--~~~~l~~H~~~h~~~~~~~~~C~~C~~~f~~~~ 415 (734)
...+.|+|.+|..+.....-| +|-.| ||+..- --.+|+.|-.++. .|++|..+|.+..
T Consensus 76 ~d~~lYeCnIC~etS~ee~FL----------KPneC--CgY~iCn~Cya~LWK~~~~yp-------vCPvCkTSFKss~ 135 (140)
T PF05290_consen 76 LDPKLYECNICKETSAEERFL----------KPNEC--CGYSICNACYANLWKFCNLYP-------VCPVCKTSFKSSS 135 (140)
T ss_pred cCCCceeccCcccccchhhcC----------Ccccc--cchHHHHHHHHHHHHHcccCC-------CCCcccccccccc
Confidence 345778888888766543322 33333 222211 1346777665443 4888888887654
No 156
>KOG2071|consensus
Probab=23.10 E-value=63 Score=34.88 Aligned_cols=42 Identities=29% Similarity=0.539 Sum_probs=0.0
Q ss_pred CCCCccccccccccCChhhHHHHHhhh-----------------------------------------------------
Q psy366 340 KIKDHSCDVCDKSFVNRAGLRLHKLIH----------------------------------------------------- 366 (734)
Q Consensus 340 ~~~~~~C~~C~~~f~~~~~l~~H~~~h----------------------------------------------------- 366 (734)
...+..|..||.+|.+......||..|
T Consensus 415 ~~~pnqC~~CG~R~~~~ee~sk~md~H~dwh~r~n~~~r~s~~~~~sR~Wf~s~s~W~~~K~~e~t~e~~~~~~~~~~~~ 494 (579)
T KOG2071|consen 415 KDSPNQCKSCGLRFDDSEERSKHMDIHDDWHRRKNTTIRGSSKWQKSRSWFPSKSGWLAAKAGEETDEKEKVEHEELQIK 494 (579)
T ss_pred cCCcchhcccccccccchhhhhHhhhhhhhhhhhhhhcccccccccccceeechhhhccccccccccccccccchhhcch
Q ss_pred --------------cCCCcccCccccccc
Q psy366 367 --------------SEERGFVCDLCGASF 381 (734)
Q Consensus 367 --------------~~~~~~~C~~C~~~f 381 (734)
++++...|++|+..|
T Consensus 495 ~~~s~~~k~~~Vp~d~e~~~~C~IC~EkF 523 (579)
T KOG2071|consen 495 KELSLRSKYELVPADSERQASCPICQEKF 523 (579)
T ss_pred hhhhhhccceecccCcccccCCccccccc
No 157
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=22.56 E-value=38 Score=31.70 Aligned_cols=29 Identities=28% Similarity=0.509 Sum_probs=22.6
Q ss_pred cccccccccccccCchhHHHHhhhccCCc
Q psy366 648 YKFECTLCGKKFVKNYSLYEHMKRHSDVR 676 (734)
Q Consensus 648 ~~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 676 (734)
..|.|+.|+|.|.-..-+..|+..-+.+.
T Consensus 76 ~K~~C~lc~KlFkg~eFV~KHI~nKH~e~ 104 (214)
T PF04959_consen 76 DKWRCPLCGKLFKGPEFVRKHIFNKHPEK 104 (214)
T ss_dssp EEEEE-SSS-EESSHHHHHHHHHHH-HHH
T ss_pred CEECCCCCCcccCChHHHHHHHhhcCHHH
Confidence 57999999999999999999998777754
No 158
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=22.44 E-value=29 Score=27.46 Aligned_cols=14 Identities=36% Similarity=0.819 Sum_probs=7.3
Q ss_pred cccccccccccccC
Q psy366 648 YKFECTLCGKKFVK 661 (734)
Q Consensus 648 ~~~~C~~C~~~f~~ 661 (734)
+.|.|+.||..-.+
T Consensus 21 k~FtCp~Cghe~vs 34 (104)
T COG4888 21 KTFTCPRCGHEKVS 34 (104)
T ss_pred ceEecCccCCeeee
Confidence 44666666544433
No 159
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=22.40 E-value=36 Score=26.60 Aligned_cols=12 Identities=17% Similarity=0.576 Sum_probs=7.4
Q ss_pred ccccCCCcCCcC
Q psy366 676 RPYKCDICGHGF 687 (734)
Q Consensus 676 ~~~~C~~C~~~f 687 (734)
-.+.|.-|++.|
T Consensus 53 GIW~C~~C~~~~ 64 (90)
T PTZ00255 53 GIWRCKGCKKTV 64 (90)
T ss_pred EEEEcCCCCCEE
Confidence 456666666665
No 160
>KOG3408|consensus
Probab=22.29 E-value=59 Score=26.81 Aligned_cols=26 Identities=35% Similarity=0.585 Sum_probs=17.9
Q ss_pred CCCCcccCcchhhcCChHHHHHHHhh
Q psy366 703 TTRPYLCTICSATFKTARALKQHGRT 728 (734)
Q Consensus 703 ~~~~~~C~~C~~~f~~~~~l~~H~~~ 728 (734)
|...|-|-.|.+-|.....|..|.++
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFKT 79 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHhc
Confidence 33347777777777777777777665
No 161
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=21.78 E-value=52 Score=31.13 Aligned_cols=23 Identities=26% Similarity=0.544 Sum_probs=11.9
Q ss_pred CccccccccccccChHHHHhHHh
Q psy366 428 RPYKCDICEQHFKFNYDIQMHKR 450 (734)
Q Consensus 428 ~~~~C~~C~~~f~~~~~l~~H~~ 450 (734)
+++.|+.|+.......+|..-.+
T Consensus 208 k~~PCPKCg~et~eTkdLSmStR 230 (314)
T PF06524_consen 208 KPIPCPKCGYETQETKDLSMSTR 230 (314)
T ss_pred CCCCCCCCCCcccccccceeeee
Confidence 45556666555555444444433
No 162
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=21.66 E-value=58 Score=22.79 Aligned_cols=26 Identities=38% Similarity=0.760 Sum_probs=17.5
Q ss_pred ccccccccccCchhHHHHhhhccCCccccCCCcCCcCC
Q psy366 651 ECTLCGKKFVKNYSLYEHMKRHSDVRPYKCDICGHGFK 688 (734)
Q Consensus 651 ~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~f~ 688 (734)
.|..|++.|.. ..+.+.|..||+.|-
T Consensus 4 ~C~~C~~~F~~------------~~rk~~Cr~Cg~~~C 29 (57)
T cd00065 4 SCMGCGKPFTL------------TRRRHHCRNCGRIFC 29 (57)
T ss_pred cCcccCccccC------------CccccccCcCcCCcC
Confidence 47778877763 245567777877764
No 163
>KOG2593|consensus
Probab=21.48 E-value=55 Score=33.75 Aligned_cols=42 Identities=19% Similarity=0.592 Sum_probs=25.7
Q ss_pred ccccCCCCccccccccccccchhHHHHHhhhhCCCCCCccccccccc
Q psy366 524 LNTQDNKINYSCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCGK 570 (734)
Q Consensus 524 ~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~~ 570 (734)
++.-++...|.|+.|.+.|+....+..=. .....|.|..|+-
T Consensus 120 ~~d~t~~~~Y~Cp~C~kkyt~Lea~~L~~-----~~~~~F~C~~C~g 161 (436)
T KOG2593|consen 120 LRDDTNVAGYVCPNCQKKYTSLEALQLLD-----NETGEFHCENCGG 161 (436)
T ss_pred hhhccccccccCCccccchhhhHHHHhhc-----ccCceEEEecCCC
Confidence 33445556688888888887766554311 2234677887764
No 164
>KOG4377|consensus
Probab=21.26 E-value=45 Score=33.89 Aligned_cols=19 Identities=16% Similarity=0.431 Sum_probs=13.0
Q ss_pred ccccccCchhHHHHhhhcc
Q psy366 655 CGKKFVKNYSLYEHMKRHS 673 (734)
Q Consensus 655 C~~~f~~~~~L~~H~~~H~ 673 (734)
|+.++.+.+.+..|-|.|.
T Consensus 409 c~~tl~s~sqm~shkrkhe 427 (480)
T KOG4377|consen 409 CEATLYSVSQMASHKRKHE 427 (480)
T ss_pred CceEEEehhhhhhhhhhhh
Confidence 6677777777777766663
No 165
>KOG1280|consensus
Probab=21.07 E-value=80 Score=31.46 Aligned_cols=40 Identities=18% Similarity=0.454 Sum_probs=26.3
Q ss_pred CCccccccccccccchhHHHHHhhhhCCCCCCcccccccc
Q psy366 530 KINYSCEQCKVQFSCKSDMRKHAKTHLPAIGRSYTCDQCG 569 (734)
Q Consensus 530 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~~~C~~C~ 569 (734)
...|.|++|+.+-.+...|..|...-++.....-.|++|+
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~~icp~c~ 116 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTSVICPLCA 116 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcceeeeccc
Confidence 3467888888887778888888655444433445566664
No 166
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=20.46 E-value=44 Score=20.58 Aligned_cols=24 Identities=25% Similarity=0.596 Sum_probs=18.3
Q ss_pred hhhcccchhhccCcHHHHHHHHhhc
Q psy366 7 RLICSLCSETFKSKPRLRAHIWKCQ 31 (734)
Q Consensus 7 ~~~C~~C~~~f~~~~~l~~H~~~~~ 31 (734)
-+.|+.|++... ...+..|+.+|-
T Consensus 4 ~~~C~nC~R~v~-a~RfA~HLekCm 27 (33)
T PF08209_consen 4 YVECPNCGRPVA-ASRFAPHLEKCM 27 (33)
T ss_dssp EEE-TTTSSEEE-GGGHHHHHHHHT
T ss_pred eEECCCCcCCcc-hhhhHHHHHHHH
Confidence 478999999875 678999987663
Done!