Query psy7449
Match_columns 596
No_of_seqs 505 out of 3164
Neff 10.4
Searched_HMMs 46136
Date Fri Aug 16 18:35:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7449.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7449hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1074|consensus 100.0 1.6E-31 3.4E-36 267.9 8.1 112 59-170 169-408 (958)
2 KOG1074|consensus 100.0 5.2E-31 1.1E-35 264.2 7.6 236 348-584 605-936 (958)
3 KOG2462|consensus 99.9 1.4E-27 3.1E-32 211.7 5.8 134 375-577 129-265 (279)
4 KOG3608|consensus 99.9 3.1E-26 6.7E-31 207.6 7.0 193 229-432 181-379 (467)
5 KOG3608|consensus 99.9 3.2E-26 6.9E-31 207.5 6.0 163 228-427 135-314 (467)
6 KOG2462|consensus 99.9 6.3E-26 1.4E-30 201.3 6.9 142 56-199 127-271 (279)
7 KOG3623|consensus 99.9 9E-24 2E-28 208.6 9.1 83 495-577 890-972 (1007)
8 KOG3623|consensus 99.9 3.1E-23 6.6E-28 204.9 5.6 108 58-165 209-331 (1007)
9 KOG3576|consensus 99.7 5E-18 1.1E-22 142.5 0.5 117 460-583 113-240 (267)
10 KOG3576|consensus 99.6 3.8E-17 8.2E-22 137.3 1.8 116 56-171 114-240 (267)
11 PHA00733 hypothetical protein 99.2 1.6E-11 3.5E-16 101.1 3.0 93 487-581 28-125 (128)
12 PLN03086 PRLI-interacting fact 99.1 1.1E-10 2.4E-15 118.4 9.2 147 348-580 407-565 (567)
13 PLN03086 PRLI-interacting fact 99.0 8E-10 1.7E-14 112.3 6.1 93 86-184 452-554 (567)
14 PHA00733 hypothetical protein 98.9 9.6E-10 2.1E-14 90.6 4.7 53 113-167 71-123 (128)
15 PHA02768 hypothetical protein; 98.9 6.6E-10 1.4E-14 74.1 1.3 42 500-543 6-47 (55)
16 PHA02768 hypothetical protein; 98.8 1.4E-09 3E-14 72.6 1.8 44 527-572 5-48 (55)
17 KOG1146|consensus 98.8 3.4E-09 7.4E-14 114.3 4.4 77 499-580 1260-1353(1406)
18 KOG3993|consensus 98.7 1.3E-08 2.8E-13 96.3 3.9 84 348-432 267-383 (500)
19 KOG3993|consensus 98.7 8E-09 1.7E-13 97.7 1.7 171 376-580 267-483 (500)
20 PF13465 zf-H2C2_2: Zinc-finge 98.6 1.4E-08 3E-13 57.9 0.8 24 543-566 2-25 (26)
21 PF13465 zf-H2C2_2: Zinc-finge 98.5 2.6E-08 5.7E-13 56.7 1.1 26 514-539 1-26 (26)
22 PHA00616 hypothetical protein 98.4 6.6E-08 1.4E-12 61.2 0.6 33 527-559 1-33 (44)
23 PHA00732 hypothetical protein 98.2 7.5E-07 1.6E-11 66.1 2.7 46 499-550 1-47 (79)
24 PHA00616 hypothetical protein 98.1 1.2E-06 2.5E-11 55.6 1.8 34 255-288 1-34 (44)
25 PHA00732 hypothetical protein 98.1 1.6E-06 3.5E-11 64.3 2.8 49 527-581 1-50 (79)
26 PF05605 zf-Di19: Drought indu 98.0 4.9E-06 1.1E-10 57.4 3.0 50 377-429 3-53 (54)
27 PF05605 zf-Di19: Drought indu 97.9 7.7E-06 1.7E-10 56.4 3.1 50 528-580 3-54 (54)
28 PF00096 zf-C2H2: Zinc finger, 97.6 1.7E-05 3.7E-10 43.9 0.3 18 558-575 3-20 (23)
29 PF00096 zf-C2H2: Zinc finger, 97.5 6.3E-05 1.4E-09 41.5 1.6 23 256-278 1-23 (23)
30 PF13894 zf-C2H2_4: C2H2-type 97.4 6.3E-05 1.4E-09 42.0 1.4 24 556-579 1-24 (24)
31 PF12756 zf-C2H2_2: C2H2 type 97.4 7.7E-05 1.7E-09 59.6 2.0 74 501-579 1-74 (100)
32 PF13912 zf-C2H2_6: C2H2-type 97.3 0.00013 2.8E-09 42.1 1.3 26 555-580 1-26 (27)
33 COG5189 SFP1 Putative transcri 97.3 9.5E-05 2.1E-09 67.8 1.1 55 373-427 346-420 (423)
34 COG5189 SFP1 Putative transcri 97.2 0.00012 2.5E-09 67.3 1.2 54 526-579 348-422 (423)
35 PF13894 zf-C2H2_4: C2H2-type 97.1 0.00025 5.4E-09 39.5 1.7 24 406-429 1-24 (24)
36 PF13912 zf-C2H2_6: C2H2-type 97.1 0.00018 3.8E-09 41.5 0.7 24 527-550 1-24 (27)
37 PF12756 zf-C2H2_2: C2H2 type 97.1 0.00033 7.2E-09 55.9 2.3 24 143-166 50-73 (100)
38 PF09237 GAGA: GAGA factor; I 96.5 0.0015 3.3E-08 42.4 1.5 30 525-554 22-51 (54)
39 KOG2231|consensus 96.5 0.002 4.2E-08 67.3 3.1 22 350-371 184-205 (669)
40 smart00355 ZnF_C2H2 zinc finge 96.3 0.002 4.3E-08 36.5 1.4 18 558-575 3-20 (26)
41 PF13909 zf-H2C2_5: C2H2-type 96.3 0.0015 3.4E-08 36.3 0.7 24 556-580 1-24 (24)
42 PRK04860 hypothetical protein; 96.2 0.0018 3.9E-08 55.6 0.9 39 498-540 118-156 (160)
43 PF09237 GAGA: GAGA factor; I 96.1 0.0055 1.2E-07 39.9 2.5 39 393-431 12-50 (54)
44 KOG2231|consensus 96.0 0.0068 1.5E-07 63.4 4.1 147 377-576 100-260 (669)
45 smart00355 ZnF_C2H2 zinc finge 95.9 0.0086 1.9E-07 33.7 2.6 23 256-278 1-23 (26)
46 PRK04860 hypothetical protein; 95.8 0.0045 9.8E-08 53.2 1.7 40 527-570 119-158 (160)
47 COG5236 Uncharacterized conser 95.8 0.007 1.5E-07 56.6 2.8 133 256-432 152-308 (493)
48 PF12874 zf-met: Zinc-finger o 95.6 0.0053 1.1E-07 34.5 0.8 22 556-577 1-22 (25)
49 PF13909 zf-H2C2_5: C2H2-type 95.6 0.0044 9.5E-08 34.4 0.5 22 528-550 1-22 (24)
50 KOG1146|consensus 95.5 0.006 1.3E-07 67.6 1.8 53 378-430 438-490 (1406)
51 PF12874 zf-met: Zinc-finger o 95.5 0.0047 1E-07 34.7 0.5 22 500-521 1-22 (25)
52 COG5236 Uncharacterized conser 95.3 0.015 3.2E-07 54.6 3.1 73 501-581 222-307 (493)
53 KOG2785|consensus 95.2 0.026 5.7E-07 54.2 4.7 26 499-524 166-191 (390)
54 KOG2785|consensus 95.2 0.016 3.4E-07 55.7 3.0 54 375-428 165-243 (390)
55 KOG2482|consensus 95.1 0.04 8.6E-07 51.8 5.4 23 143-165 195-217 (423)
56 KOG2482|consensus 95.1 0.028 6.1E-07 52.8 4.3 181 241-427 129-356 (423)
57 PF12171 zf-C2H2_jaz: Zinc-fin 94.9 0.012 2.6E-07 33.8 0.8 22 500-521 2-23 (27)
58 PF12171 zf-C2H2_jaz: Zinc-fin 94.7 0.016 3.4E-07 33.2 1.0 22 528-549 2-23 (27)
59 COG5048 FOG: Zn-finger [Genera 93.5 0.1 2.2E-06 54.4 5.2 158 375-563 288-454 (467)
60 COG5048 FOG: Zn-finger [Genera 92.6 0.07 1.5E-06 55.7 2.4 154 114-290 288-453 (467)
61 PF13913 zf-C2HC_2: zinc-finge 92.2 0.11 2.4E-06 29.0 1.7 21 60-81 3-23 (25)
62 PF13913 zf-C2HC_2: zinc-finge 91.6 0.17 3.8E-06 28.2 2.1 20 256-276 3-22 (25)
63 smart00451 ZnF_U1 U1-like zinc 91.3 0.15 3.2E-06 31.3 1.8 24 555-578 3-26 (35)
64 KOG2893|consensus 90.1 0.12 2.7E-06 45.9 0.9 48 349-400 11-58 (341)
65 cd00350 rubredoxin_like Rubred 89.0 0.3 6.5E-06 29.5 1.7 10 500-509 2-11 (33)
66 cd00350 rubredoxin_like Rubred 88.5 0.35 7.6E-06 29.2 1.8 10 465-474 2-11 (33)
67 smart00451 ZnF_U1 U1-like zinc 88.0 0.39 8.5E-06 29.3 1.9 21 116-136 4-24 (35)
68 KOG4173|consensus 87.7 0.13 2.7E-06 44.8 -0.7 83 498-583 78-174 (253)
69 TIGR00622 ssl1 transcription f 86.7 1.9 4E-05 34.4 5.3 23 555-577 81-103 (112)
70 KOG2893|consensus 85.9 0.37 8.1E-06 42.9 1.2 42 229-274 12-53 (341)
71 KOG4173|consensus 85.8 0.19 4.2E-06 43.7 -0.5 81 349-432 80-173 (253)
72 COG4049 Uncharacterized protei 84.7 0.35 7.5E-06 32.2 0.4 29 552-580 14-42 (65)
73 TIGR00622 ssl1 transcription f 83.7 2.3 5E-05 33.8 4.6 19 308-327 14-32 (112)
74 KOG1280|consensus 83.4 1.3 2.8E-05 42.3 3.6 41 553-594 77-117 (381)
75 PF12013 DUF3505: Protein of u 83.0 1.2 2.6E-05 35.9 2.9 25 406-430 81-109 (109)
76 PF09986 DUF2225: Uncharacteri 82.0 0.65 1.4E-05 42.6 1.2 44 375-418 4-61 (214)
77 PF12013 DUF3505: Protein of u 81.1 1.3 2.9E-05 35.6 2.6 25 348-372 80-108 (109)
78 COG4049 Uncharacterized protei 80.9 0.76 1.6E-05 30.6 0.8 29 402-430 14-42 (65)
79 PHA00626 hypothetical protein 80.7 1.3 2.7E-05 29.8 1.8 12 499-510 23-34 (59)
80 cd00729 rubredoxin_SM Rubredox 77.8 1.7 3.7E-05 26.4 1.7 8 528-535 19-26 (34)
81 PF10571 UPF0547: Uncharacteri 77.5 1.9 4E-05 24.3 1.6 11 349-359 15-25 (26)
82 PF09986 DUF2225: Uncharacteri 75.4 1.6 3.4E-05 40.1 1.6 43 526-568 4-61 (214)
83 PF09538 FYDLN_acid: Protein o 74.2 2.3 4.9E-05 33.9 2.0 15 526-540 25-39 (108)
84 COG2888 Predicted Zn-ribbon RN 72.7 3.7 7.9E-05 28.2 2.3 9 499-507 27-35 (61)
85 PRK00464 nrdR transcriptional 72.4 1.4 3E-05 37.7 0.4 16 528-543 29-44 (154)
86 PRK00398 rpoP DNA-directed RNA 70.8 3.5 7.5E-05 27.1 1.9 13 464-476 3-15 (46)
87 PF03604 DNA_RNApol_7kD: DNA d 69.4 4.9 0.00011 24.0 2.1 12 465-476 1-12 (32)
88 PF09538 FYDLN_acid: Protein o 69.1 4.1 8.8E-05 32.5 2.3 30 256-296 10-39 (108)
89 PRK14890 putative Zn-ribbon RN 67.8 8 0.00017 26.7 3.1 10 499-508 25-34 (59)
90 PF02892 zf-BED: BED zinc fing 66.9 3.8 8.3E-05 26.6 1.5 26 554-579 15-44 (45)
91 KOG2186|consensus 66.4 3 6.4E-05 38.1 1.2 46 500-548 4-49 (276)
92 smart00659 RPOLCX RNA polymera 66.1 5.2 0.00011 26.0 2.0 11 465-475 3-13 (44)
93 COG1592 Rubrerythrin [Energy p 65.7 3.9 8.5E-05 35.3 1.8 22 349-383 135-156 (166)
94 KOG2186|consensus 65.6 3.2 6.9E-05 37.9 1.2 46 116-164 4-49 (276)
95 PF13719 zinc_ribbon_5: zinc-r 65.1 6.7 0.00015 24.3 2.3 10 256-265 26-35 (37)
96 TIGR02605 CxxC_CxxC_SSSS putat 65.0 3.6 7.8E-05 27.8 1.2 11 500-510 6-16 (52)
97 PRK00464 nrdR transcriptional 64.8 4.2 9.2E-05 34.8 1.8 19 499-517 28-46 (154)
98 PF09723 Zn-ribbon_8: Zinc rib 63.3 3.3 7.2E-05 26.5 0.7 13 172-184 6-18 (42)
99 PRK09678 DNA-binding transcrip 63.3 2.1 4.5E-05 31.2 -0.3 40 500-541 2-43 (72)
100 PF12907 zf-met2: Zinc-binding 63.2 4 8.7E-05 25.8 1.0 32 556-587 2-36 (40)
101 PF13717 zinc_ribbon_4: zinc-r 63.0 8.5 0.00019 23.7 2.4 10 256-265 26-35 (36)
102 PF01352 KRAB: KRAB box; Inte 62.8 3.5 7.5E-05 26.3 0.7 20 19-38 21-40 (41)
103 PRK09678 DNA-binding transcrip 61.6 3.2 6.9E-05 30.2 0.4 17 498-514 26-44 (72)
104 PF05443 ROS_MUCR: ROS/MUCR tr 61.1 3.7 8E-05 34.1 0.7 25 556-583 73-97 (132)
105 smart00531 TFIIE Transcription 60.9 5.8 0.00013 33.9 2.0 13 499-511 99-111 (147)
106 TIGR02098 MJ0042_CXXC MJ0042 f 59.4 9 0.0002 23.8 2.2 10 256-265 26-35 (38)
107 COG1592 Rubrerythrin [Energy p 59.2 6.5 0.00014 34.0 1.9 24 499-535 134-157 (166)
108 TIGR02300 FYDLN_acid conserved 58.7 7.7 0.00017 31.5 2.1 12 527-538 26-37 (129)
109 TIGR00373 conserved hypothetic 58.0 6.3 0.00014 34.1 1.7 17 527-543 109-125 (158)
110 TIGR00373 conserved hypothetic 57.6 9.2 0.0002 33.1 2.6 16 349-364 110-125 (158)
111 KOG2807|consensus 56.1 13 0.00028 35.5 3.4 24 255-278 345-368 (378)
112 smart00834 CxxC_CXXC_SSSS Puta 55.8 5.8 0.00013 25.0 0.9 10 144-153 6-15 (41)
113 PF15269 zf-C2H2_7: Zinc-finge 55.5 15 0.00032 23.5 2.5 27 344-370 16-42 (54)
114 TIGR02300 FYDLN_acid conserved 55.4 8.9 0.00019 31.2 2.0 31 59-100 9-39 (129)
115 COG1996 RPC10 DNA-directed RNA 55.3 6.3 0.00014 26.1 0.9 10 500-509 7-16 (49)
116 smart00531 TFIIE Transcription 55.1 13 0.00027 31.9 3.1 13 376-388 99-111 (147)
117 TIGR01384 TFS_arch transcripti 55.1 8.1 0.00018 30.7 1.8 11 556-566 91-101 (104)
118 PRK04023 DNA polymerase II lar 54.9 12 0.00026 41.7 3.5 11 556-566 664-674 (1121)
119 smart00734 ZnF_Rad18 Rad18-lik 54.8 11 0.00023 21.3 1.7 19 557-576 3-21 (26)
120 PF14353 CpXC: CpXC protein 54.2 2.2 4.8E-05 35.5 -1.7 13 465-477 2-14 (128)
121 COG1997 RPL43A Ribosomal prote 53.3 8.3 0.00018 28.9 1.4 13 404-416 52-64 (89)
122 PRK06266 transcription initiat 51.2 9.2 0.0002 33.8 1.7 16 528-543 118-133 (178)
123 smart00614 ZnF_BED BED zinc fi 50.9 10 0.00023 25.3 1.5 23 557-579 20-47 (50)
124 PRK06266 transcription initiat 50.4 14 0.0003 32.7 2.7 16 376-391 117-132 (178)
125 PF07754 DUF1610: Domain of un 48.1 16 0.00035 20.1 1.7 10 498-507 15-24 (24)
126 PF14353 CpXC: CpXC protein 47.8 4.7 0.0001 33.6 -0.7 19 528-546 39-57 (128)
127 PF06524 NOA36: NOA36 protein; 47.0 13 0.00028 34.2 1.8 14 278-291 137-150 (314)
128 PF15269 zf-C2H2_7: Zinc-finge 46.9 14 0.0003 23.5 1.5 24 498-521 19-42 (54)
129 PF06524 NOA36: NOA36 protein; 46.8 13 0.00028 34.2 1.9 60 226-303 170-229 (314)
130 PRK14714 DNA polymerase II lar 46.0 22 0.00048 40.9 3.8 41 500-566 680-720 (1337)
131 KOG2593|consensus 45.0 14 0.0003 37.0 1.9 37 523-562 124-160 (436)
132 COG5151 SSL1 RNA polymerase II 45.0 21 0.00046 33.7 3.0 47 378-427 364-410 (421)
133 KOG2071|consensus 44.4 16 0.00034 38.3 2.3 26 498-523 417-442 (579)
134 KOG2593|consensus 43.8 27 0.0006 35.0 3.7 15 58-72 127-141 (436)
135 PF05495 zf-CHY: CHY zinc fing 43.6 11 0.00023 27.6 0.7 11 465-475 11-21 (71)
136 PRK04023 DNA polymerase II lar 42.1 23 0.00049 39.7 3.1 8 256-263 639-646 (1121)
137 PF04959 ARS2: Arsenite-resist 42.0 11 0.00025 34.2 0.8 26 555-580 77-102 (214)
138 KOG4124|consensus 42.0 7.3 0.00016 37.3 -0.4 54 374-427 347-420 (442)
139 COG1198 PriA Primosomal protei 41.3 19 0.00041 39.6 2.4 9 282-290 474-482 (730)
140 PF01780 Ribosomal_L37ae: Ribo 40.5 9.9 0.00022 29.0 0.2 13 404-416 52-64 (90)
141 PF02176 zf-TRAF: TRAF-type zi 40.2 21 0.00046 24.7 1.8 6 103-108 26-31 (60)
142 PF12760 Zn_Tnp_IS1595: Transp 39.2 27 0.00059 22.8 2.1 11 553-563 35-45 (46)
143 COG4957 Predicted transcriptio 38.9 15 0.00033 30.0 1.0 23 557-582 78-100 (148)
144 PTZ00255 60S ribosomal protein 38.5 14 0.00031 28.1 0.7 14 404-417 53-66 (90)
145 PF13240 zinc_ribbon_2: zinc-r 38.2 18 0.00039 19.7 0.9 8 467-474 2-9 (23)
146 PF05443 ROS_MUCR: ROS/MUCR tr 38.0 12 0.00027 31.0 0.4 24 528-554 73-96 (132)
147 PF05191 ADK_lid: Adenylate ki 37.8 24 0.00051 21.8 1.5 10 528-537 22-31 (36)
148 PF03811 Zn_Tnp_IS1: InsA N-te 37.5 35 0.00075 21.1 2.2 19 487-505 17-35 (36)
149 PF08274 PhnA_Zn_Ribbon: PhnA 37.0 17 0.00036 21.4 0.7 11 466-476 4-14 (30)
150 TIGR00280 L37a ribosomal prote 36.7 15 0.00032 28.1 0.6 15 403-417 51-65 (91)
151 PF04959 ARS2: Arsenite-resist 36.3 19 0.00041 32.8 1.3 26 405-430 77-102 (214)
152 PF00301 Rubredoxin: Rubredoxi 36.2 36 0.00077 22.5 2.2 12 465-476 2-13 (47)
153 COG4530 Uncharacterized protei 35.6 23 0.00051 27.7 1.5 12 498-509 25-36 (129)
154 PRK14714 DNA polymerase II lar 34.3 37 0.0008 39.2 3.3 11 376-386 692-702 (1337)
155 COG1198 PriA Primosomal protei 33.9 32 0.0007 37.8 2.8 9 465-473 436-444 (730)
156 KOG2272|consensus 33.6 23 0.00049 32.4 1.3 28 172-199 100-127 (332)
157 KOG4377|consensus 33.5 32 0.0007 34.0 2.4 64 60-126 272-360 (480)
158 KOG2272|consensus 33.4 37 0.0008 31.1 2.6 127 58-189 98-239 (332)
159 PRK00432 30S ribosomal protein 33.2 33 0.00071 23.0 1.7 11 499-509 37-47 (50)
160 PF14446 Prok-RING_1: Prokaryo 32.3 30 0.00066 23.5 1.4 25 116-152 6-30 (54)
161 COG5151 SSL1 RNA polymerase II 32.2 73 0.0016 30.3 4.3 25 347-371 387-411 (421)
162 PF13453 zf-TFIIB: Transcripti 32.2 26 0.00057 22.2 1.1 20 554-573 18-37 (41)
163 PRK03976 rpl37ae 50S ribosomal 31.3 19 0.00041 27.5 0.4 15 403-417 52-66 (90)
164 PF01096 TFIIS_C: Transcriptio 30.9 22 0.00048 22.3 0.6 11 499-509 28-38 (39)
165 PRK03824 hypA hydrogenase nick 30.6 38 0.00083 28.4 2.1 14 463-476 69-82 (135)
166 PF02176 zf-TRAF: TRAF-type zi 30.4 53 0.0012 22.6 2.6 21 156-176 23-43 (60)
167 cd00730 rubredoxin Rubredoxin; 29.9 43 0.00093 22.5 1.8 12 465-476 2-13 (50)
168 KOG2807|consensus 29.6 63 0.0014 31.1 3.5 20 376-395 345-364 (378)
169 PF08790 zf-LYAR: LYAR-type C2 28.3 30 0.00064 19.9 0.7 9 501-509 2-10 (28)
170 PF09845 DUF2072: Zn-ribbon co 28.3 34 0.00074 28.1 1.4 13 464-476 1-13 (131)
171 smart00440 ZnF_C2C2 C2C2 Zinc 28.1 47 0.001 21.0 1.7 11 499-509 28-38 (40)
172 PF15135 UPF0515: Uncharacteri 28.1 65 0.0014 29.7 3.2 12 349-360 156-167 (278)
173 KOG4167|consensus 27.3 17 0.00038 38.7 -0.5 25 255-279 792-816 (907)
174 smart00661 RPOL9 RNA polymeras 27.0 53 0.0012 21.9 2.0 13 499-511 20-32 (52)
175 PF07975 C1_4: TFIIH C1-like d 26.7 32 0.00069 23.2 0.8 20 376-395 21-40 (51)
176 KOG3362|consensus 26.0 25 0.00055 29.1 0.3 22 499-520 129-150 (156)
177 PF13451 zf-trcl: Probable zin 25.8 29 0.00064 23.1 0.5 10 377-386 5-14 (49)
178 KOG4167|consensus 25.5 19 0.00042 38.4 -0.6 27 57-83 790-816 (907)
179 TIGR01562 FdhE formate dehydro 25.4 48 0.001 32.2 2.1 75 463-564 183-261 (305)
180 PF07649 C1_3: C1-like domain; 25.1 50 0.0011 19.1 1.4 11 226-236 14-24 (30)
181 PF08271 TF_Zn_Ribbon: TFIIB z 24.4 38 0.00083 21.7 0.9 9 406-414 20-28 (43)
182 PF04216 FdhE: Protein involve 24.1 16 0.00035 35.5 -1.4 75 464-563 172-246 (290)
183 COG3677 Transposase and inacti 23.8 54 0.0012 27.3 1.8 13 498-510 52-64 (129)
184 PRK03564 formate dehydrogenase 23.5 73 0.0016 31.0 2.9 78 461-566 184-263 (309)
185 COG2331 Uncharacterized protei 23.3 40 0.00086 24.6 0.8 10 144-153 13-22 (82)
186 PRK03681 hypA hydrogenase nick 23.3 52 0.0011 26.6 1.6 11 465-475 71-81 (114)
187 PF03833 PolC_DP2: DNA polymer 22.9 28 0.00061 38.2 0.0 11 527-537 680-690 (900)
188 TIGR00595 priA primosomal prot 22.8 56 0.0012 34.6 2.2 47 193-290 214-260 (505)
189 COG0068 HypF Hydrogenase matur 22.2 27 0.00058 37.6 -0.3 73 466-562 103-180 (750)
190 TIGR00100 hypA hydrogenase nic 22.2 50 0.0011 26.8 1.3 12 500-511 71-82 (115)
191 KOG4124|consensus 22.1 28 0.00062 33.5 -0.1 24 553-576 396-419 (442)
192 PF05290 Baculo_IE-1: Baculovi 22.1 49 0.0011 27.2 1.2 15 85-99 78-92 (140)
193 cd00065 FYVE FYVE domain; Zinc 21.9 65 0.0014 21.9 1.7 11 501-511 4-14 (57)
194 KOG3408|consensus 21.8 34 0.00074 27.5 0.3 25 524-548 54-78 (129)
195 KOG2461|consensus 21.6 2.2E+02 0.0049 28.9 6.0 69 494-562 326-394 (396)
196 PF01286 XPA_N: XPA protein N- 21.4 39 0.00085 20.5 0.4 8 467-474 6-13 (34)
197 KOG1701|consensus 21.0 20 0.00044 35.7 -1.4 36 285-320 396-438 (468)
198 PRK14873 primosome assembly pr 21.0 64 0.0014 35.4 2.2 46 193-290 384-429 (665)
199 PF11789 zf-Nse: Zinc-finger o 20.9 53 0.0011 22.7 1.0 15 499-513 24-38 (57)
200 COG3364 Zn-ribbon containing p 20.7 54 0.0012 25.4 1.1 14 376-389 2-15 (112)
201 PF06220 zf-U1: U1 zinc finger 20.6 73 0.0016 19.9 1.5 22 555-576 3-26 (38)
202 PRK12380 hydrogenase nickel in 20.2 64 0.0014 26.1 1.6 11 465-475 71-81 (113)
203 smart00064 FYVE Protein presen 20.2 66 0.0014 22.9 1.5 26 501-538 12-37 (68)
204 PF01363 FYVE: FYVE zinc finge 20.2 55 0.0012 23.5 1.1 10 465-474 10-19 (69)
205 TIGR01206 lysW lysine biosynth 20.1 49 0.0011 22.6 0.7 10 144-153 3-12 (54)
206 PF13878 zf-C2H2_3: zinc-finge 20.1 82 0.0018 20.0 1.7 10 117-126 15-24 (41)
No 1
>KOG1074|consensus
Probab=99.97 E-value=1.6e-31 Score=267.93 Aligned_cols=112 Identities=18% Similarity=0.193 Sum_probs=94.6
Q ss_pred ccccccccccccCchHHHhHHhhhcCCCCcccCcCcccccChhHHHHHHhhcCCC-------------------------
Q psy7449 59 KNTCCICGHVESGAWRLKRHFLKHTGEKSYECCWCTYKTAYENDLKKHQRKHTGE------------------------- 113 (596)
Q Consensus 59 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~------------------------- 113 (596)
+-.|..|...|.+...+..+++.|..+.++.|..|+..|....+|..+.-.+...
T Consensus 169 ~~~~~~s~~~~~t~s~~~~~lrqhl~qq~~~~~~~~q~~~a~~s~~~~~~~~~~~~e~Pq~~~e~~~kp~ng~~~p~~l~ 248 (958)
T KOG1074|consen 169 PPGVGSSHLNIPTISEELRVLRQHLIQQMQMTEQICQQVLALGSLGQTVGAPASPSELPQTGTESSTKPLNGLFSPIKLV 248 (958)
T ss_pred CCCCCccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccCCCChhhcCCCCCcccccccCCCCCccccc
Confidence 4468889999999999999999999999999999999888888877764322100
Q ss_pred --------------------------------------------------------------------------------
Q psy7449 114 -------------------------------------------------------------------------------- 113 (596)
Q Consensus 114 -------------------------------------------------------------------------------- 113 (596)
T Consensus 249 ~~~~~d~pS~s~~ss~a~~~~~~~~~~~~~~~~~~~~~s~~s~~S~~~~~~~~~~~~~~P~~l~~~~~~~~~~~~~s~sa 328 (958)
T KOG1074|consen 249 QTSKADSPSSSSSSSGAETPKQAFFHLYHELGSQMAFSSGGSLASHPPTPAPPPAAPAFPSPLGLLAQTEEAARGLSGSA 328 (958)
T ss_pred cCCccCCcccccccccccCccccccccccccccccchhhcCccccCCCCCCCCcccCCCCCCccccccchhhhccccccc
Confidence
Q ss_pred -----------------------CCcccCCCCccccChhHHHHHhhhccCCCCccCCCCCccccCchhHHhHhhhccCCC
Q psy7449 114 -----------------------KPYSCGHCPYTARSSSYMLTHLKVHSGEKAFKCEYCSYSSYSKSNLRTHMRTHIGEK 170 (596)
Q Consensus 114 -----------------------~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~ 170 (596)
-+.+|.+|.+.|.+.+.|..|++.|+|++||+|++||..|.++.+|+.|...|...-
T Consensus 329 ~p~~~k~~~~~~~~v~~~~~~~~~khkCr~CakvfgS~SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~e~~ 408 (958)
T KOG1074|consen 329 SPGLLKEKNGSYFSVEGPSEKPFFKHKCRFCAKVFGSDSALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHREKY 408 (958)
T ss_pred CcccCCcccccccccccCCccccccchhhhhHhhcCchhhhhhhhhccCCCCCeeecccccccccccceeeeeeeccccC
Confidence 006799999999999999999999999999999999999999999999998876543
No 2
>KOG1074|consensus
Probab=99.97 E-value=5.2e-31 Score=264.22 Aligned_cols=236 Identities=23% Similarity=0.328 Sum_probs=158.8
Q ss_pred CcccCCCCCcccCHHHHHHhhhhcCCCCccccCccccccccchhhhhhhhcccCCC----CcccCC---CCcccccchhh
Q psy7449 348 NYVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTHCRYSASRRDTLDHHIQVSHLYK----KRFQCP---VCDYRCTKKYN 420 (596)
Q Consensus 348 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~~~----~~~~C~---~C~~~f~~~~~ 420 (596)
.-+|-+|-++.+-.++|+.|.++|+||+||+|.+||++|.++.+|+.|| ..|... -.+.|+ +|-+.|.+...
T Consensus 605 PNqCiiC~rVlSC~saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~-~vHka~p~~R~q~ScP~~~ic~~kftn~V~ 683 (958)
T KOG1074|consen 605 PNQCIICLRVLSCPSALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHM-SVHKAKPPARVQFSCPSTFICQKKFTNAVT 683 (958)
T ss_pred ccceeeeeecccchhhhhhhhhcccCcCccccccccchhccccchhhcc-cccccCccccccccCCchhhhccccccccc
Confidence 4689999999999999999999999999999999999999999999995 456544 348999 99999999999
Q ss_pred HHHHhhhcccCCCC-----------CCCcccCCcccccccCCCCcccc---------------ccCCcc----ceecCCC
Q psy7449 421 LQMHALTKHDLDIK-----------SDGIIQDAGTYILDTALSPEKRV---------------KSRGHE----TVICHHC 470 (596)
Q Consensus 421 l~~H~~~~H~~~~~-----------~~~~~~~~~~~~~~~~l~~~~~~---------------~~~~~~----~~~C~~C 470 (596)
|..|++.|-++... .+.|..|.+.|.....+..++.. ..+++. +..+..|
T Consensus 684 lpQhIriH~~~~~s~g~~a~e~~~~adq~~~~qk~~~~a~~f~~~~se~~~~~s~~~~~~~~~t~t~~~~~tp~~~e~~~ 763 (958)
T KOG1074|consen 684 LPQHIRIHLGGQISNGGTAAEGILAADQCSSCQKTFSDARSFSQQISEQPSPESEPDEQMDERTETEELDVTPPPPENSC 763 (958)
T ss_pred ccceEEeecCCCCCCCcccccccchhcccchhhhcccccccchhhhhccCCcccCCcccccccccccccccCCCcccccc
Confidence 99999995433321 23456666666443333332220 111111 4556667
Q ss_pred Ccccccchhhhcccccc--cc--------------CCCCCCCCCcc-cCCCCCccccccchhh----hh-----------
Q psy7449 471 RDLLPYKIKDLVSHGEL--CS--------------GVPRPDGSFRF-VCFTCNYHTYKSGHMR----QH----------- 518 (596)
Q Consensus 471 ~~~~~~~~~~~~~~~~l--~~--------------h~~~~~~~~~~-~C~~C~~~f~~~~~l~----~H----------- 518 (596)
+..+.......+.-..- .. .....+++++. .+.+++-.-...-... .=
T Consensus 764 ~~~~~~e~~i~~~g~te~asa~~~~vg~~s~~~~~~~~~~T~~k~~~~~~~~~~~~~~~v~~~pvl~~~~~~~l~eg~~t 843 (958)
T KOG1074|consen 764 GRELEGEMAISVRGSTEEASANLDEVGTVSAAGEAGEEDDTSEKPTQASSFPGEILAPSVNMDPVLWNQETSMLNEGLAT 843 (958)
T ss_pred ccccCcccccccccchhhhhcChhhhcCccccchhhhhcccCCCCcccccCCCcCCccccccCchhhccccccccccccc
Confidence 76665532222111100 00 01112334455 5666653322111100 00
Q ss_pred ---hhccCCC------------------------CCccCCCCCccccCchhhhhhhhhccCCCcccCCccCcCccChhhH
Q psy7449 519 ---INGHVGI------------------------KPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNSQL 571 (596)
Q Consensus 519 ---~~~H~~~------------------------~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~l 571 (596)
..++.+. ....|.+||+.|...++|..||++|+|++||.|.+|++.|..+..|
T Consensus 844 ~~n~~t~~~~~~sv~qs~~~p~l~p~l~~~~pvnn~h~C~vCgk~FsSSsALqiH~rTHtg~KPF~C~fC~~aFttrgnL 923 (958)
T KOG1074|consen 844 KTNEITPEGPADSVIQSGGVPTLEPSLGRPGPVNNAHVCNVCGKQFSSSAALEIHMRTHTGPKPFFCHFCEEAFTTRGNL 923 (958)
T ss_pred ccccccCCCcchhhhhhccccccCCCCCCCCcccchhhhccchhcccchHHHHHhhhcCCCCCCccchhhhhhhhhhhhh
Confidence 0000000 1167999999999999999999999999999999999999999999
Q ss_pred HHHHhhcCCCCCC
Q psy7449 572 KYHLTTRHDLPDM 584 (596)
Q Consensus 572 ~~H~~~~H~~~~~ 584 (596)
+.||.+|+..-|.
T Consensus 924 KvHMgtH~w~q~~ 936 (958)
T KOG1074|consen 924 KVHMGTHMWVQPP 936 (958)
T ss_pred hhhhccccccCCC
Confidence 9999999865443
No 3
>KOG2462|consensus
Probab=99.94 E-value=1.4e-27 Score=211.67 Aligned_cols=134 Identities=31% Similarity=0.533 Sum_probs=119.3
Q ss_pred CccccCccccccccchhhhhhhhcccC---CCCcccCCCCcccccchhhHHHHhhhcccCCCCCCCcccCCcccccccCC
Q psy7449 375 KPFKCTHCRYSASRRDTLDHHIQVSHL---YKKRFQCPVCDYRCTKKYNLQMHALTKHDLDIKSDGIIQDAGTYILDTAL 451 (596)
Q Consensus 375 k~~~C~~C~~~f~~~~~L~~H~~~~H~---~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~l 451 (596)
..|+|+.||+.+.+.++|.+|.+ .|- ..+.+.|++|||.|.+...|.+|+++ |+
T Consensus 129 ~r~~c~eCgk~ysT~snLsrHkQ-~H~~~~s~ka~~C~~C~K~YvSmpALkMHirT-H~--------------------- 185 (279)
T KOG2462|consen 129 PRYKCPECGKSYSTSSNLSRHKQ-THRSLDSKKAFSCKYCGKVYVSMPALKMHIRT-HT--------------------- 185 (279)
T ss_pred Cceeccccccccccccccchhhc-ccccccccccccCCCCCceeeehHHHhhHhhc-cC---------------------
Confidence 45888888888888888888844 442 24669999999999999999999988 43
Q ss_pred CCccccccCCccceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCCCccccccchhhhhhhccCCCCCccCC
Q psy7449 452 SPEKRVKSRGHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCL 531 (596)
Q Consensus 452 ~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~ 531 (596)
.+++|.+|||.|...+.|+-|+|+|+|||||.|+
T Consensus 186 ----------------------------------------------l~c~C~iCGKaFSRPWLLQGHiRTHTGEKPF~C~ 219 (279)
T KOG2462|consen 186 ----------------------------------------------LPCECGICGKAFSRPWLLQGHIRTHTGEKPFSCP 219 (279)
T ss_pred ----------------------------------------------CCcccccccccccchHHhhcccccccCCCCccCC
Confidence 2478999999999999999999999999999999
Q ss_pred CCCccccCchhhhhhhhhccCCCcccCCccCcCccChhhHHHHHhh
Q psy7449 532 HCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNSQLKYHLTT 577 (596)
Q Consensus 532 ~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~l~~H~~~ 577 (596)
+|+++|.++++|+.||++|.+.++|+|..|++.|..++.|.+|+..
T Consensus 220 hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 220 HCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred cccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhh
Confidence 9999999999999999999999999999999999999999999764
No 4
>KOG3608|consensus
Probab=99.93 E-value=3.1e-26 Score=207.57 Aligned_cols=193 Identities=26% Similarity=0.508 Sum_probs=178.1
Q ss_pred ecCCCCccCCChHHHHhhhhhcCCCCCccCccCcccccChHHHHHHHHHhc--CCCceeeCCCCCccccccccccccccc
Q psy7449 229 QCYSCEYHSLYSSRMIKHIRNHTGERPFQCKICDSTFKRKQTLDLHIKRHT--GERTYKCEFCDYTAARKDTLGVHVRIK 306 (596)
Q Consensus 229 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~~ 306 (596)
....|.+.+.+++.|++|++.|++++...|+.||.-|.++..|-.|+++-+ ...+|.|..|.|.|.++..|..|+..+
T Consensus 181 ~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv~rH 260 (467)
T KOG3608|consen 181 NWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHVVRH 260 (467)
T ss_pred cchhhhhhhccHHHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHHHHh
Confidence 446799999999999999999999999999999999999999999987643 456899999999999999999999999
Q ss_pred ccccccCCcccccchhhHHHHHHhhhcCCCCCCCCCCCCCCCcccCCCCCcccCHHHHHHhhhhcCCCCccccCc--ccc
Q psy7449 307 HKISICQNCGLILSTITEELLEHCRICSGKERPDSSTDKSFNYVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTH--CRY 384 (596)
Q Consensus 307 h~~~~C~~C~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~--C~~ 384 (596)
-.-|.|+.|+...+. .++|..|++.-+.+.+| |+|+.|++.+.+.+.|.+|..+|. +..|.|.. |.+
T Consensus 261 vn~ykCplCdmtc~~-~ssL~~H~r~rHs~dkp---------fKCd~Cd~~c~~esdL~kH~~~HS-~~~y~C~h~~C~~ 329 (467)
T KOG3608|consen 261 VNCYKCPLCDMTCSS-ASSLTTHIRYRHSKDKP---------FKCDECDTRCVRESDLAKHVQVHS-KTVYQCEHPDCHY 329 (467)
T ss_pred hhcccccccccCCCC-hHHHHHHHHhhhccCCC---------ccccchhhhhccHHHHHHHHHhcc-ccceecCCCCCcH
Confidence 999999999999998 89999999987777788 999999999999999999999988 67899998 999
Q ss_pred ccccchhhhhhhhcccCCC--CcccCCCCcccccchhhHHHHhhhcccCC
Q psy7449 385 SASRRDTLDHHIQVSHLYK--KRFQCPVCDYRCTKKYNLQMHALTKHDLD 432 (596)
Q Consensus 385 ~f~~~~~L~~H~~~~H~~~--~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 432 (596)
+|.....|.+|++.+|.+. .+|.|..|++.|++-.+|..|++..|+..
T Consensus 330 s~r~~~q~~~H~~evhEg~np~~Y~CH~Cdr~ft~G~~L~~HL~kkH~f~ 379 (467)
T KOG3608|consen 330 SVRTYTQMRRHFLEVHEGNNPILYACHCCDRFFTSGKSLSAHLMKKHGFR 379 (467)
T ss_pred HHHHHHHHHHHHHHhccCCCCCceeeecchhhhccchhHHHHHHHhhccc
Confidence 9999999999999988655 45999999999999999999999999876
No 5
>KOG3608|consensus
Probab=99.92 E-value=3.2e-26 Score=207.48 Aligned_cols=163 Identities=28% Similarity=0.566 Sum_probs=127.2
Q ss_pred eec--CCCCccCCChHHHHhhhhhcCC------------CCC-ccC--ccCcccccChHHHHHHHHHhcCCCceeeCCCC
Q psy7449 228 YQC--YSCEYHSLYSSRMIKHIRNHTG------------ERP-FQC--KICDSTFKRKQTLDLHIKRHTGERTYKCEFCD 290 (596)
Q Consensus 228 ~~C--~~C~~~f~~~~~l~~H~~~h~~------------~~~-~~C--~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 290 (596)
|.| ..|+..|.+...|..|+-.|.- ++| +.| ..|.+.|.++..|+.|++.|++++...|+.|+
T Consensus 135 f~C~WedCe~~F~s~~ef~dHV~~H~l~ceyd~~~~~~D~~pv~~C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp~Cg 214 (467)
T KOG3608|consen 135 FRCGWEDCEREFVSIVEFQDHVVKHALFCEYDIQKTPEDERPVTMCNWAMCTKHMGNKYRLREHIRTHSNEKVVACPHCG 214 (467)
T ss_pred hccChhhcCCcccCHHHHHHHHHHhhhhhhhhhhhCCCCCCceeeccchhhhhhhccHHHHHHHHHhcCCCeEEecchHH
Confidence 455 6899999999999999877642 122 456 46999999999999999999999999999999
Q ss_pred CcccccccccccccccccccccCCcccccchhhHHHHHHhhhcCCCCCCCCCCCCCCCcccCCCCCcccCHHHHHHhhhh
Q psy7449 291 YTAARKDTLGVHVRIKHKISICQNCGLILSTITEELLEHCRICSGKERPDSSTDKSFNYVCYTCDYHTGVRECMKKHIRT 370 (596)
Q Consensus 291 ~~f~~~~~l~~H~~~~h~~~~C~~C~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~ 370 (596)
..|+++..|-.|++.. ..|.. .+ |.|..|.+.|.+...|..|+..
T Consensus 215 ~~F~~~tkl~DH~rRq-----------------t~l~~---------n~---------fqC~~C~KrFaTeklL~~Hv~r 259 (467)
T KOG3608|consen 215 ELFRTKTKLFDHLRRQ-----------------TELNT---------NS---------FQCAQCFKRFATEKLLKSHVVR 259 (467)
T ss_pred HHhccccHHHHHHHhh-----------------hhhcC---------Cc---------hHHHHHHHHHhHHHHHHHHHHH
Confidence 9999988888877531 11100 12 8888888888888888888887
Q ss_pred cCCCCccccCccccccccchhhhhhhhcccCCCCcccCCCCcccccchhhHHHHhhh
Q psy7449 371 HTGEKPFKCTHCRYSASRRDTLDHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHALT 427 (596)
Q Consensus 371 h~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 427 (596)
|.. .|+|+.|+.+....++|..||+..|...+||+|..|++.|.+.++|.+|+..
T Consensus 260 Hvn--~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~kH~~~ 314 (467)
T KOG3608|consen 260 HVN--CYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAKHVQV 314 (467)
T ss_pred hhh--cccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHHHHHh
Confidence 765 4788888888888888888888778777888888888888777777777765
No 6
>KOG2462|consensus
Probab=99.92 E-value=6.3e-26 Score=201.29 Aligned_cols=142 Identities=31% Similarity=0.535 Sum_probs=132.8
Q ss_pred CCCccccccccccccCchHHHhHHhhhc---CCCCcccCcCcccccChhHHHHHHhhcCCCCCcccCCCCccccChhHHH
Q psy7449 56 AQNKNTCCICGHVESGAWRLKRHFLKHT---GEKSYECCWCTYKTAYENDLKKHQRKHTGEKPYSCGHCPYTARSSSYML 132 (596)
Q Consensus 56 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~---~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~ 132 (596)
....|.|+.||+.+.+.+.|.+|.+.|- ..+.+.|+.||+.|.+...|+.|+++|+ .+.+|.+|||.|.....|.
T Consensus 127 ~~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~iCGKaFSRPWLLQ 204 (279)
T KOG2462|consen 127 KHPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECGICGKAFSRPWLLQ 204 (279)
T ss_pred cCCceeccccccccccccccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC--CCcccccccccccchHHhh
Confidence 4567899999999999999999999985 3678999999999999999999999997 6899999999999999999
Q ss_pred HHhhhccCCCCccCCCCCccccCchhHHhHhhhccCCCceecCCCccccchhHhhhcccccccccCC
Q psy7449 133 THLKVHSGEKAFKCEYCSYSSYSKSNLRTHMRTHIGEKPYKCSSCDFQATQKIQVKKHVLIICQHCD 199 (596)
Q Consensus 133 ~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~c~~c~ 199 (596)
-|+++|+|||||.|+.|++.|.++++|+.||++|.+.|.|+|..|++.|..++.|.+|..--|....
T Consensus 205 GHiRTHTGEKPF~C~hC~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFsl~SyLnKH~ES~C~~~~ 271 (279)
T KOG2462|consen 205 GHIRTHTGEKPFSCPHCGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFALKSYLNKHSESACLKYL 271 (279)
T ss_pred cccccccCCCCccCCcccchhcchHHHHHHHHhhcCCccccCcchhhHHHHHHHHHHhhhhcccccc
Confidence 9999999999999999999999999999999999999999999999999999999999876665443
No 7
>KOG3623|consensus
Probab=99.89 E-value=9e-24 Score=208.61 Aligned_cols=83 Identities=28% Similarity=0.621 Sum_probs=79.8
Q ss_pred CCCCcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhhhccCCCcccCCccCcCccChhhHHHH
Q psy7449 495 DGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNSQLKYH 574 (596)
Q Consensus 495 ~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~l~~H 574 (596)
+.+..|.|++|+|.|...+.|.+|.-.|+|.+||+|.+|.|.|..+..|..|+|.|.||+||+|+.|++.|.....+..|
T Consensus 890 te~gmyaCDqCDK~FqKqSSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHSGEKPfQCdKClKRFSHSGSYSQH 969 (1007)
T KOG3623|consen 890 TEDGMYACDQCDKAFQKQSSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHSGEKPFQCDKCLKRFSHSGSYSQH 969 (1007)
T ss_pred CccccchHHHHHHHHHhhHHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhccCCCcchhhhhhhhcccccchHhh
Confidence 45678999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred Hhh
Q psy7449 575 LTT 577 (596)
Q Consensus 575 ~~~ 577 (596)
|..
T Consensus 970 MNH 972 (1007)
T KOG3623|consen 970 MNH 972 (1007)
T ss_pred hcc
Confidence 863
No 8
>KOG3623|consensus
Probab=99.88 E-value=3.1e-23 Score=204.89 Aligned_cols=108 Identities=27% Similarity=0.549 Sum_probs=92.3
Q ss_pred CccccccccccccCchHHHhHHhhhc--CCCCcccCcCcccccChhHHHHHHhhcCC-------------CCCcccCCCC
Q psy7449 58 NKNTCCICGHVESGAWRLKRHFLKHT--GEKSYECCWCTYKTAYENDLKKHQRKHTG-------------EKPYSCGHCP 122 (596)
Q Consensus 58 ~~~~C~~C~~~f~~~~~l~~H~~~h~--~~~~~~C~~C~~~f~~~~~l~~H~~~h~~-------------~~~~~C~~C~ 122 (596)
...+|+.|++.+....+|..|++-.+ .+.-|.|..|.++|.++..|.+||.+|.. .+.|+|.+|+
T Consensus 209 qlltcpycdrgykrltslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~lRKFKCtECg 288 (1007)
T KOG3623|consen 209 QLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALLRKFKCTECG 288 (1007)
T ss_pred hhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccccchhhhccccccccc
Confidence 45689999999999999999987554 34568899999999999999999988853 3569999999
Q ss_pred ccccChhHHHHHhhhccCCCCccCCCCCccccCchhHHhHhhh
Q psy7449 123 YTARSSSYMLTHLKVHSGEKAFKCEYCSYSSYSKSNLRTHMRT 165 (596)
Q Consensus 123 ~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~ 165 (596)
|.|+.+-.|+.|+++|+|++||.|+.|++.|...+.+..||..
T Consensus 289 KAFKfKHHLKEHlRIHSGEKPfeCpnCkKRFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 289 KAFKFKHHLKEHLRIHSGEKPFECPNCKKRFSHSGSYSSHMSS 331 (1007)
T ss_pred hhhhhHHHHHhhheeecCCCCcCCcccccccccCCcccccccc
Confidence 9999999999999999999999999999999999999888753
No 9
>KOG3576|consensus
Probab=99.68 E-value=5e-18 Score=142.51 Aligned_cols=117 Identities=22% Similarity=0.446 Sum_probs=106.4
Q ss_pred CCccceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccC
Q psy7449 460 RGHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQ 539 (596)
Q Consensus 460 ~~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~ 539 (596)
.+...+.|.+|++.|.. ...|..|+.-|+..++|-|..||+.|.....|++|+|+|+|.+||+|..|+++|++
T Consensus 113 sd~d~ftCrvCgK~F~l-------QRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftq 185 (267)
T KOG3576|consen 113 SDQDSFTCRVCGKKFGL-------QRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQ 185 (267)
T ss_pred CCCCeeeeehhhhhhhH-------HHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHh
Confidence 34567999999999998 88999999999999999999999999999999999999999999999999999999
Q ss_pred chhhhhhhhhccC-----------CCcccCCccCcCccChhhHHHHHhhcCCCCC
Q psy7449 540 SSCLGRHMRIHTG-----------EKPYACSLCDLKFSQNSQLKYHLTTRHDLPD 583 (596)
Q Consensus 540 ~~~L~~H~~~H~~-----------~~~~~C~~C~~~F~~~~~l~~H~~~~H~~~~ 583 (596)
.-+|..|++.-+| ++.|.|+.||.+-.....+..|++.+|..-|
T Consensus 186 rcsleshl~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 186 RCSLESHLKKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred hccHHHHHHHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 9999999974443 4679999999999999999999999997654
No 10
>KOG3576|consensus
Probab=99.64 E-value=3.8e-17 Score=137.28 Aligned_cols=116 Identities=32% Similarity=0.546 Sum_probs=105.6
Q ss_pred CCCccccccccccccCchHHHhHHhhhcCCCCcccCcCcccccChhHHHHHHhhcCCCCCcccCCCCccccChhHHHHHh
Q psy7449 56 AQNKNTCCICGHVESGAWRLKRHFLKHTGEKSYECCWCTYKTAYENDLKKHQRKHTGEKPYSCGHCPYTARSSSYMLTHL 135 (596)
Q Consensus 56 ~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 135 (596)
++..|.|.+|++.|.-...|.+|++-|..-+.|.|..||+.|+...+|++|+|+|+|.+||+|..|++.|..+-+|..|+
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rthtgvrpykc~~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHTGVRPYKCSLCEKAFTQRCSLESHL 193 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhccccCccccchhhhhHHHHhhccHHHHH
Confidence 46689999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred hhccC-----------CCCccCCCCCccccCchhHHhHhhhccCCCc
Q psy7449 136 KVHSG-----------EKAFKCEYCSYSSYSKSNLRTHMRTHIGEKP 171 (596)
Q Consensus 136 ~~h~~-----------~~~~~C~~C~~~f~~~~~L~~H~~~h~~~~~ 171 (596)
+.-+| ++.|.|+.||.+-.....+..|++.|+...|
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~t~~~~e~~~~h~~~~hp~Sp 240 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGYTSERPEVYYLHLKLHHPFSP 240 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCCCCCChhHHHHHHHhcCCCCH
Confidence 75444 4678999999999999999999998876543
No 11
>PHA00733 hypothetical protein
Probab=99.16 E-value=1.6e-11 Score=101.12 Aligned_cols=93 Identities=18% Similarity=0.358 Sum_probs=76.3
Q ss_pred cccCCCCCCCCCcccCCCCCccccccchhhhh--hh---ccCCCCCccCCCCCccccCchhhhhhhhhccCCCcccCCcc
Q psy7449 487 LCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQH--IN---GHVGIKPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLC 561 (596)
Q Consensus 487 l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H--~~---~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C 561 (596)
|...-......+++.|.+|++.|.....|..| ++ .+.+.+||.|+.||+.|.+...|..|++.| +.+|.|+.|
T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~C 105 (128)
T PHA00733 28 LKRYHSLTPEQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVC 105 (128)
T ss_pred hhhhhcCChhhhhHHHHHHhhhccChhhhcchHHHHhhcccCCCCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCC
Confidence 33333333445679999999999888877776 22 344588999999999999999999999987 457999999
Q ss_pred CcCccChhhHHHHHhhcCCC
Q psy7449 562 DLKFSQNSQLKYHLTTRHDL 581 (596)
Q Consensus 562 ~~~F~~~~~l~~H~~~~H~~ 581 (596)
+++|.....|..|+...|++
T Consensus 106 gK~F~~~~sL~~H~~~~h~~ 125 (128)
T PHA00733 106 GKEFRNTDSTLDHVCKKHNI 125 (128)
T ss_pred CCccCCHHHHHHHHHHhcCc
Confidence 99999999999999999984
No 12
>PLN03086 PRLI-interacting factor K; Provisional
Probab=99.15 E-value=1.1e-10 Score=118.39 Aligned_cols=147 Identities=18% Similarity=0.362 Sum_probs=112.3
Q ss_pred CcccCCCCCcccCHHHHHHhhhhcCCCCccccCc--cccccccchhhhhhhhcccCCCCcccCCCCcccccchhhHHHHh
Q psy7449 348 NYVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTH--CRYSASRRDTLDHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHA 425 (596)
Q Consensus 348 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~--C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~ 425 (596)
.-.|+-|....... .|..|...-.. ....|+. |+..| .+..|..| +.|+.|++.|. ...|..|+
T Consensus 407 ~V~C~NC~~~i~l~-~l~lHe~~C~r-~~V~Cp~~~Cg~v~-~r~el~~H----------~~C~~Cgk~f~-~s~LekH~ 472 (567)
T PLN03086 407 TVECRNCKHYIPSR-SIALHEAYCSR-HNVVCPHDGCGIVL-RVEEAKNH----------VHCEKCGQAFQ-QGEMEKHM 472 (567)
T ss_pred eEECCCCCCccchh-HHHHHHhhCCC-cceeCCccccccee-eccccccC----------ccCCCCCCccc-hHHHHHHH
Confidence 35799998876544 45577754332 3456884 99988 45555555 48999999996 67899998
Q ss_pred hhcccCCCCCCCcccCCcccccccCCCCccccccCCccceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCC
Q psy7449 426 LTKHDLDIKSDGIIQDAGTYILDTALSPEKRVKSRGHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTC 505 (596)
Q Consensus 426 ~~~H~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C 505 (596)
..+| .++.|+ |
T Consensus 473 ~~~H--------------------------------------------------------------------kpv~Cp-C 483 (567)
T PLN03086 473 KVFH--------------------------------------------------------------------EPLQCP-C 483 (567)
T ss_pred HhcC--------------------------------------------------------------------CCccCC-C
Confidence 7743 246788 8
Q ss_pred CccccccchhhhhhhccCCCCCccCCCCCccccC----------chhhhhhhhhccCCCcccCCccCcCccChhhHHHHH
Q psy7449 506 NYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQ----------SSCLGRHMRIHTGEKPYACSLCDLKFSQNSQLKYHL 575 (596)
Q Consensus 506 ~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~----------~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~l~~H~ 575 (596)
|+.+ ....|..|+.+|.+.+++.|..|++.|.. .+.|..|..+. |.+++.|..||+.|. ...|..|+
T Consensus 484 g~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~Vr-lrdm~~H~ 560 (567)
T PLN03086 484 GVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRSVM-LKEMDIHQ 560 (567)
T ss_pred CCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCeee-ehhHHHHH
Confidence 8655 56899999999999999999999999952 35799999885 999999999998754 55788998
Q ss_pred hhcCC
Q psy7449 576 TTRHD 580 (596)
Q Consensus 576 ~~~H~ 580 (596)
...|.
T Consensus 561 ~~~h~ 565 (567)
T PLN03086 561 IAVHQ 565 (567)
T ss_pred HHhhc
Confidence 88874
No 13
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.95 E-value=8e-10 Score=112.26 Aligned_cols=93 Identities=18% Similarity=0.416 Sum_probs=72.5
Q ss_pred CCcccCcCcccccChhHHHHHHhhcCCCCCcccCCCCccccChhHHHHHhhhccCCCCccCCCCCcccc----------C
Q psy7449 86 KSYECCWCTYKTAYENDLKKHQRKHTGEKPYSCGHCPYTARSSSYMLTHLKVHSGEKAFKCEYCSYSSY----------S 155 (596)
Q Consensus 86 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~----------~ 155 (596)
+.+.|+.|++.|. ...|..|++.++ +++.|+ |++.+ .+..|..|+.+|.+.+++.|..|++.|. .
T Consensus 452 ~H~~C~~Cgk~f~-~s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thCp~Kpi~C~fC~~~v~~g~~~~d~~d~ 526 (567)
T PLN03086 452 NHVHCEKCGQAFQ-QGEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTCPLRLITCRFCGDMVQAGGSAMDVRDR 526 (567)
T ss_pred cCccCCCCCCccc-hHHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccCCCCceeCCCCCCccccCccccchhhh
Confidence 3457888888885 577888888875 778888 88655 5688888888888888888888888884 1
Q ss_pred chhHHhHhhhccCCCceecCCCccccchh
Q psy7449 156 KSNLRTHMRTHIGEKPYKCSSCDFQATQK 184 (596)
Q Consensus 156 ~~~L~~H~~~h~~~~~~~C~~C~~~f~~~ 184 (596)
...|..|.... +.+++.|..||+.+..+
T Consensus 527 ~s~Lt~HE~~C-G~rt~~C~~Cgk~Vrlr 554 (567)
T PLN03086 527 LRGMSEHESIC-GSRTAPCDSCGRSVMLK 554 (567)
T ss_pred hhhHHHHHHhc-CCcceEccccCCeeeeh
Confidence 34788888775 78888888888877653
No 14
>PHA00733 hypothetical protein
Probab=98.93 E-value=9.6e-10 Score=90.64 Aligned_cols=53 Identities=21% Similarity=0.365 Sum_probs=30.2
Q ss_pred CCCcccCCCCccccChhHHHHHhhhccCCCCccCCCCCccccCchhHHhHhhhcc
Q psy7449 113 EKPYSCGHCPYTARSSSYMLTHLKVHSGEKAFKCEYCSYSSYSKSNLRTHMRTHI 167 (596)
Q Consensus 113 ~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~~h~ 167 (596)
++||.|+.|++.|.+...|..|++.+ ..+|.|+.|++.|.....|..|+...+
T Consensus 71 ~kPy~C~~Cgk~Fss~s~L~~H~r~h--~~~~~C~~CgK~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 71 VSPYVCPLCLMPFSSSVSLKQHIRYT--EHSKVCPVCGKEFRNTDSTLDHVCKKH 123 (128)
T ss_pred CCCccCCCCCCcCCCHHHHHHHHhcC--CcCccCCCCCCccCCHHHHHHHHHHhc
Confidence 45555555665565555555555544 234566666666666666666655433
No 15
>PHA02768 hypothetical protein; Provisional
Probab=98.87 E-value=6.6e-10 Score=74.10 Aligned_cols=42 Identities=21% Similarity=0.312 Sum_probs=25.2
Q ss_pred ccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhh
Q psy7449 500 FVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCL 543 (596)
Q Consensus 500 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L 543 (596)
|.|+.||+.|...++|..|+++|+ ++|+|..|++.|.+.+.|
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l 47 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEY 47 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceeccccee
Confidence 566666666666666666666665 456666666666555544
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.83 E-value=1.4e-09 Score=72.61 Aligned_cols=44 Identities=18% Similarity=0.288 Sum_probs=40.9
Q ss_pred CccCCCCCccccCchhhhhhhhhccCCCcccCCccCcCccChhhHH
Q psy7449 527 PYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNSQLK 572 (596)
Q Consensus 527 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~l~ 572 (596)
.|+|++||+.|+..++|..|+++|+ ++|+|..|++.|.+...|.
T Consensus 5 ~y~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPICGEIYIKRKSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred ccCcchhCCeeccHHHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 4899999999999999999999999 7999999999999887764
No 17
>KOG1146|consensus
Probab=98.79 E-value=3.4e-09 Score=114.27 Aligned_cols=77 Identities=17% Similarity=0.235 Sum_probs=61.0
Q ss_pred cccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhhh-----------------ccCCCcccCCcc
Q psy7449 499 RFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMRI-----------------HTGEKPYACSLC 561 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~-----------------H~~~~~~~C~~C 561 (596)
.+.|..|++.|...-.+. |+-. ..+|.|..|...|.....|..|.+. +..-.+| |.+|
T Consensus 1260 e~~c~~~~~~~~~~~~~~-~l~~---~~~~~~~~~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~~c 1334 (1406)
T KOG1146|consen 1260 EGECGAVDELLTPSFGIS-TLDV---THRYLCRQCKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CLAC 1334 (1406)
T ss_pred cchhhhccccccCcccee-eccc---chhHHHHHHHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-chHH
Confidence 478999988888877777 6643 2468999999999999999888742 1223446 9999
Q ss_pred CcCccChhhHHHHHhhcCC
Q psy7449 562 DLKFSQNSQLKYHLTTRHD 580 (596)
Q Consensus 562 ~~~F~~~~~l~~H~~~~H~ 580 (596)
...|.....|+.||+..+.
T Consensus 1335 ~~~~~~~~alqihm~~~~~ 1353 (1406)
T KOG1146|consen 1335 EVLLSGREALQIHMRSSAH 1353 (1406)
T ss_pred HhhcchhHHHHHHHHHhhh
Confidence 9999999999999998775
No 18
>KOG3993|consensus
Probab=98.68 E-value=1.3e-08 Score=96.33 Aligned_cols=84 Identities=20% Similarity=0.385 Sum_probs=54.6
Q ss_pred CcccCCCCCcccCHHHHHHhhhhcCCCCccccCccccccccchhhhhhhhcccC--------------------------
Q psy7449 348 NYVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTHCRYSASRRDTLDHHIQVSHL-------------------------- 401 (596)
Q Consensus 348 ~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~~H~-------------------------- 401 (596)
.|.|..|...|.+.-.|..|.-.-.-.-.|+|++|+|.|+-..+|..| ++.|.
T Consensus 267 dyiCqLCK~kYeD~F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASH-RRWHKPR~eaa~a~~~P~k~~~~~rae~~ea 345 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHRCPRIVHVEYRCPECDKVFSCPANLASH-RRWHKPRPEAAKAGSPPPKQAVETRAEVQEA 345 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhccCCeeEEeeecCCcccccccCchhhhhh-hcccCCchhhhhcCCCChhhhhhhhhhhhhc
Confidence 366666666666666666664211111236666666666666666666 44441
Q ss_pred -------CCCcccCCCCcccccchhhHHHHhhhcccCC
Q psy7449 402 -------YKKRFQCPVCDYRCTKKYNLQMHALTKHDLD 432 (596)
Q Consensus 402 -------~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~ 432 (596)
....|.|.+|++.|.+...|+.|+.++|...
T Consensus 346 ~rsg~dss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~ 383 (500)
T KOG3993|consen 346 ERSGDDSSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAP 383 (500)
T ss_pred cccCCcccCceeecHHhhhhhHHHHHHHHhHHhhhccc
Confidence 1234999999999999999999999977543
No 19
>KOG3993|consensus
Probab=98.66 E-value=8e-09 Score=97.70 Aligned_cols=171 Identities=23% Similarity=0.315 Sum_probs=106.7
Q ss_pred ccccCccccccccchhhhhhhhcccCCCCcccCCCCcccccchhhHHHHhhhcccCCCCCCCcccCCcccccccCCCCcc
Q psy7449 376 PFKCTHCRYSASRRDTLDHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHALTKHDLDIKSDGIIQDAGTYILDTALSPEK 455 (596)
Q Consensus 376 ~~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~l~~~~ 455 (596)
.|.|..|...|.+...|.+|. -.-...-.|+|+.|+|.|.-..+|..|.+. |........ ...-.-+..
T Consensus 267 dyiCqLCK~kYeD~F~LAQHr-C~RIV~vEYrCPEC~KVFsCPANLASHRRW-HKPR~eaa~----a~~~P~k~~----- 335 (500)
T KOG3993|consen 267 DYICQLCKEKYEDAFALAQHR-CPRIVHVEYRCPECDKVFSCPANLASHRRW-HKPRPEAAK----AGSPPPKQA----- 335 (500)
T ss_pred HHHHHHHHHhhhhHHHHhhcc-CCeeEEeeecCCcccccccCchhhhhhhcc-cCCchhhhh----cCCCChhhh-----
Confidence 499999999999999999994 222222359999999999999999999988 543311110 000000000
Q ss_pred ccccCCccceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCCCccccccchhhhhhhccCCC--------C-
Q psy7449 456 RVKSRGHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQHINGHVGI--------K- 526 (596)
Q Consensus 456 ~~~~~~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~--------~- 526 (596)
..++.+ +.... ++... ..+..|.|.+|+++|.+...|+.|+.+|... +
T Consensus 336 --~~~rae------------------~~ea~-rsg~d--ss~gi~~C~~C~KkFrRqAYLrKHqlthq~~~~~k~~a~~f 392 (500)
T KOG3993|consen 336 --VETRAE------------------VQEAE-RSGDD--SSSGIFSCHTCGKKFRRQAYLRKHQLTHQRAPLAKEKAPKF 392 (500)
T ss_pred --hhhhhh------------------hhhcc-ccCCc--ccCceeecHHhhhhhHHHHHHHHhHHhhhccccchhcccCc
Confidence 000000 00000 00000 2233688888888898888888887766411 0
Q ss_pred --------Cc-----------------------------cCCCCCccccCchhhhhhhhhccCCCcccCCccCcCccChh
Q psy7449 527 --------PY-----------------------------KCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNS 569 (596)
Q Consensus 527 --------~~-----------------------------~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~ 569 (596)
.+ .|++||-.+.++..--.+.+.-..+.-|.|.+|...|.+..
T Consensus 393 ~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~~sgg~~rlg~~~q~f~~ky~~atfyss~ 472 (500)
T KOG3993|consen 393 LLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGSAELELPPYDGSPPSSSGSSGGYGRLGIAEQGFTCKYCPATFYSSP 472 (500)
T ss_pred chhhcccccccccccccccccccccccceeeeeccccccCCCCCCCCcccCCCCCccccccchhhccccccchHhhhcCc
Confidence 02 35555555555554444444333455699999999999999
Q ss_pred hHHHHHhhcCC
Q psy7449 570 QLKYHLTTRHD 580 (596)
Q Consensus 570 ~l~~H~~~~H~ 580 (596)
.|.+|+..-|.
T Consensus 473 ~ltrhin~~Hp 483 (500)
T KOG3993|consen 473 GLTRHINKCHP 483 (500)
T ss_pred chHhHhhhcCh
Confidence 99999998885
No 20
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.59 E-value=1.4e-08 Score=57.86 Aligned_cols=24 Identities=67% Similarity=1.244 Sum_probs=12.4
Q ss_pred hhhhhhhccCCCcccCCccCcCcc
Q psy7449 543 LGRHMRIHTGEKPYACSLCDLKFS 566 (596)
Q Consensus 543 L~~H~~~H~~~~~~~C~~C~~~F~ 566 (596)
|.+||++|+|++||.|++|++.|.
T Consensus 2 l~~H~~~H~~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 2 LRRHMRTHTGEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHHHSSSSSEEESSSSEEES
T ss_pred HHHHhhhcCCCCCCCCCCCcCeeC
Confidence 445555555555555555555543
No 21
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.54 E-value=2.6e-08 Score=56.71 Aligned_cols=26 Identities=38% Similarity=0.844 Sum_probs=24.0
Q ss_pred hhhhhhhccCCCCCccCCCCCccccC
Q psy7449 514 HMRQHINGHVGIKPYKCLHCSYKSTQ 539 (596)
Q Consensus 514 ~l~~H~~~H~~~~~~~C~~C~~~f~~ 539 (596)
+|..|+++|+|++||.|++|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 48899999999999999999999964
No 22
>PHA00616 hypothetical protein
Probab=98.41 E-value=6.6e-08 Score=61.22 Aligned_cols=33 Identities=15% Similarity=0.252 Sum_probs=20.0
Q ss_pred CccCCCCCccccCchhhhhhhhhccCCCcccCC
Q psy7449 527 PYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACS 559 (596)
Q Consensus 527 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~ 559 (596)
||+|+.||+.|...+.|..|++.|+|++++.|+
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 456666666666666666666666666665554
No 23
>PHA00732 hypothetical protein
Probab=98.22 E-value=7.5e-07 Score=66.12 Aligned_cols=46 Identities=26% Similarity=0.457 Sum_probs=26.5
Q ss_pred cccCCCCCccccccchhhhhhhc-cCCCCCccCCCCCccccCchhhhhhhhhc
Q psy7449 499 RFVCFTCNYHTYKSGHMRQHING-HVGIKPYKCLHCSYKSTQSSCLGRHMRIH 550 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~~l~~H~~~-H~~~~~~~C~~C~~~f~~~~~L~~H~~~H 550 (596)
||.|..||+.|.+...|..|++. |. ++.|+.||+.|. .|..|++..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~---~~~C~~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT---LTKCPVCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC---CCccCCCCCEeC---Chhhhhccc
Confidence 35666666666666666666653 43 245666666665 355555443
No 24
>PHA00616 hypothetical protein
Probab=98.14 E-value=1.2e-06 Score=55.64 Aligned_cols=34 Identities=21% Similarity=0.476 Sum_probs=27.9
Q ss_pred CccCccCcccccChHHHHHHHHHhcCCCceeeCC
Q psy7449 255 PFQCKICDSTFKRKQTLDLHIKRHTGERTYKCEF 288 (596)
Q Consensus 255 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~ 288 (596)
||+|+.||+.|..++.|..|++.|++++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~hg~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVHKQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhcCCCccceeE
Confidence 5788888888888888888888888888887764
No 25
>PHA00732 hypothetical protein
Probab=98.13 E-value=1.6e-06 Score=64.35 Aligned_cols=49 Identities=27% Similarity=0.614 Sum_probs=42.4
Q ss_pred CccCCCCCccccCchhhhhhhhh-ccCCCcccCCccCcCccChhhHHHHHhhcCCC
Q psy7449 527 PYKCLHCSYKSTQSSCLGRHMRI-HTGEKPYACSLCDLKFSQNSQLKYHLTTRHDL 581 (596)
Q Consensus 527 ~~~C~~C~~~f~~~~~L~~H~~~-H~~~~~~~C~~C~~~F~~~~~l~~H~~~~H~~ 581 (596)
||.|+.||+.|.+...|+.|++. |++ +.|+.||+.|. .|..|++++-|.
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~~~~ 50 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQYDI 50 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccCC---CccCCCCCEeC---ChhhhhcccCCc
Confidence 68999999999999999999984 654 68999999998 599999777654
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=98.00 E-value=4.9e-06 Score=57.35 Aligned_cols=50 Identities=32% Similarity=0.581 Sum_probs=31.9
Q ss_pred cccCccccccccchhhhhhhhcccCCC-CcccCCCCcccccchhhHHHHhhhcc
Q psy7449 377 FKCTHCRYSASRRDTLDHHIQVSHLYK-KRFQCPVCDYRCTKKYNLQMHALTKH 429 (596)
Q Consensus 377 ~~C~~C~~~f~~~~~L~~H~~~~H~~~-~~~~C~~C~~~f~~~~~l~~H~~~~H 429 (596)
|.|++|++ -.+...|..|+...|..+ +.+.||+|...++ .+|..|+...|
T Consensus 3 f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 66777777 345566777766666554 3577777776544 37777776655
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.94 E-value=7.7e-06 Score=56.36 Aligned_cols=50 Identities=28% Similarity=0.572 Sum_probs=29.5
Q ss_pred ccCCCCCccccCchhhhhhhh-hccCC-CcccCCccCcCccChhhHHHHHhhcCC
Q psy7449 528 YKCLHCSYKSTQSSCLGRHMR-IHTGE-KPYACSLCDLKFSQNSQLKYHLTTRHD 580 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L~~H~~-~H~~~-~~~~C~~C~~~F~~~~~l~~H~~~~H~ 580 (596)
|.|++||+. .+...|..|.. .|..+ +.+.|++|...+. .+|..||...|+
T Consensus 3 f~CP~C~~~-~~~~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H~ 54 (54)
T PF05605_consen 3 FTCPYCGKG-FSESSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQHR 54 (54)
T ss_pred cCCCCCCCc-cCHHHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhcC
Confidence 666666663 34556666654 34433 3466777766544 367777776663
No 28
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.59 E-value=1.7e-05 Score=43.86 Aligned_cols=18 Identities=44% Similarity=0.955 Sum_probs=7.0
Q ss_pred CCccCcCccChhhHHHHH
Q psy7449 558 CSLCDLKFSQNSQLKYHL 575 (596)
Q Consensus 558 C~~C~~~F~~~~~l~~H~ 575 (596)
|+.|++.|.++..|..|+
T Consensus 3 C~~C~~~f~~~~~l~~H~ 20 (23)
T PF00096_consen 3 CPICGKSFSSKSNLKRHM 20 (23)
T ss_dssp ETTTTEEESSHHHHHHHH
T ss_pred CCCCCCccCCHHHHHHHH
Confidence 333333333333333333
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.48 E-value=6.3e-05 Score=41.53 Aligned_cols=23 Identities=39% Similarity=1.009 Sum_probs=15.3
Q ss_pred ccCccCcccccChHHHHHHHHHh
Q psy7449 256 FQCKICDSTFKRKQTLDLHIKRH 278 (596)
Q Consensus 256 ~~C~~C~~~f~~~~~l~~H~~~h 278 (596)
|.|+.|++.|.++..|..|++.|
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHHH
T ss_pred CCCCCCCCccCCHHHHHHHHhHC
Confidence 45667777777777777776653
No 30
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.45 E-value=6.3e-05 Score=42.02 Aligned_cols=24 Identities=33% Similarity=0.883 Sum_probs=14.9
Q ss_pred ccCCccCcCccChhhHHHHHhhcC
Q psy7449 556 YACSLCDLKFSQNSQLKYHLTTRH 579 (596)
Q Consensus 556 ~~C~~C~~~F~~~~~l~~H~~~~H 579 (596)
|.|++|++.|.+...|..|++++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 466677777777777777766655
No 31
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.41 E-value=7.7e-05 Score=59.64 Aligned_cols=74 Identities=18% Similarity=0.374 Sum_probs=22.3
Q ss_pred cCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhhhccCCCcccCCccCcCccChhhHHHHHhhcC
Q psy7449 501 VCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNSQLKYHLTTRH 579 (596)
Q Consensus 501 ~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~l~~H~~~~H 579 (596)
+|..|+..|.+...|..|+...++...- ....+.....|..+++.-. ...+.|.+|++.|.+...|..||+.++
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~----~~~~l~~~~~~~~~~~~~~-~~~~~C~~C~~~f~s~~~l~~Hm~~~~ 74 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIP----DQKYLVDPNRLLNYLRKKV-KESFRCPYCNKTFRSREALQEHMRSKH 74 (100)
T ss_dssp -----------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHTT
T ss_pred Cccccccccccccccccccccccccccc----ccccccccccccccccccc-CCCCCCCccCCCCcCHHHHHHHHcCcc
Confidence 4899999999999999999765543211 1222224444555544322 236899999999999999999999764
No 32
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.27 E-value=0.00013 Score=42.11 Aligned_cols=26 Identities=31% Similarity=0.627 Sum_probs=15.6
Q ss_pred cccCCccCcCccChhhHHHHHhhcCC
Q psy7449 555 PYACSLCDLKFSQNSQLKYHLTTRHD 580 (596)
Q Consensus 555 ~~~C~~C~~~F~~~~~l~~H~~~~H~ 580 (596)
||.|..|++.|.+...|..|++.+++
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhcC
Confidence 45666666666666666666655543
No 33
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.26 E-value=9.5e-05 Score=67.84 Aligned_cols=55 Identities=29% Similarity=0.589 Sum_probs=46.0
Q ss_pred CCCccccCc--cccccccchhhhhhhhccc------------------CCCCcccCCCCcccccchhhHHHHhhh
Q psy7449 373 GEKPFKCTH--CRYSASRRDTLDHHIQVSH------------------LYKKRFQCPVCDYRCTKKYNLQMHALT 427 (596)
Q Consensus 373 ~~k~~~C~~--C~~~f~~~~~L~~H~~~~H------------------~~~~~~~C~~C~~~f~~~~~l~~H~~~ 427 (596)
++|||+|++ |++.+.+...|..|+..-| ...+||.|++|+|.+.+...|+.|...
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~H 420 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKH 420 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccc
Confidence 358999988 9999999999999976555 134789999999999999999998653
No 34
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.22 E-value=0.00012 Score=67.28 Aligned_cols=54 Identities=41% Similarity=0.795 Sum_probs=34.3
Q ss_pred CCccCCC--CCccccCchhhhhhhh-hc------------------cCCCcccCCccCcCccChhhHHHHHhhcC
Q psy7449 526 KPYKCLH--CSYKSTQSSCLGRHMR-IH------------------TGEKPYACSLCDLKFSQNSQLKYHLTTRH 579 (596)
Q Consensus 526 ~~~~C~~--C~~~f~~~~~L~~H~~-~H------------------~~~~~~~C~~C~~~F~~~~~l~~H~~~~H 579 (596)
|||+|++ |.|.+.....|+.|+. -| ..++||+|++|++.+....-|+.|...-|
T Consensus 348 KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYHr~Hsh 422 (423)
T COG5189 348 KPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYHRKHSH 422 (423)
T ss_pred ceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceecccccC
Confidence 5555554 5555555555555543 11 23578888888888888888888866443
No 35
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.15 E-value=0.00025 Score=39.51 Aligned_cols=24 Identities=42% Similarity=0.822 Sum_probs=14.1
Q ss_pred ccCCCCcccccchhhHHHHhhhcc
Q psy7449 406 FQCPVCDYRCTKKYNLQMHALTKH 429 (596)
Q Consensus 406 ~~C~~C~~~f~~~~~l~~H~~~~H 429 (596)
|.|++|++.|.+...|..|+.++|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 456666666666666666666543
No 36
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.10 E-value=0.00018 Score=41.54 Aligned_cols=24 Identities=33% Similarity=0.668 Sum_probs=13.3
Q ss_pred CccCCCCCccccCchhhhhhhhhc
Q psy7449 527 PYKCLHCSYKSTQSSCLGRHMRIH 550 (596)
Q Consensus 527 ~~~C~~C~~~f~~~~~L~~H~~~H 550 (596)
||+|+.|++.|.+...|..|++.|
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTT
T ss_pred CCCCCccCCccCChhHHHHHhHHh
Confidence 355555555555555555555544
No 37
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.06 E-value=0.00033 Score=55.90 Aligned_cols=24 Identities=38% Similarity=0.890 Sum_probs=14.7
Q ss_pred CccCCCCCccccCchhHHhHhhhc
Q psy7449 143 AFKCEYCSYSSYSKSNLRTHMRTH 166 (596)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~~h 166 (596)
.+.|.+|++.|.+...|..|++.+
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHT
T ss_pred CCCCCccCCCCcCHHHHHHHHcCc
Confidence 466666666666666666666653
No 38
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.47 E-value=0.0015 Score=42.41 Aligned_cols=30 Identities=33% Similarity=0.452 Sum_probs=12.2
Q ss_pred CCCccCCCCCccccCchhhhhhhhhccCCC
Q psy7449 525 IKPYKCLHCSYKSTQSSCLGRHMRIHTGEK 554 (596)
Q Consensus 525 ~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~ 554 (596)
+.|-.|++|+..+++..+|++|+.++++.+
T Consensus 22 ~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 22 EQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp S--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred CCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 344455555555555555555554444433
No 39
>KOG2231|consensus
Probab=96.45 E-value=0.002 Score=67.27 Aligned_cols=22 Identities=18% Similarity=0.362 Sum_probs=10.9
Q ss_pred ccCCCCCcccCHHHHHHhhhhc
Q psy7449 350 VCYTCDYHTGVRECMKKHIRTH 371 (596)
Q Consensus 350 ~C~~C~~~f~~~~~l~~H~~~h 371 (596)
.|..|...|-....|.+|++.+
T Consensus 184 ~C~~C~~~fld~~el~rH~~~~ 205 (669)
T KOG2231|consen 184 LCKFCHERFLDDDELYRHLRFD 205 (669)
T ss_pred cchhhhhhhccHHHHHHhhccc
Confidence 3455555555555555555443
No 40
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.34 E-value=0.002 Score=36.49 Aligned_cols=18 Identities=39% Similarity=0.746 Sum_probs=6.8
Q ss_pred CCccCcCccChhhHHHHH
Q psy7449 558 CSLCDLKFSQNSQLKYHL 575 (596)
Q Consensus 558 C~~C~~~F~~~~~l~~H~ 575 (596)
|..|++.|.+...|..|+
T Consensus 3 C~~C~~~f~~~~~l~~H~ 20 (26)
T smart00355 3 CPECGKVFKSKSALKEHM 20 (26)
T ss_pred CCCCcchhCCHHHHHHHH
Confidence 333333333333333333
No 41
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.30 E-value=0.0015 Score=36.27 Aligned_cols=24 Identities=42% Similarity=0.755 Sum_probs=14.5
Q ss_pred ccCCccCcCccChhhHHHHHhhcCC
Q psy7449 556 YACSLCDLKFSQNSQLKYHLTTRHD 580 (596)
Q Consensus 556 ~~C~~C~~~F~~~~~l~~H~~~~H~ 580 (596)
|+|+.|++... +..|.+|++.+|+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 56777776665 6667777766653
No 42
>PRK04860 hypothetical protein; Provisional
Probab=96.19 E-value=0.0018 Score=55.64 Aligned_cols=39 Identities=23% Similarity=0.461 Sum_probs=24.9
Q ss_pred CcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCc
Q psy7449 498 FRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQS 540 (596)
Q Consensus 498 ~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~ 540 (596)
.+|.|. |+. ....++.|.++|+++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 356776 665 556666666666666666666666666543
No 43
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.07 E-value=0.0055 Score=39.92 Aligned_cols=39 Identities=18% Similarity=0.294 Sum_probs=23.6
Q ss_pred hhhhhcccCCCCcccCCCCcccccchhhHHHHhhhcccC
Q psy7449 393 DHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHALTKHDL 431 (596)
Q Consensus 393 ~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~ 431 (596)
..+.+..+..+.|-.|++|+..+....+|++|+...|..
T Consensus 12 ~~~~k~~~~S~~PatCP~C~a~~~~srnLrRHle~~H~~ 50 (54)
T PF09237_consen 12 TKKPKSKSQSEQPATCPICGAVIRQSRNLRRHLEIRHFK 50 (54)
T ss_dssp ----CCCCTTS--EE-TTT--EESSHHHHHHHHHHHTTT
T ss_pred hhHHHHhhccCCCCCCCcchhhccchhhHHHHHHHHhcc
Confidence 445555566677888888888888888888888776754
No 44
>KOG2231|consensus
Probab=95.97 E-value=0.0068 Score=63.38 Aligned_cols=147 Identities=20% Similarity=0.317 Sum_probs=85.4
Q ss_pred cccCccccccccchhhhhhhhcccCCCCcccCCCCcccccchhhHHHHhhhcccCCCCCCCcccCCcccccccCCCCccc
Q psy7449 377 FKCTHCRYSASRRDTLDHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHALTKHDLDIKSDGIIQDAGTYILDTALSPEKR 456 (596)
Q Consensus 377 ~~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~~l~~~~~ 456 (596)
+.|.+|++.|.-.. ..-.|..| -.|.+...|+.|+...|..- .-..|..-.+.|...
T Consensus 100 ~~C~~C~~~~~~~~-------------~~~~~~~c-~~~~s~~~Lk~H~~~~H~~~-~c~lC~~~~kif~~e-------- 156 (669)
T KOG2231|consen 100 HSCHICDRRFRALY-------------NKKECLHC-TEFKSVENLKNHMRDQHKLH-LCSLCLQNLKIFINE-------- 156 (669)
T ss_pred hhcCccccchhhhc-------------ccCCCccc-cchhHHHHHHHHHHHhhhhh-ccccccccceeeeee--------
Confidence 78999998873211 12368888 88889999999998878543 111222222222211
Q ss_pred cccCCccceecCCCCcccccchhhhccccccccCCCCCCC-CC----cccCCCCCccccccchhhhhhhccCCCCCccCC
Q psy7449 457 VKSRGHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDG-SF----RFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCL 531 (596)
Q Consensus 457 ~~~~~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~-~~----~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~ 531 (596)
++.+. ..+|..|...... +. --.|..|...|.....|..|++.++ |.|.
T Consensus 157 --------------~k~Yt--------~~el~~h~~~gd~d~~s~rGhp~C~~C~~~fld~~el~rH~~~~h----~~ch 210 (669)
T KOG2231|consen 157 --------------RKLYT--------RAELNLHLMFGDPDDESCRGHPLCKFCHERFLDDDELYRHLRFDH----EFCH 210 (669)
T ss_pred --------------eeheh--------HHHHHHHHhcCCCccccccCCccchhhhhhhccHHHHHHhhccce----ehee
Confidence 11111 1222222111110 00 1369999999999999999998765 6677
Q ss_pred CCC------ccccCchhhhhhhhhccCCCcccCC--ccCc-CccChhhHHHHHh
Q psy7449 532 HCS------YKSTQSSCLGRHMRIHTGEKPYACS--LCDL-KFSQNSQLKYHLT 576 (596)
Q Consensus 532 ~C~------~~f~~~~~L~~H~~~H~~~~~~~C~--~C~~-~F~~~~~l~~H~~ 576 (596)
.|. .-|.....|..|.+.++ |.|+ .|.. .|...-.+..|++
T Consensus 211 fC~~~~~~neyy~~~~dLe~HfR~~H----flCE~~~C~~~~f~~~~~~ei~lk 260 (669)
T KOG2231|consen 211 FCDYKTGQNEYYNDYDDLEEHFRKGH----FLCEEEFCRTKKFYVAFELEIELK 260 (669)
T ss_pred ecCcccccchhcccchHHHHHhhhcC----ccccccccccceeeehhHHHHHHH
Confidence 774 45667778888887655 7787 5643 3333324444444
No 45
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.87 E-value=0.0086 Score=33.74 Aligned_cols=23 Identities=35% Similarity=0.857 Sum_probs=15.7
Q ss_pred ccCccCcccccChHHHHHHHHHh
Q psy7449 256 FQCKICDSTFKRKQTLDLHIKRH 278 (596)
Q Consensus 256 ~~C~~C~~~f~~~~~l~~H~~~h 278 (596)
|.|+.|++.|.+...|..|++.|
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHh
Confidence 45677777777777777776655
No 46
>PRK04860 hypothetical protein; Provisional
Probab=95.81 E-value=0.0045 Score=53.18 Aligned_cols=40 Identities=30% Similarity=0.706 Sum_probs=35.3
Q ss_pred CccCCCCCccccCchhhhhhhhhccCCCcccCCccCcCccChhh
Q psy7449 527 PYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFSQNSQ 570 (596)
Q Consensus 527 ~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~~~~~ 570 (596)
+|.|. |+. ....+++|.++|+|+++|.|..|+..|.....
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~g~~~YrC~~C~~~l~~~~~ 158 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVRGEAVYRCRRCGETLVFKGE 158 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhcCCccEECCCCCceeEEecc
Confidence 69998 998 77889999999999999999999998876543
No 47
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.76 E-value=0.007 Score=56.58 Aligned_cols=133 Identities=21% Similarity=0.342 Sum_probs=87.5
Q ss_pred ccCcc--CcccccChHHHHHHHHHhcCCCceeeCCCC---Cccc------ccccccccccccccccccCCcccccchhhH
Q psy7449 256 FQCKI--CDSTFKRKQTLDLHIKRHTGERTYKCEFCD---YTAA------RKDTLGVHVRIKHKISICQNCGLILSTITE 324 (596)
Q Consensus 256 ~~C~~--C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~---~~f~------~~~~l~~H~~~~h~~~~C~~C~~~~~~~~~ 324 (596)
|.|+. |.........|..|.+.-++. +.|.+|- +.|. ++..|..|...+-.. ..|
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~~H~~--~~C~~C~~nKk~F~~E~~lF~~~~Lr~H~~~G~~e-------~GF----- 217 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKAQHGF--VLCSECIGNKKDFWNEIRLFRSSTLRDHKNGGLEE-------EGF----- 217 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHhhcCc--EEhHhhhcCcccCccceeeeecccccccccCCccc-------cCc-----
Confidence 67765 666666778888888765442 6777774 2333 233444443321100 000
Q ss_pred HHHHHhhhcCCCCCCCCCCCCCCCcccCCCCCcccCHHHHHHhhhhcCCCCccccCcccc-------ccccchhhhhhhh
Q psy7449 325 ELLEHCRICSGKERPDSSTDKSFNYVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTHCRY-------SASRRDTLDHHIQ 397 (596)
Q Consensus 325 ~l~~h~~~~~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~-------~f~~~~~L~~H~~ 397 (596)
..--.|..|...|-+...|..|++..+. .|.+|++ -|.+...|..|.+
T Consensus 218 ---------------------KGHP~C~FC~~~FYdDDEL~~HcR~~HE----~ChICD~v~p~~~QYFK~Y~~Le~HF~ 272 (493)
T COG5236 218 ---------------------KGHPLCIFCKIYFYDDDELRRHCRLRHE----ACHICDMVGPIRYQYFKSYEDLEAHFR 272 (493)
T ss_pred ---------------------CCCchhhhccceecChHHHHHHHHhhhh----hhhhhhccCccchhhhhCHHHHHHHhh
Confidence 0114699999999999999999986552 4666654 3888899999987
Q ss_pred cccCCCCcccCCC--Cc----ccccchhhHHHHhhhcccCC
Q psy7449 398 VSHLYKKRFQCPV--CD----YRCTKKYNLQMHALTKHDLD 432 (596)
Q Consensus 398 ~~H~~~~~~~C~~--C~----~~f~~~~~l~~H~~~~H~~~ 432 (596)
..| |.|.. |- ..|.....|..|+..-|+..
T Consensus 273 ~~h-----y~ct~qtc~~~k~~vf~~~~el~~h~~~~h~~~ 308 (493)
T COG5236 273 NAH-----YCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVN 308 (493)
T ss_pred cCc-----eEEEEEEEecCcEEEeccHHHHHHHHHHHhhcc
Confidence 666 66644 42 46889999999998888754
No 48
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.57 E-value=0.0053 Score=34.47 Aligned_cols=22 Identities=32% Similarity=0.865 Sum_probs=14.3
Q ss_pred ccCCccCcCccChhhHHHHHhh
Q psy7449 556 YACSLCDLKFSQNSQLKYHLTT 577 (596)
Q Consensus 556 ~~C~~C~~~F~~~~~l~~H~~~ 577 (596)
|.|++|++.|.+...|..|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 4566666666666666666654
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.56 E-value=0.0044 Score=34.41 Aligned_cols=22 Identities=55% Similarity=1.120 Sum_probs=10.6
Q ss_pred ccCCCCCccccCchhhhhhhhhc
Q psy7449 528 YKCLHCSYKSTQSSCLGRHMRIH 550 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L~~H~~~H 550 (596)
|+|+.|+.... ...|.+|+++|
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 44555555554 55555555544
No 50
>KOG1146|consensus
Probab=95.54 E-value=0.006 Score=67.58 Aligned_cols=53 Identities=23% Similarity=0.432 Sum_probs=47.2
Q ss_pred ccCccccccccchhhhhhhhcccCCCCcccCCCCcccccchhhHHHHhhhccc
Q psy7449 378 KCTHCRYSASRRDTLDHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHALTKHD 430 (596)
Q Consensus 378 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 430 (596)
.|.-|+..+.....+..|++..|...+.|.|+.|+..|.....|..|++..|.
T Consensus 438 e~~~~e~~~~s~r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp 490 (1406)
T KOG1146|consen 438 ELTKAEPLLESKRSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHP 490 (1406)
T ss_pred cccchhhhhhhhcccccceeeeecccccccCCccchhhhhHHHhhhccccccc
Confidence 46667788888889999999899989999999999999999999999999774
No 51
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.52 E-value=0.0047 Score=34.67 Aligned_cols=22 Identities=32% Similarity=0.628 Sum_probs=12.0
Q ss_pred ccCCCCCccccccchhhhhhhc
Q psy7449 500 FVCFTCNYHTYKSGHMRQHING 521 (596)
Q Consensus 500 ~~C~~C~~~f~~~~~l~~H~~~ 521 (596)
|.|.+|++.|.+...|+.|++.
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTT
T ss_pred CCCCCCCCCcCCHHHHHHHHCc
Confidence 3455555555555555555543
No 52
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=95.29 E-value=0.015 Score=54.56 Aligned_cols=73 Identities=26% Similarity=0.518 Sum_probs=55.3
Q ss_pred cCCCCCccccccchhhhhhhccCCCCCccCCCCCc-------cccCchhhhhhhhhccCCCcccCCc--cC----cCccC
Q psy7449 501 VCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSY-------KSTQSSCLGRHMRIHTGEKPYACSL--CD----LKFSQ 567 (596)
Q Consensus 501 ~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~-------~f~~~~~L~~H~~~H~~~~~~~C~~--C~----~~F~~ 567 (596)
.|..|...|.+-..|..|+|.-+ =.|.+|++ -|.....|..|.+.- -|.|.+ |- ..|..
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~H----E~ChICD~v~p~~~QYFK~Y~~Le~HF~~~----hy~ct~qtc~~~k~~vf~~ 293 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRH----EACHICDMVGPIRYQYFKSYEDLEAHFRNA----HYCCTFQTCRVGKCYVFPY 293 (493)
T ss_pred hhhhccceecChHHHHHHHHhhh----hhhhhhhccCccchhhhhCHHHHHHHhhcC----ceEEEEEEEecCcEEEecc
Confidence 59999999999999999998533 24667765 366777888887632 266643 42 46888
Q ss_pred hhhHHHHHhhcCCC
Q psy7449 568 NSQLKYHLTTRHDL 581 (596)
Q Consensus 568 ~~~l~~H~~~~H~~ 581 (596)
..+|+.|+...|+.
T Consensus 294 ~~el~~h~~~~h~~ 307 (493)
T COG5236 294 HTELLEHLTRFHKV 307 (493)
T ss_pred HHHHHHHHHHHhhc
Confidence 99999999999963
No 53
>KOG2785|consensus
Probab=95.22 E-value=0.026 Score=54.17 Aligned_cols=26 Identities=15% Similarity=0.270 Sum_probs=19.3
Q ss_pred cccCCCCCccccccchhhhhhhccCC
Q psy7449 499 RFVCFTCNYHTYKSGHMRQHINGHVG 524 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~~l~~H~~~H~~ 524 (596)
|-.|.+|++.|.+...-..||..++|
T Consensus 166 Pt~CLfC~~~~k~~e~~~~HM~~~Hg 191 (390)
T KOG2785|consen 166 PTDCLFCDKKSKSLEENLKHMFKEHG 191 (390)
T ss_pred CcceeecCCCcccHHHHHHHHhhccC
Confidence 45678888888777777788876665
No 54
>KOG2785|consensus
Probab=95.16 E-value=0.016 Score=55.65 Aligned_cols=54 Identities=19% Similarity=0.327 Sum_probs=43.0
Q ss_pred CccccCccccccccchhhhhhhhcccCCCC----------------------cccCCCCc---ccccchhhHHHHhhhc
Q psy7449 375 KPFKCTHCRYSASRRDTLDHHIQVSHLYKK----------------------RFQCPVCD---YRCTKKYNLQMHALTK 428 (596)
Q Consensus 375 k~~~C~~C~~~f~~~~~L~~H~~~~H~~~~----------------------~~~C~~C~---~~f~~~~~l~~H~~~~ 428 (596)
-|-.|-.|++.|.+...-..||...|+--- -|.|-.|. +.|.+....+.||..+
T Consensus 165 ~Pt~CLfC~~~~k~~e~~~~HM~~~HgffIPdreYL~D~~GLl~YLgeKV~~~~~CL~CN~~~~~f~sleavr~HM~~K 243 (390)
T KOG2785|consen 165 IPTDCLFCDKKSKSLEENLKHMFKEHGFFIPDREYLTDEKGLLKYLGEKVGIGFICLFCNELGRPFSSLEAVRAHMRDK 243 (390)
T ss_pred CCcceeecCCCcccHHHHHHHHhhccCCcCCchHhhhchhHHHHHHHHHhccCceEEEeccccCcccccHHHHHHHhhc
Confidence 356788888888888888888887775322 27888888 9999999999999764
No 55
>KOG2482|consensus
Probab=95.14 E-value=0.04 Score=51.79 Aligned_cols=23 Identities=35% Similarity=0.733 Sum_probs=21.0
Q ss_pred CccCCCCCccccCchhHHhHhhh
Q psy7449 143 AFKCEYCSYSSYSKSNLRTHMRT 165 (596)
Q Consensus 143 ~~~C~~C~~~f~~~~~L~~H~~~ 165 (596)
.++|-+|.+.|.++..|+.||+.
T Consensus 195 r~~CLyCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 195 RLRCLYCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred hheeeeeccccCCcHHHHHHHHh
Confidence 47899999999999999999975
No 56
>KOG2482|consensus
Probab=95.09 E-value=0.028 Score=52.76 Aligned_cols=181 Identities=18% Similarity=0.259 Sum_probs=89.3
Q ss_pred HHHHhhhhhcC-CCCCccCccCccccc-ChHHHHHHHHHhcCCCceeeCCCCCcccccccccccccccccccccCCcccc
Q psy7449 241 SRMIKHIRNHT-GERPFQCKICDSTFK-RKQTLDLHIKRHTGERTYKCEFCDYTAARKDTLGVHVRIKHKISICQNCGLI 318 (596)
Q Consensus 241 ~~l~~H~~~h~-~~~~~~C~~C~~~f~-~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~~h~~~~C~~C~~~ 318 (596)
..|..+++.-. .....+|-.|+..+. +++....|+-.-++-.. . =.-.......|..|++..-....|-.|.+.
T Consensus 129 eaLeqqQ~Eredt~fslqClFCn~e~lgnRs~~l~Hlf~~H~lni---G-lpDniVyvnelLehLkekL~r~~CLyCeki 204 (423)
T KOG2482|consen 129 EALEQQQKEREDTIFSLQCLFCNNEGLGNRSEILEHLFHVHGLNI---G-LPDNIVYVNELLEHLKEKLERLRCLYCEKI 204 (423)
T ss_pred HHHHHHHHHhcCCeeeeEEEEecchhcccHHHHHHHHHHHhhhcc---C-CCcceeeHHHHHHHHHHHHhhheeeeeccc
Confidence 34555554332 123457888877665 45666677543222100 0 000112233455666665556677777777
Q ss_pred cchhhHHHHHHhhhcCC-CCCCCCCCCCCCCccc--CCCCCcccC-HHHHH-----Hhhhh--------cCCCCc--ccc
Q psy7449 319 LSTITEELLEHCRICSG-KERPDSSTDKSFNYVC--YTCDYHTGV-RECMK-----KHIRT--------HTGEKP--FKC 379 (596)
Q Consensus 319 ~~~~~~~l~~h~~~~~~-~~~~~~~~~~~~~~~C--~~C~~~f~~-~~~l~-----~H~~~--------h~~~k~--~~C 379 (596)
|.. +..|..||+.-.- +..|....-..+ |.= ..=|++... .+.+. .+... -.+..+ ..|
T Consensus 205 frd-kntLkeHMrkK~HrrinPknreYDkf-yiINY~ev~ks~t~~~~e~dret~~d~~E~D~~wsDw~ed~a~a~~v~C 282 (423)
T KOG2482|consen 205 FRD-KNTLKEHMRKKRHRRINPKNREYDKF-YIINYLEVGKSWTIVHSEDDRETNEDINETDDTWSDWNEDDAEALSVVC 282 (423)
T ss_pred cCC-cHHHHHHHHhccCcccCCCccccceE-EEEeHhhcCCccchhhhhhhhhhhccccccccchhhhhcCCCCccceEE
Confidence 777 7777777764221 112211110000 000 000111100 00000 00000 001112 578
Q ss_pred CccccccccchhhhhhhhcccCC--------------------------CCcccCCCCcccccchhhHHHHhhh
Q psy7449 380 THCRYSASRRDTLDHHIQVSHLY--------------------------KKRFQCPVCDYRCTKKYNLQMHALT 427 (596)
Q Consensus 380 ~~C~~~f~~~~~L~~H~~~~H~~--------------------------~~~~~C~~C~~~f~~~~~l~~H~~~ 427 (596)
-.|.....+...|..||..+|.- ...-.|-.|+-.|..+..|..||..
T Consensus 283 LfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 283 LFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred EeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 88888888888888888777721 1124678888888888888888865
No 57
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.86 E-value=0.012 Score=33.75 Aligned_cols=22 Identities=23% Similarity=0.585 Sum_probs=13.7
Q ss_pred ccCCCCCccccccchhhhhhhc
Q psy7449 500 FVCFTCNYHTYKSGHMRQHING 521 (596)
Q Consensus 500 ~~C~~C~~~f~~~~~l~~H~~~ 521 (596)
|.|..|++.|.+...|..|+++
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 4566666666666666666653
No 58
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=94.68 E-value=0.016 Score=33.24 Aligned_cols=22 Identities=23% Similarity=0.466 Sum_probs=15.8
Q ss_pred ccCCCCCccccCchhhhhhhhh
Q psy7449 528 YKCLHCSYKSTQSSCLGRHMRI 549 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L~~H~~~ 549 (596)
|.|..|++.|.+...|..|++.
T Consensus 2 ~~C~~C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDACDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTTTTBBBSSHHHHHCCTTS
T ss_pred CCcccCCCCcCCHHHHHHHHcc
Confidence 5677777777777777777653
No 59
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=93.49 E-value=0.1 Score=54.42 Aligned_cols=158 Identities=22% Similarity=0.361 Sum_probs=83.3
Q ss_pred CccccCccccccccchhhhhhhhc-ccCCC--CcccCC--CCcccccchhhHHHHhhhcccCCCCCCCcccCCccccccc
Q psy7449 375 KPFKCTHCRYSASRRDTLDHHIQV-SHLYK--KRFQCP--VCDYRCTKKYNLQMHALTKHDLDIKSDGIIQDAGTYILDT 449 (596)
Q Consensus 375 k~~~C~~C~~~f~~~~~L~~H~~~-~H~~~--~~~~C~--~C~~~f~~~~~l~~H~~~~H~~~~~~~~~~~~~~~~~~~~ 449 (596)
.++.|..|...|.....|..|... .|.++ +++.|+ .|++.|.....+..|...+.........-..+...+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 356677777777777777777543 56666 677777 5777777777777776663332211111111111111100
Q ss_pred CCCCccccccCCccceecCCCCcccccchhhhccccccccCCCCCCCCCcccC--CCCCccccccchhhhhhhccCCCC-
Q psy7449 450 ALSPEKRVKSRGHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVC--FTCNYHTYKSGHMRQHINGHVGIK- 526 (596)
Q Consensus 450 ~l~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C--~~C~~~f~~~~~l~~H~~~H~~~~- 526 (596)
.-..... + ...........+.+ ..|-..+.....+..|...|...+
T Consensus 368 ~~~~~~~------------------------------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (467)
T COG5048 368 NNEPPQS------------------------------L-QQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRP 416 (467)
T ss_pred CCCCccc------------------------------h-hhccCccCCccccccccchhhhhccccccccccccccccCC
Confidence 0000000 0 00000011111222 224455666666667766666555
Q ss_pred -CccCCCCCccccCchhhhhhhhhccCCCcccCCccCc
Q psy7449 527 -PYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDL 563 (596)
Q Consensus 527 -~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~ 563 (596)
.+.+..|++.|.....|..|++.|....++.|..++.
T Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (467)
T COG5048 417 YNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKS 454 (467)
T ss_pred cCCCCCcchhhccCcccccccccccccCCceeeccccc
Confidence 4566778888888888888888887777666655443
No 60
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.62 E-value=0.07 Score=55.68 Aligned_cols=154 Identities=24% Similarity=0.321 Sum_probs=90.4
Q ss_pred CCcccCCCCccccChhHHHHHhh--hccCC--CCccCC--CCCccccCchhHHhHhhhccCCCceecCCCc--cccchhH
Q psy7449 114 KPYSCGHCPYTARSSSYMLTHLK--VHSGE--KAFKCE--YCSYSSYSKSNLRTHMRTHIGEKPYKCSSCD--FQATQKI 185 (596)
Q Consensus 114 ~~~~C~~C~~~f~~~~~l~~H~~--~h~~~--~~~~C~--~C~~~f~~~~~L~~H~~~h~~~~~~~C~~C~--~~f~~~~ 185 (596)
.++.|..|...|.....|..|.+ .|.++ +++.|+ .|++.|.....+..|...|.+..++.+..-. ..+....
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 367 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367 (467)
T ss_pred cCCCCccccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccCCCccccccccCcccccccc
Confidence 46778888888888888888888 78888 888888 7888888888888888888877766554322 1111111
Q ss_pred hhhcccccccccCCCccccchhhhhhhhcccCCCCCCCCCceeec--CCCCccCCChHHHHhhhhhcCCCC--CccCccC
Q psy7449 186 QVKKHVLIICQHCDQSLFGTTQQLADHCRTCDKMPRPDRSYQYQC--YSCEYHSLYSSRMIKHIRNHTGER--PFQCKIC 261 (596)
Q Consensus 186 ~l~~H~~~~c~~c~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~C--~~C~~~f~~~~~l~~H~~~h~~~~--~~~C~~C 261 (596)
.-..+ +.+... ........+.+ ..|...+.....+..|...+.... .+.+..|
T Consensus 368 ~~~~~----------------~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (467)
T COG5048 368 NNEPP----------------QSLQQY-------KDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPC 424 (467)
T ss_pred CCCCc----------------cchhhc-------cCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcc
Confidence 10000 000000 00011111111 223444444455555555554443 4566778
Q ss_pred cccccChHHHHHHHHHhcCCCceeeCCCC
Q psy7449 262 DSTFKRKQTLDLHIKRHTGERTYKCEFCD 290 (596)
Q Consensus 262 ~~~f~~~~~l~~H~~~h~~~~~~~C~~C~ 290 (596)
.+.|.....+..|++.|....++.|..+.
T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (467)
T COG5048 425 SKSFNRHYNLIPHKKIHTNHAPLLCSILK 453 (467)
T ss_pred hhhccCcccccccccccccCCceeecccc
Confidence 88888888888888877766655554433
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=92.16 E-value=0.11 Score=29.04 Aligned_cols=21 Identities=33% Similarity=0.468 Sum_probs=15.7
Q ss_pred cccccccccccCchHHHhHHhh
Q psy7449 60 NTCCICGHVESGAWRLKRHFLK 81 (596)
Q Consensus 60 ~~C~~C~~~f~~~~~l~~H~~~ 81 (596)
..|+.||+.| ...+|..|+.+
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHHh
Confidence 4688888888 56778888654
No 62
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.58 E-value=0.17 Score=28.21 Aligned_cols=20 Identities=35% Similarity=0.747 Sum_probs=12.6
Q ss_pred ccCccCcccccChHHHHHHHH
Q psy7449 256 FQCKICDSTFKRKQTLDLHIK 276 (596)
Q Consensus 256 ~~C~~C~~~f~~~~~l~~H~~ 276 (596)
..|+.||+.| ..+.|..|+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 3566777777 5566666654
No 63
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.26 E-value=0.15 Score=31.26 Aligned_cols=24 Identities=25% Similarity=0.746 Sum_probs=19.3
Q ss_pred cccCCccCcCccChhhHHHHHhhc
Q psy7449 555 PYACSLCDLKFSQNSQLKYHLTTR 578 (596)
Q Consensus 555 ~~~C~~C~~~F~~~~~l~~H~~~~ 578 (596)
+|.|++|++.|.+...+..|++..
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~gk 26 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLKGK 26 (35)
T ss_pred CeEccccCCccCCHHHHHHHHChH
Confidence 577888888888888888887643
No 64
>KOG2893|consensus
Probab=90.07 E-value=0.12 Score=45.86 Aligned_cols=48 Identities=23% Similarity=0.414 Sum_probs=41.1
Q ss_pred cccCCCCCcccCHHHHHHhhhhcCCCCccccCccccccccchhhhhhhhccc
Q psy7449 349 YVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTHCRYSASRRDTLDHHIQVSH 400 (596)
Q Consensus 349 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~~H 400 (596)
-.|.+|+..|.+...|..|++. |-|+|.+|.+...+--.|..|...+|
T Consensus 11 pwcwycnrefddekiliqhqka----khfkchichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 11 PWCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred ceeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceeehhhhh
Confidence 4699999999999999999875 44999999999888888888877666
No 65
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.95 E-value=0.3 Score=29.46 Aligned_cols=10 Identities=40% Similarity=1.059 Sum_probs=4.7
Q ss_pred ccCCCCCccc
Q psy7449 500 FVCFTCNYHT 509 (596)
Q Consensus 500 ~~C~~C~~~f 509 (596)
|.|..||..+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 4455555433
No 66
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=88.52 E-value=0.35 Score=29.16 Aligned_cols=10 Identities=20% Similarity=0.697 Sum_probs=4.7
Q ss_pred eecCCCCccc
Q psy7449 465 VICHHCRDLL 474 (596)
Q Consensus 465 ~~C~~C~~~~ 474 (596)
|+|.+||..+
T Consensus 2 ~~C~~CGy~y 11 (33)
T cd00350 2 YVCPVCGYIY 11 (33)
T ss_pred EECCCCCCEE
Confidence 4455555443
No 67
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=88.00 E-value=0.39 Score=29.31 Aligned_cols=21 Identities=29% Similarity=0.490 Sum_probs=11.3
Q ss_pred cccCCCCccccChhHHHHHhh
Q psy7449 116 YSCGHCPYTARSSSYMLTHLK 136 (596)
Q Consensus 116 ~~C~~C~~~f~~~~~l~~H~~ 136 (596)
|.|++|++.|.+...+..|++
T Consensus 4 ~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 4 FYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred eEccccCCccCCHHHHHHHHC
Confidence 455555555555555555544
No 68
>KOG4173|consensus
Probab=87.71 E-value=0.13 Score=44.80 Aligned_cols=83 Identities=24% Similarity=0.424 Sum_probs=65.0
Q ss_pred CcccCCC--CCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhh-hc---------cCCCcccCC--ccCc
Q psy7449 498 FRFVCFT--CNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMR-IH---------TGEKPYACS--LCDL 563 (596)
Q Consensus 498 ~~~~C~~--C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~-~H---------~~~~~~~C~--~C~~ 563 (596)
..|.|.+ |...|.....+..|...-+| ..|.+|.+.|++..-|-.|+. .| -|...|+|- -|+.
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~ 154 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTE 154 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhh
Confidence 3467755 77888888888888865443 359999999999999998875 34 356679995 4999
Q ss_pred CccChhhHHHHHhhcCCCCC
Q psy7449 564 KFSQNSQLKYHLTTRHDLPD 583 (596)
Q Consensus 564 ~F~~~~~l~~H~~~~H~~~~ 583 (596)
.|.+...-+.|+-..|..+.
T Consensus 155 KFkT~r~RkdH~I~~Hk~Pa 174 (253)
T KOG4173|consen 155 KFKTSRDRKDHMIRMHKYPA 174 (253)
T ss_pred hhhhhhhhhhHHHHhccCCc
Confidence 99999999999999997664
No 69
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=86.65 E-value=1.9 Score=34.36 Aligned_cols=23 Identities=22% Similarity=0.556 Sum_probs=13.1
Q ss_pred cccCCccCcCccChhhHHHHHhh
Q psy7449 555 PYACSLCDLKFSQNSQLKYHLTT 577 (596)
Q Consensus 555 ~~~C~~C~~~F~~~~~l~~H~~~ 577 (596)
.|+|+.|...|=-.-++-.|...
T Consensus 81 ~y~C~~C~~~FC~dCD~fiHe~L 103 (112)
T TIGR00622 81 RYVCAVCKNVFCVDCDVFVHESL 103 (112)
T ss_pred ceeCCCCCCccccccchhhhhhc
Confidence 46666666666555555555433
No 70
>KOG2893|consensus
Probab=85.88 E-value=0.37 Score=42.93 Aligned_cols=42 Identities=24% Similarity=0.578 Sum_probs=21.7
Q ss_pred ecCCCCccCCChHHHHhhhhhcCCCCCccCccCcccccChHHHHHH
Q psy7449 229 QCYSCEYHSLYSSRMIKHIRNHTGERPFQCKICDSTFKRKQTLDLH 274 (596)
Q Consensus 229 ~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H 274 (596)
.|-.|++.|.+..-|++|++. +-|+|.+|.+...+--.|..|
T Consensus 12 wcwycnrefddekiliqhqka----khfkchichkkl~sgpglsih 53 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQKA----KHFKCHICHKKLFSGPGLSIH 53 (341)
T ss_pred eeeecccccchhhhhhhhhhh----ccceeeeehhhhccCCCceee
Confidence 455555555555555555543 225555555554444444444
No 71
>KOG4173|consensus
Probab=85.82 E-value=0.19 Score=43.66 Aligned_cols=81 Identities=22% Similarity=0.414 Sum_probs=63.7
Q ss_pred cccCC--CCCcccCHHHHHHhhhhcCCCCccccCccccccccchhhhhhhhccc---------CCCCcccCCC--Ccccc
Q psy7449 349 YVCYT--CDYHTGVRECMKKHIRTHTGEKPFKCTHCRYSASRRDTLDHHIQVSH---------LYKKRFQCPV--CDYRC 415 (596)
Q Consensus 349 ~~C~~--C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~~~f~~~~~L~~H~~~~H---------~~~~~~~C~~--C~~~f 415 (596)
+.|++ |...|.+......|..+-++ -.|.+|.+.|.+..-|..|+...| .|...|+|-+ |+..|
T Consensus 80 ~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt~KF 156 (253)
T KOG4173|consen 80 FACQVAGCCQVFDALDDYEHHYHTLHG---NSCSFCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCTEKF 156 (253)
T ss_pred ccccccchHHHHhhhhhHHHhhhhccc---chhHHHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhhhhh
Confidence 66776 77788888878888765444 369999999999999999987777 3455688855 99999
Q ss_pred cchhhHHHHhhhcccCC
Q psy7449 416 TKKYNLQMHALTKHDLD 432 (596)
Q Consensus 416 ~~~~~l~~H~~~~H~~~ 432 (596)
.+...-..|+...|...
T Consensus 157 kT~r~RkdH~I~~Hk~P 173 (253)
T KOG4173|consen 157 KTSRDRKDHMIRMHKYP 173 (253)
T ss_pred hhhhhhhhHHHHhccCC
Confidence 99999999998878643
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=84.74 E-value=0.35 Score=32.19 Aligned_cols=29 Identities=24% Similarity=0.564 Sum_probs=16.9
Q ss_pred CCCcccCCccCcCccChhhHHHHHhhcCC
Q psy7449 552 GEKPYACSLCDLKFSQNSQLKYHLTTRHD 580 (596)
Q Consensus 552 ~~~~~~C~~C~~~F~~~~~l~~H~~~~H~ 580 (596)
||..+.|+-||+.|....+..+|+...|+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 44455666666666666666666655553
No 73
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=83.71 E-value=2.3 Score=33.84 Aligned_cols=19 Identities=32% Similarity=0.476 Sum_probs=10.3
Q ss_pred cccccCCcccccchhhHHHH
Q psy7449 308 KISICQNCGLILSTITEELL 327 (596)
Q Consensus 308 ~~~~C~~C~~~~~~~~~~l~ 327 (596)
-+..|+.||..... +..|.
T Consensus 14 LP~~CpiCgLtLVs-s~HLA 32 (112)
T TIGR00622 14 LPVECPICGLTLIL-STHLA 32 (112)
T ss_pred CCCcCCcCCCEEec-cchHH
Confidence 35566666666554 33443
No 74
>KOG1280|consensus
Probab=83.37 E-value=1.3 Score=42.28 Aligned_cols=41 Identities=20% Similarity=0.429 Sum_probs=29.2
Q ss_pred CCcccCCccCcCccChhhHHHHHhhcCCCCCCcceeeeeccC
Q psy7449 553 EKPYACSLCDLKFSQNSQLKYHLTTRHDLPDMSTCLSIECGA 594 (596)
Q Consensus 553 ~~~~~C~~C~~~F~~~~~l~~H~~~~H~~~~~~~c~~~~c~~ 594 (596)
...|.|++|+..=.+...|..|+...|.+.++. ++|-.|++
T Consensus 77 ~qSftCPyC~~~Gfte~~f~~Hv~s~Hpda~~~-~icp~c~~ 117 (381)
T KOG1280|consen 77 PQSFTCPYCGIMGFTERQFGTHVLSQHPEASTS-VICPLCAA 117 (381)
T ss_pred cccccCCcccccccchhHHHHHhhhcCcccCcc-eeeecccc
Confidence 345788888887777888888888888776644 55566654
No 75
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=83.02 E-value=1.2 Score=35.91 Aligned_cols=25 Identities=28% Similarity=0.696 Sum_probs=22.9
Q ss_pred ccC----CCCcccccchhhHHHHhhhccc
Q psy7449 406 FQC----PVCDYRCTKKYNLQMHALTKHD 430 (596)
Q Consensus 406 ~~C----~~C~~~f~~~~~l~~H~~~~H~ 430 (596)
|.| +.|+..+.+...+..|.+.+|+
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhcC
Confidence 899 9999999999999999998774
No 76
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=82.04 E-value=0.65 Score=42.58 Aligned_cols=44 Identities=20% Similarity=0.446 Sum_probs=27.7
Q ss_pred CccccCccccccccchhhhhhhhccc---------CCCCc-----ccCCCCcccccch
Q psy7449 375 KPFKCTHCRYSASRRDTLDHHIQVSH---------LYKKR-----FQCPVCDYRCTKK 418 (596)
Q Consensus 375 k~~~C~~C~~~f~~~~~L~~H~~~~H---------~~~~~-----~~C~~C~~~f~~~ 418 (596)
+.+.||+|+..|.++.-+....+... .+..| ..||.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 45778888888777665555544322 12223 5899999887654
No 77
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=81.13 E-value=1.3 Score=35.61 Aligned_cols=25 Identities=36% Similarity=0.729 Sum_probs=16.0
Q ss_pred Cccc----CCCCCcccCHHHHHHhhhhcC
Q psy7449 348 NYVC----YTCDYHTGVRECMKKHIRTHT 372 (596)
Q Consensus 348 ~~~C----~~C~~~f~~~~~l~~H~~~h~ 372 (596)
.|.| ..|++.+.+...+..|++.++
T Consensus 80 G~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 80 GYRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CeeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 4666 666666666666666666554
No 78
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=80.91 E-value=0.76 Score=30.64 Aligned_cols=29 Identities=17% Similarity=0.403 Sum_probs=17.7
Q ss_pred CCCcccCCCCcccccchhhHHHHhhhccc
Q psy7449 402 YKKRFQCPVCDYRCTKKYNLQMHALTKHD 430 (596)
Q Consensus 402 ~~~~~~C~~C~~~f~~~~~l~~H~~~~H~ 430 (596)
++.-+.||-||..|....+..+|+...|+
T Consensus 14 GE~~lrCPRC~~~FR~~K~Y~RHVNKaH~ 42 (65)
T COG4049 14 GEEFLRCPRCGMVFRRRKDYIRHVNKAHG 42 (65)
T ss_pred CceeeeCCchhHHHHHhHHHHHHhhHHhh
Confidence 34446666666666666666666655554
No 79
>PHA00626 hypothetical protein
Probab=80.69 E-value=1.3 Score=29.81 Aligned_cols=12 Identities=33% Similarity=1.052 Sum_probs=5.8
Q ss_pred cccCCCCCcccc
Q psy7449 499 RFVCFTCNYHTY 510 (596)
Q Consensus 499 ~~~C~~C~~~f~ 510 (596)
.|+|+.||..|+
T Consensus 23 rYkCkdCGY~ft 34 (59)
T PHA00626 23 DYVCCDCGYNDS 34 (59)
T ss_pred ceEcCCCCCeec
Confidence 355555554443
No 80
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=77.84 E-value=1.7 Score=26.35 Aligned_cols=8 Identities=38% Similarity=0.717 Sum_probs=3.5
Q ss_pred ccCCCCCc
Q psy7449 528 YKCLHCSY 535 (596)
Q Consensus 528 ~~C~~C~~ 535 (596)
..|++||.
T Consensus 19 ~~CP~Cg~ 26 (34)
T cd00729 19 EKCPICGA 26 (34)
T ss_pred CcCcCCCC
Confidence 34444443
No 81
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=77.47 E-value=1.9 Score=24.34 Aligned_cols=11 Identities=27% Similarity=0.694 Sum_probs=7.8
Q ss_pred cccCCCCCccc
Q psy7449 349 YVCYTCDYHTG 359 (596)
Q Consensus 349 ~~C~~C~~~f~ 359 (596)
-.|+.||+.|.
T Consensus 15 ~~Cp~CG~~F~ 25 (26)
T PF10571_consen 15 KFCPHCGYDFE 25 (26)
T ss_pred CcCCCCCCCCc
Confidence 56778877774
No 82
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=75.44 E-value=1.6 Score=40.09 Aligned_cols=43 Identities=19% Similarity=0.495 Sum_probs=25.0
Q ss_pred CCccCCCCCccccCchhhhhhhhh---cc-------CCCc-----ccCCccCcCccCh
Q psy7449 526 KPYKCLHCSYKSTQSSCLGRHMRI---HT-------GEKP-----YACSLCDLKFSQN 568 (596)
Q Consensus 526 ~~~~C~~C~~~f~~~~~L~~H~~~---H~-------~~~~-----~~C~~C~~~F~~~ 568 (596)
+.+.||+|+..|.+...+....++ .+ +.-| -.|+.||.+|...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~~r~~~~d~D~~~~Y~~vnP~~Y~V~vCP~CgyA~~~~ 61 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGKIRVIRRDSDFCPRYKGVNPLFYEVWVCPHCGYAAFEE 61 (214)
T ss_pred CceECCCCCCeeeeeEEEcCCceEeeecCCCccccCCCCCeeeeEEECCCCCCccccc
Confidence 446677777777666555444432 11 2223 3788888887654
No 83
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=74.18 E-value=2.3 Score=33.93 Aligned_cols=15 Identities=20% Similarity=0.534 Sum_probs=7.3
Q ss_pred CCccCCCCCccccCc
Q psy7449 526 KPYKCLHCSYKSTQS 540 (596)
Q Consensus 526 ~~~~C~~C~~~f~~~ 540 (596)
.|-.|+.||..|.-.
T Consensus 25 ~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 25 DPIVCPKCGTEFPPE 39 (108)
T ss_pred CCccCCCCCCccCcc
Confidence 344455555555433
No 84
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=72.67 E-value=3.7 Score=28.15 Aligned_cols=9 Identities=33% Similarity=0.936 Sum_probs=5.0
Q ss_pred cccCCCCCc
Q psy7449 499 RFVCFTCNY 507 (596)
Q Consensus 499 ~~~C~~C~~ 507 (596)
.|.|+.||.
T Consensus 27 ~F~CPnCGe 35 (61)
T COG2888 27 KFPCPNCGE 35 (61)
T ss_pred EeeCCCCCc
Confidence 355666653
No 85
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=72.44 E-value=1.4 Score=37.71 Aligned_cols=16 Identities=25% Similarity=0.486 Sum_probs=9.7
Q ss_pred ccCCCCCccccCchhh
Q psy7449 528 YKCLHCSYKSTQSSCL 543 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L 543 (596)
++|+.||+.|.+...+
T Consensus 29 ~~c~~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECLACGKRFTTFERV 44 (154)
T ss_pred eeccccCCcceEeEec
Confidence 6666666666655443
No 86
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=70.77 E-value=3.5 Score=27.06 Aligned_cols=13 Identities=15% Similarity=0.429 Sum_probs=8.3
Q ss_pred ceecCCCCccccc
Q psy7449 464 TVICHHCRDLLPY 476 (596)
Q Consensus 464 ~~~C~~C~~~~~~ 476 (596)
.|.|+.||..+..
T Consensus 3 ~y~C~~CG~~~~~ 15 (46)
T PRK00398 3 EYKCARCGREVEL 15 (46)
T ss_pred EEECCCCCCEEEE
Confidence 4677777766644
No 87
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=69.36 E-value=4.9 Score=23.96 Aligned_cols=12 Identities=25% Similarity=0.614 Sum_probs=6.7
Q ss_pred eecCCCCccccc
Q psy7449 465 VICHHCRDLLPY 476 (596)
Q Consensus 465 ~~C~~C~~~~~~ 476 (596)
|.|..||..+..
T Consensus 1 Y~C~~Cg~~~~~ 12 (32)
T PF03604_consen 1 YICGECGAEVEL 12 (32)
T ss_dssp EBESSSSSSE-B
T ss_pred CCCCcCCCeeEc
Confidence 456667666543
No 88
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=69.06 E-value=4.1 Score=32.52 Aligned_cols=30 Identities=17% Similarity=0.399 Sum_probs=16.9
Q ss_pred ccCccCcccccChHHHHHHHHHhcCCCceeeCCCCCccccc
Q psy7449 256 FQCKICDSTFKRKQTLDLHIKRHTGERTYKCEFCDYTAARK 296 (596)
Q Consensus 256 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 296 (596)
..|+.||+.|-.. +..|-.|+.||..|.-.
T Consensus 10 R~Cp~CG~kFYDL-----------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPSCGAKFYDL-----------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCCCCcchhccC-----------CCCCccCCCCCCccCcc
Confidence 3566666666542 23455666666666554
No 89
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=67.76 E-value=8 Score=26.67 Aligned_cols=10 Identities=30% Similarity=0.896 Sum_probs=5.5
Q ss_pred cccCCCCCcc
Q psy7449 499 RFVCFTCNYH 508 (596)
Q Consensus 499 ~~~C~~C~~~ 508 (596)
.|.|+.||..
T Consensus 25 ~F~CPnCG~~ 34 (59)
T PRK14890 25 KFLCPNCGEV 34 (59)
T ss_pred EeeCCCCCCe
Confidence 4566666543
No 90
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=66.87 E-value=3.8 Score=26.60 Aligned_cols=26 Identities=35% Similarity=0.602 Sum_probs=12.3
Q ss_pred CcccCCccCcCccCh----hhHHHHHhhcC
Q psy7449 554 KPYACSLCDLKFSQN----SQLKYHLTTRH 579 (596)
Q Consensus 554 ~~~~C~~C~~~F~~~----~~l~~H~~~~H 579 (596)
....|.+|++.+... ..|.+|++..|
T Consensus 15 ~~a~C~~C~~~~~~~~~~ts~l~~HL~~~h 44 (45)
T PF02892_consen 15 KKAKCKYCGKVIKYSSGGTSNLKRHLKKKH 44 (45)
T ss_dssp S-EEETTTTEE-----SSTHHHHHHHHHTT
T ss_pred CeEEeCCCCeEEeeCCCcHHHHHHhhhhhC
Confidence 334566666655543 45666665544
No 91
>KOG2186|consensus
Probab=66.43 E-value=3 Score=38.07 Aligned_cols=46 Identities=17% Similarity=0.475 Sum_probs=29.0
Q ss_pred ccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhh
Q psy7449 500 FVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMR 548 (596)
Q Consensus 500 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~ 548 (596)
|.|..||..... ..+.+|+-.-+| .-|.|--||+.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKK-p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK-PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc-cchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 667777766543 345557766655 45777777777766 55666654
No 92
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=66.12 E-value=5.2 Score=25.97 Aligned_cols=11 Identities=27% Similarity=0.525 Sum_probs=6.3
Q ss_pred eecCCCCcccc
Q psy7449 465 VICHHCRDLLP 475 (596)
Q Consensus 465 ~~C~~C~~~~~ 475 (596)
|.|..||..|.
T Consensus 3 Y~C~~Cg~~~~ 13 (44)
T smart00659 3 YICGECGRENE 13 (44)
T ss_pred EECCCCCCEee
Confidence 55666665554
No 93
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=65.70 E-value=3.9 Score=35.32 Aligned_cols=22 Identities=45% Similarity=1.099 Sum_probs=14.4
Q ss_pred cccCCCCCcccCHHHHHHhhhhcCCCCccccCccc
Q psy7449 349 YVCYTCDYHTGVRECMKKHIRTHTGEKPFKCTHCR 383 (596)
Q Consensus 349 ~~C~~C~~~f~~~~~l~~H~~~h~~~k~~~C~~C~ 383 (596)
|.|++||+. +.++.|-+||+||
T Consensus 135 ~vC~vCGy~-------------~~ge~P~~CPiCg 156 (166)
T COG1592 135 WVCPVCGYT-------------HEGEAPEVCPICG 156 (166)
T ss_pred EEcCCCCCc-------------ccCCCCCcCCCCC
Confidence 777777764 3456666777776
No 94
>KOG2186|consensus
Probab=65.62 E-value=3.2 Score=37.88 Aligned_cols=46 Identities=15% Similarity=0.427 Sum_probs=22.8
Q ss_pred cccCCCCccccChhHHHHHhhhccCCCCccCCCCCccccCchhHHhHhh
Q psy7449 116 YSCGHCPYTARSSSYMLTHLKVHSGEKAFKCEYCSYSSYSKSNLRTHMR 164 (596)
Q Consensus 116 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~L~~H~~ 164 (596)
|.|..||.... +..+.+|+..-.+ .-|.|-.|++.|.. .++..|..
T Consensus 4 FtCnvCgEsvK-Kp~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVK-KPQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhcc-ccchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 44555555444 2334445554444 33555556655555 44455543
No 95
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=65.11 E-value=6.7 Score=24.32 Aligned_cols=10 Identities=30% Similarity=0.959 Sum_probs=4.6
Q ss_pred ccCccCcccc
Q psy7449 256 FQCKICDSTF 265 (596)
Q Consensus 256 ~~C~~C~~~f 265 (596)
.+|+.|+..|
T Consensus 26 vrC~~C~~~f 35 (37)
T PF13719_consen 26 VRCPKCGHVF 35 (37)
T ss_pred EECCCCCcEe
Confidence 4444444443
No 96
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=65.03 E-value=3.6 Score=27.81 Aligned_cols=11 Identities=18% Similarity=0.558 Sum_probs=5.6
Q ss_pred ccCCCCCcccc
Q psy7449 500 FVCFTCNYHTY 510 (596)
Q Consensus 500 ~~C~~C~~~f~ 510 (596)
|+|..||..|.
T Consensus 6 y~C~~Cg~~fe 16 (52)
T TIGR02605 6 YRCTACGHRFE 16 (52)
T ss_pred EEeCCCCCEeE
Confidence 45555555444
No 97
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=64.81 E-value=4.2 Score=34.81 Aligned_cols=19 Identities=16% Similarity=0.308 Sum_probs=14.1
Q ss_pred cccCCCCCccccccchhhh
Q psy7449 499 RFVCFTCNYHTYKSGHMRQ 517 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~~l~~ 517 (596)
.++|+.||++|.+...+..
T Consensus 28 ~~~c~~c~~~f~~~e~~~~ 46 (154)
T PRK00464 28 RRECLACGKRFTTFERVEL 46 (154)
T ss_pred eeeccccCCcceEeEeccC
Confidence 4889999999977665443
No 98
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=63.33 E-value=3.3 Score=26.55 Aligned_cols=13 Identities=23% Similarity=0.759 Sum_probs=6.5
Q ss_pred eecCCCccccchh
Q psy7449 172 YKCSSCDFQATQK 184 (596)
Q Consensus 172 ~~C~~C~~~f~~~ 184 (596)
|.|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (42)
T PF09723_consen 6 YRCEECGHEFEVL 18 (42)
T ss_pred EEeCCCCCEEEEE
Confidence 4555555555443
No 99
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=63.26 E-value=2.1 Score=31.17 Aligned_cols=40 Identities=15% Similarity=0.175 Sum_probs=18.1
Q ss_pred ccCCCCCccccccchhhhhhhccCCCCCccCC--CCCccccCch
Q psy7449 500 FVCFTCNYHTYKSGHMRQHINGHVGIKPYKCL--HCSYKSTQSS 541 (596)
Q Consensus 500 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~--~C~~~f~~~~ 541 (596)
+.|+.||....-..+-..... ..++-++|. .||..|.+..
T Consensus 2 m~CP~Cg~~a~irtSr~~s~~--~~~~Y~qC~N~eCg~tF~t~e 43 (72)
T PRK09678 2 FHCPLCQHAAHARTSRYITDT--TKERYHQCQNVNCSATFITYE 43 (72)
T ss_pred ccCCCCCCccEEEEChhcChh--hheeeeecCCCCCCCEEEEEE
Confidence 456666654432222222211 233445665 5666665544
No 100
>PF12907 zf-met2: Zinc-binding
Probab=63.15 E-value=4 Score=25.75 Aligned_cols=32 Identities=28% Similarity=0.664 Sum_probs=20.9
Q ss_pred ccCCccCcCc---cChhhHHHHHhhcCCCCCCcce
Q psy7449 556 YACSLCDLKF---SQNSQLKYHLTTRHDLPDMSTC 587 (596)
Q Consensus 556 ~~C~~C~~~F---~~~~~l~~H~~~~H~~~~~~~c 587 (596)
+.|.+|-..| .+...|..|....|+..++..|
T Consensus 2 i~C~iC~qtF~~t~~~~~L~eH~enKHpK~~~~~C 36 (40)
T PF12907_consen 2 IICKICRQTFMQTTNEPQLKEHAENKHPKNTFEEC 36 (40)
T ss_pred cCcHHhhHHHHhcCCHHHHHHHHHccCCCCCHHHc
Confidence 4677777444 3445688888888876655544
No 101
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=63.00 E-value=8.5 Score=23.69 Aligned_cols=10 Identities=30% Similarity=0.982 Sum_probs=4.6
Q ss_pred ccCccCcccc
Q psy7449 256 FQCKICDSTF 265 (596)
Q Consensus 256 ~~C~~C~~~f 265 (596)
.+|+.|+..|
T Consensus 26 v~C~~C~~~f 35 (36)
T PF13717_consen 26 VRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEe
Confidence 4444444443
No 102
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=62.77 E-value=3.5 Score=26.30 Aligned_cols=20 Identities=15% Similarity=0.154 Sum_probs=14.6
Q ss_pred chhHHHHhhccccccccccc
Q psy7449 19 DYTSLFRRSKQKHVLHLWPC 38 (596)
Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~ 38 (596)
...+||+++|+||+.++.+.
T Consensus 21 ~Qk~ly~dvm~Eny~~l~sl 40 (41)
T PF01352_consen 21 AQKNLYRDVMLENYRNLVSL 40 (41)
T ss_dssp HHHHHHHHHHHHTTTS---S
T ss_pred eecccchhHHHHhhcccEec
Confidence 36789999999999887764
No 103
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=61.63 E-value=3.2 Score=30.20 Aligned_cols=17 Identities=12% Similarity=0.139 Sum_probs=9.1
Q ss_pred CcccCC--CCCccccccch
Q psy7449 498 FRFVCF--TCNYHTYKSGH 514 (596)
Q Consensus 498 ~~~~C~--~C~~~f~~~~~ 514 (596)
.-++|. .||.+|.....
T Consensus 26 ~Y~qC~N~eCg~tF~t~es 44 (72)
T PRK09678 26 RYHQCQNVNCSATFITYES 44 (72)
T ss_pred eeeecCCCCCCCEEEEEEE
Confidence 345565 56666654443
No 104
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=61.09 E-value=3.7 Score=34.05 Aligned_cols=25 Identities=44% Similarity=0.703 Sum_probs=11.4
Q ss_pred ccCCccCcCccChhhHHHHHhhcCCCCC
Q psy7449 556 YACSLCDLKFSQNSQLKYHLTTRHDLPD 583 (596)
Q Consensus 556 ~~C~~C~~~F~~~~~l~~H~~~~H~~~~ 583 (596)
-.|-.||+.|.. |++|++.|||+.|
T Consensus 73 i~clecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-H
T ss_pred eEEccCCcccch---HHHHHHHccCCCH
Confidence 355556665554 3666666666543
No 105
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=60.88 E-value=5.8 Score=33.92 Aligned_cols=13 Identities=15% Similarity=0.350 Sum_probs=6.2
Q ss_pred cccCCCCCccccc
Q psy7449 499 RFVCFTCNYHTYK 511 (596)
Q Consensus 499 ~~~C~~C~~~f~~ 511 (596)
.|.|+.|+..|..
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 3445555544443
No 106
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=59.40 E-value=9 Score=23.77 Aligned_cols=10 Identities=20% Similarity=0.830 Sum_probs=4.4
Q ss_pred ccCccCcccc
Q psy7449 256 FQCKICDSTF 265 (596)
Q Consensus 256 ~~C~~C~~~f 265 (596)
..|+.|+..|
T Consensus 26 v~C~~C~~~~ 35 (38)
T TIGR02098 26 VRCGKCGHVW 35 (38)
T ss_pred EECCCCCCEE
Confidence 3444444443
No 107
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=59.24 E-value=6.5 Score=34.02 Aligned_cols=24 Identities=38% Similarity=0.827 Sum_probs=14.9
Q ss_pred cccCCCCCccccccchhhhhhhccCCCCCccCCCCCc
Q psy7449 499 RFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSY 535 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~ 535 (596)
.|.|++||.. +.|+.|-+||+||.
T Consensus 134 ~~vC~vCGy~-------------~~ge~P~~CPiCga 157 (166)
T COG1592 134 VWVCPVCGYT-------------HEGEAPEVCPICGA 157 (166)
T ss_pred EEEcCCCCCc-------------ccCCCCCcCCCCCC
Confidence 4677777642 44556677777763
No 108
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=58.65 E-value=7.7 Score=31.51 Aligned_cols=12 Identities=8% Similarity=-0.128 Sum_probs=5.7
Q ss_pred CccCCCCCcccc
Q psy7449 527 PYKCLHCSYKST 538 (596)
Q Consensus 527 ~~~C~~C~~~f~ 538 (596)
|-.|+.||..|.
T Consensus 26 p~vcP~cg~~~~ 37 (129)
T TIGR02300 26 PAVSPYTGEQFP 37 (129)
T ss_pred CccCCCcCCccC
Confidence 444555554443
No 109
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=57.96 E-value=6.3 Score=34.13 Aligned_cols=17 Identities=12% Similarity=0.235 Sum_probs=8.6
Q ss_pred CccCCCCCccccCchhh
Q psy7449 527 PYKCLHCSYKSTQSSCL 543 (596)
Q Consensus 527 ~~~C~~C~~~f~~~~~L 543 (596)
-|.|+.|+..|+...++
T Consensus 109 ~Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM 125 (158)
T ss_pred eEECCCCCcEeeHHHHH
Confidence 35555555555554444
No 110
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=57.62 E-value=9.2 Score=33.15 Aligned_cols=16 Identities=19% Similarity=0.376 Sum_probs=7.4
Q ss_pred cccCCCCCcccCHHHH
Q psy7449 349 YVCYTCDYHTGVRECM 364 (596)
Q Consensus 349 ~~C~~C~~~f~~~~~l 364 (596)
|.|+.|+..|+...++
T Consensus 110 Y~Cp~c~~r~tf~eA~ 125 (158)
T TIGR00373 110 FICPNMCVRFTFNEAM 125 (158)
T ss_pred EECCCCCcEeeHHHHH
Confidence 4444444444444444
No 111
>KOG2807|consensus
Probab=56.06 E-value=13 Score=35.47 Aligned_cols=24 Identities=21% Similarity=0.513 Sum_probs=14.7
Q ss_pred CccCccCcccccChHHHHHHHHHh
Q psy7449 255 PFQCKICDSTFKRKQTLDLHIKRH 278 (596)
Q Consensus 255 ~~~C~~C~~~f~~~~~l~~H~~~h 278 (596)
.|.|..|...|-..=..-.|...|
T Consensus 345 ~y~C~~Ck~~FCldCDv~iHesLh 368 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIHESLH 368 (378)
T ss_pred cEEchhccceeeccchHHHHhhhh
Confidence 466777766666555555565554
No 112
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=55.83 E-value=5.8 Score=25.05 Aligned_cols=10 Identities=30% Similarity=0.949 Sum_probs=4.7
Q ss_pred ccCCCCCccc
Q psy7449 144 FKCEYCSYSS 153 (596)
Q Consensus 144 ~~C~~C~~~f 153 (596)
|+|..||..|
T Consensus 6 y~C~~Cg~~f 15 (41)
T smart00834 6 YRCEDCGHTF 15 (41)
T ss_pred EEcCCCCCEE
Confidence 4444444444
No 113
>PF15269 zf-C2H2_7: Zinc-finger
Probab=55.52 E-value=15 Score=23.47 Aligned_cols=27 Identities=22% Similarity=0.728 Sum_probs=21.0
Q ss_pred CCCCCcccCCCCCcccCHHHHHHhhhh
Q psy7449 344 DKSFNYVCYTCDYHTGVRECMKKHIRT 370 (596)
Q Consensus 344 ~~~~~~~C~~C~~~f~~~~~l~~H~~~ 370 (596)
.+.+.|+|-+|+++....+.|-.||+.
T Consensus 16 gkp~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 16 GKPFKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred CCCccceeecCCcccchHHHHHHHHHH
Confidence 445668888888888888888888763
No 114
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=55.44 E-value=8.9 Score=31.16 Aligned_cols=31 Identities=13% Similarity=0.029 Sum_probs=17.4
Q ss_pred ccccccccccccCchHHHhHHhhhcCCCCcccCcCcccccCh
Q psy7449 59 KNTCCICGHVESGAWRLKRHFLKHTGEKSYECCWCTYKTAYE 100 (596)
Q Consensus 59 ~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~ 100 (596)
+..|+.||+.|... +..|-.|+.||..|.-.
T Consensus 9 Kr~Cp~cg~kFYDL-----------nk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 9 KRICPNTGSKFYDL-----------NRRPAVSPYTGEQFPPE 39 (129)
T ss_pred cccCCCcCcccccc-----------CCCCccCCCcCCccCcc
Confidence 34566666666542 23456666666665443
No 115
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=55.32 E-value=6.3 Score=26.11 Aligned_cols=10 Identities=20% Similarity=0.763 Sum_probs=5.4
Q ss_pred ccCCCCCccc
Q psy7449 500 FVCFTCNYHT 509 (596)
Q Consensus 500 ~~C~~C~~~f 509 (596)
|.|..||+.|
T Consensus 7 Y~C~~Cg~~~ 16 (49)
T COG1996 7 YKCARCGREV 16 (49)
T ss_pred EEhhhcCCee
Confidence 5555555555
No 116
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=55.12 E-value=13 Score=31.87 Aligned_cols=13 Identities=23% Similarity=0.649 Sum_probs=6.3
Q ss_pred ccccCcccccccc
Q psy7449 376 PFKCTHCRYSASR 388 (596)
Q Consensus 376 ~~~C~~C~~~f~~ 388 (596)
-|.|+.|+..|..
T Consensus 99 ~Y~Cp~C~~~y~~ 111 (147)
T smart00531 99 YYKCPNCQSKYTF 111 (147)
T ss_pred EEECcCCCCEeeH
Confidence 3455555554443
No 117
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=55.06 E-value=8.1 Score=30.72 Aligned_cols=11 Identities=27% Similarity=0.957 Sum_probs=6.8
Q ss_pred ccCCccCcCcc
Q psy7449 556 YACSLCDLKFS 566 (596)
Q Consensus 556 ~~C~~C~~~F~ 566 (596)
|.|..|+..+.
T Consensus 91 y~C~~C~~~w~ 101 (104)
T TIGR01384 91 YKCTKCGYVWR 101 (104)
T ss_pred EEeCCCCCeeE
Confidence 66766665543
No 118
>PRK04023 DNA polymerase II large subunit; Validated
Probab=54.92 E-value=12 Score=41.71 Aligned_cols=11 Identities=18% Similarity=0.459 Sum_probs=6.9
Q ss_pred ccCCccCcCcc
Q psy7449 556 YACSLCDLKFS 566 (596)
Q Consensus 556 ~~C~~C~~~F~ 566 (596)
|.|+.||..-.
T Consensus 664 y~CPKCG~El~ 674 (1121)
T PRK04023 664 DECEKCGREPT 674 (1121)
T ss_pred CcCCCCCCCCC
Confidence 56777775443
No 119
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=54.79 E-value=11 Score=21.27 Aligned_cols=19 Identities=21% Similarity=0.550 Sum_probs=13.4
Q ss_pred cCCccCcCccChhhHHHHHh
Q psy7449 557 ACSLCDLKFSQNSQLKYHLT 576 (596)
Q Consensus 557 ~C~~C~~~F~~~~~l~~H~~ 576 (596)
.|++|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 577787776 5567777764
No 120
>PF14353 CpXC: CpXC protein
Probab=54.22 E-value=2.2 Score=35.52 Aligned_cols=13 Identities=23% Similarity=0.892 Sum_probs=8.2
Q ss_pred eecCCCCcccccc
Q psy7449 465 VICHHCRDLLPYK 477 (596)
Q Consensus 465 ~~C~~C~~~~~~~ 477 (596)
..|+.|+..|...
T Consensus 2 itCP~C~~~~~~~ 14 (128)
T PF14353_consen 2 ITCPHCGHEFEFE 14 (128)
T ss_pred cCCCCCCCeeEEE
Confidence 3577777776553
No 121
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=53.26 E-value=8.3 Score=28.89 Aligned_cols=13 Identities=15% Similarity=0.575 Sum_probs=7.0
Q ss_pred CcccCCCCccccc
Q psy7449 404 KRFQCPVCDYRCT 416 (596)
Q Consensus 404 ~~~~C~~C~~~f~ 416 (596)
..+.|..|+..|.
T Consensus 52 GIW~C~kCg~~fA 64 (89)
T COG1997 52 GIWKCRKCGAKFA 64 (89)
T ss_pred CeEEcCCCCCeec
Confidence 3455555555554
No 122
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=51.19 E-value=9.2 Score=33.85 Aligned_cols=16 Identities=19% Similarity=0.555 Sum_probs=7.6
Q ss_pred ccCCCCCccccCchhh
Q psy7449 528 YKCLHCSYKSTQSSCL 543 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L 543 (596)
|.|+.|+..|+...++
T Consensus 118 Y~Cp~C~~rytf~eA~ 133 (178)
T PRK06266 118 FFCPNCHIRFTFDEAM 133 (178)
T ss_pred EECCCCCcEEeHHHHh
Confidence 4455555554444443
No 123
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=50.90 E-value=10 Score=25.31 Aligned_cols=23 Identities=39% Similarity=0.820 Sum_probs=11.5
Q ss_pred cCCccCcCccCh-----hhHHHHHhhcC
Q psy7449 557 ACSLCDLKFSQN-----SQLKYHLTTRH 579 (596)
Q Consensus 557 ~C~~C~~~F~~~-----~~l~~H~~~~H 579 (596)
.|.+|++.+... +.|.+|++..|
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~~~h 47 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLRRKH 47 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHHhHC
Confidence 455555554433 35666665433
No 124
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=50.42 E-value=14 Score=32.73 Aligned_cols=16 Identities=19% Similarity=0.478 Sum_probs=7.7
Q ss_pred ccccCccccccccchh
Q psy7449 376 PFKCTHCRYSASRRDT 391 (596)
Q Consensus 376 ~~~C~~C~~~f~~~~~ 391 (596)
-|.|+.|+..|+....
T Consensus 117 ~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 117 FFFCPNCHIRFTFDEA 132 (178)
T ss_pred EEECCCCCcEEeHHHH
Confidence 3555555555444443
No 125
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=48.05 E-value=16 Score=20.12 Aligned_cols=10 Identities=30% Similarity=0.986 Sum_probs=6.2
Q ss_pred CcccCCCCCc
Q psy7449 498 FRFVCFTCNY 507 (596)
Q Consensus 498 ~~~~C~~C~~ 507 (596)
.+|.|+.||+
T Consensus 15 v~f~CPnCG~ 24 (24)
T PF07754_consen 15 VPFPCPNCGF 24 (24)
T ss_pred ceEeCCCCCC
Confidence 3567777763
No 126
>PF14353 CpXC: CpXC protein
Probab=47.81 E-value=4.7 Score=33.56 Aligned_cols=19 Identities=26% Similarity=0.378 Sum_probs=8.9
Q ss_pred ccCCCCCccccCchhhhhh
Q psy7449 528 YKCLHCSYKSTQSSCLGRH 546 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L~~H 546 (596)
|.|+.||..|.-...+..|
T Consensus 39 ~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 39 FTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred EECCCCCCceecCCCEEEE
Confidence 4455555555444444433
No 127
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=47.01 E-value=13 Score=34.22 Aligned_cols=14 Identities=50% Similarity=1.232 Sum_probs=9.6
Q ss_pred hcCCCceeeCCCCC
Q psy7449 278 HTGERTYKCEFCDY 291 (596)
Q Consensus 278 h~~~~~~~C~~C~~ 291 (596)
.+|.+.|+|.+|..
T Consensus 137 ~hGGrif~CsfC~~ 150 (314)
T PF06524_consen 137 DHGGRIFKCSFCDN 150 (314)
T ss_pred cCCCeEEEeecCCC
Confidence 34667788887764
No 128
>PF15269 zf-C2H2_7: Zinc-finger
Probab=46.86 E-value=14 Score=23.54 Aligned_cols=24 Identities=25% Similarity=0.811 Sum_probs=19.9
Q ss_pred CcccCCCCCccccccchhhhhhhc
Q psy7449 498 FRFVCFTCNYHTYKSGHMRQHING 521 (596)
Q Consensus 498 ~~~~C~~C~~~f~~~~~l~~H~~~ 521 (596)
..|+|-+|.++...++.|.+||+-
T Consensus 19 ~~ykcfqcpftc~~kshl~nhmky 42 (54)
T PF15269_consen 19 FKYKCFQCPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ccceeecCCcccchHHHHHHHHHH
Confidence 347899999999999999999863
No 129
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=46.85 E-value=13 Score=34.17 Aligned_cols=60 Identities=18% Similarity=0.459 Sum_probs=32.1
Q ss_pred ceeecCCCCccCCChHHHHhhhhhcCCCCCccCccCcccccChHHHHHHHHHhcCCCceeeCCCCCcccccccccccc
Q psy7449 226 YQYQCYSCEYHSLYSSRMIKHIRNHTGERPFQCKICDSTFKRKQTLDLHIKRHTGERTYKCEFCDYTAARKDTLGVHV 303 (596)
Q Consensus 226 ~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~H~ 303 (596)
..|+|..|++.- .|.|-.|...|-.....+.=.+. ...+++.|+.|+........|..-.
T Consensus 170 E~~KC~SCNrlG-----------------q~sCLRCK~cfCddHvrrKg~ky-~k~k~~PCPKCg~et~eTkdLSmSt 229 (314)
T PF06524_consen 170 ETFKCQSCNRLG-----------------QYSCLRCKICFCDDHVRRKGFKY-EKGKPIPCPKCGYETQETKDLSMST 229 (314)
T ss_pred cccccccccccc-----------------chhhhheeeeehhhhhhhccccc-ccCCCCCCCCCCCcccccccceeee
Confidence 457888886532 24555555445442222211221 2336788888887776666665433
No 130
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=45.98 E-value=22 Score=40.92 Aligned_cols=41 Identities=27% Similarity=0.656 Sum_probs=25.4
Q ss_pred ccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhhhccCCCcccCCccCcCcc
Q psy7449 500 FVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCDLKFS 566 (596)
Q Consensus 500 ~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~~~F~ 566 (596)
..|+.||... +.+|.|+.||....... ++ ...|+.|+....
T Consensus 680 ~fCP~CGs~t---------------e~vy~CPsCGaev~~de---------s~--a~~CP~CGtplv 720 (1337)
T PRK14714 680 NRCPDCGTHT---------------EPVYVCPDCGAEVPPDE---------SG--RVECPRCDVELT 720 (1337)
T ss_pred ccCcccCCcC---------------CCceeCccCCCccCCCc---------cc--cccCCCCCCccc
Confidence 3688888643 23578888888654321 12 346888885443
No 131
>KOG2593|consensus
Probab=44.99 E-value=14 Score=36.99 Aligned_cols=37 Identities=22% Similarity=0.512 Sum_probs=22.7
Q ss_pred CCCCCccCCCCCccccCchhhhhhhhhccCCCcccCCccC
Q psy7449 523 VGIKPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCD 562 (596)
Q Consensus 523 ~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~ 562 (596)
+....|.|+.|++.|+...+|+. +-.....|.|..|+
T Consensus 124 t~~~~Y~Cp~C~kkyt~Lea~~L---~~~~~~~F~C~~C~ 160 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSLEALQL---LDNETGEFHCENCG 160 (436)
T ss_pred cccccccCCccccchhhhHHHHh---hcccCceEEEecCC
Confidence 34456777777777776665543 22233457777775
No 132
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=44.97 E-value=21 Score=33.70 Aligned_cols=47 Identities=15% Similarity=0.323 Sum_probs=29.0
Q ss_pred ccCccccccccchhhhhhhhcccCCCCcccCCCCcccccchhhHHHHhhh
Q psy7449 378 KCTHCRYSASRRDTLDHHIQVSHLYKKRFQCPVCDYRCTKKYNLQMHALT 427 (596)
Q Consensus 378 ~C~~C~~~f~~~~~L~~H~~~~H~~~~~~~C~~C~~~f~~~~~l~~H~~~ 427 (596)
.|-.|.-.|+.... |-.-.-+....|+|+.|...|-..-+.-.|...
T Consensus 364 ~Cf~CQ~~fp~~~~---~~~~~~~ss~rY~Ce~CK~~FC~dCdvfiHe~L 410 (421)
T COG5151 364 HCFVCQGPFPKPPV---SPFDESTSSGRYQCELCKSTFCSDCDVFIHETL 410 (421)
T ss_pred cceeccCCCCCCCC---CcccccccccceechhhhhhhhhhhHHHHHHHH
Confidence 47677766654332 211122334568888888888887777777655
No 133
>KOG2071|consensus
Probab=44.38 E-value=16 Score=38.26 Aligned_cols=26 Identities=19% Similarity=0.259 Sum_probs=22.0
Q ss_pred CcccCCCCCccccccchhhhhhhccC
Q psy7449 498 FRFVCFTCNYHTYKSGHMRQHINGHV 523 (596)
Q Consensus 498 ~~~~C~~C~~~f~~~~~l~~H~~~H~ 523 (596)
.+-.|..||.+|........||..|.
T Consensus 417 ~pnqC~~CG~R~~~~ee~sk~md~H~ 442 (579)
T KOG2071|consen 417 SPNQCKSCGLRFDDSEERSKHMDIHD 442 (579)
T ss_pred CcchhcccccccccchhhhhHhhhhh
Confidence 46789999999999998888887774
No 134
>KOG2593|consensus
Probab=43.79 E-value=27 Score=34.96 Aligned_cols=15 Identities=13% Similarity=0.228 Sum_probs=7.4
Q ss_pred CccccccccccccCc
Q psy7449 58 NKNTCCICGHVESGA 72 (596)
Q Consensus 58 ~~~~C~~C~~~f~~~ 72 (596)
..|.|+.|++.|..-
T Consensus 127 ~~Y~Cp~C~kkyt~L 141 (436)
T KOG2593|consen 127 AGYVCPNCQKKYTSL 141 (436)
T ss_pred ccccCCccccchhhh
Confidence 345555555555433
No 135
>PF05495 zf-CHY: CHY zinc finger; InterPro: IPR008913 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Pirh2 is an eukaryotic ubiquitin protein ligase, which has been shown to promote p53 degradation in mammals. Pirh2 physically interacts with p53 and promotes ubiquitination of p53 independently of MDM2. Like MDM2, Pirh2 is thought to participate in an autoregulatory feedback loop that controls p53 function. Pirh2 proteins contain three distinct zinc fingers, the CHY-type, the CTCHY-type which is C-terminal to the CHY-type zinc finger and a RING finger. The CHY-type zinc finger has no currently known function []. As well as Pirh2, the CHY-type zinc finger is also found in the following proteins: Yeast helper of Tim protein 13. Hot13 may have a role in the assembly and recycling of the small Tims, a complex of the mitochondrial intermembrane space that participates in the TIM22 import pathway for assembly of the inner membrane [] Several plant hypothetical proteins that also contain haemerythrin cation binding domains Several protozoan hypothetical proteins that also contain a Myb domain The solution structure of this zinc finger has been solved and binds 3 zinc atoms as shown in the following schematic representation: ++---------+-----+ || | | CXHYxxxxxxxxxCCxxxxxCxxCHxxxxxHxxxxxxxxxxxCxxCxxxxxxxxxCxxC | | | | | | | | +-+-----------------+--+ +--+---------+--+ 'C': conserved cysteine involved in the binding of one zinc atom. 'H': conserved histidine involved in the binding of one zinc atom. More information about these proteins can be found at Protein of the Month: Zinc Fingers []; GO: 0008270 zinc ion binding; PDB: 2DKT_A 2K2C_A.
Probab=43.60 E-value=11 Score=27.55 Aligned_cols=11 Identities=18% Similarity=0.489 Sum_probs=5.6
Q ss_pred eecCCCCcccc
Q psy7449 465 VICHHCRDLLP 475 (596)
Q Consensus 465 ~~C~~C~~~~~ 475 (596)
.+++-|++.|+
T Consensus 11 ~~~~cC~~~y~ 21 (71)
T PF05495_consen 11 IRFPCCGKYYP 21 (71)
T ss_dssp EEETTTTEEES
T ss_pred EECCcccCeec
Confidence 44555555554
No 136
>PRK04023 DNA polymerase II large subunit; Validated
Probab=42.10 E-value=23 Score=39.71 Aligned_cols=8 Identities=25% Similarity=0.970 Sum_probs=4.1
Q ss_pred ccCccCcc
Q psy7449 256 FQCKICDS 263 (596)
Q Consensus 256 ~~C~~C~~ 263 (596)
+.|+.||.
T Consensus 639 frCP~CG~ 646 (1121)
T PRK04023 639 RRCPFCGT 646 (1121)
T ss_pred ccCCCCCC
Confidence 45555554
No 137
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=42.05 E-value=11 Score=34.20 Aligned_cols=26 Identities=23% Similarity=0.597 Sum_probs=14.9
Q ss_pred cccCCccCcCccChhhHHHHHhhcCC
Q psy7449 555 PYACSLCDLKFSQNSQLKYHLTTRHD 580 (596)
Q Consensus 555 ~~~C~~C~~~F~~~~~l~~H~~~~H~ 580 (596)
.|.|..|+|.|....-+.+|+...|.
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred EECCCCCCcccCChHHHHHHHhhcCH
Confidence 46666666666666666666666664
No 138
>KOG4124|consensus
Probab=42.00 E-value=7.3 Score=37.31 Aligned_cols=54 Identities=28% Similarity=0.570 Sum_probs=41.0
Q ss_pred CCccccCc--cccccccchhhhhhhhcccC------------------CCCcccCCCCcccccchhhHHHHhhh
Q psy7449 374 EKPFKCTH--CRYSASRRDTLDHHIQVSHL------------------YKKRFQCPVCDYRCTKKYNLQMHALT 427 (596)
Q Consensus 374 ~k~~~C~~--C~~~f~~~~~L~~H~~~~H~------------------~~~~~~C~~C~~~f~~~~~l~~H~~~ 427 (596)
+++|.|++ |.+.......|..|....|. ..++|.|++|.+.+.....|..|+..
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~~~~ 420 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYHRTH 420 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCceeeh
Confidence 47888976 99999988899888655552 23678999999888887777777654
No 139
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=41.30 E-value=19 Score=39.55 Aligned_cols=9 Identities=22% Similarity=0.549 Sum_probs=4.4
Q ss_pred CceeeCCCC
Q psy7449 282 RTYKCEFCD 290 (596)
Q Consensus 282 ~~~~C~~C~ 290 (596)
.|..|+.|+
T Consensus 474 ~p~~Cp~Cg 482 (730)
T COG1198 474 IPQSCPECG 482 (730)
T ss_pred CCCCCCCCC
Confidence 344555554
No 140
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=40.51 E-value=9.9 Score=28.95 Aligned_cols=13 Identities=15% Similarity=0.585 Sum_probs=8.2
Q ss_pred CcccCCCCccccc
Q psy7449 404 KRFQCPVCDYRCT 416 (596)
Q Consensus 404 ~~~~C~~C~~~f~ 416 (596)
..+.|..|++.|+
T Consensus 52 GIW~C~~C~~~~A 64 (90)
T PF01780_consen 52 GIWKCKKCGKKFA 64 (90)
T ss_dssp TEEEETTTTEEEE
T ss_pred EEeecCCCCCEEe
Confidence 3466777776664
No 141
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=40.23 E-value=21 Score=24.73 Aligned_cols=6 Identities=33% Similarity=0.545 Sum_probs=2.3
Q ss_pred HHHHHh
Q psy7449 103 LKKHQR 108 (596)
Q Consensus 103 l~~H~~ 108 (596)
|..|+.
T Consensus 26 l~~H~~ 31 (60)
T PF02176_consen 26 LDDHLE 31 (60)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 333333
No 142
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=39.17 E-value=27 Score=22.82 Aligned_cols=11 Identities=27% Similarity=0.761 Sum_probs=6.4
Q ss_pred CCcccCCccCc
Q psy7449 553 EKPYACSLCDL 563 (596)
Q Consensus 553 ~~~~~C~~C~~ 563 (596)
...|+|..|++
T Consensus 35 ~~~~~C~~C~~ 45 (46)
T PF12760_consen 35 RGRYRCKACRK 45 (46)
T ss_pred CCeEECCCCCC
Confidence 44566666654
No 143
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=38.93 E-value=15 Score=29.98 Aligned_cols=23 Identities=43% Similarity=0.495 Sum_probs=11.9
Q ss_pred cCCccCcCccChhhHHHHHhhcCCCC
Q psy7449 557 ACSLCDLKFSQNSQLKYHLTTRHDLP 582 (596)
Q Consensus 557 ~C~~C~~~F~~~~~l~~H~~~~H~~~ 582 (596)
.|-.+|+.|. +|++|+.+|+|+.
T Consensus 78 icLEDGkkfK---SLKRHL~t~~gmT 100 (148)
T COG4957 78 ICLEDGKKFK---SLKRHLTTHYGLT 100 (148)
T ss_pred EEeccCcchH---HHHHHHhcccCCC
Confidence 4444555543 3555666655544
No 144
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=38.52 E-value=14 Score=28.13 Aligned_cols=14 Identities=14% Similarity=0.439 Sum_probs=8.4
Q ss_pred CcccCCCCcccccc
Q psy7449 404 KRFQCPVCDYRCTK 417 (596)
Q Consensus 404 ~~~~C~~C~~~f~~ 417 (596)
..+.|..|++.|+-
T Consensus 53 GIW~C~~C~~~~AG 66 (90)
T PTZ00255 53 GIWRCKGCKKTVAG 66 (90)
T ss_pred EEEEcCCCCCEEeC
Confidence 45666666666643
No 145
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=38.25 E-value=18 Score=19.68 Aligned_cols=8 Identities=25% Similarity=0.908 Sum_probs=3.8
Q ss_pred cCCCCccc
Q psy7449 467 CHHCRDLL 474 (596)
Q Consensus 467 C~~C~~~~ 474 (596)
|+.||...
T Consensus 2 Cp~CG~~~ 9 (23)
T PF13240_consen 2 CPNCGAEI 9 (23)
T ss_pred CcccCCCC
Confidence 45555443
No 146
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=38.05 E-value=12 Score=30.98 Aligned_cols=24 Identities=42% Similarity=0.743 Sum_probs=12.7
Q ss_pred ccCCCCCccccCchhhhhhhhhccCCC
Q psy7449 528 YKCLHCSYKSTQSSCLGRHMRIHTGEK 554 (596)
Q Consensus 528 ~~C~~C~~~f~~~~~L~~H~~~H~~~~ 554 (596)
..|-+||+.|.. |++|++.|+|-.
T Consensus 73 i~clecGk~~k~---LkrHL~~~~glt 96 (132)
T PF05443_consen 73 IICLECGKKFKT---LKRHLRTHHGLT 96 (132)
T ss_dssp EE-TBT--EESB---HHHHHHHTT-S-
T ss_pred eEEccCCcccch---HHHHHHHccCCC
Confidence 567777777754 477777776643
No 147
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=37.82 E-value=24 Score=21.76 Aligned_cols=10 Identities=20% Similarity=0.335 Sum_probs=4.9
Q ss_pred ccCCCCCccc
Q psy7449 528 YKCLHCSYKS 537 (596)
Q Consensus 528 ~~C~~C~~~f 537 (596)
-.|+.||-.+
T Consensus 22 ~~Cd~cg~~L 31 (36)
T PF05191_consen 22 GVCDNCGGEL 31 (36)
T ss_dssp TBCTTTTEBE
T ss_pred CccCCCCCee
Confidence 4455555443
No 148
>PF03811 Zn_Tnp_IS1: InsA N-terminal domain; InterPro: IPR003220 Insertion elements are mobile elements in DNA, usually encoding proteins required for transposition, for example transposases. Protein InsA is absolutely required for transposition of insertion element 1. This entry represents a short zinc binding domain found in IS1 InsA family protein. It is found at the N terminus of the protein and may be a DNA-binding domain.; GO: 0006313 transposition, DNA-mediated
Probab=37.48 E-value=35 Score=21.06 Aligned_cols=19 Identities=26% Similarity=0.611 Sum_probs=9.9
Q ss_pred cccCCCCCCCCCcccCCCC
Q psy7449 487 LCSGVPRPDGSFRFVCFTC 505 (596)
Q Consensus 487 l~~h~~~~~~~~~~~C~~C 505 (596)
+..|-....|...|.|..|
T Consensus 17 v~k~G~~~~G~qryrC~~C 35 (36)
T PF03811_consen 17 VKKNGKSPSGHQRYRCKDC 35 (36)
T ss_pred ceeCCCCCCCCEeEecCcC
Confidence 3344444455555666655
No 149
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=36.98 E-value=17 Score=21.36 Aligned_cols=11 Identities=27% Similarity=0.682 Sum_probs=2.3
Q ss_pred ecCCCCccccc
Q psy7449 466 ICHHCRDLLPY 476 (596)
Q Consensus 466 ~C~~C~~~~~~ 476 (596)
.|+.|+..+.+
T Consensus 4 ~Cp~C~se~~y 14 (30)
T PF08274_consen 4 KCPLCGSEYTY 14 (30)
T ss_dssp --TTT-----E
T ss_pred CCCCCCCccee
Confidence 45666655544
No 150
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=36.68 E-value=15 Score=28.09 Aligned_cols=15 Identities=13% Similarity=0.443 Sum_probs=8.9
Q ss_pred CCcccCCCCcccccc
Q psy7449 403 KKRFQCPVCDYRCTK 417 (596)
Q Consensus 403 ~~~~~C~~C~~~f~~ 417 (596)
...+.|..|++.|.-
T Consensus 51 ~GIW~C~~C~~~~AG 65 (91)
T TIGR00280 51 TGIWTCRKCGAKFAG 65 (91)
T ss_pred eEEEEcCCCCCEEeC
Confidence 345667777666643
No 151
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=36.30 E-value=19 Score=32.79 Aligned_cols=26 Identities=23% Similarity=0.589 Sum_probs=11.5
Q ss_pred cccCCCCcccccchhhHHHHhhhccc
Q psy7449 405 RFQCPVCDYRCTKKYNLQMHALTKHD 430 (596)
Q Consensus 405 ~~~C~~C~~~f~~~~~l~~H~~~~H~ 430 (596)
.|.|++|+|.|.-..-...|+...|.
T Consensus 77 K~~C~lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 77 KWRCPLCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp EEEE-SSS-EESSHHHHHHHHHHH-H
T ss_pred EECCCCCCcccCChHHHHHHHhhcCH
Confidence 35555555555555555555554443
No 152
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=36.16 E-value=36 Score=22.50 Aligned_cols=12 Identities=17% Similarity=0.310 Sum_probs=8.1
Q ss_pred eecCCCCccccc
Q psy7449 465 VICHHCRDLLPY 476 (596)
Q Consensus 465 ~~C~~C~~~~~~ 476 (596)
|+|.+||..+..
T Consensus 2 y~C~~CgyvYd~ 13 (47)
T PF00301_consen 2 YQCPVCGYVYDP 13 (47)
T ss_dssp EEETTTSBEEET
T ss_pred cCCCCCCEEEcC
Confidence 677777766654
No 153
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.64 E-value=23 Score=27.67 Aligned_cols=12 Identities=8% Similarity=-0.299 Sum_probs=5.9
Q ss_pred CcccCCCCCccc
Q psy7449 498 FRFVCFTCNYHT 509 (596)
Q Consensus 498 ~~~~C~~C~~~f 509 (596)
.|..|++||++|
T Consensus 25 dPiVsPytG~s~ 36 (129)
T COG4530 25 DPIVSPYTGKSY 36 (129)
T ss_pred CccccCcccccc
Confidence 344455555554
No 154
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=34.25 E-value=37 Score=39.22 Aligned_cols=11 Identities=18% Similarity=0.673 Sum_probs=7.5
Q ss_pred ccccCcccccc
Q psy7449 376 PFKCTHCRYSA 386 (596)
Q Consensus 376 ~~~C~~C~~~f 386 (596)
+|.|+.||...
T Consensus 692 vy~CPsCGaev 702 (1337)
T PRK14714 692 VYVCPDCGAEV 702 (1337)
T ss_pred ceeCccCCCcc
Confidence 56777777754
No 155
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=33.94 E-value=32 Score=37.78 Aligned_cols=9 Identities=22% Similarity=1.021 Sum_probs=5.2
Q ss_pred eecCCCCcc
Q psy7449 465 VICHHCRDL 473 (596)
Q Consensus 465 ~~C~~C~~~ 473 (596)
..|..||..
T Consensus 436 l~C~~Cg~v 444 (730)
T COG1198 436 LLCRDCGYI 444 (730)
T ss_pred eecccCCCc
Confidence 356666644
No 156
>KOG2272|consensus
Probab=33.60 E-value=23 Score=32.42 Aligned_cols=28 Identities=14% Similarity=0.610 Sum_probs=14.4
Q ss_pred eecCCCccccchhHhhhcccccccccCC
Q psy7449 172 YKCSSCDFQATQKIQVKKHVLIICQHCD 199 (596)
Q Consensus 172 ~~C~~C~~~f~~~~~l~~H~~~~c~~c~ 199 (596)
|.|++|.+...+...++.--+..|..|+
T Consensus 100 F~Cd~Cn~~Lad~gf~rnqgr~LC~~Cn 127 (332)
T KOG2272|consen 100 FRCDLCNKHLADQGFYRNQGRALCRECN 127 (332)
T ss_pred chhHHHHHHHhhhhhHhhcchHHhhhhh
Confidence 5566666555555555444444444443
No 157
>KOG4377|consensus
Probab=33.50 E-value=32 Score=34.03 Aligned_cols=64 Identities=23% Similarity=0.365 Sum_probs=37.7
Q ss_pred ccc--ccccccccCchHHHhHHhhhcC------------CCCccc--CcCcccccChhHHHHHHhhcCCC-------CCc
Q psy7449 60 NTC--CICGHVESGAWRLKRHFLKHTG------------EKSYEC--CWCTYKTAYENDLKKHQRKHTGE-------KPY 116 (596)
Q Consensus 60 ~~C--~~C~~~f~~~~~l~~H~~~h~~------------~~~~~C--~~C~~~f~~~~~l~~H~~~h~~~-------~~~ 116 (596)
|.| +.|+-.+.++..+.+|...|.. ...|.| ..|.+ +-++...|...|+.. --|
T Consensus 272 yhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gfrrthf 348 (480)
T KOG4377|consen 272 YHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGFRRTHF 348 (480)
T ss_pred hcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCceeccee
Confidence 445 4677777778888888887752 123566 34777 344566666655321 125
Q ss_pred ccCCCC--cccc
Q psy7449 117 SCGHCP--YTAR 126 (596)
Q Consensus 117 ~C~~C~--~~f~ 126 (596)
.|..++ .+|.
T Consensus 349 hC~r~gCTdtfK 360 (480)
T KOG4377|consen 349 HCQRIGCTDTFK 360 (480)
T ss_pred EEeccCCccccc
Confidence 676666 5555
No 158
>KOG2272|consensus
Probab=33.36 E-value=37 Score=31.10 Aligned_cols=127 Identities=15% Similarity=0.231 Sum_probs=60.0
Q ss_pred CccccccccccccCchHHHhHHh----------hhcCCCCcccCcCcccccChhHHHHHHhhcCCCCCcccCCCCccccC
Q psy7449 58 NKNTCCICGHVESGAWRLKRHFL----------KHTGEKSYECCWCTYKTAYENDLKKHQRKHTGEKPYSCGHCPYTARS 127 (596)
Q Consensus 58 ~~~~C~~C~~~f~~~~~l~~H~~----------~h~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~ 127 (596)
.-|.|++|.+...+...++.--+ .-.+...|.|..|-..... ..|.---..-+ .--|+|..|++...+
T Consensus 98 ~CF~Cd~Cn~~Lad~gf~rnqgr~LC~~Cn~k~Ka~~~g~YvC~KCh~~iD~-~~l~fr~d~yH-~yHFkCt~C~keL~s 175 (332)
T KOG2272|consen 98 ACFRCDLCNKHLADQGFYRNQGRALCRECNQKEKAKGRGRYVCQKCHAHIDE-QPLTFRGDPYH-PYHFKCTTCGKELTS 175 (332)
T ss_pred ccchhHHHHHHHhhhhhHhhcchHHhhhhhhhhcccccceeehhhhhhhccc-ccccccCCCCC-ccceecccccccccc
Confidence 34678899887777655543221 1112336777777544333 11111000011 122679999888775
Q ss_pred hhHHHHH----hhhccCCCCccCCCCCccccCch-hHHhHhhhccCCCceecCCCccccchhHhhhc
Q psy7449 128 SSYMLTH----LKVHSGEKAFKCEYCSYSSYSKS-NLRTHMRTHIGEKPYKCSSCDFQATQKIQVKK 189 (596)
Q Consensus 128 ~~~l~~H----~~~h~~~~~~~C~~C~~~f~~~~-~L~~H~~~h~~~~~~~C~~C~~~f~~~~~l~~ 189 (596)
.+.-..- ++-| ..+.+++||.-=.-.. ....-|-.|-...-|.|..|.+.|.......+
T Consensus 176 daRevk~eLyClrCh---D~mgipiCgaC~rpIeervi~amgKhWHveHFvCa~CekPFlGHrHYEk 239 (332)
T KOG2272|consen 176 DAREVKGELYCLRCH---DKMGIPICGACRRPIEERVIFAMGKHWHVEHFVCAKCEKPFLGHRHYEK 239 (332)
T ss_pred hhhhhccceeccccc---cccCCcccccccCchHHHHHHHhccccchhheeehhcCCcccchhhhhh
Confidence 4432111 0111 1244555553221111 11112223333445888888888876554443
No 159
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=33.19 E-value=33 Score=23.03 Aligned_cols=11 Identities=36% Similarity=0.969 Sum_probs=6.1
Q ss_pred cccCCCCCccc
Q psy7449 499 RFVCFTCNYHT 509 (596)
Q Consensus 499 ~~~C~~C~~~f 509 (596)
.+.|..||.++
T Consensus 37 r~~C~~Cgyt~ 47 (50)
T PRK00432 37 RWHCGKCGYTE 47 (50)
T ss_pred cEECCCcCCEE
Confidence 45666666543
No 160
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=32.27 E-value=30 Score=23.52 Aligned_cols=25 Identities=20% Similarity=0.562 Sum_probs=13.6
Q ss_pred cccCCCCccccChhHHHHHhhhccCCCCccCCCCCcc
Q psy7449 116 YSCGHCPYTARSSSYMLTHLKVHSGEKAFKCEYCSYS 152 (596)
Q Consensus 116 ~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~ 152 (596)
.+|+.|++.|.. ++....|+.|+..
T Consensus 6 ~~C~~Cg~~~~~------------~dDiVvCp~Cgap 30 (54)
T PF14446_consen 6 CKCPVCGKKFKD------------GDDIVVCPECGAP 30 (54)
T ss_pred ccChhhCCcccC------------CCCEEECCCCCCc
Confidence 356666666643 2334566666654
No 161
>COG5151 SSL1 RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair]
Probab=32.17 E-value=73 Score=30.27 Aligned_cols=25 Identities=20% Similarity=0.149 Sum_probs=13.4
Q ss_pred CCcccCCCCCcccCHHHHHHhhhhc
Q psy7449 347 FNYVCYTCDYHTGVRECMKKHIRTH 371 (596)
Q Consensus 347 ~~~~C~~C~~~f~~~~~l~~H~~~h 371 (596)
..|.|+.|...|-..-+.-.|...|
T Consensus 387 ~rY~Ce~CK~~FC~dCdvfiHe~Lh 411 (421)
T COG5151 387 GRYQCELCKSTFCSDCDVFIHETLH 411 (421)
T ss_pred cceechhhhhhhhhhhHHHHHHHHh
Confidence 3466666655555555555554443
No 162
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=32.15 E-value=26 Score=22.20 Aligned_cols=20 Identities=15% Similarity=0.292 Sum_probs=12.6
Q ss_pred CcccCCccCcCccChhhHHH
Q psy7449 554 KPYACSLCDLKFSQNSQLKY 573 (596)
Q Consensus 554 ~~~~C~~C~~~F~~~~~l~~ 573 (596)
..+.|+.|+-.+....+|.+
T Consensus 18 ~id~C~~C~G~W~d~~el~~ 37 (41)
T PF13453_consen 18 EIDVCPSCGGIWFDAGELEK 37 (41)
T ss_pred EEEECCCCCeEEccHHHHHH
Confidence 34567777776666666543
No 163
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=31.29 E-value=19 Score=27.46 Aligned_cols=15 Identities=13% Similarity=0.481 Sum_probs=9.0
Q ss_pred CCcccCCCCcccccc
Q psy7449 403 KKRFQCPVCDYRCTK 417 (596)
Q Consensus 403 ~~~~~C~~C~~~f~~ 417 (596)
...+.|..|++.|.-
T Consensus 52 ~GIW~C~~C~~~~AG 66 (90)
T PRK03976 52 TGIWECRKCGAKFAG 66 (90)
T ss_pred EEEEEcCCCCCEEeC
Confidence 345667777666643
No 164
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=30.94 E-value=22 Score=22.31 Aligned_cols=11 Identities=27% Similarity=0.706 Sum_probs=6.6
Q ss_pred cccCCCCCccc
Q psy7449 499 RFVCFTCNYHT 509 (596)
Q Consensus 499 ~~~C~~C~~~f 509 (596)
-|.|..||..|
T Consensus 28 fy~C~~C~~~w 38 (39)
T PF01096_consen 28 FYVCCNCGHRW 38 (39)
T ss_dssp EEEESSSTEEE
T ss_pred EEEeCCCCCee
Confidence 36666666554
No 165
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=30.58 E-value=38 Score=28.41 Aligned_cols=14 Identities=14% Similarity=0.539 Sum_probs=7.8
Q ss_pred cceecCCCCccccc
Q psy7449 463 ETVICHHCRDLLPY 476 (596)
Q Consensus 463 ~~~~C~~C~~~~~~ 476 (596)
..+.|..||..|..
T Consensus 69 ~~~~C~~CG~~~~~ 82 (135)
T PRK03824 69 AVLKCRNCGNEWSL 82 (135)
T ss_pred eEEECCCCCCEEec
Confidence 34566666655543
No 166
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=30.37 E-value=53 Score=22.65 Aligned_cols=21 Identities=19% Similarity=0.550 Sum_probs=10.0
Q ss_pred chhHHhHhhhccCCCceecCC
Q psy7449 156 KSNLRTHMRTHIGEKPYKCSS 176 (596)
Q Consensus 156 ~~~L~~H~~~h~~~~~~~C~~ 176 (596)
+..|..|+...-...+..|+.
T Consensus 23 r~~l~~H~~~~C~~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENECPKRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTSTTSEEE-SS
T ss_pred HHHHHHHHHccCCCCcEECCC
Confidence 445555555433444455555
No 167
>cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer.
Probab=29.93 E-value=43 Score=22.47 Aligned_cols=12 Identities=17% Similarity=0.357 Sum_probs=7.7
Q ss_pred eecCCCCccccc
Q psy7449 465 VICHHCRDLLPY 476 (596)
Q Consensus 465 ~~C~~C~~~~~~ 476 (596)
|+|.+||..+..
T Consensus 2 y~C~~CgyiYd~ 13 (50)
T cd00730 2 YECRICGYIYDP 13 (50)
T ss_pred cCCCCCCeEECC
Confidence 567777766653
No 168
>KOG2807|consensus
Probab=29.58 E-value=63 Score=31.08 Aligned_cols=20 Identities=15% Similarity=0.386 Sum_probs=10.0
Q ss_pred ccccCccccccccchhhhhh
Q psy7449 376 PFKCTHCRYSASRRDTLDHH 395 (596)
Q Consensus 376 ~~~C~~C~~~f~~~~~L~~H 395 (596)
.|.|+.|...|-.--..-.|
T Consensus 345 ~y~C~~Ck~~FCldCDv~iH 364 (378)
T KOG2807|consen 345 RYRCESCKNVFCLDCDVFIH 364 (378)
T ss_pred cEEchhccceeeccchHHHH
Confidence 35555555555544444444
No 169
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=28.28 E-value=30 Score=19.94 Aligned_cols=9 Identities=22% Similarity=0.741 Sum_probs=4.3
Q ss_pred cCCCCCccc
Q psy7449 501 VCFTCNYHT 509 (596)
Q Consensus 501 ~C~~C~~~f 509 (596)
.|-.|++.|
T Consensus 2 sCiDC~~~F 10 (28)
T PF08790_consen 2 SCIDCSKDF 10 (28)
T ss_dssp EETTTTEEE
T ss_pred eeecCCCCc
Confidence 344455544
No 170
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=28.26 E-value=34 Score=28.15 Aligned_cols=13 Identities=15% Similarity=0.480 Sum_probs=9.1
Q ss_pred ceecCCCCccccc
Q psy7449 464 TVICHHCRDLLPY 476 (596)
Q Consensus 464 ~~~C~~C~~~~~~ 476 (596)
|.+|..||+.|.+
T Consensus 1 PH~Ct~Cg~~f~d 13 (131)
T PF09845_consen 1 PHQCTKCGRVFED 13 (131)
T ss_pred CcccCcCCCCcCC
Confidence 4567777777766
No 171
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=28.13 E-value=47 Score=20.99 Aligned_cols=11 Identities=27% Similarity=0.652 Sum_probs=6.1
Q ss_pred cccCCCCCccc
Q psy7449 499 RFVCFTCNYHT 509 (596)
Q Consensus 499 ~~~C~~C~~~f 509 (596)
-|.|..|+..+
T Consensus 28 fy~C~~C~~~w 38 (40)
T smart00440 28 FYVCTKCGHRW 38 (40)
T ss_pred EEEeCCCCCEe
Confidence 35666666544
No 172
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=28.09 E-value=65 Score=29.68 Aligned_cols=12 Identities=17% Similarity=0.426 Sum_probs=7.0
Q ss_pred cccCCCCCcccC
Q psy7449 349 YVCYTCDYHTGV 360 (596)
Q Consensus 349 ~~C~~C~~~f~~ 360 (596)
|.|+.|+..|..
T Consensus 156 f~C~~C~h~F~G 167 (278)
T PF15135_consen 156 FHCPKCRHNFRG 167 (278)
T ss_pred eecccccccchh
Confidence 566666665554
No 173
>KOG4167|consensus
Probab=27.29 E-value=17 Score=38.68 Aligned_cols=25 Identities=28% Similarity=0.677 Sum_probs=23.3
Q ss_pred CccCccCcccccChHHHHHHHHHhc
Q psy7449 255 PFQCKICDSTFKRKQTLDLHIKRHT 279 (596)
Q Consensus 255 ~~~C~~C~~~f~~~~~l~~H~~~h~ 279 (596)
-|.|..|++.|.....+..||++|.
T Consensus 792 iFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHHHHHHHHHHhhhhHHHHHHH
Confidence 3899999999999999999999985
No 174
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=27.02 E-value=53 Score=21.88 Aligned_cols=13 Identities=46% Similarity=0.981 Sum_probs=7.7
Q ss_pred cccCCCCCccccc
Q psy7449 499 RFVCFTCNYHTYK 511 (596)
Q Consensus 499 ~~~C~~C~~~f~~ 511 (596)
.|.|+.||..+.-
T Consensus 20 ~~vC~~Cg~~~~~ 32 (52)
T smart00661 20 RFVCRKCGYEEPI 32 (52)
T ss_pred EEECCcCCCeEEC
Confidence 4666666655543
No 175
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=26.67 E-value=32 Score=23.17 Aligned_cols=20 Identities=15% Similarity=0.413 Sum_probs=7.7
Q ss_pred ccccCccccccccchhhhhh
Q psy7449 376 PFKCTHCRYSASRRDTLDHH 395 (596)
Q Consensus 376 ~~~C~~C~~~f~~~~~L~~H 395 (596)
.|+|+.|...|-..-.+-.|
T Consensus 21 ~y~C~~C~~~FC~dCD~fiH 40 (51)
T PF07975_consen 21 RYRCPKCKNHFCIDCDVFIH 40 (51)
T ss_dssp EE--TTTT--B-HHHHHTTT
T ss_pred eEECCCCCCccccCcChhhh
Confidence 45555555555554444444
No 176
>KOG3362|consensus
Probab=25.98 E-value=25 Score=29.13 Aligned_cols=22 Identities=14% Similarity=0.508 Sum_probs=16.6
Q ss_pred cccCCCCCccccccchhhhhhh
Q psy7449 499 RFVCFTCNYHTYKSGHMRQHIN 520 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~~l~~H~~ 520 (596)
+|.|.-||-.+-+..-|..|..
T Consensus 129 ~ysC~~CG~kyCsv~C~~~Hne 150 (156)
T KOG3362|consen 129 KYSCVNCGTKYCSVRCLKTHNE 150 (156)
T ss_pred hhHHHhcCCceeechhhhhccc
Confidence 4778888888877777777754
No 177
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=25.84 E-value=29 Score=23.07 Aligned_cols=10 Identities=20% Similarity=0.677 Sum_probs=4.0
Q ss_pred cccCcccccc
Q psy7449 377 FKCTHCRYSA 386 (596)
Q Consensus 377 ~~C~~C~~~f 386 (596)
+.|..||..|
T Consensus 5 l~C~dCg~~F 14 (49)
T PF13451_consen 5 LTCKDCGAEF 14 (49)
T ss_pred EEcccCCCeE
Confidence 3344444333
No 178
>KOG4167|consensus
Probab=25.49 E-value=19 Score=38.37 Aligned_cols=27 Identities=22% Similarity=0.262 Sum_probs=23.6
Q ss_pred CCccccccccccccCchHHHhHHhhhc
Q psy7449 57 QNKNTCCICGHVESGAWRLKRHFLKHT 83 (596)
Q Consensus 57 ~~~~~C~~C~~~f~~~~~l~~H~~~h~ 83 (596)
.+.|.|.+|+++|....++..||++|.
T Consensus 790 ~giFpCreC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred CceeehHHHHHHHHHHhhhhHHHHHHH
Confidence 567889999999999999999999886
No 179
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=25.40 E-value=48 Score=32.17 Aligned_cols=75 Identities=19% Similarity=0.267 Sum_probs=39.5
Q ss_pred cceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchh
Q psy7449 463 ETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSC 542 (596)
Q Consensus 463 ~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~ 542 (596)
..-.|++||..-.. +.+ .......|.+-..|..|+-.- |. ..-+|+.||.. ..
T Consensus 183 ~~~~CPvCGs~P~~--------s~~-~~~~~~~G~RyL~CslC~teW------------~~--~R~~C~~Cg~~----~~ 235 (305)
T TIGR01562 183 SRTLCPACGSPPVA--------SMV-RQGGKETGLRYLSCSLCATEW------------HY--VRVKCSHCEES----KH 235 (305)
T ss_pred CCCcCCCCCChhhh--------hhh-cccCCCCCceEEEcCCCCCcc------------cc--cCccCCCCCCC----Cc
Confidence 34579999954311 000 111123566778899997422 21 23679999974 22
Q ss_pred hhhhhhh----ccCCCcccCCccCcC
Q psy7449 543 LGRHMRI----HTGEKPYACSLCDLK 564 (596)
Q Consensus 543 L~~H~~~----H~~~~~~~C~~C~~~ 564 (596)
|...... ..+.+...|..|+--
T Consensus 236 l~y~~~e~~~~~~~~r~e~C~~C~~Y 261 (305)
T TIGR01562 236 LAYLSLEHDAEKAVLKAETCDSCQGY 261 (305)
T ss_pred eeeEeecCCCCCcceEEeeccccccc
Confidence 3222211 123355789988743
No 180
>PF07649 C1_3: C1-like domain; InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=25.11 E-value=50 Score=19.15 Aligned_cols=11 Identities=27% Similarity=1.219 Sum_probs=4.4
Q ss_pred ceeecCCCCcc
Q psy7449 226 YQYQCYSCEYH 236 (596)
Q Consensus 226 ~~~~C~~C~~~ 236 (596)
..|.|.+|+..
T Consensus 14 ~~Y~C~~Cdf~ 24 (30)
T PF07649_consen 14 WFYRCSECDFD 24 (30)
T ss_dssp -EEE-TTT---
T ss_pred ceEECccCCCc
Confidence 56777777654
No 181
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=24.44 E-value=38 Score=21.69 Aligned_cols=9 Identities=33% Similarity=0.999 Sum_probs=4.2
Q ss_pred ccCCCCccc
Q psy7449 406 FQCPVCDYR 414 (596)
Q Consensus 406 ~~C~~C~~~ 414 (596)
+.|+.||..
T Consensus 20 ~vC~~CG~V 28 (43)
T PF08271_consen 20 LVCPNCGLV 28 (43)
T ss_dssp EEETTT-BB
T ss_pred EECCCCCCE
Confidence 455555544
No 182
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=24.13 E-value=16 Score=35.45 Aligned_cols=75 Identities=16% Similarity=0.237 Sum_probs=28.6
Q ss_pred ceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhh
Q psy7449 464 TVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCL 543 (596)
Q Consensus 464 ~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L 543 (596)
.-.|++||..-. .+.+..... .|.+-..|..|+-.. +. ...+|+.||..-...-.+
T Consensus 172 ~g~CPvCGs~P~--------~s~l~~~~~--~G~R~L~Cs~C~t~W------------~~--~R~~Cp~Cg~~~~~~l~~ 227 (290)
T PF04216_consen 172 RGYCPVCGSPPV--------LSVLRGGER--EGKRYLHCSLCGTEW------------RF--VRIKCPYCGNTDHEKLEY 227 (290)
T ss_dssp -SS-TTT---EE--------EEEEE--------EEEEEETTT--EE------------E----TTS-TTT---SS-EEE-
T ss_pred CCcCCCCCCcCc--------eEEEecCCC--CccEEEEcCCCCCee------------ee--cCCCCcCCCCCCCcceee
Confidence 357999995432 112221111 256677899997432 11 235799999875433222
Q ss_pred hhhhhhccCCCcccCCccCc
Q psy7449 544 GRHMRIHTGEKPYACSLCDL 563 (596)
Q Consensus 544 ~~H~~~H~~~~~~~C~~C~~ 563 (596)
.. ...-.+.+.+.|..|+-
T Consensus 228 ~~-~e~~~~~rve~C~~C~~ 246 (290)
T PF04216_consen 228 FT-VEGEPAYRVEVCESCGS 246 (290)
T ss_dssp --------SEEEEEETTTTE
T ss_pred Ee-cCCCCcEEEEECCcccc
Confidence 10 11222445578988874
No 183
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=23.79 E-value=54 Score=27.26 Aligned_cols=13 Identities=23% Similarity=0.534 Sum_probs=7.1
Q ss_pred CcccCCCCCcccc
Q psy7449 498 FRFVCFTCNYHTY 510 (596)
Q Consensus 498 ~~~~C~~C~~~f~ 510 (596)
..|+|..|++.|.
T Consensus 52 qRyrC~~C~~tf~ 64 (129)
T COG3677 52 QRYKCKSCGSTFT 64 (129)
T ss_pred cccccCCcCccee
Confidence 3455666655554
No 184
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=23.54 E-value=73 Score=30.99 Aligned_cols=78 Identities=19% Similarity=0.352 Sum_probs=41.9
Q ss_pred CccceecCCCCcccccchhhhccccccccCCCCCCCCCcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCc
Q psy7449 461 GHETVICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQS 540 (596)
Q Consensus 461 ~~~~~~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~ 540 (596)
++..-.|++||..-.. +.+ +.....|.+-..|..|+-.- |. ..-+|+.||..
T Consensus 184 ~~~~~~CPvCGs~P~~--------s~v--~~~~~~G~RyL~CslC~teW------------~~--~R~~C~~Cg~~---- 235 (309)
T PRK03564 184 GEQRQFCPVCGSMPVS--------SVV--QIGTTQGLRYLHCNLCESEW------------HV--VRVKCSNCEQS---- 235 (309)
T ss_pred ccCCCCCCCCCCcchh--------hee--eccCCCCceEEEcCCCCCcc------------cc--cCccCCCCCCC----
Confidence 3445689999964321 111 11223566778899997422 21 23679999962
Q ss_pred hhhhhhhh-h-ccCCCcccCCccCcCcc
Q psy7449 541 SCLGRHMR-I-HTGEKPYACSLCDLKFS 566 (596)
Q Consensus 541 ~~L~~H~~-~-H~~~~~~~C~~C~~~F~ 566 (596)
..|..... . ..+.+...|..|+--.+
T Consensus 236 ~~l~y~~~~~~~~~~r~e~C~~C~~YlK 263 (309)
T PRK03564 236 GKLHYWSLDSEQAAVKAESCGDCGTYLK 263 (309)
T ss_pred CceeeeeecCCCcceEeeecccccccce
Confidence 22332211 1 12345578999975433
No 185
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.33 E-value=40 Score=24.57 Aligned_cols=10 Identities=20% Similarity=0.750 Sum_probs=6.2
Q ss_pred ccCCCCCccc
Q psy7449 144 FKCEYCSYSS 153 (596)
Q Consensus 144 ~~C~~C~~~f 153 (596)
|+|..|+..|
T Consensus 13 Y~c~~cg~~~ 22 (82)
T COG2331 13 YECTECGNRF 22 (82)
T ss_pred EeecccchHH
Confidence 6666666554
No 186
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=23.26 E-value=52 Score=26.65 Aligned_cols=11 Identities=18% Similarity=0.776 Sum_probs=5.9
Q ss_pred eecCCCCcccc
Q psy7449 465 VICHHCRDLLP 475 (596)
Q Consensus 465 ~~C~~C~~~~~ 475 (596)
..|..||..|+
T Consensus 71 ~~C~~Cg~~~~ 81 (114)
T PRK03681 71 CWCETCQQYVT 81 (114)
T ss_pred EEcccCCCeee
Confidence 45555555544
No 187
>PF03833 PolC_DP2: DNA polymerase II large subunit DP2; InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=22.88 E-value=28 Score=38.24 Aligned_cols=11 Identities=27% Similarity=0.766 Sum_probs=0.0
Q ss_pred CccCCCCCccc
Q psy7449 527 PYKCLHCSYKS 537 (596)
Q Consensus 527 ~~~C~~C~~~f 537 (596)
-|.|+.|+...
T Consensus 680 ~~~Cp~C~~~~ 690 (900)
T PF03833_consen 680 VYVCPDCGIEV 690 (900)
T ss_dssp -----------
T ss_pred ceecccccccc
Confidence 36677776654
No 188
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=22.76 E-value=56 Score=34.61 Aligned_cols=47 Identities=23% Similarity=0.684 Sum_probs=0.0
Q ss_pred cccccCCCccccchhhhhhhhcccCCCCCCCCCceeecCCCCccCCChHHHHhhhhhcCCCCCccCccCcccccChHHHH
Q psy7449 193 IICQHCDQSLFGTTQQLADHCRTCDKMPRPDRSYQYQCYSCEYHSLYSSRMIKHIRNHTGERPFQCKICDSTFKRKQTLD 272 (596)
Q Consensus 193 ~~c~~c~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~ 272 (596)
+.|..||.. ..|+.|+-.... |.......|..||
T Consensus 214 ~~C~~Cg~~--------------------------~~C~~C~~~l~~----------h~~~~~l~Ch~Cg---------- 247 (505)
T TIGR00595 214 LLCRSCGYI--------------------------LCCPNCDVSLTY----------HKKEGKLRCHYCG---------- 247 (505)
T ss_pred eEhhhCcCc--------------------------cCCCCCCCceEE----------ecCCCeEEcCCCc----------
Q ss_pred HHHHHhcCCCceeeCCCC
Q psy7449 273 LHIKRHTGERTYKCEFCD 290 (596)
Q Consensus 273 ~H~~~h~~~~~~~C~~C~ 290 (596)
+...-|..|+.|+
T Consensus 248 -----~~~~~~~~Cp~C~ 260 (505)
T TIGR00595 248 -----YQEPIPKTCPQCG 260 (505)
T ss_pred -----CcCCCCCCCCCCC
No 189
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=22.24 E-value=27 Score=37.58 Aligned_cols=73 Identities=18% Similarity=0.303 Sum_probs=0.0
Q ss_pred ecCCCCcccccchhhhccccccccCCCCCCCCCcc-----cCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCc
Q psy7449 466 ICHHCRDLLPYKIKDLVSHGELCSGVPRPDGSFRF-----VCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQS 540 (596)
Q Consensus 466 ~C~~C~~~~~~~~~~~~~~~~l~~h~~~~~~~~~~-----~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~ 540 (596)
.|+.|-+.+.+ .+++.| .|..||=+|+-...|---...-+-..--.|+.|.+.|...
T Consensus 103 ~C~~Cl~Ei~d------------------p~~rrY~YPF~~CT~CGPRfTIi~alPYDR~nTsM~~F~lC~~C~~EY~dP 164 (750)
T COG0068 103 TCEDCLEEIFD------------------PNSRRYLYPFINCTNCGPRFTIIEALPYDRENTSMADFPLCPFCDKEYKDP 164 (750)
T ss_pred hhHHHHHHhcC------------------CCCcceeccccccCCCCcceeeeccCCCCcccCccccCcCCHHHHHHhcCc
Q ss_pred hhhhhhhhhccCCCcccCCccC
Q psy7449 541 SCLGRHMRIHTGEKPYACSLCD 562 (596)
Q Consensus 541 ~~L~~H~~~H~~~~~~~C~~C~ 562 (596)
.+-+-| ..|..|+.||
T Consensus 165 ~nRRfH------AQp~aCp~CG 180 (750)
T COG0068 165 LNRRFH------AQPIACPKCG 180 (750)
T ss_pred cccccc------cccccCcccC
No 190
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=22.20 E-value=50 Score=26.81 Aligned_cols=12 Identities=17% Similarity=0.348 Sum_probs=6.3
Q ss_pred ccCCCCCccccc
Q psy7449 500 FVCFTCNYHTYK 511 (596)
Q Consensus 500 ~~C~~C~~~f~~ 511 (596)
..|..||..|..
T Consensus 71 ~~C~~Cg~~~~~ 82 (115)
T TIGR00100 71 CECEDCSEEVSP 82 (115)
T ss_pred EEcccCCCEEec
Confidence 455556555433
No 191
>KOG4124|consensus
Probab=22.09 E-value=28 Score=33.53 Aligned_cols=24 Identities=42% Similarity=0.959 Sum_probs=16.1
Q ss_pred CCcccCCccCcCccChhhHHHHHh
Q psy7449 553 EKPYACSLCDLKFSQNSQLKYHLT 576 (596)
Q Consensus 553 ~~~~~C~~C~~~F~~~~~l~~H~~ 576 (596)
.|+|+|++|.+.+.....|.-|..
T Consensus 396 nk~~r~~i~~~~~k~~~~l~~~~~ 419 (442)
T KOG4124|consen 396 NKPYRCEVCSKRYKNLNGLKYHRT 419 (442)
T ss_pred cCcccChhhhhhhccCCCCCceee
Confidence 567788888777776666655544
No 192
>PF05290 Baculo_IE-1: Baculovirus immediate-early protein (IE-0); InterPro: IPR007954 This entry contains the Baculovirus immediate-early protein IE-0.
Probab=22.05 E-value=49 Score=27.23 Aligned_cols=15 Identities=40% Similarity=0.669 Sum_probs=8.1
Q ss_pred CCCcccCcCcccccC
Q psy7449 85 EKSYECCWCTYKTAY 99 (596)
Q Consensus 85 ~~~~~C~~C~~~f~~ 99 (596)
.+.|.|.+|..+...
T Consensus 78 ~~lYeCnIC~etS~e 92 (140)
T PF05290_consen 78 PKLYECNICKETSAE 92 (140)
T ss_pred CCceeccCcccccch
Confidence 355666666555443
No 193
>cd00065 FYVE FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;
Probab=21.90 E-value=65 Score=21.95 Aligned_cols=11 Identities=18% Similarity=0.410 Sum_probs=6.5
Q ss_pred cCCCCCccccc
Q psy7449 501 VCFTCNYHTYK 511 (596)
Q Consensus 501 ~C~~C~~~f~~ 511 (596)
.|..|++.|..
T Consensus 4 ~C~~C~~~F~~ 14 (57)
T cd00065 4 SCMGCGKPFTL 14 (57)
T ss_pred cCcccCccccC
Confidence 36666666654
No 194
>KOG3408|consensus
Probab=21.81 E-value=34 Score=27.52 Aligned_cols=25 Identities=20% Similarity=0.526 Sum_probs=17.9
Q ss_pred CCCCccCCCCCccccCchhhhhhhh
Q psy7449 524 GIKPYKCLHCSYKSTQSSCLGRHMR 548 (596)
Q Consensus 524 ~~~~~~C~~C~~~f~~~~~L~~H~~ 548 (596)
|...|.|-.|.+-|.+..+|..|.+
T Consensus 54 G~GqfyCi~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 54 GGGQFYCIECARYFIDAKALKTHFK 78 (129)
T ss_pred CCceeehhhhhhhhcchHHHHHHHh
Confidence 4455777777777777777777765
No 195
>KOG2461|consensus
Probab=21.62 E-value=2.2e+02 Score=28.91 Aligned_cols=69 Identities=10% Similarity=-0.161 Sum_probs=33.6
Q ss_pred CCCCCcccCCCCCccccccchhhhhhhccCCCCCccCCCCCccccCchhhhhhhhhccCCCcccCCccC
Q psy7449 494 PDGSFRFVCFTCNYHTYKSGHMRQHINGHVGIKPYKCLHCSYKSTQSSCLGRHMRIHTGEKPYACSLCD 562 (596)
Q Consensus 494 ~~~~~~~~C~~C~~~f~~~~~l~~H~~~H~~~~~~~C~~C~~~f~~~~~L~~H~~~H~~~~~~~C~~C~ 562 (596)
+.+..+..+.++.+.+.....+..|...+.++.++.+..+...+.....+..+..+|+....+.+..|+
T Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 394 (396)
T KOG2461|consen 326 ATVSVWTGETIPVRTPAGQLIYTQSHSMEVAEPTDMAPNQIWKIYHTGVLGFLIITTDESECNNMSFVC 394 (396)
T ss_pred ccccccCcCcccccccccccchhhhhhcccCCCCcccccccccceeccccceeeeeccccccccccccC
Confidence 333444444555555555555555555555554444444444444444444455555555555554443
No 196
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=21.42 E-value=39 Score=20.51 Aligned_cols=8 Identities=25% Similarity=0.957 Sum_probs=1.9
Q ss_pred cCCCCccc
Q psy7449 467 CHHCRDLL 474 (596)
Q Consensus 467 C~~C~~~~ 474 (596)
|.+|++.|
T Consensus 6 C~eC~~~f 13 (34)
T PF01286_consen 6 CDECGKPF 13 (34)
T ss_dssp -TTT--EE
T ss_pred HhHhCCHH
Confidence 33333333
No 197
>KOG1701|consensus
Probab=21.04 E-value=20 Score=35.68 Aligned_cols=36 Identities=31% Similarity=0.704 Sum_probs=18.4
Q ss_pred eeCCCCCcccccccccccccc-------cccccccCCcccccc
Q psy7449 285 KCEFCDYTAARKDTLGVHVRI-------KHKISICQNCGLILS 320 (596)
Q Consensus 285 ~C~~C~~~f~~~~~l~~H~~~-------~h~~~~C~~C~~~~~ 320 (596)
+|.+|++-..-...-..=+++ |..-|.|+.||..++
T Consensus 396 rCs~C~~PI~P~~G~~etvRvvamdr~fHv~CY~CEDCg~~LS 438 (468)
T KOG1701|consen 396 RCSVCGNPILPRDGKDETVRVVAMDRDFHVNCYKCEDCGLLLS 438 (468)
T ss_pred chhhccCCccCCCCCcceEEEEEccccccccceehhhcCcccc
Confidence 577776655444333322222 334566666666654
No 198
>PRK14873 primosome assembly protein PriA; Provisional
Probab=21.02 E-value=64 Score=35.41 Aligned_cols=46 Identities=17% Similarity=0.585 Sum_probs=0.0
Q ss_pred cccccCCCccccchhhhhhhhcccCCCCCCCCCceeecCCCCccCCChHHHHhhhhhcCCCCCccCccCcccccChHHHH
Q psy7449 193 IICQHCDQSLFGTTQQLADHCRTCDKMPRPDRSYQYQCYSCEYHSLYSSRMIKHIRNHTGERPFQCKICDSTFKRKQTLD 272 (596)
Q Consensus 193 ~~c~~c~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~C~~C~~~f~~~~~l~~H~~~h~~~~~~~C~~C~~~f~~~~~l~ 272 (596)
+.|..||.. ..|+.|+-.... |.......|..||..
T Consensus 384 l~C~~Cg~~--------------------------~~C~~C~~~L~~----------h~~~~~l~Ch~CG~~-------- 419 (665)
T PRK14873 384 LACARCRTP--------------------------ARCRHCTGPLGL----------PSAGGTPRCRWCGRA-------- 419 (665)
T ss_pred eEhhhCcCe--------------------------eECCCCCCceeE----------ecCCCeeECCCCcCC--------
Q ss_pred HHHHHhcCCCceeeCCCC
Q psy7449 273 LHIKRHTGERTYKCEFCD 290 (596)
Q Consensus 273 ~H~~~h~~~~~~~C~~C~ 290 (596)
..++.|+.|+
T Consensus 420 --------~~p~~Cp~Cg 429 (665)
T PRK14873 420 --------APDWRCPRCG 429 (665)
T ss_pred --------CcCccCCCCc
No 199
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=20.92 E-value=53 Score=22.73 Aligned_cols=15 Identities=13% Similarity=0.064 Sum_probs=7.0
Q ss_pred cccCCCCCccccccc
Q psy7449 499 RFVCFTCNYHTYKSG 513 (596)
Q Consensus 499 ~~~C~~C~~~f~~~~ 513 (596)
|++...||-.|...+
T Consensus 24 PV~s~~C~H~fek~a 38 (57)
T PF11789_consen 24 PVKSKKCGHTFEKEA 38 (57)
T ss_dssp EEEESSS--EEEHHH
T ss_pred CcCcCCCCCeecHHH
Confidence 455556666665443
No 200
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=20.74 E-value=54 Score=25.42 Aligned_cols=14 Identities=29% Similarity=0.650 Sum_probs=8.5
Q ss_pred ccccCccccccccc
Q psy7449 376 PFKCTHCRYSASRR 389 (596)
Q Consensus 376 ~~~C~~C~~~f~~~ 389 (596)
|.+|..||..|.+-
T Consensus 2 pH~CtrCG~vf~~g 15 (112)
T COG3364 2 PHQCTRCGEVFDDG 15 (112)
T ss_pred Cceecccccccccc
Confidence 45666666666654
No 201
>PF06220 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=20.60 E-value=73 Score=19.89 Aligned_cols=22 Identities=27% Similarity=0.492 Sum_probs=7.9
Q ss_pred cccCCccCcCcc-Ch-hhHHHHHh
Q psy7449 555 PYACSLCDLKFS-QN-SQLKYHLT 576 (596)
Q Consensus 555 ~~~C~~C~~~F~-~~-~~l~~H~~ 576 (596)
.|-|++|+..|. +. ..-+.|.+
T Consensus 3 ryyCdyC~~~~~~d~~~~Rk~H~~ 26 (38)
T PF06220_consen 3 RYYCDYCKKYLTHDSPSIRKQHER 26 (38)
T ss_dssp S-B-TTT--B-S--SHHHHHHHT-
T ss_pred CeecccccceecCCChHHHHHhhc
Confidence 366777777773 32 23355543
No 202
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=20.22 E-value=64 Score=26.08 Aligned_cols=11 Identities=18% Similarity=0.749 Sum_probs=5.3
Q ss_pred eecCCCCcccc
Q psy7449 465 VICHHCRDLLP 475 (596)
Q Consensus 465 ~~C~~C~~~~~ 475 (596)
..|..||..|.
T Consensus 71 ~~C~~Cg~~~~ 81 (113)
T PRK12380 71 AWCWDCSQVVE 81 (113)
T ss_pred EEcccCCCEEe
Confidence 44555554443
No 203
>smart00064 FYVE Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=20.16 E-value=55 Score=23.47 Aligned_cols=10 Identities=20% Similarity=0.504 Sum_probs=2.9
Q ss_pred eecCCCCccc
Q psy7449 465 VICHHCRDLL 474 (596)
Q Consensus 465 ~~C~~C~~~~ 474 (596)
-.|.+|++.|
T Consensus 10 ~~C~~C~~~F 19 (69)
T PF01363_consen 10 SNCMICGKKF 19 (69)
T ss_dssp SB-TTT--B-
T ss_pred CcCcCcCCcC
Confidence 3455555555
No 205
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=20.08 E-value=49 Score=22.61 Aligned_cols=10 Identities=30% Similarity=0.816 Sum_probs=5.3
Q ss_pred ccCCCCCccc
Q psy7449 144 FKCEYCSYSS 153 (596)
Q Consensus 144 ~~C~~C~~~f 153 (596)
++|+.||..+
T Consensus 3 ~~CP~CG~~i 12 (54)
T TIGR01206 3 FECPDCGAEI 12 (54)
T ss_pred cCCCCCCCEE
Confidence 4555555544
No 206
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=20.07 E-value=82 Score=20.04 Aligned_cols=10 Identities=20% Similarity=0.570 Sum_probs=5.0
Q ss_pred ccCCCCcccc
Q psy7449 117 SCGHCPYTAR 126 (596)
Q Consensus 117 ~C~~C~~~f~ 126 (596)
.|+.||..|.
T Consensus 15 ~C~~CgM~Y~ 24 (41)
T PF13878_consen 15 TCPTCGMLYS 24 (41)
T ss_pred CCCCCCCEEC
Confidence 4555555443
Done!