Query psy11070
Match_columns 148
No_of_seqs 134 out of 1276
Neff 10.1
Searched_HMMs 46136
Date Fri Aug 16 17:16:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11070.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11070hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 5.4E-24 1.2E-28 146.4 4.3 107 29-144 128-240 (279)
2 KOG2462|consensus 99.9 3.1E-23 6.7E-28 142.7 6.9 104 29-141 159-265 (279)
3 KOG1074|consensus 99.6 3.2E-17 7E-22 126.9 0.0 56 85-145 604-659 (958)
4 KOG3576|consensus 99.6 2.3E-16 5E-21 104.6 3.6 89 50-145 111-199 (267)
5 KOG3623|consensus 99.6 1.6E-16 3.5E-21 121.7 1.3 81 54-141 892-972 (1007)
6 KOG3623|consensus 99.5 3.4E-15 7.5E-20 114.6 0.2 112 23-141 202-331 (1007)
7 KOG3576|consensus 99.5 6.2E-15 1.3E-19 97.9 0.8 106 32-143 118-236 (267)
8 KOG1074|consensus 99.3 8.7E-13 1.9E-17 102.8 -0.0 85 55-146 604-695 (958)
9 KOG3608|consensus 99.2 4.1E-12 8.9E-17 90.9 2.8 105 32-145 180-318 (467)
10 PHA00733 hypothetical protein 99.1 6.5E-11 1.4E-15 75.1 4.5 86 53-147 37-127 (128)
11 PHA02768 hypothetical protein; 98.9 6.1E-10 1.3E-14 59.3 2.0 40 57-102 6-45 (55)
12 KOG3608|consensus 98.9 4.4E-09 9.5E-14 75.6 6.3 81 57-143 264-345 (467)
13 PHA02768 hypothetical protein; 98.8 1.7E-09 3.8E-14 57.6 1.9 43 86-135 5-47 (55)
14 PF13465 zf-H2C2_2: Zinc-finge 98.8 2.7E-09 5.8E-14 48.8 2.1 22 103-125 1-22 (26)
15 PLN03086 PRLI-interacting fact 98.8 2.4E-08 5.2E-13 76.9 7.9 82 53-146 450-541 (567)
16 PF13465 zf-H2C2_2: Zinc-finge 98.7 2.4E-08 5.1E-13 45.6 2.7 26 73-100 1-26 (26)
17 PHA00733 hypothetical protein 98.6 1.1E-07 2.4E-12 60.3 5.4 56 52-113 69-124 (128)
18 PLN03086 PRLI-interacting fact 98.6 1E-07 2.3E-12 73.5 5.8 85 47-144 471-565 (567)
19 KOG3993|consensus 98.3 1.1E-07 2.4E-12 69.9 0.5 86 56-147 267-384 (500)
20 PHA00616 hypothetical protein 98.3 2.7E-07 5.8E-12 46.9 1.6 31 87-120 2-32 (44)
21 COG5189 SFP1 Putative transcri 98.3 3.2E-07 6.9E-12 65.4 1.7 73 53-140 346-419 (423)
22 PHA00616 hypothetical protein 98.3 4.8E-07 1E-11 46.0 1.7 33 56-90 1-33 (44)
23 PHA00732 hypothetical protein 98.1 3.6E-06 7.8E-11 48.8 2.8 48 56-113 1-49 (79)
24 PHA00732 hypothetical protein 98.1 4E-06 8.7E-11 48.6 2.9 46 86-142 1-47 (79)
25 PF00096 zf-C2H2: Zinc finger, 97.8 2E-05 4.2E-10 34.6 2.2 22 57-80 1-22 (23)
26 PF05605 zf-Di19: Drought indu 97.7 0.00011 2.4E-09 39.4 4.6 50 87-143 3-53 (54)
27 PF00096 zf-C2H2: Zinc finger, 97.7 4E-05 8.7E-10 33.5 2.4 22 118-141 1-22 (23)
28 KOG3993|consensus 97.6 1.5E-05 3.2E-10 59.1 0.6 27 115-143 456-482 (500)
29 PF13894 zf-C2H2_4: C2H2-type 97.6 7.6E-05 1.6E-09 32.7 2.6 23 118-142 1-23 (24)
30 PF05605 zf-Di19: Drought indu 97.5 0.00032 6.9E-09 37.7 4.4 49 56-111 2-52 (54)
31 PF13912 zf-C2H2_6: C2H2-type 97.4 7.9E-05 1.7E-09 34.0 1.4 25 117-143 1-25 (27)
32 PF12756 zf-C2H2_2: C2H2 type 97.3 0.00035 7.6E-09 42.0 3.3 74 58-143 1-74 (100)
33 PF13894 zf-C2H2_4: C2H2-type 97.3 0.00036 7.7E-09 30.5 2.5 22 57-80 1-22 (24)
34 PF13912 zf-C2H2_6: C2H2-type 97.2 0.00035 7.6E-09 31.7 2.0 23 87-111 2-24 (27)
35 smart00355 ZnF_C2H2 zinc finge 96.9 0.0012 2.6E-08 29.1 2.6 23 118-142 1-23 (26)
36 PF09237 GAGA: GAGA factor; I 96.7 0.0026 5.7E-08 33.2 3.1 28 84-113 22-49 (54)
37 PRK04860 hypothetical protein; 96.7 0.0018 4E-08 42.7 2.9 37 86-129 119-155 (160)
38 smart00355 ZnF_C2H2 zinc finge 96.6 0.0026 5.7E-08 28.0 2.6 19 63-81 5-23 (26)
39 PF13909 zf-H2C2_5: C2H2-type 96.6 0.0023 5E-08 28.1 2.1 23 118-143 1-23 (24)
40 PF09237 GAGA: GAGA factor; I 96.4 0.0063 1.4E-07 31.8 3.3 33 52-86 20-52 (54)
41 PF12874 zf-met: Zinc-finger o 95.6 0.0078 1.7E-07 26.6 1.4 17 63-79 5-21 (25)
42 PF12874 zf-met: Zinc-finger o 95.5 0.014 3.1E-07 25.7 2.0 17 93-109 5-21 (25)
43 PF13909 zf-H2C2_5: C2H2-type 95.1 0.027 5.9E-07 24.5 2.2 22 57-81 1-22 (24)
44 COG5048 FOG: Zn-finger [Genera 94.4 0.0079 1.7E-07 45.1 -0.9 67 55-124 288-358 (467)
45 KOG4173|consensus 94.2 0.022 4.7E-07 38.8 1.1 85 55-146 78-173 (253)
46 PRK04860 hypothetical protein; 93.7 0.047 1E-06 36.1 1.9 39 56-102 119-157 (160)
47 PF12171 zf-C2H2_jaz: Zinc-fin 93.1 0.022 4.7E-07 25.7 -0.3 21 57-79 2-22 (27)
48 PF13913 zf-C2HC_2: zinc-finge 91.8 0.21 4.5E-06 22.1 2.1 20 57-79 3-22 (25)
49 smart00451 ZnF_U1 U1-like zinc 91.7 0.19 4E-06 23.9 2.0 22 117-140 3-24 (35)
50 COG5048 FOG: Zn-finger [Genera 90.5 0.05 1.1E-06 40.8 -1.1 57 85-146 288-350 (467)
51 KOG2893|consensus 90.2 0.056 1.2E-06 37.7 -0.9 33 93-131 15-47 (341)
52 KOG1146|consensus 89.7 0.12 2.7E-06 44.2 0.5 79 51-141 460-540 (1406)
53 PF12756 zf-C2H2_2: C2H2 type 88.0 0.6 1.3E-05 27.6 2.6 24 56-81 50-73 (100)
54 KOG1146|consensus 87.6 0.55 1.2E-05 40.5 2.9 22 117-140 589-610 (1406)
55 COG5189 SFP1 Putative transcri 87.3 0.38 8.3E-06 35.2 1.6 25 116-140 348-372 (423)
56 TIGR00373 conserved hypothetic 86.1 2.1 4.5E-05 28.3 4.6 45 82-138 105-149 (158)
57 PF02892 zf-BED: BED zinc fing 84.1 0.97 2.1E-05 22.8 1.9 26 115-140 14-41 (45)
58 PRK06266 transcription initiat 83.6 3.3 7.1E-05 27.9 4.7 43 83-137 114-156 (178)
59 COG2888 Predicted Zn-ribbon RN 78.3 3 6.5E-05 22.7 2.5 31 86-124 27-57 (61)
60 smart00614 ZnF_BED BED zinc fi 77.7 2.8 6E-05 21.7 2.3 22 119-140 20-44 (50)
61 COG4049 Uncharacterized protei 77.1 1.5 3.2E-05 23.5 1.0 26 82-109 13-38 (65)
62 PRK14890 putative Zn-ribbon RN 77.1 2.9 6.4E-05 22.7 2.2 30 86-124 25-55 (59)
63 cd00350 rubredoxin_like Rubred 76.3 1.8 3.9E-05 20.4 1.2 10 57-68 2-11 (33)
64 smart00531 TFIIE Transcription 76.2 9.9 0.00021 24.7 5.0 40 83-129 96-135 (147)
65 PF09538 FYDLN_acid: Protein o 73.7 1.8 4E-05 26.6 1.0 30 57-101 10-39 (108)
66 COG4049 Uncharacterized protei 73.4 2.3 4.9E-05 22.8 1.2 30 49-80 10-39 (65)
67 cd00729 rubredoxin_SM Rubredox 72.9 3.1 6.7E-05 19.8 1.5 9 116-124 17-25 (34)
68 PF13878 zf-C2H2_3: zinc-finge 72.8 5.3 0.00011 19.9 2.4 26 118-143 14-39 (41)
69 PF13451 zf-trcl: Probable zin 72.5 1.8 4E-05 22.6 0.7 20 115-134 2-21 (49)
70 COG0068 HypF Hydrogenase matur 71.9 0.48 1E-05 38.4 -2.2 61 58-130 125-186 (750)
71 KOG2186|consensus 71.3 2.7 6E-05 29.8 1.6 49 57-112 4-52 (276)
72 PF13719 zinc_ribbon_5: zinc-r 71.2 4.7 0.0001 19.5 2.0 9 117-125 25-33 (37)
73 TIGR02098 MJ0042_CXXC MJ0042 f 70.4 4.6 0.0001 19.4 1.9 7 63-69 7-13 (38)
74 KOG2785|consensus 68.5 7.2 0.00016 29.4 3.3 26 56-83 166-191 (390)
75 PF13717 zinc_ribbon_4: zinc-r 67.9 6.9 0.00015 18.8 2.2 8 63-70 7-14 (36)
76 KOG2231|consensus 66.5 8.2 0.00018 31.5 3.4 15 66-80 122-136 (669)
77 COG5236 Uncharacterized conser 64.6 5 0.00011 30.0 1.8 24 57-80 152-175 (493)
78 TIGR00373 conserved hypothetic 61.6 6.4 0.00014 25.9 1.8 35 52-99 105-139 (158)
79 PF12013 DUF3505: Protein of u 61.0 9.4 0.0002 23.3 2.4 31 47-82 74-108 (109)
80 KOG4173|consensus 60.5 3 6.4E-05 28.8 0.1 79 29-112 79-171 (253)
81 PRK06266 transcription initiat 60.3 5.3 0.00012 26.9 1.3 35 53-100 114-148 (178)
82 PF15269 zf-C2H2_7: Zinc-finge 60.2 8.5 0.00018 19.6 1.7 22 118-141 21-42 (54)
83 TIGR02300 FYDLN_acid conserved 59.6 5.8 0.00012 25.2 1.2 32 57-103 10-41 (129)
84 smart00531 TFIIE Transcription 59.2 13 0.00027 24.2 2.9 39 53-99 96-134 (147)
85 PF12013 DUF3505: Protein of u 59.1 11 0.00024 22.9 2.5 25 118-144 81-109 (109)
86 COG3364 Zn-ribbon containing p 58.8 7.8 0.00017 23.5 1.6 14 86-101 2-15 (112)
87 PRK00398 rpoP DNA-directed RNA 57.9 4.1 9E-05 20.7 0.3 8 118-125 22-29 (46)
88 COG1592 Rubrerythrin [Energy p 56.5 8.5 0.00018 25.7 1.7 10 115-124 147-156 (166)
89 PF05443 ROS_MUCR: ROS/MUCR tr 56.3 12 0.00027 23.9 2.4 29 53-86 69-97 (132)
90 smart00659 RPOLCX RNA polymera 55.4 13 0.00029 18.8 2.0 9 117-125 19-27 (44)
91 KOG2231|consensus 55.4 25 0.00054 28.9 4.4 21 88-110 184-204 (669)
92 KOG2593|consensus 54.6 18 0.00039 27.9 3.3 37 83-125 125-161 (436)
93 PF07754 DUF1610: Domain of un 54.5 7.6 0.00017 17.0 0.8 8 117-124 16-23 (24)
94 PF09845 DUF2072: Zn-ribbon co 54.4 10 0.00022 24.2 1.8 13 57-71 2-14 (131)
95 TIGR00622 ssl1 transcription f 53.8 17 0.00037 22.6 2.6 78 55-142 14-104 (112)
96 PF04959 ARS2: Arsenite-resist 53.1 12 0.00027 26.0 2.1 28 53-82 74-102 (214)
97 COG5236 Uncharacterized conser 52.6 29 0.00062 26.2 4.0 77 58-144 222-306 (493)
98 KOG4377|consensus 52.5 5.4 0.00012 30.4 0.3 76 52-130 267-360 (480)
99 KOG4167|consensus 50.7 3.9 8.5E-05 33.5 -0.6 25 56-82 792-816 (907)
100 PF09986 DUF2225: Uncharacteri 49.8 3.4 7.3E-05 28.7 -1.0 17 55-73 4-20 (214)
101 COG1996 RPC10 DNA-directed RNA 48.4 14 0.00031 19.3 1.4 11 86-98 6-16 (49)
102 COG1675 TFA1 Transcription ini 45.4 54 0.0012 22.2 4.2 42 81-134 108-149 (176)
103 COG4957 Predicted transcriptio 43.8 14 0.0003 23.7 1.1 25 57-86 77-101 (148)
104 PF04959 ARS2: Arsenite-resist 43.0 21 0.00046 24.9 2.0 25 116-142 76-100 (214)
105 PHA00626 hypothetical protein 41.6 38 0.00083 18.2 2.4 13 56-70 23-35 (59)
106 KOG2186|consensus 41.4 24 0.00051 25.3 2.1 46 87-140 4-49 (276)
107 KOG4167|consensus 41.1 6.9 0.00015 32.2 -0.6 26 86-113 792-817 (907)
108 PF11931 DUF3449: Domain of un 40.3 9.4 0.0002 26.2 0.0 23 115-138 99-121 (196)
109 smart00734 ZnF_Rad18 Rad18-lik 40.3 36 0.00078 15.0 1.9 19 58-79 3-21 (26)
110 COG5152 Uncharacterized conser 39.6 27 0.00058 24.1 2.0 41 93-140 213-255 (259)
111 PF11789 zf-Nse: Zinc-finger o 37.0 30 0.00066 18.5 1.7 34 85-125 23-56 (57)
112 PF14353 CpXC: CpXC protein 36.8 55 0.0012 20.4 3.1 9 58-68 3-11 (128)
113 TIGR02605 CxxC_CxxC_SSSS putat 35.7 8.7 0.00019 19.9 -0.6 11 57-69 6-16 (52)
114 KOG3408|consensus 34.8 29 0.00064 21.9 1.5 24 84-109 55-78 (129)
115 COG4530 Uncharacterized protei 34.2 39 0.00085 20.9 2.0 13 54-68 24-36 (129)
116 KOG2636|consensus 34.0 31 0.00066 26.9 1.8 28 109-138 394-421 (497)
117 KOG2593|consensus 33.9 37 0.0008 26.3 2.2 39 52-97 124-162 (436)
118 PF02176 zf-TRAF: TRAF-type zi 33.8 40 0.00086 17.7 1.9 21 101-122 23-43 (60)
119 KOG4124|consensus 33.2 16 0.00034 27.5 0.2 53 53-108 175-232 (442)
120 COG1198 PriA Primosomal protei 33.0 13 0.00029 30.8 -0.2 14 111-125 470-483 (730)
121 PF08790 zf-LYAR: LYAR-type C2 32.3 7.7 0.00017 17.7 -0.9 10 57-68 1-10 (28)
122 PF06397 Desulfoferrod_N: Desu 31.9 24 0.00052 17.1 0.7 10 116-125 5-14 (36)
123 KOG2785|consensus 29.4 73 0.0016 24.3 3.1 22 57-80 4-25 (390)
124 KOG3002|consensus 29.3 1.2E+02 0.0025 22.5 4.1 85 54-145 78-165 (299)
125 KOG2482|consensus 29.0 36 0.00079 25.6 1.5 23 56-80 195-217 (423)
126 COG1198 PriA Primosomal protei 27.9 57 0.0012 27.3 2.5 15 81-97 470-484 (730)
127 PF08274 PhnA_Zn_Ribbon: PhnA 27.6 34 0.00075 15.8 0.8 9 116-124 18-26 (30)
128 PF04423 Rad50_zn_hook: Rad50 26.9 49 0.0011 17.3 1.4 7 119-125 22-28 (54)
129 PF03604 DNA_RNApol_7kD: DNA d 26.7 40 0.00086 15.8 0.9 8 117-124 17-24 (32)
130 PF08209 Sgf11: Sgf11 (transcr 26.6 85 0.0018 14.8 2.3 22 117-141 4-25 (33)
131 smart00834 CxxC_CXXC_SSSS Puta 26.0 34 0.00074 16.4 0.7 12 57-70 6-17 (41)
132 PF10571 UPF0547: Uncharacteri 25.9 37 0.00081 15.0 0.7 11 57-69 15-25 (26)
133 PRK00464 nrdR transcriptional 25.7 9.2 0.0002 25.2 -1.8 16 57-74 29-44 (154)
134 KOG2071|consensus 25.2 49 0.0011 26.7 1.7 26 115-142 416-441 (579)
135 PF04780 DUF629: Protein of un 24.9 69 0.0015 25.3 2.4 26 56-83 57-83 (466)
136 KOG4727|consensus 23.8 49 0.0011 22.2 1.2 22 117-140 75-96 (193)
137 PRK09678 DNA-binding transcrip 23.1 27 0.00058 19.8 -0.0 6 93-98 34-39 (72)
138 COG3357 Predicted transcriptio 22.6 42 0.00091 20.0 0.7 14 55-70 57-70 (97)
139 KOG0801|consensus 22.3 19 0.00041 23.9 -0.9 20 102-125 127-146 (205)
140 KOG2482|consensus 21.8 58 0.0013 24.6 1.4 22 86-109 195-216 (423)
141 TIGR00143 hypF [NiFe] hydrogen 21.6 6.3 0.00014 32.5 -3.9 60 58-129 92-152 (711)
142 COG1773 Rubredoxin [Energy pro 21.2 51 0.0011 17.7 0.8 10 116-125 2-11 (55)
143 PF15135 UPF0515: Uncharacteri 21.1 60 0.0013 23.3 1.3 12 57-70 156-167 (278)
144 TIGR00319 desulf_FeS4 desulfof 21.0 57 0.0012 15.1 0.9 10 116-125 6-15 (34)
145 PF09723 Zn-ribbon_8: Zinc rib 21.0 47 0.001 16.4 0.6 29 57-96 6-34 (42)
146 PF03145 Sina: Seven in absent 20.9 85 0.0018 21.3 2.1 7 103-109 62-68 (198)
147 cd00974 DSRD Desulforedoxin (D 20.9 58 0.0012 15.2 0.9 10 116-125 3-12 (34)
148 PF07295 DUF1451: Protein of u 20.1 42 0.00092 21.9 0.4 13 118-131 131-143 (146)
No 1
>KOG2462|consensus
Probab=99.89 E-value=5.4e-24 Score=146.41 Aligned_cols=107 Identities=22% Similarity=0.448 Sum_probs=70.7
Q ss_pred HhhhhcCCCCcceeeccC---CCccC---CCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChH
Q psy11070 29 TEALQSLGINKEYFISST---GGKNS---KKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMK 102 (148)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~---~~~~h---~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~ 102 (148)
...+.+..|++.|...++ |...| ...+.+.|. .|+|.|...-.|++|+++|+ .++.|. .||+.|.+++
T Consensus 128 ~~r~~c~eCgk~ysT~snLsrHkQ~H~~~~s~ka~~C~--~C~K~YvSmpALkMHirTH~--l~c~C~--iCGKaFSRPW 201 (279)
T KOG2462|consen 128 HPRYKCPECGKSYSTSSNLSRHKQTHRSLDSKKAFSCK--YCGKVYVSMPALKMHIRTHT--LPCECG--ICGKAFSRPW 201 (279)
T ss_pred CCceeccccccccccccccchhhcccccccccccccCC--CCCceeeehHHHhhHhhccC--CCcccc--cccccccchH
Confidence 344566677777777665 33333 124556677 77777777777777777765 456677 6777777777
Q ss_pred HHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHHHHHhhCC
Q psy11070 103 QYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVKAVANNSK 144 (148)
Q Consensus 103 ~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~h~~ 144 (148)
.|+-|+|+|+| ||||.|..|+| +|..+++|+-|+.+|+.
T Consensus 202 LLQGHiRTHTG-EKPF~C~hC~k--AFADRSNLRAHmQTHS~ 240 (279)
T KOG2462|consen 202 LLQGHIRTHTG-EKPFSCPHCGK--AFADRSNLRAHMQTHSD 240 (279)
T ss_pred HhhcccccccC-CCCccCCcccc--hhcchHHHHHHHHhhcC
Confidence 77777777777 67777777776 47777777777777654
No 2
>KOG2462|consensus
Probab=99.89 E-value=3.1e-23 Score=142.68 Aligned_cols=104 Identities=22% Similarity=0.337 Sum_probs=95.3
Q ss_pred HhhhhcCCCCcceeeccC---CCccCCCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHH
Q psy11070 29 TEALQSLGINKEYFISST---GGKNSKKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYS 105 (148)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~---~~~~h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~ 105 (148)
.....+..|++.|-.... |+++|. -+++|. +|||.|.+.=-|+.|+|+|+|||||.|. .|+++|...++|+
T Consensus 159 ~ka~~C~~C~K~YvSmpALkMHirTH~--l~c~C~--iCGKaFSRPWLLQGHiRTHTGEKPF~C~--hC~kAFADRSNLR 232 (279)
T KOG2462|consen 159 KKAFSCKYCGKVYVSMPALKMHIRTHT--LPCECG--ICGKAFSRPWLLQGHIRTHTGEKPFSCP--HCGKAFADRSNLR 232 (279)
T ss_pred cccccCCCCCceeeehHHHhhHhhccC--CCcccc--cccccccchHHhhcccccccCCCCccCC--cccchhcchHHHH
Confidence 455678889999988866 888887 568888 9999999999999999999999999999 9999999999999
Q ss_pred HHHHHhcCCCCceeCCCCCCCcccccHHHHHHHHHh
Q psy11070 106 THLKEHSNVSAPYMCDYKGVCMSLTLKISFVKAVAN 141 (148)
Q Consensus 106 ~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~ 141 (148)
.|+++|.+ .|+|.|..|+| +|...+.|.+|...
T Consensus 233 AHmQTHS~-~K~~qC~~C~K--sFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 233 AHMQTHSD-VKKHQCPRCGK--SFALKSYLNKHSES 265 (279)
T ss_pred HHHHhhcC-CccccCcchhh--HHHHHHHHHHhhhh
Confidence 99999999 89999999998 79999999999764
No 3
>KOG1074|consensus
Probab=99.63 E-value=3.2e-17 Score=126.90 Aligned_cols=56 Identities=18% Similarity=0.279 Sum_probs=51.6
Q ss_pred CceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHHHHHhhCCC
Q psy11070 85 LTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVKAVANNSKY 145 (148)
Q Consensus 85 ~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~h~~~ 145 (148)
.|-+|. .|.++...+.+|+.|.|+|+| ||||+|++||+ +|+.+.+|+.|+.+|.-.
T Consensus 604 dPNqCi--iC~rVlSC~saLqmHyrtHtG-ERPFkCKiCgR--AFtTkGNLkaH~~vHka~ 659 (958)
T KOG1074|consen 604 DPNQCI--ICLRVLSCPSALQMHYRTHTG-ERPFKCKICGR--AFTTKGNLKAHMSVHKAK 659 (958)
T ss_pred Ccccee--eeeecccchhhhhhhhhcccC-cCccccccccc--hhccccchhhcccccccC
Confidence 457799 999999999999999999999 99999999999 599999999999998743
No 4
>KOG3576|consensus
Probab=99.63 E-value=2.3e-16 Score=104.62 Aligned_cols=89 Identities=22% Similarity=0.428 Sum_probs=81.5
Q ss_pred cCCCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccc
Q psy11070 50 NSKKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSL 129 (148)
Q Consensus 50 ~h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f 129 (148)
..++...|.|. .|+|.|.....|.+|++-|...+.+.|. .||+.|..--.|++|+|+|+| .+||+|..|+| +|
T Consensus 111 sssd~d~ftCr--vCgK~F~lQRmlnrh~kch~~vkr~lct--~cgkgfndtfdlkrh~rthtg-vrpykc~~c~k--af 183 (267)
T KOG3576|consen 111 SSSDQDSFTCR--VCGKKFGLQRMLNRHLKCHSDVKRHLCT--FCGKGFNDTFDLKRHTRTHTG-VRPYKCSLCEK--AF 183 (267)
T ss_pred CCCCCCeeeee--hhhhhhhHHHHHHHHhhhccHHHHHHHh--hccCcccchhhhhhhhccccC-ccccchhhhhH--HH
Confidence 34556679999 9999999999999999999999999999 999999999999999999999 89999999998 69
Q ss_pred ccHHHHHHHHHhhCCC
Q psy11070 130 TLKISFVKAVANNSKY 145 (148)
Q Consensus 130 ~~~~~l~~h~~~h~~~ 145 (148)
+++-.|..|++.-++.
T Consensus 184 tqrcsleshl~kvhgv 199 (267)
T KOG3576|consen 184 TQRCSLESHLKKVHGV 199 (267)
T ss_pred HhhccHHHHHHHHcCc
Confidence 9999999999877664
No 5
>KOG3623|consensus
Probab=99.60 E-value=1.6e-16 Score=121.70 Aligned_cols=81 Identities=22% Similarity=0.467 Sum_probs=76.9
Q ss_pred CCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHH
Q psy11070 54 QQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKI 133 (148)
Q Consensus 54 ~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~ 133 (148)
+.+|.|. .|+|.|...+.|.+|.-.|+|.+||+|. +|.++|+.+-.|..|.|.|.| +|||.|+.|+| +|....
T Consensus 892 ~gmyaCD--qCDK~FqKqSSLaRHKYEHsGqRPyqC~--iCkKAFKHKHHLtEHkRLHSG-EKPfQCdKClK--RFSHSG 964 (1007)
T KOG3623|consen 892 DGMYACD--QCDKAFQKQSSLARHKYEHSGQRPYQCI--ICKKAFKHKHHLTEHKRLHSG-EKPFQCDKCLK--RFSHSG 964 (1007)
T ss_pred cccchHH--HHHHHHHhhHHHHHhhhhhcCCCCcccc--hhhHhhhhhhhhhhhhhhccC-CCcchhhhhhh--hccccc
Confidence 4579999 9999999999999999999999999999 999999999999999999999 99999999998 799999
Q ss_pred HHHHHHHh
Q psy11070 134 SFVKAVAN 141 (148)
Q Consensus 134 ~l~~h~~~ 141 (148)
.+..||..
T Consensus 965 SYSQHMNH 972 (1007)
T KOG3623|consen 965 SYSQHMNH 972 (1007)
T ss_pred chHhhhcc
Confidence 99999864
No 6
>KOG3623|consensus
Probab=99.49 E-value=3.4e-15 Score=114.56 Aligned_cols=112 Identities=22% Similarity=0.375 Sum_probs=88.9
Q ss_pred chhHHHHhhhhcCCCCcceeeccC---CCcc--CCCCCceeCCCcccccccCChHHHHHHHHHhcC-------------C
Q psy11070 23 DEEDAITEALQSLGINKEYFISST---GGKN--SKKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGK-------------P 84 (148)
Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~--h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~-------------~ 84 (148)
...+.......+..|.+-+.+... |++. ...+..|.|. .|.++|.+.+.|.+||.+|.. -
T Consensus 202 gtpdAfsqlltcpycdrgykrltslkeHikyrhekne~nfsC~--lCsytFAyRtQLErhm~~hkpg~dqa~sltqsa~l 279 (1007)
T KOG3623|consen 202 GTPDAFSQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEPNFSCM--LCSYTFAYRTQLERHMQLHKPGGDQAISLTQSALL 279 (1007)
T ss_pred CCcchhhhhhcchhHHHHHHHHHHHHHHHHHHHhhCCCCCcch--hhhhhhhhHHHHHHHHHhhcCCCcccccccchhhh
Confidence 334444445566677776665543 4433 2345569999 999999999999999999853 2
Q ss_pred CceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHHHHHh
Q psy11070 85 LTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVKAVAN 141 (148)
Q Consensus 85 ~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~ 141 (148)
+.|+|. .|+++|+.+-.|+.|+|+|.| +|||.|.-|+| +|.....+..|+..
T Consensus 280 RKFKCt--ECgKAFKfKHHLKEHlRIHSG-EKPfeCpnCkK--RFSHSGSySSHmSS 331 (1007)
T KOG3623|consen 280 RKFKCT--ECGKAFKFKHHLKEHLRIHSG-EKPFECPNCKK--RFSHSGSYSSHMSS 331 (1007)
T ss_pred cccccc--ccchhhhhHHHHHhhheeecC-CCCcCCccccc--ccccCCcccccccc
Confidence 569999 999999999999999999999 99999999998 79888888888753
No 7
>KOG3576|consensus
Probab=99.48 E-value=6.2e-15 Score=97.90 Aligned_cols=106 Identities=19% Similarity=0.255 Sum_probs=88.0
Q ss_pred hhcCCCCcceeeccC---CCccCCCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHH
Q psy11070 32 LQSLGINKEYFISST---GGKNSKKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHL 108 (148)
Q Consensus 32 ~~~~~~~~~~~~~~~---~~~~h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~ 108 (148)
+.+.-|++.|....- |++.|+..|.+.|. .||+.|.....|++|+++|+|.+||.|. .|+++|.+.-.|-.|.
T Consensus 118 ftCrvCgK~F~lQRmlnrh~kch~~vkr~lct--~cgkgfndtfdlkrh~rthtgvrpykc~--~c~kaftqrcsleshl 193 (267)
T KOG3576|consen 118 FTCRVCGKKFGLQRMLNRHLKCHSDVKRHLCT--FCGKGFNDTFDLKRHTRTHTGVRPYKCS--LCEKAFTQRCSLESHL 193 (267)
T ss_pred eeeehhhhhhhHHHHHHHHhhhccHHHHHHHh--hccCcccchhhhhhhhccccCccccchh--hhhHHHHhhccHHHHH
Confidence 344557777766543 89999999999999 9999999999999999999999999999 9999999999999998
Q ss_pred H-HhcC---------CCCceeCCCCCCCcccccHHHHHHHHHhhC
Q psy11070 109 K-EHSN---------VSAPYMCDYKGVCMSLTLKISFVKAVANNS 143 (148)
Q Consensus 109 ~-~h~~---------~~k~~~C~~c~~~~~f~~~~~l~~h~~~h~ 143 (148)
+ +|.. .++.|.|+.||. .-.....+..|++.|+
T Consensus 194 ~kvhgv~~~yaykerr~kl~vcedcg~--t~~~~e~~~~h~~~~h 236 (267)
T KOG3576|consen 194 KKVHGVQHQYAYKERRAKLYVCEDCGY--TSERPEVYYLHLKLHH 236 (267)
T ss_pred HHHcCchHHHHHHHhhhheeeecccCC--CCCChhHHHHHHHhcC
Confidence 6 4442 157799999996 3555566778887765
No 8
>KOG1074|consensus
Probab=99.25 E-value=8.7e-13 Score=102.83 Aligned_cols=85 Identities=20% Similarity=0.343 Sum_probs=77.0
Q ss_pred CceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcCCC----CceeCC---CCCCCc
Q psy11070 55 QLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVS----APYMCD---YKGVCM 127 (148)
Q Consensus 55 k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~----k~~~C~---~c~~~~ 127 (148)
.|..|. +|.+.+.-...|++|+++|+|++||+|+ +||++|.++.+|+.|+.+|.. . -+|.|+ +|-+
T Consensus 604 dPNqCi--iC~rVlSC~saLqmHyrtHtGERPFkCK--iCgRAFtTkGNLkaH~~vHka-~p~~R~q~ScP~~~ic~~-- 676 (958)
T KOG1074|consen 604 DPNQCI--ICLRVLSCPSALQMHYRTHTGERPFKCK--ICGRAFTTKGNLKAHMSVHKA-KPPARVQFSCPSTFICQK-- 676 (958)
T ss_pred Ccccee--eeeecccchhhhhhhhhcccCcCccccc--cccchhccccchhhccccccc-CccccccccCCchhhhcc--
Confidence 468899 9999999999999999999999999999 999999999999999999986 3 247899 9987
Q ss_pred ccccHHHHHHHHHhhCCCC
Q psy11070 128 SLTLKISFVKAVANNSKYP 146 (148)
Q Consensus 128 ~f~~~~~l~~h~~~h~~~~ 146 (148)
.|.....|..|+++|.+-.
T Consensus 677 kftn~V~lpQhIriH~~~~ 695 (958)
T KOG1074|consen 677 KFTNAVTLPQHIRIHLGGQ 695 (958)
T ss_pred cccccccccceEEeecCCC
Confidence 6999999999999988544
No 9
>KOG3608|consensus
Probab=99.24 E-value=4.1e-12 Score=90.87 Aligned_cols=105 Identities=23% Similarity=0.339 Sum_probs=74.6
Q ss_pred hhcCCCCcceeeccC---CCccCCCCCceeCCCcccccccCChHHHHHH------------------------------H
Q psy11070 32 LQSLGINKEYFISST---GGKNSKKQQLWSCPIKNCNKRFTTLSNLKMH------------------------------M 78 (148)
Q Consensus 32 ~~~~~~~~~~~~~~~---~~~~h~~~k~~~C~~~~C~~~f~~~~~l~~h------------------------------~ 78 (148)
+.+..|-+.+..+.. |++.|+++|...|+ .||.-|.+.+.|-.| +
T Consensus 180 C~W~~Ct~~~~~k~~LreH~r~Hs~eKvvACp--~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~~Hv 257 (467)
T KOG3608|consen 180 CNWAMCTKHMGNKYRLREHIRTHSNEKVVACP--HCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLKSHV 257 (467)
T ss_pred ccchhhhhhhccHHHHHHHHHhcCCCeEEecc--hHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHHHHH
Confidence 455556555555543 67777777655555 555555555544444 4
Q ss_pred HHhcCCCceecCcCCcccccCChHHHHHHHH-HhcCCCCceeCCCCCCCcccccHHHHHHHHHhhCCC
Q psy11070 79 VRHGKPLTLCCEELGCGRKFQTMKQYSTHLK-EHSNVSAPYMCDYKGVCMSLTLKISFVKAVANNSKY 145 (148)
Q Consensus 79 ~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~-~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~h~~~ 145 (148)
+.|. ..|.|+ .|.......+.|..|++ .|.. +|||+|+.|++ +|.+.+.|.+|...|++.
T Consensus 258 ~rHv--n~ykCp--lCdmtc~~~ssL~~H~r~rHs~-dkpfKCd~Cd~--~c~~esdL~kH~~~HS~~ 318 (467)
T KOG3608|consen 258 VRHV--NCYKCP--LCDMTCSSASSLTTHIRYRHSK-DKPFKCDECDT--RCVRESDLAKHVQVHSKT 318 (467)
T ss_pred HHhh--hccccc--ccccCCCChHHHHHHHHhhhcc-CCCccccchhh--hhccHHHHHHHHHhcccc
Confidence 4443 357777 78888888888888887 4777 89999999997 799999999999999854
No 10
>PHA00733 hypothetical protein
Probab=99.14 E-value=6.5e-11 Score=75.06 Aligned_cols=86 Identities=13% Similarity=0.109 Sum_probs=71.5
Q ss_pred CCCceeCCCcccccccCChHHHHHH--HH---HhcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCc
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMH--MV---RHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCM 127 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h--~~---~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~ 127 (148)
.++++.|. .|.+.|.....|..+ +. .+.+.+||.|. .|++.|.....|..|++.|. .+|.|..|++
T Consensus 37 ~~~~~~~~--~~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~--~Cgk~Fss~s~L~~H~r~h~---~~~~C~~CgK-- 107 (128)
T PHA00733 37 EQKRLIRA--VVKTLIYNPQLLDESSYLYKLLTSKAVSPYVCP--LCLMPFSSSVSLKQHIRYTE---HSKVCPVCGK-- 107 (128)
T ss_pred hhhhHHHH--HHhhhccChhhhcchHHHHhhcccCCCCCccCC--CCCCcCCCHHHHHHHHhcCC---cCccCCCCCC--
Confidence 35779999 999888877766655 22 23457899999 99999999999999999763 5799999998
Q ss_pred ccccHHHHHHHHHhhCCCCC
Q psy11070 128 SLTLKISFVKAVANNSKYPV 147 (148)
Q Consensus 128 ~f~~~~~l~~h~~~h~~~~~ 147 (148)
.|.....|..|+...+++=|
T Consensus 108 ~F~~~~sL~~H~~~~h~~~~ 127 (128)
T PHA00733 108 EFRNTDSTLDHVCKKHNICV 127 (128)
T ss_pred ccCCHHHHHHHHHHhcCccc
Confidence 69999999999999888755
No 11
>PHA02768 hypothetical protein; Provisional
Probab=98.92 E-value=6.1e-10 Score=59.35 Aligned_cols=40 Identities=20% Similarity=0.475 Sum_probs=18.8
Q ss_pred eeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMK 102 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~ 102 (148)
|.|+ .||+.|...++|..||+.|+ +++.|. .|++.|.+.+
T Consensus 6 y~C~--~CGK~Fs~~~~L~~H~r~H~--k~~kc~--~C~k~f~~~s 45 (55)
T PHA02768 6 YECP--ICGEIYIKRKSMITHLRKHN--TNLKLS--NCKRISLRTG 45 (55)
T ss_pred cCcc--hhCCeeccHHHHHHHHHhcC--CcccCC--cccceecccc
Confidence 3444 44444444444444444444 344444 4444444443
No 12
>KOG3608|consensus
Probab=98.90 E-value=4.4e-09 Score=75.63 Aligned_cols=81 Identities=20% Similarity=0.441 Sum_probs=48.4
Q ss_pred eeCCCcccccccCChHHHHHHHHH-hcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVR-HGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISF 135 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~-h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l 135 (148)
|+|+ .|+-+....+.|..||+. |+..+||+|. .|...|.+.+.|..|..+|.. -.|.|...++.++|+..-++
T Consensus 264 ykCp--lCdmtc~~~ssL~~H~r~rHs~dkpfKCd--~Cd~~c~~esdL~kH~~~HS~--~~y~C~h~~C~~s~r~~~q~ 337 (467)
T KOG3608|consen 264 YKCP--LCDMTCSSASSLTTHIRYRHSKDKPFKCD--ECDTRCVRESDLAKHVQVHSK--TVYQCEHPDCHYSVRTYTQM 337 (467)
T ss_pred cccc--ccccCCCChHHHHHHHHhhhccCCCcccc--chhhhhccHHHHHHHHHhccc--cceecCCCCCcHHHHHHHHH
Confidence 5566 666666666666666654 5566666666 666666666666666666653 45666664444455566666
Q ss_pred HHHHHhhC
Q psy11070 136 VKAVANNS 143 (148)
Q Consensus 136 ~~h~~~h~ 143 (148)
.+|.+-++
T Consensus 338 ~~H~~evh 345 (467)
T KOG3608|consen 338 RRHFLEVH 345 (467)
T ss_pred HHHHHHhc
Confidence 66655443
No 13
>PHA02768 hypothetical protein; Provisional
Probab=98.84 E-value=1.7e-09 Score=57.60 Aligned_cols=43 Identities=19% Similarity=0.223 Sum_probs=37.3
Q ss_pred ceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHH
Q psy11070 86 TLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISF 135 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l 135 (148)
.|.|+ .||+.|.+.++|..|+++|+ ++|+|..|++ .|.+.+.|
T Consensus 5 ~y~C~--~CGK~Fs~~~~L~~H~r~H~---k~~kc~~C~k--~f~~~s~l 47 (55)
T PHA02768 5 GYECP--ICGEIYIKRKSMITHLRKHN---TNLKLSNCKR--ISLRTGEY 47 (55)
T ss_pred ccCcc--hhCCeeccHHHHHHHHHhcC---CcccCCcccc--eeccccee
Confidence 47899 99999999999999999996 5799999998 57666554
No 14
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.82 E-value=2.7e-09 Score=48.79 Aligned_cols=22 Identities=32% Similarity=0.819 Sum_probs=16.2
Q ss_pred HHHHHHHHhcCCCCceeCCCCCC
Q psy11070 103 QYSTHLKEHSNVSAPYMCDYKGV 125 (148)
Q Consensus 103 ~l~~h~~~h~~~~k~~~C~~c~~ 125 (148)
+|.+|+++|+| ++||.|+.|++
T Consensus 1 ~l~~H~~~H~~-~k~~~C~~C~k 22 (26)
T PF13465_consen 1 NLRRHMRTHTG-EKPYKCPYCGK 22 (26)
T ss_dssp HHHHHHHHHSS-SSSEEESSSSE
T ss_pred CHHHHhhhcCC-CCCCCCCCCcC
Confidence 46677777777 77777777776
No 15
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.80 E-value=2.4e-08 Score=76.89 Aligned_cols=82 Identities=16% Similarity=0.272 Sum_probs=65.5
Q ss_pred CCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccc--
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLT-- 130 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~-- 130 (148)
-++.+.|. .|++.|. ...|..|+..++ +++.|+ |+..+ ....|..|+.+|.+ .+++.|..|+.. |.
T Consensus 450 l~~H~~C~--~Cgk~f~-~s~LekH~~~~H--kpv~Cp---Cg~~~-~R~~L~~H~~thCp-~Kpi~C~fC~~~--v~~g 517 (567)
T PLN03086 450 AKNHVHCE--KCGQAFQ-QGEMEKHMKVFH--EPLQCP---CGVVL-EKEQMVQHQASTCP-LRLITCRFCGDM--VQAG 517 (567)
T ss_pred cccCccCC--CCCCccc-hHHHHHHHHhcC--CCccCC---CCCCc-chhHHHhhhhccCC-CCceeCCCCCCc--cccC
Confidence 34567899 9999996 677999999874 788997 99755 56899999999999 899999999974 43
Q ss_pred --------cHHHHHHHHHhhCCCC
Q psy11070 131 --------LKISFVKAVANNSKYP 146 (148)
Q Consensus 131 --------~~~~l~~h~~~h~~~~ 146 (148)
..+.|..|...+...+
T Consensus 518 ~~~~d~~d~~s~Lt~HE~~CG~rt 541 (567)
T PLN03086 518 GSAMDVRDRLRGMSEHESICGSRT 541 (567)
T ss_pred ccccchhhhhhhHHHHHHhcCCcc
Confidence 2458999998876554
No 16
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.67 E-value=2.4e-08 Score=45.60 Aligned_cols=26 Identities=35% Similarity=0.654 Sum_probs=23.5
Q ss_pred HHHHHHHHhcCCCceecCcCCcccccCC
Q psy11070 73 NLKMHMVRHGKPLTLCCEELGCGRKFQT 100 (148)
Q Consensus 73 ~l~~h~~~h~~~~~~~c~~~~C~~~f~~ 100 (148)
+|..||+.|++++||.|+ .|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~--~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCP--YCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEES--SSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCC--CCcCeeCc
Confidence 488999999999999999 99999863
No 17
>PHA00733 hypothetical protein
Probab=98.61 E-value=1.1e-07 Score=60.34 Aligned_cols=56 Identities=25% Similarity=0.508 Sum_probs=49.6
Q ss_pred CCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcC
Q psy11070 52 KKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSN 113 (148)
Q Consensus 52 ~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~ 113 (148)
.+.++|.|. .|++.|.....|..|++.| ..++.|. .|++.|.....|..|+....+
T Consensus 69 ~~~kPy~C~--~Cgk~Fss~s~L~~H~r~h--~~~~~C~--~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 69 KAVSPYVCP--LCLMPFSSSVSLKQHIRYT--EHSKVCP--VCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred CCCCCccCC--CCCCcCCCHHHHHHHHhcC--CcCccCC--CCCCccCCHHHHHHHHHHhcC
Confidence 347899999 9999999999999999976 3579999 999999999999999976544
No 18
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.59 E-value=1e-07 Score=73.45 Aligned_cols=85 Identities=21% Similarity=0.266 Sum_probs=66.4
Q ss_pred CCccCCCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccC----------ChHHHHHHHHHhcCCCC
Q psy11070 47 GGKNSKKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQ----------TMKQYSTHLKEHSNVSA 116 (148)
Q Consensus 47 ~~~~h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~----------~~~~l~~h~~~h~~~~k 116 (148)
|...+. +++.|+ |++.+ ....|..|+..|.+.+++.|. .|+..|. ....|..|..+. | .+
T Consensus 471 H~~~~H--kpv~Cp---Cg~~~-~R~~L~~H~~thCp~Kpi~C~--fC~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G-~r 540 (567)
T PLN03086 471 HMKVFH--EPLQCP---CGVVL-EKEQMVQHQASTCPLRLITCR--FCGDMVQAGGSAMDVRDRLRGMSEHESIC-G-SR 540 (567)
T ss_pred HHHhcC--CCccCC---CCCCc-chhHHHhhhhccCCCCceeCC--CCCCccccCccccchhhhhhhHHHHHHhc-C-Cc
Confidence 444442 678887 99765 568899999999999999999 9999885 235899999986 6 79
Q ss_pred ceeCCCCCCCcccccHHHHHHHHHhhCC
Q psy11070 117 PYMCDYKGVCMSLTLKISFVKAVANNSK 144 (148)
Q Consensus 117 ~~~C~~c~~~~~f~~~~~l~~h~~~h~~ 144 (148)
++.|..||+ .|..+ .+..|+-..+.
T Consensus 541 t~~C~~Cgk--~Vrlr-dm~~H~~~~h~ 565 (567)
T PLN03086 541 TAPCDSCGR--SVMLK-EMDIHQIAVHQ 565 (567)
T ss_pred ceEccccCC--eeeeh-hHHHHHHHhhc
Confidence 999999998 45444 67788765543
No 19
>KOG3993|consensus
Probab=98.35 E-value=1.1e-07 Score=69.93 Aligned_cols=86 Identities=16% Similarity=0.325 Sum_probs=70.0
Q ss_pred ceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcCC---------------------
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNV--------------------- 114 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~--------------------- 114 (148)
.|.|. .|...|.....|..|...-.-...|.|+ .|+++|+-+.+|..|.|=|.-.
T Consensus 267 dyiCq--LCK~kYeD~F~LAQHrC~RIV~vEYrCP--EC~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~ 342 (500)
T KOG3993|consen 267 DYICQ--LCKEKYEDAFALAQHRCPRIVHVEYRCP--ECDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEV 342 (500)
T ss_pred HHHHH--HHHHhhhhHHHHhhccCCeeEEeeecCC--cccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhh
Confidence 49999 9999999999998887643334468999 9999999999999998855410
Q ss_pred -----------CCceeCCCCCCCcccccHHHHHHHHHhhCCCCC
Q psy11070 115 -----------SAPYMCDYKGVCMSLTLKISFVKAVANNSKYPV 147 (148)
Q Consensus 115 -----------~k~~~C~~c~~~~~f~~~~~l~~h~~~h~~~~~ 147 (148)
+--|.|..|+| .|++...|++|+-+|.+.+.
T Consensus 343 ~ea~rsg~dss~gi~~C~~C~K--kFrRqAYLrKHqlthq~~~~ 384 (500)
T KOG3993|consen 343 QEAERSGDDSSSGIFSCHTCGK--KFRRQAYLRKHQLTHQRAPL 384 (500)
T ss_pred hhccccCCcccCceeecHHhhh--hhHHHHHHHHhHHhhhcccc
Confidence 12489999998 79999999999999887543
No 20
>PHA00616 hypothetical protein
Probab=98.33 E-value=2.7e-07 Score=46.92 Aligned_cols=31 Identities=23% Similarity=0.359 Sum_probs=16.4
Q ss_pred eecCcCCcccccCChHHHHHHHHHhcCCCCceeC
Q psy11070 87 LCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMC 120 (148)
Q Consensus 87 ~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C 120 (148)
|+|+ .||+.|...+.|..|++.|+| ++++.|
T Consensus 2 YqC~--~CG~~F~~~s~l~~H~r~~hg-~~~~~~ 32 (44)
T PHA00616 2 YQCL--RCGGIFRKKKEVIEHLLSVHK-QNKLTL 32 (44)
T ss_pred Cccc--hhhHHHhhHHHHHHHHHHhcC-CCccce
Confidence 4555 555555555555555555555 455444
No 21
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=98.29 E-value=3.2e-07 Score=65.38 Aligned_cols=73 Identities=19% Similarity=0.347 Sum_probs=50.6
Q ss_pred CCCceeCCCcccccccCChHHHHHHHHH-hcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccccc
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMHMVR-HGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTL 131 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h~~~-h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~ 131 (148)
++|||+|+.+.|.|.|+....|+.|+.- |...+...-+.+.--..| -.. +|||.|++|+| +++.
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F------------~~~-~KPYrCevC~K--RYKN 410 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIF------------SAK-DKPYRCEVCDK--RYKN 410 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccc------------ccc-CCceeccccch--hhcc
Confidence 3599999999999999999999999875 543333332210100112 233 79999999998 6888
Q ss_pred HHHHHHHHH
Q psy11070 132 KISFVKAVA 140 (148)
Q Consensus 132 ~~~l~~h~~ 140 (148)
.-.|+-|+.
T Consensus 411 lNGLKYHr~ 419 (423)
T COG5189 411 LNGLKYHRK 419 (423)
T ss_pred Cccceeccc
Confidence 878877753
No 22
>PHA00616 hypothetical protein
Probab=98.27 E-value=4.8e-07 Score=46.01 Aligned_cols=33 Identities=21% Similarity=0.417 Sum_probs=31.0
Q ss_pred ceeCCCcccccccCChHHHHHHHHHhcCCCceecC
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCE 90 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~ 90 (148)
||.|+ .||+.|.....|..|++.|+|++++.|.
T Consensus 1 pYqC~--~CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCL--RCGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccc--hhhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 58899 9999999999999999999999998876
No 23
>PHA00732 hypothetical protein
Probab=98.07 E-value=3.6e-06 Score=48.84 Aligned_cols=48 Identities=27% Similarity=0.586 Sum_probs=39.0
Q ss_pred ceeCCCcccccccCChHHHHHHHHH-hcCCCceecCcCCcccccCChHHHHHHHHHhcC
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVR-HGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSN 113 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~-h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~ 113 (148)
||.|. .|++.|.....|..|++. |.+ +.|. .|++.|. .|..|..+..+
T Consensus 1 py~C~--~Cgk~F~s~s~Lk~H~r~~H~~---~~C~--~CgKsF~---~l~~H~~~~~~ 49 (79)
T PHA00732 1 MFKCP--ICGFTTVTLFALKQHARRNHTL---TKCP--VCNKSYR---RLNQHFYSQYD 49 (79)
T ss_pred CccCC--CCCCccCCHHHHHHHhhcccCC---CccC--CCCCEeC---ChhhhhcccCC
Confidence 57888 999999999999999984 653 4798 9999998 47788876654
No 24
>PHA00732 hypothetical protein
Probab=98.06 E-value=4e-06 Score=48.63 Aligned_cols=46 Identities=17% Similarity=0.257 Sum_probs=37.1
Q ss_pred ceecCcCCcccccCChHHHHHHHH-HhcCCCCceeCCCCCCCcccccHHHHHHHHHhh
Q psy11070 86 TLCCEELGCGRKFQTMKQYSTHLK-EHSNVSAPYMCDYKGVCMSLTLKISFVKAVANN 142 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~~~l~~h~~-~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~h 142 (148)
||.|. .|++.|.+...|..|++ .|.+ +.|+.||+ .|. .|..|....
T Consensus 1 py~C~--~Cgk~F~s~s~Lk~H~r~~H~~----~~C~~CgK--sF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCP--ICGFTTVTLFALKQHARRNHTL----TKCPVCNK--SYR---RLNQHFYSQ 47 (79)
T ss_pred CccCC--CCCCccCCHHHHHHHhhcccCC----CccCCCCC--EeC---Chhhhhccc
Confidence 57899 99999999999999998 4655 58999998 465 477777543
No 25
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.81 E-value=2e-05 Score=34.59 Aligned_cols=22 Identities=55% Similarity=1.026 Sum_probs=11.8
Q ss_pred eeCCCcccccccCChHHHHHHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVR 80 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~ 80 (148)
|.|+ .|++.|.....|..|++.
T Consensus 1 y~C~--~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCP--ICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp EEET--TTTEEESSHHHHHHHHHH
T ss_pred CCCC--CCCCccCCHHHHHHHHhH
Confidence 3455 555555555555555544
No 26
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.71 E-value=0.00011 Score=39.45 Aligned_cols=50 Identities=18% Similarity=0.358 Sum_probs=25.2
Q ss_pred eecCcCCcccccCChHHHHHHHH-HhcCCCCceeCCCCCCCcccccHHHHHHHHHhhC
Q psy11070 87 LCCEELGCGRKFQTMKQYSTHLK-EHSNVSAPYMCDYKGVCMSLTLKISFVKAVANNS 143 (148)
Q Consensus 87 ~~c~~~~C~~~f~~~~~l~~h~~-~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~~h~ 143 (148)
|.|+ .|++ ..+...|..|.. .|..+.+.+.|++|.. .+. .+|..|+..++
T Consensus 3 f~CP--~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~~--~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 3 FTCP--YCGK-GFSESSLVEHCEDEHRSESKNVVCPICSS--RVT--DNLIRHLNSQH 53 (54)
T ss_pred cCCC--CCCC-ccCHHHHHHHHHhHCcCCCCCccCCCchh--hhh--hHHHHHHHHhc
Confidence 4566 5666 344455666643 3443123466666664 222 25666665544
No 27
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.71 E-value=4e-05 Score=33.55 Aligned_cols=22 Identities=23% Similarity=0.425 Sum_probs=11.9
Q ss_pred eeCCCCCCCcccccHHHHHHHHHh
Q psy11070 118 YMCDYKGVCMSLTLKISFVKAVAN 141 (148)
Q Consensus 118 ~~C~~c~~~~~f~~~~~l~~h~~~ 141 (148)
|.|+.|++ .|.....|..|++.
T Consensus 1 y~C~~C~~--~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 1 YKCPICGK--SFSSKSNLKRHMRR 22 (23)
T ss_dssp EEETTTTE--EESSHHHHHHHHHH
T ss_pred CCCCCCCC--ccCCHHHHHHHHhH
Confidence 45555554 45555555555554
No 28
>KOG3993|consensus
Probab=97.64 E-value=1.5e-05 Score=59.09 Aligned_cols=27 Identities=7% Similarity=0.265 Sum_probs=19.0
Q ss_pred CCceeCCCCCCCcccccHHHHHHHHHhhC
Q psy11070 115 SAPYMCDYKGVCMSLTLKISFVKAVANNS 143 (148)
Q Consensus 115 ~k~~~C~~c~~~~~f~~~~~l~~h~~~h~ 143 (148)
+.-|.|..|-- .|.....|.+|+..++
T Consensus 456 ~q~f~~ky~~a--tfyss~~ltrhin~~H 482 (500)
T KOG3993|consen 456 EQGFTCKYCPA--TFYSSPGLTRHINKCH 482 (500)
T ss_pred hhccccccchH--hhhcCcchHhHhhhcC
Confidence 34577888875 5777778888876544
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.61 E-value=7.6e-05 Score=32.72 Aligned_cols=23 Identities=17% Similarity=0.338 Sum_probs=13.3
Q ss_pred eeCCCCCCCcccccHHHHHHHHHhh
Q psy11070 118 YMCDYKGVCMSLTLKISFVKAVANN 142 (148)
Q Consensus 118 ~~C~~c~~~~~f~~~~~l~~h~~~h 142 (148)
|.|++|++ .|.+...|..|+..|
T Consensus 1 ~~C~~C~~--~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGK--SFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS---EESSHHHHHHHHHHH
T ss_pred CCCcCCCC--cCCcHHHHHHHHHhh
Confidence 45666665 566666666666655
No 30
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.49 E-value=0.00032 Score=37.66 Aligned_cols=49 Identities=24% Similarity=0.481 Sum_probs=38.4
Q ss_pred ceeCCCcccccccCChHHHHHHHHH-hcCC-CceecCcCCcccccCChHHHHHHHHHh
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVR-HGKP-LTLCCEELGCGRKFQTMKQYSTHLKEH 111 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~-h~~~-~~~~c~~~~C~~~f~~~~~l~~h~~~h 111 (148)
.|.|+ .|++ ..+...|..|... |..+ +.+.|+ .|...+. .+|..|+..+
T Consensus 2 ~f~CP--~C~~-~~~~~~L~~H~~~~H~~~~~~v~CP--iC~~~~~--~~l~~Hl~~~ 52 (54)
T PF05605_consen 2 SFTCP--YCGK-GFSESSLVEHCEDEHRSESKNVVCP--ICSSRVT--DNLIRHLNSQ 52 (54)
T ss_pred CcCCC--CCCC-ccCHHHHHHHHHhHCcCCCCCccCC--Cchhhhh--hHHHHHHHHh
Confidence 48899 9999 5667889999775 6654 569999 9998654 4888998754
No 31
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.44 E-value=7.9e-05 Score=33.96 Aligned_cols=25 Identities=16% Similarity=0.357 Sum_probs=13.4
Q ss_pred ceeCCCCCCCcccccHHHHHHHHHhhC
Q psy11070 117 PYMCDYKGVCMSLTLKISFVKAVANNS 143 (148)
Q Consensus 117 ~~~C~~c~~~~~f~~~~~l~~h~~~h~ 143 (148)
||.|..|++ .|.....|..|++.|.
T Consensus 1 ~~~C~~C~~--~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGK--TFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTE--EESSHHHHHHHHCTTT
T ss_pred CCCCCccCC--ccCChhHHHHHhHHhc
Confidence 355555554 4555555555555443
No 32
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=97.27 E-value=0.00035 Score=41.98 Aligned_cols=74 Identities=12% Similarity=0.270 Sum_probs=20.6
Q ss_pred eCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHH
Q psy11070 58 SCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVK 137 (148)
Q Consensus 58 ~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~ 137 (148)
.|. .|+..|.....|..|+....+...-. ...+.....+..+.+.-. ...+.|..|++ .|.....|..
T Consensus 1 ~C~--~C~~~f~~~~~l~~H~~~~H~~~~~~------~~~l~~~~~~~~~~~~~~--~~~~~C~~C~~--~f~s~~~l~~ 68 (100)
T PF12756_consen 1 QCL--FCDESFSSVDDLLQHMKKKHGFDIPD------QKYLVDPNRLLNYLRKKV--KESFRCPYCNK--TFRSREALQE 68 (100)
T ss_dssp ----------------------------------------------------------SSEEBSSSS---EESSHHHHHH
T ss_pred Ccc--cccccccccccccccccccccccccc------cccccccccccccccccc--CCCCCCCccCC--CCcCHHHHHH
Confidence 377 99999999999999987633321101 112223334444443322 24689999997 5889999999
Q ss_pred HHHhhC
Q psy11070 138 AVANNS 143 (148)
Q Consensus 138 h~~~h~ 143 (148)
|++.++
T Consensus 69 Hm~~~~ 74 (100)
T PF12756_consen 69 HMRSKH 74 (100)
T ss_dssp HHHHTT
T ss_pred HHcCcc
Confidence 998643
No 33
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=97.26 E-value=0.00036 Score=30.45 Aligned_cols=22 Identities=41% Similarity=0.880 Sum_probs=11.1
Q ss_pred eeCCCcccccccCChHHHHHHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVR 80 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~ 80 (148)
|.|+ .|++.|.+...|..|+..
T Consensus 1 ~~C~--~C~~~~~~~~~l~~H~~~ 22 (24)
T PF13894_consen 1 FQCP--ICGKSFRSKSELRQHMRT 22 (24)
T ss_dssp EE-S--STS-EESSHHHHHHHHHH
T ss_pred CCCc--CCCCcCCcHHHHHHHHHh
Confidence 3455 555555555555555554
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.17 E-value=0.00035 Score=31.74 Aligned_cols=23 Identities=26% Similarity=0.829 Sum_probs=13.0
Q ss_pred eecCcCCcccccCChHHHHHHHHHh
Q psy11070 87 LCCEELGCGRKFQTMKQYSTHLKEH 111 (148)
Q Consensus 87 ~~c~~~~C~~~f~~~~~l~~h~~~h 111 (148)
|.|. .|++.|.....|..|++.|
T Consensus 2 ~~C~--~C~~~F~~~~~l~~H~~~h 24 (27)
T PF13912_consen 2 FECD--ECGKTFSSLSALREHKRSH 24 (27)
T ss_dssp EEET--TTTEEESSHHHHHHHHCTT
T ss_pred CCCC--ccCCccCChhHHHHHhHHh
Confidence 4555 5555555555555555544
No 35
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.92 E-value=0.0012 Score=29.12 Aligned_cols=23 Identities=17% Similarity=0.309 Sum_probs=13.7
Q ss_pred eeCCCCCCCcccccHHHHHHHHHhh
Q psy11070 118 YMCDYKGVCMSLTLKISFVKAVANN 142 (148)
Q Consensus 118 ~~C~~c~~~~~f~~~~~l~~h~~~h 142 (148)
|.|..|++ .|.....|..|++.|
T Consensus 1 ~~C~~C~~--~f~~~~~l~~H~~~H 23 (26)
T smart00355 1 YRCPECGK--VFKSKSALKEHMRTH 23 (26)
T ss_pred CCCCCCcc--hhCCHHHHHHHHHHh
Confidence 45666665 466666666666644
No 36
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.74 E-value=0.0026 Score=33.24 Aligned_cols=28 Identities=18% Similarity=0.341 Sum_probs=11.4
Q ss_pred CCceecCcCCcccccCChHHHHHHHHHhcC
Q psy11070 84 PLTLCCEELGCGRKFQTMKQYSTHLKEHSN 113 (148)
Q Consensus 84 ~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~ 113 (148)
+.|..|+ .|+..+.+..+|.+|+..+.+
T Consensus 22 ~~PatCP--~C~a~~~~srnLrRHle~~H~ 49 (54)
T PF09237_consen 22 EQPATCP--ICGAVIRQSRNLRRHLEIRHF 49 (54)
T ss_dssp S--EE-T--TT--EESSHHHHHHHHHHHTT
T ss_pred CCCCCCC--cchhhccchhhHHHHHHHHhc
Confidence 3444555 555555555555555544433
No 37
>PRK04860 hypothetical protein; Provisional
Probab=96.67 E-value=0.0018 Score=42.65 Aligned_cols=37 Identities=19% Similarity=0.362 Sum_probs=30.7
Q ss_pred ceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccc
Q psy11070 86 TLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSL 129 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f 129 (148)
+|.|. |+. ....+.+|.++|++ +++|.|..|+..+.|
T Consensus 119 ~Y~C~---C~~---~~~~~rrH~ri~~g-~~~YrC~~C~~~l~~ 155 (160)
T PRK04860 119 PYRCK---CQE---HQLTVRRHNRVVRG-EAVYRCRRCGETLVF 155 (160)
T ss_pred EEEcC---CCC---eeCHHHHHHHHhcC-CccEECCCCCceeEE
Confidence 57885 876 67788999999999 999999999985444
No 38
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.64 E-value=0.0026 Score=27.96 Aligned_cols=19 Identities=47% Similarity=0.732 Sum_probs=10.5
Q ss_pred ccccccCChHHHHHHHHHh
Q psy11070 63 NCNKRFTTLSNLKMHMVRH 81 (148)
Q Consensus 63 ~C~~~f~~~~~l~~h~~~h 81 (148)
.|++.|.....|..|++.|
T Consensus 5 ~C~~~f~~~~~l~~H~~~H 23 (26)
T smart00355 5 ECGKVFKSKSALKEHMRTH 23 (26)
T ss_pred CCcchhCCHHHHHHHHHHh
Confidence 5555555555555555544
No 39
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=96.58 E-value=0.0023 Score=28.12 Aligned_cols=23 Identities=17% Similarity=0.307 Sum_probs=14.4
Q ss_pred eeCCCCCCCcccccHHHHHHHHHhhC
Q psy11070 118 YMCDYKGVCMSLTLKISFVKAVANNS 143 (148)
Q Consensus 118 ~~C~~c~~~~~f~~~~~l~~h~~~h~ 143 (148)
|+|..|+. ... ...|..|++.++
T Consensus 1 y~C~~C~y--~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPHCSY--STS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SSSS---EES-HHHHHHHHHHHH
T ss_pred CCCCCCCC--cCC-HHHHHHHHHhhC
Confidence 57777773 445 667777777754
No 40
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.42 E-value=0.0063 Score=31.84 Aligned_cols=33 Identities=21% Similarity=0.330 Sum_probs=23.5
Q ss_pred CCCCceeCCCcccccccCChHHHHHHHHHhcCCCc
Q psy11070 52 KKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLT 86 (148)
Q Consensus 52 ~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~ 86 (148)
..+.|-.|+ .|+..+.+..+|.+|+....+.+|
T Consensus 20 ~S~~PatCP--~C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCP--ICGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-T--TT--EESSHHHHHHHHHHHTTTS-
T ss_pred ccCCCCCCC--cchhhccchhhHHHHHHHHhcccC
Confidence 356789999 999999999999999988777665
No 41
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.65 E-value=0.0078 Score=26.56 Aligned_cols=17 Identities=35% Similarity=0.725 Sum_probs=8.4
Q ss_pred ccccccCChHHHHHHHH
Q psy11070 63 NCNKRFTTLSNLKMHMV 79 (148)
Q Consensus 63 ~C~~~f~~~~~l~~h~~ 79 (148)
.|++.|.....|..|++
T Consensus 5 ~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 5 ICNKSFSSENSLRQHLR 21 (25)
T ss_dssp TTTEEESSHHHHHHHHT
T ss_pred CCCCCcCCHHHHHHHHC
Confidence 55555555555544443
No 42
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=95.49 E-value=0.014 Score=25.67 Aligned_cols=17 Identities=24% Similarity=0.597 Sum_probs=9.0
Q ss_pred CcccccCChHHHHHHHH
Q psy11070 93 GCGRKFQTMKQYSTHLK 109 (148)
Q Consensus 93 ~C~~~f~~~~~l~~h~~ 109 (148)
.|++.|.....|..|++
T Consensus 5 ~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 5 ICNKSFSSENSLRQHLR 21 (25)
T ss_dssp TTTEEESSHHHHHHHHT
T ss_pred CCCCCcCCHHHHHHHHC
Confidence 55555555555555544
No 43
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=95.10 E-value=0.027 Score=24.55 Aligned_cols=22 Identities=45% Similarity=0.925 Sum_probs=11.4
Q ss_pred eeCCCcccccccCChHHHHHHHHHh
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRH 81 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h 81 (148)
|.|. .|+.... ...|..|++.+
T Consensus 1 y~C~--~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCP--HCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-S--SSS-EES-HHHHHHHHHHH
T ss_pred CCCC--CCCCcCC-HHHHHHHHHhh
Confidence 4555 6665555 55666665553
No 44
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.37 E-value=0.0079 Score=45.11 Aligned_cols=67 Identities=21% Similarity=0.332 Sum_probs=53.6
Q ss_pred CceeCCCcccccccCChHHHHHHHH--HhcCC--CceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCC
Q psy11070 55 QLWSCPIKNCNKRFTTLSNLKMHMV--RHGKP--LTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKG 124 (148)
Q Consensus 55 k~~~C~~~~C~~~f~~~~~l~~h~~--~h~~~--~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~ 124 (148)
.++.|. .|...|.....|..|.+ .|+++ +++.|++..|++.|.+...+..|..+|.+ .+++.+..-.
T Consensus 288 ~~~~~~--~~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 358 (467)
T COG5048 288 LPIKSK--QCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS-ISPAKEKLLN 358 (467)
T ss_pred cCCCCc--cccCCccccccccccccccccccccCCceeeeccCCCccccccccccCCcccccC-CCcccccccc
Confidence 356777 88888888888888888 78888 88888855588888888888888888888 7777665544
No 45
>KOG4173|consensus
Probab=94.21 E-value=0.022 Score=38.76 Aligned_cols=85 Identities=25% Similarity=0.509 Sum_probs=64.2
Q ss_pred CceeCCCcccccccCChHHHHHHHHH-hcCCCceecCcCCcccccCChHHHHHHHHH-hc---------CCCCceeCCCC
Q psy11070 55 QLWSCPIKNCNKRFTTLSNLKMHMVR-HGKPLTLCCEELGCGRKFQTMKQYSTHLKE-HS---------NVSAPYMCDYK 123 (148)
Q Consensus 55 k~~~C~~~~C~~~f~~~~~l~~h~~~-h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~-h~---------~~~k~~~C~~c 123 (148)
+.+.|+...|...|.....+..|..+ |+. .|. .|.+.|.+..-|..|+.- |. | .--|.|-+=
T Consensus 78 ~~~~cqvagc~~~~d~lD~~E~hY~~~h~~----sCs--~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG-~dMy~ClvE 150 (253)
T KOG4173|consen 78 PAFACQVAGCCQVFDALDDYEHHYHTLHGN----SCS--FCKRAFPTGHLLDAHILEWHDSLFQALVERG-QDMYQCLVE 150 (253)
T ss_pred ccccccccchHHHHhhhhhHHHhhhhcccc----hhH--HHHHhCCchhhhhHHHHHHHHHHHHHHHHcC-ccHHHHHHH
Confidence 45889988999999888888888765 432 588 899999999999888752 22 2 234788665
Q ss_pred CCCcccccHHHHHHHHHhhCCCC
Q psy11070 124 GVCMSLTLKISFVKAVANNSKYP 146 (148)
Q Consensus 124 ~~~~~f~~~~~l~~h~~~h~~~~ 146 (148)
||+..|++.-..+.|+-.-+++|
T Consensus 151 gCt~KFkT~r~RkdH~I~~Hk~P 173 (253)
T KOG4173|consen 151 GCTEKFKTSRDRKDHMIRMHKYP 173 (253)
T ss_pred hhhhhhhhhhhhhhHHHHhccCC
Confidence 55567999888999988777766
No 46
>PRK04860 hypothetical protein; Provisional
Probab=93.75 E-value=0.047 Score=36.09 Aligned_cols=39 Identities=15% Similarity=0.363 Sum_probs=31.4
Q ss_pred ceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChH
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMK 102 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~ 102 (148)
+|.|. |++ ....+..|.++|+++++|.|. .|+..|....
T Consensus 119 ~Y~C~---C~~---~~~~~rrH~ri~~g~~~YrC~--~C~~~l~~~~ 157 (160)
T PRK04860 119 PYRCK---CQE---HQLTVRRHNRVVRGEAVYRCR--RCGETLVFKG 157 (160)
T ss_pred EEEcC---CCC---eeCHHHHHHHHhcCCccEECC--CCCceeEEec
Confidence 47775 766 455688999999999999999 9998886543
No 47
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.09 E-value=0.022 Score=25.72 Aligned_cols=21 Identities=38% Similarity=0.808 Sum_probs=12.2
Q ss_pred eeCCCcccccccCChHHHHHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMV 79 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~ 79 (148)
|.|. .|++.|.....|..|++
T Consensus 2 ~~C~--~C~k~f~~~~~~~~H~~ 22 (27)
T PF12171_consen 2 FYCD--ACDKYFSSENQLKQHMK 22 (27)
T ss_dssp CBBT--TTTBBBSSHHHHHCCTT
T ss_pred CCcc--cCCCCcCCHHHHHHHHc
Confidence 4455 66666666666655543
No 48
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=91.75 E-value=0.21 Score=22.12 Aligned_cols=20 Identities=30% Similarity=0.725 Sum_probs=13.5
Q ss_pred eeCCCcccccccCChHHHHHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMV 79 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~ 79 (148)
..|+ .||+.| ..+.|..|+.
T Consensus 3 ~~C~--~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCP--ICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCC--CCCCEE-CHHHHHHHHH
Confidence 3577 888888 4556777754
No 49
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=91.66 E-value=0.19 Score=23.87 Aligned_cols=22 Identities=23% Similarity=0.366 Sum_probs=14.7
Q ss_pred ceeCCCCCCCcccccHHHHHHHHH
Q psy11070 117 PYMCDYKGVCMSLTLKISFVKAVA 140 (148)
Q Consensus 117 ~~~C~~c~~~~~f~~~~~l~~h~~ 140 (148)
+|.|..|++ .|.....+..|+.
T Consensus 3 ~~~C~~C~~--~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNV--TFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCC--ccCCHHHHHHHHC
Confidence 466777776 4666666666654
No 50
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=90.46 E-value=0.05 Score=40.79 Aligned_cols=57 Identities=18% Similarity=0.398 Sum_probs=50.4
Q ss_pred CceecCcCCcccccCChHHHHHHHH--HhcCCC--CceeCC--CCCCCcccccHHHHHHHHHhhCCCC
Q psy11070 85 LTLCCEELGCGRKFQTMKQYSTHLK--EHSNVS--APYMCD--YKGVCMSLTLKISFVKAVANNSKYP 146 (148)
Q Consensus 85 ~~~~c~~~~C~~~f~~~~~l~~h~~--~h~~~~--k~~~C~--~c~~~~~f~~~~~l~~h~~~h~~~~ 146 (148)
.++.|. .|...|.....|..|.+ .|++ + +++.|. .|++ .|.+...+..|...|.+..
T Consensus 288 ~~~~~~--~~~~~~s~~~~l~~~~~~~~h~~-~~~~~~~~p~~~~~~--~~~~~~~~~~~~~~~~~~~ 350 (467)
T COG5048 288 LPIKSK--QCNISFSRSSPLTRHLRSVNHSG-ESLKPFSCPYSLCGK--LFSRNDALKRHILLHTSIS 350 (467)
T ss_pred cCCCCc--cccCCcccccccccccccccccc-ccCCceeeeccCCCc--cccccccccCCcccccCCC
Confidence 367788 99999999999999999 8999 8 999999 7997 6999999999988887654
No 51
>KOG2893|consensus
Probab=90.23 E-value=0.056 Score=37.75 Aligned_cols=33 Identities=21% Similarity=0.315 Sum_probs=25.3
Q ss_pred CcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccccc
Q psy11070 93 GCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTL 131 (148)
Q Consensus 93 ~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~ 131 (148)
.|++.|....-|..|++ .|-|+|-+|-| +.|+.
T Consensus 15 ycnrefddekiliqhqk-----akhfkchichk-kl~sg 47 (341)
T KOG2893|consen 15 YCNREFDDEKILIQHQK-----AKHFKCHICHK-KLFSG 47 (341)
T ss_pred ecccccchhhhhhhhhh-----hccceeeeehh-hhccC
Confidence 68888988888888776 57788988887 35543
No 52
>KOG1146|consensus
Probab=89.69 E-value=0.12 Score=44.18 Aligned_cols=79 Identities=18% Similarity=0.280 Sum_probs=43.8
Q ss_pred CCCCCceeCCCcccccccCChHHHHHHHHH-hcCCCceecCcCCcccccCChHHHHHHHHHh-cCCCCceeCCCCCCCcc
Q psy11070 51 SKKQQLWSCPIKNCNKRFTTLSNLKMHMVR-HGKPLTLCCEELGCGRKFQTMKQYSTHLKEH-SNVSAPYMCDYKGVCMS 128 (148)
Q Consensus 51 h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~-h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h-~~~~k~~~C~~c~~~~~ 128 (148)
++-.+.++|+ .|+..|+....|..||+. |.....-.|.| .-... .+ ....+. .+ -++|.|..|.. .
T Consensus 460 ~S~~kt~~cp--kc~~~yk~a~~L~vhmRskhp~~~~~~c~~-----gq~~~-~~-arg~~~~~~-~~p~~C~~C~~--s 527 (1406)
T KOG1146|consen 460 HSFFKTLKCP--KCNWHYKLAQTLGVHMRSKHPESQSAYCKA-----GQNHP-RL-ARGEVYRCP-GKPYPCRACNY--S 527 (1406)
T ss_pred ecccccccCC--ccchhhhhHHHhhhcccccccccchhHhHh-----ccccc-cc-cccccccCC-CCcccceeeee--e
Confidence 4455677777 888888877778777776 33221111211 00000 00 000111 12 37888998886 6
Q ss_pred cccHHHHHHHHHh
Q psy11070 129 LTLKISFVKAVAN 141 (148)
Q Consensus 129 f~~~~~l~~h~~~ 141 (148)
+.....|..|+..
T Consensus 528 tttng~LsihlqS 540 (1406)
T KOG1146|consen 528 TTTNGNLSIHLQS 540 (1406)
T ss_pred eecchHHHHHHHH
Confidence 7787888888764
No 53
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=88.04 E-value=0.6 Score=27.59 Aligned_cols=24 Identities=38% Similarity=0.771 Sum_probs=17.3
Q ss_pred ceeCCCcccccccCChHHHHHHHHHh
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVRH 81 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~h 81 (148)
.+.|. .|++.|.+...|..||+.+
T Consensus 50 ~~~C~--~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCP--YCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBS--SSS-EESSHHHHHHHHHHT
T ss_pred CCCCC--ccCCCCcCHHHHHHHHcCc
Confidence 47777 8888888888888887763
No 54
>KOG1146|consensus
Probab=87.61 E-value=0.55 Score=40.49 Aligned_cols=22 Identities=5% Similarity=0.135 Sum_probs=12.8
Q ss_pred ceeCCCCCCCcccccHHHHHHHHH
Q psy11070 117 PYMCDYKGVCMSLTLKISFVKAVA 140 (148)
Q Consensus 117 ~~~C~~c~~~~~f~~~~~l~~h~~ 140 (148)
.+.|.+|+.. -.-.-+|..|+.
T Consensus 589 ~~~C~vc~ye--tniarnlrihmt 610 (1406)
T KOG1146|consen 589 SWRCEVCSYE--TNIARNLRIHMT 610 (1406)
T ss_pred Ccchhhhcch--hhhhhccccccc
Confidence 3789999853 333335555543
No 55
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=87.27 E-value=0.38 Score=35.15 Aligned_cols=25 Identities=16% Similarity=0.295 Sum_probs=13.0
Q ss_pred CceeCCCCCCCcccccHHHHHHHHH
Q psy11070 116 APYMCDYKGVCMSLTLKISFVKAVA 140 (148)
Q Consensus 116 k~~~C~~c~~~~~f~~~~~l~~h~~ 140 (148)
|||+|++.||.+.++..-.|+-|+.
T Consensus 348 KpykCpV~gC~K~YknqnGLKYH~l 372 (423)
T COG5189 348 KPYKCPVEGCNKKYKNQNGLKYHML 372 (423)
T ss_pred ceecCCCCCchhhhccccchhhhhh
Confidence 5555555554445555555555544
No 56
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=86.13 E-value=2.1 Score=28.26 Aligned_cols=45 Identities=9% Similarity=0.197 Sum_probs=32.4
Q ss_pred cCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHHH
Q psy11070 82 GKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVKA 138 (148)
Q Consensus 82 ~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h 138 (148)
.+..-|.|+ .|+..|+...++. --|.|+.||..+.+...+.+..-
T Consensus 105 ~~~~~Y~Cp--~c~~r~tf~eA~~----------~~F~Cp~Cg~~L~~~dn~~~i~~ 149 (158)
T TIGR00373 105 TNNMFFICP--NMCVRFTFNEAME----------LNFTCPRCGAMLDYLDNSEAIEK 149 (158)
T ss_pred cCCCeEECC--CCCcEeeHHHHHH----------cCCcCCCCCCEeeeccCHHHHHH
Confidence 445669999 9999998888774 23999999986555555554443
No 57
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=84.14 E-value=0.97 Score=22.77 Aligned_cols=26 Identities=12% Similarity=0.257 Sum_probs=14.6
Q ss_pred CCceeCCCCCCCcccc--cHHHHHHHHH
Q psy11070 115 SAPYMCDYKGVCMSLT--LKISFVKAVA 140 (148)
Q Consensus 115 ~k~~~C~~c~~~~~f~--~~~~l~~h~~ 140 (148)
.....|..|++.+... ..+.|..|++
T Consensus 14 ~~~a~C~~C~~~~~~~~~~ts~l~~HL~ 41 (45)
T PF02892_consen 14 KKKAKCKYCGKVIKYSSGGTSNLKRHLK 41 (45)
T ss_dssp SS-EEETTTTEE-----SSTHHHHHHHH
T ss_pred cCeEEeCCCCeEEeeCCCcHHHHHHhhh
Confidence 4456788888754453 5567888874
No 58
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=83.59 E-value=3.3 Score=27.92 Aligned_cols=43 Identities=14% Similarity=0.294 Sum_probs=29.9
Q ss_pred CCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHH
Q psy11070 83 KPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVK 137 (148)
Q Consensus 83 ~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~ 137 (148)
...-|.|+ .|+..|....++. --|.|+.||..+.+-..+.+..
T Consensus 114 ~~~~Y~Cp--~C~~rytf~eA~~----------~~F~Cp~Cg~~L~~~dn~~~~~ 156 (178)
T PRK06266 114 NNMFFFCP--NCHIRFTFDEAME----------YGFRCPQCGEMLEEYDNSELIK 156 (178)
T ss_pred CCCEEECC--CCCcEEeHHHHhh----------cCCcCCCCCCCCeecccHHHHH
Confidence 44568999 8999888887652 2499999997655544444433
No 59
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=78.31 E-value=3 Score=22.66 Aligned_cols=31 Identities=23% Similarity=0.443 Sum_probs=17.1
Q ss_pred ceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCC
Q psy11070 86 TLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKG 124 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~ 124 (148)
.|.|+ .||...-....-- |.+ ..+|.|+.||
T Consensus 27 ~F~CP--nCGe~~I~Rc~~C---Rk~---g~~Y~Cp~CG 57 (61)
T COG2888 27 KFPCP--NCGEVEIYRCAKC---RKL---GNPYRCPKCG 57 (61)
T ss_pred EeeCC--CCCceeeehhhhH---HHc---CCceECCCcC
Confidence 47777 7775433332211 111 3678888887
No 60
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=77.68 E-value=2.8 Score=21.75 Aligned_cols=22 Identities=18% Similarity=0.303 Sum_probs=11.9
Q ss_pred eCCCCCCCcccc---cHHHHHHHHH
Q psy11070 119 MCDYKGVCMSLT---LKISFVKAVA 140 (148)
Q Consensus 119 ~C~~c~~~~~f~---~~~~l~~h~~ 140 (148)
.|..|++.++.. ..+.|.+|++
T Consensus 20 ~C~~C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 20 KCKYCGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EecCCCCEeeeCCCCCcHHHHHHHH
Confidence 466666643332 2356666666
No 61
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=77.06 E-value=1.5 Score=23.50 Aligned_cols=26 Identities=31% Similarity=0.579 Sum_probs=19.3
Q ss_pred cCCCceecCcCCcccccCChHHHHHHHH
Q psy11070 82 GKPLTLCCEELGCGRKFQTMKQYSTHLK 109 (148)
Q Consensus 82 ~~~~~~~c~~~~C~~~f~~~~~l~~h~~ 109 (148)
.|+.-+.|+ .|+..|.......+|..
T Consensus 13 DGE~~lrCP--RC~~~FR~~K~Y~RHVN 38 (65)
T COG4049 13 DGEEFLRCP--RCGMVFRRRKDYIRHVN 38 (65)
T ss_pred CCceeeeCC--chhHHHHHhHHHHHHhh
Confidence 456667788 88888888877777775
No 62
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=77.06 E-value=2.9 Score=22.68 Aligned_cols=30 Identities=27% Similarity=0.706 Sum_probs=17.8
Q ss_pred ceecCcCCcccc-cCChHHHHHHHHHhcCCCCceeCCCCC
Q psy11070 86 TLCCEELGCGRK-FQTMKQYSTHLKEHSNVSAPYMCDYKG 124 (148)
Q Consensus 86 ~~~c~~~~C~~~-f~~~~~l~~h~~~h~~~~k~~~C~~c~ 124 (148)
.|.|+ .||.. ..+=..- |.. ..+|.|+.||
T Consensus 25 ~F~CP--nCG~~~I~RC~~C----Rk~---~~~Y~CP~CG 55 (59)
T PRK14890 25 KFLCP--NCGEVIIYRCEKC----RKQ---SNPYTCPKCG 55 (59)
T ss_pred EeeCC--CCCCeeEeechhH----Hhc---CCceECCCCC
Confidence 47888 88876 3222111 111 4679999988
No 63
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=76.33 E-value=1.8 Score=20.41 Aligned_cols=10 Identities=30% Similarity=1.082 Sum_probs=5.5
Q ss_pred eeCCCccccccc
Q psy11070 57 WSCPIKNCNKRF 68 (148)
Q Consensus 57 ~~C~~~~C~~~f 68 (148)
|.|. .||..+
T Consensus 2 ~~C~--~CGy~y 11 (33)
T cd00350 2 YVCP--VCGYIY 11 (33)
T ss_pred EECC--CCCCEE
Confidence 4555 666554
No 64
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=76.22 E-value=9.9 Score=24.67 Aligned_cols=40 Identities=15% Similarity=0.326 Sum_probs=26.0
Q ss_pred CCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccc
Q psy11070 83 KPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSL 129 (148)
Q Consensus 83 ~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f 129 (148)
+..-|.|+ .|+..|.....+.. . . . +..|.|+.||.....
T Consensus 96 ~~~~Y~Cp--~C~~~y~~~ea~~~-~--d-~-~~~f~Cp~Cg~~l~~ 135 (147)
T smart00531 96 NNAYYKCP--NCQSKYTFLEANQL-L--D-M-DGTFTCPRCGEELEE 135 (147)
T ss_pred CCcEEECc--CCCCEeeHHHHHHh-c--C-C-CCcEECCCCCCEEEE
Confidence 44569999 99999886554332 0 1 1 233999999975433
No 65
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=73.71 E-value=1.8 Score=26.64 Aligned_cols=30 Identities=30% Similarity=0.847 Sum_probs=19.9
Q ss_pred eeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCCh
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTM 101 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~ 101 (148)
..|+ .||++|.... ..|-.|+ .||..|...
T Consensus 10 R~Cp--~CG~kFYDLn-----------k~PivCP--~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCP--SCGAKFYDLN-----------KDPIVCP--KCGTEFPPE 39 (108)
T ss_pred ccCC--CCcchhccCC-----------CCCccCC--CCCCccCcc
Confidence 3677 8887775432 3566788 788777665
No 66
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=73.44 E-value=2.3 Score=22.78 Aligned_cols=30 Identities=17% Similarity=0.528 Sum_probs=25.1
Q ss_pred ccCCCCCceeCCCcccccccCChHHHHHHHHH
Q psy11070 49 KNSKKQQLWSCPIKNCNKRFTTLSNLKMHMVR 80 (148)
Q Consensus 49 ~~h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~ 80 (148)
+...|+..+.|+ -|+..|.....+.+|...
T Consensus 10 ~~RDGE~~lrCP--RC~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 10 RDRDGEEFLRCP--RCGMVFRRRKDYIRHVNK 39 (65)
T ss_pred eccCCceeeeCC--chhHHHHHhHHHHHHhhH
Confidence 344577889999 999999999999999875
No 67
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=72.95 E-value=3.1 Score=19.82 Aligned_cols=9 Identities=44% Similarity=0.733 Sum_probs=4.9
Q ss_pred CceeCCCCC
Q psy11070 116 APYMCDYKG 124 (148)
Q Consensus 116 k~~~C~~c~ 124 (148)
.|..|++||
T Consensus 17 ~p~~CP~Cg 25 (34)
T cd00729 17 APEKCPICG 25 (34)
T ss_pred CCCcCcCCC
Confidence 344566665
No 68
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=72.80 E-value=5.3 Score=19.90 Aligned_cols=26 Identities=12% Similarity=-0.078 Sum_probs=13.6
Q ss_pred eeCCCCCCCcccccHHHHHHHHHhhC
Q psy11070 118 YMCDYKGVCMSLTLKISFVKAVANNS 143 (148)
Q Consensus 118 ~~C~~c~~~~~f~~~~~l~~h~~~h~ 143 (148)
-.|+.||=.+.......-..|.+.|.
T Consensus 14 ~~C~~CgM~Y~~~~~eD~~~H~~yH~ 39 (41)
T PF13878_consen 14 TTCPTCGMLYSPGSPEDEKLHKKYHD 39 (41)
T ss_pred cCCCCCCCEECCCCHHHHHHHHHHHh
Confidence 35666664444444555555555554
No 69
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=72.53 E-value=1.8 Score=22.57 Aligned_cols=20 Identities=15% Similarity=0.169 Sum_probs=15.8
Q ss_pred CCceeCCCCCCCcccccHHH
Q psy11070 115 SAPYMCDYKGVCMSLTLKIS 134 (148)
Q Consensus 115 ~k~~~C~~c~~~~~f~~~~~ 134 (148)
++.+.|..||..|.|+-...
T Consensus 2 Dk~l~C~dCg~~FvfTa~EQ 21 (49)
T PF13451_consen 2 DKTLTCKDCGAEFVFTAGEQ 21 (49)
T ss_pred CeeEEcccCCCeEEEehhHH
Confidence 57899999999888865543
No 70
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=71.93 E-value=0.48 Score=38.37 Aligned_cols=61 Identities=23% Similarity=0.480 Sum_probs=42.6
Q ss_pred eCCCcccccccCChHHHHHHHHHhcCCCce-ecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccc
Q psy11070 58 SCPIKNCNKRFTTLSNLKMHMVRHGKPLTL-CCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLT 130 (148)
Q Consensus 58 ~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~-~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~ 130 (148)
.|. .||.+|+....|..- |.+|....| .|+ .|.+.|..+.+-+ .| ..|..|+.||-...+.
T Consensus 125 ~CT--~CGPRfTIi~alPYD-R~nTsM~~F~lC~--~C~~EY~dP~nRR----fH---AQp~aCp~CGP~~~l~ 186 (750)
T COG0068 125 NCT--NCGPRFTIIEALPYD-RENTSMADFPLCP--FCDKEYKDPLNRR----FH---AQPIACPKCGPHLFLV 186 (750)
T ss_pred ccC--CCCcceeeeccCCCC-cccCccccCcCCH--HHHHHhcCccccc----cc---cccccCcccCCCeEEE
Confidence 488 999999987776432 334444443 488 8999998887644 44 4689999999644444
No 71
>KOG2186|consensus
Probab=71.32 E-value=2.7 Score=29.81 Aligned_cols=49 Identities=24% Similarity=0.596 Sum_probs=30.5
Q ss_pred eeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHHHHHHHHHhc
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQYSTHLKEHS 112 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~ 112 (148)
|.|. .||.+.... .+..|+-.=++ ..|.|. -|+..|.+ .....|..--+
T Consensus 4 FtCn--vCgEsvKKp-~vekH~srCrn-~~fSCI--DC~k~F~~-~sYknH~kCIT 52 (276)
T KOG2186|consen 4 FTCN--VCGESVKKP-QVEKHMSRCRN-AYFSCI--DCGKTFER-VSYKNHTKCIT 52 (276)
T ss_pred Eehh--hhhhhcccc-chHHHHHhccC-CeeEEe--eccccccc-chhhhhhhhcc
Confidence 5666 777766544 35667665444 457777 77777776 55666655443
No 72
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=71.18 E-value=4.7 Score=19.53 Aligned_cols=9 Identities=22% Similarity=0.265 Sum_probs=4.0
Q ss_pred ceeCCCCCC
Q psy11070 117 PYMCDYKGV 125 (148)
Q Consensus 117 ~~~C~~c~~ 125 (148)
..+|+.|+.
T Consensus 25 ~vrC~~C~~ 33 (37)
T PF13719_consen 25 KVRCPKCGH 33 (37)
T ss_pred EEECCCCCc
Confidence 344444443
No 73
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=70.41 E-value=4.6 Score=19.43 Aligned_cols=7 Identities=43% Similarity=1.084 Sum_probs=3.2
Q ss_pred ccccccC
Q psy11070 63 NCNKRFT 69 (148)
Q Consensus 63 ~C~~~f~ 69 (148)
.|+..|.
T Consensus 7 ~C~~~~~ 13 (38)
T TIGR02098 7 NCKTSFR 13 (38)
T ss_pred CCCCEEE
Confidence 4444443
No 74
>KOG2785|consensus
Probab=68.47 E-value=7.2 Score=29.43 Aligned_cols=26 Identities=23% Similarity=0.371 Sum_probs=21.6
Q ss_pred ceeCCCcccccccCChHHHHHHHHHhcC
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVRHGK 83 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~h~~ 83 (148)
|--|. .|++.+.....-..||..++|
T Consensus 166 Pt~CL--fC~~~~k~~e~~~~HM~~~Hg 191 (390)
T KOG2785|consen 166 PTDCL--FCDKKSKSLEENLKHMFKEHG 191 (390)
T ss_pred Cccee--ecCCCcccHHHHHHHHhhccC
Confidence 45688 999999999888999988665
No 75
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=67.91 E-value=6.9 Score=18.85 Aligned_cols=8 Identities=25% Similarity=0.945 Sum_probs=3.6
Q ss_pred ccccccCC
Q psy11070 63 NCNKRFTT 70 (148)
Q Consensus 63 ~C~~~f~~ 70 (148)
.|+..|.-
T Consensus 7 ~C~~~y~i 14 (36)
T PF13717_consen 7 NCQAKYEI 14 (36)
T ss_pred CCCCEEeC
Confidence 44444443
No 76
>KOG2231|consensus
Probab=66.48 E-value=8.2 Score=31.50 Aligned_cols=15 Identities=40% Similarity=0.620 Sum_probs=9.0
Q ss_pred cccCChHHHHHHHHH
Q psy11070 66 KRFTTLSNLKMHMVR 80 (148)
Q Consensus 66 ~~f~~~~~l~~h~~~ 80 (148)
..|.....|+.|+..
T Consensus 122 ~~~~s~~~Lk~H~~~ 136 (669)
T KOG2231|consen 122 TEFKSVENLKNHMRD 136 (669)
T ss_pred cchhHHHHHHHHHHH
Confidence 334466777777743
No 77
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=64.56 E-value=5 Score=29.96 Aligned_cols=24 Identities=33% Similarity=0.772 Sum_probs=14.8
Q ss_pred eeCCCcccccccCChHHHHHHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVR 80 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~ 80 (148)
|.|+-..|..+......|+.|.+.
T Consensus 152 F~CP~skc~~~C~~~k~lk~H~K~ 175 (493)
T COG5236 152 FKCPKSKCHRRCGSLKELKKHYKA 175 (493)
T ss_pred hcCCchhhhhhhhhHHHHHHHHHh
Confidence 666655666655555666666665
No 78
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=61.65 E-value=6.4 Score=25.95 Aligned_cols=35 Identities=29% Similarity=0.674 Sum_probs=27.3
Q ss_pred CCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccC
Q psy11070 52 KKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQ 99 (148)
Q Consensus 52 ~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~ 99 (148)
.+..-|.|+ .|+.+|+....+. ..|.|+ .||....
T Consensus 105 ~~~~~Y~Cp--~c~~r~tf~eA~~---------~~F~Cp--~Cg~~L~ 139 (158)
T TIGR00373 105 TNNMFFICP--NMCVRFTFNEAME---------LNFTCP--RCGAMLD 139 (158)
T ss_pred cCCCeEECC--CCCcEeeHHHHHH---------cCCcCC--CCCCEee
Confidence 344569999 9999998887764 269999 9997654
No 79
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=60.97 E-value=9.4 Score=23.28 Aligned_cols=31 Identities=16% Similarity=0.376 Sum_probs=23.6
Q ss_pred CCccCCCCCceeC----CCcccccccCChHHHHHHHHHhc
Q psy11070 47 GGKNSKKQQLWSC----PIKNCNKRFTTLSNLKMHMVRHG 82 (148)
Q Consensus 47 ~~~~h~~~k~~~C----~~~~C~~~f~~~~~l~~h~~~h~ 82 (148)
.+.++.| |.| . .|+....+...+..|.+.+.
T Consensus 74 gLp~~~G---~~C~~~~~--~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 74 GLPVYDG---YRCQCDPP--HCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred CCCCCCC---eeeecCCC--CCCcEeccHHHHHHHHHHhc
Confidence 3445544 888 6 99999999999999988754
No 80
>KOG4173|consensus
Probab=60.46 E-value=3 Score=28.75 Aligned_cols=79 Identities=24% Similarity=0.336 Sum_probs=55.6
Q ss_pred HhhhhcCCCCcceeeccC---CCccCCCCCceeCCCcccccccCChHHHHHHHHH-h---------cCCCceecCcCCcc
Q psy11070 29 TEALQSLGINKEYFISST---GGKNSKKQQLWSCPIKNCNKRFTTLSNLKMHMVR-H---------GKPLTLCCEELGCG 95 (148)
Q Consensus 29 ~~~~~~~~~~~~~~~~~~---~~~~h~~~k~~~C~~~~C~~~f~~~~~l~~h~~~-h---------~~~~~~~c~~~~C~ 95 (148)
...+...+|...|...-. |...-.+ ..|. .|.+.|.+..-|..|+.. | .|.-.|+|..++|+
T Consensus 79 ~~~cqvagc~~~~d~lD~~E~hY~~~h~---~sCs--~C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvEgCt 153 (253)
T KOG4173|consen 79 AFACQVAGCCQVFDALDDYEHHYHTLHG---NSCS--FCKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVEGCT 153 (253)
T ss_pred cccccccchHHHHhhhhhHHHhhhhccc---chhH--HHHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHHhhh
Confidence 334566677776665532 2211111 3799 999999999999999864 4 34556999877899
Q ss_pred cccCChHHHHHHH-HHhc
Q psy11070 96 RKFQTMKQYSTHL-KEHS 112 (148)
Q Consensus 96 ~~f~~~~~l~~h~-~~h~ 112 (148)
..|.+...-+.|+ +.|.
T Consensus 154 ~KFkT~r~RkdH~I~~Hk 171 (253)
T KOG4173|consen 154 EKFKTSRDRKDHMIRMHK 171 (253)
T ss_pred hhhhhhhhhhhHHHHhcc
Confidence 9999999999997 4554
No 81
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=60.30 E-value=5.3 Score=26.93 Aligned_cols=35 Identities=29% Similarity=0.744 Sum_probs=26.7
Q ss_pred CCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCC
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQT 100 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~ 100 (148)
+..-|.|+ .|+.+|+....+. ..|.|+ .||.....
T Consensus 114 ~~~~Y~Cp--~C~~rytf~eA~~---------~~F~Cp--~Cg~~L~~ 148 (178)
T PRK06266 114 NNMFFFCP--NCHIRFTFDEAME---------YGFRCP--QCGEMLEE 148 (178)
T ss_pred CCCEEECC--CCCcEEeHHHHhh---------cCCcCC--CCCCCCee
Confidence 34569999 9999998877652 369999 99976543
No 82
>PF15269 zf-C2H2_7: Zinc-finger
Probab=60.20 E-value=8.5 Score=19.62 Aligned_cols=22 Identities=14% Similarity=0.149 Sum_probs=14.1
Q ss_pred eeCCCCCCCcccccHHHHHHHHHh
Q psy11070 118 YMCDYKGVCMSLTLKISFVKAVAN 141 (148)
Q Consensus 118 ~~C~~c~~~~~f~~~~~l~~h~~~ 141 (148)
|+|-+|. |....++.|-.|++.
T Consensus 21 ykcfqcp--ftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQCP--FTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeecCC--cccchHHHHHHHHHH
Confidence 5576666 455667777777653
No 83
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=59.58 E-value=5.8 Score=25.15 Aligned_cols=32 Identities=22% Similarity=0.392 Sum_probs=19.6
Q ss_pred eeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccCChHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQTMKQ 103 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~~~~~ 103 (148)
..|+ .||++|.... ..|-.|+ .||..|.....
T Consensus 10 r~Cp--~cg~kFYDLn-----------k~p~vcP--~cg~~~~~~~~ 41 (129)
T TIGR02300 10 RICP--NTGSKFYDLN-----------RRPAVSP--YTGEQFPPEEA 41 (129)
T ss_pred ccCC--CcCccccccC-----------CCCccCC--CcCCccCcchh
Confidence 4677 7777775422 3466777 77777655533
No 84
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=59.16 E-value=13 Score=24.17 Aligned_cols=39 Identities=26% Similarity=0.692 Sum_probs=26.3
Q ss_pred CCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccccC
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRKFQ 99 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~f~ 99 (148)
+..-|.|+ .|+..|.....+.. ... ...|.|+ .||....
T Consensus 96 ~~~~Y~Cp--~C~~~y~~~ea~~~---~d~-~~~f~Cp--~Cg~~l~ 134 (147)
T smart00531 96 NNAYYKCP--NCQSKYTFLEANQL---LDM-DGTFTCP--RCGEELE 134 (147)
T ss_pred CCcEEECc--CCCCEeeHHHHHHh---cCC-CCcEECC--CCCCEEE
Confidence 34569999 99999986554321 122 3349999 9997664
No 85
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=59.08 E-value=11 Score=22.93 Aligned_cols=25 Identities=20% Similarity=0.186 Sum_probs=21.4
Q ss_pred eeC----CCCCCCcccccHHHHHHHHHhhCC
Q psy11070 118 YMC----DYKGVCMSLTLKISFVKAVANNSK 144 (148)
Q Consensus 118 ~~C----~~c~~~~~f~~~~~l~~h~~~h~~ 144 (148)
|.| ..|+. .+.....+++|.+.++|
T Consensus 81 ~~C~~~~~~C~y--~~~~~~~m~~H~~~~Hg 109 (109)
T PF12013_consen 81 YRCQCDPPHCGY--ITRSKKTMRKHWRKEHG 109 (109)
T ss_pred eeeecCCCCCCc--EeccHHHHHHHHHHhcC
Confidence 889 88874 78899999999998775
No 86
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=58.77 E-value=7.8 Score=23.54 Aligned_cols=14 Identities=29% Similarity=0.653 Sum_probs=7.1
Q ss_pred ceecCcCCcccccCCh
Q psy11070 86 TLCCEELGCGRKFQTM 101 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~ 101 (148)
|++|. .||..|..-
T Consensus 2 pH~Ct--rCG~vf~~g 15 (112)
T COG3364 2 PHQCT--RCGEVFDDG 15 (112)
T ss_pred Cceec--ccccccccc
Confidence 34555 555555543
No 87
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=57.85 E-value=4.1 Score=20.68 Aligned_cols=8 Identities=38% Similarity=0.771 Sum_probs=3.9
Q ss_pred eeCCCCCC
Q psy11070 118 YMCDYKGV 125 (148)
Q Consensus 118 ~~C~~c~~ 125 (148)
..|+.||.
T Consensus 22 ~~Cp~CG~ 29 (46)
T PRK00398 22 VRCPYCGY 29 (46)
T ss_pred eECCCCCC
Confidence 45555553
No 88
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=56.48 E-value=8.5 Score=25.68 Aligned_cols=10 Identities=40% Similarity=0.767 Sum_probs=5.5
Q ss_pred CCceeCCCCC
Q psy11070 115 SAPYMCDYKG 124 (148)
Q Consensus 115 ~k~~~C~~c~ 124 (148)
+-|-.|++||
T Consensus 147 e~P~~CPiCg 156 (166)
T COG1592 147 EAPEVCPICG 156 (166)
T ss_pred CCCCcCCCCC
Confidence 4455555555
No 89
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=56.28 E-value=12 Score=23.91 Aligned_cols=29 Identities=31% Similarity=0.539 Sum_probs=16.5
Q ss_pred CCCceeCCCcccccccCChHHHHHHHHHhcCCCc
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLT 86 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~ 86 (148)
.+.-..|- .||+.|+. |++|+..|.|-.|
T Consensus 69 ~~d~i~cl--ecGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 69 TPDYIICL--ECGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp -SS-EE-T--BT--EESB---HHHHHHHTT-S-H
T ss_pred ccCeeEEc--cCCcccch---HHHHHHHccCCCH
Confidence 34447788 99998875 5889888866543
No 90
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=55.40 E-value=13 Score=18.80 Aligned_cols=9 Identities=22% Similarity=0.246 Sum_probs=4.6
Q ss_pred ceeCCCCCC
Q psy11070 117 PYMCDYKGV 125 (148)
Q Consensus 117 ~~~C~~c~~ 125 (148)
+..|+.||.
T Consensus 19 ~irC~~CG~ 27 (44)
T smart00659 19 VVRCRECGY 27 (44)
T ss_pred ceECCCCCc
Confidence 345555553
No 91
>KOG2231|consensus
Probab=55.36 E-value=25 Score=28.87 Aligned_cols=21 Identities=24% Similarity=0.485 Sum_probs=13.6
Q ss_pred ecCcCCcccccCChHHHHHHHHH
Q psy11070 88 CCEELGCGRKFQTMKQYSTHLKE 110 (148)
Q Consensus 88 ~c~~~~C~~~f~~~~~l~~h~~~ 110 (148)
.|. .|...|.....|..|++.
T Consensus 184 ~C~--~C~~~fld~~el~rH~~~ 204 (669)
T KOG2231|consen 184 LCK--FCHERFLDDDELYRHLRF 204 (669)
T ss_pred cch--hhhhhhccHHHHHHhhcc
Confidence 455 666667777677666653
No 92
>KOG2593|consensus
Probab=54.63 E-value=18 Score=27.89 Aligned_cols=37 Identities=14% Similarity=0.468 Sum_probs=24.4
Q ss_pred CCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCC
Q psy11070 83 KPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGV 125 (148)
Q Consensus 83 ~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~ 125 (148)
....|.|+ .|.+.|.....++. .-.. .--|.|..|+-
T Consensus 125 ~~~~Y~Cp--~C~kkyt~Lea~~L---~~~~-~~~F~C~~C~g 161 (436)
T KOG2593|consen 125 NVAGYVCP--NCQKKYTSLEALQL---LDNE-TGEFHCENCGG 161 (436)
T ss_pred ccccccCC--ccccchhhhHHHHh---hccc-CceEEEecCCC
Confidence 45668888 88888887776642 1221 23588888874
No 93
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=54.46 E-value=7.6 Score=17.00 Aligned_cols=8 Identities=38% Similarity=0.978 Sum_probs=4.7
Q ss_pred ceeCCCCC
Q psy11070 117 PYMCDYKG 124 (148)
Q Consensus 117 ~~~C~~c~ 124 (148)
.|.|+.||
T Consensus 16 ~f~CPnCG 23 (24)
T PF07754_consen 16 PFPCPNCG 23 (24)
T ss_pred eEeCCCCC
Confidence 46666665
No 94
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=54.38 E-value=10 Score=24.17 Aligned_cols=13 Identities=23% Similarity=0.690 Sum_probs=7.1
Q ss_pred eeCCCcccccccCCh
Q psy11070 57 WSCPIKNCNKRFTTL 71 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~ 71 (148)
+.|. .||+.|..-
T Consensus 2 H~Ct--~Cg~~f~dg 14 (131)
T PF09845_consen 2 HQCT--KCGRVFEDG 14 (131)
T ss_pred cccC--cCCCCcCCC
Confidence 4555 666655543
No 95
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=53.84 E-value=17 Score=22.58 Aligned_cols=78 Identities=17% Similarity=0.221 Sum_probs=42.2
Q ss_pred CceeCCCcccccccCChHHHHHHHHHhc-CCCc------------eecCcCCcccccCChHHHHHHHHHhcCCCCceeCC
Q psy11070 55 QLWSCPIKNCNKRFTTLSNLKMHMVRHG-KPLT------------LCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCD 121 (148)
Q Consensus 55 k~~~C~~~~C~~~f~~~~~l~~h~~~h~-~~~~------------~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~ 121 (148)
-|-.|+ .|+-+.-....|.+..- |. ..++ -.|- .|...|........- .-.. ...|.|.
T Consensus 14 LP~~Cp--iCgLtLVss~HLARSyH-HLfPl~~f~ev~~~~~~~~~~C~--~C~~~f~~~~~~~~~--~~~~-~~~y~C~ 85 (112)
T TIGR00622 14 LPVECP--ICGLTLILSTHLARSYH-HLFPLKAFQEIPLEEYNGSRFCF--GCQGPFPKPPVSPFD--ELKD-SHRYVCA 85 (112)
T ss_pred CCCcCC--cCCCEEeccchHHHhhh-ccCCCcccccccccccCCCCccc--CcCCCCCCccccccc--cccc-ccceeCC
Confidence 346777 88887777777654211 10 1111 1377 788888765321100 0112 3468899
Q ss_pred CCCCCcccccHHHHHHHHHhh
Q psy11070 122 YKGVCMSLTLKISFVKAVANN 142 (148)
Q Consensus 122 ~c~~~~~f~~~~~l~~h~~~h 142 (148)
.|.. .|-..-+.-.|...|
T Consensus 86 ~C~~--~FC~dCD~fiHe~Lh 104 (112)
T TIGR00622 86 VCKN--VFCVDCDVFVHESLH 104 (112)
T ss_pred CCCC--ccccccchhhhhhcc
Confidence 9987 455444555555444
No 96
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=53.09 E-value=12 Score=26.02 Aligned_cols=28 Identities=29% Similarity=0.729 Sum_probs=21.3
Q ss_pred CCCceeCCCcccccccCChHHHHHHHHH-hc
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMHMVR-HG 82 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h~~~-h~ 82 (148)
++..|.|. .|+|.|+-......|+.. |.
T Consensus 74 ~~~K~~C~--lc~KlFkg~eFV~KHI~nKH~ 102 (214)
T PF04959_consen 74 DEDKWRCP--LCGKLFKGPEFVRKHIFNKHP 102 (214)
T ss_dssp SSEEEEE---SSS-EESSHHHHHHHHHHH-H
T ss_pred cCCEECCC--CCCcccCChHHHHHHHhhcCH
Confidence 44559999 999999999999999986 53
No 97
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=52.64 E-value=29 Score=26.16 Aligned_cols=77 Identities=19% Similarity=0.446 Sum_probs=48.7
Q ss_pred eCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcc----cccCChHHHHHHHHHhcCCCCceeCCC----CCCCccc
Q psy11070 58 SCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCG----RKFQTMKQYSTHLKEHSNVSAPYMCDY----KGVCMSL 129 (148)
Q Consensus 58 ~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~----~~f~~~~~l~~h~~~h~~~~k~~~C~~----c~~~~~f 129 (148)
.|. .|...|.....|..|++.-. ++.+.|+ .-+ ..|..-..|-.|-+ .--|.|.. -|+-+.|
T Consensus 222 ~C~--FC~~~FYdDDEL~~HcR~~H-E~ChICD--~v~p~~~QYFK~Y~~Le~HF~-----~~hy~ct~qtc~~~k~~vf 291 (493)
T COG5236 222 LCI--FCKIYFYDDDELRRHCRLRH-EACHICD--MVGPIRYQYFKSYEDLEAHFR-----NAHYCCTFQTCRVGKCYVF 291 (493)
T ss_pred hhh--hccceecChHHHHHHHHhhh-hhhhhhh--ccCccchhhhhCHHHHHHHhh-----cCceEEEEEEEecCcEEEe
Confidence 577 89999999999999988632 4555555 222 23455555655544 23355543 2344578
Q ss_pred ccHHHHHHHHHhhCC
Q psy11070 130 TLKISFVKAVANNSK 144 (148)
Q Consensus 130 ~~~~~l~~h~~~h~~ 144 (148)
.....|..|+-.-++
T Consensus 292 ~~~~el~~h~~~~h~ 306 (493)
T COG5236 292 PYHTELLEHLTRFHK 306 (493)
T ss_pred ccHHHHHHHHHHHhh
Confidence 888888888765444
No 98
>KOG4377|consensus
Probab=52.48 E-value=5.4 Score=30.44 Aligned_cols=76 Identities=14% Similarity=0.256 Sum_probs=51.5
Q ss_pred CCCCceeCCCcccccccCChHHHHHHHHHhcCC------------CceecCcCCcccccCChHHHHHHHHHhcCC----C
Q psy11070 52 KKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKP------------LTLCCEELGCGRKFQTMKQYSTHLKEHSNV----S 115 (148)
Q Consensus 52 ~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~------------~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~----~ 115 (148)
.+...|.|.-+.|+..+..+..+.+|...|... ..|.|....|.+ .-++...|-..|+.. -
T Consensus 267 ~~rEhyhcl~e~C~ykr~~k~DvirH~~~hkkrdnsL~dgf~rfs~syhC~~~~C~k---sTsdV~~h~nFht~~~n~Gf 343 (480)
T KOG4377|consen 267 AGREHYHCLNEYCFYKRGQKNDVIRHVEIHKKRDNSLIDGFHRFSNSYHCTGQICEK---STSDVLLHDNFHTDKRNNGF 343 (480)
T ss_pred ccchhhcccCccccccccchhhhHHHHHHHhhcccccccchhhcCccchhhhcccCc---ccccccccCccccccccCce
Confidence 345568898889998777788888998887532 225677667887 334455666666530 1
Q ss_pred C--ceeCCCCCCCcccc
Q psy11070 116 A--PYMCDYKGVCMSLT 130 (148)
Q Consensus 116 k--~~~C~~c~~~~~f~ 130 (148)
+ -|.|..||+++.|+
T Consensus 344 rrthfhC~r~gCTdtfK 360 (480)
T KOG4377|consen 344 RRTHFHCQRIGCTDTFK 360 (480)
T ss_pred ecceeEEeccCCccccc
Confidence 2 36799999876776
No 99
>KOG4167|consensus
Probab=50.73 E-value=3.9 Score=33.51 Aligned_cols=25 Identities=28% Similarity=0.686 Sum_probs=22.8
Q ss_pred ceeCCCcccccccCChHHHHHHHHHhc
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVRHG 82 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~h~ 82 (148)
-|.|. .|++.|....++..||+.|.
T Consensus 792 iFpCr--eC~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCR--ECGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehH--HHHHHHHHHhhhhHHHHHHH
Confidence 49999 99999999999999999985
No 100
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=49.83 E-value=3.4 Score=28.72 Aligned_cols=17 Identities=35% Similarity=0.724 Sum_probs=12.0
Q ss_pred CceeCCCcccccccCChHH
Q psy11070 55 QLWSCPIKNCNKRFTTLSN 73 (148)
Q Consensus 55 k~~~C~~~~C~~~f~~~~~ 73 (148)
+...|+ .|++.|....-
T Consensus 4 k~~~CP--vC~~~F~~~~v 20 (214)
T PF09986_consen 4 KKITCP--VCGKEFKTKKV 20 (214)
T ss_pred CceECC--CCCCeeeeeEE
Confidence 456788 88888876643
No 101
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=48.35 E-value=14 Score=19.26 Aligned_cols=11 Identities=36% Similarity=0.960 Sum_probs=6.8
Q ss_pred ceecCcCCccccc
Q psy11070 86 TLCCEELGCGRKF 98 (148)
Q Consensus 86 ~~~c~~~~C~~~f 98 (148)
.|.|. .||+.|
T Consensus 6 ~Y~C~--~Cg~~~ 16 (49)
T COG1996 6 EYKCA--RCGREV 16 (49)
T ss_pred EEEhh--hcCCee
Confidence 35666 666666
No 102
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=45.40 E-value=54 Score=22.16 Aligned_cols=42 Identities=12% Similarity=0.164 Sum_probs=26.6
Q ss_pred hcCCCceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHH
Q psy11070 81 HGKPLTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKIS 134 (148)
Q Consensus 81 h~~~~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~ 134 (148)
-++..-|.|+ .|...|+.-.+. +..|.|+.||....+...+.
T Consensus 108 ~~~~~~y~C~--~~~~r~sfdeA~----------~~~F~Cp~Cg~~L~~~d~s~ 149 (176)
T COG1675 108 ETENNYYVCP--NCHVKYSFDEAM----------ELGFTCPKCGEDLEEYDSSE 149 (176)
T ss_pred hccCCceeCC--CCCCcccHHHHH----------HhCCCCCCCCchhhhccchH
Confidence 3455668998 887766655432 23389999997655544443
No 103
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=43.78 E-value=14 Score=23.75 Aligned_cols=25 Identities=28% Similarity=0.389 Sum_probs=19.8
Q ss_pred eeCCCcccccccCChHHHHHHHHHhcCCCc
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRHGKPLT 86 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~ 86 (148)
..|- .+|+.|+ +|++|..+|.|--|
T Consensus 77 IicL--EDGkkfK---SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICL--EDGKKFK---SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEe--ccCcchH---HHHHHHhcccCCCH
Confidence 6788 9999885 48899998877554
No 104
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=42.96 E-value=21 Score=24.88 Aligned_cols=25 Identities=16% Similarity=0.133 Sum_probs=16.7
Q ss_pred CceeCCCCCCCcccccHHHHHHHHHhh
Q psy11070 116 APYMCDYKGVCMSLTLKISFVKAVANN 142 (148)
Q Consensus 116 k~~~C~~c~~~~~f~~~~~l~~h~~~h 142 (148)
-.|.|..|+| .|+-..-..+|+..-
T Consensus 76 ~K~~C~lc~K--lFkg~eFV~KHI~nK 100 (214)
T PF04959_consen 76 DKWRCPLCGK--LFKGPEFVRKHIFNK 100 (214)
T ss_dssp EEEEE-SSS---EESSHHHHHHHHHHH
T ss_pred CEECCCCCCc--ccCChHHHHHHHhhc
Confidence 4578888886 677777777777653
No 105
>PHA00626 hypothetical protein
Probab=41.58 E-value=38 Score=18.21 Aligned_cols=13 Identities=15% Similarity=0.434 Sum_probs=8.3
Q ss_pred ceeCCCcccccccCC
Q psy11070 56 LWSCPIKNCNKRFTT 70 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~ 70 (148)
.|+|. .||..|+.
T Consensus 23 rYkCk--dCGY~ft~ 35 (59)
T PHA00626 23 DYVCC--DCGYNDSK 35 (59)
T ss_pred ceEcC--CCCCeech
Confidence 46677 77766653
No 106
>KOG2186|consensus
Probab=41.39 E-value=24 Score=25.31 Aligned_cols=46 Identities=22% Similarity=0.447 Sum_probs=34.2
Q ss_pred eecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCcccccHHHHHHHHH
Q psy11070 87 LCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSLTLKISFVKAVA 140 (148)
Q Consensus 87 ~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h~~ 140 (148)
|.|. .||....-+ .+-.|+..-.+ ..|.|.-||+ .|-+ ...+.|..
T Consensus 4 FtCn--vCgEsvKKp-~vekH~srCrn--~~fSCIDC~k--~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCN--VCGESVKKP-QVEKHMSRCRN--AYFSCIDCGK--TFER-VSYKNHTK 49 (276)
T ss_pred Eehh--hhhhhcccc-chHHHHHhccC--CeeEEeeccc--cccc-chhhhhhh
Confidence 7798 999887766 45568876665 6799999998 4755 55667754
No 107
>KOG4167|consensus
Probab=41.12 E-value=6.9 Score=32.18 Aligned_cols=26 Identities=31% Similarity=0.731 Sum_probs=23.3
Q ss_pred ceecCcCCcccccCChHHHHHHHHHhcC
Q psy11070 86 TLCCEELGCGRKFQTMKQYSTHLKEHSN 113 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~~~l~~h~~~h~~ 113 (148)
-|.|. .|++.|.-...+..||++|.-
T Consensus 792 iFpCr--eC~kvF~KiKSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 792 IFPCR--ECGKVFFKIKSRNAHMKTHRQ 817 (907)
T ss_pred eeehH--HHHHHHHHHhhhhHHHHHHHH
Confidence 38899 999999999999999999874
No 108
>PF11931 DUF3449: Domain of unknown function (DUF3449); InterPro: IPR024598 This presumed domain is functionally uncharacterised. It has two conserved sequence motifs: PIP and CEICG and contains a zinc-finger of the C2H2-type.; PDB: 4DGW_A.
Probab=40.31 E-value=9.4 Score=26.21 Aligned_cols=23 Identities=26% Similarity=0.401 Sum_probs=0.0
Q ss_pred CCceeCCCCCCCcccccHHHHHHH
Q psy11070 115 SAPYMCDYKGVCMSLTLKISFVKA 138 (148)
Q Consensus 115 ~k~~~C~~c~~~~~f~~~~~l~~h 138 (148)
.+-|.|.+||- +.|..+-.+.+|
T Consensus 99 ~~ey~CEICGN-~~Y~GrkaFekH 121 (196)
T PF11931_consen 99 GVEYKCEICGN-QSYKGRKAFEKH 121 (196)
T ss_dssp ------------------------
T ss_pred CCeeeeEeCCC-cceecHHHHHHh
Confidence 45566666665 244444444444
No 109
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=40.28 E-value=36 Score=14.97 Aligned_cols=19 Identities=21% Similarity=0.562 Sum_probs=10.3
Q ss_pred eCCCcccccccCChHHHHHHHH
Q psy11070 58 SCPIKNCNKRFTTLSNLKMHMV 79 (148)
Q Consensus 58 ~C~~~~C~~~f~~~~~l~~h~~ 79 (148)
.|+ .|++.+ ....+..|..
T Consensus 3 ~CP--iC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCP--VCFREV-PENLINSHLD 21 (26)
T ss_pred cCC--CCcCcc-cHHHHHHHHH
Confidence 466 777666 3344455543
No 110
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=39.55 E-value=27 Score=24.10 Aligned_cols=41 Identities=15% Similarity=0.233 Sum_probs=22.4
Q ss_pred CcccccCChHHHHHHHHHhcCCCCceeCCCCCCCc--ccccHHHHHHHHH
Q psy11070 93 GCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCM--SLTLKISFVKAVA 140 (148)
Q Consensus 93 ~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~--~f~~~~~l~~h~~ 140 (148)
.||..|...-++..- .+--.|.+||++. .|.-.+.|.+.+.
T Consensus 213 ~CGH~FC~~Cai~~y-------~kg~~C~~Cgk~t~G~f~V~~d~~kmL~ 255 (259)
T COG5152 213 ECGHSFCSLCAIRKY-------QKGDECGVCGKATYGRFWVVSDLQKMLN 255 (259)
T ss_pred hcchhHHHHHHHHHh-------ccCCcceecchhhccceeHHhhHHHHHh
Confidence 455555555444432 2234688888742 3555666666554
No 111
>PF11789 zf-Nse: Zinc-finger of the MIZ type in Nse subunit; PDB: 2YU4_A 3HTK_C.
Probab=36.99 E-value=30 Score=18.52 Aligned_cols=34 Identities=15% Similarity=0.336 Sum_probs=16.6
Q ss_pred CceecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCC
Q psy11070 85 LTLCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGV 125 (148)
Q Consensus 85 ~~~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~ 125 (148)
.|.... .|+..|....-+.. + .. .+...|++-|+
T Consensus 23 ~PV~s~--~C~H~fek~aI~~~-i---~~-~~~~~CPv~GC 56 (57)
T PF11789_consen 23 DPVKSK--KCGHTFEKEAILQY-I---QR-NGSKRCPVAGC 56 (57)
T ss_dssp SEEEES--SS--EEEHHHHHHH-C---TT-TS-EE-SCCC-
T ss_pred CCcCcC--CCCCeecHHHHHHH-H---Hh-cCCCCCCCCCC
Confidence 456666 77777776644332 2 22 45677877664
No 112
>PF14353 CpXC: CpXC protein
Probab=36.80 E-value=55 Score=20.45 Aligned_cols=9 Identities=44% Similarity=1.542 Sum_probs=5.1
Q ss_pred eCCCccccccc
Q psy11070 58 SCPIKNCNKRF 68 (148)
Q Consensus 58 ~C~~~~C~~~f 68 (148)
.|+ .|+..|
T Consensus 3 tCP--~C~~~~ 11 (128)
T PF14353_consen 3 TCP--HCGHEF 11 (128)
T ss_pred CCC--CCCCee
Confidence 455 666554
No 113
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=35.69 E-value=8.7 Score=19.87 Aligned_cols=11 Identities=36% Similarity=1.189 Sum_probs=5.8
Q ss_pred eeCCCcccccccC
Q psy11070 57 WSCPIKNCNKRFT 69 (148)
Q Consensus 57 ~~C~~~~C~~~f~ 69 (148)
|.|. .|+..|.
T Consensus 6 y~C~--~Cg~~fe 16 (52)
T TIGR02605 6 YRCT--ACGHRFE 16 (52)
T ss_pred EEeC--CCCCEeE
Confidence 4555 5555554
No 114
>KOG3408|consensus
Probab=34.75 E-value=29 Score=21.85 Aligned_cols=24 Identities=33% Similarity=0.546 Sum_probs=18.6
Q ss_pred CCceecCcCCcccccCChHHHHHHHH
Q psy11070 84 PLTLCCEELGCGRKFQTMKQYSTHLK 109 (148)
Q Consensus 84 ~~~~~c~~~~C~~~f~~~~~l~~h~~ 109 (148)
...|-|. .|.+-|.....|..|.+
T Consensus 55 ~GqfyCi--~CaRyFi~~~~l~~H~k 78 (129)
T KOG3408|consen 55 GGQFYCI--ECARYFIDAKALKTHFK 78 (129)
T ss_pred Cceeehh--hhhhhhcchHHHHHHHh
Confidence 3447788 88888888888888876
No 115
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.20 E-value=39 Score=20.90 Aligned_cols=13 Identities=15% Similarity=0.299 Sum_probs=6.9
Q ss_pred CCceeCCCccccccc
Q psy11070 54 QQLWSCPIKNCNKRF 68 (148)
Q Consensus 54 ~k~~~C~~~~C~~~f 68 (148)
..|..|+ .||++|
T Consensus 24 rdPiVsP--ytG~s~ 36 (129)
T COG4530 24 RDPIVSP--YTGKSY 36 (129)
T ss_pred CCccccC--cccccc
Confidence 3445555 555555
No 116
>KOG2636|consensus
Probab=34.02 E-value=31 Score=26.88 Aligned_cols=28 Identities=21% Similarity=0.310 Sum_probs=21.8
Q ss_pred HHhcCCCCceeCCCCCCCcccccHHHHHHH
Q psy11070 109 KEHSNVSAPYMCDYKGVCMSLTLKISFVKA 138 (148)
Q Consensus 109 ~~h~~~~k~~~C~~c~~~~~f~~~~~l~~h 138 (148)
+.|.- ..-|.|.+||. +++..+-.+.+|
T Consensus 394 KLHGL-~~ey~CEICGN-y~Y~GrkaF~RH 421 (497)
T KOG2636|consen 394 KLHGL-DIEYNCEICGN-YVYKGRKAFDRH 421 (497)
T ss_pred hhcCC-CcccceeeccC-ccccCcHHHHHH
Confidence 55666 78899999995 467777777777
No 117
>KOG2593|consensus
Probab=33.86 E-value=37 Score=26.29 Aligned_cols=39 Identities=31% Similarity=0.761 Sum_probs=27.0
Q ss_pred CCCCceeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCcccc
Q psy11070 52 KKQQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGRK 97 (148)
Q Consensus 52 ~~~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~~ 97 (148)
+...-|.|+ .|.+.|.....++ ..--....|.|. .|+-.
T Consensus 124 t~~~~Y~Cp--~C~kkyt~Lea~~---L~~~~~~~F~C~--~C~ge 162 (436)
T KOG2593|consen 124 TNVAGYVCP--NCQKKYTSLEALQ---LLDNETGEFHCE--NCGGE 162 (436)
T ss_pred cccccccCC--ccccchhhhHHHH---hhcccCceEEEe--cCCCc
Confidence 344569999 9999998777653 333234468898 88853
No 118
>PF02176 zf-TRAF: TRAF-type zinc finger; PDB: 2EOD_A 2YUC_A 3HCU_A 3HCS_B 3HCT_A.
Probab=33.80 E-value=40 Score=17.71 Aligned_cols=21 Identities=29% Similarity=0.597 Sum_probs=11.3
Q ss_pred hHHHHHHHHHhcCCCCceeCCC
Q psy11070 101 MKQYSTHLKEHSNVSAPYMCDY 122 (148)
Q Consensus 101 ~~~l~~h~~~h~~~~k~~~C~~ 122 (148)
...|..|+..--. .++..|..
T Consensus 23 r~~l~~H~~~~C~-~~~v~C~~ 43 (60)
T PF02176_consen 23 RKELDDHLENECP-KRPVPCPY 43 (60)
T ss_dssp CCCHHHHHHTTST-TSEEE-SS
T ss_pred HHHHHHHHHccCC-CCcEECCC
Confidence 3356666664444 45666766
No 119
>KOG4124|consensus
Probab=33.17 E-value=16 Score=27.51 Aligned_cols=53 Identities=9% Similarity=-0.050 Sum_probs=38.4
Q ss_pred CCCceeCCCcccccccCChHHHHHH-----HHHhcCCCceecCcCCcccccCChHHHHHHH
Q psy11070 53 KQQLWSCPIKNCNKRFTTLSNLKMH-----MVRHGKPLTLCCEELGCGRKFQTMKQYSTHL 108 (148)
Q Consensus 53 ~~k~~~C~~~~C~~~f~~~~~l~~h-----~~~h~~~~~~~c~~~~C~~~f~~~~~l~~h~ 108 (148)
.|+++.|..+.|.+++........| +...+-.+||+|+ |++++..+..|+.|-
T Consensus 175 EE~~~S~~vp~~~~~~~~~Ns~~~~S~~~~~T~~t~~~p~k~~---~~~~~~T~~~l~~HS 232 (442)
T KOG4124|consen 175 EEYRVSVVVPAAAAAAAAANSSDMSSDEASSTAETTGTPKKMP---ESLVMDTSSPLSDHS 232 (442)
T ss_pred ccccccccCchhhhhhhccccccccccccccccccccCCccCc---ccccccccchhhhcc
Confidence 5788899888888877655444333 2333456889996 999999999988774
No 120
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=32.98 E-value=13 Score=30.81 Aligned_cols=14 Identities=21% Similarity=0.332 Sum_probs=9.8
Q ss_pred hcCCCCceeCCCCCC
Q psy11070 111 HSNVSAPYMCDYKGV 125 (148)
Q Consensus 111 h~~~~k~~~C~~c~~ 125 (148)
|.. ..|..|+.||-
T Consensus 470 ~~~-~~p~~Cp~Cgs 483 (730)
T COG1198 470 YQE-PIPQSCPECGS 483 (730)
T ss_pred CCC-CCCCCCCCCCC
Confidence 444 57788888885
No 121
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=32.31 E-value=7.7 Score=17.68 Aligned_cols=10 Identities=50% Similarity=1.305 Sum_probs=6.1
Q ss_pred eeCCCccccccc
Q psy11070 57 WSCPIKNCNKRF 68 (148)
Q Consensus 57 ~~C~~~~C~~~f 68 (148)
|.|. .|++.|
T Consensus 1 ~sCi--DC~~~F 10 (28)
T PF08790_consen 1 FSCI--DCSKDF 10 (28)
T ss_dssp EEET--TTTEEE
T ss_pred Ceee--cCCCCc
Confidence 3456 666666
No 122
>PF06397 Desulfoferrod_N: Desulfoferrodoxin, N-terminal domain; InterPro: IPR004462 This domain is found as essentially the full length of desulforedoxin, a 37-residue homodimeric non-haem iron protein. It is also found as the N-terminal domain of desulfoferrodoxin (rbo), a homodimeric non-haem iron protein with 2 Fe atoms per monomer in different oxidation states. This domain binds the ferric rather than the ferrous Fe of desulfoferrodoxin. Neelaredoxin, a monomeric blue non-haem iron protein, lacks this domain.; GO: 0005506 iron ion binding; PDB: 1DFX_A 1VZI_B 2JI2_D 1VZH_B 2JI3_C 2JI1_C 1VZG_A 1CFW_A 2LK5_B 1DHG_B ....
Probab=31.93 E-value=24 Score=17.09 Aligned_cols=10 Identities=30% Similarity=0.541 Sum_probs=5.8
Q ss_pred CceeCCCCCC
Q psy11070 116 APYMCDYKGV 125 (148)
Q Consensus 116 k~~~C~~c~~ 125 (148)
+-|+|..||.
T Consensus 5 ~~YkC~~CGn 14 (36)
T PF06397_consen 5 EFYKCEHCGN 14 (36)
T ss_dssp EEEE-TTT--
T ss_pred cEEEccCCCC
Confidence 4689999995
No 123
>KOG2785|consensus
Probab=29.37 E-value=73 Score=24.32 Aligned_cols=22 Identities=23% Similarity=0.632 Sum_probs=17.5
Q ss_pred eeCCCcccccccCChHHHHHHHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVR 80 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~ 80 (148)
|+|. -|...|.....-+.|+++
T Consensus 4 ftC~--tC~v~F~~ad~Qr~HyKS 25 (390)
T KOG2785|consen 4 FTCN--TCNVEFDDADEQRAHYKS 25 (390)
T ss_pred ceee--ceeeeeccHHHHHHHhhh
Confidence 7788 888888888777777765
No 124
>KOG3002|consensus
Probab=29.30 E-value=1.2e+02 Score=22.48 Aligned_cols=85 Identities=20% Similarity=0.343 Sum_probs=50.8
Q ss_pred CCceeCCCcccccccCChHHHHHHHHHhcCCCceecC--cCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccc-c
Q psy11070 54 QQLWSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCE--ELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSL-T 130 (148)
Q Consensus 54 ~k~~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~--~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f-~ 130 (148)
+....|+ .|...+.....|.+-..... ....|+ ..+|.+.|..-.. ..|.+.-.. +||.|+..|-...+ -
T Consensus 78 ~~~~~CP--~Cr~~~g~~R~~amEkV~e~--~~vpC~~~~~GC~~~~~Y~~~-~~HE~~C~f--~~~~CP~p~~~C~~~G 150 (299)
T KOG3002|consen 78 KVSNKCP--TCRLPIGNIRCRAMEKVAEA--VLVPCKNAKLGCTKSFPYGEK-SKHEKVCEF--RPCSCPVPGAECKYTG 150 (299)
T ss_pred hhcccCC--ccccccccHHHHHHHHHHHh--ceecccccccCCceeeccccc-ccccccccc--CCcCCCCCcccCCccC
Confidence 4567889 89988886655433323222 233443 2268888887766 456655443 78888887321123 3
Q ss_pred cHHHHHHHHHhhCCC
Q psy11070 131 LKISFVKAVANNSKY 145 (148)
Q Consensus 131 ~~~~l~~h~~~h~~~ 145 (148)
....|..|.+..++.
T Consensus 151 ~~~~l~~H~~~~hk~ 165 (299)
T KOG3002|consen 151 SYKDLYAHLNDTHKS 165 (299)
T ss_pred cHHHHHHHHHhhChh
Confidence 445788887765544
No 125
>KOG2482|consensus
Probab=29.00 E-value=36 Score=25.59 Aligned_cols=23 Identities=35% Similarity=0.583 Sum_probs=21.1
Q ss_pred ceeCCCcccccccCChHHHHHHHHH
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVR 80 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~ 80 (148)
.+.|- .|.+.|..+..|+.||+.
T Consensus 195 r~~CL--yCekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 195 RLRCL--YCEKIFRDKNTLKEHMRK 217 (423)
T ss_pred hheee--eeccccCCcHHHHHHHHh
Confidence 37899 999999999999999985
No 126
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=27.86 E-value=57 Score=27.30 Aligned_cols=15 Identities=27% Similarity=0.806 Sum_probs=10.6
Q ss_pred hcCCCceecCcCCcccc
Q psy11070 81 HGKPLTLCCEELGCGRK 97 (148)
Q Consensus 81 h~~~~~~~c~~~~C~~~ 97 (148)
+....|..|+ .||..
T Consensus 470 ~~~~~p~~Cp--~Cgs~ 484 (730)
T COG1198 470 YQEPIPQSCP--ECGSE 484 (730)
T ss_pred CCCCCCCCCC--CCCCC
Confidence 3445678899 89864
No 127
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=27.63 E-value=34 Score=15.78 Aligned_cols=9 Identities=22% Similarity=0.416 Sum_probs=4.5
Q ss_pred CceeCCCCC
Q psy11070 116 APYMCDYKG 124 (148)
Q Consensus 116 k~~~C~~c~ 124 (148)
..|.|+.|+
T Consensus 18 ~~~vCp~C~ 26 (30)
T PF08274_consen 18 ELLVCPECG 26 (30)
T ss_dssp SSEEETTTT
T ss_pred CEEeCCccc
Confidence 345555554
No 128
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=26.89 E-value=49 Score=17.26 Aligned_cols=7 Identities=29% Similarity=0.558 Sum_probs=2.3
Q ss_pred eCCCCCC
Q psy11070 119 MCDYKGV 125 (148)
Q Consensus 119 ~C~~c~~ 125 (148)
.|+.|+.
T Consensus 22 ~CPlC~r 28 (54)
T PF04423_consen 22 CCPLCGR 28 (54)
T ss_dssp E-TTT--
T ss_pred cCCCCCC
Confidence 5555554
No 129
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=26.70 E-value=40 Score=15.80 Aligned_cols=8 Identities=38% Similarity=0.713 Sum_probs=4.2
Q ss_pred ceeCCCCC
Q psy11070 117 PYMCDYKG 124 (148)
Q Consensus 117 ~~~C~~c~ 124 (148)
+..|..||
T Consensus 17 ~irC~~CG 24 (32)
T PF03604_consen 17 PIRCPECG 24 (32)
T ss_dssp TSSBSSSS
T ss_pred cEECCcCC
Confidence 34555555
No 130
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=26.56 E-value=85 Score=14.83 Aligned_cols=22 Identities=14% Similarity=0.114 Sum_probs=11.8
Q ss_pred ceeCCCCCCCcccccHHHHHHHHHh
Q psy11070 117 PYMCDYKGVCMSLTLKISFVKAVAN 141 (148)
Q Consensus 117 ~~~C~~c~~~~~f~~~~~l~~h~~~ 141 (148)
-+.|+.|++ -...+.+-.|+..
T Consensus 4 ~~~C~nC~R---~v~a~RfA~HLek 25 (33)
T PF08209_consen 4 YVECPNCGR---PVAASRFAPHLEK 25 (33)
T ss_dssp EEE-TTTSS---EEEGGGHHHHHHH
T ss_pred eEECCCCcC---CcchhhhHHHHHH
Confidence 467888876 2334445556554
No 131
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=25.96 E-value=34 Score=16.37 Aligned_cols=12 Identities=25% Similarity=1.007 Sum_probs=8.1
Q ss_pred eeCCCcccccccCC
Q psy11070 57 WSCPIKNCNKRFTT 70 (148)
Q Consensus 57 ~~C~~~~C~~~f~~ 70 (148)
|.|. .||..|..
T Consensus 6 y~C~--~Cg~~fe~ 17 (41)
T smart00834 6 YRCE--DCGHTFEV 17 (41)
T ss_pred EEcC--CCCCEEEE
Confidence 6677 77776653
No 132
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=25.89 E-value=37 Score=15.02 Aligned_cols=11 Identities=36% Similarity=0.921 Sum_probs=8.1
Q ss_pred eeCCCcccccccC
Q psy11070 57 WSCPIKNCNKRFT 69 (148)
Q Consensus 57 ~~C~~~~C~~~f~ 69 (148)
-.|+ .||..|.
T Consensus 15 ~~Cp--~CG~~F~ 25 (26)
T PF10571_consen 15 KFCP--HCGYDFE 25 (26)
T ss_pred CcCC--CCCCCCc
Confidence 4578 8888774
No 133
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=25.73 E-value=9.2 Score=25.18 Aligned_cols=16 Identities=44% Similarity=0.908 Sum_probs=10.9
Q ss_pred eeCCCcccccccCChHHH
Q psy11070 57 WSCPIKNCNKRFTTLSNL 74 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l 74 (148)
+.|+ .||++|.....+
T Consensus 29 ~~c~--~c~~~f~~~e~~ 44 (154)
T PRK00464 29 RECL--ACGKRFTTFERV 44 (154)
T ss_pred eecc--ccCCcceEeEec
Confidence 7777 777777665444
No 134
>KOG2071|consensus
Probab=25.19 E-value=49 Score=26.73 Aligned_cols=26 Identities=15% Similarity=0.204 Sum_probs=0.0
Q ss_pred CCceeCCCCCCCcccccHHHHHHHHHhh
Q psy11070 115 SAPYMCDYKGVCMSLTLKISFVKAVANN 142 (148)
Q Consensus 115 ~k~~~C~~c~~~~~f~~~~~l~~h~~~h 142 (148)
.+|-.|..||. +|.......+|+..|
T Consensus 416 ~~pnqC~~CG~--R~~~~ee~sk~md~H 441 (579)
T KOG2071|consen 416 DSPNQCKSCGL--RFDDSEERSKHMDIH 441 (579)
T ss_pred CCcchhccccc--ccccchhhhhHhhhh
No 135
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=24.86 E-value=69 Score=25.26 Aligned_cols=26 Identities=35% Similarity=0.889 Sum_probs=22.6
Q ss_pred ceeCCCcccccccCChHHHHHHHHH-hcC
Q psy11070 56 LWSCPIKNCNKRFTTLSNLKMHMVR-HGK 83 (148)
Q Consensus 56 ~~~C~~~~C~~~f~~~~~l~~h~~~-h~~ 83 (148)
-+.|+ .|.+.|.....+..|+.. |.+
T Consensus 57 FWiCp--~CskkF~d~~~~~~H~~~eH~~ 83 (466)
T PF04780_consen 57 FWICP--RCSKKFSDAESCLSHMEQEHPA 83 (466)
T ss_pred EeeCC--cccceeCCHHHHHHHHHHhhhh
Confidence 47899 999999999999999985 654
No 136
>KOG4727|consensus
Probab=23.75 E-value=49 Score=22.24 Aligned_cols=22 Identities=23% Similarity=0.480 Sum_probs=16.7
Q ss_pred ceeCCCCCCCcccccHHHHHHHHH
Q psy11070 117 PYMCDYKGVCMSLTLKISFVKAVA 140 (148)
Q Consensus 117 ~~~C~~c~~~~~f~~~~~l~~h~~ 140 (148)
.|-|.+|++ .|....++-.|+.
T Consensus 75 GyyCdVCdc--vvKDSinflDHiN 96 (193)
T KOG4727|consen 75 GYYCDVCDC--VVKDSINFLDHIN 96 (193)
T ss_pred ceeeeecce--eehhhHHHHHHhc
Confidence 489999998 5877777766653
No 137
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=23.13 E-value=27 Score=19.85 Aligned_cols=6 Identities=33% Similarity=1.043 Sum_probs=2.5
Q ss_pred Cccccc
Q psy11070 93 GCGRKF 98 (148)
Q Consensus 93 ~C~~~f 98 (148)
.||..|
T Consensus 34 eCg~tF 39 (72)
T PRK09678 34 NCSATF 39 (72)
T ss_pred CCCCEE
Confidence 344444
No 138
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=22.59 E-value=42 Score=20.01 Aligned_cols=14 Identities=21% Similarity=0.546 Sum_probs=8.2
Q ss_pred CceeCCCcccccccCC
Q psy11070 55 QLWSCPIKNCNKRFTT 70 (148)
Q Consensus 55 k~~~C~~~~C~~~f~~ 70 (148)
+|-.|. .||..|..
T Consensus 57 ~Pa~Ck--kCGfef~~ 70 (97)
T COG3357 57 RPARCK--KCGFEFRD 70 (97)
T ss_pred cChhhc--ccCccccc
Confidence 345566 66666654
No 139
>KOG0801|consensus
Probab=22.27 E-value=19 Score=23.95 Aligned_cols=20 Identities=20% Similarity=0.368 Sum_probs=15.0
Q ss_pred HHHHHHHHHhcCCCCceeCCCCCC
Q psy11070 102 KQYSTHLKEHSNVSAPYMCDYKGV 125 (148)
Q Consensus 102 ~~l~~h~~~h~~~~k~~~C~~c~~ 125 (148)
..|..|+..|.| ++|++|.|
T Consensus 127 ~slP~hi~~~~g----~KCPvC~K 146 (205)
T KOG0801|consen 127 DSLPVHIMDHSG----MKCPVCHK 146 (205)
T ss_pred hccceeeeccCC----ccCCcccc
Confidence 456667767777 78999997
No 140
>KOG2482|consensus
Probab=21.83 E-value=58 Score=24.56 Aligned_cols=22 Identities=23% Similarity=0.561 Sum_probs=20.3
Q ss_pred ceecCcCCcccccCChHHHHHHHH
Q psy11070 86 TLCCEELGCGRKFQTMKQYSTHLK 109 (148)
Q Consensus 86 ~~~c~~~~C~~~f~~~~~l~~h~~ 109 (148)
.+.|. .|.+.|..+..|+.|||
T Consensus 195 r~~CL--yCekifrdkntLkeHMr 216 (423)
T KOG2482|consen 195 RLRCL--YCEKIFRDKNTLKEHMR 216 (423)
T ss_pred hheee--eeccccCCcHHHHHHHH
Confidence 37799 99999999999999997
No 141
>TIGR00143 hypF [NiFe] hydrogenase maturation protein HypF. A previously described regulatory effect of HypF mutatation is attributable to loss of activity of a regulatory hydrogenase. A zinc finger-like region CXXCX(18)CXXCX(24)CXXCX(18)CXXC region further supported the regulatory hypothesis. However, more recent work (PUBMED:11375153) shows the direct effect is on the activity of expressed hydrogenases with nickel/iron centers, rather than on expression.
Probab=21.57 E-value=6.3 Score=32.55 Aligned_cols=60 Identities=22% Similarity=0.511 Sum_probs=38.3
Q ss_pred eCCCcccccccCChHHHHHHHHHhcCCCc-eecCcCCcccccCChHHHHHHHHHhcCCCCceeCCCCCCCccc
Q psy11070 58 SCPIKNCNKRFTTLSNLKMHMVRHGKPLT-LCCEELGCGRKFQTMKQYSTHLKEHSNVSAPYMCDYKGVCMSL 129 (148)
Q Consensus 58 ~C~~~~C~~~f~~~~~l~~h~~~h~~~~~-~~c~~~~C~~~f~~~~~l~~h~~~h~~~~k~~~C~~c~~~~~f 129 (148)
.|. .||.+|.....|..- +.++.-.. -.|+ .|.+.|..+.+. |.| -.+-.|..||-...+
T Consensus 92 nCt--~CGPr~~i~~~lpyd-r~~t~m~~f~~C~--~C~~ey~~p~~r----r~h---~~~~~C~~Cgp~l~l 152 (711)
T TIGR00143 92 SCT--HCGPRFTIIEALPYD-RENTSMADFPLCP--DCAKEYKDPLDR----RFH---AQPIACPRCGPQLNF 152 (711)
T ss_pred ccc--CCCCCeEEeecCCCC-CCCcCCCCCcCCH--HHHHHhcCCccc----cCC---CCCccCCCCCcEEEE
Confidence 488 999999877665422 22333322 3588 888888777543 333 356789999964444
No 142
>COG1773 Rubredoxin [Energy production and conversion]
Probab=21.21 E-value=51 Score=17.67 Aligned_cols=10 Identities=20% Similarity=0.491 Sum_probs=6.8
Q ss_pred CceeCCCCCC
Q psy11070 116 APYMCDYKGV 125 (148)
Q Consensus 116 k~~~C~~c~~ 125 (148)
+.|+|..||.
T Consensus 2 ~~~~C~~CG~ 11 (55)
T COG1773 2 KRWRCSVCGY 11 (55)
T ss_pred CceEecCCce
Confidence 3577888874
No 143
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=21.10 E-value=60 Score=23.27 Aligned_cols=12 Identities=33% Similarity=1.157 Sum_probs=8.6
Q ss_pred eeCCCcccccccCC
Q psy11070 57 WSCPIKNCNKRFTT 70 (148)
Q Consensus 57 ~~C~~~~C~~~f~~ 70 (148)
|.|+ .|+..|.-
T Consensus 156 f~C~--~C~h~F~G 167 (278)
T PF15135_consen 156 FHCP--KCRHNFRG 167 (278)
T ss_pred eecc--cccccchh
Confidence 7777 77777753
No 144
>TIGR00319 desulf_FeS4 desulfoferrodoxin FeS4 iron-binding domain. Neelaredoxin, a monomeric blue non-heme iron protein, lacks this domain.
Probab=21.00 E-value=57 Score=15.13 Aligned_cols=10 Identities=30% Similarity=0.637 Sum_probs=8.2
Q ss_pred CceeCCCCCC
Q psy11070 116 APYMCDYKGV 125 (148)
Q Consensus 116 k~~~C~~c~~ 125 (148)
+-|+|..||.
T Consensus 6 ~~ykC~~Cgn 15 (34)
T TIGR00319 6 QVYKCEVCGN 15 (34)
T ss_pred cEEEcCCCCc
Confidence 4689999985
No 145
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=20.96 E-value=47 Score=16.41 Aligned_cols=29 Identities=24% Similarity=0.744 Sum_probs=16.0
Q ss_pred eeCCCcccccccCChHHHHHHHHHhcCCCceecCcCCccc
Q psy11070 57 WSCPIKNCNKRFTTLSNLKMHMVRHGKPLTLCCEELGCGR 96 (148)
Q Consensus 57 ~~C~~~~C~~~f~~~~~l~~h~~~h~~~~~~~c~~~~C~~ 96 (148)
|.|. .||..|.....+ .. ..+-.|+ .||.
T Consensus 6 y~C~--~Cg~~fe~~~~~------~~-~~~~~CP--~Cg~ 34 (42)
T PF09723_consen 6 YRCE--ECGHEFEVLQSI------SE-DDPVPCP--ECGS 34 (42)
T ss_pred EEeC--CCCCEEEEEEEc------CC-CCCCcCC--CCCC
Confidence 6677 777776543221 11 3345677 6765
No 146
>PF03145 Sina: Seven in absentia protein family; InterPro: IPR018121 The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non-neuronal cell type. The Sina protein contains an N-terminal RING finger domain C3HC4-type. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 [] also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation [, ]. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is a C3HC4-type, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana.; GO: 0006511 ubiquitin-dependent protein catabolic process, 0007275 multicellular organismal development, 0005634 nucleus; PDB: 2A25_A 1K2F_B 2AN6_C.
Probab=20.93 E-value=85 Score=21.29 Aligned_cols=7 Identities=29% Similarity=0.586 Sum_probs=3.3
Q ss_pred HHHHHHH
Q psy11070 103 QYSTHLK 109 (148)
Q Consensus 103 ~l~~h~~ 109 (148)
.|..|..
T Consensus 62 ~l~~Hl~ 68 (198)
T PF03145_consen 62 ELLDHLR 68 (198)
T ss_dssp CHHHHHH
T ss_pred HHHHHHH
Confidence 4445544
No 147
>cd00974 DSRD Desulforedoxin (DSRD) domain; a small non-heme iron domain present in the desulforedoxin (rubredoxin oxidoreductase) and desulfoferrodoxin proteins of some archeael and bacterial methanogens and sulfate/sulfur reducers. Desulforedoxin is a small, single-domain homodimeric protein; each subunit contains an iron atom bound to four cysteinyl sulfur atoms, Fe(S-Cys)4, in a distorted tetrahedral coordination. Its metal center is similar to that found in rubredoxin type proteins. Desulforedoxin is regarded as a potential redox partner for rubredoxin. Desulfoferrodoxin forms a homodimeric protein, with each protomer comprised of two domains, the N-terminal DSRD domain and C-terminal superoxide reductase-like (SORL) domain. Each domain has a distinct iron center: the DSRD iron center I, Fe(S-Cys)4; and the SORL iron center II, Fe[His4Cys(Glu)].
Probab=20.87 E-value=58 Score=15.15 Aligned_cols=10 Identities=30% Similarity=0.657 Sum_probs=8.1
Q ss_pred CceeCCCCCC
Q psy11070 116 APYMCDYKGV 125 (148)
Q Consensus 116 k~~~C~~c~~ 125 (148)
+-|+|..||.
T Consensus 3 ~~ykC~~CGn 12 (34)
T cd00974 3 EVYKCEICGN 12 (34)
T ss_pred cEEEcCCCCc
Confidence 4689999995
No 148
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=20.06 E-value=42 Score=21.90 Aligned_cols=13 Identities=23% Similarity=0.181 Sum_probs=7.6
Q ss_pred eeCCCCCCCccccc
Q psy11070 118 YMCDYKGVCMSLTL 131 (148)
Q Consensus 118 ~~C~~c~~~~~f~~ 131 (148)
-.|+.||.. .|.+
T Consensus 131 ~~Cp~C~~~-~F~R 143 (146)
T PF07295_consen 131 PPCPKCGHT-EFTR 143 (146)
T ss_pred CCCCCCCCC-eeee
Confidence 367777763 4543
Done!