Query psy8702
Match_columns 194
No_of_seqs 190 out of 2086
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 17:04:57 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8702.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8702hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.9 2.9E-24 6.2E-29 169.1 3.3 125 16-167 126-254 (279)
2 KOG2462|consensus 99.9 7.4E-23 1.6E-27 161.2 3.2 110 18-157 159-268 (279)
3 KOG1074|consensus 99.6 6.9E-16 1.5E-20 136.6 5.7 57 16-80 601-657 (958)
4 KOG1074|consensus 99.6 5.4E-16 1.2E-20 137.2 3.0 56 17-80 350-405 (958)
5 KOG3576|consensus 99.5 6E-16 1.3E-20 117.3 -1.3 116 15-155 112-239 (267)
6 KOG3623|consensus 99.5 3.6E-14 7.7E-19 124.3 3.7 122 17-153 207-330 (1007)
7 KOG3623|consensus 99.4 9.4E-14 2E-18 121.7 2.9 83 15-121 889-971 (1007)
8 KOG3608|consensus 99.3 1.2E-13 2.5E-18 112.5 -1.3 128 24-161 183-324 (467)
9 KOG3608|consensus 99.3 2.8E-13 6E-18 110.4 -0.6 142 14-175 231-395 (467)
10 KOG3576|consensus 99.2 1.3E-12 2.9E-17 99.2 -2.0 84 53-153 114-198 (267)
11 PLN03086 PRLI-interacting fact 98.8 2.9E-09 6.4E-14 93.4 4.5 91 19-140 452-552 (567)
12 PHA02768 hypothetical protein; 98.8 3.8E-09 8.3E-14 64.3 1.9 43 21-73 6-48 (55)
13 PHA00733 hypothetical protein 98.6 2E-08 4.4E-13 72.6 3.2 89 16-125 36-124 (128)
14 PHA00733 hypothetical protein 98.5 1.9E-08 4.1E-13 72.8 0.2 79 53-153 37-120 (128)
15 PF13465 zf-H2C2_2: Zinc-finge 98.4 8.6E-08 1.9E-12 49.8 1.5 26 43-68 1-26 (26)
16 PHA02768 hypothetical protein; 98.4 9.5E-08 2.1E-12 58.2 1.8 41 102-148 7-47 (55)
17 PF13465 zf-H2C2_2: Zinc-finge 98.4 2.9E-08 6.4E-13 51.6 -0.9 25 115-140 1-25 (26)
18 PLN03086 PRLI-interacting fact 98.4 3.9E-07 8.4E-12 80.3 5.1 89 54-165 451-550 (567)
19 KOG3993|consensus 98.0 1E-06 2.2E-11 73.9 -0.8 53 20-80 267-319 (500)
20 PHA00616 hypothetical protein 97.9 4.1E-06 8.8E-11 48.7 1.3 32 21-60 2-33 (44)
21 PHA00616 hypothetical protein 97.9 3.9E-06 8.3E-11 48.7 0.9 34 56-90 1-34 (44)
22 KOG3993|consensus 97.7 1.7E-05 3.7E-10 66.7 2.1 124 18-153 293-479 (500)
23 PHA00732 hypothetical protein 97.6 3.3E-05 7.1E-10 51.1 2.3 46 21-80 2-48 (79)
24 PHA00732 hypothetical protein 97.5 0.00013 2.8E-09 48.2 3.2 47 56-124 1-48 (79)
25 PF00096 zf-C2H2: Zinc finger, 97.2 0.00012 2.7E-09 36.4 0.8 20 58-77 2-21 (23)
26 PF00096 zf-C2H2: Zinc finger, 97.2 0.00013 2.8E-09 36.3 0.7 23 21-51 1-23 (23)
27 PF05605 zf-Di19: Drought indu 97.1 0.00082 1.8E-08 41.0 3.5 51 19-80 1-53 (54)
28 COG5189 SFP1 Putative transcri 96.8 0.00072 1.6E-08 55.3 2.1 55 82-153 346-419 (423)
29 PF13894 zf-C2H2_4: C2H2-type 96.7 0.00092 2E-08 33.1 1.3 23 57-79 1-23 (24)
30 PF13912 zf-C2H2_6: C2H2-type 96.6 0.00092 2E-08 34.5 0.9 25 56-80 1-25 (27)
31 PF13894 zf-C2H2_4: C2H2-type 96.5 0.0012 2.7E-08 32.6 1.0 23 21-51 1-23 (24)
32 PF05605 zf-Di19: Drought indu 96.4 0.0044 9.5E-08 37.7 3.4 51 56-124 2-53 (54)
33 PF12756 zf-C2H2_2: C2H2 type 96.0 0.0039 8.5E-08 42.2 1.8 57 22-79 1-73 (100)
34 PF13912 zf-C2H2_6: C2H2-type 95.9 0.0036 7.9E-08 32.2 0.9 24 21-52 2-25 (27)
35 COG5048 FOG: Zn-finger [Genera 95.9 0.0074 1.6E-07 51.0 3.2 139 19-169 288-455 (467)
36 smart00355 ZnF_C2H2 zinc finge 95.8 0.0068 1.5E-07 30.2 1.7 24 57-80 1-24 (26)
37 smart00355 ZnF_C2H2 zinc finge 95.6 0.0084 1.8E-07 29.8 1.6 24 21-52 1-24 (26)
38 PF09237 GAGA: GAGA factor; I 95.5 0.01 2.2E-07 35.4 1.9 33 52-85 20-52 (54)
39 PRK04860 hypothetical protein; 95.4 0.011 2.4E-07 44.4 2.2 39 19-69 118-156 (160)
40 COG5189 SFP1 Putative transcri 95.2 0.0084 1.8E-07 49.2 1.3 52 17-76 346-418 (423)
41 PF12756 zf-C2H2_2: C2H2 type 94.9 0.0062 1.3E-07 41.2 -0.3 65 58-123 1-73 (100)
42 PF09237 GAGA: GAGA factor; I 94.8 0.026 5.5E-07 33.7 2.1 33 16-56 20-52 (54)
43 PF12874 zf-met: Zinc-finger o 94.4 0.017 3.6E-07 29.0 0.6 21 57-77 1-21 (25)
44 PF12874 zf-met: Zinc-finger o 94.2 0.017 3.6E-07 29.0 0.4 22 21-50 1-22 (25)
45 PF12171 zf-C2H2_jaz: Zinc-fin 93.7 0.012 2.6E-07 30.4 -0.7 19 58-76 3-21 (27)
46 PF12171 zf-C2H2_jaz: Zinc-fin 93.1 0.041 8.9E-07 28.3 0.8 21 20-48 1-21 (27)
47 PRK04860 hypothetical protein; 92.3 0.086 1.9E-06 39.6 1.8 32 56-92 119-150 (160)
48 KOG2893|consensus 90.9 0.1 2.2E-06 41.3 1.0 53 16-80 6-59 (341)
49 PF13909 zf-H2C2_5: C2H2-type 90.5 0.12 2.6E-06 25.6 0.7 22 57-79 1-22 (24)
50 PF09538 FYDLN_acid: Protein o 89.4 0.29 6.4E-06 34.2 2.2 34 17-69 6-39 (108)
51 KOG2231|consensus 89.1 0.4 8.7E-06 43.6 3.4 41 108-152 220-263 (669)
52 PF13913 zf-C2HC_2: zinc-finge 88.6 0.34 7.4E-06 24.5 1.6 18 58-76 4-21 (25)
53 KOG2893|consensus 87.6 0.24 5.2E-06 39.3 0.8 45 102-151 12-58 (341)
54 smart00451 ZnF_U1 U1-like zinc 86.5 0.49 1.1E-05 25.5 1.5 22 56-77 3-24 (35)
55 KOG2231|consensus 82.7 0.69 1.5E-05 42.2 1.5 56 100-160 182-240 (669)
56 COG5048 FOG: Zn-finger [Genera 82.5 0.76 1.6E-05 38.7 1.7 70 55-138 288-362 (467)
57 TIGR02300 FYDLN_acid conserved 82.4 1.1 2.4E-05 32.1 2.2 38 17-73 6-43 (129)
58 cd00350 rubredoxin_like Rubred 82.3 0.89 1.9E-05 24.5 1.3 13 21-33 2-14 (33)
59 PF09986 DUF2225: Uncharacteri 78.9 0.71 1.5E-05 36.3 0.3 21 55-75 4-24 (214)
60 KOG2186|consensus 77.2 1.6 3.6E-05 34.9 1.8 46 21-77 4-49 (276)
61 PF10571 UPF0547: Uncharacteri 73.8 3.1 6.7E-05 21.2 1.8 10 101-110 15-24 (26)
62 KOG1146|consensus 72.3 1.6 3.5E-05 42.6 0.8 98 23-152 439-538 (1406)
63 KOG2482|consensus 68.0 3.8 8.2E-05 34.4 2.0 66 57-122 280-356 (423)
64 PRK00464 nrdR transcriptional 66.7 1.6 3.6E-05 32.5 -0.3 19 55-73 27-45 (154)
65 TIGR02098 MJ0042_CXXC MJ0042 f 65.4 4.9 0.00011 22.1 1.6 34 21-67 3-36 (38)
66 PF06524 NOA36: NOA36 protein; 64.6 8.7 0.00019 31.1 3.4 95 53-157 139-234 (314)
67 PF09723 Zn-ribbon_8: Zinc rib 63.3 3.2 7E-05 23.6 0.6 13 21-33 6-18 (42)
68 cd00729 rubredoxin_SM Rubredox 62.7 5.1 0.00011 21.7 1.3 12 21-32 3-14 (34)
69 PF13719 zinc_ribbon_5: zinc-r 61.8 7.2 0.00016 21.5 1.8 33 22-67 4-36 (37)
70 PF13717 zinc_ribbon_4: zinc-r 61.4 7.3 0.00016 21.4 1.8 32 22-66 4-35 (36)
71 PRK00398 rpoP DNA-directed RNA 60.1 6.6 0.00014 22.7 1.5 14 20-33 3-16 (46)
72 COG1997 RPL43A Ribosomal prote 59.4 6.5 0.00014 26.3 1.5 14 55-68 52-65 (89)
73 TIGR02605 CxxC_CxxC_SSSS putat 59.3 3.8 8.2E-05 24.2 0.4 13 21-33 6-18 (52)
74 PRK06266 transcription initiat 58.1 5.2 0.00011 30.5 1.1 17 55-71 116-132 (178)
75 COG1592 Rubrerythrin [Energy p 57.5 7.4 0.00016 29.4 1.8 24 55-92 133-156 (166)
76 COG5236 Uncharacterized conser 57.3 13 0.00027 31.5 3.2 96 4-122 203-303 (493)
77 smart00834 CxxC_CXXC_SSSS Puta 56.7 6.5 0.00014 21.7 1.1 13 21-33 6-18 (41)
78 TIGR00622 ssl1 transcription f 56.3 28 0.00061 24.5 4.4 77 55-139 14-91 (112)
79 COG2888 Predicted Zn-ribbon RN 55.9 8.9 0.00019 23.7 1.6 9 84-92 49-57 (61)
80 PF15135 UPF0515: Uncharacteri 55.7 10 0.00022 30.5 2.3 11 55-65 111-121 (278)
81 KOG1146|consensus 54.7 6.9 0.00015 38.5 1.5 51 19-77 464-539 (1406)
82 smart00614 ZnF_BED BED zinc fi 53.7 12 0.00027 21.9 2.0 29 20-51 18-47 (50)
83 smart00531 TFIIE Transcription 53.6 10 0.00022 27.9 1.9 37 52-93 95-131 (147)
84 PF05443 ROS_MUCR: ROS/MUCR tr 53.0 9.7 0.00021 27.6 1.7 29 17-56 69-97 (132)
85 COG4049 Uncharacterized protei 47.8 7.1 0.00015 23.8 0.3 23 54-76 15-37 (65)
86 COG4530 Uncharacterized protei 47.3 12 0.00025 26.2 1.3 32 18-68 7-38 (129)
87 PF03604 DNA_RNApol_7kD: DNA d 46.5 16 0.00035 19.6 1.5 13 21-33 1-13 (32)
88 COG1571 Predicted DNA-binding 45.7 15 0.00032 31.9 2.0 37 16-71 346-382 (421)
89 PRK09678 DNA-binding transcrip 45.2 8.7 0.00019 24.8 0.4 42 21-72 2-45 (72)
90 smart00659 RPOLCX RNA polymera 43.1 17 0.00036 21.0 1.3 12 21-32 3-14 (44)
91 PF13248 zf-ribbon_3: zinc-rib 42.6 19 0.00042 18.0 1.4 7 101-107 17-23 (26)
92 PRK14890 putative Zn-ribbon RN 41.9 19 0.00041 22.2 1.5 9 57-65 26-34 (59)
93 smart00734 ZnF_Rad18 Rad18-lik 41.9 22 0.00047 18.0 1.5 20 57-77 2-21 (26)
94 KOG4167|consensus 40.4 11 0.00024 34.9 0.4 28 18-53 790-817 (907)
95 TIGR00686 phnA alkylphosphonat 40.3 19 0.00041 25.1 1.5 30 21-69 3-32 (109)
96 TIGR00373 conserved hypothetic 39.8 28 0.00062 25.9 2.5 34 49-92 102-135 (158)
97 KOG2186|consensus 39.8 12 0.00026 30.1 0.6 56 56-130 3-58 (276)
98 PTZ00255 60S ribosomal protein 38.4 15 0.00033 24.7 0.8 14 55-68 53-66 (90)
99 PHA00626 hypothetical protein 37.7 16 0.00034 22.3 0.7 16 54-69 21-36 (59)
100 PF07754 DUF1610: Domain of un 37.6 11 0.00025 18.8 0.1 10 127-137 15-24 (24)
101 PF02892 zf-BED: BED zinc fing 37.4 21 0.00046 20.1 1.2 12 56-67 16-27 (45)
102 KOG2593|consensus 36.9 23 0.00049 30.8 1.8 42 47-92 119-160 (436)
103 COG3677 Transposase and inacti 36.7 18 0.00038 26.1 1.0 16 54-69 51-66 (129)
104 COG4896 Uncharacterized protei 34.7 17 0.00037 22.6 0.5 30 108-138 11-40 (68)
105 PRK10220 hypothetical protein; 34.4 29 0.00063 24.2 1.7 31 21-70 4-34 (111)
106 PF04959 ARS2: Arsenite-resist 34.1 19 0.00042 28.3 0.9 18 16-33 73-90 (214)
107 COG5236 Uncharacterized conser 33.2 13 0.00028 31.4 -0.2 58 58-132 222-279 (493)
108 KOG3583|consensus 33.1 23 0.00049 28.0 1.1 24 171-194 254-277 (279)
109 TIGR00280 L37a ribosomal prote 32.8 18 0.00039 24.4 0.4 12 56-67 53-64 (91)
110 PF01780 Ribosomal_L37ae: Ribo 32.7 12 0.00026 25.3 -0.4 12 56-67 53-64 (90)
111 PRK03564 formate dehydrogenase 32.2 26 0.00056 29.3 1.3 49 55-108 186-234 (309)
112 COG1996 RPC10 DNA-directed RNA 31.8 25 0.00054 20.9 0.9 11 20-30 6-16 (49)
113 PF08274 PhnA_Zn_Ribbon: PhnA 30.9 19 0.00042 19.0 0.3 12 21-32 3-14 (30)
114 PF14446 Prok-RING_1: Prokaryo 30.5 39 0.00084 20.5 1.6 14 20-33 5-18 (54)
115 PF13451 zf-trcl: Probable zin 29.8 43 0.00093 19.9 1.6 17 55-71 3-19 (49)
116 PF05191 ADK_lid: Adenylate ki 29.4 36 0.00079 18.6 1.2 12 22-33 3-14 (36)
117 PRK14559 putative protein seri 29.2 83 0.0018 29.1 4.2 8 130-138 43-50 (645)
118 PF01286 XPA_N: XPA protein N- 29.1 25 0.00054 19.1 0.5 14 20-33 3-16 (34)
119 KOG2593|consensus 29.0 39 0.00084 29.4 1.9 39 16-65 124-162 (436)
120 COG1198 PriA Primosomal protei 28.3 26 0.00056 32.8 0.8 9 23-31 438-446 (730)
121 PF01363 FYVE: FYVE zinc finge 27.9 31 0.00068 21.4 0.9 16 17-32 6-21 (69)
122 COG4957 Predicted transcriptio 27.0 39 0.00084 24.6 1.3 25 21-56 77-101 (148)
123 PRK03976 rpl37ae 50S ribosomal 26.9 25 0.00054 23.7 0.4 12 56-67 54-65 (90)
124 COG1655 Uncharacterized protei 26.3 16 0.00034 29.1 -0.8 30 54-84 17-46 (267)
125 PF06906 DUF1272: Protein of u 25.9 31 0.00067 21.1 0.6 27 85-111 26-52 (57)
126 COG3357 Predicted transcriptio 25.7 46 0.001 22.5 1.4 28 54-92 56-83 (97)
127 PF15269 zf-C2H2_7: Zinc-finge 25.5 1.1E+02 0.0024 17.8 2.8 24 18-49 18-41 (54)
128 PF13878 zf-C2H2_3: zinc-finge 25.3 76 0.0017 17.8 2.1 13 21-33 14-26 (41)
129 KOG4124|consensus 25.2 14 0.0003 31.3 -1.4 58 18-75 347-417 (442)
130 PF04959 ARS2: Arsenite-resist 24.6 37 0.00081 26.7 1.0 28 53-80 74-101 (214)
131 TIGR00244 transcriptional regu 24.6 44 0.00096 24.7 1.3 11 129-140 29-39 (147)
132 KOG1813|consensus 24.4 48 0.001 27.5 1.6 45 55-109 240-284 (313)
133 PF12760 Zn_Tnp_IS1595: Transp 24.3 36 0.00077 19.5 0.6 9 55-63 36-44 (46)
134 PF13824 zf-Mss51: Zinc-finger 24.1 64 0.0014 19.6 1.7 14 124-138 10-23 (55)
135 PF12013 DUF3505: Protein of u 23.3 53 0.0011 22.5 1.5 24 57-80 81-108 (109)
136 COG1327 Predicted transcriptio 23.1 52 0.0011 24.4 1.4 11 129-140 29-39 (156)
137 PRK12496 hypothetical protein; 23.0 46 0.001 24.9 1.2 14 19-32 126-139 (164)
138 PF12773 DZR: Double zinc ribb 22.9 92 0.002 17.8 2.3 27 100-138 12-38 (50)
139 COG1198 PriA Primosomal protei 22.1 49 0.0011 31.0 1.4 23 115-138 462-484 (730)
140 PRK00432 30S ribosomal protein 20.8 68 0.0015 19.0 1.4 12 55-66 36-47 (50)
No 1
>KOG2462|consensus
Probab=99.89 E-value=2.9e-24 Score=169.14 Aligned_cols=125 Identities=22% Similarity=0.271 Sum_probs=82.3
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCC---CCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTD---HKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~---~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
.....+.|++|||.|.+. ++|.+|.++|-. .+.+.|++|+|.|.....|.+|+++|. .+.+|.+|
T Consensus 126 ~~~~r~~c~eCgk~ysT~--------snLsrHkQ~H~~~~s~ka~~C~~C~K~YvSmpALkMHirTH~---l~c~C~iC- 193 (279)
T KOG2462|consen 126 AKHPRYKCPECGKSYSTS--------SNLSRHKQTHRSLDSKKAFSCKYCGKVYVSMPALKMHIRTHT---LPCECGIC- 193 (279)
T ss_pred ccCCceeccccccccccc--------cccchhhcccccccccccccCCCCCceeeehHHHhhHhhccC---CCcccccc-
Confidence 355678999999999999 666666666533 445666666666666666666666665 55555544
Q ss_pred cccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCcc-CCCCCccccccCCCCcccccCCCCCcc
Q psy8702 93 SIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSR-SRTPSPPLRSHSPISIFRRHQSPSPIG 167 (194)
Q Consensus 93 ~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~-~~~~~~hl~~H~~~~~~~~~~~~~~~~ 167 (194)
||.|.+.=.|.-|+++|+|||||.|+. |+|.|. ..+|..||++|++++.|.+..+...|.
T Consensus 194 --------------GKaFSRPWLLQGHiRTHTGEKPF~C~h-C~kAFADRSNLRAHmQTHS~~K~~qC~~C~KsFs 254 (279)
T KOG2462|consen 194 --------------GKAFSRPWLLQGHIRTHTGEKPFSCPH-CGKAFADRSNLRAHMQTHSDVKKHQCPRCGKSFA 254 (279)
T ss_pred --------------cccccchHHhhcccccccCCCCccCCc-ccchhcchHHHHHHHHhhcCCccccCcchhhHHH
Confidence 466666666666666666666666664 666665 666666666666666666666555554
No 2
>KOG2462|consensus
Probab=99.86 E-value=7.4e-23 Score=161.20 Aligned_cols=110 Identities=25% Similarity=0.301 Sum_probs=100.8
Q ss_pred CCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCcccccccc
Q psy8702 18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSL 97 (194)
Q Consensus 18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~ 97 (194)
.....|++|||.|..- ..|..|+++|+ -+++|.+|||.|....-|+.|+|+|+ |||||.|..|
T Consensus 159 ~ka~~C~~C~K~YvSm--------pALkMHirTH~--l~c~C~iCGKaFSRPWLLQGHiRTHT-GEKPF~C~hC------ 221 (279)
T KOG2462|consen 159 KKAFSCKYCGKVYVSM--------PALKMHIRTHT--LPCECGICGKAFSRPWLLQGHIRTHT-GEKPFSCPHC------ 221 (279)
T ss_pred cccccCCCCCceeeeh--------HHHhhHhhccC--CCcccccccccccchHHhhccccccc-CCCCccCCcc------
Confidence 4557799999999999 99999999997 47999999999999999999999999 9999999887
Q ss_pred ccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccCCCCcc
Q psy8702 98 KRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSPISIF 157 (194)
Q Consensus 98 ~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~~~~~~ 157 (194)
+|.|...++|+.||+||.+.|+|+|.. |||. |.+.+.|.+|....-.
T Consensus 222 ---------~kAFADRSNLRAHmQTHS~~K~~qC~~-C~Ks---Fsl~SyLnKH~ES~C~ 268 (279)
T KOG2462|consen 222 ---------GKAFADRSNLRAHMQTHSDVKKHQCPR-CGKS---FALKSYLNKHSESACL 268 (279)
T ss_pred ---------cchhcchHHHHHHHHhhcCCccccCcc-hhhH---HHHHHHHHHhhhhccc
Confidence 589999999999999999999999996 9999 8888889998765444
No 3
>KOG1074|consensus
Probab=99.61 E-value=6.9e-16 Score=136.59 Aligned_cols=57 Identities=26% Similarity=0.542 Sum_probs=51.8
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
+....-+|-+|-++...+ +.|+.|.++|+||+||+|.+||+.|.++.+|+.|+.+|.
T Consensus 601 ~~TdPNqCiiC~rVlSC~--------saLqmHyrtHtGERPFkCKiCgRAFtTkGNLkaH~~vHk 657 (958)
T KOG1074|consen 601 KRTDPNQCIICLRVLSCP--------SALQMHYRTHTGERPFKCKICGRAFTTKGNLKAHMSVHK 657 (958)
T ss_pred ccCCccceeeeeecccch--------hhhhhhhhcccCcCccccccccchhccccchhhcccccc
Confidence 344455799999999999 999999999999999999999999999999999998776
No 4
>KOG1074|consensus
Probab=99.58 E-value=5.4e-16 Score=137.25 Aligned_cols=56 Identities=30% Similarity=0.560 Sum_probs=50.9
Q ss_pred CCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702 17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
...+..|..|+|.|... +.|+.|.+.|++++||+|.+||..|.++.+|+.|...|.
T Consensus 350 ~~~khkCr~CakvfgS~--------SaLqiHlRSHTGERPfqCnvCG~~FSTkGNLKvH~~rH~ 405 (958)
T KOG1074|consen 350 PFFKHKCRFCAKVFGSD--------SALQIHLRSHTGERPFQCNVCGNRFSTKGNLKVHFQRHR 405 (958)
T ss_pred ccccchhhhhHhhcCch--------hhhhhhhhccCCCCCeeecccccccccccceeeeeeecc
Confidence 33456799999999999 999999999999999999999999999999999987665
No 5
>KOG3576|consensus
Probab=99.53 E-value=6e-16 Score=117.25 Aligned_cols=116 Identities=28% Similarity=0.331 Sum_probs=96.2
Q ss_pred CCCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccc
Q psy8702 15 AKDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESI 94 (194)
Q Consensus 15 ~~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~ 94 (194)
..+...+.|.+|||.|... ..|.+|++.|.+.+.+-|..||+.|...-.|++|+++|+ |.+||+|..|+
T Consensus 112 ssd~d~ftCrvCgK~F~lQ--------Rmlnrh~kch~~vkr~lct~cgkgfndtfdlkrh~rtht-gvrpykc~~c~-- 180 (267)
T KOG3576|consen 112 SSDQDSFTCRVCGKKFGLQ--------RMLNRHLKCHSDVKRHLCTFCGKGFNDTFDLKRHTRTHT-GVRPYKCSLCE-- 180 (267)
T ss_pred CCCCCeeeeehhhhhhhHH--------HHHHHHhhhccHHHHHHHhhccCcccchhhhhhhhcccc-Cccccchhhhh--
Confidence 3456778899999999999 999999999999999999999999999999999999999 99999999985
Q ss_pred cccccCCCCCCCCCCCCCChhhHhhhccCC-----------CCCCCCChhhhCCCcc-CCCCCccccccCCCC
Q psy8702 95 GSLKRDNLPPAPSPTGSQSSEDIEIERTPS-----------EERSLEPSDMLGRRSR-SRTPSPPLRSHSPIS 155 (194)
Q Consensus 95 ~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~-----------~ekp~~C~~~Cgk~f~-~~~~~~hl~~H~~~~ 155 (194)
|.|...-.|..|.+.-+ .+|.|.|++ ||..-. --....|++.|....
T Consensus 181 -------------kaftqrcsleshl~kvhgv~~~yaykerr~kl~vced-cg~t~~~~e~~~~h~~~~hp~S 239 (267)
T KOG3576|consen 181 -------------KAFTQRCSLESHLKKVHGVQHQYAYKERRAKLYVCED-CGYTSERPEVYYLHLKLHHPFS 239 (267)
T ss_pred -------------HHHHhhccHHHHHHHHcCchHHHHHHHhhhheeeecc-cCCCCCChhHHHHHHHhcCCCC
Confidence 79999999999975433 367899998 998822 222334566665433
No 6
>KOG3623|consensus
Probab=99.45 E-value=3.6e-14 Score=124.31 Aligned_cols=122 Identities=20% Similarity=0.165 Sum_probs=90.1
Q ss_pred CCCCcccccccccccccccccccccchHHHHHhHcC--CCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccc
Q psy8702 17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHT--DHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESI 94 (194)
Q Consensus 17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~--~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~ 94 (194)
-+.-..|++|.+.+.+- ..|+.|+..-+ .+-.|.|..|..+|....+|.+|+.+|..+.....-- . .
T Consensus 207 fsqlltcpycdrgykrl--------tslkeHikyrhekne~nfsC~lCsytFAyRtQLErhm~~hkpg~dqa~sl--t-q 275 (1007)
T KOG3623|consen 207 FSQLLTCPYCDRGYKRL--------TSLKEHIKYRHEKNEPNFSCMLCSYTFAYRTQLERHMQLHKPGGDQAISL--T-Q 275 (1007)
T ss_pred hhhhhcchhHHHHHHHH--------HHHHHHHHHHHhhCCCCCcchhhhhhhhhHHHHHHHHHhhcCCCcccccc--c-c
Confidence 33445789999999998 88999987543 3556899999999999999999999987432221000 0 0
Q ss_pred cccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccCC
Q psy8702 95 GSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSP 153 (194)
Q Consensus 95 ~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~~ 153 (194)
....+...|.+|||.|..+..|++|+++|.|||||+|+. |+|+ |+.+.....|+.
T Consensus 276 sa~lRKFKCtECgKAFKfKHHLKEHlRIHSGEKPfeCpn-CkKR---FSHSGSySSHmS 330 (1007)
T KOG3623|consen 276 SALLRKFKCTECGKAFKFKHHLKEHLRIHSGEKPFECPN-CKKR---FSHSGSYSSHMS 330 (1007)
T ss_pred hhhhccccccccchhhhhHHHHHhhheeecCCCCcCCcc-cccc---cccCCccccccc
Confidence 111223345666899999999999999999999999996 9999 555555555543
No 7
>KOG3623|consensus
Probab=99.40 E-value=9.4e-14 Score=121.73 Aligned_cols=83 Identities=23% Similarity=0.353 Sum_probs=77.7
Q ss_pred CCCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccc
Q psy8702 15 AKDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESI 94 (194)
Q Consensus 15 ~~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~ 94 (194)
.-....|.|+.|+|+|... ++|.+|.-.|+|.+||+|.+|.|.|..+..|..|+|+|. |||||.|+.|
T Consensus 889 kte~gmyaCDqCDK~FqKq--------SSLaRHKYEHsGqRPyqC~iCkKAFKHKHHLtEHkRLHS-GEKPfQCdKC--- 956 (1007)
T KOG3623|consen 889 KTEDGMYACDQCDKAFQKQ--------SSLARHKYEHSGQRPYQCIICKKAFKHKHHLTEHKRLHS-GEKPFQCDKC--- 956 (1007)
T ss_pred cCccccchHHHHHHHHHhh--------HHHHHhhhhhcCCCCcccchhhHhhhhhhhhhhhhhhcc-CCCcchhhhh---
Confidence 3455678999999999999 999999999999999999999999999999999999999 9999999988
Q ss_pred cccccCCCCCCCCCCCCCChhhHhhhc
Q psy8702 95 GSLKRDNLPPAPSPTGSQSSEDIEIER 121 (194)
Q Consensus 95 ~~~~~~~~~~~C~k~F~~~~~L~~H~~ 121 (194)
+|.|.....+.+||.
T Consensus 957 ------------lKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 957 ------------LKRFSHSGSYSQHMN 971 (1007)
T ss_pred ------------hhhcccccchHhhhc
Confidence 589999999999985
No 8
>KOG3608|consensus
Probab=99.33 E-value=1.2e-13 Score=112.52 Aligned_cols=128 Identities=20% Similarity=0.257 Sum_probs=106.3
Q ss_pred ccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC-CCCCCCCCCcccccccccc---
Q psy8702 24 SLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA-VGDVPPESSLAESIGSLKR--- 99 (194)
Q Consensus 24 ~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~-~~~k~~~c~~c~~~~~~~~--- 99 (194)
..|.+.+.++ +.|++|++.|+++|...|+.||..|.++..|-.|.+..+ ....+|.|..|.+.....+
T Consensus 183 ~~Ct~~~~~k--------~~LreH~r~Hs~eKvvACp~Cg~~F~~~tkl~DH~rRqt~l~~n~fqC~~C~KrFaTeklL~ 254 (467)
T KOG3608|consen 183 AMCTKHMGNK--------YRLREHIRTHSNEKVVACPHCGELFRTKTKLFDHLRRQTELNTNSFQCAQCFKRFATEKLLK 254 (467)
T ss_pred hhhhhhhccH--------HHHHHHHHhcCCCeEEecchHHHHhccccHHHHHHHhhhhhcCCchHHHHHHHHHhHHHHHH
Confidence 8999999999 999999999999999999999999999999999977443 1456899998864333322
Q ss_pred --------CCCCCCCCCCCCCChhhHhhhcc-CCCCCCCCChhhhCCC-ccCCCCCccccccCCCCcccccC
Q psy8702 100 --------DNLPPAPSPTGSQSSEDIEIERT-PSEERSLEPSDMLGRR-SRSRTPSPPLRSHSPISIFRRHQ 161 (194)
Q Consensus 100 --------~~~~~~C~k~F~~~~~L~~H~~~-H~~ekp~~C~~~Cgk~-f~~~~~~~hl~~H~~~~~~~~~~ 161 (194)
.-.|+.|+-+....++|..|+++ |..+|||+|.+ |+++ |+...|..|+.+|+ +..|++..
T Consensus 255 ~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~-Cd~~c~~esdL~kH~~~HS-~~~y~C~h 324 (467)
T KOG3608|consen 255 SHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDE-CDTRCVRESDLAKHVQVHS-KTVYQCEH 324 (467)
T ss_pred HHHHHhhhcccccccccCCCChHHHHHHHHhhhccCCCccccc-hhhhhccHHHHHHHHHhcc-ccceecCC
Confidence 22388999999999999999874 88899999987 9999 55888889999998 45555544
No 9
>KOG3608|consensus
Probab=99.30 E-value=2.8e-13 Score=110.37 Aligned_cols=142 Identities=17% Similarity=0.135 Sum_probs=102.1
Q ss_pred CCCCCCCccccccccccccccccccc------------------ccchHHHHHhH-cCCCCCccCCcCCCCcccchHHHH
Q psy8702 14 VAKDSVNPGLSLYTKSFTWKNLLTNV------------------LTSNLKTHLLT-HTDHKPYECNSCGKVFRRNCDLRR 74 (194)
Q Consensus 14 ~~~~~~~~~c~~Cgk~f~~~~~L~~H------------------~~s~L~~H~~~-h~~~k~~~C~~C~k~f~~~~~L~~ 74 (194)
.....+.|.|..|.|.|.++..|.-| ..+.|.+|++. |...+||+|+.|++.+.+...|.+
T Consensus 231 t~l~~n~fqC~~C~KrFaTeklL~~Hv~rHvn~ykCplCdmtc~~~ssL~~H~r~rHs~dkpfKCd~Cd~~c~~esdL~k 310 (467)
T KOG3608|consen 231 TELNTNSFQCAQCFKRFATEKLLKSHVVRHVNCYKCPLCDMTCSSASSLTTHIRYRHSKDKPFKCDECDTRCVRESDLAK 310 (467)
T ss_pred hhhcCCchHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCCChHHHHHHHHhhhccCCCccccchhhhhccHHHHHH
Confidence 34445678888999999888777665 35667777765 666777777777777777777777
Q ss_pred hhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCC-CC--CCCCChhhhCCCcc-CCCCCccccc
Q psy8702 75 HALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPS-EE--RSLEPSDMLGRRSR-SRTPSPPLRS 150 (194)
Q Consensus 75 H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~-~e--kp~~C~~~Cgk~f~-~~~~~~hl~~ 150 (194)
|..+|. +-.|.|+.- .|...|.....++.|++.+. |. -+|.|. .|++.|. -++|..||.+
T Consensus 311 H~~~HS--~~~y~C~h~-------------~C~~s~r~~~q~~~H~~evhEg~np~~Y~CH-~Cdr~ft~G~~L~~HL~k 374 (467)
T KOG3608|consen 311 HVQVHS--KTVYQCEHP-------------DCHYSVRTYTQMRRHFLEVHEGNNPILYACH-CCDRFFTSGKSLSAHLMK 374 (467)
T ss_pred HHHhcc--ccceecCCC-------------CCcHHHHHHHHHHHHHHHhccCCCCCceeee-cchhhhccchhHHHHHHH
Confidence 777664 666777652 24568888999999986544 54 469998 5999955 4555555544
Q ss_pred cCCCCcccccCCCCCccccccccCC
Q psy8702 151 HSPISIFRRHQSPSPIGIYRRHSQS 175 (194)
Q Consensus 151 H~~~~~~~~~~~~~~~~~~~~~~~~ 175 (194)
- --.+++++.+...|+.+-+.
T Consensus 375 k----H~f~~PsGh~RFtYk~~edG 395 (467)
T KOG3608|consen 375 K----HGFRLPSGHKRFTYKVDEDG 395 (467)
T ss_pred h----hcccCCCCCCceeeeeccCc
Confidence 3 34678999999999988653
No 10
>KOG3576|consensus
Probab=99.18 E-value=1.3e-12 Score=99.20 Aligned_cols=84 Identities=18% Similarity=0.210 Sum_probs=74.6
Q ss_pred CCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCCh
Q psy8702 53 DHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPS 132 (194)
Q Consensus 53 ~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~ 132 (194)
+...|.|.+|+|.|....-|++|+.-|. .-+.+-|..| ||.|...-+|.+|+++|+|-+||+|.
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~kch~-~vkr~lct~c---------------gkgfndtfdlkrh~rthtgvrpykc~ 177 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLKCHS-DVKRHLCTFC---------------GKGFNDTFDLKRHTRTHTGVRPYKCS 177 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhhhcc-HHHHHHHhhc---------------cCcccchhhhhhhhccccCccccchh
Confidence 4567999999999999999999999998 8888989877 68999999999999999999999998
Q ss_pred hhhCCCcc-CCCCCccccccCC
Q psy8702 133 DMLGRRSR-SRTPSPPLRSHSP 153 (194)
Q Consensus 133 ~~Cgk~f~-~~~~~~hl~~H~~ 153 (194)
.|+|.|. ..+|..||+.=.|
T Consensus 178 -~c~kaftqrcsleshl~kvhg 198 (267)
T KOG3576|consen 178 -LCEKAFTQRCSLESHLKKVHG 198 (267)
T ss_pred -hhhHHHHhhccHHHHHHHHcC
Confidence 4999987 8888888865443
No 11
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.83 E-value=2.9e-09 Score=93.40 Aligned_cols=91 Identities=14% Similarity=0.193 Sum_probs=73.9
Q ss_pred CCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccc
Q psy8702 19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLK 98 (194)
Q Consensus 19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~ 98 (194)
..+.|+.||+.|. . ..|..|+.+++ +++.|+ ||+.+ ....|..|+.+|. +++++.|.+|+
T Consensus 452 ~H~~C~~Cgk~f~-~--------s~LekH~~~~H--kpv~Cp-Cg~~~-~R~~L~~H~~thC-p~Kpi~C~fC~------ 511 (567)
T PLN03086 452 NHVHCEKCGQAFQ-Q--------GEMEKHMKVFH--EPLQCP-CGVVL-EKEQMVQHQASTC-PLRLITCRFCG------ 511 (567)
T ss_pred cCccCCCCCCccc-h--------HHHHHHHHhcC--CCccCC-CCCCc-chhHHHhhhhccC-CCCceeCCCCC------
Confidence 3457899999886 4 67899999875 789999 99755 6689999999999 99999999885
Q ss_pred cCCCCCCCCCCCC----------CChhhHhhhccCCCCCCCCChhhhCCCcc
Q psy8702 99 RDNLPPAPSPTGS----------QSSEDIEIERTPSEERSLEPSDMLGRRSR 140 (194)
Q Consensus 99 ~~~~~~~C~k~F~----------~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~ 140 (194)
+.|. ..+.|..|+.+. |.+++.|.. ||+.+.
T Consensus 512 ---------~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~-Cgk~Vr 552 (567)
T PLN03086 512 ---------DMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDS-CGRSVM 552 (567)
T ss_pred ---------CccccCccccchhhhhhhHHHHHHhc-CCcceEccc-cCCeee
Confidence 4553 235789999885 999999975 999944
No 12
>PHA02768 hypothetical protein; Provisional
Probab=98.76 E-value=3.8e-09 Score=64.28 Aligned_cols=43 Identities=14% Similarity=0.260 Sum_probs=38.4
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHH
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLR 73 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~ 73 (194)
|.|+.||+.|... ++|..|+++|+ ++++|..|++.|.....|.
T Consensus 6 y~C~~CGK~Fs~~--------~~L~~H~r~H~--k~~kc~~C~k~f~~~s~l~ 48 (55)
T PHA02768 6 YECPICGEIYIKR--------KSMITHLRKHN--TNLKLSNCKRISLRTGEYI 48 (55)
T ss_pred cCcchhCCeeccH--------HHHHHHHHhcC--CcccCCcccceecccceeE
Confidence 6899999999999 99999999998 6899999999999776653
No 13
>PHA00733 hypothetical protein
Probab=98.64 E-value=2e-08 Score=72.63 Aligned_cols=89 Identities=13% Similarity=0.141 Sum_probs=69.4
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCcccccc
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIG 95 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~ 95 (194)
.......|.+|++.|.....|..+ +.|..|+..+ ++++|.|+.|++.|.....|..|++.+. .++.|..|
T Consensus 36 ~~~~~~~~~~~~~~~~~~~~l~~~--~~l~~~~~~~-~~kPy~C~~Cgk~Fss~s~L~~H~r~h~---~~~~C~~C---- 105 (128)
T PHA00733 36 PEQKRLIRAVVKTLIYNPQLLDES--SYLYKLLTSK-AVSPYVCPLCLMPFSSSVSLKQHIRYTE---HSKVCPVC---- 105 (128)
T ss_pred hhhhhHHHHHHhhhccChhhhcch--HHHHhhcccC-CCCCccCCCCCCcCCCHHHHHHHHhcCC---cCccCCCC----
Confidence 344556789999999998666642 2255555444 4789999999999999999999998764 46788766
Q ss_pred ccccCCCCCCCCCCCCCChhhHhhhccCCC
Q psy8702 96 SLKRDNLPPAPSPTGSQSSEDIEIERTPSE 125 (194)
Q Consensus 96 ~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ 125 (194)
++.|.....|..|+...++
T Consensus 106 -----------gK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 106 -----------GKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred -----------CCccCCHHHHHHHHHHhcC
Confidence 5899999999999876655
No 14
>PHA00733 hypothetical protein
Probab=98.52 E-value=1.9e-08 Score=72.76 Aligned_cols=79 Identities=15% Similarity=-0.038 Sum_probs=62.3
Q ss_pred CCCCccCCcCCCCcccchHHHHh--hh---hcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCC
Q psy8702 53 DHKPYECNSCGKVFRRNCDLRRH--AL---THAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEER 127 (194)
Q Consensus 53 ~~k~~~C~~C~k~f~~~~~L~~H--~~---~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ek 127 (194)
..+++.|.+|.+.|.....|..+ +. .+. +++||.|+.| |+.|.....|..|++.+ +.
T Consensus 37 ~~~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~-~~kPy~C~~C---------------gk~Fss~s~L~~H~r~h--~~ 98 (128)
T PHA00733 37 EQKRLIRAVVKTLIYNPQLLDESSYLYKLLTSK-AVSPYVCPLC---------------LMPFSSSVSLKQHIRYT--EH 98 (128)
T ss_pred hhhhHHHHHHhhhccChhhhcchHHHHhhcccC-CCCCccCCCC---------------CCcCCCHHHHHHHHhcC--Cc
Confidence 35789999999988887766665 21 223 5789999877 58999999999999987 46
Q ss_pred CCCChhhhCCCccCCCCCccccccCC
Q psy8702 128 SLEPSDMLGRRSRSRTPSPPLRSHSP 153 (194)
Q Consensus 128 p~~C~~~Cgk~f~~~~~~~hl~~H~~ 153 (194)
+|.|.. |++. |....+|..|..
T Consensus 99 ~~~C~~-CgK~---F~~~~sL~~H~~ 120 (128)
T PHA00733 99 SKVCPV-CGKE---FRNTDSTLDHVC 120 (128)
T ss_pred CccCCC-CCCc---cCCHHHHHHHHH
Confidence 899995 9999 777777777754
No 15
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.45 E-value=8.6e-08 Score=49.83 Aligned_cols=26 Identities=54% Similarity=1.080 Sum_probs=23.4
Q ss_pred hHHHHHhHcCCCCCccCCcCCCCccc
Q psy8702 43 NLKTHLLTHTDHKPYECNSCGKVFRR 68 (194)
Q Consensus 43 ~L~~H~~~h~~~k~~~C~~C~k~f~~ 68 (194)
+|.+|+++|++++||.|+.|++.|.+
T Consensus 1 ~l~~H~~~H~~~k~~~C~~C~k~F~~ 26 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPYCGKSFSN 26 (26)
T ss_dssp HHHHHHHHHSSSSSEEESSSSEEESS
T ss_pred CHHHHhhhcCCCCCCCCCCCcCeeCc
Confidence 47899999999999999999999863
No 16
>PHA02768 hypothetical protein; Provisional
Probab=98.44 E-value=9.5e-08 Score=58.19 Aligned_cols=41 Identities=15% Similarity=-0.013 Sum_probs=33.5
Q ss_pred CCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccc
Q psy8702 102 LPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPL 148 (194)
Q Consensus 102 ~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl 148 (194)
.|+.||+.|...+.|..|+++|+ ++|.|.. |++. |...+.|
T Consensus 7 ~C~~CGK~Fs~~~~L~~H~r~H~--k~~kc~~-C~k~---f~~~s~l 47 (55)
T PHA02768 7 ECPICGEIYIKRKSMITHLRKHN--TNLKLSN-CKRI---SLRTGEY 47 (55)
T ss_pred CcchhCCeeccHHHHHHHHHhcC--CcccCCc-ccce---eccccee
Confidence 35566899999999999999999 7999996 9999 5444444
No 17
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.41 E-value=2.9e-08 Score=51.61 Aligned_cols=25 Identities=16% Similarity=0.062 Sum_probs=22.4
Q ss_pred hhHhhhccCCCCCCCCChhhhCCCcc
Q psy8702 115 EDIEIERTPSEERSLEPSDMLGRRSR 140 (194)
Q Consensus 115 ~L~~H~~~H~~ekp~~C~~~Cgk~f~ 140 (194)
+|..|+++|+|++||.|+. |+++|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~-C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPY-CGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESS-SSEEES
T ss_pred CHHHHhhhcCCCCCCCCCC-CcCeeC
Confidence 5889999999999999996 999953
No 18
>PLN03086 PRLI-interacting factor K; Provisional
Probab=98.39 E-value=3.9e-07 Score=80.26 Aligned_cols=89 Identities=11% Similarity=0.150 Sum_probs=67.9
Q ss_pred CCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChh
Q psy8702 54 HKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSD 133 (194)
Q Consensus 54 ~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~ 133 (194)
++.+.|+.|++.|. ...|..|+.+++ +++.|+ |+ +.+ ....|..|+.+|.+++++.|+.
T Consensus 451 ~~H~~C~~Cgk~f~-~s~LekH~~~~H---kpv~Cp-Cg---------------~~~-~R~~L~~H~~thCp~Kpi~C~f 509 (567)
T PLN03086 451 KNHVHCEKCGQAFQ-QGEMEKHMKVFH---EPLQCP-CG---------------VVL-EKEQMVQHQASTCPLRLITCRF 509 (567)
T ss_pred ccCccCCCCCCccc-hHHHHHHHHhcC---CCccCC-CC---------------CCc-chhHHHhhhhccCCCCceeCCC
Confidence 45678999999996 678999999876 789998 74 444 6799999999999999999995
Q ss_pred hhCCCccC-----------CCCCccccccCCCCcccccCCCCC
Q psy8702 134 MLGRRSRS-----------RTPSPPLRSHSPISIFRRHQSPSP 165 (194)
Q Consensus 134 ~Cgk~f~~-----------~~~~~hl~~H~~~~~~~~~~~~~~ 165 (194)
|++.|.. ..+..|.... |.+.+.+..++..
T Consensus 510 -C~~~v~~g~~~~d~~d~~s~Lt~HE~~C-G~rt~~C~~Cgk~ 550 (567)
T PLN03086 510 -CGDMVQAGGSAMDVRDRLRGMSEHESIC-GSRTAPCDSCGRS 550 (567)
T ss_pred -CCCccccCccccchhhhhhhHHHHHHhc-CCcceEccccCCe
Confidence 9999531 1344455553 6777777776543
No 19
>KOG3993|consensus
Probab=97.98 E-value=1e-06 Score=73.93 Aligned_cols=53 Identities=25% Similarity=0.338 Sum_probs=46.1
Q ss_pred CcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702 20 NPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 20 ~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
.|.|..|...|.+. ..|.+|...-.....|+|++|+|.|+-..+|..|.++|.
T Consensus 267 dyiCqLCK~kYeD~--------F~LAQHrC~RIV~vEYrCPEC~KVFsCPANLASHRRWHK 319 (500)
T KOG3993|consen 267 DYICQLCKEKYEDA--------FALAQHRCPRIVHVEYRCPECDKVFSCPANLASHRRWHK 319 (500)
T ss_pred HHHHHHHHHhhhhH--------HHHhhccCCeeEEeeecCCcccccccCchhhhhhhcccC
Confidence 47789999999999 999999854444455999999999999999999999997
No 20
>PHA00616 hypothetical protein
Probab=97.93 E-value=4.1e-06 Score=48.66 Aligned_cols=32 Identities=16% Similarity=0.076 Sum_probs=16.4
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCC
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECN 60 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~ 60 (194)
|.|+.||+.|..+ +.|..|++.|++++++.|+
T Consensus 2 YqC~~CG~~F~~~--------s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 2 YQCLRCGGIFRKK--------KEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CccchhhHHHhhH--------HHHHHHHHHhcCCCcccee
Confidence 3455555555555 5555555555555554443
No 21
>PHA00616 hypothetical protein
Probab=97.91 E-value=3.9e-06 Score=48.75 Aligned_cols=34 Identities=24% Similarity=0.497 Sum_probs=32.0
Q ss_pred CccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCc
Q psy8702 56 PYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSL 90 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~ 90 (194)
||+|..||+.|.....|..|++.|+ |++++.|+.
T Consensus 1 pYqC~~CG~~F~~~s~l~~H~r~~h-g~~~~~~~~ 34 (44)
T PHA00616 1 MYQCLRCGGIFRKKKEVIEHLLSVH-KQNKLTLEY 34 (44)
T ss_pred CCccchhhHHHhhHHHHHHHHHHhc-CCCccceeE
Confidence 6899999999999999999999999 999999864
No 22
>KOG3993|consensus
Probab=97.71 E-value=1.7e-05 Score=66.73 Aligned_cols=124 Identities=26% Similarity=0.320 Sum_probs=80.9
Q ss_pred CCCcccccccccccccccccccccchHHHHHhHc---------------------------------CCCCCccCCcCCC
Q psy8702 18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH---------------------------------TDHKPYECNSCGK 64 (194)
Q Consensus 18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h---------------------------------~~~k~~~C~~C~k 64 (194)
...|.|++|+|.|.-. .+|..|.|-| ..+..|.|..|+|
T Consensus 293 ~vEYrCPEC~KVFsCP--------ANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~~C~K 364 (500)
T KOG3993|consen 293 HVEYRCPECDKVFSCP--------ANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCHTCGK 364 (500)
T ss_pred EeeecCCcccccccCc--------hhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecHHhhh
Confidence 3568899999999999 5555554432 1123589999999
Q ss_pred CcccchHHHHhhhhcCCCCCCC----------------CCCcccccccc--------------ccCCCCCCCCCCCCCCh
Q psy8702 65 VFRRNCDLRRHALTHAVGDVPP----------------ESSLAESIGSL--------------KRDNLPPAPSPTGSQSS 114 (194)
Q Consensus 65 ~f~~~~~L~~H~~~h~~~~k~~----------------~c~~c~~~~~~--------------~~~~~~~~C~k~F~~~~ 114 (194)
.|.+...|+.|+.+|......- -+..+..-... ....+|+.|+..+..+.
T Consensus 365 kFrRqAYLrKHqlthq~~~~~k~~a~~f~~s~~~~l~~~~~~~a~h~~a~~~~g~~vl~~a~sael~~pp~~~~ppsss~ 444 (500)
T KOG3993|consen 365 KFRRQAYLRKHQLTHQRAPLAKEKAPKFLLSRVIPLMHFNQAVATHSSASDSHGDEVLYVAGSAELELPPYDGSPPSSSG 444 (500)
T ss_pred hhHHHHHHHHhHHhhhccccchhcccCcchhhcccccccccccccccccccccccceeeeeccccccCCCCCCCCcccCC
Confidence 9999999999988876222111 11111100000 01112567777777776
Q ss_pred hhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccCC
Q psy8702 115 EDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHSP 153 (194)
Q Consensus 115 ~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~~ 153 (194)
.---+.+.-..++-|.|. +|.-. |..+.+|.+|..
T Consensus 445 ~sgg~~rlg~~~q~f~~k-y~~at---fyss~~ltrhin 479 (500)
T KOG3993|consen 445 SSGGYGRLGIAEQGFTCK-YCPAT---FYSSPGLTRHIN 479 (500)
T ss_pred CCCccccccchhhccccc-cchHh---hhcCcchHhHhh
Confidence 666666666667789998 49888 777778887754
No 23
>PHA00732 hypothetical protein
Probab=97.65 E-value=3.3e-05 Score=51.09 Aligned_cols=46 Identities=24% Similarity=0.285 Sum_probs=37.8
Q ss_pred cccccccccccccccccccccchHHHHHhH-cCCCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLT-HTDHKPYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
+.|+.||+.|... ..|..|++. |.+ +.|+.|++.|. .|..|.....
T Consensus 2 y~C~~Cgk~F~s~--------s~Lk~H~r~~H~~---~~C~~CgKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 2 FKCPICGFTTVTL--------FALKQHARRNHTL---TKCPVCNKSYR---RLNQHFYSQY 48 (79)
T ss_pred ccCCCCCCccCCH--------HHHHHHhhcccCC---CccCCCCCEeC---ChhhhhcccC
Confidence 5799999999999 999999984 654 58999999998 4777875543
No 24
>PHA00732 hypothetical protein
Probab=97.45 E-value=0.00013 Score=48.20 Aligned_cols=47 Identities=15% Similarity=0.171 Sum_probs=36.7
Q ss_pred CccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCC
Q psy8702 56 PYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPS 124 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~ 124 (194)
||.|..|++.|.....|..|++. |. + +.|+.| |+.|. .|..|..+..
T Consensus 1 py~C~~Cgk~F~s~s~Lk~H~r~~H~-~---~~C~~C---------------gKsF~---~l~~H~~~~~ 48 (79)
T PHA00732 1 MFKCPICGFTTVTLFALKQHARRNHT-L---TKCPVC---------------NKSYR---RLNQHFYSQY 48 (79)
T ss_pred CccCCCCCCccCCHHHHHHHhhcccC-C---CccCCC---------------CCEeC---ChhhhhcccC
Confidence 58899999999999999999985 54 3 456555 67887 5888885543
No 25
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.22 E-value=0.00012 Score=36.39 Aligned_cols=20 Identities=40% Similarity=0.908 Sum_probs=9.1
Q ss_pred cCCcCCCCcccchHHHHhhh
Q psy8702 58 ECNSCGKVFRRNCDLRRHAL 77 (194)
Q Consensus 58 ~C~~C~k~f~~~~~L~~H~~ 77 (194)
.|+.|++.|.....|..|++
T Consensus 2 ~C~~C~~~f~~~~~l~~H~~ 21 (23)
T PF00096_consen 2 KCPICGKSFSSKSNLKRHMR 21 (23)
T ss_dssp EETTTTEEESSHHHHHHHHH
T ss_pred CCCCCCCccCCHHHHHHHHh
Confidence 34444444444444444443
No 26
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.19 E-value=0.00013 Score=36.33 Aligned_cols=23 Identities=43% Similarity=0.509 Sum_probs=20.4
Q ss_pred cccccccccccccccccccccchHHHHHhHc
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH 51 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h 51 (194)
|.|+.|++.|..+ ..|..|++.|
T Consensus 1 y~C~~C~~~f~~~--------~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSK--------SNLKRHMRRH 23 (23)
T ss_dssp EEETTTTEEESSH--------HHHHHHHHHH
T ss_pred CCCCCCCCccCCH--------HHHHHHHhHC
Confidence 5799999999999 9999998764
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=97.06 E-value=0.00082 Score=41.03 Aligned_cols=51 Identities=24% Similarity=0.252 Sum_probs=38.4
Q ss_pred CCcccccccccccccccccccccchHHHHHhH-cCC-CCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702 19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLT-HTD-HKPYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h~~-~k~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
+.|.|+.||+.|+. ..|..|... |.. .+.+.|++|...+. .+|..|+..++
T Consensus 1 ~~f~CP~C~~~~~~---------~~L~~H~~~~H~~~~~~v~CPiC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 1 DSFTCPYCGKGFSE---------SSLVEHCEDEHRSESKNVVCPICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CCcCCCCCCCccCH---------HHHHHHHHhHCcCCCCCccCCCchhhhh--hHHHHHHHHhc
Confidence 35789999996554 789999765 554 35799999998644 48888987654
No 28
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=96.80 E-value=0.00072 Score=55.31 Aligned_cols=55 Identities=7% Similarity=-0.001 Sum_probs=41.0
Q ss_pred CCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhcc-C------------------CCCCCCCChhhhCCCccCC
Q psy8702 82 GDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERT-P------------------SEERSLEPSDMLGRRSRSR 142 (194)
Q Consensus 82 ~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~-H------------------~~ekp~~C~~~Cgk~f~~~ 142 (194)
+++||+|++-+ |.|.+.+..-|+.|+.- | ..+|||.|+ +|+|+ +
T Consensus 346 d~KpykCpV~g-------------C~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCe-vC~KR---Y 408 (423)
T COG5189 346 DGKPYKCPVEG-------------CNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCE-VCDKR---Y 408 (423)
T ss_pred cCceecCCCCC-------------chhhhccccchhhhhhccccCcccCCCCCccccccccccCCceecc-ccchh---h
Confidence 56999998754 57899999999988742 2 246999998 59999 6
Q ss_pred CCCccccccCC
Q psy8702 143 TPSPPLRSHSP 153 (194)
Q Consensus 143 ~~~~hl~~H~~ 153 (194)
.-..-|+-|.+
T Consensus 409 KNlNGLKYHr~ 419 (423)
T COG5189 409 KNLNGLKYHRK 419 (423)
T ss_pred ccCccceeccc
Confidence 55556666643
No 29
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.68 E-value=0.00092 Score=33.08 Aligned_cols=23 Identities=48% Similarity=1.054 Sum_probs=11.2
Q ss_pred ccCCcCCCCcccchHHHHhhhhc
Q psy8702 57 YECNSCGKVFRRNCDLRRHALTH 79 (194)
Q Consensus 57 ~~C~~C~k~f~~~~~L~~H~~~h 79 (194)
|.|++|++.|.+...|..|+..+
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHH
T ss_pred CCCcCCCCcCCcHHHHHHHHHhh
Confidence 34555555555555555555443
No 30
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.60 E-value=0.00092 Score=34.53 Aligned_cols=25 Identities=44% Similarity=1.004 Sum_probs=19.2
Q ss_pred CccCCcCCCCcccchHHHHhhhhcC
Q psy8702 56 PYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
+|.|..|++.|.+...|..|++.|.
T Consensus 1 ~~~C~~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETTTTEEESSHHHHHHHHCTTT
T ss_pred CCCCCccCCccCChhHHHHHhHHhc
Confidence 4678888888888888888877664
No 31
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.48 E-value=0.0012 Score=32.58 Aligned_cols=23 Identities=39% Similarity=0.469 Sum_probs=18.6
Q ss_pred cccccccccccccccccccccchHHHHHhHc
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH 51 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h 51 (194)
|.|+.|++.|.+. ..|..|+.++
T Consensus 1 ~~C~~C~~~~~~~--------~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPICGKSFRSK--------SELRQHMRTH 23 (24)
T ss_dssp EE-SSTS-EESSH--------HHHHHHHHHH
T ss_pred CCCcCCCCcCCcH--------HHHHHHHHhh
Confidence 4699999999999 9999999875
No 32
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.45 E-value=0.0044 Score=37.75 Aligned_cols=51 Identities=16% Similarity=0.216 Sum_probs=36.6
Q ss_pred CccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCC
Q psy8702 56 PYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPS 124 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~ 124 (194)
.|.|+.|++ ......|..|... |..+.+.+.|++|. ..+ ..+|..|+..++
T Consensus 2 ~f~CP~C~~-~~~~~~L~~H~~~~H~~~~~~v~CPiC~---------------~~~--~~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPYCGK-GFSESSLVEHCEDEHRSESKNVVCPICS---------------SRV--TDNLIRHLNSQH 53 (54)
T ss_pred CcCCCCCCC-ccCHHHHHHHHHhHCcCCCCCccCCCch---------------hhh--hhHHHHHHHHhc
Confidence 489999999 5556899999765 55234578888885 333 358888987654
No 33
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=96.00 E-value=0.0039 Score=42.20 Aligned_cols=57 Identities=21% Similarity=0.340 Sum_probs=21.4
Q ss_pred cccccccccccccccccccc----------------chHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhhhc
Q psy8702 22 GLSLYTKSFTWKNLLTNVLT----------------SNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHALTH 79 (194)
Q Consensus 22 ~c~~Cgk~f~~~~~L~~H~~----------------s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h 79 (194)
+|..|+..|.....|..|.. ..+....+. .....+.|..|++.|.+...|..|++.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKHGFDIPDQKYLVDPNRLLNYLRK-KVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Ccccccccccccccccccccccccccccccccccccccccccccc-ccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 48899999999977777621 111111111 0113589999999999999999999864
No 34
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=95.89 E-value=0.0036 Score=32.20 Aligned_cols=24 Identities=25% Similarity=0.210 Sum_probs=21.4
Q ss_pred cccccccccccccccccccccchHHHHHhHcC
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHT 52 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~ 52 (194)
+.|+.|++.|... ..|..|++.|.
T Consensus 2 ~~C~~C~~~F~~~--------~~l~~H~~~h~ 25 (27)
T PF13912_consen 2 FECDECGKTFSSL--------SALREHKRSHC 25 (27)
T ss_dssp EEETTTTEEESSH--------HHHHHHHCTTT
T ss_pred CCCCccCCccCCh--------hHHHHHhHHhc
Confidence 6799999999999 99999998764
No 35
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=95.86 E-value=0.0074 Score=50.99 Aligned_cols=139 Identities=22% Similarity=0.204 Sum_probs=92.6
Q ss_pred CCcccccccccccccccccccccchHHHHHh--HcCCC--CCccCC--cCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLL--THTDH--KPYECN--SCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~--~h~~~--k~~~C~--~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
..+.|..|...|... ..|..|.+ .|.++ +++.|. .|++.|.+...+..|..+|. +..++.+....
T Consensus 288 ~~~~~~~~~~~~s~~--------~~l~~~~~~~~h~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 358 (467)
T COG5048 288 LPIKSKQCNISFSRS--------SPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT-SISPAKEKLLN 358 (467)
T ss_pred cCCCCccccCCcccc--------ccccccccccccccccCCceeeeccCCCccccccccccCCccccc-CCCcccccccc
Confidence 356789999999999 88888888 79999 999999 79999999999999999998 76666665332
Q ss_pred cccccccCCC----------------------CCCCCCCCCCChhhHhhhccCCCCCCCCChh-hhCCCccCCCCCcccc
Q psy8702 93 SIGSLKRDNL----------------------PPAPSPTGSQSSEDIEIERTPSEERSLEPSD-MLGRRSRSRTPSPPLR 149 (194)
Q Consensus 93 ~~~~~~~~~~----------------------~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~-~Cgk~f~~~~~~~hl~ 149 (194)
.......... -..|-..+.....+..|...+....++.|.. .|++. +.....+.
T Consensus 359 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 435 (467)
T COG5048 359 SSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKS---FNRHYNLI 435 (467)
T ss_pred CccccccccCCCCccchhhccCccCCccccccccchhhhhccccccccccccccccCCcCCCCCcchhh---ccCccccc
Confidence 1111111000 1146677777788888888888777554432 47777 44444444
Q ss_pred ccCCCCcccccCCCCCcccc
Q psy8702 150 SHSPISIFRRHQSPSPIGIY 169 (194)
Q Consensus 150 ~H~~~~~~~~~~~~~~~~~~ 169 (194)
.|+....-.........+.|
T Consensus 436 ~~~~~~~~~~~~~~~~~~~~ 455 (467)
T COG5048 436 PHKKIHTNHAPLLCSILKSF 455 (467)
T ss_pred ccccccccCCceeecccccc
Confidence 44444433333333333333
No 36
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.76 E-value=0.0068 Score=30.20 Aligned_cols=24 Identities=46% Similarity=0.998 Sum_probs=16.2
Q ss_pred ccCCcCCCCcccchHHHHhhhhcC
Q psy8702 57 YECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 57 ~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
|.|..|++.|.....|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHHHhc
Confidence 456777777777777777766553
No 37
>smart00355 ZnF_C2H2 zinc finger.
Probab=95.58 E-value=0.0084 Score=29.84 Aligned_cols=24 Identities=38% Similarity=0.281 Sum_probs=20.8
Q ss_pred cccccccccccccccccccccchHHHHHhHcC
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHT 52 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~ 52 (194)
+.|+.|++.|... ..|..|++.|.
T Consensus 1 ~~C~~C~~~f~~~--------~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSK--------SALKEHMRTHX 24 (26)
T ss_pred CCCCCCcchhCCH--------HHHHHHHHHhc
Confidence 4699999999999 99999988664
No 38
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=95.50 E-value=0.01 Score=35.43 Aligned_cols=33 Identities=33% Similarity=0.622 Sum_probs=20.4
Q ss_pred CCCCCccCCcCCCCcccchHHHHhhhhcCCCCCC
Q psy8702 52 TDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVP 85 (194)
Q Consensus 52 ~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~ 85 (194)
..+.|..|++|+..+....+|++|+.+.+ +.+|
T Consensus 20 ~S~~PatCP~C~a~~~~srnLrRHle~~H-~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQSRNLRRHLEIRH-FKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSHHHHHHHHHHHT-TTS-
T ss_pred ccCCCCCCCcchhhccchhhHHHHHHHHh-cccC
Confidence 34678888888888888888888887666 5554
No 39
>PRK04860 hypothetical protein; Provisional
Probab=95.38 E-value=0.011 Score=44.37 Aligned_cols=39 Identities=13% Similarity=0.270 Sum_probs=28.5
Q ss_pred CCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccc
Q psy8702 19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRN 69 (194)
Q Consensus 19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~ 69 (194)
-.|.|. |++ .. ..+.+|.++++++++|.|..|+..|...
T Consensus 118 ~~Y~C~-C~~---~~--------~~~rrH~ri~~g~~~YrC~~C~~~l~~~ 156 (160)
T PRK04860 118 FPYRCK-CQE---HQ--------LTVRRHNRVVRGEAVYRCRRCGETLVFK 156 (160)
T ss_pred EEEEcC-CCC---ee--------CHHHHHHHHhcCCccEECCCCCceeEEe
Confidence 456776 777 34 5678888888888888888888776643
No 40
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=95.23 E-value=0.0084 Score=49.21 Aligned_cols=52 Identities=29% Similarity=0.530 Sum_probs=36.1
Q ss_pred CCCCccc--ccccccccccccccccccchHHHHHhH-c------------------CCCCCccCCcCCCCcccchHHHHh
Q psy8702 17 DSVNPGL--SLYTKSFTWKNLLTNVLTSNLKTHLLT-H------------------TDHKPYECNSCGKVFRRNCDLRRH 75 (194)
Q Consensus 17 ~~~~~~c--~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h------------------~~~k~~~C~~C~k~f~~~~~L~~H 75 (194)
.+..|.| +-|.|.|+.. ..|+.|+.- | ...|||.|++|+|.|...-.|+.|
T Consensus 346 d~KpykCpV~gC~K~Yknq--------nGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCevC~KRYKNlNGLKYH 417 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQ--------NGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCDKRYKNLNGLKYH 417 (423)
T ss_pred cCceecCCCCCchhhhccc--------cchhhhhhccccCcccCCCCCccccccccccCCceeccccchhhccCccceec
Confidence 3466778 7899999999 444444431 1 224788888888888888887776
Q ss_pred h
Q psy8702 76 A 76 (194)
Q Consensus 76 ~ 76 (194)
.
T Consensus 418 r 418 (423)
T COG5189 418 R 418 (423)
T ss_pred c
Confidence 4
No 41
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=94.91 E-value=0.0062 Score=41.23 Aligned_cols=65 Identities=18% Similarity=0.124 Sum_probs=15.0
Q ss_pred cCCcCCCCcccchHHHHhhhhcCCCCCCCCCC----c---cccccccc-cCCCCCCCCCCCCCChhhHhhhccC
Q psy8702 58 ECNSCGKVFRRNCDLRRHALTHAVGDVPPESS----L---AESIGSLK-RDNLPPAPSPTGSQSSEDIEIERTP 123 (194)
Q Consensus 58 ~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~----~---c~~~~~~~-~~~~~~~C~k~F~~~~~L~~H~~~H 123 (194)
+|..|+..|.....|..|+...+ +-..-... . ........ ....|..|++.|.....|..|++.+
T Consensus 1 ~C~~C~~~f~~~~~l~~H~~~~H-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~C~~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 1 QCLFCDESFSSVDDLLQHMKKKH-GFDIPDQKYLVDPNRLLNYLRKKVKESFRCPYCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp -------------------------------------------------SSEEBSSSS-EESSHHHHHHHHHHT
T ss_pred Ccccccccccccccccccccccc-ccccccccccccccccccccccccCCCCCCCccCCCCcCHHHHHHHHcCc
Confidence 58899999999999999987544 32111110 0 00000000 1234667777777777777777643
No 42
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=94.76 E-value=0.026 Score=33.74 Aligned_cols=33 Identities=18% Similarity=0.202 Sum_probs=22.6
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCCCCC
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKP 56 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~ 56 (194)
.+....-|++|+..+... .+|++|+.+.++.+|
T Consensus 20 ~S~~PatCP~C~a~~~~s--------rnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 20 QSEQPATCPICGAVIRQS--------RNLRRHLEIRHFKKP 52 (54)
T ss_dssp TTS--EE-TTT--EESSH--------HHHHHHHHHHTTTS-
T ss_pred ccCCCCCCCcchhhccch--------hhHHHHHHHHhcccC
Confidence 345556799999999999 999999988777665
No 43
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.37 E-value=0.017 Score=29.04 Aligned_cols=21 Identities=33% Similarity=0.752 Sum_probs=12.3
Q ss_pred ccCCcCCCCcccchHHHHhhh
Q psy8702 57 YECNSCGKVFRRNCDLRRHAL 77 (194)
Q Consensus 57 ~~C~~C~k~f~~~~~L~~H~~ 77 (194)
|.|.+|++.|.+...|..|++
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~ 21 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLR 21 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHT
T ss_pred CCCCCCCCCcCCHHHHHHHHC
Confidence 345666666666666666654
No 44
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.24 E-value=0.017 Score=29.05 Aligned_cols=22 Identities=27% Similarity=0.359 Sum_probs=19.0
Q ss_pred cccccccccccccccccccccchHHHHHhH
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLT 50 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~ 50 (194)
|.|+.|++.|... ..|..|++.
T Consensus 1 ~~C~~C~~~f~s~--------~~~~~H~~s 22 (25)
T PF12874_consen 1 FYCDICNKSFSSE--------NSLRQHLRS 22 (25)
T ss_dssp EEETTTTEEESSH--------HHHHHHHTT
T ss_pred CCCCCCCCCcCCH--------HHHHHHHCc
Confidence 5799999999999 888888764
No 45
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.69 E-value=0.012 Score=30.36 Aligned_cols=19 Identities=32% Similarity=0.805 Sum_probs=9.0
Q ss_pred cCCcCCCCcccchHHHHhh
Q psy8702 58 ECNSCGKVFRRNCDLRRHA 76 (194)
Q Consensus 58 ~C~~C~k~f~~~~~L~~H~ 76 (194)
-|..|++.|.+...|..|+
T Consensus 3 ~C~~C~k~f~~~~~~~~H~ 21 (27)
T PF12171_consen 3 YCDACDKYFSSENQLKQHM 21 (27)
T ss_dssp BBTTTTBBBSSHHHHHCCT
T ss_pred CcccCCCCcCCHHHHHHHH
Confidence 3444444444444444443
No 46
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.11 E-value=0.041 Score=28.28 Aligned_cols=21 Identities=24% Similarity=0.248 Sum_probs=16.3
Q ss_pred CcccccccccccccccccccccchHHHHH
Q psy8702 20 NPGLSLYTKSFTWKNLLTNVLTSNLKTHL 48 (194)
Q Consensus 20 ~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~ 48 (194)
.++|+.|++.|.+. ..|..|+
T Consensus 1 q~~C~~C~k~f~~~--------~~~~~H~ 21 (27)
T PF12171_consen 1 QFYCDACDKYFSSE--------NQLKQHM 21 (27)
T ss_dssp -CBBTTTTBBBSSH--------HHHHCCT
T ss_pred CCCcccCCCCcCCH--------HHHHHHH
Confidence 47899999999999 5555554
No 47
>PRK04860 hypothetical protein; Provisional
Probab=92.30 E-value=0.086 Score=39.58 Aligned_cols=32 Identities=25% Similarity=0.563 Sum_probs=28.7
Q ss_pred CccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 56 PYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
+|.|. |+. ....+..|.++|. ++++|.|..|+
T Consensus 119 ~Y~C~-C~~---~~~~~rrH~ri~~-g~~~YrC~~C~ 150 (160)
T PRK04860 119 PYRCK-CQE---HQLTVRRHNRVVR-GEAVYRCRRCG 150 (160)
T ss_pred EEEcC-CCC---eeCHHHHHHHHhc-CCccEECCCCC
Confidence 59998 987 7788899999999 99999999886
No 48
>KOG2893|consensus
Probab=90.92 E-value=0.1 Score=41.33 Aligned_cols=53 Identities=23% Similarity=0.285 Sum_probs=41.0
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHh-hhhcC
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRH-ALTHA 80 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H-~~~h~ 80 (194)
+.-.++-|-+|++.|... .-|..|++. +.|+|-+|-|...+-..|..| +.+|.
T Consensus 6 kk~~kpwcwycnrefdde--------kiliqhqka----khfkchichkkl~sgpglsihcmqvhk 59 (341)
T KOG2893|consen 6 KKVDKPWCWYCNREFDDE--------KILIQHQKA----KHFKCHICHKKLFSGPGLSIHCMQVHK 59 (341)
T ss_pred cccCCceeeecccccchh--------hhhhhhhhh----ccceeeeehhhhccCCCceeehhhhhh
Confidence 444567899999999999 888888764 669999999887777777777 44443
No 49
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=90.52 E-value=0.12 Score=25.62 Aligned_cols=22 Identities=32% Similarity=0.739 Sum_probs=10.9
Q ss_pred ccCCcCCCCcccchHHHHhhhhc
Q psy8702 57 YECNSCGKVFRRNCDLRRHALTH 79 (194)
Q Consensus 57 ~~C~~C~k~f~~~~~L~~H~~~h 79 (194)
|.|+.|+.... ...|..|++.+
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~ 22 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRH 22 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHH
T ss_pred CCCCCCCCcCC-HHHHHHHHHhh
Confidence 35566665554 55566665543
No 50
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=89.42 E-value=0.29 Score=34.22 Aligned_cols=34 Identities=18% Similarity=0.289 Sum_probs=27.9
Q ss_pred CCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccc
Q psy8702 17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRN 69 (194)
Q Consensus 17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~ 69 (194)
-+.|..|+.||+.|..- +..|..|+.||..|.-.
T Consensus 6 lGtKR~Cp~CG~kFYDL-------------------nk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 6 LGTKRTCPSCGAKFYDL-------------------NKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cCCcccCCCCcchhccC-------------------CCCCccCCCCCCccCcc
Confidence 45677899999999775 34788999999998876
No 51
>KOG2231|consensus
Probab=89.12 E-value=0.4 Score=43.62 Aligned_cols=41 Identities=10% Similarity=-0.043 Sum_probs=22.9
Q ss_pred CCCCCChhhHhhhccCCCCCCCCChh-hhCCC-cc-CCCCCccccccC
Q psy8702 108 PTGSQSSEDIEIERTPSEERSLEPSD-MLGRR-SR-SRTPSPPLRSHS 152 (194)
Q Consensus 108 k~F~~~~~L~~H~~~H~~ekp~~C~~-~Cgk~-f~-~~~~~~hl~~H~ 152 (194)
.-|..-..|..|-+. .-|.|++ .|--. |. .+.+..+|+.|.
T Consensus 220 eyy~~~~dLe~HfR~----~HflCE~~~C~~~~f~~~~~~ei~lk~~~ 263 (669)
T KOG2231|consen 220 EYYNDYDDLEEHFRK----GHFLCEEEFCRTKKFYVAFELEIELKAHN 263 (669)
T ss_pred hhcccchHHHHHhhh----cCccccccccccceeeehhHHHHHHHhhc
Confidence 445555666666443 3366653 35433 44 557777788665
No 52
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=88.63 E-value=0.34 Score=24.48 Aligned_cols=18 Identities=33% Similarity=0.803 Sum_probs=8.9
Q ss_pred cCCcCCCCcccchHHHHhh
Q psy8702 58 ECNSCGKVFRRNCDLRRHA 76 (194)
Q Consensus 58 ~C~~C~k~f~~~~~L~~H~ 76 (194)
.|+.||+.| ....|..|+
T Consensus 4 ~C~~CgR~F-~~~~l~~H~ 21 (25)
T PF13913_consen 4 PCPICGRKF-NPDRLEKHE 21 (25)
T ss_pred cCCCCCCEE-CHHHHHHHH
Confidence 455555555 334444444
No 53
>KOG2893|consensus
Probab=87.57 E-value=0.24 Score=39.33 Aligned_cols=45 Identities=7% Similarity=-0.058 Sum_probs=35.0
Q ss_pred CCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCC-ccCCCCCcc-cccc
Q psy8702 102 LPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRR-SRSRTPSPP-LRSH 151 (194)
Q Consensus 102 ~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~-f~~~~~~~h-l~~H 151 (194)
-|+.|.+.|....-|.+|++ .|-|+|. ||.|. |.--.|..| +..|
T Consensus 12 wcwycnrefddekiliqhqk----akhfkch-ichkkl~sgpglsihcmqvh 58 (341)
T KOG2893|consen 12 WCWYCNREFDDEKILIQHQK----AKHFKCH-ICHKKLFSGPGLSIHCMQVH 58 (341)
T ss_pred eeeecccccchhhhhhhhhh----hccceee-eehhhhccCCCceeehhhhh
Confidence 37788999999999999986 4679997 79999 665555555 5555
No 54
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=86.51 E-value=0.49 Score=25.47 Aligned_cols=22 Identities=18% Similarity=0.490 Sum_probs=16.2
Q ss_pred CccCCcCCCCcccchHHHHhhh
Q psy8702 56 PYECNSCGKVFRRNCDLRRHAL 77 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~ 77 (194)
+|.|++|++.|.....+..|+.
T Consensus 3 ~~~C~~C~~~~~~~~~~~~H~~ 24 (35)
T smart00451 3 GFYCKLCNVTFTDEISVEAHLK 24 (35)
T ss_pred CeEccccCCccCCHHHHHHHHC
Confidence 4678888888887777777764
No 55
>KOG2231|consensus
Probab=82.72 E-value=0.69 Score=42.17 Aligned_cols=56 Identities=11% Similarity=-0.145 Sum_probs=33.2
Q ss_pred CCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCC---ccCCCCCccccccCCCCccccc
Q psy8702 100 DNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRR---SRSRTPSPPLRSHSPISIFRRH 160 (194)
Q Consensus 100 ~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~---f~~~~~~~hl~~H~~~~~~~~~ 160 (194)
+.+|..|...|.....|..|++.++ |.|. +|.+. .-.|.....|..|-+..-|.+.
T Consensus 182 hp~C~~C~~~fld~~el~rH~~~~h----~~ch-fC~~~~~~neyy~~~~dLe~HfR~~HflCE 240 (669)
T KOG2231|consen 182 HPLCKFCHERFLDDDELYRHLRFDH----EFCH-FCDYKTGQNEYYNDYDDLEEHFRKGHFLCE 240 (669)
T ss_pred CccchhhhhhhccHHHHHHhhccce----ehee-ecCcccccchhcccchHHHHHhhhcCcccc
Confidence 3456677777777777777776543 4554 36432 1145555667777666666554
No 56
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=82.54 E-value=0.76 Score=38.66 Aligned_cols=70 Identities=16% Similarity=0.154 Sum_probs=57.4
Q ss_pred CCccCCcCCCCcccchHHHHhhh--hcCCCC--CCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCC
Q psy8702 55 KPYECNSCGKVFRRNCDLRRHAL--THAVGD--VPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLE 130 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~~H~~--~h~~~~--k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~ 130 (194)
.++.|..|...|.....|..|.+ .|. ++ +++.|+.- .|++.|.....+..|..+|.+..++.
T Consensus 288 ~~~~~~~~~~~~s~~~~l~~~~~~~~h~-~~~~~~~~~p~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (467)
T COG5048 288 LPIKSKQCNISFSRSSPLTRHLRSVNHS-GESLKPFSCPYS-------------LCGKLFSRNDALKRHILLHTSISPAK 353 (467)
T ss_pred cCCCCccccCCccccccccccccccccc-cccCCceeeecc-------------CCCccccccccccCCcccccCCCccc
Confidence 46889999999999999999999 788 88 89999731 23689999999999999999988877
Q ss_pred Chh-hhCCC
Q psy8702 131 PSD-MLGRR 138 (194)
Q Consensus 131 C~~-~Cgk~ 138 (194)
+.. .|.+.
T Consensus 354 ~~~~~~~~~ 362 (467)
T COG5048 354 EKLLNSSSK 362 (467)
T ss_pred cccccCccc
Confidence 743 34444
No 57
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=82.42 E-value=1.1 Score=32.12 Aligned_cols=38 Identities=13% Similarity=0.040 Sum_probs=29.1
Q ss_pred CCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHH
Q psy8702 17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLR 73 (194)
Q Consensus 17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~ 73 (194)
-+.|..|+.||+.|... +..|..|+.||..|.-...++
T Consensus 6 lGtKr~Cp~cg~kFYDL-------------------nk~p~vcP~cg~~~~~~~~~~ 43 (129)
T TIGR02300 6 LGTKRICPNTGSKFYDL-------------------NRRPAVSPYTGEQFPPEEALK 43 (129)
T ss_pred hCccccCCCcCcccccc-------------------CCCCccCCCcCCccCcchhhc
Confidence 35677899999999774 347899999999887664443
No 58
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=82.35 E-value=0.89 Score=24.53 Aligned_cols=13 Identities=0% Similarity=-0.225 Sum_probs=8.6
Q ss_pred ccccccccccccc
Q psy8702 21 PGLSLYTKSFTWK 33 (194)
Q Consensus 21 ~~c~~Cgk~f~~~ 33 (194)
+.|..||..+...
T Consensus 2 ~~C~~CGy~y~~~ 14 (33)
T cd00350 2 YVCPVCGYIYDGE 14 (33)
T ss_pred EECCCCCCEECCC
Confidence 5677777776554
No 59
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=78.88 E-value=0.71 Score=36.29 Aligned_cols=21 Identities=29% Similarity=0.518 Sum_probs=13.3
Q ss_pred CCccCCcCCCCcccchHHHHh
Q psy8702 55 KPYECNSCGKVFRRNCDLRRH 75 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~~H 75 (194)
+...|++|++.|.++.-+...
T Consensus 4 k~~~CPvC~~~F~~~~vrs~~ 24 (214)
T PF09986_consen 4 KKITCPVCGKEFKTKKVRSGK 24 (214)
T ss_pred CceECCCCCCeeeeeEEEcCC
Confidence 456777777777776544433
No 60
>KOG2186|consensus
Probab=77.15 E-value=1.6 Score=34.94 Aligned_cols=46 Identities=20% Similarity=0.426 Sum_probs=37.7
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchHHHHhhh
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRHAL 77 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~ 77 (194)
+-|..||-.... ..+..|+....+ .-|.|..|++.|.. ..+..|..
T Consensus 4 FtCnvCgEsvKK---------p~vekH~srCrn-~~fSCIDC~k~F~~-~sYknH~k 49 (276)
T KOG2186|consen 4 FTCNVCGESVKK---------PQVEKHMSRCRN-AYFSCIDCGKTFER-VSYKNHTK 49 (276)
T ss_pred Eehhhhhhhccc---------cchHHHHHhccC-CeeEEeeccccccc-chhhhhhh
Confidence 568999988877 467779988777 67999999999998 67777865
No 61
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=73.82 E-value=3.1 Score=21.22 Aligned_cols=10 Identities=10% Similarity=-0.147 Sum_probs=5.0
Q ss_pred CCCCCCCCCC
Q psy8702 101 NLPPAPSPTG 110 (194)
Q Consensus 101 ~~~~~C~k~F 110 (194)
..|+.||..|
T Consensus 15 ~~Cp~CG~~F 24 (26)
T PF10571_consen 15 KFCPHCGYDF 24 (26)
T ss_pred CcCCCCCCCC
Confidence 3345555555
No 62
>KOG1146|consensus
Probab=72.27 E-value=1.6 Score=42.61 Aligned_cols=98 Identities=15% Similarity=0.161 Sum_probs=55.4
Q ss_pred cccccccccccccccccccchHHHHHhH-cCCCCCccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccC
Q psy8702 23 LSLYTKSFTWKNLLTNVLTSNLKTHLLT-HTDHKPYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRD 100 (194)
Q Consensus 23 c~~Cgk~f~~~~~L~~H~~s~L~~H~~~-h~~~k~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~ 100 (194)
|..|+..+..+ ..+..++.. +.-.+.+.|+.|+..|.....|..|++. |.+++. .+|.
T Consensus 439 ~~~~e~~~~s~--------r~~~~~t~~L~S~~kt~~cpkc~~~yk~a~~L~vhmRskhp~~~~----~~c~-------- 498 (1406)
T KOG1146|consen 439 LTKAEPLLESK--------RSLEGQTVVLHSFFKTLKCPKCNWHYKLAQTLGVHMRSKHPESQS----AYCK-------- 498 (1406)
T ss_pred ccchhhhhhhh--------cccccceeeeecccccccCCccchhhhhHHHhhhcccccccccch----hHhH--------
Confidence 34445455554 334444432 3445789999999999999999999986 432333 3332
Q ss_pred CCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCCccCCCCCccccccC
Q psy8702 101 NLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRRSRSRTPSPPLRSHS 152 (194)
Q Consensus 101 ~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~f~~~~~~~hl~~H~ 152 (194)
.+.....+.+-.-.--+.++|.|. .|..+ ++....|-.|+
T Consensus 499 --------~gq~~~~~arg~~~~~~~~p~~C~-~C~~s---tttng~Lsihl 538 (1406)
T KOG1146|consen 499 --------AGQNHPRLARGEVYRCPGKPYPCR-ACNYS---TTTNGNLSIHL 538 (1406)
T ss_pred --------hccccccccccccccCCCCcccce-eeeee---eecchHHHHHH
Confidence 222222222212222345788997 49888 44444444443
No 63
>KOG2482|consensus
Probab=68.01 E-value=3.8 Score=34.41 Aligned_cols=66 Identities=11% Similarity=0.013 Sum_probs=34.2
Q ss_pred ccCCcCCCCcccchHHHHhhhh-cCCC----CCCCCCCccc------cccccccCCCCCCCCCCCCCChhhHhhhcc
Q psy8702 57 YECNSCGKVFRRNCDLRRHALT-HAVG----DVPPESSLAE------SIGSLKRDNLPPAPSPTGSQSSEDIEIERT 122 (194)
Q Consensus 57 ~~C~~C~k~f~~~~~L~~H~~~-h~~~----~k~~~c~~c~------~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~ 122 (194)
..|-.|....-+...|..|+.+ |.+. ...|.-.+.+ ..........|..|.-.|.....|..||.-
T Consensus 280 v~CLfC~~~~en~~~l~eHmk~vHe~Dl~Ki~sd~~Ln~YqrvrviNyiRkq~~~~~c~~cd~~F~~e~~l~~hm~e 356 (423)
T KOG2482|consen 280 VVCLFCTNFYENPVFLFEHMKIVHEFDLLKIQSDYSLNFYQRVRVINYIRKQKKKSRCAECDLSFWKEPGLLIHMVE 356 (423)
T ss_pred eEEEeeccchhhHHHHHHHHHHHHHhhHHhhccccccchhhhhhHHHHHHHHhhccccccccccccCcchhhhhccc
Confidence 4666677666666677777664 2211 1111111110 111111223367788888888888888753
No 64
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=66.71 E-value=1.6 Score=32.47 Aligned_cols=19 Identities=32% Similarity=0.613 Sum_probs=14.0
Q ss_pred CCccCCcCCCCcccchHHH
Q psy8702 55 KPYECNSCGKVFRRNCDLR 73 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~ 73 (194)
+.++|+.||+.|.+...+.
T Consensus 27 ~~~~c~~c~~~f~~~e~~~ 45 (154)
T PRK00464 27 RRRECLACGKRFTTFERVE 45 (154)
T ss_pred eeeeccccCCcceEeEecc
Confidence 3488999999998765443
No 65
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=65.37 E-value=4.9 Score=22.05 Aligned_cols=34 Identities=18% Similarity=0.283 Sum_probs=19.7
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcc
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFR 67 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~ 67 (194)
..|+.|+..|.-. ... .........|+.|+..|.
T Consensus 3 ~~CP~C~~~~~v~--------~~~-----~~~~~~~v~C~~C~~~~~ 36 (38)
T TIGR02098 3 IQCPNCKTSFRVV--------DSQ-----LGANGGKVRCGKCGHVWY 36 (38)
T ss_pred EECCCCCCEEEeC--------HHH-----cCCCCCEEECCCCCCEEE
Confidence 3578888877665 221 111223477888887664
No 66
>PF06524 NOA36: NOA36 protein; InterPro: IPR010531 This family consists of several NOA36 proteins which contain 29 highly conserved cysteine residues. The function of this protein is unknown.; GO: 0008270 zinc ion binding, 0005634 nucleus
Probab=64.64 E-value=8.7 Score=31.11 Aligned_cols=95 Identities=6% Similarity=0.076 Sum_probs=44.9
Q ss_pred CCCCccCCcCCCCcccchHHHHhhhh-cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCC
Q psy8702 53 DHKPYECNSCGKVFRRNCDLRRHALT-HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEP 131 (194)
Q Consensus 53 ~~k~~~C~~C~k~f~~~~~L~~H~~~-h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C 131 (194)
|.+.|.|..|..-.--..++. |+.. .......|+|..|+..+. .. |.-|.-.|-.....+.-. .....+++.|
T Consensus 139 GGrif~CsfC~~flCEDDQFE-HQAsCQvLe~E~~KC~SCNrlGq-~s---CLRCK~cfCddHvrrKg~-ky~k~k~~PC 212 (314)
T PF06524_consen 139 GGRIFKCSFCDNFLCEDDQFE-HQASCQVLESETFKCQSCNRLGQ-YS---CLRCKICFCDDHVRRKGF-KYEKGKPIPC 212 (314)
T ss_pred CCeEEEeecCCCeeeccchhh-hhhhhhhhhcccccccccccccc-hh---hhheeeeehhhhhhhccc-ccccCCCCCC
Confidence 446677777765443444433 3322 222445677777764331 11 222222222211111111 1234589999
Q ss_pred hhhhCCCccCCCCCccccccCCCCcc
Q psy8702 132 SDMLGRRSRSRTPSPPLRSHSPISIF 157 (194)
Q Consensus 132 ~~~Cgk~f~~~~~~~hl~~H~~~~~~ 157 (194)
+. ||.. ......|..-++...|
T Consensus 213 PK-Cg~e---t~eTkdLSmStR~hky 234 (314)
T PF06524_consen 213 PK-CGYE---TQETKDLSMSTRSHKY 234 (314)
T ss_pred CC-CCCc---ccccccceeeeecchh
Confidence 97 9999 4444444444443344
No 67
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=63.28 E-value=3.2 Score=23.61 Aligned_cols=13 Identities=8% Similarity=-0.171 Sum_probs=8.8
Q ss_pred ccccccccccccc
Q psy8702 21 PGLSLYTKSFTWK 33 (194)
Q Consensus 21 ~~c~~Cgk~f~~~ 33 (194)
|.|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (42)
T PF09723_consen 6 YRCEECGHEFEVL 18 (42)
T ss_pred EEeCCCCCEEEEE
Confidence 5677777776654
No 68
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=62.74 E-value=5.1 Score=21.74 Aligned_cols=12 Identities=0% Similarity=-0.476 Sum_probs=7.2
Q ss_pred cccccccccccc
Q psy8702 21 PGLSLYTKSFTW 32 (194)
Q Consensus 21 ~~c~~Cgk~f~~ 32 (194)
|.|..||..+..
T Consensus 3 ~~C~~CG~i~~g 14 (34)
T cd00729 3 WVCPVCGYIHEG 14 (34)
T ss_pred EECCCCCCEeEC
Confidence 556677666544
No 69
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=61.76 E-value=7.2 Score=21.51 Aligned_cols=33 Identities=21% Similarity=0.417 Sum_probs=19.8
Q ss_pred ccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcc
Q psy8702 22 GLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFR 67 (194)
Q Consensus 22 ~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~ 67 (194)
.|+.|+..|.-. .. ++-.+.....|..|+..|.
T Consensus 4 ~CP~C~~~f~v~--------~~-----~l~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 4 TCPNCQTRFRVP--------DD-----KLPAGGRKVRCPKCGHVFR 36 (37)
T ss_pred ECCCCCceEEcC--------HH-----HcccCCcEEECCCCCcEee
Confidence 477788777665 11 1223345677888877663
No 70
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=61.44 E-value=7.3 Score=21.40 Aligned_cols=32 Identities=16% Similarity=0.392 Sum_probs=19.3
Q ss_pred ccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCc
Q psy8702 22 GLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVF 66 (194)
Q Consensus 22 ~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f 66 (194)
.|+.|+..|.-. .. ++-......+|..|+..|
T Consensus 4 ~Cp~C~~~y~i~--------d~-----~ip~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEID--------DE-----KIPPKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCC--------HH-----HCCCCCcEEECCCCCCEe
Confidence 577788777665 11 122333457788887765
No 71
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=60.07 E-value=6.6 Score=22.67 Aligned_cols=14 Identities=0% Similarity=-0.111 Sum_probs=9.4
Q ss_pred Cccccccccccccc
Q psy8702 20 NPGLSLYTKSFTWK 33 (194)
Q Consensus 20 ~~~c~~Cgk~f~~~ 33 (194)
.|.|+.||..|...
T Consensus 3 ~y~C~~CG~~~~~~ 16 (46)
T PRK00398 3 EYKCARCGREVELD 16 (46)
T ss_pred EEECCCCCCEEEEC
Confidence 35677787776554
No 72
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=59.38 E-value=6.5 Score=26.28 Aligned_cols=14 Identities=29% Similarity=0.831 Sum_probs=9.9
Q ss_pred CCccCCcCCCCccc
Q psy8702 55 KPYECNSCGKVFRR 68 (194)
Q Consensus 55 k~~~C~~C~k~f~~ 68 (194)
..+.|..||..|..
T Consensus 52 GIW~C~kCg~~fAG 65 (89)
T COG1997 52 GIWKCRKCGAKFAG 65 (89)
T ss_pred CeEEcCCCCCeecc
Confidence 45778888877764
No 73
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=59.32 E-value=3.8 Score=24.23 Aligned_cols=13 Identities=8% Similarity=-0.107 Sum_probs=8.4
Q ss_pred ccccccccccccc
Q psy8702 21 PGLSLYTKSFTWK 33 (194)
Q Consensus 21 ~~c~~Cgk~f~~~ 33 (194)
|.|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (52)
T TIGR02605 6 YRCTACGHRFEVL 18 (52)
T ss_pred EEeCCCCCEeEEE
Confidence 5677777766554
No 74
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=58.07 E-value=5.2 Score=30.52 Aligned_cols=17 Identities=18% Similarity=0.315 Sum_probs=7.7
Q ss_pred CCccCCcCCCCcccchH
Q psy8702 55 KPYECNSCGKVFRRNCD 71 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~ 71 (194)
.-|.|+.|+..|+....
T Consensus 116 ~~Y~Cp~C~~rytf~eA 132 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEA 132 (178)
T ss_pred CEEECCCCCcEEeHHHH
Confidence 33445445444444433
No 75
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=57.54 E-value=7.4 Score=29.38 Aligned_cols=24 Identities=33% Similarity=0.768 Sum_probs=18.3
Q ss_pred CCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 55 KPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
+.|.|.+||. ++. |+-|-.|+.|+
T Consensus 133 ~~~vC~vCGy-------------~~~-ge~P~~CPiCg 156 (166)
T COG1592 133 KVWVCPVCGY-------------THE-GEAPEVCPICG 156 (166)
T ss_pred CEEEcCCCCC-------------ccc-CCCCCcCCCCC
Confidence 3688999985 345 78888888886
No 76
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=57.33 E-value=13 Score=31.49 Aligned_cols=96 Identities=16% Similarity=0.084 Sum_probs=56.1
Q ss_pred CCCCCCCCCCCC-CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCC----CcccchHHHHhhhh
Q psy8702 4 PRKNHSNVGNVA-KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGK----VFRRNCDLRRHALT 78 (194)
Q Consensus 4 ~~r~~~~~~~~~-~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k----~f~~~~~L~~H~~~ 78 (194)
+.|+|.+.+... -=..-|.|..|.+-|... ..|..|++.-+. +-+.|+..+. -|..-.+|..|.
T Consensus 203 ~Lr~H~~~G~~e~GFKGHP~C~FC~~~FYdD--------DEL~~HcR~~HE-~ChICD~v~p~~~QYFK~Y~~Le~HF-- 271 (493)
T COG5236 203 TLRDHKNGGLEEEGFKGHPLCIFCKIYFYDD--------DELRRHCRLRHE-ACHICDMVGPIRYQYFKSYEDLEAHF-- 271 (493)
T ss_pred cccccccCCccccCcCCCchhhhccceecCh--------HHHHHHHHhhhh-hhhhhhccCccchhhhhCHHHHHHHh--
Confidence 456777665554 234558899999999998 888999875433 3344433331 233344454443
Q ss_pred cCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhcc
Q psy8702 79 HAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERT 122 (194)
Q Consensus 79 h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~ 122 (194)
...-|.|.+-.+.. .--..|.....|..|+..
T Consensus 272 ---~~~hy~ct~qtc~~---------~k~~vf~~~~el~~h~~~ 303 (493)
T COG5236 272 ---RNAHYCCTFQTCRV---------GKCYVFPYHTELLEHLTR 303 (493)
T ss_pred ---hcCceEEEEEEEec---------CcEEEeccHHHHHHHHHH
Confidence 33446676422111 001368888999999743
No 77
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=56.67 E-value=6.5 Score=21.75 Aligned_cols=13 Identities=8% Similarity=-0.169 Sum_probs=8.7
Q ss_pred ccccccccccccc
Q psy8702 21 PGLSLYTKSFTWK 33 (194)
Q Consensus 21 ~~c~~Cgk~f~~~ 33 (194)
|.|..||..|...
T Consensus 6 y~C~~Cg~~fe~~ 18 (41)
T smart00834 6 YRCEDCGHTFEVL 18 (41)
T ss_pred EEcCCCCCEEEEE
Confidence 5677777776654
No 78
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=56.34 E-value=28 Score=24.47 Aligned_cols=77 Identities=10% Similarity=-0.016 Sum_probs=41.3
Q ss_pred CCccCCcCCCCcccchHHHHhhhhcC-CCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCChh
Q psy8702 55 KPYECNSCGKVFRRNCDLRRHALTHA-VGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSD 133 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~~H~~~h~-~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~ 133 (194)
-|..|++||-+......|.+. .|+ -.-++|+--.- ........|..|.+.|........ ..-.....|.|+.
T Consensus 14 LP~~CpiCgLtLVss~HLARS--yHHLfPl~~f~ev~~---~~~~~~~~C~~C~~~f~~~~~~~~--~~~~~~~~y~C~~ 86 (112)
T TIGR00622 14 LPVECPICGLTLILSTHLARS--YHHLFPLKAFQEIPL---EEYNGSRFCFGCQGPFPKPPVSPF--DELKDSHRYVCAV 86 (112)
T ss_pred CCCcCCcCCCEEeccchHHHh--hhccCCCcccccccc---cccCCCCcccCcCCCCCCcccccc--cccccccceeCCC
Confidence 467899999888888887763 222 01222321100 011123347778888875432110 0122345788986
Q ss_pred hhCCCc
Q psy8702 134 MLGRRS 139 (194)
Q Consensus 134 ~Cgk~f 139 (194)
|...|
T Consensus 87 -C~~~F 91 (112)
T TIGR00622 87 -CKNVF 91 (112)
T ss_pred -CCCcc
Confidence 98883
No 79
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=55.91 E-value=8.9 Score=23.71 Aligned_cols=9 Identities=11% Similarity=0.032 Sum_probs=4.9
Q ss_pred CCCCCCccc
Q psy8702 84 VPPESSLAE 92 (194)
Q Consensus 84 k~~~c~~c~ 92 (194)
.+|.|+.|+
T Consensus 49 ~~Y~Cp~CG 57 (61)
T COG2888 49 NPYRCPKCG 57 (61)
T ss_pred CceECCCcC
Confidence 455555554
No 80
>PF15135 UPF0515: Uncharacterised protein UPF0515
Probab=55.66 E-value=10 Score=30.46 Aligned_cols=11 Identities=27% Similarity=1.090 Sum_probs=5.8
Q ss_pred CCccCCcCCCC
Q psy8702 55 KPYECNSCGKV 65 (194)
Q Consensus 55 k~~~C~~C~k~ 65 (194)
+.|.|..|+..
T Consensus 111 rqFaC~~Cd~~ 121 (278)
T PF15135_consen 111 RQFACSSCDHM 121 (278)
T ss_pred eeeeccccchH
Confidence 44556666433
No 81
>KOG1146|consensus
Probab=54.74 E-value=6.9 Score=38.50 Aligned_cols=51 Identities=20% Similarity=0.316 Sum_probs=41.3
Q ss_pred CCcccccccccccccccccccccchHHHHHhHc-------------------------CCCCCccCCcCCCCcccchHHH
Q psy8702 19 VNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTH-------------------------TDHKPYECNSCGKVFRRNCDLR 73 (194)
Q Consensus 19 ~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h-------------------------~~~k~~~C~~C~k~f~~~~~L~ 73 (194)
..+.|+.|+..|+.. ..|..|||.- -+.++|.|..|...+..+.+|.
T Consensus 464 kt~~cpkc~~~yk~a--------~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~~C~~stttng~Ls 535 (1406)
T KOG1146|consen 464 KTLKCPKCNWHYKLA--------QTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCRACNYSTTTNGNLS 535 (1406)
T ss_pred ccccCCccchhhhhH--------HHhhhcccccccccchhHhHhccccccccccccccCCCCcccceeeeeeeecchHHH
Confidence 667899999999999 6666666651 1236799999999999999999
Q ss_pred Hhhh
Q psy8702 74 RHAL 77 (194)
Q Consensus 74 ~H~~ 77 (194)
+|+.
T Consensus 536 ihlq 539 (1406)
T KOG1146|consen 536 IHLQ 539 (1406)
T ss_pred HHHH
Confidence 9976
No 82
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=53.75 E-value=12 Score=21.90 Aligned_cols=29 Identities=28% Similarity=0.252 Sum_probs=19.8
Q ss_pred CcccccccccccccccccccccchHHHHHh-Hc
Q psy8702 20 NPGLSLYTKSFTWKNLLTNVLTSNLKTHLL-TH 51 (194)
Q Consensus 20 ~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~-~h 51 (194)
.=.|..|++.+...+. ..+++|.+|+. .|
T Consensus 18 ~a~C~~C~~~l~~~~~---~gTs~L~rHl~~~h 47 (50)
T smart00614 18 RAKCKYCGKKLSRSSK---GGTSNLRRHLRRKH 47 (50)
T ss_pred EEEecCCCCEeeeCCC---CCcHHHHHHHHhHC
Confidence 3468999999876532 34468888887 44
No 83
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=53.61 E-value=10 Score=27.89 Aligned_cols=37 Identities=14% Similarity=0.125 Sum_probs=20.8
Q ss_pred CCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCcccc
Q psy8702 52 TDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAES 93 (194)
Q Consensus 52 ~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~ 93 (194)
.+..-|.|+.|+..|.....+.. .. ....|.|+.|+.
T Consensus 95 ~~~~~Y~Cp~C~~~y~~~ea~~~----~d-~~~~f~Cp~Cg~ 131 (147)
T smart00531 95 TNNAYYKCPNCQSKYTFLEANQL----LD-MDGTFTCPRCGE 131 (147)
T ss_pred cCCcEEECcCCCCEeeHHHHHHh----cC-CCCcEECCCCCC
Confidence 34456778888877775544322 01 122377777763
No 84
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=53.00 E-value=9.7 Score=27.62 Aligned_cols=29 Identities=31% Similarity=0.195 Sum_probs=16.9
Q ss_pred CCCCcccccccccccccccccccccchHHHHHhHcCCCCC
Q psy8702 17 DSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKP 56 (194)
Q Consensus 17 ~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~ 56 (194)
......|-+|||.|.. |++|++.|.|-.|
T Consensus 69 ~~d~i~clecGk~~k~-----------LkrHL~~~~gltp 97 (132)
T PF05443_consen 69 TPDYIICLECGKKFKT-----------LKRHLRTHHGLTP 97 (132)
T ss_dssp -SS-EE-TBT--EESB-----------HHHHHHHTT-S-H
T ss_pred ccCeeEEccCCcccch-----------HHHHHHHccCCCH
Confidence 3444578999999855 7999999977443
No 85
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=47.78 E-value=7.1 Score=23.82 Aligned_cols=23 Identities=43% Similarity=0.807 Sum_probs=10.5
Q ss_pred CCCccCCcCCCCcccchHHHHhh
Q psy8702 54 HKPYECNSCGKVFRRNCDLRRHA 76 (194)
Q Consensus 54 ~k~~~C~~C~k~f~~~~~L~~H~ 76 (194)
|.-+.|+-||..|.....+..|+
T Consensus 15 E~~lrCPRC~~~FR~~K~Y~RHV 37 (65)
T COG4049 15 EEFLRCPRCGMVFRRRKDYIRHV 37 (65)
T ss_pred ceeeeCCchhHHHHHhHHHHHHh
Confidence 33344444444444444444443
No 86
>COG4530 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.29 E-value=12 Score=26.23 Aligned_cols=32 Identities=19% Similarity=0.110 Sum_probs=24.4
Q ss_pred CCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCccc
Q psy8702 18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRR 68 (194)
Q Consensus 18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~ 68 (194)
..+--|+.||+.|... +..|..|+.||++|..
T Consensus 7 GtKridPetg~KFYDL-------------------NrdPiVsPytG~s~P~ 38 (129)
T COG4530 7 GTKRIDPETGKKFYDL-------------------NRDPIVSPYTGKSYPR 38 (129)
T ss_pred cccccCccccchhhcc-------------------CCCccccCcccccchH
Confidence 3445689999998764 3478899999999854
No 87
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=46.52 E-value=16 Score=19.58 Aligned_cols=13 Identities=8% Similarity=-0.273 Sum_probs=6.6
Q ss_pred ccccccccccccc
Q psy8702 21 PGLSLYTKSFTWK 33 (194)
Q Consensus 21 ~~c~~Cgk~f~~~ 33 (194)
|.|..||..+..+
T Consensus 1 Y~C~~Cg~~~~~~ 13 (32)
T PF03604_consen 1 YICGECGAEVELK 13 (32)
T ss_dssp EBESSSSSSE-BS
T ss_pred CCCCcCCCeeEcC
Confidence 3466666665543
No 88
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=45.70 E-value=15 Score=31.95 Aligned_cols=37 Identities=19% Similarity=0.323 Sum_probs=27.8
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccchH
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNCD 71 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~ 71 (194)
....+|.|+.||.+.... |.+-|.|+.||..+.....
T Consensus 346 ~~~~~p~Cp~Cg~~m~S~-------------------G~~g~rC~kCg~~~~~~~~ 382 (421)
T COG1571 346 YERVNPVCPRCGGRMKSA-------------------GRNGFRCKKCGTRARETLI 382 (421)
T ss_pred eEEcCCCCCccCCchhhc-------------------CCCCcccccccccCCcccc
Confidence 456778999999886654 4447999999988776543
No 89
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=45.21 E-value=8.7 Score=24.79 Aligned_cols=42 Identities=12% Similarity=0.174 Sum_probs=25.8
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCC--cCCCCcccchHH
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECN--SCGKVFRRNCDL 72 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~--~C~k~f~~~~~L 72 (194)
..|+.||..-.-. .+-... .-.-+.-++|. .||..|.....+
T Consensus 2 m~CP~Cg~~a~ir--------tSr~~s--~~~~~~Y~qC~N~eCg~tF~t~es~ 45 (72)
T PRK09678 2 FHCPLCQHAAHAR--------TSRYIT--DTTKERYHQCQNVNCSATFITYESV 45 (72)
T ss_pred ccCCCCCCccEEE--------EChhcC--hhhheeeeecCCCCCCCEEEEEEEE
Confidence 4689998765333 221111 11345678898 899999886554
No 90
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=43.07 E-value=17 Score=21.00 Aligned_cols=12 Identities=0% Similarity=-0.418 Sum_probs=6.9
Q ss_pred cccccccccccc
Q psy8702 21 PGLSLYTKSFTW 32 (194)
Q Consensus 21 ~~c~~Cgk~f~~ 32 (194)
|.|..||..|.-
T Consensus 3 Y~C~~Cg~~~~~ 14 (44)
T smart00659 3 YICGECGRENEI 14 (44)
T ss_pred EECCCCCCEeec
Confidence 456666665543
No 91
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=42.55 E-value=19 Score=18.02 Aligned_cols=7 Identities=14% Similarity=0.206 Sum_probs=3.0
Q ss_pred CCCCCCC
Q psy8702 101 NLPPAPS 107 (194)
Q Consensus 101 ~~~~~C~ 107 (194)
..|+.||
T Consensus 17 ~fC~~CG 23 (26)
T PF13248_consen 17 KFCPNCG 23 (26)
T ss_pred ccChhhC
Confidence 3344444
No 92
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=41.91 E-value=19 Score=22.23 Aligned_cols=9 Identities=44% Similarity=1.350 Sum_probs=4.4
Q ss_pred ccCCcCCCC
Q psy8702 57 YECNSCGKV 65 (194)
Q Consensus 57 ~~C~~C~k~ 65 (194)
|.|+.||..
T Consensus 26 F~CPnCG~~ 34 (59)
T PRK14890 26 FLCPNCGEV 34 (59)
T ss_pred eeCCCCCCe
Confidence 445555544
No 93
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=41.86 E-value=22 Score=17.96 Aligned_cols=20 Identities=15% Similarity=0.257 Sum_probs=13.1
Q ss_pred ccCCcCCCCcccchHHHHhhh
Q psy8702 57 YECNSCGKVFRRNCDLRRHAL 77 (194)
Q Consensus 57 ~~C~~C~k~f~~~~~L~~H~~ 77 (194)
..|++|++.+ ....++.|..
T Consensus 2 v~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 2 VQCPVCFREV-PENLINSHLD 21 (26)
T ss_pred CcCCCCcCcc-cHHHHHHHHH
Confidence 3578888776 4456666654
No 94
>KOG4167|consensus
Probab=40.36 E-value=11 Score=34.92 Aligned_cols=28 Identities=18% Similarity=0.019 Sum_probs=22.0
Q ss_pred CCCcccccccccccccccccccccchHHHHHhHcCC
Q psy8702 18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTD 53 (194)
Q Consensus 18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~ 53 (194)
...|.|.+|+|.|... .++..||++|.-
T Consensus 790 ~giFpCreC~kvF~Ki--------KSrNAHMK~Hr~ 817 (907)
T KOG4167|consen 790 TGIFPCRECGKVFFKI--------KSRNAHMKTHRQ 817 (907)
T ss_pred CceeehHHHHHHHHHH--------hhhhHHHHHHHH
Confidence 4568899999999998 777777777643
No 95
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=40.25 E-value=19 Score=25.12 Aligned_cols=30 Identities=23% Similarity=0.444 Sum_probs=23.2
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccc
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRN 69 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~ 69 (194)
|.|+.|+-.|++. ....+.|++|+..|...
T Consensus 3 p~CP~C~seytY~-------------------dg~~~iCpeC~~EW~~~ 32 (109)
T TIGR00686 3 PPCPKCNSEYTYH-------------------DGTQLICPSCLYEWNEN 32 (109)
T ss_pred CcCCcCCCcceEe-------------------cCCeeECcccccccccc
Confidence 6799999888876 23468999999877654
No 96
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=39.78 E-value=28 Score=25.88 Aligned_cols=34 Identities=12% Similarity=0.058 Sum_probs=24.3
Q ss_pred hHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 49 LTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 49 ~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
....+..-|.|+.|+..|+....+. ..|.|+.|+
T Consensus 102 ~~e~~~~~Y~Cp~c~~r~tf~eA~~----------~~F~Cp~Cg 135 (158)
T TIGR00373 102 EFETNNMFFICPNMCVRFTFNEAME----------LNFTCPRCG 135 (158)
T ss_pred hhccCCCeEECCCCCcEeeHHHHHH----------cCCcCCCCC
Confidence 3345556788999998888777763 258888876
No 97
>KOG2186|consensus
Probab=39.77 E-value=12 Score=30.13 Aligned_cols=56 Identities=13% Similarity=0.163 Sum_probs=41.0
Q ss_pred CccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCC
Q psy8702 56 PYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLE 130 (194)
Q Consensus 56 ~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~ 130 (194)
-|.|.+||.... +..+..|+.... + .-|.|.-| +++|.. .+...|...-++...|.
T Consensus 3 ~FtCnvCgEsvK-Kp~vekH~srCr-n-~~fSCIDC---------------~k~F~~-~sYknH~kCITEaQKYg 58 (276)
T KOG2186|consen 3 FFTCNVCGESVK-KPQVEKHMSRCR-N-AYFSCIDC---------------GKTFER-VSYKNHTKCITEAQKYG 58 (276)
T ss_pred EEehhhhhhhcc-ccchHHHHHhcc-C-CeeEEeec---------------cccccc-chhhhhhhhcchHHHhh
Confidence 378999998644 567777888776 4 77888766 579987 78888887766655553
No 98
>PTZ00255 60S ribosomal protein L37a; Provisional
Probab=38.40 E-value=15 Score=24.75 Aligned_cols=14 Identities=21% Similarity=0.610 Sum_probs=8.3
Q ss_pred CCccCCcCCCCccc
Q psy8702 55 KPYECNSCGKVFRR 68 (194)
Q Consensus 55 k~~~C~~C~k~f~~ 68 (194)
..+.|..|++.|..
T Consensus 53 GIW~C~~C~~~~AG 66 (90)
T PTZ00255 53 GIWRCKGCKKTVAG 66 (90)
T ss_pred EEEEcCCCCCEEeC
Confidence 34666666665553
No 99
>PHA00626 hypothetical protein
Probab=37.72 E-value=16 Score=22.34 Aligned_cols=16 Identities=25% Similarity=0.619 Sum_probs=12.1
Q ss_pred CCCccCCcCCCCcccc
Q psy8702 54 HKPYECNSCGKVFRRN 69 (194)
Q Consensus 54 ~k~~~C~~C~k~f~~~ 69 (194)
...|.|..||..|+..
T Consensus 21 snrYkCkdCGY~ft~~ 36 (59)
T PHA00626 21 SDDYVCCDCGYNDSKD 36 (59)
T ss_pred CcceEcCCCCCeechh
Confidence 3568898898877764
No 100
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=37.62 E-value=11 Score=18.80 Aligned_cols=10 Identities=10% Similarity=0.046 Sum_probs=6.9
Q ss_pred CCCCChhhhCC
Q psy8702 127 RSLEPSDMLGR 137 (194)
Q Consensus 127 kp~~C~~~Cgk 137 (194)
-+|.|+. ||+
T Consensus 15 v~f~CPn-CG~ 24 (24)
T PF07754_consen 15 VPFPCPN-CGF 24 (24)
T ss_pred ceEeCCC-CCC
Confidence 3578886 874
No 101
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=37.42 E-value=21 Score=20.09 Aligned_cols=12 Identities=42% Similarity=1.032 Sum_probs=4.2
Q ss_pred CccCCcCCCCcc
Q psy8702 56 PYECNSCGKVFR 67 (194)
Q Consensus 56 ~~~C~~C~k~f~ 67 (194)
...|..|++.+.
T Consensus 16 ~a~C~~C~~~~~ 27 (45)
T PF02892_consen 16 KAKCKYCGKVIK 27 (45)
T ss_dssp -EEETTTTEE--
T ss_pred eEEeCCCCeEEe
Confidence 344444544433
No 102
>KOG2593|consensus
Probab=36.93 E-value=23 Score=30.80 Aligned_cols=42 Identities=17% Similarity=0.180 Sum_probs=29.3
Q ss_pred HHhHcCCCCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 47 HLLTHTDHKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 47 H~~~h~~~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
-++--+....|.|+.|.+.|+....++. +-. ..-.|.|..|+
T Consensus 119 ~~~d~t~~~~Y~Cp~C~kkyt~Lea~~L---~~~-~~~~F~C~~C~ 160 (436)
T KOG2593|consen 119 RLRDDTNVAGYVCPNCQKKYTSLEALQL---LDN-ETGEFHCENCG 160 (436)
T ss_pred HhhhccccccccCCccccchhhhHHHHh---hcc-cCceEEEecCC
Confidence 3344456678999999999998766543 222 45678888886
No 103
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=36.73 E-value=18 Score=26.09 Aligned_cols=16 Identities=44% Similarity=1.124 Sum_probs=12.0
Q ss_pred CCCccCCcCCCCcccc
Q psy8702 54 HKPYECNSCGKVFRRN 69 (194)
Q Consensus 54 ~k~~~C~~C~k~f~~~ 69 (194)
...|.|..|++.|...
T Consensus 51 ~qRyrC~~C~~tf~~~ 66 (129)
T COG3677 51 HQRYKCKSCGSTFTVE 66 (129)
T ss_pred ccccccCCcCcceeee
Confidence 4668888888887754
No 104
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.71 E-value=17 Score=22.60 Aligned_cols=30 Identities=7% Similarity=-0.114 Sum_probs=20.6
Q ss_pred CCCCCChhhHhhhccCCCCCCCCChhhhCCC
Q psy8702 108 PTGSQSSEDIEIERTPSEERSLEPSDMLGRR 138 (194)
Q Consensus 108 k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~ 138 (194)
..+.....+..-+.....-+.|.|++ |.-+
T Consensus 11 v~~iD~rt~~tKrLrN~PIrtymC~e-C~~R 40 (68)
T COG4896 11 VDEIDNRTFKTKRLRNKPIRTYMCPE-CEHR 40 (68)
T ss_pred eeeecchhHHHHHhhCCCceeEechh-hHhh
Confidence 45666666665555555556799998 9877
No 105
>PRK10220 hypothetical protein; Provisional
Probab=34.42 E-value=29 Score=24.23 Aligned_cols=31 Identities=19% Similarity=0.490 Sum_probs=23.6
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCCcccch
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKVFRRNC 70 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~ 70 (194)
|.|+.|+-.|++. ....|.|++|+..|....
T Consensus 4 P~CP~C~seytY~-------------------d~~~~vCpeC~hEW~~~~ 34 (111)
T PRK10220 4 PHCPKCNSEYTYE-------------------DNGMYICPECAHEWNDAE 34 (111)
T ss_pred CcCCCCCCcceEc-------------------CCCeEECCcccCcCCccc
Confidence 6799999888876 234689999998776553
No 106
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=34.07 E-value=19 Score=28.32 Aligned_cols=18 Identities=22% Similarity=0.056 Sum_probs=11.6
Q ss_pred CCCCCccccccccccccc
Q psy8702 16 KDSVNPGLSLYTKSFTWK 33 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~ 33 (194)
....+|.|+.|+|.|.-.
T Consensus 73 ~~~~K~~C~lc~KlFkg~ 90 (214)
T PF04959_consen 73 EDEDKWRCPLCGKLFKGP 90 (214)
T ss_dssp SSSEEEEE-SSS-EESSH
T ss_pred HcCCEECCCCCCcccCCh
Confidence 455677788888888777
No 107
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=33.20 E-value=13 Score=31.42 Aligned_cols=58 Identities=16% Similarity=0.100 Sum_probs=35.6
Q ss_pred cCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCCCCCChhhHhhhccCCCCCCCCCh
Q psy8702 58 ECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPTGSQSSEDIEIERTPSEERSLEPS 132 (194)
Q Consensus 58 ~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~ 132 (194)
.|..|..-|.....|..|++..+ |+-|.|+.-+ +.=.+-|..-.+|..|-+ ..-|-|.
T Consensus 222 ~C~FC~~~FYdDDEL~~HcR~~H--E~ChICD~v~-----------p~~~QYFK~Y~~Le~HF~----~~hy~ct 279 (493)
T COG5236 222 LCIFCKIYFYDDDELRRHCRLRH--EACHICDMVG-----------PIRYQYFKSYEDLEAHFR----NAHYCCT 279 (493)
T ss_pred hhhhccceecChHHHHHHHHhhh--hhhhhhhccC-----------ccchhhhhCHHHHHHHhh----cCceEEE
Confidence 47888888888888888887655 5555554321 011234666677777754 2335564
No 108
>KOG3583|consensus
Probab=33.13 E-value=23 Score=27.97 Aligned_cols=24 Identities=38% Similarity=0.654 Sum_probs=20.0
Q ss_pred cccCCCCCCCCCCCCcccCCCCCC
Q psy8702 171 RHSQSPQPAHSSSIPFRHHSPIKF 194 (194)
Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~ 194 (194)
.|+|-|+-.+--.||.|||+|.+|
T Consensus 254 s~hq~P~~s~~~~~~~~~~~~~~~ 277 (279)
T KOG3583|consen 254 SHHQQPQYSHQQPMNPQHHSPMLL 277 (279)
T ss_pred hhccCccccccCCCCcccccchhh
Confidence 455778888889999999999875
No 109
>TIGR00280 L37a ribosomal protein L37a. This model finds eukaryotic ribosomal protein L37a and its archaeal orthologs. The nomeclature is tricky because eukaryotes have proteins called both L37 and L37a.
Probab=32.83 E-value=18 Score=24.45 Aligned_cols=12 Identities=33% Similarity=0.991 Sum_probs=7.4
Q ss_pred CccCCcCCCCcc
Q psy8702 56 PYECNSCGKVFR 67 (194)
Q Consensus 56 ~~~C~~C~k~f~ 67 (194)
.+.|..|++.|.
T Consensus 53 IW~C~~C~~~~A 64 (91)
T TIGR00280 53 IWTCRKCGAKFA 64 (91)
T ss_pred EEEcCCCCCEEe
Confidence 466666666554
No 110
>PF01780 Ribosomal_L37ae: Ribosomal L37ae protein family; InterPro: IPR002674 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This ribosomal protein is found in archaebacteria and eukaryotes []. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 4A1E_Y 4A17_Y 4A1C_Y 4A1A_Y 3O58_g 3IZS_m 3O5H_g 1S1I_9 3IZR_m 1YSH_D ....
Probab=32.70 E-value=12 Score=25.26 Aligned_cols=12 Identities=42% Similarity=1.140 Sum_probs=7.0
Q ss_pred CccCCcCCCCcc
Q psy8702 56 PYECNSCGKVFR 67 (194)
Q Consensus 56 ~~~C~~C~k~f~ 67 (194)
.|.|..|++.|.
T Consensus 53 IW~C~~C~~~~A 64 (90)
T PF01780_consen 53 IWKCKKCGKKFA 64 (90)
T ss_dssp EEEETTTTEEEE
T ss_pred EeecCCCCCEEe
Confidence 366666665554
No 111
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=32.16 E-value=26 Score=29.25 Aligned_cols=49 Identities=16% Similarity=0.021 Sum_probs=26.8
Q ss_pred CCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCC
Q psy8702 55 KPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSP 108 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k 108 (194)
..-.|++||..-... -+ +..... |.+-..|..|+. .=......|+.||.
T Consensus 186 ~~~~CPvCGs~P~~s-~v--~~~~~~-G~RyL~CslC~t-eW~~~R~~C~~Cg~ 234 (309)
T PRK03564 186 QRQFCPVCGSMPVSS-VV--QIGTTQ-GLRYLHCNLCES-EWHVVRVKCSNCEQ 234 (309)
T ss_pred CCCCCCCCCCcchhh-ee--eccCCC-CceEEEcCCCCC-cccccCccCCCCCC
Confidence 446799998642211 11 112233 666678887752 11223556889985
No 112
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=31.76 E-value=25 Score=20.89 Aligned_cols=11 Identities=0% Similarity=-0.133 Sum_probs=6.9
Q ss_pred Ccccccccccc
Q psy8702 20 NPGLSLYTKSF 30 (194)
Q Consensus 20 ~~~c~~Cgk~f 30 (194)
.|.|..||+.|
T Consensus 6 ~Y~C~~Cg~~~ 16 (49)
T COG1996 6 EYKCARCGREV 16 (49)
T ss_pred EEEhhhcCCee
Confidence 35566666666
No 113
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=30.87 E-value=19 Score=18.99 Aligned_cols=12 Identities=25% Similarity=0.551 Sum_probs=2.6
Q ss_pred cccccccccccc
Q psy8702 21 PGLSLYTKSFTW 32 (194)
Q Consensus 21 ~~c~~Cgk~f~~ 32 (194)
+.|+.|+-.+..
T Consensus 3 p~Cp~C~se~~y 14 (30)
T PF08274_consen 3 PKCPLCGSEYTY 14 (30)
T ss_dssp ---TTT-----E
T ss_pred CCCCCCCCccee
Confidence 457777766544
No 114
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=30.49 E-value=39 Score=20.49 Aligned_cols=14 Identities=14% Similarity=-0.012 Sum_probs=9.7
Q ss_pred Cccccccccccccc
Q psy8702 20 NPGLSLYTKSFTWK 33 (194)
Q Consensus 20 ~~~c~~Cgk~f~~~ 33 (194)
...|+.||+.|...
T Consensus 5 ~~~C~~Cg~~~~~~ 18 (54)
T PF14446_consen 5 GCKCPVCGKKFKDG 18 (54)
T ss_pred CccChhhCCcccCC
Confidence 34578888887654
No 115
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=29.77 E-value=43 Score=19.87 Aligned_cols=17 Identities=29% Similarity=0.681 Sum_probs=11.1
Q ss_pred CCccCCcCCCCcccchH
Q psy8702 55 KPYECNSCGKVFRRNCD 71 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~ 71 (194)
+.+.|..||..|.....
T Consensus 3 k~l~C~dCg~~FvfTa~ 19 (49)
T PF13451_consen 3 KTLTCKDCGAEFVFTAG 19 (49)
T ss_pred eeEEcccCCCeEEEehh
Confidence 45667777777766544
No 116
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=29.39 E-value=36 Score=18.64 Aligned_cols=12 Identities=0% Similarity=-0.114 Sum_probs=8.0
Q ss_pred cccccccccccc
Q psy8702 22 GLSLYTKSFTWK 33 (194)
Q Consensus 22 ~c~~Cgk~f~~~ 33 (194)
.|+.||+.|...
T Consensus 3 ~C~~Cg~~Yh~~ 14 (36)
T PF05191_consen 3 ICPKCGRIYHIE 14 (36)
T ss_dssp EETTTTEEEETT
T ss_pred CcCCCCCccccc
Confidence 467777777654
No 117
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=29.19 E-value=83 Score=29.13 Aligned_cols=8 Identities=13% Similarity=0.202 Sum_probs=3.9
Q ss_pred CChhhhCCC
Q psy8702 130 EPSDMLGRR 138 (194)
Q Consensus 130 ~C~~~Cgk~ 138 (194)
-|++ ||..
T Consensus 43 fC~~-CG~~ 50 (645)
T PRK14559 43 HCPN-CGAE 50 (645)
T ss_pred cccc-cCCc
Confidence 4554 5544
No 118
>PF01286 XPA_N: XPA protein N-terminal; InterPro: IPR022652 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. Skin cells of individual's with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-A is the most severe form of the disease and is due to defects in a 30 kDa nuclear protein called XPA (or XPAC) []. The sequence of the XPA protein is conserved from higher eukaryotes [] to yeast (gene RAD14) []. XPA is a hydrophilic protein of 247 to 296 amino-acid residues which has a C4-type zinc finger motif in its central section. This entry contains the zinc-finger containing region in the XPA protein. It is found N-terminal to PF05181 from PFAM ; PDB: 1D4U_A 1XPA_A.
Probab=29.05 E-value=25 Score=19.14 Aligned_cols=14 Identities=21% Similarity=0.112 Sum_probs=7.4
Q ss_pred Cccccccccccccc
Q psy8702 20 NPGLSLYTKSFTWK 33 (194)
Q Consensus 20 ~~~c~~Cgk~f~~~ 33 (194)
.+.|.+||+.|..+
T Consensus 3 ~~~C~eC~~~f~dS 16 (34)
T PF01286_consen 3 YPKCDECGKPFMDS 16 (34)
T ss_dssp -EE-TTT--EES-S
T ss_pred CchHhHhCCHHHHH
Confidence 46789999999885
No 119
>KOG2593|consensus
Probab=28.96 E-value=39 Score=29.41 Aligned_cols=39 Identities=15% Similarity=0.283 Sum_probs=24.7
Q ss_pred CCCCCcccccccccccccccccccccchHHHHHhHcCCCCCccCCcCCCC
Q psy8702 16 KDSVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKPYECNSCGKV 65 (194)
Q Consensus 16 ~~~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~~~C~~C~k~ 65 (194)
-....|.|+.|.+.|+.- -.-+..-...-.|.|..|+-.
T Consensus 124 t~~~~Y~Cp~C~kkyt~L-----------ea~~L~~~~~~~F~C~~C~ge 162 (436)
T KOG2593|consen 124 TNVAGYVCPNCQKKYTSL-----------EALQLLDNETGEFHCENCGGE 162 (436)
T ss_pred cccccccCCccccchhhh-----------HHHHhhcccCceEEEecCCCc
Confidence 345668899999888663 333333333456888888753
No 120
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=28.32 E-value=26 Score=32.80 Aligned_cols=9 Identities=0% Similarity=-0.500 Sum_probs=4.0
Q ss_pred ccccccccc
Q psy8702 23 LSLYTKSFT 31 (194)
Q Consensus 23 c~~Cgk~f~ 31 (194)
|..||..+.
T Consensus 438 C~~Cg~v~~ 446 (730)
T COG1198 438 CRDCGYIAE 446 (730)
T ss_pred cccCCCccc
Confidence 444444433
No 121
>PF01363 FYVE: FYVE zinc finger; InterPro: IPR000306 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two zinc ions []. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. FYVE-type domains are divided into two known classes: FYVE domains that specifically bind to phosphatidylinositol 3-phosphate in lipid bilayers and FYVE-related domains of undetermined function []. Those that bind to phosphatidylinositol 3-phosphate are often found in proteins targeted to lipid membranes that are involved in regulating membrane traffic [, , ]. Most FYVE domains target proteins to endosomes by binding specifically to phosphatidylinositol-3-phosphate at the membrane surface. By contrast, the CARP2 FYVE-like domain is not optimized to bind to phosphoinositides or insert into lipid bilayers. FYVE domains are distinguished from other zinc fingers by three signature sequences: an N-terminal WxxD motif, a basic R(R/K)HHCR patch, and a C-terminal RVC motif. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0046872 metal ion binding; PDB: 1HYI_A 1JOC_B 1HYJ_A 1DVP_A 3ZYQ_A 4AVX_A 1VFY_A 3T7L_A 1X4U_A 1WFK_A ....
Probab=27.85 E-value=31 Score=21.38 Aligned_cols=16 Identities=25% Similarity=0.312 Sum_probs=7.0
Q ss_pred CCCCcccccccccccc
Q psy8702 17 DSVNPGLSLYTKSFTW 32 (194)
Q Consensus 17 ~~~~~~c~~Cgk~f~~ 32 (194)
+.+...|..|++.|..
T Consensus 6 d~~~~~C~~C~~~F~~ 21 (69)
T PF01363_consen 6 DSEASNCMICGKKFSL 21 (69)
T ss_dssp GGG-SB-TTT--B-BS
T ss_pred CCCCCcCcCcCCcCCC
Confidence 3455678888888843
No 122
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=26.98 E-value=39 Score=24.62 Aligned_cols=25 Identities=36% Similarity=0.296 Sum_probs=19.6
Q ss_pred cccccccccccccccccccccchHHHHHhHcCCCCC
Q psy8702 21 PGLSLYTKSFTWKNLLTNVLTSNLKTHLLTHTDHKP 56 (194)
Q Consensus 21 ~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~~h~~~k~ 56 (194)
..|-++||.|. +|++|+.+|.+--|
T Consensus 77 IicLEDGkkfK-----------SLKRHL~t~~gmTP 101 (148)
T COG4957 77 IICLEDGKKFK-----------SLKRHLTTHYGLTP 101 (148)
T ss_pred EEEeccCcchH-----------HHHHHHhcccCCCH
Confidence 46889999994 48999999877543
No 123
>PRK03976 rpl37ae 50S ribosomal protein L37Ae; Reviewed
Probab=26.86 E-value=25 Score=23.70 Aligned_cols=12 Identities=42% Similarity=1.093 Sum_probs=6.8
Q ss_pred CccCCcCCCCcc
Q psy8702 56 PYECNSCGKVFR 67 (194)
Q Consensus 56 ~~~C~~C~k~f~ 67 (194)
.+.|..|++.|.
T Consensus 54 IW~C~~C~~~~A 65 (90)
T PRK03976 54 IWECRKCGAKFA 65 (90)
T ss_pred EEEcCCCCCEEe
Confidence 456666665554
No 124
>COG1655 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.25 E-value=16 Score=29.13 Aligned_cols=30 Identities=27% Similarity=0.449 Sum_probs=18.5
Q ss_pred CCCccCCcCCCCcccchHHHHhhhhcCCCCC
Q psy8702 54 HKPYECNSCGKVFRRNCDLRRHALTHAVGDV 84 (194)
Q Consensus 54 ~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k 84 (194)
++...|++|+-.|....-+..-.++-+ |+-
T Consensus 17 kk~ieCPvC~tkFkkeev~tgsiRiia-gDl 46 (267)
T COG1655 17 KKTIECPVCNTKFKKEEVKTGSIRIIA-GDL 46 (267)
T ss_pred hceeccCcccchhhhhheeccceeEec-ccc
Confidence 456778888877776655555555544 443
No 125
>PF06906 DUF1272: Protein of unknown function (DUF1272); InterPro: IPR010696 This family consists of several hypothetical bacterial proteins of around 80 residues in length. This family contains a number of conserved cysteine residues and its function is unknown.
Probab=25.94 E-value=31 Score=21.09 Aligned_cols=27 Identities=7% Similarity=-0.018 Sum_probs=17.9
Q ss_pred CCCCCccccccccccCCCCCCCCCCCC
Q psy8702 85 PPESSLAESIGSLKRDNLPPAPSPTGS 111 (194)
Q Consensus 85 ~~~c~~c~~~~~~~~~~~~~~C~k~F~ 111 (194)
.|+|.+|..+........|++||=.|.
T Consensus 26 SfECTFC~~C~e~~l~~~CPNCgGelv 52 (57)
T PF06906_consen 26 SFECTFCADCAETMLNGVCPNCGGELV 52 (57)
T ss_pred eEeCcccHHHHHHHhcCcCcCCCCccc
Confidence 478888876555555677777776554
No 126
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=25.69 E-value=46 Score=22.47 Aligned_cols=28 Identities=25% Similarity=0.541 Sum_probs=19.8
Q ss_pred CCCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccc
Q psy8702 54 HKPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAE 92 (194)
Q Consensus 54 ~k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~ 92 (194)
.+|-.|..||..|.... -.+|..|+.|.
T Consensus 56 v~Pa~CkkCGfef~~~~-----------ik~pSRCP~CK 83 (97)
T COG3357 56 VRPARCKKCGFEFRDDK-----------IKKPSRCPKCK 83 (97)
T ss_pred ecChhhcccCccccccc-----------cCCcccCCcch
Confidence 36788999998888731 24567777775
No 127
>PF15269 zf-C2H2_7: Zinc-finger
Probab=25.45 E-value=1.1e+02 Score=17.77 Aligned_cols=24 Identities=17% Similarity=-0.013 Sum_probs=16.0
Q ss_pred CCCcccccccccccccccccccccchHHHHHh
Q psy8702 18 SVNPGLSLYTKSFTWKNLLTNVLTSNLKTHLL 49 (194)
Q Consensus 18 ~~~~~c~~Cgk~f~~~~~L~~H~~s~L~~H~~ 49 (194)
..+|.|=.|.-+...+ +.|-.||+
T Consensus 18 p~~ykcfqcpftc~~k--------shl~nhmk 41 (54)
T PF15269_consen 18 PFKYKCFQCPFTCNEK--------SHLFNHMK 41 (54)
T ss_pred CccceeecCCcccchH--------HHHHHHHH
Confidence 3456777888887788 55555554
No 128
>PF13878 zf-C2H2_3: zinc-finger of acetyl-transferase ESCO
Probab=25.32 E-value=76 Score=17.81 Aligned_cols=13 Identities=0% Similarity=-0.202 Sum_probs=8.3
Q ss_pred ccccccccccccc
Q psy8702 21 PGLSLYTKSFTWK 33 (194)
Q Consensus 21 ~~c~~Cgk~f~~~ 33 (194)
-.|+.||..|...
T Consensus 14 ~~C~~CgM~Y~~~ 26 (41)
T PF13878_consen 14 TTCPTCGMLYSPG 26 (41)
T ss_pred cCCCCCCCEECCC
Confidence 3577777776554
No 129
>KOG4124|consensus
Probab=25.20 E-value=14 Score=31.28 Aligned_cols=58 Identities=22% Similarity=0.291 Sum_probs=33.8
Q ss_pred CCCccc--ccccccccccccccccc-----------cchHHHHHhHcCCCCCccCCcCCCCcccchHHHHh
Q psy8702 18 SVNPGL--SLYTKSFTWKNLLTNVL-----------TSNLKTHLLTHTDHKPYECNSCGKVFRRNCDLRRH 75 (194)
Q Consensus 18 ~~~~~c--~~Cgk~f~~~~~L~~H~-----------~s~L~~H~~~h~~~k~~~C~~C~k~f~~~~~L~~H 75 (194)
...|.| +.|.+.+...+.|..|. ...-+-|+-.....|+|.|++|.+.+.....|..|
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~i~~~~~k~~~~l~~~ 417 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCEVCSKRYKNLNGLKYH 417 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccChhhhhhhccCCCCCce
Confidence 455666 89999999997776651 01111122222335677777777766665554444
No 130
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=24.63 E-value=37 Score=26.73 Aligned_cols=28 Identities=29% Similarity=0.692 Sum_probs=20.9
Q ss_pred CCCCccCCcCCCCcccchHHHHhhhhcC
Q psy8702 53 DHKPYECNSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 53 ~~k~~~C~~C~k~f~~~~~L~~H~~~h~ 80 (194)
++.-|.|..|+|.|.-...+.+|+..-+
T Consensus 74 ~~~K~~C~lc~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 74 DEDKWRCPLCGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp SSEEEEE-SSS-EESSHHHHHHHHHHH-
T ss_pred cCCEECCCCCCcccCChHHHHHHHhhcC
Confidence 3446999999999999999999987544
No 131
>TIGR00244 transcriptional regulator NrdR. Members of this almost entirely bacterial family contain an ATP cone domain (PFAM:PF03477). There is never more than one member per genome. Common gene symbols given include nrdR, ybaD, ribX and ytcG. The member from Streptomyces coelicolor is found upstream in the operon of the class II oxygen-independent ribonucleotide reductase gene nrdJ and was shown to repress nrdJ expression. Many members of this family are found near genes for riboflavin biosynthesis in Gram-negative bacteria, suggesting a role in that pathway. However, a phylogenetic profiling study associates members of this family with the presence of a palindromic signal with consensus acaCwAtATaTwGtgt, termed the NrdR-box, an upstream element for most operons for ribonucleotide reductase of all three classes in bacterial genomes.
Probab=24.57 E-value=44 Score=24.68 Aligned_cols=11 Identities=18% Similarity=-0.055 Sum_probs=6.1
Q ss_pred CCChhhhCCCcc
Q psy8702 129 LEPSDMLGRRSR 140 (194)
Q Consensus 129 ~~C~~~Cgk~f~ 140 (194)
=+|.. ||++|+
T Consensus 29 ReC~~-C~~RFT 39 (147)
T TIGR00244 29 RECLE-CHERFT 39 (147)
T ss_pred ccCCc-cCCccc
Confidence 45654 666644
No 132
>KOG1813|consensus
Probab=24.41 E-value=48 Score=27.45 Aligned_cols=45 Identities=16% Similarity=0.140 Sum_probs=23.0
Q ss_pred CCccCCcCCCCcccchHHHHhhhhcCCCCCCCCCCccccccccccCCCCCCCCCC
Q psy8702 55 KPYECNSCGKVFRRNCDLRRHALTHAVGDVPPESSLAESIGSLKRDNLPPAPSPT 109 (194)
Q Consensus 55 k~~~C~~C~k~f~~~~~L~~H~~~h~~~~k~~~c~~c~~~~~~~~~~~~~~C~k~ 109 (194)
-||.|.+|.+.|... +-+ .-.-|-|..|.. ...++...|.+|++.
T Consensus 240 ~Pf~c~icr~~f~~p--------Vvt-~c~h~fc~~ca~-~~~qk~~~c~vC~~~ 284 (313)
T KOG1813|consen 240 LPFKCFICRKYFYRP--------VVT-KCGHYFCEVCAL-KPYQKGEKCYVCSQQ 284 (313)
T ss_pred CCccccccccccccc--------hhh-cCCceeehhhhc-cccccCCcceecccc
Confidence 467777777766643 112 234455655542 333444555555543
No 133
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=24.27 E-value=36 Score=19.54 Aligned_cols=9 Identities=33% Similarity=1.114 Sum_probs=4.3
Q ss_pred CCccCCcCC
Q psy8702 55 KPYECNSCG 63 (194)
Q Consensus 55 k~~~C~~C~ 63 (194)
..|.|..|+
T Consensus 36 ~~~~C~~C~ 44 (46)
T PF12760_consen 36 GRYRCKACR 44 (46)
T ss_pred CeEECCCCC
Confidence 345555544
No 134
>PF13824 zf-Mss51: Zinc-finger of mitochondrial splicing suppressor 51
Probab=24.09 E-value=64 Score=19.64 Aligned_cols=14 Identities=21% Similarity=0.218 Sum_probs=9.4
Q ss_pred CCCCCCCChhhhCCC
Q psy8702 124 SEERSLEPSDMLGRR 138 (194)
Q Consensus 124 ~~ekp~~C~~~Cgk~ 138 (194)
.+.-.|.|+. ||-.
T Consensus 10 ~~~v~~~Cp~-cGip 23 (55)
T PF13824_consen 10 PAHVNFECPD-CGIP 23 (55)
T ss_pred ccccCCcCCC-CCCc
Confidence 3455688886 8866
No 135
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=23.33 E-value=53 Score=22.53 Aligned_cols=24 Identities=29% Similarity=0.668 Sum_probs=21.4
Q ss_pred ccC----CcCCCCcccchHHHHhhhhcC
Q psy8702 57 YEC----NSCGKVFRRNCDLRRHALTHA 80 (194)
Q Consensus 57 ~~C----~~C~k~f~~~~~L~~H~~~h~ 80 (194)
|.| ..|+....+...+..|.+.++
T Consensus 81 ~~C~~~~~~C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPHCGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCCCCcEeccHHHHHHHHHHhc
Confidence 899 899999999999999988765
No 136
>COG1327 Predicted transcriptional regulator, consists of a Zn-ribbon and ATP-cone domains [Transcription]
Probab=23.14 E-value=52 Score=24.45 Aligned_cols=11 Identities=27% Similarity=0.187 Sum_probs=6.2
Q ss_pred CCChhhhCCCcc
Q psy8702 129 LEPSDMLGRRSR 140 (194)
Q Consensus 129 ~~C~~~Cgk~f~ 140 (194)
-+|+. ||++|+
T Consensus 29 ReC~~-C~~RFT 39 (156)
T COG1327 29 RECLE-CGERFT 39 (156)
T ss_pred hcccc-cccccc
Confidence 45654 666644
No 137
>PRK12496 hypothetical protein; Provisional
Probab=23.02 E-value=46 Score=24.94 Aligned_cols=14 Identities=7% Similarity=-0.192 Sum_probs=10.8
Q ss_pred CCcccccccccccc
Q psy8702 19 VNPGLSLYTKSFTW 32 (194)
Q Consensus 19 ~~~~c~~Cgk~f~~ 32 (194)
..+.|..||+.|..
T Consensus 126 w~~~C~gC~~~~~~ 139 (164)
T PRK12496 126 WRKVCKGCKKKYPE 139 (164)
T ss_pred eeEECCCCCccccC
Confidence 44778999988865
No 138
>PF12773 DZR: Double zinc ribbon
Probab=22.87 E-value=92 Score=17.82 Aligned_cols=27 Identities=11% Similarity=0.145 Sum_probs=14.5
Q ss_pred CCCCCCCCCCCCCChhhHhhhccCCCCCCCCChhhhCCC
Q psy8702 100 DNLPPAPSPTGSQSSEDIEIERTPSEERSLEPSDMLGRR 138 (194)
Q Consensus 100 ~~~~~~C~k~F~~~~~L~~H~~~H~~ekp~~C~~~Cgk~ 138 (194)
...|+.||.... -.......|+. ||..
T Consensus 12 ~~fC~~CG~~l~-----------~~~~~~~~C~~-Cg~~ 38 (50)
T PF12773_consen 12 AKFCPHCGTPLP-----------PPDQSKKICPN-CGAE 38 (50)
T ss_pred ccCChhhcCChh-----------hccCCCCCCcC-CcCC
Confidence 444566665554 12233466775 7765
No 139
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=22.08 E-value=49 Score=31.04 Aligned_cols=23 Identities=9% Similarity=-0.197 Sum_probs=14.0
Q ss_pred hhHhhhccCCCCCCCCChhhhCCC
Q psy8702 115 EDIEIERTPSEERSLEPSDMLGRR 138 (194)
Q Consensus 115 ~L~~H~~~H~~ekp~~C~~~Cgk~ 138 (194)
.|.-|.--+....|..|++ ||-.
T Consensus 462 ~L~CH~Cg~~~~~p~~Cp~-Cgs~ 484 (730)
T COG1198 462 QLRCHYCGYQEPIPQSCPE-CGSE 484 (730)
T ss_pred eeEeCCCCCCCCCCCCCCC-CCCC
Confidence 3334443445567888887 8855
No 140
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=20.84 E-value=68 Score=18.97 Aligned_cols=12 Identities=25% Similarity=0.783 Sum_probs=9.0
Q ss_pred CCccCCcCCCCc
Q psy8702 55 KPYECNSCGKVF 66 (194)
Q Consensus 55 k~~~C~~C~k~f 66 (194)
..+.|..||..+
T Consensus 36 ~r~~C~~Cgyt~ 47 (50)
T PRK00432 36 DRWHCGKCGYTE 47 (50)
T ss_pred CcEECCCcCCEE
Confidence 468898888754
Done!