Query psy8460
Match_columns 86
No_of_seqs 122 out of 1037
Neff 6.7
Searched_HMMs 46136
Date Fri Aug 16 19:28:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8460.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8460hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0034|consensus 99.4 2E-12 4.4E-17 89.7 7.7 66 2-69 121-186 (187)
2 KOG0038|consensus 99.3 3.7E-12 8E-17 86.3 7.0 65 2-70 125-189 (189)
3 KOG0044|consensus 99.1 5.6E-10 1.2E-14 77.9 6.7 68 2-70 117-187 (193)
4 PF13499 EF-hand_7: EF-hand do 98.9 3.5E-09 7.5E-14 60.7 5.4 50 2-56 17-66 (66)
5 PF13833 EF-hand_8: EF-hand do 98.8 1.2E-08 2.5E-13 56.7 5.0 47 2-57 5-52 (54)
6 cd05023 S-100A11 S-100A11: S-1 98.7 3.5E-08 7.6E-13 61.0 5.4 53 2-58 28-80 (89)
7 cd05022 S-100A13 S-100A13: S-1 98.7 3.4E-08 7.3E-13 61.2 5.0 49 2-58 26-75 (89)
8 PF00036 EF-hand_1: EF hand; 98.7 1.6E-08 3.6E-13 50.7 2.6 27 32-58 2-28 (29)
9 cd05026 S-100Z S-100Z: S-100Z 98.7 7.7E-08 1.7E-12 59.6 5.5 53 2-58 29-81 (93)
10 cd05027 S-100B S-100B: S-100B 98.6 1.2E-07 2.6E-12 58.4 5.6 52 2-57 27-78 (88)
11 cd05029 S-100A6 S-100A6: S-100 98.6 1.5E-07 3.2E-12 58.0 5.9 51 2-58 29-79 (88)
12 COG5126 FRQ1 Ca2+-binding prot 98.6 1E-07 2.2E-12 64.9 5.0 52 1-61 108-159 (160)
13 cd05031 S-100A10_like S-100A10 98.5 3.5E-07 7.7E-12 56.3 5.3 57 2-62 27-83 (94)
14 cd05030 calgranulins Calgranul 98.5 4.8E-07 1E-11 55.4 5.6 49 2-58 27-79 (88)
15 PF13202 EF-hand_5: EF hand; P 98.4 3.9E-07 8.5E-12 44.1 2.9 24 33-56 2-25 (25)
16 cd05024 S-100A10 S-100A10: A s 98.4 1.2E-06 2.6E-11 54.7 5.5 53 2-58 24-76 (91)
17 KOG0027|consensus 98.4 1.3E-06 2.9E-11 57.6 6.0 49 2-59 102-150 (151)
18 cd05025 S-100A1 S-100A1: S-100 98.3 2E-06 4.4E-11 52.6 5.5 53 2-58 28-80 (92)
19 cd00213 S-100 S-100: S-100 dom 98.1 1E-05 2.3E-10 48.7 6.2 54 2-59 27-80 (88)
20 KOG0028|consensus 98.1 8.1E-06 1.8E-10 55.9 5.7 38 19-60 135-172 (172)
21 PTZ00183 centrin; Provisional 98.1 8.8E-06 1.9E-10 52.6 5.5 51 2-61 107-157 (158)
22 cd00052 EH Eps15 homology doma 97.9 5.4E-05 1.2E-09 42.6 5.6 47 2-59 16-62 (67)
23 PF13405 EF-hand_6: EF-hand do 97.9 1.6E-05 3.4E-10 39.7 2.9 26 33-58 3-28 (31)
24 KOG0027|consensus 97.8 5.8E-05 1.3E-09 49.8 5.9 49 2-59 25-73 (151)
25 cd00051 EFh EF-hand, calcium b 97.7 0.00018 3.9E-09 38.5 5.7 46 2-56 17-62 (63)
26 cd00252 SPARC_EC SPARC_EC; ext 97.7 0.00024 5.2E-09 45.9 7.0 29 30-58 80-108 (116)
27 smart00027 EH Eps15 homology d 97.7 0.00016 3.5E-09 44.4 5.5 58 2-72 27-84 (96)
28 PTZ00184 calmodulin; Provision 97.7 0.00018 3.9E-09 45.7 5.8 23 35-57 125-147 (149)
29 smart00054 EFh EF-hand, calciu 97.5 0.00016 3.4E-09 33.2 3.1 26 33-58 3-28 (29)
30 KOG4065|consensus 97.4 0.00016 3.4E-09 47.6 3.4 34 23-56 110-143 (144)
31 PF14788 EF-hand_10: EF hand; 97.4 0.00062 1.3E-08 38.4 4.8 47 2-57 2-48 (51)
32 PF13499 EF-hand_7: EF-hand do 97.3 0.00034 7.4E-09 39.6 3.0 27 32-58 2-28 (66)
33 PTZ00183 centrin; Provisional 97.1 0.002 4.4E-08 41.4 5.8 48 2-58 34-81 (158)
34 KOG0044|consensus 97.1 0.002 4.2E-08 45.2 5.9 28 30-57 64-91 (193)
35 KOG0030|consensus 97.0 0.0017 3.6E-08 43.8 4.7 46 2-57 105-150 (152)
36 PTZ00184 calmodulin; Provision 97.0 0.0032 6.9E-08 39.8 5.9 48 2-58 28-75 (149)
37 PF12763 EF-hand_4: Cytoskelet 96.8 0.004 8.7E-08 39.5 5.2 34 22-59 39-72 (104)
38 PF14658 EF-hand_9: EF-hand do 96.6 0.01 2.2E-07 35.0 5.5 46 5-58 18-64 (66)
39 KOG2643|consensus 96.6 0.0013 2.8E-08 51.2 2.0 49 2-58 405-453 (489)
40 smart00027 EH Eps15 homology d 96.6 0.0043 9.3E-08 38.0 4.0 35 23-58 4-38 (96)
41 COG5126 FRQ1 Ca2+-binding prot 96.5 0.0094 2E-07 40.7 5.7 46 2-57 37-82 (160)
42 PRK12309 transaldolase/EF-hand 96.4 0.012 2.6E-07 45.0 6.5 35 22-57 350-384 (391)
43 KOG4223|consensus 96.2 0.0078 1.7E-07 45.1 4.2 30 30-59 200-229 (325)
44 PLN02964 phosphatidylserine de 96.0 0.016 3.5E-07 47.0 5.5 49 2-59 196-244 (644)
45 cd00051 EFh EF-hand, calcium b 95.9 0.012 2.7E-07 31.0 3.2 26 33-58 3-28 (63)
46 cd00213 S-100 S-100: S-100 dom 95.9 0.028 6.1E-07 33.5 5.1 32 26-58 5-38 (88)
47 KOG0031|consensus 95.8 0.025 5.3E-07 38.8 4.9 35 19-57 130-164 (171)
48 KOG0028|consensus 95.7 0.041 8.9E-07 37.9 5.6 72 3-82 87-167 (172)
49 cd00052 EH Eps15 homology doma 95.6 0.014 3E-07 32.5 2.7 25 34-58 3-27 (67)
50 KOG0036|consensus 95.6 0.029 6.4E-07 43.6 5.1 50 2-60 99-148 (463)
51 cd05025 S-100A1 S-100A1: S-100 95.4 0.072 1.6E-06 32.2 5.6 29 30-58 9-39 (92)
52 KOG0037|consensus 95.2 0.075 1.6E-06 38.0 5.8 58 2-68 141-198 (221)
53 cd05022 S-100A13 S-100A13: S-1 94.8 0.052 1.1E-06 33.4 3.7 31 32-62 10-42 (89)
54 KOG0377|consensus 94.7 0.11 2.3E-06 41.2 5.9 51 2-57 564-614 (631)
55 cd00252 SPARC_EC SPARC_EC; ext 94.4 0.21 4.5E-06 32.2 5.9 28 29-56 47-74 (116)
56 cd05031 S-100A10_like S-100A10 93.9 0.11 2.4E-06 31.5 3.7 27 32-58 10-38 (94)
57 PF10591 SPARC_Ca_bdg: Secrete 93.8 0.07 1.5E-06 34.1 2.8 23 32-54 90-112 (113)
58 cd05029 S-100A6 S-100A6: S-100 93.7 0.27 5.9E-06 29.9 5.2 31 26-57 7-39 (88)
59 PF09824 ArsR: ArsR transcript 93.7 0.22 4.8E-06 34.0 5.1 62 2-73 87-148 (160)
60 PLN02964 phosphatidylserine de 93.6 0.1 2.2E-06 42.5 3.9 36 22-58 172-207 (644)
61 cd05027 S-100B S-100B: S-100B 93.6 0.13 2.8E-06 31.4 3.6 27 32-58 10-38 (88)
62 cd05026 S-100Z S-100Z: S-100Z 93.5 0.13 2.8E-06 31.5 3.6 26 33-58 13-40 (93)
63 KOG0041|consensus 93.4 0.17 3.8E-06 36.2 4.5 26 32-57 137-162 (244)
64 cd05023 S-100A11 S-100A11: S-1 93.2 0.19 4.1E-06 30.7 4.0 28 32-59 11-40 (89)
65 KOG0034|consensus 93.2 0.28 6.2E-06 34.1 5.3 27 32-58 106-132 (187)
66 KOG0041|consensus 93.1 0.11 2.3E-06 37.3 3.0 35 23-58 93-127 (244)
67 KOG4251|consensus 93.0 0.033 7.1E-07 41.2 0.4 21 36-56 146-166 (362)
68 KOG0037|consensus 92.5 0.35 7.5E-06 34.7 5.0 21 37-57 101-121 (221)
69 KOG4223|consensus 92.2 0.36 7.7E-06 36.4 4.9 29 30-58 113-141 (325)
70 PRK12309 transaldolase/EF-hand 92.1 0.73 1.6E-05 35.4 6.6 46 5-54 313-358 (391)
71 PF09279 EF-hand_like: Phospho 92.0 0.36 7.7E-06 28.5 4.0 54 2-58 16-69 (83)
72 KOG0046|consensus 91.1 0.53 1.2E-05 37.9 5.1 29 29-57 56-84 (627)
73 PF08976 DUF1880: Domain of un 89.8 0.35 7.6E-06 31.6 2.6 32 23-58 4-35 (118)
74 KOG0030|consensus 89.6 1.2 2.6E-05 30.2 5.1 77 2-84 66-150 (152)
75 KOG0036|consensus 89.5 1.2 2.6E-05 34.9 5.7 29 31-59 52-80 (463)
76 PF00427 PBS_linker_poly: Phyc 89.0 1.6 3.6E-05 28.8 5.4 69 2-76 5-74 (131)
77 KOG4666|consensus 88.6 0.7 1.5E-05 35.4 3.8 34 34-67 335-368 (412)
78 KOG1955|consensus 88.1 0.84 1.8E-05 36.8 4.1 33 21-57 260-292 (737)
79 COG4860 Uncharacterized protei 86.5 1.7 3.7E-05 29.6 4.4 56 1-66 93-148 (170)
80 KOG0377|consensus 86.3 0.8 1.7E-05 36.5 3.1 26 32-57 549-574 (631)
81 KOG0040|consensus 84.2 3.3 7.2E-05 37.5 6.0 35 22-57 2246-2280(2399)
82 KOG2643|consensus 80.6 0.89 1.9E-05 35.8 1.3 22 35-56 238-259 (489)
83 PF00404 Dockerin_1: Dockerin 79.8 2.2 4.7E-05 19.6 2.0 15 40-54 1-15 (21)
84 cd02977 ArsC_family Arsenate R 79.1 4.6 0.0001 24.7 4.0 30 35-64 62-91 (105)
85 KOG4251|consensus 77.8 2.4 5.3E-05 31.5 2.8 34 24-57 230-263 (362)
86 KOG2562|consensus 76.1 3.9 8.5E-05 32.5 3.6 28 30-57 311-342 (493)
87 KOG3866|consensus 74.6 4 8.7E-05 31.3 3.2 43 26-68 292-339 (442)
88 cd03035 ArsC_Yffb Arsenate Red 74.1 4.3 9.3E-05 25.3 2.9 30 35-64 60-89 (105)
89 KOG1029|consensus 73.4 2.6 5.7E-05 35.7 2.1 27 32-58 231-257 (1118)
90 KOG4004|consensus 73.0 2.3 4.9E-05 30.6 1.5 28 30-57 222-249 (259)
91 cd03032 ArsC_Spx Arsenate Redu 72.9 10 0.00022 23.7 4.4 28 36-64 64-91 (115)
92 PF05517 p25-alpha: p25-alpha 72.0 4.5 9.7E-05 27.0 2.7 33 21-57 36-68 (154)
93 PF02864 STAT_bind: STAT prote 71.5 6.3 0.00014 28.8 3.5 51 5-55 182-232 (254)
94 PF08766 DEK_C: DEK C terminal 71.3 15 0.00032 20.1 4.7 47 3-55 1-47 (54)
95 KOG4578|consensus 70.3 3.6 7.7E-05 31.7 2.1 32 30-61 370-401 (421)
96 cd05030 calgranulins Calgranul 69.9 11 0.00025 22.4 4.0 24 34-57 12-37 (88)
97 PF06207 DUF1002: Protein of u 69.0 27 0.00059 25.0 6.3 19 24-42 173-191 (225)
98 PF14203 DUF4319: Domain of un 67.8 12 0.00026 21.8 3.6 42 2-53 16-57 (64)
99 PF10591 SPARC_Ca_bdg: Secrete 67.7 2.1 4.5E-05 27.3 0.4 29 27-55 51-79 (113)
100 KOG0031|consensus 67.4 4.8 0.00011 27.7 2.1 53 30-82 101-162 (171)
101 KOG0038|consensus 67.2 7.2 0.00016 26.9 2.9 23 35-57 76-98 (189)
102 PF09373 PMBR: Pseudomurein-bi 66.8 5.5 0.00012 19.9 1.8 29 43-72 1-29 (33)
103 PF07637 PSD5: Protein of unkn 64.7 23 0.00049 20.1 4.4 56 8-63 4-59 (64)
104 TIGR00988 hip integration host 63.8 8.3 0.00018 23.2 2.5 45 3-52 2-48 (94)
105 PRK10026 arsenate reductase; P 62.9 16 0.00035 24.2 4.0 30 34-64 64-93 (141)
106 KOG1029|consensus 62.9 8.3 0.00018 32.9 3.0 28 31-58 50-77 (1118)
107 PRK13344 spxA transcriptional 60.1 17 0.00036 23.6 3.6 28 36-64 64-91 (132)
108 cd03034 ArsC_ArsC Arsenate Red 59.8 25 0.00054 21.9 4.3 28 36-64 63-90 (112)
109 PRK12559 transcriptional regul 59.6 20 0.00043 23.2 3.9 55 2-64 37-91 (131)
110 PF08006 DUF1700: Protein of u 59.2 42 0.0009 22.5 5.6 59 3-66 2-60 (181)
111 TIGR00014 arsC arsenate reduct 57.1 27 0.00059 21.8 4.1 28 36-64 64-91 (114)
112 KOG4065|consensus 56.9 16 0.00035 24.2 3.1 35 21-58 61-95 (144)
113 PF02269 TFIID-18kDa: Transcri 56.8 16 0.00034 22.5 2.9 43 29-71 37-79 (93)
114 PF12872 OST-HTH: OST-HTH/LOTU 56.5 34 0.00074 19.2 5.0 51 5-55 4-56 (74)
115 PRK01655 spxA transcriptional 56.5 23 0.0005 22.8 3.8 29 35-64 63-91 (131)
116 PF05042 Caleosin: Caleosin re 55.7 18 0.00039 25.1 3.3 33 31-63 97-129 (174)
117 PF09069 EF-hand_3: EF-hand; 55.5 38 0.00083 20.9 4.5 59 2-62 19-79 (90)
118 PF08269 Cache_2: Cache domain 54.8 21 0.00045 21.2 3.2 27 21-47 33-59 (95)
119 PF04876 Tenui_NCP: Tenuivirus 54.0 66 0.0014 22.1 5.7 43 21-66 126-168 (175)
120 KOG1707|consensus 53.5 24 0.00053 29.0 4.1 39 22-61 308-346 (625)
121 KOG1954|consensus 53.4 12 0.00026 29.6 2.3 24 30-53 477-500 (532)
122 PF03960 ArsC: ArsC family; I 53.1 5.4 0.00012 24.7 0.4 29 36-64 60-88 (110)
123 PRK00199 ihfB integration host 51.9 17 0.00036 21.9 2.4 45 3-52 2-48 (94)
124 KOG4347|consensus 50.5 19 0.00041 29.8 3.1 38 29-66 554-593 (671)
125 PF02885 Glycos_trans_3N: Glyc 50.4 33 0.00071 19.4 3.4 28 7-37 2-29 (66)
126 CHL00091 apcE phycobillisome l 49.9 78 0.0017 27.2 6.6 31 43-73 304-334 (877)
127 PF07492 Trehalase_Ca-bi: Neut 49.6 4.1 9E-05 20.4 -0.4 18 34-51 3-20 (30)
128 cd03036 ArsC_like Arsenate Red 49.2 30 0.00065 21.5 3.4 30 35-64 62-92 (111)
129 KOG3866|consensus 49.1 30 0.00066 26.6 3.8 49 1-57 223-271 (442)
130 PF12674 Zn_ribbon_2: Putative 47.1 47 0.001 20.0 3.9 34 2-37 42-75 (81)
131 KOG0169|consensus 45.6 33 0.00071 28.8 3.8 31 26-56 132-162 (746)
132 CHL00091 apcE phycobillisome l 45.3 22 0.00048 30.3 2.9 35 43-77 552-586 (877)
133 PF09312 SurA_N: SurA N-termin 44.7 27 0.00058 21.9 2.7 35 19-59 63-97 (118)
134 PF07553 Lipoprotein_Ltp: Host 44.3 42 0.00091 18.3 3.0 28 2-33 19-46 (48)
135 PF09435 DUF2015: Fungal prote 43.7 79 0.0017 20.9 4.7 37 3-46 86-123 (128)
136 smart00708 PhBP Insect pheromo 43.1 49 0.0011 19.2 3.5 36 25-60 29-65 (103)
137 COG0721 GatC Asp-tRNAAsn/Glu-t 43.1 60 0.0013 20.1 4.0 26 1-31 2-27 (96)
138 KOG0751|consensus 42.5 26 0.00057 28.5 2.8 27 37-63 81-109 (694)
139 PF09966 DUF2200: Uncharacteri 42.4 55 0.0012 21.1 3.8 41 3-60 24-65 (111)
140 PF12213 Dpoe2NT: DNA polymera 42.1 57 0.0012 19.2 3.6 46 6-56 23-70 (73)
141 PLN02952 phosphoinositide phos 41.5 1.3E+02 0.0027 24.8 6.5 54 3-59 55-111 (599)
142 TIGR01616 nitro_assoc nitrogen 40.8 63 0.0014 20.8 4.0 28 36-64 63-90 (126)
143 PF12174 RST: RCD1-SRO-TAF4 (R 40.8 41 0.00089 19.7 2.8 44 3-58 10-53 (70)
144 PF09832 DUF2059: Uncharacteri 39.8 42 0.00091 18.5 2.7 26 3-32 1-26 (64)
145 COG0776 HimA Bacterial nucleoi 39.7 93 0.002 19.3 4.5 32 1-38 1-32 (94)
146 KOG0046|consensus 39.5 68 0.0015 26.3 4.6 35 22-58 12-46 (627)
147 COG1460 Uncharacterized protei 39.2 1E+02 0.0022 20.0 4.6 30 2-36 80-109 (114)
148 TIGR01617 arsC_related transcr 38.8 18 0.00038 22.6 1.1 20 45-64 73-92 (117)
149 COG1715 Mrr Restriction endonu 38.3 74 0.0016 24.0 4.4 22 22-43 164-185 (308)
150 PF09682 Holin_LLH: Phage holi 38.0 1E+02 0.0022 19.2 4.5 33 7-40 73-105 (108)
151 cd03033 ArsC_15kD Arsenate Red 37.1 77 0.0017 19.9 3.9 28 36-64 62-89 (113)
152 TIGR00135 gatC glutamyl-tRNA(G 37.0 87 0.0019 18.7 4.0 26 2-32 1-26 (93)
153 PF09424 YqeY: Yqey-like prote 36.9 41 0.00088 22.3 2.7 22 21-42 89-110 (143)
154 PF11116 DUF2624: Protein of u 35.6 1.1E+02 0.0023 18.8 4.2 31 2-38 15-45 (85)
155 TIGR00987 himA integration hos 35.4 47 0.001 20.1 2.6 45 2-52 2-48 (96)
156 PF10955 DUF2757: Protein of u 34.9 40 0.00087 20.3 2.2 39 21-64 30-70 (76)
157 KOG2243|consensus 34.8 27 0.00059 32.5 1.9 26 35-60 4062-4087(5019)
158 PF01320 Colicin_Pyocin: Colic 34.7 1.1E+02 0.0024 18.7 4.4 31 3-36 11-41 (85)
159 PRK00034 gatC aspartyl/glutamy 34.3 1E+02 0.0023 18.3 4.1 25 2-31 3-27 (95)
160 PF10891 DUF2719: Protein of u 33.9 26 0.00055 21.4 1.2 21 43-63 32-52 (81)
161 COG4286 Uncharacterized conser 33.5 88 0.0019 23.6 4.1 64 20-84 103-171 (306)
162 KOG3555|consensus 33.1 13 0.00029 28.8 -0.1 28 32-59 284-311 (434)
163 PF03705 CheR_N: CheR methyltr 32.6 61 0.0013 17.2 2.6 32 25-61 25-56 (57)
164 KOG0042|consensus 32.2 86 0.0019 26.0 4.2 32 22-57 625-656 (680)
165 cd07978 TAF13 The TATA Binding 32.0 1.2E+02 0.0027 18.5 5.6 30 42-71 50-79 (92)
166 KOG0040|consensus 31.6 54 0.0012 30.5 3.1 26 33-58 2299-2324(2399)
167 PRK00285 ihfA integration host 31.4 54 0.0012 19.8 2.4 45 2-52 3-49 (99)
168 PRK10664 transcriptional regul 31.2 59 0.0013 19.6 2.5 44 3-52 2-47 (90)
169 KOG4403|consensus 31.1 72 0.0016 25.6 3.5 48 22-79 64-111 (575)
170 KOG0039|consensus 30.9 92 0.002 25.5 4.3 62 1-63 33-94 (646)
171 cd00591 HU_IHF Integration hos 30.7 58 0.0013 18.8 2.4 44 3-52 1-46 (87)
172 PF13946 DUF4214: Domain of un 30.3 1.2E+02 0.0025 17.6 5.2 38 22-66 36-73 (75)
173 PF07531 TAFH: NHR1 homology t 29.3 21 0.00046 22.4 0.3 28 44-71 38-71 (96)
174 PF09454 Vps23_core: Vps23 cor 29.2 36 0.00077 19.7 1.2 16 43-58 36-51 (65)
175 smart00549 TAFH TAF homology. 29.1 34 0.00075 21.4 1.2 28 44-71 37-70 (92)
176 PRK07077 hypothetical protein; 28.7 70 0.0015 23.0 2.9 26 41-66 181-206 (238)
177 PRK14981 DNA-directed RNA poly 28.5 1.3E+02 0.0029 18.9 4.0 28 3-35 80-107 (112)
178 PF09066 B2-adapt-app_C: Beta2 28.2 37 0.00081 20.9 1.3 43 43-85 3-47 (114)
179 COG4535 CorC Putative Mg2+ and 28.0 1E+02 0.0022 23.0 3.6 35 3-41 33-67 (293)
180 COG4086 Predicted secreted pro 27.7 1.9E+02 0.0041 21.8 5.0 35 3-38 211-245 (299)
181 PF14513 DAG_kinase_N: Diacylg 27.6 45 0.00097 22.2 1.6 19 40-58 42-60 (138)
182 COG1393 ArsC Arsenate reductas 27.5 1.7E+02 0.0036 18.6 5.2 55 2-64 38-92 (117)
183 PF10163 EnY2: Transcription f 26.8 1.5E+02 0.0032 17.7 4.7 49 2-57 14-62 (86)
184 PF11829 DUF3349: Protein of u 26.7 1.7E+02 0.0036 18.3 5.3 35 22-57 35-69 (96)
185 PF05598 DUF772: Transposase d 26.5 1.3E+02 0.0028 16.9 4.0 33 48-80 24-60 (77)
186 PF09851 SHOCT: Short C-termin 26.3 58 0.0013 15.8 1.6 13 43-55 13-25 (31)
187 PHA02943 hypothetical protein; 25.8 2.3E+02 0.005 19.5 5.4 56 29-84 77-135 (165)
188 PRK10753 transcriptional regul 25.6 80 0.0017 19.0 2.4 44 3-52 2-47 (90)
189 cd00038 CAP_ED effector domain 25.6 61 0.0013 18.2 1.9 21 46-66 94-114 (115)
190 PRK10665 nitrogen regulatory p 25.3 91 0.002 19.7 2.7 24 24-47 66-91 (112)
191 cd02518 GT2_SpsF SpsF is a gly 25.1 2.2E+02 0.0047 19.3 4.8 39 24-64 191-229 (233)
192 PLN02764 glycosyltransferase f 25.0 1.7E+02 0.0038 22.9 4.7 54 2-58 392-445 (453)
193 COG2055 Malate/L-lactate dehyd 24.8 3.3E+02 0.0071 21.0 6.1 53 2-58 6-61 (349)
194 PF13442 Cytochrome_CBB3: Cyto 24.7 1.3E+02 0.0028 16.3 3.7 29 5-33 35-65 (67)
195 PF11363 DUF3164: Protein of u 24.6 2E+02 0.0044 20.0 4.6 41 24-67 116-159 (195)
196 COG5562 Phage envelope protein 24.2 54 0.0012 21.9 1.6 21 39-59 81-101 (137)
197 PF09860 DUF2087: Uncharacteri 23.8 1.6E+02 0.0035 17.1 3.7 28 6-35 15-42 (71)
198 KOG4666|consensus 23.6 62 0.0014 25.1 2.0 28 30-57 259-286 (412)
199 PRK08032 fliD flagellar cappin 23.4 57 0.0012 25.5 1.8 27 40-66 347-376 (462)
200 PRK13403 ketol-acid reductoiso 23.4 1.7E+02 0.0037 22.3 4.2 44 3-61 269-312 (335)
201 PF06348 DUF1059: Protein of u 22.9 1.5E+02 0.0032 16.4 4.0 35 3-37 21-55 (57)
202 PRK05479 ketol-acid reductoiso 22.8 1.7E+02 0.0036 22.1 4.1 19 42-60 295-313 (330)
203 COG3793 TerB Tellurite resista 22.4 2.5E+02 0.0055 18.9 5.9 41 24-64 63-104 (144)
204 KOG0998|consensus 22.3 69 0.0015 27.2 2.2 36 22-58 276-311 (847)
205 KOG0751|consensus 22.3 1.1E+02 0.0025 25.1 3.3 28 31-58 109-136 (694)
206 PHA02554 13 neck protein; Prov 22.1 1.7E+02 0.0036 22.2 3.9 34 3-37 7-42 (311)
207 PF01839 FG-GAP: FG-GAP repeat 21.8 50 0.0011 16.5 0.8 9 38-46 7-15 (34)
208 PF13591 MerR_2: MerR HTH fami 21.6 1.1E+02 0.0023 18.1 2.4 20 47-66 1-20 (84)
209 PF06569 DUF1128: Protein of u 21.5 1.9E+02 0.0042 17.1 4.0 6 46-51 53-58 (71)
210 PF06919 Phage_T4_Gp30_7: Phag 21.4 2.4E+02 0.0051 18.3 4.0 28 10-41 94-121 (121)
211 TIGR03798 ocin_TIGR03798 bacte 21.3 1E+02 0.0022 17.3 2.1 22 48-69 3-24 (64)
212 PLN02222 phosphoinositide phos 21.2 3.1E+02 0.0067 22.5 5.5 51 2-58 40-90 (581)
213 PF09068 EF-hand_2: EF hand; 21.1 2.4E+02 0.0052 18.1 4.9 54 3-56 61-123 (127)
214 PF08557 Lipid_DES: Sphingolip 21.1 94 0.002 16.3 1.8 17 54-70 21-37 (39)
215 PLN02534 UDP-glycosyltransfera 20.9 1E+02 0.0022 24.3 2.8 56 2-58 429-486 (491)
216 COG3937 Uncharacterized conser 20.8 2.5E+02 0.0053 18.1 4.2 19 22-40 38-56 (108)
217 PF08726 EFhand_Ca_insen: Ca2+ 20.6 1.8E+02 0.004 16.9 3.2 36 34-72 10-45 (69)
218 PF06570 DUF1129: Protein of u 20.3 2.5E+02 0.0053 19.3 4.3 17 22-38 21-37 (206)
219 PRK06798 fliD flagellar cappin 20.2 74 0.0016 24.8 1.8 28 39-66 323-352 (440)
220 PF02671 PAH: Paired amphipath 20.2 89 0.0019 16.2 1.7 16 56-71 32-47 (47)
No 1
>KOG0034|consensus
Probab=99.38 E-value=2e-12 Score=89.70 Aligned_cols=66 Identities=32% Similarity=0.429 Sum_probs=58.2
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhccc
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQC 69 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~ 69 (86)
|+|+|+.++++..++. .... +++++..+++++|.++|.|+||+|||+||.+++.+.|++.+.|+..
T Consensus 121 I~reel~~iv~~~~~~-~~~~-~~e~~~~i~d~t~~e~D~d~DG~IsfeEf~~~v~~~P~~~~~m~~~ 186 (187)
T KOG0034|consen 121 ISREELKQILRMMVGE-NDDM-SDEQLEDIVDKTFEEADTDGDGKISFEEFCKVVEKQPDLLEKMTIR 186 (187)
T ss_pred CcHHHHHHHHHHHHcc-CCcc-hHHHHHHHHHHHHHHhCCCCCCcCcHHHHHHHHHcCccHHHHcCCC
Confidence 7999999999997753 2112 5999999999999999999999999999999999999999988654
No 2
>KOG0038|consensus
Probab=99.34 E-value=3.7e-12 Score=86.33 Aligned_cols=65 Identities=29% Similarity=0.390 Sum_probs=57.5
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCL 70 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l 70 (86)
+..+++.+.+.. ++. .++|++|+.-++++++.++|.||||++|++||.+++.+.|+|+.+|..+|
T Consensus 125 i~~~DL~~~l~~-lTr---~eLs~eEv~~i~ekvieEAD~DgDgkl~~~eFe~~i~raPDFlsTFHIRI 189 (189)
T KOG0038|consen 125 IGHDDLEKTLTS-LTR---DELSDEEVELICEKVIEEADLDGDGKLSFAEFEHVILRAPDFLSTFHIRI 189 (189)
T ss_pred ccHHHHHHHHHH-Hhh---ccCCHHHHHHHHHHHHHHhcCCCCCcccHHHHHHHHHhCcchHhhheecC
Confidence 346777787776 555 57999999999999999999999999999999999999999999997764
No 3
>KOG0044|consensus
Probab=99.06 E-value=5.6e-10 Score=77.89 Aligned_cols=68 Identities=22% Similarity=0.358 Sum_probs=55.9
Q ss_pred CCHHHHHHHHHHHhhcCCCC---CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccC
Q psy8460 2 GQKRSLMYLGRNSLISHPQD---EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCL 70 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~---~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l 70 (86)
|||+||..|++.++.- .+. +..++..++.++.+|+++|.|+||.||++||+.++..+|.+++++...+
T Consensus 117 It~~Eml~iv~~i~~m-~~~~~~~~~~~~~~~~v~~if~k~D~n~Dg~lT~eef~~~~~~d~~i~~~l~~~~ 187 (193)
T KOG0044|consen 117 ITKEEMLKIVQAIYQM-TGSKALPEDEETPEERVDKIFSKMDKNKDGKLTLEEFIEGCKADPSILRALEQDP 187 (193)
T ss_pred EcHHHHHHHHHHHHHH-cccccCCcccccHHHHHHHHHHHcCCCCCCcccHHHHHHHhhhCHHHHHHhhhcc
Confidence 7999999999987632 221 1224445588999999999999999999999999999999999997654
No 4
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=98.92 E-value=3.5e-09 Score=60.70 Aligned_cols=50 Identities=26% Similarity=0.376 Sum_probs=43.4
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
+|++|+..+++.... ..+++++++.++.+|+.+|.|+||.|+|+||.+++
T Consensus 17 i~~~el~~~~~~~~~-----~~~~~~~~~~~~~~~~~~D~d~dG~i~~~Ef~~~~ 66 (66)
T PF13499_consen 17 ISKEELRRALKHLGR-----DMSDEESDEMIDQIFREFDTDGDGRISFDEFLNFM 66 (66)
T ss_dssp EEHHHHHHHHHHTTS-----HSTHHHHHHHHHHHHHHHTTTSSSSEEHHHHHHHH
T ss_pred CCHHHHHHHHHHhcc-----cccHHHHHHHHHHHHHHhCCCCcCCCcHHHHhccC
Confidence 578999999998432 35688899999999999999999999999999875
No 5
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=98.82 E-value=1.2e-08 Score=56.67 Aligned_cols=47 Identities=26% Similarity=0.275 Sum_probs=38.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCC-CChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDE-DPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~-ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
+|++|+..++ ..+. .+ +++++++ .+|..+|.|+||.|+|+||+.++.
T Consensus 5 i~~~~~~~~l-~~~g----~~~~s~~e~~----~l~~~~D~~~~G~I~~~EF~~~~~ 52 (54)
T PF13833_consen 5 ITREEFRRAL-SKLG----IKDLSEEEVD----RLFREFDTDGDGYISFDEFISMMQ 52 (54)
T ss_dssp EEHHHHHHHH-HHTT----SSSSCHHHHH----HHHHHHTTSSSSSEEHHHHHHHHH
T ss_pred ECHHHHHHHH-HHhC----CCCCCHHHHH----HHHHhcccCCCCCCCHHHHHHHHH
Confidence 6899999999 4342 35 8898755 568999999999999999999875
No 6
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=98.72 E-value=3.5e-08 Score=60.96 Aligned_cols=53 Identities=19% Similarity=0.390 Sum_probs=40.9
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|++|+..++...+++..+...++.++ +++++++|.|+||.|+|+||++++.+
T Consensus 28 Ls~~Elk~ll~~e~~~~~~~~~~~~~~----~~ll~~~D~d~DG~I~f~EF~~l~~~ 80 (89)
T cd05023 28 LSKTEFLSFMNTELASFTKNQKDPGVL----DRMMKKLDLNSDGQLDFQEFLNLIGG 80 (89)
T ss_pred ECHHHHHHHHHHhhhHhhcCCCCHHHH----HHHHHHcCCCCCCcCcHHHHHHHHHH
Confidence 689999999998764422234555654 45689999999999999999988764
No 7
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=98.70 E-value=3.4e-08 Score=61.20 Aligned_cols=49 Identities=24% Similarity=0.375 Sum_probs=39.9
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCCh-HHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPD-EGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~-ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++++|+..+++.-++ +.+++ ++ ++.+++.+|.|+||.|+|+||+.++.+
T Consensus 26 i~~~ELk~ll~~elg----~~ls~~~~----v~~mi~~~D~d~DG~I~F~EF~~l~~~ 75 (89)
T cd05022 26 LTASEFQELLTQQLP----HLLKDVEG----LEEKMKNLDVNQDSKLSFEEFWELIGE 75 (89)
T ss_pred ECHHHHHHHHHHHhh----hhccCHHH----HHHHHHHhCCCCCCCCcHHHHHHHHHH
Confidence 688999999997443 34777 55 456689999999999999999998875
No 8
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=98.69 E-value=1.6e-08 Score=50.70 Aligned_cols=27 Identities=37% Similarity=0.737 Sum_probs=23.9
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++++|+.+|.|+||.|+++||..++.+
T Consensus 2 ~~~~F~~~D~d~dG~I~~~Ef~~~~~~ 28 (29)
T PF00036_consen 2 LKEAFREFDKDGDGKIDFEEFKEMMKK 28 (29)
T ss_dssp HHHHHHHHSTTSSSEEEHHHHHHHHHH
T ss_pred HHHHHHHHCCCCCCcCCHHHHHHHHHh
Confidence 356799999999999999999998863
No 9
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=98.65 E-value=7.7e-08 Score=59.55 Aligned_cols=53 Identities=21% Similarity=0.253 Sum_probs=39.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|++||..+++..+........++++ ++++++++|.|+||.|+|+||+.++..
T Consensus 29 Is~~EL~~ll~~~~~~~~~~~~~~~~----v~~i~~elD~n~dG~Idf~EF~~l~~~ 81 (93)
T cd05026 29 LSKGELKELLQRELTDFLSSQKDPML----VDKIMNDLDSNKDNEVDFNEFVVLVAA 81 (93)
T ss_pred ECHHHHHHHHHHHhHHhcccccCHHH----HHHHHHHhCCCCCCCCCHHHHHHHHHH
Confidence 68999999998754321112335554 456789999999999999999988864
No 10
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=98.61 E-value=1.2e-07 Score=58.37 Aligned_cols=52 Identities=29% Similarity=0.331 Sum_probs=41.1
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
++.+|+..+|+.-+++..|...++++++ .+++.+|.|+||.|+|+||+.++.
T Consensus 27 I~~~eL~~ll~~~~~~~lg~~~~~~~v~----~~i~~~D~n~dG~v~f~eF~~li~ 78 (88)
T cd05027 27 LKKSELKELINNELSHFLEEIKEQEVVD----KVMETLDSDGDGECDFQEFMAFVA 78 (88)
T ss_pred ECHHHHHHHHHHHhHHHhcCCCCHHHHH----HHHHHhCCCCCCcCcHHHHHHHHH
Confidence 6899999999985544334456777655 558899999999999999998775
No 11
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=98.61 E-value=1.5e-07 Score=57.99 Aligned_cols=51 Identities=20% Similarity=0.303 Sum_probs=40.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|++|+.++++..+ . .|++++++++++ +++.+|.|+||+|+|+||+..+.+
T Consensus 29 Is~~EL~~~l~~~~-~-lg~k~t~~ev~~----m~~~~D~d~dG~Idf~EFv~lm~~ 79 (88)
T cd05029 29 LSKKELKELIQKEL-T-IGSKLQDAEIAK----LMEDLDRNKDQEVNFQEYVTFLGA 79 (88)
T ss_pred ECHHHHHHHHHHHH-h-cCCCCCHHHHHH----HHHHhcCCCCCCCcHHHHHHHHHH
Confidence 68899999997522 1 246788887665 588999999999999999988764
No 12
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=98.58 E-value=1e-07 Score=64.95 Aligned_cols=52 Identities=25% Similarity=0.405 Sum_probs=44.1
Q ss_pred CCCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChH
Q psy8460 1 MGQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPL 61 (86)
Q Consensus 1 ~~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~ 61 (86)
.||.+||..||+. .|++++++++++ +++.+|.|+||.|+|++|.+.+..+|.
T Consensus 108 ~Is~~eL~~vl~~-----lge~~~deev~~----ll~~~d~d~dG~i~~~eF~~~~~~~~~ 159 (160)
T COG5126 108 YISIGELRRVLKS-----LGERLSDEEVEK----LLKEYDEDGDGEIDYEEFKKLIKDSPT 159 (160)
T ss_pred eecHHHHHHHHHh-----hcccCCHHHHHH----HHHhcCCCCCceEeHHHHHHHHhccCC
Confidence 3788899999885 346899998665 589999999999999999999988774
No 13
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=98.49 E-value=3.5e-07 Score=56.27 Aligned_cols=57 Identities=19% Similarity=0.355 Sum_probs=42.2
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLL 62 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~ 62 (86)
+|++|+..+++..++...|...++++++ .+++.+|.|+||.|+|+||++++....-+
T Consensus 27 Is~~El~~~l~~~~g~~lg~~~s~~ei~----~~~~~~D~~~dg~I~f~eF~~l~~~~~~~ 83 (94)
T cd05031 27 LSRKELKKLMEKELSEFLKNQKDPMAVD----KIMKDLDQNRDGKVNFEEFVSLVAGLSIA 83 (94)
T ss_pred ECHHHHHHHHHHHhHHHhhccccHHHHH----HHHHHhCCCCCCcCcHHHHHHHHHHHHHH
Confidence 6889999999874432123356777655 56889999999999999999988754433
No 14
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=98.47 E-value=4.8e-07 Score=55.45 Aligned_cols=49 Identities=20% Similarity=0.320 Sum_probs=38.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCC----hHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDP----DEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls----~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|++||..++...++. .++ +++ ++.+|+.+|.|+||.|+|+||+.++..
T Consensus 27 Is~~El~~ll~~~~g~----~~t~~~~~~~----v~~i~~~~D~d~dG~I~f~eF~~~~~~ 79 (88)
T cd05030 27 LYKKEFKQLVEKELPN----FLKKEKNQKA----IDKIFEDLDTNQDGQLSFEEFLVLVIK 79 (88)
T ss_pred CCHHHHHHHHHHHhhH----hhccCCCHHH----HHHHHHHcCCCCCCcCcHHHHHHHHHH
Confidence 7899999999864533 244 554 456689999999999999999998873
No 15
>PF13202 EF-hand_5: EF hand; PDB: 3DD4_A 2Q4U_A 2BE4_A 1UHJ_B 1UHI_A 1UHH_B 1EJ3_B 1UHK_A 2ZFD_A 1UHN_A ....
Probab=98.39 E-value=3.9e-07 Score=44.12 Aligned_cols=24 Identities=50% Similarity=0.794 Sum_probs=21.4
Q ss_pred HHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 33 ELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 33 d~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
+.+|+.+|.|+||.||++||.+++
T Consensus 2 ~~~F~~~D~d~DG~is~~E~~~~~ 25 (25)
T PF13202_consen 2 KDAFQQFDTDGDGKISFEEFQRLV 25 (25)
T ss_dssp HHHHHHHTTTSSSEEEHHHHHHHH
T ss_pred HHHHHHHcCCCCCcCCHHHHHHHC
Confidence 467999999999999999999864
No 16
>cd05024 S-100A10 S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A10 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=98.37 E-value=1.2e-06 Score=54.74 Aligned_cols=53 Identities=19% Similarity=0.315 Sum_probs=40.7
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.|+.|+...++.-+++.....-.+ ..|+++++..|.|+||.|+|.||+..+..
T Consensus 24 Lsk~Elk~Ll~~Elp~~l~~~~d~----~~vd~im~~LD~n~Dg~vdF~EF~~Lv~~ 76 (91)
T cd05024 24 LNRDDLQKLMEKEFSEFLKNQNDP----MAVDKIMKDLDDCRDGKVGFQSFFSLIAG 76 (91)
T ss_pred CCHHHHHHHHHHHhHHHHcCCCCH----HHHHHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 589999999987665422222233 35778899999999999999999988764
No 17
>KOG0027|consensus
Probab=98.36 E-value=1.3e-06 Score=57.64 Aligned_cols=49 Identities=22% Similarity=0.411 Sum_probs=40.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
||.+|+..+|.. .|++.++++++. +++++|.|+||.|+|+||++++...
T Consensus 102 Is~~el~~~l~~-----lg~~~~~~e~~~----mi~~~d~d~dg~i~f~ef~~~m~~~ 150 (151)
T KOG0027|consen 102 ISASELKKVLTS-----LGEKLTDEECKE----MIREVDVDGDGKVNFEEFVKMMSGK 150 (151)
T ss_pred CcHHHHHHHHHH-----hCCcCCHHHHHH----HHHhcCCCCCCeEeHHHHHHHHhcC
Confidence 688899998887 235788887665 5799999999999999999998753
No 18
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=98.31 E-value=2e-06 Score=52.62 Aligned_cols=53 Identities=23% Similarity=0.352 Sum_probs=39.5
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|++|+..+++..+....+...++++++ .+++.+|.|++|.|+|+||+.++..
T Consensus 28 Is~~El~~~l~~~lg~~~~~~~s~~~v~----~i~~~~D~d~~G~I~f~eF~~l~~~ 80 (92)
T cd05025 28 LSKKELKDLLQTELSDFLDAQKDADAVD----KIMKELDENGDGEVDFQEFVVLVAA 80 (92)
T ss_pred ECHHHHHHHHHHHHHHHccCCCCHHHHH----HHHHHHCCCCCCcCcHHHHHHHHHH
Confidence 6899999999864422112235666544 5688999999999999999988874
No 19
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=98.15 E-value=1e-05 Score=48.75 Aligned_cols=54 Identities=26% Similarity=0.396 Sum_probs=39.5
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
+|.+|+..+++..++...+...+.++ ++.+++.+|.+++|.|+|+||+.++...
T Consensus 27 Is~~el~~~l~~~~g~~~~~~~~~~e----i~~i~~~~d~~~~g~I~f~eF~~~~~~~ 80 (88)
T cd00213 27 LSKKELKELLETELPNFLKNQKDPEA----VDKIMKDLDVNKDGKVDFQEFLVLIGKL 80 (88)
T ss_pred CcHHHHHHHHHHHhhhhccCCCCHHH----HHHHHHHhccCCCCcCcHHHHHHHHHHH
Confidence 68899999998644321112345665 4456899999999999999999988743
No 20
>KOG0028|consensus
Probab=98.12 E-value=8.1e-06 Score=55.90 Aligned_cols=38 Identities=18% Similarity=0.387 Sum_probs=32.9
Q ss_pred CCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCCh
Q psy8460 19 PQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEP 60 (86)
Q Consensus 19 ~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p 60 (86)
.|++++++++++| +.++|.|+||.|+-+||.++|.+.|
T Consensus 135 LgenltD~El~eM----IeEAd~d~dgevneeEF~~imk~t~ 172 (172)
T KOG0028|consen 135 LGENLTDEELMEM----IEEADRDGDGEVNEEEFIRIMKKTS 172 (172)
T ss_pred hCccccHHHHHHH----HHHhcccccccccHHHHHHHHhcCC
Confidence 4578999998877 5899999999999999999988754
No 21
>PTZ00183 centrin; Provisional
Probab=98.10 E-value=8.8e-06 Score=52.59 Aligned_cols=51 Identities=20% Similarity=0.285 Sum_probs=29.1
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPL 61 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~ 61 (86)
++++|+..++... +..++++++. .++..+|.|++|.|+|+||..++...|.
T Consensus 107 i~~~e~~~~l~~~-----~~~l~~~~~~----~~~~~~d~~~~g~i~~~ef~~~~~~~~~ 157 (158)
T PTZ00183 107 ISLKNLKRVAKEL-----GETITDEELQ----EMIDEADRNGDGEISEEEFYRIMKKTNL 157 (158)
T ss_pred CcHHHHHHHHHHh-----CCCCCHHHHH----HHHHHhCCCCCCcCcHHHHHHHHhcccC
Confidence 3445555555431 1235555444 3456667777777777777777766664
No 22
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=97.90 E-value=5.4e-05 Score=42.61 Aligned_cols=47 Identities=19% Similarity=0.105 Sum_probs=36.7
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
+|.+|+..+++.. + .++++ ++++++.+|.+++|.|+|+||+.++..-
T Consensus 16 i~~~el~~~l~~~--g-----~~~~~----~~~i~~~~d~~~~g~i~~~ef~~~~~~~ 62 (67)
T cd00052 16 ISGDEARPFLGKS--G-----LPRSV----LAQIWDLADTDKDGKLDKEEFAIAMHLI 62 (67)
T ss_pred CcHHHHHHHHHHc--C-----CCHHH----HHHHHHHhcCCCCCcCCHHHHHHHHHHH
Confidence 5788898888762 2 45555 4466899999999999999999887643
No 23
>PF13405 EF-hand_6: EF-hand domain; PDB: 2AMI_A 3QRX_A 1W7J_B 1OE9_B 1W7I_B 1KFU_S 1KFX_S 2BL0_B 1Y1X_B 3MSE_B ....
Probab=97.88 E-value=1.6e-05 Score=39.73 Aligned_cols=26 Identities=27% Similarity=0.496 Sum_probs=22.5
Q ss_pred HHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 33 ELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 33 d~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
..+|+.+|.|+||.|+.+||.+++.+
T Consensus 3 ~~~F~~~D~d~dG~I~~~el~~~l~~ 28 (31)
T PF13405_consen 3 REAFKMFDKDGDGFIDFEELRAILRK 28 (31)
T ss_dssp HHHHHHH-TTSSSEEEHHHHHHHHHH
T ss_pred HHHHHHHCCCCCCcCcHHHHHHHHHH
Confidence 46799999999999999999999874
No 24
>KOG0027|consensus
Probab=97.85 E-value=5.8e-05 Score=49.78 Aligned_cols=49 Identities=24% Similarity=0.371 Sum_probs=40.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
+|++|+..+++... ...+++++.. ++.+.|.|+||.|+++||..++.+.
T Consensus 25 i~~~el~~~lr~lg-----~~~t~~el~~----~~~~~D~dg~g~I~~~eF~~l~~~~ 73 (151)
T KOG0027|consen 25 ISVEELGAVLRSLG-----QNPTEEELRD----LIKEIDLDGDGTIDFEEFLDLMEKL 73 (151)
T ss_pred ccHHHHHHHHHHcC-----CCCCHHHHHH----HHHHhCCCCCCeEcHHHHHHHHHhh
Confidence 68899999998732 3478887665 5789999999999999999999853
No 25
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=97.72 E-value=0.00018 Score=38.48 Aligned_cols=46 Identities=24% Similarity=0.340 Sum_probs=34.9
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
++.+|+..+++... ...+.+++ ..+++.+|.+++|.|++++|...+
T Consensus 17 l~~~e~~~~l~~~~-----~~~~~~~~----~~~~~~~~~~~~~~l~~~ef~~~~ 62 (63)
T cd00051 17 ISADELKAALKSLG-----EGLSEEEI----DEMIREVDKDGDGKIDFEEFLELM 62 (63)
T ss_pred CcHHHHHHHHHHhC-----CCCCHHHH----HHHHHHhCCCCCCeEeHHHHHHHh
Confidence 57888888888632 23555544 457899999999999999998765
No 26
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=97.71 E-value=0.00024 Score=45.93 Aligned_cols=29 Identities=24% Similarity=0.410 Sum_probs=24.5
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
..++++|+.+|.|+||.||++||..++.+
T Consensus 80 ~~~~~f~~~~D~n~Dg~IS~~Ef~~cl~~ 108 (116)
T cd00252 80 HCIKPFFESCDLDKDGSISLDEWCYCFIK 108 (116)
T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHhC
Confidence 34677899999999999999999998844
No 27
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=97.67 E-value=0.00016 Score=44.39 Aligned_cols=58 Identities=16% Similarity=0.094 Sum_probs=42.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCCC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLPS 72 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~ 72 (86)
+|.+|+..+++. . .+++++++ ++++.+|.+++|.|+|+||+.++..--.... +..||.
T Consensus 27 Is~~el~~~l~~-~------~~~~~ev~----~i~~~~d~~~~g~I~~~eF~~~~~~~~~~~~--g~~~~~ 84 (96)
T smart00027 27 VTGAQAKPILLK-S------GLPQTLLA----KIWNLADIDNDGELDKDEFALAMHLIYRKLN--GYPIPA 84 (96)
T ss_pred EeHHHHHHHHHH-c------CCCHHHHH----HHHHHhcCCCCCCcCHHHHHHHHHHHHHHHc--CCCCCc
Confidence 567788887775 1 25666544 5688999999999999999998886554433 556664
No 28
>PTZ00184 calmodulin; Provisional
Probab=97.65 E-value=0.00018 Score=45.70 Aligned_cols=23 Identities=30% Similarity=0.700 Sum_probs=12.9
Q ss_pred HHHhcCCCCCCcccHHHHHHHHh
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
+++.+|.+++|.|+|+||+..+.
T Consensus 125 ~~~~~d~~~~g~i~~~ef~~~~~ 147 (149)
T PTZ00184 125 MIREADVDGDGQINYEEFVKMMM 147 (149)
T ss_pred HHHhcCCCCCCcCcHHHHHHHHh
Confidence 34555666666666666655543
No 29
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=97.51 E-value=0.00016 Score=33.21 Aligned_cols=26 Identities=38% Similarity=0.673 Sum_probs=22.9
Q ss_pred HHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 33 ELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 33 d~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
..+|+.+|.+++|.|++.||..+++.
T Consensus 3 ~~~f~~~d~~~~g~i~~~e~~~~~~~ 28 (29)
T smart00054 3 KEAFRLFDKDGDGKIDFEEFKDLLKA 28 (29)
T ss_pred HHHHHHHCCCCCCcEeHHHHHHHHHh
Confidence 46789999999999999999988764
No 30
>KOG4065|consensus
Probab=97.44 E-value=0.00016 Score=47.60 Aligned_cols=34 Identities=21% Similarity=0.507 Sum_probs=31.1
Q ss_pred CChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 23 DPDEGVRDLVELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 23 ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
.|+.|+..+||-+++.-|.|+||.|+|.||.+..
T Consensus 110 ~sE~Ele~~iD~vL~DdDfN~DG~IDYgEflK~q 143 (144)
T KOG4065|consen 110 SSEAELERLIDAVLDDDDFNGDGVIDYGEFLKRQ 143 (144)
T ss_pred CCHHHHHHHHHHHhcccccCCCceeeHHHHHhhc
Confidence 4778999999999999999999999999998753
No 31
>PF14788 EF-hand_10: EF hand; PDB: 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B 1DJZ_B 1DJY_B 1DJX_B 1QAT_A 1DJH_A ....
Probab=97.37 E-value=0.00062 Score=38.36 Aligned_cols=47 Identities=15% Similarity=0.111 Sum_probs=34.0
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.|-.|+..+|+.+- ..++++. +..+|+++|.+++|.+.-+||..+..
T Consensus 2 msf~Evk~lLk~~N-----I~~~~~y----A~~LFq~~D~s~~g~Le~~Ef~~Fy~ 48 (51)
T PF14788_consen 2 MSFKEVKKLLKMMN-----IEMDDEY----ARQLFQECDKSQSGRLEGEEFEEFYK 48 (51)
T ss_dssp BEHHHHHHHHHHTT---------HHH----HHHHHHHH-SSSSSEBEHHHHHHHHH
T ss_pred CCHHHHHHHHHHHc-----cCcCHHH----HHHHHHHhcccCCCCccHHHHHHHHH
Confidence 36789999999742 2466664 44679999999999999999987754
No 32
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=97.25 E-value=0.00034 Score=39.61 Aligned_cols=27 Identities=30% Similarity=0.534 Sum_probs=24.1
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+.++|+.+|.|+||.|+.+||..++..
T Consensus 2 l~~~F~~~D~d~~G~i~~~el~~~~~~ 28 (66)
T PF13499_consen 2 LKEAFKKFDKDGDGYISKEELRRALKH 28 (66)
T ss_dssp HHHHHHHHSTTSSSEEEHHHHHHHHHH
T ss_pred HHHHHHHHcCCccCCCCHHHHHHHHHH
Confidence 456799999999999999999998875
No 33
>PTZ00183 centrin; Provisional
Probab=97.09 E-value=0.002 Score=41.40 Aligned_cols=48 Identities=25% Similarity=0.337 Sum_probs=32.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++.+|+..+++.. + ..++.++ +..+++.+|.+++|.|+|+||+..+..
T Consensus 34 i~~~e~~~~l~~~--g---~~~~~~~----~~~l~~~~d~~~~g~i~~~eF~~~~~~ 81 (158)
T PTZ00183 34 IDPKELKVAMRSL--G---FEPKKEE----IKQMIADVDKDGSGKIDFEEFLDIMTK 81 (158)
T ss_pred ccHHHHHHHHHHh--C---CCCCHHH----HHHHHHHhCCCCCCcEeHHHHHHHHHH
Confidence 6777887777653 2 2344443 446678888888888888888877653
No 34
>KOG0044|consensus
Probab=97.08 E-value=0.002 Score=45.16 Aligned_cols=28 Identities=36% Similarity=0.665 Sum_probs=20.9
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
..++.+|+.+|.|+||.|+|.||+.++.
T Consensus 64 ~y~~~vF~~fD~~~dg~i~F~Efi~als 91 (193)
T KOG0044|consen 64 KYAELVFRTFDKNKDGTIDFLEFICALS 91 (193)
T ss_pred HHHHHHHHHhcccCCCCcCHHHHHHHHH
Confidence 4566778888888888888888776654
No 35
>KOG0030|consensus
Probab=97.00 E-value=0.0017 Score=43.82 Aligned_cols=46 Identities=20% Similarity=0.259 Sum_probs=29.1
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
|.-.|++++|.. .|++++++|++.+ +.- -.|++|.|+|+.|++.+.
T Consensus 105 i~~aeLRhvLtt-----lGekl~eeEVe~L----lag-~eD~nG~i~YE~fVk~i~ 150 (152)
T KOG0030|consen 105 IMGAELRHVLTT-----LGEKLTEEEVEEL----LAG-QEDSNGCINYEAFVKHIM 150 (152)
T ss_pred eeHHHHHHHHHH-----HHhhccHHHHHHH----Hcc-ccccCCcCcHHHHHHHHh
Confidence 334455555554 3467888876654 343 346788888888887664
No 36
>PTZ00184 calmodulin; Provisional
Probab=96.99 E-value=0.0032 Score=39.84 Aligned_cols=48 Identities=19% Similarity=0.317 Sum_probs=33.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++.+|+...+.. + + ...+++ .+..+++.+|.+++|.|+|+||+.++..
T Consensus 28 i~~~e~~~~l~~-~-~---~~~~~~----~~~~~~~~~d~~~~g~i~~~ef~~~l~~ 75 (149)
T PTZ00184 28 ITTKELGTVMRS-L-G---QNPTEA----ELQDMINEVDADGNGTIDFPEFLTLMAR 75 (149)
T ss_pred CCHHHHHHHHHH-h-C---CCCCHH----HHHHHHHhcCcCCCCcCcHHHHHHHHHH
Confidence 567777776654 2 1 234444 3456788899999999999999988763
No 37
>PF12763 EF-hand_4: Cytoskeletal-regulatory complex EF hand; PDB: 2QPT_A 2KSP_A 2KFG_A 2JQ6_A 2KFH_A 2KFF_A 1IQ3_A 3FIA_A 2KHN_A 2KGR_A ....
Probab=96.81 E-value=0.004 Score=39.50 Aligned_cols=34 Identities=24% Similarity=0.368 Sum_probs=26.8
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
+++.+. +.+||.-+|.|+||.++++||.-+|.--
T Consensus 39 ~L~~~~----L~~IW~LaD~~~dG~L~~~EF~iAm~Li 72 (104)
T PF12763_consen 39 GLPRDV----LAQIWNLADIDNDGKLDFEEFAIAMHLI 72 (104)
T ss_dssp TSSHHH----HHHHHHHH-SSSSSEEEHHHHHHHHHHH
T ss_pred CCCHHH----HHHHHhhhcCCCCCcCCHHHHHHHHHHH
Confidence 466664 4567999999999999999999888743
No 38
>PF14658 EF-hand_9: EF-hand domain
Probab=96.60 E-value=0.01 Score=35.05 Aligned_cols=46 Identities=20% Similarity=0.397 Sum_probs=34.5
Q ss_pred HHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCC-CcccHHHHHHHHhC
Q psy8460 5 RSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKD-GKISFQDFQQSVTD 58 (86)
Q Consensus 5 eEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~D-G~Is~eEF~~~l~~ 58 (86)
.++..+|+++ +. ...++.+++++ .+++|.+|. |.|+++.|..+|++
T Consensus 18 ~~l~~~Lra~-~~---~~p~e~~Lq~l----~~elDP~g~~~~v~~d~F~~iM~~ 64 (66)
T PF14658_consen 18 SDLITYLRAV-TG---RSPEESELQDL----INELDPEGRDGSVNFDTFLAIMRD 64 (66)
T ss_pred HHHHHHHHHH-cC---CCCcHHHHHHH----HHHhCCCCCCceEeHHHHHHHHHH
Confidence 4677778773 22 24666766654 689999998 99999999999874
No 39
>KOG2643|consensus
Probab=96.58 E-value=0.0013 Score=51.23 Aligned_cols=49 Identities=16% Similarity=0.320 Sum_probs=37.4
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++..+|.++.+. .++ .++|+. ++|-+|.-+|.|+||.+|++||+.+|.+
T Consensus 405 i~~~~f~raa~~-vtG---veLSdh----VvdvvF~IFD~N~Dg~LS~~EFl~Vmk~ 453 (489)
T KOG2643|consen 405 IDEKTFQRAAKV-VTG---VELSDH----VVDVVFTIFDENNDGTLSHKEFLAVMKR 453 (489)
T ss_pred CCHHHHHHHHHH-hcC---cccccc----eeeeEEEEEccCCCCcccHHHHHHHHHH
Confidence 455666666664 223 467776 5667899999999999999999999874
No 40
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=96.58 E-value=0.0043 Score=37.97 Aligned_cols=35 Identities=17% Similarity=0.226 Sum_probs=29.8
Q ss_pred CChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 23 DPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 23 ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|++++.. +..+|+.+|.|++|.|+++|+..++..
T Consensus 4 ls~~~~~~-l~~~F~~~D~d~~G~Is~~el~~~l~~ 38 (96)
T smart00027 4 ISPEDKAK-YEQIFRSLDKNQDGTVTGAQAKPILLK 38 (96)
T ss_pred CCHHHHHH-HHHHHHHhCCCCCCeEeHHHHHHHHHH
Confidence 67776664 568899999999999999999999865
No 41
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=96.52 E-value=0.0094 Score=40.67 Aligned_cols=46 Identities=13% Similarity=0.226 Sum_probs=23.2
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
++++|+..|++. ++. ..++.++. +++...|. |.|.|+|.+|+.+|.
T Consensus 37 I~~~el~~ilr~-lg~----~~s~~ei~----~l~~~~d~-~~~~idf~~Fl~~ms 82 (160)
T COG5126 37 IDRNELGKILRS-LGF----NPSEAEIN----KLFEEIDA-GNETVDFPEFLTVMS 82 (160)
T ss_pred CcHHHHHHHHHH-cCC----CCcHHHHH----HHHHhccC-CCCccCHHHHHHHHH
Confidence 466677777663 322 34444333 23445554 555555555555544
No 42
>PRK12309 transaldolase/EF-hand domain-containing protein; Provisional
Probab=96.44 E-value=0.012 Score=45.04 Aligned_cols=35 Identities=17% Similarity=0.259 Sum_probs=20.2
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.++.+|+.. ++.+|+.+|.|+||.|+++||.+++.
T Consensus 350 ~Is~~E~~~-~~~~F~~~D~d~DG~Is~eEf~~~~~ 384 (391)
T PRK12309 350 FITREEWLG-SDAVFDALDLNHDGKITPEEMRAGLG 384 (391)
T ss_pred cCcHHHHHH-HHHHHHHhCCCCCCCCcHHHHHHHHH
Confidence 355554422 35566666666666666666666554
No 43
>KOG4223|consensus
Probab=96.18 E-value=0.0078 Score=45.12 Aligned_cols=30 Identities=27% Similarity=0.473 Sum_probs=25.6
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
=+|...+...|.|+||+|+++||+.-+...
T Consensus 200 iVi~Etl~d~Dkn~DG~I~~eEfigd~~~~ 229 (325)
T KOG4223|consen 200 IVIAETLEDIDKNGDGKISLEEFIGDLYSH 229 (325)
T ss_pred HHHHHHHhhcccCCCCceeHHHHHhHHhhc
Confidence 456677889999999999999999888754
No 44
>PLN02964 phosphatidylserine decarboxylase
Probab=96.03 E-value=0.016 Score=46.99 Aligned_cols=49 Identities=8% Similarity=0.060 Sum_probs=35.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
++.+|+..++.. +. ...+++++. .+|+.+|.|+||.|+++||.+++...
T Consensus 196 IdfdEFl~lL~~-lg----~~~seEEL~----eaFk~fDkDgdG~Is~dEL~~vL~~~ 244 (644)
T PLN02964 196 LSFSEFSDLIKA-FG----NLVAANKKE----ELFKAADLNGDGVVTIDELAALLALQ 244 (644)
T ss_pred EcHHHHHHHHHH-hc----cCCCHHHHH----HHHHHhCCCCCCcCCHHHHHHHHHhc
Confidence 456677776664 21 235566544 56899999999999999999988873
No 45
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=95.93 E-value=0.012 Score=31.03 Aligned_cols=26 Identities=31% Similarity=0.489 Sum_probs=23.0
Q ss_pred HHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 33 ELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 33 d~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
..+|..+|.|++|.|+++||..++..
T Consensus 3 ~~~f~~~d~~~~g~l~~~e~~~~l~~ 28 (63)
T cd00051 3 REAFRLFDKDGDGTISADELKAALKS 28 (63)
T ss_pred HHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 46789999999999999999998864
No 46
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=95.91 E-value=0.028 Score=33.51 Aligned_cols=32 Identities=6% Similarity=0.167 Sum_probs=26.1
Q ss_pred HHHHHHHHHHHHhcCC--CCCCcccHHHHHHHHhC
Q psy8460 26 EGVRDLVELALRKMDY--DKDGKISFQDFQQSVTD 58 (86)
Q Consensus 26 ee~~~~vd~i~~~~D~--d~DG~Is~eEF~~~l~~ 58 (86)
++++. +..+|..+|. |++|.|+++||..++..
T Consensus 5 ~~~~~-l~~~F~~~D~~~~~~G~Is~~el~~~l~~ 38 (88)
T cd00213 5 KAIET-IIDVFHKYSGKEGDKDTLSKKELKELLET 38 (88)
T ss_pred HHHHH-HHHHHHHHhhccCCCCcCcHHHHHHHHHH
Confidence 34444 5578999999 89999999999999864
No 47
>KOG0031|consensus
Probab=95.81 E-value=0.025 Score=38.85 Aligned_cols=35 Identities=29% Similarity=0.443 Sum_probs=26.1
Q ss_pred CCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 19 PQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 19 ~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.|+.++++|++. +++.+-.|..|.|.|.+|..++.
T Consensus 130 ~gDr~~~eEV~~----m~r~~p~d~~G~~dy~~~~~~it 164 (171)
T KOG0031|consen 130 MGDRFTDEEVDE----MYREAPIDKKGNFDYKAFTYIIT 164 (171)
T ss_pred hcccCCHHHHHH----HHHhCCcccCCceeHHHHHHHHH
Confidence 456788887554 47788888888888888887775
No 48
>KOG0028|consensus
Probab=95.69 E-value=0.041 Score=37.89 Aligned_cols=72 Identities=21% Similarity=0.281 Sum_probs=44.5
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC-Ch-----HHH---HHhcccCCCh
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD-EP-----LLL---EAFGQCLPSD 73 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~-~p-----~~~---~~~~~~l~~~ 73 (86)
+.+++..++...+. +.-+.+++. ..|+..|.|++|+||+.+|..++.. .| ++. +-...+.-+.
T Consensus 87 ~fe~f~~~mt~k~~----e~dt~eEi~----~afrl~D~D~~Gkis~~~lkrvakeLgenltD~El~eMIeEAd~d~dge 158 (172)
T KOG0028|consen 87 TFEDFRRVMTVKLG----ERDTKEEIK----KAFRLFDDDKTGKISQRNLKRVAKELGENLTDEELMEMIEEADRDGDGE 158 (172)
T ss_pred chHHHHHHHHHHHh----ccCcHHHHH----HHHHcccccCCCCcCHHHHHHHHHHhCccccHHHHHHHHHHhccccccc
Confidence 34455555444332 234777665 4589999999999999999988764 44 555 3333444444
Q ss_pred HHHHHHHHH
Q psy8460 74 AARQSFLST 82 (86)
Q Consensus 74 ~~~~~~~~~ 82 (86)
.-...|++.
T Consensus 159 vneeEF~~i 167 (172)
T KOG0028|consen 159 VNEEEFIRI 167 (172)
T ss_pred ccHHHHHHH
Confidence 445556544
No 49
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=95.63 E-value=0.014 Score=32.49 Aligned_cols=25 Identities=28% Similarity=0.316 Sum_probs=22.0
Q ss_pred HHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 34 LALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 34 ~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++|+.+|.|+||.|+.+|+..++..
T Consensus 3 ~~F~~~D~~~~G~i~~~el~~~l~~ 27 (67)
T cd00052 3 QIFRSLDPDGDGLISGDEARPFLGK 27 (67)
T ss_pred HHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 4688899999999999999988754
No 50
>KOG0036|consensus
Probab=95.59 E-value=0.029 Score=43.60 Aligned_cols=50 Identities=14% Similarity=0.173 Sum_probs=40.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCCh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEP 60 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p 60 (86)
+.-+|+.+.++.. |.++++++++. +++.+|.||++.|.++||...+.-.|
T Consensus 99 i~~~Ei~~~l~~~-----gi~l~de~~~k----~~e~~d~~g~~~I~~~e~rd~~ll~p 148 (463)
T KOG0036|consen 99 IDPNEIWRYLKDL-----GIQLSDEKAAK----FFEHMDKDGKATIDLEEWRDHLLLYP 148 (463)
T ss_pred cCHHHHHHHHHHh-----CCccCHHHHHH----HHHHhccCCCeeeccHHHHhhhhcCC
Confidence 4456788888773 24699998765 57899999999999999999998877
No 51
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=95.40 E-value=0.072 Score=32.18 Aligned_cols=29 Identities=10% Similarity=0.178 Sum_probs=24.6
Q ss_pred HHHHHHHHhcC-CCCCC-cccHHHHHHHHhC
Q psy8460 30 DLVELALRKMD-YDKDG-KISFQDFQQSVTD 58 (86)
Q Consensus 30 ~~vd~i~~~~D-~d~DG-~Is~eEF~~~l~~ 58 (86)
+-+.++|+.+| .|++| .|+..|+..++..
T Consensus 9 ~~l~~~F~~fDd~dg~G~~Is~~El~~~l~~ 39 (92)
T cd05025 9 ETLINVFHAHSGKEGDKYKLSKKELKDLLQT 39 (92)
T ss_pred HHHHHHHHHHhcccCCCCeECHHHHHHHHHH
Confidence 44567899997 99999 5999999999964
No 52
>KOG0037|consensus
Probab=95.21 E-value=0.075 Score=38.04 Aligned_cols=58 Identities=28% Similarity=0.254 Sum_probs=44.4
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcc
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQ 68 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~ 68 (86)
|+..||.+-|.. + |=.++++ +++.+++++|.-++|.|.|++|++++..-+.+.+.|+.
T Consensus 141 I~~sEL~~Al~~-~----Gy~Lspq----~~~~lv~kyd~~~~g~i~FD~FI~ccv~L~~lt~~Fr~ 198 (221)
T KOG0037|consen 141 IDSSELRQALTQ-L----GYRLSPQ----FYNLLVRKYDRFGGGRIDFDDFIQCCVVLQRLTEAFRR 198 (221)
T ss_pred ccHHHHHHHHHH-c----CcCCCHH----HHHHHHHHhccccCCceeHHHHHHHHHHHHHHHHHHHH
Confidence 566777776665 2 2357777 44556789997779999999999999988888887764
No 53
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=94.82 E-value=0.052 Score=33.39 Aligned_cols=31 Identities=19% Similarity=0.293 Sum_probs=26.4
Q ss_pred HHHHHHhcCC-CCCCcccHHHHHHHHhC-ChHH
Q psy8460 32 VELALRKMDY-DKDGKISFQDFQQSVTD-EPLL 62 (86)
Q Consensus 32 vd~i~~~~D~-d~DG~Is~eEF~~~l~~-~p~~ 62 (86)
+-.+|+.+|. +++|.|+..||..++.. -|++
T Consensus 10 l~~~F~~fd~~~~~g~i~~~ELk~ll~~elg~~ 42 (89)
T cd05022 10 LVSNFHKASVKGGKESLTASEFQELLTQQLPHL 42 (89)
T ss_pred HHHHHHHHhCCCCCCeECHHHHHHHHHHHhhhh
Confidence 3478999999 99999999999999987 4544
No 54
>KOG0377|consensus
Probab=94.66 E-value=0.11 Score=41.23 Aligned_cols=51 Identities=22% Similarity=0.271 Sum_probs=35.5
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
||-+|.....+-.-.. ....+++.++-+++ +-+|.|+||+|++.||.++.+
T Consensus 564 isldEF~~a~~l~~sh-~~~~i~~~~i~~la----~~mD~NkDG~IDlNEfLeAFr 614 (631)
T KOG0377|consen 564 ISLDEFRTAWKLLSSH-MNGAISDDEILELA----RSMDLNKDGKIDLNEFLEAFR 614 (631)
T ss_pred eeHHHHHHHHHHHHhh-cCCCcCHHHHHHHH----HhhccCCCCcccHHHHHHHHh
Confidence 4666766665532111 22368888776654 678999999999999998765
No 55
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=94.39 E-value=0.21 Score=32.17 Aligned_cols=28 Identities=25% Similarity=0.415 Sum_probs=23.9
Q ss_pred HHHHHHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 29 RDLVELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 29 ~~~vd~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
..-+.-.|..+|.|+||.||.+|...+.
T Consensus 47 ~~~l~w~F~~lD~d~DG~Ls~~EL~~~~ 74 (116)
T cd00252 47 KDPVGWMFNQLDGNYDGKLSHHELAPIR 74 (116)
T ss_pred HHHHHHHHHHHCCCCCCcCCHHHHHHHH
Confidence 3446778999999999999999999764
No 56
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=93.90 E-value=0.11 Score=31.55 Aligned_cols=27 Identities=11% Similarity=0.249 Sum_probs=22.8
Q ss_pred HHHHHHhcCC-CC-CCcccHHHHHHHHhC
Q psy8460 32 VELALRKMDY-DK-DGKISFQDFQQSVTD 58 (86)
Q Consensus 32 vd~i~~~~D~-d~-DG~Is~eEF~~~l~~ 58 (86)
+..+|..+|. |+ ||.|+..|+..++..
T Consensus 10 l~~~F~~~D~~dg~dG~Is~~El~~~l~~ 38 (94)
T cd05031 10 LILTFHRYAGKDGDKNTLSRKELKKLMEK 38 (94)
T ss_pred HHHHHHHHhccCCCCCeECHHHHHHHHHH
Confidence 4467999997 97 699999999998864
No 57
>PF10591 SPARC_Ca_bdg: Secreted protein acidic and rich in cysteine Ca binding region; InterPro: IPR019577 This entry represents the calcium-binding domain found in SPARC (Secreted Protein Acidic and Rich in Cysteine) and Testican (also known as SPOCK; or SParc/Osteonectin, Cwcv and Kazal-like domains) proteins. SPARC proteins are down-regulated in various tumours and may have a tumour-suppressor function [, ]. Testican-3 appears to be a novel regulator that reduces the activity of matrix metalloproteinase (MMP) in adult T-cell leukemia (ATL) []. This cysteine-rich domain is responsible for the anti-spreading activity of human urothelial cells. This extracellular calcium-binding domain is rich in alpha-helices and contains two EF-hands that each coordinates one Ca2+ ion, forming a helix-loop-helix structure that not only drives the conformation of the protein but is also necessary for biological activity. The anti-spreading activity was dependent on the coordination of Ca2+ by a Glu residue at the Z position of EF-hand 2 []. ; GO: 0005509 calcium ion binding, 0007165 signal transduction, 0005578 proteinaceous extracellular matrix; PDB: 1BMO_A 1SRA_A 2V53_A 1NUB_B.
Probab=93.78 E-value=0.07 Score=34.10 Aligned_cols=23 Identities=35% Similarity=0.566 Sum_probs=15.5
Q ss_pred HHHHHHhcCCCCCCcccHHHHHH
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQ 54 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~ 54 (86)
+...++..|.|+||.||..|+..
T Consensus 90 ~~~F~~~CD~n~d~~Is~~EW~~ 112 (113)
T PF10591_consen 90 ARPFFRSCDVNKDGKISLDEWCN 112 (113)
T ss_dssp HHHHHHHH-TT-SSSEEHHHHHH
T ss_pred HHHHHHHcCCCCCCCCCHHHHcc
Confidence 34567888888888888888764
No 58
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=93.68 E-value=0.27 Score=29.92 Aligned_cols=31 Identities=13% Similarity=0.244 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHhcCC-CC-CCcccHHHHHHHHh
Q psy8460 26 EGVRDLVELALRKMDY-DK-DGKISFQDFQQSVT 57 (86)
Q Consensus 26 ee~~~~vd~i~~~~D~-d~-DG~Is~eEF~~~l~ 57 (86)
+.+..+| .+|.++|. || +|.|+.+||.+++.
T Consensus 7 ~~~~~~i-~~F~~y~~~~~~~g~Is~~EL~~~l~ 39 (88)
T cd05029 7 QAIGLLV-AIFHKYSGREGDKNTLSKKELKELIQ 39 (88)
T ss_pred HHHHHHH-HHHHHHHccCCCCCEECHHHHHHHHH
Confidence 3444444 67888998 77 89999999999995
No 59
>PF09824 ArsR: ArsR transcriptional regulator; InterPro: IPR014517 Members of this family of archaeal proteins are conserved transcriptional regulators belonging to the ArsR family.
Probab=93.65 E-value=0.22 Score=34.04 Aligned_cols=62 Identities=19% Similarity=0.255 Sum_probs=50.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCCCh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLPSD 73 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~~ 73 (86)
.|-+|+..++.-. -+++++++++++++.+++.. |..|.....+.+..+|-|++.+..+.|..
T Consensus 87 cs~~DLsdii~i~-------f~~deel~~~~e~i~~~v~~---Gn~Sl~~lsr~l~~sp~firglAKRs~~L 148 (160)
T PF09824_consen 87 CSMEDLSDIIYIA-------FMSDEELRDYVEKIEKEVEA---GNTSLSDLSRKLGISPVFIRGLAKRSPKL 148 (160)
T ss_pred eeHHHHHHHHhee-------ecCHHHHHHHHHHHHHHHHc---CCCcHHHHHHHhCCCHHHHHHHHHhccCc
Confidence 4567777777643 26789999999999998764 88999999999999999999998887753
No 60
>PLN02964 phosphatidylserine decarboxylase
Probab=93.59 E-value=0.1 Score=42.51 Aligned_cols=36 Identities=25% Similarity=0.537 Sum_probs=28.2
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
..++++. ..++.+|+.+|.|+||.|+++||+.++..
T Consensus 172 ~pte~e~-~fi~~mf~~~D~DgdG~IdfdEFl~lL~~ 207 (644)
T PLN02964 172 DPVETER-SFARRILAIVDYDEDGQLSFSEFSDLIKA 207 (644)
T ss_pred CCCHHHH-HHHHHHHHHhCCCCCCeEcHHHHHHHHHH
Confidence 3444432 24678899999999999999999998863
No 61
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=93.57 E-value=0.13 Score=31.38 Aligned_cols=27 Identities=7% Similarity=0.219 Sum_probs=23.1
Q ss_pred HHHHHHhcC-CCCCC-cccHHHHHHHHhC
Q psy8460 32 VELALRKMD-YDKDG-KISFQDFQQSVTD 58 (86)
Q Consensus 32 vd~i~~~~D-~d~DG-~Is~eEF~~~l~~ 58 (86)
+..+|+.+| .|||| .|+.+|+..+++.
T Consensus 10 l~~aF~~fD~~dgdG~~I~~~eL~~ll~~ 38 (88)
T cd05027 10 LIDVFHQYSGREGDKHKLKKSELKELINN 38 (88)
T ss_pred HHHHHHHhcccCCCcCEECHHHHHHHHHH
Confidence 346799998 79999 5999999999975
No 62
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=93.53 E-value=0.13 Score=31.47 Aligned_cols=26 Identities=12% Similarity=0.161 Sum_probs=22.0
Q ss_pred HHHHHhcC-CCCCC-cccHHHHHHHHhC
Q psy8460 33 ELALRKMD-YDKDG-KISFQDFQQSVTD 58 (86)
Q Consensus 33 d~i~~~~D-~d~DG-~Is~eEF~~~l~~ 58 (86)
-++|+.+| .|||| .||..|+..++.+
T Consensus 13 ~~~F~~~dd~dgdg~~Is~~EL~~ll~~ 40 (93)
T cd05026 13 IRIFHNYSGKEGDRYKLSKGELKELLQR 40 (93)
T ss_pred HHHHHHHHccCCCCCEECHHHHHHHHHH
Confidence 35688998 88999 5999999999964
No 63
>KOG0041|consensus
Probab=93.44 E-value=0.17 Score=36.23 Aligned_cols=26 Identities=31% Similarity=0.560 Sum_probs=20.5
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
+..+++++|-|.||+|||.||.-+..
T Consensus 137 lK~mikeVded~dgklSfreflLIfr 162 (244)
T KOG0041|consen 137 LKNMIKEVDEDFDGKLSFREFLLIFR 162 (244)
T ss_pred HHHHHHHhhcccccchhHHHHHHHHH
Confidence 44567888999999999999886654
No 64
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=93.25 E-value=0.19 Score=30.70 Aligned_cols=28 Identities=21% Similarity=0.300 Sum_probs=23.6
Q ss_pred HHHHHHh-cCCCCCC-cccHHHHHHHHhCC
Q psy8460 32 VELALRK-MDYDKDG-KISFQDFQQSVTDE 59 (86)
Q Consensus 32 vd~i~~~-~D~d~DG-~Is~eEF~~~l~~~ 59 (86)
+-.+|+. +|.|||| .||.+||..++.+.
T Consensus 11 l~~~F~~y~~~dg~~~~Ls~~Elk~ll~~e 40 (89)
T cd05023 11 LIAVFQKYAGKDGDSYQLSKTEFLSFMNTE 40 (89)
T ss_pred HHHHHHHHhccCCCcCeECHHHHHHHHHHh
Confidence 3467888 8899987 99999999999875
No 65
>KOG0034|consensus
Probab=93.20 E-value=0.28 Score=34.09 Aligned_cols=27 Identities=33% Similarity=0.471 Sum_probs=22.5
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++-.|+-+|.++||.|+.+|+.+++..
T Consensus 106 l~faF~vYD~~~~G~I~reel~~iv~~ 132 (187)
T KOG0034|consen 106 LRFAFRVYDLDGDGFISREELKQILRM 132 (187)
T ss_pred HHHHHHHhcCCCCCcCcHHHHHHHHHH
Confidence 445688999999999999999988763
No 66
>KOG0041|consensus
Probab=93.08 E-value=0.11 Score=37.33 Aligned_cols=35 Identities=20% Similarity=0.424 Sum_probs=30.3
Q ss_pred CChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 23 DPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 23 ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.+-.++++ ...+|+.+|.|.||.|++-|...+|.+
T Consensus 93 FsrkqIk~-~~~~Fk~yDe~rDgfIdl~ELK~mmEK 127 (244)
T KOG0041|consen 93 FSRKQIKD-AESMFKQYDEDRDGFIDLMELKRMMEK 127 (244)
T ss_pred HHHHHHHH-HHHHHHHhcccccccccHHHHHHHHHH
Confidence 45577775 578999999999999999999999886
No 67
>KOG4251|consensus
Probab=93.03 E-value=0.033 Score=41.18 Aligned_cols=21 Identities=29% Similarity=0.680 Sum_probs=17.8
Q ss_pred HHhcCCCCCCcccHHHHHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l 56 (86)
|+..|.||||.|+++||.--.
T Consensus 146 FraVDpdgDGhvsWdEykvkF 166 (362)
T KOG4251|consen 146 FRAVDPDGDGHVSWDEYKVKF 166 (362)
T ss_pred eeeeCCCCCCceehhhhhhHH
Confidence 667899999999999997443
No 68
>KOG0037|consensus
Probab=92.53 E-value=0.35 Score=34.68 Aligned_cols=21 Identities=29% Similarity=0.555 Sum_probs=10.1
Q ss_pred HhcCCCCCCcccHHHHHHHHh
Q psy8460 37 RKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 37 ~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.-+|.|++|+|+++||.....
T Consensus 101 ~mfd~~~~G~i~f~EF~~Lw~ 121 (221)
T KOG0037|consen 101 SMFDRDNSGTIGFKEFKALWK 121 (221)
T ss_pred HHhcCCCCCccCHHHHHHHHH
Confidence 444555555555555544433
No 69
>KOG4223|consensus
Probab=92.19 E-value=0.36 Score=36.40 Aligned_cols=29 Identities=17% Similarity=0.328 Sum_probs=23.8
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.-+.+-|.+.|.|+||.|+++||......
T Consensus 113 ~~~~~~~~~~d~~~Dg~i~~eey~~~~~~ 141 (325)
T KOG4223|consen 113 EEAARRWDEYDKNKDGFITWEEYLPQTYG 141 (325)
T ss_pred HHHHHHHHHhccCccceeeHHHhhhhhhh
Confidence 33455678899999999999999988774
No 70
>PRK12309 transaldolase/EF-hand domain-containing protein; Provisional
Probab=92.08 E-value=0.73 Score=35.42 Aligned_cols=46 Identities=22% Similarity=0.229 Sum_probs=30.9
Q ss_pred HHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHH
Q psy8460 5 RSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQ 54 (86)
Q Consensus 5 eEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~ 54 (86)
..|..+++..+....| -+-++..+..+|+-+|.|+||.|+.+||..
T Consensus 313 ~~L~~~i~~~~~~~~~----~~~~~~~l~~aF~~~D~dgdG~Is~~E~~~ 358 (391)
T PRK12309 313 ETLEKLLAHRLARLEG----GEAFTHAAQEIFRLYDLDGDGFITREEWLG 358 (391)
T ss_pred HHHHHHHHHHHHHhhc----cChhhHHHHHHHHHhCCCCCCcCcHHHHHH
Confidence 4556666655532112 123335566789999999999999999963
No 71
>PF09279 EF-hand_like: Phosphoinositide-specific phospholipase C, efhand-like; InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=92.00 E-value=0.36 Score=28.47 Aligned_cols=54 Identities=13% Similarity=0.153 Sum_probs=38.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+|.+++...|+..... ..++++++..++.+.-........+.+++++|...|..
T Consensus 16 mt~~~f~~FL~~eQ~~---~~~~~~~~~~li~~~~~~~~~~~~~~lt~~gF~~fL~S 69 (83)
T PF09279_consen 16 MTAEEFRRFLREEQGE---PRLTDEQAKELIEKFEPDERNRQKGQLTLEGFTRFLFS 69 (83)
T ss_dssp EEHHHHHHHHHHTSS----TTSSHHHHHHHHHHHHHHHHHHCTTEEEHHHHHHHHHS
T ss_pred CCHHHHHHHHHHHhcc---ccCcHHHHHHHHHHHccchhhcccCCcCHHHHHHHHCC
Confidence 4678888888863321 25788988888766544333346799999999999875
No 72
>KOG0046|consensus
Probab=91.08 E-value=0.53 Score=37.88 Aligned_cols=29 Identities=24% Similarity=0.514 Sum_probs=25.3
Q ss_pred HHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 29 RDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 29 ~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
++.++.++.+.+.|.||+|+|+||+....
T Consensus 56 ~eei~~~l~~~~~~~~g~v~fe~f~~~~~ 84 (627)
T KOG0046|consen 56 REEIKEILGEVGVDADGRVEFEEFVGIFL 84 (627)
T ss_pred HHHHHHHHhccCCCcCCccCHHHHHHHHH
Confidence 46677889999999999999999999655
No 73
>PF08976 DUF1880: Domain of unknown function (DUF1880); InterPro: IPR015070 This entry represents EF-hand calcium-binding domain-containing protein 6 that negatively regulates the androgen receptor by recruiting histone deacetylase complex, and protein DJ-1 antagonises this inhibition by abrogation of this complex [].; PDB: 1WLZ_C.
Probab=89.83 E-value=0.35 Score=31.55 Aligned_cols=32 Identities=19% Similarity=0.375 Sum_probs=21.9
Q ss_pred CChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 23 DPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 23 ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
|||++.+ +++.++-+|-.|++.|.||..-...
T Consensus 4 LtDeQFd----rLW~e~Pvn~~GrLkY~eFL~kfs~ 35 (118)
T PF08976_consen 4 LTDEQFD----RLWNEMPVNAKGRLKYQEFLSKFSS 35 (118)
T ss_dssp --HHHHH----HHHTTS-B-TTS-EEHHHHHHHT--
T ss_pred ccHHHhh----hhhhhCcCCccCCEeHHHHHHHccc
Confidence 7888755 6799999999999999999977663
No 74
>KOG0030|consensus
Probab=89.61 E-value=1.2 Score=30.19 Aligned_cols=77 Identities=17% Similarity=0.194 Sum_probs=47.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC------ChHHHHHhcc--cCCCh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD------EPLLLEAFGQ--CLPSD 73 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~------~p~~~~~~~~--~l~~~ 73 (86)
++.++.+-++.+.- + ..+..+-+ +.+ +-++.+|++++|.|.+.|+.+.+.+ ..++-+.+.- +..++
T Consensus 66 l~FE~fLpm~q~va-k-nk~q~t~e---dfv-egLrvFDkeg~G~i~~aeLRhvLttlGekl~eeEVe~Llag~eD~nG~ 139 (152)
T KOG0030|consen 66 LDFEEFLPMYQQVA-K-NKDQGTYE---DFV-EGLRVFDKEGNGTIMGAELRHVLTTLGEKLTEEEVEELLAGQEDSNGC 139 (152)
T ss_pred hhHHHHHHHHHHHH-h-ccccCcHH---HHH-HHHHhhcccCCcceeHHHHHHHHHHHHhhccHHHHHHHHccccccCCc
Confidence 45666777766633 2 12334444 344 4578999999999999999998874 3344444432 33456
Q ss_pred HHHHHHHHHhh
Q psy8460 74 AARQSFLSTLQ 84 (86)
Q Consensus 74 ~~~~~~~~~~~ 84 (86)
....+|.+...
T Consensus 140 i~YE~fVk~i~ 150 (152)
T KOG0030|consen 140 INYEAFVKHIM 150 (152)
T ss_pred CcHHHHHHHHh
Confidence 66666665543
No 75
>KOG0036|consensus
Probab=89.46 E-value=1.2 Score=34.88 Aligned_cols=29 Identities=17% Similarity=0.618 Sum_probs=20.4
Q ss_pred HHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 31 LVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 31 ~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
.+..+++.+|.|.||.++|+||.+-+..+
T Consensus 52 ~~~~l~~~~d~~~dg~vDy~eF~~Y~~~~ 80 (463)
T KOG0036|consen 52 AAKMLFSAMDANRDGRVDYSEFKRYLDNK 80 (463)
T ss_pred HHHHHHHhcccCcCCcccHHHHHHHHHHh
Confidence 34456777777777777777777777653
No 76
>PF00427 PBS_linker_poly: Phycobilisome Linker polypeptide; InterPro: IPR001297 Phycobilisomes (PBSs) are the major light-harvesting systems in cyanobacteria and red algae. PBS is a supercomplex that is composed of a core complex and multiple peripheral rod complexes. Typically, the core consists of two or five cylinders lying on the membrane with, in most cases, multiple rods radiating from the core to form a hemidiscoidal structure. The building units of the core cylinders and the peripheral rods are trimeric and hexameric discs, in which a monomer consists of a pair of related phycobiliproteins (PBPs), such as phycorerythrins, phycoerythrocyanins, phycocyanins, and allophycocyanins. The discs are connected to each other via specific linker polypeptides to form peripheral rods or core cylinders. Linker polypeptides share a conserved domain of ~180 residues, which can be present in one or multiple copies [, , , , ].; GO: 0015979 photosynthesis, 0030089 phycobilisome; PDB: 2L8V_A 2KY4_A 3OSJ_D 2L06_A 3NPH_B 2L3W_A 3PRU_C 3OHW_A.
Probab=88.96 E-value=1.6 Score=28.82 Aligned_cols=69 Identities=16% Similarity=0.128 Sum_probs=40.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCC-CChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCCChHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDE-DPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLPSDAAR 76 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~-ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~~~~~ 76 (86)
.|.+|+..++++.+..-.|.. ..+.+...-++..|+ +|.||-.||++.+.+++.+.+.|-...|..+.+
T Consensus 5 ~s~~~~~~vI~AaYrQVf~~~~~~~~er~~~lESqlr------ng~IsVreFVr~La~S~~yr~~f~~~~~~~R~i 74 (131)
T PF00427_consen 5 SSEEELEAVIRAAYRQVFGNDHPMESERLISLESQLR------NGQISVREFVRALAKSELYRKRFFEPNSNYRFI 74 (131)
T ss_dssp S-HHHHHHHHHHHHHHHHSSSSSHCSHHTHHHHHHHH------TTSS-HHHHHHHHHTSHHHHHHHTTTS-HHHHH
T ss_pred CCHHHHHHHHHHHHHHHhcCccchhhhccchHHHHHH------cCCCcHHHHHHHHHcCHHHHHHHcccccchHHH
Confidence 467788888886543211111 112121223333343 899999999999999999998776655544433
No 77
>KOG4666|consensus
Probab=88.60 E-value=0.7 Score=35.39 Aligned_cols=34 Identities=21% Similarity=0.234 Sum_probs=29.6
Q ss_pred HHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhc
Q psy8460 34 LALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFG 67 (86)
Q Consensus 34 ~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~ 67 (86)
.+|...+...||+|+|++|.+.+.+.|.+..++.
T Consensus 335 ~lf~~i~q~d~~ki~~~~f~~fa~~~p~~a~~~~ 368 (412)
T KOG4666|consen 335 VLFPSIEQKDDPKIYASNFRKFAATEPNLALSEL 368 (412)
T ss_pred ccchhhhcccCcceeHHHHHHHHHhCchhhhhhh
Confidence 3577788888999999999999999999996654
No 78
>KOG1955|consensus
Probab=88.10 E-value=0.84 Score=36.80 Aligned_cols=33 Identities=21% Similarity=0.363 Sum_probs=26.9
Q ss_pred CCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 21 DEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.++.-+|+. .||+..|.|.||-+++.||..++.
T Consensus 260 Sklpi~ELs----hIWeLsD~d~DGALtL~EFcAAfH 292 (737)
T KOG1955|consen 260 SKLPIEELS----HIWELSDVDRDGALTLSEFCAAFH 292 (737)
T ss_pred ccCchHHHH----HHHhhcccCccccccHHHHHhhHh
Confidence 356666655 569999999999999999998764
No 79
>COG4860 Uncharacterized protein conserved in archaea [Function unknown]
Probab=86.54 E-value=1.7 Score=29.59 Aligned_cols=56 Identities=20% Similarity=0.274 Sum_probs=39.3
Q ss_pred CCCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHh
Q psy8460 1 MGQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 1 ~~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~ 66 (86)
|+|-+|+..|+...+ ++++|+++..+++.+.+. +|++..+ ...+.+-.+|.++...
T Consensus 93 ~~Sl~dL~dii~~~f-------~sdeev~ey~~ei~~l~e-~g~ts~~--~vt~~Ln~~p~~irai 148 (170)
T COG4860 93 MGSLSDLADIIYAAF-------LSDEEVKEYEDEIKALME-EGNTSFL--DVTDTLNISPTLIRAI 148 (170)
T ss_pred EEeHHHHHHHHHHHh-------CCHHHHHHHHHHHHHHHH-cCCceEe--ehhhhcCCChHHHHHH
Confidence 578889999888643 789999999888888776 3444444 4555666666666544
No 80
>KOG0377|consensus
Probab=86.26 E-value=0.8 Score=36.45 Aligned_cols=26 Identities=27% Similarity=0.529 Sum_probs=22.6
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
++-+|.-+|.|+.|.||.+||..++.
T Consensus 549 LetiF~~iD~D~SG~isldEF~~a~~ 574 (631)
T KOG0377|consen 549 LETIFNIIDADNSGEISLDEFRTAWK 574 (631)
T ss_pred HHHHHHHhccCCCCceeHHHHHHHHH
Confidence 34579999999999999999998765
No 81
>KOG0040|consensus
Probab=84.20 E-value=3.3 Score=37.53 Aligned_cols=35 Identities=17% Similarity=0.542 Sum_probs=30.6
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
..|++.+.++ .-+|+.+|.+.+|.++|.+|.-+++
T Consensus 2246 GVtEe~L~EF-s~~fkhFDkek~G~Ldhq~F~sCLr 2280 (2399)
T KOG0040|consen 2246 GVTEEQLKEF-SMMFKHFDKEKNGRLDHQHFKSCLR 2280 (2399)
T ss_pred CCCHHHHHHH-HHHHHHhchhhccCCcHHHHHHHHH
Confidence 4888888874 5789999999999999999998876
No 82
>KOG2643|consensus
Probab=80.64 E-value=0.89 Score=35.83 Aligned_cols=22 Identities=27% Similarity=0.567 Sum_probs=19.7
Q ss_pred HHHhcCCCCCCcccHHHHHHHH
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l 56 (86)
-|+-+|.||||.|+.+||..+.
T Consensus 238 AFKMFD~dgnG~IdkeEF~~v~ 259 (489)
T KOG2643|consen 238 AFKMFDLDGNGEIDKEEFETVQ 259 (489)
T ss_pred eeeeeecCCCCcccHHHHHHHH
Confidence 3778999999999999999776
No 83
>PF00404 Dockerin_1: Dockerin type I repeat; InterPro: IPR018242 Gram-positive, thermophilic anaerobes such as Clostridium thermocellum or Clostridium cellulolyticum secretes a highly active and thermostable cellulase complex (cellulosome) responsible for the degradation of crystalline cellulose [, ]. The cellulosome contains at least 30 polypeptides, the majority of the enzymes are endoglucanases (3.2.1.4 from EC), but there are also some xylanases (3.2.1.8 from EC), beta-glucosidases (3.2.1.21 from EC) and endo-beta-1,3-1,4-glucanases (3.2.1.73 from EC). Complete sequence data for many of these enzymes has been obtained. A majority of these proteins contain a highly conserved type I dockerin domain of about 65 to 70 residues, which is generally (but not always) located in the C terminus. The dockerin domain is the binding partner of the cohesin domain (see IPR002102 from INTERPRO). The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome []. The dockerin domain contains a tandem repeat of two calcium-binding loop-helix motifs (distinct from EF-hand Ca-binding motifs). These motifs are about 24 amino acids in length. This entry represents these repeated Ca-binding motifs.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3P0D_J 1OHZ_B 2CCL_B 1DAV_A 1DAQ_A 2VN5_B 2VN6_B.
Probab=79.82 E-value=2.2 Score=19.62 Aligned_cols=15 Identities=33% Similarity=0.532 Sum_probs=11.5
Q ss_pred CCCCCCcccHHHHHH
Q psy8460 40 DYDKDGKISFQDFQQ 54 (86)
Q Consensus 40 D~d~DG~Is~eEF~~ 54 (86)
|.|+||+|+=-++.-
T Consensus 1 DvN~DG~vna~D~~~ 15 (21)
T PF00404_consen 1 DVNGDGKVNAIDLAL 15 (21)
T ss_dssp -TTSSSSSSHHHHHH
T ss_pred CCCCCCcCCHHHHHH
Confidence 689999999877653
No 84
>cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase.
Probab=79.14 E-value=4.6 Score=24.66 Aligned_cols=30 Identities=17% Similarity=0.362 Sum_probs=23.2
Q ss_pred HHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
.+++.+.+....+|-+|.++.+..+|.+++
T Consensus 62 ~~~~l~~~~~~~ls~~e~~~~l~~~p~Lik 91 (105)
T cd02977 62 PYRKLGLADKDELSDEEALELMAEHPKLIK 91 (105)
T ss_pred hHHHcCCccccCCCHHHHHHHHHhCcCeee
Confidence 345555553467899999999999999875
No 85
>KOG4251|consensus
Probab=77.78 E-value=2.4 Score=31.51 Aligned_cols=34 Identities=24% Similarity=0.316 Sum_probs=28.8
Q ss_pred ChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 24 PDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 24 s~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
|-.-+..||..+.+..|.|||..+|-.||+...-
T Consensus 230 SrgmLrfmVkeivrdlDqdgDkqlSvpeFislpv 263 (362)
T KOG4251|consen 230 SRGMLRFMVKEIVRDLDQDGDKQLSVPEFISLPV 263 (362)
T ss_pred hhhhHHHHHHHHHHHhccCCCeeecchhhhcCCC
Confidence 4446778999999999999999999999996554
No 86
>KOG2562|consensus
Probab=76.06 E-value=3.9 Score=32.46 Aligned_cols=28 Identities=14% Similarity=0.445 Sum_probs=22.0
Q ss_pred HHHHHHHH----hcCCCCCCcccHHHHHHHHh
Q psy8460 30 DLVELALR----KMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 30 ~~vd~i~~----~~D~d~DG~Is~eEF~~~l~ 57 (86)
-+||+||. ..-.-.+|+++|++|+-.+.
T Consensus 311 ~ivdRIFs~v~r~~~~~~eGrmdykdFv~Fil 342 (493)
T KOG2562|consen 311 RIVDRIFSQVPRGFTVKVEGRMDYKDFVDFIL 342 (493)
T ss_pred HHHHHHHhhccccceeeecCcccHHHHHHHHH
Confidence 57888888 45556789999999987775
No 87
>KOG3866|consensus
Probab=74.55 E-value=4 Score=31.27 Aligned_cols=43 Identities=21% Similarity=0.417 Sum_probs=32.2
Q ss_pred HHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC----Ch-HHHHHhcc
Q psy8460 26 EGVRDLVELALRKMDYDKDGKISFQDFQQSVTD----EP-LLLEAFGQ 68 (86)
Q Consensus 26 ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~----~p-~~~~~~~~ 68 (86)
++.-.|-+.+++..|+|.|.-|+++||++...+ .| +=++.+++
T Consensus 292 EErlRMREHVMk~vDtNqDRlvtleEFL~~t~~kef~~p~e~WEtl~q 339 (442)
T KOG3866|consen 292 EERLRMREHVMKQVDTNQDRLVTLEEFLNDTDNKEFNPPKEEWETLGQ 339 (442)
T ss_pred HHHHHHHHHHHHhcccchhhhhhHHHHHhhhhhcccCCcchhhhhhcc
Confidence 344456678899999999999999999988775 33 44555554
No 88
>cd03035 ArsC_Yffb Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC. The structure of Yffb and the conservation of the catalytic cysteine suggest that it is likely to function as a glutathione (GSH)-dependent thiol reductase. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from GSH via glutaredoxin, through a single catalytic cysteine.
Probab=74.09 E-value=4.3 Score=25.30 Aligned_cols=30 Identities=17% Similarity=0.290 Sum_probs=22.1
Q ss_pred HHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
.+++.+.+....+|-+|.+..|..+|.+++
T Consensus 60 ~yk~l~l~~~~~~s~~e~~~~l~~~p~Lik 89 (105)
T cd03035 60 TWRKLDDAQKAALDAAKAIALMLEHPSLIK 89 (105)
T ss_pred HHHhCChhhhccCCHHHHHHHHHhCcCeee
Confidence 345555553345888999999999999875
No 89
>KOG1029|consensus
Probab=73.42 E-value=2.6 Score=35.70 Aligned_cols=27 Identities=26% Similarity=0.305 Sum_probs=22.7
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+..|+-..|.|+||+++-+||+-+|.-
T Consensus 231 LA~IW~LsDvd~DGkL~~dEfilam~l 257 (1118)
T KOG1029|consen 231 LAHIWTLSDVDGDGKLSADEFILAMHL 257 (1118)
T ss_pred HhhheeeeccCCCCcccHHHHHHHHHH
Confidence 346788999999999999999977753
No 90
>KOG4004|consensus
Probab=72.97 E-value=2.3 Score=30.64 Aligned_cols=28 Identities=14% Similarity=0.237 Sum_probs=22.7
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.-+.+.|...|.|+||.|+..||-..+.
T Consensus 222 ~c~~~f~e~cd~~nd~~ial~ew~~c~g 249 (259)
T KOG4004|consen 222 HCTTRFFETCDLDNDKYIALDEWAGCFG 249 (259)
T ss_pred hhchhhhhcccCCCCCceeHHHhhcccC
Confidence 3455789999999999999999876543
No 91
>cd03032 ArsC_Spx Arsenate Reductase (ArsC) family, Spx subfamily; Spx is a unique RNA polymerase (RNAP)-binding protein present in bacilli and some mollicutes. It inhibits transcription by binding to the C-terminal domain of the alpha subunit of RNAP, disrupting complex formation between RNAP and certain transcriptional activator proteins like ResD and ComA. In response to oxidative stress, Spx can also activate transcription, making it a general regulator that exerts both positive and negative control over transcription initiation. Spx has been shown to exert redox-sensitive transcriptional control over genes like trxA (TRX) and trxB (TRX reductase), genes that function in thiol homeostasis. This redox-sensitive activity is dependent on the presence of a CXXC motif, present in some members of the Spx subfamily, that acts as a thiol/disulfide switch. Spx has also been shown to repress genes in a sulfate-dependent manner independent of the presence of the CXXC motif.
Probab=72.91 E-value=10 Score=23.71 Aligned_cols=28 Identities=18% Similarity=0.533 Sum_probs=21.9
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
+++.+.+. ..+|-+|.++.|..+|.+++
T Consensus 64 ~k~l~~~~-~~ls~~e~i~~l~~~p~Lik 91 (115)
T cd03032 64 FKNLNIDI-DELSLSELIRLISEHPSLLR 91 (115)
T ss_pred HHHcCCCc-ccCCHHHHHHHHHhChhhee
Confidence 45555544 56899999999999999975
No 92
>PF05517 p25-alpha: p25-alpha ; InterPro: IPR008907 This family encodes a 25 kDa protein that is phosphorylated by a Ser/Thr-Pro kinase []. It has been described as a brain specific protein, but it is found in Tetrahymena thermophila.; PDB: 1WLM_A 1PUL_A 2JRF_A.
Probab=71.98 E-value=4.5 Score=26.99 Aligned_cols=33 Identities=18% Similarity=0.388 Sum_probs=23.4
Q ss_pred CCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 21 DEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
..++... +|.+|.++-..+..+|+|++|..+|.
T Consensus 36 ~k~t~td----vDiiF~Kvk~k~~~~I~f~~F~~aL~ 68 (154)
T PF05517_consen 36 KKLTSTD----VDIIFSKVKAKGARKITFEQFLEALA 68 (154)
T ss_dssp SSS-HHH----HHHHHHHHT-SS-SEEEHHHHHHHHH
T ss_pred CCCchHH----HHHHHHHhhcCCCcccCHHHHHHHHH
Confidence 3466664 55678987767778899999999986
No 93
>PF02864 STAT_bind: STAT protein, DNA binding domain; InterPro: IPR013801 The STAT protein (Signal Transducers and Activators of Transcription) family contains transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors, hence they act as signal transducers in the cytoplasm and transcription activators in the nucleus []. Binding of these factors to cell-surface receptors leads to receptor autophosphorylation at a tyrosine, the phosphotyrosine being recognised by the STAT SH2 domain, which mediates the recruitment of STAT proteins from the cytosol and their association with the activated receptor. The STAT proteins are then activated by phosphorylation via members of the JAK family of protein kinases, causing them to dimerise and translocated to the nucleus, where they bind to specific promoter sequences in target genes. In mammals, STATs comprise a family of seven structurally and functionally related proteins: Stat1, Stat2, Stat3, Stat4, Stat5a and Stat5b, Stat6. STAT proteins play a critical role in regulating innate and acquired host immune responses. Dysregulation of at least two STAT signalling cascades (i.e. Stat3 and Stat5) is associated with cellular transformation. Signalling through the JAK/STAT pathway is initiated when a cytokine binds to its corresponding receptor. This leads to conformational changes in the cytoplasmic portion of the receptor, initiating activation of receptor associated members of the JAK family of kinases. The JAKs, in turn, mediate phosphorylation at the specific receptor tyrosine residues, which then serve as docking sites for STATs and other signalling molecules. Once recruited to the receptor, STATs also become phosphorylated by JAKs, on a single tyrosine residue. Activated STATs dissociate from the receptor, dimerise, translocate to the nucleus and bind to members of the GAS (gamma activated site) family of enhancers. The seven STAT proteins identified in mammals range in size from 750 and 850 amino acids. The chromosomal distribution of these STATs, as well as the identification of STATs in more primitive eukaryotes, suggest that this family arose from a single primordial gene. STATs share structurally and functionally conserved domains including: an N-terminal domain that strengthens interactions between STAT dimers on adjacent DNA-binding sites; a coiled-coil STAT domain that is implicated in protein-protein interactions; a DNA-binding domain with an immunoglobulin-like fold similar to p53 tumour suppressor protein; an EF-hand-like linker domain connecting the DNA-binding and SH2 domains; an SH2 domain (IPR000980 from INTERPRO) that acts as a phosphorylation-dependent switch to control receptor recognition and DNA-binding; and a C-terminal transactivation domain []. The crystal structure of the N terminus of Stat4 reveals a dimer. The interface of this dimer is formed by a ring-shaped element consisting of five short helices. Several studies suggest that this N-terminal dimerisation promotes cooperativity of binding to tandem GAS elements and with the transcriptional coactivator CBP/p300. This entry represents the DNA-binding domain, which has an immunoglobulin-like structural fold.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0004871 signal transducer activity, 0006355 regulation of transcription, DNA-dependent, 0007165 signal transduction, 0005634 nucleus; PDB: 1YVL_A 1BF5_A 1Y1U_B 3CWG_B 1BG1_A.
Probab=71.49 E-value=6.3 Score=28.82 Aligned_cols=51 Identities=18% Similarity=0.066 Sum_probs=33.4
Q ss_pred HHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHH
Q psy8460 5 RSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQS 55 (86)
Q Consensus 5 eEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~ 55 (86)
.+|.++|..-+....|-.|+++++..+.++++..-....++.||+..|.+-
T Consensus 182 ~ql~~~L~~~F~~~~~R~L~~~~L~~L~~Kl~~~~~~~~~~~isw~~F~Ke 232 (254)
T PF02864_consen 182 PQLSEALSWQFSSETGRGLTDEQLQYLAEKLFGQNSSYNNMLISWSQFCKE 232 (254)
T ss_dssp HHHHHHHHHHHHHHSS----HHHHHHHHHHHHTSSS-GCC-EEEHHHHHTS
T ss_pred HHHHHHHHHHHHHhhCCCCCHHHHHHHHHHHhCCcccCCCceeEHHHhhhc
Confidence 456666665332223356999999999999999777778999999999743
No 94
>PF08766 DEK_C: DEK C terminal domain; InterPro: IPR014876 DEK is a chromatin associated protein that is linked with cancers and autoimmune disease. This domain is found at the C-terminal of DEK and is of clinical importance since it can reverse the characteristic abnormal DNA-mutagen sensitivity in fibroblasts from ataxia-telangiectasia (A-T) patients []. The structure of this domain shows it to be homologous to the E2F/DP transcription factor family []. This domain is also found in chitin synthase proteins like Q8TF96 from SWISSPROT, and in protein phosphatases such as Q6NN85 from SWISSPROT. ; PDB: 1Q1V_A.
Probab=71.28 E-value=15 Score=20.12 Aligned_cols=47 Identities=19% Similarity=0.183 Sum_probs=21.1
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQS 55 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~ 55 (86)
|.+++...++.++....-+.+|..+++..++.-| .+|.. ++..|++.
T Consensus 1 td~~i~~~i~~iL~~~dl~~vT~k~vr~~Le~~~-~~dL~-----~~K~~I~~ 47 (54)
T PF08766_consen 1 TDEEIREAIREILREADLDTVTKKQVREQLEERF-GVDLS-----SRKKFIKE 47 (54)
T ss_dssp -HHHHHHHHHHHHTTS-GGG--HHHHHHHHHHH--SS--S-----HHHHHHHH
T ss_pred CHHHHHHHHHHHHHhCCHhHhhHHHHHHHHHHHH-CCCcH-----HHHHHHHH
Confidence 4566666666666431113466666665554444 33332 55555543
No 95
>KOG4578|consensus
Probab=70.28 E-value=3.6 Score=31.67 Aligned_cols=32 Identities=22% Similarity=0.279 Sum_probs=24.7
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHhCChH
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVTDEPL 61 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~ 61 (86)
.-..++++-.|+|+|-+||+.|+...+.-.++
T Consensus 370 kC~rk~~~yCDlNkDKkISl~Ew~~CL~~~~~ 401 (421)
T KOG4578|consen 370 KCSRKFFKYCDLNKDKKISLDEWRGCLGVEKE 401 (421)
T ss_pred HHhhhcchhcccCCCceecHHHHhhhhccccc
Confidence 34456788889999999999998888776554
No 96
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=69.93 E-value=11 Score=22.44 Aligned_cols=24 Identities=13% Similarity=0.202 Sum_probs=20.1
Q ss_pred HHHHhcCCC--CCCcccHHHHHHHHh
Q psy8460 34 LALRKMDYD--KDGKISFQDFQQSVT 57 (86)
Q Consensus 34 ~i~~~~D~d--~DG~Is~eEF~~~l~ 57 (86)
.+|.+++.. .+|.|+.+||..++.
T Consensus 12 ~~f~~y~~~~~~~~~Is~~El~~ll~ 37 (88)
T cd05030 12 NVFHQYSVRKGHPDTLYKKEFKQLVE 37 (88)
T ss_pred HHHHHHhccCCCcccCCHHHHHHHHH
Confidence 678888765 479999999999996
No 97
>PF06207 DUF1002: Protein of unknown function (DUF1002); InterPro: IPR009343 This protein family has no known function. Its members are about 300 amino acids in length. It has so far been detected in Firmicute bacteria and some archaebacteria.
Probab=69.01 E-value=27 Score=25.01 Aligned_cols=19 Identities=26% Similarity=0.534 Sum_probs=8.9
Q ss_pred ChHHHHHHHHHHHHhcCCC
Q psy8460 24 PDEGVRDLVELALRKMDYD 42 (86)
Q Consensus 24 s~ee~~~~vd~i~~~~D~d 42 (86)
++++++.+|.++....+.+
T Consensus 173 t~~eI~~IV~~~~~~~~i~ 191 (225)
T PF06207_consen 173 TDEEIRNIVNNVLNNYNIN 191 (225)
T ss_pred CHHHHHHHHHHHHHHcCCC
Confidence 4444444444444444443
No 98
>PF14203 DUF4319: Domain of unknown function (DUF4319); PDB: 2L7K_A.
Probab=67.79 E-value=12 Score=21.82 Aligned_cols=42 Identities=26% Similarity=0.343 Sum_probs=29.7
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQ 53 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~ 53 (86)
+||..+..-++..++. +.+.++.+++++++.+.+. ++=+||-
T Consensus 16 g~R~~~i~~l~~~lp~-----m~d~em~eLa~~tl~KL~~-----mtD~ef~ 57 (64)
T PF14203_consen 16 GSREGTIENLTAALPY-----MDDSEMRELAESTLRKLDA-----MTDAEFA 57 (64)
T ss_dssp SHHHHHHHHHHHSTTT-----S--HHHHHHHHHHHHHHTT-------HHHHH
T ss_pred CCHHHHHHHHHHHHHH-----hchHHHHHHHHHHHHHHHh-----cCHHHHH
Confidence 6788888888886543 6788899999999999885 5555554
No 99
>PF10591 SPARC_Ca_bdg: Secreted protein acidic and rich in cysteine Ca binding region; InterPro: IPR019577 This entry represents the calcium-binding domain found in SPARC (Secreted Protein Acidic and Rich in Cysteine) and Testican (also known as SPOCK; or SParc/Osteonectin, Cwcv and Kazal-like domains) proteins. SPARC proteins are down-regulated in various tumours and may have a tumour-suppressor function [, ]. Testican-3 appears to be a novel regulator that reduces the activity of matrix metalloproteinase (MMP) in adult T-cell leukemia (ATL) []. This cysteine-rich domain is responsible for the anti-spreading activity of human urothelial cells. This extracellular calcium-binding domain is rich in alpha-helices and contains two EF-hands that each coordinates one Ca2+ ion, forming a helix-loop-helix structure that not only drives the conformation of the protein but is also necessary for biological activity. The anti-spreading activity was dependent on the coordination of Ca2+ by a Glu residue at the Z position of EF-hand 2 []. ; GO: 0005509 calcium ion binding, 0007165 signal transduction, 0005578 proteinaceous extracellular matrix; PDB: 1BMO_A 1SRA_A 2V53_A 1NUB_B.
Probab=67.75 E-value=2.1 Score=27.26 Aligned_cols=29 Identities=17% Similarity=0.373 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHhcCCCCCCcccHHHHHHH
Q psy8460 27 GVRDLVELALRKMDYDKDGKISFQDFQQS 55 (86)
Q Consensus 27 e~~~~vd~i~~~~D~d~DG~Is~eEF~~~ 55 (86)
.....|.-.|...|.|+||.++..|+..+
T Consensus 51 ~~~~~~~W~F~~LD~n~d~~L~~~El~~l 79 (113)
T PF10591_consen 51 ECKRVVHWKFCQLDRNKDGVLDRSELKPL 79 (113)
T ss_dssp GGHHHHHHHHHHH--T-SSEE-TTTTGGG
T ss_pred hhhhhhhhhHhhhcCCCCCccCHHHHHHH
Confidence 34466777899999999999999996543
No 100
>KOG0031|consensus
Probab=67.37 E-value=4.8 Score=27.74 Aligned_cols=53 Identities=15% Similarity=0.228 Sum_probs=34.2
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHhC------ChHHHHHhcccCC---ChHHHHHHHHH
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVTD------EPLLLEAFGQCLP---SDAARQSFLST 82 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~~------~p~~~~~~~~~l~---~~~~~~~~~~~ 82 (86)
+.|-.-|+.+|.+++|.|.-+.+..++.. ..++=+.++...| ++-+-++|-+.
T Consensus 101 ~~I~~AF~~FD~~~~G~I~~d~lre~Ltt~gDr~~~eEV~~m~r~~p~d~~G~~dy~~~~~~ 162 (171)
T KOG0031|consen 101 EVILNAFKTFDDEGSGKIDEDYLRELLTTMGDRFTDEEVDEMYREAPIDKKGNFDYKAFTYI 162 (171)
T ss_pred HHHHHHHHhcCccCCCccCHHHHHHHHHHhcccCCHHHHHHHHHhCCcccCCceeHHHHHHH
Confidence 55667788999999999998888888874 2333344554444 34344455443
No 101
>KOG0038|consensus
Probab=67.17 E-value=7.2 Score=26.88 Aligned_cols=23 Identities=26% Similarity=0.445 Sum_probs=18.5
Q ss_pred HHHhcCCCCCCcccHHHHHHHHh
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
|-.-+.-||+|.+||++|+.+.+
T Consensus 76 i~e~FSeDG~GnlsfddFlDmfS 98 (189)
T KOG0038|consen 76 ICEVFSEDGRGNLSFDDFLDMFS 98 (189)
T ss_pred HHHHhccCCCCcccHHHHHHHHH
Confidence 34456679999999999998775
No 102
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=66.83 E-value=5.5 Score=19.92 Aligned_cols=29 Identities=21% Similarity=0.340 Sum_probs=20.9
Q ss_pred CCCcccHHHHHHHHhCChHHHHHhcccCCC
Q psy8460 43 KDGKISFQDFQQSVTDEPLLLEAFGQCLPS 72 (86)
Q Consensus 43 ~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~ 72 (86)
+.|.|+++||+.+..+--.|++. ..++|.
T Consensus 1 ~~~~i~~~~~~d~a~rv~~f~~~-ngRlPn 29 (33)
T PF09373_consen 1 SSGTISKEEYLDMASRVNNFYES-NGRLPN 29 (33)
T ss_pred CCceecHHHHHHHHHHHHHHHHH-cCCCCC
Confidence 36889999999998887777653 334553
No 103
>PF07637 PSD5: Protein of unknown function (DUF1595); InterPro: IPR013043 A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013042 from INTERPRO.
Probab=64.68 E-value=23 Score=20.12 Aligned_cols=56 Identities=16% Similarity=0.044 Sum_probs=31.3
Q ss_pred HHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHH
Q psy8460 8 MYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLL 63 (86)
Q Consensus 8 ~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~ 63 (86)
.+++.........-+++++|++.+++-.-......+|-.-...+=+..+..+|.|+
T Consensus 4 ~~~l~~Fa~rAfRRp~~~~e~~~~~~~~~~~~~~g~~~~~a~~~~l~aiL~SP~FL 59 (64)
T PF07637_consen 4 REILRRFARRAFRRPLTDEEVDRYLALYDSARAQGEDFEEALKEALQAILCSPSFL 59 (64)
T ss_pred HHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHcCcchh
Confidence 34444433222334789998887664443333333333444555667777788875
No 104
>TIGR00988 hip integration host factor, beta subunit. This protein forms a site-specific DNA-binding heterodimer with the homologous integration host factor alpha subunit. It is closely related to the DNA-binding protein HU.
Probab=63.83 E-value=8.3 Score=23.23 Aligned_cols=45 Identities=22% Similarity=0.289 Sum_probs=30.5
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
++.|+...+....+ .+|..++..+++.++..+ -+..++.|.+.+|
T Consensus 2 ~k~eli~~i~~~~~-----~~s~~~v~~vv~~~~~~i~~~L~~g~~V~l~gf 48 (94)
T TIGR00988 2 TKSELIERIATQQS-----HLPAKDVEDAVKTMLEHMASALAQGDRIEIRGF 48 (94)
T ss_pred CHHHHHHHHHHHcC-----CCCHHHHHHHHHHHHHHHHHHHHcCCeEEEcCc
Confidence 67888777765321 378888888887777754 4556667776665
No 105
>PRK10026 arsenate reductase; Provisional
Probab=62.95 E-value=16 Score=24.21 Aligned_cols=30 Identities=13% Similarity=0.318 Sum_probs=23.4
Q ss_pred HHHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 34 LALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 34 ~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
..+++.+.+.+ .+|.+|.+..|..+|.+++
T Consensus 64 ~~yr~L~~~~~-~ls~~e~l~ll~~~P~LIK 93 (141)
T PRK10026 64 EPYEELGLAED-KFTDDQLIDFMLQHPILIN 93 (141)
T ss_pred chHHHcCCCcc-CCCHHHHHHHHHhCcccee
Confidence 34666666654 4899999999999999885
No 106
>KOG1029|consensus
Probab=62.94 E-value=8.3 Score=32.86 Aligned_cols=28 Identities=21% Similarity=0.421 Sum_probs=24.8
Q ss_pred HHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 31 LVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 31 ~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++.+|+...|+|+||+.+..||.-+|.-
T Consensus 50 VLaqIWALsDldkDGrmdi~EfSIAmkL 77 (1118)
T KOG1029|consen 50 VLAQIWALSDLDKDGRMDIREFSIAMKL 77 (1118)
T ss_pred HHHHHHHhhhcCccccchHHHHHHHHHH
Confidence 5678899999999999999999987763
No 107
>PRK13344 spxA transcriptional regulator Spx; Reviewed
Probab=60.11 E-value=17 Score=23.60 Aligned_cols=28 Identities=18% Similarity=0.438 Sum_probs=21.6
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
+++.+.+. ..+|-+|.+..+..+|.+++
T Consensus 64 ~k~L~~~~-~~ls~~e~i~ll~~~P~Lik 91 (132)
T PRK13344 64 AKALDCDI-EELSVNEVIDLIQENPRILK 91 (132)
T ss_pred HHhCCcch-hcCCHHHHHHHHHhCcccee
Confidence 45555443 56889999999999999974
No 108
>cd03034 ArsC_ArsC Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli. E. coli ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], the first step in the detoxification of arsenic, using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX). ArsC contains a single catalytic cysteine, within a thioredoxin fold, that forms a covalent thiolate-As(V) intermediate, which is reduced by GRX through a mixed GSH-arsenate intermediate. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases.
Probab=59.82 E-value=25 Score=21.91 Aligned_cols=28 Identities=14% Similarity=0.438 Sum_probs=21.4
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
+++...+. ..++-+|.++.+..+|.+++
T Consensus 63 y~~l~~~~-~~ls~~e~i~ll~~~P~Lik 90 (112)
T cd03034 63 YKELGLAD-PELSDEELIDAMAAHPILIE 90 (112)
T ss_pred HHHcCCCc-cCCCHHHHHHHHHhCcCccc
Confidence 44545543 46889999999999999985
No 109
>PRK12559 transcriptional regulator Spx; Provisional
Probab=59.61 E-value=20 Score=23.22 Aligned_cols=55 Identities=13% Similarity=0.277 Sum_probs=33.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
.|++|+..++... + ...+++-.-=...+++.+.+.+ .+|-+|.+..|..+|.+++
T Consensus 37 ~s~~el~~~l~~~--~-----~g~~~lin~~~~~~k~l~~~~~-~ls~~e~i~ll~~~P~Lik 91 (131)
T PRK12559 37 MTVDELKSILRLT--E-----EGATEIISTRSKTFQDLNINIE-ELSLNEFYKLIIEHPLMLR 91 (131)
T ss_pred CCHHHHHHHHHHc--C-----CCHHHHHhcCcHHHHhCCCCcc-cCCHHHHHHHHHhCcceEe
Confidence 4667777776652 1 1111111111234567776653 5789999999999999985
No 110
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=59.23 E-value=42 Score=22.50 Aligned_cols=59 Identities=12% Similarity=0.095 Sum_probs=37.9
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHh
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~ 66 (86)
+|+|.++.|+..+.+ ++++|.+++++..=+.+|--+...-|.+|-++-+..-.++.+..
T Consensus 2 ~k~efL~~L~~~L~~-----lp~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~~iA~~i 60 (181)
T PF08006_consen 2 NKNEFLNELEKYLKK-----LPEEEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPKEIAREI 60 (181)
T ss_pred CHHHHHHHHHHHHHc-----CCHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHHHHHHHH
Confidence 789999999987744 77777777766554444433333357777777766544444433
No 111
>TIGR00014 arsC arsenate reductase (glutaredoxin). composed of two polypeptides, the products of the arsA and arsB genes. The pump alone produces resistance to arsenite and antimonite. This protein, ArsC, catalyzes the reduction of arsenate to arsenite, and thus extends resistance to include arsenate.
Probab=57.13 E-value=27 Score=21.84 Aligned_cols=28 Identities=21% Similarity=0.465 Sum_probs=20.7
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
+++.+.+. ..++-+|.++.|..+|.+++
T Consensus 64 ~~~l~~~~-~~ls~~e~i~~l~~~P~Lik 91 (114)
T TIGR00014 64 YKELGLSD-PNLSDQELLDAMVAHPILLE 91 (114)
T ss_pred HHHcCCCc-cCCCHHHHHHHHHHCcCccc
Confidence 34444443 36788999999999999985
No 112
>KOG4065|consensus
Probab=56.87 E-value=16 Score=24.18 Aligned_cols=35 Identities=14% Similarity=0.131 Sum_probs=29.2
Q ss_pred CCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 21 DEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.++|+++++- ..|+-.|+|+||.|+=-|.++++.-
T Consensus 61 a~mtpeqlqf---HYF~MHDldknn~lDGiEl~kAiTH 95 (144)
T KOG4065|consen 61 AKMTPEQLQF---HYFSMHDLDKNNFLDGIELLKAITH 95 (144)
T ss_pred hhCCHHHHhh---hhhhhhccCcCCcchHHHHHHHHHH
Confidence 5799998874 4688899999999999998887763
No 113
>PF02269 TFIID-18kDa: Transcription initiation factor IID, 18kD subunit; InterPro: IPR003195 This family includes the Spt3 yeast transcription factors and the 18 kDa subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone fold [].; GO: 0006366 transcription from RNA polymerase II promoter; PDB: 1BH9_A 1BH8_A.
Probab=56.82 E-value=16 Score=22.48 Aligned_cols=43 Identities=19% Similarity=0.236 Sum_probs=3.8
Q ss_pred HHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCC
Q psy8460 29 RDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLP 71 (86)
Q Consensus 29 ~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~ 71 (86)
.+++....+-+...|.|+|+.++|+=.++++|.-+.-+..-|.
T Consensus 37 ~~l~~~A~~~a~~rg~~~i~~eDl~F~lR~D~~Kl~Rl~~~L~ 79 (93)
T PF02269_consen 37 IELCQEAMEVAQRRGSKKIKVEDLLFLLRKDPKKLARLRELLS 79 (93)
T ss_dssp HHHHHHHHC----------------------------------
T ss_pred HHHHHHHHHHHhccccCcCcHHHHHHHHhcCHHHHHHHHHHHH
Confidence 3344444445666778899999999999999988765554444
No 114
>PF12872 OST-HTH: OST-HTH/LOTUS domain; PDB: 2KPM_A 3S93_B 3RCO_A 2KZV_A.
Probab=56.51 E-value=34 Score=19.15 Aligned_cols=51 Identities=18% Similarity=0.055 Sum_probs=30.9
Q ss_pred HHHHHHHHHHhhcCCCC--CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHH
Q psy8460 5 RSLMYLGRNSLISHPQD--EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQS 55 (86)
Q Consensus 5 eEm~~vL~~~l~~~~~~--~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~ 55 (86)
+++...++..+....++ .++-.++.....+.+..+|...=|.-++.+|++.
T Consensus 4 ~~~~~~l~~ll~~~~~~~g~v~ls~l~~~~~~~~~~f~~~~yG~~~l~~ll~~ 56 (74)
T PF12872_consen 4 EELKKLLRELLESQKGEDGWVSLSQLGQEYKKKYPDFDPRDYGFSSLSELLES 56 (74)
T ss_dssp HHHHHHHHHHHHHTCTTTSSEEHHHHHHHHHHHHTT--TCCTTSSSHHHHHHT
T ss_pred HHHHHHHHHHHHhCcCCCceEEHHHHHHHHHHHCCCCCccccCCCcHHHHHHh
Confidence 44555555544222222 4666667666667777888888888888888864
No 115
>PRK01655 spxA transcriptional regulator Spx; Reviewed
Probab=56.49 E-value=23 Score=22.80 Aligned_cols=29 Identities=31% Similarity=0.560 Sum_probs=22.7
Q ss_pred HHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
.+++.+.+. -.+|.+|.+..|..+|.+++
T Consensus 63 ~~k~l~~~~-~~ls~~e~i~ll~~~p~Lik 91 (131)
T PRK01655 63 VFQKLNVDV-ESLSLQDLIKLISDNPGLLR 91 (131)
T ss_pred HHHhCCCCc-ccCCHHHHHHHHHhCcceEe
Confidence 356666654 35889999999999999975
No 116
>PF05042 Caleosin: Caleosin related protein; InterPro: IPR007736 This family contains plant proteins related to caleosin. Caleosins contain calcium-binding domains and have an oleosin-like association with lipid bodies. Caleosins are present at relatively low levels and are mainly bound to microsomal membrane fractions at the early stages of seed development. As the seeds mature, overall levels of caleosins increased dramatically and they were associated almost exclusively with storage lipid bodies []. The calcium binding domain is probably related to the calcium-binding EF-hands motif IPR002048 from INTERPRO.
Probab=55.73 E-value=18 Score=25.10 Aligned_cols=33 Identities=6% Similarity=0.033 Sum_probs=27.6
Q ss_pred HHHHHHHhcCCCCCCcccHHHHHHHHhCChHHH
Q psy8460 31 LVELALRKMDYDKDGKISFQDFQQSVTDEPLLL 63 (86)
Q Consensus 31 ~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~ 63 (86)
-.+.||.+.+..+.+.+++.|..+++..+-+..
T Consensus 97 kFe~iF~kya~~~~d~LT~~E~~~m~~~nr~~~ 129 (174)
T PF05042_consen 97 KFEEIFSKYAKTGPDALTLRELWRMLKGNRNAN 129 (174)
T ss_pred HHHHHHHHhCCCCCCCcCHHHHHHHHHhccccC
Confidence 355679999999999999999999999866543
No 117
>PF09069 EF-hand_3: EF-hand; InterPro: IPR015154 Like other EF hand domains, this domain forms a helix-loop-helix motif, though since it does not contain the canonical pattern of calcium binding residues found in many EF hand domains, it does not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (IPR001202 from INTERPRO), enhancing dystroglycan binding []. ; PDB: 1EG3_A 1EG4_A.
Probab=55.53 E-value=38 Score=20.87 Aligned_cols=59 Identities=17% Similarity=0.122 Sum_probs=30.6
Q ss_pred CCHHHHHHHHHHHh--hcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHH
Q psy8460 2 GQKRSLMYLGRNSL--ISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLL 62 (86)
Q Consensus 2 ~sreEm~~vL~~~l--~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~ 62 (86)
+++.-+...|+..+ +...||..+--.++.-|...|... .+.-.|+.++|+..+..+|-.
T Consensus 19 ~~~~~l~~lL~d~lqip~~vgE~~aFg~~e~sv~sCF~~~--~~~~~I~~~~Fl~wl~~ePq~ 79 (90)
T PF09069_consen 19 MDQRKLGLLLHDVLQIPRAVGEGPAFGYIEPSVRSCFQQV--QLSPKITENQFLDWLMSEPQS 79 (90)
T ss_dssp B-HHHHHHHHHHHHHHHHHTT-GGGGT--HHHHHHHHHHT--TT-S-B-HHHHHHHHHT--TT
T ss_pred CcHHHHHHHHHHHHHHHHHhCccccccCcHHHHHHHhccc--CCCCccCHHHHHHHHHhCCCe
Confidence 34555555555433 111223222222556677888876 367789999999999998853
No 118
>PF08269 Cache_2: Cache domain; InterPro: IPR013163 Cache is an extracellular domain that is predicted to have a role in small-molecule recognition in a wide range of proteins, including the animal dihydropyridine-sensitive voltage-gated Ca2+ channel; alpha-2delta subunit, and various bacterial chemotaxis receptors. The name Cache comes from CAlcium channels and CHEmotaxis receptors. This domain consists of an N-terminal part with three predicted strands and an alpha-helix, and a C-terminal part with a strand dyad followed by a relatively unstructured region. The N-terminal portion of the (unpermuted) Cache domain contains three predicted strands that could form a sheet analogous to that present in the core of the PAS domain structure. Cache domains are particularly widespread in bacteria, with Vibrio cholerae. The animal calcium channel alpha-2delta subunits might have acquired a part of their extracellular domains from a bacterial source []. The Cache domain appears to have arisen from the GAF-PAS fold despite their divergent functions []. This entry is composed of the type 2 Cache domain.; PDB: 2QHK_A 4EXO_A.
Probab=54.80 E-value=21 Score=21.22 Aligned_cols=27 Identities=26% Similarity=0.396 Sum_probs=15.8
Q ss_pred CCCChHHHHHHHHHHHHhcCCCCCCcc
Q psy8460 21 DEDPDEGVRDLVELALRKMDYDKDGKI 47 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d~DG~I 47 (86)
.++|+++++..+...++.+..+++|.+
T Consensus 33 g~ls~eea~~~a~~~l~~~r~~~~gY~ 59 (95)
T PF08269_consen 33 GKLSEEEAQQQAREALRALRYGGDGYF 59 (95)
T ss_dssp T-----TTHHHHHHHHHH--SBTTB--
T ss_pred CCccHHHHHHHHHHHHhccccCCCCeE
Confidence 368999999999999999999888865
No 119
>PF04876 Tenui_NCP: Tenuivirus major non-capsid protein; InterPro: IPR006960 This entry contains the tenuivirus major non-capsid protein. Proteins accumulate in large amounts in tenuivirus infected cells. They are found in the inclusion bodies that are formed after infection [].
Probab=53.96 E-value=66 Score=22.14 Aligned_cols=43 Identities=7% Similarity=0.208 Sum_probs=34.2
Q ss_pred CCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHh
Q psy8460 21 DEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~ 66 (86)
..+|.+|+..+++.+++-+-.++ +.=|+|.....+-|.....+
T Consensus 126 n~MSk~Qik~L~~~Ii~~akae~---~dtE~Ye~vwkKmPaY~~ni 168 (175)
T PF04876_consen 126 NRMSKDQIKTLCEQIIEMAKAES---SDTEHYEKVWKKMPAYFSNI 168 (175)
T ss_pred chhhHHHHHHHHHHHHHHHhccC---CchHHHHHHHHHhhHHHHHH
Confidence 36999999999999999877654 34588888888888876544
No 120
>KOG1707|consensus
Probab=53.51 E-value=24 Score=28.96 Aligned_cols=39 Identities=26% Similarity=0.411 Sum_probs=30.9
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChH
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPL 61 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~ 61 (86)
++++..++ ++..+|..+|.|+||.++=.|+...-..-|.
T Consensus 308 ELs~~~~~-Fl~~~f~~~D~d~Dg~L~p~El~~LF~~~P~ 346 (625)
T KOG1707|consen 308 ELSPKGYR-FLVDVFEKFDRDNDGALSPEELKDLFSTAPG 346 (625)
T ss_pred eccHHHHH-HHHHHHHhccCCCCCCcCHHHHHHHhhhCCC
Confidence 46666555 6678999999999999999999877665554
No 121
>KOG1954|consensus
Probab=53.36 E-value=12 Score=29.58 Aligned_cols=24 Identities=25% Similarity=0.437 Sum_probs=22.0
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHH
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQ 53 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~ 53 (86)
.++.++++.+|.|.||.++-+||-
T Consensus 477 svlgkiwklad~d~dg~ld~eefa 500 (532)
T KOG1954|consen 477 SVLGKIWKLADIDKDGMLDDEEFA 500 (532)
T ss_pred hHHHhhhhhhcCCcccCcCHHHHH
Confidence 567889999999999999999996
No 122
>PF03960 ArsC: ArsC family; InterPro: IPR006660 Several bacterial taxon have a chromosomal resistance system, encoded by the ars operon, for the detoxification of arsenate, arsenite, and antimonite []. This system transports arsenite and antimonite out of the cell. The pump is composed of two polypeptides, the products of the arsA and arsB genes. This two-subunit enzyme produces resistance to arsenite and antimonite. Arsenate, however, must first be reduced to arsenite before it is extruded. A third gene, arsC, expands the substrate specificity to allow for arsenate pumping and resistance. ArsC is an approximately 150-residue arsenate reductase that uses reduced glutathione (GSH) to convert arsenate to arsenite with a redox active cysteine residue in the active site. ArsC forms an active quaternary complex with GSH, arsenate, and glutaredoxin 1 (Grx1). The three ligands must be present simultaneously for reduction to occur []. The arsC family also comprises the Spx proteins which are GRAM-positive bacterial transcription factors that regulate the transcription of multiple genes in response to disulphide stress []. The arsC protein structure has been solved []. It belongs to the thioredoxin superfamily fold which is defined by a beta-sheet core surrounded by alpha-helices. The active cysteine residue of ArsC is located in the loop between the first beta-strand and the first helix, which is also conserved in the Spx protein and its homologues.; PDB: 2KOK_A 1SK1_A 1SK2_A 1JZW_A 1J9B_A 1S3C_A 1SD8_A 1SD9_A 1I9D_A 1SK0_A ....
Probab=53.13 E-value=5.4 Score=24.70 Aligned_cols=29 Identities=14% Similarity=0.314 Sum_probs=21.2
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
+++.+......+|-+|.+..|..+|.+++
T Consensus 60 ~k~l~~~~~~~~s~~e~i~~l~~~p~Lik 88 (110)
T PF03960_consen 60 YKELGKLKKDDLSDEELIELLLENPKLIK 88 (110)
T ss_dssp HHHTTHHHCTTSBHHHHHHHHHHSGGGB-
T ss_pred HhhhhhhhhhhhhhHHHHHHHHhChhhee
Confidence 45555223466899999999999999875
No 123
>PRK00199 ihfB integration host factor subunit beta; Reviewed
Probab=51.94 E-value=17 Score=21.94 Aligned_cols=45 Identities=18% Similarity=0.209 Sum_probs=29.6
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
++.||...+.... ..+|..++..+++.+++.+ -+..++.|.+..|
T Consensus 2 tk~eli~~ia~~~-----~~~s~~~~~~vv~~~~~~i~~~L~~g~~V~l~gf 48 (94)
T PRK00199 2 TKSELIERLAARN-----PHLSAKDVENAVKEILEEMSDALARGDRIEIRGF 48 (94)
T ss_pred CHHHHHHHHHHHc-----CCCCHHHHHHHHHHHHHHHHHHHHcCCeEEEcCC
Confidence 6788877776522 1378888888887777744 3455666766655
No 124
>KOG4347|consensus
Probab=50.47 E-value=19 Score=29.78 Aligned_cols=38 Identities=21% Similarity=0.277 Sum_probs=30.1
Q ss_pred HHHHHHHHHhcCCCCCCcccHHHHHHHHhC--ChHHHHHh
Q psy8460 29 RDLVELALRKMDYDKDGKISFQDFQQSVTD--EPLLLEAF 66 (86)
Q Consensus 29 ~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~--~p~~~~~~ 66 (86)
..++.++|+..|.+++|.|+|.+++.++.. .-+.++.+
T Consensus 554 ~~~~~rlF~l~D~s~~g~Ltf~~lv~gL~~l~~~~~~ek~ 593 (671)
T KOG4347|consen 554 LIFLERLFRLLDDSMTGLLTFKDLVSGLSILKAGDALEKL 593 (671)
T ss_pred HHHHHHHHHhcccCCcceeEHHHHHHHHHHHHhhhHHHHH
Confidence 357789999999999999999999988763 44444433
No 125
>PF02885 Glycos_trans_3N: Glycosyl transferase family, helical bundle domain Prosite entry for Thymidine and pyrimidine-nucleoside phosphorylases; InterPro: IPR017459 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. The glycosyl transferase family includes anthranilate phosphoribosyltransferase (TrpD, 2.4.2.18 from EC) and thymidine phosphorylase (2.4.2.2 from EC). All these proteins can transfer a phosphorylated ribose substrate. Thymidine phosphorylase (2.4.2.2 from EC) catalyses the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate. This enzyme regulates the availability of thymidine and is therefore essential to nucleic acid metabolism. This N-terminal domain is found in various family 3 glycosyl transferases, including anthranilate phosphoribosyltransferase (TrpD, 2.4.2.18 from EC) and thymidine phosphorylase (2.4.2.2 from EC). All these proteins can transfer a phosphorylated ribose substrate. Thymidine phosphorylase catalyses the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate. This enzyme regulates the availability of thymidine and is therefore essential to nucleic acid metabolism.; PDB: 2DSJ_B 2ELC_B 2BPQ_A 1ZVW_B 3QR9_B 1V8G_B 2WK5_C 2J0F_C 2WK6_B 1UOU_A ....
Probab=50.43 E-value=33 Score=19.35 Aligned_cols=28 Identities=7% Similarity=0.038 Sum_probs=15.2
Q ss_pred HHHHHHHHhhcCCCCCCChHHHHHHHHHHHH
Q psy8460 7 LMYLGRNSLISHPQDEDPDEGVRDLVELALR 37 (86)
Q Consensus 7 m~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~ 37 (86)
+.++++....+ ..+|.+|+..+++.++.
T Consensus 2 ~~~~l~~l~~g---~~Ls~~e~~~~~~~i~~ 29 (66)
T PF02885_consen 2 IKEILKKLRDG---EDLSREEAKAAFDAILD 29 (66)
T ss_dssp HHHHHHHHHTT-------HHHHHHHHHHHHT
T ss_pred HHHHHHHHHcC---CCCCHHHHHHHHHHHHc
Confidence 34556664433 57888888887776654
No 126
>CHL00091 apcE phycobillisome linker protein
Probab=49.93 E-value=78 Score=27.22 Aligned_cols=31 Identities=19% Similarity=0.413 Sum_probs=25.3
Q ss_pred CCCcccHHHHHHHHhCChHHHHHhcccCCCh
Q psy8460 43 KDGKISFQDFQQSVTDEPLLLEAFGQCLPSD 73 (86)
Q Consensus 43 ~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~~ 73 (86)
.+|.||-.||++.+.+++.+.+.|=...+..
T Consensus 304 rnG~IsVReFIR~LakS~~Yr~~f~~~~~~~ 334 (877)
T CHL00091 304 KNGQISIKEFIRALGKSEIYRKQFYEPFVNS 334 (877)
T ss_pred hcCCccHHHHHHHHhccHHHHHHhccCCCcc
Confidence 5899999999999999999987765544443
No 127
>PF07492 Trehalase_Ca-bi: Neutral trehalase Ca2+ binding domain; InterPro: IPR011120 Neutral trehalases mobilise trehalose accumulated by fungal cells as a protective and storage carbohydrate. This family represents a calcium-binding domain similar to EF hand. Residues 97 and 108 in O42893 from SWISSPROT have been implicated in this interaction. It is thought that this domain may provide a general mechanism for regulating neutral trehalase activity in yeasts and filamentous fungi [].; GO: 0004555 alpha,alpha-trehalase activity, 0005509 calcium ion binding, 0005993 trehalose catabolic process, 0005737 cytoplasm
Probab=49.57 E-value=4.1 Score=20.42 Aligned_cols=18 Identities=28% Similarity=0.342 Sum_probs=14.6
Q ss_pred HHHHhcCCCCCCcccHHH
Q psy8460 34 LALRKMDYDKDGKISFQD 51 (86)
Q Consensus 34 ~i~~~~D~d~DG~Is~eE 51 (86)
.++..-|.|+|-.|+.++
T Consensus 3 ~LL~qEDTDgn~qITIeD 20 (30)
T PF07492_consen 3 SLLEQEDTDGNFQITIED 20 (30)
T ss_pred hHhhccccCCCcEEEEec
Confidence 456778999999998765
No 128
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=49.22 E-value=30 Score=21.45 Aligned_cols=30 Identities=10% Similarity=0.299 Sum_probs=21.1
Q ss_pred HHHhcCCCCC-CcccHHHHHHHHhCChHHHH
Q psy8460 35 ALRKMDYDKD-GKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 35 i~~~~D~d~D-G~Is~eEF~~~l~~~p~~~~ 64 (86)
.+++.+.+.. ..++-+|.++.|..+|.+++
T Consensus 62 ~~~~l~~~~~~~~~s~~e~~~~l~~~p~Lik 92 (111)
T cd03036 62 SYRELGLKDKLPSLSEEEALELLSSDGMLIK 92 (111)
T ss_pred hHHhCCcccccccCCHHHHHHHHHhCcCeee
Confidence 3455555432 24688999999999998875
No 129
>KOG3866|consensus
Probab=49.05 E-value=30 Score=26.64 Aligned_cols=49 Identities=18% Similarity=0.136 Sum_probs=31.8
Q ss_pred CCCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 1 MGQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 1 ~~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.||++.+..|...+= ++-.....++ -.|.-.|.|+||.++-.|......
T Consensus 223 PGSkdQLkEVWEE~D-gLdpn~fdPK-------TFF~LHD~NsDGfldeqELEaLFt 271 (442)
T KOG3866|consen 223 PGSKDQLKEVWEESD-GLDPNQFDPK-------TFFALHDLNSDGFLDEQELEALFT 271 (442)
T ss_pred CCcHHHHHHHHHHhc-CCCcccCCcc-------hheeeeccCCcccccHHHHHHHHH
Confidence 378888888877632 2111233444 247788999999999888765544
No 130
>PF12674 Zn_ribbon_2: Putative zinc ribbon domain
Probab=47.13 E-value=47 Score=19.95 Aligned_cols=34 Identities=9% Similarity=-0.056 Sum_probs=19.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALR 37 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~ 37 (86)
+|.+||.......+.. .+ .+++++++.++...|.
T Consensus 42 ~t~eemie~~~~~~~~-~~-~~~~~~a~~~~~~~lp 75 (81)
T PF12674_consen 42 ITMEEMIEFCVPFMDE-FN-GMTPEEARKMMPRYLP 75 (81)
T ss_pred CCHHHHHHHHHHHHHH-hC-CCCHHHHHHHHHHHcc
Confidence 4667777776665544 11 2677766666555443
No 131
>KOG0169|consensus
Probab=45.62 E-value=33 Score=28.84 Aligned_cols=31 Identities=16% Similarity=0.364 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 26 EGVRDLVELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 26 ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
+..+.++..+|+++|.+++|.+++.+=...+
T Consensus 132 ~~~~~wi~~~~~~ad~~~~~~~~~~~~~~~~ 162 (746)
T KOG0169|consen 132 SRREHWIHSIFQEADKNKNGHMSFDEVLDLL 162 (746)
T ss_pred chHHHHHHHHHHHHccccccccchhhHHHHH
Confidence 4455789999999999999999998855444
No 132
>CHL00091 apcE phycobillisome linker protein
Probab=45.35 E-value=22 Score=30.31 Aligned_cols=35 Identities=14% Similarity=0.193 Sum_probs=29.6
Q ss_pred CCCcccHHHHHHHHhCChHHHHHhcccCCChHHHH
Q psy8460 43 KDGKISFQDFQQSVTDEPLLLEAFGQCLPSDAARQ 77 (86)
Q Consensus 43 ~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~~~~~~ 77 (86)
.+|.||-.||++.+.++..+.+.|-..++.++.+.
T Consensus 552 ~nG~IsvREFIR~LakS~~fr~~f~~~~~~~k~IE 586 (877)
T CHL00091 552 RRRQISVREFVRQLAKSSVFRSLYWSPLYICKAIE 586 (877)
T ss_pred hcCCccHHHHHHHHhccHHHHHhhccccCccchhh
Confidence 58999999999999999999988777777765553
No 133
>PF09312 SurA_N: SurA N-terminal domain; InterPro: IPR015391 The correct folding of outer membrane proteins in Gram negative bacteria is facilitated by the survival protein SurA []. This entry represents the domain found at the N terminus of the chaperone SurA. It is a helical domain of unknown function. The C terminus of the SurA protein folds back and forms part of this domain also but is not included in the current alignment. ; PDB: 3RGC_B 2PV3_B 1M5Y_A.
Probab=44.72 E-value=27 Score=21.91 Aligned_cols=35 Identities=14% Similarity=0.234 Sum_probs=18.2
Q ss_pred CCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 19 PQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 19 ~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
.|-..|+++++..++.+.+... +|.++|.+.+...
T Consensus 63 ~gI~vsd~evd~~i~~ia~~n~------ls~~ql~~~L~~~ 97 (118)
T PF09312_consen 63 LGIKVSDEEVDEAIANIAKQNN------LSVEQLRQQLEQQ 97 (118)
T ss_dssp CT----HHHHHHHHHHHHHHTT--------HHHHHHHCHHC
T ss_pred cCCCCCHHHHHHHHHHHHHHcC------CCHHHHHHHHHHc
Confidence 4456788888888877766432 3555555555543
No 134
>PF07553 Lipoprotein_Ltp: Host cell surface-exposed lipoprotein; InterPro: IPR011434 This domain is found as 1-3 copies in a small family of proteins of unknown function.
Probab=44.31 E-value=42 Score=18.31 Aligned_cols=28 Identities=14% Similarity=0.006 Sum_probs=17.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVE 33 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd 33 (86)
.|++.|...|... .|+..|+++++.-|+
T Consensus 19 ~Sk~~l~~QL~se----~ge~Ft~e~A~YAv~ 46 (48)
T PF07553_consen 19 MSKQGLYDQLTSE----YGEGFTEEEAQYAVD 46 (48)
T ss_pred CCHHHHHHHHHhh----cccCCCHHHHHHHHH
Confidence 3666666666552 456788887775544
No 135
>PF09435 DUF2015: Fungal protein of unknown function (DUF2015); InterPro: IPR018559 This entry represents uncharacterised proteins found in fungi.
Probab=43.69 E-value=79 Score=20.92 Aligned_cols=37 Identities=19% Similarity=0.215 Sum_probs=25.0
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHH-HHHHHHHHhcCCCCCCc
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVR-DLVELALRKMDYDKDGK 46 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~-~~vd~i~~~~D~d~DG~ 46 (86)
+++|++.|++.. .++-+|++ ...++.|++-+.+.||.
T Consensus 86 ak~EI~~IM~~~-------~v~FDeARliy~~~~f~~NgI~pdG~ 123 (128)
T PF09435_consen 86 AKREIRRIMKRR-------RVNFDEARLIYTERRFKKNGIGPDGR 123 (128)
T ss_pred HHHHHHHHHHHc-------CCCHHHHHHHHHHHHHHHcCCCCCCC
Confidence 467888887762 35544443 45677888888888884
No 136
>smart00708 PhBP Insect pheromone/odorant binding protein domains.
Probab=43.15 E-value=49 Score=19.16 Aligned_cols=36 Identities=25% Similarity=0.480 Sum_probs=25.3
Q ss_pred hHHHHHHHHHHHHhcC-CCCCCcccHHHHHHHHhCCh
Q psy8460 25 DEGVRDLVELALRKMD-YDKDGKISFQDFQQSVTDEP 60 (86)
Q Consensus 25 ~ee~~~~vd~i~~~~D-~d~DG~Is~eEF~~~l~~~p 60 (86)
+++..-++.=+++++. .|++|.|..+.+...+..+|
T Consensus 29 ~~~~kC~~~C~~~~~g~~~~~g~~~~~~~~~~~~~~~ 65 (103)
T smart00708 29 DEKVKCYIKCVLKKLGLVDDDGKFDAEKLLEQLKADD 65 (103)
T ss_pred CcccCcHHHHHHHHcCCcCCCCCcCHHHHHHHHHcCh
Confidence 4445556666777777 55579999999988887534
No 137
>COG0721 GatC Asp-tRNAAsn/Glu-tRNAGln amidotransferase C subunit [Translation, ribosomal structure and biogenesis]
Probab=43.11 E-value=60 Score=20.07 Aligned_cols=26 Identities=12% Similarity=0.086 Sum_probs=17.7
Q ss_pred CCCHHHHHHHHHHHhhcCCCCCCChHHHHHH
Q psy8460 1 MGQKRSLMYLGRNSLISHPQDEDPDEGVRDL 31 (86)
Q Consensus 1 ~~sreEm~~vL~~~l~~~~~~~ls~ee~~~~ 31 (86)
+++++++.++.+-.. -.+|+++++.+
T Consensus 2 ~i~~e~v~~la~Lar-----L~lseee~e~~ 27 (96)
T COG0721 2 AIDREEVKHLAKLAR-----LELSEEELEKF 27 (96)
T ss_pred ccCHHHHHHHHHHhh-----cccCHHHHHHH
Confidence 588999999877432 24777666644
No 138
>KOG0751|consensus
Probab=42.55 E-value=26 Score=28.55 Aligned_cols=27 Identities=48% Similarity=0.578 Sum_probs=20.6
Q ss_pred HhcCCCCCCcccHHHHHH--HHhCChHHH
Q psy8460 37 RKMDYDKDGKISFQDFQQ--SVTDEPLLL 63 (86)
Q Consensus 37 ~~~D~d~DG~Is~eEF~~--~l~~~p~~~ 63 (86)
.-+|..+||-|||+||.. .+...|+.+
T Consensus 81 ~iaD~tKDglisf~eF~afe~~lC~pDal 109 (694)
T KOG0751|consen 81 SIADQTKDGLISFQEFRAFESVLCAPDAL 109 (694)
T ss_pred hhhhhcccccccHHHHHHHHhhccCchHH
Confidence 467899999999999974 455666654
No 139
>PF09966 DUF2200: Uncharacterized protein conserved in bacteria (DUF2200); InterPro: IPR014580 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 3C9P_A.
Probab=42.40 E-value=55 Score=21.11 Aligned_cols=41 Identities=20% Similarity=0.260 Sum_probs=20.4
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHH-hCCh
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSV-TDEP 60 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l-~~~p 60 (86)
|++|+-+|+.= +++ .++++++.. -+..++|++|..-. .-||
T Consensus 24 t~~Evd~vi~W-LTG-----y~~~~l~~~-----------~~~~~~~~~FF~~AP~lNp 65 (111)
T PF09966_consen 24 TKEEVDQVIRW-LTG-----YDQEELQAQ-----------IESKVTFETFFAQAPALNP 65 (111)
T ss_dssp -HHHHHHHHHH-HH--------HHHHHHH-----------TTS--BHHHHHHT-S---G
T ss_pred CHHHHHHHHHH-Hhc-----CCHHHHHHH-----------HHcCCCHHHHHHHccccCc
Confidence 67777777764 555 667765543 23458899987654 3344
No 140
>PF12213 Dpoe2NT: DNA polymerases epsilon N terminal; InterPro: IPR024639 This domain is found in the N-terminal of DNA polymerase epsilon subunit B proteins. It contains a single completely conserved phenylalanine residue that may be functionally important. It forms a primarily alpha helical structure in which four helices are arranged in two hairpins with connecting loops containing beta strands which form a short parallel sheet. DNA polymerase epsilon is required in DNA replication for synthesis of the leading strand. The N-terminal domain has close structural relation to AAA+ protein C-terminal domains [].; PDB: 2V6Z_M.
Probab=42.12 E-value=57 Score=19.24 Aligned_cols=46 Identities=9% Similarity=0.083 Sum_probs=25.5
Q ss_pred HHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCC--CCcccHHHHHHHH
Q psy8460 6 SLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDK--DGKISFQDFQQSV 56 (86)
Q Consensus 6 Em~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~--DG~Is~eEF~~~l 56 (86)
|-..++...+. ++++.+.++++++++...+..+ +..|+.+.+..++
T Consensus 23 eA~~~l~~~l~-----~~~~~e~e~~l~~Ii~~v~kq~Lss~~v~~~~ie~Av 70 (73)
T PF12213_consen 23 EASKYLAEQLQ-----SLSEEEREDWLDKIIDAVQKQPLSSSIVDKEHIESAV 70 (73)
T ss_dssp HHHHHHHHHTT-----TS-TTTHHHHHHHHHHHHTTS--SSSEE-HHHHHHHH
T ss_pred HHHHHHHHHHc-----cCCHHHHHHHHHHHHHHHhcCCCCCCccCHHHHHHHH
Confidence 44444444442 3567777888888888776542 4455555554443
No 141
>PLN02952 phosphoinositide phospholipase C
Probab=41.47 E-value=1.3e+02 Score=24.79 Aligned_cols=54 Identities=9% Similarity=-0.015 Sum_probs=31.3
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcC---CCCCCcccHHHHHHHHhCC
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMD---YDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D---~d~DG~Is~eEF~~~l~~~ 59 (86)
|.+++...|+..... ...+.++++.++++++.... ..+.+.++++.|...+...
T Consensus 55 t~~~l~~FL~~~Q~e---~~~~~~~~~~i~~~~~~~~~~~~~~~~~~l~~~~F~~~l~s~ 111 (599)
T PLN02952 55 GADQLRRFLVLHQDE---LDCTLAEAQRIVEEVINRRHHVTRYTRHGLNLDDFFHFLLYD 111 (599)
T ss_pred CHHHHHHHHHHhCCC---cCCCHHHHHHHHHHHHhhccccccccccCcCHHHHHHHHcCc
Confidence 445555555542211 23566667777766655422 1234568999999999843
No 142
>TIGR01616 nitro_assoc nitrogenase-associated protein. This model describes a small family of uncharacterized proteins found so far in alpha and gamma proteobacteria and in Nostoc sp. PCC 7120, a cyanobacterium. The gene for this protein is associated with nitrogenase genes. This family shows sequence similarity to TIGR00014, a glutaredoxin-dependent arsenate reductase that converts arsentate to arsenite for disposal. This family is one of several included in Pfam model pfam03960.
Probab=40.83 E-value=63 Score=20.82 Aligned_cols=28 Identities=14% Similarity=0.273 Sum_probs=21.2
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
+++.+.+. ..+|-+|.+..|..+|.+++
T Consensus 63 ~r~l~~~~-~~ls~~e~i~lm~~~P~LIK 90 (126)
T TIGR01616 63 VKSGEVNP-DSIDEASALALMVSDPLLIR 90 (126)
T ss_pred hhhCCCCc-ccCCHHHHHHHHHhCcCeEe
Confidence 44555443 45788999999999999885
No 143
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=40.79 E-value=41 Score=19.72 Aligned_cols=44 Identities=16% Similarity=0.083 Sum_probs=26.3
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+..-|..++.+. +.+..... +...++.=+.++|+-+||++.++.
T Consensus 10 ~F~~L~~~l~~~--------l~~~~~~~----l~~~Y~~~k~~kIsR~~fvr~lR~ 53 (70)
T PF12174_consen 10 PFPMLFSALSKH--------LPPSKMDL----LQKHYEEFKKKKISREEFVRKLRQ 53 (70)
T ss_pred cHHHHHHHHHHH--------CCHHHHHH----HHHHHHHHHHCCCCHHHHHHHHHH
Confidence 444556666653 33333332 234444446899999999988774
No 144
>PF09832 DUF2059: Uncharacterized protein conserved in bacteria (DUF2059); InterPro: IPR018637 This entry contains proteins that have no known function. ; PDB: 2X3O_B 3OAO_A.
Probab=39.85 E-value=42 Score=18.54 Aligned_cols=26 Identities=4% Similarity=0.023 Sum_probs=17.8
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLV 32 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~v 32 (86)
++++|...+...+.. .+|++|++.++
T Consensus 1 d~~~~~~~~~~~y~~----~ft~~El~~i~ 26 (64)
T PF09832_consen 1 DPEKMIDQMAPIYAE----HFTEEELDAIL 26 (64)
T ss_dssp -HHHHHHHHHHHHHH----HS-HHHHHHHH
T ss_pred CHHHHHHHHHHHHHH----HCCHHHHHHHH
Confidence 356778888877754 58999888765
No 145
>COG0776 HimA Bacterial nucleoid DNA-binding protein [DNA replication, recombination, and repair]
Probab=39.67 E-value=93 Score=19.32 Aligned_cols=32 Identities=13% Similarity=0.043 Sum_probs=24.5
Q ss_pred CCCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHh
Q psy8460 1 MGQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRK 38 (86)
Q Consensus 1 ~~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~ 38 (86)
|.++.||.+.+-... .++..++...|+.+|..
T Consensus 1 ~mtKseli~~ia~~~------~l~k~~a~~~v~~~~~~ 32 (94)
T COG0776 1 TMTKSELIDAIAEKA------GLSKKDAEEAVDAFLEE 32 (94)
T ss_pred CCCHHHHHHHHHHHc------CCCHHHHHHHHHHHHHH
Confidence 578999999888732 38888888888877764
No 146
>KOG0046|consensus
Probab=39.45 E-value=68 Score=26.32 Aligned_cols=35 Identities=20% Similarity=0.422 Sum_probs=27.8
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.+|.+|+.++- .-|.+.| |++|.|+..|....+.+
T Consensus 12 ~~tq~El~~l~-~kF~~~d-~~~G~v~~~~l~~~f~k 46 (627)
T KOG0046|consen 12 QLTQEELRELK-EKFNKLD-DQKGYVTVYELPDAFKK 46 (627)
T ss_pred cccHHHHHHHH-HHHHhhc-CCCCeeehHHhHHHHHH
Confidence 58888888765 4567788 99999999998877765
No 147
>COG1460 Uncharacterized protein conserved in archaea [Function unknown]
Probab=39.24 E-value=1e+02 Score=20.02 Aligned_cols=30 Identities=10% Similarity=0.071 Sum_probs=20.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELAL 36 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~ 36 (86)
.|++|++.|+..- +-.+|+++++.+++-+.
T Consensus 80 ~t~~ElRsIla~e-----~~~~s~E~l~~Ildiv~ 109 (114)
T COG1460 80 RTPDELRSILAKE-----RVMLSDEELDKILDIVD 109 (114)
T ss_pred CCHHHHHHHHHHc-----cCCCCHHHHHHHHHHHH
Confidence 4778888887752 13578888888876543
No 148
>TIGR01617 arsC_related transcriptional regulator, Spx/MgsR family. This model represents a portion of the proteins within the larger set covered by Pfam model pfam03960. That larger family includes a glutaredoxin-dependent arsenate reductase (TIGR00014). Characterized members of this family include Spx and MgsR from Bacillus subtili. Spx is a global regulator for response to thiol-specific oxidative stress. It interacts with RNA polymerase. MgsR (modulator of the general stress response, also called YqgZ) provides a second level of regulation for more than a third of the proteins in the B. subtilis general stress regulon controlled by Sigma-B.
Probab=38.82 E-value=18 Score=22.61 Aligned_cols=20 Identities=20% Similarity=0.320 Sum_probs=17.6
Q ss_pred CcccHHHHHHHHhCChHHHH
Q psy8460 45 GKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 45 G~Is~eEF~~~l~~~p~~~~ 64 (86)
..+|.+|++..|..+|.+++
T Consensus 73 ~~ls~~e~~~~i~~~p~Lik 92 (117)
T TIGR01617 73 LDLSDKEALELLAEDPALLR 92 (117)
T ss_pred ccCCHHHHHHHHHhCcceEe
Confidence 56899999999999999874
No 149
>COG1715 Mrr Restriction endonuclease [Defense mechanisms]
Probab=38.34 E-value=74 Score=24.03 Aligned_cols=22 Identities=23% Similarity=0.226 Sum_probs=18.5
Q ss_pred CCChHHHHHHHHHHHHhcCCCC
Q psy8460 22 EDPDEGVRDLVELALRKMDYDK 43 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~ 43 (86)
.+|+...++++.+++..+...|
T Consensus 164 ~~sp~~Fe~lvvdvl~rmGYgg 185 (308)
T COG1715 164 KLSPAFFEELVVDVLERMGYGG 185 (308)
T ss_pred hcCHHHHHHHHHHHHHHhcCCc
Confidence 4789999999999999887763
No 150
>PF09682 Holin_LLH: Phage holin protein (Holin_LLH); InterPro: IPR010026 This entry represents the Bacteriophage LL-H, Orf107, holin protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This protein family represent one of a large number of mutually dissimilar families of phage holins. It is thought that the temporal precision of holin-mediated lysis may occur through the build-up of a holin oligomer which causes the lysis [].
Probab=38.00 E-value=1e+02 Score=19.20 Aligned_cols=33 Identities=18% Similarity=0.240 Sum_probs=20.8
Q ss_pred HHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcC
Q psy8460 7 LMYLGRNSLISHPQDEDPDEGVRDLVELALRKMD 40 (86)
Q Consensus 7 m~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D 40 (86)
-.+.+...+.. .|-++|+++++..++....++.
T Consensus 73 A~~~v~~~L~~-~gi~~t~~~i~~~IEaAV~~m~ 105 (108)
T PF09682_consen 73 AVQYVKERLKK-KGIKVTDEQIEGAIEAAVKEMN 105 (108)
T ss_pred HHHHHHHHHHH-cCCCCCHHHHHHHHHHHHHHHh
Confidence 34455555544 4557888888888777666553
No 151
>cd03033 ArsC_15kD Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC. It is encoded by an ORF present in a gene cluster associated with nitrogen fixation that also encodes dinitrogenase reductase ADP-ribosyltransferase (DRAT) and dinitrogenase reductase activating glycohydrolase (DRAG). ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via glutaredoxin, through a single catalytic cysteine.
Probab=37.13 E-value=77 Score=19.90 Aligned_cols=28 Identities=11% Similarity=0.214 Sum_probs=20.7
Q ss_pred HHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 36 LRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 36 ~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
++..+.+. ..+|-+|.+..|..+|.+++
T Consensus 62 ~r~~~~~~-~~ls~~e~~~ll~~~P~Lik 89 (113)
T cd03033 62 VKSGEVVP-EALDEEEALALMIADPLLIR 89 (113)
T ss_pred HHhcCCCc-cCCCHHHHHHHHHhCcceee
Confidence 44444433 45789999999999999885
No 152
>TIGR00135 gatC glutamyl-tRNA(Gln) and/or aspartyl-tRNA(Asn) amidotransferase, C subunit. This model has been revised to remove the candidate sequence from Methanococcus jannaschii, now part of a related model.
Probab=37.04 E-value=87 Score=18.74 Aligned_cols=26 Identities=12% Similarity=0.037 Sum_probs=16.4
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLV 32 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~v 32 (86)
+|++++.++.+-.. -.+++++++.+.
T Consensus 1 i~~~~v~~lA~La~-----L~l~eee~~~~~ 26 (93)
T TIGR00135 1 ISDEEVKHLAKLAR-----LELSEEEAESFA 26 (93)
T ss_pred CCHHHHHHHHHHhC-----CCCCHHHHHHHH
Confidence 57888888776432 247777765443
No 153
>PF09424 YqeY: Yqey-like protein; InterPro: IPR019004 Putative protein of unknown function; the authentic protein is detected in highly purified mitochondria in high-throughput studies; YOR215C is not an essential gene. ; PDB: 1NG6_A.
Probab=36.92 E-value=41 Score=22.26 Aligned_cols=22 Identities=18% Similarity=0.324 Sum_probs=15.0
Q ss_pred CCCChHHHHHHHHHHHHhcCCC
Q psy8460 21 DEDPDEGVRDLVELALRKMDYD 42 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d 42 (86)
..+|+++++.+|+.++.+.+..
T Consensus 89 ~~lseeEi~~~v~~~i~e~ga~ 110 (143)
T PF09424_consen 89 KQLSEEEIEAIVEEAIAELGAS 110 (143)
T ss_dssp ----HHHHHHHHHHHHHHTT--
T ss_pred CCCCHHHHHHHHHHHHHHhCCC
Confidence 4799999999999999988654
No 154
>PF11116 DUF2624: Protein of unknown function (DUF2624); InterPro: IPR020277 This entry contains proteins with no known function.
Probab=35.63 E-value=1.1e+02 Score=18.84 Aligned_cols=31 Identities=10% Similarity=0.063 Sum_probs=21.9
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRK 38 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~ 38 (86)
+|.+|++++-+. ++-++|+++++.++ .+++.
T Consensus 15 iT~~eLlkyskq-----y~i~it~~QA~~I~-~~lr~ 45 (85)
T PF11116_consen 15 ITAKELLKYSKQ-----YNISITKKQAEQIA-NILRG 45 (85)
T ss_pred CCHHHHHHHHHH-----hCCCCCHHHHHHHH-HHHhc
Confidence 577888887776 33579999988776 45553
No 155
>TIGR00987 himA integration host factor, alpha subunit. This protein forms a site-specific DNA-binding heterodimer with the integration host factor beta subunit. It is closely related to the DNA-binding protein HU.
Probab=35.44 E-value=47 Score=20.07 Aligned_cols=45 Identities=16% Similarity=0.242 Sum_probs=30.2
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
.++.||.+.+.... .++..+++.+++.++..+ -+...+.|.+..|
T Consensus 2 mtk~eli~~ia~~~------~~s~~~v~~vv~~~~~~i~~~L~~g~~V~l~gf 48 (96)
T TIGR00987 2 LTKAEMSEYLFDEL------GLSKREAKELVELFFEEIRRALENGEQVKLSGF 48 (96)
T ss_pred CCHHHHHHHHHHHh------CcCHHHHHHHHHHHHHHHHHHHHcCCeEEecCC
Confidence 47788887776632 378888888887777754 3445566766655
No 156
>PF10955 DUF2757: Protein of unknown function (DUF2757); InterPro: IPR020115 This entry contains proteins with no known function.
Probab=34.93 E-value=40 Score=20.30 Aligned_cols=39 Identities=21% Similarity=0.302 Sum_probs=27.5
Q ss_pred CCCChHHHHHHHHHHHHhcCCCCCCccc--HHHHHHHHhCChHHHH
Q psy8460 21 DEDPDEGVRDLVELALRKMDYDKDGKIS--FQDFQQSVTDEPLLLE 64 (86)
Q Consensus 21 ~~ls~ee~~~~vd~i~~~~D~d~DG~Is--~eEF~~~l~~~p~~~~ 64 (86)
..||++|.++|| ..|.+||=.|. .+.=..++..+|++.+
T Consensus 30 ~~Lt~eEr~dmI-----~~~~~G~i~V~~iCe~C~eaL~~nPe~~~ 70 (76)
T PF10955_consen 30 HHLTPEERQDMI-----SYDENGDIHVKVICEDCQEALERNPEYHE 70 (76)
T ss_pred ccCCHHHHhhhe-----EEcCCCCEEEEEecHHHHHHHHhCcchhh
Confidence 368999988874 45555554333 6777788999999875
No 157
>KOG2243|consensus
Probab=34.82 E-value=27 Score=32.48 Aligned_cols=26 Identities=27% Similarity=0.485 Sum_probs=22.8
Q ss_pred HHHhcCCCCCCcccHHHHHHHHhCCh
Q psy8460 35 ALRKMDYDKDGKISFQDFQQSVTDEP 60 (86)
Q Consensus 35 i~~~~D~d~DG~Is~eEF~~~l~~~p 60 (86)
.|++.|.||.|-||..+|-++|....
T Consensus 4062 tfkeydpdgkgiiskkdf~kame~~k 4087 (5019)
T KOG2243|consen 4062 TFKEYDPDGKGIISKKDFHKAMEGHK 4087 (5019)
T ss_pred cchhcCCCCCccccHHHHHHHHhccc
Confidence 47899999999999999999998643
No 158
>PF01320 Colicin_Pyocin: Colicin immunity protein / pyocin immunity protein; InterPro: IPR023802 Bacterial colicin and pyocin immunity proteins [, ] can bind specifically to the DNase-type colicins and pyocins and inhibit their bactericidal activity. The 1.8-angstrom crystal structure of the ImmE7 protein consists of four antiparallel alpha-helices []. Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. Pyocin protects a cell that harbours the plasmid ColE2 encoding colicin E2 against colicin E2; it is thus essential both for autonomous replication and colicin E2 immunity []. This entry represents the structural domain of colicin and pyocin immunity proteins.; GO: 0015643 toxin binding, 0030153 bacteriocin immunity; PDB: 1GXH_A 1GXG_A 1MZ8_C 2ERH_A 1ZNV_C 1AYI_A 1UNK_A 2JBG_A 7CEI_A 1CEI_A ....
Probab=34.66 E-value=1.1e+02 Score=18.71 Aligned_cols=31 Identities=10% Similarity=0.148 Sum_probs=18.4
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELAL 36 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~ 36 (86)
|++|++.+|+.+... ...++++...+++.+-
T Consensus 11 TE~EFl~~v~~i~~~---~~~~ee~~d~lv~hF~ 41 (85)
T PF01320_consen 11 TESEFLEFVKEIFNA---ELKTEEEHDELVDHFE 41 (85)
T ss_dssp BHHHHHHHHHHHHHT---CSSSCHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHcC---CCCCHHHHHHHHHHHH
Confidence 677888888776643 1245555665554443
No 159
>PRK00034 gatC aspartyl/glutamyl-tRNA amidotransferase subunit C; Reviewed
Probab=34.29 E-value=1e+02 Score=18.33 Aligned_cols=25 Identities=12% Similarity=0.086 Sum_probs=16.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDL 31 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~ 31 (86)
+|++++.++.+-.. -.+++++++.+
T Consensus 3 i~~e~i~~la~La~-----l~l~~ee~~~~ 27 (95)
T PRK00034 3 ITREEVKHLAKLAR-----LELSEEELEKF 27 (95)
T ss_pred CCHHHHHHHHHHhC-----CCCCHHHHHHH
Confidence 78999988877532 24777665544
No 160
>PF10891 DUF2719: Protein of unknown function (DUF2719); InterPro: IPR020122 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf56; it is a family of uncharacterised viral proteins.
Probab=33.88 E-value=26 Score=21.41 Aligned_cols=21 Identities=14% Similarity=0.173 Sum_probs=15.6
Q ss_pred CCCcccHHHHHHHHhCChHHH
Q psy8460 43 KDGKISFQDFQQSVTDEPLLL 63 (86)
Q Consensus 43 ~DG~Is~eEF~~~l~~~p~~~ 63 (86)
--|.|||+||+.+-.+-..++
T Consensus 32 APmSIS~eeY~~LH~~fN~i~ 52 (81)
T PF10891_consen 32 APMSISFEEYIRLHIKFNEIF 52 (81)
T ss_pred cccEeeHHHHHHHHHHHHHHH
Confidence 458899999998776654443
No 161
>COG4286 Uncharacterized conserved protein related to MYG1 family [Function unknown]
Probab=33.49 E-value=88 Score=23.56 Aligned_cols=64 Identities=19% Similarity=0.312 Sum_probs=42.1
Q ss_pred CCCCChHHHHHHHHHHHH----hcCCCCCCcccHHHHHHHHhCChHHHHHhcccCCC-hHHHHHHHHHhh
Q psy8460 20 QDEDPDEGVRDLVELALR----KMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLPS-DAARQSFLSTLQ 84 (86)
Q Consensus 20 ~~~ls~ee~~~~vd~i~~----~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~-~~~~~~~~~~~~ 84 (86)
+.+.|++.++.+.+.+-. -+|...||...|.+-.- ...-|+++++|.++-.+ ..+-+.|+.-|+
T Consensus 103 ~~~~~~~~~~~l~e~vy~~fv~~~Da~DNG~~~~~~i~p-f~sl~~iv~~~np~~~~e~~~d~~F~~al~ 171 (306)
T COG4286 103 GINRSEDDLETLYEKVYNTFVLGVDAIDNGISIYGEIEP-FRSLPDIVESFNPDWSDEDADDEGFLEALA 171 (306)
T ss_pred CCccchhhHHHHHHHHHHhhhcccccccCccccccccCc-ccchHHHHHhcCCCcCCCCcchHHHHHHHH
Confidence 445677777666665543 67988899888855444 45589999999876664 233346665543
No 162
>KOG3555|consensus
Probab=33.13 E-value=13 Score=28.79 Aligned_cols=28 Identities=21% Similarity=0.282 Sum_probs=17.9
Q ss_pred HHHHHHhcCCCCCCcccHHHHHHHHhCC
Q psy8460 32 VELALRKMDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 32 vd~i~~~~D~d~DG~Is~eEF~~~l~~~ 59 (86)
|...|.-.|...||.||=.|+.....+.
T Consensus 284 ikpFfnsCD~~kDg~iS~~EWC~CF~k~ 311 (434)
T KOG3555|consen 284 IKPFFNSCDTYKDGSISTNEWCYCFQKS 311 (434)
T ss_pred HHHHHhhhcccccCccccchhhhhhccC
Confidence 4455666677777777777766665543
No 163
>PF03705 CheR_N: CheR methyltransferase, all-alpha domain; InterPro: IPR022641 CheR proteins are part of the chemotaxis signaling mechanism which methylates the chemotaxis receptor at specific glutamate residues. This entry refers to the N-terminal domain of the CherR-type MCP methyltransferases, which are found in bacteria, archaea and green plants. This entry is found in association with PF01739 from PFAM. Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. Flagellated bacteria swim towards favourable chemicals and away from deleterious ones. Sensing of chemoeffector gradients involves chemotaxis receptors, transmembrane (TM) proteins that detect stimuli through their periplasmic domains and transduce the signals via their cytoplasmic domains []. Signalling outputs from these receptors are influenced both by the binding of the chemoeffector ligand to their periplasmic domains and by methylation of specific glutamate residues on their cytoplasmic domains. Methylation is catalysed by CheR, an S-adenosylmethionine-dependent methyltransferase [], which reversibly methylates specific glutamate residues within a coiled coil region, to form gamma-glutamyl methyl ester residues [, ]. The structure of the Salmonella typhimurium chemotaxis receptor methyltransferase CheR, bound to S-adenosylhomocysteine, has been determined to a resolution of 2.0 A []. The structure reveals CheR to be a two-domain protein, with a smaller N-terminal helical domain linked via a single polypeptide connection to a larger C-terminal alpha/beta domain. The C-terminal domain has the characteristics of a nucleotide-binding fold, with an insertion of a small anti-parallel beta-sheet subdomain. The S-adenosylhomocysteine-binding site is formed mainly by the large domain, with contributions from residues within the N-terminal domain and the linker region [].; PDB: 1AF7_A 1BC5_A.
Probab=32.60 E-value=61 Score=17.22 Aligned_cols=32 Identities=13% Similarity=0.237 Sum_probs=18.7
Q ss_pred hHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChH
Q psy8460 25 DEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPL 61 (86)
Q Consensus 25 ~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~ 61 (86)
...++..+...++..+ .=||.+|...+..+|+
T Consensus 25 ~~~l~rRl~~rm~~~~-----~~~~~~y~~~L~~d~~ 56 (57)
T PF03705_consen 25 RSLLERRLARRMRALG-----LPSFAEYYELLRSDPD 56 (57)
T ss_dssp HHHHHHHHHHHHHHHT--------HHHHHHHHHH-T-
T ss_pred HHHHHHHHHHHHHHcC-----CCCHHHHHHHHHhCCC
Confidence 3455555556666544 4589999999988874
No 164
>KOG0042|consensus
Probab=32.21 E-value=86 Score=26.01 Aligned_cols=32 Identities=16% Similarity=0.307 Sum_probs=25.2
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
+.+++. .+++++++|.+-+|.++..||.+.++
T Consensus 625 ~~d~~~----~~~~l~ea~~~~~g~v~l~e~~q~~s 656 (680)
T KOG0042|consen 625 GWDEDR----LHEELQEADENLNGFVELREFLQLMS 656 (680)
T ss_pred CCCHHH----HHHHHHHHHHhhcceeeHHHHHHHHH
Confidence 455554 44567899999999999999998876
No 165
>cd07978 TAF13 The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 13 (TAF13) is one of several TAFs that bind TBP and is involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Several hy
Probab=31.95 E-value=1.2e+02 Score=18.50 Aligned_cols=30 Identities=20% Similarity=0.381 Sum_probs=24.6
Q ss_pred CCCCcccHHHHHHHHhCChHHHHHhcccCC
Q psy8460 42 DKDGKISFQDFQQSVTDEPLLLEAFGQCLP 71 (86)
Q Consensus 42 d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~ 71 (86)
-+.|+++.++|+=.++++|.-+.-++.-|.
T Consensus 50 ~r~~k~~~eD~~FliR~D~~Kl~Rl~~lL~ 79 (92)
T cd07978 50 RRRGKVKVEDLIFLLRKDPKKLARLRELLS 79 (92)
T ss_pred cCCCCCCHHHHHHHHhcCHHHHHHHHHHHH
Confidence 478999999999999999998866654444
No 166
>KOG0040|consensus
Probab=31.56 E-value=54 Score=30.47 Aligned_cols=26 Identities=31% Similarity=0.465 Sum_probs=22.6
Q ss_pred HHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 33 ELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 33 d~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+.++.-.|.+.||.|+..||+..|-+
T Consensus 2299 e~~ld~vDP~r~G~Vsl~dY~afmi~ 2324 (2399)
T KOG0040|consen 2299 EEILDLVDPNRDGYVSLQDYMAFMIS 2324 (2399)
T ss_pred HHHHHhcCCCCcCcccHHHHHHHHHh
Confidence 34578899999999999999998875
No 167
>PRK00285 ihfA integration host factor subunit alpha; Reviewed
Probab=31.45 E-value=54 Score=19.84 Aligned_cols=45 Identities=16% Similarity=0.223 Sum_probs=30.4
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
.+++||.+.+.... .++..++..+++.++..+ -+..++.|.+..|
T Consensus 3 mtk~el~~~ia~~~------~~s~~~v~~vl~~~~~~i~~~L~~g~~V~l~gf 49 (99)
T PRK00285 3 LTKADLAEALFEKV------GLSKREAKELVELFFEEIRDALENGEQVKLSGF 49 (99)
T ss_pred cCHHHHHHHHHHHh------CcCHHHHHHHHHHHHHHHHHHHHcCCeEEEcCC
Confidence 47888888776532 378888888888777754 3445566766655
No 168
>PRK10664 transcriptional regulator HU subunit beta; Provisional
Probab=31.17 E-value=59 Score=19.61 Aligned_cols=44 Identities=7% Similarity=0.141 Sum_probs=28.9
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
++.||...+.... .+|..++..+++.+++.. -+..+|.|.+..|
T Consensus 2 tK~eli~~ia~~~------~~s~~~~~~~v~~~~~~i~~~L~~~~~v~l~gf 47 (90)
T PRK10664 2 NKSQLIDKIAAGA------DISKAAAGRALDAIIASVTESLKEGDDVALVGF 47 (90)
T ss_pred CHHHHHHHHHHHh------CCCHHHHHHHHHHHHHHHHHHHhCCCEEEECCc
Confidence 6778877776531 488888888887777643 3445566665544
No 169
>KOG4403|consensus
Probab=31.06 E-value=72 Score=25.63 Aligned_cols=48 Identities=23% Similarity=0.293 Sum_probs=29.3
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCCChHHHHHH
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLPSDAARQSF 79 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~~~~~~~~ 79 (86)
+++-+.++ .|-+.+|-|.||.|..+|= +.-+.+-+..+=+..++.++|
T Consensus 64 klg~EAir----~iHrqmDDD~nG~Id~~ES------deFlrEdmky~~~~~kr~~~f 111 (575)
T KOG4403|consen 64 KLGYEAIR----DIHRQMDDDHNGSIDVEES------DEFLREDMKYRDSTRKRSEKF 111 (575)
T ss_pred hhhHHHHH----HHHHhcccccCCCcccccc------hHHHHHHhhcccchhhhhhhc
Confidence 45555544 4568999999999999882 222224445554555554443
No 170
>KOG0039|consensus
Probab=30.92 E-value=92 Score=25.50 Aligned_cols=62 Identities=18% Similarity=0.163 Sum_probs=46.3
Q ss_pred CCCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHH
Q psy8460 1 MGQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLL 63 (86)
Q Consensus 1 ~~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~ 63 (86)
+++.+|+..++...+--.. -....+...+....++.+.|.++.|.+.++++.-.+...|-..
T Consensus 33 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~ll~~~~~~~ 94 (646)
T KOG0039|consen 33 KLTEEEVRELIMSSISANW-LSLIKKQTEEYAALIMEELDPDHKGYITNEDLEILLLQIPTLL 94 (646)
T ss_pred CccHHHHHHHHHHHHHhhh-hhhhhhhhhHHHHHhhhhccccccceeeecchhHHHHhchHHH
Confidence 3567888888886542211 2345666777888899999999999999999998888777544
No 171
>cd00591 HU_IHF Integration host factor (IHF) and HU are small heterodimeric members of the DNABII protein family that bind and bend DNA, functioning as architectural factors in many cellular processes including transcription, site-specific recombination, and higher-order nucleoprotein complex assembly. The dimer subunits associate to form a compact globular core from which two beta ribbon arms (one from each subunit) protrude. The beta arms track and bind the DNA minor groove. Despite sequence and structural similarity, IHF and HU can be distinguished by their different DNA substrate preferences.
Probab=30.67 E-value=58 Score=18.81 Aligned_cols=44 Identities=16% Similarity=0.163 Sum_probs=26.4
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
||.|+-..+.... .++..++..+++.++..+ -+.+.+.|.+..|
T Consensus 1 ~K~~l~~~ia~~~------~~~~~~v~~vl~~~~~~i~~~L~~g~~V~l~~~ 46 (87)
T cd00591 1 TKSELIEAIAEKT------GLSKKDAEAAVDAFLDVITEALAKGEKVELPGF 46 (87)
T ss_pred CHHHHHHHHHHHh------CcCHHHHHHHHHHHHHHHHHHHhCCCeEEEeCC
Confidence 4677777666532 377777777777766643 3334445655544
No 172
>PF13946 DUF4214: Domain of unknown function (DUF4214)
Probab=30.26 E-value=1.2e+02 Score=17.61 Aligned_cols=38 Identities=21% Similarity=0.330 Sum_probs=28.8
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHh
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~ 66 (86)
...++.+..++.++ ..|.++..+++..+..+|++...+
T Consensus 36 ~~D~~Gl~yw~~~L-------~~G~~sr~~v~~~~~~S~Ef~~~~ 73 (75)
T PF13946_consen 36 EPDPEGLAYWLNQL-------NSGAMSRGDVANEFANSPEFQARF 73 (75)
T ss_pred CCCHHHHHHHHHHH-------HcCCCCHHHHHHHHHhCHHHHHHh
Confidence 45677777765433 367789999999999999998654
No 173
>PF07531 TAFH: NHR1 homology to TAF; InterPro: IPR003894 The TAF homology (TAFH) or Nervy homology region 1 (NHR1) domain is a domain of 95-100 amino acids present in eukaryotic proteins of the MTG/ETO family and whereof the core ~75-80 residues occur in TAF proteins. The transcription initiation TFIID complex is composed of TATA binding protein (TBP) and a number of TBP-associated factors (TAFs). The TAFH/NHR1 domain is named after fruit fly TATA-box-associated factor 110 (TAF110), human TAF105 and TAF130, and the fruit fly protein Nervy, which is a homologue of human MTG8/ETO [, ]. The human eight twenty-one (ETO or MTG8) and related myeloid transforming gene products MTGR1 and MTG16 as well as the Nervy protein contain the NHR1-4 domains. The NHR1/TAFH domain occurs in the N-terminal part of these proteins, while a MYND-type zinc finger forms the NHR4 domain []. The TAFH/NHR1 domain can be involved in protein-protein interactions, e.g in MTG8/ETO with HSP90 and Gfi-1 []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2P6V_A 2KNH_A 2PP4_A 2H7B_A.
Probab=29.30 E-value=21 Score=22.43 Aligned_cols=28 Identities=32% Similarity=0.702 Sum_probs=18.5
Q ss_pred CCcccHHHHHHHHhC------ChHHHHHhcccCC
Q psy8460 44 DGKISFQDFQQSVTD------EPLLLEAFGQCLP 71 (86)
Q Consensus 44 DG~Is~eEF~~~l~~------~p~~~~~~~~~l~ 71 (86)
||+|+-|||...+.. .|.++..+...+|
T Consensus 38 ~~~i~~EeF~~~Lq~~lns~pqP~lvPFLK~~lp 71 (96)
T PF07531_consen 38 DGKIEAEEFTSKLQEELNSSPQPYLVPFLKKSLP 71 (96)
T ss_dssp TTSS-HHHHHHHHHHHCTSS--TTHHHHHHHHHH
T ss_pred cCCCCHHHHHHHHHHHhcCCCCcchHHHHHHhHH
Confidence 689999999988874 4566655554444
No 174
>PF09454 Vps23_core: Vps23 core domain; InterPro: IPR017916 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ]. This entry represents the Steadiness box domain.; PDB: 2CAZ_A 2F66_D 2F6M_A 2P22_A.
Probab=29.22 E-value=36 Score=19.66 Aligned_cols=16 Identities=25% Similarity=0.544 Sum_probs=12.2
Q ss_pred CCCcccHHHHHHHHhC
Q psy8460 43 KDGKISFQDFQQSVTD 58 (86)
Q Consensus 43 ~DG~Is~eEF~~~l~~ 58 (86)
++|.|+++.|++.++.
T Consensus 36 ~~g~I~~d~~lK~vR~ 51 (65)
T PF09454_consen 36 QRGSIDLDTFLKQVRS 51 (65)
T ss_dssp HTTSS-HHHHHHHHHH
T ss_pred HcCCCCHHHHHHHHHH
Confidence 3688999999988774
No 175
>smart00549 TAFH TAF homology. Domain in Drosophila nervy, CBFA2T1, human TAF105, human TAF130, and Drosophila TAF110. Also known as nervy homology region 1 (NHR1).
Probab=29.09 E-value=34 Score=21.38 Aligned_cols=28 Identities=25% Similarity=0.650 Sum_probs=21.4
Q ss_pred CCcccHHHHHHHHhC------ChHHHHHhcccCC
Q psy8460 44 DGKISFQDFQQSVTD------EPLLLEAFGQCLP 71 (86)
Q Consensus 44 DG~Is~eEF~~~l~~------~p~~~~~~~~~l~ 71 (86)
||+|+-|||..-++. .|.++..+.-.+|
T Consensus 37 ~~~i~~EeF~~~Lq~~lns~~qP~lvPFLK~slp 70 (92)
T smart00549 37 NGTITAEEFTSRLQEALNSPLQPYLIPFLKNSLP 70 (92)
T ss_pred hCCCCHHHHHHHHHHHHcCCCCchhHHHHHHhhH
Confidence 789999999988874 5667666665555
No 176
>PRK07077 hypothetical protein; Provisional
Probab=28.68 E-value=70 Score=22.95 Aligned_cols=26 Identities=19% Similarity=0.345 Sum_probs=21.9
Q ss_pred CCCCCcccHHHHHHHHhCChHHHHHh
Q psy8460 41 YDKDGKISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 41 ~d~DG~Is~eEF~~~l~~~p~~~~~~ 66 (86)
.+.||.+++..|...+.+.|..+..|
T Consensus 181 ~~~~g~~~~~~~l~~l~r~P~~i~~L 206 (238)
T PRK07077 181 LRDDGSTDILPILRGLARQPSQLGAL 206 (238)
T ss_pred cCCCcCcCHHHHHHHHHhChHHHHHH
Confidence 35899999999999999999877544
No 177
>PRK14981 DNA-directed RNA polymerase subunit F; Provisional
Probab=28.50 E-value=1.3e+02 Score=18.90 Aligned_cols=28 Identities=7% Similarity=0.092 Sum_probs=15.3
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELA 35 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i 35 (86)
|++|++.++-. . + ..+++++++++++.+
T Consensus 80 ~~dElrai~~~-~-~---~~~~~e~l~~ILd~l 107 (112)
T PRK14981 80 TRDELRAIFAK-E-R---YTLSPEELDEILDIV 107 (112)
T ss_pred CHHHHHHHHHH-h-c---cCCCHHHHHHHHHHH
Confidence 45555555543 1 1 246777777666544
No 178
>PF09066 B2-adapt-app_C: Beta2-adaptin appendage, C-terminal sub-domain; InterPro: IPR015151 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. This entry represents a subdomain of the appendage (ear) domain of beta-adaptin from AP clathrin adaptor complexes. This domain has a three-layer arrangement, alpha-beta-alpha, with a bifurcated antiparallel beta-sheet []. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15) []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 1E42_B 2G30_A 2IV9_B 2IV8_A 3HS9_A 3H1Z_A.
Probab=28.25 E-value=37 Score=20.86 Aligned_cols=43 Identities=23% Similarity=0.251 Sum_probs=21.7
Q ss_pred CCCcccHHHHHHHHhCChHHH-HHhcccCC-ChHHHHHHHHHhhh
Q psy8460 43 KDGKISFQDFQQSVTDEPLLL-EAFGQCLP-SDAARQSFLSTLQA 85 (86)
Q Consensus 43 ~DG~Is~eEF~~~l~~~p~~~-~~~~~~l~-~~~~~~~~~~~~~~ 85 (86)
.||.++-++|.+....-|+-. ..+...++ ......++.+.|++
T Consensus 3 ~d~~~~~~~F~~~W~sl~~~~~~e~~~~~~~~~~~~~~i~~~L~~ 47 (114)
T PF09066_consen 3 EDGSMDPEEFQEMWKSLPDSNQQELSIQLNASVPSPDAIEEKLQA 47 (114)
T ss_dssp TT----HHHHHHHHHHS-GGG--EEEEEETT----HHHHHHHHHC
T ss_pred CCCccCHHHHHHHHHhCCcccceEEEEeccccCCcHHHHHHHHHH
Confidence 479999999999999877766 34444444 33334455555543
No 179
>COG4535 CorC Putative Mg2+ and Co2+ transporter CorC [Inorganic ion transport and metabolism]
Probab=28.04 E-value=1e+02 Score=22.99 Aligned_cols=35 Identities=26% Similarity=0.245 Sum_probs=25.8
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCC
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDY 41 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~ 41 (86)
+|+|++.+++..-.+ ++=+.+...|++-++.-+|.
T Consensus 33 nr~eLl~liRdse~n----~LiD~dt~~mlEGvm~iadl 67 (293)
T COG4535 33 NREELLELIRDSEQN----ELIDADTLDMLEGVMDIADL 67 (293)
T ss_pred CHHHHHHHHHHhhhc----cccChhHHHHHHHHHHHHHh
Confidence 689999999986533 46666667788888877665
No 180
>COG4086 Predicted secreted protein [Function unknown]
Probab=27.69 E-value=1.9e+02 Score=21.80 Aligned_cols=35 Identities=14% Similarity=0.061 Sum_probs=16.5
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHh
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRK 38 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~ 38 (86)
+++|+++++.....+ .+-++++.++..++.-.++.
T Consensus 211 ~~~dirkvv~dv~~~-ynvnltd~qvn~i~~~~~~~ 245 (299)
T COG4086 211 DPADIRKVVDDVANN-YNVNLTDTQVNQIVNLFLAM 245 (299)
T ss_pred CHHHHHHHHHHHHHH-cCCCCCHHHHHHHHHHHHHh
Confidence 344555555543322 33345555555555444443
No 181
>PF14513 DAG_kinase_N: Diacylglycerol kinase N-terminus; PDB: 1TUZ_A.
Probab=27.59 E-value=45 Score=22.15 Aligned_cols=19 Identities=16% Similarity=0.499 Sum_probs=14.1
Q ss_pred CCCCCCcccHHHHHHHHhC
Q psy8460 40 DYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 40 D~d~DG~Is~eEF~~~l~~ 58 (86)
-.+.++.|+|+.|..+|..
T Consensus 42 ~~~~~~~Id~egF~~Fm~~ 60 (138)
T PF14513_consen 42 KYNPEEPIDYEGFKLFMKT 60 (138)
T ss_dssp GGEETTEE-HHHHHHHHHH
T ss_pred ccCCCCCcCHHHHHHHHHH
Confidence 4556678999999998874
No 182
>COG1393 ArsC Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism]
Probab=27.55 E-value=1.7e+02 Score=18.59 Aligned_cols=55 Identities=15% Similarity=0.248 Sum_probs=31.6
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
.|++|+..++... + .+-+++-.--...+++.+.+ ...++-++-...+..+|.+++
T Consensus 38 ~s~~eL~~~l~~~--g-----~~~~~li~t~~~~~r~L~~~-~~~~~~~~~~~~i~~~~~Lik 92 (117)
T COG1393 38 PSREELKKILSKL--G-----DGVEELINTRGTTYRELNLD-KEDLSDEELIEALLENPSLIK 92 (117)
T ss_pred CCHHHHHHHHHHc--C-----ccHHHHHHhccchHHHcCCc-ccccChHHHHHHHHhChhhcc
Confidence 4777777777752 1 11222222223456677744 345777777888888886653
No 183
>PF10163 EnY2: Transcription factor e(y)2; InterPro: IPR018783 Enhancer of yellow 2 (EnY2) is a small transcription factor which is combined in a complex with the TAFII40 protein []. This protein is conserved from protozoa to humans.; PDB: 4DHX_C 3FWC_P 3M99_C 3KIK_A 3KJL_C 3FWB_C 3MHS_B 3MHH_B.
Probab=26.84 E-value=1.5e+02 Score=17.70 Aligned_cols=49 Identities=27% Similarity=0.351 Sum_probs=34.1
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
|.++.+...|+..+.. .| . -++++.++..+++. .|...+++++.+..+.
T Consensus 14 Ge~~~L~~~L~~rL~e-~G--W-~d~vr~~~re~i~~---~g~~~~~~~~l~~~i~ 62 (86)
T PF10163_consen 14 GEYERLKELLRQRLIE-CG--W-RDEVRQLCREIIRE---RGIDNLTFEDLLEEIT 62 (86)
T ss_dssp THHHHHHHHHHHHHHH-TT--H-HHHHHHHHHHHHHH---H-TTTSBHHHHHHHHH
T ss_pred CcHHHHHHHHHHHHHH-CC--h-HHHHHHHHHHHHHh---hCCCCCCHHHHHHHHH
Confidence 5567788888877754 11 2 35677788888887 4566799999887665
No 184
>PF11829 DUF3349: Protein of unknown function (DUF3349); InterPro: IPR021784 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 99 to 124 amino acids in length. ; PDB: 2KVC_A 3OL3_B 3OL4_A 2LKY_A.
Probab=26.69 E-value=1.7e+02 Score=18.31 Aligned_cols=35 Identities=17% Similarity=0.088 Sum_probs=22.7
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
.+|++|+.+++..+....+..- ..+....++.-+.
T Consensus 35 ~Ltd~ev~~Va~~L~~~~~~~~-~~~dI~~~I~~vt 69 (96)
T PF11829_consen 35 RLTDDEVAEVAAELAARGDPPV-DRIDIGVAITRVT 69 (96)
T ss_dssp TS-HHHHHHHHHHHHHHTSS-B-SCCHHHHHHHHHC
T ss_pred cCCHHHHHHHHHHHHhcCCCCC-CHHHHHHHHHHHH
Confidence 5999999999999888665433 3455555554443
No 185
>PF05598 DUF772: Transposase domain (DUF772); InterPro: IPR008490 This presumed domain is found at the N terminus of transposase insH and other related transposases.
Probab=26.52 E-value=1.3e+02 Score=16.89 Aligned_cols=33 Identities=15% Similarity=0.241 Sum_probs=21.0
Q ss_pred cHHHHHHHHhCChHHHHHhc----ccCCChHHHHHHH
Q psy8460 48 SFQDFQQSVTDEPLLLEAFG----QCLPSDAARQSFL 80 (86)
Q Consensus 48 s~eEF~~~l~~~p~~~~~~~----~~l~~~~~~~~~~ 80 (86)
|..+....+..++.+....+ ...|+.+....|.
T Consensus 24 S~r~l~~~l~~~~~~r~~~g~~~~~~~pd~stl~rfr 60 (77)
T PF05598_consen 24 SDRELEERLRDNLSFRYFCGLSLEEPVPDHSTLSRFR 60 (77)
T ss_pred hHHHHHhhHhhhhHHHHHHhcccCCCCCChHHHHHHH
Confidence 55666666666777766666 4667765555554
No 186
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=26.29 E-value=58 Score=15.84 Aligned_cols=13 Identities=31% Similarity=0.644 Sum_probs=10.2
Q ss_pred CCCcccHHHHHHH
Q psy8460 43 KDGKISFQDFQQS 55 (86)
Q Consensus 43 ~DG~Is~eEF~~~ 55 (86)
.+|.||-+||.+.
T Consensus 13 ~~G~IseeEy~~~ 25 (31)
T PF09851_consen 13 DKGEISEEEYEQK 25 (31)
T ss_pred HcCCCCHHHHHHH
Confidence 3788999998764
No 187
>PHA02943 hypothetical protein; Provisional
Probab=25.79 E-value=2.3e+02 Score=19.53 Aligned_cols=56 Identities=14% Similarity=0.132 Sum_probs=44.2
Q ss_pred HHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCC---ChHHHHHHHHHhh
Q psy8460 29 RDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLP---SDAARQSFLSTLQ 84 (86)
Q Consensus 29 ~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~---~~~~~~~~~~~~~ 84 (86)
.++...+++-.-.++=..|+-....+.+.++|.-.+.|..-.| .+.++-+|++-.+
T Consensus 77 ~~~~Relwrlv~s~~~kfi~p~~l~~li~kd~~a~~~~ak~v~v~~r~a~il~fln~~l 135 (165)
T PHA02943 77 FEIKRELWRLVCNSRLKFITPSRLLRLIAKDTEAHNIFAKYVPVNSRHAHILSFLNSAL 135 (165)
T ss_pred HHHHHHHHHHHHhccccccChHHHHHHHHhCHHHHHHHHHhcCcccchHHHHHHHHHHH
Confidence 3444555556667778889999999999999999999998888 4778889987543
No 188
>PRK10753 transcriptional regulator HU subunit alpha; Provisional
Probab=25.63 E-value=80 Score=18.96 Aligned_cols=44 Identities=14% Similarity=0.224 Sum_probs=28.4
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhc--CCCCCCcccHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKM--DYDKDGKISFQDF 52 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~--D~d~DG~Is~eEF 52 (86)
++.||...+... ..+|..++..+++.++..+ -+..++.|.+..|
T Consensus 2 ~K~eli~~ia~~------~~~s~~~~~~~v~~~~~~i~~~L~~g~~V~i~gf 47 (90)
T PRK10753 2 NKTQLIDVIADK------AELSKTQAKAALESTLAAITESLKEGDAVQLVGF 47 (90)
T ss_pred CHHHHHHHHHHH------hCCCHHHHHHHHHHHHHHHHHHHHcCCeEEEcCC
Confidence 577777776652 1378888888888877754 3445555655443
No 189
>cd00038 CAP_ED effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of
Probab=25.56 E-value=61 Score=18.25 Aligned_cols=21 Identities=19% Similarity=0.355 Sum_probs=17.0
Q ss_pred cccHHHHHHHHhCChHHHHHh
Q psy8460 46 KISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 46 ~Is~eEF~~~l~~~p~~~~~~ 66 (86)
.|+++.|..++..+|.+...|
T Consensus 94 ~i~~~~~~~~~~~~~~~~~~~ 114 (115)
T cd00038 94 VLPRSDFRRLLQEYPELARRL 114 (115)
T ss_pred EEeHHHHHHHHHHCcHhHHhc
Confidence 578899999999999887644
No 190
>PRK10665 nitrogen regulatory protein P-II 2; Provisional
Probab=25.25 E-value=91 Score=19.71 Aligned_cols=24 Identities=21% Similarity=0.440 Sum_probs=18.3
Q ss_pred ChHHHHHHHHHHHHhc--CCCCCCcc
Q psy8460 24 PDEGVRDLVELALRKM--DYDKDGKI 47 (86)
Q Consensus 24 s~ee~~~~vd~i~~~~--D~d~DG~I 47 (86)
++++++.+++-+.+.+ ..-|||+|
T Consensus 66 ~de~ve~vv~~I~~~a~TG~~GDGkI 91 (112)
T PRK10665 66 ADDQLDEVIDIISKAAYTGKIGDGKI 91 (112)
T ss_pred ChHhHHHHHHHHHHHhccCCCCCcEE
Confidence 5788888888877754 56788887
No 191
>cd02518 GT2_SpsF SpsF is a glycosyltrnasferase implicated in the synthesis of the spore coat. Spore coat polysaccharide biosynthesis protein F (spsF) is a glycosyltransferase implicated in the synthesis of the spore coat in a variety of bacteria challenged by stress as starvation. The spsF gene is expressed in the late stage of coat development responsible for a terminal step in coat formation that involves the glycosylation of the coat. SpsF gene mutation resulted in spores that appeared normal. But, the spores tended to aggregate and had abnormal adsorption properties, indicating a surface alteration.
Probab=25.08 E-value=2.2e+02 Score=19.28 Aligned_cols=39 Identities=18% Similarity=0.298 Sum_probs=27.9
Q ss_pred ChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChHHHH
Q psy8460 24 PDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 24 s~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~ 64 (86)
|+++++ .+..+.+.++.. +-.+|+++-++.+.++|.+.+
T Consensus 191 t~eD~~-~~~~~~~~~~~~-~~~~~~~~i~~~~~~~p~l~~ 229 (233)
T cd02518 191 TPEDFE-LIKEIYEALYPK-NPDFSLEDIIELLDKNPELFE 229 (233)
T ss_pred CHHHHH-HHHHHHHHhcCC-CCCCCHHHHHHHHHhChhHHH
Confidence 456664 556666666542 334899999999999999875
No 192
>PLN02764 glycosyltransferase family protein
Probab=25.04 E-value=1.7e+02 Score=22.89 Aligned_cols=54 Identities=15% Similarity=-0.036 Sum_probs=30.8
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++++++...++..+.+ +.-.-+++++.+.++=+.+...|...-++++|++.+.+
T Consensus 392 ~~~e~i~~av~~vm~~---~~~~g~~~r~~a~~~~~~~~~~GSS~~~l~~lv~~~~~ 445 (453)
T PLN02764 392 FSKESLRDAINSVMKR---DSEIGNLVKKNHTKWRETLASPGLLTGYVDNFIESLQD 445 (453)
T ss_pred cCHHHHHHHHHHHhcC---CchhHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence 4788888888887643 11123445555555545554445555556666666554
No 193
>COG2055 Malate/L-lactate dehydrogenases [Energy production and conversion]
Probab=24.82 E-value=3.3e+02 Score=20.97 Aligned_cols=53 Identities=17% Similarity=0.211 Sum_probs=36.3
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCC---CCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYD---KDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d---~DG~Is~eEF~~~l~~ 58 (86)
+|.+|+...+.+.+.. . ..++++.+.+++ +|-.+|.. .=|..-+.-|++.+..
T Consensus 6 ~~~e~L~~~~~~vl~~-~--G~~ee~A~~vA~-~lv~ad~~G~~SHGv~r~p~yi~~l~~ 61 (349)
T COG2055 6 VSAEELKALIEEVLRK-A--GVPEEDARAVAD-VLVAADLRGVDSHGVGRLPGYVRRLKA 61 (349)
T ss_pred ecHHHHHHHHHHHHHH-c--CCCHHHHHHHHH-HHHHHHhcCCcccchHHHHHHHHHHHc
Confidence 6889999999998876 3 477887776654 44445544 3455566677766653
No 194
>PF13442 Cytochrome_CBB3: Cytochrome C oxidase, cbb3-type, subunit III ; PDB: 1KB0_A 2DGE_D 2CE1_A 2CE0_A 2V07_A 1W2L_A 2ZOO_A 2ZBO_G 1DVV_A 2EXV_A ....
Probab=24.73 E-value=1.3e+02 Score=16.32 Aligned_cols=29 Identities=24% Similarity=0.266 Sum_probs=16.6
Q ss_pred HHHHHHHHHHhhcCCC--CCCChHHHHHHHH
Q psy8460 5 RSLMYLGRNSLISHPQ--DEDPDEGVRDLVE 33 (86)
Q Consensus 5 eEm~~vL~~~l~~~~~--~~ls~ee~~~~vd 33 (86)
+.+..++..-...++. ..+|++|++.++.
T Consensus 35 ~~l~~~i~~g~~~Mp~~~~~ls~~e~~~l~~ 65 (67)
T PF13442_consen 35 EELYNIIRNGRGGMPPFGGQLSDEEIEALAA 65 (67)
T ss_dssp HHHHHHHHHTBTTBSCTTTTSTHHHHHHHHH
T ss_pred HHHHHHHHhCcCCCCCCCCCCCHHHHHHHHH
Confidence 4555666643323322 2588888887764
No 195
>PF11363 DUF3164: Protein of unknown function (DUF3164); InterPro: IPR021505 This entry is represented by Bacteriophage B3, Orf6. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=24.57 E-value=2e+02 Score=20.00 Aligned_cols=41 Identities=27% Similarity=0.362 Sum_probs=28.2
Q ss_pred ChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHh---CChHHHHHhc
Q psy8460 24 PDEGVRDLVELALRKMDYDKDGKISFQDFQQSVT---DEPLLLEAFG 67 (86)
Q Consensus 24 s~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~---~~p~~~~~~~ 67 (86)
++.++..+|...|+ .|+.|.|+...-....+ .+|.+.+.+.
T Consensus 116 ~~~~l~~lV~~af~---~dk~G~l~~~rIl~Lrrl~i~D~~w~~am~ 159 (195)
T PF11363_consen 116 ADPELRALVNRAFQ---VDKEGNLNTSRILGLRRLEIDDERWQEAMD 159 (195)
T ss_pred CChHHHHHHHHHHh---cCCCCCcCHHHHHHHHhccCCCHHHHHHHH
Confidence 35667777766554 58999999987665444 4777776654
No 196
>COG5562 Phage envelope protein [General function prediction only]
Probab=24.23 E-value=54 Score=21.93 Aligned_cols=21 Identities=14% Similarity=0.548 Sum_probs=17.1
Q ss_pred cCCCCCCcccHHHHHHHHhCC
Q psy8460 39 MDYDKDGKISFQDFQQSVTDE 59 (86)
Q Consensus 39 ~D~d~DG~Is~eEF~~~l~~~ 59 (86)
...+..|.++|+||...+.+.
T Consensus 81 l~~~qsGqttF~ef~~~la~A 101 (137)
T COG5562 81 LRRHQSGQTTFEEFCSALAEA 101 (137)
T ss_pred HHHHhcCCccHHHHHHHHHhC
Confidence 456789999999999887653
No 197
>PF09860 DUF2087: Uncharacterized protein conserved in bacteria (DUF2087); InterPro: IPR018656 This domain, found in various hypothetical prokaryotic proteins and transcriptional activators, has no known function.
Probab=23.83 E-value=1.6e+02 Score=17.14 Aligned_cols=28 Identities=11% Similarity=0.217 Sum_probs=17.8
Q ss_pred HHHHHHHHHhhcCCCCCCChHHHHHHHHHH
Q psy8460 6 SLMYLGRNSLISHPQDEDPDEGVRDLVELA 35 (86)
Q Consensus 6 Em~~vL~~~l~~~~~~~ls~ee~~~~vd~i 35 (86)
.++..+...+ . +|...|+.|+.+++..+
T Consensus 15 ~iL~~l~~~f-~-~g~~y~E~EVN~~L~~~ 42 (71)
T PF09860_consen 15 VILEYLASRF-E-PGREYSEKEVNEILKRF 42 (71)
T ss_pred HHHHHHHHhC-C-CCCccCHHHHHHHHHHH
Confidence 3444444444 2 56788999988876555
No 198
>KOG4666|consensus
Probab=23.60 E-value=62 Score=25.07 Aligned_cols=28 Identities=11% Similarity=0.229 Sum_probs=24.3
Q ss_pred HHHHHHHHhcCCCCCCcccHHHHHHHHh
Q psy8460 30 DLVELALRKMDYDKDGKISFQDFQQSVT 57 (86)
Q Consensus 30 ~~vd~i~~~~D~d~DG~Is~eEF~~~l~ 57 (86)
+....+|.-+|-+++|.++|.|....+.
T Consensus 259 d~l~~~f~LFde~~tg~~D~re~v~~la 286 (412)
T KOG4666|consen 259 DKLAPTFMLFDEGTTGNGDYRETVKTLA 286 (412)
T ss_pred hhhhhhhheecCCCCCcccHHHHhhhhe
Confidence 4566789999999999999999998765
No 199
>PRK08032 fliD flagellar capping protein; Reviewed
Probab=23.41 E-value=57 Score=25.45 Aligned_cols=27 Identities=26% Similarity=0.599 Sum_probs=22.3
Q ss_pred CCC-CCCcccHHH--HHHHHhCChHHHHHh
Q psy8460 40 DYD-KDGKISFQD--FQQSVTDEPLLLEAF 66 (86)
Q Consensus 40 D~d-~DG~Is~eE--F~~~l~~~p~~~~~~ 66 (86)
..+ .||.|++++ |.+.+..+|+-+..|
T Consensus 347 ~~~~~~G~L~lD~~kl~~al~~np~~V~~l 376 (462)
T PRK08032 347 TQDPSDGKLEIDDDKLTKALKEDPAGVKAL 376 (462)
T ss_pred eccCCCCeEEEcHHHHHHHHHHCHHHHHHH
Confidence 345 789999887 999999999987654
No 200
>PRK13403 ketol-acid reductoisomerase; Provisional
Probab=23.36 E-value=1.7e+02 Score=22.34 Aligned_cols=44 Identities=9% Similarity=0.222 Sum_probs=28.1
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhCChH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTDEPL 61 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~~p~ 61 (86)
+|+.|.++|+.+..+ +-.+++ +.+ ++.|.-+|....+...+.|-
T Consensus 269 ~k~~m~~~l~~iq~g--------~fa~~~----~~e---~~~g~~~~~~~~~~~~~h~i 312 (335)
T PRK13403 269 TKKEMKRVLTEIQQG--------EFAKKW----ILE---NQAGRPTYNAMKKAEQNHQL 312 (335)
T ss_pred HHHHHHHHHHHhcCC--------HHHHHH----HHH---hhCCCHHHHHHHHHHhcCcH
Confidence 467777777775432 122233 343 56888888888888887763
No 201
>PF06348 DUF1059: Protein of unknown function (DUF1059); InterPro: IPR009409 This entry consists of short hypothetical archaeal and bacterial proteins of unknown function.
Probab=22.94 E-value=1.5e+02 Score=16.41 Aligned_cols=35 Identities=11% Similarity=0.073 Sum_probs=17.2
Q ss_pred CHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALR 37 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~ 37 (86)
|.+|++..+..=...-+|....++++.+.|...++
T Consensus 21 tedEll~~~~~Ha~~~Hg~~~~~~el~~~ir~~I~ 55 (57)
T PF06348_consen 21 TEDELLEAVVEHAREVHGMTEIPEELREKIRSAIK 55 (57)
T ss_pred CHHHHHHHHHHHHHHhcCCccCCHHHHHHHHHHhh
Confidence 56777777664333324444334444444444443
No 202
>PRK05479 ketol-acid reductoisomerase; Provisional
Probab=22.77 E-value=1.7e+02 Score=22.12 Aligned_cols=19 Identities=21% Similarity=0.634 Sum_probs=14.1
Q ss_pred CCCCcccHHHHHHHHhCCh
Q psy8460 42 DKDGKISFQDFQQSVTDEP 60 (86)
Q Consensus 42 d~DG~Is~eEF~~~l~~~p 60 (86)
++.|..+|....+...++|
T Consensus 295 ~~~~~~~~~~~~~~~~~~~ 313 (330)
T PRK05479 295 NKAGRPTFKALRREEAEHP 313 (330)
T ss_pred hcCCcHHHHHHHHHHhcCc
Confidence 4578888888877777666
No 203
>COG3793 TerB Tellurite resistance protein [Inorganic ion transport and metabolism]
Probab=22.43 E-value=2.5e+02 Score=18.87 Aligned_cols=41 Identities=15% Similarity=0.198 Sum_probs=30.5
Q ss_pred ChHHHHHHHHHHHHhcCCCCC-CcccHHHHHHHHhCChHHHH
Q psy8460 24 PDEGVRDLVELALRKMDYDKD-GKISFQDFQQSVTDEPLLLE 64 (86)
Q Consensus 24 s~ee~~~~vd~i~~~~D~d~D-G~Is~eEF~~~l~~~p~~~~ 64 (86)
...++..+.+.+..-+|.|.+ |++.-.+.+..+..+++-.+
T Consensus 63 ~~~~i~~~~~~~~~~~~~d~~~gk~ea~~~I~~lk~d~e~Ae 104 (144)
T COG3793 63 DSNEINEIFETLVGSFDTDFEIGKREAMKEIEDLKHDTEAAE 104 (144)
T ss_pred cHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHhcCChHHHH
Confidence 344556666666667788888 88999999999998887544
No 204
>KOG0998|consensus
Probab=22.28 E-value=69 Score=27.21 Aligned_cols=36 Identities=17% Similarity=0.207 Sum_probs=27.3
Q ss_pred CCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 22 EDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.+++.+..... +||.+.|.++||.|+..++...+..
T Consensus 276 ~vsp~d~~~~~-~if~q~d~~~dG~I~s~~~~~~f~~ 311 (847)
T KOG0998|consen 276 KVSPSDKQKYS-KIFSQVDKDNDGSISSNEARNIFLP 311 (847)
T ss_pred ccChHHHHHHH-HHHHhccccCCCccccccccccccc
Confidence 46666666543 5899999999999999987766654
No 205
>KOG0751|consensus
Probab=22.26 E-value=1.1e+02 Score=25.05 Aligned_cols=28 Identities=29% Similarity=0.481 Sum_probs=23.7
Q ss_pred HHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 31 LVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 31 ~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
+-...|..+|..++|.+|+++|..+...
T Consensus 109 l~~~aFqlFDr~~~~~vs~~~~~~if~~ 136 (694)
T KOG0751|consen 109 LFEVAFQLFDRLGNGEVSFEDVADIFGQ 136 (694)
T ss_pred HHHHHHHHhcccCCCceehHHHHHHHhc
Confidence 3345789999999999999999988874
No 206
>PHA02554 13 neck protein; Provisional
Probab=22.13 E-value=1.7e+02 Score=22.20 Aligned_cols=34 Identities=24% Similarity=0.205 Sum_probs=26.8
Q ss_pred CHHHHHHHHHHHhhcCCC--CCCChHHHHHHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLISHPQ--DEDPDEGVRDLVELALR 37 (86)
Q Consensus 3 sreEm~~vL~~~l~~~~~--~~ls~ee~~~~vd~i~~ 37 (86)
|+.||.....+.|+. |- .+++++++.+.|++.++
T Consensus 7 sp~eLkD~iLRrLGA-Pii~Ievt~dQi~D~I~rALe 42 (311)
T PHA02554 7 NPRELKDYILRRLGA-PIINVEVTEDQIYDCIQRALE 42 (311)
T ss_pred CHHHHHHHHHHhcCC-CeeEeecCHHHHHHHHHHHHH
Confidence 788999988877744 31 46899999999988776
No 207
>PF01839 FG-GAP: FG-GAP repeat; InterPro: IPR013517 This region contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure []. The repeat is called the FG-GAP repeat after two conserved motifs in the repeat []. The FG-GAP repeats are found in the N terminus of integrin alpha chains, a region that has been shown to be important for ligand binding []. A putative Ca2+ binding motif is found in some of the repeats. ; PDB: 1L5G_A 3IJE_A 1M1X_A 1JV2_A 1U8C_A 3V4P_C 3V4V_C 3VI3_A 3VI4_C 2VDN_A ....
Probab=21.84 E-value=50 Score=16.45 Aligned_cols=9 Identities=33% Similarity=0.553 Sum_probs=7.1
Q ss_pred hcCCCCCCc
Q psy8460 38 KMDYDKDGK 46 (86)
Q Consensus 38 ~~D~d~DG~ 46 (86)
-.|.|+||.
T Consensus 7 ~gD~ngDG~ 15 (34)
T PF01839_consen 7 VGDFNGDGY 15 (34)
T ss_dssp EESTSSSSS
T ss_pred EEEcCCCCC
Confidence 368999996
No 208
>PF13591 MerR_2: MerR HTH family regulatory protein
Probab=21.57 E-value=1.1e+02 Score=18.11 Aligned_cols=20 Identities=30% Similarity=0.532 Sum_probs=17.1
Q ss_pred ccHHHHHHHHhCChHHHHHh
Q psy8460 47 ISFQDFQQSVTDEPLLLEAF 66 (86)
Q Consensus 47 Is~eEF~~~l~~~p~~~~~~ 66 (86)
||++||.....-+|.++..+
T Consensus 1 is~~e~~~~~~i~~~~l~~l 20 (84)
T PF13591_consen 1 ISLEEFCEACGIEPEFLREL 20 (84)
T ss_pred CCHHHHHHHHCcCHHHHHHH
Confidence 68899999999999988655
No 209
>PF06569 DUF1128: Protein of unknown function (DUF1128); InterPro: IPR009507 This family consists of several short, hypothetical bacterial proteins of unknown function.
Probab=21.48 E-value=1.9e+02 Score=17.14 Aligned_cols=6 Identities=17% Similarity=0.135 Sum_probs=2.6
Q ss_pred cccHHH
Q psy8460 46 KISFQD 51 (86)
Q Consensus 46 ~Is~eE 51 (86)
.+|-.|
T Consensus 53 ~fS~sE 58 (71)
T PF06569_consen 53 SFSPSE 58 (71)
T ss_pred CCCHHH
Confidence 344444
No 210
>PF06919 Phage_T4_Gp30_7: Phage Gp30.7 protein; InterPro: IPR009690 This family consists of several phage Gp30.7 proteins of 121 residues in length. Family members seem to be exclusively from the T4-like viruses. The function of this family is unknown.
Probab=21.40 E-value=2.4e+02 Score=18.28 Aligned_cols=28 Identities=18% Similarity=0.267 Sum_probs=20.2
Q ss_pred HHHHHhhcCCCCCCChHHHHHHHHHHHHhcCC
Q psy8460 10 LGRNSLISHPQDEDPDEGVRDLVELALRKMDY 41 (86)
Q Consensus 10 vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~ 41 (86)
.++.++.. .-++++.+.+-|-++.++|.
T Consensus 94 ~iRhiL~~----~a~~e~~EAi~D~V~NEf~i 121 (121)
T PF06919_consen 94 YIRHILGN----KAKPEHLEAIFDVVLNEFDI 121 (121)
T ss_pred HHHHHHhc----cCCHHHHHHHHHHHHhhccC
Confidence 35555543 57889999988888888763
No 211
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=21.28 E-value=1e+02 Score=17.26 Aligned_cols=22 Identities=23% Similarity=0.404 Sum_probs=17.8
Q ss_pred cHHHHHHHHhCChHHHHHhccc
Q psy8460 48 SFQDFQQSVTDEPLLLEAFGQC 69 (86)
Q Consensus 48 s~eEF~~~l~~~p~~~~~~~~~ 69 (86)
+...|+..+.++|.+.+-+.-.
T Consensus 3 ~l~~Fl~~~~~d~~L~~~l~~~ 24 (64)
T TIGR03798 3 QLKAFLEKVKTDPDLREKLKAA 24 (64)
T ss_pred HHHHHHHHHHcCHHHHHHHHHc
Confidence 3578999999999998877654
No 212
>PLN02222 phosphoinositide phospholipase C 2
Probab=21.25 E-value=3.1e+02 Score=22.52 Aligned_cols=51 Identities=10% Similarity=0.075 Sum_probs=30.0
Q ss_pred CCHHHHHHHHHHHhhcCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLISHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
.|.+++...|...... ...+.+.+++++++. . ..-+.+.++++.|...+..
T Consensus 40 mt~~~l~~FL~~~Q~~---~~~~~~~~~~ii~~~-~--~~~~~~~~~~~gF~~yL~s 90 (581)
T PLN02222 40 MTVDHLHRFLIDVQKQ---DKATREDAQSIINSA-S--SLLHRNGLHLDAFFKYLFG 90 (581)
T ss_pred cCHHHHHHHHHHhcCC---ccCCHHHHHHHHHhh-h--hhhhccCcCHHHHHHHhcC
Confidence 3566666666653321 134556666555442 1 1124567999999999875
No 213
>PF09068 EF-hand_2: EF hand; InterPro: IPR015153 Like other EF hand domains, this domain forms a helix-loop-helix motif, though since it does not contain the canonical pattern of calcium binding residues found in many EF hand domains, it does not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (IPR001202 from INTERPRO), enhancing dystroglycan binding []. ; PDB: 1EG3_A 1EG4_A.
Probab=21.12 E-value=2.4e+02 Score=18.13 Aligned_cols=54 Identities=24% Similarity=0.242 Sum_probs=30.8
Q ss_pred CHHHHHHHHHHHhh----cCCCC-CCC----hHHHHHHHHHHHHhcCCCCCCcccHHHHHHHH
Q psy8460 3 QKRSLMYLGRNSLI----SHPQD-EDP----DEGVRDLVELALRKMDYDKDGKISFQDFQQSV 56 (86)
Q Consensus 3 sreEm~~vL~~~l~----~~~~~-~ls----~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l 56 (86)
+-.||..+|..++. ..+.. +++ +.-++-.+.-++.-+|.+++|.|+--.|..++
T Consensus 61 ~v~~l~~~L~~iy~~l~~~~p~~~~i~~~~v~~a~~L~ln~Ll~vyD~~rtG~I~vls~KvaL 123 (127)
T PF09068_consen 61 SVSQLETLLSSIYEFLNKRLPTLHQIPSRPVDLAVDLLLNWLLNVYDSQRTGKIRVLSFKVAL 123 (127)
T ss_dssp EHHHHHHHHHHHHHHHHHHSTTS--HH-----HHHHHHHHHHHHHH-TT--SEEEHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCchhHHHHHHHHHHHHHHHhCCCCCCeeehhHHHHHH
Confidence 44677777777661 11111 111 24444556667789999999999988887665
No 214
>PF08557 Lipid_DES: Sphingolipid Delta4-desaturase (DES); InterPro: IPR013866 Sphingolipids are important membrane signalling molecules involved in many different cellular functions in eukaryotes. Sphingolipid delta 4-desaturase catalyses the formation of (E)-sphing-4-enine []. Some proteins in this entry have bifunctional delta 4-desaturase/C-4-hydroxylase activity. Delta 4-desaturated sphingolipids may play a role in early signalling required for entry into meiotic and spermatid differentiation pathways during Drosophila spermatogenesis []. This small protein associates with FA_desaturase IPR005804 from INTERPRO and appears to be specific to sphingolipid delta 4-desaturase. ; GO: 0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 0006633 fatty acid biosynthetic process, 0055114 oxidation-reduction process, 0016021 integral to membrane
Probab=21.07 E-value=94 Score=16.35 Aligned_cols=17 Identities=12% Similarity=0.212 Sum_probs=13.4
Q ss_pred HHHhCChHHHHHhcccC
Q psy8460 54 QSVTDEPLLLEAFGQCL 70 (86)
Q Consensus 54 ~~l~~~p~~~~~~~~~l 70 (86)
.++.+.|+|.+.++.+.
T Consensus 21 ~IL~k~PeIk~L~G~dp 37 (39)
T PF08557_consen 21 EILKKHPEIKKLMGPDP 37 (39)
T ss_pred HHHHhChHHHHHhCCCC
Confidence 56778899998888763
No 215
>PLN02534 UDP-glycosyltransferase
Probab=20.85 E-value=1e+02 Score=24.34 Aligned_cols=56 Identities=13% Similarity=0.123 Sum_probs=27.6
Q ss_pred CCHHHHHHHHHHHhh--cCCCCCCChHHHHHHHHHHHHhcCCCCCCcccHHHHHHHHhC
Q psy8460 2 GQKRSLMYLGRNSLI--SHPQDEDPDEGVRDLVELALRKMDYDKDGKISFQDFQQSVTD 58 (86)
Q Consensus 2 ~sreEm~~vL~~~l~--~~~~~~ls~ee~~~~vd~i~~~~D~d~DG~Is~eEF~~~l~~ 58 (86)
++++|+...++..+. +..|+.+- +-+.++-+..-+-+...|...-++++|++.+.+
T Consensus 429 v~~eev~~~v~~~m~~~~eeg~~~R-~rA~elk~~a~~Av~~GGSS~~nl~~fv~~i~~ 486 (491)
T PLN02534 429 VKKDEVEKAVKTLMDDGGEEGERRR-RRAQELGVMARKAMELGGSSHINLSILIQDVLK 486 (491)
T ss_pred cCHHHHHHHHHHHhccccccHHHHH-HHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHH
Confidence 578999999998764 21111111 111222222222334445555566677666653
No 216
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=20.78 E-value=2.5e+02 Score=18.08 Aligned_cols=19 Identities=26% Similarity=0.293 Sum_probs=11.4
Q ss_pred CCChHHHHHHHHHHHHhcC
Q psy8460 22 EDPDEGVRDLVELALRKMD 40 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~~D 40 (86)
+++.+|.+.+++.+++++-
T Consensus 38 eln~eEak~~vddl~~q~k 56 (108)
T COG3937 38 ELNAEEAKRFVDDLLRQAK 56 (108)
T ss_pred CCCHHHHHHHHHHHHHHHH
Confidence 4666666666666666443
No 217
>PF08726 EFhand_Ca_insen: Ca2+ insensitive EF hand; InterPro: IPR014837 EF hands are helix-loop-helix binding motifs involved in the regulation of many cellular processes. EF hands usually bind to Ca2+ ions, which cause a major conformational change that allows the protein to interact with its designated targets. This protein corresponds to an EF hand which has partially or entirely lost its calcium-binding properties. The calcium insensitive EF hand is still able to mediate protein-protein recognition []. ; PDB: 1H8B_A 1SJJ_B.
Probab=20.60 E-value=1.8e+02 Score=16.92 Aligned_cols=36 Identities=22% Similarity=0.254 Sum_probs=23.1
Q ss_pred HHHHhcCCCCCCcccHHHHHHHHhCChHHHHHhcccCCC
Q psy8460 34 LALRKMDYDKDGKISFQDFQQSVTDEPLLLEAFGQCLPS 72 (86)
Q Consensus 34 ~i~~~~D~d~DG~Is~eEF~~~l~~~p~~~~~~~~~l~~ 72 (86)
.-|+.+ .++.+.|+-+|+.+.+..... +.+..++|.
T Consensus 10 ~aFr~l-A~~KpyVT~~dLr~~l~pe~a--ey~~~~Mp~ 45 (69)
T PF08726_consen 10 EAFRAL-AGGKPYVTEEDLRRSLTPEQA--EYCISRMPP 45 (69)
T ss_dssp HHHHHH-CTSSSCEEHHHHHHHS-CCCH--HHHHCCSEC
T ss_pred HHHHHH-HcCCCcccHHHHHHHcCcHHH--HHHHHHCcc
Confidence 346777 667788888888888765444 445555553
No 218
>PF06570 DUF1129: Protein of unknown function (DUF1129); InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=20.28 E-value=2.5e+02 Score=19.25 Aligned_cols=17 Identities=24% Similarity=0.352 Sum_probs=11.7
Q ss_pred CCChHHHHHHHHHHHHh
Q psy8460 22 EDPDEGVRDLVELALRK 38 (86)
Q Consensus 22 ~ls~ee~~~~vd~i~~~ 38 (86)
..++++.++++++++.+
T Consensus 21 ~~~e~~~e~~L~eil~~ 37 (206)
T PF06570_consen 21 GVSEEEIEELLEEILPH 37 (206)
T ss_pred CCCHHHHHHHHHHHHHH
Confidence 57777777777766664
No 219
>PRK06798 fliD flagellar capping protein; Validated
Probab=20.23 E-value=74 Score=24.79 Aligned_cols=28 Identities=18% Similarity=0.557 Sum_probs=23.1
Q ss_pred cCCCCCCcccHHH--HHHHHhCChHHHHHh
Q psy8460 39 MDYDKDGKISFQD--FQQSVTDEPLLLEAF 66 (86)
Q Consensus 39 ~D~d~DG~Is~eE--F~~~l~~~p~~~~~~ 66 (86)
...+.||.+++++ |.+.+..+|+-++.|
T Consensus 323 I~~~~~G~L~lD~~kL~~al~~np~~V~~l 352 (440)
T PRK06798 323 IQVDKEGTMKVDEEALKKALKENPDAAKQF 352 (440)
T ss_pred eEeCCCCCEEEcHHHHHHHHHHCHHHHHHH
Confidence 3446789999887 999999999988755
No 220
>PF02671 PAH: Paired amphipathic helix repeat; InterPro: IPR003822 This family contains the paired amphipathic helix (PAH) repeat. The family contains the eukaryotic Sin3 proteins, which have at least three PAH domains (PAH1, PAH2, and PAH3). Sin3 proteins are components of a co-repressor complex that silences transcription, playing important roles in the transition between proliferation and differentiation. Sin3 proteins are recruited to the DNA by various DNA-binding transcription factors such as the Mad family of repressors, Mnt/Rox, PLZF, MeCP2, p53, REST/NRSF, MNFbeta, Sp1, TGIF and Ume6 []. Sin3 acts as a scaffold protein that in turn recruits histone-binding proteins RbAp46/RbAp48 and histone deacetylases HDAC1/HDAC2, which deacetylate the core histones resulting in a repressed state of the chromatin []. The PAH domains are protein-protein interaction domains through which Sin3 fulfils its role as a scaffold. The PAH2 domain of Sin3 can interact with a wide range of unrelated and structurally diverse transcription factors that bind using different interaction motifs. For example, the Sin3 PAH2 domain can interact with the unrelated Mad and HBP1 factors using alternative interaction motifs that involve binding in opposite helical orientations [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1S5Q_B 2L9S_B 1G1E_B 1S5R_B 2CR7_A 2CZY_A 2LD7_B 2RMR_A 2RMS_A 1PD7_A ....
Probab=20.20 E-value=89 Score=16.19 Aligned_cols=16 Identities=44% Similarity=0.771 Sum_probs=8.8
Q ss_pred HhCChHHHHHhcccCC
Q psy8460 56 VTDEPLLLEAFGQCLP 71 (86)
Q Consensus 56 l~~~p~~~~~~~~~l~ 71 (86)
+..+|+++.-|..-+|
T Consensus 32 l~~hpdLl~~F~~FlP 47 (47)
T PF02671_consen 32 LRGHPDLLEEFNRFLP 47 (47)
T ss_dssp TTT-HHHHHHHHHHSS
T ss_pred HccCHHHHHHHHhhCc
Confidence 3457777776654443
Done!