Query psy15330
Match_columns 60
No_of_seqs 151 out of 1027
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 18:13:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15330.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15330hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0044|consensus 99.2 1.5E-11 3.3E-16 77.9 5.5 54 2-56 133-186 (193)
2 PF00036 EF-hand_1: EF hand; 99.1 5E-11 1.1E-15 55.1 2.6 26 19-44 2-27 (29)
3 KOG0034|consensus 98.8 3.5E-09 7.7E-14 66.8 3.7 45 12-56 142-186 (187)
4 PF13202 EF-hand_5: EF hand; P 98.8 5.5E-09 1.2E-13 46.7 2.9 25 19-43 1-25 (25)
5 PF13405 EF-hand_6: EF-hand do 98.6 5.8E-08 1.3E-12 44.8 2.9 28 18-45 1-28 (31)
6 PF13833 EF-hand_8: EF-hand do 98.5 1E-07 2.2E-12 48.2 3.2 31 14-44 22-52 (54)
7 PF13499 EF-hand_7: EF-hand do 98.5 2.4E-07 5.2E-12 48.3 3.8 31 13-43 36-66 (66)
8 cd05022 S-100A13 S-100A13: S-1 98.1 3.3E-06 7.1E-11 47.6 3.7 29 17-45 47-75 (89)
9 cd00252 SPARC_EC SPARC_EC; ext 98.1 3.7E-06 8E-11 49.5 3.8 30 16-45 79-108 (116)
10 cd05026 S-100Z S-100Z: S-100Z 98.1 3.6E-06 7.7E-11 47.4 3.5 30 15-44 51-80 (93)
11 PF13499 EF-hand_7: EF-hand do 98.1 3.1E-06 6.6E-11 44.0 3.0 28 18-45 1-28 (66)
12 cd05023 S-100A11 S-100A11: S-1 98.1 6E-06 1.3E-10 46.4 3.8 30 15-44 50-79 (89)
13 smart00054 EFh EF-hand, calciu 98.1 6E-06 1.3E-10 35.1 3.0 26 19-44 2-27 (29)
14 cd05029 S-100A6 S-100A6: S-100 98.1 6.4E-06 1.4E-10 46.2 3.6 32 13-44 47-78 (88)
15 cd05030 calgranulins Calgranul 98.1 5.8E-06 1.3E-10 46.1 3.5 31 14-44 48-78 (88)
16 COG5126 FRQ1 Ca2+-binding prot 98.0 5.3E-06 1.1E-10 51.5 3.1 31 14-44 89-119 (160)
17 cd05024 S-100A10 S-100A10: A s 98.0 8.7E-06 1.9E-10 46.4 3.8 29 16-44 47-75 (91)
18 KOG0044|consensus 97.9 1.6E-05 3.4E-10 50.6 3.8 32 13-44 60-91 (193)
19 cd05027 S-100B S-100B: S-100B 97.9 2.5E-05 5.4E-10 43.8 3.9 31 14-44 48-78 (88)
20 cd05031 S-100A10_like S-100A10 97.9 1.9E-05 4.1E-10 44.2 3.3 31 15-45 49-79 (94)
21 KOG0038|consensus 97.9 2E-05 4.4E-10 49.1 3.5 42 14-55 146-187 (189)
22 cd00052 EH Eps15 homology doma 97.8 3.1E-05 6.7E-10 39.8 3.4 30 15-44 31-60 (67)
23 cd05025 S-100A1 S-100A1: S-100 97.8 2.6E-05 5.6E-10 43.4 3.3 30 15-44 50-79 (92)
24 PTZ00183 centrin; Provisional 97.8 3.1E-05 6.7E-10 45.4 3.3 30 15-44 88-117 (158)
25 cd00051 EFh EF-hand, calcium b 97.7 7.8E-05 1.7E-09 36.4 3.6 29 15-43 34-62 (63)
26 KOG0027|consensus 97.7 5.8E-05 1.3E-09 45.2 3.3 31 15-45 83-113 (151)
27 cd00213 S-100 S-100: S-100 dom 97.6 9E-05 1.9E-09 40.6 3.4 31 15-45 49-79 (88)
28 smart00027 EH Eps15 homology d 97.5 0.00015 3.3E-09 40.5 3.5 29 16-44 43-71 (96)
29 PTZ00184 calmodulin; Provision 97.5 0.00019 4.1E-09 41.4 3.6 29 15-43 82-110 (149)
30 cd00052 EH Eps15 homology doma 97.5 0.00014 3.1E-09 37.2 2.7 26 20-45 2-27 (67)
31 COG5126 FRQ1 Ca2+-binding prot 97.4 0.0002 4.3E-09 44.5 3.7 35 14-48 125-159 (160)
32 smart00027 EH Eps15 homology d 97.4 0.00018 3.8E-09 40.3 3.1 30 16-45 9-38 (96)
33 PF14788 EF-hand_10: EF hand; 97.4 0.00031 6.7E-09 36.3 3.4 31 14-44 18-48 (51)
34 cd00213 S-100 S-100: S-100 dom 97.4 0.00029 6.4E-09 38.5 3.5 29 17-45 8-38 (88)
35 cd00051 EFh EF-hand, calcium b 97.2 0.00049 1.1E-08 33.5 3.2 27 19-45 2-28 (63)
36 KOG0027|consensus 97.2 0.00042 9.2E-09 41.4 3.3 32 14-45 41-72 (151)
37 PTZ00183 centrin; Provisional 97.2 0.00094 2E-08 39.0 4.5 30 15-44 15-44 (158)
38 KOG4065|consensus 97.2 0.00045 9.6E-09 41.6 3.0 33 10-42 110-142 (144)
39 cd05025 S-100A1 S-100A1: S-100 97.2 0.00056 1.2E-08 37.9 3.3 32 14-45 6-39 (92)
40 PRK12309 transaldolase/EF-hand 97.1 0.00054 1.2E-08 47.5 3.3 25 20-44 360-384 (391)
41 PF12763 EF-hand_4: Cytoskelet 97.1 0.00064 1.4E-08 39.4 2.9 31 14-44 40-70 (104)
42 cd05026 S-100Z S-100Z: S-100Z 97.0 0.00082 1.8E-08 37.7 2.9 29 17-45 10-40 (93)
43 PTZ00184 calmodulin; Provision 97.0 0.0012 2.7E-08 37.9 3.6 29 16-44 46-74 (149)
44 cd05031 S-100A10_like S-100A10 97.0 0.0013 2.9E-08 36.6 3.4 30 15-44 6-37 (94)
45 cd00252 SPARC_EC SPARC_EC; ext 96.9 0.0018 3.8E-08 38.1 3.8 28 16-43 47-74 (116)
46 cd05027 S-100B S-100B: S-100B 96.8 0.0012 2.7E-08 36.8 2.6 29 17-45 8-38 (88)
47 KOG4223|consensus 96.8 0.0018 3.9E-08 44.1 3.5 33 13-45 196-228 (325)
48 cd05022 S-100A13 S-100A13: S-1 96.8 0.0016 3.5E-08 36.6 2.7 29 17-45 8-37 (89)
49 PLN02964 phosphatidylserine de 96.6 0.003 6.4E-08 46.3 3.7 25 19-43 181-205 (644)
50 KOG0028|consensus 96.6 0.0043 9.4E-08 38.9 3.9 39 15-53 104-148 (172)
51 KOG0034|consensus 96.5 0.0047 1E-07 39.1 3.8 31 14-44 101-131 (187)
52 cd05023 S-100A11 S-100A11: S-1 96.3 0.0098 2.1E-07 33.3 4.1 31 16-46 8-40 (89)
53 PLN02964 phosphatidylserine de 96.1 0.0065 1.4E-07 44.6 3.4 27 19-45 217-243 (644)
54 KOG0041|consensus 96.0 0.0078 1.7E-07 39.3 2.9 28 18-45 100-127 (244)
55 PF10591 SPARC_Ca_bdg: Secrete 95.9 0.0063 1.4E-07 35.5 2.1 27 15-41 86-112 (113)
56 PRK12309 transaldolase/EF-hand 95.7 0.013 2.7E-07 40.8 3.2 27 14-40 331-357 (391)
57 KOG2643|consensus 95.6 0.0022 4.8E-08 45.4 -0.8 28 18-45 426-453 (489)
58 cd05029 S-100A6 S-100A6: S-100 95.5 0.015 3.3E-07 32.4 2.7 26 19-44 12-39 (88)
59 PF14658 EF-hand_9: EF-hand do 95.5 0.028 6E-07 30.4 3.4 31 15-45 33-64 (66)
60 KOG0037|consensus 95.5 0.025 5.5E-07 36.8 3.7 33 21-53 128-165 (221)
61 KOG0028|consensus 95.4 0.028 6.1E-07 35.3 3.8 34 13-46 138-171 (172)
62 KOG0377|consensus 95.1 0.023 4.9E-07 40.9 3.0 26 19-44 549-574 (631)
63 KOG0036|consensus 94.5 0.047 1E-06 38.7 3.2 41 13-53 47-88 (463)
64 KOG4223|consensus 94.4 0.058 1.3E-06 36.9 3.4 34 8-41 268-301 (325)
65 KOG2643|consensus 94.1 0.011 2.4E-07 42.0 -0.5 25 19-43 235-259 (489)
66 KOG0036|consensus 93.8 0.081 1.8E-06 37.6 3.3 34 14-47 115-148 (463)
67 KOG0030|consensus 93.6 0.16 3.5E-06 31.4 3.9 29 16-44 10-38 (152)
68 KOG1955|consensus 92.6 0.13 2.8E-06 37.6 2.9 31 14-44 262-292 (737)
69 KOG0037|consensus 92.5 0.2 4.3E-06 32.7 3.4 39 15-53 158-196 (221)
70 KOG0377|consensus 92.3 0.2 4.4E-06 36.3 3.5 30 15-44 585-614 (631)
71 KOG0041|consensus 92.1 0.14 2.9E-06 33.6 2.3 27 18-44 136-162 (244)
72 KOG4251|consensus 91.9 0.088 1.9E-06 35.6 1.3 20 22-41 145-164 (362)
73 KOG4578|consensus 91.6 0.16 3.4E-06 35.4 2.3 32 14-45 367-398 (421)
74 KOG0031|consensus 90.7 0.42 9.1E-06 30.0 3.3 28 17-44 32-59 (171)
75 KOG0046|consensus 90.6 0.4 8.8E-06 35.1 3.6 30 14-43 54-83 (627)
76 PF10591 SPARC_Ca_bdg: Secrete 90.5 0.098 2.1E-06 30.5 0.4 29 16-44 53-81 (113)
77 KOG4251|consensus 90.4 0.48 1E-05 32.2 3.6 43 3-45 84-129 (362)
78 KOG0031|consensus 90.3 0.58 1.3E-05 29.4 3.7 33 13-45 97-129 (171)
79 KOG4347|consensus 90.0 0.44 9.5E-06 35.4 3.4 32 14-45 552-583 (671)
80 KOG4004|consensus 89.7 0.26 5.7E-06 32.3 2.0 29 14-42 219-247 (259)
81 cd05030 calgranulins Calgranul 89.5 0.65 1.4E-05 25.5 3.2 28 17-44 8-37 (88)
82 PF05042 Caleosin: Caleosin re 89.1 0.45 9.7E-06 30.1 2.6 35 16-50 95-129 (174)
83 KOG1029|consensus 86.7 0.55 1.2E-05 36.1 2.2 30 14-43 226-255 (1118)
84 KOG4666|consensus 86.7 0.6 1.3E-05 32.6 2.3 35 19-53 333-367 (412)
85 KOG0751|consensus 84.5 1.4 2.9E-05 32.5 3.3 30 16-45 107-136 (694)
86 KOG0038|consensus 84.0 1.4 3.1E-05 27.7 2.9 26 19-44 73-98 (189)
87 PF14658 EF-hand_9: EF-hand do 83.8 0.68 1.5E-05 24.9 1.2 24 21-44 2-25 (66)
88 KOG1707|consensus 83.5 1.6 3.4E-05 32.4 3.2 32 14-45 312-343 (625)
89 PF09279 EF-hand_like: Phospho 82.2 2.6 5.7E-05 22.5 3.2 26 19-45 2-27 (83)
90 PF05517 p25-alpha: p25-alpha 81.9 1.8 3.8E-05 26.4 2.7 31 14-44 38-68 (154)
91 PF08976 DUF1880: Domain of un 80.2 2.3 5E-05 25.4 2.6 32 15-46 5-36 (118)
92 PF00404 Dockerin_1: Dockerin 79.6 2 4.2E-05 18.2 1.7 15 27-41 1-15 (21)
93 KOG0040|consensus 79.3 3 6.6E-05 34.5 3.7 29 17-45 2253-2281(2399)
94 KOG3866|consensus 78.4 1.5 3.2E-05 30.6 1.7 25 20-44 299-323 (442)
95 cd02977 ArsC_family Arsenate R 75.6 2 4.4E-05 23.9 1.5 31 21-51 61-91 (105)
96 PF12174 RST: RCD1-SRO-TAF4 (R 74.9 1.8 4E-05 23.4 1.1 29 17-45 25-53 (70)
97 PF09373 PMBR: Pseudomurein-bi 74.1 2.5 5.3E-05 19.4 1.3 16 30-45 1-16 (33)
98 KOG0040|consensus 73.7 4.2 9.1E-05 33.8 3.2 34 12-45 2291-2324(2399)
99 KOG0030|consensus 72.0 4.7 0.0001 25.0 2.5 30 14-44 86-115 (152)
100 KOG1029|consensus 65.4 7.7 0.00017 30.2 3.0 30 15-44 47-76 (1118)
101 PF00427 PBS_linker_poly: Phyc 64.6 9.6 0.00021 23.0 2.8 22 30-51 41-62 (131)
102 KOG1954|consensus 62.4 7.2 0.00016 28.1 2.2 28 14-41 474-501 (532)
103 KOG2562|consensus 62.3 10 0.00022 27.5 3.0 29 15-43 309-341 (493)
104 KOG4666|consensus 61.8 6.9 0.00015 27.5 2.0 28 17-44 259-286 (412)
105 KOG4065|consensus 60.5 14 0.00031 22.4 3.0 34 11-44 61-94 (144)
106 KOG3555|consensus 60.2 7 0.00015 27.7 1.9 28 15-42 248-275 (434)
107 cd02518 GT2_SpsF SpsF is a gly 59.7 25 0.00055 21.8 4.2 41 11-52 188-230 (233)
108 cd03035 ArsC_Yffb Arsenate Red 59.4 4.9 0.00011 22.8 0.9 29 23-51 61-89 (105)
109 cd03032 ArsC_Spx Arsenate Redu 58.5 11 0.00023 21.5 2.2 20 32-51 72-91 (115)
110 COG5562 Phage envelope protein 56.8 8.1 0.00018 23.6 1.5 24 22-45 77-100 (137)
111 PF08726 EFhand_Ca_insen: Ca2+ 56.6 19 0.00041 19.3 2.8 32 16-48 5-36 (69)
112 KOG0751|consensus 56.6 14 0.00029 27.6 2.8 28 23-50 80-109 (694)
113 PF09068 EF-hand_2: EF hand; 55.8 22 0.00047 21.0 3.3 30 14-43 94-123 (127)
114 PF13608 Potyvirid-P3: Protein 55.7 13 0.00029 26.4 2.7 38 16-54 288-326 (445)
115 KOG2871|consensus 55.6 22 0.00047 25.5 3.6 43 12-54 304-352 (449)
116 PF03960 ArsC: ArsC family; I 55.1 4.4 9.6E-05 22.8 0.2 21 31-51 68-88 (110)
117 PF01023 S_100: S-100/ICaBP ty 53.6 19 0.00041 17.5 2.3 27 19-45 8-36 (44)
118 PF09069 EF-hand_3: EF-hand; 51.3 23 0.0005 20.0 2.7 34 15-50 47-80 (90)
119 KOG3555|consensus 50.3 14 0.0003 26.2 2.0 30 16-45 281-310 (434)
120 cd03034 ArsC_ArsC Arsenate Red 49.8 17 0.00036 20.7 2.1 28 23-51 63-90 (112)
121 KOG2243|consensus 49.3 13 0.00029 31.5 2.0 26 21-46 4061-4086(5019)
122 KOG0169|consensus 48.5 24 0.00052 27.0 3.1 32 13-44 132-163 (746)
123 PF11333 DUF3135: Protein of u 46.6 23 0.0005 19.7 2.2 21 34-54 4-24 (83)
124 PRK13696 hypothetical protein; 46.5 22 0.00047 18.9 2.0 33 21-54 11-48 (62)
125 PF09454 Vps23_core: Vps23 cor 46.5 18 0.00038 19.1 1.7 16 30-45 36-51 (65)
126 PF08044 DUF1707: Domain of un 46.3 19 0.00042 18.2 1.8 14 30-43 20-33 (53)
127 cd08330 CARD_ASC_NALP1 Caspase 46.3 44 0.00095 18.2 3.3 39 13-53 42-81 (82)
128 PRK01655 spxA transcriptional 46.1 16 0.00035 21.4 1.6 28 23-51 64-91 (131)
129 PRK13344 spxA transcriptional 46.1 15 0.00032 21.7 1.5 29 22-51 63-91 (132)
130 PF09851 SHOCT: Short C-termin 44.9 19 0.00041 16.1 1.5 13 31-43 14-26 (31)
131 PRK08032 fliD flagellar cappin 44.3 15 0.00033 25.9 1.6 27 27-53 347-376 (462)
132 PRK12559 transcriptional regul 43.3 17 0.00038 21.4 1.5 30 21-51 62-91 (131)
133 PF06226 DUF1007: Protein of u 43.3 24 0.00053 22.3 2.2 24 21-44 54-77 (212)
134 PRK10026 arsenate reductase; P 43.0 16 0.00036 22.1 1.4 30 21-51 64-93 (141)
135 TIGR00014 arsC arsenate reduct 42.8 20 0.00044 20.4 1.7 20 32-51 72-91 (114)
136 PF07765 KIP1: KIP1-like prote 42.0 54 0.0012 18.0 3.2 39 14-54 22-60 (74)
137 PF13591 MerR_2: MerR HTH fami 41.1 37 0.00079 18.4 2.5 20 34-53 1-20 (84)
138 TIGR01617 arsC_related transcr 41.1 14 0.0003 21.0 0.8 20 32-51 73-92 (117)
139 PF07492 Trehalase_Ca-bi: Neut 40.5 6.5 0.00014 18.1 -0.5 18 20-37 2-19 (30)
140 smart00763 AAA_PrkA PrkA AAA d 40.1 27 0.00059 24.3 2.2 18 34-51 1-18 (361)
141 PF13220 DUF4028: Protein of u 39.8 53 0.0011 17.2 2.8 36 13-48 20-56 (65)
142 PRK06798 fliD flagellar cappin 39.7 19 0.0004 25.5 1.4 27 27-53 324-352 (440)
143 KOG0046|consensus 39.7 36 0.00077 25.5 2.8 27 18-45 20-46 (627)
144 KOG4403|consensus 39.5 23 0.0005 25.9 1.8 31 15-45 66-96 (575)
145 PF05383 La: La domain; Inter 39.2 34 0.00074 17.7 2.1 20 21-40 19-38 (61)
146 TIGR03296 M6dom_TIGR03296 M6 f 38.6 14 0.0003 24.5 0.6 16 23-38 92-107 (286)
147 PF07862 Nif11: Nitrogen fixat 37.3 40 0.00087 16.2 2.1 19 36-54 6-24 (49)
148 PF05901 Excalibur: Excalibur 36.7 19 0.0004 16.9 0.7 9 25-33 26-34 (37)
149 PF08298 AAA_PrkA: PrkA AAA do 35.9 30 0.00065 24.2 1.9 18 34-51 1-18 (358)
150 KOG2562|consensus 34.3 47 0.001 24.3 2.7 28 19-46 141-168 (493)
151 cd00038 CAP_ED effector domain 34.3 35 0.00075 17.5 1.7 21 33-53 94-114 (115)
152 PF07531 TAFH: NHR1 homology t 33.8 35 0.00076 19.6 1.7 15 31-45 38-52 (96)
153 PF10891 DUF2719: Protein of u 33.5 27 0.00058 19.5 1.1 15 31-45 33-47 (81)
154 cd03036 ArsC_like Arsenate Red 33.4 26 0.00056 19.8 1.1 29 23-51 63-92 (111)
155 cd03033 ArsC_15kD Arsenate Red 32.4 35 0.00076 19.6 1.6 29 22-51 61-89 (113)
156 TIGR01616 nitro_assoc nitrogen 32.2 32 0.00069 20.2 1.4 29 22-51 62-90 (126)
157 TIGR03798 ocin_TIGR03798 bacte 32.1 55 0.0012 16.8 2.2 20 35-54 3-22 (64)
158 COG1565 Uncharacterized conser 32.0 34 0.00073 24.1 1.7 19 29-47 13-31 (370)
159 PF10376 Mei5: Double-strand r 31.9 79 0.0017 20.5 3.3 35 16-50 184-218 (221)
160 cd08029 LA_like_fungal La-moti 31.4 83 0.0018 17.0 2.9 33 22-54 23-60 (76)
161 PRK07077 hypothetical protein; 30.6 67 0.0014 21.0 2.8 25 29-53 182-206 (238)
162 TIGR02675 tape_meas_nterm tape 29.9 63 0.0014 17.2 2.2 25 30-54 27-51 (75)
163 PRK07737 fliD flagellar cappin 29.9 38 0.00083 24.3 1.7 24 30-53 375-400 (501)
164 PF14071 YlbD_coat: Putative c 28.7 31 0.00068 20.7 0.9 17 37-53 1-17 (124)
165 PF03705 CheR_N: CheR methyltr 28.7 36 0.00078 16.6 1.1 17 32-48 40-56 (57)
166 PRK15455 PrkA family serine pr 28.6 83 0.0018 23.8 3.3 31 21-51 6-36 (644)
167 PF11363 DUF3164: Protein of u 28.4 1.6E+02 0.0035 18.7 4.4 38 17-54 119-159 (195)
168 PRK06664 fliD flagellar hook-a 27.6 44 0.00095 25.1 1.7 25 29-53 539-565 (661)
169 PLN02230 phosphoinositide phos 27.5 1.1E+02 0.0024 22.9 3.7 33 12-45 24-56 (598)
170 PF04380 BMFP: Membrane fusoge 26.6 62 0.0013 17.5 1.9 23 14-41 32-54 (79)
171 PRK12765 flagellar capping pro 26.4 39 0.00085 24.9 1.3 26 28-53 370-397 (595)
172 PF12631 GTPase_Cys_C: Catalyt 26.3 50 0.0011 17.3 1.4 15 12-26 58-72 (73)
173 PF08100 Dimerisation: Dimeris 25.5 36 0.00078 17.1 0.7 27 21-47 10-37 (51)
174 KOG0042|consensus 25.4 61 0.0013 24.5 2.1 30 15-44 627-656 (680)
175 smart00549 TAFH TAF homology. 25.1 54 0.0012 18.8 1.5 27 16-45 25-51 (92)
176 KOG1265|consensus 25.0 1.1E+02 0.0024 24.6 3.4 37 17-53 221-260 (1189)
177 PF07879 PHB_acc_N: PHB/PHA ac 24.8 40 0.00087 18.0 0.8 23 24-46 10-32 (64)
178 PF00027 cNMP_binding: Cyclic 24.8 44 0.00095 16.8 1.0 16 33-48 76-91 (91)
179 PF04876 Tenui_NCP: Tenuivirus 24.8 1.2E+02 0.0026 19.1 3.1 31 15-45 81-111 (175)
180 KOG0998|consensus 24.8 37 0.00081 26.1 1.0 30 16-45 282-311 (847)
181 PF09693 Phage_XkdX: Phage unc 24.5 67 0.0014 15.2 1.5 13 31-43 24-36 (40)
182 PF04651 Pox_A12: Poxvirus A12 24.1 72 0.0016 20.5 2.0 21 33-53 8-29 (189)
183 PF06480 FtsH_ext: FtsH Extrac 23.8 47 0.001 17.6 1.1 20 27-46 23-42 (110)
184 CHL00091 apcE phycobillisome l 23.4 82 0.0018 24.7 2.5 23 30-52 552-574 (877)
185 PF01839 FG-GAP: FG-GAP repeat 23.4 43 0.00094 15.3 0.7 8 26-33 8-15 (34)
186 PF09862 DUF2089: Protein of u 23.1 1.3E+02 0.0029 17.6 2.9 22 18-42 90-111 (113)
187 PF09012 FeoC: FeoC like trans 23.0 1E+02 0.0022 15.8 2.2 24 30-53 11-34 (69)
188 PF14410 GH-E: HNH/ENDO VII su 22.7 69 0.0015 17.0 1.5 30 20-50 27-56 (70)
189 cd06403 PB1_Par6 The PB1 domai 22.5 32 0.00068 19.2 0.2 23 23-45 11-33 (80)
190 COG1102 Cmk Cytidylate kinase 22.5 70 0.0015 20.4 1.7 31 21-53 33-63 (179)
191 COG2910 Putative NADH-flavin r 22.4 68 0.0015 21.0 1.7 20 25-44 175-194 (211)
192 PRK08724 fliD flagellar cappin 22.3 58 0.0013 24.7 1.5 27 27-53 562-590 (673)
193 PF09912 DUF2141: Uncharacteri 22.3 20 0.00044 20.6 -0.7 13 26-38 61-73 (112)
194 cd08032 LARP_7 La RNA-binding 22.3 77 0.0017 17.5 1.7 33 21-53 27-65 (82)
195 PLN02952 phosphoinositide phos 22.2 1.4E+02 0.0031 22.3 3.5 32 13-45 34-65 (599)
196 CHL00091 apcE phycobillisome l 22.2 93 0.002 24.5 2.6 22 30-51 304-325 (877)
197 PF03986 Autophagy_N: Autophag 22.0 59 0.0013 19.9 1.3 14 30-43 24-37 (145)
198 PF09066 B2-adapt-app_C: Beta2 22.0 58 0.0012 18.2 1.2 18 30-47 3-20 (114)
199 PF04433 SWIRM: SWIRM domain; 22.0 91 0.002 16.7 2.0 32 20-51 40-72 (86)
200 TIGR03697 NtcA_cyano global ni 21.4 92 0.002 18.3 2.1 21 33-53 72-92 (193)
201 COG1104 NifS Cysteine sulfinat 21.4 77 0.0017 22.4 1.9 21 26-46 121-141 (386)
202 KOG4070|consensus 21.2 39 0.00084 21.4 0.4 25 19-43 59-83 (180)
203 PRK11753 DNA-binding transcrip 21.2 97 0.0021 18.6 2.2 21 33-53 98-118 (211)
204 PF09280 XPC-binding: XPC-bind 20.6 1E+02 0.0022 15.9 1.9 16 36-51 10-25 (59)
205 PF10757 YbaJ: Biofilm formati 20.3 2.1E+02 0.0046 17.1 3.8 39 14-55 75-113 (122)
206 PRK14734 coaE dephospho-CoA ki 20.2 2.2E+02 0.0047 17.6 3.6 36 19-54 49-84 (200)
207 TIGR00984 3a0801s03tim44 mitoc 20.0 2.4E+02 0.0052 20.0 4.1 37 14-54 225-263 (378)
No 1
>KOG0044|consensus
Probab=99.25 E-value=1.5e-11 Score=77.87 Aligned_cols=54 Identities=63% Similarity=0.981 Sum_probs=47.7
Q ss_pred CCCCCCCCCCCCcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHhccC
Q psy15330 2 VGTVMKMPEDESTPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLLQCD 56 (60)
Q Consensus 2 ~g~~~~~~~~~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l~~~ 56 (60)
+|+...+ .++.+++++++++|+++|.|+||.||++||+.+++++|.|+++|+..
T Consensus 133 ~~~~~~~-~~~~~~~~~v~~if~k~D~n~Dg~lT~eef~~~~~~d~~i~~~l~~~ 186 (193)
T KOG0044|consen 133 TGSKALP-EDEETPEERVDKIFSKMDKNKDGKLTLEEFIEGCKADPSILRALEQD 186 (193)
T ss_pred cccccCC-cccccHHHHHHHHHHHcCCCCCCcccHHHHHHHhhhCHHHHHHhhhc
Confidence 4554344 67889999999999999999999999999999999999999999764
No 2
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=99.13 E-value=5e-11 Score=55.13 Aligned_cols=26 Identities=42% Similarity=0.740 Sum_probs=24.4
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
++++|+.+|+|+||+||++||+.+++
T Consensus 2 ~~~~F~~~D~d~dG~I~~~Ef~~~~~ 27 (29)
T PF00036_consen 2 LKEAFREFDKDGDGKIDFEEFKEMMK 27 (29)
T ss_dssp HHHHHHHHSTTSSSEEEHHHHHHHHH
T ss_pred HHHHHHHHCCCCCCcCCHHHHHHHHH
Confidence 67899999999999999999999886
No 3
>KOG0034|consensus
Probab=98.83 E-value=3.5e-09 Score=66.79 Aligned_cols=45 Identities=27% Similarity=0.525 Sum_probs=41.2
Q ss_pred CCcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHhccC
Q psy15330 12 ESTPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLLQCD 56 (60)
Q Consensus 12 ~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l~~~ 56 (60)
+...+.+++++|.++|.|+||+|||+||.+.+.+.|.+++.|++.
T Consensus 142 ~e~~~~i~d~t~~e~D~d~DG~IsfeEf~~~v~~~P~~~~~m~~~ 186 (187)
T KOG0034|consen 142 DEQLEDIVDKTFEEADTDGDGKISFEEFCKVVEKQPDLLEKMTIR 186 (187)
T ss_pred HHHHHHHHHHHHHHhCCCCCCcCcHHHHHHHHHcCccHHHHcCCC
Confidence 456778899999999999999999999999999999999998753
No 4
>PF13202 EF-hand_5: EF hand; PDB: 3DD4_A 2Q4U_A 2BE4_A 1UHJ_B 1UHI_A 1UHH_B 1EJ3_B 1UHK_A 2ZFD_A 1UHN_A ....
Probab=98.81 E-value=5.5e-09 Score=46.68 Aligned_cols=25 Identities=36% Similarity=0.628 Sum_probs=22.2
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
++.+|+.+|+|+||.||++||.+.+
T Consensus 1 l~~~F~~~D~d~DG~is~~E~~~~~ 25 (25)
T PF13202_consen 1 LKDAFQQFDTDGDGKISFEEFQRLV 25 (25)
T ss_dssp HHHHHHHHTTTSSSEEEHHHHHHHH
T ss_pred CHHHHHHHcCCCCCcCCHHHHHHHC
Confidence 3679999999999999999998753
No 5
>PF13405 EF-hand_6: EF-hand domain; PDB: 2AMI_A 3QRX_A 1W7J_B 1OE9_B 1W7I_B 1KFU_S 1KFX_S 2BL0_B 1Y1X_B 3MSE_B ....
Probab=98.59 E-value=5.8e-08 Score=44.76 Aligned_cols=28 Identities=29% Similarity=0.479 Sum_probs=24.1
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+++.+|+.+|.|+||.|+++||..++++
T Consensus 1 ~l~~~F~~~D~d~dG~I~~~el~~~l~~ 28 (31)
T PF13405_consen 1 RLREAFKMFDKDGDGFIDFEELRAILRK 28 (31)
T ss_dssp HHHHHHHHH-TTSSSEEEHHHHHHHHHH
T ss_pred CHHHHHHHHCCCCCCcCcHHHHHHHHHH
Confidence 3578999999999999999999998863
No 6
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=98.54 E-value=1e-07 Score=48.23 Aligned_cols=31 Identities=35% Similarity=0.718 Sum_probs=27.3
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..++.++.+|+.+|.|+||.|+|+||+..+.
T Consensus 22 ~s~~e~~~l~~~~D~~~~G~I~~~EF~~~~~ 52 (54)
T PF13833_consen 22 LSEEEVDRLFREFDTDGDGYISFDEFISMMQ 52 (54)
T ss_dssp SCHHHHHHHHHHHTTSSSSSEEHHHHHHHHH
T ss_pred CCHHHHHHHHHhcccCCCCCCCHHHHHHHHH
Confidence 3455699999999999999999999999875
No 7
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=98.49 E-value=2.4e-07 Score=48.29 Aligned_cols=31 Identities=35% Similarity=0.725 Sum_probs=27.0
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
...++.++.+|+.+|+|+||.||++||...+
T Consensus 36 ~~~~~~~~~~~~~~D~d~dG~i~~~Ef~~~~ 66 (66)
T PF13499_consen 36 EESDEMIDQIFREFDTDGDGRISFDEFLNFM 66 (66)
T ss_dssp HHHHHHHHHHHHHHTTTSSSSEEHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCCCcCCCcHHHHhccC
Confidence 4566788899999999999999999999764
No 8
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=98.15 E-value=3.3e-06 Score=47.65 Aligned_cols=29 Identities=34% Similarity=0.506 Sum_probs=26.3
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+.++.+|+.+|.|+||.|+|+||+..+.+
T Consensus 47 ~~v~~mi~~~D~d~DG~I~F~EF~~l~~~ 75 (89)
T cd05022 47 EGLEEKMKNLDVNQDSKLSFEEFWELIGE 75 (89)
T ss_pred HHHHHHHHHhCCCCCCCCcHHHHHHHHHH
Confidence 67899999999999999999999987753
No 9
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=98.13 E-value=3.7e-06 Score=49.54 Aligned_cols=30 Identities=27% Similarity=0.392 Sum_probs=23.4
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+..+..+|+.+|.|+||.||++||...+.+
T Consensus 79 e~~~~~f~~~~D~n~Dg~IS~~Ef~~cl~~ 108 (116)
T cd00252 79 EHCIKPFFESCDLDKDGSISLDEWCYCFIK 108 (116)
T ss_pred HHHHHHHHHHHCCCCCCCCCHHHHHHHHhC
Confidence 456677888888888888888888887744
No 10
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=98.13 E-value=3.6e-06 Score=47.40 Aligned_cols=30 Identities=27% Similarity=0.452 Sum_probs=27.0
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+..++++++.+|.|+||.|+|+||+..+.
T Consensus 51 ~~~~v~~i~~elD~n~dG~Idf~EF~~l~~ 80 (93)
T cd05026 51 DPMLVDKIMNDLDSNKDNEVDFNEFVVLVA 80 (93)
T ss_pred CHHHHHHHHHHhCCCCCCCCCHHHHHHHHH
Confidence 456789999999999999999999998875
No 11
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=98.12 E-value=3.1e-06 Score=44.02 Aligned_cols=28 Identities=36% Similarity=0.573 Sum_probs=25.1
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
++.++|+.+|+|+||.|+.+||...+..
T Consensus 1 ~l~~~F~~~D~d~~G~i~~~el~~~~~~ 28 (66)
T PF13499_consen 1 RLKEAFKKFDKDGDGYISKEELRRALKH 28 (66)
T ss_dssp HHHHHHHHHSTTSSSEEEHHHHHHHHHH
T ss_pred CHHHHHHHHcCCccCCCCHHHHHHHHHH
Confidence 3678999999999999999999987764
No 12
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=98.08 E-value=6e-06 Score=46.44 Aligned_cols=30 Identities=27% Similarity=0.540 Sum_probs=26.5
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+..++++|+.+|.|+||.|+|+||+..+.
T Consensus 50 ~~~~~~~ll~~~D~d~DG~I~f~EF~~l~~ 79 (89)
T cd05023 50 DPGVLDRMMKKLDLNSDGQLDFQEFLNLIG 79 (89)
T ss_pred CHHHHHHHHHHcCCCCCCcCcHHHHHHHHH
Confidence 346789999999999999999999998764
No 13
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=98.08 E-value=6e-06 Score=35.07 Aligned_cols=26 Identities=38% Similarity=0.668 Sum_probs=23.7
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
+.++|+.+|.|++|.|++.||...++
T Consensus 2 ~~~~f~~~d~~~~g~i~~~e~~~~~~ 27 (29)
T smart00054 2 LKEAFRLFDKDGDGKIDFEEFKDLLK 27 (29)
T ss_pred HHHHHHHHCCCCCCcEeHHHHHHHHH
Confidence 46899999999999999999998875
No 14
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=98.06 E-value=6.4e-06 Score=46.19 Aligned_cols=32 Identities=16% Similarity=0.413 Sum_probs=28.0
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
...++.++++|+.+|.|+||.|+|+||+..+.
T Consensus 47 k~t~~ev~~m~~~~D~d~dG~Idf~EFv~lm~ 78 (88)
T cd05029 47 KLQDAEIAKLMEDLDRNKDQEVNFQEYVTFLG 78 (88)
T ss_pred CCCHHHHHHHHHHhcCCCCCCCcHHHHHHHHH
Confidence 34667899999999999999999999998764
No 15
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=98.06 E-value=5.8e-06 Score=46.11 Aligned_cols=31 Identities=48% Similarity=0.719 Sum_probs=28.0
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..+..++.+|+.+|.|+||.|+|+||+..+.
T Consensus 48 ~~~~~v~~i~~~~D~d~dG~I~f~eF~~~~~ 78 (88)
T cd05030 48 KNQKAIDKIFEDLDTNQDGQLSFEEFLVLVI 78 (88)
T ss_pred CCHHHHHHHHHHcCCCCCCcCcHHHHHHHHH
Confidence 3478899999999999999999999998875
No 16
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=98.03 E-value=5.3e-06 Score=51.47 Aligned_cols=31 Identities=29% Similarity=0.465 Sum_probs=17.4
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+++.+...|+.||+|+||+||..|+...++
T Consensus 89 ~~~Eel~~aF~~fD~d~dG~Is~~eL~~vl~ 119 (160)
T COG5126 89 DKEEELREAFKLFDKDHDGYISIGELRRVLK 119 (160)
T ss_pred CcHHHHHHHHHHhCCCCCceecHHHHHHHHH
Confidence 3445555555666666666666555555544
No 17
>cd05024 S-100A10 S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A10 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=98.03 E-value=8.7e-06 Score=46.43 Aligned_cols=29 Identities=24% Similarity=0.518 Sum_probs=26.0
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
+..++++|+.+|.|+||.|+|.||+..+.
T Consensus 47 ~~~vd~im~~LD~n~Dg~vdF~EF~~Lv~ 75 (91)
T cd05024 47 PMAVDKIMKDLDDCRDGKVGFQSFFSLIA 75 (91)
T ss_pred HHHHHHHHHHhCCCCCCcCcHHHHHHHHH
Confidence 45789999999999999999999998664
No 18
>KOG0044|consensus
Probab=97.91 E-value=1.6e-05 Score=50.60 Aligned_cols=32 Identities=34% Similarity=0.602 Sum_probs=29.4
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.++..+++.+|+.+|+|+||.|+|.||+.+++
T Consensus 60 gd~~~y~~~vF~~fD~~~dg~i~F~Efi~als 91 (193)
T KOG0044|consen 60 GDASKYAELVFRTFDKNKDGTIDFLEFICALS 91 (193)
T ss_pred CCHHHHHHHHHHHhcccCCCCcCHHHHHHHHH
Confidence 57889999999999999999999999998775
No 19
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=97.88 E-value=2.5e-05 Score=43.75 Aligned_cols=31 Identities=23% Similarity=0.477 Sum_probs=27.0
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..++.++++++.+|.|+||.|+|+||+..+.
T Consensus 48 ~~~~~v~~~i~~~D~n~dG~v~f~eF~~li~ 78 (88)
T cd05027 48 KEQEVVDKVMETLDSDGDGECDFQEFMAFVA 78 (88)
T ss_pred CCHHHHHHHHHHhCCCCCCcCcHHHHHHHHH
Confidence 3456799999999999999999999998764
No 20
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=97.87 E-value=1.9e-05 Score=44.16 Aligned_cols=31 Identities=32% Similarity=0.647 Sum_probs=27.6
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.++.++.+|+.+|.|++|.|+|+||+..+..
T Consensus 49 s~~ei~~~~~~~D~~~dg~I~f~eF~~l~~~ 79 (94)
T cd05031 49 DPMAVDKIMKDLDQNRDGKVNFEEFVSLVAG 79 (94)
T ss_pred cHHHHHHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 4567899999999999999999999988753
No 21
>KOG0038|consensus
Probab=97.85 E-value=2e-05 Score=49.12 Aligned_cols=42 Identities=24% Similarity=0.376 Sum_probs=37.3
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHhcc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLLQC 55 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l~~ 55 (60)
..+..++++....|.||||+||+.||...+.+.|+.+..+.+
T Consensus 146 Ev~~i~ekvieEAD~DgDgkl~~~eFe~~i~raPDFlsTFHI 187 (189)
T KOG0038|consen 146 EVELICEKVIEEADLDGDGKLSFAEFEHVILRAPDFLSTFHI 187 (189)
T ss_pred HHHHHHHHHHHHhcCCCCCcccHHHHHHHHHhCcchHhhhee
Confidence 345568899999999999999999999999999999988764
No 22
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=97.82 E-value=3.1e-05 Score=39.79 Aligned_cols=30 Identities=30% Similarity=0.523 Sum_probs=25.9
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
++..++++|+.+|.+++|.|+++||+..+.
T Consensus 31 ~~~~~~~i~~~~d~~~~g~i~~~ef~~~~~ 60 (67)
T cd00052 31 PRSVLAQIWDLADTDKDGKLDKEEFAIAMH 60 (67)
T ss_pred CHHHHHHHHHHhcCCCCCcCCHHHHHHHHH
Confidence 455688999999999999999999998775
No 23
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=97.82 E-value=2.6e-05 Score=43.38 Aligned_cols=30 Identities=30% Similarity=0.594 Sum_probs=26.9
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+..++.+|+.+|.|++|.|+|+||+..+.
T Consensus 50 s~~~v~~i~~~~D~d~~G~I~f~eF~~l~~ 79 (92)
T cd05025 50 DADAVDKIMKELDENGDGEVDFQEFVVLVA 79 (92)
T ss_pred CHHHHHHHHHHHCCCCCCcCcHHHHHHHHH
Confidence 456789999999999999999999998775
No 24
>PTZ00183 centrin; Provisional
Probab=97.78 E-value=3.1e-05 Score=45.38 Aligned_cols=30 Identities=33% Similarity=0.567 Sum_probs=21.4
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+..+..+|+.+|+|++|.|+++||...+.
T Consensus 88 ~~~~l~~~F~~~D~~~~G~i~~~e~~~~l~ 117 (158)
T PTZ00183 88 PREEILKAFRLFDDDKTGKISLKNLKRVAK 117 (158)
T ss_pred cHHHHHHHHHHhCCCCCCcCcHHHHHHHHH
Confidence 345567777777777777777777776654
No 25
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=97.69 E-value=7.8e-05 Score=36.42 Aligned_cols=29 Identities=38% Similarity=0.762 Sum_probs=20.8
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
+.+.+..+|+.+|.+++|.|+++||...+
T Consensus 34 ~~~~~~~~~~~~~~~~~~~l~~~ef~~~~ 62 (63)
T cd00051 34 SEEEIDEMIREVDKDGDGKIDFEEFLELM 62 (63)
T ss_pred CHHHHHHHHHHhCCCCCCeEeHHHHHHHh
Confidence 44556677777887778888888777654
No 26
>KOG0027|consensus
Probab=97.66 E-value=5.8e-05 Score=45.21 Aligned_cols=31 Identities=29% Similarity=0.402 Sum_probs=27.6
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
..+.+.++|+.||+|+||+||.+|+...+.+
T Consensus 83 ~~~el~eaF~~fD~d~~G~Is~~el~~~l~~ 113 (151)
T KOG0027|consen 83 SSEELKEAFRVFDKDGDGFISASELKKVLTS 113 (151)
T ss_pred cHHHHHHHHHHHccCCCCcCcHHHHHHHHHH
Confidence 4568899999999999999999999988764
No 27
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=97.61 E-value=9e-05 Score=40.62 Aligned_cols=31 Identities=29% Similarity=0.569 Sum_probs=27.6
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.+..+..+|+.+|.|++|.|+|+||+..+..
T Consensus 49 ~~~ei~~i~~~~d~~~~g~I~f~eF~~~~~~ 79 (88)
T cd00213 49 DPEAVDKIMKDLDVNKDGKVDFQEFLVLIGK 79 (88)
T ss_pred CHHHHHHHHHHhccCCCCcCcHHHHHHHHHH
Confidence 4667899999999999999999999988764
No 28
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=97.51 E-value=0.00015 Score=40.51 Aligned_cols=29 Identities=28% Similarity=0.421 Sum_probs=19.1
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
++.++++|+.+|.+++|.|+|+||+..+.
T Consensus 43 ~~ev~~i~~~~d~~~~g~I~~~eF~~~~~ 71 (96)
T smart00027 43 QTLLAKIWNLADIDNDGELDKDEFALAMH 71 (96)
T ss_pred HHHHHHHHHHhcCCCCCCcCHHHHHHHHH
Confidence 34566677777777777777777776553
No 29
>PTZ00184 calmodulin; Provisional
Probab=97.47 E-value=0.00019 Score=41.36 Aligned_cols=29 Identities=21% Similarity=0.384 Sum_probs=21.5
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
.+..+..+|+.+|.|++|.|+.+||..++
T Consensus 82 ~~~~~~~~F~~~D~~~~g~i~~~e~~~~l 110 (149)
T PTZ00184 82 SEEEIKEAFKVFDRDGNGFISAAELRHVM 110 (149)
T ss_pred HHHHHHHHHHhhCCCCCCeEeHHHHHHHH
Confidence 44566777888888888888888887665
No 30
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=97.45 E-value=0.00014 Score=37.23 Aligned_cols=26 Identities=35% Similarity=0.432 Sum_probs=22.9
Q ss_pred HHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 20 DKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 20 ~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+++|+.+|+|++|.|+.+|+...+++
T Consensus 2 ~~~F~~~D~~~~G~i~~~el~~~l~~ 27 (67)
T cd00052 2 DQIFRSLDPDGDGLISGDEARPFLGK 27 (67)
T ss_pred hHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 46899999999999999999987643
No 31
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=97.44 E-value=0.0002 Score=44.47 Aligned_cols=35 Identities=26% Similarity=0.589 Sum_probs=27.8
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPS 48 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~ 48 (60)
.++++++.+++.+|+|+||+|+|+||++.+...|.
T Consensus 125 ~~deev~~ll~~~d~d~dG~i~~~eF~~~~~~~~~ 159 (160)
T COG5126 125 LSDEEVEKLLKEYDEDGDGEIDYEEFKKLIKDSPT 159 (160)
T ss_pred CCHHHHHHHHHhcCCCCCceEeHHHHHHHHhccCC
Confidence 35677888888888888888888888888776653
No 32
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=97.43 E-value=0.00018 Score=40.27 Aligned_cols=30 Identities=30% Similarity=0.423 Sum_probs=26.5
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
...+..+|+.+|.|++|.|+++|+...+++
T Consensus 9 ~~~l~~~F~~~D~d~~G~Is~~el~~~l~~ 38 (96)
T smart00027 9 KAKYEQIFRSLDKNQDGTVTGAQAKPILLK 38 (96)
T ss_pred HHHHHHHHHHhCCCCCCeEeHHHHHHHHHH
Confidence 356789999999999999999999998764
No 33
>PF14788 EF-hand_10: EF hand; PDB: 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B 1DJZ_B 1DJY_B 1DJX_B 1QAT_A 1DJH_A ....
Probab=97.38 E-value=0.00031 Score=36.29 Aligned_cols=31 Identities=39% Similarity=0.524 Sum_probs=25.9
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..+.+|..+|+..|++++|.+..+||.+.++
T Consensus 18 ~~~~yA~~LFq~~D~s~~g~Le~~Ef~~Fy~ 48 (51)
T PF14788_consen 18 MDDEYARQLFQECDKSQSGRLEGEEFEEFYK 48 (51)
T ss_dssp --HHHHHHHHHHH-SSSSSEBEHHHHHHHHH
T ss_pred cCHHHHHHHHHHhcccCCCCccHHHHHHHHH
Confidence 4567899999999999999999999998774
No 34
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=97.37 E-value=0.00029 Score=38.54 Aligned_cols=29 Identities=17% Similarity=0.228 Sum_probs=25.4
Q ss_pred HHHHHHHHhhCC--CCCCcccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMDK--NQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D~--d~dG~Is~~EF~~~~~~ 45 (60)
+.+.++|..+|+ |++|.|+++||...++.
T Consensus 8 ~~l~~~F~~~D~~~~~~G~Is~~el~~~l~~ 38 (88)
T cd00213 8 ETIIDVFHKYSGKEGDKDTLSKKELKELLET 38 (88)
T ss_pred HHHHHHHHHHhhccCCCCcCcHHHHHHHHHH
Confidence 447788999999 89999999999998754
No 35
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=97.24 E-value=0.00049 Score=33.46 Aligned_cols=27 Identities=37% Similarity=0.561 Sum_probs=24.2
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+..+|..+|.|++|.|+++||...++.
T Consensus 2 ~~~~f~~~d~~~~g~l~~~e~~~~l~~ 28 (63)
T cd00051 2 LREAFRLFDKDGDGTISADELKAALKS 28 (63)
T ss_pred HHHHHHHhCCCCCCcCcHHHHHHHHHH
Confidence 568899999999999999999988764
No 36
>KOG0027|consensus
Probab=97.23 E-value=0.00042 Score=41.45 Aligned_cols=32 Identities=22% Similarity=0.533 Sum_probs=28.0
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
..+..+..+++.+|.|+||.|+++||...+.+
T Consensus 41 ~t~~el~~~~~~~D~dg~g~I~~~eF~~l~~~ 72 (151)
T KOG0027|consen 41 PTEEELRDLIKEIDLDGDGTIDFEEFLDLMEK 72 (151)
T ss_pred CCHHHHHHHHHHhCCCCCCeEcHHHHHHHHHh
Confidence 45678899999999999999999999988774
No 37
>PTZ00183 centrin; Provisional
Probab=97.19 E-value=0.00094 Score=39.01 Aligned_cols=30 Identities=17% Similarity=0.335 Sum_probs=20.8
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..+.+..+|..+|.|++|.|+++||...++
T Consensus 15 ~~~~~~~~F~~~D~~~~G~i~~~e~~~~l~ 44 (158)
T PTZ00183 15 QKKEIREAFDLFDTDGSGTIDPKELKVAMR 44 (158)
T ss_pred HHHHHHHHHHHhCCCCCCcccHHHHHHHHH
Confidence 334566777777777777777777766554
No 38
>KOG4065|consensus
Probab=97.19 E-value=0.00045 Score=41.65 Aligned_cols=33 Identities=27% Similarity=0.436 Sum_probs=28.2
Q ss_pred CCCCcHHHHHHHHHHhhCCCCCCcccHHHHHHH
Q psy15330 10 EDESTPEKRTDKIFRQMDKNQDGRLSLEEFIEG 42 (60)
Q Consensus 10 ~~~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~ 42 (60)
.++...+..++.+.+.-|.|+||.|+|.||.+.
T Consensus 110 ~sE~Ele~~iD~vL~DdDfN~DG~IDYgEflK~ 142 (144)
T KOG4065|consen 110 SSEAELERLIDAVLDDDDFNGDGVIDYGEFLKR 142 (144)
T ss_pred CCHHHHHHHHHHHhcccccCCCceeeHHHHHhh
Confidence 355667778888999999999999999999875
No 39
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=97.18 E-value=0.00056 Score=37.93 Aligned_cols=32 Identities=22% Similarity=0.267 Sum_probs=27.5
Q ss_pred cHHHHHHHHHHhhC-CCCCC-cccHHHHHHHHcC
Q psy15330 14 TPEKRTDKIFRQMD-KNQDG-RLSLEEFIEGAKS 45 (60)
Q Consensus 14 ~~~~~~~~if~~~D-~d~dG-~Is~~EF~~~~~~ 45 (60)
.+.+.+.++|+.|| .|++| .|+..|+...++.
T Consensus 6 ~~~~~l~~~F~~fDd~dg~G~~Is~~El~~~l~~ 39 (92)
T cd05025 6 TAMETLINVFHAHSGKEGDKYKLSKKELKDLLQT 39 (92)
T ss_pred HHHHHHHHHHHHHhcccCCCCeECHHHHHHHHHH
Confidence 45677889999997 99999 5999999998863
No 40
>PRK12309 transaldolase/EF-hand domain-containing protein; Provisional
Probab=97.11 E-value=0.00054 Score=47.45 Aligned_cols=25 Identities=36% Similarity=0.680 Sum_probs=18.1
Q ss_pred HHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 20 DKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 20 ~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
+.+|+.+|.|+||.||++||..+++
T Consensus 360 ~~~F~~~D~d~DG~Is~eEf~~~~~ 384 (391)
T PRK12309 360 DAVFDALDLNHDGKITPEEMRAGLG 384 (391)
T ss_pred HHHHHHhCCCCCCCCcHHHHHHHHH
Confidence 6677777777777777777777664
No 41
>PF12763 EF-hand_4: Cytoskeletal-regulatory complex EF hand; PDB: 2QPT_A 2KSP_A 2KFG_A 2JQ6_A 2KFH_A 2KFF_A 1IQ3_A 3FIA_A 2KHN_A 2KGR_A ....
Probab=97.08 E-value=0.00064 Score=39.37 Aligned_cols=31 Identities=29% Similarity=0.515 Sum_probs=26.5
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+.+.+.+||...|.|+||+++++||.-+|.
T Consensus 40 L~~~~L~~IW~LaD~~~dG~L~~~EF~iAm~ 70 (104)
T PF12763_consen 40 LPRDVLAQIWNLADIDNDGKLDFEEFAIAMH 70 (104)
T ss_dssp SSHHHHHHHHHHH-SSSSSEEEHHHHHHHHH
T ss_pred CCHHHHHHHHhhhcCCCCCcCCHHHHHHHHH
Confidence 3557789999999999999999999998875
No 42
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=97.01 E-value=0.00082 Score=37.69 Aligned_cols=29 Identities=28% Similarity=0.302 Sum_probs=24.1
Q ss_pred HHHHHHHHhhC-CCCCC-cccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMD-KNQDG-RLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D-~d~dG-~Is~~EF~~~~~~ 45 (60)
..+.++|+.|| +|+|| +||.+|+...+++
T Consensus 10 ~~~~~~F~~~dd~dgdg~~Is~~EL~~ll~~ 40 (93)
T cd05026 10 DTLIRIFHNYSGKEGDRYKLSKGELKELLQR 40 (93)
T ss_pred HHHHHHHHHHHccCCCCCEECHHHHHHHHHH
Confidence 34567799999 79998 5999999998854
No 43
>PTZ00184 calmodulin; Provisional
Probab=96.99 E-value=0.0012 Score=37.93 Aligned_cols=29 Identities=17% Similarity=0.432 Sum_probs=22.7
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
...+..+|+.+|.|++|.|+++||+..+.
T Consensus 46 ~~~~~~~~~~~d~~~~g~i~~~ef~~~l~ 74 (149)
T PTZ00184 46 EAELQDMINEVDADGNGTIDFPEFLTLMA 74 (149)
T ss_pred HHHHHHHHHhcCcCCCCcCcHHHHHHHHH
Confidence 45677888888888888888888887665
No 44
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=96.96 E-value=0.0013 Score=36.59 Aligned_cols=30 Identities=13% Similarity=0.079 Sum_probs=25.4
Q ss_pred HHHHHHHHHHhhCC-CC-CCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDK-NQ-DGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~-d~-dG~Is~~EF~~~~~ 44 (60)
+...+..+|+.+|. |+ +|.|+.+|+...++
T Consensus 6 ~~~~l~~~F~~~D~~dg~dG~Is~~El~~~l~ 37 (94)
T cd05031 6 AMESLILTFHRYAGKDGDKNTLSRKELKKLME 37 (94)
T ss_pred HHHHHHHHHHHHhccCCCCCeECHHHHHHHHH
Confidence 44567889999997 98 69999999998775
No 45
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=96.90 E-value=0.0018 Score=38.12 Aligned_cols=28 Identities=32% Similarity=0.433 Sum_probs=24.3
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
...+.-.|..+|+|+||.||.+|+....
T Consensus 47 ~~~l~w~F~~lD~d~DG~Ls~~EL~~~~ 74 (116)
T cd00252 47 KDPVGWMFNQLDGNYDGKLSHHELAPIR 74 (116)
T ss_pred HHHHHHHHHHHCCCCCCcCCHHHHHHHH
Confidence 3457788999999999999999999764
No 46
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=96.82 E-value=0.0012 Score=36.84 Aligned_cols=29 Identities=21% Similarity=0.249 Sum_probs=25.3
Q ss_pred HHHHHHHHhhC-CCCCC-cccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMD-KNQDG-RLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D-~d~dG-~Is~~EF~~~~~~ 45 (60)
..+.++|+.+| .|+|| .|+.+|+...++.
T Consensus 8 ~~l~~aF~~fD~~dgdG~~I~~~eL~~ll~~ 38 (88)
T cd05027 8 VALIDVFHQYSGREGDKHKLKKSELKELINN 38 (88)
T ss_pred HHHHHHHHHhcccCCCcCEECHHHHHHHHHH
Confidence 45678999998 89999 5999999998875
No 47
>KOG4223|consensus
Probab=96.77 E-value=0.0018 Score=44.10 Aligned_cols=33 Identities=36% Similarity=0.534 Sum_probs=25.6
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
....-.+......+|+|+||+|+++||+..+-+
T Consensus 196 ~M~~iVi~Etl~d~Dkn~DG~I~~eEfigd~~~ 228 (325)
T KOG4223|consen 196 HMKDIVIAETLEDIDKNGDGKISLEEFIGDLYS 228 (325)
T ss_pred hHHHHHHHHHHhhcccCCCCceeHHHHHhHHhh
Confidence 344445677788999999999999999976654
No 48
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=96.76 E-value=0.0016 Score=36.61 Aligned_cols=29 Identities=17% Similarity=0.137 Sum_probs=25.8
Q ss_pred HHHHHHHHhhCC-CCCCcccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMDK-NQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D~-d~dG~Is~~EF~~~~~~ 45 (60)
.-+..+|+.||. +++|.|+..||+..+++
T Consensus 8 ~~l~~~F~~fd~~~~~g~i~~~ELk~ll~~ 37 (89)
T cd05022 8 ETLVSNFHKASVKGGKESLTASEFQELLTQ 37 (89)
T ss_pred HHHHHHHHHHhCCCCCCeECHHHHHHHHHH
Confidence 346789999999 99999999999998876
No 49
>PLN02964 phosphatidylserine decarboxylase
Probab=96.58 E-value=0.003 Score=46.31 Aligned_cols=25 Identities=32% Similarity=0.562 Sum_probs=14.7
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
++++|+.+|.|+||.|+++||+..+
T Consensus 181 i~~mf~~~D~DgdG~IdfdEFl~lL 205 (644)
T PLN02964 181 ARRILAIVDYDEDGQLSFSEFSDLI 205 (644)
T ss_pred HHHHHHHhCCCCCCeEcHHHHHHHH
Confidence 5566666666666666666665443
No 50
>KOG0028|consensus
Probab=96.57 E-value=0.0043 Score=38.92 Aligned_cols=39 Identities=23% Similarity=0.492 Sum_probs=29.7
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcC------ChHHHHHh
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS------DPSIVRLL 53 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~------~~~il~~l 53 (60)
+.+.+...|+.+|.|++|+||+.+|+..+.. |.++..++
T Consensus 104 t~eEi~~afrl~D~D~~Gkis~~~lkrvakeLgenltD~El~eMI 148 (172)
T KOG0028|consen 104 TKEEIKKAFRLFDDDKTGKISQRNLKRVAKELGENLTDEELMEMI 148 (172)
T ss_pred cHHHHHHHHHcccccCCCCcCHHHHHHHHHHhCccccHHHHHHHH
Confidence 5567788899999999999999988876653 34666555
No 51
>KOG0034|consensus
Probab=96.50 E-value=0.0047 Score=39.07 Aligned_cols=31 Identities=26% Similarity=0.324 Sum_probs=26.9
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..++.+.-.|+-+|.|++|.|+.+|+...+.
T Consensus 101 ~~~~Kl~faF~vYD~~~~G~I~reel~~iv~ 131 (187)
T KOG0034|consen 101 SKREKLRFAFRVYDLDGDGFISREELKQILR 131 (187)
T ss_pred cHHHHHHHHHHHhcCCCCCcCcHHHHHHHHH
Confidence 3446888999999999999999999998765
No 52
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=96.31 E-value=0.0098 Score=33.26 Aligned_cols=31 Identities=23% Similarity=0.356 Sum_probs=25.5
Q ss_pred HHHHHHHHHh-hCCCCCC-cccHHHHHHHHcCC
Q psy15330 16 EKRTDKIFRQ-MDKNQDG-RLSLEEFIEGAKSD 46 (60)
Q Consensus 16 ~~~~~~if~~-~D~d~dG-~Is~~EF~~~~~~~ 46 (60)
-..+..+|+. +|+|++| .||.+||...+.+.
T Consensus 8 i~~l~~~F~~y~~~dg~~~~Ls~~Elk~ll~~e 40 (89)
T cd05023 8 IESLIAVFQKYAGKDGDSYQLSKTEFLSFMNTE 40 (89)
T ss_pred HHHHHHHHHHHhccCCCcCeECHHHHHHHHHHh
Confidence 3456788999 8899986 99999999988764
No 53
>PLN02964 phosphatidylserine decarboxylase
Probab=96.13 E-value=0.0065 Score=44.60 Aligned_cols=27 Identities=22% Similarity=0.443 Sum_probs=24.9
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+..+|+.+|+|++|.|+++||+..+..
T Consensus 217 L~eaFk~fDkDgdG~Is~dEL~~vL~~ 243 (644)
T PLN02964 217 KEELFKAADLNGDGVVTIDELAALLAL 243 (644)
T ss_pred HHHHHHHhCCCCCCcCCHHHHHHHHHh
Confidence 578999999999999999999998876
No 54
>KOG0041|consensus
Probab=95.99 E-value=0.0078 Score=39.28 Aligned_cols=28 Identities=25% Similarity=0.498 Sum_probs=24.9
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.+..+|+.+|.+.||+|++.|.+.+|.+
T Consensus 100 ~~~~~Fk~yDe~rDgfIdl~ELK~mmEK 127 (244)
T KOG0041|consen 100 DAESMFKQYDEDRDGFIDLMELKRMMEK 127 (244)
T ss_pred HHHHHHHHhcccccccccHHHHHHHHHH
Confidence 4678999999999999999999988765
No 55
>PF10591 SPARC_Ca_bdg: Secreted protein acidic and rich in cysteine Ca binding region; InterPro: IPR019577 This entry represents the calcium-binding domain found in SPARC (Secreted Protein Acidic and Rich in Cysteine) and Testican (also known as SPOCK; or SParc/Osteonectin, Cwcv and Kazal-like domains) proteins. SPARC proteins are down-regulated in various tumours and may have a tumour-suppressor function [, ]. Testican-3 appears to be a novel regulator that reduces the activity of matrix metalloproteinase (MMP) in adult T-cell leukemia (ATL) []. This cysteine-rich domain is responsible for the anti-spreading activity of human urothelial cells. This extracellular calcium-binding domain is rich in alpha-helices and contains two EF-hands that each coordinates one Ca2+ ion, forming a helix-loop-helix structure that not only drives the conformation of the protein but is also necessary for biological activity. The anti-spreading activity was dependent on the coordination of Ca2+ by a Glu residue at the Z position of EF-hand 2 []. ; GO: 0005509 calcium ion binding, 0007165 signal transduction, 0005578 proteinaceous extracellular matrix; PDB: 1BMO_A 1SRA_A 2V53_A 1NUB_B.
Probab=95.92 E-value=0.0063 Score=35.49 Aligned_cols=27 Identities=41% Similarity=0.769 Sum_probs=19.8
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHH
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIE 41 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~ 41 (60)
++..+..+|+.-|.|+||.||+.||..
T Consensus 86 ~e~C~~~F~~~CD~n~d~~Is~~EW~~ 112 (113)
T PF10591_consen 86 PEHCARPFFRSCDVNKDGKISLDEWCN 112 (113)
T ss_dssp TGGGHHHHHHHH-TT-SSSEEHHHHHH
T ss_pred hHHHHHHHHHHcCCCCCCCCCHHHHcc
Confidence 444577888899999999999999864
No 56
>PRK12309 transaldolase/EF-hand domain-containing protein; Provisional
Probab=95.72 E-value=0.013 Score=40.77 Aligned_cols=27 Identities=30% Similarity=0.506 Sum_probs=24.3
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFI 40 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~ 40 (60)
.....+..+|+.+|.|+||.|+++||.
T Consensus 331 ~~~~~l~~aF~~~D~dgdG~Is~~E~~ 357 (391)
T PRK12309 331 AFTHAAQEIFRLYDLDGDGFITREEWL 357 (391)
T ss_pred hhhHHHHHHHHHhCCCCCCcCcHHHHH
Confidence 456678999999999999999999996
No 57
>KOG2643|consensus
Probab=95.60 E-value=0.0022 Score=45.44 Aligned_cols=28 Identities=39% Similarity=0.596 Sum_probs=24.8
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.++-+|.-||.|+||.||.+||+..+++
T Consensus 426 VvdvvF~IFD~N~Dg~LS~~EFl~Vmk~ 453 (489)
T KOG2643|consen 426 VVDVVFTIFDENNDGTLSHKEFLAVMKR 453 (489)
T ss_pred eeeeEEEEEccCCCCcccHHHHHHHHHH
Confidence 4667899999999999999999988764
No 58
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=95.55 E-value=0.015 Score=32.36 Aligned_cols=26 Identities=23% Similarity=0.220 Sum_probs=23.0
Q ss_pred HHHHHHhhCC-CC-CCcccHHHHHHHHc
Q psy15330 19 TDKIFRQMDK-NQ-DGRLSLEEFIEGAK 44 (60)
Q Consensus 19 ~~~if~~~D~-d~-dG~Is~~EF~~~~~ 44 (60)
+-.+|.++|. |+ +|.|+.+||...++
T Consensus 12 ~i~~F~~y~~~~~~~g~Is~~EL~~~l~ 39 (88)
T cd05029 12 LVAIFHKYSGREGDKNTLSKKELKELIQ 39 (88)
T ss_pred HHHHHHHHHccCCCCCEECHHHHHHHHH
Confidence 4578999999 78 89999999999985
No 59
>PF14658 EF-hand_9: EF-hand domain
Probab=95.47 E-value=0.028 Score=30.37 Aligned_cols=31 Identities=13% Similarity=0.404 Sum_probs=27.1
Q ss_pred HHHHHHHHHHhhCCCCC-CcccHHHHHHHHcC
Q psy15330 15 PEKRTDKIFRQMDKNQD-GRLSLEEFIEGAKS 45 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~d-G~Is~~EF~~~~~~ 45 (60)
.+..++.+.+.+|.++. |.|+|+.|...|+.
T Consensus 33 ~e~~Lq~l~~elDP~g~~~~v~~d~F~~iM~~ 64 (66)
T PF14658_consen 33 EESELQDLINELDPEGRDGSVNFDTFLAIMRD 64 (66)
T ss_pred cHHHHHHHHHHhCCCCCCceEeHHHHHHHHHH
Confidence 45688999999999987 99999999998863
No 60
>KOG0037|consensus
Probab=95.45 E-value=0.025 Score=36.84 Aligned_cols=33 Identities=24% Similarity=0.564 Sum_probs=19.4
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcC-----ChHHHHHh
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKS-----DPSIVRLL 53 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~-----~~~il~~l 53 (60)
.+|+.+|+|+.|.|+..|+..++.. +|.+.++|
T Consensus 128 ~vF~~~D~D~SG~I~~sEL~~Al~~~Gy~Lspq~~~~l 165 (221)
T KOG0037|consen 128 NVFRTYDRDRSGTIDSSELRQALTQLGYRLSPQFYNLL 165 (221)
T ss_pred HHHHhcccCCCCcccHHHHHHHHHHcCcCCCHHHHHHH
Confidence 4566666666666666666654432 45555554
No 61
>KOG0028|consensus
Probab=95.44 E-value=0.028 Score=35.32 Aligned_cols=34 Identities=24% Similarity=0.420 Sum_probs=29.4
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCC
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~ 46 (60)
+...+.+..+++.+|.|+||.|.-+||+..+++.
T Consensus 138 nltD~El~eMIeEAd~d~dgevneeEF~~imk~t 171 (172)
T KOG0028|consen 138 NLTDEELMEMIEEADRDGDGEVNEEEFIRIMKKT 171 (172)
T ss_pred cccHHHHHHHHHHhcccccccccHHHHHHHHhcC
Confidence 4456678899999999999999999999988764
No 62
>KOG0377|consensus
Probab=95.12 E-value=0.023 Score=40.93 Aligned_cols=26 Identities=35% Similarity=0.616 Sum_probs=23.2
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
++.+|+-+|.|+.|.||.+||...+.
T Consensus 549 LetiF~~iD~D~SG~isldEF~~a~~ 574 (631)
T KOG0377|consen 549 LETIFNIIDADNSGEISLDEFRTAWK 574 (631)
T ss_pred HHHHHHHhccCCCCceeHHHHHHHHH
Confidence 56899999999999999999997664
No 63
>KOG0036|consensus
Probab=94.48 E-value=0.047 Score=38.70 Aligned_cols=41 Identities=24% Similarity=0.450 Sum_probs=30.8
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCCh-HHHHHh
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDP-SIVRLL 53 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~-~il~~l 53 (60)
....+.+..+|+.+|.|.||.++|+||.+-+...+ .+.+++
T Consensus 47 ~~~~~~~~~l~~~~d~~~dg~vDy~eF~~Y~~~~E~~l~~~F 88 (463)
T KOG0036|consen 47 KPNYEAAKMLFSAMDANRDGRVDYSEFKRYLDNKELELYRIF 88 (463)
T ss_pred CCchHHHHHHHHhcccCcCCcccHHHHHHHHHHhHHHHHHHH
Confidence 34567788999999999999999999998776532 333443
No 64
>KOG4223|consensus
Probab=94.42 E-value=0.058 Score=36.93 Aligned_cols=34 Identities=26% Similarity=0.650 Sum_probs=28.5
Q ss_pred CCCCCCcHHHHHHHHHHhhCCCCCCcccHHHHHH
Q psy15330 8 MPEDESTPEKRTDKIFRQMDKNQDGRLSLEEFIE 41 (60)
Q Consensus 8 ~~~~~~~~~~~~~~if~~~D~d~dG~Is~~EF~~ 41 (60)
+|.+-..++.++..++-..|.|+||++|++|.+.
T Consensus 268 ~P~~~d~A~~EA~hL~~eaD~dkD~kLs~eEIl~ 301 (325)
T KOG4223|consen 268 LPSEQDHAKAEARHLLHEADEDKDGKLSKEEILE 301 (325)
T ss_pred CCCCccHHHHHHHHHhhhhccCccccccHHHHhh
Confidence 3445567888999999999999999999999875
No 65
>KOG2643|consensus
Probab=94.08 E-value=0.011 Score=42.01 Aligned_cols=25 Identities=24% Similarity=0.361 Sum_probs=21.8
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
....|+.||+|+||.|+.+||...+
T Consensus 235 F~IAFKMFD~dgnG~IdkeEF~~v~ 259 (489)
T KOG2643|consen 235 FRIAFKMFDLDGNGEIDKEEFETVQ 259 (489)
T ss_pred ceeeeeeeecCCCCcccHHHHHHHH
Confidence 4456999999999999999999765
No 66
>KOG0036|consensus
Probab=93.80 E-value=0.081 Score=37.57 Aligned_cols=34 Identities=26% Similarity=0.485 Sum_probs=26.2
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCCh
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDP 47 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~ 47 (60)
...+.++++|+.+|+++++.|.++||...+.-.|
T Consensus 115 l~de~~~k~~e~~d~~g~~~I~~~e~rd~~ll~p 148 (463)
T KOG0036|consen 115 LSDEKAAKFFEHMDKDGKATIDLEEWRDHLLLYP 148 (463)
T ss_pred cCHHHHHHHHHHhccCCCeeeccHHHHhhhhcCC
Confidence 4456677888888888888888888888776655
No 67
>KOG0030|consensus
Probab=93.58 E-value=0.16 Score=31.39 Aligned_cols=29 Identities=17% Similarity=0.266 Sum_probs=24.4
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
......+|..||..+||+|++..--..++
T Consensus 10 ~~e~ke~F~lfD~~gD~ki~~~q~gdvlR 38 (152)
T KOG0030|consen 10 MEEFKEAFLLFDRTGDGKISGSQVGDVLR 38 (152)
T ss_pred HHHHHHHHHHHhccCcccccHHHHHHHHH
Confidence 36678999999999999999988775554
No 68
>KOG1955|consensus
Probab=92.57 E-value=0.13 Score=37.64 Aligned_cols=31 Identities=29% Similarity=0.489 Sum_probs=26.6
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+-.++..||+.-|.|+||.+++.||...+.
T Consensus 262 lpi~ELshIWeLsD~d~DGALtL~EFcAAfH 292 (737)
T KOG1955|consen 262 LPIEELSHIWELSDVDRDGALTLSEFCAAFH 292 (737)
T ss_pred CchHHHHHHHhhcccCccccccHHHHHhhHh
Confidence 3456788999999999999999999997653
No 69
>KOG0037|consensus
Probab=92.49 E-value=0.2 Score=32.73 Aligned_cols=39 Identities=15% Similarity=0.333 Sum_probs=31.7
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHh
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l 53 (60)
++...+.+++++|.-++|.|.|++|+++|-.-+.+.+.+
T Consensus 158 spq~~~~lv~kyd~~~~g~i~FD~FI~ccv~L~~lt~~F 196 (221)
T KOG0037|consen 158 SPQFYNLLVRKYDRFGGGRIDFDDFIQCCVVLQRLTEAF 196 (221)
T ss_pred CHHHHHHHHHHhccccCCceeHHHHHHHHHHHHHHHHHH
Confidence 456788999999999999999999999887655555444
No 70
>KOG0377|consensus
Probab=92.27 E-value=0.2 Score=36.26 Aligned_cols=30 Identities=33% Similarity=0.648 Sum_probs=25.5
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
....+.++=+.+|.|+||.|++.||+++.+
T Consensus 585 ~~~~i~~la~~mD~NkDG~IDlNEfLeAFr 614 (631)
T KOG0377|consen 585 SDDEILELARSMDLNKDGKIDLNEFLEAFR 614 (631)
T ss_pred CHHHHHHHHHhhccCCCCcccHHHHHHHHh
Confidence 345667788899999999999999999875
No 71
>KOG0041|consensus
Probab=92.15 E-value=0.14 Score=33.60 Aligned_cols=27 Identities=26% Similarity=0.533 Sum_probs=23.1
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
-+..++...|.|.||+|||.||.-..+
T Consensus 136 ~lK~mikeVded~dgklSfreflLIfr 162 (244)
T KOG0041|consen 136 GLKNMIKEVDEDFDGKLSFREFLLIFR 162 (244)
T ss_pred HHHHHHHHhhcccccchhHHHHHHHHH
Confidence 467899999999999999999985443
No 72
>KOG4251|consensus
Probab=91.91 E-value=0.088 Score=35.59 Aligned_cols=20 Identities=35% Similarity=0.647 Sum_probs=13.7
Q ss_pred HHHhhCCCCCCcccHHHHHH
Q psy15330 22 IFRQMDKNQDGRLSLEEFIE 41 (60)
Q Consensus 22 if~~~D~d~dG~Is~~EF~~ 41 (60)
.|+..|.|+||.|+++||.-
T Consensus 145 hFraVDpdgDGhvsWdEykv 164 (362)
T KOG4251|consen 145 HFRAVDPDGDGHVSWDEYKV 164 (362)
T ss_pred heeeeCCCCCCceehhhhhh
Confidence 45667777777777777764
No 73
>KOG4578|consensus
Probab=91.58 E-value=0.16 Score=35.44 Aligned_cols=32 Identities=31% Similarity=0.482 Sum_probs=26.1
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.+.+-..++|+.-|.|+|.+||+.||...+..
T Consensus 367 ~~rkC~rk~~~yCDlNkDKkISl~Ew~~CL~~ 398 (421)
T KOG4578|consen 367 KPRKCSRKFFKYCDLNKDKKISLDEWRGCLGV 398 (421)
T ss_pred cHHHHhhhcchhcccCCCceecHHHHhhhhcc
Confidence 45666788899999999999999999876653
No 74
>KOG0031|consensus
Probab=90.69 E-value=0.42 Score=30.05 Aligned_cols=28 Identities=25% Similarity=0.417 Sum_probs=23.4
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+..+.|..+|+|+||.|..+.....+.
T Consensus 32 qEfKEAF~~mDqnrDG~IdkeDL~d~~a 59 (171)
T KOG0031|consen 32 QEFKEAFNLMDQNRDGFIDKEDLRDMLA 59 (171)
T ss_pred HHHHHHHHHHhccCCCcccHHHHHHHHH
Confidence 3457889999999999999998886554
No 75
>KOG0046|consensus
Probab=90.59 E-value=0.4 Score=35.14 Aligned_cols=30 Identities=23% Similarity=0.418 Sum_probs=26.7
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
-.+++++.+....+.|.+|+|+|+||+..+
T Consensus 54 ~~~eei~~~l~~~~~~~~g~v~fe~f~~~~ 83 (627)
T KOG0046|consen 54 FVREEIKEILGEVGVDADGRVEFEEFVGIF 83 (627)
T ss_pred hhHHHHHHHHhccCCCcCCccCHHHHHHHH
Confidence 457889999999999999999999999844
No 76
>PF10591 SPARC_Ca_bdg: Secreted protein acidic and rich in cysteine Ca binding region; InterPro: IPR019577 This entry represents the calcium-binding domain found in SPARC (Secreted Protein Acidic and Rich in Cysteine) and Testican (also known as SPOCK; or SParc/Osteonectin, Cwcv and Kazal-like domains) proteins. SPARC proteins are down-regulated in various tumours and may have a tumour-suppressor function [, ]. Testican-3 appears to be a novel regulator that reduces the activity of matrix metalloproteinase (MMP) in adult T-cell leukemia (ATL) []. This cysteine-rich domain is responsible for the anti-spreading activity of human urothelial cells. This extracellular calcium-binding domain is rich in alpha-helices and contains two EF-hands that each coordinates one Ca2+ ion, forming a helix-loop-helix structure that not only drives the conformation of the protein but is also necessary for biological activity. The anti-spreading activity was dependent on the coordination of Ca2+ by a Glu residue at the Z position of EF-hand 2 []. ; GO: 0005509 calcium ion binding, 0007165 signal transduction, 0005578 proteinaceous extracellular matrix; PDB: 1BMO_A 1SRA_A 2V53_A 1NUB_B.
Probab=90.53 E-value=0.098 Score=30.45 Aligned_cols=29 Identities=28% Similarity=0.371 Sum_probs=19.4
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
...+.-.|..+|.|+||.|+..|+.....
T Consensus 53 ~~~~~W~F~~LD~n~d~~L~~~El~~l~~ 81 (113)
T PF10591_consen 53 KRVVHWKFCQLDRNKDGVLDRSELKPLRR 81 (113)
T ss_dssp HHHHHHHHHHH--T-SSEE-TTTTGGGGS
T ss_pred hhhhhhhHhhhcCCCCCccCHHHHHHHHH
Confidence 34566679999999999999999886543
No 77
>KOG4251|consensus
Probab=90.41 E-value=0.48 Score=32.15 Aligned_cols=43 Identities=19% Similarity=0.290 Sum_probs=31.4
Q ss_pred CCCCCCCCCCC---cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 3 GTVMKMPEDES---TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 3 g~~~~~~~~~~---~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
|...++.+.+. .....+..+|++.|.|.||+||-.|.++.+.+
T Consensus 84 gkdLggfDedaeprrsrrklmviFsKvDVNtDrkisAkEmqrwIme 129 (362)
T KOG4251|consen 84 GKDLGGFDEDAEPRRSRRKLMVIFSKVDVNTDRKISAKEMQRWIME 129 (362)
T ss_pred ccCCCCcccccchhHHHHHHHHHHhhcccCccccccHHHHHHHHHH
Confidence 44455444332 34455778999999999999999999987765
No 78
>KOG0031|consensus
Probab=90.31 E-value=0.58 Score=29.43 Aligned_cols=33 Identities=18% Similarity=0.337 Sum_probs=29.0
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
..|++-+...|+.||.++.|.|.-+.+.+.+..
T Consensus 97 tdpe~~I~~AF~~FD~~~~G~I~~d~lre~Ltt 129 (171)
T KOG0031|consen 97 TDPEEVILNAFKTFDDEGSGKIDEDYLRELLTT 129 (171)
T ss_pred CCHHHHHHHHHHhcCccCCCccCHHHHHHHHHH
Confidence 457888999999999999999999988887764
No 79
>KOG4347|consensus
Probab=89.97 E-value=0.44 Score=35.44 Aligned_cols=32 Identities=19% Similarity=0.439 Sum_probs=27.9
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.....+.++|+.+|.+++|.|+|.+|+.++..
T Consensus 552 ~s~~~~~rlF~l~D~s~~g~Ltf~~lv~gL~~ 583 (671)
T KOG4347|consen 552 VSLIFLERLFRLLDDSMTGLLTFKDLVSGLSI 583 (671)
T ss_pred HHHHHHHHHHHhcccCCcceeEHHHHHHHHHH
Confidence 35567899999999999999999999988753
No 80
>KOG4004|consensus
Probab=89.74 E-value=0.26 Score=32.29 Aligned_cols=29 Identities=24% Similarity=0.391 Sum_probs=25.0
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEG 42 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~ 42 (60)
..+--+.++|+.-|.|+||+|+++||...
T Consensus 219 pme~c~~~f~e~cd~~nd~~ial~ew~~c 247 (259)
T KOG4004|consen 219 PMEHCTTRFFETCDLDNDKYIALDEWAGC 247 (259)
T ss_pred cHHhhchhhhhcccCCCCCceeHHHhhcc
Confidence 45566889999999999999999999863
No 81
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=89.50 E-value=0.65 Score=25.52 Aligned_cols=28 Identities=18% Similarity=0.221 Sum_probs=23.4
Q ss_pred HHHHHHHHhhCCC--CCCcccHHHHHHHHc
Q psy15330 17 KRTDKIFRQMDKN--QDGRLSLEEFIEGAK 44 (60)
Q Consensus 17 ~~~~~if~~~D~d--~dG~Is~~EF~~~~~ 44 (60)
..+-.+|.+++.. ++|.||.+||...+.
T Consensus 8 ~~~~~~f~~y~~~~~~~~~Is~~El~~ll~ 37 (88)
T cd05030 8 ETIINVFHQYSVRKGHPDTLYKKEFKQLVE 37 (88)
T ss_pred HHHHHHHHHHhccCCCcccCCHHHHHHHHH
Confidence 3466889999977 479999999999886
No 82
>PF05042 Caleosin: Caleosin related protein; InterPro: IPR007736 This family contains plant proteins related to caleosin. Caleosins contain calcium-binding domains and have an oleosin-like association with lipid bodies. Caleosins are present at relatively low levels and are mainly bound to microsomal membrane fractions at the early stages of seed development. As the seeds mature, overall levels of caleosins increased dramatically and they were associated almost exclusively with storage lipid bodies []. The calcium binding domain is probably related to the calcium-binding EF-hands motif IPR002048 from INTERPRO.
Probab=89.09 E-value=0.45 Score=30.10 Aligned_cols=35 Identities=20% Similarity=0.296 Sum_probs=30.0
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHH
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIV 50 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il 50 (60)
.+..++||.++++.+.+.+|+.|..++++.+....
T Consensus 95 p~kFe~iF~kya~~~~d~LT~~E~~~m~~~nr~~~ 129 (174)
T PF05042_consen 95 PQKFEEIFSKYAKTGPDALTLRELWRMLKGNRNAN 129 (174)
T ss_pred HHHHHHHHHHhCCCCCCCcCHHHHHHHHHhccccC
Confidence 45679999999999999999999999998876544
No 83
>KOG1029|consensus
Probab=86.73 E-value=0.55 Score=36.08 Aligned_cols=30 Identities=33% Similarity=0.570 Sum_probs=25.4
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
.+...+-.||..-|.|+||+++.+||+-.+
T Consensus 226 Lpq~~LA~IW~LsDvd~DGkL~~dEfilam 255 (1118)
T KOG1029|consen 226 LPQNQLAHIWTLSDVDGDGKLSADEFILAM 255 (1118)
T ss_pred CchhhHhhheeeeccCCCCcccHHHHHHHH
Confidence 455567789999999999999999999655
No 84
>KOG4666|consensus
Probab=86.70 E-value=0.6 Score=32.63 Aligned_cols=35 Identities=14% Similarity=0.314 Sum_probs=29.2
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHh
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l 53 (60)
+--+|+.+++-.+|+|++++|.+.+...|.+..+.
T Consensus 333 v~~lf~~i~q~d~~ki~~~~f~~fa~~~p~~a~~~ 367 (412)
T KOG4666|consen 333 VPVLFPSIEQKDDPKIYASNFRKFAATEPNLALSE 367 (412)
T ss_pred ccccchhhhcccCcceeHHHHHHHHHhCchhhhhh
Confidence 44678888898999999999999999998887554
No 85
>KOG0751|consensus
Probab=84.52 E-value=1.4 Score=32.53 Aligned_cols=30 Identities=17% Similarity=0.288 Sum_probs=20.0
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
...-..+|+.||+.++|.+|+++|....++
T Consensus 107 Dal~~~aFqlFDr~~~~~vs~~~~~~if~~ 136 (694)
T KOG0751|consen 107 DALFEVAFQLFDRLGNGEVSFEDVADIFGQ 136 (694)
T ss_pred hHHHHHHHHHhcccCCCceehHHHHHHHhc
Confidence 344556677777777777777777766553
No 86
>KOG0038|consensus
Probab=84.01 E-value=1.4 Score=27.71 Aligned_cols=26 Identities=19% Similarity=0.464 Sum_probs=21.7
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
-++|-+.|-.||+|.+||+.|+.+.+
T Consensus 73 k~ri~e~FSeDG~GnlsfddFlDmfS 98 (189)
T KOG0038|consen 73 KRRICEVFSEDGRGNLSFDDFLDMFS 98 (189)
T ss_pred HHHHHHHhccCCCCcccHHHHHHHHH
Confidence 34667788899999999999998764
No 87
>PF14658 EF-hand_9: EF-hand domain
Probab=83.76 E-value=0.68 Score=24.93 Aligned_cols=24 Identities=21% Similarity=0.501 Sum_probs=20.9
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
..|..+|.++.|.|....++..++
T Consensus 2 ~~F~~fD~~~tG~V~v~~l~~~Lr 25 (66)
T PF14658_consen 2 TAFDAFDTQKTGRVPVSDLITYLR 25 (66)
T ss_pred cchhhcCCcCCceEeHHHHHHHHH
Confidence 479999999999999999887655
No 88
>KOG1707|consensus
Probab=83.46 E-value=1.6 Score=32.42 Aligned_cols=32 Identities=25% Similarity=0.364 Sum_probs=27.3
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
..-+++..+|.++|.|+||.++=+|+....+.
T Consensus 312 ~~~~Fl~~~f~~~D~d~Dg~L~p~El~~LF~~ 343 (625)
T KOG1707|consen 312 KGYRFLVDVFEKFDRDNDGALSPEELKDLFST 343 (625)
T ss_pred HHHHHHHHHHHhccCCCCCCcCHHHHHHHhhh
Confidence 34567899999999999999999999876654
No 89
>PF09279 EF-hand_like: Phosphoinositide-specific phospholipase C, efhand-like; InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=82.16 E-value=2.6 Score=22.46 Aligned_cols=26 Identities=23% Similarity=0.405 Sum_probs=22.0
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+..+|+.+-. +.+.+|.++|.+.++.
T Consensus 2 i~~if~~ys~-~~~~mt~~~f~~FL~~ 27 (83)
T PF09279_consen 2 IEEIFRKYSS-DKEYMTAEEFRRFLRE 27 (83)
T ss_dssp HHHHHHHHCT-TSSSEEHHHHHHHHHH
T ss_pred HHHHHHHHhC-CCCcCCHHHHHHHHHH
Confidence 6789999955 8899999999998853
No 90
>PF05517 p25-alpha: p25-alpha ; InterPro: IPR008907 This family encodes a 25 kDa protein that is phosphorylated by a Ser/Thr-Pro kinase []. It has been described as a brain specific protein, but it is found in Tetrahymena thermophila.; PDB: 1WLM_A 1PUL_A 2JRF_A.
Probab=81.94 E-value=1.8 Score=26.36 Aligned_cols=31 Identities=19% Similarity=0.437 Sum_probs=23.5
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
-....++.+|.++-..+...|+|++|...+.
T Consensus 38 ~t~tdvDiiF~Kvk~k~~~~I~f~~F~~aL~ 68 (154)
T PF05517_consen 38 LTSTDVDIIFSKVKAKGARKITFEQFLEALA 68 (154)
T ss_dssp S-HHHHHHHHHHHT-SS-SEEEHHHHHHHHH
T ss_pred CchHHHHHHHHHhhcCCCcccCHHHHHHHHH
Confidence 3455689999998777778899999998774
No 91
>PF08976 DUF1880: Domain of unknown function (DUF1880); InterPro: IPR015070 This entry represents EF-hand calcium-binding domain-containing protein 6 that negatively regulates the androgen receptor by recruiting histone deacetylase complex, and protein DJ-1 antagonises this inhibition by abrogation of this complex [].; PDB: 1WLZ_C.
Probab=80.18 E-value=2.3 Score=25.38 Aligned_cols=32 Identities=28% Similarity=0.598 Sum_probs=22.4
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcCC
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~ 46 (60)
.++..+++|.++-.|..|+|.|.||+.-+...
T Consensus 5 tDeQFdrLW~e~Pvn~~GrLkY~eFL~kfs~e 36 (118)
T PF08976_consen 5 TDEQFDRLWNEMPVNAKGRLKYQEFLSKFSSE 36 (118)
T ss_dssp -HHHHHHHHTTS-B-TTS-EEHHHHHHHT---
T ss_pred cHHHhhhhhhhCcCCccCCEeHHHHHHHcccc
Confidence 34567999999999999999999999876643
No 92
>PF00404 Dockerin_1: Dockerin type I repeat; InterPro: IPR018242 Gram-positive, thermophilic anaerobes such as Clostridium thermocellum or Clostridium cellulolyticum secretes a highly active and thermostable cellulase complex (cellulosome) responsible for the degradation of crystalline cellulose [, ]. The cellulosome contains at least 30 polypeptides, the majority of the enzymes are endoglucanases (3.2.1.4 from EC), but there are also some xylanases (3.2.1.8 from EC), beta-glucosidases (3.2.1.21 from EC) and endo-beta-1,3-1,4-glucanases (3.2.1.73 from EC). Complete sequence data for many of these enzymes has been obtained. A majority of these proteins contain a highly conserved type I dockerin domain of about 65 to 70 residues, which is generally (but not always) located in the C terminus. The dockerin domain is the binding partner of the cohesin domain (see IPR002102 from INTERPRO). The cohesin-dockerin interaction is the crucial interaction for complex formation in the cellulosome []. The dockerin domain contains a tandem repeat of two calcium-binding loop-helix motifs (distinct from EF-hand Ca-binding motifs). These motifs are about 24 amino acids in length. This entry represents these repeated Ca-binding motifs.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3P0D_J 1OHZ_B 2CCL_B 1DAV_A 1DAQ_A 2VN5_B 2VN6_B.
Probab=79.55 E-value=2 Score=18.17 Aligned_cols=15 Identities=27% Similarity=0.439 Sum_probs=11.2
Q ss_pred CCCCCCcccHHHHHH
Q psy15330 27 DKNQDGRLSLEEFIE 41 (60)
Q Consensus 27 D~d~dG~Is~~EF~~ 41 (60)
|.|+||.|+--.|.-
T Consensus 1 DvN~DG~vna~D~~~ 15 (21)
T PF00404_consen 1 DVNGDGKVNAIDLAL 15 (21)
T ss_dssp -TTSSSSSSHHHHHH
T ss_pred CCCCCCcCCHHHHHH
Confidence 789999998776653
No 93
>KOG0040|consensus
Probab=79.27 E-value=3 Score=34.52 Aligned_cols=29 Identities=31% Similarity=0.584 Sum_probs=25.2
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
....-+|+.||++++|.+++.+|..+++.
T Consensus 2253 ~EFs~~fkhFDkek~G~Ldhq~F~sCLrs 2281 (2399)
T KOG0040|consen 2253 KEFSMMFKHFDKEKNGRLDHQHFKSCLRS 2281 (2399)
T ss_pred HHHHHHHHHhchhhccCCcHHHHHHHHHh
Confidence 34567899999999999999999988774
No 94
>KOG3866|consensus
Probab=78.44 E-value=1.5 Score=30.64 Aligned_cols=25 Identities=36% Similarity=0.723 Sum_probs=21.7
Q ss_pred HHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 20 DKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 20 ~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
+.+++.+|+|.|-.||++||+..-.
T Consensus 299 EHVMk~vDtNqDRlvtleEFL~~t~ 323 (442)
T KOG3866|consen 299 EHVMKQVDTNQDRLVTLEEFLNDTD 323 (442)
T ss_pred HHHHHhcccchhhhhhHHHHHhhhh
Confidence 4678899999999999999997654
No 95
>cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase.
Probab=75.57 E-value=2 Score=23.90 Aligned_cols=31 Identities=23% Similarity=0.389 Sum_probs=23.3
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
..|+.++.+....+|.+|+++.+.++|.+++
T Consensus 61 ~~~~~l~~~~~~~ls~~e~~~~l~~~p~Lik 91 (105)
T cd02977 61 TPYRKLGLADKDELSDEEALELMAEHPKLIK 91 (105)
T ss_pred chHHHcCCccccCCCHHHHHHHHHhCcCeee
Confidence 3455555544467899999999999999875
No 96
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=74.91 E-value=1.8 Score=23.37 Aligned_cols=29 Identities=17% Similarity=0.394 Sum_probs=21.4
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
..+..+...++.=+.++||.+||++.++.
T Consensus 25 ~~~~~l~~~Y~~~k~~kIsR~~fvr~lR~ 53 (70)
T PF12174_consen 25 SKMDLLQKHYEEFKKKKISREEFVRKLRQ 53 (70)
T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHH
Confidence 34555556666667899999999987764
No 97
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=74.08 E-value=2.5 Score=19.45 Aligned_cols=16 Identities=31% Similarity=0.555 Sum_probs=13.4
Q ss_pred CCCcccHHHHHHHHcC
Q psy15330 30 QDGRLSLEEFIEGAKS 45 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~ 45 (60)
+.|.|+++||+.+.++
T Consensus 1 ~~~~i~~~~~~d~a~r 16 (33)
T PF09373_consen 1 SSGTISKEEYLDMASR 16 (33)
T ss_pred CCceecHHHHHHHHHH
Confidence 3689999999998865
No 98
>KOG0040|consensus
Probab=73.70 E-value=4.2 Score=33.77 Aligned_cols=34 Identities=32% Similarity=0.544 Sum_probs=28.1
Q ss_pred CCcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 12 ESTPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 12 ~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+..|+-..+++....|.+.+|+||..+|+..|-+
T Consensus 2291 e~~~~p~fe~~ld~vDP~r~G~Vsl~dY~afmi~ 2324 (2399)
T KOG0040|consen 2291 EGEPEPEFEEILDLVDPNRDGYVSLQDYMAFMIS 2324 (2399)
T ss_pred cCCCChhHHHHHHhcCCCCcCcccHHHHHHHHHh
Confidence 3345556789999999999999999999987754
No 99
>KOG0030|consensus
Probab=72.03 E-value=4.7 Score=25.00 Aligned_cols=30 Identities=23% Similarity=0.184 Sum_probs=21.9
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
+.++++ +-++.||+.++|.|...|+...+.
T Consensus 86 t~edfv-egLrvFDkeg~G~i~~aeLRhvLt 115 (152)
T KOG0030|consen 86 TYEDFV-EGLRVFDKEGNGTIMGAELRHVLT 115 (152)
T ss_pred cHHHHH-HHHHhhcccCCcceeHHHHHHHHH
Confidence 344443 456889999999999999887654
No 100
>KOG1029|consensus
Probab=65.38 E-value=7.7 Score=30.19 Aligned_cols=30 Identities=30% Similarity=0.520 Sum_probs=26.2
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
|.-.+-.||...|.|+||+++..||--.|+
T Consensus 47 P~~VLaqIWALsDldkDGrmdi~EfSIAmk 76 (1118)
T KOG1029|consen 47 PTPVLAQIWALSDLDKDGRMDIREFSIAMK 76 (1118)
T ss_pred ChHHHHHHHHhhhcCccccchHHHHHHHHH
Confidence 445678899999999999999999998775
No 101
>PF00427 PBS_linker_poly: Phycobilisome Linker polypeptide; InterPro: IPR001297 Phycobilisomes (PBSs) are the major light-harvesting systems in cyanobacteria and red algae. PBS is a supercomplex that is composed of a core complex and multiple peripheral rod complexes. Typically, the core consists of two or five cylinders lying on the membrane with, in most cases, multiple rods radiating from the core to form a hemidiscoidal structure. The building units of the core cylinders and the peripheral rods are trimeric and hexameric discs, in which a monomer consists of a pair of related phycobiliproteins (PBPs), such as phycorerythrins, phycoerythrocyanins, phycocyanins, and allophycocyanins. The discs are connected to each other via specific linker polypeptides to form peripheral rods or core cylinders. Linker polypeptides share a conserved domain of ~180 residues, which can be present in one or multiple copies [, , , , ].; GO: 0015979 photosynthesis, 0030089 phycobilisome; PDB: 2L8V_A 2KY4_A 3OSJ_D 2L06_A 3NPH_B 2L3W_A 3PRU_C 3OHW_A.
Probab=64.57 E-value=9.6 Score=22.98 Aligned_cols=22 Identities=18% Similarity=0.456 Sum_probs=16.7
Q ss_pred CCCcccHHHHHHHHcCChHHHH
Q psy15330 30 QDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~~~~il~ 51 (60)
.+|.||..||++++.+.+.-.+
T Consensus 41 rng~IsVreFVr~La~S~~yr~ 62 (131)
T PF00427_consen 41 RNGQISVREFVRALAKSELYRK 62 (131)
T ss_dssp HTTSS-HHHHHHHHHTSHHHHH
T ss_pred HcCCCcHHHHHHHHHcCHHHHH
Confidence 5899999999999987654443
No 102
>KOG1954|consensus
Probab=62.36 E-value=7.2 Score=28.12 Aligned_cols=28 Identities=36% Similarity=0.504 Sum_probs=24.1
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIE 41 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~ 41 (60)
.+...+-++|+..|.|+||.++-+||.-
T Consensus 474 lpnsvlgkiwklad~d~dg~ld~eefal 501 (532)
T KOG1954|consen 474 LPNSVLGKIWKLADIDKDGMLDDEEFAL 501 (532)
T ss_pred CchhHHHhhhhhhcCCcccCcCHHHHHH
Confidence 3455678999999999999999999973
No 103
>KOG2562|consensus
Probab=62.33 E-value=10 Score=27.54 Aligned_cols=29 Identities=24% Similarity=0.557 Sum_probs=17.8
Q ss_pred HHHHHHHHHH----hhCCCCCCcccHHHHHHHH
Q psy15330 15 PEKRTDKIFR----QMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 15 ~~~~~~~if~----~~D~d~dG~Is~~EF~~~~ 43 (60)
.+.++++||+ .+=.-.+|+++|++|.-.+
T Consensus 309 t~~ivdRIFs~v~r~~~~~~eGrmdykdFv~Fi 341 (493)
T KOG2562|consen 309 TERIVDRIFSQVPRGFTVKVEGRMDYKDFVDFI 341 (493)
T ss_pred hhHHHHHHHhhccccceeeecCcccHHHHHHHH
Confidence 3556677777 3444466777777776544
No 104
>KOG4666|consensus
Probab=61.79 E-value=6.9 Score=27.54 Aligned_cols=28 Identities=14% Similarity=0.221 Sum_probs=24.3
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.....+|..||.+++|.++|.|+..++.
T Consensus 259 d~l~~~f~LFde~~tg~~D~re~v~~la 286 (412)
T KOG4666|consen 259 DKLAPTFMLFDEGTTGNGDYRETVKTLA 286 (412)
T ss_pred hhhhhhhheecCCCCCcccHHHHhhhhe
Confidence 5567899999999999999999997654
No 105
>KOG4065|consensus
Probab=60.54 E-value=14 Score=22.41 Aligned_cols=34 Identities=21% Similarity=0.246 Sum_probs=28.2
Q ss_pred CCCcHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 11 DESTPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 11 ~~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
.+.++++.--..|+..|.|+||.|+-=|.++.+.
T Consensus 61 a~mtpeqlqfHYF~MHDldknn~lDGiEl~kAiT 94 (144)
T KOG4065|consen 61 AKMTPEQLQFHYFSMHDLDKNNFLDGIELLKAIT 94 (144)
T ss_pred hhCCHHHHhhhhhhhhccCcCCcchHHHHHHHHH
Confidence 3567777777889999999999999989887664
No 106
>KOG3555|consensus
Probab=60.21 E-value=7 Score=27.65 Aligned_cols=28 Identities=21% Similarity=0.176 Sum_probs=23.4
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHH
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEG 42 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~ 42 (60)
-...+-=+|.++|.|.||.++..|....
T Consensus 248 CKds~gWMFnklD~N~Dl~Ld~sEl~~I 275 (434)
T KOG3555|consen 248 CKDSLGWMFNKLDTNYDLLLDQSELRAI 275 (434)
T ss_pred hhhhhhhhhhccccccccccCHHHhhhh
Confidence 3456778899999999999999998753
No 107
>cd02518 GT2_SpsF SpsF is a glycosyltrnasferase implicated in the synthesis of the spore coat. Spore coat polysaccharide biosynthesis protein F (spsF) is a glycosyltransferase implicated in the synthesis of the spore coat in a variety of bacteria challenged by stress as starvation. The spsF gene is expressed in the late stage of coat development responsible for a terminal step in coat formation that involves the glycosylation of the coat. SpsF gene mutation resulted in spores that appeared normal. But, the spores tended to aggregate and had abnormal adsorption properties, indicating a surface alteration.
Probab=59.73 E-value=25 Score=21.78 Aligned_cols=41 Identities=24% Similarity=0.454 Sum_probs=29.6
Q ss_pred CCCcHHH--HHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHH
Q psy15330 11 DESTPEK--RTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRL 52 (60)
Q Consensus 11 ~~~~~~~--~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~ 52 (60)
|-.++++ .++.+++.++.. +-.+|++|-++.+.+||.|.+.
T Consensus 188 DiDt~eD~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~p~l~~~ 230 (233)
T cd02518 188 TVDTPEDFELIKEIYEALYPK-NPDFSLEDIIELLDKNPELFEI 230 (233)
T ss_pred ecCCHHHHHHHHHHHHHhcCC-CCCCCHHHHHHHHHhChhHHHH
Confidence 3345554 466777777653 3338999999999999999875
No 108
>cd03035 ArsC_Yffb Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC. The structure of Yffb and the conservation of the catalytic cysteine suggest that it is likely to function as a glutathione (GSH)-dependent thiol reductase. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from GSH via glutaredoxin, through a single catalytic cysteine.
Probab=59.39 E-value=4.9 Score=22.81 Aligned_cols=29 Identities=24% Similarity=0.445 Sum_probs=20.3
Q ss_pred HHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 23 FRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 23 f~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
|+.+..+....+|-+|+++.+..+|.+++
T Consensus 61 yk~l~l~~~~~~s~~e~~~~l~~~p~Lik 89 (105)
T cd03035 61 WRKLDDAQKAALDAAKAIALMLEHPSLIK 89 (105)
T ss_pred HHhCChhhhccCCHHHHHHHHHhCcCeee
Confidence 34444442245788999999999998875
No 109
>cd03032 ArsC_Spx Arsenate Reductase (ArsC) family, Spx subfamily; Spx is a unique RNA polymerase (RNAP)-binding protein present in bacilli and some mollicutes. It inhibits transcription by binding to the C-terminal domain of the alpha subunit of RNAP, disrupting complex formation between RNAP and certain transcriptional activator proteins like ResD and ComA. In response to oxidative stress, Spx can also activate transcription, making it a general regulator that exerts both positive and negative control over transcription initiation. Spx has been shown to exert redox-sensitive transcriptional control over genes like trxA (TRX) and trxB (TRX reductase), genes that function in thiol homeostasis. This redox-sensitive activity is dependent on the presence of a CXXC motif, present in some members of the Spx subfamily, that acts as a thiol/disulfide switch. Spx has also been shown to repress genes in a sulfate-dependent manner independent of the presence of the CXXC motif.
Probab=58.52 E-value=11 Score=21.47 Aligned_cols=20 Identities=40% Similarity=0.592 Sum_probs=17.2
Q ss_pred CcccHHHHHHHHcCChHHHH
Q psy15330 32 GRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 32 G~Is~~EF~~~~~~~~~il~ 51 (60)
..+|-+|+++.+..+|.+++
T Consensus 72 ~~ls~~e~i~~l~~~p~Lik 91 (115)
T cd03032 72 DELSLSELIRLISEHPSLLR 91 (115)
T ss_pred ccCCHHHHHHHHHhChhhee
Confidence 45889999999999999875
No 110
>COG5562 Phage envelope protein [General function prediction only]
Probab=56.76 E-value=8.1 Score=23.62 Aligned_cols=24 Identities=25% Similarity=0.454 Sum_probs=18.6
Q ss_pred HHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 22 IFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 22 if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
+-.....+..|..||+||...+-+
T Consensus 77 i~~al~~~qsGqttF~ef~~~la~ 100 (137)
T COG5562 77 IKTALRRHQSGQTTFEEFCSALAE 100 (137)
T ss_pred HHHHHHHHhcCCccHHHHHHHHHh
Confidence 345567789999999999986643
No 111
>PF08726 EFhand_Ca_insen: Ca2+ insensitive EF hand; InterPro: IPR014837 EF hands are helix-loop-helix binding motifs involved in the regulation of many cellular processes. EF hands usually bind to Ca2+ ions, which cause a major conformational change that allows the protein to interact with its designated targets. This protein corresponds to an EF hand which has partially or entirely lost its calcium-binding properties. The calcium insensitive EF hand is still able to mediate protein-protein recognition []. ; PDB: 1H8B_A 1SJJ_B.
Probab=56.57 E-value=19 Score=19.33 Aligned_cols=32 Identities=9% Similarity=0.276 Sum_probs=25.2
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcCChH
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPS 48 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~ 48 (60)
.+.+...|+.+ .++.++||-+|+...+..+..
T Consensus 5 ~eqv~~aFr~l-A~~KpyVT~~dLr~~l~pe~a 36 (69)
T PF08726_consen 5 AEQVEEAFRAL-AGGKPYVTEEDLRRSLTPEQA 36 (69)
T ss_dssp CHHHHHHHHHH-CTSSSCEEHHHHHHHS-CCCH
T ss_pred HHHHHHHHHHH-HcCCCcccHHHHHHHcCcHHH
Confidence 34578899999 777899999999998876543
No 112
>KOG0751|consensus
Probab=56.56 E-value=14 Score=27.57 Aligned_cols=28 Identities=25% Similarity=0.434 Sum_probs=19.5
Q ss_pred HHhhCCCCCCcccHHHHHH--HHcCChHHH
Q psy15330 23 FRQMDKNQDGRLSLEEFIE--GAKSDPSIV 50 (60)
Q Consensus 23 f~~~D~d~dG~Is~~EF~~--~~~~~~~il 50 (60)
=.-.|.-+||.|||+||.. .+.=.|+.+
T Consensus 80 a~iaD~tKDglisf~eF~afe~~lC~pDal 109 (694)
T KOG0751|consen 80 ASIADQTKDGLISFQEFRAFESVLCAPDAL 109 (694)
T ss_pred HhhhhhcccccccHHHHHHHHhhccCchHH
Confidence 3456788999999999984 333344444
No 113
>PF09068 EF-hand_2: EF hand; InterPro: IPR015153 Like other EF hand domains, this domain forms a helix-loop-helix motif, though since it does not contain the canonical pattern of calcium binding residues found in many EF hand domains, it does not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (IPR001202 from INTERPRO), enhancing dystroglycan binding []. ; PDB: 1EG3_A 1EG4_A.
Probab=55.77 E-value=22 Score=21.03 Aligned_cols=30 Identities=10% Similarity=0.206 Sum_probs=22.0
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
..+-.++-++..+|.++.|.|+.-.|+.++
T Consensus 94 a~~L~ln~Ll~vyD~~rtG~I~vls~KvaL 123 (127)
T PF09068_consen 94 AVDLLLNWLLNVYDSQRTGKIRVLSFKVAL 123 (127)
T ss_dssp HHHHHHHHHHHHH-TT--SEEEHHHHHHHH
T ss_pred HHHHHHHHHHHHhCCCCCCeeehhHHHHHH
Confidence 345556777999999999999999998776
No 114
>PF13608 Potyvirid-P3: Protein P3 of Potyviral polyprotein
Probab=55.73 E-value=13 Score=26.40 Aligned_cols=38 Identities=18% Similarity=0.420 Sum_probs=29.8
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC-ChHHHHHhc
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS-DPSIVRLLQ 54 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~-~~~il~~l~ 54 (60)
+..+..++ .+-.-..+..|++||.+.+++ +|+++..+.
T Consensus 288 ~~~i~~ly-~~~~~~~~~pt~eEF~e~v~~~~p~L~~~~~ 326 (445)
T PF13608_consen 288 EDEIEHLY-MLCKKHGKLPTEEEFLEYVEEVNPELLEFAE 326 (445)
T ss_pred HHHHHHHH-HHHHHhCCCCCHHHHHHHHHhcCchHHHHHH
Confidence 34566677 777777899999999998764 899888765
No 115
>KOG2871|consensus
Probab=55.63 E-value=22 Score=25.47 Aligned_cols=43 Identities=16% Similarity=0.309 Sum_probs=33.0
Q ss_pred CCcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC------ChHHHHHhc
Q psy15330 12 ESTPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS------DPSIVRLLQ 54 (60)
Q Consensus 12 ~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~------~~~il~~l~ 54 (60)
+.++++.+.++|+..|.-++|.||.+-|...+.. .|+.+.++.
T Consensus 304 ~~~~s~q~rR~f~a~d~~d~nfis~s~~~~vm~~~N~~vse~a~v~l~~ 352 (449)
T KOG2871|consen 304 PENPSEQLRRNFHAYDPEDNNFISCSGLQIVMTALNRLVSEPAYVMLMR 352 (449)
T ss_pred CCCCCHHHHhhhhccCccCCCeeecHHHHHHHHHhcccccCHHHHHHhc
Confidence 4566788899999999999999999888876653 455555543
No 116
>PF03960 ArsC: ArsC family; InterPro: IPR006660 Several bacterial taxon have a chromosomal resistance system, encoded by the ars operon, for the detoxification of arsenate, arsenite, and antimonite []. This system transports arsenite and antimonite out of the cell. The pump is composed of two polypeptides, the products of the arsA and arsB genes. This two-subunit enzyme produces resistance to arsenite and antimonite. Arsenate, however, must first be reduced to arsenite before it is extruded. A third gene, arsC, expands the substrate specificity to allow for arsenate pumping and resistance. ArsC is an approximately 150-residue arsenate reductase that uses reduced glutathione (GSH) to convert arsenate to arsenite with a redox active cysteine residue in the active site. ArsC forms an active quaternary complex with GSH, arsenate, and glutaredoxin 1 (Grx1). The three ligands must be present simultaneously for reduction to occur []. The arsC family also comprises the Spx proteins which are GRAM-positive bacterial transcription factors that regulate the transcription of multiple genes in response to disulphide stress []. The arsC protein structure has been solved []. It belongs to the thioredoxin superfamily fold which is defined by a beta-sheet core surrounded by alpha-helices. The active cysteine residue of ArsC is located in the loop between the first beta-strand and the first helix, which is also conserved in the Spx protein and its homologues.; PDB: 2KOK_A 1SK1_A 1SK2_A 1JZW_A 1J9B_A 1S3C_A 1SD8_A 1SD9_A 1I9D_A 1SK0_A ....
Probab=55.11 E-value=4.4 Score=22.83 Aligned_cols=21 Identities=33% Similarity=0.482 Sum_probs=17.0
Q ss_pred CCcccHHHHHHHHcCChHHHH
Q psy15330 31 DGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 31 dG~Is~~EF~~~~~~~~~il~ 51 (60)
...+|-+|+++.+..+|.+++
T Consensus 68 ~~~~s~~e~i~~l~~~p~Lik 88 (110)
T PF03960_consen 68 KDDLSDEELIELLLENPKLIK 88 (110)
T ss_dssp CTTSBHHHHHHHHHHSGGGB-
T ss_pred hhhhhhHHHHHHHHhChhhee
Confidence 457899999999999998865
No 117
>PF01023 S_100: S-100/ICaBP type calcium binding domain; InterPro: IPR013787 The calcium-binding domain found in S100 and CaBP-9k proteins is a subfamily of the EF-hand calcium-binding domain []. S100s are small dimeric acidic calcium and zinc-binding proteins abundant in the brain, with S100B playing an important role in modulating the proliferation and differentiation of neurons and glia cells []. S100 proteins have two different types of calcium-binding sites: a low affinity one with a special structure, and a 'normal' EF-hand type high-affinity site. Calbindin-D9k (CaBP-9k) also belong to this family of proteins, but it does not form dimers. CaBP-9k is a cytosolic protein expressed in a variety of tissues. Although its precise function is unknown, it appears to be under the control of the steroid hormones oestrogen and progesterone in the female reproductive system []. In the intestine, CaBP-9k may be involved in calcium absorption by mediating intracellular diffusion []. This entry represents a subdomain of the calcium-binding domain found in S100, CaBP-9k, and related proteins.; PDB: 2RGI_A 4DUQ_B 2KAY_B 2KAX_A 2CNP_A 1CNP_A 1A03_A 1JWD_B 2JTT_A 1XK4_B ....
Probab=53.61 E-value=19 Score=17.51 Aligned_cols=27 Identities=22% Similarity=0.241 Sum_probs=19.3
Q ss_pred HHHHHHhhC-CCC-CCcccHHHHHHHHcC
Q psy15330 19 TDKIFRQMD-KNQ-DGRLSLEEFIEGAKS 45 (60)
Q Consensus 19 ~~~if~~~D-~d~-dG~Is~~EF~~~~~~ 45 (60)
+-.+|.++- ++| ...+|..||++.+.+
T Consensus 8 iI~vFhkYa~~~Gd~~~Lsk~Elk~Ll~~ 36 (44)
T PF01023_consen 8 IIDVFHKYAGKEGDKDTLSKKELKELLEK 36 (44)
T ss_dssp HHHHHHHHHTSSSSTTSEEHHHHHHHHHH
T ss_pred HHHHHHHHhccCCCCCeEcHHHHHHHHHH
Confidence 456666654 444 478999999998864
No 118
>PF09069 EF-hand_3: EF-hand; InterPro: IPR015154 Like other EF hand domains, this domain forms a helix-loop-helix motif, though since it does not contain the canonical pattern of calcium binding residues found in many EF hand domains, it does not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (IPR001202 from INTERPRO), enhancing dystroglycan binding []. ; PDB: 1EG3_A 1EG4_A.
Probab=51.34 E-value=23 Score=19.98 Aligned_cols=34 Identities=18% Similarity=0.345 Sum_probs=22.7
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHH
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIV 50 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il 50 (60)
.+.-+...|+.. ...-.|+.++|+..+..+|..+
T Consensus 47 ~e~sv~sCF~~~--~~~~~I~~~~Fl~wl~~ePq~l 80 (90)
T PF09069_consen 47 IEPSVRSCFQQV--QLSPKITENQFLDWLMSEPQSL 80 (90)
T ss_dssp -HHHHHHHHHHT--TT-S-B-HHHHHHHHHT--TTT
T ss_pred cHHHHHHHhccc--CCCCccCHHHHHHHHHhCCCee
Confidence 566778889876 4677899999999999988643
No 119
>KOG3555|consensus
Probab=50.32 E-value=14 Score=26.21 Aligned_cols=30 Identities=23% Similarity=0.326 Sum_probs=25.7
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
|.-+.-+|..-|+.+||.||-.||.-...+
T Consensus 281 E~CikpFfnsCD~~kDg~iS~~EWC~CF~k 310 (434)
T KOG3555|consen 281 EACIKPFFNSCDTYKDGSISTNEWCYCFQK 310 (434)
T ss_pred hhHHHHHHhhhcccccCccccchhhhhhcc
Confidence 445788999999999999999999976655
No 120
>cd03034 ArsC_ArsC Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli. E. coli ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], the first step in the detoxification of arsenic, using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX). ArsC contains a single catalytic cysteine, within a thioredoxin fold, that forms a covalent thiolate-As(V) intermediate, which is reduced by GRX through a mixed GSH-arsenate intermediate. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases.
Probab=49.81 E-value=17 Score=20.68 Aligned_cols=28 Identities=21% Similarity=0.487 Sum_probs=20.5
Q ss_pred HHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 23 FRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 23 f~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
|+.+..+. ..+|-+|+++.+..+|.+++
T Consensus 63 y~~l~~~~-~~ls~~e~i~ll~~~P~Lik 90 (112)
T cd03034 63 YKELGLAD-PELSDEELIDAMAAHPILIE 90 (112)
T ss_pred HHHcCCCc-cCCCHHHHHHHHHhCcCccc
Confidence 33443333 46899999999999998875
No 121
>KOG2243|consensus
Probab=49.27 E-value=13 Score=31.50 Aligned_cols=26 Identities=19% Similarity=0.459 Sum_probs=22.4
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcCC
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~~ 46 (60)
..|+.+|.||.|.||..+|.+.+...
T Consensus 4061 dtfkeydpdgkgiiskkdf~kame~~ 4086 (5019)
T KOG2243|consen 4061 DTFKEYDPDGKGIISKKDFHKAMEGH 4086 (5019)
T ss_pred ccchhcCCCCCccccHHHHHHHHhcc
Confidence 35788999999999999999988753
No 122
>KOG0169|consensus
Probab=48.47 E-value=24 Score=27.03 Aligned_cols=32 Identities=34% Similarity=0.586 Sum_probs=26.1
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
...+..+..+|+..|+|++|.+++.|-...++
T Consensus 132 ~~~~~wi~~~~~~ad~~~~~~~~~~~~~~~~~ 163 (746)
T KOG0169|consen 132 SRREHWIHSIFQEADKNKNGHMSFDEVLDLLK 163 (746)
T ss_pred chHHHHHHHHHHHHccccccccchhhHHHHHH
Confidence 34556788999999999999999998776654
No 123
>PF11333 DUF3135: Protein of unknown function (DUF3135); InterPro: IPR021482 This family of proteins with unkown function appears to be restricted to Proteobacteria.
Probab=46.64 E-value=23 Score=19.67 Aligned_cols=21 Identities=29% Similarity=0.422 Sum_probs=18.0
Q ss_pred ccHHHHHHHHcCChHHHHHhc
Q psy15330 34 LSLEEFIEGAKSDPSIVRLLQ 54 (60)
Q Consensus 34 Is~~EF~~~~~~~~~il~~l~ 54 (60)
-+|+|++...+.||+-+..|.
T Consensus 4 p~FD~L~~LA~~dPe~fe~lr 24 (83)
T PF11333_consen 4 PDFDELKELAQNDPEAFEQLR 24 (83)
T ss_pred CCHHHHHHHHHhCHHHHHHHH
Confidence 589999999999999887763
No 124
>PRK13696 hypothetical protein; Provisional
Probab=46.54 E-value=22 Score=18.93 Aligned_cols=33 Identities=12% Similarity=0.238 Sum_probs=20.1
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcC-----ChHHHHHhc
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKS-----DPSIVRLLQ 54 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~-----~~~il~~l~ 54 (60)
.+++.+..- .|..||.||+.-+-+ ...|+..++
T Consensus 11 d~Y~~L~~k-k~~~SFSevi~~L~~~~~~~~~~l~~~~G 48 (62)
T PRK13696 11 DVYEKLLEI-KGDKSFSEVIRELIEKKKGNLDKLMKAFG 48 (62)
T ss_pred HHHHHHHHH-hCCCCHHHHHHHHHHHhhccHHHHHHHHC
Confidence 344444433 355799999987662 345666554
No 125
>PF09454 Vps23_core: Vps23 core domain; InterPro: IPR017916 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ]. This entry represents the Steadiness box domain.; PDB: 2CAZ_A 2F66_D 2F6M_A 2P22_A.
Probab=46.51 E-value=18 Score=19.13 Aligned_cols=16 Identities=31% Similarity=0.702 Sum_probs=12.0
Q ss_pred CCCcccHHHHHHHHcC
Q psy15330 30 QDGRLSLEEFIEGAKS 45 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~ 45 (60)
+.|.|+++.|++.++.
T Consensus 36 ~~g~I~~d~~lK~vR~ 51 (65)
T PF09454_consen 36 QRGSIDLDTFLKQVRS 51 (65)
T ss_dssp HTTSS-HHHHHHHHHH
T ss_pred HcCCCCHHHHHHHHHH
Confidence 4688999999987753
No 126
>PF08044 DUF1707: Domain of unknown function (DUF1707); InterPro: IPR012551 This domain is found in a variety of actinomycetales proteins. All of the proteins containing this domain are hypothetical and probably membrane bound or associated. Currently, it is unclear to the function of this domain.
Probab=46.29 E-value=19 Score=18.21 Aligned_cols=14 Identities=57% Similarity=0.831 Sum_probs=11.8
Q ss_pred CCCcccHHHHHHHH
Q psy15330 30 QDGRLSLEEFIEGA 43 (60)
Q Consensus 30 ~dG~Is~~EF~~~~ 43 (60)
.+|+||.+||-+-+
T Consensus 20 a~GrL~~~Ef~~R~ 33 (53)
T PF08044_consen 20 AEGRLSLDEFDERL 33 (53)
T ss_pred HCCCCCHHHHHHHH
Confidence 58999999998654
No 127
>cd08330 CARD_ASC_NALP1 Caspase activation and recruitment domain found in Human ASC, NALP1, and similar proteins. Caspase activation and recruitment domain (CARD) similar to those found in human ASC (Apoptosis-associated speck-like protein containing a CARD) and NALP1 (CARD7, NLRP1). ASC, an adaptor molecule, and NALP1, a member of the Nod-like receptor (NLR) family, are involved in the assembly of the 'inflammasome', a multiprotein platform, which is responsible for caspase-1 activation and regulation of IL-1beta maturation. In general, CARDs are death domains (DDs) associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They se
Probab=46.27 E-value=44 Score=18.16 Aligned_cols=39 Identities=21% Similarity=0.300 Sum_probs=28.0
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHc-CChHHHHHh
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAK-SDPSIVRLL 53 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~-~~~~il~~l 53 (60)
.+..+.+.+++..+.+ -|...|.-|.+.++ .+|.+..-|
T Consensus 42 ~T~~~kar~Lld~l~~--kG~~A~~~F~~~L~e~~p~L~~~L 81 (82)
T cd08330 42 KTNQEKMRKLFSFVRS--WGASCKDIFYQILREEEPYLVEDL 81 (82)
T ss_pred CCcHHHHHHHHHHHHc--cCHHHHHHHHHHHHHhChHHHhHc
Confidence 4666777888876655 67778888888887 577776543
No 128
>PRK01655 spxA transcriptional regulator Spx; Reviewed
Probab=46.09 E-value=16 Score=21.41 Aligned_cols=28 Identities=25% Similarity=0.592 Sum_probs=20.1
Q ss_pred HHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 23 FRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 23 f~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
|+.+..+. ..+|.+|+++.+..+|.+++
T Consensus 64 ~k~l~~~~-~~ls~~e~i~ll~~~p~Lik 91 (131)
T PRK01655 64 FQKLNVDV-ESLSLQDLIKLISDNPGLLR 91 (131)
T ss_pred HHhCCCCc-ccCCHHHHHHHHHhCcceEe
Confidence 44444443 35788999999999998865
No 129
>PRK13344 spxA transcriptional regulator Spx; Reviewed
Probab=46.05 E-value=15 Score=21.72 Aligned_cols=29 Identities=24% Similarity=0.438 Sum_probs=20.7
Q ss_pred HHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 22 IFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 22 if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
.|+.++.+. ..+|-+|+++.+..+|.+++
T Consensus 63 ~~k~L~~~~-~~ls~~e~i~ll~~~P~Lik 91 (132)
T PRK13344 63 YAKALDCDI-EELSVNEVIDLIQENPRILK 91 (132)
T ss_pred HHHhCCcch-hcCCHHHHHHHHHhCcccee
Confidence 344444333 46888999999999998875
No 130
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=44.90 E-value=19 Score=16.14 Aligned_cols=13 Identities=31% Similarity=0.501 Sum_probs=10.9
Q ss_pred CCcccHHHHHHHH
Q psy15330 31 DGRLSLEEFIEGA 43 (60)
Q Consensus 31 dG~Is~~EF~~~~ 43 (60)
.|.||-+||.+.-
T Consensus 14 ~G~IseeEy~~~k 26 (31)
T PF09851_consen 14 KGEISEEEYEQKK 26 (31)
T ss_pred cCCCCHHHHHHHH
Confidence 6999999998653
No 131
>PRK08032 fliD flagellar capping protein; Reviewed
Probab=44.25 E-value=15 Score=25.94 Aligned_cols=27 Identities=30% Similarity=0.660 Sum_probs=22.3
Q ss_pred CCC-CCCcccHHH--HHHHHcCChHHHHHh
Q psy15330 27 DKN-QDGRLSLEE--FIEGAKSDPSIVRLL 53 (60)
Q Consensus 27 D~d-~dG~Is~~E--F~~~~~~~~~il~~l 53 (60)
..+ .+|.|++++ |.+.+..||+-+..|
T Consensus 347 ~~~~~~G~L~lD~~kl~~al~~np~~V~~l 376 (462)
T PRK08032 347 TQDPSDGKLEIDDDKLTKALKEDPAGVKAL 376 (462)
T ss_pred eccCCCCeEEEcHHHHHHHHHHCHHHHHHH
Confidence 345 789999998 999999999877654
No 132
>PRK12559 transcriptional regulator Spx; Provisional
Probab=43.31 E-value=17 Score=21.38 Aligned_cols=30 Identities=33% Similarity=0.516 Sum_probs=21.9
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
..|+.++.+. -.+|.+|.++.+..+|.|++
T Consensus 62 ~~~k~l~~~~-~~ls~~e~i~ll~~~P~Lik 91 (131)
T PRK12559 62 KTFQDLNINI-EELSLNEFYKLIIEHPLMLR 91 (131)
T ss_pred HHHHhCCCCc-ccCCHHHHHHHHHhCcceEe
Confidence 3455565554 35788999999999998875
No 133
>PF06226 DUF1007: Protein of unknown function (DUF1007); InterPro: IPR010412 This is a family of conserved bacterial proteins with unknown function.
Probab=43.28 E-value=24 Score=22.34 Aligned_cols=24 Identities=38% Similarity=0.553 Sum_probs=19.8
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
-++..+|.|+||.++-+|......
T Consensus 54 ~ll~~~D~~~dg~~~~~el~~l~~ 77 (212)
T PF06226_consen 54 YLLEGLDKDGDGKLDPEELAALAK 77 (212)
T ss_pred HHHHhhhhcccCCCCHHHHHHHHH
Confidence 445689999999999999887654
No 134
>PRK10026 arsenate reductase; Provisional
Probab=43.03 E-value=16 Score=22.06 Aligned_cols=30 Identities=10% Similarity=0.364 Sum_probs=21.9
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
..|+.+..+.+ .+|.+|+++.+..+|.|++
T Consensus 64 ~~yr~L~~~~~-~ls~~e~l~ll~~~P~LIK 93 (141)
T PRK10026 64 EPYEELGLAED-KFTDDQLIDFMLQHPILIN 93 (141)
T ss_pred chHHHcCCCcc-CCCHHHHHHHHHhCcccee
Confidence 34555555544 4799999999999998875
No 135
>TIGR00014 arsC arsenate reductase (glutaredoxin). composed of two polypeptides, the products of the arsA and arsB genes. The pump alone produces resistance to arsenite and antimonite. This protein, ArsC, catalyzes the reduction of arsenate to arsenite, and thus extends resistance to include arsenate.
Probab=42.85 E-value=20 Score=20.41 Aligned_cols=20 Identities=20% Similarity=0.403 Sum_probs=16.9
Q ss_pred CcccHHHHHHHHcCChHHHH
Q psy15330 32 GRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 32 G~Is~~EF~~~~~~~~~il~ 51 (60)
..+|-+|+++.+.++|.+++
T Consensus 72 ~~ls~~e~i~~l~~~P~Lik 91 (114)
T TIGR00014 72 PNLSDQELLDAMVAHPILLE 91 (114)
T ss_pred cCCCHHHHHHHHHHCcCccc
Confidence 36788999999999998875
No 136
>PF07765 KIP1: KIP1-like protein; InterPro: IPR011684 This is a group of sequences found exclusively in plants. They are similar to kinase interacting protein 1 (KIP1), which has been found to interact with the kinase domain of PRK1, a receptor-like kinase []. This particular region contains two coiled-coils, which are described as motifs involved in protein-protein interactions []. It has also been suggested that the coiled-coils of the protein allow it to dimerise in vivo [].
Probab=42.04 E-value=54 Score=18.02 Aligned_cols=39 Identities=13% Similarity=0.336 Sum_probs=26.8
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHhc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLLQ 54 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l~ 54 (60)
..++.+..+.+.++.|+|-.=-..|. ..++.|.|+.++.
T Consensus 22 dmd~kvk~mlklieedgdSfakrAEm--yy~kRp~Li~~ve 60 (74)
T PF07765_consen 22 DMDEKVKAMLKLIEEDGDSFAKRAEM--YYKKRPELISLVE 60 (74)
T ss_pred HHHHHHHHHHHHhccCcchHHHhhHH--HhcccHHHHHHHH
Confidence 46778889999999877543333332 2567888888764
No 137
>PF13591 MerR_2: MerR HTH family regulatory protein
Probab=41.13 E-value=37 Score=18.42 Aligned_cols=20 Identities=45% Similarity=0.717 Sum_probs=16.7
Q ss_pred ccHHHHHHHHcCChHHHHHh
Q psy15330 34 LSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 34 Is~~EF~~~~~~~~~il~~l 53 (60)
||++||...+.-+++.+..+
T Consensus 1 is~~e~~~~~~i~~~~l~~l 20 (84)
T PF13591_consen 1 ISLEEFCEACGIEPEFLREL 20 (84)
T ss_pred CCHHHHHHHHCcCHHHHHHH
Confidence 78999999988888877665
No 138
>TIGR01617 arsC_related transcriptional regulator, Spx/MgsR family. This model represents a portion of the proteins within the larger set covered by Pfam model pfam03960. That larger family includes a glutaredoxin-dependent arsenate reductase (TIGR00014). Characterized members of this family include Spx and MgsR from Bacillus subtili. Spx is a global regulator for response to thiol-specific oxidative stress. It interacts with RNA polymerase. MgsR (modulator of the general stress response, also called YqgZ) provides a second level of regulation for more than a third of the proteins in the B. subtilis general stress regulon controlled by Sigma-B.
Probab=41.06 E-value=14 Score=21.01 Aligned_cols=20 Identities=35% Similarity=0.390 Sum_probs=17.0
Q ss_pred CcccHHHHHHHHcCChHHHH
Q psy15330 32 GRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 32 G~Is~~EF~~~~~~~~~il~ 51 (60)
..+|.+|+++.+..+|.+++
T Consensus 73 ~~ls~~e~~~~i~~~p~Lik 92 (117)
T TIGR01617 73 LDLSDKEALELLAEDPALLR 92 (117)
T ss_pred ccCCHHHHHHHHHhCcceEe
Confidence 56899999999999998864
No 139
>PF07492 Trehalase_Ca-bi: Neutral trehalase Ca2+ binding domain; InterPro: IPR011120 Neutral trehalases mobilise trehalose accumulated by fungal cells as a protective and storage carbohydrate. This family represents a calcium-binding domain similar to EF hand. Residues 97 and 108 in O42893 from SWISSPROT have been implicated in this interaction. It is thought that this domain may provide a general mechanism for regulating neutral trehalase activity in yeasts and filamentous fungi [].; GO: 0004555 alpha,alpha-trehalase activity, 0005509 calcium ion binding, 0005993 trehalose catabolic process, 0005737 cytoplasm
Probab=40.50 E-value=6.5 Score=18.09 Aligned_cols=18 Identities=17% Similarity=0.388 Sum_probs=13.7
Q ss_pred HHHHHhhCCCCCCcccHH
Q psy15330 20 DKIFRQMDKNQDGRLSLE 37 (60)
Q Consensus 20 ~~if~~~D~d~dG~Is~~ 37 (60)
+.+...=|+|+|-+|+.+
T Consensus 2 ~~LL~qEDTDgn~qITIe 19 (30)
T PF07492_consen 2 RSLLEQEDTDGNFQITIE 19 (30)
T ss_pred hhHhhccccCCCcEEEEe
Confidence 345667799999999865
No 140
>smart00763 AAA_PrkA PrkA AAA domain. This is a family of PrkA bacterial and archaeal serine kinases approximately 630 residues long. This is the N-terminal AAA domain.
Probab=40.08 E-value=27 Score=24.33 Aligned_cols=18 Identities=33% Similarity=0.516 Sum_probs=15.7
Q ss_pred ccHHHHHHHHcCChHHHH
Q psy15330 34 LSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 34 Is~~EF~~~~~~~~~il~ 51 (60)
.||+||++.++.+|.+.+
T Consensus 1 ~s~~eyL~~~~~~p~~~~ 18 (361)
T smart00763 1 GSLAEYLDIVKEDPLVAA 18 (361)
T ss_pred CCHHHHHHHHhhChHHHH
Confidence 489999999999998764
No 141
>PF13220 DUF4028: Protein of unknown function (DUF4028)
Probab=39.85 E-value=53 Score=17.24 Aligned_cols=36 Identities=19% Similarity=0.365 Sum_probs=28.0
Q ss_pred CcHHHHHHHHHHhhCCCCC-CcccHHHHHHHHcCChH
Q psy15330 13 STPEKRTDKIFRQMDKNQD-GRLSLEEFIEGAKSDPS 48 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~d-G~Is~~EF~~~~~~~~~ 48 (60)
..-++++.++|++..-+.. |+-+|.|..+-.+.+.+
T Consensus 20 kng~~yvkkwf~khen~eelgrptfkevekdwkenre 56 (65)
T PF13220_consen 20 KNGDQYVKKWFSKHENNEELGRPTFKEVEKDWKENRE 56 (65)
T ss_pred ccHhHHHHHHHHhccchhhhCCCcHHHHHHHHHHhhh
Confidence 4577899999999877664 89999988877665543
No 142
>PRK06798 fliD flagellar capping protein; Validated
Probab=39.74 E-value=19 Score=25.52 Aligned_cols=27 Identities=15% Similarity=0.489 Sum_probs=22.4
Q ss_pred CCCCCCcccHHH--HHHHHcCChHHHHHh
Q psy15330 27 DKNQDGRLSLEE--FIEGAKSDPSIVRLL 53 (60)
Q Consensus 27 D~d~dG~Is~~E--F~~~~~~~~~il~~l 53 (60)
..+.+|.|++++ |.+.+..||+-+..|
T Consensus 324 ~~~~~G~L~lD~~kL~~al~~np~~V~~l 352 (440)
T PRK06798 324 QVDKEGTMKVDEEALKKALKENPDAAKQF 352 (440)
T ss_pred EeCCCCCEEEcHHHHHHHHHHCHHHHHHH
Confidence 346789999988 999999999887655
No 143
>KOG0046|consensus
Probab=39.65 E-value=36 Score=25.45 Aligned_cols=27 Identities=22% Similarity=0.431 Sum_probs=22.8
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.+..-|.++| |++|+|+..|....+.+
T Consensus 20 ~l~~kF~~~d-~~~G~v~~~~l~~~f~k 46 (627)
T KOG0046|consen 20 ELKEKFNKLD-DQKGYVTVYELPDAFKK 46 (627)
T ss_pred HHHHHHHhhc-CCCCeeehHHhHHHHHH
Confidence 3567799999 99999999999887765
No 144
>KOG4403|consensus
Probab=39.46 E-value=23 Score=25.86 Aligned_cols=31 Identities=29% Similarity=0.372 Sum_probs=24.1
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.-+-+..|-+.||-|.+|.|..+|=-.+++.
T Consensus 66 g~EAir~iHrqmDDD~nG~Id~~ESdeFlrE 96 (575)
T KOG4403|consen 66 GYEAIRDIHRQMDDDHNGSIDVEESDEFLRE 96 (575)
T ss_pred hHHHHHHHHHhcccccCCCcccccchHHHHH
Confidence 3345778999999999999999875555543
No 145
>PF05383 La: La domain; InterPro: IPR006630 Human Ro ribonucleoproteins (RNPs) are composed of one of the four small Y RNAs and at least two proteins, Ro60 and La. The La protein is a 47 kDa polypeptide that frequently acts as an autoantigen in systemic lupus erythematosus and Sjogren's syndrome []. In the nucleus, La acts as a RNA polymerase III (RNAP III) transcription factor, while in the cytoplasm, La acts as a translation factor []. In the nucleus, La binds to the 3'UTR of nascent RNAP III transcripts to assist in folding and maturation []. In the cytoplasm, La recognises specific classes of mRNAs that contain a 5'-terminal oligopyrimidine (5'TOP) motif known to control protein synthesis []. The specific recognition is mediated by the N-terminal domain of La, which comprises a La motif and a RNA recognition motif (RRM). The La motif adopts an alpha/beta fold that comprises a winged-helix motif []. Homologous La domain-containing proteins have been identified in a wide range of organisms except Archaea, bacteria and viruses [].; PDB: 1S29_A 1YTY_B 2VOO_B 1S7A_A 2VOP_A 2VON_B 1ZH5_B 2VOD_A 2CQK_A.
Probab=39.22 E-value=34 Score=17.68 Aligned_cols=20 Identities=30% Similarity=0.503 Sum_probs=16.7
Q ss_pred HHHHhhCCCCCCcccHHHHH
Q psy15330 21 KIFRQMDKNQDGRLSLEEFI 40 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~ 40 (60)
-+.+.++.+++|.|+...+.
T Consensus 19 fL~~~~~~~~~g~Vpi~~i~ 38 (61)
T PF05383_consen 19 FLRSQMDSNPDGWVPISTIL 38 (61)
T ss_dssp HHHHHHCTTTTTBEEHHHHT
T ss_pred HHHHHHHhcCCCcEeHHHHH
Confidence 45678899999999998876
No 146
>TIGR03296 M6dom_TIGR03296 M6 family metalloprotease domain. This model describes a metalloproteinase domain, with a characteristic HExxH motif. Examples of this domain are found in proteins in the family of immune inhibitor A, which cleaves antibacterial peptides, and in other, only distantly related proteases. This model is built to be broader and more inclusive than Pfam model pfam05547.
Probab=38.62 E-value=14 Score=24.46 Aligned_cols=16 Identities=25% Similarity=0.528 Sum_probs=12.6
Q ss_pred HHhhCCCCCCcccHHH
Q psy15330 23 FRQMDKNQDGRLSLEE 38 (60)
Q Consensus 23 f~~~D~d~dG~Is~~E 38 (60)
|..+|.|+||.|.-.|
T Consensus 92 f~~yD~dgDG~vd~~d 107 (286)
T TIGR03296 92 FDRYDLDGDGNFDEPD 107 (286)
T ss_pred cccccccCCCccCCCC
Confidence 5679999999996533
No 147
>PF07862 Nif11: Nitrogen fixation protein of unknown function; InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned [].
Probab=37.34 E-value=40 Score=16.22 Aligned_cols=19 Identities=42% Similarity=0.712 Sum_probs=13.9
Q ss_pred HHHHHHHHcCChHHHHHhc
Q psy15330 36 LEEFIEGAKSDPSIVRLLQ 54 (60)
Q Consensus 36 ~~EF~~~~~~~~~il~~l~ 54 (60)
...|+..+..||.+..-|.
T Consensus 6 l~~Fl~~~~~d~~l~~~l~ 24 (49)
T PF07862_consen 6 LKAFLEKVKSDPELREQLK 24 (49)
T ss_pred HHHHHHHHhcCHHHHHHHH
Confidence 5677888888888776653
No 148
>PF05901 Excalibur: Excalibur calcium-binding domain; InterPro: IPR008613 Extracellular Ca2+-dependent nuclease YokF from Bacillus subtilis and several other surface-exposed proteins from diverse bacteria are encoded in the genomes in two paralogous forms that differ by a ~45 amino acid fragment, which comprises a novel conserved domain. Sequence analysis of this domain revealed a conserved DxDxDGxxCE motif, which is strikingly similar to the Ca2+-binding loop of the calmodulin-like EF-hand domains, suggesting an evolutionary relationship between them. Functions of many of the other proteins in which the novel domain, named Excalibur (extracellular calcium-binding region), is found, as well as a structural model of its conserved motif are consistent with the notion that the Excalibur domain binds calcium. This domain is but one more example of the diversity of structural contexts surrounding the EF-hand-like calcium-binding loop in bacteria. This loop is thus more widespread than hitherto recognised and the evolution of EF-hand-like domains is probably more complex than previously appreciated [].
Probab=36.67 E-value=19 Score=16.86 Aligned_cols=9 Identities=33% Similarity=0.807 Sum_probs=7.4
Q ss_pred hhCCCCCCc
Q psy15330 25 QMDKNQDGR 33 (60)
Q Consensus 25 ~~D~d~dG~ 33 (60)
.+|.|+||.
T Consensus 26 ~LDrD~DGi 34 (37)
T PF05901_consen 26 KLDRDGDGI 34 (37)
T ss_pred cccCCCCCC
Confidence 579999985
No 149
>PF08298 AAA_PrkA: PrkA AAA domain; InterPro: IPR013153 This is entry is found at the N terminus of PrkA proteins - bacterial and archaeal serine kinases approximately 630 residues in length. PrkA possesses the A-motif of nucleotide-binding proteins and exhibits distant homology to eukaryotic protein kinases []. Note that many of these are hypothetical.
Probab=35.88 E-value=30 Score=24.22 Aligned_cols=18 Identities=39% Similarity=0.709 Sum_probs=15.7
Q ss_pred ccHHHHHHHHcCChHHHH
Q psy15330 34 LSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 34 Is~~EF~~~~~~~~~il~ 51 (60)
+||.||++.++.+|.+..
T Consensus 1 ms~~eYL~~~~~~p~~~~ 18 (358)
T PF08298_consen 1 MSFAEYLELVKEDPSVAA 18 (358)
T ss_pred CCHHHHHHHHHhCcHhhh
Confidence 589999999999998764
No 150
>KOG2562|consensus
Probab=34.33 E-value=47 Score=24.31 Aligned_cols=28 Identities=21% Similarity=0.466 Sum_probs=24.2
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHcCC
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~~~ 46 (60)
...+|++++.|..|.|+..-|+....+.
T Consensus 141 s~~~f~k~~~d~~g~it~~~Fi~~~~~~ 168 (493)
T KOG2562|consen 141 SASTFRKIDGDDTGHITRDKFINYWMRG 168 (493)
T ss_pred chhhhhhhccCcCCceeHHHHHHHHHhh
Confidence 3578999999999999999999877654
No 151
>cd00038 CAP_ED effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of
Probab=34.30 E-value=35 Score=17.54 Aligned_cols=21 Identities=24% Similarity=0.297 Sum_probs=17.0
Q ss_pred cccHHHHHHHHcCChHHHHHh
Q psy15330 33 RLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 33 ~Is~~EF~~~~~~~~~il~~l 53 (60)
.|+.+.|.+.+.++|.+...+
T Consensus 94 ~i~~~~~~~~~~~~~~~~~~~ 114 (115)
T cd00038 94 VLPRSDFRRLLQEYPELARRL 114 (115)
T ss_pred EEeHHHHHHHHHHCcHhHHhc
Confidence 588899999998888877654
No 152
>PF07531 TAFH: NHR1 homology to TAF; InterPro: IPR003894 The TAF homology (TAFH) or Nervy homology region 1 (NHR1) domain is a domain of 95-100 amino acids present in eukaryotic proteins of the MTG/ETO family and whereof the core ~75-80 residues occur in TAF proteins. The transcription initiation TFIID complex is composed of TATA binding protein (TBP) and a number of TBP-associated factors (TAFs). The TAFH/NHR1 domain is named after fruit fly TATA-box-associated factor 110 (TAF110), human TAF105 and TAF130, and the fruit fly protein Nervy, which is a homologue of human MTG8/ETO [, ]. The human eight twenty-one (ETO or MTG8) and related myeloid transforming gene products MTGR1 and MTG16 as well as the Nervy protein contain the NHR1-4 domains. The NHR1/TAFH domain occurs in the N-terminal part of these proteins, while a MYND-type zinc finger forms the NHR4 domain []. The TAFH/NHR1 domain can be involved in protein-protein interactions, e.g in MTG8/ETO with HSP90 and Gfi-1 []. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2P6V_A 2KNH_A 2PP4_A 2H7B_A.
Probab=33.78 E-value=35 Score=19.59 Aligned_cols=15 Identities=33% Similarity=0.645 Sum_probs=11.6
Q ss_pred CCcccHHHHHHHHcC
Q psy15330 31 DGRLSLEEFIEGAKS 45 (60)
Q Consensus 31 dG~Is~~EF~~~~~~ 45 (60)
+|.|+.|||...+..
T Consensus 38 ~~~i~~EeF~~~Lq~ 52 (96)
T PF07531_consen 38 DGKIEAEEFTSKLQE 52 (96)
T ss_dssp TTSS-HHHHHHHHHH
T ss_pred cCCCCHHHHHHHHHH
Confidence 689999999987653
No 153
>PF10891 DUF2719: Protein of unknown function (DUF2719); InterPro: IPR020122 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf56; it is a family of uncharacterised viral proteins.
Probab=33.46 E-value=27 Score=19.52 Aligned_cols=15 Identities=27% Similarity=0.270 Sum_probs=11.4
Q ss_pred CCcccHHHHHHHHcC
Q psy15330 31 DGRLSLEEFIEGAKS 45 (60)
Q Consensus 31 dG~Is~~EF~~~~~~ 45 (60)
-+.||++||+..=.+
T Consensus 33 PmSIS~eeY~~LH~~ 47 (81)
T PF10891_consen 33 PMSISFEEYIRLHIK 47 (81)
T ss_pred ccEeeHHHHHHHHHH
Confidence 477999999975443
No 154
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=33.42 E-value=26 Score=19.80 Aligned_cols=29 Identities=28% Similarity=0.397 Sum_probs=19.2
Q ss_pred HHhhCCCCC-CcccHHHHHHHHcCChHHHH
Q psy15330 23 FRQMDKNQD-GRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 23 f~~~D~d~d-G~Is~~EF~~~~~~~~~il~ 51 (60)
|+.+..+.. ..+|.+|.++.+..+|.+++
T Consensus 63 ~~~l~~~~~~~~~s~~e~~~~l~~~p~Lik 92 (111)
T cd03036 63 YRELGLKDKLPSLSEEEALELLSSDGMLIK 92 (111)
T ss_pred HHhCCcccccccCCHHHHHHHHHhCcCeee
Confidence 444444322 23588889999998888765
No 155
>cd03033 ArsC_15kD Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC. It is encoded by an ORF present in a gene cluster associated with nitrogen fixation that also encodes dinitrogenase reductase ADP-ribosyltransferase (DRAT) and dinitrogenase reductase activating glycohydrolase (DRAG). ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via glutaredoxin, through a single catalytic cysteine.
Probab=32.38 E-value=35 Score=19.58 Aligned_cols=29 Identities=21% Similarity=0.213 Sum_probs=20.4
Q ss_pred HHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 22 IFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 22 if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
.|+....+. ..+|-+|.++.+..+|.+++
T Consensus 61 ~~r~~~~~~-~~ls~~e~~~ll~~~P~Lik 89 (113)
T cd03033 61 RVKSGEVVP-EALDEEEALALMIADPLLIR 89 (113)
T ss_pred HHHhcCCCc-cCCCHHHHHHHHHhCcceee
Confidence 344444333 45788999999999998875
No 156
>TIGR01616 nitro_assoc nitrogenase-associated protein. This model describes a small family of uncharacterized proteins found so far in alpha and gamma proteobacteria and in Nostoc sp. PCC 7120, a cyanobacterium. The gene for this protein is associated with nitrogenase genes. This family shows sequence similarity to TIGR00014, a glutaredoxin-dependent arsenate reductase that converts arsentate to arsenite for disposal. This family is one of several included in Pfam model pfam03960.
Probab=32.24 E-value=32 Score=20.24 Aligned_cols=29 Identities=17% Similarity=0.220 Sum_probs=20.6
Q ss_pred HHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 22 IFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 22 if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
.|+..+.+. ..+|-+|.++.|..+|.|++
T Consensus 62 ~~r~l~~~~-~~ls~~e~i~lm~~~P~LIK 90 (126)
T TIGR01616 62 RVKSGEVNP-DSIDEASALALMVSDPLLIR 90 (126)
T ss_pred HhhhCCCCc-ccCCHHHHHHHHHhCcCeEe
Confidence 445555443 45788999999999988775
No 157
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=32.14 E-value=55 Score=16.77 Aligned_cols=20 Identities=35% Similarity=0.665 Sum_probs=15.4
Q ss_pred cHHHHHHHHcCChHHHHHhc
Q psy15330 35 SLEEFIEGAKSDPSIVRLLQ 54 (60)
Q Consensus 35 s~~EF~~~~~~~~~il~~l~ 54 (60)
++..|+..+.+||.+..-|.
T Consensus 3 ~l~~Fl~~~~~d~~L~~~l~ 22 (64)
T TIGR03798 3 QLKAFLEKVKTDPDLREKLK 22 (64)
T ss_pred HHHHHHHHHHcCHHHHHHHH
Confidence 35788888889988877663
No 158
>COG1565 Uncharacterized conserved protein [Function unknown]
Probab=32.03 E-value=34 Score=24.12 Aligned_cols=19 Identities=42% Similarity=0.658 Sum_probs=16.1
Q ss_pred CCCCcccHHHHHHHHcCCh
Q psy15330 29 NQDGRLSLEEFIEGAKSDP 47 (60)
Q Consensus 29 d~dG~Is~~EF~~~~~~~~ 47 (60)
...|.|||.+|++.+..+|
T Consensus 13 ~~~g~i~f~~fM~~~L~~p 31 (370)
T COG1565 13 AQGGPISFSDFMELALYDP 31 (370)
T ss_pred hcCCCccHHHHHHHHHcCC
Confidence 4679999999999888765
No 159
>PF10376 Mei5: Double-strand recombination repair protein ; InterPro: IPR018468 Mei5 is one of a pair of meiosis-specific proteins which facilitate the loading of Dmc1 on to Rad51 on DNA at double-strand breaks during recombination. Recombination is carried out by a large protein complex based around the two RecA homologues, Rad51 and Dmc1 []. This complex may play both a catalytic and a structural role in the interaction between homologous chromosomes during meiosis. Mei5 is seen to contain a coiled-coli region.
Probab=31.95 E-value=79 Score=20.54 Aligned_cols=35 Identities=11% Similarity=0.296 Sum_probs=27.8
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHH
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIV 50 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il 50 (60)
...+.+++..+..+.+...|..|++..+.=|+.|+
T Consensus 184 q~~l~eL~~~~~~~e~~~~TM~eL~~~l~ID~~LI 218 (221)
T PF10376_consen 184 QEALYELQSEMSEEEGEKFTMGELIKRLGIDYDLI 218 (221)
T ss_pred HHHHHHHHHHHhhccccCccHHHHHHHhCCCcccc
Confidence 34566777888887788999999999988787765
No 160
>cd08029 LA_like_fungal La-motif domain of fungal proteins similar to the La autoantigen. This domain is found in fungal proteins related to the La autoantigen. A variety of La-related proteins (LARPs or La ribonucleoproteins), with differing domain architecture, appear to function as RNA-binding proteins in eukaryotic cellular processes.
Probab=31.37 E-value=83 Score=17.04 Aligned_cols=33 Identities=9% Similarity=0.251 Sum_probs=22.3
Q ss_pred HHHhhCCCCCCcccHHHHHH-----HHcCChHHHHHhc
Q psy15330 22 IFRQMDKNQDGRLSLEEFIE-----GAKSDPSIVRLLQ 54 (60)
Q Consensus 22 if~~~D~d~dG~Is~~EF~~-----~~~~~~~il~~l~ 54 (60)
+.+.+..+.+|.|+.+.+.. .+..+..|..+|.
T Consensus 23 Lr~~~~~~~~G~Vpl~~i~~F~rmk~l~~~~~i~~Al~ 60 (76)
T cd08029 23 LWTLTGGSNNGWVPIKTIASFKRMRRFQPLEAVVEALR 60 (76)
T ss_pred HHHHhccCCCCcEehHHHhCchHHHHcCCHHHHHHHHH
Confidence 44567789999999988873 3334455666553
No 161
>PRK07077 hypothetical protein; Provisional
Probab=30.61 E-value=67 Score=21.00 Aligned_cols=25 Identities=24% Similarity=0.433 Sum_probs=20.4
Q ss_pred CCCCcccHHHHHHHHcCChHHHHHh
Q psy15330 29 NQDGRLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 29 d~dG~Is~~EF~~~~~~~~~il~~l 53 (60)
+.||.+++..|...+.+.|..+.-|
T Consensus 182 ~~~g~~~~~~~l~~l~r~P~~i~~L 206 (238)
T PRK07077 182 RDDGSTDILPILRGLARQPSQLGAL 206 (238)
T ss_pred CCCcCcCHHHHHHHHHhChHHHHHH
Confidence 4899999999999888888766543
No 162
>TIGR02675 tape_meas_nterm tape measure domain. Proteins containing this domain are strictly bacterial, including bacteriophage and prophage regions of bacterial genomes. Most members are 800 to 1800 amino acids long, making them among the longest predicted proteins of their respective phage genomes, where they are encoded in tail protein regions. This roughly 80-residue domain described here usually begins between residue 100 and 250. Many members are known or predicted to act as phage tail tape measure proteins, a minor tail component that regulates tail length.
Probab=29.91 E-value=63 Score=17.22 Aligned_cols=25 Identities=24% Similarity=0.274 Sum_probs=20.9
Q ss_pred CCCcccHHHHHHHHcCChHHHHHhc
Q psy15330 30 QDGRLSLEEFIEGAKSDPSIVRLLQ 54 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~~~~il~~l~ 54 (60)
..|+++-+||...+..-|.+.+.|.
T Consensus 27 ~~Gkv~~ee~n~~~e~~p~~~~~lA 51 (75)
T TIGR02675 27 ASGKLRGEEINSLLEALPGALQALA 51 (75)
T ss_pred HcCcccHHHHHHHHHHhHHHHHHHH
Confidence 5799999999999888887777663
No 163
>PRK07737 fliD flagellar capping protein; Validated
Probab=29.85 E-value=38 Score=24.32 Aligned_cols=24 Identities=29% Similarity=0.555 Sum_probs=20.4
Q ss_pred CCCcccHHH--HHHHHcCChHHHHHh
Q psy15330 30 QDGRLSLEE--FIEGAKSDPSIVRLL 53 (60)
Q Consensus 30 ~dG~Is~~E--F~~~~~~~~~il~~l 53 (60)
.+|.++++| |.+.+..||+-+..|
T Consensus 375 ~~G~L~iD~~kl~~Al~~n~~~V~~l 400 (501)
T PRK07737 375 DGGKLEIDETKLRQKIKENPDAVYQL 400 (501)
T ss_pred cCCeEEEcHHHHHHHHHHCHHHHHHH
Confidence 589999888 999999999877654
No 164
>PF14071 YlbD_coat: Putative coat protein
Probab=28.69 E-value=31 Score=20.70 Aligned_cols=17 Identities=41% Similarity=0.581 Sum_probs=13.7
Q ss_pred HHHHHHHcCChHHHHHh
Q psy15330 37 EEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 37 ~EF~~~~~~~~~il~~l 53 (60)
+||++.++++|.|+.=+
T Consensus 1 ~eFKeFVk~hPkii~eV 17 (124)
T PF14071_consen 1 EEFKEFVKKHPKIIQEV 17 (124)
T ss_pred ChhHHHHHhCchHHHHH
Confidence 47899999999988644
No 165
>PF03705 CheR_N: CheR methyltransferase, all-alpha domain; InterPro: IPR022641 CheR proteins are part of the chemotaxis signaling mechanism which methylates the chemotaxis receptor at specific glutamate residues. This entry refers to the N-terminal domain of the CherR-type MCP methyltransferases, which are found in bacteria, archaea and green plants. This entry is found in association with PF01739 from PFAM. Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. Flagellated bacteria swim towards favourable chemicals and away from deleterious ones. Sensing of chemoeffector gradients involves chemotaxis receptors, transmembrane (TM) proteins that detect stimuli through their periplasmic domains and transduce the signals via their cytoplasmic domains []. Signalling outputs from these receptors are influenced both by the binding of the chemoeffector ligand to their periplasmic domains and by methylation of specific glutamate residues on their cytoplasmic domains. Methylation is catalysed by CheR, an S-adenosylmethionine-dependent methyltransferase [], which reversibly methylates specific glutamate residues within a coiled coil region, to form gamma-glutamyl methyl ester residues [, ]. The structure of the Salmonella typhimurium chemotaxis receptor methyltransferase CheR, bound to S-adenosylhomocysteine, has been determined to a resolution of 2.0 A []. The structure reveals CheR to be a two-domain protein, with a smaller N-terminal helical domain linked via a single polypeptide connection to a larger C-terminal alpha/beta domain. The C-terminal domain has the characteristics of a nucleotide-binding fold, with an insertion of a small anti-parallel beta-sheet subdomain. The S-adenosylhomocysteine-binding site is formed mainly by the large domain, with contributions from residues within the N-terminal domain and the linker region [].; PDB: 1AF7_A 1BC5_A.
Probab=28.69 E-value=36 Score=16.55 Aligned_cols=17 Identities=41% Similarity=0.589 Sum_probs=9.7
Q ss_pred CcccHHHHHHHHcCChH
Q psy15330 32 GRLSLEEFIEGAKSDPS 48 (60)
Q Consensus 32 G~Is~~EF~~~~~~~~~ 48 (60)
|.=||.+|...++.+|+
T Consensus 40 ~~~~~~~y~~~L~~d~~ 56 (57)
T PF03705_consen 40 GLPSFAEYYELLRSDPD 56 (57)
T ss_dssp T---HHHHHHHHHH-T-
T ss_pred CCCCHHHHHHHHHhCCC
Confidence 45588899888887763
No 166
>PRK15455 PrkA family serine protein kinase; Provisional
Probab=28.62 E-value=83 Score=23.84 Aligned_cols=31 Identities=26% Similarity=0.457 Sum_probs=24.1
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcCChHHHH
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~~~~il~ 51 (60)
.+=+.++...+-..||.||++.|+.+|.+..
T Consensus 6 ~~~~~~~~~~~~~~sl~eyL~~vk~~p~~~~ 36 (644)
T PRK15455 6 HYQQRYEAAKEEEFSLQEYLELCKQDPSAYA 36 (644)
T ss_pred HHHHHHHHhhcccccHHHHHHHHhcChHHHh
Confidence 3334556667777899999999999998765
No 167
>PF11363 DUF3164: Protein of unknown function (DUF3164); InterPro: IPR021505 This entry is represented by Bacteriophage B3, Orf6. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=28.41 E-value=1.6e+02 Score=18.69 Aligned_cols=38 Identities=11% Similarity=0.212 Sum_probs=27.0
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHc---CChHHHHHhc
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAK---SDPSIVRLLQ 54 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~---~~~~il~~l~ 54 (60)
+...-+.+.|.+|+.|.|+...-+...+ .||...+.+.
T Consensus 119 ~l~~lV~~af~~dk~G~l~~~rIl~Lrrl~i~D~~w~~am~ 159 (195)
T PF11363_consen 119 ELRALVNRAFQVDKEGNLNTSRILGLRRLEIDDERWQEAMD 159 (195)
T ss_pred HHHHHHHHHHhcCCCCCcCHHHHHHHHhccCCCHHHHHHHH
Confidence 3445566778899999999998776544 3676666653
No 168
>PRK06664 fliD flagellar hook-associated protein FliD; Validated
Probab=27.65 E-value=44 Score=25.07 Aligned_cols=25 Identities=48% Similarity=0.566 Sum_probs=20.7
Q ss_pred CCCCcccHHH--HHHHHcCChHHHHHh
Q psy15330 29 NQDGRLSLEE--FIEGAKSDPSIVRLL 53 (60)
Q Consensus 29 d~dG~Is~~E--F~~~~~~~~~il~~l 53 (60)
+.+|.++++| |.+.+..||+-+..|
T Consensus 539 ~~dG~L~lDe~KL~~AL~~np~~V~~l 565 (661)
T PRK06664 539 IQDGYLELDEKKLDEVLKENPDSVREL 565 (661)
T ss_pred CCCccEEEcHHHHHHHHHhCHHHHHHH
Confidence 4689999998 999999999776654
No 169
>PLN02230 phosphoinositide phospholipase C 4
Probab=27.49 E-value=1.1e+02 Score=22.93 Aligned_cols=33 Identities=18% Similarity=0.284 Sum_probs=26.0
Q ss_pred CCcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 12 ESTPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 12 ~~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
...+...+..+|.++-.++ +.+|.++|...+..
T Consensus 24 ~~~p~~ei~~lf~~~s~~~-~~mt~~~l~~FL~~ 56 (598)
T PLN02230 24 ESGPVADVRDLFEKYADGD-AHMSPEQLQKLMAE 56 (598)
T ss_pred cCCCcHHHHHHHHHHhCCC-CccCHHHHHHHHHH
Confidence 3456778899999995444 89999999988764
No 170
>PF04380 BMFP: Membrane fusogenic activity; InterPro: IPR007475 BMFP consists of two structural domains, a coiled-coil C-terminal domain via which the protein self-associates as a trimer, and an N-terminal domain disordered at neutral pH but adopting an amphipathic alpha-helical structure in the presence of phospholipid vesicles, high ionic strength, acidic pH or SDS. BMFP interacts with phospholipid vesicles though the predicted amphipathic alpha-helix induced in the N-terminal half of the protein and promotes aggregation and fusion of vesicles in vitro.
Probab=26.63 E-value=62 Score=17.51 Aligned_cols=23 Identities=22% Similarity=0.315 Sum_probs=16.4
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHH
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIE 41 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~ 41 (60)
.....++..+.++| .+|.+||-.
T Consensus 32 ~~r~~l~~~l~kld-----lVtREEFd~ 54 (79)
T PF04380_consen 32 NIRARLQSALSKLD-----LVTREEFDA 54 (79)
T ss_pred HHHHHHHHHHHHCC-----CCcHHHHHH
Confidence 34456677777776 599999974
No 171
>PRK12765 flagellar capping protein; Provisional
Probab=26.44 E-value=39 Score=24.90 Aligned_cols=26 Identities=31% Similarity=0.537 Sum_probs=21.5
Q ss_pred CCCCCcccHHH--HHHHHcCChHHHHHh
Q psy15330 28 KNQDGRLSLEE--FIEGAKSDPSIVRLL 53 (60)
Q Consensus 28 ~d~dG~Is~~E--F~~~~~~~~~il~~l 53 (60)
.+.+|.|++++ |.+.+..||+-+..|
T Consensus 370 ~~~dG~L~lD~~kL~~AL~~np~~V~~l 397 (595)
T PRK12765 370 LNSDGTLSFDSSKFEQKVSEDPDSTQSF 397 (595)
T ss_pred ECCCCcEEEcHHHHHHHHHHCHHHHHHH
Confidence 45789999888 999999999877654
No 172
>PF12631 GTPase_Cys_C: Catalytic cysteine-containing C-terminus of GTPase, MnmE; PDB: 1XZQ_A 1XZP_A 2GJ8_D 3GEH_A 3GEI_B 3GEE_A.
Probab=26.34 E-value=50 Score=17.35 Aligned_cols=15 Identities=33% Similarity=0.483 Sum_probs=10.6
Q ss_pred CCcHHHHHHHHHHhh
Q psy15330 12 ESTPEKRTDKIFRQM 26 (60)
Q Consensus 12 ~~~~~~~~~~if~~~ 26 (60)
+...++.+++||+.|
T Consensus 58 ~~~~ediLd~IFs~F 72 (73)
T PF12631_consen 58 EVVTEDILDNIFSNF 72 (73)
T ss_dssp SS--HHHHHHHHCTS
T ss_pred CCChHHHHHHHHHhh
Confidence 356888999999876
No 173
>PF08100 Dimerisation: Dimerisation domain; InterPro: IPR012967 This domain is found at the N terminus of a variety of plant O-methyltransferases. It has been shown to mediate dimerisation of these proteins [].; GO: 0008168 methyltransferase activity, 0046983 protein dimerization activity; PDB: 1ZGJ_A 1ZG3_A 1ZHF_A 1ZGA_A 2QYO_A 1KYW_A 1KYZ_A 3REO_D 1FPX_A 1FP2_A ....
Probab=25.48 E-value=36 Score=17.09 Aligned_cols=27 Identities=30% Similarity=0.353 Sum_probs=18.4
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHc-CCh
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAK-SDP 47 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~-~~~ 47 (60)
.+|..+...+++.+|.+|....+. .+|
T Consensus 10 gI~dii~~~g~~~ls~~eia~~l~~~~p 37 (51)
T PF08100_consen 10 GIPDIIHNAGGGPLSLSEIAARLPTSNP 37 (51)
T ss_dssp THHHHHHHHTTS-BEHHHHHHTSTCT-T
T ss_pred CcHHHHHHcCCCCCCHHHHHHHcCCCCc
Confidence 345566666689999999998877 444
No 174
>KOG0042|consensus
Probab=25.43 E-value=61 Score=24.53 Aligned_cols=30 Identities=27% Similarity=0.450 Sum_probs=24.4
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHc
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~ 44 (60)
+++..+++.++.|.+.+|.++..||.+.++
T Consensus 627 d~~~~~~~l~ea~~~~~g~v~l~e~~q~~s 656 (680)
T KOG0042|consen 627 DEDRLHEELQEADENLNGFVELREFLQLMS 656 (680)
T ss_pred CHHHHHHHHHHHHHhhcceeeHHHHHHHHH
Confidence 456778888888999999999999887654
No 175
>smart00549 TAFH TAF homology. Domain in Drosophila nervy, CBFA2T1, human TAF105, human TAF130, and Drosophila TAF110. Also known as nervy homology region 1 (NHR1).
Probab=25.14 E-value=54 Score=18.75 Aligned_cols=27 Identities=19% Similarity=0.393 Sum_probs=17.7
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.+.+..+...+ -||.|+-|||-..+..
T Consensus 25 ~~~Vr~LV~~L---~~~~i~~EeF~~~Lq~ 51 (92)
T smart00549 25 AERVRTLVLGL---VNGTITAEEFTSRLQE 51 (92)
T ss_pred HHHHHHHHHHH---HhCCCCHHHHHHHHHH
Confidence 34444444433 3799999999976653
No 176
>KOG1265|consensus
Probab=25.03 E-value=1.1e+02 Score=24.63 Aligned_cols=37 Identities=22% Similarity=0.471 Sum_probs=29.9
Q ss_pred HHHHHHHHhhCCCCCCcccHHHHHHHHc---CChHHHHHh
Q psy15330 17 KRTDKIFRQMDKNQDGRLSLEEFIEGAK---SDPSIVRLL 53 (60)
Q Consensus 17 ~~~~~if~~~D~d~dG~Is~~EF~~~~~---~~~~il~~l 53 (60)
-.+++||.++--++.-++|.++|+.++. +||.|-.+|
T Consensus 221 ~eie~iF~ki~~~~kpylT~~ql~dfln~~QrDpRLNeil 260 (1189)
T KOG1265|consen 221 PEIEEIFRKISGKKKPYLTKEQLVDFLNKKQRDPRLNEIL 260 (1189)
T ss_pred hhHHHHHHHhccCCCccccHHHHHHHHhhhccCcchhhhh
Confidence 3588999999999989999999888776 477666655
No 177
>PF07879 PHB_acc_N: PHB/PHA accumulation regulator DNA-binding domain; InterPro: IPR012909 This domain is found at the N terminus of the polyhydroxyalkanoate (PHA) synthesis regulators. These regulators have been shown to directly bind DNA and PHA []. The invariant nature of this domain compared to the C-terminal IPR007897 from INTERPRO domain(s) suggests that it contains the DNA-binding function.
Probab=24.85 E-value=40 Score=18.05 Aligned_cols=23 Identities=17% Similarity=0.333 Sum_probs=19.6
Q ss_pred HhhCCCCCCcccHHHHHHHHcCC
Q psy15330 24 RQMDKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 24 ~~~D~d~dG~Is~~EF~~~~~~~ 46 (60)
+.+|+....+|++++..+.++..
T Consensus 10 RLYDT~~s~YiTL~di~~lV~~g 32 (64)
T PF07879_consen 10 RLYDTETSSYITLEDIAQLVREG 32 (64)
T ss_pred ccccCCCceeEeHHHHHHHHHCC
Confidence 46899999999999999888764
No 178
>PF00027 cNMP_binding: Cyclic nucleotide-binding domain; InterPro: IPR000595 Proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues [, , ]. The best studied of these proteins is the prokaryotic catabolite gene activator (also known as the cAMP receptor protein) (gene crp) where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure. There are six invariant amino acids in this domain, three of which are glycine residues that are thought to be essential for maintenance of the structural integrity of the beta-barrel. cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain. The cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain. The cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section. Vertebrate cyclic nucleotide-gated ion-channels also contain this domain. Two such cations channels have been fully characterised, one is found in rod cells where it plays a role in visual signal transduction.; PDB: 1O7F_A 2BYV_E 3E97_A 3U10_A 2H6B_A 3SHR_A 2OZ6_A 1WGP_A 3LA2_A 3LA3_B ....
Probab=24.83 E-value=44 Score=16.76 Aligned_cols=16 Identities=31% Similarity=0.669 Sum_probs=11.4
Q ss_pred cccHHHHHHHHcCChH
Q psy15330 33 RLSLEEFIEGAKSDPS 48 (60)
Q Consensus 33 ~Is~~EF~~~~~~~~~ 48 (60)
.|+.+.|.+.++++|+
T Consensus 76 ~i~~~~~~~~~~~~p~ 91 (91)
T PF00027_consen 76 RIPREDFLQLLQQDPE 91 (91)
T ss_dssp EEEHHHHHHHHHHSHH
T ss_pred EEeHHHHHHHHHhCcC
Confidence 3677777777777764
No 179
>PF04876 Tenui_NCP: Tenuivirus major non-capsid protein; InterPro: IPR006960 This entry contains the tenuivirus major non-capsid protein. Proteins accumulate in large amounts in tenuivirus infected cells. They are found in the inclusion bodies that are formed after infection [].
Probab=24.80 E-value=1.2e+02 Score=19.08 Aligned_cols=31 Identities=10% Similarity=0.250 Sum_probs=22.3
Q ss_pred HHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 15 PEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 15 ~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.+.+++.+.-.-|.|.+|.|++.+|...++.
T Consensus 81 ~~s~Lehllg~~~~~~n~~i~~~~ff~~lQ~ 111 (175)
T PF04876_consen 81 CHSFLEHLLGGEDDSTNGLIDIGKFFDILQP 111 (175)
T ss_pred HHHHHHHHhcCCcCCcccceeHHHHHHHHHH
Confidence 3445556665556677899999999987763
No 180
>KOG0998|consensus
Probab=24.80 E-value=37 Score=26.13 Aligned_cols=30 Identities=33% Similarity=0.438 Sum_probs=0.0
Q ss_pred HHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 16 EKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 16 ~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
.....+||...|.+.+|.|+..++...+..
T Consensus 282 ~~~~~~if~q~d~~~dG~I~s~~~~~~f~~ 311 (847)
T KOG0998|consen 282 KQKYSKIFSQVDKDNDGSISSNEARNIFLP 311 (847)
T ss_pred HHHHHHHHHhccccCCCccccccccccccc
No 181
>PF09693 Phage_XkdX: Phage uncharacterised protein (Phage_XkdX); InterPro: IPR010022 This entry is represented by Bacteriophage 69, Orf86. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry identifies a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=24.49 E-value=67 Score=15.19 Aligned_cols=13 Identities=31% Similarity=0.432 Sum_probs=10.3
Q ss_pred CCcccHHHHHHHH
Q psy15330 31 DGRLSLEEFIEGA 43 (60)
Q Consensus 31 dG~Is~~EF~~~~ 43 (60)
-|.||-+||.+..
T Consensus 24 ~g~IT~eey~eIT 36 (40)
T PF09693_consen 24 AGWITKEEYKEIT 36 (40)
T ss_pred cCeECHHHHHHhh
Confidence 3789999998764
No 182
>PF04651 Pox_A12: Poxvirus A12 protein; InterPro: IPR006744 This family contains vaccinia virus protein A12 and its homologues. VVA12 is a virion protein though its function is unknown.
Probab=24.12 E-value=72 Score=20.49 Aligned_cols=21 Identities=33% Similarity=0.608 Sum_probs=16.4
Q ss_pred cccHHHHHHHHcC-ChHHHHHh
Q psy15330 33 RLSLEEFIEGAKS-DPSIVRLL 53 (60)
Q Consensus 33 ~Is~~EF~~~~~~-~~~il~~l 53 (60)
+=||++|++.+.+ .|.|-.+|
T Consensus 8 RSSYDDYIeTinKitPQLrtlL 29 (189)
T PF04651_consen 8 RSSYDDYIETINKITPQLRTLL 29 (189)
T ss_pred cccHHHHHHHHHHhCHHHHHHH
Confidence 3689999998877 67776666
No 183
>PF06480 FtsH_ext: FtsH Extracellular; InterPro: IPR011546 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain is found in the FtsH family of proteins that include FtsH a membrane-bound ATP-dependent protease universally conserved in prokaryotes []. The FtsH peptidases, which belong to MEROPS peptidase family M41 (clan MA(E)), efficiently degrade proteins that have a low thermodynamic stability - e.g. they lack robust unfoldase activity. This feature may be key and implies that this could be a criterion for degrading a protein. In Oenococcus oeni (Leuconostoc oenos) FtsH is involved in protection against environmental stress [], and shows increased expression under heat or osmotic stress. These two lines of evidence suggest that it is a fundamental prokaryotic self-protection mechanism that checks if proteins are correctly folded. The precise function of this N-terminal region is unclear. ; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0016021 integral to membrane; PDB: 2LNA_A.
Probab=23.84 E-value=47 Score=17.60 Aligned_cols=20 Identities=20% Similarity=0.383 Sum_probs=11.9
Q ss_pred CCCCCCcccHHHHHHHHcCC
Q psy15330 27 DKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 27 D~d~dG~Is~~EF~~~~~~~ 46 (60)
.....-.|+|.||++.+.++
T Consensus 23 ~~~~~~~i~YS~F~~~l~~g 42 (110)
T PF06480_consen 23 NNSQTKEISYSEFLQMLEKG 42 (110)
T ss_dssp ---SSEE--HHHHHHTGGGT
T ss_pred ccCCCcEECHHHHHHHHHcC
Confidence 34456679999999888754
No 184
>CHL00091 apcE phycobillisome linker protein
Probab=23.45 E-value=82 Score=24.74 Aligned_cols=23 Identities=17% Similarity=0.338 Sum_probs=18.2
Q ss_pred CCCcccHHHHHHHHcCChHHHHH
Q psy15330 30 QDGRLSLEEFIEGAKSDPSIVRL 52 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~~~~il~~ 52 (60)
.+|.||..||++++.+.....+.
T Consensus 552 ~nG~IsvREFIR~LakS~~fr~~ 574 (877)
T CHL00091 552 RRRQISVREFVRQLAKSSVFRSL 574 (877)
T ss_pred hcCCccHHHHHHHHhccHHHHHh
Confidence 68999999999999886544433
No 185
>PF01839 FG-GAP: FG-GAP repeat; InterPro: IPR013517 This region contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure []. The repeat is called the FG-GAP repeat after two conserved motifs in the repeat []. The FG-GAP repeats are found in the N terminus of integrin alpha chains, a region that has been shown to be important for ligand binding []. A putative Ca2+ binding motif is found in some of the repeats. ; PDB: 1L5G_A 3IJE_A 1M1X_A 1JV2_A 1U8C_A 3V4P_C 3V4V_C 3VI3_A 3VI4_C 2VDN_A ....
Probab=23.43 E-value=43 Score=15.25 Aligned_cols=8 Identities=50% Similarity=0.426 Sum_probs=6.4
Q ss_pred hCCCCCCc
Q psy15330 26 MDKNQDGR 33 (60)
Q Consensus 26 ~D~d~dG~ 33 (60)
.|.|+||+
T Consensus 8 gD~ngDG~ 15 (34)
T PF01839_consen 8 GDFNGDGY 15 (34)
T ss_dssp ESTSSSSS
T ss_pred EEcCCCCC
Confidence 48888887
No 186
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=23.11 E-value=1.3e+02 Score=17.64 Aligned_cols=22 Identities=27% Similarity=0.542 Sum_probs=11.9
Q ss_pred HHHHHHHhhCCCCCCcccHHHHHHH
Q psy15330 18 RTDKIFRQMDKNQDGRLSLEEFIEG 42 (60)
Q Consensus 18 ~~~~if~~~D~d~dG~Is~~EF~~~ 42 (60)
....|++.+. +|.||.+|=++.
T Consensus 90 ~~~~IL~~L~---~GeIs~eeA~~~ 111 (113)
T PF09862_consen 90 ERKEILDKLE---KGEISVEEALEI 111 (113)
T ss_pred hHHHHHHHHH---cCCCCHHHHHHH
Confidence 3344555444 566776665544
No 187
>PF09012 FeoC: FeoC like transcriptional regulator; InterPro: IPR015102 This entry contains several transcriptional regulators, including FeoC, which contain a HTH motif. FeoC acts as a [Fe-S] dependent transcriptional repressor []. ; PDB: 1XN7_A 2K02_A.
Probab=23.02 E-value=1e+02 Score=15.79 Aligned_cols=24 Identities=29% Similarity=0.457 Sum_probs=17.1
Q ss_pred CCCcccHHHHHHHHcCChHHHHHh
Q psy15330 30 QDGRLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~~~~il~~l 53 (60)
..|.+|+.|....+..+|+.++.+
T Consensus 11 ~~~~~S~~eLa~~~~~s~~~ve~m 34 (69)
T PF09012_consen 11 ERGRVSLAELAREFGISPEAVEAM 34 (69)
T ss_dssp HS-SEEHHHHHHHTT--HHHHHHH
T ss_pred HcCCcCHHHHHHHHCcCHHHHHHH
Confidence 368899999999998888877654
No 188
>PF14410 GH-E: HNH/ENDO VII superfamily nuclease with conserved GHE residues
Probab=22.74 E-value=69 Score=17.04 Aligned_cols=30 Identities=20% Similarity=0.460 Sum_probs=21.3
Q ss_pred HHHHHhhCCCCCCcccHHHHHHHHcCChHHH
Q psy15330 20 DKIFRQMDKNQDGRLSLEEFIEGAKSDPSIV 50 (60)
Q Consensus 20 ~~if~~~D~d~dG~Is~~EF~~~~~~~~~il 50 (60)
....+.......+.||.+||...+. +|+-.
T Consensus 27 ~e~~~~~~~~~~~~~t~ke~~d~~n-~p~ny 56 (70)
T PF14410_consen 27 VEYWRLVGRAEEGGITRKEFLDWYN-DPDNY 56 (70)
T ss_pred HHHHHHHHHHHhcCCCHHHHHHHHh-Cccce
Confidence 3445555555668899999999987 66544
No 189
>cd06403 PB1_Par6 The PB1 domain is an essential part of Par6 protein which in complex with Par3 and aPKC proteins is crucial for establishment of apical-basal polarity of animal cells. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The
Probab=22.52 E-value=32 Score=19.23 Aligned_cols=23 Identities=17% Similarity=0.349 Sum_probs=17.3
Q ss_pred HHhhCCCCCCcccHHHHHHHHcC
Q psy15330 23 FRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 23 f~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
|++|-.+.+-..+|+||...+.+
T Consensus 11 fRRFsl~r~~~~~f~ef~~ll~~ 33 (80)
T cd06403 11 FRRFSLDRNKPGKFEDFYKLLEH 33 (80)
T ss_pred EEEEEeccccCcCHHHHHHHHHH
Confidence 56677777778899999876653
No 190
>COG1102 Cmk Cytidylate kinase [Nucleotide transport and metabolism]
Probab=22.49 E-value=70 Score=20.42 Aligned_cols=31 Identities=42% Similarity=0.683 Sum_probs=24.1
Q ss_pred HHHHhhCCCCCCcccHHHHHHHHcCChHHHHHh
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l 53 (60)
.+|+.+-... -+|++||-+....||+|-..+
T Consensus 33 ~iFR~~A~e~--gmsl~ef~~~AE~~p~iD~~i 63 (179)
T COG1102 33 TIFREMARER--GMSLEEFSRYAEEDPEIDKEI 63 (179)
T ss_pred HHHHHHHHHc--CCCHHHHHHHHhcCchhhHHH
Confidence 6777765543 389999999999999986654
No 191
>COG2910 Putative NADH-flavin reductase [General function prediction only]
Probab=22.41 E-value=68 Score=20.96 Aligned_cols=20 Identities=10% Similarity=0.288 Sum_probs=16.1
Q ss_pred hhCCCCCCcccHHHHHHHHc
Q psy15330 25 QMDKNQDGRLSLEEFIEGAK 44 (60)
Q Consensus 25 ~~D~d~dG~Is~~EF~~~~~ 44 (60)
..|.+|+-+|||++|...+.
T Consensus 175 l~n~~G~SrIS~aDYAiA~l 194 (211)
T COG2910 175 LVNAKGESRISYADYAIAVL 194 (211)
T ss_pred EEcCCCceeeeHHHHHHHHH
Confidence 35778888999999987663
No 192
>PRK08724 fliD flagellar capping protein; Validated
Probab=22.33 E-value=58 Score=24.72 Aligned_cols=27 Identities=7% Similarity=0.142 Sum_probs=21.7
Q ss_pred CCCCCCcccHHH--HHHHHcCChHHHHHh
Q psy15330 27 DKNQDGRLSLEE--FIEGAKSDPSIVRLL 53 (60)
Q Consensus 27 D~d~dG~Is~~E--F~~~~~~~~~il~~l 53 (60)
..+.+|.+++++ |.+.+..||+-+..|
T Consensus 562 tt~~dG~LsiD~~KL~~AL~~npd~V~~L 590 (673)
T PRK08724 562 TTTRQGTLEINYDMLDRQLNNNFNKLEEF 590 (673)
T ss_pred eeCCCCcEEecHHHHHHHHHhCHHHHHHH
Confidence 346789999988 999999999876644
No 193
>PF09912 DUF2141: Uncharacterized protein conserved in bacteria (DUF2141); InterPro: IPR018673 This family of conserved hypothetical proteins has no known function.
Probab=22.32 E-value=20 Score=20.57 Aligned_cols=13 Identities=31% Similarity=0.491 Sum_probs=10.9
Q ss_pred hCCCCCCcccHHH
Q psy15330 26 MDKNQDGRLSLEE 38 (60)
Q Consensus 26 ~D~d~dG~Is~~E 38 (60)
.|.|+||+++...
T Consensus 61 hD~N~NgklD~n~ 73 (112)
T PF09912_consen 61 HDENGNGKLDTNF 73 (112)
T ss_pred EeCCCCCcCCcCC
Confidence 5999999998765
No 194
>cd08032 LARP_7 La RNA-binding domain of La-related protein 7. LARP7 is a component of the 7SK snRNP, a key factor in the regulation of RNA polymerase II transcription. 7SK functionality is dependent on the presence of LARP7, which is thought to stabilize the 7SK RNA by interacting with its 3' end. The release of 7SK RNA from P-TEFb/HEXIM/7SK complexes activates the cyclin-dependent kinase P-TEFb, which in turn phosphorylates the C-terminal domain of RNA pol II and mediates a transition into productive transcription elongation.
Probab=22.27 E-value=77 Score=17.52 Aligned_cols=33 Identities=24% Similarity=0.409 Sum_probs=22.5
Q ss_pred HHHHhhCCCCCCcccHHHHHH-----HHcCCh-HHHHHh
Q psy15330 21 KIFRQMDKNQDGRLSLEEFIE-----GAKSDP-SIVRLL 53 (60)
Q Consensus 21 ~if~~~D~d~dG~Is~~EF~~-----~~~~~~-~il~~l 53 (60)
-+.+.++.+.||.|+.+-+.. .+..+. .|.++|
T Consensus 27 fL~~~~~~~~dG~Vpl~~i~~F~rmk~lt~d~~~i~~Al 65 (82)
T cd08032 27 FLREQIEKSRDGYIDISLLVSFNKMKKLTTDGKLIARAL 65 (82)
T ss_pred HHHHHhcCCCCCCEeHHHHhcchHHHHHcCCHHHHHHHH
Confidence 456789999999999887763 444453 444454
No 195
>PLN02952 phosphoinositide phospholipase C
Probab=22.23 E-value=1.4e+02 Score=22.32 Aligned_cols=32 Identities=9% Similarity=0.230 Sum_probs=24.7
Q ss_pred CcHHHHHHHHHHhhCCCCCCcccHHHHHHHHcC
Q psy15330 13 STPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKS 45 (60)
Q Consensus 13 ~~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~ 45 (60)
..+...+..+|..+-.++ +.+|.++|...+..
T Consensus 34 ~~~r~ei~~lf~~~~~~~-~~mt~~~l~~FL~~ 65 (599)
T PLN02952 34 AEPPDDVKDVFCKFSVGG-GHMGADQLRRFLVL 65 (599)
T ss_pred CCChHHHHHHHHHHhCCC-CccCHHHHHHHHHH
Confidence 345678899999996544 68999999988754
No 196
>CHL00091 apcE phycobillisome linker protein
Probab=22.19 E-value=93 Score=24.46 Aligned_cols=22 Identities=23% Similarity=0.490 Sum_probs=17.8
Q ss_pred CCCcccHHHHHHHHcCChHHHH
Q psy15330 30 QDGRLSLEEFIEGAKSDPSIVR 51 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~~~~il~ 51 (60)
.+|.||..||++++.+.+.-.+
T Consensus 304 rnG~IsVReFIR~LakS~~Yr~ 325 (877)
T CHL00091 304 KNGQISIKEFIRALGKSEIYRK 325 (877)
T ss_pred hcCCccHHHHHHHHhccHHHHH
Confidence 6899999999999988654433
No 197
>PF03986 Autophagy_N: Autophagocytosis associated protein (Atg3), N-terminal domain ; InterPro: IPR007134 Proteins in this entry belong to the Atg3 group of proteins and the Atg3 conjugation enzymes. Autophagy is a degradative transport pathway that delivers cytosolic proteins to the lysosome (vacuole) [] and is induced by starvation []. Cytosolic proteins appear inside the vacuole enclosed in autophagic vesicles. Autophagy significantly differs from other transport pathways by using double membrane layered transport intermediates, called autophagosomes [, ]. The breakdown of vesicular transport intermediates is a unique feature of autophagy []. Autophagy can also function in the elimination of invading bacteria and antigens []. Atg3 is the E2 enzyme for the LC3 lipidation process []. It is essential for autophagocytosis. The super protein complex, the Atg16L complex, consists of multiple Atg12-Atg5 conjugates. Atg16L has an E3-like role in the LC3 lipidation reaction. The activated intermediate, LC3-Atg3 (E2), is recruited to the site where the lipidation takes place []. Atg3 catalyses the conjugation of Atg8 and phosphatidylethanolamine (PE). Atg3 has an alpha/beta-fold, and its core region is topologically similar to canonical E2 enzymes. Atg3 has two regions inserted in the core region and another with a long alpha-helical structure that protrudes from the core region as far as 30 A []. It interacts with atg8 through an intermediate thioester bond between Cys-288 and the C-terminal Gly of atg8. It also interacts with the C-terminal region of the E1-like atg7 enzyme. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the lysosome/vacuole. Atg3 is a ubiquitin like modifier that is topologically similar to the canonical E2 enzyme []. It catalyses the conjugation of Atg8 and phosphatidylethanolamine []. This domain is the N-terminal of Atg3 while the C-terminal is represented by IPR007135 from INTERPRO.; PDB: 3T7G_C 2DYT_A.
Probab=22.04 E-value=59 Score=19.85 Aligned_cols=14 Identities=29% Similarity=0.634 Sum_probs=8.6
Q ss_pred CCCcccHHHHHHHH
Q psy15330 30 QDGRLSLEEFIEGA 43 (60)
Q Consensus 30 ~dG~Is~~EF~~~~ 43 (60)
..|.||-+||+.+-
T Consensus 24 etG~iTPeEFV~AG 37 (145)
T PF03986_consen 24 ETGVITPEEFVAAG 37 (145)
T ss_dssp HHS---HHHHHHHH
T ss_pred ccceeCHHHHHHhh
Confidence 36999999999753
No 198
>PF09066 B2-adapt-app_C: Beta2-adaptin appendage, C-terminal sub-domain; InterPro: IPR015151 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. This entry represents a subdomain of the appendage (ear) domain of beta-adaptin from AP clathrin adaptor complexes. This domain has a three-layer arrangement, alpha-beta-alpha, with a bifurcated antiparallel beta-sheet []. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15) []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 1E42_B 2G30_A 2IV9_B 2IV8_A 3HS9_A 3H1Z_A.
Probab=22.03 E-value=58 Score=18.19 Aligned_cols=18 Identities=50% Similarity=0.586 Sum_probs=9.7
Q ss_pred CCCcccHHHHHHHHcCCh
Q psy15330 30 QDGRLSLEEFIEGAKSDP 47 (60)
Q Consensus 30 ~dG~Is~~EF~~~~~~~~ 47 (60)
.||.++.++|.+..+.-|
T Consensus 3 ~d~~~~~~~F~~~W~sl~ 20 (114)
T PF09066_consen 3 EDGSMDPEEFQEMWKSLP 20 (114)
T ss_dssp TT----HHHHHHHHHHS-
T ss_pred CCCccCHHHHHHHHHhCC
Confidence 368999999998776533
No 199
>PF04433 SWIRM: SWIRM domain; InterPro: IPR007526 The SWIRM domain is a small alpha-helical domain of about 85 amino acid residues found in eukaryotic chromosomal proteins. It is named after the proteins SWI3, RSC8 and MOIRA in which it was first recognised. This domain is predicted to mediate protein-protein interactions in the assembly of chromatin-protein complexes. The SWIRM domain can be linked to different domains, such as the ZZ-type zinc finger (IPR000433 from INTERPRO), the Myb DNA-binding domain (IPR001005 from INTERPRO), the HORMA domain (IPR003511 from INTERPRO), the amino-oxidase domain, the chromo domain (IPR000953 from INTERPRO), and the JAB1/PAD1 domain.; GO: 0005515 protein binding; PDB: 2Z3Y_A 2UXN_A 2Y48_A 2HKO_A 2XAF_A 2X0L_A 2XAJ_A 2UXX_A 2V1D_A 2L3D_A ....
Probab=22.02 E-value=91 Score=16.65 Aligned_cols=32 Identities=22% Similarity=0.235 Sum_probs=22.7
Q ss_pred HHHHHhhCCCCCCcccHHHHHHHHc-CChHHHH
Q psy15330 20 DKIFRQMDKNQDGRLSLEEFIEGAK-SDPSIVR 51 (60)
Q Consensus 20 ~~if~~~D~d~dG~Is~~EF~~~~~-~~~~il~ 51 (60)
+.++..+-.|..+.+|..+-.+.++ .|...+.
T Consensus 40 n~il~~w~~n~~~~lt~~~~~~~i~~~d~~~~~ 72 (86)
T PF04433_consen 40 NTILAEWRKNPNKYLTKTDARKLIKGIDVNKIR 72 (86)
T ss_dssp HHHHHHHHHHTTS---HHHHHHHTTSSSHHHHH
T ss_pred HHHHHHHHHCCCCcccHHHHHHHccccCHHHHH
Confidence 4677888889999999999999988 4665544
No 200
>TIGR03697 NtcA_cyano global nitrogen regulator NtcA, cyanobacterial. Members of this protein family, found in the cyanobacteria, are the global nitrogen regulator NtcA. This DNA-binding transcriptional regulator is required for expressing many different ammonia-repressible genes. The consensus NtcA-binding site is G T A N(8)T A C.
Probab=21.41 E-value=92 Score=18.32 Aligned_cols=21 Identities=24% Similarity=0.482 Sum_probs=17.3
Q ss_pred cccHHHHHHHHcCChHHHHHh
Q psy15330 33 RLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 33 ~Is~~EF~~~~~~~~~il~~l 53 (60)
.|+.+.|...+.++|.+...+
T Consensus 72 ~i~~~~~~~l~~~~p~l~~~~ 92 (193)
T TIGR03697 72 AVPIEQVEKAIEEDPDLSMLL 92 (193)
T ss_pred EeeHHHHHHHHHHChHHHHHH
Confidence 478899999999999887654
No 201
>COG1104 NifS Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]
Probab=21.39 E-value=77 Score=22.42 Aligned_cols=21 Identities=24% Similarity=0.493 Sum_probs=18.7
Q ss_pred hCCCCCCcccHHHHHHHHcCC
Q psy15330 26 MDKNQDGRLSLEEFIEGAKSD 46 (60)
Q Consensus 26 ~D~d~dG~Is~~EF~~~~~~~ 46 (60)
+..|.+|.|+.+++.+.++.+
T Consensus 121 l~V~~~G~v~~e~L~~al~~~ 141 (386)
T COG1104 121 LPVDSNGLVDLEQLEEALRPD 141 (386)
T ss_pred eCCCCCCeEcHHHHHHhcCCC
Confidence 567889999999999999876
No 202
>KOG4070|consensus
Probab=21.24 E-value=39 Score=21.37 Aligned_cols=25 Identities=24% Similarity=0.497 Sum_probs=18.8
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHH
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGA 43 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~ 43 (60)
+..+|.++---.-+.|+|+||.+.+
T Consensus 59 t~i~fsKvkg~~~~~~tf~~fkkal 83 (180)
T KOG4070|consen 59 TDIVFSKVKGKKARTITFEEFKKAL 83 (180)
T ss_pred cceeeeeccccccccccHHHHHHHH
Confidence 4566777777777899999996654
No 203
>PRK11753 DNA-binding transcriptional dual regulator Crp; Provisional
Probab=21.19 E-value=97 Score=18.55 Aligned_cols=21 Identities=24% Similarity=0.423 Sum_probs=17.1
Q ss_pred cccHHHHHHHHcCChHHHHHh
Q psy15330 33 RLSLEEFIEGAKSDPSIVRLL 53 (60)
Q Consensus 33 ~Is~~EF~~~~~~~~~il~~l 53 (60)
.|+.+.|.+.+..+|.+...+
T Consensus 98 ~i~~~~~~~l~~~~p~~~~~~ 118 (211)
T PRK11753 98 EISYKKFRQLIQVNPDILMAL 118 (211)
T ss_pred EEcHHHHHHHHHHCHHHHHHH
Confidence 478899999999999887554
No 204
>PF09280 XPC-binding: XPC-binding domain; InterPro: IPR015360 Members of this entry adopt a structure consisting of four alpha helices, arranged in an array. They bind specifically and directly to the xeroderma pigmentosum group C protein (XPC) to initiate nucleotide excision repair []. ; GO: 0003684 damaged DNA binding, 0006289 nucleotide-excision repair, 0043161 proteasomal ubiquitin-dependent protein catabolic process; PDB: 1PVE_A 1QZE_A 1OQY_A 1TP4_A 1X3W_B 3ESW_B 2QSG_X 2QSF_X 1X3Z_B 2QSH_X ....
Probab=20.63 E-value=1e+02 Score=15.90 Aligned_cols=16 Identities=6% Similarity=0.372 Sum_probs=7.6
Q ss_pred HHHHHHHHcCChHHHH
Q psy15330 36 LEEFIEGAKSDPSIVR 51 (60)
Q Consensus 36 ~~EF~~~~~~~~~il~ 51 (60)
|.+....++.||.++.
T Consensus 10 f~~lR~~vq~NP~lL~ 25 (59)
T PF09280_consen 10 FQQLRQLVQQNPQLLP 25 (59)
T ss_dssp HHHHHHHHHC-GGGHH
T ss_pred HHHHHHHHHHCHHHHH
Confidence 4445555555555443
No 205
>PF10757 YbaJ: Biofilm formation regulator YbaJ; InterPro: IPR019693 YbaJ regulates biofilm formation. It also has an important role in the regulation of motility in the biofilm. YbaJ functions in increasing conjugation, aggregation and decreasing the motility, resulting in an increase of biofilm [].
Probab=20.28 E-value=2.1e+02 Score=17.15 Aligned_cols=39 Identities=10% Similarity=0.130 Sum_probs=30.6
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHhcc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLLQC 55 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l~~ 55 (60)
..+++++..|..|- +-.|++.|..+.-+.+..+..+|.-
T Consensus 75 ~ideYLDeTy~LFs---sy~In~~dL~~Wqk~~~~L~~~~~~ 113 (122)
T PF10757_consen 75 LIDEYLDETYMLFS---SYGINDSDLQKWQKSNQRLFRCFSN 113 (122)
T ss_pred HHHHHHHHHHHHhc---CccCCHHHHHHHHHHHHHHHHHHcc
Confidence 45677777777774 4459999999998888999998864
No 206
>PRK14734 coaE dephospho-CoA kinase; Provisional
Probab=20.15 E-value=2.2e+02 Score=17.61 Aligned_cols=36 Identities=25% Similarity=0.259 Sum_probs=26.3
Q ss_pred HHHHHHhhCCCCCCcccHHHHHHHHcCChHHHHHhc
Q psy15330 19 TDKIFRQMDKNQDGRLSLEEFIEGAKSDPSIVRLLQ 54 (60)
Q Consensus 19 ~~~if~~~D~d~dG~Is~~EF~~~~~~~~~il~~l~ 54 (60)
+.+.|-.--.+.||.|+.....+.+-.||+.++.|+
T Consensus 49 l~~~fg~~~~~~~g~idR~~L~~~vF~~~~~~~~le 84 (200)
T PRK14734 49 LAEAFGDDILNPDGTLDRAGLAAKAFASPEQTALLN 84 (200)
T ss_pred HHHHhCccccCCCChhhHHHHHHHHhCCHHHHHHHH
Confidence 344444434567899999999988888888877664
No 207
>TIGR00984 3a0801s03tim44 mitochondrial import inner membrane, translocase subunit. translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tim proteins.
Probab=20.03 E-value=2.4e+02 Score=19.98 Aligned_cols=37 Identities=19% Similarity=0.289 Sum_probs=27.1
Q ss_pred cHHHHHHHHHHhhCCCCCCcccHHHHHHHHcCC--hHHHHHhc
Q psy15330 14 TPEKRTDKIFRQMDKNQDGRLSLEEFIEGAKSD--PSIVRLLQ 54 (60)
Q Consensus 14 ~~~~~~~~if~~~D~d~dG~Is~~EF~~~~~~~--~~il~~l~ 54 (60)
+....+-+-++.+|.+ .+.++|+.+++.. |.|+.++-
T Consensus 225 te~a~~l~eIk~~DPs----Fd~~~Fl~gar~aI~p~ILeAf~ 263 (378)
T TIGR00984 225 TEVSEVLTEFKKIDPT----FDKEHFLRFLREYIVPEILEAYV 263 (378)
T ss_pred CHHHHHHHHHHHhCCC----CCHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555667888765 7889999999887 77887753
Done!