Query 047021
Match_columns 388
No_of_seqs 17 out of 19
Neff 2.9
Searched_HMMs 46136
Date Fri Mar 29 06:50:27 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/047021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/047021hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR02168 SMC_prok_B chromosom 98.0 0.04 8.6E-07 58.9 34.5 73 207-279 864-936 (1179)
2 TIGR02169 SMC_prok_A chromosom 97.9 0.054 1.2E-06 58.3 41.8 105 4-109 674-778 (1164)
3 TIGR02168 SMC_prok_B chromosom 97.8 0.088 1.9E-06 56.3 34.3 22 335-356 993-1014(1179)
4 PF00038 Filament: Intermediat 97.5 0.084 1.8E-06 50.0 28.1 96 11-106 18-113 (312)
5 TIGR02169 SMC_prok_A chromosom 97.4 0.33 7E-06 52.5 34.3 37 209-245 427-463 (1164)
6 KOG0161 Myosin class II heavy 97.2 0.93 2E-05 54.6 34.6 345 5-356 937-1336(1930)
7 PF13514 AAA_27: AAA domain 97.0 0.94 2E-05 51.0 40.4 253 3-264 345-669 (1111)
8 COG1196 Smc Chromosome segrega 96.9 0.34 7.4E-06 54.7 24.4 76 5-81 731-806 (1163)
9 PF00038 Filament: Intermediat 95.9 1.6 3.4E-05 41.5 21.3 119 155-283 10-142 (312)
10 TIGR00606 rad50 rad50. This fa 95.9 4.8 0.0001 46.4 39.4 32 322-353 1062-1093(1311)
11 PRK11281 hypothetical protein; 95.8 5.2 0.00011 46.2 26.5 189 132-359 125-323 (1113)
12 PF14662 CCDC155: Coiled-coil 95.3 2.7 5.9E-05 40.0 20.5 166 2-180 6-189 (193)
13 PRK04863 mukB cell division pr 95.3 7.6 0.00016 46.1 25.4 182 68-249 435-671 (1486)
14 PRK02224 chromosome segregatio 95.2 5.6 0.00012 43.1 34.1 34 79-112 555-588 (880)
15 PF07888 CALCOCO1: Calcium bin 95.2 5.6 0.00012 43.0 31.3 225 5-284 144-376 (546)
16 PHA02562 46 endonuclease subun 95.0 4.8 0.0001 40.9 26.2 138 11-152 174-323 (562)
17 PF10174 Cast: RIM-binding pro 94.7 9.1 0.0002 42.8 31.4 84 10-107 2-85 (775)
18 KOG0161 Myosin class II heavy 94.5 16 0.00035 44.7 37.6 157 70-255 903-1059(1930)
19 PF09730 BicD: Microtubule-ass 93.3 16 0.00036 40.6 30.5 151 8-181 31-181 (717)
20 PF05622 HOOK: HOOK protein; 93.2 0.025 5.4E-07 60.4 0.0 132 8-150 190-330 (713)
21 COG1196 Smc Chromosome segrega 93.2 19 0.00042 41.2 41.3 42 328-369 970-1011(1163)
22 TIGR00606 rad50 rad50. This fa 92.9 23 0.00049 41.1 41.2 188 45-256 742-935 (1311)
23 PRK11637 AmiB activator; Provi 92.8 12 0.00026 37.9 25.2 40 73-112 94-133 (428)
24 PF15070 GOLGA2L5: Putative go 92.8 17 0.00037 39.6 27.3 79 9-92 99-177 (617)
25 PF13514 AAA_27: AAA domain 92.7 21 0.00046 40.5 35.0 161 70-244 668-836 (1111)
26 PF05557 MAD: Mitotic checkpoi 92.5 1.8 3.8E-05 46.7 12.5 138 202-362 503-647 (722)
27 PF09728 Taxilin: Myosin-like 92.1 14 0.0003 36.8 29.7 254 13-286 17-307 (309)
28 TIGR02680 conserved hypothetic 92.1 30 0.00065 40.7 31.1 121 11-148 742-864 (1353)
29 PF04849 HAP1_N: HAP1 N-termin 91.8 16 0.00035 37.0 17.8 166 3-183 81-254 (306)
30 TIGR01005 eps_transp_fam exopo 91.4 13 0.00029 39.9 17.5 90 54-147 237-335 (754)
31 PRK09039 hypothetical protein; 91.3 17 0.00038 36.4 18.4 35 43-77 70-104 (343)
32 PRK11637 AmiB activator; Provi 91.0 19 0.00042 36.4 21.6 88 5-93 48-135 (428)
33 PF07888 CALCOCO1: Calcium bin 91.0 27 0.00058 38.0 33.7 75 6-81 152-226 (546)
34 KOG0980 Actin-binding protein 91.0 35 0.00076 39.3 23.6 119 154-283 387-505 (980)
35 KOG1029 Endocytic adaptor prot 90.6 37 0.00079 39.0 26.3 228 6-249 319-582 (1118)
36 PF15619 Lebercilin: Ciliary p 90.4 15 0.00034 34.4 19.9 93 75-184 82-185 (194)
37 PF00769 ERM: Ezrin/radixin/mo 90.1 18 0.0004 34.7 15.8 29 327-355 174-202 (246)
38 PF04849 HAP1_N: HAP1 N-termin 90.1 22 0.00047 36.1 16.5 84 4-112 160-243 (306)
39 PF05557 MAD: Mitotic checkpoi 88.8 6.2 0.00013 42.6 12.5 23 163-185 566-588 (722)
40 PF01576 Myosin_tail_1: Myosin 88.5 0.14 2.9E-06 56.6 0.0 304 14-364 134-455 (859)
41 COG4477 EzrA Negative regulato 88.0 45 0.00098 36.5 21.3 125 4-150 347-476 (570)
42 PF09726 Macoilin: Transmembra 87.9 49 0.0011 36.7 22.0 179 16-242 451-648 (697)
43 PF12128 DUF3584: Protein of u 87.3 64 0.0014 37.4 35.8 218 5-223 615-869 (1201)
44 PF08317 Spc7: Spc7 kinetochor 86.6 27 0.00058 34.5 14.5 53 7-60 152-204 (325)
45 PRK09039 hypothetical protein; 86.3 39 0.00085 34.0 16.4 25 205-229 175-200 (343)
46 KOG4807 F-actin binding protei 86.3 52 0.0011 35.3 19.1 59 223-281 414-479 (593)
47 PF10174 Cast: RIM-binding pro 86.1 66 0.0014 36.3 32.2 103 5-113 239-353 (775)
48 PF07106 TBPIP: Tat binding pr 83.1 21 0.00046 31.7 11.0 80 125-214 69-157 (169)
49 PF08317 Spc7: Spc7 kinetochor 80.2 65 0.0014 31.9 18.4 107 92-212 180-286 (325)
50 TIGR03017 EpsF chain length de 79.2 72 0.0016 31.8 18.2 45 99-147 257-301 (444)
51 PRK03918 chromosome segregatio 79.1 1E+02 0.0022 33.5 33.8 28 157-184 406-433 (880)
52 TIGR03752 conj_TIGR03752 integ 78.3 11 0.00023 40.3 8.6 76 2-112 64-139 (472)
53 PF09726 Macoilin: Transmembra 77.6 1.2E+02 0.0027 33.7 21.2 46 4-49 460-505 (697)
54 KOG4674 Uncharacterized conser 77.4 2E+02 0.0042 35.9 36.8 155 202-357 391-561 (1822)
55 TIGR01005 eps_transp_fam exopo 76.5 37 0.0008 36.6 12.2 108 4-111 288-398 (754)
56 PF14992 TMCO5: TMCO5 family 76.4 91 0.002 31.5 17.8 158 8-185 8-182 (280)
57 PF11932 DUF3450: Protein of u 75.6 14 0.0003 34.9 7.9 54 5-58 57-113 (251)
58 KOG1962 B-cell receptor-associ 75.5 22 0.00048 34.5 9.3 56 198-253 154-209 (216)
59 PF15397 DUF4618: Domain of un 75.1 93 0.002 31.0 22.1 194 34-258 22-221 (258)
60 PRK10929 putative mechanosensi 74.9 1.8E+02 0.004 34.3 28.1 105 245-361 198-305 (1109)
61 PF12718 Tropomyosin_1: Tropom 73.0 70 0.0015 28.6 13.6 86 4-108 21-106 (143)
62 smart00338 BRLZ basic region l 72.8 8.8 0.00019 29.3 4.8 38 4-41 26-63 (65)
63 KOG0996 Structural maintenance 72.6 2.2E+02 0.0048 34.2 22.4 233 2-240 790-1032(1293)
64 PRK11281 hypothetical protein; 72.4 2.1E+02 0.0045 33.8 26.3 82 99-183 166-254 (1113)
65 TIGR03007 pepcterm_ChnLen poly 72.4 1.2E+02 0.0026 31.0 20.9 34 82-115 204-237 (498)
66 PF01576 Myosin_tail_1: Myosin 71.5 1.3 2.8E-05 49.2 0.0 70 205-274 380-449 (859)
67 PF06005 DUF904: Protein of un 70.6 56 0.0012 26.5 9.6 65 218-285 6-70 (72)
68 KOG0977 Nuclear envelope prote 70.5 1.7E+02 0.0037 32.1 15.6 50 35-85 165-214 (546)
69 PF11932 DUF3450: Protein of u 69.6 1E+02 0.0022 29.2 13.8 72 201-272 48-119 (251)
70 COG4942 Membrane-bound metallo 69.4 1.6E+02 0.0035 31.3 23.2 24 160-183 196-219 (420)
71 PF09755 DUF2046: Uncharacteri 69.0 1.4E+02 0.0031 30.6 23.9 61 7-67 23-97 (310)
72 PF00170 bZIP_1: bZIP transcri 68.7 13 0.00029 28.3 5.0 36 5-40 27-62 (64)
73 PF12777 MT: Microtubule-bindi 68.6 31 0.00068 34.3 8.8 108 25-132 8-121 (344)
74 TIGR03185 DNA_S_dndD DNA sulfu 67.6 1.8E+02 0.004 31.3 30.2 42 71-112 205-246 (650)
75 PF08614 ATG16: Autophagy prot 65.2 39 0.00085 30.9 8.2 145 96-251 31-179 (194)
76 PRK02224 chromosome segregatio 65.2 2.2E+02 0.0047 31.3 42.6 33 77-109 407-439 (880)
77 PF15035 Rootletin: Ciliary ro 64.8 14 0.00031 34.4 5.3 43 2-44 65-107 (182)
78 TIGR01843 type_I_hlyD type I s 64.0 1.5E+02 0.0031 28.8 18.1 169 13-182 76-265 (423)
79 TIGR03017 EpsF chain length de 63.8 1.3E+02 0.0029 30.0 12.2 80 6-85 256-335 (444)
80 PF05701 WEMBL: Weak chloropla 62.1 2.2E+02 0.0048 30.3 31.8 167 4-185 27-194 (522)
81 KOG0977 Nuclear envelope prote 62.0 2.5E+02 0.0054 30.9 24.2 83 30-112 61-157 (546)
82 PF01166 TSC22: TSC-22/dip/bun 60.9 9.8 0.00021 30.5 3.0 25 2-26 19-43 (59)
83 COG4372 Uncharacterized protei 60.0 2.5E+02 0.0055 30.3 24.0 87 10-111 94-180 (499)
84 PRK10884 SH3 domain-containing 59.8 1.1E+02 0.0023 29.2 10.2 33 83-115 140-172 (206)
85 KOG3977 Troponin I [Cytoskelet 58.6 23 0.0005 34.4 5.7 75 7-85 98-181 (221)
86 TIGR02231 conserved hypothetic 58.6 2.1E+02 0.0045 29.9 13.0 44 70-113 66-109 (525)
87 PF04012 PspA_IM30: PspA/IM30 58.5 1.5E+02 0.0032 27.2 16.5 137 9-146 28-183 (221)
88 PF14197 Cep57_CLD_2: Centroso 58.4 45 0.00098 26.8 6.5 39 30-69 3-41 (69)
89 COG4942 Membrane-bound metallo 58.3 2.6E+02 0.0056 29.8 28.3 105 11-130 38-142 (420)
90 TIGR00219 mreC rod shape-deter 58.0 32 0.00068 33.7 6.6 52 229-283 58-109 (283)
91 PF09755 DUF2046: Uncharacteri 56.3 2.4E+02 0.0053 29.0 22.5 91 21-115 109-204 (310)
92 PF13935 Ead_Ea22: Ead/Ea22-li 56.1 67 0.0015 28.3 7.8 59 227-285 71-132 (139)
93 KOG0249 LAR-interacting protei 54.8 1.6E+02 0.0035 33.8 11.9 91 17-110 162-258 (916)
94 PF12329 TMF_DNA_bd: TATA elem 54.7 27 0.00058 28.2 4.6 38 5-42 34-71 (74)
95 PF04859 DUF641: Plant protein 54.7 28 0.00061 31.3 5.2 46 2-47 78-123 (131)
96 PF15294 Leu_zip: Leucine zipp 54.2 1.7E+02 0.0038 29.5 11.1 124 235-385 130-254 (278)
97 KOG4674 Uncharacterized conser 54.1 5.5E+02 0.012 32.3 33.8 131 8-145 569-706 (1822)
98 PF12709 Kinetocho_Slk19: Cent 52.9 28 0.00062 29.6 4.7 38 4-41 49-86 (87)
99 PF06156 DUF972: Protein of un 52.7 79 0.0017 27.4 7.5 53 213-286 5-57 (107)
100 PF13870 DUF4201: Domain of un 52.7 1.7E+02 0.0038 26.2 12.9 90 15-112 46-135 (177)
101 PF08614 ATG16: Autophagy prot 52.6 1.4E+02 0.0031 27.3 9.6 37 6-42 118-154 (194)
102 KOG0612 Rho-associated, coiled 51.7 5.2E+02 0.011 31.4 30.1 81 22-106 465-549 (1317)
103 PHA02562 46 endonuclease subun 51.5 2.9E+02 0.0063 28.4 23.6 146 4-150 174-328 (562)
104 PRK10929 putative mechanosensi 50.7 5E+02 0.011 30.9 27.4 116 70-188 104-240 (1109)
105 PRK10246 exonuclease subunit S 50.6 4.5E+02 0.0098 30.3 31.1 25 199-223 774-798 (1047)
106 cd07429 Cby_like Chibby, a nuc 50.4 17 0.00037 31.9 3.1 25 5-29 73-97 (108)
107 PF05667 DUF812: Protein of un 49.9 3.9E+02 0.0085 29.4 23.7 139 4-156 335-475 (594)
108 PF00261 Tropomyosin: Tropomyo 49.5 2.3E+02 0.0051 26.7 27.2 103 154-256 125-230 (237)
109 PF05622 HOOK: HOOK protein; 48.7 5.8 0.00013 42.9 0.0 100 51-151 278-386 (713)
110 KOG0964 Structural maintenance 48.4 5.5E+02 0.012 30.7 28.8 169 71-256 268-444 (1200)
111 PF01724 DUF29: Domain of unkn 48.2 1.9E+02 0.0042 25.7 9.4 45 227-273 69-113 (139)
112 COG1579 Zn-ribbon protein, pos 48.0 2.9E+02 0.0062 27.3 20.7 96 25-144 10-105 (239)
113 PRK14154 heat shock protein Gr 48.0 93 0.002 30.0 7.9 56 323-378 54-121 (208)
114 PF08172 CASP_C: CASP C termin 47.9 33 0.00072 33.4 5.0 42 3-44 85-126 (248)
115 PRK04778 septation ring format 47.8 3.8E+02 0.0082 28.7 27.9 260 4-281 230-506 (569)
116 TIGR02231 conserved hypothetic 47.5 1.7E+02 0.0037 30.6 10.3 95 13-109 73-172 (525)
117 PF00769 ERM: Ezrin/radixin/mo 47.5 2.7E+02 0.0059 26.9 12.6 106 6-111 7-118 (246)
118 PF05377 FlaC_arch: Flagella a 46.9 38 0.00082 26.7 4.2 38 12-49 1-38 (55)
119 TIGR02449 conserved hypothetic 46.5 43 0.00093 27.1 4.6 38 4-41 7-44 (65)
120 KOG4643 Uncharacterized coiled 46.0 6E+02 0.013 30.5 27.5 103 8-112 174-310 (1195)
121 COG4985 ABC-type phosphate tra 45.8 1.6E+02 0.0034 29.8 9.2 65 23-87 162-233 (289)
122 PF12795 MscS_porin: Mechanose 45.2 2.7E+02 0.0058 26.2 23.3 186 76-290 32-217 (240)
123 PRK04863 mukB cell division pr 45.1 6.7E+02 0.014 30.7 38.5 27 162-188 462-488 (1486)
124 PF07989 Microtub_assoc: Micro 45.0 64 0.0014 26.3 5.4 53 235-287 5-58 (75)
125 PF04880 NUDE_C: NUDE protein, 45.0 27 0.0006 32.5 3.8 43 6-48 2-47 (166)
126 TIGR03752 conj_TIGR03752 integ 44.6 2.4E+02 0.0053 30.5 11.0 23 4-26 73-95 (472)
127 PF12128 DUF3584: Protein of u 44.3 5.9E+02 0.013 29.9 41.4 50 64-113 486-535 (1201)
128 PF15294 Leu_zip: Leucine zipp 43.6 3.6E+02 0.0079 27.3 18.2 174 203-382 2-205 (278)
129 PF08826 DMPK_coil: DMPK coile 43.5 77 0.0017 25.3 5.5 40 8-47 15-54 (61)
130 KOG0018 Structural maintenance 43.0 6.7E+02 0.014 30.1 25.5 226 13-273 222-473 (1141)
131 PF04156 IncA: IncA protein; 42.5 2.5E+02 0.0054 25.0 13.2 34 6-39 83-116 (191)
132 PF09730 BicD: Microtubule-ass 41.5 5.8E+02 0.013 29.0 26.5 140 218-361 113-305 (717)
133 PF05600 DUF773: Protein of un 41.2 4.6E+02 0.0099 28.2 12.4 114 235-372 96-229 (507)
134 PF05103 DivIVA: DivIVA protei 40.9 13 0.00028 31.0 0.9 19 94-112 105-123 (131)
135 PRK10698 phage shock protein P 40.4 3.3E+02 0.0072 25.9 13.8 68 70-144 115-182 (222)
136 COG2433 Uncharacterized conser 40.0 3.4E+02 0.0073 30.6 11.4 87 4-109 422-508 (652)
137 KOG4403 Cell surface glycoprot 39.9 5.5E+02 0.012 28.3 15.4 137 125-293 239-382 (575)
138 PF09403 FadA: Adhesion protei 39.9 2.8E+02 0.006 24.9 13.5 60 123-185 15-74 (126)
139 PF10168 Nup88: Nuclear pore c 39.7 6E+02 0.013 28.6 16.2 43 4-46 579-621 (717)
140 PF04111 APG6: Autophagy prote 39.0 4.1E+02 0.0089 26.6 11.5 88 203-292 51-138 (314)
141 PF06005 DUF904: Protein of un 38.9 1.4E+02 0.0031 24.2 6.6 36 5-40 19-54 (72)
142 PRK10884 SH3 domain-containing 38.1 2.9E+02 0.0062 26.4 9.4 20 127-146 92-111 (206)
143 PF00261 Tropomyosin: Tropomyo 37.7 3.6E+02 0.0078 25.5 22.4 197 71-286 11-218 (237)
144 PF15035 Rootletin: Ciliary ro 37.5 3.5E+02 0.0076 25.3 15.2 117 104-220 31-166 (182)
145 KOG4797 Transcriptional regula 37.2 33 0.00072 30.8 2.9 24 2-25 72-95 (123)
146 PF02994 Transposase_22: L1 tr 37.1 74 0.0016 32.4 5.7 45 3-47 143-187 (370)
147 smart00787 Spc7 Spc7 kinetocho 36.6 4.6E+02 0.01 26.5 18.2 102 98-213 181-282 (312)
148 KOG4643 Uncharacterized coiled 35.9 8.5E+02 0.018 29.3 31.6 85 203-287 489-600 (1195)
149 TIGR02894 DNA_bind_RsfA transc 35.6 96 0.0021 29.1 5.7 39 6-44 99-137 (161)
150 TIGR01010 BexC_CtrB_KpsE polys 35.6 4.4E+02 0.0096 26.0 14.4 83 62-148 164-262 (362)
151 PRK13922 rod shape-determining 35.2 3.6E+02 0.0078 25.6 9.7 70 210-283 42-111 (276)
152 PRK04406 hypothetical protein; 35.1 1.3E+02 0.0028 24.5 5.8 52 34-88 6-59 (75)
153 PF04102 SlyX: SlyX; InterPro 34.3 1.4E+02 0.0031 23.5 5.8 45 3-47 3-47 (69)
154 PF11336 DUF3138: Protein of u 34.0 39 0.00084 36.4 3.3 67 71-139 28-94 (514)
155 PRK10246 exonuclease subunit S 33.5 8.1E+02 0.018 28.4 42.6 20 289-308 498-517 (1047)
156 PF10168 Nup88: Nuclear pore c 33.4 7.4E+02 0.016 27.9 13.2 32 195-227 633-664 (717)
157 PRK02119 hypothetical protein; 33.1 1.5E+02 0.0032 24.0 5.8 53 33-88 3-57 (73)
158 TIGR02680 conserved hypothetic 33.0 9.3E+02 0.02 28.9 32.5 108 5-112 284-398 (1353)
159 PF09304 Cortex-I_coil: Cortex 32.8 3.6E+02 0.0077 24.0 12.6 57 101-168 42-98 (107)
160 PRK00736 hypothetical protein; 32.6 1.3E+02 0.0028 24.0 5.3 50 36-88 2-53 (68)
161 PRK11546 zraP zinc resistance 32.5 1.9E+02 0.0041 26.6 7.0 25 126-150 87-111 (143)
162 COG3206 GumC Uncharacterized p 32.4 5.7E+02 0.012 26.2 11.7 104 6-109 287-393 (458)
163 COG3880 Modulator of heat shoc 32.3 1E+02 0.0022 29.4 5.3 40 70-112 131-171 (176)
164 PF00992 Troponin: Troponin; 32.2 1.5E+02 0.0033 26.3 6.3 75 6-80 52-131 (132)
165 TIGR02449 conserved hypothetic 31.7 1.3E+02 0.0029 24.3 5.2 37 12-48 29-65 (65)
166 PF08429 PLU-1: PLU-1-like pro 31.7 4.3E+02 0.0094 25.6 9.8 79 206-285 202-294 (335)
167 PF06632 XRCC4: DNA double-str 31.6 3.6E+02 0.0078 27.7 9.5 49 1-49 134-182 (342)
168 PF14988 DUF4515: Domain of un 31.2 4.7E+02 0.01 24.9 17.7 114 131-258 57-177 (206)
169 KOG0946 ER-Golgi vesicle-tethe 31.2 9.4E+02 0.02 28.4 20.6 72 5-77 672-743 (970)
170 PF11336 DUF3138: Protein of u 31.1 1.2E+02 0.0027 32.8 6.3 79 117-223 14-104 (514)
171 TIGR02894 DNA_bind_RsfA transc 30.8 1.1E+02 0.0024 28.7 5.3 32 15-46 101-132 (161)
172 PF08537 NBP1: Fungal Nap bind 30.6 95 0.0021 32.0 5.2 39 5-43 183-221 (323)
173 PF03112 DUF244: Uncharacteriz 30.5 53 0.0012 30.7 3.2 32 228-273 75-106 (158)
174 PRK02793 phi X174 lysis protei 30.2 1.4E+02 0.0031 24.0 5.2 55 32-89 1-57 (72)
175 smart00751 BSD domain in trans 30.0 43 0.00094 24.7 2.1 25 232-256 5-29 (51)
176 COG2433 Uncharacterized conser 29.9 3.2E+02 0.0069 30.7 9.3 53 4-56 429-490 (652)
177 PRK14145 heat shock protein Gr 29.1 2.5E+02 0.0054 26.9 7.5 61 318-378 42-114 (196)
178 PF10883 DUF2681: Protein of u 29.1 1.6E+02 0.0034 25.1 5.5 36 1-36 20-55 (87)
179 KOG2129 Uncharacterized conser 28.3 4.2E+02 0.0091 28.9 9.6 83 20-110 203-299 (552)
180 PF00170 bZIP_1: bZIP transcri 28.2 2.5E+02 0.0054 21.3 6.0 43 204-246 21-63 (64)
181 PF09304 Cortex-I_coil: Cortex 28.0 4.3E+02 0.0094 23.5 9.5 60 15-82 34-93 (107)
182 PRK14143 heat shock protein Gr 27.8 4.7E+02 0.01 25.7 9.2 62 320-381 66-139 (238)
183 TIGR01010 BexC_CtrB_KpsE polys 27.8 3.9E+02 0.0084 26.4 8.8 10 10-19 220-229 (362)
184 PRK00295 hypothetical protein; 27.7 2E+02 0.0043 22.9 5.6 50 36-88 2-53 (68)
185 PF04111 APG6: Autophagy prote 27.7 6.3E+02 0.014 25.3 10.7 78 202-279 57-134 (314)
186 PF09789 DUF2353: Uncharacteri 27.7 3.7E+02 0.0081 27.6 8.8 97 2-113 131-227 (319)
187 PF10481 CENP-F_N: Cenp-F N-te 27.4 7.1E+02 0.015 25.8 15.2 118 206-350 64-191 (307)
188 PF12329 TMF_DNA_bd: TATA elem 27.3 2.8E+02 0.006 22.4 6.4 15 71-85 57-71 (74)
189 PF10805 DUF2730: Protein of u 27.1 1.7E+02 0.0037 24.8 5.5 33 24-56 71-103 (106)
190 TIGR01000 bacteriocin_acc bact 26.7 7.1E+02 0.015 25.6 11.7 20 127-146 290-309 (457)
191 PF06156 DUF972: Protein of un 26.1 1.7E+02 0.0038 25.3 5.4 44 3-46 7-57 (107)
192 PRK13922 rod shape-determining 26.0 1.2E+02 0.0027 28.7 4.9 39 4-42 69-110 (276)
193 PF13870 DUF4201: Domain of un 26.0 4.3E+02 0.0093 23.7 8.1 76 9-88 96-172 (177)
194 KOG0980 Actin-binding protein 25.5 1.2E+03 0.026 27.7 19.4 71 202-279 459-543 (980)
195 PF14389 Lzipper-MIP1: Leucine 25.4 3.9E+02 0.0085 22.1 8.3 27 126-152 6-32 (88)
196 PF03962 Mnd1: Mnd1 family; I 25.1 5.7E+02 0.012 23.9 10.4 36 16-51 67-102 (188)
197 PF14645 Chibby: Chibby family 25.1 76 0.0016 27.8 3.1 25 4-28 71-95 (116)
198 PF08826 DMPK_coil: DMPK coile 24.8 3.6E+02 0.0078 21.5 8.1 49 64-112 7-55 (61)
199 KOG1029 Endocytic adaptor prot 24.8 1.2E+03 0.026 27.6 13.1 26 2-27 491-516 (1118)
200 PF10779 XhlA: Haemolysin XhlA 24.4 2.5E+02 0.0054 22.1 5.6 43 6-48 8-50 (71)
201 PRK10997 yieM hypothetical pro 24.3 4.3E+02 0.0093 28.6 8.9 83 194-286 127-225 (487)
202 PRK03947 prefoldin subunit alp 24.2 4.6E+02 0.01 22.5 9.1 42 72-113 91-132 (140)
203 PF05377 FlaC_arch: Flagella a 23.7 2.2E+02 0.0048 22.6 5.0 39 5-43 1-39 (55)
204 PF02151 UVR: UvrB/uvrC motif; 23.6 89 0.0019 21.8 2.6 16 96-111 21-36 (36)
205 COG4372 Uncharacterized protei 23.2 9.9E+02 0.022 26.0 23.4 96 70-183 76-171 (499)
206 PF10506 MCC-bdg_PDZ: PDZ doma 22.9 2.7E+02 0.0059 22.6 5.6 48 8-56 2-49 (67)
207 PF10458 Val_tRNA-synt_C: Valy 22.9 3E+02 0.0065 21.3 5.7 24 154-180 5-28 (66)
208 PF04568 IATP: Mitochondrial A 22.8 2.6E+02 0.0057 24.2 5.8 47 195-243 50-96 (100)
209 cd00890 Prefoldin Prefoldin is 21.8 4.6E+02 0.0099 21.6 8.3 36 77-112 89-124 (129)
210 PF15619 Lebercilin: Ciliary p 21.7 6.9E+02 0.015 23.6 11.4 32 56-89 108-139 (194)
211 PF03999 MAP65_ASE1: Microtubu 21.5 31 0.00067 37.0 0.0 151 68-243 133-299 (619)
212 TIGR03185 DNA_S_dndD DNA sulfu 21.3 1.1E+03 0.023 25.7 31.0 78 162-241 390-467 (650)
213 KOG0946 ER-Golgi vesicle-tethe 21.2 1.4E+03 0.031 27.0 17.3 38 5-42 679-716 (970)
214 PF07106 TBPIP: Tat binding pr 21.2 1.8E+02 0.004 25.8 4.8 20 29-48 120-139 (169)
215 PF12240 Angiomotin_C: Angiomo 21.1 1.4E+02 0.003 29.1 4.2 38 6-43 131-168 (205)
216 KOG0804 Cytoplasmic Zn-finger 21.0 1.1E+03 0.024 25.8 14.7 20 71-90 381-400 (493)
217 PF14282 FlxA: FlxA-like prote 20.9 5.2E+02 0.011 21.9 8.2 25 123-147 14-38 (106)
218 PRK00409 recombination and DNA 20.9 1.1E+03 0.024 26.6 11.6 50 7-56 523-572 (782)
219 PF10234 Cluap1: Clusterin-ass 20.8 2E+02 0.0044 28.7 5.4 47 3-50 175-221 (267)
220 PF09789 DUF2353: Uncharacteri 20.4 9.6E+02 0.021 24.8 10.6 72 7-90 75-148 (319)
221 KOG0995 Centromere-associated 20.3 1.2E+03 0.027 26.1 23.4 207 126-361 257-465 (581)
222 PF04899 MbeD_MobD: MbeD/MobD 20.3 2.9E+02 0.0064 22.5 5.3 40 316-355 23-62 (70)
223 TIGR01069 mutS2 MutS2 family p 20.1 8.4E+02 0.018 27.5 10.5 50 7-56 518-567 (771)
224 PF10224 DUF2205: Predicted co 20.1 4.3E+02 0.0094 22.1 6.4 51 198-255 12-62 (80)
225 PF09766 FimP: Fms-interacting 20.0 2E+02 0.0044 29.2 5.3 51 5-56 102-152 (355)
226 PF06825 HSBP1: Heat shock fac 20.0 1.4E+02 0.003 23.4 3.3 15 34-48 30-44 (54)
227 PF08649 DASH_Dad1: DASH compl 20.0 2.8E+02 0.0062 22.1 5.0 41 222-265 4-44 (58)
No 1
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=97.97 E-value=0.04 Score=58.86 Aligned_cols=73 Identities=23% Similarity=0.191 Sum_probs=29.3
Q ss_pred HHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 207 LEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRG 279 (388)
Q Consensus 207 le~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~ 279 (388)
++..+.++..-+..++.....+-.+++.-..++..+..+-..+...+.....--..++.++.++-.+...+..
T Consensus 864 ~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~l~~~~~~~~~~~~~l~~ 936 (1179)
T TIGR02168 864 LEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEV 936 (1179)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333344444444444444333343333333333333333333334444444444444444433
No 2
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=97.91 E-value=0.054 Score=58.34 Aligned_cols=105 Identities=18% Similarity=0.326 Sum_probs=53.6
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
..+..+......++.+...+...+...+.+...+...+..+.. ..+.++..+..+..+.+....-.+.+...+..++.+
T Consensus 674 ~~l~~l~~~l~~l~~~l~~l~~~~~~~~~~l~~l~~~~~~~~~-~~~~l~~~l~~~~~~l~~~~~~~~~l~~~~~~~~~~ 752 (1164)
T TIGR02169 674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR-KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQE 752 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455566666666666666666666666666556555554432 234455555544444444444444455555555555
Q ss_pred hHHHHHHHhhhHhhHHHHHHHHHHHH
Q 047021 84 LKEAEEEQYRAEEDAAALRAELNSLQ 109 (388)
Q Consensus 84 lkEAEEEQyRAEEDaAaLRAEln~lQ 109 (388)
+...+.+--..+.+...+..+++.+.
T Consensus 753 ~~~~~~el~~l~~~i~~l~~~i~~l~ 778 (1164)
T TIGR02169 753 IENVKSELKELEARIEELEEDLHKLE 778 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55554444444444444444444433
No 3
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=97.76 E-value=0.088 Score=56.30 Aligned_cols=22 Identities=27% Similarity=0.213 Sum_probs=11.3
Q ss_pred hHHHHhHHHHHHHHHHHHHHHH
Q 047021 335 SRAEALSAQVLQLSAQLQQTTQ 356 (388)
Q Consensus 335 sRseaLsaqv~qLsAeL~ra~Q 356 (388)
.|-+.|+.|..-|.........
T Consensus 993 er~~~l~~q~~dL~~~~~~L~~ 1014 (1179)
T TIGR02168 993 EEYEELKERYDFLTAQKEDLTE 1014 (1179)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555555555554444333
No 4
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=97.53 E-value=0.084 Score=49.95 Aligned_cols=96 Identities=25% Similarity=0.389 Sum_probs=64.9
Q ss_pred HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021 11 KKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE 90 (388)
Q Consensus 11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE 90 (388)
.|.-.||.+|+.|..+|.........-..++.+.=...+..+|+-|.+++.|+....+-.+.+-..+..++.|+.+.-..
T Consensus 18 ekVr~LE~~N~~Le~~i~~~~~~~~~~~~~~~~~ye~el~~lr~~id~~~~eka~l~~e~~~l~~e~~~~r~k~e~e~~~ 97 (312)
T PF00038_consen 18 EKVRFLEQENKRLESEIEELREKKGEEVSRIKEMYEEELRELRRQIDDLSKEKARLELEIDNLKEELEDLRRKYEEELAE 97 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhhhhHHHHHHHHhcccccCcccccchhhHHHHhHHhhhhHHHHhhHHhhhhhhHHHHHHHHHHHHHHHHHH
Confidence 34445566666666666655555444444455555666778889999999999999998998888888888888888888
Q ss_pred HhhhHhhHHHHHHHHH
Q 047021 91 QYRAEEDAAALRAELN 106 (388)
Q Consensus 91 QyRAEEDaAaLRAEln 106 (388)
...++.+-..||.++.
T Consensus 98 ~~~le~el~~lrk~ld 113 (312)
T PF00038_consen 98 RKDLEEELESLRKDLD 113 (312)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhhhhhhhh
Confidence 8888877777775554
No 5
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=97.35 E-value=0.33 Score=52.53 Aligned_cols=37 Identities=19% Similarity=0.372 Sum_probs=15.9
Q ss_pred HHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhh
Q 047021 209 KQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEE 245 (388)
Q Consensus 209 ~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~E 245 (388)
.++..+..-++.++.....+-.+|+.-..+++.+-.+
T Consensus 427 ~~~~~l~~~~~~l~~~~~~~~~~l~~l~~~~~~~~~~ 463 (1164)
T TIGR02169 427 AAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAAD 463 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333334444444444444444444444444333
No 6
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=97.20 E-value=0.93 Score=54.62 Aligned_cols=345 Identities=23% Similarity=0.277 Sum_probs=170.8
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh--hhHHH--------HhhhhhhhhhhHHHh---hh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV--PSLRK--------ALKDVAMEKDAAVVA---RE 71 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v--~slrk--------aL~~v~mEkDaA~va---re 71 (388)
.+.+|.+...++|..+.++.+++.+.+.++..|..+++.++.+.- ...+| -..+|..++|.+--. +.
T Consensus 937 e~~~l~~~~~~~E~~~~k~~~Ek~~~e~~~~~l~~e~~~~~e~~~kL~kekk~lEe~~~~l~~~l~~~eek~~~l~k~~~ 1016 (1930)
T KOG0161|consen 937 EVQELKEQLEELELTLQKLELEKNAAENKLKNLEEEINSLDENISKLSKEKKELEERIRELQDDLQAEEEKAKSLNKAKA 1016 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355666667777777778888888888888888888886544322 33333 344555555554332 45
Q ss_pred hHHHHHHHHHHHhHHHHHHHhhhHhh--HHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Q 047021 72 DLSAQLRTLKKRLKEAEEEQYRAEED--AAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQ 149 (388)
Q Consensus 72 Dl~AQLR~~kkRlkEAEEEQyRAEED--aAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~ 149 (388)
.+-++|..+..++.. +...|.|=+ --.|..++..+|-+.+.- -+ ..--...+....+-|+..++.++..+..
T Consensus 1017 kle~~l~~le~~le~--e~~~r~e~Ek~~rkle~el~~~~e~~~~~--~~--~~~el~~~l~kke~El~~l~~k~e~e~~ 1090 (1930)
T KOG0161|consen 1017 KLEQQLDDLEVTLER--EKRIRMELEKAKRKLEGELKDLQESIEEL--KK--QKEELDNQLKKKESELSQLQSKLEDEQA 1090 (1930)
T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhhhHHHHH--HH--HHHHHHHHHHHHHHHHHHHHHHhhHHHH
Confidence 555566655555543 333333333 123444444443333210 00 0000011122233344444444444444
Q ss_pred HHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhh---------------------
Q 047021 150 LRQQERKQ--------------LTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAA--------------------- 194 (388)
Q Consensus 150 lrqqeqqr--------------la~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a--------------------- 194 (388)
++.+-+.. |..|-...+-+...+..|...+..+...-.+..+..+
T Consensus 1091 ~~~~l~k~i~eL~~~i~el~e~le~er~~r~K~ek~r~dL~~ele~l~~~Lee~~~~t~~q~e~~~k~e~e~~~l~~~le 1170 (1930)
T KOG0161|consen 1091 EVAQLQKQIKELEARIKELEEELEAERASRAKAERQRRDLSEELEELKEELEEQGGTTAAQLELNKKREAEVQKLRRDLE 1170 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44331111 1111111122222222233333332221111100000
Q ss_pred ----hhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH
Q 047021 195 ----ASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDC 270 (388)
Q Consensus 195 ----~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDc 270 (388)
.+......-|.++...++++...++.+.--.++|.-|=..=..|+..|-.+.+.++++-.++--....-|.|+.+-
T Consensus 1171 ee~~~~e~~~~~lr~~~~~~~~el~~qle~l~~~k~~lekek~~lq~e~~~l~~ev~~~~~~k~~~e~~~k~~E~~l~el 1250 (1930)
T KOG0161|consen 1171 EETLDHEAQIEELRKKHADSLAELQEQLEQLQKDKAKLEKEKSDLQREIADLAAELEQLSSEKKDLEKKDKKLEAQLSEL 1250 (1930)
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHH
Confidence 1111122446677778888999999988888887777666667777788888888888888887777777666654
Q ss_pred HHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccc-cCCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHH
Q 047021 271 LKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVS-QNGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSA 349 (388)
Q Consensus 271 lkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~-~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsA 349 (388)
=.-..++...+..+= -|.+-+-.......+...+....... ...-......+..+|.+|-.+--+..+|..++.++.-
T Consensus 1251 q~k~~~~~~~~~~l~-~q~~~l~~E~~~l~~~lee~e~~~~~~~r~~~~~~~qle~~k~qle~e~r~k~~l~~~l~~l~~ 1329 (1930)
T KOG0161|consen 1251 QLKLDEQERLRNDLT-AKRSRLQNENEELSRQLEEAEAKLSALSRDKQALESQLEELKRQLEEETREKSALENALRQLEH 1329 (1930)
T ss_pred HHHHHHHHHHHHHHH-HHHHHhhhhHHHHhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444333322210 01111100000000000000000000 0012233346888899999888888888888888887
Q ss_pred HHHHHHH
Q 047021 350 QLQQTTQ 356 (388)
Q Consensus 350 eL~ra~Q 356 (388)
+.....-
T Consensus 1330 e~~~l~e 1336 (1930)
T KOG0161|consen 1330 ELDLLRE 1336 (1930)
T ss_pred HHHHHHH
Confidence 7665543
No 7
>PF13514 AAA_27: AAA domain
Probab=96.99 E-value=0.94 Score=50.96 Aligned_cols=253 Identities=19% Similarity=0.240 Sum_probs=137.6
Q ss_pred hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh-hHHHHhhhhhh-----------------hhh
Q 047021 3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP-SLRKALKDVAM-----------------EKD 64 (388)
Q Consensus 3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~-slrkaL~~v~m-----------------EkD 64 (388)
..+|..|-.....++.....+...+.+...+.+.+..++..+....+| .++.+|..+-= .+.
T Consensus 345 ~~~i~~l~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~l~~al~~~~~~~d~~~~~~~~~~~~~~~~~ 424 (1111)
T PF13514_consen 345 RERIRELLQEREQLEQALAQARRELEEAERELEQLQAELAALPAPPDPEALRAALEAAQRLGDLEARLQEAEQALEAAER 424 (1111)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCcccCCChHHHHHHHHHHhcccHHHHHHHHHHHHHHHHH
Confidence 457888889999999999999999999999999999999988887763 35555442211 111
Q ss_pred hHHHhhhhH-----------------HHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCC-----
Q 047021 65 AAVVAREDL-----------------SAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISS----- 122 (388)
Q Consensus 65 aA~vareDl-----------------~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~----- 122 (388)
....+-..| .+++....++..+.+...-++......++.++..+..+...-..+|-.+
T Consensus 425 ~l~~~l~~L~~w~~~~~~l~~~~~P~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~v~t~~~l 504 (1111)
T PF13514_consen 425 RLAAALAALGPWSGDLDALAALPLPSRETVEAFRAEFEELERQLRRARDRLEELEEELARLEARLRRLAAAGDVPTEEEL 504 (1111)
T ss_pred HHHHHHHhcCCCCCChHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHH
Confidence 111111111 1455566666667666666666666666666666655544333332211
Q ss_pred ----------------CCCChhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 123 ----------------MGISPDH---VQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFS 183 (388)
Q Consensus 123 ----------------~g~s~~~---~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~ 183 (388)
-..+++. .+.-=-.++|.....-+...-.+.-+.+++........+..+..+++..+..+.
T Consensus 505 ~~aR~~Rd~~W~~~~~~~~~~~~fe~a~~~aD~laD~~~~~a~~~a~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 584 (1111)
T PF13514_consen 505 AAARARRDAAWQLAALDAALAEAFEAAVREADELADRRLREAERAARLAQLRARLEEARARLARAQARLAAAEAALAALE 584 (1111)
T ss_pred HHHHHHHHHHhcccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 0111111 112223556666655555555555666666666666666666666666666554
Q ss_pred hh-------ch-hhhhhhhhhhhcch-----hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhh
Q 047021 184 RK-------AS-EEVSNAAASKAFSV-----EDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLS 250 (388)
Q Consensus 184 ~~-------~s-e~~~~~a~~k~~S~-----eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Ls 250 (388)
.. .. |..| .+...|+.. +.-+.+...-..+.....++...++.|...++.-.. +..|.
T Consensus 585 ~~W~~~~~~~g~p~~p-~~~~~Wl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~l~~~~~--------~~~l~ 655 (1111)
T PF13514_consen 585 AAWAALWAAAGLPLSP-AEMRDWLARREAALEAAEELRAARAELEALRARRAAARAALAAALAALGP--------AEELA 655 (1111)
T ss_pred HHHHHHHhhcCCCCCh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------cccHH
Confidence 32 11 1112 223444422 233333444344445555555566655555544322 34444
Q ss_pred hhHHHHHHHHHHHH
Q 047021 251 SSYQESMGIAKHWE 264 (388)
Q Consensus 251 ss~qea~~iaaqWE 264 (388)
.-+..+......|+
T Consensus 656 ~~l~~a~~~~~~~~ 669 (1111)
T PF13514_consen 656 ALLEEAEALLEEWE 669 (1111)
T ss_pred HHHHHHHHHHHHHH
Confidence 44444444444443
No 8
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=96.89 E-value=0.34 Score=54.74 Aligned_cols=76 Identities=22% Similarity=0.358 Sum_probs=42.2
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK 81 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k 81 (388)
..+.++.++.+++.+...+...+.+...+...+..+++.+.... +.++..+..++-.+++..-.++++-.+++..+
T Consensus 731 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 806 (1163)
T COG1196 731 ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEAL-AKLKEEIEELEEKRQALQEELEELEEELEEAE 806 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555555555555555555555555544444 55555555555555666655666666666665
No 9
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=95.93 E-value=1.6 Score=41.47 Aligned_cols=119 Identities=29% Similarity=0.353 Sum_probs=81.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhh
Q 047021 155 RKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDN 234 (388)
Q Consensus 155 qqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~ 234 (388)
-.|||.=..++..|..+-..|+.+|..+.....+..+ .++..++..|+++-..|..+-.-+.+|..++|+
T Consensus 10 NdRla~YIekVr~LE~~N~~Le~~i~~~~~~~~~~~~----------~~~~~ye~el~~lr~~id~~~~eka~l~~e~~~ 79 (312)
T PF00038_consen 10 NDRLASYIEKVRFLEQENKRLESEIEELREKKGEEVS----------RIKEMYEEELRELRRQIDDLSKEKARLELEIDN 79 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH---------H----------HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhhhhHHHHHHHHhcccccCc----------ccccchhhHHHHhHHhhhhHHHHhhHHhhhhhh
Confidence 4577888889999999999999999998866433222 234555666666666666666666666666666
Q ss_pred hhHHHHHH--------------hhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 235 QSSEIERL--------------FEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK 283 (388)
Q Consensus 235 QS~EIErL--------------f~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek 283 (388)
-..+|+.+ =.+..+|...+.+++.+-..++++++---.+.+-++..-+.
T Consensus 80 l~~e~~~~r~k~e~e~~~~~~le~el~~lrk~ld~~~~~r~~le~~i~~L~eEl~fl~~~hee 142 (312)
T PF00038_consen 80 LKEELEDLRRKYEEELAERKDLEEELESLRKDLDEETLARVDLENQIQSLKEELEFLKQNHEE 142 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHhHHHHHHHHHHHHHHHHHhhhhh
Confidence 66665543 45778888999999999999999998877777777765554
No 10
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=95.85 E-value=4.8 Score=46.37 Aligned_cols=32 Identities=9% Similarity=0.187 Sum_probs=19.4
Q ss_pred hHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHH
Q 047021 322 EFLALKGQLAKEQSRAEALSAQVLQLSAQLQQ 353 (388)
Q Consensus 322 e~l~LK~eL~kEqsRseaLsaqv~qLsAeL~r 353 (388)
++..|...-+.-.+....|..+|.+|-.+|..
T Consensus 1062 ~~~~l~~~~a~l~g~~k~le~qi~~l~~eL~e 1093 (1311)
T TIGR00606 1062 NIDLIKRNHVLALGRQKGYEKEIKHFKKELRE 1093 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 34444444444455566677777777777743
No 11
>PRK11281 hypothetical protein; Provisional
Probab=95.79 E-value=5.2 Score=46.24 Aligned_cols=189 Identities=19% Similarity=0.137 Sum_probs=109.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhH
Q 047021 132 TLEKELATLKSQLQQESQLRQ-------QERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDK 204 (388)
Q Consensus 132 ~lekEl~~l~~~lqqes~lrq-------qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDk 204 (388)
.||..+.++..+|++...--. .-+-+.++.|.+++....+.|++...+.... +..+++|..-+
T Consensus 125 qLEq~L~q~~~~Lq~~Q~~La~~NsqLi~~qT~PERAQ~~lsea~~RlqeI~~~L~~~~----------~~~~~l~~~~~ 194 (1113)
T PRK11281 125 QLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGK----------VGGKALRPSQR 194 (1113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCC----------CCCCcCCHHHH
Confidence 366666666555555333222 2455566777777888888888887665411 11223666667
Q ss_pred HHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 205 EKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKL 284 (388)
Q Consensus 205 ekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLekl 284 (388)
.+|..++.-+-.. ++-+-.| -..|+.+.+=++.=+.....--.+.+ .+...|...++.-
T Consensus 195 ~~l~ae~~~l~~~--------------~~~~~~~----l~~~~~l~~l~~~q~d~~~~~~~~~~---~~~~~lq~~in~k 253 (1113)
T PRK11281 195 VLLQAEQALLNAQ--------------NDLQRKS----LEGNTQLQDLLQKQRDYLTARIQRLE---HQLQLLQEAINSK 253 (1113)
T ss_pred HHHHHHHHHHHHH--------------HHHHHHH----HhcchHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence 7776664433322 2222111 12677777777776666655444333 3344566666665
Q ss_pred HhhhhccCCCCccccccccCCCCCCccccCCCchhhh---hHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhh
Q 047021 285 RTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETHKN---EFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYN 359 (388)
Q Consensus 285 R~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~~t---e~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n 359 (388)
|.+++-..-... . .+.+.....++|.... .|..|-.+|.+--.|...|+.+-.+.-..|.+.+|+..
T Consensus 254 r~~~se~~~~~a----~----~~~~~~~~~~~p~i~~~~~~N~~Ls~~L~~~t~~~~~l~~~~~~~~~~l~~~~q~~~ 323 (1113)
T PRK11281 254 RLTLSEKTVQEA----Q----SQDEAARIQANPLVAQELEINLQLSQRLLKATEKLNTLTQQNLRVKNWLDRLTQSER 323 (1113)
T ss_pred HHHHHHHHHHHH----h----hhhhhcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555443311000 0 0000111222444433 48899999999999999999999999999999999987
No 12
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=95.33 E-value=2.7 Score=40.03 Aligned_cols=166 Identities=31% Similarity=0.349 Sum_probs=103.3
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK 81 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k 81 (388)
|++.|.+|+..|..|..+|.+|...+++++.-+.-|-.++-++..... |+++||.-- -+ -++=..-|++.=
T Consensus 6 L~~~v~dL~~~n~~L~~en~kL~~~ve~~ee~na~L~~e~~~L~~q~~-s~Qqal~~a-----K~---l~eEledLk~~~ 76 (193)
T PF14662_consen 6 LLSCVEDLQLNNQKLADENAKLQRSVETAEEGNAQLAEEITDLRKQLK-SLQQALQKA-----KA---LEEELEDLKTLA 76 (193)
T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH-----HH---HHHHHHHHHHHH
Confidence 689999999999999999999999999999999999999999877654 778877321 01 111122222222
Q ss_pred HHhHH----HHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH-HHHHHHHHH---
Q 047021 82 KRLKE----AEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL-QQESQLRQQ--- 153 (388)
Q Consensus 82 kRlkE----AEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l-qqes~lrqq--- 153 (388)
+-+.| .=..-|-.|...-.|-+++.+||..-+.... -.+..-+.+..|-.+-.+|+.++ .=++++.+.
T Consensus 77 ~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~kl~~----e~~~lk~~~~eL~~~~~~Lq~Ql~~~e~l~~~~da~ 152 (193)
T PF14662_consen 77 KSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENGKLLA----ERDGLKKRSKELATEKATLQRQLCEFESLICQRDAI 152 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH----hhhhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 22111 0111122344556788888888877643322 12234566777777788888887 444444442
Q ss_pred --H----HHHH----HHHHHHHHHHHHHHHHHHHHHH
Q 047021 154 --E----RKQL----TEEQARISVLMSEKQELEEKLV 180 (388)
Q Consensus 154 --e----qqrl----a~Eq~~~a~l~aEkqeLeekl~ 180 (388)
+ .+-| ++=...+..|++||-.||+++.
T Consensus 153 l~e~t~~i~eL~~~ieEy~~~teeLR~e~s~LEeql~ 189 (193)
T PF14662_consen 153 LSERTQQIEELKKTIEEYRSITEELRLEKSRLEEQLS 189 (193)
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1 1112 2223345567777777777764
No 13
>PRK04863 mukB cell division protein MukB; Provisional
Probab=95.29 E-value=7.6 Score=46.15 Aligned_cols=182 Identities=20% Similarity=0.269 Sum_probs=106.2
Q ss_pred HhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC--CCCCCC----------------CCCCChhh
Q 047021 68 VAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG--SLGGIS----------------SMGISPDH 129 (388)
Q Consensus 68 vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~--~~s~~~----------------~~g~s~~~ 129 (388)
++-++|-.-+.....++.+.+.+--..|..-..+.+.+..+......- -.+.++ ...+-.++
T Consensus 435 ~SdEeLe~~LenF~aklee~e~qL~elE~kL~~lea~leql~~~~~~l~~~~Gkv~~~~a~~~~~~~~~~~~~~~~~~~~ 514 (1486)
T PRK04863 435 LTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQ 514 (1486)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCHHHHHHHHHHHHHHhHHHHHHHHh
Confidence 345777777778888888888777777777777777777777664321 122222 11133444
Q ss_pred HHHHHHHHHHHHHHHHH----HHHHHH----------------HHHHH-------H----HHHHHHHHHHHHHHHHHHHH
Q 047021 130 VQTLEKELATLKSQLQQ----ESQLRQ----------------QERKQ-------L----TEEQARISVLMSEKQELEEK 178 (388)
Q Consensus 130 ~~~lekEl~~l~~~lqq----es~lrq----------------qeqqr-------l----a~Eq~~~a~l~aEkqeLeek 178 (388)
...+...+++|+-+|++ +.++.+ +.... + +.--++-..+..+-++|...
T Consensus 515 ~~~~~~~~~~l~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~qL~~~ 594 (1486)
T PRK04863 515 LQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQAR 594 (1486)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 66788888888886554 222222 01111 1 11222335566677778888
Q ss_pred HHHHhhhchh-hhhhhhhhh-----hcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhh
Q 047021 179 LVMFSRKASE-EVSNAAASK-----AFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSL 249 (388)
Q Consensus 179 l~~~~~~~se-~~~~~a~~k-----~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~L 249 (388)
|..+++.+|. -....|..+ .-..+|...+......+..-.--+--.|+++=..|..--.+|++||.-+.+.
T Consensus 595 i~~l~~~ap~W~~a~~al~~L~eq~g~~~~~~~~v~~~mq~~~~~~~~~~~~~~~~~~~~~~L~~~i~~l~~~~~g~ 671 (1486)
T PRK04863 595 IQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEIERLSQPGGSE 671 (1486)
T ss_pred HHHHHHhChHHHhhHHHHHHHHHhcchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCc
Confidence 8888877776 222222222 1123666666655555555444455566677777777777788888765554
No 14
>PRK02224 chromosome segregation protein; Provisional
Probab=95.24 E-value=5.6 Score=43.09 Aligned_cols=34 Identities=32% Similarity=0.431 Sum_probs=19.1
Q ss_pred HHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 79 TLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 79 ~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
...++..+++.+.+.+.++...+..+++.+-...
T Consensus 555 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 588 (880)
T PRK02224 555 EKREAAAEAEEEAEEAREEVAELNSKLAELKERI 588 (880)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 3334445555566666666666666666655533
No 15
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=95.22 E-value=5.6 Score=42.97 Aligned_cols=225 Identities=24% Similarity=0.387 Sum_probs=111.4
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRL 84 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRl 84 (388)
+|...++.+.+|...|..|..+......+++.|+..|+.... ..-.|+.-.++++-....+.--++.|..++-.+..|+
T Consensus 144 qlE~~qkE~eeL~~~~~~Le~e~~~l~~~v~~l~~eL~~~~e-e~e~L~~~~kel~~~~e~l~~E~~~L~~q~~e~~~ri 222 (546)
T PF07888_consen 144 QLEECQKEKEELLKENEQLEEEVEQLREEVERLEAELEQEEE-EMEQLKQQQKELTESSEELKEERESLKEQLAEARQRI 222 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555666666666666666666666666666666664433 3445555555555544444433444444444444444
Q ss_pred HHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 85 KEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQAR 164 (388)
Q Consensus 85 kEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~ 164 (388)
.+-| .++.++.+...... --.+++..+..++...+.+|+ +||......
T Consensus 223 ~~LE--------------edi~~l~qk~~E~e--------~~~~~lk~~~~elEq~~~eLk----------~rLk~~~~~ 270 (546)
T PF07888_consen 223 RELE--------------EDIKTLTQKEKEQE--------KELDKLKELKAELEQLEAELK----------QRLKETVVQ 270 (546)
T ss_pred HHHH--------------HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHH
Confidence 4333 33334444430000 000111222222222222222 222222222
Q ss_pred H-------HHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhH
Q 047021 165 I-------SVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSS 237 (388)
Q Consensus 165 ~-------a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~ 237 (388)
. ..+..+...|.++|...-..... |......|.+.|.||+ +-|++.+.|+-....
T Consensus 271 ~~~~~~~~~~~~~e~e~LkeqLr~~qe~lqa-----------Sqq~~~~L~~EL~~~~-------~~RDrt~aeLh~aRL 332 (546)
T PF07888_consen 271 LKQEETQAQQLQQENEALKEQLRSAQEQLQA-----------SQQEAELLRKELSDAV-------NVRDRTMAELHQARL 332 (546)
T ss_pred HHHhhhhhhhHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHhhh
Confidence 2 23334444444444433211110 1233445666665554 457889999988888
Q ss_pred HHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH
Q 047021 238 EIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELR-GILEKL 284 (388)
Q Consensus 238 EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr-~lLekl 284 (388)
|.+.| +-.|+...-+..+-.++|..--. .|.+.-+.. +-+++|
T Consensus 333 e~aql---~~qLad~~l~lke~~~q~~qEk~-~l~~~~e~~k~~ie~L 376 (546)
T PF07888_consen 333 EAAQL---KLQLADASLELKEGRSQWAQEKQ-ALQHSAEADKDEIEKL 376 (546)
T ss_pred hHHHH---HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhHHHHHHH
Confidence 88888 55677766677777889986555 555544333 445443
No 16
>PHA02562 46 endonuclease subunit; Provisional
Probab=94.97 E-value=4.8 Score=40.93 Aligned_cols=138 Identities=15% Similarity=0.214 Sum_probs=77.9
Q ss_pred HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021 11 KKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE 90 (388)
Q Consensus 11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE 90 (388)
.+..+++.++..|..++.....+.+.+.+.+.+++..... .+.++.=+.|.......++-+++..++..+.....+
T Consensus 174 ~~~~e~~~~i~~l~~~i~~l~~~i~~~~~~i~~~~~~~~~----~i~~l~~e~~~l~~~~~~l~~~l~~l~~~i~~l~~~ 249 (562)
T PHA02562 174 DKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGE----NIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMD 249 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3444566666666666666666666666666666554332 233444455566666666777777666666554322
Q ss_pred -------HhhhHhhHHHHHHHHHHHHHHHhcCCCC-CCC----CCCCChhhHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 91 -------QYRAEEDAAALRAELNSLQQQAMDGSLG-GIS----SMGISPDHVQTLEKELATLKSQLQQESQLRQ 152 (388)
Q Consensus 91 -------QyRAEEDaAaLRAEln~lQqq~~~~~~s-~~~----~~g~s~~~~~~lekEl~~l~~~lqqes~lrq 152 (388)
--.++.+-+.+...++.++....-.... -.+ ++..+++.+..+...+.+++.++..-.....
T Consensus 250 i~~~~~~L~~l~~~~~~~~~~l~~~~~~~~~~~~~~~Cp~C~~~~~~~~~~~~~l~d~i~~l~~~l~~l~~~i~ 323 (562)
T PHA02562 250 IEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAID 323 (562)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCCcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2235555555666666665544322111 111 3334477788888888888888776444433
No 17
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=94.71 E-value=9.1 Score=42.83 Aligned_cols=84 Identities=30% Similarity=0.423 Sum_probs=57.6
Q ss_pred HHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021 10 QKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE 89 (388)
Q Consensus 10 q~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE 89 (388)
+-++..+.++|..|..+++.+..+..+....++ ...-|.| +|+.++ |++-.|.++.++..++.--.
T Consensus 2 q~ql~~~q~E~e~L~~ele~~~~~l~~~~~~i~---~fwspEl---------krer~~--rkee~a~l~~~k~qlr~~q~ 67 (775)
T PF10174_consen 2 QAQLERLQRENERLRRELERKQSKLGSSMNSIK---TFWSPEL---------KRERAL--RKEEAAELSRLKEQLRVTQE 67 (775)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHhHHHHHHHhHh---cccchhh---------HHHHHH--HHHHHHHHHhHHHHHHHHHh
Confidence 345556667777777777766655555544332 2333543 344443 67788999999999988888
Q ss_pred HHhhhHhhHHHHHHHHHH
Q 047021 90 EQYRAEEDAAALRAELNS 107 (388)
Q Consensus 90 EQyRAEEDaAaLRAEln~ 107 (388)
+-.++.+...+|..+|.+
T Consensus 68 e~q~~~~ei~~LqeELr~ 85 (775)
T PF10174_consen 68 ENQKAQEEIQALQEELRA 85 (775)
T ss_pred hHHHHHHHHHHHHHHHHH
Confidence 888999999999988833
No 18
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=94.47 E-value=16 Score=44.69 Aligned_cols=157 Identities=28% Similarity=0.395 Sum_probs=83.6
Q ss_pred hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Q 047021 70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQ 149 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~ 149 (388)
+..+-..|+.+..|+- -.|+..+.|-.+.+.+++-... -.+++.+++..+..+..+...-..
T Consensus 903 k~~le~~l~~~~~~~e-------~~ee~~~~le~~~~~~~~e~~~-----------l~~~~~~~E~~~~k~~~Ek~~~e~ 964 (1930)
T KOG0161|consen 903 KQELEKELKELKERLE-------EEEEKNAELERKKRKLEQEVQE-----------LKEQLEELELTLQKLELEKNAAEN 964 (1930)
T ss_pred HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444443 3344455566666665544321 123445555555555555555444
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHH
Q 047021 150 LRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLL 229 (388)
Q Consensus 150 lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL 229 (388)
-....+..++.=-..++.|..++.++++++..+............ .+ ..-+.||+.+|.||-.-.++=+..|..+
T Consensus 965 ~~~~l~~e~~~~~e~~~kL~kekk~lEe~~~~l~~~l~~~eek~~---~l-~k~~~kle~~l~~le~~le~e~~~r~e~- 1039 (1930)
T KOG0161|consen 965 KLKNLEEEINSLDENISKLSKEKKELEERIRELQDDLQAEEEKAK---SL-NKAKAKLEQQLDDLEVTLEREKRIRMEL- 1039 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence 444445555554566788888888888888887755333222110 01 2346777888777777776655554433
Q ss_pred HHhhhhhHHHHHHhhhhhhhhhhHHH
Q 047021 230 MEIDNQSSEIERLFEENSSLSSSYQE 255 (388)
Q Consensus 230 ~EID~QS~EIErLf~ENs~Lsss~qe 255 (388)
+ ..+-+|=+++..+-.+..+
T Consensus 1040 ---E---k~~rkle~el~~~~e~~~~ 1059 (1930)
T KOG0161|consen 1040 ---E---KAKRKLEGELKDLQESIEE 1059 (1930)
T ss_pred ---H---HHHHHHHHHHHHhhhHHHH
Confidence 3 3334444555444444444
No 19
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=93.28 E-value=16 Score=40.64 Aligned_cols=151 Identities=27% Similarity=0.292 Sum_probs=102.9
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA 87 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA 87 (388)
.|..+..+|+.+.+-+...+.....+++-|....+++. +. .+++-.+-+.++.-+||.
T Consensus 31 ~~~~~i~~l~~elk~~~~~~~~~~~e~~rl~~~~~~~~--------~~--------------~~~~e~~~~~lr~e~ke~ 88 (717)
T PF09730_consen 31 YLQQRILELENELKQLRQELSNVQAENERLSQLNQELR--------KE--------------CEDLELERKRLREEIKEY 88 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HH--------------HHHHHHHHHHHHHHHHHH
Confidence 34556666666666666666666666666665555432 22 345555666667777777
Q ss_pred HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 88 EEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISV 167 (388)
Q Consensus 88 EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~ 167 (388)
-.-..|-=-|-.-|-.|-=+||.|+..=- ++.+-|----.+|..|+.|+.-+.+++.....|+---..+|.+...-+..
T Consensus 89 K~rE~rll~dyselEeENislQKqvs~Lk-~sQvefE~~Khei~rl~Ee~~~l~~qlee~~rLk~iae~qleEALesl~~ 167 (717)
T PF09730_consen 89 KFREARLLQDYSELEEENISLQKQVSVLK-QSQVEFEGLKHEIKRLEEEIELLNSQLEEAARLKEIAEKQLEEALESLKS 167 (717)
T ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 77778888899999999999999994322 22222221234588899999999999999999988777777777655555
Q ss_pred HHHHHHHHHHHHHH
Q 047021 168 LMSEKQELEEKLVM 181 (388)
Q Consensus 168 l~aEkqeLeekl~~ 181 (388)
=+-.|-.|+..|..
T Consensus 168 EReqk~~LrkEL~~ 181 (717)
T PF09730_consen 168 EREQKNALRKELDQ 181 (717)
T ss_pred HHHHHHHHHHHHHH
Confidence 55556666666666
No 20
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=93.22 E-value=0.025 Score=60.44 Aligned_cols=132 Identities=26% Similarity=0.346 Sum_probs=0.0
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA 87 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA 87 (388)
.|.++|.+++.+...|....+....+++.|..+++..+.. +.-...-+.+...|+-+++|.++..+.+.
T Consensus 190 ~l~q~~~el~~~i~~L~~e~~~L~~e~~~l~~~~~~~~~~-----------~~~~~~~~~~~~~~l~~ql~~L~~el~~~ 258 (713)
T PF05622_consen 190 ELAQRCHELEKQISDLQEEKESLQSENEELQERLSQLEGS-----------SEEPSQHLSVELADLRAQLRRLREELERL 258 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHhHHHhhhhhhhhhcccCCCCCC-----------CCCcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555565555555555555555666666666655510 01111112233457888888888877766
Q ss_pred HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCC--CCC-------CCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 88 EEEQYRAEEDAAALRAELNSLQQQAMDGSLG--GIS-------SMGISPDHVQTLEKELATLKSQLQQESQL 150 (388)
Q Consensus 88 EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s--~~~-------~~g~s~~~~~~lekEl~~l~~~lqqes~l 150 (388)
|+-.--.+.....+..+|..|++....-..- .+. -+.-..+....++.++..|+-.|.--.-+
T Consensus 259 e~~~~d~~~~~e~le~ei~~L~q~~~eL~~~A~~a~~LrDElD~lR~~a~r~~klE~~ve~YKkKLed~~~l 330 (713)
T PF05622_consen 259 EEQRDDLKIELEELEKEIDELRQENEELQAEAREARALRDELDELREKADRADKLENEVEKYKKKLEDLEDL 330 (713)
T ss_dssp ------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5532222333445667777777776432111 111 12223333444666666666555543333
No 21
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=93.19 E-value=19 Score=41.19 Aligned_cols=42 Identities=14% Similarity=0.137 Sum_probs=27.1
Q ss_pred HHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHHHHHHH
Q 047021 328 GQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRLYKPVL 369 (388)
Q Consensus 328 ~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rly~PvL 369 (388)
.++-.-..|-+.|..+...|...........+.+-+..+.++
T Consensus 970 ee~e~~~~r~~~l~~~~~dl~~a~~~l~~~i~~~d~~~~~~f 1011 (1163)
T COG1196 970 EEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERF 1011 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455566677777777777777777766666666555544
No 22
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=92.88 E-value=23 Score=41.15 Aligned_cols=188 Identities=15% Similarity=0.164 Sum_probs=89.7
Q ss_pred hhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHh------hhHhhHHHHHHHHHHHHHHHhcCCCC
Q 047021 45 EQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQY------RAEEDAAALRAELNSLQQQAMDGSLG 118 (388)
Q Consensus 45 e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQy------RAEEDaAaLRAEln~lQqq~~~~~~s 118 (388)
....+|.+++-+.++.-+.+.+.-.-+++-+.|-.++..+..++.-.- |...+...++.+|..|......
T Consensus 742 ~~~eip~l~~~l~~le~~l~~~~~~le~~~~~l~~~~~~~~~~esL~~~v~~i~r~~~ei~~l~~qie~l~~~l~~---- 817 (1311)
T TIGR00606 742 KEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQG---- 817 (1311)
T ss_pred HHhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----
Confidence 345678888888777666555555555555555555555554443211 2244555555555555444421
Q ss_pred CCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhh
Q 047021 119 GISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKA 198 (388)
Q Consensus 119 ~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~ 198 (388)
+|. ...+..++.++..+..++..-..- +..-+.....+..+...|..++..+...-.. ...
T Consensus 818 ----~~~-~~s~~ele~ei~~~~~el~~l~~~-------~e~l~~e~e~~~~eI~~Lq~ki~el~~~klk------l~~- 878 (1311)
T TIGR00606 818 ----SDL-DRTVQQVNQEKQEKQHELDTVVSK-------IELNRKLIQDQQEQIQHLKSKTNELKSEKLQ------IGT- 878 (1311)
T ss_pred ----ccc-cCCHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHH-
Confidence 111 124566777777666555442111 1111222222333333333333322211000 000
Q ss_pred cchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021 199 FSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQES 256 (388)
Q Consensus 199 ~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea 256 (388)
....+..++.+|.++...++-+...=..+=.+|..-..+|+++..+...+.......
T Consensus 879 -~l~~r~~le~~L~el~~el~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 935 (1311)
T TIGR00606 879 -NLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETS 935 (1311)
T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 344666677776666655555544444454555555555555555555544444333
No 23
>PRK11637 AmiB activator; Provisional
Probab=92.84 E-value=12 Score=37.86 Aligned_cols=40 Identities=23% Similarity=0.231 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 73 LSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 73 l~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
+..++..+++++...+.+--.++.+-..++..++.+-..+
T Consensus 94 ~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlra~ 133 (428)
T PRK11637 94 TQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLDAA 133 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445555555555555555555555555554444333
No 24
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=92.81 E-value=17 Score=39.61 Aligned_cols=79 Identities=24% Similarity=0.297 Sum_probs=50.0
Q ss_pred HHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021 9 FQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE 88 (388)
Q Consensus 9 lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE 88 (388)
|..++....+.|..|+....+++.....|+.+|.++.... |...+-|. +|+-|.+-++| +++|=+.+|..|.|-+
T Consensus 99 L~~qlqaqv~~ne~Ls~L~~EqEerL~ELE~~le~~~e~~-~D~~kLLe--~lqsdk~t~SR--AlsQN~eLK~QL~Elq 173 (617)
T PF15070_consen 99 LEEQLQAQVENNEQLSRLNQEQEERLAELEEELERLQEQQ-EDRQKLLE--QLQSDKATASR--ALSQNRELKEQLAELQ 173 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHh--hhcccchHHHH--HHHhHHHHHHHHHHHH
Confidence 3333333344455555555666666666666666655433 44555554 36778888888 5888899999999987
Q ss_pred HHHh
Q 047021 89 EEQY 92 (388)
Q Consensus 89 EEQy 92 (388)
+-=.
T Consensus 174 ~~Fv 177 (617)
T PF15070_consen 174 DAFV 177 (617)
T ss_pred HHHH
Confidence 7333
No 25
>PF13514 AAA_27: AAA domain
Probab=92.74 E-value=21 Score=40.53 Aligned_cols=161 Identities=18% Similarity=0.283 Sum_probs=90.3
Q ss_pred hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC-CCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHH
Q 047021 70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG-SLGGISSMGISPDHVQTLEKELATLKSQLQQES 148 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~-~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes 148 (388)
-+...+.++.+..++..++.+--+++.....+..++...+++.... ...|+ +-+++|..+...-..+.++...+.+-.
T Consensus 668 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~l~~~gL-~~~~~~~~~~~~l~~l~~l~~~~~~~~ 746 (1111)
T PF13514_consen 668 WEQAAARREQLEEELQQLEQELEEAEAELQEAQEALEEWQEEWQEALAELGL-PADASPEEALEALELLEELREALAEIR 746 (1111)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455566666666666666666666666666666777666665322 22345 445677777776666665555544433
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhh----h---hhhhcchhhHHHHHHHHhHHHHHHHHH
Q 047021 149 QLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNA----A---ASKAFSVEDKEKLEKQLHDMAVVVERL 221 (388)
Q Consensus 149 ~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~----a---~~k~~S~eDkekle~Ql~dma~~VeRL 221 (388)
.+ ..++..+..+...++.++..+.....+..+.. + ....+.. -..-..++..+...++.+
T Consensus 747 ~~-----------~~ri~~~~~~~~~f~~~~~~L~~~l~~~~~~~~~~~~~~~L~~~l~~--a~~~~~~~~~l~~~~~~~ 813 (1111)
T PF13514_consen 747 EL-----------RRRIEQMEADLAAFEEQVAALAERLGPDLPEDPAEEALEALRARLEE--AREAQEERERLQEQLEEL 813 (1111)
T ss_pred HH-----------HHHHHHHHHHHHHHHHHHHHHHHHcCcccccCcHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH
Confidence 33 23556666777777777777665522211111 1 1222211 111123334444556666
Q ss_pred HhhHHHHHHHhhhhhHHHHHHhh
Q 047021 222 ESSRQKLLMEIDNQSSEIERLFE 244 (388)
Q Consensus 222 E~SRqKLL~EID~QS~EIErLf~ 244 (388)
+...+.+-.++..--.+|+.||.
T Consensus 814 ~~~~~~~~~~l~~~~~~l~~L~~ 836 (1111)
T PF13514_consen 814 EEELEQAEEELEELEAELAELLE 836 (1111)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 66666666677777777888887
No 26
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=92.46 E-value=1.8 Score=46.69 Aligned_cols=138 Identities=23% Similarity=0.314 Sum_probs=74.9
Q ss_pred hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHH--hhhhhhhhh-hHHHHHHHHHHHH----HHHHHHHHHH
Q 047021 202 EDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERL--FEENSSLSS-SYQESMGIAKHWE----NQVKDCLKQN 274 (388)
Q Consensus 202 eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErL--f~ENs~Lss-s~qea~~iaaqWE----nQvqDclkQN 274 (388)
+....|... +.-|+....+|-.+|+.=..+|+++ -++...-.. -+.=...=++.|+ +.+.--=+-|
T Consensus 503 e~~~~L~~~-------~~~Le~e~~~L~~~~~~Le~~l~~~~L~g~~~~~~trVL~lr~NP~~~~~~~k~~~l~~L~~En 575 (722)
T PF05557_consen 503 EELNELQKE-------IEELERENERLRQELEELESELEKLTLQGEFNPSKTRVLHLRDNPTSKAEQIKKSTLEALQAEN 575 (722)
T ss_dssp HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHCCCT--BTTTEEEEEESS-HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCceeeeeCCCcHHHHHHHHHHHHHHHHHHH
Confidence 444556666 7778888888888999888899983 122211000 0000011233343 2222222346
Q ss_pred HHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHH
Q 047021 275 EELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQT 354 (388)
Q Consensus 275 eeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra 354 (388)
+.|+..|..+-. .+..|.. -+...+......++..|+.++.....|..-|--=..+-|.+.+.|
T Consensus 576 ~~L~~~l~~le~--~~~~~~~--------------~~p~~~~~~~~~e~~~l~~~~~~~ekr~~RLkevf~~ks~eFr~a 639 (722)
T PF05557_consen 576 EDLLARLRSLEE--GNSQPVD--------------AVPTSSLESQEKEIAELKAELASAEKRNQRLKEVFKAKSQEFREA 639 (722)
T ss_dssp HHHHHHHHHHTT--TT------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhccc--CCCCCcc--------------cccchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 666665544421 1111111 111111233345788899999998888888888888889999999
Q ss_pred HHHhhhhH
Q 047021 355 TQAYNGLV 362 (388)
Q Consensus 355 ~QA~n~L~ 362 (388)
+..+.|+-
T Consensus 640 v~~llGyk 647 (722)
T PF05557_consen 640 VYSLLGYK 647 (722)
T ss_dssp HHHHHSEE
T ss_pred HHHHhcce
Confidence 98888853
No 27
>PF09728 Taxilin: Myosin-like coiled-coil protein; InterPro: IPR019132 Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca(2+)-dependent exocytosis in neuroendocrine cells []. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localises to the nucleus in osteoblasts and dimerises with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription [].
Probab=92.11 E-value=14 Score=36.82 Aligned_cols=254 Identities=22% Similarity=0.319 Sum_probs=150.5
Q ss_pred hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH-----
Q 047021 13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA----- 87 (388)
Q Consensus 13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA----- 87 (388)
+...++....|..+..+-..++..+++++..+++ .+-.+.+-...+.-|.+.++.+|+.|=.=-|-+++.-+..
T Consensus 17 l~~~eeK~~~L~kk~~ell~e~k~~~k~~~~~~K-k~~~l~kek~~l~~E~~k~~~~k~KLE~LCRELQk~Nk~lkeE~~ 95 (309)
T PF09728_consen 17 LSSPEEKLEALCKKYAELLEEMKRLQKQLKKLQK-KQEQLQKEKDQLQSELSKAILAKSKLESLCRELQKQNKKLKEESK 95 (309)
T ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444555555555555555555544432 2333444455556677788888887766656666554443
Q ss_pred ---HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCC-----------------CCCCChhh----HHHHHHHHHHHHHH
Q 047021 88 ---EEEQYRAEEDAAALRAELNSLQQQAMDGSLGGIS-----------------SMGISPDH----VQTLEKELATLKSQ 143 (388)
Q Consensus 88 ---EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~-----------------~~g~s~~~----~~~lekEl~~l~~~ 143 (388)
.++.-+-.+=+.-+..-|+.||.++..+..-... .|.....| +...+.|++=+.+.
T Consensus 96 ~~~~eee~kR~el~~kFq~~L~dIq~~~ee~~~~~~k~~~eN~~L~eKlK~l~eQye~rE~~~~~~~k~keLE~Ql~~AK 175 (309)
T PF09728_consen 96 RRAREEEEKRKELSEKFQATLKDIQAQMEEQSERNIKLREENEELREKLKSLIEQYELREEHFEKLLKQKELEVQLAEAK 175 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 3344444555677888889999988776544332 22223333 33455566666667
Q ss_pred HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhhhchhhhhhhh-hhhhcchhhHHHHHHHHhHHH
Q 047021 144 LQQESQLRQQERKQLTEEQA-------RISVLMSEKQELEEKLVMFSRKASEEVSNAA-ASKAFSVEDKEKLEKQLHDMA 215 (388)
Q Consensus 144 lqqes~lrqqeqqrla~Eq~-------~~a~l~aEkqeLeekl~~~~~~~se~~~~~a-~~k~~S~eDkekle~Ql~dma 215 (388)
|++......++.++...+.. .+..+...=.+|..+|..-+.+-.+--+..+ ++.+|. .+.+.+..|.
T Consensus 176 l~q~~~~~~~e~~k~~~~~~~~l~~~~~~~~~~~~E~~Lr~QL~~Y~~Kf~efq~tL~kSNe~F~-----tfk~Emekm~ 250 (309)
T PF09728_consen 176 LEQQQEEAEQEKEKAKQEKEILLEEAAQVQTLKETEKELREQLNLYSEKFEEFQDTLNKSNEVFE-----TFKKEMEKMS 250 (309)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH-----HHHHHHHHHH
Confidence 77777777777777777777 8888888888888888888766555322222 455552 3334444444
Q ss_pred HHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 216 VVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRT 286 (388)
Q Consensus 216 ~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~ 286 (388)
-.+.+||.--..+-.=.|.- |. .+-++++=-..|..++.-+-+|++.|.++.--|..
T Consensus 251 Kk~kklEKE~~~~k~k~e~~----------n~----~l~~m~eer~~~~~~~~~~~~k~~kLe~LcRaLQ~ 307 (309)
T PF09728_consen 251 KKIKKLEKENQTWKSKWEKS----------NK----ALIEMAEERQKLEKELEKLKKKIEKLEKLCRALQA 307 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHH----------hH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44666665443333222221 22 33444555566999999999999999988766654
No 28
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=92.08 E-value=30 Score=40.67 Aligned_cols=121 Identities=21% Similarity=0.230 Sum_probs=59.3
Q ss_pred HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh--hhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021 11 KKNAELEEKNSILSSKLQVKEVESESLQTRINEL--EQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE 88 (388)
Q Consensus 11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~--e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE 88 (388)
.+.++|+++...|.-+|.....+...+..++..+ +....|+.+... .|...-+.....+.+..+++..+.
T Consensus 742 ~ri~el~~~IaeL~~~i~~l~~~l~~l~~r~~~L~~e~~~~Ps~~dL~--------~A~~~l~~A~~~~~~a~~~l~~a~ 813 (1353)
T TIGR02680 742 RRIAELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLR--------AAHRRAAEAERQAESAERELARAA 813 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchHHH--------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555555555555555555555555533 334556642211 111112222333444444444444
Q ss_pred HHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHH
Q 047021 89 EEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQES 148 (388)
Q Consensus 89 EEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes 148 (388)
++=-.+..+.+..+.++... +++. .++..++.+..+...+.+|...++.=.
T Consensus 814 ~~l~~a~~~~~~a~~~l~~a--------aa~l-~L~a~~~~l~~~~~aL~~y~~~l~~l~ 864 (1353)
T TIGR02680 814 RKAAAAAAAWKQARRELERD--------AADL-DLPTDPDALEAVGLALKRFGDHLHTLE 864 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHH--------HhcC-CCCCChhHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444321 1222 344466777778888877777766633
No 29
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=91.79 E-value=16 Score=36.98 Aligned_cols=166 Identities=20% Similarity=0.305 Sum_probs=92.5
Q ss_pred hhhh-HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh---HhhhhhhhhHHHHhhhhhhh---hhhHHHhhhhHHH
Q 047021 3 LSQL-KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN---ELEQNTVPSLRKALKDVAME---KDAAVVAREDLSA 75 (388)
Q Consensus 3 ~~~v-~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln---d~e~n~v~slrkaL~~v~mE---kDaA~vareDl~A 75 (388)
.++| |.|-++|..|.+.|..|...+...-..+.-|.-.|+ +|=+.....-..+..+-+-. |+.....----..
T Consensus 81 aA~iGqsLl~~N~~L~~~~~~le~~L~~~~e~v~qLrHeL~~kdeLL~~ys~~~ee~~~~~~~~~~~~~~~~~~~~~~~~ 160 (306)
T PF04849_consen 81 AARIGQSLLEQNQDLSERNEALEEQLGAALEQVEQLRHELSMKDELLQIYSNDDEESEPESSESTPLRRNESSLSSQKCI 160 (306)
T ss_pred HHHHhHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcHhhhcccccCCCccccccccccccccch
Confidence 3455 778888888888888888888888777777777776 22111111111111110000 1110000111236
Q ss_pred HHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCC-CCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 76 QLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGI-SSMGISPDHVQTLEKELATLKSQLQQESQLRQQE 154 (388)
Q Consensus 76 QLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~-~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqe 154 (388)
++-.+++|+|.-|+|-+.=.+.|+-|..+-.+...+.-.- ..+. -.++....++..|..||+. +.+...+||
T Consensus 161 ~le~Lq~Klk~LEeEN~~LR~Ea~~L~~et~~~EekEqqL-v~dcv~QL~~An~qia~LseELa~-----k~Ee~~rQQ- 233 (306)
T PF04849_consen 161 QLEALQEKLKSLEEENEQLRSEASQLKTETDTYEEKEQQL-VLDCVKQLSEANQQIASLSEELAR-----KTEENRRQQ- 233 (306)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhhccHHHHHH-HHHHHHHhhhcchhHHHHHHHHHH-----HHHHHHHHH-
Confidence 7889999999999988887777888876666554332111 1111 1445566778888888872 234444443
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 155 RKQLTEEQARISVLMSEKQELEEKLVMFS 183 (388)
Q Consensus 155 qqrla~Eq~~~a~l~aEkqeLeekl~~~~ 183 (388)
.-|.+|+++.-.|+.++..++
T Consensus 234 --------EEIt~LlsqivdlQ~r~k~~~ 254 (306)
T PF04849_consen 234 --------EEITSLLSQIVDLQQRCKQLA 254 (306)
T ss_pred --------HHHHHHHHHHHHHHHHHHHHh
Confidence 223455555555555444443
No 30
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=91.41 E-value=13 Score=39.92 Aligned_cols=90 Identities=22% Similarity=0.244 Sum_probs=47.9
Q ss_pred HHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHh---------hhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCC
Q 047021 54 KALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQY---------RAEEDAAALRAELNSLQQQAMDGSLGGISSMG 124 (388)
Q Consensus 54 kaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQy---------RAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g 124 (388)
.-|.++.-..-.|...+.+.-+++..+++.++.....-. ...---+.||.++..++.+...-+. .+|
T Consensus 237 ~~L~~l~~ql~~a~~~~~~a~a~~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~i~~L~~~l~~l~~~~~~l~~----~y~ 312 (754)
T TIGR01005 237 QQLAELNTELSRARANRAAAEGTADSVKKALQNGGSLDVLPEVLSSQLKLEDLIQRLRERQAELRATIADLST----TML 312 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhhhcCcccccHHHHHHHHHHHHHHHHHHHHHH----hhC
Confidence 345555444444555555666666666666653211000 0011234566666666665432111 345
Q ss_pred CChhhHHHHHHHHHHHHHHHHHH
Q 047021 125 ISPDHVQTLEKELATLKSQLQQE 147 (388)
Q Consensus 125 ~s~~~~~~lekEl~~l~~~lqqe 147 (388)
+.+-.+..+..++++++.++.++
T Consensus 313 ~~hP~v~~l~~qi~~l~~~i~~e 335 (754)
T TIGR01005 313 ANHPRVVAAKSSLADLDAQIRSE 335 (754)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHH
Confidence 56666788888888888777653
No 31
>PRK09039 hypothetical protein; Validated
Probab=91.31 E-value=17 Score=36.43 Aligned_cols=35 Identities=26% Similarity=0.320 Sum_probs=19.2
Q ss_pred HhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHH
Q 047021 43 ELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQL 77 (388)
Q Consensus 43 d~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQL 77 (388)
+++++.--.+...|.+|.-..++|.-.|++|-+++
T Consensus 70 ~le~~~~~~l~~~l~~l~~~l~~a~~~r~~Le~~~ 104 (343)
T PRK09039 70 SLERQGNQDLQDSVANLRASLSAAEAERSRLQALL 104 (343)
T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555555566666555555555555555554433
No 32
>PRK11637 AmiB activator; Provisional
Probab=90.99 E-value=19 Score=36.39 Aligned_cols=88 Identities=15% Similarity=0.159 Sum_probs=41.7
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRL 84 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRl 84 (388)
+++++++...+++.+...+++++.....+...+...|+.+... +..+.+.++.+.=+-+...-.-.++-+.+...++.+
T Consensus 48 ~l~~l~~qi~~~~~~i~~~~~~~~~~~~~l~~l~~qi~~~~~~-i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l 126 (428)
T PRK11637 48 QLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRK-LRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLL 126 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555566666666666555566655555555555555554332 233333443333333333333333344444444444
Q ss_pred HHHHHHHhh
Q 047021 85 KEAEEEQYR 93 (388)
Q Consensus 85 kEAEEEQyR 93 (388)
+.-=-..|+
T Consensus 127 ~~rlra~Y~ 135 (428)
T PRK11637 127 AAQLDAAFR 135 (428)
T ss_pred HHHHHHHHH
Confidence 444444455
No 33
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=90.98 E-value=27 Score=37.98 Aligned_cols=75 Identities=20% Similarity=0.315 Sum_probs=52.0
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK 81 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k 81 (388)
..+|.+.|..|+.++..|..+++....+.....++...+... +-.+......++.|+|-...-.+++.+.++.+.
T Consensus 152 ~eeL~~~~~~Le~e~~~l~~~v~~l~~eL~~~~ee~e~L~~~-~kel~~~~e~l~~E~~~L~~q~~e~~~ri~~LE 226 (546)
T PF07888_consen 152 KEELLKENEQLEEEVEQLREEVERLEAELEQEEEEMEQLKQQ-QKELTESSEELKEERESLKEQLAEARQRIRELE 226 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356777777777777777777777777777777766666654 334555666778888887777777666665554
No 34
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton]
Probab=90.95 E-value=35 Score=39.28 Aligned_cols=119 Identities=24% Similarity=0.295 Sum_probs=67.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhh
Q 047021 154 ERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEID 233 (388)
Q Consensus 154 eqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID 233 (388)
++|+|..|. +++.+.-.+++.....+....-. ..+...-+ -.+++|+.+.-+.=+.+.+...---.+|=+.+-
T Consensus 387 e~eqLr~el---aql~a~r~q~eka~~~~ee~e~~---~l~~e~ry-~klkek~t~l~~~h~~lL~K~~di~kQle~~~~ 459 (980)
T KOG0980|consen 387 EQEQLRNEL---AQLLASRTQLEKAQVLVEEAENK---ALAAENRY-EKLKEKYTELRQEHADLLRKYDDIQKQLESAEQ 459 (980)
T ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHHHHhHHHH---HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666665 56666666666554332111000 00112222 356777776665555555544433333333222
Q ss_pred hhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 234 NQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK 283 (388)
Q Consensus 234 ~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek 283 (388)
+ |+.+-++|.+|...+-+.-..+-+||.-..+-.+..+.||-=|..
T Consensus 460 s----~~~~~~~~~~L~d~le~~~~~~~~~~~K~e~~~~~le~l~~El~~ 505 (980)
T KOG0980|consen 460 S----IDDVEEENTNLNDQLEELQRAAGRAETKTESQAKALESLRQELAL 505 (980)
T ss_pred h----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 2 337788999999999999999999998776665555555544433
No 35
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=90.65 E-value=37 Score=39.00 Aligned_cols=228 Identities=23% Similarity=0.265 Sum_probs=116.8
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHH--------HHHhhHhhhhhhhhHHHHhhh---hhhhh--------hhH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESL--------QTRINELEQNTVPSLRKALKD---VAMEK--------DAA 66 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L--------~~~Lnd~e~n~v~slrkaL~~---v~mEk--------DaA 66 (388)
-.++.++|++||+.++.|.-+-+.-..+++-. ++.....|++.--.|.|-|.. |-++| .+-
T Consensus 319 keNy~kGqaELerRRq~leeqqqreree~eqkEreE~ekkererqEqErk~qlElekqLerQReiE~qrEEerkkeie~r 398 (1118)
T KOG1029|consen 319 KENYEKGQAELERRRQALEEQQQREREEVEQKEREEEEKKERERQEQERKAQLELEKQLERQREIERQREEERKKEIERR 398 (1118)
T ss_pred HHhHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35789999999999998876544333333222 223334444444444444421 11111 011
Q ss_pred HHhhhhHHHHHHH--HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH
Q 047021 67 VVAREDLSAQLRT--LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL 144 (388)
Q Consensus 67 ~vareDl~AQLR~--~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l 144 (388)
-.||++|--|-+. .+-|.+|--..--|--|+--.+.|-.+.|++...+ ..+.++.|+..+.|+...+
T Consensus 399 Eaar~ElEkqRqlewErar~qem~~Qk~reqe~iv~~nak~~ql~~elet-----------Ln~k~qqls~kl~Dvr~~~ 467 (1118)
T KOG1029|consen 399 EAAREELEKQRQLEWERARRQEMLNQKNREQEWIVYLNAKKKQLQQELET-----------LNFKLQQLSGKLQDVRVDI 467 (1118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHhhhhhhheecc
Confidence 1234444433332 23333333333334445555555555566555422 2334444555555444433
Q ss_pred HHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhh-hhhhcchhhHHHHHH
Q 047021 145 QQE--------------SQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAA-ASKAFSVEDKEKLEK 209 (388)
Q Consensus 145 qqe--------------s~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a-~~k~~S~eDkekle~ 209 (388)
... -.-+.+.++|+.+=|..+-.|.-|+|.|..++-..-..+-+..+... -+.+| ..|+-+.+
T Consensus 468 tt~kt~ie~~~~q~e~~isei~qlqarikE~q~kl~~l~~Ekq~l~~qlkq~q~a~~~~~~~~s~L~aa~--~~ke~irq 545 (1118)
T KOG1029|consen 468 TTQKTEIEEVTKQRELMISEIDQLQARIKELQEKLQKLAPEKQELNHQLKQKQSAHKETTQRKSELEAAR--RKKELIRQ 545 (1118)
T ss_pred chHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHhhhhccCcchHHHHHHHHH--HHHHHHHH
Confidence 221 11112567777778888888888888888888776655554433221 13333 55777788
Q ss_pred HHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhh
Q 047021 210 QLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSL 249 (388)
Q Consensus 210 Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~L 249 (388)
+|.|-++.|+.=. .+=|.|||+=-.-.+.|-++-+..
T Consensus 546 ~ikdqldelskE~---esk~~eidi~n~qlkelk~~~~~q 582 (1118)
T KOG1029|consen 546 AIKDQLDELSKET---ESKLNEIDIFNNQLKELKEDVNSQ 582 (1118)
T ss_pred HHHHHHHHHHHHH---HHHHHhhhhHHHHHHHHHHHHHHH
Confidence 8777666555322 223556665444444444443333
No 36
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=90.45 E-value=15 Score=34.37 Aligned_cols=93 Identities=27% Similarity=0.428 Sum_probs=53.7
Q ss_pred HHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHH--
Q 047021 75 AQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQ-- 152 (388)
Q Consensus 75 AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrq-- 152 (388)
.+.|.+.+++++.+.+=|+ ++.+++.|++-++..+... ...|..++..+...++..-.-.+
T Consensus 82 ~~~r~~~~klk~~~~el~k-------~~~~l~~L~~L~~dknL~e----------ReeL~~kL~~~~~~l~~~~~ki~~L 144 (194)
T PF15619_consen 82 EQERELERKLKDKDEELLK-------TKDELKHLKKLSEDKNLAE----------REELQRKLSQLEQKLQEKEKKIQEL 144 (194)
T ss_pred HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHcCCchh----------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667777788887777776 6777888888775544322 34455555555555444222111
Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047021 153 ---------QERKQLTEEQARISVLMSEKQELEEKLVMFSR 184 (388)
Q Consensus 153 ---------qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~ 184 (388)
--+..++.|...+..++.+.+.|...+.-+..
T Consensus 145 ek~leL~~k~~~rql~~e~kK~~~~~~~~~~l~~ei~~L~~ 185 (194)
T PF15619_consen 145 EKQLELENKSFRRQLASEKKKHKEAQEEVKSLQEEIQRLNQ 185 (194)
T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 13444556666666666666666655555443
No 37
>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=90.10 E-value=18 Score=34.71 Aligned_cols=29 Identities=17% Similarity=0.264 Sum_probs=18.9
Q ss_pred HHHHHHHhhHHHHhHHHHHHHHHHHHHHH
Q 047021 327 KGQLAKEQSRAEALSAQVLQLSAQLQQTT 355 (388)
Q Consensus 327 K~eL~kEqsRseaLsaqv~qLsAeL~ra~ 355 (388)
..+-+...-++.-|--|+..|..+|..+.
T Consensus 174 EeeR~t~~EKnk~lq~QL~~L~~EL~~~k 202 (246)
T PF00769_consen 174 EEERVTYAEKNKRLQEQLKELKSELEQLK 202 (246)
T ss_dssp GGC---HHHH-HHHHHHHHHHHHHHHTTB
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHh
Confidence 34445566677788889999999997665
No 38
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=90.10 E-value=22 Score=36.12 Aligned_cols=84 Identities=31% Similarity=0.347 Sum_probs=50.9
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
.++..||+|+..||++|..|.++......+-..++ +-|+..| .|.+.||+.+...
T Consensus 160 ~~le~Lq~Klk~LEeEN~~LR~Ea~~L~~et~~~E----ekEqqLv---------------------~dcv~QL~~An~q 214 (306)
T PF04849_consen 160 IQLEALQEKLKSLEEENEQLRSEASQLKTETDTYE----EKEQQLV---------------------LDCVKQLSEANQQ 214 (306)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhhcc----HHHHHHH---------------------HHHHHHhhhcchh
Confidence 46788999999999999999886554333332221 1222222 2456666666666
Q ss_pred hHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 84 LKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 84 lkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
+..--+|=-+--||...-.-||-+|+.|.
T Consensus 215 ia~LseELa~k~Ee~~rQQEEIt~Llsqi 243 (306)
T PF04849_consen 215 IASLSEELARKTEENRRQQEEITSLLSQI 243 (306)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 66666666666666655555555555443
No 39
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=88.79 E-value=6.2 Score=42.65 Aligned_cols=23 Identities=17% Similarity=0.333 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh
Q 047021 163 ARISVLMSEKQELEEKLVMFSRK 185 (388)
Q Consensus 163 ~~~a~l~aEkqeLeekl~~~~~~ 185 (388)
.-+..|.+|-+.|...+..+...
T Consensus 566 ~~l~~L~~En~~L~~~l~~le~~ 588 (722)
T PF05557_consen 566 STLEALQAENEDLLARLRSLEEG 588 (722)
T ss_dssp HHHHHHHHHHHHHHHHHHHHTTT
T ss_pred HHHHHHHHHHHHHHHHHHhcccC
Confidence 44567777888888777776544
No 40
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=88.53 E-value=0.14 Score=56.64 Aligned_cols=304 Identities=27% Similarity=0.347 Sum_probs=0.0
Q ss_pred HHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhh
Q 047021 14 AELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYR 93 (388)
Q Consensus 14 ~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyR 93 (388)
++|-.++.-++......+-++..|..+++|+-.+. .++.|+ |.-+--.+-.+-+||..++.++.+.+-.--.
T Consensus 134 ~eL~eqle~lqk~k~~lEK~k~~l~~e~~dL~~~l-~~~~k~-------k~~~Ek~~K~lE~qL~El~~klee~er~~~e 205 (859)
T PF01576_consen 134 AELNEQLEQLQKQKAKLEKEKSQLEAELDDLQAQL-DSLQKA-------KQEAEKKRKQLEAQLNELQAKLEESERQRNE 205 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH-HHHHHH-------HHHHHhHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555555555555555555543322 333333 3333333555677888888888887776666
Q ss_pred hHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 94 AEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQ 173 (388)
Q Consensus 94 AEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkq 173 (388)
.....+-|.+|+..|..+......... . ..-....|+..|.+++.+|..++..+.....++..-..-+..|.....
T Consensus 206 l~~~k~kL~~E~~eL~~qLee~e~~~~-~---l~r~k~~L~~qLeelk~~leeEtr~k~~L~~~l~~le~e~~~L~eqle 281 (859)
T PF01576_consen 206 LTEQKAKLQSENSELTRQLEEAESQLS-Q---LQREKSSLESQLEELKRQLEEETRAKQALEKQLRQLEHELEQLREQLE 281 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-H---HHHHHHHHHHHHHhhHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHHHh
Confidence 667777888888888888754432211 1 222344578888888888888776655433322211111111211111
Q ss_pred HHHHHHHHHhhhchhhhhhhhhhhh-c---------c-hhhHHHHHH-------HHhHHHHHHHHHHhhHHHHHHHhhhh
Q 047021 174 ELEEKLVMFSRKASEEVSNAAASKA-F---------S-VEDKEKLEK-------QLHDMAVVVERLESSRQKLLMEIDNQ 235 (388)
Q Consensus 174 eLeekl~~~~~~~se~~~~~a~~k~-~---------S-~eDkekle~-------Ql~dma~~VeRLE~SRqKLL~EID~Q 235 (388)
+-.+....+.+.-+....+.+.++. | . .+-|-+|.+ ++.+.-..+..|+-.+.+|-.||+-=
T Consensus 282 eE~e~k~~l~~qlsk~~~El~~~k~K~e~e~~~~~EelEeaKKkL~~~L~el~e~le~~~~~~~~LeK~k~rL~~EleDl 361 (859)
T PF01576_consen 282 EEEEAKSELERQLSKLNAELEQWKKKYEEEAEQRTEELEEAKKKLERKLQELQEQLEEANAKVSSLEKTKKRLQGELEDL 361 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred hhhhhHHHHHHHHHHHhhHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1111111111111111111111111 0 0 122334444 44444555667777777777777766
Q ss_pred hHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCC
Q 047021 236 SSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNG 315 (388)
Q Consensus 236 S~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~ 315 (388)
..++|+....++ .++.-.+-.=+++.+++.-++.+..|-.. ....
T Consensus 362 ~~eLe~~~~~~~--------------~LeKKqr~fDk~l~e~k~~~~~~~~e~d~---------------------~q~e 406 (859)
T PF01576_consen 362 TSELEKAQAAAA--------------ELEKKQRKFDKQLAEWKAKVEELQAERDA---------------------AQRE 406 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHH--------------HHHHHHHhHHHHHHHHHHHHHHHHHHHHH---------------------HHHH
Confidence 666666544444 44444444445555555544433321111 1122
Q ss_pred CchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHH
Q 047021 316 SETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRL 364 (388)
Q Consensus 316 ~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rl 364 (388)
...+.|++.+|+..|-......+.|.-+...|..+|.-.+-....-++.
T Consensus 407 ~r~~~te~~~Lk~~lee~~e~~e~lere~k~L~~El~dl~~q~~~~~k~ 455 (859)
T PF01576_consen 407 ARELETELFKLKNELEELQEQLEELERENKQLQDELEDLTSQLDDAGKS 455 (859)
T ss_dssp -------------------------------------------------
T ss_pred hHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccchhhhhhhccc
Confidence 5567788999999998888889999999999999988877666655554
No 41
>COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning]
Probab=88.03 E-value=45 Score=36.49 Aligned_cols=125 Identities=26% Similarity=0.418 Sum_probs=87.7
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
+.|+.+++++.+++....-+...++..++.--.|+..|++ +.++|.+|.=+- +++..-|+.++|
T Consensus 347 ~~vr~~e~eL~el~~~~~~i~~~~~~~~~~yS~lq~~l~~--------~~~~l~~i~~~q-------~~~~e~L~~Lrk- 410 (570)
T COG4477 347 GSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEE--------IEKALTDIEDEQ-------EKVQEHLTSLRK- 410 (570)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHH--------HHHHHHHHhhhH-------HHHHHHHHHHHH-
Confidence 4678888888888888888888888888888888877664 567776665442 345555555544
Q ss_pred hHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCC-----CCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 84 LKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGIS-----SMGISPDHVQTLEKELATLKSQLQQESQL 150 (388)
Q Consensus 84 lkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~-----~~g~s~~~~~~lekEl~~l~~~lqqes~l 150 (388)
+--.|.+++-.++..|-.+-+-+-.....|+| -+-...++++++.+++.+.-....+-+.+
T Consensus 411 ------dEl~Are~l~~~~~~l~eikR~mek~nLPGlPe~~l~l~~~~~~~i~~l~~eLse~pinm~~v~~~ 476 (570)
T COG4477 411 ------DELEARENLERLKSKLHEIKRYMEKSNLPGLPETFLSLFFTAGHEIQDLMKELSEVPINMEAVSAL 476 (570)
T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCcHHHHHHHHhhhhHHHHHHHHHhhcCCcHHHHHHH
Confidence 45567889999999999999999888888886 12234455666666666554444444444
No 42
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=87.91 E-value=49 Score=36.73 Aligned_cols=179 Identities=31% Similarity=0.415 Sum_probs=90.2
Q ss_pred HHHHhHHhhhhHHHHHHhhHHHHHHhhHh------hhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021 16 LEEKNSILSSKLQVKEVESESLQTRINEL------EQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE 89 (388)
Q Consensus 16 le~~n~~L~sk~~~k~~~~~~L~~~Lnd~------e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE 89 (388)
|....+-+.+.|+.+..++|.|+++++.| |+.+.-+|+|=| ..|++ +|--+=+||...+|+-+++|+
T Consensus 451 l~~~Er~lk~eL~qlr~ene~Lq~Kl~~L~~aRq~DKq~l~~LEkrL---~eE~~----~R~~lEkQL~eErk~r~~ee~ 523 (697)
T PF09726_consen 451 LTNNERSLKSELSQLRQENEQLQNKLQNLVQARQQDKQSLQQLEKRL---AEERR----QRASLEKQLQEERKARKEEEE 523 (697)
T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHH----HHHHHHHHHHHHHHHHhHHHH
Confidence 33333455566777777777777777744 333344444444 23333 345566777777776666443
Q ss_pred HHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChh----hHHHHHHHHHHHHHHHHH-HHHHHHHHHHH--HH---
Q 047021 90 EQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPD----HVQTLEKELATLKSQLQQ-ESQLRQQERKQ--LT--- 159 (388)
Q Consensus 90 EQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~----~~~~lekEl~~l~~~lqq-es~lrqqeqqr--la--- 159 (388)
+ +.|+=- . ..+.-+...| ..+.||.|+..|..+|+. +..++..+.+. +.
T Consensus 524 --------~-aar~~~---------~---~~~~r~e~~e~~r~r~~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~ 582 (697)
T PF09726_consen 524 --------K-AARALA---------Q---AQATRQECAESCRQRRRQLESELKKLRRELKQKEEQIRELESELQELRKYE 582 (697)
T ss_pred --------h-hhhccc---------c---chhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1 222200 0 0111122233 356688888887777766 33333333221 11
Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHH--HHHhHHHHHHHHHHhhHHHHHHHhhhhh
Q 047021 160 EE-QARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLE--KQLHDMAVVVERLESSRQKLLMEIDNQS 236 (388)
Q Consensus 160 ~E-q~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle--~Ql~dma~~VeRLE~SRqKLL~EID~QS 236 (388)
.| ...+-.||+..+.|.+|-+.+ .+.||.|.|=||+ .-|+|-.+.+|=+++ -|.++-
T Consensus 583 ~e~~~~~e~L~~aL~amqdk~~~L-------------E~sLsaEtriKldLfsaLg~akrq~ei~~~-------~~~~~d 642 (697)
T PF09726_consen 583 KESEKDTEVLMSALSAMQDKNQHL-------------ENSLSAETRIKLDLFSALGDAKRQLEIAQG-------QLRKKD 642 (697)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHH-------------HHhhhHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHH
Confidence 11 223344555544444443332 4557788887776 334444444443333 245566
Q ss_pred HHHHHH
Q 047021 237 SEIERL 242 (388)
Q Consensus 237 ~EIErL 242 (388)
.||+-|
T Consensus 643 ~ei~~l 648 (697)
T PF09726_consen 643 KEIEEL 648 (697)
T ss_pred HHHHHH
Confidence 777654
No 43
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=87.33 E-value=64 Score=37.42 Aligned_cols=218 Identities=22% Similarity=0.333 Sum_probs=110.6
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHh-hhhHHHHHHHHHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVA-REDLSAQLRTLKKR 83 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~va-reDl~AQLR~~kkR 83 (388)
++..+.++..++|.......+++.....+.......+.-.+ +.+..++...+....+...+.-. ++.+-++|..+...
T Consensus 615 ~l~~~~~~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~ 693 (1201)
T PF12128_consen 615 QLQSAEERQEELEKQLKQINKKIEELKREITQAEQELKQAE-QDLQRLKNEREQLKQEIEEAKEERKEQIEEQLNELEEE 693 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566666677777777777777776666666666655443 34455555555555555544444 23333333333333
Q ss_pred hHHHHHHHhhhHh-------------------hHHHHHHHHHHHHHHHhcC--------------CCCCCCCCCCChhhH
Q 047021 84 LKEAEEEQYRAEE-------------------DAAALRAELNSLQQQAMDG--------------SLGGISSMGISPDHV 130 (388)
Q Consensus 84 lkEAEEEQyRAEE-------------------DaAaLRAEln~lQqq~~~~--------------~~s~~~~~g~s~~~~ 130 (388)
++....+.-...+ -.+.+.+.+..+.++.... -...+.+-||.+..+
T Consensus 694 l~~~~~e~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~d~~i~~i~~~i~~~~~~~~~~~~~le~~~~~eL~~~GvD~~~I 773 (1201)
T PF12128_consen 694 LKQLKQELEELLEELKEQLKELRNELKAQWQELEAELDEQIEQIKQEIAAAKQEAKEQLKELEQQYNQELAGKGVDPERI 773 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHH
Confidence 3322222211111 1122223333333322211 112344789999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHH
Q 047021 131 QTLEKELATLKSQLQQESQLRQ---QERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKL 207 (388)
Q Consensus 131 ~~lekEl~~l~~~lqqes~lrq---qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekl 207 (388)
..+++++..+..+|+.-..-+. .=+.=+...-.....+..++..++.++..+......--..+...+.==...+..+
T Consensus 774 ~~l~~~i~~L~~~l~~ie~~r~~V~eY~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~l 853 (1201)
T PF12128_consen 774 QQLKQEIEQLEKELKRIEERRAEVIEYEDWLQEEWDKVDELREEKPELEEQLRDLEQELQELEQELNQLQKEVKQRRKEL 853 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999988544433 2222223333334455556666666666665544443333321111113444555
Q ss_pred HHHHhHHHHHHHHHHh
Q 047021 208 EKQLHDMAVVVERLES 223 (388)
Q Consensus 208 e~Ql~dma~~VeRLE~ 223 (388)
++.++.+.....+++.
T Consensus 854 e~~~~~~~~~~~~~~~ 869 (1201)
T PF12128_consen 854 EEELKALEEQLEQLEE 869 (1201)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5555555544444443
No 44
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=86.55 E-value=27 Score=34.50 Aligned_cols=53 Identities=19% Similarity=0.308 Sum_probs=31.7
Q ss_pred HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhh
Q 047021 7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVA 60 (388)
Q Consensus 7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~ 60 (388)
..|+.....|..+...|...++....-...+..+.+.+.... ..|+....++.
T Consensus 152 ~~L~~~~~~L~~D~~~L~~~~~~l~~~~~~l~~~~~~L~~e~-~~Lk~~~~e~~ 204 (325)
T PF08317_consen 152 EGLEENLELLQEDYAKLDKQLEQLDELLPKLRERKAELEEEL-ENLKQLVEEIE 204 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhhh
Confidence 345566666777777777777666666666666666655433 25555554433
No 45
>PRK09039 hypothetical protein; Validated
Probab=86.34 E-value=39 Score=33.99 Aligned_cols=25 Identities=20% Similarity=0.311 Sum_probs=13.5
Q ss_pred HHHHHHHhHHH-HHHHHHHhhHHHHH
Q 047021 205 EKLEKQLHDMA-VVVERLESSRQKLL 229 (388)
Q Consensus 205 ekle~Ql~dma-~~VeRLE~SRqKLL 229 (388)
+.|...|...- .-|..|+.-|..+.
T Consensus 175 ~~L~~~L~~a~~~~~~~l~~~~~~~~ 200 (343)
T PRK09039 175 ADLGRRLNVALAQRVQELNRYRSEFF 200 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 44444444432 33666777776663
No 46
>KOG4807 consensus F-actin binding protein, regulates actin cytoskeletal organization [Cytoskeleton]
Probab=86.27 E-value=52 Score=35.32 Aligned_cols=59 Identities=24% Similarity=0.343 Sum_probs=35.3
Q ss_pred hhHHHHHHHhhhhhHHHHHHhh-------hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 223 SSRQKLLMEIDNQSSEIERLFE-------ENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGIL 281 (388)
Q Consensus 223 ~SRqKLL~EID~QS~EIErLf~-------ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lL 281 (388)
+-|..-|.|+.+=--|+|=|++ ||..|...+-+-+-.-.+--+--|+-+.-|-+|-+-|
T Consensus 414 aLRrQyleelqsvqRELeVLSEQYSQKCLEnahLaqalEaerqaLRqCQrEnQELnaHNQELnnRL 479 (593)
T KOG4807|consen 414 ALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRL 479 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHhhHH
Confidence 3344455666555566666654 7888877776655555444444456666666666644
No 47
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=86.09 E-value=66 Score=36.34 Aligned_cols=103 Identities=16% Similarity=0.295 Sum_probs=61.9
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhh------------
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVARED------------ 72 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareD------------ 72 (388)
.+..|++.+..|+.++.+|.+.+.+....++-+.+.|-.-...+..+=-| |++=..-++|-|
T Consensus 239 ki~~lEr~l~~le~Ei~~L~~~~~~~~~~r~~~~k~le~~~s~~~~mK~k------~d~~~~eL~rk~~E~~~~qt~l~~ 312 (775)
T PF10174_consen 239 KIASLERMLRDLEDEIYRLRSRGELSEADRDRLDKQLEVYKSHSLAMKSK------MDRLKLELSRKKSELEALQTRLET 312 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccchHHHHHHHHHHHhhHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46678888888888888888888887777777777776555444432212 333333344333
Q ss_pred HHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021 73 LSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM 113 (388)
Q Consensus 73 l~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~ 113 (388)
+.-+.--++.++--..+.=-+++..+.-|.+++..|.....
T Consensus 313 ~~~~~~d~r~hi~~lkesl~~ke~~~~~Lqsdve~Lr~rle 353 (775)
T PF10174_consen 313 LEEQDSDMRQHIEVLKESLRAKEQEAEMLQSDVEALRFRLE 353 (775)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 33333334555554555555666666666666666665553
No 48
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=83.07 E-value=21 Score=31.71 Aligned_cols=80 Identities=31% Similarity=0.409 Sum_probs=52.7
Q ss_pred CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhh
Q 047021 125 ISPDHVQTLEKELATLKSQLQQESQLRQQERKQLT---------EEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAA 195 (388)
Q Consensus 125 ~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla---------~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~ 195 (388)
++++++..|..++.++..++..-..-.......|+ +=...+..|..+..+|+.+|..+.....
T Consensus 69 ~s~eel~~ld~ei~~L~~el~~l~~~~k~l~~eL~~L~~~~t~~el~~~i~~l~~e~~~l~~kL~~l~~~~~-------- 140 (169)
T PF07106_consen 69 PSPEELAELDAEIKELREELAELKKEVKSLEAELASLSSEPTNEELREEIEELEEEIEELEEKLEKLRSGSK-------- 140 (169)
T ss_pred CCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--------
Confidence 57888999999999998888774333333333333 2345567788888888888877764222
Q ss_pred hhhcchhhHHHHHHHHhHH
Q 047021 196 SKAFSVEDKEKLEKQLHDM 214 (388)
Q Consensus 196 ~k~~S~eDkekle~Ql~dm 214 (388)
..|.+|+.++.+..+.+
T Consensus 141 --~vs~ee~~~~~~~~~~~ 157 (169)
T PF07106_consen 141 --PVSPEEKEKLEKEYKKW 157 (169)
T ss_pred --CCCHHHHHHHHHHHHHH
Confidence 26678888887765443
No 49
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=80.18 E-value=65 Score=31.88 Aligned_cols=107 Identities=20% Similarity=0.312 Sum_probs=74.2
Q ss_pred hhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 92 YRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSE 171 (388)
Q Consensus 92 yRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aE 171 (388)
-+..+--++|..++..|++.... +-...++.+..+..+|..++..+..-.....+.+.++..=...+..+..+
T Consensus 180 ~~l~~~~~~L~~e~~~Lk~~~~e-------~~~~D~~eL~~lr~eL~~~~~~i~~~k~~l~el~~el~~l~~~i~~~~~~ 252 (325)
T PF08317_consen 180 PKLRERKAELEEELENLKQLVEE-------IESCDQEELEALRQELAEQKEEIEAKKKELAELQEELEELEEKIEELEEQ 252 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhh-------hhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445567889999999887643 22347788999999999999888876666666666666666667777788
Q ss_pred HHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHh
Q 047021 172 KQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLH 212 (388)
Q Consensus 172 kqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~ 212 (388)
|+++...|+.+.+.- + ..+.+|..|-.+|...++
T Consensus 253 k~~l~~eI~e~~~~~-----~--~~r~~t~~Ev~~Lk~~~~ 286 (325)
T PF08317_consen 253 KQELLAEIAEAEKIR-----E--ECRGWTRSEVKRLKAKVD 286 (325)
T ss_pred HHHHHHHHHHHHHHH-----H--HhcCCCHHHHHHHHHHHH
Confidence 888877777544321 1 245567777777766643
No 50
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=79.21 E-value=72 Score=31.85 Aligned_cols=45 Identities=22% Similarity=0.355 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHH
Q 047021 99 AALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQE 147 (388)
Q Consensus 99 AaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqe 147 (388)
..|+.+|..++.+...-. ..+|+.+-.+..+..++++++.++..+
T Consensus 257 ~~l~~~l~~le~~l~~l~----~~y~~~hP~v~~l~~~i~~l~~~l~~e 301 (444)
T TIGR03017 257 QNLKTDIARAESKLAELS----QRLGPNHPQYKRAQAEINSLKSQLNAE 301 (444)
T ss_pred HHHHHHHHHHHHHHHHHH----HHhCCCCcHHHHHHHHHHHHHHHHHHH
Confidence 457777777776663321 134556666777888888777777554
No 51
>PRK03918 chromosome segregation protein; Provisional
Probab=79.07 E-value=1e+02 Score=33.51 Aligned_cols=28 Identities=25% Similarity=0.395 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047021 157 QLTEEQARISVLMSEKQELEEKLVMFSR 184 (388)
Q Consensus 157 rla~Eq~~~a~l~aEkqeLeekl~~~~~ 184 (388)
++..-..+...+..+..+|++.+..+..
T Consensus 406 ~i~~l~~~~~~~~~~i~eL~~~l~~L~~ 433 (880)
T PRK03918 406 EISKITARIGELKKEIKELKKAIEELKK 433 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444455556777777777777776653
No 52
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=78.29 E-value=11 Score=40.28 Aligned_cols=76 Identities=25% Similarity=0.417 Sum_probs=49.3
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK 81 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k 81 (388)
|+.+++++.++.+.|..+|+.|.. +|+.|.+|.++.+++. ..||... |
T Consensus 64 lva~~k~~r~~~~~l~~~N~~l~~-------eN~~L~~r~~~id~~i----~~av~~~----------~----------- 111 (472)
T TIGR03752 64 LVAEVKELRKRLAKLISENEALKA-------ENERLQKREQSIDQQI----QQAVQSE----------T----------- 111 (472)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHhhhhHHHHH----HHHHHhh----------h-----------
Confidence 578899999999999999988875 4556677766666553 3333211 1
Q ss_pred HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
++...++....++-+.+...|..||.|.
T Consensus 112 ---~~~~~~~~ql~~~~~~~~~~l~~l~~~l 139 (472)
T TIGR03752 112 ---QELTKEIEQLKSERQQLQGLIDQLQRRL 139 (472)
T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2222333333444556788888898887
No 53
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=77.58 E-value=1.2e+02 Score=33.69 Aligned_cols=46 Identities=22% Similarity=0.314 Sum_probs=35.5
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV 49 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v 49 (388)
+.|..|++.|.+|+..+..|....+.--.-..+|+++|.+..+...
T Consensus 460 ~eL~qlr~ene~Lq~Kl~~L~~aRq~DKq~l~~LEkrL~eE~~~R~ 505 (697)
T PF09726_consen 460 SELSQLRQENEQLQNKLQNLVQARQQDKQSLQQLEKRLAEERRQRA 505 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4577888888888888888887777777777888888887665543
No 54
>KOG4674 consensus Uncharacterized conserved coiled-coil protein [Function unknown]
Probab=77.37 E-value=2e+02 Score=35.87 Aligned_cols=155 Identities=21% Similarity=0.244 Sum_probs=95.6
Q ss_pred hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhh-------hhhHHHHHHhhhhhhhhhhHHHHHHHH-------HHHHHHH
Q 047021 202 EDKEKLEKQLHDMAVVVERLESSRQKLLMEID-------NQSSEIERLFEENSSLSSSYQESMGIA-------KHWENQV 267 (388)
Q Consensus 202 eDkekle~Ql~dma~~VeRLE~SRqKLL~EID-------~QS~EIErLf~ENs~Lsss~qea~~ia-------aqWEnQv 267 (388)
.++.++.+|+.....-++|+...=.-...|++ .|.++++|+-+.++.++..|+....=- ..-..|.
T Consensus 391 s~~~~~qqqle~~~lele~~~~~l~s~~eev~~~~p~lk~qr~~~e~~~~~~~~l~~el~~~~q~~~~~e~~~~~l~~~~ 470 (1822)
T KOG4674|consen 391 SKYSKLQQQLESLKLELERLQNILSSFKEEVKQKAPILKEQRSELERMQETKAELSEELDFSNQKIQKLEKELESLKKQL 470 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67889999999999999999988777777765 578899999999999999888776433 3344566
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCC-ccccCCCchhhh-hHHhhHHHHHHHhhHHHHhHHHHH
Q 047021 268 KDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKD-AVSQNGSETHKN-EFLALKGQLAKEQSRAEALSAQVL 345 (388)
Q Consensus 268 qDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~d-g~~~~~~~~~~t-e~l~LK~eL~kEqsRseaLsaqv~ 345 (388)
.||--||-.|+--.-+|-.+=..++..-.....|.-...+.+ -..+++++.+.+ ++..+++-.--...=.+=|+ .|+
T Consensus 471 ~~~~renk~l~~~~sdlsrqv~~Ll~el~e~~~~~~~~~~s~~~~~es~S~~iIse~Lv~F~nI~eLqekN~eLL~-~vR 549 (1822)
T KOG4674|consen 471 NDLERENKLLEQQISDLSRQVNVLLLELDELRKGSKITVSSDSTENESDSEEIISERLVEFSNINELQEKNVELLN-AVR 549 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccCccHHHHHHHHHHHhccHHHHHHHHHHHHH-HHH
Confidence 777777777765554443322222211111111110011111 122334555555 56666665443333444444 778
Q ss_pred HHHHHHHHHHHH
Q 047021 346 QLSAQLQQTTQA 357 (388)
Q Consensus 346 qLsAeL~ra~QA 357 (388)
-|+..+....+-
T Consensus 550 ~Lae~lE~~E~~ 561 (1822)
T KOG4674|consen 550 ELAEKLEAAEKT 561 (1822)
T ss_pred HHHHHHHHHhhh
Confidence 888777777665
No 55
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=76.47 E-value=37 Score=36.65 Aligned_cols=108 Identities=21% Similarity=0.272 Sum_probs=80.9
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
..+++|..+..+++.+...++.+.-.+|-....+..++++++++.-..+.+.+..+.-+-+++..-...|.+++..+++|
T Consensus 288 ~~i~~L~~~l~~l~~~~~~l~~~y~~~hP~v~~l~~qi~~l~~~i~~e~~~~~~~~~~~~~~a~~~~~~L~~~l~~~~~~ 367 (754)
T TIGR01005 288 DLIQRLRERQAELRATIADLSTTMLANHPRVVAAKSSLADLDAQIRSELQKITKSLLMQADAAQARESQLVSDVNQLKAA 367 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45788888899999999999999999999999999999999998877777777766666666665566888888888887
Q ss_pred hHHH---HHHHhhhHhhHHHHHHHHHHHHHH
Q 047021 84 LKEA---EEEQYRAEEDAAALRAELNSLQQQ 111 (388)
Q Consensus 84 lkEA---EEEQyRAEEDaAaLRAEln~lQqq 111 (388)
+... +-+-.+-+-|+.+.+.-.+.+.++
T Consensus 368 ~~~~~~~~~e~~~L~Re~~~~~~~Y~~ll~r 398 (754)
T TIGR01005 368 SAQAGEQQVDLDALQRDAAAKRQLYESYLTN 398 (754)
T ss_pred HHhCcHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7543 444444556666666665555443
No 56
>PF14992 TMCO5: TMCO5 family
Probab=76.36 E-value=91 Score=31.49 Aligned_cols=158 Identities=26% Similarity=0.321 Sum_probs=95.4
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA 87 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA 87 (388)
+|+++.+.|.+.|+.+..||++++-.+.+|.+++--.+...-.+ .=.+.|++++..| +++-
T Consensus 8 dle~d~Q~ldE~Nq~lL~ki~~~E~~iq~Le~Eit~~~~~~~~~--e~e~~~~~~~e~~-----------------l~~l 68 (280)
T PF14992_consen 8 DLEKDEQRLDEANQSLLQKIQEKEGAIQSLEREITKMDHIADRS--EEEDIISEERETD-----------------LQEL 68 (280)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccCch--hHHhhhhhchHHH-----------------HHHH
Confidence 57888999999999999999999999999999998666555555 3346666666544 2344
Q ss_pred HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCC---CCCCCCCChh-----------hHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 88 EEEQYRAEEDAAALRAELNSLQQQAMDGSLG---GISSMGISPD-----------HVQTLEKELATLKSQLQQESQLRQQ 153 (388)
Q Consensus 88 EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s---~~~~~g~s~~-----------~~~~lekEl~~l~~~lqqes~lrqq 153 (388)
|.+.-+-|.-+..+---+..||........- .-+++..+++ ....++++++.+....++-..+--.
T Consensus 69 e~e~~~LE~~ne~l~~~~~elq~k~~e~~~~~~~e~~~~~~~lq~sk~~lqql~~~~~~qE~ei~kve~d~~~v~~l~eD 148 (280)
T PF14992_consen 69 ELETAKLEKENEHLSKSVQELQRKQDEQETNVQCEDPQLSQSLQFSKNKLQQLLESCASQEKEIAKVEDDYQQVHQLCED 148 (280)
T ss_pred HhhhHHHhhhhHhhhhhhhhhhhhhccccCCCCCCccchhcccHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5555666666666766667777775432211 1112222222 2445777777777666554333211
Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 047021 154 ---ERKQLTEEQARISVLMSEKQELEEKLVMFSRK 185 (388)
Q Consensus 154 ---eqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~ 185 (388)
+-..+-+-..++-. .-|+--|+.++.+.-..
T Consensus 149 q~~~i~klkE~L~rmE~-ekE~~lLe~el~k~q~~ 182 (280)
T PF14992_consen 149 QANEIKKLKEKLRRMEE-EKEMLLLEKELSKYQMQ 182 (280)
T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhch
Confidence 22233333333333 44555667666665443
No 57
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=75.58 E-value=14 Score=34.91 Aligned_cols=54 Identities=22% Similarity=0.398 Sum_probs=25.1
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH---hhhhhhhhHHHHhhh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE---LEQNTVPSLRKALKD 58 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd---~e~n~v~slrkaL~~ 58 (388)
.+..|.+....|+..|+.+.+.+...+.++.+|+.++++ ..+..+|.+.+.++.
T Consensus 57 e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p~m~~m~~~ 113 (251)
T PF11932_consen 57 EYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQELVPLMEQMIDE 113 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444444444555555444442 223444555444443
No 58
>KOG1962 consensus B-cell receptor-associated protein and related proteins [Defense mechanisms]
Probab=75.47 E-value=22 Score=34.45 Aligned_cols=56 Identities=27% Similarity=0.296 Sum_probs=50.3
Q ss_pred hcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhH
Q 047021 198 AFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSY 253 (388)
Q Consensus 198 ~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~ 253 (388)
.--.+|.++|+..+..-..-.+..++...-|..=+|.=.-|++||-+||+.|-+.+
T Consensus 154 ~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq~e~~~~EydrLlee~~~Lq~~i 209 (216)
T KOG1962|consen 154 DKLKADLEKLETELEKKQKKLEKAQKKVDALKKQSEGLQDEYDRLLEEYSKLQEQI 209 (216)
T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHH
Confidence 33468999999999999999999999999999999999999999999999987654
No 59
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=75.09 E-value=93 Score=30.99 Aligned_cols=194 Identities=25% Similarity=0.252 Sum_probs=100.7
Q ss_pred hHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021 34 SESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM 113 (388)
Q Consensus 34 ~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~ 113 (388)
|.-|.+.+.|.|..|....|+-|..=-+=+-+.-+-.---..+|-.++..|++.++ +.+.+.+.|..++..|.+..-
T Consensus 22 N~~L~~~IqdtE~st~~~Vr~lLqqy~~~~~~i~~le~~~~~~l~~ak~eLqe~ee---k~e~~l~~Lq~ql~~l~akI~ 98 (258)
T PF15397_consen 22 NKELIKEIQDTEDSTALKVRKLLQQYDIYRTAIDILEYSNHKQLQQAKAELQEWEE---KEESKLSKLQQQLEQLDAKIQ 98 (258)
T ss_pred hHHHHHHHHhHHhhHHHHHHHHHHHHHHHHHHHHHHHccChHHHHHHHHHHHHHHH---HHHhHHHHHHHHHHHHHHHHH
Confidence 44455555666777777777766443333332222222223456677777777776 466677777777777666553
Q ss_pred cC-----CCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhch
Q 047021 114 DG-----SLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARI-SVLMSEKQELEEKLVMFSRKAS 187 (388)
Q Consensus 114 ~~-----~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~-a~l~aEkqeLeekl~~~~~~~s 187 (388)
-. ..++|- . .+...---.|++|.-++++-.-.-+.|..-|-+--... .++....+.-.+
T Consensus 99 k~~~el~~L~TYk---D--~EYPvK~vqIa~L~rqlq~lk~~qqdEldel~e~~~~el~~l~~~~q~k~~---------- 163 (258)
T PF15397_consen 99 KTQEELNFLSTYK---D--HEYPVKAVQIANLVRQLQQLKDSQQDELDELNEMRQMELASLSRKIQEKKE---------- 163 (258)
T ss_pred HHHHHHHHHHHHh---h--hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------
Confidence 22 233331 1 11111122456666666554443333333232211111 122222222222
Q ss_pred hhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHH
Q 047021 188 EEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMG 258 (388)
Q Consensus 188 e~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~ 258 (388)
..-+.++.++++.=.+.-+.+- ..-+.++.|||.+..+|+.|-++.+-|-..+++...
T Consensus 164 -~il~~~~~k~~~~~~~~l~~~~------------~~N~~m~kei~~~re~i~el~e~I~~L~~eV~~L~~ 221 (258)
T PF15397_consen 164 -EILSSAAEKTQSPMQPALLQRT------------LENQVMQKEIVQFREEIDELEEEIPQLRAEVEQLQA 221 (258)
T ss_pred -HHHHHHHHHHHhhchHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2223334455543222222221 234678899999999999999999999998887653
No 60
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=74.86 E-value=1.8e+02 Score=34.27 Aligned_cols=105 Identities=19% Similarity=0.133 Sum_probs=60.4
Q ss_pred hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCCCchh---hh
Q 047021 245 ENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETH---KN 321 (388)
Q Consensus 245 ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~---~t 321 (388)
.|+++.+=++-=+......-.+.+ .+...|+..++..|.+++-..-....... + ...+.+|.. ..
T Consensus 198 s~~~~~~L~~~q~dl~~~~~~~l~---~~~~~Lq~~in~kR~~~se~~~~~~~~~~--------~-~~~~~~~~i~~~~~ 265 (1109)
T PRK10929 198 SANNRQELARLRSELAKKRSQQLD---AYLQALRNQLNSQRQREAERALESTELLA--------E-QSGDLPKSIVAQFK 265 (1109)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhH--------H-hhccCChHHHHHHH
Confidence 455555555555555444333332 34456667777766555433211100000 0 000112222 23
Q ss_pred hHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhh
Q 047021 322 EFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGL 361 (388)
Q Consensus 322 e~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L 361 (388)
.|..|-.+|.+---|...|+.+-.+.-..+.+.+|++.++
T Consensus 266 ~N~~Ls~~L~~~t~~~n~l~~~~~~~~~~l~~~~q~~~~i 305 (1109)
T PRK10929 266 INRELSQALNQQAQRMDLIASQQRQAASQTLQVRQALNTL 305 (1109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4888999999998999999999999999999999987743
No 61
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=72.96 E-value=70 Score=28.59 Aligned_cols=86 Identities=22% Similarity=0.307 Sum_probs=56.6
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
.+++.|+.++..+|.+..-|+.|++..+.+++.+..+|.+...+.--+-.. ..+.-.+.+|
T Consensus 21 ~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~lee~~~~-------------------~~~~E~l~rr 81 (143)
T PF12718_consen 21 AKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLEESEKR-------------------KSNAEQLNRR 81 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH-------------------HHhHHHHHhh
Confidence 567888888888888888888888888888888888887665544322211 1112266677
Q ss_pred hHHHHHHHhhhHhhHHHHHHHHHHH
Q 047021 84 LKEAEEEQYRAEEDAAALRAELNSL 108 (388)
Q Consensus 84 lkEAEEEQyRAEEDaAaLRAEln~l 108 (388)
++--|++=-+++......-..|...
T Consensus 82 iq~LEeele~ae~~L~e~~ekl~e~ 106 (143)
T PF12718_consen 82 IQLLEEELEEAEKKLKETTEKLREA 106 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7777777666666655555444443
No 62
>smart00338 BRLZ basic region leucin zipper.
Probab=72.79 E-value=8.8 Score=29.26 Aligned_cols=38 Identities=21% Similarity=0.427 Sum_probs=33.4
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRI 41 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~L 41 (388)
.++.+|+.+...|+.+|..|.+++.....++..|...+
T Consensus 26 ~~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~lk~~~ 63 (65)
T smart00338 26 AEIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL 63 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 46789999999999999999999999999988887655
No 63
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=72.65 E-value=2.2e+02 Score=34.21 Aligned_cols=233 Identities=22% Similarity=0.296 Sum_probs=0.0
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK 81 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k 81 (388)
+.....+++...-++++....|...+.+.+...+.+.-...++-. .+..|.+.+ -+=-+++..+.+=...|+..+
T Consensus 790 ~~~~~~~~~~~~~~~ee~~~~lr~~~~~l~~~l~~~~~~~k~~~~-~~~~l~~~i----~~~E~~~~k~~~d~~~l~~~~ 864 (1293)
T KOG0996|consen 790 MSDKARQHQEQLHELEERVRKLRERIPELENRLEKLTASVKRLAE-LIEYLESQI----AELEAAVLKKVVDKKRLKELE 864 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHhHHHHHHHHHHHHHH-HHHHHHHHH----HHHHHHhhhccCcHHHHHHHH
Q ss_pred HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCC----CCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGIS----SMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQ 157 (388)
Q Consensus 82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~----~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqr 157 (388)
+-+-+.+-|=--+-|.++. .+++..||...|.-..--.- .+--..+++..++.+|..+.+.++.+-...++-+..
T Consensus 865 ~~ie~l~kE~e~~qe~~~K-k~~i~~lq~~i~~i~~e~~q~qk~kv~~~~~~~~~l~~~i~k~~~~i~~s~~~i~k~q~~ 943 (1293)
T KOG0996|consen 865 EQIEELKKEVEELQEKAAK-KARIKELQNKIDEIGGEKVQAQKDKVEKINEQLDKLEADIAKLTVAIKTSDRNIAKAQKK 943 (1293)
T ss_pred HHHHHHHHHHHHHHHhhhH-HHHHHHHHHHHHHhhchhhHHhHHHHHHHHHHHHHHHHHHHHhHHHHhcCcccHHHHHHH
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhchh---hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhH---HHHHHH
Q 047021 158 LTEEQARISVLMSEKQELEEKLVMFSRKASE---EVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSR---QKLLME 231 (388)
Q Consensus 158 la~Eq~~~a~l~aEkqeLeekl~~~~~~~se---~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SR---qKLL~E 231 (388)
+-+=..-+-.+..+...|.+.+..+..+..+ ..-+.....+=-...-.-+-..+.++--++--|++.| +-.+++
T Consensus 944 l~~le~~~~~~e~e~~~L~e~~~~~~~k~~E~~~~~~e~~~~~~E~k~~~~~~k~~~e~i~k~~~~lk~~rId~~~K~e~ 1023 (1293)
T KOG0996|consen 944 LSELEREIEDTEKELDDLTEELKGLEEKAAELEKEYKEAEESLKEIKKELRDLKSELENIKKSENELKAERIDIENKLEA 1023 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHH
Q ss_pred hhhhhHHHH
Q 047021 232 IDNQSSEIE 240 (388)
Q Consensus 232 ID~QS~EIE 240 (388)
++.+-.||+
T Consensus 1024 ~~~~l~e~~ 1032 (1293)
T KOG0996|consen 1024 INGELNEIE 1032 (1293)
T ss_pred HHHHHHHHH
No 64
>PRK11281 hypothetical protein; Provisional
Probab=72.45 E-value=2.1e+02 Score=33.82 Aligned_cols=82 Identities=24% Similarity=0.225 Sum_probs=44.0
Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH
Q 047021 99 AALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQ-------QERKQLTEEQARISVLMSE 171 (388)
Q Consensus 99 AaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrq-------qeqqrla~Eq~~~a~l~aE 171 (388)
...+..+..+.++..++..+|- + .++.+...|+-|++-++++...+.+.-. =.+.+..--..++..+..+
T Consensus 166 sea~~RlqeI~~~L~~~~~~~~-~--l~~~~~~~l~ae~~~l~~~~~~~~~~l~~~~~l~~l~~~q~d~~~~~~~~~~~~ 242 (1113)
T PRK11281 166 YANSQRLQQIRNLLKGGKVGGK-A--LRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQ 242 (1113)
T ss_pred HHHHHHHHHHHHHHhCCCCCCC-c--CCHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445556666666644444443 2 6777788888888877766433222211 1233333333445555555
Q ss_pred HHHHHHHHHHHh
Q 047021 172 KQELEEKLVMFS 183 (388)
Q Consensus 172 kqeLeekl~~~~ 183 (388)
.+.|.+.+..-.
T Consensus 243 ~~~lq~~in~kr 254 (1113)
T PRK11281 243 LQLLQEAINSKR 254 (1113)
T ss_pred HHHHHHHHHHHH
Confidence 566655555533
No 65
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=72.43 E-value=1.2e+02 Score=30.99 Aligned_cols=34 Identities=24% Similarity=0.203 Sum_probs=21.7
Q ss_pred HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC
Q 047021 82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG 115 (388)
Q Consensus 82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~ 115 (388)
.++.....+...++.+-+.+.+.+++|+++....
T Consensus 204 ~~l~~l~~~l~~~~~~l~~~~a~~~~l~~~l~~~ 237 (498)
T TIGR03007 204 SEISEAQEELEAARLELNEAIAQRDALKRQLGGE 237 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 3455555555566666777777777787776543
No 66
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=71.52 E-value=1.3 Score=49.23 Aligned_cols=70 Identities=24% Similarity=0.284 Sum_probs=0.0
Q ss_pred HHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH
Q 047021 205 EKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQN 274 (388)
Q Consensus 205 ekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQN 274 (388)
-+++++|.+...-++.+...|+.+=-|...-+.||=+|..++..+...+-.+...-..+.+.+.|+..|.
T Consensus 380 r~fDk~l~e~k~~~~~~~~e~d~~q~e~r~~~te~~~Lk~~lee~~e~~e~lere~k~L~~El~dl~~q~ 449 (859)
T PF01576_consen 380 RKFDKQLAEWKAKVEELQAERDAAQREARELETELFKLKNELEELQEQLEELERENKQLQDELEDLTSQL 449 (859)
T ss_dssp ----------------------------------------------------------------------
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccchhhh
Confidence 3466777777777777777777777777777777777777666666666666655666666666655443
No 67
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=70.63 E-value=56 Score=26.52 Aligned_cols=65 Identities=29% Similarity=0.276 Sum_probs=39.2
Q ss_pred HHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 218 VERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLR 285 (388)
Q Consensus 218 VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR 285 (388)
.++||.-=++++.-|+.=.-||+.|-++|.+|++.-.+-.. .=+.--+|-=...+.|+++|.|+.
T Consensus 6 l~~LE~ki~~aveti~~Lq~e~eeLke~n~~L~~e~~~L~~---en~~L~~e~~~~~~rl~~LL~kl~ 70 (72)
T PF06005_consen 6 LEQLEEKIQQAVETIALLQMENEELKEKNNELKEENEELKE---ENEQLKQERNAWQERLRSLLGKLE 70 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhh
Confidence 45666666677777777777888888887777733322222 111222444455677788887764
No 68
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=70.53 E-value=1.7e+02 Score=32.08 Aligned_cols=50 Identities=24% Similarity=0.406 Sum_probs=35.1
Q ss_pred HHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH
Q 047021 35 ESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK 85 (388)
Q Consensus 35 ~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk 85 (388)
..|+.++.++.++. --|+..|..+-...|..++.|-|+..+.-++..+|.
T Consensus 165 ~~le~e~~~Lk~en-~rl~~~l~~~r~~ld~Etllr~d~~n~~q~Lleel~ 214 (546)
T KOG0977|consen 165 KALEDELKRLKAEN-SRLREELARARKQLDDETLLRVDLQNRVQTLLEELA 214 (546)
T ss_pred HHHHHHHHHHHHHh-hhhHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 33333333443332 346778888889999999999999988888888773
No 69
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=69.64 E-value=1e+02 Score=29.15 Aligned_cols=72 Identities=15% Similarity=0.279 Sum_probs=62.1
Q ss_pred hhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH
Q 047021 201 VEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLK 272 (388)
Q Consensus 201 ~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclk 272 (388)
.+++..|...++.+-..++.|+.-.++|=.-+++|..+|++|-..-..+....++...+-.++-..++.-+.
T Consensus 48 ~~e~~~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p~m~~m~~~L~~~v~ 119 (251)
T PF11932_consen 48 DDEKQELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQELVPLMEQMIDELEQFVE 119 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 477888888888888889999999999999999999999999999999999999988888888777766444
No 70
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=69.42 E-value=1.6e+02 Score=31.32 Aligned_cols=24 Identities=21% Similarity=0.275 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 160 EEQARISVLMSEKQELEEKLVMFS 183 (388)
Q Consensus 160 ~Eq~~~a~l~aEkqeLeekl~~~~ 183 (388)
.++.+++.+..|++.++.++.+-.
T Consensus 196 ~q~~kl~~~~~E~kk~~~~l~~~l 219 (420)
T COG4942 196 AQQAKLAQLLEERKKTLAQLNSEL 219 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555555533
No 71
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=68.98 E-value=1.4e+02 Score=30.56 Aligned_cols=61 Identities=23% Similarity=0.410 Sum_probs=44.9
Q ss_pred HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh--------------hHHHHhhhhhhhhhhHH
Q 047021 7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP--------------SLRKALKDVAMEKDAAV 67 (388)
Q Consensus 7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~--------------slrkaL~~v~mEkDaA~ 67 (388)
.+|..+...|..+|+.|...+.+-..+..+|+..+..+....|. +|=|-|..+.-||+..+
T Consensus 23 ~~l~~~~~sL~qen~~Lk~El~~ek~~~~~L~~e~~~lr~~sv~~~~~aEqEEE~isN~LlKkl~~l~keKe~L~ 97 (310)
T PF09755_consen 23 EQLRKRIESLQQENRVLKRELETEKARCKHLQEENRALREASVRIQAKAEQEEEFISNTLLKKLQQLKKEKETLA 97 (310)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 57888899999999999999999999999998888766555443 34444555555555554
No 72
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=68.74 E-value=13 Score=28.25 Aligned_cols=36 Identities=28% Similarity=0.417 Sum_probs=29.7
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTR 40 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~ 40 (388)
.+..|+.++..|+.+|..|...+.....++..|...
T Consensus 27 ~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~~e 62 (64)
T PF00170_consen 27 YIEELEEKVEELESENEELKKELEQLKKEIQSLKSE 62 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 578889999999999999988888888777777654
No 73
>PF12777 MT: Microtubule-binding stalk of dynein motor; InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=68.56 E-value=31 Score=34.25 Aligned_cols=108 Identities=23% Similarity=0.263 Sum_probs=51.3
Q ss_pred hhHHHHHHhhHHHHHHhhHhhhhhh---hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHH
Q 047021 25 SKLQVKEVESESLQTRINELEQNTV---PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAAL 101 (388)
Q Consensus 25 sk~~~k~~~~~~L~~~Lnd~e~n~v---~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaL 101 (388)
.||.+....++.|+.+|....-... ....+.++.|..++..|-.-+......=..+....+++.+.+-.|++|-+..
T Consensus 8 ~KL~et~~~V~~m~~~L~~~~~~L~~k~~e~e~ll~~i~~~~~~a~~~~~~~~~ee~~~~~~~~ei~~~~~~a~~~L~~a 87 (344)
T PF12777_consen 8 DKLKETEEQVEEMQEELEEKQPELEEKQKEAEELLEEIEKEQEEAEKKKAIVEEEEEEAEKQAKEIEEIKEEAEEELAEA 87 (344)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555555553321111 1233455555555444443333333333344455556666666666666555
Q ss_pred HHHHHHHHHHHhcCCCCCCC---CCCCChhhHHH
Q 047021 102 RAELNSLQQQAMDGSLGGIS---SMGISPDHVQT 132 (388)
Q Consensus 102 RAEln~lQqq~~~~~~s~~~---~~g~s~~~~~~ 132 (388)
.-.|..-+..+..=..++++ +|+.||.-+..
T Consensus 88 ~P~L~~A~~al~~l~k~di~Eiks~~~PP~~V~~ 121 (344)
T PF12777_consen 88 EPALEEAQEALKSLDKSDISEIKSYANPPEAVKL 121 (344)
T ss_dssp HHHHHHHHHHHHCS-HHHHHHHHHSSS--HHHHH
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHhhCCCcHHHHH
Confidence 55555555555443333443 77777777554
No 74
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=67.63 E-value=1.8e+02 Score=31.28 Aligned_cols=42 Identities=26% Similarity=0.376 Sum_probs=30.2
Q ss_pred hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
.++..++..+.+++++.+.+.-..+++.+.+.+++..+++..
T Consensus 205 ~~~~~~~~~le~el~~l~~~~e~l~~~i~~l~~ele~a~~~l 246 (650)
T TIGR03185 205 SSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSL 246 (650)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346677777777777777777777777777777777666554
No 75
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=65.24 E-value=39 Score=30.90 Aligned_cols=145 Identities=28% Similarity=0.316 Sum_probs=32.0
Q ss_pred hhHHHHHHHHHHHHHHHhcC----CCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 96 EDAAALRAELNSLQQQAMDG----SLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSE 171 (388)
Q Consensus 96 EDaAaLRAEln~lQqq~~~~----~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aE 171 (388)
+.++.|.++...|+...... +.|+..+.++++..+..++..+..+..+|..--.-+.+-.++|-. +..+
T Consensus 31 d~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~l~~ELael~r~~~el~~~L~~-------~~~~ 103 (194)
T PF08614_consen 31 DRTSLLKAENEQLQPEAESLPSSSSSSPSESGSVSSAQISSLEQKLAKLQEELAELYRSKGELAQQLVE-------LNDE 103 (194)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc-------cccc
Confidence 45667777777777643222 122222333445555566666555555444322222223333333 3333
Q ss_pred HHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhh
Q 047021 172 KQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSS 251 (388)
Q Consensus 172 kqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lss 251 (388)
.+.++.++..-......-.... +--......++..|.++...++.|.--=.-|-.++...-..+.+|=.||..|..
T Consensus 104 l~~l~~~~~~~~~~l~~l~~~~----~~L~~~~~~l~~~l~ek~k~~e~l~DE~~~L~l~~~~~e~k~~~l~~En~~Lv~ 179 (194)
T PF08614_consen 104 LQELEKELSEKERRLAELEAEL----AQLEEKIKDLEEELKEKNKANEILQDELQALQLQLNMLEEKLRKLEEENRELVE 179 (194)
T ss_dssp ----------HHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cchhhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344433333222211111111 111223344555555555555555555555556666666667777777776643
No 76
>PRK02224 chromosome segregation protein; Provisional
Probab=65.16 E-value=2.2e+02 Score=31.26 Aligned_cols=33 Identities=30% Similarity=0.333 Sum_probs=15.4
Q ss_pred HHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHH
Q 047021 77 LRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQ 109 (388)
Q Consensus 77 LR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQ 109 (388)
+.....++.+++++--.+-++-+.+++.++.++
T Consensus 407 ~~~~e~~l~~l~~~~~~l~~~~~~~~~~~~~~~ 439 (880)
T PRK02224 407 LGNAEDFLEELREERDELREREAELEATLRTAR 439 (880)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444445455555444444
No 77
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=64.84 E-value=14 Score=34.38 Aligned_cols=43 Identities=28% Similarity=0.452 Sum_probs=38.9
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINEL 44 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~ 44 (388)
+++++.+=+.+|..|..-|.+|..+++.+...|+.|..+|.-+
T Consensus 65 ~l~rLeEEqqR~~~L~qvN~lLReQLEq~~~~N~~L~~dl~kl 107 (182)
T PF15035_consen 65 ALIRLEEEQQRSEELAQVNALLREQLEQARKANEALQEDLQKL 107 (182)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4678889999999999999999999999999999999887644
No 78
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=64.03 E-value=1.5e+02 Score=28.84 Aligned_cols=169 Identities=16% Similarity=0.171 Sum_probs=0.0
Q ss_pred hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhh-------------HHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021 13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPS-------------LRKALKDVAMEKDAAVVAREDLSAQLRT 79 (388)
Q Consensus 13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~s-------------lrkaL~~v~mEkDaA~vareDl~AQLR~ 79 (388)
+..++.+..-+..++.........|...++....+..|. +......+........-..+.+-+++..
T Consensus 76 ~~~~~~~l~~l~~~~~~l~a~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~ 155 (423)
T TIGR01843 76 ATDVEADAAELESQVLRLEAEVARLRAEADSQAAIEFPDDLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQ 155 (423)
T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCChHHhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHH--------HHHHH
Q 047021 80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQ--------ESQLR 151 (388)
Q Consensus 80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqq--------es~lr 151 (388)
++..+...+...-.++.....+..+++..+.-...+..|.. .+-.+..++..++.++..++.++.+ +..+.
T Consensus 156 ~~~~i~~~~~~l~~~~~~l~~~~~~~~~~~~L~~~g~is~~-~~~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~~~~~~ 234 (423)
T TIGR01843 156 LEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRL-ELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQ 234 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHH-HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 152 QQERKQLTEEQARISVLMSEKQELEEKLVMF 182 (388)
Q Consensus 152 qqeqqrla~Eq~~~a~l~aEkqeLeekl~~~ 182 (388)
................+..+...++.++..+
T Consensus 235 ~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~ 265 (423)
T TIGR01843 235 QIEQTFREEVLEELTEAQARLAELRERLNKA 265 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 79
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=63.77 E-value=1.3e+02 Score=29.97 Aligned_cols=80 Identities=19% Similarity=0.241 Sum_probs=48.4
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK 85 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk 85 (388)
+.+|..+..+++.+...|...+-..|-..-.+..+++.++...-..+++....+.-+-..+.....++.+++...++++.
T Consensus 256 i~~l~~~l~~le~~l~~l~~~y~~~hP~v~~l~~~i~~l~~~l~~e~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~ 335 (444)
T TIGR03017 256 IQNLKTDIARAESKLAELSQRLGPNHPQYKRAQAEINSLKSQLNAEIKKVTSSVGTNSRILKQREAELREALENQKAKVL 335 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677777788888888888777777777777777777776655444444443333333333334444444444444433
No 80
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function. Several sequences in this family are described as being myosin heavy chain-like.
Probab=62.07 E-value=2.2e+02 Score=30.26 Aligned_cols=167 Identities=23% Similarity=0.255 Sum_probs=93.5
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
.++...+..+..+.++...+..++..++.++--..++|..- +.+|-.|..-|..+..+++.|.-.-+ ..|-|
T Consensus 27 e~~~~~e~eL~~~qeel~~~k~~l~~~E~~k~~~l~ELe~a-kr~veel~~kLe~~~~~~~~a~~~~e-------~~k~r 98 (522)
T PF05701_consen 27 ERVKEKETELEKAQEELAKLKEQLEAAEREKAQALSELESA-KRTVEELKLKLEKAQAEEKQAEEDSE-------LAKFR 98 (522)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhH-------HhHHH
Confidence 56778888888899999999999999999988887777643 45677777778777777776654433 44556
Q ss_pred hHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH
Q 047021 84 LKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLT-EEQ 162 (388)
Q Consensus 84 lkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla-~Eq 162 (388)
++|.+. --+......++++|...+.|-.. .++-....-+++..+..|++.+...-. ...=+..+-.+.+ .-.
T Consensus 99 ~~e~e~--~~~~~~~~~~k~ele~~~~q~~~----~~~eL~~~k~EL~~lr~e~~~~~~~k~-~A~~~aeea~~~a~~~~ 171 (522)
T PF05701_consen 99 AKELEQ--GIAEEASVAWKAELESAREQYAS----AVAELDSVKQELEKLRQELASALDAKN-AALKQAEEAVSAAEENE 171 (522)
T ss_pred HHHHhh--hhcccchHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH
Confidence 665543 23333334466666666655411 111111123334444444443322110 0111111111111 233
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh
Q 047021 163 ARISVLMSEKQELEEKLVMFSRK 185 (388)
Q Consensus 163 ~~~a~l~aEkqeLeekl~~~~~~ 185 (388)
.++..|..|...+.+-|.++...
T Consensus 172 ~kve~L~~Ei~~lke~l~~~~~a 194 (522)
T PF05701_consen 172 EKVEELSKEIIALKESLESAKLA 194 (522)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556666666666666665433
No 81
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=62.02 E-value=2.5e+02 Score=30.88 Aligned_cols=83 Identities=27% Similarity=0.375 Sum_probs=58.5
Q ss_pred HHHhhHHHHHHhhHhhhhh--------------hhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhH
Q 047021 30 KEVESESLQTRINELEQNT--------------VPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAE 95 (388)
Q Consensus 30 k~~~~~~L~~~Lnd~e~n~--------------v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAE 95 (388)
.+.+|..|+.+||.++... .+++++.|++++=+|--+.+==.-|..++-.+++++-+++.+--.++
T Consensus 61 LEaqN~~L~~di~~lr~~~~~~ts~ik~~ye~El~~ar~~l~e~~~~ra~~e~ei~kl~~e~~elr~~~~~~~k~~~~~r 140 (546)
T KOG0977|consen 61 LEAQNRKLEHDINLLRGVVGRETSGIKAKYEAELATARKLLDETARERAKLEIEITKLREELKELRKKLEKAEKERRGAR 140 (546)
T ss_pred HHHHHHHHHHHHHHHHhhccCCCcchhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhH
Confidence 3455566666666554332 35788889888777766666666677888889999999988888888
Q ss_pred hhHHHHHHHHHHHHHHH
Q 047021 96 EDAAALRAELNSLQQQA 112 (388)
Q Consensus 96 EDaAaLRAEln~lQqq~ 112 (388)
++.-....-|+.++...
T Consensus 141 e~~~~~~~~l~~leAe~ 157 (546)
T KOG0977|consen 141 EKLDDYLSRLSELEAEI 157 (546)
T ss_pred HHHHHHhhhhhhhhhHH
Confidence 88776666666665544
No 82
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=60.90 E-value=9.8 Score=30.47 Aligned_cols=25 Identities=32% Similarity=0.655 Sum_probs=21.3
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhh
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSK 26 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk 26 (388)
|-.++.+|+.+|.+||.||.+|.+.
T Consensus 19 LK~~I~eL~~~n~~Le~EN~~Lk~~ 43 (59)
T PF01166_consen 19 LKEQIAELEERNSQLEEENNLLKQN 43 (59)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4568899999999999999998764
No 83
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=60.02 E-value=2.5e+02 Score=30.27 Aligned_cols=87 Identities=28% Similarity=0.350 Sum_probs=61.7
Q ss_pred HHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021 10 QKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE 89 (388)
Q Consensus 10 q~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE 89 (388)
+....+.|.|+.+-.+++++...+.+....+++...+|. -||+-+. --+++|-+-++.||+---+
T Consensus 94 ~~~k~~~e~er~~~~~El~~~r~e~~~v~~~~~~a~~n~----~kAqQ~l-----------ar~t~Q~q~lqtrl~~l~~ 158 (499)
T COG4372 94 QGEKRAAETEREAARSELQKARQEREAVRQELAAARQNL----AKAQQEL-----------ARLTKQAQDLQTRLKTLAE 158 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHH-----------HHHHHHHHHHHHHHHHHHH
Confidence 334445566666666777777777777777776666653 2444222 1367888999999999888
Q ss_pred HHhhhHhhHHHHHHHHHHHHHH
Q 047021 90 EQYRAEEDAAALRAELNSLQQQ 111 (388)
Q Consensus 90 EQyRAEEDaAaLRAEln~lQqq 111 (388)
+-..+++.+-+|-|+-+.||..
T Consensus 159 qr~ql~aq~qsl~a~~k~LQ~s 180 (499)
T COG4372 159 QRRQLEAQAQSLQASQKQLQAS 180 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 8888899999999999999865
No 84
>PRK10884 SH3 domain-containing protein; Provisional
Probab=59.80 E-value=1.1e+02 Score=29.21 Aligned_cols=33 Identities=12% Similarity=0.128 Sum_probs=20.2
Q ss_pred HhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC
Q 047021 83 RLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG 115 (388)
Q Consensus 83 RlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~ 115 (388)
+.++..++-..+......|+++++.++...+..
T Consensus 140 ~n~~L~~~l~~~~~~~~~l~~~~~~~~~~~~~~ 172 (206)
T PRK10884 140 ENQKLKNQLIVAQKKVDAANLQLDDKQRTIIMQ 172 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333334444446666777888888888665443
No 85
>KOG3977 consensus Troponin I [Cytoskeleton]
Probab=58.61 E-value=23 Score=34.40 Aligned_cols=75 Identities=27% Similarity=0.428 Sum_probs=58.8
Q ss_pred HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh-hhHHHHhhhhhhhhhhHHH--------hhhhHHHHH
Q 047021 7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV-PSLRKALKDVAMEKDAAVV--------AREDLSAQL 77 (388)
Q Consensus 7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v-~slrkaL~~v~mEkDaA~v--------areDl~AQL 77 (388)
.+|.-+...||+++=-+.-+....+.++..|-..+|||+-+.| |+|++- ++=-|+--- ...|+-|+|
T Consensus 98 ~~l~arv~~leEEkYDi~~~v~qt~~EIndLtikvnDLRGKFvkPtLkkV----sks~~kf~ka~~~~k~~~k~DlRanL 173 (221)
T KOG3977|consen 98 RELHARVDALEEEKYDIEAKVTQTETEINDLTIKVNDLRGKFVKPTLKKV----SKSADKFLKALLGSKHKVKMDLRANL 173 (221)
T ss_pred HHHHHHHHHHHHhhcchhheeehhhhhHHHHHHHHHHhcccccCccHHHH----HhhhHHHHHHhhccchhhhHHHHHHH
Confidence 4677788888898888888888899999999999999999999 998763 333333221 267999999
Q ss_pred HHHHHHhH
Q 047021 78 RTLKKRLK 85 (388)
Q Consensus 78 R~~kkRlk 85 (388)
.+.||-=-
T Consensus 174 K~VKKed~ 181 (221)
T KOG3977|consen 174 KQVKKEDT 181 (221)
T ss_pred HHhhhhhH
Confidence 99998633
No 86
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=58.55 E-value=2.1e+02 Score=29.89 Aligned_cols=44 Identities=25% Similarity=0.275 Sum_probs=38.0
Q ss_pred hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021 70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM 113 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~ 113 (388)
.+.....++.+++++++++.+..+++...+++.+.++-|..-..
T Consensus 66 ~~~~~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~ 109 (525)
T TIGR02231 66 SRPDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIRE 109 (525)
T ss_pred CcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 35567789999999999999999999999999999988877653
No 87
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=58.47 E-value=1.5e+02 Score=27.19 Aligned_cols=137 Identities=19% Similarity=0.250 Sum_probs=63.3
Q ss_pred HHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhh-hhHHHh-----------------h
Q 047021 9 FQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEK-DAAVVA-----------------R 70 (388)
Q Consensus 9 lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEk-DaA~va-----------------r 70 (388)
|..-..+++.....+..-+.+.......+.++++..+....---.+|..-+.-.+ |.|..| -
T Consensus 28 l~q~ird~e~~l~~a~~~~a~~~a~~~~le~~~~~~~~~~~~~~~~A~~Al~~g~edLAr~al~~k~~~e~~~~~l~~~~ 107 (221)
T PF04012_consen 28 LEQAIRDMEEQLRKARQALARVMANQKRLERKLDEAEEEAEKWEKQAELALAAGREDLAREALQRKADLEEQAERLEQQL 107 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555555555555555555666555555444433333321111111 111111 2
Q ss_pred hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCC-CCCCCCCCCChhhHHHHHHHHHHHHHHHHH
Q 047021 71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGS-LGGISSMGISPDHVQTLEKELATLKSQLQQ 146 (388)
Q Consensus 71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~-~s~~~~~g~s~~~~~~lekEl~~l~~~lqq 146 (388)
+.+..++-.++..+.+.+..-....--...|.+..+.-+.+...+. .+|+ ++|.+.+.+..|+..+....++..-
T Consensus 108 ~~~~~~~~~l~~~l~~l~~kl~e~k~k~~~l~ar~~~a~a~~~~~~~~~~~-~~~~a~~~~er~e~ki~~~ea~a~a 183 (221)
T PF04012_consen 108 DQAEAQVEKLKEQLEELEAKLEELKSKREELKARENAAKAQKKVNEALASF-SVSSAMDSFERMEEKIEEMEARAEA 183 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccchHHHHHHHHHHHHHHHHHHHH
Confidence 2233333333334444333332222223334444444444443333 3344 3677778888888888866555443
No 88
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=58.42 E-value=45 Score=26.80 Aligned_cols=39 Identities=23% Similarity=0.273 Sum_probs=24.5
Q ss_pred HHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHh
Q 047021 30 KEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVA 69 (388)
Q Consensus 30 k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~va 69 (388)
.+.++-+|+.+|+.+.++.- .-.-.++.++-|||.++..
T Consensus 3 Lea~~~~Lr~rLd~~~rk~~-~~~~~~k~L~~ERd~~~~~ 41 (69)
T PF14197_consen 3 LEAEIATLRNRLDSLTRKNS-VHEIENKRLRRERDSAERQ 41 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH
Confidence 45566677777776666543 2235566667788887654
No 89
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=58.29 E-value=2.6e+02 Score=29.85 Aligned_cols=105 Identities=22% Similarity=0.370 Sum_probs=58.6
Q ss_pred HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021 11 KKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE 90 (388)
Q Consensus 11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE 90 (388)
++......+...+.++|....-+...|++.|.+++.. +.++..+|.....+.+ .+..++..+-+++.-.+..
T Consensus 38 ~~l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~-i~~~~~ql~~s~~~l~-------~~~~~I~~~~~~l~~l~~q 109 (420)
T COG4942 38 KQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETE-IASLEAQLIETADDLK-------KLRKQIADLNARLNALEVQ 109 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHHH-------HHHhhHHHHHHHHHHHHHH
Confidence 4455555666666666666667777777777776643 4556666655444433 3334444444444433333
Q ss_pred HhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhH
Q 047021 91 QYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHV 130 (388)
Q Consensus 91 QyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~ 130 (388)
. .+....|-..|+++|+.- -|++ +...+++++.
T Consensus 110 ~---r~qr~~La~~L~A~~r~g-~~p~---~~ll~~~eda 142 (420)
T COG4942 110 E---REQRRRLAEQLAALQRSG-RNPP---PALLVSPEDA 142 (420)
T ss_pred H---HHHHHHHHHHHHHHHhcc-CCCC---chhhcChhhh
Confidence 3 444556667777777643 2221 2456777773
No 90
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=58.04 E-value=32 Score=33.73 Aligned_cols=52 Identities=19% Similarity=0.219 Sum_probs=38.5
Q ss_pred HHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 229 LMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK 283 (388)
Q Consensus 229 L~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek 283 (388)
...+.........|..||..|..-+. ...++|....++|...|++||.+|+-
T Consensus 58 ~~~~~~~~~~~~~l~~EN~~Lr~e~~---~l~~~~~~~~~~l~~EN~rLr~LL~~ 109 (283)
T TIGR00219 58 FDGISENLKDVNNLEYENYKLRQELL---KKNQQLEILTQNLKQENVRLRELLNS 109 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 34455566677788899998885443 33567777777899999999998875
No 91
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=56.32 E-value=2.4e+02 Score=28.95 Aligned_cols=91 Identities=26% Similarity=0.297 Sum_probs=39.4
Q ss_pred HHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHH---HHHHH--hhhH
Q 047021 21 SILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKE---AEEEQ--YRAE 95 (388)
Q Consensus 21 ~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkE---AEEEQ--yRAE 95 (388)
+.|+.||...--++..|.+.|-..-...|.-|++-+.++.-|. .+...=.-+||..|=.+.- .|.|- -|=-
T Consensus 109 n~L~rkl~qLr~EK~~lE~~Le~EqE~~V~kL~k~i~~Le~e~----~~~q~~le~Lr~EKVdlEn~LE~EQE~lvN~L~ 184 (310)
T PF09755_consen 109 NDLSRKLNQLRQEKVELENQLEQEQEYLVNKLQKKIERLEKEK----SAKQEELERLRREKVDLENTLEQEQEALVNRLW 184 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH----HHhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444333344455555554432222 1111222255554444322 12111 1112
Q ss_pred hhHHHHHHHHHHHHHHHhcC
Q 047021 96 EDAAALRAELNSLQQQAMDG 115 (388)
Q Consensus 96 EDaAaLRAEln~lQqq~~~~ 115 (388)
-...-|.++-..||......
T Consensus 185 Kqm~~l~~eKr~Lq~~l~~~ 204 (310)
T PF09755_consen 185 KQMDKLEAEKRRLQEKLEQP 204 (310)
T ss_pred HHHHHHHHHHHHHHHHHccc
Confidence 23455677777788877653
No 92
>PF13935 Ead_Ea22: Ead/Ea22-like protein
Probab=56.08 E-value=67 Score=28.32 Aligned_cols=59 Identities=20% Similarity=0.242 Sum_probs=47.5
Q ss_pred HHHHHhhhhhHHHHHHhhh--hhhhhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Q 047021 227 KLLMEIDNQSSEIERLFEE--NSSLSSSYQESMGIAKHWENQV-KDCLKQNEELRGILEKLR 285 (388)
Q Consensus 227 KLL~EID~QS~EIErLf~E--Ns~Lsss~qea~~iaaqWEnQv-qDclkQNeeLr~lLeklR 285 (388)
-||.||+.-...|++++.+ |..++..+++++.--...+.-+ ..|..+++...++...++
T Consensus 71 ALLDElE~~~~~i~~~~~~~e~~~~a~~~~~l~~~Le~ae~~~~~~~~~~~~~~e~~~~~~~ 132 (139)
T PF13935_consen 71 ALLDELERAQQRIAELEQECENEDIALDVQKLRVELEAAEKRIAAELAEQAEAYEGEIADYA 132 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 4899999999999999977 9999999999988666666655 677778777777666644
No 93
>KOG0249 consensus LAR-interacting protein and related proteins [General function prediction only]
Probab=54.77 E-value=1.6e+02 Score=33.80 Aligned_cols=91 Identities=29% Similarity=0.391 Sum_probs=63.5
Q ss_pred HHHhHHhhhhHHHHHHhhHHHHHHhh-Hhhhh-----hhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021 17 EEKNSILSSKLQVKEVESESLQTRIN-ELEQN-----TVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE 90 (388)
Q Consensus 17 e~~n~~L~sk~~~k~~~~~~L~~~Ln-d~e~n-----~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE 90 (388)
++...+|...++++..+..-..+++. ..+-| ||+ .-|.+-.=|+-||.--++-|.-+++++||++-+..-+
T Consensus 162 eer~~kl~~~~qe~naeL~rarqreemneeh~~rlsdtvd---ErlqlhlkermaAle~kn~L~~e~~s~kk~l~~~~~~ 238 (916)
T KOG0249|consen 162 EERTRKLEEQLEELNAELQRARQREKMNEEHNKRLSDTVD---ERLQLHLKERMAALEDKNRLEQELESVKKQLEEMRHD 238 (916)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555555544443 11111 332 3333333678888888999999999999999999999
Q ss_pred HhhhHhhHHHHHHHHHHHHH
Q 047021 91 QYRAEEDAAALRAELNSLQQ 110 (388)
Q Consensus 91 QyRAEEDaAaLRAEln~lQq 110 (388)
--|--.|--.||.|++.|-.
T Consensus 239 k~rl~~d~E~Lr~e~~qL~~ 258 (916)
T KOG0249|consen 239 KDKLRTDIEDLRGELDQLRR 258 (916)
T ss_pred HHHHhhhHHHHHHHHHHHHH
Confidence 88888999999999999874
No 94
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=54.74 E-value=27 Score=28.18 Aligned_cols=38 Identities=37% Similarity=0.442 Sum_probs=24.2
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN 42 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln 42 (388)
.|+.|..++.++|.....+..+++..+.+.+.|..+|.
T Consensus 34 ~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~~l~ 71 (74)
T PF12329_consen 34 TIKKLRAKIKELEKQIKELKKKLEELEKELESLEERLK 71 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 35566666666666666666666666666666666554
No 95
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=54.72 E-value=28 Score=31.33 Aligned_cols=46 Identities=22% Similarity=0.338 Sum_probs=41.4
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN 47 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n 47 (388)
|.+++++++..+...|.-.++|.+++..|+.++..|...|.++...
T Consensus 78 l~a~~~e~qsli~~yE~~~~kLe~e~~~Kdsei~~Lr~~L~~~~~~ 123 (131)
T PF04859_consen 78 LAAEIQEQQSLIKTYEIVVKKLEAELRAKDSEIDRLREKLDELNRA 123 (131)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5678899999999999999999999999999999999999987653
No 96
>PF15294 Leu_zip: Leucine zipper
Probab=54.17 E-value=1.7e+02 Score=29.47 Aligned_cols=124 Identities=27% Similarity=0.322 Sum_probs=83.1
Q ss_pred hhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCcccc-
Q 047021 235 QSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQ- 313 (388)
Q Consensus 235 QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~- 313 (388)
-+-||.||=+||..|.+-++ .-|.++-.|+.....|..-|.++|..|++.--.. ++...
T Consensus 130 l~kEi~rLq~EN~kLk~rl~-------~le~~at~~l~Ek~kl~~~L~~lq~~~~~~~~k~-------------~~~~~~ 189 (278)
T PF15294_consen 130 LNKEIDRLQEENEKLKERLK-------SLEKQATSALDEKSKLEAQLKELQDEQGDQKGKK-------------DLSFKA 189 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-------------cccccc
Confidence 56799999999999887665 3577888999999999999999999887654221 01111
Q ss_pred CCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHhhHHHHHHHhhhcccc
Q 047021 314 NGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRLYKPVLRNIESSLIKMKQDGSV 385 (388)
Q Consensus 314 ~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rly~PvL~~IEsrLikmkq~~~~ 385 (388)
........-+..+|.+|-+. ...++.+.-.|.-.|.-|.+- |-|.-.| |+.+|+-|-+.-|.+++
T Consensus 190 q~l~dLE~k~a~lK~e~ek~---~~d~~~~~k~L~e~L~~~Khe---lL~~Qeq-L~~aekeLekKfqqT~a 254 (278)
T PF15294_consen 190 QDLSDLENKMAALKSELEKA---LQDKESQQKALEETLQSCKHE---LLRVQEQ-LSLAEKELEKKFQQTAA 254 (278)
T ss_pred cchhhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH---HHhcchh-hhcchhhHHHHhCccHH
Confidence 00111111255566666665 334445666777777777654 5566666 88888888887776653
No 97
>KOG4674 consensus Uncharacterized conserved coiled-coil protein [Function unknown]
Probab=54.10 E-value=5.5e+02 Score=32.33 Aligned_cols=131 Identities=23% Similarity=0.246 Sum_probs=85.0
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHH-hh--h---hhhhhhhHHH-hhhhHHHHHHHH
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKA-LK--D---VAMEKDAAVV-AREDLSAQLRTL 80 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrka-L~--~---v~mEkDaA~v-areDl~AQLR~~ 80 (388)
.+++.+..+-+....|.+.+...+..+++|++..+-+...++++.-.. ++ + ..++.++++- .=.|+-..|+..
T Consensus 569 ~~k~~~~~a~e~i~~L~~~l~e~~~~i~sLl~erd~y~e~l~~~e~~~~~k~nss~~~~t~~~~~~e~~l~qLe~~le~~ 648 (1822)
T KOG4674|consen 569 ILKETINEASEKIAELEKELEEQEQRIESLLTERDMYKELLAELEDSHQLKPNSSALDQTEAPRAKEKRLRQLENELESY 648 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCCchhhcccccchhHHHHHHHHHHHHHHH
Confidence 455566666666666777777777777777777777766666655551 11 1 2456666522 234555555543
Q ss_pred HHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHH
Q 047021 81 KKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQ 145 (388)
Q Consensus 81 kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lq 145 (388)
+ .|.-+-....+||-..|+.++-.|.-+...-.+ ..+.+.+.+.-|++.+..++.++-
T Consensus 649 ~---~E~~~~~~~l~e~~~~l~~ev~~ir~~l~k~~~----~~~fA~ekle~L~~~ie~~K~e~~ 706 (1822)
T KOG4674|consen 649 K---KEKRENLKKLQEDFDSLQKEVTAIRSQLEKLKN----ELNLAKEKLENLEKNLELTKEEVE 706 (1822)
T ss_pred H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH----HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3 355666777889999999999999877733221 344577888888888887777765
No 98
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=52.85 E-value=28 Score=29.62 Aligned_cols=38 Identities=21% Similarity=0.399 Sum_probs=32.2
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRI 41 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~L 41 (388)
.+|.+|+..+..+.++|..|..++++.-.+...|.+=|
T Consensus 49 k~v~~L~~e~~~l~~E~e~L~~~l~~e~~Ek~~Ll~ll 86 (87)
T PF12709_consen 49 KKVDELENENKALKRENEQLKKKLDTEREEKQELLKLL 86 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 57888999999999999999999999888888877644
No 99
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=52.74 E-value=79 Score=27.35 Aligned_cols=53 Identities=32% Similarity=0.501 Sum_probs=43.6
Q ss_pred HHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 213 DMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRT 286 (388)
Q Consensus 213 dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~ 286 (388)
++-..|.+||..=..|+.+|..=...|-.|.+||..|. ..|..||..|.++..
T Consensus 5 ~l~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~---------------------~EN~~Lr~~l~~~~~ 57 (107)
T PF06156_consen 5 ELFDRLDQLEQQLGQLLEELEELKKQLQELLEENARLR---------------------IENEHLRERLEELEQ 57 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHHHHHHhc
Confidence 55666888888888888888888889999999999885 367889998888765
No 100
>PF13870 DUF4201: Domain of unknown function (DUF4201)
Probab=52.65 E-value=1.7e+02 Score=26.17 Aligned_cols=90 Identities=24% Similarity=0.381 Sum_probs=59.7
Q ss_pred HHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhh
Q 047021 15 ELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRA 94 (388)
Q Consensus 15 ~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRA 94 (388)
.|.-+|..|..+|+++..+.-.|...... .-.+....++-|..+.-+ -..+-..|...+..+....++=|++
T Consensus 46 qLkien~~l~~kIeERn~eL~~Lk~~~~~-~v~~L~h~keKl~~~~~~-------~~~l~~~l~~~~~~~~~~r~~l~~~ 117 (177)
T PF13870_consen 46 QLKIENQQLNEKIEERNKELLKLKKKIGK-TVQILTHVKEKLHFLSEE-------LERLKQELKDREEELAKLREELYRV 117 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35557777777777776665555443321 112224455555444333 2345567777888888888899999
Q ss_pred HhhHHHHHHHHHHHHHHH
Q 047021 95 EEDAAALRAELNSLQQQA 112 (388)
Q Consensus 95 EEDaAaLRAEln~lQqq~ 112 (388)
..+-..+|...+.|+++.
T Consensus 118 k~~r~k~~~~~~~l~~~~ 135 (177)
T PF13870_consen 118 KKERDKLRKQNKKLRQQG 135 (177)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 999999999999998876
No 101
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=52.59 E-value=1.4e+02 Score=27.31 Aligned_cols=37 Identities=32% Similarity=0.404 Sum_probs=16.8
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN 42 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln 42 (388)
|..|+..++.|+.+...|...|.++...++.|.+++.
T Consensus 118 l~~l~~~~~~L~~~~~~l~~~l~ek~k~~e~l~DE~~ 154 (194)
T PF08614_consen 118 LAELEAELAQLEEKIKDLEEELKEKNKANEILQDELQ 154 (194)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444444444444444
No 102
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms]
Probab=51.70 E-value=5.2e+02 Score=31.40 Aligned_cols=81 Identities=17% Similarity=0.283 Sum_probs=42.5
Q ss_pred HhhhhHHHHHHhhHHHHHHhhHhhhhhh-hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhh---Hhh
Q 047021 22 ILSSKLQVKEVESESLQTRINELEQNTV-PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRA---EED 97 (388)
Q Consensus 22 ~L~sk~~~k~~~~~~L~~~Lnd~e~n~v-~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRA---EED 97 (388)
++..++++....+.+-...|++-+.++. .+.......+..++. -+.++.+++|..+..|+.+...--.+ .+-
T Consensus 465 ~~~keL~e~i~~lk~~~~el~~~q~~l~q~~~ke~~ek~~~~~~----~~~~l~~~~~~~~eele~~q~~~~~~~~~~~k 540 (1317)
T KOG0612|consen 465 EMDKELEETIEKLKSEESELQREQKALLQHEQKEVEEKLSEEEA----KKRKLEALVRQLEEELEDAQKKNDNAADSLEK 540 (1317)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3444444444444444455554333333 233333323333332 27899999999999999994433333 344
Q ss_pred HHHHHHHHH
Q 047021 98 AAALRAELN 106 (388)
Q Consensus 98 aAaLRAEln 106 (388)
..++|.+|.
T Consensus 541 v~~~rk~le 549 (1317)
T KOG0612|consen 541 VNSLRKQLE 549 (1317)
T ss_pred HHHHHHHHH
Confidence 444444443
No 103
>PHA02562 46 endonuclease subunit; Provisional
Probab=51.51 E-value=2.9e+02 Score=28.38 Aligned_cols=146 Identities=9% Similarity=0.189 Sum_probs=68.7
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh----HhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN----ELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRT 79 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln----d~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~ 79 (388)
.+++++++.+..+..+...+..++.........+....+ +++. ....+....+.+--+.+...-.-.++-..+-.
T Consensus 174 ~~~~e~~~~i~~l~~~i~~l~~~i~~~~~~i~~~~~~~~~~i~~l~~-e~~~l~~~~~~l~~~l~~l~~~i~~l~~~i~~ 252 (562)
T PHA02562 174 DKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN-KYDELVEEAKTIKAEIEELTDELLNLVMDIED 252 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 355667777777777777776666644433333322221 1110 11111222222211111111111111111111
Q ss_pred HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh----cCCCC-CCCCCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM----DGSLG-GISSMGISPDHVQTLEKELATLKSQLQQESQL 150 (388)
Q Consensus 80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~----~~~~s-~~~~~g~s~~~~~~lekEl~~l~~~lqqes~l 150 (388)
...-+++++.+--.++.+-..+...++-+...-. ..+.. +...++.-.+.+..++.++..+...+......
T Consensus 253 ~~~~L~~l~~~~~~~~~~l~~~~~~~~~~~~~~~Cp~C~~~~~~~~~~~~~l~d~i~~l~~~l~~l~~~i~~~~~~ 328 (562)
T PHA02562 253 PSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI 328 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCCcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2223455566666666666777777666642000 00111 12244566778888999999888888754443
No 104
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=50.71 E-value=5e+02 Score=30.89 Aligned_cols=116 Identities=18% Similarity=0.266 Sum_probs=59.0
Q ss_pred hhhHHHHHHHHHHHhHHHHHHHhhhHhhH--------------HHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHH
Q 047021 70 REDLSAQLRTLKKRLKEAEEEQYRAEEDA--------------AALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEK 135 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEEEQyRAEEDa--------------AaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lek 135 (388)
-.||-.++-...-.|.+..++....-... ...|..+..+..+.-+....+- + .++.+...++-
T Consensus 104 ~~~Leq~l~~~~~~L~~~q~~l~~~~~~~~~~~~~l~~~pq~~~~~~~~l~~i~~~L~~~~~~~~-~--l~~a~~~~lqa 180 (1109)
T PRK10929 104 TDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNT-P--LAQAQLTALQA 180 (1109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHhhchhhHHHHHHHHHHHHHHHhCCCCCCC-c--ccHHHHHHHHH
Confidence 35566666666666666555544433333 4566677777766633222222 2 45556666777
Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHhhhchh
Q 047021 136 ELATLKSQLQQESQ--LRQQERKQLT-----EEQARISVLMSEKQELEEKLVMFSRKASE 188 (388)
Q Consensus 136 El~~l~~~lqqes~--lrqqeqqrla-----~Eq~~~a~l~aEkqeLeekl~~~~~~~se 188 (388)
|.+-++.+...... +-...++-|. --..++..+..+.+.|.+.+..-..+-++
T Consensus 181 e~~~l~~~~~~l~~~l~s~~~~~~L~~~q~dl~~~~~~~l~~~~~~Lq~~in~kR~~~se 240 (1109)
T PRK10929 181 ESAALKALVDELELAQLSANNRQELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAE 240 (1109)
T ss_pred HHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 77666665443222 1112222222 22334455666666666666654444443
No 105
>PRK10246 exonuclease subunit SbcC; Provisional
Probab=50.58 E-value=4.5e+02 Score=30.35 Aligned_cols=25 Identities=12% Similarity=0.197 Sum_probs=19.9
Q ss_pred cchhhHHHHHHHHhHHHHHHHHHHh
Q 047021 199 FSVEDKEKLEKQLHDMAVVVERLES 223 (388)
Q Consensus 199 ~S~eDkekle~Ql~dma~~VeRLE~ 223 (388)
++..+.+.++..+.+.-..+..+..
T Consensus 774 ~~~~~~~~l~~~i~~~~~~~~~~~~ 798 (1047)
T PRK10246 774 LDEETLTQLEQLKQNLENQRQQAQT 798 (1047)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4668888888888888887777766
No 106
>cd07429 Cby_like Chibby, a nuclear inhibitor of Wnt/beta-catenin mediated transcription, and similar proteins. Chibby(Cby) is a well-conserved nuclear protein that functions as part of the Wnt/beta-catenin signaling pathway. Specifically, Cby binds directly to beta-catenin by interacting with its central region, which harbors armadillo repeats. Cby-beta-catenin interactions may also involve 14-3-3 proteins. By competing with other binding partners of beta-catenin, the Tcf/Lef transcription factors, Cby inhibits transcriptional activation. Cby has been shown to play a role in adipocyte differentiation. The C-terminal region of Cby appears to contain an alpha-helical coiled-coil motif.
Probab=50.36 E-value=17 Score=31.89 Aligned_cols=25 Identities=40% Similarity=0.658 Sum_probs=21.8
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQV 29 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~ 29 (388)
.+..|.++|..|++||++|.-||+-
T Consensus 73 e~~rlkkk~~~LeEENNlLklKiev 97 (108)
T cd07429 73 EVLRLKKKNQQLEEENNLLKLKIEV 97 (108)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667889999999999999998864
No 107
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=49.89 E-value=3.9e+02 Score=29.43 Aligned_cols=139 Identities=24% Similarity=0.343 Sum_probs=83.2
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
.+|.+|......++.+.+++...+.....+..........++++.. +++-+.++--+.|..+ +.|-+.+-....|
T Consensus 335 ~~l~~l~~~i~~~~~~~~~l~~~~~q~~~e~~~~~~~~~~le~~~~--l~~k~~~lL~d~e~ni---~kL~~~v~~s~~r 409 (594)
T PF05667_consen 335 EQLDELESQIEELEAEIKMLKSSLKQLEEELEEKEAENEELEEELK--LKKKTVELLPDAEENI---AKLQALVEASEQR 409 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhcCcHHHH---HHHHHHHHHHHHH
Confidence 4566666666666777777777777666666666666666666554 5555555555555555 4455555555555
Q ss_pred hHHHHH--HHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 84 LKEAEE--EQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERK 156 (388)
Q Consensus 84 lkEAEE--EQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqq 156 (388)
+.+-.. |.||+ -|..|+..|....... -+.... -.+.++.+..++..+..+++.-..+..+...
T Consensus 410 l~~L~~qWe~~R~-----pL~~e~r~lk~~~~~~-~~e~~~---~~~~ik~~r~~~k~~~~e~~~Kee~~~qL~~ 475 (594)
T PF05667_consen 410 LVELAQQWEKHRA-----PLIEEYRRLKEKASNR-ESESKQ---KLQEIKELREEIKEIEEEIRQKEELYKQLVK 475 (594)
T ss_pred HHHHHHHHHHHHh-----HHHHHHHHHHHHHhhc-chHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 554332 44554 5677777777666322 222211 3466777888888888777776555554333
No 108
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=49.47 E-value=2.3e+02 Score=26.73 Aligned_cols=103 Identities=23% Similarity=0.265 Sum_probs=73.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh--chhhhhhhhhhhhcchh-hHHHHHHHHhHHHHHHHHHHhhHHHHHH
Q 047021 154 ERKQLTEEQARISVLMSEKQELEEKLVMFSRK--ASEEVSNAAASKAFSVE-DKEKLEKQLHDMAVVVERLESSRQKLLM 230 (388)
Q Consensus 154 eqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~--~se~~~~~a~~k~~S~e-Dkekle~Ql~dma~~VeRLE~SRqKLL~ 230 (388)
-...|..--.+...+.+...+|++.|..+... ..+...+.++.+--..+ .-..|..+|.+.-.-.+-.|.+=.+|=.
T Consensus 125 ~E~~Le~aEeR~e~~E~ki~eLE~el~~~~~~lk~lE~~~~~~~~re~~~e~~i~~L~~~lkeaE~Rae~aE~~v~~Le~ 204 (237)
T PF00261_consen 125 LEQELERAEERAEAAESKIKELEEELKSVGNNLKSLEASEEKASEREDEYEEKIRDLEEKLKEAENRAEFAERRVKKLEK 204 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556677778889999999999999998865 33444444444333333 3567788888887778888888888888
Q ss_pred HhhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021 231 EIDNQSSEIERLFEENSSLSSSYQES 256 (388)
Q Consensus 231 EID~QS~EIErLf~ENs~Lsss~qea 256 (388)
+||.--.+|+..=..+..++..+-.+
T Consensus 205 ~id~le~eL~~~k~~~~~~~~eld~~ 230 (237)
T PF00261_consen 205 EIDRLEDELEKEKEKYKKVQEELDQT 230 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 88888888887777776666655444
No 109
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=48.71 E-value=5.8 Score=42.89 Aligned_cols=100 Identities=22% Similarity=0.333 Sum_probs=0.0
Q ss_pred hHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH--HHHHHHhhh-HhhHHHHHHHHHHHHHHHhcC---CCC---CCC
Q 047021 51 SLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK--EAEEEQYRA-EEDAAALRAELNSLQQQAMDG---SLG---GIS 121 (388)
Q Consensus 51 slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk--EAEEEQyRA-EEDaAaLRAEln~lQqq~~~~---~~s---~~~ 121 (388)
.|++-..+...+-+.+..-||++- -||..-.|+. |++=+-||. -+|...++.++..|.-+-+.. ... -+.
T Consensus 278 ~L~q~~~eL~~~A~~a~~LrDElD-~lR~~a~r~~klE~~ve~YKkKLed~~~lk~qvk~Lee~N~~l~e~~~~LEeel~ 356 (713)
T PF05622_consen 278 ELRQENEELQAEAREARALRDELD-ELREKADRADKLENEVEKYKKKLEDLEDLKRQVKELEEDNAVLLETKAMLEEELK 356 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444466543 2443333332 444466764 367888888888887765322 000 000
Q ss_pred CCCCChhhHHHHHHHHHHHHHHHHHHHHHH
Q 047021 122 SMGISPDHVQTLEKELATLKSQLQQESQLR 151 (388)
Q Consensus 122 ~~g~s~~~~~~lekEl~~l~~~lqqes~lr 151 (388)
.++---.++..+++++.+++..+-.+...+
T Consensus 357 ~~~~~~~qle~~k~qi~eLe~~l~~~~~~~ 386 (713)
T PF05622_consen 357 KARALKSQLEEYKKQIQELEQKLSEESRRA 386 (713)
T ss_dssp ------------------------------
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 111223445556666666666665554443
No 110
>KOG0964 consensus Structural maintenance of chromosome protein 3 (sister chromatid cohesion complex Cohesin, subunit SMC3) [Cell cycle control, cell division, chromosome partitioning]
Probab=48.41 E-value=5.5e+02 Score=30.74 Aligned_cols=169 Identities=21% Similarity=0.308 Sum_probs=92.6
Q ss_pred hhHHHHHHHHHHHhHH--HHHHHhhhHhh-----HHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHH
Q 047021 71 EDLSAQLRTLKKRLKE--AEEEQYRAEED-----AAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQ 143 (388)
Q Consensus 71 eDl~AQLR~~kkRlkE--AEEEQyRAEED-----aAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~ 143 (388)
+|+.++++.+...|+- +|-+|..|++- --.|-=+++.||+|...|.-+-- .....++.++-+|.+-+-+
T Consensus 268 ~~~~~~i~ele~~l~~l~~ekeq~~a~~t~~~k~kt~lel~~kdlq~~i~~n~q~r~----~~l~~l~~~~~ki~e~~~E 343 (1200)
T KOG0964|consen 268 EDLKCEIKELENKLTNLREEKEQLKARETKISKKKTKLELKIKDLQDQITGNEQQRN----LALHVLQKVKDKIEEKKDE 343 (1200)
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHhhhhhhhhh----hHHHHHHHHHHHHHHHHHH
Confidence 3455555555544432 35566666644 33456678888888866543321 1223344455555544444
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHH-HHHHHhHHHHHHHHHH
Q 047021 144 LQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEK-LEKQLHDMAVVVERLE 222 (388)
Q Consensus 144 lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkek-le~Ql~dma~~VeRLE 222 (388)
|..---..+--...-...-.+++.|....+.|=.|- . ..+.| -|+++|.+ +.++|.++...+--..
T Consensus 344 L~~I~Pky~~l~~ee~~~~~rl~~l~~~~~~l~~Kq-------g-r~sqF-----ssk~eRDkwir~ei~~l~~~i~~~k 410 (1200)
T KOG0964|consen 344 LSKIEPKYNSLVDEEKRLKKRLAKLEQKQRDLLAKQ-------G-RYSQF-----SSKEERDKWIRSEIEKLKRGINDTK 410 (1200)
T ss_pred HHHhhhHHHHHHhHHHHHHHHHHHHHHHHHHHHHhh-------c-ccccc-----CcHHHHHHHHHHHHHHHHHHHhhhh
Confidence 433111111111111222333444444433332221 0 11122 25566665 6788888888888888
Q ss_pred hhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021 223 SSRQKLLMEIDNQSSEIERLFEENSSLSSSYQES 256 (388)
Q Consensus 223 ~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea 256 (388)
..++.|=+||+.=-.+.+..-++=..|.+|..+.
T Consensus 411 e~e~~lq~e~~~~e~~l~~~~e~i~~l~~si~e~ 444 (1200)
T KOG0964|consen 411 EQENILQKEIEDLESELKEKLEEIKELESSINET 444 (1200)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhh
Confidence 8888888888888888888878777787777744
No 111
>PF01724 DUF29: Domain of unknown function DUF29; InterPro: IPR002636 This entry is represented by Ralstonia phage RSS1, Orf11. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of various hypothetical proteins from cyanobacteria, none of which are functionally described. The aligned region is approximately 120-140 amino acids long corresponding to almost the entire length of the proteins in the family.; PDB: 3FCN_A.
Probab=48.22 E-value=1.9e+02 Score=25.71 Aligned_cols=45 Identities=24% Similarity=0.368 Sum_probs=29.3
Q ss_pred HHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 227 KLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ 273 (388)
Q Consensus 227 KLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ 273 (388)
-=..+|+.+-.+|+++|.+|.+|..-+.++ +..-|.+-++++.+.
T Consensus 69 sW~~tI~~~R~~i~~~l~~sPSLk~~l~~~--l~~~Y~~A~~~a~~e 113 (139)
T PF01724_consen 69 SWRATIRNQRRQIEDLLEDSPSLKNYLEEI--LEEAYQDARKLAARE 113 (139)
T ss_dssp HHHHHHHHHHHHHHHH----GGGGGG--HH--HHHHHHHH-HHHHHH
T ss_pred hHHHHHHHHHHHHHHHhcccccHHHHHHHH--HHHHHHHHHHHHHHH
Confidence 345699999999999999999999988887 566676665555543
No 112
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=48.05 E-value=2.9e+02 Score=27.31 Aligned_cols=96 Identities=22% Similarity=0.338 Sum_probs=55.0
Q ss_pred hhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHH
Q 047021 25 SKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAE 104 (388)
Q Consensus 25 sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAE 104 (388)
-+|++-..+++++.-+.+ +.+++|+.+--|.+++- .-+-.++..+..-+.+=-+=+.|-..+|+.
T Consensus 10 ~~iq~lD~e~~rl~~~~~--------~~~~~l~k~~~e~e~~~-------~~~~~~~~e~e~le~qv~~~e~ei~~~r~r 74 (239)
T COG1579 10 LAIQKLDLEKDRLEPRIK--------EIRKALKKAKAELEALN-------KALEALEIELEDLENQVSQLESEIQEIRER 74 (239)
T ss_pred HHHHHHHHHHHHHHHhhh--------hhHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555554443 56777755555555443 223334444444444444556677777777
Q ss_pred HHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH
Q 047021 105 LNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL 144 (388)
Q Consensus 105 ln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l 144 (388)
++..+-..+ .+-...+..+|..|+..++.+.
T Consensus 75 ~~~~e~kl~---------~v~~~~e~~aL~~E~~~ak~r~ 105 (239)
T COG1579 75 IKRAEEKLS---------AVKDERELRALNIEIQIAKERI 105 (239)
T ss_pred HHHHHHHHh---------ccccHHHHHHHHHHHHHHHHHH
Confidence 777777662 2225677888888887655443
No 113
>PRK14154 heat shock protein GrpE; Provisional
Probab=48.02 E-value=93 Score=29.97 Aligned_cols=56 Identities=20% Similarity=0.259 Sum_probs=42.4
Q ss_pred HHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHH------------HHHhhhhHHHHHHHHhhHHHHHHH
Q 047021 323 FLALKGQLAKEQSRAEALSAQVLQLSAQLQQT------------TQAYNGLVRLYKPVLRNIESSLIK 378 (388)
Q Consensus 323 ~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra------------~QA~n~L~Rly~PvL~~IEsrLik 378 (388)
+..|+.++.....+.+.|....+-+.|+..-. .-|.-+|.+-+.||+-|+|.-|-.
T Consensus 54 ~~~l~~el~~le~e~~elkd~~lRl~ADfeNyRKR~~kE~e~~~~~a~e~~~~~LLpVlDnLeRAL~~ 121 (208)
T PRK14154 54 REKLEGQLTRMERKVDEYKTQYLRAQAEMDNLRKRIEREKADIIKFGSKQLITDLLPVADSLIHGLES 121 (208)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhHHHHHHhc
Confidence 56677777777777777777777777776533 346778999999999999988754
No 114
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=47.89 E-value=33 Score=33.38 Aligned_cols=42 Identities=33% Similarity=0.341 Sum_probs=34.5
Q ss_pred hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh
Q 047021 3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINEL 44 (388)
Q Consensus 3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~ 44 (388)
++|=.-+..||.+||++++++..++.....++++|+.+=-.|
T Consensus 85 tsQRDRFR~Rn~ELE~elr~~~~~~~~L~~Ev~~L~~DN~kL 126 (248)
T PF08172_consen 85 TSQRDRFRQRNAELEEELRKQQQTISSLRREVESLRADNVKL 126 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455567889999999999999999999888888888874433
No 115
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=47.84 E-value=3.8e+02 Score=28.67 Aligned_cols=260 Identities=20% Similarity=0.248 Sum_probs=0.0
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH----hhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE----LEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRT 79 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd----~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~ 79 (388)
.|+.+|..|+.+|..+.=.+.- ...+.++..|.+++.+ +..-.+...+..++.|.-.=|..--.=+--..--..
T Consensus 230 ~ql~el~~gy~~m~~~gy~~~~--~~i~~~i~~l~~~i~~~~~~l~~l~l~~~~~~~~~i~~~Id~Lyd~lekE~~A~~~ 307 (569)
T PRK04778 230 DQLQELKAGYRELVEEGYHLDH--LDIEKEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKY 307 (569)
T ss_pred HHHHHHHHHHHHHHHcCCCCCC--CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhh---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDH---VQTLEKELATLKSQLQQESQLRQQERK 156 (388)
Q Consensus 80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~---~~~lekEl~~l~~~lqqes~lrqqeqq 156 (388)
..+....-...=..+.+..-.|..|+..+++ +++.+.+. .+.+++++..+..++..-..-......
T Consensus 308 vek~~~~l~~~l~~~~e~~~~l~~Ei~~l~~-----------sY~l~~~e~~~~~~lekeL~~Le~~~~~~~~~i~~~~~ 376 (569)
T PRK04778 308 VEKNSDTLPDFLEHAKEQNKELKEEIDRVKQ-----------SYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEI 376 (569)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH-----------ccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh-hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHH--HHhhHHHHHHHhh
Q 047021 157 QLTEEQARISVLMSEKQELEEKLVMFSRKASE-EVSNAAASKAFSVEDKEKLEKQLHDMAVVVER--LESSRQKLLMEID 233 (388)
Q Consensus 157 rla~Eq~~~a~l~aEkqeLeekl~~~~~~~se-~~~~~a~~k~~S~eDkekle~Ql~dma~~VeR--LE~SRqKLL~EID 233 (388)
....-+.....+.....++++....+...-.. ...+.-+...+ .++...|+.+.+.|++ |-+.-+..+.-++
T Consensus 377 ~ysel~e~leel~e~leeie~eq~ei~e~l~~Lrk~E~eAr~kL-----~~~~~~L~~ikr~l~k~~lpgip~~y~~~~~ 451 (569)
T PRK04778 377 AYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKL-----ERYRNKLHEIKRYLEKSNLPGLPEDYLEMFF 451 (569)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHcCCCCCcHHHHHHHH
Q ss_pred hhhHHHHHHhhh------hhhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 234 NQSSEIERLFEE------NSSLSS-SYQESMGIAKHWENQVKDCLKQNEELRGIL 281 (388)
Q Consensus 234 ~QS~EIErLf~E------Ns~Lss-s~qea~~iaaqWEnQvqDclkQNeeLr~lL 281 (388)
.=+.+|++|-.+ |-.--. -|.++..--..-+.|..|....-.-+..++
T Consensus 452 ~~~~~i~~l~~~L~~g~VNm~ai~~e~~e~~~~~~~L~~q~~dL~~~a~~lE~~I 506 (569)
T PRK04778 452 EVSDEIEALAEELEEKPINMEAVNRLLEEATEDVETLEEETEELVENATLTEQLI 506 (569)
T ss_pred HHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 116
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=47.53 E-value=1.7e+02 Score=30.55 Aligned_cols=95 Identities=23% Similarity=0.197 Sum_probs=44.1
Q ss_pred hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh---hHHHHhhhhhhhhhhHHHh--hhhHHHHHHHHHHHhHHH
Q 047021 13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP---SLRKALKDVAMEKDAAVVA--REDLSAQLRTLKKRLKEA 87 (388)
Q Consensus 13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~---slrkaL~~v~mEkDaA~va--reDl~AQLR~~kkRlkEA 87 (388)
..+|+.+.+.|..++.....+++.+..++.=++.+..+ ...+++. .=..|.+-.. -+-+..++..+..++.++
T Consensus 73 ~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (525)
T TIGR02231 73 LAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAK--RNEPDLKEWFQAFDFNGSEIERLLTEDREA 150 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc--cCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455555555555555555666666666555544321 1222110 0011211111 122334445555555555
Q ss_pred HHHHhhhHhhHHHHHHHHHHHH
Q 047021 88 EEEQYRAEEDAAALRAELNSLQ 109 (388)
Q Consensus 88 EEEQyRAEEDaAaLRAEln~lQ 109 (388)
+.+.-..++.-+.|+.+|+.+.
T Consensus 151 ~~~~~~~~~~l~~l~~~l~~l~ 172 (525)
T TIGR02231 151 ERRIRELEKQLSELQNELNALL 172 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHhhc
Confidence 5555555555556666665554
No 117
>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=47.49 E-value=2.7e+02 Score=26.89 Aligned_cols=106 Identities=21% Similarity=0.335 Sum_probs=64.3
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh------hHHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP------SLRKALKDVAMEKDAAVVAREDLSAQLRT 79 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~------slrkaL~~v~mEkDaA~vareDl~AQLR~ 79 (388)
-++|+.++.++++++.+-+..+...+....-|...+...+..... .+...-.++..+.-.-.-.|+.|.+.++-
T Consensus 7 k~Ele~rL~q~eee~~~a~~~L~e~e~~a~~Leek~k~aeeea~~Le~k~~eaee~~~rL~~~~~~~~eEk~~Le~e~~e 86 (246)
T PF00769_consen 7 KQELEERLRQMEEEMRRAQEALEESEETAEELEEKLKQAEEEAEELEQKRQEAEEEKQRLEEEAEMQEEEKEQLEQELRE 86 (246)
T ss_dssp CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 367889999999999999999998888888888887755543322 12222222222222222236677777777
Q ss_pred HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHH
Q 047021 80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQ 111 (388)
Q Consensus 80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq 111 (388)
....+..-+++--+.+..+..|+.++-.-+..
T Consensus 87 ~~~~i~~l~ee~~~ke~Ea~~lq~el~~ar~~ 118 (246)
T PF00769_consen 87 AEAEIARLEEESERKEEEAEELQEELEEARED 118 (246)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 77777666666667777777777776665553
No 118
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=46.88 E-value=38 Score=26.74 Aligned_cols=38 Identities=21% Similarity=0.360 Sum_probs=31.7
Q ss_pred hhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh
Q 047021 12 KNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV 49 (388)
Q Consensus 12 ~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v 49 (388)
|..++|.+...+.+.+.+...+++.+-+.++++++|+.
T Consensus 1 Ri~elEn~~~~~~~~i~tvk~en~~i~~~ve~i~envk 38 (55)
T PF05377_consen 1 RIDELENELPRIESSINTVKKENEEISESVEKIEENVK 38 (55)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34678888888888899999999999999998888873
No 119
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=46.49 E-value=43 Score=27.06 Aligned_cols=38 Identities=16% Similarity=0.204 Sum_probs=23.1
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRI 41 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~L 41 (388)
.+|-.|=..|.+|.++|..|..+......++..|....
T Consensus 7 ~kle~Li~~~~~L~~EN~~Lr~q~~~~~~ER~~L~ekn 44 (65)
T TIGR02449 7 AQVEHLLEYLERLKSENRLLRAQEKTWREERAQLLEKN 44 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666666666666666666666666665555443
No 120
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown]
Probab=45.99 E-value=6e+02 Score=30.48 Aligned_cols=103 Identities=24% Similarity=0.302 Sum_probs=59.6
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhh--------------------------------h--hhHH
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNT--------------------------------V--PSLR 53 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~--------------------------------v--~slr 53 (388)
.|.--|+++++.++.|+.++++|-.-+.-|.+.|+|++.-. + |.-+
T Consensus 174 hL~velAdle~kir~LrqElEEK~enll~lr~eLddleae~~klrqe~~e~l~ea~ra~~yrdeldalre~aer~d~~yk 253 (1195)
T KOG4643|consen 174 HLEVELADLEKKIRTLRQELEEKFENLLRLRNELDDLEAEISKLRQEIEEFLDEAHRADRYRDELDALREQAERPDTTYK 253 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhhcCCCccc
Confidence 45667788888888888888877766666666666543211 1 3344
Q ss_pred HHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 54 KALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 54 kaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
.+|-|+-+=||++.-+|+|=.+ ...-|-+=+++=.-|||--|-+.+-.+|=.++|+.
T Consensus 254 erlmDs~fykdRveelkedN~v--LleekeMLeeQLq~lrarse~~tleseiiqlkqkl 310 (1195)
T KOG4643|consen 254 ERLMDSDFYKDRVEELKEDNRV--LLEEKEMLEEQLQKLRARSEGATLESEIIQLKQKL 310 (1195)
T ss_pred hhhhhhHHHHHHHHHHHhhhHH--HHHHHHHHHHHHHHHHhccccCChHHHHHHHHHHH
Confidence 5555555566666666666221 11222233334445666555566666666666665
No 121
>COG4985 ABC-type phosphate transport system, auxiliary component [Inorganic ion transport and metabolism]
Probab=45.78 E-value=1.6e+02 Score=29.81 Aligned_cols=65 Identities=23% Similarity=0.393 Sum_probs=51.6
Q ss_pred hhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHh-------hhhHHHHHHHHHHHhHHH
Q 047021 23 LSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVA-------REDLSAQLRTLKKRLKEA 87 (388)
Q Consensus 23 L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~va-------reDl~AQLR~~kkRlkEA 87 (388)
|...++..--+.+.|...+|+++..-|..+-.-|+.+.++|-+..++ -.|+.|-+-.+++|++--
T Consensus 162 l~~eLqkr~~~v~~l~~q~~k~~~~qv~~in~qlErLRL~krrlQl~g~Ld~~~q~~~~ae~seLq~r~~~l 233 (289)
T COG4985 162 LERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLEKRRLQLNGQLDDEFQQHYVAEKSELQKRLAQL 233 (289)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHH
Confidence 44556666677889999999999999999999999999999877765 566777777777777543
No 122
>PF12795 MscS_porin: Mechanosensitive ion channel porin domain
Probab=45.23 E-value=2.7e+02 Score=26.20 Aligned_cols=186 Identities=23% Similarity=0.259 Sum_probs=92.4
Q ss_pred HHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 76 QLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQER 155 (388)
Q Consensus 76 QLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeq 155 (388)
+....+++..+-..---.|-.....++++|+.+... ...+... +.+.| +..|+..|....++|.....-.++..
T Consensus 32 ~~~~~~~~~~~~~~~i~~aP~~~~~l~~~l~~l~~~-~~~~~~~--~~~~s---~~eLeq~l~~~~~~L~~~q~~l~~~~ 105 (240)
T PF12795_consen 32 EIKKQKKRAAEYQKQIDQAPKEIRELQKELEALKSQ-DAPSKEI--LANLS---LEELEQRLSQEQAQLQELQEQLQQEN 105 (240)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhcc-ccccccC--cccCC---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444443333344555677788888888766 1112111 22223 45677777777777777666555555
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhh
Q 047021 156 KQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQ 235 (388)
Q Consensus 156 qrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~Q 235 (388)
-++..=+.+...+...+-+....+..+...-....++. ...++...+-.|...+.-+-.-+.-|
T Consensus 106 ~~l~~~~~~p~~aq~~l~~~~~~l~ei~~~L~~~~~~~--~~~l~~a~~~~l~ae~~~l~~~~~~l-------------- 169 (240)
T PF12795_consen 106 SQLIEIQTRPERAQQQLSEARQRLQEIRNQLQNLPPNG--ESPLSEAQRWLLQAELAALEAQIEML-------------- 169 (240)
T ss_pred HHHHHHHccHHHHHHHHHHHHHHHHHHHHHHhccCCCC--cchhhHHHHHHHHHHHHHHHHHHHHH--------------
Confidence 55554444444444444444444444333311100110 13444444444444422222222221
Q ss_pred hHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q 047021 236 SSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQAS 290 (388)
Q Consensus 236 S~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~ 290 (388)
+..---|+++.+=|+-=+......-++... +..-|++.|+..|...+-
T Consensus 170 ----e~el~s~~~rq~L~~~qrdl~~~~~~~l~~---~l~~Lq~~ln~~R~~eae 217 (240)
T PF12795_consen 170 ----EQELLSNNNRQELLQLQRDLLKARIQRLQQ---QLQALQNLLNQKRRQEAE 217 (240)
T ss_pred ----HHHHHCcHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence 222234666777777667766665555543 445678888887776553
No 123
>PRK04863 mukB cell division protein MukB; Provisional
Probab=45.08 E-value=6.7e+02 Score=30.74 Aligned_cols=27 Identities=19% Similarity=0.361 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhchh
Q 047021 162 QARISVLMSEKQELEEKLVMFSRKASE 188 (388)
Q Consensus 162 q~~~a~l~aEkqeLeekl~~~~~~~se 188 (388)
......+...++.++++...+.+....
T Consensus 462 E~kL~~lea~leql~~~~~~l~~~~Gk 488 (1486)
T PRK04863 462 EQKLSVAQAAHSQFEQAYQLVRKIAGE 488 (1486)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCC
Confidence 334455666667777777666655444
No 124
>PF07989 Microtub_assoc: Microtubule associated; InterPro: IPR012943 Proteins with this domain associate with the spindle body during cell division [].
Probab=45.03 E-value=64 Score=26.28 Aligned_cols=53 Identities=28% Similarity=0.271 Sum_probs=38.3
Q ss_pred hhHHHHHHhhhhhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047021 235 QSSEIERLFEENSSLSSSYQESMGI-AKHWENQVKDCLKQNEELRGILEKLRTE 287 (388)
Q Consensus 235 QS~EIErLf~ENs~Lsss~qea~~i-aaqWEnQvqDclkQNeeLr~lLeklR~E 287 (388)
|...|++|--||=+|-=-+--...- ...-...+.+++++|.+|++-++.|+.|
T Consensus 5 qe~~i~~L~KENF~LKLrI~fLee~l~~~~~~~~~~~~keNieLKve~~~L~~e 58 (75)
T PF07989_consen 5 QEEQIDKLKKENFNLKLRIYFLEERLQKLGPESIEELLKENIELKVEVESLKRE 58 (75)
T ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHH
Confidence 6677999999998875332222111 1334677899999999999999988876
No 125
>PF04880 NUDE_C: NUDE protein, C-terminal conserved region; InterPro: IPR006964 This domain represents the C-terminal conserved region of NUDE proteins. Emericella nidulans (Aspergillus nidulans) NUDE, acts in the cytoplasmic dynein/dynactin pathway and is required for distribution of nuclei []. It is a homologue of the nuclear distribution protein RO11 of Neurospora crassa. NUDE interacts with the NUDF via an N-terminal coiled coil domain; this is the only domain which is absolutely required for NUDE function.; PDB: 2V66_B 2V71_B.
Probab=45.01 E-value=27 Score=32.50 Aligned_cols=43 Identities=35% Similarity=0.537 Sum_probs=24.9
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHH---HhhHHHHHHhhHhhhhh
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKE---VESESLQTRINELEQNT 48 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~---~~~~~L~~~Lnd~e~n~ 48 (388)
|.+++.|...-=|.|-+|.++|++|+ ++++-|.+++-||.+-.
T Consensus 2 LeD~EsklN~AIERnalLE~ELdEKE~L~~~~QRLkDE~RDLKqEl 47 (166)
T PF04880_consen 2 LEDFESKLNQAIERNALLESELDEKENLREEVQRLKDELRDLKQEL 47 (166)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH--------------
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67899999999999999999999887 44444555555554433
No 126
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=44.56 E-value=2.4e+02 Score=30.47 Aligned_cols=23 Identities=22% Similarity=0.377 Sum_probs=14.6
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSK 26 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk 26 (388)
.++..|...|..|-.+|..|+.+
T Consensus 73 ~~~~~l~~~N~~l~~eN~~L~~r 95 (472)
T TIGR03752 73 KRLAKLISENEALKAENERLQKR 95 (472)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 45666666677777777666443
No 127
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=44.28 E-value=5.9e+02 Score=29.89 Aligned_cols=50 Identities=36% Similarity=0.425 Sum_probs=29.6
Q ss_pred hhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021 64 DAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM 113 (388)
Q Consensus 64 DaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~ 113 (388)
..+.-+..++-.++..+++.-..|+.+--.++....-+++++..|+.+..
T Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~~l~~~L~ 535 (1201)
T PF12128_consen 486 NQAQQAVEELQAEEQELRKERDQAEEELRQARRELEELRAQIAELQRQLD 535 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 33333344555555555555555555555555666667777777777774
No 128
>PF15294 Leu_zip: Leucine zipper
Probab=43.64 E-value=3.6e+02 Score=27.28 Aligned_cols=174 Identities=18% Similarity=0.223 Sum_probs=99.1
Q ss_pred hHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhh--------------hhhhh--hhHHHHHHHHHHHH--
Q 047021 203 DKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEE--------------NSSLS--SSYQESMGIAKHWE-- 264 (388)
Q Consensus 203 Dkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~E--------------Ns~Ls--ss~qea~~iaaqWE-- 264 (388)
.+.|.+++|...-..+.=|.+|| |.|---=+.||..+|.. |..-. --++...+-|-.|-
T Consensus 2 ~r~kr~~~Lk~Vds~F~Dlk~sr---L~e~t~T~~EV~~~ldgL~~~v~~~vesEL~N~~htn~lllrql~~qAek~~lk 78 (278)
T PF15294_consen 2 ARSKREQHLKEVDSCFQDLKSSR---LREDTYTSDEVTEMLDGLQVVVKSEVESELINTSHTNVLLLRQLFSQAEKWYLK 78 (278)
T ss_pred chhHHHHHHHHHHHhhhHHHHHH---HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhHHHhHHHHHHHHHHHHHHHHHH
Confidence 36678888888877788888877 44522223365555533 22111 23677888889994
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCCCchhhhhHHhhHHHHHHHhhHHHHhHHHH
Q 047021 265 NQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETHKNEFLALKGQLAKEQSRAEALSAQV 344 (388)
Q Consensus 265 nQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv 344 (388)
=|+-=.-..|.+|-.-.++...-+++........+.+ +.=+|-+++ .|++....||.+|+.+..+-.+|.-.+..+.
T Consensus 79 l~~diselEn~eLLe~i~~~E~~~~~~~~~~~~~~~~-~KL~pl~e~--g~~~ll~kEi~rLq~EN~kLk~rl~~le~~a 155 (278)
T PF15294_consen 79 LQTDISELENRELLEQIAEFEKQEFTSSFKPNQETSK-PKLEPLNES--GGSELLNKEIDRLQEENEKLKERLKSLEKQA 155 (278)
T ss_pred hcccHHHHHHHHHHHHHHHHHHhhhcccCCccccccc-ccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333445555554444444444433311111111 111122211 1256677789999999888888888777776
Q ss_pred HHHHHHHHHHHHHhhhhHH------------HHHHHHhhHHHHHHHhhhc
Q 047021 345 LQLSAQLQQTTQAYNGLVR------------LYKPVLRNIESSLIKMKQD 382 (388)
Q Consensus 345 ~qLsAeL~ra~QA~n~L~R------------ly~PvL~~IEsrLikmkq~ 382 (388)
....-+=.......+.|.. .--+-+.++|+-+..||-+
T Consensus 156 t~~l~Ek~kl~~~L~~lq~~~~~~~~k~~~~~~~q~l~dLE~k~a~lK~e 205 (278)
T PF15294_consen 156 TSALDEKSKLEAQLKELQDEQGDQKGKKDLSFKAQDLSDLENKMAALKSE 205 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccccccccccchhhHHHHHHHHHHH
Confidence 6666555555555555554 2234577788888888744
No 129
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=43.46 E-value=77 Score=25.26 Aligned_cols=40 Identities=25% Similarity=0.361 Sum_probs=27.7
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN 47 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n 47 (388)
.++..+..+-..|..+.++|++.++.|..|...+..++..
T Consensus 15 ~~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e 54 (61)
T PF08826_consen 15 AIQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKE 54 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666666777777777777777777777777766543
No 130
>KOG0018 consensus Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) [Cell cycle control, cell division, chromosome partitioning]
Probab=42.96 E-value=6.7e+02 Score=30.13 Aligned_cols=226 Identities=24% Similarity=0.304 Sum_probs=135.1
Q ss_pred hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh--h-hHHHHhhhhhhhhhhHHHhhhhHHHHHH----------H
Q 047021 13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV--P-SLRKALKDVAMEKDAAVVAREDLSAQLR----------T 79 (388)
Q Consensus 13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v--~-slrkaL~~v~mEkDaA~vareDl~AQLR----------~ 79 (388)
.-++|..+.++...+.....++..+..++|.-+...+ | ...|.++++ ..-|.-+--++..++..+ -
T Consensus 222 LfhvE~~i~k~~~els~~~~ei~~~~~~~d~~e~ei~~~k~e~~ki~re~-~~~Dk~i~~ke~~l~erp~li~~ke~~~~ 300 (1141)
T KOG0018|consen 222 LFHVEACIEKANDELSRLNAEIPKLKERMDKKEREIRVRKKERGKIRREL-QKVDKKISEKEEKLAERPELIKVKENASH 300 (1141)
T ss_pred HhhhhhhHhhhhHHHHHHhhhhHHHHhhhhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhHHhhcchhhcc
Confidence 4467888888888888888888899999987766655 3 234555555 355666666777777621 2
Q ss_pred HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHH-------
Q 047021 80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQ------- 152 (388)
Q Consensus 80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrq------- 152 (388)
.++|+.+++-.=|-++.|+-++..++..+.-+.-+-+++ -..+++||. +.++.+=
T Consensus 301 ~k~rl~~~~k~i~~~kk~~~~~~~~ie~~ek~l~av~~~-----------~~~fekei~-------~~~q~rg~~lnl~d 362 (1141)
T KOG0018|consen 301 LKKRLEEIEKDIETAKKDYRALKETIERLEKELKAVEGA-----------KEEFEKEIE-------ERSQERGSELNLKD 362 (1141)
T ss_pred chhHHHHhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHH-----------HHHHHHHHH-------HHHhhccccCCcch
Confidence 468888888888888888888888888887776332221 123445544 4443221
Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh---hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHH
Q 047021 153 ---QERKQLTEEQARISVLMSEKQELEEKLVMFSRKASE---EVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQ 226 (388)
Q Consensus 153 ---qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se---~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRq 226 (388)
++-.||-.|....+ .++|.++-..--. +...- ..-+..++..+..+-.+|+|+.--|.
T Consensus 363 ~~~~ey~rlk~ea~~~~---------~~el~~ln~~~r~~~~~ld~~-------~~~~~elE~r~k~l~~sver~~~~~~ 426 (1141)
T KOG0018|consen 363 DQVEEYERLKEEACKEA---------LEELEVLNRNMRSDQDTLDHE-------LERRAELEARIKQLKESVERLDKRRN 426 (1141)
T ss_pred HHHHHHHHHHHHHhhhh---------HHHHHHHHHHHHHHHHHHhhH-------HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 24445554443322 3334433322100 11110 12356677777777888888888888
Q ss_pred HHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 227 KLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ 273 (388)
Q Consensus 227 KLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ 273 (388)
+|+.=|++-+..-+-+-.+=..|...|-.+.+-.+---.-+.+|+-|
T Consensus 427 ~L~~~i~s~~~~~~e~~~d~~~l~~~~~~~~~~~~e~n~eL~~~~~q 473 (1141)
T KOG0018|consen 427 KLAAKITSLSRSYEELKHDLDSLESLVSSAEEEPYELNEELVEVLDQ 473 (1141)
T ss_pred HHHHHHHHHHHHHHHHhhcHHHHHHHHhhhhhhHHHHHHHHHHHHHH
Confidence 88888877666666555555555555555544444333333333333
No 131
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=42.53 E-value=2.5e+02 Score=25.05 Aligned_cols=34 Identities=29% Similarity=0.402 Sum_probs=15.5
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQT 39 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~ 39 (388)
+.++++..+++..+...++.++.+...+...+.+
T Consensus 83 ~~~~~~~l~~l~~el~~l~~~~~~~~~~l~~~~~ 116 (191)
T PF04156_consen 83 LSELQQQLQQLQEELDQLQERIQELESELEKLKE 116 (191)
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444444444333
No 132
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=41.48 E-value=5.8e+02 Score=29.02 Aligned_cols=140 Identities=26% Similarity=0.315 Sum_probs=92.5
Q ss_pred HHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC----
Q 047021 218 VERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMS---- 293 (388)
Q Consensus 218 VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~---- 293 (388)
|-=|+.|- -|++.=--||-||-+|+.-|-+.+.|++.+-.--|.|+-|||-=...=|.-=..||.|-...+.
T Consensus 113 vs~Lk~sQ----vefE~~Khei~rl~Ee~~~l~~qlee~~rLk~iae~qleEALesl~~EReqk~~LrkEL~~~~~~~~~ 188 (717)
T PF09730_consen 113 VSVLKQSQ----VEFEGLKHEIKRLEEEIELLNSQLEEAARLKEIAEKQLEEALESLKSEREQKNALRKELDQHLNIESI 188 (717)
T ss_pred HHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccc
Confidence 65666653 5888888999999999999999999999999999999999985433333322245555443221
Q ss_pred --CCcccc----------ccccCCCCCC------------------c--------cccCC-----------CchhhhhHH
Q 047021 294 --VNDKDI----------LRDFSEPSKD------------------A--------VSQNG-----------SETHKNEFL 324 (388)
Q Consensus 294 --~~~~~~----------~g~~~e~~~d------------------g--------~~~~~-----------~~~~~te~l 324 (388)
.++..+ .+..++|..| | .+... |.-+.+||.
T Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~lv~DLfSEl~~~Eiq 268 (717)
T PF09730_consen 189 SYLSNLAISLDGLKFSEDPRAATEPNNDDEEENGGLNGGPGLAKGNGDNRMSTPRKSESFSPAPSLVSDLFSELNLSEIQ 268 (717)
T ss_pred ccccchhhcccccccccccccccCCCCchhhhcchhhccchhcccccccccCCCCCCCCCCCCCcccchhhhhcchHHHH
Confidence 111111 0111122222 1 00000 334567999
Q ss_pred hhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhh
Q 047021 325 ALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGL 361 (388)
Q Consensus 325 ~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L 361 (388)
+||.+|..--.--..|..-+.-.-.+|..+..+....
T Consensus 269 KL~qQL~qve~EK~~L~~~L~e~Q~qLe~a~~als~q 305 (717)
T PF09730_consen 269 KLKQQLLQVEREKSSLLSNLQESQKQLEHAQGALSEQ 305 (717)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999988778889888888888888887776543
No 133
>PF05600 DUF773: Protein of unknown function (DUF773); InterPro: IPR008491 This family contains several eukaryotic sequences which are thought to be CDK5 activator-binding proteins, however, the function of this family is unknown.
Probab=41.16 E-value=4.6e+02 Score=28.20 Aligned_cols=114 Identities=20% Similarity=0.271 Sum_probs=74.5
Q ss_pred hhHHHHHHhhhhh-hhhhhHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccc
Q 047021 235 QSSEIERLFEENS-SLSSSYQESMGI-----------AKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRD 302 (388)
Q Consensus 235 QS~EIErLf~ENs-~Lsss~qea~~i-----------aaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~ 302 (388)
-|.+|-|+|+.++ -|...++-.+-. -+..+.|+.||-|...++..-....+.+-......- |+.|
T Consensus 96 ~W~~Iv~~yeK~n~~L~E~a~~L~r~v~YeIP~lkKqi~k~~q~~~d~~kk~~e~~~~~~~~~~~~~~~c~~l--GI~G- 172 (507)
T PF05600_consen 96 DWQEIVKLYEKDNLYLAEAAQILVRNVNYEIPALKKQIAKCQQQLEDLDKKEEELQRSAAEARERYKKACKQL--GIKG- 172 (507)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHh--CCcc-
Confidence 3889999998654 455555544322 357889999999999998887777777666655433 2222
Q ss_pred cCCCCCCccccCCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHH--------HHHHHhhH
Q 047021 303 FSEPSKDAVSQNGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRL--------YKPVLRNI 372 (388)
Q Consensus 303 ~~e~~~dg~~~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rl--------y~PvL~~I 372 (388)
. . ++.||..-..-.-.+=.+|.+.++.|..|.+-|..+..+ ..|+|+-|
T Consensus 173 -------------~-n-------ir~ELl~l~~~LP~~~~~i~~~i~~l~~aie~Y~~f~~~~~~~~~~~~Lp~L~~v 229 (507)
T PF05600_consen 173 -------------E-N-------IREELLELVKELPSLFDEIVEAISDLQEAIEYYQAFVEFVHDESESEVLPLLRFV 229 (507)
T ss_pred -------------c-h-------hHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCccchhhHHHHHH
Confidence 1 1 344444433334444455667778899999999998886 56776443
No 134
>PF05103 DivIVA: DivIVA protein; InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=40.88 E-value=13 Score=30.97 Aligned_cols=19 Identities=42% Similarity=0.625 Sum_probs=0.0
Q ss_pred hHhhHHHHHHHHHHHHHHH
Q 047021 94 AEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 94 AEEDaAaLRAEln~lQqq~ 112 (388)
|...+..++.++..|.++.
T Consensus 105 A~~~~~~l~~~~~~lk~~~ 123 (131)
T PF05103_consen 105 ARAEAERLREEIEELKRQA 123 (131)
T ss_dssp -------------------
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4455555555555555443
No 135
>PRK10698 phage shock protein PspA; Provisional
Probab=40.36 E-value=3.3e+02 Score=25.90 Aligned_cols=68 Identities=15% Similarity=0.262 Sum_probs=30.0
Q ss_pred hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH
Q 047021 70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL 144 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l 144 (388)
-+.|-.+++.++.++.++...+ +.-..|..-..-+.++- ...+|+ ..|.+..-+..||..+..+.++-
T Consensus 115 ~~~L~~~l~~L~~ki~eak~k~-----~~L~aR~~~A~a~~~~~-~~~~~~-~~~~a~~~f~rmE~ki~~~Ea~a 182 (222)
T PRK10698 115 LARMKKEIGELENKLSETRARQ-----QALMLRHQAASSSRDVR-RQLDSG-KLDEAMARFESFERRIDQMEAEA 182 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHH-HHHhCC-CcchHHHHHHHHHHHHHHHHHHH
Confidence 3455555666666666554332 22222322222222221 111222 22334455666777666554443
No 136
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=40.03 E-value=3.4e+02 Score=30.58 Aligned_cols=87 Identities=26% Similarity=0.431 Sum_probs=61.9
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR 83 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR 83 (388)
..++.+.++...|+++|+.|...+.++..+++-|..+|....+..--..++ |.-+.+++.= ...+.++
T Consensus 422 ~~i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~~~~~~---------~rei~~~~~~---I~~L~~~ 489 (652)
T COG2433 422 KRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK---------DREIRARDRR---IERLEKE 489 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---------hHHHHHHHHH---HHHHHHH
Confidence 467888999999999999999999999999999999999888776644443 3344444432 3455666
Q ss_pred hHHHHHHHhhhHhhHHHHHHHHHHHH
Q 047021 84 LKEAEEEQYRAEEDAAALRAELNSLQ 109 (388)
Q Consensus 84 lkEAEEEQyRAEEDaAaLRAEln~lQ 109 (388)
|.|.... .--|...|+.+-
T Consensus 490 L~e~~~~-------ve~L~~~l~~l~ 508 (652)
T COG2433 490 LEEKKKR-------VEELERKLAELR 508 (652)
T ss_pred HHHHHHH-------HHHHHHHHHHHH
Confidence 6655433 234666666665
No 137
>KOG4403 consensus Cell surface glycoprotein STIM, contains SAM domain [General function prediction only]
Probab=39.95 E-value=5.5e+02 Score=28.25 Aligned_cols=137 Identities=26% Similarity=0.338 Sum_probs=89.2
Q ss_pred CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhH
Q 047021 125 ISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDK 204 (388)
Q Consensus 125 ~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDk 204 (388)
.+-+|+..|-+.|.+| |...+=-+--|++|..++..-.....||+-||.++... +..++. +
T Consensus 239 ~akehv~km~kdle~L----q~aEqsl~dlQk~Lekar~e~rnvavek~~lerkl~ea-----~rl~el----------r 299 (575)
T KOG4403|consen 239 KAKEHVNKMMKDLEGL----QRAEQSLEDLQKRLEKAREEQRNVAVEKLDLERKLDEA-----PRLSEL----------R 299 (575)
T ss_pred HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHhhhchhhhhhhHHHHHhhh-----hhhhhh----------h
Confidence 4556677777776644 44444445678888888887788889999999988711 111111 1
Q ss_pred HHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhh------hhhhHHHHHHHHHHH-HHHHHHHHHHHHHH
Q 047021 205 EKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSS------LSSSYQESMGIAKHW-ENQVKDCLKQNEEL 277 (388)
Q Consensus 205 ekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~------Lsss~qea~~iaaqW-EnQvqDclkQNeeL 277 (388)
+-.+.+ -+| |=|.-+-.+-.+.|+=|+-||+ |-.=+|=+-++-.+. +.+=|+|-||.-+-
T Consensus 300 eg~e~e------------~~r-kelE~lR~~L~kAEkele~nS~wsaP~aLQ~wLq~T~E~E~q~~~kkrqnaekql~~A 366 (575)
T KOG4403|consen 300 EGVENE------------TSR-KELEQLRVALEKAEKELEANSSWSAPLALQKWLQLTHEVEVQYYNKKRQNAEKQLKEA 366 (575)
T ss_pred cchhHH------------HHH-HHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH
Confidence 111222 111 1111122344678888999964 444566677776664 56889999999999
Q ss_pred HHHHHHHHhhhhccCC
Q 047021 278 RGILEKLRTEQASFMS 293 (388)
Q Consensus 278 r~lLeklR~EQa~~~~ 293 (388)
+.+-||++.-+.+...
T Consensus 367 ke~~eklkKKrssv~g 382 (575)
T KOG4403|consen 367 KEMAEKLKKKRSSVFG 382 (575)
T ss_pred HHHHHHHHHhhcchhe
Confidence 9999999998877764
No 138
>PF09403 FadA: Adhesion protein FadA; InterPro: IPR018543 FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. ; PDB: 3ETZ_B 3ETY_A 2GL2_B 3ETX_C 3ETW_A.
Probab=39.94 E-value=2.8e+02 Score=24.87 Aligned_cols=60 Identities=27% Similarity=0.374 Sum_probs=37.6
Q ss_pred CCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 047021 123 MGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRK 185 (388)
Q Consensus 123 ~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~ 185 (388)
++.+.+....++.++..|.+++++ |.+.|.+|-.++.....++.++..++.+.-+.+...
T Consensus 15 ~sfaA~~~~~v~~~l~~LEae~q~---L~~kE~~r~~~~k~~ae~a~~~L~~~~~~~~~i~e~ 74 (126)
T PF09403_consen 15 ISFAATATASVESELNQLEAEYQQ---LEQKEEARYNEEKQEAEAAEAELAELKELYAEIEEK 74 (126)
T ss_dssp --------HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHcccchHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 333444467788888888887764 777788888877777777777776666665555433
No 139
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=39.67 E-value=6e+02 Score=28.62 Aligned_cols=43 Identities=21% Similarity=0.382 Sum_probs=30.3
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQ 46 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~ 46 (388)
..+.+++++...+.+.-.+|..+++.....-+.|.+|++.+-+
T Consensus 579 ~~L~~l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~~vl~ 621 (717)
T PF10168_consen 579 KELQELQEERKSLRESAEKLAERYEEAKDKQEKLMKRVDRVLQ 621 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666677777777777777777777777888888775544
No 140
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=39.05 E-value=4.1e+02 Score=26.59 Aligned_cols=88 Identities=25% Similarity=0.316 Sum_probs=44.3
Q ss_pred hHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 203 DKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILE 282 (388)
Q Consensus 203 Dkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLe 282 (388)
+-.+|++.-..+..-++.||.-+..|..||.....|.+.|=.+-...=..|..-.-.-.+-.+.......|..-....|+
T Consensus 51 el~~le~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~~eE~~~~~~~n~~~~~l~~~~~e~~sl~~q~~~~~~~L~ 130 (314)
T PF04111_consen 51 ELEKLEQEEEELLQELEELEKEREELDQELEELEEELEELDEEEEEYWREYNELQLELIEFQEERDSLKNQYEYASNQLD 130 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444445455555566666666666666655555555544333333333333333333334444445555566666677
Q ss_pred HHHhhhhccC
Q 047021 283 KLRTEQASFM 292 (388)
Q Consensus 283 klR~EQa~~~ 292 (388)
+||. .|..
T Consensus 131 ~L~k--tNv~ 138 (314)
T PF04111_consen 131 RLRK--TNVY 138 (314)
T ss_dssp CHHT----TT
T ss_pred HHHh--cCch
Confidence 7766 5555
No 141
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=38.93 E-value=1.4e+02 Score=24.17 Aligned_cols=36 Identities=31% Similarity=0.353 Sum_probs=19.7
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTR 40 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~ 40 (388)
.+.-|+.++.+|.++|..|.+...+...+|.-|..+
T Consensus 19 ti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e 54 (72)
T PF06005_consen 19 TIALLQMENEELKEKNNELKEENEELKEENEQLKQE 54 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 344455556666666555555555555555555533
No 142
>PRK10884 SH3 domain-containing protein; Provisional
Probab=38.13 E-value=2.9e+02 Score=26.36 Aligned_cols=20 Identities=30% Similarity=0.474 Sum_probs=14.7
Q ss_pred hhhHHHHHHHHHHHHHHHHH
Q 047021 127 PDHVQTLEKELATLKSQLQQ 146 (388)
Q Consensus 127 ~~~~~~lekEl~~l~~~lqq 146 (388)
-+.+..|++|+++++.+|.+
T Consensus 92 ~~rlp~le~el~~l~~~l~~ 111 (206)
T PRK10884 92 RTRVPDLENQVKTLTDKLNN 111 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34577788888888877665
No 143
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=37.68 E-value=3.6e+02 Score=25.50 Aligned_cols=197 Identities=21% Similarity=0.297 Sum_probs=104.5
Q ss_pred hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQL 150 (388)
Q Consensus 71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~l 150 (388)
|++-..+..+...|++++..-..||...++|.--|..|-....... +.+......+.++...+......
T Consensus 11 d~~~~~~~~~~~~l~~~~~~~~~aE~e~~~l~rri~~lE~~le~~e-----------erL~~~~~kL~~~e~~~de~er~ 79 (237)
T PF00261_consen 11 DEAEERLEEAEEKLKEAEKRAEKAEAEVASLQRRIQLLEEELERAE-----------ERLEEATEKLEEAEKRADESERA 79 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH-----------CCCCHHHHHHHHHHHHHHHHCHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH-----------HHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556677778888888888888888888888888877765553221 11222233333333333333333
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhh-hhhhc---------chhhHHHHHHHHhHHHHHHHH
Q 047021 151 RQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAA-ASKAF---------SVEDKEKLEKQLHDMAVVVER 220 (388)
Q Consensus 151 rqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a-~~k~~---------S~eDkekle~Ql~dma~~VeR 220 (388)
+..-..|...--.++..|.....+....+..+-.+..+...... ....+ .-.--..|+..|+.+.--+-.
T Consensus 80 ~k~lE~r~~~~eeri~~lE~~l~ea~~~~ee~e~k~~E~~rkl~~~E~~Le~aEeR~e~~E~ki~eLE~el~~~~~~lk~ 159 (237)
T PF00261_consen 80 RKVLENREQSDEERIEELEQQLKEAKRRAEEAERKYEEVERKLKVLEQELERAEERAEAAESKIKELEEELKSVGNNLKS 159 (237)
T ss_dssp HHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHH
Confidence 33222333333333334433333333333332222222111110 00000 011123455555555555555
Q ss_pred HHhhHHHHHHHhhhhhHHHHHHhhh-hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 221 LESSRQKLLMEIDNQSSEIERLFEE-NSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRT 286 (388)
Q Consensus 221 LE~SRqKLL~EID~QS~EIErLf~E-Ns~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~ 286 (388)
|+.+-. +.++-+.-|++ -..|..-+.++-.=+-.=|+.|..+=++++.|.+-|..++.
T Consensus 160 lE~~~~--------~~~~re~~~e~~i~~L~~~lkeaE~Rae~aE~~v~~Le~~id~le~eL~~~k~ 218 (237)
T PF00261_consen 160 LEASEE--------KASEREDEYEEKIRDLEEKLKEAENRAEFAERRVKKLEKEIDRLEDELEKEKE 218 (237)
T ss_dssp HHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhhhh--------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555444 44444333433 46677888888888888889999999999999998887664
No 144
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=37.54 E-value=3.5e+02 Score=25.33 Aligned_cols=117 Identities=22% Similarity=0.263 Sum_probs=63.5
Q ss_pred HHHHHHHHHhcCCCCCCCCC--CCChhhHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 104 ELNSLQQQAMDGSLGGISSM--GISPDHVQTLEKELATLKSQLQQESQLRQ---QERKQLTEEQARISVLMSEKQELEEK 178 (388)
Q Consensus 104 Eln~lQqq~~~~~~s~~~~~--g~s~~~~~~lekEl~~l~~~lqqes~lrq---qeqqrla~Eq~~~a~l~aEkqeLeek 178 (388)
-..-|.++..+++.++.++. +...+|+..|+.=+.-|..+-+++.-|.+ ..++.|.....-...|..+.+-|...
T Consensus 31 rc~ele~~l~~~~~l~~~~~~~~~~~e~s~dLe~~l~rLeEEqqR~~~L~qvN~lLReQLEq~~~~N~~L~~dl~klt~~ 110 (182)
T PF15035_consen 31 RCAELEQQLSASQVLESPSQRRRSEEEHSPDLEEALIRLEEEQQRSEELAQVNALLREQLEQARKANEALQEDLQKLTQD 110 (182)
T ss_pred HHHHHHHHHhcccCcCcccccccccccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455666655544433332 34566777888877777777777666544 24555555555555566666665555
Q ss_pred HHHHhh----hchh-hhhhhhhhhhcchh---------hHHHHHHHHhHHHHHHHH
Q 047021 179 LVMFSR----KASE-EVSNAAASKAFSVE---------DKEKLEKQLHDMAVVVER 220 (388)
Q Consensus 179 l~~~~~----~~se-~~~~~a~~k~~S~e---------Dkekle~Ql~dma~~VeR 220 (388)
...+.. +..+ ....-++..-||.+ .--.+.+++.+|....||
T Consensus 111 ~~~l~~eL~~ke~~~~~ee~~~~~y~~~eh~rll~LWr~v~~lRr~f~elr~~Ter 166 (182)
T PF15035_consen 111 WERLRDELEQKEAEWREEEENFNQYLSSEHSRLLSLWREVVALRRQFAELRTATER 166 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 555321 2112 12222233444443 334556677777766665
No 145
>KOG4797 consensus Transcriptional regulator [Transcription]
Probab=37.24 E-value=33 Score=30.79 Aligned_cols=24 Identities=33% Similarity=0.682 Sum_probs=20.1
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhh
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSS 25 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~s 25 (388)
|-.++++|.++|..||+||..|..
T Consensus 72 Lk~qI~eL~er~~~Le~EN~lLk~ 95 (123)
T KOG4797|consen 72 LKEQIRELEERNSALERENSLLKT 95 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 346789999999999999998864
No 146
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=37.10 E-value=74 Score=32.37 Aligned_cols=45 Identities=18% Similarity=0.398 Sum_probs=26.2
Q ss_pred hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021 3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN 47 (388)
Q Consensus 3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n 47 (388)
.++|.+++.+..++|.....+...++.++..+..|...|+|++-+
T Consensus 143 ~~Ri~e~Eeris~lEd~~~~i~~~~~~~~k~i~~l~~kl~DlEnr 187 (370)
T PF02994_consen 143 NSRIDELEERISELEDRIEEIEQAIKELEKRIKKLEDKLDDLENR 187 (370)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhh
Confidence 355666666666666666666655555555555666666665544
No 147
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=36.61 E-value=4.6e+02 Score=26.45 Aligned_cols=102 Identities=23% Similarity=0.332 Sum_probs=72.7
Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 98 AAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEE 177 (388)
Q Consensus 98 aAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLee 177 (388)
-+.|..|+..|++....-. ...++....+..+|+....++..-.....+-+.++..=...+.....+|+++..
T Consensus 181 ~~~L~~e~~~L~~~~~e~~-------~~d~~eL~~lk~~l~~~~~ei~~~~~~l~e~~~~l~~l~~~I~~~~~~k~e~~~ 253 (312)
T smart00787 181 KDALEEELRQLKQLEDELE-------DCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253 (312)
T ss_pred HHHHHHHHHHHHHhHHHHH-------hCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4578888888887763211 125788888888998888888877777777777777777788888888888888
Q ss_pred HHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhH
Q 047021 178 KLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHD 213 (388)
Q Consensus 178 kl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~d 213 (388)
.|+.+.+. .--.+.||.-|-.+|..+++.
T Consensus 254 ~I~~ae~~-------~~~~r~~t~~Ei~~Lk~~~~~ 282 (312)
T smart00787 254 EIAEAEKK-------LEQCRGFTFKEIEKLKEQLKL 282 (312)
T ss_pred HHHHHHHH-------HHhcCCCCHHHHHHHHHHHHH
Confidence 88776542 123455666676677666443
No 148
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown]
Probab=35.90 E-value=8.5e+02 Score=29.33 Aligned_cols=85 Identities=28% Similarity=0.381 Sum_probs=50.2
Q ss_pred hHHHHHHHHhHHHHHHHHHHhhHHHHHHH----------hhhhhHHHHHHh----hhhhhhhhhHH---HHHHHHHHHH-
Q 047021 203 DKEKLEKQLHDMAVVVERLESSRQKLLME----------IDNQSSEIERLF----EENSSLSSSYQ---ESMGIAKHWE- 264 (388)
Q Consensus 203 Dkekle~Ql~dma~~VeRLE~SRqKLL~E----------ID~QS~EIErLf----~ENs~Lsss~q---ea~~iaaqWE- 264 (388)
+...+-+-|++=+.+.+||.++-.+|..- .+.+-.||+++. .||..|=.-++ .+..=.+..|
T Consensus 489 ~iknlnk~L~~r~~elsrl~a~~~elkeQ~kt~~~qye~~~~k~eeLe~~l~~lE~ENa~LlkqI~~Lk~t~qn~~~LEq 568 (1195)
T KOG4643|consen 489 QIKNLNKSLNNRDLELSRLHALKNELKEQYKTCDIQYELLSNKLEELEELLGNLEEENAHLLKQIQSLKTTSQNGALLEQ 568 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 35566777888899999999998888643 344445666654 47776643332 2211222222
Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHhh
Q 047021 265 ---------NQVKDCLKQNEELRGILEKLRTE 287 (388)
Q Consensus 265 ---------nQvqDclkQNeeLr~lLeklR~E 287 (388)
+-.+--+.-.+.||.+.++|+.|
T Consensus 569 ~~n~lE~~~~elkk~idaL~alrrhke~LE~e 600 (1195)
T KOG4643|consen 569 NNNDLELIHNELKKYIDALNALRRHKEKLEEE 600 (1195)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 12222333334778888888877
No 149
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=35.64 E-value=96 Score=29.14 Aligned_cols=39 Identities=28% Similarity=0.392 Sum_probs=24.2
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINEL 44 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~ 44 (388)
.+.+++.|..|+.++..|+.++...+.+++.|++++.-.
T Consensus 99 ~~~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~ 137 (161)
T TIGR02894 99 DQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTI 137 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555666666666666666666666666666666533
No 150
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=35.57 E-value=4.4e+02 Score=25.96 Aligned_cols=83 Identities=17% Similarity=0.251 Sum_probs=51.9
Q ss_pred hhhhHHHhhhhHHHHHHHHHHHhHHHHHH--Hhhh-------Hhh-------HHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 047021 62 EKDAAVVAREDLSAQLRTLKKRLKEAEEE--QYRA-------EED-------AAALRAELNSLQQQAMDGSLGGISSMGI 125 (388)
Q Consensus 62 EkDaA~vareDl~AQLR~~kkRlkEAEEE--QyRA-------EED-------aAaLRAEln~lQqq~~~~~~s~~~~~g~ 125 (388)
....+.-+++-+-.||-.+++++.+||.. .||. +.+ .+.|++++..++.+...-. + .+|+
T Consensus 164 ~~~~~~~a~~fl~~ql~~~~~~l~~ae~~l~~fr~~~~~~d~~~~~~~~~~~i~~L~~~l~~~~~~l~~l~--~--~~~~ 239 (362)
T TIGR01010 164 NERARKDTIAFAENEVKEAEQRLNATKAELLKYQIKNKVFDPKAQSSAQLSLISTLEGELIRVQAQLAQLR--S--ITPE 239 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--h--hCCC
Confidence 34445556777888888999999988853 3443 222 3456666666666653221 1 2344
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHH
Q 047021 126 SPDHVQTLEKELATLKSQLQQES 148 (388)
Q Consensus 126 s~~~~~~lekEl~~l~~~lqqes 148 (388)
.+-.+..+..+++.++.++..+.
T Consensus 240 ~~P~v~~l~~~i~~l~~~i~~e~ 262 (362)
T TIGR01010 240 QNPQVPSLQARIKSLRKQIDEQR 262 (362)
T ss_pred CCCchHHHHHHHHHHHHHHHHHH
Confidence 45567788888888888877644
No 151
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=35.17 E-value=3.6e+02 Score=25.65 Aligned_cols=70 Identities=19% Similarity=0.240 Sum_probs=43.0
Q ss_pred HHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 210 QLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK 283 (388)
Q Consensus 210 Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek 283 (388)
-+.++..-+.+.-..--..+.++-.....+..+-.||..|..-..+ -..|-.++..|..+|++||.+|+-
T Consensus 42 ~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~l~~en~~L~~e~~~----l~~~~~~~~~l~~en~~L~~lL~~ 111 (276)
T PRK13922 42 VVGDVVSPVQRVVNAPREFVSGVFESLASLFDLREENEELKKELLE----LESRLQELEQLEAENARLRELLNL 111 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3344444444444444444555555556666666666666444333 345667778999999999998864
No 152
>PRK04406 hypothetical protein; Provisional
Probab=35.11 E-value=1.3e+02 Score=24.53 Aligned_cols=52 Identities=21% Similarity=0.357 Sum_probs=0.0
Q ss_pred hHHHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021 34 SESLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE 88 (388)
Q Consensus 34 ~~~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE 88 (388)
.+.+..||.+||-... --.=..|++|-.+-+.-+ +.|..+|+.+..|+++.+
T Consensus 6 ~~~le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~~I---~~L~~ql~~L~~rl~~~~ 59 (75)
T PRK04406 6 IEQLEERINDLECQLAFQEQTIEELNDALSQQQLLI---TKMQDQMKYVVGKVKNMD 59 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhc
No 153
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=34.26 E-value=1.4e+02 Score=23.48 Aligned_cols=45 Identities=16% Similarity=0.204 Sum_probs=31.8
Q ss_pred hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021 3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN 47 (388)
Q Consensus 3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n 47 (388)
-.||-+|+-+.+..|.-...|+.-+-.-...++.|..++..+...
T Consensus 3 e~Ri~~LE~~la~qe~~ie~Ln~~v~~Qq~~I~~L~~~l~~L~~r 47 (69)
T PF04102_consen 3 EERIEELEIKLAFQEDTIEELNDVVTEQQRQIDRLQRQLRLLRER 47 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457788888888888888777777766666666666666655443
No 154
>PF11336 DUF3138: Protein of unknown function (DUF3138); InterPro: IPR021485 This family of proteins with unknown function appear to be restricted to Proteobacteria.
Probab=33.99 E-value=39 Score=36.37 Aligned_cols=67 Identities=21% Similarity=0.312 Sum_probs=33.7
Q ss_pred hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHH
Q 047021 71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELAT 139 (388)
Q Consensus 71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~ 139 (388)
++|.+||+.+|+|.+|-+-. ..|-.-+|.--|-+-.. .-+....++.-++.+..+|++.+|..++++
T Consensus 28 ~~L~~ql~aLq~~v~eL~~~-laa~~~aa~~gA~~~~~-~~a~~~aP~~~a~~~~T~d~~~~~~qqiAn 94 (514)
T PF11336_consen 28 KALQAQLQALQDQVNELRAK-LAAKPAAAPGGAAIGPA-ATAAAAAPSSDAQAGLTNDDATEMRQQIAN 94 (514)
T ss_pred HHHHHHHHHHHHHHHHHHHH-HhcCCCCCCcccccccc-ccccccCCCcccccccChHHHHHHHHHHHh
Confidence 78999999999988874432 11111111100000000 000000111113566778999888888885
No 155
>PRK10246 exonuclease subunit SbcC; Provisional
Probab=33.51 E-value=8.1e+02 Score=28.37 Aligned_cols=20 Identities=15% Similarity=0.104 Sum_probs=14.1
Q ss_pred hccCCCCccccccccCCCCC
Q 047021 289 ASFMSVNDKDILRDFSEPSK 308 (388)
Q Consensus 289 a~~~~~~~~~~~g~~~e~~~ 308 (388)
+.+-|..||+.-|+..-|++
T Consensus 498 ~~L~~GePCPVCGS~~HP~~ 517 (1047)
T PRK10246 498 AQLQAGQPCPLCGSTSHPAV 517 (1047)
T ss_pred HhCCCCCCcCCCCcccCcCc
Confidence 46678889999887554433
No 156
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=33.36 E-value=7.4e+02 Score=27.89 Aligned_cols=32 Identities=25% Similarity=0.496 Sum_probs=19.4
Q ss_pred hhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHH
Q 047021 195 ASKAFSVEDKEKLEKQLHDMAVVVERLESSRQK 227 (388)
Q Consensus 195 ~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqK 227 (388)
+.+.| .++-+.+..+|++|+-.++.++.--++
T Consensus 633 AEr~~-~~EL~~~~~~l~~l~~si~~lk~k~~~ 664 (717)
T PF10168_consen 633 AEREF-KKELERMKDQLQDLKASIEQLKKKLDY 664 (717)
T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555 345667777777777777666544333
No 157
>PRK02119 hypothetical protein; Provisional
Probab=33.11 E-value=1.5e+02 Score=24.01 Aligned_cols=53 Identities=26% Similarity=0.474 Sum_probs=0.0
Q ss_pred hhHHHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021 33 ESESLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE 88 (388)
Q Consensus 33 ~~~~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE 88 (388)
+...+..|+.+||-.+. --.=..|++|-.+-+.-+ +.|..+|+.+..|+++.+
T Consensus 3 ~~~~~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~i---d~L~~ql~~L~~rl~~~~ 57 (73)
T PRK02119 3 IQQNLENRIAELEMKIAFQENLLEELNQALIEQQFVI---DKMQVQLRYMANKLKDMQ 57 (73)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhc
No 158
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=32.99 E-value=9.3e+02 Score=28.90 Aligned_cols=108 Identities=20% Similarity=0.248 Sum_probs=61.1
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhh-------hhhhhhHHHhhhhHHHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDV-------AMEKDAAVVAREDLSAQL 77 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v-------~mEkDaA~vareDl~AQL 77 (388)
.+..++......+++...+..++.+.+.+.+.+..++..+...-..-..+-|.++ .-.-+...-.-.....++
T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~a~~~~~eL~el~~ql~~~~~~a~~~~~~~~~a~~~~ 363 (1353)
T TIGR02680 284 DLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRL 363 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555556666666666666666666666677777666544444333444322 111111111122235566
Q ss_pred HHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 78 RTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 78 R~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
+..+.++.+++..-|.++...+..+.++..+...+
T Consensus 364 e~~~~~~~~~~~r~~~~~~~l~~~~~el~~~a~~~ 398 (1353)
T TIGR02680 364 EEERRRLDEEAGRLDDAERELRAAREQLARAAERA 398 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 77777777777777777777777777776555444
No 159
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=32.79 E-value=3.6e+02 Score=24.01 Aligned_cols=57 Identities=25% Similarity=0.375 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 101 LRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVL 168 (388)
Q Consensus 101 LRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l 168 (388)
|++-+++|++|..+ -...+.+|+.+++++...+--+-+-+-+-+-|+++++...+-+
T Consensus 42 L~~~l~~L~~q~~s-----------~~qr~~eLqaki~ea~~~le~eK~ak~~l~~r~~k~~~dka~l 98 (107)
T PF09304_consen 42 LRNALQSLQAQNAS-----------RNQRIAELQAKIDEARRNLEDEKQAKLELESRLLKAQKDKAIL 98 (107)
T ss_dssp HHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHH
Confidence 77777788777632 2345777888888887777666555545556666666555544
No 160
>PRK00736 hypothetical protein; Provisional
Probab=32.61 E-value=1.3e+02 Score=23.97 Aligned_cols=50 Identities=26% Similarity=0.332 Sum_probs=0.0
Q ss_pred HHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021 36 SLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE 88 (388)
Q Consensus 36 ~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE 88 (388)
++..||.+||-++. --.=..|++|-.+-+..+ +.|..||+.+..|+++.+
T Consensus 2 ~~e~Ri~~LE~klafqe~tie~Ln~~v~~Qq~~i---~~L~~ql~~L~~rl~~~~ 53 (68)
T PRK00736 2 DAEERLTELEIRVAEQEKTIEELSDQLAEQWKTV---EQMRKKLDALTERFLSLE 53 (68)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhc
No 161
>PRK11546 zraP zinc resistance protein; Provisional
Probab=32.47 E-value=1.9e+02 Score=26.59 Aligned_cols=25 Identities=16% Similarity=0.340 Sum_probs=19.6
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 126 SPDHVQTLEKELATLKSQLQQESQL 150 (388)
Q Consensus 126 s~~~~~~lekEl~~l~~~lqqes~l 150 (388)
.+..|.+|.+||.+|...|......
T Consensus 87 D~~kI~aL~kEI~~Lr~kL~e~r~~ 111 (143)
T PRK11546 87 DSSKINAVAKEMENLRQSLDELRVK 111 (143)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455999999999999888765543
No 162
>COG3206 GumC Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=32.36 E-value=5.7e+02 Score=26.22 Aligned_cols=104 Identities=25% Similarity=0.325 Sum_probs=78.1
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK 85 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk 85 (388)
+++|.++.+.+......+...+-.+|-..-.+...++++++.....+++.+...--+.+++.--...|..++-.+++|+.
T Consensus 287 i~~Lr~~~~~~~~~~~~l~~~~~~~~p~~~~~~~q~~~~~~~~~~e~~~~~~~~~~~~~~l~~~~~~L~~~~~~l~~~~~ 366 (458)
T COG3206 287 IQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLS 366 (458)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcccChHHHhHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 67788888888888888888999999999999999999999999888888888777777766666667666666666665
Q ss_pred HHHHHH---hhhHhhHHHHHHHHHHHH
Q 047021 86 EAEEEQ---YRAEEDAAALRAELNSLQ 109 (388)
Q Consensus 86 EAEEEQ---yRAEEDaAaLRAEln~lQ 109 (388)
..=.-| -+-+-|+.+.|.=+.++-
T Consensus 367 ~~~~~~~~l~~L~Re~~~~r~~ye~lL 393 (458)
T COG3206 367 KLPKLQVQLRELEREAEAARSLYETLL 393 (458)
T ss_pred hchHhhhHHHHHHHHHHHHHHHHHHHH
Confidence 443333 344556666666555543
No 163
>COG3880 Modulator of heat shock repressor CtsR, McsA [Signal transduction mechanisms]
Probab=32.33 E-value=1e+02 Score=29.45 Aligned_cols=40 Identities=25% Similarity=0.491 Sum_probs=32.4
Q ss_pred hhhHHHHHHHHHHHhHHHHH-HHhhhHhhHHHHHHHHHHHHHHH
Q 047021 70 REDLSAQLRTLKKRLKEAEE-EQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEE-EQyRAEEDaAaLRAEln~lQqq~ 112 (388)
..++.-+++.++.-|+..=+ |-| |.||.+|.+|++|++..
T Consensus 131 ~i~~~~~I~~L~e~Lq~~i~~Eef---EeAA~iRDqIr~Lk~k~ 171 (176)
T COG3880 131 KINPKRKIIALKEALQDLIEREEF---EEAAVIRDQIRALKAKN 171 (176)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHhhc
Confidence 46777889999998888743 444 78999999999999884
No 164
>PF00992 Troponin: Troponin; InterPro: IPR001978 The troponin (Tn) complex regulates Ca2+ induced muscle contraction. Tn contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). This family includes troponin T and troponin I. Troponin I binds to actin and troponin T binds to tropomyosin [, , ].; PDB: 1J1D_C 1MXL_I 1LXF_I 2KRD_I 1J1E_F 2L1R_B 2KGB_I 1YTZ_I 1YV0_I 1VDJ_A ....
Probab=32.20 E-value=1.5e+02 Score=26.26 Aligned_cols=75 Identities=25% Similarity=0.380 Sum_probs=53.4
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh-hhHHHHhhhhhhhhhhHHHh----hhhHHHHHHHH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV-PSLRKALKDVAMEKDAAVVA----REDLSAQLRTL 80 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v-~slrkaL~~v~mEkDaA~va----reDl~AQLR~~ 80 (388)
.++|..+...++++.--+..+..-...+++.|-.+.+|+....+ |+|++-=+.+--.+.+.--+ -+||-++|-.+
T Consensus 52 ~k~lh~ri~~leeEryd~E~kv~k~~~Ei~elk~kv~d~rgkfkkp~lkkvkk~~~~~~~~~l~~k~~~~~~lr~~lK~v 131 (132)
T PF00992_consen 52 CKELHERIDKLEEERYDLEEKVAKQDYEIEELKKKVNDLRGKFKKPTLKKVKKSADAMRRKLLGIKHKVSMDLRANLKEV 131 (132)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCSHCCHHHHHHHHHHHHHHHHHHCTTCSHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHhHHHHHcccccchHHHHHHHHHHHHHHhhhCcccccHHHHHHHhhcc
Confidence 45677778888888888888888888888888888888888444 98887554444444443333 56666666554
No 165
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=31.69 E-value=1.3e+02 Score=24.32 Aligned_cols=37 Identities=27% Similarity=0.339 Sum_probs=29.9
Q ss_pred hhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhh
Q 047021 12 KNAELEEKNSILSSKLQVKEVESESLQTRINELEQNT 48 (388)
Q Consensus 12 ~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~ 48 (388)
....+..++..|..|.+.+--.+++|-.||-.+|+.|
T Consensus 29 q~~~~~~ER~~L~ekne~Ar~rvEamI~RLk~leq~~ 65 (65)
T TIGR02449 29 QEKTWREERAQLLEKNEQARQKVEAMITRLKALEQHT 65 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCC
Confidence 3445677888888999999999999999998888764
No 166
>PF08429 PLU-1: PLU-1-like protein; InterPro: IPR013637 This domain is found in the central region of lysine-specific demethylases, which are nuclear proteins that may have a role in DNA-binding and transcription, and are associated with malignant cancer phenotypes []. The domain is also found in various other Jumonji/ARID domain-containing proteins (see IPR013129 from INTERPRO, IPR001606 from INTERPRO). ; GO: 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process
Probab=31.67 E-value=4.3e+02 Score=25.61 Aligned_cols=79 Identities=24% Similarity=0.368 Sum_probs=58.2
Q ss_pred HHHHHHhHHHHHHHHHHhhHHHHHH-------HhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH-----
Q 047021 206 KLEKQLHDMAVVVERLESSRQKLLM-------EIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ----- 273 (388)
Q Consensus 206 kle~Ql~dma~~VeRLE~SRqKLL~-------EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ----- 273 (388)
++-..|+.+-...++.+..=..+|. .+|.=+.+.++++.....+. .+.+++.-+..|-.+++.++.+
T Consensus 202 ~~~~~L~~~l~~~~~We~ka~~~L~~~~~~l~~Le~l~~~~~~ipv~~~~~~-~L~~~l~kak~w~~~i~~ll~~~~~~~ 280 (335)
T PF08429_consen 202 KLMAELQELLKQGEEWEEKAKELLSRPRVSLEQLEALLEEAENIPVSLPSLD-KLKDALQKAKEWLRQIEELLEQNGSKR 280 (335)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhcCCCchHHHH-HHHHHHHHHHHHHHHHHHHhcccCCCC
Confidence 3444566666666666665555554 67777888888888887754 8999999999999999999998
Q ss_pred --HHHHHHHHHHHH
Q 047021 274 --NEELRGILEKLR 285 (388)
Q Consensus 274 --NeeLr~lLeklR 285 (388)
..+|+.++...+
T Consensus 281 p~~~el~~l~~~~~ 294 (335)
T PF08429_consen 281 PTLDELEELVAESE 294 (335)
T ss_pred CcHHHHHHHHHHHh
Confidence 466777665543
No 167
>PF06632 XRCC4: DNA double-strand break repair and V(D)J recombination protein XRCC4; InterPro: IPR010585 This entry represents the DNA double-strand break repair and V(D)J recombination protein XRCC4, which is found in certain Metazoans, fungi and plants. XRCC4 binds to DNA, and to DNA ligase IV (LIG4) to form the LIG4-XRCC4 complex []. The LIG4-XRCC4 complex is responsible for the ligation step in the non-homologous end joining (NHEJ) pathway of DNA double-strand break repair. XRCC4 enhances the joining activity of LIG4. It is thought that XRCC4 and LIG4 are essential for alignment-based gap filling, as well as for final ligation of the breaks []. Binding of the LIG4-XRCC4 complex to DNA ends is dependent on the assembly of the DNA-dependent protein kinase complex DNA-PK to these DNA ends. ; GO: 0003677 DNA binding, 0006302 double-strand break repair, 0006310 DNA recombination, 0005634 nucleus; PDB: 3RWR_R 1IK9_B 3SR2_E 3Q4F_H 3II6_B 1FU1_A 3MUD_B.
Probab=31.59 E-value=3.6e+02 Score=27.73 Aligned_cols=49 Identities=22% Similarity=0.283 Sum_probs=35.2
Q ss_pred ChhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh
Q 047021 1 WLLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV 49 (388)
Q Consensus 1 ~l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v 49 (388)
|++..+..|+.+|.+|..+|..|.+.+.......+-+++.=+..|....
T Consensus 134 ~~l~~~~~l~~~~~~L~~enerL~~e~~~~~~qlE~~v~~K~~~E~~L~ 182 (342)
T PF06632_consen 134 WCLDANSRLQAENEHLQKENERLESEANKLLKQLEKFVNAKEEHEEDLY 182 (342)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6778888888888888888888877776666666666665555554443
No 168
>PF14988 DUF4515: Domain of unknown function (DUF4515)
Probab=31.23 E-value=4.7e+02 Score=24.88 Aligned_cols=114 Identities=25% Similarity=0.314 Sum_probs=69.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHH
Q 047021 131 QTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQ 210 (388)
Q Consensus 131 ~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Q 210 (388)
...+++++.++.+|+.=.-.+. .=.....|++.|++.+..+...++...... +.-| ..+|..|+++
T Consensus 57 lq~~k~~~~l~~eLq~l~~~~~-----------~k~~qe~eI~~Le~e~~~~~~e~~~~l~~~--~~qf-l~EK~~LEke 122 (206)
T PF14988_consen 57 LQKEKEQAKLQQELQALKEFRR-----------LKEQQEREIQTLEEELEKMRAEHAEKLQEA--ESQF-LQEKARLEKE 122 (206)
T ss_pred HHHHHHHHHHHHHHHHhHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH-HHHHHHHHHH
Confidence 4455666666666655333311 113456788889998888888777755554 5556 6899999999
Q ss_pred HhHHHH--HHHHHHhh-HHH----HHHHhhhhhHHHHHHhhhhhhhhhhHHHHHH
Q 047021 211 LHDMAV--VVERLESS-RQK----LLMEIDNQSSEIERLFEENSSLSSSYQESMG 258 (388)
Q Consensus 211 l~dma~--~VeRLE~S-RqK----LL~EID~QS~EIErLf~ENs~Lsss~qea~~ 258 (388)
++++-. +.+|-... .+| =++-+.+=+.---.+|.||.-|...+..-+.
T Consensus 123 ~~e~~i~~l~e~a~~el~~k~~ale~~A~~~l~e~~~~i~~EN~~L~k~L~~l~~ 177 (206)
T PF14988_consen 123 ASELKILQLGERAHKELKKKAQALELAAKKSLDEFTRSIKRENQQLRKELLQLIQ 177 (206)
T ss_pred HHHhhHHHhHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 865433 33332221 111 0113444444556789999999888766554
No 169
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport]
Probab=31.17 E-value=9.4e+02 Score=28.38 Aligned_cols=72 Identities=21% Similarity=0.302 Sum_probs=44.0
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQL 77 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQL 77 (388)
++.+|++.+.+|.-+|..|..+++........|.+.++.++-... -.+....+.+=+.|+--.--+.+-|++
T Consensus 672 ~~e~lkQ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~qLg-~~~~~~~~~~q~~e~~~t~~eel~a~~ 743 (970)
T KOG0946|consen 672 QIENLKQMEKELQVENEELEEEVQDFISEHSQLKDQLDLLKNQLG-IISSKQRDLLQGAEASKTQNEELNAAL 743 (970)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccchhhHHhHHHhccCChHHHHHHH
Confidence 456677788888888888888888888888888888887765544 011122233333444444444444443
No 170
>PF11336 DUF3138: Protein of unknown function (DUF3138); InterPro: IPR021485 This family of proteins with unknown function appear to be restricted to Proteobacteria.
Probab=31.11 E-value=1.2e+02 Score=32.75 Aligned_cols=79 Identities=23% Similarity=0.371 Sum_probs=49.3
Q ss_pred CCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh--------
Q 047021 117 LGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASE-------- 188 (388)
Q Consensus 117 ~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se-------- 188 (388)
..+++..+-..+++.+|++++. .|..++.||+.+|++-...+|.
T Consensus 14 l~~~a~a~a~a~~i~~L~~ql~----------------------------aLq~~v~eL~~~laa~~~aa~~gA~~~~~~ 65 (514)
T PF11336_consen 14 LPGAAMAAATADQIKALQAQLQ----------------------------ALQDQVNELRAKLAAKPAAAPGGAAIGPAA 65 (514)
T ss_pred ccccccccCCHHHHHHHHHHHH----------------------------HHHHHHHHHHHHHhcCCCCCCccccccccc
Confidence 3455566667778888877766 3445566777777665443322
Q ss_pred ----hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHh
Q 047021 189 ----EVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLES 223 (388)
Q Consensus 189 ----~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~ 223 (388)
.+|...+.-.|+.+|-..+.+|+-..-+-|+-|+-
T Consensus 66 ~a~~~aP~~~a~~~~T~d~~~~~~qqiAn~~lKv~~l~d 104 (514)
T PF11336_consen 66 TAAAAAPSSDAQAGLTNDDATEMRQQIANAQLKVESLED 104 (514)
T ss_pred cccccCCCcccccccChHHHHHHHHHHHhhhhhHHHHhh
Confidence 11222245668888888888887776666766653
No 171
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=30.80 E-value=1.1e+02 Score=28.72 Aligned_cols=32 Identities=22% Similarity=0.367 Sum_probs=16.1
Q ss_pred HHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhh
Q 047021 15 ELEEKNSILSSKLQVKEVESESLQTRINELEQ 46 (388)
Q Consensus 15 ~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~ 46 (388)
++..+|..|..++......++.|.+.+..+..
T Consensus 101 ~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~ 132 (161)
T TIGR02894 101 ALQKENERLKNQNESLQKRNEELEKELEKLRQ 132 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445555555555555555555555544443
No 172
>PF08537 NBP1: Fungal Nap binding protein NBP1; InterPro: IPR013743 NBP1 is a nuclear protein which has been shown in Saccharomyces cerevisiae (Bakers yeast) to be essential for the G2/M transition of the cell cycle.
Probab=30.64 E-value=95 Score=32.00 Aligned_cols=39 Identities=15% Similarity=0.376 Sum_probs=23.4
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE 43 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd 43 (388)
.|.+|+.++.+++.++..+..+|..+...|.-|+.-|+|
T Consensus 183 k~~~l~~~l~~~~~eL~~~~k~L~faqekn~LlqslLdd 221 (323)
T PF08537_consen 183 KIDELEERLNDLEKELEITKKDLKFAQEKNALLQSLLDD 221 (323)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 344555555555556666666666666666666666664
No 173
>PF03112 DUF244: Uncharacterized protein family (ORF7) DUF; InterPro: IPR004335 Many of the proteins in this entry are Borrelia burgdorferi plasmid proteins of unknown function.
Probab=30.54 E-value=53 Score=30.74 Aligned_cols=32 Identities=41% Similarity=0.574 Sum_probs=28.1
Q ss_pred HHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Q 047021 228 LLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ 273 (388)
Q Consensus 228 LL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ 273 (388)
=|+|||+--.||++.-.||... |+..+|-||+
T Consensus 75 ~L~EI~~lq~ElnKiqnEn~k~--------------ekp~Kd~LK~ 106 (158)
T PF03112_consen 75 SLMEIDSLQTELNKIQNENKKR--------------EKPIKDLLKI 106 (158)
T ss_pred hHHHHHHHHHHHHHHHHHHHhh--------------hchHHHHHHH
Confidence 4899999999999999999876 7888888776
No 174
>PRK02793 phi X174 lysis protein; Provisional
Probab=30.17 E-value=1.4e+02 Score=24.00 Aligned_cols=55 Identities=24% Similarity=0.372 Sum_probs=0.0
Q ss_pred HhhHHHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021 32 VESESLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE 89 (388)
Q Consensus 32 ~~~~~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE 89 (388)
+....+..||.+||-.+. --.=..|++|-.+-+..+ +.|..+|+.+..|+++.+.
T Consensus 1 m~~~~~e~Ri~~LE~~lafQe~tIe~Ln~~v~~Qq~~I---~~L~~~l~~L~~rl~~~~~ 57 (72)
T PRK02793 1 MQDSSLEARLAELESRLAFQEITIEELNVTVTAHEMEM---AKLRDHLRLLTEKLKASQP 57 (72)
T ss_pred CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhcc
No 175
>smart00751 BSD domain in transcription factors and synapse-associated proteins.
Probab=30.01 E-value=43 Score=24.71 Aligned_cols=25 Identities=40% Similarity=0.601 Sum_probs=22.0
Q ss_pred hhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021 232 IDNQSSEIERLFEENSSLSSSYQES 256 (388)
Q Consensus 232 ID~QS~EIErLf~ENs~Lsss~qea 256 (388)
+|.+..+|+.|+.+|.+|..-+.+.
T Consensus 5 ~~~~~~~i~~il~~~p~l~~~~~~l 29 (51)
T smart00751 5 LDAKKEEIESLLKENPLLKKLYNEL 29 (51)
T ss_pred ccCCHHHHHHHHHHCHHHHHHHHHH
Confidence 5788899999999999999888874
No 176
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=29.91 E-value=3.2e+02 Score=30.74 Aligned_cols=53 Identities=19% Similarity=0.340 Sum_probs=30.5
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh---------HhhhhhhhhHHHHh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN---------ELEQNTVPSLRKAL 56 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln---------d~e~n~v~slrkaL 56 (388)
..|+.|+..|.+|+.++..+...|...+.+.+.+..+.. ....+++-.|++.|
T Consensus 429 ~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~~~~~~~rei~~~~~~I~~L~~~L 490 (652)
T COG2433 429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL 490 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH
Confidence 345556666666655555555555555555555544443 23456677888888
No 177
>PRK14145 heat shock protein GrpE; Provisional
Probab=29.15 E-value=2.5e+02 Score=26.87 Aligned_cols=61 Identities=20% Similarity=0.265 Sum_probs=46.4
Q ss_pred hhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHH------------HHHhhhhHHHHHHHHhhHHHHHHH
Q 047021 318 THKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQT------------TQAYNGLVRLYKPVLRNIESSLIK 378 (388)
Q Consensus 318 ~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra------------~QA~n~L~Rly~PvL~~IEsrLik 378 (388)
....++..|+.++.+-..+...|...+.-+-|+..-. .-|.-+|.+-+.||+-|+|.-|-.
T Consensus 42 ~~~~e~~~l~~~l~~le~e~~el~d~~lR~~AEfeN~rkR~~kE~e~~~~~a~e~~~~~LLpV~DnLerAl~~ 114 (196)
T PRK14145 42 QTVDEIEELKQKLQQKEVEAQEYLDIAQRLKAEFENYRKRTEKEKSEMVEYGKEQVILELLPVMDNFERALAS 114 (196)
T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHHHHhc
Confidence 3445677788888877777788877777777776432 357788999999999999987743
No 178
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=29.12 E-value=1.6e+02 Score=25.08 Aligned_cols=36 Identities=28% Similarity=0.432 Sum_probs=28.7
Q ss_pred ChhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHH
Q 047021 1 WLLSQLKDFQKKNAELEEKNSILSSKLQVKEVESES 36 (388)
Q Consensus 1 ~l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~ 36 (388)
|+.=+++.+++.|+.|..+|..|..++..++.++.-
T Consensus 20 y~~~k~~ka~~~~~kL~~en~qlk~Ek~~~~~qvkn 55 (87)
T PF10883_consen 20 YLWWKVKKAKKQNAKLQKENEQLKTEKAVAETQVKN 55 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455678999999999999999999888776655443
No 179
>KOG2129 consensus Uncharacterized conserved protein H4 [Function unknown]
Probab=28.32 E-value=4.2e+02 Score=28.94 Aligned_cols=83 Identities=34% Similarity=0.394 Sum_probs=46.2
Q ss_pred hHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhh----hhhhhHHHh----------hhhHHHHHHHHHHHhH
Q 047021 20 NSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVA----MEKDAAVVA----------REDLSAQLRTLKKRLK 85 (388)
Q Consensus 20 n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~----mEkDaA~va----------reDl~AQLR~~kkRlk 85 (388)
-+.|-++++..+.++++|++.|| ..--+ |++ -+||+ |+.|-|--- =+-+-.+|+.++|-.+
T Consensus 203 vN~LwKrmdkLe~ekr~Lq~KlD-qpvs~-p~~---prdia~~~~~~gD~a~~~~~hi~~l~~EveRlrt~l~~Aqk~~~ 277 (552)
T KOG2129|consen 203 VNSLWKRMDKLEQEKRYLQKKLD-QPVST-PSL---PRDIAKIPDVHGDEAAAEKLHIDKLQAEVERLRTYLSRAQKSYQ 277 (552)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhc-CcccC-CCc---hhhhhcCccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35666777778888888888772 22222 322 25666 888866322 1334445555555444
Q ss_pred HHHHHHhhhHhhHHHHHHHHHHHHH
Q 047021 86 EAEEEQYRAEEDAAALRAELNSLQQ 110 (388)
Q Consensus 86 EAEEEQyRAEEDaAaLRAEln~lQq 110 (388)
| +=-||++||- ..|.|...||.
T Consensus 278 e-k~~qy~~Ee~--~~reen~rlQr 299 (552)
T KOG2129|consen 278 E-KLMQYRAEEV--DHREENERLQR 299 (552)
T ss_pred H-HHHHHHHHHh--hHHHHHHHHHH
Confidence 3 2347777764 34555555544
No 180
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=28.17 E-value=2.5e+02 Score=21.34 Aligned_cols=43 Identities=28% Similarity=0.451 Sum_probs=30.6
Q ss_pred HHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhh
Q 047021 204 KEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEEN 246 (388)
Q Consensus 204 kekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~EN 246 (388)
|.+-..++.+|-.-|..|++--+.|..+|+.=..++..|-.+|
T Consensus 21 R~RKk~~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~~e~ 63 (64)
T PF00170_consen 21 RQRKKQYIEELEEKVEELESENEELKKELEQLKKEIQSLKSEN 63 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4555566667777777777777777777777777777776665
No 181
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=27.99 E-value=4.3e+02 Score=23.49 Aligned_cols=60 Identities=27% Similarity=0.361 Sum_probs=35.1
Q ss_pred HHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHH
Q 047021 15 ELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKK 82 (388)
Q Consensus 15 ~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kk 82 (388)
+|-.+++-|.+-+++....+.+.++|++++..++ ..+++.|+ .--+||.+|...|--+.+
T Consensus 34 eL~kqkd~L~~~l~~L~~q~~s~~qr~~eLqaki-~ea~~~le-------~eK~ak~~l~~r~~k~~~ 93 (107)
T PF09304_consen 34 ELAKQKDQLRNALQSLQAQNASRNQRIAELQAKI-DEARRNLE-------DEKQAKLELESRLLKAQK 93 (107)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH-------HHHHHHHHHHHHHHHHHH
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH-------HHHHHHHHHHHHHHHHHh
Confidence 3444555666666667777777777777776653 34444442 234566666666544443
No 182
>PRK14143 heat shock protein GrpE; Provisional
Probab=27.79 E-value=4.7e+02 Score=25.65 Aligned_cols=62 Identities=19% Similarity=0.275 Sum_probs=46.7
Q ss_pred hhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHH------------HHHHhhhhHHHHHHHHhhHHHHHHHhhh
Q 047021 320 KNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQ------------TTQAYNGLVRLYKPVLRNIESSLIKMKQ 381 (388)
Q Consensus 320 ~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~r------------a~QA~n~L~Rly~PvL~~IEsrLikmkq 381 (388)
..++..|+.++..-....+.|.-+.+-+.|+..- -..|..+|.+-+.||+-|+|.-|..++.
T Consensus 66 ~~~~~~l~~el~~l~~e~~elkd~~lR~~AdfeN~RKR~~kE~e~~~~~a~~~~~~~lLpV~DnLerAl~~~~~ 139 (238)
T PRK14143 66 AARLAQLEQELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLRLQLKCNTLSEILPVVDNFERARQQLKP 139 (238)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhcccc
Confidence 3457778888777777777777777777776532 2357888999999999999998876543
No 183
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=27.75 E-value=3.9e+02 Score=26.37 Aligned_cols=10 Identities=0% Similarity=0.116 Sum_probs=3.6
Q ss_pred HHhhHHHHHH
Q 047021 10 QKKNAELEEK 19 (388)
Q Consensus 10 q~~~~~le~~ 19 (388)
+...++++.+
T Consensus 220 ~~~l~~~~~~ 229 (362)
T TIGR01010 220 EGELIRVQAQ 229 (362)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 184
>PRK00295 hypothetical protein; Provisional
Probab=27.72 E-value=2e+02 Score=22.93 Aligned_cols=50 Identities=36% Similarity=0.481 Sum_probs=0.0
Q ss_pred HHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021 36 SLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE 88 (388)
Q Consensus 36 ~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE 88 (388)
++..|+.+||-+.. --.=..|++|-.+-+.-+ +.|..||+.+..|+++.+
T Consensus 2 ~~e~Ri~~LE~kla~qE~tie~Ln~~v~~Qq~~I---~~L~~ql~~L~~rl~~~~ 53 (68)
T PRK00295 2 SLEERVTELESRQAFQDDTIQALNDVLVEQQRVI---ERLQLQMAALIKRQEEMV 53 (68)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhh
No 185
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=27.71 E-value=6.3e+02 Score=25.30 Aligned_cols=78 Identities=22% Similarity=0.265 Sum_probs=57.2
Q ss_pred hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 202 EDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRG 279 (388)
Q Consensus 202 eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~ 279 (388)
.+...+.+.|..+=.-.+.|..-..+|-.|.+....+.++.+.+.+.+---+.+...--....+|..-...+.+.|+.
T Consensus 57 ~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~~eE~~~~~~~n~~~~~l~~~~~e~~sl~~q~~~~~~~L~~L~k 134 (314)
T PF04111_consen 57 QEEEELLQELEELEKEREELDQELEELEEELEELDEEEEEYWREYNELQLELIEFQEERDSLKNQYEYASNQLDRLRK 134 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333444444444444455555555556666666667788999999999988888888888999999999999999885
No 186
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=27.69 E-value=3.7e+02 Score=27.62 Aligned_cols=97 Identities=23% Similarity=0.325 Sum_probs=67.3
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK 81 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k 81 (388)
|+.++..+..++.+|+.+.+-+.-.+++...++|......+-|-. .|--.| =..+.-+|-=|.|.+-=|-++
T Consensus 131 lV~qLEk~~~q~~qLe~d~qs~lDEkeEl~~ERD~yk~K~~RLN~----ELn~~L----~g~~~rivDIDaLi~ENRyL~ 202 (319)
T PF09789_consen 131 LVEQLEKLREQIEQLERDLQSLLDEKEELVTERDAYKCKAHRLNH----ELNYIL----NGDENRIVDIDALIMENRYLK 202 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHh----CCCCCCcccHHHHHHHHHHHH
Confidence 577888999999999999999999999999999998754443311 122222 222234445678889999999
Q ss_pred HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021 82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQAM 113 (388)
Q Consensus 82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~ 113 (388)
.|++.++||- ..++.-|+-...-+.
T Consensus 203 erl~q~qeE~-------~l~k~~i~KYK~~le 227 (319)
T PF09789_consen 203 ERLKQLQEEK-------ELLKQTINKYKSALE 227 (319)
T ss_pred HHHHHHHHHH-------HHHHHHHHHHHHHHH
Confidence 9999988774 344444554444443
No 187
>PF10481 CENP-F_N: Cenp-F N-terminal domain; InterPro: IPR018463 Mitosin or centromere-associated protein-F (Cenp-F) is found bound across the centromere as one of the proteins of the outer layer of the kinetochore []. Most of the kinetochore/centromere functions appear to depend upon binding of the C-terminal part of the molecule, whereas the N-terminal part, here, may be a cytoplasmic player in controlling the function of microtubules and dynein [].
Probab=27.45 E-value=7.1e+02 Score=25.76 Aligned_cols=118 Identities=28% Similarity=0.361 Sum_probs=70.0
Q ss_pred HHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-------H
Q 047021 206 KLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEEL-------R 278 (388)
Q Consensus 206 kle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeL-------r 278 (388)
-|.+-.+-+....+.|+-+||||--|+-.--.-+-- -|-|+--|=+|.+.| +
T Consensus 64 ~LkREnq~l~e~c~~lek~rqKlshdlq~Ke~qv~~---------------------lEgQl~s~Kkqie~Leqelkr~K 122 (307)
T PF10481_consen 64 ALKRENQSLMESCENLEKTRQKLSHDLQVKESQVNF---------------------LEGQLNSCKKQIEKLEQELKRCK 122 (307)
T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhHHHhhhHHHHHH---------------------HHHHHHHHHHHHHHHHHHHHHHH
Confidence 355667788888999999999999887544332221 244555555555544 4
Q ss_pred HHHHHHHhhhhccC-CCCccccccccCCCCCC-ccccCCCchh-hhhHHhhHHHHHHHhhHHHHhHHHHHHHHHH
Q 047021 279 GILEKLRTEQASFM-SVNDKDILRDFSEPSKD-AVSQNGSETH-KNEFLALKGQLAKEQSRAEALSAQVLQLSAQ 350 (388)
Q Consensus 279 ~lLeklR~EQa~~~-~~~~~~~~g~~~e~~~d-g~~~~~~~~~-~te~l~LK~eL~kEqsRseaLsaqv~qLsAe 350 (388)
.-||+.+.+..+.- +-+++.| |.+. .+.-+.++.| .+-+..|+...-+|--..-.|.++|..|-+.
T Consensus 123 sELErsQ~~~~~~~~sl~~~st------pqk~f~~p~tp~q~~~~sk~e~L~ekynkeveerkrle~e~k~lq~k 191 (307)
T PF10481_consen 123 SELERSQQAASSGDVSLNPCST------PQKSFATPLTPSQYYSDSKYEELQEKYNKEVEERKRLEAEVKALQAK 191 (307)
T ss_pred HHHHHHHHhhccCCccccccCC------chhhccCCCChhhhhhhhhHHHHHHHHHHHHHHHhhHHHHHHHHhcc
Confidence 55555554333111 1112211 1111 1112223333 3468899999999999999999999888653
No 188
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=27.33 E-value=2.8e+02 Score=22.38 Aligned_cols=15 Identities=47% Similarity=0.800 Sum_probs=6.4
Q ss_pred hhHHHHHHHHHHHhH
Q 047021 71 EDLSAQLRTLKKRLK 85 (388)
Q Consensus 71 eDl~AQLR~~kkRlk 85 (388)
+++...+-.++.|++
T Consensus 57 ~~~e~~~~~l~~~l~ 71 (74)
T PF12329_consen 57 EELEKELESLEERLK 71 (74)
T ss_pred HHHHHHHHHHHHHhh
Confidence 334444444444444
No 189
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=27.06 E-value=1.7e+02 Score=24.83 Aligned_cols=33 Identities=18% Similarity=0.162 Sum_probs=13.1
Q ss_pred hhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021 24 SSKLQVKEVESESLQTRINELEQNTVPSLRKAL 56 (388)
Q Consensus 24 ~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL 56 (388)
...+.+..-+...+..+++.+.+.+---|.+.|
T Consensus 71 ~l~l~el~G~~~~l~~~l~~v~~~~~lLlE~~l 103 (106)
T PF10805_consen 71 QLELAELRGELKELSARLQGVSHQLDLLLENEL 103 (106)
T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333333333344444444444443333333333
No 190
>TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein. This family represents an accessory protein that works with the bacteriocin maturation and ABC transport secretion protein described by TIGR01193.
Probab=26.71 E-value=7.1e+02 Score=25.56 Aligned_cols=20 Identities=20% Similarity=0.459 Sum_probs=10.3
Q ss_pred hhhHHHHHHHHHHHHHHHHH
Q 047021 127 PDHVQTLEKELATLKSQLQQ 146 (388)
Q Consensus 127 ~~~~~~lekEl~~l~~~lqq 146 (388)
.+++...+.++..++.++..
T Consensus 290 ~~~l~~~~~~l~~~~~~l~~ 309 (457)
T TIGR01000 290 KQEITDLNQKLLELESKIKS 309 (457)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44455555555555555543
No 191
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=26.10 E-value=1.7e+02 Score=25.28 Aligned_cols=44 Identities=32% Similarity=0.480 Sum_probs=23.0
Q ss_pred hhhhHHHHHhhHHHHHHhHHhhhhHHHH-------HHhhHHHHHHhhHhhh
Q 047021 3 LSQLKDFQKKNAELEEKNSILSSKLQVK-------EVESESLQTRINELEQ 46 (388)
Q Consensus 3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k-------~~~~~~L~~~Lnd~e~ 46 (388)
+.+|..|++....+-.+...|...+.+. .++|+-|..+|...+.
T Consensus 7 ~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~ 57 (107)
T PF06156_consen 7 FDRLDQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEELEQ 57 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3444444444444444444444444433 3466666777776655
No 192
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=25.97 E-value=1.2e+02 Score=28.74 Aligned_cols=39 Identities=33% Similarity=0.427 Sum_probs=28.9
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHH---HHHHhhHHHHHHhh
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQ---VKEVESESLQTRIN 42 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~---~k~~~~~~L~~~Ln 42 (388)
....++.+.|.+|+++|..|..++. ....+++.|.+-|+
T Consensus 69 ~~~~~l~~en~~L~~e~~~l~~~~~~~~~l~~en~~L~~lL~ 110 (276)
T PRK13922 69 ASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLN 110 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3567788888899988888888777 55666766666555
No 193
>PF13870 DUF4201: Domain of unknown function (DUF4201)
Probab=25.96 E-value=4.3e+02 Score=23.70 Aligned_cols=76 Identities=20% Similarity=0.242 Sum_probs=33.7
Q ss_pred HHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhh-hhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021 9 FQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAM-EKDAAVVAREDLSAQLRTLKKRLKEA 87 (388)
Q Consensus 9 lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~m-EkDaA~vareDl~AQLR~~kkRlkEA 87 (388)
+.....+.+.....+...+.....+.+.+.+..+.+....+..-.-+| | +=|..+-.-+++-..++.++++...+
T Consensus 96 l~~~l~~~~~~~~~~r~~l~~~k~~r~k~~~~~~~l~~~~~~~~~P~l----l~Dy~~~~~~~~~l~~~i~~l~rk~~~l 171 (177)
T PF13870_consen 96 LKQELKDREEELAKLREELYRVKKERDKLRKQNKKLRQQGGLLGVPAL----LRDYDKTKEEVEELRKEIKELERKVEIL 171 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444444444444444443332111111 2 23344444455555555666655555
Q ss_pred H
Q 047021 88 E 88 (388)
Q Consensus 88 E 88 (388)
+
T Consensus 172 ~ 172 (177)
T PF13870_consen 172 E 172 (177)
T ss_pred H
Confidence 4
No 194
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton]
Probab=25.47 E-value=1.2e+03 Score=27.71 Aligned_cols=71 Identities=23% Similarity=0.260 Sum_probs=40.0
Q ss_pred hhHHHHHHHHhHHHHHHHHHHhhHHHHH--------------HHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHH
Q 047021 202 EDKEKLEKQLHDMAVVVERLESSRQKLL--------------MEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQV 267 (388)
Q Consensus 202 eDkekle~Ql~dma~~VeRLE~SRqKLL--------------~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQv 267 (388)
++...++++...++.+||++..-++-+. .|.-.=--|+++|-..=+++++++ ++|-.|+
T Consensus 459 ~s~~~~~~~~~~L~d~le~~~~~~~~~~~K~e~~~~~le~l~~El~~l~~e~~~lq~~~~~~~qs~-------~~~~~~l 531 (980)
T KOG0980|consen 459 QSIDDVEEENTNLNDQLEELQRAAGRAETKTESQAKALESLRQELALLLIELEELQRTLSNLAQSH-------NNQLAQL 531 (980)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHH-------HHHHHHH
Confidence 4444666677777777776655444321 122222235555555544444444 5688888
Q ss_pred HHHHHHHHHHHH
Q 047021 268 KDCLKQNEELRG 279 (388)
Q Consensus 268 qDclkQNeeLr~ 279 (388)
.|-++|-+.+-.
T Consensus 532 ~~~l~~KD~~~~ 543 (980)
T KOG0980|consen 532 EDLLKQKDRLAA 543 (980)
T ss_pred HHHHHhhHHHHH
Confidence 888888665543
No 195
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=25.45 E-value=3.9e+02 Score=22.15 Aligned_cols=27 Identities=41% Similarity=0.585 Sum_probs=21.4
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 126 SPDHVQTLEKELATLKSQLQQESQLRQ 152 (388)
Q Consensus 126 s~~~~~~lekEl~~l~~~lqqes~lrq 152 (388)
..+...+|++++..|+.+|+.+..+|.
T Consensus 6 ~~~~r~~LeqeV~~Lq~~L~~E~~~r~ 32 (88)
T PF14389_consen 6 LHERRSALEQEVAELQKQLQEEQDLRR 32 (88)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445677899999999999988877754
No 196
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=25.09 E-value=5.7e+02 Score=23.85 Aligned_cols=36 Identities=22% Similarity=0.262 Sum_probs=19.6
Q ss_pred HHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhh
Q 047021 16 LEEKNSILSSKLQVKEVESESLQTRINELEQNTVPS 51 (388)
Q Consensus 16 le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~s 51 (388)
+......|..+++....++..|...|.+......++
T Consensus 67 ~~~~~~~l~~~~~~~~~~i~~l~~~i~~~~~~r~~~ 102 (188)
T PF03962_consen 67 RQNKLEKLQKEIEELEKKIEELEEKIEEAKKGREES 102 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 344445555555556666666666666554444444
No 197
>PF14645 Chibby: Chibby family
Probab=25.09 E-value=76 Score=27.79 Aligned_cols=25 Identities=32% Similarity=0.573 Sum_probs=20.7
Q ss_pred hhhHHHHHhhHHHHHHhHHhhhhHH
Q 047021 4 SQLKDFQKKNAELEEKNSILSSKLQ 28 (388)
Q Consensus 4 ~~v~~lq~~~~~le~~n~~L~sk~~ 28 (388)
.-.+.+.++|..|++||++|.-|++
T Consensus 71 ~~~~~l~~~n~~L~EENN~Lklk~e 95 (116)
T PF14645_consen 71 EENQRLRKENQQLEEENNLLKLKIE 95 (116)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3457789999999999999987764
No 198
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=24.85 E-value=3.6e+02 Score=21.51 Aligned_cols=49 Identities=20% Similarity=0.313 Sum_probs=30.6
Q ss_pred hhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 64 DAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 64 DaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
|+=+.|+..+.-.|+.+|.--..++..-.-||..-..|-++|+.|..++
T Consensus 7 ~~EirakQ~~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e~ 55 (61)
T PF08826_consen 7 EAEIRAKQAIQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKEM 55 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5666777777777766665555555555555556666666766666554
No 199
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.85 E-value=1.2e+03 Score=27.62 Aligned_cols=26 Identities=27% Similarity=0.437 Sum_probs=17.6
Q ss_pred hhhhhHHHHHhhHHHHHHhHHhhhhH
Q 047021 2 LLSQLKDFQKKNAELEEKNSILSSKL 27 (388)
Q Consensus 2 l~~~v~~lq~~~~~le~~n~~L~sk~ 27 (388)
|..++|++|+++..|--+..+|.-++
T Consensus 491 lqarikE~q~kl~~l~~Ekq~l~~ql 516 (1118)
T KOG1029|consen 491 LQARIKELQEKLQKLAPEKQELNHQL 516 (1118)
T ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHH
Confidence 45677888888777766666665444
No 200
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=24.35 E-value=2.5e+02 Score=22.12 Aligned_cols=43 Identities=21% Similarity=0.327 Sum_probs=23.7
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhh
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNT 48 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~ 48 (388)
|...+.+...++...+.|.......+.++..+..+|++++.|+
T Consensus 8 l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~n~ 50 (71)
T PF10779_consen 8 LNRIETKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKSNT 50 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444555555555555666677777776665
No 201
>PRK10997 yieM hypothetical protein; Provisional
Probab=24.28 E-value=4.3e+02 Score=28.56 Aligned_cols=83 Identities=29% Similarity=0.331 Sum_probs=55.4
Q ss_pred hhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhh---hhHHHHHHhhhhhhh-------------hhhHHHHH
Q 047021 194 AASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDN---QSSEIERLFEENSSL-------------SSSYQESM 257 (388)
Q Consensus 194 a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~---QS~EIErLf~ENs~L-------------sss~qea~ 257 (388)
+++.-|-..=+..|..++.. ..++-+|..|.|||.||-. -+..+|..|.+|..- ..+++..
T Consensus 127 ~~~~~f~~~w~~~l~~~~~~--~~~~~~e~~re~~l~el~~r~e~~~~l~~v~~~~~~~~grlWd~s~g~l~~~d~~~~- 203 (487)
T PRK10997 127 ALHTLFLQRWRLSLVVQTTT--LNQQLLEQEREQLLAELQQRMTLSGQLEPVLAENDEAAGRLWDMSAGQLKRGDYQLI- 203 (487)
T ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccccccccCcccccCHHHH-
Confidence 34555544445666666543 3578899999999999953 356677788877641 1122222
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 258 GIAKHWENQVKDCLKQNEELRGILEKLRT 286 (388)
Q Consensus 258 ~iaaqWEnQvqDclkQNeeLr~lLeklR~ 286 (388)
.+.-+-|++|.+|+.+.+.|=.
T Consensus 204 -------~~~a~~L~~~p~L~~ia~~LGR 225 (487)
T PRK10997 204 -------VQYGEFLKQQPELQKLAEQLGR 225 (487)
T ss_pred -------HHHHHHHHhCHHHHHHHHHhCc
Confidence 2456789999999999999643
No 202
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=24.17 E-value=4.6e+02 Score=22.52 Aligned_cols=42 Identities=29% Similarity=0.368 Sum_probs=30.8
Q ss_pred hHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021 72 DLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM 113 (388)
Q Consensus 72 Dl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~ 113 (388)
++..-+-.+++|+...++.--.=+.+.+.++.+++.+++.++
T Consensus 91 ~~~eA~~~l~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~~l~ 132 (140)
T PRK03947 91 DLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQ 132 (140)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556667777777777777777788888888888877764
No 203
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=23.69 E-value=2.2e+02 Score=22.55 Aligned_cols=39 Identities=5% Similarity=0.220 Sum_probs=28.5
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE 43 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd 43 (388)
++.+|+++.-.++....-+.++++.....++.+.+...+
T Consensus 1 Ri~elEn~~~~~~~~i~tvk~en~~i~~~ve~i~envk~ 39 (55)
T PF05377_consen 1 RIDELENELPRIESSINTVKKENEEISESVEKIEENVKD 39 (55)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467788888888877777777777777777776665543
No 204
>PF02151 UVR: UvrB/uvrC motif; InterPro: IPR001943 During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products []. UvrB and UvrC share a common domain of around 35 amino acids, the so called UVR domain. This domain in UvrB can interact with the homologous domain in UvrC throughout a putative coiled coil structure. This interaction is important for the incision of the damaged strand [].; GO: 0003677 DNA binding, 0004518 nuclease activity, 0006289 nucleotide-excision repair; PDB: 3PXG_D 3PXI_C 1E52_B 1QOJ_B 2D7D_B 2NMV_B.
Probab=23.58 E-value=89 Score=21.81 Aligned_cols=16 Identities=44% Similarity=0.684 Sum_probs=14.4
Q ss_pred hhHHHHHHHHHHHHHH
Q 047021 96 EDAAALRAELNSLQQQ 111 (388)
Q Consensus 96 EDaAaLRAEln~lQqq 111 (388)
|.||.+|-+|+.|+.|
T Consensus 21 E~Aa~~Rd~i~~l~~q 36 (36)
T PF02151_consen 21 EKAARLRDQIKALKKQ 36 (36)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcC
Confidence 6899999999999875
No 205
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=23.24 E-value=9.9e+02 Score=26.03 Aligned_cols=96 Identities=28% Similarity=0.401 Sum_probs=61.7
Q ss_pred hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Q 047021 70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQ 149 (388)
Q Consensus 70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~ 149 (388)
=||+..|||.++--|+-|+++---+|-.--+.|.|+..--+ +-...+.+++.++-.+-...+
T Consensus 76 lddi~~qlr~~rtel~~a~~~k~~~e~er~~~~~El~~~r~------------------e~~~v~~~~~~a~~n~~kAqQ 137 (499)
T COG4372 76 LDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQ------------------EREAVRQELAAARQNLAKAQQ 137 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHHHHHHHHHH
Confidence 58999999999999999999888888777777777754321 223344444443333323222
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021 150 LRQQERKQLTEEQARISVLMSEKQELEEKLVMFS 183 (388)
Q Consensus 150 lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~ 183 (388)
.-.+-..+.-.=+.++.+|-.+-++|+++.+++-
T Consensus 138 ~lar~t~Q~q~lqtrl~~l~~qr~ql~aq~qsl~ 171 (499)
T COG4372 138 ELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQ 171 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2222233333456677888888888888887754
No 206
>PF10506 MCC-bdg_PDZ: PDZ domain of MCC-2 bdg protein for Usher syndrome; InterPro: IPR019536 The entry represents a protein that has a high homology to the tumour suppressor Usher syndrome type-1C protein-binding protein 1, or known as MCC2 (mutated in colon cancer). MCC2 protein binds the first PDZ domain of AIE-75 with its C-terminal amino acids -DTFL. A possible role of MCC2 as a tumour suppressor has been put forward. The carboxyl terminus of the predicted protein was DTFL which matched the consensus motif X-S/T-X-phi (phi: hydrophobic amino acid residue) for binding to the PDZ domain of AIE-75 [, ].
Probab=22.87 E-value=2.7e+02 Score=22.60 Aligned_cols=48 Identities=33% Similarity=0.517 Sum_probs=39.9
Q ss_pred HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021 8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL 56 (388)
Q Consensus 8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL 56 (388)
.|+.+..+|...|.+|.+.+++-....+.|..+|---|-|.+- ||-|+
T Consensus 2 rL~~~ie~L~~~n~~L~~~le~~k~~se~Ls~~lgk~es~~~a-lrlal 49 (67)
T PF10506_consen 2 RLKRRIEELKSQNEMLSSTLEERKQQSEELSMDLGKYESNATA-LRLAL 49 (67)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH-HHHHH
Confidence 5778889999999999999999999999999998887777653 24444
No 207
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=22.85 E-value=3e+02 Score=21.32 Aligned_cols=24 Identities=38% Similarity=0.421 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 154 ERKQLTEEQARISVLMSEKQELEEKLV 180 (388)
Q Consensus 154 eqqrla~Eq~~~a~l~aEkqeLeekl~ 180 (388)
|..||..+. ..+..+...++.+|+
T Consensus 5 E~~rL~Kel---~kl~~~i~~~~~kL~ 28 (66)
T PF10458_consen 5 EIERLEKEL---EKLEKEIERLEKKLS 28 (66)
T ss_dssp HHHHHHHHH---HHHHHHHHHHHHHHC
T ss_pred HHHHHHHHH---HHHHHHHHHHHHHHc
Confidence 344444444 334444444555544
No 208
>PF04568 IATP: Mitochondrial ATPase inhibitor, IATP; InterPro: IPR007648 ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=22.78 E-value=2.6e+02 Score=24.20 Aligned_cols=47 Identities=28% Similarity=0.295 Sum_probs=33.3
Q ss_pred hhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHh
Q 047021 195 ASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLF 243 (388)
Q Consensus 195 ~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf 243 (388)
+..+|+.-+++.=+. +=-..--|+|+.-|.||-.||+.+-.+|++|=
T Consensus 50 ag~~f~krE~A~E~~--Y~r~~EkEqL~~Lk~kl~~e~~~~~k~i~~le 96 (100)
T PF04568_consen 50 AGGAFGKREAAQEEQ--YFRKKEKEQLKKLKEKLKEEIEHHRKEIDELE 96 (100)
T ss_dssp CCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hCCccchHHHhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455676544443222 22233479999999999999999999999873
No 209
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=21.77 E-value=4.6e+02 Score=21.58 Aligned_cols=36 Identities=28% Similarity=0.434 Sum_probs=25.7
Q ss_pred HHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021 77 LRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA 112 (388)
Q Consensus 77 LR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~ 112 (388)
...+++|+...+...-.-+.+...++.+++.+...+
T Consensus 89 ~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l 124 (129)
T cd00890 89 IEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEEL 124 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466677777777777777777777777777777655
No 210
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=21.75 E-value=6.9e+02 Score=23.61 Aligned_cols=32 Identities=38% Similarity=0.613 Sum_probs=19.8
Q ss_pred hhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021 56 LKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE 89 (388)
Q Consensus 56 L~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE 89 (388)
|+..+..|+-+- |+.|.-+|-.+..++.+++.
T Consensus 108 L~~L~~dknL~e--ReeL~~kL~~~~~~l~~~~~ 139 (194)
T PF15619_consen 108 LKKLSEDKNLAE--REELQRKLSQLEQKLQEKEK 139 (194)
T ss_pred HHHHHHcCCchh--HHHHHHHHHHHHHHHHHHHH
Confidence 444444444333 67777777777777776654
No 211
>PF03999 MAP65_ASE1: Microtubule associated protein (MAP65/ASE1 family); InterPro: IPR007145 This is a family of microtubule associated proteins. One of its members is the yeast anaphase spindle elongation protein.; PDB: 3NRX_A 3NRY_A.
Probab=21.53 E-value=31 Score=37.03 Aligned_cols=151 Identities=26% Similarity=0.393 Sum_probs=0.0
Q ss_pred HhhhhHH--HHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCC------------CCCCCCCCChhhHHHH
Q 047021 68 VAREDLS--AQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSL------------GGISSMGISPDHVQTL 133 (388)
Q Consensus 68 vareDl~--AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~------------s~~~~~g~s~~~~~~l 133 (388)
+-..|+. ..|..++..+..-++|--+--+....++.+|..|-....-... .+..+++++++.+..|
T Consensus 133 ~~~~~l~S~~~l~~l~~~l~~L~~e~~~R~~~v~~l~~~I~~l~~~L~~~~~~~~~e~~l~~~~~~~~~~~Ls~~~l~~L 212 (619)
T PF03999_consen 133 IDESDLPSLEELEELRQHLQRLQEEKERRLEEVRELREEIISLMEELGIDPERTSFEKDLLSYSEDEESFCLSDENLEKL 212 (619)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccchhhccccccccccCCCCHHHHHHH
Confidence 3456777 7899999999999988888888888899888877555433221 1122567777777777
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhh-hhhhhcchhhHHHHHHHHh
Q 047021 134 EKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNA-AASKAFSVEDKEKLEKQLH 212 (388)
Q Consensus 134 ekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~-a~~k~~S~eDkekle~Ql~ 212 (388)
..-+..|..+..+. ..++..+......|=+.|.. +.+....| ..+..+|...-..++..
T Consensus 213 ~~~~~~L~~~k~~r--------------~~~~~~l~~~i~~LW~~L~~----~~ee~~~F~~~~~~ls~~~i~~l~~E-- 272 (619)
T PF03999_consen 213 QELLQELEEEKEER--------------EEKLQELREKIEELWNRLDV----PEEEREAFLEENSGLSLDTIEALEEE-- 272 (619)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHhCC----CHHHHHHHhhccCcchHHHHHHHHHH--
Confidence 66665443322211 11223333333333333321 23344444 34566777666777777
Q ss_pred HHHHHHHHHHhhHHHHHHHh-hhhhHHHHHHh
Q 047021 213 DMAVVVERLESSRQKLLMEI-DNQSSEIERLF 243 (388)
Q Consensus 213 dma~~VeRLE~SRqKLL~EI-D~QS~EIErLf 243 (388)
|.||+.-|..-|-++ +.-..||+.|+
T Consensus 273 -----l~RL~~lK~~~lk~~I~~~R~ei~elW 299 (619)
T PF03999_consen 273 -----LERLEELKKQNLKEFIEKKRQEIEELW 299 (619)
T ss_dssp --------------------------------
T ss_pred -----HHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 888888887777664 66677777654
No 212
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=21.34 E-value=1.1e+03 Score=25.66 Aligned_cols=78 Identities=23% Similarity=0.272 Sum_probs=39.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHH
Q 047021 162 QARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIER 241 (388)
Q Consensus 162 q~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIEr 241 (388)
+..+..+..+...+++.+..+.++-....+.... +.+ .+....+++.+.....-++.++.....+-.+|..--.+|.+
T Consensus 390 ~~~~~~~~~~~~~~e~el~~l~~~l~~~~~~e~i-~~l-~e~l~~l~~~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 467 (650)
T TIGR03185 390 QDAKSQLLKELRELEEELAEVDKKISTIPSEEQI-AQL-LEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDE 467 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCChHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556677777777777777777653321111101 111 14445555555555555555555555444444444444433
No 213
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport]
Probab=21.18 E-value=1.4e+03 Score=27.03 Aligned_cols=38 Identities=24% Similarity=0.246 Sum_probs=31.4
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN 42 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln 42 (388)
.++.|+..|.+|+++.+...|.........+.|..+|-
T Consensus 679 ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~qLg 716 (970)
T KOG0946|consen 679 MEKELQVENEELEEEVQDFISEHSQLKDQLDLLKNQLG 716 (970)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 46778888888888888888888888888888888887
No 214
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=21.18 E-value=1.8e+02 Score=25.85 Aligned_cols=20 Identities=25% Similarity=0.385 Sum_probs=9.2
Q ss_pred HHHHhhHHHHHHhhHhhhhh
Q 047021 29 VKEVESESLQTRINELEQNT 48 (388)
Q Consensus 29 ~k~~~~~~L~~~Lnd~e~n~ 48 (388)
+...++..|..+|..+..++
T Consensus 120 ~l~~e~~~l~~kL~~l~~~~ 139 (169)
T PF07106_consen 120 ELEEEIEELEEKLEKLRSGS 139 (169)
T ss_pred HHHHHHHHHHHHHHHHHhCC
Confidence 33444445555555444433
No 215
>PF12240 Angiomotin_C: Angiomotin C terminal; InterPro: IPR024646 This domain represents the C-terminal region of angiomotin. Angiomotin regulates the action of angiogenesis-inhibitor angiostatin []. The C-terminal region of angiomotin appears to be involved in directing the protein chemotactically [].
Probab=21.05 E-value=1.4e+02 Score=29.12 Aligned_cols=38 Identities=24% Similarity=0.305 Sum_probs=33.8
Q ss_pred hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH
Q 047021 6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE 43 (388)
Q Consensus 6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd 43 (388)
|.--..||.+||-..+-|-.+|.+|+.-+.+|+++.-.
T Consensus 131 l~~a~~K~qemE~RIK~LhaqI~EKDAmIkVLQqrs~~ 168 (205)
T PF12240_consen 131 LHMANRKCQEMENRIKALHAQIAEKDAMIKVLQQRSRK 168 (205)
T ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 44557899999999999999999999999999998763
No 216
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=20.98 E-value=1.1e+03 Score=25.83 Aligned_cols=20 Identities=30% Similarity=0.243 Sum_probs=10.9
Q ss_pred hhHHHHHHHHHHHhHHHHHH
Q 047021 71 EDLSAQLRTLKKRLKEAEEE 90 (388)
Q Consensus 71 eDl~AQLR~~kkRlkEAEEE 90 (388)
|+..+|+-+--+|+..-..+
T Consensus 381 e~k~~q~q~k~~k~~kel~~ 400 (493)
T KOG0804|consen 381 ERKLQQLQTKLKKCQKELKE 400 (493)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 45566666655555444333
No 217
>PF14282 FlxA: FlxA-like protein
Probab=20.89 E-value=5.2e+02 Score=21.93 Aligned_cols=25 Identities=32% Similarity=0.513 Sum_probs=20.5
Q ss_pred CCCChhhHHHHHHHHHHHHHHHHHH
Q 047021 123 MGISPDHVQTLEKELATLKSQLQQE 147 (388)
Q Consensus 123 ~g~s~~~~~~lekEl~~l~~~lqqe 147 (388)
.|.++.+|..|++.|..|+.+|+.-
T Consensus 14 ~~~~~~~I~~L~~Qi~~Lq~ql~~l 38 (106)
T PF14282_consen 14 SGSSDSQIEQLQKQIKQLQEQLQEL 38 (106)
T ss_pred CCcHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445888999999999999998753
No 218
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=20.87 E-value=1.1e+03 Score=26.58 Aligned_cols=50 Identities=22% Similarity=0.247 Sum_probs=26.5
Q ss_pred HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021 7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL 56 (388)
Q Consensus 7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL 56 (388)
..|+....+++.+...+...+++.+.....+...++.++.+.-.-++++.
T Consensus 523 ~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~~~~~~ 572 (782)
T PRK00409 523 ASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAE 572 (782)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555555555555555555555555555555444444444
No 219
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=20.83 E-value=2e+02 Score=28.73 Aligned_cols=47 Identities=28% Similarity=0.372 Sum_probs=40.5
Q ss_pred hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh
Q 047021 3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP 50 (388)
Q Consensus 3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~ 50 (388)
..+|..+++.+..+..+...|..||+-|..+.+-.++||..|. ++.|
T Consensus 175 ~~~~~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq-~vRP 221 (267)
T PF10234_consen 175 QQQLQQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQ-SVRP 221 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcCh
Confidence 4578888899999999999999999999999999999999774 3444
No 220
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=20.36 E-value=9.6e+02 Score=24.76 Aligned_cols=72 Identities=22% Similarity=0.311 Sum_probs=40.5
Q ss_pred HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHH--HhhhhHHHHHHHHHHHh
Q 047021 7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAV--VAREDLSAQLRTLKKRL 84 (388)
Q Consensus 7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~--vareDl~AQLR~~kkRl 84 (388)
.+....|..|..+...|..++.+..-++..|-..+++.+--..+ -++.. ..|++++.||-.++++.
T Consensus 75 ~~sre~Nk~L~~Ev~~Lrqkl~E~qGD~KlLR~~la~~r~~~~~------------~~~~~~~~ere~lV~qLEk~~~q~ 142 (319)
T PF09789_consen 75 SESREQNKKLKEEVEELRQKLNEAQGDIKLLREKLARQRVGDEG------------IGARHFPHEREDLVEQLEKLREQI 142 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHhhhhhhcc------------ccccccchHHHHHHHHHHHHHHHH
Confidence 34445555555555555555555555555555555443221111 11111 55999999998888887
Q ss_pred HHHHHH
Q 047021 85 KEAEEE 90 (388)
Q Consensus 85 kEAEEE 90 (388)
..-|-+
T Consensus 143 ~qLe~d 148 (319)
T PF09789_consen 143 EQLERD 148 (319)
T ss_pred HHHHHH
Confidence 766543
No 221
>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning]
Probab=20.32 E-value=1.2e+03 Score=26.06 Aligned_cols=207 Identities=17% Similarity=0.243 Sum_probs=0.0
Q ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHH
Q 047021 126 SPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKE 205 (388)
Q Consensus 126 s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDke 205 (388)
++.-+.-|+++.++|+.-.+.-.--+.+--.+-..=.-.++.+-+|..+-|+.+..+...-.+=--.. -.+.||.+|-+
T Consensus 257 ~~~~~eslre~~~~L~~D~nK~~~y~~~~~~k~~~~~~~l~~l~~Eie~kEeE~e~lq~~~d~Lk~~I-e~Q~iS~~dve 335 (581)
T KOG0995|consen 257 DPGKEESLREKKARLQDDVNKFQAYVSQMKSKKQHMEKKLEMLKSEIEEKEEEIEKLQKENDELKKQI-ELQGISGEDVE 335 (581)
T ss_pred CcchHHHHHHHHHHHHhHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCHHHHH
Q ss_pred HHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021 206 KLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLR 285 (388)
Q Consensus 206 kle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR 285 (388)
++-.....+-.-|.++..-+++|--++=.+-++|++-|.+ -+.++.||-.+--+|.-.
T Consensus 336 ~mn~Er~~l~r~l~~i~~~~d~l~k~vw~~~l~~~~~f~~-----------------le~~~~~~~~l~~~i~l~----- 393 (581)
T KOG0995|consen 336 RMNLERNKLKRELNKIQSELDRLSKEVWELKLEIEDFFKE-----------------LEKKFIDLNSLIRRIKLG----- 393 (581)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHHH-----
Q ss_pred hhhhccCCCCccccccccCCCCCCccccCC--CchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhh
Q 047021 286 TEQASFMSVNDKDILRDFSEPSKDAVSQNG--SETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGL 361 (388)
Q Consensus 286 ~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~--~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L 361 (388)
...+..+....+..+.+.|+.-.+ -|..--.+....+++.++.-....|.--+-.+..-...-.|-..++
T Consensus 394 ------~~~~~~n~~~~pe~~~~~~~d~k~~V~~~l~el~~ei~~~~~~~~~~~~tLq~~~~~~~~~i~E~~~~l~~~ 465 (581)
T KOG0995|consen 394 ------IAENSKNLERNPERAATNGVDLKSYVKPLLKELLDEISEELHEAENELETLQEHFSNKASTIEEKIQILGEI 465 (581)
T ss_pred ------HHHHhccCCcCCccCccccccchhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 222
>PF04899 MbeD_MobD: MbeD/MobD like ; InterPro: IPR006983 The MbeD and MobD proteins are plasmid encoded, and are involved in the plasmid mobilisation and transfer in the presence of conjugative plasmids [].
Probab=20.29 E-value=2.9e+02 Score=22.53 Aligned_cols=40 Identities=25% Similarity=0.347 Sum_probs=33.5
Q ss_pred CchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHH
Q 047021 316 SETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTT 355 (388)
Q Consensus 316 ~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~ 355 (388)
..+|......|+-.|.....+-.+|+++|..|+-++..--
T Consensus 23 ~~~Wq~sy~~Lq~~~~~t~~~~a~L~~qv~~Ls~qv~~Ls 62 (70)
T PF04899_consen 23 QQEWQSSYADLQHMFEQTSQENAALSEQVNNLSQQVQRLS 62 (70)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667778889999999999999999999999988776543
No 223
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=20.14 E-value=8.4e+02 Score=27.55 Aligned_cols=50 Identities=22% Similarity=0.255 Sum_probs=27.3
Q ss_pred HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021 7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL 56 (388)
Q Consensus 7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL 56 (388)
.+|+....+++.....+...+.+.+.....|..+++.++...---+.+|-
T Consensus 518 ~~L~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~~~~a~ 567 (771)
T TIGR01069 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELE 567 (771)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555555555555555566666666555554444443333
No 224
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=20.08 E-value=4.3e+02 Score=22.11 Aligned_cols=51 Identities=25% Similarity=0.305 Sum_probs=38.8
Q ss_pred hcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHH
Q 047021 198 AFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQE 255 (388)
Q Consensus 198 ~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qe 255 (388)
.+..+.++.|-+++ --|-.+=+-|+.-||..-.|.++|-.||..|..=..-
T Consensus 12 ~~~~e~k~~Li~ei-------~~LQ~sL~~L~~Rve~Vk~E~~kL~~EN~~Lq~YI~n 62 (80)
T PF10224_consen 12 KLEKEEKEELIQEI-------LELQDSLEALSDRVEEVKEENEKLESENEYLQQYIGN 62 (80)
T ss_pred HHhHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677888888884 4455555678889999999999999999888654433
No 225
>PF09766 FimP: Fms-interacting protein; InterPro: IPR019163 This entry represents Thoc5 which is one of the subunits of the THO complex, which additionally contains: HPR1, Thoc2, Thoc6 and Thoc7. The evolutionarily conserved multisubunit THO complex, which is recruited to actively transcribed genes is required for the efficient expression of genes that have internal tandem repeats. It is suggested that the THO complex functions to rectify aberrant structures that arise during transcription [, ] and is required for cell proliferation and for proper export of heat-shock mRNAs under heat stress []. This entry also identifies the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes [].
Probab=20.04 E-value=2e+02 Score=29.16 Aligned_cols=51 Identities=25% Similarity=0.363 Sum_probs=37.1
Q ss_pred hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021 5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL 56 (388)
Q Consensus 5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL 56 (388)
+=+.|.+.+.+|+.+...|...+.++....++|...|.++...+.| +++.|
T Consensus 102 ~Rk~L~~~~~el~~~k~~l~~~~~~k~~~L~~l~~~L~~l~~a~~p-lq~~l 152 (355)
T PF09766_consen 102 QRKRLEEQLKELEQRKKKLQQENKKKKKFLDSLPPQLKSLKKAAKP-LQEYL 152 (355)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH-HHHHh
Confidence 3467777788888888888888888888888888888777666654 34444
No 226
>PF06825 HSBP1: Heat shock factor binding protein 1; InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=20.04 E-value=1.4e+02 Score=23.42 Aligned_cols=15 Identities=27% Similarity=0.636 Sum_probs=8.8
Q ss_pred hHHHHHHhhHhhhhh
Q 047021 34 SESLQTRINELEQNT 48 (388)
Q Consensus 34 ~~~L~~~Lnd~e~n~ 48 (388)
.|-+-+|+++||++.
T Consensus 30 iDeM~~RIDdLE~si 44 (54)
T PF06825_consen 30 IDEMSSRIDDLEKSI 44 (54)
T ss_dssp HHHHHHHHHCCHHHH
T ss_pred HHHHHhhHHHHHHHH
Confidence 455556666666554
No 227
>PF08649 DASH_Dad1: DASH complex subunit Dad1; InterPro: IPR013958 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules []. Throughout the cell cycle Dad1 remains bound to kinetochores and its association is dependent on the Mis6 and Mal2 [].
Probab=20.03 E-value=2.8e+02 Score=22.10 Aligned_cols=41 Identities=24% Similarity=0.511 Sum_probs=33.1
Q ss_pred HhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHH
Q 047021 222 ESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWEN 265 (388)
Q Consensus 222 E~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEn 265 (388)
|.-|++|+.||- ..+|.++.-=+.|--|+..+++|+...+.
T Consensus 4 e~qR~~Li~eI~---~~~e~vl~nlN~LNRsLE~~i~VGkEF~~ 44 (58)
T PF08649_consen 4 ERQRDRLIQEIS---ESMESVLNNLNALNRSLESVISVGKEFES 44 (58)
T ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHHHHhHHHHHHHhhhHHH
Confidence 456999999995 56778887778888999999998887654
Done!