Query psy11481
Match_columns 561
No_of_seqs 133 out of 530
Neff 2.5
Searched_HMMs 46136
Date Fri Aug 16 19:32:50 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11481.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11481hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2483|consensus 99.8 1.2E-18 2.6E-23 169.2 14.8 103 179-293 53-155 (232)
2 KOG1318|consensus 99.7 1.9E-16 4.1E-21 164.1 14.7 134 183-329 231-368 (411)
3 PF00010 HLH: Helix-loop-helix 99.4 1.7E-13 3.7E-18 103.9 6.4 53 185-237 1-55 (55)
4 KOG1319|consensus 99.4 1.5E-12 3.3E-17 125.3 10.4 88 183-270 60-151 (229)
5 cd00083 HLH Helix-loop-helix d 99.4 1.1E-12 2.4E-17 98.9 6.5 57 184-240 3-59 (60)
6 smart00353 HLH helix loop heli 99.2 3.2E-11 7E-16 89.9 6.3 52 190-241 1-52 (53)
7 KOG2588|consensus 98.8 5.8E-09 1.3E-13 117.1 5.6 95 158-254 244-343 (953)
8 KOG0561|consensus 98.1 2.8E-06 6E-11 87.2 4.2 70 184-254 59-128 (373)
9 KOG3561|consensus 98.0 5.2E-06 1.1E-10 93.1 5.4 55 185-239 20-75 (803)
10 KOG4029|consensus 97.9 1.1E-05 2.3E-10 77.4 3.8 61 183-243 107-168 (228)
11 KOG4304|consensus 97.8 1E-05 2.2E-10 80.1 2.8 61 184-244 31-96 (250)
12 KOG3960|consensus 97.8 7.7E-05 1.7E-09 75.3 7.8 63 178-241 111-173 (284)
13 KOG3582|consensus 97.1 0.0002 4.4E-09 79.8 2.3 87 184-270 650-738 (856)
14 PLN03217 transcription factor 96.4 0.0071 1.5E-07 53.3 6.0 47 197-243 19-68 (93)
15 KOG3910|consensus 96.0 0.0044 9.6E-08 67.6 3.2 73 183-267 524-597 (632)
16 KOG4447|consensus 95.4 0.0086 1.9E-07 57.4 2.2 54 185-239 78-131 (173)
17 PF06005 DUF904: Protein of un 94.7 0.17 3.8E-06 42.5 7.5 47 226-273 13-59 (72)
18 PF02344 Myc-LZ: Myc leucine z 94.6 0.071 1.5E-06 39.5 4.5 28 239-266 2-29 (32)
19 KOG3898|consensus 94.4 0.02 4.3E-07 57.1 1.8 56 184-239 71-126 (254)
20 KOG4395|consensus 93.9 0.062 1.3E-06 55.0 4.1 56 183-238 172-227 (285)
21 smart00338 BRLZ basic region l 92.2 0.69 1.5E-05 36.8 6.9 42 228-269 23-64 (65)
22 KOG4447|consensus 92.0 0.27 5.9E-06 47.5 5.2 76 192-268 29-104 (173)
23 PRK10884 SH3 domain-containing 91.9 0.52 1.1E-05 46.1 7.1 51 223-273 124-174 (206)
24 KOG4343|consensus 91.0 0.37 7.9E-06 53.7 5.6 44 227-270 298-341 (655)
25 PF00170 bZIP_1: bZIP transcri 90.4 1.5 3.3E-05 34.8 7.2 39 230-268 25-63 (64)
26 PRK15422 septal ring assembly 89.2 1.8 3.9E-05 37.8 7.2 47 226-273 13-66 (79)
27 COG3074 Uncharacterized protei 88.5 1.9 4.1E-05 37.4 6.8 47 226-273 13-66 (79)
28 KOG3119|consensus 88.4 1.6 3.6E-05 43.9 7.4 40 234-273 218-257 (269)
29 PF01920 Prefoldin_2: Prefoldi 87.9 1.6 3.4E-05 36.4 5.9 79 190-270 22-101 (106)
30 PF07716 bZIP_2: Basic region 86.7 3.3 7.2E-05 32.2 6.7 39 229-267 16-54 (54)
31 KOG4005|consensus 85.5 4.5 9.7E-05 41.8 8.7 77 184-270 58-143 (292)
32 PF05164 ZapA: Cell division p 85.0 9.1 0.0002 31.1 8.8 61 197-262 27-89 (89)
33 TIGR02894 DNA_bind_RsfA transc 84.8 3 6.5E-05 40.4 6.8 46 223-268 79-134 (161)
34 PF14775 NYD-SP28_assoc: Sperm 83.5 6.1 0.00013 32.3 7.1 56 203-263 2-58 (60)
35 smart00502 BBC B-Box C-termina 83.2 5 0.00011 33.4 6.7 76 195-270 14-90 (127)
36 PRK13922 rod shape-determining 83.1 3.6 7.8E-05 40.4 6.7 39 231-269 69-110 (276)
37 COG2433 Uncharacterized conser 83.0 12 0.00025 42.8 11.3 134 133-268 287-466 (652)
38 KOG4571|consensus 82.6 5.6 0.00012 41.6 8.1 42 228-269 238-279 (294)
39 PF11690 DUF3287: Protein of u 82.2 2.7 5.9E-05 38.4 5.1 57 227-284 38-96 (109)
40 PF10205 KLRAQ: Predicted coil 81.9 7.5 0.00016 35.3 7.7 38 233-270 35-72 (102)
41 PF04111 APG6: Autophagy prote 81.5 5 0.00011 41.3 7.4 41 228-268 47-87 (314)
42 PRK10722 hypothetical protein; 81.2 10 0.00022 38.9 9.3 79 196-291 124-209 (247)
43 PF15458 NTR2: Nineteen comple 80.6 16 0.00034 36.8 10.3 84 186-269 140-253 (254)
44 PF04977 DivIC: Septum formati 80.2 4.6 9.9E-05 32.1 5.3 31 234-264 20-50 (80)
45 PRK00888 ftsB cell division pr 80.0 4.1 9E-05 36.1 5.4 33 234-266 30-62 (105)
46 PRK14127 cell division protein 79.4 5.8 0.00013 36.0 6.2 40 231-270 30-69 (109)
47 PRK05771 V-type ATP synthase s 79.4 14 0.00031 40.9 10.4 79 194-277 49-134 (646)
48 COG3074 Uncharacterized protei 78.6 5.8 0.00013 34.5 5.6 28 238-265 39-66 (79)
49 PF06156 DUF972: Protein of un 78.2 7.4 0.00016 35.0 6.5 37 230-266 21-57 (107)
50 PF04880 NUDE_C: NUDE protein, 78.0 2.4 5.2E-05 40.9 3.5 39 226-268 9-47 (166)
51 PF08317 Spc7: Spc7 kinetochor 77.7 16 0.00034 37.5 9.4 72 196-268 185-267 (325)
52 PF12329 TMF_DNA_bd: TATA elem 77.6 7.5 0.00016 32.6 6.0 40 231-270 33-72 (74)
53 smart00338 BRLZ basic region l 76.9 13 0.00027 29.6 6.8 41 230-270 18-58 (65)
54 smart00787 Spc7 Spc7 kinetocho 76.3 17 0.00036 37.8 9.2 19 196-214 180-198 (312)
55 KOG3119|consensus 75.9 5 0.00011 40.5 5.3 32 239-270 216-247 (269)
56 COG4026 Uncharacterized protei 75.2 13 0.00029 38.3 8.1 38 233-270 144-181 (290)
57 KOG4196|consensus 75.0 8 0.00017 36.7 6.0 37 234-270 77-113 (135)
58 PRK13169 DNA replication intia 74.7 9.7 0.00021 34.7 6.3 37 230-266 21-57 (110)
59 TIGR02894 DNA_bind_RsfA transc 74.6 8.4 0.00018 37.4 6.2 40 231-270 104-143 (161)
60 PF10186 Atg14: UV radiation r 74.4 36 0.00079 32.7 10.4 36 233-268 65-100 (302)
61 KOG3863|consensus 73.9 7.8 0.00017 43.9 6.6 44 227-270 507-550 (604)
62 PF04977 DivIC: Septum formati 73.1 5.7 0.00012 31.5 4.0 33 238-270 17-49 (80)
63 TIGR02449 conserved hypothetic 73.0 11 0.00024 31.7 5.8 42 229-270 12-53 (65)
64 PF00170 bZIP_1: bZIP transcri 72.0 22 0.00047 28.3 7.0 42 229-270 17-58 (64)
65 PF10168 Nup88: Nuclear pore c 71.7 3.3 7.2E-05 47.2 3.2 39 238-276 632-670 (717)
66 PF11221 Med21: Subunit 21 of 71.5 13 0.00027 34.3 6.4 60 203-270 84-143 (144)
67 KOG3582|consensus 70.9 1 2.3E-05 51.6 -0.8 66 182-250 784-851 (856)
68 TIGR02449 conserved hypothetic 70.6 25 0.00054 29.7 7.3 36 233-268 2-37 (65)
69 KOG4196|consensus 70.5 8.9 0.00019 36.4 5.2 39 233-273 70-108 (135)
70 KOG2264|consensus 69.9 11 0.00023 43.3 6.5 32 236-267 119-150 (907)
71 PHA01750 hypothetical protein 69.9 19 0.00041 31.2 6.5 29 241-269 45-73 (75)
72 PF07544 Med9: RNA polymerase 69.5 10 0.00022 32.2 5.0 41 227-267 38-81 (83)
73 KOG2264|consensus 69.2 14 0.0003 42.4 7.1 33 233-265 95-127 (907)
74 PF02183 HALZ: Homeobox associ 69.2 14 0.00031 28.8 5.3 34 236-269 10-43 (45)
75 PF10883 DUF2681: Protein of u 68.9 24 0.00051 31.2 7.1 33 232-264 24-56 (87)
76 PF10211 Ax_dynein_light: Axon 68.7 30 0.00064 33.5 8.5 19 195-213 59-77 (189)
77 TIGR02209 ftsL_broad cell divi 68.7 24 0.00052 28.8 6.8 30 235-264 28-57 (85)
78 PF12709 Kinetocho_Slk19: Cent 68.5 14 0.00031 32.8 5.7 66 200-267 13-78 (87)
79 TIGR02231 conserved hypothetic 68.4 25 0.00053 37.9 8.7 43 228-270 128-170 (525)
80 PF09726 Macoilin: Transmembra 67.9 16 0.00035 41.8 7.5 39 231-269 538-576 (697)
81 PRK13922 rod shape-determining 67.7 59 0.0013 32.1 10.5 31 240-270 71-104 (276)
82 TIGR00219 mreC rod shape-deter 67.6 18 0.00039 36.8 7.1 44 226-269 58-108 (283)
83 KOG4571|consensus 67.4 38 0.00082 35.8 9.4 38 230-267 254-291 (294)
84 PF09789 DUF2353: Uncharacteri 67.2 43 0.00094 35.5 9.9 46 220-265 68-113 (319)
85 COG5019 CDC3 Septin family pro 67.1 34 0.00074 37.1 9.3 40 231-270 328-367 (373)
86 PF10226 DUF2216: Uncharacteri 66.5 34 0.00074 34.3 8.5 12 328-339 176-188 (195)
87 PF13815 Dzip-like_N: Iguana/D 66.4 19 0.00041 32.1 6.2 44 224-267 62-109 (118)
88 PF08172 CASP_C: CASP C termin 66.2 22 0.00048 36.0 7.3 51 220-273 78-128 (248)
89 TIGR02338 gimC_beta prefoldin, 66.1 38 0.00083 29.7 8.0 70 200-270 36-106 (110)
90 PF08826 DMPK_coil: DMPK coile 65.3 33 0.00071 28.6 6.9 30 236-265 30-59 (61)
91 PF07989 Microtub_assoc: Micro 65.0 19 0.0004 30.7 5.6 67 189-269 6-74 (75)
92 PF02996 Prefoldin: Prefoldin 64.7 21 0.00045 30.7 6.0 42 226-270 75-116 (120)
93 PF06005 DUF904: Protein of un 64.2 26 0.00057 29.6 6.3 30 233-262 27-56 (72)
94 PF05565 Sipho_Gp157: Siphovir 63.9 28 0.00062 32.6 7.2 43 229-271 45-87 (162)
95 PF11740 KfrA_N: Plasmid repli 63.7 64 0.0014 27.8 8.8 33 236-268 86-118 (120)
96 PRK10884 SH3 domain-containing 63.5 30 0.00065 34.1 7.6 34 237-270 131-164 (206)
97 PF04102 SlyX: SlyX; InterPro 63.2 15 0.00033 30.2 4.7 33 233-265 20-52 (69)
98 PF07888 CALCOCO1: Calcium bin 62.9 75 0.0016 36.0 11.2 51 220-270 406-456 (546)
99 cd00890 Prefoldin Prefoldin is 62.7 25 0.00055 30.2 6.1 38 233-270 89-126 (129)
100 COG4026 Uncharacterized protei 62.2 19 0.00041 37.3 6.0 6 204-209 79-84 (290)
101 cd00632 Prefoldin_beta Prefold 62.2 62 0.0013 28.0 8.4 50 221-270 53-102 (105)
102 KOG1962|consensus 61.6 45 0.00098 33.7 8.5 78 186-269 114-196 (216)
103 PF02996 Prefoldin: Prefoldin 61.6 22 0.00047 30.5 5.5 46 221-266 74-119 (120)
104 PF05008 V-SNARE: Vesicle tran 61.5 18 0.0004 29.3 4.8 66 197-269 5-71 (79)
105 COG1382 GimC Prefoldin, chaper 61.4 27 0.00059 32.4 6.4 39 232-270 71-109 (119)
106 PF13942 Lipoprotein_20: YfhG 61.2 64 0.0014 32.1 9.2 76 199-291 81-163 (179)
107 PRK11637 AmiB activator; Provi 60.8 26 0.00056 36.9 7.0 39 231-269 89-127 (428)
108 PF09730 BicD: Microtubule-ass 60.7 68 0.0015 37.4 10.7 69 192-262 76-145 (717)
109 PF11414 Suppressor_APC: Adeno 60.5 34 0.00075 29.9 6.5 39 230-268 6-44 (84)
110 PF07716 bZIP_2: Basic region 60.4 33 0.00072 26.7 5.9 32 228-259 22-53 (54)
111 PRK09413 IS2 repressor TnpA; R 60.3 19 0.0004 31.9 5.0 33 233-265 73-105 (121)
112 KOG0709|consensus 59.9 15 0.00033 40.6 5.3 52 219-270 260-318 (472)
113 PF04508 Pox_A_type_inc: Viral 59.8 12 0.00026 26.3 2.9 20 246-265 2-21 (23)
114 COG2433 Uncharacterized conser 59.6 19 0.00041 41.2 6.0 38 233-270 424-461 (652)
115 PRK10803 tol-pal system protei 59.5 25 0.00055 35.2 6.4 37 234-270 57-93 (263)
116 PRK03947 prefoldin subunit alp 59.4 40 0.00086 30.2 7.0 40 226-268 92-131 (140)
117 PRK11637 AmiB activator; Provi 59.2 32 0.00069 36.2 7.3 37 231-267 96-132 (428)
118 TIGR03752 conj_TIGR03752 integ 58.3 35 0.00077 37.9 7.7 34 236-269 107-140 (472)
119 PF06424 PRP1_N: PRP1 splicing 58.3 8.8 0.00019 36.0 2.8 37 239-275 84-124 (133)
120 PF09311 Rab5-bind: Rabaptin-l 58.2 6.3 0.00014 37.4 1.9 45 224-268 8-52 (181)
121 PRK09039 hypothetical protein; 58.1 74 0.0016 33.4 9.6 14 411-425 295-308 (343)
122 PF04156 IncA: IncA protein; 58.0 53 0.0011 30.5 7.8 26 226-251 125-150 (191)
123 PRK00888 ftsB cell division pr 58.0 23 0.0005 31.5 5.2 34 239-273 28-61 (105)
124 PF12097 DUF3573: Protein of u 57.3 30 0.00066 37.4 6.8 118 236-360 33-158 (383)
125 PF15066 CAGE1: Cancer-associa 57.2 13 0.00029 41.3 4.3 53 224-276 307-362 (527)
126 TIGR00293 prefoldin, archaeal 56.9 34 0.00074 30.0 6.1 38 233-270 88-125 (126)
127 PHA00728 hypothetical protein 56.8 12 0.00026 35.6 3.4 27 244-270 4-30 (151)
128 PRK09343 prefoldin subunit bet 56.5 80 0.0017 28.5 8.4 70 200-270 40-110 (121)
129 PRK00295 hypothetical protein; 56.2 36 0.00078 28.3 5.7 33 233-265 21-53 (68)
130 KOG0933|consensus 55.7 53 0.0012 39.8 8.9 80 194-284 964-1050(1174)
131 PF11853 DUF3373: Protein of u 55.5 9.1 0.0002 42.4 2.8 35 239-273 32-66 (489)
132 TIGR00219 mreC rod shape-deter 55.5 1E+02 0.0022 31.5 10.0 23 241-263 69-91 (283)
133 PF14916 CCDC92: Coiled-coil d 55.5 18 0.00038 30.2 3.8 37 223-265 5-41 (60)
134 PF05377 FlaC_arch: Flagella a 55.4 41 0.00089 27.8 5.8 36 233-268 2-37 (55)
135 TIGR02209 ftsL_broad cell divi 55.1 87 0.0019 25.6 7.8 37 234-270 20-56 (85)
136 PRK03947 prefoldin subunit alp 54.7 39 0.00084 30.3 6.1 43 223-268 96-138 (140)
137 PF05529 Bap31: B-cell recepto 54.7 23 0.00049 33.4 4.9 31 238-268 154-184 (192)
138 cd00632 Prefoldin_beta Prefold 54.6 54 0.0012 28.4 6.8 48 218-265 57-104 (105)
139 PF04156 IncA: IncA protein; 54.5 72 0.0016 29.6 8.1 66 199-265 106-171 (191)
140 PRK02119 hypothetical protein; 54.1 34 0.00074 28.8 5.3 33 233-265 25-57 (73)
141 PF14723 SSFA2_C: Sperm-specif 53.8 26 0.00056 34.7 5.2 47 224-273 127-173 (179)
142 TIGR03752 conj_TIGR03752 integ 53.6 37 0.0008 37.8 6.9 21 232-252 74-94 (472)
143 PF10234 Cluap1: Clusterin-ass 53.4 36 0.00078 35.2 6.4 44 227-270 179-222 (267)
144 cd00584 Prefoldin_alpha Prefol 53.3 43 0.00094 29.4 6.1 38 233-270 89-126 (129)
145 PF02344 Myc-LZ: Myc leucine z 53.3 30 0.00065 26.1 4.2 25 246-270 2-26 (32)
146 PRK04406 hypothetical protein; 53.2 35 0.00076 29.0 5.3 32 233-264 27-58 (75)
147 PF13600 DUF4140: N-terminal d 53.1 28 0.0006 29.6 4.8 32 233-264 72-103 (104)
148 PF04420 CHD5: CHD5-like prote 53.0 47 0.001 31.2 6.7 22 249-270 70-91 (161)
149 PF11932 DUF3450: Protein of u 53.0 36 0.00079 33.4 6.2 10 314-323 165-174 (251)
150 PTZ00454 26S protease regulato 53.0 36 0.00078 36.2 6.6 39 232-270 23-61 (398)
151 PF06637 PV-1: PV-1 protein (P 52.9 38 0.00082 37.2 6.7 57 223-279 323-383 (442)
152 PF09738 DUF2051: Double stran 52.9 31 0.00067 36.1 5.9 55 216-270 104-165 (302)
153 PF05008 V-SNARE: Vesicle tran 52.7 39 0.00085 27.4 5.4 43 227-269 35-78 (79)
154 TIGR01242 26Sp45 26S proteasom 52.4 28 0.0006 35.5 5.5 33 233-265 8-40 (364)
155 KOG1760|consensus 52.2 38 0.00083 32.1 5.8 41 233-273 83-123 (131)
156 PHA02557 22 prohead core prote 52.2 58 0.0012 34.1 7.6 55 199-267 123-177 (271)
157 PRK11281 hypothetical protein; 51.9 54 0.0012 39.7 8.4 70 197-268 38-110 (1113)
158 PRK00736 hypothetical protein; 51.8 42 0.00091 27.9 5.4 32 233-264 21-52 (68)
159 PRK02793 phi X174 lysis protei 51.5 40 0.00088 28.3 5.4 33 233-265 24-56 (72)
160 PTZ00454 26S protease regulato 51.3 39 0.00084 36.0 6.5 38 230-267 28-65 (398)
161 PF07888 CALCOCO1: Calcium bin 50.9 47 0.001 37.6 7.3 9 63-71 24-32 (546)
162 PF04420 CHD5: CHD5-like prote 50.7 29 0.00064 32.6 4.9 36 235-270 37-84 (161)
163 PF14197 Cep57_CLD_2: Centroso 50.6 52 0.0011 27.6 5.9 38 233-270 28-65 (69)
164 KOG2391|consensus 50.5 53 0.0011 35.6 7.2 31 233-263 248-278 (365)
165 PF02388 FemAB: FemAB family; 50.5 44 0.00095 35.3 6.7 12 202-213 218-229 (406)
166 PF04201 TPD52: Tumour protein 50.3 38 0.00083 33.0 5.7 43 234-276 32-86 (162)
167 KOG4797|consensus 50.3 75 0.0016 29.9 7.3 41 230-270 66-111 (123)
168 PF04111 APG6: Autophagy prote 50.3 54 0.0012 34.0 7.2 7 392-398 263-269 (314)
169 KOG3088|consensus 50.2 40 0.00087 35.8 6.2 45 233-281 59-103 (313)
170 PF08286 Spc24: Spc24 subunit 50.0 5.4 0.00012 35.4 0.0 31 239-269 14-44 (118)
171 PF03961 DUF342: Protein of un 49.9 51 0.0011 35.1 7.1 29 242-270 379-407 (451)
172 PF15397 DUF4618: Domain of un 49.9 1E+02 0.0022 32.0 8.9 18 277-297 238-255 (258)
173 PF06632 XRCC4: DNA double-str 49.8 58 0.0013 34.6 7.4 14 444-457 322-335 (342)
174 PRK04325 hypothetical protein; 49.8 44 0.00096 28.2 5.4 33 233-265 25-57 (74)
175 PF05103 DivIVA: DivIVA protei 49.6 9.9 0.00021 33.0 1.6 35 231-265 25-59 (131)
176 PRK14127 cell division protein 49.6 39 0.00085 30.8 5.4 25 246-270 38-62 (109)
177 KOG2991|consensus 49.5 31 0.00068 36.4 5.3 62 188-257 243-304 (330)
178 PF00038 Filament: Intermediat 49.2 97 0.0021 30.8 8.5 81 186-270 175-255 (312)
179 PF01763 Herpes_UL6: Herpesvir 48.9 37 0.00081 38.4 6.1 48 223-270 353-402 (557)
180 PF05529 Bap31: B-cell recepto 48.7 44 0.00096 31.5 5.8 35 234-268 157-191 (192)
181 PF07028 DUF1319: Protein of u 48.6 90 0.0019 29.6 7.6 45 226-270 41-85 (126)
182 PF05659 RPW8: Arabidopsis bro 48.4 47 0.001 31.1 5.9 56 189-245 57-112 (147)
183 TIGR01242 26Sp45 26S proteasom 48.4 34 0.00074 34.9 5.4 39 233-271 1-39 (364)
184 PF02183 HALZ: Homeobox associ 48.2 47 0.001 26.0 4.9 25 236-260 17-41 (45)
185 COG5293 Predicted ATPase [Gene 48.2 53 0.0011 37.1 7.0 85 186-270 326-424 (591)
186 PF06156 DUF972: Protein of un 47.9 89 0.0019 28.2 7.3 41 230-270 14-54 (107)
187 PF02403 Seryl_tRNA_N: Seryl-t 47.8 52 0.0011 28.2 5.6 32 232-263 30-61 (108)
188 PF08537 NBP1: Fungal Nap bind 47.7 2.5E+02 0.0054 30.3 11.5 65 203-270 151-221 (323)
189 PF10883 DUF2681: Protein of u 47.7 1.1E+02 0.0024 27.2 7.7 40 229-268 14-53 (87)
190 PF06103 DUF948: Bacterial pro 47.5 1E+02 0.0022 25.8 7.2 43 226-268 21-63 (90)
191 KOG0837|consensus 47.1 33 0.00071 35.9 5.0 41 230-270 226-266 (279)
192 KOG4010|consensus 47.0 39 0.00085 34.0 5.3 45 235-279 48-104 (208)
193 PF11932 DUF3450: Protein of u 46.7 73 0.0016 31.3 7.2 26 232-257 71-96 (251)
194 PF12718 Tropomyosin_1: Tropom 46.5 56 0.0012 30.4 6.0 37 233-269 23-59 (143)
195 PF10168 Nup88: Nuclear pore c 46.4 65 0.0014 37.2 7.7 84 186-270 539-625 (717)
196 KOG1510|consensus 46.2 93 0.002 30.0 7.4 60 203-270 71-130 (139)
197 PF15058 Speriolin_N: Sperioli 46.0 26 0.00056 35.2 3.9 24 244-267 11-34 (200)
198 PF05781 MRVI1: MRVI1 protein; 45.9 2.1E+02 0.0046 32.6 11.2 91 188-279 221-332 (538)
199 PF06305 DUF1049: Protein of u 45.7 28 0.0006 27.4 3.4 22 236-257 46-67 (68)
200 PF14282 FlxA: FlxA-like prote 45.7 67 0.0014 28.5 6.1 25 246-270 52-76 (106)
201 KOG4001|consensus 45.5 82 0.0018 32.5 7.3 75 196-270 125-217 (259)
202 COG1730 GIM5 Predicted prefold 45.4 61 0.0013 30.8 6.1 15 196-210 35-49 (145)
203 PF07407 Seadorna_VP6: Seadorn 45.3 29 0.00063 37.6 4.4 27 231-257 32-58 (420)
204 cd00890 Prefoldin Prefoldin is 45.1 64 0.0014 27.8 5.8 33 233-265 96-128 (129)
205 PRK13729 conjugal transfer pil 45.1 59 0.0013 36.3 6.8 32 236-267 95-126 (475)
206 PF04350 PilO: Pilus assembly 44.9 9.5 0.00021 33.2 0.7 38 233-270 8-48 (144)
207 PF07195 FliD_C: Flagellar hoo 44.7 35 0.00075 33.3 4.6 39 236-274 191-229 (239)
208 PF09304 Cortex-I_coil: Cortex 44.7 1.3E+02 0.0028 27.9 7.8 53 218-270 13-69 (107)
209 PF10458 Val_tRNA-synt_C: Valy 44.6 42 0.00091 27.2 4.3 28 243-270 2-29 (66)
210 PF10944 DUF2630: Protein of u 44.5 38 0.00082 29.9 4.3 44 230-273 7-50 (81)
211 COG4985 ABC-type phosphate tra 44.0 40 0.00087 35.1 5.1 30 242-271 218-247 (289)
212 KOG3558|consensus 44.0 24 0.00052 41.0 3.8 49 185-234 46-96 (768)
213 TIGR03185 DNA_S_dndD DNA sulfu 44.0 1.4E+02 0.0031 33.3 9.6 38 231-268 428-465 (650)
214 PF12709 Kinetocho_Slk19: Cent 43.9 49 0.0011 29.5 4.9 30 241-270 45-74 (87)
215 PF04201 TPD52: Tumour protein 43.7 70 0.0015 31.3 6.3 28 238-265 29-56 (162)
216 PF11853 DUF3373: Protein of u 43.6 22 0.00048 39.6 3.4 29 242-270 28-56 (489)
217 PF14389 Lzipper-MIP1: Leucine 43.5 43 0.00094 29.0 4.5 35 236-270 52-86 (88)
218 PF12938 M_domain: M domain of 43.4 98 0.0021 31.8 7.6 50 220-269 146-202 (235)
219 COG3883 Uncharacterized protei 43.4 81 0.0018 32.8 7.1 38 233-270 68-105 (265)
220 PF07047 OPA3: Optic atrophy 3 43.3 73 0.0016 29.3 6.2 21 250-270 110-130 (134)
221 PF07334 IFP_35_N: Interferon- 43.2 36 0.00079 29.6 3.9 17 250-266 12-28 (76)
222 COG1792 MreC Cell shape-determ 43.0 1.5E+02 0.0032 30.5 8.9 12 325-336 133-144 (284)
223 PF13874 Nup54: Nucleoporin co 43.0 1.1E+02 0.0025 28.0 7.4 64 203-267 31-94 (141)
224 PF05615 THOC7: Tho complex su 42.8 1.2E+02 0.0025 27.4 7.3 29 240-268 76-104 (139)
225 PF09789 DUF2353: Uncharacteri 42.8 82 0.0018 33.5 7.2 48 223-270 132-179 (319)
226 PF03980 Nnf1: Nnf1 ; InterPr 42.4 61 0.0013 28.1 5.3 33 234-266 76-108 (109)
227 PF11336 DUF3138: Protein of u 42.4 29 0.00063 38.6 4.0 26 245-270 25-50 (514)
228 PF15456 Uds1: Up-regulated Du 42.3 2.5E+02 0.0053 26.1 9.4 73 185-270 41-113 (124)
229 PRK00846 hypothetical protein; 42.2 70 0.0015 27.8 5.5 34 232-265 28-61 (77)
230 COG2919 Septum formation initi 42.2 86 0.0019 28.2 6.3 37 230-267 50-86 (117)
231 PRK12765 flagellar capping pro 42.0 30 0.00064 38.9 4.1 41 236-276 530-570 (595)
232 PF04999 FtsL: Cell division p 41.8 1.3E+02 0.0027 25.5 7.0 28 241-268 38-65 (97)
233 PRK10803 tol-pal system protei 41.5 63 0.0014 32.5 5.9 29 242-270 58-86 (263)
234 TIGR01069 mutS2 MutS2 family p 41.2 79 0.0017 36.6 7.3 45 39-83 298-345 (771)
235 PF04350 PilO: Pilus assembly 41.0 24 0.00052 30.7 2.6 36 234-269 2-40 (144)
236 PRK02195 V-type ATP synthase s 41.0 1.1E+02 0.0023 30.1 7.2 44 227-270 20-63 (201)
237 PF10186 Atg14: UV radiation r 40.9 87 0.0019 30.2 6.6 39 232-270 113-151 (302)
238 COG1256 FlgK Flagellar hook-as 40.7 1.4E+02 0.0031 33.5 9.1 73 196-270 109-188 (552)
239 PF04568 IATP: Mitochondrial A 40.4 1.3E+02 0.0029 27.2 7.1 35 232-268 65-99 (100)
240 PRK06664 fliD flagellar hook-a 40.4 98 0.0021 35.5 7.9 79 195-273 550-635 (661)
241 PF07352 Phage_Mu_Gam: Bacteri 40.3 1.1E+02 0.0023 28.2 6.8 50 226-276 5-62 (149)
242 PRK03992 proteasome-activating 40.0 93 0.002 32.6 7.1 40 231-270 8-47 (389)
243 COG1730 GIM5 Predicted prefold 40.0 1.3E+02 0.0027 28.8 7.3 26 242-267 112-137 (145)
244 KOG0977|consensus 39.9 89 0.0019 35.5 7.3 38 232-269 156-193 (546)
245 PF13815 Dzip-like_N: Iguana/D 39.9 1.5E+02 0.0033 26.4 7.5 45 226-270 61-105 (118)
246 PF03961 DUF342: Protein of un 39.6 72 0.0016 34.0 6.3 35 230-264 374-408 (451)
247 PRK15422 septal ring assembly 39.6 1E+02 0.0023 27.2 6.2 35 233-267 27-68 (79)
248 PRK10265 chaperone-modulator p 39.5 39 0.00085 29.5 3.7 41 222-262 49-95 (101)
249 PHA02621 agnoprotein; Provisio 39.2 12 0.00026 31.8 0.4 22 76-97 2-23 (68)
250 PF02050 FliJ: Flagellar FliJ 38.7 2.1E+02 0.0046 23.2 10.9 70 198-270 22-91 (123)
251 TIGR00293 prefoldin, archaeal 38.7 78 0.0017 27.8 5.5 36 234-269 2-37 (126)
252 KOG0995|consensus 38.6 1E+02 0.0022 35.3 7.5 9 75-83 61-69 (581)
253 PF11544 Spc42p: Spindle pole 38.6 76 0.0016 27.8 5.2 28 234-261 8-35 (76)
254 PRK09039 hypothetical protein; 38.1 1.8E+02 0.0039 30.5 8.8 38 233-270 146-183 (343)
255 PF04728 LPP: Lipoprotein leuc 38.1 1E+02 0.0023 25.6 5.6 37 233-269 5-41 (56)
256 PF14257 DUF4349: Domain of un 38.0 74 0.0016 31.3 5.7 48 233-280 141-197 (262)
257 PF11382 DUF3186: Protein of u 37.9 45 0.00098 34.3 4.4 22 247-268 41-62 (308)
258 KOG2070|consensus 37.7 3.6E+02 0.0077 31.2 11.3 208 17-264 433-656 (661)
259 TIGR01005 eps_transp_fam exopo 37.6 1.4E+02 0.0031 33.6 8.5 18 196-213 252-269 (754)
260 PF10224 DUF2205: Predicted co 37.6 1.6E+02 0.0035 25.7 7.1 24 241-264 40-63 (80)
261 PF11285 DUF3086: Protein of u 37.6 62 0.0014 34.0 5.3 27 233-259 6-32 (283)
262 PF14662 CCDC155: Coiled-coil 37.4 1.2E+02 0.0025 30.6 6.9 39 232-270 75-113 (193)
263 PLN02678 seryl-tRNA synthetase 37.4 84 0.0018 34.5 6.5 27 244-270 77-103 (448)
264 PF06810 Phage_GP20: Phage min 37.2 1.2E+02 0.0026 28.7 6.7 27 229-255 39-68 (155)
265 PF04102 SlyX: SlyX; InterPro 37.2 1.3E+02 0.0029 24.8 6.2 33 237-269 17-49 (69)
266 PF08614 ATG16: Autophagy prot 37.2 1.9E+02 0.004 27.7 8.1 25 249-273 155-182 (194)
267 PHA02562 46 endonuclease subun 36.9 84 0.0018 33.4 6.3 33 236-268 370-402 (562)
268 PF12718 Tropomyosin_1: Tropom 36.7 1.2E+02 0.0026 28.3 6.6 30 233-262 37-66 (143)
269 PRK08032 fliD flagellar cappin 36.7 1.4E+02 0.0029 32.5 7.9 38 236-273 404-441 (462)
270 PRK10929 putative mechanosensi 36.5 1.3E+02 0.0027 36.9 8.2 65 198-266 26-93 (1109)
271 COG1792 MreC Cell shape-determ 36.5 1.1E+02 0.0024 31.4 6.8 10 260-269 91-100 (284)
272 KOG4796|consensus 36.4 1.6E+02 0.0034 33.9 8.4 79 184-273 516-594 (604)
273 PRK12772 bifunctional flagella 36.2 1.3E+02 0.0029 34.0 7.9 88 225-354 458-554 (609)
274 PF01166 TSC22: TSC-22/dip/bun 36.1 78 0.0017 26.7 4.7 24 234-257 17-40 (59)
275 cd04769 HTH_MerR2 Helix-Turn-H 36.1 80 0.0017 27.7 5.1 34 233-266 81-114 (116)
276 KOG0804|consensus 36.1 2.4E+02 0.0053 31.8 9.6 28 187-214 339-366 (493)
277 PRK14011 prefoldin subunit alp 36.1 1.6E+02 0.0035 27.8 7.3 14 197-210 33-46 (144)
278 KOG0996|consensus 36.0 86 0.0019 38.6 6.8 74 196-270 515-595 (1293)
279 COG4079 Uncharacterized protei 35.9 5.1E+02 0.011 27.6 11.4 57 195-253 150-213 (293)
280 cd00584 Prefoldin_alpha Prefol 35.9 92 0.002 27.4 5.5 35 234-268 2-36 (129)
281 PF05103 DivIVA: DivIVA protei 35.7 19 0.00042 31.2 1.2 43 226-268 27-69 (131)
282 PF07412 Geminin: Geminin; In 35.6 75 0.0016 31.9 5.4 38 228-265 122-159 (200)
283 COG1422 Predicted membrane pro 35.6 1.6E+02 0.0034 29.9 7.5 22 238-259 72-93 (201)
284 PRK03992 proteasome-activating 35.5 1E+02 0.0022 32.4 6.6 38 229-266 13-50 (389)
285 TIGR02338 gimC_beta prefoldin, 35.3 1.8E+02 0.0039 25.6 7.1 45 220-264 63-107 (110)
286 PRK13729 conjugal transfer pil 35.2 97 0.0021 34.7 6.6 34 395-438 211-244 (475)
287 PF01736 Polyoma_agno: Polyoma 34.9 11 0.00024 31.5 -0.3 21 76-96 2-22 (62)
288 PF11180 DUF2968: Protein of u 34.7 1.5E+02 0.0033 29.8 7.2 119 147-270 39-186 (192)
289 PF07200 Mod_r: Modifier of ru 34.7 94 0.002 28.1 5.5 42 232-273 42-83 (150)
290 KOG3647|consensus 34.7 95 0.0021 33.1 6.1 32 239-270 134-165 (338)
291 PRK04778 septation ring format 34.7 79 0.0017 34.9 5.9 44 226-269 312-365 (569)
292 PF05384 DegS: Sensor protein 34.7 1.7E+02 0.0036 28.3 7.3 44 226-269 79-122 (159)
293 TIGR01834 PHA_synth_III_E poly 34.6 55 0.0012 34.8 4.5 47 224-270 260-314 (320)
294 PF07445 priB_priC: Primosomal 34.5 2.8E+02 0.0061 26.6 8.8 73 198-270 73-163 (173)
295 PRK11239 hypothetical protein; 34.4 65 0.0014 32.8 4.7 28 241-268 186-213 (215)
296 PRK13169 DNA replication intia 34.4 1.9E+02 0.0042 26.5 7.3 39 232-270 16-54 (110)
297 PF09006 Surfac_D-trimer: Lung 34.2 69 0.0015 25.9 3.9 23 248-270 2-24 (46)
298 TIGR01069 mutS2 MutS2 family p 34.2 91 0.002 36.1 6.5 8 448-455 721-728 (771)
299 PF13600 DUF4140: N-terminal d 34.2 1E+02 0.0022 26.2 5.3 29 239-267 71-99 (104)
300 KOG3335|consensus 34.1 80 0.0017 31.5 5.2 31 240-270 101-131 (181)
301 PF07926 TPR_MLP1_2: TPR/MLP1/ 34.1 1.7E+02 0.0036 26.6 6.9 36 233-268 86-121 (132)
302 TIGR02977 phageshock_pspA phag 34.0 1.6E+02 0.0034 28.8 7.2 18 197-214 65-82 (219)
303 PF11855 DUF3375: Protein of u 34.0 78 0.0017 34.5 5.6 46 194-239 66-115 (478)
304 PF08317 Spc7: Spc7 kinetochor 33.9 1.1E+02 0.0024 31.5 6.5 67 198-264 166-242 (325)
305 PF08614 ATG16: Autophagy prot 33.7 1.8E+02 0.004 27.7 7.4 20 242-261 162-181 (194)
306 PF05600 DUF773: Protein of un 33.6 83 0.0018 34.9 5.8 48 223-270 424-471 (507)
307 PF05812 Herpes_BLRF2: Herpesv 33.6 59 0.0013 30.4 4.0 22 241-262 6-27 (118)
308 PF10211 Ax_dynein_light: Axon 33.6 1.3E+02 0.0027 29.2 6.4 47 220-266 106-155 (189)
309 PF08286 Spc24: Spc24 subunit 33.5 15 0.00032 32.7 0.2 38 233-270 1-38 (118)
310 PF08961 DUF1875: Domain of un 33.5 14 0.0003 37.9 0.0 43 223-265 121-163 (243)
311 PF14282 FlxA: FlxA-like prote 33.5 1.5E+02 0.0032 26.4 6.3 14 255-268 54-67 (106)
312 PF07106 TBPIP: Tat binding pr 33.4 72 0.0016 29.6 4.6 15 253-267 117-131 (169)
313 PF10018 Med4: Vitamin-D-recep 33.3 1.8E+02 0.0039 27.8 7.3 35 236-270 27-61 (188)
314 KOG0971|consensus 33.2 1.9E+02 0.0042 35.3 8.8 20 194-213 338-357 (1243)
315 TIGR01462 greA transcription e 33.2 1.5E+02 0.0033 27.3 6.7 30 249-279 46-75 (151)
316 PF15290 Syntaphilin: Golgi-lo 33.1 1.3E+02 0.0028 32.1 6.7 26 246-271 118-143 (305)
317 PRK07737 fliD flagellar cappin 33.0 1.6E+02 0.0034 32.5 7.8 79 195-273 385-476 (501)
318 cd07429 Cby_like Chibby, a nuc 32.9 90 0.002 28.7 5.0 34 230-263 71-104 (108)
319 smart00340 HALZ homeobox assoc 32.9 94 0.002 24.9 4.4 17 249-265 16-32 (44)
320 cd01109 HTH_YyaN Helix-Turn-He 32.7 2.3E+02 0.005 24.6 7.3 24 245-268 86-109 (113)
321 PF04012 PspA_IM30: PspA/IM30 32.7 1.7E+02 0.0038 27.9 7.2 76 190-267 54-134 (221)
322 smart00434 TOP4c DNA Topoisome 32.7 2.3E+02 0.005 31.0 8.8 41 229-270 400-440 (445)
323 PF05597 Phasin: Poly(hydroxya 32.7 2.8E+02 0.006 26.1 8.2 18 191-208 50-67 (132)
324 PRK14011 prefoldin subunit alp 32.7 1.3E+02 0.0028 28.5 6.1 33 226-258 86-122 (144)
325 smart00340 HALZ homeobox assoc 32.6 67 0.0015 25.7 3.6 16 235-250 16-31 (44)
326 PF13863 DUF4200: Domain of un 32.6 1.7E+02 0.0037 25.5 6.5 30 241-270 77-106 (126)
327 PRK09413 IS2 repressor TnpA; R 32.5 97 0.0021 27.4 5.1 28 241-268 74-101 (121)
328 PF12325 TMF_TATA_bd: TATA ele 32.5 2.5E+02 0.0054 26.0 7.8 29 223-251 15-43 (120)
329 PRK00295 hypothetical protein; 32.5 1.7E+02 0.0037 24.4 6.1 35 236-270 17-51 (68)
330 PF05929 Phage_GPO: Phage caps 32.5 1.9E+02 0.004 30.3 7.7 79 191-270 163-253 (276)
331 PRK00409 recombination and DNA 32.5 1E+02 0.0022 35.8 6.5 7 41-47 305-311 (782)
332 KOG4005|consensus 32.2 1.1E+02 0.0024 32.1 6.0 78 182-267 69-147 (292)
333 PF12126 DUF3583: Protein of u 32.1 1.4E+02 0.003 32.1 6.8 67 203-269 18-85 (324)
334 PRK09343 prefoldin subunit bet 32.1 2.3E+02 0.0051 25.6 7.5 77 194-270 37-117 (121)
335 PF05278 PEARLI-4: Arabidopsis 32.1 1E+02 0.0022 32.3 5.8 26 245-270 214-239 (269)
336 PRK05771 V-type ATP synthase s 32.1 2.3E+02 0.005 31.7 8.9 27 241-267 96-122 (646)
337 PF12999 PRKCSH-like: Glucosid 32.0 2.3E+02 0.0049 28.0 7.9 16 190-205 120-135 (176)
338 TIGR01834 PHA_synth_III_E poly 32.0 1.9E+02 0.004 31.0 7.8 19 251-269 288-306 (320)
339 PF05615 THOC7: Tho complex su 32.0 2.7E+02 0.0059 25.1 7.9 51 220-270 62-113 (139)
340 PF11382 DUF3186: Protein of u 31.9 98 0.0021 31.9 5.7 35 233-267 34-68 (308)
341 PF08826 DMPK_coil: DMPK coile 31.8 2.9E+02 0.0064 23.1 7.4 37 233-269 20-56 (61)
342 PF14235 DUF4337: Domain of un 31.4 1.9E+02 0.004 27.6 7.0 32 236-267 71-102 (157)
343 PRK00736 hypothetical protein; 31.4 1.9E+02 0.0042 24.0 6.3 35 236-270 17-51 (68)
344 PF14817 HAUS5: HAUS augmin-li 31.3 66 0.0014 36.9 4.7 32 239-270 80-111 (632)
345 PF14303 NAM-associated: No ap 31.3 3.7E+02 0.008 24.0 8.6 43 246-288 104-147 (154)
346 PF15619 Lebercilin: Ciliary p 31.2 2E+02 0.0042 28.3 7.3 47 224-270 5-51 (194)
347 PF06785 UPF0242: Uncharacteri 31.1 1E+02 0.0022 33.6 5.8 22 233-254 136-157 (401)
348 PRK04406 hypothetical protein; 31.1 1.7E+02 0.0036 25.0 6.0 28 241-268 28-55 (75)
349 PF07334 IFP_35_N: Interferon- 30.9 87 0.0019 27.4 4.3 22 249-270 4-25 (76)
350 PF08232 Striatin: Striatin fa 30.8 1.4E+02 0.003 27.7 5.9 36 233-268 34-69 (134)
351 COG2919 Septum formation initi 30.7 2.6E+02 0.0055 25.2 7.4 47 224-270 36-82 (117)
352 TIGR03007 pepcterm_ChnLen poly 30.7 4.2E+02 0.0092 28.2 10.2 11 198-208 221-231 (498)
353 KOG0995|consensus 30.7 1.4E+02 0.003 34.3 7.0 23 228-250 277-299 (581)
354 PF10146 zf-C4H2: Zinc finger- 30.6 80 0.0017 31.9 4.7 50 218-267 47-103 (230)
355 PHA02562 46 endonuclease subun 30.5 5.5E+02 0.012 27.5 11.0 6 210-215 286-291 (562)
356 KOG2010|consensus 30.4 1.4E+02 0.003 32.6 6.6 35 236-270 166-200 (405)
357 PF12017 Tnp_P_element: Transp 30.4 1.2E+02 0.0027 30.7 6.0 30 225-254 11-41 (236)
358 PF14584 DUF4446: Protein of u 30.3 1E+02 0.0023 29.2 5.2 35 236-270 44-78 (151)
359 PF05010 TACC: Transforming ac 30.2 3.3E+02 0.0072 27.3 8.8 72 190-267 86-158 (207)
360 PF12795 MscS_porin: Mechanose 30.2 2E+02 0.0044 28.1 7.3 51 217-267 14-67 (240)
361 PF04380 BMFP: Membrane fusoge 30.1 82 0.0018 26.7 4.0 42 225-266 36-78 (79)
362 cd01106 HTH_TipAL-Mta Helix-Tu 30.0 1.5E+02 0.0033 25.3 5.7 10 187-196 39-48 (103)
363 PF04999 FtsL: Cell division p 29.9 3.4E+02 0.0074 22.9 7.8 34 233-266 37-70 (97)
364 cd07429 Cby_like Chibby, a nuc 29.9 1.2E+02 0.0026 28.0 5.2 34 237-270 71-104 (108)
365 PF04325 DUF465: Protein of un 29.9 74 0.0016 24.6 3.4 23 244-266 26-48 (49)
366 PF11559 ADIP: Afadin- and alp 29.7 2.8E+02 0.006 25.2 7.6 11 197-207 54-64 (151)
367 PHA03162 hypothetical protein; 29.7 66 0.0014 30.8 3.7 21 241-261 16-36 (135)
368 TIGR03545 conserved hypothetic 29.6 2.7E+02 0.0059 31.5 9.0 43 225-267 213-259 (555)
369 PF09849 DUF2076: Uncharacteri 29.6 2.7E+02 0.0058 28.7 8.2 17 197-213 6-22 (247)
370 TIGR02168 SMC_prok_B chromosom 29.6 2.3E+02 0.005 32.2 8.5 10 20-29 23-32 (1179)
371 KOG1265|consensus 29.5 85 0.0018 38.0 5.2 42 224-268 1128-1169(1189)
372 KOG4715|consensus 29.5 2.7E+02 0.0059 30.5 8.5 67 190-262 193-259 (410)
373 cd00187 TOP4c DNA Topoisomeras 29.4 1.8E+02 0.0038 32.0 7.3 25 245-269 405-429 (445)
374 KOG3560|consensus 29.2 46 0.001 38.2 3.0 41 194-234 34-75 (712)
375 PF10359 Fmp27_WPPW: RNA pol I 29.1 1.9E+02 0.0041 31.6 7.5 73 183-268 158-230 (475)
376 PF04859 DUF641: Plant protein 29.1 1E+02 0.0022 29.1 4.7 16 191-206 51-66 (131)
377 PF06632 XRCC4: DNA double-str 29.1 1.6E+02 0.0035 31.4 6.8 13 258-270 193-205 (342)
378 PRK02119 hypothetical protein; 29.1 2E+02 0.0044 24.3 6.1 35 236-270 21-55 (73)
379 PF13851 GAS: Growth-arrest sp 28.9 2.5E+02 0.0054 27.5 7.6 25 243-267 98-122 (201)
380 PF14645 Chibby: Chibby family 28.8 1.2E+02 0.0027 27.7 5.2 36 233-268 73-108 (116)
381 KOG4603|consensus 28.7 2E+02 0.0044 29.0 6.9 45 221-265 93-143 (201)
382 PF13747 DUF4164: Domain of un 28.6 2.7E+02 0.0059 24.3 7.0 21 233-253 41-61 (89)
383 PRK02793 phi X174 lysis protei 28.5 2.1E+02 0.0046 24.1 6.1 35 236-270 20-54 (72)
384 PF15546 DUF4653: Domain of un 28.5 74 0.0016 32.6 4.0 60 229-293 158-229 (239)
385 PRK00409 recombination and DNA 28.5 1.6E+02 0.0034 34.3 7.2 13 190-202 489-501 (782)
386 KOG1853|consensus 28.5 1.5E+02 0.0033 31.4 6.4 42 226-267 142-186 (333)
387 TIGR02473 flagell_FliJ flagell 28.3 3.4E+02 0.0073 23.7 7.7 46 222-267 59-104 (141)
388 PF15456 Uds1: Up-regulated Du 28.3 82 0.0018 29.1 4.0 26 245-270 22-47 (124)
389 TIGR03185 DNA_S_dndD DNA sulfu 28.3 2.7E+02 0.0058 31.2 8.7 15 521-535 604-618 (650)
390 PHA03155 hypothetical protein; 28.3 69 0.0015 30.0 3.5 20 242-261 12-31 (115)
391 PF08606 Prp19: Prp19/Pso4-lik 28.3 1.3E+02 0.0027 26.1 4.8 28 243-270 13-40 (70)
392 PF15460 SAS4: Something about 28.2 95 0.0021 28.0 4.3 50 233-290 12-62 (101)
393 PF07106 TBPIP: Tat binding pr 28.0 1.2E+02 0.0026 28.2 5.1 22 245-266 116-137 (169)
394 PF11083 Streptin-Immun: Lanti 28.0 1.5E+02 0.0034 27.1 5.6 20 222-241 14-33 (99)
395 KOG0946|consensus 28.0 1.6E+02 0.0035 35.4 7.0 19 185-203 581-599 (970)
396 KOG4378|consensus 28.0 6.2E+02 0.014 29.4 11.2 144 122-269 493-667 (673)
397 PF13805 Pil1: Eisosome compon 27.9 1.7E+02 0.0038 30.6 6.7 19 194-212 144-162 (271)
398 PRK00247 putative inner membra 27.9 3.6E+02 0.0079 29.8 9.3 83 176-261 274-361 (429)
399 TIGR03545 conserved hypothetic 27.9 1.3E+02 0.0027 34.0 6.1 86 186-273 163-254 (555)
400 KOG0837|consensus 27.9 1.3E+02 0.0028 31.7 5.7 27 236-262 239-265 (279)
401 PRK10328 DNA binding protein, 27.9 4.1E+02 0.0089 25.1 8.5 36 245-282 42-77 (134)
402 PF13863 DUF4200: Domain of un 27.7 2.5E+02 0.0054 24.4 6.7 30 236-265 79-108 (126)
403 PF10205 KLRAQ: Predicted coil 27.6 1.8E+02 0.004 26.6 6.0 34 236-269 31-64 (102)
404 TIGR03683 A-tRNA_syn_arch alan 27.5 1.6E+02 0.0035 35.0 7.1 8 340-347 874-881 (902)
405 PF00261 Tropomyosin: Tropomyo 27.4 1.7E+02 0.0038 28.7 6.3 38 233-270 178-215 (237)
406 PF12777 MT: Microtubule-bindi 27.4 2.3E+02 0.0051 29.4 7.5 46 224-269 235-280 (344)
407 PRK00226 greA transcription el 27.4 2.5E+02 0.0054 26.0 7.0 28 251-279 53-80 (157)
408 PF10224 DUF2205: Predicted co 27.4 2.2E+02 0.0047 25.0 6.1 38 232-269 24-61 (80)
409 COG3937 Uncharacterized conser 27.4 1.1E+02 0.0024 28.4 4.6 24 245-268 83-106 (108)
410 PF08232 Striatin: Striatin fa 27.3 1.7E+02 0.0037 27.1 5.9 29 242-270 36-64 (134)
411 COG1579 Zn-ribbon protein, pos 27.3 2.2E+02 0.0049 29.2 7.2 26 189-214 37-64 (239)
412 PF09812 MRP-L28: Mitochondria 27.2 5.7E+02 0.012 24.6 12.5 26 242-267 86-111 (157)
413 PF03955 Adeno_PIX: Adenovirus 27.2 36 0.00078 31.4 1.5 29 236-264 81-109 (109)
414 PF07989 Microtub_assoc: Micro 27.2 1.2E+02 0.0027 25.8 4.6 21 248-268 39-59 (75)
415 TIGR02231 conserved hypothetic 27.2 1.7E+02 0.0038 31.7 6.8 45 223-267 126-174 (525)
416 PRK00578 prfB peptide chain re 27.1 4.2E+02 0.0091 28.7 9.5 47 198-244 26-79 (367)
417 KOG1029|consensus 27.1 3.3E+02 0.0072 33.0 9.2 52 325-377 652-703 (1118)
418 PF07047 OPA3: Optic atrophy 3 27.0 4.8E+02 0.01 24.0 8.7 28 239-266 106-133 (134)
419 TIGR02492 flgK_ends flagellar 27.0 4.1E+02 0.0088 27.3 9.1 51 219-269 126-183 (322)
420 PF08657 DASH_Spc34: DASH comp 27.0 2.9E+02 0.0062 28.5 7.9 53 199-264 161-213 (259)
421 PF14661 HAUS6_N: HAUS augmin- 27.0 2.8E+02 0.0061 27.6 7.7 46 223-268 160-208 (247)
422 PF14971 DUF4510: Domain of un 27.0 1.4E+02 0.0031 29.3 5.5 67 183-253 67-141 (163)
423 PF08287 DASH_Spc19: Spc19; I 26.9 1.2E+02 0.0026 28.8 5.0 48 220-270 53-100 (153)
424 KOG4709|consensus 26.9 2.7E+02 0.0059 28.5 7.6 24 236-259 48-71 (217)
425 KOG2417|consensus 26.9 3.8E+02 0.0083 29.9 9.1 83 192-279 203-295 (462)
426 COG1842 PspA Phage shock prote 26.8 2E+02 0.0043 29.0 6.7 70 200-274 4-73 (225)
427 PRK05431 seryl-tRNA synthetase 26.8 1.3E+02 0.0028 32.4 5.8 32 242-273 70-101 (425)
428 PF04340 DUF484: Protein of un 26.8 1.8E+02 0.0039 28.2 6.2 39 232-270 48-89 (225)
429 PF04340 DUF484: Protein of un 26.7 79 0.0017 30.6 3.8 21 246-266 48-68 (225)
430 PF04012 PspA_IM30: PspA/IM30 26.7 2E+02 0.0044 27.4 6.5 41 232-273 31-71 (221)
431 PF15397 DUF4618: Domain of un 26.6 1.4E+02 0.0031 31.0 5.7 73 196-269 14-105 (258)
432 PF11285 DUF3086: Protein of u 26.6 1.4E+02 0.003 31.6 5.6 35 238-272 4-38 (283)
433 PRK10361 DNA recombination pro 26.5 2.5E+02 0.0054 31.5 7.9 20 472-491 325-344 (475)
434 PRK10265 chaperone-modulator p 26.5 1.8E+02 0.0039 25.5 5.6 25 248-273 74-98 (101)
435 PF09755 DUF2046: Uncharacteri 26.4 1.9E+02 0.0041 30.9 6.7 38 230-267 141-178 (310)
436 TIGR03017 EpsF chain length de 26.3 6.5E+02 0.014 26.2 10.5 18 218-235 279-296 (444)
437 PRK03918 chromosome segregatio 26.3 2.1E+02 0.0045 32.4 7.4 75 194-270 265-339 (880)
438 PF07798 DUF1640: Protein of u 26.3 1.8E+02 0.004 27.5 6.0 21 248-268 76-96 (177)
439 PF10146 zf-C4H2: Zinc finger- 26.2 5.7E+02 0.012 26.0 9.7 65 237-313 59-124 (230)
440 PRK01919 tatB sec-independent 26.2 2.3E+02 0.005 28.0 6.8 46 223-269 33-78 (169)
441 KOG0976|consensus 26.1 2.5E+02 0.0054 34.1 8.0 73 190-271 84-160 (1265)
442 PF05266 DUF724: Protein of un 26.0 2.2E+02 0.0047 28.0 6.6 44 227-270 106-149 (190)
443 KOG0982|consensus 26.0 4E+02 0.0087 30.2 9.2 64 205-268 314-383 (502)
444 KOG3915|consensus 26.0 2E+02 0.0043 32.9 7.0 13 222-234 508-520 (641)
445 PF13864 Enkurin: Calmodulin-b 25.9 4.4E+02 0.0095 22.8 8.0 38 232-269 45-91 (98)
446 PF06785 UPF0242: Uncharacteri 25.9 1.6E+02 0.0034 32.3 6.0 37 225-261 121-157 (401)
447 COG3027 zapA Cell division pro 25.8 3.5E+02 0.0076 24.5 7.4 63 200-269 32-96 (105)
448 PF11461 RILP: Rab interacting 25.8 1.3E+02 0.0028 25.3 4.3 32 239-270 4-35 (60)
449 PRK04325 hypothetical protein; 25.7 2.7E+02 0.0059 23.5 6.3 36 233-268 11-53 (74)
450 PF11460 DUF3007: Protein of u 25.7 81 0.0018 29.0 3.4 29 237-267 65-93 (104)
451 KOG0161|consensus 25.7 1.9E+02 0.004 37.5 7.5 44 226-269 1071-1114(1930)
452 PF04859 DUF641: Plant protein 25.7 1.9E+02 0.0042 27.2 6.0 36 233-268 82-117 (131)
453 COG3416 Uncharacterized protei 25.6 2.9E+02 0.0063 28.6 7.6 68 198-267 7-77 (233)
454 PF14645 Chibby: Chibby family 25.6 1.9E+02 0.0042 26.4 5.8 36 235-270 68-103 (116)
455 COG1579 Zn-ribbon protein, pos 25.6 1.7E+02 0.0038 30.0 6.1 36 233-268 105-140 (239)
456 cd04786 HTH_MerR-like_sg7 Heli 25.5 1.5E+02 0.0033 27.0 5.2 35 231-265 78-112 (131)
457 PF03962 Mnd1: Mnd1 family; I 25.3 1.5E+02 0.0033 28.7 5.5 57 194-266 75-131 (188)
458 cd01111 HTH_MerD Helix-Turn-He 25.3 1.8E+02 0.004 25.6 5.5 21 249-269 84-104 (107)
459 cd04766 HTH_HspR Helix-Turn-He 25.3 1.8E+02 0.0039 24.4 5.2 21 249-269 69-89 (91)
460 PF12329 TMF_DNA_bd: TATA elem 25.3 2.6E+02 0.0056 23.6 6.1 26 243-268 31-56 (74)
461 PF10454 DUF2458: Protein of u 25.1 2.2E+02 0.0047 27.1 6.3 49 222-270 4-52 (150)
462 PRK09973 putative outer membra 25.1 2.1E+02 0.0046 25.5 5.8 45 233-277 26-73 (85)
463 PF07996 T4SS: Type IV secreti 25.1 1.7E+02 0.0037 27.3 5.5 36 246-281 20-55 (195)
464 COG4192 Signal transduction hi 25.1 1.4E+02 0.003 34.2 5.7 72 195-279 85-156 (673)
465 PF14661 HAUS6_N: HAUS augmin- 25.0 3.2E+02 0.0068 27.2 7.7 39 223-261 170-208 (247)
466 PF04880 NUDE_C: NUDE protein, 25.0 56 0.0012 31.8 2.4 30 238-268 24-53 (166)
467 TIGR01005 eps_transp_fam exopo 24.9 5.5E+02 0.012 29.1 10.3 26 245-270 376-401 (754)
468 TIGR00344 alaS alanine--tRNA l 24.9 2E+02 0.0043 34.1 7.1 22 233-254 728-749 (851)
469 KOG0971|consensus 24.8 6E+02 0.013 31.5 10.8 38 233-270 1014-1051(1243)
470 COG3105 Uncharacterized protei 24.8 1.8E+02 0.0038 28.1 5.6 70 234-303 37-112 (138)
471 PF11598 COMP: Cartilage oligo 24.8 2E+02 0.0044 22.9 5.0 33 233-265 10-42 (45)
472 KOG0827|consensus 24.7 68 0.0015 35.5 3.2 83 157-243 62-144 (465)
473 PF12443 AKNA: AT-hook-contain 24.7 1E+02 0.0022 28.4 3.8 67 185-269 10-76 (106)
474 PRK06798 fliD flagellar cappin 24.7 82 0.0018 34.2 3.8 37 237-273 378-414 (440)
475 KOG1029|consensus 24.6 3.9E+02 0.0085 32.4 9.2 49 220-268 527-579 (1118)
476 PRK11546 zraP zinc resistance 24.5 3E+02 0.0064 26.5 7.0 22 247-268 91-112 (143)
477 PRK09108 type III secretion sy 24.5 3.1E+02 0.0067 29.3 7.9 90 225-356 197-295 (353)
478 KOG1899|consensus 24.5 1.7E+02 0.0037 34.4 6.3 28 351-378 345-372 (861)
479 PF06103 DUF948: Bacterial pro 24.5 4.2E+02 0.0091 22.2 7.3 42 226-267 28-69 (90)
480 KOG0286|consensus 24.5 85 0.0018 33.8 3.8 25 245-269 2-26 (343)
481 PF07321 YscO: Type III secret 24.4 4E+02 0.0087 25.5 7.9 65 196-269 48-112 (152)
482 COG3883 Uncharacterized protei 24.4 2.5E+02 0.0053 29.4 7.0 12 254-265 82-93 (265)
483 PF10174 Cast: RIM-binding pro 24.3 2E+02 0.0044 33.9 7.0 45 226-270 109-153 (775)
484 TIGR01837 PHA_granule_1 poly(h 24.3 3E+02 0.0064 25.0 6.7 21 193-213 39-59 (118)
485 cd04770 HTH_HMRTR Helix-Turn-H 24.1 1.7E+02 0.0037 25.6 5.1 24 233-256 81-104 (123)
486 cd04776 HTH_GnyR Helix-Turn-He 24.1 2.5E+02 0.0053 25.1 6.1 27 236-262 85-111 (118)
487 PF09766 FimP: Fms-interacting 24.1 1.7E+02 0.0037 30.9 5.9 45 225-269 95-139 (355)
488 PRK15396 murein lipoprotein; P 24.0 2.2E+02 0.0049 24.8 5.6 38 233-270 27-64 (78)
489 PF14257 DUF4349: Domain of un 24.0 1.7E+02 0.0036 28.9 5.5 37 243-279 167-203 (262)
490 PF08580 KAR9: Yeast cortical 24.0 2.7E+02 0.0057 32.4 7.8 18 198-215 195-212 (683)
491 smart00787 Spc7 Spc7 kinetocho 23.8 3.6E+02 0.0079 28.3 8.1 17 199-215 162-178 (312)
492 PF06216 RTBV_P46: Rice tungro 23.8 2.9E+02 0.0062 29.6 7.4 47 219-268 55-101 (389)
493 PF15003 HAUS2: HAUS augmin-li 23.8 3.7E+02 0.008 28.5 8.1 86 221-308 134-231 (277)
494 PRK00846 hypothetical protein; 23.7 1.7E+02 0.0036 25.5 4.8 36 233-268 15-57 (77)
495 PF13094 CENP-Q: CENP-Q, a CEN 23.7 3.5E+02 0.0075 25.0 7.2 45 226-270 36-80 (160)
496 PF09726 Macoilin: Transmembra 23.6 4.8E+02 0.01 30.4 9.7 81 190-270 425-513 (697)
497 PRK10698 phage shock protein P 23.6 2.7E+02 0.0059 27.6 6.9 41 227-267 95-135 (222)
498 PRK08724 fliD flagellar cappin 23.6 1.1E+02 0.0023 35.6 4.7 41 236-276 615-655 (673)
499 PRK05561 DNA topoisomerase IV 23.6 3.7E+02 0.008 31.4 8.8 83 185-269 369-461 (742)
500 PF01025 GrpE: GrpE; InterPro 23.6 2.6E+02 0.0056 25.4 6.3 38 233-270 20-57 (165)
No 1
>KOG2483|consensus
Probab=99.79 E-value=1.2e-18 Score=169.18 Aligned_cols=103 Identities=40% Similarity=0.524 Sum_probs=91.6
Q ss_pred CCCCCCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 179 RSGISGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQ 258 (561)
Q Consensus 179 RPg~SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQ 258 (561)
+...++.|+.||+|||+||++||+||+.||+.||.+.++++...+.||+||.+||++|++++.++++++|+|++|++.|+
T Consensus 53 ~~~~~~~R~~HN~LEk~RRahlk~~~~~Lk~~vP~~~~~~~~t~lsiL~kA~~~i~~l~~~~~~~~~~~e~l~~e~~~l~ 132 (232)
T KOG2483|consen 53 TSSAASSRAHHNALEKRRRAHLKDCFESLKDSVPLLNGETRSTTLSILDKALEHIQSLERKSATQQQDIEDLSRENRKLK 132 (232)
T ss_pred CCCCCcchhhhhhhhHHHHHHHHHHHHHHHHhCCCCCCcchhhhhHhhhhHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 33567899999999999999999999999999999877666668999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhhhcccccccCCCCCccccccccC
Q psy11481 259 QRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDN 293 (561)
Q Consensus 259 QRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQ 293 (561)
+||++|+.... |...++.++.+|
T Consensus 133 ~rl~ql~~~~s------------~a~~~~~~~~s~ 155 (232)
T KOG2483|consen 133 ARLEQLSLPQS------------PASSEAIRLDSQ 155 (232)
T ss_pred HHHHHhcCccc------------cchhhhhhcccc
Confidence 99999986543 366677777766
No 2
>KOG1318|consensus
Probab=99.70 E-value=1.9e-16 Score=164.06 Aligned_cols=134 Identities=19% Similarity=0.227 Sum_probs=115.9
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCC-CcCCcchHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQ-EEKKSSNLSILHSAIRYIQFLRRRERE---FEHEMERLAREKIHAQ 258 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLs-Dn~KaSKAsILrKAiDYIQkLQreEre---LE~EkERLRREneaLQ 258 (561)
..||..||++||+||++||+|+.+|..+||.+. +..|..|..||+++||||+.|++++++ .+...++|+..++.|.
T Consensus 231 r~Krd~HNeVERRRR~nIN~~IkeLg~liP~~~~~~~~~nKgtILk~s~dYIr~Lqq~~q~~~E~~~rqk~le~~n~~L~ 310 (411)
T KOG1318|consen 231 RRKRDNHNEVERRRRENINDRIKELGQLIPKCNSEDMKSNKGTILKASCDYIRELQQTLQRARELENRQKKLESTNQELA 310 (411)
T ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHhCCCCCcchhhcccchhhHHHHHHHHHHHHHHHHHHHHHhhhhHHHhHHHHHH
Confidence 457999999999999999999999999999974 346777999999999999999999874 4456677889999999
Q ss_pred HHHHHHHHHHhhhhcccccccCCCCCccccccccCccCCCcccccCCCCCCcCCCCceeEeecCcceeeec
Q psy11481 259 QRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDNHHHESSLLSYGKERSYMDEDGGLVIVTNGSVGLERS 329 (561)
Q Consensus 259 QRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQa~~DSsilS~~~~~~~~~~~g~l~~vtn~~~~m~~~ 329 (561)
+|+++||.++...|.+++.+.+.....-+++.+++.... .++++.+.+++++++++.-
T Consensus 311 ~rieeLk~~~~~~~~~~~~~~~~~~~~~~e~~i~~e~~~-------------l~~~~~s~~~~h~~~~~~~ 368 (411)
T KOG1318|consen 311 LRIEELKSEAGRHGLQVEPSKLTNSLEGSEQIIKEESDS-------------LENEQQSQETLHTAEGPST 368 (411)
T ss_pred HHHHHHHHHHHHhcCccccccccccchhhhhhhhhcchh-------------hhcccccccccccCCCCcc
Confidence 999999999999999999999999888888888776544 5677777888887777665
No 3
>PF00010 HLH: Helix-loop-helix DNA-binding domain only nuclear translocator protein (Arnt).; InterPro: IPR011598 The helix-loop-helix (HLH) DNA-binding domain consists of a closed bundle of four helices in a left-handed twist with two crossover connections. The HLH domain directs dimerisation, and is juxtaposed to basic regions to create a DNA interaction interface surface that recognises specific DNA sequences. Basic region/HLH (bHLH) proteins regulate diverse biological pathways []. bHLH proteins include MyoD [], SREBPs (sterol regulatory element binding proteins) [], and yeast Pho4 (phosphatase system) []. In certain proteins the bHLH domain contains a leucine-zipper motif. The bHLH/leucine zipper (bHLHZip) domain specifies dimerisation within a network of proteins and determines sequence-specific DNA binding []. bHLHZip domains occur in the transcription factors Myc, Mad, Max and Usf [, ]. This entry is bHLHZip, which covers the bHLH domain and the leucine zipper motif, when present.; PDB: 1NLW_A 1NKP_D 1A93_A 2A93_A 1AM9_C 3U5V_A 1A0A_B 2QL2_C 1UKL_C 1AN4_B ....
Probab=99.45 E-value=1.7e-13 Score=103.94 Aligned_cols=53 Identities=38% Similarity=0.672 Sum_probs=49.5
Q ss_pred chhhhhHHHHhHHHHHHHHHHHHhccCCCC--CCcCCcchHHHHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKECFEILKRQVPPA--QEEKKSSNLSILHSAIRYIQFLR 237 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdcFeeLRdlVPsL--sDn~KaSKAsILrKAiDYIQkLQ 237 (561)
+|..||..||+||+.||+||++|+++||.. ....|.+|+.||.+|++||++||
T Consensus 1 rR~~h~~~Er~RR~~i~~~~~~L~~llp~~~~~~~~k~~K~~iL~~ai~yI~~Lq 55 (55)
T PF00010_consen 1 RRQKHNERERRRRDRINDCFDELRELLPSCSAGSSRKLSKASILQKAIDYIKQLQ 55 (55)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHCCSSHHCCTTSSSSHHHHHHHHHHHHHHHH
T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHhccchhccccccCCHHHHHHHHHHHHHHhC
Confidence 589999999999999999999999999995 25688999999999999999996
No 4
>KOG1319|consensus
Probab=99.39 E-value=1.5e-12 Score=125.30 Aligned_cols=88 Identities=27% Similarity=0.425 Sum_probs=79.2
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCc----CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEE----KKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQ 258 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn----~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQ 258 (561)
..||++|...||+||+.||.+|+.|+++||.+..+ .|.+|+.||+|+||||.+|+++..+.++|...|+++.-+|+
T Consensus 60 ~rrr~aHtqaEqkRRdAIk~GYddLq~LvP~cq~~ds~g~KlskA~ILqksidyi~~L~~~k~kqe~e~s~L~k~vtAL~ 139 (229)
T KOG1319|consen 60 DRRRRAHTQAEQKRRDAIKRGYDDLQTLVPTCQQQDSIGQKLSKAIILQKTIDYIQFLHKEKKKQEEEVSTLRKDVTALK 139 (229)
T ss_pred HHHHHHHHHHHHHHHHHHHhchHHHHHhccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45799999999999999999999999999998643 49999999999999999999999999999999999999997
Q ss_pred HHHHHHHHHHhh
Q psy11481 259 QRLALLKKELSA 270 (561)
Q Consensus 259 QRLeELKqELsQ 270 (561)
-=-.+.++.+..
T Consensus 140 iIk~~YEqM~~~ 151 (229)
T KOG1319|consen 140 IIKVNYEQMVKA 151 (229)
T ss_pred HHHHHHHHHHHh
Confidence 666677766654
No 5
>cd00083 HLH Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and
Probab=99.37 E-value=1.1e-12 Score=98.94 Aligned_cols=57 Identities=33% Similarity=0.507 Sum_probs=53.3
Q ss_pred CchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHH
Q psy11481 184 GIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRE 240 (561)
Q Consensus 184 eRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreE 240 (561)
.+|..||..||+||+.||++|+.|+++||....+.|.+|+.||..|++||+.|+.+.
T Consensus 3 ~~r~~~~~~Er~RR~~~n~~~~~L~~llp~~~~~~k~~k~~iL~~a~~yI~~L~~~~ 59 (60)
T cd00083 3 SRREAHNLRERRRRERINDAFDELRSLLPTLPPSKKLSKAEILRKAVDYIKSLQELL 59 (60)
T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHh
Confidence 478899999999999999999999999999776789999999999999999999864
No 6
>smart00353 HLH helix loop helix domain.
Probab=99.21 E-value=3.2e-11 Score=89.89 Aligned_cols=52 Identities=35% Similarity=0.525 Sum_probs=48.0
Q ss_pred hHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRER 241 (561)
Q Consensus 190 NaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEr 241 (561)
|.+||+||+.||++|+.|+.+||......|.+|+.||..|++||++|+++.+
T Consensus 1 n~~Er~RR~~~n~~~~~L~~lip~~~~~~k~~k~~iL~~ai~yi~~L~~~~~ 52 (53)
T smart00353 1 NARERRRRRKINEAFDELRSLLPTLPNNKKLSKAEILRLAIEYIKSLQEELQ 52 (53)
T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHh
Confidence 6799999999999999999999997667899999999999999999998753
No 7
>KOG2588|consensus
Probab=98.78 E-value=5.8e-09 Score=117.08 Aligned_cols=95 Identities=31% Similarity=0.344 Sum_probs=74.8
Q ss_pred CcceeeecCCCCCCccccccCCCC-----CCCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHH
Q psy11481 158 SHAVALSTSPNNLTQNDMIKKRSG-----ISGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRY 232 (561)
Q Consensus 158 S~~VaLsnSp~~L~q~se~kKRPg-----~SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDY 232 (561)
+....++.++..++.....+|.|. +.++|..||.+|||+|..|||++.+||++||. ...|..|.++|++|++|
T Consensus 244 ~~a~~~~ispa~~~~~~~~ek~Pi~rl~~G~~kRtAHN~IEKRYRsSINDKI~eLk~lV~g--~~aKl~kSavLr~ai~~ 321 (953)
T KOG2588|consen 244 SEAEKTSISPASLPANEPKEKKPIKRLLPGGEKRTAHNIIEKRYRSSINDKIIELKDLVPG--TEAKLNKSAVLRKAIDY 321 (953)
T ss_pred ccccCcccCCCCCCcccccccCchhhcCCCCcccchhhHHHHHhhcchhHHHHHHHHhcCc--cHhhhhhhhhHHHHHHH
Confidence 444455555533333322332332 35899999999999999999999999999999 45799999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREK 254 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREn 254 (561)
|++|+..++.+.++.++++.++
T Consensus 322 i~dl~~~nq~lk~~~~~l~t~k 343 (953)
T KOG2588|consen 322 IEDLQGYNQKLKLENASLRTPK 343 (953)
T ss_pred HHHhhccccccchhhhhhhhHH
Confidence 9999999999999998888543
No 8
>KOG0561|consensus
Probab=98.08 E-value=2.8e-06 Score=87.23 Aligned_cols=70 Identities=29% Similarity=0.380 Sum_probs=61.2
Q ss_pred CchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 184 GIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREK 254 (561)
Q Consensus 184 eRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREn 254 (561)
=||..-|.=||+|=..||.+|..||.++|- .+++|.||++||+...|||..|+.+.-++..+.-+|+|.+
T Consensus 59 mRReIANsNERRRMQSINAGFqsLr~LlPr-~eGEKLSKAAILQQTa~yI~~Le~~Kt~ll~qn~elKr~~ 128 (373)
T KOG0561|consen 59 MRREIANSNERRRMQSINAGFQSLRALLPR-KEGEKLSKAAILQQTADYIHQLEGHKTELLPQNGELKRLK 128 (373)
T ss_pred HHHHhhcchHHHHHHhhhHHHHHHHHhcCc-ccchhhHHHHHHHHHHHHHHHHHhcccccccccchHHHHH
Confidence 378888999999999999999999999999 7899999999999999999999998777766555555443
No 9
>KOG3561|consensus
Probab=98.02 E-value=5.2e-06 Score=93.13 Aligned_cols=55 Identities=25% Similarity=0.490 Sum_probs=51.1
Q ss_pred chhhhhHHHHhHHHHHHHHHHHHhccCCCCC-CcCCcchHHHHHHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKECFEILKRQVPPAQ-EEKKSSNLSILHSAIRYIQFLRRR 239 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdcFeeLRdlVPsLs-Dn~KaSKAsILrKAiDYIQkLQre 239 (561)
.|+.||..|||||+.+|..+++|.++||.+. -+.|+.|.+||++|+++|+.+++.
T Consensus 20 ~Re~~~~~EKrRRdq~N~yI~ELs~Mvp~~~~~~RK~DK~tVLr~aV~~lr~~k~~ 75 (803)
T KOG3561|consen 20 KRENRSEIEKRRRDQMNKYIEELSEMVPTNASLSRKPDKLTVLRMAVDHLRLIKEQ 75 (803)
T ss_pred ccccchhHHHHHHHHHHHHHHHHHHhhhcchhcccCchHHHHHHHHHHHHHHHhhh
Confidence 3899999999999999999999999999964 358999999999999999999985
No 10
>KOG4029|consensus
Probab=97.87 E-value=1.1e-05 Score=77.35 Aligned_cols=61 Identities=26% Similarity=0.378 Sum_probs=54.9
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCC-cCCcchHHHHHHHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQE-EKKSSNLSILHSAIRYIQFLRRREREF 243 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsD-n~KaSKAsILrKAiDYIQkLQreEreL 243 (561)
...|..+|+.||+|=..+|.+|.+||..||.... .+|.||+.+|+-|+.||++|.+-....
T Consensus 107 ~~~~~~~n~RER~Rv~~vN~~f~~Lr~~lP~~~~~~kklSKveTLr~A~~YI~~L~~lL~~~ 168 (228)
T KOG4029|consen 107 SAQRQARNARERQRVQSVNSAFAELRALLPTEPPQSKKLSKVETLRLATSYIRYLTKLLATQ 168 (228)
T ss_pred hhhhhhhhhhhhhcccchhhhhHHHHhcCCCCCCcccccCcccchHHHHHHHHHHHHHhccc
Confidence 4567888999999999999999999999999877 899999999999999999999865433
No 11
>KOG4304|consensus
Probab=97.81 E-value=1e-05 Score=80.14 Aligned_cols=61 Identities=26% Similarity=0.392 Sum_probs=52.1
Q ss_pred CchhhhhHHHHhHHHHHHHHHHHHhccCCCCCC-----cCCcchHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 184 GIREVHNKLEKNRRAHLKECFEILKRQVPPAQE-----EKKSSNLSILHSAIRYIQFLRRREREFE 244 (561)
Q Consensus 184 eRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsD-----n~KaSKAsILrKAiDYIQkLQreEreLE 244 (561)
.++..|=.||||||+-||.|+++||++|+.... ..|.-|+.||.-+++|+++|++......
T Consensus 31 ~rk~~Kpl~EKkRRaRIN~~L~eLK~Li~e~~~~~~~~~sklEKAdILEltV~hL~~l~~~~~~~~ 96 (250)
T KOG4304|consen 31 YRKVRKPLLEKKRRARINRCLDELKDLIPEALKKDGQRHSKLEKADILELTVNHLRQLQRSQQAAA 96 (250)
T ss_pred HhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhHHHHHHHHHHHHHHHHhccccccc
Confidence 466778899999999999999999999998642 2678899999999999999998765433
No 12
>KOG3960|consensus
Probab=97.76 E-value=7.7e-05 Score=75.26 Aligned_cols=63 Identities=30% Similarity=0.444 Sum_probs=53.9
Q ss_pred CCCCCCCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHH
Q psy11481 178 KRSGISGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRER 241 (561)
Q Consensus 178 KRPg~SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEr 241 (561)
|+.-..+||..-..-||+|=.++|+.|+.||..--. ..+.+..|+.||+.||+||..||+-.+
T Consensus 111 rks~svDRRKAATMRERRRLkKVNEAFE~LKRrT~~-NPNQRLPKVEILRsAI~YIE~Lq~LL~ 173 (284)
T KOG3960|consen 111 RKSTSVDRRKAATMRERRRLKKVNEAFETLKRRTSS-NPNQRLPKVEILRSAIRYIERLQALLQ 173 (284)
T ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-CccccccHHHHHHHHHHHHHHHHHHHH
Confidence 334457899999999999999999999999987766 467899999999999999999887543
No 13
>KOG3582|consensus
Probab=97.14 E-value=0.0002 Score=79.81 Aligned_cols=87 Identities=18% Similarity=0.261 Sum_probs=79.0
Q ss_pred CchhhhhHHHHhHHHHHHHHHHHHhccCCCCC--CcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 184 GIREVHNKLEKNRRAHLKECFEILKRQVPPAQ--EEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 184 eRRaTHNaLERKRRd~LKdcFeeLRdlVPsLs--Dn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRL 261 (561)
.++.+|..+|++||.++|.||+.|-.++-+.. -+-|+++..-|++.++||..++.+...+.+|-..|++|..+++.-+
T Consensus 650 ~r~it~~Sa~qkrr~n~kl~~~~~Ns~~Sn~~~l~s~k~t~~~~~q~~vhYi~~t~~e~~~v~~e~~~lr~~~s~~~A~~ 729 (856)
T KOG3582|consen 650 NRPITHISAEQKRRFNIKLQFDVLNSLSSNSSSLSSIKTTNSNSLQKTVHYINVTQQERVPVQEEAHSLRKEISELNAVI 729 (856)
T ss_pred CCcccCccHHHHHHhhHHhhhHhhhhhccCcccccchhhhccccccCceeeeeccchhccccchhhhhhhhhhHHHHHHH
Confidence 57899999999999999999999999988753 2467888889999999999999999999999999999999999999
Q ss_pred HHHHHHHhh
Q psy11481 262 ALLKKELSA 270 (561)
Q Consensus 262 eELKqELsQ 270 (561)
..|++.+..
T Consensus 730 ~~~~q~p~a 738 (856)
T KOG3582|consen 730 SACQQPPPA 738 (856)
T ss_pred HHhhcCCCc
Confidence 999987765
No 14
>PLN03217 transcription factor ATBS1; Provisional
Probab=96.44 E-value=0.0071 Score=53.26 Aligned_cols=47 Identities=19% Similarity=0.384 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHhccCCCCCC---cCCcchHHHHHHHHHHHHHHHHHHHHH
Q psy11481 197 RAHLKECFEILKRQVPPAQE---EKKSSNLSILHSAIRYIQFLRRREREF 243 (561)
Q Consensus 197 Rd~LKdcFeeLRdlVPsLsD---n~KaSKAsILrKAiDYIQkLQreEreL 243 (561)
-+.|++.+..|+.++|.... ..|.|.+.+|+.+|.||+.|+++...+
T Consensus 19 ddqi~dLvsKLq~llPe~r~~r~s~k~saskvLqEtC~YIrsLhrEvDdL 68 (93)
T PLN03217 19 EDQINDLIIKLQQLLPELRDSRRSDKVSAARVLQDTCNYIRNLHREVDDL 68 (93)
T ss_pred HHHHHHHHHHHHHHChHHHhhhccccccHHHHHHHHHHHHHHHHHHHHHH
Confidence 57899999999999999754 457788999999999999998865443
No 15
>KOG3910|consensus
Probab=96.04 E-value=0.0044 Score=67.62 Aligned_cols=73 Identities=29% Similarity=0.358 Sum_probs=57.1
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKK-SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~K-aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRL 261 (561)
.+||...|+-||-|=.+||+.|.+|.++.=.=.+.+| ..|+.||..|++.|-.|+++.|| +.|+-+.
T Consensus 524 kERR~aNNARERlRVRDINeAfKELGRMCqlHlkSeKpQTKLgILhqAVsVIlsLEQQVRE------------RNLNPKa 591 (632)
T KOG3910|consen 524 KERRMANNARERLRVRDINEAFKELGRMCQLHLKSEKPQTKLGILHQAVSVILSLEQQVRE------------RNLNPKA 591 (632)
T ss_pred HHHHhhhhhhhheehhhHHHHHHHHHHHHHHhhcccCcchhhhHHHHHHHHHHHHHHHHHH------------ccCChhh
Confidence 4689999999999999999999999887653333344 79999999999999999987652 2355555
Q ss_pred HHHHHH
Q psy11481 262 ALLKKE 267 (561)
Q Consensus 262 eELKqE 267 (561)
+.||+.
T Consensus 592 aclkRR 597 (632)
T KOG3910|consen 592 ACLKRR 597 (632)
T ss_pred hhhhcc
Confidence 666654
No 16
>KOG4447|consensus
Probab=95.43 E-value=0.0086 Score=57.39 Aligned_cols=54 Identities=33% Similarity=0.506 Sum_probs=50.1
Q ss_pred chhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRR 239 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQre 239 (561)
.|-.||.-||+|-..|++.|..||..+|.+ ...|.||...|+-|..||.+|-+-
T Consensus 78 qrv~anvrerqRtqsLn~AF~~lr~iiptl-PsdklSkiqtLklA~ryidfl~~v 131 (173)
T KOG4447|consen 78 QRVMANVRERQRTQSLNEAFAALRKIIPTL-PSDKLSKIQTLKLAARYIDFLYQV 131 (173)
T ss_pred HHHHHHHHHHHhhhhHHHHHHHHHhhcCCC-CccccccccchhhcccCCchhhhc
Confidence 588999999999999999999999999996 457999999999999999998764
No 17
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=94.65 E-value=0.17 Score=42.47 Aligned_cols=47 Identities=23% Similarity=0.428 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
+.+|+|-|..|+.+..+++.+...|..++..|++..+.|+++..+ |.
T Consensus 13 i~~aveti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e~~~-~~ 59 (72)
T PF06005_consen 13 IQQAVETIALLQMENEELKEKNNELKEENEELKEENEQLKQERNA-WQ 59 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH-HH
Confidence 467999999999999999999999999999999999999999877 76
No 18
>PF02344 Myc-LZ: Myc leucine zipper domain; InterPro: IPR003327 This family consists of the leucine zipper dimerisation domain found in both cellular c-Myc proto-oncogenes and viral v-Myc oncogenes. Dimerisation via the leucine zipper motif with other basic helix-loop-helix-leucine zipper (b/HLH/lz) proteins is required for efficient DNA binding []. The Myc-Max dimer is a transactivating complex activating expression of growth related genes promoting cell proliferation. The dimerisation is facilitated via interdigitating leucine residues every 7th position of the alpha helix. Like charge repulsion of adjacent residues in this region preturbs the formation of homodimers with heterodimers being promoted by opposing charge attractions. It has been demonstrated that in transgenic mice the balance between oncogene-induced proliferation and apoptosis in a given tissue can be a critical determinant in the initiation and maintenance of the tumor [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1NKP_D 1A93_A 2A93_A.
Probab=94.60 E-value=0.071 Score=39.52 Aligned_cols=28 Identities=25% Similarity=0.432 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKq 266 (561)
+|+++..|+|+|+++.+.|+.+|++|++
T Consensus 2 dEqkL~sekeqLrrr~eqLK~kLeqlrn 29 (32)
T PF02344_consen 2 DEQKLISEKEQLRRRREQLKHKLEQLRN 29 (32)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH--
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 6788999999999999999999999985
No 19
>KOG3898|consensus
Probab=94.43 E-value=0.02 Score=57.11 Aligned_cols=56 Identities=25% Similarity=0.304 Sum_probs=50.7
Q ss_pred CchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHH
Q psy11481 184 GIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRR 239 (561)
Q Consensus 184 eRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQre 239 (561)
-+|..=|+=||+|=-+||+.|+.||+++|...+..|.+|...|+.|..||..|.+-
T Consensus 71 ~rR~kaNaRER~RMH~LNdAld~LReviP~~~~~~klskIetl~~a~~yi~als~~ 126 (254)
T KOG3898|consen 71 LRRLKANARERTRMHDLNDALDALREVIPHGLHPPKLSKIETLRLAANYIAALSEV 126 (254)
T ss_pred hhcccccchhhccccchhHHHHHhHhhccCcCCCCCCCcchhHHhhhcchhhhccc
Confidence 35777799999999999999999999999878899999999999999999988763
No 20
>KOG4395|consensus
Probab=93.89 E-value=0.062 Score=54.96 Aligned_cols=56 Identities=32% Similarity=0.347 Sum_probs=50.6
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRR 238 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQr 238 (561)
..+|..-|..||+|=..||..|+.||..||.+....|.||-..|+-|--||--|-.
T Consensus 172 ~~rr~aanarErrrm~gLN~AfD~Lr~v~p~~~~d~~LSkyetLqmaq~yi~~l~~ 227 (285)
T KOG4395|consen 172 SHRRLAANARERRRMNGLNSAFDRLRLVVPDGDSDKKLSKYETLQMAQGYILALGC 227 (285)
T ss_pred HhhhcccchHHHHHhhhHHHHHHHHHHhcCCCCccchhhhhhHHHHHHHHHhhhHH
Confidence 45778889999999999999999999999998888999999999999999986654
No 21
>smart00338 BRLZ basic region leucin zipper.
Probab=92.19 E-value=0.69 Score=36.75 Aligned_cols=42 Identities=33% Similarity=0.376 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 228 SAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+-.+||..|+.+...++.+.+.|+.++..|+..+..|+.++.
T Consensus 23 rKk~~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~lk~~~~ 64 (65)
T smart00338 23 RKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSELE 64 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 445689999999888888888888888888888888887763
No 22
>KOG4447|consensus
Probab=91.98 E-value=0.27 Score=47.46 Aligned_cols=76 Identities=17% Similarity=0.186 Sum_probs=55.0
Q ss_pred HHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 192 LEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 192 LERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+|+.|...+++.|.-|+.++|.. +..+.++...|+.+.+||++|.+-.++.--.--+-+++-..|+...+.|++.+
T Consensus 29 ~e~~R~~~ls~~s~l~g~l~pgs-pa~gk~~~ktlr~~~~~~~~~dE~q~qrv~anvrerqRtqsLn~AF~~lr~ii 104 (173)
T KOG4447|consen 29 KERGRKRRLSDASTLLGKLEPGS-PADGKRGKKTLRIGTDSIQSLDELQKQRVMANVRERQRTQSLNEAFAALRKII 104 (173)
T ss_pred HHHhHHhhhhhhhhhccccCCCC-CCcccccccccccCCCchhhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHhhc
Confidence 69999999999999999999994 55566666669999999999987655332222223334456666777776654
No 23
>PRK10884 SH3 domain-containing protein; Provisional
Probab=91.92 E-value=0.52 Score=46.13 Aligned_cols=51 Identities=10% Similarity=0.143 Sum_probs=44.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
..-+..+-+.|..|+++++++.+|.+.++.|++.|+++++.++.....+|.
T Consensus 124 ~~~~~~~~~~~~~L~~~n~~L~~~l~~~~~~~~~l~~~~~~~~~~~~~~wf 174 (206)
T PRK10884 124 QQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQRTIIMQWF 174 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555677788999999999999999999999999999999999877786
No 24
>KOG4343|consensus
Probab=91.02 E-value=0.37 Score=53.72 Aligned_cols=44 Identities=23% Similarity=0.238 Sum_probs=40.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+|-.||...|+.+.+++++|-|+|++|+.+|++||++|..|...
T Consensus 298 kKKKEy~~~Le~rLq~ll~Ene~Lk~ENatLk~qL~~l~~En~~ 341 (655)
T KOG4343|consen 298 KKKKEYMLGLEARLQALLSENEQLKKENATLKRQLDELVSENQR 341 (655)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcCcc
Confidence 45679999999999999999999999999999999999998665
No 25
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=90.44 E-value=1.5 Score=34.82 Aligned_cols=39 Identities=31% Similarity=0.397 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
..||..|+.+...++.+.+.|+.++..|+..+..|+.++
T Consensus 25 k~~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~~e~ 63 (64)
T PF00170_consen 25 KQYIEELEEKVEELESENEELKKELEQLKKEIQSLKSEN 63 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 468888999888888888888888888888888888765
No 26
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=89.21 E-value=1.8 Score=37.78 Aligned_cols=47 Identities=21% Similarity=0.372 Sum_probs=36.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhhhc
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKI-------HAQQRLALLKKELSARWE 273 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREne-------aLQQRLeELKqELsQqw~ 273 (561)
++.|+|-|.-|+-+-.++..+...|.+|++ +|.++.++||.+... |.
T Consensus 13 IqqAvdtI~LLqmEieELKekn~~L~~e~~~~~~~r~~L~~en~qLk~E~~~-Wq 66 (79)
T PRK15422 13 VQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNG-WQ 66 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-HH
Confidence 466888888888777777776666666644 488899999998777 87
No 27
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=88.53 E-value=1.9 Score=37.39 Aligned_cols=47 Identities=21% Similarity=0.399 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 226 LHSAIRYIQFLRRREREFE-------HEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE-------~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
++.|+|-|+-|+-+..++. .|...++..+++|+++-++||++-.. |.
T Consensus 13 iqqAvdTI~LLQmEieELKEknn~l~~e~q~~q~~reaL~~eneqlk~e~~~-WQ 66 (79)
T COG3074 13 VQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNG-WQ 66 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH
Confidence 4567777766665555554 44445555666677777777776544 76
No 28
>KOG3119|consensus
Probab=88.37 E-value=1.6 Score=43.91 Aligned_cols=40 Identities=23% Similarity=0.214 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
.+++.+...|+.|.+.|+.++++|+++|..|+.-+.+.|.
T Consensus 218 ~e~~~r~~~leken~~lr~~v~~l~~el~~~~~~~~~~~~ 257 (269)
T KOG3119|consen 218 DEMAHRVAELEKENEALRTQVEQLKKELATLRRLFLQLPK 257 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 4455555666666677777777777777777777666544
No 29
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=87.87 E-value=1.6 Score=36.36 Aligned_cols=79 Identities=23% Similarity=0.308 Sum_probs=59.9
Q ss_pred hHHHHhHHHHHHHHHHHHhccCCCCCCc-CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNRRAHLKECFEILKRQVPPAQEE-KKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 190 NaLERKRRd~LKdcFeeLRdlVPsLsDn-~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
..+++..+ .+....++|..+=+. ... ....++-|+...-+.+..|+++...++.+++.|+.+...+..++.+++.+|
T Consensus 22 ~~l~~~~~-~~~~~~~eL~~l~~~-~~~y~~vG~~fv~~~~~~~~~~L~~~~~~~~~~i~~l~~~~~~l~~~l~~~~~~l 99 (106)
T PF01920_consen 22 QQLERQLR-ELELTLEELEKLDDD-RKVYKSVGKMFVKQDKEEAIEELEERIEKLEKEIKKLEKQLKYLEKKLKELKKKL 99 (106)
T ss_dssp HHHHHHHH-HHHHHHHHHHTSSTT--EEEEEETTEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHH-HHHHHHHHHHhCCCc-chhHHHHhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444443 667778888876444 222 234566667767788999999999999999999999999999999999998
Q ss_pred hh
Q psy11481 269 SA 270 (561)
Q Consensus 269 sQ 270 (561)
-+
T Consensus 100 ~~ 101 (106)
T PF01920_consen 100 YE 101 (106)
T ss_dssp HC
T ss_pred HH
Confidence 65
No 30
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=86.72 E-value=3.3 Score=32.20 Aligned_cols=39 Identities=36% Similarity=0.464 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
|..|=..-+.....++.+++.|..++..|+++++.|++|
T Consensus 16 A~r~R~rkk~~~~~le~~~~~L~~en~~L~~~i~~L~~E 54 (54)
T PF07716_consen 16 ARRSRQRKKQREEELEQEVQELEEENEQLRQEIAQLERE 54 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 444555566666777888888889999999999988865
No 31
>KOG4005|consensus
Probab=85.54 E-value=4.5 Score=41.81 Aligned_cols=77 Identities=26% Similarity=0.328 Sum_probs=47.2
Q ss_pred Cchhhh-hHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH-----
Q psy11481 184 GIREVH-NKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRY-IQFLRRREREFEHEMERLAREKIH----- 256 (561)
Q Consensus 184 eRRaTH-NaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDY-IQkLQreEreLE~EkERLRREnea----- 256 (561)
.+|-.| ..=||--|.+||++...=- -+|.+| .+.. -+|| |+.|.+++++++.|-+.||+.++-
T Consensus 58 r~RL~HLS~EEK~~RrKLKNRVAAQt-----aRDrKK-aRm~----eme~~i~dL~een~~L~~en~~Lr~~n~~L~~~n 127 (292)
T KOG4005|consen 58 RRRLDHLSWEEKVQRRKLKNRVAAQT-----ARDRKK-ARME----EMEYEIKDLTEENEILQNENDSLRAINESLLAKN 127 (292)
T ss_pred HHhhcccCHHHHHHHHHHHHHHHHhh-----hhhHHH-HHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 467778 4555556888888764311 122122 1111 1344 899999999999888888876654
Q ss_pred --HHHHHHHHHHHHhh
Q psy11481 257 --AQQRLALLKKELSA 270 (561)
Q Consensus 257 --LQQRLeELKqELsQ 270 (561)
|..+|+.++++|..
T Consensus 128 ~el~~~le~~~~~l~~ 143 (292)
T KOG4005|consen 128 HELDSELELLRQELAE 143 (292)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 45556666666543
No 32
>PF05164 ZapA: Cell division protein ZapA; InterPro: IPR007838 This entry a structural domain found in the cell division protein ZapA, as well as in related proteins. This domain has a core structure consisting of two layers alpha/beta, and has a long C-terminal helix that forms dimeric parallel and tetrameric antiparallel coiled coils []. ZapA interacts with FtsZ, where FtsZ is part of a mid-cell cytokinetic structure termed the Z-ring that recruits a hierarchy of fission related proteins early in the bacterial cell cycle. ZapA drives the polymerisation and filament bundling of FtsZ, thereby contributing to the spatio-temporal tuning of the Z-ring.; PDB: 1T3U_B 1W2E_B 3HNW_A.
Probab=84.98 E-value=9.1 Score=31.15 Aligned_cols=61 Identities=23% Similarity=0.345 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHH
Q psy11481 197 RAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEME--RLAREKIHAQQRLA 262 (561)
Q Consensus 197 Rd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkE--RLRREneaLQQRLe 262 (561)
=..|++.+.+++...|.+. ..++.+| -|+.....+.+.+++.....+ +++++.+.|.++|+
T Consensus 27 a~~i~~~i~~~~~~~~~~~----~~~~~vl-aaLnla~e~~~~~~~~~~~~~~~~l~~~i~~L~~~le 89 (89)
T PF05164_consen 27 AELINEKINEIKKKYPKLS----PERLAVL-AALNLADELLKLKRELDELEELERLEERIEELNERLE 89 (89)
T ss_dssp HHHHHHHHHHHCTTCCTSS----HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCCCC----HHHHHHH-HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhhC
Confidence 4578999999999988742 3445555 456665555555554444444 66666666666553
No 33
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=84.85 E-value=3 Score=40.35 Aligned_cols=46 Identities=20% Similarity=0.367 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRR----------REREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 223 AsILrKAiDYIQkLQr----------eEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
...|...+.|++.|+. ++++++.|+++|+.+++.|+.+++.|++++
T Consensus 79 ~ltl~~vI~fLq~l~~~~~~~~~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~ 134 (161)
T TIGR02894 79 SLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRL 134 (161)
T ss_pred cCCHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3446777777777765 344444555555555555555555555444
No 34
>PF14775 NYD-SP28_assoc: Sperm tail C-terminal domain
Probab=83.53 E-value=6.1 Score=32.30 Aligned_cols=56 Identities=20% Similarity=0.261 Sum_probs=38.2
Q ss_pred HHHHHhccCCCCCCcCCcchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 203 CFEILKRQVPPAQEEKKSSNLSILHSAIR-YIQFLRRREREFEHEMERLAREKIHAQQRLAL 263 (561)
Q Consensus 203 cFeeLRdlVPsLsDn~KaSKAsILrKAiD-YIQkLQreEreLE~EkERLRREneaLQQRLeE 263 (561)
-++.+...+|. .+.--+..|.++.+ |-+-| .+..++..|++.|++++.+|+..|.+
T Consensus 2 yW~~~~~vip~----~~~~~W~~L~~~l~rY~~vL-~~R~~l~~e~~~L~~qN~eLr~lLkq 58 (60)
T PF14775_consen 2 YWERLANVIPD----EKIRLWDALENFLKRYNKVL-LDRAALIQEKESLEQQNEELRSLLKQ 58 (60)
T ss_pred HHHHHhhcCCh----HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 36778888887 24455777877765 44444 44566778888888888877766643
No 35
>smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains
Probab=83.20 E-value=5 Score=33.37 Aligned_cols=76 Identities=16% Similarity=0.227 Sum_probs=50.5
Q ss_pred hHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhh
Q psy11481 195 NRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREK-IHAQQRLALLKKELSA 270 (561)
Q Consensus 195 KRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREn-eaLQQRLeELKqELsQ 270 (561)
.++..+.+....|...+=.+......-+..|-...-+++..|+++++.+..+++..++++ ..|..+++.|...+.+
T Consensus 14 ~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~I~~~f~~l~~~L~~~e~~ll~~l~~~~~~~~~~l~~q~~~l~~~l~~ 90 (127)
T smart00502 14 KKAAELEDALKQLISIIQEVEENAADVEAQIKAAFDELRNALNKRKKQLLEDLEEQKENKLKVLEQQLESLTQKQEK 90 (127)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666666666666666555555566777777778889999999998888887654433 3455555555555543
No 36
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=83.06 E-value=3.6 Score=40.43 Aligned_cols=39 Identities=21% Similarity=0.213 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKI---HAQQRLALLKKELS 269 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREne---aLQQRLeELKqELs 269 (561)
....+|++++++|++|.++|+.++. +|+++.++|++.|.
T Consensus 69 ~~~~~l~~en~~L~~e~~~l~~~~~~~~~l~~en~~L~~lL~ 110 (276)
T PRK13922 69 ASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLN 110 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3456677777777777777777666 44555556665543
No 37
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=83.00 E-value=12 Score=42.84 Aligned_cols=134 Identities=18% Similarity=0.212 Sum_probs=74.9
Q ss_pred ccCccceee-cccccchhcc--cCccCCCcce--eeecCCCCCCccccccCCCCCCCchhhhhHHHHhHHHHHHHHHHHH
Q psy11481 133 MGSREVVAS-LSGETRLVAH--NNYTSNSHAV--ALSTSPNNLTQNDMIKKRSGISGIREVHNKLEKNRRAHLKECFEIL 207 (561)
Q Consensus 133 vGs~~~~~S-~S~~~avva~--~~~~~NS~~V--aLsnSp~~L~q~se~kKRPg~SeRRaTHNaLERKRRd~LKdcFeeL 207 (561)
.|-|-++++ +++.+..|-. .+|++--.+- .|++++-.. .....+-..++.+-|..--++=+.+. .++..|..+
T Consensus 287 lG~PvvVAtDVtp~P~~V~KiAasf~A~ly~P~~dLsveEK~~-~~r~~~~~~~ddH~RDALAAA~kAY~-~yk~kl~~v 364 (652)
T COG2433 287 LGKPVVVATDVTPAPETVKKIAASFNAVLYTPDRDLSVEEKQE-ALRTLKISVSDDHERDALAAAYKAYL-AYKPKLEKV 364 (652)
T ss_pred cCCceEEEccCCCChHHHHHHHHHcCCcccCCcccCCHHHHHH-HHhhcCCCCCCchHHHHHHHHHHHHH-HHHHHHHHH
Confidence 477888888 9999998877 7777763331 111111110 00112222345566665555544443 345577777
Q ss_pred hccCCCC---CCc---------------------------CC-----------cchHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 208 KRQVPPA---QEE---------------------------KK-----------SSNLSILHSAIRYIQFLRRREREFEHE 246 (561)
Q Consensus 208 RdlVPsL---sDn---------------------------~K-----------aSKAsILrKAiDYIQkLQreEreLE~E 246 (561)
-..+|.+ .|. .+ .+...-+.+--.-++.|+.+++.|+.+
T Consensus 365 Er~~~~~g~~~d~~rika~VIrG~~l~eal~~~~e~~~p~e~~~~~~~e~~ei~~~~~~i~~~~~~ve~l~~e~~~L~~~ 444 (652)
T COG2433 365 ERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRE 444 (652)
T ss_pred HHhcccccchhhHHHHHHHeecCCcHHHHHHHHHhhhccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7777765 210 01 111222223333466677777777777
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 247 MERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 247 kERLRREneaLQQRLeELKqEL 268 (561)
+++|++++++|+.+|++++.+.
T Consensus 445 ~ee~k~eie~L~~~l~~~~r~~ 466 (652)
T COG2433 445 LEELKREIEKLESELERFRREV 466 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 7777777777777777776665
No 38
>KOG4571|consensus
Probab=82.62 E-value=5.6 Score=41.62 Aligned_cols=42 Identities=24% Similarity=0.340 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 228 SAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
.|+.|=++=+.+...++-|.+.|.++|++||.|+.+|.+|+.
T Consensus 238 AAtRYRqKkRae~E~l~ge~~~Le~rN~~LK~qa~~lerEI~ 279 (294)
T KOG4571|consen 238 AATRYRQKKRAEKEALLGELEGLEKRNEELKDQASELEREIR 279 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466676666666666667777777777777777777776653
No 39
>PF11690 DUF3287: Protein of unknown function (DUF3287); InterPro: IPR021704 This eukaryotic family of proteins has no known function.
Probab=82.21 E-value=2.7 Score=38.38 Aligned_cols=57 Identities=16% Similarity=0.345 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhhhcccccccCCCCC
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHA--QQRLALLKKELSARWEHIDFNTLIPDN 284 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaL--QQRLeELKqELsQqw~hIDfStLlPd~ 284 (561)
..|.++|+++..+-+++-.|..+|.++++.| .+++..|+...- +|....|+.+.|..
T Consensus 38 ~ea~~F~~kV~~qH~~~~~e~r~L~kKi~~l~veRkmr~Les~p~-~W~~~gf~~~a~iP 96 (109)
T PF11690_consen 38 KEAYDFIDKVVDQHQRYCDERRKLRKKIQDLRVERKMRALESHPF-DWERRGFDKVAPIP 96 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChH-HHHHhchHHhcccc
Confidence 3478899999999999999999999999988 777888877744 49888888776644
No 40
>PF10205 KLRAQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019343 This entry represents a N-terminal 100 residues long domain, which contains a conserved KLRAQ motif. This domain is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain a C-terminal TTKRSYEDQ motif domain (IPR019348 from INTERPRO). The function of these proteins is not known.
Probab=81.87 E-value=7.5 Score=35.27 Aligned_cols=38 Identities=21% Similarity=0.338 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.-+...++++.|++-|.-.+..|-+|++.|+.++..
T Consensus 35 Lk~ke~~LRk~eqE~dSL~FrN~QL~kRV~~LQ~El~~ 72 (102)
T PF10205_consen 35 LKEKEQALRKLEQENDSLTFRNQQLTKRVEVLQEELEE 72 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444455777889999999999999999999999873
No 41
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=81.47 E-value=5 Score=41.28 Aligned_cols=41 Identities=29% Similarity=0.396 Sum_probs=25.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 228 SAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
...+-++.|+++++++.+|++.|++|..+|.+++++|+.+.
T Consensus 47 ~~~~el~~le~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~ 87 (314)
T PF04111_consen 47 ELEEELEKLEQEEEELLQELEELEKEREELDQELEELEEEL 87 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445666777777777766666666666666666655553
No 42
>PRK10722 hypothetical protein; Provisional
Probab=81.24 E-value=10 Score=38.92 Aligned_cols=79 Identities=14% Similarity=0.247 Sum_probs=53.2
Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHH---HH----HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 196 RRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRER---EF----EHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEr---eL----E~EkERLRREneaLQQRLeELKqEL 268 (561)
||..+-+.++.....+|.- -.+=+.+++.----+-.|.++.. ++ +.++++|++++..|+.+|+.+.++|
T Consensus 124 err~~l~rl~~~~~~~p~~----lrPL~qlwr~~Q~l~l~LaeEr~Ry~rLQq~sD~qlD~lrqq~~~Lq~~L~~t~rKL 199 (247)
T PRK10722 124 ERRQIVERLNAYSLQIPAQ----VRPLYQLWRDGQALQLALAEERQRYQKLQQSSDSELDALRQQQQRLQYQLELTTRKL 199 (247)
T ss_pred HHHHHHHHHhhcccccchh----hhHHHHHHHHhhHHHHhHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667777888888888862 12223333332112333555555 55 6789999999999999999999998
Q ss_pred hhhhcccccccCCCCCccccccc
Q psy11481 269 SARWEHIDFNTLIPDNMEVDIPY 291 (561)
Q Consensus 269 sQqw~hIDfStLlPd~lEIDRvl 291 (561)
.. ...|+|.|
T Consensus 200 En-------------LTdIERqL 209 (247)
T PRK10722 200 EN-------------LTDIERQL 209 (247)
T ss_pred HH-------------HHHHHHHh
Confidence 75 45678887
No 43
>PF15458 NTR2: Nineteen complex-related protein 2
Probab=80.55 E-value=16 Score=36.76 Aligned_cols=84 Identities=18% Similarity=0.290 Sum_probs=52.5
Q ss_pred hhhhhHHHHhHHHHHHHHHHH-----------------HhccCCCCC---------CcCCcchHHHHHHH----HHHHHH
Q psy11481 186 REVHNKLEKNRRAHLKECFEI-----------------LKRQVPPAQ---------EEKKSSNLSILHSA----IRYIQF 235 (561)
Q Consensus 186 RaTHNaLERKRRd~LKdcFee-----------------LRdlVPsLs---------Dn~KaSKAsILrKA----iDYIQk 235 (561)
+......+++||..|...++. ||+-.-... ..++...+-.|..+ ...+..
T Consensus 140 ~ka~~~~~~~rR~~i~e~I~~~~~d~~de~~~~wE~aQ~rkg~~~~~~~~~~~~~~~ip~~tpLP~L~~~~~rL~~~l~~ 219 (254)
T PF15458_consen 140 KKAEREQKRRRREEIEEAINDDDDDSEDEERAEWEAAQLRKGMDGLSDEKDRYKPPPIPKITPLPSLSECLERLRESLSS 219 (254)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccchhhHHHHHHHHHHhhccCCccchhccCCCCCcccCCCCchhHHHHHHHHHHHH
Confidence 444556677788888887777 333222211 01222233333333 344666
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|+...+++...++.|++|+..|..|-.+|+..|.
T Consensus 220 le~~~~~~~~~l~~l~~E~~~I~~re~elq~~l~ 253 (254)
T PF15458_consen 220 LEDSKSQLQQQLESLEKEKEEIEEREKELQELLK 253 (254)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 7777777788888888888888888888887764
No 44
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=80.23 E-value=4.6 Score=32.07 Aligned_cols=31 Identities=23% Similarity=0.345 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
..++++..+++.++++|+.|++.|+++++.|
T Consensus 20 ~~~~~ei~~l~~~i~~l~~e~~~L~~ei~~l 50 (80)
T PF04977_consen 20 YQLNQEIAELQKEIEELKKENEELKEEIERL 50 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4455656666666666666666666666665
No 45
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=80.02 E-value=4.1 Score=36.09 Aligned_cols=33 Identities=9% Similarity=-0.015 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
..|+++..+++++.++|++++++|+++++.|+.
T Consensus 30 ~~l~~q~~~~~~e~~~l~~~n~~L~~eI~~L~~ 62 (105)
T PRK00888 30 WRVNDQVAAQQQTNAKLKARNDQLFAEIDDLKG 62 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 345555555666666666666666666666654
No 46
>PRK14127 cell division protein GpsB; Provisional
Probab=79.41 E-value=5.8 Score=36.03 Aligned_cols=40 Identities=20% Similarity=0.408 Sum_probs=30.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+|+..+-++-..+..|+.+|++|+..|+++|++++.++.+
T Consensus 30 ~FLd~V~~dye~l~~e~~~Lk~e~~~l~~~l~e~~~~~~~ 69 (109)
T PRK14127 30 KFLDDVIKDYEAFQKEIEELQQENARLKAQVDELTKQVSV 69 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 4666666666677778888888888888888888887764
No 47
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=79.36 E-value=14 Score=40.88 Aligned_cols=79 Identities=18% Similarity=0.325 Sum_probs=44.7
Q ss_pred HhHHHHHHHHHHHHhccCCCCCCc-------CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 194 KNRRAHLKECFEILKRQVPPAQEE-------KKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPsLsDn-------~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
.++...++..++.|++..+..... .+.+...+...+-+.+..+.++-.++.+++++|++|.++|++++++|+
T Consensus 49 ~~~~~~~~~~l~~L~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~l~~~~~~L~~~~~~l~~~~~~l~- 127 (646)
T PRK05771 49 RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLE- 127 (646)
T ss_pred HHHHHHHHHHHHHHHHhccccccchhhhccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-
Confidence 345666888888888887763210 111222222333445666666666666666666666666666655554
Q ss_pred HHhhhhccccc
Q psy11481 267 ELSARWEHIDF 277 (561)
Q Consensus 267 ELsQqw~hIDf 277 (561)
-|..+|+
T Consensus 128 ----~~~~ld~ 134 (646)
T PRK05771 128 ----PWGNFDL 134 (646)
T ss_pred ----hhhcCCC
Confidence 2766554
No 48
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=78.57 E-value=5.8 Score=34.50 Aligned_cols=28 Identities=18% Similarity=0.232 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELK 265 (561)
++.+.+++..+.|++|++.|+++-..-+
T Consensus 39 ~e~q~~q~~reaL~~eneqlk~e~~~WQ 66 (79)
T COG3074 39 QEVQNAQHQREALERENEQLKEEQNGWQ 66 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344445555555555555555444333
No 49
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=78.21 E-value=7.4 Score=34.99 Aligned_cols=37 Identities=27% Similarity=0.391 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
.+-|..|+..-.++.+|-.+|+.|+..|+.+|.++.+
T Consensus 21 ~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~ 57 (107)
T PF06156_consen 21 LEELEELKKQLQELLEENARLRIENEHLRERLEELEQ 57 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3457777777777888888888888888888888765
No 50
>PF04880 NUDE_C: NUDE protein, C-terminal conserved region; InterPro: IPR006964 This domain represents the C-terminal conserved region of NUDE proteins. Emericella nidulans (Aspergillus nidulans) NUDE, acts in the cytoplasmic dynein/dynactin pathway and is required for distribution of nuclei []. It is a homologue of the nuclear distribution protein RO11 of Neurospora crassa. NUDE interacts with the NUDF via an N-terminal coiled coil domain; this is the only domain which is absolutely required for NUDE function.; PDB: 2V66_B 2V71_B.
Probab=77.97 E-value=2.4 Score=40.89 Aligned_cols=39 Identities=33% Similarity=0.338 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|..|+|---.|+.+. .|+|.|+.+++.|+.++..||+||
T Consensus 9 lN~AIERnalLE~EL----dEKE~L~~~~QRLkDE~RDLKqEl 47 (166)
T PF04880_consen 9 LNQAIERNALLESEL----DEKENLREEVQRLKDELRDLKQEL 47 (166)
T ss_dssp HHHHHHHHHHHHHHH----HHHHHHHHCH--------------
T ss_pred HHHHHHHhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 567888777787775 668888888888888888888887
No 51
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=77.66 E-value=16 Score=37.51 Aligned_cols=72 Identities=26% Similarity=0.302 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHH----HHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 196 RRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIR----YIQ-------FLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiD----YIQ-------kLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
|.+.|..-+..|+..+..+..+ ...++.-|+..+. -|. .|+.+..+++.+++.+..++.+++.+|+++
T Consensus 185 ~~~~L~~e~~~Lk~~~~e~~~~-D~~eL~~lr~eL~~~~~~i~~~k~~l~el~~el~~l~~~i~~~~~~k~~l~~eI~e~ 263 (325)
T PF08317_consen 185 RKAELEEELENLKQLVEEIESC-DQEELEALRQELAEQKEEIEAKKKELAELQEELEELEEKIEELEEQKQELLAEIAEA 263 (325)
T ss_pred HHHHHHHHHHHHHHHHhhhhhc-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555666666655543221 2233333333221 133 344444444444444444444455555554
Q ss_pred HHHH
Q psy11481 265 KKEL 268 (561)
Q Consensus 265 KqEL 268 (561)
++.+
T Consensus 264 ~~~~ 267 (325)
T PF08317_consen 264 EKIR 267 (325)
T ss_pred HHHH
Confidence 4443
No 52
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=77.62 E-value=7.5 Score=32.65 Aligned_cols=40 Identities=23% Similarity=0.374 Sum_probs=33.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.-|++|+.++++++.++..|.+++..+...++.|+..+.+
T Consensus 33 ~~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~~l~~ 72 (74)
T PF12329_consen 33 NTIKKLRAKIKELEKQIKELKKKLEELEKELESLEERLKR 72 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3588999999999999999999888888888888887754
No 53
>smart00338 BRLZ basic region leucin zipper.
Probab=76.93 E-value=13 Score=29.65 Aligned_cols=41 Identities=22% Similarity=0.307 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|=..=+.....++.+++.|..++..|+.+++.|+.++..
T Consensus 18 ~~~R~rKk~~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~ 58 (65)
T smart00338 18 RRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEK 58 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444445555667777788888888888888888877654
No 54
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=76.26 E-value=17 Score=37.85 Aligned_cols=19 Identities=26% Similarity=0.272 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHhccCCCC
Q psy11481 196 RRAHLKECFEILKRQVPPA 214 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsL 214 (561)
|.+.|+.-+..|+..+-.+
T Consensus 180 ~~~~L~~e~~~L~~~~~e~ 198 (312)
T smart00787 180 RKDALEEELRQLKQLEDEL 198 (312)
T ss_pred HHHHHHHHHHHHHHhHHHH
Confidence 4555555555555555543
No 55
>KOG3119|consensus
Probab=75.91 E-value=5 Score=40.51 Aligned_cols=32 Identities=31% Similarity=0.510 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.++.+.+..|.+|++.|+.++++|+++|..
T Consensus 216 ~~~e~~~r~~~leken~~lr~~v~~l~~el~~ 247 (269)
T KOG3119|consen 216 KEDEMAHRVAELEKENEALRTQVEQLKKELAT 247 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455556666666666666666666666665
No 56
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=75.24 E-value=13 Score=38.32 Aligned_cols=38 Identities=32% Similarity=0.383 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+.+++++.++..+.++|+.|.+++|.||..|+.++++
T Consensus 144 l~E~~~EkeeL~~eleele~e~ee~~erlk~le~E~s~ 181 (290)
T COG4026 144 LEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSR 181 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555666677777777777777777777777654
No 57
>KOG4196|consensus
Probab=75.04 E-value=8 Score=36.70 Aligned_cols=37 Identities=22% Similarity=0.384 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|+.++.++..|+++|++|+..++.+++.++..+.+
T Consensus 77 ~eLE~~k~~L~qqv~~L~~e~s~~~~E~da~k~k~e~ 113 (135)
T KOG4196|consen 77 HELEKEKAELQQQVEKLKEENSRLRRELDAYKSKYEA 113 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555666666666666666666666666655443
No 58
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=74.72 E-value=9.7 Score=34.65 Aligned_cols=37 Identities=22% Similarity=0.261 Sum_probs=29.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
.+-+..|++.-.++.+|-.+|+.|++.|+.+|++++.
T Consensus 21 ~~el~~LK~~~~el~EEN~~L~iEN~~Lr~~l~~~~~ 57 (110)
T PRK13169 21 LKELGALKKQLAELLEENTALRLENDKLRERLEELEA 57 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3457778888888888888888888888888888743
No 59
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=74.62 E-value=8.4 Score=37.38 Aligned_cols=40 Identities=18% Similarity=0.322 Sum_probs=31.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..-+.|+++..+++.+++.|+.|+..|.+++..++.+...
T Consensus 104 ~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~~eDY~~ 143 (161)
T TIGR02894 104 KENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQT 143 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566788888888888888888888888888888877654
No 60
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=74.36 E-value=36 Score=32.71 Aligned_cols=36 Identities=31% Similarity=0.476 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+..++.+...++.++++++++++.+++++++++.++
T Consensus 65 ~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l 100 (302)
T PF10186_consen 65 IEELRERLERLRERIERLRKRIEQKRERLEELRESL 100 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444444444444444444444444443
No 61
>KOG3863|consensus
Probab=73.85 E-value=7.8 Score=43.91 Aligned_cols=44 Identities=25% Similarity=0.372 Sum_probs=40.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++-.|.|..|+.+...++.|+++|.+|+.++...|.+++++|+.
T Consensus 507 KRKLd~I~nLE~ev~~l~~eKeqLl~Er~~~d~~L~~~kqqls~ 550 (604)
T KOG3863|consen 507 KRKLDCILNLEDEVEKLQKEKEQLLRERDELDSTLGVMKQQLSE 550 (604)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678999999999999999999999999999999999999875
No 62
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=73.10 E-value=5.7 Score=31.53 Aligned_cols=33 Identities=21% Similarity=0.292 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+-.++..++++|+++++.++++.++|+.++.+
T Consensus 17 ~~~~~~~~ei~~l~~~i~~l~~e~~~L~~ei~~ 49 (80)
T PF04977_consen 17 SRYYQLNQEIAELQKEIEELKKENEELKEEIER 49 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344566777888888888888888888888776
No 63
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=73.00 E-value=11 Score=31.69 Aligned_cols=42 Identities=12% Similarity=0.275 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
=+.++..|++++..+.++...++.|...|..+.+.-+..+..
T Consensus 12 Li~~~~~L~~EN~~Lr~q~~~~~~ER~~L~ekne~Ar~rvEa 53 (65)
T TIGR02449 12 LLEYLERLKSENRLLRAQEKTWREERAQLLEKNEQARQKVEA 53 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355677788888888877777777777777777766666544
No 64
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=71.96 E-value=22 Score=28.32 Aligned_cols=42 Identities=31% Similarity=0.342 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|..|=..=+....+|+.+++.|..++..|+.+++.|+.++.+
T Consensus 17 Ar~~R~RKk~~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~ 58 (64)
T PF00170_consen 17 ARRSRQRKKQYIEELEEKVEELESENEELKKELEQLKKEIQS 58 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666667778888888888888888888888887654
No 65
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=71.70 E-value=3.3 Score=47.17 Aligned_cols=39 Identities=31% Similarity=0.492 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLKKELSARWEHID 276 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELKqELsQqw~hID 276 (561)
+.|+++.+|+++++.+.+.|+.+|++++.++.+|+.+++
T Consensus 632 ~AEr~~~~EL~~~~~~l~~l~~si~~lk~k~~~Q~~~i~ 670 (717)
T PF10168_consen 632 EAEREFKKELERMKDQLQDLKASIEQLKKKLDYQQRQIE 670 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 347888889999999999999999999999988777665
No 66
>PF11221 Med21: Subunit 21 of Mediator complex; InterPro: IPR021384 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med21 has been known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in Drosophila. The heterodimer of the two subunits Med7 and Med21 appears to act as a hinge between the middle and the tail regions of Mediator []. ; PDB: 1YKE_B 1YKH_B.
Probab=71.47 E-value=13 Score=34.31 Aligned_cols=60 Identities=20% Similarity=0.274 Sum_probs=45.5
Q ss_pred HHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 203 CFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 203 cFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-|+.|=+.+|.+.. |--. -.+-|+.|++++++.++|.++..+|.++|..+++++=+++.+
T Consensus 84 qIe~LIdsLPg~~~----see~----Q~~~i~~L~~E~~~~~~el~~~v~e~e~ll~~v~~~i~~ia~ 143 (144)
T PF11221_consen 84 QIEYLIDSLPGIEV----SEEE----QLKRIKELEEENEEAEEELQEAVKEAEELLKQVQELIREIAR 143 (144)
T ss_dssp HHHHHHHHSTTSSS-----HHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTT--
T ss_pred HHHHHHHhCCCCCC----CHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 68899999999542 2111 224699999999999999999999999998888887766543
No 67
>KOG3582|consensus
Probab=70.94 E-value=1 Score=51.56 Aligned_cols=66 Identities=15% Similarity=0.064 Sum_probs=54.5
Q ss_pred CCCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCC--cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 182 ISGIREVHNKLEKNRRAHLKECFEILKRQVPPAQE--EKKSSNLSILHSAIRYIQFLRRREREFEHEMERL 250 (561)
Q Consensus 182 ~SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsD--n~KaSKAsILrKAiDYIQkLQreEreLE~EkERL 250 (561)
++-.|+-|...+|+||.++++.|+.|-++.|.+.. ..+.++.+||. |.|+.++++.+...+.++.+
T Consensus 784 n~~v~a~sih~lrr~~~~~~dq~~sL~alrp~v~~~~~ql~S~tS~L~---dp~~~~eq~ska~~e~~~~k 851 (856)
T KOG3582|consen 784 NGMVSAGSIHALRRTRLNWLDQFCSLPALRPQVLLNLRQLLSSTSILT---DPIKQPEQASKAVTEKIEGK 851 (856)
T ss_pred cceeecchHHHHHHHHHHHhhccccHHHHHHHHHhhHHHhhhhhhccc---CcccchHHHHHHHHhhhhhh
Confidence 44568889999999999999999999999998743 45689999998 99999999887766655443
No 68
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=70.64 E-value=25 Score=29.68 Aligned_cols=36 Identities=19% Similarity=0.183 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
++.|..+-.++...-++|++||..|+++++.++.|-
T Consensus 2 L~~Le~kle~Li~~~~~L~~EN~~Lr~q~~~~~~ER 37 (65)
T TIGR02449 2 LQALAAQVEHLLEYLERLKSENRLLRAQEKTWREER 37 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777777777777888888888888888877764
No 69
>KOG4196|consensus
Probab=70.49 E-value=8.9 Score=36.39 Aligned_cols=39 Identities=21% Similarity=0.416 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
++.++++ .+||.++.+|++|++.|++.+++++.|+.. |+
T Consensus 70 ~KRv~Qk-~eLE~~k~~L~qqv~~L~~e~s~~~~E~da-~k 108 (135)
T KOG4196|consen 70 VKRVQQK-HELEKEKAELQQQVEKLKEENSRLRRELDA-YK 108 (135)
T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH
Confidence 4555554 457777778888888888888888888777 66
No 70
>KOG2264|consensus
Probab=69.91 E-value=11 Score=43.27 Aligned_cols=32 Identities=28% Similarity=0.309 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
|++..-+.+.|+++|+-++++.+.+++||.+.
T Consensus 119 lk~~i~~~q~eL~~Lk~~ieqaq~~~~El~~~ 150 (907)
T KOG2264|consen 119 LKRLIPQKQLELSALKGEIEQAQRQLEELRET 150 (907)
T ss_pred HHHHHHHhHHHHHHHHhHHHHHHHHHHHHHhh
Confidence 33333333445555555555555555555544
No 71
>PHA01750 hypothetical protein
Probab=69.88 E-value=19 Score=31.21 Aligned_cols=29 Identities=17% Similarity=0.340 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqELs 269 (561)
..+..|++.+..+++.|++++.++++.+.
T Consensus 45 dNL~~ei~~~kikqDnl~~qv~eik~k~d 73 (75)
T PHA01750 45 DNLKTEIEELKIKQDELSRQVEEIKRKLD 73 (75)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHhhc
Confidence 34445555555556666666666665543
No 72
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=69.50 E-value=10 Score=32.23 Aligned_cols=41 Identities=24% Similarity=0.263 Sum_probs=23.0
Q ss_pred HHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 227 HSAIRYIQF---LRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 227 rKAiDYIQk---LQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
++|..+|+. +.+.-.+.+.+++.|+++++..++-|+.++..
T Consensus 38 q~ar~~i~~lpgi~~s~eeq~~~i~~Le~~i~~k~~~L~~~~~~ 81 (83)
T PF07544_consen 38 QKARAAIRELPGIDRSVEEQEEEIEELEEQIRKKREVLQKFKER 81 (83)
T ss_pred HHHHHHHHhCCCccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444 33333445566677777776666666666654
No 73
>KOG2264|consensus
Probab=69.18 E-value=14 Score=42.44 Aligned_cols=33 Identities=12% Similarity=0.223 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
+.+|+.+.+++..|++++...+++|++.|.+-|
T Consensus 95 L~ele~krqel~seI~~~n~kiEelk~~i~~~q 127 (907)
T KOG2264|consen 95 LTELEVKRQELNSEIEEINTKIEELKRLIPQKQ 127 (907)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhH
Confidence 444555555555555555444444444433333
No 74
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=69.18 E-value=14 Score=28.79 Aligned_cols=34 Identities=21% Similarity=0.326 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|+..=..+..+.+.|++|++.|+.++..|+..+.
T Consensus 10 LK~~yd~Lk~~~~~L~~E~~~L~aev~~L~~kl~ 43 (45)
T PF02183_consen 10 LKASYDSLKAEYDSLKKENEKLRAEVQELKEKLQ 43 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 5555556666777777777777777777776653
No 75
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=68.91 E-value=24 Score=31.24 Aligned_cols=33 Identities=18% Similarity=0.278 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
-++++++++.+|++|.++|+.|+..-+.++...
T Consensus 24 k~~ka~~~~~kL~~en~qlk~Ek~~~~~qvkn~ 56 (87)
T PF10883_consen 24 KVKKAKKQNAKLQKENEQLKTEKAVAETQVKNA 56 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 377888888888888888888888877777643
No 76
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=68.75 E-value=30 Score=33.45 Aligned_cols=19 Identities=37% Similarity=0.553 Sum_probs=14.5
Q ss_pred hHHHHHHHHHHHHhccCCC
Q psy11481 195 NRRAHLKECFEILKRQVPP 213 (561)
Q Consensus 195 KRRd~LKdcFeeLRdlVPs 213 (561)
-|+.-...||++|=..|--
T Consensus 59 vr~~ly~~~F~ELIRQVTi 77 (189)
T PF10211_consen 59 VREELYSQCFDELIRQVTI 77 (189)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3566666699999888876
No 77
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=68.70 E-value=24 Score=28.82 Aligned_cols=30 Identities=23% Similarity=0.289 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 235 FLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 235 kLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
.+..+..+++.++++++.|+.+|+.+++.|
T Consensus 28 ~~~~~~~~~~~~~~~l~~en~~L~~ei~~l 57 (85)
T TIGR02209 28 QLNNELQKLQLEIDKLQKEWRDLQLEVAEL 57 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444444444433
No 78
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=68.53 E-value=14 Score=32.77 Aligned_cols=66 Identities=20% Similarity=0.333 Sum_probs=38.9
Q ss_pred HHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 200 LKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 200 LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+...++.+..-+=.+..+.--.|+..|+|. |=..-+.+.++++.+++.|.+|++.|+.+|+.-+.|
T Consensus 13 v~~~ve~vA~eLh~~YssKHE~KV~~LKks--Ye~rwek~v~~L~~e~~~l~~E~e~L~~~l~~e~~E 78 (87)
T PF12709_consen 13 VEKAVEKVARELHALYSSKHETKVKALKKS--YEARWEKKVDELENENKALKRENEQLKKKLDTEREE 78 (87)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444443333345789999875 434455556666667777777777777766655444
No 79
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=68.37 E-value=25 Score=37.93 Aligned_cols=43 Identities=19% Similarity=0.182 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 228 SAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+-.++..+..++..++..++..|.++.++|+++|++|+++|.+
T Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~ 170 (525)
T TIGR02231 128 EWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNA 170 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3344455555666666666677777777777777777777644
No 80
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=67.87 E-value=16 Score=41.82 Aligned_cols=39 Identities=28% Similarity=0.476 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|.-..++++.+++|.|+.+|++|......++.+|+.++.
T Consensus 538 e~~e~~r~r~~~lE~E~~~lr~elk~kee~~~~~e~~~~ 576 (697)
T PF09726_consen 538 ECAESCRQRRRQLESELKKLRRELKQKEEQIRELESELQ 576 (697)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556678888999999999999999999999999988874
No 81
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=67.74 E-value=59 Score=32.13 Aligned_cols=31 Identities=26% Similarity=0.318 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHhh
Q psy11481 240 EREFEHEMERLAREKIHAQQRLA---LLKKELSA 270 (561)
Q Consensus 240 EreLE~EkERLRREneaLQQRLe---ELKqELsQ 270 (561)
..++.+|.++|++|+.+|+.++. +|+.|+.+
T Consensus 71 ~~~l~~en~~L~~e~~~l~~~~~~~~~l~~en~~ 104 (276)
T PRK13922 71 LFDLREENEELKKELLELESRLQELEQLEAENAR 104 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44567778888888888877776 44555443
No 82
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=67.56 E-value=18 Score=36.76 Aligned_cols=44 Identities=11% Similarity=0.136 Sum_probs=22.4
Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHh
Q psy11481 226 LHSAIRYIQF---LRRREREFEHEMERLAREKI----HAQQRLALLKKELS 269 (561)
Q Consensus 226 LrKAiDYIQk---LQreEreLE~EkERLRREne----aLQQRLeELKqELs 269 (561)
.....+|+.. |++|+++|++|+.+|+++.+ +|+++.++|++.|.
T Consensus 58 ~~~~~~~~~~~~~l~~EN~~Lr~e~~~l~~~~~~~~~~l~~EN~rLr~LL~ 108 (283)
T TIGR00219 58 FDGISENLKDVNNLEYENYKLRQELLKKNQQLEILTQNLKQENVRLRELLN 108 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3344444443 55666666666555533222 25555555555543
No 83
>KOG4571|consensus
Probab=67.43 E-value=38 Score=35.77 Aligned_cols=38 Identities=21% Similarity=0.261 Sum_probs=33.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
..-++.|.++|++|..+.++|.||++.||+-+.+..++
T Consensus 254 ~ge~~~Le~rN~~LK~qa~~lerEI~ylKqli~e~~~~ 291 (294)
T KOG4571|consen 254 LGELEGLEKRNEELKDQASELEREIRYLKQLILEVYKK 291 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34688899999999999999999999999999887654
No 84
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=67.23 E-value=43 Score=35.48 Aligned_cols=46 Identities=11% Similarity=0.142 Sum_probs=30.9
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 220 SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 220 aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
.+=+.+|..+.+-.+.|+.+...+.+.+.+++-++..|+.+++..+
T Consensus 68 ~~La~lL~~sre~Nk~L~~Ev~~Lrqkl~E~qGD~KlLR~~la~~r 113 (319)
T PF09789_consen 68 KNLAQLLSESREQNKKLKEEVEELRQKLNEAQGDIKLLREKLARQR 113 (319)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHhhh
Confidence 4446788888888888877777777666666666555555555443
No 85
>COG5019 CDC3 Septin family protein [Cell division and chromosome partitioning / Cytoskeleton]
Probab=67.12 E-value=34 Score=37.07 Aligned_cols=40 Identities=23% Similarity=0.208 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+-.+.|++.++++.+|++++..+.++.+.+++.++.++.+
T Consensus 328 ~~e~~l~~~E~~l~~e~~~~n~~Le~~~~~l~~~e~~l~~ 367 (373)
T COG5019 328 EKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEK 367 (373)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3344555555555555555555555555566665555543
No 86
>PF10226 DUF2216: Uncharacterized conserved proteins (DUF2216); InterPro: IPR019359 Proteins in this entry are found in Metazoa and contain a coiled-coil domain. Some annotation suggests it might be PKR, the Hepatitis delta antigen-interacting protein A, but this could not be confirmed.
Probab=66.50 E-value=34 Score=34.30 Aligned_cols=12 Identities=58% Similarity=0.736 Sum_probs=4.8
Q ss_pred ecCC-CCCCCccC
Q psy11481 328 RSDG-GESGSSLG 339 (561)
Q Consensus 328 ~~~~-~~~~~~~~ 339 (561)
+|.| |.|+|+-|
T Consensus 176 rD~GdGSS~sS~g 188 (195)
T PF10226_consen 176 RDVGDGSSTSSTG 188 (195)
T ss_pred CCCCCCCCCCCCC
Confidence 4444 33334433
No 87
>PF13815 Dzip-like_N: Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=66.38 E-value=19 Score=32.05 Aligned_cols=44 Identities=20% Similarity=0.348 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 224 SILHSAIRYIQFLRR----REREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 224 sILrKAiDYIQkLQr----eEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
.+++=+++|+-..++ ....++.+++.+.++.+.|++++.+++.+
T Consensus 62 rLaQl~ieYLl~~q~~L~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~ 109 (118)
T PF13815_consen 62 RLAQLSIEYLLHCQEYLSSQLEQLEERLQELQQEIEKLKQKLKKQKEE 109 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556667888654443 33334444444444444444444444433
No 88
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=66.23 E-value=22 Score=35.98 Aligned_cols=51 Identities=24% Similarity=0.311 Sum_probs=40.1
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 220 SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 220 aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
.+=+.|+.+-.| ..++++.+||+|+.++++++..|+++++.||...-+=++
T Consensus 78 ~siLpIVtsQRD---RFR~Rn~ELE~elr~~~~~~~~L~~Ev~~L~~DN~kLYE 128 (248)
T PF08172_consen 78 SSILPIVTSQRD---RFRQRNAELEEELRKQQQTISSLRREVESLRADNVKLYE 128 (248)
T ss_pred ccHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666555 567889999999999999999999999999998765333
No 89
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=66.07 E-value=38 Score=29.67 Aligned_cols=70 Identities=20% Similarity=0.264 Sum_probs=50.6
Q ss_pred HHHHHHHHhccCCCCCCcCC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 200 LKECFEILKRQVPPAQEEKK-SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 200 LKdcFeeLRdlVPsLsDn~K-aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+.-+++|..+=|. ..+-| .-.+-|.+.--+-+..|+.+...++..++.|.++...|+.++.+++.+|.+
T Consensus 36 ~~~v~~eL~~l~~d-~~vyk~VG~vlv~~~~~e~~~~l~~r~e~ie~~i~~lek~~~~l~~~l~e~q~~l~~ 106 (110)
T TIGR02338 36 AEKALEELERLPDD-TPVYKSVGNLLVKTDKEEAIQELKEKKETLELRVKTLQRQEERLREQLKELQEKIQE 106 (110)
T ss_pred HHHHHHHHHcCCCc-chhHHHhchhhheecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456666655444 33333 344566666666788888888889999999999999999999999988765
No 90
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=65.33 E-value=33 Score=28.56 Aligned_cols=30 Identities=17% Similarity=0.245 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
++.+.++.+.....|..++..|+.++++++
T Consensus 30 ~e~kLqeaE~rn~eL~~ei~~L~~e~ee~r 59 (61)
T PF08826_consen 30 FESKLQEAEKRNRELEQEIERLKKEMEELR 59 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 334444444444444444444444544444
No 91
>PF07989 Microtub_assoc: Microtubule associated; InterPro: IPR012943 Proteins with this domain associate with the spindle body during cell division [].
Probab=64.96 E-value=19 Score=30.65 Aligned_cols=67 Identities=21% Similarity=0.271 Sum_probs=44.7
Q ss_pred hhHHHHhHHH--HHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 189 HNKLEKNRRA--HLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 189 HNaLERKRRd--~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
-..+++=+.. +||-++--|.+.+... +-+.++.+.+++-++..+++.|++|...+++.|.++.+
T Consensus 6 e~~i~~L~KENF~LKLrI~fLee~l~~~--------------~~~~~~~~~keNieLKve~~~L~~el~~~~~~l~~a~~ 71 (75)
T PF07989_consen 6 EEQIDKLKKENFNLKLRIYFLEERLQKL--------------GPESIEELLKENIELKVEVESLKRELQEKKKLLKEAEK 71 (75)
T ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHhc--------------ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444 3455555555555431 22357788888888888999999999888888888877
Q ss_pred HHh
Q psy11481 267 ELS 269 (561)
Q Consensus 267 ELs 269 (561)
.+.
T Consensus 72 ~~e 74 (75)
T PF07989_consen 72 AIE 74 (75)
T ss_pred Hhh
Confidence 653
No 92
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=64.72 E-value=21 Score=30.66 Aligned_cols=42 Identities=31% Similarity=0.583 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.+|.+| |+++.+.++.+++.|+.+...++.++..++..+.+
T Consensus 75 ~~eA~~~---l~~r~~~l~~~~~~l~~~~~~~~~~~~~~~~~l~~ 116 (120)
T PF02996_consen 75 LEEAIEF---LKKRIKELEEQLEKLEKELAELQAQIEQLEQTLQQ 116 (120)
T ss_dssp HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHH
T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555 45566666677777777777777777777666554
No 93
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=64.16 E-value=26 Score=29.64 Aligned_cols=30 Identities=23% Similarity=0.295 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLe 262 (561)
|..|++++..+..+.+.|++|+..|++.-+
T Consensus 27 ~eeLke~n~~L~~e~~~L~~en~~L~~e~~ 56 (72)
T PF06005_consen 27 NEELKEKNNELKEENEELKEENEQLKQERN 56 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 444555544444444444444444443333
No 94
>PF05565 Sipho_Gp157: Siphovirus Gp157; InterPro: IPR008840 This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophage. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage [].
Probab=63.90 E-value=28 Score=32.62 Aligned_cols=43 Identities=23% Similarity=0.279 Sum_probs=38.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSAR 271 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQq 271 (561)
.+.||+.|+.+...+..|+++|+.....+..+++.|+.-|...
T Consensus 45 ~~~~Ik~~ea~~e~~k~E~krL~~rkk~~e~~~~~Lk~yL~~~ 87 (162)
T PF05565_consen 45 IAKVIKNLEADIEAIKAEIKRLQERKKSIENRIDRLKEYLLDA 87 (162)
T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3457889999999999999999999999999999999998773
No 95
>PF11740 KfrA_N: Plasmid replication region DNA-binding N-term; InterPro: IPR021104 The KfrA family of protiens are encoded on plasmids, generally in or near gene clusters invloved in stable inheritance functions. These proteins are thought to form an all-helical structure, consisting of an N-terminal helix-turn-helix DNA binding domain and an extended coiled-coil tail. The best-characterised KfrA protein, encoded on the broad host-range Plasmid RK2, is a site-specific DNA-binding protein whose operator overlaps its own promoter. The DNA-binding domain is essential for function, while the coiled-coil domain is probably responsible for formation of multimers, and may provide an example of a bridge to host structures required for plasmid partitioning []. This entry represents the N-terminal DNA-binding domain.
Probab=63.68 E-value=64 Score=27.77 Aligned_cols=33 Identities=27% Similarity=0.282 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|......++.+++.+..+..++..+++.|+.++
T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~e~ 118 (120)
T PF11740_consen 86 LEAARAELEQERAAAEAELAEAEAQAEELEAEL 118 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333334444444444444444445555444443
No 96
>PRK10884 SH3 domain-containing protein; Provisional
Probab=63.51 E-value=30 Score=34.15 Aligned_cols=34 Identities=18% Similarity=0.220 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 237 RRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 237 QreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++...++++|.++|++|.+.++++++.|+.++..
T Consensus 131 ~~~~~~L~~~n~~L~~~l~~~~~~~~~l~~~~~~ 164 (206)
T PRK10884 131 DSVINGLKEENQKLKNQLIVAQKKVDAANLQLDD 164 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344455555555555555555555555544
No 97
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=63.22 E-value=15 Score=30.23 Aligned_cols=33 Identities=27% Similarity=0.334 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..|.+.-.+++.++++|+++...|..||.+++
T Consensus 20 ie~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~ 52 (69)
T PF04102_consen 20 IEELNDVVTEQQRQIDRLQRQLRLLRERLRELE 52 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHT-----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 556666666666667777777777766666665
No 98
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=62.90 E-value=75 Score=36.03 Aligned_cols=51 Identities=25% Similarity=0.238 Sum_probs=41.5
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 220 SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 220 aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.++-.-|..+..-|..|+...+-++.|+++|++|+++|.+-+..|+.+|..
T Consensus 406 D~n~vqlsE~~rel~Elks~lrv~qkEKEql~~EkQeL~~yi~~Le~r~~~ 456 (546)
T PF07888_consen 406 DCNRVQLSENRRELQELKSSLRVAQKEKEQLQEEKQELLEYIERLEQRLDK 456 (546)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344446666777788888888888899999999999999999999998853
No 99
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=62.66 E-value=25 Score=30.22 Aligned_cols=38 Identities=29% Similarity=0.562 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.|+++...++.+++.|+++...+++++++++..+.+
T Consensus 89 ~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l~~ 126 (129)
T cd00890 89 IEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEELQQ 126 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55666777777888888888888888888888887765
No 100
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=62.22 E-value=19 Score=37.31 Aligned_cols=6 Identities=50% Similarity=0.755 Sum_probs=2.2
Q ss_pred HHHHhc
Q psy11481 204 FEILKR 209 (561)
Q Consensus 204 FeeLRd 209 (561)
|++||.
T Consensus 79 ~eeLrg 84 (290)
T COG4026 79 FEELRG 84 (290)
T ss_pred HHHHHH
Confidence 333333
No 101
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=62.17 E-value=62 Score=28.03 Aligned_cols=50 Identities=20% Similarity=0.286 Sum_probs=35.5
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 221 SNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 221 SKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-.+.|...--+-+..|+.+...++.++++|..+...+..++.+++.+|..
T Consensus 53 G~vfv~~~~~ea~~~Le~~~e~le~~i~~l~~~~~~l~~~~~elk~~l~~ 102 (105)
T cd00632 53 GNVLVKQEKEEARTELKERLETIELRIKRLERQEEDLQEKLKELQEKIQQ 102 (105)
T ss_pred hhHHhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445544555577777777778888888888888888888888877653
No 102
>KOG1962|consensus
Probab=61.63 E-value=45 Score=33.74 Aligned_cols=78 Identities=17% Similarity=0.124 Sum_probs=41.4
Q ss_pred hhhhhHHHHhHHHHH----HHHHHHHhccCCCCCCcCCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 186 REVHNKLEKNRRAHL----KECFEILKRQVPPAQEEKKSSN-LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQR 260 (561)
Q Consensus 186 RaTHNaLERKRRd~L----KdcFeeLRdlVPsLsDn~KaSK-AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQR 260 (561)
|+.|..+++-.+..= ...-+.|++.+-. .+| ..=..+..+-...|+++-++.+.+.+.++.++.+|+++
T Consensus 114 ~R~~~ll~~l~~l~~~~~~~~~~~~lk~~~~~------~~~~~~~~~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq 187 (216)
T KOG1962|consen 114 RRLHTLLRELATLRANEKAMKENEALKKQLEN------SSKLEEENDKLKADLEKLETELEKKQKKLEKAQKKVDALKKQ 187 (216)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhc------ccchhhhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 666777776653332 1233344443333 121 22233334445566666666666666666666666666
Q ss_pred HHHHHHHHh
Q psy11481 261 LALLKKELS 269 (561)
Q Consensus 261 LeELKqELs 269 (561)
.++++.|..
T Consensus 188 ~e~~~~Eyd 196 (216)
T KOG1962|consen 188 SEGLQDEYD 196 (216)
T ss_pred HHHcccHHH
Confidence 666666643
No 103
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=61.56 E-value=22 Score=30.50 Aligned_cols=46 Identities=20% Similarity=0.171 Sum_probs=35.2
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 221 SNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 221 SKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
+.-.-..-.-+-|+.|++..++++.+++.++.+...+++.++++.+
T Consensus 74 s~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~~~~~l~~~~~ 119 (120)
T PF02996_consen 74 SLEEAIEFLKKRIKELEEQLEKLEKELAELQAQIEQLEQTLQQLYQ 119 (120)
T ss_dssp EHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3333344445578888888888999999999999999999888764
No 104
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=61.52 E-value=18 Score=29.27 Aligned_cols=66 Identities=20% Similarity=0.249 Sum_probs=39.4
Q ss_pred HHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHh
Q psy11481 197 RAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLA-REKIHAQQRLALLKKELS 269 (561)
Q Consensus 197 Rd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLR-REneaLQQRLeELKqELs 269 (561)
.+.|+..+..+... . ...|-..|+++-.+|...++..++|+.|+..+- .+...++.+|..++.+|.
T Consensus 5 ~~~i~~~l~~~~~~----~---~~~r~~~i~~~e~~l~ea~~~l~qMe~E~~~~p~s~r~~~~~kl~~yr~~l~ 71 (79)
T PF05008_consen 5 TAEIKSKLERIKNL----S---GEQRKSLIREIERDLDEAEELLKQMELEVRSLPPSERNQYKSKLRSYRSELK 71 (79)
T ss_dssp HHHHHHHHHHGGGS--------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCTS-HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhhcc----C---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHH
Confidence 34555555555522 1 247777888888888888888888876665443 444445555555555543
No 105
>COG1382 GimC Prefoldin, chaperonin cofactor [Posttranslational modification, protein turnover, chaperones]
Probab=61.39 E-value=27 Score=32.43 Aligned_cols=39 Identities=23% Similarity=0.385 Sum_probs=35.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-+..|+++...++.+++.|++.-+.++.|+++|+.++-+
T Consensus 71 ~~~eL~er~E~Le~ri~tLekQe~~l~e~l~eLq~~i~~ 109 (119)
T COG1382 71 AVDELEERKETLELRIKTLEKQEEKLQERLEELQSEIQK 109 (119)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 478899999999999999999999999999999999866
No 106
>PF13942 Lipoprotein_20: YfhG lipoprotein
Probab=61.17 E-value=64 Score=32.07 Aligned_cols=76 Identities=12% Similarity=0.218 Sum_probs=44.7
Q ss_pred HHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy11481 199 HLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREF-------EHEMERLAREKIHAQQRLALLKKELSAR 271 (561)
Q Consensus 199 ~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreL-------E~EkERLRREneaLQQRLeELKqELsQq 271 (561)
.+-+.++.++..+|.-- + +=..+-+.----+-.|-++..+| ..+++.|++++..|+.+|+.-.++|..
T Consensus 81 ~~~~~l~~y~~~~P~~v---R-PL~QlWRe~Q~lql~L~eEr~Ry~rLQqssD~~lD~Lr~qq~~Lq~qL~~T~RKLEn- 155 (179)
T PF13942_consen 81 QMVDRLNSYSLQFPASV---R-PLLQLWREQQVLQLQLSEERARYQRLQQSSDSELDALRQQQQRLQYQLDTTTRKLEN- 155 (179)
T ss_pred HHHHHHHHhhhhcCHHH---h-HHHHHHHHhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhh-
Confidence 45677888888888721 1 11111111111122233333332 367888888888888888888888763
Q ss_pred hcccccccCCCCCccccccc
Q psy11481 272 WEHIDFNTLIPDNMEVDIPY 291 (561)
Q Consensus 272 w~hIDfStLlPd~lEIDRvl 291 (561)
-..|+|+|
T Consensus 156 ------------LTDIERQL 163 (179)
T PF13942_consen 156 ------------LTDIERQL 163 (179)
T ss_pred ------------hhHHHHHH
Confidence 45678876
No 107
>PRK11637 AmiB activator; Provisional
Probab=60.79 E-value=26 Score=36.88 Aligned_cols=39 Identities=13% Similarity=0.200 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+-|..++++..+++.++++|++++..++.+|++++..+.
T Consensus 89 ~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~ 127 (428)
T PRK11637 89 RKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLLA 127 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555556666666666666666655555553
No 108
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=60.65 E-value=68 Score=37.36 Aligned_cols=69 Identities=22% Similarity=0.222 Sum_probs=30.8
Q ss_pred HHHhH-HHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 192 LEKNR-RAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 192 LERKR-Rd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLe 262 (561)
+||+| |+.||++=..=..++-...+-. ---.-|+|-+.+.+.-|-+-.-+.+|+.+|..|.+.|+.+++
T Consensus 76 ~~~~~lr~e~ke~K~rE~rll~dyselE--eENislQKqvs~Lk~sQvefE~~Khei~rl~Ee~~~l~~qle 145 (717)
T PF09730_consen 76 LERKRLREEIKEYKFREARLLQDYSELE--EENISLQKQVSVLKQSQVEFEGLKHEIKRLEEEIELLNSQLE 145 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhHHHH--HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444 7777774333223333322100 001235555555555444444455555555555555544443
No 109
>PF11414 Suppressor_APC: Adenomatous polyposis coli tumour suppressor protein; PDB: 1M5I_A.
Probab=60.54 E-value=34 Score=29.88 Aligned_cols=39 Identities=18% Similarity=0.230 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
..+++.|+++..-+..+++.++++.....++|..+++..
T Consensus 6 lk~mkeLEqEkd~LLqgLe~~Er~r~Wy~~qL~~vq~rq 44 (84)
T PF11414_consen 6 LKRMKELEQEKDVLLQGLEMEERERDWYQQQLQSVQERQ 44 (84)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457899999999999999999999999999999999864
No 110
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=60.37 E-value=33 Score=26.69 Aligned_cols=32 Identities=22% Similarity=0.240 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 228 SAIRYIQFLRRREREFEHEMERLAREKIHAQQ 259 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERLRREneaLQQ 259 (561)
+--.|+..|+.+...|+.+...|+.++..|++
T Consensus 22 rkk~~~~~le~~~~~L~~en~~L~~~i~~L~~ 53 (54)
T PF07716_consen 22 RKKQREEELEQEVQELEEENEQLRQEIAQLER 53 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34456677777777777777777777776654
No 111
>PRK09413 IS2 repressor TnpA; Reviewed
Probab=60.27 E-value=19 Score=31.89 Aligned_cols=33 Identities=6% Similarity=0.098 Sum_probs=23.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
++.++++.++|+.++.+|+.|++.|++.++-..
T Consensus 73 ~~~~~~ei~~L~~el~~L~~E~diLKKa~~~~~ 105 (121)
T PRK09413 73 LAAAMKQIKELQRLLGKKTMENELLKEAVEYGR 105 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 445677777777777788777777777665444
No 112
>KOG0709|consensus
Probab=59.88 E-value=15 Score=40.64 Aligned_cols=52 Identities=21% Similarity=0.222 Sum_probs=41.1
Q ss_pred CcchHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 219 KSSNLSILHSAIRYIQFLRRR-------EREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 219 KaSKAsILrKAiDYIQkLQre-------EreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|.|+-.-=+|--+||..|+.+ +++|++++++|.++++.|.++|..||..+-+
T Consensus 260 K~SAQESRrkKkeYid~LE~rv~~~taeNqeL~kkV~~Le~~N~sLl~qL~klQt~v~q 318 (472)
T KOG0709|consen 260 KRSAQESRRKKKEYIDGLESRVSAFTAENQELQKKVEELELSNRSLLAQLKKLQTLVIQ 318 (472)
T ss_pred hhhhHHHHHhHhhHHHHHhhhhhhcccCcHHHHHHHHHHhhccHHHHHHHHHHHHHHhh
Confidence 445555556778999999875 6788888888999999999999988887655
No 113
>PF04508 Pox_A_type_inc: Viral A-type inclusion protein repeat ; InterPro: IPR007596 The repeat is found in the A-type inclusion protein of the Poxvirus family [].; GO: 0016032 viral reproduction
Probab=59.83 E-value=12 Score=26.26 Aligned_cols=20 Identities=25% Similarity=0.353 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy11481 246 EMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 246 EkERLRREneaLQQRLeELK 265 (561)
||++|+.++..|.++|++++
T Consensus 2 E~~rlr~rI~dLer~L~~C~ 21 (23)
T PF04508_consen 2 EMNRLRNRISDLERQLSECR 21 (23)
T ss_pred hHHHHHHHHHHHHHHHHHHh
Confidence 67777777777777777665
No 114
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=59.58 E-value=19 Score=41.21 Aligned_cols=38 Identities=26% Similarity=0.323 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|..|.++.++++.|+..|+++++++++.++.|+.+|.+
T Consensus 424 i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~ 461 (652)
T COG2433 424 IKKLEETVERLEEENSELKRELEELKREIEKLESELER 461 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444555566666666666666666666666654
No 115
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=59.54 E-value=25 Score=35.25 Aligned_cols=37 Identities=11% Similarity=0.151 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|+++...++.|+.+||=++++++.+|++++++--+
T Consensus 57 ~~l~~ql~~lq~ev~~LrG~~E~~~~~l~~~~~rq~~ 93 (263)
T PRK10803 57 TQLQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQ 93 (263)
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 3455555556666666666666666666655554333
No 116
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=59.39 E-value=40 Score=30.24 Aligned_cols=40 Identities=33% Similarity=0.517 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+..|++|+ +++...++..++.|+.+...+++++++++..+
T Consensus 92 ~~eA~~~l---~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~~l 131 (140)
T PRK03947 92 LDEAIEIL---DKRKEELEKALEKLEEALQKLASRIAQLAQEL 131 (140)
T ss_pred HHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 66666653 34444444555555555555555555555444
No 117
>PRK11637 AmiB activator; Provisional
Probab=59.18 E-value=32 Score=36.20 Aligned_cols=37 Identities=14% Similarity=0.234 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
.-|..++++..+++.++++++++...++.++..+-+.
T Consensus 96 ~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlra 132 (428)
T PRK11637 96 NTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLDA 132 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555555544444444444433
No 118
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=58.35 E-value=35 Score=37.92 Aligned_cols=34 Identities=24% Similarity=0.433 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|+.+++++++|.++|+.|+..|+.+|.+|+++|.
T Consensus 107 v~~~~~~~~~~~~ql~~~~~~~~~~l~~l~~~l~ 140 (472)
T TIGR03752 107 VQSETQELTKEIEQLKSERQQLQGLIDQLQRRLA 140 (472)
T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4445556666777777777777777777777663
No 119
>PF06424 PRP1_N: PRP1 splicing factor, N-terminal; InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=58.28 E-value=8.8 Score=36.00 Aligned_cols=37 Identities=38% Similarity=0.603 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hhhhccc
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKEL----SARWEHI 275 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqEL----sQqw~hI 275 (561)
++.+...++++.++++-..+++++.||+.| ..+|+.|
T Consensus 84 re~~~~~e~e~~~~~~pkI~~QFaDLKR~La~VS~eeW~~I 124 (133)
T PF06424_consen 84 REAREKEEIEKYRKENPKIQQQFADLKRSLATVSEEEWENI 124 (133)
T ss_pred hhhhhhhHHHhhhccCchHHHHHHHHHHHHccCCHHHHhcC
Confidence 344555666677777778899999999999 3569855
No 120
>PF09311 Rab5-bind: Rabaptin-like protein; InterPro: IPR015390 This domain is predominantly found in Rabaptin and allows for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes []. ; PDB: 3NF1_A 3CEQ_B 3EDT_H 1X79_C 1TU3_F.
Probab=58.16 E-value=6.3 Score=37.43 Aligned_cols=45 Identities=18% Similarity=0.296 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 224 SILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.+.....+++..|..+.+++..++.+|.+|+..|+.++..-+..|
T Consensus 8 q~~~~l~~~L~~l~~erqkl~~qv~rL~qEN~~Lr~el~~tq~~l 52 (181)
T PF09311_consen 8 QVMRALQQHLQSLEAERQKLRAQVRRLCQENDWLRGELANTQQKL 52 (181)
T ss_dssp HHHHHHHHHHHHHHHCCHHHHT-----------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345556789999999999999999999999999999999888888
No 121
>PRK09039 hypothetical protein; Validated
Probab=58.09 E-value=74 Score=33.35 Aligned_cols=14 Identities=50% Similarity=0.669 Sum_probs=9.2
Q ss_pred hhhhhcccCCCCcCC
Q psy11481 411 SVINTLSASNGVPAS 425 (561)
Q Consensus 411 ~~~~~~~~~~~~~~~ 425 (561)
+|.+.|. .+|||..
T Consensus 295 aV~~~Li-~~Gi~~~ 308 (343)
T PRK09039 295 SVVKFLI-ALGVPAD 308 (343)
T ss_pred HHHHHHH-HCCCCHH
Confidence 4666666 6788763
No 122
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=58.00 E-value=53 Score=30.48 Aligned_cols=26 Identities=27% Similarity=0.362 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLA 251 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLR 251 (561)
++...+-++.+++..+++.+|+.+|+
T Consensus 125 ~~~~~~~l~~l~~~~~~~~~e~~~l~ 150 (191)
T PF04156_consen 125 LKSVEERLDSLDESIKELEKEIRELQ 150 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444
No 123
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=57.96 E-value=23 Score=31.49 Aligned_cols=34 Identities=9% Similarity=0.075 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
+-.+++++++++++|+.+|+++.++|+.++.. |.
T Consensus 28 ~~~~l~~q~~~~~~e~~~l~~~n~~L~~eI~~-L~ 61 (105)
T PRK00888 28 DYWRVNDQVAAQQQTNAKLKARNDQLFAEIDD-LK 61 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hh
Confidence 55677888899999999999999999999876 54
No 124
>PF12097 DUF3573: Protein of unknown function (DUF3573); InterPro: IPR021956 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 372 to 530 amino acids in length.
Probab=57.33 E-value=30 Score=37.45 Aligned_cols=118 Identities=15% Similarity=0.136 Sum_probs=58.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccCCCCCccccccccCccCCCcccccCCC--------C
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDNHHHESSLLSYGKE--------R 307 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQa~~DSsilS~~~~--------~ 307 (561)
...++...++++.+||+++..||++|..|+++... ...-||.+-.+++.-- -.. ...+...++-..- -
T Consensus 33 ~~~~~~~~~~~i~~Lq~QI~~Lq~ei~~l~~~~~~-n~~~~f~tYsSkv~~~-~~~--~~~~~~~ls~~~~~~~~~~di~ 108 (383)
T PF12097_consen 33 TTQSNQNDQQEISELQKQIQQLQAEINQLEEQNNS-NNSSQFTTYSSKVDDN-TLG--GNGETKVLSNAQFNNFDASDIM 108 (383)
T ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcC-CCcccceehhhhhccc-ccc--ccCccccccccccCCcchhhhh
Confidence 33444445567788888888888888888877322 2344566654433110 010 0111111111000 0
Q ss_pred CCcCCCCceeEeecCcceeeecCCCCCCCccCCCCCcccCCCCCccccccCCc
Q psy11481 308 SYMDEDGGLVIVTNGSVGLERSDGGESGSSLGGGGEMISGRGEGEYGRAGSSS 360 (561)
Q Consensus 308 ~~~~~~g~l~~vtn~~~~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (561)
.+...+...+..-+..++=-.+ ..|+--+|+.|.|--+|+--|+-+-+.+
T Consensus 109 ~n~~~~~~~i~l~~~~~ggif~---~~g~IdVG~apaITTqGqvtylGsySgN 158 (383)
T PF12097_consen 109 SNINFSQSIIDLGSTPLGGIFS---SNGGIDVGNAPAITTQGQVTYLGSYSGN 158 (383)
T ss_pred hccccccceeecCCCCcCceec---CCCCeecCCCcceeecceEEEEeeccCC
Confidence 1112222222222221111111 2345668899999999999888765544
No 125
>PF15066 CAGE1: Cancer-associated gene protein 1 family
Probab=57.25 E-value=13 Score=41.32 Aligned_cols=53 Identities=13% Similarity=0.283 Sum_probs=44.5
Q ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy11481 224 SILHSAI---RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEHID 276 (561)
Q Consensus 224 sILrKAi---DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~hID 276 (561)
.-|.++. +-|++|+..+++.+..|..|+..+..|.+++++||-...+|-.-+|
T Consensus 307 qpleedmaLNEvL~kLk~tn~kQq~~IqdLq~sN~yLe~kvkeLQ~k~~kQqvfvD 362 (527)
T PF15066_consen 307 QPLEEDMALNEVLQKLKHTNRKQQNRIQDLQCSNLYLEKKVKELQMKITKQQVFVD 362 (527)
T ss_pred CCcHHHHHHHHHHHHHHhhhHHHHHHHHHhhhccHHHHHHHHHHHHHhhhhhHHHH
Confidence 3444544 4599999999999999999999999999999999999877655555
No 126
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=56.88 E-value=34 Score=29.96 Aligned_cols=38 Identities=29% Similarity=0.478 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.|+++...++..++.|.++...++++++++...+.+
T Consensus 88 ~~~l~~~~~~l~~~~~~l~~~l~~l~~~~~~i~~~l~~ 125 (126)
T TIGR00293 88 IEFLKKRIEELEKAIEKLQEALAELASRAQQLEQEAQQ 125 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 44566777778888888888888888888888877654
No 127
>PHA00728 hypothetical protein
Probab=56.85 E-value=12 Score=35.56 Aligned_cols=27 Identities=26% Similarity=0.361 Sum_probs=23.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 244 EHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 244 E~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|+|+|++|+++|+++|++|..-+..
T Consensus 4 ~teveql~keneelkkkla~leal~nn 30 (151)
T PHA00728 4 LTEVEQLKKENEELKKKLAELEALMNN 30 (151)
T ss_pred hhHHHHHHHhHHHHHHHHHHHHHHHcC
Confidence 357899999999999999999977654
No 128
>PRK09343 prefoldin subunit beta; Provisional
Probab=56.54 E-value=80 Score=28.53 Aligned_cols=70 Identities=16% Similarity=0.223 Sum_probs=41.7
Q ss_pred HHHHHHHHhccCCCCCCcCC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 200 LKECFEILKRQVPPAQEEKK-SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 200 LKdcFeeLRdlVPsLsDn~K-aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.....++|..+=+. ..--| .-++-|...--+-+..|..+..-++.++++|.++.+.|+.++.+++.+|.+
T Consensus 40 ~~~~~~EL~~L~~d-~~VYk~VG~vlv~qd~~e~~~~l~~r~E~ie~~ik~lekq~~~l~~~l~e~q~~l~~ 110 (121)
T PRK09343 40 INKALEELEKLPDD-TPIYKIVGNLLVKVDKTKVEKELKERKELLELRSRTLEKQEKKLREKLKELQAKINE 110 (121)
T ss_pred HHHHHHHHHcCCCc-chhHHHhhHHHhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555544333 21111 344555555555566777777777777777777777777777777777643
No 129
>PRK00295 hypothetical protein; Provisional
Probab=56.19 E-value=36 Score=28.30 Aligned_cols=33 Identities=24% Similarity=0.142 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..|-+.--+++.++++|+++...|.+||.++.
T Consensus 21 ie~Ln~~v~~Qq~~I~~L~~ql~~L~~rl~~~~ 53 (68)
T PRK00295 21 IQALNDVLVEQQRVIERLQLQMAALIKRQEEMV 53 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 455555555555566666666666655555554
No 130
>KOG0933|consensus
Probab=55.69 E-value=53 Score=39.85 Aligned_cols=80 Identities=25% Similarity=0.438 Sum_probs=56.2
Q ss_pred HhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhhh
Q psy11481 194 KNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE-LSARW 272 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE-LsQqw 272 (561)
|.|=..|.+.|+.|+..|-. -....|.++=+-+..|+.+....+.++..+++-+..| ++.+++ |...|
T Consensus 964 re~l~~Lq~k~~~l~k~vn~-------~~m~mle~~E~~~~~lk~k~~~Ie~Dk~kI~ktI~~l----De~k~~~L~kaw 1032 (1174)
T KOG0933|consen 964 REELKKLQEKKEKLEKTVNP-------KNMDMLERAEEKEAALKTKKEIIEKDKSKIKKTIEKL----DEKKREELNKAW 1032 (1174)
T ss_pred HHHHHHhhHHHHHHHhhcCH-------HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH----HHHHHHHHHHHH
Confidence 34556788888888877643 2366777777777777777777766666665555443 555554 77889
Q ss_pred cccc------cccCCCCC
Q psy11481 273 EHID------FNTLIPDN 284 (561)
Q Consensus 273 ~hID------fStLlPd~ 284 (561)
.+++ |++|+|+.
T Consensus 1033 ~~VN~dFG~IFs~LLPga 1050 (1174)
T KOG0933|consen 1033 EKVNKDFGSIFSTLLPGA 1050 (1174)
T ss_pred HHHhhhHHHHHHHhCCCc
Confidence 9998 99999976
No 131
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=55.54 E-value=9.1 Score=42.41 Aligned_cols=35 Identities=11% Similarity=0.187 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
+.++|++|+++|++|+..|++|+++.++.-...|.
T Consensus 32 kie~L~kql~~Lk~q~~~l~~~v~k~e~~s~~d~l 66 (489)
T PF11853_consen 32 KIEALKKQLEELKAQQDDLNDRVDKVEKHSAGDNL 66 (489)
T ss_pred HHHHHHHHHHHHHHhhcccccccchhhHhhhcCcE
Confidence 55555555556666555666666555554333344
No 132
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=55.54 E-value=1e+02 Score=31.46 Aligned_cols=23 Identities=26% Similarity=0.394 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLAL 263 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeE 263 (561)
+++++|.++|++|+.+|++++..
T Consensus 69 ~~l~~EN~~Lr~e~~~l~~~~~~ 91 (283)
T TIGR00219 69 NNLEYENYKLRQELLKKNQQLEI 91 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 44566777777776666555443
No 133
>PF14916 CCDC92: Coiled-coil domain of unknown function
Probab=55.54 E-value=18 Score=30.25 Aligned_cols=37 Identities=24% Similarity=0.278 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
+.-+.+.+. +|+++-... +..|-.|++.|+++..+|+
T Consensus 5 v~s~e~~i~---FLq~eH~~t---L~~LH~EIe~Lq~~~~dL~ 41 (60)
T PF14916_consen 5 VQSLEKSIL---FLQQEHAQT---LKGLHAEIERLQKRNKDLT 41 (60)
T ss_pred HHHHHHHHH---HHHHHHHHH---HHHHHHHHHHHHHhccccc
Confidence 344555544 455443322 3334444444444555554
No 134
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=55.44 E-value=41 Score=27.77 Aligned_cols=36 Identities=11% Similarity=0.236 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|..|+.+--+++..+..+++|++.++..++.+++..
T Consensus 2 i~elEn~~~~~~~~i~tvk~en~~i~~~ve~i~env 37 (55)
T PF05377_consen 2 IDELENELPRIESSINTVKKENEEISESVEKIEENV 37 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456778888888888888888888888888888775
No 135
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=55.10 E-value=87 Score=25.57 Aligned_cols=37 Identities=19% Similarity=0.247 Sum_probs=27.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-..+-+.+.+..++++++++.+.++.+.++|+.|...
T Consensus 20 v~~~~~~~~~~~~~~~~~~~~~~l~~en~~L~~ei~~ 56 (85)
T TIGR02209 20 VSAQHQTRQLNNELQKLQLEIDKLQKEWRDLQLEVAE 56 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345556677788888888888888888888888765
No 136
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=54.73 E-value=39 Score=30.33 Aligned_cols=43 Identities=23% Similarity=0.337 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
..+|.+- |..|.+..++++.+++.++++.+++++.+.++..+.
T Consensus 96 ~~~l~~~---~~~l~~~~~~l~~~l~~~~~~~~~~~~~l~~l~~~~ 138 (140)
T PRK03947 96 IEILDKR---KEELEKALEKLEEALQKLASRIAQLAQELQQLQQEA 138 (140)
T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4555554 445556666677788888888888888888887664
No 137
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=54.66 E-value=23 Score=33.37 Aligned_cols=31 Identities=23% Similarity=0.443 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+++++..+|+++|++|....+..++.||++.
T Consensus 154 ~~~~~~~~ei~~lk~el~~~~~~~~~LkkQ~ 184 (192)
T PF05529_consen 154 EENKKLSEEIEKLKKELEKKEKEIEALKKQS 184 (192)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444555555555555555555555443
No 138
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=54.61 E-value=54 Score=28.40 Aligned_cols=48 Identities=13% Similarity=0.132 Sum_probs=40.4
Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 218 KKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 218 ~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
.+.++-.++..=-+.+..|+.+.++++.+++.+.++...++.+|.+++
T Consensus 57 v~~~~~ea~~~Le~~~e~le~~i~~l~~~~~~l~~~~~elk~~l~~~~ 104 (105)
T cd00632 57 VKQEKEEARTELKERLETIELRIKRLERQEEDLQEKLKELQEKIQQAQ 104 (105)
T ss_pred hhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 356777777777778888888889999999999999999999998886
No 139
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=54.54 E-value=72 Score=29.60 Aligned_cols=66 Identities=24% Similarity=0.229 Sum_probs=29.2
Q ss_pred HHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 199 HLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 199 ~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
.+..++..+++....+. ..+.++-.-|...-+-++.+.++-++++.+.++++++.+.++.++++++
T Consensus 106 ~~~~~l~~~~~~~~~~~-~~~~~~~~~l~~l~~~~~~~~~e~~~l~~~~~~~~~~~~~~~~~~~~~~ 171 (191)
T PF04156_consen 106 ELESELEKLKEDLQELR-ELLKSVEERLDSLDESIKELEKEIRELQKELQDSREEVQELRSQLERLQ 171 (191)
T ss_pred HHHHHHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444432 2223334444444444555555555555444444444444444444443
No 140
>PRK02119 hypothetical protein; Provisional
Probab=54.12 E-value=34 Score=28.84 Aligned_cols=33 Identities=9% Similarity=0.115 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..|-+.-.+.+.++++|+++.+.|.+||.+++
T Consensus 25 ie~LN~~v~~Qq~~id~L~~ql~~L~~rl~~~~ 57 (73)
T PRK02119 25 LEELNQALIEQQFVIDKMQVQLRYMANKLKDMQ 57 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 455555555555566666666666655555544
No 141
>PF14723 SSFA2_C: Sperm-specific antigen 2 C-terminus
Probab=53.76 E-value=26 Score=34.72 Aligned_cols=47 Identities=21% Similarity=0.271 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 224 SILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
.|++.-.-+-+.|-++|+ .|.|+|+.=..+++++++||+.+|...-.
T Consensus 127 ~l~~QQalvy~hMSeeER---~EaeQLQsLR~avRqElqELE~QL~DRl~ 173 (179)
T PF14723_consen 127 HLMRQQALVYRHMSEEER---EEAEQLQSLRSAVRQELQELEFQLEDRLL 173 (179)
T ss_pred HHHHhHHHHHhcCCHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334443444455555554 57788888888999999999999876433
No 142
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=53.63 E-value=37 Score=37.80 Aligned_cols=21 Identities=24% Similarity=0.306 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy11481 232 YIQFLRRREREFEHEMERLAR 252 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRR 252 (561)
-++.|.++++++.+|.++|++
T Consensus 74 ~~~~l~~~N~~l~~eN~~L~~ 94 (472)
T TIGR03752 74 RLAKLISENEALKAENERLQK 94 (472)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 355555555555555555544
No 143
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=53.37 E-value=36 Score=35.23 Aligned_cols=44 Identities=23% Similarity=0.266 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+.-+.|..|..++..++..+++-+.|.+..++||+.||.=-++
T Consensus 179 ~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq~vRPA 222 (267)
T PF10234_consen 179 QQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQSVRPA 222 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChH
Confidence 34455677888888888888999999999999999888865555
No 144
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=53.32 E-value=43 Score=29.45 Aligned_cols=38 Identities=26% Similarity=0.531 Sum_probs=28.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.|+++.+.++..++.|+.+...++++++.++..+.+
T Consensus 89 ~~~l~~r~~~l~~~~~~l~~~l~~l~~~~~~~~~~l~~ 126 (129)
T cd00584 89 IEFLDKKIEELTKQIEKLQKELAKLKDQINTLEAELQE 126 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35566777777888888888888888888888877654
No 145
>PF02344 Myc-LZ: Myc leucine zipper domain; InterPro: IPR003327 This family consists of the leucine zipper dimerisation domain found in both cellular c-Myc proto-oncogenes and viral v-Myc oncogenes. Dimerisation via the leucine zipper motif with other basic helix-loop-helix-leucine zipper (b/HLH/lz) proteins is required for efficient DNA binding []. The Myc-Max dimer is a transactivating complex activating expression of growth related genes promoting cell proliferation. The dimerisation is facilitated via interdigitating leucine residues every 7th position of the alpha helix. Like charge repulsion of adjacent residues in this region preturbs the formation of homodimers with heterodimers being promoted by opposing charge attractions. It has been demonstrated that in transgenic mice the balance between oncogene-induced proliferation and apoptosis in a given tissue can be a critical determinant in the initiation and maintenance of the tumor [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1NKP_D 1A93_A 2A93_A.
Probab=53.31 E-value=30 Score=26.10 Aligned_cols=25 Identities=28% Similarity=0.389 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 246 EMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKqELsQ 270 (561)
+-.+|-.|++.|+++-++||+.|.|
T Consensus 2 dEqkL~sekeqLrrr~eqLK~kLeq 26 (32)
T PF02344_consen 2 DEQKLISEKEQLRRRREQLKHKLEQ 26 (32)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4467889999999999999999887
No 146
>PRK04406 hypothetical protein; Provisional
Probab=53.22 E-value=35 Score=29.02 Aligned_cols=32 Identities=6% Similarity=0.036 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
|..|-+.-.+++.++++|+++.+.|.+||.++
T Consensus 27 Ie~LN~~v~~Qq~~I~~L~~ql~~L~~rl~~~ 58 (75)
T PRK04406 27 IEELNDALSQQQLLITKMQDQMKYVVGKVKNM 58 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44444444444444455544444444444443
No 147
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=53.12 E-value=28 Score=29.63 Aligned_cols=32 Identities=22% Similarity=0.317 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
++.|+++.++++.+++.++.+...++++++-|
T Consensus 72 ~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~L 103 (104)
T PF13600_consen 72 LKELEEELEALEDELAALQDEIQALEAQIAFL 103 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 55566666666666666666666666655544
No 148
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=53.04 E-value=47 Score=31.22 Aligned_cols=22 Identities=32% Similarity=0.410 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q psy11481 249 RLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 249 RLRREneaLQQRLeELKqELsQ 270 (561)
+|+|+.++|.++|+++++++..
T Consensus 70 Kl~Rk~~kl~~el~~~~~~~~~ 91 (161)
T PF04420_consen 70 KLNRKLDKLEEELEKLNKSLSS 91 (161)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3555555555555555555544
No 149
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=52.97 E-value=36 Score=33.41 Aligned_cols=10 Identities=30% Similarity=0.501 Sum_probs=4.4
Q ss_pred CceeEeecCc
Q psy11481 314 GGLVIVTNGS 323 (561)
Q Consensus 314 g~l~~vtn~~ 323 (561)
|..+++-.|+
T Consensus 165 g~~i~~~~~~ 174 (251)
T PF11932_consen 165 GRTIEVYQGT 174 (251)
T ss_pred CCceeEEEEE
Confidence 3444444443
No 150
>PTZ00454 26S protease regulatory subunit 6B-like protein; Provisional
Probab=52.95 E-value=36 Score=36.19 Aligned_cols=39 Identities=15% Similarity=0.188 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.+.|+++.+.++.+.+.|+.+...+++++..++.++.+
T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 61 (398)
T PTZ00454 23 KLKELEKELEFLDIQEEYIKEEQKNLKRELIRAKEEVKR 61 (398)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555666556666666666666554
No 151
>PF06637 PV-1: PV-1 protein (PLVAP); InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=52.95 E-value=38 Score=37.23 Aligned_cols=57 Identities=18% Similarity=0.024 Sum_probs=39.0
Q ss_pred HHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy11481 223 LSILHSAIRYIQFLRRRE----REFEHEMERLAREKIHAQQRLALLKKELSARWEHIDFNT 279 (561)
Q Consensus 223 AsILrKAiDYIQkLQreE----reLE~EkERLRREneaLQQRLeELKqELsQqw~hIDfSt 279 (561)
-...+.|..--.+|+.+- +-..+|+..|++|++.|.++|++.++++.|.-+.+|+++
T Consensus 323 ek~~KEAqareaklqaec~rQ~qlaLEEKaaLrkerd~L~keLeekkreleql~~q~~v~~ 383 (442)
T PF06637_consen 323 EKAGKEAQAREAKLQAECARQTQLALEEKAALRKERDSLAKELEEKKRELEQLKMQLAVKT 383 (442)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 333444443333444432 224477889999999999999999999998777777554
No 152
>PF09738 DUF2051: Double stranded RNA binding protein (DUF2051); InterPro: IPR019139 This entry represents transcriptional repressors which preferentially bind to the GC-rich consensus sequence (5'-AGCCCCCGGCG-3') and may regulate expression of TNF, EGFR and PDGFA. They may control smooth muscle cell proliferation following artery injury through PDGFA repression and may also bind double-stranded RNA. They interact with the leucine-rich repeat domain of human flightless-I (FliI) protein.
Probab=52.95 E-value=31 Score=36.05 Aligned_cols=55 Identities=25% Similarity=0.389 Sum_probs=35.6
Q ss_pred CcCCcc---hHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 216 EEKKSS---NLSILHSAIRY----IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 216 Dn~KaS---KAsILrKAiDY----IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|+.|.+ .+-.|+.-++- +-.|+++.++...+.+++++....|+.++++|+.+|.+
T Consensus 104 DNek~~l~yqvd~Lkd~lee~eE~~~~~~re~~eK~~elEr~K~~~d~L~~e~~~Lre~L~~ 165 (302)
T PF09738_consen 104 DNEKSALMYQVDLLKDKLEELEETLAQLQREYREKIRELERQKRAHDSLREELDELREQLKQ 165 (302)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555643 34555555442 44455555555667778888888888888888888765
No 153
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=52.72 E-value=39 Score=27.39 Aligned_cols=43 Identities=21% Similarity=0.217 Sum_probs=23.2
Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 227 HSAIRYIQFLRRREREFE-HEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE-~EkERLRREneaLQQRLeELKqELs 269 (561)
..|-+.|+.|+-+-+.+- .+...+..+....+..|+.|+++|.
T Consensus 35 ~ea~~~l~qMe~E~~~~p~s~r~~~~~kl~~yr~~l~~lk~~l~ 78 (79)
T PF05008_consen 35 DEAEELLKQMELEVRSLPPSERNQYKSKLRSYRSELKKLKKELK 78 (79)
T ss_dssp HHHHHHHHHHHHHHCTS-HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 344455555555554443 4455555666666666666666553
No 154
>TIGR01242 26Sp45 26S proteasome subunit P45 family. Many proteins may score above the trusted cutoff because an internal
Probab=52.43 E-value=28 Score=35.55 Aligned_cols=33 Identities=30% Similarity=0.400 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
+++|+++.++++.|.+.+++|...+++++++++
T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 40 (364)
T TIGR01242 8 IRKLEDEKRSLEKEKIRLERELERLRSEIERLR 40 (364)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444555555555555555555555544444443
No 155
>KOG1760|consensus
Probab=52.23 E-value=38 Score=32.14 Aligned_cols=41 Identities=29% Similarity=0.377 Sum_probs=35.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
-..|++....++++++.|+.+.+....|+++||..|=+.+.
T Consensus 83 ~~~LEe~ke~l~k~i~~les~~e~I~~~m~~LK~~LYaKFg 123 (131)
T KOG1760|consen 83 QDQLEEKKETLEKEIEELESELESISARMDELKKVLYAKFG 123 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 45688888899999999999999999999999999865443
No 156
>PHA02557 22 prohead core protein; Provisional
Probab=52.18 E-value=58 Score=34.13 Aligned_cols=55 Identities=24% Similarity=0.333 Sum_probs=44.2
Q ss_pred HHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 199 HLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 199 ~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
-||+.|.+..=.||. ..+|.|..|+++.++++.+.+.|..++.+|+.++..++++
T Consensus 123 GLK~lF~Ehnv~vpe--------------e~vdvV~em~~~L~E~e~~~~~l~~en~~l~e~i~~~~r~ 177 (271)
T PHA02557 123 GLKELFVEHNVVVPE--------------EKVDVVAEMEEELDEMEEELNELFEENVALEEYINEVKRE 177 (271)
T ss_pred HHHHHHHHhCcCCcH--------------HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555565 2567889999999999999999999999999999998887
No 157
>PRK11281 hypothetical protein; Provisional
Probab=51.93 E-value=54 Score=39.72 Aligned_cols=70 Identities=17% Similarity=0.262 Sum_probs=46.2
Q ss_pred HHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 197 RAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRERE---FEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 197 Rd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEre---LE~EkERLRREneaLQQRLeELKqEL 268 (561)
++.|+..+++|++. ...+..+..-..-|+.|.+++++.++.+++ +++.+++..++.++.+++|++++++.
T Consensus 38 ~~~iq~~l~~~~~~--~~~~~~~k~~~~~l~~tL~~L~qi~~~~~~~~~L~k~l~~Ap~~l~~a~~~Le~Lk~~~ 110 (1113)
T PRK11281 38 EADVQAQLDALNKQ--KLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDN 110 (1113)
T ss_pred HHHHHHHHHHhhcC--CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhhccc
Confidence 44667777777774 322322333478899999987666655544 66677777777777888888777643
No 158
>PRK00736 hypothetical protein; Provisional
Probab=51.84 E-value=42 Score=27.92 Aligned_cols=32 Identities=19% Similarity=0.154 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
|..|-+.-.+++.++++|+++...|..|+.++
T Consensus 21 ie~Ln~~v~~Qq~~i~~L~~ql~~L~~rl~~~ 52 (68)
T PRK00736 21 IEELSDQLAEQWKTVEQMRKKLDALTERFLSL 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44444444444445555555555554444443
No 159
>PRK02793 phi X174 lysis protein; Provisional
Probab=51.49 E-value=40 Score=28.29 Aligned_cols=33 Identities=21% Similarity=0.171 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..|-+.--+++.++++|+++...|..||.+++
T Consensus 24 Ie~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~ 56 (72)
T PRK02793 24 IEELNVTVTAHEMEMAKLRDHLRLLTEKLKASQ 56 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 444444445555555555555555555554443
No 160
>PTZ00454 26S protease regulatory subunit 6B-like protein; Provisional
Probab=51.35 E-value=39 Score=35.97 Aligned_cols=38 Identities=24% Similarity=0.367 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
...++.|+.+++.++.+.+++++|.+.++.+++.|+.-
T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (398)
T PTZ00454 28 EKELEFLDIQEEYIKEEQKNLKRELIRAKEEVKRIQSV 65 (398)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC
Confidence 35677888888888888888888888888888888643
No 161
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=50.91 E-value=47 Score=37.56 Aligned_cols=9 Identities=44% Similarity=0.674 Sum_probs=5.0
Q ss_pred cceEeeCCC
Q psy11481 63 NHITFTNGL 71 (561)
Q Consensus 63 ~~~~~~~~~ 71 (561)
-|.|||.++
T Consensus 24 C~Ytlt~~~ 32 (546)
T PF07888_consen 24 CHYTLTPGF 32 (546)
T ss_pred EEEecCCCC
Confidence 456666554
No 162
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=50.67 E-value=29 Score=32.55 Aligned_cols=36 Identities=25% Similarity=0.373 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHhh
Q psy11481 235 FLRRREREFEHEMERLAREKI------------HAQQRLALLKKELSA 270 (561)
Q Consensus 235 kLQreEreLE~EkERLRREne------------aLQQRLeELKqELsQ 270 (561)
..-+++++++.|+.+|++|.. .|+++++.++.||.+
T Consensus 37 ~~~~~~~~l~~Ei~~l~~E~~~iS~qDeFAkwaKl~Rk~~kl~~el~~ 84 (161)
T PF04420_consen 37 KSSKEQRQLRKEILQLKRELNAISAQDEFAKWAKLNRKLDKLEEELEK 84 (161)
T ss_dssp HHHHHHHHHHHHHHHHHHHHTTS-TTTSHHHHHHHHHHHHHHHHHHHH
T ss_pred cccHHHHHHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555543 356666666666655
No 163
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=50.57 E-value=52 Score=27.63 Aligned_cols=38 Identities=24% Similarity=0.265 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+.|.+++......+.....++..|+.+++.|+++|..
T Consensus 28 ~k~L~~ERd~~~~~l~~a~~e~~~Lk~E~e~L~~el~~ 65 (69)
T PF14197_consen 28 NKRLRRERDSAERQLGDAYEENNKLKEENEALRKELEE 65 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45556666566666666666667777777777766543
No 164
>KOG2391|consensus
Probab=50.51 E-value=53 Score=35.58 Aligned_cols=31 Identities=13% Similarity=0.133 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLAL 263 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeE 263 (561)
+++|+.++.++|.++..|++..+-|+.+.+|
T Consensus 248 ~~kL~~~~etLEqq~~~L~~niDIL~~k~~e 278 (365)
T KOG2391|consen 248 KQKLVAMKETLEQQLQSLQKNIDILKSKVRE 278 (365)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 3444444444444444444444444444444
No 165
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=50.46 E-value=44 Score=35.32 Aligned_cols=12 Identities=33% Similarity=0.282 Sum_probs=9.7
Q ss_pred HHHHHHhccCCC
Q psy11481 202 ECFEILKRQVPP 213 (561)
Q Consensus 202 dcFeeLRdlVPs 213 (561)
+-|+.|.+..+.
T Consensus 218 ~Yf~~l~~~f~d 229 (406)
T PF02388_consen 218 EYFENLYDAFGD 229 (406)
T ss_dssp HHHHHHHHHCCC
T ss_pred HHHHHHHHhcCC
Confidence 578888888865
No 166
>PF04201 TPD52: Tumour protein D52 family; InterPro: IPR007327 The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family []. These proteins have an N-terminal coiled-coil that allows members to form homo- and heterodimers with each other [].
Probab=50.34 E-value=38 Score=33.01 Aligned_cols=43 Identities=26% Similarity=0.442 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-------HHHHH-----HHHHHHHHhhhhcccc
Q psy11481 234 QFLRRREREFEHEMERLAREKI-------HAQQR-----LALLKKELSARWEHID 276 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREne-------aLQQR-----LeELKqELsQqw~hID 276 (561)
..|+++..+.|+||..|++-.. +|+++ +.+|++.+.++|..+.
T Consensus 32 eeLr~EL~KvEeEI~TLrqvL~aKer~~~eLKrkLGit~l~elkqnlskg~~~vq 86 (162)
T PF04201_consen 32 EELRSELAKVEEEIQTLRQVLAAKERHCAELKRKLGITPLSELKQNLSKGWHDVQ 86 (162)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHCCchHHHHHHHHHHHhHhhh
Confidence 4455555566666665554433 44555 3566777778887444
No 167
>KOG4797|consensus
Probab=50.32 E-value=75 Score=29.85 Aligned_cols=41 Identities=29% Similarity=0.384 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhh
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQ-----QRLALLKKELSA 270 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQ-----QRLeELKqELsQ 270 (561)
.|-+.-|+++.+++++...+|++|+.-|+ .+|++|..++.+
T Consensus 66 REEVe~Lk~qI~eL~er~~~Le~EN~lLk~~~spe~L~ql~~~~~~ 111 (123)
T KOG4797|consen 66 REEVEVLKEQIRELEERNSALERENSLLKTLASPEQLAQLPAQLSP 111 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHhccc
Confidence 34456667777777777777777777776 466666666554
No 168
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=50.25 E-value=54 Score=33.95 Aligned_cols=7 Identities=43% Similarity=1.170 Sum_probs=1.2
Q ss_pred cccCccC
Q psy11481 392 HIQLPYR 398 (561)
Q Consensus 392 ~~~~~~~ 398 (561)
.+.|||+
T Consensus 263 ~~~lPy~ 269 (314)
T PF04111_consen 263 SFELPYK 269 (314)
T ss_dssp ----SS-
T ss_pred cccccee
Confidence 4555554
No 169
>KOG3088|consensus
Probab=50.20 E-value=40 Score=35.81 Aligned_cols=45 Identities=20% Similarity=0.159 Sum_probs=30.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccCC
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEHIDFNTLI 281 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~hIDfStLl 281 (561)
-+.|.+++++|++.-|+|+|+-++|++|-+++++ + .+++.-++-+
T Consensus 59 a~~~~~kq~eL~~rqeEL~Rke~ELdRREr~~a~-~---g~~~~~nNWP 103 (313)
T KOG3088|consen 59 AKDLAKKQAELLKKQEELRRKEQELDRRERALAR-A---GIVIRENNWP 103 (313)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhh-c---cCcccccCCC
Confidence 3456667777777777888888888888777776 1 3444444444
No 170
>PF08286 Spc24: Spc24 subunit of Ndc80; InterPro: IPR013252 Spc24 is a component of the evolutionarily conserved kinetochore-associated Ndc80 complex and is involved in chromosome segregation [].; PDB: 2VE7_D 2FV4_B 2FTX_B.
Probab=50.02 E-value=5.4 Score=35.41 Aligned_cols=31 Identities=23% Similarity=0.281 Sum_probs=0.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+...++.++++|+.|...|++++++|+.+..
T Consensus 14 ~~~~LE~~l~~l~~el~~L~~~l~eLe~~~~ 44 (118)
T PF08286_consen 14 ELSDLESELESLQSELEELKEELEELEEQEV 44 (118)
T ss_dssp -----------------------------HT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3344445555555555555555555555543
No 171
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=49.94 E-value=51 Score=35.12 Aligned_cols=29 Identities=24% Similarity=0.332 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++...+.+|.++..+|+.++.+|+.++.+
T Consensus 379 ~l~~~~~~l~~~~~~l~~~~~~l~~~l~~ 407 (451)
T PF03961_consen 379 KLKEKKKELKEELKELKEELKELKEELER 407 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34444455555555556666666666554
No 172
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=49.86 E-value=1e+02 Score=31.98 Aligned_cols=18 Identities=22% Similarity=0.209 Sum_probs=9.2
Q ss_pred cccCCCCCccccccccCccCC
Q psy11481 277 FNTLIPDNMEVDIPYDNHHHE 297 (561)
Q Consensus 277 fStLlPd~lEIDRvlRQa~~D 297 (561)
--.|.||+ |-.++-+.+|
T Consensus 238 rpKCTPDm---dV~LnIP~ee 255 (258)
T PF15397_consen 238 RPKCTPDM---DVILNIPTEE 255 (258)
T ss_pred CCCCCCCc---hhhcCCcccc
Confidence 35667776 4444444333
No 173
>PF06632 XRCC4: DNA double-strand break repair and V(D)J recombination protein XRCC4; InterPro: IPR010585 This entry represents the DNA double-strand break repair and V(D)J recombination protein XRCC4, which is found in certain Metazoans, fungi and plants. XRCC4 binds to DNA, and to DNA ligase IV (LIG4) to form the LIG4-XRCC4 complex []. The LIG4-XRCC4 complex is responsible for the ligation step in the non-homologous end joining (NHEJ) pathway of DNA double-strand break repair. XRCC4 enhances the joining activity of LIG4. It is thought that XRCC4 and LIG4 are essential for alignment-based gap filling, as well as for final ligation of the breaks []. Binding of the LIG4-XRCC4 complex to DNA ends is dependent on the assembly of the DNA-dependent protein kinase complex DNA-PK to these DNA ends. ; GO: 0003677 DNA binding, 0006302 double-strand break repair, 0006310 DNA recombination, 0005634 nucleus; PDB: 3RWR_R 1IK9_B 3SR2_E 3Q4F_H 3II6_B 1FU1_A 3MUD_B.
Probab=49.79 E-value=58 Score=34.61 Aligned_cols=14 Identities=36% Similarity=0.373 Sum_probs=0.0
Q ss_pred cccCCcceeEEecc
Q psy11481 444 VSTNGMSTATIIKS 457 (561)
Q Consensus 444 ~~~~~~~~~~~~~~ 457 (561)
.++-|||.-|...+
T Consensus 322 ~~~~~~~~~~~~~~ 335 (342)
T PF06632_consen 322 FSAENMSLETLKNS 335 (342)
T ss_dssp --------------
T ss_pred ccCCCCcHHHhhcC
Confidence 66677777666543
No 174
>PRK04325 hypothetical protein; Provisional
Probab=49.78 E-value=44 Score=28.18 Aligned_cols=33 Identities=12% Similarity=0.025 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..|-+.-.+++.++++|+++...|..||.+++
T Consensus 25 Ie~LN~vv~~Qq~~I~~L~~ql~~L~~rl~~~~ 57 (74)
T PRK04325 25 IDGLNATVARQQQTLDLLQAQLRLLYQQMRDAN 57 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 444445555555555555555555555555543
No 175
>PF05103 DivIVA: DivIVA protein; InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=49.61 E-value=9.9 Score=32.97 Aligned_cols=35 Identities=20% Similarity=0.257 Sum_probs=16.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
+||..|..+-..+..++..|+.++..|+.+|.+++
T Consensus 25 ~fl~~l~~~~~~l~~e~~~L~~~~~~l~~~l~~~~ 59 (131)
T PF05103_consen 25 DFLDELAEELERLQRENAELKEEIEELQAQLEELR 59 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHCCCCT-----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Confidence 45555555555555555555555555555555444
No 176
>PRK14127 cell division protein GpsB; Provisional
Probab=49.57 E-value=39 Score=30.82 Aligned_cols=25 Identities=28% Similarity=0.292 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 246 EMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.+.|.+|+..|+.+++.|+.++.+
T Consensus 38 dye~l~~e~~~Lk~e~~~l~~~l~e 62 (109)
T PRK14127 38 DYEAFQKEIEELQQENARLKAQVDE 62 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444445555555555555555443
No 177
>KOG2991|consensus
Probab=49.47 E-value=31 Score=36.36 Aligned_cols=62 Identities=19% Similarity=0.272 Sum_probs=36.2
Q ss_pred hhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 188 VHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHA 257 (561)
Q Consensus 188 THNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaL 257 (561)
.--+|-+..-++||..|++|-+.+-.|.+...-.... |-.|+++.++-+.++++|++.++.+
T Consensus 243 ~eLAmQKs~seElkssq~eL~dfm~eLdedVEgmqsT--------iliLQq~Lketr~~Iq~l~k~~~q~ 304 (330)
T KOG2991|consen 243 IELAMQKSQSEELKSSQEELYDFMEELDEDVEGMQST--------ILILQQKLKETRKEIQRLKKGLEQV 304 (330)
T ss_pred HHHHHHHhhHHHHHHhHHHHHHHHHHHHHHHhcchhh--------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456667788888888888887766654332222222 2345555555555666665554443
No 178
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=49.17 E-value=97 Score=30.75 Aligned_cols=81 Identities=20% Similarity=0.307 Sum_probs=49.2
Q ss_pred hhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 186 REVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 186 RaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..+...=.+.|..+..+|..==..+-. .......-+..+.+=+..++.+-..++.+++.|+.++..|..+|.+++
T Consensus 175 R~~ye~~~~~~~~e~e~~y~~k~~~l~~----~~~~~~~~~~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~le 250 (312)
T PF00038_consen 175 RAQYEEIAQKNREELEEWYQSKLEELRQ----QSEKSSEELESAKEELKELRRQIQSLQAELESLRAKNASLERQLRELE 250 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhhhhhhhhhhcccccccccc----cccccccccchhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHHH
Confidence 4555555555556666655532222111 011234556666677777777777777788888888888888887777
Q ss_pred HHHhh
Q psy11481 266 KELSA 270 (561)
Q Consensus 266 qELsQ 270 (561)
.++..
T Consensus 251 ~~~~~ 255 (312)
T PF00038_consen 251 QRLDE 255 (312)
T ss_dssp HHHHH
T ss_pred HHHHH
Confidence 66543
No 179
>PF01763 Herpes_UL6: Herpesvirus UL6 like; InterPro: IPR002660 This family consists of various proteins from the Herpesviridae that are similar to Human herpesvirus 1 (HHV-1) UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome [].; GO: 0006323 DNA packaging
Probab=48.87 E-value=37 Score=38.36 Aligned_cols=48 Identities=19% Similarity=0.223 Sum_probs=39.8
Q ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 223 LSILHSAIR--YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 223 AsILrKAiD--YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..=+++.+. -|+-|+++-.++-.+++.|+++++.+.+|+.+++.+|.+
T Consensus 353 ~~~~r~~v~nsI~kcLe~qIn~qf~tIe~Lk~~n~~~~~kl~~~e~~L~r 402 (557)
T PF01763_consen 353 QQAFRDSVSNSINKCLEGQINNQFDTIEDLKEENQDLEKKLRELESELSR 402 (557)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444432 488889999999999999999999999999999999987
No 180
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=48.68 E-value=44 Score=31.46 Aligned_cols=35 Identities=20% Similarity=0.248 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+.++++.+++++|++..+.|.++|+.|.+.|++|.
T Consensus 157 ~~~~~ei~~lk~el~~~~~~~~~LkkQ~~~l~~ey 191 (192)
T PF05529_consen 157 KKLSEEIEKLKKELEKKEKEIEALKKQSEGLQKEY 191 (192)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 45666666666667777777777777777776653
No 181
>PF07028 DUF1319: Protein of unknown function (DUF1319); InterPro: IPR010746 This entry is represented by Commelina yellow mottle virus, Orf1. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains a number of viral proteins of unknown function approximately 200 residues long. Family members seem to be restricted to badnaviruses.
Probab=48.59 E-value=90 Score=29.56 Aligned_cols=45 Identities=18% Similarity=0.204 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+++-.+-|+.+..=.+.++.+++.|..++..|+.+|..|++++-+
T Consensus 41 ~~~~~~~lk~~~ki~~~Qr~~l~~l~~~l~~l~~eL~~Lr~~~l~ 85 (126)
T PF07028_consen 41 QKKLLEELKNLSKIQESQRSELKELKQELDVLSKELQALRKEYLE 85 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555667777767777788889999999999999999988765
No 182
>PF05659 RPW8: Arabidopsis broad-spectrum mildew resistance protein RPW8; InterPro: IPR008808 This entry represents the RPW8 domain found in several broad-spectrum mildew resistance proteins from Arabidopsis thaliana and other dicots. Plant disease resistance (R) genes control the recognition of specific pathogens and activate subsequent defence responses. The R protein-mediated defences typically involve a rapid, localized necrosis, or hypersensitive response (HR), at the site of infection, and the localised formation of antimicrobial chemicals and proteins that restrict growth of the pathogen. The A. thaliana locus Resistance to Powdery Mildew 8 (RPW8) contains two naturally polymorphic, dominant R genes: RPW8.1 and RPW8.2, which individually control resistance to a broad range of powdery mildew pathogens. They induce localised, salicylic acid-dependent defences similar to those induced by R genes that control specific resistance. Apparently, broad-spectrum resistance mediated by RPW8 uses the same mechanisms as specific resistance [, ]. RPW8.1 and RPW8.2 share similarity with an ~150 amino acid module forming the N terminus of a group of disease resistance proteins, which have a nucleotide-binding site (NBS) and leucine-rich repeats (LRRs) [, ]. The RPW8 domain sequences contain a predicted N-terminal transmembrane (TM) region or possibly a signal peptide, and a coiled-coil (CC) motif [].
Probab=48.43 E-value=47 Score=31.15 Aligned_cols=56 Identities=13% Similarity=0.249 Sum_probs=43.5
Q ss_pred hhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 189 HNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEH 245 (561)
Q Consensus 189 HNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~ 245 (561)
-.++++.||..++.-.+.|++..=-...+.|..++.++++.. |.+++++-|+.+..
T Consensus 57 ~~eld~~~~ee~e~L~~~L~~g~~LV~k~sk~~r~n~~kk~~-y~~Ki~~le~~l~~ 112 (147)
T PF05659_consen 57 NVELDRPRQEEIERLKELLEKGKELVEKCSKVRRWNLYKKPR-YARKIEELEESLRR 112 (147)
T ss_pred hhhcCCchhHHHHHHHHHHHHHHHHHHHhccccHHHHHhhHh-HHHHHHHHHHHHHH
Confidence 457778889999998888888777666777888888887755 88888877776653
No 183
>TIGR01242 26Sp45 26S proteasome subunit P45 family. Many proteins may score above the trusted cutoff because an internal
Probab=48.42 E-value=34 Score=34.90 Aligned_cols=39 Identities=33% Similarity=0.439 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSAR 271 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQq 271 (561)
|-.|+.+.++++.+++.++.|...++++++++++++.+-
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 39 (364)
T TIGR01242 1 ISELDVRIRKLEDEKRSLEKEKIRLERELERLRSEIERL 39 (364)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346778888999999999999999999999999999773
No 184
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=48.25 E-value=47 Score=25.98 Aligned_cols=25 Identities=20% Similarity=0.310 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQR 260 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQR 260 (561)
|+.+...+..|.+.|+.|+..|+.+
T Consensus 17 Lk~~~~~L~~E~~~L~aev~~L~~k 41 (45)
T PF02183_consen 17 LKAEYDSLKKENEKLRAEVQELKEK 41 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4444444444444444444444443
No 185
>COG5293 Predicted ATPase [General function prediction only]
Probab=48.19 E-value=53 Score=37.07 Aligned_cols=85 Identities=15% Similarity=0.186 Sum_probs=62.4
Q ss_pred hhhhhHHHHhHHHHHHHHHHHHhccCCCCC------CcCCcchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH--
Q psy11481 186 REVHNKLEKNRRAHLKECFEILKRQVPPAQ------EEKKSSNLSILHS--AIRYIQFLRRREREFEHEMERLAREKI-- 255 (561)
Q Consensus 186 RaTHNaLERKRRd~LKdcFeeLRdlVPsLs------Dn~KaSKAsILrK--AiDYIQkLQreEreLE~EkERLRREne-- 255 (561)
-+.|+.+=-.||+.+++.+.+++.-+-.+. .+.+++=+.+|+. +++-++.|+++-...+.|+..++.+++
T Consensus 326 ~~F~r~~~e~R~~yl~~ei~~i~~dLk~~n~~~~~l~~~rae~l~~Lk~~g~~e~y~~l~ee~~~~~~elae~~~rie~l 405 (591)
T COG5293 326 IAFNRAITEERHDYLQEEIAEIEGDLKEVNAELDDLGKRRAEGLAFLKNRGVFEKYQTLCEEIIALRGELAELEYRIEPL 405 (591)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHhhhHHHHHHhhhHH
Confidence 567899999999999999998887766542 1244666778865 777788888887777766666655444
Q ss_pred ----HHHHHHHHHHHHHhh
Q psy11481 256 ----HAQQRLALLKKELSA 270 (561)
Q Consensus 256 ----aLQQRLeELKqELsQ 270 (561)
++.+.|..||+++-+
T Consensus 406 ~k~~~~~~~i~~lkhe~l~ 424 (591)
T COG5293 406 RKLHALDQYIGTLKHECLD 424 (591)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 555778888888755
No 186
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=47.95 E-value=89 Score=28.24 Aligned_cols=41 Identities=15% Similarity=0.199 Sum_probs=32.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-+-|..|-++-.++..++..|-.|+.+|+-+.+.|+..|.+
T Consensus 14 e~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~ 54 (107)
T PF06156_consen 14 EQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEE 54 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34467777777777788888888888888888888888876
No 187
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=47.80 E-value=52 Score=28.21 Aligned_cols=32 Identities=25% Similarity=0.354 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLAL 263 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeE 263 (561)
.|..|.++.+++..+++.|+.+...+.+++..
T Consensus 30 ~i~~ld~~~r~l~~~~e~lr~~rN~~sk~I~~ 61 (108)
T PF02403_consen 30 EIIELDQERRELQQELEELRAERNELSKEIGK 61 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 34444444444444444444444444333333
No 188
>PF08537 NBP1: Fungal Nap binding protein NBP1; InterPro: IPR013743 NBP1 is a nuclear protein which has been shown in Saccharomyces cerevisiae (Bakers yeast) to be essential for the G2/M transition of the cell cycle.
Probab=47.72 E-value=2.5e+02 Score=30.29 Aligned_cols=65 Identities=26% Similarity=0.342 Sum_probs=41.4
Q ss_pred HHHHHhccCCCCCC------cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 203 CFEILKRQVPPAQE------EKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 203 cFeeLRdlVPsLsD------n~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+++|+.-.+.... +.-..++.+|++ =|++|+++..+++.|++-+++.....+++.+-|+.-|..
T Consensus 151 mL~qLr~g~~~~~~~~~~~~~~~~D~v~LLqk---k~~~l~~~l~~~~~eL~~~~k~L~faqekn~LlqslLdd 221 (323)
T PF08537_consen 151 MLEQLRRGRSKNRHNRPRNPSSNSDRVILLQK---KIDELEERLNDLEKELEITKKDLKFAQEKNALLQSLLDD 221 (323)
T ss_pred HHHHHhcCCCCCCcccccCCCcchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 67888888776432 112355566655 355666666667777777777766666776666666654
No 189
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=47.69 E-value=1.1e+02 Score=27.16 Aligned_cols=40 Identities=10% Similarity=0.228 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
..-++-++.-+-++++.+.++|++|++.|+.+.+.-+.+.
T Consensus 14 ~~~i~~y~~~k~~ka~~~~~kL~~en~qlk~Ek~~~~~qv 53 (87)
T PF10883_consen 14 VALILAYLWWKVKKAKKQNAKLQKENEQLKTEKAVAETQV 53 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445667777778888888888888888888887777775
No 190
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=47.53 E-value=1e+02 Score=25.82 Aligned_cols=43 Identities=7% Similarity=0.156 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|.+..+-++.+.+.-++++++.+.+.+|...+-++..++....
T Consensus 21 l~~l~~~l~~~~~ti~~l~~~~~~i~~e~~~ll~~~n~l~~dv 63 (90)
T PF06103_consen 21 LKKLKKTLDEVNKTIDTLQEQVDPITKEINDLLHNTNELLEDV 63 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444444444444444444444433
No 191
>KOG0837|consensus
Probab=47.15 E-value=33 Score=35.94 Aligned_cols=41 Identities=20% Similarity=0.222 Sum_probs=31.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.|.|..|+++-+.+..+..+|..+.-.|+..++++++....
T Consensus 226 LdrisrLEdkv~~lk~~n~~L~~~l~~l~~~v~e~k~~V~~ 266 (279)
T KOG0837|consen 226 LDRISRLEDKVKTLKIYNRDLASELSKLKEQVAELKQKVME 266 (279)
T ss_pred HHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 56788888887777777777777777777788888777654
No 192
>KOG4010|consensus
Probab=47.02 E-value=39 Score=34.04 Aligned_cols=45 Identities=22% Similarity=0.328 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHhhhhccccccc
Q psy11481 235 FLRRREREFEHEMERLAREKIHAQQRL------------ALLKKELSARWEHIDFNT 279 (561)
Q Consensus 235 kLQreEreLE~EkERLRREneaLQQRL------------eELKqELsQqw~hIDfSt 279 (561)
.|+.+..+.|+||..|++=.-+-++.+ .+|++.+++.|..++.++
T Consensus 48 elr~EL~kvEeEI~TLrqVLaAKerH~~ELKRKLGlt~~~EL~qnisksw~d~q~st 104 (208)
T KOG4010|consen 48 ELRTELAKVEEEIVTLRQVLAAKERHAAELKRKLGLTVLKELKQNISKSWKDVQAST 104 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcchHHHHHHHHHHHHhhhhhHH
Confidence 555556666666666644333322333 345555667777555443
No 193
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=46.70 E-value=73 Score=31.35 Aligned_cols=26 Identities=23% Similarity=0.304 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHA 257 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaL 257 (561)
|+++|++....++.++++|+++++.+
T Consensus 71 ~~~~l~~~v~~q~~el~~L~~qi~~~ 96 (251)
T PF11932_consen 71 YNEQLERQVASQEQELASLEQQIEQI 96 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444443333
No 194
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=46.50 E-value=56 Score=30.42 Aligned_cols=37 Identities=19% Similarity=0.387 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
++.|.++..+.+.++..|++.+..|..+|+.++.+|.
T Consensus 23 ~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~ 59 (143)
T PF12718_consen 23 VKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLK 59 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5555555555555555555555555555555555543
No 195
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=46.44 E-value=65 Score=37.17 Aligned_cols=84 Identities=14% Similarity=0.149 Sum_probs=54.3
Q ss_pred hhhhhHHHHhHHHHHH---HHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 186 REVHNKLEKNRRAHLK---ECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 186 RaTHNaLERKRRd~LK---dcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLe 262 (561)
+..+-+.+.=|...|. ...+++...+..+.. .+.--...|.+..+.++.|++...++.+.+++....++.|.+|++
T Consensus 539 ~lL~~a~~vlreeYi~~~~~ar~ei~~rv~~Lk~-~~e~Ql~~L~~l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~ 617 (717)
T PF10168_consen 539 ELLSQATKVLREEYIEKQDLAREEIQRRVKLLKQ-QKEQQLKELQELQEERKSLRESAEKLAERYEEAKDKQEKLMKRVD 617 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444443322 244555555555422 233346677777778888888888888888888888888888888
Q ss_pred HHHHHHhh
Q psy11481 263 LLKKELSA 270 (561)
Q Consensus 263 ELKqELsQ 270 (561)
++.+.+..
T Consensus 618 ~vl~~l~~ 625 (717)
T PF10168_consen 618 RVLQLLNS 625 (717)
T ss_pred HHHHHHhc
Confidence 88777644
No 196
>KOG1510|consensus
Probab=46.18 E-value=93 Score=29.95 Aligned_cols=60 Identities=18% Similarity=0.264 Sum_probs=45.7
Q ss_pred HHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 203 CFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 203 cFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-|+.|=+.+|.... +-..=|.| |++|+.++++...|++.+-.+.+.|..++..+-....+
T Consensus 71 qId~LIdsLP~~~~----~~e~Ql~~----i~kLq~en~e~~~el~~~v~~~e~Ll~~vq~~le~~a~ 130 (139)
T KOG1510|consen 71 QIDTLIDSLPGEEG----SAEAQLEK----IKKLQEENEEVALELEELVSKGEKLLEQVQSLLEDIAD 130 (139)
T ss_pred HHHHHHHhCCCccc----CHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 47888888998432 22223333 99999999999999999999999998888877776655
No 197
>PF15058 Speriolin_N: Speriolin N terminus
Probab=45.99 E-value=26 Score=35.24 Aligned_cols=24 Identities=29% Similarity=0.587 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 244 EHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 244 E~EkERLRREneaLQQRLeELKqE 267 (561)
.|+||||-+||++||+++.-++.+
T Consensus 11 rhqierLv~ENeeLKKlVrLirEN 34 (200)
T PF15058_consen 11 RHQIERLVRENEELKKLVRLIREN 34 (200)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHH
Confidence 455666666666666655544433
No 198
>PF05781 MRVI1: MRVI1 protein; InterPro: IPR008677 This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene []. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP []. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins [].
Probab=45.93 E-value=2.1e+02 Score=32.62 Aligned_cols=91 Identities=19% Similarity=0.294 Sum_probs=58.1
Q ss_pred hhhHHHHhHHHHHHHHHHHHhccCCCCCC-cCCcchHHHHHHHHHHHHHHHHH-------------HHH-------HHHH
Q psy11481 188 VHNKLEKNRRAHLKECFEILKRQVPPAQE-EKKSSNLSILHSAIRYIQFLRRR-------------ERE-------FEHE 246 (561)
Q Consensus 188 THNaLERKRRd~LKdcFeeLRdlVPsLsD-n~KaSKAsILrKAiDYIQkLQre-------------Ere-------LE~E 246 (561)
.-|..|.+=+.+|.+|...|..+.|.|.+ +.-.-.+.=|.+.++.|..+-.+ |.+ +..-
T Consensus 221 ~RdlaEeNl~kEi~~~~~~l~~l~~lc~~d~e~~e~~~kl~~~l~~l~~~~~rvss~AE~lGAv~QE~R~SkAvevM~qh 300 (538)
T PF05781_consen 221 SRDLAEENLKKEIENCLKLLESLAPLCWEDNESREIIQKLQKSLDVLHQCATRVSSRAEMLGAVHQESRVSKAVEVMIQH 300 (538)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHH
Confidence 35788888889999999999999999763 33233333344444443333221 111 2345
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy11481 247 MERLAREKIHAQQRLALLKKELSARWEHIDFNT 279 (561)
Q Consensus 247 kERLRREneaLQQRLeELKqELsQqw~hIDfSt 279 (561)
+|.|+|..+..-+.|+++|+-|.|. ..+.|..
T Consensus 301 venLkr~~~kehaeL~E~k~~l~qn-~~r~~~~ 332 (538)
T PF05781_consen 301 VENLKRMYEKEHAELEELKKLLLQN-SSRSYGP 332 (538)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc-cccccCc
Confidence 6778888888888889999887773 3444444
No 199
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=45.66 E-value=28 Score=27.39 Aligned_cols=22 Identities=36% Similarity=0.596 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHA 257 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaL 257 (561)
++++.++++++++++++|+++|
T Consensus 46 ~r~~~~~~~k~l~~le~e~~~l 67 (68)
T PF06305_consen 46 LRRRIRRLRKELKKLEKELEQL 67 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc
Confidence 4444455555555555554443
No 200
>PF14282 FlxA: FlxA-like protein
Probab=45.66 E-value=67 Score=28.48 Aligned_cols=25 Identities=16% Similarity=0.255 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 246 EMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKqELsQ 270 (561)
.++.|+.++..|+.+|.+|+.+..+
T Consensus 52 q~q~Lq~QI~~LqaQI~qlq~q~~~ 76 (106)
T PF14282_consen 52 QIQLLQAQIQQLQAQIAQLQSQQAE 76 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555666666666666655543
No 201
>KOG4001|consensus
Probab=45.48 E-value=82 Score=32.46 Aligned_cols=75 Identities=25% Similarity=0.390 Sum_probs=40.0
Q ss_pred HHHHHHHHHHHHhccCCC-CC----------CcCCcch---HHHHHHHHHH--HHHHH--HHHHHHHHHHHHHHHHHHHH
Q psy11481 196 RRAHLKECFEILKRQVPP-AQ----------EEKKSSN---LSILHSAIRY--IQFLR--RREREFEHEMERLAREKIHA 257 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPs-Ls----------Dn~KaSK---AsILrKAiDY--IQkLQ--reEreLE~EkERLRREneaL 257 (561)
||+.-..||++|-..|.. +. |..+++- -.++..++.| =+-|+ ++...+..++..|+-++++|
T Consensus 125 rreLYsQcFDElIRqvs~scveRGlll~rvRDEIrMt~aAYqtlyeSsvAfGmRKALqae~ek~~~~~~~k~le~~k~~L 204 (259)
T KOG4001|consen 125 RRELYSQCFDELIRQVSVSCVERGLLLVRVRDEIRMTFAAYQTLYESSVAFGMRKALQAENEKTRATTEWKVLEDKKKEL 204 (259)
T ss_pred cHHHHHHHHHHHHHHcchhHHhcceeEEEehHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHhhhHHHH
Confidence 788888899999777543 21 2222222 1233334333 12222 22334455556666666666
Q ss_pred HHHHHHHHHHHhh
Q psy11481 258 QQRLALLKKELSA 270 (561)
Q Consensus 258 QQRLeELKqELsQ 270 (561)
..++++++..+.+
T Consensus 205 e~~ia~~k~K~e~ 217 (259)
T KOG4001|consen 205 ELKIAQLKKKLET 217 (259)
T ss_pred HHHHHHHHHHHHH
Confidence 6666666666544
No 202
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=45.41 E-value=61 Score=30.84 Aligned_cols=15 Identities=20% Similarity=0.275 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHhcc
Q psy11481 196 RRAHLKECFEILKRQ 210 (561)
Q Consensus 196 RRd~LKdcFeeLRdl 210 (561)
....+..|.++|+.+
T Consensus 35 ~~~e~~~~~~tl~~l 49 (145)
T COG1730 35 AISELQTAIETLENL 49 (145)
T ss_pred HHHHHHHHHHHHHHH
Confidence 355666777777543
No 203
>PF07407 Seadorna_VP6: Seadornavirus VP6 protein; InterPro: IPR009982 This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown.
Probab=45.26 E-value=29 Score=37.57 Aligned_cols=27 Identities=19% Similarity=0.052 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHA 257 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaL 257 (561)
|-.-.|++++.+|.+|.+.|+.|.++|
T Consensus 32 ~e~~aLr~EN~~LKkEN~~Lk~eVerL 58 (420)
T PF07407_consen 32 DENFALRMENHSLKKENNDLKIEVERL 58 (420)
T ss_pred hhhhhHHHHhHHHHHHHHHHHHHHHHH
Confidence 334445555554444444444444444
No 204
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=45.15 E-value=64 Score=27.75 Aligned_cols=33 Identities=21% Similarity=0.415 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
|..|+.+..+++.+++.++++...++..++++.
T Consensus 96 ~~~l~~~~~~l~~~~~~~~~~~~~l~~~l~~~~ 128 (129)
T cd00890 96 LETLEKQIEKLEKQLEKLQDQITELQEELQQLQ 128 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 556777777888888888888888888887764
No 205
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=45.09 E-value=59 Score=36.29 Aligned_cols=32 Identities=13% Similarity=0.323 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
|.++.++++++++.|+.|+.+|+++++.++.+
T Consensus 95 ~saq~~dle~KIkeLEaE~~~Lk~Ql~a~~~~ 126 (475)
T PRK13729 95 LNKQRGDDQRRIEKLGQDNAALAEQVKALGAN 126 (475)
T ss_pred HhhhhhhHHHHHHHHHHHHHHHHHHHHhhhcC
Confidence 33555555666666666666666666544433
No 206
>PF04350 PilO: Pilus assembly protein, PilO; PDB: 2RJZ_B.
Probab=44.94 E-value=9.5 Score=33.22 Aligned_cols=38 Identities=29% Similarity=0.381 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFE---HEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE---~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+..|+++.++.. ..++.++++...++++++++++.|+.
T Consensus 8 ~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~lP~ 48 (144)
T PF04350_consen 8 IQQLQQELAQLKEKVANLEELKKQLEQLEQQLEELLKKLPA 48 (144)
T ss_dssp ----HHHHHHTGGG-SSHHHHHHHHHHHHHHHHHHHHCTTG
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 444444443333 24455666666667777777766665
No 207
>PF07195 FliD_C: Flagellar hook-associated protein 2 C-terminus; InterPro: IPR010809 The flagellar hook-associated protein 2 (HAP2 or FliD) forms the distal end of the flagella, and plays a role in mucin specific adhesion of the bacteria []. This alignment covers the C-terminal region of the flagellar hook-associated protein 2.; GO: 0007155 cell adhesion, 0009288 bacterial-type flagellum
Probab=44.75 E-value=35 Score=33.30 Aligned_cols=39 Identities=13% Similarity=0.286 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEH 274 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~h 274 (561)
+..++..++.+++++.+++..+++|++.++..|-++|-.
T Consensus 191 i~~~~~~l~~~~~~~~~~i~~~~~rl~~~~~~l~~qf~~ 229 (239)
T PF07195_consen 191 ITSRIDSLNSQIKSLDKQIEDLEERLESKEERLRKQFSA 229 (239)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456677788888888888888888888888888776653
No 208
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=44.70 E-value=1.3e+02 Score=27.87 Aligned_cols=53 Identities=15% Similarity=0.280 Sum_probs=36.2
Q ss_pred CCcchHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 218 KKSSNLSILHSAIRYI----QFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 218 ~KaSKAsILrKAiDYI----QkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
....++.-|.+.++-+ ..|..+...++.....|+.++....+|+++|+.++..
T Consensus 13 el~n~La~Le~slE~~K~S~~eL~kqkd~L~~~l~~L~~q~~s~~qr~~eLqaki~e 69 (107)
T PF09304_consen 13 ELQNRLASLERSLEDEKTSQGELAKQKDQLRNALQSLQAQNASRNQRIAELQAKIDE 69 (107)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666777776654 4455556667777777888888888888888877754
No 209
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=44.65 E-value=42 Score=27.20 Aligned_cols=28 Identities=36% Similarity=0.500 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 243 FEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 243 LE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+.|+++|+++...++..++.+++.|+.
T Consensus 2 ~~~E~~rL~Kel~kl~~~i~~~~~kL~n 29 (66)
T PF10458_consen 2 VEAEIERLEKELEKLEKEIERLEKKLSN 29 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCS
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 4678888888888888888888888864
No 210
>PF10944 DUF2630: Protein of unknown function (DUF2630); InterPro: IPR020311 This entry contains proteins with no known function.
Probab=44.52 E-value=38 Score=29.94 Aligned_cols=44 Identities=23% Similarity=0.331 Sum_probs=30.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
...|..|-.+|++|......=.-.-..-++||.+|+.+|.|=|+
T Consensus 7 l~rI~~LV~EE~~LR~~~~~g~~~~~~e~~RL~~lE~~LDQCWD 50 (81)
T PF10944_consen 7 LARINELVAEEHELRSRLQAGEIDSDEEHARLRQLEVELDQCWD 50 (81)
T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcHHHHHHHHHHHHHHHHHHH
Confidence 35699999999888765433222223345688999999999896
No 211
>COG4985 ABC-type phosphate transport system, auxiliary component [Inorganic ion transport and metabolism]
Probab=44.04 E-value=40 Score=35.13 Aligned_cols=30 Identities=23% Similarity=0.337 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKELSAR 271 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqELsQq 271 (561)
.+..|+.+|+++...|+.+|..|+.|+.++
T Consensus 218 ~~~ae~seLq~r~~~l~~~L~~L~~e~~r~ 247 (289)
T COG4985 218 HYVAEKSELQKRLAQLQTELDALRAELERQ 247 (289)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhc
Confidence 455788888899999999999999888874
No 212
>KOG3558|consensus
Probab=44.00 E-value=24 Score=41.00 Aligned_cols=49 Identities=22% Similarity=0.354 Sum_probs=40.8
Q ss_pred chhhhhHHHHhHHHHHHHHHHHHhccCCCCCCc--CCcchHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKECFEILKRQVPPAQEE--KKSSNLSILHSAIRYIQ 234 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn--~KaSKAsILrKAiDYIQ 234 (561)
||+.-..+-|.||-+=++-|.+|+.+|| |..+ .-..|++|++=++-|++
T Consensus 46 rkEkSRdAARsRRsKEn~~FyeLa~~lP-lp~aisshLDkaSimRLtISyLR 96 (768)
T KOG3558|consen 46 RKEKSRDAARSRRSKENEEFYELAKLLP-LPAAISSHLDKASIMRLTISYLR 96 (768)
T ss_pred HhhhhhhhhhhhcccchHHHHHHHHhCC-CcchhhhhhhhHHHHHHHHHHHH
Confidence 3555567788999999999999999999 4543 45799999999999987
No 213
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=43.98 E-value=1.4e+02 Score=33.31 Aligned_cols=38 Identities=21% Similarity=0.195 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+.+..++++..+++.+++.++++...++++++++++++
T Consensus 428 e~l~~l~~~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 465 (650)
T TIGR03185 428 EELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTL 465 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444444444444443
No 214
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=43.85 E-value=49 Score=29.46 Aligned_cols=30 Identities=23% Similarity=0.375 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+.+..+++|+.++..|.+.+++|+.+|..
T Consensus 45 ~rwek~v~~L~~e~~~l~~E~e~L~~~l~~ 74 (87)
T PF12709_consen 45 ARWEKKVDELENENKALKRENEQLKKKLDT 74 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 557788889999999999999999998865
No 215
>PF04201 TPD52: Tumour protein D52 family; InterPro: IPR007327 The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family []. These proteins have an N-terminal coiled-coil that allows members to form homo- and heterodimers with each other [].
Probab=43.68 E-value=70 Score=31.29 Aligned_cols=28 Identities=21% Similarity=0.279 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELK 265 (561)
++..+++.|+.+++.|+.-|+|-|...+
T Consensus 29 eE~eeLr~EL~KvEeEI~TLrqvL~aKe 56 (162)
T PF04201_consen 29 EEREELRSELAKVEEEIQTLRQVLAAKE 56 (162)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345577777777788877776655433
No 216
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=43.61 E-value=22 Score=39.56 Aligned_cols=29 Identities=10% Similarity=0.194 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++++++|+|++|+++|++++++|++++.+
T Consensus 28 ~~~qkie~L~kql~~Lk~q~~~l~~~v~k 56 (489)
T PF11853_consen 28 DLLQKIEALKKQLEELKAQQDDLNDRVDK 56 (489)
T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccch
Confidence 33447777777777777777776666544
No 217
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=43.46 E-value=43 Score=28.96 Aligned_cols=35 Identities=23% Similarity=0.336 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|-.+-+++..|+.-|+.|+..|.+++..|..++-+
T Consensus 52 lp~~~keLL~EIA~lE~eV~~LE~~v~~L~~~l~~ 86 (88)
T PF14389_consen 52 LPKKAKELLEEIALLEAEVAKLEQKVLSLYRQLFQ 86 (88)
T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44455666777777888888888888888877754
No 218
>PF12938 M_domain: M domain of GW182
Probab=43.38 E-value=98 Score=31.81 Aligned_cols=50 Identities=18% Similarity=0.409 Sum_probs=43.9
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 220 SSNLSILHSAIRYIQFLRRREREFEH-------EMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 220 aSKAsILrKAiDYIQkLQreEreLE~-------EkERLRREneaLQQRLeELKqELs 269 (561)
...+.+|..=...|+.|+..+.+|.. +..++..++..+|++|..|++++.
T Consensus 146 pqtL~LLnQLLq~I~~Lq~~Q~~L~~~~~~~~~~~~q~~~~I~~~kqqI~~lqnQIa 202 (235)
T PF12938_consen 146 PQTLTLLNQLLQQIKRLQQQQQNLQRQGNASGQEEQQLAVQINKTKQQIQQLQNQIA 202 (235)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56688999989999999999888865 778889999999999999999987
No 219
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.38 E-value=81 Score=32.84 Aligned_cols=38 Identities=16% Similarity=0.241 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|+.++.+..+.+.++++|+.++..|+.+|.+.+.-|..
T Consensus 68 ~~~~~~~i~~~~~eik~l~~eI~~~~~~I~~r~~~l~~ 105 (265)
T COG3883 68 IDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKK 105 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56666666666677777777777777777766666644
No 220
>PF07047 OPA3: Optic atrophy 3 protein (OPA3); InterPro: IPR010754 OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity []. This family consists of several optic atrophy 3 (OPA3) proteins and related proteins from other eukaryotic species, the function is unknown.
Probab=43.27 E-value=73 Score=29.25 Aligned_cols=21 Identities=14% Similarity=0.268 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHhh
Q psy11481 250 LAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 250 LRREneaLQQRLeELKqELsQ 270 (561)
++.+.+.|+.++++|+.++.+
T Consensus 110 ~~~~l~~L~~~i~~L~~~~~~ 130 (134)
T PF07047_consen 110 LQERLEELEERIEELEEQVEK 130 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 334455555555555555443
No 221
>PF07334 IFP_35_N: Interferon-induced 35 kDa protein (IFP 35) N-terminus; InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=43.17 E-value=36 Score=29.62 Aligned_cols=17 Identities=24% Similarity=0.198 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy11481 250 LAREKIHAQQRLALLKK 266 (561)
Q Consensus 250 LRREneaLQQRLeELKq 266 (561)
|++|.+.|..+|.++++
T Consensus 12 Lk~eiqkle~ELq~~~~ 28 (76)
T PF07334_consen 12 LKEEIQKLEAELQQNKR 28 (76)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 33333333333333333
No 222
>COG1792 MreC Cell shape-determining protein [Cell envelope biogenesis, outer membrane]
Probab=43.05 E-value=1.5e+02 Score=30.52 Aligned_cols=12 Identities=17% Similarity=0.324 Sum_probs=6.5
Q ss_pred eeeecCCCCCCC
Q psy11481 325 GLERSDGGESGS 336 (561)
Q Consensus 325 ~m~~~~~~~~~~ 336 (561)
++-+|.|-..|=
T Consensus 133 ~ivId~Gs~~GV 144 (284)
T COG1792 133 TIVIDKGSNDGI 144 (284)
T ss_pred EEEEecCcccCc
Confidence 445666655543
No 223
>PF13874 Nup54: Nucleoporin complex subunit 54; PDB: 3T97_B.
Probab=42.98 E-value=1.1e+02 Score=27.99 Aligned_cols=64 Identities=23% Similarity=0.286 Sum_probs=26.9
Q ss_pred HHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 203 CFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 203 cFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+|++|+..+=. ++..-.-=-..|....+-+..|++........++++++++.+|.+|+-.+-..
T Consensus 31 GF~dL~~R~~~-Q~~~~~~~~~~l~~i~~~l~~L~~~~~~~~~rl~~~r~r~~~L~hR~l~v~~~ 94 (141)
T PF13874_consen 31 GFEDLKKRVEA-QEEEIAQHRERLKEINDKLEELQKHDLETSARLEEARRRHQELSHRLLRVLRK 94 (141)
T ss_dssp ----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56666555544 21100011233333444455665555666667777777777777776554433
No 224
>PF05615 THOC7: Tho complex subunit 7; InterPro: IPR008501 The Tho complex (THOC) is involved in transcription elongation and mRNA export from the nucleus []. This entry represents the subunit THOC7, which is found in higher eukaryotes, and the non-homologous subunit Mft1p found in yeast. The funtions of these subunits are unknown, and it is not known if these subunits are functionally equivalent.
Probab=42.84 E-value=1.2e+02 Score=27.40 Aligned_cols=29 Identities=28% Similarity=0.423 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 240 EREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 240 EreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
...|+.+.++...+++..+..+++|+.+|
T Consensus 76 ~e~Y~~~~~~i~~~i~~~k~~ie~lk~~L 104 (139)
T PF05615_consen 76 RENYEQLNEEIEQEIEQAKKEIEELKEEL 104 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444444
No 225
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=42.83 E-value=82 Score=33.47 Aligned_cols=48 Identities=23% Similarity=0.203 Sum_probs=43.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.=|.++..-|..|+.+-+.+..|++++..|.++.+.+...|-+||+.
T Consensus 132 V~qLEk~~~q~~qLe~d~qs~lDEkeEl~~ERD~yk~K~~RLN~ELn~ 179 (319)
T PF09789_consen 132 VEQLEKLREQIEQLERDLQSLLDEKEELVTERDAYKCKAHRLNHELNY 179 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566778888899999999999999999999999999999999999865
No 226
>PF03980 Nnf1: Nnf1 ; InterPro: IPR007128 NNF1 is an essential yeast gene required for proper spindle orientation, nucleolar and nuclear envelope structure and mRNA export [].
Probab=42.44 E-value=61 Score=28.12 Aligned_cols=33 Identities=9% Similarity=0.290 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
..+++...++...++.++.++.+|.++|+++++
T Consensus 76 ~~~~~~~~~L~~~l~~l~~eN~~L~~~i~~~r~ 108 (109)
T PF03980_consen 76 PYKKKEREQLNARLQELEEENEALAEEIQEQRK 108 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 334555566777888888888888888888765
No 227
>PF11336 DUF3138: Protein of unknown function (DUF3138); InterPro: IPR021485 This family of proteins with unknown function appear to be restricted to Proteobacteria.
Probab=42.43 E-value=29 Score=38.65 Aligned_cols=26 Identities=19% Similarity=0.295 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+++.|++|.++||+|+.||+.+|..
T Consensus 25 ~~i~~L~~ql~aLq~~v~eL~~~laa 50 (514)
T PF11336_consen 25 DQIKALQAQLQALQDQVNELRAKLAA 50 (514)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 45667777777777777777777754
No 228
>PF15456 Uds1: Up-regulated During Septation
Probab=42.29 E-value=2.5e+02 Score=26.09 Aligned_cols=73 Identities=26% Similarity=0.278 Sum_probs=35.7
Q ss_pred chhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
-|..++ +|+++|+.... |......-....+.+.-.++ ++ +++.......++++..|...+..|+.++
T Consensus 41 lr~kl~-le~k~RdAa~s----l~~l~~~~~~~~~~~~~~~~-~~-------eeel~~~~rk~ee~~~eL~~le~R~~~~ 107 (124)
T PF15456_consen 41 LRRKLA-LESKIRDAAHS----LSRLYSSSSRRARFSRESSL-KA-------EEELAESDRKCEELAQELWKLENRLAEV 107 (124)
T ss_pred HHHHHH-HHHHHHHHHHH----HHHhcCCCccccCCCcchHH-HH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 466676 88888885544 33333331111222222222 22 2233333344555555556666666666
Q ss_pred HHHHhh
Q psy11481 265 KKELSA 270 (561)
Q Consensus 265 KqELsQ 270 (561)
++.|-+
T Consensus 108 ~~rLLe 113 (124)
T PF15456_consen 108 RQRLLE 113 (124)
T ss_pred HHHHHH
Confidence 666544
No 229
>PRK00846 hypothetical protein; Provisional
Probab=42.19 E-value=70 Score=27.77 Aligned_cols=34 Identities=15% Similarity=0.011 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
-|..|-+...+++.++++|+++.+.|..||.+++
T Consensus 28 tIe~LN~~v~~qq~~I~~L~~ql~~L~~rL~~~~ 61 (77)
T PRK00846 28 ALTELSEALADARLTGARNAELIRHLLEDLGKVR 61 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3666666666677777777777777777777666
No 230
>COG2919 Septum formation initiator [Cell division and chromosome partitioning]
Probab=42.19 E-value=86 Score=28.19 Aligned_cols=37 Identities=22% Similarity=0.299 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
.+|++ +.++...+++|.+.|.+++..|.++++.|+..
T Consensus 50 ~~~~~-l~~qi~~~~~e~~~L~~~~~~l~~ei~~L~dg 86 (117)
T COG2919 50 ADVLQ-LQRQIAAQQAELEKLSARNTALEAEIKDLKDG 86 (117)
T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 34444 44455567777777777777777777777755
No 231
>PRK12765 flagellar capping protein; Provisional
Probab=42.03 E-value=30 Score=38.87 Aligned_cols=41 Identities=17% Similarity=0.383 Sum_probs=34.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEHID 276 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~hID 276 (561)
|+..+.-+..++++|.++++.+++||+.+...|-+||...|
T Consensus 530 l~~~~~~l~~~~~~l~~~~~~~~~rl~~~~~r~~~qf~alD 570 (595)
T PRK12765 530 LTKYDESLTNEIKSLTTSKESTQELIDTKYETMANKWLQYD 570 (595)
T ss_pred chhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566778899999999999999999999999887787554
No 232
>PF04999 FtsL: Cell division protein FtsL; InterPro: IPR007082 In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein IPR007060 from INTERPRO [], however this interaction may be indirect [].; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=41.83 E-value=1.3e+02 Score=25.54 Aligned_cols=28 Identities=25% Similarity=0.322 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+++..+++++++|.+.|+..-+.|+-|.
T Consensus 38 ~~~~~~l~~l~~~~~~l~~e~~~L~lE~ 65 (97)
T PF04999_consen 38 RQLFYELQQLEKEIDQLQEENERLRLEI 65 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444444444443
No 233
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=41.45 E-value=63 Score=32.54 Aligned_cols=29 Identities=3% Similarity=0.090 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+++.++++|++|+..||=+++++..+|.+
T Consensus 58 ~l~~ql~~lq~ev~~LrG~~E~~~~~l~~ 86 (263)
T PRK10803 58 QLQQQLSDNQSDIDSLRGQIQENQYQLNQ 86 (263)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 45555666666666666666666655543
No 234
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=41.24 E-value=79 Score=36.65 Aligned_cols=45 Identities=22% Similarity=0.146 Sum_probs=27.0
Q ss_pred CCCCCCCccccccCCCceeEeecc-c--ceEeeCCCCccccccccccc
Q psy11481 39 PDPAAHPLSHLHHSLPTTTIINTT-N--HITFTNGLPSSLLHRPLERK 83 (561)
Q Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~l~~r~l~~k 83 (561)
-.-+.||+--.....|.+..++.. + .||=.||-..+-++|.+-+-
T Consensus 298 l~~~rhPll~~~~~vp~di~l~~~~~~liItGpNg~GKSTlLK~i~~~ 345 (771)
T TIGR01069 298 LENARHPLLKEPKVVPFTLNLKFEKRVLAITGPNTGGKTVTLKTLGLL 345 (771)
T ss_pred EccccCceecCCceEeceeEeCCCceEEEEECCCCCCchHHHHHHHHH
Confidence 355778875322345666666533 2 34445888888888877543
No 235
>PF04350 PilO: Pilus assembly protein, PilO; PDB: 2RJZ_B.
Probab=41.01 E-value=24 Score=30.73 Aligned_cols=36 Identities=17% Similarity=0.351 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHh
Q psy11481 234 QFLRRREREFEHEMERLAR---EKIHAQQRLALLKKELS 269 (561)
Q Consensus 234 QkLQreEreLE~EkERLRR---EneaLQQRLeELKqELs 269 (561)
+.++.+..+++.++++++. ...+++++++++++++.
T Consensus 2 ~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~l~ 40 (144)
T PF04350_consen 2 KTLQAQIQQLQQELAQLKEKVANLEELKKQLEQLEQQLE 40 (144)
T ss_dssp ----------HHHHHHTGGG-SSHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 3444455555554444332 23344444445554444
No 236
>PRK02195 V-type ATP synthase subunit D; Provisional
Probab=40.97 E-value=1.1e+02 Score=30.06 Aligned_cols=44 Identities=20% Similarity=0.353 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.-|..+-.-|+.+...+..|+-++.++..+++++++++-++...
T Consensus 20 ~~a~rg~~lLk~KR~~Li~e~~~~~~~~~~lr~~~~~~~~~a~~ 63 (201)
T PRK02195 20 KMLERYLPTLKLKKAQLQAEVRRAKAEAAELEQEYQKLRQAIEA 63 (201)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34677888999999999999999999999999999888777543
No 237
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=40.89 E-value=87 Score=30.17 Aligned_cols=39 Identities=18% Similarity=0.263 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..+...+...+...+++..+++...++.++..-+.++-+
T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~~r~~l~~ 151 (302)
T PF10186_consen 113 LVESRQEQLEELQNELEERKQRLSQLQSQLARRRRQLIQ 151 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444455555555555555555555555444
No 238
>COG1256 FlgK Flagellar hook-associated protein [Cell motility and secretion]
Probab=40.70 E-value=1.4e+02 Score=33.55 Aligned_cols=73 Identities=14% Similarity=0.207 Sum_probs=41.0
Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 196 RRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREF-------EHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreL-------E~EkERLRREneaLQQRLeELKqEL 268 (561)
=+..|++.|..|.++.-. + ..-+.|..||.||-.....++..-+.| ..|+..--.|+..|-++|++|-++.
T Consensus 109 l~~~L~~ff~s~q~la~~-P-~~~a~r~~vl~~a~~l~~~in~~~~~L~~l~~~i~~~I~~~V~~vNsLl~qIa~lN~qI 186 (552)
T COG1256 109 LSTLLNDFFNSLQELASN-P-SDTAARQAVLSKAQTLVNQINNTYEQLTDLRKDINAEIAATVDEVNSLLKQIADLNKQI 186 (552)
T ss_pred HHHHHHHHHHHHHHHHhC-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356667777777776655 2 234778888888866554444433333 3344444444555555555555554
Q ss_pred hh
Q psy11481 269 SA 270 (561)
Q Consensus 269 sQ 270 (561)
.+
T Consensus 187 ~~ 188 (552)
T COG1256 187 RK 188 (552)
T ss_pred HH
Confidence 33
No 239
>PF04568 IATP: Mitochondrial ATPase inhibitor, IATP; InterPro: IPR007648 ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=40.43 E-value=1.3e+02 Score=27.19 Aligned_cols=35 Identities=20% Similarity=0.370 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
||++.+++ +|++-+++|.+|+..-++.|++|++.+
T Consensus 65 Y~r~~EkE--qL~~Lk~kl~~e~~~~~k~i~~le~~I 99 (100)
T PF04568_consen 65 YFRKKEKE--QLKKLKEKLKEEIEHHRKEIDELEKHI 99 (100)
T ss_dssp HHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 55544432 344445556666666777777777654
No 240
>PRK06664 fliD flagellar hook-associated protein FliD; Validated
Probab=40.43 E-value=98 Score=35.51 Aligned_cols=79 Identities=8% Similarity=0.149 Sum_probs=42.9
Q ss_pred hHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 195 NRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQF-------LRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 195 KRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQk-------LQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+=++.|.+-++.++.+.=.-.+......-.|..+--+|+.. +..+...++.+++++.++++.++.||+.++..
T Consensus 550 KL~~AL~~np~~V~~lF~~~~~g~~~~~~Gla~~l~~~l~~~t~~~G~i~~r~~~l~~~i~~l~~~i~~~e~rl~~~e~r 629 (661)
T PRK06664 550 KLDEVLKENPDSVRELFAFDSNGDAVMDNGVAKMLLEYLSPYTQAGGIIYNKVKGLDERIADNNKKIEEYEKKLESKERK 629 (661)
T ss_pred HHHHHHHhCHHHHHHHhcCCCcccccccCcHHHHHHHHHHHHHcCCCceehHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555656666666665321111111122444444444433 34455667777777777777777777777777
Q ss_pred Hhhhhc
Q psy11481 268 LSARWE 273 (561)
Q Consensus 268 LsQqw~ 273 (561)
|-++|-
T Consensus 630 l~~QFt 635 (661)
T PRK06664 630 LKGKYL 635 (661)
T ss_pred HHHHHH
Confidence 665554
No 241
>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []. This protein is also found in many bacterial species as part of a suspected prophage. Further studies have shown that Gam is a functional counterpart of the eukaryotic Ku protein, which has key roles in DNA repair and in certain transposition events. Gam displays DNA binding characteristics remarkably similar to those of human Ku []. In addition, Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae (Baker's yeast). These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved [].; GO: 0003690 double-stranded DNA binding, 0042262 DNA protection; PDB: 2P2U_B.
Probab=40.31 E-value=1.1e+02 Score=28.25 Aligned_cols=50 Identities=28% Similarity=0.285 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy11481 226 LHSAIRYIQFLRRREREFEHEM--------ERLAREKIHAQQRLALLKKELSARWEHID 276 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~Ek--------ERLRREneaLQQRLeELKqELsQqw~hID 276 (561)
+..|+.=|..|+++..+++.++ +.++.+...|+.+++.|+..+.+ |..-+
T Consensus 5 a~~al~ki~~l~~~~~~i~~~~~~~I~~i~~~~~~~~~~l~~~i~~l~~~l~~-y~e~~ 62 (149)
T PF07352_consen 5 ADWALRKIAELQREIARIEAEANDEIARIKEWYEAEIAPLQNRIEYLEGLLQA-YAEAN 62 (149)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH-HHHCT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHC
Confidence 4556667777777766655444 34666677788888888888776 76433
No 242
>PRK03992 proteasome-activating nucleotidase; Provisional
Probab=40.03 E-value=93 Score=32.61 Aligned_cols=40 Identities=25% Similarity=0.289 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++++.|+++.++++.++++|..+...++++++.+++++.+
T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 47 (389)
T PRK03992 8 ERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEK 47 (389)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556666666666666666666777777777777654
No 243
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=39.97 E-value=1.3e+02 Score=28.82 Aligned_cols=26 Identities=15% Similarity=0.294 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqE 267 (561)
+++.++++|.++...+++.+.++.++
T Consensus 112 ~l~~~l~~l~~~~~~l~~~~q~~~q~ 137 (145)
T COG1730 112 KLQQALAELAQRIEQLEQEAQQLQQK 137 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444445555555555444444
No 244
>KOG0977|consensus
Probab=39.94 E-value=89 Score=35.47 Aligned_cols=38 Identities=34% Similarity=0.509 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
-|..++++.+.++.|+.+|++|+..|...|+.++.+|.
T Consensus 156 e~~~~krr~~~le~e~~~Lk~en~rl~~~l~~~r~~ld 193 (546)
T KOG0977|consen 156 EINTLKRRIKALEDELKRLKAENSRLREELARARKQLD 193 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH
Confidence 35556666667777777777777777777776666554
No 245
>PF13815 Dzip-like_N: Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=39.89 E-value=1.5e+02 Score=26.43 Aligned_cols=45 Identities=22% Similarity=0.290 Sum_probs=32.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++=|---|++|-.....+..+++.|+.+.+.++++.++|++++.+
T Consensus 61 frLaQl~ieYLl~~q~~L~~~~~~l~~~~~~~~~~~~~l~~~~~~ 105 (118)
T PF13815_consen 61 FRLAQLSIEYLLHCQEYLSSQLEQLEERLQELQQEIEKLKQKLKK 105 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333477777777777778888888888888888888777765
No 246
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=39.61 E-value=72 Score=34.03 Aligned_cols=35 Identities=17% Similarity=0.310 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
.+.++.|++...++.+++++|+.+...|+.+++.+
T Consensus 374 ~~~~~~l~~~~~~l~~~~~~l~~~~~~l~~~l~~~ 408 (451)
T PF03961_consen 374 KEQLKKLKEKKKELKEELKELKEELKELKEELERS 408 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34455556666666666666666666666666666
No 247
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=39.59 E-value=1e+02 Score=27.19 Aligned_cols=35 Identities=20% Similarity=0.378 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMER-------LAREKIHAQQRLALLKKE 267 (561)
Q Consensus 233 IQkLQreEreLE~EkER-------LRREneaLQQRLeELKqE 267 (561)
|..|++++..+..|++. |++|++.|++....-+..
T Consensus 27 ieELKekn~~L~~e~~~~~~~r~~L~~en~qLk~E~~~Wqer 68 (79)
T PRK15422 27 IEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQER 68 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 44455554444444332 444555554444443333
No 248
>PRK10265 chaperone-modulator protein CbpM; Provisional
Probab=39.48 E-value=39 Score=29.54 Aligned_cols=41 Identities=22% Similarity=0.216 Sum_probs=21.5
Q ss_pred hHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 222 NLSILHSAIRYIQFLRRRE------REFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 222 KAsILrKAiDYIQkLQreE------reLE~EkERLRREneaLQQRLe 262 (561)
-+.++++|++--..|.-.. -.+..++++|++|++.|++||.
T Consensus 49 ~l~r~~~a~rL~~dl~in~~gialvl~LLd~i~~Lr~el~~L~~~l~ 95 (101)
T PRK10265 49 AAIVVQRAVRLRHELALDWPGIAVALTLLDEIAHLKQENRLLRQRLS 95 (101)
T ss_pred HHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667777665544443322 2344555555555555555443
No 249
>PHA02621 agnoprotein; Provisional
Probab=39.15 E-value=12 Score=31.77 Aligned_cols=22 Identities=23% Similarity=0.300 Sum_probs=19.8
Q ss_pred ccccccccceeEEEeecCCccc
Q psy11481 76 LHRPLERKEELLLTTTSNGAHF 97 (561)
Q Consensus 76 ~~r~l~~k~~l~~~~~~~g~~f 97 (561)
.+|||+|+.+.-..-||+|+.-
T Consensus 2 vlrqlsrqasvkv~ktwtgtkk 23 (68)
T PHA02621 2 VLRQLSRQASVKVGKTWTGTKK 23 (68)
T ss_pred chHHhhhhheeeeccccccchH
Confidence 5899999999999999999853
No 250
>PF02050 FliJ: Flagellar FliJ protein; InterPro: IPR012823 Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects []. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export. The sequences of fliH, fliI and the adjacent gene, fliJ, have been deduced. FliJ was shown to encode a protein of molecular mass 17,302 Da []. It is a membrane-associated protein that affects chemotactic events, mutations in FliJ result in failure to respond to chemotactic stimuli.; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009288 bacterial-type flagellum, 0016020 membrane, 0044461 bacterial-type flagellum part; PDB: 3AJW_A.
Probab=38.74 E-value=2.1e+02 Score=23.21 Aligned_cols=70 Identities=23% Similarity=0.330 Sum_probs=46.2
Q ss_pred HHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 198 AHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 198 d~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|......+...+-... ...-+.-+.-.-.||..|......++.+++.++.+.+.++..+.+...+.-.
T Consensus 22 ~~L~~~~~~~~~~~~~~~---~~~s~~~~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~r~~l~~a~~~~k~ 91 (123)
T PF02050_consen 22 EQLQQERQEYQEQLSESQ---QGVSVAQLRNYQRYISALEQAIQQQQQELERLEQEVEQAREELQEARRERKK 91 (123)
T ss_dssp HHHHHHHHHHHHT--------SGGGHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhcc---CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444333321 1334566667888999999999999999999999888888888777666543
No 251
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=38.74 E-value=78 Score=27.76 Aligned_cols=36 Identities=17% Similarity=0.194 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+.|..+-+.++.+++.|+.....|+.++++++.-+.
T Consensus 2 qql~~q~~ql~~~i~~l~~~i~~l~~~i~e~~~~~~ 37 (126)
T TIGR00293 2 QQLAAELQILQQQVESLQAQIAALRALIAELETAIE 37 (126)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666667777777777777777777777766543
No 252
>KOG0995|consensus
Probab=38.62 E-value=1e+02 Score=35.31 Aligned_cols=9 Identities=44% Similarity=0.449 Sum_probs=5.1
Q ss_pred ccccccccc
Q psy11481 75 LLHRPLERK 83 (561)
Q Consensus 75 l~~r~l~~k 83 (561)
.--|||+.|
T Consensus 61 ~dpRpl~dk 69 (581)
T KOG0995|consen 61 RDPRPLSDK 69 (581)
T ss_pred CCCCCcccH
Confidence 344666666
No 253
>PF11544 Spc42p: Spindle pole body component Spc42p; InterPro: IPR021611 Spc42p is a 42kDa component of the S.cerevisiae spindle body that localises to the electron dense central region of the SPB. Spc42p is a phosphoprotein which forms a polymeric layer at the periphery of the SPB central plaque. This functions during SPB duplication and also facilitates the attachment of the SPB to the nuclear membrane. ; PDB: 2Q6Q_B.
Probab=38.59 E-value=76 Score=27.82 Aligned_cols=28 Identities=18% Similarity=0.243 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRL 261 (561)
+.|+.+....+.|+++|..-+..|+.+|
T Consensus 8 k~L~~kL~~K~eEI~rLn~lv~sLR~KL 35 (76)
T PF11544_consen 8 KELKKKLNDKQEEIDRLNILVGSLRGKL 35 (76)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444333
No 254
>PRK09039 hypothetical protein; Validated
Probab=38.11 E-value=1.8e+02 Score=30.55 Aligned_cols=38 Identities=24% Similarity=0.358 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|..|+++...++.+++.++++..+.+.+++.|+++|..
T Consensus 146 I~aLr~Qla~le~~L~~ae~~~~~~~~~i~~L~~~L~~ 183 (343)
T PRK09039 146 IAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNV 183 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444555555555555443
No 255
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=38.06 E-value=1e+02 Score=25.55 Aligned_cols=37 Identities=14% Similarity=0.218 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|.+|-.+-+.|..++.+|..++..|+..+...|.|..
T Consensus 5 id~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAa 41 (56)
T PF04728_consen 5 IDQLSSDVQTLNSKVDQLSSDVNALRADVQAAKEEAA 41 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666666666666666666666666643
No 256
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=37.96 E-value=74 Score=31.30 Aligned_cols=48 Identities=17% Similarity=0.324 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccC
Q psy11481 233 IQFLRRREREFEH---------EMERLAREKIHAQQRLALLKKELSARWEHIDFNTL 280 (561)
Q Consensus 233 IQkLQreEreLE~---------EkERLRREneaLQQRLeELKqELsQqw~hIDfStL 280 (561)
|+.|+.++++|++ ++-.+++|....+.+|++++.++..=-..++|+++
T Consensus 141 l~~l~~~~~rl~~ll~ka~~~~d~l~ie~~L~~v~~eIe~~~~~~~~l~~~v~~sti 197 (262)
T PF14257_consen 141 LKNLEAEEERLLELLEKAKTVEDLLEIERELSRVRSEIEQLEGQLKYLDDRVDYSTI 197 (262)
T ss_pred HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhceEEE
Confidence 5555555555443 33355666666666666666665442234556664
No 257
>PF11382 DUF3186: Protein of unknown function (DUF3186); InterPro: IPR021522 This bacterial family of proteins has no known function.
Probab=37.86 E-value=45 Score=34.31 Aligned_cols=22 Identities=23% Similarity=0.345 Sum_probs=8.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 247 MERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 247 kERLRREneaLQQRLeELKqEL 268 (561)
.++|++|+++|++++++++.++
T Consensus 41 ~~~lr~e~~~l~~~~~~~~~~~ 62 (308)
T PF11382_consen 41 FDSLREENDELRAELDALQAQL 62 (308)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444443
No 258
>KOG2070|consensus
Probab=37.66 E-value=3.6e+02 Score=31.22 Aligned_cols=208 Identities=17% Similarity=0.169 Sum_probs=0.0
Q ss_pred CCcccccCCccCCccceeeecCCCCCCCCccc--cccCCCceeEeecccceEeeCCCCcccccccccccceeEEEeecCC
Q psy11481 17 PSSCYTNGVSGSNLGSVITTLSPDPAAHPLSH--LHHSLPTTTIINTTNHITFTNGLPSSLLHRPLERKEELLLTTTSNG 94 (561)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~r~l~~k~~l~~~~~~~g 94 (561)
||+|-....+ .-.-.+|.|++.| -||.-|+++|-|- .-+--+.+.+-.++-. -.|
T Consensus 433 PSs~~ads~n---------~a~~s~a~h~l~~kp~~h~tP~~~~q~~-----------~p~~ppk~~kp~s~S~---~~p 489 (661)
T KOG2070|consen 433 PSSKHADSKN---------PAPLSPAYHTLPHKPSHHGTPHTTIQNW-----------GPLEPPKTPKPWSLSC---LRP 489 (661)
T ss_pred cccccccCCC---------CCCCCcccCcCCCCCCCCCCCCCCcccc-----------CCCCCCCCCCCcchhh---cCC
Q ss_pred -ccccccccceecccccCCCCChhhhhhcccccccceeeccCccceeecccccc--hhcc----cCccCCCcceeeecCC
Q psy11481 95 -AHFVESVSNVQQPVISTPVTHPAAVMQRHYTLHIEDEMMGSREVVASLSGETR--LVAH----NNYTSNSHAVALSTSP 167 (561)
Q Consensus 95 -~~f~~s~s~VQqp~~s~~~tqp~~~~qr~~~~~~~~~vvGs~~~~~S~S~~~a--vva~----~~~~~NS~~VaLsnSp 167 (561)
.-|+ ++-+|++-.-++.+.- +---+|.-++-..+-+.. .-.+ ++++.+..+-++..++
T Consensus 490 NmKfa-------------~pL~~saa~g~~~~~s--s~~k~stk~n~t~s~~~~~~~~~~~ps~~~~~~r~~t~a~~~~e 554 (661)
T KOG2070|consen 490 NMKFA-------------PPLRPSAALGYKEDLS--SRNKRSTKSNKTMSKLLPKRKPERKPSDEEFASRKSTAALEEDE 554 (661)
T ss_pred CcccC-------------CCCCcchhhhcCCCCc--ccccccccCcccHhhcCcccCCCCCCchhhhhhhcccccccccc
Q ss_pred CCCCccccccCCCCCCCchhhhhHHHHh-------HHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHH
Q psy11481 168 NNLTQNDMIKKRSGISGIREVHNKLEKN-------RRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRE 240 (561)
Q Consensus 168 ~~L~q~se~kKRPg~SeRRaTHNaLERK-------RRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreE 240 (561)
-+...-+....-+..+..+.+|+..-++ =-++=|-..++-|...=...+++.. +-.+..-.|-|++||+++
T Consensus 555 ie~q~lkvle~YCt~~~~qqt~~~s~~~~~~p~~l~~e~eki~~ee~r~~~~~vleeksl--vdtvyalkd~v~~lqqd~ 632 (661)
T KOG2070|consen 555 IEAQILKVLEAYCTSAKTQQTLNSSNRKYSQPQVLLPEEEKILMEETRSNGQSVLEEKSL--VDTVYALKDEVSELQQDN 632 (661)
T ss_pred chhHHHHHHHHHhhcCCCcCCcccchhhcccceehhhhHHHHHHHhcccccceeecccch--hHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeEL 264 (561)
+++...+|.-++-.+.|++-|-.+
T Consensus 633 ~kmkk~leeEqkaRrdLe~ll~k~ 656 (661)
T KOG2070|consen 633 KKMKKVLEEEQKARRDLEKLLRKM 656 (661)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
No 259
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=37.62 E-value=1.4e+02 Score=33.59 Aligned_cols=18 Identities=17% Similarity=0.226 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHhccCCC
Q psy11481 196 RRAHLKECFEILKRQVPP 213 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPs 213 (561)
+|......++.|++.+..
T Consensus 252 ~~~~a~a~~~~l~~~l~~ 269 (754)
T TIGR01005 252 NRAAAEGTADSVKKALQN 269 (754)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 566666677777766654
No 260
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=37.62 E-value=1.6e+02 Score=25.70 Aligned_cols=24 Identities=33% Similarity=0.326 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeEL 264 (561)
...+.|.+.|+.|++.|++=|..|
T Consensus 40 e~Vk~E~~kL~~EN~~Lq~YI~nL 63 (80)
T PF10224_consen 40 EEVKEENEKLESENEYLQQYIGNL 63 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444455555555555555544
No 261
>PF11285 DUF3086: Protein of unknown function (DUF3086); InterPro: IPR021437 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=37.61 E-value=62 Score=34.00 Aligned_cols=27 Identities=22% Similarity=0.476 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQ 259 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQ 259 (561)
.+.|+++.+.++.|+++|+++++.+++
T Consensus 6 L~eL~qrk~~Lq~eIe~LerR~~ri~~ 32 (283)
T PF11285_consen 6 LKELEQRKQALQIEIEQLERRRERIEK 32 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666666666666666665555443
No 262
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=37.43 E-value=1.2e+02 Score=30.59 Aligned_cols=39 Identities=15% Similarity=0.226 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.++.|+++.+.+.++-.++.+|+..|-.+++.|+.+..+
T Consensus 75 ~~~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~k 113 (193)
T PF14662_consen 75 LAKSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENGK 113 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 456666667777777777777777777777777777554
No 263
>PLN02678 seryl-tRNA synthetase
Probab=37.41 E-value=84 Score=34.45 Aligned_cols=27 Identities=22% Similarity=0.218 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 244 EHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 244 E~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.++..+|++|...|..++.+++.++.+
T Consensus 77 ~~~~~~Lk~ei~~le~~~~~~~~~l~~ 103 (448)
T PLN02678 77 IAETKELKKEITEKEAEVQEAKAALDA 103 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444445555555555555555444
No 264
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=37.19 E-value=1.2e+02 Score=28.69 Aligned_cols=27 Identities=22% Similarity=0.368 Sum_probs=12.2
Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHH
Q psy11481 229 AIRYIQFLRR---REREFEHEMERLAREKI 255 (561)
Q Consensus 229 AiDYIQkLQr---eEreLE~EkERLRREne 255 (561)
+-.-|..|+. .+.+++.+++.|+.++.
T Consensus 39 ~d~~i~~Lk~~~~d~eeLk~~i~~lq~~~~ 68 (155)
T PF06810_consen 39 ADKQIKDLKKSAKDNEELKKQIEELQAKNK 68 (155)
T ss_pred HHHHHHHHHhccCCHHHHHHHHHHHHHHHH
Confidence 3334444444 33444444444444444
No 265
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=37.19 E-value=1.3e+02 Score=24.80 Aligned_cols=33 Identities=21% Similarity=0.282 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 237 RRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 237 QreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+....++...+-+.++++..|++++..|..++.
T Consensus 17 e~~ie~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~ 49 (69)
T PF04102_consen 17 EDTIEELNDVVTEQQRQIDRLQRQLRLLRERLR 49 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444555555555555555555443
No 266
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=37.17 E-value=1.9e+02 Score=27.68 Aligned_cols=25 Identities=28% Similarity=0.436 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHHHH---Hhhhhc
Q psy11481 249 RLAREKIHAQQRLALLKKE---LSARWE 273 (561)
Q Consensus 249 RLRREneaLQQRLeELKqE---LsQqw~ 273 (561)
-|+-+...+..++..|+.| |-+.||
T Consensus 155 ~L~l~~~~~e~k~~~l~~En~~Lv~Rwm 182 (194)
T PF08614_consen 155 ALQLQLNMLEEKLRKLEEENRELVERWM 182 (194)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333334444555555554 344566
No 267
>PHA02562 46 endonuclease subunit; Provisional
Probab=36.93 E-value=84 Score=33.43 Aligned_cols=33 Identities=21% Similarity=0.128 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|+.+..+++.++++|..+...++..++++.++.
T Consensus 370 l~~~~~~~~~~l~~l~~~l~~~~~~~~~~~ke~ 402 (562)
T PHA02562 370 LQAEFVDNAEELAKLQDELDKIVKTKSELVKEK 402 (562)
T ss_pred HHhhhhchHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333444444444444444444444443
No 268
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=36.69 E-value=1.2e+02 Score=28.26 Aligned_cols=30 Identities=23% Similarity=0.541 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLe 262 (561)
|..|+.+...++.++++++.+...++..++
T Consensus 37 I~sL~~K~~~lE~eld~~~~~l~~~k~~le 66 (143)
T PF12718_consen 37 ITSLQKKNQQLEEELDKLEEQLKEAKEKLE 66 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555555555555555554444444333333
No 269
>PRK08032 fliD flagellar capping protein; Reviewed
Probab=36.68 E-value=1.4e+02 Score=32.48 Aligned_cols=38 Identities=5% Similarity=0.190 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
|..+...+..++++|.++++.++.||+.++..|-+||-
T Consensus 404 l~~~~~~l~~~i~~l~~~i~~~~~rl~~~e~rl~~qF~ 441 (462)
T PRK08032 404 IKTATDGVNKTLKKLTKQYNAVSDSIDATIARYKAQFT 441 (462)
T ss_pred chhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455677788888888888888888888877766554
No 270
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=36.54 E-value=1.3e+02 Score=36.85 Aligned_cols=65 Identities=12% Similarity=0.257 Sum_probs=41.5
Q ss_pred HHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 198 AHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFL---RRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 198 d~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkL---QreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
+.|+..+++|++. ... .+..-+..|+.|.+++.+. +++.++|++.+++.-++.++++++|+.+++
T Consensus 26 ~~iq~~l~~~~~~--~~~--~~k~~~~~l~~tl~~l~~~~~~~~~~~~~~~~i~~ap~~~~~~~~~l~~~~~ 93 (1109)
T PRK10929 26 KQITQELEQAKAA--KTP--AQAEIVEALQSALNWLEERKGSLERAKQYQQVIDNFPKLSAELRQQLNNERD 93 (1109)
T ss_pred HHHHHHHHHhhcC--CCh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhc
Confidence 4677777777663 211 2445577788888875544 445555667777777777777777765543
No 271
>COG1792 MreC Cell shape-determining protein [Cell envelope biogenesis, outer membrane]
Probab=36.47 E-value=1.1e+02 Score=31.43 Aligned_cols=10 Identities=20% Similarity=0.242 Sum_probs=4.2
Q ss_pred HHHHHHHHHh
Q psy11481 260 RLALLKKELS 269 (561)
Q Consensus 260 RLeELKqELs 269 (561)
+++.|++|+.
T Consensus 91 ~~~~l~~EN~ 100 (284)
T COG1792 91 EVESLEEENK 100 (284)
T ss_pred HHHHHHHHHH
Confidence 3344444433
No 272
>KOG4796|consensus
Probab=36.39 E-value=1.6e+02 Score=33.92 Aligned_cols=79 Identities=23% Similarity=0.355 Sum_probs=46.2
Q ss_pred CchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 184 GIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLAL 263 (561)
Q Consensus 184 eRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeE 263 (561)
+=|++|-.+|+-+|.-+. .=.+|+..++......+.-+..||+. |=+. + .+..+.+|+ ++-+.|..+|+.
T Consensus 516 EYreLharve~vs~rF~~-Lea~L~srls~gS~ey~~i~~qI~qE---Yeki-~-~dp~y~eeK----~RceYLhsKLaH 585 (604)
T KOG4796|consen 516 EYRELHARVETVSRRFRQ-LEAQLKSRLSPGSPEYKQIEKQILQE---YEKI-R-KDPNYMEEK----QRCEYLHSKLAH 585 (604)
T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHhccCCCCCcHHHHHHHHHHH---HHHh-h-cCccHHHHH----HHHHHHHHHHHH
Confidence 459999999988775433 22344445555455566666666643 4221 1 334444444 345667777777
Q ss_pred HHHHHhhhhc
Q psy11481 264 LKKELSARWE 273 (561)
Q Consensus 264 LKqELsQqw~ 273 (561)
+|+.+.. |+
T Consensus 586 IK~lI~e-fD 594 (604)
T KOG4796|consen 586 IKTLIGE-FD 594 (604)
T ss_pred HHHHHHH-HH
Confidence 7776665 55
No 273
>PRK12772 bifunctional flagellar biosynthesis protein FliR/FlhB; Provisional
Probab=36.19 E-value=1.3e+02 Score=34.05 Aligned_cols=88 Identities=13% Similarity=0.186 Sum_probs=57.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHhhhhcccccccCCCCCccccccccCccCCC
Q psy11481 225 ILHSAIRYIQFLRRREREFEHEMERLAREKI------HAQQRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDNHHHES 298 (561)
Q Consensus 225 ILrKAiDYIQkLQreEreLE~EkERLRREne------aLQQRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQa~~DS 298 (561)
++--++||+-+-.+-++++.-.+++.++|.. +.|.|..++++|+.++-| +..|
T Consensus 458 ~via~~D~~~q~~~~~k~lkMskqEvK~E~Ke~EGdP~iK~r~R~~~re~~~~~m-------------~~~V-------- 516 (609)
T PRK12772 458 IIIAVADYVYQKYQYNKDLRMTKQEVKEEYKQDEGDPQIKAKIKQKQREMAMQRM-------------MQEV-------- 516 (609)
T ss_pred HHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHhccCCHHHHHHHHHHHHHHHHhhh-------------hccC--------
Confidence 3445688877777777777767777777766 467888999999887433 0111
Q ss_pred cccccCCCCCCcCCCCceeEeecCc---ceeeecCCCCCCCccCCCCCcccCCCCCccc
Q psy11481 299 SLLSYGKERSYMDEDGGLVIVTNGS---VGLERSDGGESGSSLGGGGEMISGRGEGEYG 354 (561)
Q Consensus 299 silS~~~~~~~~~~~g~l~~vtn~~---~~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (561)
+ .-.|+|||-| +++.-| ....+.|++--+|.++.-
T Consensus 517 ------------~--~AdvVitNPTH~AVAL~Y~-------~~~~~aP~VvAKG~d~~A 554 (609)
T PRK12772 517 ------------P--KATVVVTNPTHIAVALKYE-------EGKDEAPKVVAKGADYVA 554 (609)
T ss_pred ------------C--CCcEEEECCCceEEEeEeC-------CCCCCCCEEEEeeCcHHH
Confidence 1 1246777765 777777 334566777777766543
No 274
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=36.09 E-value=78 Score=26.70 Aligned_cols=24 Identities=21% Similarity=0.273 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHA 257 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaL 257 (561)
..|+++..+++..+.+|+.|+..|
T Consensus 17 evLK~~I~eL~~~n~~Le~EN~~L 40 (59)
T PF01166_consen 17 EVLKEQIAELEERNSQLEEENNLL 40 (59)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444433
No 275
>cd04769 HTH_MerR2 Helix-Turn-Helix DNA binding domain of MerR2-like transcription regulators. Helix-turn-helix (HTH) transcription regulator MerR2 and related proteins. MerR2 in Bacillus cereus RC607 regulates resistance to organomercurials. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=36.09 E-value=80 Score=27.67 Aligned_cols=34 Identities=21% Similarity=0.182 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
...|+++.+++++++++|++....|...++.++.
T Consensus 81 ~~~l~~~~~~l~~~i~~l~~~~~~l~~~~~~~~~ 114 (116)
T cd04769 81 QQALEDKKQEIRAQITELQQLLARLDAFEASLKD 114 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4557777777777777777766666666665553
No 276
>KOG0804|consensus
Probab=36.07 E-value=2.4e+02 Score=31.84 Aligned_cols=28 Identities=14% Similarity=0.339 Sum_probs=17.0
Q ss_pred hhhhHHHHhHHHHHHHHHHHHhccCCCC
Q psy11481 187 EVHNKLEKNRRAHLKECFEILKRQVPPA 214 (561)
Q Consensus 187 aTHNaLERKRRd~LKdcFeeLRdlVPsL 214 (561)
..|+.+|+.-=.+.|..|+.+...+-.+
T Consensus 339 ~~~~e~~qsqlen~k~~~e~~~~e~~~l 366 (493)
T KOG0804|consen 339 QIMSEYEQSQLENQKQYYELLITEADSL 366 (493)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHhh
Confidence 5568888854555555666655555443
No 277
>PRK14011 prefoldin subunit alpha; Provisional
Probab=36.05 E-value=1.6e+02 Score=27.84 Aligned_cols=14 Identities=14% Similarity=0.235 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHhcc
Q psy11481 197 RAHLKECFEILKRQ 210 (561)
Q Consensus 197 Rd~LKdcFeeLRdl 210 (561)
+..+..|.+.|..+
T Consensus 33 ~~e~~~~ie~L~~l 46 (144)
T PRK14011 33 KMELLKSIESMEGL 46 (144)
T ss_pred HHHHHHHHHHHHcc
Confidence 44556777777644
No 278
>KOG0996|consensus
Probab=35.99 E-value=86 Score=38.65 Aligned_cols=74 Identities=23% Similarity=0.237 Sum_probs=45.4
Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH
Q psy11481 196 RRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKI-------HAQQRLALLKKEL 268 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREne-------aLQQRLeELKqEL 268 (561)
|-+.....+++|+..+-... ..+.-+...|.+-..-|..++.+.++.+.++.+++++.. .+++|+++++..+
T Consensus 515 ~~~~~~~~~e~lk~~L~~~~-~~~~e~~~~l~~~k~~l~~~k~e~~~~~k~l~~~~~e~~~~~~~~~~~rqrveE~ks~~ 593 (1293)
T KOG0996|consen 515 RHETGLKKVEELKGKLLASS-ESLKEKKTELDDLKEELPSLKQELKEKEKELPKLRKEERNLKSQLNKLRQRVEEAKSSL 593 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445555666666555522 233334555556666677777787777777777777766 6666666666655
Q ss_pred hh
Q psy11481 269 SA 270 (561)
Q Consensus 269 sQ 270 (561)
+.
T Consensus 594 ~~ 595 (1293)
T KOG0996|consen 594 SS 595 (1293)
T ss_pred Hh
Confidence 43
No 279
>COG4079 Uncharacterized protein conserved in archaea [Function unknown]
Probab=35.89 E-value=5.1e+02 Score=27.60 Aligned_cols=57 Identities=25% Similarity=0.345 Sum_probs=35.0
Q ss_pred hHHHHHHHHHHHHhccCCCCCC-------cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 195 NRRAHLKECFEILKRQVPPAQE-------EKKSSNLSILHSAIRYIQFLRRREREFEHEMERLARE 253 (561)
Q Consensus 195 KRRd~LKdcFeeLRdlVPsLsD-------n~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRRE 253 (561)
.-++.|++||+.++..-|.... +.+..++.=|.+ +-|..|++-.++|.+||-.-.+|
T Consensus 150 d~~dal~elfe~vss~tpsVskeydiy~vs~~~d~~~rl~k--kDie~L~k~R~~L~~emld~~~e 213 (293)
T COG4079 150 DAVDALMELFETVSSKTPSVSKEYDIYQVSSNVDPVLRLVK--KDIETLRKHRRRLAEEMLDHIRE 213 (293)
T ss_pred HHHHHHHHHHHHhhcCCCcccceeEEEEecCCcCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH
Confidence 3478999999999998888653 233333333333 45666666666666665444443
No 280
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=35.85 E-value=92 Score=27.42 Aligned_cols=35 Identities=17% Similarity=0.244 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+.|....+.++.+++.|+.+...|+.+++++..-+
T Consensus 2 ~~l~~~~~~l~~~i~~l~~~~~~l~~~~~e~~~~~ 36 (129)
T cd00584 2 EQLAAQLQVLQQEIEELQQELARLNEAIAEYEQAK 36 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555566666666666666666666555443
No 281
>PF05103 DivIVA: DivIVA protein; InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=35.69 E-value=19 Score=31.18 Aligned_cols=43 Identities=21% Similarity=0.312 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|..-.+.+..|.+++..+..++++|+.+...++.+.+.|++.|
T Consensus 27 l~~l~~~~~~l~~e~~~L~~~~~~l~~~l~~~~~~~~~l~~~l 69 (131)
T PF05103_consen 27 LDELAEELERLQRENAELKEEIEELQAQLEELREEEESLQRAL 69 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHCCCCT---------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHhh
Confidence 4444445555555555566655555555555555555555544
No 282
>PF07412 Geminin: Geminin; InterPro: IPR022786 This family contains the eukaryotic protein geminin (approximately 200 residues long). Geminin inhibits DNA replication by preventing the incorporation of MCM complex into prereplication complex, and is degraded during the mitotic phase of the cell cycle. It has been proposed that geminin inhibits DNA replication during S, G2, and M phases and that geminin destruction at the metaphase-anaphase transition permits replication in the succeeding cell cycle []. ; GO: 0008156 negative regulation of DNA replication; PDB: 1T6F_B 2LP0_B 1UII_B 2WVR_B 2ZXX_B.
Probab=35.63 E-value=75 Score=31.93 Aligned_cols=38 Identities=13% Similarity=0.182 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 228 SAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
.|++-..+|+++...++.|+.+|+.|+..|+.-.+.++
T Consensus 122 eaL~ENe~Lh~~ie~~~eEi~~lk~en~~L~elae~~~ 159 (200)
T PF07412_consen 122 EALEENEKLHKEIEQKDEEIAKLKEENEELKELAEHVQ 159 (200)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444455555555555556666666655554433333
No 283
>COG1422 Predicted membrane protein [Function unknown]
Probab=35.60 E-value=1.6e+02 Score=29.89 Aligned_cols=22 Identities=23% Similarity=0.383 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 238 RREREFEHEMERLAREKIHAQQ 259 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQ 259 (561)
++-++++++++++++|.+++++
T Consensus 72 ekm~~~qk~m~efq~e~~eA~~ 93 (201)
T COG1422 72 EKMKELQKMMKEFQKEFREAQE 93 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555566666666666554
No 284
>PRK03992 proteasome-activating nucleotidase; Provisional
Probab=35.51 E-value=1e+02 Score=32.36 Aligned_cols=38 Identities=32% Similarity=0.334 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
+..-+++|+.+.+++..+++.++++.+.++++++.|+.
T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 50 (389)
T PRK03992 13 LEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50 (389)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 33457778888888888888888888888888877775
No 285
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=35.31 E-value=1.8e+02 Score=25.57 Aligned_cols=45 Identities=13% Similarity=0.166 Sum_probs=28.1
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 220 SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 220 aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
.++..++..--+=|..++.+.+.++..++.|+++...++++|.++
T Consensus 63 ~~~~e~~~~l~~r~e~ie~~i~~lek~~~~l~~~l~e~q~~l~~~ 107 (110)
T TIGR02338 63 TDKEEAIQELKEKKETLELRVKTLQRQEERLREQLKELQEKIQEA 107 (110)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444455556677777777777777777777777655
No 286
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=35.23 E-value=97 Score=34.68 Aligned_cols=34 Identities=32% Similarity=0.555 Sum_probs=19.4
Q ss_pred CccCCCCCCCchhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcc
Q psy11481 395 LPYRPTQSTCPSVIQSSVINTLSASNGVPASAGYNGKPATSPAM 438 (561)
Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (561)
.+|-|+-|--+.|+ -+|+-|.+|-+|+--+.|.+
T Consensus 211 ~~YiPsgSf~~avl----------l~G~DA~tg~~aq~nP~Pv~ 244 (475)
T PRK13729 211 LPYIPSGSFAKAML----------IEGADANASVTGNESTVPMQ 244 (475)
T ss_pred CceeCCCCeEEEEE----------ecccccCccccccCCCccEE
Confidence 35666655444332 25677777777766555544
No 287
>PF01736 Polyoma_agno: Polyomavirus agnoprotein; InterPro: IPR002643 This family consists of the DNA-binding protein or agnoprotein from various polyomaviruses. This protein is highly basic and can bind single stranded and double stranded DNA []. Mutations in the agnoprotein produce smaller viral plaques, hence its function is not essential for growth in tissue culture cells but something has slowed in the normal replication cycle []. There is also evidence suggesting that the agnogene and agnoprotein act as regulators of structural protein synthesis [].; GO: 0003677 DNA binding
Probab=34.85 E-value=11 Score=31.51 Aligned_cols=21 Identities=24% Similarity=0.366 Sum_probs=19.3
Q ss_pred ccccccccceeEEEeecCCcc
Q psy11481 76 LHRPLERKEELLLTTTSNGAH 96 (561)
Q Consensus 76 ~~r~l~~k~~l~~~~~~~g~~ 96 (561)
.+|||+|+.+.-..-||||+.
T Consensus 2 VLRqLSRqAsVkv~ktWtgtK 22 (62)
T PF01736_consen 2 VLRQLSRQASVKVGKTWTGTK 22 (62)
T ss_pred hHHHHhHhhhheeccccccch
Confidence 589999999999999999985
No 288
>PF11180 DUF2968: Protein of unknown function (DUF2968); InterPro: IPR021350 This family of proteins has no known function.
Probab=34.74 E-value=1.5e+02 Score=29.77 Aligned_cols=119 Identities=18% Similarity=0.200 Sum_probs=56.4
Q ss_pred chhcccCccCCCcceeeecCCCCCC-------cccc---ccCCCCCCCchhhhhHHHHh--------HHHHHHHHHHHHh
Q psy11481 147 RLVAHNNYTSNSHAVALSTSPNNLT-------QNDM---IKKRSGISGIREVHNKLEKN--------RRAHLKECFEILK 208 (561)
Q Consensus 147 avva~~~~~~NS~~VaLsnSp~~L~-------q~se---~kKRPg~SeRRaTHNaLERK--------RRd~LKdcFeeLR 208 (561)
.+.-|+-+-|-+-|.+|-+.+++++ ++.. -|..-...--+..++..++- ||..|-.--..+.
T Consensus 39 ~ltElRTtyNGsYGASLlF~~~eltYYVALfq~k~fWRViKt~d~~~AE~~Y~~F~~Qt~~LA~~eirR~~LeAQka~~e 118 (192)
T PF11180_consen 39 QLTELRTTYNGSYGASLLFYPKELTYYVALFQQKAFWRVIKTQDEARAEAIYRDFAQQTARLADVEIRRAQLEAQKAQLE 118 (192)
T ss_pred chhhhhhhccCCccceeeecCCcceeeeeeeecCceeEeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445566666678888888888652 2222 11111111123333333332 5555555555555
Q ss_pred ccCCCCCCcCCcchHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 209 RQVPPAQEEKKSSNLSILH-----------SAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 209 dlVPsLsDn~KaSKAsILr-----------KAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++|-.- ..++.-|+ +..++=+..+++-+.|+.|....+.+.+.|++++..|+.+...
T Consensus 119 R~ia~~-----~~ra~~LqaDl~~~~~Q~~~va~~Q~q~r~ea~aL~~e~~aaqaQL~~lQ~qv~~Lq~q~~~ 186 (192)
T PF11180_consen 119 RLIAES-----EARANRLQADLQIARQQQQQVAARQQQARQEAQALEAERRAAQAQLRQLQRQVRQLQRQANE 186 (192)
T ss_pred HHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 444331 11222222 1233444444555555555555566666666666666655543
No 289
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=34.74 E-value=94 Score=28.06 Aligned_cols=42 Identities=19% Similarity=0.176 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
-...|-+.+-.++.+++.++.+...+...+..|+.++..-|.
T Consensus 42 ~n~~lAe~nL~~~~~l~~~r~~l~~~~~~~~~L~~~~~~k~~ 83 (150)
T PF07200_consen 42 ENEELAEQNLSLEPELEELRSQLQELYEELKELESEYQEKEQ 83 (150)
T ss_dssp HHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555566666666555556666655544444
No 290
>KOG3647|consensus
Probab=34.68 E-value=95 Score=33.11 Aligned_cols=32 Identities=25% Similarity=0.278 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++..+...+++-++|.+.+++||+.|+.--++
T Consensus 134 dea~L~~Kierrk~ElEr~rkRle~LqsiRP~ 165 (338)
T KOG3647|consen 134 DEAALGSKIERRKAELERTRKRLEALQSIRPA 165 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcchH
Confidence 34556678888888889999999888765444
No 291
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=34.68 E-value=79 Score=34.91 Aligned_cols=44 Identities=14% Similarity=0.373 Sum_probs=32.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLARE----------KIHAQQRLALLKKELS 269 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRRE----------neaLQQRLeELKqELs 269 (561)
+.+-.+++..++++++.+..|+++|++. .+.++++|++++.++.
T Consensus 312 ~~~l~~~l~~~~e~~~~l~~Ei~~l~~sY~l~~~e~~~~~~lekeL~~Le~~~~ 365 (569)
T PRK04778 312 SDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYD 365 (569)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHccccCchhHHHHHHHHHHHHHHHHHHH
Confidence 3445667788888888888888888777 7777777777777665
No 292
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=34.67 E-value=1.7e+02 Score=28.29 Aligned_cols=44 Identities=16% Similarity=0.200 Sum_probs=32.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+++|.+-.+.++-+..-++++-.+|+++.+.|..+|..|+.-+.
T Consensus 79 ik~AYe~A~~lQ~~L~~~re~E~qLr~rRD~LErrl~~l~~tie 122 (159)
T PF05384_consen 79 IKEAYEEAHELQVRLAMLREREKQLRERRDELERRLRNLEETIE 122 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55677777777777777777777888888888888887776654
No 293
>TIGR01834 PHA_synth_III_E poly(R)-hydroxyalkanoic acid synthase, class III, PhaE subunit. This model represents the PhaE subunit of the heterodimeric class (class III) of polymerase for poly(R)-hydroxyalkanoic acids (PHAs), carbon and energy storage polymers of many bacteria. The most common PHA is polyhydroxybutyrate but about 150 different constituent hydroxyalkanoic acids (HAs) have been identified in various species. This model must be designated subfamily to indicate the heterogeneity of PHAs.
Probab=34.61 E-value=55 Score=34.80 Aligned_cols=47 Identities=13% Similarity=0.170 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 224 SILHSAIRYIQFLRRREREFE--------HEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE--------~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.++....+|=+.+++.-.++. .|++.+.+++.+|++|+.+|+++|.+
T Consensus 260 ~lvna~m~lr~~~qe~~e~~L~~LnlPTRsElDe~~krL~ELrR~vr~L~k~l~~ 314 (320)
T TIGR01834 260 KFINALMRLRIQQQEIVEALLKMLNLPTRSELDEAHQRIQQLRREVKSLKKRLGD 314 (320)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444455555554444433 57777777777777777777777654
No 294
>PF07445 priB_priC: Primosomal replication protein priB and priC; InterPro: IPR010890 This family contains the bacterial primosomal replication proteins priB and priC (approximately 180 residues long). In Escherichia coli, these function in the assembly of the primosome [].
Probab=34.46 E-value=2.8e+02 Score=26.61 Aligned_cols=73 Identities=25% Similarity=0.284 Sum_probs=45.7
Q ss_pred HHHHHHHHHHhccCCCCCCcCCcc------hH----HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHH
Q psy11481 198 AHLKECFEILKRQVPPAQEEKKSS------NL----SILHSAIRYIQFLRRREREFEH--------EMERLAREKIHAQQ 259 (561)
Q Consensus 198 d~LKdcFeeLRdlVPsLsDn~KaS------KA----sILrKAiDYIQkLQreEreLE~--------EkERLRREneaLQQ 259 (561)
+.|-.-|+.|...+-...-..+.+ .. .=|.+=-||-+.|.....+.+. +..++++++.++..
T Consensus 73 EkL~~Q~~AL~r~l~t~~lr~~~~~~~~~~~~~~Lyq~L~~hqe~erRL~~mi~~~e~~l~~~~~~~~~~lq~ei~a~e~ 152 (173)
T PF07445_consen 73 EKLVAQIEALQRELATQSLRKKESKPSSRKPIHQLYQRLAQHQEYERRLLAMIQEREQQLEQAQSFEQQQLQQEILALEQ 152 (173)
T ss_pred HHHHHHHHHHHHHHHhccCccCCccccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHH
Confidence 356667888888877643211111 11 2244456677777766555443 55678888888888
Q ss_pred HHHHHHHHHhh
Q psy11481 260 RLALLKKELSA 270 (561)
Q Consensus 260 RLeELKqELsQ 270 (561)
||..|++-+..
T Consensus 153 RL~RCr~Ai~~ 163 (173)
T PF07445_consen 153 RLQRCRQAIEK 163 (173)
T ss_pred HHHHHHHHHHH
Confidence 88888877654
No 295
>PRK11239 hypothetical protein; Provisional
Probab=34.39 E-value=65 Score=32.77 Aligned_cols=28 Identities=29% Similarity=0.334 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.+++.++.+|+.|+.+|++++++|..++
T Consensus 186 ~~Le~rv~~Le~eva~L~~~l~~l~~~~ 213 (215)
T PRK11239 186 GDLQARVEALEIEVAELKQRLDSLLAHL 213 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3467777888888888888888887765
No 296
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=34.39 E-value=1.9e+02 Score=26.48 Aligned_cols=39 Identities=13% Similarity=0.196 Sum_probs=29.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-|..|.++-.++..++..|-.|+.+|+...+.|+..|.+
T Consensus 16 ~l~~l~~el~~LK~~~~el~EEN~~L~iEN~~Lr~~l~~ 54 (110)
T PRK13169 16 NLGVLLKELGALKKQLAELLEENTALRLENDKLRERLEE 54 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466666667777777777778888888888888887775
No 297
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=34.24 E-value=69 Score=25.86 Aligned_cols=23 Identities=17% Similarity=0.179 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 248 ERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 248 ERLRREneaLQQRLeELKqELsQ 270 (561)
+.|+++.++|+.++..||..+++
T Consensus 2 ~aLrqQv~aL~~qv~~Lq~~fs~ 24 (46)
T PF09006_consen 2 NALRQQVEALQGQVQRLQAAFSQ 24 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555556666666666655554
No 298
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=34.23 E-value=91 Score=36.14 Aligned_cols=8 Identities=38% Similarity=0.559 Sum_probs=3.0
Q ss_pred CcceeEEe
Q psy11481 448 GMSTATII 455 (561)
Q Consensus 448 ~~~~~~~~ 455 (561)
|+....||
T Consensus 721 g~~~v~II 728 (771)
T TIGR01069 721 GYEVVLII 728 (771)
T ss_pred CCCEEEEE
Confidence 33333333
No 299
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=34.19 E-value=1e+02 Score=26.22 Aligned_cols=29 Identities=28% Similarity=0.387 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+.++++++++.|+.+..+++.+++.++.+
T Consensus 71 ~~~~l~~~l~~l~~~~~~~~~~~~~~~~~ 99 (104)
T PF13600_consen 71 ELKELEEELEALEDELAALQDEIQALEAQ 99 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444444444443
No 300
>KOG3335|consensus
Probab=34.07 E-value=80 Score=31.47 Aligned_cols=31 Identities=23% Similarity=0.194 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 240 EREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 240 EreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.+.+.|+++|+.+++.|..+++++++++.+
T Consensus 101 ee~~~~e~~elr~~~~~l~~~i~~~~~~~~~ 131 (181)
T KOG3335|consen 101 EEKRKQEIMELRLKVEKLENAIAELTKFFSQ 131 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344566777777777777777777777655
No 301
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=34.07 E-value=1.7e+02 Score=26.56 Aligned_cols=36 Identities=25% Similarity=0.305 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
-..|...+...+.++..|++++..+++|+++|..+.
T Consensus 86 ~~~l~~~e~sw~~qk~~le~e~~~~~~r~~dL~~QN 121 (132)
T PF07926_consen 86 KAELEESEASWEEQKEQLEKELSELEQRIEDLNEQN 121 (132)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555566677777777777778877777663
No 302
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=34.00 E-value=1.6e+02 Score=28.78 Aligned_cols=18 Identities=11% Similarity=0.108 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHhccCCCC
Q psy11481 197 RAHLKECFEILKRQVPPA 214 (561)
Q Consensus 197 Rd~LKdcFeeLRdlVPsL 214 (561)
+..+.+|...-+..|-..
T Consensus 65 ~~~~~~~~~~A~~Al~~G 82 (219)
T TIGR02977 65 EAQVADWQEKAELALSKG 82 (219)
T ss_pred HHHHHHHHHHHHHHHHCC
Confidence 344455555555555553
No 303
>PF11855 DUF3375: Protein of unknown function (DUF3375); InterPro: IPR021804 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Probab=33.99 E-value=78 Score=34.52 Aligned_cols=46 Identities=24% Similarity=0.397 Sum_probs=29.8
Q ss_pred HhHHHHHHHHHHH----HhccCCCCCCcCCcchHHHHHHHHHHHHHHHHH
Q psy11481 194 KNRRAHLKECFEI----LKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRR 239 (561)
Q Consensus 194 RKRRd~LKdcFee----LRdlVPsLsDn~KaSKAsILrKAiDYIQkLQre 239 (561)
+.=|+.|++|... |++..|...+...--...=..+|++|+..|++.
T Consensus 66 ~~a~~yl~~W~~~~~GwL~r~~~~~~~e~~y~lT~~a~~Al~~l~~L~~~ 115 (478)
T PF11855_consen 66 KSARAYLRDWVRADKGWLRRRYDEGSDEEHYELTPAAEKALRFLERLEER 115 (478)
T ss_pred ccHHHHHHHHHHHhhhHHHhccCCCCCCeeEEeCHHHHHHHHHHHHcCCC
Confidence 3457888888887 777777644434434444556788888777654
No 304
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=33.94 E-value=1.1e+02 Score=31.46 Aligned_cols=67 Identities=27% Similarity=0.337 Sum_probs=29.1
Q ss_pred HHHHHHHHHHhccCCCCCCcCC--cchHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 198 AHLKECFEILKRQVPPAQEEKK--SSNLSILHSAIR--------YIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 198 d~LKdcFeeLRdlVPsLsDn~K--aSKAsILrKAiD--------YIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
..|....+.|...+|.+..... ..++.-|+.+.+ .+..|+.+-.++..+++.++++...|+.+++++
T Consensus 166 ~~L~~~~~~l~~~~~~l~~~~~~L~~e~~~Lk~~~~e~~~~D~~eL~~lr~eL~~~~~~i~~~k~~l~el~~el~~l 242 (325)
T PF08317_consen 166 AKLDKQLEQLDELLPKLRERKAELEEELENLKQLVEEIESCDQEELEALRQELAEQKEEIEAKKKELAELQEELEEL 242 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555666666666432100 122333333222 345555555555555553333333333333333
No 305
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=33.67 E-value=1.8e+02 Score=27.72 Aligned_cols=20 Identities=20% Similarity=0.350 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy11481 242 EFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRL 261 (561)
.++...++|+.|+..|-+|.
T Consensus 162 ~~e~k~~~l~~En~~Lv~Rw 181 (194)
T PF08614_consen 162 MLEEKLRKLEEENRELVERW 181 (194)
T ss_dssp HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33333334444444443333
No 306
>PF05600 DUF773: Protein of unknown function (DUF773); InterPro: IPR008491 This family contains several eukaryotic sequences which are thought to be CDK5 activator-binding proteins, however, the function of this family is unknown.
Probab=33.63 E-value=83 Score=34.91 Aligned_cols=48 Identities=19% Similarity=0.261 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+..|+..-.|+..|-...++.....+++++....+.+|..+++.++.+
T Consensus 424 L~~Ik~SprYvdrl~~~L~qk~~~~~k~~~~~~~l~~kr~e~~~e~~~ 471 (507)
T PF05600_consen 424 LFMIKSSPRYVDRLVESLQQKLKQEEKLRRKREDLEEKRQEAQEEQQE 471 (507)
T ss_pred HHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444445555554444444444444444444444444444444433
No 307
>PF05812 Herpes_BLRF2: Herpesvirus BLRF2 protein; InterPro: IPR008642 This family consists of several herpes virus BLRF2 tegument proteins.; PDB: 2OA5_B 2H3R_D.
Probab=33.56 E-value=59 Score=30.41 Aligned_cols=22 Identities=23% Similarity=0.418 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLe 262 (561)
.+|.+|+.+|+-||.+|+++|.
T Consensus 6 EeLaaeL~kLqmENk~LKkkl~ 27 (118)
T PF05812_consen 6 EELAAELQKLQMENKALKKKLR 27 (118)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666655543
No 308
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=33.55 E-value=1.3e+02 Score=29.23 Aligned_cols=47 Identities=26% Similarity=0.220 Sum_probs=21.1
Q ss_pred cchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 220 SSNLSILHSAIRY---IQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 220 aSKAsILrKAiDY---IQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
.+....++|+++. ...|+.+..+++.+++.|+.+...|+.+++.+++
T Consensus 106 s~~~f~~rk~l~~e~~~~~l~~~i~~L~~e~~~L~~~~~~l~~~~e~~ek 155 (189)
T PF10211_consen 106 SSIAFGMRKALQAEQGKQELEEEIEELEEEKEELEKQVQELKNKCEQLEK 155 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555543 2234444444444444444444444444444433
No 309
>PF08286 Spc24: Spc24 subunit of Ndc80; InterPro: IPR013252 Spc24 is a component of the evolutionarily conserved kinetochore-associated Ndc80 complex and is involved in chromosome segregation [].; PDB: 2VE7_D 2FV4_B 2FTX_B.
Probab=33.54 E-value=15 Score=32.66 Aligned_cols=38 Identities=32% Similarity=0.321 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|.+|.++..++.+++..|+.+...|+.++++|+.++.+
T Consensus 1 ~~~Ld~~k~~laK~~~~LE~~l~~l~~el~~L~~~l~e 38 (118)
T PF08286_consen 1 IQELDNEKFRLAKELSDLESELESLQSELEELKEELEE 38 (118)
T ss_dssp --------------------------------------
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56788888888888999999999999999999988755
No 310
>PF08961 DUF1875: Domain of unknown function (DUF1875); InterPro: IPR015056 MIT can be found in the Nuclear receptor-binding factor 2, it has no known function. ; PDB: 2CRB_A.
Probab=33.54 E-value=14 Score=37.86 Aligned_cols=43 Identities=28% Similarity=0.169 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
-.+|..-.--|..|++-..-+..|.++|++|++.|+.+-+.|.
T Consensus 121 KT~IEEQ~T~I~dLrrlVe~L~aeNErLr~EnkqL~ae~arL~ 163 (243)
T PF08961_consen 121 KTRIEEQATKIADLRRLVEFLLAENERLRRENKQLKAENARLL 163 (243)
T ss_dssp -------------------------------------------
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3455555556888998888888999999999999988888884
No 311
>PF14282 FlxA: FlxA-like protein
Probab=33.52 E-value=1.5e+02 Score=26.35 Aligned_cols=14 Identities=14% Similarity=0.230 Sum_probs=5.5
Q ss_pred HHHHHHHHHHHHHH
Q psy11481 255 IHAQQRLALLKKEL 268 (561)
Q Consensus 255 eaLQQRLeELKqEL 268 (561)
..|+.+|..|+.+|
T Consensus 54 q~Lq~QI~~LqaQI 67 (106)
T PF14282_consen 54 QLLQAQIQQLQAQI 67 (106)
T ss_pred HHHHHHHHHHHHHH
Confidence 33334444444333
No 312
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=33.41 E-value=72 Score=29.58 Aligned_cols=15 Identities=13% Similarity=0.211 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHHHHH
Q psy11481 253 EKIHAQQRLALLKKE 267 (561)
Q Consensus 253 EneaLQQRLeELKqE 267 (561)
++.+|++++++|+..
T Consensus 117 ~i~~l~~e~~~l~~k 131 (169)
T PF07106_consen 117 EIEELEEEIEELEEK 131 (169)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333333333333333
No 313
>PF10018 Med4: Vitamin-D-receptor interacting Mediator subunit 4; InterPro: IPR019258 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Members of this family represent the Med4 subunit of the Mediator (Med) complex [, ]. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=33.28 E-value=1.8e+02 Score=27.84 Aligned_cols=35 Identities=23% Similarity=0.293 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.+..+++.|.+.|..++..+..+|.+++++|..
T Consensus 27 ~~~~I~~L~~e~~~ld~~i~~~~~~L~~~~~~L~~ 61 (188)
T PF10018_consen 27 NQARIQQLRAEIEELDEQIRDILKQLKEARKELRT 61 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444455555555555543
No 314
>KOG0971|consensus
Probab=33.22 E-value=1.9e+02 Score=35.27 Aligned_cols=20 Identities=35% Similarity=0.456 Sum_probs=13.0
Q ss_pred HhHHHHHHHHHHHHhccCCC
Q psy11481 194 KNRRAHLKECFEILKRQVPP 213 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPs 213 (561)
+.|=+.|-.-++-||..+-.
T Consensus 338 kEr~deletdlEILKaEmee 357 (1243)
T KOG0971|consen 338 KERVDELETDLEILKAEMEE 357 (1243)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 34677777777777765543
No 315
>TIGR01462 greA transcription elongation factor GreA. In the Chlamydias and some spirochetes, the region described by this model is found as the C-terminal region of a much larger protein.
Probab=33.22 E-value=1.5e+02 Score=27.33 Aligned_cols=30 Identities=23% Similarity=0.171 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy11481 249 RLAREKIHAQQRLALLKKELSARWEHIDFNT 279 (561)
Q Consensus 249 RLRREneaLQQRLeELKqELsQqw~hIDfSt 279 (561)
.-++++..+.+|+.+|+++|.. +.-+|...
T Consensus 46 aak~~~~~~e~ri~~L~~~L~~-a~iv~~~~ 75 (151)
T TIGR01462 46 AAKEEQGFNEGRIAELEDLLAN-AQVIDDSK 75 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh-CcccCccc
Confidence 3355666788899999999887 55445443
No 316
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=33.12 E-value=1.3e+02 Score=32.13 Aligned_cols=26 Identities=12% Similarity=0.284 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy11481 246 EMERLAREKIHAQQRLALLKKELSAR 271 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKqELsQq 271 (561)
-+++.|+|+.+|+|-++-.+..|...
T Consensus 118 ALKEARkEIkQLkQvieTmrssL~ek 143 (305)
T PF15290_consen 118 ALKEARKEIKQLKQVIETMRSSLAEK 143 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhchh
Confidence 34678899999999999999998763
No 317
>PRK07737 fliD flagellar capping protein; Validated
Probab=32.99 E-value=1.6e+02 Score=32.45 Aligned_cols=79 Identities=9% Similarity=0.116 Sum_probs=41.3
Q ss_pred hHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 195 NRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLR-------------RREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 195 KRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQ-------------reEreLE~EkERLRREneaLQQRL 261 (561)
+=++.|++-++.+.++.-.-.......+-.|..+--+++..+- .++..+..++.++.+++..++.||
T Consensus 385 kl~~Al~~n~~~V~~lF~~~~~~~~~~~~Gia~~l~~~l~~~~~~~~~~~g~g~~~~~~~~l~~~i~~l~~~i~~~~~rl 464 (501)
T PRK07737 385 KLRQKIKENPDAVYQLFNSGGSSSNYNEKGIARRLRDTLKETIKSIEQKAGNTTMTNQQFAIGKDLNQIETQIDRFQDRL 464 (501)
T ss_pred HHHHHHHHCHHHHHHHhcCCCCCcccccCcHHHHHHHHHHHHHhhhhhhcCCccccchhHHHHHHHHHHHHHHHHHHHHH
Confidence 4455566666666666533111011122334444444333322 233456667777777777777777
Q ss_pred HHHHHHHhhhhc
Q psy11481 262 ALLKKELSARWE 273 (561)
Q Consensus 262 eELKqELsQqw~ 273 (561)
+.++..|-+||-
T Consensus 465 ~~~e~ry~~qf~ 476 (501)
T PRK07737 465 KQIEDRYYKKFS 476 (501)
T ss_pred HHHHHHHHHHHH
Confidence 777777655544
No 318
>cd07429 Cby_like Chibby, a nuclear inhibitor of Wnt/beta-catenin mediated transcription, and similar proteins. Chibby(Cby) is a well-conserved nuclear protein that functions as part of the Wnt/beta-catenin signaling pathway. Specifically, Cby binds directly to beta-catenin by interacting with its central region, which harbors armadillo repeats. Cby-beta-catenin interactions may also involve 14-3-3 proteins. By competing with other binding partners of beta-catenin, the Tcf/Lef transcription factors, Cby inhibits transcriptional activation. Cby has been shown to play a role in adipocyte differentiation. The C-terminal region of Cby appears to contain an alpha-helical coiled-coil motif.
Probab=32.95 E-value=90 Score=28.70 Aligned_cols=34 Identities=21% Similarity=0.138 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLAL 263 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeE 263 (561)
.+....|+++++.+++|-.-|+-+++.|-..|++
T Consensus 71 ~~e~~rlkkk~~~LeEENNlLklKievLLDMLte 104 (108)
T cd07429 71 GREVLRLKKKNQQLEEENNLLKLKIEVLLDMLAE 104 (108)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677777776666666666666555555554
No 319
>smart00340 HALZ homeobox associated leucin zipper.
Probab=32.94 E-value=94 Score=24.94 Aligned_cols=17 Identities=24% Similarity=0.321 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy11481 249 RLAREKIHAQQRLALLK 265 (561)
Q Consensus 249 RLRREneaLQQRLeELK 265 (561)
.|..|++.|++++++|+
T Consensus 16 ~LteeNrRL~ke~~eLr 32 (44)
T smart00340 16 SLTEENRRLQKEVQELR 32 (44)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44445555555555555
No 320
>cd01109 HTH_YyaN Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB. Putative helix-turn-helix (HTH) MerR-like transcription regulators of Bacillus subtilis, YyaN and YraB, and related proteins; N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=32.74 E-value=2.3e+02 Score=24.62 Aligned_cols=24 Identities=25% Similarity=0.314 Sum_probs=11.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 245 HEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqEL 268 (561)
.++++|++++..|++.++.|...+
T Consensus 86 ~~~~~l~~~i~~l~~~~~~l~~~~ 109 (113)
T cd01109 86 EHREELEEQIAELQETLAYLDYKI 109 (113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444555555555555444
No 321
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=32.74 E-value=1.7e+02 Score=27.88 Aligned_cols=76 Identities=13% Similarity=0.170 Sum_probs=34.4
Q ss_pred hHHHHhH---HHHHHHHHHHHhccCCCCCCcCCcchHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNR---RAHLKECFEILKRQVPPAQEEKKSSNLSILHSA--IRYIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 190 NaLERKR---Rd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKA--iDYIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
..+|+++ +..+..+......++=... ....+..+.++. -+.+..|++.-..+....++|+.....|+.+|.++
T Consensus 54 ~~le~~~~~~~~~~~~~~~~A~~Al~~g~--edLAr~al~~k~~~e~~~~~l~~~~~~~~~~~~~l~~~l~~l~~kl~e~ 131 (221)
T PF04012_consen 54 KRLERKLDEAEEEAEKWEKQAELALAAGR--EDLAREALQRKADLEEQAERLEQQLDQAEAQVEKLKEQLEELEAKLEEL 131 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444443 3344455555555555432 223333333221 22344444444444455555555555555555555
Q ss_pred HHH
Q psy11481 265 KKE 267 (561)
Q Consensus 265 KqE 267 (561)
+.+
T Consensus 132 k~k 134 (221)
T PF04012_consen 132 KSK 134 (221)
T ss_pred HHH
Confidence 544
No 322
>smart00434 TOP4c DNA Topoisomerase IV. Bacterial DNA topoisomerase IV, GyrA, ParC
Probab=32.73 E-value=2.3e+02 Score=30.99 Aligned_cols=41 Identities=24% Similarity=0.247 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 229 AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-.+||-.++=.. =-..|+++|++|..+|++++++|++-+..
T Consensus 400 q~~~IL~m~L~~-LT~~e~~kL~~e~~~l~~ei~~l~~~l~~ 440 (445)
T smart00434 400 QADAILDMRLRR-LTKLEVEKLEKELKELEKEIEDLEKILAS 440 (445)
T ss_pred HHHHHHHhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 456665554332 12356788889999999999999887754
No 323
>PF05597 Phasin: Poly(hydroxyalcanoate) granule associated protein (phasin); InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.
Probab=32.71 E-value=2.8e+02 Score=26.07 Aligned_cols=18 Identities=39% Similarity=0.431 Sum_probs=13.9
Q ss_pred HHHHhHHHHHHHHHHHHh
Q psy11481 191 KLEKNRRAHLKECFEILK 208 (561)
Q Consensus 191 aLERKRRd~LKdcFeeLR 208 (561)
.+|++.|..+.+..+.++
T Consensus 50 ~~e~~~~~~~~e~~~~~~ 67 (132)
T PF05597_consen 50 KLEKKTRKKAEEQVEEAR 67 (132)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 467778888888777777
No 324
>PRK14011 prefoldin subunit alpha; Provisional
Probab=32.70 E-value=1.3e+02 Score=28.47 Aligned_cols=33 Identities=18% Similarity=0.284 Sum_probs=14.6
Q ss_pred HHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQ----FLRRREREFEHEMERLAREKIHAQ 258 (561)
Q Consensus 226 LrKAiDYIQ----kLQreEreLE~EkERLRREneaLQ 258 (561)
+.+|++|++ .|++..+++...++++.++.++++
T Consensus 86 ~~eA~~~~~~ri~~l~~~~~~l~~~i~~~~~~~~~l~ 122 (144)
T PRK14011 86 VSEVIEDFKKSVEELDKTKKEGNKKIEELNKEITKLR 122 (144)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666533 344444444444444444444444
No 325
>smart00340 HALZ homeobox associated leucin zipper.
Probab=32.60 E-value=67 Score=25.73 Aligned_cols=16 Identities=25% Similarity=0.488 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHHHH
Q psy11481 235 FLRRREREFEHEMERL 250 (561)
Q Consensus 235 kLQreEreLE~EkERL 250 (561)
.|.+++++++.|+++|
T Consensus 16 ~LteeNrRL~ke~~eL 31 (44)
T smart00340 16 SLTEENRRLQKEVQEL 31 (44)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4445555444444444
No 326
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=32.59 E-value=1.7e+02 Score=25.46 Aligned_cols=30 Identities=23% Similarity=0.337 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+...++.+|+.+...|+.++..++..+..
T Consensus 77 ~~k~~ei~~l~~~l~~l~~~~~k~e~~l~~ 106 (126)
T PF13863_consen 77 EEKEAEIKKLKAELEELKSEISKLEEKLEE 106 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344456666666666666666666666654
No 327
>PRK09413 IS2 repressor TnpA; Reviewed
Probab=32.51 E-value=97 Score=27.42 Aligned_cols=28 Identities=18% Similarity=0.138 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.+++.|+++|++++..|+.+++-||+.+
T Consensus 74 ~~~~~ei~~L~~el~~L~~E~diLKKa~ 101 (121)
T PRK09413 74 AAAMKQIKELQRLLGKKTMENELLKEAV 101 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677777777777777777777665
No 328
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=32.49 E-value=2.5e+02 Score=25.98 Aligned_cols=29 Identities=14% Similarity=0.263 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLA 251 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLR 251 (561)
+.++.+=-.-|+.++.+...++.++.+|.
T Consensus 15 ~~~ve~L~s~lr~~E~E~~~l~~el~~l~ 43 (120)
T PF12325_consen 15 VQLVERLQSQLRRLEGELASLQEELARLE 43 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444433334433333333333333333
No 329
>PRK00295 hypothetical protein; Provisional
Probab=32.49 E-value=1.7e+02 Score=24.40 Aligned_cols=35 Identities=14% Similarity=0.054 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++....++-..+-+.++++..|++++..|..++..
T Consensus 17 qE~tie~Ln~~v~~Qq~~I~~L~~ql~~L~~rl~~ 51 (68)
T PRK00295 17 QDDTIQALNDVLVEQQRVIERLQLQMAALIKRQEE 51 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333444455555666666666666666655543
No 330
>PF05929 Phage_GPO: Phage capsid scaffolding protein (GPO) serine peptidase; InterPro: IPR009228 This entry is represented by Bacteriophage P2, GpO. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacteriophage capsid scaffolding protein (GpO) and some related bacterial sequences. GpO is thought to function in both the assembly of proheads and the cleavage of GpN [].; GO: 0019069 viral capsid assembly
Probab=32.46 E-value=1.9e+02 Score=30.31 Aligned_cols=79 Identities=19% Similarity=0.244 Sum_probs=45.6
Q ss_pred HHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHH
Q psy11481 191 KLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRR------------EREFEHEMERLAREKIHAQ 258 (561)
Q Consensus 191 aLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQre------------EreLE~EkERLRREneaLQ 258 (561)
..|-.+....+..|..++.+.-.- +........-|..|++-|..-..+ ++++....+.|..++.+|.
T Consensus 163 e~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~ave~ia~~~~~~~~~~~~~ls~~~~~~~~~~~~l~~~~~~~~ 241 (276)
T PF05929_consen 163 EFEEEQEEEGKSLFSKVKALFKKK-EASDDEQFADLQQAVEAIAEQQQELEEAFEEQLSEQETQVAELKQELKEQHEALT 241 (276)
T ss_pred hhhhhcccchhhhhHHhhhhhcCC-cCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHH
Confidence 334444455677777777776552 222233455666666655433332 2222233445555668888
Q ss_pred HHHHHHHHHHhh
Q psy11481 259 QRLALLKKELSA 270 (561)
Q Consensus 259 QRLeELKqELsQ 270 (561)
.++++|+..|+.
T Consensus 242 ~~f~~L~~~L~~ 253 (276)
T PF05929_consen 242 EDFAALKEKLSS 253 (276)
T ss_pred HHHHHHHHHhhC
Confidence 888999888876
No 331
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=32.46 E-value=1e+02 Score=35.79 Aligned_cols=7 Identities=57% Similarity=0.814 Sum_probs=3.9
Q ss_pred CCCCCcc
Q psy11481 41 PAAHPLS 47 (561)
Q Consensus 41 ~~~~~~~ 47 (561)
-+.|||-
T Consensus 305 ~~rHPll 311 (782)
T PRK00409 305 QARHPLL 311 (782)
T ss_pred CcCCcee
Confidence 3556663
No 332
>KOG4005|consensus
Probab=32.20 E-value=1.1e+02 Score=32.11 Aligned_cols=78 Identities=23% Similarity=0.227 Sum_probs=58.5
Q ss_pred CCCchhhhhHH-HHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 182 ISGIREVHNKL-EKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQR 260 (561)
Q Consensus 182 ~SeRRaTHNaL-ERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQR 260 (561)
+..||...|-+ -+.-||.=|.+.+++-..|-.|.+. -.||+- -.+.|++.++.+..+-.+|..+.+.|++.
T Consensus 69 K~~RrKLKNRVAAQtaRDrKKaRm~eme~~i~dL~ee-----n~~L~~---en~~Lr~~n~~L~~~n~el~~~le~~~~~ 140 (292)
T KOG4005|consen 69 KVQRRKLKNRVAAQTARDRKKARMEEMEYEIKDLTEE-----NEILQN---ENDSLRAINESLLAKNHELDSELELLRQE 140 (292)
T ss_pred HHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHH-----HHHHHH---HHHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence 34567777654 4556888899999988888776442 234443 34567888888888889999999999999
Q ss_pred HHHHHHH
Q psy11481 261 LALLKKE 267 (561)
Q Consensus 261 LeELKqE 267 (561)
|.+|+++
T Consensus 141 l~~~~~~ 147 (292)
T KOG4005|consen 141 LAELKQQ 147 (292)
T ss_pred HHhhHHH
Confidence 9999876
No 333
>PF12126 DUF3583: Protein of unknown function (DUF3583); InterPro: IPR021978 This domain is found in eukaryotes, and is typically between 302 and 338 amino acids in length. It is found in association with PF00097 from PFAM and PF00643 from PFAM. Most members are promyelocytic leukemia proteins, and this family lies towards the C terminus.
Probab=32.10 E-value=1.4e+02 Score=32.06 Aligned_cols=67 Identities=16% Similarity=0.225 Sum_probs=40.5
Q ss_pred HHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh
Q psy11481 203 CFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEME-RLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 203 cFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkE-RLRREneaLQQRLeELKqELs 269 (561)
.-.+++..|-.|....+-..--|=.+...+|...+.+|++|.+.++ +-+|+.+++-.+|..|..-|.
T Consensus 18 ahaqm~sav~qL~~~r~~teelIr~rVrq~V~hVqaqEreLLe~v~~rYqR~y~ema~~L~~LeavLq 85 (324)
T PF12126_consen 18 AHAQMRSAVSQLGRARADTEELIRARVRQVVAHVQAQERELLEAVEARYQRDYEEMAGQLGRLEAVLQ 85 (324)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 3334444444432222333345556677788888889888887764 556667777666666665554
No 334
>PRK09343 prefoldin subunit beta; Provisional
Probab=32.09 E-value=2.3e+02 Score=25.62 Aligned_cols=77 Identities=14% Similarity=0.112 Sum_probs=44.5
Q ss_pred HhHHHHHHHHHHHHhccCCCCC----CcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 194 KNRRAHLKECFEILKRQVPPAQ----EEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPsLs----Dn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
.++=..-.+-|+.|.+--+-.. .-.+.+|..+...--+=|.+++.+.+.++.+.+.|+++...++.+|.++-+.+.
T Consensus 37 ~~e~~~~~~EL~~L~~d~~VYk~VG~vlv~qd~~e~~~~l~~r~E~ie~~ik~lekq~~~l~~~l~e~q~~l~~ll~~~~ 116 (121)
T PRK09343 37 LREINKALEELEKLPDDTPIYKIVGNLLVKVDKTKVEKELKERKELLELRSRTLEKQEKKLREKLKELQAKINEMLSKYY 116 (121)
T ss_pred HHHHHHHHHHHHcCCCcchhHHHhhHHHhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3344444555555544332211 112344444443333334444477788888888888888888888888877654
Q ss_pred h
Q psy11481 270 A 270 (561)
Q Consensus 270 Q 270 (561)
.
T Consensus 117 ~ 117 (121)
T PRK09343 117 P 117 (121)
T ss_pred C
Confidence 3
No 335
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=32.09 E-value=1e+02 Score=32.26 Aligned_cols=26 Identities=19% Similarity=0.357 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+++.+.+++...+++|+.+++..|.+
T Consensus 214 EeL~~~Eke~~e~~~~i~e~~~rl~~ 239 (269)
T PF05278_consen 214 EELKQKEKEVKEIKERITEMKGRLGE 239 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444445555555555555555544
No 336
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=32.07 E-value=2.3e+02 Score=31.70 Aligned_cols=27 Identities=22% Similarity=0.378 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqE 267 (561)
.++++++++|.++.++|++++++|+++
T Consensus 96 ~~~~~~i~~l~~~~~~L~~~~~~l~~~ 122 (646)
T PRK05771 96 EKIEKEIKELEEEISELENEIKELEQE 122 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555555555544
No 337
>PF12999 PRKCSH-like: Glucosidase II beta subunit-like
Probab=32.03 E-value=2.3e+02 Score=28.05 Aligned_cols=16 Identities=38% Similarity=0.356 Sum_probs=7.5
Q ss_pred hHHHHhHHHHHHHHHH
Q psy11481 190 NKLEKNRRAHLKECFE 205 (561)
Q Consensus 190 NaLERKRRd~LKdcFe 205 (561)
.++-+++|..+....+
T Consensus 120 ~e~~~~~~~~~~~~~~ 135 (176)
T PF12999_consen 120 AELGKEYREELEEEEE 135 (176)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4444555555444433
No 338
>TIGR01834 PHA_synth_III_E poly(R)-hydroxyalkanoic acid synthase, class III, PhaE subunit. This model represents the PhaE subunit of the heterodimeric class (class III) of polymerase for poly(R)-hydroxyalkanoic acids (PHAs), carbon and energy storage polymers of many bacteria. The most common PHA is polyhydroxybutyrate but about 150 different constituent hydroxyalkanoic acids (HAs) have been identified in various species. This model must be designated subfamily to indicate the heterogeneity of PHAs.
Probab=32.00 E-value=1.9e+02 Score=30.99 Aligned_cols=19 Identities=32% Similarity=0.426 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q psy11481 251 AREKIHAQQRLALLKKELS 269 (561)
Q Consensus 251 RREneaLQQRLeELKqELs 269 (561)
|+|++++.+||.+|++++-
T Consensus 288 RsElDe~~krL~ELrR~vr 306 (320)
T TIGR01834 288 RSELDEAHQRIQQLRREVK 306 (320)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4555566666666665543
No 339
>PF05615 THOC7: Tho complex subunit 7; InterPro: IPR008501 The Tho complex (THOC) is involved in transcription elongation and mRNA export from the nucleus []. This entry represents the subunit THOC7, which is found in higher eukaryotes, and the non-homologous subunit Mft1p found in yeast. The funtions of these subunits are unknown, and it is not known if these subunits are functionally equivalent.
Probab=31.98 E-value=2.7e+02 Score=25.08 Aligned_cols=51 Identities=14% Similarity=0.123 Sum_probs=36.6
Q ss_pred cchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 220 SSNLSILHSAIRY-IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 220 aSKAsILrKAiDY-IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|...+-.+.+- ...-++...+.+.+++..+.+++.|+.+|++.|..+..
T Consensus 62 ~~k~q~~~~~n~~e~e~Y~~~~~~i~~~i~~~k~~ie~lk~~L~~ak~~r~~ 113 (139)
T PF05615_consen 62 ILKSQLILEMNKRERENYEQLNEEIEQEIEQAKKEIEELKEELEEAKRVRQN 113 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4454555444332 45566777778889999999999999999988877654
No 340
>PF11382 DUF3186: Protein of unknown function (DUF3186); InterPro: IPR021522 This bacterial family of proteins has no known function.
Probab=31.92 E-value=98 Score=31.92 Aligned_cols=35 Identities=20% Similarity=0.125 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+..|+++..++++|.++|+.|+++|+.+++...+-
T Consensus 34 ~~~l~~~~~~lr~e~~~l~~~~~~~~~~~~~~d~f 68 (308)
T PF11382_consen 34 IDSLEDQFDSLREENDELRAELDALQAQLNAADQF 68 (308)
T ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455555555555555555555555555554443
No 341
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=31.81 E-value=2.9e+02 Score=23.07 Aligned_cols=37 Identities=19% Similarity=0.364 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+...+..+...+..++.-..++..|.++++.|+.++.
T Consensus 20 L~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e~e 56 (61)
T PF08826_consen 20 LTKVKSANLAFESKLQEAEKRNRELEQEIERLKKEME 56 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444555555555566666666666666666666654
No 342
>PF14235 DUF4337: Domain of unknown function (DUF4337)
Probab=31.42 E-value=1.9e+02 Score=27.65 Aligned_cols=32 Identities=13% Similarity=0.209 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
++++..+|+.|+++.++|.++|+++-++.+++
T Consensus 71 ~~~~i~~Y~~~~~~~~~e~~~l~~~A~~~e~~ 102 (157)
T PF14235_consen 71 YQKKIARYKKEKARYKSEAEELEAKAKEAEAE 102 (157)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44455555556666666655555555555444
No 343
>PRK00736 hypothetical protein; Provisional
Probab=31.41 E-value=1.9e+02 Score=24.04 Aligned_cols=35 Identities=9% Similarity=0.178 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++....+|-..+-+.++++..|++++..|..++..
T Consensus 17 qe~tie~Ln~~v~~Qq~~i~~L~~ql~~L~~rl~~ 51 (68)
T PRK00736 17 QEKTIEELSDQLAEQWKTVEQMRKKLDALTERFLS 51 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333444455555566666666666666655543
No 344
>PF14817 HAUS5: HAUS augmin-like complex subunit 5
Probab=31.32 E-value=66 Score=36.88 Aligned_cols=32 Identities=34% Similarity=0.489 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.+++++++++|+.|+..|++.|+.+..++..
T Consensus 80 ~r~~L~~everLraei~~l~~~I~~~e~e~~~ 111 (632)
T PF14817_consen 80 RRRELEKEVERLRAEIQELDKEIESREREVSR 111 (632)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556667777777777777777776666654
No 345
>PF14303 NAM-associated: No apical meristem-associated C-terminal domain
Probab=31.28 E-value=3.7e+02 Score=23.97 Aligned_cols=43 Identities=14% Similarity=0.078 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HhhhhcccccccCCCCCcccc
Q psy11481 246 EMERLAREKIHAQQRLALLKKE-LSARWEHIDFNTLIPDNMEVD 288 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKqE-LsQqw~hIDfStLlPd~lEID 288 (561)
++..+.++..+++++..+++.. ....-|.+|+|.++|+..+..
T Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~e~~IM~~D~s~md~~~r~y~ 147 (154)
T PF14303_consen 104 EIAEQEKEKNEIEKEKAEAEERKEENKIMSKDTSGMDEEQREYY 147 (154)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCHHHHHHH
Confidence 3333334444444433333322 223368899999966554433
No 346
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=31.17 E-value=2e+02 Score=28.34 Aligned_cols=47 Identities=19% Similarity=0.272 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 224 SILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.||..-...|+.|+.+..+++.++++|+.|+..|++-...-.+.|.+
T Consensus 5 rvlSar~~ki~~L~n~l~elq~~l~~l~~ENk~Lk~lq~Rq~kAL~k 51 (194)
T PF15619_consen 5 RVLSARLHKIKELQNELAELQRKLQELRKENKTLKQLQKRQEKALQK 51 (194)
T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666667899999999999999999999999998665554444543
No 347
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=31.12 E-value=1e+02 Score=33.63 Aligned_cols=22 Identities=23% Similarity=0.437 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREK 254 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREn 254 (561)
|.++++++++++.+++.+.+|.
T Consensus 136 i~~~~EEn~~lqlqL~~l~~e~ 157 (401)
T PF06785_consen 136 IRHLREENQCLQLQLDALQQEC 157 (401)
T ss_pred HHHHHHHHHHHHHhHHHHHHHH
Confidence 4445555555554444444433
No 348
>PRK04406 hypothetical protein; Provisional
Probab=31.09 E-value=1.7e+02 Score=25.00 Aligned_cols=28 Identities=7% Similarity=0.141 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.+|-..+-+.++++..|+++|..|..++
T Consensus 28 e~LN~~v~~Qq~~I~~L~~ql~~L~~rl 55 (75)
T PRK04406 28 EELNDALSQQQLLITKMQDQMKYVVGKV 55 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444445555555555555555554
No 349
>PF07334 IFP_35_N: Interferon-induced 35 kDa protein (IFP 35) N-terminus; InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=30.90 E-value=87 Score=27.35 Aligned_cols=22 Identities=18% Similarity=0.348 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q psy11481 249 RLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 249 RLRREneaLQQRLeELKqELsQ 270 (561)
.|++|+..|+.+|..|+.+|.+
T Consensus 4 ei~eEn~~Lk~eiqkle~ELq~ 25 (76)
T PF07334_consen 4 EIQEENARLKEEIQKLEAELQQ 25 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666655555554
No 350
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=30.77 E-value=1.4e+02 Score=27.70 Aligned_cols=36 Identities=22% Similarity=0.205 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|.+|+-+.+.++.-++.|.+.+..|...|.+.+..+
T Consensus 34 Ia~LEGE~r~~e~l~~dL~rrIkMLE~aLkqER~k~ 69 (134)
T PF08232_consen 34 IAFLEGERRGQENLKKDLKRRIKMLEYALKQERAKY 69 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 556666666666666666666666655555555444
No 351
>COG2919 Septum formation initiator [Cell division and chromosome partitioning]
Probab=30.75 E-value=2.6e+02 Score=25.23 Aligned_cols=47 Identities=9% Similarity=0.060 Sum_probs=32.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 224 SILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..+.-.+.+...+-.....+..+++.+++|++.|.++-..|+++...
T Consensus 36 ~~f~~~~~~g~~~~~~~~~l~~qi~~~~~e~~~L~~~~~~l~~ei~~ 82 (117)
T COG2919 36 ALFQYLAWFGKNGAADVLQLQRQIAAQQAELEKLSARNTALEAEIKD 82 (117)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555666666666777777777777777777777777777654
No 352
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=30.69 E-value=4.2e+02 Score=28.17 Aligned_cols=11 Identities=18% Similarity=0.012 Sum_probs=4.4
Q ss_pred HHHHHHHHHHh
Q psy11481 198 AHLKECFEILK 208 (561)
Q Consensus 198 d~LKdcFeeLR 208 (561)
..+...++.|+
T Consensus 221 ~~~~a~~~~l~ 231 (498)
T TIGR03007 221 NEAIAQRDALK 231 (498)
T ss_pred HHHHHHHHHHH
Confidence 33344444443
No 353
>KOG0995|consensus
Probab=30.66 E-value=1.4e+02 Score=34.30 Aligned_cols=23 Identities=17% Similarity=0.445 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 228 SAIRYIQFLRRREREFEHEMERL 250 (561)
Q Consensus 228 KAiDYIQkLQreEreLE~EkERL 250 (561)
|+..|...++.+.+.++..+++|
T Consensus 277 K~~~y~~~~~~k~~~~~~~l~~l 299 (581)
T KOG0995|consen 277 KFQAYVSQMKSKKQHMEKKLEML 299 (581)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHH
Confidence 34455555555555444433333
No 354
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=30.58 E-value=80 Score=31.85 Aligned_cols=50 Identities=18% Similarity=0.250 Sum_probs=25.7
Q ss_pred CCcchHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 218 KKSSNLSILHSAIRYIQFL-------RRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 218 ~KaSKAsILrKAiDYIQkL-------QreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+|..=+.+|+..-.=|..| +.+..+.+..+.++..|...|+.++++++.+
T Consensus 47 Er~~h~eeLrqI~~DIn~lE~iIkqa~~er~~~~~~i~r~~eey~~Lk~~in~~R~e 103 (230)
T PF10146_consen 47 ERMAHVEELRQINQDINTLENIIKQAESERNKRQEKIQRLYEEYKPLKDEINELRKE 103 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4666667776554444444 4444444444445555555555555555544
No 355
>PHA02562 46 endonuclease subunit; Provisional
Probab=30.49 E-value=5.5e+02 Score=27.47 Aligned_cols=6 Identities=17% Similarity=0.373 Sum_probs=3.3
Q ss_pred cCCCCC
Q psy11481 210 QVPPAQ 215 (561)
Q Consensus 210 lVPsLs 215 (561)
..|.|.
T Consensus 286 ~Cp~C~ 291 (562)
T PHA02562 286 VCPTCT 291 (562)
T ss_pred CCCCCC
Confidence 556654
No 356
>KOG2010|consensus
Probab=30.43 E-value=1.4e+02 Score=32.63 Aligned_cols=35 Identities=26% Similarity=0.239 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-.++.+++-.|.+++++-+..|+.+.++||++|.|
T Consensus 166 S~Re~eek~kE~er~Kh~~s~Lq~~~~elKe~l~Q 200 (405)
T KOG2010|consen 166 SYRENEEKSKELERQKHMCSVLQHKMEELKEGLRQ 200 (405)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445556678899999999999999999999988
No 357
>PF12017 Tnp_P_element: Transposase protein; InterPro: IPR021896 Protein in this family are transposases found in insects. This region is about 230 amino acids in length and is found associated with PF05485 from PFAM.
Probab=30.38 E-value=1.2e+02 Score=30.68 Aligned_cols=30 Identities=23% Similarity=0.422 Sum_probs=16.4
Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 225 ILHSAI-RYIQFLRRREREFEHEMERLAREK 254 (561)
Q Consensus 225 ILrKAi-DYIQkLQreEreLE~EkERLRREn 254 (561)
+|.++. ..++.|+.+.++++.++.+|+++.
T Consensus 11 ~ln~~~~~e~~~Lk~kir~le~~l~~Lk~~l 41 (236)
T PF12017_consen 11 ILNRTLKIENKKLKKKIRRLEKELKKLKQKL 41 (236)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444432 346666666666666666655544
No 358
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=30.33 E-value=1e+02 Score=29.23 Aligned_cols=35 Identities=20% Similarity=0.285 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|++.-.++..++++++++.+.+++++++++..+..
T Consensus 44 lE~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 78 (151)
T PF14584_consen 44 LEDLLNELFDQIDELKEELEELEKRIEELEEKLRN 78 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44445555667777777788888888888877765
No 359
>PF05010 TACC: Transforming acidic coiled-coil-containing protein (TACC); InterPro: IPR007707 This family contains the proteins TACC 1, 2 and 3, found concentrated in the centrosomes of eukaryotes which may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1) [].
Probab=30.23 E-value=3.3e+02 Score=27.31 Aligned_cols=72 Identities=21% Similarity=0.312 Sum_probs=43.1
Q ss_pred hHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAI-RYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 190 NaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAi-DYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
|.+|+.-. .|-.+|+.++..+-.+.. .=..|++.+ +|...+++.+++|+.-+.....+.+.....+++++++
T Consensus 86 ~s~E~sfs-dl~~ryek~K~vi~~~k~-----NEE~Lkk~~~ey~~~l~~~eqry~aLK~hAeekL~~ANeei~~v~~~ 158 (207)
T PF05010_consen 86 NSLEKSFS-DLHKRYEKQKEVIEGYKK-----NEETLKKCIEEYEERLKKEEQRYQALKAHAEEKLEKANEEIAQVRSK 158 (207)
T ss_pred HHHHhhHH-HHHHHHHHHHHHHHHHHH-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55666544 455577888887766532 233455544 8999999999888855444444444444444444433
No 360
>PF12795 MscS_porin: Mechanosensitive ion channel porin domain
Probab=30.20 E-value=2e+02 Score=28.13 Aligned_cols=51 Identities=18% Similarity=0.414 Sum_probs=36.2
Q ss_pred cCCcchHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 217 EKKSSNLSILHSAIRYIQFLRR---REREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 217 n~KaSKAsILrKAiDYIQkLQr---eEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
..+..-+..|..|.+++...++ +..+|+..+++.=++...++++|+.++..
T Consensus 14 ~~~~~~i~~l~~al~~L~~~~~~~~~~~~~~~~i~~aP~~~~~l~~~l~~l~~~ 67 (240)
T PF12795_consen 14 PEQKALIQDLQQALSFLDEIKKQKKRAAEYQKQIDQAPKEIRELQKELEALKSQ 67 (240)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhcc
Confidence 3455668889999998665554 44556677777777777777777777655
No 361
>PF04380 BMFP: Membrane fusogenic activity; InterPro: IPR007475 BMFP consists of two structural domains, a coiled-coil C-terminal domain via which the protein self-associates as a trimer, and an N-terminal domain disordered at neutral pH but adopting an amphipathic alpha-helical structure in the presence of phospholipid vesicles, high ionic strength, acidic pH or SDS. BMFP interacts with phospholipid vesicles though the predicted amphipathic alpha-helix induced in the N-terminal half of the protein and promotes aggregation and fusion of vesicles in vitro.
Probab=30.08 E-value=82 Score=26.75 Aligned_cols=42 Identities=24% Similarity=0.287 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 225 ILHSAIRYIQFLRRRERE-FEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 225 ILrKAiDYIQkLQreEre-LE~EkERLRREneaLQQRLeELKq 266 (561)
++++++.-...-.++|-+ +.+.+.+++.++++|..||++|+.
T Consensus 36 ~l~~~l~kldlVtREEFd~q~~~L~~~r~kl~~LEarl~~LE~ 78 (79)
T PF04380_consen 36 RLQSALSKLDLVTREEFDAQKAVLARTREKLEALEARLAALEA 78 (79)
T ss_pred HHHHHHHHCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 334443333333333333 234445666666666666666653
No 362
>cd01106 HTH_TipAL-Mta Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA. Helix-turn-helix (HTH) TipAL, Mta, and SkgA transcription regulators, and related proteins, N-terminal domain. TipAL regulates resistance to and activation by numerous cyclic thiopeptide antibiotics, such as thiostrepton. Mta is a global transcriptional regulator; the N-terminal DNA-binding domain of Mta interacts directly with the promoters of mta, bmr, blt, and ydfK, and induces transcription of these multidrug-efflux transport genes. SkgA has been shown to control stationary-phase expression of catalase-peroxidase in Caulobacter crescentus. These proteins are comprised of distinct domains that harbor an N-terminal active (DNA-binding) site and a regulatory (effector-binding) site. The conserved N-terminal domain of these transcription regulators contains winged HTH motifs that mediate DNA binding. These proteins share the N-terminal DNA binding domain with other transcrip
Probab=30.05 E-value=1.5e+02 Score=25.26 Aligned_cols=10 Identities=20% Similarity=0.265 Sum_probs=4.6
Q ss_pred hhhhHHHHhH
Q psy11481 187 EVHNKLEKNR 196 (561)
Q Consensus 187 aTHNaLERKR 196 (561)
=....+++-|
T Consensus 39 y~~~di~~l~ 48 (103)
T cd01106 39 YTEEDLERLQ 48 (103)
T ss_pred eCHHHHHHHH
Confidence 3345555443
No 363
>PF04999 FtsL: Cell division protein FtsL; InterPro: IPR007082 In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein IPR007060 from INTERPRO [], however this interaction may be indirect [].; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=29.94 E-value=3.4e+02 Score=22.93 Aligned_cols=34 Identities=21% Similarity=0.267 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
+..+..+..+++.|.++|+.|+..|+-+++.|..
T Consensus 37 ~~~~~~~l~~l~~~~~~l~~e~~~L~lE~~~l~~ 70 (97)
T PF04999_consen 37 SRQLFYELQQLEKEIDQLQEENERLRLEIATLSS 70 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 4455555666666777777777777666666553
No 364
>cd07429 Cby_like Chibby, a nuclear inhibitor of Wnt/beta-catenin mediated transcription, and similar proteins. Chibby(Cby) is a well-conserved nuclear protein that functions as part of the Wnt/beta-catenin signaling pathway. Specifically, Cby binds directly to beta-catenin by interacting with its central region, which harbors armadillo repeats. Cby-beta-catenin interactions may also involve 14-3-3 proteins. By competing with other binding partners of beta-catenin, the Tcf/Lef transcription factors, Cby inhibits transcriptional activation. Cby has been shown to play a role in adipocyte differentiation. The C-terminal region of Cby appears to contain an alpha-helical coiled-coil motif.
Probab=29.91 E-value=1.2e+02 Score=27.98 Aligned_cols=34 Identities=15% Similarity=0.218 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 237 RRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 237 QreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.++..++.++..+|++|+..|+-+++-|--.|.+
T Consensus 71 ~~e~~rlkkk~~~LeEENNlLklKievLLDMLte 104 (108)
T cd07429 71 GREVLRLKKKNQQLEEENNLLKLKIEVLLDMLAE 104 (108)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556667778888999999998888888877765
No 365
>PF04325 DUF465: Protein of unknown function (DUF465); InterPro: IPR007420 Family members are found in small bacterial proteins, and also in the heavy chains of eukaryotic myosin and kinesin, C-terminal of the motor domain. Members of this family may form coiled coil structures.; PDB: 1ZHC_A.
Probab=29.88 E-value=74 Score=24.56 Aligned_cols=23 Identities=30% Similarity=0.564 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 244 EHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 244 E~EkERLRREneaLQQRLeELKq 266 (561)
..++++|+++.-.|+.+|..+.+
T Consensus 26 d~~l~~LKk~kL~LKDei~~ll~ 48 (49)
T PF04325_consen 26 DEELERLKKEKLRLKDEIYRLLR 48 (49)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHc
Confidence 35667777777777777776654
No 366
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=29.72 E-value=2.8e+02 Score=25.25 Aligned_cols=11 Identities=27% Similarity=0.398 Sum_probs=4.2
Q ss_pred HHHHHHHHHHH
Q psy11481 197 RAHLKECFEIL 207 (561)
Q Consensus 197 Rd~LKdcFeeL 207 (561)
|..+.+.+..|
T Consensus 54 ~e~l~~~~~~l 64 (151)
T PF11559_consen 54 REDLSDKLRRL 64 (151)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 367
>PHA03162 hypothetical protein; Provisional
Probab=29.66 E-value=66 Score=30.80 Aligned_cols=21 Identities=14% Similarity=0.374 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy11481 241 REFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 241 reLE~EkERLRREneaLQQRL 261 (561)
.+|.+|+.+|+-||.+|+++|
T Consensus 16 EeLaaeL~kLqmENK~LKkkl 36 (135)
T PHA03162 16 EDLAAEIAKLQLENKALKKKI 36 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 345566666666666665555
No 368
>TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair.
Probab=29.65 E-value=2.7e+02 Score=31.45 Aligned_cols=43 Identities=21% Similarity=0.206 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 225 ILHSAIRYIQFLRRREREFE----HEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 225 ILrKAiDYIQkLQreEreLE----~EkERLRREneaLQQRLeELKqE 267 (561)
=|.++.+-+++|+.+-+... ..+++|+.+.+.+++++++|++.
T Consensus 213 ~i~~~~~e~d~lk~e~~~~~~~i~~~~~~l~~~~~~~~~~~~~lk~a 259 (555)
T TIGR03545 213 ELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKA 259 (555)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhc
Confidence 34445555555555544432 33334455555555556666554
No 369
>PF09849 DUF2076: Uncharacterized protein conserved in bacteria (DUF2076); InterPro: IPR018648 This family of hypothetical prokaryotic proteins has no known function but includes putative perimplasmic ligand-binding sensor proteins.
Probab=29.63 E-value=2.7e+02 Score=28.66 Aligned_cols=17 Identities=24% Similarity=0.366 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHhccCCC
Q psy11481 197 RAHLKECFEILKRQVPP 213 (561)
Q Consensus 197 Rd~LKdcFeeLRdlVPs 213 (561)
|..|.+.|..||.+=..
T Consensus 6 ~qLI~~lf~RL~~ae~~ 22 (247)
T PF09849_consen 6 RQLIDDLFSRLKQAEAQ 22 (247)
T ss_pred HHHHHHHHHHHHhccCC
Confidence 67889999999987666
No 370
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=29.56 E-value=2.3e+02 Score=32.22 Aligned_cols=10 Identities=30% Similarity=0.421 Sum_probs=4.1
Q ss_pred ccccCCccCC
Q psy11481 20 CYTNGVSGSN 29 (561)
Q Consensus 20 ~~~~~~~~~~ 29 (561)
...|++.|.|
T Consensus 23 ~~~~~i~G~N 32 (1179)
T TIGR02168 23 KGITGIVGPN 32 (1179)
T ss_pred CCcEEEECCC
Confidence 3344444433
No 371
>KOG1265|consensus
Probab=29.53 E-value=85 Score=38.00 Aligned_cols=42 Identities=19% Similarity=0.265 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 224 SILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.-+++.+++|+.|.+... ...++|.+.+.+-.++|++-++.|
T Consensus 1128 s~i~~~V~e~krL~~~~~---k~~e~L~k~~~~~leql~e~~kal 1169 (1189)
T KOG1265|consen 1128 SNIKEFVEERKRLAEKQS---KRQEQLVKKHLEVLEQLAEEEKAL 1169 (1189)
T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhHHH
Confidence 345666777777766543 344555555555555555443333
No 372
>KOG4715|consensus
Probab=29.49 E-value=2.7e+02 Score=30.46 Aligned_cols=67 Identities=16% Similarity=0.348 Sum_probs=45.4
Q ss_pred hHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 190 NaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLe 262 (561)
-..+|++| .|.+.|.. ..||..+-..-..+..+|++ .++.|--..++|++|+.++++.+++-++++.
T Consensus 193 arf~rN~r-LIsei~Se--svVpDvrsVVtt~RMqvlkr---Qv~SL~~HQ~KLEaEL~q~Ee~hq~kKrk~~ 259 (410)
T KOG4715|consen 193 ARFQRNHR-LISEILSE--SVVPDVRSVVTTARMQVLKR---QVQSLMVHQRKLEAELLQIEERHQEKKRKFL 259 (410)
T ss_pred hhhhhhhH-HHHHHhcc--ccccchhhhhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444 46666653 57777655455677888887 4667777778888888888887777776664
No 373
>cd00187 TOP4c DNA Topoisomerase, subtype IIA; domain A'; bacterial DNA topoisomerase IV (C subunit, ParC), bacterial DNA gyrases (A subunit, GyrA),mammalian DNA toposiomerases II. DNA topoisomerases are essential enzymes that regulate the conformational changes in DNA topology by catalysing the concerted breakage and rejoining of DNA strands during normal cellular growth.
Probab=29.38 E-value=1.8e+02 Score=32.01 Aligned_cols=25 Identities=28% Similarity=0.294 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELs 269 (561)
.|+++|++|..+|++++++|+.-+.
T Consensus 405 ~e~~kL~~E~~~l~~ei~~l~~~l~ 429 (445)
T cd00187 405 LEREKLLKELKELEAEIEDLEKILA 429 (445)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4567888888888888888887773
No 374
>KOG3560|consensus
Probab=29.22 E-value=46 Score=38.17 Aligned_cols=41 Identities=27% Similarity=0.442 Sum_probs=35.1
Q ss_pred HhHHHHHHHHHHHHhccCCCCCC-cCCcchHHHHHHHHHHHH
Q psy11481 194 KNRRAHLKECFEILKRQVPPAQE-EKKSSNLSILHSAIRYIQ 234 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPsLsD-n~KaSKAsILrKAiDYIQ 234 (561)
|+-|+-||.-++.|..++|.-.+ -.|..|++||+=++.|++
T Consensus 34 KRHRdRLNaELD~lAsLLPfpqdiisKLDkLSVLRLSVSyLr 75 (712)
T KOG3560|consen 34 KRHRDRLNAELDHLASLLPFPQDIISKLDKLSVLRLSVSYLR 75 (712)
T ss_pred hhHHHHhhhHHHHHHHhcCCCHHHHhhhhhhhhhhhhHHHHH
Confidence 44599999999999999999543 368999999999999975
No 375
>PF10359 Fmp27_WPPW: RNA pol II promoter Fmp27 protein domain; InterPro: IPR019449 The function of the FMP27 protein is not known. FMP27 is the product of a nuclear encoded gene but it is detected in highly purified mitochondria in high-throughput studies []. This entry represents a domain within FMP27 that contains characteristic HQR and WPPW sequence motifs.
Probab=29.13 E-value=1.9e+02 Score=31.59 Aligned_cols=73 Identities=18% Similarity=0.148 Sum_probs=43.6
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLe 262 (561)
.+.|.+.-.|-+.|-+.|+..+..+...+-.+..... =..++.+-+.++.+++.|++++..|+..|+
T Consensus 158 ~~p~~vQ~~L~~~Rl~~L~~qi~~~~~~l~~~~~~~~-------------~~~~~~~~~~l~~~~~~l~~~~~~l~~~l~ 224 (475)
T PF10359_consen 158 NDPRRVQIELIQERLDELEEQIEKHEEKLGELELNPD-------------DPELKSDIEELERHISSLKERIEFLENMLE 224 (475)
T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-------------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4568888889999999999988888887776421000 112333333444445555555555555555
Q ss_pred HHHHHH
Q psy11481 263 LLKKEL 268 (561)
Q Consensus 263 ELKqEL 268 (561)
.|+...
T Consensus 225 ~l~~~~ 230 (475)
T PF10359_consen 225 DLEDSE 230 (475)
T ss_pred HHhhhc
Confidence 555443
No 376
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=29.11 E-value=1e+02 Score=29.09 Aligned_cols=16 Identities=25% Similarity=0.227 Sum_probs=9.8
Q ss_pred HHHHhHHHHHHHHHHH
Q psy11481 191 KLEKNRRAHLKECFEI 206 (561)
Q Consensus 191 aLERKRRd~LKdcFee 206 (561)
..|=+|=..||.+|-.
T Consensus 51 VsEL~~Ls~LK~~y~~ 66 (131)
T PF04859_consen 51 VSELRRLSELKRRYRK 66 (131)
T ss_pred HHHHHHHHHHHHHHHc
Confidence 3455566677777754
No 377
>PF06632 XRCC4: DNA double-strand break repair and V(D)J recombination protein XRCC4; InterPro: IPR010585 This entry represents the DNA double-strand break repair and V(D)J recombination protein XRCC4, which is found in certain Metazoans, fungi and plants. XRCC4 binds to DNA, and to DNA ligase IV (LIG4) to form the LIG4-XRCC4 complex []. The LIG4-XRCC4 complex is responsible for the ligation step in the non-homologous end joining (NHEJ) pathway of DNA double-strand break repair. XRCC4 enhances the joining activity of LIG4. It is thought that XRCC4 and LIG4 are essential for alignment-based gap filling, as well as for final ligation of the breaks []. Binding of the LIG4-XRCC4 complex to DNA ends is dependent on the assembly of the DNA-dependent protein kinase complex DNA-PK to these DNA ends. ; GO: 0003677 DNA binding, 0006302 double-strand break repair, 0006310 DNA recombination, 0005634 nucleus; PDB: 3RWR_R 1IK9_B 3SR2_E 3Q4F_H 3II6_B 1FU1_A 3MUD_B.
Probab=29.10 E-value=1.6e+02 Score=31.39 Aligned_cols=13 Identities=15% Similarity=0.386 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHhh
Q psy11481 258 QQRLALLKKELSA 270 (561)
Q Consensus 258 QQRLeELKqELsQ 270 (561)
+.+|.+|++.|..
T Consensus 193 K~KIR~lq~~L~~ 205 (342)
T PF06632_consen 193 KAKIRELQRLLAS 205 (342)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 3556666666654
No 378
>PRK02119 hypothetical protein; Provisional
Probab=29.05 E-value=2e+02 Score=24.29 Aligned_cols=35 Identities=11% Similarity=0.041 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+++...++-..+-+.++++..|+++|..|..+|..
T Consensus 21 QE~tie~LN~~v~~Qq~~id~L~~ql~~L~~rl~~ 55 (73)
T PRK02119 21 QENLLEELNQALIEQQFVIDKMQVQLRYMANKLKD 55 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33344444455555666666666666666665543
No 379
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=28.92 E-value=2.5e+02 Score=27.51 Aligned_cols=25 Identities=32% Similarity=0.442 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 243 FEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 243 LE~EkERLRREneaLQQRLeELKqE 267 (561)
++.++..|+.|++.|.+|++.++++
T Consensus 98 ~ek~l~~Lk~e~evL~qr~~kle~E 122 (201)
T PF13851_consen 98 LEKELKDLKWEHEVLEQRFEKLEQE 122 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444443
No 380
>PF14645 Chibby: Chibby family
Probab=28.78 E-value=1.2e+02 Score=27.66 Aligned_cols=36 Identities=22% Similarity=0.128 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.+.|+++++++++|.+-|+-+++-|-..|++-..++
T Consensus 73 ~~~l~~~n~~L~EENN~Lklk~elLlDMLtettae~ 108 (116)
T PF14645_consen 73 NQRLRKENQQLEEENNLLKLKIELLLDMLTETTAEA 108 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445777777777777777777777777777655543
No 381
>KOG4603|consensus
Probab=28.70 E-value=2e+02 Score=29.03 Aligned_cols=45 Identities=24% Similarity=0.369 Sum_probs=23.7
Q ss_pred chHHHHHHHHHH----HHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 221 SNLSILHSAIRY----IQFLRRRE--REFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 221 SKAsILrKAiDY----IQkLQreE--reLE~EkERLRREneaLQQRLeELK 265 (561)
-|+.-|++.|.| |+.|.... .++++++..|++|....+.||..++
T Consensus 93 ek~q~l~~t~s~veaEik~L~s~Lt~eemQe~i~~L~kev~~~~erl~~~k 143 (201)
T KOG4603|consen 93 EKVQSLQQTCSYVEAEIKELSSALTTEEMQEEIQELKKEVAGYRERLKNIK 143 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555 33443321 3445555566666666666665555
No 382
>PF13747 DUF4164: Domain of unknown function (DUF4164)
Probab=28.61 E-value=2.7e+02 Score=24.31 Aligned_cols=21 Identities=19% Similarity=0.308 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLARE 253 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRRE 253 (561)
|+.|.....++.+++++...+
T Consensus 41 i~~l~~dr~rLa~eLD~~~ar 61 (89)
T PF13747_consen 41 IQRLDADRSRLAQELDQAEAR 61 (89)
T ss_pred HHHHHhhHHHHHHHHHhHHHH
Confidence 344444444444444443333
No 383
>PRK02793 phi X174 lysis protein; Provisional
Probab=28.51 E-value=2.1e+02 Score=24.07 Aligned_cols=35 Identities=20% Similarity=0.190 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++....+|-..+-+.++++..|++++..|..+|..
T Consensus 20 Qe~tIe~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~ 54 (72)
T PRK02793 20 QEITIEELNVTVTAHEMEMAKLRDHLRLLTEKLKA 54 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33333444445555566666666666666665543
No 384
>PF15546 DUF4653: Domain of unknown function (DUF4653)
Probab=28.49 E-value=74 Score=32.56 Aligned_cols=60 Identities=25% Similarity=0.251 Sum_probs=38.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHhhhhcccccccCCCCCccccccccC
Q psy11481 229 AIRYIQFLRRREREFEHEMER------------LAREKIHAQQRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDN 293 (561)
Q Consensus 229 AiDYIQkLQreEreLE~EkER------------LRREneaLQQRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQ 293 (561)
-.+-++.|+.+|+=-+.|||+ |-+=++.||+|+..-|..|... |.+++.+..|+-|-+.|
T Consensus 158 va~Avq~Lq~~Er~KeqEkEKHhv~LvMYRRLALl~Wir~LQ~~~~dQQ~RLQeS-----FDtILdnRKELiRclqq 229 (239)
T PF15546_consen 158 VAEAVQQLQAQERYKEQEKEKHHVHLVMYRRLALLRWIRGLQHQLVDQQNRLQES-----FDTILDNRKELIRCLQQ 229 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhHHHHHHHHHc
Confidence 344567777777655555553 2344555666666666666553 55888888888887744
No 385
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=28.48 E-value=1.6e+02 Score=34.28 Aligned_cols=13 Identities=38% Similarity=0.473 Sum_probs=6.1
Q ss_pred hHHHHhHHHHHHH
Q psy11481 190 NKLEKNRRAHLKE 202 (561)
Q Consensus 190 NaLERKRRd~LKd 202 (561)
|+++-.||..|-.
T Consensus 489 ~a~~iA~~~Glp~ 501 (782)
T PRK00409 489 NAFEIAKRLGLPE 501 (782)
T ss_pred HHHHHHHHhCcCH
Confidence 4455455444433
No 386
>KOG1853|consensus
Probab=28.45 E-value=1.5e+02 Score=31.43 Aligned_cols=42 Identities=31% Similarity=0.368 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRRERE---FEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 226 LrKAiDYIQkLQreEre---LE~EkERLRREneaLQQRLeELKqE 267 (561)
|..||+-+.+|+.+..+ +..++++|+.|-+-|+|.|+--.++
T Consensus 142 LnqAIErnAfLESELdEke~llesvqRLkdEardlrqelavr~kq 186 (333)
T KOG1853|consen 142 LNQAIERNAFLESELDEKEVLLESVQRLKDEARDLRQELAVRTKQ 186 (333)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 66788888888876543 4467778888888888877755533
No 387
>TIGR02473 flagell_FliJ flagellar export protein FliJ. Members of this family are the FliJ protein found, in nearly every case, in the midst of other flagellar biosynthesis genes in bacgterial genomes. Typically the fliJ gene is found adjacent to the gene for the flagellum-specific ATPase FliI. Sequence scoring in the gray zone between trusted and noise cutoffs include both probable FliJ proteins and components of bacterial type III secretion systems.
Probab=28.34 E-value=3.4e+02 Score=23.65 Aligned_cols=46 Identities=24% Similarity=0.307 Sum_probs=34.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 222 NLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 222 KAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
.+.-|..--.||..|......++.++..++.+.+..++.+.+...+
T Consensus 59 ~~~~l~~~~~f~~~l~~~i~~q~~~l~~~~~~~e~~r~~l~~a~~~ 104 (141)
T TIGR02473 59 SALELSNYQRFIRQLDQRIQQQQQELALLQQEVEAKRERLLEARRE 104 (141)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555667778899888888888888888888777777776665554
No 388
>PF15456 Uds1: Up-regulated During Septation
Probab=28.33 E-value=82 Score=29.13 Aligned_cols=26 Identities=31% Similarity=0.487 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+|++.|++|...|..|++.+++.+.-
T Consensus 22 eEVe~LKkEl~~L~~R~~~lr~kl~l 47 (124)
T PF15456_consen 22 EEVEELKKELRSLDSRLEYLRRKLAL 47 (124)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56788888888888888888887654
No 389
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=28.32 E-value=2.7e+02 Score=31.24 Aligned_cols=15 Identities=33% Similarity=0.514 Sum_probs=7.4
Q ss_pred ccceeeccccccccC
Q psy11481 521 SHQIILKPSLAQVNG 535 (561)
Q Consensus 521 ~~~~~~~~~~~~~~~ 535 (561)
++|+|+=..-.++.|
T Consensus 604 ~~QvIils~d~e~~~ 618 (650)
T TIGR03185 604 SHQVLLLSTDEEVDE 618 (650)
T ss_pred CCeEEEEechHhhCH
Confidence 466665444444433
No 390
>PHA03155 hypothetical protein; Provisional
Probab=28.31 E-value=69 Score=29.98 Aligned_cols=20 Identities=30% Similarity=0.531 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy11481 242 EFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRL 261 (561)
+|.+|+.+|+-||.+|+++|
T Consensus 12 eLaaeL~kL~~ENK~LKkkl 31 (115)
T PHA03155 12 ELEKELQKLKIENKALKKKL 31 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 45556666666665555544
No 391
>PF08606 Prp19: Prp19/Pso4-like; InterPro: IPR013915 This region is found specifically in PRP19-like protein. The region represented by this protein covers the sequence implicated in self-interaction and a coiled-coiled motif []. PRP19-like proteins form an oligomer that is necessary for spliceosome assembly [].
Probab=28.26 E-value=1.3e+02 Score=26.11 Aligned_cols=28 Identities=25% Similarity=0.395 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 243 FEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 243 LE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
++.|.+-+.-|.=.|+++|+++++||+.
T Consensus 13 lQnEWDa~mLE~f~LRk~l~~~rqELs~ 40 (70)
T PF08606_consen 13 LQNEWDALMLENFTLRKQLDQTRQELSH 40 (70)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456777777777778888888888775
No 392
>PF15460 SAS4: Something about silencing, SAS, complex subunit 4
Probab=28.25 E-value=95 Score=28.03 Aligned_cols=50 Identities=26% Similarity=0.351 Sum_probs=32.9
Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccCCCCCcccccc
Q psy11481 233 IQFLRRREREFE-HEMERLAREKIHAQQRLALLKKELSARWEHIDFNTLIPDNMEVDIP 290 (561)
Q Consensus 233 IQkLQreEreLE-~EkERLRREneaLQQRLeELKqELsQqw~hIDfStLlPd~lEIDRv 290 (561)
=+.++++|+++. .|+++++.|.+.|...|+.|+. .|....+|.+..|..+
T Consensus 12 Hrrmer~Ekrm~n~ek~r~~~E~d~L~~lle~L~~--------~dW~r~l~~iT~I~d~ 62 (101)
T PF15460_consen 12 HRRMERQEKRMRNIEKERAQHEKDRLQRLLEKLQG--------HDWLRVLPGITGINDP 62 (101)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC--------CCHHHHHhccccccCc
Confidence 345555555554 6888899999888888888873 2434666666555443
No 393
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=28.02 E-value=1.2e+02 Score=28.16 Aligned_cols=22 Identities=27% Similarity=0.421 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 245 HEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKq 266 (561)
.++++|+.|+..|..||+.|+.
T Consensus 116 ~~i~~l~~e~~~l~~kL~~l~~ 137 (169)
T PF07106_consen 116 EEIEELEEEIEELEEKLEKLRS 137 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh
Confidence 3444555555555555555553
No 394
>PF11083 Streptin-Immun: Lantibiotic streptin immunity protein; InterPro: IPR021112 Streptococcal species produce a lantibiotic, streptin, in a similar manner to the production of nisin and subtilin by other lactic acid bacteria, in order to compete against competing bacteria within the environment []. The immunity protein protects the bacterium from destruction by its own lantibiotic. In general, there is little homology between the immunity proteins of different genera of bacteria.
Probab=28.02 E-value=1.5e+02 Score=27.10 Aligned_cols=20 Identities=10% Similarity=0.218 Sum_probs=13.4
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy11481 222 NLSILHSAIRYIQFLRRRER 241 (561)
Q Consensus 222 KAsILrKAiDYIQkLQreEr 241 (561)
|.++|.|.++|.-.|+.++.
T Consensus 14 kiatLNKmAEvLinlks~~~ 33 (99)
T PF11083_consen 14 KIATLNKMAEVLINLKSDDP 33 (99)
T ss_pred HHHHHHHHHHHHHhcccCCH
Confidence 66777777777766655443
No 395
>KOG0946|consensus
Probab=28.01 E-value=1.6e+02 Score=35.39 Aligned_cols=19 Identities=16% Similarity=0.179 Sum_probs=11.5
Q ss_pred chhhhhHHHHhHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKEC 203 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdc 203 (561)
+-..-+.+++-+|.++-.+
T Consensus 581 ke~f~srL~~lsr~e~ysr 599 (970)
T KOG0946|consen 581 KENFISRLQRLSRHELYSR 599 (970)
T ss_pred HHHHHHHHHHhhHhHHHHH
Confidence 3344466777777766555
No 396
>KOG4378|consensus
Probab=28.00 E-value=6.2e+02 Score=29.45 Aligned_cols=144 Identities=19% Similarity=0.194 Sum_probs=66.2
Q ss_pred cccccccceee--c-cCccceee-cc--cccchhcccCccCCCcceeeecCCCCC--CccccccCCCCCCC-chhhhhHH
Q psy11481 122 RHYTLHIEDEM--M-GSREVVAS-LS--GETRLVAHNNYTSNSHAVALSTSPNNL--TQNDMIKKRSGISG-IREVHNKL 192 (561)
Q Consensus 122 r~~~~~~~~~v--v-Gs~~~~~S-~S--~~~avva~~~~~~NS~~VaLsnSp~~L--~q~se~kKRPg~Se-RRaTHNaL 192 (561)
|++.++||+.. . |+.--+++ +- +-.+...+.|+++|+.+..---++... .-+-+..=....++ +-++--+-
T Consensus 493 ~~~s~~~e~~sd~~~~s~kl~t~gv~a~gn~~~~~lSN~TRns~~~~~~i~~~s~~~~L~~~p~i~ksss~n~p~~~~a~ 572 (673)
T KOG4378|consen 493 EEQSANIEVASDTGGGSDKLNTPGVDAEGNRRLRLLSNATRNSTPHHANITPQSSNPLLKPQPLIAKSSSGNLPAQMDAD 572 (673)
T ss_pred HHhhcceeeeccCCCCcccccCCCCCCcccccccccccccccCCcccccCCCcccCccccCCcccccCCCCCCchhhhhh
Confidence 66777777652 1 11111232 22 233344447889998776421111111 00000000001122 23333344
Q ss_pred HHh----HHHHHHHHHHHHhccCCCCC------CcCCcchH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHH--
Q psy11481 193 EKN----RRAHLKECFEILKRQVPPAQ------EEKKSSNL---------SILHSAIRYIQFLRRREREFEHEMERLA-- 251 (561)
Q Consensus 193 ERK----RRd~LKdcFeeLRdlVPsLs------Dn~KaSKA---------sILrKAiDYIQkLQreEreLE~EkERLR-- 251 (561)
|+. -|+.|++..+.-+..+|... ...+.... .|-.-+++.|++.+-. +.||.+|.
T Consensus 573 ~ag~~s~l~~~v~qs~~~~~q~~~~~~fs~q~~q~~~~~tlddfq~~~hrdirNl~~ell~Qfhm~----~~Ems~ller 648 (673)
T KOG4378|consen 573 WAGEFSELRDFVDQSCEKVEQELEYVTFSNQRLQANKMTTLDDFQVENHRDIRNLALELLLQFHMF----MREMSRLLER 648 (673)
T ss_pred hhhhhHHHHHHHHhhhhhHHhhcccchhHHHHHHHHhhhhHHHHHHHhHHHHHHHHHHHHHHHHHH----HHHHHHHHHH
Confidence 443 36667777777776666532 01111111 1112233333333333 34566666
Q ss_pred -HHHHHHHHHHHHHHHHHh
Q psy11481 252 -REKIHAQQRLALLKKELS 269 (561)
Q Consensus 252 -REneaLQQRLeELKqELs 269 (561)
.|++.|+++|+-|++|..
T Consensus 649 y~eNe~l~aelk~lreenq 667 (673)
T KOG4378|consen 649 YNENEMLKAELKFLREENQ 667 (673)
T ss_pred hhhhHHHHHHHHHHHHhhh
Confidence 788888888888887754
No 397
>PF13805 Pil1: Eisosome component PIL1; PDB: 3PLT_B.
Probab=27.93 E-value=1.7e+02 Score=30.56 Aligned_cols=19 Identities=32% Similarity=0.527 Sum_probs=12.9
Q ss_pred HhHHHHHHHHHHHHhccCC
Q psy11481 194 KNRRAHLKECFEILKRQVP 212 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVP 212 (561)
|.||..|.+.+..|+..=|
T Consensus 144 R~~r~~l~d~I~kLk~k~P 162 (271)
T PF13805_consen 144 RDRRRKLQDEIAKLKYKDP 162 (271)
T ss_dssp HHHHHHHHHHHHHHHHH-T
T ss_pred HHHhHHHHHHHHHHHhcCC
Confidence 4567788888888876544
No 398
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=27.91 E-value=3.6e+02 Score=29.85 Aligned_cols=83 Identities=17% Similarity=0.184 Sum_probs=44.2
Q ss_pred ccCCCCCCCchhhhhHHHHhHHHHHHH----HHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHH-HHHHHHHHHH
Q psy11481 176 IKKRSGISGIREVHNKLEKNRRAHLKE----CFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRRE-REFEHEMERL 250 (561)
Q Consensus 176 ~kKRPg~SeRRaTHNaLERKRRd~LKd----cFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreE-reLE~EkERL 250 (561)
.++-|-..+-++.|-+.-++||+..+. .+...|..+-.+ ..+.+..-|+....++++-++++ ++-..+++++
T Consensus 274 ~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~p~~~~~~~~~~~~~~~~~~~~~~~~k~~~k~~ 350 (429)
T PRK00247 274 ERKYPLTDEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMI---ITPWRAPELHAENAEIKKTRTAEKNEAKARKKEI 350 (429)
T ss_pred HHhcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334454445556555555555555543 222233332222 24667777888887877766665 3333455566
Q ss_pred HHHHHHHHHHH
Q psy11481 251 AREKIHAQQRL 261 (561)
Q Consensus 251 RREneaLQQRL 261 (561)
++|.++.++++
T Consensus 351 ~~~~~~~~~~~ 361 (429)
T PRK00247 351 AQKRRAAEREI 361 (429)
T ss_pred HHHHHHHHHHH
Confidence 66555555333
No 399
>TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair.
Probab=27.88 E-value=1.3e+02 Score=34.00 Aligned_cols=86 Identities=16% Similarity=0.204 Sum_probs=40.8
Q ss_pred hhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCc--chHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 186 REVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKS--SNLSILHSA----IRYIQFLRRREREFEHEMERLAREKIHAQQ 259 (561)
Q Consensus 186 RaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~Ka--SKAsILrKA----iDYIQkLQreEreLE~EkERLRREneaLQQ 259 (561)
+..=+.+|+.. +.....|.+..+.+|. .+..+. .++..|++. ..-|++.+++-+++..|++..+...+.+++
T Consensus 163 ~~~~~~~~~~~-k~~~~~w~~~~~~Lp~-~~~~~~yk~~v~~i~~~~ik~p~~i~~~~~e~d~lk~e~~~~~~~i~~~~~ 240 (555)
T TIGR03545 163 VETAEEIEKSL-KAMQQKWKKRKKDLPN-KQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKN 240 (555)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHhcCC-chhHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444432 2334456666667775 222221 123333332 112444555555555566555555555555
Q ss_pred HHHHHHHHHhhhhc
Q psy11481 260 RLALLKKELSARWE 273 (561)
Q Consensus 260 RLeELKqELsQqw~ 273 (561)
+|+..++++.+++.
T Consensus 241 ~l~~~~~~~~~~~~ 254 (555)
T TIGR03545 241 DLQNDKKQLKADLA 254 (555)
T ss_pred HHHHhHHHHHHHHH
Confidence 66555555555444
No 400
>KOG0837|consensus
Probab=27.86 E-value=1.3e+02 Score=31.73 Aligned_cols=27 Identities=19% Similarity=0.400 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLe 262 (561)
|+-+++++.++..+|++...++++++.
T Consensus 239 lk~~n~~L~~~l~~l~~~v~e~k~~V~ 265 (279)
T KOG0837|consen 239 LKIYNRDLASELSKLKEQVAELKQKVM 265 (279)
T ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444444444444
No 401
>PRK10328 DNA binding protein, nucleoid-associated; Provisional
Probab=27.86 E-value=4.1e+02 Score=25.06 Aligned_cols=36 Identities=22% Similarity=0.274 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccCCC
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELSARWEHIDFNTLIP 282 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELsQqw~hIDfStLlP 282 (561)
++.++.+++..+.++.|+++++.+.+ .+|++..|..
T Consensus 42 ~~~~~~~~~~~er~~~l~~i~~~~~~--~Git~eeL~~ 77 (134)
T PRK10328 42 EEEEQQQRELAERQEKINTWLELMKA--DGINPEELLG 77 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--hCCCHHHHhh
Confidence 44455567777778888888888776 5677777644
No 402
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=27.66 E-value=2.5e+02 Score=24.43 Aligned_cols=30 Identities=20% Similarity=0.280 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
...+..++..+++.|+.++..+..++..++
T Consensus 79 k~~ei~~l~~~l~~l~~~~~k~e~~l~~~~ 108 (126)
T PF13863_consen 79 KEAEIKKLKAELEELKSEISKLEEKLEEYK 108 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333334444444444444444444444443
No 403
>PF10205 KLRAQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019343 This entry represents a N-terminal 100 residues long domain, which contains a conserved KLRAQ motif. This domain is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain a C-terminal TTKRSYEDQ motif domain (IPR019348 from INTERPRO). The function of these proteins is not known.
Probab=27.61 E-value=1.8e+02 Score=26.64 Aligned_cols=34 Identities=24% Similarity=0.307 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|+++.+..+..+.+++.|++.|.=|..+|.+...
T Consensus 31 L~e~Lk~ke~~LRk~eqE~dSL~FrN~QL~kRV~ 64 (102)
T PF10205_consen 31 LKEQLKEKEQALRKLEQENDSLTFRNQQLTKRVE 64 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455555555555555555555555555543
No 404
>TIGR03683 A-tRNA_syn_arch alanyl-tRNA synthetase. This family of alanyl-tRNA synthetases is limited to the archaea, and is a subset of those sequences identified by the model pfam07973 covering the second additional domain (SAD) of alanyl and threonyl tRNA synthetases.
Probab=27.45 E-value=1.6e+02 Score=34.99 Aligned_cols=8 Identities=25% Similarity=0.480 Sum_probs=3.3
Q ss_pred CCCCcccC
Q psy11481 340 GGGEMISG 347 (561)
Q Consensus 340 ~~~~~~~~ 347 (561)
||.+.++|
T Consensus 874 GGk~~~AQ 881 (902)
T TIGR03683 874 GGKPDLAQ 881 (902)
T ss_pred CCchhhee
Confidence 33344444
No 405
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=27.44 E-value=1.7e+02 Score=28.71 Aligned_cols=38 Identities=24% Similarity=0.274 Sum_probs=28.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|+.|..+.++++...+...+.+..|.++++.|..+|..
T Consensus 178 i~~L~~~lkeaE~Rae~aE~~v~~Le~~id~le~eL~~ 215 (237)
T PF00261_consen 178 IRDLEEKLKEAENRAEFAERRVKKLEKEIDRLEDELEK 215 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 77788887777777777777777777777777777765
No 406
>PF12777 MT: Microtubule-binding stalk of dynein motor; InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=27.39 E-value=2.3e+02 Score=29.36 Aligned_cols=46 Identities=15% Similarity=0.203 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 224 SILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 224 sILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
.-|...-+-+..++.+...++.+.+...+|++.|+.+++..+..|.
T Consensus 235 ~~L~~~~~~l~~l~~~l~~l~~~~~~~~~e~~~l~~~~~~~~~kl~ 280 (344)
T PF12777_consen 235 EQLAEKQAELAELEEKLAALQKEYEEAQKEKQELEEEIEETERKLE 280 (344)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 3344444445555555555555555555555555555555555543
No 407
>PRK00226 greA transcription elongation factor GreA; Reviewed
Probab=27.38 E-value=2.5e+02 Score=26.01 Aligned_cols=28 Identities=29% Similarity=0.254 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy11481 251 AREKIHAQQRLALLKKELSARWEHIDFNT 279 (561)
Q Consensus 251 RREneaLQQRLeELKqELsQqw~hIDfSt 279 (561)
+.++..+..|+.+|+++|.. ..-+|...
T Consensus 53 ~~~~~~~~~ri~~l~~~L~~-a~ii~~~~ 80 (157)
T PRK00226 53 KEEQGFIEGRIRELEDKLSN-AEVIDPSK 80 (157)
T ss_pred HHHHHHHHHHHHHHHHHHHh-CeecCccc
Confidence 44555678899999999877 55555443
No 408
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=27.38 E-value=2.2e+02 Score=24.97 Aligned_cols=38 Identities=13% Similarity=0.149 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
.|..||.....+-..++..+.|++.|++.-+-|+.=+.
T Consensus 24 ei~~LQ~sL~~L~~Rve~Vk~E~~kL~~EN~~Lq~YI~ 61 (80)
T PF10224_consen 24 EILELQDSLEALSDRVEEVKEENEKLESENEYLQQYIG 61 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46777777777777777777777777777777765443
No 409
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=27.37 E-value=1.1e+02 Score=28.39 Aligned_cols=24 Identities=25% Similarity=0.346 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 245 HEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqEL 268 (561)
.+++.|+.++++|++|+..|+.++
T Consensus 83 ~~~~~l~~rvd~Lerqv~~Lenk~ 106 (108)
T COG3937 83 SEMDELTERVDALERQVADLENKL 106 (108)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHh
Confidence 355667777788888888887665
No 410
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=27.29 E-value=1.7e+02 Score=27.07 Aligned_cols=29 Identities=21% Similarity=0.205 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.||-|...++.-+..|.+|+..|+..|-+
T Consensus 36 ~LEGE~r~~e~l~~dL~rrIkMLE~aLkq 64 (134)
T PF08232_consen 36 FLEGERRGQENLKKDLKRRIKMLEYALKQ 64 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444555555555555544
No 411
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=27.26 E-value=2.2e+02 Score=29.20 Aligned_cols=26 Identities=31% Similarity=0.286 Sum_probs=11.2
Q ss_pred hhHHHHhHHH--HHHHHHHHHhccCCCC
Q psy11481 189 HNKLEKNRRA--HLKECFEILKRQVPPA 214 (561)
Q Consensus 189 HNaLERKRRd--~LKdcFeeLRdlVPsL 214 (561)
-+.+|+.+++ .+..-++.|+..|-.+
T Consensus 37 ~~e~e~~~~~~~~~~~e~e~le~qv~~~ 64 (239)
T COG1579 37 KAELEALNKALEALEIELEDLENQVSQL 64 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444433 2333444555554443
No 412
>PF09812 MRP-L28: Mitochondrial ribosomal protein L28; InterPro: IPR019192 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Members of this family are components of the mitochondrial large ribosomal subunit. Mature mitochondrial ribosomes consist of a small (37S) and a large (54S) subunit. The 37S subunit contains at least 33 different proteins and 1 molecule of RNA (15S). The 54S subunit contains at least 45 different proteins and 1 molecule of RNA (21S) [, ].
Probab=27.25 E-value=5.7e+02 Score=24.57 Aligned_cols=26 Identities=27% Similarity=0.265 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqE 267 (561)
+.+.+..+|++..+.-+..|++|+..
T Consensus 86 ~~~~~~~~l~~~~~sq~~AleeLr~~ 111 (157)
T PF09812_consen 86 QREAREQQLRRQYESQQKALEELRLE 111 (157)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 34466777888888888889999866
No 413
>PF03955 Adeno_PIX: Adenovirus hexon-associated protein (IX); InterPro: IPR005641 Hexon (IPR000736 from INTERPRO) is the major coat protein from adenovirus type 2. Hexon forms a homo-trimer, 240 copies of which are present in the capsid, organised so that 12 lie on each of the 20 facets of this structure. The central 9 hexons in a facet are cemented together by 12 copies of protein IX []. Protein IX is not neccessarily required for viral replication, but has been shown to affect several processes including DNA-packaging capacity, thermostability, and the transcriptional activity of several promoters. For more information see [].; GO: 0031423 hexon binding, 0044423 virion part; PDB: 3IYN_T.
Probab=27.21 E-value=36 Score=31.41 Aligned_cols=29 Identities=24% Similarity=0.267 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
|+.+...++.++..|..+.++|+++|++|
T Consensus 81 l~~~~~~~~~~l~~l~a~Le~l~~~L~~~ 109 (109)
T PF03955_consen 81 LKANLTALEDKLTALLAQLEALKQQLAEL 109 (109)
T ss_dssp TTSTHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 33333333444455555555555565553
No 414
>PF07989 Microtub_assoc: Microtubule associated; InterPro: IPR012943 Proteins with this domain associate with the spindle body during cell division [].
Probab=27.21 E-value=1.2e+02 Score=25.78 Aligned_cols=21 Identities=38% Similarity=0.551 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy11481 248 ERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 248 ERLRREneaLQQRLeELKqEL 268 (561)
+.+.+++..|+..++.|++++
T Consensus 39 ~~~~keNieLKve~~~L~~el 59 (75)
T PF07989_consen 39 EELLKENIELKVEVESLKREL 59 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344444445555554444444
No 415
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=27.19 E-value=1.7e+02 Score=31.66 Aligned_cols=45 Identities=24% Similarity=0.307 Sum_probs=35.1
Q ss_pred HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRY----IQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 223 AsILrKAiDY----IQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
+.-|.+..+| +..|..+.++++.++++|+++...|+++|+.|...
T Consensus 126 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~l~~~ 174 (525)
T TIGR02231 126 LKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLTG 174 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC
Confidence 3344444444 67888888899999999999999999999988763
No 416
>PRK00578 prfB peptide chain release factor 2; Validated
Probab=27.11 E-value=4.2e+02 Score=28.70 Aligned_cols=47 Identities=28% Similarity=0.427 Sum_probs=26.1
Q ss_pred HHHHHHHHHHhccC--CCCCCcC-C----cchHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 198 AHLKECFEILKRQV--PPAQEEK-K----SSNLSILHSAIRYIQFLRRREREFE 244 (561)
Q Consensus 198 d~LKdcFeeLRdlV--PsLsDn~-K----aSKAsILrKAiDYIQkLQreEreLE 244 (561)
+.++..|.+|...+ |.+-++. + .-+...|.+.++.++.|++...+++
T Consensus 26 ~~~~~~~~~l~~~l~~p~~~~d~~~~~~l~ke~~~L~~iv~~~~~l~~~~~e~~ 79 (367)
T PRK00578 26 DALKERLEELEAEAEDPDFWNDQERAQKVTKELSSLKAKLDTLEELRQRLDDLE 79 (367)
T ss_pred HHHHHHHHHHHHHhcCCccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555677777666 3332222 1 1335666666777777776665544
No 417
>KOG1029|consensus
Probab=27.10 E-value=3.3e+02 Score=32.99 Aligned_cols=52 Identities=17% Similarity=0.092 Sum_probs=24.3
Q ss_pred eeeecCCCCCCCccCCCCCcccCCCCCccccccCCcccccccccccccccccc
Q psy11481 325 GLERSDGGESGSSLGGGGEMISGRGEGEYGRAGSSSGASNLLKCSESTLYSSN 377 (561)
Q Consensus 325 ~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (561)
+.++.|+.+.+-.-..-.-.+++-.+.|-| |+-.++.-+|--|--..||--+
T Consensus 652 avq~~d~~~~a~~~ad~~~~~a~~pa~e~~-av~as~~~~~~~vkyrAly~Fe 703 (1118)
T KOG1029|consen 652 AVQRFDADAGAKQTADPFAQIAQAPAHEKG-AVDASAFNIHDTVKYRALYPFE 703 (1118)
T ss_pred HHhhhccccccccccChhhHhhcCCCCCcc-ccccccccccceEEEeeecccc
Confidence 445555555444444434444555555544 4444433344444444555433
No 418
>PF07047 OPA3: Optic atrophy 3 protein (OPA3); InterPro: IPR010754 OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity []. This family consists of several optic atrophy 3 (OPA3) proteins and related proteins from other eukaryotic species, the function is unknown.
Probab=27.02 E-value=4.8e+02 Score=24.01 Aligned_cols=28 Identities=14% Similarity=0.317 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKq 266 (561)
++.+++++++.|+.++++|+.+++++++
T Consensus 106 Ke~~~~~~l~~L~~~i~~L~~~~~~~~~ 133 (134)
T PF07047_consen 106 KEEELQERLEELEERIEELEEQVEKQQE 133 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3455668889999999999999988874
No 419
>TIGR02492 flgK_ends flagellar hook-associated protein FlgK. The flagellar hook-associated protein FlgK of bacterial flagella has conserved N- and C-terminal domains. The central region is highly variable in length and sequence, and often contains substantial runs of low-complexity sequence. This model is built from an alignment of FlgK sequences with the central region excised. Note that several other proteins of the flagellar apparatus also are homologous in the N- and C-terminal regions to FlgK, but are excluded from this model.
Probab=27.02 E-value=4.1e+02 Score=27.34 Aligned_cols=51 Identities=20% Similarity=0.260 Sum_probs=29.2
Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 219 KSSNLSILHSAIRYIQFLRRREREFE-------HEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 219 KaSKAsILrKAiDYIQkLQreEreLE-------~EkERLRREneaLQQRLeELKqELs 269 (561)
...+..+|.+|-.....++.--.+++ .+++..-.++..|-++|++|-+++.
T Consensus 126 ~~~r~~vl~~a~~l~~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~ll~~Ia~lN~~I~ 183 (322)
T TIGR02492 126 EALRQAVLESAQALANSFNQTSNELQDLRKGINAEIKSAVTEINSLLKQIASLNKEIQ 183 (322)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678888888766555554444433 4444444455555556666655554
No 420
>PF08657 DASH_Spc34: DASH complex subunit Spc34 ; InterPro: IPR013966 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=27.01 E-value=2.9e+02 Score=28.50 Aligned_cols=53 Identities=23% Similarity=0.209 Sum_probs=32.0
Q ss_pred HHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 199 HLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 199 ~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
-|-.+++.|-...|.. .|.+.|..|+++-..+..+++.|+.++.+-+.+|+.+
T Consensus 161 vLL~~ae~L~~vYP~~-------------ga~eki~~Lr~~y~~l~~~i~~lE~~VaeQ~~qL~~~ 213 (259)
T PF08657_consen 161 VLLRGAEKLCNVYPLP-------------GAREKIAALRQRYNQLSNSIAYLEAEVAEQEAQLERM 213 (259)
T ss_pred HHHHHHHHHHHhCCCh-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666676666662 4666677777776666666666655555444444443
No 421
>PF14661 HAUS6_N: HAUS augmin-like complex subunit 6 N-terminus
Probab=27.01 E-value=2.8e+02 Score=27.59 Aligned_cols=46 Identities=17% Similarity=0.275 Sum_probs=30.3
Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRRRERE---FEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEre---LE~EkERLRREneaLQQRLeELKqEL 268 (561)
+.|+.....|.+.++++... +++..+.|..+.++|.++..++++++
T Consensus 160 a~~~~~r~~~~~~~~~~~~~~~~~~~~aq~L~~k~r~l~~~~~~~~~~~ 208 (247)
T PF14661_consen 160 ARILAHRNSFLQILQEKDAARQKYQEFAQLLRKKYRELSAECAELQAQL 208 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45555556667777776655 66666667777777777777766666
No 422
>PF14971 DUF4510: Domain of unknown function (DUF4510)
Probab=27.01 E-value=1.4e+02 Score=29.32 Aligned_cols=67 Identities=24% Similarity=0.320 Sum_probs=40.9
Q ss_pred CCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCC--------cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 183 SGIREVHNKLEKNRRAHLKECFEILKRQVPPAQE--------EKKSSNLSILHSAIRYIQFLRRREREFEHEMERLARE 253 (561)
Q Consensus 183 SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsD--------n~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRRE 253 (561)
+.+|+.--.+|+.= ..|..+++ ++..|.... ..+.....=| +|.+.|+.|+.+|..++..+.+|+.+
T Consensus 67 aar~a~qeaV~~EL-~aLq~~we--~~~~Paqp~gp~Rlv~~e~ga~~~~gL-~aaevi~~Lrs~EA~Le~~L~rLQ~q 141 (163)
T PF14971_consen 67 AARRALQEAVEQEL-AALQRAWE--QGRGPAQPHGPHRLVRREAGAAGGQGL-RAAEVIRALRSQEACLEAVLRRLQGQ 141 (163)
T ss_pred HHHHHHHHHHHHHH-HHHHHHHh--cCCCCCCCCCCceeeecCCCCCCCCCc-cHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34666666666663 44444444 345555221 1122222223 36899999999999999999988763
No 423
>PF08287 DASH_Spc19: Spc19; InterPro: IPR013251 Spc19 is a component of the DASH complex. The DASH complex associates with the spindle pole body and is important for spindle and kinetochore integrity during cell division [, ].
Probab=26.92 E-value=1.2e+02 Score=28.82 Aligned_cols=48 Identities=21% Similarity=0.243 Sum_probs=32.3
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 220 SSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 220 aSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
..|..+...---+|..|-. ++++++++|+++.+.|+++.+-.+..|..
T Consensus 53 ~Aq~~l~~EI~P~I~~Ll~---k~e~~l~kL~Rr~~tL~ak~EL~~~RL~~ 100 (153)
T PF08287_consen 53 AAQQSLRDEIEPQINHLLD---KAEKHLEKLQRREETLKAKCELQQGRLSN 100 (153)
T ss_pred HHHHHHHHhccHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3444444444445655544 46788889999999999998866666655
No 424
>KOG4709|consensus
Probab=26.92 E-value=2.7e+02 Score=28.49 Aligned_cols=24 Identities=29% Similarity=0.441 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQ 259 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQ 259 (561)
++++...+++..+++++|.++|++
T Consensus 48 veRrK~Aqeqikeq~ReerielRk 71 (217)
T KOG4709|consen 48 VERRKAAQEQIKEQLREERIELRK 71 (217)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333344444455555555554
No 425
>KOG2417|consensus
Probab=26.85 E-value=3.8e+02 Score=29.86 Aligned_cols=83 Identities=18% Similarity=0.235 Sum_probs=43.4
Q ss_pred HHHhHHHHHHHHHHHHhccCCCCCCcCCcc---h-HHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 192 LEKNRRAHLKECFEILKRQVPPAQEEKKSS---N-LSILHSAI------RYIQFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 192 LERKRRd~LKdcFeeLRdlVPsLsDn~KaS---K-AsILrKAi------DYIQkLQreEreLE~EkERLRREneaLQQRL 261 (561)
..||||..+... +.+.+.+. ....++| + +..+..+. +-|+.||+|-.-+|+--++|=-|.-+|++..
T Consensus 203 isKKkk~a~~~l--~~~~l~~e-~~~k~pSff~r~w~~~~~~~~~~~~~~~i~~lq~EV~~LEeLsrqLFLE~~eLr~~k 279 (462)
T KOG2417|consen 203 ISKKKKMAMAQL--EEKRLQSE-KVQKEPSFFRRFWGMFSSSVQDNTLSSDIKLLQQEVEPLEELSRQLFLELVELRQMK 279 (462)
T ss_pred HHHHHHHHHHHH--HHHhcCch-hcccCCcHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555443 66666665 2233444 2 33333332 2355555555555555555555555555544
Q ss_pred HHHHHHHhhhhccccccc
Q psy11481 262 ALLKKELSARWEHIDFNT 279 (561)
Q Consensus 262 eELKqELsQqw~hIDfSt 279 (561)
+. .+.+++|++.=|+=
T Consensus 280 er--~~~SkTfkG~yfN~ 295 (462)
T KOG2417|consen 280 ER--VAFSKTFKGKYFNV 295 (462)
T ss_pred HH--HHHHHHhhhHhhHh
Confidence 43 34678888776543
No 426
>COG1842 PspA Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]
Probab=26.85 E-value=2e+02 Score=28.98 Aligned_cols=70 Identities=7% Similarity=0.150 Sum_probs=0.0
Q ss_pred HHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy11481 200 LKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEH 274 (561)
Q Consensus 200 LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~h 274 (561)
++..+.-++..|-.+-+.-..++ .+|.. ||+..+.+-.+......++..++..|..++++++....+ |.+
T Consensus 4 ~~r~~~~~~a~~~~~~dk~EDp~-~~l~Q---~ird~~~~l~~ar~~~A~~~a~~k~~e~~~~~~~~~~~k-~e~ 73 (225)
T COG1842 4 FSRLKDLVKANINELLDKAEDPE-KMLEQ---AIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEK-LEE 73 (225)
T ss_pred HHHHHHHHHHHHHHHHHhhcCHH-HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH
No 427
>PRK05431 seryl-tRNA synthetase; Provisional
Probab=26.82 E-value=1.3e+02 Score=32.37 Aligned_cols=32 Identities=22% Similarity=0.173 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 242 EFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 242 eLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
++.++..+|+++..+|.+++.+++.++.+.+.
T Consensus 70 ~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 101 (425)
T PRK05431 70 ALIAEVKELKEEIKALEAELDELEAELEELLL 101 (425)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555566666666666666666666655444
No 428
>PF04340 DUF484: Protein of unknown function, DUF484; InterPro: IPR007435 This family consists of several proteins of uncharacterised function.; PDB: 3E98_B.
Probab=26.80 E-value=1.8e+02 Score=28.17 Aligned_cols=39 Identities=41% Similarity=0.486 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhh
Q psy11481 232 YIQFLRRREREFEHEMERL---AREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 232 YIQkLQreEreLE~EkERL---RREneaLQQRLeELKqELsQ 270 (561)
-+..|++++++++.++++| .++|+.+.+++.++...|-+
T Consensus 48 Q~~~LR~~~~~L~~~l~~Li~~Ar~Ne~~~~~~~~l~l~LL~ 89 (225)
T PF04340_consen 48 QLERLRERNRQLEEQLEELIENARENEAIFQRLHRLVLALLA 89 (225)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4677888888888777654 67899999999888888754
No 429
>PF04340 DUF484: Protein of unknown function, DUF484; InterPro: IPR007435 This family consists of several proteins of uncharacterised function.; PDB: 3E98_B.
Probab=26.71 E-value=79 Score=30.55 Aligned_cols=21 Identities=24% Similarity=0.392 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy11481 246 EMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKq 266 (561)
++++||++++.|+++|++|-+
T Consensus 48 Q~~~LR~~~~~L~~~l~~Li~ 68 (225)
T PF04340_consen 48 QLERLRERNRQLEEQLEELIE 68 (225)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555555555543
No 430
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=26.70 E-value=2e+02 Score=27.43 Aligned_cols=41 Identities=17% Similarity=0.343 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 232 YIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 232 YIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
||+.+++...+++..+.+.......|++++++++.+... |.
T Consensus 31 ~ird~e~~l~~a~~~~a~~~a~~~~le~~~~~~~~~~~~-~~ 71 (221)
T PF04012_consen 31 AIRDMEEQLRKARQALARVMANQKRLERKLDEAEEEAEK-WE 71 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH
Confidence 677777777777777777777777777777777777665 65
No 431
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=26.63 E-value=1.4e+02 Score=30.98 Aligned_cols=73 Identities=15% Similarity=0.265 Sum_probs=43.6
Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCcchH-------HHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 196 RRAHLKECFEILKRQVPPAQEEKKSSNL-------SILHSAIRYI------------QFLRRREREFEHEMERLAREKIH 256 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsLsDn~KaSKA-------sILrKAiDYI------------QkLQreEreLE~EkERLRREnea 256 (561)
+-+.|...=..|...|....+ .-+.|+ .++..|++-| ..|++-+.+.+.++..|+++.++
T Consensus 14 h~~~L~~~N~~L~~~IqdtE~-st~~~Vr~lLqqy~~~~~~i~~le~~~~~~l~~ak~eLqe~eek~e~~l~~Lq~ql~~ 92 (258)
T PF15397_consen 14 HEDFLTKLNKELIKEIQDTED-STALKVRKLLQQYDIYRTAIDILEYSNHKQLQQAKAELQEWEEKEESKLSKLQQQLEQ 92 (258)
T ss_pred HHHHHHHhhHHHHHHHHhHHh-hHHHHHHHHHHHHHHHHHHHHHHHccChHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 455566666677777777433 222222 2333343322 23444455566677788888888
Q ss_pred HHHHHHHHHHHHh
Q psy11481 257 AQQRLALLKKELS 269 (561)
Q Consensus 257 LQQRLeELKqELs 269 (561)
|.++|...+.+|.
T Consensus 93 l~akI~k~~~el~ 105 (258)
T PF15397_consen 93 LDAKIQKTQEELN 105 (258)
T ss_pred HHHHHHHHHHHHH
Confidence 8888888888864
No 432
>PF11285 DUF3086: Protein of unknown function (DUF3086); InterPro: IPR021437 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=26.59 E-value=1.4e+02 Score=31.62 Aligned_cols=35 Identities=20% Similarity=0.300 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLKKELSARW 272 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELKqELsQqw 272 (561)
.-.+++++.+..|+.|+++|.+|-+++++|+...+
T Consensus 4 ~~L~eL~qrk~~Lq~eIe~LerR~~ri~~EmrtsF 38 (283)
T PF11285_consen 4 EALKELEQRKQALQIEIEQLERRRERIEKEMRTSF 38 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
Confidence 34456666666677777777777777766665433
No 433
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=26.54 E-value=2.5e+02 Score=31.52 Aligned_cols=20 Identities=20% Similarity=0.463 Sum_probs=10.3
Q ss_pred ccCCcccccccccceeeeec
Q psy11481 472 LNSLPGLVSAHHSAQVLFTS 491 (561)
Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~ 491 (561)
+..-|+|+..-....|+.+|
T Consensus 325 l~~dp~L~e~A~~k~Vilvs 344 (475)
T PRK10361 325 LDRQPELITEALKNNIMLVS 344 (475)
T ss_pred HHhChHHHHHHHhCCEEEeC
Confidence 34456666544444555544
No 434
>PRK10265 chaperone-modulator protein CbpM; Provisional
Probab=26.48 E-value=1.8e+02 Score=25.47 Aligned_cols=25 Identities=32% Similarity=0.347 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 248 ERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 248 ERLRREneaLQQRLeELKqELsQqw~ 273 (561)
-.|-.|+++|++++.+|+++|.. +.
T Consensus 74 l~LLd~i~~Lr~el~~L~~~l~~-~~ 98 (101)
T PRK10265 74 LTLLDEIAHLKQENRLLRQRLSR-FV 98 (101)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-Hh
Confidence 46778888888888888888776 44
No 435
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=26.41 E-value=1.9e+02 Score=30.92 Aligned_cols=38 Identities=26% Similarity=0.395 Sum_probs=26.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 230 IRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 230 iDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
..+|..|+.+...++.++++|++|+..|...|+.=+..
T Consensus 141 ~k~i~~Le~e~~~~q~~le~Lr~EKVdlEn~LE~EQE~ 178 (310)
T PF09755_consen 141 QKKIERLEKEKSAKQEELERLRREKVDLENTLEQEQEA 178 (310)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 34566666666666777788888888877777654444
No 436
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=26.31 E-value=6.5e+02 Score=26.23 Aligned_cols=18 Identities=6% Similarity=0.073 Sum_probs=10.3
Q ss_pred CCcchHHHHHHHHHHHHH
Q psy11481 218 KKSSNLSILHSAIRYIQF 235 (561)
Q Consensus 218 ~KaSKAsILrKAiDYIQk 235 (561)
++++++.-|+..++-++.
T Consensus 279 ~~hP~v~~l~~~i~~l~~ 296 (444)
T TIGR03017 279 PNHPQYKRAQAEINSLKS 296 (444)
T ss_pred CCCcHHHHHHHHHHHHHH
Confidence 356666666665555444
No 437
>PRK03918 chromosome segregation protein; Provisional
Probab=26.28 E-value=2.1e+02 Score=32.43 Aligned_cols=75 Identities=19% Similarity=0.196 Sum_probs=0.0
Q ss_pred HhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 194 KNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
......++..+..|....+. ......+...+.++.+....|.+...+++.++.+++++...|+.++++++..+..
T Consensus 265 ~~el~~l~~~l~~l~~~~~~--~~~l~~~~~~~~~l~~~~~~l~~~~~~l~~~~~~l~~~~~~l~~~l~~~e~~~~~ 339 (880)
T PRK03918 265 EERIEELKKEIEELEEKVKE--LKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEER 339 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 438
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=26.27 E-value=1.8e+02 Score=27.47 Aligned_cols=21 Identities=29% Similarity=0.411 Sum_probs=8.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy11481 248 ERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 248 ERLRREneaLQQRLeELKqEL 268 (561)
..|+.+.+.|+++++.|+++|
T Consensus 76 ~~lr~~~e~L~~eie~l~~~L 96 (177)
T PF07798_consen 76 AELRSENEKLQREIEKLRQEL 96 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444444444333
No 439
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=26.24 E-value=5.7e+02 Score=25.98 Aligned_cols=65 Identities=15% Similarity=0.099 Sum_probs=33.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccCCCCCccccccccC-ccCCCcccccCCCCCCcCCC
Q psy11481 237 RRREREFEHEMERLAREKIHAQQRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDN-HHHESSLLSYGKERSYMDED 313 (561)
Q Consensus 237 QreEreLE~EkERLRREneaLQQRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQ-a~~DSsilS~~~~~~~~~~~ 313 (561)
+++...++..+.+++.|.+..+..+..+..++.. -..+||+.++. -..+.....+..+.+.+.+|
T Consensus 59 ~~DIn~lE~iIkqa~~er~~~~~~i~r~~eey~~------------Lk~~in~~R~e~lgl~~Lp~l~eE~~~~~~~~ 124 (230)
T PF10146_consen 59 NQDINTLENIIKQAESERNKRQEKIQRLYEEYKP------------LKDEINELRKEYLGLEPLPSLEEEELSKISPD 124 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHcCCCCCCccccccccccCHH
Confidence 4444445555555666666655555555555443 33466666655 55555544443323444434
No 440
>PRK01919 tatB sec-independent translocase; Provisional
Probab=26.23 E-value=2.3e+02 Score=28.05 Aligned_cols=46 Identities=15% Similarity=0.188 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
...+.++..|+..++++-.+- .+.++|++....++....++++.+.
T Consensus 33 Gk~i~k~Rr~~~d~K~ev~~E-~e~dElrk~~~~~e~~~~~v~~si~ 78 (169)
T PRK01919 33 GALFGRAQRYINDVKAEVSRE-IELDELRKMKTDFESAARDVENTIH 78 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHH-HhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456677777777777654332 2444555555554444444444433
No 441
>KOG0976|consensus
Probab=26.10 E-value=2.5e+02 Score=34.14 Aligned_cols=73 Identities=22% Similarity=0.173 Sum_probs=49.6
Q ss_pred hHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRY----IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 190 NaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDY----IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
.+.+|..|...+..=++|+. +-+...||+-.|.- .+.||.....++.++.+-+-|.+.+..|+.+|.
T Consensus 84 tqetriyRrdv~llEddlk~---------~~sQiriLQn~c~~lE~ekq~lQ~ti~~~q~d~ke~etelE~~~srlh~le 154 (1265)
T KOG0976|consen 84 TQETRIYRRDVNLLEDDLKH---------HESQIRILQNKCLRLEMEKQKLQDTIQGAQDDKKENEIEIENLNSRLHKLE 154 (1265)
T ss_pred HHHHHHHHHHHHHhHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 34556656555554444443 24557777776643 566777777777777777788888888899999
Q ss_pred HHHhhh
Q psy11481 266 KELSAR 271 (561)
Q Consensus 266 qELsQq 271 (561)
++|++.
T Consensus 155 ~eLsAk 160 (1265)
T KOG0976|consen 155 DELSAK 160 (1265)
T ss_pred HHHhhh
Confidence 888873
No 442
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=26.04 E-value=2.2e+02 Score=28.00 Aligned_cols=44 Identities=16% Similarity=0.215 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.+..++.+.|+++-.+-+.+..++..+...|.++|.+||.+...
T Consensus 106 ~~~~e~~k~le~~~~~~~~~~~~~e~~i~~Le~ki~el~~~~~~ 149 (190)
T PF05266_consen 106 EKLLEERKKLEKKIEEKEAELKELESEIKELEMKILELQRQAAK 149 (190)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHH
No 443
>KOG0982|consensus
Probab=26.01 E-value=4e+02 Score=30.18 Aligned_cols=64 Identities=17% Similarity=0.222 Sum_probs=36.1
Q ss_pred HHHhccCCCCC------CcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 205 EILKRQVPPAQ------EEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 205 eeLRdlVPsLs------Dn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
.+||..++.+. ++.+.--...|.-+.+-|..+|+..+++-..+.+.++|+++.+.=+++|.++|
T Consensus 314 telRs~~arlksl~dklaee~qr~sd~LE~lrlql~~eq~l~~rm~d~Lrrfq~ekeatqELieelrkel 383 (502)
T KOG0982|consen 314 TELRSLIARLKSLADKLAEEDQRSSDLLEALRLQLICEQKLRVRMNDILRRFQEEKEATQELIEELRKEL 383 (502)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 45666666553 12232234455556666666666555555555566666666666666666665
No 444
>KOG3915|consensus
Probab=25.95 E-value=2e+02 Score=32.89 Aligned_cols=13 Identities=23% Similarity=0.202 Sum_probs=8.2
Q ss_pred hHHHHHHHHHHHH
Q psy11481 222 NLSILHSAIRYIQ 234 (561)
Q Consensus 222 KAsILrKAiDYIQ 234 (561)
-..+|+.|+|-++
T Consensus 508 iq~llkva~dnar 520 (641)
T KOG3915|consen 508 IQGLLKVAIDNAR 520 (641)
T ss_pred HHHHHHHHHHHHH
Confidence 3556777777654
No 445
>PF13864 Enkurin: Calmodulin-binding
Probab=25.90 E-value=4.4e+02 Score=22.83 Aligned_cols=38 Identities=21% Similarity=0.363 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHh
Q psy11481 232 YIQFLRRREREFEHEMER---------LAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 232 YIQkLQreEreLE~EkER---------LRREneaLQQRLeELKqELs 269 (561)
.+..|+..-.++..+... .++.+++|.++|.+|...+.
T Consensus 45 lL~~Lk~~~~el~~ey~~lp~~~DT~~~~~rK~~lE~~L~qlE~dI~ 91 (98)
T PF13864_consen 45 LLEGLKKNWDELNKEYQKLPFSIDTLRKKRRKEELEKELKQLEKDIK 91 (98)
T ss_pred HHHHHHHHHHHHHHHHHhCCcccCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 356677776666666654 56777778888888776654
No 446
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=25.86 E-value=1.6e+02 Score=32.33 Aligned_cols=37 Identities=22% Similarity=0.112 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 225 ILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 225 ILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRL 261 (561)
|+.|+-..+++|+.--+++++|..+|+-+.++++++.
T Consensus 121 vf~k~k~~~q~LE~li~~~~EEn~~lqlqL~~l~~e~ 157 (401)
T PF06785_consen 121 VFMKTKGDIQHLEGLIRHLREENQCLQLQLDALQQEC 157 (401)
T ss_pred HHHHhcchHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3333444444444444444444444444444444433
No 447
>COG3027 zapA Cell division protein ZapA (stimulator of FtsZ polymerization and Z-ring component) [Cell cycle control, cell division, chromosome partitioning]
Probab=25.85 E-value=3.5e+02 Score=24.50 Aligned_cols=63 Identities=17% Similarity=0.216 Sum_probs=36.4
Q ss_pred HHHHHHHHhccCCCCCCcCCcchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 200 LKECFEILKRQVPPAQEEKKSSNLSILHS--AIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 200 LKdcFeeLRdlVPsLsDn~KaSKAsILrK--AiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
+.....+|+.--+.+. ..++.|+.- .++-+.+|+++.+.++. .+++++..++++++.-.....
T Consensus 32 lD~kv~eik~~~~~~~----~~rl~vmaAlNv~~eL~~l~~k~~~~~~---~~~q~i~~~~~~~~~Al~~~a 96 (105)
T COG3027 32 LDDKVRELKESNGVLD----TERLAVMAALNVMHELLKLKEKLRDIEA---SLEQRIRKLDQALENALTTLA 96 (105)
T ss_pred HHHHHHHHHhcCcccc----HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 4555668888888753 445555532 44456667776665544 455555556666555444443
No 448
>PF11461 RILP: Rab interacting lysosomal protein; InterPro: IPR021563 RILP contains a domain which contains two coiled-coil regions and is found mainly in the cytosol. RILP is recruited onto late endosomal and lysosomal membranes by Rab7 and acts as a downstream effector of Rab7. This recruitment process is important for phagosome maturation and fusion with late endosomes and lysosomes. ; PDB: 1YHN_B.
Probab=25.76 E-value=1.3e+02 Score=25.30 Aligned_cols=32 Identities=28% Similarity=0.278 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 239 REREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 239 eEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.++..+|.+.|+.+...|+..|+.+|.++..
T Consensus 4 ELr~VL~ERNeLK~~v~~leEEL~~yk~~~~~ 35 (60)
T PF11461_consen 4 ELREVLQERNELKARVFLLEEELAYYKSELLP 35 (60)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHHHHHHSS
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCC
Confidence 44667788888888888888888888887654
No 449
>PRK04325 hypothetical protein; Provisional
Probab=25.75 E-value=2.7e+02 Score=23.53 Aligned_cols=36 Identities=22% Similarity=0.138 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEME-------RLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkE-------RLRREneaLQQRLeELKqEL 268 (561)
|..|+.+-.-++.-++ +.++++..|+++|..|..+|
T Consensus 11 i~~LE~klAfQE~tIe~LN~vv~~Qq~~I~~L~~ql~~L~~rl 53 (74)
T PRK04325 11 ITELEIQLAFQEDLIDGLNATVARQQQTLDLLQAQLRLLYQQM 53 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555444444444 44455555555555554444
No 450
>PF11460 DUF3007: Protein of unknown function (DUF3007); InterPro: IPR021562 This is a family of uncharacterised proteins found in bacteria and eukaryotes.
Probab=25.70 E-value=81 Score=28.99 Aligned_cols=29 Identities=28% Similarity=0.540 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 237 RRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 237 QreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
.++.++|++..++++.| ++++|+++|..+
T Consensus 65 ~~Q~k~Ye~a~~~~~~~--~lqkRle~l~~e 93 (104)
T PF11460_consen 65 MQQRKDYEEAVDQLTNE--ELQKRLEELSPE 93 (104)
T ss_pred HHHHHHHHHHHHHHhHH--HHHHHHHhCCHH
Confidence 34556677777776654 677777776543
No 451
>KOG0161|consensus
Probab=25.68 E-value=1.9e+02 Score=37.52 Aligned_cols=44 Identities=18% Similarity=0.262 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|.+.-.-+..|+.+..+...++.+|++++.+|+.++++|+.+|.
T Consensus 1071 l~kke~El~~l~~k~e~e~~~~~~l~k~i~eL~~~i~el~e~le 1114 (1930)
T KOG0161|consen 1071 LKKKESELSQLQSKLEDEQAEVAQLQKQIKELEARIKELEEELE 1114 (1930)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333344444444444444455555555555555555544443
No 452
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=25.66 E-value=1.9e+02 Score=27.25 Aligned_cols=36 Identities=19% Similarity=0.247 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
|+.++...+.++.-+++|+.|.+.-..++..|+.+|
T Consensus 82 ~~e~qsli~~yE~~~~kLe~e~~~Kdsei~~Lr~~L 117 (131)
T PF04859_consen 82 IQEQQSLIKTYEIVVKKLEAELRAKDSEIDRLREKL 117 (131)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444444444444444443
No 453
>COG3416 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.62 E-value=2.9e+02 Score=28.58 Aligned_cols=68 Identities=16% Similarity=0.260 Sum_probs=40.0
Q ss_pred HHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 198 AHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRY---IQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 198 d~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDY---IQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
..|++.|..||..--.-.| ...-.-|-..+.+. --+|-+..--+|+-++++..++.+|++|++.|++.
T Consensus 7 qlle~lf~rlk~a~~~~rD--~~Ae~lI~~~~~~qP~a~Y~laQ~vliqE~ALk~a~~~i~eLe~ri~~lq~~ 77 (233)
T COG3416 7 QLLENLFHRLKKAEANERD--PQAEALIAEAVAKQPDAAYYLAQRVLIQEQALKKASTQIKELEKRIAILQAG 77 (233)
T ss_pred HHHHHHHHHHhhcccCCCC--hHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 4678889999887655222 11222222222111 12344444445566778888888888899888875
No 454
>PF14645 Chibby: Chibby family
Probab=25.62 E-value=1.9e+02 Score=26.45 Aligned_cols=36 Identities=17% Similarity=0.207 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 235 FLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 235 kLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
...++..++.++.++|++|+..|+-+++-|--.|..
T Consensus 68 ~~~~~~~~l~~~n~~L~EENN~Lklk~elLlDMLte 103 (116)
T PF14645_consen 68 ADGEENQRLRKENQQLEEENNLLKLKIELLLDMLTE 103 (116)
T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356677888889999999999999998888877765
No 455
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=25.57 E-value=1.7e+02 Score=29.98 Aligned_cols=36 Identities=33% Similarity=0.437 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+..|..+-.++..+++.|+.++..|+.++..+++.+
T Consensus 105 ~~~le~el~~l~~~~~~l~~~i~~l~~~~~~~e~~~ 140 (239)
T COG1579 105 INSLEDELAELMEEIEKLEKEIEDLKERLERLEKNL 140 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333444444444444444444444444433
No 456
>cd04786 HTH_MerR-like_sg7 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 7) with a conserved cysteine present in the C-terminal portion of the protein. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic su
Probab=25.55 E-value=1.5e+02 Score=27.04 Aligned_cols=35 Identities=23% Similarity=0.314 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 231 RYIQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 231 DYIQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
+....|+++..++++++++|++-...|...++.++
T Consensus 78 ~~~~~l~~k~~~i~~~i~~L~~~~~~L~~~i~~~~ 112 (131)
T cd04786 78 ELLAALERKVADIEALEARLAQNKAQLLVLIDLIE 112 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34556777766666666666665555555554444
No 457
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=25.33 E-value=1.5e+02 Score=28.70 Aligned_cols=57 Identities=19% Similarity=0.310 Sum_probs=0.0
Q ss_pred HhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 194 KNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKK 266 (561)
Q Consensus 194 RKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKq 266 (561)
++.-+.++..+..|++.+=....+...+ .++.++..+++.|++++..|+.+|+.++.
T Consensus 75 ~~~~~~~~~~i~~l~~~i~~~~~~r~~~----------------~eR~~~l~~l~~l~~~~~~l~~el~~~~~ 131 (188)
T PF03962_consen 75 QKEIEELEKKIEELEEKIEEAKKGREES----------------EEREELLEELEELKKELKELKKELEKYSE 131 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccc----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
No 458
>cd01111 HTH_MerD Helix-Turn-Helix DNA binding domain of the MerD transcription regulator. Helix-turn-helix (HTH) transcription regulator MerD. The putative secondary regulator of mercury resistance (mer) operons, MerD, has been shown to down-regulate the expression of this operon in gram-negative bacteria. It binds to the same operator DNA as MerR that activates transcription of the operon in the presence of mercury ions. The MerD protein shares the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily, which promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are conserved and contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules such as metal ions, drugs,
Probab=25.33 E-value=1.8e+02 Score=25.57 Aligned_cols=21 Identities=19% Similarity=0.230 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy11481 249 RLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 249 RLRREneaLQQRLeELKqELs 269 (561)
.+..+.++.+++|+.++.+|.
T Consensus 84 ~~~~~l~~~~~~L~~l~~~L~ 104 (107)
T cd01111 84 QLRQKIEVRRAALNALTTQLA 104 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333334444444455555444
No 459
>cd04766 HTH_HspR Helix-Turn-Helix DNA binding domain of the HspR transcription regulator. Helix-turn-helix (HTH) transcription regulator HspR, N-terminal domain. Heat shock protein regulators (HspR) have been shown to regulate expression of specific regulons in response to high temperature or high osmolarity in Streptomyces and Helicobacter, respectively. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
Probab=25.27 E-value=1.8e+02 Score=24.40 Aligned_cols=21 Identities=24% Similarity=0.284 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy11481 249 RLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 249 RLRREneaLQQRLeELKqELs 269 (561)
.|.++.+.|+++|++|++.+.
T Consensus 69 ~l~~~~~~l~~~l~~l~~~~~ 89 (91)
T cd04766 69 ELEEELAELRAELDELRARLR 89 (91)
T ss_pred HHHHHHHHHHHHHHHHHHHhc
Confidence 366666667777777776654
No 460
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=25.26 E-value=2.6e+02 Score=23.62 Aligned_cols=26 Identities=19% Similarity=0.377 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 243 FEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 243 LE~EkERLRREneaLQQRLeELKqEL 268 (561)
+...+..|+.++..+...+..|+..+
T Consensus 31 ~~~~IKKLr~~~~e~e~~~~~l~~~~ 56 (74)
T PF12329_consen 31 LNNTIKKLRAKIKELEKQIKELKKKL 56 (74)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444443
No 461
>PF10454 DUF2458: Protein of unknown function (DUF2458); InterPro: IPR018858 This entry represents a family of uncharacterised proteins.
Probab=25.15 E-value=2.2e+02 Score=27.14 Aligned_cols=49 Identities=20% Similarity=0.306 Sum_probs=29.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 222 NLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 222 KAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+..--..|..||-.+-.++......+++|.+++..+++++.+-+..|-+
T Consensus 4 ~It~w~~ALryv~~~v~~n~~~~~~Ir~Li~~Q~~~Er~w~~~Re~l~~ 52 (150)
T PF10454_consen 4 TITTWPAALRYVMKTVAQNPEFLQRIRRLIKEQHDHERQWWEGREALIA 52 (150)
T ss_pred hhhcHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455677776666666666666666666666666666655555543
No 462
>PRK09973 putative outer membrane lipoprotein; Provisional
Probab=25.14 E-value=2.1e+02 Score=25.47 Aligned_cols=45 Identities=16% Similarity=0.293 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhhhccccc
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL---SARWEHIDF 277 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL---sQqw~hIDf 277 (561)
+.+|..+.+.+..+++++..+...++.....-+.|- .++.+++||
T Consensus 26 vdqLss~V~~L~~kvdql~~dv~~a~aaa~aAk~EA~RAN~RiDN~~~ 73 (85)
T PRK09973 26 VNQLASNVQTLNAKIARLEQDMKALRPQIYAAKSEANRANTRLDAQDY 73 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHH
Confidence 556666666677777777777777777766665553 333555554
No 463
>PF07996 T4SS: Type IV secretion system proteins; InterPro: IPR014158 This entry contains VirB5, a protein that is involved in the type IV DNA secretion systems typified by the Agrobacterium Ti plasmid vir system where it interacts with several other proteins essential for proper pilus formation []. VirB5 is homologous to the IncN (N-type) conjugation system protein TraC [] as well as the P-type protein TrbJ and the F-type protein TraE [].; PDB: 1R8I_A.
Probab=25.12 E-value=1.7e+02 Score=27.29 Aligned_cols=36 Identities=11% Similarity=0.192 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccCC
Q psy11481 246 EMERLAREKIHAQQRLALLKKELSARWEHIDFNTLI 281 (561)
Q Consensus 246 EkERLRREneaLQQRLeELKqELsQqw~hIDfStLl 281 (561)
++.++.++.+.++++|+++++.+..--..+||..+.
T Consensus 20 q~~~~~~q~~q~~~Ql~~~k~q~~s~tG~r~~~~~~ 55 (195)
T PF07996_consen 20 QLAQWKQQLEQLKQQLQQAKQQYNSLTGNRGLGNLL 55 (195)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHTT--GGGSS
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHh
Confidence 344444455556666677776665544556666665
No 464
>COG4192 Signal transduction histidine kinase regulating phosphoglycerate transport system [Signal transduction mechanisms]
Probab=25.06 E-value=1.4e+02 Score=34.23 Aligned_cols=72 Identities=18% Similarity=0.385 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Q psy11481 195 NRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEH 274 (561)
Q Consensus 195 KRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~h 274 (561)
.||.+++.-|..+++++-.+..+... -++.+-++.+.+.+++.++---..-+++|++.+-.++. |.|
T Consensus 85 q~y~~~~q~l~~i~~l~~~L~~ne~~-----------~L~~~lq~~r~ll~qi~~~v~~~~~v~~~l~~~~~qid--WLH 151 (673)
T COG4192 85 QQYQRLQQDLTAIHALLASLPQNEQA-----------ALQQGLQELRQLLQQINDLVSARIDVDRRLALFAEQID--WLH 151 (673)
T ss_pred HHHHHHHHHHHHHHHHHHhcchhHHH-----------HHHHHHHHHHHHHHHHHHHhhhheeHHHHHHHHHHHHH--HHH
Q ss_pred ccccc
Q psy11481 275 IDFNT 279 (561)
Q Consensus 275 IDfSt 279 (561)
-||-+
T Consensus 152 ~Df~~ 156 (673)
T COG4192 152 QDFGM 156 (673)
T ss_pred Hhhhh
No 465
>PF14661 HAUS6_N: HAUS augmin-like complex subunit 6 N-terminus
Probab=25.03 E-value=3.2e+02 Score=27.25 Aligned_cols=39 Identities=21% Similarity=0.152 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 223 LSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 223 AsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRL 261 (561)
..++++....++.+++..+.+..++++|.+|...+++++
T Consensus 170 ~~~~~~~~~~~~~~~~~aq~L~~k~r~l~~~~~~~~~~~ 208 (247)
T PF14661_consen 170 LQILQEKDAARQKYQEFAQLLRKKYRELSAECAELQAQL 208 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444455555555555555556666655555555
No 466
>PF04880 NUDE_C: NUDE protein, C-terminal conserved region; InterPro: IPR006964 This domain represents the C-terminal conserved region of NUDE proteins. Emericella nidulans (Aspergillus nidulans) NUDE, acts in the cytoplasmic dynein/dynactin pathway and is required for distribution of nuclei []. It is a homologue of the nuclear distribution protein RO11 of Neurospora crassa. NUDE interacts with the NUDF via an N-terminal coiled coil domain; this is the only domain which is absolutely required for NUDE function.; PDB: 2V66_B 2V71_B.
Probab=24.98 E-value=56 Score=31.82 Aligned_cols=30 Identities=23% Similarity=0.442 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 238 RREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 238 reEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
++...|+.+.+||+.|.+.|+++| .++..+
T Consensus 24 dEKE~L~~~~QRLkDE~RDLKqEl-~V~ek~ 53 (166)
T PF04880_consen 24 DEKENLREEVQRLKDELRDLKQEL-IVQEKL 53 (166)
T ss_dssp HHHHHHHHCH---------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHh
Confidence 334556677777777777777777 555443
No 467
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=24.91 E-value=5.5e+02 Score=29.08 Aligned_cols=26 Identities=12% Similarity=0.166 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELsQ 270 (561)
.|..+|+||.+..++..+.|.+.+.+
T Consensus 376 ~e~~~L~Re~~~~~~~Y~~ll~r~~e 401 (754)
T TIGR01005 376 VDLDALQRDAAAKRQLYESYLTNYRQ 401 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455556655555555555554443
No 468
>TIGR00344 alaS alanine--tRNA ligase. The model describes alanine--tRNA ligase. This enzyme catalyzes the reaction (tRNAala + L-alanine + ATP = L-alanyl-tRNAala + pyrophosphate + AMP).
Probab=24.89 E-value=2e+02 Score=34.05 Aligned_cols=22 Identities=18% Similarity=0.231 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREK 254 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREn 254 (561)
++.|+++.+++++++++|+++.
T Consensus 728 v~~l~~e~k~l~k~~~~l~~~l 749 (851)
T TIGR00344 728 IERFFEEWKALQKELESLKLKI 749 (851)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555554444444433
No 469
>KOG0971|consensus
Probab=24.83 E-value=6e+02 Score=31.46 Aligned_cols=38 Identities=32% Similarity=0.493 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
-..|+++++++++-|+-|+++..+|.+.-++||++|..
T Consensus 1014 ~a~lr~Ke~efeetmdaLq~di~~lEsek~elKqrl~~ 1051 (1243)
T KOG0971|consen 1014 QALLRKKEKEFEETMDALQADIDQLESEKAELKQRLNS 1051 (1243)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhh
Confidence 34566677788888888888888888888888888743
No 470
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.83 E-value=1.8e+02 Score=28.10 Aligned_cols=70 Identities=21% Similarity=0.293 Sum_probs=35.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----cccc--cccCCCCCccccccccCccCCCccccc
Q psy11481 234 QFLRRREREFEHEMERLAREKIHAQQRLALLKKELSARW----EHID--FNTLIPDNMEVDIPYDNHHHESSLLSY 303 (561)
Q Consensus 234 QkLQreEreLE~EkERLRREneaLQQRLeELKqELsQqw----~hID--fStLlPd~lEIDRvlRQa~~DSsilS~ 303 (561)
+.++.+.++...+.+..++|...-=.+-++|-+.|.+.+ .|+. =++|+|+...-+++.++-..+.--...
T Consensus 37 ~~~q~ELe~~K~~ld~~rqel~~HFa~sAeLlktl~~dYqklyqHmA~ss~~Llpel~~~~~~~~~l~~~~~~~~~ 112 (138)
T COG3105 37 QKLQYELEKVKAQLDEYRQELVKHFARSAELLKTLAQDYQKLYQHMAKSSTSLLPELSAKDNPFRNLLAEEEASND 112 (138)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhCccccccccHHHHHHHhhhcccc
Confidence 344444444444444444444433344444444444322 1111 467889888888887665555443333
No 471
>PF11598 COMP: Cartilage oligomeric matrix protein; InterPro: IPR024665 Thrombospondins are adhesive glycoproteins that mediate cell-to-cell and cell-to-matrix interactions. Cartilage oligomeric matrix protein may play a role in the structural integrity of cartilage via its interaction with other extracellular matrix proteins such as collagen and fibronectin [, ]. Thrombospondin 3 and 4 and cartilage oligomeric matrix proteins contain a five-stranded coiled-coil domain represented by this entry. This domain has a binding site between two internal rings formed by Leu37 and Thr40 [].; PDB: 1MZ9_D 1FBM_A 1VDF_E.
Probab=24.75 E-value=2e+02 Score=22.92 Aligned_cols=33 Identities=9% Similarity=0.129 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLK 265 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELK 265 (561)
+..|.+--+++.+++++..+|...||.-+++++
T Consensus 10 l~~l~~~l~elk~~l~~Q~kE~~~LRntI~eC~ 42 (45)
T PF11598_consen 10 LSELNQMLQELKELLRQQIKETRFLRNTIMECQ 42 (45)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT-T
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 455555556667777788888888888777653
No 472
>KOG0827|consensus
Probab=24.75 E-value=68 Score=35.53 Aligned_cols=83 Identities=13% Similarity=0.105 Sum_probs=40.7
Q ss_pred CCcceeeecCCCCCCccccccCCCCCCCchhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHH
Q psy11481 157 NSHAVALSTSPNNLTQNDMIKKRSGISGIREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFL 236 (561)
Q Consensus 157 NS~~VaLsnSp~~L~q~se~kKRPg~SeRRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkL 236 (561)
|+.++-.+.-++..-...+-+++-| .-|..-...+..||+++- -.+.|++..-. ....+.+|+.-|+|+.-|...|
T Consensus 62 N~~~~d~vvEe~~Vld~~~~K~~~g--kvk~~~s~~~~~K~dn~e-~~d~l~~t~e~-~~a~vv~~~~al~ki~~~~~~L 137 (465)
T KOG0827|consen 62 NPSTVDHVVEESVVLDWDVGKKCGG--KVKYSVSPGWPYKSDNLE-LQDLLGHTAED-TIAIVVGKIGALKKIVVYSGDL 137 (465)
T ss_pred chhhhhhhhccchhhhHHHHHhhcc--chhhhcCcCCccccccHH-HHHHHhhhHHH-HHHHHHHHHHHhhhHHHhHHHH
Confidence 4555555555555555555555554 334444555555555431 22223322222 1223455566666666666655
Q ss_pred HHHHHHH
Q psy11481 237 RRREREF 243 (561)
Q Consensus 237 QreEreL 243 (561)
+++-+.+
T Consensus 138 k~~~~~l 144 (465)
T KOG0827|consen 138 KEDFQDL 144 (465)
T ss_pred HHHHHHH
Confidence 5554443
No 473
>PF12443 AKNA: AT-hook-containing transcription factor; InterPro: IPR022150 This domain family is found in eukaryotes, and is approximately 110 amino acids in length. This family contains a transcription factor which regulates the expression of the costimulatory molecules on lymphocytes.
Probab=24.69 E-value=1e+02 Score=28.44 Aligned_cols=67 Identities=12% Similarity=0.223 Sum_probs=34.1
Q ss_pred chhhhhHHHHhHHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALL 264 (561)
Q Consensus 185 RRaTHNaLERKRRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeEL 264 (561)
.|+.|-+||. +.++.|++.-...-.. .-.+.-.|+.+ |.+++ +++.+|..+-++|+.++++.
T Consensus 10 l~~~~~aL~g---E~~~a~~~l~~t~q~~----~s~~sp~~f~~----~~ege-------~~~qkL~eqteeLK~kvqe~ 71 (106)
T PF12443_consen 10 LHAAQEALEG---ESETACRKLSSTSQKD----DSSGSPGIFDK----IREGE-------QMIQKLGEQTEELKDKVQEF 71 (106)
T ss_pred HHHHHHHhhh---HHHHHHHhcCCCcccc----CCCCCccccCc----chHHH-------HHHHHHHHHHHHHHHHHHHH
Confidence 4677788887 6666666543322221 11222344443 33333 34445555555556666666
Q ss_pred HHHHh
Q psy11481 265 KKELS 269 (561)
Q Consensus 265 KqELs 269 (561)
+++..
T Consensus 72 sk~i~ 76 (106)
T PF12443_consen 72 SKRIE 76 (106)
T ss_pred hcCcC
Confidence 66544
No 474
>PRK06798 fliD flagellar capping protein; Validated
Probab=24.66 E-value=82 Score=34.16 Aligned_cols=37 Identities=5% Similarity=0.225 Sum_probs=23.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy11481 237 RRREREFEHEMERLAREKIHAQQRLALLKKELSARWE 273 (561)
Q Consensus 237 QreEreLE~EkERLRREneaLQQRLeELKqELsQqw~ 273 (561)
..+.+.++.++++|+++.+.+..|++.++..|-++|-
T Consensus 378 ~~r~~~l~~~i~~l~~~~~~~e~rl~~~e~~l~~qf~ 414 (440)
T PRK06798 378 GERSKSIDNRVSKLDLKITDIDTQNKQKQDNIVDKYQ 414 (440)
T ss_pred ehhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445566666667777666666666666666655443
No 475
>KOG1029|consensus
Probab=24.56 E-value=3.9e+02 Score=32.43 Aligned_cols=49 Identities=24% Similarity=0.197 Sum_probs=30.1
Q ss_pred cchHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 220 SSNLSILHSAIRY----IQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 220 aSKAsILrKAiDY----IQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
......|.+|... ++.|+.+..+++.|++.-..|.+-++.++.+|+..+
T Consensus 527 ~~~~s~L~aa~~~ke~irq~ikdqldelskE~esk~~eidi~n~qlkelk~~~ 579 (1118)
T KOG1029|consen 527 TQRKSELEAARRKKELIRQAIKDQLDELSKETESKLNEIDIFNNQLKELKEDV 579 (1118)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH
Confidence 4456677776554 666777777777666655555555555555555443
No 476
>PRK11546 zraP zinc resistance protein; Provisional
Probab=24.54 E-value=3e+02 Score=26.47 Aligned_cols=22 Identities=23% Similarity=0.276 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy11481 247 MERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 247 kERLRREneaLQQRLeELKqEL 268 (561)
++.|.+|+..|+++|.+++-++
T Consensus 91 I~aL~kEI~~Lr~kL~e~r~~~ 112 (143)
T PRK11546 91 INAVAKEMENLRQSLDELRVKR 112 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555666665555443
No 477
>PRK09108 type III secretion system protein HrcU; Validated
Probab=24.54 E-value=3.1e+02 Score=29.27 Aligned_cols=90 Identities=18% Similarity=0.077 Sum_probs=59.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHhhhhcccccccCCCCCccccccccCccCCC
Q psy11481 225 ILHSAIRYIQFLRRREREFEHEMERLAREKI------HAQQRLALLKKELSARWEHIDFNTLIPDNMEVDIPYDNHHHES 298 (561)
Q Consensus 225 ILrKAiDYIQkLQreEreLE~EkERLRREne------aLQQRLeELKqELsQqw~hIDfStLlPd~lEIDRvlRQa~~DS 298 (561)
++--++||.-+-.+-++++.-.+++.++|.. +.|+|+.++++|+.++-| ++.|
T Consensus 197 ~via~~D~~~qr~~~~k~lkMSkqEvK~E~K~~EGdP~iK~rrRq~~re~a~~~m-------------~~~V-------- 255 (353)
T PRK09108 197 LLVGAADWKIQRWLFIRDNRMSKDEVKREHKESEGDPHIKGERKRLARELAFAPP-------------RQRV-------- 255 (353)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHhccCCHHHHHHHHHHHHHHHHhHH-------------hccC--------
Confidence 4445678877777777777766666666665 467888899988876433 0111
Q ss_pred cccccCCCCCCcCCCCceeEeecCc---ceeeecCCCCCCCccCCCCCcccCCCCCccccc
Q psy11481 299 SLLSYGKERSYMDEDGGLVIVTNGS---VGLERSDGGESGSSLGGGGEMISGRGEGEYGRA 356 (561)
Q Consensus 299 silS~~~~~~~~~~~g~l~~vtn~~---~~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 356 (561)
+ .-.|+|||-| +++.-| ....+.|++--+|.++.-..
T Consensus 256 ------------~--~AdvVItNPTH~AVAL~Y~-------~~~~~AP~VvAKG~d~~A~~ 295 (353)
T PRK09108 256 ------------A--RANVVVVNPTHYAVALRYA-------PDEHPLPRVIAKGVDDGALA 295 (353)
T ss_pred ------------C--CCcEEEECCCceEEEeEeC-------CCCCCCCEEEEEeCcHHHHH
Confidence 1 1357788875 788877 44567788887777765443
No 478
>KOG1899|consensus
Probab=24.50 E-value=1.7e+02 Score=34.43 Aligned_cols=28 Identities=25% Similarity=0.031 Sum_probs=20.1
Q ss_pred CccccccCCccccccccccccccccccc
Q psy11481 351 GEYGRAGSSSGASNLLKCSESTLYSSNS 378 (561)
Q Consensus 351 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (561)
++.....+++.-.-.=+|+|+++++|-|
T Consensus 345 p~~~~~~s~~p~~ssp~v~P~~~~~S~s 372 (861)
T KOG1899|consen 345 PRSTVQSSSSPNMSSPAVSPQHNWSSAS 372 (861)
T ss_pred ccccccCCCCCcCCCCCCCCcccccccc
Confidence 6666666666666677899998887765
No 479
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=24.49 E-value=4.2e+02 Score=22.15 Aligned_cols=42 Identities=12% Similarity=0.128 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
|++...-++.|+++-..+..|.+++-++...+...++.....
T Consensus 28 l~~~~~ti~~l~~~~~~i~~e~~~ll~~~n~l~~dv~~k~~~ 69 (90)
T PF06103_consen 28 LDEVNKTIDTLQEQVDPITKEINDLLHNTNELLEDVNEKLEK 69 (90)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444455555555555555555555555555554444444333
No 480
>KOG0286|consensus
Probab=24.47 E-value=85 Score=33.78 Aligned_cols=25 Identities=24% Similarity=0.401 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 245 HEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 245 ~EkERLRREneaLQQRLeELKqELs 269 (561)
+|+++|++|.+.|+++|++-++.+.
T Consensus 2 ~~~~~l~~Eae~L~~qi~~~r~~~~ 26 (343)
T KOG0286|consen 2 EELEQLRQEAEQLKNQIRDARKKLN 26 (343)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4678888888888888888887764
No 481
>PF07321 YscO: Type III secretion protein YscO; InterPro: IPR009929 This family contains the bacterial type III secretion protein YscO, which is approximately 150 residues long. YscO has been shown to be required for high-level expression and secretion of the anti-host proteins V antigen and Yops in Yersinia pestis [].
Probab=24.39 E-value=4e+02 Score=25.52 Aligned_cols=65 Identities=20% Similarity=0.277 Sum_probs=41.0
Q ss_pred HHHHHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 196 RRAHLKECFEILKRQVPPAQEEKKSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 196 RRd~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
|+..-...|..+...+=.+. =|.+.-.-|..|...+.+++..++++.+..+..+++|.++++++.
T Consensus 48 r~~ee~rly~~~~~~~v~~k---------ele~~~~qv~~Lr~~e~~le~~~~~a~~~~~~e~~~l~~a~~~~~ 112 (152)
T PF07321_consen 48 RQREEERLYAEIQGKVVSLK---------ELEKWQQQVASLREREAELEQQLAEAEEQLEQERQALEEARKQLQ 112 (152)
T ss_pred HHHHHHHHHHHHHHhHhhHH---------HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445556666655444422 233333457778888888888888777777777777777766654
No 482
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.38 E-value=2.5e+02 Score=29.44 Aligned_cols=12 Identities=33% Similarity=0.263 Sum_probs=4.3
Q ss_pred HHHHHHHHHHHH
Q psy11481 254 KIHAQQRLALLK 265 (561)
Q Consensus 254 neaLQQRLeELK 265 (561)
...|+.+|++++
T Consensus 82 ik~l~~eI~~~~ 93 (265)
T COG3883 82 IKKLQKEIAELK 93 (265)
T ss_pred HHHHHHHHHHHH
Confidence 333333333333
No 483
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=24.35 E-value=2e+02 Score=33.92 Aligned_cols=45 Identities=22% Similarity=0.311 Sum_probs=42.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|.++..-+..|+.+..++..|+++|+...+.++.+++.+++.|..
T Consensus 109 ld~~~~q~~rl~~E~er~~~El~~lr~~lE~~q~~~e~~q~~l~~ 153 (775)
T PF10174_consen 109 LDKAQEQFERLQAERERLQRELERLRKTLEELQLRIETQQQTLDK 153 (775)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999999999999999999999999999876
No 484
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein. This model describes a domain found in some proteins associated with polyhydroxyalkanoate (PHA) granules in a subset of species that have PHA inclusion granules. Included are two tandem proteins of Pseudomonas oleovorans, PhaI and PhaF, and their homologs in related species. PhaF proteins have a low-complexity C-terminal region with repeats similar to AAAKP.
Probab=24.29 E-value=3e+02 Score=24.96 Aligned_cols=21 Identities=19% Similarity=0.358 Sum_probs=10.6
Q ss_pred HHhHHHHHHHHHHHHhccCCC
Q psy11481 193 EKNRRAHLKECFEILKRQVPP 213 (561)
Q Consensus 193 ERKRRd~LKdcFeeLRdlVPs 213 (561)
+...|..+.+..+.+++-+..
T Consensus 39 ~ee~k~~~~e~~~~~~e~~~~ 59 (118)
T TIGR01837 39 EKRGQKRFDESVDAAREEVKT 59 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHhh
Confidence 344555555555555544433
No 485
>cd04770 HTH_HMRTR Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators. Helix-turn-helix (HTH) heavy metal resistance transcription regulators (HMRTR): MerR1 (mercury), CueR (copper), CadR (cadmium), PbrR (lead), ZntR (zinc), and other related proteins. These transcription regulators mediate responses to heavy metal stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=24.14 E-value=1.7e+02 Score=25.56 Aligned_cols=24 Identities=17% Similarity=0.235 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMERLAREKIH 256 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREnea 256 (561)
...|+++.+++++++++|++....
T Consensus 81 ~~~l~~~~~~l~~~i~~l~~~~~~ 104 (123)
T cd04770 81 RALLEEKLAEVEAKIAELQALRAE 104 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555555555555554443333
No 486
>cd04776 HTH_GnyR Helix-Turn-Helix DNA binding domain of the regulatory protein GnyR. Putative helix-turn-helix (HTH) regulatory protein, GnyR, and other related proteins. GnyR belongs to the gnyRDBHAL cluster, which is involved in acyclic isoprenoid degradation in Pseudomonas aeruginosa. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
Probab=24.13 E-value=2.5e+02 Score=25.06 Aligned_cols=27 Identities=11% Similarity=0.204 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLA 262 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLe 262 (561)
|+++..++++++++|++....|...++
T Consensus 85 l~~~~~~l~~~~~~l~~~~~~L~~~~~ 111 (118)
T cd04776 85 IEKRRAELEQQRRDIDAALAELDAAEE 111 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333334444333333333333
No 487
>PF09766 FimP: Fms-interacting protein; InterPro: IPR019163 This entry represents Thoc5 which is one of the subunits of the THO complex, which additionally contains: HPR1, Thoc2, Thoc6 and Thoc7. The evolutionarily conserved multisubunit THO complex, which is recruited to actively transcribed genes is required for the efficient expression of genes that have internal tandem repeats. It is suggested that the THO complex functions to rectify aberrant structures that arise during transcription [, ] and is required for cell proliferation and for proper export of heat-shock mRNAs under heat stress []. This entry also identifies the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes [].
Probab=24.09 E-value=1.7e+02 Score=30.89 Aligned_cols=45 Identities=20% Similarity=0.253 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy11481 225 ILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELS 269 (561)
Q Consensus 225 ILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELs 269 (561)
-|..=.+-=+.|.++.++++.+++.|.+++...+.+|..|...|.
T Consensus 95 RL~~EL~~Rk~L~~~~~el~~~k~~l~~~~~~k~~~L~~l~~~L~ 139 (355)
T PF09766_consen 95 RLEFELEQRKRLEEQLKELEQRKKKLQQENKKKKKFLDSLPPQLK 139 (355)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 333333334455556666666666677777766666666666653
No 488
>PRK15396 murein lipoprotein; Provisional
Probab=24.05 E-value=2.2e+02 Score=24.77 Aligned_cols=38 Identities=11% Similarity=0.200 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
+.+|..+.+.+..++++|..+...++.....-+.|-.+
T Consensus 27 vd~LssqV~~L~~kvdql~~dv~~~~~~~~~a~~eA~r 64 (78)
T PRK15396 27 IDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAAR 64 (78)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677777777777788888888887777777766544
No 489
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=24.05 E-value=1.7e+02 Score=28.88 Aligned_cols=37 Identities=19% Similarity=0.235 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc
Q psy11481 243 FEHEMERLAREKIHAQQRLALLKKELSARWEHIDFNT 279 (561)
Q Consensus 243 LE~EkERLRREneaLQQRLeELKqELsQqw~hIDfSt 279 (561)
+|.|+.+.+.|++.++.++..|..+..=.=..|.|..
T Consensus 167 ie~~L~~v~~eIe~~~~~~~~l~~~v~~sti~i~l~~ 203 (262)
T PF14257_consen 167 IERELSRVRSEIEQLEGQLKYLDDRVDYSTITISLYE 203 (262)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhceEEEEEEEEe
Confidence 4455556666666666667777766653334444443
No 490
>PF08580 KAR9: Yeast cortical protein KAR9; InterPro: IPR013889 The KAR9 protein in Saccharomyces cerevisiae (Baker's yeast) is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules []. KAR9 localises at the shmoo tip in mating cells and at the tip of the growing bud in anaphase [].
Probab=24.02 E-value=2.7e+02 Score=32.36 Aligned_cols=18 Identities=17% Similarity=0.211 Sum_probs=10.7
Q ss_pred HHHHHHHHHHhccCCCCC
Q psy11481 198 AHLKECFEILKRQVPPAQ 215 (561)
Q Consensus 198 d~LKdcFeeLRdlVPsLs 215 (561)
+.|.+.|.+|.+.+..|.
T Consensus 195 esl~~~ll~L~arm~PLr 212 (683)
T PF08580_consen 195 ESLYSSLLALFARMQPLR 212 (683)
T ss_pred HHHHHHHHHHHhccchHH
Confidence 345566666666666653
No 491
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=23.83 E-value=3.6e+02 Score=28.31 Aligned_cols=17 Identities=24% Similarity=0.307 Sum_probs=10.9
Q ss_pred HHHHHHHHHhccCCCCC
Q psy11481 199 HLKECFEILKRQVPPAQ 215 (561)
Q Consensus 199 ~LKdcFeeLRdlVPsLs 215 (561)
.|....+.|..++|.+.
T Consensus 162 ~L~~~~~~l~~~~~~l~ 178 (312)
T smart00787 162 LLMKELELLNSIKPKLR 178 (312)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 35666666777777654
No 492
>PF06216 RTBV_P46: Rice tungro bacilliform virus P46 protein; InterPro: IPR009347 This family consists of several Rice tungro bacilliform virus P46 proteins. The function of this family is unknown.
Probab=23.83 E-value=2.9e+02 Score=29.56 Aligned_cols=47 Identities=17% Similarity=0.256 Sum_probs=26.8
Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 219 KSSNLSILHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 219 KaSKAsILrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqEL 268 (561)
+.-|+-|| .|||-.++-+.+.+..|-+.|+.++-.|+.+.+++++++
T Consensus 55 rvrklh~~---~~~~y~~e~e~~sy~~e~~~l~~qvs~l~~~~~~~r~~~ 101 (389)
T PF06216_consen 55 RVRKLHII---SDYIYNKEFERQSYSNEWISLNDQVSHLQHQNSEQRQQI 101 (389)
T ss_pred HHHHHHHH---HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444 578777777766666665555555555555444444444
No 493
>PF15003 HAUS2: HAUS augmin-like complex subunit 2
Probab=23.76 E-value=3.7e+02 Score=28.50 Aligned_cols=86 Identities=14% Similarity=0.244 Sum_probs=52.9
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc--cccCCCCCccccc
Q psy11481 221 SNLSILHSAIRYIQFLRRREREFE---------HEMERLAREKIHAQQRLALLKKELSARWEHID--FNTLIPDNMEVDI 289 (561)
Q Consensus 221 SKAsILrKAiDYIQkLQreEreLE---------~EkERLRREneaLQQRLeELKqELsQqw~hID--fStLlPd~lEIDR 289 (561)
.-+..|..|+.+|..|++.-...+ ..++..-+....+-.+.+++-..+.+ |.... ++.+.|.....+.
T Consensus 134 ~vVeLL~laa~fi~~Le~~LetIrwip~~~~~~~~m~~aL~ki~~lvae~E~l~e~ilk-wRe~~ke~~~~~~~~~~~~~ 212 (277)
T PF15003_consen 134 YVVELLELAASFIEKLEEHLETIRWIPNFDENPSNMDKALAKIDALVAECEELAEQILK-WREQQKEVSSYIPKMLAEEN 212 (277)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcccccccc
Confidence 447889999999999998876433 34455555566666666666666555 65443 5556666666666
Q ss_pred cccCccCC-CcccccCCCCC
Q psy11481 290 PYDNHHHE-SSLLSYGKERS 308 (561)
Q Consensus 290 vlRQa~~D-SsilS~~~~~~ 308 (561)
++.- .+| .++.+..++++
T Consensus 213 ~~~~-~~d~~~~t~~~~~~~ 231 (277)
T PF15003_consen 213 PLHK-PHDSDCITPPWRAES 231 (277)
T ss_pred cCCc-CCCCCccCCccccCC
Confidence 6643 333 34445554433
No 494
>PRK00846 hypothetical protein; Provisional
Probab=23.71 E-value=1.7e+02 Score=25.51 Aligned_cols=36 Identities=14% Similarity=0.080 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH
Q psy11481 233 IQFLRRREREFEHEMER-------LAREKIHAQQRLALLKKEL 268 (561)
Q Consensus 233 IQkLQreEreLE~EkER-------LRREneaLQQRLeELKqEL 268 (561)
|..|+.+-.-++.-++. +++++..|++++..|..+|
T Consensus 15 i~~LE~rlAfQe~tIe~LN~~v~~qq~~I~~L~~ql~~L~~rL 57 (77)
T PRK00846 15 LVELETRLSFQEQALTELSEALADARLTGARNAELIRHLLEDL 57 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44454444444444444 4444444444444444444
No 495
>PF13094 CENP-Q: CENP-Q, a CENPA-CAD centromere complex subunit
Probab=23.67 E-value=3.5e+02 Score=24.98 Aligned_cols=45 Identities=16% Similarity=0.182 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 226 LHSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 226 LrKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|.-+.+-|..|+++-++.+...++-.+..+.|++....+++++.+
T Consensus 36 L~~~~~~l~lLq~e~~~~e~~le~d~~~L~~Le~~~~~~~~e~~~ 80 (160)
T PF13094_consen 36 LAANLHQLELLQEEIEKEEAALERDYEYLQELEKNAKALEREREE 80 (160)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 496
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=23.62 E-value=4.8e+02 Score=30.43 Aligned_cols=81 Identities=15% Similarity=0.217 Sum_probs=0.0
Q ss_pred hHHHHhHHH--HHHHHHHHHhccCCCCCCcCCcchHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 190 NKLEKNRRA--HLKECFEILKRQVPPAQEEKKSSNLSILHSAIRY------IQFLRRREREFEHEMERLAREKIHAQQRL 261 (561)
Q Consensus 190 NaLERKRRd--~LKdcFeeLRdlVPsLsDn~KaSKAsILrKAiDY------IQkLQreEreLE~EkERLRREneaLQQRL 261 (561)
.+++|-|.+ ..|..=++||..|-.+..+++..|..+-+.-.+. +..|.+..++=...++.|+|+..+.+++-
T Consensus 425 ~dvkkLraeLq~~Rq~E~ELRsqis~l~~~Er~lk~eL~qlr~ene~Lq~Kl~~L~~aRq~DKq~l~~LEkrL~eE~~~R 504 (697)
T PF09726_consen 425 ADVKKLRAELQSSRQSEQELRSQISSLTNNERSLKSELSQLRQENEQLQNKLQNLVQARQQDKQSLQQLEKRLAEERRQR 504 (697)
T ss_pred HHHHHHHHHHHhhhhhHHHHHHHHhhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHhh
Q psy11481 262 ALLKKELSA 270 (561)
Q Consensus 262 eELKqELsQ 270 (561)
..|+++|.+
T Consensus 505 ~~lEkQL~e 513 (697)
T PF09726_consen 505 ASLEKQLQE 513 (697)
T ss_pred HHHHHHHHH
No 497
>PRK10698 phage shock protein PspA; Provisional
Probab=23.62 E-value=2.7e+02 Score=27.61 Aligned_cols=41 Identities=12% Similarity=0.181 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 227 HSAIRYIQFLRRREREFEHEMERLAREKIHAQQRLALLKKE 267 (561)
Q Consensus 227 rKAiDYIQkLQreEreLE~EkERLRREneaLQQRLeELKqE 267 (561)
..+.+-|..|+.+....+...++|+.....|+.+|++++.+
T Consensus 95 ~~~~~~~~~l~~~~~~~~~~~~~L~~~l~~L~~ki~eak~k 135 (222)
T PRK10698 95 QKLTDLIATLEHEVTLVDETLARMKKEIGELENKLSETRAR 135 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 498
>PRK08724 fliD flagellar capping protein; Validated
Probab=23.59 E-value=1.1e+02 Score=35.60 Aligned_cols=41 Identities=22% Similarity=0.472 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc
Q psy11481 236 LRRREREFEHEMERLAREKIHAQQRLALLKKELSARWEHID 276 (561)
Q Consensus 236 LQreEreLE~EkERLRREneaLQQRLeELKqELsQqw~hID 276 (561)
|..+.+.+..++.+|.++++.|+.|++.++.+|-+||-.+|
T Consensus 615 I~~R~~sL~~~i~~l~dqi~~Le~Rle~~E~Ry~~QFtAMD 655 (673)
T PRK08724 615 IRTREKSLREQNYRLNDDQVALDRRMESLEKRTHAKFAAMQ 655 (673)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 499
>PRK05561 DNA topoisomerase IV subunit A; Validated
Probab=23.58 E-value=3.7e+02 Score=31.42 Aligned_cols=83 Identities=16% Similarity=0.141 Sum_probs=0.0
Q ss_pred chhhhhHHHHhHHHHHH----HHHHHHhccCCCCCCcCCcchHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy11481 185 IREVHNKLEKNRRAHLK----ECFEILKRQVPPAQEEKKSSNLSILHS------AIRYIQFLRRREREFEHEMERLAREK 254 (561)
Q Consensus 185 RRaTHNaLERKRRd~LK----dcFeeLRdlVPsLsDn~KaSKAsILrK------AiDYIQkLQreEreLE~EkERLRREn 254 (561)
+|..+..-+-+.|.++- ..++.+...|--+..... .|..+.++ -.+||-.++-..-.-++ .++|++|.
T Consensus 369 rr~~~~l~k~~~r~~~l~g~~~~~~~id~vI~iir~s~~-ak~~l~~~f~~~~~qa~~Il~m~L~~Lt~le-~~kl~~E~ 446 (742)
T PRK05561 369 RRSQFRLDKVEKRLHILEGLLIAFLNIDEVIRIIRESDE-PKANLMARFDLSEIQAEAILELRLRRLAKLE-EIEIRKEQ 446 (742)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHhcCcc-HHHHHHHHhCCCHHHHHHHHHhhHHHHHHHH-HHHHHHHH
Q ss_pred HHHHHHHHHHHHHHh
Q psy11481 255 IHAQQRLALLKKELS 269 (561)
Q Consensus 255 eaLQQRLeELKqELs 269 (561)
.+|++++++|+.-|.
T Consensus 447 ~~l~~ei~~l~~iL~ 461 (742)
T PRK05561 447 DELRKEIAELEAILA 461 (742)
T ss_pred HHHHHHHHHHHHHhC
No 500
>PF01025 GrpE: GrpE; InterPro: IPR000740 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes the grpE protein. Dimeric GrpE is the co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold []. DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. The X-ray crystal structure of GrpE in complex with the ATPase domain of DnaK revealed that GrpE is an asymmetric homodimer, bent in a manner that favours extensive contacts with only one DnaKATPase monomer []. GrpE does not actively compete for the atomic positions occupied by the nucleotide. GrpE and ADP mutually reduce one another's affinity for DnaK 200-fold, and ATP instantly dissociates GrpE from DnaK.; GO: 0000774 adenyl-nucleotide exchange factor activity, 0042803 protein homodimerization activity, 0051087 chaperone binding, 0006457 protein folding; PDB: 3A6M_A 4ANI_A 1DKG_B.
Probab=23.58 E-value=2.6e+02 Score=25.44 Aligned_cols=38 Identities=24% Similarity=0.275 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy11481 233 IQFLRRREREFEHEMERLAREKIHAQQRLALLKKELSA 270 (561)
Q Consensus 233 IQkLQreEreLE~EkERLRREneaLQQRLeELKqELsQ 270 (561)
|..|+++-+++..+..++..+.+.+++|+...+.++-+
T Consensus 20 l~~l~~~~~~l~~~~~r~~ae~en~~~r~~~e~~~~~~ 57 (165)
T PF01025_consen 20 LEELEKEIEELKERLLRLQAEFENYRKRLEKEKEEAKK 57 (165)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Done!