Query 012498
Match_columns 462
No_of_seqs 15 out of 17
Neff 2.3
Searched_HMMs 46136
Date Fri Mar 29 03:16:37 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/012498.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/012498hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00038 Filament: Intermediat 97.4 0.13 2.9E-06 48.8 28.3 117 12-138 13-130 (312)
2 PRK09039 hypothetical protein; 97.3 0.056 1.2E-06 54.1 20.6 115 43-185 17-148 (343)
3 PF10174 Cast: RIM-binding pro 96.8 0.65 1.4E-05 51.9 25.0 115 16-139 2-135 (775)
4 PHA02562 46 endonuclease subun 96.4 0.84 1.8E-05 46.4 21.4 75 15-89 172-247 (562)
5 KOG0161 Myosin class II heavy 95.2 11 0.00025 46.3 35.4 182 58-240 1297-1482(1930)
6 COG1196 Smc Chromosome segrega 94.7 10 0.00022 43.5 32.8 49 259-307 969-1017(1163)
7 PF05667 DUF812: Protein of un 93.7 13 0.00028 40.7 21.2 88 239-329 447-534 (594)
8 TIGR02168 SMC_prok_B chromosom 93.7 12 0.00027 40.5 32.3 19 233-251 966-984 (1179)
9 TIGR02169 SMC_prok_A chromosom 93.6 13 0.00029 40.6 33.4 20 234-253 953-972 (1164)
10 TIGR00606 rad50 rad50. This fa 92.6 25 0.00054 41.0 27.3 45 54-98 223-267 (1311)
11 PRK10884 SH3 domain-containing 92.1 2.6 5.6E-05 40.3 11.8 72 11-98 87-158 (206)
12 PF15070 GOLGA2L5: Putative go 92.0 13 0.00029 40.8 18.4 173 11-198 44-216 (617)
13 TIGR02168 SMC_prok_B chromosom 92.0 21 0.00046 38.8 34.8 25 13-37 673-697 (1179)
14 PRK04863 mukB cell division pr 91.9 36 0.00078 41.1 25.6 45 51-98 282-326 (1486)
15 PF08614 ATG16: Autophagy prot 91.8 2.2 4.8E-05 39.2 10.7 42 80-121 78-119 (194)
16 PF00038 Filament: Intermediat 91.7 13 0.00027 35.6 29.8 194 113-319 63-278 (312)
17 TIGR02169 SMC_prok_A chromosom 91.5 25 0.00054 38.5 36.0 33 56-88 231-263 (1164)
18 PF10174 Cast: RIM-binding pro 91.2 31 0.00068 39.1 28.8 174 11-184 136-339 (775)
19 PF12325 TMF_TATA_bd: TATA ele 90.6 7.3 0.00016 34.7 12.3 101 42-168 11-111 (120)
20 PRK02224 chromosome segregatio 90.4 31 0.00066 37.8 32.4 26 233-258 483-508 (880)
21 PRK09039 hypothetical protein; 89.6 25 0.00055 35.6 21.0 59 15-83 44-102 (343)
22 PRK02224 chromosome segregatio 89.6 36 0.00077 37.3 33.9 24 233-256 476-499 (880)
23 PF05557 MAD: Mitotic checkpoi 88.1 1.9 4.1E-05 46.7 8.2 124 151-290 501-636 (722)
24 KOG0161 Myosin class II heavy 86.0 1E+02 0.0023 38.6 38.6 180 11-190 1316-1514(1930)
25 PRK11637 AmiB activator; Provi 85.9 44 0.00095 34.1 22.9 35 52-86 37-71 (428)
26 PF10168 Nup88: Nuclear pore c 85.5 20 0.00044 40.0 14.3 28 294-321 683-710 (717)
27 PF00261 Tropomyosin: Tropomyo 85.3 35 0.00076 32.5 16.1 51 141-191 178-228 (237)
28 PRK03918 chromosome segregatio 84.2 68 0.0015 34.9 30.9 63 15-77 410-481 (880)
29 KOG0612 Rho-associated, coiled 84.1 1.1E+02 0.0023 37.1 25.9 93 58-150 468-561 (1317)
30 PF04912 Dynamitin: Dynamitin 83.9 53 0.0011 33.4 15.9 136 10-162 87-225 (388)
31 PF12718 Tropomyosin_1: Tropom 83.5 34 0.00074 30.9 16.2 123 132-281 7-129 (143)
32 PRK03918 chromosome segregatio 82.5 80 0.0017 34.4 29.6 47 136-182 235-281 (880)
33 COG1196 Smc Chromosome segrega 82.0 1.1E+02 0.0023 35.6 38.3 41 402-443 974-1015(1163)
34 PF04849 HAP1_N: HAP1 N-termin 81.0 61 0.0013 33.4 14.6 79 12-98 162-246 (306)
35 KOG0995 Centromere-associated 81.0 99 0.0021 34.5 23.1 175 50-286 216-390 (581)
36 TIGR03185 DNA_S_dndD DNA sulfu 80.6 90 0.0019 33.8 23.8 47 13-62 265-311 (650)
37 PF09726 Macoilin: Transmembra 80.4 1.1E+02 0.0023 34.5 26.0 90 152-274 544-633 (697)
38 PF15070 GOLGA2L5: Putative go 80.0 1E+02 0.0023 34.2 26.1 68 15-98 2-69 (617)
39 PF08232 Striatin: Striatin fa 79.8 4.4 9.6E-05 36.2 5.6 56 113-182 6-61 (134)
40 PF14662 CCDC155: Coiled-coil 79.2 10 0.00022 36.8 8.1 73 12-84 97-180 (193)
41 KOG0999 Microtubule-associated 78.6 1.2E+02 0.0027 34.3 21.1 211 57-293 10-241 (772)
42 cd00632 Prefoldin_beta Prefold 77.9 41 0.00088 28.3 11.1 56 63-130 7-62 (105)
43 smart00787 Spc7 Spc7 kinetocho 77.2 88 0.0019 31.8 16.2 122 56-181 138-260 (312)
44 PHA02562 46 endonuclease subun 76.5 97 0.0021 31.9 24.5 70 171-256 259-330 (562)
45 PF12718 Tropomyosin_1: Tropom 74.8 65 0.0014 29.1 13.9 37 53-89 26-62 (143)
46 PRK11637 AmiB activator; Provi 74.8 1E+02 0.0023 31.5 25.6 35 56-90 90-124 (428)
47 PF01486 K-box: K-box region; 74.7 15 0.00033 30.5 7.0 74 14-87 9-100 (100)
48 PRK10884 SH3 domain-containing 74.7 50 0.0011 31.7 11.3 26 50-75 88-113 (206)
49 TIGR00606 rad50 rad50. This fa 74.3 1.9E+02 0.0041 34.2 35.8 106 147-257 495-602 (1311)
50 KOG0963 Transcription factor/C 73.0 1.7E+02 0.0037 33.1 26.6 56 269-325 374-430 (629)
51 PF10458 Val_tRNA-synt_C: Valy 73.0 33 0.00071 27.0 8.1 58 15-72 2-63 (66)
52 PF09728 Taxilin: Myosin-like 71.9 1.2E+02 0.0025 30.7 16.1 108 60-184 41-152 (309)
53 PF09755 DUF2046: Uncharacteri 71.8 1.3E+02 0.0028 31.2 21.5 36 230-265 227-266 (310)
54 KOG0963 Transcription factor/C 71.5 1.8E+02 0.004 32.8 21.8 112 108-258 164-275 (629)
55 PF01920 Prefoldin_2: Prefoldi 71.2 53 0.0011 26.4 9.3 31 227-257 57-87 (106)
56 KOG0804 Cytoplasmic Zn-finger 71.0 64 0.0014 35.2 12.1 42 143-184 379-420 (493)
57 PF01920 Prefoldin_2: Prefoldi 70.7 25 0.00054 28.3 7.2 74 12-85 14-99 (106)
58 PF10186 Atg14: UV radiation r 70.4 93 0.002 29.0 17.3 72 13-86 23-94 (302)
59 KOG0979 Structural maintenance 70.2 1.7E+02 0.0037 34.8 16.0 166 12-208 197-366 (1072)
60 PF05911 DUF869: Plant protein 69.8 1.8E+02 0.0039 33.4 15.8 59 120-181 111-169 (769)
61 PF12240 Angiomotin_C: Angiomo 69.6 1.1E+02 0.0024 30.2 12.4 47 119-165 100-155 (205)
62 PF02050 FliJ: Flagellar FliJ 69.1 54 0.0012 25.8 12.2 80 14-98 16-95 (123)
63 PF06657 Cep57_MT_bd: Centroso 67.8 27 0.00058 29.0 6.9 54 227-280 12-74 (79)
64 KOG0977 Nuclear envelope prote 67.5 1.4E+02 0.003 33.2 13.9 147 135-304 38-188 (546)
65 PF07888 CALCOCO1: Calcium bin 66.9 2.1E+02 0.0046 31.8 27.6 238 6-275 195-453 (546)
66 PF07889 DUF1664: Protein of u 66.5 76 0.0017 28.8 10.0 86 222-325 27-122 (126)
67 PF08172 CASP_C: CASP C termin 65.5 77 0.0017 31.3 10.7 41 144-184 84-124 (248)
68 PF06248 Zw10: Centromere/kine 64.2 2.1E+02 0.0045 30.7 18.4 52 12-64 9-62 (593)
69 PF05308 Mito_fiss_reg: Mitoch 62.6 6.7 0.00015 38.7 2.9 22 230-251 120-141 (253)
70 PF09738 DUF2051: Double stran 61.8 1.9E+02 0.0042 29.5 14.7 86 92-191 83-171 (302)
71 PF04822 Takusan: Takusan; In 61.3 24 0.00052 30.1 5.6 64 10-88 19-82 (84)
72 PF09730 BicD: Microtubule-ass 60.9 3E+02 0.0065 31.5 15.3 36 53-88 32-67 (717)
73 KOG4643 Uncharacterized coiled 60.6 3.7E+02 0.0081 32.5 21.3 108 73-187 213-328 (1195)
74 TIGR02680 conserved hypothetic 60.5 3.7E+02 0.008 32.3 19.0 113 42-161 256-383 (1353)
75 COG2825 HlpA Outer membrane pr 60.0 1.5E+02 0.0032 27.7 13.2 47 146-201 97-143 (170)
76 PF04977 DivIC: Septum formati 58.9 21 0.00045 27.5 4.5 36 53-88 15-50 (80)
77 PF05700 BCAS2: Breast carcino 56.9 1.7E+02 0.0037 27.9 11.1 90 15-110 106-195 (221)
78 PF01576 Myosin_tail_1: Myosin 56.4 3.7 8E-05 46.1 0.0 155 16-186 207-368 (859)
79 KOG0250 DNA repair protein RAD 55.3 4.5E+02 0.0097 31.7 26.2 147 101-276 306-452 (1074)
80 COG0419 SbcC ATPase involved i 55.2 3.6E+02 0.0078 30.5 31.3 38 60-97 272-309 (908)
81 PF15035 Rootletin: Ciliary ro 54.9 1.9E+02 0.0042 27.4 11.1 85 11-98 17-114 (182)
82 PF15397 DUF4618: Domain of un 54.0 2.5E+02 0.0054 28.4 15.0 100 154-257 7-106 (258)
83 PF04111 APG6: Autophagy prote 53.8 1.9E+02 0.0042 29.1 11.4 37 133-169 86-122 (314)
84 PF13514 AAA_27: AAA domain 53.7 4.1E+02 0.0089 30.8 20.3 28 12-39 745-772 (1111)
85 PF03962 Mnd1: Mnd1 family; I 53.1 2E+02 0.0044 27.1 10.8 47 143-192 107-153 (188)
86 PRK04778 septation ring format 52.1 3.3E+02 0.0073 29.3 25.7 82 59-140 253-339 (569)
87 KOG2991 Splicing regulator [RN 51.7 3E+02 0.0066 28.7 17.0 194 14-284 67-267 (330)
88 PF06005 DUF904: Protein of un 50.8 1.4E+02 0.0031 24.6 8.4 59 248-306 6-67 (72)
89 PF07139 DUF1387: Protein of u 50.4 3.1E+02 0.0068 28.5 13.7 113 54-202 149-264 (302)
90 PF11802 CENP-K: Centromere-as 49.8 3.1E+02 0.0066 28.2 12.4 191 14-221 56-257 (268)
91 cd00632 Prefoldin_beta Prefold 49.5 1.6E+02 0.0034 24.8 11.0 45 206-255 42-86 (105)
92 PF11629 Mst1_SARAH: C termina 49.1 59 0.0013 25.9 5.5 38 268-305 9-46 (49)
93 PF09789 DUF2353: Uncharacteri 48.4 51 0.0011 34.0 6.6 67 11-77 80-148 (319)
94 PF10473 CENP-F_leu_zip: Leuci 48.1 2.3E+02 0.005 26.2 14.6 28 62-89 52-79 (140)
95 TIGR02338 gimC_beta prefoldin, 47.5 1.8E+02 0.0039 24.8 10.6 94 63-183 11-104 (110)
96 PF05622 HOOK: HOOK protein; 46.7 6.5 0.00014 42.8 0.0 122 64-186 269-403 (713)
97 PF13851 GAS: Growth-arrest sp 45.7 2.7E+02 0.0059 26.4 12.7 96 12-115 57-154 (201)
98 PF02403 Seryl_tRNA_N: Seryl-t 45.6 1.6E+02 0.0034 24.5 8.0 25 13-37 39-63 (108)
99 PRK11281 hypothetical protein; 45.2 6.2E+02 0.013 30.4 20.4 162 14-185 125-331 (1113)
100 PF07083 DUF1351: Protein of u 44.8 2.9E+02 0.0062 26.4 12.3 110 135-254 60-170 (215)
101 PRK15178 Vi polysaccharide exp 44.6 2.9E+02 0.0062 29.8 11.5 105 11-137 280-384 (434)
102 PF05064 Nsp1_C: Nsp1-like C-t 44.4 74 0.0016 27.7 6.1 29 106-135 28-56 (116)
103 KOG2129 Uncharacterized conser 44.4 2.8E+02 0.0061 30.6 11.4 13 86-98 260-272 (552)
104 TIGR03007 pepcterm_ChnLen poly 44.1 2.7E+02 0.0059 28.6 10.9 29 12-48 249-277 (498)
105 PF09304 Cortex-I_coil: Cortex 43.8 2.5E+02 0.0054 25.4 10.6 34 144-177 56-89 (107)
106 TIGR02231 conserved hypothetic 43.5 2.7E+02 0.0058 29.3 11.0 43 137-179 129-171 (525)
107 KOG0996 Structural maintenance 43.5 7.1E+02 0.015 30.6 28.2 155 152-323 857-1021(1293)
108 PF12128 DUF3584: Protein of u 42.8 6.4E+02 0.014 29.8 33.0 64 101-164 785-848 (1201)
109 COG1579 Zn-ribbon protein, pos 42.6 3.6E+02 0.0079 27.0 16.5 60 131-190 95-154 (239)
110 PF06005 DUF904: Protein of un 42.1 1E+02 0.0022 25.5 6.2 40 412-451 18-57 (72)
111 TIGR01843 type_I_hlyD type I s 41.9 3.4E+02 0.0074 26.5 22.4 37 50-86 125-161 (423)
112 PF12808 Mto2_bdg: Micro-tubul 41.7 42 0.0009 26.7 3.7 28 227-254 24-51 (52)
113 PF03962 Mnd1: Mnd1 family; I 41.5 3.1E+02 0.0067 25.9 10.8 70 11-85 70-140 (188)
114 COG2433 Uncharacterized conser 40.7 4.2E+02 0.0091 30.3 12.3 91 58-180 418-508 (652)
115 PF07200 Mod_r: Modifier of ru 40.6 2.5E+02 0.0054 24.5 10.5 39 57-98 29-67 (150)
116 PRK09343 prefoldin subunit bet 40.5 2.6E+02 0.0055 24.6 10.4 94 64-184 16-109 (121)
117 PF04156 IncA: IncA protein; 39.7 2.8E+02 0.0061 24.9 15.4 18 168-185 166-183 (191)
118 COG1579 Zn-ribbon protein, pos 39.3 4.1E+02 0.0089 26.6 19.0 178 146-374 38-226 (239)
119 KOG0933 Structural maintenance 39.0 8E+02 0.017 29.9 27.8 58 12-76 669-729 (1174)
120 TIGR03789 pdsO proteobacterial 36.9 94 0.002 30.6 6.2 52 382-434 79-130 (239)
121 PF01017 STAT_alpha: STAT prot 36.5 2.9E+02 0.0062 25.6 8.9 95 55-162 2-98 (182)
122 TIGR02338 gimC_beta prefoldin, 36.3 2.7E+02 0.0059 23.7 9.5 78 12-89 19-108 (110)
123 PF12325 TMF_TATA_bd: TATA ele 36.2 3.2E+02 0.007 24.5 12.4 98 61-183 15-112 (120)
124 PF09789 DUF2353: Uncharacteri 36.0 5.3E+02 0.011 26.9 19.0 21 17-37 16-36 (319)
125 PF07047 OPA3: Optic atrophy 3 35.6 72 0.0016 28.4 4.8 34 134-167 100-133 (134)
126 PF05529 Bap31: B-cell recepto 35.4 2.5E+02 0.0054 25.7 8.3 38 139-176 154-191 (192)
127 PF05529 Bap31: B-cell recepto 35.4 2.2E+02 0.0048 26.1 8.0 65 16-82 117-181 (192)
128 PF00170 bZIP_1: bZIP transcri 34.7 1.5E+02 0.0032 22.9 5.8 37 146-182 26-62 (64)
129 PF10186 Atg14: UV radiation r 34.7 3.8E+02 0.0083 25.0 15.2 39 146-184 63-101 (302)
130 PF08317 Spc7: Spc7 kinetochor 34.7 4.8E+02 0.011 26.1 16.5 52 12-73 151-202 (325)
131 PF09832 DUF2059: Uncharacteri 34.3 1E+02 0.0023 23.3 4.9 43 90-133 4-46 (64)
132 KOG0642 Cell-cycle nuclear pro 34.3 29 0.00063 38.4 2.5 44 126-181 33-76 (577)
133 PF13851 GAS: Growth-arrest sp 34.2 4.2E+02 0.009 25.2 16.7 73 103-200 68-141 (201)
134 PF05667 DUF812: Protein of un 34.0 7.1E+02 0.015 27.8 18.6 40 205-255 378-417 (594)
135 KOG4657 Uncharacterized conser 33.5 1.3E+02 0.0028 30.5 6.6 68 16-86 50-117 (246)
136 PF10474 DUF2451: Protein of u 32.9 4.7E+02 0.01 25.4 10.6 81 214-300 72-154 (234)
137 PF02996 Prefoldin: Prefoldin 32.9 1.2E+02 0.0027 25.1 5.5 79 11-89 4-118 (120)
138 PLN02939 transferase, transfer 32.1 9.5E+02 0.021 28.7 17.4 30 128-157 152-181 (977)
139 KOG0996 Structural maintenance 32.0 1.1E+03 0.023 29.3 21.0 124 64-187 860-1011(1293)
140 PF07106 TBPIP: Tat binding pr 31.9 3.8E+02 0.0083 24.1 10.5 76 11-88 73-150 (169)
141 PF04999 FtsL: Cell division p 31.9 1.2E+02 0.0026 24.9 5.2 43 45-87 25-67 (97)
142 PF03980 Nnf1: Nnf1 ; InterPr 31.2 94 0.002 26.1 4.6 47 41-87 59-105 (109)
143 KOG3215 Uncharacterized conser 30.9 5.7E+02 0.012 25.8 12.3 94 58-166 29-123 (222)
144 PF10805 DUF2730: Protein of u 30.5 98 0.0021 26.6 4.6 39 230-268 63-106 (106)
145 PF09403 FadA: Adhesion protei 30.5 4.2E+02 0.0091 24.1 12.2 65 55-125 27-96 (126)
146 PF01166 TSC22: TSC-22/dip/bun 30.5 43 0.00093 27.5 2.3 32 236-268 11-42 (59)
147 PF03148 Tektin: Tektin family 30.2 6.3E+02 0.014 26.1 17.9 192 233-453 72-285 (384)
148 TIGR01005 eps_transp_fam exopo 30.0 7.7E+02 0.017 27.1 18.1 48 133-184 346-393 (754)
149 PF08317 Spc7: Spc7 kinetochor 30.0 5.8E+02 0.013 25.6 16.9 97 56-162 143-239 (325)
150 PRK00409 recombination and DNA 29.9 8.8E+02 0.019 27.7 14.4 61 37-97 493-555 (782)
151 TIGR02209 ftsL_broad cell divi 29.7 1.4E+02 0.0031 23.5 5.1 30 58-87 27-56 (85)
152 PF07321 YscO: Type III secret 29.5 3.5E+02 0.0076 25.2 8.3 49 50-98 76-124 (152)
153 PRK04778 septation ring format 29.4 7.5E+02 0.016 26.7 30.4 76 116-201 73-157 (569)
154 PF06156 DUF972: Protein of un 29.4 1E+02 0.0023 27.0 4.6 38 54-91 14-51 (107)
155 PF07798 DUF1640: Protein of u 28.5 4.7E+02 0.01 24.0 10.1 73 233-305 74-158 (177)
156 PF06810 Phage_GP20: Phage min 28.5 1.6E+02 0.0035 27.0 5.9 59 125-184 37-99 (155)
157 PHA02047 phage lambda Rz1-like 28.3 2.7E+02 0.0058 25.1 6.9 57 233-314 28-84 (101)
158 PF09726 Macoilin: Transmembra 28.3 5.6E+02 0.012 29.1 11.0 94 11-110 539-635 (697)
159 PF12711 Kinesin-relat_1: Kine 28.0 87 0.0019 27.0 3.8 45 40-85 10-60 (86)
160 PF13094 CENP-Q: CENP-Q, a CEN 27.7 3.2E+02 0.0069 24.4 7.5 34 224-257 19-52 (160)
161 PF05622 HOOK: HOOK protein; 27.5 20 0.00044 39.1 0.0 105 14-118 402-523 (713)
162 PRK05431 seryl-tRNA synthetase 27.4 2.2E+02 0.0047 29.8 7.3 22 14-35 39-60 (425)
163 PF02183 HALZ: Homeobox associ 27.4 1.4E+02 0.003 22.8 4.4 37 59-98 2-38 (45)
164 cd00890 Prefoldin Prefoldin is 27.2 3.7E+02 0.0079 22.4 7.6 41 48-88 87-127 (129)
165 COG1711 DNA replication initia 27.1 1.9E+02 0.0042 28.9 6.5 82 231-323 31-112 (223)
166 PF06698 DUF1192: Protein of u 27.1 73 0.0016 25.8 3.0 37 214-252 12-48 (59)
167 PRK10929 putative mechanosensi 27.1 1.2E+03 0.026 28.2 28.6 56 12-75 67-122 (1109)
168 PF01813 ATP-synt_D: ATP synth 27.0 4.3E+02 0.0094 24.4 8.5 36 123-163 11-46 (196)
169 KOG0976 Rho/Rac1-interacting s 27.0 1.2E+03 0.026 28.2 15.8 142 12-185 346-494 (1265)
170 cd07628 BAR_Atg24p The Bin/Amp 26.6 5.2E+02 0.011 24.0 10.1 79 12-122 95-178 (185)
171 KOG1656 Protein involved in gl 26.4 4.5E+02 0.0097 26.5 8.8 114 222-340 5-152 (221)
172 PF07926 TPR_MLP1_2: TPR/MLP1/ 26.3 4.5E+02 0.0097 23.1 15.4 75 98-175 53-127 (132)
173 PF07352 Phage_Mu_Gam: Bacteri 26.3 4.3E+02 0.0093 23.6 8.1 60 142-201 6-66 (149)
174 smart00338 BRLZ basic region l 26.2 2.8E+02 0.006 21.4 6.0 38 146-183 26-63 (65)
175 KOG0612 Rho-associated, coiled 26.2 1.3E+03 0.029 28.6 29.3 242 14-273 469-755 (1317)
176 PF02183 HALZ: Homeobox associ 26.2 80 0.0017 24.0 3.0 22 235-256 22-43 (45)
177 KOG0483 Transcription factor H 26.0 80 0.0017 30.5 3.7 32 56-87 106-137 (198)
178 TIGR00309 V_ATPase_subD H(+)-t 25.9 5.7E+02 0.012 24.2 12.4 54 223-278 119-175 (209)
179 KOG0018 Structural maintenance 25.9 1.3E+03 0.028 28.3 15.9 63 280-343 487-561 (1141)
180 KOG4673 Transcription factor T 25.8 1.2E+03 0.025 27.7 17.2 71 114-184 368-440 (961)
181 PF12711 Kinesin-relat_1: Kine 25.8 2.3E+02 0.0049 24.6 5.9 58 395-455 9-66 (86)
182 PF05266 DUF724: Protein of un 25.7 5.9E+02 0.013 24.3 10.7 36 112-147 87-122 (190)
183 PRK00373 V-type ATP synthase s 25.7 5.7E+02 0.012 24.1 9.1 36 124-164 22-57 (204)
184 PF11365 DUF3166: Protein of u 25.7 91 0.002 27.4 3.6 33 60-93 13-45 (96)
185 PF04065 Not3: Not1 N-terminal 25.5 1.6E+02 0.0035 29.1 5.7 82 230-325 127-208 (233)
186 PF15294 Leu_zip: Leucine zipp 25.1 6.5E+02 0.014 25.9 9.9 91 230-320 130-225 (278)
187 PF14131 DUF4298: Domain of un 24.8 3.2E+02 0.007 23.0 6.6 63 156-219 3-70 (90)
188 PF08077 Cm_res_leader: Chlora 24.8 11 0.00024 24.2 -1.5 11 41-51 2-13 (17)
189 cd00890 Prefoldin Prefoldin is 24.7 4.1E+02 0.0089 22.1 10.1 29 228-256 83-111 (129)
190 PRK13694 hypothetical protein; 24.5 2.6E+02 0.0056 24.4 6.0 36 11-46 13-48 (83)
191 PF07111 HCR: Alpha helical co 24.5 1.2E+03 0.025 27.3 22.8 33 223-255 240-272 (739)
192 PF15397 DUF4618: Domain of un 24.3 7.7E+02 0.017 25.1 18.0 26 231-256 199-224 (258)
193 PF05911 DUF869: Plant protein 24.3 1.2E+03 0.025 27.2 17.3 53 132-184 610-662 (769)
194 PF14552 Tautomerase_2: Tautom 24.2 63 0.0014 26.9 2.3 36 191-226 46-82 (82)
195 PF08172 CASP_C: CASP C termin 24.0 7.2E+02 0.016 24.7 12.0 33 149-181 2-34 (248)
196 smart00502 BBC B-Box C-termina 23.6 3.9E+02 0.0084 21.4 13.0 59 214-272 61-124 (127)
197 KOG0946 ER-Golgi vesicle-tethe 23.6 1.3E+03 0.029 27.6 15.3 118 57-193 666-832 (970)
198 PF09006 Surfac_D-trimer: Lung 23.4 1.2E+02 0.0027 23.8 3.6 24 234-257 1-24 (46)
199 COG3883 Uncharacterized protei 23.2 8.2E+02 0.018 25.1 19.3 74 125-201 34-110 (265)
200 COG2900 SlyX Uncharacterized p 23.1 3.9E+02 0.0084 22.8 6.7 49 150-201 5-60 (72)
201 PLN02939 transferase, transfer 22.9 1.4E+03 0.03 27.5 19.3 182 17-201 150-385 (977)
202 KOG4117 Heat shock factor bind 22.5 1.1E+02 0.0024 25.9 3.4 29 13-42 37-65 (73)
203 PF10473 CENP-F_leu_zip: Leuci 22.1 6.4E+02 0.014 23.4 16.2 30 56-85 25-54 (140)
204 PF12341 DUF3639: Protein of u 22.1 8.3 0.00018 27.0 -2.7 16 41-56 9-24 (27)
205 PF02388 FemAB: FemAB family; 22.0 4.3E+02 0.0093 27.3 8.1 48 230-281 240-287 (406)
206 TIGR00414 serS seryl-tRNA synt 21.9 4.4E+02 0.0095 27.6 8.3 22 14-35 41-62 (418)
207 PRK15041 methyl-accepting chem 21.7 9.7E+02 0.021 25.4 16.4 31 40-70 391-423 (554)
208 smart00340 HALZ homeobox assoc 21.7 1.3E+02 0.0029 23.6 3.4 33 61-93 4-36 (44)
209 PF00170 bZIP_1: bZIP transcri 21.6 2.7E+02 0.0058 21.5 5.2 34 420-453 27-60 (64)
210 TIGR03007 pepcterm_ChnLen poly 21.6 9E+02 0.02 24.9 19.1 61 11-73 162-222 (498)
211 PRK10636 putative ABC transpor 21.5 3.8E+02 0.0081 29.2 8.0 68 18-88 564-631 (638)
212 KOG0933 Structural maintenance 21.3 1.6E+03 0.034 27.6 27.6 52 54-105 676-728 (1174)
213 KOG3091 Nuclear pore complex, 21.3 9.4E+02 0.02 26.9 10.7 73 15-106 374-448 (508)
214 KOG4360 Uncharacterized coiled 21.2 4.3E+02 0.0094 29.8 8.3 119 165-295 157-303 (596)
215 PF00015 MCPsignal: Methyl-acc 21.2 5.6E+02 0.012 22.4 13.5 48 25-72 41-106 (213)
216 PLN02678 seryl-tRNA synthetase 21.2 3.9E+02 0.0084 28.7 7.9 16 310-325 303-319 (448)
217 PF15035 Rootletin: Ciliary ro 21.1 4.1E+02 0.0088 25.2 7.2 44 144-187 65-115 (182)
218 PF14193 DUF4315: Domain of un 21.1 2.5E+02 0.0055 23.9 5.3 38 234-278 3-40 (83)
219 PF12709 Kinetocho_Slk19: Cent 21.0 2.6E+02 0.0057 24.4 5.4 41 150-190 46-86 (87)
220 COG3707 AmiR Response regulato 20.9 1.6E+02 0.0035 28.8 4.6 42 53-96 123-173 (194)
221 PF06785 UPF0242: Uncharacteri 20.9 9.2E+02 0.02 26.1 10.3 52 53-104 139-190 (401)
222 KOG3958 Putative dynamitin [Cy 20.7 6E+02 0.013 27.1 8.8 42 10-51 87-133 (371)
223 PF05377 FlaC_arch: Flagella a 20.5 2E+02 0.0044 23.2 4.3 32 230-261 12-43 (55)
224 PF01025 GrpE: GrpE; InterPro 20.4 2.6E+02 0.0056 24.6 5.5 52 234-285 13-66 (165)
225 PF05823 Gp-FAR-1: Nematode fa 20.2 4.5E+02 0.0098 24.1 7.1 50 204-256 19-68 (154)
226 KOG2685 Cystoskeletal protein 20.1 1.2E+03 0.025 25.6 18.1 107 204-310 192-303 (421)
No 1
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=97.38 E-value=0.13 Score=48.79 Aligned_cols=117 Identities=17% Similarity=0.238 Sum_probs=87.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR 90 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaG-pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR 90 (462)
-++-+.||..||.+...|...|..+---.+. ||-+ -...+.+|..|+.++..++.++-.|+-++..+..
T Consensus 13 la~YIekVr~LE~~N~~Le~~i~~~~~~~~~~~~~~----------~~~ye~el~~lr~~id~~~~eka~l~~e~~~l~~ 82 (312)
T PF00038_consen 13 LASYIEKVRFLEQENKRLESEIEELREKKGEEVSRI----------KEMYEEELRELRRQIDDLSKEKARLELEIDNLKE 82 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH---------HHH----------HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHhcccccCccc----------ccchhhHHHHhHHhhhhHHHHhhHHhhhhhhHHH
Confidence 4677899999999999999999999876422 2211 2456888999999999999999999999998877
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhH
Q 012498 91 IKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAK 138 (462)
Q Consensus 91 iK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaK 138 (462)
--..+-.-|..+...+..+|.++.=+..-+-.+.+.|...=-+++-.+
T Consensus 83 e~~~~r~k~e~e~~~~~~le~el~~lrk~ld~~~~~r~~le~~i~~L~ 130 (312)
T PF00038_consen 83 ELEDLRRKYEEELAERKDLEEELESLRKDLDEETLARVDLENQIQSLK 130 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHhHHHHHHHHHH
Confidence 666666667788889999999988888777777777766544444444
No 2
>PRK09039 hypothetical protein; Validated
Probab=97.27 E-value=0.056 Score=54.08 Aligned_cols=115 Identities=21% Similarity=0.231 Sum_probs=65.8
Q ss_pred CchHHHhhHH-----------------HHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHh
Q 012498 43 PSYLAVATRM-----------------HFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIK 105 (462)
Q Consensus 43 pgyl~vATRM-----------------~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~K 105 (462)
||||++-|-+ +++-..++++++..|..+++. |+++-+-+.+.
T Consensus 17 pg~vd~~~~ll~~~~f~l~~f~~~q~fLs~~i~~~~~eL~~L~~qIa~---------------------L~e~L~le~~~ 75 (343)
T PRK09039 17 PGFVDALSTLLLVIMFLLTVFVVAQFFLSREISGKDSALDRLNSQIAE---------------------LADLLSLERQG 75 (343)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH---------------------HHHHHHHHHHH
Confidence 9999987754 356677777777777776655 55555555555
Q ss_pred hHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHh
Q 012498 106 NMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNES 185 (462)
Q Consensus 106 n~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~ 185 (462)
+..++..+.=.+.....|=++|+. .| ..-. .......+.+.|+..++..+..++..-...+.+...+..|.+.
T Consensus 76 ~~~l~~~l~~l~~~l~~a~~~r~~--Le--~~~~---~~~~~~~~~~~~~~~l~~~L~~~k~~~se~~~~V~~L~~qI~a 148 (343)
T PRK09039 76 NQDLQDSVANLRASLSAAEAERSR--LQ--ALLA---ELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIAA 148 (343)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHH--HH--HHHh---hhhhhcchHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 555555555555555544444431 11 1000 0011233556666666666666666666666666666666663
No 3
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=96.78 E-value=0.65 Score=51.91 Aligned_cols=115 Identities=27% Similarity=0.400 Sum_probs=67.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHH---HhhhhhHH----HHHHHHHHH-------HHhhhhhcch
Q 012498 16 MARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHF---QRTAGLEQ----EIEILKQKI-------AACARENSNL 81 (462)
Q Consensus 16 ~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~---qRta~LEQ----eiE~Lkkkl-------~~c~ren~nL 81 (462)
.+++..++.|.|-|++++|.. +.-.|+.--++-| .|+ -|..++.. ++..++.++ ...-.+.++|
T Consensus 2 q~ql~~~q~E~e~L~~ele~~-~~~l~~~~~~i~~-fwspElkrer~~rkee~a~l~~~k~qlr~~q~e~q~~~~ei~~L 79 (775)
T PF10174_consen 2 QAQLERLQRENERLRRELERK-QSKLGSSMNSIKT-FWSPELKRERALRKEEAAELSRLKEQLRVTQEENQKAQEEIQAL 79 (775)
T ss_pred ccHHHHHHHHHHHHHHHHHHH-HhHHHHHHHhHhc-ccchhhHHHHHHHHHHHHHHHhHHHHHHHHHhhHHHHHHHHHHH
Confidence 468889999999999999987 4444544444433 222 12222222 233344444 4444455677
Q ss_pred HHHHHHH----HHHHHHHHHHHH-HHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHH
Q 012498 82 QEELSEA----YRIKGQLADLHA-AEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKE 139 (462)
Q Consensus 82 QEELsEA----YRiK~qLadLh~-ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE 139 (462)
|+|| .+ ||+..++-.-.+ .+-... +++ =+|-+.+..||||....|.+....
T Consensus 80 qeEL-r~q~e~~rL~~~~e~~~~e~e~l~~--ld~----~~~q~~rl~~E~er~~~El~~lr~ 135 (775)
T PF10174_consen 80 QEEL-RAQRELNRLQQELEKAQYEFESLQE--LDK----AQEQFERLQAERERLQRELERLRK 135 (775)
T ss_pred HHHH-HHhhHHHHHHHHhhhcccccchhhh--hhh----HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 8888 55 555555443311 111111 222 367788889999999999888773
No 4
>PHA02562 46 endonuclease subunit; Provisional
Probab=96.43 E-value=0.84 Score=46.42 Aligned_cols=75 Identities=15% Similarity=0.158 Sum_probs=48.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcC-CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 15 LMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqLCMQQaG-pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
+..++.+++.+.+.|+..|+.+=-+-++ +.++.....-....++.++.+++++..+....-.+-.+|++++.+.+
T Consensus 172 ~k~~~~e~~~~i~~l~~~i~~l~~~i~~~~~~i~~~~~~~~~~i~~l~~e~~~l~~~~~~l~~~l~~l~~~i~~l~ 247 (562)
T PHA02562 172 NKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLV 247 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4556666666666666666666444443 45555555555566777777777777777777677777777776664
No 5
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=95.18 E-value=11 Score=46.30 Aligned_cols=182 Identities=22% Similarity=0.256 Sum_probs=114.7
Q ss_pred hhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHH-HHH
Q 012498 58 AGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVME-AEK 136 (462)
Q Consensus 58 a~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmE-aEk 136 (462)
.+++.+|+.++.++..-+|.+++|...+..+-+=+..|-+.+--+...-.++++++.==-+-++++-+.=+..+.. .|.
T Consensus 1297 ~~~~~qle~~k~qle~e~r~k~~l~~~l~~l~~e~~~l~e~leee~e~~~~l~r~lsk~~~e~~~~~~k~e~~~~~~~ee 1376 (1930)
T KOG0161|consen 1297 QALESQLEELKRQLEEETREKSALENALRQLEHELDLLREQLEEEQEAKNELERKLSKANAELAQWKKKFEEEVLQRLEE 1376 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4677889999999999999999999988887776666666666666666666666655444455544444444443 344
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccch---hh
Q 012498 137 AKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDK---CA 213 (462)
Q Consensus 137 aKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~K---cs 213 (462)
+.|.-...-..+.+.+++++.+...+....+..-.||.++..+.--++....++. |.+..+...+-.=..|..+ -+
T Consensus 1377 lee~kk~l~~~lq~~qe~~e~~~~~~~~Lek~k~~l~~el~d~~~d~~~~~~~~~-~le~k~k~f~k~l~e~k~~~e~l~ 1455 (1930)
T KOG0161|consen 1377 LEELKKKLQQRLQELEEQIEAANAKNASLEKAKNRLQQELEDLQLDLERSRAAVA-ALEKKQKRFEKLLAEWKKKLEKLQ 1455 (1930)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455567888899999999999999888888877777665544432222221 2222222221111344444 45
Q ss_pred hhccccccccccCCcchHHHHHHHHHH
Q 012498 214 CLLLDSAEMWSFNDTSTSKYISALEDE 240 (462)
Q Consensus 214 ~LL~Ds~~~Wsfn~tstskyisaLEeE 240 (462)
..++.+...|.=-+|+..++-.+|++-
T Consensus 1456 ~Eld~aq~e~r~~~tel~kl~~~lee~ 1482 (1930)
T KOG0161|consen 1456 AELDAAQRELRQLSTELQKLKNALEEL 1482 (1930)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 556666666766667666665555544
No 6
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=94.72 E-value=10 Score=43.51 Aligned_cols=49 Identities=8% Similarity=0.128 Sum_probs=25.7
Q ss_pred HHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhhhh
Q 012498 259 LEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEE 307 (462)
Q Consensus 259 LeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL~e 307 (462)
++-...+.++.+.|..+..-+++=...-...+......-|...|.....
T Consensus 969 iee~e~~~~r~~~l~~~~~dl~~a~~~l~~~i~~~d~~~~~~f~~~f~~ 1017 (1163)
T COG1196 969 IEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERFKETFDK 1017 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5556667777777766665554444444444444444444444443333
No 7
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=93.70 E-value=13 Score=40.72 Aligned_cols=88 Identities=16% Similarity=0.243 Sum_probs=62.6
Q ss_pred HHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhhhhcchhhhhhHHH
Q 012498 239 DELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDV 318 (462)
Q Consensus 239 eE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~s~i~s~v~~ 318 (462)
+++..++.++..+...+|-==+.-+-|.+.+..|-|. ..-......|.++-+---+|+++|.+||.|-+. |..=||.
T Consensus 447 ~~ik~~r~~~k~~~~e~~~Kee~~~qL~~e~e~~~k~--~~Rs~Yt~RIlEIv~NI~KQk~eI~KIl~DTr~-lQkeiN~ 523 (594)
T PF05667_consen 447 QEIKELREEIKEIEEEIRQKEELYKQLVKELEKLPKD--VNRSAYTRRILEIVKNIRKQKEEIEKILSDTRE-LQKEINS 523 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC--CCHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHH
Confidence 5666777777777777776655555565555555544 344555667888888888999999999999875 5667899
Q ss_pred HHhhhcccccc
Q 012498 319 IEEKTQHCDDV 329 (462)
Q Consensus 319 ieekl~~~~n~ 329 (462)
+..||.-.+.|
T Consensus 524 l~gkL~RtF~v 534 (594)
T PF05667_consen 524 LTGKLDRTFTV 534 (594)
T ss_pred HHHHHHhHHHH
Confidence 99999444455
No 8
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=93.69 E-value=12 Score=40.48 Aligned_cols=19 Identities=11% Similarity=0.198 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHhHHHH
Q 012498 233 YISALEDELEKTRSSVENL 251 (462)
Q Consensus 233 yisaLEeE~e~lr~~i~~L 251 (462)
.|..|+.+++.|.+.|+.+
T Consensus 966 ~~~~l~~~i~~lg~aiee~ 984 (1179)
T TIGR02168 966 DEEEARRRLKRLENKIKEL 984 (1179)
T ss_pred CHHHHHHHHHHHHHHHHHc
Confidence 3455555555555544443
No 9
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=93.60 E-value=13 Score=40.58 Aligned_cols=20 Identities=15% Similarity=0.468 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHhHHHHHh
Q 012498 234 ISALEDELEKTRSSVENLQS 253 (462)
Q Consensus 234 isaLEeE~e~lr~~i~~LQs 253 (462)
++.++.+++.+.+.|+++-.
T Consensus 953 ~~~l~~~l~~l~~~i~~l~~ 972 (1164)
T TIGR02169 953 LEDVQAELQRVEEEIRALEP 972 (1164)
T ss_pred HHHHHHHHHHHHHHHHHcCC
Confidence 45777888888888877665
No 10
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=92.62 E-value=25 Score=41.01 Aligned_cols=45 Identities=16% Similarity=0.197 Sum_probs=26.2
Q ss_pred HHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH
Q 012498 54 FQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADL 98 (462)
Q Consensus 54 ~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL 98 (462)
-.+.+.++..++.++.....|..+-..+++.+.+.+.+...+..+
T Consensus 223 r~~l~~~q~kie~~~~~~~~le~ei~~l~~~~~~l~~~~~~~~~l 267 (1311)
T TIGR00606 223 RDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKL 267 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455555666666666666666666666666666555554444
No 11
>PRK10884 SH3 domain-containing protein; Provisional
Probab=92.12 E-value=2.6 Score=40.25 Aligned_cols=72 Identities=21% Similarity=0.296 Sum_probs=57.7
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR 90 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR 90 (462)
...++..|+..|+.|-.+|+..+..+=-+ +.+|++.|.+.+....+.+.....+|..|.++|..
T Consensus 87 ~~p~~~~rlp~le~el~~l~~~l~~~~~~-------------~~~~~~~l~~~~~~~~~~~~~L~~~n~~L~~~l~~--- 150 (206)
T PRK10884 87 TTPSLRTRVPDLENQVKTLTDKLNNIDNT-------------WNQRTAEMQQKVAQSDSVINGLKEENQKLKNQLIV--- 150 (206)
T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHhH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---
Confidence 34567788899999999988877774322 67999999999999999999999999999999987
Q ss_pred HHHHHHHH
Q 012498 91 IKGQLADL 98 (462)
Q Consensus 91 iK~qLadL 98 (462)
.+..+..|
T Consensus 151 ~~~~~~~l 158 (206)
T PRK10884 151 AQKKVDAA 158 (206)
T ss_pred HHHHHHHH
Confidence 35555444
No 12
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=92.04 E-value=13 Score=40.82 Aligned_cols=173 Identities=19% Similarity=0.278 Sum_probs=92.1
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR 90 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR 90 (462)
.....+.||+.||+.--+|+.-+...= ....|+-.+..-.=+-.++..|.++++.|..++.+-+++|..|-.-.. .
T Consensus 44 Ek~~~~~~V~eLE~sL~eLk~q~~~~~-~~~~pa~pse~E~~Lq~E~~~L~kElE~L~~qlqaqv~~ne~Ls~L~~---E 119 (617)
T PF15070_consen 44 EKEHDISRVQELERSLSELKNQMAEPP-PPEPPAGPSEVEQQLQAEAEHLRKELESLEEQLQAQVENNEQLSRLNQ---E 119 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccC-CccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H
Confidence 356677888888887777765443311 222222111111123446777999999999999999999987733222 3
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhH
Q 012498 91 IKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNA 170 (462)
Q Consensus 91 iK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~ 170 (462)
-+..|++|-..--....+.+- -++-+|+.=++| .-+-+|-..-..+-+++.+++.+.-.++.+.. ++..
T Consensus 120 qEerL~ELE~~le~~~e~~~D----~~kLLe~lqsdk----~t~SRAlsQN~eLK~QL~Elq~~Fv~ltne~~---elt~ 188 (617)
T PF15070_consen 120 QEERLAELEEELERLQEQQED----RQKLLEQLQSDK----ATASRALSQNRELKEQLAELQDAFVKLTNENM---ELTS 188 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHHHHhhhcccc----hHHHHHHHhHHHHHHHHHHHHHHHHHHHHhhh---HhhH
Confidence 356666662210000111111 112222221111 12333433334444555555555544433221 3457
Q ss_pred hHhhhHHHHHHhhHhHHHHHHHHHHHhh
Q 012498 171 TLRFDLEKQEELNESFKEVINKFYEIRQ 198 (462)
Q Consensus 171 aLQ~dl~~~~eq~e~~~kVI~KFyeiR~ 198 (462)
+||.+.-+-++-...+-.+=.|...++-
T Consensus 189 ~lq~Eq~~~keL~~kl~~l~~~l~~~~e 216 (617)
T PF15070_consen 189 ALQSEQHVKKELQKKLGELQEKLHNLKE 216 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7888887777777777677777776664
No 13
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=92.02 E-value=21 Score=38.79 Aligned_cols=25 Identities=28% Similarity=0.353 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q 012498 13 EALMARIQQLEHERDELRKDIEQLC 37 (462)
Q Consensus 13 e~l~~RI~qLe~ERdEL~KDIEqLC 37 (462)
..+...+..++.+.+++++.++.+-
T Consensus 673 ~~l~~e~~~l~~~~~~l~~~l~~~~ 697 (1179)
T TIGR02168 673 LERRREIEELEEKIEELEEKIAELE 697 (1179)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555666666666665555543
No 14
>PRK04863 mukB cell division protein MukB; Provisional
Probab=91.87 E-value=36 Score=41.13 Aligned_cols=45 Identities=22% Similarity=0.366 Sum_probs=31.3
Q ss_pred HHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH
Q 012498 51 RMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADL 98 (462)
Q Consensus 51 RM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL 98 (462)
|.++.-++|..+......++|...-..-..+.+++. -|+.++..|
T Consensus 282 R~liEEAag~r~rk~eA~kkLe~tE~nL~rI~diL~---ELe~rL~kL 326 (1486)
T PRK04863 282 RVHLEEALELRRELYTSRRQLAAEQYRLVEMARELA---ELNEAESDL 326 (1486)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH
Confidence 677888888888777777777776666666666663 456666655
No 15
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=91.82 E-value=2.2 Score=39.21 Aligned_cols=42 Identities=38% Similarity=0.366 Sum_probs=2.2
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHH
Q 012498 80 NLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMA 121 (462)
Q Consensus 80 nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA 121 (462)
.||+||+++||.+++++.--...-.++.++++...=-++.++
T Consensus 78 ~l~~ELael~r~~~el~~~L~~~~~~l~~l~~~~~~~~~~l~ 119 (194)
T PF08614_consen 78 KLQEELAELYRSKGELAQQLVELNDELQELEKELSEKERRLA 119 (194)
T ss_dssp ------------------------------------HHHHHH
T ss_pred cccccccccccccccccccccccccccchhhhhHHHHHHHHH
Confidence 489999999999999996655444555555554443333333
No 16
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=91.71 E-value=13 Score=35.63 Aligned_cols=194 Identities=16% Similarity=0.145 Sum_probs=96.6
Q ss_pred HHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHH
Q 012498 113 VKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINK 192 (462)
Q Consensus 113 vkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~K 192 (462)
|.-...--|+.-.++|+.-.+++..+.+=+.-.+.....+.-+..+.+.+++..-....|+..+..+++......++-.
T Consensus 63 id~~~~eka~l~~e~~~l~~e~~~~r~k~e~e~~~~~~le~el~~lrk~ld~~~~~r~~le~~i~~L~eEl~fl~~~he- 141 (312)
T PF00038_consen 63 IDDLSKEKARLELEIDNLKEELEDLRRKYEEELAERKDLEEELESLRKDLDEETLARVDLENQIQSLKEELEFLKQNHE- 141 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred hhhHHHHhhHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHhHHHHHHHHHHHHHHHHHhhhh-
Confidence 3333444467777778877887777766666666667777777777777777777777777777777777763333322
Q ss_pred HHHHhhhhhhhhcccccchhhhhccccccccccCCcchHHHHHHHHHHHHHHH------------HhHHHHHhhhhhh--
Q 012498 193 FYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTR------------SSVENLQSKLRMG-- 258 (462)
Q Consensus 193 FyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr------------~~i~~LQskLR~G-- 258 (462)
-+-.++.-.-. -..+.+++++.++..+..|..+-.+.+... .++..++.....+
T Consensus 142 --------eEi~~L~~~~~----~~~~~e~~~~~~~dL~~~L~eiR~~ye~~~~~~~~e~e~~y~~k~~~l~~~~~~~~~ 209 (312)
T PF00038_consen 142 --------EEIEELREQIQ----SSVTVEVDQFRSSDLSAALREIRAQYEEIAQKNREELEEWYQSKLEELRQQSEKSSE 209 (312)
T ss_dssp --------HHHHTTSTT--------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred --------hhhhhhhhccc----cccceeecccccccchhhhhhHHHHHHHHHhhhhhhhhhhccccccccccccccccc
Confidence 11111111111 233445555555555666655544433221 3333443333221
Q ss_pred ----HHHHH-HhHHhHHHHHHhhhh---hHHHHHHHHHHHHHhhhHHHHHHHHhhhhcchhhhhhHHHH
Q 012498 259 ----LEIEN-HLKKSVRELEKKIIH---SDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVI 319 (462)
Q Consensus 259 ----LeIen-hLkk~vr~Lekkqi~---~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~s~i~s~v~~i 319 (462)
.--|. .+++.+..|+....- -...+.+.|.++.+.|...+......+..=...|..+-..+
T Consensus 210 ~~~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~le~~~~~~~~~~~~~i~~le~el~~l~~~~ 278 (312)
T PF00038_consen 210 ELESAKEELKELRRQIQSLQAELESLRAKNASLERQLRELEQRLDEEREEYQAEIAELEEELAELREEM 278 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccchhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHHHHHHHHHHHHHHHhhhccchhHHHHHHHH
Confidence 11111 234444444433321 24566778888888888777665555444444444444444
No 17
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=91.45 E-value=25 Score=38.54 Aligned_cols=33 Identities=30% Similarity=0.455 Sum_probs=18.2
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHH
Q 012498 56 RTAGLEQEIEILKQKIAACARENSNLQEELSEA 88 (462)
Q Consensus 56 Rta~LEQeiE~Lkkkl~~c~ren~nLQEELsEA 88 (462)
+...+..+++.+..++.....+-..+.+++.+.
T Consensus 231 ~~~~~~~~~~~~~~~l~~~~~~~~~l~~~l~~~ 263 (1164)
T TIGR02169 231 EKEALERQKEAIERQLASLEEELEKLTEEISEL 263 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455556666666665555555555555443
No 18
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=91.16 E-value=31 Score=39.12 Aligned_cols=174 Identities=21% Similarity=0.263 Sum_probs=103.6
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcCCc-hHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCM--QQAGPS-YLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSE 87 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCM--QQaGpg-yl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsE 87 (462)
..+.+-.||.-++.++|...-.|+.|=- |-.||+ +-...+.-...|.++++..+..|+..+.---.++.-+.++|..
T Consensus 136 ~lE~~q~~~e~~q~~l~~~~eei~kL~e~L~~~g~~~~~~~~~~~~~~~~~~~e~~~~~le~lle~~e~~~~~~r~~l~~ 215 (775)
T PF10174_consen 136 TLEELQLRIETQQQTLDKADEEIEKLQEMLQSKGLSAEAEEEDNEALRRIREAEARIMRLESLLERKEKEHMEAREQLHR 215 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHH
Confidence 4677888899999999999999988754 777844 5566666667799999999988888777777777666666665
Q ss_pred HHHHHHH------HHHHHH------HHHHhhH-HHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHH---------
Q 012498 88 AYRIKGQ------LADLHA------AEVIKNM-EAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMS--------- 145 (462)
Q Consensus 88 AYRiK~q------LadLh~------ae~~Kn~-e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~--------- 145 (462)
.|....- +-.+.- +++.++. .+|-.+.-.++.++.+=++||--.-++|--+-.-..|-
T Consensus 216 ~~~~~~~~a~t~alq~~ie~Kd~ki~~lEr~l~~le~Ei~~L~~~~~~~~~~r~~~~k~le~~~s~~~~mK~k~d~~~~e 295 (775)
T PF10174_consen 216 RLQMERDDAETEALQTVIEEKDTKIASLERMLRDLEDEIYRLRSRGELSEADRDRLDKQLEVYKSHSLAMKSKMDRLKLE 295 (775)
T ss_pred HhhcCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccchHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 5543211 111111 2333332 25677777777777777788776333333222222222
Q ss_pred -----HHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 146 -----QKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 146 -----qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
+.+..++.|++.+.+.-.+.+.=-+.|+.++.....+.+
T Consensus 296 L~rk~~E~~~~qt~l~~~~~~~~d~r~hi~~lkesl~~ke~~~~ 339 (775)
T PF10174_consen 296 LSRKKSELEALQTRLETLEEQDSDMRQHIEVLKESLRAKEQEAE 339 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 223334444444444444444444444444444444444
No 19
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=90.64 E-value=7.3 Score=34.74 Aligned_cols=101 Identities=23% Similarity=0.300 Sum_probs=71.0
Q ss_pred CCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHH
Q 012498 42 GPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMA 121 (462)
Q Consensus 42 Gpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA 121 (462)
|...+....||.++ ...+|-|+-.||..++...++...+.+|+....+--..+.. .......+++
T Consensus 11 ~~~~~~~ve~L~s~-lr~~E~E~~~l~~el~~l~~~r~~l~~Eiv~l~~~~e~~~~----~~~~~~~L~~---------- 75 (120)
T PF12325_consen 11 GGPSVQLVERLQSQ-LRRLEGELASLQEELARLEAERDELREEIVKLMEENEELRA----LKKEVEELEQ---------- 75 (120)
T ss_pred CCchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHH----------
Confidence 33445666777654 66788899999999999999999999998886655444422 2233333333
Q ss_pred HHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhh
Q 012498 122 AAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQ 168 (462)
Q Consensus 122 ~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~ 168 (462)
+......+-.++++-+-+-.++++|++.++.+.|.+
T Consensus 76 -----------el~~l~~ry~t~LellGEK~E~veEL~~Dv~DlK~m 111 (120)
T PF12325_consen 76 -----------ELEELQQRYQTLLELLGEKSEEVEELRADVQDLKEM 111 (120)
T ss_pred -----------HHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHH
Confidence 334455777888888888888889998888888854
No 20
>PRK02224 chromosome segregation protein; Provisional
Probab=90.43 E-value=31 Score=37.75 Aligned_cols=26 Identities=19% Similarity=0.337 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHhHHHHHhhhhhh
Q 012498 233 YISALEDELEKTRSSVENLQSKLRMG 258 (462)
Q Consensus 233 yisaLEeE~e~lr~~i~~LQskLR~G 258 (462)
-++.|+.+++.++..++.+.+.+...
T Consensus 483 ~~~~le~~l~~~~~~~e~l~~~~~~~ 508 (880)
T PRK02224 483 ELEDLEEEVEEVEERLERAEDLVEAE 508 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555666666666666555555543
No 21
>PRK09039 hypothetical protein; Validated
Probab=89.63 E-value=25 Score=35.60 Aligned_cols=59 Identities=25% Similarity=0.317 Sum_probs=47.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHH
Q 012498 15 LMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQE 83 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQE 83 (462)
|...|..++.|-++|..-|-. ++..--|=-.|++.|+++|..++.++....+.+.-|+.
T Consensus 44 Ls~~i~~~~~eL~~L~~qIa~----------L~e~L~le~~~~~~l~~~l~~l~~~l~~a~~~r~~Le~ 102 (343)
T PRK09039 44 LSREISGKDSALDRLNSQIAE----------LADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQA 102 (343)
T ss_pred HHHHHhhHHHHHHHHHHHHHH----------HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556788888899999988876 77778888899999999999999988876666554444
No 22
>PRK02224 chromosome segregation protein; Provisional
Probab=89.57 E-value=36 Score=37.25 Aligned_cols=24 Identities=33% Similarity=0.538 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHhHHHHHhhhh
Q 012498 233 YISALEDELEKTRSSVENLQSKLR 256 (462)
Q Consensus 233 yisaLEeE~e~lr~~i~~LQskLR 256 (462)
-|..++.+...+.+.++.+..++.
T Consensus 476 ~~~~~~~~~~~le~~l~~~~~~~e 499 (880)
T PRK02224 476 RVEELEAELEDLEEEVEEVEERLE 499 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555566666666666666554
No 23
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=88.12 E-value=1.9 Score=46.71 Aligned_cols=124 Identities=21% Similarity=0.205 Sum_probs=63.0
Q ss_pred HHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhccccccccccCCcch
Q 012498 151 FQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTST 230 (462)
Q Consensus 151 ~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~tst 230 (462)
..+++..+++.+....+-+..|+.++..++.+.+.. .+|+ .--...-|+=.|=+.|...|-+. -
T Consensus 501 ~~e~~~~L~~~~~~Le~e~~~L~~~~~~Le~~l~~~--------~L~g-----~~~~~~trVL~lr~NP~~~~~~~---k 564 (722)
T PF05557_consen 501 LSEELNELQKEIEELERENERLRQELEELESELEKL--------TLQG-----EFNPSKTRVLHLRDNPTSKAEQI---K 564 (722)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------CCCT-------BTTTEEEEEESS-HHHHHHHH---H
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------hhcc-----ccCCCCceeeeeCCCcHHHHHHH---H
Confidence 344555555555555555666666666665555411 0111 00122335555556665555443 2
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHhhhhh--------hHHHH----HHhHHhHHHHHHhhhhhHHHHHHHHHHH
Q 012498 231 SKYISALEDELEKTRSSVENLQSKLRM--------GLEIE----NHLKKSVRELEKKIIHSDKFISNAIAEL 290 (462)
Q Consensus 231 skyisaLEeE~e~lr~~i~~LQskLR~--------GLeIe----nhLkk~vr~Lekkqi~~dk~i~ngi~~l 290 (462)
..-+.+|..|++.|++.+..|...-.. |+..- +-|+..+..++|+..-+-.++...+.++
T Consensus 565 ~~~l~~L~~En~~L~~~l~~le~~~~~~~~~~p~~~~~~~~~e~~~l~~~~~~~ekr~~RLkevf~~ks~eF 636 (722)
T PF05557_consen 565 KSTLEALQAENEDLLARLRSLEEGNSQPVDAVPTSSLESQEKEIAELKAELASAEKRNQRLKEVFKAKSQEF 636 (722)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHTTTT----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHhcccCCCCCcccccchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345778888888888888666532211 12221 2256666666666665655555555544
No 24
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=85.96 E-value=1e+02 Score=38.56 Aligned_cols=180 Identities=24% Similarity=0.278 Sum_probs=97.3
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHH-------HHHhhcCCchHHHhhHHHH-----HhhhhhHHHHHHHHHHHHHhhhhh
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQ-------LCMQQAGPSYLAVATRMHF-----QRTAGLEQEIEILKQKIAACAREN 78 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEq-------LCMQQaGpgyl~vATRM~~-----qRta~LEQeiE~Lkkkl~~c~ren 78 (462)
..-.+...+.+++||.+.|++=+|- |=-+-+-..--++.+|+-+ +|+..++-....+..++.++....
T Consensus 1316 ~k~~l~~~l~~l~~e~~~l~e~leee~e~~~~l~r~lsk~~~e~~~~~~k~e~~~~~~~eelee~kk~l~~~lq~~qe~~ 1395 (1930)
T KOG0161|consen 1316 EKSALENALRQLEHELDLLREQLEEEQEAKNELERKLSKANAELAQWKKKFEEEVLQRLEELEELKKKLQQRLQELEEQI 1395 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3456778899999999988875442 1112222333344555544 344444444333333333322111
Q ss_pred cchHHHHHHHHHHHHH----HHHHH---HHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHH
Q 012498 79 SNLQEELSEAYRIKGQ----LADLH---AAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEF 151 (462)
Q Consensus 79 ~nLQEELsEAYRiK~q----LadLh---~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~ 151 (462)
-.+.---..-=+.|.. +.|+- +.-.+....+|++.+=|.+-+|.-=-..|...-|-+-+..-...-..++..+
T Consensus 1396 e~~~~~~~~Lek~k~~l~~el~d~~~d~~~~~~~~~~le~k~k~f~k~l~e~k~~~e~l~~Eld~aq~e~r~~~tel~kl 1475 (1930)
T KOG0161|consen 1396 EAANAKNASLEKAKNRLQQELEDLQLDLERSRAAVAALEKKQKRFEKLLAEWKKKLEKLQAELDAAQRELRQLSTELQKL 1475 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 1110000000011222 12221 1233445677888887777766544444444444444444444444677777
Q ss_pred HHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHH
Q 012498 152 QTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190 (462)
Q Consensus 152 ~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI 190 (462)
..+++|...++....+.|..|+.++..++.+....-+.+
T Consensus 1476 ~~~lee~~e~~e~l~renk~l~~ei~dl~~~~~e~~k~v 1514 (1930)
T KOG0161|consen 1476 KNALEELLEQLEELRRENKNLSQEIEDLEEQKDEGGKRV 1514 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 888888888888999999999999888888877444433
No 25
>PRK11637 AmiB activator; Provisional
Probab=85.88 E-value=44 Score=34.13 Aligned_cols=35 Identities=14% Similarity=0.182 Sum_probs=20.6
Q ss_pred HHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHH
Q 012498 52 MHFQRTAGLEQEIEILKQKIAACARENSNLQEELS 86 (462)
Q Consensus 52 M~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELs 86 (462)
+++.-++.++++++.+++++...-.+-..++.++.
T Consensus 37 ~~~~~~~~~~~~l~~l~~qi~~~~~~i~~~~~~~~ 71 (428)
T PRK11637 37 AFSAHASDNRDQLKSIQQDIAAKEKSVRQQQQQRA 71 (428)
T ss_pred hhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444566777788887776655444444444444
No 26
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=85.46 E-value=20 Score=39.97 Aligned_cols=28 Identities=11% Similarity=0.200 Sum_probs=24.6
Q ss_pred hhHHHHHHHHhhhhcchhhhhhHHHHHh
Q 012498 294 HSQLRVHVVNSLEEGRSHIKSISDVIEE 321 (462)
Q Consensus 294 h~~~R~~Im~lL~ee~s~i~s~v~~iee 321 (462)
=..|+..|-++|.+....|+.+|+.|..
T Consensus 683 ~~~Q~~~I~~iL~~~~~~I~~~v~~ik~ 710 (717)
T PF10168_consen 683 SESQKRTIKEILKQQGEEIDELVKQIKN 710 (717)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3568889999999999999999998864
No 27
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=85.26 E-value=35 Score=32.45 Aligned_cols=51 Identities=24% Similarity=0.263 Sum_probs=30.8
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHH
Q 012498 141 EELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVIN 191 (462)
Q Consensus 141 Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~ 191 (462)
-....+++.+.+.|.+.++..+....+..+.|.-+|...++....+.+-++
T Consensus 178 i~~L~~~lkeaE~Rae~aE~~v~~Le~~id~le~eL~~~k~~~~~~~~eld 228 (237)
T PF00261_consen 178 IRDLEEKLKEAENRAEFAERRVKKLEKEIDRLEDELEKEKEKYKKVQEELD 228 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445666666666666666666666666666666666666655555443
No 28
>PRK03918 chromosome segregation protein; Provisional
Probab=84.17 E-value=68 Score=34.92 Aligned_cols=63 Identities=25% Similarity=0.325 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH---------HHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhh
Q 012498 15 LMARIQQLEHERDELRKDIEQL---------CMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARE 77 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqL---------CMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~re 77 (462)
+..++.+++.+.++|.+-++.| |-+.=||.|-.-.+-=+-++...|+.+|+.+++++..+..+
T Consensus 410 l~~~~~~~~~~i~eL~~~l~~L~~~~~~Cp~c~~~L~~~~~~el~~~~~~ei~~l~~~~~~l~~~~~~l~~~ 481 (880)
T PRK03918 410 ITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKE 481 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444555555444322 33444444433332334445555666666666655554443
No 29
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms]
Probab=84.13 E-value=1.1e+02 Score=37.11 Aligned_cols=93 Identities=18% Similarity=0.062 Sum_probs=44.8
Q ss_pred hhhHHHHHHHHHHHHHhhh-hhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHH
Q 012498 58 AGLEQEIEILKQKIAACAR-ENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEK 136 (462)
Q Consensus 58 a~LEQeiE~Lkkkl~~c~r-en~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEk 136 (462)
++|++.|+.++.....|.| +---+|.+.+++-+.=++..+.-..--..+.+++.+++=-|-..+.++-+-+++.-+.-.
T Consensus 468 keL~e~i~~lk~~~~el~~~q~~l~q~~~ke~~ek~~~~~~~~~~l~~~~~~~~eele~~q~~~~~~~~~~~kv~~~rk~ 547 (1317)
T KOG0612|consen 468 KELEETIEKLKSEESELQREQKALLQHEQKEVEEKLSEEEAKKRKLEALVRQLEEELEDAQKKNDNAADSLEKVNSLRKQ 547 (1317)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHH
Confidence 4555555555555555554 222244444444444344333333333444444544444455555555555555555555
Q ss_pred hHHHHHHHHHHHHH
Q 012498 137 AKEKEELMSQKFNE 150 (462)
Q Consensus 137 aKE~Ee~m~qk~~~ 150 (462)
+.+.+..|..++..
T Consensus 548 le~~~~d~~~e~~~ 561 (1317)
T KOG0612|consen 548 LEEAELDMRAESED 561 (1317)
T ss_pred HHHhhhhhhhhHHH
Confidence 55555555544443
No 30
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=83.91 E-value=53 Score=33.41 Aligned_cols=136 Identities=21% Similarity=0.262 Sum_probs=72.6
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 10 NESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 10 ~~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
.+.|++.+|+.-|.+|-.||..+++.+=-...+.. =..++ ...+.+.++.|+++|... .|.+=|..
T Consensus 87 ~e~Es~~~kl~RL~~Ev~EL~eEl~~~~~~~~~~~-~e~~~------~~~l~~~~~~L~~~L~~l-----~l~~~lg~-- 152 (388)
T PF04912_consen 87 SEKESPEQKLQRLRREVEELKEELEKRKADSKESD-EEKIS------PEELAQQLEELSKQLDSL-----KLEELLGE-- 152 (388)
T ss_pred CCcCCHHHHHHHHHHHHHHHHHHHHHHhhcccccc-cccCC------hhhHHHHHHHHHHHHHHh-----hcccccch--
Confidence 45799999999999999999999998643222111 00000 122344566666666555 11111111
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhh--hhhhHHHHHhHHHHH-HHHHHHHHHHHHHHHHhHHH
Q 012498 90 RIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAER--DNSVMEAEKAKEKEE-LMSQKFNEFQTRLEELSSEN 162 (462)
Q Consensus 90 RiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAER--D~slmEaEkaKE~Ee-~m~qk~~~~~~R~~E~~s~~ 162 (462)
.++.++..+.-.-...+-.++.-|++..+++-..- |...-|.-...+... .-+++++.|+.|+..+++.+
T Consensus 153 ---~~~~~~~~~~~~~~~kl~~~l~~~k~~~~~~~~~~~~~~ityel~~~p~~~~~~~la~~a~LE~RL~~LE~~l 225 (388)
T PF04912_consen 153 ---ETAQDLSDPQKALSKKLLSQLESFKSSSGAGSSPANSDHITYELYYPPEQAKSQQLARAADLEKRLARLESAL 225 (388)
T ss_pred ---hhhcccccchhhHHHHHHHhhhhcccccccCCCCCCCCceeeeeecCcccchhhHHHHHHHHHHHHHHHHHHh
Confidence 22333333344445566667777754333211111 111112111222222 24689999999999998776
No 31
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=83.50 E-value=34 Score=30.90 Aligned_cols=123 Identities=26% Similarity=0.359 Sum_probs=83.3
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccch
Q 012498 132 MEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDK 211 (462)
Q Consensus 132 mEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~K 211 (462)
+|++-|-++-+..-+++.+++.|.......+.....-|..|..++..+.++......-+.
T Consensus 7 ~E~d~a~~r~e~~e~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~le-------------------- 66 (143)
T PF12718_consen 7 LEADNAQDRAEELEAKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLE-------------------- 66 (143)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------------
Confidence 455556666666668888888888888888877777777777777777776663332222
Q ss_pred hhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHH
Q 012498 212 CACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDK 281 (462)
Q Consensus 212 cs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk 281 (462)
++...-+=+. +..+-|.-||++++.....+.-..-+||=.=.=-.|+-|+|..||.+..-|.+
T Consensus 67 ------e~~~~~~~~E-~l~rriq~LEeele~ae~~L~e~~ekl~e~d~~ae~~eRkv~~le~~~~~~E~ 129 (143)
T PF12718_consen 67 ------ESEKRKSNAE-QLNRRIQLLEEELEEAEKKLKETTEKLREADVKAEHFERKVKALEQERDQWEE 129 (143)
T ss_pred ------hHHHHHHhHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhHHHHHH
Confidence 1111001111 57788999999999998888888887774322334889999999987766554
No 32
>PRK03918 chromosome segregation protein; Provisional
Probab=82.48 E-value=80 Score=34.43 Aligned_cols=47 Identities=21% Similarity=0.307 Sum_probs=21.3
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHh
Q 012498 136 KAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEEL 182 (462)
Q Consensus 136 kaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq 182 (462)
.+++..+....++..++.++..+++.+.+...--..++..+..+.+.
T Consensus 235 ~~~~~~~~l~~~~~~l~~~~~~l~~~i~~l~~el~~l~~~l~~l~~~ 281 (880)
T PRK03918 235 ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444445555555555555544444333333444444444333
No 33
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=81.99 E-value=1.1e+02 Score=35.61 Aligned_cols=41 Identities=22% Similarity=0.208 Sum_probs=25.4
Q ss_pred HHHHHHH-HHHhhhHHHHHHHHHHHHhhhhhhhHHHHHHHHHH
Q 012498 402 QQEERHL-LERNVNSALQKKIEELQRNLFQVTTEKVKALMELA 443 (462)
Q Consensus 402 QqeER~l-lE~~~n~~lq~~ieeLqrnl~QVt~EKVkaLmElA 443 (462)
.-++||- |.++.... ..-.+.|+.-+..++.++...+|+.-
T Consensus 974 ~~~~r~~~l~~~~~dl-~~a~~~l~~~i~~~d~~~~~~f~~~f 1015 (1163)
T COG1196 974 EVEERYEELKSQREDL-EEAKEKLLEVIEELDKEKRERFKETF 1015 (1163)
T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344442 33333333 33477788888888888888888863
No 34
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=81.05 E-value=61 Score=33.41 Aligned_cols=79 Identities=30% Similarity=0.441 Sum_probs=53.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHH------HhhhhhHHHHHHHHHHHHHhhhhhcchHHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHF------QRTAGLEQEIEILKQKIAACARENSNLQEEL 85 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~------qRta~LEQeiE~Lkkkl~~c~ren~nLQEEL 85 (462)
.+.|-.+++.||.|-..||...-+|=.--+ .| -=--+|+. -++|+ +.|-.|..-|+.++.+|...|+|.
T Consensus 162 le~Lq~Klk~LEeEN~~LR~Ea~~L~~et~--~~-EekEqqLv~dcv~QL~~An--~qia~LseELa~k~Ee~~rQQEEI 236 (306)
T PF04849_consen 162 LEALQEKLKSLEEENEQLRSEASQLKTETD--TY-EEKEQQLVLDCVKQLSEAN--QQIASLSEELARKTEENRRQQEEI 236 (306)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHh--hc-cHHHHHHHHHHHHHhhhcc--hhHHHHHHHHHHHHHHHHHHHHHH
Confidence 588999999999998888887766632111 00 00011111 12333 347888889999999999999998
Q ss_pred HHHHHHHHHHHHH
Q 012498 86 SEAYRIKGQLADL 98 (462)
Q Consensus 86 sEAYRiK~qLadL 98 (462)
+ ++-+|++||
T Consensus 237 t---~Llsqivdl 246 (306)
T PF04849_consen 237 T---SLLSQIVDL 246 (306)
T ss_pred H---HHHHHHHHH
Confidence 7 678899988
No 35
>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning]
Probab=81.04 E-value=99 Score=34.54 Aligned_cols=175 Identities=25% Similarity=0.326 Sum_probs=113.3
Q ss_pred hHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhh
Q 012498 50 TRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDN 129 (462)
Q Consensus 50 TRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~ 129 (462)
+-|+.+=-.+|++.-...-+++++|...|.+|.|-++++--..+..+-| .-+-..+..+|.=||..|-+
T Consensus 216 ~~~~~Elk~~l~~~~~~i~~~ie~l~~~n~~l~e~i~e~ek~~~~~esl----re~~~~L~~D~nK~~~y~~~------- 284 (581)
T KOG0995|consen 216 SELEDELKHRLEKYFTSIANEIEDLKKTNRELEEMINEREKDPGKEESL----REKKARLQDDVNKFQAYVSQ------- 284 (581)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchHHHH----HHHHHHHHhHHHHHHHHHHH-------
Confidence 4455555667888777788999999999999999999888887777655 22334478899999988765
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhccccc
Q 012498 130 SVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWE 209 (462)
Q Consensus 130 slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~ 209 (462)
|+ -+-..|-++++...+-+++-++.+...+..|+.|+.-++.+ ++|..
T Consensus 285 --~~-----~k~~~~~~~l~~l~~Eie~kEeE~e~lq~~~d~Lk~~Ie~Q-------------------------~iS~~ 332 (581)
T KOG0995|consen 285 --MK-----SKKQHMEKKLEMLKSEIEEKEEEIEKLQKENDELKKQIELQ-------------------------GISGE 332 (581)
T ss_pred --HH-----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------------------CCCHH
Confidence 43 44556778888888888887777777776666655433332 22221
Q ss_pred chhhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHH
Q 012498 210 DKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNA 286 (462)
Q Consensus 210 ~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ng 286 (462)
+==- ...=-..|..+++.+...+|.|++++- +.+--.......+|++-+.+++.+++=
T Consensus 333 dve~----------------mn~Er~~l~r~l~~i~~~~d~l~k~vw---~~~l~~~~~f~~le~~~~~~~~l~~~i 390 (581)
T KOG0995|consen 333 DVER----------------MNLERNKLKRELNKIQSELDRLSKEVW---ELKLEIEDFFKELEKKFIDLNSLIRRI 390 (581)
T ss_pred HHHH----------------HHHHHHHHHHHHHHHHHHHHHHHHHHH---hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1000 000113466666666666676666542 222222445567888888888877763
No 36
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=80.56 E-value=90 Score=33.77 Aligned_cols=47 Identities=23% Similarity=0.348 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHH
Q 012498 13 EALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQ 62 (462)
Q Consensus 13 e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQ 62 (462)
+.+.+++.+++.++++.++.+.++| +|+++++.++-.+.+=-.-++.
T Consensus 265 ~~Le~ei~~le~e~~e~~~~l~~l~---~~~~p~~l~~~ll~~~~~q~~~ 311 (650)
T TIGR03185 265 EQLERQLKEIEAARKANRAQLRELA---ADPLPLLLIPNLLDSTKAQLQK 311 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---cccCCHhhhHHHHHHHHHHHHH
Confidence 3566677777777777777665554 7788888887665543333333
No 37
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=80.42 E-value=1.1e+02 Score=34.52 Aligned_cols=90 Identities=27% Similarity=0.358 Sum_probs=56.2
Q ss_pred HHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhccccccccccCCcchH
Q 012498 152 QTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTS 231 (462)
Q Consensus 152 ~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~tsts 231 (462)
..|..++++.+ ..|++||...+|+.. .......++|.+.-+- +.-+- ...
T Consensus 544 r~r~~~lE~E~-------~~lr~elk~kee~~~---~~e~~~~~lr~~~~e~-----~~~~e---------------~L~ 593 (697)
T PF09726_consen 544 RQRRRQLESEL-------KKLRRELKQKEEQIR---ELESELQELRKYEKES-----EKDTE---------------VLM 593 (697)
T ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHH---HHHHHHHHHHHHHhhh-----hhhHH---------------HHH
Confidence 44555555444 356677777777766 4444556677653110 00011 144
Q ss_pred HHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHH
Q 012498 232 KYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEK 274 (462)
Q Consensus 232 kyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lek 274 (462)
..++++++.+..|.++++ ..=||=|+++-.|-.--|.||-
T Consensus 594 ~aL~amqdk~~~LE~sLs---aEtriKldLfsaLg~akrq~ei 633 (697)
T PF09726_consen 594 SALSAMQDKNQHLENSLS---AETRIKLDLFSALGDAKRQLEI 633 (697)
T ss_pred HHHHHHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHH
Confidence 578899999999988765 4667788889988666666663
No 38
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=79.99 E-value=1e+02 Score=34.15 Aligned_cols=68 Identities=28% Similarity=0.410 Sum_probs=51.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHH
Q 012498 15 LMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQ 94 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~q 94 (462)
|+.-|+||+-|||+..--+ .--..+|-||.+.|-.++.+|++....-.+.=..|...|++ +|.+
T Consensus 2 l~e~l~qlq~Erd~ya~~l-------------k~e~a~~qqr~~qmseev~~L~eEk~~~~~~V~eLE~sL~e---Lk~q 65 (617)
T PF15070_consen 2 LMESLKQLQAERDQYAQQL-------------KEESAQWQQRMQQMSEEVRTLKEEKEHDISRVQELERSLSE---LKNQ 65 (617)
T ss_pred hHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHh
Confidence 4566899999999854322 22345799999999999999999777777777777777777 6777
Q ss_pred HHHH
Q 012498 95 LADL 98 (462)
Q Consensus 95 LadL 98 (462)
++..
T Consensus 66 ~~~~ 69 (617)
T PF15070_consen 66 MAEP 69 (617)
T ss_pred hccc
Confidence 7644
No 39
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=79.85 E-value=4.4 Score=36.18 Aligned_cols=56 Identities=23% Similarity=0.272 Sum_probs=42.7
Q ss_pred HHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHh
Q 012498 113 VKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEEL 182 (462)
Q Consensus 113 vkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq 182 (462)
++|-|+--+. -|||.+-||.|||. +..|+..|+.....++.+|.+|..-..+|+--
T Consensus 6 l~fLQ~Ew~r--~ErdR~~WeiERaE------------mkarIa~LEGE~r~~e~l~~dL~rrIkMLE~a 61 (134)
T PF08232_consen 6 LHFLQTEWHR--FERDRNQWEIERAE------------MKARIAFLEGERRGQENLKKDLKRRIKMLEYA 61 (134)
T ss_pred HHHHHHHHHH--HHHHHHHhHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566665444 38999999999986 66788888899988888888877777666543
No 40
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=79.21 E-value=10 Score=36.77 Aligned_cols=73 Identities=32% Similarity=0.397 Sum_probs=52.7
Q ss_pred hHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcC-CchHHHhhHHHHHhhhhhHH---HHHHHHHHHHHhhhhhcc
Q 012498 12 SEALMARIQQLEH-------ERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQ---EIEILKQKIAACARENSN 80 (462)
Q Consensus 12 ~e~l~~RI~qLe~-------ERdEL~KDIEqLCMQQaG-pgyl~vATRM~~qRta~LEQ---eiE~Lkkkl~~c~ren~n 80 (462)
+-+|.+.|.-|+. ++|.|.+++++||+.-++ ++=|-+.++...+|-+-+.. .|+.|++-+..++.=+.-
T Consensus 97 ~q~L~~~i~~Lqeen~kl~~e~~~lk~~~~eL~~~~~~Lq~Ql~~~e~l~~~~da~l~e~t~~i~eL~~~ieEy~~~tee 176 (193)
T PF14662_consen 97 QQSLVAEIETLQEENGKLLAERDGLKKRSKELATEKATLQRQLCEFESLICQRDAILSERTQQIEELKKTIEEYRSITEE 176 (193)
T ss_pred HHHHHHHHHHHHHHHhHHHHhhhhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Confidence 4456666665554 899999999999999888 88888889999999888753 466666655555544444
Q ss_pred hHHH
Q 012498 81 LQEE 84 (462)
Q Consensus 81 LQEE 84 (462)
|.-|
T Consensus 177 LR~e 180 (193)
T PF14662_consen 177 LRLE 180 (193)
T ss_pred HHHH
Confidence 4333
No 41
>KOG0999 consensus Microtubule-associated protein Bicaudal-D [Intracellular trafficking, secretion, and vesicular transport]
Probab=78.64 E-value=1.2e+02 Score=34.26 Aligned_cols=211 Identities=27% Similarity=0.315 Sum_probs=125.8
Q ss_pred hhhhHHHHHHHHHHHHHhhhhhcchH----HHHHHHHHHHHHHHHH------HHHHHHhhHHHHHHHHHhhhhHHHHHhh
Q 012498 57 TAGLEQEIEILKQKIAACARENSNLQ----EELSEAYRIKGQLADL------HAAEVIKNMEAEKQVKFFQGCMAAAFAE 126 (462)
Q Consensus 57 ta~LEQeiE~Lkkkl~~c~ren~nLQ----EELsEAYRiK~qLadL------h~ae~~Kn~e~EkqvkFfQs~vA~AFAE 126 (462)
.--|.+||+.|-++|...+++-..-- +=|-|--.+|-|+++| -+-|+-+.+++=-|.+--+-.||..=-+
T Consensus 10 ve~lr~eierLT~el~q~t~e~~qaAeyGL~lLeeK~~Lkqq~eEleaeyd~~R~Eldqtkeal~q~~s~hkk~~~~g~e 89 (772)
T KOG0999|consen 10 VEKLRQEIERLTEELEQTTEEKIQAAEYGLELLEEKEDLKQQLEELEAEYDLARTELDQTKEALGQYRSQHKKVARDGEE 89 (772)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchh
Confidence 34456666666666666555533211 1122333455555533 4567777777766666667778888888
Q ss_pred hhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcc
Q 012498 127 RDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLET 206 (462)
Q Consensus 127 RD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~ 206 (462)
|.-||++---+| |+...+++.+++.-+. .+..+|+.-.+.++.+.+|..+|-+.-..+-.-
T Consensus 90 ~EesLLqESaak--E~~yl~kI~eleneLK--------------q~r~el~~~q~E~erl~~~~sd~~e~~~~~E~q--- 150 (772)
T KOG0999|consen 90 REESLLQESAAK--EEYYLQKILELENELK--------------QLRQELTNVQEENERLEKVHSDLKESNAAVEDQ--- 150 (772)
T ss_pred hHHHHHHHHHHh--HHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHhhhcchhhHHH---
Confidence 988998855555 5566666666554333 234667788888888888888887654422110
Q ss_pred cccchhhhhccccccccccCCcc-hHHHHHHHHHHHHHHHHhHHHHHhh-hhh-hHHHHHH--------hHHhHHHHHHh
Q 012498 207 SWEDKCACLLLDSAEMWSFNDTS-TSKYISALEDELEKTRSSVENLQSK-LRM-GLEIENH--------LKKSVRELEKK 275 (462)
Q Consensus 207 s~~~Kcs~LL~Ds~~~Wsfn~ts-tskyisaLEeE~e~lr~~i~~LQsk-LR~-GLeIenh--------Lkk~vr~Lekk 275 (462)
- .=|.|-.--+-|-.+- .|.| +-|||||=+|...|++|.++ +-. ||-+|+. |.-.+.....=
T Consensus 151 -----R-~rlr~elKe~KfRE~RllseY-SELEEENIsLQKqVs~LR~sQVEyEglkheikRleEe~elln~q~ee~~~L 223 (772)
T KOG0999|consen 151 -----R-RRLRDELKEYKFREARLLSEY-SELEEENISLQKQVSNLRQSQVEYEGLKHEIKRLEEETELLNSQLEEAIRL 223 (772)
T ss_pred -----H-HHHHHHHHHHHHHHHHHHHHH-HHHHHhcchHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 0 0111212223444432 4445 67999999999999888654 222 6655553 33344444444
Q ss_pred hhhhHHHHHHHHHHHHHh
Q 012498 276 IIHSDKFISNAIAELRLC 293 (462)
Q Consensus 276 qi~~dk~i~ngi~~lq~~ 293 (462)
..+.++-+..+|-.||.-
T Consensus 224 k~IAekQlEEALeTlq~E 241 (772)
T KOG0999|consen 224 KEIAEKQLEEALETLQQE 241 (772)
T ss_pred HHHHHHHHHHHHHHHHhH
Confidence 455777788888887754
No 42
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=77.93 E-value=41 Score=28.27 Aligned_cols=56 Identities=13% Similarity=0.328 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhh
Q 012498 63 EIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNS 130 (462)
Q Consensus 63 eiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~s 130 (462)
.++.|+.++..+...-.-|.-++.|+..+..-|..| +..-+.| -.|..+|-++|..
T Consensus 7 ~~q~l~~~~~~l~~~~~~l~~~~~E~~~v~~EL~~l-----------~~d~~vy-~~VG~vfv~~~~~ 62 (105)
T cd00632 7 QLQQLQQQLQAYIVQRQKVEAQLNENKKALEELEKL-----------ADDAEVY-KLVGNVLVKQEKE 62 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-----------CCcchHH-HHhhhHHhhccHH
Confidence 366777778778777778888888888887777655 2334444 4677888888764
No 43
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=77.23 E-value=88 Score=31.77 Aligned_cols=122 Identities=20% Similarity=0.237 Sum_probs=79.0
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhh-hhhhhHHH
Q 012498 56 RTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAE-RDNSVMEA 134 (462)
Q Consensus 56 Rta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAE-RD~slmEa 134 (462)
|+.-++-=++.|...+.+.-.|...|-..+..+=.++-.|-+.|..=-.+-..+.+.+..+++|=..-+.. | ..|
T Consensus 138 R~kllegLk~~L~~~~~~l~~D~~~L~~~~~~l~~~~~~l~~~~~~L~~e~~~L~~~~~e~~~~d~~eL~~lk-~~l--- 213 (312)
T smart00787 138 RMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAK-EKL--- 213 (312)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhCCHHHHHHHH-HHH---
Confidence 55555555667777888888888888888877778888887777655555555555555555553322211 1 111
Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHH
Q 012498 135 EKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEE 181 (462)
Q Consensus 135 EkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~e 181 (462)
.+....-+.+.+++.+++.++.++.+.+.+-+.....++.+++..+.
T Consensus 214 ~~~~~ei~~~~~~l~e~~~~l~~l~~~I~~~~~~k~e~~~~I~~ae~ 260 (312)
T smart00787 214 KKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEK 260 (312)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 11223345566788888888888888888777777777766666544
No 44
>PHA02562 46 endonuclease subunit; Provisional
Probab=76.55 E-value=97 Score=31.90 Aligned_cols=70 Identities=23% Similarity=0.354 Sum_probs=34.9
Q ss_pred hHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhccccccccccCC--cchHHHHHHHHHHHHHHHHhH
Q 012498 171 TLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFND--TSTSKYISALEDELEKTRSSV 248 (462)
Q Consensus 171 aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~--tstskyisaLEeE~e~lr~~i 248 (462)
.++.++...+.....+.+. .+||+ ...+| .-|.--+.++ .+ .+...-|+.|+.+++.+..++
T Consensus 259 ~l~~~~~~~~~~l~~~~~~-~~~~~---~~~~C------p~C~~~~~~~------~~~~~~l~d~i~~l~~~l~~l~~~i 322 (562)
T PHA02562 259 KLNTAAAKIKSKIEQFQKV-IKMYE---KGGVC------PTCTQQISEG------PDRITKIKDKLKELQHSLEKLDTAI 322 (562)
T ss_pred HHHHHHHHHHHHHHHHHHH-HHHhc---CCCCC------CCCCCcCCCc------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555566666655555444 34555 22233 1244444443 11 123345666666666666666
Q ss_pred HHHHhhhh
Q 012498 249 ENLQSKLR 256 (462)
Q Consensus 249 ~~LQskLR 256 (462)
+.++...+
T Consensus 323 ~~~~~~~~ 330 (562)
T PHA02562 323 DELEEIMD 330 (562)
T ss_pred HHHHHHHH
Confidence 65555444
No 45
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=74.83 E-value=65 Score=29.12 Aligned_cols=37 Identities=35% Similarity=0.303 Sum_probs=23.2
Q ss_pred HHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 53 HFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 53 ~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
+-+|...+|++|..|++|+...-.+=..+++.|+++-
T Consensus 26 le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k 62 (143)
T PF12718_consen 26 LEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAK 62 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666677777777766666655566666666553
No 46
>PRK11637 AmiB activator; Provisional
Probab=74.81 E-value=1e+02 Score=31.47 Aligned_cols=35 Identities=11% Similarity=0.208 Sum_probs=18.0
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH
Q 012498 56 RTAGLEQEIEILKQKIAACARENSNLQEELSEAYR 90 (462)
Q Consensus 56 Rta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR 90 (462)
....++++|..++.++.....+=..++.++...+.
T Consensus 90 ~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~ 124 (428)
T PRK11637 90 KLRETQNTLNQLNKQIDELNASIAKLEQQQAAQER 124 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555555555555555555555555443
No 47
>PF01486 K-box: K-box region; InterPro: IPR002487 MADS genes in plants encode key developmental regulators of vegetative and reproductive development. The majority of the plant MADS proteins share a stereotypical MIKC structure. It comprises (from N- to C-terminal) an N-terminal domain, which is, however, present only in a minority of proteins; a MADS domain (see PDOC00302 from PROSITEDOC, IPR002100 from INTERPRO), which is the major determinant of DNA-binding but which also performs dimerisation and accessory factor binding functions; a weakly conserved intervening (I) domain, which constitutes a key molecular determinant for the selective formation of DNA-binding dimers; a keratin-like (K-box) domain, which promotes protein dimerisation; and a C-terminal (C) domain, which is involved in transcriptional activation or in the formation of ternary or quaternary protein complexes. The 80-amino acid K-box domain was originally identified as a region with low but significant similarity to a region of keratin, which is part of the coiled-coil sequence constituting the central rod-shaped domain of keratin [, , ]. The K-box protein-protein interaction domain which mediates heterodimerization of MIKC-type MADS proteins contains several heptad repeats in which the first and the fourth positions are occupied by hydrophobic amino acids suggesting that the K-box domain forms three amphipathic alpha-helices referred to as K1, K2, and K3 [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=74.74 E-value=15 Score=30.55 Aligned_cols=74 Identities=27% Similarity=0.359 Sum_probs=54.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh---hcCCch--H-------------HHhhHHHHHhhhhhHHHHHHHHHHHHHhh
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCMQ---QAGPSY--L-------------AVATRMHFQRTAGLEQEIEILKQKIAACA 75 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCMQ---QaGpgy--l-------------~vATRM~~qRta~LEQeiE~Lkkkl~~c~ 75 (462)
......+.+.+|-+.|++.|+.|... --|++. + ....|+-++.+.-|..+|++|++|...+.
T Consensus 9 ~~~~~~e~~~~e~~~L~~~~~~L~~~~R~~~GedL~~Ls~~eL~~LE~~Le~aL~~VR~rK~~~l~~~i~~l~~ke~~l~ 88 (100)
T PF01486_consen 9 LWDSQHEELQQEIAKLRKENESLQKELRHLMGEDLESLSLKELQQLEQQLESALKRVRSRKDQLLMEQIEELKKKERELE 88 (100)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445566667777777777777664 345432 2 23567888888899999999999999999
Q ss_pred hhhcchHHHHHH
Q 012498 76 RENSNLQEELSE 87 (462)
Q Consensus 76 ren~nLQEELsE 87 (462)
.+|..|+..+.|
T Consensus 89 ~en~~L~~~~~e 100 (100)
T PF01486_consen 89 EENNQLRQKIEE 100 (100)
T ss_pred HHHHHHHHHhcC
Confidence 999999988754
No 48
>PRK10884 SH3 domain-containing protein; Provisional
Probab=74.69 E-value=50 Score=31.74 Aligned_cols=26 Identities=23% Similarity=0.295 Sum_probs=18.9
Q ss_pred hHHHHHhhhhhHHHHHHHHHHHHHhh
Q 012498 50 TRMHFQRTAGLEQEIEILKQKIAACA 75 (462)
Q Consensus 50 TRM~~qRta~LEQeiE~Lkkkl~~c~ 75 (462)
|.-...|...||+++..|+.+|+...
T Consensus 88 ~p~~~~rlp~le~el~~l~~~l~~~~ 113 (206)
T PRK10884 88 TPSLRTRVPDLENQVKTLTDKLNNID 113 (206)
T ss_pred CccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455778888888888888777644
No 49
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=74.29 E-value=1.9e+02 Score=34.16 Aligned_cols=106 Identities=12% Similarity=0.214 Sum_probs=56.7
Q ss_pred HHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhc--ccccchhhhhccccccccc
Q 012498 147 KFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLE--TSWEDKCACLLLDSAEMWS 224 (462)
Q Consensus 147 k~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~--~s~~~Kcs~LL~Ds~~~Ws 224 (462)
....+..++.+..+.+......-+.|+.++.....+.+...++=-+.=++....-.-.. -++.++-.-++. .|.
T Consensus 495 ~~~~~~~~i~~~~~~~~~le~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~----~~~ 570 (1311)
T TIGR00606 495 LTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLG----YFP 570 (1311)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCC
Confidence 44566666666666676666666677766666655555443332222222221111111 111122222332 332
Q ss_pred cCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhh
Q 012498 225 FNDTSTSKYISALEDELEKTRSSVENLQSKLRM 257 (462)
Q Consensus 225 fn~tstskyisaLEeE~e~lr~~i~~LQskLR~ 257 (462)
-+ .....++.++..++..++..++.++.++.-
T Consensus 571 ~~-~~l~~~~~~~~~el~~~~~~~~~~~~el~~ 602 (1311)
T TIGR00606 571 NK-KQLEDWLHSKSKEINQTRDRLAKLNKELAS 602 (1311)
T ss_pred Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 11 456778888888888888888888777643
No 50
>KOG0963 consensus Transcription factor/CCAAT displacement protein CDP1 [Transcription]
Probab=73.01 E-value=1.7e+02 Score=33.08 Aligned_cols=56 Identities=16% Similarity=0.245 Sum_probs=40.4
Q ss_pred HHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhhhhcchhhhhhHHHHHhhh-cc
Q 012498 269 VRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHIKSISDVIEEKT-QH 325 (462)
Q Consensus 269 vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~s~i~s~v~~ieekl-~~ 325 (462)
+..||--.+--.+++.+-.+.|+.--+..-.+||.+-.. .+++...++++.+-+ ++
T Consensus 374 ~~~leslLl~knr~lq~e~a~Lr~~n~~~~~~~~~~~~~-~~el~~~~~~~ke~i~kl 430 (629)
T KOG0963|consen 374 AKTLESLLLEKNRKLQNENASLRVANSGLSGRITELSKK-GEELEAKATEQKELIAKL 430 (629)
T ss_pred cchHHHHHHHHHhhhhHHHHHHhccccccchhHHHHHhh-hhhhHHHHHHHHHHHHHH
Confidence 334444444456788899999999999888888887554 457777888887776 54
No 51
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=72.95 E-value=33 Score=26.98 Aligned_cols=58 Identities=28% Similarity=0.442 Sum_probs=42.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhH----HHHHhhhhhHHHHHHHHHHHH
Q 012498 15 LMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATR----MHFQRTAGLEQEIEILKQKIA 72 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATR----M~~qRta~LEQeiE~Lkkkl~ 72 (462)
+.+-|.-|+++.+.+.++|+.+=--=+.|||++=|.. -...+-+.++.+++.+...|.
T Consensus 2 ~~~E~~rL~Kel~kl~~~i~~~~~kL~n~~F~~kAP~eVve~er~kl~~~~~~~~~l~~~l~ 63 (66)
T PF10458_consen 2 VEAEIERLEKELEKLEKEIERLEKKLSNENFVEKAPEEVVEKEREKLEELEEELEKLEEALE 63 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCSTTHHHHS-CCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456778999999999999999888889999886653 234455666666766666554
No 52
>PF09728 Taxilin: Myosin-like coiled-coil protein; InterPro: IPR019132 Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca(2+)-dependent exocytosis in neuroendocrine cells []. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localises to the nucleus in osteoblasts and dimerises with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription [].
Probab=71.86 E-value=1.2e+02 Score=30.71 Aligned_cols=108 Identities=31% Similarity=0.422 Sum_probs=64.5
Q ss_pred hHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHH
Q 012498 60 LEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKE 139 (462)
Q Consensus 60 LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE 139 (462)
++.++..++++......+..+++.|++.+--.|+.|-.|.+-==-.|+.+- |-+..-+..-.+
T Consensus 41 ~~k~~~~~~Kk~~~l~kek~~l~~E~~k~~~~k~KLE~LCRELQk~Nk~lk-----------------eE~~~~~~eee~ 103 (309)
T PF09728_consen 41 LQKQLKKLQKKQEQLQKEKDQLQSELSKAILAKSKLESLCRELQKQNKKLK-----------------EESKRRAREEEE 103 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------------HHHHHHHHHHHH
Confidence 677788889999999999999999999999999999988553333343332 222222333344
Q ss_pred HHHHHHHHHH----HHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 140 KEELMSQKFN----EFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 140 ~Ee~m~qk~~----~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
+-..|+.+|. +++.++++.......+..-|..|...+..+.+|-+
T Consensus 104 kR~el~~kFq~~L~dIq~~~ee~~~~~~k~~~eN~~L~eKlK~l~eQye 152 (309)
T PF09728_consen 104 KRKELSEKFQATLKDIQAQMEEQSERNIKLREENEELREKLKSLIEQYE 152 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555553 33444444444444444444444444444444333
No 53
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=71.75 E-value=1.3e+02 Score=31.23 Aligned_cols=36 Identities=22% Similarity=0.253 Sum_probs=27.1
Q ss_pred hHHHHHHHHHHHHHHHHhHHHHHhhhhh----hHHHHHHh
Q 012498 230 TSKYISALEDELEKTRSSVENLQSKLRM----GLEIENHL 265 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~LQskLR~----GLeIenhL 265 (462)
.+.+|..|-+|+..||+.+..-|..--+ -+..+.|+
T Consensus 227 ~~shI~~Lr~EV~RLR~qL~~sq~e~~~k~~~~~~eek~i 266 (310)
T PF09755_consen 227 LSSHIRSLRQEVSRLRQQLAASQQEHSEKMAQYLQEEKEI 266 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5779999999999999999887765433 24555554
No 54
>KOG0963 consensus Transcription factor/CCAAT displacement protein CDP1 [Transcription]
Probab=71.54 E-value=1.8e+02 Score=32.84 Aligned_cols=112 Identities=16% Similarity=0.257 Sum_probs=62.8
Q ss_pred HHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHH
Q 012498 108 EAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFK 187 (462)
Q Consensus 108 e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~ 187 (462)
..++-|-|-|...++-+|+|-.-|.+ .+..|-.++...++-+..+++. +.+-|..+..++..-+
T Consensus 164 ~ie~~a~~~e~~~~q~~~e~e~~L~~------~~~~~~~q~~~le~ki~~lq~a-------~~~t~~el~~~~s~~d--- 227 (629)
T KOG0963|consen 164 FIENAANETEEKLEQEWAEREAGLKD------EEQNLQEQLEELEKKISSLQSA-------IEDTQNELFDLKSKYD--- 227 (629)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHH-------HHhhhhHHHHHHHhhh---
Confidence 34555667777777777777555443 3333334444444444444433 3333444333322211
Q ss_pred HHHHHHHHHhhhhhhhhcccccchhhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhh
Q 012498 188 EVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMG 258 (462)
Q Consensus 188 kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR~G 258 (462)
.+. ..-.+=-+.++.|-++ +..-|-.||.|++.|+.++.+--+..+.|
T Consensus 228 -------------ee~--~~k~aev~lim~eLe~--------aq~ri~~lE~e~e~L~~ql~~~N~~~~~~ 275 (629)
T KOG0963|consen 228 -------------EEV--AAKAAEVSLIMTELED--------AQQRIVFLEREVEQLREQLAKANSSKKLA 275 (629)
T ss_pred -------------hhh--HHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHhhhhhhhhc
Confidence 111 1111222345555544 67789999999999999998888877776
No 55
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=71.19 E-value=53 Score=26.44 Aligned_cols=31 Identities=23% Similarity=0.464 Sum_probs=25.3
Q ss_pred CcchHHHHHHHHHHHHHHHHhHHHHHhhhhh
Q 012498 227 DTSTSKYISALEDELEKTRSSVENLQSKLRM 257 (462)
Q Consensus 227 ~tstskyisaLEeE~e~lr~~i~~LQskLR~ 257 (462)
-.+...++..|++..+.+...|++|..++.-
T Consensus 57 ~~~~~~~~~~L~~~~~~~~~~i~~l~~~~~~ 87 (106)
T PF01920_consen 57 KQDKEEAIEELEERIEKLEKEIKKLEKQLKY 87 (106)
T ss_dssp EEEHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3578889999999999999888888877653
No 56
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=70.97 E-value=64 Score=35.24 Aligned_cols=42 Identities=19% Similarity=0.241 Sum_probs=27.9
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 143 LMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 143 ~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
.|.+++.+++.++...++....+++.|-.|+.++-....+..
T Consensus 379 ~~e~k~~q~q~k~~k~~kel~~~~E~n~~l~knq~vw~~kl~ 420 (493)
T KOG0804|consen 379 IVERKLQQLQTKLKKCQKELKEEREENKKLIKNQDVWRGKLK 420 (493)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 455677777777777777777777777777766655444433
No 57
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=70.75 E-value=25 Score=28.31 Aligned_cols=74 Identities=27% Similarity=0.381 Sum_probs=50.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH--------HHhhcCCchH----HHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhc
Q 012498 12 SEALMARIQQLEHERDELRKDIEQL--------CMQQAGPSYL----AVATRMHFQRTAGLEQEIEILKQKIAACARENS 79 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqL--------CMQQaGpgyl----~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~ 79 (462)
...+..+|.+|+++.+++.-=++.| |+...|+-|| +-+.-++-.+...++.+|+.|++++..+...=.
T Consensus 14 l~~~~~q~~~l~~~~~~~~~~~~eL~~l~~~~~~y~~vG~~fv~~~~~~~~~~L~~~~~~~~~~i~~l~~~~~~l~~~l~ 93 (106)
T PF01920_consen 14 LQQLEQQIQQLERQLRELELTLEELEKLDDDRKVYKSVGKMFVKQDKEEAIEELEERIEKLEKEIKKLEKQLKYLEKKLK 93 (106)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHTSSTT-EEEEEETTEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHHHHhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456667777777666655444433 7888888876 346677888888888888888887776665554
Q ss_pred chHHHH
Q 012498 80 NLQEEL 85 (462)
Q Consensus 80 nLQEEL 85 (462)
+++..|
T Consensus 94 ~~~~~l 99 (106)
T PF01920_consen 94 ELKKKL 99 (106)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 444444
No 58
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=70.44 E-value=93 Score=28.97 Aligned_cols=72 Identities=22% Similarity=0.347 Sum_probs=39.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHH
Q 012498 13 EALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELS 86 (462)
Q Consensus 13 e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELs 86 (462)
..+...|.++..+++.|+..|+.+=....++.. ...+.+......++..+..++..+....++....++.+.
T Consensus 23 ~~~~~~l~~~~~~~~~l~~~i~~~l~~~~~~~~--~~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~ 94 (302)
T PF10186_consen 23 LELRSELQQLKEENEELRRRIEEILESDSNGQL--LEIQQLKREIEELRERLERLRERIERLRKRIEQKRERLE 94 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456677888999999999999987653333321 122222223333444444444444444444444444433
No 59
>KOG0979 consensus Structural maintenance of chromosome protein SMC5/Spr18, SMC superfamily [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning; Replication, recombination and repair]
Probab=70.24 E-value=1.7e+02 Score=34.82 Aligned_cols=166 Identities=19% Similarity=0.139 Sum_probs=97.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHH-HHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELS-EAYR 90 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELs-EAYR 90 (462)
...-++.|++|+.+-|.|.||+|.+|-=+.--++|.+- .+--- +=.+++ -.+-..++- .-=|
T Consensus 197 ~~~~~~~l~~L~~~~~~l~kdVE~~rer~~~~~~Ie~l----~~k~~-----~v~y~~--------~~~ey~~~k~~~~r 259 (1072)
T KOG0979|consen 197 LTTKTEKLNRLEDEIDKLEKDVERVRERERKKSKIELL----EKKKK-----WVEYKK--------HDREYNAYKQAKDR 259 (1072)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhcc-----ccchHh--------hhHHHHHHHHHHHH
Confidence 44456788899999999999999999766666664332 11100 001111 011122222 2236
Q ss_pred HHHHHHHHHH---HHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHh
Q 012498 91 IKGQLADLHA---AEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKK 167 (462)
Q Consensus 91 iK~qLadLh~---ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~ 167 (462)
.|..+-+|-. .=..+-+++|+ -++-.++.=+..-+++-++..+--...-+|.+++.++.+....+...|.
T Consensus 260 ~k~~~r~l~k~~~pi~~~~eeLe~-------~~~et~~~~s~~~~~~~e~~~k~~~~~ek~~~~~~~v~~~~~~le~lk~ 332 (1072)
T KOG0979|consen 260 AKKELRKLEKEIKPIEDKKEELES-------EKKETRSKISQKQRELNEALAKVQEKFEKLKEIEDEVEEKKNKLESLKK 332 (1072)
T ss_pred HHHHHHHHHHhhhhhhhhhhhHHh-------HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6777766633 22345566666 3456667777777888888888888888888888888888777766665
Q ss_pred hhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccc
Q 012498 168 QNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSW 208 (462)
Q Consensus 168 ~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~ 208 (462)
.-...|.++.. ..|.|.--=..++....|.+..+
T Consensus 333 ~~~~rq~~i~~-------~~k~i~~~q~el~~~~~~e~~~~ 366 (1072)
T KOG0979|consen 333 AAEKRQKRIEK-------AKKMILDAQAELQETEDPENPVE 366 (1072)
T ss_pred HHHHHHHHHHH-------HHHHHHHHHhhhhhcCCccccch
Confidence 55555544443 34555444444444444444333
No 60
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=69.79 E-value=1.8e+02 Score=33.36 Aligned_cols=59 Identities=27% Similarity=0.410 Sum_probs=42.7
Q ss_pred HHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHH
Q 012498 120 MAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEE 181 (462)
Q Consensus 120 vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~e 181 (462)
+..++.+|++.|+|..+.|-.-+ +.|..+..|++-.++.+.-+|+-=..|+-+|+.+.+
T Consensus 111 l~~~l~~~~~~i~~l~~~~~~~e---~~~~~l~~~l~~~eken~~Lkye~~~~~keleir~~ 169 (769)
T PF05911_consen 111 LSKALQEKEKLIAELSEEKSQAE---AEIEDLMARLESTEKENSSLKYELHVLSKELEIRNE 169 (769)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566788888888877775544 577788888888888887777766666666665543
No 61
>PF12240 Angiomotin_C: Angiomotin C terminal; InterPro: IPR024646 This domain represents the C-terminal region of angiomotin. Angiomotin regulates the action of angiogenesis-inhibitor angiostatin []. The C-terminal region of angiomotin appears to be involved in directing the protein chemotactically [].
Probab=69.56 E-value=1.1e+02 Score=30.22 Aligned_cols=47 Identities=30% Similarity=0.456 Sum_probs=32.9
Q ss_pred hHHHHHhhhhhhhH-HHH--Hh----HHHHHHHH--HHHHHHHHHHHHHhHHHHHH
Q 012498 119 CMAAAFAERDNSVM-EAE--KA----KEKEELMS--QKFNEFQTRLEELSSENIEL 165 (462)
Q Consensus 119 ~vA~AFAERD~slm-EaE--ka----KE~Ee~m~--qk~~~~~~R~~E~~s~~~~q 165 (462)
-.|+|-|+||++++ +.. +. |+.|+... .++.+.+.|++.|.+.+.+-
T Consensus 100 Aaa~aa~~rdttiI~~s~~~s~~~s~r~~eel~~a~~K~qemE~RIK~LhaqI~EK 155 (205)
T PF12240_consen 100 AAATAAAQRDTTIINHSPSESYNSSLREEEELHMANRKCQEMENRIKALHAQIAEK 155 (205)
T ss_pred HHhhhHHHHHHHHHhcCCCCCCCccccchHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence 44888899999554 333 33 44666555 46789999999998887543
No 62
>PF02050 FliJ: Flagellar FliJ protein; InterPro: IPR012823 Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects []. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export. The sequences of fliH, fliI and the adjacent gene, fliJ, have been deduced. FliJ was shown to encode a protein of molecular mass 17,302 Da []. It is a membrane-associated protein that affects chemotactic events, mutations in FliJ result in failure to respond to chemotactic stimuli.; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009288 bacterial-type flagellum, 0016020 membrane, 0044461 bacterial-type flagellum part; PDB: 3AJW_A.
Probab=69.12 E-value=54 Score=25.77 Aligned_cols=80 Identities=28% Similarity=0.349 Sum_probs=53.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKG 93 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~ 93 (462)
....+|..|+..++++...+...| + |. -....+++..=...|+..|..++..+..+-.+=...++.|.+|++=..
T Consensus 16 ~~~~~l~~L~~~~~~~~~~~~~~~-~--~~--s~~~~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~r~~l~~a~~~~k 90 (123)
T PF02050_consen 16 EAEEQLEQLQQERQEYQEQLSESQ-Q--GV--SVAQLRNYQRYISALEQAIQQQQQELERLEQEVEQAREELQEARRERK 90 (123)
T ss_dssp HHHHHHHHHHHHHHHHHHT------S--GG--GHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcc-C--CC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555555444 2 32 123445566667789999999999999999999999999999998777
Q ss_pred HHHHH
Q 012498 94 QLADL 98 (462)
Q Consensus 94 qLadL 98 (462)
.+..|
T Consensus 91 ~~e~L 95 (123)
T PF02050_consen 91 KLEKL 95 (123)
T ss_dssp HHHHH
T ss_pred HHHHH
Confidence 77777
No 63
>PF06657 Cep57_MT_bd: Centrosome microtubule-binding domain of Cep57; InterPro: IPR010597 This entry is thought to represent a centrosomal protein of 57 kDa (Cep57-related protein). It is required for spindle microtubule attachment to both kinetochores and centrosomes and functions to tether minus-ends of spindle microtubules to centrosomes. It may act by forming ring-like structures around microtubules, or by serving as a cross-linker or scaffold at the attachment site [].
Probab=67.76 E-value=27 Score=29.04 Aligned_cols=54 Identities=22% Similarity=0.331 Sum_probs=44.9
Q ss_pred CcchHHHHHHHHHHHHHHHHhHHHHHhhhh---------hhHHHHHHhHHhHHHHHHhhhhhH
Q 012498 227 DTSTSKYISALEDELEKTRSSVENLQSKLR---------MGLEIENHLKKSVRELEKKIIHSD 280 (462)
Q Consensus 227 ~tstskyisaLEeE~e~lr~~i~~LQskLR---------~GLeIenhLkk~vr~Lekkqi~~d 280 (462)
+.+.+..|.+|+.|++-++-....|+..++ ..-.+++||.+-|..||.|--.+-
T Consensus 12 ~~~Ls~vl~~LqDE~~hm~~e~~~L~~~~~~~d~s~~~~~R~~L~~~l~~lv~~mE~K~dQI~ 74 (79)
T PF06657_consen 12 GEALSEVLKALQDEFGHMKMEHQELQDEYKQMDPSLGRRKRRDLEQELEELVKRMEAKADQIY 74 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556899999999999999988888866654 467899999999999999865443
No 64
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=67.51 E-value=1.4e+02 Score=33.18 Aligned_cols=147 Identities=22% Similarity=0.264 Sum_probs=0.0
Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhh
Q 012498 135 EKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCAC 214 (462)
Q Consensus 135 EkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~ 214 (462)
++.| +.+.+++.|+--|=...--++.+|..|++|+..+..---.-..=|.-+|+.=-....-
T Consensus 38 ~rEK-------~El~~LNDRLA~YIekVR~LEaqN~~L~~di~~lr~~~~~~ts~ik~~ye~El~~ar~----------- 99 (546)
T KOG0977|consen 38 EREK-------KELQELNDRLAVYIEKVRFLEAQNRKLEHDINLLRGVVGRETSGIKAKYEAELATARK----------- 99 (546)
T ss_pred HHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCcchhHHhhhhHHHHHH-----------
Q ss_pred hccccccccccCC-cchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhh---hHHHHHHHHHHH
Q 012498 215 LLLDSAEMWSFND-TSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIH---SDKFISNAIAEL 290 (462)
Q Consensus 215 LL~Ds~~~Wsfn~-tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~---~dk~i~ngi~~l 290 (462)
+|++.+. + +....=|..|++|++.++.++++.+.-++..=+=--+....+..+|.+..+ .-+.+..-+..|
T Consensus 100 ~l~e~~~-----~ra~~e~ei~kl~~e~~elr~~~~~~~k~~~~~re~~~~~~~~l~~leAe~~~~krr~~~le~e~~~L 174 (546)
T KOG0977|consen 100 LLDETAR-----ERAKLEIEITKLREELKELRKKLEKAEKERRGAREKLDDYLSRLSELEAEINTLKRRIKALEDELKRL 174 (546)
T ss_pred HHHHHHH-----HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHH
Q ss_pred HHhhhHHHHHHHHh
Q 012498 291 RLCHSQLRVHVVNS 304 (462)
Q Consensus 291 q~~h~~~R~~Im~l 304 (462)
+.--+..|.+|-.+
T Consensus 175 k~en~rl~~~l~~~ 188 (546)
T KOG0977|consen 175 KAENSRLREELARA 188 (546)
T ss_pred HHHhhhhHHHHHHH
No 65
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=66.90 E-value=2.1e+02 Score=31.76 Aligned_cols=238 Identities=19% Similarity=0.264 Sum_probs=0.0
Q ss_pred hhhccchHHHHHHHHHHHHHHHHHHHHHHHHH-----HhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhh-c
Q 012498 6 KEKENESEALMARIQQLEHERDELRKDIEQLC-----MQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACAREN-S 79 (462)
Q Consensus 6 ~e~~~~~e~l~~RI~qLe~ERdEL~KDIEqLC-----MQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren-~ 79 (462)
++....++.+...+..|..++.++++.|.+|= |.|-+ .=..++..+.. .+..+.|.++..|.+-+++. .
T Consensus 195 kel~~~~e~l~~E~~~L~~q~~e~~~ri~~LEedi~~l~qk~----~E~e~~~~~lk-~~~~elEq~~~eLk~rLk~~~~ 269 (546)
T PF07888_consen 195 KELTESSEELKEERESLKEQLAEARQRIRELEEDIKTLTQKE----KEQEKELDKLK-ELKAELEQLEAELKQRLKETVV 269 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH
Q ss_pred chHHHHHHHHHHHHHHHHHH---HHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHH
Q 012498 80 NLQEELSEAYRIKGQLADLH---AAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLE 156 (462)
Q Consensus 80 nLQEELsEAYRiK~qLadLh---~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~ 156 (462)
.++..+..+.+.+..+..|- +..-..-...++++-|...-.+.|-+-||+.+-|--.++=..+.+-.++++...-++
T Consensus 270 ~~~~~~~~~~~~~~e~e~LkeqLr~~qe~lqaSqq~~~~L~~EL~~~~~~RDrt~aeLh~aRLe~aql~~qLad~~l~lk 349 (546)
T PF07888_consen 270 QLKQEETQAQQLQQENEALKEQLRSAQEQLQASQQEAELLRKELSDAVNVRDRTMAELHQARLEAAQLKLQLADASLELK 349 (546)
T ss_pred HHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH
Q ss_pred HHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHH---HHHHHHhhhhhhh--hcccccchhhhhccccccccccCCcchH
Q 012498 157 ELSSENIELKKQNATLRFDLEKQEELNESFKEVI---NKFYEIRQQSLEV--LETSWEDKCACLLLDSAEMWSFNDTSTS 231 (462)
Q Consensus 157 E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI---~KFyeiR~~~~e~--~~~s~~~Kcs~LL~Ds~~~Wsfn~tsts 231 (462)
|..++-...+. +|++.....++..+.+..=+ ++-|.=-...+.. ..+.-+.-|...
T Consensus 350 e~~~q~~qEk~---~l~~~~e~~k~~ie~L~~el~~~e~~lqEer~E~qkL~~ql~ke~D~n~v---------------- 410 (546)
T PF07888_consen 350 EGRSQWAQEKQ---ALQHSAEADKDEIEKLSRELQMLEEHLQEERMERQKLEKQLGKEKDCNRV---------------- 410 (546)
T ss_pred HHHHHHHHHHH---HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHH----------------
Q ss_pred HHHHHHHHHHHHHHHhHHHHHhhhhh-hHHHHHH------hHHhHHHHHHh
Q 012498 232 KYISALEDELEKTRSSVENLQSKLRM-GLEIENH------LKKSVRELEKK 275 (462)
Q Consensus 232 kyisaLEeE~e~lr~~i~~LQskLR~-GLeIenh------Lkk~vr~Lekk 275 (462)
.+--.+-.|.-|++-||| +.|=|+. |...|+-||.+
T Consensus 411 --------qlsE~~rel~Elks~lrv~qkEKEql~~EkQeL~~yi~~Le~r 453 (546)
T PF07888_consen 411 --------QLSENRRELQELKSSLRVAQKEKEQLQEEKQELLEYIERLEQR 453 (546)
T ss_pred --------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 66
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=66.48 E-value=76 Score=28.82 Aligned_cols=86 Identities=24% Similarity=0.414 Sum_probs=62.5
Q ss_pred ccccCC------cchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhh---hhhHHHHHHHHHHHHH
Q 012498 222 MWSFND------TSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKI---IHSDKFISNAIAELRL 292 (462)
Q Consensus 222 ~Wsfn~------tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkq---i~~dk~i~ngi~~lq~ 292 (462)
-|||+| -|.++.++++-.+++.+-.+|..-. .||..++..|..|+ .-..+.|.+.+++++.
T Consensus 27 Gws~sD~M~vTrr~m~~A~~~v~kql~~vs~~l~~tK----------khLsqRId~vd~klDe~~ei~~~i~~eV~~v~~ 96 (126)
T PF07889_consen 27 GWSFSDLMFVTRRSMSDAVASVSKQLEQVSESLSSTK----------KHLSQRIDRVDDKLDEQKEISKQIKDEVTEVRE 96 (126)
T ss_pred CCchhHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHh
Confidence 488886 3788888888888777777766543 57777777777766 3367788888898888
Q ss_pred hhhHHHHHHHHhhhhcchhhhhhHHHHHhhh-cc
Q 012498 293 CHSQLRVHVVNSLEEGRSHIKSISDVIEEKT-QH 325 (462)
Q Consensus 293 ~h~~~R~~Im~lL~ee~s~i~s~v~~ieekl-~~ 325 (462)
.=++-+..|-+ +..+|-.++.|| .+
T Consensus 97 dv~~i~~dv~~--------v~~~V~~Le~ki~~i 122 (126)
T PF07889_consen 97 DVSQIGDDVDS--------VQQMVEGLEGKIDEI 122 (126)
T ss_pred hHHHHHHHHHH--------HHHHHHHHHHHHHHH
Confidence 87777776644 566777777777 54
No 67
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=65.50 E-value=77 Score=31.29 Aligned_cols=41 Identities=22% Similarity=0.273 Sum_probs=37.4
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 144 m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
+..+=.-|-.|..||+.++..++.....|+.++..++.-|-
T Consensus 84 VtsQRDRFR~Rn~ELE~elr~~~~~~~~L~~Ev~~L~~DN~ 124 (248)
T PF08172_consen 84 VTSQRDRFRQRNAELEEELRKQQQTISSLRREVESLRADNV 124 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667789999999999999999999999999999999988
No 68
>PF06248 Zw10: Centromere/kinetochore Zw10; InterPro: IPR009361 Zeste white 10 (ZW10) was initially identified as a mitotic checkpoint protein involved in chromosome segregation, and then implicated in targeting cytoplasmic dynein and dynactin to mitotic kinetochores, but it is also important in non-dividing cells. These include cytoplasmic dynein targeting to Golgi and other membranes, and SNARE-mediated ER-Golgi trafficking [, ]. Dominant-negative ZW10, anti-ZW10 antibody, and ZW10 RNA interference (RNAi) cause Golgi dispersal. ZW10 RNAi also disperse endosomes and lysosomes []. Drosophila kinetochore components Rough deal (Rod) and Zw10 are required for the proper functioning of the metaphase checkpoint in flies []. The eukaryotic spindle assembly checkpoint (SAC) monitors microtubule attachment to kinetochores and prevents anaphase onset until all kinetochores are aligned on the metaphase plate. It is an essential surveillance mechanism that ensures high fidelity chromosome segregation during mitosis. In higher eukaryotes, cytoplasmic dynein is involved in silencing the SAC by removing the checkpoint proteins Mad2 and the Rod-Zw10-Zwilch complex (RZZ) from aligned kinetochores [, , ].; GO: 0007067 mitosis, 0000775 chromosome, centromeric region, 0005634 nucleus
Probab=64.17 E-value=2.1e+02 Score=30.67 Aligned_cols=52 Identities=19% Similarity=0.378 Sum_probs=34.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHH--HhhHHHHHhhhhhHHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLA--VATRMHFQRTAGLEQEI 64 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~--vATRM~~qRta~LEQei 64 (462)
.|.+..+|..|.++.++++..|-..---..+ .|.. ..++-+.-|+..|..||
T Consensus 9 ~edl~~~I~~L~~~i~~~k~eV~~~I~~~y~-df~~~~~~~~~L~~~~~~l~~eI 62 (593)
T PF06248_consen 9 KEDLRKSISRLSRRIEELKEEVHSMINKKYS-DFSPSLQSAKDLIERSKSLAREI 62 (593)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhHHHHHHHHHHHHHHH
Confidence 6788999999999999999998766554433 2322 22333455666666666
No 69
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=62.64 E-value=6.7 Score=38.74 Aligned_cols=22 Identities=41% Similarity=0.617 Sum_probs=19.1
Q ss_pred hHHHHHHHHHHHHHHHHhHHHH
Q 012498 230 TSKYISALEDELEKTRSSVENL 251 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~L 251 (462)
-.+=|+|||.||-.||++|+++
T Consensus 120 AlqKIsALEdELs~LRaQIA~I 141 (253)
T PF05308_consen 120 ALQKISALEDELSRLRAQIAKI 141 (253)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3456899999999999999976
No 70
>PF09738 DUF2051: Double stranded RNA binding protein (DUF2051); InterPro: IPR019139 This entry represents transcriptional repressors which preferentially bind to the GC-rich consensus sequence (5'-AGCCCCCGGCG-3') and may regulate expression of TNF, EGFR and PDGFA. They may control smooth muscle cell proliferation following artery injury through PDGFA repression and may also bind double-stranded RNA. They interact with the leucine-rich repeat domain of human flightless-I (FliI) protein.
Probab=61.76 E-value=1.9e+02 Score=29.53 Aligned_cols=86 Identities=21% Similarity=0.301 Sum_probs=57.3
Q ss_pred HHHHHHH---HHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhh
Q 012498 92 KGQLADL---HAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQ 168 (462)
Q Consensus 92 K~qLadL---h~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~ 168 (462)
|..|+++ |+.+..-|-+|.- |+-+-+-++.-.|-+=+.|-..+++++.-.++-..++..+|+.
T Consensus 83 k~~l~evEekyrkAMv~naQLDN--------------ek~~l~yqvd~Lkd~lee~eE~~~~~~re~~eK~~elEr~K~~ 148 (302)
T PF09738_consen 83 KDSLAEVEEKYRKAMVSNAQLDN--------------EKSALMYQVDLLKDKLEELEETLAQLQREYREKIRELERQKRA 148 (302)
T ss_pred HHHHHHHHHHHHHHHHHHhhhch--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6666655 5555555555421 1222233555555555555566666666667777788999999
Q ss_pred hHhHhhhHHHHHHhhHhHHHHHH
Q 012498 169 NATLRFDLEKQEELNESFKEVIN 191 (462)
Q Consensus 169 n~aLQ~dl~~~~eq~e~~~kVI~ 191 (462)
.+.|+.++..++++..---..|.
T Consensus 149 ~d~L~~e~~~Lre~L~~rdeli~ 171 (302)
T PF09738_consen 149 HDSLREELDELREQLKQRDELIE 171 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999876656664
No 71
>PF04822 Takusan: Takusan; InterPro: IPR006907 This family includes several uncharacterised muridae (mouse and rat) proteins.
Probab=61.32 E-value=24 Score=30.09 Aligned_cols=64 Identities=28% Similarity=0.363 Sum_probs=49.0
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHH
Q 012498 10 NESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEA 88 (462)
Q Consensus 10 ~~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEA 88 (462)
...|.|+..++....||||||+=.- -..||. ..-| +--+.|.||-+=...-.+.++|+.+.++|
T Consensus 19 k~lE~L~~eL~~it~ERnELr~~L~-----~~~~~~--~n~R--------~n~~ye~Lk~q~~~vM~dl~~l~~~~~ea 82 (84)
T PF04822_consen 19 KELERLKFELQKITKERNELRDILA-----LYTEGS--LNNR--------PNPEYEMLKSQHEEVMSDLHKLEMEITEA 82 (84)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-----HhcCCC--cccC--------CChHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4578899999999999999996322 123444 3334 56678889888888899999999999887
No 72
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=60.91 E-value=3e+02 Score=31.52 Aligned_cols=36 Identities=31% Similarity=0.356 Sum_probs=22.7
Q ss_pred HHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHH
Q 012498 53 HFQRTAGLEQEIEILKQKIAACARENSNLQEELSEA 88 (462)
Q Consensus 53 ~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEA 88 (462)
+.+|.+.|+.|+-.++..+.....||..|.....+.
T Consensus 32 ~~~~i~~l~~elk~~~~~~~~~~~e~~rl~~~~~~~ 67 (717)
T PF09730_consen 32 LQQRILELENELKQLRQELSNVQAENERLSQLNQEL 67 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666677777777776666666666665554443
No 73
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown]
Probab=60.65 E-value=3.7e+02 Score=32.45 Aligned_cols=108 Identities=23% Similarity=0.332 Sum_probs=66.0
Q ss_pred HhhhhhcchHHHHHHHHHHHHHHHHH----HHHHHH--hhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHH
Q 012498 73 ACARENSNLQEELSEAYRIKGQLADL----HAAEVI--KNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQ 146 (462)
Q Consensus 73 ~c~ren~nLQEELsEAYRiK~qLadL----h~ae~~--Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~q 146 (462)
.|-+=|++-++=|.+|-|.+.+-.++ ++||.. +-++.=-+.-||-+-|-- +++||.++=+|| ++|-.
T Consensus 213 e~~klrqe~~e~l~ea~ra~~yrdeldalre~aer~d~~ykerlmDs~fykdRvee--lkedN~vLleek-----eMLee 285 (1195)
T KOG4643|consen 213 EISKLRQEIEEFLDEAHRADRYRDELDALREQAERPDTTYKERLMDSDFYKDRVEE--LKEDNRVLLEEK-----EMLEE 285 (1195)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhhcCCCccchhhhhhHHHHHHHHH--HHhhhHHHHHHH-----HHHHH
Confidence 45566677777778888887776655 445554 334444466777766644 578888876544 34455
Q ss_pred HHHHHHHHH--HHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHH
Q 012498 147 KFNEFQTRL--EELSSENIELKKQNATLRFDLEKQEELNESFK 187 (462)
Q Consensus 147 k~~~~~~R~--~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~ 187 (462)
++..+..|- -+++|.+...|..-+.++++.....-+|+.++
T Consensus 286 QLq~lrarse~~tleseiiqlkqkl~dm~~erdtdr~kteeL~ 328 (1195)
T KOG4643|consen 286 QLQKLRARSEGATLESEIIQLKQKLDDMRSERDTDRHKTEELH 328 (1195)
T ss_pred HHHHHHhccccCChHHHHHHHHHHHHHHHHhhhhHHHHHHHHH
Confidence 666666666 45666666666655555555555555555433
No 74
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=60.46 E-value=3.7e+02 Score=32.32 Aligned_cols=113 Identities=18% Similarity=0.190 Sum_probs=58.7
Q ss_pred CCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH-----------HHHHHHHHHHHHH----Hhh
Q 012498 42 GPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR-----------IKGQLADLHAAEV----IKN 106 (462)
Q Consensus 42 Gpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR-----------iK~qLadLh~ae~----~Kn 106 (462)
--+|..+..|...+..-.-..+++.++.++..+..+-.+.++++.++=. ++..+..|.+-.. ..-
T Consensus 256 y~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~a~~~~~eL 335 (1353)
T TIGR02680 256 YRRYARTMLRRRATRLRSAQTQYDQLSRDLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEEL 335 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Confidence 3456665555555554444555666666666666666666666555544 3333333322111 111
Q ss_pred HHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHH
Q 012498 107 MEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSE 161 (462)
Q Consensus 107 ~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~ 161 (462)
.+++.+++-.+...+.+... ++++..+.+..-+...+...|+.+..+.
T Consensus 336 ~el~~ql~~~~~~a~~~~~~-------~~~a~~~~e~~~~~~~~~~~r~~~~~~~ 383 (1353)
T TIGR02680 336 ERARADAEALQAAAADARQA-------IREAESRLEEERRRLDEEAGRLDDAERE 383 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 25556776666666555543 2334445555555555666666555444
No 75
>COG2825 HlpA Outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=60.05 E-value=1.5e+02 Score=27.68 Aligned_cols=47 Identities=23% Similarity=0.308 Sum_probs=27.8
Q ss_pred HHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhh
Q 012498 146 QKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSL 201 (462)
Q Consensus 146 qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~ 201 (462)
++...|..-..+++.. .-+.......+..+....+|+.|.+..+++.
T Consensus 97 ~~~~~~~~k~~~~~~~---------~~~~~~e~~~~~~~~i~~ai~~~a~~~gy~~ 143 (170)
T COG2825 97 KLVNAFNKKQQEYEKD---------LNRREAEEEQKLLEKIQRAIESVAEKGGYSL 143 (170)
T ss_pred HHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHhCCcce
Confidence 3445555544444333 2344455556666777788888888776554
No 76
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=58.89 E-value=21 Score=27.46 Aligned_cols=36 Identities=33% Similarity=0.486 Sum_probs=29.2
Q ss_pred HHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHH
Q 012498 53 HFQRTAGLEQEIEILKQKIAACARENSNLQEELSEA 88 (462)
Q Consensus 53 ~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEA 88 (462)
-..+...+.++|..|++++.....+|..|++++...
T Consensus 15 ~~~~~~~~~~ei~~l~~~i~~l~~e~~~L~~ei~~l 50 (80)
T PF04977_consen 15 GYSRYYQLNQEIAELQKEIEELKKENEELKEEIERL 50 (80)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344566788889999999999999999999888764
No 77
>PF05700 BCAS2: Breast carcinoma amplified sequence 2 (BCAS2); InterPro: IPR008409 This family consists of several eukaryotic sequences of unknown function. The mammalian members of this family are annotated as breast carcinoma amplified sequence 2 (BCAS2) proteins []. BCAS2 is a putative spliceosome associated protein [].
Probab=56.94 E-value=1.7e+02 Score=27.87 Aligned_cols=90 Identities=30% Similarity=0.344 Sum_probs=59.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHH
Q 012498 15 LMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQ 94 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~q 94 (462)
+..---+|+|.+.-+.. .| .|++-|+.---+...-+-.--..|++++..+++++..+++...+-|.+... .++ .
T Consensus 106 l~na~a~lehq~~R~~N-Le--Ll~~~g~naW~~~n~~Le~~~~~le~~l~~~k~~ie~vN~~RK~~Q~~~~~--~L~-~ 179 (221)
T PF05700_consen 106 LDNAYAQLEHQRLRLEN-LE--LLSKYGENAWLIHNEQLEAMLKRLEKELAKLKKEIEEVNRERKRRQEEAGE--ELR-Y 179 (221)
T ss_pred HHHHHHHHHHHHHHHHH-HH--HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH--HHH-H
Confidence 33333467776655432 22 577888543334444444556778888888999999999888888877433 333 6
Q ss_pred HHHHHHHHHHhhHHHH
Q 012498 95 LADLHAAEVIKNMEAE 110 (462)
Q Consensus 95 LadLh~ae~~Kn~e~E 110 (462)
|..-|..-+.||-++|
T Consensus 180 Le~~W~~~v~kn~eie 195 (221)
T PF05700_consen 180 LEQRWKELVSKNLEIE 195 (221)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 6666777788887776
No 78
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=56.37 E-value=3.7 Score=46.05 Aligned_cols=155 Identities=23% Similarity=0.284 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHH
Q 012498 16 MARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQL 95 (462)
Q Consensus 16 ~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qL 95 (462)
......|+.|.++|.+.++..=.| ++.+||- -..|++.++.++..|...++...+|+..|..+=.=...|
T Consensus 207 ~~~k~kL~~E~~eL~~qLee~e~~------~~~l~r~----k~~L~~qLeelk~~leeEtr~k~~L~~~l~~le~e~~~L 276 (859)
T PF01576_consen 207 TEQKAKLQSENSELTRQLEEAESQ------LSQLQRE----KSSLESQLEELKRQLEEETRAKQALEKQLRQLEHELEQL 276 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHH----HHHHHHHHHhhHHHHHhHhhhhhhhHHHHHHHHHHHHHH
Confidence 334444555555665555554433 2233332 345888899999999999999999988776654322222
Q ss_pred HHHHHHHHHhhHH-------HHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhh
Q 012498 96 ADLHAAEVIKNME-------AEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQ 168 (462)
Q Consensus 96 adLh~ae~~Kn~e-------~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~ 168 (462)
-+...-+-..-.+ +..++.|+...+-+.+..|-..+-|+- .-+..++.+.+..++++.+.+...++.
T Consensus 277 ~eqleeE~e~k~~l~~qlsk~~~El~~~k~K~e~e~~~~~EelEeaK------KkL~~~L~el~e~le~~~~~~~~LeK~ 350 (859)
T PF01576_consen 277 REQLEEEEEAKSELERQLSKLNAELEQWKKKYEEEAEQRTEELEEAK------KKLERKLQELQEQLEEANAKVSSLEKT 350 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHhhhhhhHHHHHHHHHHHhhHHHHHHHHHHHHhhhhHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 2222222222233 444566666666666665554444432 345678999999999999999999999
Q ss_pred hHhHhhhHHHHHHhhHhH
Q 012498 169 NATLRFDLEKQEELNESF 186 (462)
Q Consensus 169 n~aLQ~dl~~~~eq~e~~ 186 (462)
...|+.++..+.-..+..
T Consensus 351 k~rL~~EleDl~~eLe~~ 368 (859)
T PF01576_consen 351 KKRLQGELEDLTSELEKA 368 (859)
T ss_dssp ------------------
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 999999888877666643
No 79
>KOG0250 consensus DNA repair protein RAD18 (SMC family protein) [Replication, recombination and repair]
Probab=55.31 E-value=4.5e+02 Score=31.69 Aligned_cols=147 Identities=19% Similarity=0.235 Sum_probs=96.5
Q ss_pred HHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHH
Q 012498 101 AEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180 (462)
Q Consensus 101 ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~ 180 (462)
....++.++.+...=++-.++..-.|=|.-=-|++-+++.=......+++++.-..+.++.+.+.|.--+.|...++.++
T Consensus 306 ~~~~k~~~~r~k~teiea~i~~~~~e~~~~d~Ei~~~r~~~~~~~re~~~~~~~~~~~~n~i~~~k~~~d~l~k~I~~~~ 385 (1074)
T KOG0250|consen 306 EKQGKIEEARQKLTEIEAKIGELKDEVDAQDEEIEEARKDLDDLRREVNDLKEEIREIENSIRKLKKEVDRLEKQIADLE 385 (1074)
T ss_pred HHHHHHHHHhhhhhHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556666666666677777777666555555666666666666677777777777777777777776666666666666
Q ss_pred HhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHH
Q 012498 181 ELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLE 260 (462)
Q Consensus 181 eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR~GLe 260 (462)
+++....+.= -..-++|- ....+-|..||+.+.+|+.+...+.++++.|=+
T Consensus 386 ~~~~~~~~~~--------------~~e~e~k~---------------~~L~~evek~e~~~~~L~~e~~~~~~~~~~~~e 436 (1074)
T KOG0250|consen 386 KQTNNELGSE--------------LEERENKL---------------EQLKKEVEKLEEQINSLREELNEVKEKAKEEEE 436 (1074)
T ss_pred HHHHhhhhhh--------------HHHHHHHH---------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 6662111100 00011111 126678899999999999999999999999865
Q ss_pred HHHHhHHhHHHHHHhh
Q 012498 261 IENHLKKSVRELEKKI 276 (462)
Q Consensus 261 IenhLkk~vr~Lekkq 276 (462)
=--|++..++.|.+++
T Consensus 437 e~~~i~~~i~~l~k~i 452 (1074)
T KOG0250|consen 437 EKEHIEGEILQLRKKI 452 (1074)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5556666666666665
No 80
>COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=55.18 E-value=3.6e+02 Score=30.54 Aligned_cols=38 Identities=18% Similarity=0.225 Sum_probs=18.6
Q ss_pred hHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHH
Q 012498 60 LEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLAD 97 (462)
Q Consensus 60 LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLad 97 (462)
.+..+..+...+..+-....+|.+.-.+....+.++..
T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~L~~~~~e~~~~~~~~~~ 309 (908)
T COG0419 272 REEELRELERLLEELEEKIERLEELEREIEELEEELEG 309 (908)
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444555555555555555544444
No 81
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=54.91 E-value=1.9e+02 Score=27.36 Aligned_cols=85 Identities=22% Similarity=0.335 Sum_probs=54.8
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHH---HHHHhhcC--------CchHHHhhHHH--HHhhhhhHHHHHHHHHHHHHhhhh
Q 012498 11 ESEALMARIQQLEHERDELRKDIE---QLCMQQAG--------PSYLAVATRMH--FQRTAGLEQEIEILKQKIAACARE 77 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIE---qLCMQQaG--------pgyl~vATRM~--~qRta~LEQeiE~Lkkkl~~c~re 77 (462)
....|-++|.|..+-+.+|..=+. .+|..... |..-.+.+|.- -||.++|+|-...|+.+|..+...
T Consensus 17 Lv~~LQ~KV~qYr~rc~ele~~l~~~~~l~~~~~~~~~~~e~s~dLe~~l~rLeEEqqR~~~L~qvN~lLReQLEq~~~~ 96 (182)
T PF15035_consen 17 LVQRLQAKVLQYRKRCAELEQQLSASQVLESPSQRRRSEEEHSPDLEEALIRLEEEQQRSEELAQVNALLREQLEQARKA 96 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCcCcccccccccccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Confidence 345566777778887777765441 12221110 11112223322 379999999999999999999999
Q ss_pred hcchHHHHHHHHHHHHHHHHH
Q 012498 78 NSNLQEELSEAYRIKGQLADL 98 (462)
Q Consensus 78 n~nLQEELsEAYRiK~qLadL 98 (462)
|..|.++|. ++...+..+
T Consensus 97 N~~L~~dl~---klt~~~~~l 114 (182)
T PF15035_consen 97 NEALQEDLQ---KLTQDWERL 114 (182)
T ss_pred HHHHHHHHH---HHHHHHHHH
Confidence 999999986 455555543
No 82
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=54.03 E-value=2.5e+02 Score=28.42 Aligned_cols=100 Identities=21% Similarity=0.294 Sum_probs=58.4
Q ss_pred HHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhccccccccccCCcchHHH
Q 012498 154 RLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFNDTSTSKY 233 (462)
Q Consensus 154 R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn~tstsky 233 (462)
-+.||.+++......|..|....--.++.+-.-..-..-=|++=.......+.++...=.-+- ++.=+|.+. ..+=
T Consensus 7 sl~el~~h~~~L~~~N~~L~~~IqdtE~st~~~Vr~lLqqy~~~~~~i~~le~~~~~~l~~ak---~eLqe~eek-~e~~ 82 (258)
T PF15397_consen 7 SLQELKKHEDFLTKLNKELIKEIQDTEDSTALKVRKLLQQYDIYRTAIDILEYSNHKQLQQAK---AELQEWEEK-EESK 82 (258)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHhHHhhHHHHHHHHHHHHHHHHHHHHHHHccChHHHHHHH---HHHHHHHHH-HHhH
Confidence 367888888888889988888777666655433333322233322222222222222111000 011111122 4556
Q ss_pred HHHHHHHHHHHHHhHHHHHhhhhh
Q 012498 234 ISALEDELEKTRSSVENLQSKLRM 257 (462)
Q Consensus 234 isaLEeE~e~lr~~i~~LQskLR~ 257 (462)
++.|+.+++.|.+.|.+.|-.|++
T Consensus 83 l~~Lq~ql~~l~akI~k~~~el~~ 106 (258)
T PF15397_consen 83 LSKLQQQLEQLDAKIQKTQEELNF 106 (258)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 889999999999999999999998
No 83
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=53.77 E-value=1.9e+02 Score=29.15 Aligned_cols=37 Identities=38% Similarity=0.434 Sum_probs=22.2
Q ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhh
Q 012498 133 EAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQN 169 (462)
Q Consensus 133 EaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n 169 (462)
|.+..++.|+.....++.|+..+.+.+......+.+-
T Consensus 86 e~~~l~~eE~~~~~~~n~~~~~l~~~~~e~~sl~~q~ 122 (314)
T PF04111_consen 86 ELEELDEEEEEYWREYNELQLELIEFQEERDSLKNQY 122 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444456666677777887777776655544444333
No 84
>PF13514 AAA_27: AAA domain
Probab=53.73 E-value=4.1e+02 Score=30.82 Aligned_cols=28 Identities=21% Similarity=0.366 Sum_probs=21.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQ 39 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQ 39 (462)
...+..||.+++.+.+.+...+..|+-.
T Consensus 745 ~~~~~~ri~~~~~~~~~f~~~~~~L~~~ 772 (1111)
T PF13514_consen 745 IRELRRRIEQMEADLAAFEEQVAALAER 772 (1111)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456677888888888888888888853
No 85
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=53.09 E-value=2e+02 Score=27.07 Aligned_cols=47 Identities=23% Similarity=0.287 Sum_probs=27.1
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHH
Q 012498 143 LMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINK 192 (462)
Q Consensus 143 ~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~K 192 (462)
.+++++.++..+..++++.+......+ ..-+..+++.+...+.-+|.
T Consensus 107 ~~l~~l~~l~~~~~~l~~el~~~~~~D---p~~i~~~~~~~~~~~~~anr 153 (188)
T PF03962_consen 107 ELLEELEELKKELKELKKELEKYSEND---PEKIEKLKEEIKIAKEAANR 153 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcC---HHHHHHHHHHHHHHHHHHHH
Confidence 466778888888888877776443322 22344445555544444443
No 86
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=52.14 E-value=3.3e+02 Score=29.30 Aligned_cols=82 Identities=21% Similarity=0.230 Sum_probs=40.0
Q ss_pred hhHHHHHHHHHHHHHhhhh--hcchHHHHHHHHHHHHHHHHHH---HHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHH
Q 012498 59 GLEQEIEILKQKIAACARE--NSNLQEELSEAYRIKGQLADLH---AAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVME 133 (462)
Q Consensus 59 ~LEQeiE~Lkkkl~~c~re--n~nLQEELsEAYRiK~qLadLh---~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmE 133 (462)
+++.+|+.+++++..|... +..|-.--..-=.|..++..|| ..|..-.+.+++...-..+.+..+=..=+.-.-|
T Consensus 253 ~i~~~i~~l~~~i~~~~~~l~~l~l~~~~~~~~~i~~~Id~Lyd~lekE~~A~~~vek~~~~l~~~l~~~~e~~~~l~~E 332 (569)
T PRK04778 253 DIEKEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKYVEKNSDTLPDFLEHAKEQNKELKEE 332 (569)
T ss_pred ChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555554321 1222222222233444444443 4666666666666666666665554444444444
Q ss_pred HHHhHHH
Q 012498 134 AEKAKEK 140 (462)
Q Consensus 134 aEkaKE~ 140 (462)
.+..++.
T Consensus 333 i~~l~~s 339 (569)
T PRK04778 333 IDRVKQS 339 (569)
T ss_pred HHHHHHc
Confidence 4444443
No 87
>KOG2991 consensus Splicing regulator [RNA processing and modification]
Probab=51.71 E-value=3e+02 Score=28.71 Aligned_cols=194 Identities=22% Similarity=0.269 Sum_probs=105.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKG 93 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~ 93 (462)
.+..--.+++.-|+||++---+ +---.|+.|+- .+||.-|+-+|++||. .--++|.
T Consensus 67 ~~~seq~~~~~a~~elq~~ks~----~Q~e~~v~a~e---~~~~rll~d~i~nLk~-----------------se~~lkq 122 (330)
T KOG2991|consen 67 VRLSEQDFKVMARDELQLRKSW----KQYEAYVQALE---GKYTRLLSDDITNLKE-----------------SEEKLKQ 122 (330)
T ss_pred hhhHHHHHHHHHHHHHHHHHHH----HHHHHHHHHhc---CcccchhHHHHHhhHH-----------------HHHHHHH
Confidence 3444445667778888653111 11134555543 3888889999999987 2235666
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHh
Q 012498 94 QLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLR 173 (462)
Q Consensus 94 qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ 173 (462)
|+++- +.+|....-.++.-+.-+-|+.|++-+.|.+-.---
T Consensus 123 Q~~~a---------------------------------------~RrE~ilv~rlA~kEQEmqe~~sqi~~lK~qq~Ps~ 163 (330)
T KOG2991|consen 123 QQQEA---------------------------------------ARRENILVMRLATKEQEMQECTSQIQYLKQQQQPSV 163 (330)
T ss_pred HHHHH---------------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCcHH
Confidence 65554 344455555666677777788888877765432111
Q ss_pred hhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhccccccccccC-CcchHHHHH----HHHHHHHHHHHhH
Q 012498 174 FDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSFN-DTSTSKYIS----ALEDELEKTRSSV 248 (462)
Q Consensus 174 ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsfn-~tstskyis----aLEeE~e~lr~~i 248 (462)
+.+ .+-.+--.||-||.-=...++.-+--.++ +-+-..-|.|. ++-|-|-+= -|.+||+.|-...
T Consensus 164 ~ql-----R~~llDPAinl~F~rlK~ele~tk~Klee-----~QnelsAwkFTPdS~tGK~LMAKCR~L~qENeElG~q~ 233 (330)
T KOG2991|consen 164 AQL-----RSTLLDPAINLFFLRLKGELEQTKDKLEE-----AQNELSAWKFTPDSKTGKMLMAKCRTLQQENEELGHQA 233 (330)
T ss_pred HHH-----HHHhhChHHHHHHHHHHHHHHHHHHHHHH-----HHhhhheeeecCCCcchHHHHHHHHHHHHHHHHHHhhh
Confidence 111 11223367888887655555542111111 12333459998 455666553 4788888775433
Q ss_pred HHHHhhhhh-hHHHHHHhHHhHH-HHHHhhhhhHHHHH
Q 012498 249 ENLQSKLRM-GLEIENHLKKSVR-ELEKKIIHSDKFIS 284 (462)
Q Consensus 249 ~~LQskLR~-GLeIenhLkk~vr-~Lekkqi~~dk~i~ 284 (462)
+ +=|+ -|+||=-++|.-. +|-+.+--+++||.
T Consensus 234 s----~Gria~Le~eLAmQKs~seElkssq~eL~dfm~ 267 (330)
T KOG2991|consen 234 S----EGRIAELEIELAMQKSQSEELKSSQEELYDFME 267 (330)
T ss_pred h----cccHHHHHHHHHHHHhhHHHHHHhHHHHHHHHH
Confidence 2 2222 2566655555433 34444444555553
No 88
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=50.84 E-value=1.4e+02 Score=24.63 Aligned_cols=59 Identities=17% Similarity=0.195 Sum_probs=38.2
Q ss_pred HHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhh---HHHHHHHHHHHHHhhhHHHHHHHHhhh
Q 012498 248 VENLQSKLRMGLEIENHLKKSVRELEKKIIHS---DKFISNAIAELRLCHSQLRVHVVNSLE 306 (462)
Q Consensus 248 i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~---dk~i~ngi~~lq~~h~~~R~~Im~lL~ 306 (462)
++.|..|+...++--..|+..+..|..+..-+ ..-++.....|++.|.....+|.++|.
T Consensus 6 l~~LE~ki~~aveti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e~~~~~~rl~~LL~ 67 (72)
T PF06005_consen 6 LEQLEEKIQQAVETIALLQMENEELKEKNNELKEENEELKEENEQLKQERNAWQERLRSLLG 67 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566666666666667777777776654332 233444556677888888888877775
No 89
>PF07139 DUF1387: Protein of unknown function (DUF1387); InterPro: IPR009816 This family represents a conserved region approximately 300 residues long within a number of hypothetical proteins of unknown function that seem to be restricted to mammals.
Probab=50.39 E-value=3.1e+02 Score=28.47 Aligned_cols=113 Identities=26% Similarity=0.391 Sum_probs=71.1
Q ss_pred HHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH-HHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhH
Q 012498 54 FQRTAGLEQEIEILKQKIAACARENSNLQEELSEAY-RIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVM 132 (462)
Q Consensus 54 ~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY-RiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slm 132 (462)
-.+..++|.-+-.|+.=+..++|=+..|.||.--++ +||..+++| |+| +.+|--+||
T Consensus 149 KKlg~nIEKSvKDLqRctvSL~RYr~~lkee~d~S~k~ik~~F~~l------------------~~c----L~dREvaLl 206 (302)
T PF07139_consen 149 KKLGPNIEKSVKDLQRCTVSLTRYRVVLKEEMDSSIKKIKQTFAEL------------------QSC----LMDREVALL 206 (302)
T ss_pred cccCccHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH------------------HHH----HHHHHHHHH
Confidence 356789999999999999999999999999996654 899999999 333 456777766
Q ss_pred -HHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhH-hHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhh
Q 012498 133 -EAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNA-TLRFDLEKQEELNESFKEVINKFYEIRQQSLE 202 (462)
Q Consensus 133 -EaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~-aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e 202 (462)
|-.|+| +|+|. -+..=+++.+|| |++-| |-|| -++|.--+..=|.-|--=|.++-+
T Consensus 207 ~EmdkVK--~EAme-iL~aRqkkAeeL-------krltd~A~~M----sE~Ql~ELRadIK~fvs~rk~de~ 264 (302)
T PF07139_consen 207 AEMDKVK--AEAME-ILDARQKKAEEL-------KRLTDRASQM----SEEQLAELRADIKHFVSERKYDEE 264 (302)
T ss_pred HHHHHHH--HHHHH-HHHHHHHHHHHH-------HHHHHHHhhc----CHHHHHHHHHHHHHHhhhhhhHHH
Confidence 444444 45552 122223333333 33321 2222 133333344556666666666554
No 90
>PF11802 CENP-K: Centromere-associated protein K; InterPro: IPR020993 Cenp-K is one of seven new Cenp-A-nucleosome distal (CAD) centromere components (the others being Cenp-L, Cenp-O, Cenp-P, Cenp-Q, Cenp-R and Cenp-S) that are identified as assembling on the Cenp-A nucleosome associated complex, NAC []. The Cenp-A NAC is essential, as disruption of the complex causes errors of chromosome alignment and segregation that preclude cell survival despite continued centromere-derived mitotic checkpoint signalling. Cenp-K is centromere-associated through its interaction with one or more components of the Cenp-A NAC.; GO: 0005634 nucleus
Probab=49.76 E-value=3.1e+02 Score=28.15 Aligned_cols=191 Identities=18% Similarity=0.203 Sum_probs=98.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKG 93 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~ 93 (462)
-++.|.+.|..|.+-.+|.-..+.- --|++|-...+=+++|. +..|+.-|+.+-..|..|.+.|-..=-.=.
T Consensus 56 ll~~~~k~L~aE~~qwqk~~peii~--~n~~VL~~lgkeelqkl------~~eLe~vLs~~q~KnekLke~LerEq~wL~ 127 (268)
T PF11802_consen 56 LLMMRVKCLTAELEQWQKRTPEIIP--LNPEVLLTLGKEELQKL------ISELEMVLSTVQSKNEKLKEDLEREQQWLD 127 (268)
T ss_pred HHHHHHHHHHHHHHHHHhcCCCcCC--CCHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667777777776666665443331 11555555555555553 334444555556666677766653322222
Q ss_pred HHHHHHHHHHHhhHHHHHHH-HHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHH---HHHhHHHHHHHhhh
Q 012498 94 QLADLHAAEVIKNMEAEKQV-KFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRL---EELSSENIELKKQN 169 (462)
Q Consensus 94 qLadLh~ae~~Kn~e~Ekqv-kFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~---~E~~s~~~~qk~~n 169 (462)
+--.++.+--..-.++..++ .|.=+.|.+++..+ -.++|+-.+.+...+-+|-.-- -.-+....+-+.-.
T Consensus 128 Eqqql~~sL~~r~~elk~~~~~~se~rv~~el~~K------~~~~k~~~e~Ll~~LgeFLeeHfPlp~~~~~~~Kkk~~~ 201 (268)
T PF11802_consen 128 EQQQLLESLNKRHEELKNQVETFSESRVFQELKTK------IEKIKEYKEKLLSFLGEFLEEHFPLPDEQGNAKKKKKGE 201 (268)
T ss_pred HHHHHHHHHHHHHHHHHHhhhccchHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhccc
Confidence 22233444444455565555 56666666666554 4455566666666666664321 11111122222222
Q ss_pred HhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhh-hccc------ccchhhhhcccccc
Q 012498 170 ATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV-LETS------WEDKCACLLLDSAE 221 (462)
Q Consensus 170 ~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~-~~~s------~~~Kcs~LL~Ds~~ 221 (462)
+.--.++..+.+-+| ..||+.++.-.-.-.- .+.. +.-.|+|-+.+|.|
T Consensus 202 ~e~~~~~~~l~eilE---~LmN~l~~~p~DpYv~i~~~~WPpyie~LlR~GIa~rHP~D 257 (268)
T PF11802_consen 202 DEPSAQLITLREILE---ILMNKLLDSPHDPYVKIDDSFWPPYIELLLRSGIALRHPED 257 (268)
T ss_pred cccchhhhHHHHHHH---HHHHHhcCCCCCCceecCcccChHHHHHHHHcCCeeeCCCC
Confidence 233445555554444 8899988765532222 4433 34567777776665
No 91
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=49.52 E-value=1.6e+02 Score=24.77 Aligned_cols=45 Identities=11% Similarity=0.107 Sum_probs=25.8
Q ss_pred ccccchhhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhh
Q 012498 206 TSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKL 255 (462)
Q Consensus 206 ~s~~~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskL 255 (462)
+..+.+|-.++++.-. -.+....+..|+..++.+...++.+..++
T Consensus 42 l~~d~~vy~~VG~vfv-----~~~~~ea~~~Le~~~e~le~~i~~l~~~~ 86 (105)
T cd00632 42 LADDAEVYKLVGNVLV-----KQEKEEARTELKERLETIELRIKRLERQE 86 (105)
T ss_pred CCCcchHHHHhhhHHh-----hccHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456666655554222 34555666666666666666666655554
No 92
>PF11629 Mst1_SARAH: C terminal SARAH domain of Mst1; InterPro: IPR024205 The SARAH (Sav/Rassf/Hpo) domain is found at the C terminus in three classes of eukaryotic tumour suppressors that give the domain its name. In the Sav (Salvador) and Hpo (Hippo) families, the SARAH domain mediates signal transduction from Hpo via the Sav scaffolding protein to the downstream component Wts (Warts); the phosphorylation of Wts by Hpo triggers cell cycle arrest and apoptosis by down-regulating cyclin E, Diap 1 and other targets []. The SARAH domain is also involved in dimerisation, as in the human Hpo orthologue, Mst1, which homodimerises via its C-terminal SARAH domain. The SARAH domain is found associated with other domains, such as protein kinase domains, WW/rsp5/WWP domain (IPR001202 from INTERPRO), C1 domain (IPR002219 from INTERPRO), LIM domain (IPR001781 from INTERPRO), or the Ras-associating (RA) domain (IPR000159 from INTERPRO).; GO: 0004674 protein serine/threonine kinase activity; PDB: 2JO8_A.
Probab=49.05 E-value=59 Score=25.86 Aligned_cols=38 Identities=24% Similarity=0.349 Sum_probs=32.9
Q ss_pred hHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhh
Q 012498 268 SVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSL 305 (462)
Q Consensus 268 ~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL 305 (462)
++.+|+.+.+.+|..|.--|.+|+..|..-|.=|..-+
T Consensus 9 s~~eL~~rl~~LD~~ME~Eieelr~RY~~KRqPIldAi 46 (49)
T PF11629_consen 9 SYEELQQRLASLDPEMEQEIEELRQRYQAKRQPILDAI 46 (49)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhccHHHHH
Confidence 46789999999999999999999999999998876544
No 93
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=48.39 E-value=51 Score=34.04 Aligned_cols=67 Identities=19% Similarity=0.300 Sum_probs=45.4
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhh
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAG--PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARE 77 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaG--pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~re 77 (462)
.|-.|..-|..|.+.-.|++.||.-|=|+.|- +|.-.+.+|-++..-..|-..+|.+++|....-||
T Consensus 80 ~Nk~L~~Ev~~Lrqkl~E~qGD~KlLR~~la~~r~~~~~~~~~~~~~ere~lV~qLEk~~~q~~qLe~d 148 (319)
T PF09789_consen 80 QNKKLKEEVEELRQKLNEAQGDIKLLREKLARQRVGDEGIGARHFPHEREDLVEQLEKLREQIEQLERD 148 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHhchHHHHHHHHHhhhhhhccccccccchHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677778888888889999999888774433 33344667766655556666688888866544333
No 94
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=48.07 E-value=2.3e+02 Score=26.21 Aligned_cols=28 Identities=29% Similarity=0.362 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 62 QEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 62 QeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
.+|++|+.+++..+.+...|..||.-..
T Consensus 52 ~eie~L~~el~~lt~el~~L~~EL~~l~ 79 (140)
T PF10473_consen 52 AEIETLEEELEELTSELNQLELELDTLR 79 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555555555555444433
No 95
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=47.51 E-value=1.8e+02 Score=24.79 Aligned_cols=94 Identities=20% Similarity=0.318 Sum_probs=55.2
Q ss_pred HHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHH
Q 012498 63 EIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEE 142 (462)
Q Consensus 63 eiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee 142 (462)
....++.++..+...-..|.-++.|+-.+..-|..| ....+.|- .|...|-++|..=+-
T Consensus 11 ~~q~~q~~~~~l~~q~~~le~~~~E~~~v~~eL~~l-----------~~d~~vyk-~VG~vlv~~~~~e~~--------- 69 (110)
T TIGR02338 11 QLQQLQQQLQAVATQKQQVEAQLKEAEKALEELERL-----------PDDTPVYK-SVGNLLVKTDKEEAI--------- 69 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-----------CCcchhHH-HhchhhheecHHHHH---------
Confidence 356667777777777778888888888888777766 23444453 467788887754221
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhh
Q 012498 143 LMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELN 183 (462)
Q Consensus 143 ~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~ 183 (462)
..++.|++.++..+......-..|+..+..+..+.
T Consensus 70 ------~~l~~r~e~ie~~i~~lek~~~~l~~~l~e~q~~l 104 (110)
T TIGR02338 70 ------QELKEKKETLELRVKTLQRQEERLREQLKELQEKI 104 (110)
T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444444444433
No 96
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=46.65 E-value=6.5 Score=42.75 Aligned_cols=122 Identities=23% Similarity=0.354 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH---------HHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHH-
Q 012498 64 IEILKQKIAACARENSNLQEELSEAYRIKGQLADL---------HAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVME- 133 (462)
Q Consensus 64 iE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL---------h~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmE- 133 (462)
++.+.+.+..+..+|..|+-.-.+|-.+|..|.-| ..+++.+-++=-..+.||..-| ..+-|+-..+|+
T Consensus 269 ~e~le~ei~~L~q~~~eL~~~A~~a~~LrDElD~lR~~a~r~~klE~~ve~YKkKLed~~~lk~qv-k~Lee~N~~l~e~ 347 (713)
T PF05622_consen 269 LEELEKEIDELRQENEELQAEAREARALRDELDELREKADRADKLENEVEKYKKKLEDLEDLKRQV-KELEEDNAVLLET 347 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH
Confidence 33445555556666666666666666666666544 1223333333333455555554 333333333332
Q ss_pred ---HHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhH
Q 012498 134 ---AEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186 (462)
Q Consensus 134 ---aEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~ 186 (462)
.|..-.+-.+...++......+-+++..+.+...-.+.|.+++..+++.++.+
T Consensus 348 ~~~LEeel~~~~~~~~qle~~k~qi~eLe~~l~~~~~~~~~l~~e~~~L~ek~~~l 403 (713)
T PF05622_consen 348 KAMLEEELKKARALKSQLEEYKKQIQELEQKLSEESRRADKLEFENKQLEEKLEAL 403 (713)
T ss_dssp --------------------------------------------------------
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 11111122233344444555555555555555555666666776666666543
No 97
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=45.74 E-value=2.7e+02 Score=26.44 Aligned_cols=96 Identities=20% Similarity=0.313 Sum_probs=55.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhh--HHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVAT--RMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vAT--RM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
|..|..=+..++.|+.+|++++.+.=--.. ..-..=+ +..-+...+|+.+-+.|..+...+-+|...|+.
T Consensus 57 N~~L~epL~~a~~e~~eL~k~L~~y~kdK~--~L~~~k~rl~~~ek~l~~Lk~e~evL~qr~~kle~ErdeL~~------ 128 (201)
T PF13851_consen 57 NKRLSEPLKKAEEEVEELRKQLKNYEKDKQ--SLQNLKARLKELEKELKDLKWEHEVLEQRFEKLEQERDELYR------ 128 (201)
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------
Confidence 455666677888999999998875322111 1100000 112334445555555555555555555443333
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHH
Q 012498 90 RIKGQLADLHAAEVIKNMEAEKQVKF 115 (462)
Q Consensus 90 RiK~qLadLh~ae~~Kn~e~EkqvkF 115 (462)
|.-+.+-|..+..-+||.=||+.+.=
T Consensus 129 kf~~~i~evqQk~~~kn~lLEkKl~~ 154 (201)
T PF13851_consen 129 KFESAIQEVQQKTGLKNLLLEKKLQA 154 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455668888888899999988764
No 98
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=45.61 E-value=1.6e+02 Score=24.45 Aligned_cols=25 Identities=36% Similarity=0.575 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q 012498 13 EALMARIQQLEHERDELRKDIEQLC 37 (462)
Q Consensus 13 e~l~~RI~qLe~ERdEL~KDIEqLC 37 (462)
-.+..++..|.++|+++.|.|-++=
T Consensus 39 r~l~~~~e~lr~~rN~~sk~I~~~~ 63 (108)
T PF02403_consen 39 RELQQELEELRAERNELSKEIGKLK 63 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHh
Confidence 3566677778888888888876653
No 99
>PRK11281 hypothetical protein; Provisional
Probab=45.16 E-value=6.2e+02 Score=30.39 Aligned_cols=162 Identities=17% Similarity=0.212 Sum_probs=83.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----hhcCCchHHHhhHHHH--HhhhhhHHH---------------HHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCM----QQAGPSYLAVATRMHF--QRTAGLEQE---------------IEILKQKIA 72 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCM----QQaGpgyl~vATRM~~--qRta~LEQe---------------iE~Lkkkl~ 72 (462)
.|.+++.+++.+..+.++|..++=- +|.-|-- +-|||-. +|+..+.+. ...|+..+.
T Consensus 125 qLEq~L~q~~~~Lq~~Q~~La~~NsqLi~~qT~PER--AQ~~lsea~~RlqeI~~~L~~~~~~~~~l~~~~~~~l~ae~~ 202 (1113)
T PRK11281 125 QLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPER--AQAALYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQA 202 (1113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchHH--HHHHHHHHHHHHHHHHHHHhCCCCCCCcCCHHHHHHHHHHHH
Confidence 3888888888888888888876633 4444554 3333322 122222211 222344444
Q ss_pred HhhhhhcchHHHHH------HHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHH---HHhhhhhhhHHHHH-------
Q 012498 73 ACARENSNLQEELS------EAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAA---AFAERDNSVMEAEK------- 136 (462)
Q Consensus 73 ~c~ren~nLQEELs------EAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~---AFAERD~slmEaEk------- 136 (462)
+...+|.-++.||. +-|+.+..+... +-..+|.++.+.|..+.. .-+|- .+-||+.
T Consensus 203 ~l~~~~~~~~~~l~~~~~l~~l~~~q~d~~~~------~~~~~~~~~~~lq~~in~kr~~~se~--~~~~a~~~~~~~~~ 274 (1113)
T PRK11281 203 LLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTA------RIQRLEHQLQLLQEAINSKRLTLSEK--TVQEAQSQDEAARI 274 (1113)
T ss_pred HHHHHHHHHHHHHhcchHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhhhhhhccc
Confidence 44444555555443 223333322222 334567777777766554 22221 2222211
Q ss_pred --------hHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHh
Q 012498 137 --------AKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNES 185 (462)
Q Consensus 137 --------aKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~ 185 (462)
.-+.-..+++.+.+.-+|+..+..+...-|..=+.+.-.+..++||.+.
T Consensus 275 ~~~p~i~~~~~~N~~Ls~~L~~~t~~~~~l~~~~~~~~~~l~~~~q~~~~i~eqi~~ 331 (1113)
T PRK11281 275 QANPLVAQELEINLQLSQRLLKATEKLNTLTQQNLRVKNWLDRLTQSERNIKEQISV 331 (1113)
T ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1122345666666666666666666666666666666666666666653
No 100
>PF07083 DUF1351: Protein of unknown function (DUF1351); InterPro: IPR009785 This entry is represented by Lactobacillus prophage Lj928, Orf309. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial and phage proteins of around 230 residues in length. The function of this family is unknown.
Probab=44.80 E-value=2.9e+02 Score=26.43 Aligned_cols=110 Identities=20% Similarity=0.328 Sum_probs=65.4
Q ss_pred HHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH-hHHHHHHHHHHHhhhhhhhhcccccchhh
Q 012498 135 EKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE-SFKEVINKFYEIRQQSLEVLETSWEDKCA 213 (462)
Q Consensus 135 EkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e-~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs 213 (462)
.+.|+....++.=+.+|+.++.++...+.+- .+.+-..+...+++-. .=+.+|..+|+=.|....-.-..|+++
T Consensus 60 ~~RK~ikk~~~~P~~~Fe~~~K~l~~~i~~~---~~~I~~~ik~~Ee~~k~~k~~~i~~~~~~~~~~~~v~~~~fe~~-- 134 (215)
T PF07083_consen 60 DKRKEIKKEYSKPIKEFEAKIKELIAPIDEA---SDKIDEQIKEFEEKEKEEKREKIKEYFEEMAEEYGVDPEPFERI-- 134 (215)
T ss_pred HHHHHHHHHHhchHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCChHHHhhh--
Confidence 3556778888899999999999998777543 3444444443333322 123455555555443332222334433
Q ss_pred hhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhh
Q 012498 214 CLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSK 254 (462)
Q Consensus 214 ~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQsk 254 (462)
-...|.=.++|..+.+..+..-..++...+.-+-..
T Consensus 135 -----~~~~wlnks~s~kk~~eei~~~i~~~~~~~~~~~~~ 170 (215)
T PF07083_consen 135 -----IKPKWLNKSYSLKKIEEEIDDQIDKIKQDLEEIKAA 170 (215)
T ss_pred -----cchHHhhcCCcHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456687778888887777777666665555444433
No 101
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=44.62 E-value=2.9e+02 Score=29.82 Aligned_cols=105 Identities=13% Similarity=0.089 Sum_probs=67.5
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR 90 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR 90 (462)
..++.+.-|..||.+.-+++-+.-+|=.. ..|.. -.-..+-.|.++|++.|...+.++++-.. +.++-.-
T Consensus 280 ~a~~~~~lI~~Le~qLa~~~aeL~~L~~~-~~p~s--PqV~~l~~rI~aLe~QIa~er~kl~~~~g-~~~la~~------ 349 (434)
T PRK15178 280 TITAIYQLIAGFETQLAEAKAEYAQLMVN-GLDQN--PLIPRLSAKIKVLEKQIGEQRNRLSNKLG-SQGSSES------ 349 (434)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cCCCC--CchhHHHHHHHHHHHHHHHHHHHhhcCCC-CCchhHH------
Confidence 46788899999999999999888877332 23333 11245667889999999999999974321 1122111
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHh
Q 012498 91 IKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKA 137 (462)
Q Consensus 91 iK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEka 137 (462)
+ +.=.+++-+..|=|...+.|.+--+++-+||.+.
T Consensus 350 ----l--------aeYe~L~le~efAe~~y~sAlaaLE~AR~EA~RQ 384 (434)
T PRK15178 350 ----L--------SLFEDLRLQSEIAKARWESALQTLQQGKLQALRE 384 (434)
T ss_pred ----H--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 1 1113344455555666677777777777777653
No 102
>PF05064 Nsp1_C: Nsp1-like C-terminal region; InterPro: IPR007758 The NSP1-like protein appears to be an essential component of the nuclear pore complex, for example preribosome nuclear export requires the Nup82p-Nup159p-Nsp1p complex. The C-terminal of Nsp1 is involved in binding Nup82 [], probably via coiled-coil formation [, ]. The family is related to the rotavirus nonstructural protein NSP1 which is the least conserved protein in the rotavirus genome. Its function in the replication process is not fully understood.; GO: 0017056 structural constituent of nuclear pore, 0005643 nuclear pore; PDB: 3T97_C.
Probab=44.45 E-value=74 Score=27.73 Aligned_cols=29 Identities=31% Similarity=0.320 Sum_probs=6.8
Q ss_pred hHHHHHHHHHhhhhHHHHHhhhhhhhHHHH
Q 012498 106 NMEAEKQVKFFQGCMAAAFAERDNSVMEAE 135 (462)
Q Consensus 106 n~e~EkqvkFfQs~vA~AFAERD~slmEaE 135 (462)
+.+|++|+|.|.. .|.-++..|..||+..
T Consensus 28 ~~eLe~q~k~F~~-qA~~V~~wDr~Lv~n~ 56 (116)
T PF05064_consen 28 NKELEEQEKEFNE-QATQVNAWDRQLVENG 56 (116)
T ss_dssp ----------------------TCHHHHHH
T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHH
Confidence 5788999999985 5788999999999854
No 103
>KOG2129 consensus Uncharacterized conserved protein H4 [Function unknown]
Probab=44.41 E-value=2.8e+02 Score=30.63 Aligned_cols=13 Identities=23% Similarity=0.337 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHH
Q 012498 86 SEAYRIKGQLADL 98 (462)
Q Consensus 86 sEAYRiK~qLadL 98 (462)
+|.-|+|++|+.-
T Consensus 260 ~EveRlrt~l~~A 272 (552)
T KOG2129|consen 260 AEVERLRTYLSRA 272 (552)
T ss_pred HHHHHHHHHHHHH
Confidence 4666777777643
No 104
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=44.11 E-value=2.7e+02 Score=28.62 Aligned_cols=29 Identities=31% Similarity=0.323 Sum_probs=20.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAV 48 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~v 48 (462)
...+..++.+|+.++.+|.. .-||.+=.|
T Consensus 249 ~~~l~~~l~~l~~~l~~l~~--------~y~~~hP~v 277 (498)
T TIGR03007 249 NSELDGRIEALEKQLDALRL--------RYTDKHPDV 277 (498)
T ss_pred CCchHHHHHHHHHHHHHHHH--------HhcccChHH
Confidence 45677889999888888874 346666444
No 105
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=43.76 E-value=2.5e+02 Score=25.41 Aligned_cols=34 Identities=21% Similarity=0.257 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHH
Q 012498 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLE 177 (462)
Q Consensus 144 m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~ 177 (462)
..+.+++++..+.++-+.+.+.|...+.|+..+.
T Consensus 56 ~~qr~~eLqaki~ea~~~le~eK~ak~~l~~r~~ 89 (107)
T PF09304_consen 56 RNQRIAELQAKIDEARRNLEDEKQAKLELESRLL 89 (107)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3466666777777776666666655555555444
No 106
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=43.53 E-value=2.7e+02 Score=29.34 Aligned_cols=43 Identities=19% Similarity=0.119 Sum_probs=22.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHH
Q 012498 137 AKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQ 179 (462)
Q Consensus 137 aKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~ 179 (462)
..+.-..+.+++.++..++.+++..+.+.++.-..|+.+|..+
T Consensus 129 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~l 171 (525)
T TIGR02231 129 WFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNAL 171 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3344445555555555555555555555555545555555444
No 107
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=43.49 E-value=7.1e+02 Score=30.62 Aligned_cols=155 Identities=23% Similarity=0.223 Sum_probs=81.4
Q ss_pred HHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhh------hccc----ccchhhhhcccccc
Q 012498 152 QTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV------LETS----WEDKCACLLLDSAE 221 (462)
Q Consensus 152 ~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~------~~~s----~~~Kcs~LL~Ds~~ 221 (462)
..|+++++..+.+.++--+++|-.-++ +++.+.+...|..-+.++.+-... ..+. --.||++-+--|.
T Consensus 857 ~~~l~~~~~~ie~l~kE~e~~qe~~~K-k~~i~~lq~~i~~i~~e~~q~qk~kv~~~~~~~~~l~~~i~k~~~~i~~s~- 934 (1293)
T KOG0996|consen 857 KKRLKELEEQIEELKKEVEELQEKAAK-KARIKELQNKIDEIGGEKVQAQKDKVEKINEQLDKLEADIAKLTVAIKTSD- 934 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhH-HHHHHHHHHHHHHhhchhhHHhHHHHHHHHHHHHHHHHHHHHhHHHHhcCc-
Confidence 345566666666666666666644444 566666666666666554332211 1111 1123444333222
Q ss_pred ccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHH
Q 012498 222 MWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHV 301 (462)
Q Consensus 222 ~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~I 301 (462)
|. -+...+-++-|+.+.+.+..+++.|-.. .+|+...+-++++.- +=-.++|-+++.-|...+..+
T Consensus 935 -~~--i~k~q~~l~~le~~~~~~e~e~~~L~e~-------~~~~~~k~~E~~~~~----~e~~~~~~E~k~~~~~~k~~~ 1000 (1293)
T KOG0996|consen 935 -RN--IAKAQKKLSELEREIEDTEKELDDLTEE-------LKGLEEKAAELEKEY----KEAEESLKEIKKELRDLKSEL 1000 (1293)
T ss_pred -cc--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HhhhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHH
Confidence 11 1234555666666666666666665433 234444444444432 123567777777777777777
Q ss_pred HHhhhhcchhhhhhHHHHHhhh
Q 012498 302 VNSLEEGRSHIKSISDVIEEKT 323 (462)
Q Consensus 302 m~lL~ee~s~i~s~v~~ieekl 323 (462)
-++=+.+-..-...|+ |+.|+
T Consensus 1001 e~i~k~~~~lk~~rId-~~~K~ 1021 (1293)
T KOG0996|consen 1001 ENIKKSENELKAERID-IENKL 1021 (1293)
T ss_pred HHHHHHHHHHHHhhcc-HHHHH
Confidence 6665555444444555 66666
No 108
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=42.78 E-value=6.4e+02 Score=29.84 Aligned_cols=64 Identities=17% Similarity=0.237 Sum_probs=39.4
Q ss_pred HHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHH
Q 012498 101 AEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIE 164 (462)
Q Consensus 101 ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~ 164 (462)
.++..-.+-+..|.=|+.-+..-|..+|.-.-+.-..++.....-+++..++.++....+....
T Consensus 785 ~~l~~ie~~r~~V~eY~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~ 848 (1201)
T PF12128_consen 785 KELKRIEERRAEVIEYEDWLQEEWDKVDELREEKPELEEQLRDLEQELQELEQELNQLQKEVKQ 848 (1201)
T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555556667778888888888776433333344444445556777777776666555543
No 109
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=42.55 E-value=3.6e+02 Score=26.97 Aligned_cols=60 Identities=25% Similarity=0.273 Sum_probs=38.2
Q ss_pred hHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHH
Q 012498 131 VMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190 (462)
Q Consensus 131 lmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI 190 (462)
--|-..+|++.....-++.++..+..+++..+...+.--..+..++...++-.+.-...|
T Consensus 95 ~~E~~~ak~r~~~le~el~~l~~~~~~l~~~i~~l~~~~~~~e~~~~e~~~~~e~e~~~i 154 (239)
T COG1579 95 NIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI 154 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356666777777777777777777777777776666655555555555554444333333
No 110
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=42.08 E-value=1e+02 Score=25.45 Aligned_cols=40 Identities=28% Similarity=0.210 Sum_probs=24.0
Q ss_pred hhhHHHHHHHHHHHHhhhhhhhHHHHHHHHHHhHHHHHHH
Q 012498 412 NVNSALQKKIEELQRNLFQVTTEKVKALMELAQLKQDYQL 451 (462)
Q Consensus 412 ~~n~~lq~~ieeLqrnl~QVt~EKVkaLmElAqLkq~y~l 451 (462)
.++..||..+++|.+.-.+..++.-..--|..+|++++.-
T Consensus 18 eti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e~~~ 57 (72)
T PF06005_consen 18 ETIALLQMENEELKEKNNELKEENEELKEENEQLKQERNA 57 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556666666665555555566666667777776654
No 111
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=41.94 E-value=3.4e+02 Score=26.47 Aligned_cols=37 Identities=14% Similarity=0.178 Sum_probs=20.7
Q ss_pred hHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHH
Q 012498 50 TRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELS 86 (462)
Q Consensus 50 TRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELs 86 (462)
...+..+.+.+....+.++.++.....+-..++.++.
T Consensus 125 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~i~ 161 (423)
T TIGR01843 125 PELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELA 161 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455556666666666666666655544444444443
No 112
>PF12808 Mto2_bdg: Micro-tubular organiser Mto1 C-term Mto2-binding region; InterPro: IPR024545 This domain occurs at the C terminus of microtubule organising proteins in both budding and fission fungi. In Schizosaccharomyces pombe it has been shown to interact with the Mto2p protein, an interaction which is critical for anchoring the cytokinetic actin ring to the medial region of the cell and for proper coordination of mitosis with cytokinesis [, ].
Probab=41.73 E-value=42 Score=26.66 Aligned_cols=28 Identities=32% Similarity=0.431 Sum_probs=24.7
Q ss_pred CcchHHHHHHHHHHHHHHHHhHHHHHhh
Q 012498 227 DTSTSKYISALEDELEKTRSSVENLQSK 254 (462)
Q Consensus 227 ~tstskyisaLEeE~e~lr~~i~~LQsk 254 (462)
-+++++=|+.|+.||..|++.+..+|+.
T Consensus 24 ~~~a~~rl~~l~~EN~~Lr~eL~~~r~~ 51 (52)
T PF12808_consen 24 RSAARKRLSKLEGENRLLRAELERLRSR 51 (52)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4568899999999999999999998863
No 113
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=41.46 E-value=3.1e+02 Score=25.86 Aligned_cols=70 Identities=23% Similarity=0.364 Sum_probs=37.8
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEEL 85 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaG-pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEEL 85 (462)
..+.|.+.|..++.+..+|...|+.. .+| |.. ..-.....+-..|++++..|+++|....+-+...-+++
T Consensus 70 ~~~~l~~~~~~~~~~i~~l~~~i~~~---~~~r~~~--~eR~~~l~~l~~l~~~~~~l~~el~~~~~~Dp~~i~~~ 140 (188)
T PF03962_consen 70 KLEKLQKEIEELEKKIEELEEKIEEA---KKGREES--EEREELLEELEELKKELKELKKELEKYSENDPEKIEKL 140 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---Hhccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHH
Confidence 35666777777777777777777776 333 222 22223344445555555555555554444443333333
No 114
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=40.72 E-value=4.2e+02 Score=30.28 Aligned_cols=91 Identities=25% Similarity=0.291 Sum_probs=52.8
Q ss_pred hhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHh
Q 012498 58 AGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKA 137 (462)
Q Consensus 58 a~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEka 137 (462)
...+.+|..+.+++....++|.+|+-++-+--++-.-| +.++.=|.-. ++-+
T Consensus 418 ~~~~~~i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L--------------~~~l~~~~r~------------~~~~-- 469 (652)
T COG2433 418 TVYEKRIKKLEETVERLEEENSELKRELEELKREIEKL--------------ESELERFRRE------------VRDK-- 469 (652)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHH------------HHHH--
Confidence 66777888888888888999999988876544332222 2211111110 1111
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHH
Q 012498 138 KEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQE 180 (462)
Q Consensus 138 KE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~ 180 (462)
.-...++...+.|+..|+..+.+.+.--+.|-..|+.++
T Consensus 470 ----~~~~rei~~~~~~I~~L~~~L~e~~~~ve~L~~~l~~l~ 508 (652)
T COG2433 470 ----VRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELR 508 (652)
T ss_pred ----HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 122234555566666666666666666666666666554
No 115
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=40.61 E-value=2.5e+02 Score=24.55 Aligned_cols=39 Identities=36% Similarity=0.436 Sum_probs=25.8
Q ss_pred hhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH
Q 012498 57 TAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADL 98 (462)
Q Consensus 57 ta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL 98 (462)
...+.++++.+........+.|..++.+|.+ .|+++..+
T Consensus 29 ~~~~~~~~~~l~~~n~~lAe~nL~~~~~l~~---~r~~l~~~ 67 (150)
T PF07200_consen 29 VQELQQEREELLAENEELAEQNLSLEPELEE---LRSQLQEL 67 (150)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHH----HHHHH---HHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccchHHHH---HHHHHHHH
Confidence 3456777888888888888888888888876 56677666
No 116
>PRK09343 prefoldin subunit beta; Provisional
Probab=40.46 E-value=2.6e+02 Score=24.60 Aligned_cols=94 Identities=20% Similarity=0.288 Sum_probs=51.4
Q ss_pred HHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHH
Q 012498 64 IEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEEL 143 (462)
Q Consensus 64 iE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~ 143 (462)
++.+++++..+...-..|.-++.|+-....-|..| +.+-+.|- .|...|--.|.+=+
T Consensus 16 ~q~lq~~l~~~~~q~~~le~q~~e~~~~~~EL~~L-----------~~d~~VYk-~VG~vlv~qd~~e~----------- 72 (121)
T PRK09343 16 LQQLQQQLERLLQQKSQIDLELREINKALEELEKL-----------PDDTPIYK-IVGNLLVKVDKTKV----------- 72 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-----------CCcchhHH-HhhHHHhhccHHHH-----------
Confidence 44555566666666666666666666655555544 23344443 36666665554322
Q ss_pred HHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 144 MSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 144 m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
..++++|++-+.+.+.........|+..+..+..+..
T Consensus 73 ----~~~l~~r~E~ie~~ik~lekq~~~l~~~l~e~q~~l~ 109 (121)
T PRK09343 73 ----EKELKERKELLELRSRTLEKQEKKLREKLKELQAKIN 109 (121)
T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1344455555555555555555666666666655555
No 117
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=39.74 E-value=2.8e+02 Score=24.90 Aligned_cols=18 Identities=28% Similarity=0.244 Sum_probs=7.6
Q ss_pred hhHhHhhhHHHHHHhhHh
Q 012498 168 QNATLRFDLEKQEELNES 185 (462)
Q Consensus 168 ~n~aLQ~dl~~~~eq~e~ 185 (462)
.-..++.++..+.++...
T Consensus 166 ~~~~~~~~~~~l~~~~~~ 183 (191)
T PF04156_consen 166 QLERLQENLQQLEEKIQE 183 (191)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333344444444444443
No 118
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=39.30 E-value=4.1e+02 Score=26.62 Aligned_cols=178 Identities=18% Similarity=0.286 Sum_probs=99.9
Q ss_pred HHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhhhcccccchhhhhcccccccccc
Q 012498 146 QKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEVLETSWEDKCACLLLDSAEMWSF 225 (462)
Q Consensus 146 qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~Kcs~LL~Ds~~~Wsf 225 (462)
..+.....++.+++-.+.+.+.+-..++.++....++....- . .+
T Consensus 38 ~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e----------------------~----kl--------- 82 (239)
T COG1579 38 AELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAE----------------------E----KL--------- 82 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------H----HH---------
Confidence 445555566666666666666666666666666655544110 0 01
Q ss_pred CCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHh-------HHHHHHhhhhhHHHHHHHHHHHH---Hhhh
Q 012498 226 NDTSTSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKS-------VRELEKKIIHSDKFISNAIAELR---LCHS 295 (462)
Q Consensus 226 n~tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~-------vr~Lekkqi~~dk~i~ngi~~lq---~~h~ 295 (462)
.+.++.+-.+||..|.++++..+..|-..|.=-.+...+|.+. +..+|+...-+-.-+...+..+. +-|.
T Consensus 83 ~~v~~~~e~~aL~~E~~~ak~r~~~le~el~~l~~~~~~l~~~i~~l~~~~~~~e~~~~e~~~~~e~e~~~i~e~~~~~~ 162 (239)
T COG1579 83 SAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEIREEGQELS 162 (239)
T ss_pred hccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1445888888888888888888777777776655555555443 33444444444444555555554 4566
Q ss_pred HHHHHHHHhhhhcchhhhhhHHHHHhhhccccccccccccCCCcccccccccccccceec-cCCCCccccCCCCCCcchh
Q 012498 296 QLRVHVVNSLEEGRSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETKLDEFECRDVHIN-NDADTNLVSQRNDPAYCDI 374 (462)
Q Consensus 296 ~~R~~Im~lL~ee~s~i~s~v~~ieekl~~~~n~~~E~n~~~pq~e~~~~e~ec~dVhv~-~d~~p~~~~k~~~p~~~~~ 374 (462)
..|+++..=|..+ ++..++.-..-.-+ .+..|+....|..-||- |+..-+.+.+.|.+.-|..
T Consensus 163 ~~~~~L~~~l~~e------ll~~yeri~~~~kg----------~gvvpl~g~~C~GC~m~l~~~~~~~V~~~d~iv~CP~ 226 (239)
T COG1579 163 SKREELKEKLDPE------LLSEYERIRKNKKG----------VGVVPLEGRVCGGCHMKLPSQTLSKVRKKDEIVFCPY 226 (239)
T ss_pred HHHHHHHHhcCHH------HHHHHHHHHhcCCC----------ceEEeecCCcccCCeeeecHHHHHHHhcCCCCccCCc
Confidence 6666665544432 11222221122112 34556667788888874 4444455666676666654
No 119
>KOG0933 consensus Structural maintenance of chromosome protein 2 (chromosome condensation complex Condensin, subunit E) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=38.97 E-value=8e+02 Score=29.89 Aligned_cols=58 Identities=21% Similarity=0.338 Sum_probs=34.6
Q ss_pred hHHHHHHHHHHHH---HHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhh
Q 012498 12 SEALMARIQQLEH---ERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACAR 76 (462)
Q Consensus 12 ~e~l~~RI~qLe~---ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~r 76 (462)
.+++...|+.|-. +-..-++|++.+=-|=++-- -.++-..-|.|+++...-+|+.|.+
T Consensus 669 ~a~~L~~l~~l~~~~~~~~~~q~el~~le~eL~~le-------~~~~kf~~l~~ql~l~~~~l~l~~~ 729 (1174)
T KOG0933|consen 669 GADLLRQLQKLKQAQKELRAIQKELEALERELKSLE-------AQSQKFRDLKQQLELKLHELALLEK 729 (1174)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555443 44445677777666554421 1234455688888888888887754
No 120
>TIGR03789 pdsO proteobacterial sortase system OmpA family protein. A newly defined histidine kinase (TIGR03785) and response regulator (TIGR03787) gene pair occurs exclusively in Proteobacteria, mostly of marine origin, nearly all of which contain a subfamily 6 sortase (TIGR03784) and its single dedicated target protein (TIGR03788) adjacent to to the sortase. This protein family shows up in only in those species with the histidine kinase/response regulator gene pair, and often adjacent to that pair. It belongs to the OmpA protein family (pfam00691). Its function is unknown. We assign the gene symbol pdsO, for Proteobacterial Dedicated Sortase system OmpA family protein.
Probab=36.91 E-value=94 Score=30.64 Aligned_cols=52 Identities=19% Similarity=0.199 Sum_probs=35.3
Q ss_pred chHHHHHHHHHHHHHHHhhcHHHHHHHHHHhhhHHHHHHHHHHHHhhhhhhhH
Q 012498 382 ASETLAQALQEKVAALLLLSQQEERHLLERNVNSALQKKIEELQRNLFQVTTE 434 (462)
Q Consensus 382 ~s~alAqAL~EKveALlLlSQqeER~llE~~~n~~lq~~ieeLqrnl~QVt~E 434 (462)
..+++ +.|..+=..|+-|||++.++.-=.+-+...|.++++||+..-|..++
T Consensus 79 ~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 130 (239)
T TIGR03789 79 NDEQQ-QHIAQQRQQMVALTQKQQALEQLEAEYQQAQVHLETLQQDQQQLLEE 130 (239)
T ss_pred CcHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 44555 77888878888888888887655555555566677777766664433
No 121
>PF01017 STAT_alpha: STAT protein, all-alpha domain; InterPro: IPR013800 The STAT protein (Signal Transducers and Activators of Transcription) family contains transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors, hence they act as signal transducers in the cytoplasm and transcription activators in the nucleus []. Binding of these factors to cell-surface receptors leads to receptor autophosphorylation at a tyrosine, the phosphotyrosine being recognised by the STAT SH2 domain, which mediates the recruitment of STAT proteins from the cytosol and their association with the activated receptor. The STAT proteins are then activated by phosphorylation via members of the JAK family of protein kinases, causing them to dimerise and translocated to the nucleus, where they bind to specific promoter sequences in target genes. In mammals, STATs comprise a family of seven structurally and functionally related proteins: Stat1, Stat2, Stat3, Stat4, Stat5a and Stat5b, Stat6. STAT proteins play a critical role in regulating innate and acquired host immune responses. Dysregulation of at least two STAT signalling cascades (i.e. Stat3 and Stat5) is associated with cellular transformation. Signalling through the JAK/STAT pathway is initiated when a cytokine binds to its corresponding receptor. This leads to conformational changes in the cytoplasmic portion of the receptor, initiating activation of receptor associated members of the JAK family of kinases. The JAKs, in turn, mediate phosphorylation at the specific receptor tyrosine residues, which then serve as docking sites for STATs and other signalling molecules. Once recruited to the receptor, STATs also become phosphorylated by JAKs, on a single tyrosine residue. Activated STATs dissociate from the receptor, dimerise, translocate to the nucleus and bind to members of the GAS (gamma activated site) family of enhancers. The seven STAT proteins identified in mammals range in size from 750 and 850 amino acids. The chromosomal distribution of these STATs, as well as the identification of STATs in more primitive eukaryotes, suggest that this family arose from a single primordial gene. STATs share structurally and functionally conserved domains including: an N-terminal domain that strengthens interactions between STAT dimers on adjacent DNA-binding sites; a coiled-coil STAT domain that is implicated in protein-protein interactions; a DNA-binding domain with an immunoglobulin-like fold similar to p53 tumour suppressor protein; an EF-hand-like linker domain connecting the DNA-binding and SH2 domains; an SH2 domain (IPR000980 from INTERPRO) that acts as a phosphorylation-dependent switch to control receptor recognition and DNA-binding; and a C-terminal transactivation domain []. The crystal structure of the N terminus of Stat4 reveals a dimer. The interface of this dimer is formed by a ring-shaped element consisting of five short helices. Several studies suggest that this N-terminal dimerisation promotes cooperativity of binding to tandem GAS elements and with the transcriptional coactivator CBP/p300. This entry represents the all-alpha helical domain, which consists of four long helices arranged in a bundle with a left-handed twist (coiled-coil), which in turn forms a right-handed superhelix.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0004871 signal transducer activity, 0006355 regulation of transcription, DNA-dependent, 0007165 signal transduction, 0005634 nucleus; PDB: 1YVL_A 1BF5_A 3CWG_B 1BG1_A 1Y1U_B.
Probab=36.46 E-value=2.9e+02 Score=25.61 Aligned_cols=95 Identities=22% Similarity=0.346 Sum_probs=49.8
Q ss_pred HhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHH-HHHHHHHHHHhhH-HHHHHHHHhhhhHHHHHhhhhhhhH
Q 012498 55 QRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQ-LADLHAAEVIKNM-EAEKQVKFFQGCMAAAFAERDNSVM 132 (462)
Q Consensus 55 qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~q-LadLh~ae~~Kn~-e~EkqvkFfQs~vA~AFAERD~slm 132 (462)
.|-..+++.+..|+++.-..-.++..|++ +-|.|-++++ |-.+...+ .|. .....++-.+..+.+-+.
T Consensus 2 ~~~~ei~~~l~~l~~~vq~~e~~~k~Le~-~QE~f~~~~q~lq~~~~~~--~~~~~~~~~~~~~~~~~~~~~~------- 71 (182)
T PF01017_consen 2 EKQQEIEQKLQDLRNRVQETENDIKSLED-LQEEFDFQYQTLQQLQETE--QNSNALKEQLKQEQQQLQQMLN------- 71 (182)
T ss_dssp CHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCTTTTT----STTTHHHHHCCCCCHHHHHHHH-------
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcccc--chhhhhHHHHHHHHHHHHHHHH-------
Confidence 34556777777787777777777777754 5688888886 21221111 111 112222222222222222
Q ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHhHHH
Q 012498 133 EAEKAKEKEELMSQKFNEFQTRLEELSSEN 162 (462)
Q Consensus 133 EaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~ 162 (462)
....+...+..++.+.=..++.+++.+
T Consensus 72 ---~L~~~R~~lv~~l~~~~~~~~~lq~~l 98 (182)
T PF01017_consen 72 ---ELDQKRKELVSKLKETLNCLEQLQSQL 98 (182)
T ss_dssp ---HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 223344556666777777777776554
No 122
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=36.31 E-value=2.7e+02 Score=23.69 Aligned_cols=78 Identities=23% Similarity=0.296 Sum_probs=50.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH--------HHhhcCCchHH----HhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhc
Q 012498 12 SEALMARIQQLEHERDELRKDIEQL--------CMQQAGPSYLA----VATRMHFQRTAGLEQEIEILKQKIAACARENS 79 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqL--------CMQQaGpgyl~----vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~ 79 (462)
...+...+.+|+.+..|...=++.| |.-..||-+|- -|--=+--|...++-.|..|.+++..+...=.
T Consensus 19 ~~~l~~q~~~le~~~~E~~~v~~eL~~l~~d~~vyk~VG~vlv~~~~~e~~~~l~~r~e~ie~~i~~lek~~~~l~~~l~ 98 (110)
T TIGR02338 19 LQAVATQKQQVEAQLKEAEKALEELERLPDDTPVYKSVGNLLVKTDKEEAIQELKEKKETLELRVKTLQRQEERLREQLK 98 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcchhHHHhchhhheecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566777778887777776655544 67777776642 22233455666777777777777777766666
Q ss_pred chHHHHHHHH
Q 012498 80 NLQEELSEAY 89 (462)
Q Consensus 80 nLQEELsEAY 89 (462)
++|..|-+++
T Consensus 99 e~q~~l~~~~ 108 (110)
T TIGR02338 99 ELQEKIQEAL 108 (110)
T ss_pred HHHHHHHHHh
Confidence 6666666654
No 123
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=36.16 E-value=3.2e+02 Score=24.53 Aligned_cols=98 Identities=24% Similarity=0.245 Sum_probs=57.9
Q ss_pred HHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHH
Q 012498 61 EQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEK 140 (462)
Q Consensus 61 EQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~ 140 (462)
-+-++.|+..|..+-.|...|+++++..=+-|..+++=--+-...|.++ ...+..
T Consensus 15 ~~~ve~L~s~lr~~E~E~~~l~~el~~l~~~r~~l~~Eiv~l~~~~e~~-------------------------~~~~~~ 69 (120)
T PF12325_consen 15 VQLVERLQSQLRRLEGELASLQEELARLEAERDELREEIVKLMEENEEL-------------------------RALKKE 69 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------HHHHHH
Confidence 3557888888888888888888888877777777763211111111111 112222
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhh
Q 012498 141 EELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELN 183 (462)
Q Consensus 141 Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~ 183 (462)
-..+-+++.+++.|...+=--+-+-.+.+..|+.|+..+++--
T Consensus 70 ~~~L~~el~~l~~ry~t~LellGEK~E~veEL~~Dv~DlK~my 112 (120)
T PF12325_consen 70 VEELEQELEELQQRYQTLLELLGEKSEEVEELRADVQDLKEMY 112 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHH
Confidence 2344466666666655444444455567788888888776543
No 124
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=35.96 E-value=5.3e+02 Score=26.92 Aligned_cols=21 Identities=29% Similarity=0.408 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q 012498 17 ARIQQLEHERDELRKDIEQLC 37 (462)
Q Consensus 17 ~RI~qLe~ERdEL~KDIEqLC 37 (462)
+-+..-+.|||.....+|||=
T Consensus 16 ~eLe~cq~ErDqyKlMAEqLq 36 (319)
T PF09789_consen 16 QELEKCQSERDQYKLMAEQLQ 36 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344444558888887777774
No 125
>PF07047 OPA3: Optic atrophy 3 protein (OPA3); InterPro: IPR010754 OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity []. This family consists of several optic atrophy 3 (OPA3) proteins and related proteins from other eukaryotic species, the function is unknown.
Probab=35.58 E-value=72 Score=28.45 Aligned_cols=34 Identities=29% Similarity=0.492 Sum_probs=28.3
Q ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHh
Q 012498 134 AEKAKEKEELMSQKFNEFQTRLEELSSENIELKK 167 (462)
Q Consensus 134 aEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~ 167 (462)
+.|.+.+|+...+.+..++.++++++..+.+|+.
T Consensus 100 ~~ke~~Ke~~~~~~l~~L~~~i~~L~~~~~~~~~ 133 (134)
T PF07047_consen 100 ARKEAKKEEELQERLEELEERIEELEEQVEKQQE 133 (134)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4566677888889999999999999998887764
No 126
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=35.38 E-value=2.5e+02 Score=25.71 Aligned_cols=38 Identities=26% Similarity=0.376 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhH
Q 012498 139 EKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDL 176 (462)
Q Consensus 139 E~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl 176 (462)
+.......++.+....++..+.+++..|.+...|+.++
T Consensus 154 ~~~~~~~~ei~~lk~el~~~~~~~~~LkkQ~~~l~~ey 191 (192)
T PF05529_consen 154 EENKKLSEEIEKLKKELEKKEKEIEALKKQSEGLQKEY 191 (192)
T ss_pred hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 34444556667777777777777776666666665543
No 127
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=35.35 E-value=2.2e+02 Score=26.06 Aligned_cols=65 Identities=26% Similarity=0.342 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchH
Q 012498 16 MARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQ 82 (462)
Q Consensus 16 ~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQ 82 (462)
+.|+-.+-++...+++.++.+=-|-.+.. ..+.+......+.+..||++|+++|.....|...|+
T Consensus 117 I~r~~~li~~l~~~~~~~~~~~kq~~~~~--~~~~~~~~~~~~~~~~ei~~lk~el~~~~~~~~~Lk 181 (192)
T PF05529_consen 117 IRRVHSLIKELIKLEEKLEALKKQAESAS--EAAEKLLKEENKKLSEEIEKLKKELEKKEKEIEALK 181 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhh--hhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555555555544443321 123333556677788888888887777555544444
No 128
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=34.73 E-value=1.5e+02 Score=22.86 Aligned_cols=37 Identities=35% Similarity=0.478 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHh
Q 012498 146 QKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEEL 182 (462)
Q Consensus 146 qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq 182 (462)
+.+.+++.++..+++.....+..+..|...+..+..+
T Consensus 26 ~~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~~e 62 (64)
T PF00170_consen 26 QYIEELEEKVEELESENEELKKELEQLKKEIQSLKSE 62 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4455666666666666665555555555555555443
No 129
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=34.69 E-value=3.8e+02 Score=24.96 Aligned_cols=39 Identities=31% Similarity=0.346 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 146 QKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 146 qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
.+....+.|+..+...+..+++.....+..+..+.+.++
T Consensus 63 ~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~ 101 (302)
T PF10186_consen 63 REIEELRERLERLRERIERLRKRIEQKRERLEELRESLE 101 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455566666666666666666666666666666666665
No 130
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=34.67 E-value=4.8e+02 Score=26.12 Aligned_cols=52 Identities=33% Similarity=0.396 Sum_probs=29.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAA 73 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~ 73 (462)
.+.|..++..|+.+..-|.++++++ +...--...|-++|+.++..|+.....
T Consensus 151 ~~~L~~~~~~L~~D~~~L~~~~~~l----------~~~~~~l~~~~~~L~~e~~~Lk~~~~e 202 (325)
T PF08317_consen 151 KEGLEENLELLQEDYAKLDKQLEQL----------DELLPKLRERKAELEEELENLKQLVEE 202 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3455555555555555565555554 122223345667788888888775443
No 131
>PF09832 DUF2059: Uncharacterized protein conserved in bacteria (DUF2059); InterPro: IPR018637 This entry contains proteins that have no known function. ; PDB: 2X3O_B 3OAO_A.
Probab=34.33 E-value=1e+02 Score=23.35 Aligned_cols=43 Identities=14% Similarity=0.353 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHH
Q 012498 90 RIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVME 133 (462)
Q Consensus 90 RiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmE 133 (462)
+++..+++.|...+ -..|+..=+.||.|-+.+.|...-.+++.
T Consensus 4 ~~~~~~~~~y~~~f-t~~El~~i~~FY~Sp~Gqk~~~~~~~~~~ 46 (64)
T PF09832_consen 4 KMIDQMAPIYAEHF-TEEELDAILAFYESPLGQKIVAKEPALMQ 46 (64)
T ss_dssp HHHHHHHHHHHHHS--HHHHHHHHHHHHSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHC-CHHHHHHHHHHHCCHHhHHHHHHhHHHHH
Confidence 34556666665554 45688999999999999999887776665
No 132
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=34.32 E-value=29 Score=38.38 Aligned_cols=44 Identities=27% Similarity=0.277 Sum_probs=34.7
Q ss_pred hhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHH
Q 012498 126 ERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEE 181 (462)
Q Consensus 126 ERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~e 181 (462)
|||.++||+|+|. .+-|+-.||-.-..|+.+.-.||+..++++-
T Consensus 33 E~dr~~WElERaE------------lqariAfLqgErk~qenlk~dl~rR~kmlE~ 76 (577)
T KOG0642|consen 33 ERDRARWELERAE------------LQARIAFLQGERKGQENLKMDLVRRIKMLEF 76 (577)
T ss_pred hhhhhheehhhhh------------HHHHHHHHhcchhhhHHHHHHHHHHHhcccc
Confidence 8999999999986 5667777777778888887777777766643
No 133
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=34.25 E-value=4.2e+02 Score=25.23 Aligned_cols=73 Identities=26% Similarity=0.353 Sum_probs=46.7
Q ss_pred HHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHh
Q 012498 103 VIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEEL 182 (462)
Q Consensus 103 ~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq 182 (462)
...+.++.++++||++ |+ +.+..+..|+..++..+...+.-+..|...+..+...
T Consensus 68 ~~e~~eL~k~L~~y~k---------dK----------------~~L~~~k~rl~~~ek~l~~Lk~e~evL~qr~~kle~E 122 (201)
T PF13851_consen 68 EEEVEELRKQLKNYEK---------DK----------------QSLQNLKARLKELEKELKDLKWEHEVLEQRFEKLEQE 122 (201)
T ss_pred HHHHHHHHHHHHHHHH---------HH----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456677888887754 22 3455666777777777777777777777777776665
Q ss_pred hHhHH-HHHHHHHHHhhhh
Q 012498 183 NESFK-EVINKFYEIRQQS 200 (462)
Q Consensus 183 ~e~~~-kVI~KFyeiR~~~ 200 (462)
-+-+. +.-..+++|.+.+
T Consensus 123 rdeL~~kf~~~i~evqQk~ 141 (201)
T PF13851_consen 123 RDELYRKFESAIQEVQQKT 141 (201)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 55333 4444556666643
No 134
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=34.01 E-value=7.1e+02 Score=27.82 Aligned_cols=40 Identities=25% Similarity=0.307 Sum_probs=28.9
Q ss_pred cccccchhhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhh
Q 012498 205 ETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKL 255 (462)
Q Consensus 205 ~~s~~~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskL 255 (462)
.+....|..-||.|+.. .|+.|+.-++.-.+.+..|+++.
T Consensus 378 ~~~l~~k~~~lL~d~e~-----------ni~kL~~~v~~s~~rl~~L~~qW 417 (594)
T PF05667_consen 378 ELKLKKKTVELLPDAEE-----------NIAKLQALVEASEQRLVELAQQW 417 (594)
T ss_pred HHHHHHHHHHHhcCcHH-----------HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455666677877766 67888888888888888887764
No 135
>KOG4657 consensus Uncharacterized conserved protein [Function unknown]
Probab=33.51 E-value=1.3e+02 Score=30.47 Aligned_cols=68 Identities=22% Similarity=0.266 Sum_probs=47.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHH
Q 012498 16 MARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELS 86 (462)
Q Consensus 16 ~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELs 86 (462)
.+++-+-..|-.-|-+|.++-=-+-. -...+.|+=+. |-+++||||-.+|.+|...++-|+-|.+|+.
T Consensus 50 ar~lS~~~~e~e~l~~~l~etene~~--~~neL~~ek~~-~q~~ieqeik~~q~elEvl~~n~Q~lkeE~d 117 (246)
T KOG4657|consen 50 ARALSQSQVELENLKADLRETENELV--KVNELKTEKEA-RQMGIEQEIKATQSELEVLRRNLQLLKEEKD 117 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34555555566666666665432211 23335566554 4468999999999999999999999999998
No 136
>PF10474 DUF2451: Protein of unknown function C-terminus (DUF2451); InterPro: IPR019514 This protein is found in eukaryotes but its function is not known. The N-terminal domain of some members is PF10475 from PFAM (DUF2450).
Probab=32.87 E-value=4.7e+02 Score=25.44 Aligned_cols=81 Identities=14% Similarity=0.229 Sum_probs=57.2
Q ss_pred hhccccccccccCCcc--hHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHH
Q 012498 214 CLLLDSAEMWSFNDTS--TSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELR 291 (462)
Q Consensus 214 ~LL~Ds~~~Wsfn~ts--tskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq 291 (462)
++-.=+...|..++.. -|.||+.|=++.......++.+-...++--++.+.|-..+=. ..-..++.|.|.++
T Consensus 72 i~~~Ia~vKWdvkev~~qhs~YVd~l~~~~~~f~~rL~~i~~~~~i~~~~~~~lw~~~i~------~~~~~Lveg~s~vk 145 (234)
T PF10474_consen 72 ILNSIANVKWDVKEVMSQHSSYVDQLVQEFQQFSERLDEISKQGPIPPEVQNVLWDRLIF------FAFETLVEGYSRVK 145 (234)
T ss_pred HHHHHHHcCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH------HHHHHHHHHHHhcc
Confidence 3334456679999644 499999999999999999988776666666666554332211 24455678888888
Q ss_pred HhhhHHHHH
Q 012498 292 LCHSQLRVH 300 (462)
Q Consensus 292 ~~h~~~R~~ 300 (462)
++-..-|+-
T Consensus 146 KCs~eGRal 154 (234)
T PF10474_consen 146 KCSNEGRAL 154 (234)
T ss_pred CCChhhHHH
Confidence 888877764
No 137
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=32.86 E-value=1.2e+02 Score=25.09 Aligned_cols=79 Identities=29% Similarity=0.435 Sum_probs=60.6
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHH-hh------------c------------------CCch-----HHHhhHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCM-QQ------------A------------------GPSY-----LAVATRMHF 54 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCM-QQ------------a------------------Gpgy-----l~vATRM~~ 54 (462)
..+.+.++|..|+...+++..=++.|.- +. + |.|| +.=|...+.
T Consensus 4 ~l~~l~~~~~~l~~~~~e~~~~~~~l~~l~~~~~~~~~lvplg~~~~v~g~i~~~~~vlV~lG~~~~vE~s~~eA~~~l~ 83 (120)
T PF02996_consen 4 ELENLQQQIEQLEEQIEEYEEAKETLEELKKEKKEHEILVPLGSGVFVPGKIPDTDKVLVSLGAGYYVEMSLEEAIEFLK 83 (120)
T ss_dssp CCHHHHHHHHHHHHHHHHHHHHHHHHHHHTT--TT-EEEEEECTTEEEEEE-SSTTEEEEEEETTEEEEEEHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeeecCCCCeEEEEEeCCCCEEEEEeeCCeEEEecHHHHHHHHH
Confidence 3567889999999999988888888874 43 1 2222 234778888
Q ss_pred HhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 55 QRTAGLEQEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 55 qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
.|...|+..++.+.+++......-..++..+++.|
T Consensus 84 ~r~~~l~~~~~~l~~~~~~~~~~~~~~~~~l~~~~ 118 (120)
T PF02996_consen 84 KRIKELEEQLEKLEKELAELQAQIEQLEQTLQQLY 118 (120)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 99999999999999988888888888888777765
No 138
>PLN02939 transferase, transferring glycosyl groups
Probab=32.11 E-value=9.5e+02 Score=28.75 Aligned_cols=30 Identities=27% Similarity=0.341 Sum_probs=21.1
Q ss_pred hhhhHHHHHhHHHHHHHHHHHHHHHHHHHH
Q 012498 128 DNSVMEAEKAKEKEELMSQKFNEFQTRLEE 157 (462)
Q Consensus 128 D~slmEaEkaKE~Ee~m~qk~~~~~~R~~E 157 (462)
=.++-+.+|.--..|+.-.+++-++.|+.|
T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (977)
T PLN02939 152 LQALEDLEKILTEKEALQGKINILEMRLSE 181 (977)
T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHhhh
Confidence 344555566554456666899999999998
No 139
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=32.00 E-value=1.1e+03 Score=29.27 Aligned_cols=124 Identities=19% Similarity=0.295 Sum_probs=56.2
Q ss_pred HHHHHHHHHHhhhhhcchHHHHHHHH---HHHHHHHHHHH----HHHHhhHHHHHHHHHhhhhHHHH-----Hhhh----
Q 012498 64 IEILKQKIAACARENSNLQEELSEAY---RIKGQLADLHA----AEVIKNMEAEKQVKFFQGCMAAA-----FAER---- 127 (462)
Q Consensus 64 iE~Lkkkl~~c~ren~nLQEELsEAY---RiK~qLadLh~----ae~~Kn~e~EkqvkFfQs~vA~A-----FAER---- 127 (462)
...++++++..-+|-.++||+-+.-- +++..+..+++ +--+|-..+=.|..++-.-+|.. -+.|
T Consensus 860 l~~~~~~ie~l~kE~e~~qe~~~Kk~~i~~lq~~i~~i~~e~~q~qk~kv~~~~~~~~~l~~~i~k~~~~i~~s~~~i~k 939 (1293)
T KOG0996|consen 860 LKELEEQIEELKKEVEELQEKAAKKARIKELQNKIDEIGGEKVQAQKDKVEKINEQLDKLEADIAKLTVAIKTSDRNIAK 939 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhchhhHHhHHHHHHHHHHHHHHHHHHHHhHHHHhcCcccHHH
Confidence 34555666666666666665544311 22233333322 33444455555666664433321 1122
Q ss_pred -hhhhHHHHH----hHHHHHHHHH-------HHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHH
Q 012498 128 -DNSVMEAEK----AKEKEELMSQ-------KFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFK 187 (462)
Q Consensus 128 -D~slmEaEk----aKE~Ee~m~q-------k~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~ 187 (462)
++.+-+.|+ .++.-+..-. +..+.+.++.|.+..+.+.+..-.++-.++...+.....+.
T Consensus 940 ~q~~l~~le~~~~~~e~e~~~L~e~~~~~~~k~~E~~~~~~e~~~~~~E~k~~~~~~k~~~e~i~k~~~~lk 1011 (1293)
T KOG0996|consen 940 AQKKLSELEREIEDTEKELDDLTEELKGLEEKAAELEKEYKEAEESLKEIKKELRDLKSELENIKKSENELK 1011 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 122222222 2222223333 34445555555555555555555555555555554444333
No 140
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=31.93 E-value=3.8e+02 Score=24.09 Aligned_cols=76 Identities=28% Similarity=0.305 Sum_probs=51.8
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcch-HHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAG-PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNL-QEELSEA 88 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaG-pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nL-QEELsEA 88 (462)
+...+...|.+|+.+-.+|++++-.|--+-+. -+.+ .|-=+-..++.|+++|+.|..+|...-..+... .+|...+
T Consensus 73 el~~ld~ei~~L~~el~~l~~~~k~l~~eL~~L~~~~--t~~el~~~i~~l~~e~~~l~~kL~~l~~~~~~vs~ee~~~~ 150 (169)
T PF07106_consen 73 ELAELDAEIKELREELAELKKEVKSLEAELASLSSEP--TNEELREEIEELEEEIEELEEKLEKLRSGSKPVSPEEKEKL 150 (169)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHHHH
Confidence 46677888999999999999999888876655 1111 111245667889999999999998876643332 3344433
No 141
>PF04999 FtsL: Cell division protein FtsL; InterPro: IPR007082 In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein IPR007060 from INTERPRO [], however this interaction may be indirect [].; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=31.86 E-value=1.2e+02 Score=24.85 Aligned_cols=43 Identities=21% Similarity=0.247 Sum_probs=32.0
Q ss_pred hHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHH
Q 012498 45 YLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSE 87 (462)
Q Consensus 45 yl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsE 87 (462)
+.++++-+....+..+..+++.++++......||.+|+=|.+.
T Consensus 25 ~~a~~~v~~~~~~~~~~~~l~~l~~~~~~l~~e~~~L~lE~~~ 67 (97)
T PF04999_consen 25 ISALGVVYSRHQSRQLFYELQQLEKEIDQLQEENERLRLEIAT 67 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344555555557777788999999999999999999877653
No 142
>PF03980 Nnf1: Nnf1 ; InterPro: IPR007128 NNF1 is an essential yeast gene required for proper spindle orientation, nucleolar and nuclear envelope structure and mRNA export [].
Probab=31.22 E-value=94 Score=26.12 Aligned_cols=47 Identities=21% Similarity=0.210 Sum_probs=39.2
Q ss_pred cCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHH
Q 012498 41 AGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSE 87 (462)
Q Consensus 41 aGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsE 87 (462)
++|..+..-+-.-+..+..+.+.++.|...|...-.+|..|.+++.+
T Consensus 59 ~~~~~l~P~~~i~a~l~~~~~~~~~~L~~~l~~l~~eN~~L~~~i~~ 105 (109)
T PF03980_consen 59 VWRHSLTPEEDIRAHLAPYKKKEREQLNARLQELEEENEALAEEIQE 105 (109)
T ss_pred CCCCCCChHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556666677777778888999999999999999999999999875
No 143
>KOG3215 consensus Uncharacterized conserved protein [Function unknown]
Probab=30.89 E-value=5.7e+02 Score=25.78 Aligned_cols=94 Identities=23% Similarity=0.221 Sum_probs=58.3
Q ss_pred hhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHH-HHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHH
Q 012498 58 AGLEQEIEILKQKIAACARENSNLQEELSEAYRI-KGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEK 136 (462)
Q Consensus 58 a~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRi-K~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEk 136 (462)
+|=++-++.|.++....-.|-..=-+++++|-|| |.-|+.|- ||+-++--.-.==+.-+.|++-
T Consensus 29 ~~~dr~v~~l~ksf~~~~~E~~kee~~y~ea~ri~Ka~L~~Ls---------------q~E~~mlKtqrv~e~nlre~e~ 93 (222)
T KOG3215|consen 29 DGGDRLVEHLEKSFVLAKAEIEKEEKEYSEAKRIRKALLASLS---------------QDEPSMLKTQRVIEMNLREIEN 93 (222)
T ss_pred CCCcHHHHHHHHHHHHHHHHhhhhhhchhHHHHHHHHHHHHHh---------------hcccchHHHHHHHHHHHHHHHH
Confidence 3445667777777665555544444459999999 55577773 3333333333333444566666
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Q 012498 137 AKEKEELMSQKFNEFQTRLEELSSENIELK 166 (462)
Q Consensus 137 aKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk 166 (462)
--+..+.|-++|.+-..-++.+-.++.+.|
T Consensus 94 ~~q~k~Eiersi~~a~~kie~lkkql~eaK 123 (222)
T KOG3215|consen 94 LVQKKLEIERSIQKARNKIELLKKQLHEAK 123 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 666667777777777777777766665555
No 144
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=30.55 E-value=98 Score=26.64 Aligned_cols=39 Identities=33% Similarity=0.515 Sum_probs=23.2
Q ss_pred hHHHHHHHHHHHHHHHHhHHHHHhhhh-----hhHHHHHHhHHh
Q 012498 230 TSKYISALEDELEKTRSSVENLQSKLR-----MGLEIENHLKKS 268 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~LQskLR-----~GLeIenhLkk~ 268 (462)
|.+=+..|+-++..++-.++.+-..|+ ++|.+||+||++
T Consensus 63 t~~dv~~L~l~l~el~G~~~~l~~~l~~v~~~~~lLlE~~lk~~ 106 (106)
T PF10805_consen 63 TRDDVHDLQLELAELRGELKELSARLQGVSHQLDLLLENELKKD 106 (106)
T ss_pred CHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 344455555555555555555555544 379999998763
No 145
>PF09403 FadA: Adhesion protein FadA; InterPro: IPR018543 FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. ; PDB: 3ETZ_B 3ETY_A 2GL2_B 3ETX_C 3ETW_A.
Probab=30.48 E-value=4.2e+02 Score=24.13 Aligned_cols=65 Identities=26% Similarity=0.408 Sum_probs=36.9
Q ss_pred HhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHH-----HHHHhhHHHHHHHHHhhhhHHHHHh
Q 012498 55 QRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHA-----AEVIKNMEAEKQVKFFQGCMAAAFA 125 (462)
Q Consensus 55 qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~-----ae~~Kn~e~EkqvkFfQs~vA~AFA 125 (462)
-+-.+||.+.+.|-++ |+.--.++=..|=..-..|+++.. .+...-......||||..-.-.-..
T Consensus 27 ~~l~~LEae~q~L~~k------E~~r~~~~k~~ae~a~~~L~~~~~~~~~i~e~~~kl~~~~~~r~yk~eYk~llk 96 (126)
T PF09403_consen 27 SELNQLEAEYQQLEQK------EEARYNEEKQEAEAAEAELAELKELYAEIEEKIEKLKQDSKVRWYKDEYKELLK 96 (126)
T ss_dssp HHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHGGGSTTHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcchhHHHHHHHHHHH
Confidence 3456677777777663 444444455555555566665533 3344455666788888755443333
No 146
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=30.46 E-value=43 Score=27.50 Aligned_cols=32 Identities=31% Similarity=0.455 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHh
Q 012498 236 ALEDELEKTRSSVENLQSKLRMGLEIENHLKKS 268 (462)
Q Consensus 236 aLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~ 268 (462)
|.-||+|.||.+|..|+.+.+ -|+.||.+.|.
T Consensus 11 AVrEEVevLK~~I~eL~~~n~-~Le~EN~~Lk~ 42 (59)
T PF01166_consen 11 AVREEVEVLKEQIAELEERNS-QLEEENNLLKQ 42 (59)
T ss_dssp T-TTSHHHHHHHHHHHHHHHH-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHh
Confidence 456899999999999998776 48899987654
No 147
>PF03148 Tektin: Tektin family; InterPro: IPR000435 Tektin heteropolymers form unique protofilaments of flagellar microtubules []. The proteins are predicted to form extended rods composed of 2 alpha- helical segments (~180 residues long) capable of forming coiled coils, interrupted by non-helical linkers []. The 2 segments are similar in sequence, indicating a gene duplication event. Along each tektin rod, cysteine residues occur with a periodicity of ~8nm, coincident with the axial repeat of tubulin dimers in microtubules []. It is proposed that the assembly of tektin heteropolymers produces filaments with repeats of 8, 16, 24, 32, 40, 48 and 96nm, generating the basis for the complex spatial arrangements of axonemal components [].; GO: 0000226 microtubule cytoskeleton organization, 0005874 microtubule
Probab=30.15 E-value=6.3e+02 Score=26.08 Aligned_cols=192 Identities=21% Similarity=0.281 Sum_probs=105.9
Q ss_pred HHHHHHHHHHHHHHhHHHHHhhhhh------------h-----HHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhh
Q 012498 233 YISALEDELEKTRSSVENLQSKLRM------------G-----LEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHS 295 (462)
Q Consensus 233 yisaLEeE~e~lr~~i~~LQskLR~------------G-----LeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~ 295 (462)
=|.+|......+...++.+.--|.| | =+.|..|.+-+..++.-+.++.+.+.....-|+..-
T Consensus 72 Ei~~L~~~K~~le~aL~~~~~pl~i~~ecL~~R~~R~~~dlv~D~ve~eL~kE~~li~~~~~lL~~~l~~~~eQl~~lr- 150 (384)
T PF03148_consen 72 EIDLLEEEKRRLEKALEALRKPLSIAQECLSLREKRPGIDLVHDEVEKELLKEVELIENIKRLLQRTLEQAEEQLRLLR- 150 (384)
T ss_pred HHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHhCCCCcccCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence 4555666655555555555544443 2 345677899999999888888888777766554322
Q ss_pred HHHHHHHHhhhhcchhhhhhHHHHHhhh-ccccccccccccCCCcccccccccccccceeccCCCCccccCCC-CCCcch
Q 012498 296 QLRVHVVNSLEEGRSHIKSISDVIEEKT-QHCDDVIRGQNTGTYQRETKLDEFECRDVHINNDADTNLVSQRN-DPAYCD 373 (462)
Q Consensus 296 ~~R~~Im~lL~ee~s~i~s~v~~ieekl-~~~~n~~~E~n~~~pq~e~~~~e~ec~dVhv~~d~~p~~~~k~~-~p~~~~ 373 (462)
.-+. .+-.-+.+|. -+.+|. .+ ..+ .+.+.++...++ |...|+.. .|..-.
T Consensus 151 -----------~ar~---~Le~Dl~dK~~A~~ID~---~~-------~~L-~~~S~~i~~~~~--~~r~~~~~~tp~~W~ 203 (384)
T PF03148_consen 151 -----------AARY---RLEKDLSDKFEALEIDT---QC-------LSL-NNNSTNISYKPG--STRIPKNSSTPESWE 203 (384)
T ss_pred -----------HHHH---HHHHHHHHHHHHHHHHH---HH-------HhC-CCccCCCcccCC--cccccccCCChHHHH
Confidence 1111 2223344444 333332 11 001 111233333332 22222222 222211
Q ss_pred h-hhcccC--CchHHHHHHHHHHHHHHHhhcHHHHHHHHHHhhhHHHHHHHHHHHHhhhhhhhHHHHHHHHHHhHHHHHH
Q 012498 374 I-EADRKG--EASETLAQALQEKVAALLLLSQQEERHLLERNVNSALQKKIEELQRNLFQVTTEKVKALMELAQLKQDYQ 450 (462)
Q Consensus 374 ~-~~d~~~--d~s~alAqAL~EKveALlLlSQqeER~llE~~~n~~lq~~ieeLqrnl~QVt~EKVkaLmElAqLkq~y~ 450 (462)
. ..+.+. ..-.+-+..|++-|..++-=+.. .-.--=..||.+|...|.|.+.-..+....+-+++-|++.+...+.
T Consensus 204 ~~s~~ni~~a~~e~~~S~~LR~~i~~~l~~~~~-dl~~Q~~~vn~al~~Ri~et~~ak~~Le~ql~~~~~ei~~~e~~i~ 282 (384)
T PF03148_consen 204 EFSNENIQRAEKERQSSAQLREDIDSILEQTAN-DLRAQADAVNAALRKRIHETQEAKNELEWQLKKTLQEIAEMEKNIE 282 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 0 011111 22223345566666655432221 1122234689999999999999999999999999999999999988
Q ss_pred Hhh
Q 012498 451 LLQ 453 (462)
Q Consensus 451 lL~ 453 (462)
.|+
T Consensus 283 ~L~ 285 (384)
T PF03148_consen 283 DLE 285 (384)
T ss_pred HHH
Confidence 876
No 148
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=30.05 E-value=7.7e+02 Score=27.06 Aligned_cols=48 Identities=19% Similarity=0.378 Sum_probs=31.7
Q ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 133 EAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 133 EaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
+.+-++.+++.+.+++++++.|+..+...-.+.. .|+++.+..+..-+
T Consensus 346 ~~~~a~~~~~~L~~~l~~~~~~~~~~~~~~~e~~----~L~Re~~~~~~~Y~ 393 (754)
T TIGR01005 346 QADAAQARESQLVSDVNQLKAASAQAGEQQVDLD----ALQRDAAAKRQLYE 393 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCcHhHHHHH----HHHHHHHHHHHHHH
Confidence 4566778888889999999999887755443322 45555555554444
No 149
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=30.02 E-value=5.8e+02 Score=25.59 Aligned_cols=97 Identities=18% Similarity=0.214 Sum_probs=50.5
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHH
Q 012498 56 RTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAE 135 (462)
Q Consensus 56 Rta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaE 135 (462)
|+.-++.=++.|...+.+.-.|...|...+..+-.++-.+.+. ...++.++.=.+..++. ...-|.. |.+
T Consensus 143 R~~ll~gl~~~L~~~~~~L~~D~~~L~~~~~~l~~~~~~l~~~-------~~~L~~e~~~Lk~~~~e-~~~~D~~--eL~ 212 (325)
T PF08317_consen 143 RMQLLEGLKEGLEENLELLQEDYAKLDKQLEQLDELLPKLRER-------KAELEEELENLKQLVEE-IESCDQE--ELE 212 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHhh-hhhcCHH--HHH
Confidence 6666666666777777777777777766666655555555544 34445555544444433 4444443 333
Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHhHHH
Q 012498 136 KAKEKEELMSQKFNEFQTRLEELSSEN 162 (462)
Q Consensus 136 kaKE~Ee~m~qk~~~~~~R~~E~~s~~ 162 (462)
.+|..=.....++..+...+.+++..+
T Consensus 213 ~lr~eL~~~~~~i~~~k~~l~el~~el 239 (325)
T PF08317_consen 213 ALRQELAEQKEEIEAKKKELAELQEEL 239 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444333333344443333333333333
No 150
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=29.93 E-value=8.8e+02 Score=27.66 Aligned_cols=61 Identities=16% Similarity=0.216 Sum_probs=38.7
Q ss_pred HHhhcC--CchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHH
Q 012498 37 CMQQAG--PSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLAD 97 (462)
Q Consensus 37 CMQQaG--pgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLad 97 (462)
|...+| |..|.-|..++......++.=|+.|..+....-.+...+...+.++=+.+..|..
T Consensus 493 iA~~~Glp~~ii~~A~~~~~~~~~~~~~li~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~ 555 (782)
T PRK00409 493 IAKRLGLPENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEE 555 (782)
T ss_pred HHHHhCcCHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456667 5566777788888887888877777775555544555555555555555555443
No 151
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=29.65 E-value=1.4e+02 Score=23.53 Aligned_cols=30 Identities=30% Similarity=0.395 Sum_probs=25.0
Q ss_pred hhhHHHHHHHHHHHHHhhhhhcchHHHHHH
Q 012498 58 AGLEQEIEILKQKIAACARENSNLQEELSE 87 (462)
Q Consensus 58 a~LEQeiE~Lkkkl~~c~ren~nLQEELsE 87 (462)
..+..++.++++++.....+|..|+.|.+.
T Consensus 27 ~~~~~~~~~~~~~~~~l~~en~~L~~ei~~ 56 (85)
T TIGR02209 27 RQLNNELQKLQLEIDKLQKEWRDLQLEVAE 56 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 367788888888888888899999988764
No 152
>PF07321 YscO: Type III secretion protein YscO; InterPro: IPR009929 This family contains the bacterial type III secretion protein YscO, which is approximately 150 residues long. YscO has been shown to be required for high-level expression and secretion of the anti-host proteins V antigen and Yops in Yersinia pestis [].
Probab=29.55 E-value=3.5e+02 Score=25.15 Aligned_cols=49 Identities=20% Similarity=0.277 Sum_probs=42.2
Q ss_pred hHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH
Q 012498 50 TRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADL 98 (462)
Q Consensus 50 TRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL 98 (462)
..++..+.+.|++.++...+++.++..-=...+.++.+|.|.+..++.|
T Consensus 76 v~~Lr~~e~~le~~~~~a~~~~~~e~~~l~~a~~~~~~a~r~~eKf~eL 124 (152)
T PF07321_consen 76 VASLREREAELEQQLAEAEEQLEQERQALEEARKQLQQARRQQEKFAEL 124 (152)
T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456778889999999999999999888888899999999999887766
No 153
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=29.40 E-value=7.5e+02 Score=26.72 Aligned_cols=76 Identities=18% Similarity=0.333 Sum_probs=41.6
Q ss_pred hhhhHHHHHhhhhhhhHHHHH---------hHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhH
Q 012498 116 FQGCMAAAFAERDNSVMEAEK---------AKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESF 186 (462)
Q Consensus 116 fQs~vA~AFAERD~slmEaEk---------aKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~ 186 (462)
|+--|...|++=|..|.+||. |+..-...-+.+..++.++......+.+...+. +++-...
T Consensus 73 ~~~i~~~~~~~ie~~l~~ae~~~~~~~f~~a~~~~~~~~~~l~~~e~~~~~i~~~l~~l~~~e----------~~nr~~v 142 (569)
T PRK04778 73 WDEIVTNSLPDIEEQLFEAEELNDKFRFRKAKHEINEIESLLDLIEEDIEQILEELQELLESE----------EKNREEV 142 (569)
T ss_pred HHHHHHhhhhhHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHH
Confidence 566678888888888888885 443333333344444444444333333333332 3333334
Q ss_pred HHHHHHHHHHhhhhh
Q 012498 187 KEVINKFYEIRQQSL 201 (462)
Q Consensus 187 ~kVI~KFyeiR~~~~ 201 (462)
..+-++|-++|..-+
T Consensus 143 ~~l~~~y~~~rk~ll 157 (569)
T PRK04778 143 EQLKDLYRELRKSLL 157 (569)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456677778887544
No 154
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=29.36 E-value=1e+02 Score=27.03 Aligned_cols=38 Identities=26% Similarity=0.239 Sum_probs=29.6
Q ss_pred HHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHH
Q 012498 54 FQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRI 91 (462)
Q Consensus 54 ~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRi 91 (462)
.+.+..|-.+|+.||+.+....-||..|+-|....++.
T Consensus 14 e~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~ 51 (107)
T PF06156_consen 14 EQQLGQLLEELEELKKQLQELLEENARLRIENEHLRER 51 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566666778899999999999999999887765543
No 155
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=28.49 E-value=4.7e+02 Score=24.02 Aligned_cols=73 Identities=27% Similarity=0.354 Sum_probs=39.6
Q ss_pred HHHHHHHHHHHHHHhHHHHHhhhhh-----------hHHHHH-HhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHH
Q 012498 233 YISALEDELEKTRSSVENLQSKLRM-----------GLEIEN-HLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVH 300 (462)
Q Consensus 233 yisaLEeE~e~lr~~i~~LQskLR~-----------GLeIen-hLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~ 300 (462)
-++.|..+.+.|+..+++|.++|+- =+..+. ......+.++.+..-.+.=|...|++|+..--..|..
T Consensus 74 ~~~~lr~~~e~L~~eie~l~~~L~~ei~~l~a~~klD~n~eK~~~r~e~~~~~~ki~e~~~ki~~ei~~lr~~iE~~K~~ 153 (177)
T PF07798_consen 74 EFAELRSENEKLQREIEKLRQELREEINKLRAEVKLDLNLEKGRIREEQAKQELKIQELNNKIDTEIANLRTEIESLKWD 153 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666666666655554 222221 2333344444444445555666677777766666666
Q ss_pred HHHhh
Q 012498 301 VVNSL 305 (462)
Q Consensus 301 Im~lL 305 (462)
+++.+
T Consensus 154 ~lr~~ 158 (177)
T PF07798_consen 154 TLRWL 158 (177)
T ss_pred HHHHH
Confidence 66543
No 156
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=28.45 E-value=1.6e+02 Score=27.01 Aligned_cols=59 Identities=17% Similarity=0.350 Sum_probs=32.5
Q ss_pred hhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHH----HHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 125 AERDNSVMEAEKAKEKEELMSQKFNEFQTRLE----ELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 125 AERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~----E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
.+||+.|-.-.+...--+.+-+++.+++.... +|+..+...+ ++.++..-|......+.
T Consensus 37 ~~~d~~i~~Lk~~~~d~eeLk~~i~~lq~~~~~~~~~~e~~l~~~~-~~~ai~~al~~akakn~ 99 (155)
T PF06810_consen 37 KEADKQIKDLKKSAKDNEELKKQIEELQAKNKTAKEEYEAKLAQMK-KDSAIKSALKGAKAKNP 99 (155)
T ss_pred HHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHcCCCCH
Confidence 34444444443322233344466666666555 6666666555 56677666666655554
No 157
>PHA02047 phage lambda Rz1-like protein
Probab=28.31 E-value=2.7e+02 Score=25.13 Aligned_cols=57 Identities=18% Similarity=0.345 Sum_probs=44.0
Q ss_pred HHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhhhhcchhh
Q 012498 233 YISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRSHI 312 (462)
Q Consensus 233 yisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~s~i 312 (462)
|.-.-.++-+.+.++++.++-++ +|+++.|..|+.| -.++|.+|.+-|+...+|-
T Consensus 28 ~~g~~h~~a~~la~qLE~a~~r~-------~~~Q~~V~~l~~k------------------ae~~t~Ei~~aL~~n~~Wa 82 (101)
T PHA02047 28 ALGIAHEEAKRQTARLEALEVRY-------ATLQRHVQAVEAR------------------TNTQRQEVDRALDQNRPWA 82 (101)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHH------------------HHHHHHHHHHHHHhCCCcc
Confidence 33344667788888888776554 3788899888887 4578999999999999997
Q ss_pred hh
Q 012498 313 KS 314 (462)
Q Consensus 313 ~s 314 (462)
++
T Consensus 83 D~ 84 (101)
T PHA02047 83 DR 84 (101)
T ss_pred cC
Confidence 65
No 158
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=28.25 E-value=5.6e+02 Score=29.10 Aligned_cols=94 Identities=24% Similarity=0.222 Sum_probs=62.7
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHH---HHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQ---LCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSE 87 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEq---LCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsE 87 (462)
-.|++..|+++||.|-+.||.|+-+ -|+.--.-+ -.-|++- ..=++|+|.|---|++.-..|+-|..-||-
T Consensus 539 ~~e~~r~r~~~lE~E~~~lr~elk~kee~~~~~e~~~---~~lr~~~---~e~~~~~e~L~~aL~amqdk~~~LE~sLsa 612 (697)
T PF09726_consen 539 CAESCRQRRRQLESELKKLRRELKQKEEQIRELESEL---QELRKYE---KESEKDTEVLMSALSAMQDKNQHLENSLSA 612 (697)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHH---hhhhhhHHHHHHHHHHHHHHHHHHHHhhhH
Confidence 4678999999999999999988743 343211100 0012211 224678999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHH
Q 012498 88 AYRIKGQLADLHAAEVIKNMEAE 110 (462)
Q Consensus 88 AYRiK~qLadLh~ae~~Kn~e~E 110 (462)
-=|||--|=--.|.+.-+-..++
T Consensus 613 EtriKldLfsaLg~akrq~ei~~ 635 (697)
T PF09726_consen 613 ETRIKLDLFSALGDAKRQLEIAQ 635 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999755544444443333333
No 159
>PF12711 Kinesin-relat_1: Kinesin motor; InterPro: IPR024658 Kinesin [, , ] is a microtubule-associated force-producing protein that may play a role in organelle transport. The kinesin motor activity is directed toward the microtubule's plus end. Kinesin is an oligomeric complex composed of two heavy chains and two light chains. The maintenance of the quaternary structure does not require interchain disulphide bonds. The heavy chain is composed of three structural domains: a large globular N-terminal domain which is responsible for the motor activity of kinesin (it is known to hydrolyse ATP, to bind and move on microtubules), a central alpha-helical coiled coil domain that mediates the heavy chain dimerisation; and a small globular C-terminal domain which interacts with other proteins (such as the kinesin light chains), vesicles and membranous organelles. A number of proteins have been recently found that contain a domain similar to that of the kinesin 'motor' domain [, ]: Drosophila melanogaster claret segregational protein (ncd). Ncd is required for normal chromosomal segregation in meiosis, in females, and in early mitotic divisions of the embryo. The ncd motor activity is directed toward the microtubule's minus end. Homo sapiens CENP-E []. CENP-E is a protein that associates with kinetochores during chromosome congression, relocates to the spindle midzone at anaphase, and is quantitatively discarded at the end of the cell division. CENP-E is probably an important motor molecule in chromosome movement and/or spindle elongation. H. sapiens mitotic kinesin-like protein-1 (MKLP-1), a motor protein whose activity is directed toward the microtubule's plus end. Saccharomyces cerevisiae KAR3 protein, which is essential for nuclear fusion during mating. KAR3 may mediate microtubule sliding during nuclear fusion and possibly mitosis. S. cerevisiae CIN8 and KIP1 proteins which are required for the assembly of the mitotic spindle. Both proteins seem to interact with spindle microtubules to produce an outwardly directed force acting upon the poles. Emericella nidulans (Aspergillus nidulans) bimC, which plays an important role in nuclear division. A. nidulans klpA. Caenorhabditis elegans unc-104, which may be required for the transport of substances needed for neuronal cell differentiation. C. elegans osm-3. Xenopus laevis Eg5, which may be involved in mitosis. Arabidopsis thaliana KatA, KatB and katC. Chlamydomonas reinhardtii FLA10/KHP1 and KLP1. Both proteins seem to play a role in the rotation or twisting of the microtubules of the flagella. C. elegans hypothetical protein T09A5.2. Kinesin-like proteins KLP2 (or KIF15) also contain a kinesin 'motor' domain. They are involved in mitotic spindle assembly, playing a role in positioning spindle poles during mitosis, specifically at prometaphase []. This entry represents a domain of unknown function found in this type of kinesin-like proteins.
Probab=28.03 E-value=87 Score=27.05 Aligned_cols=45 Identities=27% Similarity=0.392 Sum_probs=31.1
Q ss_pred hcCCchHHHhhHHHHHhhhhhHHHHHHHHHHH------HHhhhhhcchHHHH
Q 012498 40 QAGPSYLAVATRMHFQRTAGLEQEIEILKQKI------AACARENSNLQEEL 85 (462)
Q Consensus 40 QaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl------~~c~ren~nLQEEL 85 (462)
..-.|-++.-+.+.-.. .+|..||+.|+.|+ .-+.-||..|++|+
T Consensus 10 ~~~~g~l~~~~~~~~e~-~~L~eEI~~Lr~qve~nPevtr~A~EN~rL~ee~ 60 (86)
T PF12711_consen 10 KLLDGKLPSESYLEEEN-EALKEEIQLLREQVEHNPEVTRFAMENIRLREEL 60 (86)
T ss_pred HHhcCCCCccchhHHHH-HHHHHHHHHHHHHHHhCHHHHHHHHHHHHHHHHH
Confidence 33344444556666666 88999999999765 45666888877776
No 160
>PF13094 CENP-Q: CENP-Q, a CENPA-CAD centromere complex subunit
Probab=27.66 E-value=3.2e+02 Score=24.45 Aligned_cols=34 Identities=26% Similarity=0.269 Sum_probs=28.0
Q ss_pred ccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhh
Q 012498 224 SFNDTSTSKYISALEDELEKTRSSVENLQSKLRM 257 (462)
Q Consensus 224 sfn~tstskyisaLEeE~e~lr~~i~~LQskLR~ 257 (462)
+|+=.+..+...+||..+.....+|+.||..++-
T Consensus 19 ~~~~e~ll~~~~~LE~qL~~~~~~l~lLq~e~~~ 52 (160)
T PF13094_consen 19 SFDYEQLLDRKRALERQLAANLHQLELLQEEIEK 52 (160)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444567889999999999999999999987764
No 161
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=27.45 E-value=20 Score=39.07 Aligned_cols=105 Identities=30% Similarity=0.329 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHH------------Hh----hHHHHHhhhhhHHHHHHHHHHHHHhhhh
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCMQQAGPSYLA------------VA----TRMHFQRTAGLEQEIEILKQKIAACARE 77 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~------------vA----TRM~~qRta~LEQeiE~Lkkkl~~c~re 77 (462)
++......|..|||.|+--++.|-.-+++++.++ .+ +.=...|..-|+.|-..|+.+.++...+
T Consensus 402 ~l~~eke~l~~e~~~L~e~~eeL~~~~~~~~~l~~~~~~~~~~~~~l~~El~~~~l~erl~rLe~ENk~Lk~~~e~~~~e 481 (713)
T PF05622_consen 402 ALEEEKERLQEERDSLRETNEELECSQAQQEQLSQSGEESSSSGDNLSAELNPAELRERLLRLEHENKRLKEKQEESEEE 481 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccccccchhhhccchHHHHHHHHHHHHHHHHHHHhccchhh
Confidence 3334444555577777777777654333211111 11 1113456677888888887777666443
Q ss_pred h-cchHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhh
Q 012498 78 N-SNLQEELSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQG 118 (462)
Q Consensus 78 n-~nLQEELsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs 118 (462)
. .-|+.+|.+|-+.+..|-.-+...-.+..+++.|+.=-|.
T Consensus 482 ~~~~L~~~Leda~~~~~~Le~~~~~~~~~~~~lq~qle~lq~ 523 (713)
T PF05622_consen 482 KLEELQSQLEDANRRKEKLEEENREANEKILELQSQLEELQK 523 (713)
T ss_dssp ------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3 4688888888888888877666655566666666654443
No 162
>PRK05431 seryl-tRNA synthetase; Provisional
Probab=27.43 E-value=2.2e+02 Score=29.84 Aligned_cols=22 Identities=36% Similarity=0.625 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQ 35 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEq 35 (462)
.+..++..|.++|+++.|.|-.
T Consensus 39 ~l~~~~~~lr~~rn~~sk~i~~ 60 (425)
T PRK05431 39 ELQTELEELQAERNALSKEIGQ 60 (425)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4566777788888888888865
No 163
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=27.43 E-value=1.4e+02 Score=22.76 Aligned_cols=37 Identities=22% Similarity=0.407 Sum_probs=27.8
Q ss_pred hhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH
Q 012498 59 GLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADL 98 (462)
Q Consensus 59 ~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL 98 (462)
.||.|-+.||..-.....+|..|+.|-.. +++++..|
T Consensus 2 QlE~Dy~~LK~~yd~Lk~~~~~L~~E~~~---L~aev~~L 38 (45)
T PF02183_consen 2 QLERDYDALKASYDSLKAEYDSLKKENEK---LRAEVQEL 38 (45)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH
Confidence 37888888888888888888888887654 56666555
No 164
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=27.18 E-value=3.7e+02 Score=22.37 Aligned_cols=41 Identities=29% Similarity=0.370 Sum_probs=28.0
Q ss_pred HhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHH
Q 012498 48 VATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEA 88 (462)
Q Consensus 48 vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEA 88 (462)
=|....-.|...|+..++++.+.+......=..++..+.+.
T Consensus 87 eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l~~~ 127 (129)
T cd00890 87 EAIEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEELQQL 127 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 36667777777788878888777776666655555555543
No 165
>COG1711 DNA replication initiation complex subunit, GINS family [Replication, recombination, and repair]
Probab=27.12 E-value=1.9e+02 Score=28.91 Aligned_cols=82 Identities=22% Similarity=0.321 Sum_probs=49.3
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhhhhcch
Q 012498 231 SKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGRS 310 (462)
Q Consensus 231 skyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~s 310 (462)
-+||++||.+.+.-.+. .--|+.+-+- .|+ -++..+|.+=+ .-+.|++.-.+.++.-.- |-+|..+|+.
T Consensus 31 ~~~I~eLe~~~~~~~~~-~D~e~~~~~~-~~e-t~~~~~r~ifq--rR~~Kiv~~A~~~~~~~~------~~~Lt~eEk~ 99 (223)
T COG1711 31 RSFIKELEDEAGRAEEA-RDIEKYLLTD-RIE-TAKSDARSIFQ--RRYGKIVSRAIYDVPGET------ISNLTPEEKE 99 (223)
T ss_pred HHHHHHHHHHhhccccc-cCHHHHHHHH-HHH-HHHHHHHHHHH--HHHHHHHHHHHHhccccc------hhcCCHHHHH
Confidence 35899998887665544 2222222222 111 12333333222 236788888777765432 8889999999
Q ss_pred hhhhhHHHHHhhh
Q 012498 311 HIKSISDVIEEKT 323 (462)
Q Consensus 311 ~i~s~v~~ieekl 323 (462)
.+.++++.|++--
T Consensus 100 ly~~l~~~I~~e~ 112 (223)
T COG1711 100 LYEDLVNFIEDER 112 (223)
T ss_pred HHHHHHHHHhhch
Confidence 9999999987644
No 166
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=27.12 E-value=73 Score=25.83 Aligned_cols=37 Identities=14% Similarity=0.240 Sum_probs=28.4
Q ss_pred hhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHH
Q 012498 214 CLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQ 252 (462)
Q Consensus 214 ~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQ 252 (462)
.++...-+.||+.+ ...||+.|+.|...+++.+++=+
T Consensus 12 ~~ig~dLs~lSv~E--L~~RIa~L~aEI~R~~~~~~~K~ 48 (59)
T PF06698_consen 12 HEIGEDLSLLSVEE--LEERIALLEAEIARLEAAIAKKS 48 (59)
T ss_pred cccCCCchhcCHHH--HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566667788774 46699999999999998877644
No 167
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=27.11 E-value=1.2e+03 Score=28.24 Aligned_cols=56 Identities=18% Similarity=0.208 Sum_probs=31.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhh
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACA 75 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ 75 (462)
.+.+...|.+...+-.++++.|+. ..+..|.|.+-.+. ..|||.+......|...-
T Consensus 67 ~~~~~~~i~~ap~~~~~~~~~l~~--~~~~~~~~~~~~s~------~~Leq~l~~~~~~L~~~q 122 (1109)
T PRK10929 67 AKQYQQVIDNFPKLSAELRQQLNN--ERDEPRSVPPNMST------DALEQEILQVSSQLLEKS 122 (1109)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHh--hhcccccccccCCH------HHHHHHHHHHHHHHHHHH
Confidence 455666666666677788888886 45555655333222 455555554444444333
No 168
>PF01813 ATP-synt_D: ATP synthase subunit D ; InterPro: IPR002699 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. The V-ATPases (or V1V0-ATPase) and A-ATPases (or A1A0-ATPase) are each composed of two linked complexes: the V1 or A1 complex contains the catalytic core that hydrolyses/synthesizes ATP, and the V0 or A0 complex that forms the membrane-spanning pore. The V- and A-ATPases both contain rotary motors, one that drives proton translocation across the membrane and one that drives ATP synthesis/hydrolysis [, , ]. The V- and A-ATPases more closely resemble one another in subunit structure than they do the F-ATPases, although the function of A-ATPases is closer to that of F-ATPases. This entry represents the D subunit found in V1 and A1 complexes of V- and A-ATPases, respectively. Subunit D appears to be located in the central stalk, whereas subunits E and G form part of the peripheral stalk connecting V1 and V0. This subunit is the most likely homologue to the gamma subunit of the F1 complex in F-ATPases, which undergoes rotation during ATP hydrolysis and serves an essential function in rotary catalysis [, ]. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0042626 ATPase activity, coupled to transmembrane movement of substances, 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0033178 proton-transporting two-sector ATPase complex, catalytic domain; PDB: 3A5C_G 3A5D_G 3J0J_G 3AON_A.
Probab=26.97 E-value=4.3e+02 Score=24.43 Aligned_cols=36 Identities=22% Similarity=0.410 Sum_probs=26.2
Q ss_pred HHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHH
Q 012498 123 AFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENI 163 (462)
Q Consensus 123 AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~ 163 (462)
.+|.|=+.+++ .|-+++..+|..+-..+.++...+.
T Consensus 11 ~~a~rg~~lLk-----~Krd~L~~e~~~~~~~~~~~r~~~~ 46 (196)
T PF01813_consen 11 KLAKRGHKLLK-----KKRDALIREFRKLIKEAEELREELE 46 (196)
T ss_dssp HHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677777887 7788888888888777777655553
No 169
>KOG0976 consensus Rho/Rac1-interacting serine/threonine kinase Citron [Signal transduction mechanisms]
Probab=26.95 E-value=1.2e+03 Score=28.23 Aligned_cols=142 Identities=22% Similarity=0.285 Sum_probs=88.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHh----hhhhHHHHHHHHHHHHHhh---hhhcchHHH
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQR----TAGLEQEIEILKQKIAACA---RENSNLQEE 84 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qR----ta~LEQeiE~Lkkkl~~c~---ren~nLQEE 84 (462)
.|++-....+||++||.+--|+-.| |+- --.-|-..|| .|.+++.|+-||.++.+.+ ++......|
T Consensus 346 ~egfddk~~eLEKkrd~al~dvr~i--~e~-----k~nve~elqsL~~l~aerqeQidelKn~if~~e~~~~dhe~~kne 418 (1265)
T KOG0976|consen 346 AEGFDDKLNELEKKRDMALMDVRSI--QEK-----KENVEEELQSLLELQAERQEQIDELKNHIFRLEQGKKDHEAAKNE 418 (1265)
T ss_pred hcchhHHHHHHHHHHHHHHHhHHHH--HHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccchhHHHHHH
Confidence 4667777889999999999988765 331 1233444444 4677788999999887764 344445556
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHH
Q 012498 85 LSEAYRIKGQLADLHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIE 164 (462)
Q Consensus 85 LsEAYRiK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~ 164 (462)
|++|-- =+|+.|++++ -+++|.--||.-- |-++.+ ++- .+.+.++.--|++-+..+...
T Consensus 419 L~~a~e----kld~mgthl~---mad~Q~s~fk~Lk------------e~aegs-rrr-aIeQcnemv~rir~l~~sle~ 477 (1265)
T KOG0976|consen 419 LQEALE----KLDLMGTHLS---MADYQLSNFKVLK------------EHAEGS-RRR-AIEQCNEMVDRIRALMDSLEK 477 (1265)
T ss_pred HHHHHH----HHHHHhHHHH---HHHHHHhhHHHHH------------Hhhhhh-Hhh-HHHHHHHHHHHHHHHhhChhh
Confidence 666642 2466677665 4688888888643 333333 222 334567777888888777766
Q ss_pred HHhhhHhHhhhHHHHHHhhHh
Q 012498 165 LKKQNATLRFDLEKQEELNES 185 (462)
Q Consensus 165 qk~~n~aLQ~dl~~~~eq~e~ 185 (462)
|+..- -++.+++..|+.
T Consensus 478 qrKVe----qe~emlKaen~r 494 (1265)
T KOG0976|consen 478 QRKVE----QEYEMLKAENER 494 (1265)
T ss_pred hcchH----HHHHHHHHHHHH
Confidence 66433 334444444443
No 170
>cd07628 BAR_Atg24p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Atg24p. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. Atg24p is involved in membrane fusion events at the vacuolar surface during pexophagy. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=26.63 E-value=5.2e+02 Score=23.98 Aligned_cols=79 Identities=29% Similarity=0.430 Sum_probs=46.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHH-
Q 012498 12 SEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYR- 90 (462)
Q Consensus 12 ~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYR- 90 (462)
-.....-|+.+=+.|+-.+-|-|.|+ -|+ |+.+++.-+.....+. ..+-.|+..-=+
T Consensus 95 ~~~y~~s~k~~lk~R~~kq~d~e~l~------e~l-------------l~~~ve~a~~~~e~f~---~~~~~E~~rF~~~ 152 (185)
T cd07628 95 LLHYILSLKNLIKLRDQKQLDYEELS------DYL-------------LTDEVENAKETSDAFN---KEVLKEYPNFERI 152 (185)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHH------HHH-------------HHHHHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence 34445556666667777777777777 333 6666776666555443 233333332222
Q ss_pred ----HHHHHHHHHHHHHHhhHHHHHHHHHhhhhHHH
Q 012498 91 ----IKGQLADLHAAEVIKNMEAEKQVKFFQGCMAA 122 (462)
Q Consensus 91 ----iK~qLadLh~ae~~Kn~e~EkqvkFfQs~vA~ 122 (462)
+|..|.++ +..|+.||++++..
T Consensus 153 k~~elk~~l~~~----------a~~qi~~y~~~~~~ 178 (185)
T cd07628 153 KKQEIKDSLGAL----------ADGHIDFYQGLVED 178 (185)
T ss_pred HHHHHHHHHHHH----------HHHHHHHHHHHHHH
Confidence 33444444 67899999998653
No 171
>KOG1656 consensus Protein involved in glucose derepression and pre-vacuolar endosome protein sorting [Intracellular trafficking, secretion, and vesicular transport]
Probab=26.42 E-value=4.5e+02 Score=26.48 Aligned_cols=114 Identities=18% Similarity=0.212 Sum_probs=71.8
Q ss_pred ccccCC------cchHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHH----Hh----------HHhHHHHHHhhhhhHH
Q 012498 222 MWSFND------TSTSKYISALEDELEKTRSSVENLQSKLRMGLEIEN----HL----------KKSVRELEKKIIHSDK 281 (462)
Q Consensus 222 ~Wsfn~------tstskyisaLEeE~e~lr~~i~~LQskLR~GLeIen----hL----------kk~vr~Lekkqi~~dk 281 (462)
+|-|+| ++....|--|.+-.+.|-.+=.-|-.+ ++=|+++ |. .|+-+..|+..+++|+
T Consensus 5 ~~~FG~~k~~~~~t~~eaI~kLrEteemL~KKqe~Le~k--i~~e~e~~A~k~~tkNKR~AlqaLkrKK~~E~qL~qidG 82 (221)
T KOG1656|consen 5 SRLFGGMKQEAKPTPQEAIQKLRETEEMLEKKQEFLEKK--IEQEVENNARKYGTKNKRMALQALKRKKRYEKQLAQIDG 82 (221)
T ss_pred HHHhCcccccCCCChHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhh
Confidence 567775 345678888888888777666666543 3445555 21 4555678888888998
Q ss_pred HHHH---HHHHHHHhhhHHHHHHHHhhhhcchhhhhh-----HHHHHhhh-cccc-----ccccccccCCCcc
Q 012498 282 FISN---AIAELRLCHSQLRVHVVNSLEEGRSHIKSI-----SDVIEEKT-QHCD-----DVIRGQNTGTYQR 340 (462)
Q Consensus 282 ~i~n---gi~~lq~~h~~~R~~Im~lL~ee~s~i~s~-----v~~ieekl-~~~~-----n~~~E~n~~~pq~ 340 (462)
+..+ -...|.+-+. -++++.-+..+.+.+|++ ||.|.+-. .|.. .-|++-+ ++|.|
T Consensus 83 ~l~tie~Qr~alEnA~~--n~Evl~~m~~~A~AmK~~h~~mDiDkVdd~MdeI~eQqe~a~eIseAi-S~Pvg 152 (221)
T KOG1656|consen 83 TLSTIEFQREALENANT--NTEVLDAMGSAAKAMKAAHKNMDIDKVDDLMDEIAEQQEVAEEISEAI-SAPVG 152 (221)
T ss_pred HHHHHHHHHHHHHcccc--cHHHHHHHHHHHHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHHH-hCccc
Confidence 8643 2334444443 368888899999888876 34444433 3322 3345556 88876
No 172
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=26.34 E-value=4.5e+02 Score=23.09 Aligned_cols=75 Identities=19% Similarity=0.230 Sum_probs=47.7
Q ss_pred HHHHHHHhhHHHHHHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhh
Q 012498 98 LHAAEVIKNMEAEKQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFD 175 (462)
Q Consensus 98 Lh~ae~~Kn~e~EkqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~d 175 (462)
+|+...-.-..+..++.=++.-++..=+++|.+--+.+..+ ......=..++..+.++++.+.+...+|.-|-.-
T Consensus 53 ~Ha~~~~~L~~lr~e~~~~~~~~~~l~~~~~~a~~~l~~~e---~sw~~qk~~le~e~~~~~~r~~dL~~QN~lLh~Q 127 (132)
T PF07926_consen 53 KHAEDIKELQQLREELQELQQEINELKAEAESAKAELEESE---ASWEEQKEQLEKELSELEQRIEDLNEQNKLLHDQ 127 (132)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46655555666777777788888888888887766654333 3333334455666666666666667777665433
No 173
>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []. This protein is also found in many bacterial species as part of a suspected prophage. Further studies have shown that Gam is a functional counterpart of the eukaryotic Ku protein, which has key roles in DNA repair and in certain transposition events. Gam displays DNA binding characteristics remarkably similar to those of human Ku []. In addition, Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae (Baker's yeast). These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved [].; GO: 0003690 double-stranded DNA binding, 0042262 DNA protection; PDB: 2P2U_B.
Probab=26.34 E-value=4.3e+02 Score=23.59 Aligned_cols=60 Identities=12% Similarity=0.210 Sum_probs=49.1
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHHHH-hhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhh
Q 012498 142 ELMSQKFNEFQTRLEELSSENIELK-KQNATLRFDLEKQEELNESFKEVINKFYEIRQQSL 201 (462)
Q Consensus 142 e~m~qk~~~~~~R~~E~~s~~~~qk-~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~ 201 (462)
...++++.+++..+.++++.+.++- +++..++...+.+....+-+-..|.-|++-.-...
T Consensus 6 ~~al~ki~~l~~~~~~i~~~~~~~I~~i~~~~~~~~~~l~~~i~~l~~~l~~y~e~~r~e~ 66 (149)
T PF07352_consen 6 DWALRKIAELQREIARIEAEANDEIARIKEWYEAEIAPLQNRIEYLEGLLQAYAEANRDEL 66 (149)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCTHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHhc
Confidence 4456899999999999999887665 77888888889999999989999999988765443
No 174
>smart00338 BRLZ basic region leucin zipper.
Probab=26.23 E-value=2.8e+02 Score=21.37 Aligned_cols=38 Identities=34% Similarity=0.460 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhh
Q 012498 146 QKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELN 183 (462)
Q Consensus 146 qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~ 183 (462)
+.+.+++.++..+++...+.......|+.++..++.++
T Consensus 26 ~~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~lk~~~ 63 (65)
T smart00338 26 AEIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL 63 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 35556666666666666555555555555555555443
No 175
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms]
Probab=26.19 E-value=1.3e+03 Score=28.56 Aligned_cols=242 Identities=22% Similarity=0.243 Sum_probs=108.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-hhcCCchHHHhhHHHHHhh---hhhHHHHHHHHHHHHHhhhhhcchHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQLCM-QQAGPSYLAVATRMHFQRT---AGLEQEIEILKQKIAACARENSNLQEELSEAY 89 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEqLCM-QQaGpgyl~vATRM~~qRt---a~LEQeiE~Lkkkl~~c~ren~nLQEELsEAY 89 (462)
-|..-|.++.-++.+|++ +|.-. |+ -.+.+++++.+=. ..|+-++..++..|....+.|.|++..+...-
T Consensus 469 eL~e~i~~lk~~~~el~~--~q~~l~q~----~~ke~~ek~~~~~~~~~~l~~~~~~~~eele~~q~~~~~~~~~~~kv~ 542 (1317)
T KOG0612|consen 469 ELEETIEKLKSEESELQR--EQKALLQH----EQKEVEEKLSEEEAKKRKLEALVRQLEEELEDAQKKNDNAADSLEKVN 542 (1317)
T ss_pred HHHHHHHHHHHHHHHHHH--HHHHHHHH----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Confidence 344445555556666664 22111 11 1235555555422 24455555666666666667777766666655
Q ss_pred HHHHHHH---HH-------------HHHHHHhhHHHHHH--------HHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHH
Q 012498 90 RIKGQLA---DL-------------HAAEVIKNMEAEKQ--------VKFFQGCMAAAFAERDNSVMEAEKAKEKEELMS 145 (462)
Q Consensus 90 RiK~qLa---dL-------------h~ae~~Kn~e~Ekq--------vkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~ 145 (462)
-.+.+|. +. |.+++++-++-+.. ..--|.+--.---++-.-..++|+.++..-..+
T Consensus 543 ~~rk~le~~~~d~~~e~~~~~kl~~~~~e~~~~iq~~~e~~~~~~d~l~~le~~k~~ls~~~~~~~~~~e~~~~~~~~~~ 622 (1317)
T KOG0612|consen 543 SLRKQLEEAELDMRAESEDAGKLRKHSKELSKQIQQELEENRDLEDKLSLLEESKSKLSKENKKLRSELEKERRQRTEIS 622 (1317)
T ss_pred HHHHHHHHhhhhhhhhHHHHhhHhhhhhhhhHHHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5555554 11 33344333322221 111111111111122222334455555555555
Q ss_pred HHHHHHHHHHHHHhHHHHH----------HHhhhHhHhhhHHH--HHHhhHhHHHHHHHHHHHhhhhhhhhcccccch-h
Q 012498 146 QKFNEFQTRLEELSSENIE----------LKKQNATLRFDLEK--QEELNESFKEVINKFYEIRQQSLEVLETSWEDK-C 212 (462)
Q Consensus 146 qk~~~~~~R~~E~~s~~~~----------qk~~n~aLQ~dl~~--~~eq~e~~~kVI~KFyeiR~~~~e~~~~s~~~K-c 212 (462)
-.+.+++.++..+++.... .++.|..-..+.++ ++.+.+--++++..+++ +-..+|.-+-...+ |
T Consensus 623 e~~~~l~~~i~sL~~~~~~~~~~l~k~~el~r~~~e~~~~~ek~~~e~~~e~~lk~~q~~~e--q~~~E~~~~~L~~~e~ 700 (1317)
T KOG0612|consen 623 EIIAELKEEISSLEETLKAGKKELLKVEELKRENQERISDSEKEALEIKLERKLKMLQNELE--QENAEHHRLRLQDKEA 700 (1317)
T ss_pred HHHHHHHhHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhHHH
Confidence 5566666666655554432 22222222233333 44445544555555443 22223311111111 1
Q ss_pred hhhccccccccccCCcchHHHHHH----HHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHH
Q 012498 213 ACLLLDSAEMWSFNDTSTSKYISA----LEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELE 273 (462)
Q Consensus 213 s~LL~Ds~~~Wsfn~tstskyisa----LEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Le 273 (462)
.+ -....|--.+-++--|..+ ++.+++.|++.. +|++ +=.|||.++.+.+.
T Consensus 701 ~~---~e~~~~lseek~ar~k~e~~~~~i~~e~e~L~~d~--~~~~-----~~~~~l~r~~~~~~ 755 (1317)
T KOG0612|consen 701 QM---KEIESKLSEEKSAREKAENLLLEIEAELEYLSNDY--KQSQ-----EKLNELRRSKDQLI 755 (1317)
T ss_pred HH---HHHHHHhcccccHHHHHHHHHHHHHHHHHHHhhhh--hhhc-----cchhhhhhhHHHHH
Confidence 11 1224455556566666666 666666666532 3333 44566655444443
No 176
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=26.17 E-value=80 Score=24.03 Aligned_cols=22 Identities=41% Similarity=0.604 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHhHHHHHhhhh
Q 012498 235 SALEDELEKTRSSVENLQSKLR 256 (462)
Q Consensus 235 saLEeE~e~lr~~i~~LQskLR 256 (462)
.+|..|++.|++.|..|..++.
T Consensus 22 ~~L~~E~~~L~aev~~L~~kl~ 43 (45)
T PF02183_consen 22 DSLKKENEKLRAEVQELKEKLQ 43 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHhhc
Confidence 5788888888888888887765
No 177
>KOG0483 consensus Transcription factor HEX, contains HOX and HALZ domains [Transcription]
Probab=26.01 E-value=80 Score=30.54 Aligned_cols=32 Identities=38% Similarity=0.551 Sum_probs=29.5
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhcchHHHHHH
Q 012498 56 RTAGLEQEIEILKQKIAACARENSNLQEELSE 87 (462)
Q Consensus 56 Rta~LEQeiE~Lkkkl~~c~ren~nLQEELsE 87 (462)
++..||.|-+.||..+....++|.-||.|..+
T Consensus 106 K~kqlE~d~~~Lk~~~~~l~~~~~~Lq~e~~e 137 (198)
T KOG0483|consen 106 KTKQLEKDYESLKRQLESLRSENDRLQSEVQE 137 (198)
T ss_pred cchhhhhhHHHHHHHHHHHhhhhhHHHHHHHH
Confidence 57899999999999999999999999998765
No 178
>TIGR00309 V_ATPase_subD H(+)-transporting ATP synthase, vacuolar type, subunit D. Although this ATPase can run backwards, using a proton gradient to synthesize ATP, the primary biological role is to acidify some compartment, such as yeast vacuole (a lysosomal homolog) or the interior of a prokaryote.
Probab=25.94 E-value=5.7e+02 Score=24.19 Aligned_cols=54 Identities=26% Similarity=0.355 Sum_probs=33.2
Q ss_pred cccCCcc--hHHHHHHHHHHHHHHHHhHHHHHhhhhh-hHHHHHHhHHhHHHHHHhhhh
Q 012498 223 WSFNDTS--TSKYISALEDELEKTRSSVENLQSKLRM-GLEIENHLKKSVRELEKKIIH 278 (462)
Q Consensus 223 Wsfn~ts--tskyisaLEeE~e~lr~~i~~LQskLR~-GLeIenhLkk~vr~Lekkqi~ 278 (462)
+++.+|+ +...+.++++-++.+ -.++.+++.++. +-||. --+++|++||+..|=
T Consensus 119 y~l~~t~~~~d~a~~~~~~~l~~l-i~lA~~e~~~~~L~~eI~-~T~RRVNALE~vvIP 175 (209)
T TIGR00309 119 YGLLFTSYKVDEAAEIYEEAVELI-VELAEIETTIRLLAEEIE-ITKRRVNALEHVIIP 175 (209)
T ss_pred cCcccCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhh
Confidence 7776554 556677776655443 345555555544 33333 348999999998753
No 179
>KOG0018 consensus Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) [Cell cycle control, cell division, chromosome partitioning]
Probab=25.88 E-value=1.3e+03 Score=28.27 Aligned_cols=63 Identities=11% Similarity=0.071 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHhhhhc------------chhhhhhHHHHHhhhccccccccccccCCCccccc
Q 012498 280 DKFISNAIAELRLCHSQLRVHVVNSLEEG------------RSHIKSISDVIEEKTQHCDDVIRGQNTGTYQRETK 343 (462)
Q Consensus 280 dk~i~ngi~~lq~~h~~~R~~Im~lL~ee------------~s~i~s~v~~ieekl~~~~n~~~E~n~~~pq~e~~ 343 (462)
+......|..|+..++-.-.+|+.+-.-- ..+.++||-.-+..-.-||+.+-|+- .+|.-=.|
T Consensus 487 ~~~~~eave~lKr~fPgv~GrviDLc~pt~kkyeiAvt~~Lgk~~daIiVdte~ta~~CI~ylKeqr-~~~~TFlP 561 (1141)
T KOG0018|consen 487 RSRKQEAVEALKRLFPGVYGRVIDLCQPTQKKYEIAVTVVLGKNMDAIIVDTEATARDCIQYLKEQR-LEPMTFLP 561 (1141)
T ss_pred HHHHHHHHHHHHHhCCCccchhhhcccccHHHHHHHHHHHHhcccceEEeccHHHHHHHHHHHHHhc-cCCccccc
Confidence 34555667777777766666666555443 23444444333443366666666665 55544444
No 180
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription]
Probab=25.83 E-value=1.2e+03 Score=27.73 Aligned_cols=71 Identities=25% Similarity=0.267 Sum_probs=54.9
Q ss_pred HHhhhhHHHHHhhh--hhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 114 KFFQGCMAAAFAER--DNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 114 kFfQs~vA~AFAER--D~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
+..++..|.|.-+- ||+..|+-+.|+.+-+++---.+|.+|+.+++...--.-+-.|||.++...+++...
T Consensus 368 qll~~e~~ka~lee~~~n~~~e~~~~k~~~s~~ssl~~e~~QRva~lEkKvqa~~kERDalr~e~kslk~ela 440 (961)
T KOG4673|consen 368 QLLADEIAKAMLEEEQLNSVTEDLKRKSNESEVSSLREEYHQRVATLEKKVQALTKERDALRREQKSLKKELA 440 (961)
T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccchHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 34455556666555 899999999999999999999999999999988776655667888888776655443
No 181
>PF12711 Kinesin-relat_1: Kinesin motor; InterPro: IPR024658 Kinesin [, , ] is a microtubule-associated force-producing protein that may play a role in organelle transport. The kinesin motor activity is directed toward the microtubule's plus end. Kinesin is an oligomeric complex composed of two heavy chains and two light chains. The maintenance of the quaternary structure does not require interchain disulphide bonds. The heavy chain is composed of three structural domains: a large globular N-terminal domain which is responsible for the motor activity of kinesin (it is known to hydrolyse ATP, to bind and move on microtubules), a central alpha-helical coiled coil domain that mediates the heavy chain dimerisation; and a small globular C-terminal domain which interacts with other proteins (such as the kinesin light chains), vesicles and membranous organelles. A number of proteins have been recently found that contain a domain similar to that of the kinesin 'motor' domain [, ]: Drosophila melanogaster claret segregational protein (ncd). Ncd is required for normal chromosomal segregation in meiosis, in females, and in early mitotic divisions of the embryo. The ncd motor activity is directed toward the microtubule's minus end. Homo sapiens CENP-E []. CENP-E is a protein that associates with kinetochores during chromosome congression, relocates to the spindle midzone at anaphase, and is quantitatively discarded at the end of the cell division. CENP-E is probably an important motor molecule in chromosome movement and/or spindle elongation. H. sapiens mitotic kinesin-like protein-1 (MKLP-1), a motor protein whose activity is directed toward the microtubule's plus end. Saccharomyces cerevisiae KAR3 protein, which is essential for nuclear fusion during mating. KAR3 may mediate microtubule sliding during nuclear fusion and possibly mitosis. S. cerevisiae CIN8 and KIP1 proteins which are required for the assembly of the mitotic spindle. Both proteins seem to interact with spindle microtubules to produce an outwardly directed force acting upon the poles. Emericella nidulans (Aspergillus nidulans) bimC, which plays an important role in nuclear division. A. nidulans klpA. Caenorhabditis elegans unc-104, which may be required for the transport of substances needed for neuronal cell differentiation. C. elegans osm-3. Xenopus laevis Eg5, which may be involved in mitosis. Arabidopsis thaliana KatA, KatB and katC. Chlamydomonas reinhardtii FLA10/KHP1 and KLP1. Both proteins seem to play a role in the rotation or twisting of the microtubules of the flagella. C. elegans hypothetical protein T09A5.2. Kinesin-like proteins KLP2 (or KIF15) also contain a kinesin 'motor' domain. They are involved in mitotic spindle assembly, playing a role in positioning spindle poles during mitosis, specifically at prometaphase []. This entry represents a domain of unknown function found in this type of kinesin-like proteins.
Probab=25.77 E-value=2.3e+02 Score=24.57 Aligned_cols=58 Identities=28% Similarity=0.327 Sum_probs=43.9
Q ss_pred HHHHhhcHHHHHHHHHHhhhHHHHHHHHHHHHhhhhhhhHHHHHHHHHHhHHHHHHHhhhc
Q 012498 395 AALLLLSQQEERHLLERNVNSALQKKIEELQRNLFQVTTEKVKALMELAQLKQDYQLLQEY 455 (462)
Q Consensus 395 eALlLlSQqeER~llE~~~n~~lq~~ieeLqrnl~QVt~EKVkaLmElAqLkq~y~lL~~~ 455 (462)
++++-=+.--+-|+++.+ ..|...|+-|+..+-. .++=.+.-||--.|+++-.+|+.|
T Consensus 9 E~~~~g~l~~~~~~~~e~--~~L~eEI~~Lr~qve~-nPevtr~A~EN~rL~ee~rrl~~f 66 (86)
T PF12711_consen 9 EKLLDGKLPSESYLEEEN--EALKEEIQLLREQVEH-NPEVTRFAMENIRLREELRRLQSF 66 (86)
T ss_pred HHHhcCCCCccchhHHHH--HHHHHHHHHHHHHHHh-CHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333334455676666 7788889999999888 777778999999999999998865
No 182
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=25.74 E-value=5.9e+02 Score=24.31 Aligned_cols=36 Identities=22% Similarity=0.339 Sum_probs=24.5
Q ss_pred HHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHH
Q 012498 112 QVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQK 147 (462)
Q Consensus 112 qvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk 147 (462)
.|+|.|+-+-...+=+|...-=.+..|..|+.+.++
T Consensus 87 nV~~l~~RL~kLL~lk~~~~~~~e~~k~le~~~~~~ 122 (190)
T PF05266_consen 87 NVKFLRSRLNKLLSLKDDQEKLLEERKKLEKKIEEK 122 (190)
T ss_pred ccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 588999988888888886555555555555555444
No 183
>PRK00373 V-type ATP synthase subunit D; Reviewed
Probab=25.71 E-value=5.7e+02 Score=24.07 Aligned_cols=36 Identities=22% Similarity=0.383 Sum_probs=26.2
Q ss_pred HhhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHH
Q 012498 124 FAERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIE 164 (462)
Q Consensus 124 FAERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~ 164 (462)
.|.|=..+++ .|.+++..+|..+-..+.++...+.+
T Consensus 22 ~a~rg~~lLk-----~Krd~L~~e~~~~~~~~~~~r~~~~~ 57 (204)
T PRK00373 22 LAERGHKLLK-----DKRDELIMEFFDILDEAKKLREEVEE 57 (204)
T ss_pred HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666 77888888888888888877666543
No 184
>PF11365 DUF3166: Protein of unknown function (DUF3166); InterPro: IPR021507 This eukaryotic family of proteins has no known function.
Probab=25.71 E-value=91 Score=27.40 Aligned_cols=33 Identities=39% Similarity=0.639 Sum_probs=28.8
Q ss_pred hHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHH
Q 012498 60 LEQEIEILKQKIAACARENSNLQEELSEAYRIKG 93 (462)
Q Consensus 60 LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~ 93 (462)
.|.|-+-|.++++-.-.+|..|..||+. |+.+.
T Consensus 13 vEEEa~LlRRkl~ele~eN~~l~~EL~k-yk~~~ 45 (96)
T PF11365_consen 13 VEEEAELLRRKLSELEDENKQLTEELNK-YKSKY 45 (96)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhc
Confidence 3788999999999999999999999998 76653
No 185
>PF04065 Not3: Not1 N-terminal domain, CCR4-Not complex component ; InterPro: IPR007207 The Ccr4-Not complex (Not1, Not2, Not3, Not4 and Not5) is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID []. This domain is the N-terminal region of the Not proteins.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=25.45 E-value=1.6e+02 Score=29.12 Aligned_cols=82 Identities=16% Similarity=0.195 Sum_probs=50.7
Q ss_pred hHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHHHHHHHHHHHHHhhhHHHHHHHHhhhhcc
Q 012498 230 TSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDKFISNAIAELRLCHSQLRVHVVNSLEEGR 309 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~ 309 (462)
++..|+.|..++|.+.+.++.|++..+=| ++-... +.+...+..+| +-.++|...=-.|+.+|..+.
T Consensus 127 l~~~Id~L~~QiE~~E~E~E~L~~~~kKk----k~~~~~----~~r~~~l~~~i-----erhk~Hi~kLE~lLR~L~N~~ 193 (233)
T PF04065_consen 127 LKDSIDELNRQIEQLEAEIESLSSQKKKK----KKDSTK----QERIEELESRI-----ERHKFHIEKLELLLRLLDNDE 193 (233)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccC----ccCccc----hhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHcCC
Confidence 67789999999999999999999865432 111111 11111122222 224566666667888999998
Q ss_pred hhhhhhHHHHHhhhcc
Q 012498 310 SHIKSISDVIEEKTQH 325 (462)
Q Consensus 310 s~i~s~v~~ieekl~~ 325 (462)
..-.. |+.|.+-|+.
T Consensus 194 l~~e~-V~~ikediey 208 (233)
T PF04065_consen 194 LDPEQ-VEDIKEDIEY 208 (233)
T ss_pred CCHHH-HHHHHHHHHH
Confidence 76644 4457777733
No 186
>PF15294 Leu_zip: Leucine zipper
Probab=25.14 E-value=6.5e+02 Score=25.93 Aligned_cols=91 Identities=19% Similarity=0.345 Sum_probs=58.6
Q ss_pred hHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHH-----HHHHHHHHHHHhhhHHHHHHHHh
Q 012498 230 TSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDK-----FISNAIAELRLCHSQLRVHVVNS 304 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk-----~i~ngi~~lq~~h~~~R~~Im~l 304 (462)
..+-|..|.+||++|++.+..++..--..++=-.-|+...+.|+..+.-... +-.--|++|.+--......+-+-
T Consensus 130 l~kEi~rLq~EN~kLk~rl~~le~~at~~l~Ek~kl~~~L~~lq~~~~~~~~k~~~~~~~q~l~dLE~k~a~lK~e~ek~ 209 (278)
T PF15294_consen 130 LNKEIDRLQEENEKLKERLKSLEKQATSALDEKSKLEAQLKELQDEQGDQKGKKDLSFKAQDLSDLENKMAALKSELEKA 209 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccchhhHHHHHHHHHHHHHHH
Confidence 6788999999999999988888876554444444467777777773322111 22234556666656666777677
Q ss_pred hhhcchhhhhhHHHHH
Q 012498 305 LEEGRSHIKSISDVIE 320 (462)
Q Consensus 305 L~ee~s~i~s~v~~ie 320 (462)
+.+..++.+++-..+.
T Consensus 210 ~~d~~~~~k~L~e~L~ 225 (278)
T PF15294_consen 210 LQDKESQQKALEETLQ 225 (278)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 7777777666554443
No 187
>PF14131 DUF4298: Domain of unknown function (DUF4298)
Probab=24.82 E-value=3.2e+02 Score=23.00 Aligned_cols=63 Identities=21% Similarity=0.238 Sum_probs=30.0
Q ss_pred HHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhh--hh---hcccccchhhhhcccc
Q 012498 156 EELSSENIELKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSL--EV---LETSWEDKCACLLLDS 219 (462)
Q Consensus 156 ~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~--e~---~~~s~~~Kcs~LL~Ds 219 (462)
.+.++-..+...+...|+..+...++.-... .-+.+||-=..... +. .++..+.+|+||=-|.
T Consensus 3 ~eme~~y~~~~~~l~~le~~l~~~~~~~~~~-~~L~~YY~s~~w~~d~e~~e~g~~~~~~~~gVLSEDa 70 (90)
T PF14131_consen 3 QEMEKIYNEWCELLEELEEALEKWQEAQPDY-RKLRDYYGSEEWMEDYEASEQGDLPTDGKCGVLSEDA 70 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHCcHhHHHHHHHHhCCCCCCCcccCccCchH
Confidence 3333334444444444444444444444333 33445772111111 11 4677889999985553
No 188
>PF08077 Cm_res_leader: Chloramphenicol resistance gene leader peptide; InterPro: IPR012537 This family consists of chloramphenicol (Cm) resistance gene leader peptides. Inducible resistance to Cm in both Gram-positive and Gram-negative bacteria is controlled by translation attenuation. In translation attenuation, the ribosome-binding-site (RBS) for the resistance determinant is sequestered in a secondary structure domain within the mRNA. Preceding the secondary structure is a short, translated ORF termed the leader. Ribosome stalling in the leader causes the destabilisation of the downstream secondary structure, allowing initiation of translation of the Cm resistance gene [].
Probab=24.82 E-value=11 Score=24.15 Aligned_cols=11 Identities=64% Similarity=0.927 Sum_probs=9.5
Q ss_pred cC-CchHHHhhH
Q 012498 41 AG-PSYLAVATR 51 (462)
Q Consensus 41 aG-pgyl~vATR 51 (462)
+| ||-++|.||
T Consensus 2 sgvpgalavvtr 13 (17)
T PF08077_consen 2 SGVPGALAVVTR 13 (17)
T ss_pred CCCCceEEEEEE
Confidence 56 999999987
No 189
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=24.69 E-value=4.1e+02 Score=22.08 Aligned_cols=29 Identities=28% Similarity=0.398 Sum_probs=20.4
Q ss_pred cchHHHHHHHHHHHHHHHHhHHHHHhhhh
Q 012498 228 TSTSKYISALEDELEKTRSSVENLQSKLR 256 (462)
Q Consensus 228 tstskyisaLEeE~e~lr~~i~~LQskLR 256 (462)
.+....+.-|+...+.+.+.++.|++.+.
T Consensus 83 ~~~~eA~~~l~~r~~~l~~~~~~l~~~~~ 111 (129)
T cd00890 83 KSLEEAIEFLKKRLETLEKQIEKLEKQLE 111 (129)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677777777777777777777776654
No 190
>PRK13694 hypothetical protein; Provisional
Probab=24.55 E-value=2.6e+02 Score=24.39 Aligned_cols=36 Identities=25% Similarity=0.522 Sum_probs=32.1
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYL 46 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl 46 (462)
...+.+.||..||.|...+.-||--+----.|-||=
T Consensus 13 ~Lr~fIERIERLEeEkk~i~~dikdVyaEAK~~GfD 48 (83)
T PRK13694 13 QLRAFIERIERLEEEKKTISDDIKDVYAEAKGNGFD 48 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc
Confidence 346788999999999999999999998888899993
No 191
>PF07111 HCR: Alpha helical coiled-coil rod protein (HCR); InterPro: IPR009800 This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation [].; GO: 0030154 cell differentiation, 0005634 nucleus, 0005737 cytoplasm
Probab=24.53 E-value=1.2e+03 Score=27.32 Aligned_cols=33 Identities=15% Similarity=0.351 Sum_probs=25.9
Q ss_pred cccCCcchHHHHHHHHHHHHHHHHhHHHHHhhh
Q 012498 223 WSFNDTSTSKYISALEDELEKTRSSVENLQSKL 255 (462)
Q Consensus 223 Wsfn~tstskyisaLEeE~e~lr~~i~~LQskL 255 (462)
|.--.--...-|..|+++.+.|.+.++.||-.|
T Consensus 240 we~Er~~L~~tVq~L~edR~~L~~T~ELLqVRv 272 (739)
T PF07111_consen 240 WEPEREELLETVQHLQEDRDALQATAELLQVRV 272 (739)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 554444466779999999999999999998544
No 192
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=24.26 E-value=7.7e+02 Score=25.09 Aligned_cols=26 Identities=35% Similarity=0.546 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHhhhh
Q 012498 231 SKYISALEDELEKTRSSVENLQSKLR 256 (462)
Q Consensus 231 skyisaLEeE~e~lr~~i~~LQskLR 256 (462)
...|+.|++++..|++.|..|+...+
T Consensus 199 re~i~el~e~I~~L~~eV~~L~~~~~ 224 (258)
T PF15397_consen 199 REEIDELEEEIPQLRAEVEQLQAQAQ 224 (258)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 45799999999999999999998765
No 193
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=24.26 E-value=1.2e+03 Score=27.19 Aligned_cols=53 Identities=28% Similarity=0.342 Sum_probs=32.8
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhH
Q 012498 132 MEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNE 184 (462)
Q Consensus 132 mEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e 184 (462)
++-.++...=+.-..+|.+.+.+++++++.+.-.+..|..+-..+...++.++
T Consensus 610 ~~L~~~~d~lE~~~~qL~E~E~~L~eLq~eL~~~keS~s~~E~ql~~~~e~~e 662 (769)
T PF05911_consen 610 MELASCQDQLESLKNQLKESEQKLEELQSELESAKESNSLAETQLKAMKESYE 662 (769)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333334444555667777777777777777777777666666655544443
No 194
>PF14552 Tautomerase_2: Tautomerase enzyme; PDB: 2AAG_C 2AAL_A 2AAJ_A 1MWW_C.
Probab=24.17 E-value=63 Score=26.87 Aligned_cols=36 Identities=28% Similarity=0.397 Sum_probs=23.9
Q ss_pred HHHHHHhhhhhhh-hcccccchhhhhccccccccccC
Q 012498 191 NKFYEIRQQSLEV-LETSWEDKCACLLLDSAEMWSFN 226 (462)
Q Consensus 191 ~KFyeiR~~~~e~-~~~s~~~Kcs~LL~Ds~~~Wsfn 226 (462)
.+||..=...+.. ..++++|=.-+|..-+.++|||+
T Consensus 46 ~~ly~~l~~~L~~~~gi~p~Dv~I~l~e~~~edWSFg 82 (82)
T PF14552_consen 46 KALYRALAERLAEKLGIRPEDVMIVLVENPREDWSFG 82 (82)
T ss_dssp HHHHHHHHHHHHHHH---GGGEEEEEEEE-GGGEEEC
T ss_pred HHHHHHHHHHHHHHcCCCHHHEEEEEEECCcccCCCC
Confidence 3555554444543 78999999999999999999996
No 195
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=24.02 E-value=7.2e+02 Score=24.70 Aligned_cols=33 Identities=39% Similarity=0.434 Sum_probs=29.6
Q ss_pred HHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHH
Q 012498 149 NEFQTRLEELSSENIELKKQNATLRFDLEKQEE 181 (462)
Q Consensus 149 ~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~e 181 (462)
+++++.+.++++.+.+++++|..|-.||+....
T Consensus 2 ~~lq~~l~~l~~~~~~~~~L~~kLE~DL~~~~~ 34 (248)
T PF08172_consen 2 EELQKELSELEAKLEEQKELNAKLENDLAKVQA 34 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 567889999999999999999999999998753
No 196
>smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains
Probab=23.61 E-value=3.9e+02 Score=21.40 Aligned_cols=59 Identities=19% Similarity=0.177 Sum_probs=37.1
Q ss_pred hhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhhhh-----HHHHHHhHHhHHHH
Q 012498 214 CLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLRMG-----LEIENHLKKSVREL 272 (462)
Q Consensus 214 ~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR~G-----LeIenhLkk~vr~L 272 (462)
.||.+-...+.=...+....+..|+..++.+...++-.+.-|.-| |...+++..+++.|
T Consensus 61 ~ll~~l~~~~~~~~~~l~~q~~~l~~~l~~l~~~~~~~e~~l~~~~~~e~L~~~~~i~~rl~~l 124 (127)
T smart00502 61 QLLEDLEEQKENKLKVLEQQLESLTQKQEKLSHAINFTEEALNSGDPTELLLSKKLIIERLQNL 124 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHH
Confidence 344443333333334567788888888888888888888888775 34445555555444
No 197
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport]
Probab=23.59 E-value=1.3e+03 Score=27.61 Aligned_cols=118 Identities=25% Similarity=0.202 Sum_probs=0.0
Q ss_pred hhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHH-------------------------------HHHHHHh
Q 012498 57 TAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADL-------------------------------HAAEVIK 105 (462)
Q Consensus 57 ta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadL-------------------------------h~ae~~K 105 (462)
..+|.-+|++++.+.....-+|-.|++++-.---.++||-|. -++.+++
T Consensus 666 I~~lD~~~e~lkQ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~qLg~~~~~~~~~~q~~e~~~t~~eel~a~~~e 745 (970)
T KOG0946|consen 666 IRELDYQIENLKQMEKELQVENEELEEEVQDFISEHSQLKDQLDLLKNQLGIISSKQRDLLQGAEASKTQNEELNAALSE 745 (970)
T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccchhhHHhHHHhccCChHHHHHHHHH
Q ss_pred hHHHH-HHHHHhhhhHHHHHhhhhhhhHHHHHhHHHHHHHHHHHHHHHH-----------------HHHHHhHHHHHHHh
Q 012498 106 NMEAE-KQVKFFQGCMAAAFAERDNSVMEAEKAKEKEELMSQKFNEFQT-----------------RLEELSSENIELKK 167 (462)
Q Consensus 106 n~e~E-kqvkFfQs~vA~AFAERD~slmEaEkaKE~Ee~m~qk~~~~~~-----------------R~~E~~s~~~~qk~ 167 (462)
++.++ +| +=..|.-++-.++.+.|...+. -+-|+-....+.+.
T Consensus 746 ~k~l~~~q-------------------~~l~~~L~k~~~~~es~k~~~~~a~~~~~~~~~~~~~qeqv~El~~~l~e~~~ 806 (970)
T KOG0946|consen 746 NKKLENDQ-------------------ELLTKELNKKNADIESFKATQRSAELSQGSLNDNLGDQEQVIELLKNLSEEST 806 (970)
T ss_pred HHHHHHHH-------------------HHHHHHHHhhhHHHHHHHHHHhhhhcccchhhhhhhhHHHHHHHHHhhhhhhh
Q ss_pred hhHhHhhhHHHHHHhhHhHHHHHHHH
Q 012498 168 QNATLRFDLEKQEELNESFKEVINKF 193 (462)
Q Consensus 168 ~n~aLQ~dl~~~~eq~e~~~kVI~KF 193 (462)
.+..+|.++..+++|.+-...-|.-|
T Consensus 807 ~l~~~q~e~~~~keq~~t~~~~tsa~ 832 (970)
T KOG0946|consen 807 RLQELQSELTQLKEQIQTLLERTSAA 832 (970)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
No 198
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=23.42 E-value=1.2e+02 Score=23.82 Aligned_cols=24 Identities=29% Similarity=0.469 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHhHHHHHhhhhh
Q 012498 234 ISALEDELEKTRSSVENLQSKLRM 257 (462)
Q Consensus 234 isaLEeE~e~lr~~i~~LQskLR~ 257 (462)
|+||.++++.|..++..||+.+..
T Consensus 1 i~aLrqQv~aL~~qv~~Lq~~fs~ 24 (46)
T PF09006_consen 1 INALRQQVEALQGQVQRLQAAFSQ 24 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHH
Confidence 678888888888888888877654
No 199
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.24 E-value=8.2e+02 Score=25.05 Aligned_cols=74 Identities=20% Similarity=0.335 Sum_probs=42.8
Q ss_pred hhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHHH---HHHHHhhhhh
Q 012498 125 AERDNSVMEAEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVIN---KFYEIRQQSL 201 (462)
Q Consensus 125 AERD~slmEaEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI~---KFyeiR~~~~ 201 (462)
+--|..+-++.+.+- .+-.++..+..-+++.++...+.+.-++.++.++..++.+...+..=|. +-|.=|-|+.
T Consensus 34 ~~~ds~l~~~~~~~~---~~q~ei~~L~~qi~~~~~k~~~~~~~i~~~~~eik~l~~eI~~~~~~I~~r~~~l~~raRAm 110 (265)
T COG3883 34 QNQDSKLSELQKEKK---NIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKKRARAM 110 (265)
T ss_pred HhhHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344667777666552 2224455555666666666666666677777777666666665555553 2333454444
No 200
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.12 E-value=3.9e+02 Score=22.81 Aligned_cols=49 Identities=16% Similarity=0.255 Sum_probs=31.7
Q ss_pred HHHHHHHHHhHHHHHHH----hhhHhH---hhhHHHHHHhhHhHHHHHHHHHHHhhhhh
Q 012498 150 EFQTRLEELSSENIELK----KQNATL---RFDLEKQEELNESFKEVINKFYEIRQQSL 201 (462)
Q Consensus 150 ~~~~R~~E~~s~~~~qk----~~n~aL---Q~dl~~~~eq~e~~~kVI~KFyeiR~~~~ 201 (462)
.++.|+.+++...--|. ++|++| |+.++++.+|.. -+++||-+++....
T Consensus 5 ~lE~Ri~eLE~r~AfQE~tieeLn~~laEq~~~i~k~q~qlr---~L~~kl~~~~~~~~ 60 (72)
T COG2900 5 ELEARIIELEIRLAFQEQTIEELNDALAEQQLVIDKLQAQLR---LLTEKLKDLQPSAI 60 (72)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhhccccc
Confidence 56777777777665554 456655 445555555555 78899988876444
No 201
>PLN02939 transferase, transferring glycosyl groups
Probab=22.93 E-value=1.4e+03 Score=27.53 Aligned_cols=182 Identities=20% Similarity=0.227 Sum_probs=94.2
Q ss_pred HHHHHHHH------HHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHH------------------
Q 012498 17 ARIQQLEH------ERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIA------------------ 72 (462)
Q Consensus 17 ~RI~qLe~------ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~------------------ 72 (462)
+|++-|++ |.+.|+.-|--|=|-=|-.+--...|-----||.-||..+|+|++.|.
T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (977)
T PLN02939 150 ARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELD 229 (977)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhhhhhhhccccchhhHHHHHHHhhhhhccccccccccccHHHHHH
Confidence 45554444 889999999999997766322111121122345556666666655442
Q ss_pred HhhhhhcchHHHHHHHHHHHHHHHHH-------HH--HH----HHhhHHHHHHHHHhhhhHHHHHhhhhhhhHH------
Q 012498 73 ACARENSNLQEELSEAYRIKGQLADL-------HA--AE----VIKNMEAEKQVKFFQGCMAAAFAERDNSVME------ 133 (462)
Q Consensus 73 ~c~ren~nLQEELsEAYRiK~qLadL-------h~--ae----~~Kn~e~EkqvkFfQs~vA~AFAERD~slmE------ 133 (462)
-.-.||--|.+.+ --+|..|.+. ++ +| -+--.++|+..--.|.-|+.--.=++-++||
T Consensus 230 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (977)
T PLN02939 230 VLKEENMLLKDDI---QFLKAELIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVENLQ 306 (977)
T ss_pred HHHHHhHHHHHHH---HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhccchhHHHHHHHHHHHH
Confidence 1222333333222 1123333322 00 11 1223456666655666665555555556666
Q ss_pred -----HHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhh------hHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhh
Q 012498 134 -----AEKAKEKEELMSQKFNEFQTRLEELSSENIELKKQ------NATLRFDLEKQEELNESFKEVINKFYEIRQQSL 201 (462)
Q Consensus 134 -----aEkaKE~Ee~m~qk~~~~~~R~~E~~s~~~~qk~~------n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~ 201 (462)
+-+.-|+.-.++++-++++.++..+++.+.+-.-. -+.||..+.-++++.+.+..-|+-+-++-+.+.
T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 385 (977)
T PLN02939 307 DLLDRATNQVEKAALVLDQNQDLRDKVDKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQESI 385 (977)
T ss_pred HHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHhhHhhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 33334455566777777777777777665443211 133555555666666655555555555544444
No 202
>KOG4117 consensus Heat shock factor binding protein [Transcription; Posttranslational modification, protein turnover, chaperones]
Probab=22.54 E-value=1.1e+02 Score=25.90 Aligned_cols=29 Identities=38% Similarity=0.709 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Q 012498 13 EALMARIQQLEHERDELRKDIEQLCMQQAG 42 (462)
Q Consensus 13 e~l~~RI~qLe~ERdEL~KDIEqLCMQQaG 42 (462)
.-.++||...-.--|.|.|.|--| |+|||
T Consensus 37 DQII~RiDDM~~riDDLEKnIaDL-m~qag 65 (73)
T KOG4117|consen 37 DQIIGRIDDMSSRIDDLEKNIADL-MTQAG 65 (73)
T ss_pred HHHHHHHhhhhhhhHHHHHHHHHH-HHHcc
Confidence 346778888888889999999887 88898
No 203
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=22.13 E-value=6.4e+02 Score=23.38 Aligned_cols=30 Identities=20% Similarity=0.341 Sum_probs=18.2
Q ss_pred hhhhhHHHHHHHHHHHHHhhhhhcchHHHH
Q 012498 56 RTAGLEQEIEILKQKIAACARENSNLQEEL 85 (462)
Q Consensus 56 Rta~LEQeiE~Lkkkl~~c~ren~nLQEEL 85 (462)
|+-+||.|++..+........+|-|-+.++
T Consensus 25 ~v~~LEreLe~~q~~~e~~~~daEn~k~ei 54 (140)
T PF10473_consen 25 HVESLERELEMSQENKECLILDAENSKAEI 54 (140)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 455666666666666666666666655544
No 204
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=22.06 E-value=8.3 Score=26.98 Aligned_cols=16 Identities=50% Similarity=0.688 Sum_probs=13.0
Q ss_pred cCCchHHHhhHHHHHh
Q 012498 41 AGPSYLAVATRMHFQR 56 (462)
Q Consensus 41 aGpgyl~vATRM~~qR 56 (462)
+||+|++|||.-.+-|
T Consensus 9 ~g~~~vavaTS~~~lR 24 (27)
T PF12341_consen 9 AGDSWVAVATSAGYLR 24 (27)
T ss_pred ccCCEEEEEeCCCeEE
Confidence 7999999999765544
No 205
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=21.95 E-value=4.3e+02 Score=27.28 Aligned_cols=48 Identities=29% Similarity=0.530 Sum_probs=34.1
Q ss_pred hHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhhhHH
Q 012498 230 TSKYISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIHSDK 281 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~~dk 281 (462)
..+|++.|+++++.+.+.+++|..+|.-.= +.+++.+.+++....+++
T Consensus 240 ~~~~~~~l~~~~~~~~~~i~~l~~~l~~~~----k~~~k~~~~~~q~~~~~k 287 (406)
T PF02388_consen 240 GKEYLESLQEKLEKLEKEIEKLEEKLEKNP----KKKNKLKELEEQLASLEK 287 (406)
T ss_dssp CHHHHHHHHHHHHHHHHHHHHHHHHHHH-T----HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCc----chhhHHHHHHHHHHHHHH
Confidence 568999999999999999999998764432 445555555555544444
No 206
>TIGR00414 serS seryl-tRNA synthetase. This model represents the seryl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam model tRNA-synt_2b. The seryl-tRNA synthetases of two archaeal species, Methanococcus jannaschii and Methanobacterium thermoautotrophicum, differ considerably and are included in a different model.
Probab=21.89 E-value=4.4e+02 Score=27.58 Aligned_cols=22 Identities=36% Similarity=0.675 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q 012498 14 ALMARIQQLEHERDELRKDIEQ 35 (462)
Q Consensus 14 ~l~~RI~qLe~ERdEL~KDIEq 35 (462)
.+..++..|++||+.+-|.|-+
T Consensus 41 ~~~~~~~~l~~erN~~sk~i~~ 62 (418)
T TIGR00414 41 KLLSEIEELQAKRNELSKQIGK 62 (418)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4556677777788888888765
No 207
>PRK15041 methyl-accepting chemotaxis protein I; Provisional
Probab=21.71 E-value=9.7e+02 Score=25.36 Aligned_cols=31 Identities=32% Similarity=0.392 Sum_probs=24.4
Q ss_pred hcCCchHHHhh--HHHHHhhhhhHHHHHHHHHH
Q 012498 40 QAGPSYLAVAT--RMHFQRTAGLEQEIEILKQK 70 (462)
Q Consensus 40 QaGpgyl~vAT--RM~~qRta~LEQeiE~Lkkk 70 (462)
-+|-||=.||. |=++.||+.--++|..+=..
T Consensus 391 E~GrGFAVVA~EVR~LA~~s~~at~~I~~~i~~ 423 (554)
T PRK15041 391 EQGRGFAVVAGEVRNLAQRSAQAAREIKSLIED 423 (554)
T ss_pred CCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36789988885 77999999988888876543
No 208
>smart00340 HALZ homeobox associated leucin zipper.
Probab=21.68 E-value=1.3e+02 Score=23.55 Aligned_cols=33 Identities=33% Similarity=0.386 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHH
Q 012498 61 EQEIEILKQKIAACARENSNLQEELSEAYRIKG 93 (462)
Q Consensus 61 EQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~ 93 (462)
|-|-|-||+=-...+.||..||.|+.|-.++|.
T Consensus 4 EvdCe~LKrcce~LteeNrRL~ke~~eLralk~ 36 (44)
T smart00340 4 EVDCELLKRCCESLTEENRRLQKEVQELRALKL 36 (44)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 446677888888889999999999999887764
No 209
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=21.60 E-value=2.7e+02 Score=21.45 Aligned_cols=34 Identities=35% Similarity=0.441 Sum_probs=29.2
Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhHHHHHHHhh
Q 012498 420 KIEELQRNLFQVTTEKVKALMELAQLKQDYQLLQ 453 (462)
Q Consensus 420 ~ieeLqrnl~QVt~EKVkaLmElAqLkq~y~lL~ 453 (462)
.|++|+..+..++.+-...-.++..|++++..|.
T Consensus 27 ~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~ 60 (64)
T PF00170_consen 27 YIEELEEKVEELESENEELKKELEQLKKEIQSLK 60 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5788888888888888888899999999988775
No 210
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=21.57 E-value=9e+02 Score=24.92 Aligned_cols=61 Identities=11% Similarity=0.214 Sum_probs=34.2
Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHH
Q 012498 11 ESEALMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAA 73 (462)
Q Consensus 11 ~~e~l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~ 73 (462)
..+-+..++.+++.+-++..+-+... +++.|- ++.-.+-...+|.+.+++++...+.++.+
T Consensus 162 ~~~fl~~ql~~~~~~L~~ae~~l~~f-~~~~~~-~~~~~~~~~~~~l~~l~~~l~~~~~~l~~ 222 (498)
T TIGR03007 162 AQRFIDEQIKTYEKKLEAAENRLKAF-KQENGG-ILPDQEGDYYSEISEAQEELEAARLELNE 222 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHhCcc-cCccchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 34556667777777777777777766 555552 22222334445666666655555444433
No 211
>PRK10636 putative ABC transporter ATP-binding protein; Provisional
Probab=21.53 E-value=3.8e+02 Score=29.19 Aligned_cols=68 Identities=18% Similarity=0.196 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHH
Q 012498 18 RIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTAGLEQEIEILKQKIAACARENSNLQEELSEA 88 (462)
Q Consensus 18 RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEA 88 (462)
+|..||.+-.+|.+.|+.|=.+-+.|.+. +.--..+.+.|-++++.+++++..+..+=..|.++|.|+
T Consensus 564 ~~~~~e~~i~~le~~~~~l~~~l~~~~~~---~~~~~~~~~~~~~~~~~~~~~l~~~~~~w~~l~~~~~~~ 631 (638)
T PRK10636 564 EIARLEKEMEKLNAQLAQAEEKLGDSELY---DQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQM 631 (638)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCchhc---ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566666666666666665555555321 111112455566666666666666555555555555443
No 212
>KOG0933 consensus Structural maintenance of chromosome protein 2 (chromosome condensation complex Condensin, subunit E) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=21.34 E-value=1.6e+03 Score=27.63 Aligned_cols=52 Identities=21% Similarity=0.378 Sum_probs=30.8
Q ss_pred HHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHH-HHHHHHHHh
Q 012498 54 FQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLA-DLHAAEVIK 105 (462)
Q Consensus 54 ~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLa-dLh~ae~~K 105 (462)
+|--+..+-+|+.-++.|.+..|+=..|+--=..--++|.||. .+|+..+.+
T Consensus 676 l~~l~~~~~~~~~~q~el~~le~eL~~le~~~~kf~~l~~ql~l~~~~l~l~~ 728 (1174)
T KOG0933|consen 676 LQKLKQAQKELRAIQKELEALERELKSLEAQSQKFRDLKQQLELKLHELALLE 728 (1174)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445556666777777777777765555443333445777776 445544443
No 213
>KOG3091 consensus Nuclear pore complex, p54 component (sc Nup57) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=21.26 E-value=9.4e+02 Score=26.91 Aligned_cols=73 Identities=23% Similarity=0.339 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCchHHHhhHHHHHhhh--hhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHH
Q 012498 15 LMARIQQLEHERDELRKDIEQLCMQQAGPSYLAVATRMHFQRTA--GLEQEIEILKQKIAACARENSNLQEELSEAYRIK 92 (462)
Q Consensus 15 l~~RI~qLe~ERdEL~KDIEqLCMQQaGpgyl~vATRM~~qRta--~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK 92 (462)
-.++|.++.+.--+|.+-|=++-.+|.+ .|-- .|--+=|.|.+||- +|+.++..---+|
T Consensus 374 ~~~KI~~~k~r~~~Ls~RiLRv~ikqei------------lr~~G~~L~~~EE~Lr~Kld-------tll~~ln~Pnq~k 434 (508)
T KOG3091|consen 374 AVAKIEEAKNRHVELSHRILRVMIKQEI------------LRKRGYALTPDEEELRAKLD-------TLLAQLNAPNQLK 434 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HhccCCcCCccHHHHHHHHH-------HHHHHhcChHHHH
Confidence 3445555555555555555444444332 2222 34445567777774 4455555567789
Q ss_pred HHHHHHHHHHHHhh
Q 012498 93 GQLADLHAAEVIKN 106 (462)
Q Consensus 93 ~qLadLh~ae~~Kn 106 (462)
..|+.|+-....+|
T Consensus 435 ~Rl~~L~e~~r~q~ 448 (508)
T KOG3091|consen 435 ARLDELYEILRMQN 448 (508)
T ss_pred HHHHHHHHHHHhhc
Confidence 99999976666665
No 214
>KOG4360 consensus Uncharacterized coiled coil protein [Function unknown]
Probab=21.20 E-value=4.3e+02 Score=29.77 Aligned_cols=119 Identities=24% Similarity=0.228 Sum_probs=0.0
Q ss_pred HHhhhHhHhhhHHHHHHhhHhHHHHHHHHHHHhhhhhhh----hcccccchhhhhccccccccccC------C----cch
Q 012498 165 LKKQNATLRFDLEKQEELNESFKEVINKFYEIRQQSLEV----LETSWEDKCACLLLDSAEMWSFN------D----TST 230 (462)
Q Consensus 165 qk~~n~aLQ~dl~~~~eq~e~~~kVI~KFyeiR~~~~e~----~~~s~~~Kcs~LL~Ds~~~Wsfn------~----tst 230 (462)
|..+-++||-.|-.+++.|. |-++-.| ..+++++|=+.+..|-.-.-.+- | .+-
T Consensus 157 ~~~~~EaL~ekLk~~~een~------------~lr~k~~llk~Et~~~~~keq~~y~~~~KelrdtN~q~~s~~eel~~k 224 (596)
T KOG4360|consen 157 QRELLEALQEKLKPLEEENT------------QLRSKAMLLKTETLTYEEKEQQLYGDCVKELRDTNTQARSGQEELQSK 224 (596)
T ss_pred HHHHHHHHHhhcCChHHHHH------------HHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHHHhHHHHHhhhhh------------hHHHHHH--hHHhHHHHHHhhhhhHHHHHHHHHHHHHhhh
Q 012498 231 SKYISALEDELEKTRSSVENLQSKLRM------------GLEIENH--LKKSVRELEKKIIHSDKFISNAIAELRLCHS 295 (462)
Q Consensus 231 skyisaLEeE~e~lr~~i~~LQskLR~------------GLeIenh--Lkk~vr~Lekkqi~~dk~i~ngi~~lq~~h~ 295 (462)
.+=.+-+.||+.+|-+.|.-+|-|+|+ +.-+--| |.-..++||-|-+-.-.+....=.+|++.|+
T Consensus 225 t~el~~q~Ee~skLlsql~d~qkk~k~~~~Ekeel~~~Lq~~~da~~ql~aE~~EleDkyAE~m~~~~EaeeELk~lrs 303 (596)
T KOG4360|consen 225 TKELSRQQEENSKLLSQLVDLQKKIKYLRHEKEELDEHLQAYKDAQRQLTAELEELEDKYAECMQMLHEAEEELKCLRS 303 (596)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
No 215
>PF00015 MCPsignal: Methyl-accepting chemotaxis protein (MCP) signalling domain; InterPro: IPR004089 Methyl-accepting chemotaxis proteins (MCPs) are a family of bacterial receptors that mediate chemotaxis to diverse signals, responding to changes in the concentration of attractants and repellents in the environment by altering swimming behaviour []. Environmental diversity gives rise to diversity in bacterial signalling receptors, and consequently there are many genes encoding MCPs []. For example, there are four well-characterised MCPs found in Escherichia coli: Tar (taxis towards aspartate and maltose, away from nickel and cobalt), Tsr (taxis towards serine, away from leucine, indole and weak acids), Trg (taxis towards galactose and ribose) and Tap (taxis towards dipeptides). MCPs share similar topology and signalling mechanisms. MCPs either bind ligands directly or interact with ligand-binding proteins, transducing the signal to downstream signalling proteins in the cytoplasm. MCPs undergo two covalent modifications: deamidation and reversible methylation at a number of glutamate residues. Attractants increase the level of methylation, while repellents decrease it. The methyl groups are added by the methyl-transferase cheR and are removed by the methylesterase cheB. Most MCPs are homodimers that contain the following organisation: an N-terminal signal sequence that acts as a transmembrane domain in the mature protein; a poorly-conserved periplasmic receptor (ligand-binding) domain; a second transmembrane domain; and a highly-conserved C-terminal cytoplasmic domain that interacts with downstream signalling components. The C-terminal domain contains the glycosylated glutamate residues. This entry represents the signalling domain found in several methyl-accepting chemotaxis proteins. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs.; GO: 0004871 signal transducer activity, 0007165 signal transduction, 0016020 membrane; PDB: 2CH7_A 3ZX6_B 1QU7_A 3G6B_B 3UR1_C 3G67_B.
Probab=21.18 E-value=5.6e+02 Score=22.41 Aligned_cols=48 Identities=25% Similarity=0.340 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHHHHh----------------hcCCchHHHhh--HHHHHhhhhhHHHHHHHHHHHH
Q 012498 25 ERDELRKDIEQLCMQ----------------QAGPSYLAVAT--RMHFQRTAGLEQEIEILKQKIA 72 (462)
Q Consensus 25 ERdEL~KDIEqLCMQ----------------QaGpgyl~vAT--RM~~qRta~LEQeiE~Lkkkl~ 72 (462)
.-++.-+.|..+.-| .+|+||-.||- |=++.+|...=.+|..+=..+.
T Consensus 41 ~i~~~~~~i~~ia~qt~lLalNAsIEAaraGe~G~gF~vvA~eir~LA~~t~~~~~~I~~~i~~i~ 106 (213)
T PF00015_consen 41 DISEILSLINEIAEQTNLLALNASIEAARAGEAGRGFAVVADEIRKLAEQTSESAKEISEIIEEIQ 106 (213)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHTCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhhHhhhhhccccchhcccchhHHHHHHHHHHhhhhhhhHHHHHHHHHhhhh
Confidence 344455566666655 36789988885 4577888777777766544333
No 216
>PLN02678 seryl-tRNA synthetase
Probab=21.16 E-value=3.9e+02 Score=28.69 Aligned_cols=16 Identities=6% Similarity=-0.114 Sum_probs=10.1
Q ss_pred hhhhhhHHHHHhhh-cc
Q 012498 310 SHIKSISDVIEEKT-QH 325 (462)
Q Consensus 310 s~i~s~v~~ieekl-~~ 325 (462)
.++..+++..++-+ .+
T Consensus 303 ~~~e~~l~~~~~i~~~L 319 (448)
T PLN02678 303 EMHEEMLKNSEDFYQSL 319 (448)
T ss_pred HHHHHHHHHHHHHHHHc
Confidence 45666777666666 44
No 217
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=21.12 E-value=4.1e+02 Score=25.24 Aligned_cols=44 Identities=30% Similarity=0.300 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHHH-------HhHHHHHHHhhhHhHhhhHHHHHHhhHhHH
Q 012498 144 MSQKFNEFQTRLEE-------LSSENIELKKQNATLRFDLEKQEELNESFK 187 (462)
Q Consensus 144 m~qk~~~~~~R~~E-------~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~ 187 (462)
++.++.+=+.|-++ |-.+++..+..|++|+.|+..++.+-..+.
T Consensus 65 ~l~rLeEEqqR~~~L~qvN~lLReQLEq~~~~N~~L~~dl~klt~~~~~l~ 115 (182)
T PF15035_consen 65 ALIRLEEEQQRSEELAQVNALLREQLEQARKANEALQEDLQKLTQDWERLR 115 (182)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56677777777777 666677778889999999999888777543
No 218
>PF14193 DUF4315: Domain of unknown function (DUF4315)
Probab=21.06 E-value=2.5e+02 Score=23.94 Aligned_cols=38 Identities=29% Similarity=0.444 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHHHhHHHHHhhhhhhHHHHHHhHHhHHHHHHhhhh
Q 012498 234 ISALEDELEKTRSSVENLQSKLRMGLEIENHLKKSVRELEKKIIH 278 (462)
Q Consensus 234 isaLEeE~e~lr~~i~~LQskLR~GLeIenhLkk~vr~Lekkqi~ 278 (462)
|.-+..++++.+.+|+.+|.+||. |.++-+++|.-+|+
T Consensus 3 leKi~~eieK~k~Kiae~Q~rlK~-------Le~qk~E~EN~EIv 40 (83)
T PF14193_consen 3 LEKIRAEIEKTKEKIAELQARLKE-------LEAQKTEAENLEIV 40 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHH
Confidence 566788999999999999999885 34555566655544
No 219
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=21.04 E-value=2.6e+02 Score=24.40 Aligned_cols=41 Identities=24% Similarity=0.524 Sum_probs=31.4
Q ss_pred HHHHHHHHHhHHHHHHHhhhHhHhhhHHHHHHhhHhHHHHH
Q 012498 150 EFQTRLEELSSENIELKKQNATLRFDLEKQEELNESFKEVI 190 (462)
Q Consensus 150 ~~~~R~~E~~s~~~~qk~~n~aLQ~dl~~~~eq~e~~~kVI 190 (462)
.+++|+.+++..+....+-|..|+..+..-.+.-..+++++
T Consensus 46 rwek~v~~L~~e~~~l~~E~e~L~~~l~~e~~Ek~~Ll~ll 86 (87)
T PF12709_consen 46 RWEKKVDELENENKALKRENEQLKKKLDTEREEKQELLKLL 86 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 47888999999998888888888888876666555555543
No 220
>COG3707 AmiR Response regulator with putative antiterminator output domain [Signal transduction mechanisms]
Probab=20.92 E-value=1.6e+02 Score=28.75 Aligned_cols=42 Identities=29% Similarity=0.487 Sum_probs=34.3
Q ss_pred HHHhhhhhHHHHHHHHHHHH---------HhhhhhcchHHHHHHHHHHHHHHH
Q 012498 53 HFQRTAGLEQEIEILKQKIA---------ACARENSNLQEELSEAYRIKGQLA 96 (462)
Q Consensus 53 ~~qRta~LEQeiE~Lkkkl~---------~c~ren~nLQEELsEAYRiK~qLa 96 (462)
-|..+..|++|.+++|++|+ |.+=.+.|+-|+ |||+.=+.+|
T Consensus 123 rf~~~~~L~~el~~~k~~L~~rK~ierAKglLM~~~g~sE~--EAy~~lR~~A 173 (194)
T COG3707 123 RFEERRALRRELAKLKDRLEERKVIERAKGLLMKRRGLSEE--EAYKLLRRTA 173 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHH--HHHHHHHHHH
Confidence 57788899999999999997 456677888875 8998877666
No 221
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=20.88 E-value=9.2e+02 Score=26.13 Aligned_cols=52 Identities=23% Similarity=0.302 Sum_probs=36.6
Q ss_pred HHHhhhhhHHHHHHHHHHHHHhhhhhcchHHHHHHHHHHHHHHHHHHHHHHH
Q 012498 53 HFQRTAGLEQEIEILKQKIAACARENSNLQEELSEAYRIKGQLADLHAAEVI 104 (462)
Q Consensus 53 ~~qRta~LEQeiE~Lkkkl~~c~ren~nLQEELsEAYRiK~qLadLh~ae~~ 104 (462)
+-++++-|+=.++.++...+.-.-|++.|-.||+||-|.+..|++=|+|-+.
T Consensus 139 ~~EEn~~lqlqL~~l~~e~~Ekeeesq~LnrELaE~layqq~L~~eyQatf~ 190 (401)
T PF06785_consen 139 LREENQCLQLQLDALQQECGEKEEESQTLNRELAEALAYQQELNDEYQATFV 190 (401)
T ss_pred HHHHHHHHHHhHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Confidence 4444555555555555555555557899999999999999999998765543
No 222
>KOG3958 consensus Putative dynamitin [Cytoskeleton]
Probab=20.69 E-value=6e+02 Score=27.10 Aligned_cols=42 Identities=24% Similarity=0.275 Sum_probs=30.3
Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcC---CchHHHhhH
Q 012498 10 NESEALMARIQQLEHERDELRKDIEQLCMQ--QAG---PSYLAVATR 51 (462)
Q Consensus 10 ~~~e~l~~RI~qLe~ERdEL~KDIEqLCMQ--QaG---pgyl~vATR 51 (462)
...|-+..+.+.|.||-.||-..+|+|=.= .|- -.|+.+|+-
T Consensus 87 ~~kETp~qK~qRll~Ev~eL~~eve~ik~dk~~a~Eek~t~~l~A~v 133 (371)
T KOG3958|consen 87 GVKETPQQKYQRLLHEVQELTTEVEKIKTDKESATEEKLTPVLLAKV 133 (371)
T ss_pred CcccCHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhhhcchHHHHHH
Confidence 346677888999999999999999988543 111 356666653
No 223
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=20.45 E-value=2e+02 Score=23.23 Aligned_cols=32 Identities=22% Similarity=0.326 Sum_probs=23.0
Q ss_pred hHHHHHHHHHHHHHHHHhHHHHHhhhhhhHHH
Q 012498 230 TSKYISALEDELEKTRSSVENLQSKLRMGLEI 261 (462)
Q Consensus 230 tskyisaLEeE~e~lr~~i~~LQskLR~GLeI 261 (462)
.+.-|+.++.|++.++.+++.+..++|.-+.|
T Consensus 12 ~~~~i~tvk~en~~i~~~ve~i~envk~ll~l 43 (55)
T PF05377_consen 12 IESSINTVKKENEEISESVEKIEENVKDLLSL 43 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33347888888888888888888887765443
No 224
>PF01025 GrpE: GrpE; InterPro: IPR000740 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes the grpE protein. Dimeric GrpE is the co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold []. DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. The X-ray crystal structure of GrpE in complex with the ATPase domain of DnaK revealed that GrpE is an asymmetric homodimer, bent in a manner that favours extensive contacts with only one DnaKATPase monomer []. GrpE does not actively compete for the atomic positions occupied by the nucleotide. GrpE and ADP mutually reduce one another's affinity for DnaK 200-fold, and ATP instantly dissociates GrpE from DnaK.; GO: 0000774 adenyl-nucleotide exchange factor activity, 0042803 protein homodimerization activity, 0051087 chaperone binding, 0006457 protein folding; PDB: 3A6M_A 4ANI_A 1DKG_B.
Probab=20.41 E-value=2.6e+02 Score=24.62 Aligned_cols=52 Identities=35% Similarity=0.526 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHHhHHHHHhhh-hhhHHHHHHhHHhHHHHHHhh-hhhHHHHHH
Q 012498 234 ISALEDELEKTRSSVENLQSKL-RMGLEIENHLKKSVRELEKKI-IHSDKFISN 285 (462)
Q Consensus 234 isaLEeE~e~lr~~i~~LQskL-R~GLeIenhLkk~vr~Lekkq-i~~dk~i~n 285 (462)
+..++.++..+.++++.|+..+ |.--+++|..++-.+..+... -...+|+..
T Consensus 13 ~~~~~~~l~~l~~~~~~l~~~~~r~~ae~en~~~r~~~e~~~~~~~~~~~~~~~ 66 (165)
T PF01025_consen 13 IEELEEELEELEKEIEELKERLLRLQAEFENYRKRLEKEKEEAKKYALEKFLKD 66 (165)
T ss_dssp HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455555556666665553 444577888776666554333 234555544
No 225
>PF05823 Gp-FAR-1: Nematode fatty acid retinoid binding protein (Gp-FAR-1); InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=20.22 E-value=4.5e+02 Score=24.11 Aligned_cols=50 Identities=20% Similarity=0.262 Sum_probs=31.6
Q ss_pred hcccccchhhhhccccccccccCCcchHHHHHHHHHHHHHHHHhHHHHHhhhh
Q 012498 204 LETSWEDKCACLLLDSAEMWSFNDTSTSKYISALEDELEKTRSSVENLQSKLR 256 (462)
Q Consensus 204 ~~~s~~~Kcs~LL~Ds~~~Wsfn~tstskyisaLEeE~e~lr~~i~~LQskLR 256 (462)
.++|.++|..+- +-..+|. +-+++-.+|++|.+...+|-+++.+|...++
T Consensus 19 ~~Lt~eeK~~lk--ev~~~~~-~~~~~de~i~~LK~ksP~L~~k~~~l~~~~k 68 (154)
T PF05823_consen 19 KNLTPEEKAELK--EVAKNYA-KFKNEDEMIAALKEKSPSLYEKAEKLRDKLK 68 (154)
T ss_dssp HH--TTTHHHHH--HHHTT--------TTHHHHHHHH-HHHHHHHHHHHHHHH
T ss_pred HcCCHHHHHHHH--HHHHHcc-ccCCHHHHHHHHHHhCHHHHHHHHHHHHHHH
Confidence 678999998764 4444553 2245778999999999999999998866554
No 226
>KOG2685 consensus Cystoskeletal protein Tektin [Cytoskeleton]
Probab=20.08 E-value=1.2e+03 Score=25.64 Aligned_cols=107 Identities=14% Similarity=0.161 Sum_probs=72.5
Q ss_pred hcccccchhhhhcccccccccc-CCcchHHHHHHHHHHHHHHHHhHHHHHhhh----hhhHHHHHHhHHhHHHHHHhhhh
Q 012498 204 LETSWEDKCACLLLDSAEMWSF-NDTSTSKYISALEDELEKTRSSVENLQSKL----RMGLEIENHLKKSVRELEKKIIH 278 (462)
Q Consensus 204 ~~~s~~~Kcs~LL~Ds~~~Wsf-n~tstskyisaLEeE~e~lr~~i~~LQskL----R~GLeIenhLkk~vr~Lekkqi~ 278 (462)
.-++.++||..|=.+|..---| +++-...-++++|.=.+-..+-++.-|+.. ..|-.+++.|-.-++.|.....-
T Consensus 192 eA~~ID~~c~~L~~~S~~I~~~p~~~R~~~~~~s~e~W~~fs~~nl~~ae~er~~S~~LR~~l~~~l~~tan~lr~Q~~~ 271 (421)
T KOG2685|consen 192 EAYEIDEKCLALNNNSPNISYKPDPTRVPPNSSSPESWAKFSGDNLDRAERERAASAALREALDQTLRETANDLRTQADA 271 (421)
T ss_pred hhheechhhhhhcCCCCCeeccCCCccCCCCCCCHHHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3478899999998887654222 222122222234444444444444444432 23456677788889999999999
Q ss_pred hHHHHHHHHHHHHHhhhHHHHHHHHhhhhcch
Q 012498 279 SDKFISNAIAELRLCHSQLRVHVVNSLEEGRS 310 (462)
Q Consensus 279 ~dk~i~ngi~~lq~~h~~~R~~Im~lL~ee~s 310 (462)
.+.-+.++|++.+......-.+.-+.|++-..
T Consensus 272 ve~af~~ri~etqdar~kL~~ql~k~leEi~~ 303 (421)
T KOG2685|consen 272 VELAFKKRIRETQDARNKLEWQLAKTLEEIAD 303 (421)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 99999999999999988888888888877443
Done!