Query psy21
Match_columns 288
No_of_seqs 310 out of 1434
Neff 6.6
Searched_HMMs 46136
Date Sat Aug 17 00:14:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy21.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/21hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0977|consensus 100.0 1.1E-36 2.3E-41 298.9 9.3 166 2-179 372-543 (546)
2 PF00038 Filament: Intermediat 99.7 1.7E-17 3.8E-22 155.0 14.0 120 163-284 171-312 (312)
3 PF00932 LTD: Lamin Tail Domai 99.6 1.7E-14 3.7E-19 116.1 9.5 99 67-175 4-108 (116)
4 KOG0977|consensus 99.5 2.6E-14 5.7E-19 141.4 10.0 122 163-286 251-394 (546)
5 PF00038 Filament: Intermediat 96.7 0.033 7.1E-07 52.1 13.0 71 209-279 212-290 (312)
6 KOG4642|consensus 95.5 0.01 2.2E-07 54.2 2.9 78 204-283 188-283 (284)
7 PRK11637 AmiB activator; Provi 94.9 0.24 5.2E-06 48.7 10.8 67 193-259 69-135 (428)
8 COG1579 Zn-ribbon protein, pos 94.9 0.83 1.8E-05 41.8 13.2 98 184-281 30-150 (239)
9 COG1579 Zn-ribbon protein, pos 94.4 0.9 1.9E-05 41.5 12.4 68 206-276 89-156 (239)
10 PRK11637 AmiB activator; Provi 94.2 0.66 1.4E-05 45.6 12.0 78 185-262 54-131 (428)
11 PRK09039 hypothetical protein; 94.0 0.46 9.9E-06 45.6 10.1 9 233-241 171-179 (343)
12 PF07888 CALCOCO1: Calcium bin 93.9 2.3 4.9E-05 43.3 15.1 38 231-268 217-254 (546)
13 PF08614 ATG16: Autophagy prot 93.2 0.67 1.5E-05 40.7 9.0 77 189-265 106-182 (194)
14 PRK10884 SH3 domain-containing 93.0 1.9 4.2E-05 38.5 11.7 49 205-253 124-172 (206)
15 PRK09039 hypothetical protein; 92.7 1.6 3.4E-05 42.0 11.4 39 205-243 129-167 (343)
16 TIGR02894 DNA_bind_RsfA transc 92.6 2.9 6.4E-05 35.9 11.6 47 217-263 101-147 (161)
17 PF10473 CENP-F_leu_zip: Leuci 92.2 3.1 6.7E-05 35.0 11.3 36 190-225 8-43 (140)
18 PF10146 zf-C4H2: Zinc finger- 92.1 4.9 0.00011 36.5 13.3 64 199-262 32-95 (230)
19 PF10146 zf-C4H2: Zinc finger- 91.6 4.3 9.4E-05 36.9 12.4 89 189-277 15-103 (230)
20 PF14662 CCDC155: Coiled-coil 91.4 3.7 8E-05 36.3 11.2 75 204-278 72-146 (193)
21 KOG0161|consensus 91.1 3.7 8E-05 47.6 13.7 100 182-281 1066-1172(1930)
22 PRK15178 Vi polysaccharide exp 91.0 5.6 0.00012 39.5 13.4 79 189-272 260-363 (434)
23 PF04156 IncA: IncA protein; 90.8 4.2 9.1E-05 35.2 11.2 21 201-221 90-110 (191)
24 PF11932 DUF3450: Protein of u 90.5 7.8 0.00017 35.3 13.2 23 204-226 61-83 (251)
25 COG4942 Membrane-bound metallo 89.9 3.4 7.4E-05 40.7 10.7 67 184-250 44-110 (420)
26 PF10234 Cluap1: Clusterin-ass 89.8 6.9 0.00015 36.4 12.1 52 225-276 209-260 (267)
27 PF11559 ADIP: Afadin- and alp 89.5 7.1 0.00015 32.6 11.2 71 189-259 63-133 (151)
28 COG3883 Uncharacterized protei 89.4 4 8.6E-05 37.9 10.2 63 193-255 46-108 (265)
29 PF08317 Spc7: Spc7 kinetochor 89.0 8.7 0.00019 36.5 12.6 48 232-279 221-268 (325)
30 PF04849 HAP1_N: HAP1 N-termin 88.9 5.5 0.00012 37.7 10.9 21 256-276 284-304 (306)
31 TIGR02338 gimC_beta prefoldin, 88.7 7.1 0.00015 31.1 10.1 77 186-262 11-109 (110)
32 COG4026 Uncharacterized protei 88.5 2.9 6.4E-05 37.9 8.4 63 200-262 136-198 (290)
33 PF04156 IncA: IncA protein; 88.5 15 0.00033 31.6 13.8 28 192-219 95-122 (191)
34 PF08317 Spc7: Spc7 kinetochor 88.4 3.1 6.7E-05 39.5 9.1 47 205-251 208-261 (325)
35 PF10186 Atg14: UV radiation r 88.3 9.9 0.00022 34.7 12.3 48 203-250 60-107 (302)
36 PRK02119 hypothetical protein; 88.2 5 0.00011 29.9 8.3 46 207-252 3-48 (73)
37 PF00345 PapD_N: Pili and flag 88.2 0.88 1.9E-05 36.5 4.6 54 82-135 14-72 (122)
38 PF03148 Tektin: Tektin family 88.0 11 0.00024 36.6 12.9 95 184-278 246-368 (384)
39 PF12718 Tropomyosin_1: Tropom 87.6 9.1 0.0002 32.1 10.5 62 190-251 19-97 (143)
40 PF09726 Macoilin: Transmembra 87.5 12 0.00027 39.4 13.6 36 244-279 548-584 (697)
41 PF13851 GAS: Growth-arrest sp 87.2 17 0.00036 32.3 12.5 86 187-272 29-124 (201)
42 COG3883 Uncharacterized protei 86.7 7.9 0.00017 35.9 10.4 57 194-250 40-96 (265)
43 KOG4687|consensus 86.6 7.9 0.00017 36.2 10.2 109 171-279 8-128 (389)
44 PF11559 ADIP: Afadin- and alp 86.2 15 0.00032 30.7 11.2 58 201-258 61-118 (151)
45 COG3074 Uncharacterized protei 86.0 5.7 0.00012 29.5 7.2 46 191-236 24-69 (79)
46 COG2433 Uncharacterized conser 85.9 2.9 6.3E-05 42.9 7.6 49 204-252 420-468 (652)
47 PF09726 Macoilin: Transmembra 85.7 5.6 0.00012 41.9 10.0 36 205-240 459-494 (697)
48 PF09304 Cortex-I_coil: Cortex 85.6 17 0.00037 29.2 11.1 41 212-252 36-76 (107)
49 PF12325 TMF_TATA_bd: TATA ele 85.5 11 0.00024 30.9 9.6 78 185-262 30-117 (120)
50 PF04871 Uso1_p115_C: Uso1 / p 85.4 17 0.00038 30.2 11.0 49 231-279 59-108 (136)
51 PRK04406 hypothetical protein; 85.0 10 0.00022 28.4 8.5 41 210-250 8-48 (75)
52 PF08826 DMPK_coil: DMPK coile 84.8 3.8 8.1E-05 29.6 5.8 36 191-226 17-52 (61)
53 COG2433 Uncharacterized conser 84.8 10 0.00022 39.1 10.9 71 192-262 422-509 (652)
54 KOG4603|consensus 84.2 5.7 0.00012 34.6 7.6 36 192-227 79-114 (201)
55 COG1196 Smc Chromosome segrega 84.1 18 0.0004 40.2 13.6 10 123-132 644-653 (1163)
56 KOG0250|consensus 83.7 8.7 0.00019 41.9 10.4 43 240-282 747-789 (1074)
57 PF08581 Tup_N: Tup N-terminal 83.6 12 0.00026 28.4 8.4 65 184-251 10-74 (79)
58 PF08647 BRE1: BRE1 E3 ubiquit 83.5 12 0.00026 29.2 8.7 47 232-278 22-68 (96)
59 PF12329 TMF_DNA_bd: TATA elem 83.3 13 0.00029 27.6 8.5 57 194-250 14-70 (74)
60 PF11932 DUF3450: Protein of u 83.3 18 0.0004 32.8 11.2 37 190-226 54-90 (251)
61 KOG1962|consensus 83.2 12 0.00025 33.8 9.5 59 204-262 149-207 (216)
62 PRK02224 chromosome segregatio 83.1 13 0.00027 39.8 11.6 23 256-278 420-442 (880)
63 PF12325 TMF_TATA_bd: TATA ele 83.1 24 0.00052 28.9 12.2 8 165-172 24-31 (120)
64 KOG4807|consensus 82.9 43 0.00094 33.0 13.8 37 243-282 458-494 (593)
65 PRK02224 chromosome segregatio 82.8 11 0.00025 40.1 11.1 46 240-285 411-456 (880)
66 PF04102 SlyX: SlyX; InterPro 82.5 5.8 0.00013 29.1 6.2 45 207-251 5-49 (69)
67 PRK00295 hypothetical protein; 82.5 11 0.00024 27.6 7.6 39 208-246 7-45 (68)
68 PF04111 APG6: Autophagy prote 82.3 20 0.00044 34.0 11.4 41 207-247 51-91 (314)
69 COG1196 Smc Chromosome segrega 82.2 23 0.0005 39.5 13.4 6 103-108 98-103 (1163)
70 PRK00736 hypothetical protein; 82.1 11 0.00025 27.6 7.6 41 208-248 7-47 (68)
71 PRK02793 phi X174 lysis protei 82.1 13 0.00029 27.5 8.0 38 211-248 6-43 (72)
72 PF06005 DUF904: Protein of un 81.9 15 0.00032 27.3 8.2 24 204-227 23-46 (72)
73 PF04111 APG6: Autophagy prote 81.9 19 0.00041 34.2 11.0 21 204-224 62-82 (314)
74 PF02050 FliJ: Flagellar FliJ 81.2 22 0.00049 27.2 10.5 28 231-258 63-90 (123)
75 KOG0976|consensus 81.2 20 0.00044 38.3 11.5 62 190-251 104-165 (1265)
76 smart00787 Spc7 Spc7 kinetocho 81.1 12 0.00027 35.5 9.4 6 161-166 144-149 (312)
77 PF14282 FlxA: FlxA-like prote 80.8 10 0.00022 30.2 7.5 24 230-253 54-77 (106)
78 PRK09343 prefoldin subunit bet 80.7 29 0.00063 28.2 10.5 33 229-261 80-112 (121)
79 KOG2264|consensus 80.7 9.7 0.00021 39.1 8.8 66 190-269 91-159 (907)
80 PRK04325 hypothetical protein; 80.6 11 0.00024 28.1 7.2 39 208-246 11-49 (74)
81 PF06818 Fez1: Fez1; InterPro 80.6 20 0.00042 32.1 9.9 87 165-253 4-106 (202)
82 PF14662 CCDC155: Coiled-coil 80.2 27 0.00058 31.0 10.5 82 186-267 61-152 (193)
83 KOG0250|consensus 80.2 34 0.00073 37.6 13.2 25 87-111 113-141 (1074)
84 PF12718 Tropomyosin_1: Tropom 80.1 25 0.00054 29.5 10.1 40 211-250 19-58 (143)
85 PRK04406 hypothetical protein; 80.0 11 0.00024 28.2 7.0 47 204-250 9-55 (75)
86 PF08826 DMPK_coil: DMPK coile 80.0 18 0.00039 26.1 7.8 45 203-247 15-59 (61)
87 PRK02793 phi X174 lysis protei 79.9 13 0.00028 27.6 7.3 48 204-251 6-53 (72)
88 KOG0995|consensus 79.9 32 0.00069 35.3 12.2 37 231-268 357-393 (581)
89 PF07798 DUF1640: Protein of u 79.9 39 0.00084 29.1 13.0 24 205-228 72-95 (177)
90 PF09789 DUF2353: Uncharacteri 79.6 15 0.00033 35.1 9.4 38 239-276 131-168 (319)
91 PF15397 DUF4618: Domain of un 79.6 34 0.00073 31.8 11.4 38 185-225 63-100 (258)
92 KOG0971|consensus 79.4 30 0.00066 37.5 12.2 37 186-222 404-440 (1243)
93 PRK04325 hypothetical protein; 79.3 20 0.00043 26.7 8.2 44 209-252 5-48 (74)
94 PF15070 GOLGA2L5: Putative go 79.2 41 0.00088 35.1 13.2 29 232-260 85-113 (617)
95 PF05384 DegS: Sensor protein 79.0 28 0.00061 29.9 10.1 76 184-259 76-151 (159)
96 PF06785 UPF0242: Uncharacteri 78.9 39 0.00084 32.6 11.7 22 211-232 132-153 (401)
97 PRK00295 hypothetical protein; 78.8 18 0.00039 26.5 7.8 42 211-252 3-44 (68)
98 PF10186 Atg14: UV radiation r 78.8 37 0.0008 30.9 11.7 16 204-219 75-90 (302)
99 COG3074 Uncharacterized protei 78.7 23 0.00049 26.4 8.0 31 195-225 14-44 (79)
100 PF15290 Syntaphilin: Golgi-lo 78.6 19 0.00042 33.7 9.5 57 194-250 77-140 (305)
101 PF15254 CCDC14: Coiled-coil d 78.5 41 0.00089 35.8 12.8 71 205-275 426-500 (861)
102 KOG0996|consensus 78.5 53 0.0011 36.6 13.9 14 233-246 864-877 (1293)
103 PF10805 DUF2730: Protein of u 78.4 14 0.00031 29.3 7.7 23 229-251 67-89 (106)
104 PRK10884 SH3 domain-containing 78.4 18 0.00039 32.3 9.1 21 113-135 40-60 (206)
105 PF06005 DUF904: Protein of un 78.3 25 0.00055 26.1 8.9 49 185-233 11-59 (72)
106 PRK02119 hypothetical protein; 78.1 14 0.0003 27.5 7.0 48 204-251 7-54 (73)
107 PRK00846 hypothetical protein; 78.0 20 0.00042 27.2 7.8 44 205-248 12-55 (77)
108 PF10473 CENP-F_leu_zip: Leuci 78.0 41 0.00088 28.3 11.7 18 227-244 87-104 (140)
109 PF07889 DUF1664: Protein of u 77.3 34 0.00075 28.2 9.8 33 201-233 70-102 (126)
110 PF11611 DUF4352: Domain of un 77.2 4.6 0.0001 31.8 4.6 77 82-159 32-121 (123)
111 KOG2264|consensus 77.1 18 0.0004 37.1 9.6 19 260-278 130-148 (907)
112 PF06810 Phage_GP20: Phage min 77.1 22 0.00049 30.2 9.0 23 191-213 26-48 (155)
113 PF10805 DUF2730: Protein of u 76.7 18 0.00038 28.8 7.8 12 236-247 67-78 (106)
114 COG4026 Uncharacterized protei 76.5 24 0.00051 32.2 9.2 67 195-261 138-204 (290)
115 KOG0971|consensus 76.2 48 0.001 36.0 12.6 31 242-272 397-427 (1243)
116 KOG0933|consensus 76.2 22 0.00048 38.8 10.3 21 250-270 887-907 (1174)
117 PRK00736 hypothetical protein; 76.1 24 0.00052 25.8 7.7 42 211-252 3-44 (68)
118 KOG0964|consensus 76.1 30 0.00065 37.7 11.2 56 227-282 265-320 (1200)
119 PRK15422 septal ring assembly 76.1 31 0.00067 26.2 8.3 23 201-223 20-42 (79)
120 COG4942 Membrane-bound metallo 76.1 26 0.00056 34.7 10.2 75 185-259 52-126 (420)
121 TIGR00606 rad50 rad50. This fa 75.8 50 0.0011 37.3 13.7 44 237-280 1050-1093(1311)
122 KOG0976|consensus 75.7 26 0.00055 37.5 10.4 63 191-253 84-146 (1265)
123 KOG0978|consensus 75.7 16 0.00034 38.5 9.0 76 205-281 572-647 (698)
124 KOG0161|consensus 75.6 26 0.00056 41.1 11.3 42 240-281 1694-1735(1930)
125 PF07058 Myosin_HC-like: Myosi 75.6 11 0.00023 35.8 7.0 74 206-284 35-126 (351)
126 PRK00846 hypothetical protein; 75.6 30 0.00066 26.1 8.3 44 210-253 10-53 (77)
127 PF10212 TTKRSYEDQ: Predicted 75.5 37 0.00081 34.5 11.3 23 232-254 485-507 (518)
128 PF04102 SlyX: SlyX; InterPro 75.4 22 0.00047 26.0 7.4 40 212-251 3-42 (69)
129 TIGR03017 EpsF chain length de 75.3 68 0.0015 31.3 13.1 8 272-279 356-363 (444)
130 PRK15422 septal ring assembly 75.1 17 0.00037 27.6 6.7 55 185-239 11-72 (79)
131 KOG0804|consensus 74.8 21 0.00045 35.6 9.1 23 184-206 353-375 (493)
132 PF05911 DUF869: Plant protein 74.7 24 0.00052 37.7 10.2 26 232-257 678-703 (769)
133 PRK10803 tol-pal system protei 74.5 19 0.00041 33.2 8.5 23 211-233 59-81 (263)
134 PF09744 Jnk-SapK_ap_N: JNK_SA 74.5 55 0.0012 28.1 10.9 50 165-221 14-65 (158)
135 PF13805 Pil1: Eisosome compon 74.4 76 0.0016 29.7 12.3 24 202-225 134-157 (271)
136 PF06785 UPF0242: Uncharacteri 74.3 40 0.00088 32.4 10.5 23 201-223 94-116 (401)
137 KOG0288|consensus 74.0 58 0.0012 32.3 11.8 56 168-223 17-72 (459)
138 PF06818 Fez1: Fez1; InterPro 74.0 48 0.001 29.6 10.5 72 207-279 32-104 (202)
139 KOG0933|consensus 74.0 55 0.0012 35.9 12.5 52 229-280 789-840 (1174)
140 KOG4643|consensus 73.9 24 0.00052 38.5 9.9 65 205-269 263-343 (1195)
141 PF10481 CENP-F_N: Cenp-F N-te 73.9 24 0.00053 33.0 8.8 64 191-254 45-129 (307)
142 PF10267 Tmemb_cc2: Predicted 73.7 25 0.00054 34.6 9.4 18 7-24 62-79 (395)
143 TIGR01843 type_I_hlyD type I s 73.3 53 0.0011 31.4 11.6 27 233-259 209-235 (423)
144 PF01920 Prefoldin_2: Prefoldi 72.5 41 0.00089 25.7 9.7 31 194-224 7-37 (106)
145 KOG0996|consensus 72.5 39 0.00085 37.5 11.1 88 192-279 493-594 (1293)
146 PF13870 DUF4201: Domain of un 72.4 61 0.0013 27.7 12.8 95 185-279 56-169 (177)
147 KOG2391|consensus 72.3 28 0.00062 33.5 9.1 35 194-228 234-268 (365)
148 PF05266 DUF724: Protein of un 72.1 69 0.0015 28.2 11.0 55 205-259 130-184 (190)
149 PRK13729 conjugal transfer pil 72.0 12 0.00026 37.5 6.9 26 225-250 95-120 (475)
150 KOG2685|consensus 71.8 70 0.0015 31.6 11.8 92 184-275 273-392 (421)
151 COG1382 GimC Prefoldin, chaper 71.6 55 0.0012 26.8 10.1 34 186-219 14-47 (119)
152 TIGR01843 type_I_hlyD type I s 71.3 75 0.0016 30.3 12.2 10 124-133 64-73 (423)
153 smart00637 CBD_II CBD_II domai 71.2 22 0.00048 26.9 6.9 45 88-134 12-72 (92)
154 TIGR03007 pepcterm_ChnLen poly 70.9 78 0.0017 31.4 12.6 20 242-261 356-375 (498)
155 KOG4809|consensus 70.8 63 0.0014 33.1 11.5 13 266-278 425-437 (654)
156 PRK11020 hypothetical protein; 70.7 41 0.00088 27.3 8.3 60 204-263 3-68 (118)
157 PF07926 TPR_MLP1_2: TPR/MLP1/ 70.0 36 0.00077 27.9 8.3 13 239-251 103-115 (132)
158 COG4372 Uncharacterized protei 69.6 80 0.0017 31.1 11.5 35 189-223 134-168 (499)
159 PF13851 GAS: Growth-arrest sp 69.6 81 0.0017 27.9 12.5 30 230-259 110-139 (201)
160 TIGR03495 phage_LysB phage lys 69.0 58 0.0013 27.2 9.3 16 206-221 40-55 (135)
161 KOG1962|consensus 68.0 49 0.0011 29.9 9.2 53 211-263 149-201 (216)
162 PF00261 Tropomyosin: Tropomyo 67.9 93 0.002 28.0 12.5 26 233-258 196-221 (237)
163 PF02403 Seryl_tRNA_N: Seryl-t 67.3 58 0.0012 25.4 8.9 63 190-252 27-99 (108)
164 PF10234 Cluap1: Clusterin-ass 67.1 55 0.0012 30.5 9.7 74 193-266 145-222 (267)
165 PF05700 BCAS2: Breast carcino 67.1 68 0.0015 28.7 10.2 12 165-176 124-135 (221)
166 KOG0804|consensus 67.1 1.1E+02 0.0024 30.6 12.2 25 198-222 360-384 (493)
167 smart00787 Spc7 Spc7 kinetocho 67.0 1.1E+02 0.0023 29.2 11.9 38 157-194 106-145 (312)
168 KOG0946|consensus 66.8 63 0.0014 34.7 10.9 28 249-276 738-765 (970)
169 PF07061 Swi5: Swi5; InterPro 66.6 39 0.00085 25.8 7.3 63 204-276 5-67 (83)
170 TIGR00606 rad50 rad50. This fa 66.0 56 0.0012 36.9 11.4 29 249-277 938-966 (1311)
171 PF07076 DUF1344: Protein of u 66.0 17 0.00036 26.3 4.8 37 78-135 9-48 (61)
172 PF03670 UPF0184: Uncharacteri 65.5 50 0.0011 25.3 7.5 43 210-252 30-72 (83)
173 PRK01156 chromosome segregatio 65.4 79 0.0017 34.0 12.0 14 235-248 696-709 (895)
174 PF15294 Leu_zip: Leucine zipp 65.3 67 0.0015 30.1 9.9 73 185-257 132-213 (278)
175 PF05701 WEMBL: Weak chloropla 65.3 69 0.0015 32.5 10.9 67 185-251 281-354 (522)
176 PRK09841 cryptic autophosphory 65.3 52 0.0011 34.7 10.4 48 231-278 343-390 (726)
177 KOG4643|consensus 64.9 54 0.0012 36.0 10.2 72 211-282 175-246 (1195)
178 PF12329 TMF_DNA_bd: TATA elem 64.9 54 0.0012 24.3 7.6 24 206-229 33-56 (74)
179 TIGR01005 eps_transp_fam exopo 64.2 1E+02 0.0022 32.5 12.4 20 243-262 378-397 (754)
180 PRK11546 zraP zinc resistance 64.1 28 0.00061 29.4 6.6 22 232-253 87-108 (143)
181 KOG2629|consensus 63.7 46 0.00099 31.4 8.4 8 267-274 180-187 (300)
182 PRK10803 tol-pal system protei 63.7 52 0.0011 30.3 9.0 32 196-227 58-89 (263)
183 PF05278 PEARLI-4: Arabidopsis 63.5 90 0.0019 29.2 10.3 39 185-223 169-210 (269)
184 PF07385 DUF1498: Protein of u 63.5 19 0.00041 32.7 5.7 45 88-135 122-168 (225)
185 PF06156 DUF972: Protein of un 63.4 32 0.00068 27.6 6.5 47 192-238 8-54 (107)
186 COG1730 GIM5 Predicted prefold 63.3 92 0.002 26.3 10.4 32 189-220 10-41 (145)
187 PF01920 Prefoldin_2: Prefoldi 63.2 65 0.0014 24.6 8.4 33 187-219 7-39 (106)
188 PRK11519 tyrosine kinase; Prov 63.0 71 0.0015 33.7 10.9 16 235-250 347-362 (719)
189 PF07200 Mod_r: Modifier of ru 62.8 45 0.00098 27.6 7.8 85 190-277 39-136 (150)
190 PRK09841 cryptic autophosphory 62.6 2E+02 0.0044 30.3 14.2 28 240-267 369-396 (726)
191 KOG0288|consensus 62.6 1.4E+02 0.003 29.7 11.8 42 184-225 26-67 (459)
192 PF14197 Cep57_CLD_2: Centroso 62.6 59 0.0013 23.9 8.9 17 206-222 5-21 (69)
193 TIGR03185 DNA_S_dndD DNA sulfu 62.2 72 0.0016 33.2 10.6 7 102-108 101-107 (650)
194 PF13870 DUF4201: Domain of un 62.1 1E+02 0.0022 26.4 10.8 22 232-253 96-117 (177)
195 PF09787 Golgin_A5: Golgin sub 62.0 98 0.0021 31.3 11.3 71 208-278 276-346 (511)
196 PF09744 Jnk-SapK_ap_N: JNK_SA 61.9 76 0.0016 27.2 9.0 50 204-253 87-136 (158)
197 PF14915 CCDC144C: CCDC144C pr 61.7 1E+02 0.0023 29.2 10.4 43 221-263 71-113 (305)
198 PRK04863 mukB cell division pr 61.5 92 0.002 35.9 11.9 47 232-278 374-420 (1486)
199 PF10212 TTKRSYEDQ: Predicted 61.5 1.5E+02 0.0034 30.1 12.3 91 185-275 416-514 (518)
200 TIGR01010 BexC_CtrB_KpsE polys 61.5 1.5E+02 0.0033 28.2 12.5 20 231-250 246-265 (362)
201 KOG4657|consensus 60.2 1.2E+02 0.0027 27.6 10.2 33 205-237 92-124 (246)
202 COG4372 Uncharacterized protei 60.2 1.8E+02 0.004 28.7 12.6 12 123-134 35-46 (499)
203 KOG4005|consensus 59.8 1.1E+02 0.0024 28.2 9.9 24 204-227 95-118 (292)
204 PF09755 DUF2046: Uncharacteri 59.7 1.6E+02 0.0035 28.0 11.8 55 214-268 114-169 (310)
205 COG2900 SlyX Uncharacterized p 59.4 72 0.0016 23.8 8.1 26 213-238 8-33 (72)
206 KOG2391|consensus 59.1 29 0.00062 33.4 6.4 11 162-172 207-217 (365)
207 PF15372 DUF4600: Domain of un 59.1 1E+02 0.0023 25.6 9.2 40 238-277 69-108 (129)
208 KOG4196|consensus 59.1 1.1E+02 0.0023 25.6 9.6 72 191-263 46-117 (135)
209 PF03173 CHB_HEX: Putative car 58.9 24 0.00051 30.4 5.4 18 117-134 81-99 (164)
210 KOG3433|consensus 58.7 1.2E+02 0.0026 26.8 9.6 112 155-270 40-159 (203)
211 PF10226 DUF2216: Uncharacteri 58.7 1.1E+02 0.0024 27.1 9.5 39 233-271 107-145 (195)
212 PF09478 CBM49: Carbohydrate b 58.6 26 0.00056 26.1 5.0 30 100-133 45-75 (80)
213 PF14263 DUF4354: Domain of un 58.6 27 0.00058 28.8 5.4 44 88-133 53-102 (124)
214 KOG4674|consensus 58.6 92 0.002 36.4 11.1 72 206-279 1243-1314(1822)
215 cd00632 Prefoldin_beta Prefold 58.4 87 0.0019 24.4 10.2 18 233-250 76-93 (105)
216 PRK12704 phosphodiesterase; Pr 58.4 1.7E+02 0.0038 29.7 12.4 20 240-259 116-135 (520)
217 COG3524 KpsE Capsule polysacch 58.4 1.8E+02 0.0038 28.0 12.0 30 231-260 255-293 (372)
218 KOG3156|consensus 58.3 76 0.0016 28.6 8.5 36 194-229 103-139 (220)
219 PF10498 IFT57: Intra-flagella 58.2 1.1E+02 0.0023 29.8 10.3 33 234-266 301-336 (359)
220 KOG2751|consensus 57.9 99 0.0021 30.8 10.0 70 204-273 181-250 (447)
221 PRK14692 lagellar hook-associa 57.8 2.5E+02 0.0054 30.1 13.6 93 121-228 613-710 (749)
222 TIGR03007 pepcterm_ChnLen poly 57.4 1.1E+02 0.0025 30.2 10.8 23 229-251 326-348 (498)
223 PF03962 Mnd1: Mnd1 family; I 57.2 71 0.0015 28.0 8.2 9 160-168 33-41 (188)
224 PF14874 PapD-like: Flagellar- 57.2 30 0.00064 26.4 5.3 45 86-134 24-68 (102)
225 PF04849 HAP1_N: HAP1 N-termin 56.7 1.1E+02 0.0024 29.1 9.8 47 228-274 235-281 (306)
226 PF04728 LPP: Lipoprotein leuc 56.7 69 0.0015 22.8 8.0 12 242-253 39-50 (56)
227 KOG4360|consensus 56.4 1.7E+02 0.0036 29.9 11.4 11 205-215 239-249 (596)
228 PF14197 Cep57_CLD_2: Centroso 56.2 79 0.0017 23.2 8.4 12 237-248 50-61 (69)
229 PF05622 HOOK: HOOK protein; 55.6 3.9 8.4E-05 43.0 0.0 123 155-277 285-420 (713)
230 PRK08412 flgL flagellar hook-a 55.6 2.1E+02 0.0045 31.0 12.7 78 71-170 639-721 (827)
231 PF15233 SYCE1: Synaptonemal c 55.6 1.2E+02 0.0026 25.2 10.7 38 186-223 21-58 (134)
232 PF07200 Mod_r: Modifier of ru 55.5 1.2E+02 0.0026 25.1 12.8 56 205-260 33-88 (150)
233 PRK13729 conjugal transfer pil 55.4 29 0.00062 34.9 6.0 7 166-172 57-63 (475)
234 PF08172 CASP_C: CASP C termin 55.3 79 0.0017 29.1 8.5 48 211-258 84-137 (248)
235 PF09730 BicD: Microtubule-ass 55.1 1.3E+02 0.0027 32.1 10.9 30 187-216 36-65 (717)
236 PF10174 Cast: RIM-binding pro 55.0 2.7E+02 0.0059 29.9 13.5 40 184-223 293-332 (775)
237 PF10458 Val_tRNA-synt_C: Valy 55.0 76 0.0017 22.7 7.6 46 206-251 4-63 (66)
238 PRK14160 heat shock protein Gr 54.9 1.2E+02 0.0026 27.3 9.4 18 204-221 73-90 (211)
239 PF12128 DUF3584: Protein of u 54.9 76 0.0016 35.6 9.9 12 265-276 695-706 (1201)
240 KOG0995|consensus 54.9 1.7E+02 0.0036 30.3 11.2 25 63-87 93-122 (581)
241 PF06637 PV-1: PV-1 protein (P 54.8 2.2E+02 0.0047 28.1 11.5 69 151-219 241-312 (442)
242 PF15066 CAGE1: Cancer-associa 54.7 1.5E+02 0.0032 30.0 10.6 35 192-226 317-351 (527)
243 PF04108 APG17: Autophagy prot 54.7 1.9E+02 0.0041 28.4 11.7 52 225-279 302-353 (412)
244 PF05266 DUF724: Protein of un 54.6 1.5E+02 0.0033 26.1 11.3 9 204-212 91-99 (190)
245 KOG4807|consensus 54.6 1.9E+02 0.0041 28.7 11.1 22 257-278 553-574 (593)
246 TIGR02338 gimC_beta prefoldin, 54.4 1.1E+02 0.0023 24.2 9.9 37 189-225 7-43 (110)
247 PF08181 DegQ: DegQ (SacQ) fam 54.1 54 0.0012 21.8 5.1 34 206-239 4-37 (46)
248 PF05667 DUF812: Protein of un 53.2 70 0.0015 33.2 8.6 28 193-220 329-356 (594)
249 cd04776 HTH_GnyR Helix-Turn-He 53.2 1.1E+02 0.0024 24.5 8.2 31 214-244 81-111 (118)
250 KOG4571|consensus 53.1 32 0.0007 32.4 5.6 32 201-232 250-281 (294)
251 PLN03188 kinesin-12 family pro 53.0 2.4E+02 0.0052 32.0 12.8 23 256-278 1230-1252(1320)
252 PF14257 DUF4349: Domain of un 53.0 87 0.0019 28.5 8.5 14 239-252 167-180 (262)
253 PRK13169 DNA replication intia 52.9 59 0.0013 26.2 6.4 44 194-237 10-53 (110)
254 PF09006 Surfac_D-trimer: Lung 52.9 49 0.0011 22.6 5.0 25 209-233 2-26 (46)
255 PF14282 FlxA: FlxA-like prote 52.9 64 0.0014 25.6 6.6 6 190-195 31-36 (106)
256 COG4946 Uncharacterized protei 52.8 43 0.00093 33.9 6.6 65 70-137 398-477 (668)
257 KOG4360|consensus 52.8 2.2E+02 0.0047 29.2 11.5 30 200-229 213-242 (596)
258 KOG4848|consensus 52.7 1.7E+02 0.0037 26.1 12.1 48 218-265 137-184 (225)
259 PHA02047 phage lambda Rz1-like 52.5 69 0.0015 25.3 6.4 33 186-218 28-60 (101)
260 TIGR03319 YmdA_YtgF conserved 52.2 2.6E+02 0.0056 28.5 12.4 18 240-257 110-127 (514)
261 PF04582 Reo_sigmaC: Reovirus 52.2 26 0.00057 33.5 5.0 73 198-270 83-155 (326)
262 PF13514 AAA_27: AAA domain 51.9 81 0.0018 35.0 9.4 42 235-276 934-975 (1111)
263 PF15188 CCDC-167: Coiled-coil 51.8 84 0.0018 24.2 6.8 28 225-252 41-68 (85)
264 PF07246 Phlebovirus_NSM: Phle 51.6 76 0.0016 29.5 7.7 42 225-266 207-248 (264)
265 KOG1029|consensus 51.4 1.1E+02 0.0025 32.8 9.7 8 79-86 238-245 (1118)
266 TIGR00998 8a0101 efflux pump m 51.4 1.1E+02 0.0024 28.3 9.2 16 205-220 107-122 (334)
267 PF05667 DUF812: Protein of un 51.4 88 0.0019 32.5 9.0 36 186-221 329-364 (594)
268 PRK15396 murein lipoprotein; P 51.4 1E+02 0.0023 23.3 7.9 18 207-224 26-43 (78)
269 PF09304 Cortex-I_coil: Cortex 51.4 1.3E+02 0.0028 24.2 11.9 36 204-239 35-70 (107)
270 PF00635 Motile_Sperm: MSP (Ma 51.3 31 0.00066 26.4 4.6 43 86-134 22-64 (109)
271 TIGR01010 BexC_CtrB_KpsE polys 51.2 2.2E+02 0.0048 27.0 11.4 22 182-203 181-202 (362)
272 PRK15178 Vi polysaccharide exp 51.0 2.5E+02 0.0054 28.1 11.7 23 207-229 315-337 (434)
273 PF03962 Mnd1: Mnd1 family; I 50.7 1.2E+02 0.0025 26.7 8.5 38 214-251 111-152 (188)
274 KOG0980|consensus 50.4 2.2E+02 0.0047 31.0 11.6 8 215-222 468-475 (980)
275 PF10506 MCC-bdg_PDZ: PDZ doma 50.3 1E+02 0.0022 22.7 8.4 60 204-263 3-65 (67)
276 PRK05771 V-type ATP synthase s 50.3 2.4E+02 0.0052 29.3 12.1 50 229-278 217-267 (646)
277 KOG0249|consensus 50.3 1.2E+02 0.0026 32.3 9.5 16 262-277 285-300 (916)
278 PF05701 WEMBL: Weak chloropla 50.1 1.4E+02 0.003 30.4 10.1 36 184-219 294-329 (522)
279 PHA01750 hypothetical protein 50.0 75 0.0016 23.4 5.9 40 185-226 30-69 (75)
280 PRK10361 DNA recombination pro 49.8 1.5E+02 0.0032 30.1 10.0 24 204-227 65-88 (475)
281 PF04799 Fzo_mitofusin: fzo-li 49.7 90 0.002 27.2 7.5 11 205-215 126-136 (171)
282 KOG0964|consensus 49.7 1.7E+02 0.0037 32.3 10.8 28 75-111 77-104 (1200)
283 KOG3809|consensus 49.5 85 0.0018 31.4 8.0 19 260-278 544-562 (583)
284 TIGR02894 DNA_bind_RsfA transc 49.4 92 0.002 26.9 7.4 8 185-192 57-64 (161)
285 KOG4677|consensus 48.9 1.4E+02 0.003 30.0 9.4 75 205-279 308-386 (554)
286 TIGR02680 conserved hypothetic 48.8 2.6E+02 0.0056 32.0 12.9 21 11-31 565-585 (1353)
287 KOG1029|consensus 48.8 3.9E+02 0.0083 29.0 13.2 12 86-97 196-207 (1118)
288 PF05010 TACC: Transforming ac 48.8 2E+02 0.0043 25.7 11.2 51 229-279 120-171 (207)
289 TIGR03017 EpsF chain length de 48.7 2.2E+02 0.0047 27.7 11.0 26 240-265 341-366 (444)
290 PF08232 Striatin: Striatin fa 48.7 85 0.0018 26.0 7.0 45 205-249 24-68 (134)
291 KOG2077|consensus 48.6 79 0.0017 32.7 7.8 50 204-253 327-376 (832)
292 TIGR02977 phageshock_pspA phag 48.5 1.3E+02 0.0029 26.7 8.7 65 187-251 87-151 (219)
293 PF15254 CCDC14: Coiled-coil d 48.5 2.1E+02 0.0046 30.7 11.1 45 240-284 454-498 (861)
294 PF06034 DUF919: Nucleopolyhed 48.4 62 0.0013 23.5 5.3 49 201-249 3-53 (62)
295 PF05278 PEARLI-4: Arabidopsis 48.3 1.3E+02 0.0029 28.0 8.8 58 191-248 192-249 (269)
296 TIGR00998 8a0101 efflux pump m 48.1 1E+02 0.0022 28.6 8.3 24 204-227 99-122 (334)
297 PF09738 DUF2051: Double stran 47.7 1E+02 0.0022 29.3 8.1 57 195-251 108-164 (302)
298 PF15456 Uds1: Up-regulated Du 47.6 1.6E+02 0.0034 24.2 8.7 33 191-224 21-53 (124)
299 PF10211 Ax_dynein_light: Axon 47.0 2E+02 0.0043 25.2 10.4 22 116-137 6-27 (189)
300 PRK09343 prefoldin subunit bet 46.7 1.6E+02 0.0034 23.9 9.3 19 233-251 77-95 (121)
301 COG4985 ABC-type phosphate tra 46.2 1.4E+02 0.0031 27.4 8.4 66 145-216 129-196 (289)
302 PF04645 DUF603: Protein of un 46.2 91 0.002 27.2 6.8 85 186-270 76-176 (181)
303 PF11544 Spc42p: Spindle pole 45.9 65 0.0014 24.3 5.2 39 185-223 12-50 (76)
304 PF03961 DUF342: Protein of un 45.9 1E+02 0.0022 30.5 8.3 14 121-134 208-221 (451)
305 PF08172 CASP_C: CASP C termin 45.6 63 0.0014 29.7 6.3 44 184-227 92-135 (248)
306 PRK11519 tyrosine kinase; Prov 45.5 3.9E+02 0.0085 28.2 14.2 6 126-131 195-200 (719)
307 PF04899 MbeD_MobD: MbeD/MobD 45.3 1.2E+02 0.0027 22.4 8.0 10 187-196 8-17 (70)
308 PF15030 DUF4527: Protein of u 45.0 1.9E+02 0.0041 26.8 9.0 12 267-278 121-132 (277)
309 PF05384 DegS: Sensor protein 45.0 2E+02 0.0043 24.7 11.7 79 184-262 26-122 (159)
310 cd04779 HTH_MerR-like_sg4 Heli 44.6 1.6E+02 0.0035 24.3 8.0 50 191-240 59-108 (134)
311 TIGR02231 conserved hypothetic 44.0 1.8E+02 0.0038 29.4 9.8 29 232-260 143-171 (525)
312 TIGR03545 conserved hypothetic 43.9 1.4E+02 0.003 30.8 9.0 32 224-255 223-254 (555)
313 PF07705 CARDB: CARDB; InterP 43.7 59 0.0013 24.0 5.0 43 87-134 24-66 (101)
314 PRK04778 septation ring format 43.6 3.1E+02 0.0068 28.1 11.6 7 207-213 356-362 (569)
315 KOG0982|consensus 43.5 3.2E+02 0.0069 27.4 10.9 64 207-270 298-368 (502)
316 PF04508 Pox_A_type_inc: Viral 43.4 38 0.00082 19.7 2.8 13 209-221 4-16 (23)
317 KOG1760|consensus 43.1 1.9E+02 0.0042 23.9 9.6 32 190-221 21-52 (131)
318 PF01576 Myosin_tail_1: Myosin 43.0 8 0.00017 41.7 0.0 98 184-281 10-114 (859)
319 PF10779 XhlA: Haemolysin XhlA 43.0 1.3E+02 0.0028 21.9 7.4 37 206-242 13-49 (71)
320 KOG4571|consensus 42.8 1.1E+02 0.0024 28.8 7.4 36 217-252 252-287 (294)
321 PF04977 DivIC: Septum formati 42.8 46 0.001 24.0 4.1 10 210-219 28-37 (80)
322 PF05529 Bap31: B-cell recepto 42.5 1.6E+02 0.0034 25.5 8.1 30 228-257 162-191 (192)
323 PF08657 DASH_Spc34: DASH comp 42.3 1E+02 0.0022 28.6 7.1 36 189-224 177-212 (259)
324 COG3879 Uncharacterized protei 42.3 93 0.002 28.7 6.7 29 210-238 54-82 (247)
325 PF09730 BicD: Microtubule-ass 42.1 2.3E+02 0.005 30.2 10.4 38 238-275 80-117 (717)
326 PF05546 She9_MDM33: She9 / Md 42.0 2.1E+02 0.0045 25.7 8.7 83 188-277 12-97 (207)
327 COG3879 Uncharacterized protei 41.9 1.8E+02 0.0039 26.9 8.5 31 203-233 54-84 (247)
328 COG3206 GumC Uncharacterized p 41.9 3.5E+02 0.0077 26.6 11.7 56 205-260 341-399 (458)
329 PF09787 Golgin_A5: Golgin sub 41.8 3E+02 0.0064 27.9 11.0 22 205-226 287-308 (511)
330 PF04012 PspA_IM30: PspA/IM30 41.7 2.4E+02 0.0053 24.7 12.1 16 204-219 56-71 (221)
331 COG2900 SlyX Uncharacterized p 41.6 1.5E+02 0.0032 22.2 7.3 36 217-252 5-40 (72)
332 PF00553 CBM_2: Cellulose bind 41.6 91 0.002 24.1 5.8 26 83-110 11-39 (101)
333 TIGR02231 conserved hypothetic 41.6 1.9E+02 0.004 29.2 9.5 41 211-251 129-169 (525)
334 KOG3091|consensus 41.5 3E+02 0.0065 28.0 10.6 38 186-223 356-393 (508)
335 cd07909 YciF YciF bacterial st 41.3 1.1E+02 0.0024 25.8 6.6 12 268-279 104-115 (147)
336 PF07139 DUF1387: Protein of u 41.2 3.2E+02 0.007 26.0 12.1 41 240-280 242-284 (302)
337 PF15188 CCDC-167: Coiled-coil 40.9 1.5E+02 0.0033 22.8 6.7 46 206-251 5-53 (85)
338 KOG0243|consensus 40.8 2.7E+02 0.0059 30.9 10.9 10 76-85 274-283 (1041)
339 PF04859 DUF641: Plant protein 40.6 1E+02 0.0022 25.6 6.2 44 224-267 84-127 (131)
340 PRK10636 putative ABC transpor 40.3 1.2E+02 0.0026 31.5 8.1 57 206-262 563-626 (638)
341 PF01576 Myosin_tail_1: Myosin 40.0 9.5 0.00021 41.1 0.0 83 197-279 417-506 (859)
342 PF07407 Seadorna_VP6: Seadorn 39.9 1.8E+02 0.004 28.1 8.4 32 191-222 31-62 (420)
343 PF15070 GOLGA2L5: Putative go 39.8 2.9E+02 0.0063 28.9 10.7 27 233-259 201-227 (617)
344 KOG0999|consensus 39.7 2.8E+02 0.006 28.8 10.1 26 242-267 122-147 (772)
345 KOG0946|consensus 39.7 4.3E+02 0.0093 28.7 11.7 43 206-248 671-713 (970)
346 PF10779 XhlA: Haemolysin XhlA 39.5 1.5E+02 0.0032 21.6 7.4 43 208-250 8-50 (71)
347 KOG0709|consensus 39.2 66 0.0014 32.3 5.6 48 190-237 270-317 (472)
348 PHA03011 hypothetical protein; 39.2 1.7E+02 0.0037 23.4 6.8 47 204-250 69-115 (120)
349 PRK10246 exonuclease subunit S 38.9 4.2E+02 0.009 29.4 12.3 18 270-287 492-510 (1047)
350 PF04314 DUF461: Protein of un 38.9 1.2E+02 0.0026 23.9 6.2 78 86-172 18-105 (110)
351 smart00502 BBC B-Box C-termina 38.9 1.8E+02 0.0039 22.3 8.4 19 206-224 21-39 (127)
352 KOG3091|consensus 38.8 2.6E+02 0.0057 28.4 9.7 26 187-212 343-368 (508)
353 PF05483 SCP-1: Synaptonemal c 38.6 5.1E+02 0.011 27.6 12.0 40 229-268 631-670 (786)
354 KOG0981|consensus 38.6 44 0.00095 34.5 4.4 18 155-172 612-629 (759)
355 KOG0239|consensus 38.4 3.2E+02 0.0068 28.9 10.8 14 11-24 16-29 (670)
356 PF07544 Med9: RNA polymerase 38.3 75 0.0016 24.0 4.7 18 206-223 28-45 (83)
357 PF14326 DUF4384: Domain of un 38.2 1.3E+02 0.0027 22.4 6.0 31 64-94 9-39 (83)
358 PF07716 bZIP_2: Basic region 38.1 1.1E+02 0.0024 20.9 5.2 21 204-224 30-50 (54)
359 PF05008 V-SNARE: Vesicle tran 37.9 93 0.002 22.7 5.2 55 198-252 24-79 (79)
360 KOG0243|consensus 37.8 3.4E+02 0.0074 30.2 11.1 25 238-262 487-511 (1041)
361 PF08702 Fib_alpha: Fibrinogen 37.8 2.2E+02 0.0047 24.0 7.9 29 226-254 103-131 (146)
362 PF08581 Tup_N: Tup N-terminal 37.8 1.8E+02 0.0039 22.0 10.2 18 263-280 58-75 (79)
363 KOG0999|consensus 37.6 2.2E+02 0.0047 29.5 9.0 38 238-275 153-190 (772)
364 PF15035 Rootletin: Ciliary ro 37.4 2.8E+02 0.0061 24.2 9.0 11 187-197 28-38 (182)
365 PF06698 DUF1192: Protein of u 37.3 1.3E+02 0.0028 21.6 5.4 22 208-229 23-44 (59)
366 KOG0982|consensus 37.2 4.4E+02 0.0096 26.4 12.2 11 209-219 314-324 (502)
367 PF15619 Lebercilin: Ciliary p 37.1 2.9E+02 0.0064 24.3 11.9 27 193-219 13-39 (194)
368 KOG0994|consensus 37.1 1.6E+02 0.0034 33.3 8.3 15 6-20 1419-1433(1758)
369 KOG0612|consensus 37.0 4E+02 0.0087 30.2 11.4 17 9-26 287-303 (1317)
370 COG1730 GIM5 Predicted prefold 37.0 2.6E+02 0.0056 23.6 10.6 32 193-224 7-38 (145)
371 PF10174 Cast: RIM-binding pro 37.0 4.3E+02 0.0093 28.5 11.6 25 195-219 318-342 (775)
372 PRK00106 hypothetical protein; 36.9 4.9E+02 0.011 26.8 12.4 26 235-260 126-151 (535)
373 KOG4674|consensus 36.7 2.8E+02 0.006 32.7 10.6 48 185-232 199-250 (1822)
374 KOG4593|consensus 36.7 5.5E+02 0.012 27.3 12.4 32 242-273 238-269 (716)
375 PRK05431 seryl-tRNA synthetase 36.6 2E+02 0.0044 28.4 8.7 62 191-252 27-98 (425)
376 cd07602 BAR_RhoGAP_OPHN1-like 36.3 3E+02 0.0065 24.6 8.9 21 203-223 6-26 (207)
377 PF06483 ChiC: Chitinase C; I 36.2 77 0.0017 27.8 5.0 22 113-134 116-137 (180)
378 PF05010 TACC: Transforming ac 36.1 3.2E+02 0.0069 24.5 11.3 30 194-223 25-54 (207)
379 PF14257 DUF4349: Domain of un 36.0 1.1E+02 0.0024 27.7 6.4 10 99-108 74-83 (262)
380 PF07160 DUF1395: Protein of u 35.9 2.6E+02 0.0056 25.6 8.7 24 206-229 22-45 (243)
381 PF12128 DUF3584: Protein of u 35.9 1.9E+02 0.0041 32.5 9.3 27 230-256 681-707 (1201)
382 PRK00888 ftsB cell division pr 35.9 54 0.0012 26.0 3.8 17 206-222 41-57 (105)
383 KOG0998|consensus 35.8 98 0.0021 33.5 6.7 78 205-282 504-584 (847)
384 PRK14143 heat shock protein Gr 35.7 2.2E+02 0.0048 26.0 8.2 32 197-228 72-103 (238)
385 PF07334 IFP_35_N: Interferon- 35.4 82 0.0018 23.8 4.4 23 239-261 12-34 (76)
386 COG5633 Predicted periplasmic 35.4 1.2E+02 0.0027 24.8 5.7 61 76-137 47-109 (123)
387 PRK14148 heat shock protein Gr 35.3 2.8E+02 0.0062 24.5 8.6 25 201-225 49-73 (195)
388 PF06280 DUF1034: Fn3-like dom 35.2 91 0.002 24.4 5.0 15 120-134 61-75 (112)
389 PF08308 PEGA: PEGA domain; I 35.1 1.3E+02 0.0029 21.2 5.5 51 81-135 15-65 (71)
390 PRK10698 phage shock protein P 34.6 2.6E+02 0.0056 25.1 8.4 10 239-248 171-180 (222)
391 PLN02678 seryl-tRNA synthetase 34.5 2.7E+02 0.0059 27.8 9.2 62 191-252 32-103 (448)
392 PF02994 Transposase_22: L1 tr 34.4 1.6E+02 0.0036 28.5 7.6 23 204-226 142-164 (370)
393 PF10046 BLOC1_2: Biogenesis o 34.4 2.2E+02 0.0048 22.1 11.9 29 235-264 67-95 (99)
394 TIGR00414 serS seryl-tRNA synt 34.3 2.3E+02 0.0051 27.9 8.7 25 229-253 78-102 (418)
395 PF10168 Nup88: Nuclear pore c 34.3 6E+02 0.013 27.1 12.4 23 201-223 581-603 (717)
396 PRK06798 fliD flagellar cappin 34.2 3.6E+02 0.0077 26.8 10.0 50 203-252 376-429 (440)
397 PLN02939 transferase, transfer 34.2 2.7E+02 0.0059 30.8 9.7 53 209-261 327-385 (977)
398 PF09766 FimP: Fms-interacting 34.1 3E+02 0.0065 26.5 9.3 45 232-280 89-133 (355)
399 PRK12705 hypothetical protein; 34.1 5.3E+02 0.011 26.4 13.1 24 206-229 70-93 (508)
400 PF04582 Reo_sigmaC: Reovirus 33.9 14 0.0003 35.4 0.0 54 193-246 64-117 (326)
401 PF03961 DUF342: Protein of un 33.8 2.2E+02 0.0048 28.1 8.5 19 117-135 221-239 (451)
402 KOG3809|consensus 33.7 2.8E+02 0.006 28.0 8.8 15 248-262 549-563 (583)
403 PRK00888 ftsB cell division pr 33.6 1E+02 0.0022 24.4 5.0 23 207-229 35-57 (105)
404 KOG4673|consensus 33.4 3.6E+02 0.0077 28.8 9.9 66 190-255 688-767 (961)
405 cd07636 BAR_GRAF The Bin/Amphi 33.4 3E+02 0.0064 24.7 8.4 25 201-225 4-28 (207)
406 PF02994 Transposase_22: L1 tr 33.4 1.6E+02 0.0034 28.6 7.2 25 210-234 141-165 (370)
407 PF09177 Syntaxin-6_N: Syntaxi 33.4 1.3E+02 0.0027 23.2 5.4 21 209-229 42-62 (97)
408 PF12126 DUF3583: Protein of u 33.3 4.3E+02 0.0094 25.1 11.6 35 231-265 72-111 (324)
409 PF05700 BCAS2: Breast carcino 33.3 3.5E+02 0.0076 24.1 10.6 11 11-21 27-37 (221)
410 PRK06975 bifunctional uroporph 33.2 2.7E+02 0.0059 29.2 9.4 74 212-285 377-454 (656)
411 PRK03947 prefoldin subunit alp 33.2 1.9E+02 0.0041 23.6 6.8 42 208-249 96-137 (140)
412 TIGR03495 phage_LysB phage lys 33.1 2.9E+02 0.0063 23.1 9.9 33 185-217 26-58 (135)
413 PF04859 DUF641: Plant protein 32.7 1.8E+02 0.0039 24.2 6.4 44 204-247 78-121 (131)
414 KOG3433|consensus 32.5 3.6E+02 0.0077 23.9 9.1 66 203-268 120-185 (203)
415 PF13874 Nup54: Nucleoporin co 32.4 2.4E+02 0.0052 23.3 7.3 21 204-224 70-90 (141)
416 PF06030 DUF916: Bacterial pro 32.4 1.2E+02 0.0027 24.5 5.4 18 118-135 82-99 (121)
417 KOG0979|consensus 32.4 4E+02 0.0086 29.6 10.4 139 125-275 590-729 (1072)
418 PF07610 DUF1573: Protein of u 32.3 1.5E+02 0.0032 19.5 5.3 40 89-134 3-42 (45)
419 TIGR02449 conserved hypothetic 32.2 2E+02 0.0044 21.0 7.9 61 200-260 1-61 (65)
420 PF06657 Cep57_MT_bd: Centroso 32.1 2.2E+02 0.0048 21.4 6.7 14 249-262 54-67 (79)
421 COG2959 HemX Uncharacterized e 32.1 3.7E+02 0.008 26.4 9.3 77 209-285 86-167 (391)
422 TIGR00634 recN DNA repair prot 32.1 4.5E+02 0.0098 26.8 10.7 17 103-119 95-111 (563)
423 PRK14011 prefoldin subunit alp 32.0 3.1E+02 0.0067 23.1 10.2 38 186-223 4-41 (144)
424 COG3096 MukB Uncharacterized p 32.0 5.7E+02 0.012 27.9 11.1 30 233-262 1079-1108(1480)
425 PF05546 She9_MDM33: She9 / Md 31.9 3.8E+02 0.0082 24.1 10.1 24 191-214 31-54 (207)
426 PRK10476 multidrug resistance 31.8 3.9E+02 0.0086 25.0 9.6 55 206-260 114-178 (346)
427 KOG0249|consensus 31.8 2.9E+02 0.0062 29.6 8.9 18 183-200 168-185 (916)
428 PF06698 DUF1192: Protein of u 31.5 89 0.0019 22.4 3.9 22 231-252 25-46 (59)
429 KOG3850|consensus 31.5 5.1E+02 0.011 25.6 10.1 21 7-27 98-118 (455)
430 PF10883 DUF2681: Protein of u 31.4 92 0.002 24.1 4.2 51 207-257 31-81 (87)
431 PRK11281 hypothetical protein; 31.3 4.2E+02 0.0092 29.8 10.8 15 267-281 204-218 (1113)
432 PF11853 DUF3373: Protein of u 31.3 49 0.0011 33.5 3.4 23 214-236 32-54 (489)
433 PF11518 DUF3221: Protein of u 31.2 27 0.00059 27.9 1.3 23 124-148 78-100 (108)
434 PLN02320 seryl-tRNA synthetase 31.2 2.8E+02 0.006 28.3 8.7 64 191-254 92-164 (502)
435 KOG4348|consensus 31.1 2E+02 0.0043 28.9 7.4 16 262-277 608-623 (627)
436 PF07439 DUF1515: Protein of u 31.0 2.7E+02 0.0058 22.5 6.8 20 231-250 44-63 (112)
437 PRK13923 putative spore coat p 31.0 3.6E+02 0.0078 23.5 10.9 52 227-283 111-162 (170)
438 PF15619 Lebercilin: Ciliary p 30.9 3.7E+02 0.0081 23.7 12.3 11 265-275 174-184 (194)
439 PF06548 Kinesin-related: Kine 30.9 5.7E+02 0.012 25.8 12.6 25 255-279 459-483 (488)
440 KOG3647|consensus 30.9 2.8E+02 0.0061 26.1 7.9 10 188-197 56-65 (338)
441 PRK14127 cell division protein 30.6 85 0.0019 25.3 4.1 18 209-226 47-64 (109)
442 TIGR01069 mutS2 MutS2 family p 30.5 5.7E+02 0.012 27.4 11.4 16 187-202 513-528 (771)
443 KOG0994|consensus 30.5 2.1E+02 0.0046 32.3 8.0 70 188-257 1685-1754(1758)
444 PF10168 Nup88: Nuclear pore c 30.4 5.3E+02 0.011 27.5 11.0 81 173-253 567-665 (717)
445 KOG4680|consensus 30.4 2E+02 0.0044 24.3 6.3 67 72-152 34-124 (153)
446 KOG0612|consensus 30.3 4.8E+02 0.01 29.6 10.7 85 189-273 462-547 (1317)
447 COG5293 Predicted ATPase [Gene 30.3 6E+02 0.013 25.9 12.4 107 164-279 295-403 (591)
448 PF00769 ERM: Ezrin/radixin/mo 30.1 4.3E+02 0.0092 24.1 9.2 80 195-274 8-87 (246)
449 PF10018 Med4: Vitamin-D-recep 30.0 2.1E+02 0.0046 24.8 6.9 54 186-239 9-62 (188)
450 PF07889 DUF1664: Protein of u 30.0 3.2E+02 0.0069 22.6 11.8 84 183-272 37-120 (126)
451 PF11855 DUF3375: Protein of u 30.0 4.5E+02 0.0098 26.4 10.1 74 183-256 120-214 (478)
452 PF09738 DUF2051: Double stran 30.0 4.9E+02 0.011 24.7 9.8 85 194-278 79-163 (302)
453 PF05308 Mito_fiss_reg: Mitoch 30.0 56 0.0012 30.2 3.3 21 234-254 122-142 (253)
454 TIGR02449 conserved hypothetic 29.9 2.2E+02 0.0048 20.8 8.6 64 207-270 1-64 (65)
455 PRK11677 hypothetical protein; 29.8 3.1E+02 0.0068 22.8 7.5 47 213-262 29-75 (134)
456 TIGR00219 mreC rod shape-deter 29.7 1.8E+02 0.0038 27.1 6.7 45 220-264 66-110 (283)
457 KOG4809|consensus 29.7 3.4E+02 0.0074 28.1 8.9 73 188-260 327-399 (654)
458 PF14723 SSFA2_C: Sperm-specif 29.6 3.9E+02 0.0083 23.4 9.3 73 187-259 100-177 (179)
459 KOG0979|consensus 29.6 8.3E+02 0.018 27.3 12.6 113 175-287 245-357 (1072)
460 PF03148 Tektin: Tektin family 29.6 4.5E+02 0.0097 25.5 9.7 86 166-251 210-296 (384)
461 TIGR00414 serS seryl-tRNA synt 29.4 3.6E+02 0.0078 26.6 9.1 71 191-261 29-103 (418)
462 PF07902 Gp58: gp58-like prote 29.3 1.7E+02 0.0036 30.5 6.8 74 200-274 262-335 (601)
463 KOG3335|consensus 29.2 86 0.0019 27.5 4.1 58 186-248 91-148 (181)
464 PF09728 Taxilin: Myosin-like 29.2 5E+02 0.011 24.5 10.5 84 187-270 221-308 (309)
465 PF05531 NPV_P10: Nucleopolyhe 29.1 1.5E+02 0.0033 22.3 4.9 53 191-243 10-65 (75)
466 PRK13182 racA polar chromosome 29.1 2.1E+02 0.0046 24.8 6.6 57 194-250 87-148 (175)
467 PRK00409 recombination and DNA 29.1 6E+02 0.013 27.3 11.3 84 174-259 507-595 (782)
468 PF10458 Val_tRNA-synt_C: Valy 29.0 2.1E+02 0.0046 20.4 5.6 44 204-247 9-66 (66)
469 PRK06975 bifunctional uroporph 29.0 2.7E+02 0.0058 29.2 8.5 64 197-260 344-411 (656)
470 PF06339 Ectoine_synth: Ectoin 28.9 86 0.0019 25.9 3.9 41 93-134 11-51 (126)
471 PF14193 DUF4315: Domain of un 28.9 1.8E+02 0.0038 22.3 5.4 55 194-248 3-62 (83)
472 PF06156 DUF972: Protein of un 28.7 2E+02 0.0044 22.9 6.0 46 234-279 8-53 (107)
473 COG3685 Uncharacterized protei 28.5 3.9E+02 0.0085 23.2 9.5 96 182-278 9-120 (167)
474 PF12709 Kinetocho_Slk19: Cent 28.3 2.7E+02 0.0059 21.5 6.3 41 223-263 45-85 (87)
475 PF07926 TPR_MLP1_2: TPR/MLP1/ 28.3 3.3E+02 0.0071 22.2 10.5 75 204-278 1-75 (132)
476 PRK14158 heat shock protein Gr 28.3 3.8E+02 0.0081 23.8 8.1 60 210-269 37-96 (194)
477 PF14931 IFT20: Intraflagellar 28.3 3.3E+02 0.0071 22.2 11.5 92 183-274 18-120 (120)
478 cd00632 Prefoldin_beta Prefold 28.2 2.8E+02 0.0062 21.5 9.6 74 187-260 1-96 (105)
479 cd07597 BAR_SNX8 The Bin/Amphi 28.2 2.6E+02 0.0057 25.3 7.4 115 141-257 70-196 (246)
480 KOG3119|consensus 28.2 3.9E+02 0.0085 24.7 8.7 66 202-267 190-255 (269)
481 COG2847 Copper(I)-binding prot 28.1 2.1E+02 0.0045 24.4 6.2 49 85-133 44-100 (151)
482 PRK12704 phosphodiesterase; Pr 28.0 6.6E+02 0.014 25.6 12.7 99 172-270 50-153 (520)
483 KOG0978|consensus 27.9 7.7E+02 0.017 26.3 14.1 109 171-279 509-618 (698)
484 PF12072 DUF3552: Domain of un 27.8 4.2E+02 0.009 23.2 9.5 78 183-260 69-146 (201)
485 PF10267 Tmemb_cc2: Predicted 27.8 6E+02 0.013 25.1 11.6 89 176-266 222-315 (395)
486 PF02055 Glyco_hydro_30: O-Gly 27.7 3.3E+02 0.0071 27.6 8.6 65 63-134 427-494 (496)
487 PRK13731 conjugal transfer sur 27.7 60 0.0013 29.8 3.0 40 87-130 54-104 (243)
488 PRK01203 prefoldin subunit alp 27.6 3.6E+02 0.0078 22.4 9.8 74 188-261 3-121 (130)
489 PF10828 DUF2570: Protein of u 27.6 3.1E+02 0.0067 21.7 9.3 68 194-261 20-87 (110)
490 COG4717 Uncharacterized conser 27.5 3.2E+02 0.007 29.8 8.6 80 192-276 774-853 (984)
491 PRK11147 ABC transporter ATPas 27.3 2E+02 0.0044 29.7 7.2 53 208-260 570-628 (635)
492 PRK11578 macrolide transporter 27.2 2.7E+02 0.0058 26.5 7.7 88 171-260 86-184 (370)
493 KOG0993|consensus 27.2 6.5E+02 0.014 25.2 10.6 88 174-261 409-496 (542)
494 KOG3208|consensus 27.0 4.9E+02 0.011 23.7 8.6 59 202-261 54-122 (231)
495 KOG0712|consensus 27.0 1.2E+02 0.0026 29.3 5.1 54 98-152 264-317 (337)
496 KOG0018|consensus 27.0 3.7E+02 0.008 30.0 9.1 81 185-268 655-735 (1141)
497 PF04420 CHD5: CHD5-like prote 27.0 2.9E+02 0.0063 23.4 7.1 56 184-239 39-99 (161)
498 PF08286 Spc24: Spc24 subunit 26.9 23 0.00049 28.6 0.2 43 208-250 1-43 (118)
499 PF07111 HCR: Alpha helical co 26.9 8E+02 0.017 26.2 13.0 107 174-281 485-595 (739)
500 KOG3335|consensus 26.9 71 0.0015 28.0 3.2 53 175-229 91-143 (181)
No 1
>KOG0977|consensus
Probab=100.00 E-value=1.1e-36 Score=298.88 Aligned_cols=166 Identities=36% Similarity=0.638 Sum_probs=138.2
Q ss_pred ccchhhhhHHHHhhhhhhhhcccccCCCCCCccC---CCCCCC-cCCcccchhhhhhcccccccccceeEEeeCCCCCeE
Q psy21 2 GVVFETRIEEVQKKRKTELLCMNVSGAGSSSVAG---TPSRGS-VAPSKKRKLNILEEYESEERTSSGFQVTSSAKGNME 77 (288)
Q Consensus 2 ~~~ld~ei~~yrklle~ee~r~~~~~~~~~~~~~---~~~~~~-~~~~~kr~r~~~~~~~s~~~~~~~~~~~~sa~G~v~ 77 (288)
|++||+||++|||||||||.| +|++.+.+.. +++... +++.+.|. ..+ .+..+.+++++|+|+|.
T Consensus 372 ki~Ld~EI~~YRkLLegee~r---~~~~~s~~~~~~s~~s~~~~~~~~~~r~--~~g------e~~~~ss~~r~a~g~v~ 440 (546)
T KOG0977|consen 372 KISLDAEIAAYRKLLEGEEER---TGSLSSLVVQNVSTSSPRAVRDTRSSRT--VIG------ESESRSSYSRSAKGNVA 440 (546)
T ss_pred HhHHHhHHHHHHHHhccccCC---CCccceeEEeecccCCCcccccCCCccc--ccC------ccccccccccccCCCcc
Confidence 689999999999999999999 4545444431 111111 12222222 111 12233489999999999
Q ss_pred EEeecCCCcEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEEecccCCCCCCCCCceeeecc-cc
Q psy21 78 ITEVDPDGKFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVWSFNVEGATHEPPHNIVMKNQ-KF 156 (288)
Q Consensus 78 I~e~d~~GkfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIwa~~~~~~~~~pp~dlv~~~~-~w 156 (288)
|.||||+||||++.|++.+++++|||+|+|.+||...++|+||++++|+||++|||||+++ |+.|+||+++||+.+ +|
T Consensus 441 i~e~d~~gk~i~~~n~~~~~~~~g~~kl~r~~~~~~~i~~~f~~~~~~~~g~~v~i~~a~~-G~~~~Pp~~lV~~~~~~w 519 (546)
T KOG0977|consen 441 IHECDPEGKFIRLNNKSSEEESIGGWKLRRKIDGKREIVFKFPSGYVLKPGASVTIWAADA-GAVHNPPESLVMKGENTW 519 (546)
T ss_pred eeecccccceeeeccccccccCCCcceEEEecCCcceEEEECCCCceecCCceEEEeecCC-CCccCCCcceeecCCccc
Confidence 9999999999999999999999999999999998788999999999999999999999999 999999999999998 99
Q ss_pred cCccchhhheeccchh-HHHHHHH
Q psy21 157 GSGDEISTILLNTDGE-LANQERI 179 (288)
Q Consensus 157 g~g~~i~t~L~d~~ge-vA~~~~~ 179 (288)
|.|+++.|.|+|.+|+ +|.+.+.
T Consensus 520 g~g~~~~t~l~n~~ge~~as~~~~ 543 (546)
T KOG0977|consen 520 GIGASVRTILYNSEGEEVASHSQV 543 (546)
T ss_pred ccCCCcceeeccccceeeeeeeee
Confidence 9999999999999999 9988653
No 2
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=99.75 E-value=1.7e-17 Score=155.03 Aligned_cols=120 Identities=37% Similarity=0.493 Sum_probs=95.1
Q ss_pred hhheeccchhHHHHHHHHhhhhhHHHhHH------------------HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH
Q psy21 163 STILLNTDGELANQERIIDNLRKEQSSLL------------------LELKETKGRSGKYDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 163 ~t~L~d~~gevA~~~~~~~~~~~~~~~~~------------------eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~ 224 (288)
...++..|..++..++... ..||+.++ +|+..++.++++++.+|..|+..+..|+.++.+
T Consensus 171 L~eiR~~ye~~~~~~~~e~--e~~y~~k~~~l~~~~~~~~~~~~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~ 248 (312)
T PF00038_consen 171 LREIRAQYEEIAQKNREEL--EEWYQSKLEELRQQSEKSSEELESAKEELKELRRQIQSLQAELESLRAKNASLERQLRE 248 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhHHHHHHHHHhhhhhhh--hhhcccccccccccccccccccchhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHH
Confidence 3444455555666555554 45555554 445556666777888888888888888888888
Q ss_pred HHHH----HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccccC
Q psy21 225 LEDQ----IEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEESS 284 (288)
Q Consensus 225 le~~----l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe~R 284 (288)
++.. +..++..|..++.+|..++.+|+.++++|++|||+||+||+||+|||+|||||++|
T Consensus 249 le~~~~~~~~~~~~~i~~le~el~~l~~~~~~~~~ey~~Ll~~K~~Ld~EIatYR~LLEgEE~R 312 (312)
T PF00038_consen 249 LEQRLDEEREEYQAEIAELEEELAELREEMARQLREYQELLDVKLALDAEIATYRKLLEGEESR 312 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTCHC--
T ss_pred HHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhCcccC
Confidence 7754 46789999999999999999999999999999999999999999999999999987
No 3
>PF00932 LTD: Lamin Tail Domain; InterPro: IPR001322 Intermediate filaments (IF) are primordial components of the cytoskeleton and the nuclear envelope []. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups, type I: acidic cytokeratins, type II: basic cytokeratins, type III: vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin, type IV: neurofilaments L, H and M, alpha-internexin and nestin, and type V: nuclear lamins A, B1, B2 and C. The lamins are components of the nuclear lamina, a fibrous layer on the nucleoplasmic side of the inner nuclear membrane that may provide a framework for the nuclear envelope and may interact with chromatin. All IF proteins are structurally similar in that they consist of a central rod domain arranged in coiled-coil alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. The C-terminal domain has been charcterised for the lamins.; PDB: 3UMN_A 2KPW_A 3JT0_A 2LLL_A 1UFG_A 3GEF_D 1IFR_A 1IVT_A.
Probab=99.56 E-value=1.7e-14 Score=116.10 Aligned_cols=99 Identities=27% Similarity=0.594 Sum_probs=76.5
Q ss_pred EEeeCCCCCeEEEeecCCC---cEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEEecccCCCCC
Q psy21 67 QVTSSAKGNMEITEVDPDG---KFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVWSFNVEGATH 143 (288)
Q Consensus 67 ~~~~sa~G~v~I~e~d~~G---kfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIwa~~~~~~~~ 143 (288)
..+.++.+.|.|.|++.+| +||.|.|.++.+++|+||+|.+. ...|.||++.+|.||+.+.||+.+. +
T Consensus 4 ~~~~~~~~~v~I~Ei~~~~~~~e~VEl~N~~~~~vdL~gw~L~~~-----~~~~~~~~~~~l~pg~~~~v~~~~~----~ 74 (116)
T PF00932_consen 4 ATPASAAGDVVINEIMPDGSDNEWVELYNPGDSTVDLSGWKLVDS-----NNTYTFPPGTTLAPGEYVVVWTGGG----N 74 (116)
T ss_dssp -EEEEETSSEEEEEEETTS---SEEEEEE-SSS-EEGTT-EEEEE-----EEEEE--TT-EE-TTEEEEEEECCS----S
T ss_pred cccCCCCCcEEEEEECCCCCcEEEEEEEECCCCcEeeccEEEEEC-----CccEEccCCcCcCCCCEEEEEECCC----C
Confidence 3566788999999999999 99999999999999999999987 4689999999999999999999553 5
Q ss_pred CCCCceeeecc--cccCccchhhheeccchh-HHH
Q psy21 144 EPPHNIVMKNQ--KFGSGDEISTILLNTDGE-LAN 175 (288)
Q Consensus 144 ~pp~dlv~~~~--~wg~g~~i~t~L~d~~ge-vA~ 175 (288)
.++.+++|... .|+.+++ .+.|+|+.|+ |+.
T Consensus 75 ~~~~~~~~~~~~~~~~~~gd-~i~L~d~~G~~vd~ 108 (116)
T PF00932_consen 75 NPPFDLYWGSSTFGLNNSGD-TITLYDPSGEVVDS 108 (116)
T ss_dssp BTTTEEEETTS-SSSSSSEE-EEEEE-TTS-EEEE
T ss_pred CCCcceeeeccCccEeCCCe-EEEEECCCCCEEEE
Confidence 67888888776 8888655 4689999999 654
No 4
>KOG0977|consensus
Probab=99.53 E-value=2.6e-14 Score=141.36 Aligned_cols=122 Identities=25% Similarity=0.342 Sum_probs=100.8
Q ss_pred hhheeccchhHHHHHHHHhhhhhHHHhHH------------------HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH
Q psy21 163 STILLNTDGELANQERIIDNLRKEQSSLL------------------LELKETKGRSGKYDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 163 ~t~L~d~~gevA~~~~~~~~~~~~~~~~~------------------eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~ 224 (288)
+-.++++|......++.+. ..||+.+. ||+...+..+.+|+.+|.+|...+..|+.+|++
T Consensus 251 i~eiRaqye~~~~~nR~di--E~~Y~~kI~~i~~~~~~~~~~~~~~rEEl~~~R~~i~~Lr~klselE~~n~~L~~~I~d 328 (546)
T KOG0977|consen 251 IREIRAQYEAISRQNRKDI--ESWYKRKIQEIRTSAERANVEQNYAREELRRIRSRISGLRAKLSELESRNSALEKRIED 328 (546)
T ss_pred HHHHHHHHHHHHHHhHHHH--HHHHHHHHHHHHhhhccccchhHHHHHHHHHHHhcccchhhhhccccccChhHHHHHHH
Confidence 3455566666444445444 55666554 556667777888999999999999999999999
Q ss_pred HHHHHH----HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccccCCC
Q psy21 225 LEDQIE----YEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEESSLP 286 (288)
Q Consensus 225 le~~l~----~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe~Rl~ 286 (288)
|+.++. .+.+.+...+.++.+||.+|+.++.|||.|||+|+.||+||++||+||+||+.|-+
T Consensus 329 L~~ql~e~~r~~e~~L~~kd~~i~~mReec~~l~~Elq~LlD~ki~Ld~EI~~YRkLLegee~r~~ 394 (546)
T KOG0977|consen 329 LEYQLDEDQRSFEQALNDKDAEIAKMREECQQLSVELQKLLDTKISLDAEIAAYRKLLEGEEERTG 394 (546)
T ss_pred HHhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHhhchHhHHHhHHHHHHHHhccccCCCC
Confidence 998775 46778888999999999999999999999999999999999999999999998854
No 5
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=96.72 E-value=0.033 Score=52.08 Aligned_cols=71 Identities=17% Similarity=0.412 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh--------hhhhhhhhhHHHhhhhhh
Q psy21 209 NELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM--------EIKINLDVEIAQYRNLME 279 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL--------~vKl~Le~EIatYrkLLE 279 (288)
..++..+..+...+..++.++..++.....|+..+..+.......+.+|+.-+ .++..+..=+..|..||+
T Consensus 212 ~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~le~~~~~~~~~~~~~i~~le~el~~l~~~~~~~~~ey~~Ll~ 290 (312)
T PF00038_consen 212 ESAKEELKELRRQIQSLQAELESLRAKNASLERQLRELEQRLDEEREEYQAEIAELEEELAELREEMARQLREYQELLD 290 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred chhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHHHHHHHHHHHHHHHhhhccchhHHHHHHHHHHHHHHHHHHHH
Confidence 33333333344444444444455555555555555555555555555555433 334444444455665553
No 6
>KOG4642|consensus
Probab=95.51 E-value=0.01 Score=54.23 Aligned_cols=78 Identities=13% Similarity=0.178 Sum_probs=61.4
Q ss_pred chhHHHHHHHHHHHHHHHHHH-------HHH-----------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy21 204 YDTEMNELRAANYRLTERNKE-------LED-----------QIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKI 265 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~-------le~-----------~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl 265 (288)
...+|++|.+.++.+....+- +.- -+...++.|....+.+...+..+..+|.+||.++|++|
T Consensus 188 ~~kel~elf~~v~e~rk~rEvpd~lcgkIt~el~~~pvi~psgIty~ra~I~Ehl~rvghfdpvtr~~Lte~q~ipN~al 267 (284)
T KOG4642|consen 188 TTKELSELFSKVDEKRKKREVPDYLCGKITLELMREPVITPSGITYDRADIEEHLQRVGHFDPVTRWPLTEYQLIPNLAL 267 (284)
T ss_pred HHHHHHHHHHHHHHHhccccccchhhhhhhHHhhcCCccCccccchhHHHHHHHHHHhccCCchhcccCCHHhhccchHH
Confidence 456777777777776665431 111 13456888999999999999999999999999999999
Q ss_pred hhhhhHHHhhhhhhcccc
Q psy21 266 NLDVEIAQYRNLMEAEES 283 (288)
Q Consensus 266 ~Le~EIatYrkLLEgEe~ 283 (288)
+++ |++|++++++-..
T Consensus 268 kev--Ia~fl~~n~w~~~ 283 (284)
T KOG4642|consen 268 KEV--IAAFLKENEWADD 283 (284)
T ss_pred HHH--HHHHHHhcccccc
Confidence 999 9999999987543
No 7
>PRK11637 AmiB activator; Provisional
Probab=94.94 E-value=0.24 Score=48.67 Aligned_cols=67 Identities=12% Similarity=0.105 Sum_probs=42.1
Q ss_pred HHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 193 ELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
++.....++..+..+|..+...+..++.++..++.++...+..|..++.++...+..+..+++.+..
T Consensus 69 ~~~~~~~~l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlra~Y~ 135 (428)
T PRK11637 69 QRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLDAAFR 135 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333334444445555555555555555556666666677777777777777777778777776655
No 8
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=94.86 E-value=0.83 Score=41.76 Aligned_cols=98 Identities=20% Similarity=0.298 Sum_probs=51.6
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHHHhhHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQ-----------------------IEYEQSKFARME 240 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~-----------------------l~~~q~~i~~le 240 (288)
...+..+.+|+...+..+..++.++.+++.++.+++..++++..+ +.-++..+..|+
T Consensus 30 ~~~l~k~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl~~v~~~~e~~aL~~E~~~ak~r~~~le 109 (239)
T COG1579 30 RKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLE 109 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556666666666666666666666666666666555543322 223344444455
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 241 AEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 241 ~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
.+|..+.......-.+-..|..-..+++-.++.++..++++
T Consensus 110 ~el~~l~~~~~~l~~~i~~l~~~~~~~e~~~~e~~~~~e~e 150 (239)
T COG1579 110 DELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55555555444433333344444445555666666655543
No 9
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=94.43 E-value=0.9 Score=41.55 Aligned_cols=68 Identities=26% Similarity=0.366 Sum_probs=45.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhh
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrk 276 (288)
.++..|+..++.++.++..++.++.+....+..++.++..++..+.++-+ .+...+-.++.+|+.-+.
T Consensus 89 ~e~~aL~~E~~~ak~r~~~le~el~~l~~~~~~l~~~i~~l~~~~~~~e~---~~~e~~~~~e~e~~~i~e 156 (239)
T COG1579 89 RELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEK---NLAEAEARLEEEVAEIRE 156 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence 45555555555555566667777888888888888888888887776653 344455667777776554
No 10
>PRK11637 AmiB activator; Provisional
Probab=94.21 E-value=0.66 Score=45.58 Aligned_cols=78 Identities=8% Similarity=0.176 Sum_probs=50.4
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
..++....+++.+..+...+..+|..+...+..++.++..++.++...+..|..++.+|..++.++..+...+..++.
T Consensus 54 ~qi~~~~~~i~~~~~~~~~~~~~l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlr 131 (428)
T PRK11637 54 QDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLD 131 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444455555555555566666666666666666666666677777777777777777777777777666666654
No 11
>PRK09039 hypothetical protein; Validated
Probab=93.99 E-value=0.46 Score=45.64 Aligned_cols=9 Identities=33% Similarity=0.427 Sum_probs=3.3
Q ss_pred HHHHhhHHH
Q psy21 233 QSKFARMEA 241 (288)
Q Consensus 233 q~~i~~le~ 241 (288)
+.+|..++.
T Consensus 171 ~~~i~~L~~ 179 (343)
T PRK09039 171 QAKIADLGR 179 (343)
T ss_pred HHHHHHHHH
Confidence 333333333
No 12
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=93.88 E-value=2.3 Score=43.30 Aligned_cols=38 Identities=29% Similarity=0.450 Sum_probs=18.6
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 231 YEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
.....|..|+.++..+......+-+....|.+++..++
T Consensus 217 e~~~ri~~LEedi~~l~qk~~E~e~~~~~lk~~~~elE 254 (546)
T PF07888_consen 217 EARQRIRELEEDIKTLTQKEKEQEKELDKLKELKAELE 254 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666666666665555333333334444443333
No 13
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=93.18 E-value=0.67 Score=40.72 Aligned_cols=77 Identities=23% Similarity=0.298 Sum_probs=43.2
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKI 265 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl 265 (288)
.+..++......+..++.++..|+..+..|+..+.+....++..++.+..|.-++..+...+.+.-.|+.+|++-=|
T Consensus 106 ~l~~~~~~~~~~l~~l~~~~~~L~~~~~~l~~~l~ek~k~~e~l~DE~~~L~l~~~~~e~k~~~l~~En~~Lv~Rwm 182 (194)
T PF08614_consen 106 ELEKELSEKERRLAELEAELAQLEEKIKDLEEELKEKNKANEILQDELQALQLQLNMLEEKLRKLEEENRELVERWM 182 (194)
T ss_dssp --------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444455555555555555555555555555666677777777777777777777778888776543
No 14
>PRK10884 SH3 domain-containing protein; Provisional
Probab=93.02 E-value=1.9 Score=38.51 Aligned_cols=49 Identities=18% Similarity=0.235 Sum_probs=30.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
+.++......+..|+.+++.+..+++..+..+..++.++..++.+...+
T Consensus 124 ~~~~~~~~~~~~~L~~~n~~L~~~l~~~~~~~~~l~~~~~~~~~~~~~~ 172 (206)
T PRK10884 124 QQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQRTIIMQ 172 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445555555666666666666667777777777777776665544
No 15
>PRK09039 hypothetical protein; Validated
Probab=92.68 E-value=1.6 Score=42.01 Aligned_cols=39 Identities=13% Similarity=0.003 Sum_probs=17.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEI 243 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el 243 (288)
+.+..+.+.++..|..+|+.|+.+++.+++.|...+.+.
T Consensus 129 k~~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~ 167 (343)
T PRK09039 129 KQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRD 167 (343)
T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444444444444444443
No 16
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=92.56 E-value=2.9 Score=35.89 Aligned_cols=47 Identities=28% Similarity=0.493 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy21 217 RLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEI 263 (288)
Q Consensus 217 ~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~v 263 (288)
.+..+...++.++..++.++..|+.++..+..+..-...+|+.|++|
T Consensus 101 ~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~~eDY~~L~~I 147 (161)
T TIGR02894 101 ALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTLIDI 147 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33445556777788889999999999999999999999999999997
No 17
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=92.21 E-value=3.1 Score=35.01 Aligned_cols=36 Identities=17% Similarity=0.178 Sum_probs=20.5
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
..+++++.+....+++..|..|.+.+...+.....+
T Consensus 8 v~~kLK~~~~e~dsle~~v~~LEreLe~~q~~~e~~ 43 (140)
T PF10473_consen 8 VEEKLKESESEKDSLEDHVESLERELEMSQENKECL 43 (140)
T ss_pred HHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHhHHHH
Confidence 345566666666666666666666655555544433
No 18
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=92.09 E-value=4.9 Score=36.55 Aligned_cols=64 Identities=19% Similarity=0.307 Sum_probs=32.0
Q ss_pred HhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 199 GRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 199 ~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
.-+.+++.|++.|.........+|+.....+..+.+.|...+.+..+.+..+.+...+|..|.+
T Consensus 32 ~~L~e~~kE~~~L~~Er~~h~eeLrqI~~DIn~lE~iIkqa~~er~~~~~~i~r~~eey~~Lk~ 95 (230)
T PF10146_consen 32 KCLEEYRKEMEELLQERMAHVEELRQINQDINTLENIIKQAESERNKRQEKIQRLYEEYKPLKD 95 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555555555555554444444444445555555555555555555555555444
No 19
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=91.64 E-value=4.3 Score=36.90 Aligned_cols=89 Identities=17% Similarity=0.162 Sum_probs=55.3
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
....++......+..-..=|.+++..+..|..+-.....+|.+.+..|..||..+.+++.+..+....-+.|.+....|-
T Consensus 15 k~k~~i~~e~~~~e~ee~~L~e~~kE~~~L~~Er~~h~eeLrqI~~DIn~lE~iIkqa~~er~~~~~~i~r~~eey~~Lk 94 (230)
T PF10146_consen 15 KLKNEILQEVESLENEEKCLEEYRKEMEELLQERMAHVEELRQINQDINTLENIIKQAESERNKRQEKIQRLYEEYKPLK 94 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444433344445556666666666666666666677777777777777777777777666666666666666666
Q ss_pred hhHHHhhhh
Q psy21 269 VEIAQYRNL 277 (288)
Q Consensus 269 ~EIatYrkL 277 (288)
-+|..-|+=
T Consensus 95 ~~in~~R~e 103 (230)
T PF10146_consen 95 DEINELRKE 103 (230)
T ss_pred HHHHHHHHH
Confidence 666655543
No 20
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=91.36 E-value=3.7 Score=36.29 Aligned_cols=75 Identities=20% Similarity=0.277 Sum_probs=37.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
|+.-+..|+.....|.++...++.+...+-+.|..|..+...+..+.......+++|-.-+..|-.-|..|-.|+
T Consensus 72 Lk~~~~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~kl~~e~~~lk~~~~eL~~~~~~Lq~Ql~~~e~l~ 146 (193)
T PF14662_consen 72 LKTLAKSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENGKLLAERDGLKKRSKELATEKATLQRQLCEFESLI 146 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhhhHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 333333444444444444444554444445555555555555555555555555555555555555554444443
No 21
>KOG0161|consensus
Probab=91.13 E-value=3.7 Score=47.61 Aligned_cols=100 Identities=23% Similarity=0.314 Sum_probs=70.9
Q ss_pred hhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHhhHHHHHHHHHHHHHHHH
Q psy21 182 NLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYE-------QSKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 182 ~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~-------q~~i~~le~el~~lr~~~~~ql 254 (288)
.+...+..+..|+..+..++.+....+..+.+.+..|+.++.+++..++.. ......|..++..++.++..+.
T Consensus 1066 el~~~l~kke~El~~l~~k~e~e~~~~~~l~k~i~eL~~~i~el~e~le~er~~r~K~ek~r~dL~~ele~l~~~Lee~~ 1145 (1930)
T KOG0161|consen 1066 ELDNQLKKKESELSQLQSKLEDEQAEVAQLQKQIKELEARIKELEEELEAERASRAKAERQRRDLSEELEELKEELEEQG 1145 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334555556666777777777777777777777778887777776655443 3344556666777777777776
Q ss_pred HHHHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 255 KDYRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 255 ~eyq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
..-...+.+....+.|+..-++-|+-+
T Consensus 1146 ~~t~~q~e~~~k~e~e~~~l~~~leee 1172 (1930)
T KOG0161|consen 1146 GTTAAQLELNKKREAEVQKLRRDLEEE 1172 (1930)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 666777888888999999999888754
No 22
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=90.98 E-value=5.6 Score=39.49 Aligned_cols=79 Identities=19% Similarity=0.222 Sum_probs=47.1
Q ss_pred hHHHHHHHHHHhhCC--chhHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHhhHHHHHHHHHHHHH-
Q psy21 189 SLLLELKETKGRSGK--YDTEMNELRAANYRLTERNKELEDQ--------------IEYEQSKFARMEAEIKRLQTEMT- 251 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~--l~~ei~~L~~~~~~Le~~l~~le~~--------------l~~~q~~i~~le~el~~lr~~~~- 251 (288)
.+...|...+.+.+- -..+...+-..+..|+.++.+++.+ +..++..|..|+.+++..+..+.
T Consensus 260 ~Ar~aL~~fRn~~gvlDP~~~a~~~~~lI~~Le~qLa~~~aeL~~L~~~~~p~sPqV~~l~~rI~aLe~QIa~er~kl~~ 339 (434)
T PRK15178 260 AARLELLKIQHIQKDIDPKETITAIYQLIAGFETQLAEAKAEYAQLMVNGLDQNPLIPRLSAKIKVLEKQIGEQRNRLSN 339 (434)
T ss_pred HHHHHHHHHHHhCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCchhHHHHHHHHHHHHHHHHHHHhhc
Confidence 334444444444432 2455555556666666666555443 34567788888888888887774
Q ss_pred --------HHHHHHHHHhhhhhhhhhhHH
Q psy21 252 --------EQLKDYRELMEIKINLDVEIA 272 (288)
Q Consensus 252 --------~ql~eyq~LL~vKl~Le~EIa 272 (288)
..+.+|+.| .||.|+|
T Consensus 340 ~~g~~~la~~laeYe~L-----~le~efA 363 (434)
T PRK15178 340 KLGSQGSSESLSLFEDL-----RLQSEIA 363 (434)
T ss_pred CCCCCchhHHHHHHHHH-----HHHHHHH
Confidence 256677764 4555655
No 23
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=90.82 E-value=4.2 Score=35.17 Aligned_cols=21 Identities=14% Similarity=0.237 Sum_probs=8.0
Q ss_pred hCCchhHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTER 221 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~ 221 (288)
+..+..+++++...++.+...
T Consensus 90 l~~l~~el~~l~~~~~~~~~~ 110 (191)
T PF04156_consen 90 LQQLQEELDQLQERIQELESE 110 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333
No 24
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=90.53 E-value=7.8 Score=35.26 Aligned_cols=23 Identities=22% Similarity=0.302 Sum_probs=9.2
Q ss_pred chhHHHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le 226 (288)
++.|++.|+..+..++..+...+
T Consensus 61 l~~e~e~L~~~~~~l~~~v~~q~ 83 (251)
T PF11932_consen 61 LEREIENLEVYNEQLERQVASQE 83 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444443333333
No 25
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=89.86 E-value=3.4 Score=40.72 Aligned_cols=67 Identities=18% Similarity=0.306 Sum_probs=45.1
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
...+.....++.....+.+.|+.+|.+++..+..++.++.+.+..+...+..|..++..|..+..+-
T Consensus 44 q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~i~~~~~ql~~s~~~l~~~~~~I~~~~~~l~~l~~q~ 110 (420)
T COG4942 44 QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQE 110 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHHHH
Confidence 4555556666666666667777888888888888888777776666666666666655555555544
No 26
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=89.78 E-value=6.9 Score=36.39 Aligned_cols=52 Identities=17% Similarity=0.219 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhh
Q psy21 225 LEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 225 le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrk 276 (288)
.+.+|+.+|..--..-.|...+..++..+-..|-.-+..-..|+.++..|.+
T Consensus 209 ~qKRL~sLq~vRPAfmdEyEklE~EL~~lY~~Y~~kfRNl~yLe~qle~~~~ 260 (267)
T PF10234_consen 209 NQKRLQSLQSVRPAFMDEYEKLEEELQKLYEIYVEKFRNLDYLEHQLEEYNR 260 (267)
T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 3334555554444445666666666666666666666555667777666654
No 27
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=89.53 E-value=7.1 Score=32.62 Aligned_cols=71 Identities=14% Similarity=0.350 Sum_probs=36.1
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
....++..+...++.|+.++..+++.+..++.....++..+......+.....+++.++..+......|..
T Consensus 63 ~l~~d~~~l~~~~~rL~~~~~~~ere~~~~~~~~~~l~~~~~~~~~~~k~~kee~~klk~~~~~~~tq~~~ 133 (151)
T PF11559_consen 63 RLRSDIERLQNDVERLKEQLEELERELASAEEKERQLQKQLKSLEAKLKQEKEELQKLKNQLQQRKTQYEH 133 (151)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444455555555555555555544555555555555555555555555555554444444
No 28
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=89.36 E-value=4 Score=37.91 Aligned_cols=63 Identities=16% Similarity=0.237 Sum_probs=28.4
Q ss_pred HHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy21 193 ELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLK 255 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~ 255 (288)
+.+....++++|..+|+++...++.+..++..++..+..++..|..++..|.....-+..++|
T Consensus 46 ~~~~~q~ei~~L~~qi~~~~~k~~~~~~~i~~~~~eik~l~~eI~~~~~~I~~r~~~l~~raR 108 (265)
T COG3883 46 EKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKKRAR 108 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333344444444444444444444444444444444444454444444444444444444
No 29
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=88.95 E-value=8.7 Score=36.46 Aligned_cols=48 Identities=29% Similarity=0.344 Sum_probs=28.2
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
....|.....++.+++.++.....+-..+=.-|..|..+|+.-.+.++
T Consensus 221 ~~~~i~~~k~~l~el~~el~~l~~~i~~~~~~k~~l~~eI~e~~~~~~ 268 (325)
T PF08317_consen 221 QKEEIEAKKKELAELQEELEELEEKIEELEEQKQELLAEIAEAEKIRE 268 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444455555555555555556666667777777777666654
No 30
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=88.92 E-value=5.5 Score=37.74 Aligned_cols=21 Identities=24% Similarity=0.358 Sum_probs=10.5
Q ss_pred HHHHHhhhhhhhhhhHHHhhh
Q psy21 256 DYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 256 eyq~LL~vKl~Le~EIatYrk 276 (288)
-|.+++..-.-..-|+.+||+
T Consensus 284 kY~E~~~mL~EaQEElk~lR~ 304 (306)
T PF04849_consen 284 KYAECMAMLHEAQEELKTLRK 304 (306)
T ss_pred HHHHHHHHHHHHHHHHHHhhC
Confidence 444444444444455556653
No 31
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=88.70 E-value=7.1 Score=31.05 Aligned_cols=77 Identities=16% Similarity=0.242 Sum_probs=43.7
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHH----------------------HHHHHHHHHHHHHHHhhHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTER----------------------NKELEDQIEYEQSKFARMEAEI 243 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~----------------------l~~le~~l~~~q~~i~~le~el 243 (288)
.|+...++++.+..+++.++.++.+....+..|+.- +.+++.+++.+...|..++..+
T Consensus 11 ~~q~~q~~~~~l~~q~~~le~~~~E~~~v~~eL~~l~~d~~vyk~VG~vlv~~~~~e~~~~l~~r~e~ie~~i~~lek~~ 90 (110)
T TIGR02338 11 QLQQLQQQLQAVATQKQQVEAQLKEAEKALEELERLPDDTPVYKSVGNLLVKTDKEEAIQELKEKKETLELRVKTLQRQE 90 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcchhHHHhchhhheecHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555555555555555544431 1123344555666677777777
Q ss_pred HHHHHHHHHHHHHHHHHhh
Q psy21 244 KRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 244 ~~lr~~~~~ql~eyq~LL~ 262 (288)
..++..+...-..++.++.
T Consensus 91 ~~l~~~l~e~q~~l~~~~~ 109 (110)
T TIGR02338 91 ERLREQLKELQEKIQEALA 109 (110)
T ss_pred HHHHHHHHHHHHHHHHHhc
Confidence 7777777766666666653
No 32
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=88.53 E-value=2.9 Score=37.85 Aligned_cols=63 Identities=22% Similarity=0.304 Sum_probs=43.7
Q ss_pred hhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 200 RSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 200 ~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
...+++.+++++.+.+..|-.++.++++.+...|..+.+++.+.+++........-+|..|-+
T Consensus 136 ~~ee~kekl~E~~~EkeeL~~eleele~e~ee~~erlk~le~E~s~LeE~~~~l~~ev~~L~~ 198 (290)
T COG4026 136 DYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKK 198 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHH
Confidence 334456667777777777777777788777777777777777777776666665556665543
No 33
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=88.46 E-value=15 Score=31.60 Aligned_cols=28 Identities=21% Similarity=0.336 Sum_probs=12.3
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHH
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le 219 (288)
+|+......+.++..++..++.....+.
T Consensus 95 ~el~~l~~~~~~~~~~l~~~~~~~~~~~ 122 (191)
T PF04156_consen 95 EELDQLQERIQELESELEKLKEDLQELR 122 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 3444444444444444444444444444
No 34
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=88.41 E-value=3.1 Score=39.53 Aligned_cols=47 Identities=23% Similarity=0.366 Sum_probs=22.8
Q ss_pred hhHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNK-------ELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~-------~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
..+|..++..+..+..++. +++.++...+..|..+..+...+..+|.
T Consensus 208 ~~eL~~lr~eL~~~~~~i~~~k~~l~el~~el~~l~~~i~~~~~~k~~l~~eI~ 261 (325)
T PF08317_consen 208 QEELEALRQELAEQKEEIEAKKKELAELQEELEELEEKIEELEEQKQELLAEIA 261 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555554444444444 4444444555555555555444444443
No 35
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=88.26 E-value=9.9 Score=34.69 Aligned_cols=48 Identities=23% Similarity=0.443 Sum_probs=22.3
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 203 KYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 203 ~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
.++.++..++.++..|...+..+..++...+..+..+...+...+..+
T Consensus 60 ~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~~~~l 107 (302)
T PF10186_consen 60 QLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQRRSRL 107 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555444444444444444444444444444433
No 36
>PRK02119 hypothetical protein; Provisional
Probab=88.23 E-value=5 Score=29.88 Aligned_cols=46 Identities=13% Similarity=0.080 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
++..+..++..||..+...+..++.++..+.....+|..++..+..
T Consensus 3 ~~~~~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~id~L~~ql~~ 48 (73)
T PRK02119 3 IQQNLENRIAELEMKIAFQENLLEELNQALIEQQFVIDKMQVQLRY 48 (73)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666666666666666666666666555555555555544433
No 37
>PF00345 PapD_N: Pili and flagellar-assembly chaperone, PapD N-terminal domain; InterPro: IPR016147 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the N-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of seven strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2CO6_B 2CO7_B 1L4I_B 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 3DSN_D 2OS7_B ....
Probab=88.19 E-value=0.88 Score=36.50 Aligned_cols=54 Identities=13% Similarity=0.206 Sum_probs=39.4
Q ss_pred cCCCcEEEEEeCCccceeccCeEEEEE-eCC--cccEEEEecCCeEecCC--cEEEEEe
Q psy21 82 DPDGKFVKLYNKGSEEQSLGNWQLVRK-VQG--EVKTSFKFHRTLKIEAG--GTVTVWS 135 (288)
Q Consensus 82 d~~GkfV~l~N~s~~~~~L~gw~l~r~-v~~--~~~~~y~Fp~~~~L~~g--~~vtIwa 135 (288)
+..+..|.|.|+++++.-+--|..... .++ ......-+|+-|.|+|| +.|.||-
T Consensus 14 ~~~~~~i~v~N~~~~~~~vq~~v~~~~~~~~~~~~~~~~vsPp~~~L~pg~~q~vRv~~ 72 (122)
T PF00345_consen 14 SQRSASITVTNNSDQPYLVQVWVYDQDDEDEDEPTDPFIVSPPIFRLEPGESQTVRVYR 72 (122)
T ss_dssp TSSEEEEEEEESSSSEEEEEEEEEETTSTTSSSSSSSEEEESSEEEEETTEEEEEEEEE
T ss_pred CCCEEEEEEEcCCCCcEEEEEEEEcCCCcccccccccEEEeCCceEeCCCCcEEEEEEe
Confidence 444679999999999988888887610 111 12246789999999999 6777766
No 38
>PF03148 Tektin: Tektin family; InterPro: IPR000435 Tektin heteropolymers form unique protofilaments of flagellar microtubules []. The proteins are predicted to form extended rods composed of 2 alpha- helical segments (~180 residues long) capable of forming coiled coils, interrupted by non-helical linkers []. The 2 segments are similar in sequence, indicating a gene duplication event. Along each tektin rod, cysteine residues occur with a periodicity of ~8nm, coincident with the axial repeat of tubulin dimers in microtubules []. It is proposed that the assembly of tektin heteropolymers produces filaments with repeats of 8, 16, 24, 32, 40, 48 and 96nm, generating the basis for the complex spatial arrangements of axonemal components [].; GO: 0000226 microtubule cytoskeleton organization, 0005874 microtubule
Probab=88.00 E-value=11 Score=36.63 Aligned_cols=95 Identities=19% Similarity=0.218 Sum_probs=63.6
Q ss_pred hhHHHhHHHHHHHHHHhh----CCchhHHHHHHHHHHHHHHHHHHHHHHH-------H-----------------HHHHH
Q psy21 184 RKEQSSLLLELKETKGRS----GKYDTEMNELRAANYRLTERNKELEDQI-------E-----------------YEQSK 235 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~----~~l~~ei~~L~~~~~~Le~~l~~le~~l-------~-----------------~~q~~ 235 (288)
...+.....|++..+.++ .....||.++.+.+..|+..+.+.++.+ + .+-..
T Consensus 246 n~al~~Ri~et~~ak~~Le~ql~~~~~ei~~~e~~i~~L~~ai~~k~~~lkvaqTRL~~R~~RP~vElcrD~~q~~L~~E 325 (384)
T PF03148_consen 246 NAALRKRIHETQEAKNELEWQLKKTLQEIAEMEKNIEDLEKAIRDKEGPLKVAQTRLENRTQRPNVELCRDPPQYGLIEE 325 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHhhHhcCCchHHHHhhHHHHHHHH
Confidence 345555566666665543 3356777777777777777766533211 0 12344
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 236 FARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 236 i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
+..+..-+..|...+..--..++.|...++.|+.+|+.=..-|
T Consensus 326 v~~l~~~i~~L~~~L~~a~~~l~~L~~~~~~Le~di~~K~~sL 368 (384)
T PF03148_consen 326 VKELRESIEALQEKLDEAEASLQKLERTRLRLEEDIAVKNNSL 368 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667777778888888888899999999999999998654433
No 39
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=87.58 E-value=9.1 Score=32.14 Aligned_cols=62 Identities=21% Similarity=0.361 Sum_probs=36.0
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHH-----------------HHHHHHHhhHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQI-----------------EYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l-----------------~~~q~~i~~le~el~~lr~~~~ 251 (288)
+-+.++.+-.+......+|.+|+..++.|+.++..++..| ..++..|..||.+|......+.
T Consensus 19 ~e~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~lee~~~~~~~~E~l~rriq~LEeele~ae~~L~ 97 (143)
T PF12718_consen 19 LEAKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLEESEKRKSNAEQLNRRIQLLEEELEEAEKKLK 97 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHhhHHHHHHHHHHHHHHHH
Confidence 3344444444455566677777777777776666544332 2455666666777666665553
No 40
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=87.46 E-value=12 Score=39.35 Aligned_cols=36 Identities=19% Similarity=0.375 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhh-hh
Q psy21 244 KRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNL-ME 279 (288)
Q Consensus 244 ~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkL-LE 279 (288)
+++..++.+..+|+...=+.+..|+.|+..||+. -|
T Consensus 548 ~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~~e 584 (697)
T PF09726_consen 548 RQLESELKKLRRELKQKEEQIRELESELQELRKYEKE 584 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3333444444455555566677788888888876 54
No 41
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=87.19 E-value=17 Score=32.30 Aligned_cols=86 Identities=24% Similarity=0.279 Sum_probs=39.4
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHH-------HHHHHH---HHHHhhHHHHHHHHHHHHHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELE-------DQIEYE---QSKFARMEAEIKRLQTEMTEQLKD 256 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le-------~~l~~~---q~~i~~le~el~~lr~~~~~ql~e 256 (288)
+++.++++...+.+-...+..+.++...+.+|...|..++ ..+..+ ...+..+...+..+..++...--+
T Consensus 29 IksLKeei~emkk~e~~~~k~m~ei~~eN~~L~epL~~a~~e~~eL~k~L~~y~kdK~~L~~~k~rl~~~ek~l~~Lk~e 108 (201)
T PF13851_consen 29 IKSLKEEIAEMKKKEERNEKLMAEISQENKRLSEPLKKAEEEVEELRKQLKNYEKDKQSLQNLKARLKELEKELKDLKWE 108 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555555555566666666665555544332 222222 222233333444444444444444
Q ss_pred HHHHhhhhhhhhhhHH
Q psy21 257 YRELMEIKINLDVEIA 272 (288)
Q Consensus 257 yq~LL~vKl~Le~EIa 272 (288)
|+.|..---.|+.|-.
T Consensus 109 ~evL~qr~~kle~Erd 124 (201)
T PF13851_consen 109 HEVLEQRFEKLEQERD 124 (201)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5554444444444433
No 42
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=86.73 E-value=7.9 Score=35.94 Aligned_cols=57 Identities=18% Similarity=0.356 Sum_probs=25.4
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
+..+....+.++.+|+.|..++..+...+.+++.++...+..|..++.+|..++..|
T Consensus 40 l~~~~~~~~~~q~ei~~L~~qi~~~~~k~~~~~~~i~~~~~eik~l~~eI~~~~~~I 96 (265)
T COG3883 40 LSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENI 96 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333444444444444444444444444444444444444444444444443
No 43
>KOG4687|consensus
Probab=86.61 E-value=7.9 Score=36.20 Aligned_cols=109 Identities=17% Similarity=0.221 Sum_probs=76.8
Q ss_pred hh-HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHhh
Q psy21 171 GE-LANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQS-----------KFAR 238 (288)
Q Consensus 171 ge-vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~-----------~i~~ 238 (288)
|. +|++......+...+.++.+-+..+...+...+.|-..|......|+..+..++..++-.-+ .-.+
T Consensus 8 ga~iae~k~e~sAlhqK~~aKtdairiL~QdLEkfe~Ekd~~a~~aETLeln~ealere~eLlaa~gc~a~~e~gterqd 87 (389)
T KOG4687|consen 8 GAEIAELKKEFSALHQKCGAKTDAIRILGQDLEKFENEKDGLAARAETLELNLEALERELELLAACGCDAKIEFGTERQD 87 (389)
T ss_pred cHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHhhhHHHHhcCCCchhhccchhhH
Confidence 55 88888777766778888999999999988888888888888888888888777765442211 1123
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 239 MEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 239 le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
|-+.|...+.+-.+..-+-+.|++.|.-|--.+.++|..-.
T Consensus 88 Laa~i~etkeeNlkLrTd~eaL~dq~adLhgD~elfReTeA 128 (389)
T KOG4687|consen 88 LAADIEETKEENLKLRTDREALLDQKADLHGDCELFRETEA 128 (389)
T ss_pred HHHHHHHHHHHhHhhhHHHHHHHHHHHHHhchHHHHHHHHH
Confidence 33344444444444455778888888888777777776543
No 44
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=86.22 E-value=15 Score=30.70 Aligned_cols=58 Identities=17% Similarity=0.367 Sum_probs=29.9
Q ss_pred hCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYR 258 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq 258 (288)
+..+..++..|...+..|+.++..++..+...+.....++.++..+...+...-.|.+
T Consensus 61 ~~~l~~d~~~l~~~~~rL~~~~~~~ere~~~~~~~~~~l~~~~~~~~~~~k~~kee~~ 118 (151)
T PF11559_consen 61 LRRLRSDIERLQNDVERLKEQLEELERELASAEEKERQLQKQLKSLEAKLKQEKEELQ 118 (151)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345556666666666666666666655554444444444444444444443333333
No 45
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=86.04 E-value=5.7 Score=29.48 Aligned_cols=46 Identities=17% Similarity=0.295 Sum_probs=22.1
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKF 236 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i 236 (288)
+-|+.+++.+.++|..+.++++.....|+.+...+..+-..+|..|
T Consensus 24 QmEieELKEknn~l~~e~q~~q~~reaL~~eneqlk~e~~~WQerl 69 (79)
T COG3074 24 QMEIEELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNGWQERL 69 (79)
T ss_pred HHHHHHHHHHhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444555555555555555555444444444444443
No 46
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=85.88 E-value=2.9 Score=42.87 Aligned_cols=49 Identities=27% Similarity=0.546 Sum_probs=39.7
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
...++..+.+++..|+.++++|+..+..++..|..|+.+|.+++..+..
T Consensus 420 ~~~~i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~~ 468 (652)
T COG2433 420 YEKRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRD 468 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567778888888888888888888888888888888888888877753
No 47
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=85.72 E-value=5.6 Score=41.86 Aligned_cols=36 Identities=19% Similarity=0.406 Sum_probs=17.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARME 240 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le 240 (288)
+.+|+.|+..++.|+..+.+|....+.....+..||
T Consensus 459 k~eL~qlr~ene~Lq~Kl~~L~~aRq~DKq~l~~LE 494 (697)
T PF09726_consen 459 KSELSQLRQENEQLQNKLQNLVQARQQDKQSLQQLE 494 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555555555555444443333333333
No 48
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=85.61 E-value=17 Score=29.17 Aligned_cols=41 Identities=15% Similarity=0.259 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 212 RAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 212 ~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
-++...|+..+..|+.+.......+..|++.+..++..+..
T Consensus 36 ~kqkd~L~~~l~~L~~q~~s~~qr~~eLqaki~ea~~~le~ 76 (107)
T PF09304_consen 36 AKQKDQLRNALQSLQAQNASRNQRIAELQAKIDEARRNLED 76 (107)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333344433444433333334444444444444444443
No 49
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=85.47 E-value=11 Score=30.87 Aligned_cols=78 Identities=24% Similarity=0.354 Sum_probs=36.5
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH-------HhhHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTE---RNKELEDQIEYEQSK-------FARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~---~l~~le~~l~~~q~~-------i~~le~el~~lr~~~~~ql 254 (288)
..+...++++.++..+-..+..||-.|-..+..+.. ++..++.++..++.+ ++.-.+++..++.++...-
T Consensus 30 ~E~~~l~~el~~l~~~r~~l~~Eiv~l~~~~e~~~~~~~~~~~L~~el~~l~~ry~t~LellGEK~E~veEL~~Dv~DlK 109 (120)
T PF12325_consen 30 GELASLQEELARLEAERDELREEIVKLMEENEELRALKKEVEELEQELEELQQRYQTLLELLGEKSEEVEELRADVQDLK 109 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHH
Confidence 344444444444444444444444444444433322 222333333333322 2333666777777777665
Q ss_pred HHHHHHhh
Q psy21 255 KDYRELME 262 (288)
Q Consensus 255 ~eyq~LL~ 262 (288)
.=|....+
T Consensus 110 ~myr~Qi~ 117 (120)
T PF12325_consen 110 EMYREQID 117 (120)
T ss_pred HHHHHHHH
Confidence 55655443
No 50
>PF04871 Uso1_p115_C: Uso1 / p115 like vesicle tethering protein, C terminal region; InterPro: IPR006955 This domain identifies a group of proteins, which are described as: General vesicular transport factor, Transcytosis associate protein (TAP) and Vesicle docking protein. This myosin-shaped molecule consists of an N-terminal globular head region, a coiled-coil tail which mediates dimerisation, and a short C-terminal acidic region []. p115 tethers COP1 vesicles to the Golgi by binding the coiled coil proteins giantin (on the vesicles) and GM130 (on the Golgi), via its C-terminal acidic region. It is required for intercisternal transport in the Golgi stack. This domain is found in the acidic C-terminal region, which binds to the golgins giantin and GM130. p115 is thought to juxtapose two membranes by binding giantin with one acidic region, and GM130 with another [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport, 0005737 cytoplasm, 0016020 membrane
Probab=85.40 E-value=17 Score=30.23 Aligned_cols=49 Identities=20% Similarity=0.332 Sum_probs=32.6
Q ss_pred HHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 231 YEQSKFARMEAEIKRLQTEM-TEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~-~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
.....+..++.....++.+. ...-.|..+||=+=--|+.-|..|+..|.
T Consensus 59 ~~~~~~~~l~~~~~kl~~E~~~~~q~EldDLL~ll~Dle~K~~kyk~rLk 108 (136)
T PF04871_consen 59 ELASEVKELEAEKEKLKEEARKEAQSELDDLLVLLGDLEEKRKKYKERLK 108 (136)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHhHHHHHHHHHHHHH
Confidence 33444444555555555433 23345888888888889999999999884
No 51
>PRK04406 hypothetical protein; Provisional
Probab=85.01 E-value=10 Score=28.45 Aligned_cols=41 Identities=20% Similarity=0.177 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 210 ELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 210 ~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
.+..++..||..+.-.+..++.++..+.....+|..++..+
T Consensus 8 ~le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~~I~~L~~ql 48 (75)
T PRK04406 8 QLEERINDLECQLAFQEQTIEELNDALSQQQLLITKMQDQM 48 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555555444444445444444444444444433
No 52
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=84.79 E-value=3.8 Score=29.62 Aligned_cols=36 Identities=19% Similarity=0.296 Sum_probs=14.4
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le 226 (288)
.+||+..+.....++++|++-..++..|+.++..|+
T Consensus 17 ~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~ 52 (61)
T PF08826_consen 17 QEELTKVKSANLAFESKLQEAEKRNRELEQEIERLK 52 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444333334444444444444433333333
No 53
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=84.79 E-value=10 Score=39.07 Aligned_cols=71 Identities=18% Similarity=0.301 Sum_probs=34.8
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHH-----------------HHHHHHHhhHHHHHHHHHHHHHHHH
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQI-----------------EYEQSKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l-----------------~~~q~~i~~le~el~~lr~~~~~ql 254 (288)
.+++.+..+++.++.++.+|++.+..++..+..|+.++ +..+..|..|+.+|..-+..+...-
T Consensus 422 ~~i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~~~~~~~rei~~~~~~I~~L~~~L~e~~~~ve~L~ 501 (652)
T COG2433 422 KRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEELE 501 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444444444444433332 2345556666666666655555544
Q ss_pred HHHHHHhh
Q psy21 255 KDYRELME 262 (288)
Q Consensus 255 ~eyq~LL~ 262 (288)
+....|..
T Consensus 502 ~~l~~l~k 509 (652)
T COG2433 502 RKLAELRK 509 (652)
T ss_pred HHHHHHHH
Confidence 45555443
No 54
>KOG4603|consensus
Probab=84.22 E-value=5.7 Score=34.56 Aligned_cols=36 Identities=19% Similarity=0.207 Sum_probs=17.5
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
+|++.+.+.+..|+.++++|+.+....+++|.+|..
T Consensus 79 eel~~ld~~i~~l~ek~q~l~~t~s~veaEik~L~s 114 (201)
T KOG4603|consen 79 EELQVLDGKIVALTEKVQSLQQTCSYVEAEIKELSS 114 (201)
T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555444444444555555544444444444443
No 55
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=84.12 E-value=18 Score=40.19 Aligned_cols=10 Identities=30% Similarity=0.501 Sum_probs=4.6
Q ss_pred eEecCCcEEE
Q psy21 123 LKIEAGGTVT 132 (288)
Q Consensus 123 ~~L~~g~~vt 132 (288)
.++.|+..+|
T Consensus 644 ~~~~~~G~~t 653 (1163)
T COG1196 644 DLVEPSGSIT 653 (1163)
T ss_pred cEEeCCeeee
Confidence 3444445544
No 56
>KOG0250|consensus
Probab=83.73 E-value=8.7 Score=41.94 Aligned_cols=43 Identities=16% Similarity=0.156 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccc
Q psy21 240 EAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEE 282 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe 282 (288)
+.+|....+.++....+...+..-+.-+....++++.=|.||-
T Consensus 747 ~~eIe~~~~~~e~l~~e~e~~~~e~~e~~~~~~~~~~~l~~e~ 789 (1074)
T KOG0250|consen 747 EKEIEEKEAPLEKLKEELEHIELEAQELEEYYAAGREKLQGEI 789 (1074)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444444455555555555555543
No 57
>PF08581 Tup_N: Tup N-terminal; InterPro: IPR013890 The N-terminal region of the Tup protein has been shown to interact with the Ssn6 transcriptional co-repressor []. ; PDB: 3VP9_B 3VP8_B.
Probab=83.64 E-value=12 Score=28.38 Aligned_cols=65 Identities=25% Similarity=0.332 Sum_probs=28.5
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
+..|.+...|+...+.+..+++.+|+.--..++.+...+-+||....... ...|.+|.+++.+++
T Consensus 10 r~Ef~~~~~e~~~~k~~~~e~e~ki~~Qi~Em~~ir~~v~eLE~~h~kmK---~~YEeEI~rLr~eLe 74 (79)
T PF08581_consen 10 RQEFENLSQEANSYKHQKDEYEHKINSQIQEMQQIRQKVYELEQAHRKMK---QQYEEEIARLRRELE 74 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Confidence 34444444445555544444444444444444444444444443322111 223555555555443
No 58
>PF08647 BRE1: BRE1 E3 ubiquitin ligase; InterPro: IPR013956 BRE1 is an E3 ubiquitin ligase that has been shown to act as a transcriptional activator through direct activator interactions [].
Probab=83.47 E-value=12 Score=29.18 Aligned_cols=47 Identities=21% Similarity=0.273 Sum_probs=26.0
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
....+..+|+.+.++..+++..-++|-..+..|=.|+.|+...+.++
T Consensus 22 k~~~~~~lE~k~~rl~~Ek~kadqkyfa~mr~~d~l~~e~k~L~~~~ 68 (96)
T PF08647_consen 22 KVKELTILEQKKLRLEAEKAKADQKYFAAMRSKDALDNEMKKLNTQL 68 (96)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 33344444555555555555555566666666666666665555554
No 59
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=83.32 E-value=13 Score=27.63 Aligned_cols=57 Identities=23% Similarity=0.321 Sum_probs=28.4
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
|..+....+.|...-..++..|..|...+.+++..+..+...+...+.++..++..+
T Consensus 14 Ia~L~eEGekLSk~el~~~~~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~~l 70 (74)
T PF12329_consen 14 IAQLMEEGEKLSKKELKLNNTIKKLRAKIKELEKQIKELKKKLEELEKELESLEERL 70 (74)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333333333333333444445555555555555555555555555555555555443
No 60
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=83.26 E-value=18 Score=32.80 Aligned_cols=37 Identities=16% Similarity=0.151 Sum_probs=16.0
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le 226 (288)
..+++..+..++..++.....+.+.+..++.++.+++
T Consensus 54 L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~ 90 (251)
T PF11932_consen 54 LLAEYRQLEREIENLEVYNEQLERQVASQEQELASLE 90 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444444444443333
No 61
>KOG1962|consensus
Probab=83.24 E-value=12 Score=33.84 Aligned_cols=59 Identities=24% Similarity=0.362 Sum_probs=36.9
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
++.+.+.+.....-|+.++++.+..++..+.....+..+...+..+..+-+.||+.|-+
T Consensus 149 ~~~~~~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq~e~~~~EydrLlee~~~Lq~ 207 (216)
T KOG1962|consen 149 LEEENDKLKADLEKLETELEKKQKKLEKAQKKVDALKKQSEGLQDEYDRLLEEYSKLQE 207 (216)
T ss_pred hhhhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHH
Confidence 55666666666666666666666666666666666666666666666666666655543
No 62
>PRK02224 chromosome segregation protein; Provisional
Probab=83.13 E-value=13 Score=39.81 Aligned_cols=23 Identities=22% Similarity=0.302 Sum_probs=9.7
Q ss_pred HHHHHhhhhhhhhhhHHHhhhhh
Q psy21 256 DYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 256 eyq~LL~vKl~Le~EIatYrkLL 278 (288)
+++.|-.-...+...+..|++.|
T Consensus 420 ~~~~l~~~~~~~~~~~~~~~~~l 442 (880)
T PRK02224 420 ERDELREREAELEATLRTARERV 442 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333334444444444443
No 63
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=83.09 E-value=24 Score=28.86 Aligned_cols=8 Identities=38% Similarity=0.493 Sum_probs=3.3
Q ss_pred heeccchh
Q psy21 165 ILLNTDGE 172 (288)
Q Consensus 165 ~L~d~~ge 172 (288)
.|+--.||
T Consensus 24 ~lr~~E~E 31 (120)
T PF12325_consen 24 QLRRLEGE 31 (120)
T ss_pred HHHHHHHH
Confidence 33344444
No 64
>KOG4807|consensus
Probab=82.91 E-value=43 Score=32.98 Aligned_cols=37 Identities=27% Similarity=0.388 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccc
Q psy21 243 IKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEE 282 (288)
Q Consensus 243 l~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe 282 (288)
|.++..+-...+.-.|+|-| .|-.||...|.||-|.+
T Consensus 458 LRqCQrEnQELnaHNQELnn---RLaaEItrLRtlltgdG 494 (593)
T KOG4807|consen 458 LRQCQRENQELNAHNQELNN---RLAAEITRLRTLLTGDG 494 (593)
T ss_pred HHHHHHhhHHHHHHHHHHhh---HHHHHHHHHHHHhccCC
Confidence 33333333333333444433 57889999999997654
No 65
>PRK02224 chromosome segregation protein; Provisional
Probab=82.80 E-value=11 Score=40.13 Aligned_cols=46 Identities=20% Similarity=0.148 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccccCC
Q psy21 240 EAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEESSL 285 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe~Rl 285 (288)
+..+..++.++.....++..+-...-.+...|..-+.+|....|.+
T Consensus 411 e~~l~~l~~~~~~l~~~~~~~~~~~~~~~~~l~~~~~~l~~~~Cp~ 456 (880)
T PRK02224 411 EDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPE 456 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC
Confidence 3444444444444444444444444444444444444554433433
No 66
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=82.52 E-value=5.8 Score=29.06 Aligned_cols=45 Identities=18% Similarity=0.311 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
.|.+|+.++.-++..+++|...+..-+..|..|+.++..+...+.
T Consensus 5 Ri~~LE~~la~qe~~ie~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~ 49 (69)
T PF04102_consen 5 RIEELEIKLAFQEDTIEELNDVVTEQQRQIDRLQRQLRLLRERLR 49 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444455555555555555555555555555555555555544443
No 67
>PRK00295 hypothetical protein; Provisional
Probab=82.47 E-value=11 Score=27.65 Aligned_cols=39 Identities=18% Similarity=0.262 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRL 246 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~l 246 (288)
|.+|+-++.-.+..+.+|...+..-+..|..|+.++..+
T Consensus 7 i~~LE~kla~qE~tie~Ln~~v~~Qq~~I~~L~~ql~~L 45 (68)
T PRK00295 7 VTELESRQAFQDDTIQALNDVLVEQQRVIERLQLQMAAL 45 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444433333344444443433344444444444433
No 68
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=82.28 E-value=20 Score=33.95 Aligned_cols=41 Identities=24% Similarity=0.387 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQ 247 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr 247 (288)
+++.|+.....|..++.+++.+.+.....+..++.+...+.
T Consensus 51 el~~le~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~ 91 (314)
T PF04111_consen 51 ELEKLEQEEEELLQELEELEKEREELDQELEELEEELEELD 91 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333333333333333333333
No 69
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=82.20 E-value=23 Score=39.45 Aligned_cols=6 Identities=17% Similarity=0.423 Sum_probs=2.3
Q ss_pred eEEEEE
Q psy21 103 WQLVRK 108 (288)
Q Consensus 103 w~l~r~ 108 (288)
|.+.|.
T Consensus 98 i~v~Rr 103 (1163)
T COG1196 98 ISVTRR 103 (1163)
T ss_pred EEEEEE
Confidence 334333
No 70
>PRK00736 hypothetical protein; Provisional
Probab=82.12 E-value=11 Score=27.55 Aligned_cols=41 Identities=20% Similarity=0.260 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~ 248 (288)
|.+|+-++.-.+..+++|...+..-+..|..|..++..+..
T Consensus 7 i~~LE~klafqe~tie~Ln~~v~~Qq~~i~~L~~ql~~L~~ 47 (68)
T PRK00736 7 LTELEIRVAEQEKTIEELSDQLAEQWKTVEQMRKKLDALTE 47 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444444444444444444444433
No 71
>PRK02793 phi X174 lysis protein; Provisional
Probab=82.12 E-value=13 Score=27.52 Aligned_cols=38 Identities=26% Similarity=0.165 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~ 248 (288)
+..++..||..+.-.+..++.++..+.....++..+..
T Consensus 6 ~e~Ri~~LE~~lafQe~tIe~Ln~~v~~Qq~~I~~L~~ 43 (72)
T PRK02793 6 LEARLAELESRLAFQEITIEELNVTVTAHEMEMAKLRD 43 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444444444444444333
No 72
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=81.92 E-value=15 Score=27.34 Aligned_cols=24 Identities=38% Similarity=0.555 Sum_probs=11.2
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
|+.++.+|+..+..|..+...|..
T Consensus 23 Lq~e~eeLke~n~~L~~e~~~L~~ 46 (72)
T PF06005_consen 23 LQMENEELKEKNNELKEENEELKE 46 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHH
Confidence 444445555554444444444443
No 73
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=81.86 E-value=19 Score=34.18 Aligned_cols=21 Identities=24% Similarity=0.268 Sum_probs=8.1
Q ss_pred chhHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~ 224 (288)
+..+|..|+.....|..++..
T Consensus 62 l~~eL~~LE~e~~~l~~el~~ 82 (314)
T PF04111_consen 62 LLQELEELEKEREELDQELEE 82 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333
No 74
>PF02050 FliJ: Flagellar FliJ protein; InterPro: IPR012823 Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects []. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export. The sequences of fliH, fliI and the adjacent gene, fliJ, have been deduced. FliJ was shown to encode a protein of molecular mass 17,302 Da []. It is a membrane-associated protein that affects chemotactic events, mutations in FliJ result in failure to respond to chemotactic stimuli.; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009288 bacterial-type flagellum, 0016020 membrane, 0044461 bacterial-type flagellum part; PDB: 3AJW_A.
Probab=81.21 E-value=22 Score=27.22 Aligned_cols=28 Identities=21% Similarity=0.483 Sum_probs=11.7
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHH
Q psy21 231 YEQSKFARMEAEIKRLQTEMTEQLKDYR 258 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~~~ql~eyq 258 (288)
..+..+..++.++..++..+.....++.
T Consensus 63 ~~~~~~~~~~~~~~~~r~~l~~a~~~~k 90 (123)
T PF02050_consen 63 QQQQELERLEQEVEQAREELQEARRERK 90 (123)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444444433333
No 75
>KOG0976|consensus
Probab=81.18 E-value=20 Score=38.26 Aligned_cols=62 Identities=16% Similarity=0.209 Sum_probs=37.6
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
+.-.+..+...++.+++|.+.|+.++++++.++.+.+-+++.....+..++.+|..-+.+|.
T Consensus 104 ~~sQiriLQn~c~~lE~ekq~lQ~ti~~~q~d~ke~etelE~~~srlh~le~eLsAk~~eIf 165 (1265)
T KOG0976|consen 104 HESQIRILQNKCLRLEMEKQKLQDTIQGAQDDKKENEIEIENLNSRLHKLEDELSAKAHDIF 165 (1265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHhhhhHHHH
Confidence 33344444445556677777777777777777766665665555555555555555554443
No 76
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=81.07 E-value=12 Score=35.49 Aligned_cols=6 Identities=17% Similarity=0.086 Sum_probs=2.2
Q ss_pred chhhhe
Q psy21 161 EISTIL 166 (288)
Q Consensus 161 ~i~t~L 166 (288)
++...|
T Consensus 144 gLk~~L 149 (312)
T smart00787 144 GLKEGL 149 (312)
T ss_pred HHHHHH
Confidence 333333
No 77
>PF14282 FlxA: FlxA-like protein
Probab=80.77 E-value=10 Score=30.15 Aligned_cols=24 Identities=29% Similarity=0.501 Sum_probs=12.3
Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 230 EYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 230 ~~~q~~i~~le~el~~lr~~~~~q 253 (288)
..++..|..|+++|+.+..+...+
T Consensus 54 q~Lq~QI~~LqaQI~qlq~q~~~~ 77 (106)
T PF14282_consen 54 QLLQAQIQQLQAQIAQLQSQQAEQ 77 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555555555555544433
No 78
>PRK09343 prefoldin subunit beta; Provisional
Probab=80.69 E-value=29 Score=28.19 Aligned_cols=33 Identities=18% Similarity=0.275 Sum_probs=17.7
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
++-....|..++.....++..+...-..++.++
T Consensus 80 ~E~ie~~ik~lekq~~~l~~~l~e~q~~l~~ll 112 (121)
T PRK09343 80 KELLELRSRTLEKQEKKLREKLKELQAKINEML 112 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455555566665555555555555555444
No 79
>KOG2264|consensus
Probab=80.68 E-value=9.7 Score=39.06 Aligned_cols=66 Identities=21% Similarity=0.318 Sum_probs=32.3
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh---hhhhh
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM---EIKIN 266 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL---~vKl~ 266 (288)
-..|+.++..+-|.|..+|++++..+..| +..|..-+.+|.+++.++++..+.|.+|+ +.|+.
T Consensus 91 Vs~EL~ele~krqel~seI~~~n~kiEel--------------k~~i~~~q~eL~~Lk~~ieqaq~~~~El~~~n~pkl~ 156 (907)
T KOG2264|consen 91 VSLELTELEVKRQELNSEIEEINTKIEEL--------------KRLIPQKQLELSALKGEIEQAQRQLEELRETNNPKLF 156 (907)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHH--------------HHHHHHhHHHHHHHHhHHHHHHHHHHHHHhhcCCcee
Confidence 34455555444444555555444444333 33333344455555555555545555544 56666
Q ss_pred hhh
Q psy21 267 LDV 269 (288)
Q Consensus 267 Le~ 269 (288)
|-.
T Consensus 157 LP~ 159 (907)
T KOG2264|consen 157 LPF 159 (907)
T ss_pred ecc
Confidence 643
No 80
>PRK04325 hypothetical protein; Provisional
Probab=80.57 E-value=11 Score=28.10 Aligned_cols=39 Identities=15% Similarity=0.201 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRL 246 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~l 246 (288)
|.+|+-++.-.+..+++|...+..-+..|..|..+++.+
T Consensus 11 i~~LE~klAfQE~tIe~LN~vv~~Qq~~I~~L~~ql~~L 49 (74)
T PRK04325 11 ITELEIQLAFQEDLIDGLNATVARQQQTLDLLQAQLRLL 49 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333333333444444333333
No 81
>PF06818 Fez1: Fez1; InterPro: IPR009638 This family represents the eukaryotic Fez1 protein. Fez1 contains a leucine-zipper region with similarity to the DNA-binding domain of the cAMP-responsive activating-transcription factor 5 []. There is evidence that Fez1 inhibits cancer cell growth through regulation of mitosis, and that its alterations result in abnormal cell growth []. Note that some family members contain more than one copy of this region.; GO: 0005737 cytoplasm, 0016020 membrane
Probab=80.56 E-value=20 Score=32.07 Aligned_cols=87 Identities=16% Similarity=0.218 Sum_probs=58.1
Q ss_pred heeccchhHHHH--HHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH-------HHHHHH------
Q psy21 165 ILLNTDGELANQ--ERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK-------ELEDQI------ 229 (288)
Q Consensus 165 ~L~d~~gevA~~--~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~-------~le~~l------ 229 (288)
.++-.+|||+-. ..+++ ...+..+-.||-.+++++.+++.++......+..|...+. ..+.++
T Consensus 4 evCqk~GEIsLLKqQLke~--q~E~~~K~~Eiv~Lr~ql~e~~~~l~~~~~~~~~l~~~~~~K~~ELE~ce~ELqr~~~E 81 (202)
T PF06818_consen 4 EVCQKSGEISLLKQQLKES--QAEVNQKDSEIVSLRAQLRELRAELRNKESQIQELQDSLRTKQLELEVCENELQRKKNE 81 (202)
T ss_pred hHhhhhhhHHHHHHHHHHH--HHHHHHHHhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhHhHHHhHHHHHHHhCH
Confidence 345677885543 33444 5667777888888888887777777777777766665443 122222
Q ss_pred -HHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 230 -EYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 230 -~~~q~~i~~le~el~~lr~~~~~q 253 (288)
+.++.++..++.++..++......
T Consensus 82 a~lLrekl~~le~El~~Lr~~l~~~ 106 (202)
T PF06818_consen 82 AELLREKLGQLEAELAELREELACA 106 (202)
T ss_pred HHHhhhhhhhhHHHHHHHHHHHHhh
Confidence 335677888888999888888775
No 82
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=80.25 E-value=27 Score=30.98 Aligned_cols=82 Identities=20% Similarity=0.249 Sum_probs=46.1
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHhhHHHHHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQS----------KFARMEAEIKRLQTEMTEQLK 255 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~----------~i~~le~el~~lr~~~~~ql~ 255 (288)
.+++..+|+..++.-..+++.+-..|.++...++.+.+.+...++.+|. .+..--.+|..-...+..|+.
T Consensus 61 ~aK~l~eEledLk~~~~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~kl~~e~~~lk~~~~eL~~~~~~Lq~Ql~ 140 (193)
T PF14662_consen 61 KAKALEEELEDLKTLAKSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENGKLLAERDGLKKRSKELATEKATLQRQLC 140 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhhhHHHHHHHHHHhhHHHHHHHH
Confidence 3344466666666666566555555555555555555544443333222 122224455555667778888
Q ss_pred HHHHHhhhhhhh
Q psy21 256 DYRELMEIKINL 267 (288)
Q Consensus 256 eyq~LL~vKl~L 267 (288)
+|..|++.+=++
T Consensus 141 ~~e~l~~~~da~ 152 (193)
T PF14662_consen 141 EFESLICQRDAI 152 (193)
T ss_pred HHHHHHHHHHHH
Confidence 888888877544
No 83
>KOG0250|consensus
Probab=80.16 E-value=34 Score=37.60 Aligned_cols=25 Identities=24% Similarity=0.317 Sum_probs=12.1
Q ss_pred EEEEEeCCccce---eccCe-EEEEEeCC
Q psy21 87 FVKLYNKGSEEQ---SLGNW-QLVRKVQG 111 (288)
Q Consensus 87 fV~l~N~s~~~~---~L~gw-~l~r~v~~ 111 (288)
.|.|.|.|.... --|+- .|.|.+..
T Consensus 113 sItL~N~G~~Afk~eiyG~~IiIER~I~~ 141 (1074)
T KOG0250|consen 113 SITLSNSGLDAFKPEIYGNSIIIERTIRR 141 (1074)
T ss_pred EEEEecCCcccCChhhcCCeeEEEEeecc
Confidence 466777664222 23332 35555553
No 84
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=80.14 E-value=25 Score=29.49 Aligned_cols=40 Identities=28% Similarity=0.417 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
+...+..|+......+.++..++.++..+|.++..+...+
T Consensus 19 ~e~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l 58 (143)
T PF12718_consen 19 LEAKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQL 58 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333344444444444444444444444433
No 85
>PRK04406 hypothetical protein; Provisional
Probab=80.01 E-value=11 Score=28.23 Aligned_cols=47 Identities=17% Similarity=0.275 Sum_probs=26.4
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
++..|.+|+-++.-.+..+++|...+..-+..|..|..+++.+...+
T Consensus 9 le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~~I~~L~~ql~~L~~rl 55 (75)
T PRK04406 9 LEERINDLECQLAFQEQTIEELNDALSQQQLLITKMQDQMKYVVGKV 55 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555555555555555555666666655554433
No 86
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=80.00 E-value=18 Score=26.11 Aligned_cols=45 Identities=18% Similarity=0.357 Sum_probs=29.8
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy21 203 KYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQ 247 (288)
Q Consensus 203 ~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr 247 (288)
.+..||...+..+..++..|++.+.+...+...|..|+.++..++
T Consensus 15 ~~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e~ee~r 59 (61)
T PF08826_consen 15 AIQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKEMEELR 59 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 356667777777777777777777666666666666666655544
No 87
>PRK02793 phi X174 lysis protein; Provisional
Probab=79.94 E-value=13 Score=27.57 Aligned_cols=48 Identities=13% Similarity=0.208 Sum_probs=30.4
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
++..|.+|+-++.-.+..+.+|...+..-+..|..|..++..+...+.
T Consensus 6 ~e~Ri~~LE~~lafQe~tIe~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~ 53 (72)
T PRK02793 6 LEARLAELESRLAFQEITIEELNVTVTAHEMEMAKLRDHLRLLTEKLK 53 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445566666666666666666666666666666666666666655443
No 88
>KOG0995|consensus
Probab=79.94 E-value=32 Score=35.26 Aligned_cols=37 Identities=19% Similarity=0.337 Sum_probs=21.0
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 231 YEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
.++..+-.++-+++..-.++...+.+|..|.. |+.|.
T Consensus 357 ~l~k~vw~~~l~~~~~f~~le~~~~~~~~l~~-~i~l~ 393 (581)
T KOG0995|consen 357 RLSKEVWELKLEIEDFFKELEKKFIDLNSLIR-RIKLG 393 (581)
T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH
Confidence 33444444455556666666777777766654 34444
No 89
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=79.91 E-value=39 Score=29.15 Aligned_cols=24 Identities=29% Similarity=0.494 Sum_probs=12.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQ 228 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~ 228 (288)
+.++..|+.....|..++..++.+
T Consensus 72 k~~~~~lr~~~e~L~~eie~l~~~ 95 (177)
T PF07798_consen 72 KSEFAELRSENEKLQREIEKLRQE 95 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555555555544443
No 90
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=79.61 E-value=15 Score=35.06 Aligned_cols=38 Identities=18% Similarity=0.386 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhh
Q psy21 239 MEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 239 le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrk 276 (288)
+=.+|..++..+...-+||+.++|-|--|-.|-..|+.
T Consensus 131 lV~qLEk~~~q~~qLe~d~qs~lDEkeEl~~ERD~yk~ 168 (319)
T PF09789_consen 131 LVEQLEKLREQIEQLERDLQSLLDEKEELVTERDAYKC 168 (319)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455667778888888999999999999999999984
No 91
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=79.60 E-value=34 Score=31.75 Aligned_cols=38 Identities=16% Similarity=0.249 Sum_probs=17.0
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
..++.+..||++...+. +.++..|+.+...|.+.+...
T Consensus 63 ~~l~~ak~eLqe~eek~---e~~l~~Lq~ql~~l~akI~k~ 100 (258)
T PF15397_consen 63 KQLQQAKAELQEWEEKE---ESKLSKLQQQLEQLDAKIQKT 100 (258)
T ss_pred HHHHHHHHHHHHHHHHH---HhHHHHHHHHHHHHHHHHHHH
Confidence 34445555555544322 334444444444444444433
No 92
>KOG0971|consensus
Probab=79.35 E-value=30 Score=37.45 Aligned_cols=37 Identities=19% Similarity=0.223 Sum_probs=15.7
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERN 222 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l 222 (288)
.++.+..|+.+++..-..|..+++.+..++..|..++
T Consensus 404 elE~k~sE~~eL~r~kE~Lsr~~d~aEs~iadlkEQV 440 (1243)
T KOG0971|consen 404 ELEKKNSELEELRRQKERLSRELDQAESTIADLKEQV 440 (1243)
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444443344444444444444444443
No 93
>PRK04325 hypothetical protein; Provisional
Probab=79.34 E-value=20 Score=26.73 Aligned_cols=44 Identities=20% Similarity=0.169 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 209 NELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
+.+..++..||..+.-.+..++.++..+.....+|..+...+..
T Consensus 5 ~~~e~Ri~~LE~klAfQE~tIe~LN~vv~~Qq~~I~~L~~ql~~ 48 (74)
T PRK04325 5 QEMEDRITELEIQLAFQEDLIDGLNATVARQQQTLDLLQAQLRL 48 (74)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666666666666666666666666666666666555443
No 94
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=79.23 E-value=41 Score=35.06 Aligned_cols=29 Identities=17% Similarity=0.385 Sum_probs=16.4
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
+++.+..|..++..+...+..+.++.+.|
T Consensus 85 Lq~E~~~L~kElE~L~~qlqaqv~~ne~L 113 (617)
T PF15070_consen 85 LQAEAEHLRKELESLEEQLQAQVENNEQL 113 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555656666565555555554
No 95
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=78.95 E-value=28 Score=29.86 Aligned_cols=76 Identities=16% Similarity=0.231 Sum_probs=50.0
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
...++.+|++-..+..++..++.+-..|+.+-+.|+..+..++..++....-+..+..-|.-+..++......+.+
T Consensus 76 E~dik~AYe~A~~lQ~~L~~~re~E~qLr~rRD~LErrl~~l~~tierAE~l~sqi~vvl~yL~~dl~~v~~~~e~ 151 (159)
T PF05384_consen 76 EEDIKEAYEEAHELQVRLAMLREREKQLRERRDELERRLRNLEETIERAENLVSQIGVVLNYLSGDLQQVSEQIED 151 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 4566677777777777777777777777777777777777777666655555555555555566655554433333
No 96
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=78.92 E-value=39 Score=32.56 Aligned_cols=22 Identities=27% Similarity=0.224 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYE 232 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~ 232 (288)
|+..+..+..+.+.++.+|..+
T Consensus 132 LE~li~~~~EEn~~lqlqL~~l 153 (401)
T PF06785_consen 132 LEGLIRHLREENQCLQLQLDAL 153 (401)
T ss_pred HHHHHHHHHHHHHHHHHhHHHH
Confidence 3333333333333333344333
No 97
>PRK00295 hypothetical protein; Provisional
Probab=78.85 E-value=18 Score=26.47 Aligned_cols=42 Identities=24% Similarity=0.271 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
+..++..||..+.-.+..++.++..+.....+|..++..+..
T Consensus 3 ~e~Ri~~LE~kla~qE~tie~Ln~~v~~Qq~~I~~L~~ql~~ 44 (68)
T PRK00295 3 LEERVTELESRQAFQDDTIQALNDVLVEQQRVIERLQLQMAA 44 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666677777777777777777777777777666655544
No 98
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=78.76 E-value=37 Score=30.89 Aligned_cols=16 Identities=13% Similarity=0.111 Sum_probs=5.9
Q ss_pred chhHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLT 219 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le 219 (288)
++.++..++..+....
T Consensus 75 l~~~i~~~~~~i~~~r 90 (302)
T PF10186_consen 75 LRERIERLRKRIEQKR 90 (302)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333333333333333
No 99
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=78.66 E-value=23 Score=26.36 Aligned_cols=31 Identities=16% Similarity=0.245 Sum_probs=18.9
Q ss_pred HHHHHhhCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 195 KETKGRSGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 195 ~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
+.....+.=|++||.+|+..++.|..+.+++
T Consensus 14 qqAvdTI~LLQmEieELKEknn~l~~e~q~~ 44 (79)
T COG3074 14 QQAIDTITLLQMEIEELKEKNNSLSQEVQNA 44 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHhHHHHHHH
Confidence 3334444456777778877777776655443
No 100
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=78.55 E-value=19 Score=33.69 Aligned_cols=57 Identities=23% Similarity=0.425 Sum_probs=34.2
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHH-----HHHHHHHHHHHHH--HHHHHHhhHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYR-----LTERNKELEDQIE--YEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~-----Le~~l~~le~~l~--~~q~~i~~le~el~~lr~~~ 250 (288)
|++...++++-..||.+|+.+..+ +|+++-.+|+||+ ..+..|..|.+-+..+|..+
T Consensus 77 Lkes~~~l~dRetEI~eLksQL~RMrEDWIEEECHRVEAQLALKEARkEIkQLkQvieTmrssL 140 (305)
T PF15290_consen 77 LKESENRLHDRETEIDELKSQLARMREDWIEEECHRVEAQLALKEARKEIKQLKQVIETMRSSL 140 (305)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 455666666777788888877654 4556666777654 34445555555554444433
No 101
>PF15254 CCDC14: Coiled-coil domain-containing protein 14
Probab=78.50 E-value=41 Score=35.79 Aligned_cols=71 Identities=25% Similarity=0.245 Sum_probs=38.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH----HHHHhhhhhhhhhhHHHhh
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKD----YRELMEIKINLDVEIAQYR 275 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~e----yq~LL~vKl~Le~EIatYr 275 (288)
..||-.|+..+..|+.++.+....++.+|.+-..|-.-+...+.+-.+.... -+.|+..|-..|+|++--+
T Consensus 426 n~El~sLqSlN~~Lq~ql~es~k~~e~lq~kneellk~~e~q~~Enk~~~~~~~ekd~~l~~~kq~~d~e~~rik 500 (861)
T PF15254_consen 426 NLELFSLQSLNMSLQNQLQESLKSQELLQSKNEELLKVIENQKEENKRLRKMFQEKDQELLENKQQFDIETTRIK 500 (861)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 4567777777777777776655544444444444433333333333332222 3566666777776665433
No 102
>KOG0996|consensus
Probab=78.46 E-value=53 Score=36.57 Aligned_cols=14 Identities=14% Similarity=0.494 Sum_probs=6.1
Q ss_pred HHHHhhHHHHHHHH
Q psy21 233 QSKFARMEAEIKRL 246 (288)
Q Consensus 233 q~~i~~le~el~~l 246 (288)
+..|..++.++..+
T Consensus 864 ~~~ie~l~kE~e~~ 877 (1293)
T KOG0996|consen 864 EEQIEELKKEVEEL 877 (1293)
T ss_pred HHHHHHHHHHHHHH
Confidence 34444444444444
No 103
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=78.39 E-value=14 Score=29.30 Aligned_cols=23 Identities=22% Similarity=0.381 Sum_probs=9.6
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHH
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~ 251 (288)
+..++-.+..++.++..+.+.+.
T Consensus 67 v~~L~l~l~el~G~~~~l~~~l~ 89 (106)
T PF10805_consen 67 VHDLQLELAELRGELKELSARLQ 89 (106)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHH
Confidence 33344444444444444444443
No 104
>PRK10884 SH3 domain-containing protein; Provisional
Probab=78.37 E-value=18 Score=32.32 Aligned_cols=21 Identities=19% Similarity=0.300 Sum_probs=12.3
Q ss_pred ccEEEEecCCeEecCCcEEEEEe
Q psy21 113 VKTSFKFHRTLKIEAGGTVTVWS 135 (288)
Q Consensus 113 ~~~~y~Fp~~~~L~~g~~vtIwa 135 (288)
+...|++- -+|++|..|+|-.
T Consensus 40 Pg~~y~Iv--~~l~~G~~v~vl~ 60 (206)
T PRK10884 40 PGDQYRIV--GTLNAGEEVTLLQ 60 (206)
T ss_pred CCCCCceE--EEEcCCCEEEEEE
Confidence 33444442 3577777777765
No 105
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=78.26 E-value=25 Score=26.11 Aligned_cols=49 Identities=18% Similarity=0.199 Sum_probs=24.5
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQ 233 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q 233 (288)
..++.+.+.+..++..+..|+.+-..|...+..|..++..+......++
T Consensus 11 ~ki~~aveti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e~~~~~ 59 (72)
T PF06005_consen 11 EKIQQAVETIALLQMENEELKEKNNELKEENEELKEENEQLKQERNAWQ 59 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555555555555555555555555555554444333333
No 106
>PRK02119 hypothetical protein; Provisional
Probab=78.06 E-value=14 Score=27.50 Aligned_cols=48 Identities=13% Similarity=0.198 Sum_probs=29.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
++..|.+|+-++.-.+..+.+|...+..-+..|..|..++..+...+.
T Consensus 7 ~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~id~L~~ql~~L~~rl~ 54 (73)
T PRK02119 7 LENRIAELEMKIAFQENLLEELNQALIEQQFVIDKMQVQLRYMANKLK 54 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556666666666666666666666666666666666666655443
No 107
>PRK00846 hypothetical protein; Provisional
Probab=77.99 E-value=20 Score=27.15 Aligned_cols=44 Identities=18% Similarity=0.131 Sum_probs=19.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~ 248 (288)
...|.+|+.++.-.+..+++|...+...+..|..|..++..+..
T Consensus 12 e~Ri~~LE~rlAfQe~tIe~LN~~v~~qq~~I~~L~~ql~~L~~ 55 (77)
T PRK00846 12 EARLVELETRLSFQEQALTELSEALADARLTGARNAELIRHLLE 55 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444444444444444444433
No 108
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=77.97 E-value=41 Score=28.31 Aligned_cols=18 Identities=17% Similarity=0.289 Sum_probs=7.7
Q ss_pred HHHHHHHHHHhhHHHHHH
Q psy21 227 DQIEYEQSKFARMEAEIK 244 (288)
Q Consensus 227 ~~l~~~q~~i~~le~el~ 244 (288)
..++..+.+|..|+....
T Consensus 87 k~lq~~q~kv~eLE~~~~ 104 (140)
T PF10473_consen 87 KELQKKQEKVSELESLNS 104 (140)
T ss_pred HHHHHHHHHHHHHHHHhH
Confidence 334444444444444333
No 109
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=77.34 E-value=34 Score=28.24 Aligned_cols=33 Identities=9% Similarity=0.266 Sum_probs=14.3
Q ss_pred hCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNKELEDQIEYEQ 233 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q 233 (288)
++.+..++++..........+..++...+.+..
T Consensus 70 Id~vd~klDe~~ei~~~i~~eV~~v~~dv~~i~ 102 (126)
T PF07889_consen 70 IDRVDDKLDEQKEISKQIKDEVTEVREDVSQIG 102 (126)
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHhhHHHHH
Confidence 344444444444444444444444443333333
No 110
>PF11611 DUF4352: Domain of unknown function (DUF4352); InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=77.16 E-value=4.6 Score=31.78 Aligned_cols=77 Identities=17% Similarity=0.222 Sum_probs=31.8
Q ss_pred cCCCcEE----EEEeCCccceecc--CeEEEEEeCCcccEEEEe------cCCeEecCCcEEEEEecccCCCCCCCCCce
Q psy21 82 DPDGKFV----KLYNKGSEEQSLG--NWQLVRKVQGEVKTSFKF------HRTLKIEAGGTVTVWSFNVEGATHEPPHNI 149 (288)
Q Consensus 82 d~~GkfV----~l~N~s~~~~~L~--gw~l~r~v~~~~~~~y~F------p~~~~L~~g~~vtIwa~~~~~~~~~pp~dl 149 (288)
...++|| .|+|+++++..+. .+.|...-+..-...+.+ +....|+||.+++.|-.=. -....+|..|
T Consensus 32 ~~g~~fv~v~v~v~N~~~~~~~~~~~~f~l~d~~g~~~~~~~~~~~~~~~~~~~~i~pG~~~~g~l~F~-vp~~~~~~~l 110 (123)
T PF11611_consen 32 KEGNKFVVVDVTVKNNGDEPLDFSPSDFKLYDSDGNKYDPDFSASSNDNDLFSETIKPGESVTGKLVFE-VPKDDKPYTL 110 (123)
T ss_dssp ---SEEEEEEEEEEE-SSS-EEEEGGGEEEE-TT--B--EEE-CCCTTTB--EEEE-TT-EEEEEEEEE-ESTT-GG-EE
T ss_pred CCCCEEEEEEEEEEECCCCcEEecccceEEEeCCCCEEcccccchhccccccccEECCCCEEEEEEEEE-ECCCCccEEE
Confidence 4456665 5689998888774 567763222212222322 1246999999999885332 1222234666
Q ss_pred eeecc-cccCc
Q psy21 150 VMKNQ-KFGSG 159 (288)
Q Consensus 150 v~~~~-~wg~g 159 (288)
..... .|..+
T Consensus 111 ~~~~~~~~~~~ 121 (123)
T PF11611_consen 111 EYSPDISWSDK 121 (123)
T ss_dssp EE-H-------
T ss_pred EEecCcccccc
Confidence 66544 57643
No 111
>KOG2264|consensus
Probab=77.14 E-value=18 Score=37.13 Aligned_cols=19 Identities=21% Similarity=0.226 Sum_probs=7.1
Q ss_pred HhhhhhhhhhhHHHhhhhh
Q psy21 260 LMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 260 LL~vKl~Le~EIatYrkLL 278 (288)
|...|...+.-=-+|+.|.
T Consensus 130 L~~Lk~~ieqaq~~~~El~ 148 (907)
T KOG2264|consen 130 LSALKGEIEQAQRQLEELR 148 (907)
T ss_pred HHHHHhHHHHHHHHHHHHH
Confidence 3333333333333344443
No 112
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=77.11 E-value=22 Score=30.21 Aligned_cols=23 Identities=17% Similarity=0.216 Sum_probs=9.9
Q ss_pred HHHHHHHHHhhCCchhHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRA 213 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~ 213 (288)
.+++..+..++.+...+|.+|+.
T Consensus 26 ~~e~~~~k~ql~~~d~~i~~Lk~ 48 (155)
T PF06810_consen 26 KEERDNLKTQLKEADKQIKDLKK 48 (155)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 34444444444444444444443
No 113
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=76.73 E-value=18 Score=28.77 Aligned_cols=12 Identities=17% Similarity=0.396 Sum_probs=4.5
Q ss_pred HhhHHHHHHHHH
Q psy21 236 FARMEAEIKRLQ 247 (288)
Q Consensus 236 i~~le~el~~lr 247 (288)
+..|+-++..++
T Consensus 67 v~~L~l~l~el~ 78 (106)
T PF10805_consen 67 VHDLQLELAELR 78 (106)
T ss_pred HHHHHHHHHHHH
Confidence 333333333333
No 114
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=76.51 E-value=24 Score=32.17 Aligned_cols=67 Identities=22% Similarity=0.303 Sum_probs=33.2
Q ss_pred HHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 195 KETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 195 ~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
..++.+++.+..+-.+|...+..|+.+..+++.++..+....+.|+..+..+-.+..+.-.-+.+|.
T Consensus 138 ee~kekl~E~~~EkeeL~~eleele~e~ee~~erlk~le~E~s~LeE~~~~l~~ev~~L~~r~~ELe 204 (290)
T COG4026 138 EELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELE 204 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHhc
Confidence 3344444445455555555555555555555555555544455555555555544444444444443
No 115
>KOG0971|consensus
Probab=76.22 E-value=48 Score=36.02 Aligned_cols=31 Identities=26% Similarity=0.372 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhHH
Q psy21 242 EIKRLQTEMTEQLKDYRELMEIKINLDVEIA 272 (288)
Q Consensus 242 el~~lr~~~~~ql~eyq~LL~vKl~Le~EIa 272 (288)
..+.+..+++..-.|..+|-.+|-.|..+|.
T Consensus 397 d~qK~~kelE~k~sE~~eL~r~kE~Lsr~~d 427 (1243)
T KOG0971|consen 397 DHQKLQKELEKKNSELEELRRQKERLSRELD 427 (1243)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Confidence 3444444555555555555555555544443
No 116
>KOG0933|consensus
Probab=76.20 E-value=22 Score=38.78 Aligned_cols=21 Identities=14% Similarity=0.196 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHhhhhhhhhhh
Q psy21 250 MTEQLKDYRELMEIKINLDVE 270 (288)
Q Consensus 250 ~~~ql~eyq~LL~vKl~Le~E 270 (288)
|...+.+-+.++.-|...+.+
T Consensus 887 i~~~~~~~e~~~~e~~~~~l~ 907 (1174)
T KOG0933|consen 887 ISGLLTSQEKCLSEKSDGELE 907 (1174)
T ss_pred HhhhhhHHHHHHHHhhcccch
Confidence 333344444444444444433
No 117
>PRK00736 hypothetical protein; Provisional
Probab=76.13 E-value=24 Score=25.84 Aligned_cols=42 Identities=17% Similarity=0.169 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
+..++..||..+...+..++.++..+.....+|..+...+..
T Consensus 3 ~e~Ri~~LE~klafqe~tie~Ln~~v~~Qq~~i~~L~~ql~~ 44 (68)
T PRK00736 3 AEERLTELEIRVAEQEKTIEELSDQLAEQWKTVEQMRKKLDA 44 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445566677777777777777777777767666666655443
No 118
>KOG0964|consensus
Probab=76.10 E-value=30 Score=37.73 Aligned_cols=56 Identities=16% Similarity=0.285 Sum_probs=46.0
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccc
Q psy21 227 DQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEE 282 (288)
Q Consensus 227 ~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe 282 (288)
..+.++...|..|+..|..++.+-++.-.+|...+.-|+.|+.+|.....=++|.+
T Consensus 265 d~~~~~~~~i~ele~~l~~l~~ekeq~~a~~t~~~k~kt~lel~~kdlq~~i~~n~ 320 (1200)
T KOG0964|consen 265 DESEDLKCEIKELENKLTNLREEKEQLKARETKISKKKTKLELKIKDLQDQITGNE 320 (1200)
T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHhhhhh
Confidence 34556677788888888888888888888999999999999999998887777653
No 119
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=76.07 E-value=31 Score=26.22 Aligned_cols=23 Identities=22% Similarity=0.310 Sum_probs=12.0
Q ss_pred hCCchhHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
+.-|++||.+|+..+..|..+..
T Consensus 20 I~LLqmEieELKekn~~L~~e~~ 42 (79)
T PRK15422 20 ITLLQMEIEELKEKNNSLSQEVQ 42 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33345555555555555555443
No 120
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=76.06 E-value=26 Score=34.70 Aligned_cols=75 Identities=17% Similarity=0.205 Sum_probs=44.8
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
+.+.....+...+..++.+++.+|..+..++...+..+..++..+++....+..++.+...-+..++.++.-++.
T Consensus 52 ~~i~~~~~~~~kL~~~lk~~e~~i~~~~~ql~~s~~~l~~~~~~I~~~~~~l~~l~~q~r~qr~~La~~L~A~~r 126 (420)
T COG4942 52 KKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEVQEREQRRRLAEQLAALQR 126 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 333344444555555555666666666666666666666666666666666666666665666666666654443
No 121
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=75.75 E-value=50 Score=37.32 Aligned_cols=44 Identities=11% Similarity=0.213 Sum_probs=32.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhc
Q psy21 237 ARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEA 280 (288)
Q Consensus 237 ~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEg 280 (288)
..+..+...+..+.......+..|+.-+-.|+..|+.+..-|+.
T Consensus 1050 ~~~~~e~~~l~~~~~~l~~~~a~l~g~~k~le~qi~~l~~eL~e 1093 (1311)
T TIGR00606 1050 LQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELRE 1093 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 44556666666666666667778888888888888888877754
No 122
>KOG0976|consensus
Probab=75.74 E-value=26 Score=37.54 Aligned_cols=63 Identities=11% Similarity=0.079 Sum_probs=41.5
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
.+|-..+|..+.-++.++-.+++++.-|+..+.++|.+++.+|+.|..++.++.+...+++.-
T Consensus 84 tqetriyRrdv~llEddlk~~~sQiriLQn~c~~lE~ekq~lQ~ti~~~q~d~ke~etelE~~ 146 (1265)
T KOG0976|consen 84 TQETRIYRRDVNLLEDDLKHHESQIRILQNKCLRLEMEKQKLQDTIQGAQDDKKENEIEIENL 146 (1265)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 344455555555566677777777777777777777777777777777766666666555443
No 123
>KOG0978|consensus
Probab=75.70 E-value=16 Score=38.45 Aligned_cols=76 Identities=25% Similarity=0.349 Sum_probs=42.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
+.+++...+....++.++.+....+..+..+..++++++..++..+.+.-.+-. .-+.=-.|-.||..|+.+|-.-
T Consensus 572 q~~~ek~~~~le~i~~~~~e~~~ele~~~~k~~rleEE~e~L~~kle~~k~~~~-~~s~d~~L~EElk~yK~~LkCs 647 (698)
T KOG0978|consen 572 QIELEKSEAKLEQIQEQYAELELELEIEKFKRKRLEEELERLKRKLERLKKEES-GASADEVLAEELKEYKELLKCS 647 (698)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccccHHHHHHHHHHHhceeCC
Confidence 344444444444444444444445555555566666666666666655433222 2333345778999999988643
No 124
>KOG0161|consensus
Probab=75.62 E-value=26 Score=41.05 Aligned_cols=42 Identities=31% Similarity=0.367 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 240 EAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
|.++..+...+..-..--..|...|..|+.+|++|..=|+-+
T Consensus 1694 E~e~~E~~e~i~~~~~~~s~l~~~KrklE~~i~~l~~elee~ 1735 (1930)
T KOG0161|consen 1694 ELELEELAERVNELNAQNSSLTAEKRKLEAEIAQLQSELEEE 1735 (1930)
T ss_pred HHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHHHH
Confidence 444444444444433334567778889999999988777643
No 125
>PF07058 Myosin_HC-like: Myosin II heavy chain-like; InterPro: IPR009768 This family represents a conserved region within a number of myosin II heavy chain-like proteins that seem to be specific to Arabidopsis thaliana.
Probab=75.57 E-value=11 Score=35.78 Aligned_cols=74 Identities=18% Similarity=0.249 Sum_probs=43.5
Q ss_pred hHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH----HH----HHHHhhhhhhh
Q psy21 206 TEMNELRAANYRLTERNK----------ELEDQIEYEQSKFARMEAEIKRLQTEMTEQL----KD----YRELMEIKINL 267 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~----------~le~~l~~~q~~i~~le~el~~lr~~~~~ql----~e----yq~LL~vKl~L 267 (288)
.|+..|-.++..||..+- +++++++.++..-..||.||++.+-...+.. .| ...+|.||.-|
T Consensus 35 ~EVEKLsqTi~ELEEaiLagGaaaNavrdYqrq~~elneEkrtLeRELARaKV~aNRVA~vvANEWKD~nDkvMPVKqWL 114 (351)
T PF07058_consen 35 LEVEKLSQTIRELEEAILAGGAAANAVRDYQRQVQELNEEKRTLERELARAKVSANRVATVVANEWKDENDKVMPVKQWL 114 (351)
T ss_pred HHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcccccccCCccccHHHHH
Confidence 567777777777776542 3444444444444455666666655443332 12 23578888888
Q ss_pred hhhHHHhhhhhhccccC
Q psy21 268 DVEIAQYRNLMEAEESS 284 (288)
Q Consensus 268 e~EIatYrkLLEgEe~R 284 (288)
|- |++|.||=.|
T Consensus 115 EE-----RR~lQgEmQ~ 126 (351)
T PF07058_consen 115 EE-----RRFLQGEMQQ 126 (351)
T ss_pred HH-----HHHHHHHHHH
Confidence 64 7788777544
No 126
>PRK00846 hypothetical protein; Provisional
Probab=75.57 E-value=30 Score=26.11 Aligned_cols=44 Identities=16% Similarity=0.013 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 210 ELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 210 ~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
.+..++..||..+.-.+..++.++..+......+..++..+...
T Consensus 10 ~le~Ri~~LE~rlAfQe~tIe~LN~~v~~qq~~I~~L~~ql~~L 53 (77)
T PRK00846 10 ALEARLVELETRLSFQEQALTELSEALADARLTGARNAELIRHL 53 (77)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45556666666666666666666666666666666666655543
No 127
>PF10212 TTKRSYEDQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019348 This entry represents a C-terminal 500 residue region, which contains a conserved TTKRSYEDQ motif. It is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain an N-terminal domain with a KLRAQ motif (IPR019343 from INTERPRO). The function of these proteins is not known.
Probab=75.52 E-value=37 Score=34.45 Aligned_cols=23 Identities=9% Similarity=0.283 Sum_probs=10.2
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHH
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql 254 (288)
|..+|..|-+.|..|...++.|-
T Consensus 485 YE~QLs~MSEHLasmNeqL~~Q~ 507 (518)
T PF10212_consen 485 YEEQLSMMSEHLASMNEQLAKQR 507 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444443
No 128
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=75.40 E-value=22 Score=26.00 Aligned_cols=40 Identities=23% Similarity=0.189 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 212 RAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 212 ~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
..++..||..+...+..++.++..+.....+|..++..+.
T Consensus 3 e~Ri~~LE~~la~qe~~ie~Ln~~v~~Qq~~I~~L~~~l~ 42 (69)
T PF04102_consen 3 EERIEELEIKLAFQEDTIEELNDVVTEQQRQIDRLQRQLR 42 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555555555555555555555555544443
No 129
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=75.29 E-value=68 Score=31.27 Aligned_cols=8 Identities=25% Similarity=0.468 Sum_probs=3.7
Q ss_pred HHhhhhhh
Q psy21 272 AQYRNLME 279 (288)
Q Consensus 272 atYrkLLE 279 (288)
..|..||+
T Consensus 356 ~~y~~ll~ 363 (444)
T TIGR03017 356 RAYDAAMQ 363 (444)
T ss_pred HHHHHHHH
Confidence 34544443
No 130
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=75.05 E-value=17 Score=27.58 Aligned_cols=55 Identities=11% Similarity=0.106 Sum_probs=28.0
Q ss_pred hHHHhHHHHHHHHHHhhC-------CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy21 185 KEQSSLLLELKETKGRSG-------KYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARM 239 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~-------~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~l 239 (288)
..++.+.+.+.-+++.+. .+..+++.++.....|+.+...+..+...++..+..|
T Consensus 11 ~KIqqAvdtI~LLqmEieELKekn~~L~~e~~~~~~~r~~L~~en~qLk~E~~~WqerLr~L 72 (79)
T PRK15422 11 AKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERLQAL 72 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444 4444555555555556666555555555555554443
No 131
>KOG0804|consensus
Probab=74.83 E-value=21 Score=35.58 Aligned_cols=23 Identities=17% Similarity=0.208 Sum_probs=10.6
Q ss_pred hhHHHhHHHHHHHHHHhhCCchh
Q psy21 184 RKEQSSLLLELKETKGRSGKYDT 206 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ 206 (288)
+..|..+..|...+.....+++.
T Consensus 353 k~~~e~~~~e~~~l~~~~~~~e~ 375 (493)
T KOG0804|consen 353 KQYYELLITEADSLKQESSDLEA 375 (493)
T ss_pred HHHHHHHHHHHHhhhhhhhHHHH
Confidence 34444455555554444444433
No 132
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=74.71 E-value=24 Score=37.66 Aligned_cols=26 Identities=23% Similarity=0.229 Sum_probs=12.3
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~ey 257 (288)
++.++..|+.+|..-|.........|
T Consensus 678 l~~Ki~~Le~Ele~er~~~~e~~~kc 703 (769)
T PF05911_consen 678 LQSKISSLEEELEKERALSEELEAKC 703 (769)
T ss_pred HHHHHHHHHHHHHHHHhcchhhhhHH
Confidence 44455555555555444433333333
No 133
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=74.52 E-value=19 Score=33.23 Aligned_cols=23 Identities=22% Similarity=0.125 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQ 233 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q 233 (288)
|..+++.|+.++.+|.++++..+
T Consensus 59 l~~ql~~lq~ev~~LrG~~E~~~ 81 (263)
T PRK10803 59 LQQQLSDNQSDIDSLRGQIQENQ 81 (263)
T ss_pred HHHHHHHHHHHHHHHhhHHHHHH
Confidence 33334444444444444433333
No 134
>PF09744 Jnk-SapK_ap_N: JNK_SAPK-associated protein-1; InterPro: IPR019143 This entry represents the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have a RhoGEF domain (IPR000219 from INTERPRO) at their C-terminal end.
Probab=74.52 E-value=55 Score=28.07 Aligned_cols=50 Identities=22% Similarity=0.254 Sum_probs=25.6
Q ss_pred heeccchh-HH-HHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHH
Q psy21 165 ILLNTDGE-LA-NQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTER 221 (288)
Q Consensus 165 ~L~d~~ge-vA-~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~ 221 (288)
.|++.+|. ++ ..-.+-. ...|.|-.+..+.+....++..|+..+.+|..+
T Consensus 14 ~lId~~G~e~v~~LmP~VV-------~vLE~Le~~~~~n~~~~~e~~~L~~d~e~L~~q 65 (158)
T PF09744_consen 14 RLIDRYGEEAVKGLMPKVV-------RVLELLESLASRNQEHEVELELLREDNEQLETQ 65 (158)
T ss_pred HHHHHhChhHHHHHHHHHH-------HHHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHH
Confidence 45677777 33 2222211 244555555555555555566666555555543
No 135
>PF13805 Pil1: Eisosome component PIL1; PDB: 3PLT_B.
Probab=74.44 E-value=76 Score=29.67 Aligned_cols=24 Identities=13% Similarity=0.221 Sum_probs=12.2
Q ss_pred CCchhHHHHHHHHHHHHHHHHHHH
Q psy21 202 GKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 202 ~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
.+.+..++..+...+.|..+|..+
T Consensus 134 R~~E~sl~p~R~~r~~l~d~I~kL 157 (271)
T PF13805_consen 134 RNREESLQPSRDRRRKLQDEIAKL 157 (271)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHhHHHHHhHHHHHHHHHH
Confidence 334455555555555555555443
No 136
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=74.30 E-value=40 Score=32.43 Aligned_cols=23 Identities=17% Similarity=0.215 Sum_probs=12.3
Q ss_pred hCCchhHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
++.-+.|.+.|+.+++.|.+++-
T Consensus 94 ~~e~q~e~~qL~~qnqkL~nqL~ 116 (401)
T PF06785_consen 94 VEERQQESEQLQSQNQKLKNQLF 116 (401)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHH
Confidence 33344555556666666555553
No 137
>KOG0288|consensus
Probab=74.04 E-value=58 Score=32.25 Aligned_cols=56 Identities=25% Similarity=0.299 Sum_probs=30.0
Q ss_pred ccchhHHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 168 NTDGELANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 168 d~~gevA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
+.+.++|.+......+....-....|-+..+..++..+.+|+.|+..+..|..++-
T Consensus 17 ~~~~~laq~~k~~s~~~aq~~~~~a~~~ai~a~~~~~E~~l~~Lq~e~~~l~e~~v 72 (459)
T KOG0288|consen 17 DLNTELAQCEKAQSRLSAQLVILRAESRAIKAKLQEKELELNRLQEENTQLNEERV 72 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333555444444333333334445555566666667777777776666665543
No 138
>PF06818 Fez1: Fez1; InterPro: IPR009638 This family represents the eukaryotic Fez1 protein. Fez1 contains a leucine-zipper region with similarity to the DNA-binding domain of the cAMP-responsive activating-transcription factor 5 []. There is evidence that Fez1 inhibits cancer cell growth through regulation of mitosis, and that its alterations result in abnormal cell growth []. Note that some family members contain more than one copy of this region.; GO: 0005737 cytoplasm, 0016020 membrane
Probab=74.01 E-value=48 Score=29.62 Aligned_cols=72 Identities=25% Similarity=0.349 Sum_probs=30.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhHHHhhhhhh
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKIN-LDVEIAQYRNLME 279 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~-Le~EIatYrkLLE 279 (288)
+|-.|+..+..+...+...+.++..++..+..-..++..+..++.+...+- .||.-|+. |+.||+.-|.-+.
T Consensus 32 Eiv~Lr~ql~e~~~~l~~~~~~~~~l~~~~~~K~~ELE~ce~ELqr~~~Ea-~lLrekl~~le~El~~Lr~~l~ 104 (202)
T PF06818_consen 32 EIVSLRAQLRELRAELRNKESQIQELQDSLRTKQLELEVCENELQRKKNEA-ELLREKLGQLEAELAELREELA 104 (202)
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhHhHHHhHHHHHHHhCHH-HHhhhhhhhhHHHHHHHHHHHH
Confidence 333333333333333333333333333333333444444444444443332 33333333 3666666665443
No 139
>KOG0933|consensus
Probab=73.95 E-value=55 Score=35.92 Aligned_cols=52 Identities=13% Similarity=0.245 Sum_probs=38.4
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhc
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEA 280 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEg 280 (288)
+.++...|..+...++.-..++...-++|+.|.--=-.|..||..+...|++
T Consensus 789 lkdl~keik~~k~~~e~~~~~~ek~~~e~e~l~lE~e~l~~e~~~~k~~l~~ 840 (1174)
T KOG0933|consen 789 LKDLEKEIKTAKQRAEESSKELEKRENEYERLQLEHEELEKEISSLKQQLEQ 840 (1174)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555666666666777777888888888887777888888888888865
No 140
>KOG4643|consensus
Probab=73.94 E-value=24 Score=38.51 Aligned_cols=65 Identities=25% Similarity=0.248 Sum_probs=41.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH----------------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 205 DTEMNELRAANYRLTERNKELEDQI----------------EYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l----------------~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
+..+.+|+..++.|.++-+-|+.+| =.++.++..|+.+..-.+..++..+.|.+.|...+-+|+
T Consensus 263 kdRveelkedN~vLleekeMLeeQLq~lrarse~~tleseiiqlkqkl~dm~~erdtdr~kteeL~eEnstLq~q~eqL~ 342 (1195)
T KOG4643|consen 263 KDRVEELKEDNRVLLEEKEMLEEQLQKLRARSEGATLESEIIQLKQKLDDMRSERDTDRHKTEELHEENSTLQVQKEQLD 342 (1195)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHhccccCChHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhh
Confidence 5566666665554444433333333 345677777777777777777887888888887776665
Q ss_pred h
Q psy21 269 V 269 (288)
Q Consensus 269 ~ 269 (288)
.
T Consensus 343 ~ 343 (1195)
T KOG4643|consen 343 G 343 (1195)
T ss_pred h
Confidence 3
No 141
>PF10481 CENP-F_N: Cenp-F N-terminal domain; InterPro: IPR018463 Mitosin or centromere-associated protein-F (Cenp-F) is found bound across the centromere as one of the proteins of the outer layer of the kinetochore []. Most of the kinetochore/centromere functions appear to depend upon binding of the C-terminal part of the molecule, whereas the N-terminal part, here, may be a cytoplasmic player in controlling the function of microtubules and dynein [].
Probab=73.92 E-value=24 Score=32.95 Aligned_cols=64 Identities=23% Similarity=0.358 Sum_probs=39.6
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHHhhHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELED---------------------QIEYEQSKFARMEAEIKRLQTE 249 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~---------------------~l~~~q~~i~~le~el~~lr~~ 249 (288)
-.-++..+.++.+-++++..|++.++.|-..+.+++. ++......|..|+.++.+++.+
T Consensus 45 EAaLqKQKqK~e~ek~e~s~LkREnq~l~e~c~~lek~rqKlshdlq~Ke~qv~~lEgQl~s~Kkqie~Leqelkr~KsE 124 (307)
T PF10481_consen 45 EAALQKQKQKVEEEKNEYSALKRENQSLMESCENLEKTRQKLSHDLQVKESQVNFLEGQLNSCKKQIEKLEQELKRCKSE 124 (307)
T ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHHHHHHHHHHhhHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345556666666677777777777777776665543 3344445566666666666666
Q ss_pred HHHHH
Q psy21 250 MTEQL 254 (288)
Q Consensus 250 ~~~ql 254 (288)
+++..
T Consensus 125 LErsQ 129 (307)
T PF10481_consen 125 LERSQ 129 (307)
T ss_pred HHHHH
Confidence 66543
No 142
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=73.71 E-value=25 Score=34.61 Aligned_cols=18 Identities=22% Similarity=0.189 Sum_probs=12.3
Q ss_pred hhhHHHHhhhhhhhhccc
Q psy21 7 TRIEEVQKKRKTELLCMN 24 (288)
Q Consensus 7 ~ei~~yrklle~ee~r~~ 24 (288)
.+|+..+|.||.=..||.
T Consensus 62 ~~i~~lqkkL~~y~~~l~ 79 (395)
T PF10267_consen 62 QTIAQLQKKLEQYHKRLK 79 (395)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 367777777777777764
No 143
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=73.29 E-value=53 Score=31.37 Aligned_cols=27 Identities=26% Similarity=0.468 Sum_probs=13.6
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 233 QSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
+..+...+.++..++.++.....++..
T Consensus 209 ~~~l~~~~~~l~~~~~~l~~~~~~~~~ 235 (423)
T TIGR01843 209 QGELGRLEAELEVLKRQIDELQLERQQ 235 (423)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444455555555555555544444443
No 144
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=72.51 E-value=41 Score=25.73 Aligned_cols=31 Identities=13% Similarity=0.181 Sum_probs=12.9
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~ 224 (288)
+..+..++..+..++..|...+..++..+.+
T Consensus 7 ~~~l~~~l~~~~~q~~~l~~~~~~~~~~~~e 37 (106)
T PF01920_consen 7 FQELNQQLQQLEQQIQQLERQLRELELTLEE 37 (106)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333434444444444444444443333
No 145
>KOG0996|consensus
Probab=72.47 E-value=39 Score=37.53 Aligned_cols=88 Identities=14% Similarity=0.229 Sum_probs=52.3
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQI--------------EYEQSKFARMEAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l--------------~~~q~~i~~le~el~~lr~~~~~ql~ey 257 (288)
..+.+.+++++-.+.++..|...-..+.....+++..| ......|.++..++.+...++.....+-
T Consensus 493 ~~~n~~~~e~~vaesel~~L~~~~~~~~~~~e~lk~~L~~~~~~~~e~~~~l~~~k~~l~~~k~e~~~~~k~l~~~~~e~ 572 (1293)
T KOG0996|consen 493 KQVNEARSELDVAESELDILLSRHETGLKKVEELKGKLLASSESLKEKKTELDDLKEELPSLKQELKEKEKELPKLRKEE 572 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHhHHHHHHHH
Confidence 33444555555555666555555444444444444333 2334445555556666666666666677
Q ss_pred HHHhhhhhhhhhhHHHhhhhhh
Q psy21 258 RELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 258 q~LL~vKl~Le~EIatYrkLLE 279 (288)
+.|-.....+..++..|+.-+.
T Consensus 573 ~~~~~~~~~~rqrveE~ks~~~ 594 (1293)
T KOG0996|consen 573 RNLKSQLNKLRQRVEEAKSSLS 594 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 7777777888888888887443
No 146
>PF13870 DUF4201: Domain of unknown function (DUF4201)
Probab=72.40 E-value=61 Score=27.71 Aligned_cols=95 Identities=23% Similarity=0.265 Sum_probs=47.2
Q ss_pred hHHHhHHHHHHHHHHhhCC-------chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHH-------HHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGK-------YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAE-------IKRLQTEM 250 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~-------l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~e-------l~~lr~~~ 250 (288)
..+..+..|+..++..+.. .+..+..+......+..++...+..+...+..+.....+ ...++..+
T Consensus 56 ~kIeERn~eL~~Lk~~~~~~v~~L~h~keKl~~~~~~~~~l~~~l~~~~~~~~~~r~~l~~~k~~r~k~~~~~~~l~~~~ 135 (177)
T PF13870_consen 56 EKIEERNKELLKLKKKIGKTVQILTHVKEKLHFLSEELERLKQELKDREEELAKLREELYRVKKERDKLRKQNKKLRQQG 135 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3344445555555544322 345555555555555555554444444444444333333 33333322
Q ss_pred H-----HHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 251 T-----EQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 251 ~-----~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
. ..+.+|....+---.|..+|..+++-++
T Consensus 136 ~~~~~P~ll~Dy~~~~~~~~~l~~~i~~l~rk~~ 169 (177)
T PF13870_consen 136 GLLGVPALLRDYDKTKEEVEELRKEIKELERKVE 169 (177)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1 2234666666665667777777776554
No 147
>KOG2391|consensus
Probab=72.27 E-value=28 Score=33.46 Aligned_cols=35 Identities=14% Similarity=0.161 Sum_probs=16.8
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELEDQ 228 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~ 228 (288)
...++...++++.-.++|...+..||.+...++..
T Consensus 234 q~slkRt~EeL~~G~~kL~~~~etLEqq~~~L~~n 268 (365)
T KOG2391|consen 234 QESLKRTEEELNIGKQKLVAMKETLEQQLQSLQKN 268 (365)
T ss_pred HHHHHhhHHHHHhhHHHHHHHHHHHHHHHHHHHhh
Confidence 33333444444555555555555555555544443
No 148
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=72.13 E-value=69 Score=28.24 Aligned_cols=55 Identities=18% Similarity=0.402 Sum_probs=28.3
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
+.+|.+|...+-.|+++..-+....+.....|++++.....+..++...-.+|+.
T Consensus 130 e~~i~~Le~ki~el~~~~~~~~~~ke~~~~ei~~lks~~~~l~~~~~~~e~~F~~ 184 (190)
T PF05266_consen 130 ESEIKELEMKILELQRQAAKLKEKKEAKDKEISRLKSEAEALKEEIENAELEFQS 184 (190)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444333333334444556666666666666666665555554
No 149
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=72.00 E-value=12 Score=37.51 Aligned_cols=26 Identities=15% Similarity=0.276 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 225 LEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 225 le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
+..+.++++.+|..++.++..|+.++
T Consensus 95 ~saq~~dle~KIkeLEaE~~~Lk~Ql 120 (475)
T PRK13729 95 LNKQRGDDQRRIEKLGQDNAALAEQV 120 (475)
T ss_pred HhhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 33444444555555666666665555
No 150
>KOG2685|consensus
Probab=71.85 E-value=70 Score=31.61 Aligned_cols=92 Identities=12% Similarity=0.139 Sum_probs=57.0
Q ss_pred hhHHHhHHHHHHHHHHhhCC----chhHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHH----HH
Q psy21 184 RKEQSSLLLELKETKGRSGK----YDTEMNELRAANYRLTERNKELED--------------------QIEYEQ----SK 235 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~----l~~ei~~L~~~~~~Le~~l~~le~--------------------~l~~~q----~~ 235 (288)
...+...+.|.+..+.+++. .-.||.+....|..|+..|..-++ .....| ..
T Consensus 273 e~af~~ri~etqdar~kL~~ql~k~leEi~~~e~~I~~le~airdK~~pLKVAqTRle~Rt~RPnvELCrD~AQ~~L~~E 352 (421)
T KOG2685|consen 273 ELAFKKRIRETQDARNKLEWQLAKTLEEIADAENNIEALERAIRDKEGPLKVAQTRLENRTYRPNVELCRDQAQYRLVDE 352 (421)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhcccccHHHHHHHHHHcccCCchHHHHhHHHHHHHHH
Confidence 34555556666666655433 345677777777777776652111 011112 23
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhh
Q psy21 236 FARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYR 275 (288)
Q Consensus 236 i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYr 275 (288)
+..+..-++.++..+..--...+.|-+.+.+|+.+|+.=.
T Consensus 353 V~~l~~t~~~L~~kL~eA~~~l~~L~~~~~rLe~di~~k~ 392 (421)
T KOG2685|consen 353 VHELDDTVAALKEKLDEAEDSLKLLVNHRARLERDIAIKA 392 (421)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3445666667777777767788899999999999998654
No 151
>COG1382 GimC Prefoldin, chaperonin cofactor [Posttranslational modification, protein turnover, chaperones]
Probab=71.62 E-value=55 Score=26.82 Aligned_cols=34 Identities=15% Similarity=0.211 Sum_probs=19.0
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le 219 (288)
.|+...+++.....+.+.++.+|.+....+..|+
T Consensus 14 q~QqLq~ql~~~~~qk~~le~qL~E~~~al~Ele 47 (119)
T COG1382 14 QLQQLQQQLQKVILQKQQLEAQLKEIEKALEELE 47 (119)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4445555555555555556666666555544444
No 152
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=71.28 E-value=75 Score=30.31 Aligned_cols=10 Identities=20% Similarity=0.498 Sum_probs=5.1
Q ss_pred EecCCcEEEE
Q psy21 124 KIEAGGTVTV 133 (288)
Q Consensus 124 ~L~~g~~vtI 133 (288)
.++.|+.+-.
T Consensus 64 ~V~kG~~L~~ 73 (423)
T TIGR01843 64 RVKAGQVLVE 73 (423)
T ss_pred EecCCCeEEE
Confidence 3466665543
No 153
>smart00637 CBD_II CBD_II domain.
Probab=71.20 E-value=22 Score=26.94 Aligned_cols=45 Identities=22% Similarity=0.403 Sum_probs=26.2
Q ss_pred EEEEeCCccceeccCeEEEEEeCCcccE-------------EEEecC---CeEecCCcEEEEE
Q psy21 88 VKLYNKGSEEQSLGNWQLVRKVQGEVKT-------------SFKFHR---TLKIEAGGTVTVW 134 (288)
Q Consensus 88 V~l~N~s~~~~~L~gw~l~r~v~~~~~~-------------~y~Fp~---~~~L~~g~~vtIw 134 (288)
|.|.|+++. ++.+|.|.-.++++..+ .|.|++ +-.|.||++++|=
T Consensus 12 v~vtN~~~~--~~~~W~v~~~~~~~~~i~~~Wn~~~~~~g~~~~~~~~~wn~~i~~G~s~~~g 72 (92)
T smart00637 12 VTVTNTGSS--AINGWTVTFDLPGGQTVTNSWNATVSQSGGHVTATNASWNGTIAPGGSVSFG 72 (92)
T ss_pred EEEEeCCCC--cccCeEEEEEcCCCcEEeeeEEEEEEecCCEEEEecCccccccCCCCEEEEE
Confidence 567888765 36788887666542211 355542 2366677766653
No 154
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=70.90 E-value=78 Score=31.40 Aligned_cols=20 Identities=20% Similarity=0.511 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy21 242 EIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 242 el~~lr~~~~~ql~eyq~LL 261 (288)
++..+..+..-.-..|..|+
T Consensus 356 el~~L~Re~~~~~~~Y~~l~ 375 (498)
T TIGR03007 356 ELTQLNRDYEVNKSNYEQLL 375 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333
No 155
>KOG4809|consensus
Probab=70.81 E-value=63 Score=33.13 Aligned_cols=13 Identities=31% Similarity=0.485 Sum_probs=9.6
Q ss_pred hhhhhHHHhhhhh
Q psy21 266 NLDVEIAQYRNLM 278 (288)
Q Consensus 266 ~Le~EIatYrkLL 278 (288)
.||-|+.-|++++
T Consensus 425 ~le~e~~~y~de~ 437 (654)
T KOG4809|consen 425 QLEKEASYYRDEC 437 (654)
T ss_pred HHHHHHHHHHHHH
Confidence 5777788887765
No 156
>PRK11020 hypothetical protein; Provisional
Probab=70.73 E-value=41 Score=27.34 Aligned_cols=60 Identities=22% Similarity=0.263 Sum_probs=36.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHhhHHHHHHHHHHHHHHHH-HHHHHHhhh
Q psy21 204 YDTEMNELRAANYRLTERNKELED-----QIEYEQSKFARMEAEIKRLQTEMTEQL-KDYRELMEI 263 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~-----~l~~~q~~i~~le~el~~lr~~~~~ql-~eyq~LL~v 263 (288)
++.||+.|..+++.+...+..++. -+.++...+..|+.++++++..-...+ .+-+.|.+.
T Consensus 3 ~K~Eiq~L~drLD~~~~Klaaa~~rgd~~~i~qf~~E~~~l~k~I~~lk~~~~~~lske~~~l~~l 68 (118)
T PRK11020 3 EKNEIKRLSDRLDAIRHKLAAASLRGDAEKYAQFEKEKATLEAEIARLKEVQSQKLSKEAQKLMKL 68 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 455666666666666666655442 245556666677777777776555443 355666654
No 157
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=70.03 E-value=36 Score=27.91 Aligned_cols=13 Identities=23% Similarity=0.549 Sum_probs=5.2
Q ss_pred HHHHHHHHHHHHH
Q psy21 239 MEAEIKRLQTEMT 251 (288)
Q Consensus 239 le~el~~lr~~~~ 251 (288)
|+.++..+...+.
T Consensus 103 le~e~~~~~~r~~ 115 (132)
T PF07926_consen 103 LEKELSELEQRIE 115 (132)
T ss_pred HHHHHHHHHHHHH
Confidence 3444444443333
No 158
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=69.60 E-value=80 Score=31.13 Aligned_cols=35 Identities=14% Similarity=0.194 Sum_probs=16.7
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
.+..++.++..+.++++.++..|-.+-..|+++.+
T Consensus 134 kAqQ~lar~t~Q~q~lqtrl~~l~~qr~ql~aq~q 168 (499)
T COG4372 134 KAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQ 168 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444555555555444444444444443
No 159
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=69.55 E-value=81 Score=27.92 Aligned_cols=30 Identities=17% Similarity=0.403 Sum_probs=16.6
Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 230 EYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 230 ~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
+-+...+..++.+...+.......+.++|.
T Consensus 110 evL~qr~~kle~ErdeL~~kf~~~i~evqQ 139 (201)
T PF13851_consen 110 EVLEQRFEKLEQERDELYRKFESAIQEVQQ 139 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455555566666666555555555554
No 160
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family. Members of this protein family are phage lysis regulatory protein, including the well-studied protein LysB (lysis protein B) of Enterobacteria phage P2. For members of this family, genes are found in phage or in prophage regions of bacterial genomes, typically near a phage lysozyme or phage holin.
Probab=69.02 E-value=58 Score=27.24 Aligned_cols=16 Identities=13% Similarity=-0.029 Sum_probs=6.4
Q ss_pred hHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTER 221 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~ 221 (288)
..+..+.+++..|...
T Consensus 40 ~~l~~~~~qL~~l~~~ 55 (135)
T TIGR03495 40 AELASKANQLIVLLAL 55 (135)
T ss_pred HHHHHHHhHHHHHHHH
Confidence 3334444444444333
No 161
>KOG1962|consensus
Probab=67.99 E-value=49 Score=29.88 Aligned_cols=53 Identities=23% Similarity=0.392 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEI 263 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~v 263 (288)
|+..++.+.+.+.-++.+++..+..+...+.+...++.+.+...+||-.|+.-
T Consensus 149 ~~~~~~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq~e~~~~EydrLlee 201 (216)
T KOG1962|consen 149 LEEENDKLKADLEKLETELEKKQKKLEKAQKKVDALKKQSEGLQDEYDRLLEE 201 (216)
T ss_pred hhhhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHH
Confidence 55556666666677777777788888888888888888888888899988865
No 162
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=67.94 E-value=93 Score=28.02 Aligned_cols=26 Identities=27% Similarity=0.647 Sum_probs=12.2
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHH
Q psy21 233 QSKFARMEAEIKRLQTEMTEQLKDYR 258 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~~~ql~eyq 258 (288)
...+..|+.++..+..++...-..|.
T Consensus 196 E~~v~~Le~~id~le~eL~~~k~~~~ 221 (237)
T PF00261_consen 196 ERRVKKLEKEIDRLEDELEKEKEKYK 221 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444455555555555544444443
No 163
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=67.31 E-value=58 Score=25.38 Aligned_cols=63 Identities=22% Similarity=0.355 Sum_probs=35.3
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKEL----------EDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~l----------e~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
..+++-.+-.+...+..+++.|++....+..++..+ ..+.......|..++.++..+..++..
T Consensus 27 ~vd~i~~ld~~~r~l~~~~e~lr~~rN~~sk~I~~~~~~~~~~~~l~~e~~~lk~~i~~le~~~~~~e~~l~~ 99 (108)
T PF02403_consen 27 DVDEIIELDQERRELQQELEELRAERNELSKEIGKLKKAGEDAEELKAEVKELKEEIKELEEQLKELEEELNE 99 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHTTCCTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhhCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555556666666666666666655433 334445555566666666555555543
No 164
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=67.13 E-value=55 Score=30.50 Aligned_cols=74 Identities=23% Similarity=0.317 Sum_probs=45.2
Q ss_pred HHHHHHHhhCCchhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy21 193 ELKETKGRSGKYDTEMNELRA----ANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKIN 266 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~----~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~ 266 (288)
+++..+.+.-....+++++++ .+..+..+++.++.++.+....-..|+..+.+.+.++++..+-++.|-+++=+
T Consensus 145 ~lr~~R~~a~~r~~e~~~iE~~l~~ai~~~~~~~~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq~vRPA 222 (267)
T PF10234_consen 145 ELREERQRALARPLELNEIEKALKEAIKAVQQQLQQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQSVRPA 222 (267)
T ss_pred hHHHHHHHHHcCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChH
Confidence 344444433333444544443 33444555556666666666666777777777788888777777777777643
No 165
>PF05700 BCAS2: Breast carcinoma amplified sequence 2 (BCAS2); InterPro: IPR008409 This family consists of several eukaryotic sequences of unknown function. The mammalian members of this family are annotated as breast carcinoma amplified sequence 2 (BCAS2) proteins []. BCAS2 is a putative spliceosome associated protein [].
Probab=67.08 E-value=68 Score=28.69 Aligned_cols=12 Identities=42% Similarity=0.221 Sum_probs=6.6
Q ss_pred heeccchhHHHH
Q psy21 165 ILLNTDGELANQ 176 (288)
Q Consensus 165 ~L~d~~gevA~~ 176 (288)
.|.+.+|.-||.
T Consensus 124 eLl~~~g~naW~ 135 (221)
T PF05700_consen 124 ELLSKYGENAWL 135 (221)
T ss_pred HHHHHHhHHHHH
Confidence 455666664443
No 166
>KOG0804|consensus
Probab=67.06 E-value=1.1e+02 Score=30.61 Aligned_cols=25 Identities=12% Similarity=0.131 Sum_probs=11.8
Q ss_pred HHhhCCchhHHHHHHHHHHHHHHHH
Q psy21 198 KGRSGKYDTEMNELRAANYRLTERN 222 (288)
Q Consensus 198 ~~~~~~l~~ei~~L~~~~~~Le~~l 222 (288)
.....+++.+.+.+++..+.++..+
T Consensus 360 ~~e~~~l~~~~~~~e~~kk~~e~k~ 384 (493)
T KOG0804|consen 360 ITEADSLKQESSDLEAEKKIVERKL 384 (493)
T ss_pred HHHHHhhhhhhhHHHHHHHHHHHHH
Confidence 3334444555555555544444443
No 167
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=66.96 E-value=1.1e+02 Score=29.15 Aligned_cols=38 Identities=11% Similarity=0.042 Sum_probs=17.5
Q ss_pred cCccchhhheeccchhHHHHHHHHh--hhhhHHHhHHHHH
Q psy21 157 GSGDEISTILLNTDGELANQERIID--NLRKEQSSLLLEL 194 (288)
Q Consensus 157 g~g~~i~t~L~d~~gevA~~~~~~~--~~~~~~~~~~eEl 194 (288)
...+++...+-++--.|=++.+..+ .|..|=....+.|
T Consensus 106 ~a~~d~r~lm~~Qf~lvK~~aRl~ak~~WYeWR~kllegL 145 (312)
T smart00787 106 SASPDVKLLMDKQFQLVKTFARLEAKKMWYEWRMKLLEGL 145 (312)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456655444333333555555544 4444443333333
No 168
>KOG0946|consensus
Probab=66.82 E-value=63 Score=34.67 Aligned_cols=28 Identities=18% Similarity=0.224 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHhhhhhhhhhhHHHhhh
Q psy21 249 EMTEQLKDYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 249 ~~~~ql~eyq~LL~vKl~Le~EIatYrk 276 (288)
++...+-+|+.|.+----|..+...|..
T Consensus 738 el~a~~~e~k~l~~~q~~l~~~L~k~~~ 765 (970)
T KOG0946|consen 738 ELNAALSENKKLENDQELLTKELNKKNA 765 (970)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhH
Confidence 3444555666665544444444444433
No 169
>PF07061 Swi5: Swi5; InterPro: IPR010760 This entry represents Swi5 and is involved in meiotic DNA repair synthesis and meiotic joint molecule formation []. It is known to interact with Swi2, Rhp51 and Swi6 [].
Probab=66.61 E-value=39 Score=25.77 Aligned_cols=63 Identities=24% Similarity=0.328 Sum_probs=35.6
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhh
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrk 276 (288)
+..++.+|+.++..++.++.++...+.. .-...-...-+.|++|.++=|+=+.|=-=||..|.
T Consensus 5 l~~~~~~L~~~~~~l~~~i~~~~~~l~~----------~~~~~v~~hI~lLheYNeiKD~gQ~Lig~iA~~rg 67 (83)
T PF07061_consen 5 LEAEIQELKEQIEQLEKEISELEAELIE----------DPEKIVKRHIKLLHEYNEIKDIGQGLIGLIADQRG 67 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccc----------CHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHcC
Confidence 4444555555555555554444443211 11112223345688999999998888877777664
No 170
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=66.01 E-value=56 Score=36.92 Aligned_cols=29 Identities=10% Similarity=0.077 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHhhhhhhhhhhHHHhhhh
Q psy21 249 EMTEQLKDYRELMEIKINLDVEIAQYRNL 277 (288)
Q Consensus 249 ~~~~ql~eyq~LL~vKl~Le~EIatYrkL 277 (288)
.+...+..|+..++.-..|.-+|-.|-..
T Consensus 938 ~~~~~~~~~~~~~~~~~~~~~~i~~y~~~ 966 (1311)
T TIGR00606 938 KAQDKVNDIKEKVKNIHGYMKDIENKIQD 966 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 34444556666666666677777776543
No 171
>PF07076 DUF1344: Protein of unknown function (DUF1344); InterPro: IPR009780 This family consists of several short, hypothetical bacterial proteins of around 80 residues in length. Members of this family are found in Rhizobium, Agrobacterium and Brucella species. The function of this family is unknown.
Probab=65.95 E-value=17 Score=26.30 Aligned_cols=37 Identities=27% Similarity=0.540 Sum_probs=26.9
Q ss_pred EEeecCCCcEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeE---ecCCcEEEEEe
Q psy21 78 ITEVDPDGKFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLK---IEAGGTVTVWS 135 (288)
Q Consensus 78 I~e~d~~GkfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~---L~~g~~vtIwa 135 (288)
|.++|++..-|.|. || .+|++|+.|. |+||.+|-|.=
T Consensus 9 I~~id~~~~titLd------------------DG---ksy~lp~ef~~~~L~~G~kV~V~y 48 (61)
T PF07076_consen 9 IKSIDPETMTITLD------------------DG---KSYKLPEEFDFDGLKPGMKVVVFY 48 (61)
T ss_pred EEEEcCCceEEEec------------------CC---CEEECCCcccccccCCCCEEEEEE
Confidence 56777776666653 22 6899998884 67999998864
No 172
>PF03670 UPF0184: Uncharacterised protein family (UPF0184); InterPro: IPR022788 This family of proteins has no known function.
Probab=65.48 E-value=50 Score=25.35 Aligned_cols=43 Identities=12% Similarity=0.187 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 210 ELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 210 ~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
.++.++.+|..-|..||.+..++..++..|-+.-++.|.++..
T Consensus 30 ~ins~LD~Lns~LD~LE~rnD~l~~~L~~LLesnrq~R~e~~~ 72 (83)
T PF03670_consen 30 AINSMLDQLNSCLDHLEQRNDHLHAQLQELLESNRQIRLEFQE 72 (83)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333334433333333333333333344444433
No 173
>PRK01156 chromosome segregation protein; Provisional
Probab=65.41 E-value=79 Score=33.97 Aligned_cols=14 Identities=43% Similarity=0.546 Sum_probs=5.3
Q ss_pred HHhhHHHHHHHHHH
Q psy21 235 KFARMEAEIKRLQT 248 (288)
Q Consensus 235 ~i~~le~el~~lr~ 248 (288)
.+..++..+..++.
T Consensus 696 ~~~~l~~~i~~l~~ 709 (895)
T PRK01156 696 NRARLESTIEILRT 709 (895)
T ss_pred HHHHHHHHHHHHHh
Confidence 33333333333333
No 174
>PF15294 Leu_zip: Leucine zipper
Probab=65.35 E-value=67 Score=30.12 Aligned_cols=73 Identities=22% Similarity=0.298 Sum_probs=36.0
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHhhHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYE---------QSKFARMEAEIKRLQTEMTEQLK 255 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~---------q~~i~~le~el~~lr~~~~~ql~ 255 (288)
+.+....+|...++.++.+++.+-...-.....|+.+|.+++...... ...+..||..++.++.+++..+.
T Consensus 132 kEi~rLq~EN~kLk~rl~~le~~at~~l~Ek~kl~~~L~~lq~~~~~~~~k~~~~~~~q~l~dLE~k~a~lK~e~ek~~~ 211 (278)
T PF15294_consen 132 KEIDRLQEENEKLKERLKSLEKQATSALDEKSKLEAQLKELQDEQGDQKGKKDLSFKAQDLSDLENKMAALKSELEKALQ 211 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccchhhHHHHHHHHHHHHHHHHH
Confidence 344445555555555555555444444444445555555444311111 22345556666666666555544
Q ss_pred HH
Q psy21 256 DY 257 (288)
Q Consensus 256 ey 257 (288)
+-
T Consensus 212 d~ 213 (278)
T PF15294_consen 212 DK 213 (278)
T ss_pred HH
Confidence 33
No 175
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function. Several sequences in this family are described as being myosin heavy chain-like.
Probab=65.32 E-value=69 Score=32.51 Aligned_cols=67 Identities=25% Similarity=0.316 Sum_probs=41.1
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHhhHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIE-------YEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~-------~~q~~i~~le~el~~lr~~~~ 251 (288)
..+.....||...+..++..+.++..|+..+..|..+|......+. .....+..|+.+|..++.++.
T Consensus 281 ~~l~s~~~ELe~ak~~L~~~k~E~~~L~~~vesL~~ELe~~K~el~~lke~e~~a~~~v~~L~~eL~~~r~eLe 354 (522)
T PF05701_consen 281 SSLASAKKELEEAKKELEKAKEEASSLRASVESLRSELEKEKEELERLKEREKEASSEVSSLEAELNKTRSELE 354 (522)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhHHHHHHHHHHHHH
Confidence 3456666777777777777777777777777777776654443332 223445556666666666553
No 176
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=65.25 E-value=52 Score=34.72 Aligned_cols=48 Identities=21% Similarity=0.280 Sum_probs=21.9
Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 231 YEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
.++.++..|+.++..++.++...-..=+++..++-..++-=..|..||
T Consensus 343 ~l~~~~~~L~~~~~~l~~~~~~~p~~e~~~~~L~R~~~~~~~lY~~lL 390 (726)
T PRK09841 343 ALLEKRQTLEQERKRLNKRVSAMPSTQQEVLRLSRDVEAGRAVYLQLL 390 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455556666666655554433333333333333333333444444
No 177
>KOG4643|consensus
Probab=64.93 E-value=54 Score=35.95 Aligned_cols=72 Identities=18% Similarity=0.278 Sum_probs=51.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccc
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEE 282 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe 282 (288)
|.-++..++..|.-|+.+|+..-..+.++..+|..+++++....+|-+++++-+-+.++++..-..|-++-+
T Consensus 175 L~velAdle~kir~LrqElEEK~enll~lr~eLddleae~~klrqe~~e~l~ea~ra~~yrdeldalre~ae 246 (1195)
T KOG4643|consen 175 LEVELADLEKKIRTLRQELEEKFENLLRLRNELDDLEAEISKLRQEIEEFLDEAHRADRYRDELDALREQAE 246 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhh
Confidence 333444444444555555555556667788888899999998888999999999999988777777766644
No 178
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=64.89 E-value=54 Score=24.32 Aligned_cols=24 Identities=29% Similarity=0.441 Sum_probs=9.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l 229 (288)
.-|-.|+..+..++.++..+...+
T Consensus 33 ~~IKKLr~~~~e~e~~~~~l~~~~ 56 (74)
T PF12329_consen 33 NTIKKLRAKIKELEKQIKELKKKL 56 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333333333333333
No 179
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=64.19 E-value=1e+02 Score=32.47 Aligned_cols=20 Identities=15% Similarity=0.330 Sum_probs=7.5
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy21 243 IKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 243 l~~lr~~~~~ql~eyq~LL~ 262 (288)
+.++..+.+-...-|+.||.
T Consensus 378 ~~~L~Re~~~~~~~Y~~ll~ 397 (754)
T TIGR01005 378 LDALQRDAAAKRQLYESYLT 397 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333444433
No 180
>PRK11546 zraP zinc resistance protein; Provisional
Probab=64.07 E-value=28 Score=29.42 Aligned_cols=22 Identities=23% Similarity=0.421 Sum_probs=12.8
Q ss_pred HHHHHhhHHHHHHHHHHHHHHH
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~q 253 (288)
.+.+|..+..|+.+++.++..+
T Consensus 87 D~~kI~aL~kEI~~Lr~kL~e~ 108 (143)
T PRK11546 87 DSSKINAVAKEMENLRQSLDEL 108 (143)
T ss_pred CHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666666666666655443
No 181
>KOG2629|consensus
Probab=63.73 E-value=46 Score=31.37 Aligned_cols=8 Identities=38% Similarity=0.513 Sum_probs=3.1
Q ss_pred hhhhHHHh
Q psy21 267 LDVEIAQY 274 (288)
Q Consensus 267 Le~EIatY 274 (288)
|+-||++-
T Consensus 180 ~esei~~I 187 (300)
T KOG2629|consen 180 LESEINTI 187 (300)
T ss_pred HHHHHHHH
Confidence 33344333
No 182
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=63.70 E-value=52 Score=30.28 Aligned_cols=32 Identities=6% Similarity=0.133 Sum_probs=17.4
Q ss_pred HHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH
Q psy21 196 ETKGRSGKYDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 196 ~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
++..+++.++.||..|+.+++.+.-+++.+..
T Consensus 58 ~l~~ql~~lq~ev~~LrG~~E~~~~~l~~~~~ 89 (263)
T PRK10803 58 QLQQQLSDNQSDIDSLRGQIQENQYQLNQVVE 89 (263)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 34445555555666666665555555555444
No 183
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=63.52 E-value=90 Score=29.15 Aligned_cols=39 Identities=21% Similarity=-0.014 Sum_probs=22.1
Q ss_pred hHHHhHHHHHHHHHHhh---CCchhHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRS---GKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~---~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
.|+..+.+|+...+... +....+.+...+.+.....++.
T Consensus 169 ~WLR~~L~Ei~Ea~e~~~~~~~~e~eke~~~r~l~~~~~ELe 210 (269)
T PF05278_consen 169 DWLRSKLEEILEAKEIYDQHETREEEKEEKDRKLELKKEELE 210 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 78888888888775543 2344444444444444433333
No 184
>PF07385 DUF1498: Protein of unknown function (DUF1498); InterPro: IPR010864 This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown.; PDB: 3MPB_B 3KMH_A.
Probab=63.52 E-value=19 Score=32.67 Aligned_cols=45 Identities=18% Similarity=0.316 Sum_probs=22.0
Q ss_pred EEEEeCCccceeccCeEEEEEeCCcccEEEEecCC--eEecCCcEEEEEe
Q psy21 88 VKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRT--LKIEAGGTVTVWS 135 (288)
Q Consensus 88 V~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~--~~L~~g~~vtIwa 135 (288)
|.|.|..........=-+.-.+|| ..++||++ ++|.||++||+.-
T Consensus 122 i~l~~s~~~~~~~~~~~v~V~~DG---~~~t~~aG~~l~L~PGESiTL~P 168 (225)
T PF07385_consen 122 IELYNSDPDGELDADTDVTVPVDG---IRRTVPAGTQLRLNPGESITLPP 168 (225)
T ss_dssp EEEEEB--TTSSB-SS-EEEEETT---EEEEE-TT-EEEE-TT-EEEE-T
T ss_pred EEEEeccCCCccccCCCeEEecCC---cEEEecCCceEEeCCCCeEeeCC
Confidence 677777632211111123334565 56666654 6889999999853
No 185
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=63.39 E-value=32 Score=27.57 Aligned_cols=47 Identities=21% Similarity=0.236 Sum_probs=21.8
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFAR 238 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~ 238 (288)
+.+..+..+++.+-.+|.+|+..+..|..+...|.-+.+.++..+..
T Consensus 8 ~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~ 54 (107)
T PF06156_consen 8 DRLDQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEE 54 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444555555555555555444444444444444433
No 186
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=63.34 E-value=92 Score=26.34 Aligned_cols=32 Identities=19% Similarity=0.210 Sum_probs=14.6
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHH
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTE 220 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~ 220 (288)
...++++.+..+++.++.++..|+..+..++.
T Consensus 10 ~l~a~lq~l~~qie~L~~~i~~l~~~~~e~~~ 41 (145)
T COG1730 10 ELAAQLQILQSQIESLQAQIAALNAAISELQT 41 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444444443
No 187
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=63.21 E-value=65 Score=24.56 Aligned_cols=33 Identities=15% Similarity=0.189 Sum_probs=15.3
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le 219 (288)
|+....++.....++..++.++..+.-....|+
T Consensus 7 ~~~l~~~l~~~~~q~~~l~~~~~~~~~~~~eL~ 39 (106)
T PF01920_consen 7 FQELNQQLQQLEQQIQQLERQLRELELTLEELE 39 (106)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444455555555555544444444333
No 188
>PRK11519 tyrosine kinase; Provisional
Probab=63.00 E-value=71 Score=33.69 Aligned_cols=16 Identities=25% Similarity=0.304 Sum_probs=6.8
Q ss_pred HHhhHHHHHHHHHHHH
Q psy21 235 KFARMEAEIKRLQTEM 250 (288)
Q Consensus 235 ~i~~le~el~~lr~~~ 250 (288)
.+..|+.++..++.++
T Consensus 347 ~~~~L~~~~~~l~~~~ 362 (719)
T PRK11519 347 KRKALEDEKAKLNGRV 362 (719)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344444444444433
No 189
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=62.84 E-value=45 Score=27.60 Aligned_cols=85 Identities=27% Similarity=0.357 Sum_probs=41.3
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHhhHHHHHHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYE-------------QSKFARMEAEIKRLQTEMTEQLKD 256 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~-------------q~~i~~le~el~~lr~~~~~ql~e 256 (288)
...+...+...+-+++.+|..++..+..+-..+..++.+.... .+....|...+.....+.....
T Consensus 39 l~~~n~~lAe~nL~~~~~l~~~r~~l~~~~~~~~~L~~~~~~k~~~~~~l~~~~s~~~l~~~L~~~~~e~eeeSe~la-- 116 (150)
T PF07200_consen 39 LLAENEELAEQNLSLEPELEELRSQLQELYEELKELESEYQEKEQQQDELSSNYSPDALLARLQAAASEAEEESEELA-- 116 (150)
T ss_dssp HHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHC--
T ss_pred HHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHH--
Confidence 3444444444444555666666666655555555544433211 2223334444444444333332
Q ss_pred HHHHhhhhhhhhhhHHHhhhh
Q psy21 257 YRELMEIKINLDVEIAQYRNL 277 (288)
Q Consensus 257 yq~LL~vKl~Le~EIatYrkL 277 (288)
+..++.++.++.-|..|+..
T Consensus 117 -e~fl~g~~d~~~Fl~~f~~~ 136 (150)
T PF07200_consen 117 -EEFLDGEIDVDDFLKQFKEK 136 (150)
T ss_dssp --S-SSSHHHHHHHHHHHHHH
T ss_pred -HHHhCCCCCHHHHHHHHHHH
Confidence 44577888888888888753
No 190
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=62.64 E-value=2e+02 Score=30.34 Aligned_cols=28 Identities=21% Similarity=0.354 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy21 240 EAEIKRLQTEMTEQLKDYRELMEIKINL 267 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~LL~vKl~L 267 (288)
+.++.++..+.+-.-.-|..||.-...+
T Consensus 369 e~~~~~L~R~~~~~~~lY~~lL~r~~e~ 396 (726)
T PRK09841 369 QQEVLRLSRDVEAGRAVYLQLLNRQQEL 396 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555555555555555555554433
No 191
>KOG0288|consensus
Probab=62.61 E-value=1.4e+02 Score=29.69 Aligned_cols=42 Identities=29% Similarity=0.296 Sum_probs=19.1
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
.+.++...+++-.++.....+...++..+..++.|+.++..+
T Consensus 26 ~k~~s~~~aq~~~~~a~~~ai~a~~~~~E~~l~~Lq~e~~~l 67 (459)
T KOG0288|consen 26 EKAQSRLSAQLVILRAESRAIKAKLQEKELELNRLQEENTQL 67 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333444444444444555555555555555554443
No 192
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=62.61 E-value=59 Score=23.89 Aligned_cols=17 Identities=35% Similarity=0.423 Sum_probs=6.8
Q ss_pred hHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERN 222 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l 222 (288)
.++..|+..+.++..++
T Consensus 5 a~~~~Lr~rLd~~~rk~ 21 (69)
T PF14197_consen 5 AEIATLRNRLDSLTRKN 21 (69)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33444444444444333
No 193
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=62.21 E-value=72 Score=33.16 Aligned_cols=7 Identities=43% Similarity=0.819 Sum_probs=3.9
Q ss_pred CeEEEEE
Q psy21 102 NWQLVRK 108 (288)
Q Consensus 102 gw~l~r~ 108 (288)
.|+|+|.
T Consensus 101 ~y~i~R~ 107 (650)
T TIGR03185 101 EYTLVRS 107 (650)
T ss_pred EEEEEEE
Confidence 3566655
No 194
>PF13870 DUF4201: Domain of unknown function (DUF4201)
Probab=62.05 E-value=1e+02 Score=26.36 Aligned_cols=22 Identities=18% Similarity=0.317 Sum_probs=9.6
Q ss_pred HHHHHhhHHHHHHHHHHHHHHH
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~q 253 (288)
....|...+..+..++.++...
T Consensus 96 l~~~l~~~~~~~~~~r~~l~~~ 117 (177)
T PF13870_consen 96 LKQELKDREEELAKLREELYRV 117 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444433
No 195
>PF09787 Golgin_A5: Golgin subfamily A member 5; InterPro: IPR019177 This entry represents a family of proteins involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterised: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1 [].
Probab=61.99 E-value=98 Score=31.29 Aligned_cols=71 Identities=24% Similarity=0.367 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
+..|+.....+..++..++.++..+...+.+++.++..-...+..+.++.+..+.-=..-+.|+.-|..-|
T Consensus 276 l~~l~~E~~~~~ee~~~l~~Qi~~l~~e~~d~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~e~~l~~~el 346 (511)
T PF09787_consen 276 LEELKQERDHLQEEIQLLERQIEQLRAELQDLEAQLEGEQESFREQPQELSQQLEPELTTEAELRLYYQEL 346 (511)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHH
Confidence 33444444444444444444444444444444444444333333444444443322222255555554443
No 196
>PF09744 Jnk-SapK_ap_N: JNK_SAPK-associated protein-1; InterPro: IPR019143 This entry represents the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have a RhoGEF domain (IPR000219 from INTERPRO) at their C-terminal end.
Probab=61.95 E-value=76 Score=27.20 Aligned_cols=50 Identities=22% Similarity=0.340 Sum_probs=29.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
.+.+-.+|..++..|+.+...|+..+.++...+.++++.-.+++.+..+.
T Consensus 87 ~~~e~k~L~~~v~~Le~e~r~L~~~~~~~~~q~~rlee~e~~l~~e~~~l 136 (158)
T PF09744_consen 87 WRQERKDLQSQVEQLEEENRQLELKLKNLSDQSSRLEEREAELKKEYNRL 136 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccccchhHHHHHHHHHHH
Confidence 45566667777777777776666666555555555554444444433333
No 197
>PF14915 CCDC144C: CCDC144C protein coiled-coil region
Probab=61.73 E-value=1e+02 Score=29.16 Aligned_cols=43 Identities=21% Similarity=0.361 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy21 221 RNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEI 263 (288)
Q Consensus 221 ~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~v 263 (288)
++.-|...|......-.+||.++...+..++..+.++..-.-.
T Consensus 71 ENt~L~SkLe~EKq~kerLEtEiES~rsRLaaAi~d~dqsq~s 113 (305)
T PF14915_consen 71 ENTMLNSKLEKEKQNKERLETEIESYRSRLAAAIQDHDQSQTS 113 (305)
T ss_pred HHHHHhHHHHHhHHHHHHHHHHHHHHHHHHHHHHhhHHHHHhh
Confidence 3333333444444444556666666666666666666554433
No 198
>PRK04863 mukB cell division protein MukB; Provisional
Probab=61.55 E-value=92 Score=35.90 Aligned_cols=47 Identities=13% Similarity=0.116 Sum_probs=22.1
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
.+..+..++.++..+..++...-.++.++....-.+..++..|...+
T Consensus 374 leeeleeleeEleelEeeLeeLqeqLaelqqel~elQ~el~q~qq~i 420 (1486)
T PRK04863 374 ADEQQEENEARAEAAEEEVDELKSQLADYQQALDVQQTRAIQYQQAV 420 (1486)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444443444444444445555555665443
No 199
>PF10212 TTKRSYEDQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019348 This entry represents a C-terminal 500 residue region, which contains a conserved TTKRSYEDQ motif. It is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain an N-terminal domain with a KLRAQ motif (IPR019343 from INTERPRO). The function of these proteins is not known.
Probab=61.55 E-value=1.5e+02 Score=30.15 Aligned_cols=91 Identities=16% Similarity=0.214 Sum_probs=46.3
Q ss_pred hHHHhHHHHHHHHHHh----hCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH---HH
Q psy21 185 KEQSSLLLELKETKGR----SGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLK---DY 257 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~----~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~---ey 257 (288)
..|..++++|...... ......|-..|..+....+.+...+..++......+..|+.||...+..-+.|+. |+
T Consensus 416 ~~Y~~RI~eLt~qlQ~adSKa~~f~~Ec~aL~~rL~~aE~ek~~l~eeL~~a~~~i~~LqDEL~TTr~NYE~QLs~MSEH 495 (518)
T PF10212_consen 416 SYYMSRIEELTSQLQHADSKAVHFYAECRALQKRLESAEKEKESLEEELKEANQNISRLQDELETTRRNYEEQLSMMSEH 495 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 3444444444443332 2234445555555555555555555555555666666666666666665555533 44
Q ss_pred HHHhhhhhhh-hhhHHHhh
Q psy21 258 RELMEIKINL-DVEIAQYR 275 (288)
Q Consensus 258 q~LL~vKl~L-e~EIatYr 275 (288)
-.=||-|+.- .-||.+++
T Consensus 496 LasmNeqL~~Q~eeI~~LK 514 (518)
T PF10212_consen 496 LASMNEQLAKQREEIQTLK 514 (518)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 4555555443 23444443
No 200
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=61.47 E-value=1.5e+02 Score=28.17 Aligned_cols=20 Identities=10% Similarity=0.277 Sum_probs=10.7
Q ss_pred HHHHHHhhHHHHHHHHHHHH
Q psy21 231 YEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~ 250 (288)
..+.++..++.++......+
T Consensus 246 ~l~~~i~~l~~~i~~e~~~i 265 (362)
T TIGR01010 246 SLQARIKSLRKQIDEQRNQL 265 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 34555566666665554443
No 201
>KOG4657|consensus
Probab=60.22 E-value=1.2e+02 Score=27.58 Aligned_cols=33 Identities=9% Similarity=0.130 Sum_probs=15.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFA 237 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~ 237 (288)
+.+|.+++..+..|+..++-++.++.++...|.
T Consensus 92 eqeik~~q~elEvl~~n~Q~lkeE~dd~keiIs 124 (246)
T KOG4657|consen 92 EQEIKATQSELEVLRRNLQLLKEEKDDSKEIIS 124 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH
Confidence 344444444444444444444444444444443
No 202
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=60.17 E-value=1.8e+02 Score=28.73 Aligned_cols=12 Identities=8% Similarity=0.116 Sum_probs=7.8
Q ss_pred eEecCCcEEEEE
Q psy21 123 LKIEAGGTVTVW 134 (288)
Q Consensus 123 ~~L~~g~~vtIw 134 (288)
|.|.|.++-++-
T Consensus 35 F~LRPkqTAvlv 46 (499)
T COG4372 35 FGLRPKQTAVLV 46 (499)
T ss_pred hccCcccceeeh
Confidence 667777775544
No 203
>KOG4005|consensus
Probab=59.76 E-value=1.1e+02 Score=28.20 Aligned_cols=24 Identities=25% Similarity=0.371 Sum_probs=11.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
++.+|-+|...++.|..+++.|..
T Consensus 95 me~~i~dL~een~~L~~en~~Lr~ 118 (292)
T KOG4005|consen 95 MEYEIKDLTEENEILQNENDSLRA 118 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444445555555555444444433
No 204
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=59.73 E-value=1.6e+02 Score=28.01 Aligned_cols=55 Identities=27% Similarity=0.449 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 214 ANYRLTERNKELEDQIEYEQS-KFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 214 ~~~~Le~~l~~le~~l~~~q~-~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
.+..|..+-.+++..|+..|. .+..|...|..+..++...-.+...|..-|+-|+
T Consensus 114 kl~qLr~EK~~lE~~Le~EqE~~V~kL~k~i~~Le~e~~~~q~~le~Lr~EKVdlE 169 (310)
T PF09755_consen 114 KLNQLRQEKVELENQLEQEQEYLVNKLQKKIERLEKEKSAKQEELERLRREKVDLE 169 (310)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhHH
Confidence 333333333344444443332 2344445555555444444444445555554443
No 205
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=59.36 E-value=72 Score=23.81 Aligned_cols=26 Identities=27% Similarity=0.103 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 213 AANYRLTERNKELEDQIEYEQSKFAR 238 (288)
Q Consensus 213 ~~~~~Le~~l~~le~~l~~~q~~i~~ 238 (288)
+++..||.++.--+..++.++..+..
T Consensus 8 ~Ri~eLE~r~AfQE~tieeLn~~laE 33 (72)
T COG2900 8 ARIIELEIRLAFQEQTIEELNDALAE 33 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333333
No 206
>KOG2391|consensus
Probab=59.15 E-value=29 Score=33.42 Aligned_cols=11 Identities=18% Similarity=0.256 Sum_probs=5.6
Q ss_pred hhhheeccchh
Q psy21 162 ISTILLNTDGE 172 (288)
Q Consensus 162 i~t~L~d~~ge 172 (288)
|.|.++..+.+
T Consensus 207 irasvisa~~e 217 (365)
T KOG2391|consen 207 IRASVISAVRE 217 (365)
T ss_pred hhHHHHHHHHH
Confidence 34455555555
No 207
>PF15372 DUF4600: Domain of unknown function (DUF4600)
Probab=59.15 E-value=1e+02 Score=25.57 Aligned_cols=40 Identities=20% Similarity=0.294 Sum_probs=22.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhh
Q psy21 238 RMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNL 277 (288)
Q Consensus 238 ~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkL 277 (288)
.|+.++......+.+..+.|...-+.+-..=.||+.+..+
T Consensus 69 ~Le~qlk~~e~rLeQEsKAyhk~ndeRr~ylaEi~~~s~~ 108 (129)
T PF15372_consen 69 SLENQLKDYEWRLEQESKAYHKANDERRQYLAEISQTSAL 108 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhhhh
Confidence 3455555555555555566666665555555566555543
No 208
>KOG4196|consensus
Probab=59.08 E-value=1.1e+02 Score=25.61 Aligned_cols=72 Identities=19% Similarity=0.209 Sum_probs=37.6
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEI 263 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~v 263 (288)
.||+-+++.+-..|+..--...-.+.+++ +-.+||.+-..++..+..|..++.+++.+......-|+.|.+.
T Consensus 46 reEVvrlKQrRRTLKNRGYA~sCR~KRv~-Qk~eLE~~k~~L~qqv~~L~~e~s~~~~E~da~k~k~e~l~~~ 117 (135)
T KOG4196|consen 46 REEVVRLKQRRRTLKNRGYAQSCRVKRVQ-QKHELEKEKAELQQQVEKLKEENSRLRRELDAYKSKYEALQNS 117 (135)
T ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 55666665555444433222222222221 1124454445555666666666777777766666677776654
No 209
>PF03173 CHB_HEX: Putative carbohydrate binding domain; InterPro: IPR004866 This domain represents the N-terminal domain in chitobiases and beta-hexosaminidases 3.2.1.52 from EC. Chitobiases degrade chitin, which forms the exoskeleton in insects and crustaceans, and which is one of the most abundant polysaccharides on earth []. Beta-hexosaminidases are composed of either a HexA/HexB heterodimer or a HexB homodimer, and can hydrolyse diverse substrates, including GM(2)-gangliosides; mutations in this enzyme are associated with Tay-Sachs disease []. HexB is structurally similar to chitobiase, consisting of a beta sandwich structure; this structure is similar to that found in the cellulose-binding domain of cellulase from Cellulomonas fimi (IPR001919 from INTERPRO), suggesting that it may function as a carbohydrate-binding domain.; GO: 0030246 carbohydrate binding; PDB: 1C7T_A 1QBA_A 1QBB_A 1C7S_A.
Probab=58.93 E-value=24 Score=30.42 Aligned_cols=18 Identities=11% Similarity=0.218 Sum_probs=9.7
Q ss_pred EEecCCeE-ecCCcEEEEE
Q psy21 117 FKFHRTLK-IEAGGTVTVW 134 (288)
Q Consensus 117 y~Fp~~~~-L~~g~~vtIw 134 (288)
..-.++|. |+||++++|=
T Consensus 81 l~Pt~~F~gl~~Ges~~I~ 99 (164)
T PF03173_consen 81 LTPTAGFKGLAPGESLEIP 99 (164)
T ss_dssp EEE-TT---B-TTEEEEEE
T ss_pred EeECCCCCccCCCCEEEEE
Confidence 34446674 9999987764
No 210
>KOG3433|consensus
Probab=58.74 E-value=1.2e+02 Score=26.83 Aligned_cols=112 Identities=13% Similarity=0.145 Sum_probs=57.9
Q ss_pred cccCccchhhheeccchh-HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH-------HH
Q psy21 155 KFGSGDEISTILLNTDGE-LANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKE-------LE 226 (288)
Q Consensus 155 ~wg~g~~i~t~L~d~~ge-vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~-------le 226 (288)
.|.+-.++.-.|+| +|. .+.+..... .=| .=..+-+...+...+.|..+++.+.+....|...+.. .|
T Consensus 40 v~~tvKdvLQsLvD-D~lV~~eKIgtSn--yyw-sfps~a~~~~ks~~qeLe~~L~~~~qk~~tl~e~~en~K~~~e~tE 115 (203)
T KOG3433|consen 40 VWQTVKDVLQSLVD-DGLVIKEKIGTSN--YYW-SFPSEAICDRKSVLQELESQLATGSQKKATLGESIENRKAGREETE 115 (203)
T ss_pred ehhHHHHHHHHHhc-cchHHHHHhcccc--ccc-ccchHHHHHHHHHHHHHHHHHHHhhhhHhHHHHHHHHHHhhhhhhH
Confidence 46777778888887 444 333322221 001 0122223334444555556666666555555554433 33
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh
Q psy21 227 DQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVE 270 (288)
Q Consensus 227 ~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~E 270 (288)
..-..+...+..+..++..++.++......-+....-|.++.-+
T Consensus 116 er~~el~kklnslkk~~e~lr~el~k~~e~dpqv~~k~~~~~K~ 159 (203)
T KOG3433|consen 116 ERTDELTKKLNSLKKILESLRWELAKIQETDPQVFEKKVHLEKT 159 (203)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHH
Confidence 33334455555566667777777766555555555666666555
No 211
>PF10226 DUF2216: Uncharacterized conserved proteins (DUF2216); InterPro: IPR019359 Proteins in this entry are found in Metazoa and contain a coiled-coil domain. Some annotation suggests it might be PKR, the Hepatitis delta antigen-interacting protein A, but this could not be confirmed.
Probab=58.66 E-value=1.1e+02 Score=27.07 Aligned_cols=39 Identities=26% Similarity=0.406 Sum_probs=27.8
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhH
Q psy21 233 QSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEI 271 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EI 271 (288)
+..+..+...|..+.......++|..+|-.+=+.||-|-
T Consensus 107 r~eV~~Y~~KL~eLE~kq~~L~rEN~eLKElcl~LDeer 145 (195)
T PF10226_consen 107 RQEVAQYQQKLKELEDKQEELIRENLELKELCLYLDEER 145 (195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhccc
Confidence 445555666677776666777788888888878887764
No 212
>PF09478 CBM49: Carbohydrate binding domain CBM49; InterPro: IPR019028 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This domain is found at the C-terminal of cellulases and in vitro binding studies have shown it to binds to crystalline cellulose []. ; GO: 0030246 carbohydrate binding, 0005576 extracellular region
Probab=58.63 E-value=26 Score=26.12 Aligned_cols=30 Identities=17% Similarity=0.294 Sum_probs=21.7
Q ss_pred ccCeEEEEEeCCcccEEEEecCCe-EecCCcEEEE
Q psy21 100 LGNWQLVRKVQGEVKTSFKFHRTL-KIEAGGTVTV 133 (288)
Q Consensus 100 L~gw~l~r~v~~~~~~~y~Fp~~~-~L~~g~~vtI 133 (288)
-.=|-|.+.-+ ..|.||+-. .|+||++.+.
T Consensus 45 ~~iW~l~~~~~----~~y~lPs~~~~i~pg~s~~F 75 (80)
T PF09478_consen 45 GSIWGLDKVSG----NTYTLPSYQPTIKPGQSFTF 75 (80)
T ss_pred hhheeEEeccC----CEEECCccccccCCCCEEEE
Confidence 44588877323 489999854 8999999863
No 213
>PF14263 DUF4354: Domain of unknown function (DUF4354); PDB: 3NRF_B 3SB3_A.
Probab=58.63 E-value=27 Score=28.82 Aligned_cols=44 Identities=16% Similarity=0.197 Sum_probs=27.1
Q ss_pred EEEEeCCccceeccCeEEEEEeCCcccEEEEecC---C---eEecCCcEEEE
Q psy21 88 VKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHR---T---LKIEAGGTVTV 133 (288)
Q Consensus 88 V~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~---~---~~L~~g~~vtI 133 (288)
|.+.|.++++++|....++-.-+.+ ..|++.- + =.|+||+.|+=
T Consensus 53 V~vaN~s~~~idLsk~Cf~a~~~~g--k~f~ldTVd~~L~~g~lK~g~s~kG 102 (124)
T PF14263_consen 53 VTVANLSDKDIDLSKMCFKAYSPDG--KEFKLDTVDEELTSGTLKPGESVKG 102 (124)
T ss_dssp EEEEE-SSS-EE-TT-EEEEEETTS---EEEEEEE-GGGG-SEE-TT-EEEE
T ss_pred EEEecCCCCccccccchhhhccccC--CEEEecccchhhhhccccCCCceeE
Confidence 8899999999999999998776543 3454421 1 27899999873
No 214
>KOG4674|consensus
Probab=58.60 E-value=92 Score=36.44 Aligned_cols=72 Identities=21% Similarity=0.294 Sum_probs=46.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
..+++|+..++.++..+..++..+......+....+++..++.++.+-.+-||+|+.-- =+..+..|++|..
T Consensus 1243 ~k~qEl~~~i~kl~~el~plq~~l~el~~e~~~~~ael~~l~~e~~~wK~R~q~L~~k~--k~~d~~~~~kL~~ 1314 (1822)
T KOG4674|consen 1243 EKIQELRDKIEKLNFELAPLQNELKELKAELQEKVAELKKLEEENDRWKQRNQDLLEKY--KDSDKNDYEKLKS 1314 (1822)
T ss_pred HHHHHHHHHHHHHHhhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcCCHHHHHHHHH
Confidence 33444444444444444445555555666666677777788888888888889988752 3455678888764
No 215
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=58.41 E-value=87 Score=24.44 Aligned_cols=18 Identities=22% Similarity=0.444 Sum_probs=6.8
Q ss_pred HHHHhhHHHHHHHHHHHH
Q psy21 233 QSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~ 250 (288)
...+..++.++..+..+|
T Consensus 76 e~~i~~l~~~~~~l~~~~ 93 (105)
T cd00632 76 ELRIKRLERQEEDLQEKL 93 (105)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333333333333333
No 216
>PRK12704 phosphodiesterase; Provisional
Probab=58.37 E-value=1.7e+02 Score=29.73 Aligned_cols=20 Identities=25% Similarity=0.451 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy21 240 EAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~ 259 (288)
+.+|.....+++....+|..
T Consensus 116 e~~Le~re~eLe~~~~~~~~ 135 (520)
T PRK12704 116 EKELEQKQQELEKKEEELEE 135 (520)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33333333444443333333
No 217
>COG3524 KpsE Capsule polysaccharide export protein [Cell envelope biogenesis, outer membrane]
Probab=58.35 E-value=1.8e+02 Score=27.96 Aligned_cols=30 Identities=7% Similarity=0.276 Sum_probs=16.5
Q ss_pred HHHHHHhhHHHHHHHHHHH---------HHHHHHHHHHH
Q psy21 231 YEQSKFARMEAEIKRLQTE---------MTEQLKDYREL 260 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~---------~~~ql~eyq~L 260 (288)
.+.+.+..++.+|.+-+.. +..+..|||+|
T Consensus 255 ~LkarieSlrkql~qe~q~isag~~~~sl~~qaAefq~l 293 (372)
T COG3524 255 GLKARIESLRKQLLQEKQAISAGGSSQSLSNQAAEFQRL 293 (372)
T ss_pred hHHHHHHHHHHHHHHHHHHhcCCCCccchhHHHHHHHHH
Confidence 3455555555555554443 34566677765
No 218
>KOG3156|consensus
Probab=58.27 E-value=76 Score=28.62 Aligned_cols=36 Identities=19% Similarity=0.320 Sum_probs=19.7
Q ss_pred HHHHHHhhCCc-hhHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKY-DTEMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 194 l~~~~~~~~~l-~~ei~~L~~~~~~Le~~l~~le~~l 229 (288)
+...|..+.++ +.|...|++.+..|...++.+...+
T Consensus 103 f~kiRsel~S~e~sEF~~lr~e~EklkndlEk~ks~l 139 (220)
T KOG3156|consen 103 FAKIRSELVSIERSEFANLRAENEKLKNDLEKLKSSL 139 (220)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444554443 4566666666666666665555444
No 219
>PF10498 IFT57: Intra-flagellar transport protein 57 ; InterPro: IPR019530 Eukaryotic cilia and flagella are specialised organelles found at the periphery of cells of diverse organisms. Intra-flagellar transport (IFT) is required for the assembly and maintenance of eukaryotic cilia and flagella, and consists of the bi-directional movement of large protein particles between the base and the distal tip of the organelle. IFT particles contain multiple copies of two distinct protein complexes, A and B, which contain at least 6 and 11 protein subunits. IFT57 is part of complex B but is not, however, required for the core subunits to stay associated []. This protein is known as Huntington-interacting protein-1 in humans.
Probab=58.18 E-value=1.1e+02 Score=29.78 Aligned_cols=33 Identities=27% Similarity=0.491 Sum_probs=16.0
Q ss_pred HHHhhHHHHHHHHHHHHHHH---HHHHHHHhhhhhh
Q psy21 234 SKFARMEAEIKRLQTEMTEQ---LKDYRELMEIKIN 266 (288)
Q Consensus 234 ~~i~~le~el~~lr~~~~~q---l~eyq~LL~vKl~ 266 (288)
..+..+..+|.+++.+|+.. +-+-..|..+|-+
T Consensus 301 ~~L~~IseeLe~vK~emeerg~~mtD~sPlv~IKqA 336 (359)
T PF10498_consen 301 RELAEISEELEQVKQEMEERGSSMTDGSPLVKIKQA 336 (359)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHH
Confidence 33444445555555555443 2233456666654
No 220
>KOG2751|consensus
Probab=57.91 E-value=99 Score=30.78 Aligned_cols=70 Identities=20% Similarity=0.298 Sum_probs=38.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQ 273 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIat 273 (288)
+..++..+.....+|..++.+++.+-.+....+..++..--++..++.+..++|....---+..+-|+++
T Consensus 181 ~~~e~~~l~~eE~~L~q~lk~le~~~~~l~~~l~e~~~~~~~~~e~~~~~~~ey~~~~~q~~~~~del~S 250 (447)
T KOG2751|consen 181 LLKELKNLKEEEERLLQQLEELEKEEAELDHQLKELEFKAERLNEEEDQYWREYNNFQRQLIEHQDELDS 250 (447)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccchHHH
Confidence 4556666666666666666666554444455555555555555556666666665444333444444443
No 221
>PRK14692 lagellar hook-associated protein FlgL; Provisional
Probab=57.81 E-value=2.5e+02 Score=30.09 Aligned_cols=93 Identities=11% Similarity=0.157 Sum_probs=45.2
Q ss_pred CCeEecCCcEEEEEecccCCCCCCCCCceeeecccccCccchhhheeccchh-----HHHHHHHHhhhhhHHHhHHHHHH
Q psy21 121 RTLKIEAGGTVTVWSFNVEGATHEPPHNIVMKNQKFGSGDEISTILLNTDGE-----LANQERIIDNLRKEQSSLLLELK 195 (288)
Q Consensus 121 ~~~~L~~g~~vtIwa~~~~~~~~~pp~dlv~~~~~wg~g~~i~t~L~d~~ge-----vA~~~~~~~~~~~~~~~~~eEl~ 195 (288)
+.|.+.+|-+++|-. |+.++ |...++++..|.+.... .+.+...-...-..+....+-+.
T Consensus 613 ~~~~~~~n~~l~i~~---------p~~dv------F~~Ld~lI~AL~sg~~~~~~~~~d~~~aglq~aL~~LD~~~D~V~ 677 (749)
T PRK14692 613 PNFSFSANNSLTIDE---------PNVDI------IKDLDSMIDAVLKGNMRADSESEDPRNTGMQGALERLDHLADHVS 677 (749)
T ss_pred ceEEEcCCceEEecC---------ChhhH------HHHHHHHHHHHhCCCcccccccchhhhhHHHHHHHHHHHHHHHHH
Confidence 445556666666622 44454 56667777777655321 11111110100234444555555
Q ss_pred HHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 196 ETKGRSGKYDTEMNELRAANYRLTERNKELEDQ 228 (288)
Q Consensus 196 ~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~ 228 (288)
..+.+++....+|+.+......+...+.++...
T Consensus 678 ~~rA~vGAr~NrlE~~~~r~e~~~l~l~~~lS~ 710 (749)
T PRK14692 678 KLNTTMGAYHNTIEGVNTRTSFLSVNVQSIKSN 710 (749)
T ss_pred HHHHHHhHHHHHHHHHHHhhHHHHHHHHHHHHh
Confidence 555555555555555555544444444444333
No 222
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=57.36 E-value=1.1e+02 Score=30.23 Aligned_cols=23 Identities=26% Similarity=0.437 Sum_probs=9.8
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHH
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~ 251 (288)
+..++..+..+..++.+++.++.
T Consensus 326 ~~~l~~~~~~l~~~~~~~~~~~~ 348 (498)
T TIGR03007 326 IASLEARVAELTARIERLESLLR 348 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444444444444444433
No 223
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=57.25 E-value=71 Score=28.01 Aligned_cols=9 Identities=11% Similarity=0.386 Sum_probs=4.8
Q ss_pred cchhhheec
Q psy21 160 DEISTILLN 168 (288)
Q Consensus 160 ~~i~t~L~d 168 (288)
-++...|+|
T Consensus 33 Kdvlq~LvD 41 (188)
T PF03962_consen 33 KDVLQSLVD 41 (188)
T ss_pred HHHHHHHhc
Confidence 344555666
No 224
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=57.22 E-value=30 Score=26.37 Aligned_cols=45 Identities=16% Similarity=0.038 Sum_probs=28.6
Q ss_pred cEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEE
Q psy21 86 KFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 86 kfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
+.|.|.|.|.... -|++...-.......+.+++ -.|.||.+++|-
T Consensus 24 ~~v~l~N~s~~p~---~f~v~~~~~~~~~~~v~~~~-g~l~PG~~~~~~ 68 (102)
T PF14874_consen 24 RTVTLTNTSSIPA---RFRVRQPESLSSFFSVEPPS-GFLAPGESVELE 68 (102)
T ss_pred EEEEEEECCCCCE---EEEEEeCCcCCCCEEEECCC-CEECCCCEEEEE
Confidence 5799999997764 46665432122344445444 579999877664
No 225
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=56.75 E-value=1.1e+02 Score=29.11 Aligned_cols=47 Identities=19% Similarity=0.233 Sum_probs=20.0
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHh
Q psy21 228 QIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQY 274 (288)
Q Consensus 228 ~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatY 274 (288)
++..+.+.|..++..+..+-.+.+....-.+..-+.-..|-.|+..+
T Consensus 235 EIt~LlsqivdlQ~r~k~~~~EnEeL~q~L~~ske~Q~~L~aEL~el 281 (306)
T PF04849_consen 235 EITSLLSQIVDLQQRCKQLAAENEELQQHLQASKESQRQLQAELQEL 281 (306)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444433333333333334444555555433
No 226
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=56.66 E-value=69 Score=22.75 Aligned_cols=12 Identities=17% Similarity=0.246 Sum_probs=4.4
Q ss_pred HHHHHHHHHHHH
Q psy21 242 EIKRLQTEMTEQ 253 (288)
Q Consensus 242 el~~lr~~~~~q 253 (288)
|-.+.-..|.-+
T Consensus 39 EAaRAN~RlDN~ 50 (56)
T PF04728_consen 39 EAARANQRLDNI 50 (56)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHhh
Confidence 333333333333
No 227
>KOG4360|consensus
Probab=56.40 E-value=1.7e+02 Score=29.93 Aligned_cols=11 Identities=9% Similarity=0.135 Sum_probs=4.0
Q ss_pred hhHHHHHHHHH
Q psy21 205 DTEMNELRAAN 215 (288)
Q Consensus 205 ~~ei~~L~~~~ 215 (288)
..+|-+++..+
T Consensus 239 lsql~d~qkk~ 249 (596)
T KOG4360|consen 239 LSQLVDLQKKI 249 (596)
T ss_pred HHHHHhhHHHH
Confidence 33333333333
No 228
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=56.16 E-value=79 Score=23.24 Aligned_cols=12 Identities=17% Similarity=0.374 Sum_probs=4.5
Q ss_pred hhHHHHHHHHHH
Q psy21 237 ARMEAEIKRLQT 248 (288)
Q Consensus 237 ~~le~el~~lr~ 248 (288)
..|.+++..++.
T Consensus 50 ~~Lk~E~e~L~~ 61 (69)
T PF14197_consen 50 NKLKEENEALRK 61 (69)
T ss_pred HHHHHHHHHHHH
Confidence 333333333333
No 229
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=55.61 E-value=3.9 Score=43.00 Aligned_cols=123 Identities=18% Similarity=0.175 Sum_probs=0.0
Q ss_pred cccCccchhhheeccchh---HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH---HHHH
Q psy21 155 KFGSGDEISTILLNTDGE---LANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKE---LEDQ 228 (288)
Q Consensus 155 ~wg~g~~i~t~L~d~~ge---vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~---le~~ 228 (288)
.|.....-...|.|.-.. .|.+..+-...-..|+.+.+++.-++.++..|+.+...|..++..|+.++.. +..+
T Consensus 285 eL~~~A~~a~~LrDElD~lR~~a~r~~klE~~ve~YKkKLed~~~lk~qvk~Lee~N~~l~e~~~~LEeel~~~~~~~~q 364 (713)
T PF05622_consen 285 ELQAEAREARALRDELDELREKADRADKLENEVEKYKKKLEDLEDLKRQVKELEEDNAVLLETKAMLEEELKKARALKSQ 364 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHH
Confidence 565555555666666555 2222222111224566777777777777777777776666677777766643 3334
Q ss_pred HHHHHHHHhhHHHHHHHHH-------HHHHHHHHHHHHHhhhhhhhhhhHHHhhhh
Q psy21 229 IEYEQSKFARMEAEIKRLQ-------TEMTEQLKDYRELMEIKINLDVEIAQYRNL 277 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr-------~~~~~ql~eyq~LL~vKl~Le~EIatYrkL 277 (288)
+..+...|..++..+.... .++......+..|..-|..|..|+...+-.
T Consensus 365 le~~k~qi~eLe~~l~~~~~~~~~l~~e~~~L~ek~~~l~~eke~l~~e~~~L~e~ 420 (713)
T PF05622_consen 365 LEEYKKQIQELEQKLSEESRRADKLEFENKQLEEKLEALEEEKERLQEERDSLRET 420 (713)
T ss_dssp --------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445455555444443333 233333344455555556666666655543
No 230
>PRK08412 flgL flagellar hook-associated protein FlgL; Validated
Probab=55.61 E-value=2.1e+02 Score=31.04 Aligned_cols=78 Identities=24% Similarity=0.304 Sum_probs=40.8
Q ss_pred CCCCCeEEEeecCCCcEEEEEeCCccceeccCeEEEEEeCCcccEEEE-----ecCCeEecCCcEEEEEecccCCCCCCC
Q psy21 71 SAKGNMEITEVDPDGKFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFK-----FHRTLKIEAGGTVTVWSFNVEGATHEP 145 (288)
Q Consensus 71 sa~G~v~I~e~d~~GkfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~-----Fp~~~~L~~g~~vtIwa~~~~~~~~~p 145 (288)
.|||.|.+. .|-+|+++.--++.... +..+.-. |. ....|- -.+.+.+.++..|||- . |
T Consensus 639 ~ak~~v~~~-ld~~g~~~~~d~~~s~t----~~~~~~~-d~-~~~~f~~~~~~~~~~~~f~ann~iti~---~------p 702 (827)
T PRK08412 639 QAKGNVSVN-LNEQGKLIIQDNMHSNT----KMQFMLY-DS-DANDFSQALHSDKPSLTFNANNALTID---D------P 702 (827)
T ss_pred HhhccEEEe-ecCCCcEEEEeccCCCc----ceEEEEe-cC-CCCcccCCCCCCCCceEEecCCEEEec---C------C
Confidence 467777764 77778776655443221 1111110 11 001111 1245666788999884 2 3
Q ss_pred CCceeeecccccCccchhhheeccc
Q psy21 146 PHNIVMKNQKFGSGDEISTILLNTD 170 (288)
Q Consensus 146 p~dlv~~~~~wg~g~~i~t~L~d~~ 170 (288)
-.++ |...+++...|.+++
T Consensus 703 ~~dI------F~tLd~lI~AL~sg~ 721 (827)
T PRK08412 703 SVNF------FDQLDNIITAVRKGI 721 (827)
T ss_pred CccH------HHHHHHHHHHHhCCC
Confidence 3344 556677777777664
No 231
>PF15233 SYCE1: Synaptonemal complex central element protein 1
Probab=55.57 E-value=1.2e+02 Score=25.24 Aligned_cols=38 Identities=18% Similarity=0.162 Sum_probs=30.7
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
.-+...+|+.+++.-...|..|+++|+.....|+.-+.
T Consensus 21 aKKk~~EELgEa~~l~eaL~~ELDsL~~EkvhLeeiln 58 (134)
T PF15233_consen 21 AKKKSSEELGEAQALWEALQRELDSLNGEKVHLEEILN 58 (134)
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 44566788888888888899999999998888876554
No 232
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=55.46 E-value=1.2e+02 Score=25.06 Aligned_cols=56 Identities=34% Similarity=0.433 Sum_probs=24.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
...+..+...+..|...+-.++.++...+..+...-.++..+..+.......|+.+
T Consensus 33 ~~~~~~l~~~n~~lAe~nL~~~~~l~~~r~~l~~~~~~~~~L~~~~~~k~~~~~~l 88 (150)
T PF07200_consen 33 QQEREELLAENEELAEQNLSLEPELEELRSQLQELYEELKELESEYQEKEQQQDEL 88 (150)
T ss_dssp HHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444444444444444444444444444444444444
No 233
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=55.36 E-value=29 Score=34.91 Aligned_cols=7 Identities=14% Similarity=0.278 Sum_probs=2.7
Q ss_pred eeccchh
Q psy21 166 LLNTDGE 172 (288)
Q Consensus 166 L~d~~ge 172 (288)
++|+.-.
T Consensus 57 vV~~~Fd 63 (475)
T PRK13729 57 VVDTTFD 63 (475)
T ss_pred eecchhH
Confidence 3444333
No 234
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=55.28 E-value=79 Score=29.07 Aligned_cols=48 Identities=23% Similarity=0.478 Sum_probs=34.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH------HHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM------TEQLKDYR 258 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~------~~ql~eyq 258 (288)
+.++-++....+.+||.++...+..+..|..++..++.+- .+.++-|+
T Consensus 84 VtsQRDRFR~Rn~ELE~elr~~~~~~~~L~~Ev~~L~~DN~kLYEKiRylqSY~ 137 (248)
T PF08172_consen 84 VTSQRDRFRQRNAELEEELRKQQQTISSLRREVESLRADNVKLYEKIRYLQSYN 137 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCc
Confidence 4455667777777888888878888888888888888775 34455666
No 235
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=55.09 E-value=1.3e+02 Score=32.08 Aligned_cols=30 Identities=23% Similarity=0.162 Sum_probs=12.5
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELRAANY 216 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~ 216 (288)
+.....|++..+..+...+.|.+.|...++
T Consensus 36 i~~l~~elk~~~~~~~~~~~e~~rl~~~~~ 65 (717)
T PF09730_consen 36 ILELENELKQLRQELSNVQAENERLSQLNQ 65 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444444444333333333
No 236
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=55.01 E-value=2.7e+02 Score=29.93 Aligned_cols=40 Identities=20% Similarity=0.162 Sum_probs=23.1
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
...+..+..|+..+..++.++..+-.+++..+..|...+.
T Consensus 293 ~~eL~rk~~E~~~~qt~l~~~~~~~~d~r~hi~~lkesl~ 332 (775)
T PF10174_consen 293 KLELSRKKSELEALQTRLETLEEQDSDMRQHIEVLKESLR 332 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 3445555556666666666665566666666665555554
No 237
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=55.00 E-value=76 Score=22.74 Aligned_cols=46 Identities=24% Similarity=0.458 Sum_probs=21.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHhhHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQI--------------EYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l--------------~~~q~~i~~le~el~~lr~~~~ 251 (288)
.|+..|.+.+..++.++..++..| +..+.++..++.++..+...+.
T Consensus 4 ~E~~rL~Kel~kl~~~i~~~~~kL~n~~F~~kAP~eVve~er~kl~~~~~~~~~l~~~l~ 63 (66)
T PF10458_consen 4 AEIERLEKELEKLEKEIERLEKKLSNENFVEKAPEEVVEKEREKLEELEEELEKLEEALE 63 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHCSTTHHHHS-CCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555544444332 2334455555555555554443
No 238
>PRK14160 heat shock protein GrpE; Provisional
Probab=54.94 E-value=1.2e+02 Score=27.30 Aligned_cols=18 Identities=17% Similarity=0.318 Sum_probs=6.8
Q ss_pred chhHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTER 221 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~ 221 (288)
++.++.+++....++.+.
T Consensus 73 l~~e~~elkd~~lR~~Ae 90 (211)
T PRK14160 73 LENELEALKDRLLRTVAE 90 (211)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333333333333333
No 239
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=54.93 E-value=76 Score=35.62 Aligned_cols=12 Identities=17% Similarity=0.235 Sum_probs=5.7
Q ss_pred hhhhhhHHHhhh
Q psy21 265 INLDVEIAQYRN 276 (288)
Q Consensus 265 l~Le~EIatYrk 276 (288)
-.+..|+..+..
T Consensus 695 ~~~~~e~~~~~~ 706 (1201)
T PF12128_consen 695 KQLKQELEELLE 706 (1201)
T ss_pred HHHHHHHHHHHH
Confidence 334455555543
No 240
>KOG0995|consensus
Probab=54.85 E-value=1.7e+02 Score=30.27 Aligned_cols=25 Identities=32% Similarity=0.385 Sum_probs=12.8
Q ss_pred cceeEEeeCCCCCeEE-----EeecCCCcE
Q psy21 63 SSGFQVTSSAKGNMEI-----TEVDPDGKF 87 (288)
Q Consensus 63 ~~~~~~~~sa~G~v~I-----~e~d~~Gkf 87 (288)
+-+..+..|.|+.+.| .-+||+=.|
T Consensus 93 S~k~l~~PS~KdF~~iFkfLY~~Ldp~y~f 122 (581)
T KOG0995|consen 93 SIKLLMKPSVKDFIAIFKFLYGFLDPDYEF 122 (581)
T ss_pred hhhhcCCCccccHHHHHHHHHhccCCCccc
Confidence 3444555555555554 234555555
No 241
>PF06637 PV-1: PV-1 protein (PLVAP); InterPro: IPR009538 This family consists of several PV-1 (PLVAP) proteins, which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms []. The function of this family is unknown.
Probab=54.82 E-value=2.2e+02 Score=28.07 Aligned_cols=69 Identities=19% Similarity=0.203 Sum_probs=31.4
Q ss_pred eecc-cccCccchhhheeccchh-HHHHHHHHhhhhhHHHhHHHHHHHH-HHhhCCchhHHHHHHHHHHHHH
Q psy21 151 MKNQ-KFGSGDEISTILLNTDGE-LANQERIIDNLRKEQSSLLLELKET-KGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 151 ~~~~-~wg~g~~i~t~L~d~~ge-vA~~~~~~~~~~~~~~~~~eEl~~~-~~~~~~l~~ei~~L~~~~~~Le 219 (288)
|... .-.+.+++-=.|+.+.+- ++.-.+.=.++..-..+|.+|+.+. +..+...-.|=.+|++++-.++
T Consensus 241 WRDSii~R~Ld~~~y~ly~~l~~el~siRr~Cd~lP~~m~tKveelar~Lr~~I~~VarENs~LqrQKle~e 312 (442)
T PF06637_consen 241 WRDSIIPRSLDNLGYSLYHPLGPELESIRRTCDHLPKIMTTKVEELARSLRAGIERVARENSDLQRQKLEAE 312 (442)
T ss_pred HHHHHHhhhhhcCCcccCCCCcchHHHHHHHHhhchHHHHHHHHHHHHHHhhhHHHHHHhhhHHHHHHHHHH
Confidence 4433 333344443334444444 4443333334566667788887653 3333333333334444433333
No 242
>PF15066 CAGE1: Cancer-associated gene protein 1 family
Probab=54.75 E-value=1.5e+02 Score=29.97 Aligned_cols=35 Identities=26% Similarity=0.369 Sum_probs=17.0
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHH
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le 226 (288)
|-|+.++.....-+..|++|+..+-.|+..+.+|+
T Consensus 317 EvL~kLk~tn~kQq~~IqdLq~sN~yLe~kvkeLQ 351 (527)
T PF15066_consen 317 EVLQKLKHTNRKQQNRIQDLQCSNLYLEKKVKELQ 351 (527)
T ss_pred HHHHHHHhhhHHHHHHHHHhhhccHHHHHHHHHHH
Confidence 33444444444444555555555555555544443
No 243
>PF04108 APG17: Autophagy protein Apg17 ; InterPro: IPR007240 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Autophagy protein 17 (Apg17) is required for activating Apg1 protein kinases. This entry also contains Autophagy protein 11 which is involved in cytoplasm to vacuole transport (Cvt) and pexophagy. ; GO: 0006914 autophagy
Probab=54.70 E-value=1.9e+02 Score=28.42 Aligned_cols=52 Identities=12% Similarity=0.231 Sum_probs=34.5
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 225 LEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 225 le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
....+..+-..+...+..+...+..+...+.+..+|-..-.... .+|..||-
T Consensus 302 ~~~~l~~yl~~~~~~~~~~~~~~~~i~~~~~~l~~L~~~Y~~F~---~aY~~LL~ 353 (412)
T PF04108_consen 302 FGERLPSYLAAFHDFEERWEEEKESIQAYIDELEQLCEFYEGFL---SAYDSLLL 353 (412)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH
Confidence 33345556667777777777777777777777777766544333 58888774
No 244
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=54.64 E-value=1.5e+02 Score=26.07 Aligned_cols=9 Identities=22% Similarity=0.416 Sum_probs=3.6
Q ss_pred chhHHHHHH
Q psy21 204 YDTEMNELR 212 (288)
Q Consensus 204 l~~ei~~L~ 212 (288)
++..|+.|-
T Consensus 91 l~~RL~kLL 99 (190)
T PF05266_consen 91 LRSRLNKLL 99 (190)
T ss_pred HHHHHHHHH
Confidence 344444433
No 245
>KOG4807|consensus
Probab=54.61 E-value=1.9e+02 Score=28.69 Aligned_cols=22 Identities=27% Similarity=0.472 Sum_probs=14.8
Q ss_pred HHHHhhhhhhhhhhHHHhhhhh
Q psy21 257 YRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 257 yq~LL~vKl~Le~EIatYrkLL 278 (288)
|-+|--+|..-|.+|.....-|
T Consensus 553 YtELSiaKakadcdIsrLKEqL 574 (593)
T KOG4807|consen 553 YTELSIAKAKADCDISRLKEQL 574 (593)
T ss_pred HHHHHHHHHhhhccHHHHHHHH
Confidence 5667777788888887655433
No 246
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=54.43 E-value=1.1e+02 Score=24.23 Aligned_cols=37 Identities=16% Similarity=0.153 Sum_probs=14.9
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
....+++.++.+++.+..++..|.......+.-+.++
T Consensus 7 ~~~~~~q~~q~~~~~l~~q~~~le~~~~E~~~v~~eL 43 (110)
T TIGR02338 7 NQLAQLQQLQQQLQAVATQKQQVEAQLKEAEKALEEL 43 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444444433333333333
No 247
>PF08181 DegQ: DegQ (SacQ) family; InterPro: IPR012554 This family consists of the DegQ (formerly sacQ) regulatory peptides. The DegQ family of peptides control the rates of synthesis of a class of both secreted and intracellular degradative enzymes in Bacillus subtilis. DegQ is 46 amino acids long and activates the synthesis of degradative enzymes. The expression of this peptide was shown to be subjected both to catabolite repression and DegS-DegU-mediated control. Thus allowing an increase in the rate of synthesis of degQ under conditions of nitrogen starvation [].
Probab=54.10 E-value=54 Score=21.77 Aligned_cols=34 Identities=18% Similarity=0.331 Sum_probs=25.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYEQSKFARM 239 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~l 239 (288)
.+|.+|+...-+|+.++++....+...+..|..+
T Consensus 4 ~~ieelkqll~rle~eirett~sl~ninksidq~ 37 (46)
T PF08181_consen 4 KKIEELKQLLWRLENEIRETTDSLRNINKSIDQY 37 (46)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence 4677888888888888888887777766655543
No 248
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=53.25 E-value=70 Score=33.19 Aligned_cols=28 Identities=25% Similarity=0.357 Sum_probs=11.1
Q ss_pred HHHHHHHhhCCchhHHHHHHHHHHHHHH
Q psy21 193 ELKETKGRSGKYDTEMNELRAANYRLTE 220 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~~~~~Le~ 220 (288)
|+..+..+++.+..++..+...+..+..
T Consensus 329 el~~l~~~l~~l~~~i~~~~~~~~~l~~ 356 (594)
T PF05667_consen 329 ELEELQEQLDELESQIEELEAEIKMLKS 356 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333444444444444433333333
No 249
>cd04776 HTH_GnyR Helix-Turn-Helix DNA binding domain of the regulatory protein GnyR. Putative helix-turn-helix (HTH) regulatory protein, GnyR, and other related proteins. GnyR belongs to the gnyRDBHAL cluster, which is involved in acyclic isoprenoid degradation in Pseudomonas aeruginosa. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
Probab=53.22 E-value=1.1e+02 Score=24.48 Aligned_cols=31 Identities=23% Similarity=0.269 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHH
Q psy21 214 ANYRLTERNKELEDQIEYEQSKFARMEAEIK 244 (288)
Q Consensus 214 ~~~~Le~~l~~le~~l~~~q~~i~~le~el~ 244 (288)
....|..++..++.+++.++..+..++..+.
T Consensus 81 ~~~~l~~~~~~l~~~~~~l~~~~~~L~~~~~ 111 (118)
T cd04776 81 MLEKIEKRRAELEQQRRDIDAALAELDAAEE 111 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444444443333
No 250
>KOG4571|consensus
Probab=53.11 E-value=32 Score=32.36 Aligned_cols=32 Identities=28% Similarity=0.369 Sum_probs=17.9
Q ss_pred hCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNKELEDQIEYE 232 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~ 232 (288)
.+++..|+.+|.++++.|..+..+++.+++-+
T Consensus 250 ~E~l~ge~~~Le~rN~~LK~qa~~lerEI~yl 281 (294)
T KOG4571|consen 250 KEALLGELEGLEKRNEELKDQASELEREIRYL 281 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455566666666666666555555544433
No 251
>PLN03188 kinesin-12 family protein; Provisional
Probab=53.05 E-value=2.4e+02 Score=32.01 Aligned_cols=23 Identities=22% Similarity=0.453 Sum_probs=15.8
Q ss_pred HHHHHhhhhhhhhhhHHHhhhhh
Q psy21 256 DYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 256 eyq~LL~vKl~Le~EIatYrkLL 278 (288)
-|+.+=+.|-.-|+||.|-..+|
T Consensus 1230 ~~k~~~klkrkh~~e~~t~~q~~ 1252 (1320)
T PLN03188 1230 AYKQIDKLKRKHENEISTLNQLV 1252 (1320)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 35555566777777888877766
No 252
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=53.00 E-value=87 Score=28.46 Aligned_cols=14 Identities=29% Similarity=0.695 Sum_probs=6.3
Q ss_pred HHHHHHHHHHHHHH
Q psy21 239 MEAEIKRLQTEMTE 252 (288)
Q Consensus 239 le~el~~lr~~~~~ 252 (288)
+|.+|.+.+.+++.
T Consensus 167 ie~~L~~v~~eIe~ 180 (262)
T PF14257_consen 167 IERELSRVRSEIEQ 180 (262)
T ss_pred HHHHHHHHHHHHHH
Confidence 44444444444443
No 253
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=52.95 E-value=59 Score=26.22 Aligned_cols=44 Identities=18% Similarity=0.210 Sum_probs=20.0
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFA 237 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~ 237 (288)
+..+..+++.+-.++.+|+..+..|..+...|+-+.+.++..+.
T Consensus 10 l~~le~~l~~l~~el~~LK~~~~el~EEN~~L~iEN~~Lr~~l~ 53 (110)
T PRK13169 10 LDDLEQNLGVLLKELGALKKQLAELLEENTALRLENDKLRERLE 53 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444555555544444444444444444444443
No 254
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=52.93 E-value=49 Score=22.57 Aligned_cols=25 Identities=20% Similarity=0.195 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 209 NELRAANYRLTERNKELEDQIEYEQ 233 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le~~l~~~q 233 (288)
+.|+.++..|+.+++.|+..+..++
T Consensus 2 ~aLrqQv~aL~~qv~~Lq~~fs~yK 26 (46)
T PF09006_consen 2 NALRQQVEALQGQVQRLQAAFSQYK 26 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555566566666555555554443
No 255
>PF14282 FlxA: FlxA-like protein
Probab=52.93 E-value=64 Score=25.56 Aligned_cols=6 Identities=33% Similarity=0.379 Sum_probs=2.2
Q ss_pred HHHHHH
Q psy21 190 LLLELK 195 (288)
Q Consensus 190 ~~eEl~ 195 (288)
..++|+
T Consensus 31 Lq~ql~ 36 (106)
T PF14282_consen 31 LQEQLQ 36 (106)
T ss_pred HHHHHH
Confidence 333333
No 256
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=52.80 E-value=43 Score=33.92 Aligned_cols=65 Identities=23% Similarity=0.359 Sum_probs=43.5
Q ss_pred eCCCCCeEEEeecCCCcEEEEEeCCc----cceeccCeEEEEEe-----------CCcccEEEEecCCeEecCCcEEEEE
Q psy21 70 SSAKGNMEITEVDPDGKFVKLYNKGS----EEQSLGNWQLVRKV-----------QGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 70 ~sa~G~v~I~e~d~~GkfV~l~N~s~----~~~~L~gw~l~r~v-----------~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
-..-|+|.-.-++++||||.+.|.-- -|++-|+-++..+- ++..=+.|-||++++. |+|+|+
T Consensus 398 e~~lg~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idkS~~~lItdf~~~~nsr~iAYafP~gy~t---q~Ikly 474 (668)
T COG4946 398 EKDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSRWIAYAFPEGYYT---QSIKLY 474 (668)
T ss_pred eCCccceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEecccccceeEEEEEcCCceeEEEecCcceee---eeEEEE
Confidence 34457788888999999999999862 35555555544332 2234467899998887 567776
Q ss_pred ecc
Q psy21 135 SFN 137 (288)
Q Consensus 135 a~~ 137 (288)
-..
T Consensus 475 dm~ 477 (668)
T COG4946 475 DMD 477 (668)
T ss_pred ecC
Confidence 443
No 257
>KOG4360|consensus
Probab=52.80 E-value=2.2e+02 Score=29.18 Aligned_cols=30 Identities=20% Similarity=0.198 Sum_probs=12.9
Q ss_pred hhCCchhHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 200 RSGKYDTEMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 200 ~~~~l~~ei~~L~~~~~~Le~~l~~le~~l 229 (288)
++.++..+|+.+-..+.++..++..|..+|
T Consensus 213 q~~s~~eel~~kt~el~~q~Ee~skLlsql 242 (596)
T KOG4360|consen 213 QARSGQEELQSKTKELSRQQEENSKLLSQL 242 (596)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444444444444444444433
No 258
>KOG4848|consensus
Probab=52.73 E-value=1.7e+02 Score=26.09 Aligned_cols=48 Identities=8% Similarity=0.225 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy21 218 LTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKI 265 (288)
Q Consensus 218 Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl 265 (288)
+...+..+-..+.++++.+..-+++-+......++..+|-|+++.-++
T Consensus 137 iak~m~K~pq~~a~~~a~~~k~e~~a~a~~~r~erli~eiqe~fGy~v 184 (225)
T KOG4848|consen 137 IAKNMKKYPQTLAKYEASLVKQEQEADAKEVRLERLIREIQEYFGYWV 184 (225)
T ss_pred HHHHHHHhHHHHHHHHHHHHHhHHHhhHHHHHHHHHHHHHHHHhCccC
Confidence 333333333445566666666666666666666677777777776655
No 259
>PHA02047 phage lambda Rz1-like protein
Probab=52.54 E-value=69 Score=25.31 Aligned_cols=33 Identities=12% Similarity=0.033 Sum_probs=13.0
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRL 218 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~L 218 (288)
|+-..-++.+.+..+++.++.++..+++.++.|
T Consensus 28 ~~g~~h~~a~~la~qLE~a~~r~~~~Q~~V~~l 60 (101)
T PHA02047 28 ALGIAHEEAKRQTARLEALEVRYATLQRHVQAV 60 (101)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333344444444433343433333333333
No 260
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF.
Probab=52.19 E-value=2.6e+02 Score=28.46 Aligned_cols=18 Identities=17% Similarity=0.364 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy21 240 EAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 240 e~el~~lr~~~~~ql~ey 257 (288)
+.+|.....++.....++
T Consensus 110 e~~L~~re~eLee~~~e~ 127 (514)
T TIGR03319 110 EKELSNKEKNLDEKEEEL 127 (514)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333333333333333
No 261
>PF04582 Reo_sigmaC: Reovirus sigma C capsid protein; InterPro: IPR007662 Protein sigmaC in its native state was shown to be a homotrimer. It was demonstrated that the sigmaC subunits are not covalently bound via disulphide linkages and the formation of an intrachain disulphide bond between the two cysteine residues of the sigmaC polypeptide may have a negative effect on oligomer stability. The susceptibility of the trimer to pH, temperature, ionic strength, chemical denaturants and detergents indicates that hydrophobic interactions contribute much more to oligomer stability than do ionic interactions and hydrogen bonding [].; PDB: 2VRS_C 2JJL_A 2BSF_A 2BT7_A 2BT8_A.
Probab=52.18 E-value=26 Score=33.50 Aligned_cols=73 Identities=12% Similarity=0.191 Sum_probs=25.4
Q ss_pred HHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh
Q psy21 198 KGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVE 270 (288)
Q Consensus 198 ~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~E 270 (288)
...+..++..|++|...+..|...+...+..+.++|..+..++-++..++.++..+.-.-.+|=+.=-+||..
T Consensus 83 sstV~~lq~Sl~~lsssVs~lS~~ls~h~ssIS~Lqs~v~~lsTdvsNLksdVSt~aL~ItdLe~RV~~LEs~ 155 (326)
T PF04582_consen 83 SSTVTSLQSSLSSLSSSVSSLSSTLSDHSSSISDLQSSVSALSTDVSNLKSDVSTQALNITDLESRVKALESG 155 (326)
T ss_dssp -----------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHhhhHHhhhhhhhhhhhhHHHHHHhhhhhhhhhhhhhhhhhhhcchHhhHHHHHHHHhcC
Confidence 3333333333344444444444444444556667788888888888888888877755555544433344443
No 262
>PF13514 AAA_27: AAA domain
Probab=51.89 E-value=81 Score=35.01 Aligned_cols=42 Identities=24% Similarity=0.334 Sum_probs=28.3
Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhh
Q psy21 235 KFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRN 276 (288)
Q Consensus 235 ~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrk 276 (288)
.+..+..++..+..+|.....+|-.+--...-|..-|..|+.
T Consensus 934 ~~a~l~~e~e~~~a~l~~~~~~~~~~~la~~lL~~a~~~~r~ 975 (1111)
T PF13514_consen 934 DAAELEQEREEAEAELEELAEEWAALRLAAELLEEAIERYRE 975 (1111)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344566777777777777777777666666666666666663
No 263
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=51.82 E-value=84 Score=24.20 Aligned_cols=28 Identities=29% Similarity=0.378 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 225 LEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 225 le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
++.++......+...|.+|..+|.+-..
T Consensus 41 lE~E~~~l~~~l~~~E~eL~~LrkENrK 68 (85)
T PF15188_consen 41 LEKELNELKEKLENNEKELKLLRKENRK 68 (85)
T ss_pred HHHHHHHHHHHhhccHHHHHHHHHhhhh
Confidence 3444455566666667777777765443
No 264
>PF07246 Phlebovirus_NSM: Phlebovirus nonstructural protein NS-M; InterPro: IPR009879 This entry consists of several Phlebovirus nonstructural NS-M proteins, which represent the N-terminal region of the M polyprotein precursor. The function of this family is unknown.
Probab=51.61 E-value=76 Score=29.53 Aligned_cols=42 Identities=14% Similarity=0.181 Sum_probs=27.1
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhh
Q psy21 225 LEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKIN 266 (288)
Q Consensus 225 le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~ 266 (288)
+..+.+..+.....|+.++.+++.++..+..+......+..+
T Consensus 207 ~~~r~~~lr~~~~~l~~el~~aK~~~~~~~~~~~~~~sv~~a 248 (264)
T PF07246_consen 207 LEARESGLRNESKWLEHELSDAKEDMIRLRNDISDFTSVPQA 248 (264)
T ss_pred HHHhHhhhHHHHHHHHHHHHHHHHHHHHHHhcccchhhHHHH
Confidence 334444555666677888888888888877665554444443
No 265
>KOG1029|consensus
Probab=51.41 E-value=1.1e+02 Score=32.79 Aligned_cols=8 Identities=63% Similarity=1.049 Sum_probs=4.3
Q ss_pred EeecCCCc
Q psy21 79 TEVDPDGK 86 (288)
Q Consensus 79 ~e~d~~Gk 86 (288)
++||-||+
T Consensus 238 sDvd~DGk 245 (1118)
T KOG1029|consen 238 SDVDGDGK 245 (1118)
T ss_pred eccCCCCc
Confidence 45555554
No 266
>TIGR00998 8a0101 efflux pump membrane protein (multidrug resistance protein A).
Probab=51.40 E-value=1.1e+02 Score=28.33 Aligned_cols=16 Identities=13% Similarity=0.187 Sum_probs=6.3
Q ss_pred hhHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTE 220 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~ 220 (288)
+.++..++..+...+.
T Consensus 107 ~~~i~~~~~~~~~a~~ 122 (334)
T TIGR00998 107 QAKVESLKIKLEQARE 122 (334)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344444444433333
No 267
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=51.39 E-value=88 Score=32.46 Aligned_cols=36 Identities=22% Similarity=0.313 Sum_probs=15.6
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTER 221 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~ 221 (288)
.++...+++..+..++..+..++..+...+..+..+
T Consensus 329 el~~l~~~l~~l~~~i~~~~~~~~~l~~~~~q~~~e 364 (594)
T PF05667_consen 329 ELEELQEQLDELESQIEELEAEIKMLKSSLKQLEEE 364 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444444444444444444444444333
No 268
>PRK15396 murein lipoprotein; Provisional
Probab=51.39 E-value=1e+02 Score=23.26 Aligned_cols=18 Identities=11% Similarity=0.285 Sum_probs=7.0
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKE 224 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~ 224 (288)
+++.|..+++.|..+...
T Consensus 26 kvd~LssqV~~L~~kvdq 43 (78)
T PRK15396 26 KIDQLSSDVQTLNAKVDQ 43 (78)
T ss_pred hHHHHHHHHHHHHHHHHH
Confidence 333344444333333333
No 269
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=51.37 E-value=1.3e+02 Score=24.24 Aligned_cols=36 Identities=22% Similarity=0.243 Sum_probs=17.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARM 239 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~l 239 (288)
|-.+-+.|++.++.|+.+......++.++++.|..+
T Consensus 35 L~kqkd~L~~~l~~L~~q~~s~~qr~~eLqaki~ea 70 (107)
T PF09304_consen 35 LAKQKDQLRNALQSLQAQNASRNQRIAELQAKIDEA 70 (107)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444455555555555444444444444444444
No 270
>PF00635 Motile_Sperm: MSP (Major sperm protein) domain; InterPro: IPR000535 Major sperm proteins (MSP) are central components in molecular interactions underlying sperm motility in Caenorhabditis elegans, whose sperm employ an amoebae-like crawling motion using a MSP-containing lamellipod, rather than the flagellar-based swimming motion associated with other sperm. These proteins oligomerise to form an extensive filament system that extends from sperm villipoda, along the leading edge of the pseudopod. About 30 MSP isoforms may exist in C. elegans. MSPs form a fibrous network, whereby MSP dimers form helical subfilaments that coil around one another to produce filaments, which in turn form supercoils to produce bundles. The crystal structure of MSP from C. elegans reveals an immunoglobulin (Ig)-like seven-stranded beta sandwich fold []. ; GO: 0005198 structural molecule activity; PDB: 1MSP_A 3MSP_B 2BVU_B 2MSP_C 1Z9O_F 1Z9L_A 3IKK_A 1WIC_A 2CRI_A 2RR3_A ....
Probab=51.33 E-value=31 Score=26.44 Aligned_cols=43 Identities=14% Similarity=0.019 Sum_probs=27.2
Q ss_pred cEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEE
Q psy21 86 KFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 86 kfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
..|.|.|.++..+ .|+|+-.... ..+-.|..-+|+||++++|-
T Consensus 22 ~~l~l~N~s~~~i---~fKiktt~~~---~y~v~P~~G~i~p~~~~~i~ 64 (109)
T PF00635_consen 22 CELTLTNPSDKPI---AFKIKTTNPN---RYRVKPSYGIIEPGESVEIT 64 (109)
T ss_dssp EEEEEEE-SSSEE---EEEEEES-TT---TEEEESSEEEE-TTEEEEEE
T ss_pred EEEEEECCCCCcE---EEEEEcCCCc---eEEecCCCEEECCCCEEEEE
Confidence 3689999998865 4677755332 33445666699999776654
No 271
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=51.22 E-value=2.2e+02 Score=27.00 Aligned_cols=22 Identities=9% Similarity=0.009 Sum_probs=15.5
Q ss_pred hhhhHHHhHHHHHHHHHHhhCC
Q psy21 182 NLRKEQSSLLLELKETKGRSGK 203 (288)
Q Consensus 182 ~~~~~~~~~~eEl~~~~~~~~~ 203 (288)
..+..+..+...+...+.+++.
T Consensus 181 ~~~~~l~~ae~~l~~fr~~~~~ 202 (362)
T TIGR01010 181 EAEQRLNATKAELLKYQIKNKV 202 (362)
T ss_pred HHHHHHHHHHHHHHHHHHhCCC
Confidence 3367777777778887777654
No 272
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=51.03 E-value=2.5e+02 Score=28.07 Aligned_cols=23 Identities=17% Similarity=0.143 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l 229 (288)
+|..|+.+|..|+.+|.....++
T Consensus 315 qV~~l~~rI~aLe~QIa~er~kl 337 (434)
T PRK15178 315 LIPRLSAKIKVLEKQIGEQRNRL 337 (434)
T ss_pred chhHHHHHHHHHHHHHHHHHHHh
Confidence 44444444555555544443333
No 273
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=50.71 E-value=1.2e+02 Score=26.66 Aligned_cols=38 Identities=24% Similarity=0.287 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHHhhHHHHHHHHHHHHH
Q psy21 214 ANYRLTERNKELEDQIEYE----QSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 214 ~~~~Le~~l~~le~~l~~~----q~~i~~le~el~~lr~~~~ 251 (288)
.+..|+.++..++.+++.+ ...|..+..++..++....
T Consensus 111 ~l~~l~~~~~~l~~el~~~~~~Dp~~i~~~~~~~~~~~~~an 152 (188)
T PF03962_consen 111 ELEELKKELKELKKELEKYSENDPEKIEKLKEEIKIAKEAAN 152 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444322 3345555555555555443
No 274
>KOG0980|consensus
Probab=50.37 E-value=2.2e+02 Score=31.04 Aligned_cols=8 Identities=25% Similarity=0.331 Sum_probs=2.9
Q ss_pred HHHHHHHH
Q psy21 215 NYRLTERN 222 (288)
Q Consensus 215 ~~~Le~~l 222 (288)
+-.|..++
T Consensus 468 ~~~L~d~l 475 (980)
T KOG0980|consen 468 NTNLNDQL 475 (980)
T ss_pred HHHHHHHH
Confidence 33333333
No 275
>PF10506 MCC-bdg_PDZ: PDZ domain of MCC-2 bdg protein for Usher syndrome; InterPro: IPR019536 The entry represents a protein that has a high homology to the tumour suppressor Usher syndrome type-1C protein-binding protein 1, or known as MCC2 (mutated in colon cancer). MCC2 protein binds the first PDZ domain of AIE-75 with its C-terminal amino acids -DTFL. A possible role of MCC2 as a tumour suppressor has been put forward. The carboxyl terminus of the predicted protein was DTFL which matched the consensus motif X-S/T-X-phi (phi: hydrophobic amino acid residue) for binding to the PDZ domain of AIE-75 [, ].
Probab=50.32 E-value=1e+02 Score=22.70 Aligned_cols=60 Identities=18% Similarity=0.276 Sum_probs=37.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH---HHHHHHHHHhhh
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT---EQLKDYRELMEI 263 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~---~ql~eyq~LL~v 263 (288)
|+..|++|...++.|...+++...+-+.+-..+...|+....++.... +-..-|+.|+.+
T Consensus 3 L~~~ie~L~~~n~~L~~~le~~k~~se~Ls~~lgk~es~~~alrlal~ys~r~~e~~~~llal 65 (67)
T PF10506_consen 3 LKRRIEELKSQNEMLSSTLEERKQQSEELSMDLGKYESNATALRLALKYSERCKEAYEVLLAL 65 (67)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh
Confidence 456677788888888777777766666666666666666666644332 223345555543
No 276
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=50.31 E-value=2.4e+02 Score=29.30 Aligned_cols=50 Identities=24% Similarity=0.313 Sum_probs=35.8
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh-hhhhhhhhHHHhhhhh
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME-IKINLDVEIAQYRNLM 278 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~-vKl~Le~EIatYrkLL 278 (288)
+..++.++..++.++..++.++.+...+|..++. .+..|+.+..-|..+.
T Consensus 217 l~~l~~~l~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 267 (646)
T PRK05771 217 IREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALS 267 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445666777788888888888887777766666 7777777777776554
No 277
>KOG0249|consensus
Probab=50.30 E-value=1.2e+02 Score=32.32 Aligned_cols=16 Identities=13% Similarity=-0.093 Sum_probs=10.7
Q ss_pred hhhhhhhhhHHHhhhh
Q psy21 262 EIKINLDVEIAQYRNL 277 (288)
Q Consensus 262 ~vKl~Le~EIatYrkL 277 (288)
+-|..++.+++...+|
T Consensus 285 ~~k~S~~~rrp~~grL 300 (916)
T KOG0249|consen 285 NYKTSGVRRRPRKGRL 300 (916)
T ss_pred hhhhhhhhhhhhhhhH
Confidence 4677777777766655
No 278
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function. Several sequences in this family are described as being myosin heavy chain-like.
Probab=50.07 E-value=1.4e+02 Score=30.36 Aligned_cols=36 Identities=17% Similarity=0.164 Sum_probs=17.1
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le 219 (288)
+..++...+|+..++..+.+|+.+|...+..+..+.
T Consensus 294 k~~L~~~k~E~~~L~~~vesL~~ELe~~K~el~~lk 329 (522)
T PF05701_consen 294 KKELEKAKEEASSLRASVESLRSELEKEKEELERLK 329 (522)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444455555555555555544444444444333
No 279
>PHA01750 hypothetical protein
Probab=50.01 E-value=75 Score=23.37 Aligned_cols=40 Identities=18% Similarity=0.298 Sum_probs=18.8
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le 226 (288)
+.++.+.+|+-. ..+..|+.||++++..++.|+.++.++.
T Consensus 30 q~lkdAvkeIV~--~ELdNL~~ei~~~kikqDnl~~qv~eik 69 (75)
T PHA01750 30 QALKDAVKEIVN--SELDNLKTEIEELKIKQDELSRQVEEIK 69 (75)
T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 444444444322 2233455555555555555555544443
No 280
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=49.80 E-value=1.5e+02 Score=30.06 Aligned_cols=24 Identities=21% Similarity=0.342 Sum_probs=10.6
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
+..++..++..+..++.++.+++.
T Consensus 65 ~~~~l~~~~~~~~~~~~~~~~l~~ 88 (475)
T PRK10361 65 LNNEVRSLQSINTSLEADLREVTT 88 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444443
No 281
>PF04799 Fzo_mitofusin: fzo-like conserved region; InterPro: IPR006884 This entry represents the heptad repeat domain which is conserved at the C terminus of Fzo/mitofusion family of GTPases. Fzo is a mediator of mitochondrial fusion during spermatogenesis []. This conserved region is also found in the human mitofusin protein []. This domain forms a dimeric antiparallel coiled coil structure, which has been proposed to act as a mitochodrial tether before vesicle fusion [].; GO: 0003924 GTPase activity, 0006184 GTP catabolic process, 0008053 mitochondrial fusion, 0005741 mitochondrial outer membrane, 0016021 integral to membrane; PDB: 1T3J_A.
Probab=49.68 E-value=90 Score=27.17 Aligned_cols=11 Identities=18% Similarity=0.380 Sum_probs=4.1
Q ss_pred hhHHHHHHHHH
Q psy21 205 DTEMNELRAAN 215 (288)
Q Consensus 205 ~~ei~~L~~~~ 215 (288)
+.||.+|..++
T Consensus 126 ~~eI~~L~~~i 136 (171)
T PF04799_consen 126 EDEIKQLEKEI 136 (171)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 282
>KOG0964|consensus
Probab=49.66 E-value=1.7e+02 Score=32.28 Aligned_cols=28 Identities=29% Similarity=0.366 Sum_probs=15.8
Q ss_pred CeEEEeecCCCcEEEEEeCCccceeccCeEEEEEeCC
Q psy21 75 NMEITEVDPDGKFVKLYNKGSEEQSLGNWQLVRKVQG 111 (288)
Q Consensus 75 ~v~I~e~d~~GkfV~l~N~s~~~~~L~gw~l~r~v~~ 111 (288)
.|.|.=+|+|-.|. .+..++ .|+|+||-
T Consensus 77 ~VEIvF~nsdnr~~----~~k~Ev-----~lrRtVGl 104 (1200)
T KOG0964|consen 77 SVEIVFDNSDNRLP----RGKSEV-----SLRRTVGL 104 (1200)
T ss_pred EEEEEEeCcccccC----CCCCeE-----EEEEeecc
Confidence 46666666666665 333333 56666653
No 283
>KOG3809|consensus
Probab=49.50 E-value=85 Score=31.41 Aligned_cols=19 Identities=32% Similarity=0.394 Sum_probs=9.3
Q ss_pred HhhhhhhhhhhHHHhhhhh
Q psy21 260 LMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 260 LL~vKl~Le~EIatYrkLL 278 (288)
|-++-..|+.+|+.|+-++
T Consensus 544 L~~~la~lq~~I~d~~e~i 562 (583)
T KOG3809|consen 544 LYNILANLQKEINDTKEEI 562 (583)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3344444555555555444
No 284
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=49.45 E-value=92 Score=26.85 Aligned_cols=8 Identities=13% Similarity=0.102 Sum_probs=3.0
Q ss_pred hHHHhHHH
Q psy21 185 KEQSSLLL 192 (288)
Q Consensus 185 ~~~~~~~e 192 (288)
+.|+...+
T Consensus 57 kqY~~~i~ 64 (161)
T TIGR02894 57 KQYEEAIE 64 (161)
T ss_pred HHHHHHHH
Confidence 33333333
No 285
>KOG4677|consensus
Probab=48.93 E-value=1.4e+02 Score=30.03 Aligned_cols=75 Identities=17% Similarity=0.202 Sum_probs=39.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH----HHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLK----DYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~----eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
++|+.+++-...--.+.++.++.++..++..|.++|++...+......|++ +-+..++.+.-|..+...++-=.|
T Consensus 308 ~~E~ee~rve~~~s~ed~~~~q~q~~~Lrs~~~d~EAq~r~l~s~~~~q~~~~h~~ka~~~~~~~~l~~~~ec~~~e~e 386 (554)
T KOG4677|consen 308 RKEFEETRVELPFSAEDSAHIQDQYTLLRSQIIDIEAQDRHLESAGQTQIFRKHPRKASILNMPLVLTLFYECFYHETE 386 (554)
T ss_pred HHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHhhhHhhhhhhchHHHHHHHHHHHHHHH
Confidence 455555554433333444445555555555555555555555444444433 445667777777766666554443
No 286
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=48.81 E-value=2.6e+02 Score=31.99 Aligned_cols=21 Identities=14% Similarity=0.063 Sum_probs=10.6
Q ss_pred HHHhhhhhhhhcccccCCCCC
Q psy21 11 EVQKKRKTELLCMNVSGAGSS 31 (288)
Q Consensus 11 ~yrklle~ee~r~~~~~~~~~ 31 (288)
.-.+-|+.|..++.-.|.+.+
T Consensus 565 e~~~el~~e~~~~e~~~~~~P 585 (1353)
T TIGR02680 565 EERDALRTERERLEQGTDRDP 585 (1353)
T ss_pred HHHHHHHHHHhccccCCCCCC
Confidence 333445666555554444443
No 287
>KOG1029|consensus
Probab=48.79 E-value=3.9e+02 Score=29.04 Aligned_cols=12 Identities=25% Similarity=0.487 Sum_probs=7.1
Q ss_pred cEEEEEeCCccc
Q psy21 86 KFVKLYNKGSEE 97 (288)
Q Consensus 86 kfV~l~N~s~~~ 97 (288)
||-.|+|+-|+.
T Consensus 196 KY~QlFNa~Dkt 207 (1118)
T KOG1029|consen 196 KYRQLFNALDKT 207 (1118)
T ss_pred HHHHHhhhcccc
Confidence 455667766543
No 288
>PF05010 TACC: Transforming acidic coiled-coil-containing protein (TACC); InterPro: IPR007707 This family contains the proteins TACC 1, 2 and 3, found concentrated in the centrosomes of eukaryotes which may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1) [].
Probab=48.76 E-value=2e+02 Score=25.75 Aligned_cols=51 Identities=16% Similarity=0.310 Sum_probs=21.3
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHH-HHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMTEQLK-DYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~~ql~-eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
+.++...+...+.....++.....++. -..++=.++-....|+.+.+..|.
T Consensus 120 ~~ey~~~l~~~eqry~aLK~hAeekL~~ANeei~~v~~~~~~e~~aLqa~lk 171 (207)
T PF05010_consen 120 IEEYEERLKKEEQRYQALKAHAEEKLEKANEEIAQVRSKHQAELLALQASLK 171 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 334444444444444444443333322 233333344444455555554443
No 289
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=48.72 E-value=2.2e+02 Score=27.72 Aligned_cols=26 Identities=19% Similarity=0.305 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy21 240 EAEIKRLQTEMTEQLKDYRELMEIKI 265 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~LL~vKl 265 (288)
+.++..+..+.+-...-|..|+.-.-
T Consensus 341 ~~~~~~L~r~~~~~~~~y~~ll~r~~ 366 (444)
T TIGR03017 341 RDEMSVLQRDVENAQRAYDAAMQRYT 366 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445555555555556666666443
No 290
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=48.70 E-value=85 Score=25.99 Aligned_cols=45 Identities=16% Similarity=0.175 Sum_probs=26.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTE 249 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~ 249 (288)
+.|-++|+++|..|+-+....+....++...|.-||-.|.+-|..
T Consensus 24 eiERaEmkarIa~LEGE~r~~e~l~~dL~rrIkMLE~aLkqER~k 68 (134)
T PF08232_consen 24 EIERAEMKARIAFLEGERRGQENLKKDLKRRIKMLEYALKQERAK 68 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345556666666666666666655555555566666665555544
No 291
>KOG2077|consensus
Probab=48.58 E-value=79 Score=32.72 Aligned_cols=50 Identities=16% Similarity=0.215 Sum_probs=23.1
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
|-.++++|...+.-|..++...+..-..++.+|..+|++|..++++....
T Consensus 327 LIakVDeL~~E~~vLrgElea~kqak~Klee~i~elEEElk~~k~ea~~a 376 (832)
T KOG2077|consen 327 LIAKVDELTCEKDVLRGELEAVKQAKLKLEEKIRELEEELKKAKAEAEDA 376 (832)
T ss_pred HHHHHHhhccHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444443333322334455555555555555544433
No 292
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=48.54 E-value=1.3e+02 Score=26.72 Aligned_cols=65 Identities=9% Similarity=0.193 Sum_probs=0.0
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
...+.++...+..+...++.++..++..+..|+.++..|+..+...+.+-..|-......+....
T Consensus 87 Ar~Al~~k~~~~~~~~~l~~~~~~~~~~v~~l~~~l~~L~~ki~~~k~k~~~l~ar~~~A~a~~~ 151 (219)
T TIGR02977 87 ARAALIEKQKAQELAEALERELAAVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQAASSRLD 151 (219)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 293
>PF15254 CCDC14: Coiled-coil domain-containing protein 14
Probab=48.46 E-value=2.1e+02 Score=30.69 Aligned_cols=45 Identities=13% Similarity=0.234 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhccccC
Q psy21 240 EAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEESS 284 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe~R 284 (288)
..--.++...|+.|..|++.|.++-.-=|.|+-.-++.++-|-.|
T Consensus 454 q~kneellk~~e~q~~Enk~~~~~~~ekd~~l~~~kq~~d~e~~r 498 (861)
T PF15254_consen 454 QSKNEELLKVIENQKEENKRLRKMFQEKDQELLENKQQFDIETTR 498 (861)
T ss_pred HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 333344555667777888888887777777777777766655443
No 294
>PF06034 DUF919: Nucleopolyhedrovirus protein of unknown function (DUF919); InterPro: IPR009265 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf29. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several short baculovirus proteins of unknown function.
Probab=48.39 E-value=62 Score=23.47 Aligned_cols=49 Identities=14% Similarity=0.350 Sum_probs=31.9
Q ss_pred hCCchhHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHhhHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNKELED--QIEYEQSKFARMEAEIKRLQTE 249 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~~le~--~l~~~q~~i~~le~el~~lr~~ 249 (288)
..+|+.+++++......+..+.+..|. .++-....+..++..|..+|.+
T Consensus 3 ~~~L~~QLd~I~~~K~~l~ik~~H~Ekl~kitK~p~El~~i~~kl~~~R~~ 53 (62)
T PF06034_consen 3 NRSLTQQLDEINQMKRQLTIKSQHWEKLKKITKNPKELQEIEKKLQELRQN 53 (62)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHH
Confidence 356788888888888888887776664 3333444455555566555543
No 295
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=48.35 E-value=1.3e+02 Score=28.02 Aligned_cols=58 Identities=21% Similarity=0.252 Sum_probs=28.9
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~ 248 (288)
..+.......+...+.++..+.......+..+.++..++......++.++.+..++..
T Consensus 192 e~eke~~~r~l~~~~~ELe~~~EeL~~~Eke~~e~~~~i~e~~~rl~~l~~~~~~l~k 249 (269)
T PF05278_consen 192 EEEKEEKDRKLELKKEELEELEEELKQKEKEVKEIKERITEMKGRLGELEMESTRLSK 249 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444445555555555555555555555555555555555555444444
No 296
>TIGR00998 8a0101 efflux pump membrane protein (multidrug resistance protein A).
Probab=48.09 E-value=1e+02 Score=28.60 Aligned_cols=24 Identities=13% Similarity=0.201 Sum_probs=11.5
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
++.++..++..+..++.++...+.
T Consensus 99 ~~~~~~~~~~~i~~~~~~~~~a~~ 122 (334)
T TIGR00998 99 LEITVQQLQAKVESLKIKLEQARE 122 (334)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444555555555555544443
No 297
>PF09738 DUF2051: Double stranded RNA binding protein (DUF2051); InterPro: IPR019139 This entry represents transcriptional repressors which preferentially bind to the GC-rich consensus sequence (5'-AGCCCCCGGCG-3') and may regulate expression of TNF, EGFR and PDGFA. They may control smooth muscle cell proliferation following artery injury through PDGFA repression and may also bind double-stranded RNA. They interact with the leucine-rich repeat domain of human flightless-I (FliI) protein.
Probab=47.69 E-value=1e+02 Score=29.29 Aligned_cols=57 Identities=12% Similarity=0.210 Sum_probs=32.0
Q ss_pred HHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 195 KETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 195 ~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
..+.=++..|+.+|.+|...+..+..++.+--..++.....+..|..++..++.++.
T Consensus 108 ~~l~yqvd~Lkd~lee~eE~~~~~~re~~eK~~elEr~K~~~d~L~~e~~~Lre~L~ 164 (302)
T PF09738_consen 108 SALMYQVDLLKDKLEELEETLAQLQREYREKIRELERQKRAHDSLREELDELREQLK 164 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444555566666666666666666554444455555555566666666666553
No 298
>PF15456 Uds1: Up-regulated During Septation
Probab=47.62 E-value=1.6e+02 Score=24.20 Aligned_cols=33 Identities=21% Similarity=0.275 Sum_probs=21.1
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~ 224 (288)
++|+..++.++..|...+..+++++. |+..+++
T Consensus 21 ~eEVe~LKkEl~~L~~R~~~lr~kl~-le~k~Rd 53 (124)
T PF15456_consen 21 FEEVEELKKELRSLDSRLEYLRRKLA-LESKIRD 53 (124)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH
Confidence 56666666666666666777766665 5555544
No 299
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=46.98 E-value=2e+02 Score=25.18 Aligned_cols=22 Identities=9% Similarity=-0.142 Sum_probs=15.2
Q ss_pred EEEecCCeEecCCcEEEEEecc
Q psy21 116 SFKFHRTLKIEAGGTVTVWSFN 137 (288)
Q Consensus 116 ~y~Fp~~~~L~~g~~vtIwa~~ 137 (288)
..-|||+..-..|+....+.+.
T Consensus 6 ~~i~ppr~~~~~~~~~~q~vS~ 27 (189)
T PF10211_consen 6 NSILPPREWEEDGQLWIQFVSS 27 (189)
T ss_pred HhhCCchhhhcCCeeeEeeeCC
Confidence 3468888777777777666544
No 300
>PRK09343 prefoldin subunit beta; Provisional
Probab=46.75 E-value=1.6e+02 Score=23.89 Aligned_cols=19 Identities=11% Similarity=0.191 Sum_probs=8.1
Q ss_pred HHHHhhHHHHHHHHHHHHH
Q psy21 233 QSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~~ 251 (288)
..++.-++.++..+.....
T Consensus 77 ~~r~E~ie~~ik~lekq~~ 95 (121)
T PRK09343 77 KERKELLELRSRTLEKQEK 95 (121)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444333
No 301
>COG4985 ABC-type phosphate transport system, auxiliary component [Inorganic ion transport and metabolism]
Probab=46.25 E-value=1.4e+02 Score=27.43 Aligned_cols=66 Identities=17% Similarity=0.246 Sum_probs=29.2
Q ss_pred CCCceeeecccccCccchhhheeccchh-HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCc-hhHHHHHHHHHH
Q psy21 145 PPHNIVMKNQKFGSGDEISTILLNTDGE-LANQERIIDNLRKEQSSLLLELKETKGRSGKY-DTEMNELRAANY 216 (288)
Q Consensus 145 pp~dlv~~~~~wg~g~~i~t~L~d~~ge-vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l-~~ei~~L~~~~~ 216 (288)
|-.-+|.+-..||..-..... ++.+|. ++-- -....++....+...++.++..+ +.++.-++.+..
T Consensus 129 P~gl~vleR~~~G~~yG~ll~-~~~ngq~l~Gd-----~l~~eLqkr~~~v~~l~~q~~k~~~~qv~~in~qlE 196 (289)
T COG4985 129 PQGLAVLERSRGGQLYGKLLR-FDSNGQELDGD-----PLERELQKRLLEVETLRDQVDKMVEQQVRVINSQLE 196 (289)
T ss_pred cchheeeeeccCcchhhheee-eccCCCcccCc-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444455554466654332222 355555 5411 11234444555555555555443 233334443333
No 302
>PF04645 DUF603: Protein of unknown function, DUF603; InterPro: IPR006739 This family includes several uncharacterised proteins from Borrelia species.
Probab=46.17 E-value=91 Score=27.21 Aligned_cols=85 Identities=15% Similarity=0.163 Sum_probs=49.1
Q ss_pred HHHhHHHHHHHHHHhh---------CCchhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhhHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRS---------GKYDTEMNELRAANYRLTERNKELEDQIE-----YEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~---------~~l~~ei~~L~~~~~~Le~~l~~le~~l~-----~~q~~i~~le~el~~lr~~~~ 251 (288)
.+...+.|+..++.++ +-+..+|.++...+.-|+..|..|+..+. .....|..|+.+|..+..+.+
T Consensus 76 ~a~~~Kse~~~~r~~L~l~FI~sf~~Y~~leL~s~~~ei~~L~~kI~~L~~~in~~~k~~~n~~i~slk~EL~d~iKe~e 155 (181)
T PF04645_consen 76 EARNAKSELEMERSNLELSFIDSFNQYKNLELKSIKKEIEILRLKISSLQKEINKNKKKDLNEEIESLKSELNDLIKERE 155 (181)
T ss_pred HHHHHHhHHHHHHHHHhhHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHHHH
Confidence 3344555555555543 23456777777777777777776665543 234456677777777766554
Q ss_pred HHH--HHHHHHhhhhhhhhhh
Q psy21 252 EQL--KDYRELMEIKINLDVE 270 (288)
Q Consensus 252 ~ql--~eyq~LL~vKl~Le~E 270 (288)
-.- -=|+++...+.+-|.|
T Consensus 156 ~~emeLyyecMkkL~~a~~~e 176 (181)
T PF04645_consen 156 IREMELYYECMKKLAKAHEVE 176 (181)
T ss_pred HHHHHHHHHHHHHHhhhhhhh
Confidence 432 2456666665555554
No 303
>PF11544 Spc42p: Spindle pole body component Spc42p; InterPro: IPR021611 Spc42p is a 42kDa component of the S.cerevisiae spindle body that localises to the electron dense central region of the SPB. Spc42p is a phosphoprotein which forms a polymeric layer at the periphery of the SPB central plaque. This functions during SPB duplication and also facilitates the attachment of the SPB to the nuclear membrane. ; PDB: 2Q6Q_B.
Probab=45.90 E-value=65 Score=24.32 Aligned_cols=39 Identities=13% Similarity=0.093 Sum_probs=18.4
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
..+..+.+|+.++..-+++++.++..+...+..|+.++.
T Consensus 12 ~kL~~K~eEI~rLn~lv~sLR~KLiKYt~LnkkLq~~~~ 50 (76)
T PF11544_consen 12 KKLNDKQEEIDRLNILVGSLRGKLIKYTELNKKLQDQLL 50 (76)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555555544444444444444433
No 304
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=45.88 E-value=1e+02 Score=30.48 Aligned_cols=14 Identities=29% Similarity=0.617 Sum_probs=6.5
Q ss_pred CCeEecCCcEEEEE
Q psy21 121 RTLKIEAGGTVTVW 134 (288)
Q Consensus 121 ~~~~L~~g~~vtIw 134 (288)
++|.++|+..|+|+
T Consensus 208 ~g~~V~A~g~I~I~ 221 (451)
T PF03961_consen 208 SGFSVKAGGDITIK 221 (451)
T ss_pred CCCEEEecCcEEEE
Confidence 34444444444444
No 305
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=45.64 E-value=63 Score=29.70 Aligned_cols=44 Identities=27% Similarity=0.390 Sum_probs=29.4
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELED 227 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~ 227 (288)
+......-+|+..+..++..|+.|++.|++-+-.|-+.++-++.
T Consensus 92 R~Rn~ELE~elr~~~~~~~~L~~Ev~~L~~DN~kLYEKiRylqS 135 (248)
T PF08172_consen 92 RQRNAELEEELRKQQQTISSLRREVESLRADNVKLYEKIRYLQS 135 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44444455566666666777778888888777777777766543
No 306
>PRK11519 tyrosine kinase; Provisional
Probab=45.46 E-value=3.9e+02 Score=28.17 Aligned_cols=6 Identities=17% Similarity=0.346 Sum_probs=2.6
Q ss_pred cCCcEE
Q psy21 126 EAGGTV 131 (288)
Q Consensus 126 ~~g~~v 131 (288)
.||..+
T Consensus 195 ~~~~~~ 200 (719)
T PRK11519 195 RPGTEF 200 (719)
T ss_pred CCCCEE
Confidence 345433
No 307
>PF04899 MbeD_MobD: MbeD/MobD like ; InterPro: IPR006983 The MbeD and MobD proteins are plasmid encoded, and are involved in the plasmid mobilisation and transfer in the presence of conjugative plasmids [].
Probab=45.31 E-value=1.2e+02 Score=22.36 Aligned_cols=10 Identities=40% Similarity=0.311 Sum_probs=4.2
Q ss_pred HHhHHHHHHH
Q psy21 187 QSSLLLELKE 196 (288)
Q Consensus 187 ~~~~~eEl~~ 196 (288)
+-++.++|+.
T Consensus 8 Ll~ale~Lq~ 17 (70)
T PF04899_consen 8 LLSALEELQQ 17 (70)
T ss_pred HHHHHHHHHH
Confidence 3344444443
No 308
>PF15030 DUF4527: Protein of unknown function (DUF4527)
Probab=45.02 E-value=1.9e+02 Score=26.77 Aligned_cols=12 Identities=33% Similarity=0.534 Sum_probs=6.9
Q ss_pred hhhhHHHhhhhh
Q psy21 267 LDVEIAQYRNLM 278 (288)
Q Consensus 267 Le~EIatYrkLL 278 (288)
=|+=.|.|-..+
T Consensus 121 ~DvALaEYaAtF 132 (277)
T PF15030_consen 121 NDVALAEYAATF 132 (277)
T ss_pred HHHHHHHHHHHh
Confidence 355566676554
No 309
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=45.01 E-value=2e+02 Score=24.67 Aligned_cols=79 Identities=24% Similarity=0.335 Sum_probs=44.2
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHH--------H-------HHHHHHHhh---HHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQ--------I-------EYEQSKFAR---MEAEIKR 245 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~--------l-------~~~q~~i~~---le~el~~ 245 (288)
+..+.....||...+.++...-.+++.|...-......|.++-.. + .++|..+.- -|.+|..
T Consensus 26 R~E~~~l~~EL~evk~~v~~~I~evD~Le~~er~aR~rL~eVS~~f~~ysE~dik~AYe~A~~lQ~~L~~~re~E~qLr~ 105 (159)
T PF05384_consen 26 RQEYERLRKELEEVKEEVSEVIEEVDKLEKRERQARQRLAEVSRNFDRYSEEDIKEAYEEAHELQVRLAMLREREKQLRE 105 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555566655555555555555555544444455433221 1 123333332 3778888
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy21 246 LQTEMTEQLKDYRELME 262 (288)
Q Consensus 246 lr~~~~~ql~eyq~LL~ 262 (288)
.|.++++.++.....+.
T Consensus 106 rRD~LErrl~~l~~tie 122 (159)
T PF05384_consen 106 RRDELERRLRNLEETIE 122 (159)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 88888888887776655
No 310
>cd04779 HTH_MerR-like_sg4 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 4). Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=44.55 E-value=1.6e+02 Score=24.32 Aligned_cols=50 Identities=16% Similarity=0.210 Sum_probs=27.2
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARME 240 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le 240 (288)
..|+++.-............+...++.+..+++.++.++..++.....++
T Consensus 59 L~eI~~~l~~~~~~~~~~~~~~~~~~~l~~~i~~Le~~l~~L~~~~~~l~ 108 (134)
T cd04779 59 LAEIKDQLEEVQRSDKEQREVAQEVQLVCDQIDGLEHRLKQLKPIASQTD 108 (134)
T ss_pred HHHHHHHHHhhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555554433333344445566666666666666666655555555443
No 311
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=44.04 E-value=1.8e+02 Score=29.36 Aligned_cols=29 Identities=17% Similarity=0.271 Sum_probs=12.7
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
+...+..++.++..++.++....++...|
T Consensus 143 ~~~~~~~~~~~~~~~~~~l~~l~~~l~~l 171 (525)
T TIGR02231 143 LLTEDREAERRIRELEKQLSELQNELNAL 171 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 33344444444444444444444444333
No 312
>TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair.
Probab=43.91 E-value=1.4e+02 Score=30.78 Aligned_cols=32 Identities=9% Similarity=0.265 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy21 224 ELEDQIEYEQSKFARMEAEIKRLQTEMTEQLK 255 (288)
Q Consensus 224 ~le~~l~~~q~~i~~le~el~~lr~~~~~ql~ 255 (288)
.+..++....+.+..+..++...+.++..++.
T Consensus 223 ~lk~e~~~~~~~i~~~~~~l~~~~~~~~~~~~ 254 (555)
T TIGR03545 223 KLKKEGKADKQKIKSAKNDLQNDKKQLKADLA 254 (555)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 33344444444555554455555544444443
No 313
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=43.67 E-value=59 Score=24.01 Aligned_cols=43 Identities=19% Similarity=0.246 Sum_probs=25.4
Q ss_pred EEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEE
Q psy21 87 FVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 87 fV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
.|.|.|.|..+ .+++.+.-.+++........ ..|.||.+.+|-
T Consensus 24 ~~~V~N~G~~~--~~~~~v~~~~~~~~~~~~~i---~~L~~g~~~~v~ 66 (101)
T PF07705_consen 24 TVTVKNNGTAD--AENVTVRLYLDGNSVSTVTI---PSLAPGESETVT 66 (101)
T ss_dssp EEEEEE-SSS---BEEEEEEEEETTEEEEEEEE---SEB-TTEEEEEE
T ss_pred EEEEEECCCCC--CCCEEEEEEECCceeccEEE---CCcCCCcEEEEE
Confidence 46788888766 66777777777644333333 257777766653
No 314
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=43.64 E-value=3.1e+02 Score=28.06 Aligned_cols=7 Identities=14% Similarity=0.458 Sum_probs=2.5
Q ss_pred HHHHHHH
Q psy21 207 EMNELRA 213 (288)
Q Consensus 207 ei~~L~~ 213 (288)
+|.++..
T Consensus 356 eL~~Le~ 362 (569)
T PRK04778 356 QLESLEK 362 (569)
T ss_pred HHHHHHH
Confidence 3333333
No 315
>KOG0982|consensus
Probab=43.52 E-value=3.2e+02 Score=27.39 Aligned_cols=64 Identities=22% Similarity=0.210 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------hhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKF-------ARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVE 270 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i-------~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~E 270 (288)
++..++.+++.|+.+..++...++.+...+ .++..+|..++.++..-.+.|....++-++...|
T Consensus 298 e~Enlqmr~qqleeentelRs~~arlksl~dklaee~qr~sd~LE~lrlql~~eq~l~~rm~d~Lrrfq~e 368 (502)
T KOG0982|consen 298 EKENLQMRDQQLEEENTELRSLIARLKSLADKLAEEDQRSSDLLEALRLQLICEQKLRVRMNDILRRFQEE 368 (502)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444455555555555544443332222222 2233344444444444444444444444443333
No 316
>PF04508 Pox_A_type_inc: Viral A-type inclusion protein repeat ; InterPro: IPR007596 The repeat is found in the A-type inclusion protein of the Poxvirus family [].; GO: 0016032 viral reproduction
Probab=43.39 E-value=38 Score=19.74 Aligned_cols=13 Identities=31% Similarity=0.330 Sum_probs=5.0
Q ss_pred HHHHHHHHHHHHH
Q psy21 209 NELRAANYRLTER 221 (288)
Q Consensus 209 ~~L~~~~~~Le~~ 221 (288)
+.|+..|..|+.+
T Consensus 4 ~rlr~rI~dLer~ 16 (23)
T PF04508_consen 4 NRLRNRISDLERQ 16 (23)
T ss_pred HHHHHHHHHHHHH
Confidence 3333333333333
No 317
>KOG1760|consensus
Probab=43.14 E-value=1.9e+02 Score=23.92 Aligned_cols=32 Identities=13% Similarity=0.229 Sum_probs=16.5
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTER 221 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~ 221 (288)
+.++..++-++...++.+|...+..+..|+..
T Consensus 21 ~iN~Fsrl~~R~~~lk~dik~~k~~~enledA 52 (131)
T KOG1760|consen 21 NINEFSRLNSRKDDLKADIKEAKTEIENLEDA 52 (131)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444455555555555555555555543
No 318
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=43.01 E-value=8 Score=41.68 Aligned_cols=98 Identities=18% Similarity=0.244 Sum_probs=0.0
Q ss_pred hhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHhhHHHHHHHHHHHHHHHHHH
Q psy21 184 RKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQ-------SKFARMEAEIKRLQTEMTEQLKD 256 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q-------~~i~~le~el~~lr~~~~~ql~e 256 (288)
...+..+-+|+..+..++.+-...+..|.+.+..|+.+|.+|+..|.... ..-..|..+|..++..+......
T Consensus 10 ~~~l~kke~El~~~~~~~e~e~~~~~~l~k~~kelq~~i~el~eeLe~Er~~R~kaek~r~dL~~ELe~l~~~Lee~~~~ 89 (859)
T PF01576_consen 10 EEQLKKKEEELSQLNSKLEDEQALRAQLQKKIKELQARIEELEEELESERQARAKAEKQRRDLSEELEELKERLEEAGGA 89 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCc
Confidence 34455566777777777777777777788888888888887777665332 23355777788888877777777
Q ss_pred HHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 257 YRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 257 yq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
-..-..+.-..|.|++..|+-|+..
T Consensus 90 t~aq~E~~kkrE~El~~Lrr~LEe~ 114 (859)
T PF01576_consen 90 TQAQIELNKKREAELAKLRRDLEEA 114 (859)
T ss_dssp -------------------------
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 7777778888889999999888754
No 319
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=42.96 E-value=1.3e+02 Score=21.89 Aligned_cols=37 Identities=14% Similarity=0.289 Sum_probs=15.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAE 242 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~e 242 (288)
..++.+...++.|+......+.++......|..++.-
T Consensus 13 ~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~n 49 (71)
T PF10779_consen 13 TKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKSN 49 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344444444444444443333444444444433
No 320
>KOG4571|consensus
Probab=42.85 E-value=1.1e+02 Score=28.82 Aligned_cols=36 Identities=25% Similarity=0.266 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 217 RLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 217 ~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
.|+.++..|+.+-+.++.+..++|.|++.+|.-|..
T Consensus 252 ~l~ge~~~Le~rN~~LK~qa~~lerEI~ylKqli~e 287 (294)
T KOG4571|consen 252 ALLGELEGLEKRNEELKDQASELEREIRYLKQLILE 287 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444445555555566666666666666665543
No 321
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=42.76 E-value=46 Score=24.04 Aligned_cols=10 Identities=30% Similarity=0.258 Sum_probs=3.6
Q ss_pred HHHHHHHHHH
Q psy21 210 ELRAANYRLT 219 (288)
Q Consensus 210 ~L~~~~~~Le 219 (288)
+|+.+++.+.
T Consensus 28 ~l~~~i~~l~ 37 (80)
T PF04977_consen 28 ELQKEIEELK 37 (80)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 322
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=42.55 E-value=1.6e+02 Score=25.46 Aligned_cols=30 Identities=27% Similarity=0.417 Sum_probs=16.7
Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy21 228 QIEYEQSKFARMEAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 228 ~l~~~q~~i~~le~el~~lr~~~~~ql~ey 257 (288)
+++.+...+...+.++..++.+.+...+||
T Consensus 162 ei~~lk~el~~~~~~~~~LkkQ~~~l~~ey 191 (192)
T PF05529_consen 162 EIEKLKKELEKKEKEIEALKKQSEGLQKEY 191 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 444444445555556666666666655555
No 323
>PF08657 DASH_Spc34: DASH complex subunit Spc34 ; InterPro: IPR013966 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=42.34 E-value=1e+02 Score=28.55 Aligned_cols=36 Identities=8% Similarity=0.047 Sum_probs=23.5
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH
Q psy21 189 SLLLELKETKGRSGKYDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~ 224 (288)
.+.+-+..++.+++.+..+|+.|++.+..-+.+|+.
T Consensus 177 ga~eki~~Lr~~y~~l~~~i~~lE~~VaeQ~~qL~~ 212 (259)
T PF08657_consen 177 GAREKIAALRQRYNQLSNSIAYLEAEVAEQEAQLER 212 (259)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666777777777777777777766665555543
No 324
>COG3879 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=42.30 E-value=93 Score=28.68 Aligned_cols=29 Identities=17% Similarity=0.226 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 210 ELRAANYRLTERNKELEDQIEYEQSKFAR 238 (288)
Q Consensus 210 ~L~~~~~~Le~~l~~le~~l~~~q~~i~~ 238 (288)
.|...+.+++.+...|..++.+++..+.+
T Consensus 54 ~L~~e~~s~Q~~~~~L~~ev~~~~~~~~s 82 (247)
T COG3879 54 DLVKELRSLQKKVNTLAAEVEDLENKLDS 82 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333333333
No 325
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=42.14 E-value=2.3e+02 Score=30.20 Aligned_cols=38 Identities=32% Similarity=0.496 Sum_probs=19.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhh
Q psy21 238 RMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYR 275 (288)
Q Consensus 238 ~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYr 275 (288)
.|-.++..++..=.+.++||.+|=.--+.|..-|+.+|
T Consensus 80 ~lr~e~ke~K~rE~rll~dyselEeENislQKqvs~Lk 117 (717)
T PF09730_consen 80 RLREEIKEYKFREARLLQDYSELEEENISLQKQVSVLK 117 (717)
T ss_pred HHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence 33334444444445666666666555555555444444
No 326
>PF05546 She9_MDM33: She9 / Mdm33 family; InterPro: IPR008839 Members of this family are mitochondrial inner membrane proteins with a role in inner mitochondrial membrane organisation and biogenesis []. The yeast Mdm33 protein assembles into an oligomeric complex in the inner membrane where it performs homotypic protein-protein interactions. It has been suggested that Mdm33 plays a distinct role, possibly involved in fission of the mitochondrial inner membrane [].
Probab=42.00 E-value=2.1e+02 Score=25.72 Aligned_cols=83 Identities=11% Similarity=0.194 Sum_probs=41.1
Q ss_pred HhHHHHHHHHHHhhCCch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy21 188 SSLLLELKETKGRSGKYD--TEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKI 265 (288)
Q Consensus 188 ~~~~eEl~~~~~~~~~l~--~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl 265 (288)
...+..+-..-..++++. ..|..|+..+..++..+......+..... .....-..-..-.+|-.+||.-|.
T Consensus 12 d~lq~~i~~as~~lNd~TGYs~Ie~LK~~i~~~E~~l~~~r~~~~~aK~-------~Y~~ai~~Rs~sQrEvn~LLqRK~ 84 (207)
T PF05546_consen 12 DSLQETIFTASQALNDVTGYSEIEKLKKSIEELEDELEAARQEVREAKA-------AYDDAIQQRSSSQREVNELLQRKH 84 (207)
T ss_pred HHHHHHHHHHHHHHHhccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHhccc
Confidence 334444444444555553 56777777777777666655544433222 222222222233356677777776
Q ss_pred h-hhhhHHHhhhh
Q psy21 266 N-LDVEIAQYRNL 277 (288)
Q Consensus 266 ~-Le~EIatYrkL 277 (288)
. -+..+.-|-.|
T Consensus 85 sWs~~DleRFT~L 97 (207)
T PF05546_consen 85 SWSPADLERFTEL 97 (207)
T ss_pred CCChHHHHHHHHH
Confidence 5 33333344443
No 327
>COG3879 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=41.93 E-value=1.8e+02 Score=26.86 Aligned_cols=31 Identities=16% Similarity=0.289 Sum_probs=17.4
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 203 KYDTEMNELRAANYRLTERNKELEDQIEYEQ 233 (288)
Q Consensus 203 ~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q 233 (288)
.+..++.++++....|..++..++..+.+++
T Consensus 54 ~L~~e~~s~Q~~~~~L~~ev~~~~~~~~s~~ 84 (247)
T COG3879 54 DLVKELRSLQKKVNTLAAEVEDLENKLDSVR 84 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555556666655555555554
No 328
>COG3206 GumC Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=41.86 E-value=3.5e+02 Score=26.62 Aligned_cols=56 Identities=20% Similarity=0.283 Sum_probs=23.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIE---YEQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~---~~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
..++..++.+.+.|+.++..++.++. ..+..+.+|+.+++..+.-.+..+.-||++
T Consensus 341 ~~~~~~l~~~~~~L~~~~~~l~~~~~~~~~~~~~l~~L~Re~~~~r~~ye~lL~r~qe~ 399 (458)
T COG3206 341 PNELALLEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAEAARSLYETLLQRYQEL 399 (458)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhhchHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444444332 223334444444444444444444444443
No 329
>PF09787 Golgin_A5: Golgin subfamily A member 5; InterPro: IPR019177 This entry represents a family of proteins involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterised: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1 [].
Probab=41.76 E-value=3e+02 Score=27.86 Aligned_cols=22 Identities=23% Similarity=0.265 Sum_probs=9.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHH
Q psy21 205 DTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le 226 (288)
+.+++.|+.++..|..++++.+
T Consensus 287 ~ee~~~l~~Qi~~l~~e~~d~e 308 (511)
T PF09787_consen 287 QEEIQLLERQIEQLRAELQDLE 308 (511)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444433
No 330
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=41.74 E-value=2.4e+02 Score=24.71 Aligned_cols=16 Identities=6% Similarity=0.219 Sum_probs=6.9
Q ss_pred chhHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLT 219 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le 219 (288)
++.++.++...+..++
T Consensus 56 le~~~~~~~~~~~~~~ 71 (221)
T PF04012_consen 56 LERKLDEAEEEAEKWE 71 (221)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444444444444433
No 331
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=41.64 E-value=1.5e+02 Score=22.18 Aligned_cols=36 Identities=17% Similarity=0.314 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 217 RLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 217 ~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
.|+++|.+||.+++.-...|..|...+...+..+++
T Consensus 5 ~lE~Ri~eLE~r~AfQE~tieeLn~~laEq~~~i~k 40 (72)
T COG2900 5 ELEARIIELEIRLAFQEQTIEELNDALAEQQLVIDK 40 (72)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444443334444444444444444433
No 332
>PF00553 CBM_2: Cellulose binding domain; InterPro: IPR001919 The microbial degradation of cellulose and xylans requires several types of enzyme such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) []. Structurally, cellulases and xylanases generally consist of a catalytic domain joined to a cellulose-binding domain (CBD) by a short linker sequence rich in proline and/or hydroxy-amino acids. The CBD domain is found either at the N-terminal or at the C-terminal extremity of these enzymes. As it is shown in the following schematic representation, there are two conserved cysteines in this CBD domain - one at each extremity of the domain - which have been shown [] to be involved in a disulphide bond. There are also four conserved tryptophan, two are involved in cellulose binding. The CBD of a number of bacterial cellulases has been shown to consist of about 105 amino acid residues [, ]. +-------------------------------------------------+ | | xCxxxxWxxxxxNxxxWxxxxxxxWxxxxxxxxWNxxxxxGxxxxxxxxxxCx 'C': conserved cysteine involved in a disulphide bond. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process; PDB: 2CZN_A 2CWR_A 1HEH_C 1HEJ_C 3NDZ_E 3NDY_E 2XBD_A 1E5C_A 1XBD_A 1E5B_A ....
Probab=41.57 E-value=91 Score=24.11 Aligned_cols=26 Identities=27% Similarity=0.532 Sum_probs=19.5
Q ss_pred CCCcE---EEEEeCCccceeccCeEEEEEeC
Q psy21 83 PDGKF---VKLYNKGSEEQSLGNWQLVRKVQ 110 (288)
Q Consensus 83 ~~Gkf---V~l~N~s~~~~~L~gw~l~r~v~ 110 (288)
-+|.| |.|.|+++. ++.||+|.-.++
T Consensus 11 W~~Gf~~~v~v~N~~~~--~i~~W~v~~~~~ 39 (101)
T PF00553_consen 11 WGGGFQGEVTVTNNGSS--PINGWTVTFTFP 39 (101)
T ss_dssp SSSEEEEEEEEEESSSS--TEESEEEEEEES
T ss_pred cCCCeEEEEEEEECCCC--ccCCEEEEEEeC
Confidence 34445 899999885 567999987776
No 333
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=41.55 E-value=1.9e+02 Score=29.20 Aligned_cols=41 Identities=15% Similarity=0.095 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 211 LRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 211 L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
+....+.+..++.++..++......+..++.+|..++.++.
T Consensus 129 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~ 169 (525)
T TIGR02231 129 WFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELN 169 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333334444444444444444444444444444443
No 334
>KOG3091|consensus
Probab=41.48 E-value=3e+02 Score=27.97 Aligned_cols=38 Identities=11% Similarity=0.116 Sum_probs=22.8
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
.+....|++.++...+-+.-.+|.+++++...|..++-
T Consensus 356 ri~~i~e~v~eLqk~~ad~~~KI~~~k~r~~~Ls~RiL 393 (508)
T KOG3091|consen 356 RINAIGERVTELQKHHADAVAKIEEAKNRHVELSHRIL 393 (508)
T ss_pred HHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 33344555555554444556677777777777776653
No 335
>cd07909 YciF YciF bacterial stress response protein, ferritin-like iron-binding domain. YciF is a bacterial protein of unknown function that is up-regulated when bacteria experience stress conditions, and is highly conserved in a broad range of bacterial species. YciF has a ferritin-like domain. Ferritin-like, diiron-carboxylate proteins participate in a range of functions including iron regulation, mono-oxygenation, and reactive radical production. These proteins are characterized by the fact that they catalyze dioxygen-dependent oxidation-hydroxylation reactions within diiron centers; one exception is manganese catalase, which catalyzes peroxide-dependent oxidation-reduction within a dimanganese center. Diiron-carboxylate proteins are further characterized by the presence of duplicate metal ligands, glutamates and histidines (ExxH) and two additional glutamates within a four-helix bundle. Outside of these conserved residues there is little obvious homology. Members include bacterio
Probab=41.29 E-value=1.1e+02 Score=25.81 Aligned_cols=12 Identities=42% Similarity=0.498 Sum_probs=9.4
Q ss_pred hhhHHHhhhhhh
Q psy21 268 DVEIAQYRNLME 279 (288)
Q Consensus 268 e~EIatYrkLLE 279 (288)
-.|||.|..|..
T Consensus 104 HyEIA~YgtL~~ 115 (147)
T cd07909 104 HYEIAGYGTLRA 115 (147)
T ss_pred HHHHHHHHHHHH
Confidence 368999998864
No 336
>PF07139 DUF1387: Protein of unknown function (DUF1387); InterPro: IPR009816 This family represents a conserved region approximately 300 residues long within a number of hypothetical proteins of unknown function that seem to be restricted to mammals.
Probab=41.18 E-value=3.2e+02 Score=25.96 Aligned_cols=41 Identities=12% Similarity=0.303 Sum_probs=26.0
Q ss_pred HHHHHHHHHHHHHHH--HHHHHHhhhhhhhhhhHHHhhhhhhc
Q psy21 240 EAEIKRLQTEMTEQL--KDYRELMEIKINLDVEIAQYRNLMEA 280 (288)
Q Consensus 240 e~el~~lr~~~~~ql--~eyq~LL~vKl~Le~EIatYrkLLEg 280 (288)
|.+|..||++|..-. +.|.+=|..=.++-..+.+..++...
T Consensus 242 E~Ql~ELRadIK~fvs~rk~de~lg~~~rf~~d~~~l~~~i~~ 284 (302)
T PF07139_consen 242 EEQLAELRADIKHFVSERKYDEELGRAARFTCDPEQLKKSIMS 284 (302)
T ss_pred HHHHHHHHHHHHHHhhhhhhHHHHhHhhhcccCHHHHHHHHHh
Confidence 889999999996653 35666665555555555555555443
No 337
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=40.90 E-value=1.5e+02 Score=22.82 Aligned_cols=46 Identities=15% Similarity=0.271 Sum_probs=25.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhhHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYE---QSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~---q~~i~~le~el~~lr~~~~ 251 (288)
.||+.++..+.++..+++.++.++... ...-..+|.|+..+...+.
T Consensus 5 ~eId~lEekl~~cr~~le~ve~rL~~~eLs~e~R~~lE~E~~~l~~~l~ 53 (85)
T PF15188_consen 5 KEIDGLEEKLAQCRRRLEAVESRLRRRELSPEARRSLEKELNELKEKLE 53 (85)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHcccCCChHHHHHHHHHHHHHHHHhh
Confidence 466777777777777777666655322 1122335555555555443
No 338
>KOG0243|consensus
Probab=40.77 E-value=2.7e+02 Score=30.86 Aligned_cols=10 Identities=40% Similarity=0.813 Sum_probs=7.6
Q ss_pred eEEEeecCCC
Q psy21 76 MEITEVDPDG 85 (288)
Q Consensus 76 v~I~e~d~~G 85 (288)
|.|.|.++.|
T Consensus 274 vhike~t~~g 283 (1041)
T KOG0243|consen 274 VHIKENTPEG 283 (1041)
T ss_pred EEEecCCCcc
Confidence 6778888883
No 339
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=40.62 E-value=1e+02 Score=25.63 Aligned_cols=44 Identities=20% Similarity=0.260 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy21 224 ELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINL 267 (288)
Q Consensus 224 ~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~L 267 (288)
+.+..+..+...+..|+.++..-+.++.....+..++......|
T Consensus 84 e~qsli~~yE~~~~kLe~e~~~Kdsei~~Lr~~L~~~~~~n~~L 127 (131)
T PF04859_consen 84 EQQSLIKTYEIVVKKLEAELRAKDSEIDRLREKLDELNRANKSL 127 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33334444444455555555555555555444445544444443
No 340
>PRK10636 putative ABC transporter ATP-binding protein; Provisional
Probab=40.27 E-value=1.2e+02 Score=31.51 Aligned_cols=57 Identities=12% Similarity=0.138 Sum_probs=33.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH---HH----HHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEY---EQ----SKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~---~q----~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
.++..|...+..|+.++.+++.++.+ ++ ..+..+..++..++.+++....++.+|-.
T Consensus 563 ~~~~~~e~~i~~le~~~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~w~~l~~ 626 (638)
T PRK10636 563 KEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQE 626 (638)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCchhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35555666666666666666665532 11 14566666677777777666666666554
No 341
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=40.04 E-value=9.5 Score=41.12 Aligned_cols=83 Identities=24% Similarity=0.328 Sum_probs=0.0
Q ss_pred HHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh
Q psy21 197 TKGRSGKYDTEMNELRAANYRLTERNKELEDQI-------EYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDV 269 (288)
Q Consensus 197 ~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l-------~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~ 269 (288)
++..+..+..++..+.+.+..|..+|.++..++ ..+......|+.++..++..+...-..++.+-+.++.|++
T Consensus 417 Lk~~lee~~e~~e~lere~k~L~~El~dl~~q~~~~~k~v~eLek~kr~LE~e~~El~~~leE~E~~l~~~E~~~lRl~~ 496 (859)
T PF01576_consen 417 LKNELEELQEQLEELERENKQLQDELEDLTSQLDDAGKSVHELEKAKRRLEQEKEELQEQLEEAEDALEAEEQKKLRLQV 496 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHhhccchhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444455555555555555555544333 3345555666777777777777776777778888888888
Q ss_pred hHHHhhhhhh
Q psy21 270 EIAQYRNLME 279 (288)
Q Consensus 270 EIatYrkLLE 279 (288)
++..+|.=++
T Consensus 497 el~~~r~e~e 506 (859)
T PF01576_consen 497 ELQQLRQEIE 506 (859)
T ss_dssp ----------
T ss_pred HHHHHHHHHH
Confidence 8888876553
No 342
>PF07407 Seadorna_VP6: Seadornavirus VP6 protein; InterPro: IPR009982 This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown.
Probab=39.88 E-value=1.8e+02 Score=28.05 Aligned_cols=32 Identities=19% Similarity=0.119 Sum_probs=21.2
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERN 222 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l 222 (288)
..|+-.+++...+|+.|+++|+..+.+||++.
T Consensus 31 ~~e~~aLr~EN~~LKkEN~~Lk~eVerLE~e~ 62 (420)
T PF07407_consen 31 IDENFALRMENHSLKKENNDLKIEVERLENEM 62 (420)
T ss_pred hhhhhhHHHHhHHHHHHHHHHHHHHHHHHHHh
Confidence 34555666666667777777777777776654
No 343
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=39.85 E-value=2.9e+02 Score=28.88 Aligned_cols=27 Identities=11% Similarity=0.346 Sum_probs=12.2
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 233 QSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
+.+++.++..+..++..+...-.|++.
T Consensus 201 ~~kl~~l~~~l~~~~e~le~K~qE~~~ 227 (617)
T PF15070_consen 201 QKKLGELQEKLHNLKEKLELKSQEAQS 227 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHH
Confidence 344445555555555444443333333
No 344
>KOG0999|consensus
Probab=39.70 E-value=2.8e+02 Score=28.82 Aligned_cols=26 Identities=23% Similarity=0.194 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy21 242 EIKRLQTEMTEQLKDYRELMEIKINL 267 (288)
Q Consensus 242 el~~lr~~~~~ql~eyq~LL~vKl~L 267 (288)
+|...+.+.++....++++-+.+..+
T Consensus 122 el~~~q~E~erl~~~~sd~~e~~~~~ 147 (772)
T KOG0999|consen 122 ELTNVQEENERLEKVHSDLKESNAAV 147 (772)
T ss_pred HHHHHHHHHHHHHHHHHHhhhcchhh
Confidence 33333334444444555555444433
No 345
>KOG0946|consensus
Probab=39.66 E-value=4.3e+02 Score=28.74 Aligned_cols=43 Identities=19% Similarity=0.364 Sum_probs=16.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~ 248 (288)
.+++.++.....|+.+.++++..+.+....++.+..++..++.
T Consensus 671 ~~~e~lkQ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~ 713 (970)
T KOG0946|consen 671 YQIENLKQMEKELQVENEELEEEVQDFISEHSQLKDQLDLLKN 713 (970)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333333333333333333333333333
No 346
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=39.51 E-value=1.5e+02 Score=21.58 Aligned_cols=43 Identities=12% Similarity=0.278 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
+..++..+..++..+..+|...+.....+..+..+|..+....
T Consensus 8 l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~n~ 50 (71)
T PF10779_consen 8 LNRIETKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKSNT 50 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333344333333333444444444444433
No 347
>KOG0709|consensus
Probab=39.18 E-value=66 Score=32.26 Aligned_cols=48 Identities=19% Similarity=0.142 Sum_probs=33.5
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFA 237 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~ 237 (288)
+++-+..+..++.....|=++|+++++.|+.+++.|-.+|..+|+.+.
T Consensus 270 KkeYid~LE~rv~~~taeNqeL~kkV~~Le~~N~sLl~qL~klQt~v~ 317 (472)
T KOG0709|consen 270 KKEYIDGLESRVSAFTAENQELQKKVEELELSNRSLLAQLKKLQTLVI 317 (472)
T ss_pred HhhHHHHHhhhhhhcccCcHHHHHHHHHHhhccHHHHHHHHHHHHHHh
Confidence 445555566666666677778888888888887777777777776554
No 348
>PHA03011 hypothetical protein; Provisional
Probab=39.16 E-value=1.7e+02 Score=23.38 Aligned_cols=47 Identities=23% Similarity=0.284 Sum_probs=20.6
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
|..+-++|...-+-++.++..++.-+++--+.|.-|-+++.+++..+
T Consensus 69 L~~qYN~L~dEYn~i~Ne~k~~~~iIQdn~d~I~~LraeIDkLK~ni 115 (120)
T PHA03011 69 LIAQYNELLDEYNLIENEIKDLEIIIQDNDDEIHFLRAEIDKLKENI 115 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444444433333333444444444444443
No 349
>PRK10246 exonuclease subunit SbcC; Provisional
Probab=38.94 E-value=4.2e+02 Score=29.41 Aligned_cols=18 Identities=11% Similarity=0.209 Sum_probs=13.6
Q ss_pred hHHHhhhhh-hccccCCCC
Q psy21 270 EIAQYRNLM-EAEESSLPT 287 (288)
Q Consensus 270 EIatYrkLL-EgEe~Rl~~ 287 (288)
.++.||.-| +|+.|.+|+
T Consensus 492 ~~~~Lr~~L~~GePCPVCG 510 (1047)
T PRK10246 492 DLEAQRAQLQAGQPCPLCG 510 (1047)
T ss_pred HHHHHHHhCCCCCCcCCCC
Confidence 356688666 899999886
No 350
>PF04314 DUF461: Protein of unknown function (DUF461); InterPro: IPR007410 This entry represents a domain found in of proteins of unknown function, including DR1885 from Deinococcus radiodurans and CC3502 from Caulobacter crescentus (Caulobacter vibrioides), which share a potential metal binding motif H(M)X10MX21HXM. DR1885 was found to bind copper(I) through a histidine and three Mets in a cupredoxin-like fold []. The surface location of the copper-binding site as well as the type of coordination are well poised for metal transfer chemistry, suggesting that DR1885 might transfer copper, taking the role of Cox17 in bacteria (Cox17 being an accessory protein required for correct assembly of eukaryotic cyochrome c oxidase). ; PDB: 2K6W_A 2K6Z_A 2K6Y_A 2K70_A 1X9L_A 2JQA_A.
Probab=38.91 E-value=1.2e+02 Score=23.91 Aligned_cols=78 Identities=23% Similarity=0.316 Sum_probs=38.5
Q ss_pred cEEEEEeCCccceeccCe--------EEEEEeCC-cccEEEEecCCeEecCCcEEEEEecccCCCCCCCCCceeeecc-c
Q psy21 86 KFVKLYNKGSEEQSLGNW--------QLVRKVQG-EVKTSFKFHRTLKIEAGGTVTVWSFNVEGATHEPPHNIVMKNQ-K 155 (288)
Q Consensus 86 kfV~l~N~s~~~~~L~gw--------~l~r~v~~-~~~~~y~Fp~~~~L~~g~~vtIwa~~~~~~~~~pp~dlv~~~~-~ 155 (288)
-|..|.|+++++..|-|= .|.+.+.. +....-.. +.+.|+||++|.. +.| + ..-++++-. .
T Consensus 18 ~y~ti~N~g~~~~~L~~v~s~~a~~v~lh~~~~~~g~~~m~~v-~~i~ipa~~~v~l-~pg---g----~HlmL~g~~~~ 88 (110)
T PF04314_consen 18 AYFTITNNGDQDDRLVGVSSPAAARVELHETVMEDGVMKMRPV-DSIPIPAGSTVEL-KPG---G----YHLMLMGLKRP 88 (110)
T ss_dssp EEEEEE-CSSSEEEEEEEE-TTCCEEEEEEECCCCCEEEECCS-S-EEEETT-EEEE--CC---C----CEEEEECESS-
T ss_pred EEEEEEeCCCCCeEEEEEEcCCCceEEEEEEEccCCeEEEEEC-CCEEECCCCeEEe-cCC---C----EEEEEeCCccc
Confidence 488888888877655443 23333321 11222233 3688888888876 211 1 233344444 5
Q ss_pred ccCccchhhheeccchh
Q psy21 156 FGSGDEISTILLNTDGE 172 (288)
Q Consensus 156 wg~g~~i~t~L~d~~ge 172 (288)
+..|+.+.-+|.=.+|.
T Consensus 89 l~~G~~v~ltL~f~~gg 105 (110)
T PF04314_consen 89 LKPGDTVPLTLTFEDGG 105 (110)
T ss_dssp B-TTEEEEEEEEETTTE
T ss_pred CCCCCEEEEEEEECCCC
Confidence 66777776666655443
No 351
>smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains
Probab=38.90 E-value=1.8e+02 Score=22.35 Aligned_cols=19 Identities=11% Similarity=0.190 Sum_probs=7.6
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKE 224 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~ 224 (288)
..+..+......++.+...
T Consensus 21 ~~~~~l~~~~~~l~~~~~~ 39 (127)
T smart00502 21 DALKQLISIIQEVEENAAD 39 (127)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444333
No 352
>KOG3091|consensus
Probab=38.84 E-value=2.6e+02 Score=28.35 Aligned_cols=26 Identities=19% Similarity=0.160 Sum_probs=13.7
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELR 212 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~ 212 (288)
++...++++..+.++.....++.+|+
T Consensus 343 ~K~Q~q~~~~~r~ri~~i~e~v~eLq 368 (508)
T KOG3091|consen 343 LKVQDQEVKQHRIRINAIGERVTELQ 368 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344455555555555555555555
No 353
>PF05483 SCP-1: Synaptonemal complex protein 1 (SCP-1); InterPro: IPR008827 Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase [].; GO: 0007130 synaptonemal complex assembly, 0000795 synaptonemal complex
Probab=38.63 E-value=5.1e+02 Score=27.57 Aligned_cols=40 Identities=18% Similarity=0.292 Sum_probs=23.3
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
+..+.-.|..|+.++..++.........|+.=+..|-.++
T Consensus 631 ~~~~eikVn~L~~E~e~~kk~~eE~~~~~~keie~K~~~e 670 (786)
T PF05483_consen 631 SNVYEIKVNKLQEELENLKKKHEEETDKYQKEIESKSISE 670 (786)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhhhhH
Confidence 3445566666666666666666555556665555554443
No 354
>KOG0981|consensus
Probab=38.60 E-value=44 Score=34.50 Aligned_cols=18 Identities=17% Similarity=0.324 Sum_probs=8.3
Q ss_pred cccCccchhhheeccchh
Q psy21 155 KFGSGDEISTILLNTDGE 172 (288)
Q Consensus 155 ~wg~g~~i~t~L~d~~ge 172 (288)
+++-.+..+++|||.+.-
T Consensus 612 ~YnrANr~VAIlCNHQR~ 629 (759)
T KOG0981|consen 612 SYNRANRTVAILCNHQRA 629 (759)
T ss_pred HHhhccceeeeeeccccc
Confidence 344444445555554444
No 355
>KOG0239|consensus
Probab=38.42 E-value=3.2e+02 Score=28.92 Aligned_cols=14 Identities=0% Similarity=-0.130 Sum_probs=6.0
Q ss_pred HHHhhhhhhhhccc
Q psy21 11 EVQKKRKTELLCMN 24 (288)
Q Consensus 11 ~yrklle~ee~r~~ 24 (288)
+++.-+...+.|.+
T Consensus 16 ~~~~~~~~~~~~~~ 29 (670)
T KOG0239|consen 16 TSDDTLSNPKKRFE 29 (670)
T ss_pred cccccccccccccC
Confidence 33344444444444
No 356
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=38.27 E-value=75 Score=24.00 Aligned_cols=18 Identities=11% Similarity=0.074 Sum_probs=7.4
Q ss_pred hHHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNK 223 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~ 223 (288)
..+.+|+.+++.....+.
T Consensus 28 ~~~~~lk~Klq~ar~~i~ 45 (83)
T PF07544_consen 28 TATGSLKHKLQKARAAIR 45 (83)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 334444444444444333
No 357
>PF14326 DUF4384: Domain of unknown function (DUF4384)
Probab=38.20 E-value=1.3e+02 Score=22.40 Aligned_cols=31 Identities=32% Similarity=0.508 Sum_probs=26.6
Q ss_pred ceeEEeeCCCCCeEEEeecCCCcEEEEEeCC
Q psy21 64 SGFQVTSSAKGNMEITEVDPDGKFVKLYNKG 94 (288)
Q Consensus 64 ~~~~~~~sa~G~v~I~e~d~~GkfV~l~N~s 94 (288)
.++.++.+..|-|.|-.+|++|+...|.=+.
T Consensus 9 v~~~~~~~~~~Yl~l~~~~~~G~v~~L~Pn~ 39 (83)
T PF14326_consen 9 VRFRVTSNRDGYLYLFYIDADGKVTLLFPNR 39 (83)
T ss_pred EEEEEEeCCCeEEEEEEECCCCCEEEEecCc
Confidence 5678888999999999999999999998443
No 358
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=38.08 E-value=1.1e+02 Score=20.90 Aligned_cols=21 Identities=24% Similarity=0.376 Sum_probs=9.0
Q ss_pred chhHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~ 224 (288)
++.++..|...+..|..++..
T Consensus 30 le~~~~~L~~en~~L~~~i~~ 50 (54)
T PF07716_consen 30 LEQEVQELEEENEQLRQEIAQ 50 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444333
No 359
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=37.92 E-value=93 Score=22.67 Aligned_cols=55 Identities=16% Similarity=0.263 Sum_probs=0.0
Q ss_pred HHhhCCchhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 198 KGRSGKYDTEMNELRAANYRLTERNKELE-DQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 198 ~~~~~~l~~ei~~L~~~~~~Le~~l~~le-~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
+..+...+..|++....+..++.++.++- ..-..++.++..+..++..++.++.+
T Consensus 24 ~~~i~~~e~~l~ea~~~l~qMe~E~~~~p~s~r~~~~~kl~~yr~~l~~lk~~l~~ 79 (79)
T PF05008_consen 24 KSLIREIERDLDEAEELLKQMELEVRSLPPSERNQYKSKLRSYRSELKKLKKELKK 79 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHCTS-HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHhcC
No 360
>KOG0243|consensus
Probab=37.84 E-value=3.4e+02 Score=30.16 Aligned_cols=25 Identities=4% Similarity=0.263 Sum_probs=10.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 238 RMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 238 ~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
.++..|+..-.+|...-.+|+.+-.
T Consensus 487 ~~k~~L~~~~~el~~~~ee~~~~~~ 511 (1041)
T KOG0243|consen 487 KLKSKLQNKNKELESLKEELQQAKA 511 (1041)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444333
No 361
>PF08702 Fib_alpha: Fibrinogen alpha/beta chain family; InterPro: IPR012290 Fibrinogen plays key roles in both blood clotting and platelet aggregation. During blood clot formation, the conversion of soluble fibrinogen to insoluble fibrin is triggered by thrombin, resulting in the polymerisation of fibrin, which forms a soft clot; this is then converted to a hard clot by factor XIIIA, which cross-links fibrin molecules. Platelet aggregation involves the binding of the platelet protein receptor integrin alpha(IIb)-beta(3) to the C-terminal D domain of fibrinogen []. In addition to platelet aggregation, platelet-fibrinogen interaction mediates both adhesion and fibrin clot retraction. Fibrinogen occurs as a dimer, where each monomer is composed of three non-identical chains, alpha, beta and gamma, linked together by several disulphide bonds []. The N-terminals of all six chains come together to form the centre of the molecule (E domain), from which the monomers extend in opposite directions as coiled coils, followed by C-terminal globular domains (D domains). Therefore, the domain composition is: D-coil-E-coil-D. At each end, the C-terminal of the alpha chain extends beyond the D domain as a protuberance that is important for cross-linking the molecule. During clot formation, the N-terminal fragments of the alpha and beta chains (within the E domain) in fibrinogen are cleaved by thrombin, releasing fibrinopeptides A and B, respectively, and producing fibrin. This cleavage results in the exposure of four binding sites on the E domain, each of which can bind to a D domain from different fibrin molecules. The binding of fibrin molecules produces a polymer consisting of a lattice network of fibrins that form a long, branching, flexible fibre [, ]. Fibrin fibres interact with platelets to increase the size of the clot, as well as with several different proteins and cells, thereby promoting the inflammatory response and concentrating the cells required for wound repair at the site of damage. This entry represents the coiled-coil domain and part of the N-terminal E domain found in all three fibrinogen polypeptides, namely the alpha, beta and gamma chains. More information about these proteins can be found at Protein of the Month: Fibrinogen [].; GO: 0005102 receptor binding, 0030674 protein binding, bridging, 0007165 signal transduction, 0030168 platelet activation, 0051258 protein polymerization, 0005577 fibrinogen complex; PDB: 1LWU_D 1N73_D 1M1J_B 1JY2_R 1JY3_R 1RF0_A 2H43_D 1RE4_D 2XNY_D 2HPC_D ....
Probab=37.79 E-value=2.2e+02 Score=23.96 Aligned_cols=29 Identities=21% Similarity=0.298 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy21 226 EDQIEYEQSKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 226 e~~l~~~q~~i~~le~el~~lr~~~~~ql 254 (288)
...+..+|..+......+.+|+.++..+.
T Consensus 103 ~~~I~~Lq~~~~~~~~ki~~Le~~i~~~~ 131 (146)
T PF08702_consen 103 PSNIRVLQNILRSNRQKIQRLEQDIDQQE 131 (146)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444555555555555555555555443
No 362
>PF08581 Tup_N: Tup N-terminal; InterPro: IPR013890 The N-terminal region of the Tup protein has been shown to interact with the Ssn6 transcriptional co-repressor []. ; PDB: 3VP9_B 3VP8_B.
Probab=37.77 E-value=1.8e+02 Score=22.00 Aligned_cols=18 Identities=33% Similarity=0.468 Sum_probs=11.5
Q ss_pred hhhhhhhhHHHhhhhhhc
Q psy21 263 IKINLDVEIAQYRNLMEA 280 (288)
Q Consensus 263 vKl~Le~EIatYrkLLEg 280 (288)
+|-.-|-||+-.++-|+.
T Consensus 58 mK~~YEeEI~rLr~eLe~ 75 (79)
T PF08581_consen 58 MKQQYEEEIARLRRELEQ 75 (79)
T ss_dssp HHHHHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 444556678777776653
No 363
>KOG0999|consensus
Probab=37.58 E-value=2.2e+02 Score=29.55 Aligned_cols=38 Identities=29% Similarity=0.490 Sum_probs=24.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhh
Q psy21 238 RMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYR 275 (288)
Q Consensus 238 ~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYr 275 (288)
+|..+|...+..-++.+.||.+|=--...|..-++..|
T Consensus 153 rlr~elKe~KfRE~RllseYSELEEENIsLQKqVs~LR 190 (772)
T KOG0999|consen 153 RLRDELKEYKFREARLLSEYSELEEENISLQKQVSNLR 190 (772)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHh
Confidence 34455555555557788888888776666666666555
No 364
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=37.43 E-value=2.8e+02 Score=24.20 Aligned_cols=11 Identities=18% Similarity=0.117 Sum_probs=5.1
Q ss_pred HHhHHHHHHHH
Q psy21 187 QSSLLLELKET 197 (288)
Q Consensus 187 ~~~~~eEl~~~ 197 (288)
|...+.++...
T Consensus 28 Yr~rc~ele~~ 38 (182)
T PF15035_consen 28 YRKRCAELEQQ 38 (182)
T ss_pred HHHHHHHHHHH
Confidence 44445554443
No 365
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=37.35 E-value=1.3e+02 Score=21.59 Aligned_cols=22 Identities=27% Similarity=0.234 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l 229 (288)
+.+|...|..|+.+|..++..+
T Consensus 23 v~EL~~RIa~L~aEI~R~~~~~ 44 (59)
T PF06698_consen 23 VEELEERIALLEAEIARLEAAI 44 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555554444443
No 366
>KOG0982|consensus
Probab=37.16 E-value=4.4e+02 Score=26.42 Aligned_cols=11 Identities=45% Similarity=0.501 Sum_probs=4.0
Q ss_pred HHHHHHHHHHH
Q psy21 209 NELRAANYRLT 219 (288)
Q Consensus 209 ~~L~~~~~~Le 219 (288)
.+|+..+..|.
T Consensus 314 telRs~~arlk 324 (502)
T KOG0982|consen 314 TELRSLIARLK 324 (502)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 367
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=37.12 E-value=2.9e+02 Score=24.32 Aligned_cols=27 Identities=26% Similarity=0.425 Sum_probs=11.6
Q ss_pred HHHHHHHhhCCchhHHHHHHHHHHHHH
Q psy21 193 ELKETKGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~~~~~Le 219 (288)
.++.+..++.+++.+++++...+..|.
T Consensus 13 ki~~L~n~l~elq~~l~~l~~ENk~Lk 39 (194)
T PF15619_consen 13 KIKELQNELAELQRKLQELRKENKTLK 39 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444333
No 368
>KOG0994|consensus
Probab=37.06 E-value=1.6e+02 Score=33.27 Aligned_cols=15 Identities=13% Similarity=0.087 Sum_probs=5.9
Q ss_pred hhhhHHHHhhhhhhh
Q psy21 6 ETRIEEVQKKRKTEL 20 (288)
Q Consensus 6 d~ei~~yrklle~ee 20 (288)
|.+-.--+|+-|.|+
T Consensus 1419 ~~~~~l~~~~ae~eq 1433 (1758)
T KOG0994|consen 1419 DADTQLRSKLAEAEQ 1433 (1758)
T ss_pred hHHHHHHHHHHHHHH
Confidence 333333344444443
No 369
>KOG0612|consensus
Probab=37.04 E-value=4e+02 Score=30.20 Aligned_cols=17 Identities=12% Similarity=0.141 Sum_probs=12.1
Q ss_pred hHHHHhhhhhhhhccccc
Q psy21 9 IEEVQKKRKTELLCMNVS 26 (288)
Q Consensus 9 i~~yrklle~ee~r~~~~ 26 (288)
..+|.|.+... ++|+++
T Consensus 287 veTY~KIm~hk-~~l~FP 303 (1317)
T KOG0612|consen 287 VETYGKIMNHK-ESLSFP 303 (1317)
T ss_pred HHHHHHHhchh-hhcCCC
Confidence 46899998873 456665
No 370
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=37.00 E-value=2.6e+02 Score=23.64 Aligned_cols=32 Identities=22% Similarity=0.232 Sum_probs=15.0
Q ss_pred HHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH
Q psy21 193 ELKETKGRSGKYDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~ 224 (288)
++.++..+++-++.+++.|++.+..|...+.+
T Consensus 7 ~le~l~a~lq~l~~qie~L~~~i~~l~~~~~e 38 (145)
T COG1730 7 ELEELAAQLQILQSQIESLQAQIAALNAAISE 38 (145)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444555555544444444443
No 371
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=36.96 E-value=4.3e+02 Score=28.48 Aligned_cols=25 Identities=4% Similarity=0.020 Sum_probs=9.2
Q ss_pred HHHHHhhCCchhHHHHHHHHHHHHH
Q psy21 195 KETKGRSGKYDTEMNELRAANYRLT 219 (288)
Q Consensus 195 ~~~~~~~~~l~~ei~~L~~~~~~Le 219 (288)
..++..+.-++..+...+.....|.
T Consensus 318 ~d~r~hi~~lkesl~~ke~~~~~Lq 342 (775)
T PF10174_consen 318 SDMRQHIEVLKESLRAKEQEAEMLQ 342 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333333333
No 372
>PRK00106 hypothetical protein; Provisional
Probab=36.86 E-value=4.9e+02 Score=26.78 Aligned_cols=26 Identities=12% Similarity=0.216 Sum_probs=11.6
Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 235 KFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 235 ~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
.+...+.+|.....++..+..+|..+
T Consensus 126 eLe~kekeLe~reeeLee~~~~~~~~ 151 (535)
T PRK00106 126 TLESKEQSLTDKSKHIDEREEQVEKL 151 (535)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444443
No 373
>KOG4674|consensus
Probab=36.68 E-value=2.8e+02 Score=32.74 Aligned_cols=48 Identities=23% Similarity=0.291 Sum_probs=23.2
Q ss_pred hHHHhHHHHHHHHHHh----hCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGR----SGKYDTEMNELRAANYRLTERNKELEDQIEYE 232 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~----~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~ 232 (288)
..+.+..+++..++.+ +..++++|.++......++..+..+..+...+
T Consensus 199 ~eL~~~~ekll~~~re~s~~~~~L~~~L~~~~~~~~~~q~~~~~l~q~~~eL 250 (1822)
T KOG4674|consen 199 RELSKVNEKLLSLRREHSIEVEQLEEKLSDLKESLAELQEKNKSLKQQNEEL 250 (1822)
T ss_pred HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555555 33345555555555555555444444333333
No 374
>KOG4593|consensus
Probab=36.66 E-value=5.5e+02 Score=27.31 Aligned_cols=32 Identities=28% Similarity=0.444 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHH
Q psy21 242 EIKRLQTEMTEQLKDYRELMEIKINLDVEIAQ 273 (288)
Q Consensus 242 el~~lr~~~~~ql~eyq~LL~vKl~Le~EIat 273 (288)
++..+...|..++++...|----+.+--|.++
T Consensus 238 ele~i~~~~~dqlqel~~l~~a~~q~~ee~~~ 269 (716)
T KOG4593|consen 238 ELEAINKNMKDQLQELEELERALSQLREELAT 269 (716)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555544444333333334443
No 375
>PRK05431 seryl-tRNA synthetase; Provisional
Probab=36.60 E-value=2e+02 Score=28.39 Aligned_cols=62 Identities=26% Similarity=0.366 Sum_probs=29.3
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKE----------LEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~----------le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
.+++-.+-.+...+..+++.|+++...+..++.. +..+...+...|..++.++..+..++..
T Consensus 27 vd~i~~ld~~~r~l~~~~~~lr~~rn~~sk~i~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~ 98 (425)
T PRK05431 27 VDELLELDEERRELQTELEELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEE 98 (425)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444445555555555444444432 2233344455555556555555555544
No 376
>cd07602 BAR_RhoGAP_OPHN1-like The Bin/Amphiphysin/Rvs (BAR) domain of Oligophrenin1-like Rho GTPase Activating Proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. This subfamily is composed of Rho and Rac GTPase activating proteins (GAPs) with similarity to oligophrenin1 (OPHN1). Members contain an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, and a Rho GAP domain. Some members contain a C-terminal SH3 domain. Vertebrates harbor at least three Rho GAPs in this subfamily including OPHN1, GTPase Regulator Associated with Focal adhesion kinase (GRAF), GRAF2, and an uncharacterized protein called GAP10-like. OPHN1, GRAF and GRAF2 show GAP activity towards RhoA and Cdc42. In addition, OPHN1 is active towards Rac. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The BAR domains of OPHN1
Probab=36.26 E-value=3e+02 Score=24.62 Aligned_cols=21 Identities=19% Similarity=0.207 Sum_probs=10.5
Q ss_pred CchhHHHHHHHHHHHHHHHHH
Q psy21 203 KYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 203 ~l~~ei~~L~~~~~~Le~~l~ 223 (288)
+.+.++.+++..+..|-..+.
T Consensus 6 ~~E~ele~l~~~ikkLiK~ck 26 (207)
T cd07602 6 EHEAELERTNKAIKELIKECK 26 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555544443
No 377
>PF06483 ChiC: Chitinase C; InterPro: IPR009470 This ~170 aa region is found at the C-terminal to the catalytic domain (IPR001223 from INTERPRO) found in members of glycoside hydrolase family 18.
Probab=36.19 E-value=77 Score=27.78 Aligned_cols=22 Identities=18% Similarity=0.258 Sum_probs=17.1
Q ss_pred ccEEEEecCCeEecCCcEEEEE
Q psy21 113 VKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 113 ~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
..+.|++|.--.|.||.+|.+=
T Consensus 116 Hrvs~tlp~wqslapG~s~~~~ 137 (180)
T PF06483_consen 116 HRVSFTLPAWQSLAPGASVELD 137 (180)
T ss_pred EEEEEECCCccccCCCCEEEEe
Confidence 3567888887889999888763
No 378
>PF05010 TACC: Transforming acidic coiled-coil-containing protein (TACC); InterPro: IPR007707 This family contains the proteins TACC 1, 2 and 3, found concentrated in the centrosomes of eukaryotes which may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1) [].
Probab=36.14 E-value=3.2e+02 Score=24.46 Aligned_cols=30 Identities=17% Similarity=0.113 Sum_probs=13.9
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
...++.++..+......+...+..++..+.
T Consensus 25 ~~~l~~k~~e~~~~~~~m~~i~~e~Ek~i~ 54 (207)
T PF05010_consen 25 EQELKKKYEELHKENQEMRKIMEEYEKTIA 54 (207)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 444444444444444444444444444443
No 379
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=35.95 E-value=1.1e+02 Score=27.74 Aligned_cols=10 Identities=20% Similarity=0.252 Sum_probs=4.7
Q ss_pred eccCeEEEEE
Q psy21 99 SLGNWQLVRK 108 (288)
Q Consensus 99 ~L~gw~l~r~ 108 (288)
..||+.....
T Consensus 74 ~~gG~i~~~~ 83 (262)
T PF14257_consen 74 SYGGYIESSS 83 (262)
T ss_pred HcCCEEEEEe
Confidence 3455544443
No 380
>PF07160 DUF1395: Protein of unknown function (DUF1395); InterPro: IPR009829 This family consists of several hypothetical eukaryotic proteins of around 250 residues in length. The function of this family is unknown.; PDB: 4AJ5_G.
Probab=35.91 E-value=2.6e+02 Score=25.57 Aligned_cols=24 Identities=8% Similarity=0.263 Sum_probs=10.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l 229 (288)
.-|..+...+..++..+++++..+
T Consensus 22 ~~L~~i~~~~~~i~~~l~~~~~~l 45 (243)
T PF07160_consen 22 DTLSKIDQEVSAIEELLNDIEQEL 45 (243)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444433
No 381
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=35.89 E-value=1.9e+02 Score=32.53 Aligned_cols=27 Identities=30% Similarity=0.543 Sum_probs=14.2
Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy21 230 EYEQSKFARMEAEIKRLQTEMTEQLKD 256 (288)
Q Consensus 230 ~~~q~~i~~le~el~~lr~~~~~ql~e 256 (288)
+..+..+..++.++..+..++...+.+
T Consensus 681 ~~~~~~l~~l~~~l~~~~~e~~~~~~~ 707 (1201)
T PF12128_consen 681 EQIEEQLNELEEELKQLKQELEELLEE 707 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555566665555555544433
No 382
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=35.88 E-value=54 Score=26.00 Aligned_cols=17 Identities=24% Similarity=0.178 Sum_probs=6.4
Q ss_pred hHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERN 222 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l 222 (288)
.+++.|+.+++.|..++
T Consensus 41 ~e~~~l~~~n~~L~~eI 57 (105)
T PRK00888 41 QTNAKLKARNDQLFAEI 57 (105)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33333333333333333
No 383
>KOG0998|consensus
Probab=35.76 E-value=98 Score=33.54 Aligned_cols=78 Identities=15% Similarity=0.243 Sum_probs=42.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH---HHHHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 205 DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQL---KDYRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 205 ~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql---~eyq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
..+|..+...+..++.+...++.-+......+..+..+|..+..+|.+.. .....-+++-+.+..+...++++..+.
T Consensus 504 ~~ei~~~~~~ln~~~qq~~~l~~~v~~~~~~ve~l~~~L~~~~~~~~~~~s~~~~l~~~~~~~~~~~~~~~~~~k~~n~~ 583 (847)
T KOG0998|consen 504 NREISSLEKELNELQQQLSVLEGSVKAIESQVENLQKELLDLIYEMADTRSKSTLLDDSFKVGMELFEQLLKGSKLVNGK 583 (847)
T ss_pred hhhHHHHHHHHhhhHHHHhHHhhhhhhhhhhhhhhHhHHHHHHHHHHhhcccchhhhhhhhhhhhhhhhhhhhhhccccc
Confidence 34666666666666665554544444444446666666666666665432 233334444445555556666666554
Q ss_pred c
Q psy21 282 E 282 (288)
Q Consensus 282 e 282 (288)
.
T Consensus 584 ~ 584 (847)
T KOG0998|consen 584 D 584 (847)
T ss_pred c
Confidence 3
No 384
>PRK14143 heat shock protein GrpE; Provisional
Probab=35.69 E-value=2.2e+02 Score=25.99 Aligned_cols=32 Identities=13% Similarity=0.231 Sum_probs=17.1
Q ss_pred HHHhhCCchhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 197 TKGRSGKYDTEMNELRAANYRLTERNKELEDQ 228 (288)
Q Consensus 197 ~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~ 228 (288)
+..++..++.++.+|+.+..++.+...++..+
T Consensus 72 l~~el~~l~~e~~elkd~~lR~~AdfeN~RKR 103 (238)
T PRK14143 72 LEQELESLKQELEELNSQYMRIAADFDNFRKR 103 (238)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344445556666666655666555554443
No 385
>PF07334 IFP_35_N: Interferon-induced 35 kDa protein (IFP 35) N-terminus; InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=35.44 E-value=82 Score=23.79 Aligned_cols=23 Identities=17% Similarity=0.494 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy21 239 MEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 239 le~el~~lr~~~~~ql~eyq~LL 261 (288)
|+.+|+.+.+++....++||..-
T Consensus 12 Lk~eiqkle~ELq~~~~~~qIk~ 34 (76)
T PF07334_consen 12 LKEEIQKLEAELQQNKREFQIKE 34 (76)
T ss_pred HHHHHHHHHHHHHHHHhhhhhcc
Confidence 34444444444444445555443
No 386
>COG5633 Predicted periplasmic lipoprotein [General function prediction only]
Probab=35.36 E-value=1.2e+02 Score=24.80 Aligned_cols=61 Identities=16% Similarity=0.194 Sum_probs=37.1
Q ss_pred eEEEeecCCC-cEEEEEeCCccceeccCeEEE-EEeCCcccEEEEecCCeEecCCcEEEEEecc
Q psy21 76 MEITEVDPDG-KFVKLYNKGSEEQSLGNWQLV-RKVQGEVKTSFKFHRTLKIEAGGTVTVWSFN 137 (288)
Q Consensus 76 v~I~e~d~~G-kfV~l~N~s~~~~~L~gw~l~-r~v~~~~~~~y~Fp~~~~L~~g~~vtIwa~~ 137 (288)
..+.++|-.- .-++|.|+...++.+.= ++= ....|-....--=|..++|+|+.+|+|....
T Consensus 47 ~~l~~sd~~~~~~s~l~N~~q~pv~v~Y-rfYWYD~~Gle~~~~es~~si~l~~~e~vsi~~~~ 109 (123)
T COG5633 47 PVLSESDGQPSASSVLKNKRQEPVTVHY-RFYWYDAQGLEQNPLESPRSITLPGHEAVSIYLSK 109 (123)
T ss_pred CeeeeeccccceeEEEeccccCceEEEE-EEEEEcCCCceeccccCCcceEecCCceEEEEccc
Confidence 4455666555 68899999988875431 111 0111212223334667999999999998743
No 387
>PRK14148 heat shock protein GrpE; Provisional
Probab=35.27 E-value=2.8e+02 Score=24.53 Aligned_cols=25 Identities=4% Similarity=0.202 Sum_probs=10.7
Q ss_pred hCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
+..++.++.+|+....++.+..+.+
T Consensus 49 l~~l~~e~~elkd~~lR~~Ae~eN~ 73 (195)
T PRK14148 49 IKELEDSCDQFKDEALRAKAEMENI 73 (195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444444333
No 388
>PF06280 DUF1034: Fn3-like domain (DUF1034); InterPro: IPR010435 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain of unknown function is present in bacterial and plant peptidases belonging to MEROPS peptidase family S8 (subfamily S8A subtilisin, clan SB). It is C-terminal to and adjacent to the S8 peptidase domain and can be found in conjunction with the PA (Protease associated) domain (IPR003137 from INTERPRO) and additionally in Gram-positive bacteria with the surface protein anchor domain (IPR001899 from INTERPRO).; GO: 0004252 serine-type endopeptidase activity, 0005618 cell wall, 0016020 membrane; PDB: 3EIF_A 1XF1_B.
Probab=35.20 E-value=91 Score=24.37 Aligned_cols=15 Identities=33% Similarity=0.353 Sum_probs=9.5
Q ss_pred cCCeEecCCcEEEEE
Q psy21 120 HRTLKIEAGGTVTVW 134 (288)
Q Consensus 120 p~~~~L~~g~~vtIw 134 (288)
|..++|+||++.+|-
T Consensus 61 ~~~vTV~ag~s~~v~ 75 (112)
T PF06280_consen 61 PDTVTVPAGQSKTVT 75 (112)
T ss_dssp -EEEEE-TTEEEEEE
T ss_pred CCeEEECCCCEEEEE
Confidence 356788888777764
No 389
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=35.12 E-value=1.3e+02 Score=21.16 Aligned_cols=51 Identities=22% Similarity=0.260 Sum_probs=29.5
Q ss_pred ecCCCcEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEEe
Q psy21 81 VDPDGKFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVWS 135 (288)
Q Consensus 81 ~d~~GkfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIwa 135 (288)
|-.||+|+--....-.+.+-|-+.|.-..+|....+ ..+.++||+.++|-.
T Consensus 15 V~vdg~~~G~tp~~~~~l~~G~~~v~v~~~Gy~~~~----~~v~v~~~~~~~v~~ 65 (71)
T PF08308_consen 15 VYVDGKYIGTTPLTLKDLPPGEHTVTVEKPGYEPYT----KTVTVKPGETTTVNV 65 (71)
T ss_pred EEECCEEeccCcceeeecCCccEEEEEEECCCeeEE----EEEEECCCCEEEEEE
Confidence 444666665333223446677777777776633322 245777888877743
No 390
>PRK10698 phage shock protein PspA; Provisional
Probab=34.62 E-value=2.6e+02 Score=25.07 Aligned_cols=10 Identities=20% Similarity=0.537 Sum_probs=4.0
Q ss_pred HHHHHHHHHH
Q psy21 239 MEAEIKRLQT 248 (288)
Q Consensus 239 le~el~~lr~ 248 (288)
+|.-+.++.+
T Consensus 171 mE~ki~~~Ea 180 (222)
T PRK10698 171 FERRIDQMEA 180 (222)
T ss_pred HHHHHHHHHH
Confidence 3444443333
No 391
>PLN02678 seryl-tRNA synthetase
Probab=34.47 E-value=2.7e+02 Score=27.84 Aligned_cols=62 Identities=15% Similarity=0.180 Sum_probs=32.3
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKE----------LEDQIEYEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~----------le~~l~~~q~~i~~le~el~~lr~~~~~ 252 (288)
.+++-.+-.+...+..+++.|++....+..++.. +..++..+...|..++.++..+..++..
T Consensus 32 id~il~ld~~~r~l~~~~e~lr~erN~~sk~I~~~k~~~~~~~~l~~~~~~Lk~ei~~le~~~~~~~~~l~~ 103 (448)
T PLN02678 32 VDEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAALDA 103 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444554444445555666666665555555532 2223344455555566665555555543
No 392
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=34.43 E-value=1.6e+02 Score=28.47 Aligned_cols=23 Identities=26% Similarity=0.459 Sum_probs=9.3
Q ss_pred chhHHHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELE 226 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le 226 (288)
+...|+++...+..|+..+.+++
T Consensus 142 l~~Ri~e~Eeris~lEd~~~~i~ 164 (370)
T PF02994_consen 142 LNSRIDELEERISELEDRIEEIE 164 (370)
T ss_dssp --HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhHHHHHHHHHHHHh
Confidence 33444444444444444444333
No 393
>PF10046 BLOC1_2: Biogenesis of lysosome-related organelles complex-1 subunit 2 ; InterPro: IPR019269 This entry represents a family of proteins that play a role in cellular proliferation, as well as in the biogenesis of specialised organelles of the endosomal-lysosomal system [].
Probab=34.37 E-value=2.2e+02 Score=22.08 Aligned_cols=29 Identities=21% Similarity=0.463 Sum_probs=12.1
Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy21 235 KFARMEAEIKRLQTEMTEQLKDYRELMEIK 264 (288)
Q Consensus 235 ~i~~le~el~~lr~~~~~ql~eyq~LL~vK 264 (288)
+|..++.++..+.. +...|.+|-.-|-.|
T Consensus 67 ~Id~Ie~~V~~LE~-~v~~LD~ysk~LE~k 95 (99)
T PF10046_consen 67 QIDQIEEQVTELEQ-TVYELDEYSKELESK 95 (99)
T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHH
Confidence 33334444443322 333445555444443
No 394
>TIGR00414 serS seryl-tRNA synthetase. This model represents the seryl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam model tRNA-synt_2b. The seryl-tRNA synthetases of two archaeal species, Methanococcus jannaschii and Methanobacterium thermoautotrophicum, differ considerably and are included in a different model.
Probab=34.33 E-value=2.3e+02 Score=27.87 Aligned_cols=25 Identities=16% Similarity=0.429 Sum_probs=12.8
Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 229 IEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 229 l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
+..+...|..++.++..+..++...
T Consensus 78 ~~~l~~~~~~~~~~~~~~~~~~~~~ 102 (418)
T TIGR00414 78 LKELKEELTELSAALKALEAELQDK 102 (418)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555554443
No 395
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=34.27 E-value=6e+02 Score=27.08 Aligned_cols=23 Identities=26% Similarity=0.349 Sum_probs=11.0
Q ss_pred hCCchhHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
++.++.++..|+..-..|.++++
T Consensus 581 L~~l~e~~~~l~~~ae~LaeR~e 603 (717)
T PF10168_consen 581 LQELQEERKSLRESAEKLAERYE 603 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555544444444443
No 396
>PRK06798 fliD flagellar capping protein; Validated
Probab=34.17 E-value=3.6e+02 Score=26.84 Aligned_cols=50 Identities=10% Similarity=0.226 Sum_probs=36.5
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHhhHHHHHHHHHHHHHH
Q psy21 203 KYDTEMNELRAANYRLTERNKELEDQIE----YEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 203 ~l~~ei~~L~~~~~~Le~~l~~le~~l~----~~q~~i~~le~el~~lr~~~~~ 252 (288)
.+..++..|..++.+|+.++..++.+++ .+..+...||..+++|..++..
T Consensus 376 ~i~~r~~~l~~~i~~l~~~~~~~e~rl~~~e~~l~~qf~ale~~ms~lnsQ~s~ 429 (440)
T PRK06798 376 IIGERSKSIDNRVSKLDLKITDIDTQNKQKQDNIVDKYQKLESTLAALDSQLKT 429 (440)
T ss_pred eeehhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4577888888888888888888887764 3455667777777777665543
No 397
>PLN02939 transferase, transferring glycosyl groups
Probab=34.16 E-value=2.7e+02 Score=30.78 Aligned_cols=53 Identities=23% Similarity=0.323 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHHHHHHHHH------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 209 NELRAANYRLTERNKELED------QIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le~------~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
++|+.+++.|++.+.+... .++-+|.++..+++.++....+|..+++-||+..
T Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 385 (977)
T PLN02939 327 QDLRDKVDKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQESI 385 (977)
T ss_pred hHHHHHHHHHHHHHHHhhHhhhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 4555566666665554221 2455788899999999999999999988888743
No 398
>PF09766 FimP: Fms-interacting protein; InterPro: IPR019163 This entry represents Thoc5 which is one of the subunits of the THO complex, which additionally contains: HPR1, Thoc2, Thoc6 and Thoc7. The evolutionarily conserved multisubunit THO complex, which is recruited to actively transcribed genes is required for the efficient expression of genes that have internal tandem repeats. It is suggested that the THO complex functions to rectify aberrant structures that arise during transcription [, ] and is required for cell proliferation and for proper export of heat-shock mRNAs under heat stress []. This entry also identifies the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes [].
Probab=34.11 E-value=3e+02 Score=26.50 Aligned_cols=45 Identities=22% Similarity=0.328 Sum_probs=31.3
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhc
Q psy21 232 EQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEA 280 (288)
Q Consensus 232 ~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEg 280 (288)
.+-.+.+|+-||.+.+ . ...++++|..-|-.|..||..=++-|..
T Consensus 89 H~lml~RL~~EL~~Rk-~---L~~~~~el~~~k~~l~~~~~~k~~~L~~ 133 (355)
T PF09766_consen 89 HQLMLARLEFELEQRK-R---LEEQLKELEQRKKKLQQENKKKKKFLDS 133 (355)
T ss_pred HHHHHHHHHHHHHHHH-H---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666766666532 2 2357788999999999999887776654
No 399
>PRK12705 hypothetical protein; Provisional
Probab=34.07 E-value=5.3e+02 Score=26.35 Aligned_cols=24 Identities=29% Similarity=0.246 Sum_probs=11.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l 229 (288)
.++.+.+..++..+.++...+.++
T Consensus 70 ~e~~~~~~~~~~~e~rl~~~e~~l 93 (508)
T PRK12705 70 QEARREREELQREEERLVQKEEQL 93 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555555444433333
No 400
>PF04582 Reo_sigmaC: Reovirus sigma C capsid protein; InterPro: IPR007662 Protein sigmaC in its native state was shown to be a homotrimer. It was demonstrated that the sigmaC subunits are not covalently bound via disulphide linkages and the formation of an intrachain disulphide bond between the two cysteine residues of the sigmaC polypeptide may have a negative effect on oligomer stability. The susceptibility of the trimer to pH, temperature, ionic strength, chemical denaturants and detergents indicates that hydrophobic interactions contribute much more to oligomer stability than do ionic interactions and hydrogen bonding [].; PDB: 2VRS_C 2JJL_A 2BSF_A 2BT7_A 2BT8_A.
Probab=33.88 E-value=14 Score=35.41 Aligned_cols=54 Identities=17% Similarity=0.244 Sum_probs=0.0
Q ss_pred HHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy21 193 ELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRL 246 (288)
Q Consensus 193 El~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~l 246 (288)
.+..+...+.+++.+|.+|..++..|+..+.++...+.++...+...+..|..+
T Consensus 64 ~L~~l~~sl~~~~s~L~sLsstV~~lq~Sl~~lsssVs~lS~~ls~h~ssIS~L 117 (326)
T PF04582_consen 64 DLQDLASSLADMTSELNSLSSTVTSLQSSLSSLSSSVSSLSSTLSDHSSSISDL 117 (326)
T ss_dssp ------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHhhhhhhhhhhhhHHHH
Confidence 334444444445555555555555555555555554444433333333333333
No 401
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=33.85 E-value=2.2e+02 Score=28.13 Aligned_cols=19 Identities=32% Similarity=0.365 Sum_probs=11.3
Q ss_pred EEecCCeEecCCcEEEEEe
Q psy21 117 FKFHRTLKIEAGGTVTVWS 135 (288)
Q Consensus 117 y~Fp~~~~L~~g~~vtIwa 135 (288)
.-+-.+..|.+|..|+|.+
T Consensus 221 ~G~V~~a~I~a~g~I~i~~ 239 (451)
T PF03961_consen 221 KGNVEGATIEAGGNITIHG 239 (451)
T ss_pred EEEEeeEEEEECCcEEEeC
Confidence 3333456666777777763
No 402
>KOG3809|consensus
Probab=33.65 E-value=2.8e+02 Score=27.96 Aligned_cols=15 Identities=13% Similarity=0.441 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHhh
Q psy21 248 TEMTEQLKDYRELME 262 (288)
Q Consensus 248 ~~~~~ql~eyq~LL~ 262 (288)
+++++.+.+|+++++
T Consensus 549 a~lq~~I~d~~e~i~ 563 (583)
T KOG3809|consen 549 ANLQKEINDTKEEIS 563 (583)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444556666554
No 403
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=33.61 E-value=1e+02 Score=24.41 Aligned_cols=23 Identities=26% Similarity=0.406 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l 229 (288)
++.+++..++.|+.+...|+.++
T Consensus 35 q~~~~~~e~~~l~~~n~~L~~eI 57 (105)
T PRK00888 35 QVAAQQQTNAKLKARNDQLFAEI 57 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444433333
No 404
>KOG4673|consensus
Probab=33.44 E-value=3.6e+02 Score=28.83 Aligned_cols=66 Identities=17% Similarity=0.211 Sum_probs=39.8
Q ss_pred HHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy21 190 LLLELKETKGRSGKYDTEMNELRAANYRLTERNKE--------------LEDQIEYEQSKFARMEAEIKRLQTEMTEQLK 255 (288)
Q Consensus 190 ~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~--------------le~~l~~~q~~i~~le~el~~lr~~~~~ql~ 255 (288)
..+|+-.++-.+-+...++.=|++...+|..++.. ++.+++.++..+.+++.++++.+....++++
T Consensus 688 ekqElL~~~~~l~s~~~q~sllraE~~~l~~~le~e~nr~~~~~~e~~~~qeE~~~l~~r~~~le~e~r~~k~~~~q~lq 767 (961)
T KOG4673|consen 688 EKQELLSLNFSLPSSPIQLSLLRAEQGQLSKSLEKERNRAAENRQEYLAAQEEADTLEGRANQLEVEIRELKRKHKQELQ 767 (961)
T ss_pred hHHHHHHHhcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34445455545555566667777777777665542 1223445566677777777777777666554
No 405
>cd07636 BAR_GRAF The Bin/Amphiphysin/Rvs (BAR) domain of GTPase Regulator Associated with Focal adhesion kinase. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. GTPase Regulator Associated with Focal adhesion kinase (GRAF), also called Rho GTPase activating protein 26 (ARHGAP26), is a GAP with activity towards RhoA and Cdc42 and is only weakly active towards Rac1. It influences Rho-mediated cytoskeletal rearrangements and binds focal adhesion kinase (FAK), which is a critical component of integrin signaling. GRAF contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, a Rho GAP domain, and a C-terminal SH3 domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The BAR domain of GRAF directly interacts with its Rho GAP domain and inhibits its activity. Autoinhibited GRAF is capable o
Probab=33.44 E-value=3e+02 Score=24.68 Aligned_cols=25 Identities=16% Similarity=0.158 Sum_probs=13.8
Q ss_pred hCCchhHHHHHHHHHHHHHHHHHHH
Q psy21 201 SGKYDTEMNELRAANYRLTERNKEL 225 (288)
Q Consensus 201 ~~~l~~ei~~L~~~~~~Le~~l~~l 225 (288)
+.+++.+++.++..+..|-..+..+
T Consensus 4 l~~~E~~le~~~k~ik~liK~~k~~ 28 (207)
T cd07636 4 LKSHEAELDKTNKFIKELIKDGKSL 28 (207)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556666666666655554433
No 406
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=33.37 E-value=1.6e+02 Score=28.64 Aligned_cols=25 Identities=40% Similarity=0.412 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 210 ELRAANYRLTERNKELEDQIEYEQS 234 (288)
Q Consensus 210 ~L~~~~~~Le~~l~~le~~l~~~q~ 234 (288)
+|..+++.++..+.+++..+.....
T Consensus 141 ~l~~Ri~e~Eeris~lEd~~~~i~~ 165 (370)
T PF02994_consen 141 SLNSRIDELEERISELEDRIEEIEQ 165 (370)
T ss_dssp ---HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHhh
Confidence 3444445555555555544443333
No 407
>PF09177 Syntaxin-6_N: Syntaxin 6, N-terminal; InterPro: IPR015260 Members of this entry, which are found in the amino terminus of various SNARE proteins, adopt a structure consisting of an antiparallel three-helix bundle. Their exact function has not been determined, though it is known that they regulate the SNARE motif, as well as mediate various protein-protein interactions involved in membrane-transport []. ; GO: 0048193 Golgi vesicle transport, 0016020 membrane; PDB: 1LVF_B 2C5I_T 2C5J_A 2C5K_T 4DND_A.
Probab=33.36 E-value=1.3e+02 Score=23.18 Aligned_cols=21 Identities=29% Similarity=0.351 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy21 209 NELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le~~l 229 (288)
.+|+..+..++.+|.+|+..+
T Consensus 42 ~eL~~~l~~ie~~L~DL~~aV 62 (97)
T PF09177_consen 42 RELRNALQSIEWDLEDLEEAV 62 (97)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555555555543
No 408
>PF12126 DUF3583: Protein of unknown function (DUF3583); InterPro: IPR021978 This domain is found in eukaryotes, and is typically between 302 and 338 amino acids in length. It is found in association with PF00097 from PFAM and PF00643 from PFAM. Most members are promyelocytic leukemia proteins, and this family lies towards the C terminus.
Probab=33.29 E-value=4.3e+02 Score=25.13 Aligned_cols=35 Identities=26% Similarity=0.530 Sum_probs=22.2
Q ss_pred HHHHHHhhHHHHHHHHHH--HHHHHHHHH---HHHhhhhh
Q psy21 231 YEQSKFARMEAEIKRLQT--EMTEQLKDY---RELMEIKI 265 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~--~~~~ql~ey---q~LL~vKl 265 (288)
++..++.+|++-|++++. .+...++-| |+.|++.-
T Consensus 72 ema~~L~~LeavLqRir~G~~LVekM~~YASDQEVLdMh~ 111 (324)
T PF12126_consen 72 EMAGQLGRLEAVLQRIRTGGALVEKMKLYASDQEVLDMHG 111 (324)
T ss_pred HHHHHHhHHHHHHHHHHhHHHHHHHHHHhcchHHHHHHHH
Confidence 445567777888888776 345555555 66666543
No 409
>PF05700 BCAS2: Breast carcinoma amplified sequence 2 (BCAS2); InterPro: IPR008409 This family consists of several eukaryotic sequences of unknown function. The mammalian members of this family are annotated as breast carcinoma amplified sequence 2 (BCAS2) proteins []. BCAS2 is a putative spliceosome associated protein [].
Probab=33.28 E-value=3.5e+02 Score=24.08 Aligned_cols=11 Identities=9% Similarity=0.069 Sum_probs=5.3
Q ss_pred HHHhhhhhhhh
Q psy21 11 EVQKKRKTELL 21 (288)
Q Consensus 11 ~yrklle~ee~ 21 (288)
+-..|++.|=.
T Consensus 27 ~a~~lI~eE~~ 37 (221)
T PF05700_consen 27 AAEALIEEEMR 37 (221)
T ss_pred HHHHHHHHHHh
Confidence 34455654433
No 410
>PRK06975 bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
Probab=33.23 E-value=2.7e+02 Score=29.17 Aligned_cols=74 Identities=12% Similarity=0.190 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHh---hhhhhhhhhHHHhhhhhhccccCC
Q psy21 212 RAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTE-MTEQLKDYRELM---EIKINLDVEIAQYRNLMEAEESSL 285 (288)
Q Consensus 212 ~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~-~~~ql~eyq~LL---~vKl~Le~EIatYrkLLEgEe~Rl 285 (288)
+.....+..++..+|.++...+..+..|+..++.+... -...+.|=.-|+ +-|+.|+-.+.+=..||+.-+.||
T Consensus 377 ~~~~~~~~~~l~~le~~l~~~~~~~~~L~~~~~~l~~~r~dW~laEae~Ll~lA~q~L~l~~dv~~A~~~L~~AD~~L 454 (656)
T PRK06975 377 QASVHQLDSQFAQLDGKLADAQSAQQALEQQYQDLSRNRDDWMIAEVEQMLSSASQQLQLTGNVQLALIALQNADARL 454 (656)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhhHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 33334444445555555555555555555555433221 122233333333 457777777777777776544443
No 411
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=33.20 E-value=1.9e+02 Score=23.57 Aligned_cols=42 Identities=24% Similarity=0.377 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTE 249 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~ 249 (288)
+.-|.+++..|+..+..++..+...+..+..+...+.++..+
T Consensus 96 ~~~l~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~~l~~l~~~ 137 (140)
T PRK03947 96 IEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQQE 137 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 412
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family. Members of this protein family are phage lysis regulatory protein, including the well-studied protein LysB (lysis protein B) of Enterobacteria phage P2. For members of this family, genes are found in phage or in prophage regions of bacterial genomes, typically near a phage lysozyme or phage holin.
Probab=33.14 E-value=2.9e+02 Score=23.08 Aligned_cols=33 Identities=15% Similarity=0.040 Sum_probs=16.9
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHH
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYR 217 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~ 217 (288)
..+..+...++.....+..+..+|..|....+.
T Consensus 26 ~~~~~a~~~~~~~~~~l~~~~~qL~~l~~~a~~ 58 (135)
T TIGR03495 26 ADLERANRVLKAQQAELASKANQLIVLLALAKR 58 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 444444455555555555555555555544433
No 413
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=32.69 E-value=1.8e+02 Score=24.19 Aligned_cols=44 Identities=18% Similarity=0.278 Sum_probs=27.7
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQ 247 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr 247 (288)
+..++++++..+...+..+..|+.++..-...|..|..+|..+.
T Consensus 78 l~a~~~e~qsli~~yE~~~~kLe~e~~~Kdsei~~Lr~~L~~~~ 121 (131)
T PF04859_consen 78 LAAEIQEQQSLIKTYEIVVKKLEAELRAKDSEIDRLREKLDELN 121 (131)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45677777777777777777776666655555555555554443
No 414
>KOG3433|consensus
Probab=32.55 E-value=3.6e+02 Score=23.94 Aligned_cols=66 Identities=12% Similarity=0.113 Sum_probs=27.9
Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q psy21 203 KYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLD 268 (288)
Q Consensus 203 ~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le 268 (288)
.+..+++.|+..+..+..++..++..-.+.-..+..++......-..+..+..-.++-+--|+.++
T Consensus 120 el~kklnslkk~~e~lr~el~k~~e~dpqv~~k~~~~~K~~~eaanrwtDnI~il~dy~~rkf~~e 185 (203)
T KOG3433|consen 120 ELTKKLNSLKKILESLRWELAKIQETDPQVFEKKVHLEKTMAEAANRWTDNIFILIDYLYRKFGLE 185 (203)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhcCCC
Confidence 344555555554444544444333211111122233333333333344555544555555555554
No 415
>PF13874 Nup54: Nucleoporin complex subunit 54; PDB: 3T97_B.
Probab=32.43 E-value=2.4e+02 Score=23.29 Aligned_cols=21 Identities=19% Similarity=0.240 Sum_probs=11.1
Q ss_pred chhHHHHHHHHHHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNKE 224 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~ 224 (288)
...+|.+++++...|...+-.
T Consensus 70 ~~~rl~~~r~r~~~L~hR~l~ 90 (141)
T PF13874_consen 70 TSARLEEARRRHQELSHRLLR 90 (141)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555555554433
No 416
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=32.42 E-value=1.2e+02 Score=24.52 Aligned_cols=18 Identities=17% Similarity=0.172 Sum_probs=12.1
Q ss_pred EecCCeEecCCcEEEEEe
Q psy21 118 KFHRTLKIEAGGTVTVWS 135 (288)
Q Consensus 118 ~Fp~~~~L~~g~~vtIwa 135 (288)
++|..+.|+||++.+|=.
T Consensus 82 ~~~~~Vtl~~~~sk~V~~ 99 (121)
T PF06030_consen 82 KIPKEVTLPPNESKTVTF 99 (121)
T ss_pred cCCcEEEECCCCEEEEEE
Confidence 445557888887776644
No 417
>KOG0979|consensus
Probab=32.41 E-value=4e+02 Score=29.57 Aligned_cols=139 Identities=18% Similarity=0.169 Sum_probs=0.0
Q ss_pred ecCCcEEEEEecccCCCCCCCCCceeeecccccCccchhhheeccchh-HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCC
Q psy21 125 IEAGGTVTVWSFNVEGATHEPPHNIVMKNQKFGSGDEISTILLNTDGE-LANQERIIDNLRKEQSSLLLELKETKGRSGK 203 (288)
Q Consensus 125 L~~g~~vtIwa~~~~~~~~~pp~dlv~~~~~wg~g~~i~t~L~d~~ge-vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~ 203 (288)
+..|..+-|..... |..+--+.+-.-+..++..+..+.-.+.+--.+ +-.+ ...+.....++.+...+.+.
T Consensus 590 i~g~~~~~i~~S~y-gs~~v~~~~~~lk~~~f~~~~~~l~~~~~~~ee~~~~~-------~~~~~~~~~~~r~lee~~~k 661 (1072)
T KOG0979|consen 590 IAGGRSKSIKKSAY-GSRQVITRNDPLKSRNFFSVSPVLEELDNRIEEEIQKL-------KAEIDIRSSTLRELEEKKQK 661 (1072)
T ss_pred hhcCchhhhhhhcc-ccceeeecCCcchhhhhhccchHHHHHHHHHHHHHHHH-------HHHHhhhhHHHHHHHHHHHH
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhh
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYR 275 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYr 275 (288)
...++......++.+......++..+..++.. +..+..+..+|.+|-.+|.....-|.-=.-++-.|.
T Consensus 662 ~~k~le~~~~~~~~~~~er~~~~~~~~~~~~r----~~~ie~~~~~l~~qkee~~~~~~~~I~~~~~~~~~~ 729 (1072)
T KOG0979|consen 662 ERKELEEEQKKLKLLKRERTKLNSELKSYQQR----KERIENLVVDLDRQEEEYAASEAKKILDTEDMRIQS 729 (1072)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
No 418
>PF07610 DUF1573: Protein of unknown function (DUF1573); InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=32.31 E-value=1.5e+02 Score=19.48 Aligned_cols=40 Identities=13% Similarity=0.134 Sum_probs=25.5
Q ss_pred EEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEE
Q psy21 89 KLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 89 ~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
.|.|+|+.+..+ .+..-.=.=.+..++. -.|+||.+.+|-
T Consensus 3 ~~~N~g~~~L~I-----~~v~tsCgCt~~~~~~-~~i~PGes~~i~ 42 (45)
T PF07610_consen 3 EFTNTGDSPLVI-----TDVQTSCGCTTAEYSK-KPIAPGESGKIK 42 (45)
T ss_pred EEEECCCCcEEE-----EEeeEccCCEEeeCCc-ceECCCCEEEEE
Confidence 578999988744 3322222235566655 679999887763
No 419
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=32.20 E-value=2e+02 Score=20.98 Aligned_cols=61 Identities=13% Similarity=0.133 Sum_probs=0.0
Q ss_pred hhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 200 RSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 200 ~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
.++.+..+|+.|-..-+.|..++.-|..++..++..-..|-+.....+..++..+.-...|
T Consensus 1 ~L~~Le~kle~Li~~~~~L~~EN~~Lr~q~~~~~~ER~~L~ekne~Ar~rvEamI~RLk~l 61 (65)
T TIGR02449 1 ELQALAAQVEHLLEYLERLKSENRLLRAQEKTWREERAQLLEKNEQARQKVEAMITRLKAL 61 (65)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
No 420
>PF06657 Cep57_MT_bd: Centrosome microtubule-binding domain of Cep57; InterPro: IPR010597 This entry is thought to represent a centrosomal protein of 57 kDa (Cep57-related protein). It is required for spindle microtubule attachment to both kinetochores and centrosomes and functions to tether minus-ends of spindle microtubules to centrosomes. It may act by forming ring-like structures around microtubules, or by serving as a cross-linker or scaffold at the attachment site [].
Probab=32.07 E-value=2.2e+02 Score=21.37 Aligned_cols=14 Identities=21% Similarity=0.515 Sum_probs=5.1
Q ss_pred HHHHHHHHHHHHhh
Q psy21 249 EMTEQLKDYRELME 262 (288)
Q Consensus 249 ~~~~ql~eyq~LL~ 262 (288)
++++.+.+....|.
T Consensus 54 ~L~~~l~~lv~~mE 67 (79)
T PF06657_consen 54 DLEQELEELVKRME 67 (79)
T ss_pred HHHHHHHHHHHHHH
Confidence 33333333333333
No 421
>COG2959 HemX Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=32.06 E-value=3.7e+02 Score=26.41 Aligned_cols=77 Identities=17% Similarity=0.237 Sum_probs=44.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH--HHHHHHHHHHHHh---hhhhhhhhhHHHhhhhhhcccc
Q psy21 209 NELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT--EMTEQLKDYRELM---EIKINLDVEIAQYRNLMEAEES 283 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~--~~~~ql~eyq~LL---~vKl~Le~EIatYrkLLEgEe~ 283 (288)
..+...+...+.++..++.+++..+..|..+......++. --...+.|-.-|| ..|+.|+..|.|=-.||++-..
T Consensus 86 ~ql~~~~~~~q~el~~l~~~~~~~~~ql~e~Q~~v~~is~~~~~dWllaEad~Ll~lA~rkL~l~~DV~TAv~lLk~aD~ 165 (391)
T COG2959 86 AQLESLIAQQQAELDRLERQLETLQKQLSELQKKVATISGSDRKDWLLAEADFLLKLAGRKLVLDQDVTTAVALLKSADA 165 (391)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhccCChhhHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHH
Confidence 4444444555555555555555555555555555444441 1222334444444 4588999999998888887665
Q ss_pred CC
Q psy21 284 SL 285 (288)
Q Consensus 284 Rl 285 (288)
||
T Consensus 166 ~L 167 (391)
T COG2959 166 RL 167 (391)
T ss_pred HH
Confidence 54
No 422
>TIGR00634 recN DNA repair protein RecN. All proteins in this family for which functions are known are ATP binding proteins involved in the initiation of recombination and recombinational repair.
Probab=32.06 E-value=4.5e+02 Score=26.76 Aligned_cols=17 Identities=12% Similarity=0.112 Sum_probs=7.7
Q ss_pred eEEEEEeCCcccEEEEe
Q psy21 103 WQLVRKVQGEVKTSFKF 119 (288)
Q Consensus 103 w~l~r~v~~~~~~~y~F 119 (288)
+.|.|.+.......|.+
T Consensus 95 ~ii~R~i~~~grs~~~i 111 (563)
T TIGR00634 95 VILRRSISRDGRSRAYL 111 (563)
T ss_pred EEEEEEEcCCCceEEEE
Confidence 45566654322334433
No 423
>PRK14011 prefoldin subunit alpha; Provisional
Probab=32.03 E-value=3.1e+02 Score=23.06 Aligned_cols=38 Identities=13% Similarity=0.201 Sum_probs=20.6
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNK 223 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~ 223 (288)
.++....+++.+..+++.+..+|+.|+.....+...+.
T Consensus 4 elq~~~~~l~~~~~qie~L~~si~~L~~a~~e~~~~ie 41 (144)
T PRK14011 4 ELQNQFMALEVYNQQVQKLQEELSSIDMMKMELLKSIE 41 (144)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555555666665566666665555555544443
No 424
>COG3096 MukB Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning]
Probab=31.95 E-value=5.7e+02 Score=27.87 Aligned_cols=30 Identities=27% Similarity=0.439 Sum_probs=17.7
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 233 QSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 233 q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
...+.-.|.+...+...|.+.-++|+.+-.
T Consensus 1079 EkqlT~~E~E~~~L~~~~rK~ErDY~~~Re 1108 (1480)
T COG3096 1079 EKQLTFCEAEMDNLTRKLRKLERDYFEMRE 1108 (1480)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHH
Confidence 334444566666666666666677776543
No 425
>PF05546 She9_MDM33: She9 / Mdm33 family; InterPro: IPR008839 Members of this family are mitochondrial inner membrane proteins with a role in inner mitochondrial membrane organisation and biogenesis []. The yeast Mdm33 protein assembles into an oligomeric complex in the inner membrane where it performs homotypic protein-protein interactions. It has been suggested that Mdm33 plays a distinct role, possibly involved in fission of the mitochondrial inner membrane [].
Probab=31.94 E-value=3.8e+02 Score=24.08 Aligned_cols=24 Identities=17% Similarity=0.268 Sum_probs=10.1
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAA 214 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~ 214 (288)
+.++..++..+..++.++.+.+..
T Consensus 31 Ys~Ie~LK~~i~~~E~~l~~~r~~ 54 (207)
T PF05546_consen 31 YSEIEKLKKSIEELEDELEAARQE 54 (207)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444333
No 426
>PRK10476 multidrug resistance protein MdtN; Provisional
Probab=31.79 E-value=3.9e+02 Score=25.03 Aligned_cols=55 Identities=7% Similarity=0.060 Sum_probs=24.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 206 TEMNELRAANYRLTERNKELEDQIEYE----------QSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 206 ~ei~~L~~~~~~Le~~l~~le~~l~~~----------q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
.++..++.++...+.++...+..++.+ +..+...+.++...+.++.....++..+
T Consensus 114 ~~~~~~~~~i~~a~~~l~~a~~~~~R~~~L~~~g~vS~~~~~~a~~~~~~a~~~l~~a~~~~~~~ 178 (346)
T PRK10476 114 SNAASANEQVERARANAKLATRTLERLEPLLAKGYVSAQQVDQARTAQRDAEVSLNQALLQAQAA 178 (346)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555444333333222 2223333444555555554444444443
No 427
>KOG0249|consensus
Probab=31.75 E-value=2.9e+02 Score=29.59 Aligned_cols=18 Identities=28% Similarity=0.416 Sum_probs=10.3
Q ss_pred hhhHHHhHHHHHHHHHHh
Q psy21 183 LRKEQSSLLLELKETKGR 200 (288)
Q Consensus 183 ~~~~~~~~~eEl~~~~~~ 200 (288)
....++.+..||.+++.+
T Consensus 168 l~~~~qe~naeL~rarqr 185 (916)
T KOG0249|consen 168 LEEQLEELNAELQRARQR 185 (916)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 345555566666666554
No 428
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=31.47 E-value=89 Score=22.40 Aligned_cols=22 Identities=32% Similarity=0.480 Sum_probs=11.4
Q ss_pred HHHHHHhhHHHHHHHHHHHHHH
Q psy21 231 YEQSKFARMEAEIKRLQTEMTE 252 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~~~ 252 (288)
+++..|..|++|+.+++..+..
T Consensus 25 EL~~RIa~L~aEI~R~~~~~~~ 46 (59)
T PF06698_consen 25 ELEERIALLEAEIARLEAAIAK 46 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555543
No 429
>KOG3850|consensus
Probab=31.46 E-value=5.1e+02 Score=25.62 Aligned_cols=21 Identities=19% Similarity=0.085 Sum_probs=16.7
Q ss_pred hhhHHHHhhhhhhhhcccccC
Q psy21 7 TRIEEVQKKRKTELLCMNVSG 27 (288)
Q Consensus 7 ~ei~~yrklle~ee~r~~~~~ 27 (288)
.+|+-.+|.|+.=+.||+.--
T Consensus 98 htiaqlqkkL~~y~~rLkeie 118 (455)
T KOG3850|consen 98 HTIAQLQKKLEQYHRRLKEIE 118 (455)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 478888999998888887543
No 430
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=31.36 E-value=92 Score=24.10 Aligned_cols=51 Identities=16% Similarity=0.192 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~ey 257 (288)
++..|..+++.|..+..-.+.++.+++.+=..-|.-..--+..+..+|+..
T Consensus 31 ~~~kL~~en~qlk~Ek~~~~~qvkn~~vrqknee~~~~~sr~~V~d~L~q~ 81 (87)
T PF10883_consen 31 QNAKLQKENEQLKTEKAVAETQVKNAKVRQKNEENTRRLSRDSVIDQLQQH 81 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhccCCHHHHHHHHHHc
Confidence 355666666666666655555555443222221222222344445555443
No 431
>PRK11281 hypothetical protein; Provisional
Probab=31.30 E-value=4.2e+02 Score=29.79 Aligned_cols=15 Identities=20% Similarity=0.286 Sum_probs=7.3
Q ss_pred hhhhHHHhhhhhhcc
Q psy21 267 LDVEIAQYRNLMEAE 281 (288)
Q Consensus 267 Le~EIatYrkLLEgE 281 (288)
|+.++...+..|.|-
T Consensus 204 l~~~~~~~~~~l~~~ 218 (1113)
T PRK11281 204 LNAQNDLQRKSLEGN 218 (1113)
T ss_pred HHHHHHHHHHHHhcc
Confidence 444455555555543
No 432
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=31.25 E-value=49 Score=33.49 Aligned_cols=23 Identities=17% Similarity=0.131 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy21 214 ANYRLTERNKELEDQIEYEQSKF 236 (288)
Q Consensus 214 ~~~~Le~~l~~le~~l~~~q~~i 236 (288)
+|+.|+.+|++|+.++..++.++
T Consensus 32 kie~L~kql~~Lk~q~~~l~~~v 54 (489)
T PF11853_consen 32 KIEALKKQLEELKAQQDDLNDRV 54 (489)
T ss_pred HHHHHHHHHHHHHHhhccccccc
Confidence 44444444444444444333333
No 433
>PF11518 DUF3221: Protein of unknown function (DUF3221); InterPro: IPR021598 This family of proteins with unknown function appears to be restricted to Bacillus. Some members in this family of proteins are annotated as YobA however this cannot be confirmed. YobA is a protein with unknown function. ; PDB: 2JQO_A.
Probab=31.25 E-value=27 Score=27.86 Aligned_cols=23 Identities=30% Similarity=0.673 Sum_probs=13.7
Q ss_pred EecCCcEEEEEecccCCCCCCCCCc
Q psy21 124 KIEAGGTVTVWSFNVEGATHEPPHN 148 (288)
Q Consensus 124 ~L~~g~~vtIwa~~~~~~~~~pp~d 148 (288)
-|+-|++|+||+... ....||-.
T Consensus 78 ~lk~G~KVkVw~~~i--leS~P~~a 100 (108)
T PF11518_consen 78 NLKVGQKVKVWSSEI--LESYPPKA 100 (108)
T ss_dssp G--TTEEEEEEEEEE--E-SSS-EE
T ss_pred hcccCCEEEEEeCCc--eecCCCcc
Confidence 468999999999654 33455543
No 434
>PLN02320 seryl-tRNA synthetase
Probab=31.18 E-value=2.8e+02 Score=28.31 Aligned_cols=64 Identities=17% Similarity=0.200 Sum_probs=35.3
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNK---------ELEDQIEYEQSKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~---------~le~~l~~~q~~i~~le~el~~lr~~~~~ql 254 (288)
.+++..+-.+...+..+++.|++....+..++. ++..+...+...|..+|.++..+..++...+
T Consensus 92 vd~l~~ld~~~r~~~~~~~~lr~ern~~sk~i~~~~~~~~~~~l~~~~k~lk~~i~~le~~~~~~~~~l~~~~ 164 (502)
T PLN02320 92 LELVLELYENMLALQKEVERLRAERNAVANKMKGKLEPSERQALVEEGKNLKEGLVTLEEDLVKLTDELQLEA 164 (502)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444455566666665555555443 2333445566667777777766666665543
No 435
>KOG4348|consensus
Probab=31.09 E-value=2e+02 Score=28.93 Aligned_cols=16 Identities=25% Similarity=0.559 Sum_probs=9.1
Q ss_pred hhhhhhhhhHHHhhhh
Q psy21 262 EIKINLDVEIAQYRNL 277 (288)
Q Consensus 262 ~vKl~Le~EIatYrkL 277 (288)
.+.+.|+|||....+-
T Consensus 608 ~mr~~lemei~~lkka 623 (627)
T KOG4348|consen 608 TMRSNLEMEIEKLKKA 623 (627)
T ss_pred HHHhhhHhhHHHHHHH
Confidence 4455666666655543
No 436
>PF07439 DUF1515: Protein of unknown function (DUF1515); InterPro: IPR010889 This family consists of several hypothetical bacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Rhizobium species. The function of this family is unknown.
Probab=31.03 E-value=2.7e+02 Score=22.55 Aligned_cols=20 Identities=10% Similarity=0.293 Sum_probs=8.3
Q ss_pred HHHHHHhhHHHHHHHHHHHH
Q psy21 231 YEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 231 ~~q~~i~~le~el~~lr~~~ 250 (288)
.+...+..+|..+..+++++
T Consensus 44 ElV~Rv~~lEs~~~~lk~dV 63 (112)
T PF07439_consen 44 ELVERVTTLESSVSTLKADV 63 (112)
T ss_pred HHHHHHHHHHHHHHHHHhhH
Confidence 33334444444444444444
No 437
>PRK13923 putative spore coat protein regulator protein YlbO; Provisional
Probab=30.99 E-value=3.6e+02 Score=23.48 Aligned_cols=52 Identities=25% Similarity=0.316 Sum_probs=35.8
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhhcccc
Q psy21 227 DQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLMEAEES 283 (288)
Q Consensus 227 ~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgEe~ 283 (288)
.++..++..+..|+.++..++.+..-.-.+|+.|+++- +-|.=..+++.++.
T Consensus 111 ~e~~kl~~~~e~L~~e~~~L~~~~~~~~eDy~~Li~Im-----~rark~~~~~~~e~ 162 (170)
T PRK13923 111 EQIGKLQEEEEKLSWENQTLKQELAITEEDYRALIVIM-----NRARRMAILVEDER 162 (170)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHcchhhhhhh
Confidence 34455667777788888888888888888999999983 43444444555433
No 438
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=30.94 E-value=3.7e+02 Score=23.66 Aligned_cols=11 Identities=27% Similarity=0.335 Sum_probs=5.1
Q ss_pred hhhhhhHHHhh
Q psy21 265 INLDVEIAQYR 275 (288)
Q Consensus 265 l~Le~EIatYr 275 (288)
-.|..||..+.
T Consensus 174 ~~l~~ei~~L~ 184 (194)
T PF15619_consen 174 KSLQEEIQRLN 184 (194)
T ss_pred HHHHHHHHHHH
Confidence 34445554444
No 439
>PF06548 Kinesin-related: Kinesin-related; InterPro: IPR010544 This entry represents a domain within kinesin-related proteins from higher plants. Many proteins containing this domain also contain the IPR001752 from INTERPRO domain. Kinesins are ATP-driven microtubule motor proteins that produce directed force []. Some family members are associated with the phragmoplast, a structure composed mainly of microtubules that executes cytokinesis in higher plants [].
Probab=30.90 E-value=5.7e+02 Score=25.80 Aligned_cols=25 Identities=16% Similarity=0.355 Sum_probs=20.0
Q ss_pred HHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 255 KDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 255 ~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
+-|+.+=+.|-..++||.|-..+|-
T Consensus 459 k~~kqiekLK~kh~~Ei~t~kq~la 483 (488)
T PF06548_consen 459 KAKKQIEKLKRKHKMEISTMKQYLA 483 (488)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3567777788889999999988774
No 440
>KOG3647|consensus
Probab=30.87 E-value=2.8e+02 Score=26.10 Aligned_cols=10 Identities=20% Similarity=0.365 Sum_probs=4.0
Q ss_pred HhHHHHHHHH
Q psy21 188 SSLLLELKET 197 (288)
Q Consensus 188 ~~~~eEl~~~ 197 (288)
..+.+|+...
T Consensus 56 ~~k~~e~r~~ 65 (338)
T KOG3647|consen 56 GDKIEELRKA 65 (338)
T ss_pred HHHHHHHHHH
Confidence 3344444433
No 441
>PRK14127 cell division protein GpsB; Provisional
Probab=30.64 E-value=85 Score=25.26 Aligned_cols=18 Identities=39% Similarity=0.510 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy21 209 NELRAANYRLTERNKELE 226 (288)
Q Consensus 209 ~~L~~~~~~Le~~l~~le 226 (288)
.+|+.++.+|+.++.+++
T Consensus 47 ~~Lk~e~~~l~~~l~e~~ 64 (109)
T PRK14127 47 EELQQENARLKAQVDELT 64 (109)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333333333333333
No 442
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=30.50 E-value=5.7e+02 Score=27.41 Aligned_cols=16 Identities=13% Similarity=0.077 Sum_probs=6.6
Q ss_pred HHhHHHHHHHHHHhhC
Q psy21 187 QSSLLLELKETKGRSG 202 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~ 202 (288)
+....+++.+.+.+++
T Consensus 513 ~~~li~~L~~~~~~~e 528 (771)
T TIGR01069 513 INVLIEKLSALEKELE 528 (771)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3334444444444433
No 443
>KOG0994|consensus
Probab=30.48 E-value=2.1e+02 Score=32.31 Aligned_cols=70 Identities=14% Similarity=0.135 Sum_probs=0.0
Q ss_pred HhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Q psy21 188 SSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 188 ~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~ey 257 (288)
..+++-.+.++.+-..|-.+-++....++.|+....+-+..|.++++.|..|+.++..+...|..+..=|
T Consensus 1685 ~~ar~rAe~L~~eA~~Ll~~a~~kl~~l~dLe~~y~~~~~~L~~~~aeL~~Le~r~~~vl~~I~~rv~~y 1754 (1758)
T KOG0994|consen 1685 QAARERAEQLRTEAEKLLGQANEKLDRLKDLELEYLRNEQALEDKAAELAGLEKRVESVLDHINERVLYY 1754 (1758)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHhhhHHHHHHHHHHHHhhhhhhh
No 444
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=30.41 E-value=5.3e+02 Score=27.50 Aligned_cols=81 Identities=17% Similarity=0.228 Sum_probs=0.0
Q ss_pred HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHH
Q psy21 173 LANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELED------------------QIEYEQS 234 (288)
Q Consensus 173 vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~------------------~l~~~q~ 234 (288)
+..........-.++....++.+.++..-+.|..++.+...+-..|..++..+-. +|+.++.
T Consensus 567 v~~Lk~~~e~Ql~~L~~l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~~vl~~l~~~~P~LS~AEr~~~~EL~~~~~ 646 (717)
T PF10168_consen 567 VKLLKQQKEQQLKELQELQEERKSLRESAEKLAERYEEAKDKQEKLMKRVDRVLQLLNSQLPVLSEAEREFKKELERMKD 646 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHHH
Q ss_pred HHhhHHHHHHHHHHHHHHH
Q psy21 235 KFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 235 ~i~~le~el~~lr~~~~~q 253 (288)
++..|...+.+++..+..|
T Consensus 647 ~l~~l~~si~~lk~k~~~Q 665 (717)
T PF10168_consen 647 QLQDLKASIEQLKKKLDYQ 665 (717)
T ss_pred HHHHHHHHHHHHHHHHHHH
No 445
>KOG4680|consensus
Probab=30.40 E-value=2e+02 Score=24.32 Aligned_cols=67 Identities=22% Similarity=0.394 Sum_probs=0.0
Q ss_pred CCCCeEEEeecC------------------------CCcEEEEEeCCccceeccCeEEEEEeCCcccEEEEecCCeEecC
Q psy21 72 AKGNMEITEVDP------------------------DGKFVKLYNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEA 127 (288)
Q Consensus 72 a~G~v~I~e~d~------------------------~GkfV~l~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~ 127 (288)
.++.|+|.+||. +|+|| | ++.++||.|..+.-+-=+.+ .+-++|
T Consensus 34 d~~~VkV~~Vni~PnP~a~Ge~aTf~i~~ntg~tIs~Gk~V-I------eV~y~gi~ihsethDLCdet-----sCPVep 101 (153)
T KOG4680|consen 34 DEYEVKVKEVNISPNPPARGENATFSISGNTGETISEGKYV-I------EVSYGGIRIHSETHDLCDET-----SCPVEP 101 (153)
T ss_pred ccceEEEEEEecCCCCCCCCCccEEEEecccccEeeCCeEE-E------EEEEeeEEEeeccccccccc-----cCCcCc
Q ss_pred CcEEEEEecccCCCCCCCCCceeee
Q psy21 128 GGTVTVWSFNVEGATHEPPHNIVMK 152 (288)
Q Consensus 128 g~~vtIwa~~~~~~~~~pp~dlv~~ 152 (288)
|.-+.=.+.-- .--.||+..+.+
T Consensus 102 G~f~~~hsq~L--Pg~tPPG~Y~lk 124 (153)
T KOG4680|consen 102 GDFLVAHSQVL--PGYTPPGSYVLK 124 (153)
T ss_pred CceeeeeeEec--cCcCCCceEEEE
No 446
>KOG0612|consensus
Probab=30.34 E-value=4.8e+02 Score=29.63 Aligned_cols=85 Identities=22% Similarity=0.250 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHhhCCchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy21 189 SLLLELKETKGRSGKYDTEMNELR-AANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINL 267 (288)
Q Consensus 189 ~~~eEl~~~~~~~~~l~~ei~~L~-~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~L 267 (288)
.+-++.+++...+..++.+.++|+ .+..-++.+..+.+.++....+++..++.++.+++.++....+..+.+.+..-.+
T Consensus 462 ~~~~~~keL~e~i~~lk~~~~el~~~q~~l~q~~~ke~~ek~~~~~~~~~~l~~~~~~~~eele~~q~~~~~~~~~~~kv 541 (1317)
T KOG0612|consen 462 ELEEMDKELEETIEKLKSEESELQREQKALLQHEQKEVEEKLSEEEAKKRKLEALVRQLEEELEDAQKKNDNAADSLEKV 541 (1317)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q ss_pred hhhHHH
Q psy21 268 DVEIAQ 273 (288)
Q Consensus 268 e~EIat 273 (288)
..+-.+
T Consensus 542 ~~~rk~ 547 (1317)
T KOG0612|consen 542 NSLRKQ 547 (1317)
T ss_pred HHHHHH
No 447
>COG5293 Predicted ATPase [General function prediction only]
Probab=30.33 E-value=6e+02 Score=25.87 Aligned_cols=107 Identities=15% Similarity=0.107 Sum_probs=0.0
Q ss_pred hheeccchhHHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy21 164 TILLNTDGELANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEI 243 (288)
Q Consensus 164 t~L~d~~gevA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el 243 (288)
+.+..++.-+.-+..+-..+....+..++++...-..+ ...+-.-|..+|.++++.|...++.+..+..++
T Consensus 295 ~~~~~pd~i~~~ye~vg~~fpg~Vkk~~e~v~~F~r~~---------~e~R~~yl~~ei~~i~~dLk~~n~~~~~l~~~r 365 (591)
T COG5293 295 QILFCPDEIQVLYEEVGVLFPGQVKKDFEHVIAFNRAI---------TEERHDYLQEEIAEIEGDLKEVNAELDDLGKRR 365 (591)
T ss_pred hccCChHHHHHHHHHhhhcChHHHHHhHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHH--HHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 244 KRLQTEMTEQ--LKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 244 ~~lr~~~~~q--l~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
+..-+-+... +.+|+.|+..-.++..|+|.-+.-.|
T Consensus 366 ae~l~~Lk~~g~~e~y~~l~ee~~~~~~elae~~~rie 403 (591)
T COG5293 366 AEGLAFLKNRGVFEKYQTLCEEIIALRGELAELEYRIE 403 (591)
T ss_pred HHHHHHHHhCCcHHHHHHHHHHHHHHhhhHHHHHHhhh
No 448
>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=30.14 E-value=4.3e+02 Score=24.08 Aligned_cols=80 Identities=19% Similarity=0.216 Sum_probs=0.0
Q ss_pred HHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHh
Q psy21 195 KETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQY 274 (288)
Q Consensus 195 ~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatY 274 (288)
..+..++..++.+....+.....-+.....|+..+...+..-..|+.....+...+.+....-...-.-|..|+.+|..|
T Consensus 8 ~Ele~rL~q~eee~~~a~~~L~e~e~~a~~Leek~k~aeeea~~Le~k~~eaee~~~rL~~~~~~~~eEk~~Le~e~~e~ 87 (246)
T PF00769_consen 8 QELEERLRQMEEEMRRAQEALEESEETAEELEEKLKQAEEEAEELEQKRQEAEEEKQRLEEEAEMQEEEKEQLEQELREA 87 (246)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 449
>PF10018 Med4: Vitamin-D-receptor interacting Mediator subunit 4; InterPro: IPR019258 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Members of this family represent the Med4 subunit of the Mediator (Med) complex [, ]. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=29.98 E-value=2.1e+02 Score=24.81 Aligned_cols=54 Identities=17% Similarity=0.222 Sum_probs=0.0
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARM 239 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~l 239 (288)
.......+.-....+|+....+|..|++.+..|..++.++...|...+..|..+
T Consensus 9 ~~d~~L~~~L~~l~~hq~~~~~I~~L~~e~~~ld~~i~~~~~~L~~~~~~L~~~ 62 (188)
T PF10018_consen 9 EADDELSSALEELQEHQENQARIQQLRAEIEELDEQIRDILKQLKEARKELRTL 62 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 450
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=29.97 E-value=3.2e+02 Score=22.55 Aligned_cols=84 Identities=12% Similarity=0.257 Sum_probs=0.0
Q ss_pred hhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 183 LRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 183 ~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
++.-+..+++-+...-.++ -+.|..+...|..+|+.+...+++..+.......++..++.+++..-.+-..+-.
T Consensus 37 Trr~m~~A~~~v~kql~~v------s~~l~~tKkhLsqRId~vd~klDe~~ei~~~i~~eV~~v~~dv~~i~~dv~~v~~ 110 (126)
T PF07889_consen 37 TRRSMSDAVASVSKQLEQV------SESLSSTKKHLSQRIDRVDDKLDEQKEISKQIKDEVTEVREDVSQIGDDVDSVQQ 110 (126)
T ss_pred HHHhHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Q ss_pred hhhhhhhhHH
Q psy21 263 IKINLDVEIA 272 (288)
Q Consensus 263 vKl~Le~EIa 272 (288)
.=..||--|.
T Consensus 111 ~V~~Le~ki~ 120 (126)
T PF07889_consen 111 MVEGLEGKID 120 (126)
T ss_pred HHHHHHHHHH
No 451
>PF11855 DUF3375: Protein of unknown function (DUF3375); InterPro: IPR021804 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Probab=29.97 E-value=4.5e+02 Score=26.37 Aligned_cols=74 Identities=15% Similarity=0.238 Sum_probs=0.0
Q ss_pred hhhHHHhHHHHHHHHHHhhCCc-hhHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHHHhhHHH
Q psy21 183 LRKEQSSLLLELKETKGRSGKY-DTEMNELRAANYRLTERNKELED--------------------QIEYEQSKFARMEA 241 (288)
Q Consensus 183 ~~~~~~~~~eEl~~~~~~~~~l-~~ei~~L~~~~~~Le~~l~~le~--------------------~l~~~q~~i~~le~ 241 (288)
+...+.+.++.++.+....+.. ...|..|+++++.|+.+|..++. ....+-..+..++.
T Consensus 120 TeSRl~tv~~~l~~la~~~~~Dp~~Ri~~Le~e~~~i~~EI~~l~aG~~~~ld~~~~~er~~~i~~la~~L~~DFr~V~~ 199 (478)
T PF11855_consen 120 TESRLNTVFDALRQLAEGTDPDPERRIAELEREIAEIDAEIDRLEAGDVPVLDDTQARERARQILQLARELPADFRRVED 199 (478)
T ss_pred cHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHH
Q psy21 242 EIKRLQTEMTEQLKD 256 (288)
Q Consensus 242 el~~lr~~~~~ql~e 256 (288)
.+.++..++-.++.+
T Consensus 200 ~~r~l~r~lr~~i~~ 214 (478)
T PF11855_consen 200 NFRELDRALRERIID 214 (478)
T ss_pred HHHHHHHHHHHHHhh
No 452
>PF09738 DUF2051: Double stranded RNA binding protein (DUF2051); InterPro: IPR019139 This entry represents transcriptional repressors which preferentially bind to the GC-rich consensus sequence (5'-AGCCCCCGGCG-3') and may regulate expression of TNF, EGFR and PDGFA. They may control smooth muscle cell proliferation following artery injury through PDGFA repression and may also bind double-stranded RNA. They interact with the leucine-rich repeat domain of human flightless-I (FliI) protein.
Probab=29.97 E-value=4.9e+02 Score=24.69 Aligned_cols=85 Identities=18% Similarity=0.314 Sum_probs=0.0
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQ 273 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIat 273 (288)
++.++..+..++...-.--=.+.+|-++-..+.=++.-+.+.+..+++.+.+++.++....+++..+-..--.|-.|++.
T Consensus 79 ~r~lk~~l~evEekyrkAMv~naQLDNek~~l~yqvd~Lkd~lee~eE~~~~~~re~~eK~~elEr~K~~~d~L~~e~~~ 158 (302)
T PF09738_consen 79 LRDLKDSLAEVEEKYRKAMVSNAQLDNEKSALMYQVDLLKDKLEELEETLAQLQREYREKIRELERQKRAHDSLREELDE 158 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred hhhhh
Q psy21 274 YRNLM 278 (288)
Q Consensus 274 YrkLL 278 (288)
.+.-|
T Consensus 159 Lre~L 163 (302)
T PF09738_consen 159 LREQL 163 (302)
T ss_pred HHHHH
No 453
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=29.96 E-value=56 Score=30.18 Aligned_cols=21 Identities=24% Similarity=0.544 Sum_probs=0.0
Q ss_pred HHHhhHHHHHHHHHHHHHHHH
Q psy21 234 SKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 234 ~~i~~le~el~~lr~~~~~ql 254 (288)
.+|+.||.||..||++|+...
T Consensus 122 qKIsALEdELs~LRaQIA~IV 142 (253)
T PF05308_consen 122 QKISALEDELSRLRAQIAKIV 142 (253)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
No 454
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=29.93 E-value=2.2e+02 Score=20.76 Aligned_cols=64 Identities=14% Similarity=0.110 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh
Q psy21 207 EMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVE 270 (288)
Q Consensus 207 ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~E 270 (288)
++..|..++..|-..+..+..+-..+...+..+..+-+.+...++.-..--..|+.-==+||.+
T Consensus 1 ~L~~Le~kle~Li~~~~~L~~EN~~Lr~q~~~~~~ER~~L~ekne~Ar~rvEamI~RLk~leq~ 64 (65)
T TIGR02449 1 ELQALAAQVEHLLEYLERLKSENRLLRAQEKTWREERAQLLEKNEQARQKVEAMITRLKALEQH 64 (65)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC
No 455
>PRK11677 hypothetical protein; Provisional
Probab=29.75 E-value=3.1e+02 Score=22.82 Aligned_cols=47 Identities=19% Similarity=0.234 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 213 AANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 213 ~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
++...|+.+++..+.++++++..+..-=.+-+.|=..|++ +|+.|..
T Consensus 29 ~~q~~le~eLe~~k~ele~YkqeV~~HFa~TA~Ll~~L~~---~Y~~Ly~ 75 (134)
T PRK11677 29 RQQQALQYELEKNKAELEEYRQELVSHFARSAELLDTMAK---DYRQLYQ 75 (134)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHH
No 456
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=29.72 E-value=1.8e+02 Score=27.14 Aligned_cols=45 Identities=13% Similarity=0.189 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy21 220 ERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIK 264 (288)
Q Consensus 220 ~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vK 264 (288)
..+.++..+.+.++.++..+.+++.....++.+....++.||+.+
T Consensus 66 ~~~~~l~~EN~~Lr~e~~~l~~~~~~~~~~l~~EN~rLr~LL~~~ 110 (283)
T TIGR00219 66 KDVNNLEYENYKLRQELLKKNQQLEILTQNLKQENVRLRELLNSP 110 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc
No 457
>KOG4809|consensus
Probab=29.67 E-value=3.4e+02 Score=28.07 Aligned_cols=73 Identities=12% Similarity=0.183 Sum_probs=0.0
Q ss_pred HhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 188 SSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 188 ~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
..+.||+...+...++|+..|+.|++....-+.++-.++.....+...+..+..+|..+.-.++++-.+.-.+
T Consensus 327 rE~~EeIe~~~ke~kdLkEkv~~lq~~l~eke~sl~dlkehassLas~glk~ds~Lk~leIalEqkkEec~km 399 (654)
T KOG4809|consen 327 RERLEEIESFRKENKDLKEKVNALQAELTEKESSLIDLKEHASSLASAGLKRDSKLKSLEIALEQKKEECSKM 399 (654)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHH
No 458
>PF14723 SSFA2_C: Sperm-specific antigen 2 C-terminus
Probab=29.65 E-value=3.9e+02 Score=23.42 Aligned_cols=73 Identities=21% Similarity=0.263 Sum_probs=0.0
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELRAANYRLTERN-----KELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRE 259 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l-----~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~ 259 (288)
|..-..|++.-+.-++..+.++.+|+-.+-+-++-. .+-..+.+++|.--..+-+|++++..++...+..+.+
T Consensus 100 ~~~T~~Elq~mr~~ln~FR~qm~dlE~~l~~QQalvy~hMSeeER~EaeQLQsLR~avRqElqELE~QL~DRl~~l~e 177 (179)
T PF14723_consen 100 YSCTVQELQQMRRSLNSFREQMMDLELHLMRQQALVYRHMSEEEREEAEQLQSLRSAVRQELQELEFQLEDRLLQLRE 177 (179)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
No 459
>KOG0979|consensus
Probab=29.61 E-value=8.3e+02 Score=27.25 Aligned_cols=113 Identities=11% Similarity=0.187 Sum_probs=0.0
Q ss_pred HHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy21 175 NQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQL 254 (288)
Q Consensus 175 ~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql 254 (288)
.+...-.........++++++.+.....-..+.+.+|+..+..+..++......+......+...-..+..++..+....
T Consensus 245 ~~~~ey~~~k~~~~r~k~~~r~l~k~~~pi~~~~eeLe~~~~et~~~~s~~~~~~~e~~~k~~~~~ek~~~~~~~v~~~~ 324 (1072)
T KOG0979|consen 245 KHDREYNAYKQAKDRAKKELRKLEKEIKPIEDKKEELESEKKETRSKISQKQRELNEALAKVQEKFEKLKEIEDEVEEKK 324 (1072)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhHHhHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHhhhhhhhhhhHHHhhhhhhccccCCCC
Q psy21 255 KDYRELMEIKINLDVEIAQYRNLMEAEESSLPT 287 (288)
Q Consensus 255 ~eyq~LL~vKl~Le~EIatYrkLLEgEe~Rl~~ 287 (288)
.+...|.+--......|..-++.+.--..+++.
T Consensus 325 ~~le~lk~~~~~rq~~i~~~~k~i~~~q~el~~ 357 (1072)
T KOG0979|consen 325 NKLESLKKAAEKRQKRIEKAKKMILDAQAELQE 357 (1072)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
No 460
>PF03148 Tektin: Tektin family; InterPro: IPR000435 Tektin heteropolymers form unique protofilaments of flagellar microtubules []. The proteins are predicted to form extended rods composed of 2 alpha- helical segments (~180 residues long) capable of forming coiled coils, interrupted by non-helical linkers []. The 2 segments are similar in sequence, indicating a gene duplication event. Along each tektin rod, cysteine residues occur with a periodicity of ~8nm, coincident with the axial repeat of tubulin dimers in microtubules []. It is proposed that the assembly of tektin heteropolymers produces filaments with repeats of 8, 16, 24, 32, 40, 48 and 96nm, generating the basis for the complex spatial arrangements of axonemal components [].; GO: 0000226 microtubule cytoskeleton organization, 0005874 microtubule
Probab=29.61 E-value=4.5e+02 Score=25.54 Aligned_cols=86 Identities=10% Similarity=0.158 Sum_probs=0.0
Q ss_pred eeccchhHHHHHHHHhhhhhHHHhHHHHHHHHHHhhCC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHH
Q psy21 166 LLNTDGELANQERIIDNLRKEQSSLLLELKETKGRSGK-YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIK 244 (288)
Q Consensus 166 L~d~~gevA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~-l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~ 244 (288)
+....-+++.-...+..+...+.....++..+...+.. +...|.+.....+.|+.++..+..++......|..|+..+.
T Consensus 210 i~~a~~e~~~S~~LR~~i~~~l~~~~~dl~~Q~~~vn~al~~Ri~et~~ak~~Le~ql~~~~~ei~~~e~~i~~L~~ai~ 289 (384)
T PF03148_consen 210 IQRAEKERQSSAQLREDIDSILEQTANDLRAQADAVNAALRKRIHETQEAKNELEWQLKKTLQEIAEMEKNIEDLEKAIR 289 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHH
Q psy21 245 RLQTEMT 251 (288)
Q Consensus 245 ~lr~~~~ 251 (288)
....-|.
T Consensus 290 ~k~~~lk 296 (384)
T PF03148_consen 290 DKEGPLK 296 (384)
T ss_pred HHHhhHH
No 461
>TIGR00414 serS seryl-tRNA synthetase. This model represents the seryl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam model tRNA-synt_2b. The seryl-tRNA synthetases of two archaeal species, Methanococcus jannaschii and Methanobacterium thermoautotrophicum, differ considerably and are included in a different model.
Probab=29.38 E-value=3.6e+02 Score=26.57 Aligned_cols=71 Identities=18% Similarity=0.272 Sum_probs=0.0
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQ----IEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~----l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
.+++-.+-.+...+..+++.|++....+..++...... .+.+.+.+..+..++..+..++...-.++..++
T Consensus 29 vd~i~~ld~~~r~~~~~~~~l~~erN~~sk~i~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 103 (418)
T TIGR00414 29 LEKLIALDDERKKLLSEIEELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKL 103 (418)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 462
>PF07902 Gp58: gp58-like protein; InterPro: IPR012892 Sequences found in this entry are derived from a number of bacteriophage and prophage proteins. They are similar to gp58 (Q38355 from SWISSPROT), a minor structural protein of Lactococcus delbrueckii bacteriophage LL-H [].
Probab=29.32 E-value=1.7e+02 Score=30.46 Aligned_cols=74 Identities=5% Similarity=0.163 Sum_probs=0.0
Q ss_pred hhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHh
Q psy21 200 RSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQY 274 (288)
Q Consensus 200 ~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatY 274 (288)
++.+.+..++.++...++++.+|++.+..+..++..+..++..+......+..+......+...++.. .++.++
T Consensus 262 ei~~~~gaVS~vqQtads~~~rLq~aE~n~SSL~QTV~GLqs~Vs~~~~kisSritQls~~I~~~VT~-~~l~S~ 335 (601)
T PF07902_consen 262 EISNREGAVSRVQQTADSYQRRLQDAEGNYSSLTQTVRGLQSTVSDPDTKISSRITQLSSQIEQRVTR-SDLQSQ 335 (601)
T ss_pred hhhhhhhHHHHHHHHHHHHHhHHHHhHhHHHHHHHHHHHHHhhhhccccchhHHHHHHHHHHHHHhhh-hhhHhH
No 463
>KOG3335|consensus
Probab=29.25 E-value=86 Score=27.46 Aligned_cols=58 Identities=26% Similarity=0.237 Sum_probs=0.0
Q ss_pred HHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Q psy21 186 EQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 186 ~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~ 248 (288)
|.++..+-.++... +.++.+|+..+..|+.+++++...+.++...+...+.++...+.
T Consensus 91 ~R~~~~e~~kee~~-----~~e~~elr~~~~~l~~~i~~~~~~~~~L~~~l~~~~~el~~~~q 148 (181)
T KOG3335|consen 91 WRQARKERKKEEKR-----KQEIMELRLKVEKLENAIAELTKFFSQLHSKLNKPESELKPIRQ 148 (181)
T ss_pred HHhhhcchhhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccccccccc
No 464
>PF09728 Taxilin: Myosin-like coiled-coil protein; InterPro: IPR019132 Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca(2+)-dependent exocytosis in neuroendocrine cells []. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localises to the nucleus in osteoblasts and dimerises with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription [].
Probab=29.17 E-value=5e+02 Score=24.55 Aligned_cols=84 Identities=20% Similarity=0.288 Sum_probs=0.0
Q ss_pred HHhHHHHHHHHHHhhCCc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhh
Q psy21 187 QSSLLLELKETKGRSGKY----DTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELME 262 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l----~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~ 262 (288)
|..++++++.+-.+-+++ +.|+..+...+-.|+.+...+....+..+..|..+-.+......++...-..-+.|-+
T Consensus 221 Y~~Kf~efq~tL~kSNe~F~tfk~Emekm~Kk~kklEKE~~~~k~k~e~~n~~l~~m~eer~~~~~~~~~~~~k~~kLe~ 300 (309)
T PF09728_consen 221 YSEKFEEFQDTLNKSNEVFETFKKEMEKMSKKIKKLEKENQTWKSKWEKSNKALIEMAEERQKLEKELEKLKKKIEKLEK 300 (309)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred hhhhhhhh
Q psy21 263 IKINLDVE 270 (288)
Q Consensus 263 vKl~Le~E 270 (288)
+--+|-.|
T Consensus 301 LcRaLQ~e 308 (309)
T PF09728_consen 301 LCRALQAE 308 (309)
T ss_pred HHHHHhhC
No 465
>PF05531 NPV_P10: Nucleopolyhedrovirus P10 protein; InterPro: IPR008702 This family consists of several nucleopolyhedrovirus P10 proteins which are thought to be involved in the morphogenesis of the polyhedra [].; GO: 0019028 viral capsid
Probab=29.14 E-value=1.5e+02 Score=22.28 Aligned_cols=53 Identities=11% Similarity=0.202 Sum_probs=0.0
Q ss_pred HHHHHHHHHhhCCchhHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhHHHHH
Q psy21 191 LLELKETKGRSGKYDTEMNELRAANYR---LTERNKELEDQIEYEQSKFARMEAEI 243 (288)
Q Consensus 191 ~eEl~~~~~~~~~l~~ei~~L~~~~~~---Le~~l~~le~~l~~~q~~i~~le~el 243 (288)
.+.++..-.++..++.+++.++..... |...+..+..++...+.++.....=|
T Consensus 10 r~dIk~vd~KVdaLq~~V~~l~~~~~~v~~l~~klDa~~~~l~~l~~~V~~I~~iL 65 (75)
T PF05531_consen 10 RQDIKAVDDKVDALQTQVDDLESNLPDVTELNKKLDAQSAQLTTLNTKVNEIQDIL 65 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHh
No 466
>PRK13182 racA polar chromosome segregation protein; Reviewed
Probab=29.11 E-value=2.1e+02 Score=24.84 Aligned_cols=57 Identities=12% Similarity=0.247 Sum_probs=0.0
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELE-----DQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le-----~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
+..+..+.+.+..+|++|+...+.+...+..++ .+++++...|..||+.++++....
T Consensus 87 ~~lLe~~~~~l~~ri~eLe~~l~~kad~vvsYqll~hr~e~ee~~~~l~~le~~~~~~e~~~ 148 (175)
T PRK13182 87 FEQLEAQLNTITRRLDELERQLQQKADDVVSYQLLQHRREMEEMLERLQKLEARLKKLEPIY 148 (175)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHhhc
No 467
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=29.07 E-value=6e+02 Score=27.29 Aligned_cols=84 Identities=21% Similarity=0.283 Sum_probs=0.0
Q ss_pred HHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hhHHHHHHHHHH
Q psy21 174 ANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKF-----ARMEAEIKRLQT 248 (288)
Q Consensus 174 A~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i-----~~le~el~~lr~ 248 (288)
|....... ...+....+.+.+.+.+++....++..++.....+.++++.....+......+ ...+..+.+++.
T Consensus 507 A~~~~~~~--~~~~~~li~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~~~~~~~~a~~~l~~a~~ 584 (782)
T PRK00409 507 AKKLIGED--KEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK 584 (782)
T ss_pred HHHHHhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHH
Q psy21 249 EMTEQLKDYRE 259 (288)
Q Consensus 249 ~~~~ql~eyq~ 259 (288)
++...+++.+.
T Consensus 585 ~~~~~i~~lk~ 595 (782)
T PRK00409 585 EADEIIKELRQ 595 (782)
T ss_pred HHHHHHHHHHH
No 468
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=28.97 E-value=2.1e+02 Score=20.39 Aligned_cols=44 Identities=16% Similarity=0.369 Sum_probs=0.0
Q ss_pred chhHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHhhHHHHHHHHH
Q psy21 204 YDTEMNELRAANYRLTERNK--------------ELEDQIEYEQSKFARMEAEIKRLQ 247 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~--------------~le~~l~~~q~~i~~le~el~~lr 247 (288)
|+.++..++..+..++..|. .-...++.++..+..++..|..|.
T Consensus 9 L~Kel~kl~~~i~~~~~kL~n~~F~~kAP~eVve~er~kl~~~~~~~~~l~~~l~~Lk 66 (66)
T PF10458_consen 9 LEKELEKLEKEIERLEKKLSNENFVEKAPEEVVEKEREKLEELEEELEKLEEALEQLK 66 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHCSTTHHHHS-CCHHHHHHHHHHHHHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHHHHHHHHHHHcCccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
No 469
>PRK06975 bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
Probab=28.96 E-value=2.7e+02 Score=29.20 Aligned_cols=64 Identities=17% Similarity=0.287 Sum_probs=0.0
Q ss_pred HHHhhCCchhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 197 TKGRSGKYDTEMNELRAANYRLTERNKE----LEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 197 ~~~~~~~l~~ei~~L~~~~~~Le~~l~~----le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
...++..++.+|..+..+...+..+.+. ++..+...+.++..+|.++...+.+....-.-|++|
T Consensus 344 ~~q~~~~~~~~l~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~l~~le~~l~~~~~~~~~L~~~~~~l 411 (656)
T PRK06975 344 LNRKVDRLDQELVQRQQANDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQQYQDL 411 (656)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 470
>PF06339 Ectoine_synth: Ectoine synthase; InterPro: IPR010462 This family consists of several bacterial ectoine synthase proteins. The ectABC genes encode the diaminobutyric acid acetyltransferase (EctA), the diaminobutyric acid aminotransferase (EctB), and the ectoine synthase (EctC). Together these proteins constitute the ectoine biosynthetic pathway [].; GO: 0016836 hydro-lyase activity, 0006596 polyamine biosynthetic process
Probab=28.93 E-value=86 Score=25.95 Aligned_cols=41 Identities=22% Similarity=0.364 Sum_probs=0.0
Q ss_pred CCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEE
Q psy21 93 KGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 93 ~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
.++.++.=++|.=+|-+=..+..-|.|+. .+|.||....+|
T Consensus 11 ~Tdr~V~~~~w~SrRlll~~DgmGFS~h~-T~i~aGtet~~~ 51 (126)
T PF06339_consen 11 GTDRDVDAENWESRRLLLKDDGMGFSFHE-TTIYAGTETHIH 51 (126)
T ss_pred CCceeEEcCCceEEEEEEccCCCCEEEEE-EEEeCCCeeEEE
No 471
>PF14193 DUF4315: Domain of unknown function (DUF4315)
Probab=28.86 E-value=1.8e+02 Score=22.26 Aligned_cols=55 Identities=16% Similarity=0.244 Sum_probs=0.0
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHhhHHHHHHHHHH
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELED-----QIEYEQSKFARMEAEIKRLQT 248 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~-----~l~~~q~~i~~le~el~~lr~ 248 (288)
+.++...++-.+.+|.+++..+..|+.+..++|+ .+..+.-....|-.-|...+.
T Consensus 3 leKi~~eieK~k~Kiae~Q~rlK~Le~qk~E~EN~EIv~~VR~~~mtp~eL~~~L~~~~~ 62 (83)
T PF14193_consen 3 LEKIRAEIEKTKEKIAELQARLKELEAQKTEAENLEIVQMVRSMKMTPEELAAFLRAMKS 62 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHh
No 472
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=28.66 E-value=2e+02 Score=22.89 Aligned_cols=46 Identities=26% Similarity=0.449 Sum_probs=0.0
Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 234 SKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 234 ~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
+.+..+++++..+-.+|......-..|+-=-..|.+|-...|..|.
T Consensus 8 ~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~ 53 (107)
T PF06156_consen 8 DRLDQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLE 53 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 473
>COG3685 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.49 E-value=3.9e+02 Score=23.15 Aligned_cols=96 Identities=18% Similarity=0.175 Sum_probs=0.0
Q ss_pred hhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH--------HHHHHHHHHHHHHH
Q psy21 182 NLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARM--------EAEIKRLQTEMTEQ 253 (288)
Q Consensus 182 ~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~l--------e~el~~lr~~~~~q 253 (288)
.+..|+...+---++.-.-+..+-..++ --....+.+..+++++++++.++..+..+ -..+..+-.+.+..
T Consensus 9 ~f~~~LrD~y~aEkq~~kaL~kma~~~~-~~~Lka~~E~Hl~ET~~qi~rLe~Vfe~~g~~~~~~~cda~~giiaegq~i 87 (167)
T COG3685 9 LFIDTLRDIYAAEKQILKALPKMARRAQ-YPELKAAIEKHLEETKGQIERLEQVFERLGKKARRVTCDAMEGLIAEGQEI 87 (167)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc-CHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccccchHHHHHHHHHHHHHH
Q ss_pred HHHHHHHhhhhhhhhh--------hHHHhhhhh
Q psy21 254 LKDYRELMEIKINLDV--------EIAQYRNLM 278 (288)
Q Consensus 254 l~eyq~LL~vKl~Le~--------EIatYrkLL 278 (288)
+.+|..=--++..|-. ||+.|+.|+
T Consensus 88 ~~~~~~~evlda~L~~aaq~vEhyEIA~YgtL~ 120 (167)
T COG3685 88 MEEFKSNEVLDAGLIGAAQKVEHYEIACYGTLI 120 (167)
T ss_pred HHhccccHHHHHHHHHHHHHHHHHHHHHHHHHH
No 474
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=28.35 E-value=2.7e+02 Score=21.53 Aligned_cols=41 Identities=12% Similarity=0.315 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy21 223 KELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEI 263 (288)
Q Consensus 223 ~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~v 263 (288)
...+.++..++..+..+..++.+++.++.....|=+.||.+
T Consensus 45 ~rwek~v~~L~~e~~~l~~E~e~L~~~l~~e~~Ek~~Ll~l 85 (87)
T PF12709_consen 45 ARWEKKVDELENENKALKRENEQLKKKLDTEREEKQELLKL 85 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
No 475
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=28.31 E-value=3.3e+02 Score=22.15 Aligned_cols=75 Identities=16% Similarity=0.266 Sum_probs=0.0
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHhhhhh
Q psy21 204 YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEIAQYRNLM 278 (288)
Q Consensus 204 l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EIatYrkLL 278 (288)
++.++..|+..+..+......++.++..++..+.........+...-++.+.-+-..+..=..|-.+...++.-+
T Consensus 1 ~~~e~~~l~~e~~~~~~~~~~~~~~~~~~~~dl~~q~~~a~~Aq~~YE~El~~Ha~~~~~L~~lr~e~~~~~~~~ 75 (132)
T PF07926_consen 1 FESELSSLQSELQRLKEQEEDAEEQLQSLREDLESQAKIAQEAQQKYERELVKHAEDIKELQQLREELQELQQEI 75 (132)
T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH
No 476
>PRK14158 heat shock protein GrpE; Provisional
Probab=28.28 E-value=3.8e+02 Score=23.75 Aligned_cols=60 Identities=12% Similarity=0.149 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh
Q psy21 210 ELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDV 269 (288)
Q Consensus 210 ~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~ 269 (288)
.....+..|+.++..++.++..+...+.++.++....|..+.+...+...--.-++..++
T Consensus 37 ~~~~~~~~le~~l~~le~e~~el~d~~lR~~AefeN~RkR~~kE~e~~~~~a~~~~~~~l 96 (194)
T PRK14158 37 AAADRIKELEEALAAKEAEAAANWDKYLRERADLENYRKRVQKEKEELLKYGNESLILEI 96 (194)
T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 477
>PF14931 IFT20: Intraflagellar transport complex B, subunit 20
Probab=28.27 E-value=3.3e+02 Score=22.19 Aligned_cols=92 Identities=14% Similarity=0.196 Sum_probs=0.0
Q ss_pred hhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHhhHHHHHHHHHHHHH
Q psy21 183 LRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKE-----------LEDQIEYEQSKFARMEAEIKRLQTEMT 251 (288)
Q Consensus 183 ~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~-----------le~~l~~~q~~i~~le~el~~lr~~~~ 251 (288)
..+.-....++......++.+.+.=+..+-..+..+-..++. ++....+-......+...|...+.+++
T Consensus 18 ~~~~t~~Lk~ec~~F~~ki~~F~~iv~~~~~~~~~~A~~VE~eKlkAIG~RN~l~s~~k~R~~~~q~lq~~I~Ek~~eLE 97 (120)
T PF14931_consen 18 KADQTQELKEECKEFVEKISEFQKIVKGFIEILDELAKRVENEKLKAIGARNLLKSEAKQREAQQQQLQALIAEKKMELE 97 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHhhhhhhhhhhHHHh
Q psy21 252 EQLKDYRELMEIKINLDVEIAQY 274 (288)
Q Consensus 252 ~ql~eyq~LL~vKl~Le~EIatY 274 (288)
+.-.+|+.|..+..--..=|..+
T Consensus 98 Rl~~E~~sL~kve~eQ~~~i~~~ 120 (120)
T PF14931_consen 98 RLRSEYESLQKVEQEQNELIQKL 120 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhC
No 478
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=28.25 E-value=2.8e+02 Score=21.46 Aligned_cols=74 Identities=19% Similarity=0.236 Sum_probs=0.0
Q ss_pred HHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH----------------------HHHHHHHHHhhHHHHHH
Q psy21 187 QSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELED----------------------QIEYEQSKFARMEAEIK 244 (288)
Q Consensus 187 ~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~----------------------~l~~~q~~i~~le~el~ 244 (288)
++....+++.++.+++.+..+++.|.......+.-+.+++. ....+...+..++.++.
T Consensus 1 ~q~~~~~~q~l~~~~~~l~~~~~~l~~~~~E~~~v~~EL~~l~~d~~vy~~VG~vfv~~~~~ea~~~Le~~~e~le~~i~ 80 (105)
T cd00632 1 VQEQLAQLQQLQQQLQAYIVQRQKVEAQLNENKKALEELEKLADDAEVYKLVGNVLVKQEKEEARTELKERLETIELRIK 80 (105)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcchHHHHhhhHHhhccHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHHH
Q psy21 245 RLQTEMTEQLKDYREL 260 (288)
Q Consensus 245 ~lr~~~~~ql~eyq~L 260 (288)
.+...+...-.+..+|
T Consensus 81 ~l~~~~~~l~~~~~el 96 (105)
T cd00632 81 RLERQEEDLQEKLKEL 96 (105)
T ss_pred HHHHHHHHHHHHHHHH
No 479
>cd07597 BAR_SNX8 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 8. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. SNX8 and the yeast counterpart Mvp1p are involved in sorting and delivery of late-Golgi proteins, such as carboxypeptidase Y, to vacuoles. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=28.23 E-value=2.6e+02 Score=25.31 Aligned_cols=115 Identities=16% Similarity=0.062 Sum_probs=0.0
Q ss_pred CCCCCCCceeeecc-cccCccchhhheeccchhHHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHH----HHHHHH
Q psy21 141 ATHEPPHNIVMKNQ-KFGSGDEISTILLNTDGELANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMN----ELRAAN 215 (288)
Q Consensus 141 ~~~~pp~dlv~~~~-~wg~g~~i~t~L~d~~gevA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~----~L~~~~ 215 (288)
...+++..-.|.+. .|+...+....+-+.-+..+......+ ..|...-.|.++.++.-+.+++.=.. .....+
T Consensus 70 ~~~~~~~~~~~~~~~~~~~l~~~l~~~s~~~~~~s~~~~~~a--~~~~~~vlE~Lk~~~d~l~S~r~lf~R~~k~~~~~i 147 (246)
T cd07597 70 GELTARLYPWAGDSDTWGDINEGLSSLSKHFQLLSDLSEDEA--RAEEDGVLEKLKLQLDLLVSLRDLFERHEKLSLNNI 147 (246)
T ss_pred ccCCCCCCCccCCCccHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcccccH
Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHhhHHHHHHHHHHHHHHHHHHH
Q psy21 216 YRLTERNKELEDQIEYE-------QSKFARMEAEIKRLQTEMTEQLKDY 257 (288)
Q Consensus 216 ~~Le~~l~~le~~l~~~-------q~~i~~le~el~~lr~~~~~ql~ey 257 (288)
++|+..+...+..+..+ ...+..+++.+..-+.+|..|+.-|
T Consensus 148 ~~l~~ri~~~~~kl~~l~~~~~~~~~e~ekl~~~i~~d~~~i~~q~~R~ 196 (246)
T cd07597 148 QRLLKRIELNKKKLESLRAKPDVKGAEVDKLEASIIKDKESIANQLNRS 196 (246)
T ss_pred HHHHHHHHHHHHHHHHhhcCCCCchhHHHHHHHHHhccHHHHHHHHHHH
No 480
>KOG3119|consensus
Probab=28.19 E-value=3.9e+02 Score=24.68 Aligned_cols=66 Identities=17% Similarity=0.183 Sum_probs=0.0
Q ss_pred CCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy21 202 GKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINL 267 (288)
Q Consensus 202 ~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~L 267 (288)
........-.++.+.++......-.....+.+.++..|+.|...++.++.+..++...|-+++..+
T Consensus 190 ~~~~~y~err~rNN~A~~kSR~~~k~~~~e~~~r~~~leken~~lr~~v~~l~~el~~~~~~~~~~ 255 (269)
T KOG3119|consen 190 KKDPEYKERRRRNNEAVRKSRDKRKQKEDEMAHRVAELEKENEALRTQVEQLKKELATLRRLFLQL 255 (269)
T ss_pred cCCHHHHHHHHhhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
No 481
>COG2847 Copper(I)-binding protein [Inorganic ion transport and metabolism]
Probab=28.11 E-value=2.1e+02 Score=24.43 Aligned_cols=49 Identities=22% Similarity=0.371 Sum_probs=0.0
Q ss_pred CcEEEEEeCCccceec--------cCeEEEEEeCCcccEEEEecCCeEecCCcEEEE
Q psy21 85 GKFVKLYNKGSEEQSL--------GNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTV 133 (288)
Q Consensus 85 GkfV~l~N~s~~~~~L--------~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtI 133 (288)
|-|+.|.|+++++..| ..-.|-..+.++..-.-+==+++.|+||++|..
T Consensus 44 a~~~ti~N~~~~~~~Lv~v~s~~a~~~ElHe~i~~~gvmkMr~v~~i~Ipa~~~v~l 100 (151)
T COG2847 44 AAFMTITNNGDKDDRLVGVSSPIAARAELHETIHDGGVMKMRKVPGIVIPAGGTVEL 100 (151)
T ss_pred eEEEEEeCCCCCCceEEEEecCccceeEEEEEEecCCeEEEEEcCcEEECCCceEEe
No 482
>PRK12704 phosphodiesterase; Provisional
Probab=28.04 E-value=6.6e+02 Score=25.60 Aligned_cols=99 Identities=16% Similarity=0.210 Sum_probs=0.0
Q ss_pred h-HHHHHHHHhhhhhHHHhHHHHHHHH----HHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Q psy21 172 E-LANQERIIDNLRKEQSSLLLELKET----KGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRL 246 (288)
Q Consensus 172 e-vA~~~~~~~~~~~~~~~~~eEl~~~----~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~l 246 (288)
+ .+.........+..+....+|+... +.++...+..|..-...+..-+..|...+..|...+..+...+.++...
T Consensus 50 eAke~~ke~~leaeeE~~~~R~Ele~e~~~~e~~L~qrE~rL~~Ree~Le~r~e~Lekke~eL~~re~~Le~re~eLe~~ 129 (520)
T PRK12704 50 EAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKK 129 (520)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhh
Q psy21 247 QTEMTEQLKDYRELMEIKINLDVE 270 (288)
Q Consensus 247 r~~~~~ql~eyq~LL~vKl~Le~E 270 (288)
+.++.....+...-|--=..|..|
T Consensus 130 ~~~~~~~~~~~~~~l~~~a~lt~~ 153 (520)
T PRK12704 130 EEELEELIEEQLQELERISGLTAE 153 (520)
T ss_pred HHHHHHHHHHHHHHHHHHhCCCHH
No 483
>KOG0978|consensus
Probab=27.86 E-value=7.7e+02 Score=26.30 Aligned_cols=109 Identities=17% Similarity=0.198 Sum_probs=0.0
Q ss_pred hh-HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy21 171 GE-LANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTE 249 (288)
Q Consensus 171 ge-vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~ 249 (288)
++ +++-..........+...-+.+..+-...+.+..++.-+...+..+.....++...+..++......+..+.++...
T Consensus 509 ~~~i~~l~~~~~~~~~~i~~leeq~~~lt~~~~~l~~el~~~~~~le~~kk~~~e~~~~~~~Lq~~~ek~~~~le~i~~~ 588 (698)
T KOG0978|consen 509 EEQILTLKASVDKLELKIGKLEEQERGLTSNESKLIKELTTLTQSLEMLKKKAQEAKQSLEDLQIELEKSEAKLEQIQEQ 588 (698)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHhhhhhhhhhhHHHhhhhhh
Q psy21 250 MTEQLKDYRELMEIKINLDVEIAQYRNLME 279 (288)
Q Consensus 250 ~~~ql~eyq~LL~vKl~Le~EIatYrkLLE 279 (288)
....-.+...+--.--+|+-|+...++-|+
T Consensus 589 ~~e~~~ele~~~~k~~rleEE~e~L~~kle 618 (698)
T KOG0978|consen 589 YAELELELEIEKFKRKRLEEELERLKRKLE 618 (698)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 484
>PF12072 DUF3552: Domain of unknown function (DUF3552); InterPro: IPR022711 This presumed domain is functionally uncharacterised. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with PF00013 from PFAM, PF01966 from PFAM. This domain has a single completely conserved residue A that may be functionally important. ; GO: 0008663 2',3'-cyclic-nucleotide 2'-phosphodiesterase activity
Probab=27.79 E-value=4.2e+02 Score=23.20 Aligned_cols=78 Identities=18% Similarity=0.257 Sum_probs=0.0
Q ss_pred hhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 183 LRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 183 ~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
....+.....++.....++..-+..|+.-...+..-+..+...+..|......+...+.++..+..+....|.+.-.|
T Consensus 69 ~E~E~~~~~~el~~~E~rl~~rE~~L~~~~~~L~~~e~~l~~~~~~l~~~~~~l~~~~~e~~~~~~~~~~~Le~iAgl 146 (201)
T PF12072_consen 69 LERELKERRKELQRLEKRLQQREEQLDRRLEQLEKREEELEKKEEELEQRKEELEEREEELEELIEEQQQELEEIAGL 146 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC
No 485
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=27.75 E-value=6e+02 Score=25.07 Aligned_cols=89 Identities=19% Similarity=0.168 Sum_probs=0.0
Q ss_pred HHHHHhhhhhHHHhHHHHHHH-HHHhhCCchhHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHhhHHHHHHHHHHHH
Q psy21 176 QERIIDNLRKEQSSLLLELKE-TKGRSGKYDTEMNELRAANYRLTERNKELED----QIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 176 ~~~~~~~~~~~~~~~~eEl~~-~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~----~l~~~q~~i~~le~el~~lr~~~ 250 (288)
...+.. ...++..++.|+. ....++-+...+++-+-+..+||+++.++-. ++..++..++.+|+.++=...+-
T Consensus 222 ~eik~~--~~~L~~~~e~Lk~~~~~e~~~~~~~LqEEr~R~erLEeqlNd~~elHq~Ei~~LKqeLa~~EEK~~Yqs~eR 299 (395)
T PF10267_consen 222 REIKES--QSRLEESIEKLKEQYQREYQFILEALQEERYRYERLEEQLNDLTELHQNEIYNLKQELASMEEKMAYQSYER 299 (395)
T ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHhhhhhh
Q psy21 251 TEQLKDYRELMEIKIN 266 (288)
Q Consensus 251 ~~ql~eyq~LL~vKl~ 266 (288)
.+-+.|--+-...|+.
T Consensus 300 aRdi~E~~Es~qtRis 315 (395)
T PF10267_consen 300 ARDIWEVMESCQTRIS 315 (395)
T ss_pred HhHHHHHHHHHHHHHH
No 486
>PF02055 Glyco_hydro_30: O-Glycosyl hydrolase family 30; InterPro: IPR001139 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 30 GH30 from CAZY comprises enzymes with only one known activity; glucosylceramidase (3.2.1.45 from EC). Family 30 encompasses the mammalian glucosylceramidases. Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase), cleaves the glucosidic bonds of glucosylceramide and synthetic beta-glucosides []. Any one of over 50 different mutations in the gene of glucocerebrosidase have been found to affect activity of this hydrolase, producing variants of Gaucher disease, the most prevalent lysosomal storage disease [, ].; GO: 0004348 glucosylceramidase activity, 0006665 sphingolipid metabolic process, 0007040 lysosome organization, 0005764 lysosome; PDB: 2VT0_B 1NOF_A 2Y24_A 2WCG_B 2J25_A 3GXM_D 1Y7V_B 2NT0_C 3GXF_C 3GXD_A ....
Probab=27.74 E-value=3.3e+02 Score=27.61 Aligned_cols=65 Identities=25% Similarity=0.428 Sum_probs=0.0
Q ss_pred cceeEEeeCCCC-CeEEEe-ecCCCcEEEE-EeCCccceeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEE
Q psy21 63 SSGFQVTSSAKG-NMEITE-VDPDGKFVKL-YNKGSEEQSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVW 134 (288)
Q Consensus 63 ~~~~~~~~sa~G-~v~I~e-~d~~GkfV~l-~N~s~~~~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIw 134 (288)
+.++....+..+ .|.... .+|||..|.+ .|.++.++ .+.-.|.+.....-.| .+.|.|+..||++
T Consensus 427 a~RI~st~~~~~~~l~~vAF~nPDGs~vvVv~N~~~~~~-----~~~v~v~~~~~~~~~~--~~~lp~~s~~t~~ 494 (496)
T PF02055_consen 427 AVRIGSTSSSSDSGLEAVAFLNPDGSIVVVVLNRGDSDQ-----NFSVTVKDGSKGNNHF--NVTLPPRSIVTTY 494 (496)
T ss_dssp -EEEEEEESSSTTTEEEEEEEETTSEEEEEEEE-SSS-E-----EEEEEEECTTTEE--E--EEEEE-TTEEEEE
T ss_pred CEEEEeeccCCCCceeEEEEECCCCCEEEEEEcCCCCcc-----ceEEEEecCCcceeEE--EEEeCCCceEEEE
No 487
>PRK13731 conjugal transfer surface exclusion protein TraT; Provisional
Probab=27.66 E-value=60 Score=29.80 Aligned_cols=40 Identities=18% Similarity=0.278 Sum_probs=0.0
Q ss_pred EEEEEeCCcccee-----------ccCeEEEEEeCCcccEEEEecCCeEecCCcE
Q psy21 87 FVKLYNKGSEEQS-----------LGNWQLVRKVQGEVKTSFKFHRTLKIEAGGT 130 (288)
Q Consensus 87 fV~l~N~s~~~~~-----------L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~ 130 (288)
||.+.|+|+++.. -.||+|. ++-++..|-+.. .+|+.|..
T Consensus 54 yv~vrNTSd~~~~~l~~~i~~~L~~kGY~iv---~~P~~A~Y~lQa-NVL~~~K~ 104 (243)
T PRK13731 54 FLQIKNTSDKDMSGLQGKIADAVKAKGYQVV---TSPDKAYYWIQA-NVLKADKM 104 (243)
T ss_pred EEEEeeCCCcchHHHHHHHHHHHHhCCeEEe---cChhhceeeeee-eehhcccC
No 488
>PRK01203 prefoldin subunit alpha; Provisional
Probab=27.63 E-value=3.6e+02 Score=22.40 Aligned_cols=74 Identities=14% Similarity=0.236 Sum_probs=0.0
Q ss_pred HhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHH----------------------------------------
Q psy21 188 SSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELED---------------------------------------- 227 (288)
Q Consensus 188 ~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~---------------------------------------- 227 (288)
....++++-+..+++.++.+++.|+.....+...+..++.
T Consensus 3 ~~~~~~~~~~~~q~e~l~~ql~~L~~a~se~~~~ie~L~~~~~~~~~eiLVPLg~slYV~gki~d~~kVlVdIGTGy~VE 82 (130)
T PRK01203 3 RDVEAQLNYIESLISSVDSQIDSLNKTLSEVQQTISFLSDNELDNSKELLISIGSGIFADGNIKKDKDLIVPIGSGVYIA 82 (130)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccCCCCeEEEEccCCceEeEEecCCCeEEEEcCCCeEEE
Q ss_pred -----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 228 -----QIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 228 -----~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
.+..++..+..|+.-+......+.....-|..|.
T Consensus 83 K~~e~kie~L~~~ie~Le~~i~~K~~~l~~i~~~~~~l~ 121 (130)
T PRK01203 83 EERERTIERLKENLEDLKDSIQKLNDQRKTLVDQYNTVY 121 (130)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 489
>PF10828 DUF2570: Protein of unknown function (DUF2570); InterPro: IPR022538 This entry is represented by Bacteriophage IME08, pseT.3. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This is a family of proteins with unknown function.
Probab=27.55 E-value=3.1e+02 Score=21.65 Aligned_cols=68 Identities=13% Similarity=0.186 Sum_probs=0.0
Q ss_pred HHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 194 LKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 194 l~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
+--+..++..++.+...+..++..-+..+..+...+...+..+......-++++.+.+...+.+..-+
T Consensus 20 l~~qs~~i~~L~a~n~~q~~tI~qq~~~~~~L~~~~~~~r~~~~~~~~~~qq~r~~~e~~~e~ik~~l 87 (110)
T PF10828_consen 20 LWYQSQRIDRLRAENKAQAQTIQQQEDANQELKAQLQQNRQAVEEQQKREQQLRQQSEERRESIKTAL 87 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 490
>COG4717 Uncharacterized conserved protein [Function unknown]
Probab=27.54 E-value=3.2e+02 Score=29.85 Aligned_cols=80 Identities=19% Similarity=0.255 Sum_probs=0.0
Q ss_pred HHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhH
Q psy21 192 LELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIKINLDVEI 271 (288)
Q Consensus 192 eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vKl~Le~EI 271 (288)
+|+..+...+..+..++.+|++++..+..++..+|. -..++.+..+-..++.++....++|-.|--.+..|+.=|
T Consensus 774 ~E~~~lEe~~d~~~ee~~el~a~v~~~~~qi~~lE~-----g~~~a~lr~~~~slk~~l~e~ar~Wasl~~~~~vl~e~l 848 (984)
T COG4717 774 EELALLEEAIDALDEEVEELHAQVAALSRQIAQLEG-----GGTVAELRQRRESLKEDLEEKARKWASLRLAVQVLEEAL 848 (984)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHhhh
Q psy21 272 AQYRN 276 (288)
Q Consensus 272 atYrk 276 (288)
--|++
T Consensus 849 ~~~ke 853 (984)
T COG4717 849 RLFKE 853 (984)
T ss_pred HHHHh
No 491
>PRK11147 ABC transporter ATPase component; Reviewed
Probab=27.28 E-value=2e+02 Score=29.72 Aligned_cols=53 Identities=15% Similarity=0.240 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH------HHhhHHHHHHHHHHHHHHHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQS------KFARMEAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~------~i~~le~el~~lr~~~~~ql~eyq~L 260 (288)
+..|+..+..|+.++.+++.++.+... .+..+..++..++.++.....++++|
T Consensus 570 ~~~~e~~i~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~ 628 (635)
T PRK11147 570 LEQLPQLLEDLEAEIEALQAQVADADFFSQPHEQTQKVLADLADAEQELEVAFERWEEL 628 (635)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCchhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 492
>PRK11578 macrolide transporter subunit MacA; Provisional
Probab=27.24 E-value=2.7e+02 Score=26.47 Aligned_cols=88 Identities=14% Similarity=0.167 Sum_probs=0.0
Q ss_pred hh-HHHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHhhH
Q psy21 171 GE-LANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTE----------RNKELEDQIEYEQSKFARM 239 (288)
Q Consensus 171 ge-vA~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~----------~l~~le~~l~~~q~~i~~l 239 (288)
|+ ++....... ...+..+..++..+..++...+.+++..+...++.+. ++.+.+.++...+..+..+
T Consensus 86 G~~L~~ld~~~~--~~~~~~~~a~l~~~~~~l~~a~~~l~~a~~~~~r~~~L~~~g~is~~~~~~~~~~~~~~~~~~~~~ 163 (370)
T PRK11578 86 DQLLGVIDPEQA--ENQIKEVEATLMELRAQRQQAEAELKLARVTLSRQQRLAKTQAVSQQDLDTAATELAVKQAQIGTI 163 (370)
T ss_pred CCEEEEECcHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy21 240 EAEIKRLQTEMTEQLKDYREL 260 (288)
Q Consensus 240 e~el~~lr~~~~~ql~eyq~L 260 (288)
++++..++..+.........+
T Consensus 164 ~~~l~~~~~~l~~~~~~l~~~ 184 (370)
T PRK11578 164 DAQIKRNQASLDTAKTNLDYT 184 (370)
T ss_pred HHHHHHHHHHHHHHHHHHhcC
No 493
>KOG0993|consensus
Probab=27.21 E-value=6.5e+02 Score=25.23 Aligned_cols=88 Identities=17% Similarity=0.239 Sum_probs=0.0
Q ss_pred HHHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy21 174 ANQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQ 253 (288)
Q Consensus 174 A~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~q 253 (288)
+.-......+...-....+++...-+--.++..+|..|+-++......-++++.++...-......++.+..++.++.+.
T Consensus 409 ~ave~l~ql~~~~r~~~~~~l~a~ehv~e~l~~ei~~L~eqle~e~~~~~~le~ql~~~ve~c~~~~aS~~slk~e~erl 488 (542)
T KOG0993|consen 409 AAVEQLAQLYKQRRTSLQQELDASEHVQEDLVKEIQSLQEQLEKERQSEQELEWQLDDDVEQCSNCDASFASLKVEPERL 488 (542)
T ss_pred hhHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHH
Q ss_pred HHHHHHHh
Q psy21 254 LKDYRELM 261 (288)
Q Consensus 254 l~eyq~LL 261 (288)
-+....+.
T Consensus 489 ~qq~eqi~ 496 (542)
T KOG0993|consen 489 HQQCEQIF 496 (542)
T ss_pred HHHHHHHH
No 494
>KOG3208|consensus
Probab=27.04 E-value=4.9e+02 Score=23.74 Aligned_cols=59 Identities=8% Similarity=0.253 Sum_probs=0.0
Q ss_pred CCchhHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHh
Q psy21 202 GKYDTEMNELRAANYRLTERNKE----------LEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELM 261 (288)
Q Consensus 202 ~~l~~ei~~L~~~~~~Le~~l~~----------le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL 261 (288)
.++..||+.|-.+.+.+.....+ ....|+..+..+.++.++....+.. -.+.+|...||
T Consensus 54 ks~~~eie~LLeql~~vndsm~~~~~s~a~~aa~~htL~RHrEILqdy~qef~rir~n-~~a~~e~~~Ll 122 (231)
T KOG3208|consen 54 KSLENEIEGLLEQLQDVNDSMNDCASSPANSAAVMHTLQRHREILQDYTQEFRRIRSN-IDAKRERESLL 122 (231)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHhhccCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH
No 495
>KOG0712|consensus
Probab=27.04 E-value=1.2e+02 Score=29.28 Aligned_cols=54 Identities=17% Similarity=0.123 Sum_probs=0.0
Q ss_pred eeccCeEEEEEeCCcccEEEEecCCeEecCCcEEEEEecccCCCCCCCCCceeee
Q psy21 98 QSLGNWQLVRKVQGEVKTSFKFHRTLKIEAGGTVTVWSFNVEGATHEPPHNIVMK 152 (288)
Q Consensus 98 ~~L~gw~l~r~v~~~~~~~y~Fp~~~~L~~g~~vtIwa~~~~~~~~~pp~dlv~~ 152 (288)
..|.||...-+..++....+.-+|+-||+||....|...|. -...+|..||+.+
T Consensus 264 eal~G~~~~~~~ldGr~l~~~~~pg~vi~~~~~~~v~~~gm-p~~~~~~g~lyi~ 317 (337)
T KOG0712|consen 264 EALCGFQRVWETLDGRLLKLSSKPGEVISPGDTKRVEGEGM-PIFRNPKGDLYIK 317 (337)
T ss_pred hccccceEEEEccCCceEEEecCCCceeChhHEEeecCCCc-ccccCCCCcEEEE
No 496
>KOG0018|consensus
Probab=27.02 E-value=3.7e+02 Score=30.03 Aligned_cols=81 Identities=21% Similarity=0.155 Sum_probs=0.0
Q ss_pred hHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy21 185 KEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEMTEQLKDYRELMEIK 264 (288)
Q Consensus 185 ~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~~~ql~eyq~LL~vK 264 (288)
+.+..+++-+...-..++.-+.++.+.+..+..|+..+..+...+......++..+.|++.....|. ..-..+++++
T Consensus 655 ~~L~~~k~rl~eel~ei~~~~~e~~~v~~~i~~le~~~~~~~~~~~~~k~~l~~~~~El~~~~~~i~---~~~p~i~~i~ 731 (1141)
T KOG0018|consen 655 DQLKEKKERLLEELKEIQKRRKEVSSVESKIHGLEMRLKYSKLDLEQLKRSLEQNELELQRTESEID---EFGPEISEIK 731 (1141)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhCchHHHHH
Q ss_pred hhhh
Q psy21 265 INLD 268 (288)
Q Consensus 265 l~Le 268 (288)
..|+
T Consensus 732 r~l~ 735 (1141)
T KOG0018|consen 732 RKLQ 735 (1141)
T ss_pred HHHH
No 497
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=26.97 E-value=2.9e+02 Score=23.38 Aligned_cols=56 Identities=23% Similarity=0.271 Sum_probs=0.0
Q ss_pred hhHHHhHHHHHHHHHHhhCCch-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy21 184 RKEQSSLLLELKETKGRSGKYD-----TEMNELRAANYRLTERNKELEDQIEYEQSKFARM 239 (288)
Q Consensus 184 ~~~~~~~~eEl~~~~~~~~~l~-----~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~l 239 (288)
...+..++.|+.+++.+..... .+-..|+++.+.++.+++.+..++...+..+...
T Consensus 39 ~~~~~~l~~Ei~~l~~E~~~iS~qDeFAkwaKl~Rk~~kl~~el~~~~~~~~~~~~~~~~~ 99 (161)
T PF04420_consen 39 SKEQRQLRKEILQLKRELNAISAQDEFAKWAKLNRKLDKLEEELEKLNKSLSSEKSSFDKS 99 (161)
T ss_dssp HHHHHHHHHHHHHHHHHHTTS-TTTSHHHHHHHHHHHHHHHHHHHHHHHHHHHTCHHHHHH
T ss_pred cHHHHHHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 498
>PF08286 Spc24: Spc24 subunit of Ndc80; InterPro: IPR013252 Spc24 is a component of the evolutionarily conserved kinetochore-associated Ndc80 complex and is involved in chromosome segregation [].; PDB: 2VE7_D 2FV4_B 2FTX_B.
Probab=26.95 E-value=23 Score=28.55 Aligned_cols=43 Identities=26% Similarity=0.498 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Q psy21 208 MNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTEM 250 (288)
Q Consensus 208 i~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~~ 250 (288)
|.+|...+.++..++.++|..+..++..+..|..++..+..+.
T Consensus 1 ~~~Ld~~k~~laK~~~~LE~~l~~l~~el~~L~~~l~eLe~~~ 43 (118)
T PF08286_consen 1 IQELDNEKFRLAKELSDLESELESLQSELEELKEELEELEEQE 43 (118)
T ss_dssp ------------------------------------------H
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
No 499
>PF07111 HCR: Alpha helical coiled-coil rod protein (HCR); InterPro: IPR009800 This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation [].; GO: 0030154 cell differentiation, 0005634 nucleus, 0005737 cytoplasm
Probab=26.90 E-value=8e+02 Score=26.20 Aligned_cols=107 Identities=21% Similarity=0.275 Sum_probs=0.0
Q ss_pred HHHHHHHhhhhhHHHhHHHHHHHHHHhhCC----chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Q psy21 174 ANQERIIDNLRKEQSSLLLELKETKGRSGK----YDTEMNELRAANYRLTERNKELEDQIEYEQSKFARMEAEIKRLQTE 249 (288)
Q Consensus 174 A~~~~~~~~~~~~~~~~~eEl~~~~~~~~~----l~~ei~~L~~~~~~Le~~l~~le~~l~~~q~~i~~le~el~~lr~~ 249 (288)
..+++..+.+.--..-...++...+.+... +....+.|+...+.-...+.+++.++.-..........+-..++.+
T Consensus 485 eERdRl~aeLqlSa~liqqeV~~ArEqgeaE~~~Lse~aqqLE~~Lq~~qe~la~l~~QL~~Ar~~lqes~eea~~lR~E 564 (739)
T PF07111_consen 485 EERDRLDAELQLSARLIQQEVGRAREQGEAERQQLSEVAQQLEQELQEKQESLAELEEQLEAARKSLQESTEEAAELRRE 564 (739)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHHhhhhhhhhhhHHHhhhhhhcc
Q psy21 250 MTEQLKDYRELMEIKINLDVEIAQYRNLMEAE 281 (288)
Q Consensus 250 ~~~ql~eyq~LL~vKl~Le~EIatYrkLLEgE 281 (288)
+.+|...|..=|..|++ ++|--.-.+|-+-|
T Consensus 565 L~~QQ~~y~~alqekvs-evEsrl~E~L~~~E 595 (739)
T PF07111_consen 565 LTQQQEVYERALQEKVS-EVESRLREQLSEME 595 (739)
T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH
No 500
>KOG3335|consensus
Probab=26.85 E-value=71 Score=27.97 Aligned_cols=53 Identities=17% Similarity=0.194 Sum_probs=0.0
Q ss_pred HHHHHHhhhhhHHHhHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHHHHH
Q psy21 175 NQERIIDNLRKEQSSLLLELKETKGRSGKYDTEMNELRAANYRLTERNKELEDQI 229 (288)
Q Consensus 175 ~~~~~~~~~~~~~~~~~eEl~~~~~~~~~l~~ei~~L~~~~~~Le~~l~~le~~l 229 (288)
++..... ...-+....|+.+++.+++.++.+|+++++..+.|-..+.+.+..+
T Consensus 91 ~R~~~~e--~~kee~~~~e~~elr~~~~~l~~~i~~~~~~~~~L~~~l~~~~~el 143 (181)
T KOG3335|consen 91 WRQARKE--RKKEEKRKQEIMELRLKVEKLENAIAELTKFFSQLHSKLNKPESEL 143 (181)
T ss_pred HHhhhcc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCccccc
Done!