Query 000473
Match_columns 1471
No_of_seqs 549 out of 4687
Neff 7.1
Searched_HMMs 46136
Date Fri Mar 29 10:01:15 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/000473.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/000473hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0319 WD40-repeat-containing 100.0 6E-36 1.3E-40 355.3 40.3 544 17-764 63-617 (775)
2 KOG0271 Notchless-like WD40 re 100.0 1.8E-35 4E-40 328.2 26.9 360 14-705 113-479 (480)
3 KOG0319 WD40-repeat-containing 100.0 7.8E-33 1.7E-37 328.9 40.0 511 14-708 103-620 (775)
4 KOG0306 WD40-repeat-containing 100.0 9.3E-31 2E-35 310.5 49.0 580 17-765 66-663 (888)
5 KOG0271 Notchless-like WD40 re 100.0 3E-31 6.5E-36 294.8 26.2 196 500-729 238-461 (480)
6 KOG0306 WD40-repeat-containing 100.0 3.9E-28 8.6E-33 288.3 47.3 500 15-708 106-665 (888)
7 KOG0291 WD40-repeat-containing 100.0 1.2E-24 2.5E-29 259.2 53.5 524 20-769 18-553 (893)
8 KOG0272 U4/U6 small nuclear ri 100.0 9.2E-28 2E-32 271.0 22.5 157 508-705 302-458 (459)
9 KOG0291 WD40-repeat-containing 100.0 7.2E-25 1.6E-29 260.9 46.8 424 15-717 144-622 (893)
10 KOG0286 G-protein beta subunit 100.0 5.6E-26 1.2E-30 246.5 32.7 200 506-764 142-343 (343)
11 KOG0272 U4/U6 small nuclear ri 100.0 6.5E-28 1.4E-32 272.2 16.7 242 477-764 189-458 (459)
12 KOG0318 WD40 repeat stress pro 100.0 2.9E-23 6.2E-28 239.7 54.4 533 14-765 57-601 (603)
13 KOG1408 WD40 repeat protein [F 100.0 3.8E-25 8.2E-30 260.1 38.5 604 5-766 58-713 (1080)
14 KOG0318 WD40 repeat stress pro 99.9 2E-23 4.4E-28 240.9 44.8 405 19-707 193-602 (603)
15 KOG1539 WD repeat protein [Gen 99.9 3.1E-23 6.7E-28 249.9 44.7 535 6-772 106-654 (910)
16 KOG0263 Transcription initiati 99.9 5.9E-25 1.3E-29 265.5 21.4 210 501-769 443-652 (707)
17 KOG0295 WD40 repeat-containing 99.9 4.6E-24 1E-28 237.5 24.1 151 527-706 249-405 (406)
18 KOG0273 Beta-transducin family 99.9 9.4E-23 2E-27 233.1 30.4 168 506-707 356-523 (524)
19 KOG0273 Beta-transducin family 99.9 1.6E-22 3.4E-27 231.3 31.2 161 528-720 332-495 (524)
20 KOG0286 G-protein beta subunit 99.9 3E-22 6.5E-27 217.7 31.8 141 527-705 201-343 (343)
21 KOG0315 G-protein beta subunit 99.9 6.1E-22 1.3E-26 210.9 29.9 161 527-725 138-306 (311)
22 KOG0263 Transcription initiati 99.9 2.5E-23 5.3E-28 251.5 19.8 224 500-767 369-608 (707)
23 KOG1063 RNA polymerase II elon 99.9 1.6E-20 3.4E-25 223.1 42.1 522 15-730 53-675 (764)
24 KOG0265 U5 snRNP-specific prot 99.9 1.1E-21 2.3E-26 214.3 29.2 140 528-705 189-336 (338)
25 KOG0276 Vesicle coat complex C 99.9 6.8E-23 1.5E-27 239.7 18.8 231 478-770 28-261 (794)
26 KOG0279 G protein beta subunit 99.9 9.4E-22 2E-26 213.0 23.9 277 478-831 31-310 (315)
27 KOG0281 Beta-TrCP (transducin 99.9 1.4E-23 3E-28 230.9 9.5 221 477-770 209-432 (499)
28 cd00200 WD40 WD40 domain, foun 99.9 2.2E-20 4.8E-25 210.5 35.9 141 527-705 149-289 (289)
29 KOG0296 Angio-associated migra 99.9 1.7E-20 3.6E-25 209.6 32.9 161 14-250 62-222 (399)
30 KOG0279 G protein beta subunit 99.9 1.5E-21 3.2E-26 211.5 22.4 212 502-767 8-223 (315)
31 KOG0295 WD40 repeat-containing 99.9 1.6E-21 3.4E-26 217.5 18.8 213 478-729 165-386 (406)
32 PLN00181 protein SPA1-RELATED; 99.9 1.8E-19 3.9E-24 238.2 41.3 141 527-706 632-792 (793)
33 KOG0266 WD40 repeat-containing 99.9 1E-20 2.2E-25 234.1 27.4 232 478-767 174-410 (456)
34 KOG0276 Vesicle coat complex C 99.9 1.1E-20 2.5E-25 221.2 21.9 191 500-728 88-278 (794)
35 KOG0284 Polyadenylation factor 99.9 8.3E-22 1.8E-26 222.1 11.7 227 477-766 110-337 (464)
36 KOG0293 WD40 repeat-containing 99.9 2.9E-20 6.3E-25 209.3 23.4 84 14-126 222-305 (519)
37 KOG0281 Beta-TrCP (transducin 99.9 6.1E-22 1.3E-26 218.1 9.7 220 478-767 250-478 (499)
38 KOG0315 G-protein beta subunit 99.9 3.2E-20 6.8E-25 198.0 22.1 228 477-767 12-289 (311)
39 KOG0285 Pleiotropic regulator 99.9 8.1E-21 1.8E-25 210.6 18.3 187 500-728 142-328 (460)
40 KOG0285 Pleiotropic regulator 99.9 5.9E-21 1.3E-25 211.7 17.1 224 478-766 166-389 (460)
41 KOG0292 Vesicle coat complex C 99.9 1.6E-19 3.4E-24 217.9 30.4 340 16-709 9-350 (1202)
42 KOG0313 Microtubule binding pr 99.8 9.9E-20 2.2E-24 204.1 24.8 274 431-764 126-416 (423)
43 KOG1539 WD repeat protein [Gen 99.8 2.1E-18 4.6E-23 208.6 37.5 471 19-708 162-649 (910)
44 KOG0316 Conserved WD40 repeat- 99.8 2.8E-20 6.2E-25 196.8 17.4 233 479-776 33-267 (307)
45 KOG0266 WD40 repeat-containing 99.8 1E-19 2.2E-24 225.2 23.5 201 509-768 159-366 (456)
46 KOG0274 Cdc4 and related F-box 99.8 9E-20 2E-24 225.9 21.6 224 477-770 220-445 (537)
47 KOG0265 U5 snRNP-specific prot 99.8 1.3E-19 2.7E-24 198.3 19.5 208 504-769 42-249 (338)
48 KOG0292 Vesicle coat complex C 99.8 9.3E-20 2E-24 219.9 19.3 233 479-774 25-288 (1202)
49 KOG2106 Uncharacterized conser 99.8 7.7E-17 1.7E-21 185.8 41.1 485 14-765 102-625 (626)
50 KOG1408 WD40 repeat protein [F 99.8 2.2E-17 4.8E-22 195.4 36.8 188 478-708 519-714 (1080)
51 KOG0284 Polyadenylation factor 99.8 2.1E-20 4.5E-25 210.9 10.8 217 482-764 157-378 (464)
52 KOG0282 mRNA splicing factor [ 99.8 1.7E-19 3.8E-24 207.6 18.0 162 14-250 212-374 (503)
53 PTZ00421 coronin; Provisional 99.8 3E-18 6.5E-23 212.1 28.5 217 505-767 71-291 (493)
54 KOG0313 Microtubule binding pr 99.8 4.7E-19 1E-23 198.8 18.5 232 477-766 117-376 (423)
55 KOG0274 Cdc4 and related F-box 99.8 1.2E-18 2.6E-23 216.0 22.6 222 477-770 263-486 (537)
56 PTZ00420 coronin; Provisional 99.8 9.8E-18 2.1E-22 208.5 29.0 236 483-766 52-293 (568)
57 KOG0275 Conserved WD40 repeat- 99.8 1.6E-19 3.6E-24 197.1 10.7 234 469-764 222-507 (508)
58 KOG0645 WD40 repeat protein [G 99.8 1.5E-17 3.2E-22 180.0 24.6 206 502-765 7-224 (312)
59 KOG0282 mRNA splicing factor [ 99.8 5.2E-19 1.1E-23 203.7 13.4 225 479-764 231-503 (503)
60 KOG0275 Conserved WD40 repeat- 99.8 9E-20 2E-24 199.1 5.7 166 522-718 220-389 (508)
61 KOG1407 WD40 repeat protein [F 99.8 1.1E-16 2.4E-21 172.4 28.0 144 532-706 166-310 (313)
62 PTZ00420 coronin; Provisional 99.8 2.1E-17 4.5E-22 205.6 25.7 142 560-724 64-214 (568)
63 PTZ00421 coronin; Provisional 99.8 1.6E-17 3.4E-22 205.7 24.5 171 565-769 70-248 (493)
64 cd00200 WD40 WD40 domain, foun 99.8 4.6E-17 1E-21 183.4 25.8 223 479-764 67-289 (289)
65 KOG0277 Peroxisomal targeting 99.8 4.2E-18 9.1E-23 182.5 15.9 190 526-766 74-265 (311)
66 KOG0645 WD40 repeat protein [G 99.8 4.6E-17 1E-21 176.2 23.6 205 478-728 30-242 (312)
67 KOG0277 Peroxisomal targeting 99.8 5.6E-18 1.2E-22 181.5 15.7 209 478-728 76-287 (311)
68 KOG0296 Angio-associated migra 99.8 3.7E-17 8E-22 183.1 22.5 207 501-754 56-262 (399)
69 PLN00181 protein SPA1-RELATED; 99.8 2.7E-16 5.8E-21 208.2 34.2 192 477-714 547-745 (793)
70 KOG0301 Phospholipase A2-activ 99.7 1.3E-17 2.9E-22 198.6 17.1 227 478-772 28-255 (745)
71 KOG0267 Microtubule severing p 99.7 2.3E-18 5E-23 205.4 8.8 222 485-775 14-235 (825)
72 KOG1445 Tumor-specific antigen 99.7 1.2E-15 2.5E-20 178.8 30.3 198 481-708 599-799 (1012)
73 KOG0293 WD40 repeat-containing 99.7 6.1E-17 1.3E-21 182.9 19.1 243 458-767 226-514 (519)
74 KOG0305 Anaphase promoting com 99.7 7E-17 1.5E-21 193.8 20.1 229 477-767 231-462 (484)
75 KOG0283 WD40 repeat-containing 99.7 6.8E-17 1.5E-21 197.8 20.0 214 507-765 265-531 (712)
76 KOG0269 WD40 repeat-containing 99.7 1.4E-17 3.1E-22 200.3 13.7 231 478-762 103-336 (839)
77 KOG0301 Phospholipase A2-activ 99.7 7.3E-17 1.6E-21 192.4 19.1 215 478-766 74-288 (745)
78 KOG0973 Histone transcription 99.7 2.8E-16 6.1E-21 196.6 25.1 239 502-765 62-311 (942)
79 KOG0308 Conserved WD40 repeat- 99.7 1.3E-16 2.8E-21 188.9 20.9 241 429-709 46-287 (735)
80 KOG0264 Nucleosome remodeling 99.7 2E-16 4.4E-21 182.1 19.5 240 479-766 141-404 (422)
81 KOG0288 WD40 repeat protein Ti 99.7 1.3E-16 2.8E-21 180.9 17.1 125 557-705 329-459 (459)
82 KOG1063 RNA polymerase II elon 99.7 2E-14 4.3E-19 171.8 36.2 233 478-768 331-650 (764)
83 KOG0643 Translation initiation 99.7 4.4E-15 9.6E-20 160.5 27.8 143 527-707 161-317 (327)
84 KOG0310 Conserved WD40 repeat- 99.7 1.3E-16 2.8E-21 184.5 17.0 207 504-769 63-271 (487)
85 KOG0288 WD40 repeat protein Ti 99.7 2.2E-16 4.8E-21 179.0 18.3 234 477-763 189-458 (459)
86 KOG0300 WD40 repeat-containing 99.7 1.1E-16 2.3E-21 175.1 14.6 213 500-772 139-392 (481)
87 KOG0269 WD40 repeat-containing 99.7 5.5E-17 1.2E-21 195.3 13.2 195 525-766 100-296 (839)
88 KOG0294 WD40 repeat-containing 99.7 6.4E-16 1.4E-20 170.4 20.0 212 503-766 37-281 (362)
89 KOG0264 Nucleosome remodeling 99.7 2.5E-16 5.4E-21 181.3 17.4 221 507-772 122-353 (422)
90 KOG2048 WD40 repeat protein [G 99.7 1.3E-12 2.9E-17 156.5 48.7 503 17-708 26-549 (691)
91 KOG0316 Conserved WD40 repeat- 99.7 6.2E-16 1.3E-20 164.2 18.3 182 502-729 10-193 (307)
92 KOG0308 Conserved WD40 repeat- 99.7 2.1E-16 4.6E-21 187.1 15.7 234 478-768 40-287 (735)
93 KOG0647 mRNA export protein (c 99.7 1.3E-14 2.8E-19 159.4 26.9 85 12-126 23-108 (347)
94 KOG0267 Microtubule severing p 99.7 7.8E-17 1.7E-21 192.5 8.9 185 503-729 64-248 (825)
95 KOG0640 mRNA cleavage stimulat 99.7 5E-16 1.1E-20 169.9 14.1 223 500-766 103-335 (430)
96 KOG0643 Translation initiation 99.7 5.8E-15 1.3E-19 159.6 21.7 205 505-769 6-223 (327)
97 KOG2106 Uncharacterized conser 99.6 4.1E-12 8.8E-17 147.4 43.8 137 564-728 362-500 (626)
98 KOG0300 WD40 repeat-containing 99.6 3.7E-15 7.9E-20 163.3 17.9 210 478-719 163-399 (481)
99 KOG1332 Vesicle coat complex C 99.6 1.4E-15 3E-20 162.5 13.7 196 525-765 23-240 (299)
100 KOG0278 Serine/threonine kinas 99.6 1.7E-15 3.6E-20 162.0 13.9 220 480-765 76-296 (334)
101 KOG0289 mRNA splicing factor [ 99.6 4.1E-15 9E-20 169.4 17.9 189 485-719 241-431 (506)
102 KOG0289 mRNA splicing factor [ 99.6 4.1E-15 8.9E-20 169.5 16.9 200 511-769 221-422 (506)
103 KOG0302 Ribosome Assembly prot 99.6 2.4E-15 5.2E-20 168.9 14.0 168 500-708 202-379 (440)
104 KOG0973 Histone transcription 99.6 5.4E-15 1.2E-19 185.2 18.5 190 509-728 13-222 (942)
105 KOG0646 WD40 repeat protein [G 99.6 1.1E-14 2.4E-19 168.0 19.0 214 479-728 97-328 (476)
106 KOG1446 Histone H3 (Lys4) meth 99.6 7.2E-14 1.6E-18 155.2 24.8 207 478-730 29-286 (311)
107 KOG0640 mRNA cleavage stimulat 99.6 4.4E-15 9.6E-20 162.6 14.2 177 500-718 163-346 (430)
108 KOG0641 WD40 repeat protein [G 99.6 7.4E-14 1.6E-18 147.1 22.3 216 505-765 85-348 (350)
109 KOG0270 WD40 repeat-containing 99.6 9.1E-15 2E-19 167.7 16.7 205 526-766 193-404 (463)
110 KOG2048 WD40 repeat protein [G 99.6 8.1E-12 1.8E-16 149.9 41.6 109 98-250 33-142 (691)
111 KOG0772 Uncharacterized conser 99.6 4.9E-14 1.1E-18 163.3 21.8 203 478-719 284-503 (641)
112 KOG1407 WD40 repeat protein [F 99.6 1.1E-14 2.4E-19 157.1 15.4 185 505-729 16-282 (313)
113 KOG0302 Ribosome Assembly prot 99.6 2.1E-14 4.5E-19 161.5 17.3 201 527-765 167-377 (440)
114 KOG0646 WD40 repeat protein [G 99.6 4.1E-14 8.8E-19 163.3 20.0 223 482-766 58-307 (476)
115 KOG0641 WD40 repeat protein [G 99.6 1.2E-13 2.6E-18 145.5 21.3 217 508-765 31-302 (350)
116 KOG0294 WD40 repeat-containing 99.6 7E-14 1.5E-18 154.5 20.2 193 478-709 56-283 (362)
117 KOG0310 Conserved WD40 repeat- 99.6 3.4E-14 7.4E-19 164.8 18.2 198 481-728 86-288 (487)
118 KOG0283 WD40 repeat-containing 99.6 6.1E-14 1.3E-18 172.3 20.5 209 500-769 360-579 (712)
119 KOG0299 U3 snoRNP-associated p 99.6 4.8E-14 1E-18 162.4 18.2 167 506-716 199-365 (479)
120 KOG0772 Uncharacterized conser 99.6 7.8E-14 1.7E-18 161.7 18.1 215 504-772 162-400 (641)
121 KOG0305 Anaphase promoting com 99.6 5.7E-14 1.2E-18 168.9 17.8 200 483-729 195-397 (484)
122 KOG2096 WD40 repeat protein [G 99.5 8.6E-13 1.9E-17 145.6 24.1 120 561-705 269-400 (420)
123 KOG0278 Serine/threonine kinas 99.5 1.5E-14 3.3E-19 154.8 9.3 205 505-770 10-217 (334)
124 TIGR03866 PQQ_ABC_repeats PQQ- 99.5 3.8E-11 8.3E-16 139.0 38.1 122 573-718 159-290 (300)
125 KOG1446 Histone H3 (Lys4) meth 99.5 1.7E-12 3.6E-17 144.5 24.9 209 505-770 10-266 (311)
126 KOG0268 Sof1-like rRNA process 99.5 6.4E-14 1.4E-18 156.9 13.7 240 478-767 82-346 (433)
127 KOG0650 WD40 repeat nucleolar 99.5 5.8E-13 1.3E-17 156.8 22.0 143 523-704 574-732 (733)
128 KOG0268 Sof1-like rRNA process 99.5 3.4E-14 7.4E-19 159.0 11.3 201 505-765 62-301 (433)
129 KOG1036 Mitotic spindle checkp 99.5 5.9E-12 1.3E-16 139.5 27.7 95 527-647 191-294 (323)
130 KOG0303 Actin-binding protein 99.5 1E-13 2.2E-18 156.8 13.9 135 564-720 75-216 (472)
131 KOG1274 WD40 repeat protein [G 99.5 1.7E-12 3.7E-17 159.9 24.7 186 477-708 68-263 (933)
132 KOG0647 mRNA export protein (c 99.5 1.1E-12 2.3E-17 144.6 20.2 205 509-765 27-280 (347)
133 KOG4283 Transcription-coupled 99.5 7.1E-13 1.5E-17 144.9 18.4 229 506-765 40-275 (397)
134 TIGR03866 PQQ_ABC_repeats PQQ- 99.5 1.3E-10 2.7E-15 134.7 37.5 89 574-679 210-300 (300)
135 KOG2445 Nuclear pore complex c 99.5 3.5E-12 7.5E-17 140.8 22.4 207 506-765 10-317 (361)
136 KOG1332 Vesicle coat complex C 99.5 3.3E-13 7.1E-18 144.6 13.6 198 478-707 73-286 (299)
137 KOG0321 WD40 repeat-containing 99.5 5.7E-13 1.2E-17 158.2 16.2 165 503-708 94-302 (720)
138 KOG1273 WD40 repeat protein [G 99.5 6.1E-12 1.3E-16 138.9 21.9 238 19-309 26-280 (405)
139 KOG0299 U3 snoRNP-associated p 99.4 1E-12 2.2E-17 151.6 16.2 215 503-766 136-356 (479)
140 KOG1036 Mitotic spindle checkp 99.4 3E-12 6.4E-17 141.8 19.1 199 478-728 28-283 (323)
141 KOG0303 Actin-binding protein 99.4 4.4E-12 9.6E-17 143.7 20.0 174 505-709 77-251 (472)
142 KOG4283 Transcription-coupled 99.4 3.5E-12 7.7E-17 139.5 17.6 168 506-711 98-280 (397)
143 KOG1274 WD40 repeat protein [G 99.4 8.9E-12 1.9E-16 153.8 22.9 229 478-770 28-266 (933)
144 KOG0642 Cell-cycle nuclear pro 99.4 1.2E-12 2.6E-17 154.2 14.8 211 504-765 289-560 (577)
145 KOG4378 Nuclear protein COP1 [ 99.4 2.9E-12 6.3E-17 147.9 17.3 203 482-729 98-303 (673)
146 KOG1273 WD40 repeat protein [G 99.4 8.1E-12 1.8E-16 138.0 19.9 170 523-724 31-297 (405)
147 KOG0639 Transducin-like enhanc 99.4 2.5E-12 5.5E-17 148.5 14.9 207 469-725 474-680 (705)
148 KOG0270 WD40 repeat-containing 99.4 5.9E-12 1.3E-16 144.9 17.5 164 506-708 240-405 (463)
149 KOG2055 WD40 repeat protein [G 99.4 7.9E-11 1.7E-15 136.1 25.7 141 528-707 359-512 (514)
150 KOG0639 Transducin-like enhanc 99.4 1.7E-12 3.8E-17 149.7 12.2 208 479-728 434-642 (705)
151 KOG0307 Vesicle coat complex C 99.4 2.2E-12 4.7E-17 162.7 12.6 198 527-767 82-285 (1049)
152 KOG0642 Cell-cycle nuclear pro 99.4 8.6E-12 1.9E-16 147.2 16.0 211 477-720 308-574 (577)
153 KOG1034 Transcriptional repres 99.3 1.3E-11 2.9E-16 137.2 15.0 102 527-657 107-211 (385)
154 KOG2096 WD40 repeat protein [G 99.3 2.9E-11 6.4E-16 133.7 17.6 207 503-766 80-308 (420)
155 KOG0307 Vesicle coat complex C 99.3 3.4E-12 7.4E-17 161.0 11.7 217 478-731 83-308 (1049)
156 KOG0649 WD40 repeat protein [G 99.3 1E-10 2.2E-15 125.4 19.3 188 503-725 56-255 (325)
157 KOG4378 Nuclear protein COP1 [ 99.3 3.3E-11 7.2E-16 139.4 16.4 237 479-776 51-290 (673)
158 KOG2445 Nuclear pore complex c 99.3 1.8E-10 3.9E-15 127.5 20.9 214 458-707 65-318 (361)
159 KOG1034 Transcriptional repres 99.3 1E-10 2.2E-15 130.3 17.3 125 568-710 87-214 (385)
160 KOG0322 G-protein beta subunit 99.3 1.4E-10 3.1E-15 125.9 17.9 120 507-656 203-322 (323)
161 KOG0649 WD40 repeat protein [G 99.3 2.1E-10 4.6E-15 123.0 18.6 163 527-720 24-199 (325)
162 KOG4328 WD40 protein [Function 99.3 3.7E-11 8E-16 138.7 13.6 165 505-707 182-399 (498)
163 KOG4328 WD40 protein [Function 99.2 1.5E-10 3.3E-15 133.8 17.4 212 483-723 208-466 (498)
164 KOG1963 WD40 repeat protein [G 99.2 6.8E-09 1.5E-13 128.8 32.3 224 22-302 22-316 (792)
165 KOG0321 WD40 repeat-containing 99.2 1.1E-10 2.3E-15 139.3 15.2 153 527-710 66-251 (720)
166 KOG1445 Tumor-specific antigen 99.2 5E-11 1.1E-15 140.6 11.4 148 531-710 599-753 (1012)
167 KOG1009 Chromatin assembly com 99.2 1.1E-10 2.5E-15 133.2 13.4 156 527-711 28-199 (434)
168 KOG2919 Guanine nucleotide-bin 99.2 1.1E-09 2.4E-14 121.8 19.6 217 478-729 126-350 (406)
169 COG2319 FOG: WD40 repeat [Gene 99.2 4.4E-09 9.6E-14 124.4 25.0 198 484-727 133-336 (466)
170 KOG1009 Chromatin assembly com 99.2 2.5E-10 5.4E-15 130.5 13.3 138 570-729 13-175 (434)
171 KOG2919 Guanine nucleotide-bin 99.2 2.1E-09 4.5E-14 119.7 20.0 191 527-769 125-330 (406)
172 COG2319 FOG: WD40 repeat [Gene 99.1 7.3E-09 1.6E-13 122.6 26.0 202 483-728 85-293 (466)
173 KOG1007 WD repeat protein TSSC 99.1 2E-09 4.2E-14 118.3 18.7 188 479-708 138-362 (370)
174 KOG0644 Uncharacterized conser 99.1 3.4E-11 7.3E-16 146.3 5.6 190 502-729 183-405 (1113)
175 KOG1963 WD40 repeat protein [G 99.1 1E-08 2.2E-13 127.3 26.5 168 13-250 202-377 (792)
176 KOG1188 WD40 repeat protein [G 99.1 1.2E-09 2.5E-14 122.8 14.9 190 527-769 42-245 (376)
177 KOG0290 Conserved WD40 repeat- 99.1 2.2E-09 4.7E-14 118.1 15.1 212 507-767 94-319 (364)
178 KOG1538 Uncharacterized conser 99.0 9.2E-08 2E-12 114.2 28.6 278 12-377 8-294 (1081)
179 KOG1517 Guanine nucleotide bin 99.0 2.1E-08 4.5E-13 125.4 23.1 191 477-709 1179-1383(1387)
180 KOG0322 G-protein beta subunit 99.0 8.6E-10 1.9E-14 119.9 9.8 152 527-706 167-322 (323)
181 KOG1188 WD40 repeat protein [G 99.0 3.2E-09 6.8E-14 119.4 14.2 199 477-716 42-251 (376)
182 KOG2394 WD40 protein DMR-N9 [G 99.0 2.9E-09 6.3E-14 125.1 12.9 138 570-729 219-384 (636)
183 KOG1007 WD repeat protein TSSC 99.0 5.1E-09 1.1E-13 115.1 13.7 164 508-708 122-290 (370)
184 KOG0290 Conserved WD40 repeat- 99.0 6.9E-09 1.5E-13 114.3 14.6 166 508-709 149-320 (364)
185 KOG1587 Cytoplasmic dynein int 99.0 2.9E-08 6.3E-13 123.3 21.8 258 400-708 245-517 (555)
186 KOG0650 WD40 repeat nucleolar 99.0 1.7E-08 3.6E-13 120.0 18.1 167 562-766 392-595 (733)
187 KOG0644 Uncharacterized conser 99.0 2.3E-10 5.1E-15 139.2 2.6 120 561-707 181-300 (1113)
188 KOG1310 WD40 repeat protein [G 98.9 1.7E-09 3.6E-14 126.8 9.2 125 563-708 43-179 (758)
189 KOG0771 Prolactin regulatory e 98.9 3.9E-09 8.5E-14 121.9 11.3 190 20-250 148-356 (398)
190 KOG2394 WD40 protein DMR-N9 [G 98.9 2E-08 4.2E-13 118.4 16.6 152 510-679 220-384 (636)
191 PF08662 eIF2A: Eukaryotic tra 98.9 5.4E-08 1.2E-12 107.2 18.9 124 569-715 58-186 (194)
192 KOG0974 WD-repeat protein WDR6 98.9 2.1E-08 4.5E-13 126.4 15.8 143 527-710 147-291 (967)
193 KOG2055 WD40 repeat protein [G 98.9 7.6E-08 1.7E-12 111.9 18.3 175 510-728 214-394 (514)
194 KOG0771 Prolactin regulatory e 98.8 2.6E-08 5.6E-13 115.3 14.1 169 527-730 158-335 (398)
195 KOG1310 WD40 repeat protein [G 98.8 6.3E-09 1.4E-13 122.1 8.8 166 503-707 44-231 (758)
196 KOG2110 Uncharacterized conser 98.7 3.6E-07 7.8E-12 104.2 18.6 145 527-710 99-251 (391)
197 KOG1272 WD40-repeat-containing 98.7 1.3E-08 2.9E-13 118.0 7.3 202 511-764 211-419 (545)
198 KOG1240 Protein kinase contain 98.7 6.6E-07 1.4E-11 114.1 22.1 237 497-766 1036-1334(1431)
199 KOG1538 Uncharacterized conser 98.7 6.1E-08 1.3E-12 115.7 11.8 146 556-730 39-187 (1081)
200 KOG4547 WD40 repeat-containing 98.7 3E-07 6.5E-12 110.5 17.8 143 527-708 72-221 (541)
201 KOG1587 Cytoplasmic dynein int 98.7 2.3E-07 4.9E-12 115.6 17.4 206 478-710 258-475 (555)
202 PRK01742 tolB translocation pr 98.7 4.9E-07 1.1E-11 112.0 19.0 164 523-729 255-425 (429)
203 KOG0974 WD-repeat protein WDR6 98.7 2.3E-07 5E-12 117.2 16.0 146 591-769 145-291 (967)
204 KOG1524 WD40 repeat-containing 98.7 7.8E-08 1.7E-12 112.9 10.8 150 503-690 98-247 (737)
205 KOG1523 Actin-related protein 98.7 2.6E-07 5.5E-12 103.6 14.1 159 509-706 10-175 (361)
206 KOG4227 WD40 repeat protein [G 98.6 1.7E-07 3.6E-12 106.3 11.3 125 563-710 49-182 (609)
207 KOG1524 WD40 repeat-containing 98.6 1.2E-07 2.5E-12 111.5 10.3 162 527-730 77-238 (737)
208 KOG1517 Guanine nucleotide bin 98.6 1.3E-06 2.7E-11 109.9 19.7 197 478-709 1079-1289(1387)
209 KOG1272 WD40-repeat-containing 98.6 5.5E-08 1.2E-12 113.0 6.9 134 560-717 200-333 (545)
210 PF02239 Cytochrom_D1: Cytochr 98.6 0.00018 3.9E-09 87.2 37.0 95 556-666 255-356 (369)
211 KOG4547 WD40 repeat-containing 98.6 2.6E-06 5.6E-11 102.6 19.8 122 591-728 70-193 (541)
212 KOG1523 Actin-related protein 98.6 1.9E-06 4.1E-11 96.8 17.1 192 479-710 26-239 (361)
213 KOG4227 WD40 repeat protein [G 98.5 9.7E-07 2.1E-11 100.3 14.3 173 501-709 48-227 (609)
214 KOG1240 Protein kinase contain 98.5 4.3E-06 9.4E-11 107.0 21.4 153 554-729 1033-1248(1431)
215 PF08662 eIF2A: Eukaryotic tra 98.5 5.9E-06 1.3E-10 91.1 19.5 136 603-765 39-178 (194)
216 KOG2139 WD40 repeat protein [G 98.5 2.9E-06 6.3E-11 96.4 16.7 182 478-689 113-300 (445)
217 KOG2110 Uncharacterized conser 98.5 0.00012 2.6E-09 84.2 29.4 95 534-658 152-249 (391)
218 KOG2139 WD40 repeat protein [G 98.5 2.6E-06 5.6E-11 96.8 15.9 153 527-719 112-281 (445)
219 KOG2111 Uncharacterized conser 98.5 5.1E-06 1.1E-10 93.5 17.9 177 484-709 74-258 (346)
220 KOG1334 WD40 repeat protein [G 98.5 1.3E-06 2.9E-11 102.3 13.6 112 13-155 139-254 (559)
221 PF02239 Cytochrom_D1: Cytochr 98.4 0.00012 2.5E-09 88.8 30.2 103 601-716 249-356 (369)
222 KOG3881 Uncharacterized conser 98.4 7.8E-06 1.7E-10 94.1 18.0 205 477-714 117-327 (412)
223 PRK01742 tolB translocation pr 98.4 7.1E-06 1.5E-10 101.7 18.7 172 485-713 184-367 (429)
224 KOG2321 WD40 repeat protein [G 98.3 5.3E-06 1.2E-10 98.9 14.7 160 557-747 162-335 (703)
225 KOG1409 Uncharacterized conser 98.3 1E-05 2.2E-10 91.7 16.2 194 478-711 39-274 (404)
226 PRK11028 6-phosphogluconolacto 98.3 8.3E-05 1.8E-09 88.8 23.4 115 572-709 127-260 (330)
227 KOG1912 WD40 repeat protein [G 98.3 0.00012 2.6E-09 90.1 23.6 109 592-715 438-559 (1062)
228 KOG2321 WD40 repeat protein [G 98.3 0.00011 2.3E-09 88.2 22.7 155 508-690 174-334 (703)
229 KOG1334 WD40 repeat protein [G 98.2 2.3E-06 5.1E-11 100.4 8.7 232 478-764 157-464 (559)
230 PRK03629 tolB translocation pr 98.2 6.8E-05 1.5E-09 93.0 21.7 168 523-728 250-426 (429)
231 KOG0280 Uncharacterized conser 98.2 2E-05 4.4E-10 87.8 14.1 148 527-709 135-286 (339)
232 KOG3881 Uncharacterized conser 98.2 7.3E-05 1.6E-09 86.4 18.0 190 527-767 117-321 (412)
233 PRK04922 tolB translocation pr 98.1 0.00013 2.9E-09 90.6 22.0 160 523-720 255-423 (433)
234 PRK11028 6-phosphogluconolacto 98.1 0.00011 2.4E-09 87.7 20.5 116 573-711 177-310 (330)
235 KOG4497 Uncharacterized conser 98.1 0.00015 3.2E-09 81.9 19.5 102 572-690 320-423 (447)
236 KOG1409 Uncharacterized conser 98.1 1.7E-05 3.6E-10 90.1 11.8 101 530-658 170-271 (404)
237 PRK05137 tolB translocation pr 98.1 0.00016 3.5E-09 89.9 21.9 131 561-713 192-328 (435)
238 KOG2111 Uncharacterized conser 98.1 0.00016 3.5E-09 81.7 19.2 100 531-658 155-257 (346)
239 PRK02889 tolB translocation pr 98.1 0.00014 2.9E-09 90.3 20.9 162 523-722 247-417 (427)
240 KOG1064 RAVE (regulator of V-A 98.1 3.5E-06 7.5E-11 110.8 6.9 180 478-717 2223-2408(2439)
241 PRK04922 tolB translocation pr 98.1 0.00015 3.2E-09 90.2 20.8 125 566-712 243-373 (433)
242 KOG3914 WD repeat protein WDR4 98.1 3.5E-05 7.6E-10 89.4 13.3 109 592-716 123-232 (390)
243 PRK05137 tolB translocation pr 98.1 0.00027 5.8E-09 88.0 22.3 166 505-710 197-369 (435)
244 KOG2695 WD40 repeat protein [G 98.0 1E-05 2.2E-10 91.7 8.1 126 569-718 251-387 (425)
245 PRK02889 tolB translocation pr 98.0 0.00018 3.8E-09 89.3 20.0 165 507-713 193-366 (427)
246 KOG2315 Predicted translation 98.0 0.0068 1.5E-07 73.5 31.5 124 569-714 269-396 (566)
247 PRK03629 tolB translocation pr 98.0 0.0004 8.6E-09 86.2 22.6 122 572-716 244-371 (429)
248 KOG4190 Uncharacterized conser 98.0 1.2E-05 2.6E-10 94.7 8.1 179 503-716 729-915 (1034)
249 PF00400 WD40: WD domain, G-be 98.0 2.5E-05 5.4E-10 62.3 6.7 38 561-607 2-39 (39)
250 PF00400 WD40: WD domain, G-be 97.9 2.7E-05 5.9E-10 62.1 6.2 39 659-705 1-39 (39)
251 KOG1064 RAVE (regulator of V-A 97.9 3.4E-05 7.3E-10 101.9 10.2 147 527-717 2222-2376(2439)
252 PRK01029 tolB translocation pr 97.8 0.0017 3.7E-08 80.5 22.7 134 561-715 271-411 (428)
253 TIGR02800 propeller_TolB tol-p 97.8 0.0013 2.8E-08 81.1 21.1 124 566-711 229-358 (417)
254 KOG1354 Serine/threonine prote 97.7 0.00023 5E-09 80.9 12.1 134 562-719 155-314 (433)
255 KOG2695 WD40 repeat protein [G 97.7 0.00014 3E-09 82.8 10.1 120 527-669 266-388 (425)
256 KOG2066 Vacuolar assembly/sort 97.7 0.00026 5.5E-09 88.3 13.3 176 523-745 47-225 (846)
257 TIGR02800 propeller_TolB tol-p 97.7 0.002 4.3E-08 79.4 21.4 158 523-718 241-407 (417)
258 KOG0309 Conserved WD40 repeat- 97.7 0.00017 3.7E-09 88.3 10.3 161 572-768 69-234 (1081)
259 PRK00178 tolB translocation pr 97.6 0.0022 4.9E-08 79.6 20.1 129 563-713 191-325 (430)
260 COG5354 Uncharacterized protei 97.6 0.038 8.2E-07 66.5 28.6 97 569-690 314-420 (561)
261 KOG0280 Uncharacterized conser 97.6 0.00095 2.1E-08 74.9 14.4 117 574-712 125-247 (339)
262 PRK00178 tolB translocation pr 97.6 0.0073 1.6E-07 75.0 24.2 120 570-711 242-367 (430)
263 PRK01029 tolB translocation pr 97.5 0.003 6.6E-08 78.3 19.6 125 569-713 229-365 (428)
264 PRK04792 tolB translocation pr 97.5 0.0059 1.3E-07 76.3 22.3 161 523-721 269-438 (448)
265 KOG2041 WD40 repeat protein [G 97.5 0.00025 5.3E-09 86.4 9.5 169 510-708 15-187 (1189)
266 KOG3914 WD repeat protein WDR4 97.5 0.00037 8.1E-09 81.2 10.4 87 564-666 145-232 (390)
267 PRK04792 tolB translocation pr 97.5 0.0042 9.2E-08 77.6 19.8 128 564-713 211-344 (448)
268 KOG4714 Nucleoporin [Nuclear s 97.4 0.00023 4.9E-09 78.5 6.8 94 602-708 160-255 (319)
269 KOG2315 Predicted translation 97.4 0.13 2.9E-06 62.7 29.9 118 527-681 286-412 (566)
270 PF10282 Lactonase: Lactonase, 97.3 0.23 5.1E-06 59.9 31.8 123 571-715 192-332 (345)
271 KOG4190 Uncharacterized conser 97.3 0.00052 1.1E-08 81.4 8.3 173 12-250 731-908 (1034)
272 COG4946 Uncharacterized protei 97.3 0.015 3.2E-07 69.0 19.9 128 567-714 356-484 (668)
273 KOG2041 WD40 repeat protein [G 97.3 0.063 1.4E-06 66.4 25.7 119 563-707 497-619 (1189)
274 KOG4714 Nucleoporin [Nuclear s 97.3 0.00086 1.9E-08 74.1 8.6 74 572-658 181-255 (319)
275 KOG1275 PAB-dependent poly(A) 97.1 0.0025 5.3E-08 80.6 11.6 139 527-705 189-340 (1118)
276 COG5354 Uncharacterized protei 97.1 0.59 1.3E-05 56.8 29.9 124 565-708 269-396 (561)
277 KOG4497 Uncharacterized conser 97.1 0.0087 1.9E-07 68.1 14.0 147 527-712 63-245 (447)
278 PLN02919 haloacid dehalogenase 97.1 0.031 6.6E-07 76.7 22.3 117 572-711 741-892 (1057)
279 KOG4532 WD40-like repeat conta 97.0 0.032 7E-07 62.3 17.8 148 527-715 130-290 (344)
280 KOG1912 WD40 repeat protein [G 97.0 0.013 2.8E-07 73.1 15.9 139 523-680 23-166 (1062)
281 PLN02919 haloacid dehalogenase 97.0 0.081 1.7E-06 72.7 25.6 117 574-713 686-839 (1057)
282 PF04762 IKI3: IKI3 family; I 97.0 2.4 5.2E-05 57.7 38.5 115 570-707 426-564 (928)
283 KOG0882 Cyclophilin-related pe 96.9 0.0015 3.2E-08 76.9 7.0 170 507-716 7-240 (558)
284 KOG1354 Serine/threonine prote 96.9 0.0057 1.2E-07 70.0 11.0 171 506-711 161-363 (433)
285 TIGR03300 assembly_YfgL outer 96.9 0.49 1.1E-05 57.7 29.1 109 591-719 241-350 (377)
286 KOG0309 Conserved WD40 repeat- 96.9 0.0022 4.7E-08 79.0 8.0 127 561-709 105-234 (1081)
287 PF10282 Lactonase: Lactonase, 96.8 0.31 6.8E-06 58.8 26.2 211 18-300 88-313 (345)
288 COG2706 3-carboxymuconate cycl 96.8 0.3 6.5E-06 57.2 23.9 207 20-303 92-315 (346)
289 TIGR02658 TTQ_MADH_Hv methylam 96.8 0.24 5.1E-06 59.6 23.7 76 24-131 53-147 (352)
290 PF11768 DUF3312: Protein of u 96.7 0.0096 2.1E-07 73.2 11.9 77 619-711 257-333 (545)
291 KOG2314 Translation initiation 96.7 0.79 1.7E-05 56.1 26.9 104 570-690 445-558 (698)
292 PF11768 DUF3312: Protein of u 96.7 0.006 1.3E-07 75.0 9.6 83 2-122 249-331 (545)
293 COG4946 Uncharacterized protei 96.6 0.1 2.2E-06 62.2 18.4 146 507-688 357-507 (668)
294 KOG1832 HIV-1 Vpr-binding prot 96.6 0.0018 3.9E-08 80.9 4.2 131 561-718 1092-1226(1516)
295 COG5170 CDC55 Serine/threonine 96.5 0.014 3E-07 66.0 10.3 161 567-753 169-365 (460)
296 KOG1832 HIV-1 Vpr-binding prot 96.4 0.0014 3.1E-08 81.8 2.0 162 504-716 1096-1264(1516)
297 PF13360 PQQ_2: PQQ-like domai 96.3 0.13 2.8E-06 58.0 17.0 113 591-719 36-152 (238)
298 TIGR02658 TTQ_MADH_Hv methylam 96.1 0.13 2.8E-06 61.7 16.6 103 601-717 27-146 (352)
299 PRK04043 tolB translocation pr 96.1 0.43 9.2E-06 59.2 21.7 122 572-715 189-317 (419)
300 COG2706 3-carboxymuconate cycl 96.1 2.4 5.2E-05 49.9 25.8 87 37-155 15-108 (346)
301 PF13360 PQQ_2: PQQ-like domai 96.1 0.54 1.2E-05 52.9 20.7 104 591-713 122-236 (238)
302 smart00320 WD40 WD40 repeats. 96.0 0.016 3.4E-07 43.5 5.6 39 659-705 2-40 (40)
303 KOG4532 WD40-like repeat conta 96.0 0.16 3.5E-06 56.9 15.0 121 592-728 129-259 (344)
304 smart00320 WD40 WD40 repeats. 96.0 0.017 3.7E-07 43.3 5.5 38 561-607 3-40 (40)
305 KOG1645 RING-finger-containing 95.9 0.021 4.6E-07 66.8 7.9 94 603-710 175-269 (463)
306 KOG0882 Cyclophilin-related pe 95.8 0.071 1.5E-06 63.3 12.1 208 17-301 10-224 (558)
307 TIGR03300 assembly_YfgL outer 95.8 0.25 5.4E-06 60.2 17.5 107 591-715 105-216 (377)
308 PRK04043 tolB translocation pr 95.8 0.67 1.5E-05 57.5 21.2 140 557-717 220-366 (419)
309 PF08450 SGL: SMP-30/Gluconola 95.7 1.2 2.7E-05 50.8 21.9 132 570-727 85-232 (246)
310 PF15492 Nbas_N: Neuroblastoma 95.6 2.2 4.8E-05 48.9 22.4 129 573-711 46-192 (282)
311 KOG2314 Translation initiation 95.6 0.074 1.6E-06 64.6 11.0 112 574-712 214-339 (698)
312 KOG1275 PAB-dependent poly(A) 95.5 0.17 3.6E-06 64.9 14.1 99 591-706 187-295 (1118)
313 KOG2114 Vacuolar assembly/sort 95.4 0.56 1.2E-05 60.2 18.3 163 527-729 37-224 (933)
314 KOG4640 Anaphase-promoting com 95.4 0.057 1.2E-06 66.8 9.5 78 622-714 21-99 (665)
315 PF07433 DUF1513: Protein of u 95.2 1.7 3.7E-05 50.9 20.2 126 563-716 49-256 (305)
316 KOG1645 RING-finger-containing 95.1 0.029 6.2E-07 65.8 5.7 123 561-707 184-315 (463)
317 PF15492 Nbas_N: Neuroblastoma 94.8 2.3 4.9E-05 48.8 19.2 167 523-716 51-268 (282)
318 KOG2114 Vacuolar assembly/sort 94.6 1.9 4E-05 55.7 19.8 127 570-720 125-257 (933)
319 PF14783 BBS2_Mid: Ciliary BBS 94.5 1.9 4.2E-05 43.0 15.6 100 573-690 2-105 (111)
320 KOG2066 Vacuolar assembly/sort 93.9 1 2.3E-05 57.5 15.6 142 527-708 85-234 (846)
321 KOG4640 Anaphase-promoting com 93.7 0.27 5.8E-06 61.1 9.8 78 572-665 22-100 (665)
322 PF07433 DUF1513: Protein of u 93.5 9.3 0.0002 45.0 21.4 156 16-222 4-171 (305)
323 PF08596 Lgl_C: Lethal giant l 93.2 12 0.00027 46.0 23.1 225 511-771 3-295 (395)
324 KOG1008 Uncharacterized conser 93.0 0.035 7.6E-07 68.4 0.8 166 527-729 72-253 (783)
325 KOG3617 WD40 and TPR repeat-co 92.7 0.18 3.8E-06 63.8 6.3 99 591-707 27-131 (1416)
326 KOG4649 PQQ (pyrrolo-quinoline 92.5 4.1 9E-05 46.0 15.7 109 591-715 23-131 (354)
327 KOG2444 WD40 repeat protein [G 92.4 0.23 4.9E-06 55.1 6.0 103 527-657 72-177 (238)
328 KOG3621 WD40 repeat-containing 92.3 0.51 1.1E-05 59.4 9.4 104 591-708 45-155 (726)
329 PF08553 VID27: VID27 cytoplas 92.3 1.9 4.2E-05 56.7 15.0 131 555-706 509-646 (794)
330 PF08450 SGL: SMP-30/Gluconola 92.2 13 0.00029 42.3 20.7 109 573-707 42-164 (246)
331 KOG4649 PQQ (pyrrolo-quinoline 91.9 17 0.00037 41.3 19.5 93 591-689 63-155 (354)
332 KOG1920 IkappaB kinase complex 91.5 10 0.00022 51.1 20.0 158 510-710 69-276 (1265)
333 PF14783 BBS2_Mid: Ciliary BBS 91.5 11 0.00023 37.9 15.8 91 527-652 17-109 (111)
334 KOG2444 WD40 repeat protein [G 91.4 0.47 1E-05 52.7 6.9 104 591-707 70-177 (238)
335 PF04762 IKI3: IKI3 family; I 91.3 75 0.0016 43.7 54.4 92 6-131 62-160 (928)
336 COG5170 CDC55 Serine/threonine 91.3 0.76 1.7E-05 52.5 8.6 153 506-690 169-358 (460)
337 PRK11138 outer membrane biogen 91.2 3 6.6E-05 51.2 14.9 112 591-714 160-281 (394)
338 PF08596 Lgl_C: Lethal giant l 91.0 9 0.00019 47.2 18.3 183 507-718 84-301 (395)
339 KOG3617 WD40 and TPR repeat-co 90.3 0.47 1E-05 60.2 6.4 70 573-657 62-131 (1416)
340 PF12894 Apc4_WD40: Anaphase-p 89.8 0.84 1.8E-05 38.4 5.5 38 9-46 4-41 (47)
341 PRK11138 outer membrane biogen 89.0 11 0.00023 46.4 17.1 98 591-706 294-393 (394)
342 KOG2079 Vacuolar assembly/sort 88.4 1.5 3.3E-05 57.7 9.1 94 591-690 99-196 (1206)
343 KOG2079 Vacuolar assembly/sort 88.1 1.1 2.3E-05 59.0 7.4 81 640-728 100-181 (1206)
344 KOG1008 Uncharacterized conser 86.5 0.28 6E-06 60.9 1.1 113 572-707 156-275 (783)
345 PF04053 Coatomer_WDAD: Coatom 86.4 9.4 0.0002 47.7 14.3 103 573-707 71-173 (443)
346 PF12894 Apc4_WD40: Anaphase-p 85.8 1.8 4E-05 36.4 5.3 34 669-711 11-44 (47)
347 PF08553 VID27: VID27 cytoplas 85.5 9 0.00019 50.8 13.9 101 526-656 543-646 (794)
348 KOG3621 WD40 repeat-containing 85.0 6.2 0.00013 50.2 11.5 102 527-658 47-155 (726)
349 cd00216 PQQ_DH Dehydrogenases 84.2 21 0.00046 45.3 16.4 119 592-716 111-273 (488)
350 cd00216 PQQ_DH Dehydrogenases 84.0 42 0.00091 42.7 18.9 103 599-715 173-327 (488)
351 KOG1916 Nuclear protein, conta 83.1 0.67 1.5E-05 59.5 2.2 124 591-728 195-352 (1283)
352 PF00780 CNH: CNH domain; Int 83.0 64 0.0014 37.2 18.6 145 527-715 9-173 (275)
353 KOG2395 Protein involved in va 81.8 10 0.00022 47.0 11.1 131 556-706 362-499 (644)
354 PRK02888 nitrous-oxide reducta 81.4 20 0.00043 46.2 14.1 93 600-708 295-405 (635)
355 PRK02888 nitrous-oxide reducta 80.0 24 0.00051 45.5 14.1 106 594-710 239-354 (635)
356 PF02897 Peptidase_S9_N: Proly 79.6 85 0.0019 38.7 19.1 116 573-709 126-262 (414)
357 KOG1916 Nuclear protein, conta 79.5 0.71 1.5E-05 59.3 0.7 133 566-720 128-283 (1283)
358 KOG1920 IkappaB kinase complex 78.8 39 0.00084 46.0 15.7 107 591-709 207-324 (1265)
359 PF11715 Nup160: Nucleoporin N 78.1 12 0.00025 48.3 11.2 31 101-131 229-259 (547)
360 PF06433 Me-amine-dh_H: Methyl 78.0 7.5 0.00016 46.3 8.4 18 114-131 69-86 (342)
361 COG0823 TolB Periplasmic compo 77.0 4.9 0.00011 49.9 7.0 69 1316-1400 243-311 (425)
362 COG0823 TolB Periplasmic compo 76.5 22 0.00047 44.3 12.4 122 575-717 242-368 (425)
363 PF12234 Rav1p_C: RAVE protein 73.1 69 0.0015 41.7 15.7 102 591-706 41-155 (631)
364 PF00930 DPPIV_N: Dipeptidyl p 71.8 8.6 0.00019 46.6 7.2 83 600-690 22-121 (353)
365 COG3391 Uncharacterized conser 71.8 77 0.0017 38.9 15.5 117 574-713 77-196 (381)
366 COG3391 Uncharacterized conser 71.4 1.1E+02 0.0024 37.6 16.7 123 572-714 117-246 (381)
367 PF03178 CPSF_A: CPSF A subuni 70.9 38 0.00083 40.3 12.4 112 568-707 86-202 (321)
368 PF03088 Str_synth: Strictosid 70.5 15 0.00033 35.4 7.0 50 1336-1399 36-86 (89)
369 PRK13616 lipoprotein LpqB; Pro 69.1 33 0.00071 44.7 11.8 103 571-703 350-472 (591)
370 PF05096 Glu_cyclase_2: Glutam 69.0 1.1E+02 0.0023 35.7 14.6 120 572-718 46-168 (264)
371 PF14655 RAB3GAP2_N: Rab3 GTPa 68.9 38 0.00083 41.9 11.8 87 615-717 301-408 (415)
372 PF06977 SdiA-regulated: SdiA- 68.7 88 0.0019 36.1 14.0 112 564-690 15-138 (248)
373 COG3386 Gluconolactonase [Carb 68.4 68 0.0015 38.3 13.4 114 561-690 153-276 (307)
374 PF07676 PD40: WD40-like Beta 68.4 10 0.00022 30.0 4.6 29 1370-1398 8-38 (39)
375 PF04841 Vps16_N: Vps16, N-ter 67.9 1.8E+02 0.0039 36.2 17.6 52 669-728 216-268 (410)
376 KOG4499 Ca2+-binding protein R 67.1 96 0.0021 35.2 13.0 102 574-689 161-274 (310)
377 PF12348 CLASP_N: CLASP N term 65.3 77 0.0017 35.6 12.8 143 1144-1306 53-204 (228)
378 PRK13616 lipoprotein LpqB; Pro 64.3 1.2E+02 0.0026 39.6 15.5 98 573-690 399-516 (591)
379 TIGR03074 PQQ_membr_DH membran 64.2 3E+02 0.0064 37.2 19.4 61 648-716 413-486 (764)
380 PF00780 CNH: CNH domain; Int 63.7 2.8E+02 0.006 31.9 20.0 146 528-720 108-268 (275)
381 COG3386 Gluconolactonase [Carb 62.3 3.4E+02 0.0073 32.5 19.3 108 604-725 144-260 (307)
382 KOG2956 CLIP-associating prote 62.2 39 0.00085 41.6 9.7 76 1221-1298 392-468 (516)
383 PF06433 Me-amine-dh_H: Methyl 61.4 26 0.00056 41.9 8.0 62 650-719 270-332 (342)
384 PF10313 DUF2415: Uncharacteri 61.2 28 0.0006 28.9 5.7 30 671-708 2-34 (43)
385 TIGR03075 PQQ_enz_alc_DH PQQ-d 60.3 1.4E+02 0.0031 38.4 15.1 60 649-717 441-500 (527)
386 PF11715 Nup160: Nucleoporin N 60.1 33 0.00071 44.2 9.6 40 671-718 216-259 (547)
387 PF10313 DUF2415: Uncharacteri 59.8 21 0.00045 29.6 4.8 34 1371-1404 1-36 (43)
388 PF15390 DUF4613: Domain of un 59.4 1.2E+02 0.0027 38.6 13.4 118 571-707 57-186 (671)
389 PF03178 CPSF_A: CPSF A subuni 59.4 3.7E+02 0.0079 32.0 20.2 112 509-657 88-202 (321)
390 KOG4499 Ca2+-binding protein R 58.9 3.2E+02 0.007 31.2 17.7 92 624-728 160-262 (310)
391 KOG2395 Protein involved in va 58.0 1.1E+02 0.0025 38.3 12.6 148 103-303 347-496 (644)
392 TIGR03074 PQQ_membr_DH membran 55.3 1.8E+02 0.0038 39.3 15.1 125 591-716 194-353 (764)
393 PF05694 SBP56: 56kDa selenium 53.5 1E+02 0.0022 38.3 11.2 99 600-708 221-343 (461)
394 PF14761 HPS3_N: Hermansky-Pud 53.5 1.2E+02 0.0027 34.0 11.1 62 1317-1395 22-84 (215)
395 PF14870 PSII_BNR: Photosynthe 52.9 2.6E+02 0.0057 33.3 14.5 120 570-715 144-268 (302)
396 PF02897 Peptidase_S9_N: Proly 52.9 55 0.0012 40.4 9.6 75 1315-1404 128-213 (414)
397 COG3490 Uncharacterized protei 52.8 2.7E+02 0.0059 32.7 13.7 103 599-714 199-317 (366)
398 TIGR03032 conserved hypothetic 52.4 1E+02 0.0022 36.6 10.7 88 101-207 212-299 (335)
399 PF10168 Nup88: Nuclear pore c 52.4 2E+02 0.0044 38.4 14.9 71 623-708 86-180 (717)
400 PF14655 RAB3GAP2_N: Rab3 GTPa 52.1 99 0.0022 38.4 11.2 95 563-666 300-407 (415)
401 PF07569 Hira: TUP1-like enhan 50.4 84 0.0018 35.5 9.6 63 639-710 22-98 (219)
402 PF01731 Arylesterase: Arylest 50.0 63 0.0014 31.0 7.1 49 1336-1400 35-83 (86)
403 PF04841 Vps16_N: Vps16, N-ter 50.0 6E+02 0.013 31.6 19.6 56 621-682 216-272 (410)
404 TIGR02276 beta_rpt_yvtn 40-res 48.5 50 0.0011 26.2 5.5 32 679-717 1-32 (42)
405 COG3490 Uncharacterized protei 48.3 2.9E+02 0.0062 32.5 13.0 86 591-688 80-180 (366)
406 PF05096 Glu_cyclase_2: Glutam 47.5 2.7E+02 0.0059 32.5 13.0 105 99-250 53-159 (264)
407 PF14727 PHTB1_N: PTHB1 N-term 47.0 5.1E+02 0.011 32.4 16.3 54 186-248 151-204 (418)
408 TIGR03075 PQQ_enz_alc_DH PQQ-d 46.5 2.5E+02 0.0055 36.1 14.2 124 591-716 69-198 (527)
409 PF12768 Rax2: Cortical protei 46.4 2.5E+02 0.0054 33.1 12.9 98 601-710 16-126 (281)
410 PF14761 HPS3_N: Hermansky-Pud 46.1 2E+02 0.0044 32.4 11.3 102 592-709 29-165 (215)
411 PF04053 Coatomer_WDAD: Coatom 45.3 55 0.0012 41.0 7.8 80 591-680 117-207 (443)
412 PF10647 Gmad1: Lipoprotein Lp 44.3 3.1E+02 0.0067 31.7 13.3 114 572-706 25-143 (253)
413 PF00930 DPPIV_N: Dipeptidyl p 42.6 52 0.0011 39.8 6.9 107 648-765 22-130 (353)
414 PRK10115 protease 2; Provision 42.3 6.6E+02 0.014 33.6 17.5 112 573-706 129-254 (686)
415 PHA02713 hypothetical protein; 41.9 1.1E+02 0.0023 39.7 10.0 73 639-719 464-545 (557)
416 PF13645 YkuD_2: L,D-transpept 40.9 74 0.0016 34.7 6.8 80 1287-1381 4-84 (176)
417 PF14583 Pectate_lyase22: Olig 40.5 5.9E+02 0.013 31.4 15.0 175 516-720 36-236 (386)
418 COG5167 VID27 Protein involved 39.9 1.4E+02 0.0031 37.2 9.6 60 639-707 573-632 (776)
419 PF14583 Pectate_lyase22: Olig 39.7 2E+02 0.0042 35.4 10.8 151 556-716 16-185 (386)
420 KOG4460 Nuclear pore complex, 38.1 2.8E+02 0.006 35.0 11.6 76 623-712 105-203 (741)
421 PF05694 SBP56: 56kDa selenium 37.4 4.5E+02 0.0097 32.9 13.3 44 111-154 221-265 (461)
422 smart00036 CNH Domain found in 36.5 1E+02 0.0023 36.5 8.0 57 1323-1400 14-72 (302)
423 PF14781 BBS2_N: Ciliary BBSom 36.4 5.3E+02 0.011 27.1 12.6 108 20-155 2-116 (136)
424 KOG3630 Nuclear pore complex, 35.7 1.6E+02 0.0035 40.2 9.8 100 593-706 116-227 (1405)
425 PF07569 Hira: TUP1-like enhan 33.9 2.3E+02 0.0049 32.1 9.7 24 591-614 22-45 (219)
426 PF02985 HEAT: HEAT repeat; I 33.9 42 0.0009 25.4 2.6 24 1006-1029 2-25 (31)
427 PF08728 CRT10: CRT10; InterP 33.1 2.4E+02 0.0052 37.4 10.8 102 591-707 114-246 (717)
428 PF12234 Rav1p_C: RAVE protein 31.8 3.1E+02 0.0067 36.0 11.4 56 642-705 43-102 (631)
429 COG3204 Uncharacterized protei 31.4 5.6E+02 0.012 30.4 12.2 118 567-707 82-210 (316)
430 TIGR02604 Piru_Ver_Nterm putat 30.6 7.9E+02 0.017 29.9 14.5 110 602-719 48-184 (367)
431 KOG1242 Protein containing ada 30.5 1.3E+03 0.029 29.9 17.6 250 984-1318 196-451 (569)
432 KOG2973 Uncharacterized conser 30.1 3E+02 0.0065 32.7 9.7 32 1148-1180 7-42 (353)
433 PF07676 PD40: WD40-like Beta 28.6 1.4E+02 0.003 23.5 5.0 27 17-43 9-38 (39)
434 TIGR02276 beta_rpt_yvtn 40-res 28.4 1.8E+02 0.0038 22.9 5.7 38 639-677 3-41 (42)
435 PF12755 Vac14_Fab1_bd: Vacuol 27.2 1.5E+02 0.0033 29.1 6.0 51 1256-1306 5-55 (97)
436 PRK13684 Ycf48-like protein; P 26.2 7.3E+02 0.016 29.9 13.0 118 570-715 172-295 (334)
437 TIGR02604 Piru_Ver_Nterm putat 25.8 5.2E+02 0.011 31.5 11.7 58 622-686 124-200 (367)
438 PF08728 CRT10: CRT10; InterP 25.7 3.8E+02 0.0082 35.6 10.7 120 527-657 116-246 (717)
439 KOG3630 Nuclear pore complex, 24.9 1.3E+02 0.0027 41.2 6.3 105 569-689 154-263 (1405)
440 PF14500 MMS19_N: Dos2-interac 24.9 4.5E+02 0.0097 30.7 10.3 145 1149-1314 4-156 (262)
441 PF12717 Cnd1: non-SMC mitotic 24.8 5.6E+02 0.012 27.7 10.6 99 1122-1248 8-111 (178)
442 PHA03098 kelch-like protein; P 24.4 5.9E+02 0.013 32.6 12.5 155 527-710 345-514 (534)
443 PF10193 Telomere_reg-2: Telom 24.4 2.3E+02 0.0051 28.6 6.9 69 1233-1301 1-76 (114)
444 PHA02872 EFc gene family prote 24.3 2.4E+02 0.0052 28.1 6.5 96 1281-1398 17-122 (124)
445 KOG2109 WD40 repeat protein [G 24.2 76 0.0016 40.7 4.0 53 1335-1400 293-345 (788)
446 PF10647 Gmad1: Lipoprotein Lp 23.1 1.2E+03 0.026 26.9 17.1 104 572-690 67-186 (253)
447 PF14781 BBS2_N: Ciliary BBSom 22.9 9E+02 0.02 25.4 13.4 112 593-717 12-135 (136)
448 COG4590 ABC-type uncharacteriz 22.6 4.2E+02 0.009 32.9 9.4 33 14-47 218-250 (733)
449 KOG1062 Vesicle coat complex A 21.8 8.7E+02 0.019 32.5 12.5 162 1136-1307 171-379 (866)
450 KOG3616 Selective LIM binding 21.5 1.4E+02 0.003 38.7 5.4 31 17-47 15-45 (1636)
451 KOG1059 Vesicle coat complex A 21.4 4.2E+02 0.0091 34.9 9.5 38 1136-1173 136-173 (877)
452 PF03022 MRJP: Major royal jel 20.8 1.5E+02 0.0033 35.0 5.5 39 1336-1385 87-145 (287)
453 PF06977 SdiA-regulated: SdiA- 20.5 1.4E+03 0.029 26.5 20.1 163 510-716 22-209 (248)
454 COG4590 ABC-type uncharacteriz 20.1 3.9E+02 0.0085 33.1 8.5 108 591-714 280-393 (733)
No 1
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=100.00 E-value=6e-36 Score=355.26 Aligned_cols=544 Identities=15% Similarity=0.139 Sum_probs=364.6
Q ss_pred ceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecc-cccceeEeeeccccccccCccccccccccccccc
Q 000473 17 HRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCG-HSAPIADLSICYPAMVSRDGKAEHWKAENSSNVM 95 (1471)
Q Consensus 17 h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~G-H~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~ 95 (1471)
..||+++++||++.|++.+..+.+++|++.+ + +.+..+-+ |++||--++
T Consensus 63 d~ita~~l~~d~~~L~~a~rs~llrv~~L~t---g--k~irswKa~He~Pvi~ma------------------------- 112 (775)
T KOG0319|consen 63 DEITALALTPDEEVLVTASRSQLLRVWSLPT---G--KLIRSWKAIHEAPVITMA------------------------- 112 (775)
T ss_pred hhhheeeecCCccEEEEeeccceEEEEEccc---c--hHhHhHhhccCCCeEEEE-------------------------
Confidence 5699999999999999999999999999985 3 34444555 999999998
Q ss_pred ccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCC-CeEEEEcceecccCCccccccccccccc
Q 000473 96 GKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSN-PRYVCIGCCFIDTNQLSDHHSFESVEGD 174 (1471)
Q Consensus 96 ~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~-~~ll~~G~~~id~~~~~~~h~~~~i~~~ 174 (1471)
|.|.+..|++|+.||.++|||+..+.|....+-++ |....+.+.+.. .+++.+|..
T Consensus 113 --~~~~g~LlAtggaD~~v~VWdi~~~~~th~fkG~g--GvVssl~F~~~~~~~lL~sg~~------------------- 169 (775)
T KOG0319|consen 113 --FDPTGTLLATGGADGRVKVWDIKNGYCTHSFKGHG--GVVSSLLFHPHWNRWLLASGAT------------------- 169 (775)
T ss_pred --EcCCCceEEeccccceEEEEEeeCCEEEEEecCCC--ceEEEEEeCCccchhheeecCC-------------------
Confidence 57778899999999999999999999999977652 334445555532 356777765
Q ss_pred ccccccCCCCCCCCCceEEEEeCcceEE-EEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCc
Q 000473 175 LVSEDKEVPMKNPPKCTLVIVDTYGLTI-VQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHL 253 (1471)
Q Consensus 175 ~~~~d~~~~~~~~~~~~I~v~D~~t~~~-l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~ 253 (1471)
++.+.+||..+... +.++.. +.+. ++.++|++ |+ ..++.++.|..+-|||+..-+.
T Consensus 170 --------------D~~v~vwnl~~~~tcl~~~~~-H~S~--vtsL~~~~---d~--~~~ls~~RDkvi~vwd~~~~~~- 226 (775)
T KOG0319|consen 170 --------------DGTVRVWNLNDKRTCLHTMIL-HKSA--VTSLAFSE---DS--LELLSVGRDKVIIVWDLVQYKK- 226 (775)
T ss_pred --------------CceEEEEEcccCchHHHHHHh-hhhh--eeeeeecc---CC--ceEEEeccCcEEEEeehhhhhh-
Confidence 48999999886554 333333 3444 89999984 33 3477779999999999954320
Q ss_pred ccccCCCcccCCCcccceeccCCcccCceEEEEecC-----CcEEEEEeCCeEEEEEcCCCcceeeeeeecceeEeecCC
Q 000473 254 DREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATC-----GNIIALVLKDHCIFRLLGSGSTIGEICFVDNLFCLEGGS 328 (1471)
Q Consensus 254 ~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~-----g~~l~~~~~~~~~~~l~d~~~~ige~~~~~~~l~~~~~~ 328 (1471)
...-++| +..+.+.+-++ |..+.++++.+ .++.|+... +.. +.....+
T Consensus 227 --l~~lp~y----------------e~~E~vv~l~~~~~~~~~~~~TaG~~g-~~~~~d~es--~~~------~~~~~~~ 279 (775)
T KOG0319|consen 227 --LKTLPLY----------------ESLESVVRLREELGGKGEYIITAGGSG-VVQYWDSES--GKC------VYKQRQS 279 (775)
T ss_pred --hheechh----------------hheeeEEEechhcCCcceEEEEecCCc-eEEEEeccc--chh------hhhhccC
Confidence 1111122 22344444444 55777777766 566677665 111 0001111
Q ss_pred CCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEEEeec
Q 000473 329 TNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIHFIQM 408 (1471)
Q Consensus 329 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~f~~~ 408 (1471)
+.+-+..+..... ...+.+-+.+....+|...... .+....+-.+.-.+..|...
T Consensus 280 ~~~e~~~~~~~~~----------------~~~~l~vtaeQnl~l~d~~~l~---------i~k~ivG~ndEI~Dm~~lG~ 334 (775)
T KOG0319|consen 280 DSEEIDHLLAIES----------------MSQLLLVTAEQNLFLYDEDELT---------IVKQIVGYNDEILDMKFLGP 334 (775)
T ss_pred Cchhhhcceeccc----------------cCceEEEEccceEEEEEccccE---------EehhhcCCchhheeeeecCC
Confidence 1111222222222 1133334444445555221000 00000011111122223322
Q ss_pred ceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCccc
Q 000473 409 SLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVP 488 (1471)
Q Consensus 409 ~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~ 488 (1471)
....+.+.+ . .|.++++.++. ..|+ +..|....+-.... -.+| +-+++.+.+.+++
T Consensus 335 e~~~laVAT------N----s~~lr~y~~~~------~~c~-ii~GH~e~vlSL~~----~~~g---~llat~sKD~svi 390 (775)
T KOG0319|consen 335 EESHLAVAT------N----SPELRLYTLPT------SYCQ-IIPGHTEAVLSLDV----WSSG---DLLATGSKDKSVI 390 (775)
T ss_pred ccceEEEEe------C----CCceEEEecCC------CceE-EEeCchhheeeeee----cccC---cEEEEecCCceEE
Confidence 212222221 1 24466664322 2334 33343221110000 0122 1367888899999
Q ss_pred cccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCC---cceEEE
Q 000473 489 RSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNS---HVSRQY 565 (1471)
Q Consensus 489 ~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s---~~~~~~ 565 (1471)
+|++.++...-.+.....+|...|+++++. .-.+..+++++.|+++++|.+.. .. +... ..+..+
T Consensus 391 lWr~~~~~~~~~~~a~~~gH~~svgava~~---~~~asffvsvS~D~tlK~W~l~~--s~-------~~~~~~~~~~~~t 458 (775)
T KOG0319|consen 391 LWRLNNNCSKSLCVAQANGHTNSVGAVAGS---KLGASFFVSVSQDCTLKLWDLPK--SK-------ETAFPIVLTCRYT 458 (775)
T ss_pred EEEecCCcchhhhhhhhcccccccceeeec---ccCccEEEEecCCceEEEecCCC--cc-------cccccceehhhHH
Confidence 999966544445666678999999998862 22345899999999999955441 10 0111 011123
Q ss_pred EecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEE
Q 000473 566 FLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSV 645 (1471)
Q Consensus 566 l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~ 645 (1471)
-..|...|+|++++|+ ..+++|||.|++.++|++.....+.++.+|+..|+++.|+|. .++++|+
T Consensus 459 ~~aHdKdIN~Vaia~n---------dkLiAT~SqDktaKiW~le~~~l~~vLsGH~RGvw~V~Fs~~------dq~laT~ 523 (775)
T KOG0319|consen 459 ERAHDKDINCVAIAPN---------DKLIATGSQDKTAKIWDLEQLRLLGVLSGHTRGVWCVSFSKN------DQLLATC 523 (775)
T ss_pred HHhhcccccceEecCC---------CceEEecccccceeeecccCceEEEEeeCCccceEEEEeccc------cceeEec
Confidence 3579999999999997 899999999999999999999999999999999999999999 8999999
Q ss_pred eCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceee
Q 000473 646 GEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFD 725 (1471)
Q Consensus 646 s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~ 725 (1471)
|.|++|+||.+.+..|+.+|.||...|..+.|-.+|..|++++.| |-+++|++++++|++++.+|..+|...
T Consensus 524 SgD~TvKIW~is~fSClkT~eGH~~aVlra~F~~~~~qliS~~ad--------GliKlWnikt~eC~~tlD~H~DrvWaL 595 (775)
T KOG0319|consen 524 SGDKTVKIWSISTFSCLKTFEGHTSAVLRASFIRNGKQLISAGAD--------GLIKLWNIKTNECEMTLDAHNDRVWAL 595 (775)
T ss_pred cCCceEEEEEeccceeeeeecCccceeEeeeeeeCCcEEEeccCC--------CcEEEEeccchhhhhhhhhccceeEEE
Confidence 999999999999999999999999999999999999999999999 999999999999999999999999987
Q ss_pred eeeeccccccccceEEcCCccccccceeeccCCceEeec
Q 000473 726 HFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 726 ~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
...+.. ..+++-..|+++-.|.
T Consensus 596 ~~~~~~-----------------~~~~tgg~Dg~i~~wk 617 (775)
T KOG0319|consen 596 SVSPLL-----------------DMFVTGGGDGRIIFWK 617 (775)
T ss_pred eecCcc-----------------ceeEecCCCeEEEEee
Confidence 433211 1233344589999986
No 2
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=100.00 E-value=1.8e-35 Score=328.18 Aligned_cols=360 Identities=19% Similarity=0.211 Sum_probs=269.0
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccc
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSN 93 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~ 93 (1471)
.+...|-|++|+|||+.||||+.|.++++||+.+ ..|..+..||...|.|++
T Consensus 113 GH~e~Vl~~~fsp~g~~l~tGsGD~TvR~WD~~T-----eTp~~t~KgH~~WVlcva----------------------- 164 (480)
T KOG0271|consen 113 GHGEAVLSVQFSPTGSRLVTGSGDTTVRLWDLDT-----ETPLFTCKGHKNWVLCVA----------------------- 164 (480)
T ss_pred CCCCcEEEEEecCCCceEEecCCCceEEeeccCC-----CCcceeecCCccEEEEEE-----------------------
Confidence 4556799999999999999999999999999996 468889999999999999
Q ss_pred ccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEc-------CCCCeEEEEcceecccCCccccc
Q 000473 94 VMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTL-------PSNPRYVCIGCCFIDTNQLSDHH 166 (1471)
Q Consensus 94 ~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~-------s~~~~ll~~G~~~id~~~~~~~h 166 (1471)
++||++.||||+.||+|++||-.+|+++-+ .++ |.-.-|..+ .+.+++++.+.-
T Consensus 165 ----wsPDgk~iASG~~dg~I~lwdpktg~~~g~-~l~---gH~K~It~Lawep~hl~p~~r~las~sk----------- 225 (480)
T KOG0271|consen 165 ----WSPDGKKIASGSKDGSIRLWDPKTGQQIGR-ALR---GHKKWITALAWEPLHLVPPCRRLASSSK----------- 225 (480)
T ss_pred ----ECCCcchhhccccCCeEEEecCCCCCcccc-ccc---CcccceeEEeecccccCCCccceecccC-----------
Confidence 688899999999999999999999987743 333 222222211 145555554432
Q ss_pred ccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEE
Q 000473 167 SFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVP 246 (1471)
Q Consensus 167 ~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~ 246 (1471)
++.|+|||+..++++.++.. +..+ |+|+..- |+ +.++.++.|++|++|+
T Consensus 226 ----------------------Dg~vrIWd~~~~~~~~~lsg-HT~~--VTCvrwG-----G~-gliySgS~DrtIkvw~ 274 (480)
T KOG0271|consen 226 ----------------------DGSVRIWDTKLGTCVRTLSG-HTAS--VTCVRWG-----GE-GLIYSGSQDRTIKVWR 274 (480)
T ss_pred ----------------------CCCEEEEEccCceEEEEecc-Cccc--eEEEEEc-----CC-ceEEecCCCceEEEEE
Confidence 38999999999999998876 5444 8888732 22 6788889999999999
Q ss_pred CCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCcceeeeeeecceeEeec
Q 000473 247 ISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGSTIGEICFVDNLFCLEG 326 (1471)
Q Consensus 247 l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~~ige~~~~~~~l~~~~ 326 (1471)
...+.. + +.+.+|...++.++++++ +.+ ..+. +
T Consensus 275 a~dG~~---------------~---r~lkGHahwvN~lalsTd-----------y~L---Rtga----f----------- 307 (480)
T KOG0271|consen 275 ALDGKL---------------C---RELKGHAHWVNHLALSTD-----------YVL---RTGA----F----------- 307 (480)
T ss_pred ccchhH---------------H---Hhhcccchheeeeeccch-----------hhh---hccc----c-----------
Confidence 876521 1 123334444444443211 111 0000 0
Q ss_pred CCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEEEe
Q 000473 327 GSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIHFI 406 (1471)
Q Consensus 327 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~f~ 406 (1471)
...+ .. |.
T Consensus 308 ------------~~t~----------------~~------------------------------~~-------------- 315 (480)
T KOG0271|consen 308 ------------DHTG----------------RK------------------------------PK-------------- 315 (480)
T ss_pred ------------cccc----------------cc------------------------------CC--------------
Confidence 0000 00 00
Q ss_pred ecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCc
Q 000473 407 QMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDT 486 (1471)
Q Consensus 407 ~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~ 486 (1471)
| .. ..+. +.+.+ |
T Consensus 316 -----------------------~-~s--e~~~---------~Al~r------Y-------------------------- 328 (480)
T KOG0271|consen 316 -----------------------S-FS--EEQK---------KALER------Y-------------------------- 328 (480)
T ss_pred -----------------------C-hH--HHHH---------HHHHH------H--------------------------
Confidence 0 00 0000 00000 0
Q ss_pred cccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEE
Q 000473 487 VPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF 566 (1471)
Q Consensus 487 v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l 566 (1471)
+ .+. .-.|.++++|+.|+++.+ |+ ..++.+++..+
T Consensus 329 ----~-------------------~~~--------~~~~erlVSgsDd~tlfl--W~------------p~~~kkpi~rm 363 (480)
T KOG0271|consen 329 ----E-------------------AVL--------KDSGERLVSGSDDFTLFL--WN------------PFKSKKPITRM 363 (480)
T ss_pred ----H-------------------Hhh--------ccCcceeEEecCCceEEE--ec------------ccccccchhhh
Confidence 0 000 011126999999999999 55 12344678888
Q ss_pred ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe
Q 000473 567 LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG 646 (1471)
Q Consensus 567 ~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s 646 (1471)
.||..-|+.+.|+|| +++++|+|-|.+|++||..+|+.+.+|++|.++|.+++|+.| .++++||+
T Consensus 364 tgHq~lVn~V~fSPd---------~r~IASaSFDkSVkLW~g~tGk~lasfRGHv~~VYqvawsaD------sRLlVS~S 428 (480)
T KOG0271|consen 364 TGHQALVNHVSFSPD---------GRYIASASFDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSAD------SRLLVSGS 428 (480)
T ss_pred hchhhheeeEEECCC---------ccEEEEeecccceeeeeCCCcchhhhhhhccceeEEEEeccC------ccEEEEcC
Confidence 999999999999998 899999999999999999999999999999999999999999 89999999
Q ss_pred CCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 647 EDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 647 ~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
.|.++++|++++.+....++||.+.|.++.|+|||..+++|+.| ..+++|.
T Consensus 429 kDsTLKvw~V~tkKl~~DLpGh~DEVf~vDwspDG~rV~sggkd--------kv~~lw~ 479 (480)
T KOG0271|consen 429 KDSTLKVWDVRTKKLKQDLPGHADEVFAVDWSPDGQRVASGGKD--------KVLRLWR 479 (480)
T ss_pred CCceEEEEEeeeeeecccCCCCCceEEEEEecCCCceeecCCCc--------eEEEeec
Confidence 99999999999999999999999999999999999999999998 9999994
No 3
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=100.00 E-value=7.8e-33 Score=328.85 Aligned_cols=511 Identities=15% Similarity=0.178 Sum_probs=341.2
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccc
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSN 93 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~ 93 (1471)
.++.+|--++|.|.|..|+||+.||.|++||+.. + ...+.|.||.+.|.++.| +|
T Consensus 103 ~He~Pvi~ma~~~~g~LlAtggaD~~v~VWdi~~---~--~~th~fkG~gGvVssl~F-~~------------------- 157 (775)
T KOG0319|consen 103 IHEAPVITMAFDPTGTLLATGGADGRVKVWDIKN---G--YCTHSFKGHGGVVSSLLF-HP------------------- 157 (775)
T ss_pred ccCCCeEEEEEcCCCceEEeccccceEEEEEeeC---C--EEEEEecCCCceEEEEEe-CC-------------------
Confidence 3667899999999999999999999999999983 3 356889999999999985 11
Q ss_pred ccccccCCCCEEEEEeCCCeEEEEEcCCCe-EEEeeeCCCCCCCCcEEEEcCCCC-eEEEEcceecccCCcccccccccc
Q 000473 94 VMGKSSLDNGALISACTDGVLCVWSRSSGH-CRRRRKLPPWVGSPSVICTLPSNP-RYVCIGCCFIDTNQLSDHHSFESV 171 (1471)
Q Consensus 94 ~~~~~s~d~~~LaSas~DG~I~VWdv~~G~-ci~~~~l~~~~g~~~~i~~~s~~~-~ll~~G~~~id~~~~~~~h~~~~i 171 (1471)
.+....|++|..|+++++||+.+++ |+.....+ -+...-..+.+++ .+++.|.+
T Consensus 158 -----~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H---~S~vtsL~~~~d~~~~ls~~RD---------------- 213 (775)
T KOG0319|consen 158 -----HWNRWLLASGATDGTVRVWNLNDKRTCLHTMILH---KSAVTSLAFSEDSLELLSVGRD---------------- 213 (775)
T ss_pred -----ccchhheeecCCCceEEEEEcccCchHHHHHHhh---hhheeeeeeccCCceEEEeccC----------------
Confidence 1223679999999999999999654 45555555 3444444555555 56666665
Q ss_pred cccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCC
Q 000473 172 EGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKES 251 (1471)
Q Consensus 172 ~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~ 251 (1471)
..+.+||..+.+.+.++.- ..+ +-.+.+...+.++++..+++++.+|.+++|+.+...
T Consensus 214 ------------------kvi~vwd~~~~~~l~~lp~-ye~---~E~vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~ 271 (775)
T KOG0319|consen 214 ------------------KVIIVWDLVQYKKLKTLPL-YES---LESVVRLREELGGKGEYIITAGGSGVVQYWDSESGK 271 (775)
T ss_pred ------------------cEEEEeehhhhhhhheech-hhh---eeeEEEechhcCCcceEEEEecCCceEEEEecccch
Confidence 7889999988888777765 322 344444433223333567777999999999998764
Q ss_pred CcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCcceeeeeeecceeEeecCCCCc
Q 000473 252 HLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGSTIGEICFVDNLFCLEGGSTNS 331 (1471)
Q Consensus 252 ~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~~ige~~~~~~~l~~~~~~~~~ 331 (1471)
........+ ..| + ...+.+...++.++..+....++ +|... .....++. ..+.
T Consensus 272 ~~~~~~~~~--~~e--~------------~~~~~~~~~~~~l~vtaeQnl~l--~d~~~----l~i~k~iv-----G~nd 324 (775)
T KOG0319|consen 272 CVYKQRQSD--SEE--I------------DHLLAIESMSQLLLVTAEQNLFL--YDEDE----LTIVKQIV-----GYND 324 (775)
T ss_pred hhhhhccCC--chh--h------------hcceeccccCceEEEEccceEEE--EEccc----cEEehhhc-----CCch
Confidence 322111111 000 1 12233334455555545544333 54433 10000000 1111
Q ss_pred eeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEEEeeccee
Q 000473 332 YVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIHFIQMSLY 411 (1471)
Q Consensus 332 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~f~~~~~~ 411 (1471)
-....-|.... ...++|.+..+..++|.++...- . -+ ++..+..+++.-...|..
T Consensus 325 EI~Dm~~lG~e---------------~~~laVATNs~~lr~y~~~~~~c--~-----ii---~GH~e~vlSL~~~~~g~l 379 (775)
T KOG0319|consen 325 EILDMKFLGPE---------------ESHLAVATNSPELRLYTLPTSYC--Q-----II---PGHTEAVLSLDVWSSGDL 379 (775)
T ss_pred hheeeeecCCc---------------cceEEEEeCCCceEEEecCCCce--E-----EE---eCchhheeeeeecccCcE
Confidence 11122233222 23678888899999996653321 1 11 112222233332233334
Q ss_pred eEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCcccccc
Q 000473 412 LLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSE 491 (1471)
Q Consensus 412 L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd 491 (1471)
+.+.. . +..+.+|.+..+ .....|--...|....+-.++. +..|- --+.+.+.+.++++|+
T Consensus 380 lat~s-----K------D~svilWr~~~~--~~~~~~~a~~~gH~~svgava~----~~~~a--sffvsvS~D~tlK~W~ 440 (775)
T KOG0319|consen 380 LATGS-----K------DKSVILWRLNNN--CSKSLCVAQANGHTNSVGAVAG----SKLGA--SFFVSVSQDCTLKLWD 440 (775)
T ss_pred EEEec-----C------CceEEEEEecCC--cchhhhhhhhcccccccceeee----cccCc--cEEEEecCCceEEEec
Confidence 44332 1 234889976111 1111100111111100000000 00110 0266778889999999
Q ss_pred ccCCCCCC-----CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEE
Q 000473 492 HVDSRQAG-----DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF 566 (1471)
Q Consensus 492 ~~~~~~~g-----~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l 566 (1471)
+...++.. ++..+...|.+.|+|+++.+... .+++|+.|.+.+|| + .+......+|
T Consensus 441 l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~ndk----LiAT~SqDktaKiW--~-------------le~~~l~~vL 501 (775)
T KOG0319|consen 441 LPKSKETAFPIVLTCRYTERAHDKDINCVAIAPNDK----LIATGSQDKTAKIW--D-------------LEQLRLLGVL 501 (775)
T ss_pred CCCcccccccceehhhHHHHhhcccccceEecCCCc----eEEecccccceeee--c-------------ccCceEEEEe
Confidence 98743322 33445678999999999665555 89999999999993 3 3455788999
Q ss_pred ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe
Q 000473 567 LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG 646 (1471)
Q Consensus 567 ~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s 646 (1471)
.||+..|+|+.|+|. .+.++|+|.|+||++|.+.+..|+.+|.||...|....|-.+ +..|+|++
T Consensus 502 sGH~RGvw~V~Fs~~---------dq~laT~SgD~TvKIW~is~fSClkT~eGH~~aVlra~F~~~------~~qliS~~ 566 (775)
T KOG0319|consen 502 SGHTRGVWCVSFSKN---------DQLLATCSGDKTVKIWSISTFSCLKTFEGHTSAVLRASFIRN------GKQLISAG 566 (775)
T ss_pred eCCccceEEEEeccc---------cceeEeccCCceEEEEEeccceeeeeecCccceeEeeeeeeC------CcEEEecc
Confidence 999999999999997 899999999999999999999999999999999999999988 99999999
Q ss_pred CCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 647 EDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 647 ~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.||-++||+++++.|++++.+|.+.|+++.-+|.+.++++|+.| |.|.+|.=-|
T Consensus 567 adGliKlWnikt~eC~~tlD~H~DrvWaL~~~~~~~~~~tgg~D--------g~i~~wkD~T 620 (775)
T KOG0319|consen 567 ADGLIKLWNIKTNECEMTLDAHNDRVWALSVSPLLDMFVTGGGD--------GRIIFWKDVT 620 (775)
T ss_pred CCCcEEEEeccchhhhhhhhhccceeEEEeecCccceeEecCCC--------eEEEEeecCc
Confidence 99999999999999999999999999999999999999999998 9999996433
No 4
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=100.00 E-value=9.3e-31 Score=310.49 Aligned_cols=580 Identities=16% Similarity=0.121 Sum_probs=374.9
Q ss_pred ceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccc
Q 000473 17 HRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMG 96 (1471)
Q Consensus 17 h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~ 96 (1471)
..|||+.-+||...||.|.+||.|++|++.+ . .+..++.||+++|+++.
T Consensus 66 ~evt~l~~~~d~l~lAVGYaDGsVqif~~~s---~--~~~~tfngHK~AVt~l~-------------------------- 114 (888)
T KOG0306|consen 66 AEVTCLRSSDDILLLAVGYADGSVQIFSLES---E--EILITFNGHKAAVTTLK-------------------------- 114 (888)
T ss_pred ceEEEeeccCCcceEEEEecCceEEeeccCC---C--ceeeeecccccceEEEE--------------------------
Confidence 4699999999999999999999999999984 2 46678889999999998
Q ss_pred cccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccccccc
Q 000473 97 KSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLV 176 (1471)
Q Consensus 97 ~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~ 176 (1471)
+...|.+|+|||.|+.|.|||+..-.-+.+ +.++-.+-+...+..++..+++++.+
T Consensus 115 -fd~~G~rlaSGskDt~IIvwDlV~E~Gl~r--L~GHkd~iT~~~F~~~~~~lvS~sKD--------------------- 170 (888)
T KOG0306|consen 115 -FDKIGTRLASGSKDTDIIVWDLVGEEGLFR--LRGHKDSITQALFLNGDSFLVSVSKD--------------------- 170 (888)
T ss_pred -EcccCceEeecCCCccEEEEEeccceeeEE--eecchHHHhHHhccCCCeEEEEeccC---------------------
Confidence 567788999999999999999975443433 33111333444566678888888876
Q ss_pred ccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCcccc
Q 000473 177 SEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDRE 256 (1471)
Q Consensus 177 ~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~~ 256 (1471)
..|.+||..+..+..+... +. +.+..+++.. ..+++++.|+.++||++.-.. .+
T Consensus 171 -------------s~iK~WdL~tqhCf~Thvd-~r--~Eiw~l~~~~-------~~lvt~~~dse~~v~~L~~~~---D~ 224 (888)
T KOG0306|consen 171 -------------SMIKFWDLETQHCFETHVD-HR--GEIWALVLDE-------KLLVTAGTDSELKVWELAFED---DE 224 (888)
T ss_pred -------------ceEEEEecccceeeeEEec-cc--ceEEEEEEec-------ceEEEEecCCceEEEEeeccc---cc
Confidence 7899999999999988776 22 3378888761 236666999999999995441 11
Q ss_pred cCCCcccCCCcccceeccCCcccCceEEEEecC--CcEEEEEeCCeEE--EEEcCCCcceeee----eeecceeEeecCC
Q 000473 257 EGNGLCKSSSQLDMAILQNGVVEGGHLVSVATC--GNIIALVLKDHCI--FRLLGSGSTIGEI----CFVDNLFCLEGGS 328 (1471)
Q Consensus 257 ~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~--g~~l~~~~~~~~~--~~l~d~~~~ige~----~~~~~~l~~~~~~ 328 (1471)
.....+..-++.+.+..+ +.++.+.+..+ ++++++.+.+..+ +++.......+.. .........+...
T Consensus 225 ~~~~~~~s~~~~G~~~rq----sk~R~i~l~~d~s~r~~~c~g~d~~~e~frI~s~~E~~k~l~Kk~k~~Kkka~t~e~~ 300 (888)
T KOG0306|consen 225 KETNRYISTKLRGTFIRQ----SKGREINLVTDFSDRFLVCQGADKVIELFRIRSKEEIAKILSKKLKRAKKKAETEENE 300 (888)
T ss_pred ccccccceeeccceeeec----cCCceeEEeecCcccEEEEecchhhhhheeecCHHHHHHHHHHHHHHhhhhccccccc
Confidence 111111111111111111 12344444444 6777776665431 2222221100000 0000000000000
Q ss_pred ---CCc-----eeeeeEeecchhhhhhcccc-cccccccceEEEEcCCCcEEEEEeecCCCCCcc-cCeeeecCccCCCC
Q 000473 329 ---TNS-----YVIGAMFLERVVAEKIENTM-GVCTTFYENFAVWDNRGSAIVYAISYMNEKFDY-EPHFEIPAVSYPSG 398 (1471)
Q Consensus 329 ---~~~-----~~~~g~~~~~~~~~~~~~~~-~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~-~~~~~ip~v~~~~~ 398 (1471)
... .-+..+.... .+.... -+.+... +..|--.+..++.|.++........ ...-.+ ...+.+.
T Consensus 301 ~~v~~sl~~~i~r~~~ir~~~----kiks~dv~~~~~~~-~~lv~l~nNtv~~ysl~~s~~~~p~~~~~~~i-~~~GHR~ 374 (888)
T KOG0306|consen 301 DDVEKSLSDEIKRLETIRTSA----KIKSFDVTPSGGTE-NTLVLLANNTVEWYSLENSGKTSPEADRTSNI-EIGGHRS 374 (888)
T ss_pred cchhhhHHHHHHHHHheechh----heeEEEEEecCCcc-eeEEEeecCceEEEEeccCCCCCcccccccee-eeccchh
Confidence 000 0000000000 000000 0011111 2333355667778888763322110 000011 1234555
Q ss_pred ceeeEEEeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCccccee
Q 000473 399 VKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKS 478 (1471)
Q Consensus 399 ~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l 478 (1471)
+...+.++.++..+.+... + .+.+|... ..+..+.+.++.....-+++. |- .+
T Consensus 375 dVRsl~vS~d~~~~~Sga~------~------SikiWn~~-----t~kciRTi~~~y~l~~~Fvpg------d~----~I 427 (888)
T KOG0306|consen 375 DVRSLCVSSDSILLASGAG------E------SIKIWNRD-----TLKCIRTITCGYILASKFVPG------DR----YI 427 (888)
T ss_pred heeEEEeecCceeeeecCC------C------cEEEEEcc-----CcceeEEeccccEEEEEecCC------Cc----eE
Confidence 6566667777666655421 2 37888632 122234555552222222221 11 35
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
.....+|.+.++|.... .+..+...|.+.+.++...+++. .+++|+.|.+|++|++.+.... ++..-.+-
T Consensus 428 v~G~k~Gel~vfdlaS~----~l~Eti~AHdgaIWsi~~~pD~~----g~vT~saDktVkfWdf~l~~~~--~gt~~k~l 497 (888)
T KOG0306|consen 428 VLGTKNGELQVFDLASA----SLVETIRAHDGAIWSISLSPDNK----GFVTGSADKTVKFWDFKLVVSV--PGTQKKVL 497 (888)
T ss_pred EEeccCCceEEEEeehh----hhhhhhhccccceeeeeecCCCC----ceEEecCCcEEEEEeEEEEecc--Ccccceee
Confidence 55667778888887653 45667789999999999888887 8999999999999766543221 11000000
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
+-+..++| .-.+.|.|+.++|| +++|+.+=-|.+|++|-+++-+..-.+.||.-||.++..+||
T Consensus 498 sl~~~rtL-el~ddvL~v~~Spd---------gk~LaVsLLdnTVkVyflDtlKFflsLYGHkLPV~smDIS~D------ 561 (888)
T KOG0306|consen 498 SLKHTRTL-ELEDDVLCVSVSPD---------GKLLAVSLLDNTVKVYFLDTLKFFLSLYGHKLPVLSMDISPD------ 561 (888)
T ss_pred eeccceEE-eccccEEEEEEcCC---------CcEEEEEeccCeEEEEEecceeeeeeecccccceeEEeccCC------
Confidence 00111122 23567999999998 899999999999999999999999999999999999999999
Q ss_pred CCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 639 SDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
+..++|||.|++|++|-++-|.|.+.+.+|.+.|++|.|-|...++.+++.| +.|+-||.+..++++++.||
T Consensus 562 SklivTgSADKnVKiWGLdFGDCHKS~fAHdDSvm~V~F~P~~~~FFt~gKD--------~kvKqWDg~kFe~iq~L~~H 633 (888)
T KOG0306|consen 562 SKLIVTGSADKNVKIWGLDFGDCHKSFFAHDDSVMSVQFLPKTHLFFTCGKD--------GKVKQWDGEKFEEIQKLDGH 633 (888)
T ss_pred cCeEEeccCCCceEEeccccchhhhhhhcccCceeEEEEcccceeEEEecCc--------ceEEeechhhhhhheeeccc
Confidence 9999999999999999999999999999999999999999999999999999 99999999999999999999
Q ss_pred CCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 719 ASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 719 ~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
...+.+.... .+|++++ +.+.|.++|.|.-
T Consensus 634 ~~ev~cLav~------------~~G~~vv-----s~shD~sIRlwE~ 663 (888)
T KOG0306|consen 634 HSEVWCLAVS------------PNGSFVV-----SSSHDKSIRLWER 663 (888)
T ss_pred hheeeeeEEc------------CCCCeEE-----eccCCceeEeeec
Confidence 9988887444 2344433 4455999999963
No 5
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.98 E-value=3e-31 Score=294.81 Aligned_cols=196 Identities=25% Similarity=0.224 Sum_probs=168.4
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEe
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAH 579 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~s 579 (1471)
.+...+.+|...|+|+..-.+. .+.+|+.|++|++|+ ...|++.+.|+||...|+.++.+
T Consensus 238 ~~~~~lsgHT~~VTCvrwGG~g-----liySgS~DrtIkvw~---------------a~dG~~~r~lkGHahwvN~lals 297 (480)
T KOG0271|consen 238 TCVRTLSGHTASVTCVRWGGEG-----LIYSGSQDRTIKVWR---------------ALDGKLCRELKGHAHWVNHLALS 297 (480)
T ss_pred eEEEEeccCccceEEEEEcCCc-----eEEecCCCceEEEEE---------------ccchhHHHhhcccchheeeeecc
Confidence 3444566788889988744333 699999999999933 34578899999999999999988
Q ss_pred cCCCCc------ccCc---------------------CCCEEEEEECCCcEEEEECC-CCceEEEEeccCCCEEEEEECC
Q 000473 580 RMVGTA------KGWS---------------------FNEVLVSGSMDCSIRIWDLG-SGNLITVMHHHVAPVRQIILSP 631 (1471)
Q Consensus 580 pd~~~~------~~~~---------------------~~~~L~SGs~DgtI~lWDl~-tg~~l~~~~~H~~~V~~l~fsp 631 (1471)
.|..-. .+++ .++.|+||+.|+++.+|+-. +.+++.++.+|..-|..+.|+|
T Consensus 298 Tdy~LRtgaf~~t~~~~~~~se~~~~Al~rY~~~~~~~~erlVSgsDd~tlflW~p~~~kkpi~rmtgHq~lVn~V~fSP 377 (480)
T KOG0271|consen 298 TDYVLRTGAFDHTGRKPKSFSEEQKKALERYEAVLKDSGERLVSGSDDFTLFLWNPFKSKKPITRMTGHQALVNHVSFSP 377 (480)
T ss_pred chhhhhccccccccccCCChHHHHHHHHHHHHHhhccCcceeEEecCCceEEEecccccccchhhhhchhhheeeEEECC
Confidence 543100 0111 35679999999999999975 4568899999999999999999
Q ss_pred CCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 632 PQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 632 d~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
| +.+++|++-|++|+|||-++|+.+..|+||-+.|+.|+|+.|.++|++|+.| .+++|||+++.++
T Consensus 378 d------~r~IASaSFDkSVkLW~g~tGk~lasfRGHv~~VYqvawsaDsRLlVS~SkD--------sTLKvw~V~tkKl 443 (480)
T KOG0271|consen 378 D------GRYIASASFDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSADSRLLVSGSKD--------STLKVWDVRTKKL 443 (480)
T ss_pred C------ccEEEEeecccceeeeeCCCcchhhhhhhccceeEEEEeccCccEEEEcCCC--------ceEEEEEeeeeee
Confidence 9 9999999999999999999999999999999999999999999999999998 9999999999999
Q ss_pred EEEEeCCCCCceeeeeee
Q 000473 712 ERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 712 ~~~l~gH~~~v~~~~~~~ 729 (1471)
...|.||...|..+++.+
T Consensus 444 ~~DLpGh~DEVf~vDwsp 461 (480)
T KOG0271|consen 444 KQDLPGHADEVFAVDWSP 461 (480)
T ss_pred cccCCCCCceEEEEEecC
Confidence 999999999999998884
No 6
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.97 E-value=3.9e-28 Score=288.34 Aligned_cols=500 Identities=15% Similarity=0.174 Sum_probs=321.8
Q ss_pred CCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccc
Q 000473 15 PSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNV 94 (1471)
Q Consensus 15 p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~ 94 (1471)
+...||++.|..+|..|||||.|+.|++||+-. ..-+..|.||...||..-
T Consensus 106 HK~AVt~l~fd~~G~rlaSGskDt~IIvwDlV~-----E~Gl~rL~GHkd~iT~~~------------------------ 156 (888)
T KOG0306|consen 106 HKAAVTTLKFDKIGTRLASGSKDTDIIVWDLVG-----EEGLFRLRGHKDSITQAL------------------------ 156 (888)
T ss_pred cccceEEEEEcccCceEeecCCCccEEEEEecc-----ceeeEEeecchHHHhHHh------------------------
Confidence 445699999999999999999999999999972 234567899999999886
Q ss_pred cccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCC-CeEEEEcce-ecccCCcccc--ccccc
Q 000473 95 MGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSN-PRYVCIGCC-FIDTNQLSDH--HSFES 170 (1471)
Q Consensus 95 ~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~-~~ll~~G~~-~id~~~~~~~--h~~~~ 170 (1471)
|..+..+|+|.|.|+.|++||+.+.+|..+.--+ ...+-.+.-+ .++++.|.. -+..|.+... ..+ +
T Consensus 157 ---F~~~~~~lvS~sKDs~iK~WdL~tqhCf~Thvd~-----r~Eiw~l~~~~~~lvt~~~dse~~v~~L~~~~D~~~-~ 227 (888)
T KOG0306|consen 157 ---FLNGDSFLVSVSKDSMIKFWDLETQHCFETHVDH-----RGEIWALVLDEKLLVTAGTDSELKVWELAFEDDEKE-T 227 (888)
T ss_pred ---ccCCCeEEEEeccCceEEEEecccceeeeEEecc-----cceEEEEEEecceEEEEecCCceEEEEeeccccccc-c
Confidence 4456788999999999999999999999874222 2333333312 344444431 1222222100 000 0
Q ss_pred ccccccccccCC-CCCCCCCceEEEEeCcceEEE-----------EEeec------------------------------
Q 000473 171 VEGDLVSEDKEV-PMKNPPKCTLVIVDTYGLTIV-----------QTVFH------------------------------ 208 (1471)
Q Consensus 171 i~~~~~~~d~~~-~~~~~~~~~I~v~D~~t~~~l-----------~tl~s------------------------------ 208 (1471)
.... ...... ..+.+....+...-..+.+.+ +.+++
T Consensus 228 ~~~~--s~~~~G~~~rqsk~R~i~l~~d~s~r~~~c~g~d~~~e~frI~s~~E~~k~l~Kk~k~~Kkka~t~e~~~~v~~ 305 (888)
T KOG0306|consen 228 NRYI--STKLRGTFIRQSKGREINLVTDFSDRFLVCQGADKVIELFRIRSKEEIAKILSKKLKRAKKKAETEENEDDVEK 305 (888)
T ss_pred cccc--eeeccceeeeccCCceeEEeecCcccEEEEecchhhhhheeecCHHHHHHHHHHHHHHhhhhccccccccchhh
Confidence 0000 000000 000000001111111111110 00000
Q ss_pred ----------CccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCcccccCCCcccCCCcccceeccCCcc
Q 000473 209 ----------GNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDREEGNGLCKSSSQLDMAILQNGVV 278 (1471)
Q Consensus 209 ----------~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~ 278 (1471)
.-.....|..+++.+++ +....++.-+++++.++.++... +..+++. .+.. ....+|-
T Consensus 306 sl~~~i~r~~~ir~~~kiks~dv~~~~---~~~~~lv~l~nNtv~~ysl~~s~---~~~p~~~-----~~~~-i~~~GHR 373 (888)
T KOG0306|consen 306 SLSDEIKRLETIRTSAKIKSFDVTPSG---GTENTLVLLANNTVEWYSLENSG---KTSPEAD-----RTSN-IEIGGHR 373 (888)
T ss_pred hHHHHHHHHHheechhheeEEEEEecC---CcceeEEEeecCceEEEEeccCC---CCCcccc-----ccce-eeeccch
Confidence 00012357788888643 22335555788999999998731 1111111 1111 1234588
Q ss_pred cCceEEEEecCCcEEEEEeCCeEEEEEcCCCcceeeeeeecceeEeecCCCCceeeeeEeecchhhhhhccccccccccc
Q 000473 279 EGGHLVSVATCGNIIALVLKDHCIFRLLGSGSTIGEICFVDNLFCLEGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFY 358 (1471)
Q Consensus 279 ~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~~ige~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 358 (1471)
.+++.+++|.+...+++++++..++|..++...+..+ +++ +.++..|... +
T Consensus 374 ~dVRsl~vS~d~~~~~Sga~~SikiWn~~t~kciRTi------------~~~-y~l~~~Fvpg----------------d 424 (888)
T KOG0306|consen 374 SDVRSLCVSSDSILLASGAGESIKIWNRDTLKCIRTI------------TCG-YILASKFVPG----------------D 424 (888)
T ss_pred hheeEEEeecCceeeeecCCCcEEEEEccCcceeEEe------------ccc-cEEEEEecCC----------------C
Confidence 8899999999999999999999888776666544333 111 5566677644 3
Q ss_pred ceEEEEcCCCcEEEEEeecCCCC----CcccCeeeecCccCCCCceeeEEEeecceeeEEeeeeeccccccccccCeeEE
Q 000473 359 ENFAVWDNRGSAIVYAISYMNEK----FDYEPHFEIPAVSYPSGVKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISV 434 (1471)
Q Consensus 359 ~~~~vw~~~G~~~vy~l~~~~~~----~~~~~~~~ip~v~~~~~~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~v 434 (1471)
..++++..+|...+|.+...... .+...+|.+ ...+++...+.... +.++++
T Consensus 425 ~~Iv~G~k~Gel~vfdlaS~~l~Eti~AHdgaIWsi-------------~~~pD~~g~vT~sa-----------DktVkf 480 (888)
T KOG0306|consen 425 RYIVLGTKNGELQVFDLASASLVETIRAHDGAIWSI-------------SLSPDNKGFVTGSA-----------DKTVKF 480 (888)
T ss_pred ceEEEeccCCceEEEEeehhhhhhhhhccccceeee-------------eecCCCCceEEecC-----------CcEEEE
Confidence 47889999999999988533211 111122211 11122222222110 123444
Q ss_pred EEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCccccccccCCCCCCCccccccccCccEEE
Q 000473 435 WSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSS 514 (1471)
Q Consensus 435 wsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts 514 (1471)
|+...-.+..+.+.|. +++- .+..-.-+..|.|
T Consensus 481 Wdf~l~~~~~gt~~k~------------------------------------lsl~-----------~~rtLel~ddvL~ 513 (888)
T KOG0306|consen 481 WDFKLVVSVPGTQKKV------------------------------------LSLK-----------HTRTLELEDDVLC 513 (888)
T ss_pred EeEEEEeccCccccee------------------------------------eeec-----------cceEEeccccEEE
Confidence 4321000000000000 0000 0001122456888
Q ss_pred EEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEE
Q 000473 515 SMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVL 594 (1471)
Q Consensus 515 ~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L 594 (1471)
+.+.|+.. .++.|--|.++.|+..| +.+..-.|+||.-+|.|+.++|| +..+
T Consensus 514 v~~Spdgk----~LaVsLLdnTVkVyflD---------------tlKFflsLYGHkLPV~smDIS~D---------Skli 565 (888)
T KOG0306|consen 514 VSVSPDGK----LLAVSLLDNTVKVYFLD---------------TLKFFLSLYGHKLPVLSMDISPD---------SKLI 565 (888)
T ss_pred EEEcCCCc----EEEEEeccCeEEEEEec---------------ceeeeeeecccccceeEEeccCC---------cCeE
Confidence 88555555 69999999999997766 34667789999999999999998 8999
Q ss_pred EEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEE
Q 000473 595 VSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAK 674 (1471)
Q Consensus 595 ~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~ 674 (1471)
+|||.|.+|++|-++-|.|...|.+|...|.++.|.|. .+.|.++|.|+.|+-||-+..++++.+.+|...|+|
T Consensus 566 vTgSADKnVKiWGLdFGDCHKS~fAHdDSvm~V~F~P~------~~~FFt~gKD~kvKqWDg~kFe~iq~L~~H~~ev~c 639 (888)
T KOG0306|consen 566 VTGSADKNVKIWGLDFGDCHKSFFAHDDSVMSVQFLPK------THLFFTCGKDGKVKQWDGEKFEEIQKLDGHHSEVWC 639 (888)
T ss_pred EeccCCCceEEeccccchhhhhhhcccCceeEEEEccc------ceeEEEecCcceEEeechhhhhhheeeccchheeee
Confidence 99999999999999999999999999999999999998 889999999999999999999999999999999999
Q ss_pred EEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 675 VVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 675 v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
++.+|+|.|+++++.| .+|++|....
T Consensus 640 Lav~~~G~~vvs~shD--------~sIRlwE~td 665 (888)
T KOG0306|consen 640 LAVSPNGSFVVSSSHD--------KSIRLWERTD 665 (888)
T ss_pred eEEcCCCCeEEeccCC--------ceeEeeeccC
Confidence 9999999999999999 9999998644
No 7
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.96 E-value=1.2e-24 Score=259.15 Aligned_cols=524 Identities=15% Similarity=0.125 Sum_probs=320.1
Q ss_pred EEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccccccccc
Q 000473 20 TATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMGKSS 99 (1471)
Q Consensus 20 tava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~~~s 99 (1471)
--+.|++||..+++- -.+.|-++|+.. -+....-+.+...|++++ .|
T Consensus 18 Gnl~ft~dG~sviSP-vGNrvsv~dLkn-----N~S~Tl~~e~~~NI~~ia---------------------------lS 64 (893)
T KOG0291|consen 18 GNLVFTKDGNSVISP-VGNRVSVFDLKN-----NKSYTLPLETRYNITRIA---------------------------LS 64 (893)
T ss_pred CcEEECCCCCEEEec-cCCEEEEEEccC-----CcceeEEeecCCceEEEE---------------------------eC
Confidence 357899999888764 346799999984 233444557888999998 78
Q ss_pred CCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCcccccccccccccccccc
Q 000473 100 LDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLVSED 179 (1471)
Q Consensus 100 ~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~~~d 179 (1471)
|++.+|++.-++|...+-++.....++....- .+....-++|+++++++|+.
T Consensus 65 p~g~lllavdE~g~~~lvs~~~r~Vlh~f~fk----~~v~~i~fSPng~~fav~~g------------------------ 116 (893)
T KOG0291|consen 65 PDGTLLLAVDERGRALLVSLLSRSVLHRFNFK----RGVGAIKFSPNGKFFAVGCG------------------------ 116 (893)
T ss_pred CCceEEEEEcCCCcEEEEecccceeeEEEeec----CccceEEECCCCcEEEEEec------------------------
Confidence 99999999999999999998777777665543 23455678899999988875
Q ss_pred cCCCCCCCCCceEEEEeCcce-------EEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCC
Q 000473 180 KEVPMKNPPKCTLVIVDTYGL-------TIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESH 252 (1471)
Q Consensus 180 ~~~~~~~~~~~~I~v~D~~t~-------~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~ 252 (1471)
.-+.||..... -.+...+-|+ -+.|..++++.|. ..+++|+.|-.++||.++..++
T Consensus 117 ----------n~lqiw~~P~~~~~~~~pFvl~r~~~g~--fddi~si~Ws~DS-----r~l~~gsrD~s~rl~~v~~~k~ 179 (893)
T KOG0291|consen 117 ----------NLLQIWHAPGEIKNEFNPFVLHRTYLGH--FDDITSIDWSDDS-----RLLVTGSRDLSARLFGVDGNKN 179 (893)
T ss_pred ----------ceeEEEecCcchhcccCcceEeeeecCC--ccceeEEEeccCC-----ceEEeccccceEEEEEeccccc
Confidence 34555554321 1122222223 3458899988432 3456669999999999997741
Q ss_pred cccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEE-EEcCCCcceeeeeeecceeEeecCCCCc
Q 000473 253 LDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIF-RLLGSGSTIGEICFVDNLFCLEGGSTNS 331 (1471)
Q Consensus 253 ~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~-~l~d~~~~ige~~~~~~~l~~~~~~~~~ 331 (1471)
-.. .-.++|...+..--|..+...+.+++.++.++ |-++..- .+. -..+.. .+
T Consensus 180 ---~~~-------------~~l~gHkd~VvacfF~~~~~~l~tvskdG~l~~W~~~~~P--~~~------~~~~kd--~e 233 (893)
T KOG0291|consen 180 ---LFT-------------YALNGHKDYVVACFFGANSLDLYTVSKDGALFVWTCDLRP--PEL------DKAEKD--EE 233 (893)
T ss_pred ---cce-------------EeccCCCcceEEEEeccCcceEEEEecCceEEEEEecCCC--ccc------cccccc--cc
Confidence 001 12344666666666777788888888887643 3333111 000 000000 00
Q ss_pred eeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEEEeeccee
Q 000473 332 YVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIHFIQMSLY 411 (1471)
Q Consensus 332 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~f~~~~~~ 411 (1471)
| .++.......+. +-.=.+|....+-++ ++....+ ..+.+.+.-++.+.-...+-+
T Consensus 234 ----g----~~d~~~~~~~Ee-----k~~~~~~~k~~k~~l------n~~~~kv-----taa~fH~~t~~lvvgFssG~f 289 (893)
T KOG0291|consen 234 ----G----SDDEEMDEDGEE-----KTHKIFWYKTKKHYL------NQNSSKV-----TAAAFHKGTNLLVVGFSSGEF 289 (893)
T ss_pred ----c----cccccccccchh-----hhcceEEEEEEeeee------cccccce-----eeeeccCCceEEEEEecCCee
Confidence 0 000000000110 000011111100000 0000000 000111111111110011111
Q ss_pred -eEEeeeeeccccccccccCeeE-EEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCcccc
Q 000473 412 -LLRMETVCFHVEETSQWRPYIS-VWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPR 489 (1471)
Q Consensus 412 -L~~v~s~~~~~~~~~~~~P~v~-vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~ 489 (1471)
|+-+ |-.. +..+... +. ++.. ..| +..|.+ -....+.-+.+-+
T Consensus 290 ~Lyel--------------P~f~lih~LSis---~~---~I~t-----~~~--------N~tGDW--iA~g~~klgQLlV 334 (893)
T KOG0291|consen 290 GLYEL--------------PDFNLIHSLSIS---DQ---KILT-----VSF--------NSTGDW--IAFGCSKLGQLLV 334 (893)
T ss_pred EEEec--------------CCceEEEEeecc---cc---eeeE-----EEe--------cccCCE--EEEcCCccceEEE
Confidence 1111 1000 0000000 00 0000 000 111110 0011122244556
Q ss_pred ccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecC
Q 000473 490 SEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGH 569 (1471)
Q Consensus 490 Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH 569 (1471)
|+.... ..+...++|...++++.|.++.. .+++|++||.|+| ||..++.|..+|..|
T Consensus 335 weWqsE----sYVlKQQgH~~~i~~l~YSpDgq----~iaTG~eDgKVKv---------------Wn~~SgfC~vTFteH 391 (893)
T KOG0291|consen 335 WEWQSE----SYVLKQQGHSDRITSLAYSPDGQ----LIATGAEDGKVKV---------------WNTQSGFCFVTFTEH 391 (893)
T ss_pred EEeecc----ceeeeccccccceeeEEECCCCc----EEEeccCCCcEEE---------------EeccCceEEEEeccC
Confidence 665443 22334578999999999666665 8999999999999 446678999999999
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccC-CCEEEEEECCCCCCCCCCCEEEEEeCC
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHV-APVRQIILSPPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~-~~V~~l~fspd~~~~~~~~~l~S~s~D 648 (1471)
+..|+.+.|+.. ++.++|.|.||+|+.||+...+..++|..-. -...+++..|. |..++.|+.|
T Consensus 392 ts~Vt~v~f~~~---------g~~llssSLDGtVRAwDlkRYrNfRTft~P~p~QfscvavD~s------GelV~AG~~d 456 (893)
T KOG0291|consen 392 TSGVTAVQFTAR---------GNVLLSSSLDGTVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPS------GELVCAGAQD 456 (893)
T ss_pred CCceEEEEEEec---------CCEEEEeecCCeEEeeeecccceeeeecCCCceeeeEEEEcCC------CCEEEeeccc
Confidence 999999999986 8999999999999999999999988887543 23568888888 8999999988
Q ss_pred C-cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeee
Q 000473 649 F-SVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHF 727 (1471)
Q Consensus 649 g-sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~ 727 (1471)
. .|.+|++++|+.+-.+.||.+||.+++|+|++..|++|+.| .+||+||+-......+--.+...++.+.|
T Consensus 457 ~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWD--------kTVRiW~if~s~~~vEtl~i~sdvl~vsf 528 (893)
T KOG0291|consen 457 SFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWD--------KTVRIWDIFSSSGTVETLEIRSDVLAVSF 528 (893)
T ss_pred eEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEecccc--------ceEEEEEeeccCceeeeEeeccceeEEEE
Confidence 5 79999999999999999999999999999999999999988 99999999765322222345567777777
Q ss_pred eeccccccccceEEcCCccccccceeeccCCceEeecccccc
Q 000473 728 CKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 728 ~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
.| +|.......+ ||.+-.|+.+.-.
T Consensus 529 rP------------dG~elaVaTl-----dgqItf~d~~~~~ 553 (893)
T KOG0291|consen 529 RP------------DGKELAVATL-----DGQITFFDIKEAV 553 (893)
T ss_pred cC------------CCCeEEEEEe-----cceEEEEEhhhce
Confidence 74 2333333333 8889999876543
No 8
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.96 E-value=9.2e-28 Score=271.02 Aligned_cols=157 Identities=20% Similarity=0.212 Sum_probs=139.3
Q ss_pred cCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCccc
Q 000473 508 KEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKG 587 (1471)
Q Consensus 508 h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~ 587 (1471)
|...|.++++.++.. .+++|+.|..-+| ||+.++.++-.|.||...|.+++|+|+
T Consensus 302 Hs~~v~~iaf~~DGS----L~~tGGlD~~~Rv---------------WDlRtgr~im~L~gH~k~I~~V~fsPN------ 356 (459)
T KOG0272|consen 302 HSKGVFSIAFQPDGS----LAATGGLDSLGRV---------------WDLRTGRCIMFLAGHIKEILSVAFSPN------ 356 (459)
T ss_pred cccccceeEecCCCc----eeeccCccchhhe---------------eecccCcEEEEecccccceeeEeECCC------
Confidence 444455554333332 5677777776666 567788999999999999999999997
Q ss_pred CcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecC
Q 000473 588 WSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPG 667 (1471)
Q Consensus 588 ~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g 667 (1471)
|..|+|||.|++++|||++..+++.++.+|..-|+.|.|+|.. |.+|+|++.|++++||.-+++.+++.+.|
T Consensus 357 ---Gy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~nlVS~Vk~~p~~-----g~fL~TasyD~t~kiWs~~~~~~~ksLaG 428 (459)
T KOG0272|consen 357 ---GYHLATGSSDNTCKVWDLRMRSELYTIPAHSNLVSQVKYSPQE-----GYFLVTASYDNTVKIWSTRTWSPLKSLAG 428 (459)
T ss_pred ---ceEEeecCCCCcEEEeeecccccceecccccchhhheEecccC-----CeEEEEcccCcceeeecCCCcccchhhcC
Confidence 9999999999999999999999999999999999999999964 88999999999999999999999999999
Q ss_pred CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 668 HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 668 h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
|.+.|.++..+||+.++++++.| .++++|.
T Consensus 429 He~kV~s~Dis~d~~~i~t~s~D--------RT~KLW~ 458 (459)
T KOG0272|consen 429 HEGKVISLDISPDSQAIATSSFD--------RTIKLWR 458 (459)
T ss_pred CccceEEEEeccCCceEEEeccC--------ceeeecc
Confidence 99999999999999999999999 9999995
No 9
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.96 E-value=7.2e-25 Score=260.94 Aligned_cols=424 Identities=17% Similarity=0.183 Sum_probs=282.6
Q ss_pred CCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccc
Q 000473 15 PSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNV 94 (1471)
Q Consensus 15 p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~ 94 (1471)
+-..|+++.||.|-++|++||.|-++++|+++. .+.-....+.||+.+|.+.-
T Consensus 144 ~fddi~si~Ws~DSr~l~~gsrD~s~rl~~v~~---~k~~~~~~l~gHkd~Vvacf------------------------ 196 (893)
T KOG0291|consen 144 HFDDITSIDWSDDSRLLVTGSRDLSARLFGVDG---NKNLFTYALNGHKDYVVACF------------------------ 196 (893)
T ss_pred CccceeEEEeccCCceEEeccccceEEEEEecc---ccccceEeccCCCcceEEEE------------------------
Confidence 334699999999999999999999999999984 22222446779999998875
Q ss_pred cccccCCCCEEEEEeCCCeEEEEEcCCC--eEEE---------------------------eeeCCCCC--CCCcEEEEc
Q 000473 95 MGKSSLDNGALISACTDGVLCVWSRSSG--HCRR---------------------------RRKLPPWV--GSPSVICTL 143 (1471)
Q Consensus 95 ~~~~s~d~~~LaSas~DG~I~VWdv~~G--~ci~---------------------------~~~l~~~~--g~~~~i~~~ 143 (1471)
|..++.-+.+.|.||.+++|-.+.. .... ..+-++-. ++-....+|
T Consensus 197 ---F~~~~~~l~tvskdG~l~~W~~~~~P~~~~~~~kd~eg~~d~~~~~~~Eek~~~~~~~k~~k~~ln~~~~kvtaa~f 273 (893)
T KOG0291|consen 197 ---FGANSLDLYTVSKDGALFVWTCDLRPPELDKAEKDEEGSDDEEMDEDGEEKTHKIFWYKTKKHYLNQNSSKVTAAAF 273 (893)
T ss_pred ---eccCcceEEEEecCceEEEEEecCCCcccccccccccccccccccccchhhhcceEEEEEEeeeecccccceeeeec
Confidence 5667778999999999999987611 0000 00000000 011222344
Q ss_pred CCCCeEEEEcceecccCCcccccccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEee
Q 000473 144 PSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVS 223 (1471)
Q Consensus 144 s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~ 223 (1471)
++...++++|.. ++...+++..+..+++.+.-+. . +|..+.|..
T Consensus 274 H~~t~~lvvgFs---------------------------------sG~f~LyelP~f~lih~LSis~-~--~I~t~~~N~ 317 (893)
T KOG0291|consen 274 HKGTNLLVVGFS---------------------------------SGEFGLYELPDFNLIHSLSISD-Q--KILTVSFNS 317 (893)
T ss_pred cCCceEEEEEec---------------------------------CCeeEEEecCCceEEEEeeccc-c--eeeEEEecc
Confidence 455555555543 3666678888877777776522 2 378888764
Q ss_pred ecCCCCceeEEEE-eCCCcEEEEECCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEE
Q 000473 224 LGEDMGKHYGLMV-DSVGRLQLVPISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCI 302 (1471)
Q Consensus 224 ~~~d~~~~~llva-s~dG~V~vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~ 302 (1471)
.+ +.+.+| +.-|.+-||+...+. -+-.+++|......++++|||+.+++++.++ .
T Consensus 318 tG-----DWiA~g~~klgQLlVweWqsEs------------------YVlKQQgH~~~i~~l~YSpDgq~iaTG~eDg-K 373 (893)
T KOG0291|consen 318 TG-----DWIAFGCSKLGQLLVWEWQSES------------------YVLKQQGHSDRITSLAYSPDGQLIATGAEDG-K 373 (893)
T ss_pred cC-----CEEEEcCCccceEEEEEeeccc------------------eeeeccccccceeeEEECCCCcEEEeccCCC-c
Confidence 43 678887 666899999998762 1244677888999999999999999998887 4
Q ss_pred EEEcCCCcceeeeeeecceeEeecCCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCC
Q 000473 303 FRLLGSGSTIGEICFVDNLFCLEGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKF 382 (1471)
Q Consensus 303 ~~l~d~~~~ige~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~ 382 (1471)
+++||..+ -+|.. .|.+... .+
T Consensus 374 VKvWn~~S----------gfC~v-----------TFteHts----------------~V--------------------- 395 (893)
T KOG0291|consen 374 VKVWNTQS----------GFCFV-----------TFTEHTS----------------GV--------------------- 395 (893)
T ss_pred EEEEeccC----------ceEEE-----------EeccCCC----------------ce---------------------
Confidence 56676554 11211 1111100 00
Q ss_pred cccCeeeecCccCCCCceeeEEEeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeecc
Q 000473 383 DYEPHFEIPAVSYPSGVKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVN 462 (1471)
Q Consensus 383 ~~~~~~~ip~v~~~~~~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~ 462 (1471)
--+.|...++.+++
T Consensus 396 ------------------t~v~f~~~g~~lls------------------------------------------------ 409 (893)
T KOG0291|consen 396 ------------------TAVQFTARGNVLLS------------------------------------------------ 409 (893)
T ss_pred ------------------EEEEEEecCCEEEE------------------------------------------------
Confidence 00111122222221
Q ss_pred ccccccCCCCcccceeecccccCccccccccCCCCCCCcccccccc-CccEEEEEeeccccccCCEEEEEEcCC-cEEEE
Q 000473 463 NSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHK-EKIVSSSMVISESFYAPYAIVYGFFSG-EIEVI 540 (1471)
Q Consensus 463 ~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h-~~~Vts~~~is~~~f~P~~lv~Gs~DG-~I~V~ 540 (1471)
++-+|+|+.||+.+-+ ...+|... +....|++.-+... .+..|..|. .|.|
T Consensus 410 ------------------sSLDGtVRAwDlkRYr----NfRTft~P~p~QfscvavD~sGe----lV~AG~~d~F~Ifv- 462 (893)
T KOG0291|consen 410 ------------------SSLDGTVRAWDLKRYR----NFRTFTSPEPIQFSCVAVDPSGE----LVCAGAQDSFEIFV- 462 (893)
T ss_pred ------------------eecCCeEEeeeecccc----eeeeecCCCceeeeEEEEcCCCC----EEEeeccceEEEEE-
Confidence 2223344444443321 11222211 12234454333333 344444432 3555
Q ss_pred EecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCce-EEEEec
Q 000473 541 QFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNL-ITVMHH 619 (1471)
Q Consensus 541 ~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~-l~~~~~ 619 (1471)
|++++|+.+-.|.||.++|.+++|+|+ +..|+|||+|.|||+||+....- ..++ .
T Consensus 463 --------------WS~qTGqllDiLsGHEgPVs~l~f~~~---------~~~LaS~SWDkTVRiW~if~s~~~vEtl-~ 518 (893)
T KOG0291|consen 463 --------------WSVQTGQLLDILSGHEGPVSGLSFSPD---------GSLLASGSWDKTVRIWDIFSSSGTVETL-E 518 (893)
T ss_pred --------------EEeecCeeeehhcCCCCcceeeEEccc---------cCeEEeccccceEEEEEeeccCceeeeE-e
Confidence 457789999999999999999999998 89999999999999999975532 3333 4
Q ss_pred cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecC--------------------CCCCcEEEEEcC
Q 000473 620 HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPG--------------------HPNYPAKVVWDC 679 (1471)
Q Consensus 620 H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g--------------------h~~~V~~v~~sp 679 (1471)
+...|..+.|+|+ |+.+|.+..||.|.+||.+.+..+..+.| .....+.+++++
T Consensus 519 i~sdvl~vsfrPd------G~elaVaTldgqItf~d~~~~~q~~~IdgrkD~~~gR~~~D~~ta~~sa~~K~Ftti~ySa 592 (893)
T KOG0291|consen 519 IRSDVLAVSFRPD------GKELAVATLDGQITFFDIKEAVQVGSIDGRKDLSGGRKETDRITAENSAKGKTFTTICYSA 592 (893)
T ss_pred eccceeEEEEcCC------CCeEEEEEecceEEEEEhhhceeeccccchhhccccccccceeehhhcccCCceEEEEEcC
Confidence 6678999999999 99999999999999999987655533332 123678999999
Q ss_pred CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 680 PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 680 dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
||.+|++|+.. ..|.+||+.++-+++.+.-
T Consensus 593 DG~~IlAgG~s--------n~iCiY~v~~~vllkkfqi 622 (893)
T KOG0291|consen 593 DGKCILAGGES--------NSICIYDVPEGVLLKKFQI 622 (893)
T ss_pred CCCEEEecCCc--------ccEEEEECchhheeeeEEe
Confidence 99999999998 8999999999999887763
No 10
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.95 E-value=5.6e-26 Score=246.54 Aligned_cols=200 Identities=21% Similarity=0.276 Sum_probs=174.1
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
.+|.+.++|+.|..+. .|++|+.|.++.+ ||+++++..+.|.||++.|.+|.++|.
T Consensus 142 ~gHtgylScC~f~dD~-----~ilT~SGD~TCal---------------WDie~g~~~~~f~GH~gDV~slsl~p~---- 197 (343)
T KOG0286|consen 142 AGHTGYLSCCRFLDDN-----HILTGSGDMTCAL---------------WDIETGQQTQVFHGHTGDVMSLSLSPS---- 197 (343)
T ss_pred cCccceeEEEEEcCCC-----ceEecCCCceEEE---------------EEcccceEEEEecCCcccEEEEecCCC----
Confidence 4566777777766655 4999999999988 568889999999999999999999994
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF 665 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~ 665 (1471)
+++.++||+.|++.++||++.+.+.+.|.+|...|.+|.|.|+ |.-|++|++|++.++||++..+.+..+
T Consensus 198 ----~~ntFvSg~cD~~aklWD~R~~~c~qtF~ghesDINsv~ffP~------G~afatGSDD~tcRlyDlRaD~~~a~y 267 (343)
T KOG0286|consen 198 ----DGNTFVSGGCDKSAKLWDVRSGQCVQTFEGHESDINSVRFFPS------GDAFATGSDDATCRLYDLRADQELAVY 267 (343)
T ss_pred ----CCCeEEecccccceeeeeccCcceeEeecccccccceEEEccC------CCeeeecCCCceeEEEeecCCcEEeee
Confidence 2899999999999999999999999999999999999999999 999999999999999999999998888
Q ss_pred cCC--CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcC
Q 000473 666 PGH--PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNG 743 (1471)
Q Consensus 666 ~gh--~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g 743 (1471)
... ..+|++|+||-.|++|++|..| .++.|||.-.++.+..|.||..+|.++..++. .--+.+|
T Consensus 268 s~~~~~~gitSv~FS~SGRlLfagy~d--------~~c~vWDtlk~e~vg~L~GHeNRvScl~~s~D------G~av~Tg 333 (343)
T KOG0286|consen 268 SHDSIICGITSVAFSKSGRLLFAGYDD--------FTCNVWDTLKGERVGVLAGHENRVSCLGVSPD------GMAVATG 333 (343)
T ss_pred ccCcccCCceeEEEcccccEEEeeecC--------CceeEeeccccceEEEeeccCCeeEEEEECCC------CcEEEec
Confidence 632 3478999999999999999888 99999999999999999999999999877731 1123344
Q ss_pred CccccccceeeccCCceEeec
Q 000473 744 NTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 744 ~~~~s~~l~~~~~D~tir~w~ 764 (1471)
++ |.++|+|.
T Consensus 334 SW-----------Ds~lriW~ 343 (343)
T KOG0286|consen 334 SW-----------DSTLRIWA 343 (343)
T ss_pred ch-----------hHheeecC
Confidence 44 88899884
No 11
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.95 E-value=6.5e-28 Score=272.22 Aligned_cols=242 Identities=14% Similarity=0.182 Sum_probs=209.5
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc------cc----
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL------FE---- 546 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~------l~---- 546 (1471)
.+++.+.+|.+++|+..+. +..++|.+|...|.++.+.|... ...+++|+.||++++|.++. +.
T Consensus 189 ~laT~swsG~~kvW~~~~~----~~~~~l~gH~~~v~~~~fhP~~~--~~~lat~s~Dgtvklw~~~~e~~l~~l~gH~~ 262 (459)
T KOG0272|consen 189 HLATGSWSGLVKVWSVPQC----NLLQTLRGHTSRVGAAVFHPVDS--DLNLATASADGTVKLWKLSQETPLQDLEGHLA 262 (459)
T ss_pred eEEEeecCCceeEeecCCc----ceeEEEeccccceeeEEEccCCC--ccceeeeccCCceeeeccCCCcchhhhhcchh
Confidence 4677888899999998764 67789999999999998555420 01799999999999976532 11
Q ss_pred ---------------cC--CCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC
Q 000473 547 ---------------RH--NSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG 609 (1471)
Q Consensus 547 ---------------~~--d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~ 609 (1471)
.. |.+-++||+.++.......||...|.+++|+|| |.+++|||.|..-+|||++
T Consensus 263 RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~ElL~QEGHs~~v~~iaf~~D---------GSL~~tGGlD~~~RvWDlR 333 (459)
T KOG0272|consen 263 RVSRVAFHPSGKFLGTASFDSTWRLWDLETKSELLLQEGHSKGVFSIAFQPD---------GSLAATGGLDSLGRVWDLR 333 (459)
T ss_pred hheeeeecCCCceeeecccccchhhcccccchhhHhhcccccccceeEecCC---------CceeeccCccchhheeecc
Confidence 11 445578999999888888999999999999998 9999999999999999999
Q ss_pred CCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcC-CCCEEEEEE
Q 000473 610 SGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDC-PRGYIACLC 688 (1471)
Q Consensus 610 tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~sp-dg~~L~sgs 688 (1471)
+|+++..+.+|..+|.+|.|+|+ |..++|||.|++++|||++..+++.++++|.+-|+.|+|+| .|.+|+|++
T Consensus 334 tgr~im~L~gH~k~I~~V~fsPN------Gy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~nlVS~Vk~~p~~g~fL~Tas 407 (459)
T KOG0272|consen 334 TGRCIMFLAGHIKEILSVAFSPN------GYHLATGSSDNTCKVWDLRMRSELYTIPAHSNLVSQVKYSPQEGYFLVTAS 407 (459)
T ss_pred cCcEEEEecccccceeeEeECCC------ceEEeecCCCCcEEEeeecccccceecccccchhhheEecccCCeEEEEcc
Confidence 99999999999999999999999 99999999999999999999999999999999999999999 788999999
Q ss_pred cCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeec
Q 000473 689 RDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 689 ~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
.| ++++||..+++++++++.||.+.|+.+...+ ++.. +.+.+-|.++|.|.
T Consensus 408 yD--------~t~kiWs~~~~~~~ksLaGHe~kV~s~Dis~------------d~~~-----i~t~s~DRT~KLW~ 458 (459)
T KOG0272|consen 408 YD--------NTVKIWSTRTWSPLKSLAGHEGKVISLDISP------------DSQA-----IATSSFDRTIKLWR 458 (459)
T ss_pred cC--------cceeeecCCCcccchhhcCCccceEEEEecc------------CCce-----EEEeccCceeeecc
Confidence 98 9999999999999999999999999986662 1222 33444599999995
No 12
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.95 E-value=2.9e-23 Score=239.66 Aligned_cols=533 Identities=12% Similarity=0.070 Sum_probs=311.4
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccc
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSN 93 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~ 93 (1471)
-++|.||...+||.|-++|+|...|.|++||.. ..+.-++..+.--+++|.+|+
T Consensus 57 EH~~~vtVAkySPsG~yiASGD~sG~vRIWdtt---~~~hiLKnef~v~aG~I~Di~----------------------- 110 (603)
T KOG0318|consen 57 EHAHQVTVAKYSPSGFYIASGDVSGKVRIWDTT---QKEHILKNEFQVLAGPIKDIS----------------------- 110 (603)
T ss_pred cccceeEEEEeCCCceEEeecCCcCcEEEEecc---Ccceeeeeeeeecccccccce-----------------------
Confidence 457999999999999999999999999999987 334556666777788999998
Q ss_pred ccccccCCCCEEEEEeCC----CeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCC-CC-eEEEEcceecccCCcccccc
Q 000473 94 VMGKSSLDNGALISACTD----GVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPS-NP-RYVCIGCCFIDTNQLSDHHS 167 (1471)
Q Consensus 94 ~~~~~s~d~~~LaSas~D----G~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~-~~-~ll~~G~~~id~~~~~~~h~ 167 (1471)
++.|+++++..++. |.+.+||. |.-+-...-+ ......+.+.+ ++ |++++|.+
T Consensus 111 ----Wd~ds~RI~avGEGrerfg~~F~~DS--G~SvGei~Gh---Sr~ins~~~KpsRPfRi~T~sdD------------ 169 (603)
T KOG0318|consen 111 ----WDFDSKRIAAVGEGRERFGHVFLWDS--GNSVGEITGH---SRRINSVDFKPSRPFRIATGSDD------------ 169 (603)
T ss_pred ----eCCCCcEEEEEecCccceeEEEEecC--CCccceeecc---ceeEeeeeccCCCceEEEeccCC------------
Confidence 67788889887764 45778875 3322222111 11112233332 33 45555544
Q ss_pred cccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEEC
Q 000473 168 FESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPI 247 (1471)
Q Consensus 168 ~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l 247 (1471)
..|.+|+-.=.+.-.+... +...|.++.++ ||+. ..+.++.||.+.++|=
T Consensus 170 ----------------------n~v~ffeGPPFKFk~s~r~---HskFV~~VRys---PDG~--~Fat~gsDgki~iyDG 219 (603)
T KOG0318|consen 170 ----------------------NTVAFFEGPPFKFKSSFRE---HSKFVNCVRYS---PDGS--RFATAGSDGKIYIYDG 219 (603)
T ss_pred ----------------------CeEEEeeCCCeeeeecccc---cccceeeEEEC---CCCC--eEEEecCCccEEEEcC
Confidence 5555655433333333332 23358899988 4554 3666799999999995
Q ss_pred CCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCe-EEEEEcCCCcceeeeeeecceeEeec
Q 000473 248 SKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDH-CIFRLLGSGSTIGEICFVDNLFCLEG 326 (1471)
Q Consensus 248 ~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~-~~~~l~d~~~~ige~~~~~~~l~~~~ 326 (1471)
..+ +.+.+- ....+|.+++.+++.+||+..+++++.+. +.+|......++.++....+
T Consensus 220 ktg--------e~vg~l-------~~~~aHkGsIfalsWsPDs~~~~T~SaDkt~KIWdVs~~slv~t~~~~~~------ 278 (603)
T KOG0318|consen 220 KTG--------EKVGEL-------EDSDAHKGSIFALSWSPDSTQFLTVSADKTIKIWDVSTNSLVSTWPMGST------ 278 (603)
T ss_pred CCc--------cEEEEe-------cCCCCccccEEEEEECCCCceEEEecCCceEEEEEeeccceEEEeecCCc------
Confidence 544 222221 22446888899999999999999987765 34433333444444422211
Q ss_pred CCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCc-eeeEEE
Q 000473 327 GSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGV-KFSIHF 405 (1471)
Q Consensus 327 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~-~~~i~f 405 (1471)
-....+|+..- .+ .++.=+-+|....++..... .. .+-..++. --.+.-
T Consensus 279 --v~dqqvG~lWq-kd-----------------~lItVSl~G~in~ln~~d~~-~~---------~~i~GHnK~ITaLtv 328 (603)
T KOG0318|consen 279 --VEDQQVGCLWQ-KD-----------------HLITVSLSGTINYLNPSDPS-VL---------KVISGHNKSITALTV 328 (603)
T ss_pred --hhceEEEEEEe-CC-----------------eEEEEEcCcEEEEecccCCC-hh---------heecccccceeEEEE
Confidence 01123344333 11 23223334443333322111 00 00011111 112333
Q ss_pred eecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccC
Q 000473 406 IQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQD 485 (1471)
Q Consensus 406 ~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~ 485 (1471)
..++.++++.. . +.++.-|..... ..+++..++..-.+--.. ....| .+.+.+.+.
T Consensus 329 ~~d~~~i~Sgs-----y------DG~I~~W~~~~g-----~~~~~~g~~h~nqI~~~~----~~~~~----~~~t~g~Dd 384 (603)
T KOG0318|consen 329 SPDGKTIYSGS-----Y------DGHINSWDSGSG-----TSDRLAGKGHTNQIKGMA----ASESG----ELFTIGWDD 384 (603)
T ss_pred cCCCCEEEeec-----c------CceEEEEecCCc-----cccccccccccceEEEEe----ecCCC----cEEEEecCC
Confidence 45556666543 2 335777763221 111222222100000000 00011 244445556
Q ss_pred ccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEE
Q 000473 486 TVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQY 565 (1471)
Q Consensus 486 ~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~ 565 (1471)
+++.-+...+.-.......+...+. +++..++.. .++..+. +.|.+ ++. ......
T Consensus 385 ~l~~~~~~~~~~t~~~~~~lg~QP~---~lav~~d~~----~avv~~~-~~iv~--l~~---------------~~~~~~ 439 (603)
T KOG0318|consen 385 TLRVISLKDNGYTKSEVVKLGSQPK---GLAVLSDGG----TAVVACI-SDIVL--LQD---------------QTKVSS 439 (603)
T ss_pred eEEEEecccCcccccceeecCCCce---eEEEcCCCC----EEEEEec-CcEEE--Eec---------------CCccee
Confidence 6665554432211111111111111 223223322 2333333 33333 110 000111
Q ss_pred EecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceE--EEEeccCCCEEEEEECCCCCCCCCCCEEE
Q 000473 566 FLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLI--TVMHHHVAPVRQIILSPPQTEHPWSDCFL 643 (1471)
Q Consensus 566 l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l--~~~~~H~~~V~~l~fspd~~~~~~~~~l~ 643 (1471)
.. -.-.+.+++++|+ +..++.|+.|+.|+++.+..+++. .....|.++|++++|+|+ +.+|+
T Consensus 440 ~~-~~y~~s~vAv~~~---------~~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd------~~yla 503 (603)
T KOG0318|consen 440 IP-IGYESSAVAVSPD---------GSEVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPD------GAYLA 503 (603)
T ss_pred ec-cccccceEEEcCC---------CCEEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCC------CcEEE
Confidence 11 1234689999998 899999999999999999865533 456789999999999999 99999
Q ss_pred EEeCCCcEEEEECCCCcEEEEe-cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEeCCCCC
Q 000473 644 SVGEDFSVALASLETLRVERMF-PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLRGTASH 721 (1471)
Q Consensus 644 S~s~DgsV~lWdl~t~~~l~~~-~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~gH~~~ 721 (1471)
++...+.|.+||+.+++..... .-|...|.+++|+|+..++++|+.| -.|.||+++.- +.+.....|...
T Consensus 504 ~~Da~rkvv~yd~~s~~~~~~~w~FHtakI~~~aWsP~n~~vATGSlD--------t~Viiysv~kP~~~i~iknAH~~g 575 (603)
T KOG0318|consen 504 AGDASRKVVLYDVASREVKTNRWAFHTAKINCVAWSPNNKLVATGSLD--------TNVIIYSVKKPAKHIIIKNAHLGG 575 (603)
T ss_pred EeccCCcEEEEEcccCceecceeeeeeeeEEEEEeCCCceEEEecccc--------ceEEEEEccChhhheEeccccccC
Confidence 9999999999999988874333 3499999999999999999999999 89999999863 335556678877
Q ss_pred ceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 722 SMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 722 v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
|..+.|-++ ..+++..+|..+|.|++
T Consensus 576 Vn~v~wlde------------------~tvvSsG~Da~iK~W~v 601 (603)
T KOG0318|consen 576 VNSVAWLDE------------------STVVSSGQDANIKVWNV 601 (603)
T ss_pred ceeEEEecC------------------ceEEeccCcceeEEecc
Confidence 777666532 23455566999999986
No 13
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.95 E-value=3.8e-25 Score=260.08 Aligned_cols=604 Identities=13% Similarity=0.121 Sum_probs=347.3
Q ss_pred eeeEecCCCCCC---------ceEEEEEEcCCCCeEEEE--eCCCcEEEEEccCCCCCceeeeEEecccccceeEeeecc
Q 000473 5 SVACIWSGTPPS---------HRVTATSALTQPPTLYTG--GSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICY 73 (1471)
Q Consensus 5 ~~~~lw~~~~p~---------h~Vtava~SpDg~~LaTG--s~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~ 73 (1471)
=|++|+...-.+ -.+||+|||++|+|+||| |....+++|+++. ...++.|..|+..|+|++
T Consensus 58 CvVVlfn~~~~tQ~hlvnssRk~~t~vAfS~~GryvatGEcG~~pa~kVw~la~-----h~vVAEfvdHKY~vtcva--- 129 (1080)
T KOG1408|consen 58 CVVVLFNVDSCTQSHLVNSSRKPLTCVAFSQNGRYVATGECGRTPASKVWSLAF-----HGVVAEFVDHKYNVTCVA--- 129 (1080)
T ss_pred cEEEEEcccccchhheecccCcceeEEEEcCCCcEEEecccCCCccceeeeecc-----ccchhhhhhccccceeee---
Confidence 367777665443 259999999999999999 4778899999985 346778899999999999
Q ss_pred ccccccCcccccccccccccccccccCCCCEEEEEeC--CCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeE-E
Q 000473 74 PAMVSRDGKAEHWKAENSSNVMGKSSLDNGALISACT--DGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRY-V 150 (1471)
Q Consensus 74 ~~~~s~dg~~~~~~~~~~~~~~~~~s~d~~~LaSas~--DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~l-l 150 (1471)
|+|.++|++|.+. |-.|.+||+.........+. .+...+..|++++-| +
T Consensus 130 ------------------------Fsp~~kyvvSVGsQHDMIVnv~dWr~N~~~asnki----ss~Vsav~fsEdgSYfv 181 (1080)
T KOG1408|consen 130 ------------------------FSPGNKYVVSVGSQHDMIVNVNDWRVNSSGASNKI----SSVVSAVAFSEDGSYFV 181 (1080)
T ss_pred ------------------------ecCCCcEEEeeccccceEEEhhhhhhccccccccc----ceeEEEEEEccCCceee
Confidence 7899999999664 77888999875544444333 234566788999865 4
Q ss_pred EEcceecccCCcccccccc----cccccccccccCCCCCCCCCceEEE-----EeCcceEEEEEeecCccccCC------
Q 000473 151 CIGCCFIDTNQLSDHHSFE----SVEGDLVSEDKEVPMKNPPKCTLVI-----VDTYGLTIVQTVFHGNLSIGP------ 215 (1471)
Q Consensus 151 ~~G~~~id~~~~~~~h~~~----~i~~~~~~~d~~~~~~~~~~~~I~v-----~D~~t~~~l~tl~s~~~s~~~------ 215 (1471)
++|...+..|.+.....++ ..-....+-+.+........|.+-+ |.......+-.+.+.++--.|
T Consensus 182 T~gnrHvk~wyl~~~~KykdpiPl~gRs~~lg~lr~n~f~avaCg~gicAestfait~qGhLvEFSsRRLLDKWVqcRTT 261 (1080)
T KOG1408|consen 182 TSGNRHVKLWYLQIQSKYKDPIPLPGRSYFLGNLRFNEFLAVACGVGICAESTFAITAQGHLVEFSSRRLLDKWVQCRTT 261 (1080)
T ss_pred eeeeeeEEEEEeeccccccCCccccchhhhccccccchhhhhhhcCcccccceEEEecccceeeechhhhhhhhhhhhcc
Confidence 5677667767665554331 1111111112111111111121111 111111111222221211122
Q ss_pred -eEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCccccc-----C---CCcccCCCcccceeccCCcccCceEEEE
Q 000473 216 -WKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDREE-----G---NGLCKSSSQLDMAILQNGVVEGGHLVSV 286 (1471)
Q Consensus 216 -i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~~~-----~---~~l~~~e~~i~~v~~~~~~~~~~~~vs~ 286 (1471)
.++++++. ..+++|.++|+|++|+.++..-....+ + ..+.+.+ ++.. .-.++.+.+-.++.|
T Consensus 262 nAnCIcVs~-------r~I~cgCa~g~vrlFnp~tL~y~~Tlpr~halg~d~a~~~q~~-~~~s-~~~~a~fPD~IA~~F 332 (1080)
T KOG1408|consen 262 NANCICVSS-------RLIACGCAKGMVRLFNPETLDYAGTLPRSHALGSDTANLSQPE-PKNS-ESSPAIFPDAIACQF 332 (1080)
T ss_pred ccceeeeec-------ceEEEeeccceeeecCcchhhhccccccccccccchhhccccc-cccc-ccCcccCCceeEEEe
Confidence 25667661 457888999999999977642111111 0 1111111 1100 112223333456667
Q ss_pred ecCCcEEEEEeCCeEEEEEcCCCc--ceeee-eeecceeEeecCCCCceeeeeEeecchhhhhhcccccccccccceEEE
Q 000473 287 ATCGNIIALVLKDHCIFRLLGSGS--TIGEI-CFVDNLFCLEGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAV 363 (1471)
Q Consensus 287 s~~g~~l~~~~~~~~~~~l~d~~~--~ige~-~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~v 363 (1471)
.+....+.++-.++-++ +||..+ ..|+. .+..+-.|.=. ...+..+ ++.....|-. .+-|..
T Consensus 333 det~~klscVYndhSlY-vWDvrD~~kvgk~~s~lyHS~ciW~---------Ve~~p~n----v~~~~~aclp-~~cF~T 397 (1080)
T KOG1408|consen 333 DETTDKLSCVYNDHSLY-VWDVRDVNKVGKCSSMLYHSACIWD---------VENLPCN----VHSPTAACLP-RGCFTT 397 (1080)
T ss_pred cCCCceEEEEEcCceEE-EEeccccccccceeeeeeccceeee---------ecccccc----ccCcccccCC-ccceeE
Confidence 76655555554544332 366555 12222 11111122100 0000000 0000000000 124566
Q ss_pred EcCCCcEEEEEeecC--CCCCcccCee-eecCccCCCCceeeEEEeecceeeEEeeeeeccccccccccCeeEEEEcccc
Q 000473 364 WDNRGSAIVYAISYM--NEKFDYEPHF-EIPAVSYPSGVKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQK 440 (1471)
Q Consensus 364 w~~~G~~~vy~l~~~--~~~~~~~~~~-~ip~v~~~~~~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~ 440 (1471)
.+.||.+++|.+.+. ++.+++..+. +..++++..+. -+++.-.....++......+.|.+-+-.+
T Consensus 398 CSsD~TIRlW~l~~ctnn~vyrRNils~~l~ki~y~d~~---------~q~~~d~~~~~fdka~~s~~d~r~G~R~~--- 465 (1080)
T KOG1408|consen 398 CSSDGTIRLWDLAFCTNNQVYRRNILSANLSKIPYEDST---------QQIMHDASAGIFDKALVSTCDSRFGFRAL--- 465 (1080)
T ss_pred ecCCCcEEEeecccccccceeecccchhhhhcCccccCc---------hhhhhhccCCcccccchhhcCcccceEEE---
Confidence 666666666666542 2222221111 11111111111 00010000000111111111111111000
Q ss_pred CCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCccccccccCCCCCCCccccccccCccEEEEEeecc
Q 000473 441 HSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISE 520 (1471)
Q Consensus 441 ~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~ 520 (1471)
.+.++|. .++..-..+.++++++.+- .....+..|...|.|+.|...
T Consensus 466 --------------------------~vSp~gq---hLAsGDr~GnlrVy~Lq~l----~~~~~~eAHesEilcLeyS~p 512 (1080)
T KOG1408|consen 466 --------------------------AVSPDGQ---HLASGDRGGNLRVYDLQEL----EYTCFMEAHESEILCLEYSFP 512 (1080)
T ss_pred --------------------------EECCCcc---eecccCccCceEEEEehhh----hhhhheecccceeEEEeecCc
Confidence 1223441 2444455678888887653 334456889999999885432
Q ss_pred ccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC
Q 000473 521 SFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD 600 (1471)
Q Consensus 521 ~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D 600 (1471)
.. .-..|++|+.|.-|.|++.. ..-.+++++.+|...|+++.|.-. + .+..++|++.|
T Consensus 513 ~~-~~kLLASasrdRlIHV~Dv~--------------rny~l~qtld~HSssITsvKFa~~-----g--ln~~MiscGAD 570 (1080)
T KOG1408|consen 513 VL-TNKLLASASRDRLIHVYDVK--------------RNYDLVQTLDGHSSSITSVKFACN-----G--LNRKMISCGAD 570 (1080)
T ss_pred hh-hhHhhhhccCCceEEEEecc--------------cccchhhhhcccccceeEEEEeec-----C--CceEEEeccCc
Confidence 21 11278999999999994422 122457889999999999999764 1 14689999999
Q ss_pred CcEEEEECCCCceEEEEecc-----CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecC---CCCCc
Q 000473 601 CSIRIWDLGSGNLITVMHHH-----VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPG---HPNYP 672 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H-----~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g---h~~~V 672 (1471)
+.|.+--.....-...|..| ...+..+++.|. ..++++++.|+.|+|+|+++++..+.|.| |.+..
T Consensus 571 ksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~------~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~l 644 (1080)
T KOG1408|consen 571 KSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPT------SKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDL 644 (1080)
T ss_pred hhhheehhccccCceeccccccccccceEEEeeeCCC------cceEEEEecccceEEEeccccceeeeecccccCCCce
Confidence 98764333211111222222 246889999998 88999999999999999999999999975 66778
Q ss_pred EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccce
Q 000473 673 AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLL 752 (1471)
Q Consensus 673 ~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~ 752 (1471)
..|...|.|.||++.|.| .++.++|..+|+++..+.||...|+.+.|.+.- .++.
T Consensus 645 IKv~lDPSgiY~atScsd--------ktl~~~Df~sgEcvA~m~GHsE~VTG~kF~nDC-----------------kHlI 699 (1080)
T KOG1408|consen 645 IKVILDPSGIYLATSCSD--------KTLCFVDFVSGECVAQMTGHSEAVTGVKFLNDC-----------------KHLI 699 (1080)
T ss_pred EEEEECCCccEEEEeecC--------CceEEEEeccchhhhhhcCcchheeeeeecccc-----------------hhhe
Confidence 899999999999999999 999999999999999999999999998887421 2345
Q ss_pred eeccCCceEeeccc
Q 000473 753 PIHEDGTFRQSQIQ 766 (1471)
Q Consensus 753 ~~~~D~tir~w~l~ 766 (1471)
.++.|+.|-+|.+.
T Consensus 700 SvsgDgCIFvW~lp 713 (1080)
T KOG1408|consen 700 SVSGDGCIFVWKLP 713 (1080)
T ss_pred eecCCceEEEEECc
Confidence 66779999999874
No 14
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.94 E-value=2e-23 Score=240.86 Aligned_cols=405 Identities=14% Similarity=0.154 Sum_probs=265.1
Q ss_pred EEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEec---ccccceeEeeeccccccccCccccccccccccccc
Q 000473 19 VTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLC---GHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVM 95 (1471)
Q Consensus 19 Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~---GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~ 95 (1471)
|.|+.|+|||..+||.+.||+|.++|-.+ .+.+..|. +|++.|-+|+
T Consensus 193 V~~VRysPDG~~Fat~gsDgki~iyDGkt-----ge~vg~l~~~~aHkGsIfals------------------------- 242 (603)
T KOG0318|consen 193 VNCVRYSPDGSRFATAGSDGKIYIYDGKT-----GEKVGELEDSDAHKGSIFALS------------------------- 242 (603)
T ss_pred eeeEEECCCCCeEEEecCCccEEEEcCCC-----ccEEEEecCCCCccccEEEEE-------------------------
Confidence 99999999999999999999999999875 33555666 8999999999
Q ss_pred ccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCcccccccccccccc
Q 000473 96 GKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDL 175 (1471)
Q Consensus 96 ~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~ 175 (1471)
++||++.++|+|.|.+++|||+.+++|+....+...++...--+-. ..+.++++.-
T Consensus 243 --WsPDs~~~~T~SaDkt~KIWdVs~~slv~t~~~~~~v~dqqvG~lW-qkd~lItVSl--------------------- 298 (603)
T KOG0318|consen 243 --WSPDSTQFLTVSADKTIKIWDVSTNSLVSTWPMGSTVEDQQVGCLW-QKDHLITVSL--------------------- 298 (603)
T ss_pred --ECCCCceEEEecCCceEEEEEeeccceEEEeecCCchhceEEEEEE-eCCeEEEEEc---------------------
Confidence 7999999999999999999999999999887765221111111111 1344444433
Q ss_pred cccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCccc
Q 000473 176 VSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDR 255 (1471)
Q Consensus 176 ~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~ 255 (1471)
++.|-.+++....+++++.. +... |+++++++ ++ ..++.|+.||.|.-|++..+.
T Consensus 299 -------------~G~in~ln~~d~~~~~~i~G-HnK~--ITaLtv~~---d~--~~i~SgsyDG~I~~W~~~~g~---- 353 (603)
T KOG0318|consen 299 -------------SGTINYLNPSDPSVLKVISG-HNKS--ITALTVSP---DG--KTIYSGSYDGHINSWDSGSGT---- 353 (603)
T ss_pred -------------CcEEEEecccCCChhheecc-cccc--eeEEEEcC---CC--CEEEeeccCceEEEEecCCcc----
Confidence 38899999999887777665 6666 99999884 44 458889999999999998773
Q ss_pred ccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCcceeeeeeecceeEeecCCCCceeee
Q 000473 256 EEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGSTIGEICFVDNLFCLEGGSTNSYVIG 335 (1471)
Q Consensus 256 ~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~~ige~~~~~~~l~~~~~~~~~~~~~ 335 (1471)
..++.+ ..|...+..++.+..+.++-+.-++... ..+.+. +
T Consensus 354 --~~~~~g-----------~~h~nqI~~~~~~~~~~~~t~g~Dd~l~--~~~~~~------------------------~ 394 (603)
T KOG0318|consen 354 --SDRLAG-----------KGHTNQIKGMAASESGELFTIGWDDTLR--VISLKD------------------------N 394 (603)
T ss_pred --cccccc-----------ccccceEEEEeecCCCcEEEEecCCeEE--EEeccc------------------------C
Confidence 111111 1133333444433334443333333211 111111 0
Q ss_pred eEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEEEeecceeeEEe
Q 000473 336 AMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIHFIQMSLYLLRM 415 (1471)
Q Consensus 336 g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~f~~~~~~L~~v 415 (1471)
++.. . . +.++..+ |.. ++-..++..++
T Consensus 395 ~~t~-~------------------~-----------~~~lg~Q------------P~~---------lav~~d~~~av-- 421 (603)
T KOG0318|consen 395 GYTK-S------------------E-----------VVKLGSQ------------PKG---------LAVLSDGGTAV-- 421 (603)
T ss_pred cccc-c------------------c-----------eeecCCC------------cee---------EEEcCCCCEEE--
Confidence 0000 0 0 0000000 000 00000000000
Q ss_pred eeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCccccccccCC
Q 000473 416 ETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDS 495 (1471)
Q Consensus 416 ~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~ 495 (1471)
+. +.+. -.+.++. .+ . ..
T Consensus 422 ----------------v~----~~~~-------iv~l~~~---------------~~-----~---------------~~ 439 (603)
T KOG0318|consen 422 ----------------VA----CISD-------IVLLQDQ---------------TK-----V---------------SS 439 (603)
T ss_pred ----------------EE----ecCc-------EEEEecC---------------Cc-----c---------------ee
Confidence 00 0000 0111110 00 0 00
Q ss_pred CCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEE
Q 000473 496 RQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLC 575 (1471)
Q Consensus 496 ~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~ 575 (1471)
.+ -.-..+++++.++.. .++.|++||.+.|+... + ........+..|.++|++
T Consensus 440 ~~----------~~y~~s~vAv~~~~~----~vaVGG~Dgkvhvysl~---g----------~~l~ee~~~~~h~a~iT~ 492 (603)
T KOG0318|consen 440 IP----------IGYESSAVAVSPDGS----EVAVGGQDGKVHVYSLS---G----------DELKEEAKLLEHRAAITD 492 (603)
T ss_pred ec----------cccccceEEEcCCCC----EEEEecccceEEEEEec---C----------CcccceeeeecccCCceE
Confidence 00 012244566555554 79999999999996543 1 111344567789999999
Q ss_pred EEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceE-EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEE
Q 000473 576 LAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLI-TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALA 654 (1471)
Q Consensus 576 la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l-~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lW 654 (1471)
++|+|| +.+|++|...+.|.+||..+.+.. ..+.-|+..|.+++|+|+ ...+|||+-|-.|.+|
T Consensus 493 vaySpd---------~~yla~~Da~rkvv~yd~~s~~~~~~~w~FHtakI~~~aWsP~------n~~vATGSlDt~Viiy 557 (603)
T KOG0318|consen 493 VAYSPD---------GAYLAAGDASRKVVLYDVASREVKTNRWAFHTAKINCVAWSPN------NKLVATGSLDTNVIIY 557 (603)
T ss_pred EEECCC---------CcEEEEeccCCcEEEEEcccCceecceeeeeeeeEEEEEeCCC------ceEEEeccccceEEEE
Confidence 999998 899999999999999999988764 344459999999999999 7899999999999999
Q ss_pred ECCCCc-EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 655 SLETLR-VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 655 dl~t~~-~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
+++... .+.....|...|+.+.|- +...+++.+.| ..|++|++.
T Consensus 558 sv~kP~~~i~iknAH~~gVn~v~wl-de~tvvSsG~D--------a~iK~W~v~ 602 (603)
T KOG0318|consen 558 SVKKPAKHIIIKNAHLGGVNSVAWL-DESTVVSSGQD--------ANIKVWNVT 602 (603)
T ss_pred EccChhhheEeccccccCceeEEEe-cCceEEeccCc--------ceeEEeccc
Confidence 997643 355556798899999996 45688999998 899999874
No 15
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.94 E-value=3.1e-23 Score=249.90 Aligned_cols=535 Identities=16% Similarity=0.160 Sum_probs=330.1
Q ss_pred eeEecCCCCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCce-ee-eEEecccccceeEeeeccccccccCccc
Q 000473 6 VACIWSGTPPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEI-KP-VAMLCGHSAPIADLSICYPAMVSRDGKA 83 (1471)
Q Consensus 6 ~~~lw~~~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~-~~-~~~L~GH~~~Vt~La~c~~~~~s~dg~~ 83 (1471)
+.++|.... ..|.-+. |=|.+|+++..++.+.+|+... ..+. .. ...+.+...-|++|. +|.
T Consensus 106 i~~~~~~~~--a~v~~l~--~fGe~lia~d~~~~l~vw~~s~--~~~e~~l~~~~~~~~~~~Ital~--HP~-------- 169 (910)
T KOG1539|consen 106 IRHTTLLHG--AKVHLLL--PFGEHLIAVDISNILFVWKTSS--IQEELYLQSTFLKVEGDFITALL--HPS-------- 169 (910)
T ss_pred EEEEecccc--ceEEEEe--eecceEEEEEccCcEEEEEecc--ccccccccceeeeccCCceeeEe--cch--------
Confidence 344444433 4454443 3478888888999999999873 1111 11 111222222288885 565
Q ss_pred ccccccccccccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcC--CCCeEEEEcceecccCC
Q 000473 84 EHWKAENSSNVMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLP--SNPRYVCIGCCFIDTNQ 161 (1471)
Q Consensus 84 ~~~~~~~~~~~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s--~~~~ll~~G~~~id~~~ 161 (1471)
.|.++ ++-|+.+|.|.+||+.+|+.+...+.. +..|.... |--.++++|.-
T Consensus 170 -TYLNK---------------IvvGs~~G~lql~Nvrt~K~v~~f~~~-----~s~IT~ieqsPaLDVVaiG~~------ 222 (910)
T KOG1539|consen 170 -TYLNK---------------IVVGSSQGRLQLWNVRTGKVVYTFQEF-----FSRITAIEQSPALDVVAIGLE------ 222 (910)
T ss_pred -hheee---------------EEEeecCCcEEEEEeccCcEEEEeccc-----ccceeEeccCCcceEEEEecc------
Confidence 45555 889999999999999999999886543 24444444 44567888875
Q ss_pred cccccccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCc
Q 000473 162 LSDHHSFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGR 241 (1471)
Q Consensus 162 ~~~~h~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~ 241 (1471)
+|+|.+++...++++.++.+ .+ +.|+.++|.. ||+ .-+++|+..|.
T Consensus 223 ---------------------------~G~ViifNlK~dkil~sFk~-d~--g~VtslSFrt---DG~-p~las~~~~G~ 268 (910)
T KOG1539|consen 223 ---------------------------NGTVIIFNLKFDKILMSFKQ-DW--GRVTSLSFRT---DGN-PLLASGRSNGD 268 (910)
T ss_pred ---------------------------CceEEEEEcccCcEEEEEEc-cc--cceeEEEecc---CCC-eeEEeccCCce
Confidence 59999999999999999987 33 3499999984 444 23344477799
Q ss_pred EEEEECCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeC-CeEEEEEcCCCcceeeeeeecc
Q 000473 242 LQLVPISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLK-DHCIFRLLGSGSTIGEICFVDN 320 (1471)
Q Consensus 242 V~vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~-~~~~~~l~d~~~~ige~~~~~~ 320 (1471)
+-+||+++.+. +. ...+.|.+++....|.+..-++.+... ++-.+|.+|+++ |. | +
T Consensus 269 m~~wDLe~kkl---------------~~--v~~nah~~sv~~~~fl~~epVl~ta~~DnSlk~~vfD~~d--g~---p-R 325 (910)
T KOG1539|consen 269 MAFWDLEKKKL---------------IN--VTRNAHYGSVTGATFLPGEPVLVTAGADNSLKVWVFDSGD--GV---P-R 325 (910)
T ss_pred EEEEEcCCCee---------------ee--eeeccccCCcccceecCCCceEeeccCCCceeEEEeeCCC--Cc---c-h
Confidence 99999998742 11 113446777777888888666666544 455677888777 21 2 3
Q ss_pred eeEeecCCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecC--CCCCcccCeeeecCccCCCC
Q 000473 321 LFCLEGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYM--NEKFDYEPHFEIPAVSYPSG 398 (1471)
Q Consensus 321 ~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~--~~~~~~~~~~~ip~v~~~~~ 398 (1471)
+|-+..+....+..-.++.+.+ ..+.-...|.+.+.|.+.-. +..... .+++.....
T Consensus 326 ~LR~R~GHs~Pp~~irfy~~~g----------------~~ilsa~~Drt~r~fs~~~e~~~~~l~~---~~~~~~~kk-- 384 (910)
T KOG1539|consen 326 LLRSRGGHSAPPSCIRFYGSQG----------------HFILSAKQDRTLRSFSVISESQSQELGQ---LHNKKRAKK-- 384 (910)
T ss_pred heeeccCCCCCchheeeeccCc----------------EEEEecccCcchhhhhhhHHHHhHhhcc---ccccccccc--
Confidence 3433222211111111111110 01111122222222221100 000000 000000000
Q ss_pred ceeeEEEeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCccccee
Q 000473 399 VKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKS 478 (1471)
Q Consensus 399 ~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l 478 (1471)
..+ ........+|++. + -..+.....|..- +
T Consensus 385 --------------~~~-----~~~~~~k~p~i~~-f---------------a~~~~RE~~W~Nv--------------~ 415 (910)
T KOG1539|consen 385 --------------VNV-----FSTEKLKLPPIVE-F---------------AFENAREKEWDNV--------------I 415 (910)
T ss_pred --------------ccc-----cchhhhcCCccee-e---------------ecccchhhhhcce--------------e
Confidence 000 0001111112211 1 1111122344221 2
Q ss_pred ecccccCccccccccCCCCCCCcc---ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGR---DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~---~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
+.......++.|+..+... |.-. ..++.....++|++...+.. ..+.|+..|.|.+++
T Consensus 416 ~~h~~~~~~~tW~~~n~~~-G~~~L~~~~~~~~~~~~~av~vs~CGN----F~~IG~S~G~Id~fN-------------- 476 (910)
T KOG1539|consen 416 TAHKGKRSAYTWNFRNKTS-GRHVLDPKRFKKDDINATAVCVSFCGN----FVFIGYSKGTIDRFN-------------- 476 (910)
T ss_pred EEecCcceEEEEeccCccc-ccEEecCccccccCcceEEEEEeccCc----eEEEeccCCeEEEEE--------------
Confidence 2333344566777766433 2110 11222346788888777776 488999999999944
Q ss_pred ccCCcceEEEE---ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCC
Q 000473 556 KVNSHVSRQYF---LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPP 632 (1471)
Q Consensus 556 d~~s~~~~~~l---~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd 632 (1471)
.++|.....+ ..|.++|+.++.... ++.++|++.||-+++||...+.++..+..- .++..+.++..
T Consensus 477 -mQSGi~r~sf~~~~ah~~~V~gla~D~~---------n~~~vsa~~~Gilkfw~f~~k~l~~~l~l~-~~~~~iv~hr~ 545 (910)
T KOG1539|consen 477 -MQSGIHRKSFGDSPAHKGEVTGLAVDGT---------NRLLVSAGADGILKFWDFKKKVLKKSLRLG-SSITGIVYHRV 545 (910)
T ss_pred -cccCeeecccccCccccCceeEEEecCC---------CceEEEccCcceEEEEecCCcceeeeeccC-CCcceeeeeeh
Confidence 4466666666 589999999998765 789999999999999999988887777543 35677777766
Q ss_pred CCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 633 QTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 633 ~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
...++.+..|-.|+++|..+.+.++.|.||.+.|++++|||||++|++++.| ++|++||+.||.++
T Consensus 546 ------s~l~a~~~ddf~I~vvD~~t~kvvR~f~gh~nritd~~FS~DgrWlisasmD--------~tIr~wDlpt~~lI 611 (910)
T KOG1539|consen 546 ------SDLLAIALDDFSIRVVDVVTRKVVREFWGHGNRITDMTFSPDGRWLISASMD--------STIRTWDLPTGTLI 611 (910)
T ss_pred ------hhhhhhhcCceeEEEEEchhhhhhHHhhccccceeeeEeCCCCcEEEEeecC--------CcEEEEeccCccee
Confidence 7799999999999999999999999999999999999999999999999999 99999999999999
Q ss_pred EEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccC-CceEeeccccccccc
Q 000473 713 RVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHED-GTFRQSQIQNDERGV 772 (1471)
Q Consensus 713 ~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D-~tir~w~l~~~~~~~ 772 (1471)
..+.-.. ..+.+.|. ++|++.+..+. | .-+..|.-+...+..
T Consensus 612 D~~~vd~-~~~sls~S------------PngD~LAT~Hv-----d~~gIylWsNkslF~~v 654 (910)
T KOG1539|consen 612 DGLLVDS-PCTSLSFS------------PNGDFLATVHV-----DQNGIYLWSNKSLFKSV 654 (910)
T ss_pred eeEecCC-cceeeEEC------------CCCCEEEEEEe-----cCceEEEEEchhHheec
Confidence 8776443 33444444 34455444333 5 357788644444333
No 16
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.93 E-value=5.9e-25 Score=265.45 Aligned_cols=210 Identities=17% Similarity=0.230 Sum_probs=185.3
Q ss_pred ccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEec
Q 000473 501 GRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHR 580 (1471)
Q Consensus 501 ~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~sp 580 (1471)
...++.+|.++|....+.|+.. .|+++++|+++++|. ..+..+.-.++||..+|+++.|+|
T Consensus 443 ~~~~L~GH~GPVyg~sFsPd~r----fLlScSED~svRLWs---------------l~t~s~~V~y~GH~~PVwdV~F~P 503 (707)
T KOG0263|consen 443 TSRTLYGHSGPVYGCSFSPDRR----FLLSCSEDSSVRLWS---------------LDTWSCLVIYKGHLAPVWDVQFAP 503 (707)
T ss_pred eeEEeecCCCceeeeeeccccc----ceeeccCCcceeeee---------------cccceeEEEecCCCcceeeEEecC
Confidence 3445789999999988555554 699999999999944 345567788899999999999999
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR 660 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~ 660 (1471)
- |.+++|||.|++.++|....-.+++.|.+|-+.|.|+.|+|+ .+++++||.|++||+||+.+|.
T Consensus 504 ~---------GyYFatas~D~tArLWs~d~~~PlRifaghlsDV~cv~FHPN------s~Y~aTGSsD~tVRlWDv~~G~ 568 (707)
T KOG0263|consen 504 R---------GYYFATASHDQTARLWSTDHNKPLRIFAGHLSDVDCVSFHPN------SNYVATGSSDRTVRLWDVSTGN 568 (707)
T ss_pred C---------ceEEEecCCCceeeeeecccCCchhhhcccccccceEEECCc------ccccccCCCCceEEEEEcCCCc
Confidence 6 899999999999999999999999999999999999999999 8999999999999999999999
Q ss_pred EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceE
Q 000473 661 VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSV 740 (1471)
Q Consensus 661 ~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v 740 (1471)
.++.|.||.++|.+++|+|+|+||++|+.| |.|.+||+.+|.++..+.||++.+..+.|+.
T Consensus 569 ~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed--------~~I~iWDl~~~~~v~~l~~Ht~ti~SlsFS~----------- 629 (707)
T KOG0263|consen 569 SVRIFTGHKGPVTALAFSPCGRYLASGDED--------GLIKIWDLANGSLVKQLKGHTGTIYSLSFSR----------- 629 (707)
T ss_pred EEEEecCCCCceEEEEEcCCCceEeecccC--------CcEEEEEcCCCcchhhhhcccCceeEEEEec-----------
Confidence 999999999999999999999999999999 9999999999999999999999888898883
Q ss_pred EcCCccccccceeeccCCceEeecccccc
Q 000473 741 LNGNTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 741 ~~g~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
.|+..+++. .|.++|.|++....
T Consensus 630 -dg~vLasgg-----~DnsV~lWD~~~~~ 652 (707)
T KOG0263|consen 630 -DGNVLASGG-----ADNSVRLWDLTKVI 652 (707)
T ss_pred -CCCEEEecC-----CCCeEEEEEchhhc
Confidence 223333333 39999999985433
No 17
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.92 E-value=4.6e-24 Score=237.50 Aligned_cols=151 Identities=21% Similarity=0.312 Sum_probs=133.9
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcc------cCcCCCEEEEEECC
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAK------GWSFNEVLVSGSMD 600 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~------~~~~~~~L~SGs~D 600 (1471)
.+++++.|.++++|. ..++.+...+.+|.-+|.|++|.|....++ +...++++++||.|
T Consensus 249 i~As~s~dqtl~vW~---------------~~t~~~k~~lR~hEh~vEci~wap~~~~~~i~~at~~~~~~~~l~s~SrD 313 (406)
T KOG0295|consen 249 IIASCSNDQTLRVWV---------------VATKQCKAELREHEHPVECIAWAPESSYPSISEATGSTNGGQVLGSGSRD 313 (406)
T ss_pred EEEecCCCceEEEEE---------------eccchhhhhhhccccceEEEEecccccCcchhhccCCCCCccEEEeeccc
Confidence 467778888999833 445677788999999999999988753222 11235799999999
Q ss_pred CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCC
Q 000473 601 CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCP 680 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spd 680 (1471)
++|++||+.+|.++.++.+|...|..++|+|. |++|+|+.+|+++++||+++++|...++.|+..|+++.|+.+
T Consensus 314 ktIk~wdv~tg~cL~tL~ghdnwVr~~af~p~------Gkyi~ScaDDktlrvwdl~~~~cmk~~~ah~hfvt~lDfh~~ 387 (406)
T KOG0295|consen 314 KTIKIWDVSTGMCLFTLVGHDNWVRGVAFSPG------GKYILSCADDKTLRVWDLKNLQCMKTLEAHEHFVTSLDFHKT 387 (406)
T ss_pred ceEEEEeccCCeEEEEEecccceeeeeEEcCC------CeEEEEEecCCcEEEEEeccceeeeccCCCcceeEEEecCCC
Confidence 99999999999999999999999999999998 999999999999999999999999999999999999999999
Q ss_pred CCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 681 RGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 681 g~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
..|+++|+-| .++++|.-
T Consensus 388 ~p~VvTGsVd--------qt~KvwEc 405 (406)
T KOG0295|consen 388 APYVVTGSVD--------QTVKVWEC 405 (406)
T ss_pred CceEEecccc--------ceeeeeec
Confidence 9999999999 99999974
No 18
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.92 E-value=9.4e-23 Score=233.11 Aligned_cols=168 Identities=17% Similarity=0.255 Sum_probs=142.2
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
.+|.+.|.++.+.+... .|++++.|++++| |. .....+...|.+|+..|..+.|+|+....
T Consensus 356 ~GH~g~V~alk~n~tg~----LLaS~SdD~Tlki--Ws-------------~~~~~~~~~l~~Hskei~t~~wsp~g~v~ 416 (524)
T KOG0273|consen 356 IGHHGEVNALKWNPTGS----LLASCSDDGTLKI--WS-------------MGQSNSVHDLQAHSKEIYTIKWSPTGPVT 416 (524)
T ss_pred ecccCceEEEEECCCCc----eEEEecCCCeeEe--ee-------------cCCCcchhhhhhhccceeeEeecCCCCcc
Confidence 45667777777444443 7999999999999 33 23345677889999999999999974333
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF 665 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~ 665 (1471)
.....+..+++++.|++|++||+..|.++++|..|+.+|.+|+|+|+ |.++++|+.||.|.+|+.++++.++.+
T Consensus 417 ~n~~~~~~l~sas~dstV~lwdv~~gv~i~~f~kH~~pVysvafS~~------g~ylAsGs~dg~V~iws~~~~~l~~s~ 490 (524)
T KOG0273|consen 417 SNPNMNLMLASASFDSTVKLWDVESGVPIHTLMKHQEPVYSVAFSPN------GRYLASGSLDGCVHIWSTKTGKLVKSY 490 (524)
T ss_pred CCCcCCceEEEeecCCeEEEEEccCCceeEeeccCCCceEEEEecCC------CcEEEecCCCCeeEeccccchheeEee
Confidence 23445779999999999999999999999999999999999999999 999999999999999999999999998
Q ss_pred cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 666 PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 666 ~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.+. +.|..++|+.+|++|..+-.| +.+.+-|++
T Consensus 491 ~~~-~~Ifel~Wn~~G~kl~~~~sd--------~~vcvldlr 523 (524)
T KOG0273|consen 491 QGT-GGIFELCWNAAGDKLGACASD--------GSVCVLDLR 523 (524)
T ss_pred cCC-CeEEEEEEcCCCCEEEEEecC--------CCceEEEec
Confidence 764 459999999999999887777 999998875
No 19
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.91 E-value=1.6e-22 Score=231.33 Aligned_cols=161 Identities=24% Similarity=0.323 Sum_probs=144.9
Q ss_pred EEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEE
Q 000473 528 IVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWD 607 (1471)
Q Consensus 528 lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWD 607 (1471)
+++.+.||.|+|+..+ ...++.++.||++.|.++.|.|. +.+|+|+|.|+|+++|.
T Consensus 332 F~ts~td~~i~V~kv~---------------~~~P~~t~~GH~g~V~alk~n~t---------g~LLaS~SdD~TlkiWs 387 (524)
T KOG0273|consen 332 FATSSTDGCIHVCKVG---------------EDRPVKTFIGHHGEVNALKWNPT---------GSLLASCSDDGTLKIWS 387 (524)
T ss_pred EeecCCCceEEEEEec---------------CCCcceeeecccCceEEEEECCC---------CceEEEecCCCeeEeee
Confidence 5566678889986544 23678899999999999999997 89999999999999999
Q ss_pred CCCCceEEEEeccCCCEEEEEECCCCCC--CC-CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEE
Q 000473 608 LGSGNLITVMHHHVAPVRQIILSPPQTE--HP-WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYI 684 (1471)
Q Consensus 608 l~tg~~l~~~~~H~~~V~~l~fspd~~~--~~-~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L 684 (1471)
.....+.+.|.+|...|..+.|+|..+. .| .+..+++++.|++|++||+..+.+++.|..|..+|++++|+|+|+|+
T Consensus 388 ~~~~~~~~~l~~Hskei~t~~wsp~g~v~~n~~~~~~l~sas~dstV~lwdv~~gv~i~~f~kH~~pVysvafS~~g~yl 467 (524)
T KOG0273|consen 388 MGQSNSVHDLQAHSKEIYTIKWSPTGPVTSNPNMNLMLASASFDSTVKLWDVESGVPIHTLMKHQEPVYSVAFSPNGRYL 467 (524)
T ss_pred cCCCcchhhhhhhccceeeEeecCCCCccCCCcCCceEEEeecCCeEEEEEccCCceeEeeccCCCceEEEEecCCCcEE
Confidence 9999999999999999999999998533 33 35589999999999999999999999999999999999999999999
Q ss_pred EEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCC
Q 000473 685 ACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTAS 720 (1471)
Q Consensus 685 ~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~ 720 (1471)
++|+.| |.|.+|++++|++.+.+.+...
T Consensus 468 AsGs~d--------g~V~iws~~~~~l~~s~~~~~~ 495 (524)
T KOG0273|consen 468 ASGSLD--------GCVHIWSTKTGKLVKSYQGTGG 495 (524)
T ss_pred EecCCC--------CeeEeccccchheeEeecCCCe
Confidence 999999 9999999999999999988764
No 20
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.91 E-value=3e-22 Score=217.70 Aligned_cols=141 Identities=19% Similarity=0.245 Sum_probs=129.6
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+++|+.|+..++ ||+.++.++++|.||...|+++.|+|+ +.-+++||.|++.++|
T Consensus 201 tFvSg~cD~~akl---------------WD~R~~~c~qtF~ghesDINsv~ffP~---------G~afatGSDD~tcRly 256 (343)
T KOG0286|consen 201 TFVSGGCDKSAKL---------------WDVRSGQCVQTFEGHESDINSVRFFPS---------GDAFATGSDDATCRLY 256 (343)
T ss_pred eEEecccccceee---------------eeccCcceeEeecccccccceEEEccC---------CCeeeecCCCceeEEE
Confidence 5788888998888 456778899999999999999999997 8999999999999999
Q ss_pred ECCCCceEEEEecc--CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEE
Q 000473 607 DLGSGNLITVMHHH--VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYI 684 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H--~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L 684 (1471)
|++..+.+..|... ..+|++++|+-. |+++.+|..|.++.+||.-.++.+..+.||..+|.++..+|||--+
T Consensus 257 DlRaD~~~a~ys~~~~~~gitSv~FS~S------GRlLfagy~d~~c~vWDtlk~e~vg~L~GHeNRvScl~~s~DG~av 330 (343)
T KOG0286|consen 257 DLRADQELAVYSHDSIICGITSVAFSKS------GRLLFAGYDDFTCNVWDTLKGERVGVLAGHENRVSCLGVSPDGMAV 330 (343)
T ss_pred eecCCcEEeeeccCcccCCceeEEEccc------ccEEEeeecCCceeEeeccccceEEEeeccCCeeEEEEECCCCcEE
Confidence 99999988888642 358999999998 9999999999999999999999999999999999999999999999
Q ss_pred EEEEcCCCCCCCCCCEEEEEE
Q 000473 685 ACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 685 ~sgs~D~sg~~D~~gtV~VWD 705 (1471)
++|+.| .+++||.
T Consensus 331 ~TgSWD--------s~lriW~ 343 (343)
T KOG0286|consen 331 ATGSWD--------STLRIWA 343 (343)
T ss_pred Eecchh--------HheeecC
Confidence 999988 8999994
No 21
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.91 E-value=6.1e-22 Score=210.95 Aligned_cols=161 Identities=17% Similarity=0.324 Sum_probs=136.1
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEE-ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF-LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l-~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
-|++|..+|.|+| ||+ ....+.+.+ ..-.-.|.++.+.|| +..++.+..-|+..+
T Consensus 138 eLis~dqsg~irv--WDl-------------~~~~c~~~liPe~~~~i~sl~v~~d---------gsml~a~nnkG~cyv 193 (311)
T KOG0315|consen 138 ELISGDQSGNIRV--WDL-------------GENSCTHELIPEDDTSIQSLTVMPD---------GSMLAAANNKGNCYV 193 (311)
T ss_pred eEEeecCCCcEEE--EEc-------------cCCccccccCCCCCcceeeEEEcCC---------CcEEEEecCCccEEE
Confidence 4777778888888 442 112222222 223357889999998 899999999999999
Q ss_pred EECCCCc------eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEc
Q 000473 606 WDLGSGN------LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWD 678 (1471)
Q Consensus 606 WDl~tg~------~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~s 678 (1471)
|++-++. ++++|+.|.+-+..+.++|+ +++|+++|.|++|+||+.++. +....+.+|...++..+||
T Consensus 194 W~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd------~k~lat~ssdktv~iwn~~~~~kle~~l~gh~rWvWdc~FS 267 (311)
T KOG0315|consen 194 WRLLNHQTASELEPVHKFQAHNGHILRCLLSPD------VKYLATCSSDKTVKIWNTDDFFKLELVLTGHQRWVWDCAFS 267 (311)
T ss_pred EEccCCCccccceEhhheecccceEEEEEECCC------CcEEEeecCCceEEEEecCCceeeEEEeecCCceEEeeeec
Confidence 9997643 66889999999999999999 899999999999999999988 7778899999999999999
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceee
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFD 725 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~ 725 (1471)
.||.||++|+.| +.+++||+..|+.++...||....++.
T Consensus 268 ~dg~YlvTassd--------~~~rlW~~~~~k~v~qy~gh~K~~vc~ 306 (311)
T KOG0315|consen 268 ADGEYLVTASSD--------HTARLWDLSAGKEVRQYQGHHKAAVCV 306 (311)
T ss_pred cCccEEEecCCC--------CceeecccccCceeeecCCcccccEEE
Confidence 999999999998 999999999999999999998766654
No 22
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.90 E-value=2.5e-23 Score=251.52 Aligned_cols=224 Identities=23% Similarity=0.321 Sum_probs=188.9
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc-----cccC-----------CCCCCccccCCcceE
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL-----FERH-----------NSPGASLKVNSHVSR 563 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~-----l~~~-----------d~~~~~~d~~s~~~~ 563 (1471)
-+.+++.--...++|..+..+.. .+|+|+.|..|++|.+.. +... |-..+.-|..+....
T Consensus 369 ic~YT~~nt~~~v~ca~fSddss----mlA~Gf~dS~i~~~Sl~p~kl~~lk~~~~l~~~d~~sad~~~~~~D~~~~~~~ 444 (707)
T KOG0263|consen 369 ICMYTFHNTYQGVTCAEFSDDSS----MLACGFVDSSVRVWSLTPKKLKKLKDASDLSNIDTESADVDVDMLDDDSSGTS 444 (707)
T ss_pred EEEEEEEEcCCcceeEeecCCcc----hhhccccccEEEEEecchhhhccccchhhhccccccccchhhhhccccCCcee
Confidence 44555555556788887555555 899999999999976651 1111 001122344445566
Q ss_pred EEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEE
Q 000473 564 QYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFL 643 (1471)
Q Consensus 564 ~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~ 643 (1471)
+++.||+++|....|+|+ .++|+|+|.|++||+|++.+..++-.+++|..||+.+.|+|. |.+||
T Consensus 445 ~~L~GH~GPVyg~sFsPd---------~rfLlScSED~svRLWsl~t~s~~V~y~GH~~PVwdV~F~P~------GyYFa 509 (707)
T KOG0263|consen 445 RTLYGHSGPVYGCSFSPD---------RRFLLSCSEDSSVRLWSLDTWSCLVIYKGHLAPVWDVQFAPR------GYYFA 509 (707)
T ss_pred EEeecCCCceeeeeeccc---------ccceeeccCCcceeeeecccceeEEEecCCCcceeeEEecCC------ceEEE
Confidence 779999999999999998 899999999999999999999999999999999999999998 99999
Q ss_pred EEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCce
Q 000473 644 SVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSM 723 (1471)
Q Consensus 644 S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~ 723 (1471)
|+|.|++-++|......+++.|.||.+.|.|+.|+|+..|+++|+.| .+||+||+.+|..+|.+.||.+.|+
T Consensus 510 tas~D~tArLWs~d~~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD--------~tVRlWDv~~G~~VRiF~GH~~~V~ 581 (707)
T KOG0263|consen 510 TASHDQTARLWSTDHNKPLRIFAGHLSDVDCVSFHPNSNYVATGSSD--------RTVRLWDVSTGNSVRIFTGHKGPVT 581 (707)
T ss_pred ecCCCceeeeeecccCCchhhhcccccccceEEECCcccccccCCCC--------ceEEEEEcCCCcEEEEecCCCCceE
Confidence 99999999999999999999999999999999999999999999998 9999999999999999999999999
Q ss_pred eeeeeeccccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 724 FDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 724 ~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
.+.|++ +|.+.+++. +|+.|+.|++.+
T Consensus 582 al~~Sp------------~Gr~LaSg~-----ed~~I~iWDl~~ 608 (707)
T KOG0263|consen 582 ALAFSP------------CGRYLASGD-----EDGLIKIWDLAN 608 (707)
T ss_pred EEEEcC------------CCceEeecc-----cCCcEEEEEcCC
Confidence 998883 344544443 499999999844
No 23
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.90 E-value=1.6e-20 Score=223.06 Aligned_cols=522 Identities=14% Similarity=0.151 Sum_probs=313.6
Q ss_pred CCceEEEEEEcCCCC---eEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccc
Q 000473 15 PSHRVTATSALTQPP---TLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENS 91 (1471)
Q Consensus 15 p~h~Vtava~SpDg~---~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~ 91 (1471)
+..+|+|+.+.|+.+ ++++|+.||+|++|.+.. .....+..+.||...+.|+.-
T Consensus 53 H~a~VnC~~~l~~s~~~a~~vsG~sD~~v~lW~l~~---~~~~~i~~~~g~~~~~~cv~a-------------------- 109 (764)
T KOG1063|consen 53 HVARVNCVHWLPTSEIVAEMVSGDSDGRVILWKLRD---EYLIKIYTIQGHCKECVCVVA-------------------- 109 (764)
T ss_pred CccceEEEEEcccccccceEEEccCCCcEEEEEEee---hheEEEEeecCcceeEEEEEe--------------------
Confidence 456799999999876 899999999999999983 334556677888988888861
Q ss_pred ccccccccCCCCEEEE-EeCCCeEEEEEcCCCe--EEEeeeCCCCCCCCcEEEEcC-CCCeEEEEcceecccCCcccccc
Q 000473 92 SNVMGKSSLDNGALIS-ACTDGVLCVWSRSSGH--CRRRRKLPPWVGSPSVICTLP-SNPRYVCIGCCFIDTNQLSDHHS 167 (1471)
Q Consensus 92 ~~~~~~~s~d~~~LaS-as~DG~I~VWdv~~G~--ci~~~~l~~~~g~~~~i~~~s-~~~~ll~~G~~~id~~~~~~~h~ 167 (1471)
...+.+ .+.|+++.+||....+ |.+........-.|..+.+.+ ++.-++++|..
T Consensus 110 ----------~~~~~~~~~ad~~v~vw~~~~~e~~~~~~~rf~~k~~ipLcL~~~~~~~~~lla~Ggs------------ 167 (764)
T KOG1063|consen 110 ----------RSSVMTCKAADGTVSVWDKQQDEVFLLAVLRFEIKEAIPLCLAALKNNKTFLLACGGS------------ 167 (764)
T ss_pred ----------eeeEEEeeccCceEEEeecCCCceeeehheehhhhhHhhHHHhhhccCCcEEEEecCc------------
Confidence 111222 3789999999996555 333322211112344444555 33456778875
Q ss_pred cccccccccccccCCCCCCCCCceEEEEeCcce--EEEEEeecCccccCCeEEEEEeeecCCCCceeEEEE--eCCCcEE
Q 000473 168 FESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGL--TIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMV--DSVGRLQ 243 (1471)
Q Consensus 168 ~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~--~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llva--s~dG~V~ 243 (1471)
+..|.++--.+. ..+..+.. +.|||..++|.. +.+ ++++++ +.|..||
T Consensus 168 ---------------------~~~v~~~s~~~d~f~~v~el~G---H~DWIrsl~f~~--~~~--~~~~laS~SQD~yIR 219 (764)
T KOG1063|consen 168 ---------------------KFVVDLYSSSADSFARVAELEG---HTDWIRSLAFAR--LGG--DDLLLASSSQDRYIR 219 (764)
T ss_pred ---------------------ceEEEEeccCCcceeEEEEeec---cchhhhhhhhhc--cCC--CcEEEEecCCceEEE
Confidence 233434333322 22333333 579999999873 222 234444 9999999
Q ss_pred EEECCCCCCccc---ccCCCcccCC--Cc----cc----ceeccCCcccCceEEEEecCCcEEEEE-eCCeEEEEEcCCC
Q 000473 244 LVPISKESHLDR---EEGNGLCKSS--SQ----LD----MAILQNGVVEGGHLVSVATCGNIIALV-LKDHCIFRLLGSG 309 (1471)
Q Consensus 244 vW~l~~~~~~~~---~~~~~l~~~e--~~----i~----~v~~~~~~~~~~~~vs~s~~g~~l~~~-~~~~~~~~l~d~~ 309 (1471)
+|.+.-....+. +....++.+. .. +. .=....+|..-+.++-..|.+..|++. .++..++|-.+..
T Consensus 220 iW~i~~~~~~~~~~~e~~~t~~~~~~~f~~l~~i~~~is~eall~GHeDWV~sv~W~p~~~~LLSASaDksmiiW~pd~~ 299 (764)
T KOG1063|consen 220 IWRIVLGDDEDSNEREDSLTTLSNLPVFMILEEIQYRISFEALLMGHEDWVYSVWWHPEGLDLLSASADKSMIIWKPDEN 299 (764)
T ss_pred EEEEEecCCccccccccccccccCCceeeeeeeEEEEEehhhhhcCcccceEEEEEccchhhheecccCcceEEEecCCc
Confidence 999875521110 0000111111 00 10 001123566778888888888555554 4455677544444
Q ss_pred cceeeeeeecceeEe-ecCCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCee
Q 000473 310 STIGEICFVDNLFCL-EGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHF 388 (1471)
Q Consensus 310 ~~ige~~~~~~~l~~-~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~ 388 (1471)
+-+ +. ...++ +.+.....+.++.+...+ +-++-|+..|..++|+ . .+ ...|
T Consensus 300 tGi----Wv-~~vRlGe~gg~a~GF~g~lw~~n~----------------~~ii~~g~~Gg~hlWk-t-~d-----~~~w 351 (764)
T KOG1063|consen 300 TGI----WV-DVVRLGEVGGSAGGFWGGLWSPNS----------------NVIIAHGRTGGFHLWK-T-KD-----KTFW 351 (764)
T ss_pred cce----EE-EEEEeecccccccceeeEEEcCCC----------------CEEEEecccCcEEEEe-c-cC-----ccce
Confidence 311 22 11222 222112235566665332 2456677788888887 1 11 1223
Q ss_pred -eecCccCCCCceeeEEEeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccc
Q 000473 389 -EIPAVSYPSGVKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFL 467 (1471)
Q Consensus 389 -~ip~v~~~~~~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~ 467 (1471)
..|.+.+.-++.-.+.+.+.|.||.++.. +.+++++.-- ++ ...
T Consensus 352 ~~~~~iSGH~~~V~dv~W~psGeflLsvs~-----------DQTTRlFa~w------g~----------q~~-------- 396 (764)
T KOG1063|consen 352 TQEPVISGHVDGVKDVDWDPSGEFLLSVSL-----------DQTTRLFARW------GR----------QQE-------- 396 (764)
T ss_pred eeccccccccccceeeeecCCCCEEEEecc-----------ccceeeeccc------cc----------ccc--------
Confidence 22233333344456677777777777652 2234444211 00 000
Q ss_pred cCCCCcccceeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc---
Q 000473 468 DENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL--- 544 (1471)
Q Consensus 468 ~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~--- 544 (1471)
|.- ....+-|+-.++|+.++.... ++++|.+..-++++....
T Consensus 397 ----------------------wHE---------iaRPQiHGyDl~c~~~vn~~~----~FVSgAdEKVlRvF~aPk~fv 441 (764)
T KOG1063|consen 397 ----------------------WHE---------IARPQIHGYDLTCLSFVNEDL----QFVSGADEKVLRVFEAPKSFV 441 (764)
T ss_pred ----------------------eee---------ecccccccccceeeehccCCc----eeeecccceeeeeecCcHHHH
Confidence 111 111234555566665544321 456666555566643210
Q ss_pred -----ccc-----CC-----------------------CCCCc------------------cccC-------CcceEEEE
Q 000473 545 -----FER-----HN-----------------------SPGAS------------------LKVN-------SHVSRQYF 566 (1471)
Q Consensus 545 -----l~~-----~d-----------------------~~~~~------------------~d~~-------s~~~~~~l 566 (1471)
+.+ .+ ..+.. +..+ --..++.|
T Consensus 442 ~~l~~i~g~~~~~~~~~p~gA~VpaLGLSnKa~~~~e~~~G~~~~~~~et~~~~~p~~L~ePP~EdqLq~~tLwPEv~KL 521 (764)
T KOG1063|consen 442 KSLMAICGKCFKGSDELPDGANVPALGLSNKAFFPGETNTGGEAAVCAETPLAAAPCELTEPPTEDQLQQNTLWPEVHKL 521 (764)
T ss_pred HHHHHHhCccccCchhcccccccccccccCCCCcccccccccccceeeecccccCchhccCCChHHHHHHhccchhhHHh
Confidence 000 00 00000 0000 00123567
Q ss_pred ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCC-----cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCE
Q 000473 567 LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDC-----SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDC 641 (1471)
Q Consensus 567 ~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dg-----tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~ 641 (1471)
+||...|+|++.+|+ +++++|+.... .|++|+..+......+.+|.-.|+.++|+|| +++
T Consensus 522 YGHGyEv~~l~~s~~---------gnliASaCKS~~~ehAvI~lw~t~~W~~~~~L~~HsLTVT~l~FSpd------g~~ 586 (764)
T KOG1063|consen 522 YGHGYEVYALAISPT---------GNLIASACKSSLKEHAVIRLWNTANWLQVQELEGHSLTVTRLAFSPD------GRY 586 (764)
T ss_pred ccCceeEEEEEecCC---------CCEEeehhhhCCccceEEEEEeccchhhhheecccceEEEEEEECCC------CcE
Confidence 899999999999997 89999987553 6899999999988999999999999999999 999
Q ss_pred EEEEeCCCcEEEEECCCCcE----EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC--eEEE--
Q 000473 642 FLSVGEDFSVALASLETLRV----ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG--ARER-- 713 (1471)
Q Consensus 642 l~S~s~DgsV~lWdl~t~~~----l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg--~~~~-- 713 (1471)
|+++|.|+++.||....... ......|..-|++..|+|++.+++|++.| .+|+||..... +.+.
T Consensus 587 LLsvsRDRt~sl~~~~~~~~~e~~fa~~k~HtRIIWdcsW~pde~~FaTaSRD--------K~VkVW~~~~~~d~~i~~~ 658 (764)
T KOG1063|consen 587 LLSVSRDRTVSLYEVQEDIKDEFRFACLKAHTRIIWDCSWSPDEKYFATASRD--------KKVKVWEEPDLRDKYISRF 658 (764)
T ss_pred EEEeecCceEEeeeeecccchhhhhccccccceEEEEcccCcccceeEEecCC--------ceEEEEeccCchhhhhhhh
Confidence 99999999999998854321 12256799999999999999999999999 99999998876 3332
Q ss_pred EEeCCCCCceeeeeeec
Q 000473 714 VLRGTASHSMFDHFCKG 730 (1471)
Q Consensus 714 ~l~gH~~~v~~~~~~~~ 730 (1471)
....+...|+.+.+++-
T Consensus 659 a~~~~~~aVTAv~~~~~ 675 (764)
T KOG1063|consen 659 ACLKFSLAVTAVAYLPV 675 (764)
T ss_pred chhccCCceeeEEeecc
Confidence 23457778888888853
No 24
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.90 E-value=1.1e-21 Score=214.31 Aligned_cols=140 Identities=20% Similarity=0.336 Sum_probs=120.5
Q ss_pred EEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEE
Q 000473 528 IVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWD 607 (1471)
Q Consensus 528 lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWD 607 (1471)
+++|.-|+.|++ |+ ...+....++.||.+.|+.+..+|+ +.++.|-++|.++++||
T Consensus 189 v~sggIdn~ikv--Wd-------------~r~~d~~~~lsGh~DtIt~lsls~~---------gs~llsnsMd~tvrvwd 244 (338)
T KOG0265|consen 189 VISGGIDNDIKV--WD-------------LRKNDGLYTLSGHADTITGLSLSRY---------GSFLLSNSMDNTVRVWD 244 (338)
T ss_pred eeeccccCceee--ec-------------cccCcceEEeecccCceeeEEeccC---------CCccccccccceEEEEE
Confidence 556777889998 44 3344667899999999999999997 89999999999999999
Q ss_pred CCC----CceEEEEeccCC----CEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcC
Q 000473 608 LGS----GNLITVMHHHVA----PVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDC 679 (1471)
Q Consensus 608 l~t----g~~l~~~~~H~~----~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~sp 679 (1471)
++. .+++..|.+|.- -.....|+|+ +..+-.|+.|+.|.+||....+.+..++||.+.|.++.|+|
T Consensus 245 ~rp~~p~~R~v~if~g~~hnfeknlL~cswsp~------~~~i~ags~dr~vyvwd~~~r~~lyklpGh~gsvn~~~Fhp 318 (338)
T KOG0265|consen 245 VRPFAPSQRCVKIFQGHIHNFEKNLLKCSWSPN------GTKITAGSADRFVYVWDTTSRRILYKLPGHYGSVNEVDFHP 318 (338)
T ss_pred ecccCCCCceEEEeecchhhhhhhcceeeccCC------CCccccccccceEEEeecccccEEEEcCCcceeEEEeeecC
Confidence 973 346888887643 3456788998 88999999999999999999999999999999999999999
Q ss_pred CCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 680 PRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 680 dg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
...+|.+++.| .+|++=+
T Consensus 319 ~e~iils~~sd--------k~i~lge 336 (338)
T KOG0265|consen 319 TEPIILSCSSD--------KTIYLGE 336 (338)
T ss_pred CCcEEEEeccC--------ceeEeec
Confidence 99999999988 8888733
No 25
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.90 E-value=6.8e-23 Score=239.67 Aligned_cols=231 Identities=20% Similarity=0.231 Sum_probs=196.0
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.++..++.+.+|+.... .+...+.-...+|.+..++.... .+++|+.|+.|+|++++
T Consensus 28 ~la~LynG~V~IWnyetq----tmVksfeV~~~PvRa~kfiaRkn----Wiv~GsDD~~IrVfnyn-------------- 85 (794)
T KOG0276|consen 28 ILAALYNGDVQIWNYETQ----TMVKSFEVSEVPVRAAKFIARKN----WIVTGSDDMQIRVFNYN-------------- 85 (794)
T ss_pred EEEeeecCeeEEEecccc----eeeeeeeecccchhhheeeeccc----eEEEecCCceEEEEecc--------------
Confidence 344556788999998764 34455665667788877666555 79999999999997766
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC-CceEEEEeccCCCEEEEEECCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS-GNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t-g~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
++..+..|..|.+.+.|++.||. ..+++|+|.|-+|++||-+. ..+.++|.+|...|.+++|+|..
T Consensus 86 -t~ekV~~FeAH~DyIR~iavHPt---------~P~vLtsSDDm~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD--- 152 (794)
T KOG0276|consen 86 -TGEKVKTFEAHSDYIRSIAVHPT---------LPYVLTSSDDMTIKLWDWENEWACEQTFEGHEHYVMQVAFNPKD--- 152 (794)
T ss_pred -cceeeEEeeccccceeeeeecCC---------CCeEEecCCccEEEEeeccCceeeeeEEcCcceEEEEEEecCCC---
Confidence 45678999999999999999997 78999999999999999875 46789999999999999999984
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCC--CCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCP--RGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spd--g~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
.+.|+|+|-|++|++|++....+..++.||...|++|.+-+. ..||++|+.| .+|+|||.+|.+|+++
T Consensus 153 --~ntFaS~sLDrTVKVWslgs~~~nfTl~gHekGVN~Vdyy~~gdkpylIsgaDD--------~tiKvWDyQtk~CV~T 222 (794)
T KOG0276|consen 153 --PNTFASASLDRTVKVWSLGSPHPNFTLEGHEKGVNCVDYYTGGDKPYLISGADD--------LTIKVWDYQTKSCVQT 222 (794)
T ss_pred --ccceeeeeccccEEEEEcCCCCCceeeeccccCcceEEeccCCCcceEEecCCC--------ceEEEeecchHHHHHH
Confidence 679999999999999999999999999999999999999664 4699999999 9999999999999999
Q ss_pred EeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccc
Q 000473 715 LRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 715 l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
+.||...|-.+.|.|+... +.+.++|||+|+|+..++..
T Consensus 223 LeGHt~Nvs~v~fhp~lpi-----------------iisgsEDGTvriWhs~Ty~l 261 (794)
T KOG0276|consen 223 LEGHTNNVSFVFFHPELPI-----------------IISGSEDGTVRIWNSKTYKL 261 (794)
T ss_pred hhcccccceEEEecCCCcE-----------------EEEecCCccEEEecCcceeh
Confidence 9999999988888865431 22445699999999766554
No 26
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.89 E-value=9.4e-22 Score=213.01 Aligned_cols=277 Identities=17% Similarity=0.163 Sum_probs=206.7
Q ss_pred eecccccCccccccccC-CCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 478 SDLTFCQDTVPRSEHVD-SRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~-~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
+...+.+.++-+|++.. ....|.....+.+|...|+.+.+.++.. +.++|+.||++++ ||
T Consensus 31 l~sasrDk~ii~W~L~~dd~~~G~~~r~~~GHsH~v~dv~~s~dg~----~alS~swD~~lrl---------------WD 91 (315)
T KOG0279|consen 31 LVSASRDKTIIVWKLTSDDIKYGVPVRRLTGHSHFVSDVVLSSDGN----FALSASWDGTLRL---------------WD 91 (315)
T ss_pred EEEcccceEEEEEEeccCccccCceeeeeeccceEecceEEccCCc----eEEeccccceEEE---------------EE
Confidence 34445556666777543 3445666778899999999999888887 6999999999999 44
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc--CCCEEEEEECCCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH--VAPVRQIILSPPQT 634 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H--~~~V~~l~fspd~~ 634 (1471)
..++++.+.|.||+..|.+++|++| ++.++|||.|.+|++|++. |.+..++..+ .+-|.++.|+|++
T Consensus 92 l~~g~~t~~f~GH~~dVlsva~s~d---------n~qivSGSrDkTiklwnt~-g~ck~t~~~~~~~~WVscvrfsP~~- 160 (315)
T KOG0279|consen 92 LATGESTRRFVGHTKDVLSVAFSTD---------NRQIVSGSRDKTIKLWNTL-GVCKYTIHEDSHREWVSCVRFSPNE- 160 (315)
T ss_pred ecCCcEEEEEEecCCceEEEEecCC---------CceeecCCCcceeeeeeec-ccEEEEEecCCCcCcEEEEEEcCCC-
Confidence 6677899999999999999999998 8999999999999999986 5566666554 6889999999983
Q ss_pred CCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 635 EHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 635 ~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
..-+|+++|.|++|++||+++.+...++.||.+.++.+++||||...++|+.| |.+.+||++.++.+..
T Consensus 161 ---~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpDGslcasGgkd--------g~~~LwdL~~~k~lys 229 (315)
T KOG0279|consen 161 ---SNPIIVSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPDGSLCASGGKD--------GEAMLWDLNEGKNLYS 229 (315)
T ss_pred ---CCcEEEEccCCceEEEEccCCcchhhccccccccEEEEEECCCCCEEecCCCC--------ceEEEEEccCCceeEe
Confidence 24589999999999999999999999999999999999999999999999998 9999999999999777
Q ss_pred EeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecccccccccccccCCCCCccccccCCCCCCC
Q 000473 715 LRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGVAFSTISEPSASHVRKGNSGKPS 794 (1471)
Q Consensus 715 l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 794 (1471)
+. |...|-...|.| ....+....+..||+|++..... ++.-+.+..
T Consensus 230 l~-a~~~v~sl~fsp------------------nrywL~~at~~sIkIwdl~~~~~--------------v~~l~~d~~- 275 (315)
T KOG0279|consen 230 LE-AFDIVNSLCFSP------------------NRYWLCAATATSIKIWDLESKAV--------------VEELKLDGI- 275 (315)
T ss_pred cc-CCCeEeeEEecC------------------CceeEeeccCCceEEEeccchhh--------------hhhcccccc-
Confidence 65 444444444553 11222333466799999732110 110011100
Q ss_pred CCCcccccccccccccCCCCCcceEEEechhhccccc
Q 000473 795 LNTRIGLQRKKQTIKCSCPYPGIATLSFDLASLMFPY 831 (1471)
Q Consensus 795 ~~~~~~~~~~~~~~~~~~~~~~~~~l~fd~e~l~~~~ 831 (1471)
.|........-+-+.|+..|++.+.=+-++.|...
T Consensus 276 --g~s~~~~~~~clslaws~dG~tLf~g~td~~irv~ 310 (315)
T KOG0279|consen 276 --GPSSKAGDPICLSLAWSADGQTLFAGYTDNVIRVW 310 (315)
T ss_pred --ccccccCCcEEEEEEEcCCCcEEEeeecCCcEEEE
Confidence 00000001123345688899999888887776544
No 27
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.89 E-value=1.4e-23 Score=230.86 Aligned_cols=221 Identities=23% Similarity=0.271 Sum_probs=189.4
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
++.....++++++||...- .+...+.+|.+.|.|+. |....+++|+.|.+|+||+
T Consensus 209 kiVSGlrDnTikiWD~n~~----~c~~~L~GHtGSVLCLq------yd~rviisGSSDsTvrvWD--------------- 263 (499)
T KOG0281|consen 209 KIVSGLRDNTIKIWDKNSL----ECLKILTGHTGSVLCLQ------YDERVIVSGSSDSTVRVWD--------------- 263 (499)
T ss_pred hhhcccccCceEEeccccH----HHHHhhhcCCCcEEeee------ccceEEEecCCCceEEEEe---------------
Confidence 4667778999999997653 66778899999999976 5555799999999999944
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc---eEEEEeccCCCEEEEEECCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN---LITVMHHHVAPVRQIILSPPQ 633 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~---~l~~~~~H~~~V~~l~fspd~ 633 (1471)
+++++++.++-+|...|..+.|+ +.+++|+|.|.+|.+||+.+.. +.+++.+|...|..+.|+.
T Consensus 264 v~tge~l~tlihHceaVLhlrf~-----------ng~mvtcSkDrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~-- 330 (499)
T KOG0281|consen 264 VNTGEPLNTLIHHCEAVLHLRFS-----------NGYMVTCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDD-- 330 (499)
T ss_pred ccCCchhhHHhhhcceeEEEEEe-----------CCEEEEecCCceeEEEeccCchHHHHHHHHhhhhhheeeecccc--
Confidence 55788999999999999999997 5799999999999999998765 3467889999999999964
Q ss_pred CCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 634 TEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 634 ~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
++++|++.|.+|++|++.++++++++.||...|-|+.+. |+++++|+.| .+|++||++.|.+++
T Consensus 331 ------kyIVsASgDRTikvW~~st~efvRtl~gHkRGIAClQYr--~rlvVSGSSD--------ntIRlwdi~~G~cLR 394 (499)
T KOG0281|consen 331 ------KYIVSASGDRTIKVWSTSTCEFVRTLNGHKRGIACLQYR--DRLVVSGSSD--------NTIRLWDIECGACLR 394 (499)
T ss_pred ------ceEEEecCCceEEEEeccceeeehhhhcccccceehhcc--CeEEEecCCC--------ceEEEEeccccHHHH
Confidence 499999999999999999999999999999999998885 7889888888 999999999999999
Q ss_pred EEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccc
Q 000473 714 VLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 714 ~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
.+.||..-+-++.|.++ -+.+|.+ ||++|+|+++.-++
T Consensus 395 vLeGHEeLvRciRFd~k--------rIVSGaY-----------DGkikvWdl~aald 432 (499)
T KOG0281|consen 395 VLEGHEELVRCIRFDNK--------RIVSGAY-----------DGKIKVWDLQAALD 432 (499)
T ss_pred HHhchHHhhhheeecCc--------eeeeccc-----------cceEEEEecccccC
Confidence 99999998888888742 1233333 99999999865443
No 28
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.89 E-value=2.2e-20 Score=210.51 Aligned_cols=141 Identities=26% Similarity=0.388 Sum_probs=124.7
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+++++.||.+.+| + ..+++....+..|...|.++.|+|+ ++.+++++.|+.|++|
T Consensus 149 ~l~~~~~~~~i~i~--d-------------~~~~~~~~~~~~~~~~i~~~~~~~~---------~~~l~~~~~~~~i~i~ 204 (289)
T cd00200 149 FVASSSQDGTIKLW--D-------------LRTGKCVATLTGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLW 204 (289)
T ss_pred EEEEEcCCCcEEEE--E-------------ccccccceeEecCccccceEEECCC---------cCEEEEecCCCcEEEE
Confidence 35556668888883 3 3344566778889999999999997 7788899899999999
Q ss_pred ECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEE
Q 000473 607 DLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIAC 686 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~s 686 (1471)
|+.+++.+..+..|...|.++.|+|+ +..+++++.|+.|++||+.+++.+..+.+|...|.+++|+|++.+|++
T Consensus 205 d~~~~~~~~~~~~~~~~i~~~~~~~~------~~~~~~~~~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~ 278 (289)
T cd00200 205 DLSTGKCLGTLRGHENGVNSVAFSPD------GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLAS 278 (289)
T ss_pred ECCCCceecchhhcCCceEEEEEcCC------CcEEEEEcCCCcEEEEEcCCceeEEEccccCCcEEEEEECCCCCEEEE
Confidence 99999999999899999999999998 788998888999999999999999999999999999999999999999
Q ss_pred EEcCCCCCCCCCCEEEEEE
Q 000473 687 LCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 687 gs~D~sg~~D~~gtV~VWD 705 (1471)
++.| |.|++||
T Consensus 279 ~~~d--------~~i~iw~ 289 (289)
T cd00200 279 GSAD--------GTIRIWD 289 (289)
T ss_pred ecCC--------CeEEecC
Confidence 9998 9999996
No 29
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.89 E-value=1.7e-20 Score=209.57 Aligned_cols=161 Identities=16% Similarity=0.118 Sum_probs=128.2
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccc
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSN 93 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~ 93 (1471)
.++..|-+++.+|+-+.+||||.|-.-.||++.+ ++ ....+.||+.+|+|+.
T Consensus 62 ~H~~svFavsl~P~~~l~aTGGgDD~AflW~~~~---ge--~~~eltgHKDSVt~~~----------------------- 113 (399)
T KOG0296|consen 62 KHTDSVFAVSLHPNNNLVATGGGDDLAFLWDIST---GE--FAGELTGHKDSVTCCS----------------------- 113 (399)
T ss_pred hcCCceEEEEeCCCCceEEecCCCceEEEEEccC---Cc--ceeEecCCCCceEEEE-----------------------
Confidence 4566799999999999999999999999999985 43 5667889999999999
Q ss_pred ccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCcccccccccccc
Q 000473 94 VMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEG 173 (1471)
Q Consensus 94 ~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~ 173 (1471)
|+.|+.+||||.-+|.|.||.+.+|.......-+ .....=...+|..+++..|..
T Consensus 114 ----FshdgtlLATGdmsG~v~v~~~stg~~~~~~~~e---~~dieWl~WHp~a~illAG~~------------------ 168 (399)
T KOG0296|consen 114 ----FSHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQE---VEDIEWLKWHPRAHILLAGST------------------ 168 (399)
T ss_pred ----EccCceEEEecCCCccEEEEEcccCceEEEeecc---cCceEEEEecccccEEEeecC------------------
Confidence 7899999999999999999999999866554322 111122245578888888886
Q ss_pred cccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCC
Q 000473 174 DLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKE 250 (1471)
Q Consensus 174 ~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~ 250 (1471)
+|.+++|...+....+.+.. +.++ +++-.|. |+|++ ++++..||+|++|++.+.
T Consensus 169 ---------------DGsvWmw~ip~~~~~kv~~G-h~~~--ct~G~f~---pdGKr--~~tgy~dgti~~Wn~ktg 222 (399)
T KOG0296|consen 169 ---------------DGSVWMWQIPSQALCKVMSG-HNSP--CTCGEFI---PDGKR--ILTGYDDGTIIVWNPKTG 222 (399)
T ss_pred ---------------CCcEEEEECCCcceeeEecC-CCCC--ccccccc---CCCce--EEEEecCceEEEEecCCC
Confidence 49999999998766655544 5555 7888888 45654 888888999999998755
No 30
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.88 E-value=1.5e-21 Score=211.51 Aligned_cols=212 Identities=16% Similarity=0.172 Sum_probs=179.3
Q ss_pred cccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecC
Q 000473 502 RDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRM 581 (1471)
Q Consensus 502 ~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd 581 (1471)
..++.+|.+.|++++.... .|+.++.++.|.++.+|... .-|...|.+++.|.||...|..+..++|
T Consensus 8 ~~tl~gh~d~Vt~la~~~~---~~~~l~sasrDk~ii~W~L~----------~dd~~~G~~~r~~~GHsH~v~dv~~s~d 74 (315)
T KOG0279|consen 8 RGTLEGHTDWVTALAIKIK---NSDILVSASRDKTIIVWKLT----------SDDIKYGVPVRRLTGHSHFVSDVVLSSD 74 (315)
T ss_pred eeeecCCCceEEEEEeecC---CCceEEEcccceEEEEEEec----------cCccccCceeeeeeccceEecceEEccC
Confidence 3457899999999886665 34689999999999995543 1256678899999999999999999998
Q ss_pred CCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE
Q 000473 582 VGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV 661 (1471)
Q Consensus 582 ~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~ 661 (1471)
+++.+|+|.|+++++||+.+|+..+.|.+|+..|.+++|+|| ...++||+.|++|++|+.. +.|
T Consensus 75 ---------g~~alS~swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~d------n~qivSGSrDkTiklwnt~-g~c 138 (315)
T KOG0279|consen 75 ---------GNFALSASWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTD------NRQIVSGSRDKTIKLWNTL-GVC 138 (315)
T ss_pred ---------CceEEeccccceEEEEEecCCcEEEEEEecCCceEEEEecCC------CceeecCCCcceeeeeeec-ccE
Confidence 899999999999999999999999999999999999999999 8899999999999999987 455
Q ss_pred EEEecCC--CCCcEEEEEcCC--CCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeecccccccc
Q 000473 662 ERMFPGH--PNYPAKVVWDCP--RGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSIS 737 (1471)
Q Consensus 662 l~~~~gh--~~~V~~v~~spd--g~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~s 737 (1471)
..++..+ ...|.|+.|+|+ ..+|++++.| ++|++||+++.++...+.||++-+..+.+.| .
T Consensus 139 k~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~D--------ktvKvWnl~~~~l~~~~~gh~~~v~t~~vSp-------D 203 (315)
T KOG0279|consen 139 KYTIHEDSHREWVSCVRFSPNESNPIIVSASWD--------KTVKVWNLRNCQLRTTFIGHSGYVNTVTVSP-------D 203 (315)
T ss_pred EEEEecCCCcCcEEEEEEcCCCCCcEEEEccCC--------ceEEEEccCCcchhhccccccccEEEEEECC-------C
Confidence 5555444 789999999998 6899999988 9999999999999999999999888876663 2
Q ss_pred ceEEcCCccccccceeeccCCceEeecccc
Q 000473 738 GSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 738 g~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
|+ .-.++ .+|+.+..|+|..
T Consensus 204 Gs-----lcasG-----gkdg~~~LwdL~~ 223 (315)
T KOG0279|consen 204 GS-----LCASG-----GKDGEAMLWDLNE 223 (315)
T ss_pred CC-----EEecC-----CCCceEEEEEccC
Confidence 22 22222 2488999999843
No 31
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.87 E-value=1.6e-21 Score=217.47 Aligned_cols=213 Identities=17% Similarity=0.165 Sum_probs=186.8
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+++.+++-.+++|+.... -++.....+|.-.|.++.+.+... .+++++.|.+|+. |+.
T Consensus 165 l~tcSsDl~~~LWd~~~~---~~c~ks~~gh~h~vS~V~f~P~gd----~ilS~srD~tik~---------------We~ 222 (406)
T KOG0295|consen 165 LATCSSDLSAKLWDFDTF---FRCIKSLIGHEHGVSSVFFLPLGD----HILSCSRDNTIKA---------------WEC 222 (406)
T ss_pred EEecCCccchhheeHHHH---HHHHHHhcCcccceeeEEEEecCC----eeeecccccceeE---------------Eec
Confidence 444444445999998753 246667789999999999777765 7999999999998 446
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC--
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE-- 635 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~-- 635 (1471)
+++.++.+|.+|..-|.-++.+.| +.+++|+|.|.++++|-+.++++...++.|.-+|.+++|.|....
T Consensus 223 ~tg~cv~t~~~h~ewvr~v~v~~D---------Gti~As~s~dqtl~vW~~~t~~~k~~lR~hEh~vEci~wap~~~~~~ 293 (406)
T KOG0295|consen 223 DTGYCVKTFPGHSEWVRMVRVNQD---------GTIIASCSNDQTLRVWVVATKQCKAELREHEHPVECIAWAPESSYPS 293 (406)
T ss_pred ccceeEEeccCchHhEEEEEecCC---------eeEEEecCCCceEEEEEeccchhhhhhhccccceEEEEecccccCcc
Confidence 678999999999999999999887 899999999999999999999999999999999999999987422
Q ss_pred ----CC---CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 636 ----HP---WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 636 ----~~---~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.+ .++++.+++.|++|++||+.++.|+.++.||...|..++|+|.|+||++..+| ++++|||+++
T Consensus 294 i~~at~~~~~~~~l~s~SrDktIk~wdv~tg~cL~tL~ghdnwVr~~af~p~Gkyi~ScaDD--------ktlrvwdl~~ 365 (406)
T KOG0295|consen 294 ISEATGSTNGGQVLGSGSRDKTIKIWDVSTGMCLFTLVGHDNWVRGVAFSPGGKYILSCADD--------KTLRVWDLKN 365 (406)
T ss_pred hhhccCCCCCccEEEeecccceEEEEeccCCeEEEEEecccceeeeeEEcCCCeEEEEEecC--------CcEEEEEecc
Confidence 11 24699999999999999999999999999999999999999999999998888 9999999999
Q ss_pred CeEEEEEeCCCCCceeeeeee
Q 000473 709 GARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 709 g~~~~~l~gH~~~v~~~~~~~ 729 (1471)
+++..++..|..-+...+|.+
T Consensus 366 ~~cmk~~~ah~hfvt~lDfh~ 386 (406)
T KOG0295|consen 366 LQCMKTLEAHEHFVTSLDFHK 386 (406)
T ss_pred ceeeeccCCCcceeEEEecCC
Confidence 999999999998888887774
No 32
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.87 E-value=1.8e-19 Score=238.25 Aligned_cols=141 Identities=14% Similarity=0.191 Sum_probs=117.9
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCc-ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSH-VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~-~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
.+++|+.||.|++|+.. +. .+...+.+|...|+++.|. + +.+|++|+.|++|++
T Consensus 632 ~latgs~dg~I~iwD~~---------------~~~~~~~~~~~h~~~V~~v~f~-~---------~~~lvs~s~D~~iki 686 (793)
T PLN00181 632 SLAFGSADHKVYYYDLR---------------NPKLPLCTMIGHSKTVSYVRFV-D---------SSTLVSSSTDNTLKL 686 (793)
T ss_pred EEEEEeCCCeEEEEECC---------------CCCccceEecCCCCCEEEEEEe-C---------CCEEEEEECCCEEEE
Confidence 47889999999994322 22 2456778999999999997 3 688999999999999
Q ss_pred EECCC------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe-------------c
Q 000473 606 WDLGS------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF-------------P 666 (1471)
Q Consensus 606 WDl~t------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~-------------~ 666 (1471)
||+.. +++++.+.+|...+..+.|+|+ +++|++|+.|+.|++|+.....++..+ .
T Consensus 687 Wd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~------~~~lasgs~D~~v~iw~~~~~~~~~s~~~~~~~~~~~~~~~ 760 (793)
T PLN00181 687 WDLSMSISGINETPLHSFMGHTNVKNFVGLSVS------DGYIATGSETNEVFVYHKAFPMPVLSYKFKTIDPVSGLEVD 760 (793)
T ss_pred EeCCCCccccCCcceEEEcCCCCCeeEEEEcCC------CCEEEEEeCCCEEEEEECCCCCceEEEecccCCcccccccC
Confidence 99974 3578899999999999999998 899999999999999998765443222 2
Q ss_pred CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 667 GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 667 gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
.|...|.+++|+|++.+|++|+.| |+|+|||+
T Consensus 761 ~~~~~V~~v~ws~~~~~lva~~~d--------G~I~i~~~ 792 (793)
T PLN00181 761 DASQFISSVCWRGQSSTLVAANST--------GNIKILEM 792 (793)
T ss_pred CCCcEEEEEEEcCCCCeEEEecCC--------CcEEEEec
Confidence 345679999999999999999998 99999997
No 33
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.87 E-value=1e-20 Score=234.05 Aligned_cols=232 Identities=18% Similarity=0.262 Sum_probs=195.3
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+..+..++++++|+...... .+...+.+|...|+.+++.++.. .+++|+.|++|+| |+ .
T Consensus 174 l~~~~~~~~i~~~~~~~~~~--~~~~~l~~h~~~v~~~~fs~d~~----~l~s~s~D~tiri--wd-------------~ 232 (456)
T KOG0266|consen 174 LAAASSDGLIRIWKLEGIKS--NLLRELSGHTRGVSDVAFSPDGS----YLLSGSDDKTLRI--WD-------------L 232 (456)
T ss_pred EEEccCCCcEEEeecccccc--hhhccccccccceeeeEECCCCc----EEEEecCCceEEE--ee-------------c
Confidence 66677788999999855432 34555688999999998555555 7999999999999 44 3
Q ss_pred -CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCC
Q 000473 558 -NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 558 -~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
+.+..++++.||...|++++|+|+ +++++||+.|++|++||+++++++..+.+|.++|.+++|+++
T Consensus 233 ~~~~~~~~~l~gH~~~v~~~~f~p~---------g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d---- 299 (456)
T KOG0266|consen 233 KDDGRNLKTLKGHSTYVTSVAFSPD---------GNLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGISGLAFSPD---- 299 (456)
T ss_pred cCCCeEEEEecCCCCceEEEEecCC---------CCEEEEecCCCcEEEEeccCCeEEEeeeccCCceEEEEECCC----
Confidence 345788999999999999999998 799999999999999999999999999999999999999999
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCc--EEEEecCCCCC--cEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 637 PWSDCFLSVGEDFSVALASLETLR--VERMFPGHPNY--PAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~--~l~~~~gh~~~--V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
++.+++++.|+.|++||+.++. ++..+.++... ++.++|+|++.|+++++.| +.+++||+..+...
T Consensus 300 --~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d--------~~~~~w~l~~~~~~ 369 (456)
T KOG0266|consen 300 --GNLLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLD--------RTLKLWDLRSGKSV 369 (456)
T ss_pred --CCEEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCC--------CeEEEEEccCCcce
Confidence 9999999999999999999999 67888888776 8999999999999999999 99999999999999
Q ss_pred EEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 713 RVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 713 ~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
+.+.+|...+.+. |+... ......++..+.|+.+..|++..
T Consensus 370 ~~~~~~~~~~~~~-~~~~~-------------~~~~~~i~sg~~d~~v~~~~~~s 410 (456)
T KOG0266|consen 370 GTYTGHSNLVRCI-FSPTL-------------STGGKLIYSGSEDGSVYVWDSSS 410 (456)
T ss_pred eeecccCCcceeE-ecccc-------------cCCCCeEEEEeCCceEEEEeCCc
Confidence 9999999764222 33110 11223355666799999999753
No 34
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.86 E-value=1.1e-20 Score=221.25 Aligned_cols=191 Identities=18% Similarity=0.173 Sum_probs=171.5
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEe
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAH 579 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~s 579 (1471)
.....|..|...|.|++++|... .+++++.|-.|++|+|+. .-.+.++|.||+..|.+++|.
T Consensus 88 ekV~~FeAH~DyIR~iavHPt~P----~vLtsSDDm~iKlW~we~--------------~wa~~qtfeGH~HyVMqv~fn 149 (794)
T KOG0276|consen 88 EKVKTFEAHSDYIRSIAVHPTLP----YVLTSSDDMTIKLWDWEN--------------EWACEQTFEGHEHYVMQVAFN 149 (794)
T ss_pred eeeEEeeccccceeeeeecCCCC----eEEecCCccEEEEeeccC--------------ceeeeeEEcCcceEEEEEEec
Confidence 44567899999999999888777 699999999999988872 225789999999999999999
Q ss_pred cCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC
Q 000473 580 RMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL 659 (1471)
Q Consensus 580 pd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~ 659 (1471)
|. +.+.++|+|-|+||++|.+.+..+..++.+|...|.++.|-|. .+..+++||++|.+|++||.++.
T Consensus 150 Pk--------D~ntFaS~sLDrTVKVWslgs~~~nfTl~gHekGVN~Vdyy~~----gdkpylIsgaDD~tiKvWDyQtk 217 (794)
T KOG0276|consen 150 PK--------DPNTFASASLDRTVKVWSLGSPHPNFTLEGHEKGVNCVDYYTG----GDKPYLISGADDLTIKVWDYQTK 217 (794)
T ss_pred CC--------CccceeeeeccccEEEEEcCCCCCceeeeccccCcceEEeccC----CCcceEEecCCCceEEEeecchH
Confidence 97 4789999999999999999999999999999999999999885 11349999999999999999999
Q ss_pred cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 660 RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 660 ~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
.|++++.||...|..+.|+|.-..+++|++| |+|+||+-.|.+++.++.--..++.++...
T Consensus 218 ~CV~TLeGHt~Nvs~v~fhp~lpiiisgsED--------GTvriWhs~Ty~lE~tLn~gleRvW~I~~~ 278 (794)
T KOG0276|consen 218 SCVQTLEGHTNNVSFVFFHPELPIIISGSED--------GTVRIWNSKTYKLEKTLNYGLERVWCIAAH 278 (794)
T ss_pred HHHHHhhcccccceEEEecCCCcEEEEecCC--------ccEEEecCcceehhhhhhcCCceEEEEeec
Confidence 9999999999999999999999999999999 999999999999998888777777777544
No 35
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.86 E-value=8.3e-22 Score=222.07 Aligned_cols=227 Identities=15% Similarity=0.076 Sum_probs=190.5
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
++.+.+.++-+.+|+...- .....++.|...|+++.+.++.. .+++|..+|.|++ |+. +
T Consensus 110 RLltgs~SGEFtLWNg~~f----nFEtilQaHDs~Vr~m~ws~~g~----wmiSgD~gG~iKy--Wqp-----------n 168 (464)
T KOG0284|consen 110 RLLTGSQSGEFTLWNGTSF----NFETILQAHDSPVRTMKWSHNGT----WMISGDKGGMIKY--WQP-----------N 168 (464)
T ss_pred eeEeecccccEEEecCcee----eHHHHhhhhcccceeEEEccCCC----EEEEcCCCceEEe--ccc-----------c
Confidence 4777778888889986321 12233588999999999777776 7999999999999 551 1
Q ss_pred cCCcceEEEEecCC-ccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC
Q 000473 557 VNSHVSRQYFLGHT-GAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~-~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~ 635 (1471)
...++.+.+|. ..|.+++|+|. ...++|+|.|++|+|||....+...++.+|.-.|.++.|+|.
T Consensus 169 ---mnnVk~~~ahh~eaIRdlafSpn---------DskF~t~SdDg~ikiWdf~~~kee~vL~GHgwdVksvdWHP~--- 233 (464)
T KOG0284|consen 169 ---MNNVKIIQAHHAEAIRDLAFSPN---------DSKFLTCSDDGTIKIWDFRMPKEERVLRGHGWDVKSVDWHPT--- 233 (464)
T ss_pred ---hhhhHHhhHhhhhhhheeccCCC---------CceeEEecCCCeEEEEeccCCchhheeccCCCCcceeccCCc---
Confidence 13345556665 89999999997 789999999999999999988888899999999999999998
Q ss_pred CCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 636 HPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 636 ~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
...++++|.|..|++||.++++|+.++.+|...|..+.|.|++++|++++.| ..++++|+++.+.++++
T Consensus 234 ---kgLiasgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n~N~Llt~skD--------~~~kv~DiR~mkEl~~~ 302 (464)
T KOG0284|consen 234 ---KGLIASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPNGNWLLTGSKD--------QSCKVFDIRTMKELFTY 302 (464)
T ss_pred ---cceeEEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCCCCeeEEccCC--------ceEEEEehhHhHHHHHh
Confidence 7899999999999999999999999999999999999999999999999999 99999999999999999
Q ss_pred eCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 716 RGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 716 ~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
+||...++.+.+.|-. .+.. .+...|+.+..|.+.
T Consensus 303 r~Hkkdv~~~~WhP~~-----~~lf-----------tsgg~Dgsvvh~~v~ 337 (464)
T KOG0284|consen 303 RGHKKDVTSLTWHPLN-----ESLF-----------TSGGSDGSVVHWVVG 337 (464)
T ss_pred hcchhhheeecccccc-----ccce-----------eeccCCCceEEEecc
Confidence 9999999998777521 1222 222338888888775
No 36
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.86 E-value=2.9e-20 Score=209.32 Aligned_cols=84 Identities=23% Similarity=0.306 Sum_probs=76.3
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccc
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSN 93 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~ 93 (1471)
.++..|--+.||++|++|||++.|.+.++|++.. ....+...++.||..+|..+.
T Consensus 222 ~htdEVWfl~FS~nGkyLAsaSkD~Taiiw~v~~--d~~~kl~~tlvgh~~~V~yi~----------------------- 276 (519)
T KOG0293|consen 222 DHTDEVWFLQFSHNGKYLASASKDSTAIIWIVVY--DVHFKLKKTLVGHSQPVSYIM----------------------- 276 (519)
T ss_pred hCCCcEEEEEEcCCCeeEeeccCCceEEEEEEec--CcceeeeeeeecccCceEEEE-----------------------
Confidence 4556799999999999999999999999999985 455778899999999999998
Q ss_pred ccccccCCCCEEEEEeCCCeEEEEEcCCCeEEE
Q 000473 94 VMGKSSLDNGALISACTDGVLCVWSRSSGHCRR 126 (1471)
Q Consensus 94 ~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~ 126 (1471)
+|||..||++++.|-.+.+||+.+|.++.
T Consensus 277 ----wSPDdryLlaCg~~e~~~lwDv~tgd~~~ 305 (519)
T KOG0293|consen 277 ----WSPDDRYLLACGFDEVLSLWDVDTGDLRH 305 (519)
T ss_pred ----ECCCCCeEEecCchHheeeccCCcchhhh
Confidence 79999999999999999999999998775
No 37
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.86 E-value=6.1e-22 Score=218.07 Aligned_cols=220 Identities=17% Similarity=0.202 Sum_probs=185.1
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+...++++++++||..+. .+..++.+|...|..+. |+...+++.+.|.+|.||+.+ . .
T Consensus 250 iisGSSDsTvrvWDv~tg----e~l~tlihHceaVLhlr------f~ng~mvtcSkDrsiaVWdm~--s----------p 307 (499)
T KOG0281|consen 250 IVSGSSDSTVRVWDVNTG----EPLNTLIHHCEAVLHLR------FSNGYMVTCSKDRSIAVWDMA--S----------P 307 (499)
T ss_pred EEecCCCceEEEEeccCC----chhhHHhhhcceeEEEE------EeCCEEEEecCCceeEEEecc--C----------c
Confidence 556678899999999885 45677888999998876 767789999999999994433 1 1
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
....+.+.|.||..+|+.+.|+ .++++|+|.|.+|++|++.++++++++.+|...|-|+.+.
T Consensus 308 s~it~rrVLvGHrAaVNvVdfd-----------~kyIVsASgDRTikvW~~st~efvRtl~gHkRGIAClQYr------- 369 (499)
T KOG0281|consen 308 TDITLRRVLVGHRAAVNVVDFD-----------DKYIVSASGDRTIKVWSTSTCEFVRTLNGHKRGIACLQYR------- 369 (499)
T ss_pred hHHHHHHHHhhhhhheeeeccc-----------cceEEEecCCceEEEEeccceeeehhhhcccccceehhcc-------
Confidence 1225567789999999999986 5799999999999999999999999999999999999874
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe-------
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA------- 710 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~------- 710 (1471)
++.++||+.|.+|++||+..|.|+++++||+.-|.++.|. .+.+++|..| |+|+|||+.++.
T Consensus 370 -~rlvVSGSSDntIRlwdi~~G~cLRvLeGHEeLvRciRFd--~krIVSGaYD--------GkikvWdl~aaldpra~~~ 438 (499)
T KOG0281|consen 370 -DRLVVSGSSDNTIRLWDIECGACLRVLEGHEELVRCIRFD--NKRIVSGAYD--------GKIKVWDLQAALDPRAPAS 438 (499)
T ss_pred -CeEEEecCCCceEEEEeccccHHHHHHhchHHhhhheeec--Cceeeecccc--------ceEEEEecccccCCccccc
Confidence 6799999999999999999999999999999999999995 6789888888 999999998864
Q ss_pred --EEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 711 --RERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 711 --~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
++.++..|+++|...+|.. . .+++++-|.+|.+|+.-+
T Consensus 439 ~~Cl~~lv~hsgRVFrLQFD~---f----------------qIvsssHddtILiWdFl~ 478 (499)
T KOG0281|consen 439 TLCLRTLVEHSGRVFRLQFDE---F----------------QIISSSHDDTILIWDFLN 478 (499)
T ss_pred chHHHhhhhccceeEEEeecc---e----------------EEEeccCCCeEEEEEcCC
Confidence 4567778888888887762 1 234555689999999644
No 38
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.86 E-value=3.2e-20 Score=197.99 Aligned_cols=228 Identities=18% Similarity=0.215 Sum_probs=185.2
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.+++.+.+-+++.|+... |.|..+++.....|+++.+.++.. .|+.|+ .-.|++++.+
T Consensus 12 iLvsA~YDhTIRfWqa~t----G~C~rTiqh~dsqVNrLeiTpdk~----~LAaa~-~qhvRlyD~~------------- 69 (311)
T KOG0315|consen 12 ILVSAGYDHTIRFWQALT----GICSRTIQHPDSQVNRLEITPDKK----DLAAAG-NQHVRLYDLN------------- 69 (311)
T ss_pred EEEeccCcceeeeeehhc----CeEEEEEecCccceeeEEEcCCcc----hhhhcc-CCeeEEEEcc-------------
Confidence 366778889999999876 577888888888999999777776 465554 5688884433
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
..+..++.++.+|+..|+.+.|+.+ ++++.|||.||+++|||++...+-+.| .|..+|+++..+|+
T Consensus 70 S~np~Pv~t~e~h~kNVtaVgF~~d---------grWMyTgseDgt~kIWdlR~~~~qR~~-~~~spVn~vvlhpn---- 135 (311)
T KOG0315|consen 70 SNNPNPVATFEGHTKNVTAVGFQCD---------GRWMYTGSEDGTVKIWDLRSLSCQRNY-QHNSPVNTVVLHPN---- 135 (311)
T ss_pred CCCCCceeEEeccCCceEEEEEeec---------CeEEEecCCCceEEEEeccCcccchhc-cCCCCcceEEecCC----
Confidence 1223578999999999999999987 999999999999999999986665555 56799999999998
Q ss_pred CCCCEEEEEeCCCcEEEEECCCC-------------------------------------------------cEEEEecC
Q 000473 637 PWSDCFLSVGEDFSVALASLETL-------------------------------------------------RVERMFPG 667 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~-------------------------------------------------~~l~~~~g 667 (1471)
+.-|+++..+|.|++||+.+. .++..|+.
T Consensus 136 --QteLis~dqsg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~a 213 (311)
T KOG0315|consen 136 --QTELISGDQSGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQA 213 (311)
T ss_pred --cceEEeecCCCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheec
Confidence 677888888888888888642 34556778
Q ss_pred CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEeCCCCCceeeeeeeccccccccceEEcCCcc
Q 000473 668 HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTS 746 (1471)
Q Consensus 668 h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~ 746 (1471)
|.+.+..+.+|||++||++++.| .+++||+.++. +++..+.||+.-+.-..|+. +|.
T Consensus 214 h~~~il~C~lSPd~k~lat~ssd--------ktv~iwn~~~~~kle~~l~gh~rWvWdc~FS~------------dg~-- 271 (311)
T KOG0315|consen 214 HNGHILRCLLSPDVKYLATCSSD--------KTVKIWNTDDFFKLELVLTGHQRWVWDCAFSA------------DGE-- 271 (311)
T ss_pred ccceEEEEEECCCCcEEEeecCC--------ceEEEEecCCceeeEEEeecCCceEEeeeecc------------Ccc--
Confidence 88999999999999999999999 99999999998 88999999998888877771 222
Q ss_pred ccccceeeccCCceEeecccc
Q 000473 747 VSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 747 ~s~~l~~~~~D~tir~w~l~~ 767 (1471)
-+++.+.|++.|.|++..
T Consensus 272 ---YlvTassd~~~rlW~~~~ 289 (311)
T KOG0315|consen 272 ---YLVTASSDHTARLWDLSA 289 (311)
T ss_pred ---EEEecCCCCceeeccccc
Confidence 245666699999998743
No 39
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.86 E-value=8.1e-21 Score=210.58 Aligned_cols=187 Identities=19% Similarity=0.210 Sum_probs=171.9
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEe
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAH 579 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~s 579 (1471)
.+...+.+|-+.|.|+++-+.+. .+++|+.|++|.| ||+.+++...++.||...|..++++
T Consensus 142 Kl~rVi~gHlgWVr~vavdP~n~----wf~tgs~DrtikI---------------wDlatg~LkltltGhi~~vr~vavS 202 (460)
T KOG0285|consen 142 KLYRVISGHLGWVRSVAVDPGNE----WFATGSADRTIKI---------------WDLATGQLKLTLTGHIETVRGVAVS 202 (460)
T ss_pred eehhhhhhccceEEEEeeCCCce----eEEecCCCceeEE---------------EEcccCeEEEeecchhheeeeeeec
Confidence 44556789999999999666655 7999999999999 4577889999999999999999999
Q ss_pred cCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC
Q 000473 580 RMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL 659 (1471)
Q Consensus 580 pd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~ 659 (1471)
+- ..++++++.|+.|+.||+...+.++.+.+|-..|.++..+|. -+.+++++.|.++++||+++.
T Consensus 203 ~r---------HpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS~V~~L~lhPT------ldvl~t~grDst~RvWDiRtr 267 (460)
T KOG0285|consen 203 KR---------HPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLSGVYCLDLHPT------LDVLVTGGRDSTIRVWDIRTR 267 (460)
T ss_pred cc---------CceEEEecCCCeeEEEechhhhhHHHhccccceeEEEecccc------ceeEEecCCcceEEEeeeccc
Confidence 86 689999999999999999999999999999999999999998 789999999999999999999
Q ss_pred cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 660 RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 660 ~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
..+..+.||..+|..|.+.|.+..+++|+.| ++|++||++.|+...++..|...|.+....
T Consensus 268 ~~V~~l~GH~~~V~~V~~~~~dpqvit~S~D--------~tvrlWDl~agkt~~tlt~hkksvral~lh 328 (460)
T KOG0285|consen 268 ASVHVLSGHTNPVASVMCQPTDPQVITGSHD--------STVRLWDLRAGKTMITLTHHKKSVRALCLH 328 (460)
T ss_pred ceEEEecCCCCcceeEEeecCCCceEEecCC--------ceEEEeeeccCceeEeeecccceeeEEecC
Confidence 9999999999999999999999999999999 999999999999999999998777775444
No 40
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.86 E-value=5.9e-21 Score=211.66 Aligned_cols=224 Identities=17% Similarity=0.235 Sum_probs=192.8
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.+.+.+.++++||+.. |++..++.+|-..|..+++..-.. ++.++++|+.|+- | |+
T Consensus 166 f~tgs~DrtikIwDlat----g~LkltltGhi~~vr~vavS~rHp----YlFs~gedk~VKC--w-------------DL 222 (460)
T KOG0285|consen 166 FATGSADRTIKIWDLAT----GQLKLTLTGHIETVRGVAVSKRHP----YLFSAGEDKQVKC--W-------------DL 222 (460)
T ss_pred EEecCCCceeEEEEccc----CeEEEeecchhheeeeeeecccCc----eEEEecCCCeeEE--E-------------ec
Confidence 56667789999999987 578888999999999988555444 6999999999998 3 34
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
+..+.++.+.||-..|.||+.+|. -.+|+||+.|.++|+||+++...++++.+|..+|.++.+.|-
T Consensus 223 e~nkvIR~YhGHlS~V~~L~lhPT---------ldvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V~~~~~----- 288 (460)
T KOG0285|consen 223 EYNKVIRHYHGHLSGVYCLDLHPT---------LDVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPVASVMCQPT----- 288 (460)
T ss_pred hhhhhHHHhccccceeEEEecccc---------ceeEEecCCcceEEEeeecccceEEEecCCCCcceeEEeecC-----
Confidence 556788899999999999999997 799999999999999999999999999999999999999987
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
...++|||.|++|++||++.|+....+..|...|.+++.+|....++++|.| .|+-|++..|..++.+.|
T Consensus 289 -dpqvit~S~D~tvrlWDl~agkt~~tlt~hkksvral~lhP~e~~fASas~d---------nik~w~~p~g~f~~nlsg 358 (460)
T KOG0285|consen 289 -DPQVITGSHDSTVRLWDLRAGKTMITLTHHKKSVRALCLHPKENLFASASPD---------NIKQWKLPEGEFLQNLSG 358 (460)
T ss_pred -CCceEEecCCceEEEeeeccCceeEeeecccceeeEEecCCchhhhhccCCc---------cceeccCCccchhhcccc
Confidence 6689999999999999999999999999999999999999999999999987 799999999999999999
Q ss_pred CCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 718 TASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 718 H~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
|.+-+...... ..|.+..|.. .+.+..|+-+
T Consensus 359 h~~iintl~~n-------sD~v~~~G~d-----------ng~~~fwdwk 389 (460)
T KOG0285|consen 359 HNAIINTLSVN-------SDGVLVSGGD-----------NGSIMFWDWK 389 (460)
T ss_pred ccceeeeeeec-------cCceEEEcCC-----------ceEEEEEecC
Confidence 99766555222 1344444443 6677788753
No 41
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.86 E-value=1.6e-19 Score=217.93 Aligned_cols=340 Identities=16% Similarity=0.175 Sum_probs=243.3
Q ss_pred CceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccccc
Q 000473 16 SHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVM 95 (1471)
Q Consensus 16 ~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~ 95 (1471)
+.+|-.++|+|..+.+.|+=..|.|.+||..- ...+..+.+|.+||..+.
T Consensus 9 SsRvKglsFHP~rPwILtslHsG~IQlWDYRM-----~tli~rFdeHdGpVRgv~------------------------- 58 (1202)
T KOG0292|consen 9 SSRVKGLSFHPKRPWILTSLHSGVIQLWDYRM-----GTLIDRFDEHDGPVRGVD------------------------- 58 (1202)
T ss_pred cccccceecCCCCCEEEEeecCceeeeehhhh-----hhHHhhhhccCCccceee-------------------------
Confidence 47899999999999999999999999999873 456777889999999998
Q ss_pred ccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCcccccccccccccc
Q 000473 96 GKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDL 175 (1471)
Q Consensus 96 ~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~ 175 (1471)
|.|.++++|||++|-.|+||+..+.+|+.+...| .+....+.+.++-+|++.+..+
T Consensus 59 --FH~~qplFVSGGDDykIkVWnYk~rrclftL~GH--lDYVRt~~FHheyPWIlSASDD-------------------- 114 (1202)
T KOG0292|consen 59 --FHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGH--LDYVRTVFFHHEYPWILSASDD-------------------- 114 (1202)
T ss_pred --ecCCCCeEEecCCccEEEEEecccceehhhhccc--cceeEEeeccCCCceEEEccCC--------------------
Confidence 6888999999999999999999999999765433 1223333344567799888776
Q ss_pred cccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCccc
Q 000473 176 VSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDR 255 (1471)
Q Consensus 176 ~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~ 255 (1471)
-+|+||+.++.+++.++.. +.+ +|-|..|.|. + +.++.|+-|.+|||||+..-+.
T Consensus 115 --------------QTIrIWNwqsr~~iavltG-HnH--YVMcAqFhpt---E--DlIVSaSLDQTVRVWDisGLRk--- 169 (1202)
T KOG0292|consen 115 --------------QTIRIWNWQSRKCIAVLTG-HNH--YVMCAQFHPT---E--DLIVSASLDQTVRVWDISGLRK--- 169 (1202)
T ss_pred --------------CeEEEEeccCCceEEEEec-Cce--EEEeeccCCc---c--ceEEEecccceEEEEeecchhc---
Confidence 7999999999999999886 333 2667677642 2 4477779999999999986520
Q ss_pred ccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCcceeeeeeecceeEeecCCCCceeee
Q 000473 256 EEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGSTIGEICFVDNLFCLEGGSTNSYVIG 335 (1471)
Q Consensus 256 ~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~~ige~~~~~~~l~~~~~~~~~~~~~ 335 (1471)
.+. ..+ .+ .+ ++ . +
T Consensus 170 ------------------k~~--~pg-~~-------------e~--~~------~------------------------~ 183 (1202)
T KOG0292|consen 170 ------------------KNK--APG-SL-------------ED--QM------R------------------------G 183 (1202)
T ss_pred ------------------cCC--CCC-Cc-------------hh--hh------h------------------------c
Confidence 110 000 00 00 00 0 0
Q ss_pred eEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEEEeecceeeEEe
Q 000473 336 AMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIHFIQMSLYLLRM 415 (1471)
Q Consensus 336 g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~f~~~~~~L~~v 415 (1471)
.. . .++-|+. ++
T Consensus 184 ~~-~--------------------------------------~~dLfg~------~D----------------------- 195 (1202)
T KOG0292|consen 184 QQ-G--------------------------------------NSDLFGQ------TD----------------------- 195 (1202)
T ss_pred cc-c--------------------------------------chhhcCC------cC-----------------------
Confidence 00 0 0000000 00
Q ss_pred eeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCccccccccCC
Q 000473 416 ETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDS 495 (1471)
Q Consensus 416 ~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~ 495 (1471)
.++ |.+
T Consensus 196 --------------aVV----------------K~V-------------------------------------------- 201 (1202)
T KOG0292|consen 196 --------------AVV----------------KHV-------------------------------------------- 201 (1202)
T ss_pred --------------eee----------------eee--------------------------------------------
Confidence 001 112
Q ss_pred CCCCCccccccccCccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccE
Q 000473 496 RQAGDGRDDFVHKEKIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAV 573 (1471)
Q Consensus 496 ~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V 573 (1471)
+.||...|+..+ |+|. .+++|+.|..|++|+|+ ..+.|.++ +..||.+.|
T Consensus 202 ---------LEGHDRGVNwaA------fhpTlpliVSG~DDRqVKlWrmn-------etKaWEvD------tcrgH~nnV 253 (1202)
T KOG0292|consen 202 ---------LEGHDRGVNWAA------FHPTLPLIVSGADDRQVKLWRMN-------ETKAWEVD------TCRGHYNNV 253 (1202)
T ss_pred ---------ecccccccceEE------ecCCcceEEecCCcceeeEEEec-------cccceeeh------hhhcccCCc
Confidence 234666677766 6664 89999999999998887 23346654 457999999
Q ss_pred EEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEE
Q 000473 574 LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVAL 653 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~l 653 (1471)
.|+.|||. .++++|.|.|++|+|||+...+.+++|+...+.-+.++.+|. .+.|+.| .|+-+.+
T Consensus 254 ssvlfhp~---------q~lIlSnsEDksirVwDm~kRt~v~tfrrendRFW~laahP~------lNLfAAg-HDsGm~V 317 (1202)
T KOG0292|consen 254 SSVLFHPH---------QDLILSNSEDKSIRVWDMTKRTSVQTFRRENDRFWILAAHPE------LNLFAAG-HDSGMIV 317 (1202)
T ss_pred ceEEecCc---------cceeEecCCCccEEEEecccccceeeeeccCCeEEEEEecCC------cceeeee-cCCceEE
Confidence 99999997 789999999999999999999999999988889999999998 7777655 5777777
Q ss_pred EECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 654 ASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 654 Wdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
+-++..++. .+.+.++ .+.+- | ..|+-+|+.|.
T Consensus 318 FkleRErpa------------~~v~~n~-LfYvk--d--------~~i~~~d~~t~ 350 (1202)
T KOG0292|consen 318 FKLERERPA------------YAVNGNG-LFYVK--D--------RFIRSYDLRTQ 350 (1202)
T ss_pred EEEcccCce------------EEEcCCE-EEEEc--c--------ceEEeeecccc
Confidence 877643332 2233222 22222 4 78888888874
No 42
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.85 E-value=9.9e-20 Score=204.09 Aligned_cols=274 Identities=19% Similarity=0.254 Sum_probs=207.9
Q ss_pred eeEEEEccccCCCCCcceeEeccCCce--EeeccccccccCCCCcccceeecccccCccccccccCCCCCCCcccccccc
Q 000473 431 YISVWSLSQKHSGPGKQCRMVGEGFSF--VDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHK 508 (1471)
Q Consensus 431 ~v~vwsl~~~~~~~~~~~k~l~~g~~~--~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h 508 (1471)
..++|+..++. .+ .+.+.++.+ ..|.+.. ...+ .+.+++.+.++++|....+...-.+...-.||
T Consensus 126 ~~riWd~~Gk~---~~--~~~Ght~~ik~v~~v~~n----~~~~----~fvsas~Dqtl~Lw~~~~~~~~~~~~~~~~GH 192 (423)
T KOG0313|consen 126 TSRIWDLKGKS---IK--TIVGHTGPIKSVAWVIKN----SSSC----LFVSASMDQTLRLWKWNVGENKVKALKVCRGH 192 (423)
T ss_pred eeEEEecCCce---EE--EEecCCcceeeeEEEecC----Cccc----eEEEecCCceEEEEEecCchhhhhHHhHhccc
Confidence 47899865542 11 133444444 6776654 1122 47888899999999998876655555555799
Q ss_pred CccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc-----cccCCC-----CCCccccCCcceEEEEecCCccEEEEEE
Q 000473 509 EKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL-----FERHNS-----PGASLKVNSHVSRQYFLGHTGAVLCLAA 578 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~-----l~~~d~-----~~~~~d~~s~~~~~~l~gH~~~V~~la~ 578 (1471)
...|-++...++.. ++++|+.|..+.||+... ++.... ..+.....+..++-++.||+++|.++.|
T Consensus 193 k~~V~sVsv~~sgt----r~~SgS~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~~~r~P~vtl~GHt~~Vs~V~w 268 (423)
T KOG0313|consen 193 KRSVDSVSVDSSGT----RFCSGSWDTMLKIWSVETDEEDELESSSNRRRKKQKREKEGGTRTPLVTLEGHTEPVSSVVW 268 (423)
T ss_pred ccceeEEEecCCCC----eEEeecccceeeecccCCCccccccccchhhhhhhhhhhcccccCceEEecccccceeeEEE
Confidence 99999998888777 899999999999965211 111100 0011123345678899999999999999
Q ss_pred ecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC
Q 000473 579 HRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET 658 (1471)
Q Consensus 579 spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t 658 (1471)
++ ...+.|+|.|.+|+.||+.++.+..++.+. .++.++.++|. .+++++|+.|..++|||.++
T Consensus 269 ~d----------~~v~yS~SwDHTIk~WDletg~~~~~~~~~-ksl~~i~~~~~------~~Ll~~gssdr~irl~DPR~ 331 (423)
T KOG0313|consen 269 SD----------ATVIYSVSWDHTIKVWDLETGGLKSTLTTN-KSLNCISYSPL------SKLLASGSSDRHIRLWDPRT 331 (423)
T ss_pred cC----------CCceEeecccceEEEEEeecccceeeeecC-cceeEeecccc------cceeeecCCCCceeecCCCC
Confidence 86 588999999999999999999998887654 57899999998 89999999999999999987
Q ss_pred Cc---EEEEecCCCCCcEEEEEcCCCCEE-EEEEcCCCCCCCCCCEEEEEECCCCe-EEEEEeCCCCCceeeeeeecccc
Q 000473 659 LR---VERMFPGHPNYPAKVVWDCPRGYI-ACLCRDHSRTSDAVDVLFIWDVKTGA-RERVLRGTASHSMFDHFCKGISM 733 (1471)
Q Consensus 659 ~~---~l~~~~gh~~~V~~v~~spdg~~L-~sgs~D~sg~~D~~gtV~VWDi~tg~-~~~~l~gH~~~v~~~~~~~~~~~ 733 (1471)
+. ..++|.||.+.|.++.|+|...|+ ++|+.| +++++||+++-. .+..+.+|...+..+.+..
T Consensus 332 ~~gs~v~~s~~gH~nwVssvkwsp~~~~~~~S~S~D--------~t~klWDvRS~k~plydI~~h~DKvl~vdW~~---- 399 (423)
T KOG0313|consen 332 GDGSVVSQSLIGHKNWVSSVKWSPTNEFQLVSGSYD--------NTVKLWDVRSTKAPLYDIAGHNDKVLSVDWNE---- 399 (423)
T ss_pred CCCceeEEeeecchhhhhheecCCCCceEEEEEecC--------CeEEEEEeccCCCcceeeccCCceEEEEeccC----
Confidence 53 457899999999999999987755 556666 999999999876 8999999999999987773
Q ss_pred ccccceEEcCCccccccceeeccCCceEeec
Q 000473 734 NSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 734 ~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
++.+.+|.. |.++|++.
T Consensus 400 ---~~~IvSGGa-----------D~~l~i~~ 416 (423)
T KOG0313|consen 400 ---GGLIVSGGA-----------DNKLRIFK 416 (423)
T ss_pred ---CceEEeccC-----------cceEEEec
Confidence 233444443 88888754
No 43
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.85 E-value=2.1e-18 Score=208.58 Aligned_cols=471 Identities=13% Similarity=0.138 Sum_probs=289.9
Q ss_pred EEEEEEcCCC--CeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccc
Q 000473 19 VTATSALTQP--PTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMG 96 (1471)
Q Consensus 19 Vtava~SpDg--~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~ 96 (1471)
||++. +|+- .-|+.|+.+|.+.+||+++ .+.+..+.+|...||++.
T Consensus 162 Ital~-HP~TYLNKIvvGs~~G~lql~Nvrt-----~K~v~~f~~~~s~IT~ie-------------------------- 209 (910)
T KOG1539|consen 162 ITALL-HPSTYLNKIVVGSSQGRLQLWNVRT-----GKVVYTFQEFFSRITAIE-------------------------- 209 (910)
T ss_pred eeeEe-cchhheeeEEEeecCCcEEEEEecc-----CcEEEEecccccceeEec--------------------------
Confidence 66553 4542 4467899999999999996 457788999999999997
Q ss_pred cccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccccccc
Q 000473 97 KSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLV 176 (1471)
Q Consensus 97 ~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~ 176 (1471)
-+|-=..+|-|..||+|.+.|+..++.+...+.+ | |..+.+.+=+...-++++|+.
T Consensus 210 -qsPaLDVVaiG~~~G~ViifNlK~dkil~sFk~d-~-g~VtslSFrtDG~p~las~~~--------------------- 265 (910)
T KOG1539|consen 210 -QSPALDVVAIGLENGTVIIFNLKFDKILMSFKQD-W-GRVTSLSFRTDGNPLLASGRS--------------------- 265 (910)
T ss_pred -cCCcceEEEEeccCceEEEEEcccCcEEEEEEcc-c-cceeEEEeccCCCeeEEeccC---------------------
Confidence 3455677999999999999999999999888775 2 333444443334456777765
Q ss_pred ccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEE-EeCCCcEEEEECCCCCCccc
Q 000473 177 SEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLM-VDSVGRLQLVPISKESHLDR 255 (1471)
Q Consensus 177 ~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llv-as~dG~V~vW~l~~~~~~~~ 255 (1471)
+|.+.+||...-++...+...+ -+.+....|.+.. -+++ ++.|+.+++|-.+.+. +
T Consensus 266 ------------~G~m~~wDLe~kkl~~v~~nah--~~sv~~~~fl~~e------pVl~ta~~DnSlk~~vfD~~d---g 322 (910)
T KOG1539|consen 266 ------------NGDMAFWDLEKKKLINVTRNAH--YGSVTGATFLPGE------PVLVTAGADNSLKVWVFDSGD---G 322 (910)
T ss_pred ------------CceEEEEEcCCCeeeeeeeccc--cCCcccceecCCC------ceEeeccCCCceeEEEeeCCC---C
Confidence 4889999998877776666433 2336666666332 2454 4999999999998652 1
Q ss_pred ccCCCcccCCCcccceeccCCcccCceEEEEe-cCCcEEEEEeCCeEEEEEcCCCc--ceeeeeeecceeEeecCCCC--
Q 000473 256 EEGNGLCKSSSQLDMAILQNGVVEGGHLVSVA-TCGNIIALVLKDHCIFRLLGSGS--TIGEICFVDNLFCLEGGSTN-- 330 (1471)
Q Consensus 256 ~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s-~~g~~l~~~~~~~~~~~l~d~~~--~ige~~~~~~~l~~~~~~~~-- 330 (1471)
. ++ .+ +...+|++.-..|.|- .+|..+.++..++ .+|.+.... .-++.... +... ...+.+
T Consensus 323 ~--pR------~L---R~R~GHs~Pp~~irfy~~~g~~ilsa~~Dr-t~r~fs~~~e~~~~~l~~~-~~~~-~~kk~~~~ 388 (910)
T KOG1539|consen 323 V--PR------LL---RSRGGHSAPPSCIRFYGSQGHFILSAKQDR-TLRSFSVISESQSQELGQL-HNKK-RAKKVNVF 388 (910)
T ss_pred c--ch------he---eeccCCCCCchheeeeccCcEEEEecccCc-chhhhhhhHHHHhHhhccc-cccc-cccccccc
Confidence 1 11 22 3355566666677766 4578888877776 344443321 00111000 0000 000000
Q ss_pred ------ceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCCceeeEE
Q 000473 331 ------SYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSGVKFSIH 404 (1471)
Q Consensus 331 ------~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~~~~~i~ 404 (1471)
.+.+..+..+. +... .. ++.+...-.+..++.|+ +.+...+...+ +..+..+++
T Consensus 389 ~~~~~k~p~i~~fa~~~--~RE~--~W------~Nv~~~h~~~~~~~tW~--~~n~~~G~~~L---~~~~~~~~~----- 448 (910)
T KOG1539|consen 389 STEKLKLPPIVEFAFEN--AREK--EW------DNVITAHKGKRSAYTWN--FRNKTSGRHVL---DPKRFKKDD----- 448 (910)
T ss_pred chhhhcCCcceeeeccc--chhh--hh------cceeEEecCcceEEEEe--ccCcccccEEe---cCccccccC-----
Confidence 00000000000 0000 00 00111111111222222 22211111000 000000000
Q ss_pred EeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeeccccc
Q 000473 405 FIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQ 484 (1471)
Q Consensus 405 f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~ 484 (1471)
....+ +. .+.|++| ..+..+.
T Consensus 449 -----~~~~a-----------------v~-vs~CGNF------------------------------------~~IG~S~ 469 (910)
T KOG1539|consen 449 -----INATA-----------------VC-VSFCGNF------------------------------------VFIGYSK 469 (910)
T ss_pred -----cceEE-----------------EE-EeccCce------------------------------------EEEeccC
Confidence 00000 00 0112221 1122233
Q ss_pred CccccccccCCCCCCCccccc---cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcc
Q 000473 485 DTVPRSEHVDSRQAGDGRDDF---VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHV 561 (1471)
Q Consensus 485 ~~v~~Wd~~~~~~~g~~~~~~---~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~ 561 (1471)
+.+..++.....+. ..+ ..|...|+.+++-.-+. .+++++.+|.+..|++. ...
T Consensus 470 G~Id~fNmQSGi~r----~sf~~~~ah~~~V~gla~D~~n~----~~vsa~~~Gilkfw~f~---------------~k~ 526 (910)
T KOG1539|consen 470 GTIDRFNMQSGIHR----KSFGDSPAHKGEVTGLAVDGTNR----LLVSAGADGILKFWDFK---------------KKV 526 (910)
T ss_pred CeEEEEEcccCeee----cccccCccccCceeEEEecCCCc----eEEEccCcceEEEEecC---------------Ccc
Confidence 45555554443332 223 47889999998766665 79999999999995443 222
Q ss_pred eEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCE
Q 000473 562 SRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDC 641 (1471)
Q Consensus 562 ~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~ 641 (1471)
....+. -...++++.+|.. ...++.+..|..|++.|..+.+.++.|.+|++.|+++.|+|| |++
T Consensus 527 l~~~l~-l~~~~~~iv~hr~---------s~l~a~~~ddf~I~vvD~~t~kvvR~f~gh~nritd~~FS~D------grW 590 (910)
T KOG1539|consen 527 LKKSLR-LGSSITGIVYHRV---------SDLLAIALDDFSIRVVDVVTRKVVREFWGHGNRITDMTFSPD------GRW 590 (910)
T ss_pred eeeeec-cCCCcceeeeeeh---------hhhhhhhcCceeEEEEEchhhhhhHHhhccccceeeeEeCCC------CcE
Confidence 333332 1235778888875 688999999999999999999999999999999999999999 999
Q ss_pred EEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 642 FLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 642 l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+++++.|++|++||+.++.++-.+. -..+++.+.|+|+|+||+|...| ..-||+|--++
T Consensus 591 lisasmD~tIr~wDlpt~~lID~~~-vd~~~~sls~SPngD~LAT~Hvd-------~~gIylWsNks 649 (910)
T KOG1539|consen 591 LISASMDSTIRTWDLPTGTLIDGLL-VDSPCTSLSFSPNGDFLATVHVD-------QNGIYLWSNKS 649 (910)
T ss_pred EEEeecCCcEEEEeccCcceeeeEe-cCCcceeeEECCCCCEEEEEEec-------CceEEEEEchh
Confidence 9999999999999999999988775 45678999999999999999988 25699997554
No 44
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.84 E-value=2.8e-20 Score=196.76 Aligned_cols=233 Identities=18% Similarity=0.163 Sum_probs=196.3
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
.+..++.++++|+... |.++.++.+|...|...+..+++. .+++|+.|..+.+ ||+.
T Consensus 33 ltcGsdrtvrLWNp~r----g~liktYsghG~EVlD~~~s~Dns----kf~s~GgDk~v~v---------------wDV~ 89 (307)
T KOG0316|consen 33 LTCGSDRTVRLWNPLR----GALIKTYSGHGHEVLDAALSSDNS----KFASCGGDKAVQV---------------WDVN 89 (307)
T ss_pred EEcCCCceEEeecccc----cceeeeecCCCceeeecccccccc----ccccCCCCceEEE---------------EEcc
Confidence 4555778999998654 688899999999999888777776 8999999999998 5688
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC--CceEEEEeccCCCEEEEEECCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS--GNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t--g~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
+|+..+.|.||.+.|+.++|..+ ...++|||.|.++++||-++ .++++.+......|.++.+..
T Consensus 90 TGkv~Rr~rgH~aqVNtV~fNee---------sSVv~SgsfD~s~r~wDCRS~s~ePiQildea~D~V~Si~v~~----- 155 (307)
T KOG0316|consen 90 TGKVDRRFRGHLAQVNTVRFNEE---------SSVVASGSFDSSVRLWDCRSRSFEPIQILDEAKDGVSSIDVAE----- 155 (307)
T ss_pred cCeeeeecccccceeeEEEecCc---------ceEEEeccccceeEEEEcccCCCCccchhhhhcCceeEEEecc-----
Confidence 99999999999999999999986 78999999999999999875 568889988889999999854
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
+.++.|+.||+++.||++.|+...-+-|| +|+++.|+++++..+.++.| +++++-|-.||++++.+.
T Consensus 156 ---heIvaGS~DGtvRtydiR~G~l~sDy~g~--pit~vs~s~d~nc~La~~l~--------stlrLlDk~tGklL~sYk 222 (307)
T KOG0316|consen 156 ---HEIVAGSVDGTVRTYDIRKGTLSSDYFGH--PITSVSFSKDGNCSLASSLD--------STLRLLDKETGKLLKSYK 222 (307)
T ss_pred ---cEEEeeccCCcEEEEEeecceeehhhcCC--cceeEEecCCCCEEEEeecc--------ceeeecccchhHHHHHhc
Confidence 58999999999999999999877666554 69999999999999999999 999999999999999999
Q ss_pred CCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccccccccc
Q 000473 717 GTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGVAFST 776 (1471)
Q Consensus 717 gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~~~~~ 776 (1471)
||.........|- +. ....++..++||.+-.|+|.+-.+..+.+.
T Consensus 223 Ghkn~eykldc~l----~q-----------sdthV~sgSEDG~Vy~wdLvd~~~~sk~~~ 267 (307)
T KOG0316|consen 223 GHKNMEYKLDCCL----NQ-----------SDTHVFSGSEDGKVYFWDLVDETQISKLSV 267 (307)
T ss_pred ccccceeeeeeee----cc-----------cceeEEeccCCceEEEEEeccceeeeeecc
Confidence 9999988886661 11 112345566799999999865544444433
No 45
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.84 E-value=1e-19 Score=225.17 Aligned_cols=201 Identities=22% Similarity=0.318 Sum_probs=170.5
Q ss_pred CccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcc--eEEEEecCCccEEEEEEecCCCCcc
Q 000473 509 EKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHV--SRQYFLGHTGAVLCLAAHRMVGTAK 586 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~--~~~~l~gH~~~V~~la~spd~~~~~ 586 (1471)
...|++..+.++.. .+++++.|+.+++|... ..+ ..+.+.+|...|.+++|+|+
T Consensus 159 ~~sv~~~~fs~~g~----~l~~~~~~~~i~~~~~~---------------~~~~~~~~~l~~h~~~v~~~~fs~d----- 214 (456)
T KOG0266|consen 159 CPSVTCVDFSPDGR----ALAAASSDGLIRIWKLE---------------GIKSNLLRELSGHTRGVSDVAFSPD----- 214 (456)
T ss_pred cCceEEEEEcCCCC----eEEEccCCCcEEEeecc---------------cccchhhccccccccceeeeEECCC-----
Confidence 56788877444444 79999999999995442 112 45666899999999999998
Q ss_pred cCcCCCEEEEEECCCcEEEEEC-CCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe
Q 000473 587 GWSFNEVLVSGSMDCSIRIWDL-GSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF 665 (1471)
Q Consensus 587 ~~~~~~~L~SGs~DgtI~lWDl-~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~ 665 (1471)
++++++|+.|++|++||+ ..+.+++++.+|...|++++|+|+ ++.++||+.|++|++||+++++++..+
T Consensus 215 ----~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~------g~~i~Sgs~D~tvriWd~~~~~~~~~l 284 (456)
T KOG0266|consen 215 ----GSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPD------GNLLVSGSDDGTVRIWDVRTGECVRKL 284 (456)
T ss_pred ----CcEEEEecCCceEEEeeccCCCeEEEEecCCCCceEEEEecCC------CCEEEEecCCCcEEEEeccCCeEEEee
Confidence 889999999999999999 566899999999999999999999 899999999999999999999999999
Q ss_pred cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe--EEEEEeCCCCC--ceeeeeeeccccccccceEE
Q 000473 666 PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA--RERVLRGTASH--SMFDHFCKGISMNSISGSVL 741 (1471)
Q Consensus 666 ~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~--~~~~l~gH~~~--v~~~~~~~~~~~~~~sg~v~ 741 (1471)
.+|.+.|++++|++++.+|++++.| +.|+|||+.+++ +...+.++... +..+.|++
T Consensus 285 ~~hs~~is~~~f~~d~~~l~s~s~d--------~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp------------ 344 (456)
T KOG0266|consen 285 KGHSDGISGLAFSPDGNLLVSASYD--------GTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSP------------ 344 (456)
T ss_pred eccCCceEEEEECCCCCEEEEcCCC--------ccEEEEECCCCceeeeecccCCCCCCceeEEEECC------------
Confidence 9999999999999999999999888 999999999999 67888888876 55666663
Q ss_pred cCCccccccceeeccCCceEeeccccc
Q 000473 742 NGNTSVSSLLLPIHEDGTFRQSQIQND 768 (1471)
Q Consensus 742 ~g~~~~s~~l~~~~~D~tir~w~l~~~ 768 (1471)
+|.. ++....|++++.|++...
T Consensus 345 ~~~~-----ll~~~~d~~~~~w~l~~~ 366 (456)
T KOG0266|consen 345 NGKY-----LLSASLDRTLKLWDLRSG 366 (456)
T ss_pred CCcE-----EEEecCCCeEEEEEccCC
Confidence 1222 334445889999998643
No 46
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.83 E-value=9e-20 Score=225.93 Aligned_cols=224 Identities=21% Similarity=0.207 Sum_probs=190.7
Q ss_pred eeecccccCccccccccCCCCCCCcccc-ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDD-FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~-~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
.+...+.++++++|+..+. ..... +.+|.+.|.++.+.+... .+++|+.|.+++| |
T Consensus 220 ~~~~~s~~~tl~~~~~~~~----~~i~~~l~GH~g~V~~l~~~~~~~----~lvsgS~D~t~rv---------------W 276 (537)
T KOG0274|consen 220 FFKSGSDDSTLHLWDLNNG----YLILTRLVGHFGGVWGLAFPSGGD----KLVSGSTDKTERV---------------W 276 (537)
T ss_pred eEEecCCCceeEEeecccc----eEEEeeccCCCCCceeEEEecCCC----EEEEEecCCcEEe---------------E
Confidence 4566677888999998774 44555 899999999998776444 7999999999999 4
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~ 635 (1471)
|..++.+..++.||.+.|.|+... +.++++||.|.+|++|++.++.+++.+.+|.++|.++.+.
T Consensus 277 d~~sg~C~~~l~gh~stv~~~~~~-----------~~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~~V~~v~~~----- 340 (537)
T KOG0274|consen 277 DCSTGECTHSLQGHTSSVRCLTID-----------PFLLVSGSRDNTVKVWDVTNGACLNLLRGHTGPVNCVQLD----- 340 (537)
T ss_pred ecCCCcEEEEecCCCceEEEEEcc-----------CceEeeccCCceEEEEeccCcceEEEeccccccEEEEEec-----
Confidence 466889999999999999999875 4678999999999999999999999999999999999986
Q ss_pred CCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEE
Q 000473 636 HPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERV 714 (1471)
Q Consensus 636 ~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~ 714 (1471)
+..+++|+.|++|++||+.++++++.+.||..+|+++.+.+. .++++|+.| ++|++||++++ +++.+
T Consensus 341 ---~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~~V~sl~~~~~-~~~~Sgs~D--------~~IkvWdl~~~~~c~~t 408 (537)
T KOG0274|consen 341 ---EPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTGRVYSLIVDSE-NRLLSGSLD--------TTIKVWDLRTKRKCIHT 408 (537)
T ss_pred ---CCEEEEEecCceEEEEEhhhceeeeeecCCcceEEEEEecCc-ceEEeeeec--------cceEeecCCchhhhhhh
Confidence 349999999999999999999999999999999999988765 889999999 99999999999 99999
Q ss_pred EeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccc
Q 000473 715 LRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 715 l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
+.+|.+-+..+.+- ++. ++....|+++++|+....+.
T Consensus 409 l~~h~~~v~~l~~~--------------~~~-----Lvs~~aD~~Ik~WD~~~~~~ 445 (537)
T KOG0274|consen 409 LQGHTSLVSSLLLR--------------DNF-----LVSSSADGTIKLWDAEEGEC 445 (537)
T ss_pred hcCCcccccccccc--------------cce-----eEeccccccEEEeecccCce
Confidence 99999877444221 122 33444599999998765543
No 47
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.83 E-value=1.3e-19 Score=198.28 Aligned_cols=208 Identities=17% Similarity=0.153 Sum_probs=175.7
Q ss_pred cccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 504 DFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 504 ~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
.+.+|.+.|....+.|+.. .+++|+.|..|.+|+-. ..-+...+++||.++|..+.|.+|
T Consensus 42 ~l~gh~geI~~~~F~P~gs----~~aSgG~Dr~I~LWnv~--------------gdceN~~~lkgHsgAVM~l~~~~d-- 101 (338)
T KOG0265|consen 42 LLPGHKGEIYTIKFHPDGS----CFASGGSDRAIVLWNVY--------------GDCENFWVLKGHSGAVMELHGMRD-- 101 (338)
T ss_pred hcCCCcceEEEEEECCCCC----eEeecCCcceEEEEecc--------------ccccceeeeccccceeEeeeeccC--
Confidence 3578999999998555544 79999999999995421 112456778899999999999997
Q ss_pred CcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEE
Q 000473 584 TAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVER 663 (1471)
Q Consensus 584 ~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~ 663 (1471)
+..++|+|.|.+|+.||+++|++++++++|.+-|.++. |. ++...+++|++.|++++|||+++..+++
T Consensus 102 -------~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~~vNs~~--p~---rrg~~lv~SgsdD~t~kl~D~R~k~~~~ 169 (338)
T KOG0265|consen 102 -------GSHILSCGTDKTVRGWDAETGKRIRKHKGHTSFVNSLD--PS---RRGPQLVCSGSDDGTLKLWDIRKKEAIK 169 (338)
T ss_pred -------CCEEEEecCCceEEEEecccceeeehhccccceeeecC--cc---ccCCeEEEecCCCceEEEEeecccchhh
Confidence 89999999999999999999999999999999999987 43 1225688999999999999999999888
Q ss_pred EecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcC
Q 000473 664 MFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNG 743 (1471)
Q Consensus 664 ~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g 743 (1471)
+++ ...+++++.|.-++..+++|+-| +.|++||++.+....+++||...++.+... ..|
T Consensus 170 t~~-~kyqltAv~f~d~s~qv~sggId--------n~ikvWd~r~~d~~~~lsGh~DtIt~lsls------------~~g 228 (338)
T KOG0265|consen 170 TFE-NKYQLTAVGFKDTSDQVISGGID--------NDIKVWDLRKNDGLYTLSGHADTITGLSLS------------RYG 228 (338)
T ss_pred ccc-cceeEEEEEecccccceeecccc--------CceeeeccccCcceEEeecccCceeeEEec------------cCC
Confidence 875 34569999999999999999999 999999999999999999999999887444 345
Q ss_pred CccccccceeeccCCceEeecccccc
Q 000473 744 NTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 744 ~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
+...+..+ |.++|+|+++.+-
T Consensus 229 s~llsnsM-----d~tvrvwd~rp~~ 249 (338)
T KOG0265|consen 229 SFLLSNSM-----DNTVRVWDVRPFA 249 (338)
T ss_pred Cccccccc-----cceEEEEEecccC
Confidence 66666666 9999999986554
No 48
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.83 E-value=9.3e-20 Score=219.88 Aligned_cols=233 Identities=20% Similarity=0.187 Sum_probs=194.5
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
.++..++.+++||..= +.+...|.+|.++|..+.+.+... .+|+|+.|-.|+||++.
T Consensus 25 LtslHsG~IQlWDYRM----~tli~rFdeHdGpVRgv~FH~~qp----lFVSGGDDykIkVWnYk--------------- 81 (1202)
T KOG0292|consen 25 LTSLHSGVIQLWDYRM----GTLIDRFDEHDGPVRGVDFHPTQP----LFVSGGDDYKIKVWNYK--------------- 81 (1202)
T ss_pred EEeecCceeeeehhhh----hhHHhhhhccCCccceeeecCCCC----eEEecCCccEEEEEecc---------------
Confidence 3455678999999754 367788999999999998444444 89999999999995554
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
+.++..+|.||-+.|..+.||+. -.+++|+|.|.||++|+..+++++..+.||...|.|..|+|.
T Consensus 82 ~rrclftL~GHlDYVRt~~FHhe---------yPWIlSASDDQTIrIWNwqsr~~iavltGHnHYVMcAqFhpt------ 146 (1202)
T KOG0292|consen 82 TRRCLFTLLGHLDYVRTVFFHHE---------YPWILSASDDQTIRIWNWQSRKCIAVLTGHNHYVMCAQFHPT------ 146 (1202)
T ss_pred cceehhhhccccceeEEeeccCC---------CceEEEccCCCeEEEEeccCCceEEEEecCceEEEeeccCCc------
Confidence 56889999999999999999997 689999999999999999999999999999999999999997
Q ss_pred CCEEEEEeCCCcEEEEECCCC---------------------------c--EEEEecCCCCCcEEEEEcCCCCEEEEEEc
Q 000473 639 SDCFLSVGEDFSVALASLETL---------------------------R--VERMFPGHPNYPAKVVWDCPRGYIACLCR 689 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~---------------------------~--~l~~~~gh~~~V~~v~~spdg~~L~sgs~ 689 (1471)
.+.++|+|-|.+||+||+... . ..+.+.||...|+-++|+|.-..|++|+.
T Consensus 147 EDlIVSaSLDQTVRVWDisGLRkk~~~pg~~e~~~~~~~~~~dLfg~~DaVVK~VLEGHDRGVNwaAfhpTlpliVSG~D 226 (1202)
T KOG0292|consen 147 EDLIVSASLDQTVRVWDISGLRKKNKAPGSLEDQMRGQQGNSDLFGQTDAVVKHVLEGHDRGVNWAAFHPTLPLIVSGAD 226 (1202)
T ss_pred cceEEEecccceEEEEeecchhccCCCCCCchhhhhccccchhhcCCcCeeeeeeecccccccceEEecCCcceEEecCC
Confidence 789999999999999998531 1 12456799999999999999999999999
Q ss_pred CCCCCCCCCCEEEEEECCCCe--EEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 690 DHSRTSDAVDVLFIWDVKTGA--RERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 690 D~sg~~D~~gtV~VWDi~tg~--~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
| ..|++|.+..-+ .+.+.+||...|.++-|.+.. ..+++.++|+++|+|++..
T Consensus 227 D--------RqVKlWrmnetKaWEvDtcrgH~nnVssvlfhp~q-----------------~lIlSnsEDksirVwDm~k 281 (1202)
T KOG0292|consen 227 D--------RQVKLWRMNETKAWEVDTCRGHYNNVSSVLFHPHQ-----------------DLILSNSEDKSIRVWDMTK 281 (1202)
T ss_pred c--------ceeeEEEeccccceeehhhhcccCCcceEEecCcc-----------------ceeEecCCCccEEEEeccc
Confidence 8 999999986543 356789999999998787531 2355667799999999865
Q ss_pred ccccccc
Q 000473 768 DERGVAF 774 (1471)
Q Consensus 768 ~~~~~~~ 774 (1471)
...-..+
T Consensus 282 Rt~v~tf 288 (1202)
T KOG0292|consen 282 RTSVQTF 288 (1202)
T ss_pred ccceeee
Confidence 4433333
No 49
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.82 E-value=7.7e-17 Score=185.79 Aligned_cols=485 Identities=13% Similarity=0.101 Sum_probs=286.8
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCC--------cEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccc
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDG--------SILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEH 85 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG--------~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~ 85 (1471)
.|+..|-|++++||.-.+++|-..| .|++||-.+ ...+..+-+-...|+|++| .-
T Consensus 102 GH~ddikc~~vHPdri~vatGQ~ag~~g~~~~phvriWdsv~-----L~TL~V~g~f~~GV~~vaF--sk---------- 164 (626)
T KOG2106|consen 102 GHNDDIKCMAVHPDRIRVATGQGAGTSGRPLQPHVRIWDSVT-----LSTLHVIGFFDRGVTCVAF--SK---------- 164 (626)
T ss_pred CCCCceEEEeecCCceeeccCcccccCCCcCCCeeeeccccc-----ceeeeeeccccccceeeee--cc----------
Confidence 5667899999999988888885444 599999663 3445555566678999996 10
Q ss_pred ccccccccccccccCCCCEEEE--EeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEE-EcCCCCeEE-EEcceecccCC
Q 000473 86 WKAENSSNVMGKSSLDNGALIS--ACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVIC-TLPSNPRYV-CIGCCFIDTNQ 161 (1471)
Q Consensus 86 ~~~~~~~~~~~~~s~d~~~LaS--as~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~-~~s~~~~ll-~~G~~~id~~~ 161 (1471)
...+.+|.. -+.+--|.|||+..++.....+-. ......+ +.|.+..++ .+|.
T Consensus 165 -------------~~~G~~l~~vD~s~~h~lSVWdWqk~~~~~~vk~s---ne~v~~a~FHPtd~nliit~Gk------- 221 (626)
T KOG2106|consen 165 -------------INGGSLLCAVDDSNPHMLSVWDWQKKAKLGPVKTS---NEVVFLATFHPTDPNLIITCGK------- 221 (626)
T ss_pred -------------cCCCceEEEecCCCccccchhhchhhhccCcceec---cceEEEEEeccCCCcEEEEeCC-------
Confidence 112233332 355667999999977644332211 1112233 445555444 4454
Q ss_pred cccccccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEe--ecCccccCCeEEEEEeeecCCCCceeEEEEeCC
Q 000473 162 LSDHHSFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTV--FHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSV 239 (1471)
Q Consensus 162 ~~~~h~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl--~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~d 239 (1471)
+.+..|+..++.+..+. +.+... ..|.+++|.+ + ++++.|+++
T Consensus 222 ----------------------------~H~~Fw~~~~~~l~k~~~~fek~ek-k~Vl~v~F~e---n---gdviTgDS~ 266 (626)
T KOG2106|consen 222 ----------------------------GHLYFWTLRGGSLVKRQGIFEKREK-KFVLCVTFLE---N---GDVITGDSG 266 (626)
T ss_pred ----------------------------ceEEEEEccCCceEEEeeccccccc-eEEEEEEEcC---C---CCEEeecCC
Confidence 56778888777665543 333333 4588888873 3 459999999
Q ss_pred CcEEEEECCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCc-ceeeeeee
Q 000473 240 GRLQLVPISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGS-TIGEICFV 318 (1471)
Q Consensus 240 G~V~vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~-~ige~~~~ 318 (1471)
|.|.||+..... +. +++ . +|++++..+...-+|.++. +.+++- +-+||..- .+.++..|
T Consensus 267 G~i~Iw~~~~~~---------~~---k~~-----~-aH~ggv~~L~~lr~GtllS-GgKDRk-i~~Wd~~y~k~r~~elP 326 (626)
T KOG2106|consen 267 GNILIWSKGTNR---------IS---KQV-----H-AHDGGVFSLCMLRDGTLLS-GGKDRK-IILWDDNYRKLRETELP 326 (626)
T ss_pred ceEEEEeCCCce---------EE---eEe-----e-ecCCceEEEEEecCccEee-cCccce-EEeccccccccccccCc
Confidence 999999986541 11 122 2 4788888888777888776 666653 33566321 22222222
Q ss_pred cceeEeecCCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecCccCCCC
Q 000473 319 DNLFCLEGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPAVSYPSG 398 (1471)
Q Consensus 319 ~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~v~~~~~ 398 (1471)
+.. ..+-...-.. ..+.|++....+
T Consensus 327 e~~----------G~iRtv~e~~-----------------~di~vGTtrN~i---------------------------- 351 (626)
T KOG2106|consen 327 EQF----------GPIRTVAEGK-----------------GDILVGTTRNFI---------------------------- 351 (626)
T ss_pred hhc----------CCeeEEecCC-----------------CcEEEeeccceE----------------------------
Confidence 110 0000000000 012222111000
Q ss_pred ceeeEEEeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCccccee
Q 000473 399 VKFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKS 478 (1471)
Q Consensus 399 ~~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l 478 (1471)
|++..+. ++. .+.+|.....|-....+..+ .+
T Consensus 352 --------------------------------------L~Gt~~~-~f~--~~v~gh~delwgla~hps~~-------q~ 383 (626)
T KOG2106|consen 352 --------------------------------------LQGTLEN-GFT--LTVQGHGDELWGLATHPSKN-------QL 383 (626)
T ss_pred --------------------------------------EEeeecC-Cce--EEEEecccceeeEEcCCChh-------he
Confidence 0010000 000 11111111234333211111 36
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCC-EEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY-AIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~-~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
.+...++.+++|+ ..++ .-+ ..-.....|.. |+|. .++.|+..|.-.|.+ .
T Consensus 384 ~T~gqdk~v~lW~--~~k~----~wt-~~~~d~~~~~~------fhpsg~va~Gt~~G~w~V~d---------------~ 435 (626)
T KOG2106|consen 384 LTCGQDKHVRLWN--DHKL----EWT-KIIEDPAECAD------FHPSGVVAVGTATGRWFVLD---------------T 435 (626)
T ss_pred eeccCcceEEEcc--CCce----eEE-EEecCceeEee------ccCcceEEEeeccceEEEEe---------------c
Confidence 6777889999999 2211 101 11123445544 7776 689999999888732 3
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc-eEEEE-eccCCCEEEEEECCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN-LITVM-HHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~-~l~~~-~~H~~~V~~l~fspd~~~ 635 (1471)
++...+..-.. +.++++++|+|+ +.+|+.|+.|+.|.++-+.... ...+. +.|..+|+.+.|++|
T Consensus 436 e~~~lv~~~~d-~~~ls~v~ysp~---------G~~lAvgs~d~~iyiy~Vs~~g~~y~r~~k~~gs~ithLDwS~D--- 502 (626)
T KOG2106|consen 436 ETQDLVTIHTD-NEQLSVVRYSPD---------GAFLAVGSHDNHIYIYRVSANGRKYSRVGKCSGSPITHLDWSSD--- 502 (626)
T ss_pred ccceeEEEEec-CCceEEEEEcCC---------CCEEEEecCCCeEEEEEECCCCcEEEEeeeecCceeEEeeecCC---
Confidence 33333333333 789999999998 9999999999999999987543 33222 235589999999999
Q ss_pred CCCCCEEEEEeCCCcEEEEECCCCc--------------EEEEe----cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCC
Q 000473 636 HPWSDCFLSVGEDFSVALASLETLR--------------VERMF----PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDA 697 (1471)
Q Consensus 636 ~~~~~~l~S~s~DgsV~lWdl~t~~--------------~l~~~----~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~ 697 (1471)
++++.+.+.|-.|..|....-+ |..-| ..|...|..++-+.+++.+++|...
T Consensus 503 ---s~~~~~~S~d~eiLyW~~~~~~~~ts~kDvkW~t~~c~lGF~v~g~s~~t~i~a~~rs~~~~~lA~gdd~------- 572 (626)
T KOG2106|consen 503 ---SQFLVSNSGDYEILYWKPSECKQITSVKDVKWATYTCTLGFEVFGGSDGTDINAVARSHCEKLLASGDDF------- 572 (626)
T ss_pred ---CceEEeccCceEEEEEccccCcccceecceeeeeeEEEEEEEEecccCCchHHHhhhhhhhhhhhccccC-------
Confidence 9999999999999999443221 11111 1233455666666667777766665
Q ss_pred CCEEEEEECCC---CeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 698 VDVLFIWDVKT---GARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 698 ~gtV~VWDi~t---g~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
|+|+++...- ...-+.+.||++++..+.|...+.. +...-.|.++.+|.+
T Consensus 573 -g~v~lf~yPc~s~rA~~he~~ghs~~vt~V~Fl~~d~~-----------------li~tg~D~Si~qW~l 625 (626)
T KOG2106|consen 573 -GKVHLFSYPCSSPRAPSHEYGGHSSHVTNVAFLCKDSH-----------------LISTGKDTSIMQWRL 625 (626)
T ss_pred -ceEEEEccccCCCcccceeeccccceeEEEEEeeCCce-----------------EEecCCCceEEEEEe
Confidence 9999998753 3456788999999999988733221 222234888899975
No 50
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.82 E-value=2.2e-17 Score=195.38 Aligned_cols=188 Identities=14% Similarity=0.170 Sum_probs=150.8
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
++..+.+..+.++|...+- .+.+++.+|...||++.+..... ..+.++++.|..|.....+. .
T Consensus 519 LASasrdRlIHV~Dv~rny---~l~qtld~HSssITsvKFa~~gl--n~~MiscGADksimFr~~qk------------~ 581 (1080)
T KOG1408|consen 519 LASASRDRLIHVYDVKRNY---DLVQTLDGHSSSITSVKFACNGL--NRKMISCGADKSIMFRVNQK------------A 581 (1080)
T ss_pred hhhccCCceEEEEeccccc---chhhhhcccccceeEEEEeecCC--ceEEEeccCchhhheehhcc------------c
Confidence 5566777889999986652 46788899999999998655442 01677777888776422220 0
Q ss_pred CCcceEEEEecC-----CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec---cCCCEEEEEE
Q 000473 558 NSHVSRQYFLGH-----TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH---HVAPVRQIIL 629 (1471)
Q Consensus 558 ~s~~~~~~l~gH-----~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~---H~~~V~~l~f 629 (1471)
.++ ..|..| ...++.+++.|. .+++++++.|..|++||+.+|+..+.|++ |.|....+..
T Consensus 582 ~~g---~~f~r~t~t~~ktTlYDm~Vdp~---------~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~l 649 (1080)
T KOG1408|consen 582 SSG---RLFPRHTQTLSKTTLYDMAVDPT---------SKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVIL 649 (1080)
T ss_pred cCc---eeccccccccccceEEEeeeCCC---------cceEEEEecccceEEEeccccceeeeecccccCCCceEEEEE
Confidence 011 112222 235788899886 78999999999999999999999999986 6678888999
Q ss_pred CCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 630 SPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 630 spd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.|. |.++++.+.|+++.++|.-+|+|+..+.||...|+.+.|.+|-++|++.+.| |.|.||.+..
T Consensus 650 DPS------giY~atScsdktl~~~Df~sgEcvA~m~GHsE~VTG~kF~nDCkHlISvsgD--------gCIFvW~lp~ 714 (1080)
T KOG1408|consen 650 DPS------GIYLATSCSDKTLCFVDFVSGECVAQMTGHSEAVTGVKFLNDCKHLISVSGD--------GCIFVWKLPL 714 (1080)
T ss_pred CCC------ccEEEEeecCCceEEEEeccchhhhhhcCcchheeeeeecccchhheeecCC--------ceEEEEECch
Confidence 998 9999999999999999999999999999999999999999999999999888 9999999854
No 51
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.82 E-value=2.1e-20 Score=210.91 Aligned_cols=217 Identities=15% Similarity=0.200 Sum_probs=176.0
Q ss_pred cccCccccccccCCCCCCCccccccccC-ccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 482 FCQDTVPRSEHVDSRQAGDGRDDFVHKE-KIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 482 ~~~~~v~~Wd~~~~~~~g~~~~~~~~h~-~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
...+.+|.|+..- +..+.+.+|. ..|++++ |+|+ .+++++.||+|+||++.
T Consensus 157 D~gG~iKyWqpnm-----nnVk~~~ahh~eaIRdla------fSpnDskF~t~SdDg~ikiWdf~--------------- 210 (464)
T KOG0284|consen 157 DKGGMIKYWQPNM-----NNVKIIQAHHAEAIRDLA------FSPNDSKFLTCSDDGTIKIWDFR--------------- 210 (464)
T ss_pred CCCceEEecccch-----hhhHHhhHhhhhhhheec------cCCCCceeEEecCCCeEEEEecc---------------
Confidence 3457899998643 2344455555 8899988 6665 89999999999994332
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
-.+..+.|.||.-.|.|++|||. ..+++|||.|..|++||.++|+++.++.+|...|.++.|+|+
T Consensus 211 ~~kee~vL~GHgwdVksvdWHP~---------kgLiasgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n------ 275 (464)
T KOG0284|consen 211 MPKEERVLRGHGWDVKSVDWHPT---------KGLIASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPN------ 275 (464)
T ss_pred CCchhheeccCCCCcceeccCCc---------cceeEEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCC------
Confidence 23456778999999999999997 789999999999999999999999999999999999999999
Q ss_pred CCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE-EEe
Q 000473 639 SDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWDVKTGARER-VLR 716 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~-~l~ 716 (1471)
+++|+|+|.|.+++++|+++.+.++.+++|...|+++.|+|-. .+|.+||.| |.|..|.+..-+.+. .-.
T Consensus 276 ~N~Llt~skD~~~kv~DiR~mkEl~~~r~Hkkdv~~~~WhP~~~~lftsgg~D--------gsvvh~~v~~~~p~~~i~~ 347 (464)
T KOG0284|consen 276 GNWLLTGSKDQSCKVFDIRTMKELFTYRGHKKDVTSLTWHPLNESLFTSGGSD--------GSVVHWVVGLEEPLGEIPP 347 (464)
T ss_pred CCeeEEccCCceEEEEehhHhHHHHHhhcchhhheeeccccccccceeeccCC--------CceEEEeccccccccCCCc
Confidence 8999999999999999999999999999999999999999964 567777777 999999998444443 345
Q ss_pred CCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeec
Q 000473 717 GTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 717 gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
+|...+....+.| +. .... +-+.|.++|.|.
T Consensus 348 AHd~~iwsl~~hP------lG------hil~-----tgsnd~t~rfw~ 378 (464)
T KOG0284|consen 348 AHDGEIWSLAYHP------LG------HILA-----TGSNDRTVRFWT 378 (464)
T ss_pred ccccceeeeeccc------cc------eeEe-----ecCCCcceeeec
Confidence 7877787776663 11 1111 223389999996
No 52
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.82 E-value=1.7e-19 Score=207.60 Aligned_cols=162 Identities=20% Similarity=0.250 Sum_probs=131.9
Q ss_pred CCCceEEEEEEcC-CCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccc
Q 000473 14 PPSHRVTATSALT-QPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSS 92 (1471)
Q Consensus 14 ~p~h~Vtava~Sp-Dg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~ 92 (1471)
.++--|+++.|.| .+.+|++|+.|+.|+|||+-. ....+.++.||+.+|.+++
T Consensus 212 gH~kgvsai~~fp~~~hLlLS~gmD~~vklW~vy~----~~~~lrtf~gH~k~Vrd~~---------------------- 265 (503)
T KOG0282|consen 212 GHTKGVSAIQWFPKKGHLLLSGGMDGLVKLWNVYD----DRRCLRTFKGHRKPVRDAS---------------------- 265 (503)
T ss_pred CCccccchhhhccceeeEEEecCCCceEEEEEEec----Ccceehhhhcchhhhhhhh----------------------
Confidence 3446699999999 899999999999999999983 3567788999999999998
Q ss_pred cccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccc
Q 000473 93 NVMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVE 172 (1471)
Q Consensus 93 ~~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~ 172 (1471)
++.++..|.|+|-|+.|++||+++|+|+.+..+. ..|..+.+.+.+..++.+|..
T Consensus 266 -----~s~~g~~fLS~sfD~~lKlwDtETG~~~~~f~~~---~~~~cvkf~pd~~n~fl~G~s----------------- 320 (503)
T KOG0282|consen 266 -----FNNCGTSFLSASFDRFLKLWDTETGQVLSRFHLD---KVPTCVKFHPDNQNIFLVGGS----------------- 320 (503)
T ss_pred -----ccccCCeeeeeecceeeeeeccccceEEEEEecC---CCceeeecCCCCCcEEEEecC-----------------
Confidence 6788888999999999999999999999988774 344455555555577777776
Q ss_pred ccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCC
Q 000473 173 GDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKE 250 (1471)
Q Consensus 173 ~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~ 250 (1471)
++.|+-||..++++++..-. ++ +.|..+.|.+ +++ ..+.++.|+.+++|+...+
T Consensus 321 ----------------d~ki~~wDiRs~kvvqeYd~-hL--g~i~~i~F~~---~g~--rFissSDdks~riWe~~~~ 374 (503)
T KOG0282|consen 321 ----------------DKKIRQWDIRSGKVVQEYDR-HL--GAILDITFVD---EGR--RFISSSDDKSVRIWENRIP 374 (503)
T ss_pred ----------------CCcEEEEeccchHHHHHHHh-hh--hheeeeEEcc---CCc--eEeeeccCccEEEEEcCCC
Confidence 48999999999998877665 43 3488888883 444 3677799999999998765
No 53
>PTZ00421 coronin; Provisional
Probab=99.81 E-value=3e-18 Score=212.09 Aligned_cols=217 Identities=16% Similarity=0.230 Sum_probs=163.1
Q ss_pred ccccCccEEEEEeec-cccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 505 FVHKEKIVSSSMVIS-ESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is-~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
+.+|.+.|+++.+.+ +. +.+++|+.||+|++|+... .+ .......++..+.+|...|.+++|+|+
T Consensus 71 l~GH~~~V~~v~fsP~d~----~~LaSgS~DgtIkIWdi~~--~~------~~~~~~~~l~~L~gH~~~V~~l~f~P~-- 136 (493)
T PTZ00421 71 LLGQEGPIIDVAFNPFDP----QKLFTASEDGTIMGWGIPE--EG------LTQNISDPIVHLQGHTKKVGIVSFHPS-- 136 (493)
T ss_pred EeCCCCCEEEEEEcCCCC----CEEEEEeCCCEEEEEecCC--Cc------cccccCcceEEecCCCCcEEEEEeCcC--
Confidence 678999999998444 22 2799999999999954431 00 000112456789999999999999996
Q ss_pred CcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEE
Q 000473 584 TAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVER 663 (1471)
Q Consensus 584 ~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~ 663 (1471)
.+.+|++|+.|++|++||+.+++.+..+.+|...|.++.|+|+ +..|++++.|++|++||+++++.+.
T Consensus 137 ------~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~spd------G~lLatgs~Dg~IrIwD~rsg~~v~ 204 (493)
T PTZ00421 137 ------AMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLD------GSLLCTTSKDKKLNIIDPRDGTIVS 204 (493)
T ss_pred ------CCCEEEEEeCCCEEEEEECCCCeEEEEEcCCCCceEEEEEECC------CCEEEEecCCCEEEEEECCCCcEEE
Confidence 1579999999999999999999999999999999999999999 9999999999999999999999999
Q ss_pred EecCCCCC-cEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe-EEEEEeCCCCCceeeeeeeccccccccceEE
Q 000473 664 MFPGHPNY-PAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA-RERVLRGTASHSMFDHFCKGISMNSISGSVL 741 (1471)
Q Consensus 664 ~~~gh~~~-V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~-~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~ 741 (1471)
.+.+|.+. +..+.|.+++..+++++.+ ++.| +.|++||+++.. .+.....+....+...+.+
T Consensus 205 tl~~H~~~~~~~~~w~~~~~~ivt~G~s--~s~D--r~VklWDlr~~~~p~~~~~~d~~~~~~~~~~d------------ 268 (493)
T PTZ00421 205 SVEAHASAKSQRCLWAKRKDLIITLGCS--KSQQ--RQIMLWDTRKMASPYSTVDLDQSSALFIPFFD------------ 268 (493)
T ss_pred EEecCCCCcceEEEEcCCCCeEEEEecC--CCCC--CeEEEEeCCCCCCceeEeccCCCCceEEEEEc------------
Confidence 99999875 4578899998888877643 2233 899999998754 4444444443333322221
Q ss_pred cCCccccccceeec-cCCceEeecccc
Q 000473 742 NGNTSVSSLLLPIH-EDGTFRQSQIQN 767 (1471)
Q Consensus 742 ~g~~~~s~~l~~~~-~D~tir~w~l~~ 767 (1471)
.+.. .++... .|+++|.|++.+
T Consensus 269 ---~d~~-~L~lggkgDg~Iriwdl~~ 291 (493)
T PTZ00421 269 ---EDTN-LLYIGSKGEGNIRCFELMN 291 (493)
T ss_pred ---CCCC-EEEEEEeCCCeEEEEEeeC
Confidence 0111 122222 489999999854
No 54
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.81 E-value=4.7e-19 Score=198.76 Aligned_cols=232 Identities=16% Similarity=0.197 Sum_probs=185.2
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.+.++++++++++|+.. |.....+.+|.+.|.++..+..+. ....+++++.|.++++|.|+. +.
T Consensus 117 ~IltgsYDg~~riWd~~-----Gk~~~~~~Ght~~ik~v~~v~~n~-~~~~fvsas~Dqtl~Lw~~~~-------~~--- 180 (423)
T KOG0313|consen 117 WILTGSYDGTSRIWDLK-----GKSIKTIVGHTGPIKSVAWVIKNS-SSCLFVSASMDQTLRLWKWNV-------GE--- 180 (423)
T ss_pred eEEEeecCCeeEEEecC-----CceEEEEecCCcceeeeEEEecCC-ccceEEEecCCceEEEEEecC-------ch---
Confidence 36788899999999975 577788899999999765544332 011599999999999988762 11
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC-------------------------CC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG-------------------------SG 611 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~-------------------------tg 611 (1471)
..-+..+.-.||...|-++...++ +..++|||.|.++++|+.. ++
T Consensus 181 -~~~~~~~~~~GHk~~V~sVsv~~s---------gtr~~SgS~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~~~r 250 (423)
T KOG0313|consen 181 -NKVKALKVCRGHKRSVDSVSVDSS---------GTRFCSGSWDTMLKIWSVETDEEDELESSSNRRRKKQKREKEGGTR 250 (423)
T ss_pred -hhhhHHhHhcccccceeEEEecCC---------CCeEEeecccceeeecccCCCccccccccchhhhhhhhhhhccccc
Confidence 011223334599999999999887 8999999999999999932 12
Q ss_pred ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCC
Q 000473 612 NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDH 691 (1471)
Q Consensus 612 ~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~ 691 (1471)
.++-.+.+|+.+|.++.|++. ..++|++.|++|+.||+.++++...+.+. ...+++..+|..++|++||.|
T Consensus 251 ~P~vtl~GHt~~Vs~V~w~d~-------~v~yS~SwDHTIk~WDletg~~~~~~~~~-ksl~~i~~~~~~~Ll~~gssd- 321 (423)
T KOG0313|consen 251 TPLVTLEGHTEPVSSVVWSDA-------TVIYSVSWDHTIKVWDLETGGLKSTLTTN-KSLNCISYSPLSKLLASGSSD- 321 (423)
T ss_pred CceEEecccccceeeEEEcCC-------CceEeecccceEEEEEeecccceeeeecC-cceeEeecccccceeeecCCC-
Confidence 366788999999999999885 58899999999999999999998887654 458999999999999999999
Q ss_pred CCCCCCCCEEEEEECCCCe---EEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 692 SRTSDAVDVLFIWDVKTGA---RERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 692 sg~~D~~gtV~VWDi~tg~---~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
..+++||.+++. ..+.+.||++-|-.+.+||....+-+ +.+-|+++|.|+++
T Consensus 322 -------r~irl~DPR~~~gs~v~~s~~gH~nwVssvkwsp~~~~~~~----------------S~S~D~t~klWDvR 376 (423)
T KOG0313|consen 322 -------RHIRLWDPRTGDGSVVSQSLIGHKNWVSSVKWSPTNEFQLV----------------SGSYDNTVKLWDVR 376 (423)
T ss_pred -------CceeecCCCCCCCceeEEeeecchhhhhheecCCCCceEEE----------------EEecCCeEEEEEec
Confidence 999999999863 35789999998888888876544433 23339999999974
No 55
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.80 E-value=1.2e-18 Score=216.00 Aligned_cols=222 Identities=20% Similarity=0.326 Sum_probs=188.3
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
++...+.+.++++||... |++..++.+|...|.++. ..+..++.|+.|.+|+| |+
T Consensus 263 ~lvsgS~D~t~rvWd~~s----g~C~~~l~gh~stv~~~~------~~~~~~~sgs~D~tVkV---------------W~ 317 (537)
T KOG0274|consen 263 KLVSGSTDKTERVWDCST----GECTHSLQGHTSSVRCLT------IDPFLLVSGSRDNTVKV---------------WD 317 (537)
T ss_pred EEEEEecCCcEEeEecCC----CcEEEEecCCCceEEEEE------ccCceEeeccCCceEEE---------------Ee
Confidence 466777899999999655 688999999999999976 34447888999999999 34
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
+++++.++.+.||.++|.|+.++ +.++++|+.|++|++||+.++++++++.+|.+.|.++.+.+.
T Consensus 318 v~n~~~l~l~~~h~~~V~~v~~~-----------~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~~V~sl~~~~~---- 382 (537)
T KOG0274|consen 318 VTNGACLNLLRGHTGPVNCVQLD-----------EPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTGRVYSLIVDSE---- 382 (537)
T ss_pred ccCcceEEEeccccccEEEEEec-----------CCEEEEEecCceEEEEEhhhceeeeeecCCcceEEEEEecCc----
Confidence 55788999999999999999986 689999999999999999999999999999999999988662
Q ss_pred CCCCEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 637 PWSDCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
..+++|+.|++|++||+++. +|+..+.+|..-|..+.+ .+++|++++.| ++|++||..++++++++
T Consensus 383 ---~~~~Sgs~D~~IkvWdl~~~~~c~~tl~~h~~~v~~l~~--~~~~Lvs~~aD--------~~Ik~WD~~~~~~~~~~ 449 (537)
T KOG0274|consen 383 ---NRLLSGSLDTTIKVWDLRTKRKCIHTLQGHTSLVSSLLL--RDNFLVSSSAD--------GTIKLWDAEEGECLRTL 449 (537)
T ss_pred ---ceEEeeeeccceEeecCCchhhhhhhhcCCccccccccc--ccceeEecccc--------ccEEEeecccCceeeee
Confidence 48999999999999999999 999999999998866655 47899999998 99999999999999999
Q ss_pred eC-CCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccc
Q 000473 716 RG-TASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 716 ~g-H~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
.+ |...+....+. ...++....|++++.|+++....
T Consensus 450 ~~~~~~~v~~l~~~-------------------~~~il~s~~~~~~~l~dl~~~~~ 486 (537)
T KOG0274|consen 450 EGRHVGGVSALALG-------------------KEEILCSSDDGSVKLWDLRSGTL 486 (537)
T ss_pred ccCCcccEEEeecC-------------------cceEEEEecCCeeEEEecccCch
Confidence 99 55656554221 22345566799999999865443
No 56
>PTZ00420 coronin; Provisional
Probab=99.80 E-value=9.8e-18 Score=208.55 Aligned_cols=236 Identities=11% Similarity=0.077 Sum_probs=169.4
Q ss_pred ccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcce
Q 000473 483 CQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVS 562 (1471)
Q Consensus 483 ~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~ 562 (1471)
..+.+++|+.... .....+.+|...|.++.+.+.. ++.+++|+.||+|+||+.. .+. ... .....+
T Consensus 52 ~~gvI~L~~~~r~----~~v~~L~gH~~~V~~lafsP~~---~~lLASgS~DgtIrIWDi~--t~~----~~~-~~i~~p 117 (568)
T PTZ00420 52 LIGAIRLENQMRK----PPVIKLKGHTSSILDLQFNPCF---SEILASGSEDLTIRVWEIP--HND----ESV-KEIKDP 117 (568)
T ss_pred ceeEEEeeecCCC----ceEEEEcCCCCCEEEEEEcCCC---CCEEEEEeCCCeEEEEECC--CCC----ccc-cccccc
Confidence 4567888887653 2345678999999999844421 1389999999999995433 110 000 000134
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEE
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
...+.+|.+.|.+++|+|+ ...+|++|+.|++|++||+.+++.+..+. |...|.++.|+|+ |..|
T Consensus 118 ~~~L~gH~~~V~sVaf~P~--------g~~iLaSgS~DgtIrIWDl~tg~~~~~i~-~~~~V~Slswspd------G~lL 182 (568)
T PTZ00420 118 QCILKGHKKKISIIDWNPM--------NYYIMCSSGFDSFVNIWDIENEKRAFQIN-MPKKLSSLKWNIK------GNLL 182 (568)
T ss_pred eEEeecCCCcEEEEEECCC--------CCeEEEEEeCCCeEEEEECCCCcEEEEEe-cCCcEEEEEECCC------CCEE
Confidence 5678999999999999997 13467899999999999999999887775 5678999999999 9999
Q ss_pred EEEeCCCcEEEEECCCCcEEEEecCCCCCcEE-----EEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEe
Q 000473 643 LSVGEDFSVALASLETLRVERMFPGHPNYPAK-----VVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLR 716 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~-----v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~ 716 (1471)
++++.|+.|+|||+++++.+..+.+|.+.+.. ..|++++.+|++++.| +.+ +++|+|||+++ ++++..+.
T Consensus 183 at~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d--~~~--~R~VkLWDlr~~~~pl~~~~ 258 (568)
T PTZ00420 183 SGTCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFS--KNN--MREMKLWDLKNTTSALVTMS 258 (568)
T ss_pred EEEecCCEEEEEECCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcC--CCC--ccEEEEEECCCCCCceEEEE
Confidence 99999999999999999999999999876533 3456899999998877 111 25899999995 66666655
Q ss_pred CCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 717 GTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 717 gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
.+.+...+.-+.+ ..+|. .++....|+++|.|++.
T Consensus 259 ld~~~~~L~p~~D-----~~tg~----------l~lsGkGD~tIr~~e~~ 293 (568)
T PTZ00420 259 IDNASAPLIPHYD-----ESTGL----------IYLIGKGDGNCRYYQHS 293 (568)
T ss_pred ecCCccceEEeee-----CCCCC----------EEEEEECCCeEEEEEcc
Confidence 4443222221211 11121 23344469999999873
No 57
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.79 E-value=1.6e-19 Score=197.12 Aligned_cols=234 Identities=21% Similarity=0.240 Sum_probs=191.7
Q ss_pred CCCCcccceeecccccCccccccccCCCCCCC----ccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc
Q 000473 469 ENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGD----GRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL 544 (1471)
Q Consensus 469 ~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~----~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~ 544 (1471)
.+||. .+.+.+-++.+.+|+..+.+-... ....|.-+...|.|+.+..+.. .+++|+.||.|+||
T Consensus 222 SPDgq---yLvsgSvDGFiEVWny~~GKlrKDLkYQAqd~fMMmd~aVlci~FSRDsE----MlAsGsqDGkIKvW---- 290 (508)
T KOG0275|consen 222 SPDGQ---YLVSGSVDGFIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVLCISFSRDSE----MLASGSQDGKIKVW---- 290 (508)
T ss_pred CCCCc---eEeeccccceeeeehhccchhhhhhhhhhhcceeecccceEEEeecccHH----HhhccCcCCcEEEE----
Confidence 45663 467778889999999998776543 2334556778999988655554 89999999999993
Q ss_pred cccCCCCCCccccCCcceEEEEe-cCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCC
Q 000473 545 FERHNSPGASLKVNSHVSRQYFL-GHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAP 623 (1471)
Q Consensus 545 l~~~d~~~~~~d~~s~~~~~~l~-gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~ 623 (1471)
.+.+|.+++.|. .|+..|+|+.|+.| +..++|+|.|.++++--+.+|+++..|++|...
T Consensus 291 -----------ri~tG~ClRrFdrAHtkGvt~l~FSrD---------~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSy 350 (508)
T KOG0275|consen 291 -----------RIETGQCLRRFDRAHTKGVTCLSFSRD---------NSQILSASFDQTVRIHGLKSGKCLKEFRGHSSY 350 (508)
T ss_pred -----------EEecchHHHHhhhhhccCeeEEEEccC---------cchhhcccccceEEEeccccchhHHHhcCcccc
Confidence 356788888886 89999999999997 789999999999999999999999999999999
Q ss_pred EEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCC-----------------------------------
Q 000473 624 VRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGH----------------------------------- 668 (1471)
Q Consensus 624 V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh----------------------------------- 668 (1471)
|+...|.++ |+.+++++.||+|++|+.++.+|+.+|...
T Consensus 351 vn~a~ft~d------G~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNrsntv~imn~qG 424 (508)
T KOG0275|consen 351 VNEATFTDD------GHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCNRSNTVYIMNMQG 424 (508)
T ss_pred ccceEEcCC------CCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcCCCCceEEEEcCCCeEEEEeccc
Confidence 999999999 999999999999999999987666555311
Q ss_pred ------------CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccc
Q 000473 669 ------------PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSI 736 (1471)
Q Consensus 669 ------------~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~ 736 (1471)
.+...+.+.+|.|.++.+.++| +.+|.+.+.+|.+++++.-|...++...-.|..
T Consensus 425 QvVrsfsSGkREgGdFi~~~lSpkGewiYcigED--------~vlYCF~~~sG~LE~tl~VhEkdvIGl~HHPHq----- 491 (508)
T KOG0275|consen 425 QVVRSFSSGKREGGDFINAILSPKGEWIYCIGED--------GVLYCFSVLSGKLERTLPVHEKDVIGLTHHPHQ----- 491 (508)
T ss_pred eEEeeeccCCccCCceEEEEecCCCcEEEEEccC--------cEEEEEEeecCceeeeeecccccccccccCccc-----
Confidence 1122345779999999999999 999999999999999999999999987555421
Q ss_pred cceEEcCCccccccceeeccCCceEeec
Q 000473 737 SGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 737 sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
+.+-+-++|+.++.|.
T Consensus 492 ------------NllAsYsEDgllKLWk 507 (508)
T KOG0275|consen 492 ------------NLLASYSEDGLLKLWK 507 (508)
T ss_pred ------------chhhhhcccchhhhcC
Confidence 2233456799888884
No 58
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.79 E-value=1.5e-17 Score=180.02 Aligned_cols=206 Identities=17% Similarity=0.234 Sum_probs=171.8
Q ss_pred cccccccCccEEEEEeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEE-ecCCccEEEEE
Q 000473 502 RDDFVHKEKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF-LGHTGAVLCLA 577 (1471)
Q Consensus 502 ~~~~~~h~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l-~gH~~~V~~la 577 (1471)
.+.+.+|.+++..++ ++|. .|++++.|..|+| |.+-. .++-.+...+ .+|+..|.+++
T Consensus 7 ~~~~~gh~~r~W~~a------whp~~g~ilAscg~Dk~vri--w~~~~----------~~s~~ck~vld~~hkrsVRsvA 68 (312)
T KOG0645|consen 7 EQKLSGHKDRVWSVA------WHPGKGVILASCGTDKAVRI--WSTSS----------GDSWTCKTVLDDGHKRSVRSVA 68 (312)
T ss_pred EEeecCCCCcEEEEE------eccCCceEEEeecCCceEEE--EecCC----------CCcEEEEEeccccchheeeeee
Confidence 355688999999988 5555 6999999999999 44210 1122344444 57999999999
Q ss_pred EecCCCCcccCcCCCEEEEEECCCcEEEEECCCC--ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEE
Q 000473 578 AHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG--NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALAS 655 (1471)
Q Consensus 578 ~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg--~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWd 655 (1471)
|+|. +++|++||.|.++.+|.-..+ +++.++.+|...|.+++|+++ |++||+++.|++|-+|.
T Consensus 69 wsp~---------g~~La~aSFD~t~~Iw~k~~~efecv~~lEGHEnEVK~Vaws~s------G~~LATCSRDKSVWiWe 133 (312)
T KOG0645|consen 69 WSPH---------GRYLASASFDATVVIWKKEDGEFECVATLEGHENEVKCVAWSAS------GNYLATCSRDKSVWIWE 133 (312)
T ss_pred ecCC---------CcEEEEeeccceEEEeecCCCceeEEeeeeccccceeEEEEcCC------CCEEEEeeCCCeEEEEE
Confidence 9997 899999999999999986644 588999999999999999999 99999999999999999
Q ss_pred CCC---CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC---CCeEEEEEeCCCCCceeeeeee
Q 000473 656 LET---LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK---TGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 656 l~t---~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~---tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
+.. .+|...+.+|...|..+.|+|....|++++.| .+|++|+-. .-++.+++.||...|....|.+
T Consensus 134 ~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYD--------nTIk~~~~~~dddW~c~~tl~g~~~TVW~~~F~~ 205 (312)
T KOG0645|consen 134 IDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYD--------NTIKVYRDEDDDDWECVQTLDGHENTVWSLAFDN 205 (312)
T ss_pred ecCCCcEEEEeeeccccccccEEEEcCCcceeEEeccC--------CeEEEEeecCCCCeeEEEEecCccceEEEEEecC
Confidence 974 46788999999999999999999999999998 999999876 3478999999999888887773
Q ss_pred ccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 730 GISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 730 ~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
.| ..++.++.|+++++|.+
T Consensus 206 ------------~G-----~rl~s~sdD~tv~Iw~~ 224 (312)
T KOG0645|consen 206 ------------IG-----SRLVSCSDDGTVSIWRL 224 (312)
T ss_pred ------------CC-----ceEEEecCCcceEeeee
Confidence 12 23556777999999975
No 59
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.79 E-value=5.2e-19 Score=203.71 Aligned_cols=225 Identities=16% Similarity=0.169 Sum_probs=184.5
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
...+-++.+++|++... +.+..++.+|...|..+++..+.. .+.+++.|+.+++ ||++
T Consensus 231 LS~gmD~~vklW~vy~~---~~~lrtf~gH~k~Vrd~~~s~~g~----~fLS~sfD~~lKl---------------wDtE 288 (503)
T KOG0282|consen 231 LSGGMDGLVKLWNVYDD---RRCLRTFKGHRKPVRDASFNNCGT----SFLSASFDRFLKL---------------WDTE 288 (503)
T ss_pred EecCCCceEEEEEEecC---cceehhhhcchhhhhhhhccccCC----eeeeeecceeeee---------------eccc
Confidence 34456678999999873 578999999999999988666665 8999999999998 5688
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
+|++...+.- ...++|+.|+|+ +.+.+++|+.|+.|+.||+++++.++.+..|-+.|..+.|-++
T Consensus 289 TG~~~~~f~~-~~~~~cvkf~pd--------~~n~fl~G~sd~ki~~wDiRs~kvvqeYd~hLg~i~~i~F~~~------ 353 (503)
T KOG0282|consen 289 TGQVLSRFHL-DKVPTCVKFHPD--------NQNIFLVGGSDKKIRQWDIRSGKVVQEYDRHLGAILDITFVDE------ 353 (503)
T ss_pred cceEEEEEec-CCCceeeecCCC--------CCcEEEEecCCCcEEEEeccchHHHHHHHhhhhheeeeEEccC------
Confidence 8988877642 235789999998 2589999999999999999999999999999999999999999
Q ss_pred CCEEEEEeCCCcEEEEECCCCcE----------------------------------------------EEEecCCC--C
Q 000473 639 SDCFLSVGEDFSVALASLETLRV----------------------------------------------ERMFPGHP--N 670 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~----------------------------------------------l~~~~gh~--~ 670 (1471)
+..|++.++|+++++|+.+..-. ...|.||. +
T Consensus 354 g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~~~r~nkkK~feGh~vaG 433 (503)
T KOG0282|consen 354 GRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVPPFRLNKKKRFEGHSVAG 433 (503)
T ss_pred CceEeeeccCccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCceEEEEecccccccCHhhhhcceeccC
Confidence 99999999999999999875311 11233553 3
Q ss_pred CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCcccccc
Q 000473 671 YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSL 750 (1471)
Q Consensus 671 ~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~ 750 (1471)
.-..+.|||||++|++|..| |.+++||.+|-+++..+.+|...++.+.+.|... .-
T Consensus 434 ys~~v~fSpDG~~l~SGdsd--------G~v~~wdwkt~kl~~~lkah~~~ci~v~wHP~e~-----Sk----------- 489 (503)
T KOG0282|consen 434 YSCQVDFSPDGRTLCSGDSD--------GKVNFWDWKTTKLVSKLKAHDQPCIGVDWHPVEP-----SK----------- 489 (503)
T ss_pred ceeeEEEcCCCCeEEeecCC--------ccEEEeechhhhhhhccccCCcceEEEEecCCCc-----ce-----------
Confidence 55778999999999999888 9999999999999999999988888888886322 12
Q ss_pred ceeeccCCceEeec
Q 000473 751 LLPIHEDGTFRQSQ 764 (1471)
Q Consensus 751 l~~~~~D~tir~w~ 764 (1471)
+.++.-||.|+.|+
T Consensus 490 vat~~w~G~Ikiwd 503 (503)
T KOG0282|consen 490 VATCGWDGLIKIWD 503 (503)
T ss_pred eEecccCceeEecC
Confidence 22333388899885
No 60
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.78 E-value=9e-20 Score=199.14 Aligned_cols=166 Identities=24% Similarity=0.380 Sum_probs=143.9
Q ss_pred cccCC--EEEEEEcCCcEEEEEecccccCCCCCCcc-ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE
Q 000473 522 FYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASL-KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS 598 (1471)
Q Consensus 522 ~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~-d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs 598 (1471)
.|+|+ ++++|+-||-|.||++. .++.. |.+ -+....|.-|.++|.|+.|+.| ...+++|+
T Consensus 220 ~FSPDgqyLvsgSvDGFiEVWny~-------~GKlrKDLk-YQAqd~fMMmd~aVlci~FSRD---------sEMlAsGs 282 (508)
T KOG0275|consen 220 RFSPDGQYLVSGSVDGFIEVWNYT-------TGKLRKDLK-YQAQDNFMMMDDAVLCISFSRD---------SEMLASGS 282 (508)
T ss_pred eeCCCCceEeeccccceeeeehhc-------cchhhhhhh-hhhhcceeecccceEEEeeccc---------HHHhhccC
Confidence 36776 89999999999995553 12111 111 0112235568899999999997 79999999
Q ss_pred CCCcEEEEECCCCceEEEEe-ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE
Q 000473 599 MDCSIRIWDLGSGNLITVMH-HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW 677 (1471)
Q Consensus 599 ~DgtI~lWDl~tg~~l~~~~-~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~ 677 (1471)
.||.|++|.+.+|.|+++|. .|+..|+++.|+.| +..++|++-|.+|++--+++|+++..|.||.+.|+.+.|
T Consensus 283 qDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD------~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSyvn~a~f 356 (508)
T KOG0275|consen 283 QDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRD------NSQILSASFDQTVRIHGLKSGKCLKEFRGHSSYVNEATF 356 (508)
T ss_pred cCCcEEEEEEecchHHHHhhhhhccCeeEEEEccC------cchhhcccccceEEEeccccchhHHHhcCccccccceEE
Confidence 99999999999999999998 89999999999999 788999999999999999999999999999999999999
Q ss_pred cCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 678 DCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 678 spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
.+||.++++++.| |+|+||+.+|++|+.++..-
T Consensus 357 t~dG~~iisaSsD--------gtvkvW~~KtteC~~Tfk~~ 389 (508)
T KOG0275|consen 357 TDDGHHIISASSD--------GTVKVWHGKTTECLSTFKPL 389 (508)
T ss_pred cCCCCeEEEecCC--------ccEEEecCcchhhhhhccCC
Confidence 9999999999999 99999999999999888743
No 61
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.78 E-value=1.1e-16 Score=172.39 Aligned_cols=144 Identities=20% Similarity=0.302 Sum_probs=123.7
Q ss_pred EcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC
Q 000473 532 FFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG 611 (1471)
Q Consensus 532 s~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg 611 (1471)
...|.|.|..|. +.++++.++.|.....|+.|+|+ |++|++|+.|-.+.+||+..-
T Consensus 166 ~GlG~v~ILsyp---------------sLkpv~si~AH~snCicI~f~p~---------GryfA~GsADAlvSLWD~~EL 221 (313)
T KOG1407|consen 166 NGLGCVEILSYP---------------SLKPVQSIKAHPSNCICIEFDPD---------GRYFATGSADALVSLWDVDEL 221 (313)
T ss_pred cCCceEEEEecc---------------ccccccccccCCcceEEEEECCC---------CceEeeccccceeeccChhHh
Confidence 345888886665 34678899999999999999998 999999999999999999988
Q ss_pred ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCC
Q 000473 612 NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDH 691 (1471)
Q Consensus 612 ~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~ 691 (1471)
-|++.|.-+.-+|..+.|+.+ |++||++|+|+.|-|=++++|..+..++ +.++...|+|+|...+|+-+|.|-
T Consensus 222 iC~R~isRldwpVRTlSFS~d------g~~lASaSEDh~IDIA~vetGd~~~eI~-~~~~t~tVAWHPk~~LLAyA~ddk 294 (313)
T KOG1407|consen 222 ICERCISRLDWPVRTLSFSHD------GRMLASASEDHFIDIAEVETGDRVWEIP-CEGPTFTVAWHPKRPLLAYACDDK 294 (313)
T ss_pred hhheeeccccCceEEEEeccC------cceeeccCccceEEeEecccCCeEEEee-ccCCceeEEecCCCceeeEEecCC
Confidence 899999999999999999999 9999999999999999999999999887 788899999999999999999982
Q ss_pred CC-CCCCCCEEEEEEC
Q 000473 692 SR-TSDAVDVLFIWDV 706 (1471)
Q Consensus 692 sg-~~D~~gtV~VWDi 706 (1471)
.+ ++...|+|+++-+
T Consensus 295 ~~d~~reag~vKiFG~ 310 (313)
T KOG1407|consen 295 DGDSNREAGTVKIFGL 310 (313)
T ss_pred CCccccccceeEEecC
Confidence 22 0111156666643
No 62
>PTZ00420 coronin; Provisional
Probab=99.77 E-value=2.1e-17 Score=205.64 Aligned_cols=142 Identities=20% Similarity=0.279 Sum_probs=125.1
Q ss_pred cceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc--------eEEEEeccCCCEEEEEECC
Q 000473 560 HVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN--------LITVMHHHVAPVRQIILSP 631 (1471)
Q Consensus 560 ~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~--------~l~~~~~H~~~V~~l~fsp 631 (1471)
..++..+.+|.+.|.+++|+|+ .+.+|+||+.|++|++||+.++. ++..+.+|...|.+++|+|
T Consensus 64 ~~~v~~L~gH~~~V~~lafsP~--------~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P 135 (568)
T PTZ00420 64 KPPVIKLKGHTSSILDLQFNPC--------FSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNP 135 (568)
T ss_pred CceEEEEcCCCCCEEEEEEcCC--------CCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECC
Confidence 3567889999999999999996 26899999999999999998642 3457889999999999999
Q ss_pred CCCCCCCCC-EEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 632 PQTEHPWSD-CFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 632 d~~~~~~~~-~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
+ +. .+++++.|++|+|||+++++.+..+. |...|.+++|+|+|.+|+++|.| ++|+|||+++++
T Consensus 136 ~------g~~iLaSgS~DgtIrIWDl~tg~~~~~i~-~~~~V~SlswspdG~lLat~s~D--------~~IrIwD~Rsg~ 200 (568)
T PTZ00420 136 M------NYYIMCSSGFDSFVNIWDIENEKRAFQIN-MPKKLSSLKWNIKGNLLSGTCVG--------KHMHIIDPRKQE 200 (568)
T ss_pred C------CCeEEEEEeCCCeEEEEECCCCcEEEEEe-cCCcEEEEEECCCCCEEEEEecC--------CEEEEEECCCCc
Confidence 8 55 56899999999999999998877775 66789999999999999999998 999999999999
Q ss_pred EEEEEeCCCCCcee
Q 000473 711 RERVLRGTASHSMF 724 (1471)
Q Consensus 711 ~~~~l~gH~~~v~~ 724 (1471)
.+.++.+|.+.+..
T Consensus 201 ~i~tl~gH~g~~~s 214 (568)
T PTZ00420 201 IASSFHIHDGGKNT 214 (568)
T ss_pred EEEEEecccCCcee
Confidence 99999999876543
No 63
>PTZ00421 coronin; Provisional
Probab=99.77 E-value=1.6e-17 Score=205.73 Aligned_cols=171 Identities=16% Similarity=0.232 Sum_probs=140.9
Q ss_pred EEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-------ceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 565 YFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-------NLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 565 ~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-------~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
.+.||++.|++++|+|. ++++|++|+.|++|++||+.++ +++..+.+|...|.+++|+|+.
T Consensus 70 ~l~GH~~~V~~v~fsP~--------d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~---- 137 (493)
T PTZ00421 70 ILLGQEGPIIDVAFNPF--------DPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSA---- 137 (493)
T ss_pred eEeCCCCCEEEEEEcCC--------CCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCC----
Confidence 47899999999999993 1789999999999999999765 3577899999999999999972
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
++.|++++.|++|+|||+++++.+..+.+|...|.+++|+|++.+|++++.| ++|+|||+++++.+..+.+
T Consensus 138 -~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~spdG~lLatgs~D--------g~IrIwD~rsg~~v~tl~~ 208 (493)
T PTZ00421 138 -MNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKD--------KKLNIIDPRDGTIVSSVEA 208 (493)
T ss_pred -CCEEEEEeCCCEEEEEECCCCeEEEEEcCCCCceEEEEEECCCCEEEEecCC--------CEEEEEECCCCcEEEEEec
Confidence 4699999999999999999999999999999999999999999999999999 9999999999999999999
Q ss_pred CCCCce-eeeeeeccccccccceEEcCCccccccceeeccCCceEeecccccc
Q 000473 718 TASHSM-FDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 718 H~~~v~-~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
|.+... ...|++. .+.++.... .-..|+++++|++++..
T Consensus 209 H~~~~~~~~~w~~~------~~~ivt~G~-------s~s~Dr~VklWDlr~~~ 248 (493)
T PTZ00421 209 HASAKSQRCLWAKR------KDLIITLGC-------SKSQQRQIMLWDTRKMA 248 (493)
T ss_pred CCCCcceEEEEcCC------CCeEEEEec-------CCCCCCeEEEEeCCCCC
Confidence 986433 2334421 112211111 11348899999986544
No 64
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.77 E-value=4.6e-17 Score=183.43 Aligned_cols=223 Identities=22% Similarity=0.259 Sum_probs=182.0
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
.+...++.+.+|+.... +....+..|...+.++.+.+... .++++..||.+.+ |+ ..
T Consensus 67 ~~~~~~~~i~i~~~~~~----~~~~~~~~~~~~i~~~~~~~~~~----~~~~~~~~~~i~~--~~-------------~~ 123 (289)
T cd00200 67 ASGSSDKTIRLWDLETG----ECVRTLTGHTSYVSSVAFSPDGR----ILSSSSRDKTIKV--WD-------------VE 123 (289)
T ss_pred EEEcCCCeEEEEEcCcc----cceEEEeccCCcEEEEEEcCCCC----EEEEecCCCeEEE--EE-------------CC
Confidence 33444677888888764 23445667778898888555533 5666767999999 33 33
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
+++....+.+|.+.|.++.|+|+ +.++++++.|+.|++||+.+++.+..+..|...|.++.|+|+
T Consensus 124 ~~~~~~~~~~~~~~i~~~~~~~~---------~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~------ 188 (289)
T cd00200 124 TGKCLTTLRGHTDWVNSVAFSPD---------GTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD------ 188 (289)
T ss_pred CcEEEEEeccCCCcEEEEEEcCc---------CCEEEEEcCCCcEEEEEccccccceeEecCccccceEEECCC------
Confidence 45667778899999999999996 789999988999999999999999999999999999999999
Q ss_pred CCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 639 SDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
++.+++++.|+.|++||+++++.+..+..|...+.+++|+|++.++++++.| |.|++||+.+++....+.+|
T Consensus 189 ~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--------~~i~i~~~~~~~~~~~~~~~ 260 (289)
T cd00200 189 GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED--------GTIRVWDLRTGECVQTLSGH 260 (289)
T ss_pred cCEEEEecCCCcEEEEECCCCceecchhhcCCceEEEEEcCCCcEEEEEcCC--------CcEEEEEcCCceeEEEcccc
Confidence 8889999999999999999999999998899999999999998888887767 99999999999999999999
Q ss_pred CCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeec
Q 000473 719 ASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 719 ~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
...+..+.+.+. ...++....|+.+++|+
T Consensus 261 ~~~i~~~~~~~~-----------------~~~l~~~~~d~~i~iw~ 289 (289)
T cd00200 261 TNSVTSLAWSPD-----------------GKRLASGSADGTIRIWD 289 (289)
T ss_pred CCcEEEEEECCC-----------------CCEEEEecCCCeEEecC
Confidence 887777766631 11234455688898884
No 65
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.77 E-value=4.2e-18 Score=182.48 Aligned_cols=190 Identities=18% Similarity=0.200 Sum_probs=156.7
Q ss_pred CEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 526 YAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 526 ~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
+.+++++.||++++ |++ .....+++.++.|...|.++.|.+. .++.++++|.|++|++
T Consensus 74 ~~~~~a~GDGSLrl--~d~------------~~~s~Pi~~~kEH~~EV~Svdwn~~--------~r~~~ltsSWD~TiKL 131 (311)
T KOG0277|consen 74 NQVIAASGDGSLRL--FDL------------TMPSKPIHKFKEHKREVYSVDWNTV--------RRRIFLTSSWDGTIKL 131 (311)
T ss_pred ceEEEEecCceEEE--ecc------------CCCCcchhHHHhhhhheEEeccccc--------cceeEEeeccCCceEe
Confidence 37889999999999 542 1123578899999999999999886 3778888899999999
Q ss_pred EECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcC-CCCEE
Q 000473 606 WDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDC-PRGYI 684 (1471)
Q Consensus 606 WDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~sp-dg~~L 684 (1471)
||...++-+.+|.+|...|.+.+|+|.. +++|+++|.|++.++||++.......++.|...+.++.|+. +...|
T Consensus 132 W~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~-----~nlfas~Sgd~~l~lwdvr~~gk~~~i~ah~~Eil~cdw~ky~~~vl 206 (311)
T KOG0277|consen 132 WDPNRPNSVQTFNGHNSCIYQAAFSPHI-----PNLFASASGDGTLRLWDVRSPGKFMSIEAHNSEILCCDWSKYNHNVL 206 (311)
T ss_pred ecCCCCcceEeecCCccEEEEEecCCCC-----CCeEEEccCCceEEEEEecCCCceeEEEeccceeEeecccccCCcEE
Confidence 9999999999999999999999999985 88999999999999999987555556999999999999986 45677
Q ss_pred EEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEee
Q 000473 685 ACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQS 763 (1471)
Q Consensus 685 ~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w 763 (1471)
+||+.| +.||+||++. ..++.++.||.-.|-.+.|.|.-. +...+.+ =|-|+|+|
T Consensus 207 ~Tg~vd--------~~vr~wDir~~r~pl~eL~gh~~AVRkvk~Sph~~-----------~lLaSas-----YDmT~riw 262 (311)
T KOG0277|consen 207 ATGGVD--------NLVRGWDIRNLRTPLFELNGHGLAVRKVKFSPHHA-----------SLLASAS-----YDMTVRIW 262 (311)
T ss_pred EecCCC--------ceEEEEehhhccccceeecCCceEEEEEecCcchh-----------hHhhhcc-----ccceEEec
Confidence 888777 9999999987 457889999998777776665211 2222333 39999999
Q ss_pred ccc
Q 000473 764 QIQ 766 (1471)
Q Consensus 764 ~l~ 766 (1471)
+..
T Consensus 263 ~~~ 265 (311)
T KOG0277|consen 263 DPE 265 (311)
T ss_pred ccc
Confidence 863
No 66
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.77 E-value=4.6e-17 Score=176.19 Aligned_cols=205 Identities=17% Similarity=0.185 Sum_probs=165.3
Q ss_pred eecccccCccccccccCCCCCCCccccc-cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDF-VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~-~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
+++.+.+..+++|++..... -.+...+ .+|...|.+++..|... .|++|+.|.++.| |..-
T Consensus 30 lAscg~Dk~vriw~~~~~~s-~~ck~vld~~hkrsVRsvAwsp~g~----~La~aSFD~t~~I--w~k~----------- 91 (312)
T KOG0645|consen 30 LASCGTDKAVRIWSTSSGDS-WTCKTVLDDGHKRSVRSVAWSPHGR----YLASASFDATVVI--WKKE----------- 91 (312)
T ss_pred EEeecCCceEEEEecCCCCc-EEEEEeccccchheeeeeeecCCCc----EEEEeeccceEEE--eecC-----------
Confidence 56667788999999874211 1222222 57889999999665555 7999999999999 5411
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC---ceEEEEeccCCCEEEEEECCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG---NLITVMHHHVAPVRQIILSPPQ 633 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg---~~l~~~~~H~~~V~~l~fspd~ 633 (1471)
-...+++.+|.||.+.|.|++|+++ |.+|+++|.|++|-+|.+..+ ++...++.|+..|..+.|+|.
T Consensus 92 ~~efecv~~lEGHEnEVK~Vaws~s---------G~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt- 161 (312)
T KOG0645|consen 92 DGEFECVATLEGHENEVKCVAWSAS---------GNYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPT- 161 (312)
T ss_pred CCceeEEeeeeccccceeEEEEcCC---------CCEEEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCC-
Confidence 1234778999999999999999997 999999999999999999754 477899999999999999997
Q ss_pred CCCCCCCEEEEEeCCCcEEEEECC---CCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 634 TEHPWSDCFLSVGEDFSVALASLE---TLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 634 ~~~~~~~~l~S~s~DgsV~lWdl~---t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
..+|+|+|.|.+|++|.-. ...+++++.+|...|++++|+|.|..|++++.| ++|+||-..+.
T Consensus 162 -----~dlL~S~SYDnTIk~~~~~~dddW~c~~tl~g~~~TVW~~~F~~~G~rl~s~sdD--------~tv~Iw~~~~~- 227 (312)
T KOG0645|consen 162 -----EDLLFSCSYDNTIKVYRDEDDDDWECVQTLDGHENTVWSLAFDNIGSRLVSCSDD--------GTVSIWRLYTD- 227 (312)
T ss_pred -----cceeEEeccCCeEEEEeecCCCCeeEEEEecCccceEEEEEecCCCceEEEecCC--------cceEeeeeccC-
Confidence 7799999999999999876 457899999999999999999999999999998 99999986532
Q ss_pred EEEEEe-CCCCCceeeeee
Q 000473 711 RERVLR-GTASHSMFDHFC 728 (1471)
Q Consensus 711 ~~~~l~-gH~~~v~~~~~~ 728 (1471)
+. -|+..+..+.++
T Consensus 228 ----~~~~~sr~~Y~v~W~ 242 (312)
T KOG0645|consen 228 ----LSGMHSRALYDVPWD 242 (312)
T ss_pred ----cchhcccceEeeeec
Confidence 22 244455545444
No 67
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.76 E-value=5.6e-18 Score=181.51 Aligned_cols=209 Identities=17% Similarity=0.139 Sum_probs=166.5
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.+...++++++||+..+ ...+..+++|...|.++.+..... ..+++++.||+|++ |+ .
T Consensus 76 ~~~a~GDGSLrl~d~~~~---s~Pi~~~kEH~~EV~Svdwn~~~r---~~~ltsSWD~TiKL--W~-------------~ 134 (311)
T KOG0277|consen 76 VIAASGDGSLRLFDLTMP---SKPIHKFKEHKREVYSVDWNTVRR---RIFLTSSWDGTIKL--WD-------------P 134 (311)
T ss_pred EEEEecCceEEEeccCCC---CcchhHHHhhhhheEEeccccccc---eeEEeeccCCceEe--ec-------------C
Confidence 445556778999995433 346778899999999987444333 26888899999999 43 3
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
..++.++++.||...|...+|+|.. +++++++|.|++.++||++.......+..|...|.++.|+.-+
T Consensus 135 ~r~~Sv~Tf~gh~~~Iy~a~~sp~~--------~nlfas~Sgd~~l~lwdvr~~gk~~~i~ah~~Eil~cdw~ky~---- 202 (311)
T KOG0277|consen 135 NRPNSVQTFNGHNSCIYQAAFSPHI--------PNLFASASGDGTLRLWDVRSPGKFMSIEAHNSEILCCDWSKYN---- 202 (311)
T ss_pred CCCcceEeecCCccEEEEEecCCCC--------CCeEEEccCCceEEEEEecCCCceeEEEeccceeEeecccccC----
Confidence 3457789999999999999999973 8999999999999999998644444489999999999998763
Q ss_pred CCCEEEEEeCCCcEEEEECCCCc-EEEEecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCCe-EEEE
Q 000473 638 WSDCFLSVGEDFSVALASLETLR-VERMFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTGA-RERV 714 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~-~l~~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg~-~~~~ 714 (1471)
.+.++||+.|+.|+.||++..+ ++..+.||.-.|.+|+|||... .|++++.| -+++|||.+.+. ++.+
T Consensus 203 -~~vl~Tg~vd~~vr~wDir~~r~pl~eL~gh~~AVRkvk~Sph~~~lLaSasYD--------mT~riw~~~~~ds~~e~ 273 (311)
T KOG0277|consen 203 -HNVLATGGVDNLVRGWDIRNLRTPLFELNGHGLAVRKVKFSPHHASLLASASYD--------MTVRIWDPERQDSAIET 273 (311)
T ss_pred -CcEEEecCCCceEEEEehhhccccceeecCCceEEEEEecCcchhhHhhhcccc--------ceEEecccccchhhhhh
Confidence 6799999999999999998765 5788899999999999999754 66777777 899999998554 5566
Q ss_pred EeCCCCCceeeeee
Q 000473 715 LRGTASHSMFDHFC 728 (1471)
Q Consensus 715 l~gH~~~v~~~~~~ 728 (1471)
..-|+.-+..+++.
T Consensus 274 ~~~HtEFv~g~Dws 287 (311)
T KOG0277|consen 274 VDHHTEFVCGLDWS 287 (311)
T ss_pred hhccceEEeccccc
Confidence 66777655555444
No 68
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.76 E-value=3.7e-17 Score=183.05 Aligned_cols=207 Identities=14% Similarity=0.145 Sum_probs=174.5
Q ss_pred ccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEec
Q 000473 501 GRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHR 580 (1471)
Q Consensus 501 ~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~sp 580 (1471)
...+|..|...|.++...|... .+++|+.|..-.+ |+..++.....+.||++.|+|+.|+.
T Consensus 56 S~~tF~~H~~svFavsl~P~~~----l~aTGGgDD~Afl---------------W~~~~ge~~~eltgHKDSVt~~~Fsh 116 (399)
T KOG0296|consen 56 SLVTFDKHTDSVFAVSLHPNNN----LVATGGGDDLAFL---------------WDISTGEFAGELTGHKDSVTCCSFSH 116 (399)
T ss_pred ceeehhhcCCceEEEEeCCCCc----eEEecCCCceEEE---------------EEccCCcceeEecCCCCceEEEEEcc
Confidence 3456899999999999776554 7999999998888 34556677888999999999999999
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR 660 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~ 660 (1471)
+ +.+|+||+++|.|+||+..+|.....+..-...+..+.|+|. ++.|+.|+.||++-+|.+.++.
T Consensus 117 d---------gtlLATGdmsG~v~v~~~stg~~~~~~~~e~~dieWl~WHp~------a~illAG~~DGsvWmw~ip~~~ 181 (399)
T KOG0296|consen 117 D---------GTLLATGDMSGKVLVFKVSTGGEQWKLDQEVEDIEWLKWHPR------AHILLAGSTDGSVWMWQIPSQA 181 (399)
T ss_pred C---------ceEEEecCCCccEEEEEcccCceEEEeecccCceEEEEeccc------ccEEEeecCCCcEEEEECCCcc
Confidence 7 899999999999999999999999888877788999999997 9999999999999999999988
Q ss_pred EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceE
Q 000473 661 VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSV 740 (1471)
Q Consensus 661 ~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v 740 (1471)
..+.+.||..++++=.|.|+|+.++++..| |+|++||.+||+++..+.+.... +.+.+..+..+..+
T Consensus 182 ~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~d--------gti~~Wn~ktg~p~~~~~~~e~~-----~~~~~~~~~~~~~~ 248 (399)
T KOG0296|consen 182 LCKVMSGHNSPCTCGEFIPDGKRILTGYDD--------GTIIVWNPKTGQPLHKITQAEGL-----ELPCISLNLAGSTL 248 (399)
T ss_pred eeeEecCCCCCcccccccCCCceEEEEecC--------ceEEEEecCCCceeEEecccccC-----cCCcccccccccee
Confidence 889999999999999999999999999998 99999999999999998854421 12223344455556
Q ss_pred EcCCccccccceee
Q 000473 741 LNGNTSVSSLLLPI 754 (1471)
Q Consensus 741 ~~g~~~~s~~l~~~ 754 (1471)
..|+..+...++..
T Consensus 249 ~~g~~e~~~~~~~~ 262 (399)
T KOG0296|consen 249 TKGNSEGVACGVNN 262 (399)
T ss_pred EeccCCccEEEEcc
Confidence 66666665555443
No 69
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.76 E-value=2.7e-16 Score=208.22 Aligned_cols=192 Identities=20% Similarity=0.181 Sum_probs=158.7
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.++++..++++++||.... +....+.+|...|+++.+.+.. ++.+++|+.||.|++|+
T Consensus 547 ~las~~~Dg~v~lWd~~~~----~~~~~~~~H~~~V~~l~~~p~~---~~~L~Sgs~Dg~v~iWd--------------- 604 (793)
T PLN00181 547 QVASSNFEGVVQVWDVARS----QLVTEMKEHEKRVWSIDYSSAD---PTLLASGSDDGSVKLWS--------------- 604 (793)
T ss_pred EEEEEeCCCeEEEEECCCC----eEEEEecCCCCCEEEEEEcCCC---CCEEEEEcCCCEEEEEE---------------
Confidence 4677788999999998753 4556678999999999854321 12799999999999933
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc-eEEEEeccCCCEEEEEECCCCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN-LITVMHHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~-~l~~~~~H~~~V~~l~fspd~~~ 635 (1471)
+.++.....+..| ..|.++.|+++ ++.+|++|+.|++|++||+.+++ .+..+.+|...|.++.|. +
T Consensus 605 ~~~~~~~~~~~~~-~~v~~v~~~~~--------~g~~latgs~dg~I~iwD~~~~~~~~~~~~~h~~~V~~v~f~-~--- 671 (793)
T PLN00181 605 INQGVSIGTIKTK-ANICCVQFPSE--------SGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFV-D--- 671 (793)
T ss_pred CCCCcEEEEEecC-CCeEEEEEeCC--------CCCEEEEEeCCCeEEEEECCCCCccceEecCCCCCEEEEEEe-C---
Confidence 3445666677655 67999999764 28999999999999999998766 567888999999999996 4
Q ss_pred CCCCCEEEEEeCCCcEEEEECCC------CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 636 HPWSDCFLSVGEDFSVALASLET------LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 636 ~~~~~~l~S~s~DgsV~lWdl~t------~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
+..+++++.|++|++||++. .++++.+.+|...+..++|+|++.+|++|+.| ++|++|+...+
T Consensus 672 ---~~~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~D--------~~v~iw~~~~~ 740 (793)
T PLN00181 672 ---SSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSET--------NEVFVYHKAFP 740 (793)
T ss_pred ---CCEEEEEECCCEEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeCC--------CEEEEEECCCC
Confidence 67999999999999999974 36788999999999999999999999999999 99999998776
Q ss_pred eEEEE
Q 000473 710 ARERV 714 (1471)
Q Consensus 710 ~~~~~ 714 (1471)
..+..
T Consensus 741 ~~~~s 745 (793)
T PLN00181 741 MPVLS 745 (793)
T ss_pred CceEE
Confidence 55543
No 70
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.75 E-value=1.3e-17 Score=198.61 Aligned_cols=227 Identities=17% Similarity=0.207 Sum_probs=177.5
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEE-EEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSS-SMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts-~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
+...+.++++++|+-...... ....+.+|.+-|.. .+|.+... .+++.|+.|+.|.++.
T Consensus 28 i~s~sRd~t~~vw~~~~~~~l--~~~~~~~~~g~i~~~i~y~e~~~---~~l~~g~~D~~i~v~~--------------- 87 (745)
T KOG0301|consen 28 IISGSRDGTVKVWAKKGKQYL--ETHAFEGPKGFIANSICYAESDK---GRLVVGGMDTTIIVFK--------------- 87 (745)
T ss_pred EeecCCCCceeeeeccCcccc--cceecccCcceeeccceeccccC---cceEeecccceEEEEe---------------
Confidence 344556788999987554332 12335556666665 55443221 1699999999999833
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
.....+..+|+||...|.|+....+ +. ++|||+|.|+++|-. +++...+.+|..+|++++.-|+
T Consensus 88 ~~~~~P~~~LkgH~snVC~ls~~~~---------~~-~iSgSWD~TakvW~~--~~l~~~l~gH~asVWAv~~l~e---- 151 (745)
T KOG0301|consen 88 LSQAEPLYTLKGHKSNVCSLSIGED---------GT-LISGSWDSTAKVWRI--GELVYSLQGHTASVWAVASLPE---- 151 (745)
T ss_pred cCCCCchhhhhccccceeeeecCCc---------Cc-eEecccccceEEecc--hhhhcccCCcchheeeeeecCC----
Confidence 3345788999999999999987654 44 999999999999975 6777889999999999999997
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
+ .++|||.|++|++|.- ++++++|.||.+.|+.+++-++..+ ++++.| |.|+.|++ +|+++.++.
T Consensus 152 --~-~~vTgsaDKtIklWk~--~~~l~tf~gHtD~VRgL~vl~~~~f-lScsND--------g~Ir~w~~-~ge~l~~~~ 216 (745)
T KOG0301|consen 152 --N-TYVTGSADKTIKLWKG--GTLLKTFSGHTDCVRGLAVLDDSHF-LSCSND--------GSIRLWDL-DGEVLLEMH 216 (745)
T ss_pred --C-cEEeccCcceeeeccC--CchhhhhccchhheeeeEEecCCCe-EeecCC--------ceEEEEec-cCceeeeee
Confidence 4 8999999999999975 8899999999999999999987665 466777 99999999 899999999
Q ss_pred CCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccccc
Q 000473 717 GTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGV 772 (1471)
Q Consensus 717 gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~ 772 (1471)
||++.+..+... .....++++.+|+++|+|+.....+-+
T Consensus 217 ghtn~vYsis~~-----------------~~~~~Ivs~gEDrtlriW~~~e~~q~I 255 (745)
T KOG0301|consen 217 GHTNFVYSISMA-----------------LSDGLIVSTGEDRTLRIWKKDECVQVI 255 (745)
T ss_pred ccceEEEEEEec-----------------CCCCeEEEecCCceEEEeecCceEEEE
Confidence 999988776422 123456788899999999876444433
No 71
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.74 E-value=2.3e-18 Score=205.43 Aligned_cols=222 Identities=16% Similarity=0.239 Sum_probs=184.1
Q ss_pred CccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEE
Q 000473 485 DTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQ 564 (1471)
Q Consensus 485 ~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~ 564 (1471)
...++|++.. +..|...|.+........ .+++|+.|-.+-+ |. +....++.
T Consensus 14 t~Lr~~~~~~----------~~~hsaav~~lk~~~s~r----~~~~Gg~~~k~~L--~~-------------i~kp~~i~ 64 (825)
T KOG0267|consen 14 TKLRVWDTRE----------FVAHSAAVGCLKIRKSSR----SLVTGGEDEKVNL--WA-------------IGKPNAIT 64 (825)
T ss_pred eccccccchh----------hhhhhhhhceeeeeccce----eeccCCCceeecc--cc-------------ccCCchhh
Confidence 3445676543 567778888876655555 5888888776666 33 22223344
Q ss_pred EEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEE
Q 000473 565 YFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLS 644 (1471)
Q Consensus 565 ~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S 644 (1471)
.|.+|..+|.||.|++. +.+|+.|+.||+|++||+..++.++++.+|...+.+|.|+|- +.++++
T Consensus 65 S~~~hespIeSl~f~~~---------E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~------~~~~a~ 129 (825)
T KOG0267|consen 65 SLTGHESPIESLTFDTS---------ERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPY------GEFFAS 129 (825)
T ss_pred eeeccCCcceeeecCcc---------hhhhcccccCCceeeeehhhhhhhhhhhccccCcceeeeccc------eEEecc
Confidence 58999999999999986 789999999999999999999999999999999999999998 889999
Q ss_pred EeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCcee
Q 000473 645 VGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMF 724 (1471)
Q Consensus 645 ~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~ 724 (1471)
|+.|..+++||.+...|.+.+.+|...|..+.|+|+|+++++|++| .+++|||...|+....+.+|...+..
T Consensus 130 gStdtd~~iwD~Rk~Gc~~~~~s~~~vv~~l~lsP~Gr~v~~g~ed--------~tvki~d~~agk~~~ef~~~e~~v~s 201 (825)
T KOG0267|consen 130 GSTDTDLKIWDIRKKGCSHTYKSHTRVVDVLRLSPDGRWVASGGED--------NTVKIWDLTAGKLSKEFKSHEGKVQS 201 (825)
T ss_pred ccccccceehhhhccCceeeecCCcceeEEEeecCCCceeeccCCc--------ceeeeecccccccccccccccccccc
Confidence 9999999999999999999999999999999999999999999998 99999999999999999999999998
Q ss_pred eeeeeccccccccceEEcCCccccccceeeccCCceEeecccccccccccc
Q 000473 725 DHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGVAFS 775 (1471)
Q Consensus 725 ~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~~~~ 775 (1471)
..|.+.... +-+-+.|.++|+|+++.|+-..+..
T Consensus 202 le~hp~e~L-----------------la~Gs~d~tv~f~dletfe~I~s~~ 235 (825)
T KOG0267|consen 202 LEFHPLEVL-----------------LAPGSSDRTVRFWDLETFEVISSGK 235 (825)
T ss_pred cccCchhhh-----------------hccCCCCceeeeeccceeEEeeccC
Confidence 887753211 1123349999999999887655443
No 72
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=99.74 E-value=1.2e-15 Score=178.80 Aligned_cols=198 Identities=16% Similarity=0.211 Sum_probs=142.6
Q ss_pred ccccCccccccccCCCCC-CCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCC
Q 000473 481 TFCQDTVPRSEHVDSRQA-GDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNS 559 (1471)
Q Consensus 481 s~~~~~v~~Wd~~~~~~~-g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s 559 (1471)
....+.+-++++..+-.. ...+-.+ -+...|+...+ ..|.+.+|+.|++||.|++|... .++ ..-..
T Consensus 599 ~g~gG~iai~el~~PGrLPDgv~p~l-~Ngt~vtDl~W---dPFD~~rLAVa~ddg~i~lWr~~--a~g------l~e~~ 666 (1012)
T KOG1445|consen 599 AGSGGVIAIYELNEPGRLPDGVMPGL-FNGTLVTDLHW---DPFDDERLAVATDDGQINLWRLT--ANG------LPENE 666 (1012)
T ss_pred cCCCceEEEEEcCCCCCCCccccccc-ccCceeeeccc---CCCChHHeeecccCceEEEEEec--cCC------CCccc
Confidence 334566777776553211 0111111 12234444331 12444599999999999995543 221 11122
Q ss_pred cceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCC
Q 000473 560 HVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 560 ~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~ 639 (1471)
..+...+.+|...|+++.|||- -.++|++++.|.+|++||+.+++....|.+|++.|..++|+|+ |
T Consensus 667 ~tPe~~lt~h~eKI~slRfHPL--------AadvLa~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpd------G 732 (1012)
T KOG1445|consen 667 MTPEKILTIHGEKITSLRFHPL--------AADVLAVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPD------G 732 (1012)
T ss_pred CCcceeeecccceEEEEEecch--------hhhHhhhhhccceeeeeehhhhhhhheeccCcCceeEEEECCC------C
Confidence 3567789999999999999996 2689999999999999999999999999999999999999999 9
Q ss_pred CEEEEEeCCCcEEEEECCCCcE-EEEecCCCC-CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 640 DCFLSVGEDFSVALASLETLRV-ERMFPGHPN-YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~~~-l~~~~gh~~-~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+.+++++.||+|++|..++++. ++.-.|..+ .--.|.|--||+++++.+.| ..+...|.+||.++
T Consensus 733 r~~AtVcKDg~~rVy~Prs~e~pv~Eg~gpvgtRgARi~wacdgr~viv~Gfd----k~SeRQv~~Y~Aq~ 799 (1012)
T KOG1445|consen 733 RRIATVCKDGTLRVYEPRSREQPVYEGKGPVGTRGARILWACDGRIVIVVGFD----KSSERQVQMYDAQT 799 (1012)
T ss_pred cceeeeecCceEEEeCCCCCCCccccCCCCccCcceeEEEEecCcEEEEeccc----ccchhhhhhhhhhh
Confidence 9999999999999999987653 443333222 33578899999999999988 22236799999875
No 73
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.74 E-value=6.1e-17 Score=182.86 Aligned_cols=243 Identities=19% Similarity=0.203 Sum_probs=179.5
Q ss_pred EeeccccccccCCCCcccceeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcE
Q 000473 458 VDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEI 537 (1471)
Q Consensus 458 ~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I 537 (1471)
.+|+..+ .++|. ++++.+.+.++-+|++....+ -.+..++.+|...|..+++.|+.. ++++++.|..+
T Consensus 226 EVWfl~F----S~nGk---yLAsaSkD~Taiiw~v~~d~~-~kl~~tlvgh~~~V~yi~wSPDdr----yLlaCg~~e~~ 293 (519)
T KOG0293|consen 226 EVWFLQF----SHNGK---YLASASKDSTAIIWIVVYDVH-FKLKKTLVGHSQPVSYIMWSPDDR----YLLACGFDEVL 293 (519)
T ss_pred cEEEEEE----cCCCe---eEeeccCCceEEEEEEecCcc-eeeeeeeecccCceEEEEECCCCC----eEEecCchHhe
Confidence 4676665 55673 578888899999999988766 466788899999999999555444 45544445555
Q ss_pred EEEEecccccCCCCCCccccCCcceEEEE-ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEE
Q 000473 538 EVIQFDLFERHNSPGASLKVNSHVSRQYF-LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITV 616 (1471)
Q Consensus 538 ~V~~~~~l~~~d~~~~~~d~~s~~~~~~l-~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~ 616 (1471)
.+ ||+.+|.....+ .+|...+.+++|.|| +..+++|+.|+++..||++ |+.+..
T Consensus 294 ~l---------------wDv~tgd~~~~y~~~~~~S~~sc~W~pD---------g~~~V~Gs~dr~i~~wdlD-gn~~~~ 348 (519)
T KOG0293|consen 294 SL---------------WDVDTGDLRHLYPSGLGFSVSSCAWCPD---------GFRFVTGSPDRTIIMWDLD-GNILGN 348 (519)
T ss_pred ee---------------ccCCcchhhhhcccCcCCCcceeEEccC---------CceeEecCCCCcEEEecCC-cchhhc
Confidence 55 556777665554 345689999999998 8999999999999999986 222211
Q ss_pred Eec------------------------------------------cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEE
Q 000473 617 MHH------------------------------------------HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALA 654 (1471)
Q Consensus 617 ~~~------------------------------------------H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lW 654 (1471)
+++ -..+|+++..+.+ +++++.-=.+..+.+|
T Consensus 349 W~gvr~~~v~dlait~Dgk~vl~v~~d~~i~l~~~e~~~dr~lise~~~its~~iS~d------~k~~LvnL~~qei~LW 422 (519)
T KOG0293|consen 349 WEGVRDPKVHDLAITYDGKYVLLVTVDKKIRLYNREARVDRGLISEEQPITSFSISKD------GKLALVNLQDQEIHLW 422 (519)
T ss_pred ccccccceeEEEEEcCCCcEEEEEecccceeeechhhhhhhccccccCceeEEEEcCC------CcEEEEEcccCeeEEe
Confidence 111 1235666666666 7777777778888999
Q ss_pred ECCCCcEEEEecCCCC--CcEEEEEcC-CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeecc
Q 000473 655 SLETLRVERMFPGHPN--YPAKVVWDC-PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGI 731 (1471)
Q Consensus 655 dl~t~~~l~~~~gh~~--~V~~v~~sp-dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~ 731 (1471)
|++..+.++.+.||.. .+-.-+|-. +..++++|++| +.||||+..+|.++.++.||...|.++.+.|..
T Consensus 423 Dl~e~~lv~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED--------~kvyIWhr~sgkll~~LsGHs~~vNcVswNP~~ 494 (519)
T KOG0293|consen 423 DLEENKLVRKYFGHKQGHFIIRSCFGGGNDKFIASGSED--------SKVYIWHRISGKLLAVLSGHSKTVNCVSWNPAD 494 (519)
T ss_pred ecchhhHHHHhhcccccceEEEeccCCCCcceEEecCCC--------ceEEEEEccCCceeEeecCCcceeeEEecCCCC
Confidence 9998888899999976 455567754 44799999998 999999999999999999999888888766532
Q ss_pred ccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 732 SMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 732 ~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
. ..+-+.+.|+|||+|....
T Consensus 495 p----------------~m~ASasDDgtIRIWg~~~ 514 (519)
T KOG0293|consen 495 P----------------EMFASASDDGTIRIWGPSD 514 (519)
T ss_pred H----------------HHhhccCCCCeEEEecCCc
Confidence 2 1122334599999998643
No 74
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.73 E-value=7e-17 Score=193.76 Aligned_cols=229 Identities=17% Similarity=0.163 Sum_probs=186.5
Q ss_pred eeecccccCccccccccCCCCCCCccccccc-cCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVH-KEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~-h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
.+++...++.+++||....+. ...+.+ |...|.+++ +....+.+|+.||.|.+++...
T Consensus 231 ~LavG~~~g~v~iwD~~~~k~----~~~~~~~h~~rvg~la------W~~~~lssGsr~~~I~~~dvR~----------- 289 (484)
T KOG0305|consen 231 HLAVGTSDGTVQIWDVKEQKK----TRTLRGSHASRVGSLA------WNSSVLSSGSRDGKILNHDVRI----------- 289 (484)
T ss_pred EEEEeecCCeEEEEehhhccc----cccccCCcCceeEEEe------ccCceEEEecCCCcEEEEEEec-----------
Confidence 477788889999999877644 334455 889999987 5556899999999999944331
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~ 635 (1471)
. .....++.+|...|..+.|++| +.+++||+.|+.+.|||....+++..+..|.+.|.+++|+|-.
T Consensus 290 --~-~~~~~~~~~H~qeVCgLkws~d---------~~~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q-- 355 (484)
T KOG0305|consen 290 --S-QHVVSTLQGHRQEVCGLKWSPD---------GNQLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQ-- 355 (484)
T ss_pred --c-hhhhhhhhcccceeeeeEECCC---------CCeeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCCc--
Confidence 0 0112248899999999999998 8999999999999999998889999999999999999999974
Q ss_pred CCCCCEEEEEe--CCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 636 HPWSDCFLSVG--EDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 636 ~~~~~~l~S~s--~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
...||+|+ .|+.|++||..+++.+..... ...|..+.|++..+-|+++. |+++ +.|.||+..+..++.
T Consensus 356 ---~~lLAsGGGs~D~~i~fwn~~~g~~i~~vdt-gsQVcsL~Wsk~~kEi~sth----G~s~--n~i~lw~~ps~~~~~ 425 (484)
T KOG0305|consen 356 ---SGLLATGGGSADRCIKFWNTNTGARIDSVDT-GSQVCSLIWSKKYKELLSTH----GYSE--NQITLWKYPSMKLVA 425 (484)
T ss_pred ---cCceEEcCCCcccEEEEEEcCCCcEeccccc-CCceeeEEEcCCCCEEEEec----CCCC--CcEEEEeccccceee
Confidence 66899875 699999999999999888754 45699999999998887753 4455 799999999999999
Q ss_pred EEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 714 VLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 714 ~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
.+.||..+|+.....| +|.+. +..+.|.|+|.|++..
T Consensus 426 ~l~gH~~RVl~la~SP------------dg~~i-----~t~a~DETlrfw~~f~ 462 (484)
T KOG0305|consen 426 ELLGHTSRVLYLALSP------------DGETI-----VTGAADETLRFWNLFD 462 (484)
T ss_pred eecCCcceeEEEEECC------------CCCEE-----EEecccCcEEeccccC
Confidence 9999999988875553 23333 3444599999999854
No 75
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.73 E-value=6.8e-17 Score=197.85 Aligned_cols=214 Identities=19% Similarity=0.126 Sum_probs=165.0
Q ss_pred ccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc-----cccCC--------------------C-----------
Q 000473 507 HKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL-----FERHN--------------------S----------- 550 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~-----l~~~d--------------------~----------- 550 (1471)
.|.+.|.++.+..+.. +||+|++||.|+||.... +..++ .
T Consensus 265 ah~gaIw~mKFS~DGK----yLAsaGeD~virVWkVie~e~~~~~~~~~~~~~~~~~~~s~~~p~~s~~~~~~~~~s~~~ 340 (712)
T KOG0283|consen 265 AHKGAIWAMKFSHDGK----YLASAGEDGVIRVWKVIESERMRVAEGDSSCMYFEYNANSQIEPSTSSEEKISSRTSSSR 340 (712)
T ss_pred ccCCcEEEEEeCCCCc----eeeecCCCceEEEEEEeccchhcccccccchhhhhhhhccccCccccccccccccccccc
Confidence 8999999999666665 799999999999964321 00000 0
Q ss_pred -----CC-Cccc---cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccC
Q 000473 551 -----PG-ASLK---VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHV 621 (1471)
Q Consensus 551 -----~~-~~~d---~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~ 621 (1471)
+. ...+ .-..++.+.|.||++.|..|.|+- +++|+|+|+|.|||||++...+|+.+| .|.
T Consensus 341 ~~~~s~~~~~p~~~f~f~ekP~~ef~GHt~DILDlSWSK----------n~fLLSSSMDKTVRLWh~~~~~CL~~F-~Hn 409 (712)
T KOG0283|consen 341 KGSQSPCVLLPLKAFVFSEKPFCEFKGHTADILDLSWSK----------NNFLLSSSMDKTVRLWHPGRKECLKVF-SHN 409 (712)
T ss_pred cccCCccccCCCccccccccchhhhhccchhheeccccc----------CCeeEeccccccEEeecCCCcceeeEE-ecC
Confidence 00 0000 012246677899999999999996 699999999999999999999999999 699
Q ss_pred CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEE
Q 000473 622 APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVL 701 (1471)
Q Consensus 622 ~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV 701 (1471)
..|+||+|+|.. .++|+||+-|+.||||++...+.+.-...+ .-|++++|.|||++.++|+.+ |.+
T Consensus 410 dfVTcVaFnPvD-----DryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~-~lITAvcy~PdGk~avIGt~~--------G~C 475 (712)
T KOG0283|consen 410 DFVTCVAFNPVD-----DRYFISGSLDGKVRLWSISDKKVVDWNDLR-DLITAVCYSPDGKGAVIGTFN--------GYC 475 (712)
T ss_pred CeeEEEEecccC-----CCcEeecccccceEEeecCcCeeEeehhhh-hhheeEEeccCCceEEEEEec--------cEE
Confidence 999999999974 789999999999999999988877665544 679999999999999999999 999
Q ss_pred EEEECCCCeEEEEE--eCC------CCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 702 FIWDVKTGARERVL--RGT------ASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 702 ~VWDi~tg~~~~~l--~gH------~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
++|+.+.-++.... .-| ..+|+..+|++... . .++-.+.|.+||++++
T Consensus 476 ~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~---------------~-~vLVTSnDSrIRI~d~ 531 (712)
T KOG0283|consen 476 RFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDP---------------D-EVLVTSNDSRIRIYDG 531 (712)
T ss_pred EEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCCC---------------C-eEEEecCCCceEEEec
Confidence 99999876655333 222 12677777774211 1 3344556999999998
No 76
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.73 E-value=1.4e-17 Score=200.33 Aligned_cols=231 Identities=16% Similarity=0.168 Sum_probs=181.8
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+++.+..+.+.+||+...-. ....+.|..|...++++.+.+ +.|+.|++|+.||.|++ || .
T Consensus 103 IAT~s~nG~i~vWdlnk~~r-nk~l~~f~EH~Rs~~~ldfh~---tep~iliSGSQDg~vK~--~D-------------l 163 (839)
T KOG0269|consen 103 IATCSTNGVISVWDLNKSIR-NKLLTVFNEHERSANKLDFHS---TEPNILISGSQDGTVKC--WD-------------L 163 (839)
T ss_pred heeecCCCcEEEEecCcccc-chhhhHhhhhccceeeeeecc---CCccEEEecCCCceEEE--Ee-------------e
Confidence 66777888999999877421 245567899999999988655 55679999999999999 33 3
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-ceEEEEeccCCCEEEEEECCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-NLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
.....+.++.+....|..+.|+|. .+..++++...|.+++||++.. ++...|.+|.|+|.++.|+|+
T Consensus 164 R~~~S~~t~~~nSESiRDV~fsp~--------~~~~F~s~~dsG~lqlWDlRqp~r~~~k~~AH~GpV~c~nwhPn---- 231 (839)
T KOG0269|consen 164 RSKKSKSTFRSNSESIRDVKFSPG--------YGNKFASIHDSGYLQLWDLRQPDRCEKKLTAHNGPVLCLNWHPN---- 231 (839)
T ss_pred ecccccccccccchhhhceeeccC--------CCceEEEecCCceEEEeeccCchhHHHHhhcccCceEEEeecCC----
Confidence 445667888889999999999996 3789999999999999999865 456788999999999999998
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecC-CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEE
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPG-HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERV 714 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g-h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~ 714 (1471)
+.+|||||.|++|+|||..+.+.-....- -..+|.+|+|.|+..+.+..|.- -.|..|+|||++.. -+-++
T Consensus 232 --r~~lATGGRDK~vkiWd~t~~~~~~~~tInTiapv~rVkWRP~~~~hLAtcsm-----v~dtsV~VWDvrRPYIP~~t 304 (839)
T KOG0269|consen 232 --REWLATGGRDKMVKIWDMTDSRAKPKHTINTIAPVGRVKWRPARSYHLATCSM-----VVDTSVHVWDVRRPYIPYAT 304 (839)
T ss_pred --CceeeecCCCccEEEEeccCCCccceeEEeecceeeeeeeccCccchhhhhhc-----cccceEEEEeecccccccee
Confidence 89999999999999999987654332222 34579999999998877655543 12378999999764 35688
Q ss_pred EeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEe
Q 000473 715 LRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQ 762 (1471)
Q Consensus 715 l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~ 762 (1471)
+..|+..+..+.+- ......++.++.|+++.+
T Consensus 305 ~~eH~~~vt~i~W~----------------~~d~~~l~s~sKD~tv~q 336 (839)
T KOG0269|consen 305 FLEHTDSVTGIAWD----------------SGDRINLWSCSKDGTVLQ 336 (839)
T ss_pred eeccCccccceecc----------------CCCceeeEeecCccHHHH
Confidence 88999777776444 233456778888988765
No 77
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.73 E-value=7.3e-17 Score=192.44 Aligned_cols=215 Identities=20% Similarity=0.256 Sum_probs=172.0
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.....+.++-+|...+..| .+++.+|...|.|+....+. .+++|+.|.+.+||.-
T Consensus 74 l~~g~~D~~i~v~~~~~~~P----~~~LkgH~snVC~ls~~~~~-----~~iSgSWD~TakvW~~--------------- 129 (745)
T KOG0301|consen 74 LVVGGMDTTIIVFKLSQAEP----LYTLKGHKSNVCSLSIGEDG-----TLISGSWDSTAKVWRI--------------- 129 (745)
T ss_pred eEeecccceEEEEecCCCCc----hhhhhccccceeeeecCCcC-----ceEecccccceEEecc---------------
Confidence 33444555666777777654 56789999999997644444 4899999999999432
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
+++...+.||+..|++++.-|+ +.++|||.|.+|++|.- ++++.+|.+|+.-|+.+++-|+
T Consensus 130 --~~l~~~l~gH~asVWAv~~l~e----------~~~vTgsaDKtIklWk~--~~~l~tf~gHtD~VRgL~vl~~----- 190 (745)
T KOG0301|consen 130 --GELVYSLQGHTASVWAVASLPE----------NTYVTGSADKTIKLWKG--GTLLKTFSGHTDCVRGLAVLDD----- 190 (745)
T ss_pred --hhhhcccCCcchheeeeeecCC----------CcEEeccCcceeeeccC--CchhhhhccchhheeeeEEecC-----
Confidence 2556669999999999999885 58999999999999985 7889999999999999999986
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
..|+|++.||.|++|++ +|+++..+.||...|.++...++++.++++++| ++++||+.. ++.++++-
T Consensus 191 --~~flScsNDg~Ir~w~~-~ge~l~~~~ghtn~vYsis~~~~~~~Ivs~gED--------rtlriW~~~--e~~q~I~l 257 (745)
T KOG0301|consen 191 --SHFLSCSNDGSIRLWDL-DGEVLLEMHGHTNFVYSISMALSDGLIVSTGED--------RTLRIWKKD--ECVQVITL 257 (745)
T ss_pred --CCeEeecCCceEEEEec-cCceeeeeeccceEEEEEEecCCCCeEEEecCC--------ceEEEeecC--ceEEEEec
Confidence 47899999999999999 799999999999999999988888999999999 999999976 88888876
Q ss_pred CCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 718 TASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 718 H~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
..-.+..+ -| ..+|.+..|. +||.+|+|...
T Consensus 258 PttsiWsa-~~------L~NgDIvvg~-----------SDG~VrVfT~~ 288 (745)
T KOG0301|consen 258 PTTSIWSA-KV------LLNGDIVVGG-----------SDGRVRVFTVD 288 (745)
T ss_pred CccceEEE-EE------eeCCCEEEec-----------cCceEEEEEec
Confidence 55445554 22 1234444443 48888887653
No 78
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.73 E-value=2.8e-16 Score=196.59 Aligned_cols=239 Identities=15% Similarity=0.137 Sum_probs=186.3
Q ss_pred cccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccC----CCCCCccccCCcceEEEEecCCccEEEEE
Q 000473 502 RDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERH----NSPGASLKVNSHVSRQYFLGHTGAVLCLA 577 (1471)
Q Consensus 502 ~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~----d~~~~~~d~~s~~~~~~l~gH~~~V~~la 577 (1471)
......|.+.|+|+.+.++.. ++++|++|..|.||.... .+. -..+...+++.-+.+..|.||.+.|..+.
T Consensus 62 l~~m~~h~~sv~CVR~S~dG~----~lAsGSDD~~v~iW~~~~-~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~ 136 (942)
T KOG0973|consen 62 LCTMDDHDGSVNCVRFSPDGS----YLASGSDDRLVMIWERAE-IGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVN 136 (942)
T ss_pred heeeccccCceeEEEECCCCC----eEeeccCcceEEEeeecc-cCCcccccccccccccceeeEEEEEecCCCccceec
Confidence 345578999999988444444 899999999999976653 111 11234456677788999999999999999
Q ss_pred EecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC
Q 000473 578 AHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE 657 (1471)
Q Consensus 578 ~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~ 657 (1471)
|+|+ +.+|+|+|.|++|.+||..+.+++.++.+|.+.|..+.|.|- |++|||-+.|++|++|.+.
T Consensus 137 Wsp~---------~~~lvS~s~DnsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~------Gky~ASqsdDrtikvwrt~ 201 (942)
T KOG0973|consen 137 WSPD---------DSLLVSVSLDNSVIIWNAKTFELLKVLRGHQSLVKGVSWDPI------GKYFASQSDDRTLKVWRTS 201 (942)
T ss_pred cCCC---------ccEEEEecccceEEEEccccceeeeeeecccccccceEECCc------cCeeeeecCCceEEEEEcc
Confidence 9998 899999999999999999999999999999999999999998 9999999999999999988
Q ss_pred CCcEEEEecCCCC------CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeecc
Q 000473 658 TLRVERMFPGHPN------YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGI 731 (1471)
Q Consensus 658 t~~~l~~~~gh~~------~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~ 731 (1471)
+..+.+.+.++-. ....+.|||||.||++.-.- .+. ..++.|.+-.+.+....+-||.+.+..+.|.|.+
T Consensus 202 dw~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~-n~~---~~~~~IieR~tWk~~~~LvGH~~p~evvrFnP~l 277 (942)
T KOG0973|consen 202 DWGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAV-NGG---KSTIAIIERGTWKVDKDLVGHSAPVEVVRFNPKL 277 (942)
T ss_pred cceeeEeeccchhhCCCcceeeecccCCCcCeecchhhc-cCC---cceeEEEecCCceeeeeeecCCCceEEEEeChHH
Confidence 8777777765433 56899999999999987552 322 2589999999999999999999999999999854
Q ss_pred ccc-cccceEEcCCccccccceeeccCCceEeecc
Q 000473 732 SMN-SISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 732 ~~~-~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
-.. .-.|...-.+ ---.-+-.-++|+++.+|.-
T Consensus 278 fe~~~~ng~~~~~~-~~y~i~AvgSqDrSlSVW~T 311 (942)
T KOG0973|consen 278 FERNNKNGTSTQPN-CYYCIAAVGSQDRSLSVWNT 311 (942)
T ss_pred hccccccCCccCCC-cceEEEEEecCCccEEEEec
Confidence 221 1122211110 00012223456999999973
No 79
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.73 E-value=1.3e-16 Score=188.90 Aligned_cols=241 Identities=17% Similarity=0.194 Sum_probs=186.1
Q ss_pred cCeeEEEEccccCCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCccccccccCCCCCCCcccccccc
Q 000473 429 RPYISVWSLSQKHSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHK 508 (1471)
Q Consensus 429 ~P~v~vwsl~~~~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h 508 (1471)
+..++.|+.....+.. .+ ..+..=....+|+...++.- +|. .+...+++.+|++|+..... .-+..++..|
T Consensus 46 Dg~i~~W~~~~d~~~~-s~-~~~asme~HsDWVNDiiL~~--~~~---tlIS~SsDtTVK~W~~~~~~--~~c~stir~H 116 (735)
T KOG0308|consen 46 DGIIRLWSVTQDSNEP-ST-PYIASMEHHSDWVNDIILCG--NGK---TLISASSDTTVKVWNAHKDN--TFCMSTIRTH 116 (735)
T ss_pred CceEEEeccccccCCc-cc-chhhhhhhhHhHHhhHHhhc--CCC---ceEEecCCceEEEeecccCc--chhHhhhhcc
Confidence 3458899743322110 01 12222223478987764422 331 36778889999999986643 2577888999
Q ss_pred CccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEe-cCCccEEEEEEecCCCCccc
Q 000473 509 EKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFL-GHTGAVLCLAAHRMVGTAKG 587 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~-gH~~~V~~la~spd~~~~~~ 587 (1471)
...|.|++++....+ .+|+|+-|+.|.+|+.+..... ..+. .+ ......+. ||...|++++..+.
T Consensus 117 ~DYVkcla~~ak~~~---lvaSgGLD~~IflWDin~~~~~-l~~s-~n---~~t~~sl~sG~k~siYSLA~N~t------ 182 (735)
T KOG0308|consen 117 KDYVKCLAYIAKNNE---LVASGGLDRKIFLWDINTGTAT-LVAS-FN---NVTVNSLGSGPKDSIYSLAMNQT------ 182 (735)
T ss_pred cchheeeeecccCce---eEEecCCCccEEEEEccCcchh-hhhh-cc---ccccccCCCCCccceeeeecCCc------
Confidence 999999998544432 8999999999999666522110 0000 11 12233343 99999999999886
Q ss_pred CcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecC
Q 000473 588 WSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPG 667 (1471)
Q Consensus 588 ~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g 667 (1471)
+..+++|+..+.+++||.++++.+..+.+|+.-|..+..++| |..++|+|.|++|+|||+...+|+.++..
T Consensus 183 ---~t~ivsGgtek~lr~wDprt~~kimkLrGHTdNVr~ll~~dD------Gt~~ls~sSDgtIrlWdLgqQrCl~T~~v 253 (735)
T KOG0308|consen 183 ---GTIIVSGGTEKDLRLWDPRTCKKIMKLRGHTDNVRVLLVNDD------GTRLLSASSDGTIRLWDLGQQRCLATYIV 253 (735)
T ss_pred ---ceEEEecCcccceEEeccccccceeeeeccccceEEEEEcCC------CCeEeecCCCceEEeeeccccceeeeEEe
Confidence 789999999999999999999999999999999999999999 99999999999999999999999999999
Q ss_pred CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 668 HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 668 h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
|...|+++..+|+=.++.+|+.| |.|+.=|+++.
T Consensus 254 H~e~VWaL~~~~sf~~vYsG~rd--------~~i~~Tdl~n~ 287 (735)
T KOG0308|consen 254 HKEGVWALQSSPSFTHVYSGGRD--------GNIYRTDLRNP 287 (735)
T ss_pred ccCceEEEeeCCCcceEEecCCC--------CcEEecccCCc
Confidence 99999999999999999999998 99999999985
No 80
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.71 E-value=2e-16 Score=182.13 Aligned_cols=240 Identities=17% Similarity=0.127 Sum_probs=176.4
Q ss_pred ecccccCccccccccCCCCCC------CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAG------DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPG 552 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g------~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~ 552 (1471)
++.+..+.+.+||........ .....+.+|.+.=..+.... +.+-.+++|+.|+.|.+|+.+ ...
T Consensus 141 At~t~~~dv~Vfd~tk~~s~~~~~~~~~Pdl~L~gH~~eg~glsWn~---~~~g~Lls~~~d~~i~lwdi~--~~~---- 211 (422)
T KOG0264|consen 141 ATKTSSGDVYVFDYTKHPSKPKASGECRPDLRLKGHEKEGYGLSWNR---QQEGTLLSGSDDHTICLWDIN--AES---- 211 (422)
T ss_pred EecCCCCCEEEEEeccCCCcccccccCCCceEEEeeccccccccccc---ccceeEeeccCCCcEEEEecc--ccc----
Confidence 344455667777765533221 12235677777544433111 222389999999999995443 111
Q ss_pred CccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC--CCceEEEEeccCCCEEEEEEC
Q 000473 553 ASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG--SGNLITVMHHHVAPVRQIILS 630 (1471)
Q Consensus 553 ~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~--tg~~l~~~~~H~~~V~~l~fs 630 (1471)
-+.....+...+.+|...|..++|++.+ ..++++.+.|+.+.|||++ +.++.+...+|.++|.+++|+
T Consensus 212 --~~~~~~~p~~~~~~h~~~VeDV~~h~~h--------~~lF~sv~dd~~L~iwD~R~~~~~~~~~~~ah~~~vn~~~fn 281 (422)
T KOG0264|consen 212 --KEDKVVDPKTIFSGHEDVVEDVAWHPLH--------EDLFGSVGDDGKLMIWDTRSNTSKPSHSVKAHSAEVNCVAFN 281 (422)
T ss_pred --cCCccccceEEeecCCcceehhhccccc--------hhhheeecCCCeEEEEEcCCCCCCCcccccccCCceeEEEeC
Confidence 0111234567889999999999999974 7899999999999999999 566678888999999999999
Q ss_pred CCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 631 PPQTEHPWSDCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 631 pd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
|-. +..|||||.|++|+|||+++. ++++.+.+|...|.+|.|+|+.. .|++++.| +.+.|||+..
T Consensus 282 p~~-----~~ilAT~S~D~tV~LwDlRnL~~~lh~~e~H~dev~~V~WSPh~etvLASSg~D--------~rl~vWDls~ 348 (422)
T KOG0264|consen 282 PFN-----EFILATGSADKTVALWDLRNLNKPLHTFEGHEDEVFQVEWSPHNETVLASSGTD--------RRLNVWDLSR 348 (422)
T ss_pred CCC-----CceEEeccCCCcEEEeechhcccCceeccCCCcceEEEEeCCCCCceeEecccC--------CcEEEEeccc
Confidence 985 779999999999999999985 46899999999999999999865 66777777 9999999864
Q ss_pred C--------------eEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 709 G--------------ARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 709 g--------------~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
- +++-.-.||++.|.-..++|... =.+.++++|+.+.+|+..
T Consensus 349 ig~eq~~eda~dgppEllF~HgGH~~kV~DfsWnp~eP----------------W~I~SvaeDN~LqIW~~s 404 (422)
T KOG0264|consen 349 IGEEQSPEDAEDGPPELLFIHGGHTAKVSDFSWNPNEP----------------WTIASVAEDNILQIWQMA 404 (422)
T ss_pred cccccChhhhccCCcceeEEecCcccccccccCCCCCC----------------eEEEEecCCceEEEeecc
Confidence 1 12355568988877776774221 124466779999999863
No 81
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.71 E-value=1.3e-16 Score=180.94 Aligned_cols=125 Identities=15% Similarity=0.179 Sum_probs=105.2
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc----CCCEEEEEECCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH----VAPVRQIILSPP 632 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H----~~~V~~l~fspd 632 (1471)
..+..+.+....|. .|++|..+++ +..+.+.+.|.++.+.|+++.+..+.|..- ...++.+.|+|+
T Consensus 329 ~Rs~~~~~sv~~gg-~vtSl~ls~~---------g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfSpd 398 (459)
T KOG0288|consen 329 IRSADKTRSVPLGG-RVTSLDLSMD---------GLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFSPD 398 (459)
T ss_pred ccCCceeeEeecCc-ceeeEeeccC---------CeEEeeecCCCceeeeecccccEEEEeeccccccccccceeEECCC
Confidence 33444555555554 8999999987 788888899999999999999988887642 234889999999
Q ss_pred CCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCC--cEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 633 QTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNY--PAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 633 ~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~--V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+.++++||.||+|.||++.++++...+...... |+++.|+|.|.+|++++.+ +.+.+|.
T Consensus 399 ------~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~~aI~s~~W~~sG~~Llsadk~--------~~v~lW~ 459 (459)
T KOG0288|consen 399 ------GSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSNAAITSLSWNPSGSGLLSADKQ--------KAVTLWT 459 (459)
T ss_pred ------CceeeeccCCCcEEEEEccCceEEEEeccCCCCcceEEEEEcCCCchhhcccCC--------cceEecC
Confidence 999999999999999999999999888765554 9999999999999999888 9999993
No 82
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.71 E-value=2e-14 Score=171.79 Aligned_cols=233 Identities=11% Similarity=0.057 Sum_probs=166.4
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEE-EecccccCCCCCCccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVI-QFDLFERHNSPGASLK 556 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~-~~~~l~~~d~~~~~~d 556 (1471)
+...+..|-.++|...++..- .....+.||.+.|+.+...+... .+++.+.|.+-+++ +|. ..+.|.
T Consensus 331 ii~~g~~Gg~hlWkt~d~~~w-~~~~~iSGH~~~V~dv~W~psGe----flLsvs~DQTTRlFa~wg-------~q~~wH 398 (764)
T KOG1063|consen 331 IIAHGRTGGFHLWKTKDKTFW-TQEPVISGHVDGVKDVDWDPSGE----FLLSVSLDQTTRLFARWG-------RQQEWH 398 (764)
T ss_pred EEEecccCcEEEEeccCccce-eeccccccccccceeeeecCCCC----EEEEeccccceeeecccc-------ccccee
Confidence 445566778889984432211 22234589999999998555554 69999999999984 231 000111
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC--------------------------
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS-------------------------- 610 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t-------------------------- 610 (1471)
.+..-+-|....+|+++-+. ...++||.....+|+++...
T Consensus 399 -----EiaRPQiHGyDl~c~~~vn~---------~~~FVSgAdEKVlRvF~aPk~fv~~l~~i~g~~~~~~~~~p~gA~V 464 (764)
T KOG1063|consen 399 -----EIARPQIHGYDLTCLSFVNE---------DLQFVSGADEKVLRVFEAPKSFVKSLMAICGKCFKGSDELPDGANV 464 (764)
T ss_pred -----eecccccccccceeeehccC---------CceeeecccceeeeeecCcHHHHHHHHHHhCccccCchhccccccc
Confidence 11112347778999999873 57889999999999998640
Q ss_pred -----------------Cc---------------------------------eEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 611 -----------------GN---------------------------------LITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 611 -----------------g~---------------------------------~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
|. .++++.||.-.|.+++.+|+ ++
T Consensus 465 paLGLSnKa~~~~e~~~G~~~~~~~et~~~~~p~~L~ePP~EdqLq~~tLwPEv~KLYGHGyEv~~l~~s~~------gn 538 (764)
T KOG1063|consen 465 PALGLSNKAFFPGETNTGGEAAVCAETPLAAAPCELTEPPTEDQLQQNTLWPEVHKLYGHGYEVYALAISPT------GN 538 (764)
T ss_pred ccccccCCCCcccccccccccceeeecccccCchhccCCChHHHHHHhccchhhHHhccCceeEEEEEecCC------CC
Confidence 00 11245689999999999999 99
Q ss_pred EEEEEeCC-----CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE-E--
Q 000473 641 CFLSVGED-----FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR-E-- 712 (1471)
Q Consensus 641 ~l~S~s~D-----gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~-~-- 712 (1471)
.++|++.. ..|+||+..+...++.+.+|.-.|+.++|+|||+||++.|.| .++.+|....... .
T Consensus 539 liASaCKS~~~ehAvI~lw~t~~W~~~~~L~~HsLTVT~l~FSpdg~~LLsvsRD--------Rt~sl~~~~~~~~~e~~ 610 (764)
T KOG1063|consen 539 LIASACKSSLKEHAVIRLWNTANWLQVQELEGHSLTVTRLAFSPDGRYLLSVSRD--------RTVSLYEVQEDIKDEFR 610 (764)
T ss_pred EEeehhhhCCccceEEEEEeccchhhhheecccceEEEEEEECCCCcEEEEeecC--------ceEEeeeeecccchhhh
Confidence 99999864 458999999999999999999999999999999999999999 9999998754322 2
Q ss_pred -EEEeCCCCCceeeeeeeccccccccceEEcCCcccccc-ceeeccCCceEeeccccc
Q 000473 713 -RVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSL-LLPIHEDGTFRQSQIQND 768 (1471)
Q Consensus 713 -~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~-l~~~~~D~tir~w~l~~~ 768 (1471)
..++.|+--+....+. .... +++.++|.++++|.....
T Consensus 611 fa~~k~HtRIIWdcsW~------------------pde~~FaTaSRDK~VkVW~~~~~ 650 (764)
T KOG1063|consen 611 FACLKAHTRIIWDCSWS------------------PDEKYFATASRDKKVKVWEEPDL 650 (764)
T ss_pred hccccccceEEEEcccC------------------cccceeEEecCCceEEEEeccCc
Confidence 2355666333333333 2222 778999999999987543
No 83
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.71 E-value=4.4e-15 Score=160.48 Aligned_cols=143 Identities=16% Similarity=0.170 Sum_probs=110.5
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCc-ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSH-VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~-~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
.|++|-+||.|.+|+ ..++ +.+..-..|+..|+.+.+++| ..++++||.|.+-++
T Consensus 161 ~ii~Ghe~G~is~~d---------------a~~g~~~v~s~~~h~~~Ind~q~s~d---------~T~FiT~s~Dttakl 216 (327)
T KOG0643|consen 161 TIIAGHEDGSISIYD---------------ARTGKELVDSDEEHSSKINDLQFSRD---------RTYFITGSKDTTAKL 216 (327)
T ss_pred EEEEecCCCcEEEEE---------------cccCceeeechhhhccccccccccCC---------cceEEecccCcccee
Confidence 588999999999933 3343 445555779999999999998 899999999999999
Q ss_pred EECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC-CcEEEEECCCC------------cEEEEecCCCCCc
Q 000473 606 WDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED-FSVALASLETL------------RVERMFPGHPNYP 672 (1471)
Q Consensus 606 WDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D-gsV~lWdl~t~------------~~l~~~~gh~~~V 672 (1471)
||..+-+.+.++.. ..||++.+++|.. ...++-|+.| .-|.--+-+.| +.+....||-++|
T Consensus 217 ~D~~tl~v~Kty~t-e~PvN~aaisP~~-----d~VilgGGqeA~dVTTT~~r~GKFEArFyh~i~eEEigrvkGHFGPI 290 (327)
T KOG0643|consen 217 VDVRTLEVLKTYTT-ERPVNTAAISPLL-----DHVILGGGQEAMDVTTTSTRAGKFEARFYHLIFEEEIGRVKGHFGPI 290 (327)
T ss_pred eeccceeeEEEeee-cccccceeccccc-----ceEEecCCceeeeeeeecccccchhhhHHHHHHHHHhccccccccCc
Confidence 99999999988865 4689999999972 3344444444 12222222333 3455678999999
Q ss_pred EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 673 AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 673 ~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
++|+|+|+|+-.++|++| |.||+.-.+
T Consensus 291 NsvAfhPdGksYsSGGED--------G~VR~h~Fd 317 (327)
T KOG0643|consen 291 NSVAFHPDGKSYSSGGED--------GYVRLHHFD 317 (327)
T ss_pred ceeEECCCCcccccCCCC--------ceEEEEEec
Confidence 999999999999999998 999997554
No 84
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.71 E-value=1.3e-16 Score=184.54 Aligned_cols=207 Identities=21% Similarity=0.203 Sum_probs=165.4
Q ss_pred cccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 504 DFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 504 ~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
++..-...|.+..+-++.. .+++|.+.|.|+|++ ..+...++.+++|+.+|..+.|+|.
T Consensus 63 ~~srFk~~v~s~~fR~DG~----LlaaGD~sG~V~vfD---------------~k~r~iLR~~~ah~apv~~~~f~~~-- 121 (487)
T KOG0310|consen 63 TFSRFKDVVYSVDFRSDGR----LLAAGDESGHVKVFD---------------MKSRVILRQLYAHQAPVHVTKFSPQ-- 121 (487)
T ss_pred hHHhhccceeEEEeecCCe----EEEccCCcCcEEEec---------------cccHHHHHHHhhccCceeEEEeccc--
Confidence 3445557788888777776 789999999999943 2233456778999999999999996
Q ss_pred CcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEE
Q 000473 584 TAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVE 662 (1471)
Q Consensus 584 ~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l 662 (1471)
++..|++|++|+.+++||+.+......+.+|+..|.+.+|+|.+ ++.++|||.||+|++||++.. ..+
T Consensus 122 ------d~t~l~s~sDd~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~-----~hivvtGsYDg~vrl~DtR~~~~~v 190 (487)
T KOG0310|consen 122 ------DNTMLVSGSDDKVVKYWDLSTAYVQAELSGHTDYVRCGDISPAN-----DHIVVTGSYDGKVRLWDTRSLTSRV 190 (487)
T ss_pred ------CCeEEEecCCCceEEEEEcCCcEEEEEecCCcceeEeeccccCC-----CeEEEecCCCceEEEEEeccCCcee
Confidence 48899999999999999999988767899999999999999985 679999999999999999987 555
Q ss_pred EEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE-EEEeCCCCCceeeeeeeccccccccceEE
Q 000473 663 RMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE-RVLRGTASHSMFDHFCKGISMNSISGSVL 741 (1471)
Q Consensus 663 ~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~-~~l~gH~~~v~~~~~~~~~~~~~~sg~v~ 741 (1471)
.++ .|..+|..|.+-|.|..+++++.+ .|+|||+-+|..+ ..+..|...|++..+..
T Consensus 191 ~el-nhg~pVe~vl~lpsgs~iasAgGn---------~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s------------ 248 (487)
T KOG0310|consen 191 VEL-NHGCPVESVLALPSGSLIASAGGN---------SVKVWDLTTGGQLLTSMFNHNKTVTCLRLAS------------ 248 (487)
T ss_pred EEe-cCCCceeeEEEcCCCCEEEEcCCC---------eEEEEEecCCceehhhhhcccceEEEEEeec------------
Confidence 565 499999999999999999998874 8999999966554 44445998999886651
Q ss_pred cCCccccccceeeccCCceEeecccccc
Q 000473 742 NGNTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 742 ~g~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
++.-..++.+ |+.+++++.-+++
T Consensus 249 ~~~rLlS~sL-----D~~VKVfd~t~~K 271 (487)
T KOG0310|consen 249 DSTRLLSGSL-----DRHVKVFDTTNYK 271 (487)
T ss_pred CCceEeeccc-----ccceEEEEccceE
Confidence 1122233333 9999998865444
No 85
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.71 E-value=2.2e-16 Score=179.04 Aligned_cols=234 Identities=19% Similarity=0.187 Sum_probs=178.2
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.+++...+..|++|++...+. ....++.+..+.|+++.+-+++. .++.++.|+..++ |+
T Consensus 189 tlatgg~Dr~Ik~W~v~~~k~--~~~~tLaGs~g~it~~d~d~~~~----~~iAas~d~~~r~---------------Wn 247 (459)
T KOG0288|consen 189 TLATGGSDRIIKLWNVLGEKS--ELISTLAGSLGNITSIDFDSDNK----HVIAASNDKNLRL---------------WN 247 (459)
T ss_pred hhhhcchhhhhhhhhcccchh--hhhhhhhccCCCcceeeecCCCc----eEEeecCCCceee---------------ee
Confidence 466777888999999887652 46677788888999998666665 6888999999999 34
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCC-Cccc-----------------------Cc------CCCEEEEEECCCcEEEE
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVG-TAKG-----------------------WS------FNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~-~~~~-----------------------~~------~~~~L~SGs~DgtI~lW 606 (1471)
+.+.+...+|.||++.|+++.|...+. .-++ .+ ....++||-.|++|++|
T Consensus 248 vd~~r~~~TLsGHtdkVt~ak~~~~~~~vVsgs~DRtiK~WDl~k~~C~kt~l~~S~cnDI~~~~~~~~SgH~DkkvRfw 327 (459)
T KOG0288|consen 248 VDSLRLRHTLSGHTDKVTAAKFKLSHSRVVSGSADRTIKLWDLQKAYCSKTVLPGSQCNDIVCSISDVISGHFDKKVRFW 327 (459)
T ss_pred ccchhhhhhhcccccceeeehhhccccceeeccccchhhhhhhhhhheeccccccccccceEecceeeeecccccceEEE
Confidence 556678889999999999999866431 0000 00 13344566666666666
Q ss_pred ECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCC----CCCcEEEEEcCCCC
Q 000473 607 DLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGH----PNYPAKVVWDCPRG 682 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh----~~~V~~v~~spdg~ 682 (1471)
|++++.+......+. .|+++..+++ +..+++.+.|.++.+.|+++....+.+... ....+.+.|||++.
T Consensus 328 D~Rs~~~~~sv~~gg-~vtSl~ls~~------g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfSpd~~ 400 (459)
T KOG0288|consen 328 DIRSADKTRSVPLGG-RVTSLDLSMD------GLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFSPDGS 400 (459)
T ss_pred eccCCceeeEeecCc-ceeeEeeccC------CeEEeeecCCCceeeeecccccEEEEeeccccccccccceeEECCCCc
Confidence 666666666666655 8999999999 889999999999999999999888777532 22489999999999
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCC--ceeeeeeeccccccccceEEcCCccccccceeeccCCce
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASH--SMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTF 760 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~--v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~ti 760 (1471)
|+++|+.| |.|+||++.+|+++..+....+. ++++.|++ .| ..++.+.++..+
T Consensus 401 YvaAGS~d--------gsv~iW~v~tgKlE~~l~~s~s~~aI~s~~W~~------------sG-----~~Llsadk~~~v 455 (459)
T KOG0288|consen 401 YVAAGSAD--------GSVYIWSVFTGKLEKVLSLSTSNAAITSLSWNP------------SG-----SGLLSADKQKAV 455 (459)
T ss_pred eeeeccCC--------CcEEEEEccCceEEEEeccCCCCcceEEEEEcC------------CC-----chhhcccCCcce
Confidence 99999999 99999999999999999887765 77777773 11 124455667777
Q ss_pred Eee
Q 000473 761 RQS 763 (1471)
Q Consensus 761 r~w 763 (1471)
..|
T Consensus 456 ~lW 458 (459)
T KOG0288|consen 456 TLW 458 (459)
T ss_pred Eec
Confidence 777
No 86
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.70 E-value=1.1e-16 Score=175.13 Aligned_cols=213 Identities=16% Similarity=0.169 Sum_probs=177.9
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEe
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAH 579 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~s 579 (1471)
++...+.+|...|..++.-+... .+.+++.|.+-+| |.++++.++.++.||.+.|+|++||
T Consensus 139 ~lvre~~GHkDGiW~Vaa~~tqp----i~gtASADhTA~i---------------Ws~Esg~CL~~Y~GH~GSVNsikfh 199 (481)
T KOG0300|consen 139 RLVRELEGHKDGIWHVAADSTQP----ICGTASADHTARI---------------WSLESGACLATYTGHTGSVNSIKFH 199 (481)
T ss_pred eehhhhcccccceeeehhhcCCc----ceeecccccceeE---------------EeeccccceeeecccccceeeEEec
Confidence 44556788999998876554443 6778888888888 4577899999999999999999999
Q ss_pred cCCCCcccCcCCCEEEEEECCCcEEEEECC------C------------------------------C----ceEEEEec
Q 000473 580 RMVGTAKGWSFNEVLVSGSMDCSIRIWDLG------S------------------------------G----NLITVMHH 619 (1471)
Q Consensus 580 pd~~~~~~~~~~~~L~SGs~DgtI~lWDl~------t------------------------------g----~~l~~~~~ 619 (1471)
+. +.+++++|.|++..+|... . + -++..|.+
T Consensus 200 ~s---------~~L~lTaSGD~taHIW~~av~~~vP~~~a~~~hSsEeE~e~sDe~~~d~d~~~~sD~~tiRvPl~~ltg 270 (481)
T KOG0300|consen 200 NS---------GLLLLTASGDETAHIWKAAVNWEVPSNNAPSDHSSEEEEEHSDEHNRDTDSSEKSDGHTIRVPLMRLTG 270 (481)
T ss_pred cc---------cceEEEccCCcchHHHHHhhcCcCCCCCCCCCCCchhhhhcccccccccccccccCCceeeeeeeeeec
Confidence 86 8999999999999999721 0 0 14567899
Q ss_pred cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCC
Q 000473 620 HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVD 699 (1471)
Q Consensus 620 H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~g 699 (1471)
|.+.|.+..|... |+.+++++.|.+-.+||++++++++.+.||....+.++-+|..+++++.+.| .
T Consensus 271 H~~vV~a~dWL~g------g~Q~vTaSWDRTAnlwDVEtge~v~~LtGHd~ELtHcstHptQrLVvTsSrD--------t 336 (481)
T KOG0300|consen 271 HRAVVSACDWLAG------GQQMVTASWDRTANLWDVETGEVVNILTGHDSELTHCSTHPTQRLVVTSSRD--------T 336 (481)
T ss_pred cccceEehhhhcC------cceeeeeeccccceeeeeccCceeccccCcchhccccccCCcceEEEEeccC--------c
Confidence 9999999999887 8999999999999999999999999999999999999999999999999999 8
Q ss_pred EEEEEECCCC-eEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccccc
Q 000473 700 VLFIWDVKTG-ARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGV 772 (1471)
Q Consensus 700 tV~VWDi~tg-~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~ 772 (1471)
+.++||++.. +-+.++.||+..|+.+.|... . .+++-+.|.++++|+|+|.....
T Consensus 337 TFRLWDFReaI~sV~VFQGHtdtVTS~vF~~d-------------d-----~vVSgSDDrTvKvWdLrNMRspl 392 (481)
T KOG0300|consen 337 TFRLWDFREAIQSVAVFQGHTDTVTSVVFNTD-------------D-----RVVSGSDDRTVKVWDLRNMRSPL 392 (481)
T ss_pred eeEeccchhhcceeeeecccccceeEEEEecC-------------C-----ceeecCCCceEEEeeeccccCcc
Confidence 9999999843 457889999999999877731 1 12333459999999998876544
No 87
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.70 E-value=5.5e-17 Score=195.32 Aligned_cols=195 Identities=19% Similarity=0.221 Sum_probs=157.0
Q ss_pred CCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEE
Q 000473 525 PYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIR 604 (1471)
Q Consensus 525 P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~ 604 (1471)
-+.|++++.+|.|.+|+.... ...+....|..|+..|+++.||+.+ ..+|+|||.||+|+
T Consensus 100 ~NlIAT~s~nG~i~vWdlnk~------------~rnk~l~~f~EH~Rs~~~ldfh~te--------p~iliSGSQDg~vK 159 (839)
T KOG0269|consen 100 SNLIATCSTNGVISVWDLNKS------------IRNKLLTVFNEHERSANKLDFHSTE--------PNILISGSQDGTVK 159 (839)
T ss_pred hhhheeecCCCcEEEEecCcc------------ccchhhhHhhhhccceeeeeeccCC--------ccEEEecCCCceEE
Confidence 347999999999999544310 1134556789999999999999863 78999999999999
Q ss_pred EEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC-CcEEEEecCCCCCcEEEEEcCCCCE
Q 000473 605 IWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET-LRVERMFPGHPNYPAKVVWDCPRGY 683 (1471)
Q Consensus 605 lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t-~~~l~~~~gh~~~V~~v~~spdg~~ 683 (1471)
+||++..+-..++.+....|+.|.|+|.. ++.|+++.+.|.+++||++. .++..++..|.++|.++.|+|++.+
T Consensus 160 ~~DlR~~~S~~t~~~nSESiRDV~fsp~~-----~~~F~s~~dsG~lqlWDlRqp~r~~~k~~AH~GpV~c~nwhPnr~~ 234 (839)
T KOG0269|consen 160 CWDLRSKKSKSTFRSNSESIRDVKFSPGY-----GNKFASIHDSGYLQLWDLRQPDRCEKKLTAHNGPVLCLNWHPNREW 234 (839)
T ss_pred EEeeecccccccccccchhhhceeeccCC-----CceEEEecCCceEEEeeccCchhHHHHhhcccCceEEEeecCCCce
Confidence 99999999999999999999999999985 88999999999999999975 4678889999999999999999999
Q ss_pred EEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC-CCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEe
Q 000473 684 IACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT-ASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQ 762 (1471)
Q Consensus 684 L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH-~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~ 762 (1471)
||||+.| +.|+|||+.+++.-....-+ .+.+..+.|-|+...+..+ .. ..-|..|++
T Consensus 235 lATGGRD--------K~vkiWd~t~~~~~~~~tInTiapv~rVkWRP~~~~hLAt-----------cs---mv~dtsV~V 292 (839)
T KOG0269|consen 235 LATGGRD--------KMVKIWDMTDSRAKPKHTINTIAPVGRVKWRPARSYHLAT-----------CS---MVVDTSVHV 292 (839)
T ss_pred eeecCCC--------ccEEEEeccCCCccceeEEeecceeeeeeeccCccchhhh-----------hh---ccccceEEE
Confidence 9999999 99999999876543333333 3466677788765433221 11 122778999
Q ss_pred eccc
Q 000473 763 SQIQ 766 (1471)
Q Consensus 763 w~l~ 766 (1471)
|+++
T Consensus 293 WDvr 296 (839)
T KOG0269|consen 293 WDVR 296 (839)
T ss_pred Eeec
Confidence 9964
No 88
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.70 E-value=6.4e-16 Score=170.38 Aligned_cols=212 Identities=19% Similarity=0.203 Sum_probs=160.5
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCC
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~ 582 (1471)
..+..|...++++++.. ..+++|+.|-+|+|++.. ....+..+..|.+.|+++.|.+..
T Consensus 37 F~~~aH~~sitavAVs~------~~~aSGssDetI~IYDm~---------------k~~qlg~ll~HagsitaL~F~~~~ 95 (362)
T KOG0294|consen 37 FAFSAHAGSITALAVSG------PYVASGSSDETIHIYDMR---------------KRKQLGILLSHAGSITALKFYPPL 95 (362)
T ss_pred ccccccccceeEEEecc------eeEeccCCCCcEEEEecc---------------chhhhcceeccccceEEEEecCCc
Confidence 34678999999988432 269999999999994433 345567788999999999998762
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE 662 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l 662 (1471)
...+|+||+.||.|.+|+...++++..+++|.+.|+.++++|. ++..++++.|+.+++||+-+|+.-
T Consensus 96 -------S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS------~KLALsVg~D~~lr~WNLV~Gr~a 162 (362)
T KOG0294|consen 96 -------SKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQVTDLSIHPS------GKLALSVGGDQVLRTWNLVRGRVA 162 (362)
T ss_pred -------chhheeeecCCCcEEEEEcCCeEEeeeecccccccceeEecCC------CceEEEEcCCceeeeehhhcCccc
Confidence 1349999999999999999999999999999999999999998 999999999999999999887643
Q ss_pred EEecCCCCCcEEEEEcCCCCEEEEEEcCC---------------------------------CCCCCCCCEEEEEECCCC
Q 000473 663 RMFPGHPNYPAKVVWDCPRGYIACLCRDH---------------------------------SRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 663 ~~~~gh~~~V~~v~~spdg~~L~sgs~D~---------------------------------sg~~D~~gtV~VWDi~tg 709 (1471)
..+.- ....+.|.|+|.|.+++.+..+. .|..| +.|.+||...+
T Consensus 163 ~v~~L-~~~at~v~w~~~Gd~F~v~~~~~i~i~q~d~A~v~~~i~~~~r~l~~~~l~~~~L~vG~d~--~~i~~~D~ds~ 239 (362)
T KOG0294|consen 163 FVLNL-KNKATLVSWSPQGDHFVVSGRNKIDIYQLDNASVFREIENPKRILCATFLDGSELLVGGDN--EWISLKDTDSD 239 (362)
T ss_pred eeecc-CCcceeeEEcCCCCEEEEEeccEEEEEecccHhHhhhhhccccceeeeecCCceEEEecCC--ceEEEeccCCC
Confidence 33321 11122355555555444433321 22222 89999999999
Q ss_pred eEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 710 ARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 710 ~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
.+...+.+|.++|-.+.+-. +. ....++++++||.|++|++.
T Consensus 240 ~~~~~~~AH~~RVK~i~~~~----~~-----------~~~~lvTaSSDG~I~vWd~~ 281 (362)
T KOG0294|consen 240 TPLTEFLAHENRVKDIASYT----NP-----------EHEYLVTASSDGFIKVWDID 281 (362)
T ss_pred ccceeeecchhheeeeEEEe----cC-----------CceEEEEeccCceEEEEEcc
Confidence 99999999999988874331 00 01346788889999999974
No 89
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.70 E-value=2.5e-16 Score=181.34 Aligned_cols=221 Identities=16% Similarity=0.191 Sum_probs=173.4
Q ss_pred ccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcc
Q 000473 507 HKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAK 586 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~ 586 (1471)
.|.+.|+...++|... +.+++++.+|.+.|+++..... ..-......+-.+|.||++.-+.|+|++.
T Consensus 122 ~h~gEVnRaRymPQnp---~iVAt~t~~~dv~Vfd~tk~~s-----~~~~~~~~~Pdl~L~gH~~eg~glsWn~~----- 188 (422)
T KOG0264|consen 122 NHDGEVNRARYMPQNP---NIVATKTSSGDVYVFDYTKHPS-----KPKASGECRPDLRLKGHEKEGYGLSWNRQ----- 188 (422)
T ss_pred cCCccchhhhhCCCCC---cEEEecCCCCCEEEEEeccCCC-----cccccccCCCceEEEeecccccccccccc-----
Confidence 4788899888777665 4788889999999966652111 00001123455689999998888999886
Q ss_pred cCcCCCEEEEEECCCcEEEEECCCCc-------eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC--
Q 000473 587 GWSFNEVLVSGSMDCSIRIWDLGSGN-------LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE-- 657 (1471)
Q Consensus 587 ~~~~~~~L~SGs~DgtI~lWDl~tg~-------~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~-- 657 (1471)
..-.|++|+.|++|++||+.... ....|.+|...|..++|+|-+ .+.|++++.|+.+.|||++
T Consensus 189 ---~~g~Lls~~~d~~i~lwdi~~~~~~~~~~~p~~~~~~h~~~VeDV~~h~~h-----~~lF~sv~dd~~L~iwD~R~~ 260 (422)
T KOG0264|consen 189 ---QEGTLLSGSDDHTICLWDINAESKEDKVVDPKTIFSGHEDVVEDVAWHPLH-----EDLFGSVGDDGKLMIWDTRSN 260 (422)
T ss_pred ---cceeEeeccCCCcEEEEeccccccCCccccceEEeecCCcceehhhccccc-----hhhheeecCCCeEEEEEcCCC
Confidence 36799999999999999997533 457889999999999999985 6799999999999999999
Q ss_pred CCcEEEEecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEeCCCCCceeeeeeecccccc
Q 000473 658 TLRVERMFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLRGTASHSMFDHFCKGISMNS 735 (1471)
Q Consensus 658 t~~~l~~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~gH~~~v~~~~~~~~~~~~~ 735 (1471)
+.++.+...+|.++|.|++|+|-+. .||+|+.| ++|++||+++- +++.++.+|...|..+.|.|....
T Consensus 261 ~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D--------~tV~LwDlRnL~~~lh~~e~H~dev~~V~WSPh~et-- 330 (422)
T KOG0264|consen 261 TSKPSHSVKAHSAEVNCVAFNPFNEFILATGSAD--------KTVALWDLRNLNKPLHTFEGHEDEVFQVEWSPHNET-- 330 (422)
T ss_pred CCCCcccccccCCceeEEEeCCCCCceEEeccCC--------CcEEEeechhcccCceeccCCCcceEEEEeCCCCCc--
Confidence 5667788889999999999999665 55777777 99999999984 578999999999999999964321
Q ss_pred ccceEEcCCccccccceeeccCCceEeeccccccccc
Q 000473 736 ISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGV 772 (1471)
Q Consensus 736 ~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~ 772 (1471)
. +-+...|+++.+|++.......
T Consensus 331 ---v-----------LASSg~D~rl~vWDls~ig~eq 353 (422)
T KOG0264|consen 331 ---V-----------LASSGTDRRLNVWDLSRIGEEQ 353 (422)
T ss_pred ---e-----------eEecccCCcEEEEecccccccc
Confidence 1 1233359999999997655443
No 90
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.69 E-value=1.3e-12 Score=156.52 Aligned_cols=503 Identities=11% Similarity=0.042 Sum_probs=272.2
Q ss_pred ceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccc-cceeEeeeccccccccCccccccccccccccc
Q 000473 17 HRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHS-APIADLSICYPAMVSRDGKAEHWKAENSSNVM 95 (1471)
Q Consensus 17 h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~-~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~ 95 (1471)
..|+|+|++.+...||.+=.||.|.+|++.. + --....+.||. ..|.+|+|
T Consensus 26 s~I~slA~s~kS~~lAvsRt~g~IEiwN~~~--~--w~~~~vi~g~~drsIE~L~W------------------------ 77 (691)
T KOG2048|consen 26 SEIVSLAYSHKSNQLAVSRTDGNIEIWNLSN--N--WFLEPVIHGPEDRSIESLAW------------------------ 77 (691)
T ss_pred cceEEEEEeccCCceeeeccCCcEEEEccCC--C--ceeeEEEecCCCCceeeEEE------------------------
Confidence 3499999999999999999999999999984 2 22334455655 68999985
Q ss_pred ccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCcccccccccccccc
Q 000473 96 GKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDL 175 (1471)
Q Consensus 96 ~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~ 175 (1471)
. ++.+|.|.+.+|+|+-||+.+++.+...... |.+.--.+..+.+..+++||+
T Consensus 78 ---~-e~~RLFS~g~sg~i~EwDl~~lk~~~~~d~~---gg~IWsiai~p~~~~l~Igcd-------------------- 130 (691)
T KOG2048|consen 78 ---A-EGGRLFSSGLSGSITEWDLHTLKQKYNIDSN---GGAIWSIAINPENTILAIGCD-------------------- 130 (691)
T ss_pred ---c-cCCeEEeecCCceEEEEecccCceeEEecCC---CcceeEEEeCCccceEEeecC--------------------
Confidence 2 4667999999999999999999988776554 444434456677788899986
Q ss_pred cccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCcc-
Q 000473 176 VSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLD- 254 (1471)
Q Consensus 176 ~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~- 254 (1471)
++.+..++...+++.+.-.-.+. .+.+-++++. +++. .++.|+.||.|++||..++....
T Consensus 131 -------------dGvl~~~s~~p~~I~~~r~l~rq-~sRvLslsw~---~~~~--~i~~Gs~Dg~Iriwd~~~~~t~~~ 191 (691)
T KOG2048|consen 131 -------------DGVLYDFSIGPDKITYKRSLMRQ-KSRVLSLSWN---PTGT--KIAGGSIDGVIRIWDVKSGQTLHI 191 (691)
T ss_pred -------------CceEEEEecCCceEEEEeecccc-cceEEEEEec---CCcc--EEEecccCceEEEEEcCCCceEEE
Confidence 35566666666666543322122 2235566655 2333 37778999999999998773110
Q ss_pred -cccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCc--ceeeee-eecceeEeecCCCC
Q 000473 255 -REEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGS--TIGEIC-FVDNLFCLEGGSTN 330 (1471)
Q Consensus 255 -~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~--~ige~~-~~~~~l~~~~~~~~ 330 (1471)
...-.++.+.+. .-+.+|.+-.+ .+|+++++.+ .+++||... .+..+. .-.+++|.....++
T Consensus 192 ~~~~~d~l~k~~~------------~iVWSv~~Lrd-~tI~sgDS~G-~V~FWd~~~gTLiqS~~~h~adVl~Lav~~~~ 257 (691)
T KOG2048|consen 192 ITMQLDRLSKREP------------TIVWSVLFLRD-STIASGDSAG-TVTFWDSIFGTLIQSHSCHDADVLALAVADNE 257 (691)
T ss_pred eeecccccccCCc------------eEEEEEEEeec-CcEEEecCCc-eEEEEcccCcchhhhhhhhhcceeEEEEcCCC
Confidence 000111111110 01234444433 4455565554 344576654 222221 11133333322211
Q ss_pred ceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeecCCCCCcccCeeeecC--ccCCCCceeeEEEeec
Q 000473 331 SYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISYMNEKFDYEPHFEIPA--VSYPSGVKFSIHFIQM 408 (1471)
Q Consensus 331 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~~~~~~~~~~~~~ip~--v~~~~~~~~~i~f~~~ 408 (1471)
.++...+.++.++.|+....... |.+.. -.+.++.|....+..
T Consensus 258 ----------------------------d~vfsaGvd~~ii~~~~~~~~~~------wv~~~~r~~h~hdvrs~av~~~- 302 (691)
T KOG2048|consen 258 ----------------------------DRVFSAGVDPKIIQYSLTTNKSE------WVINSRRDLHAHDVRSMAVIEN- 302 (691)
T ss_pred ----------------------------CeEEEccCCCceEEEEecCCccc------eeeeccccCCcccceeeeeecc-
Confidence 25556666777776766543221 21111 011222222111111
Q ss_pred ceeeEEeeeeeccccccccccCeeEEEEcccc--CCCCCcceeEeccCCceEeeccccccccCCCCcccceeecccccCc
Q 000473 409 SLYLLRMETVCFHVEETSQWRPYISVWSLSQK--HSGPGKQCRMVGEGFSFVDWVNNSTFLDENEGSCTGKSDLTFCQDT 486 (1471)
Q Consensus 409 ~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~--~~~~~~~~k~l~~g~~~~~w~~~~~~~~~~dG~~i~~l~~s~~~~~ 486 (1471)
.+++.. .+..+. .+..+....+.. .+....-+++...... ++...-....
T Consensus 303 --~l~sgG-----~d~~l~-i~~s~~~~~~~h~~~~~~p~~~~v~~a~~~--------------------~L~~~w~~h~ 354 (691)
T KOG2048|consen 303 --ALISGG-----RDFTLA-ICSSREFKNMDHRQKNLFPASDRVSVAPEN--------------------RLLVLWKAHG 354 (691)
T ss_pred --eEEecc-----eeeEEE-EccccccCchhhhccccccccceeecCccc--------------------eEEEEecccc
Confidence 111100 000000 000000000000 0000000011111100 1111112334
Q ss_pred cccccccCCCCCC-----CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcc
Q 000473 487 VPRSEHVDSRQAG-----DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHV 561 (1471)
Q Consensus 487 v~~Wd~~~~~~~g-----~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~ 561 (1471)
+.+|.+....+.| .+.+........++|.+..|+.. .++.+.- .+.+|+..+- . + ++. -+
T Consensus 355 v~lwrlGS~~~~g~~~~~~Llkl~~k~~~nIs~~aiSPdg~----~Ia~st~-~~~~iy~L~~---~--~----~vk-~~ 419 (691)
T KOG2048|consen 355 VDLWRLGSVILQGEYNYIHLLKLFTKEKENISCAAISPDGN----LIAISTV-SRTKIYRLQP---D--P----NVK-VI 419 (691)
T ss_pred ccceeccCcccccccChhhheeeecCCccceeeeccCCCCC----EEEEeec-cceEEEEecc---C--c----cee-EE
Confidence 4555554442222 22222223345577766555554 5666543 3455544431 0 0 000 00
Q ss_pred eEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE-CCCcEEEEECCCCc--eEEEEe--ccCCCEEEEEECCCCCCC
Q 000473 562 SRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS-MDCSIRIWDLGSGN--LITVMH--HHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 562 ~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs-~DgtI~lWDl~tg~--~l~~~~--~H~~~V~~l~fspd~~~~ 636 (1471)
.+.....-.-.+..+.|.-| +..++-.+ .++.+.+.++.++. .+..+. ....+|..+..+|+
T Consensus 420 ~v~~~~~~~~~a~~i~ftid---------~~k~~~~s~~~~~le~~el~~ps~kel~~~~~~~~~~~I~~l~~Ssd---- 486 (691)
T KOG2048|consen 420 NVDDVPLALLDASAISFTID---------KNKLFLVSKNIFSLEEFELETPSFKELKSIQSQAKCPSISRLVVSSD---- 486 (691)
T ss_pred EeccchhhhccceeeEEEec---------CceEEEEecccceeEEEEecCcchhhhhccccccCCCcceeEEEcCC----
Confidence 11111112234566777766 45555444 77888888887654 232222 34578999999999
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcC-CCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDC-PRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~sp-dg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
|++||..+.++.|.+|++++++.......-...|++++|+| +.+.|+.+..| +.|+=+|++.
T Consensus 487 --G~yiaa~~t~g~I~v~nl~~~~~~~l~~rln~~vTa~~~~~~~~~~lvvats~--------nQv~efdi~~ 549 (691)
T KOG2048|consen 487 --GNYIAAISTRGQIFVYNLETLESHLLKVRLNIDVTAAAFSPFVRNRLVVATSN--------NQVFEFDIEA 549 (691)
T ss_pred --CCEEEEEeccceEEEEEcccceeecchhccCcceeeeeccccccCcEEEEecC--------CeEEEEecch
Confidence 99999999999999999999887655545568899999994 56788888887 9999999954
No 91
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.69 E-value=6.2e-16 Score=164.20 Aligned_cols=182 Identities=15% Similarity=0.101 Sum_probs=158.1
Q ss_pred cccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecC
Q 000473 502 RDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRM 581 (1471)
Q Consensus 502 ~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd 581 (1471)
...+..|.+.|.++.|.-+.. +.++++.|.+|++ |+...+.+++++.||...|..++.+.|
T Consensus 10 ~~~l~~~qgaV~avryN~dGn----Y~ltcGsdrtvrL---------------WNp~rg~liktYsghG~EVlD~~~s~D 70 (307)
T KOG0316|consen 10 LSILDCAQGAVRAVRYNVDGN----YCLTCGSDRTVRL---------------WNPLRGALIKTYSGHGHEVLDAALSSD 70 (307)
T ss_pred ceeecccccceEEEEEccCCC----EEEEcCCCceEEe---------------ecccccceeeeecCCCceeeecccccc
Confidence 344667889999998777776 5778888999998 446667899999999999999998876
Q ss_pred CCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC--
Q 000473 582 VGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-- 659 (1471)
Q Consensus 582 ~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-- 659 (1471)
+..+++|+.|..|.+||+.+|+.+++|.+|.+.|..+.|+.+ ...++||+-|.++++||.++.
T Consensus 71 ---------nskf~s~GgDk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNee------sSVv~SgsfD~s~r~wDCRS~s~ 135 (307)
T KOG0316|consen 71 ---------NSKFASCGGDKAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEE------SSVVASGSFDSSVRLWDCRSRSF 135 (307)
T ss_pred ---------ccccccCCCCceEEEEEcccCeeeeecccccceeeEEEecCc------ceEEEeccccceeEEEEcccCCC
Confidence 789999999999999999999999999999999999999988 789999999999999999764
Q ss_pred cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeee
Q 000473 660 RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 660 ~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
++++.+......|.+|... +..|++|+.| |++|.||++.|++---+.||. +.++.|.+
T Consensus 136 ePiQildea~D~V~Si~v~--~heIvaGS~D--------GtvRtydiR~G~l~sDy~g~p--it~vs~s~ 193 (307)
T KOG0316|consen 136 EPIQILDEAKDGVSSIDVA--EHEIVAGSVD--------GTVRTYDIRKGTLSSDYFGHP--ITSVSFSK 193 (307)
T ss_pred CccchhhhhcCceeEEEec--ccEEEeeccC--------CcEEEEEeecceeehhhcCCc--ceeEEecC
Confidence 6788888888889999886 5578999998 999999999999988888874 66665663
No 92
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.69 E-value=2.1e-16 Score=187.11 Aligned_cols=234 Identities=13% Similarity=0.146 Sum_probs=185.2
Q ss_pred eecccccCccccccccCCCC--CCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQ--AGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~--~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
+.+.+.++.++.|+...... .......+..|...|+.+....... .+++++.|-+|++|+..
T Consensus 40 LfTgGRDg~i~~W~~~~d~~~~s~~~~asme~HsDWVNDiiL~~~~~----tlIS~SsDtTVK~W~~~------------ 103 (735)
T KOG0308|consen 40 LFTGGRDGIIRLWSVTQDSNEPSTPYIASMEHHSDWVNDIILCGNGK----TLISASSDTTVKVWNAH------------ 103 (735)
T ss_pred EEecCCCceEEEeccccccCCcccchhhhhhhhHhHHhhHHhhcCCC----ceEEecCCceEEEeecc------------
Confidence 55566778888888755332 1234667888999888776555554 79999999999994433
Q ss_pred ccCCcceEEEEecCCccEEEEEE-ecCCCCcccCcCCCEEEEEECCCcEEEEECCCC--ceEEEE--------e-ccCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAA-HRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG--NLITVM--------H-HHVAP 623 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~-spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg--~~l~~~--------~-~H~~~ 623 (1471)
....-+..++..|++.|.|+++ .++ ..+++|||-|+.|.+||+++| ++++++ . ++..+
T Consensus 104 -~~~~~c~stir~H~DYVkcla~~ak~---------~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~s 173 (735)
T KOG0308|consen 104 -KDNTFCMSTIRTHKDYVKCLAYIAKN---------NELVASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDS 173 (735)
T ss_pred -cCcchhHhhhhcccchheeeeecccC---------ceeEEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccc
Confidence 1111456778899999999999 544 789999999999999999988 333333 2 78889
Q ss_pred EEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEE
Q 000473 624 VRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFI 703 (1471)
Q Consensus 624 V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~V 703 (1471)
|.+++.++. |..|++|+..+.+++||.++++.+..+.||..-|.++..++||..+++++.| |+|++
T Consensus 174 iYSLA~N~t------~t~ivsGgtek~lr~wDprt~~kimkLrGHTdNVr~ll~~dDGt~~ls~sSD--------gtIrl 239 (735)
T KOG0308|consen 174 IYSLAMNQT------GTIIVSGGTEKDLRLWDPRTCKKIMKLRGHTDNVRVLLVNDDGTRLLSASSD--------GTIRL 239 (735)
T ss_pred eeeeecCCc------ceEEEecCcccceEEeccccccceeeeeccccceEEEEEcCCCCeEeecCCC--------ceEEe
Confidence 999999998 8899999999999999999999999999999999999999999999999998 99999
Q ss_pred EECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccc
Q 000473 704 WDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQND 768 (1471)
Q Consensus 704 WDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~ 768 (1471)
||+...+++.++..|...|...+..+. -..++.-.+|+.+..=+|+++
T Consensus 240 WdLgqQrCl~T~~vH~e~VWaL~~~~s-----------------f~~vYsG~rd~~i~~Tdl~n~ 287 (735)
T KOG0308|consen 240 WDLGQQRCLATYIVHKEGVWALQSSPS-----------------FTHVYSGGRDGNIYRTDLRNP 287 (735)
T ss_pred eeccccceeeeEEeccCceEEEeeCCC-----------------cceEEecCCCCcEEecccCCc
Confidence 999999999999999988877633211 112334455777777666654
No 93
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.68 E-value=1.3e-14 Score=159.40 Aligned_cols=85 Identities=24% Similarity=0.415 Sum_probs=70.1
Q ss_pred CCCCCceEEEEEEcCC-CCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccc
Q 000473 12 GTPPSHRVTATSALTQ-PPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAEN 90 (1471)
Q Consensus 12 ~~~p~h~Vtava~SpD-g~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~ 90 (1471)
+.||...|++++|||. ...++.||.||+|++|++.. .+...++ ....|.++|.+++
T Consensus 23 ~~pP~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~--~g~~~~k-a~~~~~~PvL~v~-------------------- 79 (347)
T KOG0647|consen 23 PNPPEDSISALAFSPQADNLLAAGSWDGTVRIWEVQN--SGQLVPK-AQQSHDGPVLDVC-------------------- 79 (347)
T ss_pred CCCcccchheeEeccccCceEEecccCCceEEEEEec--CCcccch-hhhccCCCeEEEE--------------------
Confidence 4688999999999994 45666899999999999984 2334443 3467999999998
Q ss_pred cccccccccCCCCEEEEEeCCCeEEEEEcCCCeEEE
Q 000473 91 SSNVMGKSSLDNGALISACTDGVLCVWSRSSGHCRR 126 (1471)
Q Consensus 91 ~~~~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~ 126 (1471)
++.|+..+++|+.|+++++||+.+|+...
T Consensus 80 -------WsddgskVf~g~~Dk~~k~wDL~S~Q~~~ 108 (347)
T KOG0647|consen 80 -------WSDDGSKVFSGGCDKQAKLWDLASGQVSQ 108 (347)
T ss_pred -------EccCCceEEeeccCCceEEEEccCCCeee
Confidence 67788889999999999999999996553
No 94
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.67 E-value=7.8e-17 Score=192.53 Aligned_cols=185 Identities=20% Similarity=0.230 Sum_probs=166.7
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCC
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~ 582 (1471)
..+.+|...|.|+.+..... .++.|..+|+|++ | |++..+.+++|.||...+..|.|+|-
T Consensus 64 ~S~~~hespIeSl~f~~~E~----LlaagsasgtiK~--w-------------DleeAk~vrtLtgh~~~~~sv~f~P~- 123 (825)
T KOG0267|consen 64 TSLTGHESPIESLTFDTSER----LLAAGSASGTIKV--W-------------DLEEAKIVRTLTGHLLNITSVDFHPY- 123 (825)
T ss_pred heeeccCCcceeeecCcchh----hhcccccCCceee--e-------------ehhhhhhhhhhhccccCcceeeeccc-
Confidence 34789999999988444443 7999999999999 4 45566778899999999999999995
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE 662 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l 662 (1471)
+.+.++|+.|+.+++||++...|.+.+.+|...|..+.|+|+ |.+++++++|.++++||+..|+..
T Consensus 124 --------~~~~a~gStdtd~~iwD~Rk~Gc~~~~~s~~~vv~~l~lsP~------Gr~v~~g~ed~tvki~d~~agk~~ 189 (825)
T KOG0267|consen 124 --------GEFFASGSTDTDLKIWDIRKKGCSHTYKSHTRVVDVLRLSPD------GRWVASGGEDNTVKIWDLTAGKLS 189 (825)
T ss_pred --------eEEeccccccccceehhhhccCceeeecCCcceeEEEeecCC------CceeeccCCcceeeeecccccccc
Confidence 888999999999999999988899999999999999999999 999999999999999999999999
Q ss_pred EEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeee
Q 000473 663 RMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 663 ~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
..|.+|...+..+.|+|.+-.+++|+.| ++|++||++|.+.+....+....+.+..|.+
T Consensus 190 ~ef~~~e~~v~sle~hp~e~Lla~Gs~d--------~tv~f~dletfe~I~s~~~~~~~v~~~~fn~ 248 (825)
T KOG0267|consen 190 KEFKSHEGKVQSLEFHPLEVLLAPGSSD--------RTVRFWDLETFEVISSGKPETDGVRSLAFNP 248 (825)
T ss_pred cccccccccccccccCchhhhhccCCCC--------ceeeeeccceeEEeeccCCccCCceeeeecC
Confidence 9999999999999999999999998888 9999999999999888887777777776764
No 95
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.67 E-value=5e-16 Score=169.94 Aligned_cols=223 Identities=17% Similarity=0.249 Sum_probs=167.9
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCC--CCccccCC-cceEEEEecCCccEEEE
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSP--GASLKVNS-HVSRQYFLGHTGAVLCL 576 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~--~~~~d~~s-~~~~~~l~gH~~~V~~l 576 (1471)
.+...+..|...+.+.++.++.. .+++|+.|.+|+|.+...+.....+ ...-+.+. ...+++|..|.+.|++|
T Consensus 103 yEt~ylt~HK~~cR~aafs~DG~----lvATGsaD~SIKildvermlaks~~~em~~~~~qa~hPvIRTlYDH~devn~l 178 (430)
T KOG0640|consen 103 YETKYLTSHKSPCRAAAFSPDGS----LVATGSADASIKILDVERMLAKSKPKEMISGDTQARHPVIRTLYDHVDEVNDL 178 (430)
T ss_pred cceEEEeecccceeeeeeCCCCc----EEEccCCcceEEEeehhhhhhhcchhhhccCCcccCCceEeehhhccCcccce
Confidence 44455678888888888555554 7999999999999654321111000 00111222 36789999999999999
Q ss_pred EEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEe--ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEE
Q 000473 577 AAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMH--HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALA 654 (1471)
Q Consensus 577 a~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~--~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lW 654 (1471)
.|||. ...|+||+.|++|+++|+......+.|+ ....+|.++.|+|. |.+++.|.+-.+++||
T Consensus 179 ~FHPr---------e~ILiS~srD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPs------GefllvgTdHp~~rlY 243 (430)
T KOG0640|consen 179 DFHPR---------ETILISGSRDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPS------GEFLLVGTDHPTLRLY 243 (430)
T ss_pred eecch---------hheEEeccCCCeEEEEecccHHHHHHHHHhhccceeeeEeecCC------CceEEEecCCCceeEE
Confidence 99997 8999999999999999997543322222 34568999999998 9999999999999999
Q ss_pred ECCCCcEEEEe---cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe-CCCCCceee-eeee
Q 000473 655 SLETLRVERMF---PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR-GTASHSMFD-HFCK 729 (1471)
Q Consensus 655 dl~t~~~l~~~---~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~-gH~~~v~~~-~~~~ 729 (1471)
|+++.+|.... .+|.+.|++|.+++.+++-++|+.| |.|++||--+++|++++. .|.+..++. .|.+
T Consensus 244 dv~T~QcfvsanPd~qht~ai~~V~Ys~t~~lYvTaSkD--------G~IklwDGVS~rCv~t~~~AH~gsevcSa~Ftk 315 (430)
T KOG0640|consen 244 DVNTYQCFVSANPDDQHTGAITQVRYSSTGSLYVTASKD--------GAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFTK 315 (430)
T ss_pred eccceeEeeecCcccccccceeEEEecCCccEEEEeccC--------CcEEeeccccHHHHHHHHhhcCCceeeeEEEcc
Confidence 99999987654 3688999999999999999999999 999999999999998874 677655553 3441
Q ss_pred ccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 730 GISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 730 ~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
+|.+. ++...|.+++.|.+.
T Consensus 316 ------------n~kyi-----LsSG~DS~vkLWEi~ 335 (430)
T KOG0640|consen 316 ------------NGKYI-----LSSGKDSTVKLWEIS 335 (430)
T ss_pred ------------CCeEE-----eecCCcceeeeeeec
Confidence 22332 333348889999873
No 96
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.66 E-value=5.8e-15 Score=159.55 Aligned_cols=205 Identities=17% Similarity=0.193 Sum_probs=169.3
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGT 584 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~ 584 (1471)
+++|+.+++.+.|..+.. .|.+++.|.+..||.- ..|+.+-++.||++.|+|++..-+
T Consensus 6 l~GHERplTqiKyN~eGD----LlFscaKD~~~~vw~s---------------~nGerlGty~GHtGavW~~Did~~--- 63 (327)
T KOG0643|consen 6 LQGHERPLTQIKYNREGD----LLFSCAKDSTPTVWYS---------------LNGERLGTYDGHTGAVWCCDIDWD--- 63 (327)
T ss_pred cccCccccceEEecCCCc----EEEEecCCCCceEEEe---------------cCCceeeeecCCCceEEEEEecCC---
Confidence 578999999999776665 8999999999999432 246788999999999999999876
Q ss_pred cccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC-----CCcEEEEECC--
Q 000473 585 AKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE-----DFSVALASLE-- 657 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~-----DgsV~lWdl~-- 657 (1471)
..++++|+.|.++++||+.+|+.+..++. ..+|..+.|+++ |++++...+ -+.|.++|++
T Consensus 64 ------s~~liTGSAD~t~kLWDv~tGk~la~~k~-~~~Vk~~~F~~~------gn~~l~~tD~~mg~~~~v~~fdi~~~ 130 (327)
T KOG0643|consen 64 ------SKHLITGSADQTAKLWDVETGKQLATWKT-NSPVKRVDFSFG------GNLILASTDKQMGYTCFVSVFDIRDD 130 (327)
T ss_pred ------cceeeeccccceeEEEEcCCCcEEEEeec-CCeeEEEeeccC------CcEEEEEehhhcCcceEEEEEEccCC
Confidence 79999999999999999999999998864 458999999998 776665543 3678999998
Q ss_pred -----CCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEeCCCCCceeeeeeecc
Q 000473 658 -----TLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLRGTASHSMFDHFCKGI 731 (1471)
Q Consensus 658 -----t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~gH~~~v~~~~~~~~~ 731 (1471)
..++...++.+.+.++.+.|.|-+++|++|..| |.|.+||+++| +.+....-|.+.+.-+++.+..
T Consensus 131 ~~~~~s~ep~~kI~t~~skit~a~Wg~l~~~ii~Ghe~--------G~is~~da~~g~~~v~s~~~h~~~Ind~q~s~d~ 202 (327)
T KOG0643|consen 131 SSDIDSEEPYLKIPTPDSKITSALWGPLGETIIAGHED--------GSISIYDARTGKELVDSDEEHSSKINDLQFSRDR 202 (327)
T ss_pred hhhhcccCceEEecCCccceeeeeecccCCEEEEecCC--------CcEEEEEcccCceeeechhhhccccccccccCCc
Confidence 456788889999999999999999999999999 99999999998 4567778888877777666432
Q ss_pred ccccccceEEcCCccccccceeeccCCceEeecccccc
Q 000473 732 SMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 732 ~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
. .+++.+.|.+.+.|+...++
T Consensus 203 T-----------------~FiT~s~Dttakl~D~~tl~ 223 (327)
T KOG0643|consen 203 T-----------------YFITGSKDTTAKLVDVRTLE 223 (327)
T ss_pred c-----------------eEEecccCccceeeecccee
Confidence 2 23455668888888865443
No 97
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.65 E-value=4.1e-12 Score=147.35 Aligned_cols=137 Identities=15% Similarity=0.176 Sum_probs=108.1
Q ss_pred EEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEE
Q 000473 564 QYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFL 643 (1471)
Q Consensus 564 ~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~ 643 (1471)
....||.+..+.++.||+ .+.+++++.|+.+++|+ ..+++.+.. -..++.++.|+|. | .++
T Consensus 362 ~~v~gh~delwgla~hps---------~~q~~T~gqdk~v~lW~--~~k~~wt~~-~~d~~~~~~fhps------g-~va 422 (626)
T KOG2106|consen 362 LTVQGHGDELWGLATHPS---------KNQLLTCGQDKHVRLWN--DHKLEWTKI-IEDPAECADFHPS------G-VVA 422 (626)
T ss_pred EEEEecccceeeEEcCCC---------hhheeeccCcceEEEcc--CCceeEEEE-ecCceeEeeccCc------c-eEE
Confidence 455799999999999997 78999999999999999 445544332 3468899999998 7 999
Q ss_pred EEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCC-CC
Q 000473 644 SVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTA-SH 721 (1471)
Q Consensus 644 S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~-~~ 721 (1471)
.|...|...+.|.++...+..-.. ..++++|+|+|+|.+|++|+.| +.||||-+.. |.....+.-|+ +.
T Consensus 423 ~Gt~~G~w~V~d~e~~~lv~~~~d-~~~ls~v~ysp~G~~lAvgs~d--------~~iyiy~Vs~~g~~y~r~~k~~gs~ 493 (626)
T KOG2106|consen 423 VGTATGRWFVLDTETQDLVTIHTD-NEQLSVVRYSPDGAFLAVGSHD--------NHIYIYRVSANGRKYSRVGKCSGSP 493 (626)
T ss_pred EeeccceEEEEecccceeEEEEec-CCceEEEEEcCCCCEEEEecCC--------CeEEEEEECCCCcEEEEeeeecCce
Confidence 999999999999998655544443 7889999999999999999999 9999999864 55544443333 44
Q ss_pred ceeeeee
Q 000473 722 SMFDHFC 728 (1471)
Q Consensus 722 v~~~~~~ 728 (1471)
++-.+|.
T Consensus 494 ithLDwS 500 (626)
T KOG2106|consen 494 ITHLDWS 500 (626)
T ss_pred eEEeeec
Confidence 4445555
No 98
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.65 E-value=3.7e-15 Score=163.26 Aligned_cols=210 Identities=13% Similarity=0.211 Sum_probs=165.4
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEE----ecccccC-----
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQ----FDLFERH----- 548 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~----~~~l~~~----- 548 (1471)
+.+.+.+-+.++|.++. |.|...+.+|.+.|+++.+.+... .+++|+.|++-.||. |..-...
T Consensus 163 ~gtASADhTA~iWs~Es----g~CL~~Y~GH~GSVNsikfh~s~~----L~lTaSGD~taHIW~~av~~~vP~~~a~~~h 234 (481)
T KOG0300|consen 163 CGTASADHTARIWSLES----GACLATYTGHTGSVNSIKFHNSGL----LLLTASGDETAHIWKAAVNWEVPSNNAPSDH 234 (481)
T ss_pred eeecccccceeEEeecc----ccceeeecccccceeeEEeccccc----eEEEccCCcchHHHHHhhcCcCCCCCCCCCC
Confidence 44566778899999987 467788899999999999544443 789999999999975 3221100
Q ss_pred ---------CCCCCccc--c-CC----cceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc
Q 000473 549 ---------NSPGASLK--V-NS----HVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN 612 (1471)
Q Consensus 549 ---------d~~~~~~d--~-~s----~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~ 612 (1471)
|.+.+.-+ . .. ..++..|.||.+.|.+..|-.. ++.++++|+|.+..+||+++|+
T Consensus 235 SsEeE~e~sDe~~~d~d~~~~sD~~tiRvPl~~ltgH~~vV~a~dWL~g---------g~Q~vTaSWDRTAnlwDVEtge 305 (481)
T KOG0300|consen 235 SSEEEEEHSDEHNRDTDSSEKSDGHTIRVPLMRLTGHRAVVSACDWLAG---------GQQMVTASWDRTANLWDVETGE 305 (481)
T ss_pred CchhhhhcccccccccccccccCCceeeeeeeeeeccccceEehhhhcC---------cceeeeeeccccceeeeeccCc
Confidence 11111001 0 01 1356678999999999999764 8999999999999999999999
Q ss_pred eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCC
Q 000473 613 LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWDCPRGYIACLCRDH 691 (1471)
Q Consensus 613 ~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~ 691 (1471)
.++.+.+|....+.+.-+|. .+++++.+.|.+.++||.+.. ..+..|.||...|+++.|.-+. .+++|+.|
T Consensus 306 ~v~~LtGHd~ELtHcstHpt------QrLVvTsSrDtTFRLWDFReaI~sV~VFQGHtdtVTS~vF~~dd-~vVSgSDD- 377 (481)
T KOG0300|consen 306 VVNILTGHDSELTHCSTHPT------QRLVVTSSRDTTFRLWDFREAIQSVAVFQGHTDTVTSVVFNTDD-RVVSGSDD- 377 (481)
T ss_pred eeccccCcchhccccccCCc------ceEEEEeccCceeEeccchhhcceeeeecccccceeEEEEecCC-ceeecCCC-
Confidence 99999999999998888887 899999999999999999853 4578899999999999998765 46788887
Q ss_pred CCCCCCCCEEEEEECCCCe-EEEEEeCCC
Q 000473 692 SRTSDAVDVLFIWDVKTGA-RERVLRGTA 719 (1471)
Q Consensus 692 sg~~D~~gtV~VWDi~tg~-~~~~l~gH~ 719 (1471)
.+|+|||+++.+ .+.++.-.+
T Consensus 378 -------rTvKvWdLrNMRsplATIRtdS 399 (481)
T KOG0300|consen 378 -------RTVKVWDLRNMRSPLATIRTDS 399 (481)
T ss_pred -------ceEEEeeeccccCcceeeecCC
Confidence 999999998754 566666443
No 99
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.64 E-value=1.4e-15 Score=162.54 Aligned_cols=196 Identities=18% Similarity=0.228 Sum_probs=157.3
Q ss_pred CCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEE-ecCCCCcccCcCCCEEEEEECCCcE
Q 000473 525 PYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAA-HRMVGTAKGWSFNEVLVSGSMDCSI 603 (1471)
Q Consensus 525 P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~-spd~~~~~~~~~~~~L~SGs~DgtI 603 (1471)
-.+|++++.|++|+|+.-. . +. ..+.+.+|.||.++|+.++| ||. +|.+|+|++.||.|
T Consensus 23 gkrlATcsSD~tVkIf~v~--~---------n~-~s~ll~~L~Gh~GPVwqv~wahPk--------~G~iLAScsYDgkV 82 (299)
T KOG1332|consen 23 GKRLATCSSDGTVKIFEVR--N---------NG-QSKLLAELTGHSGPVWKVAWAHPK--------FGTILASCSYDGKV 82 (299)
T ss_pred cceeeeecCCccEEEEEEc--C---------CC-CceeeeEecCCCCCeeEEeecccc--------cCcEeeEeecCceE
Confidence 3489999999999994332 0 11 12678899999999999999 665 59999999999999
Q ss_pred EEEECCCCc--eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC---cEEEEecCCCCCcEEEEEc
Q 000473 604 RIWDLGSGN--LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL---RVERMFPGHPNYPAKVVWD 678 (1471)
Q Consensus 604 ~lWDl~tg~--~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~---~~l~~~~gh~~~V~~v~~s 678 (1471)
.+|.-..|+ ..+.+..|...|++++|.|.+ -|-.|++++.||.|.|.+.++. ...+....|...|++|.|.
T Consensus 83 IiWke~~g~w~k~~e~~~h~~SVNsV~waphe----ygl~LacasSDG~vsvl~~~~~g~w~t~ki~~aH~~GvnsVswa 158 (299)
T KOG1332|consen 83 IIWKEENGRWTKAYEHAAHSASVNSVAWAPHE----YGLLLACASSDGKVSVLTYDSSGGWTTSKIVFAHEIGVNSVSWA 158 (299)
T ss_pred EEEecCCCchhhhhhhhhhcccceeecccccc----cceEEEEeeCCCcEEEEEEcCCCCccchhhhhccccccceeeec
Confidence 999988775 346677899999999999973 2668999999999999998754 2234556799999999999
Q ss_pred CC---C-----------CEEEEEEcCCCCCCCCCCEEEEEECCCC--eEEEEEeCCCCCceeeeeeeccccccccceEEc
Q 000473 679 CP---R-----------GYIACLCRDHSRTSDAVDVLFIWDVKTG--ARERVLRGTASHSMFDHFCKGISMNSISGSVLN 742 (1471)
Q Consensus 679 pd---g-----------~~L~sgs~D~sg~~D~~gtV~VWDi~tg--~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~ 742 (1471)
|. | +.|++|+.| ..|+||+..++ .++++|.+|..-|--+.+||.+.
T Consensus 159 pa~~~g~~~~~~~~~~~krlvSgGcD--------n~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~g---------- 220 (299)
T KOG1332|consen 159 PASAPGSLVDQGPAAKVKRLVSGGCD--------NLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVG---------- 220 (299)
T ss_pred CcCCCccccccCcccccceeeccCCc--------cceeeeecCCcchhhhhhhhhcchhhhhhhhccccC----------
Confidence 87 4 569999998 99999999876 46788999999888888886432
Q ss_pred CCccccccceeeccCCceEeecc
Q 000473 743 GNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 743 g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
.....+.++++|+++-+|..
T Consensus 221 ---l~~s~iAS~SqDg~viIwt~ 240 (299)
T KOG1332|consen 221 ---LPKSTIASCSQDGTVIIWTK 240 (299)
T ss_pred ---CCceeeEEecCCCcEEEEEe
Confidence 22344567778999999974
No 100
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.64 E-value=1.7e-15 Score=162.04 Aligned_cols=220 Identities=19% Similarity=0.251 Sum_probs=174.7
Q ss_pred cccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCC
Q 000473 480 LTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNS 559 (1471)
Q Consensus 480 ~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s 559 (1471)
..+.+=+.++|+... |....+|. |.-.|.++++..+.. .|++|+.+.-++|++.+. .
T Consensus 76 saaadftakvw~a~t----gdelhsf~-hkhivk~~af~~ds~----~lltgg~ekllrvfdln~--------------p 132 (334)
T KOG0278|consen 76 SAAADFTAKVWDAVT----GDELHSFE-HKHIVKAVAFSQDSN----YLLTGGQEKLLRVFDLNR--------------P 132 (334)
T ss_pred hhcccchhhhhhhhh----hhhhhhhh-hhheeeeEEecccch----hhhccchHHHhhhhhccC--------------C
Confidence 334556789999876 34555543 667788888666665 799999999999954441 1
Q ss_pred cceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCC
Q 000473 560 HVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 560 ~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~ 639 (1471)
..+...+.||++.|..+.|... .+.++|.+.|++||+||.++|..++++.- ..+|+++.++++ |
T Consensus 133 ~App~E~~ghtg~Ir~v~wc~e---------D~~iLSSadd~tVRLWD~rTgt~v~sL~~-~s~VtSlEvs~d------G 196 (334)
T KOG0278|consen 133 KAPPKEISGHTGGIRTVLWCHE---------DKCILSSADDKTVRLWDHRTGTEVQSLEF-NSPVTSLEVSQD------G 196 (334)
T ss_pred CCCchhhcCCCCcceeEEEecc---------CceEEeeccCCceEEEEeccCcEEEEEec-CCCCcceeeccC------C
Confidence 1334567899999999999775 67888889999999999999999988854 457999999998 6
Q ss_pred CEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE-eCC
Q 000473 640 DCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL-RGT 718 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l-~gH 718 (1471)
++ ++.+.-++|++||..+...+..+.. +..|.+...+|+..++++|+.| ..++.||..||+.+..+ .||
T Consensus 197 ~i-lTia~gssV~Fwdaksf~~lKs~k~-P~nV~SASL~P~k~~fVaGged--------~~~~kfDy~TgeEi~~~nkgh 266 (334)
T KOG0278|consen 197 RI-LTIAYGSSVKFWDAKSFGLLKSYKM-PCNVESASLHPKKEFFVAGGED--------FKVYKFDYNTGEEIGSYNKGH 266 (334)
T ss_pred CE-EEEecCceeEEeccccccceeeccC-ccccccccccCCCceEEecCcc--------eEEEEEeccCCceeeecccCC
Confidence 64 5666788999999999998888764 3458899999999999999999 99999999999998886 999
Q ss_pred CCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 719 ASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 719 ~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
-+.|-++.|.|. |..-.++ ++||++|+|+.
T Consensus 267 ~gpVhcVrFSPd------------GE~yAsG-----SEDGTirlWQt 296 (334)
T KOG0278|consen 267 FGPVHCVRFSPD------------GELYASG-----SEDGTIRLWQT 296 (334)
T ss_pred CCceEEEEECCC------------Cceeecc-----CCCceEEEEEe
Confidence 999999999952 2222222 34999999974
No 101
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.64 E-value=4.1e-15 Score=169.44 Aligned_cols=189 Identities=17% Similarity=0.145 Sum_probs=157.6
Q ss_pred CccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEE
Q 000473 485 DTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQ 564 (1471)
Q Consensus 485 ~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~ 564 (1471)
..+-++|... .+....+.||...|+++.+.+... .+++++.|-.|+||... ......
T Consensus 241 ~~av~~d~~s----~q~l~~~~Gh~kki~~v~~~~~~~----~v~~aSad~~i~vws~~---------------~~s~~~ 297 (506)
T KOG0289|consen 241 KTAVLFDKPS----NQILATLKGHTKKITSVKFHKDLD----TVITASADEIIRVWSVP---------------LSSEPT 297 (506)
T ss_pred CceEEEecch----hhhhhhccCcceEEEEEEeccchh----heeecCCcceEEeeccc---------------cccCcc
Confidence 3444444333 467788999999999998444443 78999999999994322 122345
Q ss_pred EEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc--CCCEEEEEECCCCCCCCCCCEE
Q 000473 565 YFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH--VAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 565 ~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H--~~~V~~l~fspd~~~~~~~~~l 642 (1471)
....|.++|+.+..||. +.+|++++.|++..+-|+.+|..+...... .-.+++.+|+|| |..|
T Consensus 298 ~~~~h~~~V~~ls~h~t---------geYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpD------gLif 362 (506)
T KOG0289|consen 298 SSRPHEEPVTGLSLHPT---------GEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPD------GLIF 362 (506)
T ss_pred ccccccccceeeeeccC---------CcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCC------ceEE
Confidence 56789999999999997 899999999999999999999988776553 235899999999 9999
Q ss_pred EEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCC
Q 000473 643 LSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTA 719 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~ 719 (1471)
.+|..|+.|++||+.++..+..|++|.++|..++|+.+|=||++++.| +.|++||++.-+..+++.-..
T Consensus 363 gtgt~d~~vkiwdlks~~~~a~Fpght~~vk~i~FsENGY~Lat~add--------~~V~lwDLRKl~n~kt~~l~~ 431 (506)
T KOG0289|consen 363 GTGTPDGVVKIWDLKSQTNVAKFPGHTGPVKAISFSENGYWLATAADD--------GSVKLWDLRKLKNFKTIQLDE 431 (506)
T ss_pred eccCCCceEEEEEcCCccccccCCCCCCceeEEEeccCceEEEEEecC--------CeEEEEEehhhcccceeeccc
Confidence 999999999999999999999999999999999999999999999999 899999998877666665444
No 102
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.63 E-value=4.1e-15 Score=169.47 Aligned_cols=200 Identities=19% Similarity=0.221 Sum_probs=168.7
Q ss_pred cEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcC
Q 000473 511 IVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSF 590 (1471)
Q Consensus 511 ~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~ 590 (1471)
.++++..++... .+.+|+.|....+++ ..+++.+.+|.||+..|+.+.++++
T Consensus 221 gi~ald~~~s~~----~ilTGG~d~~av~~d---------------~~s~q~l~~~~Gh~kki~~v~~~~~--------- 272 (506)
T KOG0289|consen 221 GITALDIIPSSS----KILTGGEDKTAVLFD---------------KPSNQILATLKGHTKKITSVKFHKD--------- 272 (506)
T ss_pred CeeEEeecCCCC----cceecCCCCceEEEe---------------cchhhhhhhccCcceEEEEEEeccc---------
Confidence 466655555423 689999998888833 4456778899999999999999997
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCC-
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHP- 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~- 669 (1471)
...+++++.|..|++|.............|.++|+.+..+|. |.+|++++.|++..+.|+++++++.......
T Consensus 273 ~~~v~~aSad~~i~vws~~~~s~~~~~~~h~~~V~~ls~h~t------geYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s 346 (506)
T KOG0289|consen 273 LDTVITASADEIIRVWSVPLSSEPTSSRPHEEPVTGLSLHPT------GEYLLSASNDGTWAFSDISSGSQLTVVSDETS 346 (506)
T ss_pred hhheeecCCcceEEeeccccccCccccccccccceeeeeccC------CcEEEEecCCceEEEEEccCCcEEEEEeeccc
Confidence 788999999999999999888888888899999999999998 9999999999999999999999987765322
Q ss_pred -CCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCcccc
Q 000473 670 -NYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVS 748 (1471)
Q Consensus 670 -~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s 748 (1471)
-.+++.+|+|||..+.+|..| |.|+|||+.++.....+.||++.|..+.|.. +|-+
T Consensus 347 ~v~~ts~~fHpDgLifgtgt~d--------~~vkiwdlks~~~~a~Fpght~~vk~i~FsE------------NGY~--- 403 (506)
T KOG0289|consen 347 DVEYTSAAFHPDGLIFGTGTPD--------GVVKIWDLKSQTNVAKFPGHTGPVKAISFSE------------NGYW--- 403 (506)
T ss_pred cceeEEeeEcCCceEEeccCCC--------ceEEEEEcCCccccccCCCCCCceeEEEecc------------CceE---
Confidence 357999999999999999998 9999999999999999999999999998882 2223
Q ss_pred ccceeeccCCceEeecccccc
Q 000473 749 SLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 749 ~~l~~~~~D~tir~w~l~~~~ 769 (1471)
+...+.|+.++.|+|++.+
T Consensus 404 --Lat~add~~V~lwDLRKl~ 422 (506)
T KOG0289|consen 404 --LATAADDGSVKLWDLRKLK 422 (506)
T ss_pred --EEEEecCCeEEEEEehhhc
Confidence 2344457779999997655
No 103
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.62 E-value=2.4e-15 Score=168.95 Aligned_cols=168 Identities=24% Similarity=0.296 Sum_probs=132.9
Q ss_pred CccccccccCccEEEEEeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEE
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCL 576 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~l 576 (1471)
....++.+|...=.++. |+|. .+++|-.-+.|++ |..-. ..|.++ .+.|.+|+..|-.|
T Consensus 202 ~Pl~t~~ghk~EGy~Ld------WSp~~~g~LlsGDc~~~I~l--w~~~~------g~W~vd----~~Pf~gH~~SVEDL 263 (440)
T KOG0302|consen 202 RPLFTFNGHKGEGYGLD------WSPIKTGRLLSGDCVKGIHL--WEPST------GSWKVD----QRPFTGHTKSVEDL 263 (440)
T ss_pred CceEEecccCccceeee------cccccccccccCccccceEe--eeecc------Cceeec----Cccccccccchhhh
Confidence 45566778877766666 5663 5777766677777 44211 135543 24467899999999
Q ss_pred EEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC---ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEE
Q 000473 577 AAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG---NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVAL 653 (1471)
Q Consensus 577 a~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg---~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~l 653 (1471)
+|+|.+ ...|+|||.|++|+|||++.+ .++.+ +.|.+.|..|.|+.+ -.+|++|+.||+++|
T Consensus 264 qWSptE--------~~vfaScS~DgsIrIWDiRs~~~~~~~~~-kAh~sDVNVISWnr~------~~lLasG~DdGt~~i 328 (440)
T KOG0302|consen 264 QWSPTE--------DGVFASCSCDGSIRIWDIRSGPKKAAVST-KAHNSDVNVISWNRR------EPLLASGGDDGTLSI 328 (440)
T ss_pred ccCCcc--------CceEEeeecCceEEEEEecCCCccceeEe-eccCCceeeEEccCC------cceeeecCCCceEEE
Confidence 999973 789999999999999999988 34444 889999999999987 459999999999999
Q ss_pred EECCC---CcEEEEecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 654 ASLET---LRVERMFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 654 Wdl~t---~~~l~~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
||++. ++++..|..|..+|++|.|+|... .|++++.| ..|.+||+..
T Consensus 329 wDLR~~~~~~pVA~fk~Hk~pItsieW~p~e~s~iaasg~D--------~QitiWDlsv 379 (440)
T KOG0302|consen 329 WDLRQFKSGQPVATFKYHKAPITSIEWHPHEDSVIAASGED--------NQITIWDLSV 379 (440)
T ss_pred EEhhhccCCCcceeEEeccCCeeEEEeccccCceEEeccCC--------CcEEEEEeec
Confidence 99975 678899999999999999999754 66677777 8999999853
No 104
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.62 E-value=5.4e-15 Score=185.23 Aligned_cols=190 Identities=23% Similarity=0.286 Sum_probs=158.8
Q ss_pred CccEEEEEeeccccccCCEEEEEE--cCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcc
Q 000473 509 EKIVSSSMVISESFYAPYAIVYGF--FSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAK 586 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~~lv~Gs--~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~ 586 (1471)
...|.++.+.++.. ++++|+ .||.++||+-+.+.. ....++..-.+.+.+...|.+.|+|+.|+||
T Consensus 13 ~~~IfSIdv~pdg~----~~aTgGq~~d~~~~iW~~~~vl~---~~~~~~~~l~k~l~~m~~h~~sv~CVR~S~d----- 80 (942)
T KOG0973|consen 13 EKSIFSIDVHPDGV----KFATGGQVLDGGIVIWSQDPVLD---EKEEKNENLPKHLCTMDDHDGSVNCVRFSPD----- 80 (942)
T ss_pred CeeEEEEEecCCce----eEecCCccccccceeeccccccc---hhhhhhcccchhheeeccccCceeEEEECCC-----
Confidence 44577777555555 899999 899999966543221 1112333224556778899999999999998
Q ss_pred cCcCCCEEEEEECCCcEEEEECCC------------------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC
Q 000473 587 GWSFNEVLVSGSMDCSIRIWDLGS------------------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 587 ~~~~~~~L~SGs~DgtI~lWDl~t------------------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D 648 (1471)
+++|++||+|+.|.+|+... ++....+.+|.+.|..+.|+|+ +.+++|++.|
T Consensus 81 ----G~~lAsGSDD~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp~------~~~lvS~s~D 150 (942)
T KOG0973|consen 81 ----GSYLASGSDDRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWSPD------DSLLVSVSLD 150 (942)
T ss_pred ----CCeEeeccCcceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccCCC------ccEEEEeccc
Confidence 99999999999999999772 2367789999999999999999 9999999999
Q ss_pred CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 649 FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 649 gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
++|.+||.++.+++..+.+|.+.|..+.|+|-|+||++-+.| ++|+||++.+...++.++++-.++..-.+.
T Consensus 151 nsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~Gky~ASqsdD--------rtikvwrt~dw~i~k~It~pf~~~~~~T~f 222 (942)
T KOG0973|consen 151 NSVIIWNAKTFELLKVLRGHQSLVKGVSWDPIGKYFASQSDD--------RTLKVWRTSDWGIEKSITKPFEESPLTTFF 222 (942)
T ss_pred ceEEEEccccceeeeeeecccccccceEECCccCeeeeecCC--------ceEEEEEcccceeeEeeccchhhCCCccee
Confidence 999999999999999999999999999999999999999999 999999988888999999998777664443
No 105
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.62 E-value=1.1e-14 Score=167.96 Aligned_cols=214 Identities=16% Similarity=0.145 Sum_probs=177.8
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
......+.+.+|.+.. |.+...+.+|-..|||+.+..+.. .+++|+.||.|.+|....+...+.. .
T Consensus 97 ~ag~i~g~lYlWelss----G~LL~v~~aHYQ~ITcL~fs~dgs----~iiTgskDg~V~vW~l~~lv~a~~~------~ 162 (476)
T KOG0646|consen 97 LAGTISGNLYLWELSS----GILLNVLSAHYQSITCLKFSDDGS----HIITGSKDGAVLVWLLTDLVSADND------H 162 (476)
T ss_pred EeecccCcEEEEEecc----ccHHHHHHhhccceeEEEEeCCCc----EEEecCCCccEEEEEEEeecccccC------C
Confidence 3344667788898776 577888899999999999777776 7999999999999776555543221 1
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
+.++.+.|.+|+-+|+.+...+. + ...+++|.|.|.++++||+..|.++.++. ...++.++++.|.
T Consensus 163 ~~~p~~~f~~HtlsITDl~ig~G-----g--~~~rl~TaS~D~t~k~wdlS~g~LLlti~-fp~si~av~lDpa------ 228 (476)
T KOG0646|consen 163 SVKPLHIFSDHTLSITDLQIGSG-----G--TNARLYTASEDRTIKLWDLSLGVLLLTIT-FPSSIKAVALDPA------ 228 (476)
T ss_pred CccceeeeccCcceeEEEEecCC-----C--ccceEEEecCCceEEEEEeccceeeEEEe-cCCcceeEEEccc------
Confidence 45788999999999999998764 1 25799999999999999999999988774 4568999999998
Q ss_pred CCEEEEEeCCCcEEEEECCCC----------------cEEEEecCCCC--CcEEEEEcCCCCEEEEEEcCCCCCCCCCCE
Q 000473 639 SDCFLSVGEDFSVALASLETL----------------RVERMFPGHPN--YPAKVVWDCPRGYIACLCRDHSRTSDAVDV 700 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~----------------~~l~~~~gh~~--~V~~v~~spdg~~L~sgs~D~sg~~D~~gt 700 (1471)
...+..|+.||.|.+.++... ..+..+.||.+ .|+|++.+-||..|++|+.| |.
T Consensus 229 e~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLais~DgtlLlSGd~d--------g~ 300 (476)
T KOG0646|consen 229 ERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESAITCLAISTDGTLLLSGDED--------GK 300 (476)
T ss_pred ccEEEecCCcceEEeeehhcCCcccccccccccccccceeeeeccccCCcceeEEEEecCccEEEeeCCC--------CC
Confidence 789999999999998887432 35667889988 99999999999999999999 99
Q ss_pred EEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 701 LFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 701 V~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
|.|||+.+.+++|++..-.+.|...++.
T Consensus 301 VcvWdi~S~Q~iRtl~~~kgpVtnL~i~ 328 (476)
T KOG0646|consen 301 VCVWDIYSKQCIRTLQTSKGPVTNLQIN 328 (476)
T ss_pred EEEEecchHHHHHHHhhhccccceeEee
Confidence 9999999999999998767777777654
No 106
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.62 E-value=7.2e-14 Score=155.25 Aligned_cols=207 Identities=18% Similarity=0.261 Sum_probs=153.5
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEc--CCcEEEEEecccccCCCCCCcc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFF--SGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~--DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
+.+++.+.+++++|..+.+. ..++..+.-.+....+.. +++.++.++. |.+|+...
T Consensus 29 litss~dDsl~LYd~~~g~~----~~ti~skkyG~~~~~Fth----~~~~~i~sStk~d~tIryLs-------------- 86 (311)
T KOG1446|consen 29 LITSSEDDSLRLYDSLSGKQ----VKTINSKKYGVDLACFTH----HSNTVIHSSTKEDDTIRYLS-------------- 86 (311)
T ss_pred EEEecCCCeEEEEEcCCCce----eeEeecccccccEEEEec----CCceEEEccCCCCCceEEEE--------------
Confidence 45566777899999887544 233333323333333332 2336777776 77888732
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc---------------
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH--------------- 620 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H--------------- 620 (1471)
+.+.+.++.|.||...|+.|..+|. ++.++|+|.|++|++||++..++...+..-
T Consensus 87 -l~dNkylRYF~GH~~~V~sL~~sP~---------~d~FlS~S~D~tvrLWDlR~~~cqg~l~~~~~pi~AfDp~GLifA 156 (311)
T KOG1446|consen 87 -LHDNKYLRYFPGHKKRVNSLSVSPK---------DDTFLSSSLDKTVRLWDLRVKKCQGLLNLSGRPIAAFDPEGLIFA 156 (311)
T ss_pred -eecCceEEEcCCCCceEEEEEecCC---------CCeEEecccCCeEEeeEecCCCCceEEecCCCcceeECCCCcEEE
Confidence 3456899999999999999999997 799999999999999999965544332211
Q ss_pred ------------------------------CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC
Q 000473 621 ------------------------------VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 621 ------------------------------~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
....+.+.|+|+ |++++-....+.+.+.|--+|..+..+.++..
T Consensus 157 ~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~d------GK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~ 230 (311)
T KOG1446|consen 157 LANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPD------GKSILLSTNASFIYLLDAFDGTVKSTFSGYPN 230 (311)
T ss_pred EecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEEcCC------CCEEEEEeCCCcEEEEEccCCcEeeeEeeccC
Confidence 233456667776 77777777777777777777787788877765
Q ss_pred Cc---EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC-CCCCceeeeeeec
Q 000473 671 YP---AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG-TASHSMFDHFCKG 730 (1471)
Q Consensus 671 ~V---~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g-H~~~v~~~~~~~~ 730 (1471)
.- ....|+||++++++|+.| |+|++|++++|..+..+.| +...+.+++|.|.
T Consensus 231 ~~~~~~~a~ftPds~Fvl~gs~d--------g~i~vw~~~tg~~v~~~~~~~~~~~~~~~fnP~ 286 (311)
T KOG1446|consen 231 AGNLPLSATFTPDSKFVLSGSDD--------GTIHVWNLETGKKVAVLRGPNGGPVSCVRFNPR 286 (311)
T ss_pred CCCcceeEEECCCCcEEEEecCC--------CcEEEEEcCCCcEeeEecCCCCCCccccccCCc
Confidence 32 678999999999999988 9999999999999999999 6888888877753
No 107
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.61 E-value=4.4e-15 Score=162.63 Aligned_cols=177 Identities=16% Similarity=0.266 Sum_probs=147.8
Q ss_pred CccccccccCccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEE
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLA 577 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la 577 (1471)
..+.++-.|...|+++. |+|. .|++|+.|++|+++++.. ..-.+..++| .-...|.++.
T Consensus 163 PvIRTlYDH~devn~l~------FHPre~ILiS~srD~tvKlFDfsK------------~saKrA~K~~-qd~~~vrsiS 223 (430)
T KOG0640|consen 163 PVIRTLYDHVDEVNDLD------FHPRETILISGSRDNTVKLFDFSK------------TSAKRAFKVF-QDTEPVRSIS 223 (430)
T ss_pred ceEeehhhccCccccee------ecchhheEEeccCCCeEEEEeccc------------HHHHHHHHHh-hccceeeeEe
Confidence 35567788999999988 6665 899999999999966541 0001122222 2356799999
Q ss_pred EecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEE---EeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEE
Q 000473 578 AHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITV---MHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALA 654 (1471)
Q Consensus 578 ~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~---~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lW 654 (1471)
|||. |.+|+.|....+++++|+++-++... -.+|++.|+++.+++. +...++++.||.|+||
T Consensus 224 fHPs---------GefllvgTdHp~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys~t------~~lYvTaSkDG~Iklw 288 (430)
T KOG0640|consen 224 FHPS---------GEFLLVGTDHPTLRLYDVNTYQCFVSANPDDQHTGAITQVRYSST------GSLYVTASKDGAIKLW 288 (430)
T ss_pred ecCC---------CceEEEecCCCceeEEeccceeEeeecCcccccccceeEEEecCC------ccEEEEeccCCcEEee
Confidence 9996 99999999999999999999887643 2469999999999998 9999999999999999
Q ss_pred ECCCCcEEEEec-CCCC-CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 655 SLETLRVERMFP-GHPN-YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 655 dl~t~~~l~~~~-gh~~-~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
|--+++|++++. .|.+ .|.+..|..+++|+++.+.| ..|++|++.||+++.++.|.
T Consensus 289 DGVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG~D--------S~vkLWEi~t~R~l~~YtGA 346 (430)
T KOG0640|consen 289 DGVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSGKD--------STVKLWEISTGRMLKEYTGA 346 (430)
T ss_pred ccccHHHHHHHHhhcCCceeeeEEEccCCeEEeecCCc--------ceeeeeeecCCceEEEEecC
Confidence 999999998885 4544 78999999999999999999 99999999999999999876
No 108
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.61 E-value=7.4e-14 Score=147.06 Aligned_cols=216 Identities=18% Similarity=0.174 Sum_probs=168.9
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc----cccC-------C------------------------
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL----FERH-------N------------------------ 549 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~----l~~~-------d------------------------ 549 (1471)
-+.|.+.|.|.++.++.. .+++|+.|.+|++.+++. +.+. |
T Consensus 85 ~khhkgsiyc~~ws~~ge----liatgsndk~ik~l~fn~dt~~~~g~dle~nmhdgtirdl~fld~~~s~~~il~s~ga 160 (350)
T KOG0641|consen 85 NKHHKGSIYCTAWSPCGE----LIATGSNDKTIKVLPFNADTCNATGHDLEFNMHDGTIRDLAFLDDPESGGAILASAGA 160 (350)
T ss_pred ccccCccEEEEEecCccC----eEEecCCCceEEEEecccccccccCcceeeeecCCceeeeEEecCCCcCceEEEecCC
Confidence 367889999998666665 899999999999976631 0000 0
Q ss_pred --CCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec--c-----
Q 000473 550 --SPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH--H----- 620 (1471)
Q Consensus 550 --~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~--H----- 620 (1471)
.....-|...++..+.+.||++-|.++- += ++-.++||+.|.+|++||++-..++.++.. |
T Consensus 161 gdc~iy~tdc~~g~~~~a~sghtghilaly-sw---------n~~m~~sgsqdktirfwdlrv~~~v~~l~~~~~~~gle 230 (350)
T KOG0641|consen 161 GDCKIYITDCGRGQGFHALSGHTGHILALY-SW---------NGAMFASGSQDKTIRFWDLRVNSCVNTLDNDFHDGGLE 230 (350)
T ss_pred CcceEEEeecCCCCcceeecCCcccEEEEE-Ee---------cCcEEEccCCCceEEEEeeeccceeeeccCcccCCCcc
Confidence 0001224556777888899999998874 21 278999999999999999998888876643 2
Q ss_pred CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCE
Q 000473 621 VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDV 700 (1471)
Q Consensus 621 ~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gt 700 (1471)
...|.++++.|. |+.+++|-.|.+..+||++.+++++.|..|...|.+|.|+|...||++++.| ..
T Consensus 231 ssavaav~vdps------grll~sg~~dssc~lydirg~r~iq~f~phsadir~vrfsp~a~yllt~syd--------~~ 296 (350)
T KOG0641|consen 231 SSAVAAVAVDPS------GRLLASGHADSSCMLYDIRGGRMIQRFHPHSADIRCVRFSPGAHYLLTCSYD--------MK 296 (350)
T ss_pred cceeEEEEECCC------cceeeeccCCCceEEEEeeCCceeeeeCCCccceeEEEeCCCceEEEEeccc--------ce
Confidence 257999999998 9999999999999999999999999999999999999999999999999998 89
Q ss_pred EEEEECCCC----eEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 701 LFIWDVKTG----ARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 701 V~VWDi~tg----~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
|++-|++.- -.+.+...|...++...+.+.. -++++.+.|.+.-.|-+
T Consensus 297 ikltdlqgdla~el~~~vv~ehkdk~i~~rwh~~d-----------------~sfisssadkt~tlwa~ 348 (350)
T KOG0641|consen 297 IKLTDLQGDLAHELPIMVVAEHKDKAIQCRWHPQD-----------------FSFISSSADKTATLWAL 348 (350)
T ss_pred EEEeecccchhhcCceEEEEeccCceEEEEecCcc-----------------ceeeeccCcceEEEecc
Confidence 999998631 2356667898888888777421 12345566889989865
No 109
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.61 E-value=9.1e-15 Score=167.68 Aligned_cols=205 Identities=21% Similarity=0.247 Sum_probs=150.0
Q ss_pred CEEEEEEcCCcEEEEEecccccCCCCCCcccc------CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC
Q 000473 526 YAIVYGFFSGEIEVIQFDLFERHNSPGASLKV------NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM 599 (1471)
Q Consensus 526 ~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~------~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~ 599 (1471)
++++.|+.|-.|.||+.+.....-....+-.. ..++.-..-.||++.|..|+|... ..+.|+|||.
T Consensus 193 NyvAiGtmdp~IeIWDLDI~d~v~P~~~LGs~~sk~~~k~~k~~~~~~gHTdavl~Ls~n~~--------~~nVLaSgsa 264 (463)
T KOG0270|consen 193 NYVAIGTMDPEIEIWDLDIVDAVLPCVTLGSKASKKKKKKGKRSNSASGHTDAVLALSWNRN--------FRNVLASGSA 264 (463)
T ss_pred ceEEEeccCceeEEeccccccccccceeechhhhhhhhhhcccccccccchHHHHHHHhccc--------cceeEEecCC
Confidence 47999999999999766532211000000000 011111123479999999998765 4789999999
Q ss_pred CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcC
Q 000473 600 DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDC 679 (1471)
Q Consensus 600 DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~sp 679 (1471)
|.+|++||+.+|++..++..|++.|.++.|+|.. ...+++|+.|++|+|.|.+........-...+.|..++|.|
T Consensus 265 D~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~-----p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~g~VEkv~w~~ 339 (463)
T KOG0270|consen 265 DKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYE-----PSVLLSGSYDGTVALKDCRDPSNSGKEWKFDGEVEKVAWDP 339 (463)
T ss_pred CceEEEEEcCCCCcceehhhcCCceeEEEecCCC-----ceEEEeccccceEEeeeccCccccCceEEeccceEEEEecC
Confidence 9999999999999999999999999999999986 78999999999999999985332221122346799999999
Q ss_pred CCCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCC
Q 000473 680 PRGYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDG 758 (1471)
Q Consensus 680 dg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~ 758 (1471)
.....+..+.| ||+||-+|++. |+++.++..|.+.+-.+.+.. ...+.+.+.+.|+
T Consensus 340 ~se~~f~~~td-------dG~v~~~D~R~~~~~vwt~~AHd~~ISgl~~n~----------------~~p~~l~t~s~d~ 396 (463)
T KOG0270|consen 340 HSENSFFVSTD-------DGTVYYFDIRNPGKPVWTLKAHDDEISGLSVNI----------------QTPGLLSTASTDK 396 (463)
T ss_pred CCceeEEEecC-------CceEEeeecCCCCCceeEEEeccCCcceEEecC----------------CCCcceeeccccc
Confidence 87655444444 29999999997 599999999998777764431 1233344556699
Q ss_pred ceEeeccc
Q 000473 759 TFRQSQIQ 766 (1471)
Q Consensus 759 tir~w~l~ 766 (1471)
.++.|++.
T Consensus 397 ~Vklw~~~ 404 (463)
T KOG0270|consen 397 VVKLWKFD 404 (463)
T ss_pred eEEEEeec
Confidence 99999874
No 110
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.60 E-value=8.1e-12 Score=149.86 Aligned_cols=109 Identities=15% Similarity=0.043 Sum_probs=79.2
Q ss_pred ccCCCCEEEEEeCCCeEEEEEcCCCeEEEe-eeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccccccc
Q 000473 98 SSLDNGALISACTDGVLCVWSRSSGHCRRR-RKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLV 176 (1471)
Q Consensus 98 ~s~d~~~LaSas~DG~I~VWdv~~G~ci~~-~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~ 176 (1471)
++.+...||-+-+||.|.+|++..+=+.+. +..| ......-....+.+||+.+|..
T Consensus 33 ~s~kS~~lAvsRt~g~IEiwN~~~~w~~~~vi~g~--~drsIE~L~W~e~~RLFS~g~s--------------------- 89 (691)
T KOG2048|consen 33 YSHKSNQLAVSRTDGNIEIWNLSNNWFLEPVIHGP--EDRSIESLAWAEGGRLFSSGLS--------------------- 89 (691)
T ss_pred EeccCCceeeeccCCcEEEEccCCCceeeEEEecC--CCCceeeEEEccCCeEEeecCC---------------------
Confidence 466666799999999999999998755543 3332 1222333345588999999887
Q ss_pred ccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCC
Q 000473 177 SEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKE 250 (1471)
Q Consensus 177 ~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~ 250 (1471)
+.|.-||+.+++.+..... ..+.|..+++.|. + ..+.++..||.+...++..+
T Consensus 90 -------------g~i~EwDl~~lk~~~~~d~---~gg~IWsiai~p~---~--~~l~IgcddGvl~~~s~~p~ 142 (691)
T KOG2048|consen 90 -------------GSITEWDLHTLKQKYNIDS---NGGAIWSIAINPE---N--TILAIGCDDGVLYDFSIGPD 142 (691)
T ss_pred -------------ceEEEEecccCceeEEecC---CCcceeEEEeCCc---c--ceEEeecCCceEEEEecCCc
Confidence 9999999999999988765 3455999998743 2 44777778997766666554
No 111
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.60 E-value=4.9e-14 Score=163.31 Aligned_cols=203 Identities=15% Similarity=0.083 Sum_probs=146.8
Q ss_pred eecccccCccccccccCCCCCCCccccc--cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDF--VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~--~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
+.+++.++++++|++.+.+...+....- .+.+..++++++.++.. .++.|+.||+|.+|+.. .|
T Consensus 284 FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~----~iAagc~DGSIQ~W~~~----------~~ 349 (641)
T KOG0772|consen 284 FLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGK----LIAAGCLDGSIQIWDKG----------SR 349 (641)
T ss_pred eEEecCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcc----hhhhcccCCceeeeecC----------Cc
Confidence 4556677788888888876543332222 23345566666333333 79999999999994432 12
Q ss_pred ccCCcceEEEEecCCc--cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-ceEEEEecc--CCCEEEEEEC
Q 000473 556 KVNSHVSRQYFLGHTG--AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-NLITVMHHH--VAPVRQIILS 630 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~--~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-~~l~~~~~H--~~~V~~l~fs 630 (1471)
-+. ..+..-..|.. .|+|+.|++| +++|+|-|.|.++++||++.. +++.++.+- .-+-+.++|+
T Consensus 350 ~v~--p~~~vk~AH~~g~~Itsi~FS~d---------g~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FS 418 (641)
T KOG0772|consen 350 TVR--PVMKVKDAHLPGQDITSISFSYD---------GNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFS 418 (641)
T ss_pred ccc--cceEeeeccCCCCceeEEEeccc---------cchhhhccCCCceeeeeccccccchhhhcCCCccCCCCccccC
Confidence 221 33444567887 8999999998 999999999999999999864 456555442 2345678999
Q ss_pred CCCCCCCCCCEEEEEeC------CCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE
Q 000473 631 PPQTEHPWSDCFLSVGE------DFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW 704 (1471)
Q Consensus 631 pd~~~~~~~~~l~S~s~------DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW 704 (1471)
|+ .+++++|.. -+++.++|-.+...++.+.-....|..+.|+|-=+.|++|+.| |+++||
T Consensus 419 Pd------~kli~TGtS~~~~~~~g~L~f~d~~t~d~v~ki~i~~aSvv~~~WhpkLNQi~~gsgd--------G~~~vy 484 (641)
T KOG0772|consen 419 PD------DKLILTGTSAPNGMTAGTLFFFDRMTLDTVYKIDISTASVVRCLWHPKLNQIFAGSGD--------GTAHVY 484 (641)
T ss_pred CC------ceEEEecccccCCCCCceEEEEeccceeeEEEecCCCceEEEEeecchhhheeeecCC--------CceEEE
Confidence 99 889998854 4789999999999998887778889999999999999999998 999987
Q ss_pred ECCC----CeEEEEEeCCC
Q 000473 705 DVKT----GARERVLRGTA 719 (1471)
Q Consensus 705 Di~t----g~~~~~l~gH~ 719 (1471)
==.+ |.++.+...|.
T Consensus 485 Ydp~~S~RGak~cv~k~~r 503 (641)
T KOG0772|consen 485 YDPNESIRGAKLCVVKPPR 503 (641)
T ss_pred ECccccccchhheeecCcc
Confidence 4332 44445555444
No 112
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.60 E-value=1.1e-14 Score=157.14 Aligned_cols=185 Identities=19% Similarity=0.215 Sum_probs=152.2
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGT 584 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~ 584 (1471)
+.+|...|.+++...+.. ++++|+.|+++.||+.+.+ ........+||++.|-.++|+|.+
T Consensus 16 ~~~~~~~v~Sv~wn~~g~----~lasgs~dktv~v~n~e~~-------------r~~~~~~~~gh~~svdql~w~~~~-- 76 (313)
T KOG1407|consen 16 LQGHVQKVHSVAWNCDGT----KLASGSFDKTVSVWNLERD-------------RFRKELVYRGHTDSVDQLCWDPKH-- 76 (313)
T ss_pred hhhhhhcceEEEEcccCc----eeeecccCCceEEEEecch-------------hhhhhhcccCCCcchhhheeCCCC--
Confidence 567888899998777777 8999999999999655411 112223457999999999999873
Q ss_pred cccCcCCCEEEEEECCCcEEEEECCCCceEE-------------------------------------------------
Q 000473 585 AKGWSFNEVLVSGSMDCSIRIWDLGSGNLIT------------------------------------------------- 615 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~------------------------------------------------- 615 (1471)
...+++++.|.+|++||++++++..
T Consensus 77 ------~d~~atas~dk~ir~wd~r~~k~~~~i~~~~eni~i~wsp~g~~~~~~~kdD~it~id~r~~~~~~~~~~~~e~ 150 (313)
T KOG1407|consen 77 ------PDLFATASGDKTIRIWDIRSGKCTARIETKGENINITWSPDGEYIAVGNKDDRITFIDARTYKIVNEEQFKFEV 150 (313)
T ss_pred ------CcceEEecCCceEEEEEeccCcEEEEeeccCcceEEEEcCCCCEEEEecCcccEEEEEecccceeehhccccee
Confidence 7899999999999999998665332
Q ss_pred ---------------------------------EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE
Q 000473 616 ---------------------------------VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE 662 (1471)
Q Consensus 616 ---------------------------------~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l 662 (1471)
.++.|.....|+.|+|+ |++||+|+.|..|-|||++..-|+
T Consensus 151 ne~~w~~~nd~Fflt~GlG~v~ILsypsLkpv~si~AH~snCicI~f~p~------GryfA~GsADAlvSLWD~~ELiC~ 224 (313)
T KOG1407|consen 151 NEISWNNSNDLFFLTNGLGCVEILSYPSLKPVQSIKAHPSNCICIEFDPD------GRYFATGSADALVSLWDVDELICE 224 (313)
T ss_pred eeeeecCCCCEEEEecCCceEEEEeccccccccccccCCcceEEEEECCC------CceEeeccccceeeccChhHhhhh
Confidence 24456667778888888 999999999999999999999999
Q ss_pred EEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeee
Q 000473 663 RMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 663 ~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
+.+..+.-+|..+.|+.||++|++|++| ..|-|=+++||..+..+. +.+....+.+.|
T Consensus 225 R~isRldwpVRTlSFS~dg~~lASaSED--------h~IDIA~vetGd~~~eI~-~~~~t~tVAWHP 282 (313)
T KOG1407|consen 225 RCISRLDWPVRTLSFSHDGRMLASASED--------HFIDIAEVETGDRVWEIP-CEGPTFTVAWHP 282 (313)
T ss_pred eeeccccCceEEEEeccCcceeeccCcc--------ceEEeEecccCCeEEEee-ccCCceeEEecC
Confidence 9999999999999999999999999999 899999999999888775 334444555554
No 113
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.59 E-value=2.1e-14 Score=161.49 Aligned_cols=201 Identities=17% Similarity=0.217 Sum_probs=158.1
Q ss_pred EEEEEEcCCcEEEEEecccccC-CCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERH-NSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~-d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
..++-++.|.+.||........ ..++..-.....+++.++.+|.+.=+.|+|+|-. ...|+||..-+.|++
T Consensus 167 ~~aswse~G~V~Vw~l~~~l~~l~~~~~~~~~s~~~Pl~t~~ghk~EGy~LdWSp~~--------~g~LlsGDc~~~I~l 238 (440)
T KOG0302|consen 167 LCASWSENGRVQVWDLAPHLNALSEPGLEVKDSEFRPLFTFNGHKGEGYGLDWSPIK--------TGRLLSGDCVKGIHL 238 (440)
T ss_pred eeeeecccCcEEEEEchhhhhhhcCccccccccccCceEEecccCccceeeeccccc--------ccccccCccccceEe
Confidence 5677788999999654321111 1111111113347889999999999999999951 456899999999999
Q ss_pred EECCCCceE---EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC---cEEEEecCCCCCcEEEEEcC
Q 000473 606 WDLGSGNLI---TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL---RVERMFPGHPNYPAKVVWDC 679 (1471)
Q Consensus 606 WDl~tg~~l---~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~---~~l~~~~gh~~~V~~v~~sp 679 (1471)
|...+|... ..|.+|+..|..|.|+|.+ ...|+|||.||+|+|||++.+ .++.. ..|.+.|+.|.|+.
T Consensus 239 w~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE-----~~vfaScS~DgsIrIWDiRs~~~~~~~~~-kAh~sDVNVISWnr 312 (440)
T KOG0302|consen 239 WEPSTGSWKVDQRPFTGHTKSVEDLQWSPTE-----DGVFASCSCDGSIRIWDIRSGPKKAAVST-KAHNSDVNVISWNR 312 (440)
T ss_pred eeeccCceeecCccccccccchhhhccCCcc-----CceEEeeecCceEEEEEecCCCccceeEe-eccCCceeeEEccC
Confidence 999987754 4677899999999999986 779999999999999999988 45444 78999999999999
Q ss_pred CCCEEEEEEcCCCCCCCCCCEEEEEECCC---CeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeecc
Q 000473 680 PRGYIACLCRDHSRTSDAVDVLFIWDVKT---GARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHE 756 (1471)
Q Consensus 680 dg~~L~sgs~D~sg~~D~~gtV~VWDi~t---g~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~ 756 (1471)
...+|++|+.| |+++|||+++ ++++..+.-|.++++++.|.|... +. +....+
T Consensus 313 ~~~lLasG~Dd--------Gt~~iwDLR~~~~~~pVA~fk~Hk~pItsieW~p~e~-----s~-----------iaasg~ 368 (440)
T KOG0302|consen 313 REPLLASGGDD--------GTLSIWDLRQFKSGQPVATFKYHKAPITSIEWHPHED-----SV-----------IAASGE 368 (440)
T ss_pred CcceeeecCCC--------ceEEEEEhhhccCCCcceeEEeccCCeeEEEeccccC-----ce-----------EEeccC
Confidence 99999999988 9999999986 678899999999999999985311 11 223345
Q ss_pred CCceEeecc
Q 000473 757 DGTFRQSQI 765 (1471)
Q Consensus 757 D~tir~w~l 765 (1471)
|.++-+|+|
T Consensus 369 D~QitiWDl 377 (440)
T KOG0302|consen 369 DNQITIWDL 377 (440)
T ss_pred CCcEEEEEe
Confidence 999999998
No 114
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.59 E-value=4.1e-14 Score=163.33 Aligned_cols=223 Identities=15% Similarity=0.160 Sum_probs=173.0
Q ss_pred cccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcc
Q 000473 482 FCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHV 561 (1471)
Q Consensus 482 ~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~ 561 (1471)
.....+.+|.+..... ..+. ..-++.|.|+.-.++.. .++.|...|.|.+ |.+.+|.
T Consensus 58 ~~rp~l~vw~i~k~~~---~~q~-~v~Pg~v~al~s~n~G~----~l~ag~i~g~lYl---------------WelssG~ 114 (476)
T KOG0646|consen 58 LKRPLLHVWEILKKDQ---VVQY-IVLPGPVHALASSNLGY----FLLAGTISGNLYL---------------WELSSGI 114 (476)
T ss_pred ccCccccccccCchhh---hhhh-cccccceeeeecCCCce----EEEeecccCcEEE---------------EEecccc
Confidence 3344678888765432 1111 12356788877555554 5777779999999 3456788
Q ss_pred eEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC---------CCceEEEEeccCCCEEEEEECCC
Q 000473 562 SRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG---------SGNLITVMHHHVAPVRQIILSPP 632 (1471)
Q Consensus 562 ~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~---------tg~~l~~~~~H~~~V~~l~fspd 632 (1471)
.+..+.+|-..|+|+.|+-| +.+|+|||.||.|.+|++. +-++++.|..|+-+|+.+...+.
T Consensus 115 LL~v~~aHYQ~ITcL~fs~d---------gs~iiTgskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~G 185 (476)
T KOG0646|consen 115 LLNVLSAHYQSITCLKFSDD---------GSHIITGSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSG 185 (476)
T ss_pred HHHHHHhhccceeEEEEeCC---------CcEEEecCCCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCC
Confidence 88889999999999999987 8999999999999999874 34678999999999999998775
Q ss_pred CCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC----
Q 000473 633 QTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT---- 708 (1471)
Q Consensus 633 ~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t---- 708 (1471)
. ....++|+|.|+++++||+..+..+..+. ....+.+++.+|.++.+.+|..+ |.|.+-++.+
T Consensus 186 g----~~~rl~TaS~D~t~k~wdlS~g~LLlti~-fp~si~av~lDpae~~~yiGt~~--------G~I~~~~~~~~~~~ 252 (476)
T KOG0646|consen 186 G----TNARLYTASEDRTIKLWDLSLGVLLLTIT-FPSSIKAVALDPAERVVYIGTEE--------GKIFQNLLFKLSGQ 252 (476)
T ss_pred C----ccceEEEecCCceEEEEEeccceeeEEEe-cCCcceeEEEcccccEEEecCCc--------ceEEeeehhcCCcc
Confidence 2 14589999999999999999999888775 45679999999999999999999 9999988753
Q ss_pred ------------CeEEEEEeCCCC--CceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 709 ------------GARERVLRGTAS--HSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 709 ------------g~~~~~l~gH~~--~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
+..++.+.||.. .+++...| .+|+...++ .+||++++|++.
T Consensus 253 ~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLais------------~DgtlLlSG-----d~dg~VcvWdi~ 307 (476)
T KOG0646|consen 253 SAGVNQKGRHEENTQINVLVGHENESAITCLAIS------------TDGTLLLSG-----DEDGKVCVWDIY 307 (476)
T ss_pred cccccccccccccceeeeeccccCCcceeEEEEe------------cCccEEEee-----CCCCCEEEEecc
Confidence 235678889988 77777655 233333333 349999999974
No 115
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.59 E-value=1.2e-13 Score=145.54 Aligned_cols=217 Identities=15% Similarity=0.186 Sum_probs=156.0
Q ss_pred cCccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCc-ceEEEEecCCccEEEEEEecCCCC
Q 000473 508 KEKIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSH-VSRQYFLGHTGAVLCLAAHRMVGT 584 (1471)
Q Consensus 508 h~~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~-~~~~~l~gH~~~V~~la~spd~~~ 584 (1471)
....|.+++ |+|. ..+.|+...+.+|..+.-+... +++..-..... ...+.-+.|.+.|.|.+|+|+
T Consensus 31 dsqairav~------fhp~g~lyavgsnskt~ric~yp~l~~~-r~~hea~~~pp~v~~kr~khhkgsiyc~~ws~~--- 100 (350)
T KOG0641|consen 31 DSQAIRAVA------FHPAGGLYAVGSNSKTFRICAYPALIDL-RHAHEAAKQPPSVLCKRNKHHKGSIYCTAWSPC--- 100 (350)
T ss_pred chhheeeEE------ecCCCceEEeccCCceEEEEccccccCc-ccccccccCCCeEEeeeccccCccEEEEEecCc---
Confidence 456788877 6665 7889999999999877543321 01110011111 112233568999999999998
Q ss_pred cccCcCCCEEEEEECCCcEEEEECCCCce-----EEEEeccCCCEEEEEECCCCCC------------------------
Q 000473 585 AKGWSFNEVLVSGSMDCSIRIWDLGSGNL-----ITVMHHHVAPVRQIILSPPQTE------------------------ 635 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~DgtI~lWDl~tg~~-----l~~~~~H~~~V~~l~fspd~~~------------------------ 635 (1471)
+++|++|+.|++|++.-++...+ -..|.-|.+.|..++|-.+...
T Consensus 101 ------geliatgsndk~ik~l~fn~dt~~~~g~dle~nmhdgtirdl~fld~~~s~~~il~s~gagdc~iy~tdc~~g~ 174 (350)
T KOG0641|consen 101 ------GELIATGSNDKTIKVLPFNADTCNATGHDLEFNMHDGTIRDLAFLDDPESGGAILASAGAGDCKIYITDCGRGQ 174 (350)
T ss_pred ------cCeEEecCCCceEEEEecccccccccCcceeeeecCCceeeeEEecCCCcCceEEEecCCCcceEEEeecCCCC
Confidence 89999999999999976653221 1345567777777776443110
Q ss_pred ---------------CC-CCCEEEEEeCCCcEEEEECCCCcEEEEecC--C-----CCCcEEEEEcCCCCEEEEEEcCCC
Q 000473 636 ---------------HP-WSDCFLSVGEDFSVALASLETLRVERMFPG--H-----PNYPAKVVWDCPRGYIACLCRDHS 692 (1471)
Q Consensus 636 ---------------~~-~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g--h-----~~~V~~v~~spdg~~L~sgs~D~s 692 (1471)
.. .+-.|++|+.|.+|++||++-..+++++.. | .+.|.+|+..|.|++|++|-.|
T Consensus 175 ~~~a~sghtghilalyswn~~m~~sgsqdktirfwdlrv~~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~d-- 252 (350)
T KOG0641|consen 175 GFHALSGHTGHILALYSWNGAMFASGSQDKTIRFWDLRVNSCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHAD-- 252 (350)
T ss_pred cceeecCCcccEEEEEEecCcEEEccCCCceEEEEeeeccceeeeccCcccCCCcccceeEEEEECCCcceeeeccCC--
Confidence 01 145799999999999999999999887642 2 2478999999999999999998
Q ss_pred CCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 693 RTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 693 g~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
.+..+||++.|+.++.+..|++.+-++.|.|.. ..+++++-|..||.=+|
T Consensus 253 ------ssc~lydirg~r~iq~f~phsadir~vrfsp~a-----------------~yllt~syd~~ikltdl 302 (350)
T KOG0641|consen 253 ------SSCMLYDIRGGRMIQRFHPHSADIRCVRFSPGA-----------------HYLLTCSYDMKIKLTDL 302 (350)
T ss_pred ------CceEEEEeeCCceeeeeCCCccceeEEEeCCCc-----------------eEEEEecccceEEEeec
Confidence 899999999999999999999999999998632 12334444777877655
No 116
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.58 E-value=7e-14 Score=154.51 Aligned_cols=193 Identities=16% Similarity=0.188 Sum_probs=148.3
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
++..+.+.+++++|...... ...+..|.+.|+++.+..+.. -+.|++|++||.|.+|+ +
T Consensus 56 ~aSGssDetI~IYDm~k~~q----lg~ll~HagsitaL~F~~~~S--~shLlS~sdDG~i~iw~---------------~ 114 (362)
T KOG0294|consen 56 VASGSSDETIHIYDMRKRKQ----LGILLSHAGSITALKFYPPLS--KSHLLSGSDDGHIIIWR---------------V 114 (362)
T ss_pred EeccCCCCcEEEEeccchhh----hcceeccccceEEEEecCCcc--hhheeeecCCCcEEEEE---------------c
Confidence 45566788999999877644 344567899999987443331 11599999999999933 3
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC--
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE-- 635 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~-- 635 (1471)
.+.....++++|.+.|+.+++||. +++.+|-+.|+.+++||+-+|+.-....--. .-+.|.|+|....
T Consensus 115 ~~W~~~~slK~H~~~Vt~lsiHPS---------~KLALsVg~D~~lr~WNLV~Gr~a~v~~L~~-~at~v~w~~~Gd~F~ 184 (362)
T KOG0294|consen 115 GSWELLKSLKAHKGQVTDLSIHPS---------GKLALSVGGDQVLRTWNLVRGRVAFVLNLKN-KATLVSWSPQGDHFV 184 (362)
T ss_pred CCeEEeeeecccccccceeEecCC---------CceEEEEcCCceeeeehhhcCccceeeccCC-cceeeEEcCCCCEEE
Confidence 444678899999999999999996 8999999999999999998876433222111 0111334433110
Q ss_pred -------------------------------CCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE--cCCCC
Q 000473 636 -------------------------------HPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW--DCPRG 682 (1471)
Q Consensus 636 -------------------------------~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~--spdg~ 682 (1471)
-..+..+++|++|+.|++||..+..+...+.+|..+|..+.+ .|++.
T Consensus 185 v~~~~~i~i~q~d~A~v~~~i~~~~r~l~~~~l~~~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~~~~ 264 (362)
T KOG0294|consen 185 VSGRNKIDIYQLDNASVFREIENPKRILCATFLDGSELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNPEHE 264 (362)
T ss_pred EEeccEEEEEecccHhHhhhhhccccceeeeecCCceEEEecCCceEEEeccCCCccceeeecchhheeeeEEEecCCce
Confidence 113568999999999999999999999999999999999985 67889
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
||++++.| |.|+|||++..
T Consensus 265 ~lvTaSSD--------G~I~vWd~~~~ 283 (362)
T KOG0294|consen 265 YLVTASSD--------GFIKVWDIDME 283 (362)
T ss_pred EEEEeccC--------ceEEEEEcccc
Confidence 99999999 99999999876
No 117
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.58 E-value=3.4e-14 Score=164.76 Aligned_cols=198 Identities=17% Similarity=0.159 Sum_probs=156.4
Q ss_pred ccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 481 TFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 481 s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
.-.+|.++++| .+.+ .....+.+|..+|..+. |+|. .+++|+.|+.+++|+ .
T Consensus 86 GD~sG~V~vfD-~k~r---~iLR~~~ah~apv~~~~------f~~~d~t~l~s~sDd~v~k~~d---------------~ 140 (487)
T KOG0310|consen 86 GDESGHVKVFD-MKSR---VILRQLYAHQAPVHVTK------FSPQDNTMLVSGSDDKVVKYWD---------------L 140 (487)
T ss_pred cCCcCcEEEec-cccH---HHHHHHhhccCceeEEE------ecccCCeEEEecCCCceEEEEE---------------c
Confidence 33457888888 4431 23455688988888876 5554 788888888888833 3
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-ceEEEEeccCCCEEEEEECCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-NLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
.+......+.||++.|.|.+|+|. .+..++|||.||+|++||+++. ..+..+ .|..+|..+.+-|.
T Consensus 141 s~a~v~~~l~~htDYVR~g~~~~~--------~~hivvtGsYDg~vrl~DtR~~~~~v~el-nhg~pVe~vl~lps---- 207 (487)
T KOG0310|consen 141 STAYVQAELSGHTDYVRCGDISPA--------NDHIVVTGSYDGKVRLWDTRSLTSRVVEL-NHGCPVESVLALPS---- 207 (487)
T ss_pred CCcEEEEEecCCcceeEeeccccC--------CCeEEEecCCCceEEEEEeccCCceeEEe-cCCCceeeEEEcCC----
Confidence 344445578999999999999996 3779999999999999999987 555555 69999999999998
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCc-EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 637 PWSDCFLSVGEDFSVALASLETLR-VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~-~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
|..|++++. ..|++||+.+|. .+..+..|...|+|+++..++..|++|+-| +.|+|||+.+.+.+..+
T Consensus 208 --gs~iasAgG-n~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sLD--------~~VKVfd~t~~Kvv~s~ 276 (487)
T KOG0310|consen 208 --GSLIASAGG-NSVKVWDLTTGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSLD--------RHVKVFDTTNYKVVHSW 276 (487)
T ss_pred --CCEEEEcCC-CeEEEEEecCCceehhhhhcccceEEEEEeecCCceEeecccc--------cceEEEEccceEEEEee
Confidence 889999874 589999999655 455555599999999999999999999999 99999998888888777
Q ss_pred eCCCCCceeeeee
Q 000473 716 RGTASHSMFDHFC 728 (1471)
Q Consensus 716 ~gH~~~v~~~~~~ 728 (1471)
.-. +.++.+...
T Consensus 277 ~~~-~pvLsiavs 288 (487)
T KOG0310|consen 277 KYP-GPVLSIAVS 288 (487)
T ss_pred ecc-cceeeEEec
Confidence 643 466665444
No 118
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.57 E-value=6.1e-14 Score=172.25 Aligned_cols=209 Identities=13% Similarity=0.150 Sum_probs=160.8
Q ss_pred CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEe
Q 000473 500 DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAH 579 (1471)
Q Consensus 500 ~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~s 579 (1471)
.....+.||.+.|..+.+...+ .|++++.|.++++ | .+....++.+| .|.+.|+|++|+
T Consensus 360 kP~~ef~GHt~DILDlSWSKn~-----fLLSSSMDKTVRL--W-------------h~~~~~CL~~F-~HndfVTcVaFn 418 (712)
T KOG0283|consen 360 KPFCEFKGHTADILDLSWSKNN-----FLLSSSMDKTVRL--W-------------HPGRKECLKVF-SHNDFVTCVAFN 418 (712)
T ss_pred cchhhhhccchhheecccccCC-----eeEeccccccEEe--e-------------cCCCcceeeEE-ecCCeeEEEEec
Confidence 4456678999999887755444 5999999999998 4 35555777766 499999999999
Q ss_pred cCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC
Q 000473 580 RMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL 659 (1471)
Q Consensus 580 pd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~ 659 (1471)
|- +.++++||+-|+.||||++...+...-...+ .-|++++|.|+ |+..+.|+-+|.+++|+....
T Consensus 419 Pv--------DDryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~-~lITAvcy~Pd------Gk~avIGt~~G~C~fY~t~~l 483 (712)
T KOG0283|consen 419 PV--------DDRYFISGSLDGKVRLWSISDKKVVDWNDLR-DLITAVCYSPD------GKGAVIGTFNGYCRFYDTEGL 483 (712)
T ss_pred cc--------CCCcEeecccccceEEeecCcCeeEeehhhh-hhheeEEeccC------CceEEEEEeccEEEEEEccCC
Confidence 96 4899999999999999999876655444444 78999999999 999999999999999999988
Q ss_pred cEEEEecC--C------CCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCc--eeeeee
Q 000473 660 RVERMFPG--H------PNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHS--MFDHFC 728 (1471)
Q Consensus 660 ~~l~~~~g--h------~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v--~~~~~~ 728 (1471)
+....+.- | ...|+.+.|.|... .+++.+.| ..|||+|.++..++..+.|+...- +.+.|.
T Consensus 484 k~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSnD--------SrIRI~d~~~~~lv~KfKG~~n~~SQ~~Asfs 555 (712)
T KOG0283|consen 484 KLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTSND--------SRIRIYDGRDKDLVHKFKGFRNTSSQISASFS 555 (712)
T ss_pred eEEEeeeEeeccCccccCceeeeeEecCCCCCeEEEecCC--------CceEEEeccchhhhhhhcccccCCcceeeeEc
Confidence 77655431 1 12799999987654 46677777 899999999999999999876422 223333
Q ss_pred eccccccccceEEcCCccccccceeeccCCceEeecccccc
Q 000473 729 KGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 729 ~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~ 769 (1471)
.+|. .++..++|.-+.+|++..+.
T Consensus 556 ------------~Dgk-----~IVs~seDs~VYiW~~~~~~ 579 (712)
T KOG0283|consen 556 ------------SDGK-----HIVSASEDSWVYIWKNDSFN 579 (712)
T ss_pred ------------cCCC-----EEEEeecCceEEEEeCCCCc
Confidence 1222 34455579999999975444
No 119
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.57 E-value=4.8e-14 Score=162.36 Aligned_cols=167 Identities=19% Similarity=0.202 Sum_probs=144.4
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
.+|...+.++++.++.. ++++|..|..|.| |+.++.+.++.+.||.+.|.+++|-..
T Consensus 199 ~~h~keil~~avS~Dgk----ylatgg~d~~v~I---------------w~~~t~ehv~~~~ghr~~V~~L~fr~g---- 255 (479)
T KOG0299|consen 199 KGHVKEILTLAVSSDGK----YLATGGRDRHVQI---------------WDCDTLEHVKVFKGHRGAVSSLAFRKG---- 255 (479)
T ss_pred ccccceeEEEEEcCCCc----EEEecCCCceEEE---------------ecCcccchhhcccccccceeeeeeecC----
Confidence 37889999999888888 6999999999998 445677888899999999999999764
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF 665 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~ 665 (1471)
...|.+++.|++|++|+++....+.++.+|...|.+|..... ++++..|+.|+++++|++. -+....|
T Consensus 256 -----t~~lys~s~Drsvkvw~~~~~s~vetlyGHqd~v~~IdaL~r------eR~vtVGgrDrT~rlwKi~-eesqlif 323 (479)
T KOG0299|consen 256 -----TSELYSASADRSVKVWSIDQLSYVETLYGHQDGVLGIDALSR------ERCVTVGGRDRTVRLWKIP-EESQLIF 323 (479)
T ss_pred -----ccceeeeecCCceEEEehhHhHHHHHHhCCccceeeechhcc------cceEEeccccceeEEEecc-ccceeee
Confidence 678999999999999999998889999999999999988776 6777777799999999995 3444678
Q ss_pred cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 666 PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 666 ~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
.+|.+.+.|++|-.+ ..+++|+.| |.|.+|++.+.+++.+..
T Consensus 324 rg~~~sidcv~~In~-~HfvsGSdn--------G~IaLWs~~KKkplf~~~ 365 (479)
T KOG0299|consen 324 RGGEGSIDCVAFIND-EHFVSGSDN--------GSIALWSLLKKKPLFTSR 365 (479)
T ss_pred eCCCCCeeeEEEecc-cceeeccCC--------ceEEEeeecccCceeEee
Confidence 899999999999765 467789988 999999999988887654
No 120
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.56 E-value=7.8e-14 Score=161.67 Aligned_cols=215 Identities=18% Similarity=0.203 Sum_probs=153.6
Q ss_pred cccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 504 DFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 504 ~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
.+.+|...|+++.+-+... ++++|+.|-+|+.|+++-+. ...+ ..+.++ ..-+..|+++.|++.
T Consensus 162 ~l~hgtk~Vsal~~Dp~Ga----R~~sGs~Dy~v~~wDf~gMd---as~~-----~fr~l~--P~E~h~i~sl~ys~T-- 225 (641)
T KOG0772|consen 162 QLKHGTKIVSALAVDPSGA----RFVSGSLDYTVKFWDFQGMD---ASMR-----SFRQLQ--PCETHQINSLQYSVT-- 225 (641)
T ss_pred eccCCceEEEEeeecCCCc----eeeeccccceEEEEeccccc---ccch-----hhhccC--cccccccceeeecCC--
Confidence 3578889999988555554 99999999999996665221 1111 111122 223446899999997
Q ss_pred CcccCcCCCEEEEEECCCcEEEEECCCCceEE-------------EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCc
Q 000473 584 TAKGWSFNEVLVSGSMDCSIRIWDLGSGNLIT-------------VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFS 650 (1471)
Q Consensus 584 ~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~-------------~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dgs 650 (1471)
+..|+..+.....+|+|-. |..+. .-++|...+++..|+|.. .+.|+|++.|++
T Consensus 226 -------g~~iLvvsg~aqakl~DRd-G~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~-----k~~FlT~s~Dgt 292 (641)
T KOG0772|consen 226 -------GDQILVVSGSAQAKLLDRD-GFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDN-----KEEFLTCSYDGT 292 (641)
T ss_pred -------CCeEEEEecCcceeEEccC-CceeeeeeccchhhhhhhccCCceeeeeccccccCc-----ccceEEecCCCc
Confidence 6666666667789999954 33322 235799999999999985 679999999999
Q ss_pred EEEEECCCCc-EEEEec-----CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe---EEEEEeCCCC-
Q 000473 651 VALASLETLR-VERMFP-----GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA---RERVLRGTAS- 720 (1471)
Q Consensus 651 V~lWdl~t~~-~l~~~~-----gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~---~~~~l~gH~~- 720 (1471)
+|+||+...+ .+..+. +..-+++.++|+|||+.|++||.| |+|.+||..... ...+-..|..
T Consensus 293 lRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~~iAagc~D--------GSIQ~W~~~~~~v~p~~~vk~AH~~g 364 (641)
T KOG0772|consen 293 LRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGKLIAAGCLD--------GSIQIWDKGSRTVRPVMKVKDAHLPG 364 (641)
T ss_pred EEEEecCCchhheeEEeeccCCCcccCceeeecCCCcchhhhcccC--------CceeeeecCCcccccceEeeeccCCC
Confidence 9999997644 333332 233478999999999999999999 999999975432 1233456765
Q ss_pred -CceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccccc
Q 000473 721 -HSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGV 772 (1471)
Q Consensus 721 -~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~ 772 (1471)
.+.++.|. .+|+...+... |+++|+|+|+++.+..
T Consensus 365 ~~Itsi~FS------------~dg~~LlSRg~-----D~tLKvWDLrq~kkpL 400 (641)
T KOG0772|consen 365 QDITSISFS------------YDGNYLLSRGF-----DDTLKVWDLRQFKKPL 400 (641)
T ss_pred CceeEEEec------------cccchhhhccC-----CCceeeeeccccccch
Confidence 67777776 33455444444 9999999999887654
No 121
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.55 E-value=5.7e-14 Score=168.92 Aligned_cols=200 Identities=20% Similarity=0.196 Sum_probs=162.9
Q ss_pred ccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcce
Q 000473 483 CQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVS 562 (1471)
Q Consensus 483 ~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~ 562 (1471)
....+.+|+.....- .....-+...|+++.+..+.. .|++|..+|.+.| | |..+.+.
T Consensus 195 lg~~vylW~~~s~~v----~~l~~~~~~~vtSv~ws~~G~----~LavG~~~g~v~i--w-------------D~~~~k~ 251 (484)
T KOG0305|consen 195 LGQSVYLWSASSGSV----TELCSFGEELVTSVKWSPDGS----HLAVGTSDGTVQI--W-------------DVKEQKK 251 (484)
T ss_pred ecceEEEEecCCCce----EEeEecCCCceEEEEECCCCC----EEEEeecCCeEEE--E-------------ehhhccc
Confidence 344577888766431 122222367899998777666 7999999999999 3 3445566
Q ss_pred EEEEec-CCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEE-EeccCCCEEEEEECCCCCCCCCCC
Q 000473 563 RQYFLG-HTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITV-MHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 563 ~~~l~g-H~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~-~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
.+.+.+ |...|-|++|. +..+.+|+.|+.|..+|++..+.... +.+|...|..+.|+++ +.
T Consensus 252 ~~~~~~~h~~rvg~laW~-----------~~~lssGsr~~~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d------~~ 314 (484)
T KOG0305|consen 252 TRTLRGSHASRVGSLAWN-----------SSVLSSGSRDGKILNHDVRISQHVVSTLQGHRQEVCGLKWSPD------GN 314 (484)
T ss_pred cccccCCcCceeEEEecc-----------CceEEEecCCCcEEEEEEecchhhhhhhhcccceeeeeEECCC------CC
Confidence 777888 99999999997 57899999999999999998876655 8899999999999999 99
Q ss_pred EEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCC
Q 000473 641 CFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTA 719 (1471)
Q Consensus 641 ~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~ 719 (1471)
+++||+.|+.+.|||.....+...+..|.+.|.+++|+|-. ..||+|+.- . |++|++||..+|++++.+...
T Consensus 315 ~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~~lLAsGGGs----~--D~~i~fwn~~~g~~i~~vdtg- 387 (484)
T KOG0305|consen 315 QLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQSGLLATGGGS----A--DRCIKFWNTNTGARIDSVDTG- 387 (484)
T ss_pred eeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCCccCceEEcCCC----c--ccEEEEEEcCCCcEecccccC-
Confidence 99999999999999998889999999999999999999964 577776532 3 499999999999999887643
Q ss_pred CCceeeeeee
Q 000473 720 SHSMFDHFCK 729 (1471)
Q Consensus 720 ~~v~~~~~~~ 729 (1471)
++|..+.|.+
T Consensus 388 sQVcsL~Wsk 397 (484)
T KOG0305|consen 388 SQVCSLIWSK 397 (484)
T ss_pred CceeeEEEcC
Confidence 3666666764
No 122
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.54 E-value=8.6e-13 Score=145.59 Aligned_cols=120 Identities=19% Similarity=0.161 Sum_probs=92.3
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC-------CceEEEE----eccCCCEEEEEE
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS-------GNLITVM----HHHVAPVRQIIL 629 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t-------g~~l~~~----~~H~~~V~~l~f 629 (1471)
..+..|+||...|..++|+++ ...++|.|.||++++||++- .+.+.++ ..-++.-..+.+
T Consensus 269 ~rvf~LkGH~saV~~~aFsn~---------S~r~vtvSkDG~wriwdtdVrY~~~qDpk~Lk~g~~pl~aag~~p~RL~l 339 (420)
T KOG2096|consen 269 KRVFSLKGHQSAVLAAAFSNS---------STRAVTVSKDGKWRIWDTDVRYEAGQDPKILKEGSAPLHAAGSEPVRLEL 339 (420)
T ss_pred hhhheeccchhheeeeeeCCC---------cceeEEEecCCcEEEeeccceEecCCCchHhhcCCcchhhcCCCceEEEe
Confidence 345678999999999999997 79999999999999999862 1223222 222333448999
Q ss_pred CCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEec-CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 630 SPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFP-GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 630 spd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~-gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+|+ |+.|+. +.-.+++++..++|+..-.+. .|...|.++.|+++|+|++++++ ..++|.-
T Consensus 340 sP~------g~~lA~-s~gs~l~~~~se~g~~~~~~e~~h~~~Is~is~~~~g~~~atcGd---------r~vrv~~ 400 (420)
T KOG2096|consen 340 SPS------GDSLAV-SFGSDLKVFASEDGKDYPELEDIHSTTISSISYSSDGKYIATCGD---------RYVRVIR 400 (420)
T ss_pred CCC------CcEEEe-ecCCceEEEEcccCccchhHHHhhcCceeeEEecCCCcEEeeecc---------eeeeeec
Confidence 998 776654 445689999999988776664 58889999999999999988654 5777754
No 123
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.53 E-value=1.5e-14 Score=154.76 Aligned_cols=205 Identities=17% Similarity=0.207 Sum_probs=166.4
Q ss_pred ccccCccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~ 582 (1471)
-.+|..+|-.++|.+- .|+ .|++++.||.=.+. +-++|.-+.+|.||.++|+......+
T Consensus 10 c~ghtrpvvdl~~s~i---tp~g~flisa~kd~~pmlr---------------~g~tgdwigtfeghkgavw~~~l~~n- 70 (334)
T KOG0278|consen 10 CHGHTRPVVDLAFSPI---TPDGYFLISASKDGKPMLR---------------NGDTGDWIGTFEGHKGAVWSATLNKN- 70 (334)
T ss_pred EcCCCcceeEEeccCC---CCCceEEEEeccCCCchhc---------------cCCCCCcEEeeeccCcceeeeecCch-
Confidence 4678888888774331 122 79999999876541 23456778999999999999888765
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc-E
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR-V 661 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~-~ 661 (1471)
....+|++.|.+.++||.-+|..++.| .|..-|.+++|+.| .+.|++|+.++.+|++|++..+ +
T Consensus 71 --------a~~aasaaadftakvw~a~tgdelhsf-~hkhivk~~af~~d------s~~lltgg~ekllrvfdln~p~Ap 135 (334)
T KOG0278|consen 71 --------ATRAASAAADFTAKVWDAVTGDELHSF-EHKHIVKAVAFSQD------SNYLLTGGQEKLLRVFDLNRPKAP 135 (334)
T ss_pred --------hhhhhhhcccchhhhhhhhhhhhhhhh-hhhheeeeEEeccc------chhhhccchHHHhhhhhccCCCCC
Confidence 678899999999999999999999999 57788999999999 8999999999999999998654 5
Q ss_pred EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEE
Q 000473 662 ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVL 741 (1471)
Q Consensus 662 l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~ 741 (1471)
...+.+|.+.|..+.|...++.+++...| ++||+||.+||..++++.-.. +|......
T Consensus 136 p~E~~ghtg~Ir~v~wc~eD~~iLSSadd--------~tVRLWD~rTgt~v~sL~~~s-~VtSlEvs------------- 193 (334)
T KOG0278|consen 136 PKEISGHTGGIRTVLWCHEDKCILSSADD--------KTVRLWDHRTGTEVQSLEFNS-PVTSLEVS------------- 193 (334)
T ss_pred chhhcCCCCcceeEEEeccCceEEeeccC--------CceEEEEeccCcEEEEEecCC-CCcceeec-------------
Confidence 67889999999999999988889888777 999999999999999887543 44444333
Q ss_pred cCCccccccceeeccCCceEeeccccccc
Q 000473 742 NGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 742 ~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
-.+.+++++-.++++.|+.++|+.
T Consensus 194 -----~dG~ilTia~gssV~Fwdaksf~~ 217 (334)
T KOG0278|consen 194 -----QDGRILTIAYGSSVKFWDAKSFGL 217 (334)
T ss_pred -----cCCCEEEEecCceeEEeccccccc
Confidence 234566777788899999887764
No 124
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.53 E-value=3.8e-11 Score=139.05 Aligned_cols=122 Identities=12% Similarity=0.108 Sum_probs=95.0
Q ss_pred EEEEEEecCCCCcccCcCCCEE-EEEECCCcEEEEECCCCceEEEEeccCC-------CEEEEEECCCCCCCCCCCEE-E
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVL-VSGSMDCSIRIWDLGSGNLITVMHHHVA-------PVRQIILSPPQTEHPWSDCF-L 643 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L-~SGs~DgtI~lWDl~tg~~l~~~~~H~~-------~V~~l~fspd~~~~~~~~~l-~ 643 (1471)
..+++|+|+ +++| +++..|+.|++||+.+++.+..+..+.. ....++|+|+ ++.+ +
T Consensus 159 ~~~~~~s~d---------g~~l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~d------g~~~~~ 223 (300)
T TIGR03866 159 PRFAEFTAD---------GKELWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKD------GKTAFV 223 (300)
T ss_pred ccEEEECCC---------CCEEEEEcCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECCC------CCEEEE
Confidence 456889987 6666 5666799999999999988777653321 1346889998 7764 4
Q ss_pred EEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEc-CCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 644 SVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCR-DHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 644 S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~-D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
+.+.++.+.+||+++++.+..+. +...+.+++|+|+|++|++++. + |+|+|||++++++++.+...
T Consensus 224 ~~~~~~~i~v~d~~~~~~~~~~~-~~~~~~~~~~~~~g~~l~~~~~~~--------~~i~v~d~~~~~~~~~~~~~ 290 (300)
T TIGR03866 224 ALGPANRVAVVDAKTYEVLDYLL-VGQRVWQLAFTPDEKYLLTTNGVS--------NDVSVIDVAALKVIKSIKVG 290 (300)
T ss_pred EcCCCCeEEEEECCCCcEEEEEE-eCCCcceEEECCCCCEEEEEcCCC--------CeEEEEECCCCcEEEEEEcc
Confidence 45667789999999998876653 4457899999999999988754 5 89999999999998888754
No 125
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.53 E-value=1.7e-12 Score=144.51 Aligned_cols=209 Identities=14% Similarity=0.174 Sum_probs=157.2
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGT 584 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~ 584 (1471)
+..-...|+++.+..+.. .+++.++|.++++ ++ ..+++..+++..++..|..+.|.+.
T Consensus 10 f~~~~~~i~sl~fs~~G~----~litss~dDsl~L--Yd-------------~~~g~~~~ti~skkyG~~~~~Fth~--- 67 (311)
T KOG1446|consen 10 FRETNGKINSLDFSDDGL----LLITSSEDDSLRL--YD-------------SLSGKQVKTINSKKYGVDLACFTHH--- 67 (311)
T ss_pred cccCCCceeEEEecCCCC----EEEEecCCCeEEE--EE-------------cCCCceeeEeecccccccEEEEecC---
Confidence 444457889988555554 7888899999999 33 4467888888888888888888654
Q ss_pred cccCcCCCEEEEEEC--CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE
Q 000473 585 AKGWSFNEVLVSGSM--DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE 662 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~--DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l 662 (1471)
...++-++. |.+||.-++.+.+.++.|.||...|.+|..+|- ++.|+|++.|++|++||++..+|.
T Consensus 68 ------~~~~i~sStk~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~------~d~FlS~S~D~tvrLWDlR~~~cq 135 (311)
T KOG1446|consen 68 ------SNTVIHSSTKEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPK------DDTFLSSSLDKTVRLWDLRVKKCQ 135 (311)
T ss_pred ------CceEEEccCCCCCceEEEEeecCceEEEcCCCCceEEEEEecCC------CCeEEecccCCeEEeeEecCCCCc
Confidence 455555555 999999999999999999999999999999998 899999999999999999854322
Q ss_pred EEec---------------------------------------------CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCC
Q 000473 663 RMFP---------------------------------------------GHPNYPAKVVWDCPRGYIACLCRDHSRTSDA 697 (1471)
Q Consensus 663 ~~~~---------------------------------------------gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~ 697 (1471)
..+. +-....+.+.|+|||++|+.+...
T Consensus 136 g~l~~~~~pi~AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~~------- 208 (311)
T KOG1446|consen 136 GLLNLSGRPIAAFDPEGLIFALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTNA------- 208 (311)
T ss_pred eEEecCCCcceeECCCCcEEEEecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEeCC-------
Confidence 2111 112346789999999999998887
Q ss_pred CCEEEEEECCCCeEEEEEeCCCCCc-eeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccc
Q 000473 698 VDVLFIWDVKTGARERVLRGTASHS-MFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 698 ~gtV~VWDi~tg~~~~~l~gH~~~v-~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
+.+++-|.-+|....++.+|...- +...+| .. +++.+ ++..+.||++.+|+++.-++
T Consensus 209 -s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~----ft------Pds~F-----vl~gs~dg~i~vw~~~tg~~ 266 (311)
T KOG1446|consen 209 -SFIYLLDAFDGTVKSTFSGYPNAGNLPLSAT----FT------PDSKF-----VLSGSDDGTIHVWNLETGKK 266 (311)
T ss_pred -CcEEEEEccCCcEeeeEeeccCCCCcceeEE----EC------CCCcE-----EEEecCCCcEEEEEcCCCcE
Confidence 899999999999999999987544 222222 00 11222 23334489999999854443
No 126
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.53 E-value=6.4e-14 Score=156.90 Aligned_cols=240 Identities=14% Similarity=0.127 Sum_probs=182.6
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecc-----------cc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDL-----------FE 546 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~-----------l~ 546 (1471)
+++.+.+|-+++|+..+. .+..++..|.+.|.-+.+.. ..+++.+.|.+|+.|..+. +.
T Consensus 82 ~aSGs~DG~VkiWnlsqR----~~~~~f~AH~G~V~Gi~v~~------~~~~tvgdDKtvK~wk~~~~p~~tilg~s~~~ 151 (433)
T KOG0268|consen 82 VASGSCDGEVKIWNLSQR----ECIRTFKAHEGLVRGICVTQ------TSFFTVGDDKTVKQWKIDGPPLHTILGKSVYL 151 (433)
T ss_pred hhccccCceEEEEehhhh----hhhheeecccCceeeEEecc------cceEEecCCcceeeeeccCCcceeeecccccc
Confidence 456677889999998874 56778999999999887444 3588889999999975321 00
Q ss_pred cCC------------CCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceE
Q 000473 547 RHN------------SPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLI 614 (1471)
Q Consensus 547 ~~d------------~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l 614 (1471)
+.| ....+||..-..+++.+.--.+.+.++.|+|.. ...|++|+.|++|.++|++++.++
T Consensus 152 gIdh~~~~~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfNpvE--------TsILas~~sDrsIvLyD~R~~~Pl 223 (433)
T KOG0268|consen 152 GIDHHRKNSVFATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFNPVE--------TSILASCASDRSIVLYDLRQASPL 223 (433)
T ss_pred ccccccccccccccCceeeecccccCCccceeecCCCceeEEecCCCc--------chheeeeccCCceEEEecccCCcc
Confidence 101 112478887777888888778889999999972 678999999999999999999988
Q ss_pred EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCC
Q 000473 615 TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSR 693 (1471)
Q Consensus 615 ~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg 693 (1471)
+.... +-.-..|+|+|+ +-.|+++++|..+..+|++.. +++..+.+|.+.|.+|.|+|.|+-+++|+.|
T Consensus 224 ~KVi~-~mRTN~IswnPe------afnF~~a~ED~nlY~~DmR~l~~p~~v~~dhvsAV~dVdfsptG~EfvsgsyD--- 293 (433)
T KOG0268|consen 224 KKVIL-TMRTNTICWNPE------AFNFVAANEDHNLYTYDMRNLSRPLNVHKDHVSAVMDVDFSPTGQEFVSGSYD--- 293 (433)
T ss_pred ceeee-eccccceecCcc------ccceeeccccccceehhhhhhcccchhhcccceeEEEeccCCCcchhcccccc---
Confidence 66532 223467899996 889999999999999999875 5678889999999999999999999999998
Q ss_pred CCCCCCEEEEEECCCCeEEEEEeC-CCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 694 TSDAVDVLFIWDVKTGARERVLRG-TASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 694 ~~D~~gtV~VWDi~tg~~~~~l~g-H~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
.+|+||.+..|+---++.. --.+|+++.|. ..+-.|++|+ .|+.+|.|..+.
T Consensus 294 -----ksIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S------~Dskyi~SGS-----------dd~nvRlWka~A 346 (433)
T KOG0268|consen 294 -----KSIRIFPVNHGHSRDIYHTKRMQHVFCVKYS------MDSKYIISGS-----------DDGNVRLWKAKA 346 (433)
T ss_pred -----ceEEEeecCCCcchhhhhHhhhheeeEEEEe------ccccEEEecC-----------CCcceeeeecch
Confidence 9999999988765433321 12356666665 1222333333 389999998643
No 127
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=99.52 E-value=5.8e-13 Score=156.84 Aligned_cols=143 Identities=14% Similarity=0.194 Sum_probs=111.7
Q ss_pred ccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC
Q 000473 523 YAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD 600 (1471)
Q Consensus 523 f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D 600 (1471)
|+|. .++.++. ..|+|+++. ....+..+......|..+++||. |..|+.|+.|
T Consensus 574 FHPs~p~lfVaTq-~~vRiYdL~---------------kqelvKkL~tg~kwiS~msihp~---------GDnli~gs~d 628 (733)
T KOG0650|consen 574 FHPSKPYLFVATQ-RSVRIYDLS---------------KQELVKKLLTGSKWISSMSIHPN---------GDNLILGSYD 628 (733)
T ss_pred ecCCCceEEEEec-cceEEEehh---------------HHHHHHHHhcCCeeeeeeeecCC---------CCeEEEecCC
Confidence 5554 4555554 578884332 12334444444556889999996 8999999999
Q ss_pred CcEEEEECCCC-ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC------CC---cEEEEecCCCC
Q 000473 601 CSIRIWDLGSG-NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE------TL---RVERMFPGHPN 670 (1471)
Q Consensus 601 gtI~lWDl~tg-~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~------t~---~~l~~~~gh~~ 670 (1471)
+.++++|+.-. ++.+++..|...|++|+|++. -.+||||+.|+++.|+.-. .. -++..+.||..
T Consensus 629 ~k~~WfDldlsskPyk~lr~H~~avr~Va~H~r------yPLfas~sdDgtv~Vfhg~VY~Dl~qnpliVPlK~L~gH~~ 702 (733)
T KOG0650|consen 629 KKMCWFDLDLSSKPYKTLRLHEKAVRSVAFHKR------YPLFASGSDDGTVIVFHGMVYNDLLQNPLIVPLKRLRGHEK 702 (733)
T ss_pred CeeEEEEcccCcchhHHhhhhhhhhhhhhhccc------cceeeeecCCCcEEEEeeeeehhhhcCCceEeeeeccCcee
Confidence 99999999744 567888899999999999998 7899999999999988532 11 24577889976
Q ss_pred C----cEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE
Q 000473 671 Y----PAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW 704 (1471)
Q Consensus 671 ~----V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW 704 (1471)
. |..+.|+|...+|++++.| |+|++|
T Consensus 703 ~~~~gVLd~~wHP~qpWLfsAGAd--------~tirlf 732 (733)
T KOG0650|consen 703 TNDLGVLDTIWHPRQPWLFSAGAD--------GTIRLF 732 (733)
T ss_pred ecccceEeecccCCCceEEecCCC--------ceEEee
Confidence 5 8999999999999999999 999998
No 128
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.52 E-value=3.4e-14 Score=159.04 Aligned_cols=201 Identities=16% Similarity=0.242 Sum_probs=155.2
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGT 584 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~ 584 (1471)
+.+|...|.|++-.+.. +..+++|+.||.|++ | |.....+..+|..|.|.|..|.+.
T Consensus 62 L~gHrdGV~~lakhp~~---ls~~aSGs~DG~Vki--W-------------nlsqR~~~~~f~AH~G~V~Gi~v~----- 118 (433)
T KOG0268|consen 62 LDGHRDGVSCLAKHPNK---LSTVASGSCDGEVKI--W-------------NLSQRECIRTFKAHEGLVRGICVT----- 118 (433)
T ss_pred ccccccccchhhcCcch---hhhhhccccCceEEE--E-------------ehhhhhhhheeecccCceeeEEec-----
Confidence 57899999998733322 126899999999999 3 344557788999999999999986
Q ss_pred cccCcCCCEEEEEECCCcEEEEECCC---------------------------C-----------ceEEEEeccCCCEEE
Q 000473 585 AKGWSFNEVLVSGSMDCSIRIWDLGS---------------------------G-----------NLITVMHHHVAPVRQ 626 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~DgtI~lWDl~t---------------------------g-----------~~l~~~~~H~~~V~~ 626 (1471)
...+++++.|.+|+.|-+.- | .++..|.--...|.+
T Consensus 119 ------~~~~~tvgdDKtvK~wk~~~~p~~tilg~s~~~gIdh~~~~~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~s 192 (433)
T KOG0268|consen 119 ------QTSFFTVGDDKTVKQWKIDGPPLHTILGKSVYLGIDHHRKNSVFATCGEQIDIWDEQRDNPVSSMSWGADSISS 192 (433)
T ss_pred ------ccceEEecCCcceeeeeccCCcceeeeccccccccccccccccccccCceeeecccccCCccceeecCCCceeE
Confidence 35688888999988887431 1 133334444556788
Q ss_pred EEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 627 IILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 627 l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
+.|+|.+ -..|++++.|++|.|||++++.+++...- ...-+.|+|+|.+-.+++|.+| ..+|.+|+
T Consensus 193 vkfNpvE-----TsILas~~sDrsIvLyD~R~~~Pl~KVi~-~mRTN~IswnPeafnF~~a~ED--------~nlY~~Dm 258 (433)
T KOG0268|consen 193 VKFNPVE-----TSILASCASDRSIVLYDLRQASPLKKVIL-TMRTNTICWNPEAFNFVAANED--------HNLYTYDM 258 (433)
T ss_pred EecCCCc-----chheeeeccCCceEEEecccCCccceeee-eccccceecCccccceeecccc--------ccceehhh
Confidence 8888885 56899999999999999999998876542 3346789999977777888888 89999999
Q ss_pred CC-CeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 707 KT-GARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 707 ~t-g~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
+. .+.+.+..+|.+.|+.++|.| +|..-++++. |.+||++..
T Consensus 259 R~l~~p~~v~~dhvsAV~dVdfsp------------tG~Efvsgsy-----DksIRIf~~ 301 (433)
T KOG0268|consen 259 RNLSRPLNVHKDHVSAVMDVDFSP------------TGQEFVSGSY-----DKSIRIFPV 301 (433)
T ss_pred hhhcccchhhcccceeEEEeccCC------------Ccchhccccc-----cceEEEeec
Confidence 87 457889999999999998883 4556566666 999998765
No 129
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.51 E-value=5.9e-12 Score=139.50 Aligned_cols=95 Identities=18% Similarity=0.245 Sum_probs=74.5
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCc---------cEEEEEEecCCCCcccCcCCCEEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTG---------AVLCLAAHRMVGTAKGWSFNEVLVSG 597 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~---------~V~~la~spd~~~~~~~~~~~~L~SG 597 (1471)
..++++-||.+.|-.++.- -........|+.|.. +|++++|||- -+.|+||
T Consensus 191 Gy~~sSieGRVavE~~d~s-----------~~~~skkyaFkCHr~~~~~~~~~yPVNai~Fhp~---------~~tfaTg 250 (323)
T KOG1036|consen 191 GYVVSSIEGRVAVEYFDDS-----------EEAQSKKYAFKCHRLSEKDTEIIYPVNAIAFHPI---------HGTFATG 250 (323)
T ss_pred ceEEEeecceEEEEccCCc-----------hHHhhhceeEEeeecccCCceEEEEeceeEeccc---------cceEEec
Confidence 5778888898888555521 011122345566632 6999999996 5789999
Q ss_pred ECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC
Q 000473 598 SMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE 647 (1471)
Q Consensus 598 s~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~ 647 (1471)
+.||.|.+||+.+.+.+..|......|.+++|+.+ |.+||.++.
T Consensus 251 GsDG~V~~Wd~~~rKrl~q~~~~~~SI~slsfs~d------G~~LAia~s 294 (323)
T KOG1036|consen 251 GSDGIVNIWDLFNRKRLKQLAKYETSISSLSFSMD------GSLLAIASS 294 (323)
T ss_pred CCCceEEEccCcchhhhhhccCCCCceEEEEeccC------CCeEEEEec
Confidence 99999999999999999999888888999999999 899998864
No 130
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=99.51 E-value=1e-13 Score=156.75 Aligned_cols=135 Identities=19% Similarity=0.297 Sum_probs=123.4
Q ss_pred EEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-------ceEEEEeccCCCEEEEEECCCCCCC
Q 000473 564 QYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-------NLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 564 ~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-------~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
-.+.||+++|..++|.|. +.+.|+|||.|.+|.||++..+ +++..+.+|...|--++|+|..
T Consensus 75 P~v~GHt~~vLDi~w~Pf--------nD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA--- 143 (472)
T KOG0303|consen 75 PLVCGHTAPVLDIDWCPF--------NDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTA--- 143 (472)
T ss_pred CCccCccccccccccCcc--------CCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccc---
Confidence 346799999999999997 3789999999999999999643 3567889999999999999984
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
.+.|+|++.|.+|.+||+.+|+.+..+. |..-|+++.|+-||.+|+++|.| ..|||||.++|+.+.+-.
T Consensus 144 --~NVLlsag~Dn~v~iWnv~tgeali~l~-hpd~i~S~sfn~dGs~l~TtckD--------KkvRv~dpr~~~~v~e~~ 212 (472)
T KOG0303|consen 144 --PNVLLSAGSDNTVSIWNVGTGEALITLD-HPDMVYSMSFNRDGSLLCTTCKD--------KKVRVIDPRRGTVVSEGV 212 (472)
T ss_pred --hhhHhhccCCceEEEEeccCCceeeecC-CCCeEEEEEeccCCceeeeeccc--------ceeEEEcCCCCcEeeecc
Confidence 6799999999999999999999988877 99999999999999999999999 999999999999999988
Q ss_pred CCCC
Q 000473 717 GTAS 720 (1471)
Q Consensus 717 gH~~ 720 (1471)
+|.+
T Consensus 213 ~heG 216 (472)
T KOG0303|consen 213 AHEG 216 (472)
T ss_pred cccC
Confidence 8875
No 131
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.50 E-value=1.7e-12 Score=159.93 Aligned_cols=186 Identities=18% Similarity=0.225 Sum_probs=151.6
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.+.+++.+.++..+.......-+ .+.+-.-.++.+++..+.. .++.|+.|-.|++.+.+
T Consensus 68 ~f~~~s~~~tv~~y~fps~~~~~----iL~Rftlp~r~~~v~g~g~----~iaagsdD~~vK~~~~~------------- 126 (933)
T KOG1274|consen 68 HFLTGSEQNTVLRYKFPSGEEDT----ILARFTLPIRDLAVSGSGK----MIAAGSDDTAVKLLNLD------------- 126 (933)
T ss_pred ceEEeeccceEEEeeCCCCCccc----eeeeeeccceEEEEecCCc----EEEeecCceeEEEEecc-------------
Confidence 57788888999988877654422 2233345677777666655 79999999999995543
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc--------CCCEEEEE
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH--------VAPVRQII 628 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H--------~~~V~~l~ 628 (1471)
+.....++.||.++|.+|.|+|. +++|++.+.||.|++||+.++.+.+++.+- ...+..++
T Consensus 127 --D~s~~~~lrgh~apVl~l~~~p~---------~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~a 195 (933)
T KOG1274|consen 127 --DSSQEKVLRGHDAPVLQLSYDPK---------GNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLA 195 (933)
T ss_pred --ccchheeecccCCceeeeeEcCC---------CCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeee
Confidence 23456789999999999999997 899999999999999999999877666531 45678899
Q ss_pred ECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecC--CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 629 LSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPG--HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 629 fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g--h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
|+|+ +..|+..+.|++|++|+..++.....+.. +...+..+.|+|.|.|||++..| |.|.|||.
T Consensus 196 W~Pk------~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~--------g~I~vWnv 261 (933)
T KOG1274|consen 196 WHPK------GGTLAVPPVDNTVKVYSRKGWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLD--------GQILVWNV 261 (933)
T ss_pred ecCC------CCeEEeeccCCeEEEEccCCceeheeecccccccceEEEEEcCCCcEEeeeccC--------CcEEEEec
Confidence 9999 88899999999999999999988776653 34458999999999999999999 99999999
Q ss_pred CC
Q 000473 707 KT 708 (1471)
Q Consensus 707 ~t 708 (1471)
++
T Consensus 262 ~t 263 (933)
T KOG1274|consen 262 DT 263 (933)
T ss_pred cc
Confidence 97
No 132
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.50 E-value=1.1e-12 Score=144.55 Aligned_cols=205 Identities=18% Similarity=0.227 Sum_probs=147.5
Q ss_pred CccEEEEEeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 509 EKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
...|.+++ |+|. .+++|+.||+|++|..+. .+ . ..+ +....|.++|.+++|+.|
T Consensus 27 ~DsIS~l~------FSP~~~~~~~A~SWD~tVR~wevq~------~g----~--~~~-ka~~~~~~PvL~v~Wsdd---- 83 (347)
T KOG0647|consen 27 EDSISALA------FSPQADNLLAAGSWDGTVRIWEVQN------SG----Q--LVP-KAQQSHDGPVLDVCWSDD---- 83 (347)
T ss_pred ccchheeE------eccccCceEEecccCCceEEEEEec------CC----c--ccc-hhhhccCCCeEEEEEccC----
Confidence 46677777 5552 677999999999965541 00 0 011 345679999999999987
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF 665 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~ 665 (1471)
+..+++|+.|+.+++||+.+++ ...+..|.++|..+.|-+... -.||+|||.|+++++||.+...++.++
T Consensus 84 -----gskVf~g~~Dk~~k~wDL~S~Q-~~~v~~Hd~pvkt~~wv~~~~----~~cl~TGSWDKTlKfWD~R~~~pv~t~ 153 (347)
T KOG0647|consen 84 -----GSKVFSGGCDKQAKLWDLASGQ-VSQVAAHDAPVKTCHWVPGMN----YQCLVTGSWDKTLKFWDTRSSNPVATL 153 (347)
T ss_pred -----CceEEeeccCCceEEEEccCCC-eeeeeecccceeEEEEecCCC----cceeEecccccceeecccCCCCeeeee
Confidence 8999999999999999999995 567789999999999987621 359999999999999999875444332
Q ss_pred c-------------------------------------CCCC----CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE
Q 000473 666 P-------------------------------------GHPN----YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW 704 (1471)
Q Consensus 666 ~-------------------------------------gh~~----~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW 704 (1471)
. .+.+ .+++|+...|.+..+.|+-. |.+-|-
T Consensus 154 ~LPeRvYa~Dv~~pm~vVata~r~i~vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiE--------Grv~iq 225 (347)
T KOG0647|consen 154 QLPERVYAADVLYPMAVVATAERHIAVYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIE--------GRVAIQ 225 (347)
T ss_pred eccceeeehhccCceeEEEecCCcEEEEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeec--------ceEEEE
Confidence 2 1111 35778888887777888888 899998
Q ss_pred ECCCC--eEEEEEeCCCCCce--e-eeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 705 DVKTG--ARERVLRGTASHSM--F-DHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 705 Di~tg--~~~~~l~gH~~~v~--~-~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
.+..+ +.--+++-|+.... . +-..+++.++.+.|. ++++..||++..|+-
T Consensus 226 ~id~~~~~~nFtFkCHR~~~~~~~~VYaVNsi~FhP~hgt-----------lvTaGsDGtf~FWDk 280 (347)
T KOG0647|consen 226 YIDDPNPKDNFTFKCHRSTNSVNDDVYAVNSIAFHPVHGT-----------LVTAGSDGTFSFWDK 280 (347)
T ss_pred ecCCCCccCceeEEEeccCCCCCCceEEecceEeecccce-----------EEEecCCceEEEecc
Confidence 88876 33345666663221 1 222344445554444 466777999999984
No 133
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.50 E-value=7.1e-13 Score=144.89 Aligned_cols=229 Identities=18% Similarity=0.183 Sum_probs=156.7
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcce---EEEEecCCccEEEEEEecCC
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVS---RQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~---~~~l~gH~~~V~~la~spd~ 582 (1471)
..|.+.|+++.+-.... .++++|+.||.|.||+.+.... . -...-+...++ .+.-.+|...|..+.|-|-
T Consensus 40 r~HgGsvNsL~id~teg---rymlSGgadgsi~v~Dl~n~t~--~-e~s~li~k~~c~v~~~h~~~Hky~iss~~WyP~- 112 (397)
T KOG4283|consen 40 RPHGGSVNSLQIDLTEG---RYMLSGGADGSIAVFDLQNATD--Y-EASGLIAKHKCIVAKQHENGHKYAISSAIWYPI- 112 (397)
T ss_pred ccCCCccceeeeccccc---eEEeecCCCccEEEEEeccccc--h-hhccceeheeeeccccCCccceeeeeeeEEeee-
Confidence 45788888876433322 2799999999999965542110 0 00000001111 1223579999999999886
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCC-EEEEEeCCCcEEEEECCCCcE
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSD-CFLSVGEDFSVALASLETLRV 661 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~-~l~S~s~DgsV~lWdl~t~~~ 661 (1471)
+...+.|+|-|.+++|||..|-+..-.|+ ..+.|.+-+++|-.. .+ ++++|..|-.|+|-|+.+|.+
T Consensus 113 -------DtGmFtssSFDhtlKVWDtnTlQ~a~~F~-me~~VYshamSp~a~----sHcLiA~gtr~~~VrLCDi~SGs~ 180 (397)
T KOG4283|consen 113 -------DTGMFTSSSFDHTLKVWDTNTLQEAVDFK-MEGKVYSHAMSPMAM----SHCLIAAGTRDVQVRLCDIASGSF 180 (397)
T ss_pred -------cCceeecccccceEEEeecccceeeEEee-cCceeehhhcChhhh----cceEEEEecCCCcEEEEeccCCcc
Confidence 36789999999999999999988877775 356788888888631 23 566677788999999999999
Q ss_pred EEEecCCCCCcEEEEEcCCCCEE-EEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEeCCCCCc-eeeeeeeccccccccc
Q 000473 662 ERMFPGHPNYPAKVVWDCPRGYI-ACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLRGTASHS-MFDHFCKGISMNSISG 738 (1471)
Q Consensus 662 l~~~~gh~~~V~~v~~spdg~~L-~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~gH~~~v-~~~~~~~~~~~~~~sg 738 (1471)
-+.+.||.+.|.+|.|+|...++ ++|+.| |.|++||++.. -+.+++.-|...- ..+...+. -...+.|
T Consensus 181 sH~LsGHr~~vlaV~Wsp~~e~vLatgsaD--------g~irlWDiRrasgcf~~lD~hn~k~~p~~~~n~a-h~gkvng 251 (397)
T KOG4283|consen 181 SHTLSGHRDGVLAVEWSPSSEWVLATGSAD--------GAIRLWDIRRASGCFRVLDQHNTKRPPILKTNTA-HYGKVNG 251 (397)
T ss_pred eeeeccccCceEEEEeccCceeEEEecCCC--------ceEEEEEeecccceeEEeecccCccCcccccccc-ccceeee
Confidence 99999999999999999988865 667777 99999999875 6788888887411 11111100 0001122
Q ss_pred eEEcCCccccccceeeccCCceEeecc
Q 000473 739 SVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 739 ~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
.. + ++....+..+..|.++|.|+.
T Consensus 252 la--~-tSd~~~l~~~gtd~r~r~wn~ 275 (397)
T KOG4283|consen 252 LA--W-TSDARYLASCGTDDRIRVWNM 275 (397)
T ss_pred ee--e-cccchhhhhccCccceEEeec
Confidence 21 1 222233456666999999986
No 134
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.48 E-value=1.3e-10 Score=134.72 Aligned_cols=89 Identities=15% Similarity=0.173 Sum_probs=70.6
Q ss_pred EEEEEecCCCCcccCcCCCE-EEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEE-eCCCcE
Q 000473 574 LCLAAHRMVGTAKGWSFNEV-LVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSV-GEDFSV 651 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~-L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~-s~DgsV 651 (1471)
..++|+|+ +++ +++.+.++.+.+||+.+++.+..+. +...+.++.|+|+ ++.|+++ +.++.|
T Consensus 210 ~~i~~s~d---------g~~~~~~~~~~~~i~v~d~~~~~~~~~~~-~~~~~~~~~~~~~------g~~l~~~~~~~~~i 273 (300)
T TIGR03866 210 VGIKLTKD---------GKTAFVALGPANRVAVVDAKTYEVLDYLL-VGQRVWQLAFTPD------EKYLLTTNGVSNDV 273 (300)
T ss_pred cceEECCC---------CCEEEEEcCCCCeEEEEECCCCcEEEEEE-eCCCcceEEECCC------CCEEEEEcCCCCeE
Confidence 45788887 565 4555567789999999988776553 4457999999999 8888776 569999
Q ss_pred EEEECCCCcEEEEecCCCCCcEEEEEcC
Q 000473 652 ALASLETLRVERMFPGHPNYPAKVVWDC 679 (1471)
Q Consensus 652 ~lWdl~t~~~l~~~~gh~~~V~~v~~sp 679 (1471)
++||+++++++..+... ..++.++++|
T Consensus 274 ~v~d~~~~~~~~~~~~~-~~~~~~~~~~ 300 (300)
T TIGR03866 274 SVIDVAALKVIKSIKVG-RLPWGVVVRP 300 (300)
T ss_pred EEEECCCCcEEEEEEcc-cccceeEeCC
Confidence 99999999999988754 5568888876
No 135
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.48 E-value=3.5e-12 Score=140.81 Aligned_cols=207 Identities=13% Similarity=0.168 Sum_probs=151.6
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEE-ecCCCC
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAA-HRMVGT 584 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~-spd~~~ 584 (1471)
.+|..-|.++.+.... .++++++.|++++||+.+. +..+-.+....+.|.+.|..+.| ||.
T Consensus 10 s~h~DlihdVs~D~~G----RRmAtCSsDq~vkI~d~~~-----------~s~~W~~Ts~Wrah~~Si~rV~WAhPE--- 71 (361)
T KOG2445|consen 10 SGHKDLIHDVSFDFYG----RRMATCSSDQTVKIWDSTS-----------DSGTWSCTSSWRAHDGSIWRVVWAHPE--- 71 (361)
T ss_pred cCCcceeeeeeecccC----ceeeeccCCCcEEEEeccC-----------CCCceEEeeeEEecCCcEEEEEecCcc---
Confidence 4677778887633222 2999999999999955431 11222445567889999999999 555
Q ss_pred cccCcCCCEEEEEECCCcEEEEECC---------CCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEE
Q 000473 585 AKGWSFNEVLVSGSMDCSIRIWDLG---------SGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALAS 655 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~DgtI~lWDl~---------tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWd 655 (1471)
+|+.+++++.|+++.||.-. ......++....+.|+.+.|.|.+ .|-.+++++.||++|||+
T Consensus 72 -----fGqvvA~cS~Drtv~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~DV~FaP~h----lGLklA~~~aDG~lRIYE 142 (361)
T KOG2445|consen 72 -----FGQVVATCSYDRTVSIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTDVKFAPKH----LGLKLAAASADGILRIYE 142 (361)
T ss_pred -----ccceEEEEecCCceeeeeecccccccccceeEEEEEeecCCcceeEEEecchh----cceEEEEeccCcEEEEEe
Confidence 59999999999999999862 123456777888999999999973 266888999999999986
Q ss_pred CCCC-------------------------------------------------------------------cEEEEecCC
Q 000473 656 LETL-------------------------------------------------------------------RVERMFPGH 668 (1471)
Q Consensus 656 l~t~-------------------------------------------------------------------~~l~~~~gh 668 (1471)
.-+. ..+..+.+|
T Consensus 143 A~dp~nLs~W~Lq~Ei~~~~~pp~~~~~~~~CvsWn~sr~~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~d~ 222 (361)
T KOG2445|consen 143 APDPMNLSQWTLQHEIQNVIDPPGKNKQPCFCVSWNPSRMHEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELPDH 222 (361)
T ss_pred cCCccccccchhhhhhhhccCCcccccCcceEEeeccccccCceEEEEcccCCccccceEEEEecCCcceeeeehhcCCC
Confidence 4320 122345588
Q ss_pred CCCcEEEEEcCCC----CEEEEEEcCCCCCCCCCCEEEEEECCCC--------------------eEEEEEeCCCCCcee
Q 000473 669 PNYPAKVVWDCPR----GYIACLCRDHSRTSDAVDVLFIWDVKTG--------------------ARERVLRGTASHSMF 724 (1471)
Q Consensus 669 ~~~V~~v~~spdg----~~L~sgs~D~sg~~D~~gtV~VWDi~tg--------------------~~~~~l~gH~~~v~~ 724 (1471)
..+|+.++|.|+- ..|+++|.| | |+||.++.. +.+..+.+|.++|..
T Consensus 223 ~dpI~di~wAPn~Gr~y~~lAvA~kD--------g-v~I~~v~~~~s~i~~ee~~~~~~~~~l~v~~vs~~~~H~~~VWr 293 (361)
T KOG2445|consen 223 TDPIRDISWAPNIGRSYHLLAVATKD--------G-VRIFKVKVARSAIEEEEVLAPDLMTDLPVEKVSELDDHNGEVWR 293 (361)
T ss_pred CCcceeeeeccccCCceeeEEEeecC--------c-EEEEEEeeccchhhhhcccCCCCccccceEEeeeccCCCCceEE
Confidence 8999999999973 478999998 8 999999731 245667899999998
Q ss_pred eeeeeccccccccceEEcCCccccccceeeccCCceEeecc
Q 000473 725 DHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 725 ~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
+.|. .+|++ ..+. ..||++|.|+.
T Consensus 294 v~wN-------mtGti-----LsSt-----GdDG~VRLWka 317 (361)
T KOG2445|consen 294 VRWN-------MTGTI-----LSST-----GDDGCVRLWKA 317 (361)
T ss_pred EEEe-------eeeeE-----Eeec-----CCCceeeehhh
Confidence 8766 24443 3333 34999999975
No 136
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.47 E-value=3.3e-13 Score=144.65 Aligned_cols=198 Identities=18% Similarity=0.234 Sum_probs=149.3
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCC----EEEEEEcCCcEEEEEecccccCCCCCC
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY----AIVYGFFSGEIEVIQFDLFERHNSPGA 553 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~----~lv~Gs~DG~I~V~~~~~l~~~d~~~~ 553 (1471)
++..+.++.+-+|...+.+- ........|...|+++. |.|+ .|+||+.||+|.|..++.- .
T Consensus 73 LAScsYDgkVIiWke~~g~w--~k~~e~~~h~~SVNsV~------wapheygl~LacasSDG~vsvl~~~~~-------g 137 (299)
T KOG1332|consen 73 LASCSYDGKVIIWKEENGRW--TKAYEHAAHSASVNSVA------WAPHEYGLLLACASSDGKVSVLTYDSS-------G 137 (299)
T ss_pred eeEeecCceEEEEecCCCch--hhhhhhhhhcccceeec------ccccccceEEEEeeCCCcEEEEEEcCC-------C
Confidence 66677888889998887532 33455678999999987 6665 7999999999999777621 1
Q ss_pred ccccCCcceEEEEecCCccEEEEEEecCCCCccc-------CcCCCEEEEEECCCcEEEEECCCCc--eEEEEeccCCCE
Q 000473 554 SLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKG-------WSFNEVLVSGSMDCSIRIWDLGSGN--LITVMHHHVAPV 624 (1471)
Q Consensus 554 ~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~-------~~~~~~L~SGs~DgtI~lWDl~tg~--~l~~~~~H~~~V 624 (1471)
.|+. .+....|.-.|+++.|.|.. .+| ...-+.|+||+.|..|++|+...++ +-++|.+|.+-|
T Consensus 138 ~w~t-----~ki~~aH~~GvnsVswapa~--~~g~~~~~~~~~~~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H~dwV 210 (299)
T KOG1332|consen 138 GWTT-----SKIVFAHEIGVNSVSWAPAS--APGSLVDQGPAAKVKRLVSGGCDNLVKIWKFDSDSWKLERTLEGHKDWV 210 (299)
T ss_pred Cccc-----hhhhhccccccceeeecCcC--CCccccccCcccccceeeccCCccceeeeecCCcchhhhhhhhhcchhh
Confidence 2332 23456899999999999862 111 0112679999999999999998764 446799999999
Q ss_pred EEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cE--EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEE
Q 000473 625 RQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RV--ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVL 701 (1471)
Q Consensus 625 ~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~--l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV 701 (1471)
+.++|.|..... ..+++|++.||+|.||..+.. +. ...+......++.+.||+.|+.|++++.| +.|
T Consensus 211 RDVAwaP~~gl~--~s~iAS~SqDg~viIwt~~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~Gd--------Nkv 280 (299)
T KOG1332|consen 211 RDVAWAPSVGLP--KSTIASCSQDGTVIIWTKDEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGGD--------NKV 280 (299)
T ss_pred hhhhhccccCCC--ceeeEEecCCCcEEEEEecCccCcccccccccCCcceEEEEEeccccEEEEecCC--------cEE
Confidence 999999974221 248999999999999987521 11 12333456789999999999999999988 999
Q ss_pred EEEECC
Q 000473 702 FIWDVK 707 (1471)
Q Consensus 702 ~VWDi~ 707 (1471)
++|.-.
T Consensus 281 tlwke~ 286 (299)
T KOG1332|consen 281 TLWKEN 286 (299)
T ss_pred EEEEeC
Confidence 999743
No 137
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.47 E-value=5.7e-13 Score=158.23 Aligned_cols=165 Identities=16% Similarity=0.243 Sum_probs=118.0
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEE--EecCCccEEEEEEec
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQY--FLGHTGAVLCLAAHR 580 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~--l~gH~~~V~~la~sp 580 (1471)
..+..|...|..+..++... .||+++.|.+|+. | |++....... +.||++.|.+++|+|
T Consensus 94 k~~~aH~nAifDl~wapge~----~lVsasGDsT~r~--W-------------dvk~s~l~G~~~~~GH~~SvkS~cf~~ 154 (720)
T KOG0321|consen 94 KKPLAHKNAIFDLKWAPGES----LLVSASGDSTIRP--W-------------DVKTSRLVGGRLNLGHTGSVKSECFMP 154 (720)
T ss_pred cccccccceeEeeccCCCce----eEEEccCCceeee--e-------------eeccceeecceeecccccccchhhhcc
Confidence 44578889998888666554 8999999999998 3 3444455544 899999999999999
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCCc---------------------------eEEEEeccCCCEEE---EEEC
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN---------------------------LITVMHHHVAPVRQ---IILS 630 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~---------------------------~l~~~~~H~~~V~~---l~fs 630 (1471)
+ +...|++|+.||.|.|||++-.. .++....|...|.+ +.+.
T Consensus 155 ~--------n~~vF~tGgRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~ti~ssvTvv~f 226 (720)
T KOG0321|consen 155 T--------NPAVFCTGGRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNTIFSSVTVVLF 226 (720)
T ss_pred C--------CCcceeeccCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCchhhccccccccccCceeeeeEEEEE
Confidence 7 37899999999999999986211 12233345444554 4444
Q ss_pred CCCCCCCCCCEEEEEeC-CCcEEEEECCCCcEE--------EEecCC---CCCcEEEEEcCCCCEEEEEEcCCCCCCCCC
Q 000473 631 PPQTEHPWSDCFLSVGE-DFSVALASLETLRVE--------RMFPGH---PNYPAKVVWDCPRGYIACLCRDHSRTSDAV 698 (1471)
Q Consensus 631 pd~~~~~~~~~l~S~s~-DgsV~lWdl~t~~~l--------~~~~gh---~~~V~~v~~spdg~~L~sgs~D~sg~~D~~ 698 (1471)
-| ...|+|+|. |+.|+|||++...+. ..++.| .-.+.++..+..|.+|++.|.|
T Consensus 227 kD------e~tlaSaga~D~~iKVWDLRk~~~~~r~ep~~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD-------- 292 (720)
T KOG0321|consen 227 KD------ESTLASAGAADSTIKVWDLRKNYTAYRQEPRGSDKYPTHSKRSVGQVNLILDSSGTYLFASCTD-------- 292 (720)
T ss_pred ec------cceeeeccCCCcceEEEeecccccccccCCCcccCccCcccceeeeEEEEecCCCCeEEEEecC--------
Confidence 55 578998887 999999999875432 222333 2246677777778888888888
Q ss_pred CEEEEEECCC
Q 000473 699 DVLFIWDVKT 708 (1471)
Q Consensus 699 gtV~VWDi~t 708 (1471)
++||.||+.+
T Consensus 293 ~sIy~ynm~s 302 (720)
T KOG0321|consen 293 NSIYFYNMRS 302 (720)
T ss_pred CcEEEEeccc
Confidence 6666666643
No 138
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.45 E-value=6.1e-12 Score=138.95 Aligned_cols=238 Identities=16% Similarity=0.170 Sum_probs=152.7
Q ss_pred EEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccccc
Q 000473 19 VTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMGKS 98 (1471)
Q Consensus 19 Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~~~ 98 (1471)
-+|+.|++-|.+||.|+.||.|.+||+.+ ..+-.+|.+|.-+|++|+ +
T Consensus 26 a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T-----~~iar~lsaH~~pi~sl~---------------------------W 73 (405)
T KOG1273|consen 26 AECCQFSRWGDYLAVGCANGRVVIYDFDT-----FRIARMLSAHVRPITSLC---------------------------W 73 (405)
T ss_pred cceEEeccCcceeeeeccCCcEEEEEccc-----cchhhhhhccccceeEEE---------------------------e
Confidence 68999999999999999999999999986 345578899999999999 6
Q ss_pred cCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCC--eEEEEcc----eecccCCccccccc----
Q 000473 99 SLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNP--RYVCIGC----CFIDTNQLSDHHSF---- 168 (1471)
Q Consensus 99 s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~--~ll~~G~----~~id~~~~~~~h~~---- 168 (1471)
|+|+..|+|+|.|..+.+||+.+|.|+.+..++ +|.--+.+.|.. ..+++-. ..++... ..|+.
T Consensus 74 S~dgr~LltsS~D~si~lwDl~~gs~l~rirf~----spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~--~~h~~Lp~d 147 (405)
T KOG1273|consen 74 SRDGRKLLTSSRDWSIKLWDLLKGSPLKRIRFD----SPVWGAQWHPRKRNKCVATIMEESPVVIDFSD--PKHSVLPKD 147 (405)
T ss_pred cCCCCEeeeecCCceeEEEeccCCCceeEEEcc----CccceeeeccccCCeEEEEEecCCcEEEEecC--CceeeccCC
Confidence 778888999999999999999999999887764 333333333222 2222111 1111111 11222
Q ss_pred -----cccccc-ccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcE
Q 000473 169 -----ESVEGD-LVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRL 242 (1471)
Q Consensus 169 -----~~i~~~-~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V 242 (1471)
+..-.+ ..-..+++...+..+|.+.++|..|++++..+.-.. +..|..+-|+- .+ ..+++-++|..|
T Consensus 148 ~d~dln~sas~~~fdr~g~yIitGtsKGkllv~~a~t~e~vas~rits--~~~IK~I~~s~---~g--~~liiNtsDRvI 220 (405)
T KOG1273|consen 148 DDGDLNSSASHGVFDRRGKYIITGTSKGKLLVYDAETLECVASFRITS--VQAIKQIIVSR---KG--RFLIINTSDRVI 220 (405)
T ss_pred CccccccccccccccCCCCEEEEecCcceEEEEecchheeeeeeeech--heeeeEEEEec---cC--cEEEEecCCceE
Confidence 111111 112234556678889999999999998887654311 23477777662 11 234444999999
Q ss_pred EEEECCCCCCcccccCCCcccCC-CcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCC
Q 000473 243 QLVPISKESHLDREEGNGLCKSS-SQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSG 309 (1471)
Q Consensus 243 ~vW~l~~~~~~~~~~~~~l~~~e-~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~ 309 (1471)
|++++..-... +..++. +.+ |-.++|+..+. ..+.|+.+|.++..++...--+-+|+..
T Consensus 221 R~ye~~di~~~-~r~~e~--e~~~K~qDvVNk~~W-----k~ccfs~dgeYv~a~s~~aHaLYIWE~~ 280 (405)
T KOG1273|consen 221 RTYEISDIDDE-GRDGEV--EPEHKLQDVVNKLQW-----KKCCFSGDGEYVCAGSARAHALYIWEKS 280 (405)
T ss_pred EEEehhhhccc-CccCCc--ChhHHHHHHHhhhhh-----hheeecCCccEEEeccccceeEEEEecC
Confidence 99998743110 011111 111 33445555554 6788888898887766544333345543
No 139
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.45 E-value=1e-12 Score=151.56 Aligned_cols=215 Identities=16% Similarity=0.204 Sum_probs=165.9
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc----C-CcceEEEE-ecCCccEEEE
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV----N-SHVSRQYF-LGHTGAVLCL 576 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~----~-s~~~~~~l-~gH~~~V~~l 576 (1471)
.....|.-.++++++.++.. +.+.++.||+|.- |+.+.+-+... .|.. . .+.+.+.- .+|...+.++
T Consensus 136 ~~~~~H~~s~~~vals~d~~----~~fsask~g~i~k--w~v~tgk~~~~-i~~~~ev~k~~~~~~k~~r~~h~keil~~ 208 (479)
T KOG0299|consen 136 RVIGKHQLSVTSVALSPDDK----RVFSASKDGTILK--WDVLTGKKDRY-IIERDEVLKSHGNPLKESRKGHVKEILTL 208 (479)
T ss_pred eeeccccCcceEEEeecccc----ceeecCCCcceee--eehhcCccccc-ccccchhhhhccCCCCcccccccceeEEE
Confidence 44578889999998666665 6899999997766 65433321000 1111 0 11111111 4899999999
Q ss_pred EEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEEC
Q 000473 577 AAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASL 656 (1471)
Q Consensus 577 a~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl 656 (1471)
++++| +++|++|+.|..|.|||..+.+.++.|.+|.+.|.+++|... .+.+.+++.|++|++|++
T Consensus 209 avS~D---------gkylatgg~d~~v~Iw~~~t~ehv~~~~ghr~~V~~L~fr~g------t~~lys~s~Drsvkvw~~ 273 (479)
T KOG0299|consen 209 AVSSD---------GKYLATGGRDRHVQIWDCDTLEHVKVFKGHRGAVSSLAFRKG------TSELYSASADRSVKVWSI 273 (479)
T ss_pred EEcCC---------CcEEEecCCCceEEEecCcccchhhcccccccceeeeeeecC------ccceeeeecCCceEEEeh
Confidence 99998 899999999999999999999999999999999999999876 678999999999999999
Q ss_pred CCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccc
Q 000473 657 ETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSI 736 (1471)
Q Consensus 657 ~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~ 736 (1471)
+....+.++-||.+.|..+.-..-++.+-+|+.| +++++|++.. +.--.+.||.+.+-++.|.++
T Consensus 274 ~~~s~vetlyGHqd~v~~IdaL~reR~vtVGgrD--------rT~rlwKi~e-esqlifrg~~~sidcv~~In~------ 338 (479)
T KOG0299|consen 274 DQLSYVETLYGHQDGVLGIDALSRERCVTVGGRD--------RTVRLWKIPE-ESQLIFRGGEGSIDCVAFIND------ 338 (479)
T ss_pred hHhHHHHHHhCCccceeeechhcccceEEecccc--------ceeEEEeccc-cceeeeeCCCCCeeeEEEecc------
Confidence 9999999999999999999988888888888898 9999999943 333466788766666655532
Q ss_pred cceEEcCCccccccceeeccCCceEeeccc
Q 000473 737 SGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 737 sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
.++++-+.||.|-.|.+.
T Consensus 339 ------------~HfvsGSdnG~IaLWs~~ 356 (479)
T KOG0299|consen 339 ------------EHFVSGSDNGSIALWSLL 356 (479)
T ss_pred ------------cceeeccCCceEEEeeec
Confidence 233344458999999874
No 140
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.45 E-value=3e-12 Score=141.85 Aligned_cols=199 Identities=17% Similarity=0.189 Sum_probs=145.3
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.+++.++.+++++...... .. ...|..++.++++..+. .+++|+.||.|+.++..
T Consensus 28 LLvssWDgslrlYdv~~~~l----~~-~~~~~~plL~c~F~d~~-----~~~~G~~dg~vr~~Dln-------------- 83 (323)
T KOG1036|consen 28 LLVSSWDGSLRLYDVPANSL----KL-KFKHGAPLLDCAFADES-----TIVTGGLDGQVRRYDLN-------------- 83 (323)
T ss_pred EEEEeccCcEEEEeccchhh----hh-heecCCceeeeeccCCc-----eEEEeccCceEEEEEec--------------
Confidence 55666778888888766422 22 23467788887744433 69999999999995544
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
++ ....+..|..+|.|+.+.+. ...+++||+|++|++||.+.......+. ....|.++...
T Consensus 84 -~~-~~~~igth~~~i~ci~~~~~---------~~~vIsgsWD~~ik~wD~R~~~~~~~~d-~~kkVy~~~v~------- 144 (323)
T KOG1036|consen 84 -TG-NEDQIGTHDEGIRCIEYSYE---------VGCVISGSWDKTIKFWDPRNKVVVGTFD-QGKKVYCMDVS------- 144 (323)
T ss_pred -CC-cceeeccCCCceEEEEeecc---------CCeEEEcccCccEEEEeccccccccccc-cCceEEEEecc-------
Confidence 22 23456679999999999986 7889999999999999998654444443 33467777654
Q ss_pred CCCEEEEEeCCCcEEEEECCCC---------------cEEE---------------------------------EecCCC
Q 000473 638 WSDCFLSVGEDFSVALASLETL---------------RVER---------------------------------MFPGHP 669 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~---------------~~l~---------------------------------~~~gh~ 669 (1471)
++.|+.|+.|..|.+||+++. ++++ .|..|.
T Consensus 145 -g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~pn~eGy~~sSieGRVavE~~d~s~~~~skkyaFkCHr 223 (323)
T KOG1036|consen 145 -GNRLVVGTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVPNGEGYVVSSIEGRVAVEYFDDSEEAQSKKYAFKCHR 223 (323)
T ss_pred -CCEEEEeecCceEEEEEcccccchhhhccccceeEEEEEEEecCCCceEEEeecceEEEEccCCchHHhhhceeEEeee
Confidence 447777777888888887653 1222 222332
Q ss_pred C---------CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 670 N---------YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 670 ~---------~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
. +|++++|+|-.++|+||+.| |.|.+||+.+.+.+..+.+-...+-...|+
T Consensus 224 ~~~~~~~~~yPVNai~Fhp~~~tfaTgGsD--------G~V~~Wd~~~rKrl~q~~~~~~SI~slsfs 283 (323)
T KOG1036|consen 224 LSEKDTEIIYPVNAIAFHPIHGTFATGGSD--------GIVNIWDLFNRKRLKQLAKYETSISSLSFS 283 (323)
T ss_pred cccCCceEEEEeceeEeccccceEEecCCC--------ceEEEccCcchhhhhhccCCCCceEEEEec
Confidence 1 68999999999999999999 999999999999999988875555555566
No 141
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=99.44 E-value=4.4e-12 Score=143.65 Aligned_cols=174 Identities=17% Similarity=0.200 Sum_probs=140.3
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGT 584 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~ 584 (1471)
+.+|.+.|..+.+.+ |+-+.+++|++|.+|.||... .++ ...+. -+++..|.||..+|.-++|||.
T Consensus 77 v~GHt~~vLDi~w~P---fnD~vIASgSeD~~v~vW~IP--e~~----l~~~l--tepvv~L~gH~rrVg~V~wHPt--- 142 (472)
T KOG0303|consen 77 VCGHTAPVLDIDWCP---FNDCVIASGSEDTKVMVWQIP--ENG----LTRDL--TEPVVELYGHQRRVGLVQWHPT--- 142 (472)
T ss_pred ccCccccccccccCc---cCCceeecCCCCceEEEEECC--Ccc----cccCc--ccceEEEeecceeEEEEeeccc---
Confidence 578999888876433 222389999999999995432 111 00111 2678899999999999999996
Q ss_pred cccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEE
Q 000473 585 AKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERM 664 (1471)
Q Consensus 585 ~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~ 664 (1471)
-.+.|+|++.|.+|.+||+.+|+.+-++. |...|+++.|+.+ |..+++.+.|+.|||||.++++.+..
T Consensus 143 -----A~NVLlsag~Dn~v~iWnv~tgeali~l~-hpd~i~S~sfn~d------Gs~l~TtckDKkvRv~dpr~~~~v~e 210 (472)
T KOG0303|consen 143 -----APNVLLSAGSDNTVSIWNVGTGEALITLD-HPDMVYSMSFNRD------GSLLCTTCKDKKVRVIDPRRGTVVSE 210 (472)
T ss_pred -----chhhHhhccCCceEEEEeccCCceeeecC-CCCeEEEEEeccC------CceeeeecccceeEEEcCCCCcEeee
Confidence 27899999999999999999999888887 9999999999999 99999999999999999999999998
Q ss_pred ecCCCC-CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 665 FPGHPN-YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 665 ~~gh~~-~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
-.+|.+ ....+.|-.++.++-||-. .-++ ..+-+||..+-
T Consensus 211 ~~~heG~k~~Raifl~~g~i~tTGfs---r~se--Rq~aLwdp~nl 251 (472)
T KOG0303|consen 211 GVAHEGAKPARAIFLASGKIFTTGFS---RMSE--RQIALWDPNNL 251 (472)
T ss_pred cccccCCCcceeEEeccCceeeeccc---cccc--cceeccCcccc
Confidence 888876 5677889889985544432 3444 89999997653
No 142
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.43 E-value=3.5e-12 Score=139.52 Aligned_cols=168 Identities=19% Similarity=0.289 Sum_probs=129.1
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
.+|.-.|++....| +....+.+++.|.+++| || ..+.+....|+ -.+.|.+-+++|.-
T Consensus 98 ~~Hky~iss~~WyP---~DtGmFtssSFDhtlKV--WD-------------tnTlQ~a~~F~-me~~VYshamSp~a--- 155 (397)
T KOG4283|consen 98 NGHKYAISSAIWYP---IDTGMFTSSSFDHTLKV--WD-------------TNTLQEAVDFK-MEGKVYSHAMSPMA--- 155 (397)
T ss_pred ccceeeeeeeEEee---ecCceeecccccceEEE--ee-------------cccceeeEEee-cCceeehhhcChhh---
Confidence 45666677654322 11127889999999999 33 33333333343 34568888888861
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEE
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVERM 664 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~ 664 (1471)
....++++|..|-.|++-|+.+|.+-+++.+|.+.|.++.|+|.. .-.|++|+.||.|++||++.- .|.+.
T Consensus 156 ---~sHcLiA~gtr~~~VrLCDi~SGs~sH~LsGHr~~vlaV~Wsp~~-----e~vLatgsaDg~irlWDiRrasgcf~~ 227 (397)
T KOG4283|consen 156 ---MSHCLIAAGTRDVQVRLCDIASGSFSHTLSGHRDGVLAVEWSPSS-----EWVLATGSADGAIRLWDIRRASGCFRV 227 (397)
T ss_pred ---hcceEEEEecCCCcEEEEeccCCcceeeeccccCceEEEEeccCc-----eeEEEecCCCceEEEEEeecccceeEE
Confidence 114588899999999999999999999999999999999999983 337889999999999999864 34444
Q ss_pred e--------------cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 665 F--------------PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 665 ~--------------~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
+ ..|.+.|..++|+.++.++++.+.| ..+++|++.+|+-
T Consensus 228 lD~hn~k~~p~~~~n~ah~gkvngla~tSd~~~l~~~gtd--------~r~r~wn~~~G~n 280 (397)
T KOG4283|consen 228 LDQHNTKRPPILKTNTAHYGKVNGLAWTSDARYLASCGTD--------DRIRVWNMESGRN 280 (397)
T ss_pred eecccCccCccccccccccceeeeeeecccchhhhhccCc--------cceEEeecccCcc
Confidence 3 3566778999999999999999998 8999999999864
No 143
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.42 E-value=8.9e-12 Score=153.75 Aligned_cols=229 Identities=17% Similarity=0.180 Sum_probs=175.5
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.+..+++.++.|+..+... .-.++..+...|.+++..+. .+++|+.+++|.++.++.
T Consensus 28 i~tcgsdg~ir~~~~~sd~e---~P~ti~~~g~~v~~ia~~s~------~f~~~s~~~tv~~y~fps------------- 85 (933)
T KOG1274|consen 28 ICTCGSDGDIRKWKTNSDEE---EPETIDISGELVSSIACYSN------HFLTGSEQNTVLRYKFPS------------- 85 (933)
T ss_pred EEEecCCCceEEeecCCccc---CCchhhccCceeEEEeeccc------ceEEeeccceEEEeeCCC-------------
Confidence 45566778889998766511 11122236677777663333 589999999999976652
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
+.....|..-+-++.+++|+.+ |.+++.||.|-.|++-++.+......+.+|.++|.+|.|+|.
T Consensus 86 --~~~~~iL~Rftlp~r~~~v~g~---------g~~iaagsdD~~vK~~~~~D~s~~~~lrgh~apVl~l~~~p~----- 149 (933)
T KOG1274|consen 86 --GEEDTILARFTLPIRDLAVSGS---------GKMIAAGSDDTAVKLLNLDDSSQEKVLRGHDAPVLQLSYDPK----- 149 (933)
T ss_pred --CCccceeeeeeccceEEEEecC---------CcEEEeecCceeEEEEeccccchheeecccCCceeeeeEcCC-----
Confidence 2222233334567899999876 899999999999999999999999999999999999999999
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEecCC--------CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMFPGH--------PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh--------~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
+++||+.+.||.|++||++++.+..++.+- ...+..++|+|++..+++-+.| +.|++|+..++
T Consensus 150 -~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d--------~~Vkvy~r~~w 220 (933)
T KOG1274|consen 150 -GNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVD--------NTVKVYSRKGW 220 (933)
T ss_pred -CCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccC--------CeEEEEccCCc
Confidence 999999999999999999999887766532 2346789999998888888888 99999999999
Q ss_pred eEEEEEeCCCC--CceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccc
Q 000473 710 ARERVLRGTAS--HSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDER 770 (1471)
Q Consensus 710 ~~~~~l~gH~~--~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~ 770 (1471)
+....+.+-.. .....+|. ++|.+.+++.+ |+.|.+|+.+..++
T Consensus 221 e~~f~Lr~~~~ss~~~~~~ws------------PnG~YiAAs~~-----~g~I~vWnv~t~~~ 266 (933)
T KOG1274|consen 221 ELQFKLRDKLSSSKFSDLQWS------------PNGKYIAASTL-----DGQILVWNVDTHER 266 (933)
T ss_pred eeheeecccccccceEEEEEc------------CCCcEEeeecc-----CCcEEEEecccchh
Confidence 99888876332 23334444 34566666666 99999999986555
No 144
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=99.42 E-value=1.2e-12 Score=154.23 Aligned_cols=211 Identities=19% Similarity=0.194 Sum_probs=157.3
Q ss_pred cccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 504 DFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 504 ~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
++..|...++++.+.+... -++++++||.|.+|+++. ...-....-+++.+|.+|.++|.|+++.++
T Consensus 289 tl~s~~d~ir~l~~~~sep----~lit~sed~~lk~WnLqk-------~~~s~~~~~epi~tfraH~gPVl~v~v~~n-- 355 (577)
T KOG0642|consen 289 TLRSHDDCIRALAFHPSEP----VLITASEDGTLKLWNLQK-------AKKSAEKDVEPILTFRAHEGPVLCVVVPSN-- 355 (577)
T ss_pred eeecchhhhhhhhcCCCCC----eEEEeccccchhhhhhcc-------cCCccccceeeeEEEecccCceEEEEecCC--
Confidence 3455666677766333222 799999999999955531 111122334788999999999999999987
Q ss_pred CcccCcCCCEEEEEECCCcEEEEECCC----------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEE
Q 000473 584 TAKGWSFNEVLVSGSMDCSIRIWDLGS----------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVAL 653 (1471)
Q Consensus 584 ~~~~~~~~~~L~SGs~DgtI~lWDl~t----------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~l 653 (1471)
++++.||+.||+|+.|++.. ..+...+.||++.|+.+++++. .+.|++++.||+|++
T Consensus 356 -------~~~~ysgg~Dg~I~~w~~p~n~dp~ds~dp~vl~~~l~Ghtdavw~l~~s~~------~~~Llscs~DgTvr~ 422 (577)
T KOG0642|consen 356 -------GEHCYSGGIDGTIRCWNLPPNQDPDDSYDPSVLSGTLLGHTDAVWLLALSST------KDRLLSCSSDGTVRL 422 (577)
T ss_pred -------ceEEEeeccCceeeeeccCCCCCcccccCcchhccceeccccceeeeeeccc------ccceeeecCCceEEe
Confidence 89999999999999996541 1245788999999999999998 678999999999999
Q ss_pred EECCCCcE--------------------------------------------EEEecC-----C--CCCcEEEEEcCCCC
Q 000473 654 ASLETLRV--------------------------------------------ERMFPG-----H--PNYPAKVVWDCPRG 682 (1471)
Q Consensus 654 Wdl~t~~~--------------------------------------------l~~~~g-----h--~~~V~~v~~spdg~ 682 (1471)
|+.....+ +..+.. . ...+..|.++|...
T Consensus 423 w~~~~~~~~~f~~~~e~g~Plsvd~~ss~~a~~~~s~~~~~~~~~~~ev~s~~~~~~s~~~~~~~~~~~in~vVs~~~~~ 502 (577)
T KOG0642|consen 423 WEPTEESPCTFGEPKEHGYPLSVDRTSSRPAHSLASFRFGYTSIDDMEVVSDLLIFESSASPGPRRYPQINKVVSHPTAD 502 (577)
T ss_pred eccCCcCccccCCccccCCcceEeeccchhHhhhhhcccccccchhhhhhhheeeccccCCCcccccCccceEEecCCCC
Confidence 98754322 011100 0 12467789999999
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEe
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQ 762 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~ 762 (1471)
+.+++..| +.|+++|..+|.++..+..|...+..+.+.+ +|-. +...+.|++++.
T Consensus 503 ~~~~~hed--------~~Ir~~dn~~~~~l~s~~a~~~svtslai~~------------ng~~-----l~s~s~d~sv~l 557 (577)
T KOG0642|consen 503 ITFTAHED--------RSIRFFDNKTGKILHSMVAHKDSVTSLAIDP------------NGPY-----LMSGSHDGSVRL 557 (577)
T ss_pred eeEecccC--------CceecccccccccchheeeccceecceeecC------------CCce-----EEeecCCceeeh
Confidence 99999998 9999999999999999999988777775442 1122 223344999999
Q ss_pred ecc
Q 000473 763 SQI 765 (1471)
Q Consensus 763 w~l 765 (1471)
|++
T Consensus 558 ~kl 560 (577)
T KOG0642|consen 558 WKL 560 (577)
T ss_pred hhc
Confidence 986
No 145
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=99.42 E-value=2.9e-12 Score=147.94 Aligned_cols=203 Identities=15% Similarity=0.078 Sum_probs=161.3
Q ss_pred cccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcc
Q 000473 482 FCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHV 561 (1471)
Q Consensus 482 ~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~ 561 (1471)
...+.|++||+.. ..+...+++|...|+++.|.-... .|++++..|.|.|... .++.
T Consensus 98 G~~~~Vkiwdl~~----kl~hr~lkdh~stvt~v~YN~~De----yiAsvs~gGdiiih~~---------------~t~~ 154 (673)
T KOG4378|consen 98 GQSGCVKIWDLRA----KLIHRFLKDHQSTVTYVDYNNTDE----YIASVSDGGDIIIHGT---------------KTKQ 154 (673)
T ss_pred CcCceeeehhhHH----HHHhhhccCCcceeEEEEecCCcc----eeEEeccCCcEEEEec---------------ccCc
Confidence 3557899999874 244566789999999998765555 7999999999998432 2333
Q ss_pred eEEEEecC-CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEE-eccCCCEEEEEECCCCCCCCCC
Q 000473 562 SRQYFLGH-TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVM-HHHVAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 562 ~~~~l~gH-~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~-~~H~~~V~~l~fspd~~~~~~~ 639 (1471)
....|..- ...|.-|.|+|. ...+|.+++.+|.|.+||+....+++.+ ..|..|...|+|+|.+ .
T Consensus 155 ~tt~f~~~sgqsvRll~ys~s--------kr~lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsn-----e 221 (673)
T KOG4378|consen 155 KTTTFTIDSGQSVRLLRYSPS--------KRFLLSIASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSN-----E 221 (673)
T ss_pred cccceecCCCCeEEEeecccc--------cceeeEeeccCCeEEEEeccCCCcccchhhhccCCcCcceecCCc-----c
Confidence 33344332 345567889985 2568889999999999999877766544 5799999999999985 6
Q ss_pred CEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCC
Q 000473 640 DCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGT 718 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH 718 (1471)
.+|+++|.|+.|.+||....+....+. ...+...|+|.++|-+|+.|... |.|+.||++. ..++.++..|
T Consensus 222 ~l~vsVG~Dkki~~yD~~s~~s~~~l~-y~~Plstvaf~~~G~~L~aG~s~--------G~~i~YD~R~~k~Pv~v~sah 292 (673)
T KOG4378|consen 222 ALLVSVGYDKKINIYDIRSQASTDRLT-YSHPLSTVAFSECGTYLCAGNSK--------GELIAYDMRSTKAPVAVRSAH 292 (673)
T ss_pred ceEEEecccceEEEeecccccccceee-ecCCcceeeecCCceEEEeecCC--------ceEEEEecccCCCCceEeeec
Confidence 699999999999999999776655443 35578999999999999999888 9999999985 6789999999
Q ss_pred CCCceeeeeee
Q 000473 719 ASHSMFDHFCK 729 (1471)
Q Consensus 719 ~~~v~~~~~~~ 729 (1471)
.+.|+++.|-+
T Consensus 293 ~~sVt~vafq~ 303 (673)
T KOG4378|consen 293 DASVTRVAFQP 303 (673)
T ss_pred ccceeEEEeee
Confidence 99999988774
No 146
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.42 E-value=8.1e-12 Score=137.95 Aligned_cols=170 Identities=19% Similarity=0.324 Sum_probs=130.6
Q ss_pred ccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC
Q 000473 523 YAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD 600 (1471)
Q Consensus 523 f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D 600 (1471)
|++. .++.|+.||.+.||+++ +...-+.|.+|..+|+|++|++| |+.|+|+|.|
T Consensus 31 Fs~~G~~lAvGc~nG~vvI~D~~---------------T~~iar~lsaH~~pi~sl~WS~d---------gr~LltsS~D 86 (405)
T KOG1273|consen 31 FSRWGDYLAVGCANGRVVIYDFD---------------TFRIARMLSAHVRPITSLCWSRD---------GRKLLTSSRD 86 (405)
T ss_pred eccCcceeeeeccCCcEEEEEcc---------------ccchhhhhhccccceeEEEecCC---------CCEeeeecCC
Confidence 5543 89999999999995554 23445778999999999999998 8999999999
Q ss_pred CcEEEEECCCCceEEEEeccCCCEEEEEECCCC-----------------------------------------CCCCCC
Q 000473 601 CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQ-----------------------------------------TEHPWS 639 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~-----------------------------------------~~~~~~ 639 (1471)
..|++||+..|.+++.+. ...+|+...|+|.. ...+.|
T Consensus 87 ~si~lwDl~~gs~l~rir-f~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fdr~g 165 (405)
T KOG1273|consen 87 WSIKLWDLLKGSPLKRIR-FDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFDRRG 165 (405)
T ss_pred ceeEEEeccCCCceeEEE-ccCccceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCCccccccccccccccCCC
Confidence 999999999999888774 34566666666531 125678
Q ss_pred CEEEEEeCCCcEEEEECCCCcEEEEecCCC-CC-----------------------------------------------
Q 000473 640 DCFLSVGEDFSVALASLETLRVERMFPGHP-NY----------------------------------------------- 671 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~-~~----------------------------------------------- 671 (1471)
+++.+|...|.+.++|..+.+++..+.--. ..
T Consensus 166 ~yIitGtsKGkllv~~a~t~e~vas~rits~~~IK~I~~s~~g~~liiNtsDRvIR~ye~~di~~~~r~~e~e~~~K~qD 245 (405)
T KOG1273|consen 166 KYIITGTSKGKLLVYDAETLECVASFRITSVQAIKQIIVSRKGRFLIINTSDRVIRTYEISDIDDEGRDGEVEPEHKLQD 245 (405)
T ss_pred CEEEEecCcceEEEEecchheeeeeeeechheeeeEEEEeccCcEEEEecCCceEEEEehhhhcccCccCCcChhHHHHH
Confidence 899999999999999988776655443111 11
Q ss_pred ------cEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCcee
Q 000473 672 ------PAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMF 724 (1471)
Q Consensus 672 ------V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~ 724 (1471)
-.+++|+.+|.|+..|+.- ...+|||.-..|.+++.|.|..+....
T Consensus 246 vVNk~~Wk~ccfs~dgeYv~a~s~~-------aHaLYIWE~~~GsLVKILhG~kgE~l~ 297 (405)
T KOG1273|consen 246 VVNKLQWKKCCFSGDGEYVCAGSAR-------AHALYIWEKSIGSLVKILHGTKGEELL 297 (405)
T ss_pred HHhhhhhhheeecCCccEEEecccc-------ceeEEEEecCCcceeeeecCCchhhee
Confidence 1346778888888776643 268999999999999999999866555
No 147
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.40 E-value=2.5e-12 Score=148.47 Aligned_cols=207 Identities=14% Similarity=0.181 Sum_probs=158.0
Q ss_pred CCCCcccceeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccC
Q 000473 469 ENEGSCTGKSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERH 548 (1471)
Q Consensus 469 ~~dG~~i~~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~ 548 (1471)
-+||. .+.+..+-+++.+||+..+.+..+.. +........+++..++.. ..++++.||.|.| ||
T Consensus 474 ~pdgr---tLivGGeastlsiWDLAapTprikae--ltssapaCyALa~spDak----vcFsccsdGnI~v--wD----- 537 (705)
T KOG0639|consen 474 LPDGR---TLIVGGEASTLSIWDLAAPTPRIKAE--LTSSAPACYALAISPDAK----VCFSCCSDGNIAV--WD----- 537 (705)
T ss_pred cCCCc---eEEeccccceeeeeeccCCCcchhhh--cCCcchhhhhhhcCCccc----eeeeeccCCcEEE--EE-----
Confidence 34562 46777788999999998877743221 111112223333233332 5677789999999 33
Q ss_pred CCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEE
Q 000473 549 NSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQII 628 (1471)
Q Consensus 549 d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~ 628 (1471)
+.....++.|.||++.+.|+.+++| |..|-||+-|.+||.||+++++.+... .....|.++-
T Consensus 538 --------Lhnq~~VrqfqGhtDGascIdis~d---------GtklWTGGlDntvRcWDlregrqlqqh-dF~SQIfSLg 599 (705)
T KOG0639|consen 538 --------LHNQTLVRQFQGHTDGASCIDISKD---------GTKLWTGGLDNTVRCWDLREGRQLQQH-DFSSQIFSLG 599 (705)
T ss_pred --------cccceeeecccCCCCCceeEEecCC---------CceeecCCCccceeehhhhhhhhhhhh-hhhhhheecc
Confidence 3445778999999999999999997 899999999999999999988765332 2346799999
Q ss_pred ECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 629 LSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 629 fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
++|. +++++.|-+.+.|.|.... +...+++.-|.+.|.+++|.+-|+++++.+.| +-+..|.+.-
T Consensus 600 ~cP~------~dWlavGMens~vevlh~s-kp~kyqlhlheScVLSlKFa~cGkwfvStGkD--------nlLnawrtPy 664 (705)
T KOG0639|consen 600 YCPT------GDWLAVGMENSNVEVLHTS-KPEKYQLHLHESCVLSLKFAYCGKWFVSTGKD--------NLLNAWRTPY 664 (705)
T ss_pred cCCC------ccceeeecccCcEEEEecC-CccceeecccccEEEEEEecccCceeeecCch--------hhhhhccCcc
Confidence 9998 8999999999999998865 44457778899999999999999999999999 8999999988
Q ss_pred CeEEEEEeCCCCCceee
Q 000473 709 GARERVLRGTASHSMFD 725 (1471)
Q Consensus 709 g~~~~~l~gH~~~v~~~ 725 (1471)
|..+-.... .+.|++.
T Consensus 665 GasiFqskE-~SsVlsC 680 (705)
T KOG0639|consen 665 GASIFQSKE-SSSVLSC 680 (705)
T ss_pred ccceeeccc-cCcceee
Confidence 887766543 3345554
No 148
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.39 E-value=5.9e-12 Score=144.88 Aligned_cols=164 Identities=14% Similarity=0.147 Sum_probs=133.0
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
.+|...|-++...... -+.|++|+.|.+|++ ||+.++++..++..|.+.|.++.|+|.
T Consensus 240 ~gHTdavl~Ls~n~~~---~nVLaSgsaD~TV~l---------------WD~~~g~p~~s~~~~~k~Vq~l~wh~~---- 297 (463)
T KOG0270|consen 240 SGHTDAVLALSWNRNF---RNVLASGSADKTVKL---------------WDVDTGKPKSSITHHGKKVQTLEWHPY---- 297 (463)
T ss_pred ccchHHHHHHHhcccc---ceeEEecCCCceEEE---------------EEcCCCCcceehhhcCCceeEEEecCC----
Confidence 4677666665433222 248999999999999 456788999999999999999999996
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEE
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVERM 664 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~ 664 (1471)
...+|++||.|++|.+.|.+........-...+.|..++|.|.. ...++++..||+|+-+|+|.. +++.+
T Consensus 298 ----~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~g~VEkv~w~~~s-----e~~f~~~tddG~v~~~D~R~~~~~vwt 368 (463)
T KOG0270|consen 298 ----EPSVLLSGSYDGTVALKDCRDPSNSGKEWKFDGEVEKVAWDPHS-----ENSFFVSTDDGTVYYFDIRNPGKPVWT 368 (463)
T ss_pred ----CceEEEeccccceEEeeeccCccccCceEEeccceEEEEecCCC-----ceeEEEecCCceEEeeecCCCCCceeE
Confidence 38999999999999999998533332222345679999999974 568888999999999999875 89999
Q ss_pred ecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 665 FPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 665 ~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+..|.++|.++++++.-. +|++++.| ++|++|++..
T Consensus 369 ~~AHd~~ISgl~~n~~~p~~l~t~s~d--------~~Vklw~~~~ 405 (463)
T KOG0270|consen 369 LKAHDDEISGLSVNIQTPGLLSTASTD--------KVVKLWKFDV 405 (463)
T ss_pred EEeccCCcceEEecCCCCcceeecccc--------ceEEEEeecC
Confidence 999999999999988654 67777777 9999999864
No 149
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.38 E-value=7.9e-11 Score=136.09 Aligned_cols=141 Identities=15% Similarity=0.228 Sum_probs=106.3
Q ss_pred EEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccE--EEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 528 IVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAV--LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 528 lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V--~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
|+..+.+|.|.+ |+ +....+++.+.. .+.| ++++.+++ +.+|++||..|.|.|
T Consensus 359 l~~~~~~GeV~v--~n-------------l~~~~~~~rf~D-~G~v~gts~~~S~n---------g~ylA~GS~~GiVNI 413 (514)
T KOG2055|consen 359 LLASGGTGEVYV--WN-------------LRQNSCLHRFVD-DGSVHGTSLCISLN---------GSYLATGSDSGIVNI 413 (514)
T ss_pred EEEEcCCceEEE--Ee-------------cCCcceEEEEee-cCccceeeeeecCC---------CceEEeccCcceEEE
Confidence 444556777777 33 333355555542 2233 45566665 789999999999999
Q ss_pred EECCC------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe--CCCcEEEEECCCCcEEEEecCCC---CCcEE
Q 000473 606 WDLGS------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG--EDFSVALASLETLRVERMFPGHP---NYPAK 674 (1471)
Q Consensus 606 WDl~t------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s--~DgsV~lWdl~t~~~l~~~~gh~---~~V~~ 674 (1471)
||.++ .+++..+..-+..|+++.|+|+ ++.+|-+| .+..+||..+.+......++... +.|+|
T Consensus 414 Yd~~s~~~s~~PkPik~~dNLtt~Itsl~Fn~d------~qiLAiaS~~~knalrLVHvPS~TVFsNfP~~n~~vg~vtc 487 (514)
T KOG2055|consen 414 YDGNSCFASTNPKPIKTVDNLTTAITSLQFNHD------AQILAIASRVKKNALRLVHVPSCTVFSNFPTSNTKVGHVTC 487 (514)
T ss_pred eccchhhccCCCCchhhhhhhheeeeeeeeCcc------hhhhhhhhhccccceEEEeccceeeeccCCCCCCcccceEE
Confidence 99753 4567777777888999999999 78766555 57899999998877777776443 46899
Q ss_pred EEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 675 VVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 675 v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
++|+|.+.||++|..+ |.|.+|.+.
T Consensus 488 ~aFSP~sG~lAvGNe~--------grv~l~kL~ 512 (514)
T KOG2055|consen 488 MAFSPNSGYLAVGNEA--------GRVHLFKLH 512 (514)
T ss_pred EEecCCCceEEeecCC--------CceeeEeec
Confidence 9999999999999998 999999864
No 150
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.38 E-value=1.7e-12 Score=149.75 Aligned_cols=208 Identities=14% Similarity=0.125 Sum_probs=156.6
Q ss_pred ecccccCccccccccCCCCCCCccc-cccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRD-DFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~-~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
..+...+-|++||+..+.......+ ........+.++.+.++.+ .|++|++-.++.| ||+-.
T Consensus 434 VyTgGkgcVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdgr----tLivGGeastlsi--WDLAa----------- 496 (705)
T KOG0639|consen 434 VYTGGKGCVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDGR----TLIVGGEASTLSI--WDLAA----------- 496 (705)
T ss_pred eEecCCCeEEEeeccCCCCCCccccccccCcccceeeeEecCCCc----eEEeccccceeee--eeccC-----------
Confidence 4455668899999976532211110 0112345677777666665 8999999889999 44211
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
.+......+..-.-...+|+.+|| .++.+++..||.|.|||+.....++.|.+|+..+.||.++++
T Consensus 497 pTprikaeltssapaCyALa~spD---------akvcFsccsdGnI~vwDLhnq~~VrqfqGhtDGascIdis~d----- 562 (705)
T KOG0639|consen 497 PTPRIKAELTSSAPACYALAISPD---------AKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISKD----- 562 (705)
T ss_pred CCcchhhhcCCcchhhhhhhcCCc---------cceeeeeccCCcEEEEEcccceeeecccCCCCCceeEEecCC-----
Confidence 112222233333445778899998 899999999999999999999999999999999999999999
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
|..|-||+-|.+||.||+++++.+.... ..+.|.++...|++.+|++|-++ +.|.|-.. ++..-..+.-
T Consensus 563 -GtklWTGGlDntvRcWDlregrqlqqhd-F~SQIfSLg~cP~~dWlavGMen--------s~vevlh~-skp~kyqlhl 631 (705)
T KOG0639|consen 563 -GTKLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGYCPTGDWLAVGMEN--------SNVEVLHT-SKPEKYQLHL 631 (705)
T ss_pred -CceeecCCCccceeehhhhhhhhhhhhh-hhhhheecccCCCccceeeeccc--------CcEEEEec-CCccceeecc
Confidence 9999999999999999999998765532 45679999999999999999988 77766654 3444566778
Q ss_pred CCCCceeeeee
Q 000473 718 TASHSMFDHFC 728 (1471)
Q Consensus 718 H~~~v~~~~~~ 728 (1471)
|.+.|+.+.|.
T Consensus 632 heScVLSlKFa 642 (705)
T KOG0639|consen 632 HESCVLSLKFA 642 (705)
T ss_pred cccEEEEEEec
Confidence 88888888777
No 151
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.36 E-value=2.2e-12 Score=162.73 Aligned_cols=198 Identities=19% Similarity=0.204 Sum_probs=154.2
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.|+.|.+||.|.+|+-+-+.. -.....+.++..|++.|..|.|++. .+++|+||+.||.|.||
T Consensus 82 lIaGG~edG~I~ly~p~~~~~---------~~~~~~la~~~~h~G~V~gLDfN~~--------q~nlLASGa~~geI~iW 144 (1049)
T KOG0307|consen 82 LIAGGLEDGNIVLYDPASIIA---------NASEEVLATKSKHTGPVLGLDFNPF--------QGNLLASGADDGEILIW 144 (1049)
T ss_pred eeeccccCCceEEecchhhcc---------CcchHHHhhhcccCCceeeeecccc--------CCceeeccCCCCcEEEe
Confidence 588889999999954331100 1122456677889999999999997 26799999999999999
Q ss_pred ECCCCceEEEEe--ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC--CcEEEEEcCCCC
Q 000473 607 DLGSGNLITVMH--HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN--YPAKVVWDCPRG 682 (1471)
Q Consensus 607 Dl~tg~~l~~~~--~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~--~V~~v~~spdg~ 682 (1471)
|+..-+.-..+- .-...|.+++|+... .+.|++++.++.+.|||++..+.+..+..|.. .+..++|+|++.
T Consensus 145 Dlnn~~tP~~~~~~~~~~eI~~lsWNrkv-----qhILAS~s~sg~~~iWDlr~~~pii~ls~~~~~~~~S~l~WhP~~a 219 (1049)
T KOG0307|consen 145 DLNKPETPFTPGSQAPPSEIKCLSWNRKV-----SHILASGSPSGRAVIWDLRKKKPIIKLSDTPGRMHCSVLAWHPDHA 219 (1049)
T ss_pred ccCCcCCCCCCCCCCCcccceEeccchhh-----hHHhhccCCCCCceeccccCCCcccccccCCCccceeeeeeCCCCc
Confidence 998754333331 235679999998764 78999999999999999999888887776655 467899999875
Q ss_pred -EEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCce
Q 000473 683 -YIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTF 760 (1471)
Q Consensus 683 -~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~ti 760 (1471)
.|++++.| |..-.|.+||++. -..++++++|...++.+.+|+. ....++++.+|+++
T Consensus 220 Tql~~As~d-----d~~PviqlWDlR~assP~k~~~~H~~GilslsWc~~----------------D~~lllSsgkD~~i 278 (1049)
T KOG0307|consen 220 TQLLVASGD-----DSAPVIQLWDLRFASSPLKILEGHQRGILSLSWCPQ----------------DPRLLLSSGKDNRI 278 (1049)
T ss_pred eeeeeecCC-----CCCceeEeecccccCCchhhhcccccceeeeccCCC----------------CchhhhcccCCCCe
Confidence 56666666 6668999999985 4678999999999999989943 23456778889999
Q ss_pred Eeecccc
Q 000473 761 RQSQIQN 767 (1471)
Q Consensus 761 r~w~l~~ 767 (1471)
..|+.++
T Consensus 279 i~wN~~t 285 (1049)
T KOG0307|consen 279 ICWNPNT 285 (1049)
T ss_pred eEecCCC
Confidence 9998654
No 152
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=99.35 E-value=8.6e-12 Score=147.18 Aligned_cols=211 Identities=18% Similarity=0.185 Sum_probs=161.8
Q ss_pred eeecccccCccccccccCCCCCC----CccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCC
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAG----DGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPG 552 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g----~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~ 552 (1471)
.+.+.+.++++++|++....+.. ..+.+|.+|.+.|-|+.+.+... .+.+|+.||+|+.|+.. .++|. -
T Consensus 308 ~lit~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl~v~v~~n~~----~~ysgg~Dg~I~~w~~p--~n~dp-~ 380 (577)
T KOG0642|consen 308 VLITASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVLCVVVPSNGE----HCYSGGIDGTIRCWNLP--PNQDP-D 380 (577)
T ss_pred eEEEeccccchhhhhhcccCCccccceeeeEEEecccCceEEEEecCCce----EEEeeccCceeeeeccC--CCCCc-c
Confidence 46677788999999995433322 45678899999999999877776 79999999999995433 22211 0
Q ss_pred CccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEe---ccC--------
Q 000473 553 ASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMH---HHV-------- 621 (1471)
Q Consensus 553 ~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~---~H~-------- 621 (1471)
...|. .....+|.||++.|+.+++|+. ...|+++|.|||++.|+.....+ .+|. .|.
T Consensus 381 ds~dp--~vl~~~l~Ghtdavw~l~~s~~---------~~~Llscs~DgTvr~w~~~~~~~-~~f~~~~e~g~Plsvd~~ 448 (577)
T KOG0642|consen 381 DSYDP--SVLSGTLLGHTDAVWLLALSST---------KDRLLSCSSDGTVRLWEPTEESP-CTFGEPKEHGYPLSVDRT 448 (577)
T ss_pred cccCc--chhccceeccccceeeeeeccc---------ccceeeecCCceEEeeccCCcCc-cccCCccccCCcceEeec
Confidence 01111 1345678999999999999986 67899999999999999864443 1110 000
Q ss_pred -----------------------------------------CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc
Q 000473 622 -----------------------------------------APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR 660 (1471)
Q Consensus 622 -----------------------------------------~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~ 660 (1471)
..+..+..+ |......++..|+.|+++|..+++
T Consensus 449 ss~~a~~~~s~~~~~~~~~~~ev~s~~~~~~s~~~~~~~~~~~in~vVs~------~~~~~~~~~hed~~Ir~~dn~~~~ 522 (577)
T KOG0642|consen 449 SSRPAHSLASFRFGYTSIDDMEVVSDLLIFESSASPGPRRYPQINKVVSH------PTADITFTAHEDRSIRFFDNKTGK 522 (577)
T ss_pred cchhHhhhhhcccccccchhhhhhhheeeccccCCCcccccCccceEEec------CCCCeeEecccCCceecccccccc
Confidence 112233333 447889999999999999999999
Q ss_pred EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCC
Q 000473 661 VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTAS 720 (1471)
Q Consensus 661 ~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~ 720 (1471)
+++....|...++++++.|+|.+|++++.| +.|++|.+....++.....|..
T Consensus 523 ~l~s~~a~~~svtslai~~ng~~l~s~s~d--------~sv~l~kld~k~~~~es~~~r~ 574 (577)
T KOG0642|consen 523 ILHSMVAHKDSVTSLAIDPNGPYLMSGSHD--------GSVRLWKLDVKTCVLESTAHRK 574 (577)
T ss_pred cchheeeccceecceeecCCCceEEeecCC--------ceeehhhccchheeeccccccc
Confidence 999999999999999999999999999999 9999999998888888777764
No 153
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.33 E-value=1.3e-11 Score=137.24 Aligned_cols=102 Identities=27% Similarity=0.376 Sum_probs=91.6
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+|.|+.-|.|+|.+ ..++++...+.||.+.|+.+.++|+ ..++++|+|.|.+||+|
T Consensus 107 ~la~~G~~GvIrVid---------------~~~~~~~~~~~ghG~sINeik~~p~--------~~qlvls~SkD~svRlw 163 (385)
T KOG1034|consen 107 FLAAGGYLGVIRVID---------------VVSGQCSKNYRGHGGSINEIKFHPD--------RPQLVLSASKDHSVRLW 163 (385)
T ss_pred eEEeecceeEEEEEe---------------cchhhhccceeccCccchhhhcCCC--------CCcEEEEecCCceEEEE
Confidence 467777889999943 4456788889999999999999998 37899999999999999
Q ss_pred ECCCCceEEEEe---ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC
Q 000473 607 DLGSGNLITVMH---HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE 657 (1471)
Q Consensus 607 Dl~tg~~l~~~~---~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~ 657 (1471)
++++..++..|- +|.+.|.++.|+++ |..|+|+|.|+++++|++.
T Consensus 164 nI~~~~Cv~VfGG~egHrdeVLSvD~~~~------gd~i~ScGmDhslk~W~l~ 211 (385)
T KOG1034|consen 164 NIQTDVCVAVFGGVEGHRDEVLSVDFSLD------GDRIASCGMDHSLKLWRLN 211 (385)
T ss_pred eccCCeEEEEecccccccCcEEEEEEcCC------CCeeeccCCcceEEEEecC
Confidence 999999998874 69999999999999 9999999999999999997
No 154
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.33 E-value=2.9e-11 Score=133.70 Aligned_cols=207 Identities=15% Similarity=0.130 Sum_probs=153.1
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEE-E-ecCCccEEEEEEec
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQY-F-LGHTGAVLCLAAHR 580 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~-l-~gH~~~V~~la~sp 580 (1471)
..+++|.+.|+++++.++.. .+++.+.|+.|++|+.+.+.. +..++++. + .+| -+.+.|.|
T Consensus 80 ~~LKgH~~~vt~~~FsSdGK----~lat~~~Dr~Ir~w~~~DF~~----------~eHr~~R~nve~dh---pT~V~Fap 142 (420)
T KOG2096|consen 80 SVLKGHKKEVTDVAFSSDGK----KLATISGDRSIRLWDVRDFEN----------KEHRCIRQNVEYDH---PTRVVFAP 142 (420)
T ss_pred hhhhccCCceeeeEEcCCCc----eeEEEeCCceEEEEecchhhh----------hhhhHhhccccCCC---ceEEEECC
Confidence 45789999999999888887 799999999999965542211 11122211 1 234 57789999
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCC---ceEE---------EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG---NLIT---------VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg---~~l~---------~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D 648 (1471)
| -..++++.-...++++|-+... ..-+ .-+.|.-.|..+-..-. +.+++|++.|
T Consensus 143 D--------c~s~vv~~~~g~~l~vyk~~K~~dG~~~~~~v~~D~~~f~~kh~v~~i~iGiA~~------~k~imsas~d 208 (420)
T KOG2096|consen 143 D--------CKSVVVSVKRGNKLCVYKLVKKTDGSGSHHFVHIDNLEFERKHQVDIINIGIAGN------AKYIMSASLD 208 (420)
T ss_pred C--------cceEEEEEccCCEEEEEEeeecccCCCCcccccccccccchhcccceEEEeecCC------ceEEEEecCC
Confidence 8 2567888888889999987532 2111 11235556666665555 7899999999
Q ss_pred CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC---CCe-----EEEEEeCCCC
Q 000473 649 FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK---TGA-----RERVLRGTAS 720 (1471)
Q Consensus 649 gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~---tg~-----~~~~l~gH~~ 720 (1471)
..|.||+++ |+.+..+......-...+.||+|+|+++++.. --|+||++- .|+ .+..|.||++
T Consensus 209 t~i~lw~lk-Gq~L~~idtnq~~n~~aavSP~GRFia~~gFT--------pDVkVwE~~f~kdG~fqev~rvf~LkGH~s 279 (420)
T KOG2096|consen 209 TKICLWDLK-GQLLQSIDTNQSSNYDAAVSPDGRFIAVSGFT--------PDVKVWEPIFTKDGTFQEVKRVFSLKGHQS 279 (420)
T ss_pred CcEEEEecC-CceeeeeccccccccceeeCCCCcEEEEecCC--------CCceEEEEEeccCcchhhhhhhheeccchh
Confidence 999999999 89888887766677888999999999998887 679999973 332 3567899999
Q ss_pred CceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccc
Q 000473 721 HSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQ 766 (1471)
Q Consensus 721 ~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~ 766 (1471)
.|....|.+. +..++++++||++|+|+..
T Consensus 280 aV~~~aFsn~-----------------S~r~vtvSkDG~wriwdtd 308 (420)
T KOG2096|consen 280 AVLAAAFSNS-----------------STRAVTVSKDGKWRIWDTD 308 (420)
T ss_pred heeeeeeCCC-----------------cceeEEEecCCcEEEeecc
Confidence 9998877731 3346788899999999863
No 155
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.33 E-value=3.4e-12 Score=160.98 Aligned_cols=217 Identities=15% Similarity=0.151 Sum_probs=163.9
Q ss_pred eecccccCccccccccCCCC--CCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQ--AGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~--~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
++....+|.+.+|+...-.. .-..+.....|.+.|..+.+.. |.++.+++|..||+|.||+...+... ..
T Consensus 83 IaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~---~q~nlLASGa~~geI~iWDlnn~~tP---~~-- 154 (1049)
T KOG0307|consen 83 IAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNP---FQGNLLASGADDGEILIWDLNKPETP---FT-- 154 (1049)
T ss_pred eeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeeccc---cCCceeeccCCCCcEEEeccCCcCCC---CC--
Confidence 45556677888888765211 1134455678999999976433 44468999999999999554422110 00
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCC--CEEEEEECCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVA--PVRQIILSPPQ 633 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~--~V~~l~fspd~ 633 (1471)
. + -..-.+.|.||+|... ..+.|+||+.+|.+.|||++..+.+-.+..|.+ .+..+.|+|+.
T Consensus 155 -~--~-----~~~~~~eI~~lsWNrk--------vqhILAS~s~sg~~~iWDlr~~~pii~ls~~~~~~~~S~l~WhP~~ 218 (1049)
T KOG0307|consen 155 -P--G-----SQAPPSEIKCLSWNRK--------VSHILASGSPSGRAVIWDLRKKKPIIKLSDTPGRMHCSVLAWHPDH 218 (1049)
T ss_pred -C--C-----CCCCcccceEeccchh--------hhHHhhccCCCCCceeccccCCCcccccccCCCccceeeeeeCCCC
Confidence 0 0 0113467999999875 378999999999999999999988888877765 47789999995
Q ss_pred CCCCCCCEEEEEeCC---CcEEEEECCC-CcEEEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 634 TEHPWSDCFLSVGED---FSVALASLET-LRVERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 634 ~~~~~~~~l~S~s~D---gsV~lWdl~t-~~~l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
...+++++.| -.|.+||++. ..+++.+.+|...|.++.|.+.+ .+|++++.| +.|.+|+.+|
T Consensus 219 -----aTql~~As~dd~~PviqlWDlR~assP~k~~~~H~~GilslsWc~~D~~lllSsgkD--------~~ii~wN~~t 285 (1049)
T KOG0307|consen 219 -----ATQLLVASGDDSAPVIQLWDLRFASSPLKILEGHQRGILSLSWCPQDPRLLLSSGKD--------NRIICWNPNT 285 (1049)
T ss_pred -----ceeeeeecCCCCCceeEeecccccCCchhhhcccccceeeeccCCCCchhhhcccCC--------CCeeEecCCC
Confidence 4566666655 4799999985 45678889999999999999987 788888888 9999999999
Q ss_pred CeEEEEEeCCCCCceeeeeeecc
Q 000473 709 GARERVLRGTASHSMFDHFCKGI 731 (1471)
Q Consensus 709 g~~~~~l~gH~~~v~~~~~~~~~ 731 (1471)
|+.+..+.....-..-++||++.
T Consensus 286 gEvl~~~p~~~nW~fdv~w~pr~ 308 (1049)
T KOG0307|consen 286 GEVLGELPAQGNWCFDVQWCPRN 308 (1049)
T ss_pred ceEeeecCCCCcceeeeeecCCC
Confidence 99999998877777778999753
No 156
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=99.31 E-value=1e-10 Score=125.44 Aligned_cols=188 Identities=13% Similarity=0.086 Sum_probs=133.3
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCC
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~ 582 (1471)
..+++|.+.+..++ |+-+.|++|+ ||.|+-|.|.........-++|.... +.++=.-.--.|+++...|.
T Consensus 56 v~eqahdgpiy~~~------f~d~~Lls~g-dG~V~gw~W~E~~es~~~K~lwe~~~--P~~~~~~evPeINam~ldP~- 125 (325)
T KOG0649|consen 56 VPEQAHDGPIYYLA------FHDDFLLSGG-DGLVYGWEWNEEEESLATKRLWEVKI--PMQVDAVEVPEINAMWLDPS- 125 (325)
T ss_pred eeccccCCCeeeee------eehhheeecc-CceEEEeeehhhhhhccchhhhhhcC--ccccCcccCCccceeEeccC-
Confidence 34588999999987 3333455544 69999999985544222233455432 22210111235788888886
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE 662 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l 662 (1471)
.+-++.++.|+.+.-||+++|+..+++++|+..|.++.-... ...++||++||++|+||+++++++
T Consensus 126 --------enSi~~AgGD~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~------~~qilsG~EDGtvRvWd~kt~k~v 191 (325)
T KOG0649|consen 126 --------ENSILFAGGDGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNA------NGQILSGAEDGTVRVWDTKTQKHV 191 (325)
T ss_pred --------CCcEEEecCCeEEEEEEecCCEEEEEEcCCcceeeeeeeccc------CcceeecCCCccEEEEecccccee
Confidence 455556668999999999999999999999999999987544 457899999999999999999999
Q ss_pred EEecCCCC----------CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE--eCCCCCceee
Q 000473 663 RMFPGHPN----------YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL--RGTASHSMFD 725 (1471)
Q Consensus 663 ~~~~gh~~----------~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l--~gH~~~v~~~ 725 (1471)
.++..... .|.++ .-+.++|++|+. -.+.+|.+++.++..++ .+|...+++.
T Consensus 192 ~~ie~yk~~~~lRp~~g~wigal--a~~edWlvCGgG---------p~lslwhLrsse~t~vfpipa~v~~v~F~ 255 (325)
T KOG0649|consen 192 SMIEPYKNPNLLRPDWGKWIGAL--AVNEDWLVCGGG---------PKLSLWHLRSSESTCVFPIPARVHLVDFV 255 (325)
T ss_pred EEeccccChhhcCcccCceeEEE--eccCceEEecCC---------CceeEEeccCCCceEEEecccceeEeeee
Confidence 88864322 23344 446779998876 48999999998776655 4555555544
No 157
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=99.30 E-value=3.3e-11 Score=139.38 Aligned_cols=237 Identities=14% Similarity=0.108 Sum_probs=167.8
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
+.+..+..+++|......+--.......+ ..-.|+.+.+... .+++|+.+|.++| |+ .+
T Consensus 51 as~~gdk~~~~~~K~g~~~~Vp~~~k~~g--d~~~Cv~~~s~S~----y~~sgG~~~~Vki--wd-------------l~ 109 (673)
T KOG4378|consen 51 ASMAGDKVMRIKEKDGKTPEVPRVRKLTG--DNAFCVACASQSL----YEISGGQSGCVKI--WD-------------LR 109 (673)
T ss_pred eecCCceeEEEecccCCCCccceeecccc--chHHHHhhhhcce----eeeccCcCceeee--hh-------------hH
Confidence 44556677788887665332222222222 2334554444444 6899999999999 33 34
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc-CCCEEEEEECCCCCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH-VAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H-~~~V~~l~fspd~~~~~ 637 (1471)
...+++.+++|+..|+++.|.-. ..+|++++..|.|.+..+.++..-..|..- ...|.-+.|+|..
T Consensus 110 ~kl~hr~lkdh~stvt~v~YN~~---------DeyiAsvs~gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~sk---- 176 (673)
T KOG4378|consen 110 AKLIHRFLKDHQSTVTYVDYNNT---------DEYIASVSDGGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSK---- 176 (673)
T ss_pred HHHHhhhccCCcceeEEEEecCC---------cceeEEeccCCcEEEEecccCccccceecCCCCeEEEeeccccc----
Confidence 44677889999999999999875 799999999999999999999888788655 4456688999973
Q ss_pred CCCEEEEEeCCCcEEEEECCCCcEEEEe-cCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 638 WSDCFLSVGEDFSVALASLETLRVERMF-PGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 638 ~~~~l~S~s~DgsV~lWdl~t~~~l~~~-~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
...|.+++.||.|.|||++...++..+ ..|..+...|+|+|... .|++-+.| ..|++||+...+...++
T Consensus 177 -r~lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~D--------kki~~yD~~s~~s~~~l 247 (673)
T KOG4378|consen 177 -RFLLSIASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYD--------KKINIYDIRSQASTDRL 247 (673)
T ss_pred -ceeeEeeccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEeccc--------ceEEEeeccccccccee
Confidence 447888999999999999988877554 57999999999999755 66777777 99999999987776666
Q ss_pred eCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeeccccccccccccc
Q 000473 716 RGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQNDERGVAFST 776 (1471)
Q Consensus 716 ~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~~~~~~~~~ 776 (1471)
.... +-..+.|.+ +|.+.+.+.. .|.+.+++++...+..+-++
T Consensus 248 ~y~~-Plstvaf~~------------~G~~L~aG~s-----~G~~i~YD~R~~k~Pv~v~s 290 (673)
T KOG4378|consen 248 TYSH-PLSTVAFSE------------CGTYLCAGNS-----KGELIAYDMRSTKAPVAVRS 290 (673)
T ss_pred eecC-CcceeeecC------------CceEEEeecC-----CceEEEEecccCCCCceEee
Confidence 5432 223333441 2223222222 67788888876665554443
No 158
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.30 E-value=1.8e-10 Score=127.52 Aligned_cols=214 Identities=14% Similarity=0.211 Sum_probs=149.6
Q ss_pred EeeccccccccCCCCcccceeecccccCccccccccCCC--CCC---CccccccccCccEEEEEeeccccccCC----EE
Q 000473 458 VDWVNNSTFLDENEGSCTGKSDLTFCQDTVPRSEHVDSR--QAG---DGRDDFVHKEKIVSSSMVISESFYAPY----AI 528 (1471)
Q Consensus 458 ~~w~~~~~~~~~~dG~~i~~l~~s~~~~~v~~Wd~~~~~--~~g---~~~~~~~~h~~~Vts~~~is~~~f~P~----~l 528 (1471)
.+|.-+. -|. .+++.+.++++.+|+-.... ..+ ....++...+..|+.+. |.|. .+
T Consensus 65 V~WAhPE------fGq---vvA~cS~Drtv~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~DV~------FaP~hlGLkl 129 (361)
T KOG2445|consen 65 VVWAHPE------FGQ---VVATCSYDRTVSIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTDVK------FAPKHLGLKL 129 (361)
T ss_pred EEecCcc------ccc---eEEEEecCCceeeeeecccccccccceeEEEEEeecCCcceeEEE------ecchhcceEE
Confidence 5676654 342 37788889999999974221 111 12234455556677765 7776 79
Q ss_pred EEEEcCCcEEEEEecccccCCCCCCccccCCcc--eEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC-----C
Q 000473 529 VYGFFSGEIEVIQFDLFERHNSPGASLKVNSHV--SRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD-----C 601 (1471)
Q Consensus 529 v~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~--~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D-----g 601 (1471)
++++.||.++|+... ... ....|.+.... ..-....|..+..|+.|+| ++-..++|+.|+.+ +
T Consensus 130 A~~~aDG~lRIYEA~--dp~--nLs~W~Lq~Ei~~~~~pp~~~~~~~~CvsWn~------sr~~~p~iAvgs~e~a~~~~ 199 (361)
T KOG2445|consen 130 AAASADGILRIYEAP--DPM--NLSQWTLQHEIQNVIDPPGKNKQPCFCVSWNP------SRMHEPLIAVGSDEDAPHLN 199 (361)
T ss_pred EEeccCcEEEEEecC--Ccc--ccccchhhhhhhhccCCcccccCcceEEeecc------ccccCceEEEEcccCCcccc
Confidence 999999999996432 111 11123332110 0111224667788999987 34457889999877 5
Q ss_pred cEEEEECCCCc----eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC------------------
Q 000473 602 SIRIWDLGSGN----LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL------------------ 659 (1471)
Q Consensus 602 tI~lWDl~tg~----~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~------------------ 659 (1471)
.+++|....+. .+..+.+|+.+|+.++|.|+. .+.-+.+++++.|| |+||.++..
T Consensus 200 ~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~--Gr~y~~lAvA~kDg-v~I~~v~~~~s~i~~ee~~~~~~~~~l 276 (361)
T KOG2445|consen 200 KVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNI--GRSYHLLAVATKDG-VRIFKVKVARSAIEEEEVLAPDLMTDL 276 (361)
T ss_pred ceEEEEecCCcceeeeehhcCCCCCcceeeeecccc--CCceeeEEEeecCc-EEEEEEeeccchhhhhcccCCCCcccc
Confidence 78888875432 345677999999999999983 34456899999999 999999731
Q ss_pred --cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 660 --RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 660 --~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
+.+..+.+|.+.|+.+.|.-.|..|++.+.| |.||+|...
T Consensus 277 ~v~~vs~~~~H~~~VWrv~wNmtGtiLsStGdD--------G~VRLWkan 318 (361)
T KOG2445|consen 277 PVEKVSELDDHNGEVWRVRWNMTGTILSSTGDD--------GCVRLWKAN 318 (361)
T ss_pred ceEEeeeccCCCCceEEEEEeeeeeEEeecCCC--------ceeeehhhh
Confidence 2455678999999999999999999999988 999999753
No 159
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.27 E-value=1e-10 Score=130.31 Aligned_cols=125 Identities=22% Similarity=0.255 Sum_probs=111.5
Q ss_pred cCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC
Q 000473 568 GHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE 647 (1471)
Q Consensus 568 gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~ 647 (1471)
.|...-..++|.-+. . ....+++.|+.-|.|++.|+.++++...+.+|.+.|..+.+.|+. .++++|+|.
T Consensus 87 d~~Esfytcsw~yd~--~---~~~p~la~~G~~GvIrVid~~~~~~~~~~~ghG~sINeik~~p~~-----~qlvls~Sk 156 (385)
T KOG1034|consen 87 DHDESFYTCSWSYDS--N---TGNPFLAAGGYLGVIRVIDVVSGQCSKNYRGHGGSINEIKFHPDR-----PQLVLSASK 156 (385)
T ss_pred CCCcceEEEEEEecC--C---CCCeeEEeecceeEEEEEecchhhhccceeccCccchhhhcCCCC-----CcEEEEecC
Confidence 366777888887762 1 126789999999999999999999999999999999999999985 689999999
Q ss_pred CCcEEEEECCCCcEEEEe---cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 648 DFSVALASLETLRVERMF---PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 648 DgsV~lWdl~t~~~l~~~---~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
|.+||+||+++..|+..| .||.+.|.++.|+++|.+|++++.| .++++|++...+
T Consensus 157 D~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmD--------hslk~W~l~~~~ 214 (385)
T KOG1034|consen 157 DHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMD--------HSLKLWRLNVKE 214 (385)
T ss_pred CceEEEEeccCCeEEEEecccccccCcEEEEEEcCCCCeeeccCCc--------ceEEEEecChhH
Confidence 999999999999999887 4799999999999999999999999 999999998543
No 160
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=99.27 E-value=1.4e-10 Score=125.89 Aligned_cols=120 Identities=18% Similarity=0.174 Sum_probs=90.2
Q ss_pred ccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcc
Q 000473 507 HKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAK 586 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~ 586 (1471)
.|.+.|.++.|-+... +=+.|+.+..+.+++++ |+..+.+....+.-.+-.|..+.+-||
T Consensus 203 sh~qpvlsldyas~~~----rGisgga~dkl~~~Sl~-----------~s~gslq~~~e~~lknpGv~gvrIRpD----- 262 (323)
T KOG0322|consen 203 SHKQPVLSLDYASSCD----RGISGGADDKLVMYSLN-----------HSTGSLQIRKEITLKNPGVSGVRIRPD----- 262 (323)
T ss_pred hccCcceeeeechhhc----CCcCCCccccceeeeec-----------cccCcccccceEEecCCCccceEEccC-----
Confidence 4556666666443333 55667777777776554 222111111122222335788888887
Q ss_pred cCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEEC
Q 000473 587 GWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASL 656 (1471)
Q Consensus 587 ~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl 656 (1471)
++.++|+++|+.|||+.-++.+++..++.|.+.|.+++|+|+ ...++.++.|..|.+|++
T Consensus 263 ----~KIlATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd------~~lmAaaskD~rISLWkL 322 (323)
T KOG0322|consen 263 ----GKILATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPD------CELMAAASKDARISLWKL 322 (323)
T ss_pred ----CcEEeecccCCcEEEEEeccCCchhhhhhhhcceeEEEeCCC------CchhhhccCCceEEeeec
Confidence 899999999999999999999999999999999999999999 789999999999999986
No 161
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=99.26 E-value=2.1e-10 Score=123.04 Aligned_cols=163 Identities=22% Similarity=0.357 Sum_probs=125.4
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.++.|..+|+|.+.....+..+.. ..+....+....+|.++|+.++|+ ..+|++|+. |.|+=|
T Consensus 24 ~l~agn~~G~iav~sl~sl~s~sa-----~~~gk~~iv~eqahdgpiy~~~f~-----------d~~Lls~gd-G~V~gw 86 (325)
T KOG0649|consen 24 YLFAGNLFGDIAVLSLKSLDSGSA-----EPPGKLKIVPEQAHDGPIYYLAFH-----------DDFLLSGGD-GLVYGW 86 (325)
T ss_pred EEEEecCCCeEEEEEehhhhcccc-----CCCCCcceeeccccCCCeeeeeee-----------hhheeeccC-ceEEEe
Confidence 689999999999987765544311 111224455668999999999998 367888865 999988
Q ss_pred ECCCCc-------eE-EEEecc-----CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcE
Q 000473 607 DLGSGN-------LI-TVMHHH-----VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 607 Dl~tg~-------~l-~~~~~H-----~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
.-+... +. .....| ...|+++.+.|. .+.++.++.|+.+.-||+++|+..+++.||.++|-
T Consensus 87 ~W~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~------enSi~~AgGD~~~y~~dlE~G~i~r~~rGHtDYvH 160 (325)
T KOG0649|consen 87 EWNEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPS------ENSILFAGGDGVIYQVDLEDGRIQREYRGHTDYVH 160 (325)
T ss_pred eehhhhhhccchhhhhhcCccccCcccCCccceeEeccC------CCcEEEecCCeEEEEEEecCCEEEEEEcCCcceee
Confidence 654211 11 111122 246888999888 66777777999999999999999999999999999
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCC
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTAS 720 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~ 720 (1471)
+++-......+++|++| |++||||.+|++.+.++.....
T Consensus 161 ~vv~R~~~~qilsG~ED--------GtvRvWd~kt~k~v~~ie~yk~ 199 (325)
T KOG0649|consen 161 SVVGRNANGQILSGAED--------GTVRVWDTKTQKHVSMIEPYKN 199 (325)
T ss_pred eeeecccCcceeecCCC--------ccEEEEeccccceeEEeccccC
Confidence 99997777788899999 9999999999999988875543
No 162
>KOG4328 consensus WD40 protein [Function unknown]
Probab=99.26 E-value=3.7e-11 Score=138.71 Aligned_cols=165 Identities=17% Similarity=0.203 Sum_probs=127.7
Q ss_pred ccccCccEEEEEeeccccccCC----EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEec
Q 000473 505 FVHKEKIVSSSMVISESFYAPY----AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHR 580 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~----~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~sp 580 (1471)
..-+..+|++++ |+|. .++.|...|.|-+|++++ .+ .+ ..-+..+.+|.++|.++.|+|
T Consensus 182 ~kv~~~Rit~l~------fHPt~~~~lva~GdK~G~VG~Wn~~~-~~-------~d---~d~v~~f~~hs~~Vs~l~F~P 244 (498)
T KOG4328|consen 182 AKVTDRRITSLA------FHPTENRKLVAVGDKGGQVGLWNFGT-QE-------KD---KDGVYLFTPHSGPVSGLKFSP 244 (498)
T ss_pred eEecccceEEEE------ecccCcceEEEEccCCCcEEEEecCC-CC-------Cc---cCceEEeccCCccccceEecC
Confidence 456778999998 6664 788899999999965531 11 11 133567889999999999999
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCCc---------------------------------------------eEE
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN---------------------------------------------LIT 615 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~---------------------------------------------~l~ 615 (1471)
. +...+++.|.||+|++=|++.+. ...
T Consensus 245 ~--------n~s~i~ssSyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~~s~~~ 316 (498)
T KOG4328|consen 245 A--------NTSQIYSSSYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGNFNVIDLRTDGSEYE 316 (498)
T ss_pred C--------ChhheeeeccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeecccceEEEEeecCCccch
Confidence 7 36788999999999999886321 011
Q ss_pred EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE----EEEecCCCCCcEEEEEcCCCCEEEEEEcCC
Q 000473 616 VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV----ERMFPGHPNYPAKVVWDCPRGYIACLCRDH 691 (1471)
Q Consensus 616 ~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~----l~~~~gh~~~V~~v~~spdg~~L~sgs~D~ 691 (1471)
.+.-|...|..++++|.. ..+++|+|.|++.+|||++.... +.....|...|.++.|||.+..|+|.|.|
T Consensus 317 ~~~lh~kKI~sv~~NP~~-----p~~laT~s~D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT~~D- 390 (498)
T KOG4328|consen 317 NLRLHKKKITSVALNPVC-----PWFLATASLDQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTTCQD- 390 (498)
T ss_pred hhhhhhcccceeecCCCC-----chheeecccCcceeeeehhhhcCCCCcceecccccceeeeeEEcCCCCceEeeccC-
Confidence 123466789999999984 56999999999999999986432 23334699999999999999999999999
Q ss_pred CCCCCCCCEEEEEECC
Q 000473 692 SRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 692 sg~~D~~gtV~VWDi~ 707 (1471)
..|+|||..
T Consensus 391 -------~~IRv~dss 399 (498)
T KOG4328|consen 391 -------NEIRVFDSS 399 (498)
T ss_pred -------CceEEeecc
Confidence 999999973
No 163
>KOG4328 consensus WD40 protein [Function unknown]
Probab=99.24 E-value=1.5e-10 Score=133.78 Aligned_cols=212 Identities=19% Similarity=0.211 Sum_probs=151.7
Q ss_pred ccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCC---EEEEEEcCCcEEEEEeccc-----ccC--C---
Q 000473 483 CQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLF-----ERH--N--- 549 (1471)
Q Consensus 483 ~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l-----~~~--d--- 549 (1471)
..|.|-+|+.....+...-...+..|.+.|+++. |+|. .+.+.+.||+|+..++... ... +
T Consensus 208 K~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~------F~P~n~s~i~ssSyDGtiR~~D~~~~i~e~v~s~~~d~~~ 281 (498)
T KOG4328|consen 208 KGGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLK------FSPANTSQIYSSSYDGTIRLQDFEGNISEEVLSLDTDNIW 281 (498)
T ss_pred CCCcEEEEecCCCCCccCceEEeccCCccccceE------ecCCChhheeeeccCceeeeeeecchhhHHHhhcCcccee
Confidence 3466777887433332334455778999999988 5553 8999999999998765311 000 0
Q ss_pred -----------------CCC--CccccCCcce-EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC
Q 000473 550 -----------------SPG--ASLKVNSHVS-RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG 609 (1471)
Q Consensus 550 -----------------~~~--~~~d~~s~~~-~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~ 609 (1471)
.-+ ..||..++.. ...+.-|...|+.++++|.+ ..+|+|+|.|+++++||++
T Consensus 282 fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~~s~~~~~~lh~kKI~sv~~NP~~--------p~~laT~s~D~T~kIWD~R 353 (498)
T KOG4328|consen 282 FSSLDFSAESRSVLFGDNVGNFNVIDLRTDGSEYENLRLHKKKITSVALNPVC--------PWFLATASLDQTAKIWDLR 353 (498)
T ss_pred eeeccccCCCccEEEeecccceEEEEeecCCccchhhhhhhcccceeecCCCC--------chheeecccCcceeeeehh
Confidence 011 2344443322 34456688899999999973 7899999999999999997
Q ss_pred CCc----eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC----CCcEEEEecCCCC------CcEEE
Q 000473 610 SGN----LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE----TLRVERMFPGHPN------YPAKV 675 (1471)
Q Consensus 610 tg~----~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~----t~~~l~~~~gh~~------~V~~v 675 (1471)
.-. ++-....|..+|.+..|+|. +..|+|.+.|..|+|||.. .-.+..++. |.. .+...
T Consensus 354 ~l~~K~sp~lst~~HrrsV~sAyFSPs------~gtl~TT~~D~~IRv~dss~~sa~~~p~~~I~-Hn~~t~RwlT~fKA 426 (498)
T KOG4328|consen 354 QLRGKASPFLSTLPHRRSVNSAYFSPS------GGTLLTTCQDNEIRVFDSSCISAKDEPLGTIP-HNNRTGRWLTPFKA 426 (498)
T ss_pred hhcCCCCcceecccccceeeeeEEcCC------CCceEeeccCCceEEeecccccccCCccceee-ccCcccccccchhh
Confidence 422 23334479999999999999 5669999999999999983 333334432 322 24567
Q ss_pred EEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCce
Q 000473 676 VWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSM 723 (1471)
Q Consensus 676 ~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~ 723 (1471)
+|.|+..++++|-.- ..|-|+|-..|+.+..+.+.....+
T Consensus 427 ~W~P~~~li~vg~~~--------r~IDv~~~~~~q~v~el~~P~~~tI 466 (498)
T KOG4328|consen 427 AWDPDYNLIVVGRYP--------RPIDVFDGNGGQMVCELHDPESSTI 466 (498)
T ss_pred eeCCCccEEEEeccC--------cceeEEcCCCCEEeeeccCcccccc
Confidence 899999999999887 7899999999998888887765333
No 164
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=99.23 E-value=6.8e-09 Score=128.83 Aligned_cols=224 Identities=17% Similarity=0.203 Sum_probs=140.6
Q ss_pred EEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccccccCC
Q 000473 22 TSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMGKSSLD 101 (1471)
Q Consensus 22 va~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~~~s~d 101 (1471)
-.||+|+++|+.. .+.+|.++...+ ...+..|++|.++++.+.+ .| -.++
T Consensus 22 avfSnD~k~l~~~-~~~~V~VyS~~T-----g~~i~~l~~~~a~l~s~~~-~~-----------------------~~~~ 71 (792)
T KOG1963|consen 22 AVFSNDAKFLFLC-TGNFVKVYSTAT-----GECITSLEDHTAPLTSVIV-LP-----------------------SSEN 71 (792)
T ss_pred cccccCCcEEEEe-eCCEEEEEecch-----HhhhhhcccccCccceeee-cC-----------------------CCcc
Confidence 4699999998755 467899999986 3455689999999999983 00 1223
Q ss_pred CCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcE---------EE-----------EcC-----------------
Q 000473 102 NGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSV---------IC-----------TLP----------------- 144 (1471)
Q Consensus 102 ~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~---------i~-----------~~s----------------- 144 (1471)
..++.+++.||+|++||..+|.+++.........+-.. .. .++
T Consensus 72 ~~~~~~~sl~G~I~vwd~~~~~Llkt~~~~~~v~~~~~~~~~a~~s~~~~~s~~~~~~~~~~s~~~~~q~~~~~~~t~~~ 151 (792)
T KOG1963|consen 72 ANYLIVCSLDGTIRVWDWSDGELLKTFDNNLPVHALVYKPAQADISANVYVSVEDYSILTTFSKKLSKQSSRFVLATFDS 151 (792)
T ss_pred ceEEEEEecCccEEEecCCCcEEEEEEecCCceeEEEechhHhCccceeEeecccceeeeecccccccceeeeEeeeccc
Confidence 47888999999999999999998877543211100000 00 000
Q ss_pred ------------------CCCeE--EE--Ecceec--ccCC-------cccccccccccccccccccCCCCCCCCCceEE
Q 000473 145 ------------------SNPRY--VC--IGCCFI--DTNQ-------LSDHHSFESVEGDLVSEDKEVPMKNPPKCTLV 193 (1471)
Q Consensus 145 ------------------~~~~l--l~--~G~~~i--d~~~-------~~~~h~~~~i~~~~~~~d~~~~~~~~~~~~I~ 193 (1471)
+.+.+ ++ ++.+.+ ..++ ..+.|+|. +.-...++.+.+......+|+|.
T Consensus 152 ~~~d~~~~~~~~~~I~~~~~ge~~~i~~~~~~~~~~v~~~~~~~~~~~~~~~Htf~-~t~~~~spn~~~~Aa~d~dGrI~ 230 (792)
T KOG1963|consen 152 AKGDFLKEHQEPKSIVDNNSGEFKGIVHMCKIHIYFVPKHTKHTSSRDITVHHTFN-ITCVALSPNERYLAAGDSDGRIL 230 (792)
T ss_pred cchhhhhhhcCCccEEEcCCceEEEEEEeeeEEEEEecccceeeccchhhhhhccc-ceeEEeccccceEEEeccCCcEE
Confidence 01111 00 000111 1100 11124443 33334455666666777788999
Q ss_pred EEeCcc---eEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCcccccCCCcccCCCcccc
Q 000473 194 IVDTYG---LTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDREEGNGLCKSSSQLDM 270 (1471)
Q Consensus 194 v~D~~t---~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~~~~~~l~~~e~~i~~ 270 (1471)
+|-... ...-.++. +|+.+.|.++.|++++ .+++.|+..|.+.+|.+++++ +++
T Consensus 231 vw~d~~~~~~~~t~t~l--HWH~~~V~~L~fS~~G-----~~LlSGG~E~VLv~Wq~~T~~--------------kqf-- 287 (792)
T KOG1963|consen 231 VWRDFGSSDDSETCTLL--HWHHDEVNSLSFSSDG-----AYLLSGGREGVLVLWQLETGK--------------KQF-- 287 (792)
T ss_pred EEeccccccccccceEE--EecccccceeEEecCC-----ceEeecccceEEEEEeecCCC--------------ccc--
Confidence 997655 22233444 5777889999999654 679999999999999999873 111
Q ss_pred eeccCCcccCceEEEEecCCcEEEEEeCCeEE
Q 000473 271 AILQNGVVEGGHLVSVATCGNIIALVLKDHCI 302 (1471)
Q Consensus 271 v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~ 302 (1471)
+.-..+.+..+.+|||+...+++..|.-+
T Consensus 288 ---LPRLgs~I~~i~vS~ds~~~sl~~~DNqI 316 (792)
T KOG1963|consen 288 ---LPRLGSPILHIVVSPDSDLYSLVLEDNQI 316 (792)
T ss_pred ---ccccCCeeEEEEEcCCCCeEEEEecCceE
Confidence 22133445778889999888888877644
No 165
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.22 E-value=1.1e-10 Score=139.32 Aligned_cols=153 Identities=20% Similarity=0.235 Sum_probs=115.2
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.|+.+-+||.|.+++-.. .....+ .........|.+.|..+.|-|. +..|++++.|.++++|
T Consensus 66 iLavadE~G~i~l~dt~~--------~~fr~e-e~~lk~~~aH~nAifDl~wapg---------e~~lVsasGDsT~r~W 127 (720)
T KOG0321|consen 66 ILAVADEDGGIILFDTKS--------IVFRLE-ERQLKKPLAHKNAIFDLKWAPG---------ESLLVSASGDSTIRPW 127 (720)
T ss_pred eEEEecCCCceeeecchh--------hhcchh-hhhhcccccccceeEeeccCCC---------ceeEEEccCCceeeee
Confidence 789999999999943210 001111 1123445689999999999985 7899999999999999
Q ss_pred ECCCCceEEE--EeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc-------EEE--------------
Q 000473 607 DLGSGNLITV--MHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR-------VER-------------- 663 (1471)
Q Consensus 607 Dl~tg~~l~~--~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~-------~l~-------------- 663 (1471)
|+++.++... +.+|.+.|.+++|.|.+ ...||+|+.||.|.|||++... +.+
T Consensus 128 dvk~s~l~G~~~~~GH~~SvkS~cf~~~n-----~~vF~tGgRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp 202 (720)
T KOG0321|consen 128 DVKTSRLVGGRLNLGHTGSVKSECFMPTN-----PAVFCTGGRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKP 202 (720)
T ss_pred eeccceeecceeecccccccchhhhccCC-----CcceeeccCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCch
Confidence 9999998866 99999999999999985 6799999999999999997432 000
Q ss_pred ------EecCCCCCcEE---EEEcCCCCEEEEEEc-CCCCCCCCCCEEEEEECCCCe
Q 000473 664 ------MFPGHPNYPAK---VVWDCPRGYIACLCR-DHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 664 ------~~~gh~~~V~~---v~~spdg~~L~sgs~-D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
.-..|...|.. +.+.-|+..|++++. | +.|+|||++...
T Consensus 203 ~~kr~~k~kA~s~ti~ssvTvv~fkDe~tlaSaga~D--------~~iKVWDLRk~~ 251 (720)
T KOG0321|consen 203 LKKRIRKWKAASNTIFSSVTVVLFKDESTLASAGAAD--------STIKVWDLRKNY 251 (720)
T ss_pred hhccccccccccCceeeeeEEEEEeccceeeeccCCC--------cceEEEeecccc
Confidence 01122333444 556678889998887 5 999999998754
No 166
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=99.21 E-value=5e-11 Score=140.61 Aligned_cols=148 Identities=18% Similarity=0.288 Sum_probs=123.1
Q ss_pred EEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC
Q 000473 531 GFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS 610 (1471)
Q Consensus 531 Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t 610 (1471)
.+..|.|-|+..+ +++++.|-. .-..+ ....|+.++|.|- +.+.|+.++.||.|++|.+..
T Consensus 599 ~g~gG~iai~el~------~PGrLPDgv---~p~l~--Ngt~vtDl~WdPF--------D~~rLAVa~ddg~i~lWr~~a 659 (1012)
T KOG1445|consen 599 AGSGGVIAIYELN------EPGRLPDGV---MPGLF--NGTLVTDLHWDPF--------DDERLAVATDDGQINLWRLTA 659 (1012)
T ss_pred cCCCceEEEEEcC------CCCCCCccc---ccccc--cCceeeecccCCC--------ChHHeeecccCceEEEEEecc
Confidence 4557888885544 355554431 11112 2346999999995 478999999999999999976
Q ss_pred Cc-------eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCE
Q 000473 611 GN-------LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGY 683 (1471)
Q Consensus 611 g~-------~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~ 683 (1471)
+. +-..+..|...|+++.|+|-. .+.|++++.|.+|+|||+++++....+.||.+.|..++|+|+|+.
T Consensus 660 ~gl~e~~~tPe~~lt~h~eKI~slRfHPLA-----advLa~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpdGr~ 734 (1012)
T KOG1445|consen 660 NGLPENEMTPEKILTIHGEKITSLRFHPLA-----ADVLAVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPDGRR 734 (1012)
T ss_pred CCCCcccCCcceeeecccceEEEEEecchh-----hhHhhhhhccceeeeeehhhhhhhheeccCcCceeEEEECCCCcc
Confidence 43 456788999999999999974 679999999999999999999999999999999999999999999
Q ss_pred EEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 684 IACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 684 L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
+++-|.| |+|+||+.++++
T Consensus 735 ~AtVcKD--------g~~rVy~Prs~e 753 (1012)
T KOG1445|consen 735 IATVCKD--------GTLRVYEPRSRE 753 (1012)
T ss_pred eeeeecC--------ceEEEeCCCCCC
Confidence 9999999 999999998875
No 167
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=99.20 E-value=1.1e-10 Score=133.24 Aligned_cols=156 Identities=20% Similarity=0.210 Sum_probs=129.4
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+++|+.|..|++|... .+.+..+ ..+-..+..|..|+..|+++.|+|+ +++|+||+.+|.|.+|
T Consensus 28 ~laT~G~D~~iriW~v~--r~~~~~~----~~~V~y~s~Ls~H~~aVN~vRf~p~---------gelLASg~D~g~v~lW 92 (434)
T KOG1009|consen 28 KLATAGGDKDIRIWKVN--RSEPGGG----DMKVEYLSSLSRHTRAVNVVRFSPD---------GELLASGGDGGEVFLW 92 (434)
T ss_pred ceecccCccceeeeeee--ecCCCCC----ceeEEEeecccCCcceeEEEEEcCC---------cCeeeecCCCceEEEE
Confidence 69999999999995543 2110000 0122456678899999999999998 8999999999999999
Q ss_pred ECC--------C--------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC
Q 000473 607 DLG--------S--------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 607 Dl~--------t--------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
-.. + +....++.+|...|..++|+|+ ++.+++++.|.++++||+..|..+..+.+|..
T Consensus 93 k~~~~~~~~~d~e~~~~ke~w~v~k~lr~h~~diydL~Ws~d------~~~l~s~s~dns~~l~Dv~~G~l~~~~~dh~~ 166 (434)
T KOG1009|consen 93 KQGDVRIFDADTEADLNKEKWVVKKVLRGHRDDIYDLAWSPD------SNFLVSGSVDNSVRLWDVHAGQLLAILDDHEH 166 (434)
T ss_pred EecCcCCccccchhhhCccceEEEEEecccccchhhhhccCC------CceeeeeeccceEEEEEeccceeEeecccccc
Confidence 876 3 2234677889999999999999 89999999999999999999999999999999
Q ss_pred CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 671 YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 671 ~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
.|.-++|.|..+|+++-+.| ...++.++...+.
T Consensus 167 yvqgvawDpl~qyv~s~s~d--------r~~~~~~~~~~~~ 199 (434)
T KOG1009|consen 167 YVQGVAWDPLNQYVASKSSD--------RHPEGFSAKLKQV 199 (434)
T ss_pred ccceeecchhhhhhhhhccC--------cccceeeeeeeee
Confidence 99999999999999999988 6677777655443
No 168
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.18 E-value=1.1e-09 Score=121.84 Aligned_cols=217 Identities=15% Similarity=0.190 Sum_probs=155.0
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
++.++.+.-|++||....+. ++....-.|-..+++.- ...|+|+ .|++|+ ...|+| +++ .++++-.
T Consensus 126 ~a~ssr~~PIh~wdaftG~l--raSy~~ydh~de~taAh---sL~Fs~DGeqlfaGy-krcirv--Fdt----~RpGr~c 193 (406)
T KOG2919|consen 126 FAVSSRDQPIHLWDAFTGKL--RASYRAYDHQDEYTAAH---SLQFSPDGEQLFAGY-KRCIRV--FDT----SRPGRDC 193 (406)
T ss_pred eeeccccCceeeeecccccc--ccchhhhhhHHhhhhhe---eEEecCCCCeEeecc-cceEEE--eec----cCCCCCC
Confidence 56677778899999887654 22223334555555421 1237777 666665 578999 442 1233322
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~ 635 (1471)
+..+-.. +--.|..+.+.|++|+|+ +...++.|+.-.++-++.-..+.++..+.+|.+.|+.+.|.++
T Consensus 194 ~vy~t~~-~~k~gq~giisc~a~sP~--------~~~~~a~gsY~q~~giy~~~~~~pl~llggh~gGvThL~~~ed--- 261 (406)
T KOG2919|consen 194 PVYTTVT-KGKFGQKGIISCFAFSPM--------DSKTLAVGSYGQRVGIYNDDGRRPLQLLGGHGGGVTHLQWCED--- 261 (406)
T ss_pred cchhhhh-cccccccceeeeeeccCC--------CCcceeeecccceeeeEecCCCCceeeecccCCCeeeEEeccC---
Confidence 2211000 001245778999999998 3678999999999999988889999999999999999999999
Q ss_pred CCCCCEEEEEeC-CCcEEEEECCC-CcEEEEecCCCC-CcEEEEE--cCCCCEEEEEEcCCCCCCCCCCEEEEEECCC-C
Q 000473 636 HPWSDCFLSVGE-DFSVALASLET-LRVERMFPGHPN-YPAKVVW--DCPRGYIACLCRDHSRTSDAVDVLFIWDVKT-G 709 (1471)
Q Consensus 636 ~~~~~~l~S~s~-DgsV~lWdl~t-~~~l~~~~gh~~-~V~~v~~--spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g 709 (1471)
|+.|.+|+. |-.|..||++. +.++..+.+|.. .-..|-| .|++++|++|+.| |.|++||+++ |
T Consensus 262 ---Gn~lfsGaRk~dkIl~WDiR~~~~pv~~L~rhv~~TNQRI~FDld~~~~~LasG~td--------G~V~vwdlk~~g 330 (406)
T KOG2919|consen 262 ---GNKLFSGARKDDKILCWDIRYSRDPVYALERHVGDTNQRILFDLDPKGEILASGDTD--------GSVRVWDLKDLG 330 (406)
T ss_pred ---cCeecccccCCCeEEEEeehhccchhhhhhhhccCccceEEEecCCCCceeeccCCC--------ccEEEEecCCCC
Confidence 899999875 78899999985 456777888876 4456666 6889999999998 9999999998 7
Q ss_pred eEEEEEeCCCCCceeeeeee
Q 000473 710 ARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 710 ~~~~~l~gH~~~v~~~~~~~ 729 (1471)
....++..|...+..+.+.|
T Consensus 331 n~~sv~~~~sd~vNgvslnP 350 (406)
T KOG2919|consen 331 NEVSVTGNYSDTVNGVSLNP 350 (406)
T ss_pred CcccccccccccccceecCc
Confidence 76777766766555554443
No 169
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=99.15 E-value=4.4e-09 Score=124.39 Aligned_cols=198 Identities=24% Similarity=0.311 Sum_probs=155.0
Q ss_pred cCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEc-CCcEEEEEecccccCCCCCCccccCCcce
Q 000473 484 QDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFF-SGEIEVIQFDLFERHNSPGASLKVNSHVS 562 (1471)
Q Consensus 484 ~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~-DG~I~V~~~~~l~~~d~~~~~~d~~s~~~ 562 (1471)
++.+++|+... .......+..|...|.++.+.+... .++.+.. |+.+++ |+ ...+..
T Consensus 133 d~~~~~~~~~~---~~~~~~~~~~~~~~v~~~~~~~~~~----~~~~~~~~~~~~~~--~~-------------~~~~~~ 190 (466)
T COG2319 133 DGTVKLWDLST---PGKLIRTLEGHSESVTSLAFSPDGK----LLASGSSLDGTIKL--WD-------------LRTGKP 190 (466)
T ss_pred CccEEEEEecC---CCeEEEEEecCcccEEEEEECCCCC----EEEecCCCCCceEE--EE-------------cCCCce
Confidence 55778888765 1233455678888999877555443 5777765 999998 33 333567
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCC-EEEEEECCCcEEEEECCCCceEE-EEeccCCCEEEEEECCCCCCCCCCC
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNE-VLVSGSMDCSIRIWDLGSGNLIT-VMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~-~L~SGs~DgtI~lWDl~tg~~l~-~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
...+.+|...|.+++|+|+ +. .+++++.|++|++||...+..+. .+.+|...+ ...|+|+ +.
T Consensus 191 ~~~~~~~~~~v~~~~~~~~---------~~~~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~~-~~~~~~~------~~ 254 (466)
T COG2319 191 LSTLAGHTDPVSSLAFSPD---------GGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSFSPD------GS 254 (466)
T ss_pred EEeeccCCCceEEEEEcCC---------cceEEEEecCCCcEEEEECCCCcEEeeecCCCCcce-eEeECCC------CC
Confidence 7888899999999999986 55 66666999999999999888888 788998875 4489998 77
Q ss_pred EEEEEeCCCcEEEEECCCCcE-EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe--C
Q 000473 641 CFLSVGEDFSVALASLETLRV-ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR--G 717 (1471)
Q Consensus 641 ~l~S~s~DgsV~lWdl~t~~~-l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~--g 717 (1471)
.+++++.|+.+++|+++.... +..+.+|...+.++.|.|++..+++++.| +.+.+||..++....... +
T Consensus 255 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~d--------~~~~~~~~~~~~~~~~~~~~~ 326 (466)
T COG2319 255 LLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSD--------GTVRLWDLETGKLLSSLTLKG 326 (466)
T ss_pred EEEEecCCCcEEEeeecCCCcEEEEEecCCccEEEEEECCCCCEEEEeeCC--------CcEEEEEcCCCceEEEeeecc
Confidence 888999999999999987665 55557888999999999999999987777 789999999988777776 7
Q ss_pred CCCCceeeee
Q 000473 718 TASHSMFDHF 727 (1471)
Q Consensus 718 H~~~v~~~~~ 727 (1471)
|...+....+
T Consensus 327 ~~~~~~~~~~ 336 (466)
T COG2319 327 HEGPVSSLSF 336 (466)
T ss_pred cCCceEEEEE
Confidence 7764444433
No 170
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=99.15 E-value=2.5e-10 Score=130.46 Aligned_cols=138 Identities=19% Similarity=0.292 Sum_probs=119.5
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc---------eEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN---------LITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~---------~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
..+|..+.|+++ ....++||+.|..|++|-+..++ .+..+..|...|+.+.|+|+ |+
T Consensus 13 ~~pv~s~dfq~n--------~~~~laT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~------ge 78 (434)
T KOG1009|consen 13 HEPVYSVDFQKN--------SLNKLATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPD------GE 78 (434)
T ss_pred CCceEEEEeccC--------cccceecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCC------cC
Confidence 357899999886 24599999999999999987432 34677889999999999999 99
Q ss_pred EEEEEeCCCcEEEEECC--------C--------CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE
Q 000473 641 CFLSVGEDFSVALASLE--------T--------LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW 704 (1471)
Q Consensus 641 ~l~S~s~DgsV~lWdl~--------t--------~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW 704 (1471)
.++||++++.|.+|-.. + ....+.+.+|...|..++|+|++.++++++.| .++++|
T Consensus 79 lLASg~D~g~v~lWk~~~~~~~~~d~e~~~~ke~w~v~k~lr~h~~diydL~Ws~d~~~l~s~s~d--------ns~~l~ 150 (434)
T KOG1009|consen 79 LLASGGDGGEVFLWKQGDVRIFDADTEADLNKEKWVVKKVLRGHRDDIYDLAWSPDSNFLVSGSVD--------NSVRLW 150 (434)
T ss_pred eeeecCCCceEEEEEecCcCCccccchhhhCccceEEEEEecccccchhhhhccCCCceeeeeecc--------ceEEEE
Confidence 99999999999999765 2 23456778999999999999999999999999 999999
Q ss_pred ECCCCeEEEEEeCCCCCceeeeeee
Q 000473 705 DVKTGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 705 Di~tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
|+..|+++..+.+|..-+-++.+.+
T Consensus 151 Dv~~G~l~~~~~dh~~yvqgvawDp 175 (434)
T KOG1009|consen 151 DVHAGQLLAILDDHEHYVQGVAWDP 175 (434)
T ss_pred EeccceeEeeccccccccceeecch
Confidence 9999999999999998887775553
No 171
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.15 E-value=2.1e-09 Score=119.73 Aligned_cols=191 Identities=16% Similarity=0.171 Sum_probs=138.5
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEe--cCCcc---EEEEEEecCCCCcccCcCCCEEEEEECCC
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFL--GHTGA---VLCLAAHRMVGTAKGWSFNEVLVSGSMDC 601 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~--gH~~~---V~~la~spd~~~~~~~~~~~~L~SGs~Dg 601 (1471)
.+++.+.+.-|.+ ||.-+|+....+. .|.+. ..||+|+|| |..|+.| ...
T Consensus 125 l~a~ssr~~PIh~---------------wdaftG~lraSy~~ydh~de~taAhsL~Fs~D---------GeqlfaG-ykr 179 (406)
T KOG2919|consen 125 LFAVSSRDQPIHL---------------WDAFTGKLRASYRAYDHQDEYTAAHSLQFSPD---------GEQLFAG-YKR 179 (406)
T ss_pred eeeeccccCceee---------------eeccccccccchhhhhhHHhhhhheeEEecCC---------CCeEeec-ccc
Confidence 5666667777777 3444555555443 35443 468999998 8888877 567
Q ss_pred cEEEEEC-CCCceEEE-------EeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcE
Q 000473 602 SIRIWDL-GSGNLITV-------MHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 602 tI~lWDl-~tg~~l~~-------~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
+|+++|+ +.|.-... -.+..+-|.+++|+|.. ..+++.++.-..+-|+.-..++++..+.||.+.|+
T Consensus 180 cirvFdt~RpGr~c~vy~t~~~~k~gq~giisc~a~sP~~-----~~~~a~gsY~q~~giy~~~~~~pl~llggh~gGvT 254 (406)
T KOG2919|consen 180 CIRVFDTSRPGRDCPVYTTVTKGKFGQKGIISCFAFSPMD-----SKTLAVGSYGQRVGIYNDDGRRPLQLLGGHGGGVT 254 (406)
T ss_pred eEEEeeccCCCCCCcchhhhhcccccccceeeeeeccCCC-----CcceeeecccceeeeEecCCCCceeeecccCCCee
Confidence 8999999 55542211 12346789999999984 66999999999999999999999999999999999
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCCC-CceeeeeeeccccccccceEEcCCccccccc
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTAS-HSMFDHFCKGISMNSISGSVLNGNTSVSSLL 751 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~~-~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l 751 (1471)
.+.|.++|+.|++|..- +..|..||++. +..+-.|.+|.. ..-.+.|.- ...+..+.+|++
T Consensus 255 hL~~~edGn~lfsGaRk-------~dkIl~WDiR~~~~pv~~L~rhv~~TNQRI~FDl----d~~~~~LasG~t------ 317 (406)
T KOG2919|consen 255 HLQWCEDGNKLFSGARK-------DDKILCWDIRYSRDPVYALERHVGDTNQRILFDL----DPKGEILASGDT------ 317 (406)
T ss_pred eEEeccCcCeecccccC-------CCeEEEEeehhccchhhhhhhhccCccceEEEec----CCCCceeeccCC------
Confidence 99999999999999875 36999999985 566777777775 222333331 112222233333
Q ss_pred eeeccCCceEeecccccc
Q 000473 752 LPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 752 ~~~~~D~tir~w~l~~~~ 769 (1471)
||.|++|+++.+-
T Consensus 318 -----dG~V~vwdlk~~g 330 (406)
T KOG2919|consen 318 -----DGSVRVWDLKDLG 330 (406)
T ss_pred -----CccEEEEecCCCC
Confidence 8999999987644
No 172
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=99.14 E-value=7.3e-09 Score=122.56 Aligned_cols=202 Identities=20% Similarity=0.292 Sum_probs=147.6
Q ss_pred ccCccccccccCCCCCCCccccccccC-ccEEEEEeeccccccCCEEEE-EEcCCcEEEEEecccccCCCCCCccccCC-
Q 000473 483 CQDTVPRSEHVDSRQAGDGRDDFVHKE-KIVSSSMVISESFYAPYAIVY-GFFSGEIEVIQFDLFERHNSPGASLKVNS- 559 (1471)
Q Consensus 483 ~~~~v~~Wd~~~~~~~g~~~~~~~~h~-~~Vts~~~is~~~f~P~~lv~-Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s- 559 (1471)
.++.+.+|+..... .....+..+. ..+....+. ..... ..++. +..|+.+.+|+. ..
T Consensus 85 ~d~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~d~~~~~~~~---------------~~~ 144 (466)
T COG2319 85 SDGTIKLWDLDNGE---KLIKSLEGLHDSSVSKLALS-SPDGN-SILLASSSLDGTVKLWDL---------------STP 144 (466)
T ss_pred CCCcEEEEEcCCCc---eeEEEEeccCCCceeeEEEE-CCCcc-eEEeccCCCCccEEEEEe---------------cCC
Confidence 45667777766643 1122223322 244444331 11111 12333 344888888433 22
Q ss_pred cceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC-CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCC
Q 000473 560 HVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM-DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 560 ~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~-DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
......+.+|...|.+++|+|+ +..+++++. |+.+++|++..+..+..+.+|...|.+++|+|+
T Consensus 145 ~~~~~~~~~~~~~v~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~------ 209 (466)
T COG2319 145 GKLIRTLEGHSESVTSLAFSPD---------GKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPD------ 209 (466)
T ss_pred CeEEEEEecCcccEEEEEECCC---------CCEEEecCCCCCceEEEEcCCCceEEeeccCCCceEEEEEcCC------
Confidence 4667788999999999999997 668888885 999999999998999999999999999999988
Q ss_pred CC-EEEEEeCCCcEEEEECCCCcEEE-EecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE-EEEE
Q 000473 639 SD-CFLSVGEDFSVALASLETLRVER-MFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR-ERVL 715 (1471)
Q Consensus 639 ~~-~l~S~s~DgsV~lWdl~t~~~l~-~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~-~~~l 715 (1471)
+. .+++++.|+.|++||...+..+. .+.+|...+ ...|+|++.++++++.| +.+++||++.... ...+
T Consensus 210 ~~~~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d--------~~~~~~~~~~~~~~~~~~ 280 (466)
T COG2319 210 GGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSD--------GTIRLWDLRSSSSLLRTL 280 (466)
T ss_pred cceEEEEecCCCcEEEEECCCCcEEeeecCCCCcce-eEeECCCCCEEEEecCC--------CcEEEeeecCCCcEEEEE
Confidence 66 66666999999999999888888 788998875 44899999899988888 9999999987664 5555
Q ss_pred eCCCCCceeeeee
Q 000473 716 RGTASHSMFDHFC 728 (1471)
Q Consensus 716 ~gH~~~v~~~~~~ 728 (1471)
.+|...+....+.
T Consensus 281 ~~~~~~v~~~~~~ 293 (466)
T COG2319 281 SGHSSSVLSVAFS 293 (466)
T ss_pred ecCCccEEEEEEC
Confidence 7786666665455
No 173
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=99.13 E-value=2e-09 Score=118.30 Aligned_cols=188 Identities=16% Similarity=0.187 Sum_probs=132.9
Q ss_pred ecccccCccccccccCCCCCCCcc--ccccccCccEEEEEeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGR--DDFVHKEKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGA 553 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~--~~~~~h~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~ 553 (1471)
.++..+..+.+|+........... ..-.++....++- .++|. .-+....|+++.. ||
T Consensus 138 lasm~dn~i~l~~l~ess~~vaev~ss~s~e~~~~ftsg------~WspHHdgnqv~tt~d~tl~~--~D---------- 199 (370)
T KOG1007|consen 138 LASMDDNNIVLWSLDESSKIVAEVLSSESAEMRHSFTSG------AWSPHHDGNQVATTSDSTLQF--WD---------- 199 (370)
T ss_pred eEEeccCceEEEEcccCcchheeecccccccccceeccc------ccCCCCccceEEEeCCCcEEE--EE----------
Confidence 445557888999987765421111 0112233344443 36663 3444456788887 33
Q ss_pred ccccCCcceEEEE-ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC-CceEEEEeccCCCEEEEEECC
Q 000473 554 SLKVNSHVSRQYF-LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS-GNLITVMHHHVAPVRQIILSP 631 (1471)
Q Consensus 554 ~~d~~s~~~~~~l-~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t-g~~l~~~~~H~~~V~~l~fsp 631 (1471)
..+......+ ..|...|..+.|.|+. ..+|+||+.||.|++||.+. ..++..+.+|..-|+++.|+|
T Consensus 200 ---~RT~~~~~sI~dAHgq~vrdlDfNpnk--------q~~lvt~gDdgyvriWD~R~tk~pv~el~~HsHWvW~VRfn~ 268 (370)
T KOG1007|consen 200 ---LRTMKKNNSIEDAHGQRVRDLDFNPNK--------QHILVTCGDDGYVRIWDTRKTKFPVQELPGHSHWVWAVRFNP 268 (370)
T ss_pred ---ccchhhhcchhhhhcceeeeccCCCCc--------eEEEEEcCCCccEEEEeccCCCccccccCCCceEEEEEEecC
Confidence 3333333333 5688899999999972 67999999999999999974 557899999999999999999
Q ss_pred CCCCCCCCCEEEEEeCCCcEEEEECCCC-----------------------------cEEEEecCCCCCcEEEEEcCCCC
Q 000473 632 PQTEHPWSDCFLSVGEDFSVALASLETL-----------------------------RVERMFPGHPNYPAKVVWDCPRG 682 (1471)
Q Consensus 632 d~~~~~~~~~l~S~s~DgsV~lWdl~t~-----------------------------~~l~~~~gh~~~V~~v~~spdg~ 682 (1471)
.. .++++|||.|..|.||....- ..+.++..|...|++++|+..+.
T Consensus 269 ~h-----dqLiLs~~SDs~V~Lsca~svSSE~qi~~~~dese~e~~dseer~kpL~dg~l~tydehEDSVY~~aWSsadP 343 (370)
T KOG1007|consen 269 EH-----DQLILSGGSDSAVNLSCASSVSSEQQIEFEDDESESEDEDSEERVKPLQDGQLETYDEHEDSVYALAWSSADP 343 (370)
T ss_pred cc-----ceEEEecCCCceeEEEeccccccccccccccccccCcchhhHHhcccccccccccccccccceEEEeeccCCC
Confidence 86 789999999999999965320 13446778999999999999888
Q ss_pred EEE-EEEcCCCCCCCCCCEEEEEECCC
Q 000473 683 YIA-CLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 683 ~L~-sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+++ +-+.| |.+.|=.+..
T Consensus 344 WiFASLSYD--------GRviIs~V~r 362 (370)
T KOG1007|consen 344 WIFASLSYD--------GRVIISSVPR 362 (370)
T ss_pred eeEEEeccC--------ceEEeecCCh
Confidence 765 44555 8888866543
No 174
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=99.13 E-value=3.4e-11 Score=146.29 Aligned_cols=190 Identities=18% Similarity=0.238 Sum_probs=144.0
Q ss_pred cccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecC
Q 000473 502 RDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRM 581 (1471)
Q Consensus 502 ~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd 581 (1471)
+..+.+|...|.|..+-.... ++++|+.|..++||. .++..+...+.||.+.++.++.+..
T Consensus 183 ikrLlgH~naVyca~fDrtg~----~Iitgsdd~lvKiwS---------------~et~~~lAs~rGhs~ditdlavs~~ 243 (1113)
T KOG0644|consen 183 IKRLLGHRNAVYCAIFDRTGR----YIITGSDDRLVKIWS---------------METARCLASCRGHSGDITDLAVSSN 243 (1113)
T ss_pred HHHHHhhhhheeeeeeccccc----eEeecCccceeeeee---------------ccchhhhccCCCCccccchhccchh
Confidence 445679999999988666665 799999999999933 4566788889999999999999875
Q ss_pred CCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC--
Q 000473 582 VGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-- 659 (1471)
Q Consensus 582 ~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-- 659 (1471)
+.+++++|.|+.|++|-+.++.++..+.+|+|.|++++|+|- . +.+.||++++||.+-.
T Consensus 244 ---------n~~iaaaS~D~vIrvWrl~~~~pvsvLrghtgavtaiafsP~------~----sss~dgt~~~wd~r~~~~ 304 (1113)
T KOG0644|consen 244 ---------NTMIAAASNDKVIRVWRLPDGAPVSVLRGHTGAVTAIAFSPR------A----SSSDDGTCRIWDARLEPR 304 (1113)
T ss_pred ---------hhhhhhcccCceEEEEecCCCchHHHHhccccceeeeccCcc------c----cCCCCCceEecccccccc
Confidence 688999999999999999999999999999999999999997 2 7788999999998710
Q ss_pred ------------cEEEEec----------C------CCCCcEEEEEcCCCCEEEEEEcCCC---CCCCCCCEEEEEECCC
Q 000473 660 ------------RVERMFP----------G------HPNYPAKVVWDCPRGYIACLCRDHS---RTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 660 ------------~~l~~~~----------g------h~~~V~~v~~spdg~~L~sgs~D~s---g~~D~~gtV~VWDi~t 708 (1471)
..+..+. + ......+++|...+-.+++++.|.+ -+...+-.+.+|++.+
T Consensus 305 ~y~prp~~~~~~~~~~s~~~~~~~~~f~Tgs~d~ea~n~e~~~l~~~~~~lif~t~ssd~~~~~~~ar~~~~~~vwnl~~ 384 (1113)
T KOG0644|consen 305 IYVPRPLKFTEKDLVDSILFENNGDRFLTGSRDGEARNHEFEQLAWRSNLLIFVTRSSDLSSIVVTARNDHRLCVWNLYT 384 (1113)
T ss_pred ccCCCCCCcccccceeeeeccccccccccccCCcccccchhhHhhhhccceEEEeccccccccceeeeeeeEeeeeeccc
Confidence 1111100 0 0012334555555555555554411 1112226789999999
Q ss_pred CeEEEEEeCCCCCceeeeeee
Q 000473 709 GARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 709 g~~~~~l~gH~~~v~~~~~~~ 729 (1471)
|.+++.+.||...+....++|
T Consensus 385 g~l~H~l~ghsd~~yvLd~Hp 405 (1113)
T KOG0644|consen 385 GQLLHNLMGHSDEVYVLDVHP 405 (1113)
T ss_pred chhhhhhcccccceeeeeecC
Confidence 999999999999888887775
No 175
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=99.12 E-value=1e-08 Score=127.28 Aligned_cols=168 Identities=20% Similarity=0.269 Sum_probs=111.9
Q ss_pred CCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccc
Q 000473 13 TPPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSS 92 (1471)
Q Consensus 13 ~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~ 92 (1471)
.-++..++|+++||.++++|+|..||+|.+|.=... ..+-.....|.=|..+|++|+
T Consensus 202 ~~Htf~~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~-~~~~~t~t~lHWH~~~V~~L~---------------------- 258 (792)
T KOG1963|consen 202 VHHTFNITCVALSPNERYLAAGDSDGRILVWRDFGS-SDDSETCTLLHWHHDEVNSLS---------------------- 258 (792)
T ss_pred hhhcccceeEEeccccceEEEeccCCcEEEEecccc-ccccccceEEEecccccceeE----------------------
Confidence 344445799999999999999999999999975410 011223345667999999999
Q ss_pred cccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccc
Q 000473 93 NVMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVE 172 (1471)
Q Consensus 93 ~~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~ 172 (1471)
|+++|.+|.||+..|.+.+|.+.+++ ... | |+-|+|..-...++|+.+.+.-+.
T Consensus 259 -----fS~~G~~LlSGG~E~VLv~Wq~~T~~-kqf--L-PRLgs~I~~i~vS~ds~~~sl~~~----------------- 312 (792)
T KOG1963|consen 259 -----FSSDGAYLLSGGREGVLVLWQLETGK-KQF--L-PRLGSPILHIVVSPDSDLYSLVLE----------------- 312 (792)
T ss_pred -----EecCCceEeecccceEEEEEeecCCC-ccc--c-cccCCeeEEEEEcCCCCeEEEEec-----------------
Confidence 78999999999999999999999998 221 3 344777665566677766554432
Q ss_pred ccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccc--------cCCeEEEEEeeecCCCCceeEEEEeCCCcEEE
Q 000473 173 GDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLS--------IGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQL 244 (1471)
Q Consensus 173 ~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s--------~~~i~~~~~~~~~~d~~~~~llvas~dG~V~v 244 (1471)
+..|.+....++++..++.+-... -+-.+.+.+.| +-+.++.-+..|.|++
T Consensus 313 ----------------DNqI~li~~~dl~~k~tIsgi~~~~~~~k~~~~~l~t~~~idp-----r~~~~vln~~~g~vQ~ 371 (792)
T KOG1963|consen 313 ----------------DNQIHLIKASDLEIKSTISGIKPPTPSTKTRPQSLTTGVSIDP-----RTNSLVLNGHPGHVQF 371 (792)
T ss_pred ----------------CceEEEEeccchhhhhhccCccCCCccccccccccceeEEEcC-----CCCceeecCCCceEEE
Confidence 145555555555554444320000 01123444442 1133555578899999
Q ss_pred EECCCC
Q 000473 245 VPISKE 250 (1471)
Q Consensus 245 W~l~~~ 250 (1471)
+|+-+.
T Consensus 372 ydl~td 377 (792)
T KOG1963|consen 372 YDLYTD 377 (792)
T ss_pred Eecccc
Confidence 999876
No 176
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=99.09 E-value=1.2e-09 Score=122.82 Aligned_cols=190 Identities=14% Similarity=0.139 Sum_probs=135.6
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.++++..+|+|++++|. +++.+..+++|...++.+.|..+. ..+.+.||+.||+|++|
T Consensus 42 ~vav~lSngsv~lyd~~---------------tg~~l~~fk~~~~~~N~vrf~~~d-------s~h~v~s~ssDG~Vr~w 99 (376)
T KOG1188|consen 42 AVAVSLSNGSVRLYDKG---------------TGQLLEEFKGPPATTNGVRFISCD-------SPHGVISCSSDGTVRLW 99 (376)
T ss_pred eEEEEecCCeEEEEecc---------------chhhhheecCCCCcccceEEecCC-------CCCeeEEeccCCeEEEE
Confidence 58899999999996554 467788899999999999997541 27889999999999999
Q ss_pred ECCCCceE--EEEeccC-CCEEEEEECCCCCCCCCCCEEEEEe----CCCcEEEEECCCCcE-EEE-ecCCCCCcEEEEE
Q 000473 607 DLGSGNLI--TVMHHHV-APVRQIILSPPQTEHPWSDCFLSVG----EDFSVALASLETLRV-ERM-FPGHPNYPAKVVW 677 (1471)
Q Consensus 607 Dl~tg~~l--~~~~~H~-~~V~~l~fspd~~~~~~~~~l~S~s----~DgsV~lWdl~t~~~-l~~-~~gh~~~V~~v~~ 677 (1471)
|+++.... ..+.+|. .+..+++..-. ++.+++|. .|-.|.+||++..+. ++. ...|...|++++|
T Consensus 100 D~Rs~~e~a~~~~~~~~~~~f~~ld~nck------~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrF 173 (376)
T KOG1188|consen 100 DIRSQAESARISWTQQSGTPFICLDLNCK------KNIIACGTELTRSDASVVLWDVRSEQQLLRQLNESHNDDVTQLRF 173 (376)
T ss_pred EeecchhhhheeccCCCCCcceEeeccCc------CCeEEeccccccCceEEEEEEeccccchhhhhhhhccCcceeEEe
Confidence 99976654 3444555 45666666544 67888874 478899999998776 443 4689999999999
Q ss_pred cCCC-CEEEEEEcCCCCCCCCCCEEEEEECCCCeE----EEEEeCCCCCceeeeeeeccccccccceEEcCCccccccce
Q 000473 678 DCPR-GYIACLCRDHSRTSDAVDVLFIWDVKTGAR----ERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLL 752 (1471)
Q Consensus 678 spdg-~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~----~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~ 752 (1471)
+|.. +.|++|+.| |-|.|+|++.-.. +.++. |.+.+-++.| ...+--.++
T Consensus 174 HP~~pnlLlSGSvD--------GLvnlfD~~~d~EeDaL~~viN-~~sSI~~igw----------------~~~~ykrI~ 228 (376)
T KOG1188|consen 174 HPSDPNLLLSGSVD--------GLVNLFDTKKDNEEDALLHVIN-HGSSIHLIGW----------------LSKKYKRIM 228 (376)
T ss_pred cCCCCCeEEeeccc--------ceEEeeecCCCcchhhHHHhhc-ccceeeeeee----------------ecCCcceEE
Confidence 9976 477888888 9999999875422 12221 1111111111 111212466
Q ss_pred eeccCCceEeecccccc
Q 000473 753 PIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 753 ~~~~D~tir~w~l~~~~ 769 (1471)
.+..+.++..|+++.-.
T Consensus 229 clTH~Etf~~~ele~~~ 245 (376)
T KOG1188|consen 229 CLTHMETFAIYELEDGS 245 (376)
T ss_pred EEEccCceeEEEccCCC
Confidence 77778899999886533
No 177
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=99.06 E-value=2.2e-09 Score=118.11 Aligned_cols=212 Identities=19% Similarity=0.190 Sum_probs=153.9
Q ss_pred ccCccEEEEEeecccc-ccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEE-----ecCCccEEEEEEec
Q 000473 507 HKEKIVSSSMVISESF-YAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF-----LGHTGAVLCLAAHR 580 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~-f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l-----~gH~~~V~~la~sp 580 (1471)
.|.-.++.++++++.. --|+.|++. +..+++|+...-. ... .+...| ..|..+++++.|..
T Consensus 94 d~~YP~tK~~wiPd~~g~~pdlLATs--~D~LRlWri~~ee---~~~--------~~~~~L~~~kns~~~aPlTSFDWne 160 (364)
T KOG0290|consen 94 DHPYPVTKLMWIPDSKGVYPDLLATS--SDFLRLWRIGDEE---SRV--------ELQSVLNNNKNSEFCAPLTSFDWNE 160 (364)
T ss_pred CCCCCccceEecCCccccCcchhhcc--cCeEEEEeccCcC---Cce--------ehhhhhccCcccccCCccccccccc
Confidence 4777888888877763 445566653 4578885543100 000 111111 24567899999976
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCCc---eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN---LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE 657 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~---~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~ 657 (1471)
- +.+++.+.|-|-|..+||+++|. ....+-.|..+|..++|.... .+.|+|+|.||+||++|++
T Consensus 161 ~--------dp~~igtSSiDTTCTiWdie~~~~~~vkTQLIAHDKEV~DIaf~~~s-----~~~FASvgaDGSvRmFDLR 227 (364)
T KOG0290|consen 161 V--------DPNLIGTSSIDTTCTIWDIETGVSGTVKTQLIAHDKEVYDIAFLKGS-----RDVFASVGADGSVRMFDLR 227 (364)
T ss_pred C--------CcceeEeecccCeEEEEEEeeccccceeeEEEecCcceeEEEeccCc-----cceEEEecCCCcEEEEEec
Confidence 4 37899999999999999999874 356778999999999999863 5699999999999999998
Q ss_pred CCcE---EEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCCCCceeeeeeeccc
Q 000473 658 TLRV---ERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTASHSMFDHFCKGIS 732 (1471)
Q Consensus 658 t~~~---l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~~~v~~~~~~~~~~ 732 (1471)
..+- +..=+....+...++|++.+ +|+++-..| ...|.|-|++. ...+..|++|++.|..+.+.|.-
T Consensus 228 ~leHSTIIYE~p~~~~pLlRLswnkqDpnymATf~~d-------S~~V~iLDiR~P~tpva~L~~H~a~VNgIaWaPhS- 299 (364)
T KOG0290|consen 228 SLEHSTIIYEDPSPSTPLLRLSWNKQDPNYMATFAMD-------SNKVVILDIRVPCTPVARLRNHQASVNGIAWAPHS- 299 (364)
T ss_pred ccccceEEecCCCCCCcceeeccCcCCchHHhhhhcC-------CceEEEEEecCCCcceehhhcCcccccceEecCCC-
Confidence 7653 22223334578899998865 477776666 26899999987 56789999999999998888532
Q ss_pred cccccceEEcCCccccccceeeccCCceEeecccc
Q 000473 733 MNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 733 ~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~ 767 (1471)
++.+.++..|.....|+++.
T Consensus 300 ---------------~~hictaGDD~qaliWDl~q 319 (364)
T KOG0290|consen 300 ---------------SSHICTAGDDCQALIWDLQQ 319 (364)
T ss_pred ---------------CceeeecCCcceEEEEeccc
Confidence 23445666688899999854
No 178
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=99.04 E-value=9.2e-08 Score=114.22 Aligned_cols=278 Identities=15% Similarity=0.092 Sum_probs=166.9
Q ss_pred CCCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccc
Q 000473 12 GTPPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENS 91 (1471)
Q Consensus 12 ~~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~ 91 (1471)
+.+-.|+|.-++|-|||..|+.++ +..+.++|.++ ....+.|.||+..|.|++
T Consensus 8 r~~~~hci~d~afkPDGsqL~lAA-g~rlliyD~nd-----G~llqtLKgHKDtVycVA--------------------- 60 (1081)
T KOG1538|consen 8 RDKAEHCINDIAFKPDGTQLILAA-GSRLLVYDTSD-----GTLLQPLKGHKDTVYCVA--------------------- 60 (1081)
T ss_pred hcccccchheeEECCCCceEEEec-CCEEEEEeCCC-----cccccccccccceEEEEE---------------------
Confidence 346678999999999998887664 45788999985 457788999999999999
Q ss_pred ccccccccCCCCEEEEEeCCCeEEEEEcC-CCeEEEeeeCCCCCCCCcEEEEcCCCC-eEEEEcc-----eecccCCccc
Q 000473 92 SNVMGKSSLDNGALISACTDGVLCVWSRS-SGHCRRRRKLPPWVGSPSVICTLPSNP-RYVCIGC-----CFIDTNQLSD 164 (1471)
Q Consensus 92 ~~~~~~~s~d~~~LaSas~DG~I~VWdv~-~G~ci~~~~l~~~~g~~~~i~~~s~~~-~ll~~G~-----~~id~~~~~~ 164 (1471)
.+.||++++||+.|..+.||+-. .|.+. +.|+....+..|.|-. .+++|.- ...+.+....
T Consensus 61 ------ys~dGkrFASG~aDK~VI~W~~klEG~Lk------YSH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K 128 (1081)
T KOG1538|consen 61 ------YAKDGKRFASGSADKSVIIWTSKLEGILK------YSHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSK 128 (1081)
T ss_pred ------EccCCceeccCCCceeEEEecccccceee------eccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHh
Confidence 56778889999999999999865 33222 2223333444444333 2222221 1112222222
Q ss_pred ccccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEee-cCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEE
Q 000473 165 HHSFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVF-HGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQ 243 (1471)
Q Consensus 165 ~h~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~-s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~ 243 (1471)
..+--++-++.-.-|+.+...+-.+++|-+-+....+-+..-. .|.-+ +|..+++.|++..|..+-+.+++.+.++.
T Consensus 129 ~kss~R~~~CsWtnDGqylalG~~nGTIsiRNk~gEek~~I~Rpgg~Ns--piwsi~~~p~sg~G~~di~aV~DW~qTLS 206 (1081)
T KOG1538|consen 129 HKSSSRIICCSWTNDGQYLALGMFNGTISIRNKNGEEKVKIERPGGSNS--PIWSICWNPSSGEGRNDILAVADWGQTLS 206 (1081)
T ss_pred hhhheeEEEeeecCCCcEEEEeccCceEEeecCCCCcceEEeCCCCCCC--CceEEEecCCCCCCccceEEEEeccceeE
Confidence 2222334444444455555556667777776654433333222 23334 48999999876556534455569999999
Q ss_pred EEECCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEEEEcCCCc-ceeeeeeeccee
Q 000473 244 LVPISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIFRLLGSGS-TIGEICFVDNLF 322 (1471)
Q Consensus 244 vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~~l~d~~~-~ige~~~~~~~l 322 (1471)
.+.+++.- . +.++.+ + =+...+++.++|.+++.++.+. ++.++.... .+|....
T Consensus 207 Fy~LsG~~-----I-----gk~r~L------~---FdP~CisYf~NGEy~LiGGsdk-~L~~fTR~GvrLGTvg~----- 261 (1081)
T KOG1538|consen 207 FYQLSGKQ-----I-----GKDRAL------N---FDPCCISYFTNGEYILLGGSDK-QLSLFTRDGVRLGTVGE----- 261 (1081)
T ss_pred EEEeccee-----e-----cccccC------C---CCchhheeccCCcEEEEccCCC-ceEEEeecCeEEeeccc-----
Confidence 99998651 0 111111 1 1125678889999999887776 333343322 2222210
Q ss_pred EeecCCCCceeeeeEeecchhhhhhcccccccccccceEEEEcCCCcEEEEEeec
Q 000473 323 CLEGGSTNSYVIGAMFLERVVAEKIENTMGVCTTFYENFAVWDNRGSAIVYAISY 377 (1471)
Q Consensus 323 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~vw~~~G~~~vy~l~~ 377 (1471)
...|+=.....++ ...+.++..||..--|++.+
T Consensus 262 -------~D~WIWtV~~~PN---------------sQ~v~~GCqDGTiACyNl~f 294 (1081)
T KOG1538|consen 262 -------QDSWIWTVQAKPN---------------SQYVVVGCQDGTIACYNLIF 294 (1081)
T ss_pred -------cceeEEEEEEccC---------------CceEEEEEccCeeehhhhHH
Confidence 1233333333333 22778888899888888764
No 179
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=99.02 E-value=2.1e-08 Score=125.36 Aligned_cols=191 Identities=15% Similarity=0.107 Sum_probs=131.8
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.+..+.....|++||........ ....+-...||++. ...-.-+.++.|+.||.+++++-. .. .
T Consensus 1179 ~Ll~tGd~r~IRIWDa~~E~~~~---diP~~s~t~vTaLS---~~~~~gn~i~AGfaDGsvRvyD~R--~a--------~ 1242 (1387)
T KOG1517|consen 1179 HLLVTGDVRSIRIWDAHKEQVVA---DIPYGSSTLVTALS---ADLVHGNIIAAGFADGSVRVYDRR--MA--------P 1242 (1387)
T ss_pred eEEecCCeeEEEEEecccceeEe---ecccCCCccceeec---ccccCCceEEEeecCCceEEeecc--cC--------C
Confidence 56666667888999987653321 11122334455532 111223489999999999994433 11 1
Q ss_pred cCCcceEEEEecCCcc--EEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceE--EEEeccC--C-CEEEEEE
Q 000473 557 VNSHVSRQYFLGHTGA--VLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLI--TVMHHHV--A-PVRQIIL 629 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~--V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l--~~~~~H~--~-~V~~l~f 629 (1471)
.+ ..+...+.|+.. |..+.+.+. | -..|+||+.||.|++||++..... .....|. | ..+++..
T Consensus 1243 ~d--s~v~~~R~h~~~~~Iv~~slq~~-----G---~~elvSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~V 1312 (1387)
T KOG1517|consen 1243 PD--SLVCVYREHNDVEPIVHLSLQRQ-----G---LGELVSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTV 1312 (1387)
T ss_pred cc--ccceeecccCCcccceeEEeecC-----C---CcceeeeccCCeEEEEecccCcccccceeeeccccCccceeeee
Confidence 11 346677889887 999988774 1 237999999999999999864222 2233333 4 5899999
Q ss_pred CCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCC-------CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEE
Q 000473 630 SPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGH-------PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLF 702 (1471)
Q Consensus 630 spd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh-------~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~ 702 (1471)
++. ...+|||+. +.|+||++. |+.+..+..+ .+.+.|++|+|..-.|++|..| .+|.
T Consensus 1313 H~h------apiiAsGs~-q~ikIy~~~-G~~l~~~k~n~~F~~q~~gs~scL~FHP~~~llAaG~~D--------s~V~ 1376 (1387)
T KOG1517|consen 1313 HEH------APIIASGSA-QLIKIYSLS-GEQLNIIKYNPGFMGQRIGSVSCLAFHPHRLLLAAGSAD--------STVS 1376 (1387)
T ss_pred ccC------CCeeeecCc-ceEEEEecC-hhhhcccccCcccccCcCCCcceeeecchhHhhhhccCC--------ceEE
Confidence 888 789999998 999999987 4443333322 2357999999999999999887 8999
Q ss_pred EEECCCC
Q 000473 703 IWDVKTG 709 (1471)
Q Consensus 703 VWDi~tg 709 (1471)
||....+
T Consensus 1377 iYs~~k~ 1383 (1387)
T KOG1517|consen 1377 IYSCEKP 1383 (1387)
T ss_pred EeecCCc
Confidence 9987654
No 180
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=99.02 E-value=8.6e-10 Score=119.94 Aligned_cols=152 Identities=17% Similarity=0.218 Sum_probs=116.7
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.++.|+++|.+.+|+..+ + +...+. .++.+.......|.++|.++.|.+- -..=++|+.+..+..|
T Consensus 167 lllaGyEsghvv~wd~S~--~-~~~~~~--~~~~kv~~~~ash~qpvlsldyas~---------~~rGisgga~dkl~~~ 232 (323)
T KOG0322|consen 167 LLLAGYESGHVVIWDLST--G-DKIIQL--PQSSKVESPNASHKQPVLSLDYASS---------CDRGISGGADDKLVMY 232 (323)
T ss_pred EEEEeccCCeEEEEEccC--C-ceeecc--ccccccccchhhccCcceeeeechh---------hcCCcCCCccccceee
Confidence 588999999999954331 1 100000 0111333445679999999999764 3445788888899999
Q ss_pred ECCC--CceE--EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCC
Q 000473 607 DLGS--GNLI--TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRG 682 (1471)
Q Consensus 607 Dl~t--g~~l--~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~ 682 (1471)
+++. +.+. ...+-....|..+.+.|| ++.++|+|.|+.||+|+.++++++..+.-|.+.|++++|+|+..
T Consensus 233 Sl~~s~gslq~~~e~~lknpGv~gvrIRpD------~KIlATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd~~ 306 (323)
T KOG0322|consen 233 SLNHSTGSLQIRKEITLKNPGVSGVRIRPD------GKILATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPDCE 306 (323)
T ss_pred eeccccCcccccceEEecCCCccceEEccC------CcEEeecccCCcEEEEEeccCCchhhhhhhhcceeEEEeCCCCc
Confidence 9863 3322 222333456888889998 99999999999999999999999999999999999999999999
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
.++.++.| ++|.+|++
T Consensus 307 lmAaaskD--------~rISLWkL 322 (323)
T KOG0322|consen 307 LMAAASKD--------ARISLWKL 322 (323)
T ss_pred hhhhccCC--------ceEEeeec
Confidence 99999999 99999986
No 181
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=99.01 E-value=3.2e-09 Score=119.40 Aligned_cols=199 Identities=17% Similarity=0.273 Sum_probs=149.5
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLK 556 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d 556 (1471)
.+++..+++.+++++... ++....+++++..++-+.++++. .|+.+.+++.||+|++|+...
T Consensus 42 ~vav~lSngsv~lyd~~t----g~~l~~fk~~~~~~N~vrf~~~d--s~h~v~s~ssDG~Vr~wD~Rs------------ 103 (376)
T KOG1188|consen 42 AVAVSLSNGSVRLYDKGT----GQLLEEFKGPPATTNGVRFISCD--SPHGVISCSSDGTVRLWDIRS------------ 103 (376)
T ss_pred eEEEEecCCeEEEEeccc----hhhhheecCCCCcccceEEecCC--CCCeeEEeccCCeEEEEEeec------------
Confidence 366777888999998776 56777889999999999988876 577999999999999944331
Q ss_pred cCCcceEEEEecCC-ccEEEEEEecCCCCcccCcCCCEEEEEE----CCCcEEEEECCCCce-EEE-EeccCCCEEEEEE
Q 000473 557 VNSHVSRQYFLGHT-GAVLCLAAHRMVGTAKGWSFNEVLVSGS----MDCSIRIWDLGSGNL-ITV-MHHHVAPVRQIIL 629 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~-~~V~~la~spd~~~~~~~~~~~~L~SGs----~DgtI~lWDl~tg~~-l~~-~~~H~~~V~~l~f 629 (1471)
.....+..+.+|. .+-.|++.... ++.++.|. .|-.|.+||++..+. +.. +..|...|++|.|
T Consensus 104 -~~e~a~~~~~~~~~~~f~~ld~nck---------~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrF 173 (376)
T KOG1188|consen 104 -QAESARISWTQQSGTPFICLDLNCK---------KNIIACGTELTRSDASVVLWDVRSEQQLLRQLNESHNDDVTQLRF 173 (376)
T ss_pred -chhhhheeccCCCCCcceEeeccCc---------CCeEEeccccccCceEEEEEEeccccchhhhhhhhccCcceeEEe
Confidence 1113344455665 45667765433 67777774 466899999997765 443 4579999999999
Q ss_pred CCCCCCCCCCCEEEEEeCCCcEEEEECCCCc---EEEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 630 SPPQTEHPWSDCFLSVGEDFSVALASLETLR---VERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 630 spd~~~~~~~~~l~S~s~DgsV~lWdl~t~~---~l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+|.. .+.|+|||.||-|.|+|++... .+....-|...|-++.|..++ +.|.+-+.+ ++.++|+
T Consensus 174 HP~~-----pnlLlSGSvDGLvnlfD~~~d~EeDaL~~viN~~sSI~~igw~~~~ykrI~clTH~--------Etf~~~e 240 (376)
T KOG1188|consen 174 HPSD-----PNLLLSGSVDGLVNLFDTKKDNEEDALLHVINHGSSIHLIGWLSKKYKRIMCLTHM--------ETFAIYE 240 (376)
T ss_pred cCCC-----CCeEEeecccceEEeeecCCCcchhhHHHhhcccceeeeeeeecCCcceEEEEEcc--------CceeEEE
Confidence 9985 7799999999999999997532 222223467789999998876 457787887 8999999
Q ss_pred CCCCeEEEEEe
Q 000473 706 VKTGARERVLR 716 (1471)
Q Consensus 706 i~tg~~~~~l~ 716 (1471)
++.|..+..+.
T Consensus 241 le~~~~~~~~~ 251 (376)
T KOG1188|consen 241 LEDGSEETWLE 251 (376)
T ss_pred ccCCChhhccc
Confidence 99988655444
No 182
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.98 E-value=2.9e-09 Score=125.11 Aligned_cols=138 Identities=15% Similarity=0.206 Sum_probs=115.7
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC--------------C--------------ceEEEEeccC
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS--------------G--------------NLITVMHHHV 621 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t--------------g--------------~~l~~~~~H~ 621 (1471)
+..|+|+.|-|. +...++..-.+|.+.++|..- + .++..+.--.
T Consensus 219 ktsvT~ikWvpg--------~~~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~~k~~~~f~i~t~ksk~~rNPv~~w~~~~ 290 (636)
T KOG2394|consen 219 KSSVTCIKWVPG--------SDSLFLVAHASGNLYLYDKEIVCGATAPSYQALKDGDQFAILTSKSKKTRNPVARWHIGE 290 (636)
T ss_pred ccceEEEEEEeC--------CCceEEEEEecCceEEeeccccccCCCCcccccCCCCeeEEeeeeccccCCccceeEecc
Confidence 367999999986 266777788899999997641 1 1222233335
Q ss_pred CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEE
Q 000473 622 APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVL 701 (1471)
Q Consensus 622 ~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV 701 (1471)
+.|...+|+|| |++||+++.||.+||+|..+.+.+..+..--+...||+|+|||+||++|++| .-|
T Consensus 291 g~in~f~FS~D------G~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGED--------DLV 356 (636)
T KOG2394|consen 291 GSINEFAFSPD------GKYLATVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSPDGKYIVTGGED--------DLV 356 (636)
T ss_pred ccccceeEcCC------CceEEEEecCceEEEeeccHHHHHHHHHhhccceEEEEEcCCccEEEecCCc--------ceE
Confidence 67889999999 9999999999999999999988877777666779999999999999999999 899
Q ss_pred EEEECCCCeEEEEEeCCCCCceeeeeee
Q 000473 702 FIWDVKTGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 702 ~VWDi~tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
.||.+..++.+..-.||.+-|..+.|.+
T Consensus 357 tVwSf~erRVVARGqGHkSWVs~VaFDp 384 (636)
T KOG2394|consen 357 TVWSFEERRVVARGQGHKSWVSVVAFDP 384 (636)
T ss_pred EEEEeccceEEEeccccccceeeEeecc
Confidence 9999999999999999999999988875
No 183
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=98.98 E-value=5.1e-09 Score=115.11 Aligned_cols=164 Identities=15% Similarity=0.137 Sum_probs=117.9
Q ss_pred cCccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 508 KEKIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 508 h~~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
+-+.|.|+. |.|+ .+++-. |..|.+|+.+ .+. ...-.+.. ..-.+|....++-+|+|.|
T Consensus 122 avg~i~cve------w~Pns~klasm~-dn~i~l~~l~--ess---~~vaev~s----s~s~e~~~~ftsg~WspHH--- 182 (370)
T KOG1007|consen 122 AVGKINCVE------WEPNSDKLASMD-DNNIVLWSLD--ESS---KIVAEVLS----SESAEMRHSFTSGAWSPHH--- 182 (370)
T ss_pred HhCceeeEE------EcCCCCeeEEec-cCceEEEEcc--cCc---chheeecc----cccccccceecccccCCCC---
Confidence 445888888 5555 555544 7788884443 110 00001100 0112466677888999864
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEE-eccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC-CcEEE
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVM-HHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET-LRVER 663 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~-~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t-~~~l~ 663 (1471)
+++.+++. .|+++..||+++.++...+ ..|...|..+.|+|+. ..+|+|+++|+.|++||.+. ..++.
T Consensus 183 ----dgnqv~tt-~d~tl~~~D~RT~~~~~sI~dAHgq~vrdlDfNpnk-----q~~lvt~gDdgyvriWD~R~tk~pv~ 252 (370)
T KOG1007|consen 183 ----DGNQVATT-SDSTLQFWDLRTMKKNNSIEDAHGQRVRDLDFNPNK-----QHILVTCGDDGYVRIWDTRKTKFPVQ 252 (370)
T ss_pred ----ccceEEEe-CCCcEEEEEccchhhhcchhhhhcceeeeccCCCCc-----eEEEEEcCCCccEEEEeccCCCcccc
Confidence 35655554 6899999999988766555 4688999999999994 66899999999999999975 55789
Q ss_pred EecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 664 MFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 664 ~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.+++|...|++|.|+|... +|++|+.| ..|.+|...+
T Consensus 253 el~~HsHWvW~VRfn~~hdqLiLs~~SD--------s~V~Lsca~s 290 (370)
T KOG1007|consen 253 ELPGHSHWVWAVRFNPEHDQLILSGGSD--------SAVNLSCASS 290 (370)
T ss_pred ccCCCceEEEEEEecCccceEEEecCCC--------ceeEEEeccc
Confidence 9999999999999999755 55666666 8999998654
No 184
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=98.97 E-value=6.9e-09 Score=114.25 Aligned_cols=166 Identities=16% Similarity=0.166 Sum_probs=122.6
Q ss_pred cCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCccc
Q 000473 508 KEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKG 587 (1471)
Q Consensus 508 h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~ 587 (1471)
+...+|+... +...|+++.+.+-|-+..||+.. .+ .++.....|-.|...|..++|..+
T Consensus 149 ~~aPlTSFDW---ne~dp~~igtSSiDTTCTiWdie--~~----------~~~~vkTQLIAHDKEV~DIaf~~~------ 207 (364)
T KOG0290|consen 149 FCAPLTSFDW---NEVDPNLIGTSSIDTTCTIWDIE--TG----------VSGTVKTQLIAHDKEVYDIAFLKG------ 207 (364)
T ss_pred cCCccccccc---ccCCcceeEeecccCeEEEEEEe--ec----------cccceeeEEEecCcceeEEEeccC------
Confidence 3455666321 22356789999999999993332 00 122345567899999999999875
Q ss_pred CcCCCEEEEEECCCcEEEEECCCCceEEEEec--c-CCCEEEEEECCCCCCCCCCCEEEEEeCC-CcEEEEECCC-CcEE
Q 000473 588 WSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH--H-VAPVRQIILSPPQTEHPWSDCFLSVGED-FSVALASLET-LRVE 662 (1471)
Q Consensus 588 ~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~--H-~~~V~~l~fspd~~~~~~~~~l~S~s~D-gsV~lWdl~t-~~~l 662 (1471)
..+.++|.|.||+||++|++..+.-..+.. . ..+...++|++.. -+++++...| ..|.+.|++. ..++
T Consensus 208 --s~~~FASvgaDGSvRmFDLR~leHSTIIYE~p~~~~pLlRLswnkqD-----pnymATf~~dS~~V~iLDiR~P~tpv 280 (364)
T KOG0290|consen 208 --SRDVFASVGADGSVRMFDLRSLEHSTIIYEDPSPSTPLLRLSWNKQD-----PNYMATFAMDSNKVVILDIRVPCTPV 280 (364)
T ss_pred --ccceEEEecCCCcEEEEEecccccceEEecCCCCCCcceeeccCcCC-----chHHhhhhcCCceEEEEEecCCCcce
Confidence 268999999999999999987664333322 1 3567888888763 6688888766 5799999986 4578
Q ss_pred EEecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 663 RMFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 663 ~~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
..+.+|.+.|+.++|.|... .|.+++.| ..+.+||+.+.
T Consensus 281 a~L~~H~a~VNgIaWaPhS~~hictaGDD--------~qaliWDl~q~ 320 (364)
T KOG0290|consen 281 ARLRNHQASVNGIAWAPHSSSHICTAGDD--------CQALIWDLQQM 320 (364)
T ss_pred ehhhcCcccccceEecCCCCceeeecCCc--------ceEEEEecccc
Confidence 89999999999999999865 56666665 89999999753
No 185
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.97 E-value=2.9e-08 Score=123.34 Aligned_cols=258 Identities=14% Similarity=0.120 Sum_probs=169.6
Q ss_pred eeeEEEeecceeeEEeeeeeccccccccccCeeEEEEccccCCCCCcceeEeccC--C--ceEeeccccccccCCCCccc
Q 000473 400 KFSIHFIQMSLYLLRMETVCFHVEETSQWRPYISVWSLSQKHSGPGKQCRMVGEG--F--SFVDWVNNSTFLDENEGSCT 475 (1471)
Q Consensus 400 ~~~i~f~~~~~~L~~v~s~~~~~~~~~~~~P~v~vwsl~~~~~~~~~~~k~l~~g--~--~~~~w~~~~~~~~~~dG~~i 475 (1471)
...+.|++.++.++.... ++ ..+.+|++....+............ + ....|..+. .+.
T Consensus 245 v~~~~f~p~~p~ll~gG~--y~--------GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~~------~~~-- 306 (555)
T KOG1587|consen 245 VTCLKFCPFDPNLLAGGC--YN--------GQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQNE------HNT-- 306 (555)
T ss_pred eeEEEeccCCcceEEeec--cC--------ceEEEEEccCCCCCCCcccccccccCCcCeEEEEEeccC------CCC--
Confidence 456778888877765432 22 2378898655443111100011111 1 125675532 111
Q ss_pred ceeecccccCccccccccCCCCCC--Ccccccc------ccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEeccccc
Q 000473 476 GKSDLTFCQDTVPRSEHVDSRQAG--DGRDDFV------HKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFER 547 (1471)
Q Consensus 476 ~~l~~s~~~~~v~~Wd~~~~~~~g--~~~~~~~------~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~ 547 (1471)
.+...+.++.+..|++..-..+. .+..... .....++++.+.+ ..|+.++.|+++|.|....+.-+..
T Consensus 307 -~f~s~ssDG~i~~W~~~~l~~P~e~~~~~~~~~~~~~~~~~~~~t~~~F~~---~~p~~FiVGTe~G~v~~~~r~g~~~ 382 (555)
T KOG1587|consen 307 -EFFSLSSDGSICSWDTDMLSLPVEGLLLESKKHKGQQSSKAVGATSLKFEP---TDPNHFIVGTEEGKVYKGCRKGYTP 382 (555)
T ss_pred -ceEEEecCCcEeeeeccccccchhhcccccccccccccccccceeeEeecc---CCCceEEEEcCCcEEEEEeccCCcc
Confidence 35666779999999876543322 1111111 1223466666444 3456899999999999844331111
Q ss_pred CCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC-CCceEEEEeccCCCEEE
Q 000473 548 HNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG-SGNLITVMHHHVAPVRQ 626 (1471)
Q Consensus 548 ~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~-tg~~l~~~~~H~~~V~~ 626 (1471)
+ ....-+.+..+..|.+.|+++.++|- ....++|++ |-+|++|... ...++..+..+...|++
T Consensus 383 ~-------~~~~~~~~~~~~~h~g~v~~v~~nPF--------~~k~fls~g-DW~vriWs~~~~~~Pl~~~~~~~~~v~~ 446 (555)
T KOG1587|consen 383 A-------PEVSYKGHSTFITHIGPVYAVSRNPF--------YPKNFLSVG-DWTVRIWSEDVIASPLLSLDSSPDYVTD 446 (555)
T ss_pred c-------ccccccccccccccCcceEeeecCCC--------ccceeeeec-cceeEeccccCCCCcchhhhhccceeee
Confidence 0 00111335567789999999999996 255666666 9999999988 66788888889999999
Q ss_pred EEECCCCCCCCCCCEEEEEeCCCcEEEEECCC--CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE
Q 000473 627 IILSPPQTEHPWSDCFLSVGEDFSVALASLET--LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW 704 (1471)
Q Consensus 627 l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t--~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW 704 (1471)
++|+|-+ ..+|+++..||.+-+||+.. .+++....-+....+.+.|++.|+.|++|... |++++|
T Consensus 447 vaWSptr-----pavF~~~d~~G~l~iWDLl~~~~~Pv~s~~~~~~~l~~~~~s~~g~~lavGd~~--------G~~~~~ 513 (555)
T KOG1587|consen 447 VAWSPTR-----PAVFATVDGDGNLDIWDLLQDDEEPVLSQKVCSPALTRVRWSPNGKLLAVGDAN--------GTTHIL 513 (555)
T ss_pred eEEcCcC-----ceEEEEEcCCCceehhhhhccccCCcccccccccccceeecCCCCcEEEEecCC--------CcEEEE
Confidence 9999985 45999999999999999964 44555555556667889999999999999988 999999
Q ss_pred ECCC
Q 000473 705 DVKT 708 (1471)
Q Consensus 705 Di~t 708 (1471)
++..
T Consensus 514 ~l~~ 517 (555)
T KOG1587|consen 514 KLSE 517 (555)
T ss_pred EcCc
Confidence 9853
No 186
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=98.96 E-value=1.7e-08 Score=120.00 Aligned_cols=167 Identities=20% Similarity=0.317 Sum_probs=120.8
Q ss_pred eEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCE
Q 000473 562 SRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDC 641 (1471)
Q Consensus 562 ~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~ 641 (1471)
+...+.||++.|.|+...|. +.+|++|+.||+|++|.+.||.|+.++.- .+.|.+|+|+|.. ..+
T Consensus 392 ~~lvyrGHtg~Vr~iSvdp~---------G~wlasGsdDGtvriWEi~TgRcvr~~~~-d~~I~~vaw~P~~-----~~~ 456 (733)
T KOG0650|consen 392 CALVYRGHTGLVRSISVDPS---------GEWLASGSDDGTVRIWEIATGRCVRTVQF-DSEIRSVAWNPLS-----DLC 456 (733)
T ss_pred eeeeEeccCCeEEEEEecCC---------cceeeecCCCCcEEEEEeecceEEEEEee-cceeEEEEecCCC-----Cce
Confidence 44567999999999999985 99999999999999999999999988754 4589999999984 335
Q ss_pred EEEEeCCCcEEEEECCCC-------------------------------------cEEEEecCCCCCcEEEEEcCCCCEE
Q 000473 642 FLSVGEDFSVALASLETL-------------------------------------RVERMFPGHPNYPAKVVWDCPRGYI 684 (1471)
Q Consensus 642 l~S~s~DgsV~lWdl~t~-------------------------------------~~l~~~~gh~~~V~~v~~spdg~~L 684 (1471)
++.++.+..+.|.+..-| +-++....|...|..|.|+..|+||
T Consensus 457 vLAvA~~~~~~ivnp~~G~~~e~~~t~ell~~~~~~~~p~~~~~~W~~~~~~e~~~~v~~~I~~~k~i~~vtWHrkGDYl 536 (733)
T KOG0650|consen 457 VLAVAVGECVLIVNPIFGDRLEVGPTKELLASAPNESEPDAAVVTWSRASLDELEKGVCIVIKHPKSIRQVTWHRKGDYL 536 (733)
T ss_pred eEEEEecCceEEeCccccchhhhcchhhhhhcCCCccCCcccceeechhhhhhhccceEEEEecCCccceeeeecCCceE
Confidence 555555555555443211 1123344577889999999999999
Q ss_pred EEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEeec
Q 000473 685 ACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 685 ~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
++.+.+ +++..|.|.++..+....-+.--.+.+..+.|.+. .+.+-++.-..+|+++
T Consensus 537 atV~~~-----~~~~~VliHQLSK~~sQ~PF~kskG~vq~v~FHPs------------------~p~lfVaTq~~vRiYd 593 (733)
T KOG0650|consen 537 ATVMPD-----SGNKSVLIHQLSKRKSQSPFRKSKGLVQRVKFHPS------------------KPYLFVATQRSVRIYD 593 (733)
T ss_pred EEeccC-----CCcceEEEEecccccccCchhhcCCceeEEEecCC------------------CceEEEEeccceEEEe
Confidence 998887 33478999998766554445445556677767742 1222333456788888
Q ss_pred cc
Q 000473 765 IQ 766 (1471)
Q Consensus 765 l~ 766 (1471)
|.
T Consensus 594 L~ 595 (733)
T KOG0650|consen 594 LS 595 (733)
T ss_pred hh
Confidence 73
No 187
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=98.95 E-value=2.3e-10 Score=139.17 Aligned_cols=120 Identities=18% Similarity=0.347 Sum_probs=110.7
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
+.++.|.||.++|+|..|... +.++++|+.|..|+||...++.++....||.+.|+.++.+.+ ..
T Consensus 181 k~ikrLlgH~naVyca~fDrt---------g~~Iitgsdd~lvKiwS~et~~~lAs~rGhs~ditdlavs~~------n~ 245 (1113)
T KOG0644|consen 181 KNIKRLLGHRNAVYCAIFDRT---------GRYIITGSDDRLVKIWSMETARCLASCRGHSGDITDLAVSSN------NT 245 (1113)
T ss_pred HHHHHHHhhhhheeeeeeccc---------cceEeecCccceeeeeeccchhhhccCCCCccccchhccchh------hh
Confidence 556678899999999999875 899999999999999999999999999999999999999987 67
Q ss_pred EEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 641 CFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 641 ~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.+++++.|..|++|-+.++.++..+.||.+.|++++|+|-. +.+.| |++++||.+
T Consensus 246 ~iaaaS~D~vIrvWrl~~~~pvsvLrghtgavtaiafsP~~----sss~d--------gt~~~wd~r 300 (1113)
T KOG0644|consen 246 MIAAASNDKVIRVWRLPDGAPVSVLRGHTGAVTAIAFSPRA----SSSDD--------GTCRIWDAR 300 (1113)
T ss_pred hhhhcccCceEEEEecCCCchHHHHhccccceeeeccCccc----cCCCC--------CceEecccc
Confidence 89999999999999999999999999999999999999965 44555 999999987
No 188
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=98.95 E-value=1.7e-09 Score=126.85 Aligned_cols=125 Identities=22% Similarity=0.297 Sum_probs=109.9
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEe-ccCCCEEEEEECCCCCCCCCCCE
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMH-HHVAPVRQIILSPPQTEHPWSDC 641 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~-~H~~~V~~l~fspd~~~~~~~~~ 641 (1471)
.+.|.||++.|+||.|+.+ |.+|+|||.|-.+.|||....++++.+. +|++-|.++.|-|.. ....
T Consensus 43 E~eL~GH~GCVN~LeWn~d---------G~lL~SGSDD~r~ivWd~~~~KllhsI~TgHtaNIFsvKFvP~t----nnri 109 (758)
T KOG1310|consen 43 EAELTGHTGCVNCLEWNAD---------GELLASGSDDTRLIVWDPFEYKLLHSISTGHTANIFSVKFVPYT----NNRI 109 (758)
T ss_pred hhhhccccceecceeecCC---------CCEEeecCCcceEEeecchhcceeeeeecccccceeEEeeeccC----CCeE
Confidence 4678899999999999987 9999999999999999999888888774 799999999999973 1458
Q ss_pred EEEEeCCCcEEEEECCC----------CcEEEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 642 FLSVGEDFSVALASLET----------LRVERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 642 l~S~s~DgsV~lWdl~t----------~~~l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
++||+.|..|+|+|+.. ....+.+..|...|..++--|++ ..+.++++| |+|+-+|++.
T Consensus 110 v~sgAgDk~i~lfdl~~~~~~~~d~~~~~~~~~~~cht~rVKria~~p~~PhtfwsasED--------GtirQyDiRE 179 (758)
T KOG1310|consen 110 VLSGAGDKLIKLFDLDSSKEGGMDHGMEETTRCWSCHTDRVKRIATAPNGPHTFWSASED--------GTIRQYDIRE 179 (758)
T ss_pred EEeccCcceEEEEecccccccccccCccchhhhhhhhhhhhhheecCCCCCceEEEecCC--------cceeeecccC
Confidence 99999999999999984 23456677899999999999999 677888888 9999999986
No 189
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.93 E-value=3.9e-09 Score=121.85 Aligned_cols=190 Identities=21% Similarity=0.215 Sum_probs=137.0
Q ss_pred EEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccccccccc
Q 000473 20 TATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMGKSS 99 (1471)
Q Consensus 20 tava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~~~s 99 (1471)
-+++|+.||..|+||+.||.+++|+... .........|.+.|.+|. |+
T Consensus 148 k~vaf~~~gs~latgg~dg~lRv~~~Ps-----~~t~l~e~~~~~eV~DL~---------------------------FS 195 (398)
T KOG0771|consen 148 KVVAFNGDGSKLATGGTDGTLRVWEWPS-----MLTILEEIAHHAEVKDLD---------------------------FS 195 (398)
T ss_pred eEEEEcCCCCEeeeccccceEEEEecCc-----chhhhhhHhhcCccccce---------------------------eC
Confidence 5799999999999999999999999753 445555668999999998 78
Q ss_pred CCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCC-----eEEEEcc-------eecccCCc-----
Q 000473 100 LDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNP-----RYVCIGC-------CFIDTNQL----- 162 (1471)
Q Consensus 100 ~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~-----~ll~~G~-------~~id~~~~----- 162 (1471)
||+++|+|-+.| ..+||++.+|.++.+..-... ..--.-|+|..++ ++++.-. +.+..|.-
T Consensus 196 ~dgk~lasig~d-~~~VW~~~~g~~~a~~t~~~k-~~~~~~cRF~~d~~~~~l~laa~~~~~~~v~~~~~~~w~~~~~l~ 273 (398)
T KOG0771|consen 196 PDGKFLASIGAD-SARVWSVNTGAALARKTPFSK-DEMFSSCRFSVDNAQETLRLAASQFPGGGVRLCDISLWSGSNFLR 273 (398)
T ss_pred CCCcEEEEecCC-ceEEEEeccCchhhhcCCccc-chhhhhceecccCCCceEEEEEecCCCCceeEEEeeeeccccccc
Confidence 999999999999 899999999988876431100 1112233444333 3333211 11111111
Q ss_pred --ccccccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCC
Q 000473 163 --SDHHSFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVG 240 (1471)
Q Consensus 163 --~~~h~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG 240 (1471)
....-|++|-...++.|+++...+.+++.|.|++..+++.++-+...|.. .|+.+.|+|+ .+ .+...+.+.
T Consensus 274 ~~~~~~~~~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~vk~aH~~--~VT~ltF~Pd---sr--~~~svSs~~ 346 (398)
T KOG0771|consen 274 LRKKIKRFKSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYVKEAHLG--FVTGLTFSPD---SR--YLASVSSDN 346 (398)
T ss_pred hhhhhhccCcceeEEEcCCCcEEEEeccCCcEEEEEeceeeeeEeehhhhee--eeeeEEEcCC---cC--cccccccCC
Confidence 00122478888889999999999999999999999999999988865554 4999999954 33 344457788
Q ss_pred cEEEEECCCC
Q 000473 241 RLQLVPISKE 250 (1471)
Q Consensus 241 ~V~vW~l~~~ 250 (1471)
+..|..+.-+
T Consensus 347 ~~~v~~l~vd 356 (398)
T KOG0771|consen 347 EAAVTKLAVD 356 (398)
T ss_pred ceeEEEEeec
Confidence 8888887754
No 190
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.92 E-value=2e-08 Score=118.35 Aligned_cols=152 Identities=20% Similarity=0.201 Sum_probs=116.1
Q ss_pred ccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccC-CCC------CCccc------cCCcceEEEEecCCccEEEE
Q 000473 510 KIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERH-NSP------GASLK------VNSHVSRQYFLGHTGAVLCL 576 (1471)
Q Consensus 510 ~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~-d~~------~~~~d------~~s~~~~~~l~gH~~~V~~l 576 (1471)
..|+|+...+... ..++.+..+|...+++-....+. ..+ +..+. ..+..++..+.--.+.|+..
T Consensus 220 tsvT~ikWvpg~~---~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~~k~~~~f~i~t~ksk~~rNPv~~w~~~~g~in~f 296 (636)
T KOG2394|consen 220 SSVTCIKWVPGSD---SLFLVAHASGNLYLYDKEIVCGATAPSYQALKDGDQFAILTSKSKKTRNPVARWHIGEGSINEF 296 (636)
T ss_pred cceEEEEEEeCCC---ceEEEEEecCceEEeeccccccCCCCcccccCCCCeeEEeeeeccccCCccceeEeccccccce
Confidence 5678877554332 37788889999998543211111 000 00000 11112333333335578889
Q ss_pred EEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEEC
Q 000473 577 AAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASL 656 (1471)
Q Consensus 577 a~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl 656 (1471)
+|+|| +++|++-|.||.+||+|..+.+++..|+..-+...|++|+|| |++|++|++|--|.||.+
T Consensus 297 ~FS~D---------G~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSPD------GKyIvtGGEDDLVtVwSf 361 (636)
T KOG2394|consen 297 AFSPD---------GKYLATVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSPD------GKYIVTGGEDDLVTVWSF 361 (636)
T ss_pred eEcCC---------CceEEEEecCceEEEeeccHHHHHHHHHhhccceEEEEEcCC------ccEEEecCCcceEEEEEe
Confidence 99997 999999999999999999999999999888899999999999 999999999999999999
Q ss_pred CCCcEEEEecCCCCCcEEEEEcC
Q 000473 657 ETLRVERMFPGHPNYPAKVVWDC 679 (1471)
Q Consensus 657 ~t~~~l~~~~gh~~~V~~v~~sp 679 (1471)
..++.+..-+||.++|..|+|+|
T Consensus 362 ~erRVVARGqGHkSWVs~VaFDp 384 (636)
T KOG2394|consen 362 EERRVVARGQGHKSWVSVVAFDP 384 (636)
T ss_pred ccceEEEeccccccceeeEeecc
Confidence 99999999999999999999984
No 191
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.91 E-value=5.4e-08 Score=107.17 Aligned_cols=124 Identities=18% Similarity=0.249 Sum_probs=95.3
Q ss_pred CCccEEEEEEecCCCCcccCcCCCEE--EEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe
Q 000473 569 HTGAVLCLAAHRMVGTAKGWSFNEVL--VSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG 646 (1471)
Q Consensus 569 H~~~V~~la~spd~~~~~~~~~~~~L--~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s 646 (1471)
+.+.|.+++|+|+ ++.+ +.|..+..|.+||++ ++.+..+. ...+..+.|+|+ |++++.++
T Consensus 58 ~~~~I~~~~WsP~---------g~~favi~g~~~~~v~lyd~~-~~~i~~~~--~~~~n~i~wsP~------G~~l~~~g 119 (194)
T PF08662_consen 58 KEGPIHDVAWSPN---------GNEFAVIYGSMPAKVTLYDVK-GKKIFSFG--TQPRNTISWSPD------GRFLVLAG 119 (194)
T ss_pred CCCceEEEEECcC---------CCEEEEEEccCCcccEEEcCc-ccEeEeec--CCCceEEEECCC------CCEEEEEE
Confidence 3457999999997 5543 456678899999996 66666664 567889999999 99999987
Q ss_pred CC---CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 647 ED---FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 647 ~D---gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
.+ |.|.+||.++.+.+..+. |. .++.++|+|+|++++++...-. -..|..++||+. +|+++...
T Consensus 120 ~~n~~G~l~~wd~~~~~~i~~~~-~~-~~t~~~WsPdGr~~~ta~t~~r--~~~dng~~Iw~~-~G~~l~~~ 186 (194)
T PF08662_consen 120 FGNLNGDLEFWDVRKKKKISTFE-HS-DATDVEWSPDGRYLATATTSPR--LRVDNGFKIWSF-QGRLLYKK 186 (194)
T ss_pred ccCCCcEEEEEECCCCEEeeccc-cC-cEEEEEEcCCCCEEEEEEeccc--eeccccEEEEEe-cCeEeEec
Confidence 54 669999999988887764 33 4789999999999999875211 111278999998 57766543
No 192
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=98.87 E-value=2.1e-08 Score=126.38 Aligned_cols=143 Identities=22% Similarity=0.255 Sum_probs=122.3
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
++++|+.-|.|.+|.|. .+.+++ .+.||.+.+..+.++.| +.+++|.|+|.++|+|
T Consensus 147 ~i~~gsv~~~iivW~~~--------------~dn~p~-~l~GHeG~iF~i~~s~d---------g~~i~s~SdDRsiRlW 202 (967)
T KOG0974|consen 147 YIASGSVFGEIIVWKPH--------------EDNKPI-RLKGHEGSIFSIVTSLD---------GRYIASVSDDRSIRLW 202 (967)
T ss_pred EEEeccccccEEEEecc--------------ccCCcc-eecccCCceEEEEEccC---------CcEEEEEecCcceeee
Confidence 78999999999996654 112233 68999999999999987 8999999999999999
Q ss_pred ECCCCceEE-EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC-CcEEEEEcCCCCEE
Q 000473 607 DLGSGNLIT-VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN-YPAKVVWDCPRGYI 684 (1471)
Q Consensus 607 Dl~tg~~l~-~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~-~V~~v~~spdg~~L 684 (1471)
++++.+... +-.+|...|+++.|.|. .++|++.|-+.++|+.+ ++.+..+.+|.. .++.++..++...+
T Consensus 203 ~i~s~~~~~~~~fgHsaRvw~~~~~~n--------~i~t~gedctcrvW~~~-~~~l~~y~~h~g~~iw~~~~~~~~~~~ 273 (967)
T KOG0974|consen 203 PIDSREVLGCTGFGHSARVWACCFLPN--------RIITVGEDCTCRVWGVN-GTQLEVYDEHSGKGIWKIAVPIGVIIK 273 (967)
T ss_pred ecccccccCcccccccceeEEEEeccc--------eeEEeccceEEEEEecc-cceehhhhhhhhcceeEEEEcCCceEE
Confidence 999988765 66799999999999986 89999999999999765 555668888865 68999999999999
Q ss_pred EEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 685 ACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 685 ~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
+|++.| +.+++||..+.-
T Consensus 274 vT~g~D--------s~lk~~~l~~r~ 291 (967)
T KOG0974|consen 274 VTGGND--------STLKLWDLNGRG 291 (967)
T ss_pred EeeccC--------cchhhhhhhccc
Confidence 999998 999999987643
No 193
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=98.86 E-value=7.6e-08 Score=111.89 Aligned_cols=175 Identities=10% Similarity=0.109 Sum_probs=140.3
Q ss_pred ccEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCccc
Q 000473 510 KIVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKG 587 (1471)
Q Consensus 510 ~~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~ 587 (1471)
..|+++. |+|. .+++++.||.++++..+ -+....++.+.--.-++.+..|+|+
T Consensus 214 ~~I~sv~------FHp~~plllvaG~d~~lrifqvD-------------Gk~N~~lqS~~l~~fPi~~a~f~p~------ 268 (514)
T KOG2055|consen 214 GGITSVQ------FHPTAPLLLVAGLDGTLRIFQVD-------------GKVNPKLQSIHLEKFPIQKAEFAPN------ 268 (514)
T ss_pred CCceEEE------ecCCCceEEEecCCCcEEEEEec-------------CccChhheeeeeccCccceeeecCC------
Confidence 5688876 7776 78899999999996554 2223445555555678999999997
Q ss_pred CcCCC-EEEEEECCCcEEEEECCCCceE--EEEeccC-CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEE
Q 000473 588 WSFNE-VLVSGSMDCSIRIWDLGSGNLI--TVMHHHV-APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVER 663 (1471)
Q Consensus 588 ~~~~~-~L~SGs~DgtI~lWDl~tg~~l--~~~~~H~-~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~ 663 (1471)
|+ .+++++.-.....||+.+.+.. ..+.++. ..+....++|+ +++|+..|..|.|.|....+++.+.
T Consensus 269 ---G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g~e~~~~e~FeVShd------~~fia~~G~~G~I~lLhakT~eli~ 339 (514)
T KOG2055|consen 269 ---GHSVIFTSGRRKYLYSYDLETAKVTKLKPPYGVEEKSMERFEVSHD------SNFIAIAGNNGHIHLLHAKTKELIT 339 (514)
T ss_pred ---CceEEEecccceEEEEeeccccccccccCCCCcccchhheeEecCC------CCeEEEcccCceEEeehhhhhhhhh
Confidence 55 9999999999999999987743 3444443 45778889999 8999999999999999999999988
Q ss_pred EecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 664 MFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 664 ~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
.+. -.+.|..++|+.|++.|++.|.+ |.|++||+++..+++.+.... .+-...+|
T Consensus 340 s~K-ieG~v~~~~fsSdsk~l~~~~~~--------GeV~v~nl~~~~~~~rf~D~G-~v~gts~~ 394 (514)
T KOG2055|consen 340 SFK-IEGVVSDFTFSSDSKELLASGGT--------GEVYVWNLRQNSCLHRFVDDG-SVHGTSLC 394 (514)
T ss_pred eee-eccEEeeEEEecCCcEEEEEcCC--------ceEEEEecCCcceEEEEeecC-ccceeeee
Confidence 876 45669999999999999998888 999999999999999887654 34444466
No 194
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.85 E-value=2.6e-08 Score=115.26 Aligned_cols=169 Identities=14% Similarity=0.098 Sum_probs=124.1
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+++|..||.+|||.|..+ ..+.....|.+.|.+|.|+|| +++|++-+.| ..+||
T Consensus 158 ~latgg~dg~lRv~~~Ps~---------------~t~l~e~~~~~eV~DL~FS~d---------gk~lasig~d-~~~VW 212 (398)
T KOG0771|consen 158 KLATGGTDGTLRVWEWPSM---------------LTILEEIAHHAEVKDLDFSPD---------GKFLASIGAD-SARVW 212 (398)
T ss_pred EeeeccccceEEEEecCcc---------------hhhhhhHhhcCccccceeCCC---------CcEEEEecCC-ceEEE
Confidence 8999999999999887622 334455679999999999998 8999999999 99999
Q ss_pred ECCCCceEEEEec--cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc------EEEEecCCCCCcEEEEEc
Q 000473 607 DLGSGNLITVMHH--HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR------VERMFPGHPNYPAKVVWD 678 (1471)
Q Consensus 607 Dl~tg~~l~~~~~--H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~------~l~~~~gh~~~V~~v~~s 678 (1471)
|.++|..+..... .......+.|+-+..- +.-.+++....-+.|++|++...+ ..+....+ ..|.+++.+
T Consensus 213 ~~~~g~~~a~~t~~~k~~~~~~cRF~~d~~~-~~l~laa~~~~~~~v~~~~~~~w~~~~~l~~~~~~~~~-~siSsl~VS 290 (398)
T KOG0771|consen 213 SVNTGAALARKTPFSKDEMFSSCRFSVDNAQ-ETLRLAASQFPGGGVRLCDISLWSGSNFLRLRKKIKRF-KSISSLAVS 290 (398)
T ss_pred EeccCchhhhcCCcccchhhhhceecccCCC-ceEEEEEecCCCCceeEEEeeeeccccccchhhhhhcc-CcceeEEEc
Confidence 9999976655542 2234556677766211 111234444455777777764322 11122223 359999999
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE-eCCCCCceeeeeeec
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL-RGTASHSMFDHFCKG 730 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l-~gH~~~v~~~~~~~~ 730 (1471)
++|++++.|+.| |.|-|++..+-++++.. +.|..-|+.+.|+|+
T Consensus 291 ~dGkf~AlGT~d--------GsVai~~~~~lq~~~~vk~aH~~~VT~ltF~Pd 335 (398)
T KOG0771|consen 291 DDGKFLALGTMD--------GSVAIYDAKSLQRLQYVKEAHLGFVTGLTFSPD 335 (398)
T ss_pred CCCcEEEEeccC--------CcEEEEEeceeeeeEeehhhheeeeeeEEEcCC
Confidence 999999999999 99999999988877654 478888899999963
No 195
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=98.84 E-value=6.3e-09 Score=122.12 Aligned_cols=166 Identities=17% Similarity=0.166 Sum_probs=129.2
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEE-EecCCccEEEEEEecC
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQY-FLGHTGAVLCLAAHRM 581 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~-l~gH~~~V~~la~spd 581 (1471)
+.+.+|.+-|+|+.+..+.. .|++|+.|-.+.| |+. -..+++.. -.||++.|.|+.|-|.
T Consensus 44 ~eL~GH~GCVN~LeWn~dG~----lL~SGSDD~r~iv--Wd~-------------~~~KllhsI~TgHtaNIFsvKFvP~ 104 (758)
T KOG1310|consen 44 AELTGHTGCVNCLEWNADGE----LLASGSDDTRLIV--WDP-------------FEYKLLHSISTGHTANIFSVKFVPY 104 (758)
T ss_pred hhhccccceecceeecCCCC----EEeecCCcceEEe--ecc-------------hhcceeeeeecccccceeEEeeecc
Confidence 45789999999999887777 7999999999999 552 22344444 3799999999999986
Q ss_pred CCCcccCcCCCEEEEEECCCcEEEEECCC----------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcE
Q 000473 582 VGTAKGWSFNEVLVSGSMDCSIRIWDLGS----------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSV 651 (1471)
Q Consensus 582 ~~~~~~~~~~~~L~SGs~DgtI~lWDl~t----------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV 651 (1471)
. +++.++||..|..|+++|+.. .+..+.+..|...|..++..|+. .+.|.++++||++
T Consensus 105 t-------nnriv~sgAgDk~i~lfdl~~~~~~~~d~~~~~~~~~~~cht~rVKria~~p~~-----PhtfwsasEDGti 172 (758)
T KOG1310|consen 105 T-------NNRIVLSGAGDKLIKLFDLDSSKEGGMDHGMEETTRCWSCHTDRVKRIATAPNG-----PHTFWSASEDGTI 172 (758)
T ss_pred C-------CCeEEEeccCcceEEEEecccccccccccCccchhhhhhhhhhhhhheecCCCC-----CceEEEecCCcce
Confidence 2 478999999999999999984 23456677899999999999983 4899999999999
Q ss_pred EEEECCCCc-------EEE---EecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 652 ALASLETLR-------VER---MFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 652 ~lWdl~t~~-------~l~---~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
+-+|++... +.. .+....-...++..+|... +|++|+.| --.++||.+
T Consensus 173 rQyDiREph~c~p~~~~~~~l~ny~~~lielk~ltisp~rp~~laVGgsd--------pfarLYD~R 231 (758)
T KOG1310|consen 173 RQYDIREPHVCNPDEDCPSILVNYNPQLIELKCLTISPSRPYYLAVGGSD--------PFARLYDRR 231 (758)
T ss_pred eeecccCCccCCccccccHHHHHhchhhheeeeeeecCCCCceEEecCCC--------chhhhhhhh
Confidence 999998631 111 1111222457899999765 77888887 889999963
No 196
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.74 E-value=3.6e-07 Score=104.22 Aligned_cols=145 Identities=12% Similarity=0.150 Sum_probs=111.1
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEe---cCCccEEEEEEecCCCCcccCcCCCEEE-EEE-CCC
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFL---GHTGAVLCLAAHRMVGTAKGWSFNEVLV-SGS-MDC 601 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~---gH~~~V~~la~spd~~~~~~~~~~~~L~-SGs-~Dg 601 (1471)
+++..-++ .|.|++.. +.+.++++. .|...+.++.+++. +.+++ -++ .-|
T Consensus 99 RLvV~Lee-~IyIydI~---------------~MklLhTI~t~~~n~~gl~AlS~n~~---------n~ylAyp~s~t~G 153 (391)
T KOG2110|consen 99 RLVVCLEE-SIYIYDIK---------------DMKLLHTIETTPPNPKGLCALSPNNA---------NCYLAYPGSTTSG 153 (391)
T ss_pred eEEEEEcc-cEEEEecc---------------cceeehhhhccCCCccceEeeccCCC---------CceEEecCCCCCc
Confidence 45555444 48885443 334444442 34445666655553 34444 233 357
Q ss_pred cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcE-EEEECCCCcEEEEecCCCC--CcEEEEEc
Q 000473 602 SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSV-ALASLETLRVERMFPGHPN--YPAKVVWD 678 (1471)
Q Consensus 602 tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV-~lWdl~t~~~l~~~~gh~~--~V~~v~~s 678 (1471)
.|.+||+.+-+....+..|.+++-+++|+|+ |..+||+|+.|+| ||+++.+|+.+.+|+.... .|.+++|+
T Consensus 154 dV~l~d~~nl~~v~~I~aH~~~lAalafs~~------G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs 227 (391)
T KOG2110|consen 154 DVVLFDTINLQPVNTINAHKGPLAALAFSPD------GTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFS 227 (391)
T ss_pred eEEEEEcccceeeeEEEecCCceeEEEECCC------CCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEEC
Confidence 8999999999999999999999999999999 9999999999985 9999999999999976544 46899999
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
|++.+|.+.+.. ++|+|+.++...
T Consensus 228 ~ds~~L~~sS~T--------eTVHiFKL~~~~ 251 (391)
T KOG2110|consen 228 PDSQFLAASSNT--------ETVHIFKLEKVS 251 (391)
T ss_pred CCCCeEEEecCC--------CeEEEEEecccc
Confidence 999999988887 999999987644
No 197
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=98.74 E-value=1.3e-08 Score=117.97 Aligned_cols=202 Identities=16% Similarity=0.218 Sum_probs=144.9
Q ss_pred cEEEEEeeccccccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccC
Q 000473 511 IVSSSMVISESFYAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGW 588 (1471)
Q Consensus 511 ~Vts~~~is~~~f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~ 588 (1471)
.|..+. |-|+ .|++++..|-+.. . |+.+|+.+..+.--.+.+..+.-.|.
T Consensus 211 ~v~rLe------FLPyHfLL~~~~~~G~L~Y---~------------DVS~GklVa~~~t~~G~~~vm~qNP~------- 262 (545)
T KOG1272|consen 211 RVARLE------FLPYHFLLVAASEAGFLKY---Q------------DVSTGKLVASIRTGAGRTDVMKQNPY------- 262 (545)
T ss_pred chhhhc------ccchhheeeecccCCceEE---E------------eechhhhhHHHHccCCccchhhcCCc-------
Confidence 455555 5554 7888888888774 2 56677777777777788888888886
Q ss_pred cCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCC
Q 000473 589 SFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGH 668 (1471)
Q Consensus 589 ~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh 668 (1471)
+..+-.|...|+|.+|.....+++..+..|.++|.+|++.|. |++++|.|.|+.|+|||++....++++..
T Consensus 263 --NaVih~GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~------G~YMaTtG~Dr~~kIWDlR~~~ql~t~~t- 333 (545)
T KOG1272|consen 263 --NAVIHLGHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRG------GRYMATTGLDRKVKIWDLRNFYQLHTYRT- 333 (545)
T ss_pred --cceEEEcCCCceEEecCCCCcchHHHHHhcCCCcceEEECCC------CcEEeecccccceeEeeeccccccceeec-
Confidence 789999999999999999999999999999999999999999 99999999999999999998887777655
Q ss_pred CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE-CCC--CeEEEEEeCCC--CCceeeeeeeccccccccceEEcC
Q 000473 669 PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD-VKT--GARERVLRGTA--SHSMFDHFCKGISMNSISGSVLNG 743 (1471)
Q Consensus 669 ~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD-i~t--g~~~~~l~gH~--~~v~~~~~~~~~~~~~~sg~v~~g 743 (1471)
..+...+++|-.| .|+.+-. ..|.||. .-. +.....+.-|. +.|..+.|||.-+.- |.=.
T Consensus 334 p~~a~~ls~Sqkg-lLA~~~G---------~~v~iw~d~~~~s~~~~~pYm~H~~~~~V~~l~FcP~EDvL---GIGH-- 398 (545)
T KOG1272|consen 334 PHPASNLSLSQKG-LLALSYG---------DHVQIWKDALKGSGHGETPYMNHRCGGPVEDLRFCPYEDVL---GIGH-- 398 (545)
T ss_pred CCCcccccccccc-ceeeecC---------CeeeeehhhhcCCCCCCcchhhhccCcccccceeccHHHee---eccc--
Confidence 5567888898544 5555444 3799994 322 23333333443 366677899732211 1111
Q ss_pred CccccccceeeccCCceEeec
Q 000473 744 NTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 744 ~~~~s~~l~~~~~D~tir~w~ 764 (1471)
....++.++|-+-|-.+-.|.
T Consensus 399 ~~G~tsilVPGsGePN~Ds~e 419 (545)
T KOG1272|consen 399 AGGITSILVPGSGEPNYDSLE 419 (545)
T ss_pred cCCceeEeccCCCCCCcchhc
Confidence 122245556655555444443
No 198
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=98.73 E-value=6.6e-07 Score=114.12 Aligned_cols=237 Identities=15% Similarity=0.096 Sum_probs=157.7
Q ss_pred CCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEE
Q 000473 497 QAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCL 576 (1471)
Q Consensus 497 ~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~l 576 (1471)
+.|.++..+..|...|..++..++.. ..+++|+.||+|++|+...+.+. ..+.++..++.--..++.++
T Consensus 1036 p~G~lVAhL~Ehs~~v~k~a~s~~~~---s~FvsgS~DGtVKvW~~~k~~~~--------~~s~rS~ltys~~~sr~~~v 1104 (1431)
T KOG1240|consen 1036 PRGILVAHLHEHSSAVIKLAVSSEHT---SLFVSGSDDGTVKVWNLRKLEGE--------GGSARSELTYSPEGSRVEKV 1104 (1431)
T ss_pred ccceEeehhhhccccccceeecCCCC---ceEEEecCCceEEEeeehhhhcC--------cceeeeeEEEeccCCceEEE
Confidence 44667777888999888887666652 27999999999999655543331 12234445555456778888
Q ss_pred EEecCCCCcccCcCCCEEEEEECCCcEEEEECCC--Cc-----------------eE--EE-------------------
Q 000473 577 AAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS--GN-----------------LI--TV------------------- 616 (1471)
Q Consensus 577 a~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t--g~-----------------~l--~~------------------- 616 (1471)
...+. ++.++.|+.||.|++.+++. .+ .+ +-
T Consensus 1105 t~~~~---------~~~~Av~t~DG~v~~~~id~~~~~~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~i 1175 (1431)
T KOG1240|consen 1105 TMCGN---------GDQFAVSTKDGSVRVLRIDHYNVSKRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRI 1175 (1431)
T ss_pred EeccC---------CCeEEEEcCCCeEEEEEccccccccceeeeeecccccCCCceEEeecccccccceeEEEEEeccce
Confidence 88775 78888889999999998864 10 00 00
Q ss_pred ---------------EeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEec-CCCCCcEEEEEcCC
Q 000473 617 ---------------MHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFP-GHPNYPAKVVWDCP 680 (1471)
Q Consensus 617 ---------------~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~-gh~~~V~~v~~spd 680 (1471)
..-..|.|++++.+|. ++.++.|..-|.+.+||++-+.++.... .+..+++.|..+|.
T Consensus 1176 v~~D~r~~~~~w~lk~~~~hG~vTSi~idp~------~~WlviGts~G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~~~~ 1249 (1431)
T KOG1240|consen 1176 VSWDTRMRHDAWRLKNQLRHGLVTSIVIDPW------CNWLVIGTSRGQLVLWDLRFRVPILSWEHPARAPIRHVWLCPT 1249 (1431)
T ss_pred EEecchhhhhHHhhhcCccccceeEEEecCC------ceEEEEecCCceEEEEEeecCceeecccCcccCCcceEEeecc
Confidence 0112356778888777 8899999999999999999887776543 34467888877664
Q ss_pred C---CEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeecc---ccccccceEEcCCccccccceee
Q 000473 681 R---GYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGI---SMNSISGSVLNGNTSVSSLLLPI 754 (1471)
Q Consensus 681 g---~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~---~~~~~sg~v~~g~~~~s~~l~~~ 754 (1471)
. ...++++.. +.+.|.+|++++|.+-.++-.-.....+..+.|.. .+....| +.+|--....-+++.
T Consensus 1250 ~~~~S~~vs~~~~------~~nevs~wn~~~g~~~~vl~~s~~~p~ls~~~Ps~~~~kp~~~~~-~~~~~~~~~~~~ltg 1322 (1431)
T KOG1240|consen 1250 YPQESVSVSAGSS------SNNEVSTWNMETGLRQTVLWASDGAPILSYALPSNDARKPDSLAG-ISCGVCEKNGFLLTG 1322 (1431)
T ss_pred CCCCceEEEeccc------CCCceeeeecccCcceEEEEcCCCCcchhhhcccccCCCCCcccc-eeeecccCCceeeec
Confidence 3 466665552 23789999999998887776554444444444321 0111111 123334445556666
Q ss_pred ccCCceEeeccc
Q 000473 755 HEDGTFRQSQIQ 766 (1471)
Q Consensus 755 ~~D~tir~w~l~ 766 (1471)
..|..||.|+..
T Consensus 1323 gsd~kIR~wD~~ 1334 (1431)
T KOG1240|consen 1323 GSDMKIRKWDPT 1334 (1431)
T ss_pred CCccceeeccCC
Confidence 679999999963
No 199
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=98.71 E-value=6.1e-08 Score=115.70 Aligned_cols=146 Identities=18% Similarity=0.215 Sum_probs=112.4
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTE 635 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~ 635 (1471)
|...|...++++||.+.|+|++|+.| |+.++||+.|..|.+|+-.-... ..+ .|+..|.|+.|+|-
T Consensus 39 D~ndG~llqtLKgHKDtVycVAys~d---------GkrFASG~aDK~VI~W~~klEG~-LkY-SH~D~IQCMsFNP~--- 104 (1081)
T KOG1538|consen 39 DTSDGTLLQPLKGHKDTVYCVAYAKD---------GKRFASGSADKSVIIWTSKLEGI-LKY-SHNDAIQCMSFNPI--- 104 (1081)
T ss_pred eCCCcccccccccccceEEEEEEccC---------CceeccCCCceeEEEecccccce-eee-ccCCeeeEeecCch---
Confidence 35567889999999999999999997 99999999999999998643222 222 59999999999998
Q ss_pred CCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 636 HPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 636 ~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
.+.++|++-. ...+|+........ . .....+.+.+|..||.|++.|-.| |+|.+=+- +|+.--.+
T Consensus 105 ---~h~LasCsLs-dFglWS~~qK~V~K-~-kss~R~~~CsWtnDGqylalG~~n--------GTIsiRNk-~gEek~~I 169 (1081)
T KOG1538|consen 105 ---THQLASCSLS-DFGLWSPEQKSVSK-H-KSSSRIICCSWTNDGQYLALGMFN--------GTISIRNK-NGEEKVKI 169 (1081)
T ss_pred ---HHHhhhcchh-hccccChhhhhHHh-h-hhheeEEEeeecCCCcEEEEeccC--------ceEEeecC-CCCcceEE
Confidence 7888888743 46789876533221 1 123468899999999999999998 99999864 55543222
Q ss_pred ---eCCCCCceeeeeeec
Q 000473 716 ---RGTASHSMFDHFCKG 730 (1471)
Q Consensus 716 ---~gH~~~v~~~~~~~~ 730 (1471)
-|..+.+..+.||+.
T Consensus 170 ~Rpgg~Nspiwsi~~~p~ 187 (1081)
T KOG1538|consen 170 ERPGGSNSPIWSICWNPS 187 (1081)
T ss_pred eCCCCCCCCceEEEecCC
Confidence 357788999988864
No 200
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=98.71 E-value=3e-07 Score=110.49 Aligned_cols=143 Identities=16% Similarity=0.198 Sum_probs=119.0
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEe--cCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFL--GHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIR 604 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~--gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~ 604 (1471)
.++.|...|.|.+ +. +..++....+. +|.+.|+++.++.+ -..|-|++.|+.+.
T Consensus 72 ~lvlgt~~g~v~~--ys-------------~~~g~it~~~st~~h~~~v~~~~~~~~---------~~ciyS~~ad~~v~ 127 (541)
T KOG4547|consen 72 MLVLGTPQGSVLL--YS-------------VAGGEITAKLSTDKHYGNVNEILDAQR---------LGCIYSVGADLKVV 127 (541)
T ss_pred EEEeecCCccEEE--EE-------------ecCCeEEEEEecCCCCCcceeeecccc---------cCceEecCCceeEE
Confidence 6888999999888 33 23445555554 69999999998875 67899999999999
Q ss_pred EEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCC----
Q 000473 605 IWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCP---- 680 (1471)
Q Consensus 605 lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spd---- 680 (1471)
.|+...++.++.+.+....+.++.++|| +..+++++ +.|++||+++++.+..|.||.++|.++.|--+
T Consensus 128 ~~~~~~~~~~~~~~~~~~~~~sl~is~D------~~~l~~as--~~ik~~~~~~kevv~~ftgh~s~v~t~~f~~~~~g~ 199 (541)
T KOG4547|consen 128 YILEKEKVIIRIWKEQKPLVSSLCISPD------GKILLTAS--RQIKVLDIETKEVVITFTGHGSPVRTLSFTTLIDGI 199 (541)
T ss_pred EEecccceeeeeeccCCCccceEEEcCC------CCEEEecc--ceEEEEEccCceEEEEecCCCcceEEEEEEEecccc
Confidence 9999999999999999999999999999 89999886 68999999999999999999999999999776
Q ss_pred -CCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 681 -RGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 681 -g~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
|.|++++..- +.-+.+|-++.
T Consensus 200 ~G~~vLssa~~-------~r~i~~w~v~~ 221 (541)
T KOG4547|consen 200 IGKYVLSSAAA-------ERGITVWVVEK 221 (541)
T ss_pred ccceeeecccc-------ccceeEEEEEc
Confidence 6676654322 14577776543
No 201
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.71 E-value=2.3e-07 Score=115.57 Aligned_cols=206 Identities=16% Similarity=0.119 Sum_probs=142.4
Q ss_pred eecccccCccccccccCCCC--CCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQ--AGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASL 555 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~--~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~ 555 (1471)
++....+|.|-+||+..... +.........|...++++..+.... +..+++++.||.|..|.-+++... ...-
T Consensus 258 l~gG~y~GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~~~--~~~f~s~ssDG~i~~W~~~~l~~P---~e~~ 332 (555)
T KOG1587|consen 258 LAGGCYNGQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQNEH--NTEFFSLSSDGSICSWDTDMLSLP---VEGL 332 (555)
T ss_pred EEeeccCceEEEEEccCCCCCCCcccccccccCCcCeEEEEEeccCC--CCceEEEecCCcEeeeeccccccc---hhhc
Confidence 56677889999999987655 3333344467888888887666443 235899999999999765544321 1000
Q ss_pred ccCCcce-EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE---EECCCCc-----eEEEEeccCCCEEE
Q 000473 556 KVNSHVS-RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI---WDLGSGN-----LITVMHHHVAPVRQ 626 (1471)
Q Consensus 556 d~~s~~~-~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l---WDl~tg~-----~l~~~~~H~~~V~~ 626 (1471)
..++... ...+ .-...+++++|.+. +...++.|+.+|.|.- ++...+. .+..+..|.++|.+
T Consensus 333 ~~~~~~~~~~~~-~~~~~~t~~~F~~~--------~p~~FiVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~ 403 (555)
T KOG1587|consen 333 LLESKKHKGQQS-SKAVGATSLKFEPT--------DPNHFIVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVYA 403 (555)
T ss_pred cccccccccccc-ccccceeeEeeccC--------CCceEEEEcCCcEEEEEeccCCcccccccccccccccccCcceEe
Confidence 0000000 0011 12346899999986 3788999999998876 4443332 23466778999999
Q ss_pred EEECCCCCCCCCCCEEEEEeCCCcEEEEECC-CCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 627 IILSPPQTEHPWSDCFLSVGEDFSVALASLE-TLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 627 l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~-t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+.++|=. -..|++++ |.+|+||+.. ...++..+..+...|++++|||...-++....+ +|.|.+||
T Consensus 404 v~~nPF~-----~k~fls~g-DW~vriWs~~~~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~-------~G~l~iWD 470 (555)
T KOG1587|consen 404 VSRNPFY-----PKNFLSVG-DWTVRIWSEDVIASPLLSLDSSPDYVTDVAWSPTRPAVFATVDG-------DGNLDIWD 470 (555)
T ss_pred eecCCCc-----cceeeeec-cceeEeccccCCCCcchhhhhccceeeeeEEcCcCceEEEEEcC-------CCceehhh
Confidence 9999972 33566666 9999999988 778888888899999999999998755544432 39999999
Q ss_pred CCCCe
Q 000473 706 VKTGA 710 (1471)
Q Consensus 706 i~tg~ 710 (1471)
+....
T Consensus 471 Ll~~~ 475 (555)
T KOG1587|consen 471 LLQDD 475 (555)
T ss_pred hhccc
Confidence 97543
No 202
>PRK01742 tolB translocation protein TolB; Provisional
Probab=98.68 E-value=4.9e-07 Score=111.97 Aligned_cols=164 Identities=9% Similarity=0.096 Sum_probs=113.7
Q ss_pred ccCC--EEEEEE-cCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEE-EEE
Q 000473 523 YAPY--AIVYGF-FSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLV-SGS 598 (1471)
Q Consensus 523 f~P~--~lv~Gs-~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~-SGs 598 (1471)
|+|+ .++++. .+|.+.||.++ ..++. ...+.+|...+....|+|| ++.|+ ++.
T Consensus 255 wSPDG~~La~~~~~~g~~~Iy~~d-------------~~~~~-~~~lt~~~~~~~~~~wSpD---------G~~i~f~s~ 311 (429)
T PRK01742 255 FSPDGSRLAFASSKDGVLNIYVMG-------------ANGGT-PSQLTSGAGNNTEPSWSPD---------GQSILFTSD 311 (429)
T ss_pred ECCCCCEEEEEEecCCcEEEEEEE-------------CCCCC-eEeeccCCCCcCCEEECCC---------CCEEEEEEC
Confidence 6665 676654 68888886665 22333 3456677778889999998 66555 555
Q ss_pred CCCcEEEEECCCC-ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE
Q 000473 599 MDCSIRIWDLGSG-NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW 677 (1471)
Q Consensus 599 ~DgtI~lWDl~tg-~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~ 677 (1471)
.++...+|++... .....+ .+.+ ....|+|+ |+.++.++.|+ +.+||+.+++.......+ ....+.|
T Consensus 312 ~~g~~~I~~~~~~~~~~~~l-~~~~--~~~~~SpD------G~~ia~~~~~~-i~~~Dl~~g~~~~lt~~~--~~~~~~~ 379 (429)
T PRK01742 312 RSGSPQVYRMSASGGGASLV-GGRG--YSAQISAD------GKTLVMINGDN-VVKQDLTSGSTEVLSSTF--LDESPSI 379 (429)
T ss_pred CCCCceEEEEECCCCCeEEe-cCCC--CCccCCCC------CCEEEEEcCCC-EEEEECCCCCeEEecCCC--CCCCceE
Confidence 6888899887532 222333 4443 45678888 89998888776 455999988765443332 2456889
Q ss_pred cCCCCEEEEEEcCCCCCCCCCCEEEEEEC--CCCeEEEEEeCCCCCceeeeeee
Q 000473 678 DCPRGYIACLCRDHSRTSDAVDVLFIWDV--KTGARERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 678 spdg~~L~sgs~D~sg~~D~~gtV~VWDi--~tg~~~~~l~gH~~~v~~~~~~~ 729 (1471)
+|+|++|+.++.+ +.+++|++ .+|...+.+.+|.+.+....|+|
T Consensus 380 sPdG~~i~~~s~~--------g~~~~l~~~~~~G~~~~~l~~~~g~~~~p~wsp 425 (429)
T PRK01742 380 SPNGIMIIYSSTQ--------GLGKVLQLVSADGRFKARLPGSDGQVKFPAWSP 425 (429)
T ss_pred CCCCCEEEEEEcC--------CCceEEEEEECCCCceEEccCCCCCCCCcccCC
Confidence 9999999999887 66777765 36888999998887665555553
No 203
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=98.68 E-value=2.3e-07 Score=117.22 Aligned_cols=146 Identities=17% Similarity=0.172 Sum_probs=116.5
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEE-EecCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVER-MFPGHP 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~-~~~gh~ 669 (1471)
.-++++|+.-+.|.+|+..--+.-..+.+|.+.|.++.++-+ |.+++|+|+|.++|+|++++.+... ..-||.
T Consensus 145 ~~~i~~gsv~~~iivW~~~~dn~p~~l~GHeG~iF~i~~s~d------g~~i~s~SdDRsiRlW~i~s~~~~~~~~fgHs 218 (967)
T KOG0974|consen 145 ELYIASGSVFGEIIVWKPHEDNKPIRLKGHEGSIFSIVTSLD------GRYIASVSDDRSIRLWPIDSREVLGCTGFGHS 218 (967)
T ss_pred EEEEEeccccccEEEEeccccCCcceecccCCceEEEEEccC------CcEEEEEecCcceeeeecccccccCccccccc
Confidence 568999999999999999733333468899999999999999 9999999999999999999988876 677999
Q ss_pred CCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccc
Q 000473 670 NYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSS 749 (1471)
Q Consensus 670 ~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~ 749 (1471)
.+|+.++|.|. ++++++.| .+.++|+. .++.+.++.+|..+-+---..+....
T Consensus 219 aRvw~~~~~~n--~i~t~ged--------ctcrvW~~-~~~~l~~y~~h~g~~iw~~~~~~~~~---------------- 271 (967)
T KOG0974|consen 219 ARVWACCFLPN--RIITVGED--------CTCRVWGV-NGTQLEVYDEHSGKGIWKIAVPIGVI---------------- 271 (967)
T ss_pred ceeEEEEeccc--eeEEeccc--------eEEEEEec-ccceehhhhhhhhcceeEEEEcCCce----------------
Confidence 99999999998 89999999 99999975 45666699999875554322221111
Q ss_pred cceeeccCCceEeecccccc
Q 000473 750 LLLPIHEDGTFRQSQIQNDE 769 (1471)
Q Consensus 750 ~l~~~~~D~tir~w~l~~~~ 769 (1471)
..++-..|+++|.|++....
T Consensus 272 ~~vT~g~Ds~lk~~~l~~r~ 291 (967)
T KOG0974|consen 272 IKVTGGNDSTLKLWDLNGRG 291 (967)
T ss_pred EEEeeccCcchhhhhhhccc
Confidence 12333348999999876544
No 204
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=98.67 E-value=7.8e-08 Score=112.93 Aligned_cols=150 Identities=15% Similarity=0.098 Sum_probs=112.4
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCC
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~ 582 (1471)
.....|.+.+.|-...++.. .+++.++||.|++ |. .+|-...++.....+|.|++|.|+
T Consensus 98 ~sv~AH~~A~~~gRW~~dGt----gLlt~GEDG~iKi--WS--------------rsGMLRStl~Q~~~~v~c~~W~p~- 156 (737)
T KOG1524|consen 98 RSISAHAAAISSGRWSPDGA----GLLTAGEDGVIKI--WS--------------RSGMLRSTVVQNEESIRCARWAPN- 156 (737)
T ss_pred hhhhhhhhhhhhcccCCCCc----eeeeecCCceEEE--Ee--------------ccchHHHHHhhcCceeEEEEECCC-
Confidence 44678888888765333333 8999999999999 43 123333344445678999999998
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE 662 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l 662 (1471)
++..+.+.+....|+ -+.....+-.++.|.|-|.++.|+|. .+.++|||+|-..++||-. |+.+
T Consensus 157 -------S~~vl~c~g~h~~IK--pL~~n~k~i~WkAHDGiiL~~~W~~~------s~lI~sgGED~kfKvWD~~-G~~L 220 (737)
T KOG1524|consen 157 -------SNSIVFCQGGHISIK--PLAANSKIIRWRAHDGLVLSLSWSTQ------SNIIASGGEDFRFKIWDAQ-GANL 220 (737)
T ss_pred -------CCceEEecCCeEEEe--ecccccceeEEeccCcEEEEeecCcc------ccceeecCCceeEEeeccc-Cccc
Confidence 267777766554444 45444556788999999999999998 8999999999999999976 6666
Q ss_pred EEecCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 663 RMFPGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 663 ~~~~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
..-..|..+|++|+|.|+ ..++.++.+
T Consensus 221 f~S~~~ey~ITSva~npd-~~~~v~S~n 247 (737)
T KOG1524|consen 221 FTSAAEEYAITSVAFNPE-KDYLLWSYN 247 (737)
T ss_pred ccCChhccceeeeeeccc-cceeeeeee
Confidence 666778889999999998 455566664
No 205
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=98.66 E-value=2.6e-07 Score=103.61 Aligned_cols=159 Identities=11% Similarity=0.036 Sum_probs=127.6
Q ss_pred CccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccC
Q 000473 509 EKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGW 588 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~ 588 (1471)
...|+|-++.++.. .++.+..+..+.|+.+. .. +--+..++|..|...|+++.|.|.
T Consensus 10 ~~pitchAwn~drt----~iAv~~~~~evhiy~~~-------~~-----~~w~~~htls~Hd~~vtgvdWap~------- 66 (361)
T KOG1523|consen 10 LEPITCHAWNSDRT----QIAVSPNNHEVHIYSML-------GA-----DLWEPAHTLSEHDKIVTGVDWAPK------- 66 (361)
T ss_pred cCceeeeeecCCCc----eEEeccCCceEEEEEec-------CC-----CCceeceehhhhCcceeEEeecCC-------
Confidence 35788887555555 89999999999996654 11 112567889999999999999997
Q ss_pred cCCCEEEEEECCCcEEEEECCC---CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE---
Q 000473 589 SFNEVLVSGSMDCSIRIWDLGS---GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE--- 662 (1471)
Q Consensus 589 ~~~~~L~SGs~DgtI~lWDl~t---g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l--- 662 (1471)
.+.|++++.|..-.||.... .++.-.+.-|...++++.|+|. ++.|+.||.-+.|.||=++...--
T Consensus 67 --snrIvtcs~drnayVw~~~~~~~WkptlvLlRiNrAAt~V~WsP~------enkFAVgSgar~isVcy~E~ENdWWVs 138 (361)
T KOG1523|consen 67 --SNRIVTCSHDRNAYVWTQPSGGTWKPTLVLLRINRAATCVKWSPK------ENKFAVGSGARLISVCYYEQENDWWVS 138 (361)
T ss_pred --CCceeEccCCCCccccccCCCCeeccceeEEEeccceeeEeecCc------CceEEeccCccEEEEEEEecccceehh
Confidence 78999999999999999833 3455667778899999999999 899999999999999987643221
Q ss_pred -EEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 663 -RMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 663 -~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
+.-..+.+.|.++.|+|++-+|++||.| +..||+..
T Consensus 139 KhikkPirStv~sldWhpnnVLlaaGs~D--------~k~rVfSa 175 (361)
T KOG1523|consen 139 KHIKKPIRSTVTSLDWHPNNVLLAAGSTD--------GKCRVFSA 175 (361)
T ss_pred hhhCCccccceeeeeccCCcceecccccC--------cceeEEEE
Confidence 1223356678999999999999999999 99999874
No 206
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=98.62 E-value=1.7e-07 Score=106.34 Aligned_cols=125 Identities=17% Similarity=0.165 Sum_probs=107.4
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC------CCceEEEEec-cCCCEEEEEECCCCCC
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG------SGNLITVMHH-HVAPVRQIILSPPQTE 635 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~------tg~~l~~~~~-H~~~V~~l~fspd~~~ 635 (1471)
.+-+.+|.+.|+++.|+.+ +++|+||+.|..+++|++. +.+++..+.. |...|.+++|...
T Consensus 49 qKD~~~H~GCiNAlqFS~N---------~~~L~SGGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~--- 116 (609)
T KOG4227|consen 49 QKDVREHTGCINALQFSHN---------DRFLASGGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLE--- 116 (609)
T ss_pred hhhhhhhccccceeeeccC---------CeEEeecCCcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccC---
Confidence 3456799999999999986 8999999999999999985 4567776654 4588999999887
Q ss_pred CCCCCEEEEEeCCCcEEEEECCCCcEEEEecC--CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 636 HPWSDCFLSVGEDFSVALASLETLRVERMFPG--HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 636 ~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g--h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
...+.+|+.|++|.+-|+++.+.+..+.- ..+.|+.+.-+|.++.|++.+.+ +.|.+||++..+
T Consensus 117 ---N~~~~SG~~~~~VI~HDiEt~qsi~V~~~~~~~~~VY~m~~~P~DN~~~~~t~~--------~~V~~~D~Rd~~ 182 (609)
T KOG4227|consen 117 ---NRFLYSGERWGTVIKHDIETKQSIYVANENNNRGDVYHMDQHPTDNTLIVVTRA--------KLVSFIDNRDRQ 182 (609)
T ss_pred ---CeeEecCCCcceeEeeecccceeeeeecccCcccceeecccCCCCceEEEEecC--------ceEEEEeccCCC
Confidence 67899999999999999999998888752 23489999999999999999998 999999998654
No 207
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=98.62 E-value=1.2e-07 Score=111.47 Aligned_cols=162 Identities=17% Similarity=0.205 Sum_probs=127.6
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.++..+.||.+.+. . +.++..+....|.++|.|-.|+|| |.-|+|.+.||.|++|
T Consensus 77 ~~~i~s~DGkf~il--~--------------k~~rVE~sv~AH~~A~~~gRW~~d---------GtgLlt~GEDG~iKiW 131 (737)
T KOG1524|consen 77 TLLICSNDGRFVIL--N--------------KSARVERSISAHAAAISSGRWSPD---------GAGLLTAGEDGVIKIW 131 (737)
T ss_pred eEEEEcCCceEEEe--c--------------ccchhhhhhhhhhhhhhhcccCCC---------CceeeeecCCceEEEE
Confidence 47778899999982 2 234555677899999999999998 8999999999999999
Q ss_pred ECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEE
Q 000473 607 DLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIAC 686 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~s 686 (1471)
. ++|-+..++.....+|.|++|.|+. ...+.+.+. .+.+=.+.-...+.....|.+-|.++.|+|..+.+++
T Consensus 132 S-rsGMLRStl~Q~~~~v~c~~W~p~S-----~~vl~c~g~--h~~IKpL~~n~k~i~WkAHDGiiL~~~W~~~s~lI~s 203 (737)
T KOG1524|consen 132 S-RSGMLRSTVVQNEESIRCARWAPNS-----NSIVFCQGG--HISIKPLAANSKIIRWRAHDGLVLSLSWSTQSNIIAS 203 (737)
T ss_pred e-ccchHHHHHhhcCceeEEEEECCCC-----CceEEecCC--eEEEeecccccceeEEeccCcEEEEeecCccccceee
Confidence 8 5676666777778899999999993 334444443 3444555555555667889999999999999999999
Q ss_pred EEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeec
Q 000473 687 LCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKG 730 (1471)
Q Consensus 687 gs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~ 730 (1471)
|++| -..+|||-. |+.+-.-..|.-.++.+.|.|+
T Consensus 204 gGED--------~kfKvWD~~-G~~Lf~S~~~ey~ITSva~npd 238 (737)
T KOG1524|consen 204 GGED--------FRFKIWDAQ-GANLFTSAAEEYAITSVAFNPE 238 (737)
T ss_pred cCCc--------eeEEeeccc-CcccccCChhccceeeeeeccc
Confidence 9999 999999964 6666666777777888777754
No 208
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.62 E-value=1.3e-06 Score=109.95 Aligned_cols=197 Identities=12% Similarity=0.067 Sum_probs=126.0
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.+.-....+.+||.+.++........ .-....|+.+.++.+.. -..+++|+.||.|+| |+.... .|
T Consensus 1079 i~~ad~r~~i~vwd~e~~~~l~~F~n~-~~~~t~Vs~l~liNe~D--~aLlLtas~dGvIRI--wk~y~~------~~-- 1145 (1387)
T KOG1517|consen 1079 IAAADDRERIRVWDWEKGRLLNGFDNG-AFPDTRVSDLELINEQD--DALLLTASSDGVIRI--WKDYAD------KW-- 1145 (1387)
T ss_pred eEEcCCcceEEEEecccCceeccccCC-CCCCCccceeeeecccc--hhheeeeccCceEEE--eccccc------cc--
Confidence 334444567888988776442211111 11235688877666543 127899999999999 542211 11
Q ss_pred CCcceEEEE---ecC----CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec-cCCCEEEEEE
Q 000473 558 NSHVSRQYF---LGH----TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH-HVAPVRQIIL 629 (1471)
Q Consensus 558 ~s~~~~~~l---~gH----~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~-H~~~V~~l~f 629 (1471)
+..+.+..+ .++ .+.-.-+.|... ..+|+++|.-..|++||.......+.+.. -...|+++.-
T Consensus 1146 ~~~eLVTaw~~Ls~~~~~~r~~~~v~dWqQ~---------~G~Ll~tGd~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~ 1216 (1387)
T KOG1517|consen 1146 KKPELVTAWSSLSDQLPGARGTGLVVDWQQQ---------SGHLLVTGDVRSIRIWDAHKEQVVADIPYGSSTLVTALSA 1216 (1387)
T ss_pred CCceeEEeeccccccCccCCCCCeeeehhhh---------CCeEEecCCeeEEEEEecccceeEeecccCCCccceeecc
Confidence 112222222 121 111123456554 33444444588999999998777765543 3445666654
Q ss_pred CCCCCCCCCCCEEEEEeCCCcEEEEECCCCc---EEEEecCCCCC--cEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEE
Q 000473 630 SPPQTEHPWSDCFLSVGEDFSVALASLETLR---VERMFPGHPNY--PAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFI 703 (1471)
Q Consensus 630 spd~~~~~~~~~l~S~s~DgsV~lWdl~t~~---~l~~~~gh~~~--V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~V 703 (1471)
+-. .|+.++.|..||+|++||.+... .+.....|... |..+.+.+.|- .|++||.| |.|++
T Consensus 1217 ~~~-----~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~--------G~I~~ 1283 (1387)
T KOG1517|consen 1217 DLV-----HGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQD--------GDIQL 1283 (1387)
T ss_pred ccc-----CCceEEEeecCCceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccC--------CeEEE
Confidence 333 28999999999999999997543 46777888887 99999988665 49999999 99999
Q ss_pred EECCCC
Q 000473 704 WDVKTG 709 (1471)
Q Consensus 704 WDi~tg 709 (1471)
||++..
T Consensus 1284 ~DlR~~ 1289 (1387)
T KOG1517|consen 1284 LDLRMS 1289 (1387)
T ss_pred EecccC
Confidence 999874
No 209
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=98.60 E-value=5.5e-08 Score=113.04 Aligned_cols=134 Identities=16% Similarity=0.115 Sum_probs=119.3
Q ss_pred cceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCC
Q 000473 560 HVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 560 ~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~ 639 (1471)
|..++.++.| ..|..|.|-|- --+|++++..|-++.-|+.+|+++..+..-.+.+..+.-+|- .
T Consensus 200 GtElHClk~~-~~v~rLeFLPy---------HfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~~vm~qNP~------N 263 (545)
T KOG1272|consen 200 GTELHCLKRH-IRVARLEFLPY---------HFLLVAASEAGFLKYQDVSTGKLVASIRTGAGRTDVMKQNPY------N 263 (545)
T ss_pred CcEEeehhhc-Cchhhhcccch---------hheeeecccCCceEEEeechhhhhHHHHccCCccchhhcCCc------c
Confidence 4556677666 46899999986 578899999999999999999999999888899999999997 6
Q ss_pred CEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 640 DCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
..+..|...|+|.+|+....+++..+..|.++|.+|++.++|.|++|.+.| ..++|||+++...+.++..
T Consensus 264 aVih~GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~G~YMaTtG~D--------r~~kIWDlR~~~ql~t~~t 333 (545)
T KOG1272|consen 264 AVIHLGHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRGGRYMATTGLD--------RKVKIWDLRNFYQLHTYRT 333 (545)
T ss_pred ceEEEcCCCceEEecCCCCcchHHHHHhcCCCcceEEECCCCcEEeecccc--------cceeEeeeccccccceeec
Confidence 789999999999999999999998889999999999999999999999999 9999999998776666554
No 210
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.59 E-value=0.00018 Score=87.24 Aligned_cols=95 Identities=12% Similarity=0.169 Sum_probs=61.8
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEE----ECCCcEEEEECCCCceEEEEeccC-CCEEEEEEC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSG----SMDCSIRIWDLGSGNLITVMHHHV-APVRQIILS 630 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SG----s~DgtI~lWDl~tg~~l~~~~~H~-~~V~~l~fs 630 (1471)
+..+.+.++.+....+. .-+..||+ ++++... ....+|.++|.++.+...++.... ..+..+.|+
T Consensus 255 d~~~wkvv~~I~~~G~g-lFi~thP~---------s~~vwvd~~~~~~~~~v~viD~~tl~~~~~i~~~~~~~~~h~ef~ 324 (369)
T PF02239_consen 255 DDYAWKVVKTIPTQGGG-LFIKTHPD---------SRYVWVDTFLNPDADTVQVIDKKTLKVVKTITPGPGKRVVHMEFN 324 (369)
T ss_dssp TTTBTSEEEEEE-SSSS---EE--TT----------SEEEEE-TT-SSHT-EEEEECCGTEEEE-HHHHHT--EEEEEE-
T ss_pred hhhcCeEEEEEECCCCc-ceeecCCC---------CccEEeeccCCCCCceEEEEECcCcceeEEEeccCCCcEeccEEC
Confidence 34445667777766666 66777997 6666665 556899999999998877775433 358999999
Q ss_pred CCCCCCCCCC-EEEEEeCCC-cEEEEECCCCcEEEEec
Q 000473 631 PPQTEHPWSD-CFLSVGEDF-SVALASLETLRVERMFP 666 (1471)
Q Consensus 631 pd~~~~~~~~-~l~S~s~Dg-sV~lWdl~t~~~l~~~~ 666 (1471)
++ |. ..+|.-.++ .|.++|..+.+.+..++
T Consensus 325 ~d------G~~v~vS~~~~~~~i~v~D~~Tl~~~~~i~ 356 (369)
T PF02239_consen 325 PD------GKEVWVSVWDGNGAIVVYDAKTLKEKKRIP 356 (369)
T ss_dssp TT------SSEEEEEEE--TTEEEEEETTTTEEEEEEE
T ss_pred CC------CCEEEEEEecCCCEEEEEECCCcEEEEEEE
Confidence 99 66 444554444 89999999999998876
No 211
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=98.56 E-value=2.6e-06 Score=102.61 Aligned_cols=122 Identities=16% Similarity=0.243 Sum_probs=112.5
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEe--ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMH--HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGH 668 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~--~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh 668 (1471)
...++-|...|.|-++++..|+....+. .|.+.|.++.++-+ -.||.|++.|..+..|+..+++.++.+.+.
T Consensus 70 t~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~------~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~ 143 (541)
T KOG4547|consen 70 TSMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQR------LGCIYSVGADLKVVYILEKEKVIIRIWKEQ 143 (541)
T ss_pred ceEEEeecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccc------cCceEecCCceeEEEEecccceeeeeeccC
Confidence 4578889999999999999999888886 59999999998887 779999999999999999999999999988
Q ss_pred CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 669 PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 669 ~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
...+.+++.+||+..|++++ +.|++||+++++.+++++||.+.|-+..|.
T Consensus 144 ~~~~~sl~is~D~~~l~~as----------~~ik~~~~~~kevv~~ftgh~s~v~t~~f~ 193 (541)
T KOG4547|consen 144 KPLVSSLCISPDGKILLTAS----------RQIKVLDIETKEVVITFTGHGSPVRTLSFT 193 (541)
T ss_pred CCccceEEEcCCCCEEEecc----------ceEEEEEccCceEEEEecCCCcceEEEEEE
Confidence 88999999999999999887 579999999999999999999999888777
No 212
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=98.56 E-value=1.9e-06 Score=96.79 Aligned_cols=192 Identities=12% Similarity=0.038 Sum_probs=143.1
Q ss_pred ecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccC
Q 000473 479 DLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVN 558 (1471)
Q Consensus 479 ~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~ 558 (1471)
+.+..+.-+.+++..+..+. ....++..|...|+.+...+... +|++++.|..-.| |....+ .
T Consensus 26 Av~~~~~evhiy~~~~~~~w-~~~htls~Hd~~vtgvdWap~sn----rIvtcs~drnayV--w~~~~~----------~ 88 (361)
T KOG1523|consen 26 AVSPNNHEVHIYSMLGADLW-EPAHTLSEHDKIVTGVDWAPKSN----RIVTCSHDRNAYV--WTQPSG----------G 88 (361)
T ss_pred EeccCCceEEEEEecCCCCc-eeceehhhhCcceeEEeecCCCC----ceeEccCCCCccc--cccCCC----------C
Confidence 33444445555555554321 34567889999999987444433 8999999999999 553221 1
Q ss_pred CcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceE----EEEeccCCCEEEEEECCCCC
Q 000473 559 SHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLI----TVMHHHVAPVRQIILSPPQT 634 (1471)
Q Consensus 559 s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l----~~~~~H~~~V~~l~fspd~~ 634 (1471)
+-++...|..|+..++|+.|+|. ++.++.||.-..|.||-++..+-- +.-+.+...|.++.|+|+
T Consensus 89 ~WkptlvLlRiNrAAt~V~WsP~---------enkFAVgSgar~isVcy~E~ENdWWVsKhikkPirStv~sldWhpn-- 157 (361)
T KOG1523|consen 89 TWKPTLVLLRINRAATCVKWSPK---------ENKFAVGSGARLISVCYYEQENDWWVSKHIKKPIRSTVTSLDWHPN-- 157 (361)
T ss_pred eeccceeEEEeccceeeEeecCc---------CceEEeccCccEEEEEEEecccceehhhhhCCccccceeeeeccCC--
Confidence 23456677889999999999997 899999999999999998754421 333456778999999998
Q ss_pred CCCCCCEEEEEeCCCcEEEEEC-----CC-------------CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCC
Q 000473 635 EHPWSDCFLSVGEDFSVALASL-----ET-------------LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSD 696 (1471)
Q Consensus 635 ~~~~~~~l~S~s~DgsV~lWdl-----~t-------------~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D 696 (1471)
.-+++.||.|+.+++++. ++ |..+.++....+.|..+.|+|.|..|+-.+.|
T Consensus 158 ----nVLlaaGs~D~k~rVfSayIK~Vdekpap~pWgsk~PFG~lm~E~~~~ggwvh~v~fs~sG~~lawv~Hd------ 227 (361)
T KOG1523|consen 158 ----NVLLAAGSTDGKCRVFSAYIKGVDEKPAPTPWGSKMPFGQLMSEASSSGGWVHGVLFSPSGNRLAWVGHD------ 227 (361)
T ss_pred ----cceecccccCcceeEEEEeeeccccCCCCCCCccCCcHHHHHHhhccCCCceeeeEeCCCCCEeeEecCC------
Confidence 789999999999999964 21 23344454566789999999999999999998
Q ss_pred CCCEEEEEECCCCe
Q 000473 697 AVDVLFIWDVKTGA 710 (1471)
Q Consensus 697 ~~gtV~VWDi~tg~ 710 (1471)
.+|.+=|.....
T Consensus 228 --s~v~~~da~~p~ 239 (361)
T KOG1523|consen 228 --STVSFVDAAGPS 239 (361)
T ss_pred --CceEEeecCCCc
Confidence 899999876553
No 213
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=98.54 E-value=9.7e-07 Score=100.28 Aligned_cols=173 Identities=12% Similarity=0.042 Sum_probs=129.6
Q ss_pred ccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEec-CCccEEEEEEe
Q 000473 501 GRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLG-HTGAVLCLAAH 579 (1471)
Q Consensus 501 ~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~g-H~~~V~~la~s 579 (1471)
+.+.+.+|.+-|+++.+..... .|++|+.|-.+++|..+.+.- ..+.++++.... |...|.||+|.
T Consensus 48 ~qKD~~~H~GCiNAlqFS~N~~----~L~SGGDD~~~~~W~~de~~~---------~k~~KPI~~~~~~H~SNIF~L~F~ 114 (609)
T KOG4227|consen 48 CQKDVREHTGCINALQFSHNDR----FLASGGDDMHGRVWNVDELMV---------RKTPKPIGVMEHPHRSNIFSLEFD 114 (609)
T ss_pred hhhhhhhhccccceeeeccCCe----EEeecCCcceeeeechHHHHh---------hcCCCCceeccCccccceEEEEEc
Confidence 4455678999999988555544 699999999999955442211 133456655433 44789999998
Q ss_pred cCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec--cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC
Q 000473 580 RMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH--HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE 657 (1471)
Q Consensus 580 pd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~--H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~ 657 (1471)
.. +.++.||+.+++|.+-|+.+.+.+.++.. ..+.|..+..+|. .+.|++.+.|+.|.+||.+
T Consensus 115 ~~---------N~~~~SG~~~~~VI~HDiEt~qsi~V~~~~~~~~~VY~m~~~P~------DN~~~~~t~~~~V~~~D~R 179 (609)
T KOG4227|consen 115 LE---------NRFLYSGERWGTVIKHDIETKQSIYVANENNNRGDVYHMDQHPT------DNTLIVVTRAKLVSFIDNR 179 (609)
T ss_pred cC---------CeeEecCCCcceeEeeecccceeeeeecccCcccceeecccCCC------CceEEEEecCceEEEEecc
Confidence 75 78999999999999999999998888753 3468999999998 8999999999999999998
Q ss_pred CCc-EEEE-ec-CCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 658 TLR-VERM-FP-GHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 658 t~~-~l~~-~~-gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
... .+.. ++ ........+.|+|... +|++.+.. +-+-+||++..
T Consensus 180 d~~~~~~~~~~AN~~~~F~t~~F~P~~P~Li~~~~~~--------~G~~~~D~R~~ 227 (609)
T KOG4227|consen 180 DRQNPISLVLPANSGKNFYTAEFHPETPALILVNSET--------GGPNVFDRRMQ 227 (609)
T ss_pred CCCCCCceeeecCCCccceeeeecCCCceeEEecccc--------CCCCceeeccc
Confidence 654 2211 11 1233467899999876 55666655 78999998754
No 214
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=98.53 E-value=4.3e-06 Score=106.97 Aligned_cols=153 Identities=17% Similarity=0.194 Sum_probs=117.2
Q ss_pred ccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-------ceEEEEeccCCCEEE
Q 000473 554 SLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-------NLITVMHHHVAPVRQ 626 (1471)
Q Consensus 554 ~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-------~~l~~~~~H~~~V~~ 626 (1471)
.|+. .|..+..|..|...|..++.++.+ +.+++|||.||+|++||+..- +...++..-.+.+.+
T Consensus 1033 gW~p-~G~lVAhL~Ehs~~v~k~a~s~~~--------~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~~ 1103 (1431)
T KOG1240|consen 1033 GWNP-RGILVAHLHEHSSAVIKLAVSSEH--------TSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVEK 1103 (1431)
T ss_pred CCCc-cceEeehhhhccccccceeecCCC--------CceEEEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceEE
Confidence 4665 367888999999999999998763 689999999999999998631 123344445678999
Q ss_pred EEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-------------------cEEEEe----------------------
Q 000473 627 IILSPPQTEHPWSDCFLSVGEDFSVALASLETL-------------------RVERMF---------------------- 665 (1471)
Q Consensus 627 l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-------------------~~l~~~---------------------- 665 (1471)
+.+.+. ++.+|.++.||.|++.+++-. ..+.+.
T Consensus 1104 vt~~~~------~~~~Av~t~DG~v~~~~id~~~~~~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~iv~ 1177 (1431)
T KOG1240|consen 1104 VTMCGN------GDQFAVSTKDGSVRVLRIDHYNVSKRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRIVS 1177 (1431)
T ss_pred EEeccC------CCeEEEEcCCCeEEEEEccccccccceeeeeecccccCCCceEEeecccccccceeEEEEEeccceEE
Confidence 999988 899999999999999987541 000000
Q ss_pred --------------cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe-CCCCCceeeeeee
Q 000473 666 --------------PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR-GTASHSMFDHFCK 729 (1471)
Q Consensus 666 --------------~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~-gH~~~v~~~~~~~ 729 (1471)
....+.|++++.+|.+.+++.|..- |.+-+||++-+.++.... +|.+.+.-+..|+
T Consensus 1178 ~D~r~~~~~w~lk~~~~hG~vTSi~idp~~~WlviGts~--------G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~~~ 1248 (1431)
T KOG1240|consen 1178 WDTRMRHDAWRLKNQLRHGLVTSIVIDPWCNWLVIGTSR--------GQLVLWDLRFRVPILSWEHPARAPIRHVWLCP 1248 (1431)
T ss_pred ecchhhhhHHhhhcCccccceeEEEecCCceEEEEecCC--------ceEEEEEeecCceeecccCcccCCcceEEeec
Confidence 0112468999999999999999887 999999999998887754 5566776666664
No 215
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.51 E-value=5.9e-06 Score=91.12 Aligned_cols=136 Identities=13% Similarity=0.194 Sum_probs=89.3
Q ss_pred EEEEECCC-CceEEEEec-cCCCEEEEEECCCCCCCCCCCEEEEE--eCCCcEEEEECCCCcEEEEecCCCCCcEEEEEc
Q 000473 603 IRIWDLGS-GNLITVMHH-HVAPVRQIILSPPQTEHPWSDCFLSV--GEDFSVALASLETLRVERMFPGHPNYPAKVVWD 678 (1471)
Q Consensus 603 I~lWDl~t-g~~l~~~~~-H~~~V~~l~fspd~~~~~~~~~l~S~--s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~s 678 (1471)
..+|.++. +.....+.- ..++|.+++|+|+ |+.|+.+ ..+..|.+||++ ++.+..+. ...+..|.|+
T Consensus 39 ~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~------g~~favi~g~~~~~v~lyd~~-~~~i~~~~--~~~~n~i~ws 109 (194)
T PF08662_consen 39 FELFYLNEKNIPVESIELKKEGPIHDVAWSPN------GNEFAVIYGSMPAKVTLYDVK-GKKIFSFG--TQPRNTISWS 109 (194)
T ss_pred EEEEEEecCCCccceeeccCCCceEEEEECcC------CCEEEEEEccCCcccEEEcCc-ccEeEeec--CCCceEEEEC
Confidence 44444422 233444433 4567999999999 7876544 467899999997 67777764 4678899999
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCC
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDG 758 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~ 758 (1471)
|+|++|++++.+ +..|.|.+||+++.+.+...... .+..+.|+|. +-.+++..+... ...|.
T Consensus 110 P~G~~l~~~g~~-----n~~G~l~~wd~~~~~~i~~~~~~--~~t~~~WsPd------Gr~~~ta~t~~r-----~~~dn 171 (194)
T PF08662_consen 110 PDGRFLVLAGFG-----NLNGDLEFWDVRKKKKISTFEHS--DATDVEWSPD------GRYLATATTSPR-----LRVDN 171 (194)
T ss_pred CCCCEEEEEEcc-----CCCcEEEEEECCCCEEeeccccC--cEEEEEEcCC------CCEEEEEEeccc-----eeccc
Confidence 999999998754 22278999999998888776432 3445556642 112222222111 12378
Q ss_pred ceEeecc
Q 000473 759 TFRQSQI 765 (1471)
Q Consensus 759 tir~w~l 765 (1471)
.+++|+.
T Consensus 172 g~~Iw~~ 178 (194)
T PF08662_consen 172 GFKIWSF 178 (194)
T ss_pred cEEEEEe
Confidence 8999986
No 216
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.49 E-value=2.9e-06 Score=96.41 Aligned_cols=182 Identities=18% Similarity=0.226 Sum_probs=119.4
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCCc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGAS 554 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~~ 554 (1471)
++....+..|++++.....+. .+... ....|+|++ |.|. -++.|+..| |.+|..+.-....+..+
T Consensus 113 fava~nddvVriy~ksst~pt-~Lks~---sQrnvtcla------wRPlsaselavgCr~g-IciW~~s~tln~~r~~~- 180 (445)
T KOG2139|consen 113 FAVATNDDVVRIYDKSSTCPT-KLKSV---SQRNVTCLA------WRPLSASELAVGCRAG-ICIWSDSRTLNANRNIR- 180 (445)
T ss_pred hhhhccCcEEEEeccCCCCCc-eecch---hhcceeEEE------eccCCcceeeeeecce-eEEEEcCcccccccccc-
Confidence 445556667777776553321 11111 124688888 5554 688888765 56643331110000000
Q ss_pred cccCCc-ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE-CCCcEEEEECCCCceEEEEeccCCCEEEEEECCC
Q 000473 555 LKVNSH-VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS-MDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPP 632 (1471)
Q Consensus 555 ~d~~s~-~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs-~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd 632 (1471)
-..++ -.+..-.|| ..|+++.|.+| +..+++++ .|..|.+||..+|..+....--.+.+.-+.|+||
T Consensus 181 -~~s~~~~qvl~~pgh-~pVtsmqwn~d---------gt~l~tAS~gsssi~iWdpdtg~~~pL~~~glgg~slLkwSPd 249 (445)
T KOG2139|consen 181 -MMSTHHLQVLQDPGH-NPVTSMQWNED---------GTILVTASFGSSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPD 249 (445)
T ss_pred -cccccchhheeCCCC-ceeeEEEEcCC---------CCEEeecccCcceEEEEcCCCCCcccccccCCCceeeEEEcCC
Confidence 00111 112223466 68999999997 78888887 5568999999998876554445577899999999
Q ss_pred CCCCCCCCEEEEEeCCCcEEEEEC-CCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEc
Q 000473 633 QTEHPWSDCFLSVGEDFSVALASL-ETLRVERMFPGHPNYPAKVVWDCPRGYIACLCR 689 (1471)
Q Consensus 633 ~~~~~~~~~l~S~s~DgsV~lWdl-~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~ 689 (1471)
++.|..+.-|+..++|+. ++..+.+-..+. +.|...+|+|+|++|+-.+.
T Consensus 250 ------gd~lfaAt~davfrlw~e~q~wt~erw~lgs-grvqtacWspcGsfLLf~~s 300 (445)
T KOG2139|consen 250 ------GDVLFAATCDAVFRLWQENQSWTKERWILGS-GRVQTACWSPCGSFLLFACS 300 (445)
T ss_pred ------CCEEEEecccceeeeehhcccceecceeccC-CceeeeeecCCCCEEEEEEc
Confidence 999999999999999955 445555544443 38999999999999988776
No 217
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.48 E-value=0.00012 Score=84.15 Aligned_cols=95 Identities=20% Similarity=0.311 Sum_probs=81.5
Q ss_pred CCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCc-EEEEECCCCc
Q 000473 534 SGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCS-IRIWDLGSGN 612 (1471)
Q Consensus 534 DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dgt-I~lWDl~tg~ 612 (1471)
.|+|.+ |+ ..+-+++..+..|.+.+-|++|+++ |.+|+|+|..|+ |||+++.+|+
T Consensus 152 ~GdV~l--~d-------------~~nl~~v~~I~aH~~~lAalafs~~---------G~llATASeKGTVIRVf~v~~G~ 207 (391)
T KOG2110|consen 152 SGDVVL--FD-------------TINLQPVNTINAHKGPLAALAFSPD---------GTLLATASEKGTVIRVFSVPEGQ 207 (391)
T ss_pred CceEEE--EE-------------cccceeeeEEEecCCceeEEEECCC---------CCEEEEeccCceEEEEEEcCCcc
Confidence 688888 33 3345778889999999999999998 999999999996 5799999999
Q ss_pred eEEEEeccC--CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC
Q 000473 613 LITVMHHHV--APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET 658 (1471)
Q Consensus 613 ~l~~~~~H~--~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t 658 (1471)
.+..|+--. ..|.+++|+|+ +++|++.|..++|.++.+++
T Consensus 208 kl~eFRRG~~~~~IySL~Fs~d------s~~L~~sS~TeTVHiFKL~~ 249 (391)
T KOG2110|consen 208 KLYEFRRGTYPVSIYSLSFSPD------SQFLAASSNTETVHIFKLEK 249 (391)
T ss_pred EeeeeeCCceeeEEEEEEECCC------CCeEEEecCCCeEEEEEecc
Confidence 999987544 35789999999 89999999999999998864
No 218
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.48 E-value=2.6e-06 Score=96.82 Aligned_cols=153 Identities=16% Similarity=0.198 Sum_probs=111.2
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEec-CCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLG-HTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~g-H~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
.++.+..|..|++++-. + .....++. -...|+|++|-|. ....|+.|...| |++
T Consensus 112 ~fava~nddvVriy~ks---------------s-t~pt~Lks~sQrnvtclawRPl--------saselavgCr~g-Ici 166 (445)
T KOG2139|consen 112 AFAVATNDDVVRIYDKS---------------S-TCPTKLKSVSQRNVTCLAWRPL--------SASELAVGCRAG-ICI 166 (445)
T ss_pred hhhhhccCcEEEEeccC---------------C-CCCceecchhhcceeEEEeccC--------Ccceeeeeecce-eEE
Confidence 57888889999993321 1 11222222 2346999999997 256777776655 899
Q ss_pred EECCCC----ceE----------EEEeccCCCEEEEEECCCCCCCCCCCEEEEEe-CCCcEEEEECCCCcEEEEecCCCC
Q 000473 606 WDLGSG----NLI----------TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG-EDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 606 WDl~tg----~~l----------~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s-~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
|..... ..+ ..-.+| .+|+++.|.+| |..+++++ .|..|.|||..++.++.......+
T Consensus 167 W~~s~tln~~r~~~~~s~~~~qvl~~pgh-~pVtsmqwn~d------gt~l~tAS~gsssi~iWdpdtg~~~pL~~~glg 239 (445)
T KOG2139|consen 167 WSDSRTLNANRNIRMMSTHHLQVLQDPGH-NPVTSMQWNED------GTILVTASFGSSSIMIWDPDTGQKIPLIPKGLG 239 (445)
T ss_pred EEcCcccccccccccccccchhheeCCCC-ceeeEEEEcCC------CCEEeecccCcceEEEEcCCCCCcccccccCCC
Confidence 987521 111 122345 58999999999 88899887 478999999999988776655566
Q ss_pred CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE-ECCCCeEEEEEeCCC
Q 000473 671 YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW-DVKTGARERVLRGTA 719 (1471)
Q Consensus 671 ~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW-Di~tg~~~~~l~gH~ 719 (1471)
.+.-+.||||+.+|+++.-| +..++| .-++...++-..|..
T Consensus 240 g~slLkwSPdgd~lfaAt~d--------avfrlw~e~q~wt~erw~lgsg 281 (445)
T KOG2139|consen 240 GFSLLKWSPDGDVLFAATCD--------AVFRLWQENQSWTKERWILGSG 281 (445)
T ss_pred ceeeEEEcCCCCEEEEeccc--------ceeeeehhcccceecceeccCC
Confidence 78999999999999999999 999999 445666666665554
No 219
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.48 E-value=5.1e-06 Score=93.48 Aligned_cols=177 Identities=11% Similarity=0.084 Sum_probs=120.5
Q ss_pred cCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceE
Q 000473 484 QDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSR 563 (1471)
Q Consensus 484 ~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~ 563 (1471)
.+.+-+||-....+. .++. -...|.++. +.++++|..- .+.|.||.+.. | .+..
T Consensus 74 pNkviIWDD~k~~~i----~el~-f~~~I~~V~------l~r~riVvvl-~~~I~VytF~~-----------n---~k~l 127 (346)
T KOG2111|consen 74 PNKVIIWDDLKERCI----IELS-FNSEIKAVK------LRRDRIVVVL-ENKIYVYTFPD-----------N---PKLL 127 (346)
T ss_pred CceEEEEecccCcEE----EEEE-eccceeeEE------EcCCeEEEEe-cCeEEEEEcCC-----------C---hhhe
Confidence 345677885543332 2211 234566655 5666677655 57889966541 1 1222
Q ss_pred EEEecC--CccEEEEEEecCCCCcccCcCCCEEE-EEECCCcEEEEECCCCce--EEEEeccCCCEEEEEECCCCCCCCC
Q 000473 564 QYFLGH--TGAVLCLAAHRMVGTAKGWSFNEVLV-SGSMDCSIRIWDLGSGNL--ITVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 564 ~~l~gH--~~~V~~la~spd~~~~~~~~~~~~L~-SGs~DgtI~lWDl~tg~~--l~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
+.+.-- ...+.++ .|.. ...+|+ -|-.-|.|++-|+..-+. -..+.+|...|.+++.+-+
T Consensus 128 ~~~et~~NPkGlC~~--~~~~-------~k~~LafPg~k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~~------ 192 (346)
T KOG2111|consen 128 HVIETRSNPKGLCSL--CPTS-------NKSLLAFPGFKTGQVQIVDLASTKPNAPSIINAHDSDIACVALNLQ------ 192 (346)
T ss_pred eeeecccCCCceEee--cCCC-------CceEEEcCCCccceEEEEEhhhcCcCCceEEEcccCceeEEEEcCC------
Confidence 222211 1122222 2321 133333 345568999999986655 3778899999999999998
Q ss_pred CCEEEEEeCCCc-EEEEECCCCcEEEEecCC--CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 639 SDCFLSVGEDFS-VALASLETLRVERMFPGH--PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 639 ~~~l~S~s~Dgs-V~lWdl~t~~~l~~~~gh--~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
|..+||+|..|+ |||||.++|..+++++.. ...|.+++|+|+..+|++++.- |+|+|+.++.-
T Consensus 193 Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsSdK--------gTlHiF~l~~~ 258 (346)
T KOG2111|consen 193 GTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPNSSWLAVSSDK--------GTLHIFSLRDT 258 (346)
T ss_pred ccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCCccEEEEEcCC--------CeEEEEEeecC
Confidence 999999999998 599999999999998754 3468999999999999999888 99999998753
No 220
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=98.47 E-value=1.3e-06 Score=102.35 Aligned_cols=112 Identities=14% Similarity=0.245 Sum_probs=77.0
Q ss_pred CCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEe-cccccceeEeeeccccccccCccccccccccc
Q 000473 13 TPPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAML-CGHSAPIADLSICYPAMVSRDGKAEHWKAENS 91 (1471)
Q Consensus 13 ~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L-~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~ 91 (1471)
.++.++|..|-|...|..|+||+.|-+|.+||... ..++..+ .||...|-.-.| .
T Consensus 139 ~~H~GcVntV~FN~~Gd~l~SgSDD~~vv~WdW~~-----~~~~l~f~SGH~~NvfQaKF-i------------------ 194 (559)
T KOG1334|consen 139 NKHKGCVNTVHFNQRGDVLASGSDDLQVVVWDWVS-----GSPKLSFESGHCNNVFQAKF-I------------------ 194 (559)
T ss_pred cCCCCccceeeecccCceeeccCccceEEeehhhc-----cCcccccccccccchhhhhc-c------------------
Confidence 46778999999999999999999999999999984 2333333 388877755543 0
Q ss_pred ccccccccCCCCEEEEEeCCCeEEEEEcC-CCeEEEeeeCCCCCCCCcEEEEcCCCC--eEEEEcce
Q 000473 92 SNVMGKSSLDNGALISACTDGVLCVWSRS-SGHCRRRRKLPPWVGSPSVICTLPSNP--RYVCIGCC 155 (1471)
Q Consensus 92 ~~~~~~~s~d~~~LaSas~DG~I~VWdv~-~G~ci~~~~l~~~~g~~~~i~~~s~~~--~ll~~G~~ 155 (1471)
.+.+...+++.+.||.+++=.+. +|.|.....+-+ |..|..+.++-++. .++++|.+
T Consensus 195 ------P~s~d~ti~~~s~dgqvr~s~i~~t~~~e~t~rl~~-h~g~vhklav~p~sp~~f~S~geD 254 (559)
T KOG1334|consen 195 ------PFSGDRTIVTSSRDGQVRVSEILETGYVENTKRLAP-HEGPVHKLAVEPDSPKPFLSCGED 254 (559)
T ss_pred ------CCCCCcCceeccccCceeeeeeccccceecceeccc-ccCccceeeecCCCCCcccccccc
Confidence 14556779999999999988766 555554443332 23344544444332 45666665
No 221
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.44 E-value=0.00012 Score=88.79 Aligned_cols=103 Identities=12% Similarity=0.080 Sum_probs=74.3
Q ss_pred CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEE----eCCCcEEEEECCCCcEEEEecCC-CCCcEEE
Q 000473 601 CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSV----GEDFSVALASLETLRVERMFPGH-PNYPAKV 675 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~----s~DgsV~lWdl~t~~~l~~~~gh-~~~V~~v 675 (1471)
..+.+||..+++.+.++....++ .-+..+|+ ++++... ....+|.++|.++.+.+..+... ...+..+
T Consensus 249 ~~v~v~d~~~wkvv~~I~~~G~g-lFi~thP~------s~~vwvd~~~~~~~~~v~viD~~tl~~~~~i~~~~~~~~~h~ 321 (369)
T PF02239_consen 249 DPVSVHDDYAWKVVKTIPTQGGG-LFIKTHPD------SRYVWVDTFLNPDADTVQVIDKKTLKVVKTITPGPGKRVVHM 321 (369)
T ss_dssp -TTT-STTTBTSEEEEEE-SSSS---EE--TT-------SEEEEE-TT-SSHT-EEEEECCGTEEEE-HHHHHT--EEEE
T ss_pred CccccchhhcCeEEEEEECCCCc-ceeecCCC------CccEEeeccCCCCCceEEEEECcCcceeEEEeccCCCcEecc
Confidence 35668999999999999988887 77888998 7777766 45589999999999888877532 2358899
Q ss_pred EEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 676 VWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 676 ~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
.|+++|+++..+..+ .++.|.|||.+|.++++.+.
T Consensus 322 ef~~dG~~v~vS~~~------~~~~i~v~D~~Tl~~~~~i~ 356 (369)
T PF02239_consen 322 EFNPDGKEVWVSVWD------GNGAIVVYDAKTLKEKKRIP 356 (369)
T ss_dssp EE-TTSSEEEEEEE--------TTEEEEEETTTTEEEEEEE
T ss_pred EECCCCCEEEEEEec------CCCEEEEEECCCcEEEEEEE
Confidence 999999988887766 22589999999999999887
No 222
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.42 E-value=7.8e-06 Score=94.13 Aligned_cols=205 Identities=10% Similarity=0.043 Sum_probs=138.1
Q ss_pred eeecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcC--CcEEEEEecccccCCCCCCc
Q 000473 477 KSDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFS--GEIEVIQFDLFERHNSPGAS 554 (1471)
Q Consensus 477 ~l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~D--G~I~V~~~~~l~~~d~~~~~ 554 (1471)
++.+...++.+.+|....+.........+..+ ..+.. +....-+|+.+++|+.. ..+.||+.. . ..+.
T Consensus 117 ~Litc~~sG~l~~~~~k~~d~hss~l~~la~g-~g~~~---~r~~~~~p~Iva~GGke~~n~lkiwdle--~----~~qi 186 (412)
T KOG3881|consen 117 TLITCVSSGNLQVRHDKSGDLHSSKLIKLATG-PGLYD---VRQTDTDPYIVATGGKENINELKIWDLE--Q----SKQI 186 (412)
T ss_pred EEEEEecCCcEEEEeccCCccccccceeeecC-Cceee---eccCCCCCceEecCchhcccceeeeecc--c----ceee
Confidence 45555566677777766432110011111111 11222 22333456688889888 667773322 1 1112
Q ss_pred cccCCcceEEEEecCCc--cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc-eEEEEeccCCCEEEEEECC
Q 000473 555 LKVNSHVSRQYFLGHTG--AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN-LITVMHHHVAPVRQIILSP 631 (1471)
Q Consensus 555 ~d~~s~~~~~~l~gH~~--~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~-~l~~~~~H~~~V~~l~fsp 631 (1471)
|..+.-+. - ..+-.- .++.+.|-+.. ....|+++..-+.+++||...++ ++..|..-..+|+++...|
T Consensus 187 w~aKNvpn-D-~L~LrVPvW~tdi~Fl~g~-------~~~~fat~T~~hqvR~YDt~~qRRPV~~fd~~E~~is~~~l~p 257 (412)
T KOG3881|consen 187 WSAKNVPN-D-RLGLRVPVWITDIRFLEGS-------PNYKFATITRYHQVRLYDTRHQRRPVAQFDFLENPISSTGLTP 257 (412)
T ss_pred eeccCCCC-c-cccceeeeeeccceecCCC-------CCceEEEEecceeEEEecCcccCcceeEeccccCcceeeeecC
Confidence 22111000 0 001111 24567776531 25789999999999999998765 6778877788999999999
Q ss_pred CCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEE-ecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 632 PQTEHPWSDCFLSVGEDFSVALASLETLRVERM-FPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 632 d~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~-~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
. ++.+.+|..-+.+..+|++.++.+.. +.|-.+.|+.+..+|.+++|++++-| ..|||+|++|.+
T Consensus 258 ~------gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~GLD--------RyvRIhD~ktrk 323 (412)
T KOG3881|consen 258 S------GNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASCGLD--------RYVRIHDIKTRK 323 (412)
T ss_pred C------CcEEEEecccchhheecccCceeeccccCCccCCcceEEEcCCCceEEeeccc--------eeEEEeecccch
Confidence 9 99999999999999999999998766 88899999999999999999999999 999999999976
Q ss_pred EEEE
Q 000473 711 RERV 714 (1471)
Q Consensus 711 ~~~~ 714 (1471)
++..
T Consensus 324 ll~k 327 (412)
T KOG3881|consen 324 LLHK 327 (412)
T ss_pred hhhh
Confidence 6543
No 223
>PRK01742 tolB translocation protein TolB; Provisional
Probab=98.40 E-value=7.1e-06 Score=101.69 Aligned_cols=172 Identities=15% Similarity=0.126 Sum_probs=109.0
Q ss_pred CccccccccCCCCCCCccccccccCccEEEEEeeccccccCC--EEEEEEcC-C--cEEEEEecccccCCCCCCccccCC
Q 000473 485 DTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPY--AIVYGFFS-G--EIEVIQFDLFERHNSPGASLKVNS 559 (1471)
Q Consensus 485 ~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~--~lv~Gs~D-G--~I~V~~~~~l~~~d~~~~~~d~~s 559 (1471)
+.+.+||..... ...+..|...+.+.. |+|+ .+++.+.+ + .|.+ |+ ..+
T Consensus 184 ~~i~i~d~dg~~-----~~~lt~~~~~v~~p~------wSPDG~~la~~s~~~~~~~i~i--~d-------------l~t 237 (429)
T PRK01742 184 YEVRVADYDGFN-----QFIVNRSSQPLMSPA------WSPDGSKLAYVSFENKKSQLVV--HD-------------LRS 237 (429)
T ss_pred EEEEEECCCCCC-----ceEeccCCCccccce------EcCCCCEEEEEEecCCCcEEEE--Ee-------------CCC
Confidence 456667654321 233445566666666 5555 67776543 3 3555 44 333
Q ss_pred cc--eEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEE-ECCCcEEEE--ECCCCceEEEEeccCCCEEEEEECCCCC
Q 000473 560 HV--SRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSG-SMDCSIRIW--DLGSGNLITVMHHHVAPVRQIILSPPQT 634 (1471)
Q Consensus 560 ~~--~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SG-s~DgtI~lW--Dl~tg~~l~~~~~H~~~V~~l~fspd~~ 634 (1471)
++ .+..+.+|. ..++|+|| ++.|+.+ +.|+.+.+| |+.+++ ...+..+...+....|+|+
T Consensus 238 g~~~~l~~~~g~~---~~~~wSPD---------G~~La~~~~~~g~~~Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpD-- 302 (429)
T PRK01742 238 GARKVVASFRGHN---GAPAFSPD---------GSRLAFASSKDGVLNIYVMGANGGT-PSQLTSGAGNNTEPSWSPD-- 302 (429)
T ss_pred CceEEEecCCCcc---CceeECCC---------CCEEEEEEecCCcEEEEEEECCCCC-eEeeccCCCCcCCEEECCC--
Confidence 32 233344543 46899998 7766655 578876655 666665 4556677778889999999
Q ss_pred CCCCCCEEEEEe-CCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 635 EHPWSDCFLSVG-EDFSVALASLETL-RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 635 ~~~~~~~l~S~s-~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
|+.++.++ .++...||+++.. .....+ ++.. ....|+|||++|+..+.+ + +.+||+.+|+..
T Consensus 303 ----G~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~~--~~~~~SpDG~~ia~~~~~--------~-i~~~Dl~~g~~~ 366 (429)
T PRK01742 303 ----GQSILFTSDRSGSPQVYRMSASGGGASLV-GGRG--YSAQISADGKTLVMINGD--------N-VVKQDLTSGSTE 366 (429)
T ss_pred ----CCEEEEEECCCCCceEEEEECCCCCeEEe-cCCC--CCccCCCCCCEEEEEcCC--------C-EEEEECCCCCeE
Confidence 88766555 5788899987532 222333 3443 457899999999887765 4 666999998765
Q ss_pred E
Q 000473 713 R 713 (1471)
Q Consensus 713 ~ 713 (1471)
.
T Consensus 367 ~ 367 (429)
T PRK01742 367 V 367 (429)
T ss_pred E
Confidence 3
No 224
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=98.34 E-value=5.3e-06 Score=98.93 Aligned_cols=160 Identities=18% Similarity=0.241 Sum_probs=119.1
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec------cCC-----CEE
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH------HVA-----PVR 625 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~------H~~-----~V~ 625 (1471)
++.|..+..|.--.+.++++..++. ..+|+.|+.||.|-.||.++...+.++.. |.+ .|+
T Consensus 162 LEqGrfL~P~~~~~~~lN~v~in~~---------hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~~~~~svT 232 (703)
T KOG2321|consen 162 LEQGRFLNPFETDSGELNVVSINEE---------HGLLACGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGGDAAPSVT 232 (703)
T ss_pred ccccccccccccccccceeeeecCc---------cceEEecccCceEEEecchhhhhheeeecccccCCCccccccCcce
Confidence 3445666666666789999999987 68999999999999999998776655542 333 499
Q ss_pred EEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe-cCCCCCcEEEEEcCC--CCEEEEEEcCCCCCCCCCCEEE
Q 000473 626 QIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF-PGHPNYPAKVVWDCP--RGYIACLCRDHSRTSDAVDVLF 702 (1471)
Q Consensus 626 ~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~-~gh~~~V~~v~~spd--g~~L~sgs~D~sg~~D~~gtV~ 702 (1471)
++.|+-+ |-.++.|..+|.|.|||+++.+++..- ++..-+|..+.|.+. ++.+++... ..++
T Consensus 233 al~F~d~------gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~~~~q~~v~S~Dk---------~~~k 297 (703)
T KOG2321|consen 233 ALKFRDD------GLHVAVGTSTGSVLIYDLRASKPLLVKDHGYELPIKKLDWQDTDQQNKVVSMDK---------RILK 297 (703)
T ss_pred EEEecCC------ceeEEeeccCCcEEEEEcccCCceeecccCCccceeeecccccCCCceEEecch---------HHhh
Confidence 9999988 889999999999999999998886543 333458899999776 445554332 6899
Q ss_pred EEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccc
Q 000473 703 IWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSV 747 (1471)
Q Consensus 703 VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~ 747 (1471)
|||-.+|+....+..... +..+| ....+|.++.++...
T Consensus 298 iWd~~~Gk~~asiEpt~~---lND~C----~~p~sGm~f~Ane~~ 335 (703)
T KOG2321|consen 298 IWDECTGKPMASIEPTSD---LNDFC----FVPGSGMFFTANESS 335 (703)
T ss_pred hcccccCCceeeccccCC---cCcee----eecCCceEEEecCCC
Confidence 999999998887775543 33455 223466666655543
No 225
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=98.34 E-value=1e-05 Score=91.71 Aligned_cols=194 Identities=17% Similarity=0.122 Sum_probs=135.6
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+...+.+.++++|-..+....-..+ ...-+..+++..+.++.. +|++|..+|++.-+... -|.
T Consensus 39 v~~~s~drtvrv~lkrds~q~wpsI--~~~mP~~~~~~~y~~e~~----~L~vg~~ngtvtefs~s-----------edf 101 (404)
T KOG1409|consen 39 VISVSEDRTVRVWLKRDSGQYWPSI--YHYMPSPCSAMEYVSESR----RLYVGQDNGTVTEFALS-----------EDF 101 (404)
T ss_pred eEEccccceeeeEEeccccccCchh--hhhCCCCceEeeeeccce----EEEEEEecceEEEEEhh-----------hhh
Confidence 3445667788888766543211111 122345677777666665 89999999998874332 122
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCC---------------------------
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS--------------------------- 610 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t--------------------------- 610 (1471)
+.....+....|..+|..+.|+-. .+++++.+.|..+..--.+.
T Consensus 102 nkm~~~r~~~~h~~~v~~~if~~~---------~e~V~s~~~dk~~~~hc~e~~~~lg~Y~~~~~~t~~~~d~~~~fvGd 172 (404)
T KOG1409|consen 102 NKMTFLKDYLAHQARVSAIVFSLT---------HEWVLSTGKDKQFAWHCTESGNRLGGYNFETPASALQFDALYAFVGD 172 (404)
T ss_pred hhcchhhhhhhhhcceeeEEecCC---------ceeEEEeccccceEEEeeccCCcccceEeeccCCCCceeeEEEEecc
Confidence 223344555678888888877654 46666666665443222221
Q ss_pred --------------CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE-EEEecCCCCCcEEE
Q 000473 611 --------------GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV-ERMFPGHPNYPAKV 675 (1471)
Q Consensus 611 --------------g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~-l~~~~gh~~~V~~v 675 (1471)
-.++.++.+|.+++.++.|.|. ...+.||..|..|.+||+-.++- ...+.+|...|..+
T Consensus 173 ~~gqvt~lr~~~~~~~~i~~~~~h~~~~~~l~Wd~~------~~~LfSg~~d~~vi~wdigg~~g~~~el~gh~~kV~~l 246 (404)
T KOG1409|consen 173 HSGQITMLKLEQNGCQLITTFNGHTGEVTCLKWDPG------QRLLFSGASDHSVIMWDIGGRKGTAYELQGHNDKVQAL 246 (404)
T ss_pred cccceEEEEEeecCCceEEEEcCcccceEEEEEcCC------CcEEEeccccCceEEEeccCCcceeeeeccchhhhhhh
Confidence 1245678899999999999998 78999999999999999975543 36778999999999
Q ss_pred EEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 676 VWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 676 ~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
..-+.-+.|++++.| |.|-+||++....
T Consensus 247 ~~~~~t~~l~S~~ed--------g~i~~w~mn~~r~ 274 (404)
T KOG1409|consen 247 SYAQHTRQLISCGED--------GGIVVWNMNVKRV 274 (404)
T ss_pred hhhhhheeeeeccCC--------CeEEEEeccceee
Confidence 888888899998888 9999999976543
No 226
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.28 E-value=8.3e-05 Score=88.78 Aligned_cols=115 Identities=15% Similarity=0.154 Sum_probs=80.2
Q ss_pred cEEEEEEecCCCCcccCcCCCEE-EEEECCCcEEEEECCCCceEE-------EEeccCCCEEEEEECCCCCCCCCCCEEE
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVL-VSGSMDCSIRIWDLGSGNLIT-------VMHHHVAPVRQIILSPPQTEHPWSDCFL 643 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L-~SGs~DgtI~lWDl~tg~~l~-------~~~~H~~~V~~l~fspd~~~~~~~~~l~ 643 (1471)
...+++++|+ ++++ ++...++.|.+||+.+...+. .... ......++|+|+ +++++
T Consensus 127 ~~~~~~~~p~---------g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~-g~~p~~~~~~pd------g~~ly 190 (330)
T PRK11028 127 GCHSANIDPD---------NRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVE-GAGPRHMVFHPN------QQYAY 190 (330)
T ss_pred cccEeEeCCC---------CCEEEEeeCCCCEEEEEEECCCCcccccCCCceecCC-CCCCceEEECCC------CCEEE
Confidence 4567889997 5655 566677999999998633221 1111 234578999999 88887
Q ss_pred EEeC-CCcEEEEECCC--C--cEEEEecCC------CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 644 SVGE-DFSVALASLET--L--RVERMFPGH------PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 644 S~s~-DgsV~lWdl~t--~--~~l~~~~gh------~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
++.. +++|.+|+++. + +.+..+..+ ......+.++|++++|++++.. +++|.+|++.+.
T Consensus 191 v~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~-------~~~I~v~~i~~~ 260 (330)
T PRK11028 191 CVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRT-------ASLISVFSVSED 260 (330)
T ss_pred EEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCC-------CCeEEEEEEeCC
Confidence 7776 99999999973 3 334444322 1234468999999999998663 279999999653
No 227
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=98.25 E-value=0.00012 Score=90.10 Aligned_cols=109 Identities=17% Similarity=0.233 Sum_probs=87.0
Q ss_pred CEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC----------CCcEEEEECCCCcE
Q 000473 592 EVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE----------DFSVALASLETLRV 661 (1471)
Q Consensus 592 ~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~----------DgsV~lWdl~t~~~ 661 (1471)
.+++.|...|+|-+.|+.++.....|..|.+.|.++.|-.. ..|+|.+. -+.+.+-|+++|..
T Consensus 438 pLvAvGT~sGTV~vvdvst~~v~~~fsvht~~VkgleW~g~-------sslvSfsys~~n~~sg~vrN~l~vtdLrtGls 510 (1062)
T KOG1912|consen 438 PLVAVGTNSGTVDVVDVSTNAVAASFSVHTSLVKGLEWLGN-------SSLVSFSYSHVNSASGGVRNDLVVTDLRTGLS 510 (1062)
T ss_pred eeEEeecCCceEEEEEecchhhhhhhcccccceeeeeeccc-------eeEEEeeeccccccccceeeeEEEEEcccccc
Confidence 57889999999999999999999999999999999999765 23444332 34567889999865
Q ss_pred E--EEecC-CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 662 E--RMFPG-HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 662 l--~~~~g-h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
. +.+++ ...+|..+..+..++||+..-.| .-+.+||+++-++++.+
T Consensus 511 k~fR~l~~~despI~~irvS~~~~yLai~Fr~--------~plEiwd~kt~~~lr~m 559 (1062)
T KOG1912|consen 511 KRFRGLQKPDESPIRAIRVSSSGRYLAILFRR--------EPLEIWDLKTLRMLRLM 559 (1062)
T ss_pred cccccCCCCCcCcceeeeecccCceEEEEecc--------cchHHHhhccchHHHHH
Confidence 3 32333 35689999999999999999999 89999999887665443
No 228
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=98.25 E-value=0.00011 Score=88.16 Aligned_cols=155 Identities=13% Similarity=0.102 Sum_probs=107.9
Q ss_pred cCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCc-----cEEEEEEecCC
Q 000473 508 KEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTG-----AVLCLAAHRMV 582 (1471)
Q Consensus 508 h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~-----~V~~la~spd~ 582 (1471)
....++++.+.+... .+++|+.+|.+.. ||..... .+.+.........|.+ .|+++.|..+
T Consensus 174 ~~~~lN~v~in~~hg----Lla~Gt~~g~VEf--wDpR~ks-------rv~~l~~~~~v~s~pg~~~~~svTal~F~d~- 239 (703)
T KOG2321|consen 174 DSGELNVVSINEEHG----LLACGTEDGVVEF--WDPRDKS-------RVGTLDAASSVNSHPGGDAAPSVTALKFRDD- 239 (703)
T ss_pred ccccceeeeecCccc----eEEecccCceEEE--ecchhhh-------hheeeecccccCCCccccccCcceEEEecCC-
Confidence 345677776666666 7999999999999 5421100 0000001111223333 4999999865
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEe-ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMH-HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV 661 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~-~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~ 661 (1471)
+-.++.|..+|.|.++|+++.+++..-. +..-+|..+.|.+.. .++.++|. ....++|||-.+|+.
T Consensus 240 --------gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~~~----~q~~v~S~-Dk~~~kiWd~~~Gk~ 306 (703)
T KOG2321|consen 240 --------GLHVAVGTSTGSVLIYDLRASKPLLVKDHGYELPIKKLDWQDTD----QQNKVVSM-DKRILKIWDECTGKP 306 (703)
T ss_pred --------ceeEEeeccCCcEEEEEcccCCceeecccCCccceeeecccccC----CCceEEec-chHHhhhcccccCCc
Confidence 7899999999999999999988775432 234578888887751 13355554 457889999999999
Q ss_pred EEEecCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 662 ERMFPGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 662 l~~~~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
...+.. ...++.+++-|++.+++++-.+
T Consensus 307 ~asiEp-t~~lND~C~~p~sGm~f~Ane~ 334 (703)
T KOG2321|consen 307 MASIEP-TSDLNDFCFVPGSGMFFTANES 334 (703)
T ss_pred eeeccc-cCCcCceeeecCCceEEEecCC
Confidence 877763 4459999999999999988776
No 229
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=98.24 E-value=2.3e-06 Score=100.39 Aligned_cols=232 Identities=18% Similarity=0.095 Sum_probs=146.8
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+++++.+..+-+||.....+. .....+|...|....+++...- ..++..+.||.+++.... ..+
T Consensus 157 l~SgSDD~~vv~WdW~~~~~~---l~f~SGH~~NvfQaKFiP~s~d--~ti~~~s~dgqvr~s~i~--~t~--------- 220 (559)
T KOG1334|consen 157 LASGSDDLQVVVWDWVSGSPK---LSFESGHCNNVFQAKFIPFSGD--RTIVTSSRDGQVRVSEIL--ETG--------- 220 (559)
T ss_pred eeccCccceEEeehhhccCcc---cccccccccchhhhhccCCCCC--cCceeccccCceeeeeec--ccc---------
Confidence 556667788889998775442 2223677777766553332210 168889999999984432 110
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEe---ccCC---CEEEEEECC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMH---HHVA---PVRQIILSP 631 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~---~H~~---~V~~l~fsp 631 (1471)
--+....+..|.+.|.-++.-|+ ....|.|++.|+.+.-.|++++.+...+. .+.. ....++..|
T Consensus 221 -~~e~t~rl~~h~g~vhklav~p~--------sp~~f~S~geD~~v~~~Dlr~~~pa~~~~cr~~~~~~~v~L~~Ia~~P 291 (559)
T KOG1334|consen 221 -YVENTKRLAPHEGPVHKLAVEPD--------SPKPFLSCGEDAVVFHIDLRQDVPAEKFVCREADEKERVGLYTIAVDP 291 (559)
T ss_pred -ceecceecccccCccceeeecCC--------CCCcccccccccceeeeeeccCCccceeeeeccCCccceeeeeEecCC
Confidence 01223456789999999999997 26789999999999999998765433322 2222 345666666
Q ss_pred CCCCCCCCCEEEEEeCCCcEEEEECCCC----------------------------------------------------
Q 000473 632 PQTEHPWSDCFLSVGEDFSVALASLETL---------------------------------------------------- 659 (1471)
Q Consensus 632 d~~~~~~~~~l~S~s~DgsV~lWdl~t~---------------------------------------------------- 659 (1471)
.+ .+.|++++.|-.+++||.+.-
T Consensus 292 ~n-----t~~faVgG~dqf~RvYD~R~~~~e~~n~~~~~f~p~hl~~d~~v~ITgl~Ysh~~sElLaSYnDe~IYLF~~~ 366 (559)
T KOG1334|consen 292 RN-----TNEFAVGGSDQFARVYDQRRIDKEENNGVLDKFCPHHLVEDDPVNITGLVYSHDGSELLASYNDEDIYLFNKS 366 (559)
T ss_pred CC-----ccccccCChhhhhhhhcccchhhccccchhhhcCCccccccCcccceeEEecCCccceeeeecccceEEeccc
Confidence 53 345666666666666655321
Q ss_pred --------------cEEE-EecCCCC--CcEEEEE-cCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCC
Q 000473 660 --------------RVER-MFPGHPN--YPAKVVW-DCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASH 721 (1471)
Q Consensus 660 --------------~~l~-~~~gh~~--~V~~v~~-spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~ 721 (1471)
..+. .+.||.. .|..|-| -|...|+++|+.= |.|.||+-.+++.++.+.|...-
T Consensus 367 ~~~G~~p~~~s~~~~~~k~vYKGHrN~~TVKgVNFfGPrsEyVvSGSDC--------GhIFiW~K~t~eii~~MegDr~V 438 (559)
T KOG1334|consen 367 MGDGSEPDPSSPREQYVKRVYKGHRNSRTVKGVNFFGPRSEYVVSGSDC--------GHIFIWDKKTGEIIRFMEGDRHV 438 (559)
T ss_pred cccCCCCCCCcchhhccchhhcccccccccceeeeccCccceEEecCcc--------ceEEEEecchhHHHHHhhcccce
Confidence 1111 1445543 2444443 5566677766654 99999999999999999998875
Q ss_pred ceeeeeeeccccccccceEEcCCccccccceeeccCCceEeec
Q 000473 722 SMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQ 764 (1471)
Q Consensus 722 v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~ 764 (1471)
|-++.-.|.+.. .+++.+ |..||+|.
T Consensus 439 VNCLEpHP~~Pv------------LAsSGi-----d~DVKIWT 464 (559)
T KOG1334|consen 439 VNCLEPHPHLPV------------LASSGI-----DHDVKIWT 464 (559)
T ss_pred EeccCCCCCCch------------hhccCC-----ccceeeec
Confidence 556555543322 223333 88899996
No 230
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.23 E-value=6.8e-05 Score=92.96 Aligned_cols=168 Identities=12% Similarity=0.067 Sum_probs=108.1
Q ss_pred ccCC--EEEEE-EcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC
Q 000473 523 YAPY--AIVYG-FFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM 599 (1471)
Q Consensus 523 f~P~--~lv~G-s~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~ 599 (1471)
|+|+ .++.. ..+|...|+.|+ .++++.. .+..+...+....|+|| ++.|+..+.
T Consensus 250 ~SPDG~~La~~~~~~g~~~I~~~d-------------~~tg~~~-~lt~~~~~~~~~~wSPD---------G~~I~f~s~ 306 (429)
T PRK03629 250 FSPDGSKLAFALSKTGSLNLYVMD-------------LASGQIR-QVTDGRSNNTEPTWFPD---------SQNLAYTSD 306 (429)
T ss_pred ECCCCCEEEEEEcCCCCcEEEEEE-------------CCCCCEE-EccCCCCCcCceEECCC---------CCEEEEEeC
Confidence 6665 56554 456665554454 3344443 34444556788999998 777766554
Q ss_pred -CCcEEE--EECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC---CcEEEEECCCCcEEEEecCCCCCcE
Q 000473 600 -DCSIRI--WDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED---FSVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 600 -DgtI~l--WDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D---gsV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
++...+ +|+.+++. ..+..+........|+|+ |+.++..+.+ ..|.+||+.+++....... ....
T Consensus 307 ~~g~~~Iy~~d~~~g~~-~~lt~~~~~~~~~~~SpD------G~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~~--~~~~ 377 (429)
T PRK03629 307 QAGRPQVYKVNINGGAP-QRITWEGSQNQDADVSSD------GKFMVMVSSNGGQQHIAKQDLATGGVQVLTDT--FLDE 377 (429)
T ss_pred CCCCceEEEEECCCCCe-EEeecCCCCccCEEECCC------CCEEEEEEccCCCceEEEEECCCCCeEEeCCC--CCCC
Confidence 444444 57766654 344445555677889999 8888776654 4588999998875433322 2244
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
...|+|||++|+.++.+ + +...+++|++ +|...+.+.+|.+.+....|.
T Consensus 378 ~p~~SpDG~~i~~~s~~--~---~~~~l~~~~~-~G~~~~~l~~~~~~~~~p~Ws 426 (429)
T PRK03629 378 TPSIAPNGTMVIYSSSQ--G---MGSVLNLVST-DGRFKARLPATDGQVKFPAWS 426 (429)
T ss_pred CceECCCCCEEEEEEcC--C---CceEEEEEEC-CCCCeEECccCCCCcCCcccC
Confidence 67899999999999887 1 1145888888 566677788877655444343
No 231
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=98.19 E-value=2e-05 Score=87.84 Aligned_cols=148 Identities=14% Similarity=0.039 Sum_probs=108.9
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.++++-.+|.+.+..... .. -+..+..++|.-..+...|+.. ..+++.+||.|+.+..|
T Consensus 135 ~i~vs~s~G~~~~v~~t~----------~~---le~vq~wk~He~E~Wta~f~~~--------~pnlvytGgDD~~l~~~ 193 (339)
T KOG0280|consen 135 KIFVSDSRGSISGVYETE----------MV---LEKVQTWKVHEFEAWTAKFSDK--------EPNLVYTGGDDGSLSCW 193 (339)
T ss_pred eEEEEcCCCcEEEEecce----------ee---eeecccccccceeeeeeecccC--------CCceEEecCCCceEEEE
Confidence 577777888888644331 01 1345678899999999999765 37899999999999999
Q ss_pred ECC-CCceEEE-EeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC-CcEEEEecCCCCCcEEEEEcCCCC-
Q 000473 607 DLG-SGNLITV-MHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET-LRVERMFPGHPNYPAKVVWDCPRG- 682 (1471)
Q Consensus 607 Dl~-tg~~l~~-~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t-~~~l~~~~gh~~~V~~v~~spdg~- 682 (1471)
|++ .++.+.. -+.|...|.++.-+|.. +.++++|+.|-.|++||.+. ++++..-.- .+.|+.++++|.-.
T Consensus 194 D~R~p~~~i~~n~kvH~~GV~SI~ss~~~-----~~~I~TGsYDe~i~~~DtRnm~kPl~~~~v-~GGVWRi~~~p~~~~ 267 (339)
T KOG0280|consen 194 DIRIPKTFIWHNSKVHTSGVVSIYSSPPK-----PTYIATGSYDECIRVLDTRNMGKPLFKAKV-GGGVWRIKHHPEIFH 267 (339)
T ss_pred EecCCcceeeecceeeecceEEEecCCCC-----CceEEEeccccceeeeehhcccCccccCcc-ccceEEEEecchhhh
Confidence 999 4555543 56789999999988763 77999999999999999984 566654332 36799999999643
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
.++.+|.- .-.+|-++..+
T Consensus 268 ~lL~~CMh--------~G~ki~~~~~~ 286 (339)
T KOG0280|consen 268 RLLAACMH--------NGAKILDSSDK 286 (339)
T ss_pred HHHHHHHh--------cCceEEEeccc
Confidence 34445554 34667666554
No 232
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.15 E-value=7.3e-05 Score=86.39 Aligned_cols=190 Identities=16% Similarity=0.121 Sum_probs=135.5
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC--CcEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD--CSIR 604 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D--gtI~ 604 (1471)
.|+++-.+|.+.++.-. ..|..+. ....+..| ..+..+.-++. ....+++||.. ..+.
T Consensus 117 ~Litc~~sG~l~~~~~k----------~~d~hss-~l~~la~g-~g~~~~r~~~~--------~p~Iva~GGke~~n~lk 176 (412)
T KOG3881|consen 117 TLITCVSSGNLQVRHDK----------SGDLHSS-KLIKLATG-PGLYDVRQTDT--------DPYIVATGGKENINELK 176 (412)
T ss_pred EEEEEecCCcEEEEecc----------CCccccc-cceeeecC-CceeeeccCCC--------CCceEecCchhccccee
Confidence 68888999999994321 0111121 22233333 34555554543 26788889999 8999
Q ss_pred EEECCCCceEEEEeccC---------CC--EEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCc
Q 000473 605 IWDLGSGNLITVMHHHV---------AP--VRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-RVERMFPGHPNYP 672 (1471)
Q Consensus 605 lWDl~tg~~l~~~~~H~---------~~--V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V 672 (1471)
+||++..+.+ |.+.. -| ++.+.|-|.. ....|+++..-+.|++||.+.+ +++..|.--..++
T Consensus 177 iwdle~~~qi--w~aKNvpnD~L~LrVPvW~tdi~Fl~g~----~~~~fat~T~~hqvR~YDt~~qRRPV~~fd~~E~~i 250 (412)
T KOG3881|consen 177 IWDLEQSKQI--WSAKNVPNDRLGLRVPVWITDIRFLEGS----PNYKFATITRYHQVRLYDTRHQRRPVAQFDFLENPI 250 (412)
T ss_pred eeecccceee--eeccCCCCccccceeeeeeccceecCCC----CCceEEEEecceeEEEecCcccCcceeEeccccCcc
Confidence 9999987443 33321 12 4567776651 1468999999999999999865 5788888888899
Q ss_pred EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE-EeCCCCCceeeeeeeccccccccceEEcCCccccccc
Q 000473 673 AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV-LRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLL 751 (1471)
Q Consensus 673 ~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~-l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l 751 (1471)
+++...|.++++++|..- |.+..+|++++.+..+ +.|-++.+-.++..+. +....+..+
T Consensus 251 s~~~l~p~gn~Iy~gn~~--------g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~------------~~~las~GL 310 (412)
T KOG3881|consen 251 SSTGLTPSGNFIYTGNTK--------GQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPT------------HPVLASCGL 310 (412)
T ss_pred eeeeecCCCcEEEEeccc--------chhheecccCceeeccccCCccCCcceEEEcCC------------CceEEeecc
Confidence 999999999999999887 9999999999999877 8888887777765531 224444555
Q ss_pred eeeccCCceEeecccc
Q 000473 752 LPIHEDGTFRQSQIQN 767 (1471)
Q Consensus 752 ~~~~~D~tir~w~l~~ 767 (1471)
|.-+|++++++
T Consensus 311 -----DRyvRIhD~kt 321 (412)
T KOG3881|consen 311 -----DRYVRIHDIKT 321 (412)
T ss_pred -----ceeEEEeeccc
Confidence 99999988754
No 233
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.15 E-value=0.00013 Score=90.60 Aligned_cols=160 Identities=14% Similarity=0.052 Sum_probs=102.1
Q ss_pred ccCC--EEE-EEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE-
Q 000473 523 YAPY--AIV-YGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS- 598 (1471)
Q Consensus 523 f~P~--~lv-~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs- 598 (1471)
|+|+ .++ +.+.+|.-.|+.|+ ..+++ ...+..|.+......|+|| ++.|+..+
T Consensus 255 ~SpDG~~l~~~~s~~g~~~Iy~~d-------------~~~g~-~~~lt~~~~~~~~~~~spD---------G~~l~f~sd 311 (433)
T PRK04922 255 FSPDGRRLALTLSRDGNPEIYVMD-------------LGSRQ-LTRLTNHFGIDTEPTWAPD---------GKSIYFTSD 311 (433)
T ss_pred ECCCCCEEEEEEeCCCCceEEEEE-------------CCCCC-eEECccCCCCccceEECCC---------CCEEEEEEC
Confidence 6665 454 44566764444354 33333 3445566666678899998 67666555
Q ss_pred CCCc--EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCC---cEEEEECCCCcEEEEecCCCCCcE
Q 000473 599 MDCS--IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDF---SVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 599 ~Dgt--I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dg---sV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
.++. |.++|+.+++.. .+..+........|+|+ |+.++..+.++ .|.+||+.+++..... +.....
T Consensus 312 ~~g~~~iy~~dl~~g~~~-~lt~~g~~~~~~~~SpD------G~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt--~~~~~~ 382 (433)
T PRK04922 312 RGGRPQIYRVAASGGSAE-RLTFQGNYNARASVSPD------GKKIAMVHGSGGQYRIAVMDLSTGSVRTLT--PGSLDE 382 (433)
T ss_pred CCCCceEEEEECCCCCeE-EeecCCCCccCEEECCC------CCEEEEEECCCCceeEEEEECCCCCeEECC--CCCCCC
Confidence 4554 666777777643 33333444557899999 88887765433 6999999888765333 222455
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCC
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTAS 720 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~ 720 (1471)
...|+|||++|+..+.+ .+.+.|++++.. |...+.+..+.+
T Consensus 383 ~p~~spdG~~i~~~s~~-----~g~~~L~~~~~~-g~~~~~l~~~~g 423 (433)
T PRK04922 383 SPSFAPNGSMVLYATRE-----GGRGVLAAVSTD-GRVRQRLVSADG 423 (433)
T ss_pred CceECCCCCEEEEEEec-----CCceEEEEEECC-CCceEEcccCCC
Confidence 67999999999887765 122679999985 445566654443
No 234
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.14 E-value=0.00011 Score=87.73 Aligned_cols=116 Identities=13% Similarity=0.203 Sum_probs=79.7
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEEEC-CCcEEEEECCC--Cc--eEEEEeccC------CCEEEEEECCCCCCCCCCCE
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSGSM-DCSIRIWDLGS--GN--LITVMHHHV------APVRQIILSPPQTEHPWSDC 641 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SGs~-DgtI~lWDl~t--g~--~l~~~~~H~------~~V~~l~fspd~~~~~~~~~ 641 (1471)
...++|+|+ ++++++... +++|.+||+.. ++ .+..+..+. .....+.++|+ +++
T Consensus 177 p~~~~~~pd---------g~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pd------g~~ 241 (330)
T PRK11028 177 PRHMVFHPN---------QQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPD------GRH 241 (330)
T ss_pred CceEEECCC---------CCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCC------CCE
Confidence 567899997 788777665 99999999973 33 233333221 12235889998 887
Q ss_pred EEEEe-CCCcEEEEECCCCc----EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC--CCeE
Q 000473 642 FLSVG-EDFSVALASLETLR----VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK--TGAR 711 (1471)
Q Consensus 642 l~S~s-~DgsV~lWdl~t~~----~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~--tg~~ 711 (1471)
++++. .+++|.+|+++... .+..... ...+..+.|+|+|++|++++.. +++|.+|++. +|.+
T Consensus 242 lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~~-~~~p~~~~~~~dg~~l~va~~~-------~~~v~v~~~~~~~g~l 310 (330)
T PRK11028 242 LYACDRTASLISVFSVSEDGSVLSFEGHQPT-ETQPRGFNIDHSGKYLIAAGQK-------SHHISVYEIDGETGLL 310 (330)
T ss_pred EEEecCCCCeEEEEEEeCCCCeEEEeEEEec-cccCCceEECCCCCEEEEEEcc-------CCcEEEEEEcCCCCcE
Confidence 77775 47899999996432 2222222 1245689999999999998863 2899999874 5554
No 235
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=98.14 E-value=0.00015 Score=81.93 Aligned_cols=102 Identities=18% Similarity=0.190 Sum_probs=74.4
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECC--CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCC
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMD--CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDF 649 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~D--gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dg 649 (1471)
.+.-++|++| ..+++|-... ..+-+||++..++...+ .+..+|....|.|. ...++.+....
T Consensus 320 g~g~lafs~D---------s~y~aTrnd~~PnalW~Wdlq~l~l~avL-iQk~piraf~WdP~------~prL~vctg~s 383 (447)
T KOG4497|consen 320 GAGKLAFSCD---------STYAATRNDKYPNALWLWDLQNLKLHAVL-IQKHPIRAFEWDPG------RPRLVVCTGKS 383 (447)
T ss_pred ccceeeecCC---------ceEEeeecCCCCceEEEEechhhhhhhhh-hhccceeEEEeCCC------CceEEEEcCCc
Confidence 4677889987 7888887433 47889999876654433 46678999999997 34444444445
Q ss_pred cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 650 SVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 650 sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
.+.+|......++. .++....|.+++|.-+|.+++-.+.|
T Consensus 384 rLY~W~psg~~~V~-vP~~GF~i~~l~W~~~g~~i~l~~kD 423 (447)
T KOG4497|consen 384 RLYFWAPSGPRVVG-VPKKGFNIQKLQWLQPGEFIVLCGKD 423 (447)
T ss_pred eEEEEcCCCceEEe-cCCCCceeeeEEecCCCcEEEEEcCC
Confidence 58899887655543 35555789999999999999888877
No 236
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=98.12 E-value=1.7e-05 Score=90.07 Aligned_cols=101 Identities=27% Similarity=0.445 Sum_probs=81.2
Q ss_pred EEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC
Q 000473 530 YGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG 609 (1471)
Q Consensus 530 ~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~ 609 (1471)
.|-..|.|.+.+.. .....++.++.+|.+.++|+.|.+. ..+|.||..|..|.+||+.
T Consensus 170 vGd~~gqvt~lr~~-------------~~~~~~i~~~~~h~~~~~~l~Wd~~---------~~~LfSg~~d~~vi~wdig 227 (404)
T KOG1409|consen 170 VGDHSGQITMLKLE-------------QNGCQLITTFNGHTGEVTCLKWDPG---------QRLLFSGASDHSVIMWDIG 227 (404)
T ss_pred ecccccceEEEEEe-------------ecCCceEEEEcCcccceEEEEEcCC---------CcEEEeccccCceEEEecc
Confidence 45556677764432 2234678889999999999999986 7899999999999999997
Q ss_pred CCc-eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC
Q 000473 610 SGN-LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET 658 (1471)
Q Consensus 610 tg~-~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t 658 (1471)
-.+ ....+.+|...|..+..-+. -+.+.+++.|+.|.+|+++.
T Consensus 228 g~~g~~~el~gh~~kV~~l~~~~~------t~~l~S~~edg~i~~w~mn~ 271 (404)
T KOG1409|consen 228 GRKGTAYELQGHNDKVQALSYAQH------TRQLISCGEDGGIVVWNMNV 271 (404)
T ss_pred CCcceeeeeccchhhhhhhhhhhh------heeeeeccCCCeEEEEeccc
Confidence 544 34677889999999887776 67899999999999999863
No 237
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.12 E-value=0.00016 Score=89.92 Aligned_cols=131 Identities=11% Similarity=0.109 Sum_probs=97.1
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC---CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM---DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHP 637 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~---DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~ 637 (1471)
...+.+..|...+.+..|+|| ++.|+..+. +..|.+||+.+|+. ..+..+.+.+....|+|+
T Consensus 192 ~~~~~lt~~~~~v~~p~wSpD---------G~~lay~s~~~g~~~i~~~dl~~g~~-~~l~~~~g~~~~~~~SPD----- 256 (435)
T PRK05137 192 ANVRYLTDGSSLVLTPRFSPN---------RQEITYMSYANGRPRVYLLDLETGQR-ELVGNFPGMTFAPRFSPD----- 256 (435)
T ss_pred CCcEEEecCCCCeEeeEECCC---------CCEEEEEEecCCCCEEEEEECCCCcE-EEeecCCCcccCcEECCC-----
Confidence 344567788889999999998 777776653 46899999998875 345566777888999999
Q ss_pred CCCEEE-EEeCCCc--EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 638 WSDCFL-SVGEDFS--VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 638 ~~~~l~-S~s~Dgs--V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
|+.++ +.+.|+. |.+||+.+++.. .+..+........|+|||++|+..+.. ++...|++||+.+++..+
T Consensus 257 -G~~la~~~~~~g~~~Iy~~d~~~~~~~-~Lt~~~~~~~~~~~spDG~~i~f~s~~-----~g~~~Iy~~d~~g~~~~~ 328 (435)
T PRK05137 257 -GRKVVMSLSQGGNTDIYTMDLRSGTTT-RLTDSPAIDTSPSYSPDGSQIVFESDR-----SGSPQLYVMNADGSNPRR 328 (435)
T ss_pred -CCEEEEEEecCCCceEEEEECCCCceE-EccCCCCccCceeEcCCCCEEEEEECC-----CCCCeEEEEECCCCCeEE
Confidence 77654 6666665 677798887764 455566667789999999999887643 122579999988776543
No 238
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.11 E-value=0.00016 Score=81.69 Aligned_cols=100 Identities=20% Similarity=0.380 Sum_probs=81.9
Q ss_pred EEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCc-EEEEECC
Q 000473 531 GFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCS-IRIWDLG 609 (1471)
Q Consensus 531 Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dgt-I~lWDl~ 609 (1471)
|..-|.|+|..+.. +.. + +-..+..|...|.|++...+ |.+++|+|..|| ||+||..
T Consensus 155 g~k~GqvQi~dL~~-----------~~~-~-~p~~I~AH~s~Iacv~Ln~~---------Gt~vATaStkGTLIRIFdt~ 212 (346)
T KOG2111|consen 155 GFKTGQVQIVDLAS-----------TKP-N-APSIINAHDSDIACVALNLQ---------GTLVATASTKGTLIRIFDTE 212 (346)
T ss_pred CCccceEEEEEhhh-----------cCc-C-CceEEEcccCceeEEEEcCC---------ccEEEEeccCcEEEEEEEcC
Confidence 55678999965541 111 1 23567899999999999886 899999999996 6799999
Q ss_pred CCceEEEEec--cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC
Q 000473 610 SGNLITVMHH--HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET 658 (1471)
Q Consensus 610 tg~~l~~~~~--H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t 658 (1471)
+|+++..++- ....|.+++|+|+ ..++|..|+-|++.++.++.
T Consensus 213 ~g~~l~E~RRG~d~A~iy~iaFSp~------~s~LavsSdKgTlHiF~l~~ 257 (346)
T KOG2111|consen 213 DGTLLQELRRGVDRADIYCIAFSPN------SSWLAVSSDKGTLHIFSLRD 257 (346)
T ss_pred CCcEeeeeecCCchheEEEEEeCCC------ccEEEEEcCCCeEEEEEeec
Confidence 9999988864 3357999999999 88999999999999999875
No 239
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.11 E-value=0.00014 Score=90.33 Aligned_cols=162 Identities=10% Similarity=0.043 Sum_probs=104.3
Q ss_pred ccCC--EEE-EEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE-
Q 000473 523 YAPY--AIV-YGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS- 598 (1471)
Q Consensus 523 f~P~--~lv-~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs- 598 (1471)
|+|+ .++ +...+|...|+.++ ..++. .+.+..|.+.+....|+|| ++.|+..+
T Consensus 247 ~SPDG~~la~~~~~~g~~~Iy~~d-------------~~~~~-~~~lt~~~~~~~~~~wSpD---------G~~l~f~s~ 303 (427)
T PRK02889 247 WSPDGRTLAVALSRDGNSQIYTVN-------------ADGSG-LRRLTQSSGIDTEPFFSPD---------GRSIYFTSD 303 (427)
T ss_pred ECCCCCEEEEEEccCCCceEEEEE-------------CCCCC-cEECCCCCCCCcCeEEcCC---------CCEEEEEec
Confidence 6665 565 45678888886555 22222 3445556666677889998 67666444
Q ss_pred CCCcEEEEEC--CCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCC---cEEEEECCCCcEEEEecCCCCCcE
Q 000473 599 MDCSIRIWDL--GSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDF---SVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 599 ~DgtI~lWDl--~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dg---sV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
.++...+|.+ .+++. ..+..+........|+|+ |+.++..+.++ .|.+||+.+++......+ ....
T Consensus 304 ~~g~~~Iy~~~~~~g~~-~~lt~~g~~~~~~~~SpD------G~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~--~~~~ 374 (427)
T PRK02889 304 RGGAPQIYRMPASGGAA-QRVTFTGSYNTSPRISPD------GKLLAYISRVGGAFKLYVQDLATGQVTALTDT--TRDE 374 (427)
T ss_pred CCCCcEEEEEECCCCce-EEEecCCCCcCceEECCC------CCEEEEEEccCCcEEEEEEECCCCCeEEccCC--CCcc
Confidence 4566677755 44443 222223334456789999 89888777654 699999998876544332 2346
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCc
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHS 722 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v 722 (1471)
...|+|||++|+.++.+ .+...+++-+. +|...+.+..+.+.+
T Consensus 375 ~p~~spdg~~l~~~~~~-----~g~~~l~~~~~-~g~~~~~l~~~~g~~ 417 (427)
T PRK02889 375 SPSFAPNGRYILYATQQ-----GGRSVLAAVSS-DGRIKQRLSVQGGDV 417 (427)
T ss_pred CceECCCCCEEEEEEec-----CCCEEEEEEEC-CCCceEEeecCCCCC
Confidence 78999999999988876 11245777777 566666666555433
No 240
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=98.11 E-value=3.5e-06 Score=110.78 Aligned_cols=180 Identities=14% Similarity=0.120 Sum_probs=130.7
Q ss_pred eecccccCccccccccCCCCCCCccccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCcccc
Q 000473 478 SDLTFCQDTVPRSEHVDSRQAGDGRDDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKV 557 (1471)
Q Consensus 478 l~~s~~~~~v~~Wd~~~~~~~g~~~~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~ 557 (1471)
+.+.+.++.++.|......+.- .....+. .+|+.+.+..+.. ....+-.||.+.+ |+ .
T Consensus 2223 Yltgs~dgsv~~~~w~~~~~v~--~~rt~g~-s~vtr~~f~~qGn----k~~i~d~dg~l~l--~q-------------~ 2280 (2439)
T KOG1064|consen 2223 YLTGSQDGSVRMFEWGHGQQVV--CFRTAGN-SRVTRSRFNHQGN----KFGIVDGDGDLSL--WQ-------------A 2280 (2439)
T ss_pred EEecCCCceEEEEeccCCCeEE--EeeccCc-chhhhhhhcccCC----ceeeeccCCceee--cc-------------c
Confidence 4566778899999876643311 1111222 5677766444443 6666777888887 54 1
Q ss_pred CCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEE---ECCCcEEEEECCCC---ceEEEEeccCCCEEEEEECC
Q 000473 558 NSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSG---SMDCSIRIWDLGSG---NLITVMHHHVAPVRQIILSP 631 (1471)
Q Consensus 558 ~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SG---s~DgtI~lWDl~tg---~~l~~~~~H~~~V~~l~fsp 631 (1471)
+.++....+.|......+.|- +..++++ +.++.+++||..-. .+++ ..|.+.++++++-|
T Consensus 2281 -~pk~~~s~qchnk~~~Df~Fi-----------~s~~~tag~s~d~~n~~lwDtl~~~~~s~v~--~~H~~gaT~l~~~P 2346 (2439)
T KOG1064|consen 2281 -SPKPYTSWQCHNKALSDFRFI-----------GSLLATAGRSSDNRNVCLWDTLLPPMNSLVH--TCHDGGATVLAYAP 2346 (2439)
T ss_pred -CCcceeccccCCccccceeee-----------ehhhhccccCCCCCcccchhcccCcccceee--eecCCCceEEEEcC
Confidence 135666677888888888875 3556655 36789999997532 2445 78999999999999
Q ss_pred CCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 632 PQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 632 d~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
. .+.|+|||.+|.|+|||++..+.++.++. ++ ...++++|+.. |.++||++..-.+
T Consensus 2347 ~------~qllisggr~G~v~l~D~rqrql~h~~~~---------~~-~~~~f~~~ss~--------g~ikIw~~s~~~l 2402 (2439)
T KOG1064|consen 2347 K------HQLLISGGRKGEVCLFDIRQRQLRHTFQA---------LD-TREYFVTGSSE--------GNIKIWRLSEFGL 2402 (2439)
T ss_pred c------ceEEEecCCcCcEEEeehHHHHHHHHhhh---------hh-hhheeeccCcc--------cceEEEEccccch
Confidence 8 88999999999999999998887776653 55 67899998887 9999999988877
Q ss_pred EEEEeC
Q 000473 712 ERVLRG 717 (1471)
Q Consensus 712 ~~~l~g 717 (1471)
++++.+
T Consensus 2403 l~~~p~ 2408 (2439)
T KOG1064|consen 2403 LHTFPS 2408 (2439)
T ss_pred hhcCch
Confidence 777654
No 241
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.09 E-value=0.00015 Score=90.20 Aligned_cols=125 Identities=18% Similarity=0.128 Sum_probs=86.6
Q ss_pred EecCCccEEEEEEecCCCCcccCcCCC-EEEEEECCC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEE
Q 000473 566 FLGHTGAVLCLAAHRMVGTAKGWSFNE-VLVSGSMDC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 566 l~gH~~~V~~la~spd~~~~~~~~~~~-~L~SGs~Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
+..+.+...+..|+|| ++ ++++.+.++ .|.+||+.+++. ..+..+.+......|+|+ |+.+
T Consensus 243 l~~~~g~~~~~~~SpD---------G~~l~~~~s~~g~~~Iy~~d~~~g~~-~~lt~~~~~~~~~~~spD------G~~l 306 (433)
T PRK04922 243 VASFRGINGAPSFSPD---------GRRLALTLSRDGNPEIYVMDLGSRQL-TRLTNHFGIDTEPTWAPD------GKSI 306 (433)
T ss_pred eccCCCCccCceECCC---------CCEEEEEEeCCCCceEEEEECCCCCe-EECccCCCCccceEECCC------CCEE
Confidence 3344455567899998 55 445666666 599999998875 445566666678899999 8888
Q ss_pred EEEeC-CCc--EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 643 LSVGE-DFS--VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 643 ~S~s~-Dgs--V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
+.++. ++. |.++|+.+++..+.. .+.......+|+|||++|+..+.+ ++...|++||+.+++..
T Consensus 307 ~f~sd~~g~~~iy~~dl~~g~~~~lt-~~g~~~~~~~~SpDG~~Ia~~~~~-----~~~~~I~v~d~~~g~~~ 373 (433)
T PRK04922 307 YFTSDRGGRPQIYRVAASGGSAERLT-FQGNYNARASVSPDGKKIAMVHGS-----GGQYRIAVMDLSTGSVR 373 (433)
T ss_pred EEEECCCCCceEEEEECCCCCeEEee-cCCCCccCEEECCCCCEEEEEECC-----CCceeEEEEECCCCCeE
Confidence 77664 444 666677777654332 233445578999999999887654 11247999999888754
No 242
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=98.07 E-value=3.5e-05 Score=89.42 Aligned_cols=109 Identities=17% Similarity=0.229 Sum_probs=87.4
Q ss_pred CEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe-cCCCC
Q 000473 592 EVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF-PGHPN 670 (1471)
Q Consensus 592 ~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~-~gh~~ 670 (1471)
......+.+..+.+|....+.+. .+-+|...++.|+++|| ++.++++..|..|++-....--.+..| .||..
T Consensus 123 ~v~dkagD~~~~di~s~~~~~~~-~~lGhvSml~dVavS~D------~~~IitaDRDEkIRvs~ypa~f~IesfclGH~e 195 (390)
T KOG3914|consen 123 LVADKAGDVYSFDILSADSGRCE-PILGHVSMLLDVAVSPD------DQFIITADRDEKIRVSRYPATFVIESFCLGHKE 195 (390)
T ss_pred EEEeecCCceeeeeecccccCcc-hhhhhhhhhheeeecCC------CCEEEEecCCceEEEEecCcccchhhhccccHh
Confidence 34445566778888887765543 45599999999999999 899999999999999888765555544 57999
Q ss_pred CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 671 YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 671 ~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
.|..++.-++. .|++|+.| ++|++||+++|++++++.
T Consensus 196 FVS~isl~~~~-~LlS~sGD--------~tlr~Wd~~sgk~L~t~d 232 (390)
T KOG3914|consen 196 FVSTISLTDNY-LLLSGSGD--------KTLRLWDITSGKLLDTCD 232 (390)
T ss_pred heeeeeeccCc-eeeecCCC--------CcEEEEecccCCcccccc
Confidence 99999998764 47777777 999999999999886654
No 243
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.06 E-value=0.00027 Score=87.96 Aligned_cols=166 Identities=14% Similarity=0.082 Sum_probs=109.0
Q ss_pred ccccCccEEEEEeeccccccCCEEEEEEc-CCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 505 FVHKEKIVSSSMVISESFYAPYAIVYGFF-SGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 505 ~~~h~~~Vts~~~is~~~f~P~~lv~Gs~-DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
+..|...+.+..+.++.. .|++.+. +|.-.|+.|+ ..+++. +.+..+.+.+.+.+|+||
T Consensus 197 lt~~~~~v~~p~wSpDG~----~lay~s~~~g~~~i~~~d-------------l~~g~~-~~l~~~~g~~~~~~~SPD-- 256 (435)
T PRK05137 197 LTDGSSLVLTPRFSPNRQ----EITYMSYANGRPRVYLLD-------------LETGQR-ELVGNFPGMTFAPRFSPD-- 256 (435)
T ss_pred EecCCCCeEeeEECCCCC----EEEEEEecCCCCEEEEEE-------------CCCCcE-EEeecCCCcccCcEECCC--
Confidence 445556677766333333 6776653 3333333344 334433 345566777888999998
Q ss_pred CcccCcCCCE-EEEEECCCc--EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC-C--CcEEEEECC
Q 000473 584 TAKGWSFNEV-LVSGSMDCS--IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE-D--FSVALASLE 657 (1471)
Q Consensus 584 ~~~~~~~~~~-L~SGs~Dgt--I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~-D--gsV~lWdl~ 657 (1471)
++. +++.+.++. |.+||+.+++. ..+..+.+......|+|+ |+.++..+. + ..|.+||+.
T Consensus 257 -------G~~la~~~~~~g~~~Iy~~d~~~~~~-~~Lt~~~~~~~~~~~spD------G~~i~f~s~~~g~~~Iy~~d~~ 322 (435)
T PRK05137 257 -------GRKVVMSLSQGGNTDIYTMDLRSGTT-TRLTDSPAIDTSPSYSPD------GSQIVFESDRSGSPQLYVMNAD 322 (435)
T ss_pred -------CCEEEEEEecCCCceEEEEECCCCce-EEccCCCCccCceeEcCC------CCEEEEEECCCCCCeEEEEECC
Confidence 554 557777765 77789988765 456667666778899999 888877664 3 357888988
Q ss_pred CCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 658 TLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 658 t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
+++..+.. .+...+....|+|+|++|+....+ .++..|++||..++.
T Consensus 323 g~~~~~lt-~~~~~~~~~~~SpdG~~ia~~~~~-----~~~~~i~~~d~~~~~ 369 (435)
T PRK05137 323 GSNPRRIS-FGGGRYSTPVWSPRGDLIAFTKQG-----GGQFSIGVMKPDGSG 369 (435)
T ss_pred CCCeEEee-cCCCcccCeEECCCCCEEEEEEcC-----CCceEEEEEECCCCc
Confidence 76654433 334556778999999999887754 112579999986654
No 244
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=98.04 E-value=1e-05 Score=91.68 Aligned_cols=126 Identities=15% Similarity=0.167 Sum_probs=103.8
Q ss_pred CCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC-----ceEEEEeccCCCEEEEEECC-CCCCCCCCCEE
Q 000473 569 HTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG-----NLITVMHHHVAPVRQIILSP-PQTEHPWSDCF 642 (1471)
Q Consensus 569 H~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg-----~~l~~~~~H~~~V~~l~fsp-d~~~~~~~~~l 642 (1471)
-.+.|.++.|+.. +.++..|...|.|...|++.+ .+.+.+ .|...|+++..-. + ++++
T Consensus 251 sksDVfAlQf~~s---------~nLv~~GcRngeI~~iDLR~rnqG~~~~a~rl-yh~Ssvtslq~Lq~s------~q~L 314 (425)
T KOG2695|consen 251 SKSDVFALQFAGS---------DNLVFNGCRNGEIFVIDLRCRNQGNGWCAQRL-YHDSSVTSLQILQFS------QQKL 314 (425)
T ss_pred cchhHHHHHhccc---------CCeeEecccCCcEEEEEeeecccCCCcceEEE-EcCcchhhhhhhccc------cceE
Confidence 4567888999865 799999999999999999864 345555 5888899987655 4 7899
Q ss_pred EEEeCCCcEEEEECCCCcE---EEEecCCCCCc--EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 643 LSVGEDFSVALASLETLRV---ERMFPGHPNYP--AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~t~~~---l~~~~gh~~~V--~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
++.+.+|+|++||++--++ ++++.||...- .-+-..+.+..++++++| ...|||.++.|+++.++.-
T Consensus 315 maS~M~gkikLyD~R~~K~~~~V~qYeGHvN~~a~l~~~v~~eeg~I~s~GdD--------cytRiWsl~~ghLl~tipf 386 (425)
T KOG2695|consen 315 MASDMTGKIKLYDLRATKCKKSVMQYEGHVNLSAYLPAHVKEEEGSIFSVGDD--------CYTRIWSLDSGHLLCTIPF 386 (425)
T ss_pred eeccCcCceeEeeehhhhcccceeeeecccccccccccccccccceEEEccCe--------eEEEEEecccCceeeccCC
Confidence 9999999999999987777 99999996633 334557788899999998 9999999999999988764
Q ss_pred C
Q 000473 718 T 718 (1471)
Q Consensus 718 H 718 (1471)
.
T Consensus 387 ~ 387 (425)
T KOG2695|consen 387 P 387 (425)
T ss_pred C
Confidence 3
No 245
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.04 E-value=0.00018 Score=89.33 Aligned_cols=165 Identities=15% Similarity=0.140 Sum_probs=102.4
Q ss_pred ccCccEEEEEeeccccccCC--EEEEEEc-CCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 507 HKEKIVSSSMVISESFYAPY--AIVYGFF-SGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~--~lv~Gs~-DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
.+...+.+.. |+|+ .+++.+. ++.-.|+.|+ +.+++.. .+....+.+.+.+|+||
T Consensus 193 ~~~~~v~~p~------wSPDG~~la~~s~~~~~~~I~~~d-------------l~~g~~~-~l~~~~g~~~~~~~SPD-- 250 (427)
T PRK02889 193 SSPEPIISPA------WSPDGTKLAYVSFESKKPVVYVHD-------------LATGRRR-VVANFKGSNSAPAWSPD-- 250 (427)
T ss_pred cCCCCcccce------EcCCCCEEEEEEccCCCcEEEEEE-------------CCCCCEE-EeecCCCCccceEECCC--
Confidence 4445565555 5554 6766654 3333343344 3344432 33334455678899998
Q ss_pred CcccCcCCCEE-EEEECCCcEEEEEC--CCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC-CCcEEEEEC--C
Q 000473 584 TAKGWSFNEVL-VSGSMDCSIRIWDL--GSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE-DFSVALASL--E 657 (1471)
Q Consensus 584 ~~~~~~~~~~L-~SGs~DgtI~lWDl--~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~-DgsV~lWdl--~ 657 (1471)
++.| ++.+.|+...+|.+ .++. ...+..+.+.+....|+|| |+.++.++. ++...+|.+ .
T Consensus 251 -------G~~la~~~~~~g~~~Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpD------G~~l~f~s~~~g~~~Iy~~~~~ 316 (427)
T PRK02889 251 -------GRTLAVALSRDGNSQIYTVNADGSG-LRRLTQSSGIDTEPFFSPD------GRSIYFTSDRGGAPQIYRMPAS 316 (427)
T ss_pred -------CCEEEEEEccCCCceEEEEECCCCC-cEECCCCCCCCcCeEEcCC------CCEEEEEecCCCCcEEEEEECC
Confidence 6655 57788887776654 4444 5566666666777889999 887776654 466677755 4
Q ss_pred CCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 658 TLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 658 t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
+++..+.. .+........|+|||++|+..+.+ .+...|++||+.+++...
T Consensus 317 ~g~~~~lt-~~g~~~~~~~~SpDG~~Ia~~s~~-----~g~~~I~v~d~~~g~~~~ 366 (427)
T PRK02889 317 GGAAQRVT-FTGSYNTSPRISPDGKLLAYISRV-----GGAFKLYVQDLATGQVTA 366 (427)
T ss_pred CCceEEEe-cCCCCcCceEECCCCCEEEEEEcc-----CCcEEEEEEECCCCCeEE
Confidence 55443322 222334568999999999877765 111479999999887543
No 246
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=98.02 E-value=0.0068 Score=73.48 Aligned_cols=124 Identities=17% Similarity=0.241 Sum_probs=91.6
Q ss_pred CCccEEEEEEecCCCCcccCcCCCE-EEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC
Q 000473 569 HTGAVLCLAAHRMVGTAKGWSFNEV-LVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE 647 (1471)
Q Consensus 569 H~~~V~~la~spd~~~~~~~~~~~~-L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~ 647 (1471)
..++|.++.|+|. + .++ ++-|-+=-.+.++|++ ++.+..| -.++-.++.|+|. |++++-+|-
T Consensus 269 k~GPVhdv~W~~s-----~---~EF~VvyGfMPAkvtifnlr-~~~v~df--~egpRN~~~fnp~------g~ii~lAGF 331 (566)
T KOG2315|consen 269 KEGPVHDVTWSPS-----G---REFAVVYGFMPAKVTIFNLR-GKPVFDF--PEGPRNTAFFNPH------GNIILLAGF 331 (566)
T ss_pred CCCCceEEEECCC-----C---CEEEEEEecccceEEEEcCC-CCEeEeC--CCCCccceEECCC------CCEEEEeec
Confidence 4689999999997 1 333 3445677799999985 6666555 4677899999998 998888775
Q ss_pred C---CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 648 D---FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 648 D---gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
+ |.|-+||+.+.+++..+..- .-+-..|+|||.|++|+... =-=..|+.++||+. +|.++..
T Consensus 332 GNL~G~mEvwDv~n~K~i~~~~a~--~tt~~eW~PdGe~flTATTa--PRlrvdNg~Kiwhy-tG~~l~~ 396 (566)
T KOG2315|consen 332 GNLPGDMEVWDVPNRKLIAKFKAA--NTTVFEWSPDGEYFLTATTA--PRLRVDNGIKIWHY-TGSLLHE 396 (566)
T ss_pred CCCCCceEEEeccchhhccccccC--CceEEEEcCCCcEEEEEecc--ccEEecCCeEEEEe-cCceeeh
Confidence 4 89999999998888777543 34568999999999998752 00001167999997 6766554
No 247
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.02 E-value=0.0004 Score=86.22 Aligned_cols=122 Identities=11% Similarity=0.038 Sum_probs=84.1
Q ss_pred cEEEEEEecCCCCcccCcCCCEEE-EEECCC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLV-SGSMDC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~-SGs~Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D 648 (1471)
.+..+.|+|| ++.|+ +.+.++ .|.+||+.+++... +..+...+....|+|+ |+.|+.++.+
T Consensus 244 ~~~~~~~SPD---------G~~La~~~~~~g~~~I~~~d~~tg~~~~-lt~~~~~~~~~~wSPD------G~~I~f~s~~ 307 (429)
T PRK03629 244 HNGAPAFSPD---------GSKLAFALSKTGSLNLYVMDLASGQIRQ-VTDGRSNNTEPTWFPD------SQNLAYTSDQ 307 (429)
T ss_pred CcCCeEECCC---------CCEEEEEEcCCCCcEEEEEECCCCCEEE-ccCCCCCcCceEECCC------CCEEEEEeCC
Confidence 3456899998 66555 445555 58899999887644 4444556788999999 8888777654
Q ss_pred -CcEEEE--ECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 649 -FSVALA--SLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 649 -gsV~lW--dl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
+...|| |+.+++.. .+..+........|+|||++|+..+.+ ++...|++||+.+++. +.+.
T Consensus 308 ~g~~~Iy~~d~~~g~~~-~lt~~~~~~~~~~~SpDG~~Ia~~~~~-----~g~~~I~~~dl~~g~~-~~Lt 371 (429)
T PRK03629 308 AGRPQVYKVNINGGAPQ-RITWEGSQNQDADVSSDGKFMVMVSSN-----GGQQHIAKQDLATGGV-QVLT 371 (429)
T ss_pred CCCceEEEEECCCCCeE-EeecCCCCccCEEECCCCCEEEEEEcc-----CCCceEEEEECCCCCe-EEeC
Confidence 444555 67766553 333344456678999999999887665 1225799999998874 3444
No 248
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.00 E-value=1.2e-05 Score=94.66 Aligned_cols=179 Identities=12% Similarity=0.145 Sum_probs=129.9
Q ss_pred ccccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCC
Q 000473 503 DDFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMV 582 (1471)
Q Consensus 503 ~~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~ 582 (1471)
..+.+|...|.++..+.... .+++++.|.++++|... ..+| ...+..+..++..|+.+|..+.|-.+
T Consensus 729 ~nf~GH~~~iRai~AidNEN----SFiSASkDKTVKLWSik--~EgD------~~~tsaCQfTY~aHkk~i~~igfL~~- 795 (1034)
T KOG4190|consen 729 CNFTGHQEKIRAIAAIDNEN----SFISASKDKTVKLWSIK--PEGD------EIGTSACQFTYQAHKKPIHDIGFLAD- 795 (1034)
T ss_pred ecccCcHHHhHHHHhccccc----ceeeccCCceEEEEEec--cccC------ccccceeeeEhhhccCcccceeeeec-
Confidence 55788999998876665554 68999999999995543 2221 23455788899999999999999876
Q ss_pred CCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec--cCCCEEEEEECCCCCCCCCCCEEE-EEeCCCcEEEEECCCC
Q 000473 583 GTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH--HVAPVRQIILSPPQTEHPWSDCFL-SVGEDFSVALASLETL 659 (1471)
Q Consensus 583 ~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~--H~~~V~~l~fspd~~~~~~~~~l~-S~s~DgsV~lWdl~t~ 659 (1471)
.+.++| .|+.|.+||..-|+++..+.. ..+.+.-+..-|+. +...+. -++...+|+++|-+.+
T Consensus 796 --------lr~i~S--cD~giHlWDPFigr~Laq~~dapk~~a~~~ikcl~nv----~~~iliAgcsaeSTVKl~DaRsc 861 (1034)
T KOG4190|consen 796 --------LRSIAS--CDGGIHLWDPFIGRLLAQMEDAPKEGAGGNIKCLENV----DRHILIAGCSAESTVKLFDARSC 861 (1034)
T ss_pred --------cceeee--ccCcceeecccccchhHhhhcCcccCCCceeEecccC----cchheeeeccchhhheeeecccc
Confidence 566665 589999999998888765432 12233333333331 133444 4477899999999987
Q ss_pred cEEEEe-----cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 660 RVERMF-----PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 660 ~~l~~~-----~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
+-...+ ++...-+.+++..|.|++++++-.+ |+|.+-|.++|+.+...+
T Consensus 862 e~~~E~kVcna~~Pna~~R~iaVa~~GN~lAa~LSn--------Gci~~LDaR~G~vINswr 915 (1034)
T KOG4190|consen 862 EWTCELKVCNAPGPNALTRAIAVADKGNKLAAALSN--------GCIAILDARNGKVINSWR 915 (1034)
T ss_pred cceeeEEeccCCCCchheeEEEeccCcchhhHHhcC--------CcEEEEecCCCceeccCC
Confidence 654433 4555678999999999999998887 999999999998776544
No 249
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.95 E-value=2.5e-05 Score=62.29 Aligned_cols=38 Identities=34% Similarity=0.563 Sum_probs=36.0
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEE
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWD 607 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWD 607 (1471)
++++++.+|.+.|++++|+|+ +.+|++|+.|++|++||
T Consensus 2 ~~~~~~~~h~~~i~~i~~~~~---------~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 2 KCVRTFRGHSSSINSIAWSPD---------GNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEEEESSSSSEEEEEEETT---------SSEEEEEETTSEEEEEE
T ss_pred eEEEEEcCCCCcEEEEEEecc---------cccceeeCCCCEEEEEC
Confidence 578899999999999999997 89999999999999998
No 250
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.90 E-value=2.7e-05 Score=62.06 Aligned_cols=39 Identities=23% Similarity=0.527 Sum_probs=37.0
Q ss_pred CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 659 LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 659 ~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+++++.+.+|...|++++|+|++.+|++++.| ++|++||
T Consensus 1 g~~~~~~~~h~~~i~~i~~~~~~~~~~s~~~D--------~~i~vwd 39 (39)
T PF00400_consen 1 GKCVRTFRGHSSSINSIAWSPDGNFLASGSSD--------GTIRVWD 39 (39)
T ss_dssp EEEEEEEESSSSSEEEEEEETTSSEEEEEETT--------SEEEEEE
T ss_pred CeEEEEEcCCCCcEEEEEEecccccceeeCCC--------CEEEEEC
Confidence 46789999999999999999999999999999 9999998
No 251
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=97.90 E-value=3.4e-05 Score=101.94 Aligned_cols=147 Identities=10% Similarity=0.135 Sum_probs=116.6
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEE--ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF--LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIR 604 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l--~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~ 604 (1471)
.-++|+.||.+++|.|. .++.+.++ .|. ..|+.+.|+.. |+.+.-+..||.+.
T Consensus 2222 ~Yltgs~dgsv~~~~w~---------------~~~~v~~~rt~g~-s~vtr~~f~~q---------Gnk~~i~d~dg~l~ 2276 (2439)
T KOG1064|consen 2222 YYLTGSQDGSVRMFEWG---------------HGQQVVCFRTAGN-SRVTRSRFNHQ---------GNKFGIVDGDGDLS 2276 (2439)
T ss_pred eEEecCCCceEEEEecc---------------CCCeEEEeeccCc-chhhhhhhccc---------CCceeeeccCCcee
Confidence 67899999999998876 12333333 243 78999999876 78888889999999
Q ss_pred EEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe---CCCcEEEEECCC---CcEEEEecCCCCCcEEEEEc
Q 000473 605 IWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG---EDFSVALASLET---LRVERMFPGHPNYPAKVVWD 678 (1471)
Q Consensus 605 lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s---~DgsV~lWdl~t---~~~l~~~~gh~~~V~~v~~s 678 (1471)
+|.+. .++....+.|......+.|-.. .+++.+ .++.+.+||..- ..+++ ..|.+.++++++-
T Consensus 2277 l~q~~-pk~~~s~qchnk~~~Df~Fi~s--------~~~tag~s~d~~n~~lwDtl~~~~~s~v~--~~H~~gaT~l~~~ 2345 (2439)
T KOG1064|consen 2277 LWQAS-PKPYTSWQCHNKALSDFRFIGS--------LLATAGRSSDNRNVCLWDTLLPPMNSLVH--TCHDGGATVLAYA 2345 (2439)
T ss_pred ecccC-CcceeccccCCccccceeeeeh--------hhhccccCCCCCcccchhcccCcccceee--eecCCCceEEEEc
Confidence 99987 6677788889988888888543 566654 578999999742 23444 5689999999999
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
|....|++|+.+ |.|++||++..++++++..
T Consensus 2346 P~~qllisggr~--------G~v~l~D~rqrql~h~~~~ 2376 (2439)
T KOG1064|consen 2346 PKHQLLISGGRK--------GEVCLFDIRQRQLRHTFQA 2376 (2439)
T ss_pred CcceEEEecCCc--------CcEEEeehHHHHHHHHhhh
Confidence 999999999999 9999999999888776654
No 252
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.80 E-value=0.0017 Score=80.52 Aligned_cols=134 Identities=13% Similarity=0.082 Sum_probs=89.5
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE-CCCcEEEEE--CCC-CceEEEEeccCCCEEEEEECCCCCCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS-MDCSIRIWD--LGS-GNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs-~DgtI~lWD--l~t-g~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
...+...++.+......|+|| ++.|+..+ .++...+|. +.. +.....+..+.+.+....|+||
T Consensus 271 ~~~~lt~~~~~~~~~p~wSPD---------G~~Laf~s~~~g~~~ly~~~~~~~g~~~~~lt~~~~~~~~p~wSPD---- 337 (428)
T PRK01029 271 KPRRLLNEAFGTQGNPSFSPD---------GTRLVFVSNKDGRPRIYIMQIDPEGQSPRLLTKKYRNSSCPAWSPD---- 337 (428)
T ss_pred cceEeecCCCCCcCCeEECCC---------CCEEEEEECCCCCceEEEEECcccccceEEeccCCCCccceeECCC----
Confidence 333444444344567899998 67655544 567666664 432 3334555555567788899999
Q ss_pred CCCCEEEEEeCC---CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 637 PWSDCFLSVGED---FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 637 ~~~~~l~S~s~D---gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
|+.|+..+.+ ..|.+||+.+++...... ....+....|+|||++|+....+ ++...|++||+.+++..+
T Consensus 338 --G~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~-~~~~~~~p~wSpDG~~L~f~~~~-----~g~~~L~~vdl~~g~~~~ 409 (428)
T PRK01029 338 --GKKIAFCSVIKGVRQICVYDLATGRDYQLTT-SPENKESPSWAIDSLHLVYSAGN-----SNESELYLISLITKKTRK 409 (428)
T ss_pred --CCEEEEEEcCCCCcEEEEEECCCCCeEEccC-CCCCccceEECCCCCEEEEEECC-----CCCceEEEEECCCCCEEE
Confidence 8888776543 468999999888754433 33456789999999998865543 223689999998887654
Q ss_pred EE
Q 000473 714 VL 715 (1471)
Q Consensus 714 ~l 715 (1471)
..
T Consensus 410 Lt 411 (428)
T PRK01029 410 IV 411 (428)
T ss_pred ee
Confidence 43
No 253
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.77 E-value=0.0013 Score=81.06 Aligned_cols=124 Identities=18% Similarity=0.133 Sum_probs=87.1
Q ss_pred EecCCccEEEEEEecCCCCcccCcCCC-EEEEEECCC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEE
Q 000473 566 FLGHTGAVLCLAAHRMVGTAKGWSFNE-VLVSGSMDC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 566 l~gH~~~V~~la~spd~~~~~~~~~~~-~L~SGs~Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
+..+.+.+.+++|+|| ++ ++++.+.++ .|.+||+.+++. ..+..+.+......|+|+ ++.+
T Consensus 229 ~~~~~~~~~~~~~spD---------g~~l~~~~~~~~~~~i~~~d~~~~~~-~~l~~~~~~~~~~~~s~d------g~~l 292 (417)
T TIGR02800 229 VASFPGMNGAPAFSPD---------GSKLAVSLSKDGNPDIYVMDLDGKQL-TRLTNGPGIDTEPSWSPD------GKSI 292 (417)
T ss_pred eecCCCCccceEECCC---------CCEEEEEECCCCCccEEEEECCCCCE-EECCCCCCCCCCEEECCC------CCEE
Confidence 4455666778899998 55 445665554 588999988764 344455555567789998 8877
Q ss_pred EEEeCC-C--cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 643 LSVGED-F--SVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 643 ~S~s~D-g--sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
+.++.. + .|.++|+.+++.. .+..+...+....|+|+|++|+.++.+ .+...|++||+.++..
T Consensus 293 ~~~s~~~g~~~iy~~d~~~~~~~-~l~~~~~~~~~~~~spdg~~i~~~~~~-----~~~~~i~~~d~~~~~~ 358 (417)
T TIGR02800 293 AFTSDRGGSPQIYMMDADGGEVR-RLTFRGGYNASPSWSPDGDLIAFVHRE-----GGGFNIAVMDLDGGGE 358 (417)
T ss_pred EEEECCCCCceEEEEECCCCCEE-EeecCCCCccCeEECCCCCEEEEEEcc-----CCceEEEEEeCCCCCe
Confidence 766543 3 5778888877654 334455667788999999999988876 1113899999988654
No 254
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=97.74 E-value=0.00023 Score=80.93 Aligned_cols=134 Identities=14% Similarity=0.230 Sum_probs=95.5
Q ss_pred eEEEE-ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCce---EEEEecc-----CCCEEEEEECCC
Q 000473 562 SRQYF-LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNL---ITVMHHH-----VAPVRQIILSPP 632 (1471)
Q Consensus 562 ~~~~l-~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~---l~~~~~H-----~~~V~~l~fspd 632 (1471)
+.+.+ .+|+--|+++.+..| .+.++|+ .|-.|.+|++.--.. +-.++.| +.-|++-.|+|.
T Consensus 155 prRv~aNaHtyhiNSIS~NsD---------~Et~lSA-DdLRINLWnlei~d~sFnIVDIKP~nmEeLteVITsaEFhp~ 224 (433)
T KOG1354|consen 155 PRRVYANAHTYHINSISVNSD---------KETFLSA-DDLRINLWNLEIIDQSFNIVDIKPANMEELTEVITSAEFHPH 224 (433)
T ss_pred eeeeccccceeEeeeeeecCc---------cceEeec-cceeeeeccccccCCceeEEEccccCHHHHHHHHhhhccCHh
Confidence 34444 578889999999887 7788877 688999999874321 1122222 345788899998
Q ss_pred CCCCCCCCEEEEEeCCCcEEEEECCCCcE----EEEecC------------CCCCcEEEEEcCCCCEEEEEEcCCCCCCC
Q 000473 633 QTEHPWSDCFLSVGEDFSVALASLETLRV----ERMFPG------------HPNYPAKVVWDCPRGYIACLCRDHSRTSD 696 (1471)
Q Consensus 633 ~~~~~~~~~l~S~s~DgsV~lWdl~t~~~----l~~~~g------------h~~~V~~v~~spdg~~L~sgs~D~sg~~D 696 (1471)
+ .+.|+-.+.-|+|+|-|++.... -..|.. --..|..++|++.|+||++-..
T Consensus 225 ~-----cn~f~YSSSKGtIrLcDmR~~aLCd~hsKlfEepedp~~rsffseiIsSISDvKFs~sGryilsRDy------- 292 (433)
T KOG1354|consen 225 H-----CNVFVYSSSKGTIRLCDMRQSALCDAHSKLFEEPEDPSSRSFFSEIISSISDVKFSHSGRYILSRDY------- 292 (433)
T ss_pred H-----ccEEEEecCCCcEEEeechhhhhhcchhhhhccccCCcchhhHHHHhhhhhceEEccCCcEEEEecc-------
Confidence 6 56899999999999999984321 011111 1125789999999999998665
Q ss_pred CCCEEEEEEC-CCCeEEEEEeCCC
Q 000473 697 AVDVLFIWDV-KTGARERVLRGTA 719 (1471)
Q Consensus 697 ~~gtV~VWDi-~tg~~~~~l~gH~ 719 (1471)
-+|++||+ ...+++.++.-|.
T Consensus 293 --ltvk~wD~nme~~pv~t~~vh~ 314 (433)
T KOG1354|consen 293 --LTVKLWDLNMEAKPVETYPVHE 314 (433)
T ss_pred --ceeEEEeccccCCcceEEeehH
Confidence 49999999 4567777777664
No 255
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=97.72 E-value=0.00014 Score=82.78 Aligned_cols=120 Identities=20% Similarity=0.219 Sum_probs=88.6
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+..|+.+|.|.++++.... -..+.+.+.+. |...|+|+..-.. ++++|++.+++|+|++|
T Consensus 266 Lv~~GcRngeI~~iDLR~rn----------qG~~~~a~rly-h~Ssvtslq~Lq~--------s~q~LmaS~M~gkikLy 326 (425)
T KOG2695|consen 266 LVFNGCRNGEIFVIDLRCRN----------QGNGWCAQRLY-HDSSVTSLQILQF--------SQQKLMASDMTGKIKLY 326 (425)
T ss_pred eeEecccCCcEEEEEeeecc----------cCCCcceEEEE-cCcchhhhhhhcc--------ccceEeeccCcCceeEe
Confidence 78999999999996654211 11223344443 8889999987542 27899999999999999
Q ss_pred ECCCCce---EEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCC
Q 000473 607 DLGSGNL---ITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHP 669 (1471)
Q Consensus 607 Dl~tg~~---l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~ 669 (1471)
|++.-++ +..+.+|...-.-+-+.- ++....++++++|...|+|.++.|..+.+++-..
T Consensus 327 D~R~~K~~~~V~qYeGHvN~~a~l~~~v----~~eeg~I~s~GdDcytRiWsl~~ghLl~tipf~~ 388 (425)
T KOG2695|consen 327 DLRATKCKKSVMQYEGHVNLSAYLPAHV----KEEEGSIFSVGDDCYTRIWSLDSGHLLCTIPFPY 388 (425)
T ss_pred eehhhhcccceeeeeccccccccccccc----ccccceEEEccCeeEEEEEecccCceeeccCCCC
Confidence 9987666 888999965443333322 2225688899999999999999999998877443
No 256
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.72 E-value=0.00026 Score=88.35 Aligned_cols=176 Identities=16% Similarity=0.130 Sum_probs=124.8
Q ss_pred ccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCc
Q 000473 523 YAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCS 602 (1471)
Q Consensus 523 f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dgt 602 (1471)
++-..++.|..+|.+.+...+ +.+ .+...|... .. .|.+++|||.||+
T Consensus 47 v~~~~~~~GtH~g~v~~~~~~----------------~~~-~~~~~~s~~------~~---------~Gey~asCS~DGk 94 (846)
T KOG2066|consen 47 VHDKFFALGTHRGAVYLTTCQ----------------GNP-KTNFDHSSS------IL---------EGEYVASCSDDGK 94 (846)
T ss_pred hhcceeeeccccceEEEEecC----------------Ccc-ccccccccc------cc---------CCceEEEecCCCc
Confidence 444469999999999995544 122 223334433 22 3899999999999
Q ss_pred EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC---CCcEEEEecCCCCCcEEEEEcC
Q 000473 603 IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE---TLRVERMFPGHPNYPAKVVWDC 679 (1471)
Q Consensus 603 I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~---t~~~l~~~~gh~~~V~~v~~sp 679 (1471)
|.+-.+.+.+..+++.- ..++.+++++|+ ..+...+.+++||.-| +.++.-+ ....+ .+..-.++|.++.|.
T Consensus 95 v~I~sl~~~~~~~~~df-~rpiksial~Pd-~~~~~sk~fv~GG~ag-lvL~er~wlgnk~~v-~l~~~eG~I~~i~W~- 169 (846)
T KOG2066|consen 95 VVIGSLFTDDEITQYDF-KRPIKSIALHPD-FSRQQSKQFVSGGMAG-LVLSERNWLGNKDSV-VLSEGEGPIHSIKWR- 169 (846)
T ss_pred EEEeeccCCccceeEec-CCcceeEEeccc-hhhhhhhheeecCcce-EEEehhhhhcCccce-eeecCccceEEEEec-
Confidence 99999999988877754 468999999998 3344467899999888 6665422 12222 344556789999997
Q ss_pred CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCc
Q 000473 680 PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNT 745 (1471)
Q Consensus 680 dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~ 745 (1471)
|++++-+..+ -|+|||+.+++.+..+.-....+-...|-+.......+..|++|..
T Consensus 170 -g~lIAWand~---------Gv~vyd~~~~~~l~~i~~p~~~~R~e~fpphl~W~~~~~LVIGW~d 225 (846)
T KOG2066|consen 170 -GNLIAWANDD---------GVKVYDTPTRQRLTNIPPPSQSVRPELFPPHLHWQDEDRLVIGWGD 225 (846)
T ss_pred -CcEEEEecCC---------CcEEEeccccceeeccCCCCCCCCcccCCCceEecCCCeEEEecCC
Confidence 6788877766 4899999999988887766555555545555555566677777766
No 257
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.71 E-value=0.002 Score=79.43 Aligned_cols=158 Identities=12% Similarity=0.046 Sum_probs=99.2
Q ss_pred ccCC--EEE-EEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC
Q 000473 523 YAPY--AIV-YGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM 599 (1471)
Q Consensus 523 f~P~--~lv-~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~ 599 (1471)
|+|+ .++ +...+|...|+.|+ ..++. ...+..|.+......|+|+ ++.|+..+.
T Consensus 241 ~spDg~~l~~~~~~~~~~~i~~~d-------------~~~~~-~~~l~~~~~~~~~~~~s~d---------g~~l~~~s~ 297 (417)
T TIGR02800 241 FSPDGSKLAVSLSKDGNPDIYVMD-------------LDGKQ-LTRLTNGPGIDTEPSWSPD---------GKSIAFTSD 297 (417)
T ss_pred ECCCCCEEEEEECCCCCccEEEEE-------------CCCCC-EEECCCCCCCCCCEEECCC---------CCEEEEEEC
Confidence 5555 454 44556655554454 22332 2334445555567788987 676665543
Q ss_pred -CC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCC---cEEEEECCCCcEEEEecCCCCCcE
Q 000473 600 -DC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDF---SVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 600 -Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dg---sV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
++ .|.+||+.+++. ..+..+...+....|+|+ ++.++..+.++ .|.+||+.++... .+..+ ....
T Consensus 298 ~~g~~~iy~~d~~~~~~-~~l~~~~~~~~~~~~spd------g~~i~~~~~~~~~~~i~~~d~~~~~~~-~l~~~-~~~~ 368 (417)
T TIGR02800 298 RGGSPQIYMMDADGGEV-RRLTFRGGYNASPSWSPD------GDLIAFVHREGGGFNIAVMDLDGGGER-VLTDT-GLDE 368 (417)
T ss_pred CCCCceEEEEECCCCCE-EEeecCCCCccCeEECCC------CCEEEEEEccCCceEEEEEeCCCCCeE-EccCC-CCCC
Confidence 33 578888887764 344455666788899999 88888888776 7899999886543 33322 2245
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
...|+|++++|+..+.+ ++...+++.+. +|...+.+..+
T Consensus 369 ~p~~spdg~~l~~~~~~-----~~~~~l~~~~~-~g~~~~~~~~~ 407 (417)
T TIGR02800 369 SPSFAPNGRMILYATTR-----GGRGVLGLVST-DGRFRARLPLG 407 (417)
T ss_pred CceECCCCCEEEEEEeC-----CCcEEEEEEEC-CCceeeECCCC
Confidence 56899999999988876 22235666664 35555555433
No 258
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.66 E-value=0.00017 Score=88.32 Aligned_cols=161 Identities=19% Similarity=0.255 Sum_probs=118.5
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCC---ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSG---NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg---~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D 648 (1471)
.|-.+.|+|..+ ...++++.+...+| +|++... ..-..+.+|+..|+.+.|+|.+ ...+++++.|
T Consensus 69 ~vad~qws~h~a------~~~wiVsts~qkai-iwnlA~ss~~aIef~lhghsraitd~n~~~q~-----pdVlatcsvd 136 (1081)
T KOG0309|consen 69 QVADVQWSPHPA------KPYWIVSTSNQKAI-IWNLAKSSSNAIEFVLHGHSRAITDINFNPQH-----PDVLATCSVD 136 (1081)
T ss_pred hhcceecccCCC------CceeEEecCcchhh-hhhhhcCCccceEEEEecCccceeccccCCCC-----Ccceeecccc
Confidence 366677776521 25788888777655 7988632 3446778999999999999984 5699999999
Q ss_pred CcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEeCCCCCceeee
Q 000473 649 FSVALASLETL-RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLRGTASHSMFDH 726 (1471)
Q Consensus 649 gsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~gH~~~v~~~~ 726 (1471)
..|..||+++. .++..+..-...-..|+|+--...+.+.+.. ..|+|||.+-| ..+..+.||.+.+..+.
T Consensus 137 t~vh~wd~rSp~~p~ys~~~w~s~asqVkwnyk~p~vlasshg--------~~i~vwd~r~gs~pl~s~K~~vs~vn~~~ 208 (1081)
T KOG0309|consen 137 TYVHAWDMRSPHRPFYSTSSWRSAASQVKWNYKDPNVLASSHG--------NDIFVWDLRKGSTPLCSLKGHVSSVNSID 208 (1081)
T ss_pred ccceeeeccCCCcceeeeecccccCceeeecccCcchhhhccC--------CceEEEeccCCCcceEEecccceeeehHH
Confidence 99999999875 4566666555667889998766666555554 68999999865 67889999998888776
Q ss_pred eeeccccccccceEEcCCccccccceeeccCCceEeeccccc
Q 000473 727 FCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQSQIQND 768 (1471)
Q Consensus 727 ~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~w~l~~~ 768 (1471)
|.... -+.+++.+.|++++.|+..+.
T Consensus 209 fnr~~----------------~s~~~s~~~d~tvkfw~y~kS 234 (1081)
T KOG0309|consen 209 FNRFK----------------YSEIMSSSNDGTVKFWDYSKS 234 (1081)
T ss_pred Hhhhh----------------hhhhcccCCCCceeeeccccc
Confidence 55211 123456667999999987543
No 259
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.62 E-value=0.0022 Score=79.57 Aligned_cols=129 Identities=14% Similarity=0.159 Sum_probs=91.1
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC---CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCC
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD---CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D---gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~ 639 (1471)
.+.+..|...+....|+|| ++.|+..+.+ ..|.+||+.+++... +....+.+....|+|+ |
T Consensus 191 ~~~l~~~~~~~~~p~wSpD---------G~~la~~s~~~~~~~l~~~~l~~g~~~~-l~~~~g~~~~~~~SpD------G 254 (430)
T PRK00178 191 AVTLLQSREPILSPRWSPD---------GKRIAYVSFEQKRPRIFVQNLDTGRREQ-ITNFEGLNGAPAWSPD------G 254 (430)
T ss_pred ceEEecCCCceeeeeECCC---------CCEEEEEEcCCCCCEEEEEECCCCCEEE-ccCCCCCcCCeEECCC------C
Confidence 3556667788999999998 7777665533 368899999887543 3333445567899999 8
Q ss_pred CEEE-EEeCCC--cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 640 DCFL-SVGEDF--SVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 640 ~~l~-S~s~Dg--sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
+.++ +.+.++ .|.+||+.+++..+ +..+........|+|||+.|+..+.. ++...|++||+.+|+..+
T Consensus 255 ~~la~~~~~~g~~~Iy~~d~~~~~~~~-lt~~~~~~~~~~~spDg~~i~f~s~~-----~g~~~iy~~d~~~g~~~~ 325 (430)
T PRK00178 255 SKLAFVLSKDGNPEIYVMDLASRQLSR-VTNHPAIDTEPFWGKDGRTLYFTSDR-----GGKPQIYKVNVNGGRAER 325 (430)
T ss_pred CEEEEEEccCCCceEEEEECCCCCeEE-cccCCCCcCCeEECCCCCEEEEEECC-----CCCceEEEEECCCCCEEE
Confidence 7666 555555 57888999877643 44455556778999999988776643 222579999998887544
No 260
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=97.62 E-value=0.038 Score=66.53 Aligned_cols=97 Identities=13% Similarity=0.101 Sum_probs=67.7
Q ss_pred CCccEEEEEEecCCCCcccCcCCCEEEEEECC---CcEEEEECCCCceE-EEEeccCCCEEEEEECCCCCCCCCCCEEEE
Q 000473 569 HTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD---CSIRIWDLGSGNLI-TVMHHHVAPVRQIILSPPQTEHPWSDCFLS 644 (1471)
Q Consensus 569 H~~~V~~la~spd~~~~~~~~~~~~L~SGs~D---gtI~lWDl~tg~~l-~~~~~H~~~V~~l~fspd~~~~~~~~~l~S 644 (1471)
-.+.=+.+.|+|. +++++.++-| |.|-+||....... ..+.+-. ..-+.|+|+ ++++.+
T Consensus 314 Pe~~rNT~~fsp~---------~r~il~agF~nl~gni~i~~~~~rf~~~~~~~~~n--~s~~~wspd------~qF~~~ 376 (561)
T COG5354 314 PEQKRNTIFFSPH---------ERYILFAGFDNLQGNIEIFDPAGRFKVAGAFNGLN--TSYCDWSPD------GQFYDT 376 (561)
T ss_pred CCcccccccccCc---------ccEEEEecCCccccceEEeccCCceEEEEEeecCC--ceEeeccCC------ceEEEe
Confidence 3444467788987 7888887766 57899998654333 3554433 355679999 887776
Q ss_pred Ee------CCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 645 VG------EDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 645 ~s------~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
.- .|..|+|||+...... ..+.+.|.|.+++..+.+.+
T Consensus 377 ~~ts~k~~~Dn~i~l~~v~g~~~f--------el~~~~W~p~~~~~ttsSs~ 420 (561)
T COG5354 377 DTTSEKLRVDNSIKLWDVYGAKVF--------ELTNITWDPSGQYVTTSSSC 420 (561)
T ss_pred cCCCcccccCcceEEEEecCchhh--------hhhhccccCCcccceeeccC
Confidence 53 4889999998743321 46789999999988776655
No 261
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=97.61 E-value=0.00095 Score=74.94 Aligned_cols=117 Identities=15% Similarity=0.054 Sum_probs=92.2
Q ss_pred EEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEE--EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcE
Q 000473 574 LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLIT--VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSV 651 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~--~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV 651 (1471)
.++.|++- +..++++-.+|.+.+-+.....+.. .++.|.-+.+...|+..+ .+.+.+||+|+.+
T Consensus 125 lslD~~~~---------~~~i~vs~s~G~~~~v~~t~~~le~vq~wk~He~E~Wta~f~~~~-----pnlvytGgDD~~l 190 (339)
T KOG0280|consen 125 LSLDISTS---------GTKIFVSDSRGSISGVYETEMVLEKVQTWKVHEFEAWTAKFSDKE-----PNLVYTGGDDGSL 190 (339)
T ss_pred eEEEeecc---------CceEEEEcCCCcEEEEecceeeeeecccccccceeeeeeecccCC-----CceEEecCCCceE
Confidence 46677764 6779999999999866665555443 788999999999998763 5799999999999
Q ss_pred EEEECC-CCcEEEE-ecCCCCCcEEEEEcCC-CCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEE
Q 000473 652 ALASLE-TLRVERM-FPGHPNYPAKVVWDCP-RGYIACLCRDHSRTSDAVDVLFIWDVKT-GARE 712 (1471)
Q Consensus 652 ~lWdl~-t~~~l~~-~~gh~~~V~~v~~spd-g~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~ 712 (1471)
..||++ .++.+.. -.-|...|.+|.-+|. +.+|+||+.| ..|++||.++ |+++
T Consensus 191 ~~~D~R~p~~~i~~n~kvH~~GV~SI~ss~~~~~~I~TGsYD--------e~i~~~DtRnm~kPl 247 (339)
T KOG0280|consen 191 SCWDIRIPKTFIWHNSKVHTSGVVSIYSSPPKPTYIATGSYD--------ECIRVLDTRNMGKPL 247 (339)
T ss_pred EEEEecCCcceeeecceeeecceEEEecCCCCCceEEEeccc--------cceeeeehhcccCcc
Confidence 999999 4444433 4568889999998875 5689999988 8999999984 4543
No 262
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.60 E-value=0.0073 Score=75.03 Aligned_cols=120 Identities=18% Similarity=0.089 Sum_probs=82.0
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEE-EEECCC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLV-SGSMDC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG 646 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~-SGs~Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s 646 (1471)
.+.+....|+|| ++.|+ +.+.++ .|.+||+.+++.. .+..+........|+|+ |+.++..+
T Consensus 242 ~g~~~~~~~SpD---------G~~la~~~~~~g~~~Iy~~d~~~~~~~-~lt~~~~~~~~~~~spD------g~~i~f~s 305 (430)
T PRK00178 242 EGLNGAPAWSPD---------GSKLAFVLSKDGNPEIYVMDLASRQLS-RVTNHPAIDTEPFWGKD------GRTLYFTS 305 (430)
T ss_pred CCCcCCeEECCC---------CCEEEEEEccCCCceEEEEECCCCCeE-EcccCCCCcCCeEECCC------CCEEEEEE
Confidence 344557899998 66554 665555 5888899887653 45556666677889999 77766555
Q ss_pred C-CC--cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 647 E-DF--SVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 647 ~-Dg--sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
. ++ .|.++|+.+++..+... .........|+|+|++|+....+ ++...|++||+.+++.
T Consensus 306 ~~~g~~~iy~~d~~~g~~~~lt~-~~~~~~~~~~Spdg~~i~~~~~~-----~~~~~l~~~dl~tg~~ 367 (430)
T PRK00178 306 DRGGKPQIYKVNVNGGRAERVTF-VGNYNARPRLSADGKTLVMVHRQ-----DGNFHVAAQDLQRGSV 367 (430)
T ss_pred CCCCCceEEEEECCCCCEEEeec-CCCCccceEECCCCCEEEEEEcc-----CCceEEEEEECCCCCE
Confidence 3 33 57777888887644332 12234467899999999887754 1124699999998864
No 263
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.55 E-value=0.003 Score=78.34 Aligned_cols=125 Identities=16% Similarity=0.161 Sum_probs=79.6
Q ss_pred CCccEEEEEEecCCCCcccCcCCCEEEEEE-C----CCcEEEEECCCC---ceEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 569 HTGAVLCLAAHRMVGTAKGWSFNEVLVSGS-M----DCSIRIWDLGSG---NLITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 569 H~~~V~~la~spd~~~~~~~~~~~~L~SGs-~----DgtI~lWDl~tg---~~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
..+.....+|+|| ++.|+-.+ . |..+..||+..+ +.......+.+......|+|| |+
T Consensus 229 ~~g~~~~p~wSPD---------G~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPD------G~ 293 (428)
T PRK01029 229 LQGNQLMPTFSPR---------KKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSPD------GT 293 (428)
T ss_pred CCCCccceEECCC---------CCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECCC------CC
Confidence 3444556899998 66555433 2 334555787653 333333333344567899999 88
Q ss_pred EEEEEe-CCCcEEEEE--CCC-CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 641 CFLSVG-EDFSVALAS--LET-LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 641 ~l~S~s-~DgsV~lWd--l~t-~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
.|+..+ .++...+|. +.. +...+.+..+...+....|+|||++|+..+.+ ++...|++||+.+|+..+
T Consensus 294 ~Laf~s~~~g~~~ly~~~~~~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~~-----~g~~~I~v~dl~~g~~~~ 365 (428)
T PRK01029 294 RLVFVSNKDGRPRIYIMQIDPEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSVI-----KGVRQICVYDLATGRDYQ 365 (428)
T ss_pred EEEEEECCCCCceEEEEECcccccceEEeccCCCCccceeECCCCCEEEEEEcC-----CCCcEEEEEECCCCCeEE
Confidence 777665 467666664 432 23344454455567889999999999877654 223579999999987643
No 264
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.55 E-value=0.0059 Score=76.31 Aligned_cols=161 Identities=13% Similarity=0.087 Sum_probs=97.9
Q ss_pred ccCC--EEE-EEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEE-EEE
Q 000473 523 YAPY--AIV-YGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLV-SGS 598 (1471)
Q Consensus 523 f~P~--~lv-~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~-SGs 598 (1471)
|+|+ .++ +...+|...|+.++ .++++. ..+..|...+....|+|| +++|+ +..
T Consensus 269 wSPDG~~La~~~~~~g~~~Iy~~d-------------l~tg~~-~~lt~~~~~~~~p~wSpD---------G~~I~f~s~ 325 (448)
T PRK04792 269 FSPDGKKLALVLSKDGQPEIYVVD-------------IATKAL-TRITRHRAIDTEPSWHPD---------GKSLIFTSE 325 (448)
T ss_pred ECCCCCEEEEEEeCCCCeEEEEEE-------------CCCCCe-EECccCCCCccceEECCC---------CCEEEEEEC
Confidence 5665 454 45678876665555 233333 344555556678899998 66554 443
Q ss_pred CCC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC-CC--cEEEEECCCCcEEEEecCCCCCcE
Q 000473 599 MDC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE-DF--SVALASLETLRVERMFPGHPNYPA 673 (1471)
Q Consensus 599 ~Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~-Dg--sV~lWdl~t~~~l~~~~gh~~~V~ 673 (1471)
.++ .|.++|+.+++... +..+........|+|+ |+.++..+. ++ .|.++|+.+++........ ...
T Consensus 326 ~~g~~~Iy~~dl~~g~~~~-Lt~~g~~~~~~~~SpD------G~~l~~~~~~~g~~~I~~~dl~~g~~~~lt~~~--~d~ 396 (448)
T PRK04792 326 RGGKPQIYRVNLASGKVSR-LTFEGEQNLGGSITPD------GRSMIMVNRTNGKFNIARQDLETGAMQVLTSTR--LDE 396 (448)
T ss_pred CCCCceEEEEECCCCCEEE-EecCCCCCcCeeECCC------CCEEEEEEecCCceEEEEEECCCCCeEEccCCC--CCC
Confidence 444 46667887777543 2222223345689999 887766654 33 4556788887754322221 223
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCC
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASH 721 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~ 721 (1471)
...|+|+|++|+....+ ++...+++++. +|...+.+..+.+.
T Consensus 397 ~ps~spdG~~I~~~~~~-----~g~~~l~~~~~-~G~~~~~l~~~~g~ 438 (448)
T PRK04792 397 SPSVAPNGTMVIYSTTY-----QGKQVLAAVSI-DGRFKARLPAGQGE 438 (448)
T ss_pred CceECCCCCEEEEEEec-----CCceEEEEEEC-CCCceEECcCCCCC
Confidence 45899999999887765 22246888887 56677777665443
No 265
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=97.55 E-value=0.00025 Score=86.45 Aligned_cols=169 Identities=18% Similarity=0.178 Sum_probs=123.0
Q ss_pred ccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccC-CCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccC
Q 000473 510 KIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERH-NSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGW 588 (1471)
Q Consensus 510 ~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~-d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~ 588 (1471)
....|..+..... .+++|+.||.++|...++-... ...+ .-...+...-++|.||.+.|.-+.|...
T Consensus 15 vkL~c~~WNke~g----yIAcgG~dGlLKVlKl~t~t~d~~~~g-laa~snLsmNQtLeGH~~sV~vvTWNe~------- 82 (1189)
T KOG2041|consen 15 VKLHCAEWNKESG----YIACGGADGLLKVLKLGTDTTDLNKSG-LAAASNLSMNQTLEGHNASVMVVTWNEN------- 82 (1189)
T ss_pred ceEEEEEEcccCC----eEEeccccceeEEEEccccCCcccccc-cccccccchhhhhccCcceEEEEEeccc-------
Confidence 4566666665555 6999999999999776532211 0011 1122233445789999999999999876
Q ss_pred cCCCEEEEEECCCcEEEEECCCCceEEEEec--cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEE-EEe
Q 000473 589 SFNEVLVSGSMDCSIRIWDLGSGNLITVMHH--HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVE-RMF 665 (1471)
Q Consensus 589 ~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~--H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l-~~~ 665 (1471)
.+.|-|...+|.|.+|-+..|.....|.. ..+-|.++.|+.+ |..++-+-.||.|.+=.++..+.. ..+
T Consensus 83 --~QKLTtSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~d------G~kIcIvYeDGavIVGsvdGNRIwgKeL 154 (1189)
T KOG2041|consen 83 --NQKLTTSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNLD------GTKICIVYEDGAVIVGSVDGNRIWGKEL 154 (1189)
T ss_pred --cccccccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcCC------CcEEEEEEccCCEEEEeeccceecchhc
Confidence 78899999999999999998875544432 3456899999999 999999999999988777654432 122
Q ss_pred cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 666 PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 666 ~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.|. ....+.|++|.+.++.+-.+ |.+.++|.+.
T Consensus 155 kg~--~l~hv~ws~D~~~~Lf~~an--------ge~hlydnqg 187 (1189)
T KOG2041|consen 155 KGQ--LLAHVLWSEDLEQALFKKAN--------GETHLYDNQG 187 (1189)
T ss_pred chh--eccceeecccHHHHHhhhcC--------CcEEEecccc
Confidence 221 23478999999988887777 8999999753
No 266
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=97.54 E-value=0.00037 Score=81.15 Aligned_cols=87 Identities=20% Similarity=0.203 Sum_probs=74.4
Q ss_pred EEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEe-ccCCCEEEEEECCCCCCCCCCCEE
Q 000473 564 QYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMH-HHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 564 ~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~-~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
..+.||-.-++.++++|| +++++++..|..|++=....-..+..|. ||...|..++.-++ +.|
T Consensus 145 ~~~lGhvSml~dVavS~D---------~~~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS~isl~~~-------~~L 208 (390)
T KOG3914|consen 145 EPILGHVSMLLDVAVSPD---------DQFIITADRDEKIRVSRYPATFVIESFCLGHKEFVSTISLTDN-------YLL 208 (390)
T ss_pred chhhhhhhhhheeeecCC---------CCEEEEecCCceEEEEecCcccchhhhccccHhheeeeeeccC-------cee
Confidence 445699999999999998 8999999999999987665544555554 69999999999875 568
Q ss_pred EEEeCCCcEEEEECCCCcEEEEec
Q 000473 643 LSVGEDFSVALASLETLRVERMFP 666 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~t~~~l~~~~ 666 (1471)
+|+|.|+++++||+++|++++.+.
T Consensus 209 lS~sGD~tlr~Wd~~sgk~L~t~d 232 (390)
T KOG3914|consen 209 LSGSGDKTLRLWDITSGKLLDTCD 232 (390)
T ss_pred eecCCCCcEEEEecccCCcccccc
Confidence 999999999999999999987765
No 267
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.48 E-value=0.0042 Score=77.58 Aligned_cols=128 Identities=13% Similarity=0.108 Sum_probs=88.6
Q ss_pred EEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC-CC--cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 564 QYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM-DC--SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 564 ~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~-Dg--tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
+.+..+...+.+..|+|| ++.|+-.+. ++ .|.+||+.+++... +....+......|+|+ |+
T Consensus 211 ~~l~~~~~~~~~p~wSPD---------G~~La~~s~~~g~~~L~~~dl~tg~~~~-lt~~~g~~~~~~wSPD------G~ 274 (448)
T PRK04792 211 QMLLRSPEPLMSPAWSPD---------GRKLAYVSFENRKAEIFVQDIYTQVREK-VTSFPGINGAPRFSPD------GK 274 (448)
T ss_pred eEeecCCCcccCceECCC---------CCEEEEEEecCCCcEEEEEECCCCCeEE-ecCCCCCcCCeeECCC------CC
Confidence 455566778899999998 676665543 33 58899998887532 2222334457889999 77
Q ss_pred EEE-EEeCCCc--EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 641 CFL-SVGEDFS--VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 641 ~l~-S~s~Dgs--V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
.|+ +.+.++. |.++|+++++..+ +..+........|+|||++|+..+.. ++...|+++|+.+|+..+
T Consensus 275 ~La~~~~~~g~~~Iy~~dl~tg~~~~-lt~~~~~~~~p~wSpDG~~I~f~s~~-----~g~~~Iy~~dl~~g~~~~ 344 (448)
T PRK04792 275 KLALVLSKDGQPEIYVVDIATKALTR-ITRHRAIDTEPSWHPDGKSLIFTSER-----GGKPQIYRVNLASGKVSR 344 (448)
T ss_pred EEEEEEeCCCCeEEEEEECCCCCeEE-CccCCCCccceEECCCCCEEEEEECC-----CCCceEEEEECCCCCEEE
Confidence 665 4566665 7777888877543 44455566788999999999876643 122578889998887544
No 268
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=97.44 E-value=0.00023 Score=78.48 Aligned_cols=94 Identities=15% Similarity=0.137 Sum_probs=71.1
Q ss_pred cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE-EEEecCCCCCcEEEEEcCC
Q 000473 602 SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV-ERMFPGHPNYPAKVVWDCP 680 (1471)
Q Consensus 602 tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~-l~~~~gh~~~V~~v~~spd 680 (1471)
..+.|+++..+.+..-..-...|.+++-+|.. .+.+++|+.||.+.+||.+.... ...+..|+.+++.|.|+|.
T Consensus 160 ~~~a~~~~p~~t~~~~~~~~~~v~~l~~hp~q-----q~~v~cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk 234 (319)
T KOG4714|consen 160 NFYANTLDPIKTLIPSKKALDAVTALCSHPAQ-----QHLVCCGTDDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPK 234 (319)
T ss_pred ceeeecccccccccccccccccchhhhCCccc-----ccEEEEecCCCeEEEEEcccccchHHHHHHhhhhhhheeccCC
Confidence 45667766444332222223448888888874 66888999999999999998753 3456789999999999994
Q ss_pred -CCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 681 -RGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 681 -g~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+..|+++++| |.+.-||..+
T Consensus 235 ~p~~Lft~sed--------Gslw~wdas~ 255 (319)
T KOG4714|consen 235 NPEHLFTCSED--------GSLWHWDAST 255 (319)
T ss_pred CchheeEecCC--------CcEEEEcCCC
Confidence 5689999999 9999999765
No 269
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.42 E-value=0.13 Score=62.74 Aligned_cols=118 Identities=15% Similarity=0.249 Sum_probs=84.1
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC---CcE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD---CSI 603 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D---gtI 603 (1471)
.++.|+.--.+.|++.+ +.++..| -.++=+++-|+|. +++|+-+|-+ |.|
T Consensus 286 ~VvyGfMPAkvtifnlr----------------~~~v~df--~egpRN~~~fnp~---------g~ii~lAGFGNL~G~m 338 (566)
T KOG2315|consen 286 AVVYGFMPAKVTIFNLR----------------GKPVFDF--PEGPRNTAFFNPH---------GNIILLAGFGNLPGDM 338 (566)
T ss_pred EEEEecccceEEEEcCC----------------CCEeEeC--CCCCccceEECCC---------CCEEEEeecCCCCCce
Confidence 57788888888883322 2333333 3566678899997 7877776644 789
Q ss_pred EEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC------CCcEEEEECCCCcEEEEecCCCCCcEEEEE
Q 000473 604 RIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE------DFSVALASLETLRVERMFPGHPNYPAKVVW 677 (1471)
Q Consensus 604 ~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~------DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~ 677 (1471)
-+||+.+.+++..+..-.. +-+.|+|| |++|+|+.. |+.++||+.. |+.+..-. ..+....+.|
T Consensus 339 EvwDv~n~K~i~~~~a~~t--t~~eW~Pd------Ge~flTATTaPRlrvdNg~Kiwhyt-G~~l~~~~-f~sEL~qv~W 408 (566)
T KOG2315|consen 339 EVWDVPNRKLIAKFKAANT--TVFEWSPD------GEYFLTATTAPRLRVDNGIKIWHYT-GSLLHEKM-FKSELLQVEW 408 (566)
T ss_pred EEEeccchhhccccccCCc--eEEEEcCC------CcEEEEEeccccEEecCCeEEEEec-Cceeehhh-hhHhHhheee
Confidence 9999999999988876543 55789999 999998864 8999999986 55543321 1115788999
Q ss_pred cCCC
Q 000473 678 DCPR 681 (1471)
Q Consensus 678 spdg 681 (1471)
.|-.
T Consensus 409 ~P~~ 412 (566)
T KOG2315|consen 409 RPFN 412 (566)
T ss_pred eecC
Confidence 8743
No 270
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.34 E-value=0.23 Score=59.92 Aligned_cols=123 Identities=18% Similarity=0.278 Sum_probs=83.9
Q ss_pred ccEEEEEEecCCCCcccCcCCC-EEEEEECCCcEEEEECC--CCce--EEEEecc------CCCEEEEEECCCCCCCCCC
Q 000473 571 GAVLCLAAHRMVGTAKGWSFNE-VLVSGSMDCSIRIWDLG--SGNL--ITVMHHH------VAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 571 ~~V~~la~spd~~~~~~~~~~~-~L~SGs~DgtI~lWDl~--tg~~--l~~~~~H------~~~V~~l~fspd~~~~~~~ 639 (1471)
..-.-++|+|+ ++ ..+..-.+++|.++++. ++.. +..+... ......|+++|+ |
T Consensus 192 ~GPRh~~f~pd---------g~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispd------g 256 (345)
T PF10282_consen 192 SGPRHLAFSPD---------GKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPD------G 256 (345)
T ss_dssp SSEEEEEE-TT---------SSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TT------S
T ss_pred CCCcEEEEcCC---------cCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecC------C
Confidence 34678999997 55 45666788899999998 4532 2222211 125788999999 7
Q ss_pred CEEE-EEeCCCcEEEEECC--CCc--EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC--CCCeEE
Q 000473 640 DCFL-SVGEDFSVALASLE--TLR--VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV--KTGARE 712 (1471)
Q Consensus 640 ~~l~-S~s~DgsV~lWdl~--t~~--~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi--~tg~~~ 712 (1471)
+++. +.-.+.+|.++++. +++ .+..+......++.++++|+|++|++++.+ ++.|.+|++ ++|.+.
T Consensus 257 ~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~-------s~~v~vf~~d~~tG~l~ 329 (345)
T PF10282_consen 257 RFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQD-------SNTVSVFDIDPDTGKLT 329 (345)
T ss_dssp SEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETT-------TTEEEEEEEETTTTEEE
T ss_pred CEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecC-------CCeEEEEEEeCCCCcEE
Confidence 7554 45567899999993 343 344454445568999999999999999987 378999865 688875
Q ss_pred EEE
Q 000473 713 RVL 715 (1471)
Q Consensus 713 ~~l 715 (1471)
..-
T Consensus 330 ~~~ 332 (345)
T PF10282_consen 330 PVG 332 (345)
T ss_dssp EEE
T ss_pred Eec
Confidence 543
No 271
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.33 E-value=0.00052 Score=81.36 Aligned_cols=173 Identities=16% Similarity=0.187 Sum_probs=118.5
Q ss_pred CCCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCc--eeeeEEecccccceeEeeeccccccccCccccccccc
Q 000473 12 GTPPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSE--IKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAE 89 (1471)
Q Consensus 12 ~~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~--~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~ 89 (1471)
=+.+...|.+++.-.+..-+++++.|.+|++|.+.++.++. .....+...|+.+|.++.|
T Consensus 731 f~GH~~~iRai~AidNENSFiSASkDKTVKLWSik~EgD~~~tsaCQfTY~aHkk~i~~igf------------------ 792 (1034)
T KOG4190|consen 731 FTGHQEKIRAIAAIDNENSFISASKDKTVKLWSIKPEGDEIGTSACQFTYQAHKKPIHDIGF------------------ 792 (1034)
T ss_pred ccCcHHHhHHHHhcccccceeeccCCceEEEEEeccccCccccceeeeEhhhccCcccceee------------------
Confidence 34566678888888888889999999999999998643211 1234566799999999984
Q ss_pred ccccccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCC-CCeEEEEcceecccCCccccccc
Q 000473 90 NSSNVMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPS-NPRYVCIGCCFIDTNQLSDHHSF 168 (1471)
Q Consensus 90 ~~~~~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~-~~~ll~~G~~~id~~~~~~~h~~ 168 (1471)
-.+..++ ++.||.|.+||---|+.+....-.+.+|....|..++. +..++..||.
T Consensus 793 ---------L~~lr~i--~ScD~giHlWDPFigr~Laq~~dapk~~a~~~ikcl~nv~~~iliAgcs------------- 848 (1034)
T KOG4190|consen 793 ---------LADLRSI--ASCDGGIHLWDPFIGRLLAQMEDAPKEGAGGNIKCLENVDRHILIAGCS------------- 848 (1034)
T ss_pred ---------eecccee--eeccCcceeecccccchhHhhhcCcccCCCceeEecccCcchheeeecc-------------
Confidence 3344445 45699999999888877765433445577778887776 5556666764
Q ss_pred ccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCcc-cc-CCeEEEEEeeecCCCCceeEEEEeCCCcEEEEE
Q 000473 169 ESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNL-SI-GPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVP 246 (1471)
Q Consensus 169 ~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~-s~-~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~ 246 (1471)
...+|.++|.++.+-...+.-++. -| .-+.++++.+ .|+ .+.+|-++|+|.+-|
T Consensus 849 -------------------aeSTVKl~DaRsce~~~E~kVcna~~Pna~~R~iaVa~---~GN--~lAa~LSnGci~~LD 904 (1034)
T KOG4190|consen 849 -------------------AESTVKLFDARSCEWTCELKVCNAPGPNALTRAIAVAD---KGN--KLAAALSNGCIAILD 904 (1034)
T ss_pred -------------------chhhheeeecccccceeeEEeccCCCCchheeEEEecc---Ccc--hhhHHhcCCcEEEEe
Confidence 125677888887665554443111 11 2267777763 343 366777789999999
Q ss_pred CCCC
Q 000473 247 ISKE 250 (1471)
Q Consensus 247 l~~~ 250 (1471)
..++
T Consensus 905 aR~G 908 (1034)
T KOG4190|consen 905 ARNG 908 (1034)
T ss_pred cCCC
Confidence 8776
No 272
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=97.32 E-value=0.015 Score=69.02 Aligned_cols=128 Identities=16% Similarity=0.240 Sum_probs=105.7
Q ss_pred ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCC-cEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEE
Q 000473 567 LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDC-SIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSV 645 (1471)
Q Consensus 567 ~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dg-tI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~ 645 (1471)
-+|.+.|.-..+.-+ ++-++-|..|| .+-++|..+++ ..++...-+.|.++..+|+ |..++.+
T Consensus 356 v~~~~~VrY~r~~~~---------~e~~vigt~dgD~l~iyd~~~~e-~kr~e~~lg~I~av~vs~d------GK~~vva 419 (668)
T COG4946 356 VGKKGGVRYRRIQVD---------PEGDVIGTNDGDKLGIYDKDGGE-VKRIEKDLGNIEAVKVSPD------GKKVVVA 419 (668)
T ss_pred cCCCCceEEEEEccC---------CcceEEeccCCceEEEEecCCce-EEEeeCCccceEEEEEcCC------CcEEEEE
Confidence 478888888888765 56888999999 89999998776 4566677789999999999 9988888
Q ss_pred eCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 646 GEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 646 s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
.....+.+.|++++.....=....+-|+...|+|++++++-+--+ |+.- ..|+++|+.+++....
T Consensus 420 Ndr~el~vididngnv~~idkS~~~lItdf~~~~nsr~iAYafP~--gy~t--q~Iklydm~~~Kiy~v 484 (668)
T COG4946 420 NDRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSRWIAYAFPE--GYYT--QSIKLYDMDGGKIYDV 484 (668)
T ss_pred cCceEEEEEEecCCCeeEecccccceeEEEEEcCCceeEEEecCc--ceee--eeEEEEecCCCeEEEe
Confidence 888899999999998755545566779999999999999988765 4444 6899999999886554
No 273
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=97.32 E-value=0.063 Score=66.42 Aligned_cols=119 Identities=13% Similarity=0.139 Sum_probs=86.0
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEE
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
..++.|-.++|.+++.+ .++|+.+-..|.|.-+.+..+-+...... ...|.++..+=+ ...+
T Consensus 497 ~kt~~G~~DpICAl~~s-----------dk~l~vareSG~I~rySl~nv~l~n~y~~-n~~~y~~~lNCn------stRl 558 (1189)
T KOG2041|consen 497 TKTLLGSKDPICALCIS-----------DKFLMVARESGGIYRYSLNNVVLTNSYPV-NPSIYSIKLNCN------STRL 558 (1189)
T ss_pred ceeeccCCCcceeeeec-----------ceEEEEEeccCceEEEEecceeeeecccc-CchheeEeeccC------cchh
Confidence 45677889999999876 68999999999999999988776655533 346788887655 5566
Q ss_pred EEEeCCCcEEEEECC---CCcEEE-EecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 643 LSVGEDFSVALASLE---TLRVER-MFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~---t~~~l~-~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
+...--|.+.+.|+. +|..+. .+......|+.+.|..|...|+.--.- ..++|++-.
T Consensus 559 AiId~~gv~tf~dLd~d~~g~ql~~~~~~errDVWd~~Wa~dNp~llAlmeK--------trmyifrgn 619 (1189)
T KOG2041|consen 559 AIIDLVGVVTFQDLDYDFDGDQLKLIYTSERRDVWDYEWAQDNPNLLALMEK--------TRMYIFRGN 619 (1189)
T ss_pred hhhhhhceeeeeecccccCcceeeeeehhhhhhhhhhhhccCCchHHhhhhh--------ceEEEecCc
Confidence 666667788888885 455544 334455679999998888777655544 567777643
No 274
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=97.26 E-value=0.00086 Score=74.06 Aligned_cols=74 Identities=20% Similarity=0.347 Sum_probs=64.4
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceE-EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCc
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLI-TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFS 650 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l-~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dgs 650 (1471)
.|++++-||.. .+.+++|+.||.+.+||.+..... ..+..|..+++.+-|+|.. +..|.++++||+
T Consensus 181 ~v~~l~~hp~q--------q~~v~cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk~-----p~~Lft~sedGs 247 (319)
T KOG4714|consen 181 AVTALCSHPAQ--------QHLVCCGTDDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPKN-----PEHLFTCSEDGS 247 (319)
T ss_pred cchhhhCCccc--------ccEEEEecCCCeEEEEEcccccchHHHHHHhhhhhhheeccCCC-----chheeEecCCCc
Confidence 49999999972 678999999999999999887543 5577899999999999985 778999999999
Q ss_pred EEEEECCC
Q 000473 651 VALASLET 658 (1471)
Q Consensus 651 V~lWdl~t 658 (1471)
+.-||-.+
T Consensus 248 lw~wdas~ 255 (319)
T KOG4714|consen 248 LWHWDAST 255 (319)
T ss_pred EEEEcCCC
Confidence 99999764
No 275
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=97.12 E-value=0.0025 Score=80.56 Aligned_cols=139 Identities=13% Similarity=0.184 Sum_probs=110.1
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEE--------
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGS-------- 598 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs-------- 598 (1471)
.+.+|...|+|.+ .|..+.+.++++..|++.|..+..+ |+.|+++|
T Consensus 189 ~lf~G~t~G~V~L---------------rD~~s~~~iht~~aHs~siSDfDv~-----------GNlLitCG~S~R~~~l 242 (1118)
T KOG1275|consen 189 NLFCGDTRGTVFL---------------RDPNSFETIHTFDAHSGSISDFDVQ-----------GNLLITCGYSMRRYNL 242 (1118)
T ss_pred EEEeecccceEEe---------------ecCCcCceeeeeeccccceeeeecc-----------CCeEEEeecccccccc
Confidence 6889999999998 2455668899999999999988765 78899887
Q ss_pred -CCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEE---CCCC-cEEEEecCCCCCcE
Q 000473 599 -MDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALAS---LETL-RVERMFPGHPNYPA 673 (1471)
Q Consensus 599 -~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWd---l~t~-~~l~~~~gh~~~V~ 673 (1471)
.|.-|+|||++.-+.+..+..+.++ .-+.|.|.- ...++.++..|...+-| +.+. .-+.++..-...+.
T Consensus 243 ~~D~FvkVYDLRmmral~PI~~~~~P-~flrf~Psl-----~t~~~V~S~sGq~q~vd~~~lsNP~~~~~~v~p~~s~i~ 316 (1118)
T KOG1275|consen 243 AMDPFVKVYDLRMMRALSPIQFPYGP-QFLRFHPSL-----TTRLAVTSQSGQFQFVDTATLSNPPAGVKMVNPNGSGIS 316 (1118)
T ss_pred cccchhhhhhhhhhhccCCcccccCc-hhhhhcccc-----cceEEEEecccceeeccccccCCCccceeEEccCCCcce
Confidence 4567899999988877777777766 567788874 45788888999999999 3332 22344444455689
Q ss_pred EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 674 KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 674 ~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+..++++++.|+.|..+ |.|.+|-
T Consensus 317 ~fDiSsn~~alafgd~~--------g~v~~wa 340 (1118)
T KOG1275|consen 317 AFDISSNGDALAFGDHE--------GHVNLWA 340 (1118)
T ss_pred eEEecCCCceEEEeccc--------CcEeeec
Confidence 99999999999999888 9999997
No 276
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=97.07 E-value=0.59 Score=56.79 Aligned_cols=124 Identities=16% Similarity=0.161 Sum_probs=85.5
Q ss_pred EEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEE
Q 000473 565 YFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLS 644 (1471)
Q Consensus 565 ~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S 644 (1471)
...+-.++|....|.|+ + ++--+++|-++-++.++|++.. ....+ -.+.=..+.|+|. +++++.
T Consensus 269 V~~~~~~pVhdf~W~p~-----S--~~F~vi~g~~pa~~s~~~lr~N-l~~~~--Pe~~rNT~~fsp~------~r~il~ 332 (561)
T COG5354 269 VEKDLKDPVHDFTWEPL-----S--SRFAVISGYMPASVSVFDLRGN-LRFYF--PEQKRNTIFFSPH------ERYILF 332 (561)
T ss_pred eeccccccceeeeeccc-----C--CceeEEecccccceeecccccc-eEEec--CCcccccccccCc------ccEEEE
Confidence 33355789999999997 1 1224566779999999999754 44333 3444567889998 889998
Q ss_pred EeCC---CcEEEEECCCCcEEE-EecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 645 VGED---FSVALASLETLRVER-MFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 645 ~s~D---gsV~lWdl~t~~~l~-~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
++-| |.+-+||....-.+. .+.+-. ..-+.|+||+.|+.+.... ---..|..|.|||+..
T Consensus 333 agF~nl~gni~i~~~~~rf~~~~~~~~~n--~s~~~wspd~qF~~~~~ts--~k~~~Dn~i~l~~v~g 396 (561)
T COG5354 333 AGFDNLQGNIEIFDPAGRFKVAGAFNGLN--TSYCDWSPDGQFYDTDTTS--EKLRVDNSIKLWDVYG 396 (561)
T ss_pred ecCCccccceEEeccCCceEEEEEeecCC--ceEeeccCCceEEEecCCC--cccccCcceEEEEecC
Confidence 8766 789999987654433 555433 3457899999999876432 1112347899999853
No 277
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.06 E-value=0.0087 Score=68.09 Aligned_cols=147 Identities=12% Similarity=0.075 Sum_probs=94.3
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+...+.|+.|.+|+. ...+-...+..-...+....|+|| | .+.|.+..-|-.|.+|
T Consensus 63 ilC~~yk~~~vqvwsl---------------~Qpew~ckIdeg~agls~~~WSPd-----g---rhiL~tseF~lriTVW 119 (447)
T KOG4497|consen 63 ILCVAYKDPKVQVWSL---------------VQPEWYCKIDEGQAGLSSISWSPD-----G---RHILLTSEFDLRITVW 119 (447)
T ss_pred eeeeeeccceEEEEEe---------------ecceeEEEeccCCCcceeeeECCC-----c---ceEeeeecceeEEEEE
Confidence 4555678999999432 222334556666778999999998 1 4677788889999999
Q ss_pred ECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC------------------------------------CCc
Q 000473 607 DLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE------------------------------------DFS 650 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~------------------------------------Dgs 650 (1471)
.+.+.+....-... ..+..++|+|+ |++.+-.+. +..
T Consensus 120 SL~t~~~~~~~~pK-~~~kg~~f~~d------g~f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~dT~DltgieWsPdg~~ 192 (447)
T KOG4497|consen 120 SLNTQKGYLLPHPK-TNVKGYAFHPD------GQFCAILSRRDCKDYVQISSCKAWILLKEFKLDTIDLTGIEWSPDGNW 192 (447)
T ss_pred EeccceeEEecccc-cCceeEEECCC------CceeeeeecccHHHHHHHHhhHHHHHHHhcCCCcccccCceECCCCcE
Confidence 99987755433222 23577888887 443332221 233
Q ss_pred EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 651 VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 651 V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
+.+||---.-.+..++. .-.+..+.|+|.+++|++|+.| +.+||-+--|.+..
T Consensus 193 laVwd~~Leykv~aYe~-~lG~k~v~wsP~~qflavGsyD--------~~lrvlnh~tWk~f 245 (447)
T KOG4497|consen 193 LAVWDNVLEYKVYAYER-GLGLKFVEWSPCNQFLAVGSYD--------QMLRVLNHFTWKPF 245 (447)
T ss_pred EEEecchhhheeeeeee-ccceeEEEeccccceEEeeccc--------hhhhhhceeeeeeh
Confidence 45555321111222221 1358889999999999999998 89998776554443
No 278
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.06 E-value=0.031 Score=76.68 Aligned_cols=117 Identities=13% Similarity=0.127 Sum_probs=85.0
Q ss_pred cEEEEEEecCCCCcccCcCCC-EEEEEECCCcEEEEECCCCceEEEEe-------------ccC--------CCEEEEEE
Q 000473 572 AVLCLAAHRMVGTAKGWSFNE-VLVSGSMDCSIRIWDLGSGNLITVMH-------------HHV--------APVRQIIL 629 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~-~L~SGs~DgtI~lWDl~tg~~l~~~~-------------~H~--------~~V~~l~f 629 (1471)
..+.++++|+ +. ++++-+.++.|++||+.++....... .+. ..-..+++
T Consensus 741 ~P~GIavspd---------G~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvav 811 (1057)
T PLN02919 741 QPSGISLSPD---------LKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLC 811 (1057)
T ss_pred CccEEEEeCC---------CCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeE
Confidence 3467899987 44 77777888999999998765321110 000 12357788
Q ss_pred CCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEec-------------CCCCCcEEEEEcCCCCEEEEEEcCCCCCCC
Q 000473 630 SPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFP-------------GHPNYPAKVVWDCPRGYIACLCRDHSRTSD 696 (1471)
Q Consensus 630 spd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~-------------gh~~~V~~v~~spdg~~L~sgs~D~sg~~D 696 (1471)
+|+ |+.+++-..++.|++||..++....... ++-..+..|+++++|+.+++-+.+
T Consensus 812 d~d------G~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~N------ 879 (1057)
T PLN02919 812 AKD------GQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNN------ 879 (1057)
T ss_pred eCC------CcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCC------
Confidence 888 8889999999999999998877654321 112357889999999877766665
Q ss_pred CCCEEEEEECCCCeE
Q 000473 697 AVDVLFIWDVKTGAR 711 (1471)
Q Consensus 697 ~~gtV~VWDi~tg~~ 711 (1471)
++|++||+.+++.
T Consensus 880 --n~Irvid~~~~~~ 892 (1057)
T PLN02919 880 --SLIRYLDLNKGEA 892 (1057)
T ss_pred --CEEEEEECCCCcc
Confidence 8999999999875
No 279
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=97.04 E-value=0.032 Score=62.28 Aligned_cols=148 Identities=9% Similarity=-0.076 Sum_probs=101.1
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCc--cEEEEEEecCCCCcccCcCCCEEEEEECCCcEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTG--AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIR 604 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~--~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~ 604 (1471)
.+..++.|.++++...+. .+.+. .-|.. .++.+++++| ++++++.+....|.
T Consensus 130 ~~~i~sndht~k~~~~~~-------------~s~~~----~~h~~~~~~ns~~~snd---------~~~~~~Vgds~~Vf 183 (344)
T KOG4532|consen 130 PLNIASNDHTGKTMVVSG-------------DSNKF----AVHNQNLTQNSLHYSND---------PSWGSSVGDSRRVF 183 (344)
T ss_pred ceeeccCCcceeEEEEec-------------Ccccc----eeeccccceeeeEEcCC---------CceEEEecCCCcce
Confidence 466778888888854431 11111 11332 2788999997 89999999999999
Q ss_pred EEECCCC-ceE-E-EEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEE-----EecCCCCCcEEEE
Q 000473 605 IWDLGSG-NLI-T-VMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVER-----MFPGHPNYPAKVV 676 (1471)
Q Consensus 605 lWDl~tg-~~l-~-~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~-----~~~gh~~~V~~v~ 676 (1471)
.+.+... +.+ . ....-...=.+..|+.. ...||++..||++.|||++...... +-+.|.+.+..+.
T Consensus 184 ~y~id~~sey~~~~~~a~t~D~gF~~S~s~~------~~~FAv~~Qdg~~~I~DVR~~~tpm~~~sstrp~hnGa~R~c~ 257 (344)
T KOG4532|consen 184 RYAIDDESEYIENIYEAPTSDHGFYNSFSEN------DLQFAVVFQDGTCAIYDVRNMATPMAEISSTRPHHNGAFRVCR 257 (344)
T ss_pred EEEeCCccceeeeeEecccCCCceeeeeccC------cceEEEEecCCcEEEEEecccccchhhhcccCCCCCCceEEEE
Confidence 9998743 322 2 12222333466778776 6799999999999999998754332 2245888999999
Q ss_pred EcCCCC---EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 677 WDCPRG---YIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 677 ~spdg~---~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
|+|-|. ++++-.. +.+.|-|++++...+.+
T Consensus 258 Fsl~g~lDLLf~sEhf---------s~~hv~D~R~~~~~q~I 290 (344)
T KOG4532|consen 258 FSLYGLLDLLFISEHF---------SRVHVVDTRNYVNHQVI 290 (344)
T ss_pred ecCCCcceEEEEecCc---------ceEEEEEcccCceeeEE
Confidence 997543 4444433 68999999998754443
No 280
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=97.00 E-value=0.013 Score=73.09 Aligned_cols=139 Identities=18% Similarity=0.116 Sum_probs=101.9
Q ss_pred ccCC-EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcc-cC--cCCCEEEEEE
Q 000473 523 YAPY-AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAK-GW--SFNEVLVSGS 598 (1471)
Q Consensus 523 f~P~-~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~-~~--~~~~~L~SGs 598 (1471)
++|. .++.|+.+ .|.|. |..+-+.++.+.-|...|+.+.|.|...-.+ .. ...-+|+++.
T Consensus 23 w~~~GLiAygshs-lV~VV---------------Ds~s~q~iqsie~h~s~V~~VrWap~~~p~~llS~~~~~lliAsaD 86 (1062)
T KOG1912|consen 23 WSPSGLIAYGSHS-LVSVV---------------DSRSLQLIQSIELHQSAVTSVRWAPAPSPRDLLSPSSSQLLIASAD 86 (1062)
T ss_pred cCccceEEEecCc-eEEEE---------------ehhhhhhhhccccCccceeEEEeccCCCchhccCccccceeEEecc
Confidence 4554 56777654 34442 2334466788889999999999988531111 11 1245788999
Q ss_pred CCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCC-CEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE
Q 000473 599 MDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWS-DCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW 677 (1471)
Q Consensus 599 ~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~-~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~ 677 (1471)
..|.|.+||...+..+..+..|..+|..+.|-|.. ++. +.++....-.++-+|+..+|+.+...........|+.+
T Consensus 87 ~~GrIil~d~~~~s~~~~l~~~~~~~qdl~W~~~r---d~Srd~LlaIh~ss~lvLwntdtG~k~Wk~~ys~~iLs~f~~ 163 (1062)
T KOG1912|consen 87 ISGRIILVDFVLASVINWLSHSNDSVQDLCWVPAR---DDSRDVLLAIHGSSTLVLWNTDTGEKFWKYDYSHEILSCFRV 163 (1062)
T ss_pred ccCcEEEEEehhhhhhhhhcCCCcchhheeeeecc---CcchheeEEecCCcEEEEEEccCCceeeccccCCcceeeeee
Confidence 99999999999999888999999999999998763 334 57888888899999999999988776655555666777
Q ss_pred cCC
Q 000473 678 DCP 680 (1471)
Q Consensus 678 spd 680 (1471)
.|-
T Consensus 164 DPf 166 (1062)
T KOG1912|consen 164 DPF 166 (1062)
T ss_pred CCC
Confidence 663
No 281
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.00 E-value=0.081 Score=72.67 Aligned_cols=117 Identities=15% Similarity=0.085 Sum_probs=82.6
Q ss_pred EEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEecc---------------CCCEEEEEECCCCCCCCC
Q 000473 574 LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHH---------------VAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H---------------~~~V~~l~fspd~~~~~~ 638 (1471)
+.++++|+ ++.++++.+.++.|++||..++... .+.+. -.....++++|+
T Consensus 686 ~gVa~dp~--------~g~LyVad~~~~~I~v~d~~~g~v~-~~~G~G~~~~~~g~~~~~~~~~~P~GIavspd------ 750 (1057)
T PLN02919 686 WDVCFEPV--------NEKVYIAMAGQHQIWEYNISDGVTR-VFSGDGYERNLNGSSGTSTSFAQPSGISLSPD------ 750 (1057)
T ss_pred eEEEEecC--------CCeEEEEECCCCeEEEEECCCCeEE-EEecCCccccCCCCccccccccCccEEEEeCC------
Confidence 57889885 2678888889999999999877542 33221 123456889998
Q ss_pred CC-EEEEEeCCCcEEEEECCCCcEEEEecC---------------------CCCCcEEEEEcCCCCEEEEEEcCCCCCCC
Q 000473 639 SD-CFLSVGEDFSVALASLETLRVERMFPG---------------------HPNYPAKVVWDCPRGYIACLCRDHSRTSD 696 (1471)
Q Consensus 639 ~~-~l~S~s~DgsV~lWdl~t~~~l~~~~g---------------------h~~~V~~v~~spdg~~L~sgs~D~sg~~D 696 (1471)
++ .+++-+.++.|++||++++.......+ ....+..++++++|+.+++-..+
T Consensus 751 G~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N------ 824 (1057)
T PLN02919 751 LKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYN------ 824 (1057)
T ss_pred CCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCC------
Confidence 66 666777789999999987654221110 01135689999999877766665
Q ss_pred CCCEEEEEECCCCeEEE
Q 000473 697 AVDVLFIWDVKTGARER 713 (1471)
Q Consensus 697 ~~gtV~VWDi~tg~~~~ 713 (1471)
++|++||..++....
T Consensus 825 --~rIrviD~~tg~v~t 839 (1057)
T PLN02919 825 --HKIKKLDPATKRVTT 839 (1057)
T ss_pred --CEEEEEECCCCeEEE
Confidence 899999998876543
No 282
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=96.98 E-value=2.4 Score=57.70 Aligned_cols=115 Identities=12% Similarity=0.101 Sum_probs=66.6
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCce--------EEEEe----------ccCCCEEEEEECC
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNL--------ITVMH----------HHVAPVRQIILSP 631 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~--------l~~~~----------~H~~~V~~l~fsp 631 (1471)
...|.+++|+++ +..++.-..|++|.+|....... ...+. .-...+..+.|..
T Consensus 426 ~~~v~~vaf~~~---------~~~~avl~~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (928)
T PF04762_consen 426 PSPVNDVAFSPS---------NSRFAVLTSDGSLSIYEWDLKNMWSVKPPKLLSSISLDSMDISDSELPLGSLRQLAWLN 496 (928)
T ss_pred CCCcEEEEEeCC---------CCeEEEEECCCCEEEEEecCCCcccccCcchhhhcccccccccccccccccEEEEEEeC
Confidence 467999999986 55588889999999988543321 11111 1234567788776
Q ss_pred CCCCCCCCCEEEEEeCC---CcEEEEECCCCc---EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 632 PQTEHPWSDCFLSVGED---FSVALASLETLR---VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 632 d~~~~~~~~~l~S~s~D---gsV~lWdl~t~~---~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+ ...++....+ ..+.++++...+ .+.....-...+..+...++...++.-..| |.++..+
T Consensus 497 ~------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~--------G~v~~~~ 562 (928)
T PF04762_consen 497 D------DTLLVLSDSDSNQSKIVLVDIDDSENSASVESSTEVDGVVLIISSSPDSGSLYIQTND--------GKVFQLS 562 (928)
T ss_pred C------CEEEEEEecCcccceEEEEEeccCCCceeEEEEeccCceEEEEeeCCCCcEEEEEECC--------CEEEEee
Confidence 6 4444444443 577777774332 222222233445555555555435555555 7777554
Q ss_pred CC
Q 000473 706 VK 707 (1471)
Q Consensus 706 i~ 707 (1471)
..
T Consensus 563 ~~ 564 (928)
T PF04762_consen 563 SD 564 (928)
T ss_pred cC
Confidence 43
No 283
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=96.95 E-value=0.0015 Score=76.87 Aligned_cols=170 Identities=18% Similarity=0.180 Sum_probs=119.6
Q ss_pred ccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcc
Q 000473 507 HKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAK 586 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~ 586 (1471)
-|+..|+.+.+.... .++.++.||.++. |.. .+ ..--+.+..+..|-+.+.+++.+.+
T Consensus 7 mhrd~i~hv~~tka~-----fiiqASlDGh~KF--WkK-------s~---isGvEfVKhFraHL~~I~sl~~S~d----- 64 (558)
T KOG0882|consen 7 MHRDVITHVFPTKAK-----FIIQASLDGHKKF--WKK-------SR---ISGVEFVKHFRAHLGVILSLAVSYD----- 64 (558)
T ss_pred cccceeeeEeeehhh-----eEEeeecchhhhh--cCC-------CC---ccceeehhhhHHHHHHHHhhhcccc-----
Confidence 367777776543333 5899999999998 431 00 0111334556677777777777665
Q ss_pred cCcCCCEEEEEEC-CCcEEEEECCCCc------------------------------------------------eEEEE
Q 000473 587 GWSFNEVLVSGSM-DCSIRIWDLGSGN------------------------------------------------LITVM 617 (1471)
Q Consensus 587 ~~~~~~~L~SGs~-DgtI~lWDl~tg~------------------------------------------------~l~~~ 617 (1471)
+.++.|++. |..++++|+.+-. ....-
T Consensus 65 ----g~L~~Sv~d~Dhs~KvfDvEn~DminmiKL~~lPg~a~wv~skGd~~s~IAVs~~~sg~i~VvD~~~d~~q~~~fk 140 (558)
T KOG0882|consen 65 ----GWLFRSVEDPDHSVKVFDVENFDMINMIKLVDLPGFAEWVTSKGDKISLIAVSLFKSGKIFVVDGFGDFCQDGYFK 140 (558)
T ss_pred ----ceeEeeccCcccceeEEEeeccchhhhcccccCCCceEEecCCCCeeeeEEeecccCCCcEEECCcCCcCccceec
Confidence 677777666 8888887775211 00111
Q ss_pred eccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC------CcE---------EEEecCCCCCcEEEEEcCCCC
Q 000473 618 HHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET------LRV---------ERMFPGHPNYPAKVVWDCPRG 682 (1471)
Q Consensus 618 ~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t------~~~---------l~~~~gh~~~V~~v~~spdg~ 682 (1471)
.-|..+|.++.+.|. +++++|....|.|.-|..+. .+. +..+.-....+.++.|+|++.
T Consensus 141 klH~sPV~~i~y~qa------~Ds~vSiD~~gmVEyWs~e~~~qfPr~~l~~~~K~eTdLy~f~K~Kt~pts~Efsp~g~ 214 (558)
T KOG0882|consen 141 KLHFSPVKKIRYNQA------GDSAVSIDISGMVEYWSAEGPFQFPRTNLNFELKHETDLYGFPKAKTEPTSFEFSPDGA 214 (558)
T ss_pred ccccCceEEEEeecc------ccceeeccccceeEeecCCCcccCccccccccccccchhhcccccccCccceEEccccC
Confidence 237889999999998 89999999999999999872 111 112223345678999999999
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
.+.+-..| ..|++++.++|++++.+.
T Consensus 215 qistl~~D--------rkVR~F~~KtGklvqeiD 240 (558)
T KOG0882|consen 215 QISTLNPD--------RKVRGFVFKTGKLVQEID 240 (558)
T ss_pred cccccCcc--------cEEEEEEeccchhhhhhh
Confidence 99998888 999999999999888765
No 284
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=96.91 E-value=0.0057 Score=70.03 Aligned_cols=171 Identities=16% Similarity=0.206 Sum_probs=115.1
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
..|+..|+++.+.++.. .++ ...|=.|.+ |.+ .-.|...++.|++ +. .+..-+.-|++-.|||.+
T Consensus 161 NaHtyhiNSIS~NsD~E----t~l-SADdLRINL--Wnl-ei~d~sFnIVDIK---P~-nmEeLteVITsaEFhp~~--- 225 (433)
T KOG1354|consen 161 NAHTYHINSISVNSDKE----TFL-SADDLRINL--WNL-EIIDQSFNIVDIK---PA-NMEELTEVITSAEFHPHH--- 225 (433)
T ss_pred ccceeEeeeeeecCccc----eEe-eccceeeee--ccc-cccCCceeEEEcc---cc-CHHHHHHHHhhhccCHhH---
Confidence 46888899988777776 344 344555555 542 1111122222221 10 011224468888999974
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceE----------------EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCC
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLI----------------TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDF 649 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l----------------~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dg 649 (1471)
..+++=.+..|+|++-|++...+. .-|..-...|..+.|+++ |+++++-.. -
T Consensus 226 -----cn~f~YSSSKGtIrLcDmR~~aLCd~hsKlfEepedp~~rsffseiIsSISDvKFs~s------GryilsRDy-l 293 (433)
T KOG1354|consen 226 -----CNVFVYSSSKGTIRLCDMRQSALCDAHSKLFEEPEDPSSRSFFSEIISSISDVKFSHS------GRYILSRDY-L 293 (433)
T ss_pred -----ccEEEEecCCCcEEEeechhhhhhcchhhhhccccCCcchhhHHHHhhhhhceEEccC------CcEEEEecc-c
Confidence 678888899999999999843211 112233457889999998 999988642 6
Q ss_pred cEEEEEC-CCCcEEEEecCCCC------------Cc---EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 650 SVALASL-ETLRVERMFPGHPN------------YP---AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 650 sV~lWdl-~t~~~l~~~~gh~~------------~V---~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
+|++||+ ...+++.+++-|.. .| ..++|+.++.+++||+.. ...++++...|..
T Consensus 294 tvk~wD~nme~~pv~t~~vh~~lr~kLc~lYEnD~IfdKFec~~sg~~~~v~TGsy~--------n~frvf~~~~gsk 363 (433)
T KOG1354|consen 294 TVKLWDLNMEAKPVETYPVHEYLRSKLCSLYENDAIFDKFECSWSGNDSYVMTGSYN--------NVFRVFNLARGSK 363 (433)
T ss_pred eeEEEeccccCCcceEEeehHhHHHHHHHHhhccchhheeEEEEcCCcceEeccccc--------ceEEEecCCCCcc
Confidence 8999999 56778888777743 22 568999999999999988 8999999776653
No 285
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=96.91 E-value=0.49 Score=57.69 Aligned_cols=109 Identities=11% Similarity=-0.013 Sum_probs=78.2
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
+..+..++.++.+..||..+|+.+...... ......+ . ++.+..++.|+.+..+|..+++.+........
T Consensus 241 ~~~vy~~~~~g~l~a~d~~tG~~~W~~~~~--~~~~p~~--~------~~~vyv~~~~G~l~~~d~~tG~~~W~~~~~~~ 310 (377)
T TIGR03300 241 GGQVYAVSYQGRVAALDLRSGRVLWKRDAS--SYQGPAV--D------DNRLYVTDADGVVVALDRRSGSELWKNDELKY 310 (377)
T ss_pred CCEEEEEEcCCEEEEEECCCCcEEEeeccC--CccCceE--e------CCEEEEECCCCeEEEEECCCCcEEEccccccC
Confidence 467777889999999999999987665421 1111111 2 56778888999999999999988766532111
Q ss_pred -CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCC
Q 000473 671 -YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTA 719 (1471)
Q Consensus 671 -~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~ 719 (1471)
....... .+.+|++++.+ |.|+++|..+|+.+..+..+.
T Consensus 311 ~~~ssp~i--~g~~l~~~~~~--------G~l~~~d~~tG~~~~~~~~~~ 350 (377)
T TIGR03300 311 RQLTAPAV--VGGYLVVGDFE--------GYLHWLSREDGSFVARLKTDG 350 (377)
T ss_pred CccccCEE--ECCEEEEEeCC--------CEEEEEECCCCCEEEEEEcCC
Confidence 1122222 36788999888 999999999999998887555
No 286
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.88 E-value=0.0022 Score=79.03 Aligned_cols=127 Identities=14% Similarity=0.167 Sum_probs=97.4
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc-eEEEEeccCCCEEEEEECCCCCCCCCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN-LITVMHHHVAPVRQIILSPPQTEHPWS 639 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~-~l~~~~~H~~~V~~l~fspd~~~~~~~ 639 (1471)
.....+.||+.+|+.+-|+|.+ ...+++++.|..|..||+++.. ++..+......-.+|+|+... +
T Consensus 105 aIef~lhghsraitd~n~~~q~--------pdVlatcsvdt~vh~wd~rSp~~p~ys~~~w~s~asqVkwnyk~-----p 171 (1081)
T KOG0309|consen 105 AIEFVLHGHSRAITDINFNPQH--------PDVLATCSVDTYVHAWDMRSPHRPFYSTSSWRSAASQVKWNYKD-----P 171 (1081)
T ss_pred ceEEEEecCccceeccccCCCC--------CcceeeccccccceeeeccCCCcceeeeecccccCceeeecccC-----c
Confidence 4456788999999999999873 7899999999999999998764 455565555667889998762 4
Q ss_pred CEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCC-CCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 640 DCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWDCP-RGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spd-g~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
+.++ .+.-+.|.+||++.| .++..+.+|...|+.+.|..- ...+.+.+.| |+|+.||....
T Consensus 172 ~vla-sshg~~i~vwd~r~gs~pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~d--------~tvkfw~y~kS 234 (1081)
T KOG0309|consen 172 NVLA-SSHGNDIFVWDLRKGSTPLCSLKGHVSSVNSIDFNRFKYSEIMSSSND--------GTVKFWDYSKS 234 (1081)
T ss_pred chhh-hccCCceEEEeccCCCcceEEecccceeeehHHHhhhhhhhhcccCCC--------Cceeeeccccc
Confidence 4444 455678999999865 567889999998999988542 2345555555 99999998654
No 287
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.83 E-value=0.31 Score=58.82 Aligned_cols=211 Identities=13% Similarity=0.139 Sum_probs=111.2
Q ss_pred eEEEEEEcCCCCeEEEEe-CCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccc
Q 000473 18 RVTATSALTQPPTLYTGG-SDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMG 96 (1471)
Q Consensus 18 ~Vtava~SpDg~~LaTGs-~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~ 96 (1471)
.-..++++||+++|+++. .+|+|.+++++. ++.+.....+.-|.+. . |......+.--|. -
T Consensus 88 ~p~~i~~~~~g~~l~vany~~g~v~v~~l~~--~g~l~~~~~~~~~~g~----g---~~~~rq~~~h~H~---------v 149 (345)
T PF10282_consen 88 SPCHIAVDPDGRFLYVANYGGGSVSVFPLDD--DGSLGEVVQTVRHEGS----G---PNPDRQEGPHPHQ---------V 149 (345)
T ss_dssp CEEEEEECTTSSEEEEEETTTTEEEEEEECT--TSEEEEEEEEEESEEE----E---SSTTTTSSTCEEE---------E
T ss_pred CcEEEEEecCCCEEEEEEccCCeEEEEEccC--CcccceeeeecccCCC----C---Cccccccccccee---------E
Confidence 345789999999999886 589999999985 3333332222222211 0 0000000000000 0
Q ss_pred cccCCCCEEEEEe-CCCeEEEEEcCCCe--EEE--eeeCCCCCCCCcEEEEcCCCCeEEE-EcceecccCCccccccccc
Q 000473 97 KSSLDNGALISAC-TDGVLCVWSRSSGH--CRR--RRKLPPWVGSPSVICTLPSNPRYVC-IGCCFIDTNQLSDHHSFES 170 (1471)
Q Consensus 97 ~~s~d~~~LaSas-~DG~I~VWdv~~G~--ci~--~~~l~~~~g~~~~i~~~s~~~~ll~-~G~~~id~~~~~~~h~~~~ 170 (1471)
.++||+++++... ....|.+++++.+. +.. ..+++.. ..|..+. |+++++++. +...
T Consensus 150 ~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G-~GPRh~~-f~pdg~~~Yv~~e~--------------- 212 (345)
T PF10282_consen 150 VFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPG-SGPRHLA-FSPDGKYAYVVNEL--------------- 212 (345)
T ss_dssp EE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTT-SSEEEEE-E-TTSSEEEEEETT---------------
T ss_pred EECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccC-CCCcEEE-EcCCcCEEEEecCC---------------
Confidence 1588888777653 34579999998654 433 2345532 2355555 445554443 3332
Q ss_pred ccccccccccCCCCCCCCCceEEEEeCc----ceEEEEEeecC--ccccC-CeEEEEEeeecCCCCceeEEEE-eCCCcE
Q 000473 171 VEGDLVSEDKEVPMKNPPKCTLVIVDTY----GLTIVQTVFHG--NLSIG-PWKFMDVVSLGEDMGKHYGLMV-DSVGRL 242 (1471)
Q Consensus 171 i~~~~~~~d~~~~~~~~~~~~I~v~D~~----t~~~l~tl~s~--~~s~~-~i~~~~~~~~~~d~~~~~llva-s~dG~V 242 (1471)
..+|.+++.. ..+.++++... ..... +...+.++ ||++ .++++ ...+.|
T Consensus 213 ------------------s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~is---pdg~--~lyvsnr~~~sI 269 (345)
T PF10282_consen 213 ------------------SNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAIS---PDGR--FLYVSNRGSNSI 269 (345)
T ss_dssp ------------------TTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE----TTSS--EEEEEECTTTEE
T ss_pred ------------------CCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEe---cCCC--EEEEEeccCCEE
Confidence 2566666655 34445544431 12222 46677777 5665 47887 778999
Q ss_pred EEEECCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCe
Q 000473 243 QLVPISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDH 300 (1471)
Q Consensus 243 ~vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~ 300 (1471)
.+++++.... . + +.+..+.+. ...-+.++++|+|+.++....+.
T Consensus 270 ~vf~~d~~~g---~----l----~~~~~~~~~---G~~Pr~~~~s~~g~~l~Va~~~s 313 (345)
T PF10282_consen 270 SVFDLDPATG---T----L----TLVQTVPTG---GKFPRHFAFSPDGRYLYVANQDS 313 (345)
T ss_dssp EEEEECTTTT---T----E----EEEEEEEES---SSSEEEEEE-TTSSEEEEEETTT
T ss_pred EEEEEecCCC---c----e----EEEEEEeCC---CCCccEEEEeCCCCEEEEEecCC
Confidence 9999964310 0 0 011111111 11237889999999998876553
No 288
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.79 E-value=0.3 Score=57.20 Aligned_cols=207 Identities=11% Similarity=0.112 Sum_probs=117.3
Q ss_pred EEEEEcCCCCeEEEEe-CCCcEEEEEccCCCCCceeeeEEecccccceeEe----eeccccccccCcccccccccccccc
Q 000473 20 TATSALTQPPTLYTGG-SDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADL----SICYPAMVSRDGKAEHWKAENSSNV 94 (1471)
Q Consensus 20 tava~SpDg~~LaTGs-~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~L----a~c~~~~~s~dg~~~~~~~~~~~~~ 94 (1471)
+.+++++||++|+++. .-|.|.+.-+.. ++.+.+..-+.-|.++.-.- ..|+.+
T Consensus 92 ~yvsvd~~g~~vf~AnY~~g~v~v~p~~~--dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a------------------- 150 (346)
T COG2706 92 CYVSVDEDGRFVFVANYHSGSVSVYPLQA--DGSLQPVVQVVKHTGSGPHERQESPHVHSA------------------- 150 (346)
T ss_pred eEEEECCCCCEEEEEEccCceEEEEEccc--CCccccceeeeecCCCCCCccccCCcccee-------------------
Confidence 8899999999999985 558899999974 34333333334455442110 001122
Q ss_pred cccccCCCCEEEEEe--CCCeEEEEEcCCCeEEEee--eCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccc
Q 000473 95 MGKSSLDNGALISAC--TDGVLCVWSRSSGHCRRRR--KLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFES 170 (1471)
Q Consensus 95 ~~~~s~d~~~LaSas--~DG~I~VWdv~~G~ci~~~--~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~ 170 (1471)
.+.|++++|++.. .| +|+++++++|++-... .++++ .-|..| .|+++++++.+-+-
T Consensus 151 --~~tP~~~~l~v~DLG~D-ri~~y~~~dg~L~~~~~~~v~~G-~GPRHi-~FHpn~k~aY~v~E--------------- 210 (346)
T COG2706 151 --NFTPDGRYLVVPDLGTD-RIFLYDLDDGKLTPADPAEVKPG-AGPRHI-VFHPNGKYAYLVNE--------------- 210 (346)
T ss_pred --eeCCCCCEEEEeecCCc-eEEEEEcccCccccccccccCCC-CCcceE-EEcCCCcEEEEEec---------------
Confidence 2689999998864 44 6899999999866542 23322 224444 55666766544331
Q ss_pred ccccccccccCCCCCCCCCceEEEEeCcc--e--EEEEEe---ecCccccCCeEEEEEeeecCCCCceeEEEE-eCCCcE
Q 000473 171 VEGDLVSEDKEVPMKNPPKCTLVIVDTYG--L--TIVQTV---FHGNLSIGPWKFMDVVSLGEDMGKHYGLMV-DSVGRL 242 (1471)
Q Consensus 171 i~~~~~~~d~~~~~~~~~~~~I~v~D~~t--~--~~l~tl---~s~~~s~~~i~~~~~~~~~~d~~~~~llva-s~dG~V 242 (1471)
.+.+|.+|.-.. + +.++++ ...-....|...+.++ +||+ .+++. -....|
T Consensus 211 -----------------L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis---~dGr--FLYasNRg~dsI 268 (346)
T COG2706 211 -----------------LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHIS---PDGR--FLYASNRGHDSI 268 (346)
T ss_pred -----------------cCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEEC---CCCC--EEEEecCCCCeE
Confidence 136777766544 2 233333 2323334466778877 4555 35555 334488
Q ss_pred EEEECCCCCCcccccCCCcccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEE
Q 000473 243 QLVPISKESHLDREEGNGLCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIF 303 (1471)
Q Consensus 243 ~vW~l~~~~~~~~~~~~~l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~ 303 (1471)
-++.++.... .+..+......-..-+...+++.|++|+....++-.+
T Consensus 269 ~~f~V~~~~g--------------~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i 315 (346)
T COG2706 269 AVFSVDPDGG--------------KLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNI 315 (346)
T ss_pred EEEEEcCCCC--------------EEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcE
Confidence 8888876521 0111111111111125567778898888876665433
No 289
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.75 E-value=0.24 Score=59.62 Aligned_cols=76 Identities=12% Similarity=-0.000 Sum_probs=53.4
Q ss_pred EcCCCCeEEEEeC----------CCcEEEEEccCCCCCceeeeEEec-ccc--cce----eEeeeccccccccCcccccc
Q 000473 24 ALTQPPTLYTGGS----------DGSILWWSFSDSSYSEIKPVAMLC-GHS--API----ADLSICYPAMVSRDGKAEHW 86 (1471)
Q Consensus 24 ~SpDg~~LaTGs~----------DG~I~lWdl~~~~~~~~~~~~~L~-GH~--~~V----t~La~c~~~~~s~dg~~~~~ 86 (1471)
+||||+.|+.+.. +..|.+||..+ .+++..+. +-. +.+ ..++
T Consensus 53 ~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t-----~~~~~~i~~p~~p~~~~~~~~~~~~---------------- 111 (352)
T TIGR02658 53 VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQT-----HLPIADIELPEGPRFLVGTYPWMTS---------------- 111 (352)
T ss_pred ECCCCCEEEEEeccccccccCCCCCEEEEEECcc-----CcEEeEEccCCCchhhccCccceEE----------------
Confidence 9999999998755 78899999985 23333332 111 000 1111
Q ss_pred cccccccccccccCCCCEEEEEe-C-CCeEEEEEcCCCeEEEeeeCC
Q 000473 87 KAENSSNVMGKSSLDNGALISAC-T-DGVLCVWSRSSGHCRRRRKLP 131 (1471)
Q Consensus 87 ~~~~~~~~~~~~s~d~~~LaSas-~-DG~I~VWdv~~G~ci~~~~l~ 131 (1471)
+++|+++|.... + ++.|-|.|+.+++.+.....|
T Consensus 112 -----------ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp 147 (352)
T TIGR02658 112 -----------LTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVP 147 (352)
T ss_pred -----------ECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeCC
Confidence 588888787654 3 799999999999999988776
No 290
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=96.73 E-value=0.0096 Score=73.20 Aligned_cols=77 Identities=16% Similarity=0.056 Sum_probs=64.0
Q ss_pred ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCC
Q 000473 619 HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAV 698 (1471)
Q Consensus 619 ~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~ 698 (1471)
.....|.+.+++|+ .+.++.|+.||+|.+||...+-... ....-.++.++|+|+|.++++|+.-
T Consensus 257 pL~s~v~~ca~sp~------E~kLvlGC~DgSiiLyD~~~~~t~~--~ka~~~P~~iaWHp~gai~~V~s~q-------- 320 (545)
T PF11768_consen 257 PLPSQVICCARSPS------EDKLVLGCEDGSIILYDTTRGVTLL--AKAEFIPTLIAWHPDGAIFVVGSEQ-------- 320 (545)
T ss_pred ecCCcceEEecCcc------cceEEEEecCCeEEEEEcCCCeeee--eeecccceEEEEcCCCcEEEEEcCC--------
Confidence 35678999999999 7899999999999999997664332 2345568899999999999999988
Q ss_pred CEEEEEECCCCeE
Q 000473 699 DVLFIWDVKTGAR 711 (1471)
Q Consensus 699 gtV~VWDi~tg~~ 711 (1471)
|.+.+||+.-...
T Consensus 321 GelQ~FD~ALspi 333 (545)
T PF11768_consen 321 GELQCFDMALSPI 333 (545)
T ss_pred ceEEEEEeecCcc
Confidence 9999999865443
No 291
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=96.69 E-value=0.79 Score=56.11 Aligned_cols=104 Identities=11% Similarity=0.206 Sum_probs=68.5
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEE--EEE-CCCcEEEEECCC----CceEEEEeccCCCEEEEEECCCCCCCCCCCEE
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLV--SGS-MDCSIRIWDLGS----GNLITVMHHHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~--SGs-~DgtI~lWDl~t----g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
...|...+|-|. |..++ +|. .-.++.++-+++ .+++..|.. ...+.+.|+|. |+.+
T Consensus 445 ke~vi~FaWEP~---------gdkF~vi~g~~~k~tvsfY~~e~~~~~~~lVk~~dk--~~~N~vfwsPk------G~fv 507 (698)
T KOG2314|consen 445 KESVIAFAWEPH---------GDKFAVISGNTVKNTVSFYAVETNIKKPSLVKELDK--KFANTVFWSPK------GRFV 507 (698)
T ss_pred chheeeeeeccC---------CCeEEEEEccccccceeEEEeecCCCchhhhhhhcc--cccceEEEcCC------CcEE
Confidence 456788888886 44333 332 235788888773 223334432 45678999998 8877
Q ss_pred EEE---eCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 643 LSV---GEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 643 ~S~---s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
+.+ |.-|.+.++|+.-..+..+-.......+.+.|.|.|+|+++++.-
T Consensus 508 vva~l~s~~g~l~F~D~~~a~~k~~~~~eh~~at~veWDPtGRYvvT~ss~ 558 (698)
T KOG2314|consen 508 VVAALVSRRGDLEFYDTDYADLKDTASPEHFAATEVEWDPTGRYVVTSSSS 558 (698)
T ss_pred EEEEecccccceEEEecchhhhhhccCccccccccceECCCCCEEEEeeeh
Confidence 765 457899999987544433332222346889999999999987754
No 292
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=96.67 E-value=0.006 Score=74.96 Aligned_cols=83 Identities=19% Similarity=0.243 Sum_probs=62.5
Q ss_pred cceeeeEecCCCCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCc
Q 000473 2 KCRSVACIWSGTPPSHRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDG 81 (1471)
Q Consensus 2 ~~~~~~~lw~~~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg 81 (1471)
||-+|--+ |-...|++++++|+...|+.|+.||+|++||... + ...+.-+.-..+.++
T Consensus 249 qrvsvtsi----pL~s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~---~----~t~~~ka~~~P~~ia----------- 306 (545)
T PF11768_consen 249 QRVSVTSI----PLPSQVICCARSPSEDKLVLGCEDGSIILYDTTR---G----VTLLAKAEFIPTLIA----------- 306 (545)
T ss_pred eEEEEEEE----ecCCcceEEecCcccceEEEEecCCeEEEEEcCC---C----eeeeeeecccceEEE-----------
Confidence 44444444 4456799999999999999999999999999873 1 122222334456666
Q ss_pred ccccccccccccccccccCCCCEEEEEeCCCeEEEEEcCCC
Q 000473 82 KAEHWKAENSSNVMGKSSLDNGALISACTDGVLCVWSRSSG 122 (1471)
Q Consensus 82 ~~~~~~~~~~~~~~~~~s~d~~~LaSas~DG~I~VWdv~~G 122 (1471)
+.|++..++.|++-|+|.+||+.-.
T Consensus 307 ----------------WHp~gai~~V~s~qGelQ~FD~ALs 331 (545)
T PF11768_consen 307 ----------------WHPDGAIFVVGSEQGELQCFDMALS 331 (545)
T ss_pred ----------------EcCCCcEEEEEcCCceEEEEEeecC
Confidence 5788999999999999999998643
No 293
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=96.58 E-value=0.1 Score=62.16 Aligned_cols=146 Identities=13% Similarity=0.129 Sum_probs=107.7
Q ss_pred ccCccEEEEEeeccccccCCEEEEEEcCC-cEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCc
Q 000473 507 HKEKIVSSSMVISESFYAPYAIVYGFFSG-EIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTA 585 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~~lv~Gs~DG-~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~ 585 (1471)
+|.+.|.-.....+. ..++.|..|| .+.|++.+ ++ .+..+.+.-+.|.++..+|+
T Consensus 357 ~~~~~VrY~r~~~~~----e~~vigt~dgD~l~iyd~~---------------~~-e~kr~e~~lg~I~av~vs~d---- 412 (668)
T COG4946 357 GKKGGVRYRRIQVDP----EGDVIGTNDGDKLGIYDKD---------------GG-EVKRIEKDLGNIEAVKVSPD---- 412 (668)
T ss_pred CCCCceEEEEEccCC----cceEEeccCCceEEEEecC---------------Cc-eEEEeeCCccceEEEEEcCC----
Confidence 455556554433333 3789999999 67774333 33 34566777889999999997
Q ss_pred ccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC----CcEEEEECCCCcE
Q 000473 586 KGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED----FSVALASLETLRV 661 (1471)
Q Consensus 586 ~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D----gsV~lWdl~t~~~ 661 (1471)
++.++.+...+.+-+.|+.+|+....=+...+-|+.+.|+|+ ++.+|-+--+ ..|+++|+.+++.
T Consensus 413 -----GK~~vvaNdr~el~vididngnv~~idkS~~~lItdf~~~~n------sr~iAYafP~gy~tq~Iklydm~~~Ki 481 (668)
T COG4946 413 -----GKKVVVANDRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPN------SRWIAYAFPEGYYTQSIKLYDMDGGKI 481 (668)
T ss_pred -----CcEEEEEcCceEEEEEEecCCCeeEecccccceeEEEEEcCC------ceeEEEecCcceeeeeEEEEecCCCeE
Confidence 899999999999999999999876555566788999999999 8888887655 4689999998887
Q ss_pred EEEecCCCCCcEEEEEcCCCCEEEEEE
Q 000473 662 ERMFPGHPNYPAKVVWDCPRGYIACLC 688 (1471)
Q Consensus 662 l~~~~gh~~~V~~v~~spdg~~L~sgs 688 (1471)
...-. ..+.=.+-+|.||+++|.--+
T Consensus 482 y~vTT-~ta~DfsPaFD~d~ryLYfLs 507 (668)
T COG4946 482 YDVTT-PTAYDFSPAFDPDGRYLYFLS 507 (668)
T ss_pred EEecC-CcccccCcccCCCCcEEEEEe
Confidence 54432 223334668999999886544
No 294
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=96.56 E-value=0.0018 Score=80.90 Aligned_cols=131 Identities=21% Similarity=0.202 Sum_probs=101.4
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
++.++|..|+...+|++|+-+ .++|+.|+..|.|+++++.+|.......+|..+|+-+.-+.+ |.
T Consensus 1092 r~w~~frd~~~~fTc~afs~~---------~~hL~vG~~~Geik~~nv~sG~~e~s~ncH~SavT~vePs~d------gs 1156 (1516)
T KOG1832|consen 1092 RSWRSFRDETALFTCIAFSGG---------TNHLAVGSHAGEIKIFNVSSGSMEESVNCHQSAVTLVEPSVD------GS 1156 (1516)
T ss_pred ccchhhhccccceeeEEeecC---------CceEEeeeccceEEEEEccCccccccccccccccccccccCC------cc
Confidence 445667889999999999976 799999999999999999999999999999999999987777 66
Q ss_pred EEEEEeCC--CcEEEEECC-CCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE-Ee
Q 000473 641 CFLSVGED--FSVALASLE-TLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV-LR 716 (1471)
Q Consensus 641 ~l~S~s~D--gsV~lWdl~-t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~-l~ 716 (1471)
..++.+.- --..+|++. ++...+.|.+ -.++.|+..-.+-+.|+.. ..+.+||++|+..+.+ ++
T Consensus 1157 ~~Ltsss~S~PlsaLW~~~s~~~~~Hsf~e----d~~vkFsn~~q~r~~gt~~--------d~a~~YDvqT~~~l~tylt 1224 (1516)
T KOG1832|consen 1157 TQLTSSSSSSPLSALWDASSTGGPRHSFDE----DKAVKFSNSLQFRALGTEA--------DDALLYDVQTCSPLQTYLT 1224 (1516)
T ss_pred eeeeeccccCchHHHhccccccCccccccc----cceeehhhhHHHHHhcccc--------cceEEEecccCcHHHHhcC
Confidence 55554433 257899985 3555555543 3478898765555555554 4799999999988776 44
Q ss_pred CC
Q 000473 717 GT 718 (1471)
Q Consensus 717 gH 718 (1471)
+.
T Consensus 1225 ~~ 1226 (1516)
T KOG1832|consen 1225 DT 1226 (1516)
T ss_pred cc
Confidence 43
No 295
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=96.51 E-value=0.014 Score=65.96 Aligned_cols=161 Identities=19% Similarity=0.268 Sum_probs=105.3
Q ss_pred ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCce---EEEEecc-----CCCEEEEEECCCCCCCCC
Q 000473 567 LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNL---ITVMHHH-----VAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 567 ~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~---l~~~~~H-----~~~V~~l~fspd~~~~~~ 638 (1471)
..|.--++++.|..| .+.++|+ .|-.|.+|++..-.- +..++.| +.-|++..|+|..
T Consensus 169 NaH~yhiNSiS~NsD---------~et~lSa-DdLrINLWnl~i~D~sFnIVDiKP~nmeeLteVItSaeFhp~~----- 233 (460)
T COG5170 169 NAHPYHINSISFNSD---------KETLLSA-DDLRINLWNLEIIDGSFNIVDIKPHNMEELTEVITSAEFHPEM----- 233 (460)
T ss_pred ccceeEeeeeeecCc---------hheeeec-cceeeeeccccccCCceEEEeccCccHHHHHHHHhhcccCHhH-----
Confidence 567778999999876 6777776 678899999874331 2223333 3457888999986
Q ss_pred CCEEEEEeCCCcEEEEECCCCcE-E---EE------------ecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEE
Q 000473 639 SDCFLSVGEDFSVALASLETLRV-E---RM------------FPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLF 702 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~-l---~~------------~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~ 702 (1471)
.+.|.-.+..|.|++-|++.... . .. |.+-...|..+.|+++|+|+++-.. -+|+
T Consensus 234 cn~fmYSsSkG~Ikl~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~ngryIlsRdy---------ltvk 304 (460)
T COG5170 234 CNVFMYSSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDNGRYILSRDY---------LTVK 304 (460)
T ss_pred cceEEEecCCCcEEehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCCCcEEEEecc---------ceEE
Confidence 45788888899999999983211 0 11 1122346788999999999987655 4999
Q ss_pred EEECCC-CeEEEEEeCCCC-----------CceeeeeeeccccccccceEEcCCcccccccee
Q 000473 703 IWDVKT-GARERVLRGTAS-----------HSMFDHFCKGISMNSISGSVLNGNTSVSSLLLP 753 (1471)
Q Consensus 703 VWDi~t-g~~~~~l~gH~~-----------~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~ 753 (1471)
|||+.. ..++.++.-|.. ..+...|. +.+...+..|++|++.-.-.+.|
T Consensus 305 iwDvnm~k~pikTi~~h~~l~~~l~d~YEnDaifdkFe--isfSgd~~~v~sgsy~NNfgiyp 365 (460)
T COG5170 305 IWDVNMAKNPIKTIPMHCDLMDELNDVYENDAIFDKFE--ISFSGDDKHVLSGSYSNNFGIYP 365 (460)
T ss_pred EEecccccCCceeechHHHHHHHHHhhhhccceeeeEE--EEecCCcccccccccccceeeec
Confidence 999975 457777765531 22222222 23333344566666655555555
No 296
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=96.40 E-value=0.0014 Score=81.77 Aligned_cols=162 Identities=15% Similarity=0.174 Sum_probs=110.5
Q ss_pred cccccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCC
Q 000473 504 DFVHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVG 583 (1471)
Q Consensus 504 ~~~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~ 583 (1471)
+|..|...-||+++..... .++.|+..|.|++++ +.+|.......+|..+|+-+.-+-+
T Consensus 1096 ~frd~~~~fTc~afs~~~~----hL~vG~~~Geik~~n---------------v~sG~~e~s~ncH~SavT~vePs~d-- 1154 (1516)
T KOG1832|consen 1096 SFRDETALFTCIAFSGGTN----HLAVGSHAGEIKIFN---------------VSSGSMEESVNCHQSAVTLVEPSVD-- 1154 (1516)
T ss_pred hhhccccceeeEEeecCCc----eEEeeeccceEEEEE---------------ccCccccccccccccccccccccCC--
Confidence 4566777888888666555 699999999999843 4566677778899999998876655
Q ss_pred CcccCcCCC-EEEEEECCC-cEEEEECC-CCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCc
Q 000473 584 TAKGWSFNE-VLVSGSMDC-SIRIWDLG-SGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLR 660 (1471)
Q Consensus 584 ~~~~~~~~~-~L~SGs~Dg-tI~lWDl~-tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~ 660 (1471)
+. .|.+.+... -..+|++. ++.+.+.|... .++.|+.. .+.-+.|..-..+.+||++++.
T Consensus 1155 -------gs~~Ltsss~S~PlsaLW~~~s~~~~~Hsf~ed----~~vkFsn~------~q~r~~gt~~d~a~~YDvqT~~ 1217 (1516)
T KOG1832|consen 1155 -------GSTQLTSSSSSSPLSALWDASSTGGPRHSFDED----KAVKFSNS------LQFRALGTEADDALLYDVQTCS 1217 (1516)
T ss_pred -------cceeeeeccccCchHHHhccccccCcccccccc----ceeehhhh------HHHHHhcccccceEEEecccCc
Confidence 44 444444444 56799986 46667777543 46777664 2222223333568899999998
Q ss_pred EEEEe-cC---CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 661 VERMF-PG---HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 661 ~l~~~-~g---h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
++.++ .+ ....-+++.|+|++.+++ .| | .+||++..+.++.+.
T Consensus 1218 ~l~tylt~~~~~~y~~n~a~FsP~D~LIl---nd--------G--vLWDvR~~~aIh~FD 1264 (1516)
T KOG1832|consen 1218 PLQTYLTDTVTSSYSNNLAHFSPCDTLIL---ND--------G--VLWDVRIPEAIHRFD 1264 (1516)
T ss_pred HHHHhcCcchhhhhhccccccCCCcceEe---eC--------c--eeeeeccHHHHhhhh
Confidence 87663 22 122336789999999887 22 4 579999877666554
No 297
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=96.28 E-value=0.13 Score=57.95 Aligned_cols=113 Identities=12% Similarity=-0.064 Sum_probs=80.3
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe-cCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF-PGHP 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~-~gh~ 669 (1471)
+..++.++.++.+..||..+|+.+..+............ . +..++.++.|+.+..+|.++|+.+... ....
T Consensus 36 ~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~--~------~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~ 107 (238)
T PF13360_consen 36 GGRVYVASGDGNLYALDAKTGKVLWRFDLPGPISGAPVV--D------GGRVYVGTSDGSLYALDAKTGKVLWSIYLTSS 107 (238)
T ss_dssp TTEEEEEETTSEEEEEETTTSEEEEEEECSSCGGSGEEE--E------TTEEEEEETTSEEEEEETTTSCEEEEEEE-SS
T ss_pred CCEEEEEcCCCEEEEEECCCCCEEEEeeccccccceeee--c------ccccccccceeeeEecccCCcceeeeeccccc
Confidence 577777799999999999999998887752211111111 1 456777778999999999999998874 3321
Q ss_pred C---CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCC
Q 000473 670 N---YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTA 719 (1471)
Q Consensus 670 ~---~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~ 719 (1471)
. .........+++.+++++.+ +.|+.+|+++|+.+.....+.
T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~--------g~l~~~d~~tG~~~w~~~~~~ 152 (238)
T PF13360_consen 108 PPAGVRSSSSPAVDGDRLYVGTSS--------GKLVALDPKTGKLLWKYPVGE 152 (238)
T ss_dssp CTCSTB--SEEEEETTEEEEEETC--------SEEEEEETTTTEEEEEEESST
T ss_pred cccccccccCceEecCEEEEEecc--------CcEEEEecCCCcEEEEeecCC
Confidence 1 11222333347888888877 999999999999998887754
No 298
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.15 E-value=0.13 Score=61.74 Aligned_cols=103 Identities=11% Similarity=0.027 Sum_probs=80.9
Q ss_pred CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEE-e---------CCCcEEEEECCCCcEEEEecCCCC
Q 000473 601 CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSV-G---------EDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~-s---------~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
++|.+.|..+++.+..+..-..+-. + ++|| ++.+..+ + .+..|.+||..+++.+..++....
T Consensus 27 ~~v~ViD~~~~~v~g~i~~G~~P~~-~-~spD------g~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~ 98 (352)
T TIGR02658 27 TQVYTIDGEAGRVLGMTDGGFLPNP-V-VASD------GSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEG 98 (352)
T ss_pred ceEEEEECCCCEEEEEEEccCCCce-e-ECCC------CCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCC
Confidence 8999999999999888875444433 4 8999 6655554 4 589999999999999988874322
Q ss_pred -------CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 671 -------YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 671 -------~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
.....+++|||++|++...+ .+..|.|.|+.+++.+.++.-
T Consensus 99 p~~~~~~~~~~~~ls~dgk~l~V~n~~------p~~~V~VvD~~~~kvv~ei~v 146 (352)
T TIGR02658 99 PRFLVGTYPWMTSLTPDNKTLLFYQFS------PSPAVGVVDLEGKAFVRMMDV 146 (352)
T ss_pred chhhccCccceEEECCCCCEEEEecCC------CCCEEEEEECCCCcEEEEEeC
Confidence 34588999999999987643 238999999999999988765
No 299
>PRK04043 tolB translocation protein TolB; Provisional
Probab=96.14 E-value=0.43 Score=59.25 Aligned_cols=122 Identities=11% Similarity=0.077 Sum_probs=81.3
Q ss_pred cEEEEEEecCCCCcccCcCCC-EEEEEEC---CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEE-EEe
Q 000473 572 AVLCLAAHRMVGTAKGWSFNE-VLVSGSM---DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFL-SVG 646 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~-~L~SGs~---DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~-S~s 646 (1471)
......|+|| ++ .++-.+. +..|.++|+.+|+..... ...+.+....|+|| |+.++ +.+
T Consensus 189 ~~~~p~wSpD---------G~~~i~y~s~~~~~~~Iyv~dl~tg~~~~lt-~~~g~~~~~~~SPD------G~~la~~~~ 252 (419)
T PRK04043 189 LNIFPKWANK---------EQTAFYYTSYGERKPTLYKYNLYTGKKEKIA-SSQGMLVVSDVSKD------GSKLLLTMA 252 (419)
T ss_pred CeEeEEECCC---------CCcEEEEEEccCCCCEEEEEECCCCcEEEEe-cCCCcEEeeEECCC------CCEEEEEEc
Confidence 6778899998 55 3543333 357999999988765443 35556677889999 76544 444
Q ss_pred CC--CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 647 ED--FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 647 ~D--gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
.+ ..|.++|+.+++..+ +..+........|+|||+.|+-.+.. .+...|++.|+.+|+..+..
T Consensus 253 ~~g~~~Iy~~dl~~g~~~~-LT~~~~~d~~p~~SPDG~~I~F~Sdr-----~g~~~Iy~~dl~~g~~~rlt 317 (419)
T PRK04043 253 PKGQPDIYLYDTNTKTLTQ-ITNYPGIDVNGNFVEDDKRIVFVSDR-----LGYPNIFMKKLNSGSVEQVV 317 (419)
T ss_pred cCCCcEEEEEECCCCcEEE-cccCCCccCccEECCCCCEEEEEECC-----CCCceEEEEECCCCCeEeCc
Confidence 34 567777888877544 33333323456899999988877653 11247999999988875543
No 300
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.11 E-value=2.4 Score=49.92 Aligned_cols=87 Identities=17% Similarity=0.287 Sum_probs=57.7
Q ss_pred CCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccccccCCCCEEEEEeC---CCe
Q 000473 37 DGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMGKSSLDNGALISACT---DGV 113 (1471)
Q Consensus 37 DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~~~s~d~~~LaSas~---DG~ 113 (1471)
+..|.+|++++. .++..... +..+.+..+-|+ +++++++|-++.+ +|.
T Consensus 15 s~gI~v~~ld~~-~g~l~~~~-~v~~~~nptyl~---------------------------~~~~~~~LY~v~~~~~~gg 65 (346)
T COG2706 15 SQGIYVFNLDTK-TGELSLLQ-LVAELGNPTYLA---------------------------VNPDQRHLYVVNEPGEEGG 65 (346)
T ss_pred CCceEEEEEeCc-ccccchhh-hccccCCCceEE---------------------------ECCCCCEEEEEEecCCcCc
Confidence 567999999842 22233222 346777888888 6788888888755 467
Q ss_pred EEEEEcC--CCeEEEe--eeCCCCCCCCcEEEEcCCCCeEEEEcce
Q 000473 114 LCVWSRS--SGHCRRR--RKLPPWVGSPSVICTLPSNPRYVCIGCC 155 (1471)
Q Consensus 114 I~VWdv~--~G~ci~~--~~l~~~~g~~~~i~~~s~~~~ll~~G~~ 155 (1471)
+..+.++ +|++-.. ..++ |.|.+-..+.++++++.+..|
T Consensus 66 vaay~iD~~~G~Lt~ln~~~~~---g~~p~yvsvd~~g~~vf~AnY 108 (346)
T COG2706 66 VAAYRIDPDDGRLTFLNRQTLP---GSPPCYVSVDEDGRFVFVANY 108 (346)
T ss_pred EEEEEEcCCCCeEEEeeccccC---CCCCeEEEECCCCCEEEEEEc
Confidence 7776665 4764433 3444 667666678888888777765
No 301
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=96.08 E-value=0.54 Score=52.89 Aligned_cols=104 Identities=17% Similarity=0.082 Sum_probs=72.8
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCC----------EEEEEECCCCCCCCCCCEEEEEeCCCc-EEEEECCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAP----------VRQIILSPPQTEHPWSDCFLSVGEDFS-VALASLETL 659 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~----------V~~l~fspd~~~~~~~~~l~S~s~Dgs-V~lWdl~t~ 659 (1471)
+..++.+..++.|..+|+.+|+.+..+..+... +..-.+..+ + .+..++.++. +.+ |+.++
T Consensus 122 ~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~------~-~v~~~~~~g~~~~~-d~~tg 193 (238)
T PF13360_consen 122 GDRLYVGTSSGKLVALDPKTGKLLWKYPVGEPRGSSPISSFSDINGSPVISD------G-RVYVSSGDGRVVAV-DLATG 193 (238)
T ss_dssp TTEEEEEETCSEEEEEETTTTEEEEEEESSTT-SS--EEEETTEEEEEECCT------T-EEEEECCTSSEEEE-ETTTT
T ss_pred cCEEEEEeccCcEEEEecCCCcEEEEeecCCCCCCcceeeecccccceEEEC------C-EEEEEcCCCeEEEE-ECCCC
Confidence 677888888999999999999999888765432 112222223 4 6666666774 666 99999
Q ss_pred cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 660 RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 660 ~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
+.+.... ...+.. ...+++..|++++.+ +.|+.||.+||+..-
T Consensus 194 ~~~w~~~--~~~~~~-~~~~~~~~l~~~~~~--------~~l~~~d~~tG~~~W 236 (238)
T PF13360_consen 194 EKLWSKP--ISGIYS-LPSVDGGTLYVTSSD--------GRLYALDLKTGKVVW 236 (238)
T ss_dssp EEEEEEC--SS-ECE-CEECCCTEEEEEETT--------TEEEEEETTTTEEEE
T ss_pred CEEEEec--CCCccC-CceeeCCEEEEEeCC--------CEEEEEECCCCCEEe
Confidence 9765443 222222 145778889888877 999999999998764
No 302
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=96.04 E-value=0.016 Score=43.46 Aligned_cols=39 Identities=21% Similarity=0.401 Sum_probs=34.5
Q ss_pred CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 659 LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 659 ~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
++++..+..|...|.++.|.+++.++++++.| +.+++||
T Consensus 2 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~d--------~~~~~~~ 40 (40)
T smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASASDD--------GTIKLWD 40 (40)
T ss_pred cEEEEEEEecCCceeEEEECCCCCEEEEecCC--------CeEEEcC
Confidence 45667778899999999999999999999998 9999996
No 303
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=96.00 E-value=0.16 Score=56.90 Aligned_cols=121 Identities=13% Similarity=0.040 Sum_probs=83.5
Q ss_pred CEEEEEECCCcEEEEECCCCceEEEEeccCC--CEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-c-EEEEe-c
Q 000473 592 EVLVSGSMDCSIRIWDLGSGNLITVMHHHVA--PVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL-R-VERMF-P 666 (1471)
Q Consensus 592 ~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~--~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-~-~l~~~-~ 666 (1471)
-.+.-++.|.++++.++.-+..... .|.. .+.++.++++ +.+.++++....|-+|.+... + .++.. .
T Consensus 129 ~~~~i~sndht~k~~~~~~~s~~~~--~h~~~~~~ns~~~snd------~~~~~~Vgds~~Vf~y~id~~sey~~~~~~a 200 (344)
T KOG4532|consen 129 FPLNIASNDHTGKTMVVSGDSNKFA--VHNQNLTQNSLHYSND------PSWGSSVGDSRRVFRYAIDDESEYIENIYEA 200 (344)
T ss_pred cceeeccCCcceeEEEEecCcccce--eeccccceeeeEEcCC------CceEEEecCCCcceEEEeCCccceeeeeEec
Confidence 3466778999999999875543222 3433 3788999999 899999999999999998643 2 22322 2
Q ss_pred CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC-eEEEEEe----CCCCCceeeeee
Q 000473 667 GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG-ARERVLR----GTASHSMFDHFC 728 (1471)
Q Consensus 667 gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg-~~~~~l~----gH~~~v~~~~~~ 728 (1471)
...+.=.+..|+.....+|++..| |++.|||++.. .+.+..+ .|.+.+-.++|.
T Consensus 201 ~t~D~gF~~S~s~~~~~FAv~~Qd--------g~~~I~DVR~~~tpm~~~sstrp~hnGa~R~c~Fs 259 (344)
T KOG4532|consen 201 PTSDHGFYNSFSENDLQFAVVFQD--------GTCAIYDVRNMATPMAEISSTRPHHNGAFRVCRFS 259 (344)
T ss_pred ccCCCceeeeeccCcceEEEEecC--------CcEEEEEecccccchhhhcccCCCCCCceEEEEec
Confidence 223344688999999999999999 99999999874 3333322 244444444454
No 304
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=95.96 E-value=0.017 Score=43.30 Aligned_cols=38 Identities=34% Similarity=0.478 Sum_probs=33.6
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEE
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWD 607 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWD 607 (1471)
++...+.+|...|+++.|++. +..+++|+.|+.+++||
T Consensus 3 ~~~~~~~~~~~~i~~~~~~~~---------~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 3 ELLKTLKGHTGPVTSVAFSPD---------GKYLASASDDGTIKLWD 40 (40)
T ss_pred EEEEEEEecCCceeEEEECCC---------CCEEEEecCCCeEEEcC
Confidence 456777889999999999986 68999999999999996
No 305
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=95.87 E-value=0.021 Score=66.84 Aligned_cols=94 Identities=13% Similarity=0.153 Sum_probs=74.8
Q ss_pred EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCC
Q 000473 603 IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRG 682 (1471)
Q Consensus 603 I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~ 682 (1471)
|++.+..+-+....+..|+..|..++|+|.+ ...+..++.+..|+|.|+++..++.++..| ..+++++|.-|+.
T Consensus 175 v~~l~~~~fkssq~lp~~g~~IrdlafSp~~-----~GLl~~asl~nkiki~dlet~~~vssy~a~-~~~wSC~wDlde~ 248 (463)
T KOG1645|consen 175 VQKLESHDFKSSQILPGEGSFIRDLAFSPFN-----EGLLGLASLGNKIKIMDLETSCVVSSYIAY-NQIWSCCWDLDER 248 (463)
T ss_pred eEEeccCCcchhhcccccchhhhhhccCccc-----cceeeeeccCceEEEEecccceeeeheecc-CCceeeeeccCCc
Confidence 5555554444455667788899999999983 337889999999999999999999999888 7799999988765
Q ss_pred -EEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 683 -YIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 683 -~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
+|..|-.+ |.|+|||++.-+
T Consensus 249 h~IYaGl~n--------G~VlvyD~R~~~ 269 (463)
T KOG1645|consen 249 HVIYAGLQN--------GMVLVYDMRQPE 269 (463)
T ss_pred ceeEEeccC--------ceEEEEEccCCC
Confidence 55555555 999999998643
No 306
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=95.84 E-value=0.071 Score=63.30 Aligned_cols=208 Identities=16% Similarity=0.122 Sum_probs=127.1
Q ss_pred ceEEEEEEcCCCCeEEEEeCCCcEEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCcccccccccccccccc
Q 000473 17 HRVTATSALTQPPTLYTGGSDGSILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMG 96 (1471)
Q Consensus 17 h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~ 96 (1471)
.-|+.+..+-- +++.+++.||.++.|--....+ ..-+..+..|-+.|..|+
T Consensus 10 d~i~hv~~tka-~fiiqASlDGh~KFWkKs~isG--vEfVKhFraHL~~I~sl~-------------------------- 60 (558)
T KOG0882|consen 10 DVITHVFPTKA-KFIIQASLDGHKKFWKKSRISG--VEFVKHFRAHLGVILSLA-------------------------- 60 (558)
T ss_pred ceeeeEeeehh-heEEeeecchhhhhcCCCCccc--eeehhhhHHHHHHHHhhh--------------------------
Confidence 45777777644 7999999999999998652001 223334557888888886
Q ss_pred cccCCCCEEEEEeC-CCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCC---eEEEEcceecccCCccccccccccc
Q 000473 97 KSSLDNGALISACT-DGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNP---RYVCIGCCFIDTNQLSDHHSFESVE 172 (1471)
Q Consensus 97 ~~s~d~~~LaSas~-DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~---~ll~~G~~~id~~~~~~~h~~~~i~ 172 (1471)
.+-|+.++.|.++ |..+++.|+.+-..+.-.++. .-|..+.-....+ .+++++.. +
T Consensus 61 -~S~dg~L~~Sv~d~Dhs~KvfDvEn~DminmiKL~---~lPg~a~wv~skGd~~s~IAVs~~--~-------------- 120 (558)
T KOG0882|consen 61 -VSYDGWLFRSVEDPDHSVKVFDVENFDMINMIKLV---DLPGFAEWVTSKGDKISLIAVSLF--K-------------- 120 (558)
T ss_pred -ccccceeEeeccCcccceeEEEeeccchhhhcccc---cCCCceEEecCCCCeeeeEEeecc--c--------------
Confidence 5677888999888 999999999877666556665 4455554333222 35555543 0
Q ss_pred ccccccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCC
Q 000473 173 GDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESH 252 (1471)
Q Consensus 173 ~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~ 252 (1471)
.+.+.++|......-...+- ..+..+|..+-..+.+ +..+..+.+|.|+-|..+.. -
T Consensus 121 ----------------sg~i~VvD~~~d~~q~~~fk-klH~sPV~~i~y~qa~-----Ds~vSiD~~gmVEyWs~e~~-~ 177 (558)
T KOG0882|consen 121 ----------------SGKIFVVDGFGDFCQDGYFK-KLHFSPVKKIRYNQAG-----DSAVSIDISGMVEYWSAEGP-F 177 (558)
T ss_pred ----------------CCCcEEECCcCCcCccceec-ccccCceEEEEeeccc-----cceeeccccceeEeecCCCc-c
Confidence 26677777776543222222 3334458877777443 44666688999999998852 0
Q ss_pred cccccCCCc--ccCC-CcccceeccCCcccCceEEEEecCCcEEEEEeCCeE
Q 000473 253 LDREEGNGL--CKSS-SQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHC 301 (1471)
Q Consensus 253 ~~~~~~~~l--~~~e-~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~ 301 (1471)
+ ....+-. +++| ...+... .......+.|+|+|..+.+.+.++-
T Consensus 178 q-fPr~~l~~~~K~eTdLy~f~K----~Kt~pts~Efsp~g~qistl~~Drk 224 (558)
T KOG0882|consen 178 Q-FPRTNLNFELKHETDLYGFPK----AKTEPTSFEFSPDGAQISTLNPDRK 224 (558)
T ss_pred c-Cccccccccccccchhhcccc----cccCccceEEccccCcccccCcccE
Confidence 0 0111000 1222 1111111 1123367889999999999887763
No 307
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=95.79 E-value=0.25 Score=60.22 Aligned_cols=107 Identities=12% Similarity=0.011 Sum_probs=72.9
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
+..++.++.|+.+..+|..+|+.+....... .+.+ +|. ..++.++.++.|+.+..||.++|+.+..+.....
T Consensus 105 ~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~-~~~~---~p~----v~~~~v~v~~~~g~l~a~d~~tG~~~W~~~~~~~ 176 (377)
T TIGR03300 105 GGLVFVGTEKGEVIALDAEDGKELWRAKLSS-EVLS---PPL----VANGLVVVRTNDGRLTALDAATGERLWTYSRVTP 176 (377)
T ss_pred CCEEEEEcCCCEEEEEECCCCcEeeeeccCc-eeec---CCE----EECCEEEEECCCCeEEEEEcCCCceeeEEccCCC
Confidence 5677788899999999999999887665332 2221 121 0134677778899999999999998877654322
Q ss_pred CcE-----EEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 671 YPA-----KVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 671 ~V~-----~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
... ..... +..++.+..+ |.++.+|.++|+.+...
T Consensus 177 ~~~~~~~~sp~~~--~~~v~~~~~~--------g~v~ald~~tG~~~W~~ 216 (377)
T TIGR03300 177 ALTLRGSASPVIA--DGGVLVGFAG--------GKLVALDLQTGQPLWEQ 216 (377)
T ss_pred ceeecCCCCCEEE--CCEEEEECCC--------CEEEEEEccCCCEeeee
Confidence 110 11111 3467777776 89999999999876543
No 308
>PRK04043 tolB translocation protein TolB; Provisional
Probab=95.78 E-value=0.67 Score=57.53 Aligned_cols=140 Identities=11% Similarity=0.104 Sum_probs=84.5
Q ss_pred cCCcceEEEEecCCccEEEEEEecCCCCcccCcCCC-EEEEEECC--CcEEEEECCCCceEEEEeccCCCEEEEEECCCC
Q 000473 557 VNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNE-VLVSGSMD--CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQ 633 (1471)
Q Consensus 557 ~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~-~L~SGs~D--gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~ 633 (1471)
..+++... +....+.+....|+|| ++ ++++.+.+ ..|.++|+.+++. ..+..+.+......|+||
T Consensus 220 l~tg~~~~-lt~~~g~~~~~~~SPD---------G~~la~~~~~~g~~~Iy~~dl~~g~~-~~LT~~~~~d~~p~~SPD- 287 (419)
T PRK04043 220 LYTGKKEK-IASSQGMLVVSDVSKD---------GSKLLLTMAPKGQPDIYLYDTNTKTL-TQITNYPGIDVNGNFVED- 287 (419)
T ss_pred CCCCcEEE-EecCCCcEEeeEECCC---------CCEEEEEEccCCCcEEEEEECCCCcE-EEcccCCCccCccEECCC-
Confidence 33444433 3334556677889998 54 55555544 4677778887764 445444443345579999
Q ss_pred CCCCCCCEEEEEeC-CC--cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCC-CCCCCEEEEEECCCC
Q 000473 634 TEHPWSDCFLSVGE-DF--SVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRT-SDAVDVLFIWDVKTG 709 (1471)
Q Consensus 634 ~~~~~~~~l~S~s~-Dg--sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~-~D~~gtV~VWDi~tg 709 (1471)
|+.++-.+. .+ .|.+.|+.+++..+.... ... ...|+|||++|+..+..-... +.+...|++.|+.+|
T Consensus 288 -----G~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~-g~~--~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g 359 (419)
T PRK04043 288 -----DKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFH-GKN--NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSD 359 (419)
T ss_pred -----CCEEEEEECCCCCceEEEEECCCCCeEeCccC-CCc--CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCC
Confidence 876665553 23 577788888877544322 111 248999999998877541000 001147899999888
Q ss_pred eEEEEEeC
Q 000473 710 ARERVLRG 717 (1471)
Q Consensus 710 ~~~~~l~g 717 (1471)
+. +.++.
T Consensus 360 ~~-~~LT~ 366 (419)
T PRK04043 360 YI-RRLTA 366 (419)
T ss_pred Ce-EECCC
Confidence 64 44543
No 309
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=95.74 E-value=1.2 Score=50.75 Aligned_cols=132 Identities=17% Similarity=0.151 Sum_probs=88.1
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECC--------CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCE
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD--------CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDC 641 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~D--------gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~ 641 (1471)
....+.+++.|+ +++.++.... +.|..++.. ++.. .+.........++|+|+ ++.
T Consensus 85 ~~~~ND~~vd~~---------G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~-~~~~~~~~pNGi~~s~d------g~~ 147 (246)
T PF08450_consen 85 FNRPNDVAVDPD---------GNLYVTDSGGGGASGIDPGSVYRIDPD-GKVT-VVADGLGFPNGIAFSPD------GKT 147 (246)
T ss_dssp TEEEEEEEE-TT---------S-EEEEEECCBCTTCGGSEEEEEEETT-SEEE-EEEEEESSEEEEEEETT------SSE
T ss_pred cCCCceEEEcCC---------CCEEEEecCCCccccccccceEEECCC-CeEE-EEecCcccccceEECCc------chh
Confidence 456889999987 7777776644 457777776 5543 33334556789999999 775
Q ss_pred E-EEEeCCCcEEEEECCCC-c------EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 642 F-LSVGEDFSVALASLETL-R------VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 642 l-~S~s~DgsV~lWdl~t~-~------~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
| ++-+..+.|..+++... . ....+....+.+-.+++..+|++.++.... +.|.++|.+ |+++.
T Consensus 148 lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~--------~~I~~~~p~-G~~~~ 218 (246)
T PF08450_consen 148 LYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGG--------GRIVVFDPD-GKLLR 218 (246)
T ss_dssp EEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETT--------TEEEEEETT-SCEEE
T ss_pred eeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCC--------CEEEEECCC-ccEEE
Confidence 5 56677888999998532 1 122233333347889999999877765555 899999987 99988
Q ss_pred EEeCCCCCceeeee
Q 000473 714 VLRGTASHSMFDHF 727 (1471)
Q Consensus 714 ~l~gH~~~v~~~~~ 727 (1471)
.+.-.....+.+.|
T Consensus 219 ~i~~p~~~~t~~~f 232 (246)
T PF08450_consen 219 EIELPVPRPTNCAF 232 (246)
T ss_dssp EEE-SSSSEEEEEE
T ss_pred EEcCCCCCEEEEEE
Confidence 88766445555544
No 310
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=95.65 E-value=2.2 Score=48.89 Aligned_cols=129 Identities=13% Similarity=0.111 Sum_probs=86.7
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec-------cCCCEEEEEECCCCCCCCCCCEEEEE
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH-------HVAPVRQIILSPPQTEHPWSDCFLSV 645 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~-------H~~~V~~l~fspd~~~~~~~~~l~S~ 645 (1471)
=.-++|+|| +.+|+.+...|+|+++|+.. ..+..+.. -...|..+.|.+......|...|+..
T Consensus 46 WRkl~WSpD---------~tlLa~a~S~G~i~vfdl~g-~~lf~I~p~~~~~~d~~~Aiagl~Fl~~~~s~~ws~ELlvi 115 (282)
T PF15492_consen 46 WRKLAWSPD---------CTLLAYAESTGTIRVFDLMG-SELFVIPPAMSFPGDLSDAIAGLIFLEYKKSAQWSYELLVI 115 (282)
T ss_pred heEEEECCC---------CcEEEEEcCCCeEEEEeccc-ceeEEcCcccccCCccccceeeeEeeccccccccceeEEEE
Confidence 456899998 89999999999999999974 44444332 13567788887766566666678888
Q ss_pred eCCCcEEEEECC-----CCcEEEEec---CCCCCcEEEEEcCCCCEEEEEEcCCCCCCC---CCCEEEEEECCCCeE
Q 000473 646 GEDFSVALASLE-----TLRVERMFP---GHPNYPAKVVWDCPRGYIACLCRDHSRTSD---AVDVLFIWDVKTGAR 711 (1471)
Q Consensus 646 s~DgsV~lWdl~-----t~~~l~~~~---gh~~~V~~v~~spdg~~L~sgs~D~sg~~D---~~gtV~VWDi~tg~~ 711 (1471)
..+|.++=+-+. ..+.-+.|. .+...|.++.++|..+.|++|+......+. ...-+..|.+-++.+
T Consensus 116 ~Y~G~L~Sy~vs~gt~q~y~e~hsfsf~~~yp~Gi~~~vy~p~h~LLlVgG~~~~~~~~s~a~~~GLtaWRiL~~~P 192 (282)
T PF15492_consen 116 NYRGQLRSYLVSVGTNQGYQENHSFSFSSHYPHGINSAVYHPKHRLLLVGGCEQNQDGMSKASSCGLTAWRILSDSP 192 (282)
T ss_pred eccceeeeEEEEcccCCcceeeEEEEecccCCCceeEEEEcCCCCEEEEeccCCCCCccccccccCceEEEEcCCCC
Confidence 888887766542 233334443 246689999999998888777654111000 012477887776654
No 311
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=95.57 E-value=0.074 Score=64.56 Aligned_cols=112 Identities=17% Similarity=0.179 Sum_probs=87.9
Q ss_pred EEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC------
Q 000473 574 LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE------ 647 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~------ 647 (1471)
+-+.|+|. |.+|+|--.-| |.+|--.+...+++| .|. .|.-+.|||. .++|+|=+.
T Consensus 214 tyv~wSP~---------GTYL~t~Hk~G-I~lWGG~~f~r~~RF-~Hp-~Vq~idfSP~------EkYLVT~s~~p~~~~ 275 (698)
T KOG2314|consen 214 TYVRWSPK---------GTYLVTFHKQG-IALWGGESFDRIQRF-YHP-GVQFIDFSPN------EKYLVTYSPEPIIVE 275 (698)
T ss_pred eeEEecCC---------ceEEEEEeccc-eeeecCccHHHHHhc-cCC-CceeeecCCc------cceEEEecCCccccC
Confidence 56789998 89999987766 789987777777887 465 4899999999 788888543
Q ss_pred -----CCcEEEEECCCCcEEEEecCCCC--Cc-EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 648 -----DFSVALASLETLRVERMFPGHPN--YP-AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 648 -----DgsV~lWdl~t~~~l~~~~gh~~--~V-~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
-..+.|||+.+|...+.|+.... .+ .-..||.|++|+|.-..| +|.|++.....++
T Consensus 276 ~~d~e~~~l~IWDI~tG~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~~---------sisIyEtpsf~ll 339 (698)
T KOG2314|consen 276 EDDNEGQQLIIWDIATGLLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTGN---------SISIYETPSFMLL 339 (698)
T ss_pred cccCCCceEEEEEccccchhcceeccCCCccccceEEeccCCceeEEeccc---------eEEEEecCceeee
Confidence 26789999999999988876322 22 356899999999987665 8999998775443
No 312
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=95.48 E-value=0.17 Score=64.90 Aligned_cols=99 Identities=14% Similarity=0.113 Sum_probs=79.6
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC---------CCcEEEEECCCCcE
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE---------DFSVALASLETLRV 661 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~---------DgsV~lWdl~t~~~ 661 (1471)
++++.+|..-|+|.+-|.++-+.+++|..|++.|..+... |+.|+++|. |..|+|||++..+.
T Consensus 187 nr~lf~G~t~G~V~LrD~~s~~~iht~~aHs~siSDfDv~--------GNlLitCG~S~R~~~l~~D~FvkVYDLRmmra 258 (1118)
T KOG1275|consen 187 NRNLFCGDTRGTVFLRDPNSFETIHTFDAHSGSISDFDVQ--------GNLLITCGYSMRRYNLAMDPFVKVYDLRMMRA 258 (1118)
T ss_pred CcEEEeecccceEEeecCCcCceeeeeeccccceeeeecc--------CCeEEEeecccccccccccchhhhhhhhhhhc
Confidence 7999999999999999999999999999999999887764 668888864 77899999999887
Q ss_pred EEEecCCCCCcEEEEEcCCC-CEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 662 ERMFPGHPNYPAKVVWDCPR-GYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 662 l~~~~gh~~~V~~v~~spdg-~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
+.-+.-+.++ .-+.|+|.- ..+++.+.. |...+-|.
T Consensus 259 l~PI~~~~~P-~flrf~Psl~t~~~V~S~s--------Gq~q~vd~ 295 (1118)
T KOG1275|consen 259 LSPIQFPYGP-QFLRFHPSLTTRLAVTSQS--------GQFQFVDT 295 (1118)
T ss_pred cCCcccccCc-hhhhhcccccceEEEEecc--------cceeeccc
Confidence 7666555443 557788753 356666665 78888773
No 313
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.40 E-value=0.56 Score=60.17 Aligned_cols=163 Identities=12% Similarity=0.094 Sum_probs=106.4
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCcc-EEEEEEecCCCCcccCcCCCEEEEEECCC----
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGA-VLCLAAHRMVGTAKGWSFNEVLVSGSMDC---- 601 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~-V~~la~spd~~~~~~~~~~~~L~SGs~Dg---- 601 (1471)
.++.|+.+|.|.+ ++ .+.+..+.++.|... |..+ ++-+ ...+|++-+.|.
T Consensus 37 ~vvigt~~G~V~~--Ln--------------~s~~~~~~fqa~~~siv~~L-~~~~--------~~~~L~sv~Ed~~~np 91 (933)
T KOG2114|consen 37 SVVIGTADGRVVI--LN--------------SSFQLIRGFQAYEQSIVQFL-YILN--------KQNFLFSVGEDEQGNP 91 (933)
T ss_pred eEEEeeccccEEE--ec--------------ccceeeehheecchhhhhHh-hccc--------CceEEEEEeecCCCCc
Confidence 7999999999988 33 112334556666666 4443 3332 146888887775
Q ss_pred -cEEEEECCCC------ceE--EEEec-----cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC---C-CcEEE
Q 000473 602 -SIRIWDLGSG------NLI--TVMHH-----HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE---T-LRVER 663 (1471)
Q Consensus 602 -tI~lWDl~tg------~~l--~~~~~-----H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~---t-~~~l~ 663 (1471)
.+++||++.- .++ +.+.. ...++.+++++-+ -.++|.|-.||.|.++.-+ + |....
T Consensus 92 ~llkiw~lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs~~------l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~~ 165 (933)
T KOG2114|consen 92 VLLKIWDLEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVSED------LKTIVCGFTNGLVICYKGDILRDRGSRQD 165 (933)
T ss_pred eEEEEecccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEEcc------ccEEEEEecCcEEEEEcCcchhcccccee
Confidence 4899999732 344 23333 2457888999888 7899999999999988532 1 11112
Q ss_pred EecCCCCCcEEEEEcCCCCE-EEEEEcCCCCCCCCCCEEEEEECCCCe-EEEEEeCCCCCceeeeeee
Q 000473 664 MFPGHPNYPAKVVWDCPRGY-IACLCRDHSRTSDAVDVLFIWDVKTGA-RERVLRGTASHSMFDHFCK 729 (1471)
Q Consensus 664 ~~~gh~~~V~~v~~spdg~~-L~sgs~D~sg~~D~~gtV~VWDi~tg~-~~~~l~gH~~~v~~~~~~~ 729 (1471)
-......+|+.+++..++.- ++++.. ..|.+|.+.... ....+..|....-+..||+
T Consensus 166 ~~~~~~~pITgL~~~~d~~s~lFv~Tt---------~~V~~y~l~gr~p~~~~ld~~G~~lnCss~~~ 224 (933)
T KOG2114|consen 166 YSHRGKEPITGLALRSDGKSVLFVATT---------EQVMLYSLSGRTPSLKVLDNNGISLNCSSFSD 224 (933)
T ss_pred eeccCCCCceeeEEecCCceeEEEEec---------ceeEEEEecCCCcceeeeccCCccceeeecCC
Confidence 22233568999999988876 444444 479999987444 2455777776666666664
No 314
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=95.39 E-value=0.057 Score=66.76 Aligned_cols=78 Identities=17% Similarity=0.209 Sum_probs=69.1
Q ss_pred CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcE-EEEEcCCCCEEEEEEcCCCCCCCCCCE
Q 000473 622 APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPA-KVVWDCPRGYIACLCRDHSRTSDAVDV 700 (1471)
Q Consensus 622 ~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~-~v~~spdg~~L~sgs~D~sg~~D~~gt 700 (1471)
..|..+.|+|. -..+|.+-.+|.|.+.-+. .+.+.+++-|.-.++ +++|.|||+.|++|-.| |+
T Consensus 21 ~~i~~~ewnP~------~dLiA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~DGkllaVg~kd--------G~ 85 (665)
T KOG4640|consen 21 INIKRIEWNPK------MDLIATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRPDGKLLAVGFKD--------GT 85 (665)
T ss_pred cceEEEEEcCc------cchhheeccCCcEEEEEec-cceeEeccCCCCccceeeeecCCCCEEEEEecC--------Ce
Confidence 35788999998 7899999999999999888 777788887777777 99999999999999999 99
Q ss_pred EEEEECCCCeEEEE
Q 000473 701 LFIWDVKTGARERV 714 (1471)
Q Consensus 701 V~VWDi~tg~~~~~ 714 (1471)
|++-|+++|..+..
T Consensus 86 I~L~Dve~~~~l~~ 99 (665)
T KOG4640|consen 86 IRLHDVEKGGRLVS 99 (665)
T ss_pred EEEEEccCCCceec
Confidence 99999999887655
No 315
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=95.17 E-value=1.7 Score=50.95 Aligned_cols=126 Identities=16% Similarity=0.181 Sum_probs=82.5
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEEC-----CCcEEEEECC-CCceEEEEeccCCCEEEEEECCCCCCC
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM-----DCSIRIWDLG-SGNLITVMHHHVAPVRQIILSPPQTEH 636 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~-----DgtI~lWDl~-tg~~l~~~~~H~~~V~~l~fspd~~~~ 636 (1471)
-+.|+||. .|++| +++|.+.=. .|.|-|||.. +.+.+..|..|.-.-..+.+.||
T Consensus 49 gRHFyGHg------~fs~d---------G~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pD---- 109 (305)
T PF07433_consen 49 GRHFYGHG------VFSPD---------GRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPD---- 109 (305)
T ss_pred CCEEecCE------EEcCC---------CCEEEEeccccCCCcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCC----
Confidence 35677884 67887 788887643 4789999998 66677788777666667777777
Q ss_pred CCCCEEEEE------------------eCCCcEEEEECCCCcEEEE--e-------------------------------
Q 000473 637 PWSDCFLSV------------------GEDFSVALASLETLRVERM--F------------------------------- 665 (1471)
Q Consensus 637 ~~~~~l~S~------------------s~DgsV~lWdl~t~~~l~~--~------------------------------- 665 (1471)
++.|+.+ ..+.++.+.|..+|+.+.. +
T Consensus 110 --G~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~~lSiRHLa~~~~G~V~~a~Q~qg~~~ 187 (305)
T PF07433_consen 110 --GETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLHQLSIRHLAVDGDGTVAFAMQYQGDPG 187 (305)
T ss_pred --CCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCccccccceeeEEecCCCcEEEEEecCCCCC
Confidence 4333332 1122333444444433222 1
Q ss_pred ------------------c-------CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 666 ------------------P-------GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 666 ------------------~-------gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
. .-.+++-+|++++++.++++.|-- .+.+.+||..+|+++....
T Consensus 188 ~~~PLva~~~~g~~~~~~~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsPr-------Gg~~~~~d~~tg~~~~~~~ 256 (305)
T PF07433_consen 188 DAPPLVALHRRGGALRLLPAPEEQWRRLNGYIGSIAADRDGRLIAVTSPR-------GGRVAVWDAATGRLLGSVP 256 (305)
T ss_pred ccCCeEEEEcCCCcceeccCChHHHHhhCCceEEEEEeCCCCEEEEECCC-------CCEEEEEECCCCCEeeccc
Confidence 0 112467788999999888777765 3899999999999887644
No 316
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=95.14 E-value=0.029 Score=65.79 Aligned_cols=123 Identities=13% Similarity=0.049 Sum_probs=91.9
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
++.+.+.+|...|..++|+|. ...++..++.+.+|++.|+++......+..| ..+++++|.-++ .+
T Consensus 184 kssq~lp~~g~~IrdlafSp~--------~~GLl~~asl~nkiki~dlet~~~vssy~a~-~~~wSC~wDlde-----~h 249 (463)
T KOG1645|consen 184 KSSQILPGEGSFIRDLAFSPF--------NEGLLGLASLGNKIKIMDLETSCVVSSYIAY-NQIWSCCWDLDE-----RH 249 (463)
T ss_pred chhhcccccchhhhhhccCcc--------ccceeeeeccCceEEEEecccceeeeheecc-CCceeeeeccCC-----cc
Confidence 345567788899999999997 2458999999999999999999998888888 789999999885 67
Q ss_pred EEEEEeCCCcEEEEECCCCcE-EEEecC--CCCCcEEE------EEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 641 CFLSVGEDFSVALASLETLRV-ERMFPG--HPNYPAKV------VWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 641 ~l~S~s~DgsV~lWdl~t~~~-l~~~~g--h~~~V~~v------~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.+..|-..|.|.++|++..+- +..+.+ -..+|..+ +..+-|.+|+....+ +..|++.
T Consensus 250 ~IYaGl~nG~VlvyD~R~~~~~~~e~~a~~t~~pv~~i~~~~~n~~f~~gglLv~~lt~----------l~f~ei~ 315 (463)
T KOG1645|consen 250 VIYAGLQNGMVLVYDMRQPEGPLMELVANVTINPVHKIAPVQPNKIFTSGGLLVFALTV----------LQFYEIV 315 (463)
T ss_pred eeEEeccCceEEEEEccCCCchHhhhhhhhccCcceeecccCccccccccceEEeeehh----------hhhhhhh
Confidence 899999999999999986432 222222 11223322 345567777766665 6677664
No 317
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=94.80 E-value=2.3 Score=48.83 Aligned_cols=167 Identities=16% Similarity=0.208 Sum_probs=95.1
Q ss_pred ccCC--EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEec-CCccEEEEEEecCCCCcccCcCCCEEEEEEC
Q 000473 523 YAPY--AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLG-HTGAVLCLAAHRMVGTAKGWSFNEVLVSGSM 599 (1471)
Q Consensus 523 f~P~--~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~g-H~~~V~~la~spd~~~~~~~~~~~~L~SGs~ 599 (1471)
|+|+ .++++...|+|++ ++++... ...+.. ...+.+ -..+|..+.|.+-..++ +-...|+.-..
T Consensus 51 WSpD~tlLa~a~S~G~i~v--fdl~g~~-----lf~I~p---~~~~~~d~~~Aiagl~Fl~~~~s~---~ws~ELlvi~Y 117 (282)
T PF15492_consen 51 WSPDCTLLAYAESTGTIRV--FDLMGSE-----LFVIPP---AMSFPGDLSDAIAGLIFLEYKKSA---QWSYELLVINY 117 (282)
T ss_pred ECCCCcEEEEEcCCCeEEE--Eecccce-----eEEcCc---ccccCCccccceeeeEeecccccc---ccceeEEEEec
Confidence 4444 7999999999999 4433211 111110 001111 23567777775532111 11224555556
Q ss_pred CCcEEEEECCC-----CceEEEEec---cCCCEEEEEECCCCCCCCCCCEEEEEe-CCC----------cEEEEECCCCc
Q 000473 600 DCSIRIWDLGS-----GNLITVMHH---HVAPVRQIILSPPQTEHPWSDCFLSVG-EDF----------SVALASLETLR 660 (1471)
Q Consensus 600 DgtI~lWDl~t-----g~~l~~~~~---H~~~V~~l~fspd~~~~~~~~~l~S~s-~Dg----------sV~lWdl~t~~ 660 (1471)
+|.++=+-+.. .+..+.|.- +...|.++.++|. .+.|+.|| ... -+.-|-+-++.
T Consensus 118 ~G~L~Sy~vs~gt~q~y~e~hsfsf~~~yp~Gi~~~vy~p~------h~LLlVgG~~~~~~~~s~a~~~GLtaWRiL~~~ 191 (282)
T PF15492_consen 118 RGQLRSYLVSVGTNQGYQENHSFSFSSHYPHGINSAVYHPK------HRLLLVGGCEQNQDGMSKASSCGLTAWRILSDS 191 (282)
T ss_pred cceeeeEEEEcccCCcceeeEEEEecccCCCceeEEEEcCC------CCEEEEeccCCCCCccccccccCceEEEEcCCC
Confidence 76666555422 233444432 4678999999998 45444443 222 23445432211
Q ss_pred E---------------------EE-----Eec---CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 661 V---------------------ER-----MFP---GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 661 ~---------------------l~-----~~~---gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
+ .+ .+. .....|..|..||||..|++...+ |.|.+|++.+-.+
T Consensus 192 Pyyk~v~~~~~~~~~~~~~~~~~~~~~~~~fs~~~~~~d~i~kmSlSPdg~~La~ih~s--------G~lsLW~iPsL~~ 263 (282)
T PF15492_consen 192 PYYKQVTSSEDDITASSKRRGLLRIPSFKFFSRQGQEQDGIFKMSLSPDGSLLACIHFS--------GSLSLWEIPSLRL 263 (282)
T ss_pred CcEEEccccCccccccccccceeeccceeeeeccccCCCceEEEEECCCCCEEEEEEcC--------CeEEEEecCcchh
Confidence 0 01 111 124578999999999999999998 9999999988766
Q ss_pred EEEEe
Q 000473 712 ERVLR 716 (1471)
Q Consensus 712 ~~~l~ 716 (1471)
.+...
T Consensus 264 ~~~W~ 268 (282)
T PF15492_consen 264 QRSWK 268 (282)
T ss_pred hcccc
Confidence 65544
No 318
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.64 E-value=1.9 Score=55.67 Aligned_cols=127 Identities=14% Similarity=0.131 Sum_probs=88.9
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC----CCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEE
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG----SGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSV 645 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~----tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~ 645 (1471)
..++.+++++.+ -+.+|.|-.||.|....-+ .|....-...-..+|+.+++..+ +..++-+
T Consensus 125 ~~p~s~l~Vs~~---------l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pITgL~~~~d------~~s~lFv 189 (933)
T KOG2114|consen 125 PSPASSLAVSED---------LKTIVCGFTNGLVICYKGDILRDRGSRQDYSHRGKEPITGLALRSD------GKSVLFV 189 (933)
T ss_pred CCcceEEEEEcc---------ccEEEEEecCcEEEEEcCcchhccccceeeeccCCCCceeeEEecC------CceeEEE
Confidence 346888999986 6899999999999987532 12211222234579999999888 6654555
Q ss_pred eCCCcEEEEECCCCcE-EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe-CCCC
Q 000473 646 GEDFSVALASLETLRV-ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR-GTAS 720 (1471)
Q Consensus 646 s~DgsV~lWdl~t~~~-l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~-gH~~ 720 (1471)
..-..|.+|.+....+ ...+..|...+.|..+++.-..+++++. .-|++||......--.+. ||..
T Consensus 190 ~Tt~~V~~y~l~gr~p~~~~ld~~G~~lnCss~~~~t~qfIca~~---------e~l~fY~sd~~~~cfaf~~g~kk 257 (933)
T KOG2114|consen 190 ATTEQVMLYSLSGRTPSLKVLDNNGISLNCSSFSDGTYQFICAGS---------EFLYFYDSDGRGPCFAFEVGEKK 257 (933)
T ss_pred EecceeEEEEecCCCcceeeeccCCccceeeecCCCCccEEEecC---------ceEEEEcCCCcceeeeecCCCeE
Confidence 5667899999984442 4556777788999999887664666655 379999987544444454 7763
No 319
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=94.47 E-value=1.9 Score=42.97 Aligned_cols=100 Identities=13% Similarity=0.120 Sum_probs=65.5
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEE
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVA 652 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~ 652 (1471)
|+++++..-. .+| ...|+.||.|+.||+|+= .+.+..+..+ ..|+++.-... ..|+.+-.+|+|.
T Consensus 2 V~al~~~d~d--~dg---~~eLlvGs~D~~IRvf~~--~e~~~Ei~e~-~~v~~L~~~~~-------~~F~Y~l~NGTVG 66 (111)
T PF14783_consen 2 VTALCLFDFD--GDG---ENELLVGSDDFEIRVFKG--DEIVAEITET-DKVTSLCSLGG-------GRFAYALANGTVG 66 (111)
T ss_pred eeEEEEEecC--CCC---cceEEEecCCcEEEEEeC--CcEEEEEecc-cceEEEEEcCC-------CEEEEEecCCEEE
Confidence 6677765420 112 478999999999999984 4677777654 46777776553 5799999999999
Q ss_pred EEECCCCcEEEEecCCCCCcEEEEEc-CCC---CEEEEEEcC
Q 000473 653 LASLETLRVERMFPGHPNYPAKVVWD-CPR---GYIACLCRD 690 (1471)
Q Consensus 653 lWdl~t~~~l~~~~gh~~~V~~v~~s-pdg---~~L~sgs~D 690 (1471)
+|+-.. +. ....... .+.++.+. .++ .-|++|-.+
T Consensus 67 vY~~~~-Rl-WRiKSK~-~~~~~~~~D~~gdG~~eLI~Gwsn 105 (111)
T PF14783_consen 67 VYDRSQ-RL-WRIKSKN-QVTSMAFYDINGDGVPELIVGWSN 105 (111)
T ss_pred EEeCcc-ee-eeeccCC-CeEEEEEEcCCCCCceEEEEEecC
Confidence 997532 22 2223222 35566553 332 267888777
No 320
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=93.93 E-value=1 Score=57.45 Aligned_cols=142 Identities=15% Similarity=0.154 Sum_probs=95.0
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+++++.||++.|.... +.+...++. -..++.+++++|+- .+...+.+++||.-| +.+.
T Consensus 85 y~asCS~DGkv~I~sl~---------------~~~~~~~~d-f~rpiksial~Pd~----~~~~sk~fv~GG~ag-lvL~ 143 (846)
T KOG2066|consen 85 YVASCSDDGKVVIGSLF---------------TDDEITQYD-FKRPIKSIALHPDF----SRQQSKQFVSGGMAG-LVLS 143 (846)
T ss_pred eEEEecCCCcEEEeecc---------------CCccceeEe-cCCcceeEEeccch----hhhhhhheeecCcce-EEEe
Confidence 79999999999984332 112222222 23578999999973 233467899999998 6665
Q ss_pred ECCC-CceE-EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC------CcEEEEEc
Q 000473 607 DLGS-GNLI-TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN------YPAKVVWD 678 (1471)
Q Consensus 607 Dl~t-g~~l-~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~------~V~~v~~s 678 (1471)
.-+- |... ..+..-.|+|.++.|. |+++|=++.+| |+++|..+++.+..++.... ....+.|.
T Consensus 144 er~wlgnk~~v~l~~~eG~I~~i~W~--------g~lIAWand~G-v~vyd~~~~~~l~~i~~p~~~~R~e~fpphl~W~ 214 (846)
T KOG2066|consen 144 ERNWLGNKDSVVLSEGEGPIHSIKWR--------GNLIAWANDDG-VKVYDTPTRQRLTNIPPPSQSVRPELFPPHLHWQ 214 (846)
T ss_pred hhhhhcCccceeeecCccceEEEEec--------CcEEEEecCCC-cEEEeccccceeeccCCCCCCCCcccCCCceEec
Confidence 5321 1111 1355567899999995 55888877665 89999999887766653323 34678898
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
++.+ |+.|=.| +|+|..++.
T Consensus 215 ~~~~-LVIGW~d---------~v~i~~I~~ 234 (846)
T KOG2066|consen 215 DEDR-LVIGWGD---------SVKICSIKK 234 (846)
T ss_pred CCCe-EEEecCC---------eEEEEEEec
Confidence 7665 4444443 899998873
No 321
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=93.74 E-value=0.27 Score=61.10 Aligned_cols=78 Identities=10% Similarity=0.025 Sum_probs=69.0
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEE-EEEECCCCCCCCCCCEEEEEeCCCc
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVR-QIILSPPQTEHPWSDCFLSVGEDFS 650 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~-~l~fspd~~~~~~~~~l~S~s~Dgs 650 (1471)
.+.-+.|+|. -.++|.+..+|.|.+..+. .+.+.++.-|...++ +++|.|| |+.++.|-.||+
T Consensus 22 ~i~~~ewnP~---------~dLiA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~D------GkllaVg~kdG~ 85 (665)
T KOG4640|consen 22 NIKRIEWNPK---------MDLIATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRPD------GKLLAVGFKDGT 85 (665)
T ss_pred ceEEEEEcCc---------cchhheeccCCcEEEEEec-cceeEeccCCCCccceeeeecCC------CCEEEEEecCCe
Confidence 4677889996 7899999999999999987 888889987888888 9999999 999999999999
Q ss_pred EEEEECCCCcEEEEe
Q 000473 651 VALASLETLRVERMF 665 (1471)
Q Consensus 651 V~lWdl~t~~~l~~~ 665 (1471)
|++.|.+++..+..+
T Consensus 86 I~L~Dve~~~~l~~~ 100 (665)
T KOG4640|consen 86 IRLHDVEKGGRLVSF 100 (665)
T ss_pred EEEEEccCCCceecc
Confidence 999999998876653
No 322
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=93.52 E-value=9.3 Score=45.00 Aligned_cols=156 Identities=17% Similarity=0.186 Sum_probs=84.6
Q ss_pred CceEEEEEEcC-CCCeEEEEeCCCc-EEEEEccCCCCCceeeeEEecccccceeEeeeccccccccCccccccccccccc
Q 000473 16 SHRVTATSALT-QPPTLYTGGSDGS-ILWWSFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSN 93 (1471)
Q Consensus 16 ~h~Vtava~Sp-Dg~~LaTGs~DG~-I~lWdl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~ 93 (1471)
+.+.-.++++| ++..+|-+=.-|+ ..+||..+ ++ ..+.+.. ++|| |+ +
T Consensus 4 P~RgH~~a~~p~~~~avafaRRPG~~~~v~D~~~---g~--~~~~~~a-----------------~~gR--HF------y 53 (305)
T PF07433_consen 4 PARGHGVAAHPTRPEAVAFARRPGTFALVFDCRT---GQ--LLQRLWA-----------------PPGR--HF------Y 53 (305)
T ss_pred CccccceeeCCCCCeEEEEEeCCCcEEEEEEcCC---Cc--eeeEEcC-----------------CCCC--EE------e
Confidence 34556778888 4555666666666 56788774 22 1111110 0111 00 0
Q ss_pred ccccccCCCCEEEEEe-----CCCeEEEEEcCCC-eEEEeeeCCCCCCC-CcEEEEcCCCCeE--EEEcceecccCCccc
Q 000473 94 VMGKSSLDNGALISAC-----TDGVLCVWSRSSG-HCRRRRKLPPWVGS-PSVICTLPSNPRY--VCIGCCFIDTNQLSD 164 (1471)
Q Consensus 94 ~~~~~s~d~~~LaSas-----~DG~I~VWdv~~G-~ci~~~~l~~~~g~-~~~i~~~s~~~~l--l~~G~~~id~~~~~~ 164 (1471)
.=|.||+|+.+|.+.- ..|.|-|||+..+ +.+...... |. |-.+..+ +|++. ++.|.- .+.
T Consensus 54 GHg~fs~dG~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s~---GIGPHel~l~-pDG~tLvVANGGI--~Th---- 123 (305)
T PF07433_consen 54 GHGVFSPDGRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPSH---GIGPHELLLM-PDGETLVVANGGI--ETH---- 123 (305)
T ss_pred cCEEEcCCCCEEEEeccccCCCcEEEEEEECcCCcEEEeEecCC---CcChhhEEEc-CCCCEEEEEcCCC--ccC----
Confidence 1123788888898854 3588999999833 334434333 32 3344444 45533 344432 111
Q ss_pred ccccccccccccccccCCCCCCCCCceEEEEeCcceEEEEEeec-CccccCCeEEEEEe
Q 000473 165 HHSFESVEGDLVSEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFH-GNLSIGPWKFMDVV 222 (1471)
Q Consensus 165 ~h~~~~i~~~~~~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s-~~~s~~~i~~~~~~ 222 (1471)
.-..++-..+..|...+...|..+++++....- ...+--.|.-+++.
T Consensus 124 -----------pd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~~lSiRHLa~~ 171 (305)
T PF07433_consen 124 -----------PDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLHQLSIRHLAVD 171 (305)
T ss_pred -----------cccCceecChhhcCCceEEEecCCCceeeeeecCccccccceeeEEec
Confidence 001122334667889999999999999877431 11222236778766
No 323
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=93.17 E-value=12 Score=46.00 Aligned_cols=225 Identities=15% Similarity=0.146 Sum_probs=115.9
Q ss_pred cEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccC---------C----------CCCCccccCC---------cce
Q 000473 511 IVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERH---------N----------SPGASLKVNS---------HVS 562 (1471)
Q Consensus 511 ~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~---------d----------~~~~~~d~~s---------~~~ 562 (1471)
.|+++.+-.+.. -+++|...|.+.|++|..-... + .++.+-|+.. ..+
T Consensus 3 ~v~~vs~a~~t~----Elav~~~~GeVv~~k~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~l~di~~r~~~~~~~gf~P 78 (395)
T PF08596_consen 3 SVTHVSFAPETL----ELAVGLESGEVVLFKFGKNQNYGNREQPPDLDYNFRRFSLNNSPGKLTDISDRAPPSLKEGFLP 78 (395)
T ss_dssp -EEEEEEETTTT----EEEEEETTS-EEEEEEEE------------------S--GGGSS-SEEE-GGG--TT-SEEEEE
T ss_pred eEEEEEecCCCc----eEEEEccCCcEEEEEcccCCCCCccCCCcccCcccccccccCCCcceEEehhhCCcccccccCc
Confidence 466666444444 7999999999999998531110 0 0122222211 134
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEE--ec------cCCCEEEEEECCCCC
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVM--HH------HVAPVRQIILSPPQT 634 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~--~~------H~~~V~~l~fspd~~ 634 (1471)
...+....+.|++++.+. --+++.|..||.+.|-|++....+..- .. ....|+++.|..- .
T Consensus 79 ~~l~~~~~g~vtal~~S~----------iGFvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm-~ 147 (395)
T PF08596_consen 79 LTLLDAKQGPVTALKNSD----------IGFVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVM-T 147 (395)
T ss_dssp EEEE---S-SEEEEEE-B----------TSEEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEE-E
T ss_pred hhheeccCCcEeEEecCC----------CcEEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEE-e
Confidence 444556689999999863 579999999999999999877766542 12 2235778877631 1
Q ss_pred CCCCC---CEEEEEeCCCcEEEEECC--C-CcE----EEEecCCCCCcEEEE-EcCC---------------------CC
Q 000473 635 EHPWS---DCFLSVGEDFSVALASLE--T-LRV----ERMFPGHPNYPAKVV-WDCP---------------------RG 682 (1471)
Q Consensus 635 ~~~~~---~~l~S~s~DgsV~lWdl~--t-~~~----l~~~~gh~~~V~~v~-~spd---------------------g~ 682 (1471)
..-++ -++..|...|.+.+|.+. . ++- ......+.++|..+. ++.+ ..
T Consensus 148 ~~~D~ySSi~L~vGTn~G~v~~fkIlp~~~g~f~v~~~~~~~~~~~~i~~I~~i~~~~G~~a~At~~~~~~l~~g~~i~g 227 (395)
T PF08596_consen 148 LGGDGYSSICLLVGTNSGNVLTFKILPSSNGRFSVQFAGATTNHDSPILSIIPINADTGESALATISAMQGLSKGISIPG 227 (395)
T ss_dssp -TTSSSEEEEEEEEETTSEEEEEEEEE-GGG-EEEEEEEEE--SS----EEEEEETTT--B-B-BHHHHHGGGGT----E
T ss_pred cCCCcccceEEEEEeCCCCEEEEEEecCCCCceEEEEeeccccCCCceEEEEEEECCCCCcccCchhHhhccccCCCcCc
Confidence 11112 378888889999999774 1 221 112224556666554 3221 12
Q ss_pred EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeeeeccccccccceEEcCCccccccceeeccCCceEe
Q 000473 683 YIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFCKGISMNSISGSVLNGNTSVSSLLLPIHEDGTFRQ 762 (1471)
Q Consensus 683 ~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~~~~~~~~~sg~v~~g~~~~s~~l~~~~~D~tir~ 762 (1471)
++++.+ + ..+||+...+++..+... .....+...+ . +..-.......++.+..||.++.
T Consensus 228 ~vVvvS-e--------~~irv~~~~~~k~~~K~~--~~~~~~~~~~----v------v~~~~~~~~~~Lv~l~~~G~i~i 286 (395)
T PF08596_consen 228 YVVVVS-E--------SDIRVFKPPKSKGAHKSF--DDPFLCSSAS----V------VPTISRNGGYCLVCLFNNGSIRI 286 (395)
T ss_dssp EEEEE--S--------SEEEEE-TT---EEEEE---SS-EEEEEEE----E------EEEE-EEEEEEEEEEETTSEEEE
T ss_pred EEEEEc-c--------cceEEEeCCCCcccceee--ccccccceEE----E------EeecccCCceEEEEEECCCcEEE
Confidence 554444 3 589999998877655443 2222222112 0 00000223445677888999999
Q ss_pred ecccccccc
Q 000473 763 SQIQNDERG 771 (1471)
Q Consensus 763 w~l~~~~~~ 771 (1471)
+.|..++..
T Consensus 287 ~SLP~Lkei 295 (395)
T PF08596_consen 287 YSLPSLKEI 295 (395)
T ss_dssp EETTT--EE
T ss_pred EECCCchHh
Confidence 988665543
No 324
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=92.96 E-value=0.035 Score=68.35 Aligned_cols=166 Identities=16% Similarity=0.215 Sum_probs=103.5
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEE----ECCCc
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSG----SMDCS 602 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SG----s~Dgt 602 (1471)
.++.|..+|.|.+.....- + . ....+..+|....++++|.+- +.++|+.| -.|..
T Consensus 72 IlavG~atG~I~l~s~r~~-----h------d--Ss~E~tp~~ar~Ct~lAwneL--------Dtn~LAagldkhrnds~ 130 (783)
T KOG1008|consen 72 ILAVGSATGNISLLSVRHP-----H------D--SSAEVTPGYARPCTSLAWNEL--------DTNHLAAGLDKHRNDSS 130 (783)
T ss_pred hhhhccccCceEEeecCCc-----c------c--ccceecccccccccccccccc--------cHHHHHhhhhhhcccCC
Confidence 6889999999998554310 0 1 123445678889999999874 35677777 35678
Q ss_pred EEEEECCCC--ceE--EEEec-cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE
Q 000473 603 IRIWDLGSG--NLI--TVMHH-HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW 677 (1471)
Q Consensus 603 I~lWDl~tg--~~l--~~~~~-H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~ 677 (1471)
+.+||+.++ .+. ..|.+ ......+++|..+ ...+.+|...+.++++|++..- .+...-....+..+..
T Consensus 131 ~~Iwdi~s~ltvPke~~~fs~~~l~gqns~cwlrd------~klvlaGm~sr~~~ifdlRqs~-~~~~svnTk~vqG~tV 203 (783)
T KOG1008|consen 131 LKIWDINSLLTVPKESPLFSSSTLDGQNSVCWLRD------TKLVLAGMTSRSVHIFDLRQSL-DSVSSVNTKYVQGITV 203 (783)
T ss_pred ccceecccccCCCccccccccccccCccccccccC------cchhhcccccchhhhhhhhhhh-hhhhhhhhhhccccee
Confidence 999999876 222 22332 2334456666555 6789999999999999998321 1111112334566777
Q ss_pred cC-CCCEEEEEEcCCCCCCCCCCEEEEEEC-CCC-eEEEEEeCCC----CCceeeeeee
Q 000473 678 DC-PRGYIACLCRDHSRTSDAVDVLFIWDV-KTG-ARERVLRGTA----SHSMFDHFCK 729 (1471)
Q Consensus 678 sp-dg~~L~sgs~D~sg~~D~~gtV~VWDi-~tg-~~~~~l~gH~----~~v~~~~~~~ 729 (1471)
+| .++|+++-. | |.|.+||. +.- .+++.+.-.. .+...+.+||
T Consensus 204 dp~~~nY~cs~~-d--------g~iAiwD~~rnienpl~~i~~~~N~~~~~l~~~aycP 253 (783)
T KOG1008|consen 204 DPFSPNYFCSNS-D--------GDIAIWDTYRNIENPLQIILRNENKKPKQLFALAYCP 253 (783)
T ss_pred cCCCCCceeccc-c--------CceeeccchhhhccHHHHHhhCCCCcccceeeEEecc
Confidence 88 677876544 5 89999993 322 2222222111 1355667885
No 325
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=92.74 E-value=0.18 Score=63.78 Aligned_cols=99 Identities=13% Similarity=0.159 Sum_probs=75.5
Q ss_pred CCEEEEEE----CCCcEEEEECCCCceEEE--EeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEE
Q 000473 591 NEVLVSGS----MDCSIRIWDLGSGNLITV--MHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERM 664 (1471)
Q Consensus 591 ~~~L~SGs----~DgtI~lWDl~tg~~l~~--~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~ 664 (1471)
..+++.++ .-|+|.++- ++|++-+. ...| +++++|+|. .-.++.|-.-|.+.+|...+.+....
T Consensus 27 ePlfAVA~fS~er~GSVtIfa-dtGEPqr~Vt~P~h---atSLCWHpe------~~vLa~gwe~g~~~v~~~~~~e~htv 96 (1416)
T KOG3617|consen 27 EPLFAVASFSPERGGSVTIFA-DTGEPQRDVTYPVH---ATSLCWHPE------EFVLAQGWEMGVSDVQKTNTTETHTV 96 (1416)
T ss_pred CceeEEEEecCCCCceEEEEe-cCCCCCccccccee---hhhhccChH------HHHHhhccccceeEEEecCCceeeee
Confidence 45555544 346777774 56764322 2223 456888887 66788888889999999988887777
Q ss_pred ecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 665 FPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 665 ~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
...|..+|.-+.|||+|..|+++..- |.|.+|...
T Consensus 97 ~~th~a~i~~l~wS~~G~~l~t~d~~--------g~v~lwr~d 131 (1416)
T KOG3617|consen 97 VETHPAPIQGLDWSHDGTVLMTLDNP--------GSVHLWRYD 131 (1416)
T ss_pred ccCCCCCceeEEecCCCCeEEEcCCC--------ceeEEEEee
Confidence 77899999999999999999998877 999999775
No 326
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=92.52 E-value=4.1 Score=46.00 Aligned_cols=109 Identities=18% Similarity=0.058 Sum_probs=81.3
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
..+++-||..+.+.--|..+|++...-.- ...|.+-+.- - |+.++-|+..+.+.+.+.++|.....|..-..
T Consensus 23 kT~v~igSHs~~~~avd~~sG~~~We~il-g~RiE~sa~v-v------gdfVV~GCy~g~lYfl~~~tGs~~w~f~~~~~ 94 (354)
T KOG4649|consen 23 KTLVVIGSHSGIVIAVDPQSGNLIWEAIL-GVRIECSAIV-V------GDFVVLGCYSGGLYFLCVKTGSQIWNFVILET 94 (354)
T ss_pred ceEEEEecCCceEEEecCCCCcEEeehhh-CceeeeeeEE-E------CCEEEEEEccCcEEEEEecchhheeeeeehhh
Confidence 67888888899888889998887643211 1223322221 2 77899999999999999999988887764333
Q ss_pred CcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 671 YPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 671 ~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
-=......+++..+.+|+.| ++.|.-|.++..++...
T Consensus 95 vk~~a~~d~~~glIycgshd--------~~~yalD~~~~~cVyks 131 (354)
T KOG4649|consen 95 VKVRAQCDFDGGLIYCGSHD--------GNFYALDPKTYGCVYKS 131 (354)
T ss_pred hccceEEcCCCceEEEecCC--------CcEEEecccccceEEec
Confidence 11344567899999999999 99999999998877653
No 327
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=92.45 E-value=0.23 Score=55.06 Aligned_cols=103 Identities=12% Similarity=0.127 Sum_probs=60.8
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.+++|..+|.|.++.|.+... ..+. ...-...|-|.-..-+ ++.+..+++.|+.|+.|
T Consensus 72 ~~~vG~~dg~v~~~n~n~~g~------~~d~--------~~s~~e~i~~~Ip~~~--------~~~~~c~~~~dg~ir~~ 129 (238)
T KOG2444|consen 72 KLMVGTSDGAVYVFNWNLEGA------HSDR--------VCSGEESIDLGIPNGR--------DSSLGCVGAQDGRIRAC 129 (238)
T ss_pred eEEeecccceEEEecCCccch------HHHh--------hhcccccceecccccc--------ccceeEEeccCCceeee
Confidence 799999999999988872111 0011 1111122333221111 25688999999999999
Q ss_pred ECCCCceEEEEeccC-CCEEEEEECCCCCCCCCCCEEEEE--eCCCcEEEEECC
Q 000473 607 DLGSGNLITVMHHHV-APVRQIILSPPQTEHPWSDCFLSV--GEDFSVALASLE 657 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H~-~~V~~l~fspd~~~~~~~~~l~S~--s~DgsV~lWdl~ 657 (1471)
++..++.+...-.|. .++..+..... +..++.. |.|..++.|++.
T Consensus 130 n~~p~k~~g~~g~h~~~~~e~~ivv~s------d~~i~~a~~S~d~~~k~W~ve 177 (238)
T KOG2444|consen 130 NIKPNKVLGYVGQHNFESGEELIVVGS------DEFLKIADTSHDRVLKKWNVE 177 (238)
T ss_pred ccccCceeeeeccccCCCcceeEEecC------CceEEeeccccchhhhhcchh
Confidence 999888777776676 34444443333 3444444 555555555554
No 328
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=92.30 E-value=0.51 Score=59.43 Aligned_cols=104 Identities=12% Similarity=0.057 Sum_probs=80.2
Q ss_pred CCEEEEEECCCcEEEEECCCCceE-EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE---EEEec
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLI-TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV---ERMFP 666 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l-~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~---l~~~~ 666 (1471)
+++++-|+.-|.+.+++-..++.. ....+-.+.+....++++ ..++|.|+..|.|.++-++.+.+ ...-+
T Consensus 45 ~~~l~~GsS~G~lyl~~R~~~~~~~~~~~~~~~~~~~~~vs~~------e~lvAagt~~g~V~v~ql~~~~p~~~~~~t~ 118 (726)
T KOG3621|consen 45 EEYLAMGSSAGSVYLYNRHTGEMRKLKNEGATGITCVRSVSSV------EYLVAAGTASGRVSVFQLNKELPRDLDYVTP 118 (726)
T ss_pred CceEEEecccceEEEEecCchhhhcccccCccceEEEEEecch------hHhhhhhcCCceEEeehhhccCCCcceeecc
Confidence 789999999999999998877654 333344556666778887 77888888899999998875422 12212
Q ss_pred ---CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 667 ---GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 667 ---gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.|...|++++|++++..+++|..- |+|..-.+.+
T Consensus 119 ~d~~~~~rVTal~Ws~~~~k~ysGD~~--------Gkv~~~~L~s 155 (726)
T KOG3621|consen 119 CDKSHKCRVTALEWSKNGMKLYSGDSQ--------GKVVLTELDS 155 (726)
T ss_pred ccccCCceEEEEEecccccEEeecCCC--------ceEEEEEech
Confidence 367789999999999999998776 9998887776
No 329
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=92.26 E-value=1.9 Score=56.73 Aligned_cols=131 Identities=15% Similarity=0.060 Sum_probs=90.6
Q ss_pred cccCCcceEEEEecCCcc-EEEEEEecCCCCcccC-cCCCEEEEEECCCcEEEEECCCC-ceEEEEec----cCCCEEEE
Q 000473 555 LKVNSHVSRQYFLGHTGA-VLCLAAHRMVGTAKGW-SFNEVLVSGSMDCSIRIWDLGSG-NLITVMHH----HVAPVRQI 627 (1471)
Q Consensus 555 ~d~~s~~~~~~l~gH~~~-V~~la~spd~~~~~~~-~~~~~L~SGs~DgtI~lWDl~tg-~~l~~~~~----H~~~V~~l 627 (1471)
-|++.|+.+....-|... |..++ |+. +.. -...-.+.|=.+..+..||.+-. ..+..-.. ......|+
T Consensus 509 mDLe~GKVV~eW~~~~~~~v~~~~--p~~---K~aqlt~e~tflGls~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~ 583 (794)
T PF08553_consen 509 MDLERGKVVEEWKVHDDIPVVDIA--PDS---KFAQLTNEQTFLGLSDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFSCF 583 (794)
T ss_pred EecCCCcEEEEeecCCCcceeEec--ccc---cccccCCCceEEEECCCceEEeccCCCCCceeeccccccccCCCceEE
Confidence 367788888888877654 66554 321 000 01345667778889999999853 22211111 23346677
Q ss_pred EECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 628 ILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 628 ~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
+-..+ .+||.|+.+|.|||||--..+....+++-..+|..|..+.||++|++.|.. .+.+++.
T Consensus 584 aTt~~-------G~iavgs~~G~IRLyd~~g~~AKT~lp~lG~pI~~iDvt~DGkwilaTc~t---------yLlLi~t 646 (794)
T PF08553_consen 584 ATTED-------GYIAVGSNKGDIRLYDRLGKRAKTALPGLGDPIIGIDVTADGKWILATCKT---------YLLLIDT 646 (794)
T ss_pred EecCC-------ceEEEEeCCCcEEeecccchhhhhcCCCCCCCeeEEEecCCCcEEEEeecc---------eEEEEEE
Confidence 66654 489999999999999954444556678888999999999999999999985 6777775
No 330
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=92.25 E-value=13 Score=42.26 Aligned_cols=109 Identities=10% Similarity=0.099 Sum_probs=71.6
Q ss_pred EEEEEEe-cCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEec-----cCCCEEEEEECCCCCCCCCCCEEEEEe
Q 000473 573 VLCLAAH-RMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHH-----HVAPVRQIILSPPQTEHPWSDCFLSVG 646 (1471)
Q Consensus 573 V~~la~s-pd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~-----H~~~V~~l~fspd~~~~~~~~~l~S~s 646 (1471)
...+++. ++ +.++++ ..++ +.++|..+++....+.. .......+++.|+ |+..++-.
T Consensus 42 ~~G~~~~~~~---------g~l~v~-~~~~-~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~------G~ly~t~~ 104 (246)
T PF08450_consen 42 PNGMAFDRPD---------GRLYVA-DSGG-IAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPD------GNLYVTDS 104 (246)
T ss_dssp EEEEEEECTT---------SEEEEE-ETTC-EEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TT------S-EEEEEE
T ss_pred CceEEEEccC---------CEEEEE-EcCc-eEEEecCCCcEEEEeeccCCCcccCCCceEEEcCC------CCEEEEec
Confidence 6677777 43 444444 4444 55569998865433332 3356789999999 88888766
Q ss_pred CC--------CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 647 ED--------FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 647 ~D--------gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.. +.|..++.. ++...... .....+.++|+|+++.|++.... .+.|+.+++.
T Consensus 105 ~~~~~~~~~~g~v~~~~~~-~~~~~~~~-~~~~pNGi~~s~dg~~lyv~ds~-------~~~i~~~~~~ 164 (246)
T PF08450_consen 105 GGGGASGIDPGSVYRIDPD-GKVTVVAD-GLGFPNGIAFSPDGKTLYVADSF-------NGRIWRFDLD 164 (246)
T ss_dssp CCBCTTCGGSEEEEEEETT-SEEEEEEE-EESSEEEEEEETTSSEEEEEETT-------TTEEEEEEEE
T ss_pred CCCccccccccceEEECCC-CeEEEEec-CcccccceEECCcchheeecccc-------cceeEEEecc
Confidence 54 457777777 55544433 34567899999999988765543 2889999985
No 331
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=91.87 E-value=17 Score=41.29 Aligned_cols=93 Identities=9% Similarity=-0.098 Sum_probs=63.9
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPN 670 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~ 670 (1471)
+++++-|...+.+.+-+.++|.....|..-..-=.+-...++ +..+..++.|++....|.++..++...+-..+
T Consensus 63 gdfVV~GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~a~~d~~------~glIycgshd~~~yalD~~~~~cVykskcgG~ 136 (354)
T KOG4649|consen 63 GDFVVLGCYSGGLYFLCVKTGSQIWNFVILETVKVRAQCDFD------GGLIYCGSHDGNFYALDPKTYGCVYKSKCGGG 136 (354)
T ss_pred CCEEEEEEccCcEEEEEecchhheeeeeehhhhccceEEcCC------CceEEEecCCCcEEEecccccceEEecccCCc
Confidence 688999999999999999999888777543321122234566 88999999999999999999999877543322
Q ss_pred CcEEEEEcCCCCEEEEEEc
Q 000473 671 YPAKVVWDCPRGYIACLCR 689 (1471)
Q Consensus 671 ~V~~v~~spdg~~L~sgs~ 689 (1471)
--.+-+..|....|..+..
T Consensus 137 ~f~sP~i~~g~~sly~a~t 155 (354)
T KOG4649|consen 137 TFVSPVIAPGDGSLYAAIT 155 (354)
T ss_pred eeccceecCCCceEEEEec
Confidence 2223344453333333333
No 332
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=91.48 E-value=10 Score=51.12 Aligned_cols=158 Identities=11% Similarity=0.076 Sum_probs=94.6
Q ss_pred ccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEec-CCccEEEEEEecCCCCcccC
Q 000473 510 KIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLG-HTGAVLCLAAHRMVGTAKGW 588 (1471)
Q Consensus 510 ~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~g-H~~~V~~la~spd~~~~~~~ 588 (1471)
..|.++.+..+.. .++.+..+|.|.+.+-. +.. ..+-| -.+.|.+..|+||
T Consensus 69 ~~i~s~~fl~d~~----~i~v~~~~G~iilvd~e---------------t~~--~eivg~vd~GI~aaswS~D------- 120 (1265)
T KOG1920|consen 69 DEIVSVQFLADTN----SICVITALGDIILVDPE---------------TLE--LEIVGNVDNGISAASWSPD------- 120 (1265)
T ss_pred cceEEEEEecccc----eEEEEecCCcEEEEccc---------------ccc--eeeeeeccCceEEEeecCC-------
Confidence 4566666544443 78889999999985322 111 11222 2457999999998
Q ss_pred cCCCEEEEEECCCcEEEEEC----CCC-------------------ceEEEEeccCC---------------------CE
Q 000473 589 SFNEVLVSGSMDCSIRIWDL----GSG-------------------NLITVMHHHVA---------------------PV 624 (1471)
Q Consensus 589 ~~~~~L~SGs~DgtI~lWDl----~tg-------------------~~l~~~~~H~~---------------------~V 624 (1471)
+++++-.+.+.++.+-+- -.. +.-..|+|..+ .=
T Consensus 121 --ee~l~liT~~~tll~mT~~f~~i~E~~L~~d~~~~sk~v~VGwGrkeTqfrgs~gr~~~~~~~~~ek~~~~~~~~~~~ 198 (1265)
T KOG1920|consen 121 --EELLALITGRQTLLFMTKDFEPIAEKPLDADDERKSKFVNVGWGRKETQFRGSEGRQAARQKIEKEKALEQIEQDDHK 198 (1265)
T ss_pred --CcEEEEEeCCcEEEEEeccccchhccccccccccccccceecccccceeeecchhhhcccccccccccccchhhccCC
Confidence 888888888877766432 111 11123332211 01
Q ss_pred EEEEECCCCCCCCCCCEEEEE----eCC-CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCC
Q 000473 625 RQIILSPPQTEHPWSDCFLSV----GED-FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVD 699 (1471)
Q Consensus 625 ~~l~fspd~~~~~~~~~l~S~----s~D-gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~g 699 (1471)
++|.|.-| |++|+.. -.+ +.|++||-+ |..-.+-..-.+-=.+++|.|.|..+++-..+ ++| +
T Consensus 199 ~~IsWRgD------g~~fAVs~~~~~~~~RkirV~drE-g~Lns~se~~~~l~~~LsWkPsgs~iA~iq~~---~sd--~ 266 (1265)
T KOG1920|consen 199 TSISWRGD------GEYFAVSFVESETGTRKIRVYDRE-GALNSTSEPVEGLQHSLSWKPSGSLIAAIQCK---TSD--S 266 (1265)
T ss_pred ceEEEccC------CcEEEEEEEeccCCceeEEEeccc-chhhcccCcccccccceeecCCCCeEeeeeec---CCC--C
Confidence 23666666 8888872 224 899999976 43322211112223579999999999886655 233 5
Q ss_pred EEEEEECCCCe
Q 000473 700 VLFIWDVKTGA 710 (1471)
Q Consensus 700 tV~VWDi~tg~ 710 (1471)
.|.++. ++|-
T Consensus 267 ~IvffE-rNGL 276 (1265)
T KOG1920|consen 267 DIVFFE-RNGL 276 (1265)
T ss_pred cEEEEe-cCCc
Confidence 799987 4554
No 333
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=91.48 E-value=11 Score=37.86 Aligned_cols=91 Identities=12% Similarity=0.028 Sum_probs=59.8
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.|+.|++|..|+|++=+ ..+..+.. ++.|++|.... +..++-|-.+|+|.++
T Consensus 17 eLlvGs~D~~IRvf~~~-----------------e~~~Ei~e-~~~v~~L~~~~----------~~~F~Y~l~NGTVGvY 68 (111)
T PF14783_consen 17 ELLVGSDDFEIRVFKGD-----------------EIVAEITE-TDKVTSLCSLG----------GGRFAYALANGTVGVY 68 (111)
T ss_pred eEEEecCCcEEEEEeCC-----------------cEEEEEec-ccceEEEEEcC----------CCEEEEEecCCEEEEE
Confidence 79999999999994322 33444443 45788887665 5779999999999999
Q ss_pred ECCCCceEEEEeccCCCEEEEEE-CCCCCCCCCC-CEEEEEeCCCcEE
Q 000473 607 DLGSGNLITVMHHHVAPVRQIIL-SPPQTEHPWS-DCFLSVGEDFSVA 652 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H~~~V~~l~f-spd~~~~~~~-~~l~S~s~DgsV~ 652 (1471)
+- .+.+.+.+.... +.++.+ ..+. +| ..|++|-.+|.|-
T Consensus 69 ~~--~~RlWRiKSK~~-~~~~~~~D~~g----dG~~eLI~GwsnGkve 109 (111)
T PF14783_consen 69 DR--SQRLWRIKSKNQ-VTSMAFYDING----DGVPELIVGWSNGKVE 109 (111)
T ss_pred eC--cceeeeeccCCC-eEEEEEEcCCC----CCceEEEEEecCCeEE
Confidence 85 334455544443 555554 3331 12 2688888888764
No 334
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=91.36 E-value=0.47 Score=52.67 Aligned_cols=104 Identities=14% Similarity=0.158 Sum_probs=63.6
Q ss_pred CCEEEEEECCCcEEEEECCC-CceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGS-GNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHP 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~t-g~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~ 669 (1471)
+.-++.|+.||.|.+|...- |.....+..-..+|.+.. |. ...+.+.++++.|+.|+.|+.+-++.+.....|.
T Consensus 70 ~~~~~vG~~dg~v~~~n~n~~g~~~d~~~s~~e~i~~~I--p~---~~~~~~~c~~~~dg~ir~~n~~p~k~~g~~g~h~ 144 (238)
T KOG2444|consen 70 SAKLMVGTSDGAVYVFNWNLEGAHSDRVCSGEESIDLGI--PN---GRDSSLGCVGAQDGRIRACNIKPNKVLGYVGQHN 144 (238)
T ss_pred CceEEeecccceEEEecCCccchHHHhhhcccccceecc--cc---ccccceeEEeccCCceeeeccccCceeeeecccc
Confidence 56789999999999998762 111111111122233222 22 1125689999999999999999888887777776
Q ss_pred -CCcEEEEEcCCCCEEEEE--EcCCCCCCCCCCEEEEEECC
Q 000473 670 -NYPAKVVWDCPRGYIACL--CRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 670 -~~V~~v~~spdg~~L~sg--s~D~sg~~D~~gtV~VWDi~ 707 (1471)
.++........+++++.+ +.| ..++.|+++
T Consensus 145 ~~~~e~~ivv~sd~~i~~a~~S~d--------~~~k~W~ve 177 (238)
T KOG2444|consen 145 FESGEELIVVGSDEFLKIADTSHD--------RVLKKWNVE 177 (238)
T ss_pred CCCcceeEEecCCceEEeeccccc--------hhhhhcchh
Confidence 344444444445555554 444 555666554
No 335
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=91.30 E-value=75 Score=43.72 Aligned_cols=92 Identities=20% Similarity=0.220 Sum_probs=67.3
Q ss_pred eeEecCCC---CCCceEEEEEEcCCCCeEEEEeCCCcEEEE----EccCCCCCceeeeEEecccccceeEeeeccccccc
Q 000473 6 VACIWSGT---PPSHRVTATSALTQPPTLYTGGSDGSILWW----SFSDSSYSEIKPVAMLCGHSAPIADLSICYPAMVS 78 (1471)
Q Consensus 6 ~~~lw~~~---~p~h~Vtava~SpDg~~LaTGs~DG~I~lW----dl~~~~~~~~~~~~~L~GH~~~Vt~La~c~~~~~s 78 (1471)
+.+=|... .+..+|.++.+.+|...|+.+..+|.|.+. |... ...+.+. -=...|.|.+
T Consensus 62 ~l~s~~~~~~~~~~~~ivs~~yl~d~~~l~~~~~~Gdi~~~~~~~~~~~---~~~E~VG---~vd~GI~a~~-------- 127 (928)
T PF04762_consen 62 VLASWDAPLPDDPNDKIVSFQYLADSESLCIALASGDIILVREDPDPDE---DEIEIVG---SVDSGILAAS-------- 127 (928)
T ss_pred EEEeccccCCcCCCCcEEEEEeccCCCcEEEEECCceEEEEEccCCCCC---ceeEEEE---EEcCcEEEEE--------
Confidence 34447654 556889999999999999999999999999 4442 2222222 2345788887
Q ss_pred cCcccccccccccccccccccCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCC
Q 000473 79 RDGKAEHWKAENSSNVMGKSSLDNGALISACTDGVLCVWSRSSGHCRRRRKLP 131 (1471)
Q Consensus 79 ~dg~~~~~~~~~~~~~~~~~s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~ 131 (1471)
+|||+..|+-+..+|+|.+-+ .+-..+....+.
T Consensus 128 -------------------WSPD~Ella~vT~~~~l~~mt-~~fd~i~E~~l~ 160 (928)
T PF04762_consen 128 -------------------WSPDEELLALVTGEGNLLLMT-RDFDPISEVPLD 160 (928)
T ss_pred -------------------ECCCcCEEEEEeCCCEEEEEe-ccceEEEEeecC
Confidence 799999999999999998887 455555554443
No 336
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=91.29 E-value=0.76 Score=52.48 Aligned_cols=153 Identities=12% Similarity=0.103 Sum_probs=97.7
Q ss_pred cccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecC-----CccEEEEEEec
Q 000473 506 VHKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGH-----TGAVLCLAAHR 580 (1471)
Q Consensus 506 ~~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH-----~~~V~~la~sp 580 (1471)
..|+..++++.+.++.. .++ ...|-.|.+|+.+.. |... .+--++.| +.-|++-.|||
T Consensus 169 NaH~yhiNSiS~NsD~e----t~l-SaDdLrINLWnl~i~---D~sF---------nIVDiKP~nmeeLteVItSaeFhp 231 (460)
T COG5170 169 NAHPYHINSISFNSDKE----TLL-SADDLRINLWNLEII---DGSF---------NIVDIKPHNMEELTEVITSAEFHP 231 (460)
T ss_pred ccceeEeeeeeecCchh----eee-eccceeeeecccccc---CCce---------EEEeccCccHHHHHHHHhhcccCH
Confidence 45778888888666665 333 345555555332211 1111 12223334 34578889999
Q ss_pred CCCCcccCcCCCEEEEEECCCcEEEEECCCCce----------------EEEEeccCCCEEEEEECCCCCCCCCCCEEEE
Q 000473 581 MVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNL----------------ITVMHHHVAPVRQIILSPPQTEHPWSDCFLS 644 (1471)
Q Consensus 581 d~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~----------------l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S 644 (1471)
.. ..++.=.+..|.|++-|++...+ ..-|..-...|..+.|+++ |+++++
T Consensus 232 ~~--------cn~fmYSsSkG~Ikl~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~n------gryIls 297 (460)
T COG5170 232 EM--------CNVFMYSSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDN------GRYILS 297 (460)
T ss_pred hH--------cceEEEecCCCcEEehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCC------CcEEEE
Confidence 62 56777778899999999983221 1112334567889999998 889887
Q ss_pred EeCCCcEEEEECCC-CcEEEEecCCCC------------Cc---EEEEEcCCCCEEEEEEcC
Q 000473 645 VGEDFSVALASLET-LRVERMFPGHPN------------YP---AKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 645 ~s~DgsV~lWdl~t-~~~l~~~~gh~~------------~V---~~v~~spdg~~L~sgs~D 690 (1471)
-+. -+|++||++. ..++.+++-|.. .| ..+.|+.|.+.+++|+..
T Consensus 298 Rdy-ltvkiwDvnm~k~pikTi~~h~~l~~~l~d~YEnDaifdkFeisfSgd~~~v~sgsy~ 358 (460)
T COG5170 298 RDY-LTVKIWDVNMAKNPIKTIPMHCDLMDELNDVYENDAIFDKFEISFSGDDKHVLSGSYS 358 (460)
T ss_pred ecc-ceEEEEecccccCCceeechHHHHHHHHHhhhhccceeeeEEEEecCCcccccccccc
Confidence 753 5899999974 457777766642 22 356788888888777665
No 337
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=91.25 E-value=3 Score=51.25 Aligned_cols=112 Identities=11% Similarity=0.031 Sum_probs=70.0
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEE-EEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQ-IILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHP 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~-l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~ 669 (1471)
+..++.+..++.+.-+|..+|+.+..+......+.. ..-+|. ..+..++.++.|+.+..+|.++|+.+.......
T Consensus 160 ~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~----v~~~~v~~~~~~g~v~a~d~~~G~~~W~~~~~~ 235 (394)
T PRK11138 160 DGLVLVHTSNGMLQALNESDGAVKWTVNLDVPSLTLRGESAPA----TAFGGAIVGGDNGRVSAVLMEQGQLIWQQRISQ 235 (394)
T ss_pred CCEEEEECCCCEEEEEEccCCCEeeeecCCCCcccccCCCCCE----EECCEEEEEcCCCEEEEEEccCChhhheecccc
Confidence 355666788999999999999998877543211100 001221 013356667789999999999998766543111
Q ss_pred C-------CcEEEEEcC--CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 670 N-------YPAKVVWDC--PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 670 ~-------~V~~v~~sp--dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
. ....+.-+| .+..+++++.+ |.++..|..+|+.+-.
T Consensus 236 ~~~~~~~~~~~~~~~sP~v~~~~vy~~~~~--------g~l~ald~~tG~~~W~ 281 (394)
T PRK11138 236 PTGATEIDRLVDVDTTPVVVGGVVYALAYN--------GNLVALDLRSGQIVWK 281 (394)
T ss_pred CCCccchhcccccCCCcEEECCEEEEEEcC--------CeEEEEECCCCCEEEe
Confidence 0 011111122 25667777777 8999999999987654
No 338
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=90.98 E-value=9 Score=47.21 Aligned_cols=183 Identities=13% Similarity=0.101 Sum_probs=96.9
Q ss_pred ccCccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEE--Ee------cCCccEEEEEE
Q 000473 507 HKEKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQY--FL------GHTGAVLCLAA 578 (1471)
Q Consensus 507 ~h~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~--l~------gH~~~V~~la~ 578 (1471)
...+.|++++. ++.. .++.|+++|.+.|.+.. ....+.. +. .....|+++.|
T Consensus 84 ~~~g~vtal~~-S~iG----Fvaigy~~G~l~viD~R---------------GPavI~~~~i~~~~~~~~~~~~vt~ieF 143 (395)
T PF08596_consen 84 AKQGPVTALKN-SDIG----FVAIGYESGSLVVIDLR---------------GPAVIYNENIRESFLSKSSSSYVTSIEF 143 (395)
T ss_dssp --S-SEEEEEE--BTS----EEEEEETTSEEEEEETT---------------TTEEEEEEEGGG--T-SS----EEEEEE
T ss_pred ccCCcEeEEec-CCCc----EEEEEecCCcEEEEECC---------------CCeEEeeccccccccccccccCeeEEEE
Confidence 34678999875 5666 49999999999995543 1111111 11 12346888888
Q ss_pred ecCCCCcccCcCCCEEEEEECCCcEEEEECCC---Cce----EEEEeccCCCEEEEE-ECCCCCC--------------C
Q 000473 579 HRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGS---GNL----ITVMHHHVAPVRQII-LSPPQTE--------------H 636 (1471)
Q Consensus 579 spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~t---g~~----l~~~~~H~~~V~~l~-fspd~~~--------------~ 636 (1471)
.-.. -.+..|..-.|+.|...|.+.+|.+.- +.. ......+.++|..+. ++.+... .
T Consensus 144 ~vm~-~~~D~ySSi~L~vGTn~G~v~~fkIlp~~~g~f~v~~~~~~~~~~~~i~~I~~i~~~~G~~a~At~~~~~~l~~g 222 (395)
T PF08596_consen 144 SVMT-LGGDGYSSICLLVGTNSGNVLTFKILPSSNGRFSVQFAGATTNHDSPILSIIPINADTGESALATISAMQGLSKG 222 (395)
T ss_dssp EEEE--TTSSSEEEEEEEEETTSEEEEEEEEE-GGG-EEEEEEEEE--SS----EEEEEETTT--B-B-BHHHHHGGGGT
T ss_pred EEEe-cCCCcccceEEEEEeCCCCEEEEEEecCCCCceEEEEeeccccCCCceEEEEEEECCCCCcccCchhHhhccccC
Confidence 6431 111223456899999999999998751 221 222235667777766 4332110 0
Q ss_pred CCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE-----cCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 637 PWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW-----DCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 637 ~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~-----spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
..-+.++.+..+..+|++.+-+.+..+...........+.+ ...+..|++-..| |.|+++.+..-+.
T Consensus 223 ~~i~g~vVvvSe~~irv~~~~~~k~~~K~~~~~~~~~~~~vv~~~~~~~~~~Lv~l~~~--------G~i~i~SLP~Lke 294 (395)
T PF08596_consen 223 ISIPGYVVVVSESDIRVFKPPKSKGAHKSFDDPFLCSSASVVPTISRNGGYCLVCLFNN--------GSIRIYSLPSLKE 294 (395)
T ss_dssp ----EEEEEE-SSEEEEE-TT---EEEEE-SS-EEEEEEEEEEEE-EEEEEEEEEEETT--------SEEEEEETTT--E
T ss_pred CCcCcEEEEEcccceEEEeCCCCcccceeeccccccceEEEEeecccCCceEEEEEECC--------CcEEEEECCCchH
Confidence 11123555566889999999888776554422222334445 2356678888887 9999999998777
Q ss_pred EEEEeCC
Q 000473 712 ERVLRGT 718 (1471)
Q Consensus 712 ~~~l~gH 718 (1471)
+..+.-+
T Consensus 295 i~~~~l~ 301 (395)
T PF08596_consen 295 IKSVSLP 301 (395)
T ss_dssp EEEEE-S
T ss_pred hhcccCC
Confidence 7766644
No 339
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=90.34 E-value=0.47 Score=60.21 Aligned_cols=70 Identities=20% Similarity=0.164 Sum_probs=62.6
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEE
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVA 652 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~ 652 (1471)
+++++|||. .-+|++|=.-|.+.+|...+.+....-..|..+|.-+.|+|+ |.+++|+..-|.|.
T Consensus 62 atSLCWHpe---------~~vLa~gwe~g~~~v~~~~~~e~htv~~th~a~i~~l~wS~~------G~~l~t~d~~g~v~ 126 (1416)
T KOG3617|consen 62 ATSLCWHPE---------EFVLAQGWEMGVSDVQKTNTTETHTVVETHPAPIQGLDWSHD------GTVLMTLDNPGSVH 126 (1416)
T ss_pred hhhhccChH---------HHHHhhccccceeEEEecCCceeeeeccCCCCCceeEEecCC------CCeEEEcCCCceeE
Confidence 677999996 678888988899999999887776677789999999999999 99999999999999
Q ss_pred EEECC
Q 000473 653 LASLE 657 (1471)
Q Consensus 653 lWdl~ 657 (1471)
+|...
T Consensus 127 lwr~d 131 (1416)
T KOG3617|consen 127 LWRYD 131 (1416)
T ss_pred EEEee
Confidence 99875
No 340
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=89.75 E-value=0.84 Score=38.44 Aligned_cols=38 Identities=18% Similarity=0.214 Sum_probs=34.3
Q ss_pred ecCCCCCCceEEEEEEcCCCCeEEEEeCCCcEEEEEcc
Q 000473 9 IWSGTPPSHRVTATSALTQPPTLYTGGSDGSILWWSFS 46 (1471)
Q Consensus 9 lw~~~~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~ 46 (1471)
+++.+.-..+|+++.++|....||.|..||+|.++.++
T Consensus 4 ~~~~k~l~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~ 41 (47)
T PF12894_consen 4 QLGEKNLPSRVSCMSWCPTMDLIALGTEDGEVLVYRLN 41 (47)
T ss_pred eecccCCCCcEEEEEECCCCCEEEEEECCCeEEEEECC
Confidence 45667777889999999999999999999999999986
No 341
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=88.99 E-value=11 Score=46.41 Aligned_cols=98 Identities=8% Similarity=-0.052 Sum_probs=66.3
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCC-EEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAP-VRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHP 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~-V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~ 669 (1471)
+..+..++.|+.+..+|..+|+.+.....-... ..+..+ . +..+..++.||.+..+|.++|+.+....-..
T Consensus 294 ~~~vy~~~~~g~l~ald~~tG~~~W~~~~~~~~~~~sp~v--~------~g~l~v~~~~G~l~~ld~~tG~~~~~~~~~~ 365 (394)
T PRK11138 294 GGRIYLVDQNDRVYALDTRGGVELWSQSDLLHRLLTAPVL--Y------NGYLVVGDSEGYLHWINREDGRFVAQQKVDS 365 (394)
T ss_pred CCEEEEEcCCCeEEEEECCCCcEEEcccccCCCcccCCEE--E------CCEEEEEeCCCEEEEEECCCCCEEEEEEcCC
Confidence 566777889999999999999876544321111 111111 1 4578889999999999999999887765433
Q ss_pred CCcEE-EEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 670 NYPAK-VVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 670 ~~V~~-v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
..+.. ..+ .++.|++++.| |.|+.++.
T Consensus 366 ~~~~s~P~~--~~~~l~v~t~~--------G~l~~~~~ 393 (394)
T PRK11138 366 SGFLSEPVV--ADDKLLIQARD--------GTVYAITR 393 (394)
T ss_pred CcceeCCEE--ECCEEEEEeCC--------ceEEEEeC
Confidence 33322 122 25578888888 99988764
No 342
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.44 E-value=1.5 Score=57.65 Aligned_cols=94 Identities=11% Similarity=0.080 Sum_probs=70.0
Q ss_pred CCEEEEEECCCcEEEEECCCC-ceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSG-NLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHP 669 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg-~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~ 669 (1471)
+.+++-|+..|.|-..|.... .+.+.=..-.++|++++|+-+ |..++.|-.+|.|.+||...++.++.+..|.
T Consensus 99 ~~~ivi~Ts~ghvl~~d~~~nL~~~~~ne~v~~~Vtsvafn~d------g~~l~~G~~~G~V~v~D~~~~k~l~~i~e~~ 172 (1206)
T KOG2079|consen 99 VVPIVIGTSHGHVLLSDMTGNLGPLHQNERVQGPVTSVAFNQD------GSLLLAGLGDGHVTVWDMHRAKILKVITEHG 172 (1206)
T ss_pred eeeEEEEcCchhhhhhhhhcccchhhcCCccCCcceeeEecCC------CceeccccCCCcEEEEEccCCcceeeeeecC
Confidence 567888888888888887532 111222223579999999999 9999999999999999999999999888777
Q ss_pred CCcEE---EEEcCCCCEEEEEEcC
Q 000473 670 NYPAK---VVWDCPRGYIACLCRD 690 (1471)
Q Consensus 670 ~~V~~---v~~spdg~~L~sgs~D 690 (1471)
.+.+. +.|..++..++++..-
T Consensus 173 ap~t~vi~v~~t~~nS~llt~D~~ 196 (1206)
T KOG2079|consen 173 APVTGVIFVGRTSQNSKLLTSDTG 196 (1206)
T ss_pred CccceEEEEEEeCCCcEEEEccCC
Confidence 66544 4556666666665544
No 343
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.08 E-value=1.1 Score=59.03 Aligned_cols=81 Identities=10% Similarity=0.006 Sum_probs=62.6
Q ss_pred CEEEEEeCCCcEEEEECCCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 640 DCFLSVGEDFSVALASLETL-RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 640 ~~l~S~s~DgsV~lWdl~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
..++.|++-|.|...|+... ++...-..-.++|++++|+-+|.++..|-.+ |-|.+||+.++..++.+.-|
T Consensus 100 ~~ivi~Ts~ghvl~~d~~~nL~~~~~ne~v~~~Vtsvafn~dg~~l~~G~~~--------G~V~v~D~~~~k~l~~i~e~ 171 (1206)
T KOG2079|consen 100 VPIVIGTSHGHVLLSDMTGNLGPLHQNERVQGPVTSVAFNQDGSLLLAGLGD--------GHVTVWDMHRAKILKVITEH 171 (1206)
T ss_pred eeEEEEcCchhhhhhhhhcccchhhcCCccCCcceeeEecCCCceeccccCC--------CcEEEEEccCCcceeeeeec
Confidence 35677777788888887652 2222222235689999999999999988887 99999999999999999999
Q ss_pred CCCceeeeee
Q 000473 719 ASHSMFDHFC 728 (1471)
Q Consensus 719 ~~~v~~~~~~ 728 (1471)
.+++..+-+.
T Consensus 172 ~ap~t~vi~v 181 (1206)
T KOG2079|consen 172 GAPVTGVIFV 181 (1206)
T ss_pred CCccceEEEE
Confidence 8888776444
No 344
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=86.54 E-value=0.28 Score=60.86 Aligned_cols=113 Identities=19% Similarity=0.243 Sum_probs=76.3
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcE
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSV 651 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV 651 (1471)
.+.+++|-.+ .+++++|.....+.++|++. .+.......+..|..+.+.|-. ++++|+-. |+.|
T Consensus 156 gqns~cwlrd---------~klvlaGm~sr~~~ifdlRq-s~~~~~svnTk~vqG~tVdp~~-----~nY~cs~~-dg~i 219 (783)
T KOG1008|consen 156 GQNSVCWLRD---------TKLVLAGMTSRSVHIFDLRQ-SLDSVSSVNTKYVQGITVDPFS-----PNYFCSNS-DGDI 219 (783)
T ss_pred CccccccccC---------cchhhcccccchhhhhhhhh-hhhhhhhhhhhhcccceecCCC-----CCceeccc-cCce
Confidence 4556777654 78899999999999999872 2333344455567777888832 67888776 9999
Q ss_pred EEEE-CCCC-cEEEEecCCCC----CcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 652 ALAS-LETL-RVERMFPGHPN----YPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 652 ~lWd-l~t~-~~l~~~~gh~~----~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.+|| .+.- .+++.+...+. .+..++|.|... .+++...| .++|+.+|+.
T Consensus 220 AiwD~~rnienpl~~i~~~~N~~~~~l~~~aycPtrtglla~l~Rd-------S~tIrlydi~ 275 (783)
T KOG1008|consen 220 AIWDTYRNIENPLQIILRNENKKPKQLFALAYCPTRTGLLAVLSRD-------SITIRLYDIC 275 (783)
T ss_pred eeccchhhhccHHHHHhhCCCCcccceeeEEeccCCcchhhhhccC-------cceEEEeccc
Confidence 9999 3332 22233222222 488999999765 45555555 2799999985
No 345
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=86.41 E-value=9.4 Score=47.73 Aligned_cols=103 Identities=13% Similarity=0.120 Sum_probs=56.4
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEE
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVA 652 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~ 652 (1471)
-....|.+. ..++.-....+|.++.-...+....+... ..+..+-. |.+|+..+ ++.|.
T Consensus 71 g~~~vw~~~----------n~yAv~~~~~~I~I~kn~~~~~~k~i~~~-~~~~~If~---------G~LL~~~~-~~~i~ 129 (443)
T PF04053_consen 71 GLSFVWSSR----------NRYAVLESSSTIKIYKNFKNEVVKSIKLP-FSVEKIFG---------GNLLGVKS-SDFIC 129 (443)
T ss_dssp -SEEEE-TS----------SEEEEE-TTS-EEEEETTEE-TT-----S-S-EEEEE----------SSSEEEEE-TTEEE
T ss_pred eeEEEEecC----------ccEEEEECCCeEEEEEcCccccceEEcCC-cccceEEc---------CcEEEEEC-CCCEE
Confidence 345677763 44666666888999633222221223221 12444432 44555554 44899
Q ss_pred EEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 653 LASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 653 lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
+||+++++.++.+... +|..|.|++++.+++..+.+ +++|++..
T Consensus 130 ~yDw~~~~~i~~i~v~--~vk~V~Ws~~g~~val~t~~---------~i~il~~~ 173 (443)
T PF04053_consen 130 FYDWETGKLIRRIDVS--AVKYVIWSDDGELVALVTKD---------SIYILKYN 173 (443)
T ss_dssp EE-TTT--EEEEESS---E-EEEEE-TTSSEEEEE-S----------SEEEEEE-
T ss_pred EEEhhHcceeeEEecC--CCcEEEEECCCCEEEEEeCC---------eEEEEEec
Confidence 9999999999998743 38999999999999988876 78888754
No 346
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=85.83 E-value=1.8 Score=36.43 Aligned_cols=34 Identities=18% Similarity=0.266 Sum_probs=29.9
Q ss_pred CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeE
Q 000473 669 PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGAR 711 (1471)
Q Consensus 669 ~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~ 711 (1471)
...|..++|+|..++|+.+..| |.|.++.+ +++.
T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~--------g~v~v~Rl-~~qr 44 (47)
T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTED--------GEVLVYRL-NWQR 44 (47)
T ss_pred CCcEEEEEECCCCCEEEEEECC--------CeEEEEEC-CCcC
Confidence 3468999999999999999999 99999998 6654
No 347
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=85.53 E-value=9 Score=50.76 Aligned_cols=101 Identities=14% Similarity=0.114 Sum_probs=64.9
Q ss_pred CEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEE---EecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCc
Q 000473 526 YAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQY---FLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCS 602 (1471)
Q Consensus 526 ~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~---l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dgt 602 (1471)
....+|-.+..+.. ||.... ..+.+.. .........|++-.. ..+||.||.+|.
T Consensus 543 e~tflGls~n~lfr--iDpR~~-----------~~k~v~~~~k~Y~~~~~Fs~~aTt~----------~G~iavgs~~G~ 599 (794)
T PF08553_consen 543 EQTFLGLSDNSLFR--IDPRLS-----------GNKLVDSQSKQYSSKNNFSCFATTE----------DGYIAVGSNKGD 599 (794)
T ss_pred CceEEEECCCceEE--eccCCC-----------CCceeeccccccccCCCceEEEecC----------CceEEEEeCCCc
Confidence 35677877777666 553221 1111110 112334566776543 578999999999
Q ss_pred EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEEC
Q 000473 603 IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASL 656 (1471)
Q Consensus 603 I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl 656 (1471)
||++|--..+....|.+-+.||..|..+.| |++++..+ +..+.|++.
T Consensus 600 IRLyd~~g~~AKT~lp~lG~pI~~iDvt~D------GkwilaTc-~tyLlLi~t 646 (794)
T PF08553_consen 600 IRLYDRLGKRAKTALPGLGDPIIGIDVTAD------GKWILATC-KTYLLLIDT 646 (794)
T ss_pred EEeecccchhhhhcCCCCCCCeeEEEecCC------CcEEEEee-cceEEEEEE
Confidence 999994322233445567889999999999 88777665 556777765
No 348
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=85.03 E-value=6.2 Score=50.23 Aligned_cols=102 Identities=15% Similarity=0.139 Sum_probs=72.0
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcce-EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVS-RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRI 605 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~-~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~l 605 (1471)
.++.|+.-|.+.+++-. .++. .....|-.+.+..+.++++ ..+++.|+..|.|.+
T Consensus 47 ~l~~GsS~G~lyl~~R~---------------~~~~~~~~~~~~~~~~~~~~vs~~---------e~lvAagt~~g~V~v 102 (726)
T KOG3621|consen 47 YLAMGSSAGSVYLYNRH---------------TGEMRKLKNEGATGITCVRSVSSV---------EYLVAAGTASGRVSV 102 (726)
T ss_pred eEEEecccceEEEEecC---------------chhhhcccccCccceEEEEEecch---------hHhhhhhcCCceEEe
Confidence 69999999999984322 1111 1112233344445556765 788899999999999
Q ss_pred EECCCCceE-----EEEe-ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC
Q 000473 606 WDLGSGNLI-----TVMH-HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET 658 (1471)
Q Consensus 606 WDl~tg~~l-----~~~~-~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t 658 (1471)
+-+..+..- ..+. .|...|+++.|+++ +..+.+|..-|.|.+-.+.+
T Consensus 103 ~ql~~~~p~~~~~~t~~d~~~~~rVTal~Ws~~------~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 103 FQLNKELPRDLDYVTPCDKSHKCRVTALEWSKN------GMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred ehhhccCCCcceeeccccccCCceEEEEEeccc------ccEEeecCCCceEEEEEech
Confidence 988764321 1121 37789999999999 99999999999999888877
No 349
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=84.24 E-value=21 Score=45.30 Aligned_cols=119 Identities=12% Similarity=0.053 Sum_probs=75.0
Q ss_pred CEEEEEECCCcEEEEECCCCceEEEEeccCCC--EEEEEECCCCCCCCCCCEEEEEe---------CCCcEEEEECCCCc
Q 000473 592 EVLVSGSMDCSIRIWDLGSGNLITVMHHHVAP--VRQIILSPPQTEHPWSDCFLSVG---------EDFSVALASLETLR 660 (1471)
Q Consensus 592 ~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~--V~~l~fspd~~~~~~~~~l~S~s---------~DgsV~lWdl~t~~ 660 (1471)
..++.++.|+.|.-+|.++|+.+.++...... -..+.-+|.- .+..++.++ .++.+.-+|.++|+
T Consensus 111 ~~V~v~~~~g~v~AlD~~TG~~~W~~~~~~~~~~~~~i~ssP~v----~~~~v~vg~~~~~~~~~~~~g~v~alD~~TG~ 186 (488)
T cd00216 111 RKVFFGTFDGRLVALDAETGKQVWKFGNNDQVPPGYTMTGAPTI----VKKLVIIGSSGAEFFACGVRGALRAYDVETGK 186 (488)
T ss_pred CeEEEecCCCeEEEEECCCCCEeeeecCCCCcCcceEecCCCEE----ECCEEEEeccccccccCCCCcEEEEEECCCCc
Confidence 67778889999999999999999877654321 0111112210 023444443 36788999999999
Q ss_pred EEEEecCCCCC--------------------c-EEEEEcCCCCEEEEEEcCCCCC------------CCCCCEEEEEECC
Q 000473 661 VERMFPGHPNY--------------------P-AKVVWDCPRGYIACLCRDHSRT------------SDAVDVLFIWDVK 707 (1471)
Q Consensus 661 ~l~~~~gh~~~--------------------V-~~v~~spdg~~L~sgs~D~sg~------------~D~~gtV~VWDi~ 707 (1471)
.+..+...... | ...+..+.+..++.++.| ++ .+.++.|+-.|.+
T Consensus 187 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~g~~vw~~pa~d~~~g~V~vg~~~--g~~~~~~~~~~~~~~~~~~~l~Ald~~ 264 (488)
T cd00216 187 LLWRFYTTEPDPNAFPTWGPDRQMWGPGGGTSWASPTYDPKTNLVYVGTGN--GSPWNWGGRRTPGDNLYTDSIVALDAD 264 (488)
T ss_pred eeeEeeccCCCcCCCCCCCCCcceecCCCCCccCCeeEeCCCCEEEEECCC--CCCCccCCccCCCCCCceeeEEEEcCC
Confidence 88776542110 1 123455566778887765 11 1112489999999
Q ss_pred CCeEEEEEe
Q 000473 708 TGARERVLR 716 (1471)
Q Consensus 708 tg~~~~~l~ 716 (1471)
||+..-...
T Consensus 265 tG~~~W~~~ 273 (488)
T cd00216 265 TGKVKWFYQ 273 (488)
T ss_pred CCCEEEEee
Confidence 999887654
No 350
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=83.97 E-value=42 Score=42.67 Aligned_cols=103 Identities=13% Similarity=0.142 Sum_probs=64.4
Q ss_pred CCCcEEEEECCCCceEEEEeccCCC--------------------EE-EEEECCCCCCCCCCCEEEEEeCCC--------
Q 000473 599 MDCSIRIWDLGSGNLITVMHHHVAP--------------------VR-QIILSPPQTEHPWSDCFLSVGEDF-------- 649 (1471)
Q Consensus 599 ~DgtI~lWDl~tg~~l~~~~~H~~~--------------------V~-~l~fspd~~~~~~~~~l~S~s~Dg-------- 649 (1471)
.++.+...|..+|+.+..+...... |+ ..++.+. +..+..++.|+
T Consensus 173 ~~g~v~alD~~TG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~g~~vw~~pa~d~~------~g~V~vg~~~g~~~~~~~~ 246 (488)
T cd00216 173 VRGALRAYDVETGKLLWRFYTTEPDPNAFPTWGPDRQMWGPGGGTSWASPTYDPK------TNLVYVGTGNGSPWNWGGR 246 (488)
T ss_pred CCcEEEEEECCCCceeeEeeccCCCcCCCCCCCCCcceecCCCCCccCCeeEeCC------CCEEEEECCCCCCCccCCc
Confidence 4678999999999988777542111 10 1222222 45666666554
Q ss_pred ----------cEEEEECCCCcEEEEecCCCCCc------EEEEEc----CCCC---EEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 650 ----------SVALASLETLRVERMFPGHPNYP------AKVVWD----CPRG---YIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 650 ----------sV~lWdl~t~~~l~~~~gh~~~V------~~v~~s----pdg~---~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
.|.-+|.++|+.+-.+..-.... ....+. -++. .+++++.+ |.++..|.
T Consensus 247 ~~~~~~~~~~~l~Ald~~tG~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~~~--------G~l~ald~ 318 (488)
T cd00216 247 RTPGDNLYTDSIVALDADTGKVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAPKN--------GFFYVLDR 318 (488)
T ss_pred cCCCCCCceeeEEEEcCCCCCEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEECCC--------ceEEEEEC
Confidence 78889999999987764211111 001111 1232 57777777 99999999
Q ss_pred CCCeEEEEE
Q 000473 707 KTGARERVL 715 (1471)
Q Consensus 707 ~tg~~~~~l 715 (1471)
++|+.+-..
T Consensus 319 ~tG~~~W~~ 327 (488)
T cd00216 319 TTGKLISAR 327 (488)
T ss_pred CCCcEeeEe
Confidence 999987654
No 351
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=83.13 E-value=0.67 Score=59.48 Aligned_cols=124 Identities=13% Similarity=0.083 Sum_probs=73.4
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEE-----------ECCCCCCCCCCCEEEEEeCCCcEEEEECC--
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQII-----------LSPPQTEHPWSDCFLSVGEDFSVALASLE-- 657 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~-----------fspd~~~~~~~~~l~S~s~DgsV~lWdl~-- 657 (1471)
..++.-+-.+++|++-...+... ..|.+|...+..++ ++|| |..|+..+.||.|++|.+.
T Consensus 195 ~~~ic~~~~~~~i~lL~~~ra~~-~l~rsHs~~~~d~a~~~~g~~~l~~lSpD------Gtv~a~a~~dG~v~f~Qiyi~ 267 (1283)
T KOG1916|consen 195 KVYICYGLKGGEIRLLNINRALR-SLFRSHSQRVTDMAFFAEGVLKLASLSPD------GTVFAWAISDGSVGFYQIYIT 267 (1283)
T ss_pred cceeeeccCCCceeEeeechHHH-HHHHhcCCCcccHHHHhhchhhheeeCCC------CcEEEEeecCCccceeeeeee
Confidence 46677777888998866654322 34455766555443 4666 9999999999999999763
Q ss_pred ---CCcEEEEecCCCC-CcEEEEEcCCC---------CEEEEEEcCCCCCCCCCCEEEEEECCCCeEE--------EEEe
Q 000473 658 ---TLRVERMFPGHPN-YPAKVVWDCPR---------GYIACLCRDHSRTSDAVDVLFIWDVKTGARE--------RVLR 716 (1471)
Q Consensus 658 ---t~~~l~~~~gh~~-~V~~v~~spdg---------~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~--------~~l~ 716 (1471)
..+|++.-..|.. +-.|.-++.+. .+++++.. .+..+++|.-...+|+ ....
T Consensus 268 g~~~~rclhewkphd~~p~vC~lc~~~~~~~v~i~~w~~~Itttd-------~nre~k~w~~a~w~Cll~~~~d~v~iV~ 340 (1283)
T KOG1916|consen 268 GKIVHRCLHEWKPHDKHPRVCWLCHKQEILVVSIGKWVLRITTTD-------VNREEKFWAEAPWQCLLDKLIDGVQIVG 340 (1283)
T ss_pred ccccHhhhhccCCCCCCCceeeeeccccccCCccceeEEEEeccc-------CCcceeEeeccchhhhhhhcccceEeec
Confidence 3345555556653 22222222211 24444433 2477999987766654 2333
Q ss_pred CCCCCceeeeee
Q 000473 717 GTASHSMFDHFC 728 (1471)
Q Consensus 717 gH~~~v~~~~~~ 728 (1471)
.|...+.....|
T Consensus 341 p~~~~v~~~~~~ 352 (1283)
T KOG1916|consen 341 PHDGEVTDLSMC 352 (1283)
T ss_pred CCCccccchhhh
Confidence 455455544444
No 352
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=82.99 E-value=64 Score=37.22 Aligned_cols=145 Identities=14% Similarity=0.167 Sum_probs=88.2
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
.++.|+++| +.++..+ ......+.. +...|..+...|+ -+.|+.=+ |+.+.++
T Consensus 9 ~L~vGt~~G-l~~~~~~--------------~~~~~~~i~--~~~~I~ql~vl~~---------~~~llvLs-d~~l~~~ 61 (275)
T PF00780_consen 9 RLLVGTEDG-LYVYDLS--------------DPSKPTRIL--KLSSITQLSVLPE---------LNLLLVLS-DGQLYVY 61 (275)
T ss_pred EEEEEECCC-EEEEEec--------------CCccceeEe--ecceEEEEEEecc---------cCEEEEEc-CCccEEE
Confidence 699999999 7764431 011122222 2233999998886 34444443 5999999
Q ss_pred ECCCCceEEE--------------EeccCCCEEEEE-ECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-----cEEEEec
Q 000473 607 DLGSGNLITV--------------MHHHVAPVRQII-LSPPQTEHPWSDCFLSVGEDFSVALASLETL-----RVERMFP 666 (1471)
Q Consensus 607 Dl~tg~~l~~--------------~~~H~~~V~~l~-fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~-----~~l~~~~ 666 (1471)
++..-..... -......+...+ -... .+...+.+...+.|.+|..... +..+.+.
T Consensus 62 ~L~~l~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~~~-----~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~ 136 (275)
T PF00780_consen 62 DLDSLEPVSTSAPLAFPKSRSLPTKLPETKGVSFFAVNGGH-----EGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEIS 136 (275)
T ss_pred EchhhccccccccccccccccccccccccCCeeEEeecccc-----ccceEEEEEECCEEEEEEEECCcccccceeEEEE
Confidence 9975443321 111223444444 1111 1445556666779999988653 4555554
Q ss_pred CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 667 GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 667 gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
- ...+..++|. ++.++.|..+ ...+.|+.++.....+
T Consensus 137 l-p~~~~~i~~~--~~~i~v~~~~---------~f~~idl~~~~~~~l~ 173 (275)
T PF00780_consen 137 L-PDPPSSIAFL--GNKICVGTSK---------GFYLIDLNTGSPSELL 173 (275)
T ss_pred c-CCCcEEEEEe--CCEEEEEeCC---------ceEEEecCCCCceEEe
Confidence 3 4678999998 6678888765 6899999988765544
No 353
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=81.77 E-value=10 Score=46.97 Aligned_cols=131 Identities=13% Similarity=0.059 Sum_probs=87.0
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCc-CCCEEEEEECCCcEEEEECCCCc--eEEEEeccC----CCEEEEE
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWS-FNEVLVSGSMDCSIRIWDLGSGN--LITVMHHHV----APVRQII 628 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~-~~~~L~SGs~DgtI~lWDl~tg~--~l~~~~~H~----~~V~~l~ 628 (1471)
|++.|+.+....-|.. |+-+.+.|+. ++.+ ....-+.|=.|..|+=||.+-.. .+..-++|. ....|.+
T Consensus 362 DIE~GKIVeEWk~~~d-i~mv~~t~d~---K~~Ql~~e~TlvGLs~n~vfriDpRv~~~~kl~~~q~kqy~~k~nFsc~a 437 (644)
T KOG2395|consen 362 DIERGKIVEEWKFEDD-INMVDITPDF---KFAQLTSEQTLVGLSDNSVFRIDPRVQGKNKLAVVQSKQYSTKNNFSCFA 437 (644)
T ss_pred ecccceeeeEeeccCC-cceeeccCCc---chhcccccccEEeecCCceEEecccccCcceeeeeeccccccccccceee
Confidence 6777887777777766 6777777762 1100 01222445678889999987322 222222332 1234444
Q ss_pred ECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 629 LSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 629 fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
-.. ..+++.||.+|.|||||--..+.-..++|-..+|..|..+.+|++|++.|.. .+.+-++
T Consensus 438 TT~-------sG~IvvgS~~GdIRLYdri~~~AKTAlPgLG~~I~hVdvtadGKwil~Tc~t---------yLlLi~t 499 (644)
T KOG2395|consen 438 TTE-------SGYIVVGSLKGDIRLYDRIGRRAKTALPGLGDAIKHVDVTADGKWILATCKT---------YLLLIDT 499 (644)
T ss_pred ecC-------CceEEEeecCCcEEeehhhhhhhhhcccccCCceeeEEeeccCcEEEEeccc---------EEEEEEE
Confidence 333 3489999999999999974444456788999999999999999999988875 6666654
No 354
>PRK02888 nitrous-oxide reductase; Validated
Probab=81.40 E-value=20 Score=46.19 Aligned_cols=93 Identities=14% Similarity=0.134 Sum_probs=67.3
Q ss_pred CCcEEEEECCC----C-ceEEEEeccCCCEEEEEECCCCCCCCCCCEEE-EEeCCCcEEEEECCCCcE------------
Q 000473 600 DCSIRIWDLGS----G-NLITVMHHHVAPVRQIILSPPQTEHPWSDCFL-SVGEDFSVALASLETLRV------------ 661 (1471)
Q Consensus 600 DgtI~lWDl~t----g-~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~-S~s~DgsV~lWdl~t~~~------------ 661 (1471)
++.|.+.|..+ + +.+..+. -......+.++|| |++++ ++..+.+|.+.|+++.+.
T Consensus 295 gn~V~VID~~t~~~~~~~v~~yIP-VGKsPHGV~vSPD------GkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~v 367 (635)
T PRK02888 295 GSKVPVVDGRKAANAGSALTRYVP-VPKNPHGVNTSPD------GKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAV 367 (635)
T ss_pred CCEEEEEECCccccCCcceEEEEE-CCCCccceEECCC------CCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceE
Confidence 67899999988 3 3444443 3456788999999 77655 455699999999987552
Q ss_pred EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 662 ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 662 l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+.+.+- .......+|+++|....+-.-| ..|-.||+.+
T Consensus 368 vaevev-GlGPLHTaFDg~G~aytslf~d--------sqv~kwn~~~ 405 (635)
T PRK02888 368 VAEPEL-GLGPLHTAFDGRGNAYTTLFLD--------SQIVKWNIEA 405 (635)
T ss_pred EEeecc-CCCcceEEECCCCCEEEeEeec--------ceeEEEehHH
Confidence 333332 2235678999999877777777 8999999976
No 355
>PRK02888 nitrous-oxide reductase; Validated
Probab=79.97 E-value=24 Score=45.50 Aligned_cols=106 Identities=12% Similarity=0.040 Sum_probs=63.6
Q ss_pred EEEEECCCcEEEEEC---CCCceEEEEeccCCCEEEEEECCCC--CCCCCCCEEEEEeCCCcEEEEECCC-----CcEEE
Q 000473 594 LVSGSMDCSIRIWDL---GSGNLITVMHHHVAPVRQIILSPPQ--TEHPWSDCFLSVGEDFSVALASLET-----LRVER 663 (1471)
Q Consensus 594 L~SGs~DgtI~lWDl---~tg~~l~~~~~H~~~V~~l~fspd~--~~~~~~~~l~S~s~DgsV~lWdl~t-----~~~l~ 663 (1471)
.+..+.||..-.... +.+..+..+...... ..+.|++.. ...++|++... .++.|.+.|.++ .+.+.
T Consensus 239 ~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d-~~vvfni~~iea~vkdGK~~~V--~gn~V~VID~~t~~~~~~~v~~ 315 (635)
T PRK02888 239 NVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERD-WVVVFNIARIEEAVKAGKFKTI--GGSKVPVVDGRKAANAGSALTR 315 (635)
T ss_pred cceECCCCCEEEEeccCcccCcceeeeccccCc-eEEEEchHHHHHhhhCCCEEEE--CCCEEEEEECCccccCCcceEE
Confidence 334444555444332 344444444332222 445555431 11233554433 367899999998 45556
Q ss_pred EecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 664 MFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 664 ~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
.++- ...+..+.++|||+++++++.- +.+|.|.|+.+.+
T Consensus 316 yIPV-GKsPHGV~vSPDGkylyVankl-------S~tVSVIDv~k~k 354 (635)
T PRK02888 316 YVPV-PKNPHGVNTSPDGKYFIANGKL-------SPTVTVIDVRKLD 354 (635)
T ss_pred EEEC-CCCccceEECCCCCEEEEeCCC-------CCcEEEEEChhhh
Confidence 5553 3457889999999999987764 2799999998754
No 356
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=79.55 E-value=85 Score=38.72 Aligned_cols=116 Identities=18% Similarity=0.179 Sum_probs=73.3
Q ss_pred EEEEEEecCCCCcccCcCCCEEE-EEECCC----cEEEEECCCCceEEE-EeccCCCEEEEEECCCCCCCCCCCEEEEEe
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLV-SGSMDC----SIRIWDLGSGNLITV-MHHHVAPVRQIILSPPQTEHPWSDCFLSVG 646 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~-SGs~Dg----tI~lWDl~tg~~l~~-~~~H~~~V~~l~fspd~~~~~~~~~l~S~s 646 (1471)
+....++|+ +++++ +-+..| ++++.|+.+|+.+.. +..- . -..+.|.++ ++.|+...
T Consensus 126 ~~~~~~Spd---------g~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~~~-~-~~~~~W~~d------~~~~~y~~ 188 (414)
T PF02897_consen 126 LGGFSVSPD---------GKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIENP-K-FSSVSWSDD------GKGFFYTR 188 (414)
T ss_dssp EEEEEETTT---------SSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEEEE-E-SEEEEECTT------SSEEEEEE
T ss_pred eeeeeECCC---------CCEEEEEecCCCCceEEEEEEECCCCcCcCCccccc-c-cceEEEeCC------CCEEEEEE
Confidence 345677887 56544 445444 599999999987643 2221 1 123899998 77766665
Q ss_pred CCC-----------cEEEEECCCCcE--EEEecCCCCC--cEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 647 EDF-----------SVALASLETLRV--ERMFPGHPNY--PAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 647 ~Dg-----------sV~lWdl~t~~~--l~~~~gh~~~--V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
.|. .|.+|.+.+... ...+...... ...+..++|+++|+..+.. +.. ...+++-|...+
T Consensus 189 ~~~~~~~~~~~~~~~v~~~~~gt~~~~d~lvfe~~~~~~~~~~~~~s~d~~~l~i~~~~--~~~--~s~v~~~d~~~~ 262 (414)
T PF02897_consen 189 FDEDQRTSDSGYPRQVYRHKLGTPQSEDELVFEEPDEPFWFVSVSRSKDGRYLFISSSS--GTS--ESEVYLLDLDDG 262 (414)
T ss_dssp CSTTTSS-CCGCCEEEEEEETTS-GGG-EEEEC-TTCTTSEEEEEE-TTSSEEEEEEES--SSS--EEEEEEEECCCT
T ss_pred eCcccccccCCCCcEEEEEECCCChHhCeeEEeecCCCcEEEEEEecCcccEEEEEEEc--ccc--CCeEEEEecccc
Confidence 433 378888876542 3455544433 5688899999999876654 111 157899999875
No 357
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=79.49 E-value=0.71 Score=59.30 Aligned_cols=133 Identities=11% Similarity=0.051 Sum_probs=88.5
Q ss_pred EecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC--CCceE-----EEEeccCCCEEEEEECCCCCCCCC
Q 000473 566 FLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG--SGNLI-----TVMHHHVAPVRQIILSPPQTEHPW 638 (1471)
Q Consensus 566 l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~--tg~~l-----~~~~~H~~~V~~l~fspd~~~~~~ 638 (1471)
++|..|.|..++|.+.. ...+. -.=|...|||+. .|+.. +........+.-|.|.|-.+.
T Consensus 128 ~kgf~G~v~dl~fah~~--------~pk~~--~~vg~lfVy~vd~l~G~iq~~l~v~~~~p~gs~~~~V~wcp~~~~--- 194 (1283)
T KOG1916|consen 128 AKGFPGGVGDLQFAHTK--------CPKGR--RLVGELFVYDVDVLQGEIQPQLEVTPITPYGSDPQLVSWCPIAVN--- 194 (1283)
T ss_pred HhcCCCCcccccccccC--------ChHHH--HHhhhhheeehHhhccccccceEEeecCcCCCCcceeeecccccc---
Confidence 46778889999985531 11121 233568899986 45433 333334556677777775322
Q ss_pred CCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEE-----------EcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 639 SDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVV-----------WDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~-----------~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.-+++.+-.+++|++.+..+... ..|.+|..++..++ .+|||..++.+|.| |.++.|.+.
T Consensus 195 ~~~ic~~~~~~~i~lL~~~ra~~-~l~rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~d--------G~v~f~Qiy 265 (1283)
T KOG1916|consen 195 KVYICYGLKGGEIRLLNINRALR-SLFRSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISD--------GSVGFYQIY 265 (1283)
T ss_pred cceeeeccCCCceeEeeechHHH-HHHHhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecC--------Cccceeeee
Confidence 46888888999999998865432 56677887666553 59999999999999 888888764
Q ss_pred -CC----eEEEEEeCCCC
Q 000473 708 -TG----ARERVLRGTAS 720 (1471)
Q Consensus 708 -tg----~~~~~l~gH~~ 720 (1471)
+| +|++....|..
T Consensus 266 i~g~~~~rclhewkphd~ 283 (1283)
T KOG1916|consen 266 ITGKIVHRCLHEWKPHDK 283 (1283)
T ss_pred eeccccHhhhhccCCCCC
Confidence 33 34455566663
No 358
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=78.79 E-value=39 Score=45.99 Aligned_cols=107 Identities=12% Similarity=0.207 Sum_probs=67.1
Q ss_pred CCEEEE-----EECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe---CCCcEEEEECCC---C
Q 000473 591 NEVLVS-----GSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG---EDFSVALASLET---L 659 (1471)
Q Consensus 591 ~~~L~S-----Gs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s---~DgsV~lWdl~t---~ 659 (1471)
|++++. ...-..|++||-+ |.+-.+-....+.=.+++|-|. |..+++.. .|..|.++.-+. |
T Consensus 207 g~~fAVs~~~~~~~~RkirV~drE-g~Lns~se~~~~l~~~LsWkPs------gs~iA~iq~~~sd~~IvffErNGL~hg 279 (1265)
T KOG1920|consen 207 GEYFAVSFVESETGTRKIRVYDRE-GALNSTSEPVEGLQHSLSWKPS------GSLIAAIQCKTSDSDIVFFERNGLRHG 279 (1265)
T ss_pred CcEEEEEEEeccCCceeEEEeccc-chhhcccCcccccccceeecCC------CCeEeeeeecCCCCcEEEEecCCcccc
Confidence 777776 3333899999976 5443322223333457888887 88888764 466788887442 2
Q ss_pred cEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 660 RVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 660 ~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
.-...++....+|..++|+.++..|++--.+ .. ...|++|-+.+.
T Consensus 280 ~f~l~~p~de~~ve~L~Wns~sdiLAv~~~~----~e-~~~v~lwt~~Ny 324 (1265)
T KOG1920|consen 280 EFVLPFPLDEKEVEELAWNSNSDILAVVTSN----LE-NSLVQLWTTGNY 324 (1265)
T ss_pred ccccCCcccccchheeeecCCCCceeeeecc----cc-cceEEEEEecCe
Confidence 2222233344458999999999999873222 00 145999987664
No 359
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=78.08 E-value=12 Score=48.35 Aligned_cols=31 Identities=29% Similarity=0.527 Sum_probs=27.9
Q ss_pred CCCEEEEEeCCCeEEEEEcCCCeEEEeeeCC
Q 000473 101 DNGALISACTDGVLCVWSRSSGHCRRRRKLP 131 (1471)
Q Consensus 101 d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~ 131 (1471)
+..+|++-+.|++||+||+.+++|+....+.
T Consensus 229 ~~~~l~tl~~D~~LRiW~l~t~~~~~~~~~~ 259 (547)
T PF11715_consen 229 DDTFLFTLSRDHTLRIWSLETGQCLATIDLL 259 (547)
T ss_dssp TTTEEEEEETTSEEEEEETTTTCEEEEEETT
T ss_pred CCCEEEEEeCCCeEEEEECCCCeEEEEeccc
Confidence 6789999999999999999999999887654
No 360
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=78.04 E-value=7.5 Score=46.33 Aligned_cols=18 Identities=11% Similarity=0.349 Sum_probs=7.0
Q ss_pred EEEEEcCCCeEEEeeeCC
Q 000473 114 LCVWSRSSGHCRRRRKLP 131 (1471)
Q Consensus 114 I~VWdv~~G~ci~~~~l~ 131 (1471)
|.+||..+-.....+.+|
T Consensus 69 v~~~D~~TL~~~~EI~iP 86 (342)
T PF06433_consen 69 VEIWDTQTLSPTGEIEIP 86 (342)
T ss_dssp EEEEETTTTEEEEEEEET
T ss_pred EEEEecCcCcccceEecC
Confidence 344444443333333333
No 361
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=76.99 E-value=4.9 Score=49.93 Aligned_cols=69 Identities=20% Similarity=0.294 Sum_probs=53.0
Q ss_pred eecCCCCceEEeecCcCcccCceEEEEECccccEEEEEecCCCCCCCCCCCcccccceEEEEECCCCCeEEEEecCCcEE
Q 000473 1316 VSLNDTSTKLAVGDAIGDIKKASIRVYDMQSVTKIKVLDASGPPGLPRESDSVATTVISALIFSPDGEGLVAFSEHGLMI 1395 (1471)
Q Consensus 1316 v~~~~~tqrlavg~~~g~~~~~~i~~ydl~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l~~~s~~~~~~ 1395 (1471)
-+|++.++|||+-.. +. .+-.|.+||+.+.++.+..++-|.- +.=+|||||+.||=-|..++.=
T Consensus 243 P~fspDG~~l~f~~~-rd-g~~~iy~~dl~~~~~~~Lt~~~gi~--------------~~Ps~spdG~~ivf~Sdr~G~p 306 (425)
T COG0823 243 PAFSPDGSKLAFSSS-RD-GSPDIYLMDLDGKNLPRLTNGFGIN--------------TSPSWSPDGSKIVFTSDRGGRP 306 (425)
T ss_pred ccCCCCCCEEEEEEC-CC-CCccEEEEcCCCCcceecccCCccc--------------cCccCCCCCCEEEEEeCCCCCc
Confidence 367889999998543 22 2567899999999888855543222 2557999999999999999999
Q ss_pred EEEec
Q 000473 1396 RWWSL 1400 (1471)
Q Consensus 1396 ~~w~~ 1400 (1471)
.||.+
T Consensus 307 ~I~~~ 311 (425)
T COG0823 307 QIYLY 311 (425)
T ss_pred ceEEE
Confidence 99977
No 362
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=76.48 E-value=22 Score=44.33 Aligned_cols=122 Identities=20% Similarity=0.157 Sum_probs=72.0
Q ss_pred EEEEecCCCCcccCcCCCEEEEEECCCc--EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC-CCc-
Q 000473 575 CLAAHRMVGTAKGWSFNEVLVSGSMDCS--IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE-DFS- 650 (1471)
Q Consensus 575 ~la~spd~~~~~~~~~~~~L~SGs~Dgt--I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~-Dgs- 650 (1471)
.-+|+|| | ..++++...|+. |.+.|+.++. +..+..-.+.-..=.|+|+ |+.++-.++ .|.
T Consensus 242 ~P~fspD-----G---~~l~f~~~rdg~~~iy~~dl~~~~-~~~Lt~~~gi~~~Ps~spd------G~~ivf~Sdr~G~p 306 (425)
T COG0823 242 APAFSPD-----G---SKLAFSSSRDGSPDIYLMDLDGKN-LPRLTNGFGINTSPSWSPD------GSKIVFTSDRGGRP 306 (425)
T ss_pred CccCCCC-----C---CEEEEEECCCCCccEEEEcCCCCc-ceecccCCccccCccCCCC------CCEEEEEeCCCCCc
Confidence 4578887 2 567778888885 4556776665 3334333333334456777 887776653 344
Q ss_pred -EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 651 -VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 651 -V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
|.+.|.+.....+ +.-....-..-.|+|||++|+..+.. ++.-.|.+.|+.++..++.++.
T Consensus 307 ~I~~~~~~g~~~~r-iT~~~~~~~~p~~SpdG~~i~~~~~~-----~g~~~i~~~~~~~~~~~~~lt~ 368 (425)
T COG0823 307 QIYLYDLEGSQVTR-LTFSGGGNSNPVWSPDGDKIVFESSS-----GGQWDIDKNDLASGGKIRILTS 368 (425)
T ss_pred ceEEECCCCCceeE-eeccCCCCcCccCCCCCCEEEEEecc-----CCceeeEEeccCCCCcEEEccc
Confidence 4555666555433 22222222277899999999887643 1112377777777665666554
No 363
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=73.06 E-value=69 Score=41.70 Aligned_cols=102 Identities=22% Similarity=0.300 Sum_probs=69.9
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEe-ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC---------CCc
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMH-HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE---------TLR 660 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~-~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~---------t~~ 660 (1471)
++..+.-+.-..+.+||...+.+...-. ...+.|..+.|... |+++.++++|-.+.|.++.-. +..
T Consensus 41 ~k~a~V~~~~~~LtIWD~~~~~lE~~~~f~~~~~I~dLDWtst----~d~qsiLaVGf~~~v~l~~Q~R~dy~~~~p~w~ 116 (631)
T PF12234_consen 41 KKIAVVDSSRSELTIWDTRSGVLEYEESFSEDDPIRDLDWTST----PDGQSILAVGFPHHVLLYTQLRYDYTNKGPSWA 116 (631)
T ss_pred CcEEEEECCCCEEEEEEcCCcEEEEeeeecCCCceeeceeeec----CCCCEEEEEEcCcEEEEEEccchhhhcCCcccc
Confidence 4444444445589999999887543222 45678999998643 459999999999999999652 122
Q ss_pred EEEEe--cCCC-CCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 661 VERMF--PGHP-NYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 661 ~l~~~--~gh~-~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
+++.+ ..|+ .+|.+..|.++|.++ +|+. +.++|+|-
T Consensus 117 ~i~~i~i~~~T~h~Igds~Wl~~G~Lv-V~sG---------Nqlfv~dk 155 (631)
T PF12234_consen 117 PIRKIDISSHTPHPIGDSIWLKDGTLV-VGSG---------NQLFVFDK 155 (631)
T ss_pred eeEEEEeecCCCCCccceeEecCCeEE-EEeC---------CEEEEECC
Confidence 33332 3444 579999999988655 4554 47999874
No 364
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=71.81 E-value=8.6 Score=46.62 Aligned_cols=83 Identities=20% Similarity=0.272 Sum_probs=57.8
Q ss_pred CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEe-cCC----------
Q 000473 600 DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMF-PGH---------- 668 (1471)
Q Consensus 600 DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~-~gh---------- 668 (1471)
.+.+.++|+.+++....... ...+....|+|+ |+.++-+. |+.|.++++.++...+.- .|-
T Consensus 22 ~~~y~i~d~~~~~~~~l~~~-~~~~~~~~~sP~------g~~~~~v~-~~nly~~~~~~~~~~~lT~dg~~~i~nG~~dw 93 (353)
T PF00930_consen 22 KGDYYIYDIETGEITPLTPP-PPKLQDAKWSPD------GKYIAFVR-DNNLYLRDLATGQETQLTTDGEPGIYNGVPDW 93 (353)
T ss_dssp EEEEEEEETTTTEEEESS-E-ETTBSEEEE-SS------STEEEEEE-TTEEEEESSTTSEEEESES--TTTEEESB--H
T ss_pred ceeEEEEecCCCceEECcCC-ccccccceeecC------CCeeEEEe-cCceEEEECCCCCeEEeccccceeEEcCccce
Confidence 35688999999765433333 567889999999 89988886 678999998877554322 220
Q ss_pred ------CCCcEEEEEcCCCCEEEEEEcC
Q 000473 669 ------PNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 669 ------~~~V~~v~~spdg~~L~sgs~D 690 (1471)
-+.-..+-||||+++|+..-.|
T Consensus 94 vyeEEv~~~~~~~~WSpd~~~la~~~~d 121 (353)
T PF00930_consen 94 VYEEEVFDRRSAVWWSPDSKYLAFLRFD 121 (353)
T ss_dssp HHHHHTSSSSBSEEE-TTSSEEEEEEEE
T ss_pred eccccccccccceEECCCCCEEEEEEEC
Confidence 1123568899999999998877
No 365
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=71.81 E-value=77 Score=38.94 Aligned_cols=117 Identities=13% Similarity=0.131 Sum_probs=84.8
Q ss_pred EEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCE-EEEEe--CCCc
Q 000473 574 LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDC-FLSVG--EDFS 650 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~-l~S~s--~Dgs 650 (1471)
..++++++ ..+..+....+..|.+.|..+.+..+....-. .-..++++|+ ++. .++-. .+++
T Consensus 77 ~~i~v~~~--------~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG~-~P~~~~~~~~------~~~vYV~n~~~~~~~ 141 (381)
T COG3391 77 AGVAVNPA--------GNKVYVTTGDSNTVSVIDTATNTVLGSIPVGL-GPVGLAVDPD------GKYVYVANAGNGNNT 141 (381)
T ss_pred cceeeCCC--------CCeEEEecCCCCeEEEEcCcccceeeEeeecc-CCceEEECCC------CCEEEEEecccCCce
Confidence 45667765 14577777778999999988777665554322 4578899998 654 44444 3799
Q ss_pred EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEE
Q 000473 651 VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARER 713 (1471)
Q Consensus 651 V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~ 713 (1471)
+.+.|-.+++.......-..+ ..++++|+|.++.+...+ ++.|.+.|..+....+
T Consensus 142 vsvid~~t~~~~~~~~vG~~P-~~~a~~p~g~~vyv~~~~-------~~~v~vi~~~~~~v~~ 196 (381)
T COG3391 142 VSVIDAATNKVTATIPVGNTP-TGVAVDPDGNKVYVTNSD-------DNTVSVIDTSGNSVVR 196 (381)
T ss_pred EEEEeCCCCeEEEEEecCCCc-ceEEECCCCCeEEEEecC-------CCeEEEEeCCCcceec
Confidence 999999999988876544444 899999999988877643 2899999977665554
No 366
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=71.39 E-value=1.1e+02 Score=37.60 Aligned_cols=123 Identities=15% Similarity=0.160 Sum_probs=86.2
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEE--CCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEE-EEEeCC
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGS--MDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCF-LSVGED 648 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs--~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l-~S~s~D 648 (1471)
.-..+++.|+ .+...++-. .++++.+.|-.+++.+.....-..+ ..+++.|+ |+.+ ++-..+
T Consensus 117 ~P~~~~~~~~--------~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~~P-~~~a~~p~------g~~vyv~~~~~ 181 (381)
T COG3391 117 GPVGLAVDPD--------GKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNTP-TGVAVDPD------GNKVYVTNSDD 181 (381)
T ss_pred CCceEEECCC--------CCEEEEEecccCCceEEEEeCCCCeEEEEEecCCCc-ceEEECCC------CCeEEEEecCC
Confidence 4457888887 134444444 3789999999998888775554455 88999999 7744 444578
Q ss_pred CcEEEEECCCCcEEE-E---ecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 649 FSVALASLETLRVER-M---FPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 649 gsV~lWdl~t~~~l~-~---~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
+.|.+.|.+.....+ . ...-......+.++|+|.++.+.... ++ ++.+.+-|..++.....
T Consensus 182 ~~v~vi~~~~~~v~~~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~---~~--~~~v~~id~~~~~v~~~ 246 (381)
T COG3391 182 NTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVANDG---SG--SNNVLKIDTATGNVTAT 246 (381)
T ss_pred CeEEEEeCCCcceeccccccccccCCCCceEEECCCCCEEEEEecc---CC--CceEEEEeCCCceEEEe
Confidence 999999987766553 1 01122345789999999988887765 11 26899999998876654
No 367
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=70.86 E-value=38 Score=40.31 Aligned_cols=112 Identities=14% Similarity=0.174 Sum_probs=72.2
Q ss_pred cCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCc-eEEEEe-ccCCCEEEEEECCCCCCCCCCCEEEEE
Q 000473 568 GHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGN-LITVMH-HHVAPVRQIILSPPQTEHPWSDCFLSV 645 (1471)
Q Consensus 568 gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~-~l~~~~-~H~~~V~~l~fspd~~~~~~~~~l~S~ 645 (1471)
...++|+++..-. +. |+.+. ...|.+|++...+ +...-. .....+.++... +++++.|
T Consensus 86 ~~~g~V~ai~~~~----------~~-lv~~~-g~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~--------~~~I~vg 145 (321)
T PF03178_consen 86 EVKGPVTAICSFN----------GR-LVVAV-GNKLYVYDLDNSKTLLKKAFYDSPFYITSLSVF--------KNYILVG 145 (321)
T ss_dssp EESS-EEEEEEET----------TE-EEEEE-TTEEEEEEEETTSSEEEEEEE-BSSSEEEEEEE--------TTEEEEE
T ss_pred eecCcceEhhhhC----------CE-EEEee-cCEEEEEEccCcccchhhheecceEEEEEEecc--------ccEEEEE
Confidence 3468899998652 44 44443 4789999998877 443322 223367777654 4589999
Q ss_pred eCCCcEEEEECCC-CcEEEEecC--CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 646 GEDFSVALASLET-LRVERMFPG--HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 646 s~DgsV~lWdl~t-~~~l~~~~g--h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
...+.+.++..+. .+.+..+.. ....++++.|-++++.++++..+ |.+.++...
T Consensus 146 D~~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~--------gnl~~l~~~ 202 (321)
T PF03178_consen 146 DAMKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKD--------GNLFVLRYN 202 (321)
T ss_dssp ESSSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETT--------SEEEEEEE-
T ss_pred EcccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCC--------CeEEEEEEC
Confidence 9999999886543 332333322 34468888898777788888777 999999875
No 368
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=70.51 E-value=15 Score=35.43 Aligned_cols=50 Identities=24% Similarity=0.347 Sum_probs=34.5
Q ss_pred CceEEEEECccccEEEEEecCCCCCCCCCCCcccccceEEEEECCCCCeE-EEEecCCcEEEEEe
Q 000473 1336 KASIRVYDMQSVTKIKVLDASGPPGLPRESDSVATTVISALIFSPDGEGL-VAFSEHGLMIRWWS 1399 (1471)
Q Consensus 1336 ~~~i~~ydl~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l-~~~s~~~~~~~~w~ 1399 (1471)
+|.+.-||.+|. +..||-. | -.--..|++||||.+| ++=...-...|.|-
T Consensus 36 ~GRll~ydp~t~-~~~vl~~----~---------L~fpNGVals~d~~~vlv~Et~~~Ri~rywl 86 (89)
T PF03088_consen 36 TGRLLRYDPSTK-ETTVLLD----G---------LYFPNGVALSPDESFVLVAETGRYRILRYWL 86 (89)
T ss_dssp -EEEEEEETTTT-EEEEEEE----E---------ESSEEEEEE-TTSSEEEEEEGGGTEEEEEES
T ss_pred CcCEEEEECCCC-eEEEehh----C---------CCccCeEEEcCCCCEEEEEeccCceEEEEEE
Confidence 889999999998 7776662 1 3345889999999954 55555556666763
No 369
>PRK13616 lipoprotein LpqB; Provisional
Probab=69.07 E-value=33 Score=44.66 Aligned_cols=103 Identities=14% Similarity=0.120 Sum_probs=60.3
Q ss_pred ccEEEEEEecCCCCcccCcCCCEEEEEE------CCCc--EEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEE
Q 000473 571 GAVLCLAAHRMVGTAKGWSFNEVLVSGS------MDCS--IRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 571 ~~V~~la~spd~~~~~~~~~~~~L~SGs------~Dgt--I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l 642 (1471)
..+...+++|+ ++.++--- .|.. +.+++. .+.......+ ...+.-.|+|+ |+.+
T Consensus 350 ~~vsspaiSpd---------G~~vA~v~~~~~~~~d~~s~Lwv~~~-gg~~~~lt~g--~~~t~PsWspD------G~~l 411 (591)
T PRK13616 350 GNITSAALSRS---------GRQVAAVVTLGRGAPDPASSLWVGPL-GGVAVQVLEG--HSLTRPSWSLD------ADAV 411 (591)
T ss_pred cCcccceECCC---------CCEEEEEEeecCCCCCcceEEEEEeC-CCcceeeecC--CCCCCceECCC------CCce
Confidence 45778889987 55444333 3444 444454 2322222222 23777889998 7766
Q ss_pred EEEeCC------------CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEE
Q 000473 643 LSVGED------------FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFI 703 (1471)
Q Consensus 643 ~S~s~D------------gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~V 703 (1471)
.+.+.. +.+.+.+++.++... .....|..+.|||||..++... + |.|+|
T Consensus 412 w~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~-~--------g~v~V 472 (591)
T PRK13616 412 WVVVDGNTVVRVIRDPATGQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMII-G--------GKVYL 472 (591)
T ss_pred EEEecCcceEEEeccCCCceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEE-C--------CEEEE
Confidence 666432 233333444443322 2345699999999999988755 3 67777
No 370
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=69.04 E-value=1.1e+02 Score=35.65 Aligned_cols=120 Identities=14% Similarity=0.021 Sum_probs=73.1
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECCC--cEEEEECCCCceEEEEeccC-CCEEEEEECCCCCCCCCCCEEEEEeCC
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMDC--SIRIWDLGSGNLITVMHHHV-APVRQIILSPPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~Dg--tI~lWDl~tg~~l~~~~~H~-~~V~~l~fspd~~~~~~~~~l~S~s~D 648 (1471)
..-.+.|..+ +.++-|.+.-| .|+.+|+.+|+.+....-.. -.-..+.... ++.+.-.-.+
T Consensus 46 FTQGL~~~~~---------g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~-------d~l~qLTWk~ 109 (264)
T PF05096_consen 46 FTQGLEFLDD---------GTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITILG-------DKLYQLTWKE 109 (264)
T ss_dssp EEEEEEEEET---------TEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEEEEET-------TEEEEEESSS
T ss_pred cCccEEecCC---------CEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeEEEEC-------CEEEEEEecC
Confidence 3456777554 78888888777 89999999998765433211 1111222222 2233334457
Q ss_pred CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 649 FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 649 gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
+..-+||.++.+.+.+++-. ..=+.++ .|++.|+..... ..++++|.++.+..+.+.-.
T Consensus 110 ~~~f~yd~~tl~~~~~~~y~-~EGWGLt--~dg~~Li~SDGS--------~~L~~~dP~~f~~~~~i~V~ 168 (264)
T PF05096_consen 110 GTGFVYDPNTLKKIGTFPYP-GEGWGLT--SDGKRLIMSDGS--------SRLYFLDPETFKEVRTIQVT 168 (264)
T ss_dssp SEEEEEETTTTEEEEEEE-S-SS--EEE--ECSSCEEEE-SS--------SEEEEE-TTT-SEEEEEE-E
T ss_pred CeEEEEccccceEEEEEecC-CcceEEE--cCCCEEEEECCc--------cceEEECCcccceEEEEEEE
Confidence 78899999999999888643 4456666 345555543333 69999999999888877643
No 371
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=68.92 E-value=38 Score=41.92 Aligned_cols=87 Identities=18% Similarity=0.219 Sum_probs=60.7
Q ss_pred EEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE----cCC----------
Q 000473 615 TVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW----DCP---------- 680 (1471)
Q Consensus 615 ~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~----spd---------- 680 (1471)
..|......+.++..+|. +++.+....=|.|.|+|+.++..+++..|-.+. .+.| .+.
T Consensus 301 ~~l~D~~R~~~~i~~sP~------~~laA~tDslGRV~LiD~~~~~vvrmWKGYRdA--qc~wi~~~~~~~~~~~~~~~~ 372 (415)
T PF14655_consen 301 FGLPDSKREGESICLSPS------GRLAAVTDSLGRVLLIDVARGIVVRMWKGYRDA--QCGWIEVPEEGDRDRSNSNSP 372 (415)
T ss_pred EeeccCCceEEEEEECCC------CCEEEEEcCCCcEEEEECCCChhhhhhccCccc--eEEEEEeeccccccccccccc
Confidence 344455556888999998 787777766699999999999999988876552 1222 111
Q ss_pred ------CCEEEE-EEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 681 ------RGYIAC-LCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 681 ------g~~L~s-gs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
..+|+. +-.- |.|.||++++|..+..++-
T Consensus 373 ~~~~~~~l~LvIyaprR--------g~lEvW~~~~g~Rv~a~~v 408 (415)
T PF14655_consen 373 KSSSRFALFLVIYAPRR--------GILEVWSMRQGPRVAAFNV 408 (415)
T ss_pred CCCCcceEEEEEEeccC--------CeEEEEecCCCCEEEEEEe
Confidence 123332 2223 8999999999998877653
No 372
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=68.71 E-value=88 Score=36.14 Aligned_cols=112 Identities=9% Similarity=-0.005 Sum_probs=69.8
Q ss_pred EEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccC-CCEEEEEECCCCCCCCCCCEE
Q 000473 564 QYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHV-APVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 564 ~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~-~~V~~l~fspd~~~~~~~~~l 642 (1471)
+.+.|-...+..|+|.|+ .+.+++.....+.|.-.|. +|+.++++.-.. +-...|++.-+ +.++
T Consensus 15 ~~l~g~~~e~SGLTy~pd--------~~tLfaV~d~~~~i~els~-~G~vlr~i~l~g~~D~EgI~y~g~------~~~v 79 (248)
T PF06977_consen 15 KPLPGILDELSGLTYNPD--------TGTLFAVQDEPGEIYELSL-DGKVLRRIPLDGFGDYEGITYLGN------GRYV 79 (248)
T ss_dssp EE-TT--S-EEEEEEETT--------TTEEEEEETTTTEEEEEET-T--EEEEEE-SS-SSEEEEEE-ST------TEEE
T ss_pred eECCCccCCccccEEcCC--------CCeEEEEECCCCEEEEEcC-CCCEEEEEeCCCCCCceeEEEECC------CEEE
Confidence 345565666999999997 2667888888888888886 588887775443 56788888766 5555
Q ss_pred EEEeCCCcEEEEECCCCc------EEEEec-----CCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 643 LSVGEDFSVALASLETLR------VERMFP-----GHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~t~~------~l~~~~-----gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
++--.++.+.+.++.... ....+. .+...+..++|+|.++.|+++.+.
T Consensus 80 l~~Er~~~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~ 138 (248)
T PF06977_consen 80 LSEERDQRLYIFTIDDDTTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKER 138 (248)
T ss_dssp EEETTTTEEEEEEE----TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEEES
T ss_pred EEEcCCCcEEEEEEeccccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCC
Confidence 554458999888883211 112221 234468999999998888888776
No 373
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=68.41 E-value=68 Score=38.28 Aligned_cols=114 Identities=8% Similarity=-0.033 Sum_probs=71.0
Q ss_pred ceEEEEecCCccEEEEEEecCCCCcccCcCCC-EEEEEECCCcEEEEECCC--Cc----e-EEEEeccCCCEEEEEECCC
Q 000473 561 VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNE-VLVSGSMDCSIRIWDLGS--GN----L-ITVMHHHVAPVRQIILSPP 632 (1471)
Q Consensus 561 ~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~-~L~SGs~DgtI~lWDl~t--g~----~-l~~~~~H~~~V~~l~fspd 632 (1471)
...+.+.+|-..-+.|+|+|| ++ +.++=+..+.|.-+++.. +. . ...+..+.+.--.++...+
T Consensus 153 ~~~~l~~~~~~~~NGla~SpD---------g~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDad 223 (307)
T COG3386 153 GVVRLLDDDLTIPNGLAFSPD---------GKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDAD 223 (307)
T ss_pred CEEEeecCcEEecCceEECCC---------CCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCC
Confidence 334444555555678999998 54 555556668888887752 21 1 1112223344444555555
Q ss_pred CCCCCCCCEEEEEeCCC-cEEEEECCCCcEEEEecCCCCCcEEEEEc-CCCCEEEEEEcC
Q 000473 633 QTEHPWSDCFLSVGEDF-SVALASLETLRVERMFPGHPNYPAKVVWD-CPRGYIACLCRD 690 (1471)
Q Consensus 633 ~~~~~~~~~l~S~s~Dg-sV~lWdl~t~~~l~~~~gh~~~V~~v~~s-pdg~~L~sgs~D 690 (1471)
|++.+++-.++ .|..|+.+ |+.+..+..+...+++++|- |+.+.|.+.+..
T Consensus 224 ------G~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP~~~~t~~~FgG~~~~~L~iTs~~ 276 (307)
T COG3386 224 ------GNLWVAAVWGGGRVVRFNPD-GKLLGEIKLPVKRPTNPAFGGPDLNTLYITSAR 276 (307)
T ss_pred ------CCEEEecccCCceEEEECCC-CcEEEEEECCCCCCccceEeCCCcCEEEEEecC
Confidence 66665444444 89999988 99998888777788899994 455555555443
No 374
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=68.35 E-value=10 Score=30.03 Aligned_cols=29 Identities=17% Similarity=0.130 Sum_probs=22.8
Q ss_pred ccceEEEEECCCCCeEEEEecCC--cEEEEE
Q 000473 1370 TTVISALIFSPDGEGLVAFSEHG--LMIRWW 1398 (1471)
Q Consensus 1370 ~~~i~a~~fs~dg~~l~~~s~~~--~~~~~w 1398 (1471)
.+...+-+||||||+|+=.|..+ +..-+|
T Consensus 8 ~~~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 8 PGDDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp SSSEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred CccccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 45678899999999999988887 666665
No 375
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=67.86 E-value=1.8e+02 Score=36.16 Aligned_cols=52 Identities=12% Similarity=0.080 Sum_probs=37.6
Q ss_pred CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC-CCCceeeeee
Q 000473 669 PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT-ASHSMFDHFC 728 (1471)
Q Consensus 669 ~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH-~~~v~~~~~~ 728 (1471)
.+++..++.||++++++.-..+ |.+.|....-.+.+..+.-. ......+.+|
T Consensus 216 ~~~i~~iavSpng~~iAl~t~~--------g~l~v~ssDf~~~~~e~~~~~~~~p~~~~WC 268 (410)
T PF04841_consen 216 DGPIIKIAVSPNGKFIALFTDS--------GNLWVVSSDFSEKLCEFDTDSKSPPKQMAWC 268 (410)
T ss_pred CCCeEEEEECCCCCEEEEEECC--------CCEEEEECcccceeEEeecCcCCCCcEEEEE
Confidence 3579999999999999888777 89999887656665555544 2233445577
No 376
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=67.12 E-value=96 Score=35.18 Aligned_cols=102 Identities=8% Similarity=0.018 Sum_probs=65.4
Q ss_pred EEEEEecCCCCcccCcCCCEEEEEECCCcEEEEE--CCCCceE-----EEEecc---CC-CEEEEEECCCCCCCCCCCEE
Q 000473 574 LCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWD--LGSGNLI-----TVMHHH---VA-PVRQIILSPPQTEHPWSDCF 642 (1471)
Q Consensus 574 ~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWD--l~tg~~l-----~~~~~H---~~-~V~~l~fspd~~~~~~~~~l 642 (1471)
+.++|+.++ ..+-..-+.+.+|.-|| ..+|... ..++.. .. .--.+++.-+ |+++
T Consensus 161 Ngl~Wd~d~--------K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~e------G~L~ 226 (310)
T KOG4499|consen 161 NGLAWDSDA--------KKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTE------GNLY 226 (310)
T ss_pred ccccccccC--------cEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccC------CcEE
Confidence 456676552 45666777777887777 5555432 111110 00 0011222223 7888
Q ss_pred EEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcC-CCCEEEEEEc
Q 000473 643 LSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDC-PRGYIACLCR 689 (1471)
Q Consensus 643 ~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~sp-dg~~L~sgs~ 689 (1471)
+++-.-++|...|..+|+.+.++.-....|++++|-. +-..|++.+.
T Consensus 227 Va~~ng~~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~yvT~a 274 (310)
T KOG4499|consen 227 VATFNGGTVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILYVTTA 274 (310)
T ss_pred EEEecCcEEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEEEEeh
Confidence 8888899999999999999999988888999999953 3345555443
No 377
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=65.34 E-value=77 Score=35.55 Aligned_cols=143 Identities=15% Similarity=0.166 Sum_probs=69.2
Q ss_pred HHHHHHHHHHHhcCcchhHHHHHHHHHhhHhhcccccccchhhhhhhhhhhhhhccccccccCCCCCCchhhhHHHHHHH
Q 000473 1144 LVVQPLIKLVMATNEKYSSTAAELLAEGMESTWKTCIGFEIPRLIGDIFFQIECVSNSSANLAGQHPAVPASIRETLVGI 1223 (1471)
Q Consensus 1144 ~~~~~l~~ll~~~~~~~~~~ai~l~~~gf~~~w~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~ 1223 (1471)
.+...|...+.+.++.+..+|..++++=+. .=+..+++.+..++.-++..+. . + -...|++ |..
T Consensus 53 ~~~~~i~~~l~d~Rs~v~~~A~~~l~~l~~-~l~~~~~~~~~~~l~~Ll~~~~-----------~--~-~~~i~~~-a~~ 116 (228)
T PF12348_consen 53 QLLDAIIKQLSDLRSKVSKTACQLLSDLAR-QLGSHFEPYADILLPPLLKKLG-----------D--S-KKFIREA-ANN 116 (228)
T ss_dssp ---HHHHH-S-HH---HHHHHHHHHHHHHH-HHGGGGHHHHHHHHHHHHHGGG-----------------HHHHHH-HHH
T ss_pred HhHHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHhHhHHHHHHHHHHHHHHHHc-----------c--c-cHHHHHH-HHH
Confidence 344667777777889999999988876554 3344443222233332332221 0 0 0122332 666
Q ss_pred HhHhHHhcCh--hHH-HHHHHHHHhhcCCCC-ccchhhHHHHHHHHhCCh---hHHHH--hHHHHHHHhhhhcCCCChhh
Q 000473 1224 LLPSLAMADI--LGF-LTVVESQIWSTASDS-PVHLVSIMTIIRVVRGSP---RNVAQ--HLDKVVNFILQTMDPGNSVM 1294 (1471)
Q Consensus 1224 ~l~~ia~~~~--~~f-~~~~~~~i~~~~~~~-~~~~~~~~~l~~~i~~~p---~~~~~--~l~~~~~~~~~~lDp~~~~~ 1294 (1471)
+|..|...-+ +.. ...+.... ...+ .+...++..|..++++.| ..+.. .++.++..+.++|.=.++..
T Consensus 117 ~L~~i~~~~~~~~~~~~~~l~~~~---~~Kn~~vR~~~~~~l~~~l~~~~~~~~~l~~~~~~~~l~~~l~~~l~D~~~~V 193 (228)
T PF12348_consen 117 ALDAIIESCSYSPKILLEILSQGL---KSKNPQVREECAEWLAIILEKWGSDSSVLQKSAFLKQLVKALVKLLSDADPEV 193 (228)
T ss_dssp HHHHHHTTS-H--HHHHHHHHHHT---T-S-HHHHHHHHHHHHHHHTT-----GGG--HHHHHHHHHHHHHHHTSS-HHH
T ss_pred HHHHHHHHCCcHHHHHHHHHHHHH---hCCCHHHHHHHHHHHHHHHHHccchHhhhcccchHHHHHHHHHHHCCCCCHHH
Confidence 7777777766 333 22222211 1222 333445555667888888 33333 46899999999997777777
Q ss_pred hhhhhhHHHHHH
Q 000473 1295 RKTCLHTSMAAL 1306 (1471)
Q Consensus 1295 r~~~l~~~~~~l 1306 (1471)
|+..-. ++..+
T Consensus 194 R~~Ar~-~~~~l 204 (228)
T PF12348_consen 194 REAARE-CLWAL 204 (228)
T ss_dssp HHHHHH-HHHHH
T ss_pred HHHHHH-HHHHH
Confidence 765443 33344
No 378
>PRK13616 lipoprotein LpqB; Provisional
Probab=64.27 E-value=1.2e+02 Score=39.62 Aligned_cols=98 Identities=11% Similarity=0.026 Sum_probs=57.6
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEEEC------------CCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCC
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSGSM------------DCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSD 640 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SGs~------------DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~ 640 (1471)
.+.-.|+|+ +..+.+.+. .+.+.+.++..++... ...+.|..+.|+|| |.
T Consensus 399 ~t~PsWspD---------G~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~---~~~g~Issl~wSpD------G~ 460 (591)
T PRK13616 399 LTRPSWSLD---------ADAVWVVVDGNTVVRVIRDPATGQLARTPVDASAVAS---RVPGPISELQLSRD------GV 460 (591)
T ss_pred CCCceECCC---------CCceEEEecCcceEEEeccCCCceEEEEeccCchhhh---ccCCCcCeEEECCC------CC
Confidence 677788887 555544432 2233333554444322 33567999999999 88
Q ss_pred EEEEEeCCCcEEE---EECCCCcE-E---EEe-cCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 641 CFLSVGEDFSVAL---ASLETLRV-E---RMF-PGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 641 ~l~S~s~DgsV~l---Wdl~t~~~-l---~~~-~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
.++-.. ++.|.+ -....|.. + +.+ .+-...+..+.|.+++.++ ++..+
T Consensus 461 RiA~i~-~g~v~Va~Vvr~~~G~~~l~~~~~l~~~l~~~~~~l~W~~~~~L~-V~~~~ 516 (591)
T PRK13616 461 RAAMII-GGKVYLAVVEQTEDGQYALTNPREVGPGLGDTAVSLDWRTGDSLV-VGRSD 516 (591)
T ss_pred EEEEEE-CCEEEEEEEEeCCCCceeecccEEeecccCCccccceEecCCEEE-EEecC
Confidence 777765 467766 44444541 1 112 2233346889999998854 55544
No 379
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=64.16 E-value=3e+02 Score=37.19 Aligned_cols=61 Identities=15% Similarity=0.160 Sum_probs=40.9
Q ss_pred CCcEEEEECCCCcEEEEecC-CCC--------CcEEEEEcC-CCC---EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 648 DFSVALASLETLRVERMFPG-HPN--------YPAKVVWDC-PRG---YIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 648 DgsV~lWdl~t~~~l~~~~g-h~~--------~V~~v~~sp-dg~---~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
.++|.=.|.++|+..-.++. |.+ ...-+.+.- +|+ .++.+..+ |.+++.|-+||+.+..
T Consensus 413 ~~slvALD~~TGk~~W~~Q~~~hD~WD~D~~~~p~L~d~~~~~G~~~~~v~~~~K~--------G~~~vlDr~tG~~l~~ 484 (764)
T TIGR03074 413 SSSLVALDATTGKERWVFQTVHHDLWDMDVPAQPSLVDLPDADGTTVPALVAPTKQ--------GQIYVLDRRTGEPIVP 484 (764)
T ss_pred cceEEEEeCCCCceEEEecccCCccccccccCCceEEeeecCCCcEeeEEEEECCC--------CEEEEEECCCCCEEee
Confidence 35677788999998877764 221 111122322 453 77788777 9999999999998765
Q ss_pred Ee
Q 000473 715 LR 716 (1471)
Q Consensus 715 l~ 716 (1471)
..
T Consensus 485 ~~ 486 (764)
T TIGR03074 485 VE 486 (764)
T ss_pred ce
Confidence 43
No 380
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=63.69 E-value=2.8e+02 Score=31.93 Aligned_cols=146 Identities=13% Similarity=0.158 Sum_probs=82.2
Q ss_pred EEEEEcCCcEEEEEecccccCCCCCCccccCCc-ceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEE
Q 000473 528 IVYGFFSGEIEVIQFDLFERHNSPGASLKVNSH-VSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIW 606 (1471)
Q Consensus 528 lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~-~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lW 606 (1471)
++++ ....|.++.|.. +.... +..+.+.- .+.+.++.|.. +.++.|..+ ...+-
T Consensus 108 L~va-~kk~i~i~~~~~-----------~~~~f~~~~ke~~l-p~~~~~i~~~~-----------~~i~v~~~~-~f~~i 162 (275)
T PF00780_consen 108 LCVA-VKKKILIYEWND-----------PRNSFSKLLKEISL-PDPPSSIAFLG-----------NKICVGTSK-GFYLI 162 (275)
T ss_pred EEEE-ECCEEEEEEEEC-----------CcccccceeEEEEc-CCCcEEEEEeC-----------CEEEEEeCC-ceEEE
Confidence 4443 344888888762 00111 23333332 36688888873 456666544 47788
Q ss_pred ECCCCceEEEEecc------------CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEE--EecCCCCCc
Q 000473 607 DLGSGNLITVMHHH------------VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVER--MFPGHPNYP 672 (1471)
Q Consensus 607 Dl~tg~~l~~~~~H------------~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~--~~~gh~~~V 672 (1471)
|+.++.....+... ..++..+..+ + +.++++ .|..-.+.|.. |+..+ .+. -...+
T Consensus 163 dl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~------~e~Ll~--~~~~g~fv~~~-G~~~r~~~i~-W~~~p 231 (275)
T PF00780_consen 163 DLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQLS-D------NEFLLC--YDNIGVFVNKN-GEPSRKSTIQ-WSSAP 231 (275)
T ss_pred ecCCCCceEEeCccCCcchhhhcccCCCceEEEEeC-C------ceEEEE--ecceEEEEcCC-CCcCcccEEE-cCCch
Confidence 99877654443221 1233333332 2 344443 24444444543 44333 222 23355
Q ss_pred EEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCC
Q 000473 673 AKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTAS 720 (1471)
Q Consensus 673 ~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~ 720 (1471)
..+++. ..||++-+.+ .|.||++.+|++++++.++..
T Consensus 232 ~~~~~~--~pyli~~~~~---------~iEV~~~~~~~lvQ~i~~~~~ 268 (275)
T PF00780_consen 232 QSVAYS--SPYLIAFSSN---------SIEVRSLETGELVQTIPLPNI 268 (275)
T ss_pred hEEEEE--CCEEEEECCC---------EEEEEECcCCcEEEEEECCCE
Confidence 666663 4588876654 799999999999999987763
No 381
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=62.29 E-value=3.4e+02 Score=32.47 Aligned_cols=108 Identities=9% Similarity=0.132 Sum_probs=70.6
Q ss_pred EEEECC-CCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC-CCcEEEEECCC--C----c-EEEEecCCCCCcEE
Q 000473 604 RIWDLG-SGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE-DFSVALASLET--L----R-VERMFPGHPNYPAK 674 (1471)
Q Consensus 604 ~lWDl~-tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~-DgsV~lWdl~t--~----~-~l~~~~gh~~~V~~ 674 (1471)
.+|-+. .+...+.+..|-..-+.|+|+|| ++.+..+.. .+.|.-+++.. + + ....+..+.+..-.
T Consensus 144 ~lyr~~p~g~~~~l~~~~~~~~NGla~SpD------g~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG 217 (307)
T COG3386 144 SLYRVDPDGGVVRLLDDDLTIPNGLAFSPD------GKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDG 217 (307)
T ss_pred eEEEEcCCCCEEEeecCcEEecCceEECCC------CCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCc
Confidence 355544 46666666666666678999999 766655544 47777777752 1 1 12233334556677
Q ss_pred EEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceee
Q 000473 675 VVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFD 725 (1471)
Q Consensus 675 v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~ 725 (1471)
++...+|.+.+++..+ .+.|.+|+.+ |+++..+.-+...+.+.
T Consensus 218 ~~vDadG~lw~~a~~~-------g~~v~~~~pd-G~l~~~i~lP~~~~t~~ 260 (307)
T COG3386 218 MAVDADGNLWVAAVWG-------GGRVVRFNPD-GKLLGEIKLPVKRPTNP 260 (307)
T ss_pred eEEeCCCCEEEecccC-------CceEEEECCC-CcEEEEEECCCCCCccc
Confidence 8888888887654443 1389999988 99999988775444443
No 382
>KOG2956 consensus CLIP-associating protein [General function prediction only]
Probab=62.20 E-value=39 Score=41.64 Aligned_cols=76 Identities=20% Similarity=0.254 Sum_probs=56.2
Q ss_pred HHHHhHhHHhcChhHHHHHHHHHHhhcCCCCccchhhHHHHHHHHhCC-hhHHHHhHHHHHHHhhhhcCCCChhhhhhh
Q 000473 1221 VGILLPSLAMADILGFLTVVESQIWSTASDSPVHLVSIMTIIRVVRGS-PRNVAQHLDKVVNFILQTMDPGNSVMRKTC 1298 (1471)
Q Consensus 1221 ~~~~l~~ia~~~~~~f~~~~~~~i~~~~~~~~~~~~~~~~l~~~i~~~-p~~~~~~l~~~~~~~~~~lDp~~~~~r~~~ 1298 (1471)
+.-++..+|+..|..=|..|...|.. .+.+.....+..+-+++++- ...+.+.|+.++=.+++..|-.....|+.+
T Consensus 392 eed~~~~las~~P~~~I~~i~~~Ilt--~D~~~~~~~iKm~Tkl~e~l~~EeL~~ll~diaP~~iqay~S~SS~VRKta 468 (516)
T KOG2956|consen 392 EEDCLTTLASHLPLQCIVNISPLILT--ADEPRAVAVIKMLTKLFERLSAEELLNLLPDIAPCVIQAYDSTSSTVRKTA 468 (516)
T ss_pred HHHHHHHHHhhCchhHHHHHhhHHhc--CcchHHHHHHHHHHHHHhhcCHHHHHHhhhhhhhHHHHHhcCchHHhhhhH
Confidence 44556678899999999999988854 45554434444455777554 457888899999999999998877777754
No 383
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=61.41 E-value=26 Score=41.93 Aligned_cols=62 Identities=16% Similarity=0.113 Sum_probs=44.3
Q ss_pred cEEEEECCCCcEEEEecCCCCCcEEEEEcCCCC-EEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCC
Q 000473 650 SVALASLETLRVERMFPGHPNYPAKVVWDCPRG-YIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTA 719 (1471)
Q Consensus 650 sV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~ 719 (1471)
.|-++|+.+++.+..++. ..++.+|..+-+.+ +|++.+.. ++.+.|||..||++++++.+-.
T Consensus 270 eVWv~D~~t~krv~Ri~l-~~~~~Si~Vsqd~~P~L~~~~~~-------~~~l~v~D~~tGk~~~~~~~lG 332 (342)
T PF06433_consen 270 EVWVYDLKTHKRVARIPL-EHPIDSIAVSQDDKPLLYALSAG-------DGTLDVYDAATGKLVRSIEQLG 332 (342)
T ss_dssp EEEEEETTTTEEEEEEEE-EEEESEEEEESSSS-EEEEEETT-------TTEEEEEETTT--EEEEE---S
T ss_pred EEEEEECCCCeEEEEEeC-CCccceEEEccCCCcEEEEEcCC-------CCeEEEEeCcCCcEEeehhccC
Confidence 477889999999988874 23577899988876 56555442 2899999999999999988544
No 384
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=61.19 E-value=28 Score=28.94 Aligned_cols=30 Identities=13% Similarity=0.099 Sum_probs=22.5
Q ss_pred CcEEEEEcCCCC---EEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 671 YPAKVVWDCPRG---YIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 671 ~V~~v~~spdg~---~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
.|.+++|+|+.. +|+.+-.- |.|.|+|+++
T Consensus 2 AvR~~kFsP~~~~~DLL~~~E~~--------g~vhi~D~R~ 34 (43)
T PF10313_consen 2 AVRCCKFSPEPGGNDLLAWAEHQ--------GRVHIVDTRS 34 (43)
T ss_pred CeEEEEeCCCCCcccEEEEEccC--------CeEEEEEccc
Confidence 578999998554 56554433 8999999995
No 385
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=60.31 E-value=1.4e+02 Score=38.43 Aligned_cols=60 Identities=12% Similarity=-0.062 Sum_probs=39.8
Q ss_pred CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 649 FSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 649 gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
+.+.=+|+.+++.+-..+......... ..-.+..++.+..| |.++.+|.+||+.+-..+-
T Consensus 441 g~l~AiD~~tGk~~W~~~~~~p~~~~~-l~t~g~lvf~g~~~--------G~l~a~D~~TGe~lw~~~~ 500 (527)
T TIGR03075 441 GSLIAWDPITGKIVWEHKEDFPLWGGV-LATAGDLVFYGTLE--------GYFKAFDAKTGEELWKFKT 500 (527)
T ss_pred eeEEEEeCCCCceeeEecCCCCCCCcc-eEECCcEEEEECCC--------CeEEEEECCCCCEeEEEeC
Confidence 567788999998877665322111111 11244566667667 9999999999999876653
No 386
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=60.09 E-value=33 Score=44.25 Aligned_cols=40 Identities=23% Similarity=0.344 Sum_probs=31.5
Q ss_pred CcEEEEEcC----CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCC
Q 000473 671 YPAKVVWDC----PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGT 718 (1471)
Q Consensus 671 ~V~~v~~sp----dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH 718 (1471)
.+..+++++ +..+|++-|.| +++|+||+.+++++++..-.
T Consensus 216 ~~~~~~~~~~~~~~~~~l~tl~~D--------~~LRiW~l~t~~~~~~~~~~ 259 (547)
T PF11715_consen 216 VAASLAVSSSEINDDTFLFTLSRD--------HTLRIWSLETGQCLATIDLL 259 (547)
T ss_dssp -EEEEEE-----ETTTEEEEEETT--------SEEEEEETTTTCEEEEEETT
T ss_pred ccceEEEecceeCCCCEEEEEeCC--------CeEEEEECCCCeEEEEeccc
Confidence 345666766 78899999999 99999999999998876544
No 387
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=59.75 E-value=21 Score=29.62 Aligned_cols=34 Identities=24% Similarity=0.320 Sum_probs=28.9
Q ss_pred cceEEEEECCCCC--eEEEEecCCcEEEEEecCcch
Q 000473 1371 TVISALIFSPDGE--GLVAFSEHGLMIRWWSLGSVW 1404 (1471)
Q Consensus 1371 ~~i~a~~fs~dg~--~l~~~s~~~~~~~~w~~~~~~ 1404 (1471)
++|-++.|||+.- .|-+++++-+.+-++.+-++|
T Consensus 1 GAvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~~f 36 (43)
T PF10313_consen 1 GAVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRSNF 36 (43)
T ss_pred CCeEEEEeCCCCCcccEEEEEccCCeEEEEEcccCc
Confidence 4788999998755 899999999999999986544
No 388
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=59.45 E-value=1.2e+02 Score=38.61 Aligned_cols=118 Identities=15% Similarity=0.180 Sum_probs=67.9
Q ss_pred ccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECC-----CCceEEEEeccCC-C--E--EEEEECCCCCCCCCCC
Q 000473 571 GAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLG-----SGNLITVMHHHVA-P--V--RQIILSPPQTEHPWSD 640 (1471)
Q Consensus 571 ~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~-----tg~~l~~~~~H~~-~--V--~~l~fspd~~~~~~~~ 640 (1471)
..|..+.|.|. +..+..-|+.--....|.||.+. +.+.+..-..+-+ + | ..+.|+|. ..
T Consensus 57 EhV~GlsW~P~-----~~~~~paLLAVQHkkhVtVWqL~~s~~e~~K~l~sQtcEi~e~~pvLpQGCVWHPk------~~ 125 (671)
T PF15390_consen 57 EHVHGLSWAPP-----CTADTPALLAVQHKKHVTVWQLCPSTTERNKLLMSQTCEIREPFPVLPQGCVWHPK------KA 125 (671)
T ss_pred ceeeeeeecCc-----ccCCCCceEEEeccceEEEEEeccCccccccceeeeeeeccCCcccCCCcccccCC------Cc
Confidence 34899999885 21223355556677889999986 2333333222211 1 1 23456665 66
Q ss_pred EEEEEeCCCcEEEEECCC--CcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 641 CFLSVGEDFSVALASLET--LRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 641 ~l~S~s~DgsV~lWdl~t--~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
.++.-..+..--+.+++. .+....+ .-.+.|.|.+|.+||+.|+++-.. + =.-||||=.
T Consensus 126 iL~VLT~~dvSV~~sV~~d~srVkaDi-~~~G~IhCACWT~DG~RLVVAvGS----s---LHSyiWd~~ 186 (671)
T PF15390_consen 126 ILTVLTARDVSVLPSVHCDSSRVKADI-KTSGLIHCACWTKDGQRLVVAVGS----S---LHSYIWDSA 186 (671)
T ss_pred eEEEEecCceeEeeeeeeCCceEEEec-cCCceEEEEEecCcCCEEEEEeCC----e---EEEEEecCc
Confidence 555444333333455543 2333334 345669999999999999876543 1 246899854
No 389
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=59.44 E-value=3.7e+02 Score=31.96 Aligned_cols=112 Identities=10% Similarity=0.111 Sum_probs=67.5
Q ss_pred CccEEEEEeeccccccCCEEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccC
Q 000473 509 EKIVSSSMVISESFYAPYAIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGW 588 (1471)
Q Consensus 509 ~~~Vts~~~is~~~f~P~~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~ 588 (1471)
.+.|+++... .. .+++|. ++.|.++.|+. .++-.....+..+ -.|+++...
T Consensus 88 ~g~V~ai~~~--~~----~lv~~~-g~~l~v~~l~~------------~~~l~~~~~~~~~-~~i~sl~~~--------- 138 (321)
T PF03178_consen 88 KGPVTAICSF--NG----RLVVAV-GNKLYVYDLDN------------SKTLLKKAFYDSP-FYITSLSVF--------- 138 (321)
T ss_dssp SS-EEEEEEE--TT----EEEEEE-TTEEEEEEEET------------TSSEEEEEEE-BS-SSEEEEEEE---------
T ss_pred cCcceEhhhh--CC----EEEEee-cCEEEEEEccC------------cccchhhheecce-EEEEEEecc---------
Confidence 4568877644 22 455544 68999977761 0011222223332 367777654
Q ss_pred cCCCEEEEEECCCcEEEEECCC-CceEEEEec--cCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC
Q 000473 589 SFNEVLVSGSMDCSIRIWDLGS-GNLITVMHH--HVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE 657 (1471)
Q Consensus 589 ~~~~~L~SGs~DgtI~lWDl~t-g~~l~~~~~--H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~ 657 (1471)
+++++.|.....+.++..+. ...+..+.. ....++++.|-++ ++.++.+..+|.+.++...
T Consensus 139 --~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d------~~~~i~~D~~gnl~~l~~~ 202 (321)
T PF03178_consen 139 --KNYILVGDAMKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVD------EDTIIVGDKDGNLFVLRYN 202 (321)
T ss_dssp --TTEEEEEESSSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-S------SSEEEEEETTSEEEEEEE-
T ss_pred --ccEEEEEEcccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecC------CcEEEEEcCCCeEEEEEEC
Confidence 57899999888888774432 332333322 3445888888877 5699999999999999875
No 390
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=58.87 E-value=3.2e+02 Score=31.18 Aligned_cols=92 Identities=10% Similarity=-0.021 Sum_probs=58.1
Q ss_pred EEEEEECCCCCCCCCCCEEEEEeCCCcEEEEE--CCCCc-----EEEEecC----CCCCcEEEEEcCCCCEEEEEEcCCC
Q 000473 624 VRQIILSPPQTEHPWSDCFLSVGEDFSVALAS--LETLR-----VERMFPG----HPNYPAKVVWDCPRGYIACLCRDHS 692 (1471)
Q Consensus 624 V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWd--l~t~~-----~l~~~~g----h~~~V~~v~~spdg~~L~sgs~D~s 692 (1471)
-..++|+.+. ......-+.+.+|.-|| ..+|. .+..+.- .....-.++...+|...++.-..
T Consensus 160 sNgl~Wd~d~-----K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng-- 232 (310)
T KOG4499|consen 160 SNGLAWDSDA-----KKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNG-- 232 (310)
T ss_pred CccccccccC-----cEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecC--
Confidence 3566776652 23445567788898888 55553 2222221 11123445566666655544333
Q ss_pred CCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeeee
Q 000473 693 RTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHFC 728 (1471)
Q Consensus 693 g~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~~ 728 (1471)
++|...|..||+.+.++.-...+++++.|.
T Consensus 233 ------~~V~~~dp~tGK~L~eiklPt~qitsccFg 262 (310)
T KOG4499|consen 233 ------GTVQKVDPTTGKILLEIKLPTPQITSCCFG 262 (310)
T ss_pred ------cEEEEECCCCCcEEEEEEcCCCceEEEEec
Confidence 899999999999999998888777776444
No 391
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=58.01 E-value=1.1e+02 Score=38.28 Aligned_cols=148 Identities=14% Similarity=0.091 Sum_probs=84.0
Q ss_pred CEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCcccccccccccccccccccCC
Q 000473 103 GALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLVSEDKEV 182 (1471)
Q Consensus 103 ~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~~~d~~~ 182 (1471)
-.|.++.+-..|.=-|+..|+.+...+... ...+..++++.... ++ +.+.
T Consensus 347 lil~~~~~~~~l~klDIE~GKIVeEWk~~~----di~mv~~t~d~K~~----------Ql----------------~~e~ 396 (644)
T KOG2395|consen 347 LILMDGGEQDKLYKLDIERGKIVEEWKFED----DINMVDITPDFKFA----------QL----------------TSEQ 396 (644)
T ss_pred eEeeCCCCcCcceeeecccceeeeEeeccC----CcceeeccCCcchh----------cc----------------cccc
Confidence 344566666778888999999999988762 13344444443210 00 0111
Q ss_pred CCCCCCCceEEEEeCcceEE--EEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCCCCcccccCCC
Q 000473 183 PMKNPPKCTLVIVDTYGLTI--VQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKESHLDREEGNG 260 (1471)
Q Consensus 183 ~~~~~~~~~I~v~D~~t~~~--l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~~~~~~~~~~~ 260 (1471)
-..+-.+..|+.|||+-... +....+ +......++-+|... .+ +++++|+.+|-||++|-....+
T Consensus 397 TlvGLs~n~vfriDpRv~~~~kl~~~q~-kqy~~k~nFsc~aTT-~s---G~IvvgS~~GdIRLYdri~~~A-------- 463 (644)
T KOG2395|consen 397 TLVGLSDNSVFRIDPRVQGKNKLAVVQS-KQYSTKNNFSCFATT-ES---GYIVVGSLKGDIRLYDRIGRRA-------- 463 (644)
T ss_pred cEEeecCCceEEecccccCcceeeeeec-cccccccccceeeec-CC---ceEEEeecCCcEEeehhhhhhh--------
Confidence 12233356788888874222 222222 222223333343321 12 5799999999999999643210
Q ss_pred cccCCCcccceeccCCcccCceEEEEecCCcEEEEEeCCeEEE
Q 000473 261 LCKSSSQLDMAILQNGVVEGGHLVSVATCGNIIALVLKDHCIF 303 (1471)
Q Consensus 261 l~~~e~~i~~v~~~~~~~~~~~~vs~s~~g~~l~~~~~~~~~~ 303 (1471)
|.. ..++...+.-|.++.+|+.|+..+.+..++
T Consensus 464 -----KTA-----lPgLG~~I~hVdvtadGKwil~Tc~tyLlL 496 (644)
T KOG2395|consen 464 -----KTA-----LPGLGDAIKHVDVTADGKWILATCKTYLLL 496 (644)
T ss_pred -----hhc-----ccccCCceeeEEeeccCcEEEEecccEEEE
Confidence 011 233445567788889999998888876543
No 392
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=55.27 E-value=1.8e+02 Score=39.25 Aligned_cols=125 Identities=12% Similarity=0.029 Sum_probs=75.5
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCC--------EEEEEECCC--CC--------CCCCCCEEEEEeCCCcEE
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAP--------VRQIILSPP--QT--------EHPWSDCFLSVGEDFSVA 652 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~--------V~~l~fspd--~~--------~~~~~~~l~S~s~DgsV~ 652 (1471)
+..+..++.++.|.-.|..+|+.+.++...... ...+.+-.. .. ....+..+..++.|+.+.
T Consensus 194 gg~lYv~t~~~~V~ALDa~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~Dg~Li 273 (764)
T TIGR03074 194 GDTLYLCTPHNKVIALDAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSDARLI 273 (764)
T ss_pred CCEEEEECCCCeEEEEECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCCCeEE
Confidence 466777778889999999999998887654321 122333111 00 001245777888899999
Q ss_pred EEECCCCcEEEEecCCCCCcE-------------EEEEcC--CCCEEEEEEcC--CCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 653 LASLETLRVERMFPGHPNYPA-------------KVVWDC--PRGYIACLCRD--HSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 653 lWdl~t~~~l~~~~gh~~~V~-------------~v~~sp--dg~~L~sgs~D--~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
-.|.++|+.+..|.. .+.|. .+.-.| .++.+++|+.. ..+....+|.|+-+|.+||+++-..
T Consensus 274 ALDA~TGk~~W~fg~-~G~vdl~~~~g~~~~g~~~~ts~P~V~~g~VIvG~~v~d~~~~~~~~G~I~A~Da~TGkl~W~~ 352 (764)
T TIGR03074 274 ALDADTGKLCEDFGN-NGTVDLTAGMGTTPPGYYYPTSPPLVAGTTVVIGGRVADNYSTDEPSGVIRAFDVNTGALVWAW 352 (764)
T ss_pred EEECCCCCEEEEecC-CCceeeecccCcCCCcccccccCCEEECCEEEEEecccccccccCCCcEEEEEECCCCcEeeEE
Confidence 999999999877642 11110 011112 14567777541 0000011389999999999998665
Q ss_pred e
Q 000473 716 R 716 (1471)
Q Consensus 716 ~ 716 (1471)
.
T Consensus 353 ~ 353 (764)
T TIGR03074 353 D 353 (764)
T ss_pred e
Confidence 4
No 393
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=53.50 E-value=1e+02 Score=38.25 Aligned_cols=99 Identities=9% Similarity=0.070 Sum_probs=55.8
Q ss_pred CCcEEEEECCCCceEEEEeccC--CCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEEC-CCCc----EEEEecCC----
Q 000473 600 DCSIRIWDLGSGNLITVMHHHV--APVRQIILSPPQTEHPWSDCFLSVGEDFSVALASL-ETLR----VERMFPGH---- 668 (1471)
Q Consensus 600 DgtI~lWDl~tg~~l~~~~~H~--~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl-~t~~----~l~~~~gh---- 668 (1471)
-.++.+||+.+.+.++++..-. .....|.|..+. ....-|+.+.-..+|-.|-- +.++ .+..++.-
T Consensus 221 G~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P---~~~~gFvg~aLss~i~~~~k~~~g~W~a~kVi~ip~~~v~~ 297 (461)
T PF05694_consen 221 GHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDP---DANYGFVGCALSSSIWRFYKDDDGEWAAEKVIDIPAKKVEG 297 (461)
T ss_dssp --EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SST---T--EEEEEEE--EEEEEEEE-ETTEEEEEEEEEE--EE--S
T ss_pred cCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCC---CccceEEEEeccceEEEEEEcCCCCeeeeEEEECCCcccCc
Confidence 3579999999999999987532 245667776551 11335666666666766644 3332 22222211
Q ss_pred -------------CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 669 -------------PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 669 -------------~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
..-|+.|..|.|++||.+.|.- .|.|+-||+..
T Consensus 298 ~~lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~-------~GdvrqYDISD 343 (461)
T PF05694_consen 298 WILPEMLKPFGAVPPLITDILISLDDRFLYVSNWL-------HGDVRQYDISD 343 (461)
T ss_dssp S---GGGGGG-EE------EEE-TTS-EEEEEETT-------TTEEEEEE-SS
T ss_pred ccccccccccccCCCceEeEEEccCCCEEEEEccc-------CCcEEEEecCC
Confidence 2347899999999999999985 38999999975
No 394
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=53.50 E-value=1.2e+02 Score=34.04 Aligned_cols=62 Identities=19% Similarity=0.276 Sum_probs=43.8
Q ss_pred ecCCCC-ceEEeecCcCcccCceEEEEECccccEEEEEecCCCCCCCCCCCcccccceEEEEECCCCCeEEEEecCCcEE
Q 000473 1317 SLNDTS-TKLAVGDAIGDIKKASIRVYDMQSVTKIKVLDASGPPGLPRESDSVATTVISALIFSPDGEGLVAFSEHGLMI 1395 (1471)
Q Consensus 1317 ~~~~~t-qrlavg~~~g~~~~~~i~~ydl~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l~~~s~~~~~~ 1395 (1471)
.+|... ..|-|++. ...|.+|||. ..+++.+-.= | . -++|..+.++.-|.||||.=.+.+.-
T Consensus 22 ~~c~~g~d~Lfva~~-----g~~Vev~~l~-~~~~~~~~~F-----~-----T-v~~V~~l~y~~~GDYlvTlE~k~~~~ 84 (215)
T PF14761_consen 22 AVCCGGPDALFVAAS-----GCKVEVYDLE-QEECPLLCTF-----S-----T-VGRVLQLVYSEAGDYLVTLEEKNKRS 84 (215)
T ss_pred eeeccCCceEEEEcC-----CCEEEEEEcc-cCCCceeEEE-----c-----c-hhheeEEEeccccceEEEEEeecCCc
Confidence 455555 66766544 5679999999 3355555421 1 2 47899999999999999998776654
No 395
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=52.92 E-value=2.6e+02 Score=33.31 Aligned_cols=120 Identities=10% Similarity=0.095 Sum_probs=62.0
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCC
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDF 649 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dg 649 (1471)
.+.+..+...+| +++++.++.-....-||--...-...-..-...|..+.|.|+ +...+ ....+
T Consensus 144 ~gs~~~~~r~~d---------G~~vavs~~G~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~------~~lw~-~~~Gg 207 (302)
T PF14870_consen 144 SGSINDITRSSD---------GRYVAVSSRGNFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPD------GNLWM-LARGG 207 (302)
T ss_dssp ---EEEEEE-TT---------S-EEEEETTSSEEEEE-TT-SS-EEEE--SSS-EEEEEE-TT------S-EEE-EETTT
T ss_pred cceeEeEEECCC---------CcEEEEECcccEEEEecCCCccceEEccCccceehhceecCC------CCEEE-EeCCc
Confidence 466788777776 788887766666678885432222222234578999999998 66655 44888
Q ss_pred cEEEEEC-CCCcEEEE--ecC--CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 650 SVALASL-ETLRVERM--FPG--HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 650 sV~lWdl-~t~~~l~~--~~g--h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
.|++=+. ...+.... .+. -.-.+..++|.+++...++|+. |++++ ....|+--+..
T Consensus 208 ~~~~s~~~~~~~~w~~~~~~~~~~~~~~ld~a~~~~~~~wa~gg~---------G~l~~-S~DgGktW~~~ 268 (302)
T PF14870_consen 208 QIQFSDDPDDGETWSEPIIPIKTNGYGILDLAYRPPNEIWAVGGS---------GTLLV-STDGGKTWQKD 268 (302)
T ss_dssp EEEEEE-TTEEEEE---B-TTSS--S-EEEEEESSSS-EEEEEST---------T-EEE-ESSTTSS-EE-
T ss_pred EEEEccCCCCccccccccCCcccCceeeEEEEecCCCCEEEEeCC---------ccEEE-eCCCCccceEC
Confidence 8888772 22222211 111 1223789999999888876665 45544 33455544443
No 396
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=52.91 E-value=55 Score=40.38 Aligned_cols=75 Identities=19% Similarity=0.236 Sum_probs=50.5
Q ss_pred eeecCCCCceEEeecCcCcccCceEEEEECccccEEE-EEecCCCCCCCCCCCcccccceEEEEECCCCCeEEEEecC--
Q 000473 1315 MVSLNDTSTKLAVGDAIGDIKKASIRVYDMQSVTKIK-VLDASGPPGLPRESDSVATTVISALIFSPDGEGLVAFSEH-- 1391 (1471)
Q Consensus 1315 ~v~~~~~tqrlavg~~~g~~~~~~i~~ydl~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l~~~s~~-- 1391 (1471)
..++.+..+|||++-..|-=..-+|.|+|++|++.+. .|+ ...-+.++|++||+.|.--...
T Consensus 128 ~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~---------------~~~~~~~~W~~d~~~~~y~~~~~~ 192 (414)
T PF02897_consen 128 GFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIE---------------NPKFSSVSWSDDGKGFFYTRFDED 192 (414)
T ss_dssp EEEETTTSSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEE---------------EEESEEEEECTTSSEEEEEECSTT
T ss_pred eeeECCCCCEEEEEecCCCCceEEEEEEECCCCcCcCCccc---------------ccccceEEEeCCCCEEEEEEeCcc
Confidence 5678899999999966543123569999999994433 233 1222349999999988655533
Q ss_pred --------CcEEEEEecCcch
Q 000473 1392 --------GLMIRWWSLGSVW 1404 (1471)
Q Consensus 1392 --------~~~~~~w~~~~~~ 1404 (1471)
...|..|++++..
T Consensus 193 ~~~~~~~~~~~v~~~~~gt~~ 213 (414)
T PF02897_consen 193 QRTSDSGYPRQVYRHKLGTPQ 213 (414)
T ss_dssp TSS-CCGCCEEEEEEETTS-G
T ss_pred cccccCCCCcEEEEEECCCCh
Confidence 4457888886654
No 397
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.85 E-value=2.7e+02 Score=32.68 Aligned_cols=103 Identities=18% Similarity=0.280 Sum_probs=58.8
Q ss_pred CCCcEEEEECCCCceEE--EEe--ccCCCEEEEEECCCCCCCCCCCEEEEEe-----CCCcEEEEECCCCcEEEEecC--
Q 000473 599 MDCSIRIWDLGSGNLIT--VMH--HHVAPVRQIILSPPQTEHPWSDCFLSVG-----EDFSVALASLETLRVERMFPG-- 667 (1471)
Q Consensus 599 ~DgtI~lWDl~tg~~l~--~~~--~H~~~V~~l~fspd~~~~~~~~~l~S~s-----~DgsV~lWdl~t~~~l~~~~g-- 667 (1471)
+.-++.+-|..+|+++. ++. .+.-+|..++.-++ |..++-+- .|.--.+=....++.+.-+..
T Consensus 199 MePSlvlld~atG~liekh~Lp~~l~~lSiRHld~g~d------gtvwfgcQy~G~~~d~ppLvg~~~~g~~l~~~~~pe 272 (366)
T COG3490 199 MEPSLVLLDAATGNLIEKHTLPASLRQLSIRHLDIGRD------GTVWFGCQYRGPRNDLPPLVGHFRKGEPLEFLDLPE 272 (366)
T ss_pred cCccEEEEeccccchhhhccCchhhhhcceeeeeeCCC------CcEEEEEEeeCCCccCCcceeeccCCCcCcccCCCH
Confidence 33456666666776653 333 34556777777776 54433331 122222222233444443322
Q ss_pred -----CCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 668 -----HPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 668 -----h~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
...+|-+|+.+-+..+++..+-. .+...+||..||..+..
T Consensus 273 e~~~~~anYigsiA~n~~~glV~lTSP~-------GN~~vi~da~tG~vv~~ 317 (366)
T COG3490 273 EQTAAFANYIGSIAANRRDGLVALTSPR-------GNRAVIWDAATGAVVSE 317 (366)
T ss_pred HHHHHHHhhhhheeecccCCeEEEecCC-------CCeEEEEEcCCCcEEec
Confidence 23467888988777777666554 26788999999987654
No 398
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=52.40 E-value=1e+02 Score=36.58 Aligned_cols=88 Identities=13% Similarity=0.189 Sum_probs=57.6
Q ss_pred CCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccccccccccc
Q 000473 101 DNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLVSEDK 180 (1471)
Q Consensus 101 d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~~~d~ 180 (1471)
++++-++-+..|++.-+|.++|+......+| |.+..+.++ |+++.+|.+.+.. ...|.-+. .++
T Consensus 212 dgrLwvldsgtGev~~vD~~~G~~e~Va~vp---G~~rGL~f~---G~llvVgmSk~R~-----~~~f~glp-----l~~ 275 (335)
T TIGR03032 212 QGKLWLLNSGRGELGYVDPQAGKFQPVAFLP---GFTRGLAFA---GDFAFVGLSKLRE-----SRVFGGLP-----IEE 275 (335)
T ss_pred CCeEEEEECCCCEEEEEcCCCCcEEEEEECC---CCCccccee---CCEEEEEeccccC-----CCCcCCCc-----hhh
Confidence 4555667777888999998888877777888 778877777 7777777752220 11121111 111
Q ss_pred CCCCCCCCCceEEEEeCcceEEEEEee
Q 000473 181 EVPMKNPPKCTLVIVDTYGLTIVQTVF 207 (1471)
Q Consensus 181 ~~~~~~~~~~~I~v~D~~t~~~l~tl~ 207 (1471)
+ .+...|.|.+.|..|+.++..+.
T Consensus 276 ~---l~~~~CGv~vidl~tG~vv~~l~ 299 (335)
T TIGR03032 276 R---LDALGCGVAVIDLNSGDVVHWLR 299 (335)
T ss_pred h---hhhhcccEEEEECCCCCEEEEEE
Confidence 1 12224999999999999887665
No 399
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=52.36 E-value=2e+02 Score=38.36 Aligned_cols=71 Identities=24% Similarity=0.300 Sum_probs=49.8
Q ss_pred CEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECC----------CCcEE---EEe--------cCCCCCcEEEEEcCC-
Q 000473 623 PVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLE----------TLRVE---RMF--------PGHPNYPAKVVWDCP- 680 (1471)
Q Consensus 623 ~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~----------t~~~l---~~~--------~gh~~~V~~v~~spd- 680 (1471)
.|..|.++|+ |..++-.|..+ |.|..+. .|+.. +.+ ..+...|..+.|+|.
T Consensus 86 ~v~~i~~n~~------g~~lal~G~~~-v~V~~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s 158 (717)
T PF10168_consen 86 EVHQISLNPT------GSLLALVGPRG-VVVLELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWS 158 (717)
T ss_pred eEEEEEECCC------CCEEEEEcCCc-EEEEEeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCC
Confidence 5788899998 88888888755 4444442 11111 121 123447899999997
Q ss_pred --CCEEEEEEcCCCCCCCCCCEEEEEECCC
Q 000473 681 --RGYIACLCRDHSRTSDAVDVLFIWDVKT 708 (1471)
Q Consensus 681 --g~~L~sgs~D~sg~~D~~gtV~VWDi~t 708 (1471)
+.+|++-..| +++|+||+..
T Consensus 159 ~~~~~l~vLtsd--------n~lR~y~~~~ 180 (717)
T PF10168_consen 159 ESDSHLVVLTSD--------NTLRLYDISD 180 (717)
T ss_pred CCCCeEEEEecC--------CEEEEEecCC
Confidence 5899999998 9999999975
No 400
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=52.09 E-value=99 Score=38.41 Aligned_cols=95 Identities=6% Similarity=-0.063 Sum_probs=62.1
Q ss_pred EEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccCCC-EEEEEECCCCCC------
Q 000473 563 RQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAP-VRQIILSPPQTE------ 635 (1471)
Q Consensus 563 ~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~-V~~l~fspd~~~------ 635 (1471)
...|......+.++..+|. +++.+....-|.|.++|+.++..++.+++.... +.-+....+...
T Consensus 300 r~~l~D~~R~~~~i~~sP~---------~~laA~tDslGRV~LiD~~~~~vvrmWKGYRdAqc~wi~~~~~~~~~~~~~~ 370 (415)
T PF14655_consen 300 RFGLPDSKREGESICLSPS---------GRLAAVTDSLGRVLLIDVARGIVVRMWKGYRDAQCGWIEVPEEGDRDRSNSN 370 (415)
T ss_pred EEeeccCCceEEEEEECCC---------CCEEEEEcCCCcEEEEECCCChhhhhhccCccceEEEEEeeccccccccccc
Confidence 3445555556788999997 788888888899999999999988888876543 222221111000
Q ss_pred ----CCCCC--EEEEEeCCCcEEEEECCCCcEEEEec
Q 000473 636 ----HPWSD--CFLSVGEDFSVALASLETLRVERMFP 666 (1471)
Q Consensus 636 ----~~~~~--~l~S~s~DgsV~lWdl~t~~~l~~~~ 666 (1471)
.+... +++-+-.-|.|-||++++|..+..+.
T Consensus 371 ~~~~~~~~~l~LvIyaprRg~lEvW~~~~g~Rv~a~~ 407 (415)
T PF14655_consen 371 SPKSSSRFALFLVIYAPRRGILEVWSMRQGPRVAAFN 407 (415)
T ss_pred ccCCCCcceEEEEEEeccCCeEEEEecCCCCEEEEEE
Confidence 00011 22334567899999999888776664
No 401
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=50.39 E-value=84 Score=35.54 Aligned_cols=63 Identities=8% Similarity=-0.008 Sum_probs=46.7
Q ss_pred CCEEEEEeCCCcEEEEECCCCcEEEEe-------c-------CCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEE
Q 000473 639 SDCFLSVGEDFSVALASLETLRVERMF-------P-------GHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIW 704 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~l~~~-------~-------gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VW 704 (1471)
++++++...+|.+.+||+.+++++..- . .....|..+.++.+|.-|++-+ + |..|.|
T Consensus 22 ~~~Ll~iT~~G~l~vWnl~~~k~~~~~~Si~pll~~~~~~~~~~~~~i~~~~lt~~G~PiV~ls-n--------g~~y~y 92 (219)
T PF07569_consen 22 GSYLLAITSSGLLYVWNLKKGKAVLPPVSIAPLLNSSPVSDKSSSPNITSCSLTSNGVPIVTLS-N--------GDSYSY 92 (219)
T ss_pred CCEEEEEeCCCeEEEEECCCCeeccCCccHHHHhcccccccCCCCCcEEEEEEcCCCCEEEEEe-C--------CCEEEe
Confidence 678999999999999999998765332 1 2445678888888887766543 4 678999
Q ss_pred ECCCCe
Q 000473 705 DVKTGA 710 (1471)
Q Consensus 705 Di~tg~ 710 (1471)
|..-+.
T Consensus 93 ~~~L~~ 98 (219)
T PF07569_consen 93 SPDLGC 98 (219)
T ss_pred ccccce
Confidence 875443
No 402
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=50.02 E-value=63 Score=31.05 Aligned_cols=49 Identities=22% Similarity=0.318 Sum_probs=37.5
Q ss_pred CceEEEEECccccEEEEEecCCCCCCCCCCCcccccceEEEEECCCCCeEEEEecCCcEEEEEec
Q 000473 1336 KASIRVYDMQSVTKIKVLDASGPPGLPRESDSVATTVISALIFSPDGEGLVAFSEHGLMIRWWSL 1400 (1471)
Q Consensus 1336 ~~~i~~ydl~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l~~~s~~~~~~~~w~~ 1400 (1471)
-|.|+.||-+. +++... |.+ .-..+++|||+|+|-.-|.....|.+++.
T Consensus 35 ~~~Vvyyd~~~---~~~va~----g~~---------~aNGI~~s~~~k~lyVa~~~~~~I~vy~~ 83 (86)
T PF01731_consen 35 WGNVVYYDGKE---VKVVAS----GFS---------FANGIAISPDKKYLYVASSLAHSIHVYKR 83 (86)
T ss_pred CceEEEEeCCE---eEEeec----cCC---------CCceEEEcCCCCEEEEEeccCCeEEEEEe
Confidence 45699999764 555441 222 34678999999999999999999999975
No 403
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=49.99 E-value=6e+02 Score=31.61 Aligned_cols=56 Identities=14% Similarity=0.173 Sum_probs=44.1
Q ss_pred CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCC-CCCcEEEEEcCCCC
Q 000473 621 VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPGH-PNYPAKVVWDCPRG 682 (1471)
Q Consensus 621 ~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh-~~~V~~v~~spdg~ 682 (1471)
.+++..+++||+ ++.+|.-..+|.+.+.+..-.+.+..+... ...+..+.|..++.
T Consensus 216 ~~~i~~iavSpn------g~~iAl~t~~g~l~v~ssDf~~~~~e~~~~~~~~p~~~~WCG~da 272 (410)
T PF04841_consen 216 DGPIIKIAVSPN------GKFIALFTDSGNLWVVSSDFSEKLCEFDTDSKSPPKQMAWCGNDA 272 (410)
T ss_pred CCCeEEEEECCC------CCEEEEEECCCCEEEEECcccceeEEeecCcCCCCcEEEEECCCc
Confidence 468999999999 999999999999999987766666666544 34677888877654
No 404
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=48.53 E-value=50 Score=26.15 Aligned_cols=32 Identities=6% Similarity=-0.091 Sum_probs=25.7
Q ss_pred CCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 679 CPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 679 pdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
|++++|++++.. +++|.++|..+++.+..+.-
T Consensus 1 pd~~~lyv~~~~-------~~~v~~id~~~~~~~~~i~v 32 (42)
T TIGR02276 1 PDGTKLYVTNSG-------SNTVSVIDTATNKVIATIPV 32 (42)
T ss_pred CCCCEEEEEeCC-------CCEEEEEECCCCeEEEEEEC
Confidence 678888888765 27999999999988777654
No 405
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.34 E-value=2.9e+02 Score=32.46 Aligned_cols=86 Identities=10% Similarity=0.060 Sum_probs=59.7
Q ss_pred CCEEEEEECCCc-EEEEECCCCceEEE--------EeccCCCEEEEEECCCCCCCCCCCEEEEEeCC-----CcEEEEEC
Q 000473 591 NEVLVSGSMDCS-IRIWDLGSGNLITV--------MHHHVAPVRQIILSPPQTEHPWSDCFLSVGED-----FSVALASL 656 (1471)
Q Consensus 591 ~~~L~SGs~Dgt-I~lWDl~tg~~l~~--------~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D-----gsV~lWdl 656 (1471)
..-++-+-.-|+ ..++|....+...+ |.+|. .|+|| |.+|...-.| |.|-|||.
T Consensus 80 ~ravafARrPGtf~~vfD~~~~~~pv~~~s~~~RHfyGHG------vfs~d------G~~LYATEndfd~~rGViGvYd~ 147 (366)
T COG3490 80 PRAVAFARRPGTFAMVFDPNGAQEPVTLVSQEGRHFYGHG------VFSPD------GRLLYATENDFDPNRGVIGVYDA 147 (366)
T ss_pred cceEEEEecCCceEEEECCCCCcCcEEEecccCceeeccc------ccCCC------CcEEEeecCCCCCCCceEEEEec
Confidence 344444444444 34688877665443 44554 47888 8877765444 78999999
Q ss_pred CCC-cEEEEecCCCCCcEEEEEcCCCCEEEEEE
Q 000473 657 ETL-RVERMFPGHPNYPAKVVWDCPRGYIACLC 688 (1471)
Q Consensus 657 ~t~-~~l~~~~gh~~~V~~v~~spdg~~L~sgs 688 (1471)
+.+ ..+-+++.|.-..-.+.|.+||+.|+.+.
T Consensus 148 r~~fqrvgE~~t~GiGpHev~lm~DGrtlvvan 180 (366)
T COG3490 148 REGFQRVGEFSTHGIGPHEVTLMADGRTLVVAN 180 (366)
T ss_pred ccccceecccccCCcCcceeEEecCCcEEEEeC
Confidence 854 44677888888888999999999998764
No 406
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=47.51 E-value=2.7e+02 Score=32.47 Aligned_cols=105 Identities=19% Similarity=0.066 Sum_probs=63.2
Q ss_pred cCCCCEEEEEeCCC--eEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcCCCCeEEEEcceecccCCccccccccccccccc
Q 000473 99 SLDNGALISACTDG--VLCVWSRSSGHCRRRRKLPPWVGSPSVICTLPSNPRYVCIGCCFIDTNQLSDHHSFESVEGDLV 176 (1471)
Q Consensus 99 s~d~~~LaSas~DG--~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s~~~~ll~~G~~~id~~~~~~~h~~~~i~~~~~ 176 (1471)
..++.++-|.+.-| .|+.+|+.+|+.+...++|+.. -.-.|..+ ++ ++.-.
T Consensus 53 ~~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~-FgEGit~~-~d-~l~qL------------------------ 105 (264)
T PF05096_consen 53 LDDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRY-FGEGITIL-GD-KLYQL------------------------ 105 (264)
T ss_dssp EETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT---EEEEEEE-TT-EEEEE------------------------
T ss_pred cCCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccc-cceeEEEE-CC-EEEEE------------------------
Confidence 36677888998888 6999999999999888887321 01111111 11 11111
Q ss_pred ccccCCCCCCCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECCCC
Q 000473 177 SEDKEVPMKNPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPISKE 250 (1471)
Q Consensus 177 ~~d~~~~~~~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~~~ 250 (1471)
.+ .++...++|..+++.+.++.- ...+| .++ .|++ .+++.+.+.++..+|..+.
T Consensus 106 TW---------k~~~~f~yd~~tl~~~~~~~y--~~EGW--GLt-----~dg~--~Li~SDGS~~L~~~dP~~f 159 (264)
T PF05096_consen 106 TW---------KEGTGFVYDPNTLKKIGTFPY--PGEGW--GLT-----SDGK--RLIMSDGSSRLYFLDPETF 159 (264)
T ss_dssp ES---------SSSEEEEEETTTTEEEEEEE---SSS----EEE-----ECSS--CEEEE-SSSEEEEE-TTT-
T ss_pred Ee---------cCCeEEEEccccceEEEEEec--CCcce--EEE-----cCCC--EEEEECCccceEEECCccc
Confidence 01 137889999999999988764 12334 333 1443 4778888889999887655
No 407
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=47.04 E-value=5.1e+02 Score=32.38 Aligned_cols=54 Identities=11% Similarity=0.039 Sum_probs=38.4
Q ss_pred CCCCceEEEEeCcceEEEEEeecCccccCCeEEEEEeeecCCCCceeEEEEeCCCcEEEEECC
Q 000473 186 NPPKCTLVIVDTYGLTIVQTVFHGNLSIGPWKFMDVVSLGEDMGKHYGLMVDSVGRLQLVPIS 248 (1471)
Q Consensus 186 ~~~~~~I~v~D~~t~~~l~tl~s~~~s~~~i~~~~~~~~~~d~~~~~llvas~dG~V~vW~l~ 248 (1471)
..++|.+.+++-.+......+.. .+-|+|+.++.-. +.+++++++..+.-++..
T Consensus 151 QS~DG~L~~feqe~~~f~~~lp~-~llPgPl~Y~~~t--------Dsfvt~sss~~l~~Yky~ 204 (418)
T PF14727_consen 151 QSMDGSLSFFEQESFAFSRFLPD-FLLPGPLCYCPRT--------DSFVTASSSWTLECYKYQ 204 (418)
T ss_pred EecCceEEEEeCCcEEEEEEcCC-CCCCcCeEEeecC--------CEEEEecCceeEEEecHH
Confidence 34468888888776655555544 7789899887622 567777888888888764
No 408
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=46.53 E-value=2.5e+02 Score=36.14 Aligned_cols=124 Identities=10% Similarity=-0.027 Sum_probs=70.2
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCC-CEEEEEECC--CCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecC
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVA-PVRQIILSP--PQTEHPWSDCFLSVGEDFSVALASLETLRVERMFPG 667 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~-~V~~l~fsp--d~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~g 667 (1471)
+..++.++.++.|.-.|..+|+.+.++..... .+......+ .....-.+..++.++.|+.+.-+|.++|+.+..+..
T Consensus 69 ~g~vyv~s~~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~dg~l~ALDa~TGk~~W~~~~ 148 (527)
T TIGR03075 69 DGVMYVTTSYSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTLDARLVALDAKTGKVVWSKKN 148 (527)
T ss_pred CCEEEEECCCCcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcCCCEEEEEECCCCCEEeeccc
Confidence 45666677788899999999998877654221 111100000 000001134667778899999999999999876542
Q ss_pred CCCC-cEEEEEcC--CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 668 HPNY-PAKVVWDC--PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 668 h~~~-V~~v~~sp--dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
.... -..+.-+| .+..++++... +-...+|.|+-+|.+||+.+-...
T Consensus 149 ~~~~~~~~~tssP~v~~g~Vivg~~~--~~~~~~G~v~AlD~~TG~~lW~~~ 198 (527)
T TIGR03075 149 GDYKAGYTITAAPLVVKGKVITGISG--GEFGVRGYVTAYDAKTGKLVWRRY 198 (527)
T ss_pred ccccccccccCCcEEECCEEEEeecc--cccCCCcEEEEEECCCCceeEecc
Confidence 1100 00111112 13456665431 001112899999999999876544
No 409
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=46.39 E-value=2.5e+02 Score=33.15 Aligned_cols=98 Identities=7% Similarity=0.070 Sum_probs=61.9
Q ss_pred CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe------CCCcEEEEECCCCcEEEEecCC-----C
Q 000473 601 CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG------EDFSVALASLETLRVERMFPGH-----P 669 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s------~DgsV~lWdl~t~~~l~~~~gh-----~ 669 (1471)
..|++||..+.+....-.+-.+.|.++.|..+ .+.++.|. ....+..||+.+..- ..+.+- .
T Consensus 16 ~~lC~yd~~~~qW~~~g~~i~G~V~~l~~~~~------~~Llv~G~ft~~~~~~~~la~yd~~~~~w-~~~~~~~s~~ip 88 (281)
T PF12768_consen 16 PGLCLYDTDNSQWSSPGNGISGTVTDLQWASN------NQLLVGGNFTLNGTNSSNLATYDFKNQTW-SSLGGGSSNSIP 88 (281)
T ss_pred CEEEEEECCCCEeecCCCCceEEEEEEEEecC------CEEEEEEeeEECCCCceeEEEEecCCCee-eecCCcccccCC
Confidence 36999998877655544455678999999755 56666664 467789999987643 333332 3
Q ss_pred CCcEEEEEc-CCCC-EEEEEEcCCCCCCCCCCEEEEEECCCCe
Q 000473 670 NYPAKVVWD-CPRG-YIACLCRDHSRTSDAVDVLFIWDVKTGA 710 (1471)
Q Consensus 670 ~~V~~v~~s-pdg~-~L~sgs~D~sg~~D~~gtV~VWDi~tg~ 710 (1471)
++|..+.+. .|+. +.+.|... +++..|..||-.+..
T Consensus 89 gpv~a~~~~~~d~~~~~~aG~~~-----~g~~~l~~~dGs~W~ 126 (281)
T PF12768_consen 89 GPVTALTFISNDGSNFWVAGRSA-----NGSTFLMKYDGSSWS 126 (281)
T ss_pred CcEEEEEeeccCCceEEEeceec-----CCCceEEEEcCCceE
Confidence 678888773 2333 44444321 223568888755443
No 410
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=46.07 E-value=2e+02 Score=32.37 Aligned_cols=102 Identities=19% Similarity=0.300 Sum_probs=66.9
Q ss_pred CEEEEEECCCcEEEEECC--CCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCC------CcEEEE-ECCC----
Q 000473 592 EVLVSGSMDCSIRIWDLG--SGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGED------FSVALA-SLET---- 658 (1471)
Q Consensus 592 ~~L~SGs~DgtI~lWDl~--tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~D------gsV~lW-dl~t---- 658 (1471)
+.|+.+...+.|.+|++. ..+.+.+|..- +.|..+.++.. |++++|.=.+ ..+|+| |.+.
T Consensus 29 d~Lfva~~g~~Vev~~l~~~~~~~~~~F~Tv-~~V~~l~y~~~------GDYlvTlE~k~~~~~~~fvR~Y~NWr~~~~~ 101 (215)
T PF14761_consen 29 DALFVAASGCKVEVYDLEQEECPLLCTFSTV-GRVLQLVYSEA------GDYLVTLEEKNKRSPVDFVRAYFNWRSQKEE 101 (215)
T ss_pred ceEEEEcCCCEEEEEEcccCCCceeEEEcch-hheeEEEeccc------cceEEEEEeecCCccceEEEEEEEhhhhccc
Confidence 444444666789999998 33456777544 78999999998 9999997432 256665 2221
Q ss_pred CcEEE-EecCC---------------------CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 659 LRVER-MFPGH---------------------PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 659 ~~~l~-~~~gh---------------------~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
..+++ .+-|| ..++.+++-.|...-|++||.+ ++.+|.+..+
T Consensus 102 ~~~v~vRiaG~~v~~~~~~~~~~qleiiElPl~~~p~ciaCC~~tG~LlVg~~~---------~l~lf~l~~~ 165 (215)
T PF14761_consen 102 NSPVRVRIAGHRVTPSFNESSKDQLEIIELPLSEPPLCIACCPVTGNLLVGCGN---------KLVLFTLKYQ 165 (215)
T ss_pred CCcEEEEEcccccccCCCCccccceEEEEecCCCCCCEEEecCCCCCEEEEcCC---------EEEEEEEEEE
Confidence 12221 22232 2256778888877778888875 9999987644
No 411
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=45.33 E-value=55 Score=41.04 Aligned_cols=80 Identities=19% Similarity=0.197 Sum_probs=46.9
Q ss_pred CCEEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCC-----------
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETL----------- 659 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~----------- 659 (1471)
|.+|...+.+ .|.+||..+++.++.+... +|..+.|+++ ++.++-.+.| ++.+++.+..
T Consensus 117 G~LL~~~~~~-~i~~yDw~~~~~i~~i~v~--~vk~V~Ws~~------g~~val~t~~-~i~il~~~~~~~~~~~~~g~e 186 (443)
T PF04053_consen 117 GNLLGVKSSD-FICFYDWETGKLIRRIDVS--AVKYVIWSDD------GELVALVTKD-SIYILKYNLEAVAAIPEEGVE 186 (443)
T ss_dssp SSSEEEEETT-EEEEE-TTT--EEEEESS---E-EEEEE-TT------SSEEEEE-S--SEEEEEE-HHHHHHBTTTB-G
T ss_pred CcEEEEECCC-CEEEEEhhHcceeeEEecC--CCcEEEEECC------CCEEEEEeCC-eEEEEEecchhcccccccCch
Confidence 4555555555 7999999999999998654 4999999999 8888888754 6777765432
Q ss_pred cEEEEecCCCCCcEEEEEcCC
Q 000473 660 RVERMFPGHPNYPAKVVWDCP 680 (1471)
Q Consensus 660 ~~l~~~~gh~~~V~~v~~spd 680 (1471)
..+..+..-...|.+.+|..+
T Consensus 187 ~~f~~~~E~~~~IkSg~W~~d 207 (443)
T PF04053_consen 187 DAFELIHEISERIKSGCWVED 207 (443)
T ss_dssp GGEEEEEEE-S--SEEEEETT
T ss_pred hceEEEEEecceeEEEEEEcC
Confidence 022333222446778888765
No 412
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=44.29 E-value=3.1e+02 Score=31.66 Aligned_cols=114 Identities=11% Similarity=0.013 Sum_probs=64.9
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEE-CCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCc
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGS-MDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFS 650 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs-~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dgs 650 (1471)
.+...+++++ ++.++.-. .++.-.||-...+....... -...+..-.|.++ +........+..
T Consensus 25 ~~~s~AvS~d---------g~~~A~v~~~~~~~~L~~~~~~~~~~~~~-~g~~l~~PS~d~~------g~~W~v~~~~~~ 88 (253)
T PF10647_consen 25 DVTSPAVSPD---------GSRVAAVSEGDGGRSLYVGPAGGPVRPVL-TGGSLTRPSWDPD------GWVWTVDDGSGG 88 (253)
T ss_pred cccceEECCC---------CCeEEEEEEcCCCCEEEEEcCCCcceeec-cCCccccccccCC------CCEEEEEcCCCc
Confidence 5778888887 55444333 33333333333333332221 2236777778887 666666666666
Q ss_pred EEEEE-CCCCcEEE-Ee--cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 651 VALAS-LETLRVER-MF--PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 651 V~lWd-l~t~~~l~-~~--~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
.+++. ..+++... .. ..-...|..+.+||||..++.-..+ .+++.|+|=-+
T Consensus 89 ~~~~~~~~~g~~~~~~v~~~~~~~~I~~l~vSpDG~RvA~v~~~-----~~~~~v~va~V 143 (253)
T PF10647_consen 89 VRVVRDSASGTGEPVEVDWPGLRGRITALRVSPDGTRVAVVVED-----GGGGRVYVAGV 143 (253)
T ss_pred eEEEEecCCCcceeEEecccccCCceEEEEECCCCcEEEEEEec-----CCCCeEEEEEE
Confidence 67663 33333221 12 1112279999999999998877654 11267777654
No 413
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=42.59 E-value=52 Score=39.85 Aligned_cols=107 Identities=14% Similarity=0.172 Sum_probs=58.1
Q ss_pred CCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeCCCCCceeeee
Q 000473 648 DFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRGTASHSMFDHF 727 (1471)
Q Consensus 648 DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~gH~~~v~~~~~ 727 (1471)
.+...++|+++++....... ...+....|+|+|++++.... +.|++++..+++. ..++-.....+..-.
T Consensus 22 ~~~y~i~d~~~~~~~~l~~~-~~~~~~~~~sP~g~~~~~v~~---------~nly~~~~~~~~~-~~lT~dg~~~i~nG~ 90 (353)
T PF00930_consen 22 KGDYYIYDIETGEITPLTPP-PPKLQDAKWSPDGKYIAFVRD---------NNLYLRDLATGQE-TQLTTDGEPGIYNGV 90 (353)
T ss_dssp EEEEEEEETTTTEEEESS-E-ETTBSEEEE-SSSTEEEEEET---------TEEEEESSTTSEE-EESES--TTTEEESB
T ss_pred ceeEEEEecCCCceEECcCC-ccccccceeecCCCeeEEEec---------CceEEEECCCCCe-EEeccccceeEEcCc
Confidence 46788999999876554443 567889999999999988765 6899999988844 444432223332211
Q ss_pred eecccccc--ccceEEcCCccccccceeeccCCceEeecc
Q 000473 728 CKGISMNS--ISGSVLNGNTSVSSLLLPIHEDGTFRQSQI 765 (1471)
Q Consensus 728 ~~~~~~~~--~sg~v~~g~~~~s~~l~~~~~D~tir~w~l 765 (1471)
.+=+-... .....+-|+.+..-.++....+..++.+.+
T Consensus 91 ~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~ 130 (353)
T PF00930_consen 91 PDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPL 130 (353)
T ss_dssp --HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEE
T ss_pred cceeccccccccccceEECCCCCEEEEEEECCcCCceEEe
Confidence 11000000 011123345555555555555555555544
No 414
>PRK10115 protease 2; Provisional
Probab=42.26 E-value=6.6e+02 Score=33.57 Aligned_cols=112 Identities=14% Similarity=0.181 Sum_probs=65.8
Q ss_pred EEEEEEecCCCCcccCcCCCEEEEE-E----CCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC
Q 000473 573 VLCLAAHRMVGTAKGWSFNEVLVSG-S----MDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE 647 (1471)
Q Consensus 573 V~~la~spd~~~~~~~~~~~~L~SG-s----~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~ 647 (1471)
+..+.++|| +++|+-+ + ....|++-|+.+|+.+..--.... ..+.|.+| ++.|+-...
T Consensus 129 l~~~~~Spd---------g~~la~~~d~~G~E~~~l~v~d~~tg~~l~~~i~~~~--~~~~w~~D------~~~~~y~~~ 191 (686)
T PRK10115 129 LGGMAITPD---------NTIMALAEDFLSRRQYGIRFRNLETGNWYPELLDNVE--PSFVWAND------SWTFYYVRK 191 (686)
T ss_pred EeEEEECCC---------CCEEEEEecCCCcEEEEEEEEECCCCCCCCccccCcc--eEEEEeeC------CCEEEEEEe
Confidence 556677887 5655433 3 334788999988874322212222 45899998 665544433
Q ss_pred C------CcEEEEECCCC--cEEEEecCCCCCcEEEEE-cCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 648 D------FSVALASLETL--RVERMFPGHPNYPAKVVW-DCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 648 D------gsV~lWdl~t~--~~l~~~~gh~~~V~~v~~-spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
| ..|.++++.++ +-...+........-..+ +.++++++..+.. ..++.+.+|+.
T Consensus 192 ~~~~~~~~~v~~h~lgt~~~~d~lv~~e~~~~~~~~~~~s~d~~~l~i~~~~-----~~~~~~~l~~~ 254 (686)
T PRK10115 192 HPVTLLPYQVWRHTIGTPASQDELVYEEKDDTFYVSLHKTTSKHYVVIHLAS-----ATTSEVLLLDA 254 (686)
T ss_pred cCCCCCCCEEEEEECCCChhHCeEEEeeCCCCEEEEEEEcCCCCEEEEEEEC-----CccccEEEEEC
Confidence 2 46788888887 444455443333332334 4488888766554 12267899984
No 415
>PHA02713 hypothetical protein; Provisional
Probab=41.87 E-value=1.1e+02 Score=39.74 Aligned_cols=73 Identities=7% Similarity=0.018 Sum_probs=43.5
Q ss_pred CCEEEEEeCC------CcEEEEECCC-Cc--EEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCC
Q 000473 639 SDCFLSVGED------FSVALASLET-LR--VERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTG 709 (1471)
Q Consensus 639 ~~~l~S~s~D------gsV~lWdl~t-~~--~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg 709 (1471)
+...+.||.+ ..|..||.++ .+ .+..++........+.+ +++..++|+.| +..++..||..|.
T Consensus 464 ~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~~--~~~iyv~Gg~~------~~~~~e~yd~~~~ 535 (557)
T PHA02713 464 DDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTILH--DNTIMMLHCYE------SYMLQDTFNVYTY 535 (557)
T ss_pred CEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEEE--CCEEEEEeeec------ceeehhhcCcccc
Confidence 4566667654 2467899886 33 33344433333444444 56777888877 1137889999887
Q ss_pred eEEEEEeCCC
Q 000473 710 ARERVLRGTA 719 (1471)
Q Consensus 710 ~~~~~l~gH~ 719 (1471)
+=...-..|+
T Consensus 536 ~W~~~~~~~~ 545 (557)
T PHA02713 536 EWNHICHQHS 545 (557)
T ss_pred cccchhhhcC
Confidence 6544444454
No 416
>PF13645 YkuD_2: L,D-transpeptidase catalytic domain
Probab=40.92 E-value=74 Score=34.70 Aligned_cols=80 Identities=14% Similarity=0.103 Sum_probs=52.0
Q ss_pred cCCCChhhhhhhhhHHHHHHHHHHcccCeeecCCCCceEEeecCcCcccCceEEEEECccccEEE-EEecCCCCCCCCCC
Q 000473 1287 MDPGNSVMRKTCLHTSMAALKEIVHVFPMVSLNDTSTKLAVGDAIGDIKKASIRVYDMQSVTKIK-VLDASGPPGLPRES 1365 (1471)
Q Consensus 1287 lDp~~~~~r~~~l~~~~~~l~~~~~~~p~v~~~~~tqrlavg~~~g~~~~~~i~~ydl~~~~~~~-~~~~~~~~~~~~~~ 1365 (1471)
|....+.+....++.++..+..+.++ .. ..+.++++-+-.-+-......||||++.+-+. .+-|||
T Consensus 4 l~~~~~~l~~~~~~~a~~~~~~~~~~-~~----~~~~~l~iIDfs~pS~~~R~~v~Dl~~~~~l~~~~VaHG-------- 70 (176)
T PF13645_consen 4 LNLEAPKLSPKAFQKALKAYQCAKKK-KI----YNKDILTIIDFSKPSGEKRFFVIDLKKGKLLYNTLVAHG-------- 70 (176)
T ss_pred hhhhccCCCHHHHHHHHHHHHHHHhc-cC----CCCCeEEEEECCCCCCCCeEEEEECCCCEEEEeeeeecc--------
Confidence 44445566678888888888777755 11 14445555444333335678999999996665 699997
Q ss_pred CcccccceEEEEECCC
Q 000473 1366 DSVATTVISALIFSPD 1381 (1471)
Q Consensus 1366 ~~~~~~~i~a~~fs~d 1381 (1471)
..++.-.|-.||..
T Consensus 71 --~gsg~~~a~~FSN~ 84 (176)
T PF13645_consen 71 --RGSGNLYATSFSNR 84 (176)
T ss_pred --cCCCCCccccCcCC
Confidence 33555567788755
No 417
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=40.53 E-value=5.9e+02 Score=31.38 Aligned_cols=175 Identities=14% Similarity=0.104 Sum_probs=82.2
Q ss_pred EeeccccccCC---EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCC
Q 000473 516 MVISESFYAPY---AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNE 592 (1471)
Q Consensus 516 ~~is~~~f~P~---~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~ 592 (1471)
.|-.+.+|.++ .|..+..||.-.++..+ +++++..+.-.+-........++|+ .+
T Consensus 36 ~YF~~~~ft~dG~kllF~s~~dg~~nly~lD-------------L~t~~i~QLTdg~g~~~~g~~~s~~---------~~ 93 (386)
T PF14583_consen 36 LYFYQNCFTDDGRKLLFASDFDGNRNLYLLD-------------LATGEITQLTDGPGDNTFGGFLSPD---------DR 93 (386)
T ss_dssp --TTS--B-TTS-EEEEEE-TTSS-EEEEEE-------------TTT-EEEE---SS-B-TTT-EE-TT---------SS
T ss_pred eeecCCCcCCCCCEEEEEeccCCCcceEEEE-------------cccCEEEECccCCCCCccceEEecC---------CC
Confidence 44444556655 44555558877775554 4455444333321111223344665 45
Q ss_pred EEEEEECCCcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEe---CC-------------------Cc
Q 000473 593 VLVSGSMDCSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVG---ED-------------------FS 650 (1471)
Q Consensus 593 ~L~SGs~DgtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s---~D-------------------gs 650 (1471)
.++=-.....++--|+.+++....+......+-...|..+. ++..++-.- .| ..
T Consensus 94 ~~~Yv~~~~~l~~vdL~T~e~~~vy~~p~~~~g~gt~v~n~----d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~ 169 (386)
T PF14583_consen 94 ALYYVKNGRSLRRVDLDTLEERVVYEVPDDWKGYGTWVANS----DCTKLVGIEISREDWKPLTKWKGFREFYEARPHCR 169 (386)
T ss_dssp EEEEEETTTEEEEEETTT--EEEEEE--TTEEEEEEEEE-T----TSSEEEEEEEEGGG-----SHHHHHHHHHC---EE
T ss_pred eEEEEECCCeEEEEECCcCcEEEEEECCcccccccceeeCC----CccEEEEEEEeehhccCccccHHHHHHHhhCCCce
Confidence 54434455688888999988766666666555444443221 033332221 11 23
Q ss_pred EEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCC-CeEEEEEeCCCC
Q 000473 651 VALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKT-GARERVLRGTAS 720 (1471)
Q Consensus 651 V~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~t-g~~~~~l~gH~~ 720 (1471)
|.-.|+++|+....+. -...+.-+.|+|....+++=|.. |.+|.. .-|||-+++ |...+.+..|..
T Consensus 170 i~~idl~tG~~~~v~~-~~~wlgH~~fsP~dp~li~fCHE--Gpw~~V-d~RiW~i~~dg~~~~~v~~~~~ 236 (386)
T PF14583_consen 170 IFTIDLKTGERKVVFE-DTDWLGHVQFSPTDPTLIMFCHE--GPWDLV-DQRIWTINTDGSNVKKVHRRME 236 (386)
T ss_dssp EEEEETTT--EEEEEE-ESS-EEEEEEETTEEEEEEEEE---S-TTTS-S-SEEEEETTS---EESS---T
T ss_pred EEEEECCCCceeEEEe-cCccccCcccCCCCCCEEEEecc--CCccee-ceEEEEEEcCCCcceeeecCCC
Confidence 4455778777654444 34567889999999999999977 777741 236776654 555566655544
No 418
>COG5167 VID27 Protein involved in vacuole import and degradation [Intracellular trafficking and secretion]
Probab=39.89 E-value=1.4e+02 Score=37.18 Aligned_cols=60 Identities=17% Similarity=0.194 Sum_probs=50.4
Q ss_pred CCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 639 SDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 639 ~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
..+++.+|.-|-|+|||--.-+....+++....|..|..+.+|+++++.|.. .+.+-|++
T Consensus 573 sGyIa~as~kGDirLyDRig~rAKtalP~lG~aIk~idvta~Gk~ilaTCk~---------yllL~d~~ 632 (776)
T COG5167 573 SGYIAAASRKGDIRLYDRIGKRAKTALPGLGDAIKHIDVTANGKHILATCKN---------YLLLTDVP 632 (776)
T ss_pred CceEEEecCCCceeeehhhcchhhhcCcccccceeeeEeecCCcEEEEeecc---------eEEEEecc
Confidence 4589999999999999976555556678888889999999999999999985 77777764
No 419
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=39.67 E-value=2e+02 Score=35.39 Aligned_cols=151 Identities=17% Similarity=0.157 Sum_probs=69.8
Q ss_pred ccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCc--EEEEECCCCceEEEEeccCCCEEEEEECCCC
Q 000473 556 KVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCS--IRIWDLGSGNLITVMHHHVAPVRQIILSPPQ 633 (1471)
Q Consensus 556 d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~Dgt--I~lWDl~tg~~l~~~~~H~~~V~~l~fspd~ 633 (1471)
|..+|..+..|..+.+.-..+-|...-=..+| .++|+.+..|+. +.+-|+.+++..+--.+-........++|+
T Consensus 16 D~~TG~~VtrLT~~~~~~h~~YF~~~~ft~dG---~kllF~s~~dg~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~- 91 (386)
T PF14583_consen 16 DPDTGHRVTRLTPPDGHSHRLYFYQNCFTDDG---RKLLFASDFDGNRNLYLLDLATGEITQLTDGPGDNTFGGFLSPD- 91 (386)
T ss_dssp -TTT--EEEE-S-TTS-EE---TTS--B-TTS----EEEEEE-TTSS-EEEEEETTT-EEEE---SS-B-TTT-EE-TT-
T ss_pred CCCCCceEEEecCCCCcccceeecCCCcCCCC---CEEEEEeccCCCcceEEEEcccCEEEECccCCCCCccceEEecC-
Confidence 45566666666544433333333221000111 456777776774 555677887754433221121223555677
Q ss_pred CCCCCCCEEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEEc--CCCCEEEEEEcCC----CCCC-----------C
Q 000473 634 TEHPWSDCFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVWD--CPRGYIACLCRDH----SRTS-----------D 696 (1471)
Q Consensus 634 ~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~s--pdg~~L~sgs~D~----sg~~-----------D 696 (1471)
++.++-......|+-.|+++++....+.-....+-...|. .|+.. ++|..-. .... .
T Consensus 92 -----~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~p~~~~g~gt~v~n~d~t~-~~g~e~~~~d~~~l~~~~~f~e~~~a~ 165 (386)
T PF14583_consen 92 -----DRALYYVKNGRSLRRVDLDTLEERVVYEVPDDWKGYGTWVANSDCTK-LVGIEISREDWKPLTKWKGFREFYEAR 165 (386)
T ss_dssp -----SSEEEEEETTTEEEEEETTT--EEEEEE--TTEEEEEEEEE-TTSSE-EEEEEEEGGG-----SHHHHHHHHHC-
T ss_pred -----CCeEEEEECCCeEEEEECCcCcEEEEEECCcccccccceeeCCCccE-EEEEEEeehhccCccccHHHHHHHhhC
Confidence 6777777777899999999998776776667766667774 34444 4454310 0000 0
Q ss_pred CCCEEEEEECCCCeEEEEEe
Q 000473 697 AVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 697 ~~gtV~VWDi~tg~~~~~l~ 716 (1471)
....|.--|++||+....+.
T Consensus 166 p~~~i~~idl~tG~~~~v~~ 185 (386)
T PF14583_consen 166 PHCRIFTIDLKTGERKVVFE 185 (386)
T ss_dssp --EEEEEEETTT--EEEEEE
T ss_pred CCceEEEEECCCCceeEEEe
Confidence 11457777889988655544
No 420
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=38.15 E-value=2.8e+02 Score=35.02 Aligned_cols=76 Identities=16% Similarity=0.187 Sum_probs=50.8
Q ss_pred CEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEE---------CCCCcEEEEecC-----------CCCCcEEEEEcCCC-
Q 000473 623 PVRQIILSPPQTEHPWSDCFLSVGEDFSVALAS---------LETLRVERMFPG-----------HPNYPAKVVWDCPR- 681 (1471)
Q Consensus 623 ~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWd---------l~t~~~l~~~~g-----------h~~~V~~v~~spdg- 681 (1471)
.|..+..++. |..++-.|.||.+.++= +++|+....++. ..-.+..++|+|+.
T Consensus 105 eV~~vl~s~~------GS~VaL~G~~Gi~vMeLp~rwG~~s~~eDgk~~v~CRt~~i~~~~ftss~~ltl~Qa~WHP~S~ 178 (741)
T KOG4460|consen 105 EVYQVLLSPT------GSHVALIGIKGLMVMELPKRWGKNSEFEDGKSTVNCRTTPVAERFFTSSTSLTLKQAAWHPSSI 178 (741)
T ss_pred EEEEEEecCC------CceEEEecCCeeEEEEchhhcCccceecCCCceEEEEeecccceeeccCCceeeeeccccCCcc
Confidence 4677778887 88888888898776543 234543322211 11146778999985
Q ss_pred --CEEEEEEcCCCCCCCCCCEEEEEECCCCeEE
Q 000473 682 --GYIACLCRDHSRTSDAVDVLFIWDVKTGARE 712 (1471)
Q Consensus 682 --~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~ 712 (1471)
..|..-..| .++|+||...-+.+
T Consensus 179 ~D~hL~iL~sd--------nviRiy~lS~~tel 203 (741)
T KOG4460|consen 179 LDPHLVLLTSD--------NVIRIYSLSEPTEL 203 (741)
T ss_pred CCceEEEEecC--------cEEEEEecCCcchh
Confidence 677777777 89999998764433
No 421
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=37.44 E-value=4.5e+02 Score=32.89 Aligned_cols=44 Identities=23% Similarity=0.313 Sum_probs=29.1
Q ss_pred CCeEEEEEcCCCeEEEeeeCCCCCCCCcEEEEcC-CCCeEEEEcc
Q 000473 111 DGVLCVWSRSSGHCRRRRKLPPWVGSPSVICTLP-SNPRYVCIGC 154 (1471)
Q Consensus 111 DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~~~s-~~~~ll~~G~ 154 (1471)
-.+|.+||+.+.+.+..+.++...+.|..|++++ ++...-.+|+
T Consensus 221 G~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~ 265 (461)
T PF05694_consen 221 GHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGC 265 (461)
T ss_dssp --EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEE
T ss_pred cCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEE
Confidence 3579999999999999999975444677888777 4444444444
No 422
>smart00036 CNH Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
Probab=36.47 E-value=1e+02 Score=36.54 Aligned_cols=57 Identities=18% Similarity=0.193 Sum_probs=47.4
Q ss_pred ceEEeecCcCcccCceEEEEECcc--ccEEEEEecCCCCCCCCCCCcccccceEEEEECCCCCeEEEEecCCcEEEEEec
Q 000473 1323 TKLAVGDAIGDIKKASIRVYDMQS--VTKIKVLDASGPPGLPRESDSVATTVISALIFSPDGEGLVAFSEHGLMIRWWSL 1400 (1471)
Q Consensus 1323 qrlavg~~~g~~~~~~i~~ydl~~--~~~~~~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l~~~s~~~~~~~~w~~ 1400 (1471)
.+|++|+.+| |.+.|+.. ++-.+++. +..|+.+.-.+.=..|++.|.+..+|+++.+
T Consensus 14 ~~lL~GTe~G------ly~~~~~~~~~~~~kl~~---------------~~~v~q~~v~~~~~lLi~Lsgk~~~L~~~~L 72 (302)
T smart00036 14 KWLLVGTEEG------LYVLNISDQPGTLEKLIG---------------RRSVTQIWVLEENNVLLMISGKKPQLYSHPL 72 (302)
T ss_pred cEEEEEeCCc------eEEEEcccCCCCeEEecC---------------cCceEEEEEEhhhCEEEEEeCCcceEEEEEH
Confidence 6899999966 77888765 44556666 7899999999999999999988888988887
No 423
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=36.35 E-value=5.3e+02 Score=27.06 Aligned_cols=108 Identities=13% Similarity=0.155 Sum_probs=60.7
Q ss_pred EEEEEcCCCCeEEEEeCCCcEEEEEccCCCC--C-ceeeeEEecccccceeEeeeccccccccCcccccccccccccccc
Q 000473 20 TATSALTQPPTLYTGGSDGSILWWSFSDSSY--S-EIKPVAMLCGHSAPIADLSICYPAMVSRDGKAEHWKAENSSNVMG 96 (1471)
Q Consensus 20 tava~SpDg~~LaTGs~DG~I~lWdl~~~~~--~-~~~~~~~L~GH~~~Vt~La~c~~~~~s~dg~~~~~~~~~~~~~~~ 96 (1471)
+.-.|-...+-|+.++.-|+|.+.+...... . ....++.|- -...|++|+- |
T Consensus 2 aiGkfDG~~pcL~~aT~~gKV~IH~ph~~~~~~~~~~~~i~~LN-in~~italaa---------G--------------- 56 (136)
T PF14781_consen 2 AIGKFDGVHPCLACATTGGKVFIHNPHERGQRTGRQDSDISFLN-INQEITALAA---------G--------------- 56 (136)
T ss_pred eEEEeCCCceeEEEEecCCEEEEECCCccccccccccCceeEEE-CCCceEEEEE---------E---------------
Confidence 3445655567888888889999998652100 0 111223332 2446888871 2
Q ss_pred cc--cCCCCEEEEEeCCCeEEEEEcCCCeEEEeeeCCCCCCCCcEEE-EcCC-CCeEEEEcce
Q 000473 97 KS--SLDNGALISACTDGVLCVWSRSSGHCRRRRKLPPWVGSPSVIC-TLPS-NPRYVCIGCC 155 (1471)
Q Consensus 97 ~~--s~d~~~LaSas~DG~I~VWdv~~G~ci~~~~l~~~~g~~~~i~-~~s~-~~~ll~~G~~ 155 (1471)
++ ..+...|+-|+.. .|-.+|+....-+-.+.++.++ ...+. .+.. +..++.+|..
T Consensus 57 ~l~~~~~~D~LliGt~t-~llaYDV~~N~d~Fyke~~DGv--n~i~~g~~~~~~~~l~ivGGn 116 (136)
T PF14781_consen 57 RLKPDDGRDCLLIGTQT-SLLAYDVENNSDLFYKEVPDGV--NAIVIGKLGDIPSPLVIVGGN 116 (136)
T ss_pred ecCCCCCcCEEEEeccc-eEEEEEcccCchhhhhhCccce--eEEEEEecCCCCCcEEEECce
Confidence 23 3456777777765 5779999877655555555322 22111 3332 4456666665
No 424
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=35.74 E-value=1.6e+02 Score=40.24 Aligned_cols=100 Identities=16% Similarity=0.094 Sum_probs=64.5
Q ss_pred EEEEEECCCcEEEEECCCCce-----EEEEecc------CCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcE
Q 000473 593 VLVSGSMDCSIRIWDLGSGNL-----ITVMHHH------VAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRV 661 (1471)
Q Consensus 593 ~L~SGs~DgtI~lWDl~tg~~-----l~~~~~H------~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~ 661 (1471)
.++..+.+-.|..+|+.+-.. -.-|+.| .....++.|+|.- ....+....|+.|++..+.....
T Consensus 116 ~v~~tsng~~v~~fD~~~fs~s~~~~~~pl~~s~ts~ek~vf~~~~~wnP~v-----p~n~av~l~dlsl~V~~~~~~~~ 190 (1405)
T KOG3630|consen 116 VVVSTSNGEAVYSFDLEEFSESRYETTVPLKNSATSFEKPVFQLKNVWNPLV-----PLNSAVDLSDLSLRVKSTKQLAQ 190 (1405)
T ss_pred EEEEecCCceEEEEehHhhhhhhhhhccccccccchhccccccccccccCCc-----cchhhhhccccchhhhhhhhhhh
Confidence 344455566888999975321 1222222 2345667788762 33566777899999887754332
Q ss_pred -EEEecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEEC
Q 000473 662 -ERMFPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDV 706 (1471)
Q Consensus 662 -l~~~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi 706 (1471)
+..++ -...+++|+|+|.|+.++.|-.. |++.=|..
T Consensus 191 ~v~s~p-~t~~~Tav~WSprGKQl~iG~nn--------Gt~vQy~P 227 (1405)
T KOG3630|consen 191 NVTSFP-VTNSQTAVLWSPRGKQLFIGRNN--------GTEVQYEP 227 (1405)
T ss_pred hhcccC-cccceeeEEeccccceeeEecCC--------CeEEEeec
Confidence 22222 33457999999999999998887 88877653
No 425
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=33.90 E-value=2.3e+02 Score=32.14 Aligned_cols=24 Identities=17% Similarity=0.434 Sum_probs=21.6
Q ss_pred CCEEEEEECCCcEEEEECCCCceE
Q 000473 591 NEVLVSGSMDCSIRIWDLGSGNLI 614 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~tg~~l 614 (1471)
+++|+.-..+|.+++||+.+++.+
T Consensus 22 ~~~Ll~iT~~G~l~vWnl~~~k~~ 45 (219)
T PF07569_consen 22 GSYLLAITSSGLLYVWNLKKGKAV 45 (219)
T ss_pred CCEEEEEeCCCeEEEEECCCCeec
Confidence 788999999999999999988764
No 426
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=33.88 E-value=42 Score=25.38 Aligned_cols=24 Identities=33% Similarity=0.438 Sum_probs=18.3
Q ss_pred HHHHHHHhcCCCHHHHHHHHHHHH
Q 000473 1006 LQLLVSFWQDESEHVRMAARSLFH 1029 (1471)
Q Consensus 1006 l~~la~~wqd~~~~vr~aar~l~~ 1029 (1471)
+..|.+.-+|++++||++|=.-|.
T Consensus 2 lp~l~~~l~D~~~~VR~~a~~~l~ 25 (31)
T PF02985_consen 2 LPILLQLLNDPSPEVRQAAAECLG 25 (31)
T ss_dssp HHHHHHHHT-SSHHHHHHHHHHHH
T ss_pred HHHHHHHcCCCCHHHHHHHHHHHH
Confidence 456777888999999999876665
No 427
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=33.12 E-value=2.4e+02 Score=37.38 Aligned_cols=102 Identities=11% Similarity=0.024 Sum_probs=66.6
Q ss_pred CCEEEEEECCCcEEEEECCC-------C-------------ceEEEEeccCCCEEEEEEC--CCCCCCCCCCEEEEEeCC
Q 000473 591 NEVLVSGSMDCSIRIWDLGS-------G-------------NLITVMHHHVAPVRQIILS--PPQTEHPWSDCFLSVGED 648 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lWDl~t-------g-------------~~l~~~~~H~~~V~~l~fs--pd~~~~~~~~~l~S~s~D 648 (1471)
.+.|+.+..||.|.+|.+++ . ++...+.. ...++.++++ .. .+++|.++.-
T Consensus 114 ~EVLl~c~DdG~V~~Yyt~~I~~~i~~~~~~~~~~~~r~~i~P~f~~~v-~~SaWGLdIh~~~~------~rlIAVSsNs 186 (717)
T PF08728_consen 114 EEVLLLCTDDGDVLAYYTETIIEAIERFSEDNDSGFSRLKIKPFFHLRV-GASAWGLDIHDYKK------SRLIAVSSNS 186 (717)
T ss_pred eeEEEEEecCCeEEEEEHHHHHHHHHhhccccccccccccCCCCeEeec-CCceeEEEEEecCc------ceEEEEecCC
Confidence 67999999999999997631 0 01222332 3478999997 54 6788888888
Q ss_pred CcEEEEECCC--CcE-EEEecCCCCCcEEEEEcCCC-----C-EEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 649 FSVALASLET--LRV-ERMFPGHPNYPAKVVWDCPR-----G-YIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 649 gsV~lWdl~t--~~~-l~~~~gh~~~V~~v~~spdg-----~-~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
..|.|+-... .+. ...-..|..-|-+|.|-++. . ++++++-. |.+.+|++.
T Consensus 187 ~~VTVFaf~l~~~r~~~~~s~~~~hNIP~VSFl~~~~d~~G~v~v~a~dI~--------G~v~~~~I~ 246 (717)
T PF08728_consen 187 QEVTVFAFALVDERFYHVPSHQHSHNIPNVSFLDDDLDPNGHVKVVATDIS--------GEVWTFKIK 246 (717)
T ss_pred ceEEEEEEeccccccccccccccccCCCeeEeecCCCCCccceEEEEEecc--------CcEEEEEEE
Confidence 8888886543 111 11111255568899996654 2 55556655 888888873
No 428
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=31.84 E-value=3.1e+02 Score=36.00 Aligned_cols=56 Identities=7% Similarity=0.027 Sum_probs=40.7
Q ss_pred EEEEeC-CCcEEEEECCCCcEEEEec-CCCCCcEEEEE--cCCCCEEEEEEcCCCCCCCCCCEEEEEE
Q 000473 642 FLSVGE-DFSVALASLETLRVERMFP-GHPNYPAKVVW--DCPRGYIACLCRDHSRTSDAVDVLFIWD 705 (1471)
Q Consensus 642 l~S~s~-DgsV~lWdl~t~~~l~~~~-gh~~~V~~v~~--spdg~~L~sgs~D~sg~~D~~gtV~VWD 705 (1471)
+|.+.. -..+.|||.+.+.....-. ...+.|..+.| .|+++.+++-|.. +.|.++-
T Consensus 43 ~a~V~~~~~~LtIWD~~~~~lE~~~~f~~~~~I~dLDWtst~d~qsiLaVGf~--------~~v~l~~ 102 (631)
T PF12234_consen 43 IAVVDSSRSELTIWDTRSGVLEYEESFSEDDPIRDLDWTSTPDGQSILAVGFP--------HHVLLYT 102 (631)
T ss_pred EEEEECCCCEEEEEEcCCcEEEEeeeecCCCceeeceeeecCCCCEEEEEEcC--------cEEEEEE
Confidence 444443 4678999999887654322 34678999999 5788888888877 7888884
No 429
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.41 E-value=5.6e+02 Score=30.36 Aligned_cols=118 Identities=14% Similarity=0.099 Sum_probs=74.7
Q ss_pred ecCCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCceEEEEeccC-CCEEEEEECCCCCCCCCCCEEEEE
Q 000473 567 LGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNLITVMHHHV-APVRQIILSPPQTEHPWSDCFLSV 645 (1471)
Q Consensus 567 ~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~l~~~~~H~-~~V~~l~fspd~~~~~~~~~l~S~ 645 (1471)
.|-+..|.++.|+|+ .+.|++-.....-.||=...|+.++++.-.. ..-..+.+.-+ ++++++-
T Consensus 82 ~g~~~nvS~LTynp~---------~rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Ieyig~------n~fvi~d 146 (316)
T COG3204 82 LGETANVSSLTYNPD---------TRTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEYIGG------NQFVIVD 146 (316)
T ss_pred ccccccccceeeCCC---------cceEEEecCCCceEEEEecCCceEEEecccccCChhHeEEecC------CEEEEEe
Confidence 344555999999998 6777777777777788778899988764321 12234555444 4455454
Q ss_pred eCCCcEEEEECCCCcEEE-----Eec----CC-CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECC
Q 000473 646 GEDFSVALASLETLRVER-----MFP----GH-PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVK 707 (1471)
Q Consensus 646 s~DgsV~lWdl~t~~~l~-----~~~----gh-~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~ 707 (1471)
=.|+.+.+..+.....+. .++ .+ ......++|+|.+..|..+-+- .-+.|+.+.
T Consensus 147 ER~~~l~~~~vd~~t~~~~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr--------~P~~I~~~~ 210 (316)
T COG3204 147 ERDRALYLFTVDADTTVISAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKER--------NPIGIFEVT 210 (316)
T ss_pred hhcceEEEEEEcCCccEEeccceEEeccccCCCCcCceeeecCCCCceEEEEEcc--------CCcEEEEEe
Confidence 456777666665432111 111 12 3456789999999888887776 556666554
No 430
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=30.63 E-value=7.9e+02 Score=29.91 Aligned_cols=110 Identities=17% Similarity=0.147 Sum_probs=56.3
Q ss_pred cEEEEECCC--Cc--eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCC-----Cc--EE-EEecC--
Q 000473 602 SIRIWDLGS--GN--LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLET-----LR--VE-RMFPG-- 667 (1471)
Q Consensus 602 tI~lWDl~t--g~--~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t-----~~--~l-~~~~g-- 667 (1471)
.|.+++-.. |+ ....|.........+++.++ + ++++ +.+...++.|... ++ .+ ..+..
T Consensus 48 rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~~------G-lyV~-~~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~ 119 (367)
T TIGR02604 48 RILILEDADGDGKYDKSNVFAEELSMVTGLAVAVG------G-VYVA-TPPDILFLRDKDGDDKADGEREVLLSGFGGQI 119 (367)
T ss_pred EEEEEEcCCCCCCcceeEEeecCCCCccceeEecC------C-EEEe-CCCeEEEEeCCCCCCCCCCccEEEEEccCCCC
Confidence 676666443 33 23445444445678888887 7 5554 3444333445431 12 12 22322
Q ss_pred --CCCCcEEEEEcCCCCEEEEEEcCCC------CCCC-----CCCEEEEEECCCCeEEEEEeCCC
Q 000473 668 --HPNYPAKVVWDCPRGYIACLCRDHS------RTSD-----AVDVLFIWDVKTGARERVLRGTA 719 (1471)
Q Consensus 668 --h~~~V~~v~~spdg~~L~sgs~D~s------g~~D-----~~gtV~VWDi~tg~~~~~l~gH~ 719 (1471)
+...+..++|.|||.+.++-+..-. +..+ ..|.|.-+|..+++.+..-.|+.
T Consensus 120 ~~~~~~~~~l~~gpDG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a~G~r 184 (367)
T TIGR02604 120 NNHHHSLNSLAWGPDGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVAHGFQ 184 (367)
T ss_pred CcccccccCceECCCCCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEecCcC
Confidence 1344778999999986655442100 0000 01567777777766655445544
No 431
>KOG1242 consensus Protein containing adaptin N-terminal region [Translation, ribosomal structure and biogenesis]
Probab=30.50 E-value=1.3e+03 Score=29.89 Aligned_cols=250 Identities=16% Similarity=0.155 Sum_probs=125.5
Q ss_pred HHHHHhhhhcccccCCCCCCCcHHHHHHHhcCCCHHHHHHHHHHHHHHHHhhCCCCCCCCCCcccccccccccccCCCCc
Q 000473 984 SALAAFYTRNFAENFPDIKPPLLQLLVSFWQDESEHVRMAARSLFHCAASRAIPLPLCSPKGVADAKPVWSLSTTGDDEH 1063 (1471)
Q Consensus 984 s~l~~~~~~~l~~~~~~~~~p~l~~la~~wqd~~~~vr~aar~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 1063 (1471)
+.-...+|.+|......|-.|.|-.+-..-=|..++||+||..-.. ++.+..+. -+ -|.++-++
T Consensus 196 ~~a~~~~~~~Lg~~~EPyiv~~lp~il~~~~d~~~~Vr~Aa~~a~k-ai~~~~~~-----~a--VK~llpsl-------- 259 (569)
T KOG1242|consen 196 LLAFEAAQGNLGPPFEPYIVPILPSILTNFGDKINKVREAAVEAAK-AIMRCLSA-----YA--VKLLLPSL-------- 259 (569)
T ss_pred HHHHHHHHHhcCCCCCchHHhhHHHHHHHhhccchhhhHHHHHHHH-HHHHhcCc-----ch--hhHhhhhh--------
Confidence 3334456767664444465566666666667999999999988888 66676664 22 22222110
Q ss_pred cCcccccccccccccCCCCcccCCCccchhhhhhhhhhcccccceecccCCccccchhhHh-hhhhhhhhcCCCCChhhH
Q 000473 1064 ANSNVEKISANELASDMLPETQGNSLVEESDVLSWLESFEVQDWISCVGGTSQDAMTSHII-VAAALAIWYPSLVKPTLA 1142 (1471)
Q Consensus 1064 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~ 1142 (1471)
+ . ... +. .|. .+.+.+ ..++++-..|..++. ..
T Consensus 260 ------------l----~----~l~----~~--kWr-------------------tK~aslellg~m~~~ap~qLs~-~l 293 (569)
T KOG1242|consen 260 ------------L----G----SLL----EA--KWR-------------------TKMASLELLGAMADCAPKQLSL-CL 293 (569)
T ss_pred ------------H----H----HHH----HH--hhh-------------------hHHHHHHHHHHHHHhchHHHHH-HH
Confidence 0 0 000 00 231 233333 333454443433332 23
Q ss_pred HHHHHHHHHHHHhcCcchhHHHHHHHHHhhHhhcccccccchhhhhhhhhhhhhhccccccccCCCCCCchhhhHHHHHH
Q 000473 1143 MLVVQPLIKLVMATNEKYSSTAAELLAEGMESTWKTCIGFEIPRLIGDIFFQIECVSNSSANLAGQHPAVPASIRETLVG 1222 (1471)
Q Consensus 1143 ~~~~~~l~~ll~~~~~~~~~~ai~l~~~gf~~~w~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~ 1222 (1471)
..+...|...|.+.+...|.+|++-|-+=- ++=+. +||.+.+- .++.+++.|+. -...|-..|.+
T Consensus 294 p~iiP~lsevl~DT~~evr~a~~~~l~~~~-svidN---~dI~~~ip---~Lld~l~dp~~--------~~~e~~~~L~~ 358 (569)
T KOG1242|consen 294 PDLIPVLSEVLWDTKPEVRKAGIETLLKFG-SVIDN---PDIQKIIP---TLLDALADPSC--------YTPECLDSLGA 358 (569)
T ss_pred hHhhHHHHHHHccCCHHHHHHHHHHHHHHH-Hhhcc---HHHHHHHH---HHHHHhcCccc--------chHHHHHhhcc
Confidence 345667778888999999999999887543 34321 15555542 23344432210 11112211111
Q ss_pred HHhHhHHhcChhHHHHHHHHHHhhc-CCCC-ccchhhHHHHH-HHH--hCChhHHHHhHHHHHHHhhhhcCCCChhhhhh
Q 000473 1223 ILLPSLAMADILGFLTVVESQIWST-ASDS-PVHLVSIMTII-RVV--RGSPRNVAQHLDKVVNFILQTMDPGNSVMRKT 1297 (1471)
Q Consensus 1223 ~~l~~ia~~~~~~f~~~~~~~i~~~-~~~~-~~~~~~~~~l~-~~i--~~~p~~~~~~l~~~~~~~~~~lDp~~~~~r~~ 1297 (1471)
..+ ++..++|.. ..|.-.+.|. +.++ ..+ ...-.++ .+. =..|.++.++|+.++.-+=+.++--.|.-|.-
T Consensus 359 ttF--V~~V~~psL-almvpiL~R~l~eRst~~k-r~t~~IidNm~~LveDp~~lapfl~~Llp~lk~~~~d~~PEvR~v 434 (569)
T KOG1242|consen 359 TTF--VAEVDAPSL-ALMVPILKRGLAERSTSIK-RKTAIIIDNMCKLVEDPKDLAPFLPSLLPGLKENLDDAVPEVRAV 434 (569)
T ss_pred eee--eeeecchhH-HHHHHHHHHHHhhccchhh-hhHHHHHHHHHHhhcCHHHHhhhHHHHhhHHHHHhcCCChhHHHH
Confidence 111 344444322 2222222222 1112 121 1111111 111 23688999999999988866655338886765
Q ss_pred hhhHHHHHHHHHHcccCeeec
Q 000473 1298 CLHTSMAALKEIVHVFPMVSL 1318 (1471)
Q Consensus 1298 ~l~~~~~~l~~~~~~~p~v~~ 1318 (1471)
..... ..+.++--++.|
T Consensus 435 aarAL----~~l~e~~g~~~f 451 (569)
T KOG1242|consen 435 AARAL----GALLERLGEVSF 451 (569)
T ss_pred HHHHH----HHHHHHHHhhcc
Confidence 54444 334455556666
No 432
>KOG2973 consensus Uncharacterized conserved protein [Function unknown]
Probab=30.15 E-value=3e+02 Score=32.70 Aligned_cols=32 Identities=16% Similarity=0.277 Sum_probs=26.7
Q ss_pred HHHHHHHhcCcchhHHHH----HHHHHhhHhhccccc
Q 000473 1148 PLIKLVMATNEKYSSTAA----ELLAEGMESTWKTCI 1180 (1471)
Q Consensus 1148 ~l~~ll~~~~~~~~~~ai----~l~~~gf~~~w~~~~ 1180 (1471)
+|..+|...+.++|.+|+ -|.|+||. +|..|=
T Consensus 7 elv~ll~~~sP~v~~~AV~~l~~lt~~~~~-~~~~~~ 42 (353)
T KOG2973|consen 7 ELVELLHSLSPPVRKAAVEHLLGLTGRGLQ-SLSKYS 42 (353)
T ss_pred HHHHHhccCChHHHHHHHHHHhhccccchh-hhccch
Confidence 567788889999999999 56678996 998885
No 433
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=28.62 E-value=1.4e+02 Score=23.51 Aligned_cols=27 Identities=11% Similarity=0.071 Sum_probs=18.3
Q ss_pred ceEEEEEEcCCCCeEEEE-eCC--CcEEEE
Q 000473 17 HRVTATSALTQPPTLYTG-GSD--GSILWW 43 (1471)
Q Consensus 17 h~Vtava~SpDg~~LaTG-s~D--G~I~lW 43 (1471)
..-...+|||||++|+=. ..+ |.--||
T Consensus 9 ~~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 9 GDDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp SSEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred ccccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 456788999999888854 444 665555
No 434
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=28.45 E-value=1.8e+02 Score=22.91 Aligned_cols=38 Identities=18% Similarity=0.240 Sum_probs=26.3
Q ss_pred CC-EEEEEeCCCcEEEEECCCCcEEEEecCCCCCcEEEEE
Q 000473 639 SD-CFLSVGEDFSVALASLETLRVERMFPGHPNYPAKVVW 677 (1471)
Q Consensus 639 ~~-~l~S~s~DgsV~lWdl~t~~~l~~~~gh~~~V~~v~~ 677 (1471)
++ .+++.-.+++|.++|..+++.+..+... .....++|
T Consensus 3 ~~~lyv~~~~~~~v~~id~~~~~~~~~i~vg-~~P~~i~~ 41 (42)
T TIGR02276 3 GTKLYVTNSGSNTVSVIDTATNKVIATIPVG-GYPFGVAV 41 (42)
T ss_pred CCEEEEEeCCCCEEEEEECCCCeEEEEEECC-CCCceEEe
Confidence 55 4455556899999999999888877652 33445554
No 435
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=27.17 E-value=1.5e+02 Score=29.05 Aligned_cols=51 Identities=14% Similarity=0.204 Sum_probs=37.5
Q ss_pred hhHHHHHHHHhCChhHHHHhHHHHHHHhhhhcCCCChhhhhhhhhHHHHHH
Q 000473 1256 VSIMTIIRVVRGSPRNVAQHLDKVVNFILQTMDPGNSVMRKTCLHTSMAAL 1306 (1471)
Q Consensus 1256 ~~~~~l~~~i~~~p~~~~~~l~~~~~~~~~~lDp~~~~~r~~~l~~~~~~l 1306 (1471)
-+++.|..+.-.-+..+.+|+++++..||+|++=.+...|-...+....+.
T Consensus 5 ggli~Laa~ai~l~~~~~~~l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~ 55 (97)
T PF12755_consen 5 GGLIGLAAVAIALGKDISKYLDEILPPVLKCFDDQDSRVRYYACEALYNIS 55 (97)
T ss_pred HHHHHHHHHHHHchHhHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHH
Confidence 355666555555566699999999999999999999988876554444433
No 436
>PRK13684 Ycf48-like protein; Provisional
Probab=26.23 E-value=7.3e+02 Score=29.88 Aligned_cols=118 Identities=8% Similarity=0.036 Sum_probs=64.2
Q ss_pred CccEEEEEEecCCCCcccCcCCCEEEEEECCCcEE-EEECCCCc-eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC
Q 000473 570 TGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIR-IWDLGSGN-LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE 647 (1471)
Q Consensus 570 ~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~-lWDl~tg~-~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~ 647 (1471)
.+.+.++.+.|+ +.+++.| ..|.+. .+|- .++ -......-...++.+.+.|+ +..++ ++.
T Consensus 172 ~g~~~~i~~~~~---------g~~v~~g-~~G~i~~s~~~-gg~tW~~~~~~~~~~l~~i~~~~~------g~~~~-vg~ 233 (334)
T PRK13684 172 AGVVRNLRRSPD---------GKYVAVS-SRGNFYSTWEP-GQTAWTPHQRNSSRRLQSMGFQPD------GNLWM-LAR 233 (334)
T ss_pred cceEEEEEECCC---------CeEEEEe-CCceEEEEcCC-CCCeEEEeeCCCcccceeeeEcCC------CCEEE-Eec
Confidence 346778888875 5555555 445443 2332 122 11222233457889999987 66554 456
Q ss_pred CCcEEEEECCCCcEEEEecCC----CCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEE
Q 000473 648 DFSVALASLETLRVERMFPGH----PNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVL 715 (1471)
Q Consensus 648 DgsV~lWdl~t~~~l~~~~gh----~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l 715 (1471)
.|.+++=+-+.|..-.....+ ...+..+.+.|++..+++| .+ |.|+ .....|+..+..
T Consensus 234 ~G~~~~~s~d~G~sW~~~~~~~~~~~~~l~~v~~~~~~~~~~~G-~~--------G~v~-~S~d~G~tW~~~ 295 (334)
T PRK13684 234 GGQIRFNDPDDLESWSKPIIPEITNGYGYLDLAYRTPGEIWAGG-GN--------GTLL-VSKDGGKTWEKD 295 (334)
T ss_pred CCEEEEccCCCCCccccccCCccccccceeeEEEcCCCCEEEEc-CC--------CeEE-EeCCCCCCCeEC
Confidence 777654334444433222211 2347889999988766554 44 6555 344555554443
No 437
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=25.76 E-value=5.2e+02 Score=31.52 Aligned_cols=58 Identities=10% Similarity=0.102 Sum_probs=37.0
Q ss_pred CCEEEEEECCCCCCCCCCCEEEEEeCC-------------------CcEEEEECCCCcEEEEecCCCCCcEEEEEcCCCC
Q 000473 622 APVRQIILSPPQTEHPWSDCFLSVGED-------------------FSVALASLETLRVERMFPGHPNYPAKVVWDCPRG 682 (1471)
Q Consensus 622 ~~V~~l~fspd~~~~~~~~~l~S~s~D-------------------gsV~lWdl~t~~~l~~~~gh~~~V~~v~~spdg~ 682 (1471)
.....+.|.|+ |.+.++.+.. +.|.-++..+++......++ .....++|+|+|+
T Consensus 124 ~~~~~l~~gpD------G~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e~~a~G~-rnp~Gl~~d~~G~ 196 (367)
T TIGR02604 124 HSLNSLAWGPD------GWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLRVVAHGF-QNPYGHSVDSWGD 196 (367)
T ss_pred ccccCceECCC------CCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEEEEecCc-CCCccceECCCCC
Confidence 44778999998 7766665521 34555566655543333343 3367899999988
Q ss_pred EEEE
Q 000473 683 YIAC 686 (1471)
Q Consensus 683 ~L~s 686 (1471)
++++
T Consensus 197 l~~t 200 (367)
T TIGR02604 197 VFFC 200 (367)
T ss_pred EEEE
Confidence 8765
No 438
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=25.73 E-value=3.8e+02 Score=35.64 Aligned_cols=120 Identities=8% Similarity=0.030 Sum_probs=71.1
Q ss_pred EEEEEEcCCcEEEEEecccccC-CCC---CCc-cccCCcceEEEEecCCccEEEEEEe--cCCCCcccCcCCCEEEEEEC
Q 000473 527 AIVYGFFSGEIEVIQFDLFERH-NSP---GAS-LKVNSHVSRQYFLGHTGAVLCLAAH--RMVGTAKGWSFNEVLVSGSM 599 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~-d~~---~~~-~d~~s~~~~~~l~gH~~~V~~la~s--pd~~~~~~~~~~~~L~SGs~ 599 (1471)
.|+.++.||.+.+|.-..+... ... ... .....-++...+. -...++.|+++ .. .++||.++.
T Consensus 116 VLl~c~DdG~V~~Yyt~~I~~~i~~~~~~~~~~~~r~~i~P~f~~~-v~~SaWGLdIh~~~~---------~rlIAVSsN 185 (717)
T PF08728_consen 116 VLLLCTDDGDVLAYYTETIIEAIERFSEDNDSGFSRLKIKPFFHLR-VGASAWGLDIHDYKK---------SRLIAVSSN 185 (717)
T ss_pred EEEEEecCCeEEEEEHHHHHHHHHhhccccccccccccCCCCeEee-cCCceeEEEEEecCc---------ceEEEEecC
Confidence 6888999999999654322211 000 000 0000112222332 23479999998 43 678888888
Q ss_pred CCcEEEEECCCC--ceEE-EEeccCCCEEEEEECCCCCCCCCCC-EEEEEeCCCcEEEEECC
Q 000473 600 DCSIRIWDLGSG--NLIT-VMHHHVAPVRQIILSPPQTEHPWSD-CFLSVGEDFSVALASLE 657 (1471)
Q Consensus 600 DgtI~lWDl~tg--~~l~-~~~~H~~~V~~l~fspd~~~~~~~~-~l~S~s~DgsV~lWdl~ 657 (1471)
-..|.||-+... +..+ .-..|..-|.+|.|-++. ..+.|. .+++++-.|.+.+|++.
T Consensus 186 s~~VTVFaf~l~~~r~~~~~s~~~~hNIP~VSFl~~~-~d~~G~v~v~a~dI~G~v~~~~I~ 246 (717)
T PF08728_consen 186 SQEVTVFAFALVDERFYHVPSHQHSHNIPNVSFLDDD-LDPNGHVKVVATDISGEVWTFKIK 246 (717)
T ss_pred CceEEEEEEeccccccccccccccccCCCeeEeecCC-CCCccceEEEEEeccCcEEEEEEE
Confidence 888888766532 1111 111255568899998873 222243 78889999999999874
No 439
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.94 E-value=1.3e+02 Score=41.17 Aligned_cols=105 Identities=11% Similarity=0.059 Sum_probs=73.4
Q ss_pred CCccEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEECCCCce-EEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeC
Q 000473 569 HTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWDLGSGNL-ITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGE 647 (1471)
Q Consensus 569 H~~~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWDl~tg~~-l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~ 647 (1471)
....+.|+.|.|.. ....+....|+.|++..+.--.. ...+ .-+..+++++|+|. |..++.|-.
T Consensus 154 k~vf~~~~~wnP~v--------p~n~av~l~dlsl~V~~~~~~~~~v~s~-p~t~~~Tav~WSpr------GKQl~iG~n 218 (1405)
T KOG3630|consen 154 KPVFQLKNVWNPLV--------PLNSAVDLSDLSLRVKSTKQLAQNVTSF-PVTNSQTAVLWSPR------GKQLFIGRN 218 (1405)
T ss_pred cccccccccccCCc--------cchhhhhccccchhhhhhhhhhhhhccc-CcccceeeEEeccc------cceeeEecC
Confidence 34456788898862 45667778899998887653221 1222 23456899999999 999999999
Q ss_pred CCcEEEEECCCCcEEEEecCC----CCCcEEEEEcCCCCEEEEEEc
Q 000473 648 DFSVALASLETLRVERMFPGH----PNYPAKVVWDCPRGYIACLCR 689 (1471)
Q Consensus 648 DgsV~lWdl~t~~~l~~~~gh----~~~V~~v~~spdg~~L~sgs~ 689 (1471)
.|++.-|-.. ++....+++. ..+|.+|+|-....|+++-+.
T Consensus 219 nGt~vQy~P~-leik~~ip~Pp~~e~yrvl~v~Wl~t~eflvvy~n 263 (1405)
T KOG3630|consen 219 NGTEVQYEPS-LEIKSEIPEPPVEENYRVLSVTWLSTQEFLVVYGN 263 (1405)
T ss_pred CCeEEEeecc-cceeecccCCCcCCCcceeEEEEecceeEEEEecc
Confidence 9999887643 4444444433 357899999988888877543
No 440
>PF14500 MMS19_N: Dos2-interacting transcription regulator of RNA-Pol-II
Probab=24.89 E-value=4.5e+02 Score=30.66 Aligned_cols=145 Identities=17% Similarity=0.161 Sum_probs=73.6
Q ss_pred HHHHHHhcCcchhHHHHHHHHHhhHhhccccccc-chhhhhhhhhhhhhhccccccccCCCCCCchhhhHHHHHHHHhHh
Q 000473 1149 LIKLVMATNEKYSSTAAELLAEGMESTWKTCIGF-EIPRLIGDIFFQIECVSNSSANLAGQHPAVPASIRETLVGILLPS 1227 (1471)
Q Consensus 1149 l~~ll~~~~~~~~~~ai~l~~~gf~~~w~~~~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~l~~ 1227 (1471)
|-.+|.......|..|+.+|++-.+.+=...+.. ||.-++. |...... .......+ -.+|..
T Consensus 4 Lg~~Ltsed~~~R~ka~~~Ls~vL~~lp~~~L~~~ev~~L~~---F~~~rl~------------D~~~~~~~--l~gl~~ 66 (262)
T PF14500_consen 4 LGEYLTSEDPIIRAKALELLSEVLERLPPDFLSRQEVQVLLD---FFCSRLD------------DHACVQPA--LKGLLA 66 (262)
T ss_pred hhhhhCCCCHHHHHHHHHHHHHHHHhCCHhhccHHHHHHHHH---HHHHHhc------------cHhhHHHH--HHHHHH
Confidence 4456667777889999999998877332334555 5533332 1122121 00111111 111222
Q ss_pred HH-hcC-hhHHHHHHHHHHh-hcCCC--CccchhhHHHHH-HHHhCChhHHHHhHHHHHHHhhhhcCCC-Chhhhhhhhh
Q 000473 1228 LA-MAD-ILGFLTVVESQIW-STASD--SPVHLVSIMTII-RVVRGSPRNVAQHLDKVVNFILQTMDPG-NSVMRKTCLH 1300 (1471)
Q Consensus 1228 ia-~~~-~~~f~~~~~~~i~-~~~~~--~~~~~~~~~~l~-~~i~~~p~~~~~~l~~~~~~~~~~lDp~-~~~~r~~~l~ 1300 (1471)
+. +.+ ++.-+..+.+.+. +...+ .+..+.....|+ .+++++...+..+=...+..+++.+|-- +| +||.
T Consensus 67 L~~~~~~~~~~~~~i~~~l~~~~~~q~~~q~~R~~~~~ll~~l~~~~~~~l~~~~~~fv~~~i~~~~gEkDP----RnLl 142 (262)
T PF14500_consen 67 LVKMKNFSPESAVKILRSLFQNVDVQSLPQSTRYAVYQLLDSLLENHREALQSMGDDFVYGFIQLIDGEKDP----RNLL 142 (262)
T ss_pred HHhCcCCChhhHHHHHHHHHHhCChhhhhHHHHHHHHHHHHHHHHHhHHHHHhchhHHHHHHHHHhccCCCH----HHHH
Confidence 22 221 1222333333332 11212 222233344443 4777777777666677888888888743 44 3777
Q ss_pred HHHHHHHHHHcccC
Q 000473 1301 TSMAALKEIVHVFP 1314 (1471)
Q Consensus 1301 ~~~~~l~~~~~~~p 1314 (1471)
.+|.+++.+.+.|+
T Consensus 143 ~~F~l~~~i~~~~~ 156 (262)
T PF14500_consen 143 LSFKLLKVILQEFD 156 (262)
T ss_pred HHHHHHHHHHHhcc
Confidence 77777776666666
No 441
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=24.82 E-value=5.6e+02 Score=27.74 Aligned_cols=99 Identities=24% Similarity=0.236 Sum_probs=61.5
Q ss_pred hHhhhhhhhhhcCCCCChhhHHHHHHHHHHHHHhcCcchhHHHHHHHHH----hhHhhcccccccchhhhhhhhhhhhhh
Q 000473 1122 HIIVAAALAIWYPSLVKPTLAMLVVQPLIKLVMATNEKYSSTAAELLAE----GMESTWKTCIGFEIPRLIGDIFFQIEC 1197 (1471)
Q Consensus 1122 ~~~~~~~~~~~~~~~~~~~l~~~~~~~l~~ll~~~~~~~~~~ai~l~~~----gf~~~w~~~~~~~~~~~l~~~~~~~~~ 1197 (1471)
.++..+-|++.||..+++- ...|...|.+.+...|++|+-.+.+ ||. -|+..+- .++ +..+
T Consensus 8 ~i~~l~DL~~r~~~~ve~~-----~~~l~~~L~D~~~~VR~~al~~Ls~Li~~d~i-k~k~~l~---~~~----l~~l-- 72 (178)
T PF12717_consen 8 AIIALGDLCIRYPNLVEPY-----LPNLYKCLRDEDPLVRKTALLVLSHLILEDMI-KVKGQLF---SRI----LKLL-- 72 (178)
T ss_pred HHHHHHHHHHhCcHHHHhH-----HHHHHHHHCCCCHHHHHHHHHHHHHHHHcCce-eehhhhh---HHH----HHHH--
Confidence 4555566989887666644 4445567888999999999977765 443 3333320 111 1111
Q ss_pred ccccccccCCCCCCchhhhHHHHHHHHhHhHHhc-ChhHHHHHHHHHHhhcC
Q 000473 1198 VSNSSANLAGQHPAVPASIRETLVGILLPSLAMA-DILGFLTVVESQIWSTA 1248 (1471)
Q Consensus 1198 ~~~~~~~~~~~~~~~~~~~r~~~~~~~l~~ia~~-~~~~f~~~~~~~i~~~~ 1248 (1471)
..+.+ ..|. .|+.++..++.. +|..|...+-.-|.+.+
T Consensus 73 -----------~D~~~-~Ir~-~A~~~~~e~~~~~~~~~i~~~~~e~i~~l~ 111 (178)
T PF12717_consen 73 -----------VDENP-EIRS-LARSFFSELLKKRNPNIIYNNFPELISSLN 111 (178)
T ss_pred -----------cCCCH-HHHH-HHHHHHHHHHHhccchHHHHHHHHHHHHHh
Confidence 11111 2343 478889999998 88888888777776554
No 442
>PHA03098 kelch-like protein; Provisional
Probab=24.41 E-value=5.9e+02 Score=32.62 Aligned_cols=155 Identities=10% Similarity=-0.017 Sum_probs=0.0
Q ss_pred EEEEEEcCCcEEEEEecccccCCCCCCccccCCcceEEEEecCCccEEEEEEecCCCCcccCcCCCEEEEEECC------
Q 000473 527 AIVYGFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYFLGHTGAVLCLAAHRMVGTAKGWSFNEVLVSGSMD------ 600 (1471)
Q Consensus 527 ~lv~Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l~gH~~~V~~la~spd~~~~~~~~~~~~L~SGs~D------ 600 (1471)
..+.|+.++....-... .++..+.+-...-.-....-...+..-+ +.+.+.||.+
T Consensus 345 lyv~GG~~~~~~~~~v~----------~yd~~~~~W~~~~~lp~~r~~~~~~~~~---------~~iYv~GG~~~~~~~~ 405 (534)
T PHA03098 345 IYVIGGIYNSISLNTVE----------SWKPGESKWREEPPLIFPRYNPCVVNVN---------NLIYVIGGISKNDELL 405 (534)
T ss_pred EEEEeCCCCCEecceEE----------EEcCCCCceeeCCCcCcCCccceEEEEC---------CEEEEECCcCCCCccc
Q ss_pred CcEEEEECCCCceEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCc--------EEEEECCCCcEEEEecCCCCCc
Q 000473 601 CSIRIWDLGSGNLITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFS--------VALASLETLRVERMFPGHPNYP 672 (1471)
Q Consensus 601 gtI~lWDl~tg~~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~Dgs--------V~lWdl~t~~~l~~~~gh~~~V 672 (1471)
..+..||..+.+....-..-...-...+...+ +..++.||.+.. +..||..+.+-...-.......
T Consensus 406 ~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~------~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~ 479 (534)
T PHA03098 406 KTVECFSLNTNKWSKGSPLPISHYGGCAIYHD------GKIYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFPRI 479 (534)
T ss_pred ceEEEEeCCCCeeeecCCCCccccCceEEEEC------CEEEEECCccCCCCCcccceEEEecCCCCceeeCCCCCcccc
Q ss_pred EEEEEcCCCCEEEEEEcCCCCCCCC-CCEEEEEECCCCe
Q 000473 673 AKVVWDCPRGYIACLCRDHSRTSDA-VDVLFIWDVKTGA 710 (1471)
Q Consensus 673 ~~v~~spdg~~L~sgs~D~sg~~D~-~gtV~VWDi~tg~ 710 (1471)
..-...-+++.++.|+.+ ... ...|.+||.++.+
T Consensus 480 ~~~~~~~~~~iyv~GG~~----~~~~~~~v~~yd~~~~~ 514 (534)
T PHA03098 480 NASLCIFNNKIYVVGGDK----YEYYINEIEVYDDKTNT 514 (534)
T ss_pred cceEEEECCEEEEEcCCc----CCcccceeEEEeCCCCE
No 443
>PF10193 Telomere_reg-2: Telomere length regulation protein; InterPro: IPR019337 This entry represents a conserved domain found in a group of proteins called telomere-length regulation, or clock abnormal protein-2, which are conserved from plants to humans. These proteins regulate telomere length and contribute to silencing of sub-telomeric regions []. In vitro the protein binds to telomeric DNA repeats. ; PDB: 3O4Z_B.
Probab=24.41 E-value=2.3e+02 Score=28.60 Aligned_cols=69 Identities=12% Similarity=0.193 Sum_probs=40.6
Q ss_pred hhHHHHHHHHHHhhcCCCCccchhhHHHHHHHHhCChh---HHHHhHHHHHHHhhhhcC----CCChhhhhhhhhH
Q 000473 1233 ILGFLTVVESQIWSTASDSPVHLVSIMTIIRVVRGSPR---NVAQHLDKVVNFILQTMD----PGNSVMRKTCLHT 1301 (1471)
Q Consensus 1233 ~~~f~~~~~~~i~~~~~~~~~~~~~~~~l~~~i~~~p~---~~~~~l~~~~~~~~~~lD----p~~~~~r~~~l~~ 1301 (1471)
||.||..+.........+......++..+..+||+.|. ++..+-.+++..+++.=| |+.-..|.++|-.
T Consensus 1 ~PvYlrDll~~L~~~~~~~e~~e~aL~~a~~LIR~k~~fg~el~~~a~eL~~~Ll~L~~~f~~~~Fe~~R~~alva 76 (114)
T PF10193_consen 1 RPVYLRDLLEYLRSDDEDYEKFEAALKSAEKLIRRKPDFGTELSEYAEELLKALLHLQNKFDIENFEELRQNALVA 76 (114)
T ss_dssp ----HHHHHHHHT------S-SHHHHHHHHHHHHS-----SSHHHHHHHHHHHHHH---TT--TTTTHHHHHHHHH
T ss_pred CCchHHHHHHHHhcCcCCHHHHHHHHHHHHHHHhcCCCCcchHHHHHHHHHHHHhhccccCCccCHHHHHHHHHHH
Confidence 57788887766542221222235678888889999999 999999999999998755 5555677777753
No 444
>PHA02872 EFc gene family protein; Provisional
Probab=24.30 E-value=2.4e+02 Score=28.10 Aligned_cols=96 Identities=18% Similarity=0.196 Sum_probs=55.6
Q ss_pred HHhhhhcCCCChhhhhhhhhHHHHHHHHHHcccCeeecCCCCceEEeecCcCcccCceEEEEECccccEEEEEecCCC--
Q 000473 1281 NFILQTMDPGNSVMRKTCLHTSMAALKEIVHVFPMVSLNDTSTKLAVGDAIGDIKKASIRVYDMQSVTKIKVLDASGP-- 1358 (1471)
Q Consensus 1281 ~~~~~~lDp~~~~~r~~~l~~~~~~l~~~~~~~p~v~~~~~tqrlavg~~~g~~~~~~i~~ydl~~~~~~~~~~~~~~-- 1358 (1471)
+|+--..||.|... ...-.-.+. ++++.++.||.+++-.|+... |+ |++.|||-||
T Consensus 17 ~ila~a~~plN~~n-~hTgdg~f~-----~~~irNi~f~~~~ylca~~gd---------------tv-kIYflEGkG~LI 74 (124)
T PHA02872 17 SILASASDPLNSEN-DHTGDGIFE-----AITIRNIDFCRPRYLCADAGD---------------TV-KIYFLEGKGGLI 74 (124)
T ss_pred hHHHHhcCcccccc-CccCCceEE-----EEEEeccccccceEeeecCCC---------------eE-EEEEEecCCcEE
Confidence 34445577877651 122222222 458999999999998887433 33 5555555442
Q ss_pred -----CCCCCCC-CcccccceEEEEECCCCCeEEEEecCC--cEEEEE
Q 000473 1359 -----PGLPRES-DSVATTVISALIFSPDGEGLVAFSEHG--LMIRWW 1398 (1471)
Q Consensus 1359 -----~~~~~~~-~~~~~~~i~a~~fs~dg~~l~~~s~~~--~~~~~w 1398 (1471)
-|-|.+- +++--..=.++-|--|=..|+|.+... |++-+|
T Consensus 75 fSv~dv~sp~~eedSgyv~eG~~Vef~t~f~C~iTlacts~~NtvVy~ 122 (124)
T PHA02872 75 FSVSDVGSPDNEEDSGYVNEGECVEFETDFACFITLACTSPINTVVYW 122 (124)
T ss_pred EEEEecCCCCccccccceecccEEEEecCceEEEEEEecCCcceEEEE
Confidence 3455421 133333335788887877788877654 445555
No 445
>KOG2109 consensus WD40 repeat protein [General function prediction only]
Probab=24.23 E-value=76 Score=40.72 Aligned_cols=53 Identities=26% Similarity=0.323 Sum_probs=44.5
Q ss_pred cCceEEEEECccccEEEEEecCCCCCCCCCCCcccccceEEEEECCCCCeEEEEecCCcEEEEEec
Q 000473 1335 KKASIRVYDMQSVTKIKVLDASGPPGLPRESDSVATTVISALIFSPDGEGLVAFSEHGLMIRWWSL 1400 (1471)
Q Consensus 1335 ~~~~i~~ydl~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~a~~fs~dg~~l~~~s~~~~~~~~w~~ 1400 (1471)
.+..+++=|+-+-.-+.=|- +|+.+|+||+|.+.|-.||+-+.++..|-++.+
T Consensus 293 g~~~vivkdf~S~a~i~Qfk-------------AhkspiSaLcfdqsgsllViasi~g~nVnvfRi 345 (788)
T KOG2109|consen 293 GNNLVIVKDFDSFADIRQFK-------------AHKSPISALCFDQSGSLLVIASITGRNVNVFRI 345 (788)
T ss_pred ccceEEeecccchhhhhhee-------------eecCcccccccccCceEEEEEeeccceeeeEEe
Confidence 35568888887776666677 569999999999999999999999999888765
No 446
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=23.12 E-value=1.2e+03 Score=26.85 Aligned_cols=104 Identities=13% Similarity=0.010 Sum_probs=63.1
Q ss_pred cEEEEEEecCCCCcccCcCCCEEEEEECCCcEEEEE-CCCCceEE-EEec--cCCCEEEEEECCCCCCCCCCCEEEEEe-
Q 000473 572 AVLCLAAHRMVGTAKGWSFNEVLVSGSMDCSIRIWD-LGSGNLIT-VMHH--HVAPVRQIILSPPQTEHPWSDCFLSVG- 646 (1471)
Q Consensus 572 ~V~~la~spd~~~~~~~~~~~~L~SGs~DgtI~lWD-l~tg~~l~-~~~~--H~~~V~~l~fspd~~~~~~~~~l~S~s- 646 (1471)
.++.-.|.++ +...+....+...+++. ..+++... .... -.+.|..+.++|| |..++-..
T Consensus 67 ~l~~PS~d~~---------g~~W~v~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~I~~l~vSpD------G~RvA~v~~ 131 (253)
T PF10647_consen 67 SLTRPSWDPD---------GWVWTVDDGSGGVRVVRDSASGTGEPVEVDWPGLRGRITALRVSPD------GTRVAVVVE 131 (253)
T ss_pred ccccccccCC---------CCEEEEEcCCCceEEEEecCCCcceeEEecccccCCceEEEEECCC------CcEEEEEEe
Confidence 5666677775 56556555666666663 33343221 1111 1128999999999 88776665
Q ss_pred --CCCcEEEEECC---CC------cEEEEecCCCCCcEEEEEcCCCCEEEEEEcC
Q 000473 647 --EDFSVALASLE---TL------RVERMFPGHPNYPAKVVWDCPRGYIACLCRD 690 (1471)
Q Consensus 647 --~DgsV~lWdl~---t~------~~l~~~~gh~~~V~~v~~spdg~~L~sgs~D 690 (1471)
.++.|.+=-+. .+ ..+.........+..+.|.+++.+++.+...
T Consensus 132 ~~~~~~v~va~V~r~~~g~~~~l~~~~~~~~~~~~~v~~v~W~~~~~L~V~~~~~ 186 (253)
T PF10647_consen 132 DGGGGRVYVAGVVRDGDGVPRRLTGPRRVAPPLLSDVTDVAWSDDSTLVVLGRSA 186 (253)
T ss_pred cCCCCeEEEEEEEeCCCCCcceeccceEecccccCcceeeeecCCCEEEEEeCCC
Confidence 35667666542 23 1223333345678999999999877766554
No 447
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=22.90 E-value=9e+02 Score=25.41 Aligned_cols=112 Identities=12% Similarity=0.041 Sum_probs=60.7
Q ss_pred EEEEEECCCcEEEEECCCCc--------eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEE
Q 000473 593 VLVSGSMDCSIRIWDLGSGN--------LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERM 664 (1471)
Q Consensus 593 ~L~SGs~DgtI~lWDl~tg~--------~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~ 664 (1471)
-|+.++.-+.|.+.+..... .+..+ .-...|++|+--|-.. ....++|+-| ...++..||+....-+..
T Consensus 12 cL~~aT~~gKV~IH~ph~~~~~~~~~~~~i~~L-Nin~~italaaG~l~~-~~~~D~LliG-t~t~llaYDV~~N~d~Fy 88 (136)
T PF14781_consen 12 CLACATTGGKVFIHNPHERGQRTGRQDSDISFL-NINQEITALAAGRLKP-DDGRDCLLIG-TQTSLLAYDVENNSDLFY 88 (136)
T ss_pred eEEEEecCCEEEEECCCccccccccccCceeEE-ECCCceEEEEEEecCC-CCCcCEEEEe-ccceEEEEEcccCchhhh
Confidence 45666666677777654322 22222 2344566665433100 0113355544 566888999987654422
Q ss_pred ecCCCCCcEEEEEc---C-CCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEeC
Q 000473 665 FPGHPNYPAKVVWD---C-PRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLRG 717 (1471)
Q Consensus 665 ~~gh~~~V~~v~~s---p-dg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~g 717 (1471)
-. -.+.|.++.+- . +...+++|+. .+|.-+|.+-.+...+.+|
T Consensus 89 ke-~~DGvn~i~~g~~~~~~~~l~ivGGn---------csi~Gfd~~G~e~fWtVtg 135 (136)
T PF14781_consen 89 KE-VPDGVNAIVIGKLGDIPSPLVIVGGN---------CSIQGFDYEGNEIFWTVTG 135 (136)
T ss_pred hh-CccceeEEEEEecCCCCCcEEEECce---------EEEEEeCCCCcEEEEEecc
Confidence 22 23557777662 2 3456666554 6788888766665555543
No 448
>COG4590 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=22.57 E-value=4.2e+02 Score=32.90 Aligned_cols=33 Identities=15% Similarity=0.129 Sum_probs=26.2
Q ss_pred CCCceEEEEEEcCCCCeEEEEeCCCcEEEEEccC
Q 000473 14 PPSHRVTATSALTQPPTLYTGGSDGSILWWSFSD 47 (1471)
Q Consensus 14 ~p~h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~ 47 (1471)
.|...|..+-..|||++|+.-+. .++.+.++..
T Consensus 218 ~~~~~v~qllL~Pdg~~LYv~~g-~~~~v~~L~~ 250 (733)
T COG4590 218 VPFSDVSQLLLTPDGKTLYVRTG-SELVVALLDK 250 (733)
T ss_pred CCccchHhhEECCCCCEEEEecC-CeEEEEeecc
Confidence 34567889999999999987654 6788999873
No 449
>KOG1062 consensus Vesicle coat complex AP-1, gamma subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=21.82 E-value=8.7e+02 Score=32.53 Aligned_cols=162 Identities=19% Similarity=0.173 Sum_probs=84.3
Q ss_pred CCChhhHHHHHHHHHHHHHhcCc-c---hhHHHHHHHHHhhHhhcccccccchhhhhhhhhhhhhhccccccccCCCCCC
Q 000473 1136 LVKPTLAMLVVQPLIKLVMATNE-K---YSSTAAELLAEGMESTWKTCIGFEIPRLIGDIFFQIECVSNSSANLAGQHPA 1211 (1471)
Q Consensus 1136 ~~~~~l~~~~~~~l~~ll~~~~~-~---~~~~ai~l~~~gf~~~w~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 1211 (1471)
.-.|+|..+.-.+-.++|.+.|. - --....|+|.++-. .-..|- ++..-+..+++++....-++.-.+.+.+.
T Consensus 171 rK~P~l~e~f~~~~~~lL~ek~hGVL~~~l~l~~e~c~~~~~-~l~~fr--~l~~~lV~iLk~l~~~~yspeydv~gi~d 247 (866)
T KOG1062|consen 171 RKVPDLVEHFVIAFRKLLCEKHHGVLIAGLHLITELCKISPD-ALSYFR--DLVPSLVKILKQLTNSGYSPEYDVHGISD 247 (866)
T ss_pred HcCchHHHHhhHHHHHHHhhcCCceeeeHHHHHHHHHhcCHH-HHHHHH--HHHHHHHHHHHHHhcCCCCCccCccCCCc
Confidence 35677777777777777777443 2 22346677776321 111111 11111212233433222233334555555
Q ss_pred chhhhHHHHHHHHhHhHHhcCh--hHHHHHHHHHHhhc-----------------------------------------C
Q 000473 1212 VPASIRETLVGILLPSLAMADI--LGFLTVVESQIWST-----------------------------------------A 1248 (1471)
Q Consensus 1212 ~~~~~r~~~~~~~l~~ia~~~~--~~f~~~~~~~i~~~-----------------------------------------~ 1248 (1471)
..-.+|- -++|-.++.-|+ ..-|..|--+|++. +
T Consensus 248 PFLQi~i---LrlLriLGq~d~daSd~M~DiLaqvatntdsskN~GnAILYE~V~TI~~I~~~~~LrvlainiLgkFL~n 324 (866)
T KOG1062|consen 248 PFLQIRI---LRLLRILGQNDADASDLMNDILAQVATNTDSSKNAGNAILYECVRTIMDIRSNSGLRVLAINILGKFLLN 324 (866)
T ss_pred hHHHHHH---HHHHHHhcCCCccHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHHhccCCchHHHHHHHHHHHHhcC
Confidence 6666663 233444454433 33444444455432 1
Q ss_pred CCCccchhhHHHHHHHHhCChhHHHHhHHHHHHHhhhhcCCCChhhhhhhhhHHHHHHH
Q 000473 1249 SDSPVHLVSIMTIIRVVRGSPRNVAQHLDKVVNFILQTMDPGNSVMRKTCLHTSMAALK 1307 (1471)
Q Consensus 1249 ~~~~~~~~~~~~l~~~i~~~p~~~~~~l~~~~~~~~~~lDp~~~~~r~~~l~~~~~~l~ 1307 (1471)
.+.+..-++|..|.|+|.-.|..+..|=. .|+.||+-.+..+|++-|...+.+++
T Consensus 325 ~d~NirYvaLn~L~r~V~~d~~avqrHr~----tIleCL~DpD~SIkrralELs~~lvn 379 (866)
T KOG1062|consen 325 RDNNIRYVALNMLLRVVQQDPTAVQRHRS----TILECLKDPDVSIKRRALELSYALVN 379 (866)
T ss_pred CccceeeeehhhHHhhhcCCcHHHHHHHH----HHHHHhcCCcHHHHHHHHHHHHHHhc
Confidence 12222234455666777777777766644 56788887778888888887776653
No 450
>KOG3616 consensus Selective LIM binding factor [Transcription]
Probab=21.53 E-value=1.4e+02 Score=38.71 Aligned_cols=31 Identities=23% Similarity=0.294 Sum_probs=28.0
Q ss_pred ceEEEEEEcCCCCeEEEEeCCCcEEEEEccC
Q 000473 17 HRVTATSALTQPPTLYTGGSDGSILWWSFSD 47 (1471)
Q Consensus 17 h~Vtava~SpDg~~LaTGs~DG~I~lWdl~~ 47 (1471)
.-++++.-+|.|+.++.|..||+|+++|..+
T Consensus 15 e~~~aiqshp~~~s~v~~~~d~si~lfn~~~ 45 (1636)
T KOG3616|consen 15 EFTTAIQSHPGGQSFVLAHQDGSIILFNFIP 45 (1636)
T ss_pred ceeeeeeecCCCceEEEEecCCcEEEEeecc
Confidence 3478899999999999999999999999984
No 451
>KOG1059 consensus Vesicle coat complex AP-3, delta subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=21.36 E-value=4.2e+02 Score=34.88 Aligned_cols=38 Identities=21% Similarity=0.214 Sum_probs=30.8
Q ss_pred CCChhhHHHHHHHHHHHHHhcCcchhHHHHHHHHHhhH
Q 000473 1136 LVKPTLAMLVVQPLIKLVMATNEKYSSTAAELLAEGME 1173 (1471)
Q Consensus 1136 ~~~~~l~~~~~~~l~~ll~~~~~~~~~~ai~l~~~gf~ 1173 (1471)
.+-|+|||.+|.-+..||-....=.|--||-++=|=|.
T Consensus 136 fvTpdLARDLa~Dv~tLL~sskpYvRKkAIl~lykvFL 173 (877)
T KOG1059|consen 136 IVTPDLARDLADDVFTLLNSSKPYVRKKAILLLYKVFL 173 (877)
T ss_pred ccCchhhHHHHHHHHHHHhcCchHHHHHHHHHHHHHHH
Confidence 36899999999999998866666677789999876663
No 452
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=20.84 E-value=1.5e+02 Score=34.98 Aligned_cols=39 Identities=31% Similarity=0.464 Sum_probs=0.0
Q ss_pred CceEEEEECccccEEEEEecCCCCCCCCCCCcccccc-----------------eEEEEECC---CCCeE
Q 000473 1336 KASIRVYDMQSVTKIKVLDASGPPGLPRESDSVATTV-----------------ISALIFSP---DGEGL 1385 (1471)
Q Consensus 1336 ~~~i~~ydl~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------i~a~~fs~---dg~~l 1385 (1471)
.+-|+||||++++-|++++.| .+..+ |..++.+| ||+.|
T Consensus 87 ~~glIV~dl~~~~s~Rv~~~~-----------~~~~p~~~~~~i~g~~~~~~dg~~gial~~~~~d~r~L 145 (287)
T PF03022_consen 87 GPGLIVYDLATGKSWRVLHNS-----------FSPDPDAGPFTIGGESFQWPDGIFGIALSPISPDGRWL 145 (287)
T ss_dssp TCEEEEEETTTTEEEEEETCG-----------CTTS-SSEEEEETTEEEEETTSEEEEEE-TTSTTS-EE
T ss_pred cCcEEEEEccCCcEEEEecCC-----------cceeccccceeccCceEecCCCccccccCCCCCCccEE
No 453
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=20.48 E-value=1.4e+03 Score=26.54 Aligned_cols=163 Identities=12% Similarity=0.062 Sum_probs=87.9
Q ss_pred ccEEEEEeeccccccCCEEEE-EEcCCcEEEEEecccccCCCCCCccccCCcceEEEE--ecCCccEEEEEEecCCCCcc
Q 000473 510 KIVSSSMVISESFYAPYAIVY-GFFSGEIEVIQFDLFERHNSPGASLKVNSHVSRQYF--LGHTGAVLCLAAHRMVGTAK 586 (1471)
Q Consensus 510 ~~Vts~~~is~~~f~P~~lv~-Gs~DG~I~V~~~~~l~~~d~~~~~~d~~s~~~~~~l--~gH~~~V~~la~spd~~~~~ 586 (1471)
..++.+.+.++.. .|++ ....+.|.. ++ . +|+.++.+ .| .+..-.+++.-+
T Consensus 22 ~e~SGLTy~pd~~----tLfaV~d~~~~i~e--ls-------------~-~G~vlr~i~l~g-~~D~EgI~y~g~----- 75 (248)
T PF06977_consen 22 DELSGLTYNPDTG----TLFAVQDEPGEIYE--LS-------------L-DGKVLRRIPLDG-FGDYEGITYLGN----- 75 (248)
T ss_dssp S-EEEEEEETTTT----EEEEEETTTTEEEE--EE-------------T-T--EEEEEE-SS--SSEEEEEE-ST-----
T ss_pred CCccccEEcCCCC----eEEEEECCCCEEEE--Ec-------------C-CCCEEEEEeCCC-CCCceeEEEECC-----
Confidence 4577777555433 3444 344444433 33 1 23444444 33 345778888654
Q ss_pred cCcCCCEEEEEECCCcEEEEECCCC--ce----EEEEe-----ccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEE
Q 000473 587 GWSFNEVLVSGSMDCSIRIWDLGSG--NL----ITVMH-----HHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALAS 655 (1471)
Q Consensus 587 ~~~~~~~L~SGs~DgtI~lWDl~tg--~~----l~~~~-----~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWd 655 (1471)
+.++++--.++.+.+.++... .. ...+. .+...+..++|.|. ++.|..+-+..-.+|+.
T Consensus 76 ----~~~vl~~Er~~~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~------~~~L~v~kE~~P~~l~~ 145 (248)
T PF06977_consen 76 ----GRYVLSEERDQRLYIFTIDDDTTSLDRADVQKISLGFPNKGNKGFEGLAYDPK------TNRLFVAKERKPKRLYE 145 (248)
T ss_dssp ----TEEEEEETTTTEEEEEEE----TT--EEEEEEEE---S---SS--EEEEEETT------TTEEEEEEESSSEEEEE
T ss_pred ----CEEEEEEcCCCcEEEEEEeccccccchhhceEEecccccCCCcceEEEEEcCC------CCEEEEEeCCCChhhEE
Confidence 667776667899999988422 11 11121 24456899999998 67777787777778887
Q ss_pred CCC---CcEEEE--e------cCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEEEe
Q 000473 656 LET---LRVERM--F------PGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERVLR 716 (1471)
Q Consensus 656 l~t---~~~l~~--~------~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~l~ 716 (1471)
++. ...+.. . ......+..+.++|..+.|..-+.. ...|.++| .+|+.+..+.
T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~e-------s~~l~~~d-~~G~~~~~~~ 209 (248)
T PF06977_consen 146 VNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDE-------SRLLLELD-RQGRVVSSLS 209 (248)
T ss_dssp EESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEETT-------TTEEEEE--TT--EEEEEE
T ss_pred EccccCccceeeccccccccccceeccccceEEcCCCCeEEEEECC-------CCeEEEEC-CCCCEEEEEE
Confidence 754 222111 1 1233467899999987666555544 28999999 6788776654
No 454
>COG4590 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=20.07 E-value=3.9e+02 Score=33.12 Aligned_cols=108 Identities=13% Similarity=0.102 Sum_probs=67.9
Q ss_pred CCEEEEEECCCcEEEE-ECCCCc-----eEEEEeccCCCEEEEEECCCCCCCCCCCEEEEEeCCCcEEEEECCCCcEEEE
Q 000473 591 NEVLVSGSMDCSIRIW-DLGSGN-----LITVMHHHVAPVRQIILSPPQTEHPWSDCFLSVGEDFSVALASLETLRVERM 664 (1471)
Q Consensus 591 ~~~L~SGs~DgtI~lW-Dl~tg~-----~l~~~~~H~~~V~~l~fspd~~~~~~~~~l~S~s~DgsV~lWdl~t~~~l~~ 664 (1471)
+.-|+.++.||.|.-| |.+.+. .++.|+-...+|..+. |+.. .+-|++-+..|++.++.-...+.+..
T Consensus 280 g~SLLv~~~dG~vsQWFdvr~~~~p~l~h~R~f~l~pa~~~~l~--pe~~----rkgF~~l~~~G~L~~f~st~~~~lL~ 353 (733)
T COG4590 280 GFSLLVVHEDGLVSQWFDVRRDGQPHLNHIRNFKLAPAEVQFLL--PETN----RKGFYSLYRNGTLQSFYSTSEKLLLF 353 (733)
T ss_pred ceeEEEEcCCCceeeeeeeecCCCCcceeeeccccCcccceeec--cccc----cceEEEEcCCCceeeeecccCcceeh
Confidence 5678888889988765 554322 2233333334444433 4311 45788888999998887655544322
Q ss_pred ecCCCCCcEEEEEcCCCCEEEEEEcCCCCCCCCCCEEEEEECCCCeEEEE
Q 000473 665 FPGHPNYPAKVVWDCPRGYIACLCRDHSRTSDAVDVLFIWDVKTGARERV 714 (1471)
Q Consensus 665 ~~gh~~~V~~v~~spdg~~L~sgs~D~sg~~D~~gtV~VWDi~tg~~~~~ 714 (1471)
..--..+.-+++||.+.+|++-.. |.++++.+++...+-+
T Consensus 354 -~~~~~~~~~~~~Sp~~~~Ll~e~~---------gki~~~~l~Nr~Peis 393 (733)
T COG4590 354 -ERAYQAPQLVAMSPNQAYLLSEDQ---------GKIRLAQLENRNPEIS 393 (733)
T ss_pred -hhhhcCcceeeeCcccchheeecC---------CceEEEEecCCCCCcc
Confidence 222235677999999999986433 7899999887655433
Done!