Query 001814
Match_columns 1010
No_of_seqs 368 out of 2250
Neff 5.2
Searched_HMMs 46136
Date Fri Mar 29 09:57:12 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001814.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001814hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2109 WD40 repeat protein [G 100.0 3.9E-60 8.5E-65 540.7 20.6 629 12-765 1-650 (788)
2 PF12490 BCAS3: Breast carcino 100.0 3.6E-53 7.9E-58 456.1 15.9 239 505-750 1-251 (251)
3 KOG2110 Uncharacterized conser 100.0 2E-46 4.3E-51 408.0 32.7 325 53-596 2-341 (391)
4 KOG2111 Uncharacterized conser 100.0 6.8E-36 1.5E-40 321.2 32.6 316 57-594 6-331 (346)
5 KOG0271 Notchless-like WD40 re 99.9 1.9E-22 4E-27 221.3 23.5 310 51-473 110-451 (480)
6 KOG0263 Transcription initiati 99.9 4.5E-22 9.7E-27 233.6 22.2 230 73-464 388-652 (707)
7 KOG0272 U4/U6 small nuclear ri 99.9 2.7E-21 5.9E-26 214.5 20.2 275 53-459 172-458 (459)
8 KOG0315 G-protein beta subunit 99.9 1.2E-20 2.6E-25 198.5 23.8 266 77-472 13-299 (311)
9 KOG0271 Notchless-like WD40 re 99.9 5.2E-21 1.1E-25 210.0 21.8 291 72-459 166-479 (480)
10 cd00200 WD40 WD40 domain, foun 99.9 7.3E-19 1.6E-23 179.1 33.8 276 52-459 5-289 (289)
11 KOG0273 Beta-transducin family 99.8 1.6E-18 3.5E-23 194.6 27.9 280 57-461 236-523 (524)
12 KOG0319 WD40-repeat-containing 99.8 8.6E-19 1.9E-23 204.6 20.9 113 342-474 427-548 (775)
13 KOG0318 WD40 repeat stress pro 99.8 1.6E-17 3.5E-22 188.4 30.3 341 52-491 186-583 (603)
14 KOG0272 U4/U6 small nuclear ri 99.8 8.7E-19 1.9E-23 194.8 19.7 225 173-475 197-432 (459)
15 KOG0291 WD40-repeat-containing 99.8 1E-17 2.3E-22 195.7 28.9 267 74-465 276-554 (893)
16 KOG0279 G protein beta subunit 99.8 1.2E-17 2.7E-22 178.4 26.5 246 52-465 11-266 (315)
17 KOG0295 WD40 repeat-containing 99.8 7.9E-18 1.7E-22 184.6 21.7 254 52-459 146-404 (406)
18 KOG0286 G-protein beta subunit 99.8 6.8E-17 1.5E-21 173.6 27.5 256 52-474 51-316 (343)
19 KOG0265 U5 snRNP-specific prot 99.8 2.6E-17 5.5E-22 177.3 23.3 259 52-474 43-309 (338)
20 KOG0295 WD40 repeat-containing 99.8 8.9E-18 1.9E-22 184.2 19.0 250 53-472 105-375 (406)
21 PLN00181 protein SPA1-RELATED; 99.8 2E-16 4.4E-21 195.8 32.9 227 75-460 545-792 (793)
22 KOG0266 WD40 repeat-containing 99.8 5.2E-16 1.1E-20 181.2 30.8 239 73-472 169-420 (456)
23 KOG0315 G-protein beta subunit 99.7 1.5E-16 3.3E-21 168.0 22.9 229 171-474 18-259 (311)
24 PLN00181 protein SPA1-RELATED; 99.7 1.8E-15 3.9E-20 187.4 34.5 247 53-462 480-739 (793)
25 KOG0279 G protein beta subunit 99.7 8.8E-16 1.9E-20 164.3 25.7 227 73-462 73-314 (315)
26 KOG0273 Beta-transducin family 99.7 5.6E-16 1.2E-20 174.4 23.0 215 172-465 256-486 (524)
27 KOG0292 Vesicle coat complex C 99.7 5.4E-16 1.2E-20 183.4 23.0 303 52-462 5-322 (1202)
28 KOG0281 Beta-TrCP (transducin 99.7 3.2E-17 6.9E-22 178.6 11.8 229 55-463 196-430 (499)
29 KOG0285 Pleiotropic regulator 99.7 6.5E-16 1.4E-20 169.3 21.0 238 53-462 148-390 (460)
30 cd00200 WD40 WD40 domain, foun 99.7 5.8E-15 1.3E-19 150.5 26.8 240 100-463 4-251 (289)
31 KOG0319 WD40-repeat-containing 99.7 6.1E-16 1.3E-20 181.1 21.9 259 55-474 322-590 (775)
32 KOG0291 WD40-repeat-containing 99.7 8.1E-15 1.7E-19 171.9 30.2 330 53-467 142-514 (893)
33 KOG0284 Polyadenylation factor 99.7 1.3E-16 2.7E-21 177.1 13.6 248 73-466 106-385 (464)
34 KOG0266 WD40 repeat-containing 99.7 4.3E-15 9.3E-20 173.5 27.1 243 53-461 200-454 (456)
35 KOG0278 Serine/threonine kinas 99.7 4.5E-16 9.6E-21 164.4 16.2 198 53-401 97-298 (334)
36 KOG0263 Transcription initiati 99.7 2.1E-15 4.5E-20 178.1 23.3 244 53-449 448-703 (707)
37 KOG0306 WD40-repeat-containing 99.7 1.2E-15 2.7E-20 178.5 20.1 227 73-462 422-665 (888)
38 KOG0310 Conserved WD40 repeat- 99.7 6.1E-15 1.3E-19 166.9 24.2 235 53-459 65-307 (487)
39 KOG0288 WD40 repeat protein Ti 99.7 1.2E-15 2.6E-20 169.7 18.0 235 57-459 220-459 (459)
40 KOG0274 Cdc4 and related F-box 99.7 5.9E-15 1.3E-19 175.1 25.0 240 50-465 243-486 (537)
41 TIGR03866 PQQ_ABC_repeats PQQ- 99.7 1.6E-13 3.5E-18 145.7 32.3 262 77-464 3-282 (300)
42 KOG0282 mRNA splicing factor [ 99.7 4.2E-16 9E-21 176.0 13.0 260 51-474 209-475 (503)
43 KOG0286 G-protein beta subunit 99.7 2.5E-14 5.4E-19 154.0 25.4 224 73-459 107-343 (343)
44 PTZ00421 coronin; Provisional 99.7 7.3E-14 1.6E-18 164.8 31.8 221 84-463 51-292 (493)
45 KOG0268 Sof1-like rRNA process 99.7 1.4E-15 3.1E-20 167.0 15.5 273 51-463 61-347 (433)
46 KOG0643 Translation initiation 99.6 4.4E-14 9.6E-19 150.7 26.0 258 53-462 7-318 (327)
47 KOG0285 Pleiotropic regulator 99.6 6.6E-15 1.4E-19 161.5 18.0 216 95-473 141-360 (460)
48 KOG0274 Cdc4 and related F-box 99.6 1.8E-14 4E-19 171.0 22.8 229 76-471 220-451 (537)
49 KOG0276 Vesicle coat complex C 99.6 1.9E-14 4.2E-19 166.0 22.1 223 74-457 66-295 (794)
50 KOG1407 WD40 repeat protein [F 99.6 3.2E-14 6.9E-19 151.4 21.5 244 52-464 16-264 (313)
51 KOG0310 Conserved WD40 repeat- 99.6 1.4E-14 3E-19 164.0 19.6 245 52-465 22-272 (487)
52 PTZ00421 coronin; Provisional 99.6 3E-13 6.5E-18 159.7 31.3 250 52-463 71-333 (493)
53 PTZ00420 coronin; Provisional 99.6 2.8E-13 6.1E-18 161.7 31.1 218 86-462 56-294 (568)
54 KOG0306 WD40-repeat-containing 99.6 7.8E-14 1.7E-18 163.7 25.4 330 73-473 75-550 (888)
55 KOG0281 Beta-TrCP (transducin 99.6 4.8E-15 1E-19 161.8 13.4 240 52-462 233-478 (499)
56 KOG1446 Histone H3 (Lys4) meth 99.6 6.8E-13 1.5E-17 144.4 29.9 243 54-463 12-264 (311)
57 KOG0305 Anaphase promoting com 99.6 7.5E-14 1.6E-18 162.1 24.1 240 56-463 217-463 (484)
58 KOG0645 WD40 repeat protein [G 99.6 8.5E-13 1.8E-17 141.3 29.6 260 52-461 10-311 (312)
59 KOG0318 WD40 repeat stress pro 99.6 3.7E-13 8E-18 153.5 26.4 107 342-466 163-270 (603)
60 KOG0276 Vesicle coat complex C 99.6 9.9E-14 2.2E-18 160.2 21.5 248 54-469 11-265 (794)
61 KOG0316 Conserved WD40 repeat- 99.6 7.9E-14 1.7E-18 146.7 18.9 243 98-465 10-261 (307)
62 PTZ00420 coronin; Provisional 99.6 1.3E-12 2.9E-17 155.9 31.2 130 52-232 70-210 (568)
63 KOG0265 U5 snRNP-specific prot 99.6 3.4E-13 7.4E-18 145.9 22.1 242 55-458 68-335 (338)
64 KOG0292 Vesicle coat complex C 99.6 8.7E-14 1.9E-18 165.1 18.2 207 173-478 31-255 (1202)
65 KOG1036 Mitotic spindle checkp 99.5 5E-13 1.1E-17 145.1 22.3 229 54-451 52-294 (323)
66 KOG2055 WD40 repeat protein [G 99.5 2.1E-13 4.6E-18 153.7 20.1 277 55-463 212-514 (514)
67 KOG0973 Histone transcription 99.5 8.1E-13 1.7E-17 160.7 26.2 252 53-462 66-356 (942)
68 KOG0647 mRNA export protein (c 99.5 4.9E-13 1.1E-17 144.7 21.4 255 30-450 45-312 (347)
69 KOG0288 WD40 repeat protein Ti 99.5 3.7E-14 8.1E-19 158.0 13.2 244 54-468 173-424 (459)
70 KOG0293 WD40 repeat-containing 99.5 2.1E-13 4.7E-18 151.8 19.0 277 52-463 220-515 (519)
71 KOG0296 Angio-associated migra 99.5 7.2E-12 1.6E-16 138.4 29.9 315 52-461 60-398 (399)
72 KOG0283 WD40 repeat-containing 99.5 4.1E-13 8.8E-18 160.0 21.1 197 151-464 367-579 (712)
73 KOG0645 WD40 repeat protein [G 99.5 3.8E-12 8.2E-17 136.4 25.7 205 97-461 6-225 (312)
74 KOG0308 Conserved WD40 repeat- 99.5 2.9E-13 6.4E-18 157.2 17.0 240 73-465 35-289 (735)
75 KOG0772 Uncharacterized conser 99.5 1.6E-12 3.5E-17 148.0 21.3 112 341-468 283-401 (641)
76 KOG1539 WD repeat protein [Gen 99.5 1.4E-11 3E-16 146.6 29.6 120 345-466 466-611 (910)
77 KOG0313 Microtubule binding pr 99.5 6.2E-12 1.3E-16 139.4 24.2 275 74-463 115-420 (423)
78 KOG1539 WD repeat protein [Gen 99.5 9E-12 1.9E-16 148.1 26.8 97 343-460 550-647 (910)
79 KOG0264 Nucleosome remodeling 99.5 6.5E-12 1.4E-16 141.7 22.8 111 342-461 288-404 (422)
80 KOG0278 Serine/threonine kinas 99.4 6.9E-13 1.5E-17 140.6 13.7 279 53-463 11-299 (334)
81 KOG2096 WD40 repeat protein [G 99.4 1.3E-11 2.9E-16 134.4 22.0 281 53-459 83-400 (420)
82 KOG1408 WD40 repeat protein [F 99.4 4.8E-12 1.1E-16 147.7 18.4 218 173-462 481-714 (1080)
83 KOG1407 WD40 repeat protein [F 99.4 4.1E-11 8.8E-16 128.1 23.7 222 75-461 78-311 (313)
84 KOG0772 Uncharacterized conser 99.4 8.8E-12 1.9E-16 142.0 19.7 105 344-466 334-450 (641)
85 KOG0305 Anaphase promoting com 99.4 1.7E-11 3.7E-16 142.7 22.4 238 74-474 187-432 (484)
86 KOG0277 Peroxisomal targeting 99.4 1.2E-11 2.6E-16 131.6 18.9 248 53-459 46-307 (311)
87 KOG0277 Peroxisomal targeting 99.4 1.3E-11 2.8E-16 131.4 19.0 220 86-464 40-268 (311)
88 KOG0289 mRNA splicing factor [ 99.4 2.2E-11 4.7E-16 136.9 21.7 224 76-459 232-460 (506)
89 KOG2445 Nuclear pore complex c 99.4 1.1E-10 2.5E-15 126.9 26.2 280 52-462 9-319 (361)
90 KOG0301 Phospholipase A2-activ 99.4 5.1E-11 1.1E-15 139.7 24.4 200 172-461 34-249 (745)
91 KOG0284 Polyadenylation factor 99.4 2.5E-12 5.4E-17 143.5 12.3 195 55-399 179-379 (464)
92 KOG0269 WD40 repeat-containing 99.4 2.6E-12 5.6E-17 151.7 13.1 109 346-473 196-309 (839)
93 KOG1332 Vesicle coat complex C 99.4 1.6E-11 3.4E-16 130.2 17.4 263 53-465 8-290 (299)
94 KOG1274 WD40 repeat protein [G 99.4 1E-10 2.2E-15 140.6 26.3 241 53-462 10-263 (933)
95 KOG0643 Translation initiation 99.4 5.9E-11 1.3E-15 127.1 21.6 211 100-469 5-228 (327)
96 KOG0316 Conserved WD40 repeat- 99.4 9.5E-11 2.1E-15 123.8 21.8 273 53-460 14-298 (307)
97 KOG0289 mRNA splicing factor [ 99.3 1.4E-10 3.1E-15 130.4 23.9 241 52-460 257-505 (506)
98 KOG0282 mRNA splicing factor [ 99.3 3.2E-12 6.9E-17 145.0 10.8 208 96-463 205-417 (503)
99 KOG0641 WD40 repeat protein [G 99.3 8.5E-10 1.8E-14 115.9 27.5 255 53-461 86-349 (350)
100 KOG0275 Conserved WD40 repeat- 99.3 6.3E-12 1.4E-16 136.7 12.2 108 343-470 279-387 (508)
101 KOG0640 mRNA cleavage stimulat 99.3 1.3E-11 2.9E-16 133.9 14.5 98 348-463 237-337 (430)
102 KOG0267 Microtubule severing p 99.3 2.8E-12 6E-17 150.5 9.6 215 75-452 40-259 (825)
103 KOG4378 Nuclear protein COP1 [ 99.3 4.7E-11 1E-15 135.6 17.9 109 344-473 182-293 (673)
104 TIGR03866 PQQ_ABC_repeats PQQ- 99.3 6E-10 1.3E-14 118.4 25.3 175 172-463 10-189 (300)
105 KOG0639 Transducin-like enhanc 99.3 9E-11 1.9E-15 133.4 19.8 259 74-461 430-704 (705)
106 KOG0308 Conserved WD40 repeat- 99.3 8.4E-12 1.8E-16 145.3 11.7 187 172-473 46-255 (735)
107 KOG1273 WD40 repeat protein [G 99.3 1.2E-10 2.6E-15 126.9 19.4 240 74-461 34-280 (405)
108 KOG0268 Sof1-like rRNA process 99.3 1.6E-11 3.4E-16 135.6 12.1 215 172-463 88-304 (433)
109 KOG0293 WD40 repeat-containing 99.3 1E-10 2.2E-15 130.9 17.6 240 51-401 264-514 (519)
110 KOG0294 WD40 repeat-containing 99.3 4.1E-10 8.8E-15 123.0 21.8 221 74-401 52-282 (362)
111 KOG0301 Phospholipase A2-activ 99.3 3.1E-10 6.8E-15 133.3 22.1 214 77-463 73-290 (745)
112 KOG0299 U3 snoRNP-associated p 99.3 1.7E-10 3.8E-15 130.5 19.4 219 72-452 211-446 (479)
113 KOG0264 Nucleosome remodeling 99.3 9.9E-11 2.1E-15 132.3 17.1 106 343-467 244-353 (422)
114 KOG2919 Guanine nucleotide-bin 99.3 2.1E-10 4.5E-15 125.5 18.6 122 346-484 227-351 (406)
115 KOG0270 WD40 repeat-containing 99.3 2.1E-10 4.6E-15 129.4 19.0 179 171-463 264-451 (463)
116 KOG0275 Conserved WD40 repeat- 99.3 1.1E-10 2.4E-15 127.1 16.1 255 55-478 212-484 (508)
117 KOG0296 Angio-associated migra 99.2 5.1E-10 1.1E-14 124.0 20.4 239 56-399 148-397 (399)
118 KOG1446 Histone H3 (Lys4) meth 99.2 4.6E-09 9.9E-14 115.0 27.4 244 55-464 55-306 (311)
119 KOG0307 Vesicle coat complex C 99.2 4.9E-11 1.1E-15 146.0 13.2 246 63-463 66-329 (1049)
120 PRK11028 6-phosphogluconolacto 99.2 1E-08 2.2E-13 114.0 30.2 104 346-465 194-308 (330)
121 KOG0640 mRNA cleavage stimulat 99.2 3.6E-10 7.7E-15 123.0 17.7 258 55-460 111-425 (430)
122 KOG0294 WD40 repeat-containing 99.2 2.4E-09 5.3E-14 117.1 24.1 210 172-465 62-285 (362)
123 KOG2109 WD40 repeat protein [G 99.2 1.2E-11 2.6E-16 144.4 6.4 316 74-473 251-588 (788)
124 KOG2096 WD40 repeat protein [G 99.2 4.5E-10 9.8E-15 122.7 17.7 108 342-462 202-309 (420)
125 KOG2106 Uncharacterized conser 99.2 8E-09 1.7E-13 118.2 27.9 96 344-458 423-518 (626)
126 KOG0647 mRNA export protein (c 99.2 5.5E-09 1.2E-13 113.8 25.3 251 55-472 26-292 (347)
127 KOG0283 WD40 repeat-containing 99.2 8.7E-10 1.9E-14 132.0 20.9 192 342-588 383-579 (712)
128 KOG4283 Transcription-coupled 99.2 2.3E-09 4.9E-14 116.3 21.9 131 171-401 122-277 (397)
129 KOG0267 Microtubule severing p 99.2 2.7E-11 5.8E-16 142.4 7.8 113 344-476 87-199 (825)
130 KOG0313 Microtubule binding pr 99.2 1.1E-09 2.5E-14 121.8 19.7 240 52-400 140-418 (423)
131 KOG4283 Transcription-coupled 99.2 6.2E-10 1.3E-14 120.6 17.1 111 345-462 164-277 (397)
132 KOG2106 Uncharacterized conser 99.2 2E-08 4.4E-13 115.0 28.8 292 56-496 200-506 (626)
133 KOG0299 U3 snoRNP-associated p 99.1 2.3E-09 4.9E-14 121.7 20.3 176 172-463 223-412 (479)
134 KOG0300 WD40 repeat-containing 99.1 5.3E-10 1.2E-14 121.8 13.8 104 341-464 286-389 (481)
135 KOG1063 RNA polymerase II elon 99.1 5.4E-09 1.2E-13 123.0 22.5 99 349-463 552-650 (764)
136 KOG1274 WD40 repeat protein [G 99.1 9.6E-09 2.1E-13 124.0 24.0 223 77-458 68-297 (933)
137 KOG0641 WD40 repeat protein [G 99.1 6.8E-08 1.5E-12 101.8 27.0 99 343-461 198-303 (350)
138 KOG0650 WD40 repeat nucleolar 99.1 7.2E-09 1.6E-13 120.4 21.3 315 53-459 397-733 (733)
139 KOG2048 WD40 repeat protein [G 99.1 2.5E-08 5.5E-13 117.4 25.9 246 53-462 69-320 (691)
140 KOG1408 WD40 repeat protein [F 99.1 3E-09 6.4E-14 125.0 18.0 134 331-469 467-634 (1080)
141 KOG1036 Mitotic spindle checkp 99.1 1.8E-08 3.8E-13 110.2 21.7 246 54-466 11-267 (323)
142 COG2319 FOG: WD40 repeat [Gene 99.0 5.7E-07 1.2E-11 95.4 32.2 178 172-464 133-317 (466)
143 KOG0646 WD40 repeat protein [G 99.0 1.2E-08 2.7E-13 116.0 20.0 221 72-442 90-330 (476)
144 KOG0973 Histone transcription 99.0 1.3E-09 2.8E-14 133.4 12.8 140 343-484 29-182 (942)
145 PRK01742 tolB translocation pr 99.0 4.9E-08 1.1E-12 113.5 24.8 90 348-463 313-403 (429)
146 KOG1034 Transcriptional repres 99.0 1.9E-08 4.1E-13 110.7 18.7 101 76-222 107-214 (385)
147 KOG0639 Transducin-like enhanc 99.0 2.7E-09 5.8E-14 121.7 12.5 204 186-461 413-622 (705)
148 KOG0646 WD40 repeat protein [G 99.0 2.1E-08 4.6E-13 114.1 19.4 114 343-465 192-311 (476)
149 KOG0650 WD40 repeat nucleolar 99.0 4.7E-09 1E-13 121.9 13.6 99 346-463 584-682 (733)
150 KOG0321 WD40 repeat-containing 99.0 9.2E-09 2E-13 120.3 15.6 114 343-474 233-361 (720)
151 KOG4328 WD40 protein [Function 98.9 2.7E-08 5.9E-13 113.0 18.4 252 57-466 187-453 (498)
152 KOG2048 WD40 repeat protein [G 98.9 5.2E-08 1.1E-12 114.8 21.0 190 172-475 46-247 (691)
153 KOG1272 WD40-repeat-containing 98.9 1.4E-09 3.1E-14 123.3 7.0 209 173-462 151-363 (545)
154 KOG2055 WD40 repeat protein [G 98.9 1.4E-07 3.1E-12 107.4 22.5 96 345-462 321-418 (514)
155 COG2319 FOG: WD40 repeat [Gene 98.9 2.4E-06 5.1E-11 90.7 29.8 223 81-464 130-362 (466)
156 PRK05137 tolB translocation pr 98.9 1E-06 2.3E-11 102.6 29.5 95 350-466 315-415 (435)
157 KOG4378 Nuclear protein COP1 [ 98.9 5.2E-08 1.1E-12 111.4 17.6 208 74-443 90-305 (673)
158 KOG0322 G-protein beta subunit 98.9 1.5E-08 3.1E-13 109.0 12.3 70 371-460 253-322 (323)
159 PRK03629 tolB translocation pr 98.9 1.2E-06 2.7E-11 102.1 28.7 83 351-455 313-397 (429)
160 KOG4328 WD40 protein [Function 98.9 6.9E-08 1.5E-12 109.8 17.5 100 346-461 298-399 (498)
161 KOG0269 WD40 repeat-containing 98.8 3.4E-08 7.4E-13 117.6 14.8 102 342-462 149-251 (839)
162 KOG0270 WD40 repeat-containing 98.8 1.1E-07 2.4E-12 107.9 17.5 129 52-232 239-374 (463)
163 KOG0302 Ribosome Assembly prot 98.8 2E-08 4.3E-13 112.0 11.2 105 342-464 273-381 (440)
164 KOG1063 RNA polymerase II elon 98.8 1.8E-07 4E-12 110.5 19.4 254 75-465 25-301 (764)
165 KOG0290 Conserved WD40 repeat- 98.8 3.6E-07 7.9E-12 99.5 20.0 175 172-452 172-357 (364)
166 PRK11028 6-phosphogluconolacto 98.8 1.2E-06 2.6E-11 97.6 25.0 110 344-467 143-264 (330)
167 KOG1963 WD40 repeat protein [G 98.8 7.3E-07 1.6E-11 107.9 23.6 286 73-464 26-325 (792)
168 KOG0307 Vesicle coat complex C 98.8 2.4E-08 5.1E-13 123.1 10.7 103 342-462 177-285 (1049)
169 KOG1188 WD40 repeat protein [G 98.8 1.1E-07 2.3E-12 105.4 14.6 102 347-465 141-246 (376)
170 KOG1034 Transcriptional repres 98.7 4.2E-08 9.2E-13 108.0 10.3 101 343-462 109-212 (385)
171 KOG1332 Vesicle coat complex C 98.7 1.9E-07 4.1E-12 99.8 14.4 107 342-466 26-139 (299)
172 KOG2110 Uncharacterized conser 98.7 1.1E-05 2.3E-10 90.6 28.0 197 174-467 107-337 (391)
173 PRK04922 tolB translocation pr 98.7 4.6E-06 1E-10 97.2 26.5 95 350-466 317-414 (433)
174 KOG1587 Cytoplasmic dynein int 98.7 1.4E-06 3E-11 104.4 22.3 97 348-462 419-517 (555)
175 KOG2111 Uncharacterized conser 98.7 1.1E-05 2.4E-10 89.1 27.2 119 343-465 197-326 (346)
176 PRK02889 tolB translocation pr 98.7 9.7E-06 2.1E-10 94.5 28.8 73 372-465 330-405 (427)
177 KOG0321 WD40 repeat-containing 98.7 6.7E-07 1.5E-11 105.1 18.0 104 343-466 288-396 (720)
178 KOG1009 Chromatin assembly com 98.6 9.8E-08 2.1E-12 107.4 10.4 138 342-483 29-175 (434)
179 KOG1273 WD40 repeat protein [G 98.6 1E-07 2.2E-12 104.6 10.1 127 342-469 38-191 (405)
180 KOG0771 Prolactin regulatory e 98.6 4.8E-07 1E-11 102.3 15.7 74 370-462 282-355 (398)
181 KOG0644 Uncharacterized conser 98.6 9.9E-08 2.1E-12 114.2 10.7 281 51-462 185-469 (1113)
182 KOG1963 WD40 repeat protein [G 98.6 2.1E-06 4.5E-11 104.1 21.6 231 173-464 37-284 (792)
183 PF08662 eIF2A: Eukaryotic tra 98.6 2.4E-06 5.1E-11 89.7 19.5 52 347-400 123-179 (194)
184 KOG1445 Tumor-specific antigen 98.6 6.2E-07 1.3E-11 104.9 14.7 101 343-461 694-798 (1012)
185 KOG0302 Ribosome Assembly prot 98.6 1.3E-07 2.9E-12 105.5 8.8 122 343-490 228-356 (440)
186 KOG0303 Actin-binding protein 98.6 2.5E-06 5.3E-11 96.1 18.2 128 52-230 77-214 (472)
187 KOG1007 WD repeat protein TSSC 98.6 5.5E-06 1.2E-10 90.5 20.0 246 74-463 75-363 (370)
188 PF10282 Lactonase: Lactonase, 98.5 0.00013 2.9E-09 82.7 32.1 83 371-468 246-329 (345)
189 KOG1009 Chromatin assembly com 98.5 2E-05 4.4E-10 89.2 23.8 95 349-463 261-374 (434)
190 PF08662 eIF2A: Eukaryotic tra 98.5 1E-06 2.2E-11 92.5 12.9 93 345-461 79-179 (194)
191 KOG1188 WD40 repeat protein [G 98.5 1.4E-06 3E-11 96.7 14.1 245 75-461 40-346 (376)
192 KOG1517 Guanine nucleotide bin 98.5 4.4E-06 9.5E-11 102.6 19.1 101 343-463 1273-1383(1387)
193 KOG0303 Actin-binding protein 98.5 1.6E-06 3.4E-11 97.6 13.9 104 343-466 148-254 (472)
194 KOG0642 Cell-cycle nuclear pro 98.5 4.3E-07 9.3E-12 105.6 9.7 110 342-461 309-426 (577)
195 PRK00178 tolB translocation pr 98.5 0.00011 2.4E-09 85.2 29.8 95 350-466 312-409 (430)
196 KOG0290 Conserved WD40 repeat- 98.4 5.8E-06 1.3E-10 90.4 16.7 106 342-465 212-322 (364)
197 PRK01742 tolB translocation pr 98.4 6.3E-06 1.4E-10 96.1 18.4 89 351-462 274-362 (429)
198 KOG2394 WD40 protein DMR-N9 [G 98.4 1.7E-06 3.7E-11 100.3 13.1 89 329-441 296-384 (636)
199 KOG1524 WD40 repeat-containing 98.4 5.2E-06 1.1E-10 96.2 16.7 208 75-461 75-287 (737)
200 KOG3881 Uncharacterized conser 98.4 1.9E-05 4.2E-10 89.1 20.2 106 342-466 219-325 (412)
201 KOG2445 Nuclear pore complex c 98.4 7.4E-06 1.6E-10 90.1 16.5 40 360-400 279-318 (361)
202 KOG2394 WD40 protein DMR-N9 [G 98.4 3.7E-07 8E-12 105.6 5.7 103 361-483 282-384 (636)
203 PRK05137 tolB translocation pr 98.4 4.9E-05 1.1E-09 88.7 23.4 96 349-465 270-368 (435)
204 TIGR02800 propeller_TolB tol-p 98.4 0.00016 3.5E-09 82.9 26.9 89 350-460 303-393 (417)
205 KOG1517 Guanine nucleotide bin 98.3 3.7E-05 7.9E-10 94.9 22.2 101 345-461 1227-1333(1387)
206 PRK03629 tolB translocation pr 98.3 5.1E-05 1.1E-09 88.7 22.9 93 350-461 268-363 (429)
207 KOG2139 WD40 repeat protein [G 98.3 1.7E-05 3.7E-10 88.8 17.3 100 347-467 216-315 (445)
208 PRK04922 tolB translocation pr 98.3 4.4E-05 9.6E-10 89.1 21.6 94 349-461 272-368 (433)
209 KOG1445 Tumor-specific antigen 98.3 1.1E-06 2.3E-11 103.0 7.5 99 343-461 644-750 (1012)
210 KOG1007 WD repeat protein TSSC 98.3 1.1E-05 2.3E-10 88.3 13.3 102 345-465 189-293 (370)
211 KOG0771 Prolactin regulatory e 98.2 4.1E-06 9E-11 95.0 10.4 122 342-465 159-315 (398)
212 TIGR02658 TTQ_MADH_Hv methylam 98.2 0.0018 3.8E-08 74.5 31.6 98 347-464 213-333 (352)
213 PRK02889 tolB translocation pr 98.2 6.6E-05 1.4E-09 87.6 20.4 93 351-462 266-361 (427)
214 PRK04792 tolB translocation pr 98.2 0.00069 1.5E-08 79.9 29.0 95 350-465 331-427 (448)
215 KOG1538 Uncharacterized conser 98.2 9.2E-06 2E-10 95.8 10.9 143 194-458 14-159 (1081)
216 KOG0642 Cell-cycle nuclear pro 98.1 3.1E-05 6.8E-10 90.6 15.0 56 344-400 506-561 (577)
217 PF00400 WD40: WD domain, G-be 98.1 5E-06 1.1E-10 64.0 5.5 38 360-398 2-39 (39)
218 PRK01029 tolB translocation pr 98.1 0.0011 2.5E-08 77.7 26.6 85 362-466 319-406 (428)
219 PRK04792 tolB translocation pr 98.1 0.00036 7.8E-09 82.2 22.0 98 350-466 287-385 (448)
220 KOG1538 Uncharacterized conser 98.1 0.00018 4E-09 85.3 18.8 262 73-462 22-294 (1081)
221 COG2706 3-carboxymuconate cycl 98.0 0.0078 1.7E-07 68.2 30.8 106 347-468 211-328 (346)
222 TIGR02800 propeller_TolB tol-p 98.0 0.00057 1.2E-08 78.3 22.7 95 349-462 258-355 (417)
223 KOG4227 WD40 repeat protein [G 98.0 0.00084 1.8E-08 76.0 22.8 296 53-460 53-386 (609)
224 KOG0322 G-protein beta subunit 98.0 4.7E-05 1E-09 82.6 12.5 57 342-399 266-322 (323)
225 KOG0649 WD40 repeat protein [G 98.0 0.0015 3.4E-08 70.5 23.3 63 172-234 135-201 (325)
226 KOG0644 Uncharacterized conser 98.0 2E-06 4.4E-11 103.4 1.7 96 342-461 205-300 (1113)
227 KOG2919 Guanine nucleotide-bin 98.0 0.00036 7.9E-09 77.6 18.7 55 342-400 312-367 (406)
228 KOG1310 WD40 repeat protein [G 98.0 0.00024 5.2E-09 83.1 17.6 121 344-467 165-309 (758)
229 KOG0300 WD40 repeat-containing 98.0 2.6E-05 5.7E-10 86.0 9.0 127 343-472 164-313 (481)
230 KOG0649 WD40 repeat protein [G 97.9 0.00025 5.4E-09 76.4 16.0 99 344-464 131-238 (325)
231 PRK00178 tolB translocation pr 97.9 0.0013 2.9E-08 76.3 22.8 96 349-465 267-365 (430)
232 KOG2321 WD40 repeat protein [G 97.9 0.00088 1.9E-08 79.0 19.8 186 171-464 153-346 (703)
233 KOG2315 Predicted translation 97.9 0.0014 3E-08 77.3 21.4 129 172-400 250-390 (566)
234 KOG1272 WD40-repeat-containing 97.8 3.8E-05 8.2E-10 88.2 8.2 182 173-470 231-416 (545)
235 PF02239 Cytochrom_D1: Cytochr 97.8 0.0051 1.1E-07 71.1 25.7 177 173-463 16-204 (369)
236 PRK04043 tolB translocation pr 97.8 0.016 3.6E-07 68.0 30.2 49 174-222 214-268 (419)
237 KOG4227 WD40 repeat protein [G 97.8 0.0005 1.1E-08 77.8 15.1 219 172-462 77-323 (609)
238 KOG1587 Cytoplasmic dynein int 97.7 0.00088 1.9E-08 80.8 17.9 102 343-463 364-474 (555)
239 PF10282 Lactonase: Lactonase, 97.7 0.031 6.7E-07 63.7 29.5 55 347-401 265-323 (345)
240 KOG0974 WD-repeat protein WDR6 97.7 0.00011 2.5E-09 90.8 10.4 99 342-461 148-246 (967)
241 KOG3881 Uncharacterized conser 97.7 0.0012 2.5E-08 75.2 17.3 80 344-444 264-344 (412)
242 PF02239 Cytochrom_D1: Cytochr 97.7 0.00039 8.5E-09 80.2 13.9 99 346-465 13-112 (369)
243 KOG1240 Protein kinase contain 97.6 0.015 3.2E-07 73.9 25.4 102 345-463 1213-1336(1431)
244 KOG1524 WD40 repeat-containing 97.6 0.00097 2.1E-08 78.0 14.0 86 348-456 165-250 (737)
245 KOG1310 WD40 repeat protein [G 97.6 0.00019 4.2E-09 83.8 8.4 83 362-463 43-127 (758)
246 PRK01029 tolB translocation pr 97.5 0.0062 1.3E-07 71.6 20.9 75 371-461 282-359 (428)
247 PLN02919 haloacid dehalogenase 97.5 0.05 1.1E-06 71.0 30.6 72 373-463 807-890 (1057)
248 KOG4497 Uncharacterized conser 97.5 0.0012 2.5E-08 73.8 13.2 92 346-458 111-237 (447)
249 KOG2139 WD40 repeat protein [G 97.5 0.00054 1.2E-08 77.2 10.4 76 366-461 193-268 (445)
250 KOG4547 WD40 repeat-containing 97.4 0.011 2.4E-07 70.3 21.0 53 348-400 163-220 (541)
251 KOG1523 Actin-related protein 97.4 0.0084 1.8E-07 67.1 18.7 88 367-459 144-234 (361)
252 PF00400 WD40: WD domain, G-be 97.4 0.00052 1.1E-08 52.7 6.7 37 421-459 3-39 (39)
253 KOG0974 WD-repeat protein WDR6 97.4 0.0015 3.3E-08 81.2 14.0 103 343-467 191-294 (967)
254 KOG1334 WD40 repeat protein [G 97.4 0.0033 7.1E-08 73.2 15.7 121 51-222 137-266 (559)
255 KOG2321 WD40 repeat protein [G 97.4 0.0032 7E-08 74.4 15.2 124 347-492 153-282 (703)
256 KOG1240 Protein kinase contain 97.3 0.0032 6.8E-08 79.6 15.0 92 359-463 1038-1130(1431)
257 PF04762 IKI3: IKI3 family; I 97.3 0.26 5.6E-06 63.7 32.4 98 348-463 236-335 (928)
258 KOG1523 Actin-related protein 97.2 0.0062 1.4E-07 68.1 14.3 128 342-490 25-155 (361)
259 COG2706 3-carboxymuconate cycl 97.1 0.15 3.3E-06 58.0 24.8 30 373-402 294-323 (346)
260 KOG4497 Uncharacterized conser 97.1 0.0027 5.8E-08 71.1 10.6 88 346-453 68-155 (447)
261 KOG4547 WD40 repeat-containing 97.1 0.027 5.8E-07 67.1 19.3 99 344-464 119-223 (541)
262 KOG2315 Predicted translation 97.1 0.11 2.5E-06 61.8 24.1 99 349-470 251-353 (566)
263 KOG1409 Uncharacterized conser 97.1 0.01 2.2E-07 66.9 14.8 219 172-464 45-273 (404)
264 KOG2314 Translation initiation 97.1 0.043 9.4E-07 65.1 20.3 293 72-462 219-526 (698)
265 KOG1354 Serine/threonine prote 97.0 0.11 2.3E-06 59.1 21.5 78 369-462 272-360 (433)
266 PRK04043 tolB translocation pr 96.9 0.054 1.2E-06 63.8 20.1 98 348-466 256-360 (419)
267 KOG1064 RAVE (regulator of V-A 96.9 0.0035 7.5E-08 81.6 9.7 88 347-466 2313-2403(2439)
268 TIGR02658 TTQ_MADH_Hv methylam 96.8 0.014 3.1E-07 67.2 13.9 98 348-464 26-139 (352)
269 COG5354 Uncharacterized protei 96.8 0.11 2.3E-06 61.5 20.7 284 72-461 41-348 (561)
270 KOG3914 WD repeat protein WDR4 96.8 0.0032 7E-08 71.8 8.1 99 347-466 130-228 (390)
271 smart00320 WD40 WD40 repeats. 96.8 0.0033 7.2E-08 44.3 5.5 38 360-398 3-40 (40)
272 PF11768 DUF3312: Protein of u 96.6 0.013 2.9E-07 69.9 11.6 91 351-463 238-331 (545)
273 KOG1912 WD40 repeat protein [G 96.5 0.31 6.7E-06 60.1 22.3 124 52-218 51-185 (1062)
274 PF07433 DUF1513: Protein of u 96.3 1 2.2E-05 51.2 23.4 104 348-463 137-249 (305)
275 PF13360 PQQ_2: PQQ-like domai 96.2 2.1 4.4E-05 45.0 27.3 94 345-463 128-232 (238)
276 COG4946 Uncharacterized protei 96.2 1.9 4.1E-05 51.1 25.0 119 54-229 357-486 (668)
277 KOG1275 PAB-dependent poly(A) 96.2 0.13 2.9E-06 64.1 16.7 185 172-462 156-343 (1118)
278 PF03178 CPSF_A: CPSF A subuni 96.0 0.34 7.4E-06 54.4 18.4 96 348-463 106-204 (321)
279 COG4946 Uncharacterized protei 95.9 0.071 1.5E-06 62.4 12.4 98 344-462 376-478 (668)
280 KOG1645 RING-finger-containing 95.9 0.024 5.1E-07 65.2 8.1 96 351-467 175-272 (463)
281 TIGR03300 assembly_YfgL outer 95.9 4.6 0.0001 46.2 28.1 58 172-229 114-173 (377)
282 KOG1275 PAB-dependent poly(A) 95.8 0.15 3.2E-06 63.7 14.7 52 173-224 197-259 (1118)
283 KOG1334 WD40 repeat protein [G 95.7 0.03 6.6E-07 65.5 8.5 56 343-399 410-465 (559)
284 PF15492 Nbas_N: Neuroblastoma 95.7 2 4.4E-05 48.0 22.1 31 368-399 228-258 (282)
285 KOG1912 WD40 repeat protein [G 95.7 0.18 3.9E-06 62.1 14.9 101 343-462 441-552 (1062)
286 TIGR03300 assembly_YfgL outer 95.6 5.6 0.00012 45.5 28.7 92 345-459 285-377 (377)
287 COG5170 CDC55 Serine/threonine 95.5 0.77 1.7E-05 51.8 17.7 81 365-463 276-369 (460)
288 COG5354 Uncharacterized protei 95.5 0.91 2E-05 54.0 19.1 78 347-453 338-421 (561)
289 PLN02919 haloacid dehalogenase 95.2 2.6 5.6E-05 55.5 24.0 86 372-462 742-834 (1057)
290 PF15492 Nbas_N: Neuroblastoma 95.1 2.1 4.6E-05 47.9 19.4 118 346-464 116-262 (282)
291 smart00320 WD40 WD40 repeats. 94.8 0.11 2.3E-06 36.3 5.9 29 431-459 12-40 (40)
292 KOG0280 Uncharacterized conser 94.5 0.2 4.4E-06 55.9 10.0 104 344-465 138-245 (339)
293 KOG4532 WD40-like repeat conta 94.0 0.64 1.4E-05 51.6 12.3 100 346-464 135-236 (344)
294 KOG4190 Uncharacterized conser 93.8 0.14 3.1E-06 60.6 7.3 93 346-459 848-946 (1034)
295 KOG1832 HIV-1 Vpr-binding prot 93.8 0.057 1.2E-06 66.9 4.1 107 74-228 1112-1223(1516)
296 KOG4415 Uncharacterized conser 93.6 0.062 1.3E-06 56.2 3.5 38 682-720 20-58 (247)
297 PF03178 CPSF_A: CPSF A subuni 93.6 16 0.00034 41.2 26.6 50 173-222 62-118 (321)
298 KOG4714 Nucleoporin [Nuclear s 93.5 0.17 3.8E-06 55.7 6.8 115 343-459 196-316 (319)
299 PF08553 VID27: VID27 cytoplas 93.4 7.5 0.00016 49.6 21.8 56 343-400 592-647 (794)
300 PF07433 DUF1513: Protein of u 93.4 18 0.00039 41.4 26.0 73 368-465 215-287 (305)
301 KOG3621 WD40 repeat-containing 93.3 0.22 4.7E-06 60.9 8.0 103 344-462 50-155 (726)
302 KOG4190 Uncharacterized conser 93.1 0.095 2E-06 62.0 4.4 89 361-466 727-815 (1034)
303 KOG1064 RAVE (regulator of V-A 93.1 0.21 4.6E-06 66.2 7.8 121 343-468 2224-2373(2439)
304 KOG4532 WD40-like repeat conta 93.0 7.5 0.00016 43.6 18.2 60 342-401 218-283 (344)
305 KOG0309 Conserved WD40 repeat- 93.0 0.25 5.5E-06 60.5 7.7 57 173-229 180-244 (1081)
306 KOG0309 Conserved WD40 repeat- 91.7 1.3 2.9E-05 54.6 11.6 96 347-461 178-277 (1081)
307 PF12894 Apc4_WD40: Anaphase-p 91.0 0.51 1.1E-05 39.2 5.2 31 431-461 11-41 (47)
308 KOG4640 Anaphase-promoting com 90.9 0.86 1.9E-05 55.4 9.0 76 371-468 22-99 (665)
309 KOG0280 Uncharacterized conser 90.9 0.73 1.6E-05 51.7 7.9 103 343-466 182-289 (339)
310 COG0823 TolB Periplasmic compo 90.8 0.88 1.9E-05 54.0 9.0 98 349-467 218-318 (425)
311 PF04762 IKI3: IKI3 family; I 90.8 67 0.0015 42.2 28.7 85 369-461 426-519 (928)
312 KOG2695 WD40 repeat protein [G 90.3 0.71 1.5E-05 52.7 7.2 104 344-463 269-378 (425)
313 KOG2314 Translation initiation 90.3 0.75 1.6E-05 55.2 7.7 95 350-464 232-337 (698)
314 PF08450 SGL: SMP-30/Gluconola 90.1 30 0.00066 37.1 25.4 99 349-461 115-213 (246)
315 PF13360 PQQ_2: PQQ-like domai 90.0 27 0.00059 36.5 23.4 57 173-229 86-150 (238)
316 PF00930 DPPIV_N: Dipeptidyl p 89.6 0.78 1.7E-05 52.5 7.1 103 348-465 22-135 (353)
317 PF11768 DUF3312: Protein of u 89.3 0.91 2E-05 54.8 7.5 55 344-401 276-330 (545)
318 PRK02888 nitrous-oxide reducta 89.3 3.1 6.6E-05 51.5 12.0 116 345-464 211-354 (635)
319 KOG2066 Vacuolar assembly/sort 89.0 1.4 3E-05 54.9 8.8 92 343-464 53-149 (846)
320 KOG2395 Protein involved in va 88.6 19 0.00042 43.7 17.3 55 343-399 445-499 (644)
321 KOG4640 Anaphase-promoting com 88.5 1.3 2.9E-05 53.9 8.1 58 342-401 35-93 (665)
322 KOG2041 WD40 repeat protein [G 88.4 0.8 1.7E-05 56.2 6.2 101 342-460 29-144 (1189)
323 KOG0882 Cyclophilin-related pe 88.1 2.5 5.4E-05 49.9 9.7 112 347-464 120-234 (558)
324 KOG3617 WD40 and TPR repeat-co 87.7 1.2 2.7E-05 55.6 7.2 99 344-466 36-136 (1416)
325 KOG1354 Serine/threonine prote 87.1 1.5 3.3E-05 50.1 7.1 81 370-467 26-122 (433)
326 PF00780 CNH: CNH domain; Int 87.0 14 0.00029 40.3 14.4 55 169-223 110-169 (275)
327 PF12894 Apc4_WD40: Anaphase-p 86.8 1.6 3.4E-05 36.4 5.3 30 369-399 11-40 (47)
328 COG0823 TolB Periplasmic compo 86.0 6.1 0.00013 47.0 11.8 85 349-452 262-346 (425)
329 PF14783 BBS2_Mid: Ciliary BBS 85.8 7.6 0.00017 38.2 10.2 65 372-460 2-70 (111)
330 KOG2066 Vacuolar assembly/sort 84.7 25 0.00054 44.5 16.1 46 172-217 92-144 (846)
331 PF08450 SGL: SMP-30/Gluconola 84.2 64 0.0014 34.6 30.0 60 371-451 185-245 (246)
332 KOG2114 Vacuolar assembly/sort 83.4 1.3E+02 0.0028 38.9 21.4 84 348-454 192-276 (933)
333 KOG3914 WD repeat protein WDR4 83.2 1.6 3.5E-05 50.6 5.1 74 73-193 161-235 (390)
334 PF00780 CNH: CNH domain; Int 82.4 27 0.00059 38.0 14.1 54 175-229 210-265 (275)
335 KOG1920 IkappaB kinase complex 81.7 1.3E+02 0.0027 40.3 20.9 98 349-463 222-324 (1265)
336 KOG2114 Vacuolar assembly/sort 80.9 7.1 0.00015 49.4 9.6 108 342-461 38-155 (933)
337 PRK11138 outer membrane biogen 80.3 1.2E+02 0.0026 35.2 26.4 90 347-460 302-393 (394)
338 PF10168 Nup88: Nuclear pore c 80.3 57 0.0012 41.6 17.5 88 370-462 85-180 (717)
339 KOG2695 WD40 repeat protein [G 79.8 3.9 8.5E-05 47.0 6.5 107 345-469 230-337 (425)
340 PF04841 Vps16_N: Vps16, N-ter 79.3 1.4E+02 0.0031 35.4 25.2 48 172-220 60-110 (410)
341 PF05694 SBP56: 56kDa selenium 79.1 21 0.00044 42.8 12.2 45 172-216 221-274 (461)
342 PF04053 Coatomer_WDAD: Coatom 78.8 1.6E+02 0.0034 35.7 19.7 58 381-462 117-174 (443)
343 PRK13616 lipoprotein LpqB; Pro 78.4 11 0.00024 46.8 10.3 97 349-466 379-482 (591)
344 PF02897 Peptidase_S9_N: Proly 78.3 17 0.00037 42.2 11.5 101 345-463 146-260 (414)
345 KOG1645 RING-finger-containing 78.2 6.5 0.00014 46.0 7.7 52 172-223 215-270 (463)
346 KOG2079 Vacuolar assembly/sort 76.8 2.9 6.3E-05 53.8 4.8 71 379-469 97-168 (1206)
347 COG3391 Uncharacterized conser 76.5 36 0.00078 39.7 13.5 94 347-461 94-190 (381)
348 KOG1832 HIV-1 Vpr-binding prot 76.3 1.5 3.3E-05 55.0 2.2 100 342-463 1116-1216(1516)
349 KOG4714 Nucleoporin [Nuclear s 75.7 3.1 6.7E-05 46.3 4.1 94 349-461 159-254 (319)
350 PRK02888 nitrous-oxide reducta 75.4 21 0.00044 44.6 11.3 106 348-462 295-405 (635)
351 COG3386 Gluconolactonase [Carb 75.1 1.6E+02 0.0035 33.8 20.5 101 348-461 142-243 (307)
352 PRK11138 outer membrane biogen 74.5 76 0.0017 36.8 15.4 51 345-398 341-392 (394)
353 KOG3617 WD40 and TPR repeat-co 73.5 5.3 0.00012 50.3 5.7 59 341-400 73-131 (1416)
354 KOG2079 Vacuolar assembly/sort 73.4 8.3 0.00018 49.9 7.5 57 344-401 104-161 (1206)
355 PF00930 DPPIV_N: Dipeptidyl p 72.4 1.2E+02 0.0027 34.8 16.3 50 173-222 23-74 (353)
356 KOG1409 Uncharacterized conser 72.1 17 0.00038 42.0 8.9 111 361-475 106-242 (404)
357 PF07676 PD40: WD40-like Beta 70.6 8.7 0.00019 29.7 4.5 26 433-458 10-38 (39)
358 COG3386 Gluconolactonase [Carb 68.8 36 0.00078 39.0 10.7 78 370-466 111-198 (307)
359 PF07676 PD40: WD40-like Beta 68.7 15 0.00033 28.3 5.5 30 368-397 7-38 (39)
360 PRK13616 lipoprotein LpqB; Pro 68.5 23 0.00049 44.1 9.7 101 348-465 429-529 (591)
361 PF12429 DUF3676: Protein of u 68.0 6.1 0.00013 41.8 4.0 109 871-987 28-138 (230)
362 PF06433 Me-amine-dh_H: Methyl 67.9 2.1E+02 0.0045 33.6 16.5 51 350-401 270-321 (342)
363 PF06433 Me-amine-dh_H: Methyl 67.8 11 0.00025 43.5 6.5 57 173-229 269-330 (342)
364 PF10313 DUF2415: Uncharacteri 67.6 13 0.00029 30.6 5.0 29 434-462 3-34 (43)
365 KOG2041 WD40 repeat protein [G 66.7 6.7 0.00014 48.7 4.5 97 344-461 88-186 (1189)
366 PF02897 Peptidase_S9_N: Proly 66.0 27 0.00059 40.6 9.3 64 370-453 124-191 (414)
367 PF08596 Lgl_C: Lethal giant l 65.0 90 0.002 37.0 13.4 86 371-460 3-114 (395)
368 PF10647 Gmad1: Lipoprotein Lp 64.9 72 0.0016 35.2 11.9 77 371-463 67-146 (253)
369 COG3391 Uncharacterized conser 64.9 84 0.0018 36.7 13.1 93 347-461 138-239 (381)
370 KOG2444 WD40 repeat protein [G 63.3 16 0.00035 40.1 6.2 105 344-468 75-184 (238)
371 PF08596 Lgl_C: Lethal giant l 62.8 40 0.00086 40.0 9.9 84 361-463 78-175 (395)
372 PF14870 PSII_BNR: Photosynthe 62.3 36 0.00077 39.0 9.1 70 369-459 144-213 (302)
373 COG5170 CDC55 Serine/threonine 61.5 15 0.00032 42.0 5.6 87 370-463 27-119 (460)
374 PF08553 VID27: VID27 cytoplas 61.0 24 0.00051 45.3 8.0 93 347-461 550-647 (794)
375 PF12234 Rav1p_C: RAVE protein 58.0 97 0.0021 39.0 12.3 101 348-460 50-155 (631)
376 PF10313 DUF2415: Uncharacteri 56.7 24 0.00052 29.2 4.7 29 371-400 2-33 (43)
377 KOG1916 Nuclear protein, conta 56.6 8.2 0.00018 49.1 3.0 108 345-467 201-329 (1283)
378 PF14781 BBS2_N: Ciliary BBSom 55.4 1.4E+02 0.003 30.7 10.7 58 171-228 71-134 (136)
379 KOG1897 Damage-specific DNA bi 55.1 6.5E+02 0.014 33.5 24.8 97 347-461 846-942 (1096)
380 PF14583 Pectate_lyase22: Olig 54.0 1.1E+02 0.0023 36.4 11.2 101 345-462 260-382 (386)
381 PF14655 RAB3GAP2_N: Rab3 GTPa 52.9 77 0.0017 37.9 10.1 40 361-401 299-338 (415)
382 PF06977 SdiA-regulated: SdiA- 50.4 1.7E+02 0.0036 32.7 11.6 83 364-466 16-99 (248)
383 PF08728 CRT10: CRT10; InterP 49.7 5.7E+02 0.012 33.0 17.1 74 371-461 165-246 (717)
384 cd00216 PQQ_DH Dehydrogenases 48.8 5.7E+02 0.012 31.0 21.5 57 173-229 71-138 (488)
385 KOG1920 IkappaB kinase complex 48.2 73 0.0016 42.3 9.4 68 370-458 69-136 (1265)
386 PF05694 SBP56: 56kDa selenium 47.7 1.8E+02 0.0039 35.2 11.8 107 346-465 219-346 (461)
387 PF14781 BBS2_N: Ciliary BBSom 47.7 95 0.0021 31.8 8.3 52 173-224 20-86 (136)
388 COG3490 Uncharacterized protei 47.6 1.6E+02 0.0035 33.8 10.8 42 347-388 138-180 (366)
389 PF04841 Vps16_N: Vps16, N-ter 47.1 1.1E+02 0.0024 36.3 10.3 48 393-461 62-109 (410)
390 PF05787 DUF839: Bacterial pro 47.0 40 0.00087 41.4 6.7 76 374-449 440-519 (524)
391 PF12657 TFIIIC_delta: Transcr 46.5 1.6E+02 0.0034 30.6 10.2 30 433-462 87-122 (173)
392 KOG3621 WD40 repeat-containing 45.5 33 0.00071 42.9 5.6 80 368-467 32-111 (726)
393 PF14783 BBS2_Mid: Ciliary BBS 45.5 3.2E+02 0.0069 27.1 11.6 52 343-399 19-70 (111)
394 PF01731 Arylesterase: Arylest 41.0 1.1E+02 0.0025 28.7 7.2 50 348-400 35-84 (86)
395 smart00036 CNH Domain found in 39.5 4.2E+02 0.0091 30.1 13.0 56 174-229 223-280 (302)
396 PF03022 MRJP: Major royal jel 38.9 5.3E+02 0.012 29.2 13.6 106 349-462 34-160 (287)
397 PF14727 PHTB1_N: PTHB1 N-term 38.0 8E+02 0.017 29.6 26.8 52 171-222 95-166 (418)
398 KOG2377 Uncharacterized conser 37.6 92 0.002 37.7 7.3 87 369-473 66-153 (657)
399 TIGR02276 beta_rpt_yvtn 40-res 36.8 1.7E+02 0.0037 22.3 6.7 22 379-400 1-22 (42)
400 KOG1008 Uncharacterized conser 36.7 52 0.0011 41.0 5.3 56 346-401 214-276 (783)
401 cd00216 PQQ_DH Dehydrogenases 34.8 9E+02 0.02 29.3 20.4 58 172-229 119-193 (488)
402 KOG2395 Protein involved in va 34.3 1.4E+02 0.0031 36.7 8.3 102 344-460 351-458 (644)
403 PF04053 Coatomer_WDAD: Coatom 33.9 69 0.0015 38.6 5.8 44 173-217 126-171 (443)
404 PF01731 Arylesterase: Arylest 33.4 1.1E+02 0.0023 28.9 5.8 28 435-462 57-85 (86)
405 PF12234 Rav1p_C: RAVE protein 31.9 4.2E+02 0.0091 33.7 12.1 81 364-463 24-106 (631)
406 KOG4649 PQQ (pyrrolo-quinoline 31.1 8.7E+02 0.019 28.0 17.0 56 173-228 73-132 (354)
407 smart00036 CNH Domain found in 30.4 4E+02 0.0087 30.2 10.9 39 76-116 14-53 (302)
408 PF10647 Gmad1: Lipoprotein Lp 30.3 2.4E+02 0.0052 31.1 8.9 67 371-459 25-93 (253)
409 KOG0882 Cyclophilin-related pe 30.2 27 0.00059 41.7 1.6 58 344-401 25-85 (558)
410 PF07250 Glyoxal_oxid_N: Glyox 27.8 1.5E+02 0.0033 33.0 6.8 90 350-452 47-138 (243)
411 PF11715 Nup160: Nucleoporin N 27.8 2.9E+02 0.0063 33.7 9.9 71 74-190 157-257 (547)
412 PRK13684 Ycf48-like protein; P 27.7 2.4E+02 0.0053 32.4 8.7 65 370-455 173-237 (334)
413 PF05096 Glu_cyclase_2: Glutam 27.4 3E+02 0.0066 31.1 9.0 57 173-229 110-167 (264)
414 PRK10115 protease 2; Provision 26.3 5.1E+02 0.011 33.1 11.8 100 345-462 149-256 (686)
415 PF03088 Str_synth: Strictosid 26.1 2.5E+02 0.0054 26.6 6.9 27 421-449 48-74 (89)
416 PF12657 TFIIIC_delta: Transcr 25.6 7.4E+02 0.016 25.6 11.1 16 569-584 105-120 (173)
417 KOG2444 WD40 repeat protein [G 25.5 70 0.0015 35.4 3.6 57 344-401 119-178 (238)
418 PF07569 Hira: TUP1-like enhan 24.9 2.2E+02 0.0047 31.0 7.2 52 346-399 29-94 (219)
419 COG3490 Uncharacterized protei 24.5 3.3E+02 0.0072 31.5 8.5 75 351-450 93-180 (366)
420 KOG4460 Nuclear pore complex, 24.2 2.8E+02 0.0062 34.3 8.4 85 371-463 105-200 (741)
421 TIGR02604 Piru_Ver_Nterm putat 24.2 1.8E+02 0.0038 33.8 6.8 63 370-450 124-202 (367)
422 KOG1898 Splicing factor 3b, su 24.1 1.9E+03 0.042 29.7 19.3 94 349-462 954-1049(1205)
423 KOG4649 PQQ (pyrrolo-quinoline 24.1 4.4E+02 0.0095 30.2 9.3 56 345-401 69-124 (354)
424 COG3211 PhoX Predicted phospha 24.1 1.7E+02 0.0037 36.4 6.7 64 373-449 503-571 (616)
425 PF03022 MRJP: Major royal jel 24.0 3.6E+02 0.0077 30.6 9.0 59 172-230 33-107 (287)
426 COG2133 Glucose/sorbosone dehy 24.0 2.2E+02 0.0049 34.0 7.6 30 361-390 167-197 (399)
427 PF10395 Utp8: Utp8 family; I 23.7 7.2E+02 0.016 31.8 12.1 49 171-219 249-305 (670)
428 PF14870 PSII_BNR: Photosynthe 23.6 3.7E+02 0.0081 30.9 9.1 92 347-455 163-255 (302)
429 KOG1900 Nuclear pore complex, 23.4 3.9E+02 0.0084 36.4 10.0 35 367-402 240-274 (1311)
430 KOG1008 Uncharacterized conser 22.8 32 0.00069 42.7 0.5 92 347-460 127-224 (783)
431 PF03088 Str_synth: Strictosid 22.5 2.1E+02 0.0046 27.1 5.7 53 344-397 32-85 (89)
432 PRK10115 protease 2; Provision 21.9 2.2E+02 0.0048 36.2 7.5 62 370-452 127-192 (686)
433 PF14761 HPS3_N: Hermansky-Pud 21.6 2.7E+02 0.0059 30.7 7.0 60 374-453 22-81 (215)
434 PF07569 Hira: TUP1-like enhan 21.5 1.4E+02 0.0031 32.4 5.0 38 187-224 7-45 (219)
435 PF10214 Rrn6: RNA polymerase 21.5 1.8E+03 0.04 28.5 16.9 102 341-463 160-278 (765)
436 KOG3630 Nuclear pore complex, 20.9 91 0.002 41.3 3.8 95 347-460 122-227 (1405)
437 PF12341 DUF3639: Protein of u 20.5 1.8E+02 0.004 21.9 3.8 25 433-459 3-27 (27)
438 TIGR03074 PQQ_membr_DH membran 20.4 2E+03 0.043 28.5 18.8 58 172-229 269-354 (764)
No 1
>KOG2109 consensus WD40 repeat protein [General function prediction only]
Probab=100.00 E-value=3.9e-60 Score=540.66 Aligned_cols=629 Identities=31% Similarity=0.366 Sum_probs=473.5
Q ss_pred CCchhhHhhcceeeeccCCcceehhhhhhcccccccccCCCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEc
Q 001814 12 LPNSLKIISSCLKTVSTNASTVASTVRSAGASVAASISNASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDV 91 (1010)
Q Consensus 12 ~~~s~~~~s~~~~~~s~~a~~~~~~~rs~~~s~a~~i~~~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv 91 (1010)
+|+|||+||.|+|+.|+|+.+++ +|++++.+.+.+.++|||+|++||+ .++..++||+++|.+|||+||.
T Consensus 1 ~p~s~~~vs~c~k~~ssg~~~~~-------~s~~~~~ss~~~e~~dqvlw~~fD~---~~~~~~~Vlll~~~~gfqv~d~ 70 (788)
T KOG2109|consen 1 MPPSANSVSGCKKKNSSGHQRPQ-------QSHQQTQSSPLPEEEDQVLWIKFDP---KPEVLEEVLLLNREEGFQVVDE 70 (788)
T ss_pred CCcccchhccchhhcccccccHH-------HHHHhhcCCCChhhhccccccccCC---chhHHHHHHHHhhccCceEEee
Confidence 59999999999999997776665 4566777777899999999999995 2344688999999999999999
Q ss_pred cCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCC-cCCCCCCCCC
Q 001814 92 EDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGM-MDSQSGNCVN 170 (1010)
Q Consensus 92 ~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs-~d~~~~~~~~ 170 (1010)
++...+++..+.|++++.|++|++.|..+...++|++++|++|+|... .+.+ .+..-++|. .+.
T Consensus 71 ~Dsp~vh~~vs~~dd~~~f~sm~~~pl~sg~~~gf~ss~avpavv~~t---~S~p-----~I~~S~~Gse~d~------- 135 (788)
T KOG2109|consen 71 TDSPTVHKEVSISDDLLDFSSMDKSPLSSGPDSGFESSDAVPAVVRTT---TSPP-----TIPPSQTGSEQDS------- 135 (788)
T ss_pred ccCCccceeeeecCCcceecccCCCCccCCCCCccccCCceeeecccc---cCCC-----cCCCCCCcceecc-------
Confidence 999999999999999999999999999988888999999999987522 1110 111224444 221
Q ss_pred CCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccce
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPM 250 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gpl 250 (1010)
.+....++|+++..++|.++|+ +|+|||+.+++..+.+..++.|.+ ..+++|+|+
T Consensus 136 t~an~~v~dl~S~~yah~l~fR-------------------qi~CfDa~tle~d~~~~~n~~p~l------~l~VGYGpl 190 (788)
T KOG2109|consen 136 TQANEMVVDLMSLDYAHALPFR-------------------QIHCFDAPTLEIDSMNTINTKPRL------LLSVGYGPL 190 (788)
T ss_pred cccccceeccccccchhccccc-------------------ccccccCcccCCchhhcccccccc------ceeeccccc
Confidence 2456788999999999999997 899999999998888888877632 245789999
Q ss_pred EEccceEEEccCCeeeccCCccCCCcCCC-CCCCCCcCCCCCceEEEeehhhhhhhhcccc-------eeeccccccccC
Q 001814 251 AVGPRWLAYASNTLLLSNSGRLSPQNLTP-SGVSPSTSPGGSSLVARYAMEHSKQFAAGLS-------KTLSKYCQELLP 322 (1010)
Q Consensus 251 AlgpRwLAyas~~~~iwd~G~vs~Q~lt~-p~vS~stSP~~gslVa~~A~dssk~la~Gi~-------ktls~y~~~l~p 322 (1010)
++++||+||+++... .++.+.++. +.+++++|+.++..++++|++++|++|.|+. +++++||+..++
T Consensus 191 aVg~rWaaya~~~a~-----~vss~~Vt~~~~VspttSs~~~~~va~~A~essk~lA~gl~nlgDkGy~~isglc~g~~~ 265 (788)
T KOG2109|consen 191 AVGRRWAAYAQTLAN-----QVSSHLVTMGMSVSPTTSSQITAEVAEWAQESSKELAGGLVNLGDKGYVLISGLCRGSYQ 265 (788)
T ss_pred cceeeeeeeccCcch-----hhhhccccccccccCCCCCchhHHHHHhhhhhhHHHhhhhcccccchHHHHHHHhhcccC
Confidence 999999999987432 112244555 7888999999899999999999999999965 688999999877
Q ss_pred CCCCCCccCCCccccccccccccCC-CCe--EEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 323 DGSSSPVSPNSVWKVGRHAGADMDN-AGI--VVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 323 ~gs~s~~S~s~~~k~~~~~iasgs~-dG~--V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
.+......-....+.++....++.. -|+ +.+-|+.+...+.+|++|++||++|+|+++|.+|++++..|+.|++|++
T Consensus 266 ~g~gpglgg~~~~~vGrvg~vsaesV~g~~~vivkdf~S~a~i~QfkAhkspiSaLcfdqsgsllViasi~g~nVnvfRi 345 (788)
T KOG2109|consen 266 IGTGPGLGGFEEVLVGRVGPVSAESVLGNNLVIVKDFDSFADIRQFKAHKSPISALCFDQSGSLLVIASITGRNVNVFRI 345 (788)
T ss_pred CCCCCCCCCcCceeccccccccceeecccceEEeecccchhhhhheeeecCcccccccccCceEEEEEeeccceeeeEEe
Confidence 6643332222222233322333333 455 9999999999999999999999999999999999999999999999999
Q ss_pred CCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCC
Q 001814 400 MPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDP 479 (1010)
Q Consensus 400 ~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~ 479 (1010)
++.....+.+..+..|. .||++.+.|++|+|+-+.+|++++|.+|+- |
T Consensus 346 met~~t~~~~~qs~~~s---------~ra~t~aviqdicfs~~s~~r~~gsc~Ge~-----------------------P 393 (788)
T KOG2109|consen 346 METVCTVNVSDQSLVVS---------PRANTAAVIQDICFSEVSTIRTAGSCEGEP-----------------------P 393 (788)
T ss_pred ccccccccccccccccc---------hhcchHHHHHHHhhhhhcceEeecccCCCC-----------------------c
Confidence 98543333332222221 489999999999999999999999966653 3
Q ss_pred ccCCCCCCCcccCCCCCccCccCCCCCceeeeeeeeeeecCCccccccccccccccCccccccceeeeecccCccccccc
Q 001814 480 YLFPVLSLPWWCTSSGISEQQCVLPPPPVTLSVVSRIKYSSFGWLNTVSNASASSMGKVFVPSGAVAAVFHNSIAHSSQH 559 (1010)
Q Consensus 480 ~~~pv~~lpw~~~ss~~~~q~~~p~p~~~~l~~vsrIk~~~~~w~~~v~~a~~~at~~~~~ps~~va~~F~~~~~~~~~~ 559 (1010)
.+.+-..||||-.+++...-+..+.+.+..|...++++-.+. | ++++-.+++-.|...-..+|+.......
T Consensus 394 ~ls~t~~lp~~A~~Sl~~gl~s~g~~aa~gla~~sag~~a~s----~---~asSv~s~s~~pd~ks~gv~~gsv~k~~-- 464 (788)
T KOG2109|consen 394 ALSLTCQLPAYADTSLDLGLQSSGGLAAEGLATSSAGYTAHS----Y---TASSVFSRSVKPDSKSVGVGSGSVTKAN-- 464 (788)
T ss_pred ccccccccchhhchhhhccccccCcccceeeeeccccccccc----c---ccceeeccccccchhhccceeeeccccC--
Confidence 445556789999999887777777777888888777765332 1 2223334445566666777777654321
Q ss_pred cccccCccccEEEEcCCc-cEEEEecccCCCCCCCCC-CCccccccccccCCCc-eeEeecccccceecccCCCcccccc
Q 001814 560 VNSRTNSLEHLLVYTPSG-YVVQHELLPSIGMGPSDD-GSRIRAASLMCLQEDD-LQVRVEPVQWWDVCRRSDWPEREEF 636 (1010)
Q Consensus 560 ~~s~~~~~~~LlV~s~~G-~l~~Y~L~p~~g~e~~~~-~~~~~~~~~~~~~~~~-~~~~vep~~~W~v~r~~~~~e~~~~ 636 (1010)
...+..+..|||+.|.| +++||-|.+.+++.-.+. ....+-.+ +...+++ .++.|+|.+.|+.|++-+|+|++++
T Consensus 465 -q~~~~~l~~llv~~psGd~vvqh~vahs~~gv~~Ef~~~~~l~lS-ad~~e~ef~~f~V~Ph~~wsslaav~hly~l~r 542 (788)
T KOG2109|consen 465 -QGVITVLNLLLVGEPSGDGVVQHYVAHSDPGVYIEFSPDQRLVLS-ADANENEFNIFLVMPHATWSSLAAVQHLYKLNR 542 (788)
T ss_pred -ccchhhhhheeeecCCCCceeEEEeeccCccceeeecccccceec-ccccccccceEEeecccccHHHhhhhhhhhccC
Confidence 13445678999999999 999999999988765443 22221211 3334567 9999999999999999999999997
Q ss_pred ccccccCCCCceeeeeccCCcccCCCcccccCCcceeccccccccCCCccccceeEeeeeeEeeccCCccccccceeEEE
Q 001814 637 ISEATCDGHGAVEIFQNKSDCEDNYGIDFLDINDCIVEKSTFKNCSVKSYERSHWYLSNAEVQMSSGRLPIWQSSKISFF 716 (1010)
Q Consensus 637 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~yls~aEvq~~~~~~piW~~~~i~F~ 716 (1010)
+.|.. +.+...... +..... ++. ....+.-+.+|-|+-+||+++|.. +|||+|.+ .||
T Consensus 543 --G~Tsa-----Kv~~~afs~-dsrw~A-------~~t-----~~~TthVfk~hpYgg~aeqrth~~-lp~vnk~s-rFh 600 (788)
T KOG2109|consen 543 --GSTSA-----KVVSTAFSE-DSRWLA-------ITT-----NHATTHVFKVHPYGGKAEQRTHGD-LPFVNKES-RFH 600 (788)
T ss_pred --CCccc-----eeeeeEeec-chhhhh-------hhh-----cCCceeeeeeccccccccceecCC-chhccchh-hhc
Confidence 44333 222111111 110000 100 113456778999999999999999 99999999 999
Q ss_pred EcCCccc------ccCCCcceEEeeeeceeEEEeccccccccccccccccccccc
Q 001814 717 KMDSPRA------NTHASGEFEIEKVSVHEVEIKRKELLPVFDHFQCIKPSWNNR 765 (1010)
Q Consensus 717 ~m~~~~~------~~~~~~e~eie~~~~~~~~~r~k~l~p~~~~~~~~~~~~~~~ 765 (1010)
-|..+.. +...++|.||+++.++++|+|+||||||++ +...+.|
T Consensus 601 rsagl~~d~~~~~s~ggg~e~ei~~~~~~t~e~r~~dllPvy~-----~tS~rsr 650 (788)
T KOG2109|consen 601 RSAGLTDDADVTASIGGGKEREIADSCSYTKEHRIADLLPVYA-----KTSGRSR 650 (788)
T ss_pred cccCCCccccccccCCCCccceecccccccccccccccCCccc-----ccCcccc
Confidence 9998654 233467999999999999999999999999 5555556
No 2
>PF12490 BCAS3: Breast carcinoma amplified sequence 3 ; InterPro: IPR022175 This domain family is found in eukaryotes, and is typically between 229 and 245 amino acids in length. The proteins in this family have been shown to be proto-oncogenes implicated in the development of breast cancer.
Probab=100.00 E-value=3.6e-53 Score=456.10 Aligned_cols=239 Identities=47% Similarity=0.747 Sum_probs=203.3
Q ss_pred CCceeeeeeeeeeecC-CccccccccccccccC-ccccccceeeeecccCccccccccccc-cCccccEEEEcCCccEEE
Q 001814 505 PPPVTLSVVSRIKYSS-FGWLNTVSNASASSMG-KVFVPSGAVAAVFHNSIAHSSQHVNSR-TNSLEHLLVYTPSGYVVQ 581 (1010)
Q Consensus 505 p~~~~l~~vsrIk~~~-~~w~~~v~~a~~~at~-~~~~ps~~va~~F~~~~~~~~~~~~s~-~~~~~~LlV~s~~G~l~~ 581 (1010)
|+|++|+||+|||+++ +||+|+++++++++++ +...+++++|+.||++..........+ .+.+++|||++|+|+|+|
T Consensus 1 P~Pv~l~~vsrIK~~~~~g~~~tv~~aassa~g~~~~~~sga~a~~f~~~~~~~~~~~~~~~~~~~~~LlV~spsG~Liq 80 (251)
T PF12490_consen 1 PPPVTLSVVSRIKQGNTLGWLNTVSNAASSATGGKPSSVSGAFASSFHNSKGSSSEPSDSSSSKAVESLLVFSPSGHLIQ 80 (251)
T ss_pred CCCEEechHHhhcCCccccccccccccccchhcCCcccceeEEccccccCCCCcccccccccccccceEEEECCCCcEEE
Confidence 6799999999999998 8999999999999998 889999999999999866555554444 788999999999999999
Q ss_pred EecccCCCCCCCCCCCccccccccccCCCceeEeecccccceecccCCCccccc-cccccccCCCCceeeeeccCCcccC
Q 001814 582 HELLPSIGMGPSDDGSRIRAASLMCLQEDDLQVRVEPVQWWDVCRRSDWPEREE-FISEATCDGHGAVEIFQNKSDCEDN 660 (1010)
Q Consensus 582 Y~L~p~~g~e~~~~~~~~~~~~~~~~~~~~~~~~vep~~~W~v~r~~~~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 660 (1010)
|+|+|+.+++...++++.+++....++|++|||+|||+|||||||+.+|+||++ .+..+...++..+.++.....+ .
T Consensus 81 y~L~p~~~~~~~~~~~~~~~~~~~~~~~~~l~l~vep~~~Wdl~R~~~w~e~~~d~~~~~~~~~~~~~~~~~~~~~~--~ 158 (251)
T PF12490_consen 81 YELRPSPGSDPTEGGSGNGPPSESQMDDTELRLVVEPVQQWDLCRRPNWPEREEDCVPPLPENNPLDSASKIDPSDC--R 158 (251)
T ss_pred EEEeeccccCcccccccccCccccccccCcceEEeeeccceeEeccccCCccchhccCCCCCCCHhhhhhhcccccc--c
Confidence 999999999988888888887777778899999999999999999999999999 5545555555543333232322 3
Q ss_pred CCcccccCCcceeccccccccCCCccccceeEeeeeeEeeccCC-ccccccceeEEEEcCCccc-----ccCCC--cceE
Q 001814 661 YGIDFLDINDCIVEKSTFKNCSVKSYERSHWYLSNAEVQMSSGR-LPIWQSSKISFFKMDSPRA-----NTHAS--GEFE 732 (1010)
Q Consensus 661 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~yls~aEvq~~~~~-~piW~~~~i~F~~m~~~~~-----~~~~~--~e~e 732 (1010)
++.+..+++...+.+. +++++|++||||||||||||+++ +||||||||+||+|.++.. ++..+ ||||
T Consensus 159 ~~~~~~~~~~~~~~~~-----~~~~~e~~~~wlS~vEi~th~~phrpLW~gpQf~F~~~~~~~~~~~~~s~~~~~~~e~E 233 (251)
T PF12490_consen 159 KGNSVNPSNDSYVSKE-----SDSPEERDHWWLSNVEIQTHSGPHRPLWMGPQFSFKTMSSPSSSELNISSSSGEAGEIE 233 (251)
T ss_pred ccCCcccccccccccc-----CCCcccccCcEEeeeeeEeccCCccccccCCcEEEEEecCCCCccccccccccccCcee
Confidence 3666777766555555 77899999999999999999999 8999999999999998763 34556 9999
Q ss_pred EeeeeceeEEEecccccc
Q 001814 733 IEKVSVHEVEIKRKELLP 750 (1010)
Q Consensus 733 ie~~~~~~~~~r~k~l~p 750 (1010)
|||||+|+||+|||||||
T Consensus 234 IE~~~~~~ve~r~k~l~p 251 (251)
T PF12490_consen 234 IEKIPTREVEIRRKDLLP 251 (251)
T ss_pred eccccccceeeeccccCC
Confidence 999999999999999998
No 3
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=100.00 E-value=2e-46 Score=407.98 Aligned_cols=325 Identities=27% Similarity=0.445 Sum_probs=277.1
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
++.+..+.+.+|++ +..+|.+|..+|+++|.+++.+ +..+...+.+..++|| |+++ |
T Consensus 2 ~~~~~ti~~~~~Nq-------d~~~lsvGs~~Gyk~~~~~~~~---k~~~~~~~~~~IvEmL-----------FSSS--L 58 (391)
T KOG2110|consen 2 NGKKPTINFIGFNQ-------DSTLLSVGSKDGYKIFSCSPFE---KCFSKDTEGVSIVEML-----------FSSS--L 58 (391)
T ss_pred CCCCcceeeeeecc-------ceeEEEccCCCceeEEecCchH---HhhcccCCCeEEEEee-----------cccc--e
Confidence 45677889999997 6789999999999999998744 3555556889999999 8888 9
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEEeCCe
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVGLATQ 212 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~ld~~ 212 (1010)
||+|..+ -|++++++++|++..||.+.|+++|++|++|+++|+|++.++
T Consensus 59 vaiV~~~-------------------------------qpr~Lkv~~~Kk~~~ICe~~fpt~IL~VrmNr~RLvV~Lee~ 107 (391)
T KOG2110|consen 59 VAIVSIK-------------------------------QPRKLKVVHFKKKTTICEIFFPTSILAVRMNRKRLVVCLEES 107 (391)
T ss_pred eEEEecC-------------------------------CCceEEEEEcccCceEEEEecCCceEEEEEccceEEEEEccc
Confidence 9988531 158999999999999999999999999999999999999999
Q ss_pred EEEEECCCCceeEEEeec-CCccccCCCccccccCccceEEcc----ceEEEccCCeeeccCCccCCCcCCCCCCCCCcC
Q 001814 213 IYCFDALTLENKFSVLTY-PVPQLAGQGAVGINVGYGPMAVGP----RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTS 287 (1010)
Q Consensus 213 I~IwD~~Tle~l~tL~t~-p~p~~~~~g~~~vnv~~gplAlgp----RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stS 287 (1010)
|||||+.+|+.+++|.+. |+| .+.+|+++ .||||+++
T Consensus 108 IyIydI~~MklLhTI~t~~~n~-------------~gl~AlS~n~~n~ylAyp~s------------------------- 149 (391)
T KOG2110|consen 108 IYIYDIKDMKLLHTIETTPPNP-------------KGLCALSPNNANCYLAYPGS------------------------- 149 (391)
T ss_pred EEEEecccceeehhhhccCCCc-------------cceEeeccCCCCceEEecCC-------------------------
Confidence 999999999999999986 553 46788876 47777652
Q ss_pred CCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc
Q 001814 288 PGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA 367 (1010)
Q Consensus 288 P~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a 367 (1010)
...|.|+|||+.+.+.+..|.|
T Consensus 150 ----------------------------------------------------------~t~GdV~l~d~~nl~~v~~I~a 171 (391)
T KOG2110|consen 150 ----------------------------------------------------------TTSGDVVLFDTINLQPVNTINA 171 (391)
T ss_pred ----------------------------------------------------------CCCceEEEEEcccceeeeEEEe
Confidence 1257899999999999999999
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
|.++|.||+|||+|++|||||++|++||||.+.. | +++|+||||...+.|++|+|+||+++|+
T Consensus 172 H~~~lAalafs~~G~llATASeKGTVIRVf~v~~-------G----------~kl~eFRRG~~~~~IySL~Fs~ds~~L~ 234 (391)
T KOG2110|consen 172 HKGPLAALAFSPDGTLLATASEKGTVIRVFSVPE-------G----------QKLYEFRRGTYPVSIYSLSFSPDSQFLA 234 (391)
T ss_pred cCCceeEEEECCCCCEEEEeccCceEEEEEEcCC-------c----------cEeeeeeCCceeeEEEEEEECCCCCeEE
Confidence 9999999999999999999999999999999942 5 6999999999988999999999999999
Q ss_pred EEeCCCeEEEEeCCCCCCccccccccCCCCCCccCCCCCCCcccCCCCCccCccCCCCCceeeeeeeeeeecCCcccccc
Q 001814 448 IVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFPVLSLPWWCTSSGISEQQCVLPPPPVTLSVVSRIKYSSFGWLNTV 527 (1010)
Q Consensus 448 sgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~pv~~lpw~~~ss~~~~q~~~p~p~~~~l~~vsrIk~~~~~w~~~v 527 (1010)
++|+.+|||||+|+..... + + -.|. ...+|.+.+
T Consensus 235 ~sS~TeTVHiFKL~~~~~~----------------~---------------~---~~p~------------~~~~~~~~~ 268 (391)
T KOG2110|consen 235 ASSNTETVHIFKLEKVSNN----------------P---------------P---ESPT------------AGTSWFGKV 268 (391)
T ss_pred EecCCCeEEEEEecccccC----------------C---------------C---CCCC------------CCCcccchh
Confidence 9999999999999864210 0 0 0111 135788888
Q ss_pred ccccccccCccccccceeeeecccCcc--ccccc--------cccccCccccEEEEcCCccEEEEecccCCCCCCCCCC
Q 001814 528 SNASASSMGKVFVPSGAVAAVFHNSIA--HSSQH--------VNSRTNSLEHLLVYTPSGYVVQHELLPSIGMGPSDDG 596 (1010)
Q Consensus 528 ~~a~~~at~~~~~ps~~va~~F~~~~~--~~~~~--------~~s~~~~~~~LlV~s~~G~l~~Y~L~p~~g~e~~~~~ 596 (1010)
++++.+ |+|++ |+.++++.|. +.+++ .+..+++.++++|++.|||+|.|+|+|++||||.++.
T Consensus 269 sk~~~s-----ylps~-V~~~~~~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~dG~~y~y~l~~~~gGec~lik 341 (391)
T KOG2110|consen 269 SKAATS-----YLPSQ-VSSVLDQSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYDGHLYSYRLPPKEGGECALIK 341 (391)
T ss_pred hhhhhh-----hcchh-hhhhhhhccceeEEEccCCCccceEEeeccCCCCEEEEEEcCCeEEEEEcCCCCCceeEEEE
Confidence 887666 99999 9999999986 33444 4567899999999999999999999999999998864
No 4
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=100.00 E-value=6.8e-36 Score=321.18 Aligned_cols=316 Identities=25% Similarity=0.328 Sum_probs=250.2
Q ss_pred CcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeee--eccCCEEEEEEecCCCCCCCCCCccccCcEEE
Q 001814 57 DQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVS--KRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLL 134 (1010)
Q Consensus 57 d~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS--~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLA 134 (1010)
-+.+-+.|++ +..++++|.++||+||++++ +.+..+ .+++.+..++|| |+.+ +||
T Consensus 6 ~~~lsvs~NQ-------D~ScFava~~~Gfriyn~~P---~ke~~~r~~~~~G~~~veML-----------fR~N--~la 62 (346)
T KOG2111|consen 6 PKTLSVSFNQ-------DHSCFAVATDTGFRIYNCDP---FKESASRQFIDGGFKIVEML-----------FRSN--YLA 62 (346)
T ss_pred CceeEEEEcc-------CCceEEEEecCceEEEecCc---hhhhhhhccccCchhhhhHh-----------hhhc--eEE
Confidence 3455688997 56799999999999999987 444333 345668899999 8886 999
Q ss_pred EEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEEeCCeEE
Q 001814 135 VVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVGLATQIY 214 (1010)
Q Consensus 135 vVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~ld~~I~ 214 (1010)
+|.|++. +-++|++|.|||.....++.++.|.++|.+|.+.++.|||.++.+|+
T Consensus 63 LVGGg~~--------------------------pky~pNkviIWDD~k~~~i~el~f~~~I~~V~l~r~riVvvl~~~I~ 116 (346)
T KOG2111|consen 63 LVGGGSR--------------------------PKYPPNKVIIWDDLKERCIIELSFNSEIKAVKLRRDRIVVVLENKIY 116 (346)
T ss_pred EecCCCC--------------------------CCCCCceEEEEecccCcEEEEEEeccceeeEEEcCCeEEEEecCeEE
Confidence 9865321 23468999999999999999999999999999999999999999999
Q ss_pred EEECC-CCceeEEEeecCCccccCCCccccccCccceEEcc----ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCC
Q 001814 215 CFDAL-TLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP----RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPG 289 (1010)
Q Consensus 215 IwD~~-Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp----RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~ 289 (1010)
||... +++.++.+.|.++| .|.+++.| .+|||++
T Consensus 117 VytF~~n~k~l~~~et~~NP-------------kGlC~~~~~~~k~~LafPg---------------------------- 155 (346)
T KOG2111|consen 117 VYTFPDNPKLLHVIETRSNP-------------KGLCSLCPTSNKSLLAFPG---------------------------- 155 (346)
T ss_pred EEEcCCChhheeeeecccCC-------------CceEeecCCCCceEEEcCC----------------------------
Confidence 99987 78889999987775 35667654 3344433
Q ss_pred CCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE--EEEecc
Q 001814 290 GSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI--ISQFKA 367 (1010)
Q Consensus 290 ~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~--v~~~~a 367 (1010)
-..|.|+|-|+...+. ...+.|
T Consensus 156 --------------------------------------------------------~k~GqvQi~dL~~~~~~~p~~I~A 179 (346)
T KOG2111|consen 156 --------------------------------------------------------FKTGQVQIVDLASTKPNAPSIINA 179 (346)
T ss_pred --------------------------------------------------------CccceEEEEEhhhcCcCCceEEEc
Confidence 2347788888876654 468899
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
|.++|++|+++-+|++|||||.+||.|||||... | .++++||||...|.|++|+||||+.|||
T Consensus 180 H~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~-------g----------~~l~E~RRG~d~A~iy~iaFSp~~s~La 242 (346)
T KOG2111|consen 180 HDSDIACVALNLQGTLVATASTKGTLIRIFDTED-------G----------TLLQELRRGVDRADIYCIAFSPNSSWLA 242 (346)
T ss_pred ccCceeEEEEcCCccEEEEeccCcEEEEEEEcCC-------C----------cEeeeeecCCchheEEEEEeCCCccEEE
Confidence 9999999999999999999999999999999864 5 5899999999999999999999999999
Q ss_pred EEeCCCeEEEEeCCCCCCccccccccCCCCCCccCCC-CCCCcccCCCCCccCccCCCCCceeeeeeeeeeecCCccccc
Q 001814 448 IVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFPV-LSLPWWCTSSGISEQQCVLPPPPVTLSVVSRIKYSSFGWLNT 526 (1010)
Q Consensus 448 sgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~pv-~~lpw~~~ss~~~~q~~~p~p~~~~l~~vsrIk~~~~~w~~~ 526 (1010)
++|++||+|||.+.... .+.+.|+ .++.. ..||.|+.|.|+..|+.+|..+
T Consensus 243 vsSdKgTlHiF~l~~~~--~~~~~~S------Sl~~~~~~lpky~~S~wS~~~f~l~~~~-------------------- 294 (346)
T KOG2111|consen 243 VSSDKGTLHIFSLRDTE--NTEDESS------SLSFKRLVLPKYFSSEWSFAKFQLPQGT-------------------- 294 (346)
T ss_pred EEcCCCeEEEEEeecCC--CCccccc------cccccccccchhcccceeEEEEEccCCC--------------------
Confidence 99999999999997632 1222232 12222 3689999999999887665221
Q ss_pred cccccccccCccccccceeeeecccCccccccccccccCccccEEEEcCCccEEEEecccCCCCCCCC
Q 001814 527 VSNASASSMGKVFVPSGAVAAVFHNSIAHSSQHVNSRTNSLEHLLVYTPSGYVVQHELLPSIGMGPSD 594 (1010)
Q Consensus 527 v~~a~~~at~~~~~ps~~va~~F~~~~~~~~~~~~s~~~~~~~LlV~s~~G~l~~Y~L~p~~g~e~~~ 594 (1010)
.++++ |.+.. ..+++...||.-|.|.++|.+||+|..
T Consensus 295 ----------------~~~~~-fg~~~--------------nsvi~i~~Dgsy~k~~f~~~~~g~~~~ 331 (346)
T KOG2111|consen 295 ----------------QCIIA-FGSET--------------NTVIAICADGSYYKFKFDPKNGGESSR 331 (346)
T ss_pred ----------------cEEEE-ecCCC--------------CeEEEEEeCCcEEEEEeccccccchhh
Confidence 21232 32211 258889999999999999999999963
No 5
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.90 E-value=1.9e-22 Score=221.33 Aligned_cols=310 Identities=15% Similarity=0.154 Sum_probs=217.0
Q ss_pred CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 51 ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 51 ~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
..++|-+.|+.+.|- +++..|++|..+ ++|+||++ +..-..++.+|..+|-+|.+.|++.
T Consensus 110 S~~GH~e~Vl~~~fs-------p~g~~l~tGsGD~TvR~WD~~-TeTp~~t~KgH~~WVlcvawsPDgk----------- 170 (480)
T KOG0271|consen 110 SIAGHGEAVLSVQFS-------PTGSRLVTGSGDTTVRLWDLD-TETPLFTCKGHKNWVLCVAWSPDGK----------- 170 (480)
T ss_pred ccCCCCCcEEEEEec-------CCCceEEecCCCceEEeeccC-CCCcceeecCCccEEEEEEECCCcc-----------
Confidence 467899999999996 366788888754 69999995 4556788899999999999999872
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEE-EEEeCC-CcEEEEEEcC-----
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYE-HVLRFR-SSVCMVRCSP----- 202 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V-~tL~f~-S~V~sVa~S~----- 202 (1010)
.+|. | . .+++|++||.++|+++ ..|+.| ..|.++++.|
T Consensus 171 --~iAS--G---------------------~----------~dg~I~lwdpktg~~~g~~l~gH~K~It~Lawep~hl~p 215 (480)
T KOG0271|consen 171 --KIAS--G---------------------S----------KDGSIRLWDPKTGQQIGRALRGHKKWITALAWEPLHLVP 215 (480)
T ss_pred --hhhc--c---------------------c----------cCCeEEEecCCCCCcccccccCcccceeEEeecccccCC
Confidence 2442 1 0 2478999999998864 567666 4899999976
Q ss_pred --CeEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc-ceEEEccC--CeeeccC--CccCC
Q 001814 203 --RIVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP-RWLAYASN--TLLLSNS--GRLSP 274 (1010)
Q Consensus 203 --rlLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp-RwLAyas~--~~~iwd~--G~vs~ 274 (1010)
++||.+. |+.|+|||+.-.++++++..|..+ ...+..|. .||+.++. ++.+|+. |.+ -
T Consensus 216 ~~r~las~skDg~vrIWd~~~~~~~~~lsgHT~~-------------VTCvrwGG~gliySgS~DrtIkvw~a~dG~~-~ 281 (480)
T KOG0271|consen 216 PCRRLASSSKDGSVRIWDTKLGTCVRTLSGHTAS-------------VTCVRWGGEGLIYSGSQDRTIKVWRALDGKL-C 281 (480)
T ss_pred CccceecccCCCCEEEEEccCceEEEEeccCccc-------------eEEEEEcCCceEEecCCCceEEEEEccchhH-H
Confidence 5777655 678999999999999999998874 34566665 44444432 4677863 322 0
Q ss_pred CcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccc---------------eeeccccccccCCCCCCCccCCCcccccc
Q 001814 275 QNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLS---------------KTLSKYCQELLPDGSSSPVSPNSVWKVGR 339 (1010)
Q Consensus 275 Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~---------------ktls~y~~~l~p~gs~s~~S~s~~~k~~~ 339 (1010)
..++. .+..|.+.|...--.|-.|.+ +.+.+|-.. ..++ +
T Consensus 282 r~lkG----------HahwvN~lalsTdy~LRtgaf~~t~~~~~~~se~~~~Al~rY~~~-~~~~--------~------ 336 (480)
T KOG0271|consen 282 RELKG----------HAHWVNHLALSTDYVLRTGAFDHTGRKPKSFSEEQKKALERYEAV-LKDS--------G------ 336 (480)
T ss_pred Hhhcc----------cchheeeeeccchhhhhccccccccccCCChHHHHHHHHHHHHHh-hccC--------c------
Confidence 11110 011111111100000001110 122333111 0000 0
Q ss_pred ccccccCCCCeEEEEECCC-CcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 340 HAGADMDNAGIVVVKDFVT-RAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 340 ~~iasgs~dG~V~VwDl~s-~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
--+.+++.|+++.+|+-.. .+.+..+.+|..-|+.+.|||||+++|+||. +..||+|+-.+ |
T Consensus 337 erlVSgsDd~tlflW~p~~~kkpi~rmtgHq~lVn~V~fSPd~r~IASaSF-DkSVkLW~g~t-------G--------- 399 (480)
T KOG0271|consen 337 ERLVSGSDDFTLFLWNPFKSKKPITRMTGHQALVNHVSFSPDGRYIASASF-DKSVKLWDGRT-------G--------- 399 (480)
T ss_pred ceeEEecCCceEEEecccccccchhhhhchhhheeeEEECCCccEEEEeec-ccceeeeeCCC-------c---------
Confidence 0135788999999999765 4588889999999999999999999999999 46699999753 5
Q ss_pred ceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccccccc
Q 001814 419 HVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLS 473 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~ 473 (1010)
+.|-.| ||+ -+.|+-++||.|++.|++||.|.|++||++...+-...+-.|.
T Consensus 400 -k~lasf-RGH-v~~VYqvawsaDsRLlVS~SkDsTLKvw~V~tkKl~~DLpGh~ 451 (480)
T KOG0271|consen 400 -KFLASF-RGH-VAAVYQVAWSADSRLLVSGSKDSTLKVWDVRTKKLKQDLPGHA 451 (480)
T ss_pred -chhhhh-hhc-cceeEEEEeccCccEEEEcCCCceEEEEEeeeeeecccCCCCC
Confidence 466666 675 4679999999999999999999999999999887766666663
No 6
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.89 E-value=4.5e-22 Score=233.59 Aligned_cols=230 Identities=20% Similarity=0.246 Sum_probs=171.0
Q ss_pred CCCeEEEEEecCc-EEEEEccCC------------------------------CcceEeeeeccCCEEEEEEecCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDA------------------------------SNFNELVSKRDGPVSFLQMQPFPVKDD 121 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~------------------------------g~v~ellS~hdGpV~~v~~lP~p~~s~ 121 (1010)
.+.+.||.|..+. ++||.+.+. +...+.+-+|.|||..+.|.|+-
T Consensus 388 ddssmlA~Gf~dS~i~~~Sl~p~kl~~lk~~~~l~~~d~~sad~~~~~~D~~~~~~~~~L~GH~GPVyg~sFsPd~---- 463 (707)
T KOG0263|consen 388 DDSSMLACGFVDSSVRVWSLTPKKLKKLKDASDLSNIDTESADVDVDMLDDDSSGTSRTLYGHSGPVYGCSFSPDR---- 463 (707)
T ss_pred CCcchhhccccccEEEEEecchhhhccccchhhhccccccccchhhhhccccCCceeEEeecCCCceeeeeecccc----
Confidence 3567999999876 899999741 11223466788999999988763
Q ss_pred CCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEE
Q 001814 122 GCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRC 200 (1010)
Q Consensus 122 ~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~ 200 (1010)
.+|+.++. |.+||+|++.+..++..++.|. +|++|+|
T Consensus 464 ---------rfLlScSE---------------------------------D~svRLWsl~t~s~~V~y~GH~~PVwdV~F 501 (707)
T KOG0263|consen 464 ---------RFLLSCSE---------------------------------DSSVRLWSLDTWSCLVIYKGHLAPVWDVQF 501 (707)
T ss_pred ---------cceeeccC---------------------------------CcceeeeecccceeEEEecCCCcceeeEEe
Confidence 46664321 4789999999999999998775 9999999
Q ss_pred cCC--eEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcC
Q 001814 201 SPR--IVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNL 277 (1010)
Q Consensus 201 S~r--lLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~l 277 (1010)
+|+ ++|++. |.+-++|.....+.+..+.+|-+ ..++++|.| |
T Consensus 502 ~P~GyYFatas~D~tArLWs~d~~~PlRifaghls-------------DV~cv~FHP-------N--------------- 546 (707)
T KOG0263|consen 502 APRGYYFATASHDQTARLWSTDHNKPLRIFAGHLS-------------DVDCVSFHP-------N--------------- 546 (707)
T ss_pred cCCceEEEecCCCceeeeeecccCCchhhhccccc-------------ccceEEECC-------c---------------
Confidence 994 556554 55789997554333332222211 011122221 1
Q ss_pred CCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECC
Q 001814 278 TPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFV 357 (1010)
Q Consensus 278 t~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~ 357 (1010)
.. .+++|+.|.+|++||+.
T Consensus 547 ---------------------------------------------------------s~----Y~aTGSsD~tVRlWDv~ 565 (707)
T KOG0263|consen 547 ---------------------------------------------------------SN----YVATGSSDRTVRLWDVS 565 (707)
T ss_pred ---------------------------------------------------------cc----ccccCCCCceEEEEEcC
Confidence 11 12356789999999999
Q ss_pred CCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEE
Q 001814 358 TRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDI 437 (1010)
Q Consensus 358 s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sI 437 (1010)
+|..+..|.+|+.||.+|+|||+|.+||+|+++|. |+|||+.. | ..+.++ +|+ .+.|++|
T Consensus 566 ~G~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~-I~iWDl~~-------~----------~~v~~l-~~H-t~ti~Sl 625 (707)
T KOG0263|consen 566 TGNSVRIFTGHKGPVTALAFSPCGRYLASGDEDGL-IKIWDLAN-------G----------SLVKQL-KGH-TGTIYSL 625 (707)
T ss_pred CCcEEEEecCCCCceEEEEEcCCCceEeecccCCc-EEEEEcCC-------C----------cchhhh-hcc-cCceeEE
Confidence 99999999999999999999999999999999765 99999953 3 244444 666 5679999
Q ss_pred EEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 438 CFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 438 AFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
+||.||..||+++.|.+|++|++...-
T Consensus 626 sFS~dg~vLasgg~DnsV~lWD~~~~~ 652 (707)
T KOG0263|consen 626 SFSRDGNVLASGGADNSVRLWDLTKVI 652 (707)
T ss_pred EEecCCCEEEecCCCCeEEEEEchhhc
Confidence 999999999999999999999987543
No 7
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.87 E-value=2.7e-21 Score=214.52 Aligned_cols=275 Identities=19% Similarity=0.167 Sum_probs=205.4
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
-++.-.|..+.|-. +...|++|..+| .+||+++. .+...+|-+|.+.|.++.|-|.- +..
T Consensus 172 ~gd~rPis~~~fS~-------ds~~laT~swsG~~kvW~~~~-~~~~~~l~gH~~~v~~~~fhP~~-----------~~~ 232 (459)
T KOG0272|consen 172 VGDTRPISGCSFSR-------DSKHLATGSWSGLVKVWSVPQ-CNLLQTLRGHTSRVGAAVFHPVD-----------SDL 232 (459)
T ss_pred ccCCCcceeeEeec-------CCCeEEEeecCCceeEeecCC-cceeEEEeccccceeeEEEccCC-----------Ccc
Confidence 34556678888864 677999999888 79999965 47788899999999999999752 001
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEE-
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAV- 207 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV- 207 (1010)
-||.++ .|++|++|++.+.+.+..|+.| ..|-.|+|+| ++|+.
T Consensus 233 ~lat~s---------------------------------~Dgtvklw~~~~e~~l~~l~gH~~RVs~VafHPsG~~L~Ta 279 (459)
T KOG0272|consen 233 NLATAS---------------------------------ADGTVKLWKLSQETPLQDLEGHLARVSRVAFHPSGKFLGTA 279 (459)
T ss_pred ceeeec---------------------------------cCCceeeeccCCCcchhhhhcchhhheeeeecCCCceeeec
Confidence 234322 3588999999998999999877 4899999998 67776
Q ss_pred EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEccC--CeeeccC--CccCCCcCCCCC
Q 001814 208 GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYASN--TLLLSNS--GRLSPQNLTPSG 281 (1010)
Q Consensus 208 ~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~--~~~iwd~--G~vs~Q~lt~p~ 281 (1010)
+.|.+-++||+.|.+.+.-..+|..+ .-.+|+.+ ..+|..+. .-++||. |+.
T Consensus 280 sfD~tWRlWD~~tk~ElL~QEGHs~~-------------v~~iaf~~DGSL~~tGGlD~~~RvWDlRtgr~--------- 337 (459)
T KOG0272|consen 280 SFDSTWRLWDLETKSELLLQEGHSKG-------------VFSIAFQPDGSLAATGGLDSLGRVWDLRTGRC--------- 337 (459)
T ss_pred ccccchhhcccccchhhHhhcccccc-------------cceeEecCCCceeeccCccchhheeecccCcE---------
Confidence 56778999999998887777777552 23466655 44444443 2357773 221
Q ss_pred CCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE
Q 001814 282 VSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI 361 (1010)
Q Consensus 282 vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~ 361 (1010)
+ -.|.++...++. +. |.+++..+++++.|++++|||+...+.
T Consensus 338 ------------i----------------m~L~gH~k~I~~------V~----fsPNGy~lATgs~Dnt~kVWDLR~r~~ 379 (459)
T KOG0272|consen 338 ------------I----------------MFLAGHIKEILS------VA----FSPNGYHLATGSSDNTCKVWDLRMRSE 379 (459)
T ss_pred ------------E----------------EEecccccceee------Ee----ECCCceEEeecCCCCcEEEeeeccccc
Confidence 0 112222222221 12 223445678999999999999999999
Q ss_pred EEEeccCCCCeEEEEECC-CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEc
Q 001814 362 ISQFKAHTSPISALCFDP-SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFS 440 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSP-dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFS 440 (1010)
+.+|.||++-|+.++|+| .|.+|||||.|.+ ++||.... + .++..| .||. .+|.++..|
T Consensus 380 ly~ipAH~nlVS~Vk~~p~~g~fL~TasyD~t-~kiWs~~~-------~----------~~~ksL-aGHe-~kV~s~Dis 439 (459)
T KOG0272|consen 380 LYTIPAHSNLVSQVKYSPQEGYFLVTASYDNT-VKIWSTRT-------W----------SPLKSL-AGHE-GKVISLDIS 439 (459)
T ss_pred ceecccccchhhheEecccCCeEEEEcccCcc-eeeecCCC-------c----------ccchhh-cCCc-cceEEEEec
Confidence 999999999999999999 8899999999655 99998642 2 355555 5754 579999999
Q ss_pred cCCCEEEEEeCCCeEEEEe
Q 001814 441 HYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 441 pDg~~LAsgS~dGTVhIw~ 459 (1010)
+||++|++++.|.|+++|.
T Consensus 440 ~d~~~i~t~s~DRT~KLW~ 458 (459)
T KOG0272|consen 440 PDSQAIATSSFDRTIKLWR 458 (459)
T ss_pred cCCceEEEeccCceeeecc
Confidence 9999999999999999995
No 8
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.87 E-value=1.2e-20 Score=198.55 Aligned_cols=266 Identities=18% Similarity=0.229 Sum_probs=181.4
Q ss_pred EEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCC-CC-CC-----ccccC--cEEEEEecCCCCCCCCC
Q 001814 77 VLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDD-GC-EG-----FRKLH--PFLLVVAGEDTNTLAPG 147 (1010)
Q Consensus 77 vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~-~~-D~-----F~~sr--pLLAvVsgd~~~~s~~~ 147 (1010)
+...||+.+||+|.. .+|.+..++...|+.|..+++.|+...-+ ++ .. .++.. |+.-+.+
T Consensus 13 LvsA~YDhTIRfWqa-~tG~C~rTiqh~dsqVNrLeiTpdk~~LAaa~~qhvRlyD~~S~np~Pv~t~e~---------- 81 (311)
T KOG0315|consen 13 LVSAGYDHTIRFWQA-LTGICSRTIQHPDSQVNRLEITPDKKDLAAAGNQHVRLYDLNSNNPNPVATFEG---------- 81 (311)
T ss_pred EEeccCcceeeeeeh-hcCeEEEEEecCccceeeEEEcCCcchhhhccCCeeEEEEccCCCCCceeEEec----------
Confidence 445678899999998 46999999999999999999999864421 11 11 11111 2211110
Q ss_pred CCCCCcccc---ccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCC--eEEEE-eCCeEEEEECCCC
Q 001814 148 QNRSHLGGV---RDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPR--IVAVG-LATQIYCFDALTL 221 (1010)
Q Consensus 148 q~~~~~~~v---r~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~r--lLAV~-ld~~I~IwD~~Tl 221 (1010)
+ ...+.++ .+|.|....++ +++|||||++.-.+-+.+++.++|..|..+|+ -|+++ .++.|++||+.+-
T Consensus 82 h-~kNVtaVgF~~dgrWMyTgse----Dgt~kIWdlR~~~~qR~~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~~ 156 (311)
T KOG0315|consen 82 H-TKNVTAVGFQCDGRWMYTGSE----DGTVKIWDLRSLSCQRNYQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGEN 156 (311)
T ss_pred c-CCceEEEEEeecCeEEEecCC----CceEEEEeccCcccchhccCCCCcceEEecCCcceEEeecCCCcEEEEEccCC
Confidence 0 0222333 78888765444 79999999999888888999999999999983 45555 4578999999875
Q ss_pred ceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhh
Q 001814 222 ENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEH 301 (1010)
Q Consensus 222 e~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~ds 301 (1010)
.+-..+. |.+ ......+++. |++..+
T Consensus 157 ~c~~~li--Pe~----------~~~i~sl~v~----------------------------------~dgsml-------- 182 (311)
T KOG0315|consen 157 SCTHELI--PED----------DTSIQSLTVM----------------------------------PDGSML-------- 182 (311)
T ss_pred ccccccC--CCC----------CcceeeEEEc----------------------------------CCCcEE--------
Confidence 4432221 110 0001111111 111111
Q ss_pred hhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC------cEEEEeccCCCCeEEE
Q 001814 302 SKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR------AIISQFKAHTSPISAL 375 (1010)
Q Consensus 302 sk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~------~~v~~~~aHtspIsaL 375 (1010)
+.+...|...||++-.. ..+..|++|..-|...
T Consensus 183 -----------------------------------------~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C 221 (311)
T KOG0315|consen 183 -----------------------------------------AAANNKGNCYVWRLLNHQTASELEPVHKFQAHNGHILRC 221 (311)
T ss_pred -----------------------------------------EEecCCccEEEEEccCCCccccceEhhheecccceEEEE
Confidence 12356788899998654 3677899999999999
Q ss_pred EECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeE
Q 001814 376 CFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTC 455 (1010)
Q Consensus 376 aFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTV 455 (1010)
-||||+++|||+|.| ++++||++... -.+-....|+ ..++|+.+||.||+||++++.|+++
T Consensus 222 ~lSPd~k~lat~ssd-ktv~iwn~~~~-----------------~kle~~l~gh-~rWvWdc~FS~dg~YlvTassd~~~ 282 (311)
T KOG0315|consen 222 LLSPDVKYLATCSSD-KTVKIWNTDDF-----------------FKLELVLTGH-QRWVWDCAFSADGEYLVTASSDHTA 282 (311)
T ss_pred EECCCCcEEEeecCC-ceEEEEecCCc-----------------eeeEEEeecC-CceEEeeeeccCccEEEecCCCCce
Confidence 999999999999995 66999998531 0111111332 3589999999999999999999999
Q ss_pred EEEeCCCCCCccccccc
Q 001814 456 HVFVLSPFGGDSGFQTL 472 (1010)
Q Consensus 456 hIw~I~~~gg~~~~~~H 472 (1010)
|+|++...+.....++|
T Consensus 283 rlW~~~~~k~v~qy~gh 299 (311)
T KOG0315|consen 283 RLWDLSAGKEVRQYQGH 299 (311)
T ss_pred eecccccCceeeecCCc
Confidence 99999876554444455
No 9
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.87 E-value=5.2e-21 Score=210.04 Aligned_cols=291 Identities=15% Similarity=0.140 Sum_probs=199.4
Q ss_pred CCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCC
Q 001814 72 SVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNR 150 (1010)
Q Consensus 72 ~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~ 150 (1010)
+|+++.|+.|..+| |++||-...+.....|.+|.-+|.++++.|-.... ..| +||..+
T Consensus 166 sPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K~It~Lawep~hl~p-------~~r-~las~s------------- 224 (480)
T KOG0271|consen 166 SPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKKWITALAWEPLHLVP-------PCR-RLASSS------------- 224 (480)
T ss_pred CCCcchhhccccCCeEEEecCCCCCcccccccCcccceeEEeecccccCC-------Ccc-ceeccc-------------
Confidence 47899999999887 99999876677778899999999999999865321 122 445311
Q ss_pred CCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcC-CeEEEEe-CCeEEEEECCCCceeEEE
Q 001814 151 SHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSP-RIVAVGL-ATQIYCFDALTLENKFSV 227 (1010)
Q Consensus 151 ~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~-rlLAV~l-d~~I~IwD~~Tle~l~tL 227 (1010)
.++.|+|||++.++++..+..|. +|.+|++-. .+|..+. |.+|++|++..+++..+|
T Consensus 225 --------------------kDg~vrIWd~~~~~~~~~lsgHT~~VTCvrwGG~gliySgS~DrtIkvw~a~dG~~~r~l 284 (480)
T KOG0271|consen 225 --------------------KDGSVRIWDTKLGTCVRTLSGHTASVTCVRWGGEGLIYSGSQDRTIKVWRALDGKLCREL 284 (480)
T ss_pred --------------------CCCCEEEEEccCceEEEEeccCccceEEEEEcCCceEEecCCCceEEEEEccchhHHHhh
Confidence 25789999999999999998775 899999986 6777765 568999999999988888
Q ss_pred eecCCccccCCCccccccCccceEEccceE----EEccCCeeeccCCccCC----------CcCCC-CCCC----CCcCC
Q 001814 228 LTYPVPQLAGQGAVGINVGYGPMAVGPRWL----AYASNTLLLSNSGRLSP----------QNLTP-SGVS----PSTSP 288 (1010)
Q Consensus 228 ~t~p~p~~~~~g~~~vnv~~gplAlgpRwL----Ayas~~~~iwd~G~vs~----------Q~lt~-p~vS----~stSP 288 (1010)
..|..+ .+.+|++..|. ||-.. |+... +.+.. -..+ .+.|
T Consensus 285 kGHahw-------------vN~lalsTdy~LRtgaf~~t-------~~~~~~~se~~~~Al~rY~~~~~~~~erlVSgs- 343 (480)
T KOG0271|consen 285 KGHAHW-------------VNHLALSTDYVLRTGAFDHT-------GRKPKSFSEEQKKALERYEAVLKDSGERLVSGS- 343 (480)
T ss_pred cccchh-------------eeeeeccchhhhhccccccc-------cccCCChHHHHHHHHHHHHHhhccCcceeEEec-
Confidence 887653 35677764221 22110 00000 00000 0000 0000
Q ss_pred CCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC
Q 001814 289 GGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH 368 (1010)
Q Consensus 289 ~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH 368 (1010)
+...+...--.++.|-+ ..+.++... ....+++|+.++ +++++.|..|++||..+|+-++.|++|
T Consensus 344 Dd~tlflW~p~~~kkpi-----~rmtgHq~l------Vn~V~fSPd~r~----IASaSFDkSVkLW~g~tGk~lasfRGH 408 (480)
T KOG0271|consen 344 DDFTLFLWNPFKSKKPI-----TRMTGHQAL------VNHVSFSPDGRY----IASASFDKSVKLWDGRTGKFLASFRGH 408 (480)
T ss_pred CCceEEEecccccccch-----hhhhchhhh------eeeEEECCCccE----EEEeecccceeeeeCCCcchhhhhhhc
Confidence 00111000000000000 112222221 123456666664 458999999999999999999999999
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
-.+|..++||.|.+||+++|. +++++||++..- +..+.| -|+ ...|+++.|||||+.+|+
T Consensus 409 v~~VYqvawsaDsRLlVS~Sk-DsTLKvw~V~tk-----------------Kl~~DL-pGh-~DEVf~vDwspDG~rV~s 468 (480)
T KOG0271|consen 409 VAAVYQVAWSADSRLLVSGSK-DSTLKVWDVRTK-----------------KLKQDL-PGH-ADEVFAVDWSPDGQRVAS 468 (480)
T ss_pred cceeEEEEeccCccEEEEcCC-CceEEEEEeeee-----------------eecccC-CCC-CceEEEEEecCCCceeec
Confidence 999999999999999999999 566999999631 233444 453 347999999999999999
Q ss_pred EeCCCeEEEEe
Q 001814 449 VSSKGTCHVFV 459 (1010)
Q Consensus 449 gS~dGTVhIw~ 459 (1010)
|+.|..+++|.
T Consensus 469 ggkdkv~~lw~ 479 (480)
T KOG0271|consen 469 GGKDKVLRLWR 479 (480)
T ss_pred CCCceEEEeec
Confidence 99999999995
No 10
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.86 E-value=7.3e-19 Score=179.10 Aligned_cols=276 Identities=20% Similarity=0.261 Sum_probs=194.8
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
..+|++.|..+.|.. ++++|++|..+ .+++||+.. +.....+..|..++..+.+.|+.
T Consensus 5 ~~~h~~~i~~~~~~~-------~~~~l~~~~~~g~i~i~~~~~-~~~~~~~~~~~~~i~~~~~~~~~------------- 63 (289)
T cd00200 5 LKGHTGGVTCVAFSP-------DGKLLATGSGDGTIKVWDLET-GELLRTLKGHTGPVRDVAASADG------------- 63 (289)
T ss_pred hcccCCCEEEEEEcC-------CCCEEEEeecCcEEEEEEeeC-CCcEEEEecCCcceeEEEECCCC-------------
Confidence 346889999999974 45677777755 599999964 55666777788899888888654
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAV 207 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV 207 (1010)
.+|++++ .++.|++||+.+++.+..+..+ ..|.++.+++ +++++
T Consensus 64 ~~l~~~~---------------------------------~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 110 (289)
T cd00200 64 TYLASGS---------------------------------SDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSS 110 (289)
T ss_pred CEEEEEc---------------------------------CCCeEEEEEcCcccceEEEeccCCcEEEEEEcCCCCEEEE
Confidence 2455421 1368999999998888888765 4899999988 67777
Q ss_pred Ee-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEcc--CCeeeccCCccCCCcCCCCCC
Q 001814 208 GL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYAS--NTLLLSNSGRLSPQNLTPSGV 282 (1010)
Q Consensus 208 ~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas--~~~~iwd~G~vs~Q~lt~p~v 282 (1010)
+. ++.|++||+.+.+....+..+..+ ...+++.+ ++|+... ..+.+|+....
T Consensus 111 ~~~~~~i~~~~~~~~~~~~~~~~~~~~-------------i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~---------- 167 (289)
T cd00200 111 SSRDKTIKVWDVETGKCLTTLRGHTDW-------------VNSVAFSPDGTFVASSSQDGTIKLWDLRTG---------- 167 (289)
T ss_pred ecCCCeEEEEECCCcEEEEEeccCCCc-------------EEEEEEcCcCCEEEEEcCCCcEEEEEcccc----------
Confidence 77 789999999988877777644431 24466666 6666654 34667763100
Q ss_pred CCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEE
Q 001814 283 SPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAII 362 (1010)
Q Consensus 283 S~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v 362 (1010)
..+ +.+..+ ......+.+++..+ .++.+..+|.|.+||+..++.+
T Consensus 168 ---------~~~----------------~~~~~~------~~~i~~~~~~~~~~----~l~~~~~~~~i~i~d~~~~~~~ 212 (289)
T cd00200 168 ---------KCV----------------ATLTGH------TGEVNSVAFSPDGE----KLLSSSSDGTIKLWDLSTGKCL 212 (289)
T ss_pred ---------ccc----------------eeEecC------ccccceEEECCCcC----EEEEecCCCcEEEEECCCCcee
Confidence 000 000000 00001111122211 1234556999999999999999
Q ss_pred EEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccC
Q 001814 363 SQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHY 442 (1010)
Q Consensus 363 ~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD 442 (1010)
..+..|..+|.+++|+|++.++++++.+|. |++|++.. + ..+..+. + +...|.+++|+|+
T Consensus 213 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~-i~i~~~~~-------~----------~~~~~~~-~-~~~~i~~~~~~~~ 272 (289)
T cd00200 213 GTLRGHENGVNSVAFSPDGYLLASGSEDGT-IRVWDLRT-------G----------ECVQTLS-G-HTNSVTSLAWSPD 272 (289)
T ss_pred cchhhcCCceEEEEEcCCCcEEEEEcCCCc-EEEEEcCC-------c----------eeEEEcc-c-cCCcEEEEEECCC
Confidence 999999999999999999999999987665 99999853 2 3445554 3 3457999999999
Q ss_pred CCEEEEEeCCCeEEEEe
Q 001814 443 SQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 443 g~~LAsgS~dGTVhIw~ 459 (1010)
+++|++++.||+++||+
T Consensus 273 ~~~l~~~~~d~~i~iw~ 289 (289)
T cd00200 273 GKRLASGSADGTIRIWD 289 (289)
T ss_pred CCEEEEecCCCeEEecC
Confidence 99999999999999995
No 11
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.83 E-value=1.6e-18 Score=194.61 Aligned_cols=280 Identities=19% Similarity=0.261 Sum_probs=204.3
Q ss_pred CcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEE
Q 001814 57 DQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLV 135 (1010)
Q Consensus 57 d~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAv 135 (1010)
-+|+-..++ .++.+|++|..+| ++||+. .|++...|..|.|||..|++...+ .+++.
T Consensus 236 kdVT~L~Wn-------~~G~~LatG~~~G~~riw~~--~G~l~~tl~~HkgPI~slKWnk~G-------------~yilS 293 (524)
T KOG0273|consen 236 KDVTSLDWN-------NDGTLLATGSEDGEARIWNK--DGNLISTLGQHKGPIFSLKWNKKG-------------TYILS 293 (524)
T ss_pred CCcceEEec-------CCCCeEEEeecCcEEEEEec--CchhhhhhhccCCceEEEEEcCCC-------------CEEEe
Confidence 445555554 2578999999999 699997 477888999999999999987443 45553
Q ss_pred EecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcE-EEEEEcC--CeEEEEeCCe
Q 001814 136 VAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSV-CMVRCSP--RIVAVGLATQ 212 (1010)
Q Consensus 136 Vsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V-~sVa~S~--rlLAV~ld~~ 212 (1010)
++ .++++.+||..+|+.-..+.|++.+ ++|.+-. .++..+.++.
T Consensus 294 --~~-------------------------------vD~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~~~~F~ts~td~~ 340 (524)
T KOG0273|consen 294 --GG-------------------------------VDGTTILWDAHTGTVKQQFEFHSAPALDVDWQSNDEFATSSTDGC 340 (524)
T ss_pred --cc-------------------------------CCccEEEEeccCceEEEeeeeccCCccceEEecCceEeecCCCce
Confidence 11 3688999999999999999999866 8888854 3455566778
Q ss_pred EEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEccC--CeeeccCCccCCCcCCCCCCCCCcCC
Q 001814 213 IYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYASN--TLLLSNSGRLSPQNLTPSGVSPSTSP 288 (1010)
Q Consensus 213 I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~--~~~iwd~G~vs~Q~lt~p~vS~stSP 288 (1010)
|+++.+..-+...++..|.++ .+.+-+.| ..||.+++ ++.||+.|.-.-++
T Consensus 341 i~V~kv~~~~P~~t~~GH~g~-------------V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~~~------------ 395 (524)
T KOG0273|consen 341 IHVCKVGEDRPVKTFIGHHGE-------------VNALKWNPTGSLLASCSDDGTLKIWSMGQSNSVH------------ 395 (524)
T ss_pred EEEEEecCCCcceeeecccCc-------------eEEEEECCCCceEEEecCCCeeEeeecCCCcchh------------
Confidence 999999888888888887763 34566665 67887775 57889854221000
Q ss_pred CCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC
Q 001814 289 GGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH 368 (1010)
Q Consensus 289 ~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH 368 (1010)
.| .++.|.+ |...-.|++- ...++ + ....++++..|++|++||+..+.+++.|..|
T Consensus 396 ---~l-------------~~Hskei--~t~~wsp~g~---v~~n~-~--~~~~l~sas~dstV~lwdv~~gv~i~~f~kH 451 (524)
T KOG0273|consen 396 ---DL-------------QAHSKEI--YTIKWSPTGP---VTSNP-N--MNLMLASASFDSTVKLWDVESGVPIHTLMKH 451 (524)
T ss_pred ---hh-------------hhhccce--eeEeecCCCC---ccCCC-c--CCceEEEeecCCeEEEEEccCCceeEeeccC
Confidence 00 0000000 1000112221 11111 0 1124567889999999999999999999999
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
+.||.+|+|||+|.+||+++.+|. ++||++.. | .+|+-.+|. ..|..++|+-+|.+|++
T Consensus 452 ~~pVysvafS~~g~ylAsGs~dg~-V~iws~~~-------~-----------~l~~s~~~~--~~Ifel~Wn~~G~kl~~ 510 (524)
T KOG0273|consen 452 QEPVYSVAFSPNGRYLASGSLDGC-VHIWSTKT-------G-----------KLVKSYQGT--GGIFELCWNAAGDKLGA 510 (524)
T ss_pred CCceEEEEecCCCcEEEecCCCCe-eEeccccc-------h-----------heeEeecCC--CeEEEEEEcCCCCEEEE
Confidence 999999999999999999999775 99999863 3 456655653 46999999999999999
Q ss_pred EeCCCeEEEEeCC
Q 001814 449 VSSKGTCHVFVLS 461 (1010)
Q Consensus 449 gS~dGTVhIw~I~ 461 (1010)
+-.||.+.|-++.
T Consensus 511 ~~sd~~vcvldlr 523 (524)
T KOG0273|consen 511 CASDGSVCVLDLR 523 (524)
T ss_pred EecCCCceEEEec
Confidence 9999999987763
No 12
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.81 E-value=8.6e-19 Score=204.60 Aligned_cols=113 Identities=21% Similarity=0.252 Sum_probs=96.1
Q ss_pred ccccCCCCeEEEEECCCCc-----EE----EEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCc
Q 001814 342 GADMDNAGIVVVKDFVTRA-----II----SQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHK 412 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~-----~v----~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~ 412 (1010)
++++++|+++++|++...+ .+ .+-++|...|++++.+|+.+++||||.| ++.+||++.. +
T Consensus 427 fvsvS~D~tlK~W~l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~ndkLiAT~SqD-ktaKiW~le~-------~--- 495 (775)
T KOG0319|consen 427 FVSVSQDCTLKLWDLPKSKETAFPIVLTCRYTERAHDKDINCVAIAPNDKLIATGSQD-KTAKIWDLEQ-------L--- 495 (775)
T ss_pred EEEecCCceEEEecCCCcccccccceehhhHHHHhhcccccceEecCCCceEEecccc-cceeeecccC-------c---
Confidence 4688999999999997621 11 2446899999999999999999999994 6699999953 2
Q ss_pred cccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccC
Q 001814 413 YDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSS 474 (1010)
Q Consensus 413 ~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s 474 (1010)
+++..| +||++ .|+++.|+|..+.||++|.|+||+||.|+.+.+..++.+|.+
T Consensus 496 -------~l~~vL-sGH~R-Gvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT~eGH~~ 548 (775)
T KOG0319|consen 496 -------RLLGVL-SGHTR-GVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKTFEGHTS 548 (775)
T ss_pred -------eEEEEe-eCCcc-ceEEEEeccccceeEeccCCceEEEEEeccceeeeeecCccc
Confidence 466676 67765 599999999999999999999999999999999999999964
No 13
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.81 E-value=1.6e-17 Score=188.41 Aligned_cols=341 Identities=13% Similarity=0.131 Sum_probs=211.6
Q ss_pred CCCCCCcEEEEEEeeccCCCCCC-CeEEEEEecCcEEEEEccCCCcceEeee---eccCCEEEEEEecCCCCCCCCCCcc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVF-KQVLLLGYQNGFQVLDVEDASNFNELVS---KRDGPVSFLQMQPFPVKDDGCEGFR 127 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~-~~vLalGy~~G~qVWDv~~~g~v~ellS---~hdGpV~~v~~lP~p~~s~~~D~F~ 127 (1010)
..+|.-=|..++|.. + .++..+|.++.+.|||-.. ++..-.|+ .|.|.|..|.++|+..
T Consensus 186 ~r~HskFV~~VRysP-------DG~~Fat~gsDgki~iyDGkt-ge~vg~l~~~~aHkGsIfalsWsPDs~--------- 248 (603)
T KOG0318|consen 186 FREHSKFVNCVRYSP-------DGSRFATAGSDGKIYIYDGKT-GEKVGELEDSDAHKGSIFALSWSPDST--------- 248 (603)
T ss_pred ccccccceeeEEECC-------CCCeEEEecCCccEEEEcCCC-ccEEEEecCCCCccccEEEEEECCCCc---------
Confidence 345666688888863 3 3556666666799999754 44444444 6889999999999863
Q ss_pred ccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcE----EEEEE-cC
Q 001814 128 KLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSV----CMVRC-SP 202 (1010)
Q Consensus 128 ~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V----~sVa~-S~ 202 (1010)
-+|.+++ +.++||||+.+++++.++.+.+.| ..+-+ +.
T Consensus 249 ----~~~T~Sa---------------------------------Dkt~KIWdVs~~slv~t~~~~~~v~dqqvG~lWqkd 291 (603)
T KOG0318|consen 249 ----QFLTVSA---------------------------------DKTIKIWDVSTNSLVSTWPMGSTVEDQQVGCLWQKD 291 (603)
T ss_pred ----eEEEecC---------------------------------CceEEEEEeeccceEEEeecCCchhceEEEEEEeCC
Confidence 3465432 478999999999999999987665 23333 45
Q ss_pred CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEccC--CeeeccCCcc-----C
Q 001814 203 RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYASN--TLLLSNSGRL-----S 273 (1010)
Q Consensus 203 rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~--~~~iwd~G~v-----s 273 (1010)
.+|.|++.+.|-.++...++.++++.+|... ...+++++ ++|-.++. .+.-|+.|.- .
T Consensus 292 ~lItVSl~G~in~ln~~d~~~~~~i~GHnK~-------------ITaLtv~~d~~~i~SgsyDG~I~~W~~~~g~~~~~~ 358 (603)
T KOG0318|consen 292 HLITVSLSGTINYLNPSDPSVLKVISGHNKS-------------ITALTVSPDGKTIYSGSYDGHINSWDSGSGTSDRLA 358 (603)
T ss_pred eEEEEEcCcEEEEecccCCChhheecccccc-------------eeEEEEcCCCCEEEeeccCceEEEEecCCccccccc
Confidence 6888999999999999999988888887542 24455555 44433332 2334654311 1
Q ss_pred CCcCCCCCCCCCcCCCCCceEEEeehhhhhhhh---cccc-------------------------------eeecccccc
Q 001814 274 PQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFA---AGLS-------------------------------KTLSKYCQE 319 (1010)
Q Consensus 274 ~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la---~Gi~-------------------------------ktls~y~~~ 319 (1010)
++..+..+.....+.. +.+..--..|..|.+- .|+. --|+.....
T Consensus 359 g~~h~nqI~~~~~~~~-~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg~QP~~lav~~d~~~avv~~~~~iv~l~~~~~~ 437 (603)
T KOG0318|consen 359 GKGHTNQIKGMAASES-GELFTIGWDDTLRVISLKDNGYTKSEVVKLGSQPKGLAVLSDGGTAVVACISDIVLLQDQTKV 437 (603)
T ss_pred cccccceEEEEeecCC-CcEEEEecCCeEEEEecccCcccccceeecCCCceeEEEcCCCCEEEEEecCcEEEEecCCcc
Confidence 1222211111111110 1110000000000000 0000 000000000
Q ss_pred -ccCCCC-CCCccCCCccccccccccccCCCCeEEEEECCCCc--EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEE
Q 001814 320 -LLPDGS-SSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA--IISQFKAHTSPISALCFDPSGTLLVTASVYGNNIN 395 (1010)
Q Consensus 320 -l~p~gs-~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~--~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~Ir 395 (1010)
..+.+. .+....+++.. ..+-+++||.|+||.+..+. ....+..|..+|++++|||||++||.++..+. +-
T Consensus 438 ~~~~~~y~~s~vAv~~~~~----~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rk-vv 512 (603)
T KOG0318|consen 438 SSIPIGYESSAVAVSPDGS----EVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDASRK-VV 512 (603)
T ss_pred eeeccccccceEEEcCCCC----EEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccCCc-EE
Confidence 000010 11122223222 23467899999999998765 34567789999999999999999999999654 78
Q ss_pred EEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccc-cccC
Q 001814 396 IFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQ-TLSS 474 (1010)
Q Consensus 396 Vwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~-~H~s 474 (1010)
+|++... ....-+.++|.++|.+|+||||+++||+||.|.+|+||.++.+.....++ +|
T Consensus 513 ~yd~~s~------------------~~~~~~w~FHtakI~~~aWsP~n~~vATGSlDt~Viiysv~kP~~~i~iknAH-- 572 (603)
T KOG0318|consen 513 LYDVASR------------------EVKTNRWAFHTAKINCVAWSPNNKLVATGSLDTNVIIYSVKKPAKHIIIKNAH-- 572 (603)
T ss_pred EEEcccC------------------ceecceeeeeeeeEEEEEeCCCceEEEeccccceEEEEEccChhhheEecccc--
Confidence 9999642 22334467788999999999999999999999999999999887765543 34
Q ss_pred CCCCCccCCCCCCCccc
Q 001814 475 QGGDPYLFPVLSLPWWC 491 (1010)
Q Consensus 475 ~~~~~~~~pv~~lpw~~ 491 (1010)
...++.|.|.-
T Consensus 573 ------~~gVn~v~wld 583 (603)
T KOG0318|consen 573 ------LGGVNSVAWLD 583 (603)
T ss_pred ------ccCceeEEEec
Confidence 22366677643
No 14
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.81 E-value=8.7e-19 Score=194.81 Aligned_cols=225 Identities=16% Similarity=0.140 Sum_probs=168.4
Q ss_pred CEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCC----eEEE-EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccC
Q 001814 173 TAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPR----IVAV-GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVG 246 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~-S~V~sVa~S~r----lLAV-~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~ 246 (1010)
+.+++|+..+...+++|..| +.|-++.|.|. -||. +.|+.+++|++.+-+.+..|..|...
T Consensus 197 G~~kvW~~~~~~~~~~l~gH~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~e~~l~~l~gH~~R------------- 263 (459)
T KOG0272|consen 197 GLVKVWSVPQCNLLQTLRGHTSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQETPLQDLEGHLAR------------- 263 (459)
T ss_pred CceeEeecCCcceeEEEeccccceeeEEEccCCCccceeeeccCCceeeeccCCCcchhhhhcchhh-------------
Confidence 68999999999999999876 58999999883 4554 56789999999988888888887652
Q ss_pred ccceEEcc--ceEEEccCC--eeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccC
Q 001814 247 YGPMAVGP--RWLAYASNT--LLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLP 322 (1010)
Q Consensus 247 ~gplAlgp--RwLAyas~~--~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p 322 (1010)
...++|.| ++|+.+... -.+||.-. +. -..+..|..+.+
T Consensus 264 Vs~VafHPsG~~L~TasfD~tWRlWD~~t-------------------k~---------ElL~QEGHs~~v--------- 306 (459)
T KOG0272|consen 264 VSRVAFHPSGKFLGTASFDSTWRLWDLET-------------------KS---------ELLLQEGHSKGV--------- 306 (459)
T ss_pred heeeeecCCCceeeecccccchhhccccc-------------------ch---------hhHhhccccccc---------
Confidence 35578877 899887753 35777310 00 001112221111
Q ss_pred CCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCC
Q 001814 323 DGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPS 402 (1010)
Q Consensus 323 ~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~ 402 (1010)
..+++++++. ..++++.|..-+|||+.++..+..|.+|..+|..|+|||+|.+|||+|.|++ +||||+.-.
T Consensus 307 ----~~iaf~~DGS----L~~tGGlD~~~RvWDlRtgr~im~L~gH~k~I~~V~fsPNGy~lATgs~Dnt-~kVWDLR~r 377 (459)
T KOG0272|consen 307 ----FSIAFQPDGS----LAATGGLDSLGRVWDLRTGRCIMFLAGHIKEILSVAFSPNGYHLATGSSDNT-CKVWDLRMR 377 (459)
T ss_pred ----ceeEecCCCc----eeeccCccchhheeecccCcEEEEecccccceeeEeECCCceEEeecCCCCc-EEEeeeccc
Confidence 1122222222 2357888999999999999999999999999999999999999999999765 999999621
Q ss_pred cccCCCCCCccccCCcceEEEEEecccccccEEEEEEcc-CCCEEEEEeCCCeEEEEeCCCCCCccccccccCC
Q 001814 403 CMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSH-YSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQ 475 (1010)
Q Consensus 403 ~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp-Dg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~ 475 (1010)
..+|..- + |...|..+.|+| -|++|+++|-|+|++||.-..+.....+.+|...
T Consensus 378 -----------------~~ly~ip-A-H~nlVS~Vk~~p~~g~fL~TasyD~t~kiWs~~~~~~~ksLaGHe~k 432 (459)
T KOG0272|consen 378 -----------------SELYTIP-A-HSNLVSQVKYSPQEGYFLVTASYDNTVKIWSTRTWSPLKSLAGHEGK 432 (459)
T ss_pred -----------------ccceecc-c-ccchhhheEecccCCeEEEEcccCcceeeecCCCcccchhhcCCccc
Confidence 3566653 3 345799999999 7899999999999999998777666677788644
No 15
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.80 E-value=1e-17 Score=195.68 Aligned_cols=267 Identities=20% Similarity=0.278 Sum_probs=179.8
Q ss_pred CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC-cEEEEEecCCCCCCC-CCCC-
Q 001814 74 FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH-PFLLVVAGEDTNTLA-PGQN- 149 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr-pLLAvVsgd~~~~s~-~~q~- 149 (1010)
.-.+|++|..+| |.++.+.+ =++...++--+-+|..+.+.-.+.- =.|...+ -.|+|+ +....+- ..|+
T Consensus 276 ~t~~lvvgFssG~f~LyelP~-f~lih~LSis~~~I~t~~~N~tGDW----iA~g~~klgQLlVw--eWqsEsYVlKQQg 348 (893)
T KOG0291|consen 276 GTNLLVVGFSSGEFGLYELPD-FNLIHSLSISDQKILTVSFNSTGDW----IAFGCSKLGQLLVW--EWQSESYVLKQQG 348 (893)
T ss_pred CceEEEEEecCCeeEEEecCC-ceEEEEeecccceeeEEEecccCCE----EEEcCCccceEEEE--Eeeccceeeeccc
Confidence 457999999999 68999954 4566777777788888887643310 0122221 012332 1111111 2233
Q ss_pred -CCCcccc---ccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--C-eEEEEeCCeEEEEECCCC
Q 001814 150 -RSHLGGV---RDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--R-IVAVGLATQIYCFDALTL 221 (1010)
Q Consensus 150 -~~~~~~v---r~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--r-lLAV~ld~~I~IwD~~Tl 221 (1010)
...+..+ +||+.-... ..+++|+|||..+|-|+.|+.-| +.|.+|.|.. + +|..++|++|+.||+...
T Consensus 349 H~~~i~~l~YSpDgq~iaTG----~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRY 424 (893)
T KOG0291|consen 349 HSDRITSLAYSPDGQLIATG----AEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRY 424 (893)
T ss_pred cccceeeEEECCCCcEEEec----cCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCeEEeeeeccc
Confidence 1222222 565543221 13799999999999999999654 6899999976 4 455688999999999988
Q ss_pred ceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhh
Q 001814 222 ENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEH 301 (1010)
Q Consensus 222 e~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~ds 301 (1010)
++..|+. .|.|. .+..+|+.| + |-+|..
T Consensus 425 rNfRTft-~P~p~-----------QfscvavD~----------------------------------s-GelV~A----- 452 (893)
T KOG0291|consen 425 RNFRTFT-SPEPI-----------QFSCVAVDP----------------------------------S-GELVCA----- 452 (893)
T ss_pred ceeeeec-CCCce-----------eeeEEEEcC----------------------------------C-CCEEEe-----
Confidence 8877664 34431 122233221 1 112210
Q ss_pred hhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCC
Q 001814 302 SKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSG 381 (1010)
Q Consensus 302 sk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdG 381 (1010)
-+...-.|.||++++|+++-.|.+|.+||.+|+|+|+|
T Consensus 453 ------------------------------------------G~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~ 490 (893)
T KOG0291|consen 453 ------------------------------------------GAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDG 490 (893)
T ss_pred ------------------------------------------eccceEEEEEEEeecCeeeehhcCCCCcceeeEEcccc
Confidence 01123469999999999999999999999999999999
Q ss_pred CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 382 TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 382 tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
++|||+|.| ++||+|++-.. .| .+-++. .+..+.+++|+|||+-||+++.||.|.+|++.
T Consensus 491 ~~LaS~SWD-kTVRiW~if~s-----~~-----------~vEtl~---i~sdvl~vsfrPdG~elaVaTldgqItf~d~~ 550 (893)
T KOG0291|consen 491 SLLASGSWD-KTVRIWDIFSS-----SG-----------TVETLE---IRSDVLAVSFRPDGKELAVATLDGQITFFDIK 550 (893)
T ss_pred CeEEecccc-ceEEEEEeecc-----Cc-----------eeeeEe---eccceeEEEEcCCCCeEEEEEecceEEEEEhh
Confidence 999999994 66999999531 12 222332 12458999999999999999999999999998
Q ss_pred CCCC
Q 001814 462 PFGG 465 (1010)
Q Consensus 462 ~~gg 465 (1010)
....
T Consensus 551 ~~~q 554 (893)
T KOG0291|consen 551 EAVQ 554 (893)
T ss_pred hcee
Confidence 6544
No 16
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.80 E-value=1.2e-17 Score=178.35 Aligned_cols=246 Identities=15% Similarity=0.192 Sum_probs=183.2
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccC----CCcceEeeeeccCCEEEEEEecCCCCCCCCCCc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVED----ASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGF 126 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~----~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F 126 (1010)
.++|++.|.-.+... ..+.+|+.+..+ .+-+|++.. .|.....+.+|.--|..+.+.+++.
T Consensus 11 l~gh~d~Vt~la~~~------~~~~~l~sasrDk~ii~W~L~~dd~~~G~~~r~~~GHsH~v~dv~~s~dg~-------- 76 (315)
T KOG0279|consen 11 LEGHTDWVTALAIKI------KNSDILVSASRDKTIIVWKLTSDDIKYGVPVRRLTGHSHFVSDVVLSSDGN-------- 76 (315)
T ss_pred ecCCCceEEEEEeec------CCCceEEEcccceEEEEEEeccCccccCceeeeeeccceEecceEEccCCc--------
Confidence 478999999887764 135667777665 589999953 2455677778888888888887662
Q ss_pred cccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcC--C
Q 001814 127 RKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSP--R 203 (1010)
Q Consensus 127 ~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~--r 203 (1010)
+ |+ + ++| ++++|+||+.+|+..+.|..|. -|++|++++ +
T Consensus 77 -----~-al-S---------------------~sw----------D~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~ 118 (315)
T KOG0279|consen 77 -----F-AL-S---------------------ASW----------DGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNR 118 (315)
T ss_pred -----e-EE-e---------------------ccc----------cceEEEEEecCCcEEEEEEecCCceEEEEecCCCc
Confidence 2 33 1 122 5899999999999888888775 799999998 4
Q ss_pred eEEEE-eCCeEEEEECCCCceeEEEeecC-CccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCC
Q 001814 204 IVAVG-LATQIYCFDALTLENKFSVLTYP-VPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSG 281 (1010)
Q Consensus 204 lLAV~-ld~~I~IwD~~Tle~l~tL~t~p-~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~ 281 (1010)
.|+.+ -|.+|.+||... ++.+++.... . + ....+.|+| +.
T Consensus 119 qivSGSrDkTiklwnt~g-~ck~t~~~~~~~-----------~-WVscvrfsP-------~~------------------ 160 (315)
T KOG0279|consen 119 QIVSGSRDKTIKLWNTLG-VCKYTIHEDSHR-----------E-WVSCVRFSP-------NE------------------ 160 (315)
T ss_pred eeecCCCcceeeeeeecc-cEEEEEecCCCc-----------C-cEEEEEEcC-------CC------------------
Confidence 56655 467899999764 5666665432 1 0 011222221 10
Q ss_pred CCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE
Q 001814 282 VSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI 361 (1010)
Q Consensus 282 vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~ 361 (1010)
. ...+++++.|++|+|||+.+.+.
T Consensus 161 ---------~-----------------------------------------------~p~Ivs~s~DktvKvWnl~~~~l 184 (315)
T KOG0279|consen 161 ---------S-----------------------------------------------NPIIVSASWDKTVKVWNLRNCQL 184 (315)
T ss_pred ---------C-----------------------------------------------CcEEEEccCCceEEEEccCCcch
Confidence 0 00234678899999999999999
Q ss_pred EEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEcc
Q 001814 362 ISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSH 441 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp 441 (1010)
.+.+.+|++.+++++|||||.++|+++.+|. +-+||+.. | +++|.|.. ...|.+++|+|
T Consensus 185 ~~~~~gh~~~v~t~~vSpDGslcasGgkdg~-~~LwdL~~-------~----------k~lysl~a---~~~v~sl~fsp 243 (315)
T KOG0279|consen 185 RTTFIGHSGYVNTVTVSPDGSLCASGGKDGE-AMLWDLNE-------G----------KNLYSLEA---FDIVNSLCFSP 243 (315)
T ss_pred hhccccccccEEEEEECCCCCEEecCCCCce-EEEEEccC-------C----------ceeEeccC---CCeEeeEEecC
Confidence 9999999999999999999999999999765 89999963 4 68998853 35799999999
Q ss_pred CCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 442 YSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 442 Dg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
.--|||.+...+ |+||++++...
T Consensus 244 nrywL~~at~~s-IkIwdl~~~~~ 266 (315)
T KOG0279|consen 244 NRYWLCAATATS-IKIWDLESKAV 266 (315)
T ss_pred CceeEeeccCCc-eEEEeccchhh
Confidence 999999998775 99999987543
No 17
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.79 E-value=7.9e-18 Score=184.65 Aligned_cols=254 Identities=15% Similarity=0.151 Sum_probs=191.9
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
..+|++-|.-+.|+- .+++|+.+..+- +++||.+..-.+...+.+|+-.|.++.++|.+
T Consensus 146 LrGHt~sv~di~~~a-------~Gk~l~tcSsDl~~~LWd~~~~~~c~ks~~gh~h~vS~V~f~P~g------------- 205 (406)
T KOG0295|consen 146 LRGHTDSVFDISFDA-------SGKYLATCSSDLSAKLWDFDTFFRCIKSLIGHEHGVSSVFFLPLG------------- 205 (406)
T ss_pred hhccccceeEEEEec-------CccEEEecCCccchhheeHHHHHHHHHHhcCcccceeeEEEEecC-------------
Confidence 456777788888873 468899998887 99999976566778888999999999999875
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcC--CeEEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSP--RIVAV 207 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~--rlLAV 207 (1010)
..++.++ .+++|+.|++.+|.|+.++.-++ .|+.|+.+. .++|.
T Consensus 206 d~ilS~s---------------------------------rD~tik~We~~tg~cv~t~~~h~ewvr~v~v~~DGti~As 252 (406)
T KOG0295|consen 206 DHILSCS---------------------------------RDNTIKAWECDTGYCVKTFPGHSEWVRMVRVNQDGTIIAS 252 (406)
T ss_pred Ceeeecc---------------------------------cccceeEEecccceeEEeccCchHhEEEEEecCCeeEEEe
Confidence 2445321 25789999999999999998765 899999988 46666
Q ss_pred Ee-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 GL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 ~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
+. +.++++|-+.+++++..++.|.-| ...+++-|- .+|+. ++-++
T Consensus 253 ~s~dqtl~vW~~~t~~~k~~lR~hEh~-------------vEci~wap~-~~~~~--------------------i~~at 298 (406)
T KOG0295|consen 253 CSNDQTLRVWVVATKQCKAELREHEHP-------------VECIAWAPE-SSYPS--------------------ISEAT 298 (406)
T ss_pred cCCCceEEEEEeccchhhhhhhccccc-------------eEEEEeccc-ccCcc--------------------hhhcc
Confidence 55 457999999999888777766543 122333220 01111 00000
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK 366 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~ 366 (1010)
... + +.+ ...+++.|++|++||+.++.++-+|.
T Consensus 299 ~~~------------------------------------------~-~~~----~l~s~SrDktIk~wdv~tg~cL~tL~ 331 (406)
T KOG0295|consen 299 GST------------------------------------------N-GGQ----VLGSGSRDKTIKIWDVSTGMCLFTLV 331 (406)
T ss_pred CCC------------------------------------------C-Ccc----EEEeecccceEEEEeccCCeEEEEEe
Confidence 000 0 011 12357899999999999999999999
Q ss_pred cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEE
Q 001814 367 AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWI 446 (1010)
Q Consensus 367 aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L 446 (1010)
+|..-|..++|+|-|++|+++..|++ +||||+.. + +++..+. .+...|.++.|..+.-++
T Consensus 332 ghdnwVr~~af~p~Gkyi~ScaDDkt-lrvwdl~~-------~----------~cmk~~~--ah~hfvt~lDfh~~~p~V 391 (406)
T KOG0295|consen 332 GHDNWVRGVAFSPGGKYILSCADDKT-LRVWDLKN-------L----------QCMKTLE--AHEHFVTSLDFHKTAPYV 391 (406)
T ss_pred cccceeeeeEEcCCCeEEEEEecCCc-EEEEEecc-------c----------eeeeccC--CCcceeEEEecCCCCceE
Confidence 99999999999999999999999665 99999963 2 4666664 244569999999999999
Q ss_pred EEEeCCCeEEEEe
Q 001814 447 AIVSSKGTCHVFV 459 (1010)
Q Consensus 447 AsgS~dGTVhIw~ 459 (1010)
++|+-|.|++||.
T Consensus 392 vTGsVdqt~KvwE 404 (406)
T KOG0295|consen 392 VTGSVDQTVKVWE 404 (406)
T ss_pred Eeccccceeeeee
Confidence 9999999999996
No 18
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.78 E-value=6.8e-17 Score=173.55 Aligned_cols=256 Identities=13% Similarity=0.090 Sum_probs=187.4
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
..+|..+|+-..|- .+++.|+.+.++| +-|||.-. .+-...+..+..+|-..++.|.+
T Consensus 51 LkGH~~Ki~~~~ws-------~Dsr~ivSaSqDGklIvWDs~T-tnK~haipl~s~WVMtCA~sPSg------------- 109 (343)
T KOG0286|consen 51 LKGHLNKIYAMDWS-------TDSRRIVSASQDGKLIVWDSFT-TNKVHAIPLPSSWVMTCAYSPSG------------- 109 (343)
T ss_pred ecccccceeeeEec-------CCcCeEEeeccCCeEEEEEccc-ccceeEEecCceeEEEEEECCCC-------------
Confidence 35677777766664 3678888888888 79999854 44456677778899999988865
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCC--e----EEEEEeCC-CcEEEEEEcC-
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSH--C----YEHVLRFR-SSVCMVRCSP- 202 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktg--e----~V~tL~f~-S~V~sVa~S~- 202 (1010)
.++| +| | .++...||++++. + ..++|..| +.+.+.+|-.
T Consensus 110 ~~VA--cG------G-------------------------LdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD 156 (343)
T KOG0286|consen 110 NFVA--CG------G-------------------------LDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDD 156 (343)
T ss_pred CeEE--ec------C-------------------------cCceeEEEecccccccccceeeeeecCccceeEEEEEcCC
Confidence 4555 22 1 2478999999865 2 34456555 5778888865
Q ss_pred -CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCC
Q 001814 203 -RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSG 281 (1010)
Q Consensus 203 -rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~ 281 (1010)
++|..+.|.+..+||+.+++....+.+|..- .|+|+ .+++
T Consensus 157 ~~ilT~SGD~TCalWDie~g~~~~~f~GH~gD---------------V~sls-----l~p~------------------- 197 (343)
T KOG0286|consen 157 NHILTGSGDMTCALWDIETGQQTQVFHGHTGD---------------VMSLS-----LSPS------------------- 197 (343)
T ss_pred CceEecCCCceEEEEEcccceEEEEecCCccc---------------EEEEe-----cCCC-------------------
Confidence 4666667789999999999988877776441 23322 1110
Q ss_pred CCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE
Q 001814 282 VSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI 361 (1010)
Q Consensus 282 vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~ 361 (1010)
+ + .++++++-|+..+|||+..+..
T Consensus 198 -------~-~------------------------------------------------ntFvSg~cD~~aklWD~R~~~c 221 (343)
T KOG0286|consen 198 -------D-G------------------------------------------------NTFVSGGCDKSAKLWDVRSGQC 221 (343)
T ss_pred -------C-C------------------------------------------------CeEEecccccceeeeeccCcce
Confidence 0 0 0234677899999999999999
Q ss_pred EEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEcc
Q 001814 362 ISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSH 441 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp 441 (1010)
+++|.+|.+.|++++|-|+|.-+||+|.|++ +|+||+... +.+..+..-.....|.+++||.
T Consensus 222 ~qtF~ghesDINsv~ffP~G~afatGSDD~t-cRlyDlRaD-----------------~~~a~ys~~~~~~gitSv~FS~ 283 (343)
T KOG0286|consen 222 VQTFEGHESDINSVRFFPSGDAFATGSDDAT-CRLYDLRAD-----------------QELAVYSHDSIICGITSVAFSK 283 (343)
T ss_pred eEeecccccccceEEEccCCCeeeecCCCce-eEEEeecCC-----------------cEEeeeccCcccCCceeEEEcc
Confidence 9999999999999999999999999999776 999999642 3444443333445799999999
Q ss_pred CCCEEEEEeCCCeEEEEeCCCCCCccccccccC
Q 001814 442 YSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSS 474 (1010)
Q Consensus 442 Dg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s 474 (1010)
-|++|.+|-.|.|++||+.-...-.-.+..|..
T Consensus 284 SGRlLfagy~d~~c~vWDtlk~e~vg~L~GHeN 316 (343)
T KOG0286|consen 284 SGRLLFAGYDDFTCNVWDTLKGERVGVLAGHEN 316 (343)
T ss_pred cccEEEeeecCCceeEeeccccceEEEeeccCC
Confidence 999999999999999999754322234456643
No 19
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.78 E-value=2.6e-17 Score=177.31 Aligned_cols=259 Identities=15% Similarity=0.103 Sum_probs=188.4
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
..+|+..|.-+.|+. ++.+|++|.. .-|-+|++..--+-.-.+.+|.|.|--|.+.++.
T Consensus 43 l~gh~geI~~~~F~P-------~gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~d~------------- 102 (338)
T KOG0265|consen 43 LPGHKGEIYTIKFHP-------DGSCFASGGSDRAIVLWNVYGDCENFWVLKGHSGAVMELHGMRDG------------- 102 (338)
T ss_pred cCCCcceEEEEEECC-------CCCeEeecCCcceEEEEeccccccceeeeccccceeEeeeeccCC-------------
Confidence 678999999999984 4567777665 5699999742222234567899999999988765
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcE-EEEEEc---CCeEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSV-CMVRCS---PRIVA 206 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V-~sVa~S---~rlLA 206 (1010)
..|+. ++ .|++|+.||.++|+++..++.+..+ .++.-+ +.+|.
T Consensus 103 s~i~S-~g--------------------------------tDk~v~~wD~~tG~~~rk~k~h~~~vNs~~p~rrg~~lv~ 149 (338)
T KOG0265|consen 103 SHILS-CG--------------------------------TDKTVRGWDAETGKRIRKHKGHTSFVNSLDPSRRGPQLVC 149 (338)
T ss_pred CEEEE-ec--------------------------------CCceEEEEecccceeeehhccccceeeecCccccCCeEEE
Confidence 23332 21 3689999999999999999887654 444433 34666
Q ss_pred EEeC-CeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 207 VGLA-TQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 207 V~ld-~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
.+.+ +++++||+++.+.++++. ++ |...|++ |... +
T Consensus 150 SgsdD~t~kl~D~R~k~~~~t~~---~k-------------yqltAv~-----f~d~-------s--------------- 186 (338)
T KOG0265|consen 150 SGSDDGTLKLWDIRKKEAIKTFE---NK-------------YQLTAVG-----FKDT-------S--------------- 186 (338)
T ss_pred ecCCCceEEEEeecccchhhccc---cc-------------eeEEEEE-----eccc-------c---------------
Confidence 6654 589999999766655442 11 1222332 1100 0
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEe
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQF 365 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~ 365 (1010)
-++.++..|+.|+|||+..+..+.++
T Consensus 187 ------------------------------------------------------~qv~sggIdn~ikvWd~r~~d~~~~l 212 (338)
T KOG0265|consen 187 ------------------------------------------------------DQVISGGIDNDIKVWDLRKNDGLYTL 212 (338)
T ss_pred ------------------------------------------------------cceeeccccCceeeeccccCcceEEe
Confidence 01124567999999999999999999
Q ss_pred ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccc--cEEEEEEccCC
Q 001814 366 KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSA--TIQDICFSHYS 443 (1010)
Q Consensus 366 ~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a--~I~sIAFSpDg 443 (1010)
.+|..+|..|..+|+|..|.+-+-| .++++||+.|. .+..+++..+..+.++. ....++|||++
T Consensus 213 sGh~DtIt~lsls~~gs~llsnsMd-~tvrvwd~rp~-------------~p~~R~v~if~g~~hnfeknlL~cswsp~~ 278 (338)
T KOG0265|consen 213 SGHADTITGLSLSRYGSFLLSNSMD-NTVRVWDVRPF-------------APSQRCVKIFQGHIHNFEKNLLKCSWSPNG 278 (338)
T ss_pred ecccCceeeEEeccCCCcccccccc-ceEEEEEeccc-------------CCCCceEEEeecchhhhhhhcceeeccCCC
Confidence 9999999999999999999999995 56999999873 23335677776655554 35678999999
Q ss_pred CEEEEEeCCCeEEEEeCCCCCCccccccccC
Q 001814 444 QWIAIVSSKGTCHVFVLSPFGGDSGFQTLSS 474 (1010)
Q Consensus 444 ~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s 474 (1010)
+++.++|.|..++||+....+-...+..|..
T Consensus 279 ~~i~ags~dr~vyvwd~~~r~~lyklpGh~g 309 (338)
T KOG0265|consen 279 TKITAGSADRFVYVWDTTSRRILYKLPGHYG 309 (338)
T ss_pred CccccccccceEEEeecccccEEEEcCCcce
Confidence 9999999999999999887666556666653
No 20
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.77 E-value=8.9e-18 Score=184.24 Aligned_cols=250 Identities=16% Similarity=0.199 Sum_probs=187.7
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
++++-.|..+.|.. ..-+.+++.++ .+++||.. ++++...|.+|...|..|.|... ..
T Consensus 105 ~g~r~~vt~v~~hp-------~~~~v~~as~d~tikv~D~~-tg~~e~~LrGHt~sv~di~~~a~-------------Gk 163 (406)
T KOG0295|consen 105 AGHRSSVTRVIFHP-------SEALVVSASEDATIKVFDTE-TGELERSLRGHTDSVFDISFDAS-------------GK 163 (406)
T ss_pred hccccceeeeeecc-------CceEEEEecCCceEEEEEcc-chhhhhhhhccccceeEEEEecC-------------cc
Confidence 45677788888853 44567777665 59999995 58888889999888999987632 23
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCC-CeEEEEEeCCC-cEEEEEEcC--CeEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQS-HCYEHVLRFRS-SVCMVRCSP--RIVAV 207 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlkt-ge~V~tL~f~S-~V~sVa~S~--rlLAV 207 (1010)
+||.++ ++-.+++||+.+ .++++.+..+. .|.+|.|-| +.|+.
T Consensus 164 ~l~tcS---------------------------------sDl~~~LWd~~~~~~c~ks~~gh~h~vS~V~f~P~gd~ilS 210 (406)
T KOG0295|consen 164 YLATCS---------------------------------SDLSAKLWDFDTFFRCIKSLIGHEHGVSSVFFLPLGDHILS 210 (406)
T ss_pred EEEecC---------------------------------CccchhheeHHHHHHHHHHhcCcccceeeEEEEecCCeeee
Confidence 555422 123499999986 66777776554 678888876 45554
Q ss_pred -EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 -GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 -~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
+-|..|+.||+.|+.+++++..++-+ + |.++...+
T Consensus 211 ~srD~tik~We~~tg~cv~t~~~h~ew-------------v-------r~v~v~~D------------------------ 246 (406)
T KOG0295|consen 211 CSRDNTIKAWECDTGYCVKTFPGHSEW-------------V-------RMVRVNQD------------------------ 246 (406)
T ss_pred cccccceeEEecccceeEEeccCchHh-------------E-------EEEEecCC------------------------
Confidence 55778999999999999888765442 0 11111110
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK 366 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~ 366 (1010)
|. .+++++.|.+|+||-+.++++.+.|+
T Consensus 247 ----Gt------------------------------------------------i~As~s~dqtl~vW~~~t~~~k~~lR 274 (406)
T KOG0295|consen 247 ----GT------------------------------------------------IIASCSNDQTLRVWVVATKQCKAELR 274 (406)
T ss_pred ----ee------------------------------------------------EEEecCCCceEEEEEeccchhhhhhh
Confidence 11 23467889999999999999999999
Q ss_pred cCCCCeEEEEECCC---------------CCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccccc
Q 001814 367 AHTSPISALCFDPS---------------GTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITS 431 (1010)
Q Consensus 367 aHtspIsaLaFSPd---------------GtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~ 431 (1010)
.|..||-+++|-|. |..|+++|.|+ +||+||+.. | .+|++| .|+.
T Consensus 275 ~hEh~vEci~wap~~~~~~i~~at~~~~~~~~l~s~SrDk-tIk~wdv~t-------g----------~cL~tL-~ghd- 334 (406)
T KOG0295|consen 275 EHEHPVECIAWAPESSYPSISEATGSTNGGQVLGSGSRDK-TIKIWDVST-------G----------MCLFTL-VGHD- 334 (406)
T ss_pred ccccceEEEEecccccCcchhhccCCCCCccEEEeecccc-eEEEEeccC-------C----------eEEEEE-eccc-
Confidence 99999999999872 35899999965 599999964 5 689998 4544
Q ss_pred ccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccc
Q 001814 432 ATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTL 472 (1010)
Q Consensus 432 a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H 472 (1010)
.+|.+++|+|-|+||+++.+|+|+|||++....+..++.+|
T Consensus 335 nwVr~~af~p~Gkyi~ScaDDktlrvwdl~~~~cmk~~~ah 375 (406)
T KOG0295|consen 335 NWVRGVAFSPGGKYILSCADDKTLRVWDLKNLQCMKTLEAH 375 (406)
T ss_pred ceeeeeEEcCCCeEEEEEecCCcEEEEEeccceeeeccCCC
Confidence 57999999999999999999999999999987766555444
No 21
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.77 E-value=2e-16 Score=195.76 Aligned_cols=227 Identities=15% Similarity=0.225 Sum_probs=160.2
Q ss_pred CeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCc
Q 001814 75 KQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHL 153 (1010)
Q Consensus 75 ~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~ 153 (1010)
+..|++|..+ .++|||+. .++....+..|.+.|..+++.|.. ..+|+. ++
T Consensus 545 ~~~las~~~Dg~v~lWd~~-~~~~~~~~~~H~~~V~~l~~~p~~------------~~~L~S-gs--------------- 595 (793)
T PLN00181 545 KSQVASSNFEGVVQVWDVA-RSQLVTEMKEHEKRVWSIDYSSAD------------PTLLAS-GS--------------- 595 (793)
T ss_pred CCEEEEEeCCCeEEEEECC-CCeEEEEecCCCCCEEEEEEcCCC------------CCEEEE-Ec---------------
Confidence 3456666555 59999995 456667778899999999998631 135553 21
Q ss_pred cccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC---CeEEEEe-CCeEEEEECCCCc-eeEEEe
Q 001814 154 GGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP---RIVAVGL-ATQIYCFDALTLE-NKFSVL 228 (1010)
Q Consensus 154 ~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~---rlLAV~l-d~~I~IwD~~Tle-~l~tL~ 228 (1010)
.+++|++||+++++++.++..+..|.++.+++ ..|++|. ++.|++||+.+.+ .+.++.
T Consensus 596 -----------------~Dg~v~iWd~~~~~~~~~~~~~~~v~~v~~~~~~g~~latgs~dg~I~iwD~~~~~~~~~~~~ 658 (793)
T PLN00181 596 -----------------DDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMI 658 (793)
T ss_pred -----------------CCCEEEEEECCCCcEEEEEecCCCeEEEEEeCCCCCEEEEEeCCCeEEEEECCCCCccceEec
Confidence 14789999999999999999888999999853 5677655 5689999998754 233333
Q ss_pred ecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcc
Q 001814 229 TYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAG 308 (1010)
Q Consensus 229 t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~G 308 (1010)
.|..+ + .+++|... .
T Consensus 659 ~h~~~----------------V----~~v~f~~~----------------------------~----------------- 673 (793)
T PLN00181 659 GHSKT----------------V----SYVRFVDS----------------------------S----------------- 673 (793)
T ss_pred CCCCC----------------E----EEEEEeCC----------------------------C-----------------
Confidence 33221 0 11222110 0
Q ss_pred cceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCC------CcEEEEeccCCCCeEEEEECCCCC
Q 001814 309 LSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVT------RAIISQFKAHTSPISALCFDPSGT 382 (1010)
Q Consensus 309 i~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s------~~~v~~~~aHtspIsaLaFSPdGt 382 (1010)
.+++++.||+|+|||+.. ...+..|.+|...+.+++|+|+|.
T Consensus 674 --------------------------------~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~ 721 (793)
T PLN00181 674 --------------------------------TLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDG 721 (793)
T ss_pred --------------------------------EEEEEECCCEEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCC
Confidence 122456799999999974 356789999999999999999999
Q ss_pred EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe---------cccccccEEEEEEccCCCEEEEEeCCC
Q 001814 383 LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH---------RGITSATIQDICFSHYSQWIAIVSSKG 453 (1010)
Q Consensus 383 lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~---------RG~t~a~I~sIAFSpDg~~LAsgS~dG 453 (1010)
+||+++.||+ |+||+.... . ....+.+. -..+...|.+++|+||++.|++++.+|
T Consensus 722 ~lasgs~D~~-v~iw~~~~~-------~--------~~~s~~~~~~~~~~~~~~~~~~~~V~~v~ws~~~~~lva~~~dG 785 (793)
T PLN00181 722 YIATGSETNE-VFVYHKAFP-------M--------PVLSYKFKTIDPVSGLEVDDASQFISSVCWRGQSSTLVAANSTG 785 (793)
T ss_pred EEEEEeCCCE-EEEEECCCC-------C--------ceEEEecccCCcccccccCCCCcEEEEEEEcCCCCeEEEecCCC
Confidence 9999999765 999997421 0 00111110 011234699999999999999999999
Q ss_pred eEEEEeC
Q 001814 454 TCHVFVL 460 (1010)
Q Consensus 454 TVhIw~I 460 (1010)
+|+||++
T Consensus 786 ~I~i~~~ 792 (793)
T PLN00181 786 NIKILEM 792 (793)
T ss_pred cEEEEec
Confidence 9999986
No 22
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.75 E-value=5.2e-16 Score=181.24 Aligned_cols=239 Identities=18% Similarity=0.202 Sum_probs=176.2
Q ss_pred CCCeEEEEEecCc-EEEEEccCCC-cceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDAS-NFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNR 150 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g-~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~ 150 (1010)
++++.|+.+..++ +++|+..... ++...+.+|.-.|+.++|.|++. +|+ ++.
T Consensus 169 ~~g~~l~~~~~~~~i~~~~~~~~~~~~~~~l~~h~~~v~~~~fs~d~~-------------~l~--s~s----------- 222 (456)
T KOG0266|consen 169 PDGRALAAASSDGLIRIWKLEGIKSNLLRELSGHTRGVSDVAFSPDGS-------------YLL--SGS----------- 222 (456)
T ss_pred CCCCeEEEccCCCcEEEeecccccchhhccccccccceeeeEECCCCc-------------EEE--Eec-----------
Confidence 4666777776665 8999995433 14455578999999999998872 333 221
Q ss_pred CCccccccCCcCCCCCCCCCCCCEEEEEeC-CCCeEEEEEeCC-CcEEEEEEcC--CeEEEE-eCCeEEEEECCCCceeE
Q 001814 151 SHLGGVRDGMMDSQSGNCVNSPTAVRFYSF-QSHCYEHVLRFR-SSVCMVRCSP--RIVAVG-LATQIYCFDALTLENKF 225 (1010)
Q Consensus 151 ~~~~~vr~gs~d~~~~~~~~sp~tVrIWDl-ktge~V~tL~f~-S~V~sVa~S~--rlLAV~-ld~~I~IwD~~Tle~l~ 225 (1010)
.+.+|||||+ ..+.++++++.| ..|++++|++ ++|++| .|+.|+|||+.+++++.
T Consensus 223 --------------------~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~ 282 (456)
T KOG0266|consen 223 --------------------DDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVR 282 (456)
T ss_pred --------------------CCceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEE
Confidence 2579999999 566899999866 5899999998 466655 56789999999999999
Q ss_pred EEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhh
Q 001814 226 SVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQF 305 (1010)
Q Consensus 226 tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~l 305 (1010)
++..|..+ ...+++ .. ++..
T Consensus 283 ~l~~hs~~-------------is~~~f-------~~---------------------------d~~~------------- 302 (456)
T KOG0266|consen 283 KLKGHSDG-------------ISGLAF-------SP---------------------------DGNL------------- 302 (456)
T ss_pred eeeccCCc-------------eEEEEE-------CC---------------------------CCCE-------------
Confidence 99887652 112222 22 1111
Q ss_pred hcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc--EEEEeccCCCC--eEEEEECCCC
Q 001814 306 AAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA--IISQFKAHTSP--ISALCFDPSG 381 (1010)
Q Consensus 306 a~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~--~v~~~~aHtsp--IsaLaFSPdG 381 (1010)
+++++.||.|+|||+.++. ++..+..|..+ +++++|+|+|
T Consensus 303 ------------------------------------l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~ 346 (456)
T KOG0266|consen 303 ------------------------------------LVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNG 346 (456)
T ss_pred ------------------------------------EEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCC
Confidence 1234679999999999998 67888887666 9999999999
Q ss_pred CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccc--cEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 382 TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSA--TIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 382 tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a--~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
.+|+++..++ .+++|++.. + ..+.. .+|+... .+.+..++++++|+.+|+.|++|++|+
T Consensus 347 ~~ll~~~~d~-~~~~w~l~~-------~----------~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~sg~~d~~v~~~~ 407 (456)
T KOG0266|consen 347 KYLLSASLDR-TLKLWDLRS-------G----------KSVGT-YTGHSNLVRCIFSPTLSTGGKLIYSGSEDGSVYVWD 407 (456)
T ss_pred cEEEEecCCC-eEEEEEccC-------C----------cceee-ecccCCcceeEecccccCCCCeEEEEeCCceEEEEe
Confidence 9999999954 599999953 2 12222 2444443 466677789999999999999999999
Q ss_pred CCCCCCccccccc
Q 001814 460 LSPFGGDSGFQTL 472 (1010)
Q Consensus 460 I~~~gg~~~~~~H 472 (1010)
+........+..|
T Consensus 408 ~~s~~~~~~l~~h 420 (456)
T KOG0266|consen 408 SSSGGILQRLEGH 420 (456)
T ss_pred CCccchhhhhcCC
Confidence 9986666666666
No 23
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.75 E-value=1.5e-16 Score=167.95 Aligned_cols=229 Identities=17% Similarity=0.205 Sum_probs=162.0
Q ss_pred CCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEEeCCeEEEEECCCCce--eEEEeecCCccccCCCcccccc
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVGLATQIYCFDALTLEN--KFSVLTYPVPQLAGQGAVGINV 245 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle~--l~tL~t~p~p~~~~~g~~~vnv 245 (1010)
++.|||||.+.+|.|..++++. +.|..+.+.+ +.||++....|++||+.+... +.++..+..
T Consensus 18 YDhTIRfWqa~tG~C~rTiqh~dsqVNrLeiTpdk~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~k------------- 84 (311)
T KOG0315|consen 18 YDHTIRFWQALTGICSRTIQHPDSQVNRLEITPDKKDLAAAGNQHVRLYDLNSNNPNPVATFEGHTK------------- 84 (311)
T ss_pred CcceeeeeehhcCeEEEEEecCccceeeEEEcCCcchhhhccCCeeEEEEccCCCCCceeEEeccCC-------------
Confidence 4789999999999999999997 5898888877 689999999999999998653 444444322
Q ss_pred CccceEEcc----ceEEEccC--CeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeecccccc
Q 001814 246 GYGPMAVGP----RWLAYASN--TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQE 319 (1010)
Q Consensus 246 ~~gplAlgp----RwLAyas~--~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~ 319 (1010)
+..++|- ||+..++. .+.|||.-. ++-.+.- ...+|.+..
T Consensus 85 --NVtaVgF~~dgrWMyTgseDgt~kIWdlR~--~~~qR~~---~~~spVn~v--------------------------- 130 (311)
T KOG0315|consen 85 --NVTAVGFQCDGRWMYTGSEDGTVKIWDLRS--LSCQRNY---QHNSPVNTV--------------------------- 130 (311)
T ss_pred --ceEEEEEeecCeEEEecCCCceEEEEeccC--cccchhc---cCCCCcceE---------------------------
Confidence 3344433 99988875 478898411 1111100 000111100
Q ss_pred ccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc-CCCCeEEEEECCCCCEEEEEEcCCCeEEEEe
Q 001814 320 LLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA-HTSPISALCFDPSGTLLVTASVYGNNINIFR 398 (1010)
Q Consensus 320 l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a-HtspIsaLaFSPdGtlLATAS~dGt~IrVwd 398 (1010)
.+..++. -+++++++|.|+|||+.....-.++.. -..+|.+|+..|||++|+.+-.+|+ ..||+
T Consensus 131 --------vlhpnQt------eLis~dqsg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG~-cyvW~ 195 (311)
T KOG0315|consen 131 --------VLHPNQT------ELISGDQSGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKGN-CYVWR 195 (311)
T ss_pred --------EecCCcc------eEEeecCCCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCcc-EEEEE
Confidence 0011110 235788999999999998866655543 4579999999999999999999887 78999
Q ss_pred CCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC-CCccccccccC
Q 001814 399 IMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF-GGDSGFQTLSS 474 (1010)
Q Consensus 399 i~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~-gg~~~~~~H~s 474 (1010)
+.... . .+..+.+.+|+. +..-|..+-||||+++||++|.|.||+||.++.+ +++..+..|..
T Consensus 196 l~~~~-----~------~s~l~P~~k~~a--h~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~kle~~l~gh~r 259 (311)
T KOG0315|consen 196 LLNHQ-----T------ASELEPVHKFQA--HNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFKLELVLTGHQR 259 (311)
T ss_pred ccCCC-----c------cccceEhhheec--ccceEEEEEECCCCcEEEeecCCceEEEEecCCceeeEEEeecCCc
Confidence 86421 1 122345556543 3446888999999999999999999999999987 77777777753
No 24
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.74 E-value=1.8e-15 Score=187.41 Aligned_cols=247 Identities=15% Similarity=0.106 Sum_probs=167.2
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCC---Ccce--Eee-eeccCCEEEEEEecCCCCCCCCCC
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDA---SNFN--ELV-SKRDGPVSFLQMQPFPVKDDGCEG 125 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~---g~v~--ell-S~hdGpV~~v~~lP~p~~s~~~D~ 125 (1010)
..|.+.|.-+.|+. ++++|++|..+| ++|||++.. +... .++ -.+...|..+.+.|..
T Consensus 480 ~~~~~~V~~i~fs~-------dg~~latgg~D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~-------- 544 (793)
T PLN00181 480 LNSSNLVCAIGFDR-------DGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYI-------- 544 (793)
T ss_pred cCCCCcEEEEEECC-------CCCEEEEEeCCCEEEEEECCcccccccccccceEEecccCceeeEEeccCC--------
Confidence 34777888888874 456777777655 899998431 0000 001 1123456666665421
Q ss_pred ccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--
Q 001814 126 FRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP-- 202 (1010)
Q Consensus 126 F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~-- 202 (1010)
..+||..+ .+++|+|||+.+++.+..++.| ..|++|+|++
T Consensus 545 ----~~~las~~---------------------------------~Dg~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~ 587 (793)
T PLN00181 545 ----KSQVASSN---------------------------------FEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSAD 587 (793)
T ss_pred ----CCEEEEEe---------------------------------CCCeEEEEECCCCeEEEEecCCCCCEEEEEEcCCC
Confidence 13455321 2478999999999999998766 5899999985
Q ss_pred -CeEEEE-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCC
Q 001814 203 -RIVAVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPS 280 (1010)
Q Consensus 203 -rlLAV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p 280 (1010)
.+|+++ .++.|++||+.+.+.+.++..... .. .+++...
T Consensus 588 ~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~~~----------------v~-----~v~~~~~------------------ 628 (793)
T PLN00181 588 PTLLASGSDDGSVKLWSINQGVSIGTIKTKAN----------------IC-----CVQFPSE------------------ 628 (793)
T ss_pred CCEEEEEcCCCEEEEEECCCCcEEEEEecCCC----------------eE-----EEEEeCC------------------
Confidence 466665 467899999998877666543211 00 1111110
Q ss_pred CCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc
Q 001814 281 GVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA 360 (1010)
Q Consensus 281 ~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~ 360 (1010)
+ +. .+++++.||.|++||+.+.+
T Consensus 629 --------~-g~------------------------------------------------~latgs~dg~I~iwD~~~~~ 651 (793)
T PLN00181 629 --------S-GR------------------------------------------------SLAFGSADHKVYYYDLRNPK 651 (793)
T ss_pred --------C-CC------------------------------------------------EEEEEeCCCeEEEEECCCCC
Confidence 0 11 12345689999999998765
Q ss_pred -EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE
Q 001814 361 -IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF 439 (1010)
Q Consensus 361 -~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF 439 (1010)
.+..+.+|..+|.+++|. ++.+|+|++.||+ |+|||+.... .+ . ....+..+ .|+. ..|..++|
T Consensus 652 ~~~~~~~~h~~~V~~v~f~-~~~~lvs~s~D~~-ikiWd~~~~~----~~---~----~~~~l~~~-~gh~-~~i~~v~~ 716 (793)
T PLN00181 652 LPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNT-LKLWDLSMSI----SG---I----NETPLHSF-MGHT-NVKNFVGL 716 (793)
T ss_pred ccceEecCCCCCEEEEEEe-CCCEEEEEECCCE-EEEEeCCCCc----cc---c----CCcceEEE-cCCC-CCeeEEEE
Confidence 567889999999999997 7889999999765 9999985310 00 0 01245555 4543 56899999
Q ss_pred ccCCCEEEEEeCCCeEEEEeCCC
Q 001814 440 SHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 440 SpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+|++++||+|+.|++|+||+...
T Consensus 717 s~~~~~lasgs~D~~v~iw~~~~ 739 (793)
T PLN00181 717 SVSDGYIATGSETNEVFVYHKAF 739 (793)
T ss_pred cCCCCEEEEEeCCCEEEEEECCC
Confidence 99999999999999999999653
No 25
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.73 E-value=8.8e-16 Score=164.33 Aligned_cols=227 Identities=15% Similarity=0.204 Sum_probs=160.2
Q ss_pred CCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
++++..+.|..++ +++||++ .++....+.+|.--|-.++|.|+.. . +|+|.
T Consensus 73 ~dg~~alS~swD~~lrlWDl~-~g~~t~~f~GH~~dVlsva~s~dn~------------q---ivSGS------------ 124 (315)
T KOG0279|consen 73 SDGNFALSASWDGTLRLWDLA-TGESTRRFVGHTKDVLSVAFSTDNR------------Q---IVSGS------------ 124 (315)
T ss_pred cCCceEEeccccceEEEEEec-CCcEEEEEEecCCceEEEEecCCCc------------e---eecCC------------
Confidence 4667777776665 8999995 5688889999999999999998751 1 23321
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC--CCcEEEEEEcCC----eEEE-EeCCeEEEEECCCCcee
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF--RSSVCMVRCSPR----IVAV-GLATQIYCFDALTLENK 224 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f--~S~V~sVa~S~r----lLAV-~ld~~I~IwD~~Tle~l 224 (1010)
-++++++||...+......+. +..|.+|+|+|+ +|+. +-|+.+++||+.+.+..
T Consensus 125 -------------------rDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~ 185 (315)
T KOG0279|consen 125 -------------------RDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLR 185 (315)
T ss_pred -------------------CcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchh
Confidence 258999999976544333333 568999999985 4444 45678999999998877
Q ss_pred EEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhh
Q 001814 225 FSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQ 304 (1010)
Q Consensus 225 ~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~ 304 (1010)
.++..+..- .+.++++ |+ |++
T Consensus 186 ~~~~gh~~~-------------v~t~~vS----------------------------------pD-Gsl----------- 206 (315)
T KOG0279|consen 186 TTFIGHSGY-------------VNTVTVS----------------------------------PD-GSL----------- 206 (315)
T ss_pred hcccccccc-------------EEEEEEC----------------------------------CC-CCE-----------
Confidence 666554320 1223332 11 111
Q ss_pred hhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEE
Q 001814 305 FAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLL 384 (1010)
Q Consensus 305 la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlL 384 (1010)
.++|+.||.+.+||+..++.+..|. |..+|.+|+|+|+--.|
T Consensus 207 -------------------------------------casGgkdg~~~LwdL~~~k~lysl~-a~~~v~sl~fspnrywL 248 (315)
T KOG0279|consen 207 -------------------------------------CASGGKDGEAMLWDLNEGKNLYSLE-AFDIVNSLCFSPNRYWL 248 (315)
T ss_pred -------------------------------------EecCCCCceEEEEEccCCceeEecc-CCCeEeeEEecCCceeE
Confidence 1246789999999999999876655 67899999999997777
Q ss_pred EEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe---ccc----ccccEEEEEEccCCCEEEEEeCCCeEEE
Q 001814 385 VTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH---RGI----TSATIQDICFSHYSQWIAIVSSKGTCHV 457 (1010)
Q Consensus 385 ATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~---RG~----t~a~I~sIAFSpDg~~LAsgS~dGTVhI 457 (1010)
+.|- ++.|+||++.+. ..+.+|+ .|. ....-.++|||+||+.|.++-.|+.|++
T Consensus 249 ~~at--~~sIkIwdl~~~-----------------~~v~~l~~d~~g~s~~~~~~~clslaws~dG~tLf~g~td~~irv 309 (315)
T KOG0279|consen 249 CAAT--ATSIKIWDLESK-----------------AVVEELKLDGIGPSSKAGDPICLSLAWSADGQTLFAGYTDNVIRV 309 (315)
T ss_pred eecc--CCceEEEeccch-----------------hhhhhccccccccccccCCcEEEEEEEcCCCcEEEeeecCCcEEE
Confidence 6554 566999999752 1222221 111 0123467899999999999999999999
Q ss_pred EeCCC
Q 001814 458 FVLSP 462 (1010)
Q Consensus 458 w~I~~ 462 (1010)
|.+..
T Consensus 310 ~qv~~ 314 (315)
T KOG0279|consen 310 WQVAK 314 (315)
T ss_pred EEeec
Confidence 99853
No 26
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.72 E-value=5.6e-16 Score=174.42 Aligned_cols=215 Identities=17% Similarity=0.238 Sum_probs=148.1
Q ss_pred CCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCC---eEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPR---IVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGY 247 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~-S~V~sVa~S~r---lLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~ 247 (1010)
++.+|||+. .|..+.+|.+| ++|.+++++++ +|..+-|+++.+||+.+++..+...-+..|
T Consensus 256 ~G~~riw~~-~G~l~~tl~~HkgPI~slKWnk~G~yilS~~vD~ttilwd~~~g~~~q~f~~~s~~-------------- 320 (524)
T KOG0273|consen 256 DGEARIWNK-DGNLISTLGQHKGPIFSLKWNKKGTYILSGGVDGTTILWDAHTGTVKQQFEFHSAP-------------- 320 (524)
T ss_pred CcEEEEEec-CchhhhhhhccCCceEEEEEcCCCCEEEeccCCccEEEEeccCceEEEeeeeccCC--------------
Confidence 478999998 56778888765 69999999983 444566789999999888776655544332
Q ss_pred cceEEccceEE---EccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCC
Q 001814 248 GPMAVGPRWLA---YASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDG 324 (1010)
Q Consensus 248 gplAlgpRwLA---yas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~g 324 (1010)
++...|.- |+.. ++++.-.|.+...+ +=.+|+.++-..
T Consensus 321 ---~lDVdW~~~~~F~ts------------------------~td~~i~V~kv~~~-------~P~~t~~GH~g~----- 361 (524)
T KOG0273|consen 321 ---ALDVDWQSNDEFATS------------------------STDGCIHVCKVGED-------RPVKTFIGHHGE----- 361 (524)
T ss_pred ---ccceEEecCceEeec------------------------CCCceEEEEEecCC-------CcceeeecccCc-----
Confidence 11112221 1110 00101111111100 001333333221
Q ss_pred CCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCC---------CCEEEEEEcCCCeEE
Q 001814 325 SSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPS---------GTLLVTASVYGNNIN 395 (1010)
Q Consensus 325 s~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPd---------GtlLATAS~dGt~Ir 395 (1010)
.+.+-.++.++ .+++++.|++++||..........|++|...|..+.++|+ |..||+|+.+++ ++
T Consensus 362 -V~alk~n~tg~----LLaS~SdD~TlkiWs~~~~~~~~~l~~Hskei~t~~wsp~g~v~~n~~~~~~l~sas~dst-V~ 435 (524)
T KOG0273|consen 362 -VNALKWNPTGS----LLASCSDDGTLKIWSMGQSNSVHDLQAHSKEIYTIKWSPTGPVTSNPNMNLMLASASFDST-VK 435 (524)
T ss_pred -eEEEEECCCCc----eEEEecCCCeeEeeecCCCcchhhhhhhccceeeEeecCCCCccCCCcCCceEEEeecCCe-EE
Confidence 22222233222 3468899999999999999999999999999999999995 468999999655 99
Q ss_pred EEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 396 IFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 396 Vwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
+||+.. | ++++.|.++ ...|++++|||||++||+|+.||-||||++..+..
T Consensus 436 lwdv~~-------g----------v~i~~f~kH--~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~~l 486 (524)
T KOG0273|consen 436 LWDVES-------G----------VPIHTLMKH--QEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTGKL 486 (524)
T ss_pred EEEccC-------C----------ceeEeeccC--CCceEEEEecCCCcEEEecCCCCeeEeccccchhe
Confidence 999964 5 688998764 46899999999999999999999999999987654
No 27
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.71 E-value=5.4e-16 Score=183.43 Aligned_cols=303 Identities=16% Similarity=0.170 Sum_probs=202.7
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
+|....+|.-..|. |.+-.++++..+| ||+||.- -+.+.+-+..|||||+.|.|-| .+
T Consensus 5 fEskSsRvKglsFH-------P~rPwILtslHsG~IQlWDYR-M~tli~rFdeHdGpVRgv~FH~-------------~q 63 (1202)
T KOG0292|consen 5 FESKSSRVKGLSFH-------PKRPWILTSLHSGVIQLWDYR-MGTLIDRFDEHDGPVRGVDFHP-------------TQ 63 (1202)
T ss_pred hhcccccccceecC-------CCCCEEEEeecCceeeeehhh-hhhHHhhhhccCCccceeeecC-------------CC
Confidence 45566677777785 4677899999888 8999984 4667788889999999999665 45
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCC--eEEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPR--IVAV 207 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~r--lLAV 207 (1010)
||. |+|++ +.+|++|+.++.+|+-+|..| ..|+.+.|.+. .+++
T Consensus 64 plF--VSGGD-------------------------------DykIkVWnYk~rrclftL~GHlDYVRt~~FHheyPWIlS 110 (1202)
T KOG0292|consen 64 PLF--VSGGD-------------------------------DYKIKVWNYKTRRCLFTLLGHLDYVRTVFFHHEYPWILS 110 (1202)
T ss_pred CeE--EecCC-------------------------------ccEEEEEecccceehhhhccccceeEEeeccCCCceEEE
Confidence 763 34421 479999999999999999876 68999999873 4555
Q ss_pred E-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEcc--CCeeeccCCccCCCcCCCCCC
Q 001814 208 G-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYAS--NTLLLSNSGRLSPQNLTPSGV 282 (1010)
Q Consensus 208 ~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas--~~~~iwd~G~vs~Q~lt~p~v 282 (1010)
+ .|.+|+||+-.+.+++-++.+|--- .-+..|-| ..++.++ .++++||.+++--.+..| +
T Consensus 111 ASDDQTIrIWNwqsr~~iavltGHnHY-------------VMcAqFhptEDlIVSaSLDQTVRVWDisGLRkk~~~p-g- 175 (1202)
T KOG0292|consen 111 ASDDQTIRIWNWQSRKCIAVLTGHNHY-------------VMCAQFHPTEDLIVSASLDQTVRVWDISGLRKKNKAP-G- 175 (1202)
T ss_pred ccCCCeEEEEeccCCceEEEEecCceE-------------EEeeccCCccceEEEecccceEEEEeecchhccCCCC-C-
Confidence 4 4567999999999999888876331 01122344 4555554 468999964431111111 1
Q ss_pred CCCc----CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCC
Q 001814 283 SPST----SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVT 358 (1010)
Q Consensus 283 S~st----SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s 358 (1010)
+... .+.++.|-.. +.-..|++..|+ -. ...-.+|.|.- -.+.+|+.|..|++|....
T Consensus 176 ~~e~~~~~~~~~~dLfg~-~DaVVK~VLEGH-------DR------GVNwaAfhpTl----pliVSG~DDRqVKlWrmne 237 (1202)
T KOG0292|consen 176 SLEDQMRGQQGNSDLFGQ-TDAVVKHVLEGH-------DR------GVNWAAFHPTL----PLIVSGADDRQVKLWRMNE 237 (1202)
T ss_pred CchhhhhccccchhhcCC-cCeeeeeeeccc-------cc------ccceEEecCCc----ceEEecCCcceeeEEEecc
Confidence 0000 0000000000 000012222222 11 01111122110 0245888999999999976
Q ss_pred Cc--EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEE
Q 001814 359 RA--IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQD 436 (1010)
Q Consensus 359 ~~--~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~s 436 (1010)
-+ .+-+.++|..+|+++-|+|.-.++++.|+|+ .|||||+... ..+++|||- +.+-|.
T Consensus 238 tKaWEvDtcrgH~nnVssvlfhp~q~lIlSnsEDk-sirVwDm~kR-----------------t~v~tfrre--ndRFW~ 297 (1202)
T KOG0292|consen 238 TKAWEVDTCRGHYNNVSSVLFHPHQDLILSNSEDK-SIRVWDMTKR-----------------TSVQTFRRE--NDRFWI 297 (1202)
T ss_pred ccceeehhhhcccCCcceEEecCccceeEecCCCc-cEEEEecccc-----------------cceeeeecc--CCeEEE
Confidence 54 4678899999999999999999999999965 5999998531 357788885 456799
Q ss_pred EEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 437 ICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 437 IAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
|+-.|..+.+|+|-+.|. -||.++-
T Consensus 298 laahP~lNLfAAgHDsGm-~VFkleR 322 (1202)
T KOG0292|consen 298 LAAHPELNLFAAGHDSGM-IVFKLER 322 (1202)
T ss_pred EEecCCcceeeeecCCce-EEEEEcc
Confidence 999999999999987775 4899873
No 28
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.71 E-value=3.2e-17 Score=178.61 Aligned_cols=229 Identities=18% Similarity=0.216 Sum_probs=167.6
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEE
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFL 133 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLL 133 (1010)
..+.|+.+.+|. ..++.|.. ++++|||..+ -.+..+|.+|.|.|-|+++- . . |
T Consensus 196 ~skgVYClQYDD---------~kiVSGlrDnTikiWD~n~-~~c~~~L~GHtGSVLCLqyd-------------~--r-v 249 (499)
T KOG0281|consen 196 NSKGVYCLQYDD---------EKIVSGLRDNTIKIWDKNS-LECLKILTGHTGSVLCLQYD-------------E--R-V 249 (499)
T ss_pred cCCceEEEEecc---------hhhhcccccCceEEecccc-HHHHHhhhcCCCcEEeeecc-------------c--e-E
Confidence 456677777762 35666664 6799999954 55778899999999999965 1 1 3
Q ss_pred EEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCCeEEEEeC-C
Q 001814 134 LVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPRIVAVGLA-T 211 (1010)
Q Consensus 134 AvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~rlLAV~ld-~ 211 (1010)
+ ++|. ++.||++||..+|++++++-+| ..|+.++|+..+++.+.. .
T Consensus 250 i-isGS-------------------------------SDsTvrvWDv~tge~l~tlihHceaVLhlrf~ng~mvtcSkDr 297 (499)
T KOG0281|consen 250 I-VSGS-------------------------------SDSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSNGYMVTCSKDR 297 (499)
T ss_pred E-EecC-------------------------------CCceEEEEeccCCchhhHHhhhcceeEEEEEeCCEEEEecCCc
Confidence 3 3431 3679999999999999998766 589999999988887765 4
Q ss_pred eEEEEECCCCcee---EEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCC
Q 001814 212 QIYCFDALTLENK---FSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSP 288 (1010)
Q Consensus 212 ~I~IwD~~Tle~l---~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP 288 (1010)
.|.+||+.....+ +.|.+|-. ++ +.+.
T Consensus 298 siaVWdm~sps~it~rrVLvGHrA---------aV----NvVd------------------------------------- 327 (499)
T KOG0281|consen 298 SIAVWDMASPTDITLRRVLVGHRA---------AV----NVVD------------------------------------- 327 (499)
T ss_pred eeEEEeccCchHHHHHHHHhhhhh---------he----eeec-------------------------------------
Confidence 7999997643210 00000000 00 0000
Q ss_pred CCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC
Q 001814 289 GGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH 368 (1010)
Q Consensus 289 ~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH 368 (1010)
+ +.| .+.+++.|.+|++|++.+++.+.++.+|
T Consensus 328 ------------------------------------------f--d~k----yIVsASgDRTikvW~~st~efvRtl~gH 359 (499)
T KOG0281|consen 328 ------------------------------------------F--DDK----YIVSASGDRTIKVWSTSTCEFVRTLNGH 359 (499)
T ss_pred ------------------------------------------c--ccc----eEEEecCCceEEEEeccceeeehhhhcc
Confidence 0 001 1335678999999999999999999999
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
...|.|+.+ .|++++++|.| .+||+||+.. | .+|..| .|+. .-|.+|-|. .+.|++
T Consensus 360 kRGIAClQY--r~rlvVSGSSD-ntIRlwdi~~-------G----------~cLRvL-eGHE-eLvRciRFd--~krIVS 415 (499)
T KOG0281|consen 360 KRGIACLQY--RDRLVVSGSSD-NTIRLWDIEC-------G----------ACLRVL-EGHE-ELVRCIRFD--NKRIVS 415 (499)
T ss_pred cccceehhc--cCeEEEecCCC-ceEEEEeccc-------c----------HHHHHH-hchH-Hhhhheeec--Cceeee
Confidence 999999987 58999999995 5699999963 5 466555 4643 468999984 689999
Q ss_pred EeCCCeEEEEeCCCC
Q 001814 449 VSSKGTCHVFVLSPF 463 (1010)
Q Consensus 449 gS~dGTVhIw~I~~~ 463 (1010)
|.-||+|+||++...
T Consensus 416 GaYDGkikvWdl~aa 430 (499)
T KOG0281|consen 416 GAYDGKIKVWDLQAA 430 (499)
T ss_pred ccccceEEEEecccc
Confidence 999999999999754
No 29
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.71 E-value=6.5e-16 Score=169.28 Aligned_cols=238 Identities=17% Similarity=0.240 Sum_probs=185.7
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
.+|...|..+.||. ..+.+++|..++ ++|||+. +|.+...|.+|-..|+.|++++ .+|
T Consensus 148 ~gHlgWVr~vavdP-------~n~wf~tgs~DrtikIwDla-tg~LkltltGhi~~vr~vavS~-------------rHp 206 (460)
T KOG0285|consen 148 SGHLGWVRSVAVDP-------GNEWFATGSADRTIKIWDLA-TGQLKLTLTGHIETVRGVAVSK-------------RHP 206 (460)
T ss_pred hhccceEEEEeeCC-------CceeEEecCCCceeEEEEcc-cCeEEEeecchhheeeeeeecc-------------cCc
Confidence 34566666666763 467999999876 8999995 6889999999999999999774 458
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVG 208 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~ 208 (1010)
+|.. +++ ++.|+-|||..++.|+.+-.| +.|+++++.| ++|+.+
T Consensus 207 YlFs-~ge--------------------------------dk~VKCwDLe~nkvIR~YhGHlS~V~~L~lhPTldvl~t~ 253 (460)
T KOG0285|consen 207 YLFS-AGE--------------------------------DKQVKCWDLEYNKVIRHYHGHLSGVYCLDLHPTLDVLVTG 253 (460)
T ss_pred eEEE-ecC--------------------------------CCeeEEEechhhhhHHHhccccceeEEEeccccceeEEec
Confidence 8764 321 378999999999998887655 7999999998 566654
Q ss_pred -eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcC
Q 001814 209 -LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTS 287 (1010)
Q Consensus 209 -ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stS 287 (1010)
.|..|+|||++|...++++.+|.+|.+ .+.+. +
T Consensus 254 grDst~RvWDiRtr~~V~~l~GH~~~V~-------------~V~~~------~--------------------------- 287 (460)
T KOG0285|consen 254 GRDSTIRVWDIRTRASVHVLSGHTNPVA-------------SVMCQ------P--------------------------- 287 (460)
T ss_pred CCcceEEEeeecccceEEEecCCCCcce-------------eEEee------c---------------------------
Confidence 566899999999999999988877410 00000 0
Q ss_pred CCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc
Q 001814 288 PGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA 367 (1010)
Q Consensus 288 P~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a 367 (1010)
++| ++.+++.|++|++||+..++...++..
T Consensus 288 -------------------------------------------~dp-------qvit~S~D~tvrlWDl~agkt~~tlt~ 317 (460)
T KOG0285|consen 288 -------------------------------------------TDP-------QVITGSHDSTVRLWDLRAGKTMITLTH 317 (460)
T ss_pred -------------------------------------------CCC-------ceEEecCCceEEEeeeccCceeEeeec
Confidence 000 234578899999999999999999999
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
|...|.||+..|.-.++|+||.| +|+-|++ |. | ..+..| .| +++.|..++-..|+ .++
T Consensus 318 hkksvral~lhP~e~~fASas~d--nik~w~~-p~------g----------~f~~nl-sg-h~~iintl~~nsD~-v~~ 375 (460)
T KOG0285|consen 318 HKKSVRALCLHPKENLFASASPD--NIKQWKL-PE------G----------EFLQNL-SG-HNAIINTLSVNSDG-VLV 375 (460)
T ss_pred ccceeeEEecCCchhhhhccCCc--cceeccC-Cc------c----------chhhcc-cc-ccceeeeeeeccCc-eEE
Confidence 99999999999999999999983 4999998 43 4 344444 34 45789999999886 667
Q ss_pred EEeCCCeEEEEeCCC
Q 001814 448 IVSSKGTCHVFVLSP 462 (1010)
Q Consensus 448 sgS~dGTVhIw~I~~ 462 (1010)
+|+++|++..|+...
T Consensus 376 ~G~dng~~~fwdwks 390 (460)
T KOG0285|consen 376 SGGDNGSIMFWDWKS 390 (460)
T ss_pred EcCCceEEEEEecCc
Confidence 999999999999765
No 30
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.70 E-value=5.8e-15 Score=150.48 Aligned_cols=240 Identities=20% Similarity=0.227 Sum_probs=162.6
Q ss_pred eeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEe
Q 001814 100 LVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYS 179 (1010)
Q Consensus 100 llS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWD 179 (1010)
.+..|.++|.++.+.|++ .+|++.. .++.|++||
T Consensus 4 ~~~~h~~~i~~~~~~~~~-------------~~l~~~~---------------------------------~~g~i~i~~ 37 (289)
T cd00200 4 TLKGHTGGVTCVAFSPDG-------------KLLATGS---------------------------------GDGTIKVWD 37 (289)
T ss_pred HhcccCCCEEEEEEcCCC-------------CEEEEee---------------------------------cCcEEEEEE
Confidence 455788999999998764 3555421 136899999
Q ss_pred CCCCeEEEEEeCC-CcEEEEEEcCC--eEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc-
Q 001814 180 FQSHCYEHVLRFR-SSVCMVRCSPR--IVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP- 254 (1010)
Q Consensus 180 lktge~V~tL~f~-S~V~sVa~S~r--lLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp- 254 (1010)
+.+++.+..+..+ ..|..+.+.+. .|+++. ++.|++||+.+.+....+..+..+ ...+++.+
T Consensus 38 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~~~~~~~~~~~~~-------------i~~~~~~~~ 104 (289)
T cd00200 38 LETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSY-------------VSSVAFSPD 104 (289)
T ss_pred eeCCCcEEEEecCCcceeEEEECCCCCEEEEEcCCCeEEEEEcCcccceEEEeccCCc-------------EEEEEEcCC
Confidence 9999887777665 57779999873 566554 789999999987766666644321 23355655
Q ss_pred -ceEEEcc--CCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccC
Q 001814 255 -RWLAYAS--NTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSP 331 (1010)
Q Consensus 255 -RwLAyas--~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~ 331 (1010)
++++.++ ..+.+|+.... ..+ ..+..+. .....+.+
T Consensus 105 ~~~~~~~~~~~~i~~~~~~~~-------------------~~~----------------~~~~~~~------~~i~~~~~ 143 (289)
T cd00200 105 GRILSSSSRDKTIKVWDVETG-------------------KCL----------------TTLRGHT------DWVNSVAF 143 (289)
T ss_pred CCEEEEecCCCeEEEEECCCc-------------------EEE----------------EEeccCC------CcEEEEEE
Confidence 5666554 24566763100 000 0000000 00001111
Q ss_pred CCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCC
Q 001814 332 NSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNH 411 (1010)
Q Consensus 332 s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~ 411 (1010)
++..+ .++.+..+|.|.+||+.+++.+..+..|..+|.+++|+|+|+.|++++.+|. |++|++.. +
T Consensus 144 ~~~~~----~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~-i~i~d~~~-------~-- 209 (289)
T cd00200 144 SPDGT----FVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT-IKLWDLST-------G-- 209 (289)
T ss_pred cCcCC----EEEEEcCCCcEEEEEccccccceeEecCccccceEEECCCcCEEEEecCCCc-EEEEECCC-------C--
Confidence 12111 1234456999999999999999999999999999999999999999998665 99999853 2
Q ss_pred ccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 412 KYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 412 ~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..+..+. + +...|.+++|+||+.++++++.+|++++|++...
T Consensus 210 --------~~~~~~~-~-~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~~ 251 (289)
T cd00200 210 --------KCLGTLR-G-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTG 251 (289)
T ss_pred --------ceecchh-h-cCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCCc
Confidence 2333332 2 2347999999999999999998999999999764
No 31
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.70 E-value=6.1e-16 Score=181.07 Aligned_cols=259 Identities=14% Similarity=0.169 Sum_probs=179.6
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEE
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFL 133 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLL 133 (1010)
-.|+|+.++|= ++.-+.|+++...+ +|++++. .-.+ .++.+|+..|-.+... .+ .-||
T Consensus 322 ~ndEI~Dm~~l------G~e~~~laVATNs~~lr~y~~~-~~~c-~ii~GH~e~vlSL~~~------------~~-g~ll 380 (775)
T KOG0319|consen 322 YNDEILDMKFL------GPEESHLAVATNSPELRLYTLP-TSYC-QIIPGHTEAVLSLDVW------------SS-GDLL 380 (775)
T ss_pred Cchhheeeeec------CCccceEEEEeCCCceEEEecC-CCce-EEEeCchhheeeeeec------------cc-CcEE
Confidence 44455555542 33457899998876 9999984 3445 4999999999888733 11 1355
Q ss_pred EEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCe----EEEEEeCC-CcEEEEEEcC---CeE
Q 001814 134 LVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHC----YEHVLRFR-SSVCMVRCSP---RIV 205 (1010)
Q Consensus 134 AvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge----~V~tL~f~-S~V~sVa~S~---rlL 205 (1010)
+.++ .|++|+||-+..+. ++.....| +.|.+|+++. .++
T Consensus 381 at~s---------------------------------KD~svilWr~~~~~~~~~~~a~~~gH~~svgava~~~~~asff 427 (775)
T KOG0319|consen 381 ATGS---------------------------------KDKSVILWRLNNNCSKSLCVAQANGHTNSVGAVAGSKLGASFF 427 (775)
T ss_pred EEec---------------------------------CCceEEEEEecCCcchhhhhhhhcccccccceeeecccCccEE
Confidence 5321 35899999774333 34444444 5899999976 355
Q ss_pred E-EEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCC
Q 001814 206 A-VGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSP 284 (1010)
Q Consensus 206 A-V~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~ 284 (1010)
+ ++.|..|++|++..-+.. . .++.+-.|+.+-+.+
T Consensus 428 vsvS~D~tlK~W~l~~s~~~-----~-----------------~~~~~~~~~t~~aHd---------------------- 463 (775)
T KOG0319|consen 428 VSVSQDCTLKLWDLPKSKET-----A-----------------FPIVLTCRYTERAHD---------------------- 463 (775)
T ss_pred EEecCCceEEEecCCCcccc-----c-----------------ccceehhhHHHHhhc----------------------
Confidence 5 567778999998752210 0 011111111000000
Q ss_pred CcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEE
Q 001814 285 STSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQ 364 (1010)
Q Consensus 285 stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~ 364 (1010)
|++ ++..++.| . ..++++++|.+.+||++..+....+
T Consensus 464 ------------------KdI-------------------N~Vaia~n--d----kLiAT~SqDktaKiW~le~~~l~~v 500 (775)
T KOG0319|consen 464 ------------------KDI-------------------NCVAIAPN--D----KLIATGSQDKTAKIWDLEQLRLLGV 500 (775)
T ss_pred ------------------ccc-------------------cceEecCC--C----ceEEecccccceeeecccCceEEEE
Confidence 000 00011111 1 1246789999999999999999999
Q ss_pred eccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCC
Q 001814 365 FKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQ 444 (1010)
Q Consensus 365 ~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~ 444 (1010)
|.+|+..|.++.|+|..++|||||.|. +|+||.|.+ . .++.+| .|++ +.|...+|-.+++
T Consensus 501 LsGH~RGvw~V~Fs~~dq~laT~SgD~-TvKIW~is~-------f----------SClkT~-eGH~-~aVlra~F~~~~~ 560 (775)
T KOG0319|consen 501 LSGHTRGVWCVSFSKNDQLLATCSGDK-TVKIWSIST-------F----------SCLKTF-EGHT-SAVLRASFIRNGK 560 (775)
T ss_pred eeCCccceEEEEeccccceeEeccCCc-eEEEEEecc-------c----------eeeeee-cCcc-ceeEeeeeeeCCc
Confidence 999999999999999999999999965 599999964 2 478787 5765 4589999999999
Q ss_pred EEEEEeCCCeEEEEeCCCCCCccccccccC
Q 001814 445 WIAIVSSKGTCHVFVLSPFGGDSGFQTLSS 474 (1010)
Q Consensus 445 ~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s 474 (1010)
.|.++..||-++||++....+..++..|+.
T Consensus 561 qliS~~adGliKlWnikt~eC~~tlD~H~D 590 (775)
T KOG0319|consen 561 QLISAGADGLIKLWNIKTNECEMTLDAHND 590 (775)
T ss_pred EEEeccCCCcEEEEeccchhhhhhhhhccc
Confidence 999999999999999999999999999974
No 32
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.70 E-value=8.1e-15 Score=171.88 Aligned_cols=330 Identities=15% Similarity=0.189 Sum_probs=195.8
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcc-eEeeeeccCCEEEEEEecCCCCCCCCCCcc-cc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNF-NELVSKRDGPVSFLQMQPFPVKDDGCEGFR-KL 129 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v-~ellS~hdGpV~~v~~lP~p~~s~~~D~F~-~s 129 (1010)
-+|-|.|.|+.|-. +.++|++|..+- .+||+++...++ ...+.+|.++|-...|-.+... -|- ++
T Consensus 142 ~g~fddi~si~Ws~-------DSr~l~~gsrD~s~rl~~v~~~k~~~~~~l~gHkd~VvacfF~~~~~~-----l~tvsk 209 (893)
T KOG0291|consen 142 LGHFDDITSIDWSD-------DSRLLVTGSRDLSARLFGVDGNKNLFTYALNGHKDYVVACFFGANSLD-----LYTVSK 209 (893)
T ss_pred cCCccceeEEEecc-------CCceEEeccccceEEEEEeccccccceEeccCCCcceEEEEeccCcce-----EEEEec
Confidence 45667777777753 678999988764 899999865543 4567888888877666533211 000 01
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcC---CCCCCCCCCCCEEEEEeCCCCeEEEEEeCC---CcEEEEEEcC-
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMD---SQSGNCVNSPTAVRFYSFQSHCYEHVLRFR---SSVCMVRCSP- 202 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d---~~~~~~~~sp~tVrIWDlktge~V~tL~f~---S~V~sVa~S~- 202 (1010)
.-+|.++.++.. ++. +... .-. .+|+-| ..++.- .-.++ +|-.. +..-|. +.|.+.+|++
T Consensus 210 dG~l~~W~~~~~-P~~--~~~~-~kd-~eg~~d~~~~~~~Ee--k~~~~-~~~k~-----~k~~ln~~~~kvtaa~fH~~ 276 (893)
T KOG0291|consen 210 DGALFVWTCDLR-PPE--LDKA-EKD-EEGSDDEEMDEDGEE--KTHKI-FWYKT-----KKHYLNQNSSKVTAAAFHKG 276 (893)
T ss_pred CceEEEEEecCC-Ccc--cccc-ccc-ccccccccccccchh--hhcce-EEEEE-----EeeeecccccceeeeeccCC
Confidence 113344444311 000 0000 000 001000 000000 00111 22111 111122 6889999998
Q ss_pred -CeEEEEeCC-eEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEc--cceEEEccCC---eeeccCCccCCC
Q 001814 203 -RIVAVGLAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVG--PRWLAYASNT---LLLSNSGRLSPQ 275 (1010)
Q Consensus 203 -rlLAV~ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlg--pRwLAyas~~---~~iwd~G~vs~Q 275 (1010)
++|||+... .+++|++....+++.+.-...+ ...+++. ..|||+.... +.+|..-..
T Consensus 277 t~~lvvgFssG~f~LyelP~f~lih~LSis~~~-------------I~t~~~N~tGDWiA~g~~klgQLlVweWqsE--- 340 (893)
T KOG0291|consen 277 TNLLVVGFSSGEFGLYELPDFNLIHSLSISDQK-------------ILTVSFNSTGDWIAFGCSKLGQLLVWEWQSE--- 340 (893)
T ss_pred ceEEEEEecCCeeEEEecCCceEEEEeecccce-------------eeEEEecccCCEEEEcCCccceEEEEEeecc---
Confidence 688999875 5679999999999888743332 2346665 4899999864 566752100
Q ss_pred cCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEE
Q 001814 276 NLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKD 355 (1010)
Q Consensus 276 ~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwD 355 (1010)
++|- +.+++...+ ..+.-+|+++ .+++|..||.|+|||
T Consensus 341 ----------------sYVl----------------KQQgH~~~i------~~l~YSpDgq----~iaTG~eDgKVKvWn 378 (893)
T KOG0291|consen 341 ----------------SYVL----------------KQQGHSDRI------TSLAYSPDGQ----LIATGAEDGKVKVWN 378 (893)
T ss_pred ----------------ceee----------------eccccccce------eeEEECCCCc----EEEeccCCCcEEEEe
Confidence 0100 011111111 0111122222 245788999999999
Q ss_pred CCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcc-----c-----------CCCC------CCc-
Q 001814 356 FVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCM-----R-----------SGSG------NHK- 412 (1010)
Q Consensus 356 l~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~-----~-----------~~sG------~~~- 412 (1010)
..++-++.+|.-|++.|++++|+.+|+.|+++|-||+ +|.||+..... . +.+| +..
T Consensus 379 ~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGt-VRAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~ 457 (893)
T KOG0291|consen 379 TQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGT-VRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDS 457 (893)
T ss_pred ccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCe-EEeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccce
Confidence 9999999999999999999999999999999999887 89999975310 0 0011 000
Q ss_pred ---cccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcc
Q 001814 413 ---YDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 413 ---~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
.-|+-.--++.....|| .+.|.+++|+|++..||++|=|.||++|++-...+.+
T Consensus 458 F~IfvWS~qTGqllDiLsGH-EgPVs~l~f~~~~~~LaS~SWDkTVRiW~if~s~~~v 514 (893)
T KOG0291|consen 458 FEIFVWSVQTGQLLDILSGH-EGPVSGLSFSPDGSLLASGSWDKTVRIWDIFSSSGTV 514 (893)
T ss_pred EEEEEEEeecCeeeehhcCC-CCcceeeEEccccCeEEeccccceEEEEEeeccCcee
Confidence 00221111222223464 4689999999999999999999999999997665544
No 33
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.69 E-value=1.3e-16 Score=177.14 Aligned_cols=248 Identities=17% Similarity=0.220 Sum_probs=168.1
Q ss_pred CCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
+.++-|++|.+.| |-+|+.. .=+++.++..||.+|+++++++++ ..+| ++|.
T Consensus 106 PeGRRLltgs~SGEFtLWNg~-~fnFEtilQaHDs~Vr~m~ws~~g--------------~wmi-SgD~----------- 158 (464)
T KOG0284|consen 106 PEGRRLLTGSQSGEFTLWNGT-SFNFETILQAHDSPVRTMKWSHNG--------------TWMI-SGDK----------- 158 (464)
T ss_pred CCCceeEeecccccEEEecCc-eeeHHHHhhhhcccceeEEEccCC--------------CEEE-EcCC-----------
Confidence 3556677777766 9999984 445777889999999999999765 2343 4431
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEe-CC-CcEEEEEEcC---CeEEEEeCCeEEEEECCCCceeEE
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLR-FR-SSVCMVRCSP---RIVAVGLATQIYCFDALTLENKFS 226 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~-f~-S~V~sVa~S~---rlLAV~ld~~I~IwD~~Tle~l~t 226 (1010)
++.|++|+..-. .|+.++ ++ ..|++++|++ +++.++.|+.|+|||....+....
T Consensus 159 --------------------gG~iKyWqpnmn-nVk~~~ahh~eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~kee~v 217 (464)
T KOG0284|consen 159 --------------------GGMIKYWQPNMN-NVKIIQAHHAEAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKEERV 217 (464)
T ss_pred --------------------CceEEecccchh-hhHHhhHhhhhhhheeccCCCCceeEEecCCCeEEEEeccCCchhhe
Confidence 368999998543 344443 44 5899999998 466667778999999876665554
Q ss_pred EeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhh
Q 001814 227 VLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFA 306 (1010)
Q Consensus 227 L~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la 306 (1010)
|..|-- - ++.++ |+ |. ..
T Consensus 218 L~GHgw---------------d-----Vksvd--------WH-------------------P~-kg-------------- 235 (464)
T KOG0284|consen 218 LRGHGW---------------D-----VKSVD--------WH-------------------PT-KG-------------- 235 (464)
T ss_pred eccCCC---------------C-----cceec--------cC-------------------Cc-cc--------------
Confidence 543211 0 00000 11 00 01
Q ss_pred cccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEE
Q 001814 307 AGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVT 386 (1010)
Q Consensus 307 ~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLAT 386 (1010)
-+++++.|..|++||-.++.++.++.+|+.-|.++.|+|+|.+|+|
T Consensus 236 ----------------------------------LiasgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n~N~Llt 281 (464)
T KOG0284|consen 236 ----------------------------------LIASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPNGNWLLT 281 (464)
T ss_pred ----------------------------------eeEEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCCCCeeEE
Confidence 1235667889999999999999999999999999999999999999
Q ss_pred EEcCCCeEEEEeCCCCc---c-c-------CC-------CCCCccccCCcceE--EE------EEecccccccEEEEEEc
Q 001814 387 ASVYGNNINIFRIMPSC---M-R-------SG-------SGNHKYDWNSSHVH--LY------KLHRGITSATIQDICFS 440 (1010)
Q Consensus 387 AS~dGt~IrVwdi~p~~---~-~-------~~-------sG~~~~~~~~s~~~--L~------~L~RG~t~a~I~sIAFS 440 (1010)
+|. +..++|||+.... . . .. +--.+..|+.+..+ +. ...-+ +...|++++|.
T Consensus 282 ~sk-D~~~kv~DiR~mkEl~~~r~Hkkdv~~~~WhP~~~~lftsgg~Dgsvvh~~v~~~~p~~~i~~A-Hd~~iwsl~~h 359 (464)
T KOG0284|consen 282 GSK-DQSCKVFDIRTMKELFTYRGHKKDVTSLTWHPLNESLFTSGGSDGSVVHWVVGLEEPLGEIPPA-HDGEIWSLAYH 359 (464)
T ss_pred ccC-CceEEEEehhHhHHHHHhhcchhhheeeccccccccceeeccCCCceEEEeccccccccCCCcc-cccceeeeecc
Confidence 999 5789999996210 0 0 00 00000112222211 11 11112 23369999999
Q ss_pred cCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 441 HYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 441 pDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
|=|..||+||.|.|++.|.-.-.+..
T Consensus 360 PlGhil~tgsnd~t~rfw~r~rp~d~ 385 (464)
T KOG0284|consen 360 PLGHILATGSNDRTVRFWTRNRPGDK 385 (464)
T ss_pred ccceeEeecCCCcceeeeccCCCCCc
Confidence 99999999999999999987655543
No 34
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.69 E-value=4.3e-15 Score=173.55 Aligned_cols=243 Identities=16% Similarity=0.212 Sum_probs=179.1
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
.+|...|.-+.|- +++++|+.|..++ ++|||+...+....++.+|...|.++.|.|.+ .
T Consensus 200 ~~h~~~v~~~~fs-------~d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g-------------~ 259 (456)
T KOG0266|consen 200 SGHTRGVSDVAFS-------PDGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDG-------------N 259 (456)
T ss_pred cccccceeeeEEC-------CCCcEEEEecCCceEEEeeccCCCeEEEEecCCCCceEEEEecCCC-------------C
Confidence 4566666666664 3667888888876 99999966667888999999999999999875 2
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcC--CeEEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSP--RIVAVG 208 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~--rlLAV~ 208 (1010)
+++. |+ .+++|||||+++++++..|+.|+ .|.+++|++ ++|+++
T Consensus 260 ~i~S--gs-------------------------------~D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~~l~s~ 306 (456)
T KOG0266|consen 260 LLVS--GS-------------------------------DDGTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGNLLVSA 306 (456)
T ss_pred EEEE--ec-------------------------------CCCcEEEEeccCCeEEEeeeccCCceEEEEECCCCCEEEEc
Confidence 5553 21 25789999999999999999875 899999988 566665
Q ss_pred e-CCeEEEEECCCCcee--EEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 209 L-ATQIYCFDALTLENK--FSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 209 l-d~~I~IwD~~Tle~l--~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
. ++.|+|||+.+++.. .++.....+ . ++ +.+.|.
T Consensus 307 s~d~~i~vwd~~~~~~~~~~~~~~~~~~-------------~-~~----~~~~fs------------------------- 343 (456)
T KOG0266|consen 307 SYDGTIRVWDLETGSKLCLKLLSGAENS-------------A-PV----TSVQFS------------------------- 343 (456)
T ss_pred CCCccEEEEECCCCceeeeecccCCCCC-------------C-ce----eEEEEC-------------------------
Confidence 4 678999999998843 333322111 0 00 111121
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEe
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQF 365 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~ 365 (1010)
|.+.. +..+..|+++++||+..+..+..+
T Consensus 344 --p~~~~-------------------------------------------------ll~~~~d~~~~~w~l~~~~~~~~~ 372 (456)
T KOG0266|consen 344 --PNGKY-------------------------------------------------LLSASLDRTLKLWDLRSGKSVGTY 372 (456)
T ss_pred --CCCcE-------------------------------------------------EEEecCCCeEEEEEccCCcceeee
Confidence 11111 123456889999999999999999
Q ss_pred ccCCCC---eEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccC
Q 001814 366 KAHTSP---ISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHY 442 (1010)
Q Consensus 366 ~aHtsp---IsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD 442 (1010)
..|... +.+..+++.|+++++++.++ .|.+|++.+ + ..+.++ .|+..+.|..++|+|.
T Consensus 373 ~~~~~~~~~~~~~~~~~~~~~i~sg~~d~-~v~~~~~~s-------~----------~~~~~l-~~h~~~~~~~~~~~~~ 433 (456)
T KOG0266|consen 373 TGHSNLVRCIFSPTLSTGGKLIYSGSEDG-SVYVWDSSS-------G----------GILQRL-EGHSKAAVSDLSSHPT 433 (456)
T ss_pred cccCCcceeEecccccCCCCeEEEEeCCc-eEEEEeCCc-------c----------chhhhh-cCCCCCceeccccCCC
Confidence 999874 55566789999999999965 599999864 2 244444 3432467999999999
Q ss_pred CCEEEEEe--CCCeEEEEeCC
Q 001814 443 SQWIAIVS--SKGTCHVFVLS 461 (1010)
Q Consensus 443 g~~LAsgS--~dGTVhIw~I~ 461 (1010)
..++++++ .|+.+++|..+
T Consensus 434 ~~~~~s~s~~~d~~~~~w~~~ 454 (456)
T KOG0266|consen 434 ENLIASSSFEGDGLIRLWKYD 454 (456)
T ss_pred cCeeeecCcCCCceEEEecCC
Confidence 99999999 78999999864
No 35
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.69 E-value=4.5e-16 Score=164.41 Aligned_cols=198 Identities=22% Similarity=0.303 Sum_probs=151.9
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
-.||--|.-+.|++ +...|++|.... +||||++....-.+-+++|.+.|+.+.+.-. . .
T Consensus 97 f~hkhivk~~af~~-------ds~~lltgg~ekllrvfdln~p~App~E~~ghtg~Ir~v~wc~e-----------D--~ 156 (334)
T KOG0278|consen 97 FEHKHIVKAVAFSQ-------DSNYLLTGGQEKLLRVFDLNRPKAPPKEISGHTGGIRTVLWCHE-----------D--K 156 (334)
T ss_pred hhhhheeeeEEecc-------cchhhhccchHHHhhhhhccCCCCCchhhcCCCCcceeEEEecc-----------C--c
Confidence 45788899999987 456777777766 6999998666556667889999999887721 1 1
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEe
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGL 209 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~l 209 (1010)
.++. +. .+++||+||.++|+.+++|.|+++|.++..++ ++|.++.
T Consensus 157 ~iLS-Sa--------------------------------dd~tVRLWD~rTgt~v~sL~~~s~VtSlEvs~dG~ilTia~ 203 (334)
T KOG0278|consen 157 CILS-SA--------------------------------DDKTVRLWDHRTGTEVQSLEFNSPVTSLEVSQDGRILTIAY 203 (334)
T ss_pred eEEe-ec--------------------------------cCCceEEEEeccCcEEEEEecCCCCcceeeccCCCEEEEec
Confidence 2231 11 25899999999999999999999999988876 7889999
Q ss_pred CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCC
Q 001814 210 ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPG 289 (1010)
Q Consensus 210 d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~ 289 (1010)
...|.+||+.+++.+.... .| .|+ ...+| +|.
T Consensus 204 gssV~Fwdaksf~~lKs~k---~P---------~nV--~SASL----------------------------------~P~ 235 (334)
T KOG0278|consen 204 GSSVKFWDAKSFGLLKSYK---MP---------CNV--ESASL----------------------------------HPK 235 (334)
T ss_pred CceeEEeccccccceeecc---Cc---------ccc--ccccc----------------------------------cCC
Confidence 9999999999988765443 21 000 00011 122
Q ss_pred CCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEe-ccC
Q 001814 290 GSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQF-KAH 368 (1010)
Q Consensus 290 ~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~-~aH 368 (1010)
.+.+ ..++.|+.+..||..++..+..+ ++|
T Consensus 236 k~~f-------------------------------------------------VaGged~~~~kfDy~TgeEi~~~nkgh 266 (334)
T KOG0278|consen 236 KEFF-------------------------------------------------VAGGEDFKVYKFDYNTGEEIGSYNKGH 266 (334)
T ss_pred CceE-------------------------------------------------EecCcceEEEEEeccCCceeeecccCC
Confidence 1222 23578999999999999988876 899
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
-+||-||.|+|||.+-|++|+||+ ||+|++.+
T Consensus 267 ~gpVhcVrFSPdGE~yAsGSEDGT-irlWQt~~ 298 (334)
T KOG0278|consen 267 FGPVHCVRFSPDGELYASGSEDGT-IRLWQTTP 298 (334)
T ss_pred CCceEEEEECCCCceeeccCCCce-EEEEEecC
Confidence 999999999999999999999887 99999865
No 36
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.68 E-value=2.1e-15 Score=178.06 Aligned_cols=244 Identities=17% Similarity=0.220 Sum_probs=171.7
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
-+|.+.|.-++|. |++++|+.+..++ +|+|++.. ..+.-+.++|-.||..|+|.|.+-
T Consensus 448 ~GH~GPVyg~sFs-------Pd~rfLlScSED~svRLWsl~t-~s~~V~y~GH~~PVwdV~F~P~Gy------------- 506 (707)
T KOG0263|consen 448 YGHSGPVYGCSFS-------PDRRFLLSCSEDSSVRLWSLDT-WSCLVIYKGHLAPVWDVQFAPRGY------------- 506 (707)
T ss_pred ecCCCceeeeeec-------ccccceeeccCCcceeeeeccc-ceeEEEecCCCcceeeEEecCCce-------------
Confidence 3467778888885 5788999999865 89999964 456667789999999999998762
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVG 208 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~ 208 (1010)
+.|..+ .|+|-++|+.......+.+-.| +-|-+|+|.| .+++.|
T Consensus 507 YFatas---------------------------------~D~tArLWs~d~~~PlRifaghlsDV~cv~FHPNs~Y~aTG 553 (707)
T KOG0263|consen 507 YFATAS---------------------------------HDQTARLWSTDHNKPLRIFAGHLSDVDCVSFHPNSNYVATG 553 (707)
T ss_pred EEEecC---------------------------------CCceeeeeecccCCchhhhcccccccceEEECCcccccccC
Confidence 344311 1478899999887777777665 6899999988 588876
Q ss_pred eC-CeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcC
Q 001814 209 LA-TQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTS 287 (1010)
Q Consensus 209 ld-~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stS 287 (1010)
.. .++++||+.++..+..+.+|..| ...++++| +
T Consensus 554 SsD~tVRlWDv~~G~~VRiF~GH~~~-------------V~al~~Sp-------~------------------------- 588 (707)
T KOG0263|consen 554 SSDRTVRLWDVSTGNSVRIFTGHKGP-------------VTALAFSP-------C------------------------- 588 (707)
T ss_pred CCCceEEEEEcCCCcEEEEecCCCCc-------------eEEEEEcC-------C-------------------------
Confidence 54 57999999999887777666553 12233322 1
Q ss_pred CCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc
Q 001814 288 PGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA 367 (1010)
Q Consensus 288 P~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a 367 (1010)
| + .+++++.||.|.|||+.+++.+.++++
T Consensus 589 ---G--------------------------------------------r----~LaSg~ed~~I~iWDl~~~~~v~~l~~ 617 (707)
T KOG0263|consen 589 ---G--------------------------------------------R----YLASGDEDGLIKIWDLANGSLVKQLKG 617 (707)
T ss_pred ---C--------------------------------------------c----eEeecccCCcEEEEEcCCCcchhhhhc
Confidence 0 0 123567899999999999999999999
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCC-------ccccCCcceEEEEEecccccccEEEEEEc
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNH-------KYDWNSSHVHLYKLHRGITSATIQDICFS 440 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~-------~~~~~~s~~~L~~L~RG~t~a~I~sIAFS 440 (1010)
|++.|.+|.||.||+.||+++. |+.+++||+.........+.. .....++..++-.+.. +...|..|.|.
T Consensus 618 Ht~ti~SlsFS~dg~vLasgg~-DnsV~lWD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llgs~~t--K~tpv~~l~Ft 694 (707)
T KOG0263|consen 618 HTGTIYSLSFSRDGNVLASGGA-DNSVRLWDLTKVIELLNLGHISTSNSAITQENNASSLLLGSFYT--KNTPVVGLHFT 694 (707)
T ss_pred ccCceeEEEEecCCCEEEecCC-CCeEEEEEchhhcccccccccccccccccccCCCCcceeeeeee--cCceEEEEEEe
Confidence 9999999999999999999999 677999998642110000000 0001122234444432 22368888888
Q ss_pred cCCCEEEEE
Q 001814 441 HYSQWIAIV 449 (1010)
Q Consensus 441 pDg~~LAsg 449 (1010)
.-.-.||+|
T Consensus 695 rrNl~L~~g 703 (707)
T KOG0263|consen 695 RRNLLLAVG 703 (707)
T ss_pred ccceeEEec
Confidence 766555554
No 37
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.68 E-value=1.2e-15 Score=178.50 Aligned_cols=227 Identities=19% Similarity=0.270 Sum_probs=178.5
Q ss_pred CCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
+..+++++|+++| ++|||+.. ..+.|+..-|+|.+..++.+|+... + +..+
T Consensus 422 pgd~~Iv~G~k~Gel~vfdlaS-~~l~Eti~AHdgaIWsi~~~pD~~g------------~-vT~s-------------- 473 (888)
T KOG0306|consen 422 PGDRYIVLGTKNGELQVFDLAS-ASLVETIRAHDGAIWSISLSPDNKG------------F-VTGS-------------- 473 (888)
T ss_pred CCCceEEEeccCCceEEEEeeh-hhhhhhhhccccceeeeeecCCCCc------------e-EEec--------------
Confidence 3567899999998 99999954 5667888899999999999987621 2 2211
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCC-----CeE--------EEEEeCCCcEEEEEEcC--CeEEEEe-CCeEEE
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQS-----HCY--------EHVLRFRSSVCMVRCSP--RIVAVGL-ATQIYC 215 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlkt-----ge~--------V~tL~f~S~V~sVa~S~--rlLAV~l-d~~I~I 215 (1010)
.+++|+|||++- |.. -.+|++...|++|++|| ++|||++ +.+++|
T Consensus 474 -------------------aDktVkfWdf~l~~~~~gt~~k~lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkV 534 (888)
T KOG0306|consen 474 -------------------ADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKV 534 (888)
T ss_pred -------------------CCcEEEEEeEEEEeccCcccceeeeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEE
Confidence 358999999852 221 14677889999999997 7999986 678999
Q ss_pred EECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEE
Q 001814 216 FDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVA 295 (1010)
Q Consensus 216 wD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa 295 (1010)
|=+.|++-..+|.+|..|. -.|.++ |++.
T Consensus 535 yflDtlKFflsLYGHkLPV-------------~smDIS----------------------------------~DSk---- 563 (888)
T KOG0306|consen 535 YFLDTLKFFLSLYGHKLPV-------------LSMDIS----------------------------------PDSK---- 563 (888)
T ss_pred EEecceeeeeeecccccce-------------eEEecc----------------------------------CCcC----
Confidence 9999999888888888762 122322 1111
Q ss_pred EeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEE
Q 001814 296 RYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISAL 375 (1010)
Q Consensus 296 ~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaL 375 (1010)
-+++++.|..|+||-+.=|.+-..|-||+..|.++
T Consensus 564 ---------------------------------------------livTgSADKnVKiWGLdFGDCHKS~fAHdDSvm~V 598 (888)
T KOG0306|consen 564 ---------------------------------------------LIVTGSADKNVKIWGLDFGDCHKSFFAHDDSVMSV 598 (888)
T ss_pred ---------------------------------------------eEEeccCCCceEEeccccchhhhhhhcccCceeEE
Confidence 12356789999999999999999999999999999
Q ss_pred EECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeE
Q 001814 376 CFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTC 455 (1010)
Q Consensus 376 aFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTV 455 (1010)
.|=|+..++.||+.|| .|+-||-.. ..++.+|. |++ ..||+++-+|+|.+++++|.|.+|
T Consensus 599 ~F~P~~~~FFt~gKD~-kvKqWDg~k-----------------Fe~iq~L~-~H~-~ev~cLav~~~G~~vvs~shD~sI 658 (888)
T KOG0306|consen 599 QFLPKTHLFFTCGKDG-KVKQWDGEK-----------------FEEIQKLD-GHH-SEVWCLAVSPNGSFVVSSSHDKSI 658 (888)
T ss_pred EEcccceeEEEecCcc-eEEeechhh-----------------hhhheeec-cch-heeeeeEEcCCCCeEEeccCCcee
Confidence 9999999999999966 599998642 24666774 544 579999999999999999999999
Q ss_pred EEEeCCC
Q 001814 456 HVFVLSP 462 (1010)
Q Consensus 456 hIw~I~~ 462 (1010)
++|.-..
T Consensus 659 RlwE~td 665 (888)
T KOG0306|consen 659 RLWERTD 665 (888)
T ss_pred EeeeccC
Confidence 9998654
No 38
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.67 E-value=6.1e-15 Score=166.86 Aligned_cols=235 Identities=20% Similarity=0.265 Sum_probs=168.1
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
.--++.|.-+.|-+ ++++|++|...| +||||. ........+..|+.||..++|.|.. .
T Consensus 65 srFk~~v~s~~fR~-------DG~LlaaGD~sG~V~vfD~-k~r~iLR~~~ah~apv~~~~f~~~d-------------~ 123 (487)
T KOG0310|consen 65 SRFKDVVYSVDFRS-------DGRLLAAGDESGHVKVFDM-KSRVILRQLYAHQAPVHVTKFSPQD-------------N 123 (487)
T ss_pred HhhccceeEEEeec-------CCeEEEccCCcCcEEEecc-ccHHHHHHHhhccCceeEEEecccC-------------C
Confidence 44578888888875 688999999988 899996 4444556677899999999998754 2
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCC---eEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPR---IVAV 207 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~r---lLAV 207 (1010)
.+++ +|++ +.++++||+.+.....++..| ..|++.++++. +++.
T Consensus 124 t~l~-s~sD-------------------------------d~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvt 171 (487)
T KOG0310|consen 124 TMLV-SGSD-------------------------------DKVVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVT 171 (487)
T ss_pred eEEE-ecCC-------------------------------CceEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEe
Confidence 3443 3321 478999999998875567655 58999999883 6665
Q ss_pred -EeCCeEEEEECCCC-ceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 208 -GLATQIYCFDALTL-ENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 208 -~ld~~I~IwD~~Tl-e~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
+.|+.|++||+++. ....++. |..|. -..++++ +
T Consensus 172 GsYDg~vrl~DtR~~~~~v~eln-hg~pV------------e~vl~lp-------s------------------------ 207 (487)
T KOG0310|consen 172 GSYDGKVRLWDTRSLTSRVVELN-HGCPV------------ESVLALP-------S------------------------ 207 (487)
T ss_pred cCCCceEEEEEeccCCceeEEec-CCCce------------eeEEEcC-------C------------------------
Confidence 56789999999876 4444443 22210 0112221 1
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc-EEEE
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA-IISQ 364 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~-~v~~ 364 (1010)
|++++ ++ .-..|+|||+.+|. .+..
T Consensus 208 -----gs~ia------------------------------------------------sA-gGn~vkVWDl~~G~qll~~ 233 (487)
T KOG0310|consen 208 -----GSLIA------------------------------------------------SA-GGNSVKVWDLTTGGQLLTS 233 (487)
T ss_pred -----CCEEE------------------------------------------------Ec-CCCeEEEEEecCCceehhh
Confidence 11111 11 12369999999654 5555
Q ss_pred eccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCC
Q 001814 365 FKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQ 444 (1010)
Q Consensus 365 ~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~ 444 (1010)
+..|...|+||+|.-+++.|.++|.||+ ++||++.. .+.++.+. -++.|.+|+.|||++
T Consensus 234 ~~~H~KtVTcL~l~s~~~rLlS~sLD~~-VKVfd~t~-----------------~Kvv~s~~---~~~pvLsiavs~dd~ 292 (487)
T KOG0310|consen 234 MFNHNKTVTCLRLASDSTRLLSGSLDRH-VKVFDTTN-----------------YKVVHSWK---YPGPVLSIAVSPDDQ 292 (487)
T ss_pred hhcccceEEEEEeecCCceEeecccccc-eEEEEccc-----------------eEEEEeee---cccceeeEEecCCCc
Confidence 5559999999999999999999999887 89999742 13444442 246799999999999
Q ss_pred EEEEEeCCCeEEEEe
Q 001814 445 WIAIVSSKGTCHVFV 459 (1010)
Q Consensus 445 ~LAsgS~dGTVhIw~ 459 (1010)
.+++|-.+|.+-+=+
T Consensus 293 t~viGmsnGlv~~rr 307 (487)
T KOG0310|consen 293 TVVIGMSNGLVSIRR 307 (487)
T ss_pred eEEEecccceeeeeh
Confidence 999999999987653
No 39
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.67 E-value=1.2e-15 Score=169.65 Aligned_cols=235 Identities=16% Similarity=0.231 Sum_probs=173.1
Q ss_pred CcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEE
Q 001814 57 DQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVV 136 (1010)
Q Consensus 57 d~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvV 136 (1010)
..|+-..||. +.+.+|+.++++..++|+++ ...++.+|++|.+.|.++.+.-.. ..||
T Consensus 220 g~it~~d~d~------~~~~~iAas~d~~~r~Wnvd-~~r~~~TLsGHtdkVt~ak~~~~~---------------~~vV 277 (459)
T KOG0288|consen 220 GNITSIDFDS------DNKHVIAASNDKNLRLWNVD-SLRLRHTLSGHTDKVTAAKFKLSH---------------SRVV 277 (459)
T ss_pred CCcceeeecC------CCceEEeecCCCceeeeecc-chhhhhhhcccccceeeehhhccc---------------ccee
Confidence 3455556653 35789999999999999996 467889999999999999976211 2244
Q ss_pred ecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEE-eCCeEEE
Q 001814 137 AGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVG-LATQIYC 215 (1010)
Q Consensus 137 sgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~-ld~~I~I 215 (1010)
++. .++++++||+.+..|.+++-+-+.+.+|.++...++.+ .+++|++
T Consensus 278 sgs-------------------------------~DRtiK~WDl~k~~C~kt~l~~S~cnDI~~~~~~~~SgH~DkkvRf 326 (459)
T KOG0288|consen 278 SGS-------------------------------ADRTIKLWDLQKAYCSKTVLPGSQCNDIVCSISDVISGHFDKKVRF 326 (459)
T ss_pred ecc-------------------------------ccchhhhhhhhhhheeccccccccccceEecceeeeecccccceEE
Confidence 431 36899999999999999998889999999997666665 4678999
Q ss_pred EECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEE
Q 001814 216 FDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVA 295 (1010)
Q Consensus 216 wD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa 295 (1010)
||+++..+..++..... ...+.++ +++-
T Consensus 327 wD~Rs~~~~~sv~~gg~--------------vtSl~ls----------------------------------~~g~---- 354 (459)
T KOG0288|consen 327 WDIRSADKTRSVPLGGR--------------VTSLDLS----------------------------------MDGL---- 354 (459)
T ss_pred EeccCCceeeEeecCcc--------------eeeEeec----------------------------------cCCe----
Confidence 99998776655432110 0111111 0000
Q ss_pred EeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC----CCC
Q 001814 296 RYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH----TSP 371 (1010)
Q Consensus 296 ~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH----tsp 371 (1010)
.+.+...|.++.+.|+.+.++...|.|- .+.
T Consensus 355 ---------------------------------------------~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asD 389 (459)
T KOG0288|consen 355 ---------------------------------------------ELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASD 389 (459)
T ss_pred ---------------------------------------------EEeeecCCCceeeeecccccEEEEeeccccccccc
Confidence 0112356778999999998888887762 234
Q ss_pred eEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeC
Q 001814 372 ISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSS 451 (1010)
Q Consensus 372 IsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~ 451 (1010)
.+.+.|||+|.|+|+||.+|. +.||++.. | ++...+..-..++.|.+++|+|-|..|++++.
T Consensus 390 wtrvvfSpd~~YvaAGS~dgs-v~iW~v~t-------g----------KlE~~l~~s~s~~aI~s~~W~~sG~~Llsadk 451 (459)
T KOG0288|consen 390 WTRVVFSPDGSYVAAGSADGS-VYIWSVFT-------G----------KLEKVLSLSTSNAAITSLSWNPSGSGLLSADK 451 (459)
T ss_pred cceeEECCCCceeeeccCCCc-EEEEEccC-------c----------eEEEEeccCCCCcceEEEEEcCCCchhhcccC
Confidence 889999999999999999887 89999963 4 34455544334446999999999999999999
Q ss_pred CCeEEEEe
Q 001814 452 KGTCHVFV 459 (1010)
Q Consensus 452 dGTVhIw~ 459 (1010)
++.+.+|.
T Consensus 452 ~~~v~lW~ 459 (459)
T KOG0288|consen 452 QKAVTLWT 459 (459)
T ss_pred CcceEecC
Confidence 99999994
No 40
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.67 E-value=5.9e-15 Score=175.09 Aligned_cols=240 Identities=18% Similarity=0.187 Sum_probs=183.1
Q ss_pred CCCCCCCCcEEEEEEeeccCCCCCCCeEEEEEe-cCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccc
Q 001814 50 NASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGY-QNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRK 128 (1010)
Q Consensus 50 ~~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy-~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~ 128 (1010)
...++|++.|--..|.. ...+|+.|. +..++|||+ ..|.+..++.+|.+.|+++.+.+
T Consensus 243 ~~l~GH~g~V~~l~~~~-------~~~~lvsgS~D~t~rvWd~-~sg~C~~~l~gh~stv~~~~~~~------------- 301 (537)
T KOG0274|consen 243 TRLVGHFGGVWGLAFPS-------GGDKLVSGSTDKTERVWDC-STGECTHSLQGHTSSVRCLTIDP------------- 301 (537)
T ss_pred eeccCCCCCceeEEEec-------CCCEEEEEecCCcEEeEec-CCCcEEEEecCCCceEEEEEccC-------------
Confidence 34788888888777764 134677777 567999997 57899999999999999999773
Q ss_pred cCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEe-CCCcEEEEEEcCCeEEE
Q 001814 129 LHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLR-FRSSVCMVRCSPRIVAV 207 (1010)
Q Consensus 129 srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~-f~S~V~sVa~S~rlLAV 207 (1010)
.+.+ +| + .|++|++|++.++.+++++. +..+|++|.++..+|++
T Consensus 302 ---~~~~-sg---------------------s----------~D~tVkVW~v~n~~~l~l~~~h~~~V~~v~~~~~~lvs 346 (537)
T KOG0274|consen 302 ---FLLV-SG---------------------S----------RDNTVKVWDVTNGACLNLLRGHTGPVNCVQLDEPLLVS 346 (537)
T ss_pred ---ceEe-ec---------------------c----------CCceEEEEeccCcceEEEeccccccEEEEEecCCEEEE
Confidence 2232 21 1 25899999999999999999 66799999999877765
Q ss_pred -EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 -GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 -~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
+.++.|.+||+.++++++++..|..+ .-.+. +.++.
T Consensus 347 gs~d~~v~VW~~~~~~cl~sl~gH~~~-------------V~sl~-------~~~~~----------------------- 383 (537)
T KOG0274|consen 347 GSYDGTVKVWDPRTGKCLKSLSGHTGR-------------VYSLI-------VDSEN----------------------- 383 (537)
T ss_pred EecCceEEEEEhhhceeeeeecCCcce-------------EEEEE-------ecCcc-----------------------
Confidence 45678999999999999999887552 01111 11100
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC-cEEEEe
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR-AIISQF 365 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~-~~v~~~ 365 (1010)
...+++.|+.|++||+.++ +++.++
T Consensus 384 ------------------------------------------------------~~~Sgs~D~~IkvWdl~~~~~c~~tl 409 (537)
T KOG0274|consen 384 ------------------------------------------------------RLLSGSLDTTIKVWDLRTKRKCIHTL 409 (537)
T ss_pred ------------------------------------------------------eEEeeeeccceEeecCCchhhhhhhh
Confidence 0113567899999999999 999999
Q ss_pred ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCE
Q 001814 366 KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQW 445 (1010)
Q Consensus 366 ~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~ 445 (1010)
..|+.-+..|.+ .+++|++++.||+ |++||+.. + .+++++... +...|..+++. ...
T Consensus 410 ~~h~~~v~~l~~--~~~~Lvs~~aD~~-Ik~WD~~~-------~----------~~~~~~~~~-~~~~v~~l~~~--~~~ 466 (537)
T KOG0274|consen 410 QGHTSLVSSLLL--RDNFLVSSSADGT-IKLWDAEE-------G----------ECLRTLEGR-HVGGVSALALG--KEE 466 (537)
T ss_pred cCCccccccccc--ccceeEecccccc-EEEeeccc-------C----------ceeeeeccC-CcccEEEeecC--cce
Confidence 999999977665 5789999999875 99999864 3 366666432 33568888887 568
Q ss_pred EEEEeCCCeEEEEeCCCCCC
Q 001814 446 IAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 446 LAsgS~dGTVhIw~I~~~gg 465 (1010)
+++++.+|++++|++.....
T Consensus 467 il~s~~~~~~~l~dl~~~~~ 486 (537)
T KOG0274|consen 467 ILCSSDDGSVKLWDLRSGTL 486 (537)
T ss_pred EEEEecCCeeEEEecccCch
Confidence 88999999999999987544
No 41
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.66 E-value=1.6e-13 Score=145.66 Aligned_cols=262 Identities=14% Similarity=0.102 Sum_probs=157.7
Q ss_pred EEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccc
Q 001814 77 VLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGG 155 (1010)
Q Consensus 77 vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~ 155 (1010)
+++++.. +.+.+||+. .+++...+..+.+ +..+.+.|++ ..|+++..
T Consensus 3 ~~~s~~~d~~v~~~d~~-t~~~~~~~~~~~~-~~~l~~~~dg-------------~~l~~~~~----------------- 50 (300)
T TIGR03866 3 AYVSNEKDNTISVIDTA-TLEVTRTFPVGQR-PRGITLSKDG-------------KLLYVCAS----------------- 50 (300)
T ss_pred EEEEecCCCEEEEEECC-CCceEEEEECCCC-CCceEECCCC-------------CEEEEEEC-----------------
Confidence 5555555 459999995 4556666665543 5668887764 23443321
Q ss_pred cccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEE--eCCeEEEEECCCCceeEEEeecC
Q 001814 156 VRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVG--LATQIYCFDALTLENKFSVLTYP 231 (1010)
Q Consensus 156 vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~--ld~~I~IwD~~Tle~l~tL~t~p 231 (1010)
.+++|++||+++++.+..+..+..+..+.+++ +.|+++ .++.|++||+.+.+.+..+....
T Consensus 51 ---------------~~~~v~~~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~~ 115 (300)
T TIGR03866 51 ---------------DSDTIQVIDLATGEVIGTLPSGPDPELFALHPNGKILYIANEDDNLVTVIDIETRKVLAEIPVGV 115 (300)
T ss_pred ---------------CCCeEEEEECCCCcEEEeccCCCCccEEEECCCCCEEEEEcCCCCeEEEEECCCCeEEeEeeCCC
Confidence 13679999999999988887665567778876 455543 35689999999877665554211
Q ss_pred CccccCCCccccccCccceEEcc--ceEEEccCC---eeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhh
Q 001814 232 VPQLAGQGAVGINVGYGPMAVGP--RWLAYASNT---LLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFA 306 (1010)
Q Consensus 232 ~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~~---~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la 306 (1010)
. ...++++| +++++.... +..|+... +.++..
T Consensus 116 ~--------------~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~-------------------~~~~~~---------- 152 (300)
T TIGR03866 116 E--------------PEGMAVSPDGKIVVNTSETTNMAHFIDTKT-------------------YEIVDN---------- 152 (300)
T ss_pred C--------------cceEEECCCCCEEEEEecCCCeEEEEeCCC-------------------CeEEEE----------
Confidence 1 12356665 566655431 22233100 000000
Q ss_pred cccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCC-----C--CeEEEEECC
Q 001814 307 AGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHT-----S--PISALCFDP 379 (1010)
Q Consensus 307 ~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHt-----s--pIsaLaFSP 379 (1010)
+ ..........++++.+.. .+.+..+|.|.+||+.+++.+..+..+. . ....++|+|
T Consensus 153 ------~-------~~~~~~~~~~~s~dg~~l---~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~ 216 (300)
T TIGR03866 153 ------V-------LVDQRPRFAEFTADGKEL---WVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTK 216 (300)
T ss_pred ------E-------EcCCCccEEEECCCCCEE---EEEcCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECC
Confidence 0 000000011122222211 1234568999999999998777665332 1 123588999
Q ss_pred CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe-CCCeEEEE
Q 001814 380 SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS-SKGTCHVF 458 (1010)
Q Consensus 380 dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS-~dGTVhIw 458 (1010)
+|++++.+......|.|||+.. + +.+..+..| ..+.+++|+|||++|++++ .+|+|+||
T Consensus 217 dg~~~~~~~~~~~~i~v~d~~~-------~----------~~~~~~~~~---~~~~~~~~~~~g~~l~~~~~~~~~i~v~ 276 (300)
T TIGR03866 217 DGKTAFVALGPANRVAVVDAKT-------Y----------EVLDYLLVG---QRVWQLAFTPDEKYLLTTNGVSNDVSVI 276 (300)
T ss_pred CCCEEEEEcCCCCeEEEEECCC-------C----------cEEEEEEeC---CCcceEEECCCCCEEEEEcCCCCeEEEE
Confidence 9998665543344599999853 2 233333222 2588999999999999874 68999999
Q ss_pred eCCCCC
Q 001814 459 VLSPFG 464 (1010)
Q Consensus 459 ~I~~~g 464 (1010)
++....
T Consensus 277 d~~~~~ 282 (300)
T TIGR03866 277 DVAALK 282 (300)
T ss_pred ECCCCc
Confidence 998644
No 42
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.66 E-value=4.2e-16 Score=175.96 Aligned_cols=260 Identities=14% Similarity=0.120 Sum_probs=186.4
Q ss_pred CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 51 ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 51 ~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
...+|++.|..+.|-.+ ...+|+.|..++ ++||++=+.+.+..++.+|..+|+.+.+.+.+..
T Consensus 209 ~~~gH~kgvsai~~fp~------~~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~Vrd~~~s~~g~~---------- 272 (503)
T KOG0282|consen 209 NLSGHTKGVSAIQWFPK------KGHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPVRDASFNNCGTS---------- 272 (503)
T ss_pred eccCCccccchhhhccc------eeeEEEecCCCceEEEEEEecCcceehhhhcchhhhhhhhccccCCe----------
Confidence 36788999998888642 245666666655 9999997778899999999999999998876521
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC---C-eE
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP---R-IV 205 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~---r-lL 205 (1010)
++.+ +.++.|++||+++|+++..+.....+++|.+.| + +|
T Consensus 273 ---fLS~---------------------------------sfD~~lKlwDtETG~~~~~f~~~~~~~cvkf~pd~~n~fl 316 (503)
T KOG0282|consen 273 ---FLSA---------------------------------SFDRFLKLWDTETGQVLSRFHLDKVPTCVKFHPDNQNIFL 316 (503)
T ss_pred ---eeee---------------------------------ecceeeeeeccccceEEEEEecCCCceeeecCCCCCcEEE
Confidence 2321 146899999999999999888888999999987 3 45
Q ss_pred EEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 206 AVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 206 AV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
|.+++++|+.||+++.+.++....+-.+ + ..+. |
T Consensus 317 ~G~sd~ki~~wDiRs~kvvqeYd~hLg~---------i----~~i~-------F-------------------------- 350 (503)
T KOG0282|consen 317 VGGSDKKIRQWDIRSGKVVQEYDRHLGA---------I----LDIT-------F-------------------------- 350 (503)
T ss_pred EecCCCcEEEEeccchHHHHHHHhhhhh---------e----eeeE-------E--------------------------
Confidence 5567789999999998865544322110 0 0000 0
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEe
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQF 365 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~ 365 (1010)
.+.+ ..+++...|+.|+||+......+..+
T Consensus 351 -----------------------------------~~~g---------------~rFissSDdks~riWe~~~~v~ik~i 380 (503)
T KOG0282|consen 351 -----------------------------------VDEG---------------RRFISSSDDKSVRIWENRIPVPIKNI 380 (503)
T ss_pred -----------------------------------ccCC---------------ceEeeeccCccEEEEEcCCCccchhh
Confidence 0110 01234467889999999887665444
Q ss_pred c-cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccc-cEEEEEEccCC
Q 001814 366 K-AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSA-TIQDICFSHYS 443 (1010)
Q Consensus 366 ~-aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a-~I~sIAFSpDg 443 (1010)
. .+.....+|+..|+|..++.-|. |..|-||.+.+.+. ...++-.+|+..+ --..+.|||||
T Consensus 381 ~~~~~hsmP~~~~~P~~~~~~aQs~-dN~i~ifs~~~~~r---------------~nkkK~feGh~vaGys~~v~fSpDG 444 (503)
T KOG0282|consen 381 ADPEMHTMPCLTLHPNGKWFAAQSM-DNYIAIFSTVPPFR---------------LNKKKRFEGHSVAGYSCQVDFSPDG 444 (503)
T ss_pred cchhhccCcceecCCCCCeehhhcc-CceEEEEecccccc---------------cCHhhhhcceeccCceeeEEEcCCC
Confidence 3 33445668999999999999898 67799999865321 1112222454433 24568999999
Q ss_pred CEEEEEeCCCeEEEEeCCCCCCccccccccC
Q 001814 444 QWIAIVSSKGTCHVFVLSPFGGDSGFQTLSS 474 (1010)
Q Consensus 444 ~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s 474 (1010)
++|++|+.+|.+.+|+..+-+....+++|+.
T Consensus 445 ~~l~SGdsdG~v~~wdwkt~kl~~~lkah~~ 475 (503)
T KOG0282|consen 445 RTLCSGDSDGKVNFWDWKTTKLVSKLKAHDQ 475 (503)
T ss_pred CeEEeecCCccEEEeechhhhhhhccccCCc
Confidence 9999999999999999999887778888864
No 43
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.66 E-value=2.5e-14 Score=154.05 Aligned_cols=224 Identities=14% Similarity=0.141 Sum_probs=170.2
Q ss_pred CCCeEEEEEecC-cEEEEEccCC--C---cceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCC
Q 001814 73 VFKQVLLLGYQN-GFQVLDVEDA--S---NFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAP 146 (1010)
Q Consensus 73 ~~~~vLalGy~~-G~qVWDv~~~--g---~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~ 146 (1010)
|.++.++.|.-+ -.-||++... . .+...|.+|.|-+++.+|+++. .++. + +|
T Consensus 107 PSg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD~--------------~ilT--~-----SG- 164 (343)
T KOG0286|consen 107 PSGNFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDDN--------------HILT--G-----SG- 164 (343)
T ss_pred CCCCeEEecCcCceeEEEecccccccccceeeeeecCccceeEEEEEcCCC--------------ceEe--c-----CC-
Confidence 457777777754 5789999633 1 3445578899999999999764 2332 1 12
Q ss_pred CCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC---CeEE-EEeCCeEEEEECCCC
Q 001814 147 GQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP---RIVA-VGLATQIYCFDALTL 221 (1010)
Q Consensus 147 ~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~---rlLA-V~ld~~I~IwD~~Tl 221 (1010)
+.|..+||+++|+.+..+..| .-|.+++++| +.++ .+.|...++||++..
T Consensus 165 -------------------------D~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~ 219 (343)
T KOG0286|consen 165 -------------------------DMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSG 219 (343)
T ss_pred -------------------------CceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccCc
Confidence 478999999999999999876 4899999988 4555 466788999999999
Q ss_pred ceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhh
Q 001814 222 ENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEH 301 (1010)
Q Consensus 222 e~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~ds 301 (1010)
.+.+++.+|..- +| .+.| .|++.
T Consensus 220 ~c~qtF~ghesD---------IN----sv~f----------------------------------fP~G~---------- 242 (343)
T KOG0286|consen 220 QCVQTFEGHESD---------IN----SVRF----------------------------------FPSGD---------- 242 (343)
T ss_pred ceeEeecccccc---------cc----eEEE----------------------------------ccCCC----------
Confidence 999988877551 21 2222 11111
Q ss_pred hhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC--CCCeEEEEECC
Q 001814 302 SKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH--TSPISALCFDP 379 (1010)
Q Consensus 302 sk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH--tspIsaLaFSP 379 (1010)
.+++++.|++.++||+...+.++.+..- ..+|++++||.
T Consensus 243 ---------------------------------------afatGSDD~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~ 283 (343)
T KOG0286|consen 243 ---------------------------------------AFATGSDDATCRLYDLRADQELAVYSHDSIICGITSVAFSK 283 (343)
T ss_pred ---------------------------------------eeeecCCCceeEEEeecCCcEEeeeccCcccCCceeEEEcc
Confidence 1346788999999999999999888732 45999999999
Q ss_pred CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 380 SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 380 dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
+|+||..+-.| .+++|||... | .++-.| .||. .+|.+|..+|||.-||+||=|.|++||.
T Consensus 284 SGRlLfagy~d-~~c~vWDtlk-------~----------e~vg~L-~GHe-NRvScl~~s~DG~av~TgSWDs~lriW~ 343 (343)
T KOG0286|consen 284 SGRLLFAGYDD-FTCNVWDTLK-------G----------ERVGVL-AGHE-NRVSCLGVSPDGMAVATGSWDSTLRIWA 343 (343)
T ss_pred cccEEEeeecC-CceeEeeccc-------c----------ceEEEe-eccC-CeeEEEEECCCCcEEEecchhHheeecC
Confidence 99999988664 4599999864 3 355555 4654 5799999999999999999999999994
No 44
>PTZ00421 coronin; Provisional
Probab=99.65 E-value=7.3e-14 Score=164.78 Aligned_cols=221 Identities=14% Similarity=0.133 Sum_probs=150.6
Q ss_pred CcEEEEEccCCCcceE---eeeeccCCEEEEEEec-CCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccC
Q 001814 84 NGFQVLDVEDASNFNE---LVSKRDGPVSFLQMQP-FPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDG 159 (1010)
Q Consensus 84 ~G~qVWDv~~~g~v~e---llS~hdGpV~~v~~lP-~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~g 159 (1010)
+|+.|+.++..|.+.. ++.+|.++|..++|.| ++ .+||..+
T Consensus 51 gg~~v~~~~~~G~~~~~~~~l~GH~~~V~~v~fsP~d~-------------~~LaSgS---------------------- 95 (493)
T PTZ00421 51 GSTAVLKHTDYGKLASNPPILLGQEGPIIDVAFNPFDP-------------QKLFTAS---------------------- 95 (493)
T ss_pred CceEEeeccccccCCCCCceEeCCCCCEEEEEEcCCCC-------------CEEEEEe----------------------
Confidence 3444444444454433 5778999999999997 33 2555421
Q ss_pred CcCCCCCCCCCCCCEEEEEeCCCC-------eEEEEEeCC-CcEEEEEEcC---CeEEEE-eCCeEEEEECCCCceeEEE
Q 001814 160 MMDSQSGNCVNSPTAVRFYSFQSH-------CYEHVLRFR-SSVCMVRCSP---RIVAVG-LATQIYCFDALTLENKFSV 227 (1010)
Q Consensus 160 s~d~~~~~~~~sp~tVrIWDlktg-------e~V~tL~f~-S~V~sVa~S~---rlLAV~-ld~~I~IwD~~Tle~l~tL 227 (1010)
.+++|++||+.++ +.+.+|..| ..|..|+|+| ++|+++ .++.|+|||+.+.+.+.++
T Consensus 96 -----------~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l 164 (493)
T PTZ00421 96 -----------EDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVI 164 (493)
T ss_pred -----------CCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEE
Confidence 2478999999765 346677655 5899999987 367664 5678999999998877766
Q ss_pred eecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhc
Q 001814 228 LTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAA 307 (1010)
Q Consensus 228 ~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~ 307 (1010)
..|..+ ...++ +.. ++.
T Consensus 165 ~~h~~~-------------V~sla-------~sp---------------------------dG~---------------- 181 (493)
T PTZ00421 165 KCHSDQ-------------ITSLE-------WNL---------------------------DGS---------------- 181 (493)
T ss_pred cCCCCc-------------eEEEE-------EEC---------------------------CCC----------------
Confidence 655431 11122 211 101
Q ss_pred ccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCe-EEEEECCCCCEEEE
Q 001814 308 GLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPI-SALCFDPSGTLLVT 386 (1010)
Q Consensus 308 Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspI-saLaFSPdGtlLAT 386 (1010)
.+++++.||+|+|||+.+++.+..+.+|...+ ..+.|.+++.+|+|
T Consensus 182 ---------------------------------lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt 228 (493)
T PTZ00421 182 ---------------------------------LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIIT 228 (493)
T ss_pred ---------------------------------EEEEecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcCCCCeEEE
Confidence 12345689999999999999999999998764 46789999888887
Q ss_pred EEc---CCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe-CCCeEEEEeCCC
Q 001814 387 ASV---YGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS-SKGTCHVFVLSP 462 (1010)
Q Consensus 387 AS~---dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS-~dGTVhIw~I~~ 462 (1010)
++. .++.|+|||+... . ..+... .......+....|++|+++|++++ .|++|++|++..
T Consensus 229 ~G~s~s~Dr~VklWDlr~~------~----------~p~~~~-~~d~~~~~~~~~~d~d~~~L~lggkgDg~Iriwdl~~ 291 (493)
T PTZ00421 229 LGCSKSQQRQIMLWDTRKM------A----------SPYSTV-DLDQSSALFIPFFDEDTNLLYIGSKGEGNIRCFELMN 291 (493)
T ss_pred EecCCCCCCeEEEEeCCCC------C----------CceeEe-ccCCCCceEEEEEcCCCCEEEEEEeCCCeEEEEEeeC
Confidence 653 2356999998531 1 122222 112223566778999999999988 499999999975
Q ss_pred C
Q 001814 463 F 463 (1010)
Q Consensus 463 ~ 463 (1010)
.
T Consensus 292 ~ 292 (493)
T PTZ00421 292 E 292 (493)
T ss_pred C
Confidence 3
No 45
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.65 E-value=1.4e-15 Score=167.00 Aligned_cols=273 Identities=15% Similarity=0.189 Sum_probs=182.4
Q ss_pred CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEe-cCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCC----
Q 001814 51 ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGY-QNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEG---- 125 (1010)
Q Consensus 51 ~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy-~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~---- 125 (1010)
...+|+|.|....-+. ... ..+++|. ++.++|||++ .-++..++.-|+|.|+.|.+.-...+.-+.|.
T Consensus 61 ~L~gHrdGV~~lakhp-----~~l-s~~aSGs~DG~VkiWnls-qR~~~~~f~AH~G~V~Gi~v~~~~~~tvgdDKtvK~ 133 (433)
T KOG0268|consen 61 SLDGHRDGVSCLAKHP-----NKL-STVASGSCDGEVKIWNLS-QRECIRTFKAHEGLVRGICVTQTSFFTVGDDKTVKQ 133 (433)
T ss_pred hccccccccchhhcCc-----chh-hhhhccccCceEEEEehh-hhhhhheeecccCceeeEEecccceEEecCCcceee
Confidence 3578999988665442 211 3455555 4559999995 46678889999999999998753322222222
Q ss_pred ccccCcEEEEEecCCCCCCCCCCCCCCcccc---ccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEc
Q 001814 126 FRKLHPFLLVVAGEDTNTLAPGQNRSHLGGV---RDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCS 201 (1010)
Q Consensus 126 F~~srpLLAvVsgd~~~~s~~~q~~~~~~~v---r~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S 201 (1010)
+.-..|.+-+..|+ +.+.++ +..+.- +.....|-|||..-...+..+... ..|.+|.||
T Consensus 134 wk~~~~p~~tilg~-----------s~~~gIdh~~~~~~F------aTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfN 196 (433)
T KOG0268|consen 134 WKIDGPPLHTILGK-----------SVYLGIDHHRKNSVF------ATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFN 196 (433)
T ss_pred eeccCCcceeeecc-----------ccccccccccccccc------cccCceeeecccccCCccceeecCCCceeEEecC
Confidence 22111221111111 111122 222111 112356999999998999999876 479999999
Q ss_pred C---CeEEEE-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcC
Q 001814 202 P---RIVAVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNL 277 (1010)
Q Consensus 202 ~---rlLAV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~l 277 (1010)
| .+|++| .+..|.+||+++...++.+..-.. .+.+++.|
T Consensus 197 pvETsILas~~sDrsIvLyD~R~~~Pl~KVi~~mR--------------TN~IswnP----------------------- 239 (433)
T KOG0268|consen 197 PVETSILASCASDRSIVLYDLRQASPLKKVILTMR--------------TNTICWNP----------------------- 239 (433)
T ss_pred CCcchheeeeccCCceEEEecccCCccceeeeecc--------------ccceecCc-----------------------
Confidence 9 578876 677899999998877665542111 11222221
Q ss_pred CCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECC
Q 001814 278 TPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFV 357 (1010)
Q Consensus 278 t~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~ 357 (1010)
-+..+..++.|..+..||+.
T Consensus 240 ------------------------------------------------------------eafnF~~a~ED~nlY~~DmR 259 (433)
T KOG0268|consen 240 ------------------------------------------------------------EAFNFVAANEDHNLYTYDMR 259 (433)
T ss_pred ------------------------------------------------------------cccceeeccccccceehhhh
Confidence 01122345788889999998
Q ss_pred CCc-EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEE
Q 001814 358 TRA-IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQD 436 (1010)
Q Consensus 358 s~~-~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~s 436 (1010)
... .+...+.|.+.|..+.|||.|+-+||||- +..||||.+.. | ..+-+|--+|= ..|.+
T Consensus 260 ~l~~p~~v~~dhvsAV~dVdfsptG~Efvsgsy-DksIRIf~~~~-------~--------~SRdiYhtkRM---q~V~~ 320 (433)
T KOG0268|consen 260 NLSRPLNVHKDHVSAVMDVDFSPTGQEFVSGSY-DKSIRIFPVNH-------G--------HSRDIYHTKRM---QHVFC 320 (433)
T ss_pred hhcccchhhcccceeEEEeccCCCcchhccccc-cceEEEeecCC-------C--------cchhhhhHhhh---heeeE
Confidence 764 66788899999999999999999999999 56699999853 2 11334433331 35899
Q ss_pred EEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 437 ICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 437 IAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+.||.|++||.+||+|+.|++|.-...
T Consensus 321 Vk~S~Dskyi~SGSdd~nvRlWka~As 347 (433)
T KOG0268|consen 321 VKYSMDSKYIISGSDDGNVRLWKAKAS 347 (433)
T ss_pred EEEeccccEEEecCCCcceeeeecchh
Confidence 999999999999999999999987643
No 46
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.65 E-value=4.4e-14 Score=150.70 Aligned_cols=258 Identities=14% Similarity=0.121 Sum_probs=173.9
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
.+|--.++.++|++ .+-+|..+.. ...-||-- .+|+..-.+.+|.|.|.++.+.-+ ++
T Consensus 7 ~GHERplTqiKyN~-------eGDLlFscaKD~~~~vw~s-~nGerlGty~GHtGavW~~Did~~-----------s~-- 65 (327)
T KOG0643|consen 7 QGHERPLTQIKYNR-------EGDLLFSCAKDSTPTVWYS-LNGERLGTYDGHTGAVWCCDIDWD-----------SK-- 65 (327)
T ss_pred ccCccccceEEecC-------CCcEEEEecCCCCceEEEe-cCCceeeeecCCCceEEEEEecCC-----------cc--
Confidence 45666788888986 3345555555 55899986 357777888999999999998732 22
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEe
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGL 209 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~l 209 (1010)
.|+ +|. .+.++++||.++|+++.++++.++|+.+.|+. .+++++.
T Consensus 66 ~li--TGS-------------------------------AD~t~kLWDv~tGk~la~~k~~~~Vk~~~F~~~gn~~l~~t 112 (327)
T KOG0643|consen 66 HLI--TGS-------------------------------ADQTAKLWDVETGKQLATWKTNSPVKRVDFSFGGNLILAST 112 (327)
T ss_pred eee--ecc-------------------------------ccceeEEEEcCCCcEEEEeecCCeeEEEeeccCCcEEEEEe
Confidence 223 321 25789999999999999999999999999987 5666666
Q ss_pred CCe------EEEEECCCCce-------eEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCc
Q 001814 210 ATQ------IYCFDALTLEN-------KFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQN 276 (1010)
Q Consensus 210 d~~------I~IwD~~Tle~-------l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~ 276 (1010)
+++ |.+||++.... ...+.+.... -+..+|.
T Consensus 113 D~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~sk----------------------------it~a~Wg-------- 156 (327)
T KOG0643|consen 113 DKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSK----------------------------ITSALWG-------- 156 (327)
T ss_pred hhhcCcceEEEEEEccCChhhhcccCceEEecCCccc----------------------------eeeeeec--------
Confidence 543 88998874321 1111111000 0011121
Q ss_pred CCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEEC
Q 001814 277 LTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDF 356 (1010)
Q Consensus 277 lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl 356 (1010)
+..-.++.+..+|.|.+||+
T Consensus 157 ------------------------------------------------------------~l~~~ii~Ghe~G~is~~da 176 (327)
T KOG0643|consen 157 ------------------------------------------------------------PLGETIIAGHEDGSISIYDA 176 (327)
T ss_pred ------------------------------------------------------------ccCCEEEEecCCCcEEEEEc
Confidence 11112456789999999999
Q ss_pred CCCc-EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcc---------cC-------------CCCCCcc
Q 001814 357 VTRA-IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCM---------RS-------------GSGNHKY 413 (1010)
Q Consensus 357 ~s~~-~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~---------~~-------------~sG~~~~ 413 (1010)
.++. .+...+-|.+.|+.|+|+||.++++|+|. +++-++||..+... .+ +.|...+
T Consensus 177 ~~g~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~-Dttakl~D~~tl~v~Kty~te~PvN~aaisP~~d~VilgGGqeA~ 255 (327)
T KOG0643|consen 177 RTGKELVDSDEEHSSKINDLQFSRDRTYFITGSK-DTTAKLVDVRTLEVLKTYTTERPVNTAAISPLLDHVILGGGQEAM 255 (327)
T ss_pred ccCceeeechhhhccccccccccCCcceEEeccc-CccceeeeccceeeEEEeeecccccceecccccceEEecCCceee
Confidence 9974 56667889999999999999999999999 56689999864210 00 0122222
Q ss_pred ccCCcceEEEEE---------------ecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 414 DWNSSHVHLYKL---------------HRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 414 ~~~~s~~~L~~L---------------~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+.+......-+| ..| |-..|.++||+|||+-.++|+.||.|+|...++
T Consensus 256 dVTTT~~r~GKFEArFyh~i~eEEigrvkG-HFGPINsvAfhPdGksYsSGGEDG~VR~h~Fd~ 318 (327)
T KOG0643|consen 256 DVTTTSTRAGKFEARFYHLIFEEEIGRVKG-HFGPINSVAFHPDGKSYSSGGEDGYVRLHHFDS 318 (327)
T ss_pred eeeeecccccchhhhHHHHHHHHHhccccc-cccCcceeEECCCCcccccCCCCceEEEEEecc
Confidence 222211111111 024 345799999999999999999999999876553
No 47
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.63 E-value=6.6e-15 Score=161.48 Aligned_cols=216 Identities=16% Similarity=0.213 Sum_probs=158.2
Q ss_pred CcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCE
Q 001814 95 SNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTA 174 (1010)
Q Consensus 95 g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~t 174 (1010)
-++..++++|-|+|++|++-|-. ..++ +| + .+++
T Consensus 141 wKl~rVi~gHlgWVr~vavdP~n-------------~wf~--tg---------------------s----------~Drt 174 (460)
T KOG0285|consen 141 WKLYRVISGHLGWVRSVAVDPGN-------------EWFA--TG---------------------S----------ADRT 174 (460)
T ss_pred ceehhhhhhccceEEEEeeCCCc-------------eeEE--ec---------------------C----------CCce
Confidence 34567888999999999998743 1233 22 1 2589
Q ss_pred EEEEeCCCCeEEEEEeCC-CcEEEEEEcCC---eEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccce
Q 001814 175 VRFYSFQSHCYEHVLRFR-SSVCMVRCSPR---IVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPM 250 (1010)
Q Consensus 175 VrIWDlktge~V~tL~f~-S~V~sVa~S~r---lLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gpl 250 (1010)
++|||+.+|+...+|..+ ..|..|+++++ ++.++.+++|+|||+...+.+....+|-+ +..
T Consensus 175 ikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS---------------~V~ 239 (460)
T KOG0285|consen 175 IKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLS---------------GVY 239 (460)
T ss_pred eEEEEcccCeEEEeecchhheeeeeeecccCceEEEecCCCeeEEEechhhhhHHHhccccc---------------eeE
Confidence 999999999999999855 79999999974 55567778999999988765544443322 001
Q ss_pred EEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCcc
Q 001814 251 AVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVS 330 (1010)
Q Consensus 251 AlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S 330 (1010)
+| .++|.-.
T Consensus 240 ~L------------------------------------------------------------------~lhPTld----- 248 (460)
T KOG0285|consen 240 CL------------------------------------------------------------------DLHPTLD----- 248 (460)
T ss_pred EE------------------------------------------------------------------eccccce-----
Confidence 11 1111100
Q ss_pred CCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCC
Q 001814 331 PNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGN 410 (1010)
Q Consensus 331 ~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~ 410 (1010)
.+.+++.|.+++|||+.++..+..|.+|+.+|..+.|.|-.-.++|+|.|++ ||+||+.. |
T Consensus 249 ----------vl~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V~~~~~dpqvit~S~D~t-vrlWDl~a-------g- 309 (460)
T KOG0285|consen 249 ----------VLVTGGRDSTIRVWDIRTRASVHVLSGHTNPVASVMCQPTDPQVITGSHDST-VRLWDLRA-------G- 309 (460)
T ss_pred ----------eEEecCCcceEEEeeecccceEEEecCCCCcceeEEeecCCCceEEecCCce-EEEeeecc-------C-
Confidence 1235678999999999999999999999999999999998888999999765 99999963 4
Q ss_pred CccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccccccc
Q 001814 411 HKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLS 473 (1010)
Q Consensus 411 ~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~ 473 (1010)
+-+..+. ++...|.+++..|.-..+|++|.| .++-|++....-..++..|+
T Consensus 310 ---------kt~~tlt--~hkksvral~lhP~e~~fASas~d-nik~w~~p~g~f~~nlsgh~ 360 (460)
T KOG0285|consen 310 ---------KTMITLT--HHKKSVRALCLHPKENLFASASPD-NIKQWKLPEGEFLQNLSGHN 360 (460)
T ss_pred ---------ceeEeee--cccceeeEEecCCchhhhhccCCc-cceeccCCccchhhcccccc
Confidence 2334442 234469999999999999999887 57889987654444555554
No 48
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.63 E-value=1.8e-14 Score=170.96 Aligned_cols=229 Identities=14% Similarity=0.117 Sum_probs=168.6
Q ss_pred eEEEEEecCcEEEEEccCCCcceEe-eeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCcc
Q 001814 76 QVLLLGYQNGFQVLDVEDASNFNEL-VSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLG 154 (1010)
Q Consensus 76 ~vLalGy~~G~qVWDv~~~g~v~el-lS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~ 154 (1010)
.++...++..+++||.. .+.+... +.+|.|.|..++|.- -.++|.. |.
T Consensus 220 ~~~~~s~~~tl~~~~~~-~~~~i~~~l~GH~g~V~~l~~~~-------------~~~~lvs--gS--------------- 268 (537)
T KOG0274|consen 220 FFKSGSDDSTLHLWDLN-NGYLILTRLVGHFGGVWGLAFPS-------------GGDKLVS--GS--------------- 268 (537)
T ss_pred eEEecCCCceeEEeecc-cceEEEeeccCCCCCceeEEEec-------------CCCEEEE--Ee---------------
Confidence 34444444558899995 4556666 889999999999761 1134443 21
Q ss_pred ccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCCeEEE-EeCCeEEEEECCCCceeEEEeecCC
Q 001814 155 GVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPRIVAV-GLATQIYCFDALTLENKFSVLTYPV 232 (1010)
Q Consensus 155 ~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~ 232 (1010)
.+.++|+||..+|+|++++..+ +.|+.+..-+..++. +.|.+|++||+.++.++.++..|..
T Consensus 269 ----------------~D~t~rvWd~~sg~C~~~l~gh~stv~~~~~~~~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~ 332 (537)
T KOG0274|consen 269 ----------------TDKTERVWDCSTGECTHSLQGHTSSVRCLTIDPFLLVSGSRDNTVKVWDVTNGACLNLLRGHTG 332 (537)
T ss_pred ----------------cCCcEEeEecCCCcEEEEecCCCceEEEEEccCceEeeccCCceEEEEeccCcceEEEeccccc
Confidence 2589999999999999999966 578888887766665 4678999999999998888876543
Q ss_pred ccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhccccee
Q 001814 233 PQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKT 312 (1010)
Q Consensus 233 p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~kt 312 (1010)
+ | +.+.+..+
T Consensus 333 ~---------V-----------~~v~~~~~-------------------------------------------------- 342 (537)
T KOG0274|consen 333 P---------V-----------NCVQLDEP-------------------------------------------------- 342 (537)
T ss_pred c---------E-----------EEEEecCC--------------------------------------------------
Confidence 2 0 11111110
Q ss_pred eccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC
Q 001814 313 LSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN 392 (1010)
Q Consensus 313 ls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt 392 (1010)
.+.++..||+|.|||+.++++++.+.+|+..|.+|.|++. ..+.++|.|+
T Consensus 343 ----------------------------~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~~V~sl~~~~~-~~~~Sgs~D~- 392 (537)
T KOG0274|consen 343 ----------------------------LLVSGSYDGTVKVWDPRTGKCLKSLSGHTGRVYSLIVDSE-NRLLSGSLDT- 392 (537)
T ss_pred ----------------------------EEEEEecCceEEEEEhhhceeeeeecCCcceEEEEEecCc-ceEEeeeecc-
Confidence 0124578999999999999999999999999999999876 8999999974
Q ss_pred eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccccc
Q 001814 393 NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQT 471 (1010)
Q Consensus 393 ~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~ 471 (1010)
.|++||+.. .. ++++.| .|+ .+.|.++.+ .+++|.+++.|++|++||++..+....+.+
T Consensus 393 ~IkvWdl~~-------~~---------~c~~tl-~~h-~~~v~~l~~--~~~~Lvs~~aD~~Ik~WD~~~~~~~~~~~~ 451 (537)
T KOG0274|consen 393 TIKVWDLRT-------KR---------KCIHTL-QGH-TSLVSSLLL--RDNFLVSSSADGTIKLWDAEEGECLRTLEG 451 (537)
T ss_pred ceEeecCCc-------hh---------hhhhhh-cCC-ccccccccc--ccceeEeccccccEEEeecccCceeeeecc
Confidence 599999953 10 355555 343 355665554 578999999999999999998776655443
No 49
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.63 E-value=1.9e-14 Score=165.99 Aligned_cols=223 Identities=15% Similarity=0.250 Sum_probs=171.2
Q ss_pred CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
.++-+++|.++. |+||+.+. .+-...+.-|..-++++++-|.- |+++. +.
T Consensus 66 RknWiv~GsDD~~IrVfnynt-~ekV~~FeAH~DyIR~iavHPt~-------------P~vLt-sS-------------- 116 (794)
T KOG0276|consen 66 RKNWIVTGSDDMQIRVFNYNT-GEKVKTFEAHSDYIRSIAVHPTL-------------PYVLT-SS-------------- 116 (794)
T ss_pred ccceEEEecCCceEEEEeccc-ceeeEEeeccccceeeeeecCCC-------------CeEEe-cC--------------
Confidence 356899999987 89999964 44556778899999999998753 77664 11
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCC-eEEEEEeCCC-cEEEEEEcCC----eEEEEeCCeEEEEECCCCceeEE
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSH-CYEHVLRFRS-SVCMVRCSPR----IVAVGLATQIYCFDALTLENKFS 226 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktg-e~V~tL~f~S-~V~sVa~S~r----lLAV~ld~~I~IwD~~Tle~l~t 226 (1010)
-+-+|++||.+.+ .|..+++.|+ -|..|+|+|+ +...++|.+|+||.+....+.+|
T Consensus 117 ------------------DDm~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDrTVKVWslgs~~~nfT 178 (794)
T KOG0276|consen 117 ------------------DDMTIKLWDWENEWACEQTFEGHEHYVMQVAFNPKDPNTFASASLDRTVKVWSLGSPHPNFT 178 (794)
T ss_pred ------------------CccEEEEeeccCceeeeeEEcCcceEEEEEEecCCCccceeeeeccccEEEEEcCCCCCcee
Confidence 1468999999754 5777888776 7999999993 55567899999999999999999
Q ss_pred EeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhh
Q 001814 227 VLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFA 306 (1010)
Q Consensus 227 L~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la 306 (1010)
|..|.. ++| .+.+ |++..
T Consensus 179 l~gHek---------GVN----~Vdy------y~~gd------------------------------------------- 196 (794)
T KOG0276|consen 179 LEGHEK---------GVN----CVDY------YTGGD------------------------------------------- 196 (794)
T ss_pred eecccc---------Ccc----eEEe------ccCCC-------------------------------------------
Confidence 988765 122 2221 11100
Q ss_pred cccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEE
Q 001814 307 AGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVT 386 (1010)
Q Consensus 307 ~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLAT 386 (1010)
|+ -++++..|.+|+|||.+++.++++|.+|++.|++++|.|.=.+++|
T Consensus 197 -----------------------------kp---ylIsgaDD~tiKvWDyQtk~CV~TLeGHt~Nvs~v~fhp~lpiiis 244 (794)
T KOG0276|consen 197 -----------------------------KP---YLISGADDLTIKVWDYQTKSCVQTLEGHTNNVSFVFFHPELPIIIS 244 (794)
T ss_pred -----------------------------cc---eEEecCCCceEEEeecchHHHHHHhhcccccceEEEecCCCcEEEE
Confidence 00 1235678999999999999999999999999999999999999999
Q ss_pred EEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEE
Q 001814 387 ASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHV 457 (1010)
Q Consensus 387 AS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhI 457 (1010)
+|+||+ +|||+..+. ++...|--|. .+||+|+--+.++.+|+|.+.|.|.|
T Consensus 245 gsEDGT-vriWhs~Ty-----------------~lE~tLn~gl--eRvW~I~~~k~~~~i~vG~Deg~i~v 295 (794)
T KOG0276|consen 245 GSEDGT-VRIWNSKTY-----------------KLEKTLNYGL--ERVWCIAAHKGDGKIAVGFDEGSVTV 295 (794)
T ss_pred ecCCcc-EEEecCcce-----------------ehhhhhhcCC--ceEEEEeecCCCCeEEEeccCCcEEE
Confidence 999887 899987431 2222333333 36999999999999999999998754
No 50
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.62 E-value=3.2e-14 Score=151.41 Aligned_cols=244 Identities=16% Similarity=0.183 Sum_probs=170.7
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEe-eeeccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNEL-VSKRDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~el-lS~hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
..++..+|.-+.|+- ++.-|+.|.-++ +.||+++......++ ..+|.+-|.-+...|..
T Consensus 16 ~~~~~~~v~Sv~wn~-------~g~~lasgs~dktv~v~n~e~~r~~~~~~~~gh~~svdql~w~~~~------------ 76 (313)
T KOG1407|consen 16 LQGHVQKVHSVAWNC-------DGTKLASGSFDKTVSVWNLERDRFRKELVYRGHTDSVDQLCWDPKH------------ 76 (313)
T ss_pred hhhhhhcceEEEEcc-------cCceeeecccCCceEEEEecchhhhhhhcccCCCcchhhheeCCCC------------
Confidence 456788888888873 456788887665 899999754322222 34566778877776542
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEE
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAV 207 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV 207 (1010)
-++++++++ +++|++||++++++++.+.-...=..+.++| ..+++
T Consensus 77 ~d~~atas~---------------------------------dk~ir~wd~r~~k~~~~i~~~~eni~i~wsp~g~~~~~ 123 (313)
T KOG1407|consen 77 PDLFATASG---------------------------------DKTIRIWDIRSGKCTARIETKGENINITWSPDGEYIAV 123 (313)
T ss_pred CcceEEecC---------------------------------CceEEEEEeccCcEEEEeeccCcceEEEEcCCCCEEEE
Confidence 256676432 4789999999999999998877656678887 45555
Q ss_pred -EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 -GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 -~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
.-++.|.++|+++.+...+-. -+ -.+ +-+ .|+.
T Consensus 124 ~~kdD~it~id~r~~~~~~~~~---~~-------~e~----ne~---------------~w~~----------------- 157 (313)
T KOG1407|consen 124 GNKDDRITFIDARTYKIVNEEQ---FK-------FEV----NEI---------------SWNN----------------- 157 (313)
T ss_pred ecCcccEEEEEecccceeehhc---cc-------cee----eee---------------eecC-----------------
Confidence 456789999998866443221 10 000 001 1110
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK 366 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~ 366 (1010)
++..+. -...-|+|.|....+.+.+..|+
T Consensus 158 --~nd~Ff-------------------------------------------------lt~GlG~v~ILsypsLkpv~si~ 186 (313)
T KOG1407|consen 158 --SNDLFF-------------------------------------------------LTNGLGCVEILSYPSLKPVQSIK 186 (313)
T ss_pred --CCCEEE-------------------------------------------------EecCCceEEEEeccccccccccc
Confidence 000110 01245889999999999999999
Q ss_pred cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEE
Q 001814 367 AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWI 446 (1010)
Q Consensus 367 aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L 446 (1010)
||....-|+.|+|+|++||++|. +..+-+||+... .++..+.| ....|..|+||.||++|
T Consensus 187 AH~snCicI~f~p~GryfA~GsA-DAlvSLWD~~EL-----------------iC~R~isR--ldwpVRTlSFS~dg~~l 246 (313)
T KOG1407|consen 187 AHPSNCICIEFDPDGRYFATGSA-DALVSLWDVDEL-----------------ICERCISR--LDWPVRTLSFSHDGRML 246 (313)
T ss_pred cCCcceEEEEECCCCceEeeccc-cceeeccChhHh-----------------hhheeecc--ccCceEEEEeccCccee
Confidence 99999999999999999999999 466999998531 24444444 23479999999999999
Q ss_pred EEEeCCCeEEEEeCCCCC
Q 001814 447 AIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 447 AsgS~dGTVhIw~I~~~g 464 (1010)
|++|.|.-|-|=.++...
T Consensus 247 ASaSEDh~IDIA~vetGd 264 (313)
T KOG1407|consen 247 ASASEDHFIDIAEVETGD 264 (313)
T ss_pred eccCccceEEeEecccCC
Confidence 999999888776665543
No 51
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.62 E-value=1.4e-14 Score=163.96 Aligned_cols=245 Identities=15% Similarity=0.169 Sum_probs=176.8
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
..++.+.|....|.. ..+.=+++...-++|||+.. .-.+.+++++=+..|+.+.|- +.+.
T Consensus 22 ~~ke~~~vssl~fsp------~~P~d~aVt~S~rvqly~~~-~~~~~k~~srFk~~v~s~~fR-------------~DG~ 81 (487)
T KOG0310|consen 22 VHKEHNSVSSLCFSP------KHPYDFAVTSSVRVQLYSSV-TRSVRKTFSRFKDVVYSVDFR-------------SDGR 81 (487)
T ss_pred cccccCcceeEecCC------CCCCceEEecccEEEEEecc-hhhhhhhHHhhccceeEEEee-------------cCCe
Confidence 455566677777763 12445777778899999984 445667777767788888754 4445
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC---CeEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP---RIVAV 207 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~---rlLAV 207 (1010)
|||. ||. -+.|+|+|.++...+..++-| .+|..+.|++ .+++.
T Consensus 82 Llaa--GD~-------------------------------sG~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~t~l~s 128 (487)
T KOG0310|consen 82 LLAA--GDE-------------------------------SGHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDNTMLVS 128 (487)
T ss_pred EEEc--cCC-------------------------------cCcEEEeccccHHHHHHHhhccCceeEEEecccCCeEEEe
Confidence 7773 432 157999998887777777665 5999999988 36667
Q ss_pred EeCC-eEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 GLAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 ~ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
|.|+ .+++||+.+......+.+|..- .|..++.
T Consensus 129 ~sDd~v~k~~d~s~a~v~~~l~~htDY--------------------VR~g~~~-------------------------- 162 (487)
T KOG0310|consen 129 GSDDKVVKYWDLSTAYVQAELSGHTDY--------------------VRCGDIS-------------------------- 162 (487)
T ss_pred cCCCceEEEEEcCCcEEEEEecCCcce--------------------eEeeccc--------------------------
Confidence 7776 5889999987765566655441 0111111
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC-cEEEEe
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR-AIISQF 365 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~-~~v~~~ 365 (1010)
|.++.+ +.+|++||.|++||+... ..+..|
T Consensus 163 -~~~~hi------------------------------------------------vvtGsYDg~vrl~DtR~~~~~v~el 193 (487)
T KOG0310|consen 163 -PANDHI------------------------------------------------VVTGSYDGKVRLWDTRSLTSRVVEL 193 (487)
T ss_pred -cCCCeE------------------------------------------------EEecCCCceEEEEEeccCCceeEEe
Confidence 111111 236789999999999987 455554
Q ss_pred ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCE
Q 001814 366 KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQW 445 (1010)
Q Consensus 366 ~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~ 445 (1010)
.|..||..+.|=|+|+++|||+ |..++|||+.. |. +.++.+ +.|...|+|+.+..|++.
T Consensus 194 -nhg~pVe~vl~lpsgs~iasAg--Gn~vkVWDl~~-------G~---------qll~~~--~~H~KtVTcL~l~s~~~r 252 (487)
T KOG0310|consen 194 -NHGCPVESVLALPSGSLIASAG--GNSVKVWDLTT-------GG---------QLLTSM--FNHNKTVTCLRLASDSTR 252 (487)
T ss_pred -cCCCceeeEEEcCCCCEEEEcC--CCeEEEEEecC-------Cc---------eehhhh--hcccceEEEEEeecCCce
Confidence 4889999999999999999998 78999999953 31 445443 325567999999999999
Q ss_pred EEEEeCCCeEEEEeCCCCCC
Q 001814 446 IAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 446 LAsgS~dGTVhIw~I~~~gg 465 (1010)
|.+++.|+.|+||++..++-
T Consensus 253 LlS~sLD~~VKVfd~t~~Kv 272 (487)
T KOG0310|consen 253 LLSGSLDRHVKVFDTTNYKV 272 (487)
T ss_pred EeecccccceEEEEccceEE
Confidence 99999999999999887754
No 52
>PTZ00421 coronin; Provisional
Probab=99.62 E-value=3e-13 Score=159.67 Aligned_cols=250 Identities=12% Similarity=0.109 Sum_probs=163.9
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCC------cceEeeeeccCCEEEEEEecCCCCCCCCC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDAS------NFNELVSKRDGPVSFLQMQPFPVKDDGCE 124 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g------~v~ellS~hdGpV~~v~~lP~p~~s~~~D 124 (1010)
..+|++.|..+.|... +.++|++|..+| ++|||+...+ .....+.+|...|.+++|.|..
T Consensus 71 l~GH~~~V~~v~fsP~------d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~------- 137 (493)
T PTZ00421 71 LLGQEGPIIDVAFNPF------DPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSA------- 137 (493)
T ss_pred EeCCCCCEEEEEEcCC------CCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCC-------
Confidence 3568899999999741 245677776655 9999996432 2345677899999999998753
Q ss_pred CccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC-
Q 001814 125 GFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP- 202 (1010)
Q Consensus 125 ~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~- 202 (1010)
..+|+..+ .+++|+|||+++++.+..+..+ ..|.+|+|++
T Consensus 138 -----~~iLaSgs---------------------------------~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~spd 179 (493)
T PTZ00421 138 -----MNVLASAG---------------------------------ADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLD 179 (493)
T ss_pred -----CCEEEEEe---------------------------------CCCEEEEEECCCCeEEEEEcCCCCceEEEEEECC
Confidence 13555421 2478999999999999998755 5899999987
Q ss_pred -CeEEEE-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCC
Q 001814 203 -RIVAVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPS 280 (1010)
Q Consensus 203 -rlLAV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p 280 (1010)
++|+++ .++.|+|||+++.+.+.++..|.... ...+. |.
T Consensus 180 G~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~------------~~~~~---------------w~------------ 220 (493)
T PTZ00421 180 GSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAK------------SQRCL---------------WA------------ 220 (493)
T ss_pred CCEEEEecCCCEEEEEECCCCcEEEEEecCCCCc------------ceEEE---------------Ec------------
Confidence 466655 56789999999988877776553310 00011 10
Q ss_pred CCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc
Q 001814 281 GVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA 360 (1010)
Q Consensus 281 ~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~ 360 (1010)
+.++.+++ ...+...++.|+|||+.+..
T Consensus 221 -------~~~~~ivt---------------------------------------------~G~s~s~Dr~VklWDlr~~~ 248 (493)
T PTZ00421 221 -------KRKDLIIT---------------------------------------------LGCSKSQQRQIMLWDTRKMA 248 (493)
T ss_pred -------CCCCeEEE---------------------------------------------EecCCCCCCeEEEEeCCCCC
Confidence 00011110 00023468999999998754
Q ss_pred -EEEEeccCC-CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEE
Q 001814 361 -IISQFKAHT-SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDIC 438 (1010)
Q Consensus 361 -~v~~~~aHt-spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIA 438 (1010)
.+..+..|. ..+....|+++|.+|++++..+..||+|++.. + ..++.+.. .....+..++
T Consensus 249 ~p~~~~~~d~~~~~~~~~~d~d~~~L~lggkgDg~Iriwdl~~-------~----------~~~~~~~~-~s~~~~~g~~ 310 (493)
T PTZ00421 249 SPYSTVDLDQSSALFIPFFDEDTNLLYIGSKGEGNIRCFELMN-------E----------RLTFCSSY-SSVEPHKGLC 310 (493)
T ss_pred CceeEeccCCCCceEEEEEcCCCCEEEEEEeCCCeEEEEEeeC-------C----------ceEEEeec-cCCCCCcceE
Confidence 444444443 45667789999999999986445699999963 2 23332222 2233577888
Q ss_pred EccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 439 FSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 439 FSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
|.| ++-+-...-.-.++|++...
T Consensus 311 ~~p--k~~~dv~~~Ei~r~~~l~~~ 333 (493)
T PTZ00421 311 MMP--KWSLDTRKCEIARFYALTYH 333 (493)
T ss_pred ecc--cccccccceeeeEEEEecCC
Confidence 988 45555555556688888644
No 53
>PTZ00420 coronin; Provisional
Probab=99.61 E-value=2.8e-13 Score=161.69 Aligned_cols=218 Identities=12% Similarity=0.151 Sum_probs=147.1
Q ss_pred EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCC
Q 001814 86 FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQS 165 (1010)
Q Consensus 86 ~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~ 165 (1010)
++||+... ......+.+|.++|..+++.|.. ..+||..+
T Consensus 56 I~L~~~~r-~~~v~~L~gH~~~V~~lafsP~~------------~~lLASgS---------------------------- 94 (568)
T PTZ00420 56 IRLENQMR-KPPVIKLKGHTSSILDLQFNPCF------------SEILASGS---------------------------- 94 (568)
T ss_pred EEeeecCC-CceEEEEcCCCCCEEEEEEcCCC------------CCEEEEEe----------------------------
Confidence 78998754 33455677899999999998752 13555421
Q ss_pred CCCCCCCCEEEEEeCCCCe--------EEEEEeCC-CcEEEEEEcC---CeEEE-EeCCeEEEEECCCCceeEEEeecCC
Q 001814 166 GNCVNSPTAVRFYSFQSHC--------YEHVLRFR-SSVCMVRCSP---RIVAV-GLATQIYCFDALTLENKFSVLTYPV 232 (1010)
Q Consensus 166 ~~~~~sp~tVrIWDlktge--------~V~tL~f~-S~V~sVa~S~---rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~ 232 (1010)
.+++|+|||+.++. .+..+..| ..|.+|+|+| .+|++ +.++.|+|||+.+.+...++. ++.
T Consensus 95 -----~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~-~~~ 168 (568)
T PTZ00420 95 -----EDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQIN-MPK 168 (568)
T ss_pred -----CCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEe-cCC
Confidence 24789999998642 33455544 5899999998 24554 567899999999987666553 211
Q ss_pred ccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhccccee
Q 001814 233 PQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKT 312 (1010)
Q Consensus 233 p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~kt 312 (1010)
. ...++ +. |++. +
T Consensus 169 ~-------------V~Sls-------ws---------------------------pdG~-l------------------- 181 (568)
T PTZ00420 169 K-------------LSSLK-------WN---------------------------IKGN-L------------------- 181 (568)
T ss_pred c-------------EEEEE-------EC---------------------------CCCC-E-------------------
Confidence 0 01111 11 1111 1
Q ss_pred eccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEE-----EEECCCCCEEEEE
Q 001814 313 LSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISA-----LCFDPSGTLLVTA 387 (1010)
Q Consensus 313 ls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsa-----LaFSPdGtlLATA 387 (1010)
++.+..|+.|+|||+.+++.+..+.+|...+.+ ..|++++.+|+|+
T Consensus 182 -----------------------------Lat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTt 232 (568)
T PTZ00420 182 -----------------------------LSGTCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILST 232 (568)
T ss_pred -----------------------------EEEEecCCEEEEEECCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEE
Confidence 123456899999999999999999999886543 3467999999998
Q ss_pred EcCC---CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 388 SVYG---NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 388 S~dG---t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+.++ +.|+|||+... + ..+..+.-......+....+.++|.++++|+.|++|++|++..
T Consensus 233 G~d~~~~R~VkLWDlr~~------~----------~pl~~~~ld~~~~~L~p~~D~~tg~l~lsGkGD~tIr~~e~~~ 294 (568)
T PTZ00420 233 GFSKNNMREMKLWDLKNT------T----------SALVTMSIDNASAPLIPHYDESTGLIYLIGKGDGNCRYYQHSL 294 (568)
T ss_pred EcCCCCccEEEEEECCCC------C----------CceEEEEecCCccceEEeeeCCCCCEEEEEECCCeEEEEEccC
Confidence 8765 36999998631 1 2333322111223445555677799999999999999999964
No 54
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.61 E-value=7.8e-14 Score=163.70 Aligned_cols=330 Identities=16% Similarity=0.174 Sum_probs=200.1
Q ss_pred CCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
++...|++||.+| +|||+.++ +...-.+.+|...|+++++...+ -.|| +|+
T Consensus 75 ~d~l~lAVGYaDGsVqif~~~s-~~~~~tfngHK~AVt~l~fd~~G-------------~rla--SGs------------ 126 (888)
T KOG0306|consen 75 DDILLLAVGYADGSVQIFSLES-EEILITFNGHKAAVTTLKFDKIG-------------TRLA--SGS------------ 126 (888)
T ss_pred CCcceEEEEecCceEEeeccCC-CceeeeecccccceEEEEEcccC-------------ceEe--ecC------------
Confidence 3567899999999 89999964 45666788899999999987544 2344 221
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEE-EEeCCeEEEEECCCCceeEEE
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVA-VGLATQIYCFDALTLENKFSV 227 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLA-V~ld~~I~IwD~~Tle~l~tL 227 (1010)
.++.|.+||+-..+-.-.|+.| ..|...-|.+ ++|+ ++-|..|++||+.+..+..|.
T Consensus 127 -------------------kDt~IIvwDlV~E~Gl~rL~GHkd~iT~~~F~~~~~~lvS~sKDs~iK~WdL~tqhCf~Th 187 (888)
T KOG0306|consen 127 -------------------KDTDIIVWDLVGEEGLFRLRGHKDSITQALFLNGDSFLVSVSKDSMIKFWDLETQHCFETH 187 (888)
T ss_pred -------------------CCccEEEEEeccceeeEEeecchHHHhHHhccCCCeEEEEeccCceEEEEecccceeeeEE
Confidence 2577999999877767777765 4777766755 4444 566789999999999998888
Q ss_pred eecCCccccCCCccccccCccceEEcc-ceEEEccC-CeeeccCCccCCCcCCC-----------------CCCCCCcCC
Q 001814 228 LTYPVPQLAGQGAVGINVGYGPMAVGP-RWLAYASN-TLLLSNSGRLSPQNLTP-----------------SGVSPSTSP 288 (1010)
Q Consensus 228 ~t~p~p~~~~~g~~~vnv~~gplAlgp-RwLAyas~-~~~iwd~G~vs~Q~lt~-----------------p~vS~stSP 288 (1010)
..+... ...|++.+ +.++.++. .+.+|..+-......++ -.+...+.+
T Consensus 188 vd~r~E-------------iw~l~~~~~~lvt~~~dse~~v~~L~~~~D~~~~~~~~s~~~~G~~~rqsk~R~i~l~~d~ 254 (888)
T KOG0306|consen 188 VDHRGE-------------IWALVLDEKLLVTAGTDSELKVWELAFEDDEKETNRYISTKLRGTFIRQSKGREINLVTDF 254 (888)
T ss_pred ecccce-------------EEEEEEecceEEEEecCCceEEEEeecccccccccccceeeccceeeeccCCceeEEeecC
Confidence 776551 23466666 55555543 45667532111110000 012233444
Q ss_pred CCCceEEEeehh------------hhhhhhc----------------ccc-------------eeeccccc-cccCCCC-
Q 001814 289 GGSSLVARYAME------------HSKQFAA----------------GLS-------------KTLSKYCQ-ELLPDGS- 325 (1010)
Q Consensus 289 ~~gslVa~~A~d------------ssk~la~----------------Gi~-------------ktls~y~~-~l~p~gs- 325 (1010)
++-.++++-|.+ ..|.+.+ ++- ++..++.. ++.|++.
T Consensus 255 s~r~~~c~g~d~~~e~frI~s~~E~~k~l~Kk~k~~Kkka~t~e~~~~v~~sl~~~i~r~~~ir~~~kiks~dv~~~~~~ 334 (888)
T KOG0306|consen 255 SDRFLVCQGADKVIELFRIRSKEEIAKILSKKLKRAKKKAETEENEDDVEKSLSDEIKRLETIRTSAKIKSFDVTPSGGT 334 (888)
T ss_pred cccEEEEecchhhhhheeecCHHHHHHHHHHHHHHhhhhccccccccchhhhHHHHHHHHHheechhheeEEEEEecCCc
Confidence 444455543211 1111111 110 00000000 0001000
Q ss_pred ----------------------CCC-------------------ccCCCc--------------cc--------------
Q 001814 326 ----------------------SSP-------------------VSPNSV--------------WK-------------- 336 (1010)
Q Consensus 326 ----------------------~s~-------------------~S~s~~--------------~k-------------- 336 (1010)
.+| +.++.+ |.
T Consensus 335 ~~~lv~l~nNtv~~ysl~~s~~~~p~~~~~~~i~~~GHR~dVRsl~vS~d~~~~~Sga~~SikiWn~~t~kciRTi~~~y 414 (888)
T KOG0306|consen 335 ENTLVLLANNTVEWYSLENSGKTSPEADRTSNIEIGGHRSDVRSLCVSSDSILLASGAGESIKIWNRDTLKCIRTITCGY 414 (888)
T ss_pred ceeEEEeecCceEEEEeccCCCCCccccccceeeeccchhheeEEEeecCceeeeecCCCcEEEEEccCcceeEEecccc
Confidence 000 000000 00
Q ss_pred -------cccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCC
Q 001814 337 -------VGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSG 409 (1010)
Q Consensus 337 -------~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG 409 (1010)
++...+..|...|.+.|||+.+...+-+++||+..|..|+.+|||+.++|||. +++|++|+..-...-+ |
T Consensus 415 ~l~~~Fvpgd~~Iv~G~k~Gel~vfdlaS~~l~Eti~AHdgaIWsi~~~pD~~g~vT~sa-DktVkfWdf~l~~~~~--g 491 (888)
T KOG0306|consen 415 ILASKFVPGDRYIVLGTKNGELQVFDLASASLVETIRAHDGAIWSISLSPDNKGFVTGSA-DKTVKFWDFKLVVSVP--G 491 (888)
T ss_pred EEEEEecCCCceEEEeccCCceEEEEeehhhhhhhhhccccceeeeeecCCCCceEEecC-CcEEEEEeEEEEeccC--c
Confidence 00111234556677777777777778889999999999999999999999999 5779999975321111 2
Q ss_pred CCccccCCcceEEEEEe--ccc-ccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccccccc
Q 001814 410 NHKYDWNSSHVHLYKLH--RGI-TSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLS 473 (1010)
Q Consensus 410 ~~~~~~~~s~~~L~~L~--RG~-t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~ 473 (1010)
+ ......++ |-. -...|.++++|||+++||++-.|.||+||-++..+-...+.+|-
T Consensus 492 t--------~~k~lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDtlKFflsLYGHk 550 (888)
T KOG0306|consen 492 T--------QKKVLSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDTLKFFLSLYGHK 550 (888)
T ss_pred c--------cceeeeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEEEEecceeeeeeecccc
Confidence 1 11111111 100 12369999999999999999999999999999988878888883
No 55
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.60 E-value=4.8e-15 Score=161.82 Aligned_cols=240 Identities=16% Similarity=0.199 Sum_probs=174.1
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
.-+|++-|+...|| .++|+.|..+ +++|||++ +++..+++-+|-..|-.+.|..
T Consensus 233 L~GHtGSVLCLqyd---------~rviisGSSDsTvrvWDv~-tge~l~tlihHceaVLhlrf~n--------------- 287 (499)
T KOG0281|consen 233 LTGHTGSVLCLQYD---------ERVIVSGSSDSTVRVWDVN-TGEPLNTLIHHCEAVLHLRFSN--------------- 287 (499)
T ss_pred hhcCCCcEEeeecc---------ceEEEecCCCceEEEEecc-CCchhhHHhhhcceeEEEEEeC---------------
Confidence 35677777777776 3688888865 59999995 6888888888988999998762
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEE---EEEe-CCCcEEEEEEcCCeEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYE---HVLR-FRSSVCMVRCSPRIVA 206 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V---~tL~-f~S~V~sVa~S~rlLA 206 (1010)
.+++.++ .++++.+||+.+-..+ +.|. +...|..|.|+.++++
T Consensus 288 g~mvtcS---------------------------------kDrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~kyIV 334 (499)
T KOG0281|consen 288 GYMVTCS---------------------------------KDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDKYIV 334 (499)
T ss_pred CEEEEec---------------------------------CCceeEEEeccCchHHHHHHHHhhhhhheeeeccccceEE
Confidence 2444322 2578999999865532 2333 4578999999999888
Q ss_pred EEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 207 VGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 207 V~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
.+. |.+|++|++.|.+.+.+|..|... +| +|-|-
T Consensus 335 sASgDRTikvW~~st~efvRtl~gHkRG----------------IA----ClQYr------------------------- 369 (499)
T KOG0281|consen 335 SASGDRTIKVWSTSTCEFVRTLNGHKRG----------------IA----CLQYR------------------------- 369 (499)
T ss_pred EecCCceEEEEeccceeeehhhhccccc----------------ce----ehhcc-------------------------
Confidence 765 457999999999999888877541 01 01111
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEe
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQF 365 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~ 365 (1010)
+.|| .+|+.|.+|++||+..|.++..+
T Consensus 370 -----~rlv------------------------------------------------VSGSSDntIRlwdi~~G~cLRvL 396 (499)
T KOG0281|consen 370 -----DRLV------------------------------------------------VSGSSDNTIRLWDIECGACLRVL 396 (499)
T ss_pred -----CeEE------------------------------------------------EecCCCceEEEEeccccHHHHHH
Confidence 1111 24567999999999999999999
Q ss_pred ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCE
Q 001814 366 KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQW 445 (1010)
Q Consensus 366 ~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~ 445 (1010)
++|..-|.++.|+ .+.++++.-||+ |+|||+....... -.+.--++..+-+ +.+.|..+.|. ...
T Consensus 397 eGHEeLvRciRFd--~krIVSGaYDGk-ikvWdl~aaldpr--------a~~~~~Cl~~lv~--hsgRVFrLQFD--~fq 461 (499)
T KOG0281|consen 397 EGHEELVRCIRFD--NKRIVSGAYDGK-IKVWDLQAALDPR--------APASTLCLRTLVE--HSGRVFRLQFD--EFQ 461 (499)
T ss_pred hchHHhhhheeec--Cceeeeccccce-EEEEecccccCCc--------ccccchHHHhhhh--ccceeEEEeec--ceE
Confidence 9999999999996 578999999886 9999986421000 0001123444433 34578999985 578
Q ss_pred EEEEeCCCeEEEEeCCC
Q 001814 446 IAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 446 LAsgS~dGTVhIw~I~~ 462 (1010)
|+++|.|.||.||+...
T Consensus 462 IvsssHddtILiWdFl~ 478 (499)
T KOG0281|consen 462 IISSSHDDTILIWDFLN 478 (499)
T ss_pred EEeccCCCeEEEEEcCC
Confidence 99999999999999864
No 56
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.60 E-value=6.8e-13 Score=144.39 Aligned_cols=243 Identities=13% Similarity=0.167 Sum_probs=159.3
Q ss_pred CCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 54 DLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 54 ~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
+....|.-+.|+. ++.+|+++.+ +.+||||+.+ +....++..+.=.|..+.|.-.+ . -
T Consensus 12 ~~~~~i~sl~fs~-------~G~~litss~dDsl~LYd~~~-g~~~~ti~skkyG~~~~~Fth~~-----------~--~ 70 (311)
T KOG1446|consen 12 ETNGKINSLDFSD-------DGLLLITSSEDDSLRLYDSLS-GKQVKTINSKKYGVDLACFTHHS-----------N--T 70 (311)
T ss_pred cCCCceeEEEecC-------CCCEEEEecCCCeEEEEEcCC-CceeeEeecccccccEEEEecCC-----------c--e
Confidence 3677888888874 5667777655 5899999965 44555555444456666665211 1 1
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcC---CeEEEE
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSP---RIVAVG 208 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~---rlLAV~ 208 (1010)
++.+ .+ .-+.+||.-++.++++++.+..|. .|.++..+| .+|.++
T Consensus 71 -~i~s--St----------------------------k~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~d~FlS~S 119 (311)
T KOG1446|consen 71 -VIHS--ST----------------------------KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKDDTFLSSS 119 (311)
T ss_pred -EEEc--cC----------------------------CCCCceEEEEeecCceEEEcCCCCceEEEEEecCCCCeEEecc
Confidence 2211 10 014789999999999999998775 899999999 477788
Q ss_pred eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCC
Q 001814 209 LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSP 288 (1010)
Q Consensus 209 ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP 288 (1010)
+|++|++||++..++..-+.-... ..+ ||.+
T Consensus 120 ~D~tvrLWDlR~~~cqg~l~~~~~---------------pi~-------AfDp--------------------------- 150 (311)
T KOG1446|consen 120 LDKTVRLWDLRVKKCQGLLNLSGR---------------PIA-------AFDP--------------------------- 150 (311)
T ss_pred cCCeEEeeEecCCCCceEEecCCC---------------cce-------eECC---------------------------
Confidence 999999999997776544432111 112 2222
Q ss_pred CCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC--cEEEEec
Q 001814 289 GGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR--AIISQFK 366 (1010)
Q Consensus 289 ~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~--~~v~~~~ 366 (1010)
.|-.+| .+...+.|++||+.+- ..-.+|.
T Consensus 151 ~GLifA-------------------------------------------------~~~~~~~IkLyD~Rs~dkgPF~tf~ 181 (311)
T KOG1446|consen 151 EGLIFA-------------------------------------------------LANGSELIKLYDLRSFDKGPFTTFS 181 (311)
T ss_pred CCcEEE-------------------------------------------------EecCCCeEEEEEecccCCCCceeEc
Confidence 111111 1222338899998763 2333443
Q ss_pred ---cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC
Q 001814 367 ---AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS 443 (1010)
Q Consensus 367 ---aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg 443 (1010)
+-....+.|.|||||++|+-... +..|.|.|.-. |. ..+-+.++..... .--+.+|+|||
T Consensus 182 i~~~~~~ew~~l~FS~dGK~iLlsT~-~s~~~~lDAf~-------G~--------~~~tfs~~~~~~~-~~~~a~ftPds 244 (311)
T KOG1446|consen 182 ITDNDEAEWTDLEFSPDGKSILLSTN-ASFIYLLDAFD-------GT--------VKSTFSGYPNAGN-LPLSATFTPDS 244 (311)
T ss_pred cCCCCccceeeeEEcCCCCEEEEEeC-CCcEEEEEccC-------Cc--------EeeeEeeccCCCC-cceeEEECCCC
Confidence 34678899999999998887777 45588988743 41 1222233322111 12578999999
Q ss_pred CEEEEEeCCCeEEEEeCCCC
Q 001814 444 QWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 444 ~~LAsgS~dGTVhIw~I~~~ 463 (1010)
++|.+|+.||+||||.++..
T Consensus 245 ~Fvl~gs~dg~i~vw~~~tg 264 (311)
T KOG1446|consen 245 KFVLSGSDDGTIHVWNLETG 264 (311)
T ss_pred cEEEEecCCCcEEEEEcCCC
Confidence 99999999999999999654
No 57
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.60 E-value=7.5e-14 Score=162.11 Aligned_cols=240 Identities=18% Similarity=0.195 Sum_probs=174.9
Q ss_pred CCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEE
Q 001814 56 KDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLL 134 (1010)
Q Consensus 56 kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLA 134 (1010)
.+.|.-+.+.. .++.|++|+.+| ++|||+......+.+...|.+-|.++++.. .++
T Consensus 217 ~~~vtSv~ws~-------~G~~LavG~~~g~v~iwD~~~~k~~~~~~~~h~~rvg~laW~~----------------~~l 273 (484)
T KOG0305|consen 217 EELVTSVKWSP-------DGSHLAVGTSDGTVQIWDVKEQKKTRTLRGSHASRVGSLAWNS----------------SVL 273 (484)
T ss_pred CCceEEEEECC-------CCCEEEEeecCCeEEEEehhhccccccccCCcCceeEEEeccC----------------ceE
Confidence 56677666653 678999999998 799999876666666666889999999761 223
Q ss_pred EEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEE-EeCC-CcEEEEEEcC--CeEEEEe-
Q 001814 135 VVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHV-LRFR-SSVCMVRCSP--RIVAVGL- 209 (1010)
Q Consensus 135 vVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~t-L~f~-S~V~sVa~S~--rlLAV~l- 209 (1010)
. +|. -++++.++|++..+.+.. +..| ..|+.+++++ +.||++.
T Consensus 274 s-sGs-------------------------------r~~~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgn 321 (484)
T KOG0305|consen 274 S-SGS-------------------------------RDGKILNHDVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGN 321 (484)
T ss_pred E-Eec-------------------------------CCCcEEEEEEecchhhhhhhhcccceeeeeEECCCCCeeccCCC
Confidence 2 221 136799999998775554 6655 5899999988 6888754
Q ss_pred CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCC
Q 001814 210 ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPG 289 (1010)
Q Consensus 210 d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~ 289 (1010)
|..++|||..+.+.++++.+|... +-.+|++| |.
T Consensus 322 DN~~~Iwd~~~~~p~~~~~~H~aA-------------VKA~awcP------------~q--------------------- 355 (484)
T KOG0305|consen 322 DNVVFIWDGLSPEPKFTFTEHTAA-------------VKALAWCP------------WQ--------------------- 355 (484)
T ss_pred ccceEeccCCCccccEEEecccee-------------eeEeeeCC------------Cc---------------------
Confidence 668999999888888888887551 23455544 10
Q ss_pred CCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCC
Q 001814 290 GSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHT 369 (1010)
Q Consensus 290 ~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHt 369 (1010)
..++| . .-|..|+.|++||..++..+..+...
T Consensus 356 -~~lLA-----------s-----------------------------------GGGs~D~~i~fwn~~~g~~i~~vdtg- 387 (484)
T KOG0305|consen 356 -SGLLA-----------T-----------------------------------GGGSADRCIKFWNTNTGARIDSVDTG- 387 (484)
T ss_pred -cCceE-----------E-----------------------------------cCCCcccEEEEEEcCCCcEecccccC-
Confidence 01111 0 01457999999999999888766644
Q ss_pred CCeEEEEECCCCCEEEEE-EcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 370 SPISALCFDPSGTLLVTA-SVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATA-S~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
+.|..|.|++..+-|+++ +.-...|.||+... ...+..+ -|| ..+|..+++||||..|++
T Consensus 388 sQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~ps-----------------~~~~~~l-~gH-~~RVl~la~SPdg~~i~t 448 (484)
T KOG0305|consen 388 SQVCSLIWSKKYKELLSTHGYSENQITLWKYPS-----------------MKLVAEL-LGH-TSRVLYLALSPDGETIVT 448 (484)
T ss_pred CceeeEEEcCCCCEEEEecCCCCCcEEEEeccc-----------------cceeeee-cCC-cceeEEEEECCCCCEEEE
Confidence 789999999999766655 43344699999832 1344444 454 457999999999999999
Q ss_pred EeCCCeEEEEeCCCC
Q 001814 449 VSSKGTCHVFVLSPF 463 (1010)
Q Consensus 449 gS~dGTVhIw~I~~~ 463 (1010)
++.|+|+++|++-+.
T Consensus 449 ~a~DETlrfw~~f~~ 463 (484)
T KOG0305|consen 449 GAADETLRFWNLFDE 463 (484)
T ss_pred ecccCcEEeccccCC
Confidence 999999999999765
No 58
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.60 E-value=8.5e-13 Score=141.27 Aligned_cols=260 Identities=14% Similarity=0.151 Sum_probs=172.7
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCC--cceEeee-eccCCEEEEEEecCCCCCCCCCCcc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDAS--NFNELVS-KRDGPVSFLQMQPFPVKDDGCEGFR 127 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g--~v~ellS-~hdGpV~~v~~lP~p~~s~~~D~F~ 127 (1010)
.++|+++|-.+.|.. + .+.+|++|..+ .++||++...+ .++.++. .|.-.|+.+++.|-+
T Consensus 10 ~~gh~~r~W~~awhp-----~-~g~ilAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g---------- 73 (312)
T KOG0645|consen 10 LSGHKDRVWSVAWHP-----G-KGVILASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHG---------- 73 (312)
T ss_pred ecCCCCcEEEEEecc-----C-CceEEEeecCCceEEEEecCCCCcEEEEEeccccchheeeeeeecCCC----------
Confidence 578999998888874 1 15688888765 59999986322 3444443 467789999999865
Q ss_pred ccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCC--eEEEEEeCC-CcEEEEEEcC--
Q 001814 128 KLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSH--CYEHVLRFR-SSVCMVRCSP-- 202 (1010)
Q Consensus 128 ~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktg--e~V~tL~f~-S~V~sVa~S~-- 202 (1010)
.+||.- + .+.++.||.-..+ +++.+|+.| +.|.+|++++
T Consensus 74 ---~~La~a-----------------------S----------FD~t~~Iw~k~~~efecv~~lEGHEnEVK~Vaws~sG 117 (312)
T KOG0645|consen 74 ---RYLASA-----------------------S----------FDATVVIWKKEDGEFECVATLEGHENEVKCVAWSASG 117 (312)
T ss_pred ---cEEEEe-----------------------e----------ccceEEEeecCCCceeEEeeeeccccceeEEEEcCCC
Confidence 366641 1 3578999977644 688999877 6999999987
Q ss_pred CeEEEEe-CCeEEEEECCCCc---eeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCC
Q 001814 203 RIVAVGL-ATQIYCFDALTLE---NKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLT 278 (1010)
Q Consensus 203 rlLAV~l-d~~I~IwD~~Tle---~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt 278 (1010)
++||.+. +..|.||.+.... +.-.|..|.-- + + ..+|+
T Consensus 118 ~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqD---------V-----------K--------~V~WH---------- 159 (312)
T KOG0645|consen 118 NYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQD---------V-----------K--------HVIWH---------- 159 (312)
T ss_pred CEEEEeeCCCeEEEEEecCCCcEEEEeeecccccc---------c-----------c--------EEEEc----------
Confidence 7999876 5579999876332 22223222210 0 0 01222
Q ss_pred CCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCC
Q 001814 279 PSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVT 358 (1010)
Q Consensus 279 ~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s 358 (1010)
|..+ .++++++|.+|++|+-..
T Consensus 160 ---------Pt~d-------------------------------------------------lL~S~SYDnTIk~~~~~~ 181 (312)
T KOG0645|consen 160 ---------PTED-------------------------------------------------LLFSCSYDNTIKVYRDED 181 (312)
T ss_pred ---------CCcc-------------------------------------------------eeEEeccCCeEEEEeecC
Confidence 0000 123568899999999873
Q ss_pred -C--cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCccc-----------C------CCCC--------
Q 001814 359 -R--AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMR-----------S------GSGN-------- 410 (1010)
Q Consensus 359 -~--~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~-----------~------~sG~-------- 410 (1010)
. .++++|.+|...|.+++|+|.|..|++++.|++ ++||.....-.. . +.|.
T Consensus 182 dddW~c~~tl~g~~~TVW~~~F~~~G~rl~s~sdD~t-v~Iw~~~~~~~~~~sr~~Y~v~W~~~~IaS~ggD~~i~lf~~ 260 (312)
T KOG0645|consen 182 DDDWECVQTLDGHENTVWSLAFDNIGSRLVSCSDDGT-VSIWRLYTDLSGMHSRALYDVPWDNGVIASGGGDDAIRLFKE 260 (312)
T ss_pred CCCeeEEEEecCccceEEEEEecCCCceEEEecCCcc-eEeeeeccCcchhcccceEeeeecccceEeccCCCEEEEEEe
Confidence 2 478999999999999999999999999999766 899984321000 0 0010
Q ss_pred CccccCCcceEEEEEecccccccEEEEEEccC-CCEEEEEeCCCeEEEEeCC
Q 001814 411 HKYDWNSSHVHLYKLHRGITSATIQDICFSHY-SQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 411 ~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD-g~~LAsgS~dGTVhIw~I~ 461 (1010)
...+-.+...++++. .+.|.-.|.+++|.|. +.+|+++++||+|++|.+.
T Consensus 261 s~~~d~p~~~l~~~~-~~aHe~dVNsV~w~p~~~~~L~s~~DDG~v~~W~l~ 311 (312)
T KOG0645|consen 261 SDSPDEPSWNLLAKK-EGAHEVDVNSVQWNPKVSNRLASGGDDGIVNFWELE 311 (312)
T ss_pred cCCCCCchHHHHHhh-hcccccccceEEEcCCCCCceeecCCCceEEEEEec
Confidence 000000111122211 2334447999999995 7899999999999999874
No 59
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.58 E-value=3.7e-13 Score=153.48 Aligned_cols=107 Identities=20% Similarity=0.202 Sum_probs=91.5
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
+++++.|+.|.+|+--..+-...++-|..-|.++.|||||+++||++.||+ |.|||=.+ | ..
T Consensus 163 i~T~sdDn~v~ffeGPPFKFk~s~r~HskFV~~VRysPDG~~Fat~gsDgk-i~iyDGkt-------g----------e~ 224 (603)
T KOG0318|consen 163 IATGSDDNTVAFFEGPPFKFKSSFREHSKFVNCVRYSPDGSRFATAGSDGK-IYIYDGKT-------G----------EK 224 (603)
T ss_pred EEeccCCCeEEEeeCCCeeeeecccccccceeeEEECCCCCeEEEecCCcc-EEEEcCCC-------c----------cE
Confidence 357789999999998887888899999999999999999999999999887 88998643 4 57
Q ss_pred EEEEec-ccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 422 LYKLHR-GITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 422 L~~L~R-G~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
+++|.- -.|...|+.|+||||++.|+++|.|.|++||+++.....
T Consensus 225 vg~l~~~~aHkGsIfalsWsPDs~~~~T~SaDkt~KIWdVs~~slv 270 (603)
T KOG0318|consen 225 VGELEDSDAHKGSIFALSWSPDSTQFLTVSADKTIKIWDVSTNSLV 270 (603)
T ss_pred EEEecCCCCccccEEEEEECCCCceEEEecCCceEEEEEeeccceE
Confidence 888752 124457999999999999999999999999999986543
No 60
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.58 E-value=9.9e-14 Score=160.21 Aligned_cols=248 Identities=13% Similarity=0.135 Sum_probs=177.5
Q ss_pred CCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEE
Q 001814 54 DLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFL 133 (1010)
Q Consensus 54 ~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLL 133 (1010)
.+.|+|..+.|..- .+=+|+.=|.+.++|||.+. ...-+-+.-.+-||+..+|++. .-+
T Consensus 11 ~rSdRVKsVd~HPt------ePw~la~LynG~V~IWnyet-qtmVksfeV~~~PvRa~kfiaR--------------knW 69 (794)
T KOG0276|consen 11 SRSDRVKSVDFHPT------EPWILAALYNGDVQIWNYET-QTMVKSFEVSEVPVRAAKFIAR--------------KNW 69 (794)
T ss_pred ccCCceeeeecCCC------CceEEEeeecCeeEEEeccc-ceeeeeeeecccchhhheeeec--------------cce
Confidence 36788888877631 23455555566699999965 3333444555788998888742 123
Q ss_pred EEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcCC--eEEEEeC
Q 001814 134 LVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSPR--IVAVGLA 210 (1010)
Q Consensus 134 AvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~r--lLAV~ld 210 (1010)
++++.| +-.||+|+..|++.|++++.|+ -|++|+..|. ++..+.|
T Consensus 70 iv~GsD--------------------------------D~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~P~vLtsSD 117 (794)
T KOG0276|consen 70 IVTGSD--------------------------------DMQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTLPYVLTSSD 117 (794)
T ss_pred EEEecC--------------------------------CceEEEEecccceeeEEeeccccceeeeeecCCCCeEEecCC
Confidence 443321 3579999999999999998775 8999999983 4555554
Q ss_pred -CeEEEEECCCC-ceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCC
Q 001814 211 -TQIYCFDALTL-ENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSP 288 (1010)
Q Consensus 211 -~~I~IwD~~Tl-e~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP 288 (1010)
-.|++||-... .+.++..+|.-- .|. +|+-++.
T Consensus 118 Dm~iKlW~we~~wa~~qtfeGH~Hy---------------VMq-----v~fnPkD------------------------- 152 (794)
T KOG0276|consen 118 DMTIKLWDWENEWACEQTFEGHEHY---------------VMQ-----VAFNPKD------------------------- 152 (794)
T ss_pred ccEEEEeeccCceeeeeEEcCcceE---------------EEE-----EEecCCC-------------------------
Confidence 57999986542 344444444220 011 1111110
Q ss_pred CCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC
Q 001814 289 GGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH 368 (1010)
Q Consensus 289 ~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH 368 (1010)
..++++++-|++|+||.+.+..+.-+|.+|
T Consensus 153 --------------------------------------------------~ntFaS~sLDrTVKVWslgs~~~nfTl~gH 182 (794)
T KOG0276|consen 153 --------------------------------------------------PNTFASASLDRTVKVWSLGSPHPNFTLEGH 182 (794)
T ss_pred --------------------------------------------------ccceeeeeccccEEEEEcCCCCCceeeecc
Confidence 013456788999999999999999999999
Q ss_pred CCCeEEEEECCCC--CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEE
Q 001814 369 TSPISALCFDPSG--TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWI 446 (1010)
Q Consensus 369 tspIsaLaFSPdG--tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L 446 (1010)
...|+++.|=+-| -+|+||+. +++|+|||..+- .++.+| .||++ .|..+.|.|.=-.|
T Consensus 183 ekGVN~Vdyy~~gdkpylIsgaD-D~tiKvWDyQtk-----------------~CV~TL-eGHt~-Nvs~v~fhp~lpii 242 (794)
T KOG0276|consen 183 EKGVNCVDYYTGGDKPYLISGAD-DLTIKVWDYQTK-----------------SCVQTL-EGHTN-NVSFVFFHPELPII 242 (794)
T ss_pred ccCcceEEeccCCCcceEEecCC-CceEEEeecchH-----------------HHHHHh-hcccc-cceEEEecCCCcEE
Confidence 9999999998876 49999987 678999998641 466666 56654 59999999999999
Q ss_pred EEEeCCCeEEEEeCCCCCCcccc
Q 001814 447 AIVSSKGTCHVFVLSPFGGDSGF 469 (1010)
Q Consensus 447 AsgS~dGTVhIw~I~~~gg~~~~ 469 (1010)
++||.|||++||.-.+|..+..+
T Consensus 243 isgsEDGTvriWhs~Ty~lE~tL 265 (794)
T KOG0276|consen 243 ISGSEDGTVRIWNSKTYKLEKTL 265 (794)
T ss_pred EEecCCccEEEecCcceehhhhh
Confidence 99999999999999888776543
No 61
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.58 E-value=7.9e-14 Score=146.68 Aligned_cols=243 Identities=16% Similarity=0.151 Sum_probs=165.3
Q ss_pred eEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEE
Q 001814 98 NELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRF 177 (1010)
Q Consensus 98 ~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrI 177 (1010)
..++..++|+|+.+++.-++ ++.+. +| ++++||+
T Consensus 10 ~~~l~~~qgaV~avryN~dG-------------nY~lt-cG--------------------------------sdrtvrL 43 (307)
T KOG0316|consen 10 LSILDCAQGAVRAVRYNVDG-------------NYCLT-CG--------------------------------SDRTVRL 43 (307)
T ss_pred ceeecccccceEEEEEccCC-------------CEEEE-cC--------------------------------CCceEEe
Confidence 35678899999999987554 45343 32 2589999
Q ss_pred EeCCCCeEEEEEeCCC-cEEEEEEcC---CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEc
Q 001814 178 YSFQSHCYEHVLRFRS-SVCMVRCSP---RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVG 253 (1010)
Q Consensus 178 WDlktge~V~tL~f~S-~V~sVa~S~---rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlg 253 (1010)
|+...|.+++++..|+ .|++++.+. ++.+.|.|..|++||+.|++....+.+|-.. .+.+.+.
T Consensus 44 WNp~rg~liktYsghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~rgH~aq-------------VNtV~fN 110 (307)
T KOG0316|consen 44 WNPLRGALIKTYSGHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGKVDRRFRGHLAQ-------------VNTVRFN 110 (307)
T ss_pred ecccccceeeeecCCCceeeeccccccccccccCCCCceEEEEEcccCeeeeecccccce-------------eeEEEec
Confidence 9999999999999886 899888864 4555677788999999999999999887541 2345553
Q ss_pred c--ceEEEcc--CCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCc
Q 001814 254 P--RWLAYAS--NTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPV 329 (1010)
Q Consensus 254 p--RwLAyas--~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~ 329 (1010)
- .-++.++ ..+.+||.-..+++ |+.-.. .|+ .|+. .+
T Consensus 111 eesSVv~SgsfD~s~r~wDCRS~s~e----PiQild-----------ea~-------D~V~-----------------Si 151 (307)
T KOG0316|consen 111 EESSVVASGSFDSSVRLWDCRSRSFE----PIQILD-----------EAK-------DGVS-----------------SI 151 (307)
T ss_pred CcceEEEeccccceeEEEEcccCCCC----ccchhh-----------hhc-------Ccee-----------------EE
Confidence 2 3344433 35678884211111 100000 000 1110 00
Q ss_pred cCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCC
Q 001814 330 SPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSG 409 (1010)
Q Consensus 330 S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG 409 (1010)
.....-+..++.||+++.||+..++.....-+| ||++++||+||+.+..++.+ .++|+-|-.+ |
T Consensus 152 ------~v~~heIvaGS~DGtvRtydiR~G~l~sDy~g~--pit~vs~s~d~nc~La~~l~-stlrLlDk~t-------G 215 (307)
T KOG0316|consen 152 ------DVAEHEIVAGSVDGTVRTYDIRKGTLSSDYFGH--PITSVSFSKDGNCSLASSLD-STLRLLDKET-------G 215 (307)
T ss_pred ------EecccEEEeeccCCcEEEEEeecceeehhhcCC--cceeEEecCCCCEEEEeecc-ceeeecccch-------h
Confidence 000012346788999999999999876665555 99999999999998888885 4599988643 4
Q ss_pred CCccccCCcceEEEEEecccccc-cEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 410 NHKYDWNSSHVHLYKLHRGITSA-TIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 410 ~~~~~~~~s~~~L~~L~RG~t~a-~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
+.|.. ..|+.+. -=.+.+|+.-...+++||.||.|.+|++.....
T Consensus 216 ----------klL~s-YkGhkn~eykldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~ 261 (307)
T KOG0316|consen 216 ----------KLLKS-YKGHKNMEYKLDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ 261 (307)
T ss_pred ----------HHHHH-hcccccceeeeeeeecccceeEEeccCCceEEEEEecccee
Confidence 23332 2455443 235678888889999999999999999976443
No 62
>PTZ00420 coronin; Provisional
Probab=99.57 E-value=1.3e-12 Score=155.95 Aligned_cols=130 Identities=9% Similarity=0.134 Sum_probs=97.1
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCc-------ceEeeeeccCCEEEEEEecCCCCCCCC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASN-------FNELVSKRDGPVSFLQMQPFPVKDDGC 123 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~-------v~ellS~hdGpV~~v~~lP~p~~s~~~ 123 (1010)
..+|++.|..+.|..- .+.+|++|..+| ++|||+...+. ....+..|.+.|.++++.|+..
T Consensus 70 L~gH~~~V~~lafsP~------~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~----- 138 (568)
T PTZ00420 70 LKGHTSSILDLQFNPC------FSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNY----- 138 (568)
T ss_pred EcCCCCCEEEEEEcCC------CCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCC-----
Confidence 4568889999999631 245787777765 89999964332 2235678899999999998641
Q ss_pred CCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC-
Q 001814 124 EGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP- 202 (1010)
Q Consensus 124 D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~- 202 (1010)
.+|+. ++ .+++|+|||+++++.+..+.++..|.+++|++
T Consensus 139 -------~iLaS-gS--------------------------------~DgtIrIWDl~tg~~~~~i~~~~~V~Slswspd 178 (568)
T PTZ00420 139 -------YIMCS-SG--------------------------------FDSFVNIWDIENEKRAFQINMPKKLSSLKWNIK 178 (568)
T ss_pred -------eEEEE-Ee--------------------------------CCCeEEEEECCCCcEEEEEecCCcEEEEEECCC
Confidence 13343 21 24789999999999988888888999999987
Q ss_pred -CeEEEEe-CCeEEEEECCCCceeEEEeecCC
Q 001814 203 -RIVAVGL-ATQIYCFDALTLENKFSVLTYPV 232 (1010)
Q Consensus 203 -rlLAV~l-d~~I~IwD~~Tle~l~tL~t~p~ 232 (1010)
++|+++. +++|+|||+++++.+.++..|..
T Consensus 179 G~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g 210 (568)
T PTZ00420 179 GNLLSGTCVGKHMHIIDPRKQEIASSFHIHDG 210 (568)
T ss_pred CCEEEEEecCCEEEEEECCCCcEEEEEecccC
Confidence 5777654 67899999999988887766543
No 63
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.56 E-value=3.4e-13 Score=145.86 Aligned_cols=242 Identities=16% Similarity=0.186 Sum_probs=173.3
Q ss_pred CCCcEEEEEEeeccC------CCC--------CC-CeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCC
Q 001814 55 LKDQVTWAGFDRLEY------GPS--------VF-KQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVK 119 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~------~~~--------~~-~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~ 119 (1010)
++.-++|--|+.||- +.+ .+ ..+|.+|.+..+++||++ .|++..-+..|..-|+.+. |.
T Consensus 68 Dr~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~d~s~i~S~gtDk~v~~wD~~-tG~~~rk~k~h~~~vNs~~--p~--- 141 (338)
T KOG0265|consen 68 DRAIVLWNVYGDCENFWVLKGHSGAVMELHGMRDGSHILSCGTDKTVRGWDAE-TGKRIRKHKGHTSFVNSLD--PS--- 141 (338)
T ss_pred cceEEEEeccccccceeeeccccceeEeeeeccCCCEEEEecCCceEEEEecc-cceeeehhccccceeeecC--cc---
Confidence 445577776664432 211 23 456677777789999996 5766666666766777666 22
Q ss_pred CCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEE
Q 001814 120 DDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVR 199 (1010)
Q Consensus 120 s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa 199 (1010)
.|.+.+|+++. -++++|+||+++.+++++++-..++.+|.
T Consensus 142 ---------rrg~~lv~Sgs-------------------------------dD~t~kl~D~R~k~~~~t~~~kyqltAv~ 181 (338)
T KOG0265|consen 142 ---------RRGPQLVCSGS-------------------------------DDGTLKLWDIRKKEAIKTFENKYQLTAVG 181 (338)
T ss_pred ---------ccCCeEEEecC-------------------------------CCceEEEEeecccchhhccccceeEEEEE
Confidence 12233444431 25899999999999999998888999999
Q ss_pred EcC---CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCc
Q 001814 200 CSP---RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQN 276 (1010)
Q Consensus 200 ~S~---rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~ 276 (1010)
|+. +.+..+.++.|++||++..+.++++.+|..+ ...+++++
T Consensus 182 f~d~s~qv~sggIdn~ikvWd~r~~d~~~~lsGh~Dt-------------It~lsls~---------------------- 226 (338)
T KOG0265|consen 182 FKDTSDQVISGGIDNDIKVWDLRKNDGLYTLSGHADT-------------ITGLSLSR---------------------- 226 (338)
T ss_pred ecccccceeeccccCceeeeccccCcceEEeecccCc-------------eeeEEecc----------------------
Confidence 976 5666778899999999999999999887653 11122221
Q ss_pred CCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEEC
Q 001814 277 LTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDF 356 (1010)
Q Consensus 277 lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl 356 (1010)
.|..+ .+-.-|.+|++||+
T Consensus 227 ------------~gs~l-------------------------------------------------lsnsMd~tvrvwd~ 245 (338)
T KOG0265|consen 227 ------------YGSFL-------------------------------------------------LSNSMDNTVRVWDV 245 (338)
T ss_pred ------------CCCcc-------------------------------------------------ccccccceEEEEEe
Confidence 10000 01245789999999
Q ss_pred CCC----cEEEEeccCCC----CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 357 VTR----AIISQFKAHTS----PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 357 ~s~----~~v~~~~aHts----pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
... .++..|.+|.. .....++||+++.+..+|. ++.+.|||... ...+|+| -|
T Consensus 246 rp~~p~~R~v~if~g~~hnfeknlL~cswsp~~~~i~ags~-dr~vyvwd~~~-----------------r~~lykl-pG 306 (338)
T KOG0265|consen 246 RPFAPSQRCVKIFQGHIHNFEKNLLKCSWSPNGTKITAGSA-DRFVYVWDTTS-----------------RRILYKL-PG 306 (338)
T ss_pred cccCCCCceEEEeecchhhhhhhcceeeccCCCCccccccc-cceEEEeeccc-----------------ccEEEEc-CC
Confidence 753 45888888754 4556899999999998888 56799999842 1478888 46
Q ss_pred cccccEEEEEEccCCCEEEEEeCCCeEEEE
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKGTCHVF 458 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw 458 (1010)
+ ...|..+.|.|.-..|.+++.|.||.+=
T Consensus 307 h-~gsvn~~~Fhp~e~iils~~sdk~i~lg 335 (338)
T KOG0265|consen 307 H-YGSVNEVDFHPTEPIILSCSSDKTIYLG 335 (338)
T ss_pred c-ceeEEEeeecCCCcEEEEeccCceeEee
Confidence 4 4579999999999999999999998763
No 64
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.55 E-value=8.7e-14 Score=165.09 Aligned_cols=207 Identities=14% Similarity=0.180 Sum_probs=148.6
Q ss_pred CEEEEEeCCCCeEEEEEe-CCCcEEEEEEcCC--eEEEEeC-CeEEEEECCCCceeEEEeecCCccccCCCccccccCcc
Q 001814 173 TAVRFYSFQSHCYEHVLR-FRSSVCMVRCSPR--IVAVGLA-TQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYG 248 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~-f~S~V~sVa~S~r--lLAV~ld-~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~g 248 (1010)
++|++||-+=|.+++.+. +.++|++|.|.++ ++++|.| .+|++|+..+-+++++|.+|-.-
T Consensus 31 G~IQlWDYRM~tli~rFdeHdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclftL~GHlDY--------------- 95 (1202)
T KOG0292|consen 31 GVIQLWDYRMGTLIDRFDEHDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGHLDY--------------- 95 (1202)
T ss_pred ceeeeehhhhhhHHhhhhccCCccceeeecCCCCeEEecCCccEEEEEecccceehhhhccccce---------------
Confidence 689999999999999885 5579999999983 6666555 48999999999999999876441
Q ss_pred ceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCC
Q 001814 249 PMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSP 328 (1010)
Q Consensus 249 plAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~ 328 (1010)
.|-+.|-..-+
T Consensus 96 -----VRt~~FHheyP---------------------------------------------------------------- 106 (1202)
T KOG0292|consen 96 -----VRTVFFHHEYP---------------------------------------------------------------- 106 (1202)
T ss_pred -----eEEeeccCCCc----------------------------------------------------------------
Confidence 01111111100
Q ss_pred ccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCccc---
Q 001814 329 VSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMR--- 405 (1010)
Q Consensus 329 ~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~--- 405 (1010)
| +.+++.|.+|+||+..++++++.+.+|.+.|.|..|.|...++++||-| .+||||||.-..-+
T Consensus 107 ------W------IlSASDDQTIrIWNwqsr~~iavltGHnHYVMcAqFhptEDlIVSaSLD-QTVRVWDisGLRkk~~~ 173 (1202)
T KOG0292|consen 107 ------W------ILSASDDQTIRIWNWQSRKCIAVLTGHNHYVMCAQFHPTEDLIVSASLD-QTVRVWDISGLRKKNKA 173 (1202)
T ss_pred ------e------EEEccCCCeEEEEeccCCceEEEEecCceEEEeeccCCccceEEEeccc-ceEEEEeecchhccCCC
Confidence 1 2356789999999999999999999999999999999999999999995 56999999531100
Q ss_pred CC------CCCC-ccccCC--cceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC-cc-ccccccC
Q 001814 406 SG------SGNH-KYDWNS--SHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG-DS-GFQTLSS 474 (1010)
Q Consensus 406 ~~------sG~~-~~~~~~--s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg-~~-~~~~H~s 474 (1010)
++ .|.. ..+.-. ...-.+.| .||+++ |..+||.|.--.|++|++|..|++|.++..+- ++ +-+.|..
T Consensus 174 pg~~e~~~~~~~~~~dLfg~~DaVVK~VL-EGHDRG-VNwaAfhpTlpliVSG~DDRqVKlWrmnetKaWEvDtcrgH~n 251 (1202)
T KOG0292|consen 174 PGSLEDQMRGQQGNSDLFGQTDAVVKHVL-EGHDRG-VNWAAFHPTLPLIVSGADDRQVKLWRMNETKAWEVDTCRGHYN 251 (1202)
T ss_pred CCCchhhhhccccchhhcCCcCeeeeeee-cccccc-cceEEecCCcceEEecCCcceeeEEEeccccceeehhhhcccC
Confidence 00 0000 001100 11112233 577654 88899999999999999999999999987654 22 3477765
Q ss_pred CCCC
Q 001814 475 QGGD 478 (1010)
Q Consensus 475 ~~~~ 478 (1010)
.+..
T Consensus 252 nVss 255 (1202)
T KOG0292|consen 252 NVSS 255 (1202)
T ss_pred Ccce
Confidence 4433
No 65
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.55 E-value=5e-13 Score=145.06 Aligned_cols=229 Identities=18% Similarity=0.268 Sum_probs=153.7
Q ss_pred CCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEE
Q 001814 54 DLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFL 133 (1010)
Q Consensus 54 ~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLL 133 (1010)
.+...|+-+.|.. ..+++..|.++.++++|++ ++ -...+..|+++|++|...+.- ..+
T Consensus 52 ~~~~plL~c~F~d-------~~~~~~G~~dg~vr~~Dln-~~-~~~~igth~~~i~ci~~~~~~-------------~~v 109 (323)
T KOG1036|consen 52 KHGAPLLDCAFAD-------ESTIVTGGLDGQVRRYDLN-TG-NEDQIGTHDEGIRCIEYSYEV-------------GCV 109 (323)
T ss_pred ecCCceeeeeccC-------CceEEEeccCceEEEEEec-CC-cceeeccCCCceEEEEeeccC-------------CeE
Confidence 3666788888852 3466777767779999995 34 346778899999999988421 122
Q ss_pred EEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEEe-CCe
Q 001814 134 LVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVGL-ATQ 212 (1010)
Q Consensus 134 AvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~l-d~~ 212 (1010)
+ + |+|| ++|+|||.+....+.++.-...|+++..+.+.|+||. +.+
T Consensus 110 I--s---------------------gsWD----------~~ik~wD~R~~~~~~~~d~~kkVy~~~v~g~~LvVg~~~r~ 156 (323)
T KOG1036|consen 110 I--S---------------------GSWD----------KTIKFWDPRNKVVVGTFDQGKKVYCMDVSGNRLVVGTSDRK 156 (323)
T ss_pred E--E---------------------cccC----------ccEEEEeccccccccccccCceEEEEeccCCEEEEeecCce
Confidence 2 2 3564 7899999998777777766679999999998888854 568
Q ss_pred EEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCc
Q 001814 213 IYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSS 292 (1010)
Q Consensus 213 I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gs 292 (1010)
|.+||+++++..++..+.+. -.-.|.++.-++.- |
T Consensus 157 v~iyDLRn~~~~~q~reS~l------------------kyqtR~v~~~pn~e--------------------------G- 191 (323)
T KOG1036|consen 157 VLIYDLRNLDEPFQRRESSL------------------KYQTRCVALVPNGE--------------------------G- 191 (323)
T ss_pred EEEEEcccccchhhhccccc------------------eeEEEEEEEecCCC--------------------------c-
Confidence 99999999987665544332 11223333222110 0
Q ss_pred eEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC----cEEEEeccC
Q 001814 293 LVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR----AIISQFKAH 368 (1010)
Q Consensus 293 lVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~----~~v~~~~aH 368 (1010)
| +.++.+|.|.|=++... +..-.|++|
T Consensus 192 -----------------------y--------------------------~~sSieGRVavE~~d~s~~~~skkyaFkCH 222 (323)
T KOG1036|consen 192 -----------------------Y--------------------------VVSSIEGRVAVEYFDDSEEAQSKKYAFKCH 222 (323)
T ss_pred -----------------------e--------------------------EEEeecceEEEEccCCchHHhhhceeEEee
Confidence 0 00122333333333222 122234444
Q ss_pred C---------CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE
Q 001814 369 T---------SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF 439 (1010)
Q Consensus 369 t---------spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF 439 (1010)
. -||++|+|+|--..||||+.||- |.+||+.+ .+.|++|.+- ...|.+++|
T Consensus 223 r~~~~~~~~~yPVNai~Fhp~~~tfaTgGsDG~-V~~Wd~~~-----------------rKrl~q~~~~--~~SI~slsf 282 (323)
T KOG1036|consen 223 RLSEKDTEIIYPVNAIAFHPIHGTFATGGSDGI-VNIWDLFN-----------------RKRLKQLAKY--ETSISSLSF 282 (323)
T ss_pred ecccCCceEEEEeceeEeccccceEEecCCCce-EEEccCcc-----------------hhhhhhccCC--CCceEEEEe
Confidence 3 29999999999889999999775 89999854 1466777542 235999999
Q ss_pred ccCCCEEEEEeC
Q 001814 440 SHYSQWIAIVSS 451 (1010)
Q Consensus 440 SpDg~~LAsgS~ 451 (1010)
+.||..||++++
T Consensus 283 s~dG~~LAia~s 294 (323)
T KOG1036|consen 283 SMDGSLLAIASS 294 (323)
T ss_pred ccCCCeEEEEec
Confidence 999999999975
No 66
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.55 E-value=2.1e-13 Score=153.66 Aligned_cols=277 Identities=16% Similarity=0.216 Sum_probs=180.1
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEee---eeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELV---SKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ell---S~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
....|.-+.|.+ ....+|+.|+++.++||.++ |++...+ -...-|+.++.|.|++.... -+++-|+
T Consensus 212 s~~~I~sv~FHp------~~plllvaG~d~~lrifqvD--Gk~N~~lqS~~l~~fPi~~a~f~p~G~~~i---~~s~rrk 280 (514)
T KOG2055|consen 212 SHGGITSVQFHP------TAPLLLVAGLDGTLRIFQVD--GKVNPKLQSIHLEKFPIQKAEFAPNGHSVI---FTSGRRK 280 (514)
T ss_pred CcCCceEEEecC------CCceEEEecCCCcEEEEEec--CccChhheeeeeccCccceeeecCCCceEE---Eecccce
Confidence 345677777764 23456666777779999995 4444333 23468999999999874100 0222233
Q ss_pred EEEEEecCCCCCCCCCCCCCCcccc-----------ccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGV-----------RDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRC 200 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~v-----------r~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~ 200 (1010)
++-+.--...+- +......++ .++.+-.-.|+ .+.|.+-..+|++.+.+++..+.|.++.|
T Consensus 281 y~ysyDle~ak~----~k~~~~~g~e~~~~e~FeVShd~~fia~~G~----~G~I~lLhakT~eli~s~KieG~v~~~~f 352 (514)
T KOG2055|consen 281 YLYSYDLETAKV----TKLKPPYGVEEKSMERFEVSHDSNFIAIAGN----NGHIHLLHAKTKELITSFKIEGVVSDFTF 352 (514)
T ss_pred EEEEeecccccc----ccccCCCCcccchhheeEecCCCCeEEEccc----CceEEeehhhhhhhhheeeeccEEeeEEE
Confidence 333321111000 000001111 11111000111 27799999999999999999999999999
Q ss_pred cC---CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEc--cceEEEccCCeeeccCCccCCC
Q 001814 201 SP---RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVG--PRWLAYASNTLLLSNSGRLSPQ 275 (1010)
Q Consensus 201 S~---rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlg--pRwLAyas~~~~iwd~G~vs~Q 275 (1010)
+. .+++++..++|++||+++..+++....... +. ...++++ ++|
T Consensus 353 sSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~----------v~--gts~~~S~ng~y------------------- 401 (514)
T KOG2055|consen 353 SSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGS----------VH--GTSLCISLNGSY------------------- 401 (514)
T ss_pred ecCCcEEEEEcCCceEEEEecCCcceEEEEeecCc----------cc--eeeeeecCCCce-------------------
Confidence 75 477788889999999999988777664211 00 0111211 122
Q ss_pred cCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEE
Q 001814 276 NLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKD 355 (1010)
Q Consensus 276 ~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwD 355 (1010)
+++|+..|.|.|||
T Consensus 402 ------------------------------------------------------------------lA~GS~~GiVNIYd 415 (514)
T KOG2055|consen 402 ------------------------------------------------------------------LATGSDSGIVNIYD 415 (514)
T ss_pred ------------------------------------------------------------------EEeccCcceEEEec
Confidence 23456678888888
Q ss_pred CCC------CcEEEEeccCCCCeEEEEECCCCCEEEEEEcC-CCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 356 FVT------RAIISQFKAHTSPISALCFDPSGTLLVTASVY-GNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 356 l~s------~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d-Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
..+ .+.+..+..-+..|+.|+|+||+++||.||.. ...+|+-.+ |+ .+.+++|+.....+
T Consensus 416 ~~s~~~s~~PkPik~~dNLtt~Itsl~Fn~d~qiLAiaS~~~knalrLVHv-PS------~TVFsNfP~~n~~v------ 482 (514)
T KOG2055|consen 416 GNSCFASTNPKPIKTVDNLTTAITSLQFNHDAQILAIASRVKKNALRLVHV-PS------CTVFSNFPTSNTKV------ 482 (514)
T ss_pred cchhhccCCCCchhhhhhhheeeeeeeeCcchhhhhhhhhccccceEEEec-cc------eeeeccCCCCCCcc------
Confidence 653 45677777778899999999999999999863 334888877 43 45667777654322
Q ss_pred cccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..|.|++|||.|-++|+|...|.||+|.|..|
T Consensus 483 ---g~vtc~aFSP~sG~lAvGNe~grv~l~kL~hy 514 (514)
T KOG2055|consen 483 ---GHVTCMAFSPNSGYLAVGNEAGRVHLFKLHHY 514 (514)
T ss_pred ---cceEEEEecCCCceEEeecCCCceeeEeeccC
Confidence 35899999999999999999999999999754
No 67
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.54 E-value=8.1e-13 Score=160.71 Aligned_cols=252 Identities=17% Similarity=0.189 Sum_probs=173.6
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccC------CCc-----------ceEeeeeccCCEEEEEEe
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVED------ASN-----------FNELVSKRDGPVSFLQMQ 114 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~------~g~-----------v~ellS~hdGpV~~v~~l 114 (1010)
.+|...|..++|- +++++||.|.++. ++||.-.. -+. +..++.+|++-|..+.+.
T Consensus 66 ~~h~~sv~CVR~S-------~dG~~lAsGSDD~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Ws 138 (942)
T KOG0973|consen 66 DDHDGSVNCVRFS-------PDGSYLASGSDDRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWS 138 (942)
T ss_pred ccccCceeEEEEC-------CCCCeEeeccCcceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccC
Confidence 4466678888885 3789999999998 69999862 111 456677889999999999
Q ss_pred cCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-C
Q 001814 115 PFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-S 193 (1010)
Q Consensus 115 P~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S 193 (1010)
|+. -+|+.|+ .+++|.|||.++.+.+.+++.| +
T Consensus 139 p~~-------------~~lvS~s---------------------------------~DnsViiwn~~tF~~~~vl~~H~s 172 (942)
T KOG0973|consen 139 PDD-------------SLLVSVS---------------------------------LDNSVIIWNAKTFELLKVLRGHQS 172 (942)
T ss_pred CCc-------------cEEEEec---------------------------------ccceEEEEccccceeeeeeecccc
Confidence 875 2566542 2578999999999999999866 6
Q ss_pred cEEEEEEcC--CeEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCC
Q 001814 194 SVCMVRCSP--RIVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSG 270 (1010)
Q Consensus 194 ~V~sVa~S~--rlLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G 270 (1010)
.|..|.|.| +++|+-. |..|+||+..+....+++..+-.. . +...+-.-
T Consensus 173 ~VKGvs~DP~Gky~ASqsdDrtikvwrt~dw~i~k~It~pf~~-~----------~~~T~f~R----------------- 224 (942)
T KOG0973|consen 173 LVKGVSWDPIGKYFASQSDDRTLKVWRTSDWGIEKSITKPFEE-S----------PLTTFFLR----------------- 224 (942)
T ss_pred cccceEECCccCeeeeecCCceEEEEEcccceeeEeeccchhh-C----------CCcceeee-----------------
Confidence 899999998 8999855 557999998776666666532110 0 00000000
Q ss_pred ccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCe
Q 001814 271 RLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGI 350 (1010)
Q Consensus 271 ~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~ 350 (1010)
++=||+|..+++-.| .-...-+
T Consensus 225 -------------lSWSPDG~~las~nA---------------------------------------------~n~~~~~ 246 (942)
T KOG0973|consen 225 -------------LSWSPDGHHLASPNA---------------------------------------------VNGGKST 246 (942)
T ss_pred -------------cccCCCcCeecchhh---------------------------------------------ccCCcce
Confidence 011344433322111 1112235
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEECC--------CCC---------EEEEEEcCCCeEEEEeCCCCcccCCCCCCcc
Q 001814 351 VVVKDFVTRAIISQFKAHTSPISALCFDP--------SGT---------LLVTASVYGNNINIFRIMPSCMRSGSGNHKY 413 (1010)
Q Consensus 351 V~VwDl~s~~~v~~~~aHtspIsaLaFSP--------dGt---------lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~ 413 (1010)
+.|.+-.+-+.-..|.+|..|+.++.|+| +|+ .+|+||.|+ .|-||.....
T Consensus 247 ~~IieR~tWk~~~~LvGH~~p~evvrFnP~lfe~~~~ng~~~~~~~~y~i~AvgSqDr-SlSVW~T~~~----------- 314 (942)
T KOG0973|consen 247 IAIIERGTWKVDKDLVGHSAPVEVVRFNPKLFERNNKNGTSTQPNCYYCIAAVGSQDR-SLSVWNTALP----------- 314 (942)
T ss_pred eEEEecCCceeeeeeecCCCceEEEEeChHHhccccccCCccCCCcceEEEEEecCCc-cEEEEecCCC-----------
Confidence 66666666666678999999999999998 222 567788855 5999987421
Q ss_pred ccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 414 DWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 414 ~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+.+...+. .....|.|++|||||.-|.++|.||||.++.++.
T Consensus 315 ------RPl~vi~~-lf~~SI~DmsWspdG~~LfacS~DGtV~~i~Fee 356 (942)
T KOG0973|consen 315 ------RPLFVIHN-LFNKSIVDMSWSPDGFSLFACSLDGTVALIHFEE 356 (942)
T ss_pred ------Cchhhhhh-hhcCceeeeeEcCCCCeEEEEecCCeEEEEEcch
Confidence 33333221 2345799999999999999999999999999875
No 68
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.54 E-value=4.9e-13 Score=144.71 Aligned_cols=255 Identities=18% Similarity=0.181 Sum_probs=170.1
Q ss_pred CcceehhhhhhcccccccccC-CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCE
Q 001814 30 ASTVASTVRSAGASVAASISN-ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPV 108 (1010)
Q Consensus 30 a~~~~~~~rs~~~s~a~~i~~-~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV 108 (1010)
|++=..|||---...+|...+ ....+...|+-+.|.. ++ .++++.|.++..++||+.. +.+ ..+..|++||
T Consensus 45 A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~Wsd----dg--skVf~g~~Dk~~k~wDL~S-~Q~-~~v~~Hd~pv 116 (347)
T KOG0647|consen 45 AGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSD----DG--SKVFSGGCDKQAKLWDLAS-GQV-SQVAAHDAPV 116 (347)
T ss_pred ecccCCceEEEEEecCCcccchhhhccCCCeEEEEEcc----CC--ceEEeeccCCceEEEEccC-CCe-eeeeecccce
Confidence 445556666543333333332 2244677888888874 22 4677777777799999964 544 4567899999
Q ss_pred EEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEE
Q 001814 109 SFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHV 188 (1010)
Q Consensus 109 ~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~t 188 (1010)
+.+.+.+.+.. ++|+ +|+|| +||||||.++...+.+
T Consensus 117 kt~~wv~~~~~-----------~cl~-----------------------TGSWD----------KTlKfWD~R~~~pv~t 152 (347)
T KOG0647|consen 117 KTCHWVPGMNY-----------QCLV-----------------------TGSWD----------KTLKFWDTRSSNPVAT 152 (347)
T ss_pred eEEEEecCCCc-----------ceeE-----------------------ecccc----------cceeecccCCCCeeee
Confidence 99999976521 3444 25674 7999999999999999
Q ss_pred EeCCCcEEEEEEcCCeEEEEeCC-eEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeec
Q 001814 189 LRFRSSVCMVRCSPRIVAVGLAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLS 267 (1010)
Q Consensus 189 L~f~S~V~sVa~S~rlLAV~ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iw 267 (1010)
+..+.+||++.+-..+++|++.+ .|.+|++......+....+|+ -+-.|.+|...+.-
T Consensus 153 ~~LPeRvYa~Dv~~pm~vVata~r~i~vynL~n~~te~k~~~SpL------------------k~Q~R~va~f~d~~--- 211 (347)
T KOG0647|consen 153 LQLPERVYAADVLYPMAVVATAERHIAVYNLENPPTEFKRIESPL------------------KWQTRCVACFQDKD--- 211 (347)
T ss_pred eeccceeeehhccCceeEEEecCCcEEEEEcCCCcchhhhhcCcc------------------cceeeEEEEEecCC---
Confidence 99999999999998899998876 599999877554443333332 22334444332210
Q ss_pred cCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCC
Q 001814 268 NSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDN 347 (1010)
Q Consensus 268 d~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~ 347 (1010)
+ .+-|+.
T Consensus 212 -----------------------~--------------------------------------------------~alGsi 218 (347)
T KOG0647|consen 212 -----------------------G--------------------------------------------------FALGSI 218 (347)
T ss_pred -----------------------c--------------------------------------------------eEeeee
Confidence 0 001234
Q ss_pred CCeEEEEECCCC--cEEEEeccCCC---------CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccC
Q 001814 348 AGIVVVKDFVTR--AIISQFKAHTS---------PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWN 416 (1010)
Q Consensus 348 dG~V~VwDl~s~--~~v~~~~aHts---------pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~ 416 (1010)
.|.|-|..+..+ +.--+|+.|.. +|+.|+|.|.-..|||++.||+ +..||-..
T Consensus 219 EGrv~iq~id~~~~~~nFtFkCHR~~~~~~~~VYaVNsi~FhP~hgtlvTaGsDGt-f~FWDkda--------------- 282 (347)
T KOG0647|consen 219 EGRVAIQYIDDPNPKDNFTFKCHRSTNSVNDDVYAVNSIAFHPVHGTLVTAGSDGT-FSFWDKDA--------------- 282 (347)
T ss_pred cceEEEEecCCCCccCceeEEEeccCCCCCCceEEecceEeecccceEEEecCCce-EEEecchh---------------
Confidence 455555555543 22345566652 5788999999999999999887 89999631
Q ss_pred CcceEEEEEecccccccEEEEEEccCCCEEEEEe
Q 001814 417 SSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 417 ~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
..+|.... .+...|.+.+|+.+|.++|-+.
T Consensus 283 --r~kLk~s~--~~~qpItcc~fn~~G~ifaYA~ 312 (347)
T KOG0647|consen 283 --RTKLKTSE--THPQPITCCSFNRNGSIFAYAL 312 (347)
T ss_pred --hhhhhccC--cCCCccceeEecCCCCEEEEEe
Confidence 12333322 2456899999999998887653
No 69
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.54 E-value=3.7e-14 Score=157.98 Aligned_cols=244 Identities=12% Similarity=0.104 Sum_probs=167.6
Q ss_pred CCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCc--ceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 54 DLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASN--FNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 54 ~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~--v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
-|..+|.-+.|-. .+-.|+.|.. .-|++|++. .+. +..+|.+-.|++..+.+.+.. +
T Consensus 173 ~h~gev~~v~~l~-------~sdtlatgg~Dr~Ik~W~v~-~~k~~~~~tLaGs~g~it~~d~d~~~------------~ 232 (459)
T KOG0288|consen 173 AHEGEVHDVEFLR-------NSDTLATGGSDRIIKLWNVL-GEKSELISTLAGSLGNITSIDFDSDN------------K 232 (459)
T ss_pred ccccccceeEEcc-------Ccchhhhcchhhhhhhhhcc-cchhhhhhhhhccCCCcceeeecCCC------------c
Confidence 3444555555542 2234555555 458999994 333 556677778899999987654 1
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcCC--eEEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSPR--IVAV 207 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~r--lLAV 207 (1010)
..||. ..++.+++|++.+.+..++|..|+ .|.++.|... .++.
T Consensus 233 ~~iAa----------------------------------s~d~~~r~Wnvd~~r~~~TLsGHtdkVt~ak~~~~~~~vVs 278 (459)
T KOG0288|consen 233 HVIAA----------------------------------SNDKNLRLWNVDSLRLRHTLSGHTDKVTAAKFKLSHSRVVS 278 (459)
T ss_pred eEEee----------------------------------cCCCceeeeeccchhhhhhhcccccceeeehhhccccceee
Confidence 23341 124789999999999999998775 8999999762 2444
Q ss_pred Ee-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 GL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 ~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
+. +.+|++||+....+..++..-.. .+-+..+
T Consensus 279 gs~DRtiK~WDl~k~~C~kt~l~~S~--------------cnDI~~~--------------------------------- 311 (459)
T KOG0288|consen 279 GSADRTIKLWDLQKAYCSKTVLPGSQ--------------CNDIVCS--------------------------------- 311 (459)
T ss_pred ccccchhhhhhhhhhheecccccccc--------------ccceEec---------------------------------
Confidence 43 45799999987665443321000 0000000
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK 366 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~ 366 (1010)
...++++..|++|+.||+.+...+....
T Consensus 312 ----------------------------------------------------~~~~~SgH~DkkvRfwD~Rs~~~~~sv~ 339 (459)
T KOG0288|consen 312 ----------------------------------------------------ISDVISGHFDKKVRFWDIRSADKTRSVP 339 (459)
T ss_pred ----------------------------------------------------ceeeeecccccceEEEeccCCceeeEee
Confidence 0012356789999999999999999999
Q ss_pred cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccc-ccccEEEEEEccCCCE
Q 001814 367 AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGI-TSATIQDICFSHYSQW 445 (1010)
Q Consensus 367 aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~-t~a~I~sIAFSpDg~~ 445 (1010)
+|. .|++|..+++|..|.+++.|++ +.|+|+... + ..+.|.-. |. +......+.||||+.|
T Consensus 340 ~gg-~vtSl~ls~~g~~lLsssRDdt-l~viDlRt~------e---------I~~~~sA~-g~k~asDwtrvvfSpd~~Y 401 (459)
T KOG0288|consen 340 LGG-RVTSLDLSMDGLELLSSSRDDT-LKVIDLRTK------E---------IRQTFSAE-GFKCASDWTRVVFSPDGSY 401 (459)
T ss_pred cCc-ceeeEeeccCCeEEeeecCCCc-eeeeecccc------c---------EEEEeecc-ccccccccceeEECCCCce
Confidence 886 9999999999999999988665 999998632 1 13333321 22 2235788999999999
Q ss_pred EEEEeCCCeEEEEeCCCCCCccc
Q 001814 446 IAIVSSKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 446 LAsgS~dGTVhIw~I~~~gg~~~ 468 (1010)
+|+||.||.|+||++...+.+-.
T Consensus 402 vaAGS~dgsv~iW~v~tgKlE~~ 424 (459)
T KOG0288|consen 402 VAAGSADGSVYIWSVFTGKLEKV 424 (459)
T ss_pred eeeccCCCcEEEEEccCceEEEE
Confidence 99999999999999987655443
No 70
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.54 E-value=2.1e-13 Score=151.81 Aligned_cols=277 Identities=16% Similarity=0.164 Sum_probs=177.7
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCc--ceEeeeeccCCEEEEEEecCCCCCCCCCCccc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASN--FNELVSKRDGPVSFLQMQPFPVKDDGCEGFRK 128 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~--v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~ 128 (1010)
.++|+|+|-++.|- +.++.||++..+. .-+|++..... +..++.+|..+|..+.++|+.
T Consensus 220 l~~htdEVWfl~FS-------~nGkyLAsaSkD~Taiiw~v~~d~~~kl~~tlvgh~~~V~yi~wSPDd----------- 281 (519)
T KOG0293|consen 220 LQDHTDEVWFLQFS-------HNGKYLASASKDSTAIIWIVVYDVHFKLKKTLVGHSQPVSYIMWSPDD----------- 281 (519)
T ss_pred HhhCCCcEEEEEEc-------CCCeeEeeccCCceEEEEEEecCcceeeeeeeecccCceEEEEECCCC-----------
Confidence 46899999999996 3689999999865 78999865443 346677899999999999975
Q ss_pred cCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC--CcEEEEEEcC--Ce
Q 001814 129 LHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR--SSVCMVRCSP--RI 204 (1010)
Q Consensus 129 srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~--S~V~sVa~S~--rl 204 (1010)
| +|+.|.. +..+++||..+|++.+.++.. -.+.++++.| ..
T Consensus 282 -r-yLlaCg~---------------------------------~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~ 326 (519)
T KOG0293|consen 282 -R-YLLACGF---------------------------------DEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGFR 326 (519)
T ss_pred -C-eEEecCc---------------------------------hHheeeccCCcchhhhhcccCcCCCcceeEEccCCce
Confidence 2 3343321 245999999999999888654 4677888877 34
Q ss_pred EEE-EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEE-EccCC-eeeccCCccCCCcCCC
Q 001814 205 VAV-GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLA-YASNT-LLLSNSGRLSPQNLTP 279 (1010)
Q Consensus 205 LAV-~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLA-yas~~-~~iwd~G~vs~Q~lt~ 279 (1010)
+++ +.+.+|.-||+.... +.....-..| -...+|+.+ +|+- ...+. +.+.+.
T Consensus 327 ~V~Gs~dr~i~~wdlDgn~-~~~W~gvr~~------------~v~dlait~Dgk~vl~v~~d~~i~l~~~---------- 383 (519)
T KOG0293|consen 327 FVTGSPDRTIIMWDLDGNI-LGNWEGVRDP------------KVHDLAITYDGKYVLLVTVDKKIRLYNR---------- 383 (519)
T ss_pred eEecCCCCcEEEecCCcch-hhcccccccc------------eeEEEEEcCCCcEEEEEecccceeeech----------
Confidence 444 556789999976422 2222211111 012355544 3332 22211 111111
Q ss_pred CCCCCCcCCCCCceEEEeehhhhhhhhcccc---eeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEEC
Q 001814 280 SGVSPSTSPGGSSLVARYAMEHSKQFAAGLS---KTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDF 356 (1010)
Q Consensus 280 p~vS~stSP~~gslVa~~A~dssk~la~Gi~---ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl 356 (1010)
.|+ +..|++ +.+.. ++.+.+++.. ...-.+..|++||+
T Consensus 384 -----------------e~~-----~dr~lise~~~its-------------~~iS~d~k~~----LvnL~~qei~LWDl 424 (519)
T KOG0293|consen 384 -----------------EAR-----VDRGLISEEQPITS-------------FSISKDGKLA----LVNLQDQEIHLWDL 424 (519)
T ss_pred -----------------hhh-----hhhccccccCceeE-------------EEEcCCCcEE----EEEcccCeeEEeec
Confidence 000 000110 00111 1111112211 11235788999999
Q ss_pred CCCcEEEEeccCCC--CeEEEEECC-CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccccccc
Q 001814 357 VTRAIISQFKAHTS--PISALCFDP-SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSAT 433 (1010)
Q Consensus 357 ~s~~~v~~~~aHts--pIsaLaFSP-dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~ 433 (1010)
+....+..+.+|+. -|-.-||-- +.+++|+||+|+. |+||+-.. | .++-+| .|+. ..
T Consensus 425 ~e~~lv~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED~k-vyIWhr~s-------g----------kll~~L-sGHs-~~ 484 (519)
T KOG0293|consen 425 EENKLVRKYFGHKQGHFIIRSCFGGGNDKFIASGSEDSK-VYIWHRIS-------G----------KLLAVL-SGHS-KT 484 (519)
T ss_pred chhhHHHHhhcccccceEEEeccCCCCcceEEecCCCce-EEEEEccC-------C----------ceeEee-cCCc-ce
Confidence 99888999999975 354557864 4489999999665 99999753 5 466666 5655 45
Q ss_pred EEEEEEccCC-CEEEEEeCCCeEEEEeCCCC
Q 001814 434 IQDICFSHYS-QWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 434 I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~ 463 (1010)
|.+++|+|-. .++|++|+||||+||...+.
T Consensus 485 vNcVswNP~~p~m~ASasDDgtIRIWg~~~~ 515 (519)
T KOG0293|consen 485 VNCVSWNPADPEMFASASDDGTIRIWGPSDN 515 (519)
T ss_pred eeEEecCCCCHHHhhccCCCCeEEEecCCcc
Confidence 9999999954 78999999999999998765
No 71
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.53 E-value=7.2e-12 Score=138.43 Aligned_cols=315 Identities=16% Similarity=0.188 Sum_probs=191.0
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
+..|++.|.-+..+ |..+++++|.. +---||++. .+++.-.+-+|...|.++.|+- ..
T Consensus 60 F~~H~~svFavsl~-------P~~~l~aTGGgDD~AflW~~~-~ge~~~eltgHKDSVt~~~Fsh-------------dg 118 (399)
T KOG0296|consen 60 FDKHTDSVFAVSLH-------PNNNLVATGGGDDLAFLWDIS-TGEFAGELTGHKDSVTCCSFSH-------------DG 118 (399)
T ss_pred hhhcCCceEEEEeC-------CCCceEEecCCCceEEEEEcc-CCcceeEecCCCCceEEEEEcc-------------Cc
Confidence 45577777766654 24567777765 456899995 5667777788999999999773 33
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAV 207 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV 207 (1010)
.|||. || .+++|+||...++.....+... +.|-=+.+.| ++|+.
T Consensus 119 tlLAT--Gd-------------------------------msG~v~v~~~stg~~~~~~~~e~~dieWl~WHp~a~illA 165 (399)
T KOG0296|consen 119 TLLAT--GD-------------------------------MSGKVLVFKVSTGGEQWKLDQEVEDIEWLKWHPRAHILLA 165 (399)
T ss_pred eEEEe--cC-------------------------------CCccEEEEEcccCceEEEeecccCceEEEEecccccEEEe
Confidence 57663 32 1478999999999988887543 3455567777 46665
Q ss_pred Ee-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEE-ccce-EEEccCCeeeccC--Ccc-----CCCcC
Q 001814 208 GL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAV-GPRW-LAYASNTLLLSNS--GRL-----SPQNL 277 (1010)
Q Consensus 208 ~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAl-gpRw-LAyas~~~~iwd~--G~v-----s~Q~l 277 (1010)
|. ++.|.+|.+......+.+.++..|-. .|.+.= |-|. -+|....+++|+- |.. +-+++
T Consensus 166 G~~DGsvWmw~ip~~~~~kv~~Gh~~~ct-----------~G~f~pdGKr~~tgy~dgti~~Wn~ktg~p~~~~~~~e~~ 234 (399)
T KOG0296|consen 166 GSTDGSVWMWQIPSQALCKVMSGHNSPCT-----------CGEFIPDGKRILTGYDDGTIIVWNPKTGQPLHKITQAEGL 234 (399)
T ss_pred ecCCCcEEEEECCCcceeeEecCCCCCcc-----------cccccCCCceEEEEecCceEEEEecCCCceeEEecccccC
Confidence 54 56899999988666666666554310 011100 1132 2466667888873 311 00111
Q ss_pred CCCCCCCCcCCCCCceEEEeehhhhhhhhcc-cceeeccc---cccccCC-----CCCCCccCCCccccccccc-cccCC
Q 001814 278 TPSGVSPSTSPGGSSLVARYAMEHSKQFAAG-LSKTLSKY---CQELLPD-----GSSSPVSPNSVWKVGRHAG-ADMDN 347 (1010)
Q Consensus 278 t~p~vS~stSP~~gslVa~~A~dssk~la~G-i~ktls~y---~~~l~p~-----gs~s~~S~s~~~k~~~~~i-asgs~ 347 (1010)
..|.++. + ..+.++....++....+..+ .-|.+.-. ...+.+. .+...+.++. .+++ +.+.-
T Consensus 235 ~~~~~~~--~-~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~esve~~~~ss-----~lpL~A~G~v 306 (399)
T KOG0296|consen 235 ELPCISL--N-LAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDESVESIPSSS-----KLPLAACGSV 306 (399)
T ss_pred cCCcccc--c-cccceeEeccCCccEEEEccccceEEEecCCCCccccccchhhhhhhhhccccc-----ccchhhcccc
Confidence 1111111 0 11122221111111111111 00111100 0011110 0000011111 1222 46788
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
||+|.|||+...++ +++-.|..+|..|.|-+ -.+|+||+.+|. ||.||..+ | ++++++ +
T Consensus 307 dG~i~iyD~a~~~~-R~~c~he~~V~~l~w~~-t~~l~t~c~~g~-v~~wDaRt-------G----------~l~~~y-~ 365 (399)
T KOG0296|consen 307 DGTIAIYDLAASTL-RHICEHEDGVTKLKWLN-TDYLLTACANGK-VRQWDART-------G----------QLKFTY-T 365 (399)
T ss_pred cceEEEEecccchh-heeccCCCceEEEEEcC-cchheeeccCce-EEeeeccc-------c----------ceEEEE-e
Confidence 99999999987664 45556999999999999 678888888775 99999864 6 577776 5
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
|| ...|++++.+||.++|+++|.|+|.+||.+.
T Consensus 366 GH-~~~Il~f~ls~~~~~vvT~s~D~~a~VF~v~ 398 (399)
T KOG0296|consen 366 GH-QMGILDFALSPQKRLVVTVSDDNTALVFEVP 398 (399)
T ss_pred cC-chheeEEEEcCCCcEEEEecCCCeEEEEecC
Confidence 75 4679999999999999999999999999875
No 72
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.52 E-value=4.1e-13 Score=159.95 Aligned_cols=197 Identities=17% Similarity=0.153 Sum_probs=127.0
Q ss_pred CCccccccCCcCCCC-CCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC---Ce-EEEEeCCeEEEEECCCCceeE
Q 001814 151 SHLGGVRDGMMDSQS-GNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP---RI-VAVGLATQIYCFDALTLENKF 225 (1010)
Q Consensus 151 ~~~~~vr~gs~d~~~-~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~---rl-LAV~ld~~I~IwD~~Tle~l~ 225 (1010)
+|+..+-|-+|...+ --....|+|||||++...+|++++.+..-|.+|+|+| ++ |..++|++|+||++..-+...
T Consensus 367 GHt~DILDlSWSKn~fLLSSSMDKTVRLWh~~~~~CL~~F~HndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~Vv~ 446 (712)
T KOG0283|consen 367 GHTADILDLSWSKNNFLLSSSMDKTVRLWHPGRKECLKVFSHNDFVTCVAFNPVDDRYFISGSLDGKVRLWSISDKKVVD 446 (712)
T ss_pred ccchhheecccccCCeeEeccccccEEeecCCCcceeeEEecCCeeEEEEecccCCCcEeecccccceEEeecCcCeeEe
Confidence 344445555663211 1123358999999999999999999999999999999 44 455789999999987644321
Q ss_pred EEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhh
Q 001814 226 SVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQF 305 (1010)
Q Consensus 226 tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~l 305 (1010)
...... -+.+..++|+|+..
T Consensus 447 ---W~Dl~~---------------------------------------------lITAvcy~PdGk~a------------ 466 (712)
T KOG0283|consen 447 ---WNDLRD---------------------------------------------LITAVCYSPDGKGA------------ 466 (712)
T ss_pred ---ehhhhh---------------------------------------------hheeEEeccCCceE------------
Confidence 111100 00001112222211
Q ss_pred hcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec--cC------CCCeEEEEE
Q 001814 306 AAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK--AH------TSPISALCF 377 (1010)
Q Consensus 306 a~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~--aH------tspIsaLaF 377 (1010)
+-|...|.+++|+....+....+. -| ...|+.+.|
T Consensus 467 -------------------------------------vIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~ 509 (712)
T KOG0283|consen 467 -------------------------------------VIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQF 509 (712)
T ss_pred -------------------------------------EEEEeccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEe
Confidence 124567888888888776554332 22 128999999
Q ss_pred CCCCC--EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccc-cEEEEEEccCCCEEEEEeCCCe
Q 001814 378 DPSGT--LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSA-TIQDICFSHYSQWIAIVSSKGT 454 (1010)
Q Consensus 378 SPdGt--lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a-~I~sIAFSpDg~~LAsgS~dGT 454 (1010)
.|... +|+|. .| .-|||||.... ..+.+| .|+.+. .=...+|+.||++|+++|+|.-
T Consensus 510 ~p~~~~~vLVTS-nD-SrIRI~d~~~~-----------------~lv~Kf-KG~~n~~SQ~~Asfs~Dgk~IVs~seDs~ 569 (712)
T KOG0283|consen 510 FPGDPDEVLVTS-ND-SRIRIYDGRDK-----------------DLVHKF-KGFRNTSSQISASFSSDGKHIVSASEDSW 569 (712)
T ss_pred cCCCCCeEEEec-CC-CceEEEeccch-----------------hhhhhh-cccccCCcceeeeEccCCCEEEEeecCce
Confidence 87554 66654 43 45999998431 234444 344443 2345789999999999999999
Q ss_pred EEEEeCCCCC
Q 001814 455 CHVFVLSPFG 464 (1010)
Q Consensus 455 VhIw~I~~~g 464 (1010)
|+||++....
T Consensus 570 VYiW~~~~~~ 579 (712)
T KOG0283|consen 570 VYIWKNDSFN 579 (712)
T ss_pred EEEEeCCCCc
Confidence 9999986654
No 73
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.52 E-value=3.8e-12 Score=136.39 Aligned_cols=205 Identities=20% Similarity=0.257 Sum_probs=150.6
Q ss_pred ceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEE
Q 001814 97 FNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVR 176 (1010)
Q Consensus 97 v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVr 176 (1010)
....+++|.+.|..+++-|.- +.+||.++ .+++||
T Consensus 6 ~~~~~~gh~~r~W~~awhp~~------------g~ilAscg---------------------------------~Dk~vr 40 (312)
T KOG0645|consen 6 LEQKLSGHKDRVWSVAWHPGK------------GVILASCG---------------------------------TDKAVR 40 (312)
T ss_pred eEEeecCCCCcEEEEEeccCC------------ceEEEeec---------------------------------CCceEE
Confidence 346678888999999988641 23667532 148999
Q ss_pred EEeCCC---CeEEEEEe-CC-CcEEEEEEcC--CeEEEE-eCCeEEEEECC--CCceeEEEeecCCccccCCCccccccC
Q 001814 177 FYSFQS---HCYEHVLR-FR-SSVCMVRCSP--RIVAVG-LATQIYCFDAL--TLENKFSVLTYPVPQLAGQGAVGINVG 246 (1010)
Q Consensus 177 IWDlkt---ge~V~tL~-f~-S~V~sVa~S~--rlLAV~-ld~~I~IwD~~--Tle~l~tL~t~p~p~~~~~g~~~vnv~ 246 (1010)
+|++.. -.+...|. +| -.|++|+++| ++||++ +|.++.||.-. +++++.+|.+|.+.
T Consensus 41 iw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~~efecv~~lEGHEnE------------- 107 (312)
T KOG0645|consen 41 IWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWKKEDGEFECVATLEGHENE------------- 107 (312)
T ss_pred EEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEeecCCCceeEEeeeeccccc-------------
Confidence 999984 44555563 44 4899999988 788875 57789999644 45777777777662
Q ss_pred ccceEEcc--ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCC
Q 001814 247 YGPMAVGP--RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDG 324 (1010)
Q Consensus 247 ~gplAlgp--RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~g 324 (1010)
.-.+|++. ++|
T Consensus 108 VK~Vaws~sG~~L------------------------------------------------------------------- 120 (312)
T KOG0645|consen 108 VKCVAWSASGNYL------------------------------------------------------------------- 120 (312)
T ss_pred eeEEEEcCCCCEE-------------------------------------------------------------------
Confidence 12233322 222
Q ss_pred CCCCccCCCccccccccccccCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 325 SSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR---AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 325 s~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~---~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
++++.|..|=||.+..+ .+++.|+.|+.-|--+.|.|.-.+|+++|-|. +|++|+-.+
T Consensus 121 ------------------ATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYDn-TIk~~~~~~ 181 (312)
T KOG0645|consen 121 ------------------ATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYDN-TIKVYRDED 181 (312)
T ss_pred ------------------EEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEeccCC-eEEEEeecC
Confidence 24556777778887643 47899999999999999999999999999954 599998753
Q ss_pred CcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 402 SCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 402 ~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
. .+| .++.+| .|+++ .|++++|.+.|..|+++++|+||.||..-
T Consensus 182 d----------ddW----~c~~tl-~g~~~-TVW~~~F~~~G~rl~s~sdD~tv~Iw~~~ 225 (312)
T KOG0645|consen 182 D----------DDW----ECVQTL-DGHEN-TVWSLAFDNIGSRLVSCSDDGTVSIWRLY 225 (312)
T ss_pred C----------CCe----eEEEEe-cCccc-eEEEEEecCCCceEEEecCCcceEeeeec
Confidence 1 123 356666 45443 79999999999999999999999999954
No 74
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.50 E-value=2.9e-13 Score=157.16 Aligned_cols=240 Identities=18% Similarity=0.155 Sum_probs=162.4
Q ss_pred CCCeEEEEEecCc-EEEEEccCCCc-----ceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDASN-----FNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAP 146 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g~-----v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~ 146 (1010)
+.++.|.+|..+| +++|++....+ ....++.|..||..+.+--++ ..|..+
T Consensus 35 ~~~ryLfTgGRDg~i~~W~~~~d~~~~s~~~~asme~HsDWVNDiiL~~~~-------------~tlIS~---------- 91 (735)
T KOG0308|consen 35 PNGRYLFTGGRDGIIRLWSVTQDSNEPSTPYIASMEHHSDWVNDIILCGNG-------------KTLISA---------- 91 (735)
T ss_pred CCCceEEecCCCceEEEeccccccCCcccchhhhhhhhHhHHhhHHhhcCC-------------CceEEe----------
Confidence 3566788888877 79999964333 234456677888876654222 122322
Q ss_pred CCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCC--eEEEEEeCC-CcEEEEEE-c--CCeEEE-EeCCeEEEEECC
Q 001814 147 GQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSH--CYEHVLRFR-SSVCMVRC-S--PRIVAV-GLATQIYCFDAL 219 (1010)
Q Consensus 147 ~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktg--e~V~tL~f~-S~V~sVa~-S--~rlLAV-~ld~~I~IwD~~ 219 (1010)
+++++|++|+...+ -|..+|+-| ..|.+++. - ..++|. |++.+|.+||+.
T Consensus 92 -----------------------SsDtTVK~W~~~~~~~~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin 148 (735)
T KOG0308|consen 92 -----------------------SSDTTVKVWNAHKDNTFCMSTIRTHKDYVKCLAYIAKNNELVASGGLDRKIFLWDIN 148 (735)
T ss_pred -----------------------cCCceEEEeecccCcchhHhhhhcccchheeeeecccCceeEEecCCCccEEEEEcc
Confidence 24689999999876 577788654 68888877 2 234443 678899999998
Q ss_pred CCce--eEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEe
Q 001814 220 TLEN--KFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARY 297 (1010)
Q Consensus 220 Tle~--l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~ 297 (1010)
++.. +.+....+. +.+..||+-
T Consensus 149 ~~~~~l~~s~n~~t~---------------~sl~sG~k~----------------------------------------- 172 (735)
T KOG0308|consen 149 TGTATLVASFNNVTV---------------NSLGSGPKD----------------------------------------- 172 (735)
T ss_pred Ccchhhhhhcccccc---------------ccCCCCCcc-----------------------------------------
Confidence 7632 111111100 000001100
Q ss_pred ehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE
Q 001814 298 AMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCF 377 (1010)
Q Consensus 298 A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaF 377 (1010)
+.|.-.+ |+.+ ..++.|+..+.+++||-.+++.+..|++|+..|.+|-.
T Consensus 173 ----------------siYSLA~-----------N~t~----t~ivsGgtek~lr~wDprt~~kimkLrGHTdNVr~ll~ 221 (735)
T KOG0308|consen 173 ----------------SIYSLAM-----------NQTG----TIIVSGGTEKDLRLWDPRTCKKIMKLRGHTDNVRVLLV 221 (735)
T ss_pred ----------------ceeeeec-----------CCcc----eEEEecCcccceEEeccccccceeeeeccccceEEEEE
Confidence 0010000 0000 12346788899999999999999999999999999999
Q ss_pred CCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEE
Q 001814 378 DPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHV 457 (1010)
Q Consensus 378 SPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhI 457 (1010)
++||+.++|||.||+ ||+||+.- .+++.++.- +...||++.-+|+=+.+.+|+.||.|..
T Consensus 222 ~dDGt~~ls~sSDgt-IrlWdLgq-----------------QrCl~T~~v--H~e~VWaL~~~~sf~~vYsG~rd~~i~~ 281 (735)
T KOG0308|consen 222 NDDGTRLLSASSDGT-IRLWDLGQ-----------------QRCLATYIV--HKEGVWALQSSPSFTHVYSGGRDGNIYR 281 (735)
T ss_pred cCCCCeEeecCCCce-EEeeeccc-----------------cceeeeEEe--ccCceEEEeeCCCcceEEecCCCCcEEe
Confidence 999999999999886 99999932 156666643 2345999999999999999999999988
Q ss_pred EeCCCCCC
Q 001814 458 FVLSPFGG 465 (1010)
Q Consensus 458 w~I~~~gg 465 (1010)
=++..+..
T Consensus 282 Tdl~n~~~ 289 (735)
T KOG0308|consen 282 TDLRNPAK 289 (735)
T ss_pred cccCCchh
Confidence 88877533
No 75
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.49 E-value=1.6e-12 Score=147.97 Aligned_cols=112 Identities=16% Similarity=0.220 Sum_probs=85.1
Q ss_pred cccccCCCCeEEEEECCCCc-EEEEec-----cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccc
Q 001814 341 AGADMDNAGIVVVKDFVTRA-IISQFK-----AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYD 414 (1010)
Q Consensus 341 ~iasgs~dG~V~VwDl~s~~-~v~~~~-----aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~ 414 (1010)
.+.+++.||+++|||+..-+ .+..|+ +-.-+++..+|+|||+++|+|..||. |.+|+...
T Consensus 283 ~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~~iAagc~DGS-IQ~W~~~~------------- 348 (641)
T KOG0772|consen 283 EFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGKLIAAGCLDGS-IQIWDKGS------------- 348 (641)
T ss_pred ceEEecCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcchhhhcccCCc-eeeeecCC-------------
Confidence 35678999999999998633 233332 23458899999999999999999886 99999721
Q ss_pred cCCcceEEEEEecccccc-cEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccc
Q 001814 415 WNSSHVHLYKLHRGITSA-TIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 415 ~~~s~~~L~~L~RG~t~a-~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~ 468 (1010)
|. ++..+..+..|... .|.+|+||+||++|++=+.|+|++||+|..+.....
T Consensus 349 ~~--v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~ 401 (641)
T KOG0772|consen 349 RT--VRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLN 401 (641)
T ss_pred cc--cccceEeeeccCCCCceeEEEeccccchhhhccCCCceeeeeccccccchh
Confidence 11 12344444444332 799999999999999999999999999998876543
No 76
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.49 E-value=1.4e-11 Score=146.60 Aligned_cols=120 Identities=18% Similarity=0.252 Sum_probs=85.9
Q ss_pred cCCCCeEEEEECCCCcEEEEe---ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccC--CCCCCc----c--
Q 001814 345 MDNAGIVVVKDFVTRAIISQF---KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRS--GSGNHK----Y-- 413 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~---~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~--~sG~~~----~-- 413 (1010)
|...|.|.+|++++|-....| ++|..+|..|+.+--+++++||+.+|. +..|+.....-.. ..|..- +
T Consensus 466 G~S~G~Id~fNmQSGi~r~sf~~~~ah~~~V~gla~D~~n~~~vsa~~~Gi-lkfw~f~~k~l~~~l~l~~~~~~iv~hr 544 (910)
T KOG1539|consen 466 GYSKGTIDRFNMQSGIHRKSFGDSPAHKGEVTGLAVDGTNRLLVSAGADGI-LKFWDFKKKVLKKSLRLGSSITGIVYHR 544 (910)
T ss_pred eccCCeEEEEEcccCeeecccccCccccCceeEEEecCCCceEEEccCcce-EEEEecCCcceeeeeccCCCcceeeeee
Confidence 446899999999999888888 699999999999999999999999886 8999985421000 000000 0
Q ss_pred -------ccCCcceEEEE--------EecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 414 -------DWNSSHVHLYK--------LHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 414 -------~~~~s~~~L~~--------L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
..+.-.-.++. -.+|++ .+|++++|||||+||++++.|+||++|||......
T Consensus 545 ~s~l~a~~~ddf~I~vvD~~t~kvvR~f~gh~-nritd~~FS~DgrWlisasmD~tIr~wDlpt~~lI 611 (910)
T KOG1539|consen 545 VSDLLAIALDDFSIRVVDVVTRKVVREFWGHG-NRITDMTFSPDGRWLISASMDSTIRTWDLPTGTLI 611 (910)
T ss_pred hhhhhhhhcCceeEEEEEchhhhhhHHhhccc-cceeeeEeCCCCcEEEEeecCCcEEEEeccCccee
Confidence 00000001111 114654 68999999999999999999999999999765443
No 77
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.48 E-value=6.2e-12 Score=139.43 Aligned_cols=275 Identities=14% Similarity=0.159 Sum_probs=168.1
Q ss_pred CCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEE-EecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQ-MQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~-~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
.+.+|..+|++..+|||. .|+...++.+|.+++..+. +.+++.. .+++. ++
T Consensus 115 ~~~IltgsYDg~~riWd~--~Gk~~~~~~Ght~~ik~v~~v~~n~~~-----------~~fvs-as-------------- 166 (423)
T KOG0313|consen 115 SKWILTGSYDGTSRIWDL--KGKSIKTIVGHTGPIKSVAWVIKNSSS-----------CLFVS-AS-------------- 166 (423)
T ss_pred CceEEEeecCCeeEEEec--CCceEEEEecCCcceeeeEEEecCCcc-----------ceEEE-ec--------------
Confidence 356777778888999997 5889999999999999654 4555421 23332 21
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEE----eCC-CcEEEEEEcC--CeEE-EEeCCeEEEEECCCCcee
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVL----RFR-SSVCMVRCSP--RIVA-VGLATQIYCFDALTLENK 224 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL----~f~-S~V~sVa~S~--rlLA-V~ld~~I~IwD~~Tle~l 224 (1010)
.+.++++|-.+.++.+... +.| ..|-+|+.++ .+++ .+-|..|.||+..+ +..
T Consensus 167 ------------------~Dqtl~Lw~~~~~~~~~~~~~~~~GHk~~V~sVsv~~sgtr~~SgS~D~~lkiWs~~~-~~~ 227 (423)
T KOG0313|consen 167 ------------------MDQTLRLWKWNVGENKVKALKVCRGHKRSVDSVSVDSSGTRFCSGSWDTMLKIWSVET-DEE 227 (423)
T ss_pred ------------------CCceEEEEEecCchhhhhHHhHhcccccceeEEEecCCCCeEEeecccceeeecccCC-Ccc
Confidence 2479999999887754332 233 5888999866 3444 45577899999321 111
Q ss_pred EEEeecCCc-----cccCC-C--cc----ccc-cCccceEEccceEEEccC---CeeeccC--CccCCCcCCCCCCCCCc
Q 001814 225 FSVLTYPVP-----QLAGQ-G--AV----GIN-VGYGPMAVGPRWLAYASN---TLLLSNS--GRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 225 ~tL~t~p~p-----~~~~~-g--~~----~vn-v~~gplAlgpRwLAyas~---~~~iwd~--G~vs~Q~lt~p~vS~st 286 (1010)
-++....+. ...++ + .+ ..+ -....+-+++.-.+|+.+ ++..||. |+......+
T Consensus 228 ~~~E~~s~~rrk~~~~~~~~~~r~P~vtl~GHt~~Vs~V~w~d~~v~yS~SwDHTIk~WDletg~~~~~~~~-------- 299 (423)
T KOG0313|consen 228 DELESSSNRRRKKQKREKEGGTRTPLVTLEGHTEPVSSVVWSDATVIYSVSWDHTIKVWDLETGGLKSTLTT-------- 299 (423)
T ss_pred ccccccchhhhhhhhhhhcccccCceEEecccccceeeEEEcCCCceEeecccceEEEEEeecccceeeeec--------
Confidence 111111110 00000 0 00 000 001112333344455543 3455663 211000000
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc---EEE
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA---IIS 363 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~---~v~ 363 (1010)
+ |.| .|-... +. . ..++.++.|..++|||-.++. +..
T Consensus 300 ---~--------------------ksl--~~i~~~-----------~~---~-~Ll~~gssdr~irl~DPR~~~gs~v~~ 339 (423)
T KOG0313|consen 300 ---N--------------------KSL--NCISYS-----------PL---S-KLLASGSSDRHIRLWDPRTGDGSVVSQ 339 (423)
T ss_pred ---C--------------------cce--eEeecc-----------cc---c-ceeeecCCCCceeecCCCCCCCceeEE
Confidence 0 000 000000 00 0 124578899999999998754 557
Q ss_pred EeccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccC
Q 001814 364 QFKAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHY 442 (1010)
Q Consensus 364 ~~~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD 442 (1010)
+|.+|+.-|+++.++|... +|+++|.|++ +++||+... ...||.+.+ +..+|.++.|+ +
T Consensus 340 s~~gH~nwVssvkwsp~~~~~~~S~S~D~t-~klWDvRS~----------------k~plydI~~--h~DKvl~vdW~-~ 399 (423)
T KOG0313|consen 340 SLIGHKNWVSSVKWSPTNEFQLVSGSYDNT-VKLWDVRST----------------KAPLYDIAG--HNDKVLSVDWN-E 399 (423)
T ss_pred eeecchhhhhheecCCCCceEEEEEecCCe-EEEEEeccC----------------CCcceeecc--CCceEEEEecc-C
Confidence 8999999999999999875 6788888665 999999631 127899876 35689999998 7
Q ss_pred CCEEEEEeCCCeEEEEeCCCC
Q 001814 443 SQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 443 g~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+..|++|+.|.+++||.-.+.
T Consensus 400 ~~~IvSGGaD~~l~i~~~~~~ 420 (423)
T KOG0313|consen 400 GGLIVSGGADNKLRIFKGSPI 420 (423)
T ss_pred CceEEeccCcceEEEeccccc
Confidence 889999999999999986553
No 78
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.47 E-value=9e-12 Score=148.12 Aligned_cols=97 Identities=23% Similarity=0.340 Sum_probs=81.9
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
+.+..|-.|+|+|..+.+++..|.+|+..|++++|||||++|++|+.|++ ||+||+.+ | .++
T Consensus 550 a~~~ddf~I~vvD~~t~kvvR~f~gh~nritd~~FS~DgrWlisasmD~t-Ir~wDlpt-------~----------~lI 611 (910)
T KOG1539|consen 550 AIALDDFSIRVVDVVTRKVVREFWGHGNRITDMTFSPDGRWLISASMDST-IRTWDLPT-------G----------TLI 611 (910)
T ss_pred hhhcCceeEEEEEchhhhhhHHhhccccceeeeEeCCCCcEEEEeecCCc-EEEEeccC-------c----------cee
Confidence 34566788999999999999999999999999999999999999999776 99999953 3 355
Q ss_pred EEEecccccccEEEEEEccCCCEEEEEeCC-CeEEEEeC
Q 001814 423 YKLHRGITSATIQDICFSHYSQWIAIVSSK-GTCHVFVL 460 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~~LAsgS~d-GTVhIw~I 460 (1010)
--+. ......+++|||.|.+||++..| .-|.+|.=
T Consensus 612 D~~~---vd~~~~sls~SPngD~LAT~Hvd~~gIylWsN 647 (910)
T KOG1539|consen 612 DGLL---VDSPCTSLSFSPNGDFLATVHVDQNGIYLWSN 647 (910)
T ss_pred eeEe---cCCcceeeEECCCCCEEEEEEecCceEEEEEc
Confidence 5443 23467899999999999999998 45899973
No 79
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.46 E-value=6.5e-12 Score=141.71 Aligned_cols=111 Identities=17% Similarity=0.337 Sum_probs=84.6
Q ss_pred ccccCCCCeEEEEECCCCc-EEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCcc-c--cC
Q 001814 342 GADMDNAGIVVVKDFVTRA-IISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKY-D--WN 416 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~-~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~-~--~~ 416 (1010)
+|+++.|++|.+||+.+.. .+.+|.+|...|..|.|||.- +.|||++.|++ ++|||+... |.... + -.
T Consensus 288 lAT~S~D~tV~LwDlRnL~~~lh~~e~H~dev~~V~WSPh~etvLASSg~D~r-l~vWDls~i------g~eq~~eda~d 360 (422)
T KOG0264|consen 288 LATGSADKTVALWDLRNLNKPLHTFEGHEDEVFQVEWSPHNETVLASSGTDRR-LNVWDLSRI------GEEQSPEDAED 360 (422)
T ss_pred EEeccCCCcEEEeechhcccCceeccCCCcceEEEEeCCCCCceeEecccCCc-EEEEecccc------ccccChhhhcc
Confidence 4567889999999999854 678999999999999999976 68999999776 899999642 21111 0 00
Q ss_pred CcceEEEEEecccccccEEEEEEccCCCEE-EEEeCCCeEEEEeCC
Q 001814 417 SSHVHLYKLHRGITSATIQDICFSHYSQWI-AIVSSKGTCHVFVLS 461 (1010)
Q Consensus 417 ~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L-AsgS~dGTVhIw~I~ 461 (1010)
....+|+ .++|++ +.|.+++|.|.--|+ |++++|+.+|||+..
T Consensus 361 gppEllF-~HgGH~-~kV~DfsWnp~ePW~I~SvaeDN~LqIW~~s 404 (422)
T KOG0264|consen 361 GPPELLF-IHGGHT-AKVSDFSWNPNEPWTIASVAEDNILQIWQMA 404 (422)
T ss_pred CCcceeE-EecCcc-cccccccCCCCCCeEEEEecCCceEEEeecc
Confidence 1113344 467865 679999999998765 568889999999986
No 80
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.45 E-value=6.9e-13 Score=140.55 Aligned_cols=279 Identities=14% Similarity=0.106 Sum_probs=185.1
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
.+|+-.|....|..+. +++-+|+.+..+|--+..-.++|...-++.+|.|.|....+..+. +
T Consensus 11 ~ghtrpvvdl~~s~it----p~g~flisa~kd~~pmlr~g~tgdwigtfeghkgavw~~~l~~na--------------~ 72 (334)
T KOG0278|consen 11 HGHTRPVVDLAFSPIT----PDGYFLISASKDGKPMLRNGDTGDWIGTFEGHKGAVWSATLNKNA--------------T 72 (334)
T ss_pred cCCCcceeEEeccCCC----CCceEEEEeccCCCchhccCCCCCcEEeeeccCcceeeeecCchh--------------h
Confidence 4677778888886432 456788888888853333346788899999999999988766432 2
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEE-Ee
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAV-GL 209 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV-~l 209 (1010)
.|. + ++.+-+-++||.-+|..++++++.--|.+++|+. +.|+. +.
T Consensus 73 ~aa-s-------------------------------aaadftakvw~a~tgdelhsf~hkhivk~~af~~ds~~lltgg~ 120 (334)
T KOG0278|consen 73 RAA-S-------------------------------AAADFTAKVWDAVTGDELHSFEHKHIVKAVAFSQDSNYLLTGGQ 120 (334)
T ss_pred hhh-h-------------------------------hcccchhhhhhhhhhhhhhhhhhhheeeeEEecccchhhhccch
Confidence 221 0 1235678999999999999999999999999987 34554 55
Q ss_pred CCeEEEEECCCCce-eEEEeecCCccccCCCccccccCccceEEccc-eEEEcc-CCeeeccC--CccCCCcCC-C-CCC
Q 001814 210 ATQIYCFDALTLEN-KFSVLTYPVPQLAGQGAVGINVGYGPMAVGPR-WLAYAS-NTLLLSNS--GRLSPQNLT-P-SGV 282 (1010)
Q Consensus 210 d~~I~IwD~~Tle~-l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpR-wLAyas-~~~~iwd~--G~vs~Q~lt-~-p~v 282 (1010)
+.-++|||+...+- ...+..|+.. + ..-.++.+-+ .|..+. .++++||. |.. .|.+. + ++.
T Consensus 121 ekllrvfdln~p~App~E~~ghtg~---------I--r~v~wc~eD~~iLSSadd~tVRLWD~rTgt~-v~sL~~~s~Vt 188 (334)
T KOG0278|consen 121 EKLLRVFDLNRPKAPPKEISGHTGG---------I--RTVLWCHEDKCILSSADDKTVRLWDHRTGTE-VQSLEFNSPVT 188 (334)
T ss_pred HHHhhhhhccCCCCCchhhcCCCCc---------c--eeEEEeccCceEEeeccCCceEEEEeccCcE-EEEEecCCCCc
Confidence 56689999876542 2233333321 1 0111222223 333232 35788984 211 12211 1 111
Q ss_pred CCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEE
Q 001814 283 SPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAII 362 (1010)
Q Consensus 283 S~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v 362 (1010)
|.-. ++++.+ .+....+.|..||..+..++
T Consensus 189 SlEv---------------------------------------------s~dG~i-----lTia~gssV~Fwdaksf~~l 218 (334)
T KOG0278|consen 189 SLEV---------------------------------------------SQDGRI-----LTIAYGSSVKFWDAKSFGLL 218 (334)
T ss_pred ceee---------------------------------------------ccCCCE-----EEEecCceeEEeccccccce
Confidence 1111 111221 23345678999999999988
Q ss_pred EEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccC
Q 001814 363 SQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHY 442 (1010)
Q Consensus 363 ~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD 442 (1010)
..++.. ..|.+.+++|+-..++.+++++. ++.||..+ | ..+-.+..|+. ..|.++.||||
T Consensus 219 Ks~k~P-~nV~SASL~P~k~~fVaGged~~-~~kfDy~T-------g----------eEi~~~nkgh~-gpVhcVrFSPd 278 (334)
T KOG0278|consen 219 KSYKMP-CNVESASLHPKKEFFVAGGEDFK-VYKFDYNT-------G----------EEIGSYNKGHF-GPVHCVRFSPD 278 (334)
T ss_pred eeccCc-cccccccccCCCceEEecCcceE-EEEEeccC-------C----------ceeeecccCCC-CceEEEEECCC
Confidence 877754 46888999999999999999765 78888753 4 23333346654 57999999999
Q ss_pred CCEEEEEeCCCeEEEEeCCCC
Q 001814 443 SQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 443 g~~LAsgS~dGTVhIw~I~~~ 463 (1010)
|..-|+||.||||+||...+.
T Consensus 279 GE~yAsGSEDGTirlWQt~~~ 299 (334)
T KOG0278|consen 279 GELYASGSEDGTIRLWQTTPG 299 (334)
T ss_pred CceeeccCCCceEEEEEecCC
Confidence 999999999999999998864
No 81
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.43 E-value=1.3e-11 Score=134.37 Aligned_cols=281 Identities=19% Similarity=0.191 Sum_probs=162.7
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeec----cCCEEEEEEecCCCCCCCCCCcc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKR----DGPVSFLQMQPFPVKDDGCEGFR 127 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~h----dGpV~~v~~lP~p~~s~~~D~F~ 127 (1010)
-+|++.|..+.|-. +++.|++...++ |+||++.+-.+ ++--..| -+--..|.|.|+-.
T Consensus 83 KgH~~~vt~~~FsS-------dGK~lat~~~Dr~Ir~w~~~DF~~-~eHr~~R~nve~dhpT~V~FapDc~--------- 145 (420)
T KOG2096|consen 83 KGHKKEVTDVAFSS-------DGKKLATISGDRSIRLWDVRDFEN-KEHRCIRQNVEYDHPTRVVFAPDCK--------- 145 (420)
T ss_pred hccCCceeeeEEcC-------CCceeEEEeCCceEEEEecchhhh-hhhhHhhccccCCCceEEEECCCcc---------
Confidence 46888899888863 667788877766 89999965211 1100001 02334566666541
Q ss_pred ccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCC---CeEEE------EEeCC----Cc
Q 001814 128 KLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQS---HCYEH------VLRFR----SS 194 (1010)
Q Consensus 128 ~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlkt---ge~V~------tL~f~----S~ 194 (1010)
.+ +|.+. ..+++++|-+.. |..-+ .+.|+ -.
T Consensus 146 ---s~-vv~~~--------------------------------~g~~l~vyk~~K~~dG~~~~~~v~~D~~~f~~kh~v~ 189 (420)
T KOG2096|consen 146 ---SV-VVSVK--------------------------------RGNKLCVYKLVKKTDGSGSHHFVHIDNLEFERKHQVD 189 (420)
T ss_pred ---eE-EEEEc--------------------------------cCCEEEEEEeeecccCCCCcccccccccccchhcccc
Confidence 12 22111 014677775532 22211 12233 13
Q ss_pred EE--EEEEcCCeEEEE-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEccCC--eeec
Q 001814 195 VC--MVRCSPRIVAVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYASNT--LLLS 267 (1010)
Q Consensus 195 V~--sVa~S~rlLAV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~~--~~iw 267 (1010)
|. .++=+.++|+.+ ++..|.+||+. ++.+.++.+.... .--.|++| |+||.++-+ +.+|
T Consensus 190 ~i~iGiA~~~k~imsas~dt~i~lw~lk-Gq~L~~idtnq~~-------------n~~aavSP~GRFia~~gFTpDVkVw 255 (420)
T KOG2096|consen 190 IINIGIAGNAKYIMSASLDTKICLWDLK-GQLLQSIDTNQSS-------------NYDAAVSPDGRFIAVSGFTPDVKVW 255 (420)
T ss_pred eEEEeecCCceEEEEecCCCcEEEEecC-Cceeeeecccccc-------------ccceeeCCCCcEEEEecCCCCceEE
Confidence 43 444455777765 56689999998 8899999876541 23467888 999988864 4566
Q ss_pred cCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCC
Q 001814 268 NSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDN 347 (1010)
Q Consensus 268 d~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~ 347 (1010)
.. -|.+|...+=+.-+ -.|.++...++ .+.|+++.. ..++.+.
T Consensus 256 E~--------------------------~f~kdG~fqev~rv-f~LkGH~saV~------~~aFsn~S~----r~vtvSk 298 (420)
T KOG2096|consen 256 EP--------------------------IFTKDGTFQEVKRV-FSLKGHQSAVL------AAAFSNSST----RAVTVSK 298 (420)
T ss_pred EE--------------------------EeccCcchhhhhhh-heeccchhhee------eeeeCCCcc----eeEEEec
Confidence 41 01111110000000 12223322211 122222211 2357789
Q ss_pred CCeEEEEECCC-------CcEEEEe----cc-CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcccc
Q 001814 348 AGIVVVKDFVT-------RAIISQF----KA-HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDW 415 (1010)
Q Consensus 348 dG~V~VwDl~s-------~~~v~~~----~a-HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~ 415 (1010)
||.++|||+.- .+.+..+ .+ -..|+ .|+++|+|+.||.+. |+.|++|...+ |
T Consensus 299 DG~wriwdtdVrY~~~qDpk~Lk~g~~pl~aag~~p~-RL~lsP~g~~lA~s~--gs~l~~~~se~-------g------ 362 (420)
T KOG2096|consen 299 DGKWRIWDTDVRYEAGQDPKILKEGSAPLHAAGSEPV-RLELSPSGDSLAVSF--GSDLKVFASED-------G------ 362 (420)
T ss_pred CCcEEEeeccceEecCCCchHhhcCCcchhhcCCCce-EEEeCCCCcEEEeec--CCceEEEEccc-------C------
Confidence 99999999852 2333333 22 23455 899999999998766 68899998764 4
Q ss_pred CCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 416 NSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 416 ~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
..+-++.+- +...|.+|+|++||+++|+++++ -++|+.
T Consensus 363 ----~~~~~~e~~-h~~~Is~is~~~~g~~~atcGdr-~vrv~~ 400 (420)
T KOG2096|consen 363 ----KDYPELEDI-HSTTISSISYSSDGKYIATCGDR-YVRVIR 400 (420)
T ss_pred ----ccchhHHHh-hcCceeeEEecCCCcEEeeecce-eeeeec
Confidence 233344443 44569999999999999999876 567765
No 82
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.41 E-value=4.8e-12 Score=147.73 Aligned_cols=218 Identities=17% Similarity=0.250 Sum_probs=142.3
Q ss_pred CEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC-----CeEEEEeCC-eEEEEECC-CCceeEEEeecCCccccCCCccccc
Q 001814 173 TAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP-----RIVAVGLAT-QIYCFDAL-TLENKFSVLTYPVPQLAGQGAVGIN 244 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~-S~V~sVa~S~-----rlLAV~ld~-~I~IwD~~-Tle~l~tL~t~p~p~~~~~g~~~vn 244 (1010)
+++|+|||..-++.+.++.| +.|+++.++. ++||.+..+ -|+|||+. ....+++|..|....
T Consensus 481 GnlrVy~Lq~l~~~~~~eAHesEilcLeyS~p~~~~kLLASasrdRlIHV~Dv~rny~l~qtld~HSssI---------- 550 (1080)
T KOG1408|consen 481 GNLRVYDLQELEYTCFMEAHESEILCLEYSFPVLTNKLLASASRDRLIHVYDVKRNYDLVQTLDGHSSSI---------- 550 (1080)
T ss_pred CceEEEEehhhhhhhheecccceeEEEeecCchhhhHhhhhccCCceEEEEecccccchhhhhcccccce----------
Confidence 67999999999998888866 6999999975 688887765 59999976 456677777765410
Q ss_pred cCccceEE--cc---ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeecccccc
Q 001814 245 VGYGPMAV--GP---RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQE 319 (1010)
Q Consensus 245 v~~gplAl--gp---RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~ 319 (1010)
..+-| .. +.|....+..+..+.- |. + +.|.+ ...+. +++++..
T Consensus 551 ---TsvKFa~~gln~~MiscGADksimFr~~----qk----------~-~~g~~-----------f~r~t-~t~~ktT-- 598 (1080)
T KOG1408|consen 551 ---TSVKFACNGLNRKMISCGADKSIMFRVN----QK----------A-SSGRL-----------FPRHT-QTLSKTT-- 598 (1080)
T ss_pred ---eEEEEeecCCceEEEeccCchhhheehh----cc----------c-cCcee-----------ccccc-cccccce--
Confidence 11111 00 2222211111111000 00 0 00000 00000 1111110
Q ss_pred ccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc---CCCCeEEEEECCCCCEEEEEEcCCCeEEE
Q 001814 320 LLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA---HTSPISALCFDPSGTLLVTASVYGNNINI 396 (1010)
Q Consensus 320 l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a---HtspIsaLaFSPdGtlLATAS~dGt~IrV 396 (1010)
++ ....+|.|++ .+++.+|..|+|||+.+++.+..|++ |.+.+-.|..+|+|-+|||... ++++.+
T Consensus 599 lY------Dm~Vdp~~k~----v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScs-dktl~~ 667 (1080)
T KOG1408|consen 599 LY------DMAVDPTSKL----VVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCS-DKTLCF 667 (1080)
T ss_pred EE------EeeeCCCcce----EEEEecccceEEEeccccceeeeecccccCCCceEEEEECCCccEEEEeec-CCceEE
Confidence 01 1223344553 34678999999999999999999985 6677888999999999999988 456999
Q ss_pred EeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 397 FRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 397 wdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
||.-. | .++.+. .|+. -.|+.+-|++|.+.|.+.+.||-|.||.+..
T Consensus 668 ~Df~s-------g----------EcvA~m-~GHs-E~VTG~kF~nDCkHlISvsgDgCIFvW~lp~ 714 (1080)
T KOG1408|consen 668 VDFVS-------G----------ECVAQM-TGHS-EAVTGVKFLNDCKHLISVSGDGCIFVWKLPL 714 (1080)
T ss_pred EEecc-------c----------hhhhhh-cCcc-hheeeeeecccchhheeecCCceEEEEECch
Confidence 99753 4 344443 3543 4599999999999999999999999999976
No 83
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.41 E-value=4.1e-11 Score=128.08 Aligned_cols=222 Identities=14% Similarity=0.167 Sum_probs=156.8
Q ss_pred CeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCcc
Q 001814 75 KQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLG 154 (1010)
Q Consensus 75 ~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~ 154 (1010)
..+.++..+..+++||+- .+++...+....+.+ .+.+.|++ ..++++.
T Consensus 78 d~~atas~dk~ir~wd~r-~~k~~~~i~~~~eni-~i~wsp~g-------------~~~~~~~----------------- 125 (313)
T KOG1407|consen 78 DLFATASGDKTIRIWDIR-SGKCTARIETKGENI-NITWSPDG-------------EYIAVGN----------------- 125 (313)
T ss_pred cceEEecCCceEEEEEec-cCcEEEEeeccCcce-EEEEcCCC-------------CEEEEec-----------------
Confidence 344444445679999994 455555555444444 45667665 2455421
Q ss_pred ccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--C-eEEEEeCCeEEEEECCCCceeEEEeecC
Q 001814 155 GVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--R-IVAVGLATQIYCFDALTLENKFSVLTYP 231 (1010)
Q Consensus 155 ~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--r-lLAV~ld~~I~IwD~~Tle~l~tL~t~p 231 (1010)
.+..|.+.|.++.+.++..+|.-.+..+.++- + +++....+.|.|..-..++.+++|..||
T Consensus 126 ----------------kdD~it~id~r~~~~~~~~~~~~e~ne~~w~~~nd~Fflt~GlG~v~ILsypsLkpv~si~AH~ 189 (313)
T KOG1407|consen 126 ----------------KDDRITFIDARTYKIVNEEQFKFEVNEISWNNSNDLFFLTNGLGCVEILSYPSLKPVQSIKAHP 189 (313)
T ss_pred ----------------CcccEEEEEecccceeehhcccceeeeeeecCCCCEEEEecCCceEEEEeccccccccccccCC
Confidence 13579999999999999999988887777764 3 3344445788888877888888888887
Q ss_pred CccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccce
Q 001814 232 VPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSK 311 (1010)
Q Consensus 232 ~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~k 311 (1010)
. | .-.+.++| .|..
T Consensus 190 s-----------n--CicI~f~p----------------------------------~Gry------------------- 203 (313)
T KOG1407|consen 190 S-----------N--CICIEFDP----------------------------------DGRY------------------- 203 (313)
T ss_pred c-----------c--eEEEEECC----------------------------------CCce-------------------
Confidence 5 1 11222222 1111
Q ss_pred eeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCC
Q 001814 312 TLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYG 391 (1010)
Q Consensus 312 tls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dG 391 (1010)
++.|+.|-.|.+||+...-++..|.-|.-||..|+||.||++||+||+ +
T Consensus 204 ------------------------------fA~GsADAlvSLWD~~ELiC~R~isRldwpVRTlSFS~dg~~lASaSE-D 252 (313)
T KOG1407|consen 204 ------------------------------FATGSADALVSLWDVDELICERCISRLDWPVRTLSFSHDGRMLASASE-D 252 (313)
T ss_pred ------------------------------EeeccccceeeccChhHhhhheeeccccCceEEEEeccCcceeeccCc-c
Confidence 123556778999999999899999999999999999999999999999 6
Q ss_pred CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCC---------CeEEEEeCC
Q 001814 392 NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSK---------GTCHVFVLS 461 (1010)
Q Consensus 392 t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~d---------GTVhIw~I~ 461 (1010)
+.|-|=++.+ | ..+++.. +.+.-..+||.|..-.||-+.+| |+|+||-++
T Consensus 253 h~IDIA~vet-------G----------d~~~eI~---~~~~t~tVAWHPk~~LLAyA~ddk~~d~~reag~vKiFG~~ 311 (313)
T KOG1407|consen 253 HFIDIAEVET-------G----------DRVWEIP---CEGPTFTVAWHPKRPLLAYACDDKDGDSNREAGTVKIFGLS 311 (313)
T ss_pred ceEEeEeccc-------C----------CeEEEee---ccCCceeEEecCCCceeeEEecCCCCccccccceeEEecCC
Confidence 7787776653 5 3566653 33456889999999999988765 577777654
No 84
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.41 E-value=8.8e-12 Score=142.05 Aligned_cols=105 Identities=20% Similarity=0.241 Sum_probs=76.9
Q ss_pred ccCCCCeEEEEECCCCc---EEEEeccCCC--CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 344 DMDNAGIVVVKDFVTRA---IISQFKAHTS--PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~---~v~~~~aHts--pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
.+..||.|++||..+.. ....=.||.. .|++|+||+||++|++-+.|++ ++|||+...
T Consensus 334 agc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~t-LKvWDLrq~---------------- 396 (641)
T KOG0772|consen 334 AGCLDGSIQIWDKGSRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDT-LKVWDLRQF---------------- 396 (641)
T ss_pred hcccCCceeeeecCCcccccceEeeeccCCCCceeEEEeccccchhhhccCCCc-eeeeecccc----------------
Confidence 45679999999986543 2233458987 9999999999999999999776 999999632
Q ss_pred ceEEEEEeccc-ccccEEEEEEccCCCEEEEEeC------CCeEEEEeCCCCCCc
Q 001814 419 HVHLYKLHRGI-TSATIQDICFSHYSQWIAIVSS------KGTCHVFVLSPFGGD 466 (1010)
Q Consensus 419 ~~~L~~L~RG~-t~a~I~sIAFSpDg~~LAsgS~------dGTVhIw~I~~~gg~ 466 (1010)
.++|... .|. +...-.+++||||.+.|++|+. -|++.+|+-..+...
T Consensus 397 kkpL~~~-tgL~t~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~t~d~v 450 (641)
T KOG0772|consen 397 KKPLNVR-TGLPTPFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRMTLDTV 450 (641)
T ss_pred ccchhhh-cCCCccCCCCccccCCCceEEEecccccCCCCCceEEEEeccceeeE
Confidence 1344433 233 2235678999999999999876 356788876665443
No 85
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.40 E-value=1.7e-11 Score=142.74 Aligned_cols=238 Identities=16% Similarity=0.169 Sum_probs=179.3
Q ss_pred CCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCc
Q 001814 74 FKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHL 153 (1010)
Q Consensus 74 ~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~ 153 (1010)
..++|++|....+-+|+-. .+.+.++...+...|+.|.+.|++ ..|||-.
T Consensus 187 s~n~laValg~~vylW~~~-s~~v~~l~~~~~~~vtSv~ws~~G-------------~~LavG~---------------- 236 (484)
T KOG0305|consen 187 SANVLAVALGQSVYLWSAS-SGSVTELCSFGEELVTSVKWSPDG-------------SHLAVGT---------------- 236 (484)
T ss_pred cCCeEEEEecceEEEEecC-CCceEEeEecCCCceEEEEECCCC-------------CEEEEee----------------
Confidence 4569999999999999974 678888888888899999999776 3567621
Q ss_pred cccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC-C-CcEEEEEEcCCeEEEEe-CCeEEEEECCCCceeEE-Eee
Q 001814 154 GGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF-R-SSVCMVRCSPRIVAVGL-ATQIYCFDALTLENKFS-VLT 229 (1010)
Q Consensus 154 ~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f-~-S~V~sVa~S~rlLAV~l-d~~I~IwD~~Tle~l~t-L~t 229 (1010)
..++|.|||.++.+.+.++.. + ..|-+++++...|.+|. +..|..||++..+.... +..
T Consensus 237 -----------------~~g~v~iwD~~~~k~~~~~~~~h~~rvg~laW~~~~lssGsr~~~I~~~dvR~~~~~~~~~~~ 299 (484)
T KOG0305|consen 237 -----------------SDGTVQIWDVKEQKKTRTLRGSHASRVGSLAWNSSVLSSGSRDGKILNHDVRISQHVVSTLQG 299 (484)
T ss_pred -----------------cCCeEEEEehhhccccccccCCcCceeEEEeccCceEEEecCCCcEEEEEEecchhhhhhhhc
Confidence 136899999999999999987 4 58999999998888765 56799999887654332 222
Q ss_pred cCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhccc
Q 001814 230 YPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGL 309 (1010)
Q Consensus 230 ~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi 309 (1010)
|... .+. |++.
T Consensus 300 H~qe---------------VCg-----Lkws------------------------------------------------- 310 (484)
T KOG0305|consen 300 HRQE---------------VCG-----LKWS------------------------------------------------- 310 (484)
T ss_pred ccce---------------eee-----eEEC-------------------------------------------------
Confidence 2110 000 1111
Q ss_pred ceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECC-CCCEEEEEE
Q 001814 310 SKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDP-SGTLLVTAS 388 (1010)
Q Consensus 310 ~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSP-dGtlLATAS 388 (1010)
.....+++++.|+.|.|||......+..|..|+..|-+|+|+| ...+||||+
T Consensus 311 ---------------------------~d~~~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~~lLAsGG 363 (484)
T KOG0305|consen 311 ---------------------------PDGNQLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQSGLLATGG 363 (484)
T ss_pred ---------------------------CCCCeeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCCccCceEEcC
Confidence 1111245778999999999988888999999999999999999 457999974
Q ss_pred -cCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE--eCCCeEEEEeCCCCCC
Q 001814 389 -VYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV--SSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 389 -~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg--S~dGTVhIw~I~~~gg 465 (1010)
..++.|++|++.. | .++.... +...|.+|.||+..+-|+++ -.++-|.||+......
T Consensus 364 Gs~D~~i~fwn~~~-------g----------~~i~~vd---tgsQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~ps~~~ 423 (484)
T KOG0305|consen 364 GSADRCIKFWNTNT-------G----------ARIDSVD---TGSQVCSLIWSKKYKELLSTHGYSENQITLWKYPSMKL 423 (484)
T ss_pred CCcccEEEEEEcCC-------C----------cEecccc---cCCceeeEEEcCCCCEEEEecCCCCCcEEEEeccccce
Confidence 3456799999863 4 3444443 45689999999999777664 4466799999999888
Q ss_pred ccccccccC
Q 001814 466 DSGFQTLSS 474 (1010)
Q Consensus 466 ~~~~~~H~s 474 (1010)
...+.+|+.
T Consensus 424 ~~~l~gH~~ 432 (484)
T KOG0305|consen 424 VAELLGHTS 432 (484)
T ss_pred eeeecCCcc
Confidence 888888864
No 86
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.40 E-value=1.2e-11 Score=131.64 Aligned_cols=248 Identities=13% Similarity=0.096 Sum_probs=168.8
Q ss_pred CCCCCcEEEEEEeeccC------CCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCc
Q 001814 53 EDLKDQVTWAGFDRLEY------GPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGF 126 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~------~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F 126 (1010)
.+.+.-.....||.-++ ..+...++++...++.+||||+.....-...+..|...|..+-..+.
T Consensus 46 ~~~~gi~e~~s~d~~D~LfdV~Wse~~e~~~~~a~GDGSLrl~d~~~~s~Pi~~~kEH~~EV~Svdwn~~---------- 115 (311)
T KOG0277|consen 46 TDPKGIQECQSYDTEDGLFDVAWSENHENQVIAASGDGSLRLFDLTMPSKPIHKFKEHKREVYSVDWNTV---------- 115 (311)
T ss_pred CCCCCeEEEEeeecccceeEeeecCCCcceEEEEecCceEEEeccCCCCcchhHHHhhhhheEEeccccc----------
Confidence 35566666677775222 22334567777777779999998777777778888888888876532
Q ss_pred cccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC---
Q 001814 127 RKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--- 202 (1010)
Q Consensus 127 ~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--- 202 (1010)
+....++ ++| ++|||+||.....-+.|++.+ +.|+...++|
T Consensus 116 ---~r~~~lt----------------------sSW----------D~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~ 160 (311)
T KOG0277|consen 116 ---RRRIFLT----------------------SSW----------DGTIKLWDPNRPNSVQTFNGHNSCIYQAAFSPHIP 160 (311)
T ss_pred ---cceeEEe----------------------ecc----------CCceEeecCCCCcceEeecCCccEEEEEecCCCCC
Confidence 1222221 134 479999999999999998765 5899999998
Q ss_pred CeEEE-EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCC
Q 001814 203 RIVAV-GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSG 281 (1010)
Q Consensus 203 rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~ 281 (1010)
++++. +.++..++||++..-....+..|.. =+++..|-
T Consensus 161 nlfas~Sgd~~l~lwdvr~~gk~~~i~ah~~-----------------Eil~cdw~------------------------ 199 (311)
T KOG0277|consen 161 NLFASASGDGTLRLWDVRSPGKFMSIEAHNS-----------------EILCCDWS------------------------ 199 (311)
T ss_pred CeEEEccCCceEEEEEecCCCceeEEEeccc-----------------eeEeeccc------------------------
Confidence 56765 4567899999775322222332221 01111111
Q ss_pred CCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc-
Q 001814 282 VSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA- 360 (1010)
Q Consensus 282 vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~- 360 (1010)
+| ....+++++.|+.|++||+.+.+
T Consensus 200 ---------------------------------ky---------------------~~~vl~Tg~vd~~vr~wDir~~r~ 225 (311)
T KOG0277|consen 200 ---------------------------------KY---------------------NHNVLATGGVDNLVRGWDIRNLRT 225 (311)
T ss_pred ---------------------------------cc---------------------CCcEEEecCCCceEEEEehhhccc
Confidence 11 11123567889999999999854
Q ss_pred EEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE
Q 001814 361 IISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF 439 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF 439 (1010)
.+..|.+|.-.|..++|||.- .+|||||-| -+.|||+.+..- + ....++ +|.--|..+.|
T Consensus 226 pl~eL~gh~~AVRkvk~Sph~~~lLaSasYD-mT~riw~~~~~d-----s---------~~e~~~----~HtEFv~g~Dw 286 (311)
T KOG0277|consen 226 PLFELNGHGLAVRKVKFSPHHASLLASASYD-MTVRIWDPERQD-----S---------AIETVD----HHTEFVCGLDW 286 (311)
T ss_pred cceeecCCceEEEEEecCcchhhHhhhcccc-ceEEecccccch-----h---------hhhhhh----ccceEEecccc
Confidence 678899999999999999976 589999995 569999986310 1 012222 12234777888
Q ss_pred cc-CCCEEEEEeCCCeEEEEe
Q 001814 440 SH-YSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 440 Sp-Dg~~LAsgS~dGTVhIw~ 459 (1010)
|+ +..++|+++=|+++.||+
T Consensus 287 s~~~~~~vAs~gWDe~l~Vw~ 307 (311)
T KOG0277|consen 287 SLFDPGQVASTGWDELLYVWN 307 (311)
T ss_pred ccccCceeeecccccceeeec
Confidence 86 678999999999999997
No 87
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.40 E-value=1.3e-11 Score=131.40 Aligned_cols=220 Identities=16% Similarity=0.151 Sum_probs=151.2
Q ss_pred EEEEEccCCCcceEeeeec-cCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCC
Q 001814 86 FQVLDVEDASNFNELVSKR-DGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQ 164 (1010)
Q Consensus 86 ~qVWDv~~~g~v~ellS~h-dGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~ 164 (1010)
+-|.+++....+.+..+.. ...+..|++.++.. ..+++++|
T Consensus 40 L~ile~~~~~gi~e~~s~d~~D~LfdV~Wse~~e------------~~~~~a~G-------------------------- 81 (311)
T KOG0277|consen 40 LFILEVTDPKGIQECQSYDTEDGLFDVAWSENHE------------NQVIAASG-------------------------- 81 (311)
T ss_pred EEEEecCCCCCeEEEEeeecccceeEeeecCCCc------------ceEEEEec--------------------------
Confidence 6677775455666666543 35677888876531 24455443
Q ss_pred CCCCCCCCCEEEEEeCC-CCeEEEEEeCC-CcEEEEEEcC----CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCC
Q 001814 165 SGNCVNSPTAVRFYSFQ-SHCYEHVLRFR-SSVCMVRCSP----RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQ 238 (1010)
Q Consensus 165 ~~~~~~sp~tVrIWDlk-tge~V~tL~f~-S~V~sVa~S~----rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~ 238 (1010)
++.+||||+. ....++.++-| ..|++|..+. .+|..+=|++|++||..-.+.+.|..++-.
T Consensus 82 -------DGSLrl~d~~~~s~Pi~~~kEH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~------ 148 (311)
T KOG0277|consen 82 -------DGSLRLFDLTMPSKPIHKFKEHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTFNGHNS------ 148 (311)
T ss_pred -------CceEEEeccCCCCcchhHHHhhhhheEEeccccccceeEEeeccCCceEeecCCCCcceEeecCCcc------
Confidence 3579999974 34467777644 5899999986 245556689999999765555555443321
Q ss_pred CccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccc
Q 001814 239 GAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQ 318 (1010)
Q Consensus 239 g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~ 318 (1010)
+.|.. .+||....+
T Consensus 149 ------------------~Iy~a-----------------------~~sp~~~nl------------------------- 162 (311)
T KOG0277|consen 149 ------------------CIYQA-----------------------AFSPHIPNL------------------------- 162 (311)
T ss_pred ------------------EEEEE-----------------------ecCCCCCCe-------------------------
Confidence 11111 011111111
Q ss_pred cccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECC-CCCEEEEEEcCCCeEEEE
Q 001814 319 ELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDP-SGTLLVTASVYGNNINIF 397 (1010)
Q Consensus 319 ~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSP-dGtlLATAS~dGt~IrVw 397 (1010)
+++++.||+.++||+........|.+|...|.++-|+. +-.+|||++.|+ .||+|
T Consensus 163 -----------------------fas~Sgd~~l~lwdvr~~gk~~~i~ah~~Eil~cdw~ky~~~vl~Tg~vd~-~vr~w 218 (311)
T KOG0277|consen 163 -----------------------FASASGDGTLRLWDVRSPGKFMSIEAHNSEILCCDWSKYNHNVLATGGVDN-LVRGW 218 (311)
T ss_pred -----------------------EEEccCCceEEEEEecCCCceeEEEeccceeEeecccccCCcEEEecCCCc-eEEEE
Confidence 23567899999999987555556999999999999986 457999999955 59999
Q ss_pred eCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCCC
Q 001814 398 RIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 398 di~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~g 464 (1010)
|+... .+++.+| -|+.- -|..|.|||-. ..||++|-|-|++||+.....
T Consensus 219 Dir~~----------------r~pl~eL-~gh~~-AVRkvk~Sph~~~lLaSasYDmT~riw~~~~~d 268 (311)
T KOG0277|consen 219 DIRNL----------------RTPLFEL-NGHGL-AVRKVKFSPHHASLLASASYDMTVRIWDPERQD 268 (311)
T ss_pred ehhhc----------------cccceee-cCCce-EEEEEecCcchhhHhhhccccceEEecccccch
Confidence 99642 1578888 45544 49999999974 688999999999999987443
No 88
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.40 E-value=2.2e-11 Score=136.88 Aligned_cols=224 Identities=15% Similarity=0.210 Sum_probs=156.1
Q ss_pred eEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCcc
Q 001814 76 QVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLG 154 (1010)
Q Consensus 76 ~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~ 154 (1010)
..+++|..+ ..-++|.+ .+.+..++.+|.-.|..+.+.|+- .+++. .
T Consensus 232 ~~ilTGG~d~~av~~d~~-s~q~l~~~~Gh~kki~~v~~~~~~-------------~~v~~--a---------------- 279 (506)
T KOG0289|consen 232 SKILTGGEDKTAVLFDKP-SNQILATLKGHTKKITSVKFHKDL-------------DTVIT--A---------------- 279 (506)
T ss_pred CcceecCCCCceEEEecc-hhhhhhhccCcceEEEEEEeccch-------------hheee--c----------------
Confidence 345555554 79999984 577888999999889988887653 11121 1
Q ss_pred ccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEEe-CCeEEEEECCCCceeEEEeec
Q 001814 155 GVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVGL-ATQIYCFDALTLENKFSVLTY 230 (1010)
Q Consensus 155 ~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~l-d~~I~IwD~~Tle~l~tL~t~ 230 (1010)
.++..++||+.-...+...+..+ .+|..+...+ ++|+.+. ++.+.+.|+.++.++.....-
T Consensus 280 ---------------Sad~~i~vws~~~~s~~~~~~~h~~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~ 344 (506)
T KOG0289|consen 280 ---------------SADEIIRVWSVPLSSEPTSSRPHEEPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDE 344 (506)
T ss_pred ---------------CCcceEEeeccccccCccccccccccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeec
Confidence 13478999999877766666544 6899888877 5666555 456667788887765444321
Q ss_pred CCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccc
Q 001814 231 PVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLS 310 (1010)
Q Consensus 231 p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ 310 (1010)
.+ ++ .+...+
T Consensus 345 ~s---------~v--~~ts~~----------------------------------------------------------- 354 (506)
T KOG0289|consen 345 TS---------DV--EYTSAA----------------------------------------------------------- 354 (506)
T ss_pred cc---------cc--eeEEee-----------------------------------------------------------
Confidence 11 00 011111
Q ss_pred eeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC
Q 001814 311 KTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY 390 (1010)
Q Consensus 311 ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d 390 (1010)
+.||| ..+..+..||.|+|||+.+...++.|.+|++||.+++|+-+|-+|||+..|
T Consensus 355 ---------fHpDg---------------Lifgtgt~d~~vkiwdlks~~~~a~Fpght~~vk~i~FsENGY~Lat~add 410 (506)
T KOG0289|consen 355 ---------FHPDG---------------LIFGTGTPDGVVKIWDLKSQTNVAKFPGHTGPVKAISFSENGYWLATAADD 410 (506)
T ss_pred ---------EcCCc---------------eEEeccCCCceEEEEEcCCccccccCCCCCCceeEEEeccCceEEEEEecC
Confidence 12221 112356789999999999999999999999999999999999999999997
Q ss_pred CCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 391 GNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 391 Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
|. |++||+.... ..+.+.+.. ...|.+++|.+.|++|++++.+=+|++++
T Consensus 411 ~~-V~lwDLRKl~---------------n~kt~~l~~---~~~v~s~~fD~SGt~L~~~g~~l~Vy~~~ 460 (506)
T KOG0289|consen 411 GS-VKLWDLRKLK---------------NFKTIQLDE---KKEVNSLSFDQSGTYLGIAGSDLQVYICK 460 (506)
T ss_pred Ce-EEEEEehhhc---------------ccceeeccc---cccceeEEEcCCCCeEEeecceeEEEEEe
Confidence 76 9999995320 012233322 22589999999999999998887777776
No 89
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.39 E-value=1.1e-10 Score=126.91 Aligned_cols=280 Identities=15% Similarity=0.187 Sum_probs=163.6
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEc-cCCCcceEeee--eccCCEEEEEEecCCCCCCCCCCcc
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDV-EDASNFNELVS--KRDGPVSFLQMQPFPVKDDGCEGFR 127 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv-~~~g~v~ellS--~hdGpV~~v~~lP~p~~s~~~D~F~ 127 (1010)
..+|+|-|..+.||. -++-+++|.. ..++|||. .+.++..-.-+ -|+|.|..|.+.+.- |.
T Consensus 9 ~s~h~DlihdVs~D~-------~GRRmAtCSsDq~vkI~d~~~~s~~W~~Ts~Wrah~~Si~rV~WAhPE--------fG 73 (361)
T KOG2445|consen 9 DSGHKDLIHDVSFDF-------YGRRMATCSSDQTVKIWDSTSDSGTWSCTSSWRAHDGSIWRVVWAHPE--------FG 73 (361)
T ss_pred ccCCcceeeeeeecc-------cCceeeeccCCCcEEEEeccCCCCceEEeeeEEecCCcEEEEEecCcc--------cc
Confidence 467999999999996 2344555555 55999995 34455544433 478888888887432 43
Q ss_pred ccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeC-----CC-C-e--EEEEEe-CCCcEEE
Q 001814 128 KLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSF-----QS-H-C--YEHVLR-FRSSVCM 197 (1010)
Q Consensus 128 ~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDl-----kt-g-e--~V~tL~-f~S~V~s 197 (1010)
..||.|+ .+++|+||.= +. + + ...+|. -++.|++
T Consensus 74 ---qvvA~cS---------------------------------~Drtv~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~D 117 (361)
T KOG2445|consen 74 ---QVVATCS---------------------------------YDRTVSIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTD 117 (361)
T ss_pred ---ceEEEEe---------------------------------cCCceeeeeecccccccccceeEEEEEeecCCcceeE
Confidence 4678764 3578999964 11 1 2 233443 3579999
Q ss_pred EEEcCC----eEE-EEeCCeEEEEECCCCceeEEEe-ecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCc
Q 001814 198 VRCSPR----IVA-VGLATQIYCFDALTLENKFSVL-TYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGR 271 (1010)
Q Consensus 198 Va~S~r----lLA-V~ld~~I~IwD~~Tle~l~tL~-t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~ 271 (1010)
|.|.|+ .|| ++.++.++||++.+.-.+.... .+...... ..++ + ....+++ .-|..++
T Consensus 118 V~FaP~hlGLklA~~~aDG~lRIYEA~dp~nLs~W~Lq~Ei~~~~--~pp~-~--~~~~~~C-----------vsWn~sr 181 (361)
T KOG2445|consen 118 VKFAPKHLGLKLAAASADGILRIYEAPDPMNLSQWTLQHEIQNVI--DPPG-K--NKQPCFC-----------VSWNPSR 181 (361)
T ss_pred EEecchhcceEEEEeccCcEEEEEecCCccccccchhhhhhhhcc--CCcc-c--ccCcceE-----------Eeecccc
Confidence 999984 344 4567789999877543322111 01000000 0000 0 0000111 1122122
Q ss_pred cCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeE
Q 001814 272 LSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIV 351 (1010)
Q Consensus 272 vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V 351 (1010)
...+ .+|+ |. + .++..-+.+
T Consensus 182 ~~~p----------------~iAv------------gs----------------------~----------e~a~~~~~~ 201 (361)
T KOG2445|consen 182 MHEP----------------LIAV------------GS----------------------D----------EDAPHLNKV 201 (361)
T ss_pred ccCc----------------eEEE------------Ec----------------------c----------cCCccccce
Confidence 1110 0110 00 0 012344678
Q ss_pred EEEECCCCc----EEEEeccCCCCeEEEEECCCC----CEEEEEEcCCCeEEEEeCCCCccc-CCCCCCcccc--CCcce
Q 001814 352 VVKDFVTRA----IISQFKAHTSPISALCFDPSG----TLLVTASVYGNNINIFRIMPSCMR-SGSGNHKYDW--NSSHV 420 (1010)
Q Consensus 352 ~VwDl~s~~----~v~~~~aHtspIsaLaFSPdG----tlLATAS~dGt~IrVwdi~p~~~~-~~sG~~~~~~--~~s~~ 420 (1010)
.||...... .++.+..|+.||..|+|.|+- .+||+|+.|| ||||.+...+.. -.-+....+. .-...
T Consensus 202 ~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~y~~lAvA~kDg--v~I~~v~~~~s~i~~ee~~~~~~~~~l~v~ 279 (361)
T KOG2445|consen 202 KIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRSYHLLAVATKDG--VRIFKVKVARSAIEEEEVLAPDLMTDLPVE 279 (361)
T ss_pred EEEEecCCcceeeeehhcCCCCCcceeeeeccccCCceeeEEEeecCc--EEEEEEeeccchhhhhcccCCCCccccceE
Confidence 888876543 567889999999999999963 5899999988 999999742100 0000000000 00112
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
.+-+| +.|++.||.+.|.--|..|++.++||+|++|+-+-
T Consensus 280 ~vs~~--~~H~~~VWrv~wNmtGtiLsStGdDG~VRLWkany 319 (361)
T KOG2445|consen 280 KVSEL--DDHNGEVWRVRWNMTGTILSSTGDDGCVRLWKANY 319 (361)
T ss_pred Eeeec--cCCCCceEEEEEeeeeeEEeecCCCceeeehhhhh
Confidence 22222 44678999999999999999999999999998753
No 90
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.38 E-value=5.1e-11 Score=139.71 Aligned_cols=200 Identities=17% Similarity=0.200 Sum_probs=136.3
Q ss_pred CCEEEEEeCCCCeEEEEEeCC---CcEEE-EEE---cCCeEEE-EeCCeEEEEECCCCceeEEEeecCCccccCCCcccc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFR---SSVCM-VRC---SPRIVAV-GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGI 243 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~---S~V~s-Va~---S~rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~v 243 (1010)
+++++||+-+.++++.+..|- +-|-. +.+ .+.+|++ +.|..|.+|...+.+.+++|..|..
T Consensus 34 d~t~~vw~~~~~~~l~~~~~~~~~g~i~~~i~y~e~~~~~l~~g~~D~~i~v~~~~~~~P~~~LkgH~s----------- 102 (745)
T KOG0301|consen 34 DGTVKVWAKKGKQYLETHAFEGPKGFIANSICYAESDKGRLVVGGMDTTIIVFKLSQAEPLYTLKGHKS----------- 102 (745)
T ss_pred CCceeeeeccCcccccceecccCcceeeccceeccccCcceEeecccceEEEEecCCCCchhhhhcccc-----------
Confidence 478999999988887655443 22211 333 2234555 5577899999999999999999876
Q ss_pred ccCccceEEcc----ceEEEcc--CCeeeccCCccC--CCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeecc
Q 001814 244 NVGYGPMAVGP----RWLAYAS--NTLLLSNSGRLS--PQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSK 315 (1010)
Q Consensus 244 nv~~gplAlgp----RwLAyas--~~~~iwd~G~vs--~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~ 315 (1010)
+.++++. . |-.++ .+..+|..|.+. .|.++.++.+. +.
T Consensus 103 ----nVC~ls~~~~~~-~iSgSWD~TakvW~~~~l~~~l~gH~asVWAv--------------------------~~--- 148 (745)
T KOG0301|consen 103 ----NVCSLSIGEDGT-LISGSWDSTAKVWRIGELVYSLQGHTASVWAV--------------------------AS--- 148 (745)
T ss_pred ----ceeeeecCCcCc-eEecccccceEEecchhhhcccCCcchheeee--------------------------ee---
Confidence 2344432 2 11121 356677644331 11111100000 00
Q ss_pred ccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEE
Q 001814 316 YCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNIN 395 (1010)
Q Consensus 316 y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~Ir 395 (1010)
+|. . .+.+|+.|.+|++|.- ++.+.+|.+|+.-|..|++=|++. +++|+.||. ||
T Consensus 149 -----l~e-----------~-----~~vTgsaDKtIklWk~--~~~l~tf~gHtD~VRgL~vl~~~~-flScsNDg~-Ir 203 (745)
T KOG0301|consen 149 -----LPE-----------N-----TYVTGSADKTIKLWKG--GTLLKTFSGHTDCVRGLAVLDDSH-FLSCSNDGS-IR 203 (745)
T ss_pred -----cCC-----------C-----cEEeccCcceeeeccC--CchhhhhccchhheeeeEEecCCC-eEeecCCce-EE
Confidence 111 0 2357889999999986 678899999999999999998765 668999775 99
Q ss_pred EEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 396 IFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 396 Vwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.|++. | ..|++.+ |++ .-|++++-.++++.|++++.|+|++||+..
T Consensus 204 ~w~~~--------g----------e~l~~~~-ght-n~vYsis~~~~~~~Ivs~gEDrtlriW~~~ 249 (745)
T KOG0301|consen 204 LWDLD--------G----------EVLLEMH-GHT-NFVYSISMALSDGLIVSTGEDRTLRIWKKD 249 (745)
T ss_pred EEecc--------C----------ceeeeee-ccc-eEEEEEEecCCCCeEEEecCCceEEEeecC
Confidence 99984 4 4677764 554 569999999999999999999999999976
No 91
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.37 E-value=2.5e-12 Score=143.47 Aligned_cols=195 Identities=16% Similarity=0.197 Sum_probs=141.2
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEE
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFL 133 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLL 133 (1010)
|+++|.-+.|-. ....++++.++| ++|||.-. ..-..+|++|.--|+++.+-|.- -||
T Consensus 179 h~eaIRdlafSp-------nDskF~t~SdDg~ikiWdf~~-~kee~vL~GHgwdVksvdWHP~k-------------gLi 237 (464)
T KOG0284|consen 179 HAEAIRDLAFSP-------NDSKFLTCSDDGTIKIWDFRM-PKEERVLRGHGWDVKSVDWHPTK-------------GLI 237 (464)
T ss_pred hhhhhheeccCC-------CCceeEEecCCCeEEEEeccC-CchhheeccCCCCcceeccCCcc-------------cee
Confidence 458888888863 234555555555 89999854 33445668888889999987642 365
Q ss_pred EEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC-CCcEEEEEEcC--CeEEEEe-
Q 001814 134 LVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF-RSSVCMVRCSP--RIVAVGL- 209 (1010)
Q Consensus 134 AvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f-~S~V~sVa~S~--rlLAV~l- 209 (1010)
|+ ++ .++.|+|||.++|+|+++|-. ...|.++.|++ .+|+++.
T Consensus 238 as--gs-------------------------------kDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n~N~Llt~sk 284 (464)
T KOG0284|consen 238 AS--GS-------------------------------KDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPNGNWLLTGSK 284 (464)
T ss_pred EE--cc-------------------------------CCceeEeecCCCcchhhhhhhccceEEEEEEcCCCCeeEEccC
Confidence 53 21 247899999999999999964 56999999998 5777655
Q ss_pred CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCC
Q 001814 210 ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPG 289 (1010)
Q Consensus 210 d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~ 289 (1010)
|..+++||+++|+.++++..|... +..++.. |-
T Consensus 285 D~~~kv~DiR~mkEl~~~r~Hkkd-------------v~~~~Wh----------------------------------P~ 317 (464)
T KOG0284|consen 285 DQSCKVFDIRTMKELFTYRGHKKD-------------VTSLTWH----------------------------------PL 317 (464)
T ss_pred CceEEEEehhHhHHHHHhhcchhh-------------heeeccc----------------------------------cc
Confidence 456999999999999888776441 0111111 00
Q ss_pred CCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEe-ccC
Q 001814 290 GSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQF-KAH 368 (1010)
Q Consensus 290 ~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~-~aH 368 (1010)
+-. .+.+++.||.|..|.+...+.+..+ .||
T Consensus 318 ~~~------------------------------------------------lftsgg~Dgsvvh~~v~~~~p~~~i~~AH 349 (464)
T KOG0284|consen 318 NES------------------------------------------------LFTSGGSDGSVVHWVVGLEEPLGEIPPAH 349 (464)
T ss_pred ccc------------------------------------------------ceeeccCCCceEEEeccccccccCCCccc
Confidence 000 1235678999999999865556555 489
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
...|.+|++.|=|.+|||||. ++++|.|.-
T Consensus 350 d~~iwsl~~hPlGhil~tgsn-d~t~rfw~r 379 (464)
T KOG0284|consen 350 DGEIWSLAYHPLGHILATGSN-DRTVRFWTR 379 (464)
T ss_pred ccceeeeeccccceeEeecCC-Ccceeeecc
Confidence 999999999999999999999 566999964
No 92
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.37 E-value=2.6e-12 Score=151.72 Aligned_cols=109 Identities=17% Similarity=0.253 Sum_probs=79.8
Q ss_pred CCCCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 346 DNAGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 346 s~dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
...|.+++||+... +...+|.||.+||.++.|+|++.+||||+.| ..|+||+.... ....+.+
T Consensus 196 ~dsG~lqlWDlRqp~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRD-K~vkiWd~t~~---------------~~~~~~t 259 (839)
T KOG0269|consen 196 HDSGYLQLWDLRQPDRCEKKLTAHNGPVLCLNWHPNREWLATGGRD-KMVKIWDMTDS---------------RAKPKHT 259 (839)
T ss_pred cCCceEEEeeccCchhHHHHhhcccCceEEEeecCCCceeeecCCC-ccEEEEeccCC---------------CccceeE
Confidence 45678888888754 4567889999999999999999999999975 55999998531 0123344
Q ss_pred EecccccccEEEEEEccCCCE-EEEEeC--CCeEEEEeCCC-CCCcccccccc
Q 001814 425 LHRGITSATIQDICFSHYSQW-IAIVSS--KGTCHVFVLSP-FGGDSGFQTLS 473 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSpDg~~-LAsgS~--dGTVhIw~I~~-~gg~~~~~~H~ 473 (1010)
.. |.+.|..+.|.|+.++ ||+++. |-.||||+|.- |=-...+..|.
T Consensus 260 In---Tiapv~rVkWRP~~~~hLAtcsmv~dtsV~VWDvrRPYIP~~t~~eH~ 309 (839)
T KOG0269|consen 260 IN---TIAPVGRVKWRPARSYHLATCSMVVDTSVHVWDVRRPYIPYATFLEHT 309 (839)
T ss_pred Ee---ecceeeeeeeccCccchhhhhhccccceEEEEeeccccccceeeeccC
Confidence 43 4578999999999765 555554 55799999963 33334555664
No 93
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.37 E-value=1.6e-11 Score=130.21 Aligned_cols=263 Identities=14% Similarity=0.209 Sum_probs=169.9
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCC--cceEeeeeccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDAS--NFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g--~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
..|+|.|..+..|. -++.|+++.. ..++|+.+.+.+ ++...|.+|.|||.-+.+. .|. |.
T Consensus 8 t~H~D~IHda~lDy-------ygkrlATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wa-hPk-------~G-- 70 (299)
T KOG1332|consen 8 TQHEDMIHDAQLDY-------YGKRLATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWA-HPK-------FG-- 70 (299)
T ss_pred hhhhhhhhHhhhhh-------hcceeeeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeec-ccc-------cC--
Confidence 45777777666553 2344555554 559999998776 5667788999999999986 221 22
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC---CCcEEEEEEcCC---
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF---RSSVCMVRCSPR--- 203 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f---~S~V~sVa~S~r--- 203 (1010)
.+||.++ ++++|.||.-.+|+.-+...+ .+.|.+|++-|.
T Consensus 71 -~iLAScs---------------------------------YDgkVIiWke~~g~w~k~~e~~~h~~SVNsV~wapheyg 116 (299)
T KOG1332|consen 71 -TILASCS---------------------------------YDGKVIIWKEENGRWTKAYEHAAHSASVNSVAWAPHEYG 116 (299)
T ss_pred -cEeeEee---------------------------------cCceEEEEecCCCchhhhhhhhhhcccceeecccccccc
Confidence 4778643 468999999999865444433 368999999883
Q ss_pred -eEEEE-eCCeEEEEECCCC-ce-eEEE-eecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCC
Q 001814 204 -IVAVG-LATQIYCFDALTL-EN-KFSV-LTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLT 278 (1010)
Q Consensus 204 -lLAV~-ld~~I~IwD~~Tl-e~-l~tL-~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt 278 (1010)
+|+++ .|+.|.|++..+- .- ...+ ..|+. +.+.++..| .
T Consensus 117 l~LacasSDG~vsvl~~~~~g~w~t~ki~~aH~~-------------GvnsVswap-------a---------------- 160 (299)
T KOG1332|consen 117 LLLACASSDGKVSVLTYDSSGGWTTSKIVFAHEI-------------GVNSVSWAP-------A---------------- 160 (299)
T ss_pred eEEEEeeCCCcEEEEEEcCCCCccchhhhhcccc-------------ccceeeecC-------c----------------
Confidence 56664 5778999876643 10 0000 01111 122222221 0
Q ss_pred CCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCC
Q 001814 279 PSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVT 358 (1010)
Q Consensus 279 ~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s 358 (1010)
+-| |.++.+- .++.+ .-+++++.|..|+||++.+
T Consensus 161 -------~~~--g~~~~~~-------~~~~~------------------------------krlvSgGcDn~VkiW~~~~ 194 (299)
T KOG1332|consen 161 -------SAP--GSLVDQG-------PAAKV------------------------------KRLVSGGCDNLVKIWKFDS 194 (299)
T ss_pred -------CCC--ccccccC-------ccccc------------------------------ceeeccCCccceeeeecCC
Confidence 000 1111000 00000 0134678899999999998
Q ss_pred Cc--EEEEeccCCCCeEEEEECCCC----CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccc
Q 001814 359 RA--IISQFKAHTSPISALCFDPSG----TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSA 432 (1010)
Q Consensus 359 ~~--~v~~~~aHtspIsaLaFSPdG----tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a 432 (1010)
++ .-.+|++|+.-|..+++.|.- ..||+||.||+ +.||..... ...|.. ..|.+| ..
T Consensus 195 ~~w~~e~~l~~H~dwVRDVAwaP~~gl~~s~iAS~SqDg~-viIwt~~~e---------~e~wk~--tll~~f-----~~ 257 (299)
T KOG1332|consen 195 DSWKLERTLEGHKDWVRDVAWAPSVGLPKSTIASCSQDGT-VIIWTKDEE---------YEPWKK--TLLEEF-----PD 257 (299)
T ss_pred cchhhhhhhhhcchhhhhhhhccccCCCceeeEEecCCCc-EEEEEecCc---------cCcccc--cccccC-----Cc
Confidence 65 335699999999999999975 57999999888 569987421 123433 223332 34
Q ss_pred cEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 433 TIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 433 ~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
.++.++||..|..||++..|..|.+|.=+..|.
T Consensus 258 ~~w~vSWS~sGn~LaVs~GdNkvtlwke~~~Gk 290 (299)
T KOG1332|consen 258 VVWRVSWSLSGNILAVSGGDNKVTLWKENVDGK 290 (299)
T ss_pred ceEEEEEeccccEEEEecCCcEEEEEEeCCCCc
Confidence 699999999999999999999999999665543
No 94
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.37 E-value=1e-10 Score=140.58 Aligned_cols=241 Identities=16% Similarity=0.215 Sum_probs=159.5
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
..|++.-+-+.||. ++..|+++..+| +++|+.....+.-+.+..+...|.+++.-- .
T Consensus 10 yaht~G~t~i~~d~-------~gefi~tcgsdg~ir~~~~~sd~e~P~ti~~~g~~v~~ia~~s---------------~ 67 (933)
T KOG1274|consen 10 YAHTGGLTLICYDP-------DGEFICTCGSDGDIRKWKTNSDEEEPETIDISGELVSSIACYS---------------N 67 (933)
T ss_pred hhccCceEEEEEcC-------CCCEEEEecCCCceEEeecCCcccCCchhhccCceeEEEeecc---------------c
Confidence 45777777777763 334555555555 999998654344556555666777776431 1
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEE-eCCCcEEEEEEc--CCeEEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVL-RFRSSVCMVRCS--PRIVAVG 208 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL-~f~S~V~sVa~S--~rlLAV~ 208 (1010)
.+++.+ .+++|.+|.+-+++.-..| +|.-++..++++ +..+|.|
T Consensus 68 ~f~~~s---------------------------------~~~tv~~y~fps~~~~~iL~Rftlp~r~~~v~g~g~~iaag 114 (933)
T KOG1274|consen 68 HFLTGS---------------------------------EQNTVLRYKFPSGEEDTILARFTLPIRDLAVSGSGKMIAAG 114 (933)
T ss_pred ceEEee---------------------------------ccceEEEeeCCCCCccceeeeeeccceEEEEecCCcEEEee
Confidence 334311 1378999999887753222 455555555555 5688887
Q ss_pred eCC-eEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcC
Q 001814 209 LAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTS 287 (1010)
Q Consensus 209 ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stS 287 (1010)
.++ .|+|-++.+.....++..|..|. -.+.+.
T Consensus 115 sdD~~vK~~~~~D~s~~~~lrgh~apV-------------l~l~~~---------------------------------- 147 (933)
T KOG1274|consen 115 SDDTAVKLLNLDDSSQEKVLRGHDAPV-------------LQLSYD---------------------------------- 147 (933)
T ss_pred cCceeEEEEeccccchheeecccCCce-------------eeeeEc----------------------------------
Confidence 765 69999988877777777665431 111211
Q ss_pred CCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc
Q 001814 288 PGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA 367 (1010)
Q Consensus 288 P~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a 367 (1010)
|. +.++ +....+|.|+|||+.++.+..++..
T Consensus 148 p~-~~fL------------------------------------------------Avss~dG~v~iw~~~~~~~~~tl~~ 178 (933)
T KOG1274|consen 148 PK-GNFL------------------------------------------------AVSSCDGKVQIWDLQDGILSKTLTG 178 (933)
T ss_pred CC-CCEE------------------------------------------------EEEecCceEEEEEcccchhhhhccc
Confidence 11 1111 1235689999999998876555442
Q ss_pred -------C-CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE
Q 001814 368 -------H-TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF 439 (1010)
Q Consensus 368 -------H-tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF 439 (1010)
- ...+..++|+|+|-.||....++ .|.||+... | .++++|+--.....+.+++|
T Consensus 179 v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~-~Vkvy~r~~-------------w----e~~f~Lr~~~~ss~~~~~~w 240 (933)
T KOG1274|consen 179 VDKDNEFILSRICTRLAWHPKGGTLAVPPVDN-TVKVYSRKG-------------W----ELQFKLRDKLSSSKFSDLQW 240 (933)
T ss_pred CCccccccccceeeeeeecCCCCeEEeeccCC-eEEEEccCC-------------c----eeheeecccccccceEEEEE
Confidence 1 45677899999965566666645 589999752 2 46778765444456999999
Q ss_pred ccCCCEEEEEeCCCeEEEEeCCC
Q 001814 440 SHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 440 SpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
||.|+|||+++.+|-|-||+++.
T Consensus 241 sPnG~YiAAs~~~g~I~vWnv~t 263 (933)
T KOG1274|consen 241 SPNGKYIAASTLDGQILVWNVDT 263 (933)
T ss_pred cCCCcEEeeeccCCcEEEEeccc
Confidence 99999999999999999999983
No 95
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.36 E-value=5.9e-11 Score=127.14 Aligned_cols=211 Identities=14% Similarity=0.128 Sum_probs=151.9
Q ss_pred eeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEe
Q 001814 100 LVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYS 179 (1010)
Q Consensus 100 llS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWD 179 (1010)
++.+|+-|+..|++.-++ .||..|+ .+.++.+|=
T Consensus 5 ~l~GHERplTqiKyN~eG-------------DLlFsca---------------------------------KD~~~~vw~ 38 (327)
T KOG0643|consen 5 LLQGHERPLTQIKYNREG-------------DLLFSCA---------------------------------KDSTPTVWY 38 (327)
T ss_pred ccccCccccceEEecCCC-------------cEEEEec---------------------------------CCCCceEEE
Confidence 466788999999976443 5766543 245788999
Q ss_pred CCCCeEEEEEeCC-CcEEEEEEcC--CeEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEc--
Q 001814 180 FQSHCYEHVLRFR-SSVCMVRCSP--RIVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVG-- 253 (1010)
Q Consensus 180 lktge~V~tL~f~-S~V~sVa~S~--rlLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlg-- 253 (1010)
..+|+.+-+++.| +.|++++.+. +.|+.+. |..+++||+.+++.+.++.+... ...+.|+
T Consensus 39 s~nGerlGty~GHtGavW~~Did~~s~~liTGSAD~t~kLWDv~tGk~la~~k~~~~--------------Vk~~~F~~~ 104 (327)
T KOG0643|consen 39 SLNGERLGTYDGHTGAVWCCDIDWDSKHLITGSADQTAKLWDVETGKQLATWKTNSP--------------VKRVDFSFG 104 (327)
T ss_pred ecCCceeeeecCCCceEEEEEecCCcceeeeccccceeEEEEcCCCcEEEEeecCCe--------------eEEEeeccC
Confidence 9999999999877 5898887765 5677665 45799999999999998875211 1112221
Q ss_pred cceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCC
Q 001814 254 PRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNS 333 (1010)
Q Consensus 254 pRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~ 333 (1010)
..++.+..+.
T Consensus 105 gn~~l~~tD~---------------------------------------------------------------------- 114 (327)
T KOG0643|consen 105 GNLILASTDK---------------------------------------------------------------------- 114 (327)
T ss_pred CcEEEEEehh----------------------------------------------------------------------
Confidence 1111111100
Q ss_pred ccccccccccccCCCCeEEEEECC-------CCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccC
Q 001814 334 VWKVGRHAGADMDNAGIVVVKDFV-------TRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRS 406 (1010)
Q Consensus 334 ~~k~~~~~iasgs~dG~V~VwDl~-------s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~ 406 (1010)
.-++.+.|.++|+. +.+++..+..|.+.|..+-|+|-|+.|+++.++|. |++||+..
T Consensus 115 ----------~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~skit~a~Wg~l~~~ii~Ghe~G~-is~~da~~----- 178 (327)
T KOG0643|consen 115 ----------QMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSKITSALWGPLGETIIAGHEDGS-ISIYDART----- 178 (327)
T ss_pred ----------hcCcceEEEEEEccCChhhhcccCceEEecCCccceeeeeecccCCEEEEecCCCc-EEEEEccc-----
Confidence 01345677777776 44567788889999999999999999999999886 99999964
Q ss_pred CCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccc
Q 001814 407 GSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGF 469 (1010)
Q Consensus 407 ~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~ 469 (1010)
| ..+.+-.+ .+...|.+|.||+|..++.++|.|.|.++|++.......++
T Consensus 179 --g----------~~~v~s~~-~h~~~Ind~q~s~d~T~FiT~s~Dttakl~D~~tl~v~Kty 228 (327)
T KOG0643|consen 179 --G----------KELVDSDE-EHSSKINDLQFSRDRTYFITGSKDTTAKLVDVRTLEVLKTY 228 (327)
T ss_pred --C----------ceeeechh-hhccccccccccCCcceEEecccCccceeeeccceeeEEEe
Confidence 4 23333222 24457999999999999999999999999999876554443
No 96
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.35 E-value=9.5e-11 Score=123.80 Aligned_cols=273 Identities=15% Similarity=0.121 Sum_probs=179.1
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
..+++-|.-++|+. + ..-+|+.|.+..+++|+. ..+.+.+..++|.-.|..++..-+. + -
T Consensus 14 ~~~qgaV~avryN~----d--GnY~ltcGsdrtvrLWNp-~rg~liktYsghG~EVlD~~~s~Dn-----------s--k 73 (307)
T KOG0316|consen 14 DCAQGAVRAVRYNV----D--GNYCLTCGSDRTVRLWNP-LRGALIKTYSGHGHEVLDAALSSDN-----------S--K 73 (307)
T ss_pred cccccceEEEEEcc----C--CCEEEEcCCCceEEeecc-cccceeeeecCCCceeeeccccccc-----------c--c
Confidence 55788899999985 1 245788888889999998 4688999999999889888876332 1 2
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEE-E
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAV-G 208 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV-~ 208 (1010)
++. |+ -++.|.+||+.+|+.++.++.| ..|..|+||. .+++. +
T Consensus 74 f~s--~G-------------------------------gDk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesSVv~Sgs 120 (307)
T KOG0316|consen 74 FAS--CG-------------------------------GDKAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESSVVASGS 120 (307)
T ss_pred ccc--CC-------------------------------CCceEEEEEcccCeeeeecccccceeeEEEecCcceEEEecc
Confidence 343 21 1478999999999999999877 5899999997 45555 4
Q ss_pred eCCeEEEEECCC--CceeEEEeecCCccccCCCccccccCccceEEcc-ceEEEccC-CeeeccC--CccCCCcCCCCCC
Q 001814 209 LATQIYCFDALT--LENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP-RWLAYASN-TLLLSNS--GRLSPQNLTPSGV 282 (1010)
Q Consensus 209 ld~~I~IwD~~T--le~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp-RwLAyas~-~~~iwd~--G~vs~Q~lt~p~v 282 (1010)
.|..|++||-+. .+.++.+.+... + ...+.+.- -.+|-+.+ +++..+. |.+....+..|+.
T Consensus 121 fD~s~r~wDCRS~s~ePiQildea~D------~-------V~Si~v~~heIvaGS~DGtvRtydiR~G~l~sDy~g~pit 187 (307)
T KOG0316|consen 121 FDSSVRLWDCRSRSFEPIQILDEAKD------G-------VSSIDVAEHEIVAGSVDGTVRTYDIRKGTLSSDYFGHPIT 187 (307)
T ss_pred ccceeEEEEcccCCCCccchhhhhcC------c-------eeEEEecccEEEeeccCCcEEEEEeecceeehhhcCCcce
Confidence 577899999774 455565554322 0 11222222 12221111 2233321 3332222222333
Q ss_pred CCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEE
Q 001814 283 SPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAII 362 (1010)
Q Consensus 283 S~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v 362 (1010)
+.++|+++.-+ ..+.-++++++.|-.+++++
T Consensus 188 ~vs~s~d~nc~-------------------------------------------------La~~l~stlrLlDk~tGklL 218 (307)
T KOG0316|consen 188 SVSFSKDGNCS-------------------------------------------------LASSLDSTLRLLDKETGKLL 218 (307)
T ss_pred eEEecCCCCEE-------------------------------------------------EEeeccceeeecccchhHHH
Confidence 33333322111 12356889999999999999
Q ss_pred EEeccCCCCe--EEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEc
Q 001814 363 SQFKAHTSPI--SALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFS 440 (1010)
Q Consensus 363 ~~~~aHtspI--saLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFS 440 (1010)
+..++|.+.= ...+|+.+.+.++++|+||. +.+||+.. + ..+-+|.-+-+ -.|.+|+|.
T Consensus 219 ~sYkGhkn~eykldc~l~qsdthV~sgSEDG~-Vy~wdLvd-------~----------~~~sk~~~~~~-v~v~dl~~h 279 (307)
T KOG0316|consen 219 KSYKGHKNMEYKLDCCLNQSDTHVFSGSEDGK-VYFWDLVD-------E----------TQISKLSVVST-VIVTDLSCH 279 (307)
T ss_pred HHhcccccceeeeeeeecccceeEEeccCCce-EEEEEecc-------c----------eeeeeeccCCc-eeEEeeecc
Confidence 9999997643 34689999999999999886 88999964 2 35556654322 248999999
Q ss_pred cCCCEEEEEeCCCeEEEEeC
Q 001814 441 HYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 441 pDg~~LAsgS~dGTVhIw~I 460 (1010)
|--..+.++...+ +..|.-
T Consensus 280 p~~~~f~~A~~~~-~~~~~~ 298 (307)
T KOG0316|consen 280 PTMDDFITATGHG-DLFWYQ 298 (307)
T ss_pred cCccceeEecCCc-eeceee
Confidence 9877666665443 445543
No 97
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.34 E-value=1.4e-10 Score=130.37 Aligned_cols=241 Identities=17% Similarity=0.215 Sum_probs=164.2
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
..+|.-.|+|+.|.+ +...++++.. ..|+||.+.. .....++.-|+++|..+..-|++.
T Consensus 257 ~~Gh~kki~~v~~~~-------~~~~v~~aSad~~i~vws~~~-~s~~~~~~~h~~~V~~ls~h~tge------------ 316 (506)
T KOG0289|consen 257 LKGHTKKITSVKFHK-------DLDTVITASADEIIRVWSVPL-SSEPTSSRPHEEPVTGLSLHPTGE------------ 316 (506)
T ss_pred ccCcceEEEEEEecc-------chhheeecCCcceEEeecccc-ccCccccccccccceeeeeccCCc------------
Confidence 466777888998875 2233444444 4599999964 445667778999999999988762
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-C--cEEEEEEcCC--eE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-S--SVCMVRCSPR--IV 205 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S--~V~sVa~S~r--lL 205 (1010)
+|+..+ .+++..|.|.++|.++...... + .+.+.+|.|+ ++
T Consensus 317 -YllsAs---------------------------------~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpDgLif 362 (506)
T KOG0289|consen 317 -YLLSAS---------------------------------NDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPDGLIF 362 (506)
T ss_pred -EEEEec---------------------------------CCceEEEEEccCCcEEEEEeeccccceeEEeeEcCCceEE
Confidence 444211 1468899999999988777654 3 4689999996 33
Q ss_pred EEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCC
Q 001814 206 AVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSP 284 (1010)
Q Consensus 206 AV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~ 284 (1010)
+.+. ++.|+|||+......-.+..|..| .+-|+|+.|
T Consensus 363 gtgt~d~~vkiwdlks~~~~a~Fpght~~--------------------vk~i~FsEN---------------------- 400 (506)
T KOG0289|consen 363 GTGTPDGVVKIWDLKSQTNVAKFPGHTGP--------------------VKAISFSEN---------------------- 400 (506)
T ss_pred eccCCCceEEEEEcCCccccccCCCCCCc--------------------eeEEEeccC----------------------
Confidence 4444 568999998876544333333221 022333332
Q ss_pred CcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEE
Q 001814 285 STSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQ 364 (1010)
Q Consensus 285 stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~ 364 (1010)
|.+ ++.+..|+.|++||+...+...+
T Consensus 401 ------GY~------------------------------------------------Lat~add~~V~lwDLRKl~n~kt 426 (506)
T KOG0289|consen 401 ------GYW------------------------------------------------LATAADDGSVKLWDLRKLKNFKT 426 (506)
T ss_pred ------ceE------------------------------------------------EEEEecCCeEEEEEehhhcccce
Confidence 222 12345788899999998887777
Q ss_pred ecc-CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC
Q 001814 365 FKA-HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS 443 (1010)
Q Consensus 365 ~~a-HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg 443 (1010)
|.- -..+|.+++|+++|++|+.++ ..++||.... +...|+ ++..+. .+.+....+.|-.+.
T Consensus 427 ~~l~~~~~v~s~~fD~SGt~L~~~g---~~l~Vy~~~k---------~~k~W~----~~~~~~--~~sg~st~v~Fg~~a 488 (506)
T KOG0289|consen 427 IQLDEKKEVNSLSFDQSGTYLGIAG---SDLQVYICKK---------KTKSWT----EIKELA--DHSGLSTGVRFGEHA 488 (506)
T ss_pred eeccccccceeEEEcCCCCeEEeec---ceeEEEEEec---------ccccce----eeehhh--hcccccceeeecccc
Confidence 764 335899999999999999985 4577777642 123343 232321 123456789999999
Q ss_pred CEEEEEeCCCeEEEEeC
Q 001814 444 QWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 444 ~~LAsgS~dGTVhIw~I 460 (1010)
++|+++|.|...+||.+
T Consensus 489 q~l~s~smd~~l~~~a~ 505 (506)
T KOG0289|consen 489 QYLASTSMDAILRLYAL 505 (506)
T ss_pred eEEeeccchhheEEeec
Confidence 99999999999999876
No 98
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.34 E-value=3.2e-12 Score=145.01 Aligned_cols=208 Identities=13% Similarity=0.121 Sum_probs=156.6
Q ss_pred cceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEE
Q 001814 96 NFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAV 175 (1010)
Q Consensus 96 ~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tV 175 (1010)
....++++|...|+++++.|.. ..||+. + + .+++|
T Consensus 205 k~~~~~~gH~kgvsai~~fp~~------------~hLlLS--~------g-------------------------mD~~v 239 (503)
T KOG0282|consen 205 KLSHNLSGHTKGVSAIQWFPKK------------GHLLLS--G------G-------------------------MDGLV 239 (503)
T ss_pred hheeeccCCccccchhhhccce------------eeEEEe--c------C-------------------------CCceE
Confidence 3456677888899999988742 235553 2 1 25899
Q ss_pred EEEeCCC-CeEEEEEeCCC-cEEEEEEcC---CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccce
Q 001814 176 RFYSFQS-HCYEHVLRFRS-SVCMVRCSP---RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPM 250 (1010)
Q Consensus 176 rIWDlkt-ge~V~tL~f~S-~V~sVa~S~---rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gpl 250 (1010)
+||++.. +.+++++..|+ +|.+++|+. ++|.++.|..|++||+.|++++.++.+.-.| ..+
T Consensus 240 klW~vy~~~~~lrtf~gH~k~Vrd~~~s~~g~~fLS~sfD~~lKlwDtETG~~~~~f~~~~~~--------------~cv 305 (503)
T KOG0282|consen 240 KLWNVYDDRRCLRTFKGHRKPVRDASFNNCGTSFLSASFDRFLKLWDTETGQVLSRFHLDKVP--------------TCV 305 (503)
T ss_pred EEEEEecCcceehhhhcchhhhhhhhccccCCeeeeeecceeeeeeccccceEEEEEecCCCc--------------eee
Confidence 9999986 89999998775 899999997 6899999999999999999998877642221 000
Q ss_pred EEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCcc
Q 001814 251 AVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVS 330 (1010)
Q Consensus 251 AlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S 330 (1010)
- +.|++..
T Consensus 306 k----------------------------------f~pd~~n-------------------------------------- 313 (503)
T KOG0282|consen 306 K----------------------------------FHPDNQN-------------------------------------- 313 (503)
T ss_pred e----------------------------------cCCCCCc--------------------------------------
Confidence 1 1111100
Q ss_pred CCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCC
Q 001814 331 PNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGN 410 (1010)
Q Consensus 331 ~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~ 410 (1010)
.+..|..|+.|+.||+.++++++....|-.+|..+.|=++|+.++|+|.++ .+|||+.... .
T Consensus 314 ----------~fl~G~sd~ki~~wDiRs~kvvqeYd~hLg~i~~i~F~~~g~rFissSDdk-s~riWe~~~~-------v 375 (503)
T KOG0282|consen 314 ----------IFLVGGSDKKIRQWDIRSGKVVQEYDRHLGAILDITFVDEGRRFISSSDDK-SVRIWENRIP-------V 375 (503)
T ss_pred ----------EEEEecCCCcEEEEeccchHHHHHHHhhhhheeeeEEccCCceEeeeccCc-cEEEEEcCCC-------c
Confidence 122356799999999999999999999999999999999999999999966 5999998531 0
Q ss_pred CccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 411 HKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 411 ~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.+-+...-.++ ..-+|+-+|.+.|+++=|.|.-|-||.+.+.
T Consensus 376 ---------~ik~i~~~~~h--smP~~~~~P~~~~~~aQs~dN~i~ifs~~~~ 417 (503)
T KOG0282|consen 376 ---------PIKNIADPEMH--TMPCLTLHPNGKWFAAQSMDNYIAIFSTVPP 417 (503)
T ss_pred ---------cchhhcchhhc--cCcceecCCCCCeehhhccCceEEEEecccc
Confidence 12222222222 4678999999999999999999999997653
No 99
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.33 E-value=8.5e-10 Score=115.88 Aligned_cols=255 Identities=15% Similarity=0.113 Sum_probs=172.5
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcc-----eEeeeeccCCEEEEEEecCCCCCCCCCCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNF-----NELVSKRDGPVSFLQMQPFPVKDDGCEGF 126 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v-----~ellS~hdGpV~~v~~lP~p~~s~~~D~F 126 (1010)
--||..|..+.|. +.+.+|++|..+. +++.-.+. +.+ .--++-|||.|+.++|+-+|.
T Consensus 86 khhkgsiyc~~ws-------~~geliatgsndk~ik~l~fn~-dt~~~~g~dle~nmhdgtirdl~fld~~~-------- 149 (350)
T KOG0641|consen 86 KHHKGSIYCTAWS-------PCGELIATGSNDKTIKVLPFNA-DTCNATGHDLEFNMHDGTIRDLAFLDDPE-------- 149 (350)
T ss_pred cccCccEEEEEec-------CccCeEEecCCCceEEEEeccc-ccccccCcceeeeecCCceeeeEEecCCC--------
Confidence 3466777777765 3678999999875 77765532 111 122567999999999996662
Q ss_pred cccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEE-EEcCCe
Q 001814 127 RKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMV-RCSPRI 204 (1010)
Q Consensus 127 ~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sV-a~S~rl 204 (1010)
+...+|+. ++ . -+++|.+-|-.+|+..+.+..+ +-|+++ .++.-+
T Consensus 150 -s~~~il~s--~g----a--------------------------gdc~iy~tdc~~g~~~~a~sghtghilalyswn~~m 196 (350)
T KOG0641|consen 150 -SGGAILAS--AG----A--------------------------GDCKIYITDCGRGQGFHALSGHTGHILALYSWNGAM 196 (350)
T ss_pred -cCceEEEe--cC----C--------------------------CcceEEEeecCCCCcceeecCCcccEEEEEEecCcE
Confidence 22345553 11 0 1478888899999999999876 478776 457777
Q ss_pred EEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCC
Q 001814 205 VAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVS 283 (1010)
Q Consensus 205 LAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS 283 (1010)
++.+. +.+|++||++-..++.++.+.-- +. +...-|
T Consensus 197 ~~sgsqdktirfwdlrv~~~v~~l~~~~~---------~~--glessa-------------------------------- 233 (350)
T KOG0641|consen 197 FASGSQDKTIRFWDLRVNSCVNTLDNDFH---------DG--GLESSA-------------------------------- 233 (350)
T ss_pred EEccCCCceEEEEeeeccceeeeccCccc---------CC--Ccccce--------------------------------
Confidence 77765 56799999987766666653111 00 000000
Q ss_pred CCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEE
Q 001814 284 PSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIIS 363 (1010)
Q Consensus 284 ~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~ 363 (1010)
|+.++ .+|.++ .++++..|....+||+..++.+.
T Consensus 234 ----------vaav~--------------------------------vdpsgr----ll~sg~~dssc~lydirg~r~iq 267 (350)
T KOG0641|consen 234 ----------VAAVA--------------------------------VDPSGR----LLASGHADSSCMLYDIRGGRMIQ 267 (350)
T ss_pred ----------eEEEE--------------------------------ECCCcc----eeeeccCCCceEEEEeeCCceee
Confidence 11000 011111 12356778889999999999999
Q ss_pred EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC
Q 001814 364 QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS 443 (1010)
Q Consensus 364 ~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg 443 (1010)
.|..|+..|.++.|||...+|.|+|- +..|++=|+. |... ..|-..--+.+..++..+-|.|..
T Consensus 268 ~f~phsadir~vrfsp~a~yllt~sy-d~~ikltdlq--------gdla-------~el~~~vv~ehkdk~i~~rwh~~d 331 (350)
T KOG0641|consen 268 RFHPHSADIRCVRFSPGAHYLLTCSY-DMKIKLTDLQ--------GDLA-------HELPIMVVAEHKDKAIQCRWHPQD 331 (350)
T ss_pred eeCCCccceeEEEeCCCceEEEEecc-cceEEEeecc--------cchh-------hcCceEEEEeccCceEEEEecCcc
Confidence 99999999999999999999999999 4669999985 3110 111111123344455568999999
Q ss_pred CEEEEEeCCCeEEEEeCC
Q 001814 444 QWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 444 ~~LAsgS~dGTVhIw~I~ 461 (1010)
--+.++|.|.|+.+|.++
T Consensus 332 ~sfisssadkt~tlwa~~ 349 (350)
T KOG0641|consen 332 FSFISSSADKTATLWALN 349 (350)
T ss_pred ceeeeccCcceEEEeccC
Confidence 889999999999999986
No 100
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.33 E-value=6.3e-12 Score=136.65 Aligned_cols=108 Identities=24% Similarity=0.344 Sum_probs=94.3
Q ss_pred cccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFK-AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~-aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
+++..||.|+||.+.+|.++..|. ||+..|.||.||.|+..+.++|. +.++||.-+.. | ++
T Consensus 279 AsGsqDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sf-D~tvRiHGlKS-------G----------K~ 340 (508)
T KOG0275|consen 279 ASGSQDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASF-DQTVRIHGLKS-------G----------KC 340 (508)
T ss_pred hccCcCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccc-cceEEEecccc-------c----------hh
Confidence 467899999999999999999996 99999999999999999999999 56799987752 5 57
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccc
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQ 470 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~ 470 (1010)
|.++ ||++ .-|....|++||.+|.++|.||||+||+.....+..+++
T Consensus 341 LKEf-rGHs-Syvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~Tfk 387 (508)
T KOG0275|consen 341 LKEF-RGHS-SYVNEATFTDDGHHIISASSDGTVKVWHGKTTECLSTFK 387 (508)
T ss_pred HHHh-cCcc-ccccceEEcCCCCeEEEecCCccEEEecCcchhhhhhcc
Confidence 7777 6765 469999999999999999999999999998877665554
No 101
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.33 E-value=1.3e-11 Score=133.86 Aligned_cols=98 Identities=17% Similarity=0.242 Sum_probs=83.0
Q ss_pred CCeEEEEECCCCcEEEEe---ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 348 AGIVVVKDFVTRAIISQF---KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~---~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
--++++||+.+-++...- ..|+..|.++.+|+.|++-+|||.||. |++||=-. + +++.+
T Consensus 237 Hp~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys~t~~lYvTaSkDG~-IklwDGVS-------~----------rCv~t 298 (430)
T KOG0640|consen 237 HPTLRLYDVNTYQCFVSANPDDQHTGAITQVRYSSTGSLYVTASKDGA-IKLWDGVS-------N----------RCVRT 298 (430)
T ss_pred CCceeEEeccceeEeeecCcccccccceeEEEecCCccEEEEeccCCc-EEeecccc-------H----------HHHHH
Confidence 447899999988764332 359999999999999999999999887 99999431 2 57777
Q ss_pred EecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 425 LHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+.+.+..+.|.+..|+.+|+||.+++.|.+|++|.|...
T Consensus 299 ~~~AH~gsevcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~ 337 (430)
T KOG0640|consen 299 IGNAHGGSEVCSAVFTKNGKYILSSGKDSTVKLWEISTG 337 (430)
T ss_pred HHhhcCCceeeeEEEccCCeEEeecCCcceeeeeeecCC
Confidence 777777789999999999999999999999999999864
No 102
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.33 E-value=2.8e-12 Score=150.51 Aligned_cols=215 Identities=13% Similarity=0.204 Sum_probs=153.8
Q ss_pred CeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCc
Q 001814 75 KQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHL 153 (1010)
Q Consensus 75 ~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~ 153 (1010)
.++++.|..+- +-+|.+...... .-|-+|+++|.+|.|.+.- -||+. |
T Consensus 40 ~r~~~~Gg~~~k~~L~~i~kp~~i-~S~~~hespIeSl~f~~~E-------------~Llaa--g--------------- 88 (825)
T KOG0267|consen 40 SRSLVTGGEDEKVNLWAIGKPNAI-TSLTGHESPIESLTFDTSE-------------RLLAA--G--------------- 88 (825)
T ss_pred ceeeccCCCceeeccccccCCchh-heeeccCCcceeeecCcch-------------hhhcc--c---------------
Confidence 46777777665 569998654322 2356899999999977432 24442 1
Q ss_pred cccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEE-eCCeEEEEECCCCceeEEEee
Q 001814 154 GGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVG-LATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 154 ~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~-ld~~I~IwD~~Tle~l~tL~t 229 (1010)
..+++|+|||+..+..+++|..+ ..+..|.|+| .+.|.+ ++..+++||.+.+-+.++...
T Consensus 89 ----------------sasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~~~~a~gStdtd~~iwD~Rk~Gc~~~~~s 152 (825)
T KOG0267|consen 89 ----------------SASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYGEFFASGSTDTDLKIWDIRKKGCSHTYKS 152 (825)
T ss_pred ----------------ccCCceeeeehhhhhhhhhhhccccCcceeeeccceEEeccccccccceehhhhccCceeeecC
Confidence 02479999999999999999876 6899999999 455554 466799999987777776655
Q ss_pred cCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhccc
Q 001814 230 YPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGL 309 (1010)
Q Consensus 230 ~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi 309 (1010)
++. +...++|.| + |.+
T Consensus 153 ~~~-------------vv~~l~lsP-------~----------------------------Gr~---------------- 168 (825)
T KOG0267|consen 153 HTR-------------VVDVLRLSP-------D----------------------------GRW---------------- 168 (825)
T ss_pred Ccc-------------eeEEEeecC-------C----------------------------Cce----------------
Confidence 433 012233322 1 001
Q ss_pred ceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEc
Q 001814 310 SKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASV 389 (1010)
Q Consensus 310 ~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~ 389 (1010)
.++++.|.+|+|||+..|+.+..|..|..+|.+|-|.|..-+|+++|.
T Consensus 169 --------------------------------v~~g~ed~tvki~d~~agk~~~ef~~~e~~v~sle~hp~e~Lla~Gs~ 216 (825)
T KOG0267|consen 169 --------------------------------VASGGEDNTVKIWDLTAGKLSKEFKSHEGKVQSLEFHPLEVLLAPGSS 216 (825)
T ss_pred --------------------------------eeccCCcceeeeecccccccccccccccccccccccCchhhhhccCCC
Confidence 113456889999999999999999999999999999999999999999
Q ss_pred CCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCC
Q 001814 390 YGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSK 452 (1010)
Q Consensus 390 dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~d 452 (1010)
|++ +++||+.+. ..+- ..+.....|.+++|+||++.+++|-..
T Consensus 217 d~t-v~f~dletf-----------------e~I~--s~~~~~~~v~~~~fn~~~~~~~~G~q~ 259 (825)
T KOG0267|consen 217 DRT-VRFWDLETF-----------------EVIS--SGKPETDGVRSLAFNPDGKIVLSGEQI 259 (825)
T ss_pred Cce-eeeecccee-----------------EEee--ccCCccCCceeeeecCCceeeecCchh
Confidence 655 999998642 1111 112223479999999999999887554
No 103
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=99.31 E-value=4.7e-11 Score=135.60 Aligned_cols=109 Identities=23% Similarity=0.368 Sum_probs=85.3
Q ss_pred ccCCCCeEEEEECCCCcEEEEe-ccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQF-KAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~-~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
.++.+|.|++||+.....+..+ .+|..|...+||+|... +||+.+. +..|.+||+... +.
T Consensus 182 ~asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~-Dkki~~yD~~s~-----------------~s 243 (673)
T KOG4378|consen 182 IASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGY-DKKINIYDIRSQ-----------------AS 243 (673)
T ss_pred eeccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEecc-cceEEEeecccc-----------------cc
Confidence 4578999999999988777655 68999999999999874 7788888 456999998421 11
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccc-cccc
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGF-QTLS 473 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~-~~H~ 473 (1010)
...|- ..+....++|+++|.+||+|+.+|.|..|++.-.+.++.+ .+|.
T Consensus 244 ~~~l~---y~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R~~k~Pv~v~sah~ 293 (673)
T KOG4378|consen 244 TDRLT---YSHPLSTVAFSECGTYLCAGNSKGELIAYDMRSTKAPVAVRSAHD 293 (673)
T ss_pred cceee---ecCCcceeeecCCceEEEeecCCceEEEEecccCCCCceEeeecc
Confidence 11221 1246789999999999999999999999999988877765 4554
No 104
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.31 E-value=6e-10 Score=118.44 Aligned_cols=175 Identities=14% Similarity=0.121 Sum_probs=120.7
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeE-EEE-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIV-AVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGY 247 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlL-AV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~ 247 (1010)
+++|++||+.+++.+..+..+..+..+++++ +.| +++ .++.|++||+.+++....+.....
T Consensus 10 d~~v~~~d~~t~~~~~~~~~~~~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~--------------- 74 (300)
T TIGR03866 10 DNTISVIDTATLEVTRTFPVGQRPRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPD--------------- 74 (300)
T ss_pred CCEEEEEECCCCceEEEEECCCCCCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCC---------------
Confidence 4789999999999999998777778888887 345 444 356899999998776554432111
Q ss_pred cceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCC
Q 001814 248 GPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSS 327 (1010)
Q Consensus 248 gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s 327 (1010)
+..+++.+ ++..+.
T Consensus 75 ------~~~~~~~~---------------------------~g~~l~--------------------------------- 88 (300)
T TIGR03866 75 ------PELFALHP---------------------------NGKILY--------------------------------- 88 (300)
T ss_pred ------ccEEEECC---------------------------CCCEEE---------------------------------
Confidence 11122221 111110
Q ss_pred CccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCC
Q 001814 328 PVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSG 407 (1010)
Q Consensus 328 ~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~ 407 (1010)
+.+..++.|.+||+.+.+.+..+..+ ..+..++|+|+|.+|++++.++..+.+|+...
T Consensus 89 ---------------~~~~~~~~l~~~d~~~~~~~~~~~~~-~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~------ 146 (300)
T TIGR03866 89 ---------------IANEDDNLVTVIDIETRKVLAEIPVG-VEPEGMAVSPDGKIVVNTSETTNMAHFIDTKT------ 146 (300)
T ss_pred ---------------EEcCCCCeEEEEECCCCeEEeEeeCC-CCcceEEECCCCCEEEEEecCCCeEEEEeCCC------
Confidence 11235789999999998888877644 34678999999999999998777788888742
Q ss_pred CCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe-CCCeEEEEeCCCC
Q 001814 408 SGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS-SKGTCHVFVLSPF 463 (1010)
Q Consensus 408 sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS-~dGTVhIw~I~~~ 463 (1010)
+ ..+..+..+ ..+..++|+||+++|++++ .+++|++|++...
T Consensus 147 -~----------~~~~~~~~~---~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~ 189 (300)
T TIGR03866 147 -Y----------EIVDNVLVD---QRPRFAEFTADGKELWVSSEIGGTVSVIDVATR 189 (300)
T ss_pred -C----------eEEEEEEcC---CCccEEEECCCCCEEEEEcCCCCEEEEEEcCcc
Confidence 2 233222222 2356799999999997655 5899999999754
No 105
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.31 E-value=9e-11 Score=133.44 Aligned_cols=259 Identities=17% Similarity=0.219 Sum_probs=165.1
Q ss_pred CCeEEEEEecCcEEEEEccCCCcce---Eeee-eccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNGFQVLDVEDASNFN---ELVS-KRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQN 149 (1010)
Q Consensus 74 ~~~vLalGy~~G~qVWDv~~~g~v~---ellS-~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~ 149 (1010)
+-+-+-+|..++++|||+...++-. +|-. .++.-++.++++|++ |.| + |.|+
T Consensus 430 ~trhVyTgGkgcVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdg------------rtL-i-vGGe---------- 485 (705)
T KOG0639|consen 430 PTRHVYTGGKGCVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDG------------RTL-I-VGGE---------- 485 (705)
T ss_pred CcceeEecCCCeEEEeeccCCCCCCccccccccCcccceeeeEecCCC------------ceE-E-eccc----------
Confidence 4455667779999999997544321 1111 246678888888876 233 3 2321
Q ss_pred CCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC---cEEEEEEcCC---eEEEEeCCeEEEEECCCCce
Q 001814 150 RSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS---SVCMVRCSPR---IVAVGLATQIYCFDALTLEN 223 (1010)
Q Consensus 150 ~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S---~V~sVa~S~r---lLAV~ld~~I~IwD~~Tle~ 223 (1010)
..+|.||||.+-+.--..+..+ ..++++.+++ .++++.++.|.|||+.+...
T Consensus 486 ----------------------astlsiWDLAapTprikaeltssapaCyALa~spDakvcFsccsdGnI~vwDLhnq~~ 543 (705)
T KOG0639|consen 486 ----------------------ASTLSIWDLAAPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTL 543 (705)
T ss_pred ----------------------cceeeeeeccCCCcchhhhcCCcchhhhhhhcCCccceeeeeccCCcEEEEEccccee
Confidence 2579999998765433333433 4688889884 34567788999999999888
Q ss_pred eEEEeecCCccccCCCccccccCccceEEcc---c-eEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeeh
Q 001814 224 KFSVLTYPVPQLAGQGAVGINVGYGPMAVGP---R-WLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAM 299 (1010)
Q Consensus 224 l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp---R-wLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~ 299 (1010)
+..+.+|+. +...+.++. + |-.-..+.++-||.
T Consensus 544 VrqfqGhtD-------------GascIdis~dGtklWTGGlDntvRcWDl------------------------------ 580 (705)
T KOG0639|consen 544 VRQFQGHTD-------------GASCIDISKDGTKLWTGGLDNTVRCWDL------------------------------ 580 (705)
T ss_pred eecccCCCC-------------CceeEEecCCCceeecCCCccceeehhh------------------------------
Confidence 888888876 234455543 1 11111233444542
Q ss_pred hhhhhhhcccceeeccc--cccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE
Q 001814 300 EHSKQFAAGLSKTLSKY--CQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCF 377 (1010)
Q Consensus 300 dssk~la~Gi~ktls~y--~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaF 377 (1010)
..|. .+..| ..++++-+ ...+.+|- +-+...+.|-|..... ....++.-|.+-|.+|.|
T Consensus 581 ------regr--qlqqhdF~SQIfSLg----~cP~~dWl------avGMens~vevlh~sk-p~kyqlhlheScVLSlKF 641 (705)
T KOG0639|consen 581 ------REGR--QLQQHDFSSQIFSLG----YCPTGDWL------AVGMENSNVEVLHTSK-PEKYQLHLHESCVLSLKF 641 (705)
T ss_pred ------hhhh--hhhhhhhhhhheecc----cCCCccce------eeecccCcEEEEecCC-ccceeecccccEEEEEEe
Confidence 2221 01010 11111111 01122332 2345667777766543 345678889999999999
Q ss_pred CCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEE
Q 001814 378 DPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHV 457 (1010)
Q Consensus 378 SPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhI 457 (1010)
.+-|+++++.+. +..++.|+. |. | ..+++... ...|.++..|-|.++|++||.|....|
T Consensus 642 a~cGkwfvStGk-DnlLnawrt-Py------G----------asiFqskE---~SsVlsCDIS~ddkyIVTGSGdkkATV 700 (705)
T KOG0639|consen 642 AYCGKWFVSTGK-DNLLNAWRT-PY------G----------ASIFQSKE---SSSVLSCDISFDDKYIVTGSGDKKATV 700 (705)
T ss_pred cccCceeeecCc-hhhhhhccC-cc------c----------cceeeccc---cCcceeeeeccCceEEEecCCCcceEE
Confidence 999999999998 567999998 43 5 35666543 346999999999999999999998889
Q ss_pred EeCC
Q 001814 458 FVLS 461 (1010)
Q Consensus 458 w~I~ 461 (1010)
|.|.
T Consensus 701 YeV~ 704 (705)
T KOG0639|consen 701 YEVI 704 (705)
T ss_pred EEEe
Confidence 8763
No 106
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.30 E-value=8.4e-12 Score=145.29 Aligned_cols=187 Identities=16% Similarity=0.192 Sum_probs=137.9
Q ss_pred CCEEEEEeCCCCe------EEEEEeCCC-cEEEEEE--cCC-eEEEEeCCeEEEEECCCC--ceeEEEeecCCccccCCC
Q 001814 172 PTAVRFYSFQSHC------YEHVLRFRS-SVCMVRC--SPR-IVAVGLATQIYCFDALTL--ENKFSVLTYPVPQLAGQG 239 (1010)
Q Consensus 172 p~tVrIWDlktge------~V~tL~f~S-~V~sVa~--S~r-lLAV~ld~~I~IwD~~Tl--e~l~tL~t~p~p~~~~~g 239 (1010)
++.|++|+..... ++..++.|+ .|.++.+ +.+ ++.++.|.+|++|++... -+..+|.+|..-
T Consensus 46 Dg~i~~W~~~~d~~~~s~~~~asme~HsDWVNDiiL~~~~~tlIS~SsDtTVK~W~~~~~~~~c~stir~H~DY------ 119 (735)
T KOG0308|consen 46 DGIIRLWSVTQDSNEPSTPYIASMEHHSDWVNDIILCGNGKTLISASSDTTVKVWNAHKDNTFCMSTIRTHKDY------ 119 (735)
T ss_pred CceEEEeccccccCCcccchhhhhhhhHhHHhhHHhhcCCCceEEecCCceEEEeecccCcchhHhhhhcccch------
Confidence 4789999986432 466777775 7866655 444 445566778999998765 455566666441
Q ss_pred ccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeecccccc
Q 001814 240 AVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQE 319 (1010)
Q Consensus 240 ~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~ 319 (1010)
.+.|||+... .-+
T Consensus 120 --------------Vkcla~~ak~---------------------------~~l-------------------------- 132 (735)
T KOG0308|consen 120 --------------VKCLAYIAKN---------------------------NEL-------------------------- 132 (735)
T ss_pred --------------heeeeecccC---------------------------cee--------------------------
Confidence 1446663210 001
Q ss_pred ccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc--E--------EEEec-cCCCCeEEEEECCCCCEEEEEE
Q 001814 320 LLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA--I--------ISQFK-AHTSPISALCFDPSGTLLVTAS 388 (1010)
Q Consensus 320 l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~--~--------v~~~~-aHtspIsaLaFSPdGtlLATAS 388 (1010)
.++|+-|+.|.|||+.++. . ...+. +|..+|.+|+.++.|+++|+|+
T Consensus 133 ----------------------vaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGg 190 (735)
T KOG0308|consen 133 ----------------------VASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGG 190 (735)
T ss_pred ----------------------EEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecC
Confidence 2356789999999999762 2 22333 8899999999999999999999
Q ss_pred cCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccc
Q 001814 389 VYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 389 ~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~ 468 (1010)
..+ .||+||... + ..+.+|+ ||+ ..|..|-.++||+.+.++|+||||+||+|....+..+
T Consensus 191 tek-~lr~wDprt-------~----------~kimkLr-GHT-dNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~T 250 (735)
T KOG0308|consen 191 TEK-DLRLWDPRT-------C----------KKIMKLR-GHT-DNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLAT 250 (735)
T ss_pred ccc-ceEEecccc-------c----------cceeeee-ccc-cceEEEEEcCCCCeEeecCCCceEEeeeccccceeee
Confidence 954 699999753 3 4777885 766 4699999999999999999999999999999888888
Q ss_pred ccccc
Q 001814 469 FQTLS 473 (1010)
Q Consensus 469 ~~~H~ 473 (1010)
++.|.
T Consensus 251 ~~vH~ 255 (735)
T KOG0308|consen 251 YIVHK 255 (735)
T ss_pred EEecc
Confidence 88774
No 107
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.30 E-value=1.2e-10 Score=126.93 Aligned_cols=240 Identities=19% Similarity=0.227 Sum_probs=159.2
Q ss_pred CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
-+.+|++|..+| +-|||... -..-.+++.|--||.++.+++++ .+|+..+
T Consensus 34 ~G~~lAvGc~nG~vvI~D~~T-~~iar~lsaH~~pi~sl~WS~dg-------------r~LltsS--------------- 84 (405)
T KOG1273|consen 34 WGDYLAVGCANGRVVIYDFDT-FRIARMLSAHVRPITSLCWSRDG-------------RKLLTSS--------------- 84 (405)
T ss_pred CcceeeeeccCCcEEEEEccc-cchhhhhhccccceeEEEecCCC-------------CEeeeec---------------
Confidence 456999999998 89999965 44778999999999999999876 2334211
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCC----eEEEEeCCeEEEEECCCCceeEEEe
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPR----IVAVGLATQIYCFDALTLENKFSVL 228 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~r----lLAV~ld~~I~IwD~~Tle~l~tL~ 228 (1010)
.+..|++||+..|.+++.++|+++|+.+.+.|+ .||.-.+..-++-++.+ ..+++.
T Consensus 85 ------------------~D~si~lwDl~~gs~l~rirf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~--~~h~~L 144 (405)
T KOG1273|consen 85 ------------------RDWSIKLWDLLKGSPLKRIRFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSD--PKHSVL 144 (405)
T ss_pred ------------------CCceeEEEeccCCCceeEEEccCccceeeeccccCCeEEEEEecCCcEEEEecC--Cceeec
Confidence 247899999999999999999999999999872 33333344333333333 112221
Q ss_pred ecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcc
Q 001814 229 TYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAG 308 (1010)
Q Consensus 229 t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~G 308 (1010)
. -... |.. ..+++.+
T Consensus 145 p-----------------------------~d~d-------~dl------------n~sas~~----------------- 159 (405)
T KOG1273|consen 145 P-----------------------------KDDD-------GDL------------NSSASHG----------------- 159 (405)
T ss_pred c-----------------------------CCCc-------ccc------------ccccccc-----------------
Confidence 1 0000 000 0000000
Q ss_pred cceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCC-CCeEEEEECCCCCEEEEE
Q 001814 309 LSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHT-SPISALCFDPSGTLLVTA 387 (1010)
Q Consensus 309 i~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHt-spIsaLaFSPdGtlLATA 387 (1010)
+++..++ .+.+|...|.+.|+|+.+.++++.|+-.+ ..|..+-|+..|..|++-
T Consensus 160 ---------------------~fdr~g~----yIitGtsKGkllv~~a~t~e~vas~rits~~~IK~I~~s~~g~~liiN 214 (405)
T KOG1273|consen 160 ---------------------VFDRRGK----YIITGTSKGKLLVYDAETLECVASFRITSVQAIKQIIVSRKGRFLIIN 214 (405)
T ss_pred ---------------------cccCCCC----EEEEecCcceEEEEecchheeeeeeeechheeeeEEEEeccCcEEEEe
Confidence 1111122 13456789999999999999999998776 899999999999999999
Q ss_pred EcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCC-eEEEEeCC
Q 001814 388 SVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKG-TCHVFVLS 461 (1010)
Q Consensus 388 S~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dG-TVhIw~I~ 461 (1010)
+. +++||+|++...-.-+..| .....++++-=.....-.+++||.||.||+++|.+. .+.||.-.
T Consensus 215 ts-DRvIR~ye~~di~~~~r~~--------e~e~~~K~qDvVNk~~Wk~ccfs~dgeYv~a~s~~aHaLYIWE~~ 280 (405)
T KOG1273|consen 215 TS-DRVIRTYEISDIDDEGRDG--------EVEPEHKLQDVVNKLQWKKCCFSGDGEYVCAGSARAHALYIWEKS 280 (405)
T ss_pred cC-CceEEEEehhhhcccCccC--------CcChhHHHHHHHhhhhhhheeecCCccEEEeccccceeEEEEecC
Confidence 88 6889999986321100001 112223332111122346789999999999999764 48899854
No 108
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.29 E-value=1.6e-11 Score=135.64 Aligned_cols=215 Identities=18% Similarity=0.177 Sum_probs=140.9
Q ss_pred CCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcC-CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSP-RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGP 249 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S-~V~sVa~S~-rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gp 249 (1010)
|+.|+|||+.+.+++.+++.|. .|.+|.+.. .++.||.+.+|+.|-+.- ..++++..... +.++. .
T Consensus 88 DG~VkiWnlsqR~~~~~f~AH~G~V~Gi~v~~~~~~tvgdDKtvK~wk~~~-~p~~tilg~s~-------~~gId----h 155 (433)
T KOG0268|consen 88 DGEVKIWNLSQRECIRTFKAHEGLVRGICVTQTSFFTVGDDKTVKQWKIDG-PPLHTILGKSV-------YLGID----H 155 (433)
T ss_pred CceEEEEehhhhhhhheeecccCceeeEEecccceEEecCCcceeeeeccC-Ccceeeecccc-------ccccc----c
Confidence 5789999999999999999874 899999977 677888899999997654 35566554221 00000 0
Q ss_pred eEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCc
Q 001814 250 MAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPV 329 (1010)
Q Consensus 250 lAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~ 329 (1010)
+- ...-.|..|..+.|||.-+..| ..+... | + .+....
T Consensus 156 ~~-~~~~FaTcGe~i~IWD~~R~~P------v~smsw---------------------G-------~-------Dti~sv 193 (433)
T KOG0268|consen 156 HR-KNSVFATCGEQIDIWDEQRDNP------VSSMSW---------------------G-------A-------DSISSV 193 (433)
T ss_pred cc-ccccccccCceeeecccccCCc------cceeec---------------------C-------C-------CceeEE
Confidence 00 0022355667788998543321 111000 0 0 001112
Q ss_pred cCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCC
Q 001814 330 SPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSG 409 (1010)
Q Consensus 330 S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG 409 (1010)
.+||.- ...++++..|+.|.|||+....++..+.- +..-+.|||+|.+--+++|++ ++.+..||+... -
T Consensus 194 kfNpvE---TsILas~~sDrsIvLyD~R~~~Pl~KVi~-~mRTN~IswnPeafnF~~a~E-D~nlY~~DmR~l------~ 262 (433)
T KOG0268|consen 194 KFNPVE---TSILASCASDRSIVLYDLRQASPLKKVIL-TMRTNTICWNPEAFNFVAANE-DHNLYTYDMRNL------S 262 (433)
T ss_pred ecCCCc---chheeeeccCCceEEEecccCCccceeee-eccccceecCccccceeeccc-cccceehhhhhh------c
Confidence 233321 11345778999999999998876654432 233467999997777777777 678999998531 0
Q ss_pred CCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 410 NHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 410 ~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+.+ ..++|+.+| |.++.|||.|+-+++||-|.||+||.++..
T Consensus 263 ----------~p~-~v~~dhvsA-V~dVdfsptG~EfvsgsyDksIRIf~~~~~ 304 (433)
T KOG0268|consen 263 ----------RPL-NVHKDHVSA-VMDVDFSPTGQEFVSGSYDKSIRIFPVNHG 304 (433)
T ss_pred ----------ccc-hhhccccee-EEEeccCCCcchhccccccceEEEeecCCC
Confidence 122 234676665 999999999999999999999999998754
No 109
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.27 E-value=1e-10 Score=130.85 Aligned_cols=240 Identities=13% Similarity=0.167 Sum_probs=154.7
Q ss_pred CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeee-ccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 51 ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSK-RDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 51 ~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~-hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
..++|...|.++.|-. .++.+|+.|.+.-+.+||++ +|+++.++.. +.-.|.+.++.|++ |+
T Consensus 264 tlvgh~~~V~yi~wSP------DdryLlaCg~~e~~~lwDv~-tgd~~~~y~~~~~~S~~sc~W~pDg--------~~-- 326 (519)
T KOG0293|consen 264 TLVGHSQPVSYIMWSP------DDRYLLACGFDEVLSLWDVD-TGDLRHLYPSGLGFSVSSCAWCPDG--------FR-- 326 (519)
T ss_pred eeecccCceEEEEECC------CCCeEEecCchHheeeccCC-cchhhhhcccCcCCCcceeEEccCC--------ce--
Confidence 3467788888888852 24667777777789999996 5666666653 45678999999987 33
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC--CcEEEEEEcC---Ce
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR--SSVCMVRCSP---RI 204 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~--S~V~sVa~S~---rl 204 (1010)
. |+|. .++++.-||+.. ..+...+.- ..|+++++.. .+
T Consensus 327 ----~-V~Gs-------------------------------~dr~i~~wdlDg-n~~~~W~gvr~~~v~dlait~Dgk~v 369 (519)
T KOG0293|consen 327 ----F-VTGS-------------------------------PDRTIIMWDLDG-NILGNWEGVRDPKVHDLAITYDGKYV 369 (519)
T ss_pred ----e-EecC-------------------------------CCCcEEEecCCc-chhhcccccccceeEEEEEcCCCcEE
Confidence 2 2331 247899999954 334444432 3689999876 47
Q ss_pred EEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ce--EEEccCCeeeccCCccCCCcCCCC
Q 001814 205 VAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RW--LAYASNTLLLSNSGRLSPQNLTPS 280 (1010)
Q Consensus 205 LAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--Rw--LAyas~~~~iwd~G~vs~Q~lt~p 280 (1010)
++|+.+..|++|+..+......+.+... ...+.++- ++ +......+++||...
T Consensus 370 l~v~~d~~i~l~~~e~~~dr~lise~~~--------------its~~iS~d~k~~LvnL~~qei~LWDl~e--------- 426 (519)
T KOG0293|consen 370 LLVTVDKKIRLYNREARVDRGLISEEQP--------------ITSFSISKDGKLALVNLQDQEIHLWDLEE--------- 426 (519)
T ss_pred EEEecccceeeechhhhhhhccccccCc--------------eeEEEEcCCCcEEEEEcccCeeEEeecch---------
Confidence 7788889999999887665543333211 12344433 22 234455678887421
Q ss_pred CCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc
Q 001814 281 GVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA 360 (1010)
Q Consensus 281 ~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~ 360 (1010)
..+|.+| .|. +.+.|.-. ++ | . + ....-+++|+.|+.|.||+..+++
T Consensus 427 ----------~~lv~kY---------~Gh--kq~~fiIr-----SC----F-g-g-~~~~fiaSGSED~kvyIWhr~sgk 473 (519)
T KOG0293|consen 427 ----------NKLVRKY---------FGH--KQGHFIIR-----SC----F-G-G-GNDKFIASGSEDSKVYIWHRISGK 473 (519)
T ss_pred ----------hhHHHHh---------hcc--cccceEEE-----ec----c-C-C-CCcceEEecCCCceEEEEEccCCc
Confidence 0111111 010 11111100 00 0 0 0 000124688999999999999999
Q ss_pred EEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCC
Q 001814 361 IISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p 401 (1010)
+++.+.+|...|++++++|.. .++|+||.||+ ||||-..+
T Consensus 474 ll~~LsGHs~~vNcVswNP~~p~m~ASasDDgt-IRIWg~~~ 514 (519)
T KOG0293|consen 474 LLAVLSGHSKTVNCVSWNPADPEMFASASDDGT-IRIWGPSD 514 (519)
T ss_pred eeEeecCCcceeeEEecCCCCHHHhhccCCCCe-EEEecCCc
Confidence 999999999999999999977 58999999876 99998753
No 110
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.27 E-value=4.1e-10 Score=123.01 Aligned_cols=221 Identities=14% Similarity=0.194 Sum_probs=142.5
Q ss_pred CCeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
.+..++.|..+ .|.|||+....+.. .|..|.|.|++++|.|.- ..+ .||. +.
T Consensus 52 s~~~~aSGssDetI~IYDm~k~~qlg-~ll~HagsitaL~F~~~~---------S~s-hLlS--~s-------------- 104 (362)
T KOG0294|consen 52 SGPYVASGSSDETIHIYDMRKRKQLG-ILLSHAGSITALKFYPPL---------SKS-HLLS--GS-------------- 104 (362)
T ss_pred cceeEeccCCCCcEEEEeccchhhhc-ceeccccceEEEEecCCc---------chh-heee--ec--------------
Confidence 45678888764 69999997655444 445589999999998653 222 2333 11
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--C-eEEEEeCCeEEEEECCCCceeEEEe
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--R-IVAVGLATQIYCFDALTLENKFSVL 228 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--r-lLAV~ld~~I~IwD~~Tle~l~tL~ 228 (1010)
-++.|.+|+...-+++++++-| ..|..|++.| + -|.|+.|+.++.||+-+++.-+.+.
T Consensus 105 ------------------dDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a~v~~ 166 (362)
T KOG0294|consen 105 ------------------DDGHIIIWRVGSWELLKSLKAHKGQVTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVAFVLN 166 (362)
T ss_pred ------------------CCCcEEEEEcCCeEEeeeecccccccceeEecCCCceEEEEcCCceeeeehhhcCccceeec
Confidence 1467999999999999999876 4899999988 3 3557888899999999988766654
Q ss_pred ecCCccccCCCccccccCccceEEcc---ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhh
Q 001814 229 TYPVPQLAGQGAVGINVGYGPMAVGP---RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQF 305 (1010)
Q Consensus 229 t~p~p~~~~~g~~~vnv~~gplAlgp---RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~l 305 (1010)
--..+ ..+.+.| ++.....+.+.+|-.+..+ + . +.+
T Consensus 167 L~~~a--------------t~v~w~~~Gd~F~v~~~~~i~i~q~d~A~-------------------v----~----~~i 205 (362)
T KOG0294|consen 167 LKNKA--------------TLVSWSPQGDHFVVSGRNKIDIYQLDNAS-------------------V----F----REI 205 (362)
T ss_pred cCCcc--------------eeeEEcCCCCEEEEEeccEEEEEecccHh-------------------H----h----hhh
Confidence 21110 0122222 2222222223333211000 0 0 000
Q ss_pred hcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE--CCCCCE
Q 001814 306 AAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCF--DPSGTL 383 (1010)
Q Consensus 306 a~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaF--SPdGtl 383 (1010)
..-. +. .|. ++. .. ..+..|..++.|.+||..++.+...|.||...|-.+.+ +|++.+
T Consensus 206 ~~~~-r~---l~~---------~~l--~~-----~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~~~~~ 265 (362)
T KOG0294|consen 206 ENPK-RI---LCA---------TFL--DG-----SELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNPEHEY 265 (362)
T ss_pred hccc-cc---eee---------eec--CC-----ceEEEecCCceEEEeccCCCccceeeecchhheeeeEEEecCCceE
Confidence 0000 00 000 000 00 02345678899999999999999999999999999984 789999
Q ss_pred EEEEEcCCCeEEEEeCCC
Q 001814 384 LVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 384 LATAS~dGt~IrVwdi~p 401 (1010)
|+|||.||- |+|||+..
T Consensus 266 lvTaSSDG~-I~vWd~~~ 282 (362)
T KOG0294|consen 266 LVTASSDGF-IKVWDIDM 282 (362)
T ss_pred EEEeccCce-EEEEEccc
Confidence 999999875 99999953
No 111
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.27 E-value=3.1e-10 Score=133.26 Aligned_cols=214 Identities=14% Similarity=0.123 Sum_probs=151.8
Q ss_pred EEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccc
Q 001814 77 VLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGG 155 (1010)
Q Consensus 77 vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~ 155 (1010)
.|++|..+ .+.||.+. ..+-..+|.+|...|.++...-++ . +++
T Consensus 73 ~l~~g~~D~~i~v~~~~-~~~P~~~LkgH~snVC~ls~~~~~--------------~--~iS------------------ 117 (745)
T KOG0301|consen 73 RLVVGGMDTTIIVFKLS-QAEPLYTLKGHKSNVCSLSIGEDG--------------T--LIS------------------ 117 (745)
T ss_pred ceEeecccceEEEEecC-CCCchhhhhccccceeeeecCCcC--------------c--eEe------------------
Confidence 35555555 47899985 455667889999999999865332 2 223
Q ss_pred cccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC-CeEEE-EeCCeEEEEECCCCceeEEEeecCC
Q 001814 156 VRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP-RIVAV-GLATQIYCFDALTLENKFSVLTYPV 232 (1010)
Q Consensus 156 vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~-rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~ 232 (1010)
|+|| +|+++|-. ++++.+++.| +.|++|..-+ +.+++ +.|..|++|... +.+.++..|..
T Consensus 118 ---gSWD----------~TakvW~~--~~l~~~l~gH~asVWAv~~l~e~~~vTgsaDKtIklWk~~--~~l~tf~gHtD 180 (745)
T KOG0301|consen 118 ---GSWD----------STAKVWRI--GELVYSLQGHTASVWAVASLPENTYVTGSADKTIKLWKGG--TLLKTFSGHTD 180 (745)
T ss_pred ---cccc----------cceEEecc--hhhhcccCCcchheeeeeecCCCcEEeccCcceeeeccCC--chhhhhccchh
Confidence 4564 78999965 5666667766 5899998866 44444 556689999864 45566665543
Q ss_pred ccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhccccee
Q 001814 233 PQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKT 312 (1010)
Q Consensus 233 p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~kt 312 (1010)
- .|-||.-+.
T Consensus 181 ~--------------------VRgL~vl~~-------------------------------------------------- 190 (745)
T KOG0301|consen 181 C--------------------VRGLAVLDD-------------------------------------------------- 190 (745)
T ss_pred h--------------------eeeeEEecC--------------------------------------------------
Confidence 0 122222111
Q ss_pred eccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC
Q 001814 313 LSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN 392 (1010)
Q Consensus 313 ls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt 392 (1010)
+ .+.++++||.|+.||+ +++++....+|++-|.+++..+++.+++|+++|++
T Consensus 191 -----------~----------------~flScsNDg~Ir~w~~-~ge~l~~~~ghtn~vYsis~~~~~~~Ivs~gEDrt 242 (745)
T KOG0301|consen 191 -----------S----------------HFLSCSNDGSIRLWDL-DGEVLLEMHGHTNFVYSISMALSDGLIVSTGEDRT 242 (745)
T ss_pred -----------C----------------CeEeecCCceEEEEec-cCceeeeeeccceEEEEEEecCCCCeEEEecCCce
Confidence 0 1235789999999999 78899999999999999998899999999999765
Q ss_pred eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 393 NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 393 ~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+|||+.. .++..+.- ....||++.+=++|. |++|++||-|+||...+.
T Consensus 243 -lriW~~~-------------------e~~q~I~l--PttsiWsa~~L~NgD-Ivvg~SDG~VrVfT~~k~ 290 (745)
T KOG0301|consen 243 -LRIWKKD-------------------ECVQVITL--PTTSIWSAKVLLNGD-IVVGGSDGRVRVFTVDKD 290 (745)
T ss_pred -EEEeecC-------------------ceEEEEec--CccceEEEEEeeCCC-EEEeccCceEEEEEeccc
Confidence 9999973 23344321 122699999887776 557888999999998753
No 112
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.27 E-value=1.7e-10 Score=130.52 Aligned_cols=219 Identities=17% Similarity=0.216 Sum_probs=151.2
Q ss_pred CCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCC
Q 001814 72 SVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNR 150 (1010)
Q Consensus 72 ~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~ 150 (1010)
++++++|++|..+- |+|||+++ .+-.+.+.+|-+.|..+.|--.+. .| .+.+
T Consensus 211 S~Dgkylatgg~d~~v~Iw~~~t-~ehv~~~~ghr~~V~~L~fr~gt~------------~l-ys~s------------- 263 (479)
T KOG0299|consen 211 SSDGKYLATGGRDRHVQIWDCDT-LEHVKVFKGHRGAVSSLAFRKGTS------------EL-YSAS------------- 263 (479)
T ss_pred cCCCcEEEecCCCceEEEecCcc-cchhhcccccccceeeeeeecCcc------------ce-eeee-------------
Confidence 35788999998765 89999964 566778899999999999763331 12 2211
Q ss_pred CCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEe-CCCcEEEEEEcC--CeEEEE-eCCeEEEEECCCCceeEE
Q 001814 151 SHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLR-FRSSVCMVRCSP--RIVAVG-LATQIYCFDALTLENKFS 226 (1010)
Q Consensus 151 ~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~-f~S~V~sVa~S~--rlLAV~-ld~~I~IwD~~Tle~l~t 226 (1010)
.+++|++|++....++.++- +++.|.+|.... +.+.|+ -|.++++|++..-.. ..
T Consensus 264 --------------------~Drsvkvw~~~~~s~vetlyGHqd~v~~IdaL~reR~vtVGgrDrT~rlwKi~eesq-li 322 (479)
T KOG0299|consen 264 --------------------ADRSVKVWSIDQLSYVETLYGHQDGVLGIDALSRERCVTVGGRDRTVRLWKIPEESQ-LI 322 (479)
T ss_pred --------------------cCCceEEEehhHhHHHHHHhCCccceeeechhcccceEEeccccceeEEEeccccce-ee
Confidence 35889999999999988874 557999998865 566676 677899999832111 00
Q ss_pred EeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhh
Q 001814 227 VLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFA 306 (1010)
Q Consensus 227 L~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la 306 (1010)
...+. + ++..+||-.+.
T Consensus 323 frg~~-------~-------------sidcv~~In~~------------------------------------------- 339 (479)
T KOG0299|consen 323 FRGGE-------G-------------SIDCVAFINDE------------------------------------------- 339 (479)
T ss_pred eeCCC-------C-------------CeeeEEEeccc-------------------------------------------
Confidence 11000 0 01112222110
Q ss_pred cccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec-cCC-----------CCeEE
Q 001814 307 AGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK-AHT-----------SPISA 374 (1010)
Q Consensus 307 ~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~-aHt-----------spIsa 374 (1010)
-+++|+.+|.|.+|++..++++.+.+ ||. ..|++
T Consensus 340 ----------------------------------HfvsGSdnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~~Wits 385 (479)
T KOG0299|consen 340 ----------------------------------HFVSGSDNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITS 385 (479)
T ss_pred ----------------------------------ceeeccCCceEEEeeecccCceeEeeccccccCCccccccccceee
Confidence 12356789999999999988776554 441 26899
Q ss_pred EEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCC
Q 001814 375 LCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSK 452 (1010)
Q Consensus 375 LaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~d 452 (1010)
|+.-|...+||++|.+|. +|+|.+.+.. -....++.+. -...|++|+|+++|++|.+|-..
T Consensus 386 la~i~~sdL~asGS~~G~-vrLW~i~~g~-------------r~i~~l~~ls---~~GfVNsl~f~~sgk~ivagiGk 446 (479)
T KOG0299|consen 386 LAVIPGSDLLASGSWSGC-VRLWKIEDGL-------------RAINLLYSLS---LVGFVNSLAFSNSGKRIVAGIGK 446 (479)
T ss_pred eEecccCceEEecCCCCc-eEEEEecCCc-------------cccceeeecc---cccEEEEEEEccCCCEEEEeccc
Confidence 999999999999999776 9999997631 1234667664 13469999999999988888443
No 113
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.26 E-value=9.9e-11 Score=132.31 Aligned_cols=106 Identities=18% Similarity=0.279 Sum_probs=87.0
Q ss_pred cccCCCCeEEEEECC--CCcEEEEeccCCCCeEEEEECCC-CCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc
Q 001814 343 ADMDNAGIVVVKDFV--TRAIISQFKAHTSPISALCFDPS-GTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH 419 (1010)
Q Consensus 343 asgs~dG~V~VwDl~--s~~~v~~~~aHtspIsaLaFSPd-GtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~ 419 (1010)
++++.|+.+.|||+. +.+......||+.+|.|++|+|- +.+|||||.|++ +++||+... -
T Consensus 244 ~sv~dd~~L~iwD~R~~~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D~t-V~LwDlRnL------~---------- 306 (422)
T KOG0264|consen 244 GSVGDDGKLMIWDTRSNTSKPSHSVKAHSAEVNCVAFNPFNEFILATGSADKT-VALWDLRNL------N---------- 306 (422)
T ss_pred eeecCCCeEEEEEcCCCCCCCcccccccCCceeEEEeCCCCCceEEeccCCCc-EEEeechhc------c----------
Confidence 456789999999999 55666788999999999999995 578999999776 899999642 1
Q ss_pred eEEEEEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCCCCcc
Q 001814 420 VHLYKLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 420 ~~L~~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
..++.+. |+ ...|..|.|||.- ..||+++.|+.++||++..-|++.
T Consensus 307 ~~lh~~e-~H-~dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq 353 (422)
T KOG0264|consen 307 KPLHTFE-GH-EDEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQ 353 (422)
T ss_pred cCceecc-CC-CcceEEEEeCCCCCceeEecccCCcEEEEecccccccc
Confidence 3677774 43 4579999999975 688889999999999999887764
No 114
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.26 E-value=2.1e-10 Score=125.50 Aligned_cols=122 Identities=18% Similarity=0.209 Sum_probs=92.0
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 346 DNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
++...+-||.-.....+..+-+|...|+-|+|-++|..|.+++.++..|-+|||... + ..+|+|
T Consensus 227 sY~q~~giy~~~~~~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~------~----------~pv~~L 290 (406)
T KOG2919|consen 227 SYGQRVGIYNDDGRRPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYS------R----------DPVYAL 290 (406)
T ss_pred cccceeeeEecCCCCceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhc------c----------chhhhh
Confidence 344456666666777888888999999999999999999999998888999999642 2 578888
Q ss_pred eccc--ccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccc-ccccCCCCCCccCCC
Q 001814 426 HRGI--TSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGF-QTLSSQGGDPYLFPV 484 (1010)
Q Consensus 426 ~RG~--t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~-~~H~s~~~~~~~~pv 484 (1010)
.|.. |+-+|+ ....|+|+|||+|+.||.|++|++..+|.++.. ..|.--+.+-.+.|+
T Consensus 291 ~rhv~~TNQRI~-FDld~~~~~LasG~tdG~V~vwdlk~~gn~~sv~~~~sd~vNgvslnP~ 351 (406)
T KOG2919|consen 291 ERHVGDTNQRIL-FDLDPKGEILASGDTDGSVRVWDLKDLGNEVSVTGNYSDTVNGVSLNPI 351 (406)
T ss_pred hhhccCccceEE-EecCCCCceeeccCCCccEEEEecCCCCCcccccccccccccceecCcc
Confidence 7643 343454 344689999999999999999999998875533 344333444445554
No 115
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.25 E-value=2.1e-10 Score=129.45 Aligned_cols=179 Identities=16% Similarity=0.180 Sum_probs=122.6
Q ss_pred CCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC---CeEEE-EeCCeEEEEECCCCceeEEEeecCCccccCCCcccccc
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP---RIVAV-GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINV 245 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~---rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv 245 (1010)
.++||++||+.+|++..++.++ ..|..+.+++ .+|+. +.++++.++|.+...+.
T Consensus 264 aD~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~~s--------------------- 322 (463)
T KOG0270|consen 264 ADKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPSNS--------------------- 322 (463)
T ss_pred CCceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccceEEeeeccCcccc---------------------
Confidence 4689999999999999999966 4899999987 45655 55789999998752110
Q ss_pred CccceEEccceEEEccC-CeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCC
Q 001814 246 GYGPMAVGPRWLAYASN-TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDG 324 (1010)
Q Consensus 246 ~~gplAlgpRwLAyas~-~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~g 324 (1010)
.. .| -+.+. ....|+. +++
T Consensus 323 ---~~----~w-k~~g~VEkv~w~~----------------~se------------------------------------ 342 (463)
T KOG0270|consen 323 ---GK----EW-KFDGEVEKVAWDP----------------HSE------------------------------------ 342 (463)
T ss_pred ---Cc----eE-EeccceEEEEecC----------------CCc------------------------------------
Confidence 00 01 11111 1122320 000
Q ss_pred CCCCccCCCccccccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCC
Q 001814 325 SSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPS 402 (1010)
Q Consensus 325 s~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~ 402 (1010)
. .+..+..||+|+-+|+.+. +++.+++||..+|+.|++++.- .+|+|+|.++ ++++|++...
T Consensus 343 ----~-----------~f~~~tddG~v~~~D~R~~~~~vwt~~AHd~~ISgl~~n~~~p~~l~t~s~d~-~Vklw~~~~~ 406 (463)
T KOG0270|consen 343 ----N-----------SFFVSTDDGTVYYFDIRNPGKPVWTLKAHDDEISGLSVNIQTPGLLSTASTDK-VVKLWKFDVD 406 (463)
T ss_pred ----e-----------eEEEecCCceEEeeecCCCCCceeEEEeccCCcceEEecCCCCcceeeccccc-eEEEEeecCC
Confidence 0 0112357999999999875 7999999999999999999876 4889999955 5999998532
Q ss_pred cccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCC
Q 001814 403 CMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 403 ~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~ 463 (1010)
. + ...+.+-+++ | +..|.++.|+- -+||.|+.++-++||++...
T Consensus 407 ~-----~------~~v~~~~~~~--~----rl~c~~~~~~~a~~la~GG~k~~~~vwd~~~~ 451 (463)
T KOG0270|consen 407 S-----P------KSVKEHSFKL--G----RLHCFALDPDVAFTLAFGGEKAVLRVWDIFTN 451 (463)
T ss_pred C-----C------cccccccccc--c----ceeecccCCCcceEEEecCccceEEEeecccC
Confidence 0 1 1112233333 2 25778888875 57888999999999998764
No 116
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.25 E-value=1.1e-10 Score=127.08 Aligned_cols=255 Identities=15% Similarity=0.186 Sum_probs=180.2
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEecCcE-EEEEccCCCcceEeee--------eccCCEEEEEEecCCCCCCCCCC
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGF-QVLDVEDASNFNELVS--------KRDGPVSFLQMQPFPVKDDGCEG 125 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~-qVWDv~~~g~v~ellS--------~hdGpV~~v~~lP~p~~s~~~D~ 125 (1010)
.|..+..|.| +|++++|+.|.-+|| .|||.. +|++++-|. -+|.+|.|+.|+.+.
T Consensus 212 ~KSh~EcA~F-------SPDgqyLvsgSvDGFiEVWny~-~GKlrKDLkYQAqd~fMMmd~aVlci~FSRDs-------- 275 (508)
T KOG0275|consen 212 QKSHVECARF-------SPDGQYLVSGSVDGFIEVWNYT-TGKLRKDLKYQAQDNFMMMDDAVLCISFSRDS-------- 275 (508)
T ss_pred cccchhheee-------CCCCceEeeccccceeeeehhc-cchhhhhhhhhhhcceeecccceEEEeecccH--------
Confidence 3444455555 468999999999995 899995 577765443 356788888877443
Q ss_pred ccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEe-CC-CcEEEEEEcC-
Q 001814 126 FRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLR-FR-SSVCMVRCSP- 202 (1010)
Q Consensus 126 F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~-f~-S~V~sVa~S~- 202 (1010)
-+||. | + -+++|++|-+++|.|++.++ .| ..|.++.|++
T Consensus 276 -----EMlAs--G---------------------s----------qDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD 317 (508)
T KOG0275|consen 276 -----EMLAS--G---------------------S----------QDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRD 317 (508)
T ss_pred -----HHhhc--c---------------------C----------cCCcEEEEEEecchHHHHhhhhhccCeeEEEEccC
Confidence 35552 1 0 14789999999999999885 44 4899999987
Q ss_pred --CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCC
Q 001814 203 --RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPS 280 (1010)
Q Consensus 203 --rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p 280 (1010)
++|..+.|..++|.-+..++++..+.+|.+- +| ... |
T Consensus 318 ~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSy---------vn----~a~-------f--------------------- 356 (508)
T KOG0275|consen 318 NSQILSASFDQTVRIHGLKSGKCLKEFRGHSSY---------VN----EAT-------F--------------------- 356 (508)
T ss_pred cchhhcccccceEEEeccccchhHHHhcCcccc---------cc----ceE-------E---------------------
Confidence 5788888999999999999988877766541 00 000 0
Q ss_pred CCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc
Q 001814 281 GVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA 360 (1010)
Q Consensus 281 ~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~ 360 (1010)
.++|+ .+++++.||+|+||+..+.+
T Consensus 357 ----------------------------------------t~dG~---------------~iisaSsDgtvkvW~~Ktte 381 (508)
T KOG0275|consen 357 ----------------------------------------TDDGH---------------HIISASSDGTVKVWHGKTTE 381 (508)
T ss_pred ----------------------------------------cCCCC---------------eEEEecCCccEEEecCcchh
Confidence 11110 13456789999999999999
Q ss_pred EEEEecc--CCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccccc-ccEEE
Q 001814 361 IISQFKA--HTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITS-ATIQD 436 (1010)
Q Consensus 361 ~v~~~~a--HtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~-a~I~s 436 (1010)
++.+|+. ..-+|..+-.=|.. ..++.|-. .+++.|-++. | +.+..+..|... ....+
T Consensus 382 C~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNr-sntv~imn~q--------G----------QvVrsfsSGkREgGdFi~ 442 (508)
T KOG0275|consen 382 CLSTFKPLGTDYPVNSVILLPKNPEHFIVCNR-SNTVYIMNMQ--------G----------QVVRSFSSGKREGGDFIN 442 (508)
T ss_pred hhhhccCCCCcccceeEEEcCCCCceEEEEcC-CCeEEEEecc--------c----------eEEeeeccCCccCCceEE
Confidence 9999985 45688887777765 45555654 4567777774 4 344555444322 23556
Q ss_pred EEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCC
Q 001814 437 ICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGD 478 (1010)
Q Consensus 437 IAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~ 478 (1010)
.+.||-|.|+.+.+.|+...-|.+...+.+.++.-|...+.+
T Consensus 443 ~~lSpkGewiYcigED~vlYCF~~~sG~LE~tl~VhEkdvIG 484 (508)
T KOG0275|consen 443 AILSPKGEWIYCIGEDGVLYCFSVLSGKLERTLPVHEKDVIG 484 (508)
T ss_pred EEecCCCcEEEEEccCcEEEEEEeecCceeeeeecccccccc
Confidence 789999999999999999999999887777766666443333
No 117
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.24 E-value=5.1e-10 Score=124.04 Aligned_cols=239 Identities=14% Similarity=0.138 Sum_probs=146.6
Q ss_pred CCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEE
Q 001814 56 KDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLL 134 (1010)
Q Consensus 56 kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLA 134 (1010)
-+.|.|..+. +...+|+.|..+| +-+|.+.+ +...+++++|..++.+=+|+|++. .++
T Consensus 148 ~~dieWl~WH-------p~a~illAG~~DGsvWmw~ip~-~~~~kv~~Gh~~~ct~G~f~pdGK-------------r~~ 206 (399)
T KOG0296|consen 148 VEDIEWLKWH-------PRAHILLAGSTDGSVWMWQIPS-QALCKVMSGHNSPCTCGEFIPDGK-------------RIL 206 (399)
T ss_pred cCceEEEEec-------ccccEEEeecCCCcEEEEECCC-cceeeEecCCCCCcccccccCCCc-------------eEE
Confidence 3445555554 3678999999988 79999954 467899999999999999999872 233
Q ss_pred EEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC----CcEEEEEEcCCeEEEEeC
Q 001814 135 VVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR----SSVCMVRCSPRIVAVGLA 210 (1010)
Q Consensus 135 vVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~----S~V~sVa~S~rlLAV~ld 210 (1010)
. + +.+++|++||+++|+.++.+.-. .+..++...+..++.|..
T Consensus 207 t--g-------------------------------y~dgti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~ 253 (399)
T KOG0296|consen 207 T--G-------------------------------YDDGTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNS 253 (399)
T ss_pred E--E-------------------------------ecCceEEEEecCCCceeEEecccccCcCCccccccccceeEeccC
Confidence 2 1 12478999999999999998732 233333333445555443
Q ss_pred -CeEEEEECCCCceeEEEee-cCCccccCCCccccccCccceEEcc--ceEE--EccCCeeeccCCccCCCcCCCCCCCC
Q 001814 211 -TQIYCFDALTLENKFSVLT-YPVPQLAGQGAVGINVGYGPMAVGP--RWLA--YASNTLLLSNSGRLSPQNLTPSGVSP 284 (1010)
Q Consensus 211 -~~I~IwD~~Tle~l~tL~t-~p~p~~~~~g~~~vnv~~gplAlgp--RwLA--yas~~~~iwd~G~vs~Q~lt~p~vS~ 284 (1010)
..+++-+..+++.+..... .|.- ...+. ...+ ..-.+.++. ...| +-..++.|||.....+.+.-.
T Consensus 254 e~~~~~~~~~sgKVv~~~n~~~~~l-~~~~e-~~~e-sve~~~~ss~lpL~A~G~vdG~i~iyD~a~~~~R~~c~----- 325 (399)
T KOG0296|consen 254 EGVACGVNNGSGKVVNCNNGTVPEL-KPSQE-ELDE-SVESIPSSSKLPLAACGSVDGTIAIYDLAASTLRHICE----- 325 (399)
T ss_pred CccEEEEccccceEEEecCCCCccc-cccch-hhhh-hhhhcccccccchhhcccccceEEEEecccchhheecc-----
Confidence 4577777777776665552 2210 00000 0000 001111111 1112 223356677642111111000
Q ss_pred CcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEE
Q 001814 285 STSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQ 364 (1010)
Q Consensus 285 stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~ 364 (1010)
.-.|+++ + +| .+ . -.+.++..+|.|+.||..+|+++.+
T Consensus 326 --------------------he~~V~~-l-~w----~~------------t----~~l~t~c~~g~v~~wDaRtG~l~~~ 363 (399)
T KOG0296|consen 326 --------------------HEDGVTK-L-KW----LN------------T----DYLLTACANGKVRQWDARTGQLKFT 363 (399)
T ss_pred --------------------CCCceEE-E-EE----cC------------c----chheeeccCceEEeeeccccceEEE
Confidence 0011110 0 00 00 0 0234567899999999999999999
Q ss_pred eccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 365 FKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 365 ~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
..+|+.+|.+++++|++++++|+|.| ++.+||++
T Consensus 364 y~GH~~~Il~f~ls~~~~~vvT~s~D-~~a~VF~v 397 (399)
T KOG0296|consen 364 YTGHQMGILDFALSPQKRLVVTVSDD-NTALVFEV 397 (399)
T ss_pred EecCchheeEEEEcCCCcEEEEecCC-CeEEEEec
Confidence 99999999999999999999999995 45899987
No 118
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.23 E-value=4.6e-09 Score=114.97 Aligned_cols=244 Identities=16% Similarity=0.232 Sum_probs=160.0
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEe--cCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGY--QNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy--~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
+|=.|.-++|-. ....+|.... ++.||..++.+ +.....+.+|...|..+.+.|.. | .
T Consensus 55 kkyG~~~~~Fth------~~~~~i~sStk~d~tIryLsl~d-NkylRYF~GH~~~V~sL~~sP~~------d-------~ 114 (311)
T KOG1446|consen 55 KKYGVDLACFTH------HSNTVIHSSTKEDDTIRYLSLHD-NKYLRYFPGHKKRVNSLSVSPKD------D-------T 114 (311)
T ss_pred ccccccEEEEec------CCceEEEccCCCCCceEEEEeec-CceEEEcCCCCceEEEEEecCCC------C-------e
Confidence 445566666753 2234555544 46799999965 56778899999999999999854 1 2
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcCCeEEEEeCC
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSPRIVAVGLAT 211 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~V~sVa~S~rlLAV~ld~ 211 (1010)
.+ ++ +.|++||+||++..+|..-+.... +|.+..-..-++|++...
T Consensus 115 Fl--S~-------------------------------S~D~tvrLWDlR~~~cqg~l~~~~~pi~AfDp~GLifA~~~~~ 161 (311)
T KOG1446|consen 115 FL--SS-------------------------------SLDKTVRLWDLRVKKCQGLLNLSGRPIAAFDPEGLIFALANGS 161 (311)
T ss_pred EE--ec-------------------------------ccCCeEEeeEecCCCCceEEecCCCcceeECCCCcEEEEecCC
Confidence 22 21 125899999999999888887665 454433334567777765
Q ss_pred -eEEEEECCCCc-eeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCC
Q 001814 212 -QIYCFDALTLE-NKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPG 289 (1010)
Q Consensus 212 -~I~IwD~~Tle-~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~ 289 (1010)
.|++||++.+. ..++.-..+.+ -. ++ |. -..|||+
T Consensus 162 ~~IkLyD~Rs~dkgPF~tf~i~~~---------~~---------~e-----------w~--------------~l~FS~d 198 (311)
T KOG1446|consen 162 ELIKLYDLRSFDKGPFTTFSITDN---------DE---------AE-----------WT--------------DLEFSPD 198 (311)
T ss_pred CeEEEEEecccCCCCceeEccCCC---------Cc---------cc-----------ee--------------eeEEcCC
Confidence 89999998763 12211111100 00 01 10 1123444
Q ss_pred CCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCC
Q 001814 290 GSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHT 369 (1010)
Q Consensus 290 ~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHt 369 (1010)
|+.++. ....+.+.|.|.-+|.++..|..|.
T Consensus 199 GK~iLl-------------------------------------------------sT~~s~~~~lDAf~G~~~~tfs~~~ 229 (311)
T KOG1446|consen 199 GKSILL-------------------------------------------------STNASFIYLLDAFDGTVKSTFSGYP 229 (311)
T ss_pred CCEEEE-------------------------------------------------EeCCCcEEEEEccCCcEeeeEeecc
Confidence 433321 2356788899999999888998876
Q ss_pred CCe---EEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEE
Q 001814 370 SPI---SALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWI 446 (1010)
Q Consensus 370 spI---saLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L 446 (1010)
..- ...+|+|||+++.+++.||+ |+||++.. | .++..+ +|.....+.++-|+|---.+
T Consensus 230 ~~~~~~~~a~ftPds~Fvl~gs~dg~-i~vw~~~t-------g----------~~v~~~-~~~~~~~~~~~~fnP~~~mf 290 (311)
T KOG1446|consen 230 NAGNLPLSATFTPDSKFVLSGSDDGT-IHVWNLET-------G----------KKVAVL-RGPNGGPVSCVRFNPRYAMF 290 (311)
T ss_pred CCCCcceeEEECCCCcEEEEecCCCc-EEEEEcCC-------C----------cEeeEe-cCCCCCCccccccCCceeee
Confidence 533 45689999999999999887 99999964 4 456666 55445578899999965555
Q ss_pred EEEeCCCeEEEEeCCCCC
Q 001814 447 AIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 447 AsgS~dGTVhIw~I~~~g 464 (1010)
|++ +..+-+|-....+
T Consensus 291 ~sa--~s~l~fw~p~~~~ 306 (311)
T KOG1446|consen 291 VSA--SSNLVFWLPDEDA 306 (311)
T ss_pred eec--CceEEEEeccccc
Confidence 555 5568788765443
No 119
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.23 E-value=4.9e-11 Score=146.02 Aligned_cols=246 Identities=19% Similarity=0.245 Sum_probs=166.4
Q ss_pred EEeeccCCCCCC--CeEEEEEecCc-EEEEEccC--CCcceEe---eeeccCCEEEEEEecCCCCCCCCCCccccCcEEE
Q 001814 63 GFDRLEYGPSVF--KQVLLLGYQNG-FQVLDVED--ASNFNEL---VSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLL 134 (1010)
Q Consensus 63 ~Fd~le~~~~~~--~~vLalGy~~G-~qVWDv~~--~g~v~el---lS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLA 134 (1010)
+|++|..+.... .-+|+.|..+| |-+||... .++-.++ .+.|.|+|+.+.|.|. . +.+||
T Consensus 66 rF~kL~W~~~g~~~~GlIaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~~----------q--~nlLA 133 (1049)
T KOG0307|consen 66 RFNKLAWGSYGSHSHGLIAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNPF----------Q--GNLLA 133 (1049)
T ss_pred cceeeeecccCCCccceeeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeecccc----------C--Cceee
Confidence 466665533211 24788888877 99999876 2434444 4568899999998753 2 24777
Q ss_pred EEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEE---eCCCcEEEEEEcC---CeEEEE
Q 001814 135 VVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVL---RFRSSVCMVRCSP---RIVAVG 208 (1010)
Q Consensus 135 vVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL---~f~S~V~sVa~S~---rlLAV~ 208 (1010)
. |+ .++.|.|||+.+-+.-.++ .+.+.|..|++|+ ++||.+
T Consensus 134 S--Ga-------------------------------~~geI~iWDlnn~~tP~~~~~~~~~~eI~~lsWNrkvqhILAS~ 180 (1049)
T KOG0307|consen 134 S--GA-------------------------------DDGEILIWDLNKPETPFTPGSQAPPSEIKCLSWNRKVSHILASG 180 (1049)
T ss_pred c--cC-------------------------------CCCcEEEeccCCcCCCCCCCCCCCcccceEeccchhhhHHhhcc
Confidence 3 21 1367999999874433333 3457899999998 578877
Q ss_pred eCC-eEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcC
Q 001814 209 LAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTS 287 (1010)
Q Consensus 209 ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stS 287 (1010)
... ++.|||++.-+.+..+..++.. +.++ .+-|+
T Consensus 181 s~sg~~~iWDlr~~~pii~ls~~~~~----------------~~~S----------~l~Wh------------------- 215 (1049)
T KOG0307|consen 181 SPSGRAVIWDLRKKKPIIKLSDTPGR----------------MHCS----------VLAWH------------------- 215 (1049)
T ss_pred CCCCCceeccccCCCcccccccCCCc----------------ccee----------eeeeC-------------------
Confidence 664 8999999977655555443321 1110 11122
Q ss_pred CCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC-cEEEEec
Q 001814 288 PGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR-AIISQFK 366 (1010)
Q Consensus 288 P~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~-~~v~~~~ 366 (1010)
|++.+.++ .| ..-+..-.|.+||+..- ..+..++
T Consensus 216 P~~aTql~-~A--------------------------------------------s~dd~~PviqlWDlR~assP~k~~~ 250 (1049)
T KOG0307|consen 216 PDHATQLL-VA--------------------------------------------SGDDSAPVIQLWDLRFASSPLKILE 250 (1049)
T ss_pred CCCceeee-ee--------------------------------------------cCCCCCceeEeecccccCCchhhhc
Confidence 11111100 00 01134557999998753 4667889
Q ss_pred cCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCC-
Q 001814 367 AHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQ- 444 (1010)
Q Consensus 367 aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~- 444 (1010)
.|...|.+|.|.+.+ ++|+|++.|++ |-+|+..+ | ..|++|-+ ...++.++.|.|-.-
T Consensus 251 ~H~~GilslsWc~~D~~lllSsgkD~~-ii~wN~~t-------g----------Evl~~~p~--~~nW~fdv~w~pr~P~ 310 (1049)
T KOG0307|consen 251 GHQRGILSLSWCPQDPRLLLSSGKDNR-IICWNPNT-------G----------EVLGELPA--QGNWCFDVQWCPRNPS 310 (1049)
T ss_pred ccccceeeeccCCCCchhhhcccCCCC-eeEecCCC-------c----------eEeeecCC--CCcceeeeeecCCCcc
Confidence 999999999999988 89999999877 77999753 4 57888754 235799999998664
Q ss_pred EEEEEeCCCeEEEEeCCCC
Q 001814 445 WIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 445 ~LAsgS~dGTVhIw~I~~~ 463 (1010)
.+|++|.+|+|-||.|...
T Consensus 311 ~~A~asfdgkI~I~sl~~~ 329 (1049)
T KOG0307|consen 311 VMAAASFDGKISIYSLQGT 329 (1049)
T ss_pred hhhhheeccceeeeeeecC
Confidence 8999999999999999754
No 120
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.22 E-value=1e-08 Score=113.98 Aligned_cols=104 Identities=12% Similarity=0.137 Sum_probs=69.9
Q ss_pred CCCCeEEEEECCC--C--cEEEEeccC------CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcccc
Q 001814 346 DNAGIVVVKDFVT--R--AIISQFKAH------TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDW 415 (1010)
Q Consensus 346 s~dG~V~VwDl~s--~--~~v~~~~aH------tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~ 415 (1010)
..++.|.+||+.. + +.+..+..+ ......+.|+|||++|+++......|.||++... +.
T Consensus 194 ~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~------~~----- 262 (330)
T PRK11028 194 ELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSED------GS----- 262 (330)
T ss_pred cCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCC------CC-----
Confidence 4589999999973 2 234444322 1123469999999999998776678999998542 10
Q ss_pred CCcceEEEEEecccccccEEEEEEccCCCEEEEEeC-CCeEEEEeCCCCCC
Q 001814 416 NSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSS-KGTCHVFVLSPFGG 465 (1010)
Q Consensus 416 ~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~-dGTVhIw~I~~~gg 465 (1010)
....+.....| .....++|+|||++|+++.. +++|.||+++...+
T Consensus 263 --~~~~~~~~~~~---~~p~~~~~~~dg~~l~va~~~~~~v~v~~~~~~~g 308 (330)
T PRK11028 263 --VLSFEGHQPTE---TQPRGFNIDHSGKYLIAAGQKSHHISVYEIDGETG 308 (330)
T ss_pred --eEEEeEEEecc---ccCCceEECCCCCEEEEEEccCCcEEEEEEcCCCC
Confidence 01112222222 13457899999999999886 88999999976544
No 121
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.22 E-value=3.6e-10 Score=123.02 Aligned_cols=258 Identities=19% Similarity=0.285 Sum_probs=158.9
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccC---CCcceEeee--------------eccCCEEEEEEecC
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVED---ASNFNELVS--------------KRDGPVSFLQMQPF 116 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~---~g~v~ellS--------------~hdGpV~~v~~lP~ 116 (1010)
||..+.-+.|. +++.++++|..+. |+|+|++. ....+++.+ .|-.+|..+.|-|.
T Consensus 111 HK~~cR~aafs-------~DG~lvATGsaD~SIKildvermlaks~~~em~~~~~qa~hPvIRTlYDH~devn~l~FHPr 183 (430)
T KOG0640|consen 111 HKSPCRAAAFS-------PDGSLVATGSADASIKILDVERMLAKSKPKEMISGDTQARHPVIRTLYDHVDEVNDLDFHPR 183 (430)
T ss_pred cccceeeeeeC-------CCCcEEEccCCcceEEEeehhhhhhhcchhhhccCCcccCCceEeehhhccCcccceeecch
Confidence 56666777775 4788999999875 99999971 012223332 33455666665543
Q ss_pred CCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEE---EEeCCC
Q 001814 117 PVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEH---VLRFRS 193 (1010)
Q Consensus 117 p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~---tL~f~S 193 (1010)
..+|+. +. -+++|+|+|+.+-..-+ .+.-..
T Consensus 184 -------------e~ILiS--~s-------------------------------rD~tvKlFDfsK~saKrA~K~~qd~~ 217 (430)
T KOG0640|consen 184 -------------ETILIS--GS-------------------------------RDNTVKLFDFSKTSAKRAFKVFQDTE 217 (430)
T ss_pred -------------hheEEe--cc-------------------------------CCCeEEEEecccHHHHHHHHHhhccc
Confidence 235552 10 25899999996543322 233345
Q ss_pred cEEEEEEcC--CeEEEEeCC-eEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCC
Q 001814 194 SVCMVRCSP--RIVAVGLAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSG 270 (1010)
Q Consensus 194 ~V~sVa~S~--rlLAV~ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G 270 (1010)
+|++|.|.| .+|++|.+. .+++||+.|.++...- .|.. |-.. ++ .-+-|++.
T Consensus 218 ~vrsiSfHPsGefllvgTdHp~~rlYdv~T~Qcfvsa--nPd~----qht~---------ai--~~V~Ys~t-------- 272 (430)
T KOG0640|consen 218 PVRSISFHPSGEFLLVGTDHPTLRLYDVNTYQCFVSA--NPDD----QHTG---------AI--TQVRYSST-------- 272 (430)
T ss_pred eeeeEeecCCCceEEEecCCCceeEEeccceeEeeec--Cccc----cccc---------ce--eEEEecCC--------
Confidence 899999988 799999885 6999999998764211 1210 0000 00 00111110
Q ss_pred ccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCe
Q 001814 271 RLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGI 350 (1010)
Q Consensus 271 ~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~ 350 (1010)
+. ..++++.||.
T Consensus 273 --------------------~~------------------------------------------------lYvTaSkDG~ 284 (430)
T KOG0640|consen 273 --------------------GS------------------------------------------------LYVTASKDGA 284 (430)
T ss_pred --------------------cc------------------------------------------------EEEEeccCCc
Confidence 00 1135789999
Q ss_pred EEEEECCCCcEEEEec-cCC-CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcc-----cCC-CCC-------Cc---
Q 001814 351 VVVKDFVTRAIISQFK-AHT-SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCM-----RSG-SGN-------HK--- 412 (1010)
Q Consensus 351 V~VwDl~s~~~v~~~~-aHt-spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~-----~~~-sG~-------~~--- 412 (1010)
|+|||-.+++++.+|. ||. +.|.+..|..+|+++.+.+. +.++++|.+.+... +.+ +|. .+
T Consensus 285 IklwDGVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG~-DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNht 363 (430)
T KOG0640|consen 285 IKLWDGVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSGK-DSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNHT 363 (430)
T ss_pred EEeeccccHHHHHHHHhhcCCceeeeEEEccCCeEEeecCC-cceeeeeeecCCceEEEEecCCcccchhhhhhhhhcCc
Confidence 9999999999998885 786 47999999999999999998 56799999965310 110 110 00
Q ss_pred --------------cccCCcc-eEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 413 --------------YDWNSSH-VHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 413 --------------~~~~~s~-~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
-.|++.- ..+..+.-|++ ..+..|.-||.+--++++|+|-.++.|.-
T Consensus 364 EdyVl~pDEas~slcsWdaRtadr~~l~slgHn-~a~R~i~HSP~~p~FmTcsdD~raRFWyr 425 (430)
T KOG0640|consen 364 EDYVLFPDEASNSLCSWDARTADRVALLSLGHN-GAVRWIVHSPVEPAFMTCSDDFRARFWYR 425 (430)
T ss_pred cceEEccccccCceeeccccchhhhhhcccCCC-CCceEEEeCCCCCceeeecccceeeeeee
Confidence 0122211 01111222543 34677777888888888888888888753
No 122
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.22 E-value=2.4e-09 Score=117.05 Aligned_cols=210 Identities=13% Similarity=0.119 Sum_probs=133.2
Q ss_pred CCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC-----CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCcccccc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP-----RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINV 245 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~-S~V~sVa~S~-----rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv 245 (1010)
+-+|+|||+++...+..|-.| +.|.++.|.+ .+|..+.+++|.+||+...+++.++..|...
T Consensus 62 DetI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~------------ 129 (362)
T KOG0294|consen 62 DETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQ------------ 129 (362)
T ss_pred CCcEEEEeccchhhhcceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeecccccc------------
Confidence 579999999999888887655 6899999976 3566667789999999999999999877551
Q ss_pred CccceEEcc-ce--EEEccC-CeeeccC--CccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeecccccc
Q 001814 246 GYGPMAVGP-RW--LAYASN-TLLLSNS--GRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQE 319 (1010)
Q Consensus 246 ~~gplAlgp-Rw--LAyas~-~~~iwd~--G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~ 319 (1010)
.+-+++.| .- |+..++ .+..|+. |+.. ++. .|..+..
T Consensus 130 -Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a-------------------~v~----------------~L~~~at- 172 (362)
T KOG0294|consen 130 -VTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVA-------------------FVL----------------NLKNKAT- 172 (362)
T ss_pred -cceeEecCCCceEEEEcCCceeeeehhhcCccc-------------------eee----------------ccCCcce-
Confidence 23456655 22 334444 3567873 4321 110 1111100
Q ss_pred ccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 320 LLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 320 l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
.+.+++.+.+ ++ ......|-||.+.+-.+...+.-. ..+.++.|-.. ..|++|..++ .|++||.
T Consensus 173 --------~v~w~~~Gd~----F~-v~~~~~i~i~q~d~A~v~~~i~~~-~r~l~~~~l~~-~~L~vG~d~~-~i~~~D~ 236 (362)
T KOG0294|consen 173 --------LVSWSPQGDH----FV-VSGRNKIDIYQLDNASVFREIENP-KRILCATFLDG-SELLVGGDNE-WISLKDT 236 (362)
T ss_pred --------eeEEcCCCCE----EE-EEeccEEEEEecccHhHhhhhhcc-ccceeeeecCC-ceEEEecCCc-eEEEecc
Confidence 0111111100 00 112346778888765544333322 44777888754 5566777754 5999997
Q ss_pred CCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE--ccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 400 MPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF--SHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 400 ~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF--SpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
.. + ..+..+. + |.++|.+|.| .|++.+|+++|+||.|.||+++....
T Consensus 237 ds-------~----------~~~~~~~-A-H~~RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~~~~k 285 (362)
T KOG0294|consen 237 DS-------D----------TPLTEFL-A-HENRVKDIASYTNPEHEYLVTASSDGFIKVWDIDMETK 285 (362)
T ss_pred CC-------C----------ccceeee-c-chhheeeeEEEecCCceEEEEeccCceEEEEEcccccc
Confidence 52 2 3555554 3 4578999985 57899999999999999999986533
No 123
>KOG2109 consensus WD40 repeat protein [General function prediction only]
Probab=99.21 E-value=1.2e-11 Score=144.39 Aligned_cols=316 Identities=24% Similarity=0.315 Sum_probs=191.3
Q ss_pred CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
.+-.+++||=.| .++|-..-.+.+.+.+.++.|+|+...++++. |+.+
T Consensus 251 kGy~~isglc~g~~~~g~gpglgg~~~~~vGrvg~vsaesV~g~~--------------~viv----------------- 299 (788)
T KOG2109|consen 251 KGYVLISGLCRGSYQIGTGPGLGGFEEVLVGRVGPVSAESVLGNN--------------LVIV----------------- 299 (788)
T ss_pred chHHHHHHHhhcccCCCCCCCCCCcCceeccccccccceeecccc--------------eEEe-----------------
Confidence 456788888777 79998877788889999999999999988653 3332
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEEeCCeEEEEECCCCceeEEEee-cC
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVGLATQIYCFDALTLENKFSVLT-YP 231 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~ld~~I~IwD~~Tle~l~tL~t-~p 231 (1010)
..|-++.....++.++++..+++++-+.-+|++++-..+-|++++.|+..-++.. ++
T Consensus 300 ----------------------kdf~S~a~i~QfkAhkspiSaLcfdqsgsllViasi~g~nVnvfRimet~~t~~~~~q 357 (788)
T KOG2109|consen 300 ----------------------KDFDSFADIRQFKAHKSPISALCFDQSGSLLVIASITGRNVNVFRIMETVCTVNVSDQ 357 (788)
T ss_pred ----------------------ecccchhhhhheeeecCcccccccccCceEEEEEeeccceeeeEEecccccccccccc
Confidence 1122334445556666666667777777788888877777777777776555543 22
Q ss_pred CccccCCCccccccCccceEEccceEEEccCC-eeeccC-Ccc--CCCcCCCC-C----CCCCcC---CCCCceEEEeeh
Q 001814 232 VPQLAGQGAVGINVGYGPMAVGPRWLAYASNT-LLLSNS-GRL--SPQNLTPS-G----VSPSTS---PGGSSLVARYAM 299 (1010)
Q Consensus 232 ~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~-~~iwd~-G~v--s~Q~lt~p-~----vS~stS---P~~gslVa~~A~ 299 (1010)
.+ ...+.++.++||++..-. ..-|.. |.. .+-.+.+. . ..++.. .+.+-++
T Consensus 358 s~------------~~s~ra~t~aviqdicfs~~s~~r~~gsc~Ge~P~ls~t~~lp~~A~~Sl~~gl~s~g~~a----- 420 (788)
T KOG2109|consen 358 SL------------VVSPRANTAAVIQDICFSEVSTIRTAGSCEGEPPALSLTCQLPAYADTSLDLGLQSSGGLA----- 420 (788)
T ss_pred cc------------ccchhcchHHHHHHHhhhhhcceEeecccCCCCcccccccccchhhchhhhccccccCccc-----
Confidence 21 123566666666544311 111111 111 00011100 0 000000 0001111
Q ss_pred hhhhhhhcccceeeccccccccCCCCCCCccCCCcccccc-cccc-ccCCCCeEEEEECC-----CC-cEEEEeccCCCC
Q 001814 300 EHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGR-HAGA-DMDNAGIVVVKDFV-----TR-AIISQFKAHTSP 371 (1010)
Q Consensus 300 dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~-~~ia-sgs~dG~V~VwDl~-----s~-~~v~~~~aHtsp 371 (1010)
+.|+.-+-.+|+...+..++....+.-++.+..+ ..+. .-...|.+.+.+.. .+ .+++.+.+|..+
T Consensus 421 ------a~gla~~sag~~a~s~~asSv~s~s~~pd~ks~gv~~gsv~k~~q~~~~~l~~llv~~psGd~vvqh~vahs~~ 494 (788)
T KOG2109|consen 421 ------AEGLATSSAGYTAHSYTASSVFSRSVKPDSKSVGVGSGSVTKANQGVITVLNLLLVGEPSGDGVVQHYVAHSDP 494 (788)
T ss_pred ------ceeeeeccccccccccccceeeccccccchhhccceeeeccccCccchhhhhheeeecCCCCceeEEEeeccCc
Confidence 1111111122332222111110111111111111 0111 11122444444332 23 577888999999
Q ss_pred eEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeC
Q 001814 372 ISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSS 451 (1010)
Q Consensus 372 IsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~ 451 (1010)
+..+.|+|+++++.|++..++.+++|.++|...++.. +...|+|++.||.|.++|..++|+-|++|+|....
T Consensus 495 gv~~Ef~~~~~l~lSad~~e~ef~~f~V~Ph~~wssl--------aav~hly~l~rG~TsaKv~~~afs~dsrw~A~~t~ 566 (788)
T KOG2109|consen 495 GVYIEFSPDQRLVLSADANENEFNIFLVMPHATWSSL--------AAVQHLYKLNRGSTSAKVVSTAFSEDSRWLAITTN 566 (788)
T ss_pred cceeeecccccceecccccccccceEEeecccccHHH--------hhhhhhhhccCCCccceeeeeEeecchhhhhhhhc
Confidence 9999999999999999999998999999987443221 23468999999999999999999999999999999
Q ss_pred CCeEEEEeCCCCCCcccccccc
Q 001814 452 KGTCHVFVLSPFGGDSGFQTLS 473 (1010)
Q Consensus 452 dGTVhIw~I~~~gg~~~~~~H~ 473 (1010)
.+|-|||++.+|++....++|.
T Consensus 567 ~~TthVfk~hpYgg~aeqrth~ 588 (788)
T KOG2109|consen 567 HATTHVFKVHPYGGKAEQRTHG 588 (788)
T ss_pred CCceeeeeeccccccccceecC
Confidence 9999999999999999999885
No 124
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.20 E-value=4.5e-10 Score=122.66 Aligned_cols=108 Identities=12% Similarity=0.234 Sum_probs=85.6
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
+.+++.|-.|.|||+. |+.+.++......-...+.||+|++||.++-.- .++||.+- +...++- +.+..
T Consensus 202 imsas~dt~i~lw~lk-Gq~L~~idtnq~~n~~aavSP~GRFia~~gFTp-DVkVwE~~--f~kdG~f-------qev~r 270 (420)
T KOG2096|consen 202 IMSASLDTKICLWDLK-GQLLQSIDTNQSSNYDAAVSPDGRFIAVSGFTP-DVKVWEPI--FTKDGTF-------QEVKR 270 (420)
T ss_pred EEEecCCCcEEEEecC-CceeeeeccccccccceeeCCCCcEEEEecCCC-CceEEEEE--eccCcch-------hhhhh
Confidence 3467788899999998 899999988888888889999999999998854 48999963 2222111 12345
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
++.| .|+. +.|..+|||++++.+++.|.||+.+||+++-
T Consensus 271 vf~L-kGH~-saV~~~aFsn~S~r~vtvSkDG~wriwdtdV 309 (420)
T KOG2096|consen 271 VFSL-KGHQ-SAVLAAAFSNSSTRAVTVSKDGKWRIWDTDV 309 (420)
T ss_pred hhee-ccch-hheeeeeeCCCcceeEEEecCCcEEEeeccc
Confidence 6677 5655 4599999999999999999999999999874
No 125
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.19 E-value=8e-09 Score=118.17 Aligned_cols=96 Identities=19% Similarity=0.295 Sum_probs=74.8
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
-+...|.-.|.|.++...+ +++--..||++|+|+|+|.+||.||.|+. |.||.+..+ | +...
T Consensus 423 ~Gt~~G~w~V~d~e~~~lv-~~~~d~~~ls~v~ysp~G~~lAvgs~d~~-iyiy~Vs~~------g----------~~y~ 484 (626)
T KOG2106|consen 423 VGTATGRWFVLDTETQDLV-TIHTDNEQLSVVRYSPDGAFLAVGSHDNH-IYIYRVSAN------G----------RKYS 484 (626)
T ss_pred EeeccceEEEEecccceeE-EEEecCCceEEEEEcCCCCEEEEecCCCe-EEEEEECCC------C----------cEEE
Confidence 4556788889999985554 44434899999999999999999999665 999999642 3 2322
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEE
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVF 458 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw 458 (1010)
++.+ .+.+.|..+.||+|+++|.+-|.|=-+..|
T Consensus 485 r~~k-~~gs~ithLDwS~Ds~~~~~~S~d~eiLyW 518 (626)
T KOG2106|consen 485 RVGK-CSGSPITHLDWSSDSQFLVSNSGDYEILYW 518 (626)
T ss_pred Eeee-ecCceeEEeeecCCCceEEeccCceEEEEE
Confidence 2222 222689999999999999999999999999
No 126
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.19 E-value=5.5e-09 Score=113.83 Aligned_cols=251 Identities=15% Similarity=0.177 Sum_probs=161.7
Q ss_pred CCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCCcc-eEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 55 LKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDASNF-NELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 55 ~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g~v-~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
..|.|.-..|-. ....+|+.|..+ .+|+|++++.|.. -+....|++||-++.+.-++ ..
T Consensus 26 P~DsIS~l~FSP------~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~Wsddg-------------sk 86 (347)
T KOG0647|consen 26 PEDSISALAFSP------QADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDG-------------SK 86 (347)
T ss_pred cccchheeEecc------ccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEEEccCC-------------ce
Confidence 457788888874 124566666655 5999999875433 24445688999999988554 23
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC----CeEEEE
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP----RIVAVG 208 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~----rlLAV~ 208 (1010)
+++-+ .++.+++|||.+++....--+..+|..++|=+ ..|+.|
T Consensus 87 Vf~g~---------------------------------~Dk~~k~wDL~S~Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TG 133 (347)
T KOG0647|consen 87 VFSGG---------------------------------CDKQAKLWDLASGQVSQVAAHDAPVKTCHWVPGMNYQCLVTG 133 (347)
T ss_pred EEeec---------------------------------cCCceEEEEccCCCeeeeeecccceeEEEEecCCCcceeEec
Confidence 44311 24789999999997655555667999999854 366776
Q ss_pred e-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcC
Q 001814 209 L-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTS 287 (1010)
Q Consensus 209 l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stS 287 (1010)
. |++|+.||.+....+.++. .|+ |.
T Consensus 134 SWDKTlKfWD~R~~~pv~t~~---LPe--------------------Rv------------------------------- 159 (347)
T KOG0647|consen 134 SWDKTLKFWDTRSSNPVATLQ---LPE--------------------RV------------------------------- 159 (347)
T ss_pred ccccceeecccCCCCeeeeee---ccc--------------------ee-------------------------------
Confidence 5 7889999998766655553 110 00
Q ss_pred CCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc
Q 001814 288 PGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA 367 (1010)
Q Consensus 288 P~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a 367 (1010)
||+|....++ +-+..+..|.||.+.++. ..++.
T Consensus 160 ---------Ya~Dv~~pm~------------------------------------vVata~r~i~vynL~n~~--te~k~ 192 (347)
T KOG0647|consen 160 ---------YAADVLYPMA------------------------------------VVATAERHIAVYNLENPP--TEFKR 192 (347)
T ss_pred ---------eehhccCcee------------------------------------EEEecCCcEEEEEcCCCc--chhhh
Confidence 1111100000 011235567788886553 24444
Q ss_pred CCC----CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccc----c-cccEEEEE
Q 001814 368 HTS----PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGI----T-SATIQDIC 438 (1010)
Q Consensus 368 Hts----pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~----t-~a~I~sIA 438 (1010)
|.+ .+.+|+.-+|....|-||.+|+ +-|..+.+. .+...-.++.+|-. . -..|.+|+
T Consensus 193 ~~SpLk~Q~R~va~f~d~~~~alGsiEGr-v~iq~id~~-------------~~~~nFtFkCHR~~~~~~~~VYaVNsi~ 258 (347)
T KOG0647|consen 193 IESPLKWQTRCVACFQDKDGFALGSIEGR-VAIQYIDDP-------------NPKDNFTFKCHRSTNSVNDDVYAVNSIA 258 (347)
T ss_pred hcCcccceeeEEEEEecCCceEeeeecce-EEEEecCCC-------------CccCceeEEEeccCCCCCCceEEecceE
Confidence 544 4568888888877799999998 678887541 01112455666621 1 12478899
Q ss_pred EccCCCEEEEEeCCCeEEEEeCCCCCCccccccc
Q 001814 439 FSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTL 472 (1010)
Q Consensus 439 FSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H 472 (1010)
|.|.-..|++++.|||.-.||-+..-...+...|
T Consensus 259 FhP~hgtlvTaGsDGtf~FWDkdar~kLk~s~~~ 292 (347)
T KOG0647|consen 259 FHPVHGTLVTAGSDGTFSFWDKDARTKLKTSETH 292 (347)
T ss_pred eecccceEEEecCCceEEEecchhhhhhhccCcC
Confidence 9999999999999999999997665443333344
No 127
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.19 E-value=8.7e-10 Score=132.02 Aligned_cols=192 Identities=16% Similarity=0.156 Sum_probs=122.9
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECC-CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDP-SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSP-dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
+.+++.|.+|++|++....++..|. |..-|+|++|+| |.+++++||.||. ||||+|... .+.
T Consensus 383 LLSSSMDKTVRLWh~~~~~CL~~F~-HndfVTcVaFnPvDDryFiSGSLD~K-vRiWsI~d~---------------~Vv 445 (712)
T KOG0283|consen 383 LLSSSMDKTVRLWHPGRKECLKVFS-HNDFVTCVAFNPVDDRYFISGSLDGK-VRLWSISDK---------------KVV 445 (712)
T ss_pred eEeccccccEEeecCCCcceeeEEe-cCCeeEEEEecccCCCcEeecccccc-eEEeecCcC---------------eeE
Confidence 3467899999999999999998887 999999999999 6689999999876 999999531 111
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCCccCCCCCCCcccCCCCCccCc
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFPVLSLPWWCTSSGISEQQ 500 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~pv~~lpw~~~ss~~~~q~ 500 (1010)
.-+.+ ...|++++|+|||++.++|+.+|.+++|......-....+.|........-.-++.| |+
T Consensus 446 ~W~Dl-----~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~-----------Q~ 509 (712)
T KOG0283|consen 446 DWNDL-----RDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGL-----------QF 509 (712)
T ss_pred eehhh-----hhhheeEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeEeeccCccccCceeeee-----------Ee
Confidence 22223 246999999999999999999999999998765554444333221111111123332 21
Q ss_pred cCCCCC--ceeeeeeeeeeecCC--ccccccccccccccCccccccceeeeecccCccccccccccccCccccEEEEcCC
Q 001814 501 CVLPPP--PVTLSVVSRIKYSSF--GWLNTVSNASASSMGKVFVPSGAVAAVFHNSIAHSSQHVNSRTNSLEHLLVYTPS 576 (1010)
Q Consensus 501 ~~p~p~--~~~l~~vsrIk~~~~--~w~~~v~~a~~~at~~~~~ps~~va~~F~~~~~~~~~~~~s~~~~~~~LlV~s~~ 576 (1010)
. |.-+ -++-+.=+|||.=.. .=+-..-|.+.+. -+| +.|-|.. .-.+++.++.|
T Consensus 510 ~-p~~~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n~------~SQ-~~Asfs~--------------Dgk~IVs~seD 567 (712)
T KOG0283|consen 510 F-PGDPDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRNT------SSQ-ISASFSS--------------DGKHIVSASED 567 (712)
T ss_pred c-CCCCCeEEEecCCCceEEEeccchhhhhhhcccccC------Ccc-eeeeEcc--------------CCCEEEEeecC
Confidence 1 1111 233445567775321 0000111112221 112 4444433 12489999999
Q ss_pred ccEEEEecccCC
Q 001814 577 GYVVQHELLPSI 588 (1010)
Q Consensus 577 G~l~~Y~L~p~~ 588 (1010)
-++|++++++..
T Consensus 568 s~VYiW~~~~~~ 579 (712)
T KOG0283|consen 568 SWVYIWKNDSFN 579 (712)
T ss_pred ceEEEEeCCCCc
Confidence 999999997654
No 128
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.19 E-value=2.3e-09 Score=116.34 Aligned_cols=131 Identities=16% Similarity=0.238 Sum_probs=99.7
Q ss_pred CCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC-----CeEEEEeCC-eEEEEECCCCceeEEEeecCCccccCCCccccc
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP-----RIVAVGLAT-QIYCFDALTLENKFSVLTYPVPQLAGQGAVGIN 244 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~-----rlLAV~ld~-~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vn 244 (1010)
.+.+|++||..|-+.+..+++...||+=+++| -++|+|..+ +|++.|+..+..-++|.+|..
T Consensus 122 FDhtlKVWDtnTlQ~a~~F~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs~sH~LsGHr~------------ 189 (397)
T KOG4283|consen 122 FDHTLKVWDTNTLQEAVDFKMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGSFSHTLSGHRD------------ 189 (397)
T ss_pred ccceEEEeecccceeeEEeecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCcceeeeccccC------------
Confidence 36899999999999999999999999888887 367788766 899999999999898887754
Q ss_pred cCccceEE--cc--ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccc
Q 001814 245 VGYGPMAV--GP--RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQEL 320 (1010)
Q Consensus 245 v~~gplAl--gp--RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l 320 (1010)
+.||+ +| .|+
T Consensus 190 ---~vlaV~Wsp~~e~v--------------------------------------------------------------- 203 (397)
T KOG4283|consen 190 ---GVLAVEWSPSSEWV--------------------------------------------------------------- 203 (397)
T ss_pred ---ceEEEEeccCceeE---------------------------------------------------------------
Confidence 23443 32 111
Q ss_pred cCCCCCCCccCCCccccccccccccCCCCeEEEEECCCC------------c---EEEEeccCCCCeEEEEECCCCCEEE
Q 001814 321 LPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR------------A---IISQFKAHTSPISALCFDPSGTLLV 385 (1010)
Q Consensus 321 ~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~------------~---~v~~~~aHtspIsaLaFSPdGtlLA 385 (1010)
+++++.||.|++||+..- + .+.+=.+|.+.+..+||+.||.+|+
T Consensus 204 ---------------------LatgsaDg~irlWDiRrasgcf~~lD~hn~k~~p~~~~n~ah~gkvngla~tSd~~~l~ 262 (397)
T KOG4283|consen 204 ---------------------LATGSADGAIRLWDIRRASGCFRVLDQHNTKRPPILKTNTAHYGKVNGLAWTSDARYLA 262 (397)
T ss_pred ---------------------EEecCCCceEEEEEeecccceeEEeecccCccCccccccccccceeeeeeecccchhhh
Confidence 234456666777766421 1 2223357999999999999999999
Q ss_pred EEEcCCCeEEEEeCCC
Q 001814 386 TASVYGNNINIFRIMP 401 (1010)
Q Consensus 386 TAS~dGt~IrVwdi~p 401 (1010)
+++.|++ ||+|+...
T Consensus 263 ~~gtd~r-~r~wn~~~ 277 (397)
T KOG4283|consen 263 SCGTDDR-IRVWNMES 277 (397)
T ss_pred hccCccc-eEEeeccc
Confidence 9999776 99999753
No 129
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.18 E-value=2.7e-11 Score=142.40 Aligned_cols=113 Identities=17% Similarity=0.293 Sum_probs=95.9
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
.+..+|+|++||+...+.+.+|.+|..++..|.|+|-|.+.|++|.+ +.+.+||+... | +.+
T Consensus 87 agsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~~~~a~gStd-td~~iwD~Rk~------G-----------c~~ 148 (825)
T KOG0267|consen 87 AGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYGEFFASGSTD-TDLKIWDIRKK------G-----------CSH 148 (825)
T ss_pred ccccCCceeeeehhhhhhhhhhhccccCcceeeeccceEEecccccc-ccceehhhhcc------C-----------cee
Confidence 46789999999999999999999999999999999999999999995 56999999632 3 455
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCC
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQG 476 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~ 476 (1010)
.+ +| +...|..+.|+|||+|++.+++|.+++||++...+-...+.+|..++
T Consensus 149 ~~-~s-~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk~~~ef~~~e~~v 199 (825)
T KOG0267|consen 149 TY-KS-HTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGKLSKEFKSHEGKV 199 (825)
T ss_pred ee-cC-CcceeEEEeecCCCceeeccCCcceeeeecccccccccccccccccc
Confidence 44 45 45679999999999999999999999999997766667777776543
No 130
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.18 E-value=1.1e-09 Score=121.76 Aligned_cols=240 Identities=16% Similarity=0.149 Sum_probs=156.6
Q ss_pred CCCCCCcEE---EEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEee---eeccCCEEEEEEecCCCCCCCCCC
Q 001814 52 SEDLKDQVT---WAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELV---SKRDGPVSFLQMQPFPVKDDGCEG 125 (1010)
Q Consensus 52 ~~~~kd~V~---wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ell---S~hdGpV~~v~~lP~p~~s~~~D~ 125 (1010)
..+|.+.|. |+.-|..+ ..++..|-+..+++|-.+......+.+ .+|.++|.+|.+++++.
T Consensus 140 ~~Ght~~ik~v~~v~~n~~~------~~fvsas~Dqtl~Lw~~~~~~~~~~~~~~~~GHk~~V~sVsv~~sgt------- 206 (423)
T KOG0313|consen 140 IVGHTGPIKSVAWVIKNSSS------CLFVSASMDQTLRLWKWNVGENKVKALKVCRGHKRSVDSVSVDSSGT------- 206 (423)
T ss_pred EecCCcceeeeEEEecCCcc------ceEEEecCCceEEEEEecCchhhhhHHhHhcccccceeEEEecCCCC-------
Confidence 455555544 44444322 246666666789999997655444433 47889999999987652
Q ss_pred ccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCC------------------------
Q 001814 126 FRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQ------------------------ 181 (1010)
Q Consensus 126 F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlk------------------------ 181 (1010)
-+++ |+| +++|+||+..
T Consensus 207 --------r~~S---------------------gS~----------D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~ 247 (423)
T KOG0313|consen 207 --------RFCS---------------------GSW----------DTMLKIWSVETDEEDELESSSNRRRKKQKREKEG 247 (423)
T ss_pred --------eEEe---------------------ecc----------cceeeecccCCCccccccccchhhhhhhhhhhcc
Confidence 1222 233 4788999821
Q ss_pred -CCeEEEEEeCC-CcEEEEEEcCC--eEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--c
Q 001814 182 -SHCYEHVLRFR-SSVCMVRCSPR--IVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--R 255 (1010)
Q Consensus 182 -tge~V~tL~f~-S~V~sVa~S~r--lLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--R 255 (1010)
++..+-+|..| .+|.+|.+++. ...++-|.+|+.||+.+..+.-++.+... +..+++++ +
T Consensus 248 ~~r~P~vtl~GHt~~Vs~V~w~d~~v~yS~SwDHTIk~WDletg~~~~~~~~~ks--------------l~~i~~~~~~~ 313 (423)
T KOG0313|consen 248 GTRTPLVTLEGHTEPVSSVVWSDATVIYSVSWDHTIKVWDLETGGLKSTLTTNKS--------------LNCISYSPLSK 313 (423)
T ss_pred cccCceEEecccccceeeEEEcCCCceEeecccceEEEEEeecccceeeeecCcc--------------eeEeecccccc
Confidence 23345566555 68999999874 34466788999999999998888876322 34566666 7
Q ss_pred eEEEccCC--eeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCC
Q 001814 256 WLAYASNT--LLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNS 333 (1010)
Q Consensus 256 wLAyas~~--~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~ 333 (1010)
.||.++.. +++||- |. ..+++|. .+|.++.+-+ +....+|
T Consensus 314 Ll~~gssdr~irl~DP-R~----------------~~gs~v~---------------~s~~gH~nwV------ssvkwsp 355 (423)
T KOG0313|consen 314 LLASGSSDRHIRLWDP-RT----------------GDGSVVS---------------QSLIGHKNWV------SSVKWSP 355 (423)
T ss_pred eeeecCCCCceeecCC-CC----------------CCCceeE---------------Eeeecchhhh------hheecCC
Confidence 78777653 567771 10 1122322 2333333211 1111111
Q ss_pred ccccccccccccCCCCeEEEEECCCCc-EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 334 VWKVGRHAGADMDNAGIVVVKDFVTRA-IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 334 ~~k~~~~~iasgs~dG~V~VwDl~s~~-~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
...+++++++.|+++++||+.+-+ .+-.|.+|...|.++.|+ +|.+++||+.| ..|+||.-.
T Consensus 356 ---~~~~~~~S~S~D~t~klWDvRS~k~plydI~~h~DKvl~vdW~-~~~~IvSGGaD-~~l~i~~~~ 418 (423)
T KOG0313|consen 356 ---TNEFQLVSGSYDNTVKLWDVRSTKAPLYDIAGHNDKVLSVDWN-EGGLIVSGGAD-NKLRIFKGS 418 (423)
T ss_pred ---CCceEEEEEecCCeEEEEEeccCCCcceeeccCCceEEEEecc-CCceEEeccCc-ceEEEeccc
Confidence 112356789999999999999876 889999999999999998 46799999995 569999754
No 131
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.18 E-value=6.2e-10 Score=120.59 Aligned_cols=111 Identities=15% Similarity=0.222 Sum_probs=83.2
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc-eEE
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH-VHL 422 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~-~~L 422 (1010)
+..+-.|++-|+.+|..-++|.+|+..|.++.|+|... .|||||.||+ ||+|||... +|. +.-.+.+- +..
T Consensus 164 gtr~~~VrLCDi~SGs~sH~LsGHr~~vlaV~Wsp~~e~vLatgsaDg~-irlWDiRra-----sgc-f~~lD~hn~k~~ 236 (397)
T KOG4283|consen 164 GTRDVQVRLCDIASGSFSHTLSGHRDGVLAVEWSPSSEWVLATGSADGA-IRLWDIRRA-----SGC-FRVLDQHNTKRP 236 (397)
T ss_pred ecCCCcEEEEeccCCcceeeeccccCceEEEEeccCceeEEEecCCCce-EEEEEeecc-----cce-eEEeecccCccC
Confidence 34456799999999999999999999999999999887 6799999887 999999642 121 00000000 111
Q ss_pred EEEe-cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 423 YKLH-RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 423 ~~L~-RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
..++ +-.+..+|..+||+.|++++++++.|..+++|....
T Consensus 237 p~~~~n~ah~gkvngla~tSd~~~l~~~gtd~r~r~wn~~~ 277 (397)
T KOG4283|consen 237 PILKTNTAHYGKVNGLAWTSDARYLASCGTDDRIRVWNMES 277 (397)
T ss_pred ccccccccccceeeeeeecccchhhhhccCccceEEeeccc
Confidence 1122 333556899999999999999999999999999764
No 132
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.16 E-value=2e-08 Score=115.00 Aligned_cols=292 Identities=16% Similarity=0.201 Sum_probs=175.4
Q ss_pred CCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcce---Eeeeecc-CCEEEEEEecCCCCCCCCCCccccCc
Q 001814 56 KDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFN---ELVSKRD-GPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 56 kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~---ellS~hd-GpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
.|-|.-|.|.. .+++++++...+-|-.|+... +.+. .++.+++ .-|.+|+|++++.
T Consensus 200 ne~v~~a~FHP------td~nliit~Gk~H~~Fw~~~~-~~l~k~~~~fek~ekk~Vl~v~F~engd------------- 259 (626)
T KOG2106|consen 200 NEVVFLATFHP------TDPNLIITCGKGHLYFWTLRG-GSLVKRQGIFEKREKKFVLCVTFLENGD------------- 259 (626)
T ss_pred cceEEEEEecc------CCCcEEEEeCCceEEEEEccC-CceEEEeeccccccceEEEEEEEcCCCC-------------
Confidence 35566666753 256788888888899999853 3333 3445554 4588999998761
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC-CCcEEEEEEcC--CeEEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF-RSSVCMVRCSP--RIVAVG 208 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f-~S~V~sVa~S~--rlLAV~ 208 (1010)
|++|| +.+.+.||+..+.+..++... +..|+++..-+ .+|.-+
T Consensus 260 ---viTgD-------------------------------S~G~i~Iw~~~~~~~~k~~~aH~ggv~~L~~lr~GtllSGg 305 (626)
T KOG2106|consen 260 ---VITGD-------------------------------SGGNILIWSKGTNRISKQVHAHDGGVFSLCMLRDGTLLSGG 305 (626)
T ss_pred ---EEeec-------------------------------CCceEEEEeCCCceEEeEeeecCCceEEEEEecCccEeecC
Confidence 33444 236799999988887776654 46899888765 455533
Q ss_pred eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc-ceEEEccCCeeecc------CCccCCCcCCCCC
Q 001814 209 LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP-RWLAYASNTLLLSN------SGRLSPQNLTPSG 281 (1010)
Q Consensus 209 ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp-RwLAyas~~~~iwd------~G~vs~Q~lt~p~ 281 (1010)
-|.+|..|| .+.+.+ .....|+ ++|+ |-+|-....+.+=. .|.+.
T Consensus 306 KDRki~~Wd-~~y~k~---r~~elPe----------------~~G~iRtv~e~~~di~vGTtrN~iL~Gt~~-------- 357 (626)
T KOG2106|consen 306 KDRKIILWD-DNYRKL---RETELPE----------------QFGPIRTVAEGKGDILVGTTRNFILQGTLE-------- 357 (626)
T ss_pred ccceEEecc-cccccc---ccccCch----------------hcCCeeEEecCCCcEEEeeccceEEEeeec--------
Confidence 455799999 333332 2222221 1111 22221111100000 01000
Q ss_pred CCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE
Q 001814 282 VSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI 361 (1010)
Q Consensus 282 vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~ 361 (1010)
.+-.+ +..++..+++. +...++ .-+++++++|+.|++|+ ..++
T Consensus 358 -------~~f~~------------------~v~gh~delwg------la~hps----~~q~~T~gqdk~v~lW~--~~k~ 400 (626)
T KOG2106|consen 358 -------NGFTL------------------TVQGHGDELWG------LATHPS----KNQLLTCGQDKHVRLWN--DHKL 400 (626)
T ss_pred -------CCceE------------------EEEecccceee------EEcCCC----hhheeeccCcceEEEcc--CCce
Confidence 00001 11111111110 000010 01356789999999999 4444
Q ss_pred EEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEcc
Q 001814 362 ISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSH 441 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp 441 (1010)
+.+.. -..|..|+.|+|.| .||.+...|+ --|.|++. +.+..++. .++.+..++|||
T Consensus 401 ~wt~~-~~d~~~~~~fhpsg-~va~Gt~~G~-w~V~d~e~------------------~~lv~~~~--d~~~ls~v~ysp 457 (626)
T KOG2106|consen 401 EWTKI-IEDPAECADFHPSG-VVAVGTATGR-WFVLDTET------------------QDLVTIHT--DNEQLSVVRYSP 457 (626)
T ss_pred eEEEE-ecCceeEeeccCcc-eEEEeeccce-EEEEeccc------------------ceeEEEEe--cCCceEEEEEcC
Confidence 43322 23688999999999 8898998887 55778753 23444443 257899999999
Q ss_pred CCCEEEEEeCCCeEEEEeCCCCCCcccc-ccccCCCCCCccCCCCCCCcccCCCCC
Q 001814 442 YSQWIAIVSSKGTCHVFVLSPFGGDSGF-QTLSSQGGDPYLFPVLSLPWWCTSSGI 496 (1010)
Q Consensus 442 Dg~~LAsgS~dGTVhIw~I~~~gg~~~~-~~H~s~~~~~~~~pv~~lpw~~~ss~~ 496 (1010)
||.+||+||.|+.|.||.++..|-.... ..|. -+|++.|-|...+.+.
T Consensus 458 ~G~~lAvgs~d~~iyiy~Vs~~g~~y~r~~k~~-------gs~ithLDwS~Ds~~~ 506 (626)
T KOG2106|consen 458 DGAFLAVGSHDNHIYIYRVSANGRKYSRVGKCS-------GSPITHLDWSSDSQFL 506 (626)
T ss_pred CCCEEEEecCCCeEEEEEECCCCcEEEEeeeec-------CceeEEeeecCCCceE
Confidence 9999999999999999999987765432 2221 2788999987766644
No 133
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.14 E-value=2.3e-09 Score=121.68 Aligned_cols=176 Identities=11% Similarity=0.109 Sum_probs=129.6
Q ss_pred CCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC---CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP---RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGY 247 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~-S~V~sVa~S~---rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~ 247 (1010)
++.|.|||.++.+.+++++.| ..|.+++|.. ++.+.+.|..|++|++..+.-+.++.+|+..
T Consensus 223 d~~v~Iw~~~t~ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~~~s~vetlyGHqd~-------------- 288 (479)
T KOG0299|consen 223 DRHVQIWDCDTLEHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSIDQLSYVETLYGHQDG-------------- 288 (479)
T ss_pred CceEEEecCcccchhhcccccccceeeeeeecCccceeeeecCCceEEEehhHhHHHHHHhCCccc--------------
Confidence 478999999999999998765 6899999965 5677788889999999988877777777651
Q ss_pred cceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCC
Q 001814 248 GPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSS 327 (1010)
Q Consensus 248 gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s 327 (1010)
.++++. .+.++ +
T Consensus 289 -v~~Ida------------L~reR---------------------~---------------------------------- 300 (479)
T KOG0299|consen 289 -VLGIDA------------LSRER---------------------C---------------------------------- 300 (479)
T ss_pred -eeeech------------hcccc---------------------e----------------------------------
Confidence 122210 00000 0
Q ss_pred CccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCC
Q 001814 328 PVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSG 407 (1010)
Q Consensus 328 ~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~ 407 (1010)
...++.|.+++||++..- .--.|++|...|-|++|=.+ ..++|+|.+|. |-+|++..-
T Consensus 301 --------------vtVGgrDrT~rlwKi~ee-sqlifrg~~~sidcv~~In~-~HfvsGSdnG~-IaLWs~~KK----- 358 (479)
T KOG0299|consen 301 --------------VTVGGRDRTVRLWKIPEE-SQLIFRGGEGSIDCVAFIND-EHFVSGSDNGS-IALWSLLKK----- 358 (479)
T ss_pred --------------EEeccccceeEEEecccc-ceeeeeCCCCCeeeEEEecc-cceeeccCCce-EEEeeeccc-----
Confidence 012457999999999543 33478999999999999654 67899999876 999998531
Q ss_pred CCCCccccCCcceEEEEEe--cccc--------cccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 408 SGNHKYDWNSSHVHLYKLH--RGIT--------SATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 408 sG~~~~~~~~s~~~L~~L~--RG~t--------~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+.|+..+ -|.- +.+|.+|+-.|.+..+|+||-+|.|++|.+++.
T Consensus 359 ------------kplf~~~~AHgv~~~~~~~~~~~Witsla~i~~sdL~asGS~~G~vrLW~i~~g 412 (479)
T KOG0299|consen 359 ------------KPLFTSRLAHGVIPELDPVNGNFWITSLAVIPGSDLLASGSWSGCVRLWKIEDG 412 (479)
T ss_pred ------------CceeEeeccccccCCccccccccceeeeEecccCceEEecCCCCceEEEEecCC
Confidence 1222221 1111 238999999999999999999999999999864
No 134
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.13 E-value=5.3e-10 Score=121.76 Aligned_cols=104 Identities=16% Similarity=0.208 Sum_probs=85.6
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 341 AGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 341 ~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
+..+++-|.+..+||++++.++..|.+|.+.++-++-.|.-+|++|+|. ++++|+||..+. ..
T Consensus 286 Q~vTaSWDRTAnlwDVEtge~v~~LtGHd~ELtHcstHptQrLVvTsSr-DtTFRLWDFRea----------------I~ 348 (481)
T KOG0300|consen 286 QMVTASWDRTANLWDVETGEVVNILTGHDSELTHCSTHPTQRLVVTSSR-DTTFRLWDFREA----------------IQ 348 (481)
T ss_pred eeeeeeccccceeeeeccCceeccccCcchhccccccCCcceEEEEecc-CceeEeccchhh----------------cc
Confidence 3446778899999999999999999999999999999999999999998 677999998642 12
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
-+..| .|++ ..|.++.|.-|. .+++||+|.||+||++....
T Consensus 349 sV~VF-QGHt-dtVTS~vF~~dd-~vVSgSDDrTvKvWdLrNMR 389 (481)
T KOG0300|consen 349 SVAVF-QGHT-DTVTSVVFNTDD-RVVSGSDDRTVKVWDLRNMR 389 (481)
T ss_pred eeeee-cccc-cceeEEEEecCC-ceeecCCCceEEEeeecccc
Confidence 33334 5655 469999999775 56799999999999997643
No 135
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.12 E-value=5.4e-09 Score=123.03 Aligned_cols=99 Identities=13% Similarity=0.296 Sum_probs=79.2
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
-.|+||...+=..+..|..|+-.|+.|+|||||++|+++|. ++++.+|..+..- .+ ..-|.....
T Consensus 552 AvI~lw~t~~W~~~~~L~~HsLTVT~l~FSpdg~~LLsvsR-DRt~sl~~~~~~~----~~----------e~~fa~~k~ 616 (764)
T KOG1063|consen 552 AVIRLWNTANWLQVQELEGHSLTVTRLAFSPDGRYLLSVSR-DRTVSLYEVQEDI----KD----------EFRFACLKA 616 (764)
T ss_pred eEEEEEeccchhhhheecccceEEEEEEECCCCcEEEEeec-CceEEeeeeeccc----ch----------hhhhccccc
Confidence 46999999988778889999999999999999999999999 5679999986420 00 011222122
Q ss_pred cccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
|...||+.+|+||++++|++|.|.+|.||.+...
T Consensus 617 -HtRIIWdcsW~pde~~FaTaSRDK~VkVW~~~~~ 650 (764)
T KOG1063|consen 617 -HTRIIWDCSWSPDEKYFATASRDKKVKVWEEPDL 650 (764)
T ss_pred -cceEEEEcccCcccceeEEecCCceEEEEeccCc
Confidence 3347999999999999999999999999998753
No 136
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.10 E-value=9.6e-09 Score=124.01 Aligned_cols=223 Identities=16% Similarity=0.214 Sum_probs=153.8
Q ss_pred EEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccc
Q 001814 77 VLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGG 155 (1010)
Q Consensus 77 vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~ 155 (1010)
.|++|.. +.+++|... .++...+|-+=.-|++.+++.- ++.++|. |+
T Consensus 68 ~f~~~s~~~tv~~y~fp-s~~~~~iL~Rftlp~r~~~v~g-------------~g~~iaa--gs---------------- 115 (933)
T KOG1274|consen 68 HFLTGSEQNTVLRYKFP-SGEEDTILARFTLPIRDLAVSG-------------SGKMIAA--GS---------------- 115 (933)
T ss_pred ceEEeeccceEEEeeCC-CCCccceeeeeeccceEEEEec-------------CCcEEEe--ec----------------
Confidence 5556655 569999994 5667778887778899988763 3346553 11
Q ss_pred cccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC-CCcEEEEEEcC--CeEEE-EeCCeEEEEECCCCceeEEEeecC
Q 001814 156 VRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF-RSSVCMVRCSP--RIVAV-GLATQIYCFDALTLENKFSVLTYP 231 (1010)
Q Consensus 156 vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f-~S~V~sVa~S~--rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p 231 (1010)
.+..|++-++.....+..++- ..+|++|.++| .+||| ..+++|+|||+.++.+.+++..-+
T Consensus 116 ---------------dD~~vK~~~~~D~s~~~~lrgh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~ 180 (933)
T KOG1274|consen 116 ---------------DDTAVKLLNLDDSSQEKVLRGHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVD 180 (933)
T ss_pred ---------------CceeEEEEeccccchheeecccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCC
Confidence 257899999998888888865 47999999998 58876 458899999999999888876432
Q ss_pred CccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccce
Q 001814 232 VPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSK 311 (1010)
Q Consensus 232 ~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~k 311 (1010)
-. + -+-. .| +. .-++.+|.+|.|++
T Consensus 181 k~----------n----~~~~-s~----------i~--------------~~~aW~Pk~g~la~---------------- 205 (933)
T KOG1274|consen 181 KD----------N----EFIL-SR----------IC--------------TRLAWHPKGGTLAV---------------- 205 (933)
T ss_pred cc----------c----cccc-cc----------ee--------------eeeeecCCCCeEEe----------------
Confidence 10 0 0000 00 00 00112344444432
Q ss_pred eeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc--CCCCeEEEEECCCCCEEEEEEc
Q 001814 312 TLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA--HTSPISALCFDPSGTLLVTASV 389 (1010)
Q Consensus 312 tls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a--HtspIsaLaFSPdGtlLATAS~ 389 (1010)
...++.|++|+..+......++. |.+.++.++|||+|+|||+++.
T Consensus 206 ---------------------------------~~~d~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~ 252 (933)
T KOG1274|consen 206 ---------------------------------PPVDNTVKVYSRKGWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTL 252 (933)
T ss_pred ---------------------------------eccCCeEEEEccCCceeheeecccccccceEEEEEcCCCcEEeeecc
Confidence 24578899999988877666653 4455999999999999999999
Q ss_pred CCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEE
Q 001814 390 YGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVF 458 (1010)
Q Consensus 390 dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw 458 (1010)
+|. |-|||+... .+ ++++ ..|.+++|-|++.-|-.-...|+.-+|
T Consensus 253 ~g~-I~vWnv~t~----------------~~--~~~~-----~~Vc~~aw~p~~n~it~~~~~g~~~~~ 297 (933)
T KOG1274|consen 253 DGQ-ILVWNVDTH----------------ER--HEFK-----RAVCCEAWKPNANAITLITALGTLGVS 297 (933)
T ss_pred CCc-EEEEecccc----------------hh--cccc-----ceeEEEecCCCCCeeEEEeeccccccC
Confidence 876 889998532 01 2222 258999999998888777766665444
No 137
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.09 E-value=6.8e-08 Score=101.82 Aligned_cols=99 Identities=21% Similarity=0.313 Sum_probs=81.9
Q ss_pred cccCCCCeEEEEECCCCcEEEEecc--C-----CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcccc
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKA--H-----TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDW 415 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~a--H-----tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~ 415 (1010)
+++++|.+|+.||+.-...+.++.. | .+.|.+++.+|.|++||++-.| ....+|||.- |
T Consensus 198 ~sgsqdktirfwdlrv~~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~d-ssc~lydirg-------~------ 263 (350)
T KOG0641|consen 198 ASGSQDKTIRFWDLRVNSCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHAD-SSCMLYDIRG-------G------ 263 (350)
T ss_pred EccCCCceEEEEeeeccceeeeccCcccCCCcccceeEEEEECCCcceeeeccCC-CceEEEEeeC-------C------
Confidence 4678999999999988877776642 2 3689999999999999999885 5588999962 2
Q ss_pred CCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 416 NSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 416 ~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
+.++.++- +.+.|.++-|||...|+.++|-|..|++=++.
T Consensus 264 ----r~iq~f~p--hsadir~vrfsp~a~yllt~syd~~ikltdlq 303 (350)
T KOG0641|consen 264 ----RMIQRFHP--HSADIRCVRFSPGAHYLLTCSYDMKIKLTDLQ 303 (350)
T ss_pred ----ceeeeeCC--CccceeEEEeCCCceEEEEecccceEEEeecc
Confidence 56777653 56789999999999999999999999998764
No 138
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=99.08 E-value=7.2e-09 Score=120.42 Aligned_cols=315 Identities=17% Similarity=0.169 Sum_probs=183.4
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
.+|++.|..+..| +.+..|+.|.++| ++||-+. +|.+...+.. |+.|++|++.|.+. .+
T Consensus 397 rGHtg~Vr~iSvd-------p~G~wlasGsdDGtvriWEi~-TgRcvr~~~~-d~~I~~vaw~P~~~-----------~~ 456 (733)
T KOG0650|consen 397 RGHTGLVRSISVD-------PSGEWLASGSDDGTVRIWEIA-TGRCVRTVQF-DSEIRSVAWNPLSD-----------LC 456 (733)
T ss_pred eccCCeEEEEEec-------CCcceeeecCCCCcEEEEEee-cceEEEEEee-cceeEEEEecCCCC-----------ce
Confidence 4677777777766 2567999999988 8999995 4655555543 67899999999873 36
Q ss_pred EEEEEecCCCCCCCCCCCCCCcccc-ccCCcCCCCCCCCCCCCEEEEEeCC---CCe--EEEEEeCCCcEEEEEEcC--C
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGV-RDGMMDSQSGNCVNSPTAVRFYSFQ---SHC--YEHVLRFRSSVCMVRCSP--R 203 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~v-r~gs~d~~~~~~~~sp~tVrIWDlk---tge--~V~tL~f~S~V~sVa~S~--r 203 (1010)
+|||..++...-..+ -..+.+-.. ...-+...+ +....+..|-.|.-. ..+ .-.++++...|..|.+.+ +
T Consensus 457 vLAvA~~~~~~ivnp-~~G~~~e~~~t~ell~~~~-~~~~p~~~~~~W~~~~~~e~~~~v~~~I~~~k~i~~vtWHrkGD 534 (733)
T KOG0650|consen 457 VLAVAVGECVLIVNP-IFGDRLEVGPTKELLASAP-NESEPDAAVVTWSRASLDELEKGVCIVIKHPKSIRQVTWHRKGD 534 (733)
T ss_pred eEEEEecCceEEeCc-cccchhhhcchhhhhhcCC-CccCCcccceeechhhhhhhccceEEEEecCCccceeeeecCCc
Confidence 777644332100000 000000000 000000000 111234678889643 222 223578889999999987 6
Q ss_pred eEEEEeC----CeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc---ceEEEccCCeeeccCCccCCCc
Q 001814 204 IVAVGLA----TQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP---RWLAYASNTLLLSNSGRLSPQN 276 (1010)
Q Consensus 204 lLAV~ld----~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp---RwLAyas~~~~iwd~G~vs~Q~ 276 (1010)
+|++... ..|.|+++...+.. .|-. .. .| ....+.|.| .++......+++.|...
T Consensus 535 YlatV~~~~~~~~VliHQLSK~~sQ-----~PF~-ks-kG------~vq~v~FHPs~p~lfVaTq~~vRiYdL~k----- 596 (733)
T KOG0650|consen 535 YLATVMPDSGNKSVLIHQLSKRKSQ-----SPFR-KS-KG------LVQRVKFHPSKPYLFVATQRSVRIYDLSK----- 596 (733)
T ss_pred eEEEeccCCCcceEEEEeccccccc-----Cchh-hc-CC------ceeEEEecCCCceEEEEeccceEEEehhH-----
Confidence 7876554 46999987753321 1110 00 00 011233433 22222333444554210
Q ss_pred CCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEEC
Q 001814 277 LTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDF 356 (1010)
Q Consensus 277 lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl 356 (1010)
.+..|.+..|. ++. +.++.++.+. .++-++.++.+..||+
T Consensus 597 ----------------------qelvKkL~tg~-----kwi---------S~msihp~GD----nli~gs~d~k~~WfDl 636 (733)
T KOG0650|consen 597 ----------------------QELVKKLLTGS-----KWI---------SSMSIHPNGD----NLILGSYDKKMCWFDL 636 (733)
T ss_pred ----------------------HHHHHHHhcCC-----eee---------eeeeecCCCC----eEEEecCCCeeEEEEc
Confidence 01123333332 111 1222233221 2345678999999999
Q ss_pred CCC-cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcccc--CCcceEEEEEeccccc--
Q 001814 357 VTR-AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDW--NSSHVHLYKLHRGITS-- 431 (1010)
Q Consensus 357 ~s~-~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~--~~s~~~L~~L~RG~t~-- 431 (1010)
.-. +...+++-|...|.+++|.+.=-|+|+||.||+ +.||.-+-. .+. +.-.+.|.+| ||+..
T Consensus 637 dlsskPyk~lr~H~~avr~Va~H~ryPLfas~sdDgt-v~Vfhg~VY----------~Dl~qnpliVPlK~L-~gH~~~~ 704 (733)
T KOG0650|consen 637 DLSSKPYKTLRLHEKAVRSVAFHKRYPLFASGSDDGT-VIVFHGMVY----------NDLLQNPLIVPLKRL-RGHEKTN 704 (733)
T ss_pred ccCcchhHHhhhhhhhhhhhhhccccceeeeecCCCc-EEEEeeeee----------hhhhcCCceEeeeec-cCceeec
Confidence 754 467889999999999999999999999999887 668864321 011 1123455555 45432
Q ss_pred -ccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 432 -ATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 432 -a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
..|.++.|.|.--||.+++.||||++|.
T Consensus 705 ~~gVLd~~wHP~qpWLfsAGAd~tirlfT 733 (733)
T KOG0650|consen 705 DLGVLDTIWHPRQPWLFSAGADGTIRLFT 733 (733)
T ss_pred ccceEeecccCCCceEEecCCCceEEeeC
Confidence 2488999999999999999999999993
No 139
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.08 E-value=2.5e-08 Score=117.39 Aligned_cols=246 Identities=14% Similarity=0.172 Sum_probs=166.0
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
+.+-|.+.|+ +..+++.+|.++.+.-||+.+ .+-..-+....|++..+++.|.. ..
T Consensus 69 drsIE~L~W~----------e~~RLFS~g~sg~i~EwDl~~-lk~~~~~d~~gg~IWsiai~p~~-------------~~ 124 (691)
T KOG2048|consen 69 DRSIESLAWA----------EGGRLFSSGLSGSITEWDLHT-LKQKYNIDSNGGAIWSIAINPEN-------------TI 124 (691)
T ss_pred CCceeeEEEc----------cCCeEEeecCCceEEEEeccc-CceeEEecCCCcceeEEEeCCcc-------------ce
Confidence 3455667776 135788888888899999964 43334444456889999988654 24
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC---CCcEEEEEEcCC--eEEE
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF---RSSVCMVRCSPR--IVAV 207 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f---~S~V~sVa~S~r--lLAV 207 (1010)
++| +|+ ++.+.+.+...++......| .++|++|.+++. .|+.
T Consensus 125 l~I-gcd--------------------------------dGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~ 171 (691)
T KOG2048|consen 125 LAI-GCD--------------------------------DGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAG 171 (691)
T ss_pred EEe-ecC--------------------------------CceEEEEecCCceEEEEeecccccceEEEEEecCCccEEEe
Confidence 555 443 14566667767666555444 379999999993 3566
Q ss_pred Ee-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCc
Q 001814 208 GL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPST 286 (1010)
Q Consensus 208 ~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~st 286 (1010)
|. |+.|+|||+.+...++.+ +-.. +-++ .+...++|..
T Consensus 172 Gs~Dg~Iriwd~~~~~t~~~~-~~~~-----------------d~l~------k~~~~iVWSv----------------- 210 (691)
T KOG2048|consen 172 GSIDGVIRIWDVKSGQTLHII-TMQL-----------------DRLS------KREPTIVWSV----------------- 210 (691)
T ss_pred cccCceEEEEEcCCCceEEEe-eecc-----------------cccc------cCCceEEEEE-----------------
Confidence 65 456999999988877622 2111 0000 0122334420
Q ss_pred CCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEec
Q 001814 287 SPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFK 366 (1010)
Q Consensus 287 SP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~ 366 (1010)
+ | +.+ .++++++..|+|++||...+.++..+.
T Consensus 211 ------~----------------------~----Lrd----------------~tI~sgDS~G~V~FWd~~~gTLiqS~~ 242 (691)
T KOG2048|consen 211 ------L----------------------F----LRD----------------STIASGDSAGTVTFWDSIFGTLIQSHS 242 (691)
T ss_pred ------E----------------------E----eec----------------CcEEEecCCceEEEEcccCcchhhhhh
Confidence 0 0 000 034577889999999999999999999
Q ss_pred cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEE
Q 001814 367 AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWI 446 (1010)
Q Consensus 367 aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L 446 (1010)
.|..-|.||+-++++.+|++|+.|+++|++... +. + ++| +...+|-.+...|.+++-.++ .|
T Consensus 243 ~h~adVl~Lav~~~~d~vfsaGvd~~ii~~~~~-~~------~---~~w------v~~~~r~~h~hdvrs~av~~~--~l 304 (691)
T KOG2048|consen 243 CHDADVLALAVADNEDRVFSAGVDPKIIQYSLT-TN------K---SEW------VINSRRDLHAHDVRSMAVIEN--AL 304 (691)
T ss_pred hhhcceeEEEEcCCCCeEEEccCCCceEEEEec-CC------c---cce------eeeccccCCcccceeeeeecc--eE
Confidence 999999999999999999999999997776654 21 1 111 222334445567999999987 88
Q ss_pred EEEeCCCeEEEEeCCC
Q 001814 447 AIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 447 AsgS~dGTVhIw~I~~ 462 (1010)
.+|+.|.|+-|=....
T Consensus 305 ~sgG~d~~l~i~~s~~ 320 (691)
T KOG2048|consen 305 ISGGRDFTLAICSSRE 320 (691)
T ss_pred EecceeeEEEEccccc
Confidence 8999999987755443
No 140
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.08 E-value=3e-09 Score=124.98 Aligned_cols=134 Identities=18% Similarity=0.173 Sum_probs=93.8
Q ss_pred CCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECC---CCCEEEEEEcCCCeEEEEeCCCCccc--
Q 001814 331 PNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDP---SGTLLVTASVYGNNINIFRIMPSCMR-- 405 (1010)
Q Consensus 331 ~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSP---dGtlLATAS~dGt~IrVwdi~p~~~~-- 405 (1010)
.+|.++++ ++|+.-|+++|||++..+....+.||.+.|.||.||. ..+|||+||. |+.|+|||+..++..
T Consensus 467 vSp~gqhL----AsGDr~GnlrVy~Lq~l~~~~~~eAHesEilcLeyS~p~~~~kLLASasr-dRlIHV~Dv~rny~l~q 541 (1080)
T KOG1408|consen 467 VSPDGQHL----ASGDRGGNLRVYDLQELEYTCFMEAHESEILCLEYSFPVLTNKLLASASR-DRLIHVYDVKRNYDLVQ 541 (1080)
T ss_pred ECCCccee----cccCccCceEEEEehhhhhhhheecccceeEEEeecCchhhhHhhhhccC-CceEEEEecccccchhh
Confidence 44555554 4788999999999999999999999999999999984 3479999998 789999999865421
Q ss_pred -----CC---------CC---------C-CccccCCcc--eEEEEEeccccc---ccEEEEEEccCCCEEEEEeCCCeEE
Q 001814 406 -----SG---------SG---------N-HKYDWNSSH--VHLYKLHRGITS---ATIQDICFSHYSQWIAIVSSKGTCH 456 (1010)
Q Consensus 406 -----~~---------sG---------~-~~~~~~~s~--~~L~~L~RG~t~---a~I~sIAFSpDg~~LAsgS~dGTVh 456 (1010)
++ .| + ++.-+.... ---..|.|+++. ..+++++..|..++++++..|..|+
T Consensus 542 tld~HSssITsvKFa~~gln~~MiscGADksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~~k~v~t~cQDrnir 621 (1080)
T KOG1408|consen 542 TLDGHSSSITSVKFACNGLNRKMISCGADKSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPTSKLVVTVCQDRNIR 621 (1080)
T ss_pred hhcccccceeEEEEeecCCceEEEeccCchhhheehhccccCceeccccccccccceEEEeeeCCCcceEEEEecccceE
Confidence 00 01 0 000000000 000112233321 2599999999999999999999999
Q ss_pred EEeCCCCCCcccc
Q 001814 457 VFVLSPFGGDSGF 469 (1010)
Q Consensus 457 Iw~I~~~gg~~~~ 469 (1010)
||+++..+....+
T Consensus 622 if~i~sgKq~k~F 634 (1080)
T KOG1408|consen 622 IFDIESGKQVKSF 634 (1080)
T ss_pred EEeccccceeeee
Confidence 9999876554444
No 141
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.05 E-value=1.8e-08 Score=110.20 Aligned_cols=246 Identities=16% Similarity=0.111 Sum_probs=157.1
Q ss_pred CCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 54 DLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 54 ~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
..+|.|.-+.|.. ..+.|+++..+| +++||++.. .+ .+.=.|++|+-..+|-+.-
T Consensus 11 pP~d~IS~v~f~~-------~~~~LLvssWDgslrlYdv~~~-~l-~~~~~~~~plL~c~F~d~~--------------- 66 (323)
T KOG1036|consen 11 PPEDGISSVKFSP-------SSSDLLVSSWDGSLRLYDVPAN-SL-KLKFKHGAPLLDCAFADES--------------- 66 (323)
T ss_pred CChhceeeEEEcC-------cCCcEEEEeccCcEEEEeccch-hh-hhheecCCceeeeeccCCc---------------
Confidence 4589999999983 334556666665 999999643 22 2333578899988876421
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC---CeEEEEe
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP---RIVAVGL 209 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~---rlLAV~l 209 (1010)
-++ .|+ .++.|+.+|+.+++......+..+|.+|..++ .+|+.+-
T Consensus 67 ~~~-~G~-------------------------------~dg~vr~~Dln~~~~~~igth~~~i~ci~~~~~~~~vIsgsW 114 (323)
T KOG1036|consen 67 TIV-TGG-------------------------------LDGQVRRYDLNTGNEDQIGTHDEGIRCIEYSYEVGCVISGSW 114 (323)
T ss_pred eEE-Eec-------------------------------cCceEEEEEecCCcceeeccCCCceEEEEeeccCCeEEEccc
Confidence 122 221 25789999999998766667778999999985 3555667
Q ss_pred CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCC
Q 001814 210 ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPG 289 (1010)
Q Consensus 210 d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~ 289 (1010)
|++|++||.++-....++.. +.. ...|.++
T Consensus 115 D~~ik~wD~R~~~~~~~~d~-~kk-------------Vy~~~v~------------------------------------ 144 (323)
T KOG1036|consen 115 DKTIKFWDPRNKVVVGTFDQ-GKK-------------VYCMDVS------------------------------------ 144 (323)
T ss_pred CccEEEEecccccccccccc-Cce-------------EEEEecc------------------------------------
Confidence 89999999886222111110 000 0001110
Q ss_pred CCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc--
Q 001814 290 GSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA-- 367 (1010)
Q Consensus 290 ~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a-- 367 (1010)
+.. +.-+..+..|.+||+.+....-+.+.
T Consensus 145 g~~-------------------------------------------------LvVg~~~r~v~iyDLRn~~~~~q~reS~ 175 (323)
T KOG1036|consen 145 GNR-------------------------------------------------LVVGTSDRKVLIYDLRNLDEPFQRRESS 175 (323)
T ss_pred CCE-------------------------------------------------EEEeecCceEEEEEcccccchhhhcccc
Confidence 001 11235677899999988765444443
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec----ccccc-cEEEEEEccC
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR----GITSA-TIQDICFSHY 442 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R----G~t~a-~I~sIAFSpD 442 (1010)
-.-.+.+|++-|++.=.|.+|.+|++ -|=.+.+.. -.++.+-.++.+| |..-. .|.+|+|+|-
T Consensus 176 lkyqtR~v~~~pn~eGy~~sSieGRV-avE~~d~s~-----------~~~skkyaFkCHr~~~~~~~~~yPVNai~Fhp~ 243 (323)
T KOG1036|consen 176 LKYQTRCVALVPNGEGYVVSSIEGRV-AVEYFDDSE-----------EAQSKKYAFKCHRLSEKDTEIIYPVNAIAFHPI 243 (323)
T ss_pred ceeEEEEEEEecCCCceEEEeecceE-EEEccCCch-----------HHhhhceeEEeeecccCCceEEEEeceeEeccc
Confidence 23578999999999989999999984 443332210 0011122233333 21111 5899999999
Q ss_pred CCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 443 SQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 443 g~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
-+.||+|+.||-|.+|++.+.+..
T Consensus 244 ~~tfaTgGsDG~V~~Wd~~~rKrl 267 (323)
T KOG1036|consen 244 HGTFATGGSDGIVNIWDLFNRKRL 267 (323)
T ss_pred cceEEecCCCceEEEccCcchhhh
Confidence 999999999999999999876543
No 142
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=99.04 E-value=5.7e-07 Score=95.42 Aligned_cols=178 Identities=23% Similarity=0.321 Sum_probs=124.8
Q ss_pred CCEEEEEeCCC-CeEEEEEeCC-CcEEEEEEcC--CeEEEEe--CCeEEEEECCCCceeEEEeecCCccccCCCcccccc
Q 001814 172 PTAVRFYSFQS-HCYEHVLRFR-SSVCMVRCSP--RIVAVGL--ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINV 245 (1010)
Q Consensus 172 p~tVrIWDlkt-ge~V~tL~f~-S~V~sVa~S~--rlLAV~l--d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv 245 (1010)
++++++||... ...+..+..+ ..|..+.+++ +.++++. +..+++|++.+.+.+.++..+..+
T Consensus 133 d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------ 200 (466)
T COG2319 133 DGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDP------------ 200 (466)
T ss_pred CccEEEEEecCCCeEEEEEecCcccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceEEeeccCCCc------------
Confidence 46899999998 7777777766 5888999988 3566554 678999999886655555432221
Q ss_pred CccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCC
Q 001814 246 GYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGS 325 (1010)
Q Consensus 246 ~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs 325 (1010)
. ..+++... +..
T Consensus 201 -v-------~~~~~~~~---------------------------~~~--------------------------------- 212 (466)
T COG2319 201 -V-------SSLAFSPD---------------------------GGL--------------------------------- 212 (466)
T ss_pred -e-------EEEEEcCC---------------------------cce---------------------------------
Confidence 0 01112110 000
Q ss_pred CCCccCCCccccccccccccCCCCeEEEEECCCCcEEE-EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcc
Q 001814 326 SSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIIS-QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCM 404 (1010)
Q Consensus 326 ~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~-~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~ 404 (1010)
.++++..++.|++||...+..+. .+..|.... ...|+|++.++++++.++. +++|++...
T Consensus 213 ---------------~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d~~-~~~~~~~~~-- 273 (466)
T COG2319 213 ---------------LIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSDGT-IRLWDLRSS-- 273 (466)
T ss_pred ---------------EEEEecCCCcEEEEECCCCcEEeeecCCCCcce-eEeECCCCCEEEEecCCCc-EEEeeecCC--
Confidence 01123578899999999888887 799998886 4489999999998888665 999998532
Q ss_pred cCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 405 RSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 405 ~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
. ..+..+ .++ ...|.++.|+|++..+++++.|+++++|++....
T Consensus 274 ----~----------~~~~~~-~~~-~~~v~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 317 (466)
T COG2319 274 ----S----------SLLRTL-SGH-SSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGK 317 (466)
T ss_pred ----C----------cEEEEE-ecC-CccEEEEEECCCCCEEEEeeCCCcEEEEEcCCCc
Confidence 1 123333 333 4579999999999999999999999999776543
No 143
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.03 E-value=1.2e-08 Score=115.97 Aligned_cols=221 Identities=14% Similarity=0.199 Sum_probs=141.1
Q ss_pred CCCCeEEEEEec-CcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCC
Q 001814 72 SVFKQVLLLGYQ-NGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNR 150 (1010)
Q Consensus 72 ~~~~~vLalGy~-~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~ 150 (1010)
.+.+.+|+.|.- +.+-+|.+ .+|.+..+++.|=..|++++|.-++. + ++ +++
T Consensus 90 ~n~G~~l~ag~i~g~lYlWel-ssG~LL~v~~aHYQ~ITcL~fs~dgs-------------~-ii-Tgs----------- 142 (476)
T KOG0646|consen 90 SNLGYFLLAGTISGNLYLWEL-SSGILLNVLSAHYQSITCLKFSDDGS-------------H-II-TGS----------- 142 (476)
T ss_pred CCCceEEEeecccCcEEEEEe-ccccHHHHHHhhccceeEEEEeCCCc-------------E-EE-ecC-----------
Confidence 356888888854 55999999 56889999999999999999986551 3 32 221
Q ss_pred CCccccccCCcCCCCCCCCCCCCEEEEEeCC---------CCeEEEEEeCCC-cEEEEEEc-----CCeEEEEeCCeEEE
Q 001814 151 SHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQ---------SHCYEHVLRFRS-SVCMVRCS-----PRIVAVGLATQIYC 215 (1010)
Q Consensus 151 ~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlk---------tge~V~tL~f~S-~V~sVa~S-----~rlLAV~ld~~I~I 215 (1010)
.|+.|.+|++. +-+.++.+..|. +|.++.+. ++++.++.|.+|+|
T Consensus 143 --------------------kDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~ 202 (476)
T KOG0646|consen 143 --------------------KDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKL 202 (476)
T ss_pred --------------------CCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEE
Confidence 13568888652 223455555555 78777764 36777888899999
Q ss_pred EECCCCceeEEEeecCCccccCCCccccccCccceEEcc-ceEEEccCCe-eeccCCccCCCcCCCCCCCCCcCCCCCce
Q 001814 216 FDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP-RWLAYASNTL-LLSNSGRLSPQNLTPSGVSPSTSPGGSSL 293 (1010)
Q Consensus 216 wD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp-RwLAyas~~~-~iwd~G~vs~Q~lt~p~vS~stSP~~gsl 293 (1010)
||+..+..+.++.. |. ....+++.| -..-|.|... .+|-
T Consensus 203 wdlS~g~LLlti~f-p~-------------si~av~lDpae~~~yiGt~~G~I~~------------------------- 243 (476)
T KOG0646|consen 203 WDLSLGVLLLTITF-PS-------------SIKAVALDPAERVVYIGTEEGKIFQ------------------------- 243 (476)
T ss_pred EEeccceeeEEEec-CC-------------cceeEEEcccccEEEecCCcceEEe-------------------------
Confidence 99999988877764 32 134577777 3334554311 1110
Q ss_pred EEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCC--C
Q 001814 294 VARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTS--P 371 (1010)
Q Consensus 294 Va~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHts--p 371 (1010)
. . +.. ++ ++ ..+=.++.++.. +..+..|.+|.. +
T Consensus 244 -~-------------~------~~~--~~-----~~----------------~~~v~~k~~~~~-~t~~~~~~Gh~~~~~ 279 (476)
T KOG0646|consen 244 -N-------------L------LFK--LS-----GQ----------------SAGVNQKGRHEE-NTQINVLVGHENESA 279 (476)
T ss_pred -e-------------e------hhc--CC-----cc----------------cccccccccccc-cceeeeeccccCCcc
Confidence 0 0 000 00 00 000012233322 445678889988 9
Q ss_pred eEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccC
Q 001814 372 ISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHY 442 (1010)
Q Consensus 372 IsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD 442 (1010)
|+||+.|-||++|++++.||. ++|||+.. .+++.++.. ....|+.+.|.|-
T Consensus 280 ITcLais~DgtlLlSGd~dg~-VcvWdi~S-----------------~Q~iRtl~~--~kgpVtnL~i~~~ 330 (476)
T KOG0646|consen 280 ITCLAISTDGTLLLSGDEDGK-VCVWDIYS-----------------KQCIRTLQT--SKGPVTNLQINPL 330 (476)
T ss_pred eeEEEEecCccEEEeeCCCCC-EEEEecch-----------------HHHHHHHhh--hccccceeEeecc
Confidence 999999999999999999887 89999842 245544421 1235777777664
No 144
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.03 E-value=1.3e-09 Score=133.35 Aligned_cols=140 Identities=15% Similarity=0.126 Sum_probs=100.5
Q ss_pred cccC--CCCeEEEEECCC------------CcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCC
Q 001814 343 ADMD--NAGIVVVKDFVT------------RAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGS 408 (1010)
Q Consensus 343 asgs--~dG~V~VwDl~s------------~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~s 408 (1010)
++++ .||.++||.... .+.+.+..-|++.|+|+.|+|||++||+||.| +.|-||.-.+.......
T Consensus 29 aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~~h~~sv~CVR~S~dG~~lAsGSDD-~~v~iW~~~~~~~~~~f 107 (942)
T KOG0973|consen 29 ATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMDDHDGSVNCVRFSPDGSYLASGSDD-RLVMIWERAEIGSGTVF 107 (942)
T ss_pred ecCCccccccceeeccccccchhhhhhcccchhheeeccccCceeEEEECCCCCeEeeccCc-ceEEEeeecccCCcccc
Confidence 4555 688888998753 23567778899999999999999999999995 67999997641000000
Q ss_pred CCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCCccCCC
Q 001814 409 GNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFPV 484 (1010)
Q Consensus 409 G~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~pv 484 (1010)
|.....-....+..+...||| ...|.+++||||+.+||++|.|++|+||+...++....++.|.+.++|-...|+
T Consensus 108 gs~g~~~~vE~wk~~~~l~~H-~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~ 182 (942)
T KOG0973|consen 108 GSTGGAKNVESWKVVSILRGH-DSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFELLKVLRGHQSLVKGVSWDPI 182 (942)
T ss_pred cccccccccceeeEEEEEecC-CCccceeccCCCccEEEEecccceEEEEccccceeeeeeecccccccceEECCc
Confidence 100000011112233344775 467999999999999999999999999999999777788999888766544444
No 145
>PRK01742 tolB translocation protein TolB; Provisional
Probab=99.01 E-value=4.9e-08 Score=113.49 Aligned_cols=90 Identities=17% Similarity=0.090 Sum_probs=56.8
Q ss_pred CCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 348 AGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 348 dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
+|.++||++... .....+ .|.. ..++|+|||++||.++. +. |.+||+.. |. . ..+.
T Consensus 313 ~g~~~I~~~~~~~~~~~~l-~~~~--~~~~~SpDG~~ia~~~~-~~-i~~~Dl~~-------g~----------~-~~lt 369 (429)
T PRK01742 313 SGSPQVYRMSASGGGASLV-GGRG--YSAQISADGKTLVMING-DN-VVKQDLTS-------GS----------T-EVLS 369 (429)
T ss_pred CCCceEEEEECCCCCeEEe-cCCC--CCccCCCCCCEEEEEcC-CC-EEEEECCC-------CC----------e-EEec
Confidence 444566655321 112222 3433 45789999999998876 34 55688753 31 1 1222
Q ss_pred cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.+. ...+++|+|||++|+.++.+|++.+|.+-..
T Consensus 370 ~~~---~~~~~~~sPdG~~i~~~s~~g~~~~l~~~~~ 403 (429)
T PRK01742 370 STF---LDESPSISPNGIMIIYSSTQGLGKVLQLVSA 403 (429)
T ss_pred CCC---CCCCceECCCCCEEEEEEcCCCceEEEEEEC
Confidence 221 2356889999999999999999998887443
No 146
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=98.99 E-value=1.9e-08 Score=110.67 Aligned_cols=101 Identities=20% Similarity=0.294 Sum_probs=76.3
Q ss_pred eEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccc
Q 001814 76 QVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGG 155 (1010)
Q Consensus 76 ~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~ 155 (1010)
-+.+.||.+-|+|.|+. .+.+..-+-+|.+.|+.+++.|. +|-|++. +
T Consensus 107 ~la~~G~~GvIrVid~~-~~~~~~~~~ghG~sINeik~~p~-------------~~qlvls-~----------------- 154 (385)
T KOG1034|consen 107 FLAAGGYLGVIRVIDVV-SGQCSKNYRGHGGSINEIKFHPD-------------RPQLVLS-A----------------- 154 (385)
T ss_pred eEEeecceeEEEEEecc-hhhhccceeccCccchhhhcCCC-------------CCcEEEE-e-----------------
Confidence 34444544558999995 57788888889999999997764 3444442 2
Q ss_pred cccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEe----CCCcEEEEEEcC--CeEE-EEeCCeEEEEECCCCc
Q 001814 156 VRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLR----FRSSVCMVRCSP--RIVA-VGLATQIYCFDALTLE 222 (1010)
Q Consensus 156 vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~----f~S~V~sVa~S~--rlLA-V~ld~~I~IwD~~Tle 222 (1010)
+.+..||+||++++.||..+. ++..|++|+++. ++|+ .|.|.+|++|++...+
T Consensus 155 --------------SkD~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~~~~ 214 (385)
T KOG1034|consen 155 --------------SKDHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLNVKE 214 (385)
T ss_pred --------------cCCceEEEEeccCCeEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecChhH
Confidence 135789999999999999984 457999999987 4555 5778899999998543
No 147
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=98.99 E-value=2.7e-09 Score=121.73 Aligned_cols=204 Identities=22% Similarity=0.233 Sum_probs=129.7
Q ss_pred EEEEeCCCcEEEEEEcC--CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEcc--ceEEEcc
Q 001814 186 EHVLRFRSSVCMVRCSP--RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYAS 261 (1010)
Q Consensus 186 V~tL~f~S~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas 261 (1010)
+++|.+-..|+++..+. +.+..|..+.|+|||+.......-+...+. ...-|. ...+-|.+ |-|-..+
T Consensus 413 ~~tL~HGEvVcAvtIS~~trhVyTgGkgcVKVWdis~pg~k~PvsqLdc-------l~rdny-iRSckL~pdgrtLivGG 484 (705)
T KOG0639|consen 413 INTLAHGEVVCAVTISNPTRHVYTGGKGCVKVWDISQPGNKSPVSQLDC-------LNRDNY-IRSCKLLPDGRTLIVGG 484 (705)
T ss_pred hhhhccCcEEEEEEecCCcceeEecCCCeEEEeeccCCCCCCccccccc-------cCcccc-eeeeEecCCCceEEecc
Confidence 44555556788999986 788889999999999986543221111000 000011 23345545 6665555
Q ss_pred C--CeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCcccccc
Q 001814 262 N--TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGR 339 (1010)
Q Consensus 262 ~--~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~ 339 (1010)
. ++.|||...-++. +. ..-+| + .-.+.++..+++.+..
T Consensus 485 eastlsiWDLAapTpr-ik----aelts----s------------------------------apaCyALa~spDakvc- 524 (705)
T KOG0639|consen 485 EASTLSIWDLAAPTPR-IK----AELTS----S------------------------------APACYALAISPDAKVC- 524 (705)
T ss_pred ccceeeeeeccCCCcc-hh----hhcCC----c------------------------------chhhhhhhcCCcccee-
Confidence 3 5678884211100 00 00000 0 0011122233443322
Q ss_pred ccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc
Q 001814 340 HAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH 419 (1010)
Q Consensus 340 ~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~ 419 (1010)
-++-.||.|.|||+.+..++.+|++|+..++||.+++||+.|=|++-| .++|-||+.. |
T Consensus 525 ---FsccsdGnI~vwDLhnq~~VrqfqGhtDGascIdis~dGtklWTGGlD-ntvRcWDlre-------g---------- 583 (705)
T KOG0639|consen 525 ---FSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISKDGTKLWTGGLD-NTVRCWDLRE-------G---------- 583 (705)
T ss_pred ---eeeccCCcEEEEEcccceeeecccCCCCCceeEEecCCCceeecCCCc-cceeehhhhh-------h----------
Confidence 145679999999999999999999999999999999999999999995 5599999963 3
Q ss_pred eEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 420 VHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 420 ~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
+++.+. .-...|.++..+|.+.|||+|=..+.|-|-...
T Consensus 584 rqlqqh---dF~SQIfSLg~cP~~dWlavGMens~vevlh~s 622 (705)
T KOG0639|consen 584 RQLQQH---DFSSQIFSLGYCPTGDWLAVGMENSNVEVLHTS 622 (705)
T ss_pred hhhhhh---hhhhhheecccCCCccceeeecccCcEEEEecC
Confidence 233221 112469999999999999999998876665443
No 148
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=98.98 E-value=2.1e-08 Score=114.13 Aligned_cols=114 Identities=15% Similarity=0.159 Sum_probs=82.6
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc---
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH--- 419 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~--- 419 (1010)
.+++.|.++++||+..+.++.++.- ..+|.+++.+|-++.+..++.+|. |-++++... .|. +.+.....
T Consensus 192 ~TaS~D~t~k~wdlS~g~LLlti~f-p~si~av~lDpae~~~yiGt~~G~-I~~~~~~~~-----~~~-~~~v~~k~~~~ 263 (476)
T KOG0646|consen 192 YTASEDRTIKLWDLSLGVLLLTITF-PSSIKAVALDPAERVVYIGTEEGK-IFQNLLFKL-----SGQ-SAGVNQKGRHE 263 (476)
T ss_pred EEecCCceEEEEEeccceeeEEEec-CCcceeEEEcccccEEEecCCcce-EEeeehhcC-----Ccc-ccccccccccc
Confidence 4668899999999999988876653 468999999999999999999886 666665321 010 01111111
Q ss_pred --eEEEEEeccccc-ccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 420 --VHLYKLHRGITS-ATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 420 --~~L~~L~RG~t~-a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
.....| -|+.+ ..|++++.|-||..|++|+.||+|.||++...-.
T Consensus 264 ~~t~~~~~-~Gh~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~Q~ 311 (476)
T KOG0646|consen 264 ENTQINVL-VGHENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSKQC 311 (476)
T ss_pred ccceeeee-ccccCCcceeEEEEecCccEEEeeCCCCCEEEEecchHHH
Confidence 122233 34444 4799999999999999999999999999976433
No 149
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=98.96 E-value=4.7e-09 Score=121.86 Aligned_cols=99 Identities=20% Similarity=0.247 Sum_probs=78.2
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 346 DNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
.....|+|||+....++..+..-..-|+.|+.+|.|.-|+.++.+++ +.+||+.-. ..-|+-
T Consensus 584 aTq~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d~k-~~WfDldls-----------------skPyk~ 645 (733)
T KOG0650|consen 584 ATQRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYDKK-MCWFDLDLS-----------------SKPYKT 645 (733)
T ss_pred EeccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCCCe-eEEEEcccC-----------------cchhHH
Confidence 45678999999998888888877889999999999999999999665 889998531 022332
Q ss_pred ecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 426 HRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 426 ~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.|-+ ...|.+++|.+-=-.+|++|+|||++||.-.-|
T Consensus 646 lr~H-~~avr~Va~H~ryPLfas~sdDgtv~Vfhg~VY 682 (733)
T KOG0650|consen 646 LRLH-EKAVRSVAFHKRYPLFASGSDDGTVIVFHGMVY 682 (733)
T ss_pred hhhh-hhhhhhhhhccccceeeeecCCCcEEEEeeeee
Confidence 2333 345999999998899999999999999975443
No 150
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=98.95 E-value=9.2e-09 Score=120.29 Aligned_cols=114 Identities=15% Similarity=0.149 Sum_probs=73.2
Q ss_pred cccC-CCCeEEEEECCCCcE--------EEEeccC---CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCC
Q 001814 343 ADMD-NAGIVVVKDFVTRAI--------ISQFKAH---TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGN 410 (1010)
Q Consensus 343 asgs-~dG~V~VwDl~s~~~--------v~~~~aH---tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~ 410 (1010)
++++ .|+.|+|||+..... +..+.-| .-.+.+|+.+..|++|.....|++ |..|++... +.
T Consensus 233 aSaga~D~~iKVWDLRk~~~~~r~ep~~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD~s-Iy~ynm~s~------s~ 305 (720)
T KOG0321|consen 233 ASAGAADSTIKVWDLRKNYTAYRQEPRGSDKYPTHSKRSVGQVNLILDSSGTYLFASCTDNS-IYFYNMRSL------SI 305 (720)
T ss_pred eeccCCCcceEEEeecccccccccCCCcccCccCcccceeeeEEEEecCCCCeEEEEecCCc-EEEEecccc------Cc
Confidence 3444 599999999986432 1223334 336888999999987765555555 999998542 10
Q ss_pred CccccCCcceEEEEEecccccc--cEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccc-ccccC
Q 001814 411 HKYDWNSSHVHLYKLHRGITSA--TIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGF-QTLSS 474 (1010)
Q Consensus 411 ~~~~~~~s~~~L~~L~RG~t~a--~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~-~~H~s 474 (1010)
..+..+ .|.-+. .|. -..|||+++|++||.|+.+.||.++.....+.+ .+|..
T Consensus 306 ---------sP~~~~-sg~~~~sf~vk-s~lSpd~~~l~SgSsd~~ayiw~vs~~e~~~~~l~Ght~ 361 (720)
T KOG0321|consen 306 ---------SPVAEF-SGKLNSSFYVK-SELSPDDCSLLSGSSDEQAYIWVVSSPEAPPALLLGHTR 361 (720)
T ss_pred ---------Cchhhc-cCcccceeeee-eecCCCCceEeccCCCcceeeeeecCccCChhhhhCcce
Confidence 011111 111111 122 347899999999999999999999887665544 45643
No 151
>KOG4328 consensus WD40 protein [Function unknown]
Probab=98.94 E-value=2.7e-08 Score=113.05 Aligned_cols=252 Identities=16% Similarity=0.153 Sum_probs=150.5
Q ss_pred CcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEcc---CCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcE
Q 001814 57 DQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVE---DASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPF 132 (1010)
Q Consensus 57 d~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~---~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpL 132 (1010)
++|.-..|. +...++++++|...| +-+||+. .-..-..++..|.++|..|.|.|... + .
T Consensus 187 ~Rit~l~fH-----Pt~~~~lva~GdK~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~----------s--~ 249 (498)
T KOG4328|consen 187 RRITSLAFH-----PTENRKLVAVGDKGGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANT----------S--Q 249 (498)
T ss_pred cceEEEEec-----ccCcceEEEEccCCCcEEEEecCCCCCccCceEEeccCCccccceEecCCCh----------h--h
Confidence 344444444 444678999999887 9999993 22222355677899999999998641 1 1
Q ss_pred EEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeE--EEEEeCC-CcEEEEEEcC---CeEE
Q 001814 133 LLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCY--EHVLRFR-SSVCMVRCSP---RIVA 206 (1010)
Q Consensus 133 LAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~--V~tL~f~-S~V~sVa~S~---rlLA 206 (1010)
+.. .++++++|+-|++++.. +.+++-. ....++.++. .+|+
T Consensus 250 i~s---------------------------------sSyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~ 296 (498)
T KOG4328|consen 250 IYS---------------------------------SSYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLF 296 (498)
T ss_pred eee---------------------------------eccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEE
Confidence 121 12468999999987653 3333322 2456666654 3444
Q ss_pred EEeCCeEEEEECCCCceeEEEee-cCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 207 VGLATQIYCFDALTLENKFSVLT-YPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 207 V~ld~~I~IwD~~Tle~l~tL~t-~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
+..-+...+||.++....+.... |... .+.+++.|
T Consensus 297 ~~~~G~f~~iD~R~~~s~~~~~~lh~kK-------------I~sv~~NP------------------------------- 332 (498)
T KOG4328|consen 297 GDNVGNFNVIDLRTDGSEYENLRLHKKK-------------ITSVALNP------------------------------- 332 (498)
T ss_pred eecccceEEEEeecCCccchhhhhhhcc-------------cceeecCC-------------------------------
Confidence 44445788898887553221111 1110 11122211
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc----E
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA----I 361 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~----~ 361 (1010)
.- ..-+++++.|++++|||+.... .
T Consensus 333 ------------------------------------------------~~---p~~laT~s~D~T~kIWD~R~l~~K~sp 361 (498)
T KOG4328|consen 333 ------------------------------------------------VC---PWFLATASLDQTAKIWDLRQLRGKASP 361 (498)
T ss_pred ------------------------------------------------CC---chheeecccCcceeeeehhhhcCCCCc
Confidence 00 0013467889999999997633 2
Q ss_pred EEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEcc
Q 001814 362 ISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSH 441 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp 441 (1010)
+-...+|+.+|.+..|||+|-.|+|.+.|. .|||||..-. ++.. ......+|-...-|-. .+.-.+|.|
T Consensus 362 ~lst~~HrrsV~sAyFSPs~gtl~TT~~D~-~IRv~dss~~------sa~~-~p~~~I~Hn~~t~Rwl---T~fKA~W~P 430 (498)
T KOG4328|consen 362 FLSTLPHRRSVNSAYFSPSGGTLLTTCQDN-EIRVFDSSCI------SAKD-EPLGTIPHNNRTGRWL---TPFKAAWDP 430 (498)
T ss_pred ceecccccceeeeeEEcCCCCceEeeccCC-ceEEeecccc------cccC-CccceeeccCcccccc---cchhheeCC
Confidence 334567999999999999998899999965 5999997310 0000 0000001111110111 244568999
Q ss_pred CCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 442 YSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 442 Dg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
|..+|+++-.-.-|-||+ ..+++
T Consensus 431 ~~~li~vg~~~r~IDv~~--~~~~q 453 (498)
T KOG4328|consen 431 DYNLIVVGRYPRPIDVFD--GNGGQ 453 (498)
T ss_pred CccEEEEeccCcceeEEc--CCCCE
Confidence 999999999888888877 34444
No 152
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=98.94 E-value=5.2e-08 Score=114.82 Aligned_cols=190 Identities=19% Similarity=0.253 Sum_probs=127.3
Q ss_pred CCEEEEEeCCCCeE-EEEEeCC--CcEEEEEEcC--CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccC
Q 001814 172 PTAVRFYSFQSHCY-EHVLRFR--SSVCMVRCSP--RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVG 246 (1010)
Q Consensus 172 p~tVrIWDlktge~-V~tL~f~--S~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~ 246 (1010)
++.|.||+++.+=+ ..+|..+ ..|-++++.+ +++.+++++.|.-||+.+++.++.+..... .
T Consensus 46 ~g~IEiwN~~~~w~~~~vi~g~~drsIE~L~W~e~~RLFS~g~sg~i~EwDl~~lk~~~~~d~~gg-------------~ 112 (691)
T KOG2048|consen 46 DGNIEIWNLSNNWFLEPVIHGPEDRSIESLAWAEGGRLFSSGLSGSITEWDLHTLKQKYNIDSNGG-------------A 112 (691)
T ss_pred CCcEEEEccCCCceeeEEEecCCCCceeeEEEccCCeEEeecCCceEEEEecccCceeEEecCCCc-------------c
Confidence 46799999987643 3334433 5799999974 778889999999999999998877653211 0
Q ss_pred ccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCC
Q 001814 247 YGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSS 326 (1010)
Q Consensus 247 ~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~ 326 (1010)
.-.+|+.| .+..+
T Consensus 113 IWsiai~p----------------------------------~~~~l--------------------------------- 125 (691)
T KOG2048|consen 113 IWSIAINP----------------------------------ENTIL--------------------------------- 125 (691)
T ss_pred eeEEEeCC----------------------------------ccceE---------------------------------
Confidence 11222221 11111
Q ss_pred CCccCCCccccccccccccCCCCeEEEEECCCCcEE--EEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcc
Q 001814 327 SPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAII--SQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCM 404 (1010)
Q Consensus 327 s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v--~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~ 404 (1010)
+-+..||.+..++...+.+. ..|.--++.|.+|+|+|+|+.||+|+.|| +||+||+..
T Consensus 126 ----------------~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg-~Iriwd~~~--- 185 (691)
T KOG2048|consen 126 ----------------AIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDG-VIRIWDVKS--- 185 (691)
T ss_pred ----------------EeecCCceEEEEecCCceEEEEeecccccceEEEEEecCCccEEEecccCc-eEEEEEcCC---
Confidence 12346776666666665543 34445678999999999999999999966 599999963
Q ss_pred cCCCCCCccccCCcceE-----EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCC
Q 001814 405 RSGSGNHKYDWNSSHVH-----LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQ 475 (1010)
Q Consensus 405 ~~~sG~~~~~~~~s~~~-----L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~ 475 (1010)
|. -.+ +.++.++ ..+.||++.|=.|+ .||+|-+.|||.+|+-...-....++.|.+.
T Consensus 186 ----~~--------t~~~~~~~~d~l~k~-~~~iVWSv~~Lrd~-tI~sgDS~G~V~FWd~~~gTLiqS~~~h~ad 247 (691)
T KOG2048|consen 186 ----GQ--------TLHIITMQLDRLSKR-EPTIVWSVLFLRDS-TIASGDSAGTVTFWDSIFGTLIQSHSCHDAD 247 (691)
T ss_pred ----Cc--------eEEEeeecccccccC-CceEEEEEEEeecC-cEEEecCCceEEEEcccCcchhhhhhhhhcc
Confidence 21 012 2233333 34569999998776 6779999999999998766555555556544
No 153
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=98.92 E-value=1.4e-09 Score=123.29 Aligned_cols=209 Identities=15% Similarity=0.189 Sum_probs=135.6
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcEEEEEE--cCCeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccce
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSVCMVRC--SPRIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPM 250 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V~sVa~--S~rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gpl 250 (1010)
+.|--+|.+++...+.+.....|++|.| +.+++||+...-+||||-. +..+++|..+.. ...+
T Consensus 151 GHlAa~Dw~t~~L~~Ei~v~Etv~Dv~~LHneq~~AVAQK~y~yvYD~~-GtElHClk~~~~--------------v~rL 215 (545)
T KOG1272|consen 151 GHLAAFDWVTKKLHFEINVMETVRDVTFLHNEQFFAVAQKKYVYVYDNN-GTELHCLKRHIR--------------VARL 215 (545)
T ss_pred cceeeeecccceeeeeeehhhhhhhhhhhcchHHHHhhhhceEEEecCC-CcEEeehhhcCc--------------hhhh
Confidence 5688899999999999999999999999 4589999999999999954 456777775422 1233
Q ss_pred EEcc--ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCC
Q 001814 251 AVGP--RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSP 328 (1010)
Q Consensus 251 Algp--RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~ 328 (1010)
.|-| -+||.++.. |-+ +...+|. |.+|+.+ ..|. |....
T Consensus 216 eFLPyHfLL~~~~~~------G~L-----~Y~DVS~------GklVa~~--------~t~~--------------G~~~v 256 (545)
T KOG1272|consen 216 EFLPYHFLLVAASEA------GFL-----KYQDVST------GKLVASI--------RTGA--------------GRTDV 256 (545)
T ss_pred cccchhheeeecccC------Cce-----EEEeech------hhhhHHH--------HccC--------------Cccch
Confidence 4444 223333321 111 0001111 2222211 1110 00111
Q ss_pred ccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCC
Q 001814 329 VSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGS 408 (1010)
Q Consensus 329 ~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~s 408 (1010)
...||-. + .+-.|...|+|.+|.-.+.+.+..+..|.+||++|+++++|+++||++. ++.++|||+...
T Consensus 257 m~qNP~N---a-Vih~GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~G~YMaTtG~-Dr~~kIWDlR~~------ 325 (545)
T KOG1272|consen 257 MKQNPYN---A-VIHLGHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRGGRYMATTGL-DRKVKIWDLRNF------ 325 (545)
T ss_pred hhcCCcc---c-eEEEcCCCceEEecCCCCcchHHHHHhcCCCcceEEECCCCcEEeeccc-ccceeEeeeccc------
Confidence 1222211 1 1235788999999999999999999999999999999999999999999 567999999531
Q ss_pred CCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 409 GNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 409 G~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
.++..++. +.....++||.-| .||.+- -..|+||.=..
T Consensus 326 -----------~ql~t~~t---p~~a~~ls~Sqkg-lLA~~~-G~~v~iw~d~~ 363 (545)
T KOG1272|consen 326 -----------YQLHTYRT---PHPASNLSLSQKG-LLALSY-GDHVQIWKDAL 363 (545)
T ss_pred -----------cccceeec---CCCcccccccccc-ceeeec-CCeeeeehhhh
Confidence 34444432 3346789999654 454443 33599997443
No 154
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=98.91 E-value=1.4e-07 Score=107.44 Aligned_cols=96 Identities=17% Similarity=0.322 Sum_probs=71.7
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
.+..|.|.|.-..+++.+..|+- .+.|+.++|+.||+.|..++.+|. |-||++... .++++
T Consensus 321 ~G~~G~I~lLhakT~eli~s~Ki-eG~v~~~~fsSdsk~l~~~~~~Ge-V~v~nl~~~-----------------~~~~r 381 (514)
T KOG2055|consen 321 AGNNGHIHLLHAKTKELITSFKI-EGVVSDFTFSSDSKELLASGGTGE-VYVWNLRQN-----------------SCLHR 381 (514)
T ss_pred cccCceEEeehhhhhhhhheeee-ccEEeeEEEecCCcEEEEEcCCce-EEEEecCCc-----------------ceEEE
Confidence 35566777777777777766663 367899999999998888888785 899999531 24444
Q ss_pred Ee--cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 425 LH--RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 425 L~--RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+. .+.+ =.+++.|++++|||+||+.|-|.||+.+.
T Consensus 382 f~D~G~v~---gts~~~S~ng~ylA~GS~~GiVNIYd~~s 418 (514)
T KOG2055|consen 382 FVDDGSVH---GTSLCISLNGSYLATGSDSGIVNIYDGNS 418 (514)
T ss_pred EeecCccc---eeeeeecCCCceEEeccCcceEEEeccch
Confidence 42 2222 25688899999999999999999999754
No 155
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=98.89 E-value=2.4e-06 Score=90.72 Aligned_cols=223 Identities=17% Similarity=0.227 Sum_probs=147.6
Q ss_pred Ee-cCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCccccccC
Q 001814 81 GY-QNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDG 159 (1010)
Q Consensus 81 Gy-~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~g 159 (1010)
+. ++.+++||+.........+..|...|..+.+.|+. ..++. ...
T Consensus 130 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~-------------~~~~~-~~~-------------------- 175 (466)
T COG2319 130 SSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG-------------KLLAS-GSS-------------------- 175 (466)
T ss_pred CCCCccEEEEEecCCCeEEEEEecCcccEEEEEECCCC-------------CEEEe-cCC--------------------
Confidence 44 34589999964245667778888999999998765 13332 110
Q ss_pred CcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCC---eEEE-EeCCeEEEEECCCCceeE-EEeecCCc
Q 001814 160 MMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPR---IVAV-GLATQIYCFDALTLENKF-SVLTYPVP 233 (1010)
Q Consensus 160 s~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~r---lLAV-~ld~~I~IwD~~Tle~l~-tL~t~p~p 233 (1010)
.++++++|++.++..+..+..+ ..|..+++++. +++. +.+..|++||..+.+... .+..+...
T Consensus 176 -----------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~ 244 (466)
T COG2319 176 -----------LDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDS 244 (466)
T ss_pred -----------CCCceEEEEcCCCceEEeeccCCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCcEEeeecCCCCcc
Confidence 1468999999998999888864 58999999873 3444 456789999877555544 22222210
Q ss_pred cccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceee
Q 001814 234 QLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTL 313 (1010)
Q Consensus 234 ~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktl 313 (1010)
. +. . ++|.+..
T Consensus 245 --------------~-~~------~---------------------------~~~~~~~--------------------- 255 (466)
T COG2319 245 --------------V-VS------S---------------------------FSPDGSL--------------------- 255 (466)
T ss_pred --------------e-eE------e---------------------------ECCCCCE---------------------
Confidence 0 00 0 0111100
Q ss_pred ccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE-EEEeccCCCCeEEEEECCCCCEEEEEEcCCC
Q 001814 314 SKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI-ISQFKAHTSPISALCFDPSGTLLVTASVYGN 392 (1010)
Q Consensus 314 s~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~-v~~~~aHtspIsaLaFSPdGtlLATAS~dGt 392 (1010)
++.+..++.+++||+..... +..+..|..+|.++.|+|++..+++++.+ .
T Consensus 256 ----------------------------~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~d-~ 306 (466)
T COG2319 256 ----------------------------LASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSD-G 306 (466)
T ss_pred ----------------------------EEEecCCCcEEEeeecCCCcEEEEEecCCccEEEEEECCCCCEEEEeeCC-C
Confidence 11346789999999987664 55557899999999999999999998887 4
Q ss_pred eEEEEeCCCCcccCCCCCCccccCCcceEEEEEe-cccccccEEEEEEccCCCEEEEE-eCCCeEEEEeCCCCC
Q 001814 393 NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH-RGITSATIQDICFSHYSQWIAIV-SSKGTCHVFVLSPFG 464 (1010)
Q Consensus 393 ~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~-RG~t~a~I~sIAFSpDg~~LAsg-S~dGTVhIw~I~~~g 464 (1010)
.+++|++... ....... .++. ..|..+.|++++..++.+ ..++++.+|++....
T Consensus 307 ~~~~~~~~~~-----------------~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 362 (466)
T COG2319 307 TVRLWDLETG-----------------KLLSSLTLKGHE-GPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGK 362 (466)
T ss_pred cEEEEEcCCC-----------------ceEEEeeecccC-CceEEEEECCCCCEEEEeecCCCcEEeeecCCCc
Confidence 5999987531 1222221 2322 358999995443566666 678999999987654
No 156
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.89 E-value=1e-06 Score=102.55 Aligned_cols=95 Identities=12% Similarity=0.125 Sum_probs=59.3
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC--eEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN--NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt--~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.|.++|+.+++. ..+..+...+...+|||||++||..+.++. .|.+|++.. + ....+..
T Consensus 315 ~Iy~~d~~g~~~-~~lt~~~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~-------~-----------~~~~lt~ 375 (435)
T PRK05137 315 QLYVMNADGSNP-RRISFGGGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDG-------S-----------GERILTS 375 (435)
T ss_pred eEEEEECCCCCe-EEeecCCCcccCeEECCCCCEEEEEEcCCCceEEEEEECCC-------C-----------ceEeccC
Confidence 577788766544 334334455667899999999998775433 466666531 1 1122322
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCCe----EEEEeCCCCCCc
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKGT----CHVFVLSPFGGD 466 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dGT----VhIw~I~~~gg~ 466 (1010)
+ ..+.+.+|||||++|+..+.++. ..||.++..++.
T Consensus 376 ~---~~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~ 415 (435)
T PRK05137 376 G---FLVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRN 415 (435)
T ss_pred C---CCCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCc
Confidence 2 23678999999999988776432 356666554543
No 157
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=98.88 E-value=5.2e-08 Score=111.39 Aligned_cols=208 Identities=15% Similarity=0.270 Sum_probs=142.2
Q ss_pred CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
....++.|..++ ++|||+. ..-++..+..|...|.+|.+.-+ -.++|.|+-
T Consensus 90 ~S~y~~sgG~~~~Vkiwdl~-~kl~hr~lkdh~stvt~v~YN~~-------------DeyiAsvs~-------------- 141 (673)
T KOG4378|consen 90 QSLYEISGGQSGCVKIWDLR-AKLIHRFLKDHQSTVTYVDYNNT-------------DEYIASVSD-------------- 141 (673)
T ss_pred cceeeeccCcCceeeehhhH-HHHHhhhccCCcceeEEEEecCC-------------cceeEEecc--------------
Confidence 346777777776 7999996 56677888889999999987632 246776542
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC--cEEEEEEcC--C-eEEE-EeCCeEEEEECCCCceeEE
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS--SVCMVRCSP--R-IVAV-GLATQIYCFDALTLENKFS 226 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S--~V~sVa~S~--r-lLAV-~ld~~I~IwD~~Tle~l~t 226 (1010)
.+-|.|-.++++..-.++...+ .|+-+++++ + +|.. +.++.|++||+..+...+.
T Consensus 142 -------------------gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~sp~~~ 202 (673)
T KOG4378|consen 142 -------------------GGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGMSPIFH 202 (673)
T ss_pred -------------------CCcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCCeEEEEeccCCCcccc
Confidence 1348888999998888887763 567888876 3 4444 4557899999988776554
Q ss_pred Eee-cCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhh
Q 001814 227 VLT-YPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQF 305 (1010)
Q Consensus 227 L~t-~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~l 305 (1010)
... |..| ..-+++ +|++-.+
T Consensus 203 ~~~~HsAP-------------~~gicf----------------------------------spsne~l------------ 223 (673)
T KOG4378|consen 203 ASEAHSAP-------------CRGICF----------------------------------SPSNEAL------------ 223 (673)
T ss_pred hhhhccCC-------------cCccee----------------------------------cCCccce------------
Confidence 432 3222 001122 2222211
Q ss_pred hcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEE
Q 001814 306 AAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLV 385 (1010)
Q Consensus 306 a~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLA 385 (1010)
+++.+.|..|.+||..+.+....|.. ..|+++|+|+++|++|+
T Consensus 224 ------------------------------------~vsVG~Dkki~~yD~~s~~s~~~l~y-~~Plstvaf~~~G~~L~ 266 (673)
T KOG4378|consen 224 ------------------------------------LVSVGYDKKINIYDIRSQASTDRLTY-SHPLSTVAFSECGTYLC 266 (673)
T ss_pred ------------------------------------EEEecccceEEEeecccccccceeee-cCCcceeeecCCceEEE
Confidence 23456899999999998776666654 46999999999999999
Q ss_pred EEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC
Q 001814 386 TASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS 443 (1010)
Q Consensus 386 TAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg 443 (1010)
.++.+|. |.-||+.- . ...+..+. .+.+.|++|+|-|--
T Consensus 267 aG~s~G~-~i~YD~R~--------~--------k~Pv~v~s--ah~~sVt~vafq~s~ 305 (673)
T KOG4378|consen 267 AGNSKGE-LIAYDMRS--------T--------KAPVAVRS--AHDASVTRVAFQPSP 305 (673)
T ss_pred eecCCce-EEEEeccc--------C--------CCCceEee--ecccceeEEEeeecc
Confidence 9999898 66799852 1 12333332 244569999998643
No 158
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=98.87 E-value=1.5e-08 Score=109.01 Aligned_cols=70 Identities=23% Similarity=0.285 Sum_probs=57.0
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
.|+-+.+-||++.||||+.||+ ||||.-.+ + +.|..|.. |.+.|+++||+||+..+|++|
T Consensus 253 Gv~gvrIRpD~KIlATAGWD~R-iRVyswrt-------l----------~pLAVLky--Hsagvn~vAfspd~~lmAaas 312 (323)
T KOG0322|consen 253 GVSGVRIRPDGKILATAGWDHR-IRVYSWRT-------L----------NPLAVLKY--HSAGVNAVAFSPDCELMAAAS 312 (323)
T ss_pred CccceEEccCCcEEeecccCCc-EEEEEecc-------C----------Cchhhhhh--hhcceeEEEeCCCCchhhhcc
Confidence 4666778889999999999887 99998754 2 24444432 446799999999999999999
Q ss_pred CCCeEEEEeC
Q 001814 451 SKGTCHVFVL 460 (1010)
Q Consensus 451 ~dGTVhIw~I 460 (1010)
.|++|-+|++
T Consensus 313 kD~rISLWkL 322 (323)
T KOG0322|consen 313 KDARISLWKL 322 (323)
T ss_pred CCceEEeeec
Confidence 9999999986
No 159
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.86 E-value=1.2e-06 Score=102.11 Aligned_cols=83 Identities=16% Similarity=0.108 Sum_probs=52.9
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCC--CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 351 VVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYG--NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 351 V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dG--t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
|.++|+.++.. ..+..+.......+|||||++||.++.++ ..|.+|++.. | .+..|..+
T Consensus 313 Iy~~d~~~g~~-~~lt~~~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~-------g-----------~~~~Lt~~ 373 (429)
T PRK03629 313 VYKVNINGGAP-QRITWEGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLAT-------G-----------GVQVLTDT 373 (429)
T ss_pred EEEEECCCCCe-EEeecCCCCccCEEECCCCCEEEEEEccCCCceEEEEECCC-------C-----------CeEEeCCC
Confidence 44456655543 33333444566789999999998876544 3467777742 2 12233322
Q ss_pred cccccEEEEEEccCCCEEEEEeCCCeE
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKGTC 455 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dGTV 455 (1010)
. ...+.+|||||++|+.++.++..
T Consensus 374 ~---~~~~p~~SpDG~~i~~~s~~~~~ 397 (429)
T PRK03629 374 F---LDETPSIAPNGTMVIYSSSQGMG 397 (429)
T ss_pred C---CCCCceECCCCCEEEEEEcCCCc
Confidence 1 23568899999999999988763
No 160
>KOG4328 consensus WD40 protein [Function unknown]
Probab=98.85 E-value=6.9e-08 Score=109.83 Aligned_cols=100 Identities=23% Similarity=0.282 Sum_probs=78.2
Q ss_pred CCCCeEEEEECCCCc-EEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 346 DNAGIVVVKDFVTRA-IISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~-~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
+.=|...+||+.++. ....++-|...|..|+|+|-- .+|||||.|++ .+|||+.... +. ++ -.|+
T Consensus 298 ~~~G~f~~iD~R~~~s~~~~~~lh~kKI~sv~~NP~~p~~laT~s~D~T-~kIWD~R~l~-----~K------~s-p~ls 364 (498)
T KOG4328|consen 298 DNVGNFNVIDLRTDGSEYENLRLHKKKITSVALNPVCPWFLATASLDQT-AKIWDLRQLR-----GK------AS-PFLS 364 (498)
T ss_pred ecccceEEEEeecCCccchhhhhhhcccceeecCCCCchheeecccCcc-eeeeehhhhc-----CC------CC-ccee
Confidence 445688999998765 477888999999999999976 58999999765 8999996421 21 11 1345
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.+. |...|.+..|||++-.|++.+.|.+|+||+..
T Consensus 365 t~~---HrrsV~sAyFSPs~gtl~TT~~D~~IRv~dss 399 (498)
T KOG4328|consen 365 TLP---HRRSVNSAYFSPSGGTLLTTCQDNEIRVFDSS 399 (498)
T ss_pred ccc---ccceeeeeEEcCCCCceEeeccCCceEEeecc
Confidence 442 34469999999999889999999999999985
No 161
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.83 E-value=3.4e-08 Score=117.63 Aligned_cols=102 Identities=11% Similarity=0.229 Sum_probs=85.3
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCC-CCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPS-GTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPd-GtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
+++|++||+|++||+...+-..++++....|..+.|+|. +..+|++.. +.++++||+...
T Consensus 149 liSGSQDg~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~~~~F~s~~d-sG~lqlWDlRqp------------------ 209 (839)
T KOG0269|consen 149 LISGSQDGTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGYGNKFASIHD-SGYLQLWDLRQP------------------ 209 (839)
T ss_pred EEecCCCceEEEEeeecccccccccccchhhhceeeccCCCceEEEecC-CceEEEeeccCc------------------
Confidence 568999999999999999999999999999999999984 567888877 556999999631
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
..+.++-..|+..|.++.|+|+..|||+|+.|++|+||++..
T Consensus 210 ~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~ 251 (839)
T KOG0269|consen 210 DRCEKKLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTD 251 (839)
T ss_pred hhHHHHhhcccCceEEEeecCCCceeeecCCCccEEEEeccC
Confidence 222333334667899999999999999999999999999974
No 162
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.82 E-value=1.1e-07 Score=107.91 Aligned_cols=129 Identities=19% Similarity=0.284 Sum_probs=99.5
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecC-cEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQN-GFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLH 130 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~-G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~sr 130 (1010)
.++|+|-|+...+++ ..++||+.|..+ ++++||++ .+++..++..|.+.|.++++-|..
T Consensus 239 ~~gHTdavl~Ls~n~------~~~nVLaSgsaD~TV~lWD~~-~g~p~~s~~~~~k~Vq~l~wh~~~------------- 298 (463)
T KOG0270|consen 239 ASGHTDAVLALSWNR------NFRNVLASGSADKTVKLWDVD-TGKPKSSITHHGKKVQTLEWHPYE------------- 298 (463)
T ss_pred cccchHHHHHHHhcc------ccceeEEecCCCceEEEEEcC-CCCcceehhhcCCceeEEEecCCC-------------
Confidence 467899999888886 247899999975 59999995 688999999999999999988643
Q ss_pred cEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCC-CeEEEEEeCCCcEEEEEEcC---CeEE
Q 001814 131 PFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQS-HCYEHVLRFRSSVCMVRCSP---RIVA 206 (1010)
Q Consensus 131 pLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlkt-ge~V~tL~f~S~V~sVa~S~---rlLA 206 (1010)
|.+++ +| + ++++|++.|.+. +..-..++|.+.|-.|++++ ..+.
T Consensus 299 p~~LL-sG---------------------s----------~D~~V~l~D~R~~~~s~~~wk~~g~VEkv~w~~~se~~f~ 346 (463)
T KOG0270|consen 299 PSVLL-SG---------------------S----------YDGTVALKDCRDPSNSGKEWKFDGEVEKVAWDPHSENSFF 346 (463)
T ss_pred ceEEE-ec---------------------c----------ccceEEeeeccCccccCceEEeccceEEEEecCCCceeEE
Confidence 34443 22 1 368999999994 33445678999999999987 3445
Q ss_pred EEe-CCeEEEEECCCC-ceeEEEeecCC
Q 001814 207 VGL-ATQIYCFDALTL-ENKFSVLTYPV 232 (1010)
Q Consensus 207 V~l-d~~I~IwD~~Tl-e~l~tL~t~p~ 232 (1010)
+++ ++.+|=||++.. ++++++..|..
T Consensus 347 ~~tddG~v~~~D~R~~~~~vwt~~AHd~ 374 (463)
T KOG0270|consen 347 VSTDDGTVYYFDIRNPGKPVWTLKAHDD 374 (463)
T ss_pred EecCCceEEeeecCCCCCceeEEEeccC
Confidence 554 578999999987 77888887754
No 163
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=98.81 E-value=2e-08 Score=111.99 Aligned_cols=105 Identities=21% Similarity=0.237 Sum_probs=84.2
Q ss_pred ccccCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 342 GADMDNAGIVVVKDFVTR---AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~---~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
+++++.||.|+|||+.++ .++. .+||.+-|+.|.|+.+-.+||+++.+|+ ++|||+.... +|
T Consensus 273 faScS~DgsIrIWDiRs~~~~~~~~-~kAh~sDVNVISWnr~~~lLasG~DdGt-~~iwDLR~~~----~~--------- 337 (440)
T KOG0302|consen 273 FASCSCDGSIRIWDIRSGPKKAAVS-TKAHNSDVNVISWNRREPLLASGGDDGT-LSIWDLRQFK----SG--------- 337 (440)
T ss_pred EEeeecCceEEEEEecCCCccceeE-eeccCCceeeEEccCCcceeeecCCCce-EEEEEhhhcc----CC---------
Confidence 568899999999999987 3443 3899999999999999999999999886 9999996431 12
Q ss_pred ceEEEEEecccccccEEEEEEcc-CCCEEEEEeCCCeEEEEeCCCCC
Q 001814 419 HVHLYKLHRGITSATIQDICFSH-YSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a~I~sIAFSp-Dg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
+.+..|.+ |.+.|++|.|+| +...||+++.|.+|.||++....
T Consensus 338 -~pVA~fk~--Hk~pItsieW~p~e~s~iaasg~D~QitiWDlsvE~ 381 (440)
T KOG0302|consen 338 -QPVATFKY--HKAPITSIEWHPHEDSVIAASGEDNQITIWDLSVEA 381 (440)
T ss_pred -CcceeEEe--ccCCeeEEEeccccCceEEeccCCCcEEEEEeeccC
Confidence 23444443 456799999997 46788999999999999997643
No 164
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=98.80 E-value=1.8e-07 Score=110.51 Aligned_cols=254 Identities=16% Similarity=0.141 Sum_probs=150.3
Q ss_pred CeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCcc
Q 001814 75 KQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHLG 154 (1010)
Q Consensus 75 ~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~ 154 (1010)
+..++-|....++|||-... .....+.+|.++|.|+.++|.... . +. . |+|+
T Consensus 25 ~~~vafGa~~~Iav~dp~k~-~i~t~l~GH~a~VnC~~~l~~s~~-------~---a~-~-vsG~--------------- 76 (764)
T KOG1063|consen 25 GGLVAFGAGPAIAVADPEKI-LIVTTLDGHVARVNCVHWLPTSEI-------V---AE-M-VSGD--------------- 76 (764)
T ss_pred cceEEecCCceEEEeCcccc-eeEEeccCCccceEEEEEcccccc-------c---ce-E-EEcc---------------
Confidence 45788888888999997543 355678899999999999987521 1 22 2 2332
Q ss_pred ccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEE--EEeCC-CcEEEEEEcCCeEEE-EeCCeEEEEECCCCceeEEEeec
Q 001814 155 GVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEH--VLRFR-SSVCMVRCSPRIVAV-GLATQIYCFDALTLENKFSVLTY 230 (1010)
Q Consensus 155 ~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~--tL~f~-S~V~sVa~S~rlLAV-~ld~~I~IwD~~Tle~l~tL~t~ 230 (1010)
+|++|++|-++....++ +++.+ ..+.+|...+....+ +.++.+++||...-+ +..+...
T Consensus 77 ----------------sD~~v~lW~l~~~~~~~i~~~~g~~~~~~cv~a~~~~~~~~~ad~~v~vw~~~~~e-~~~~~~~ 139 (764)
T KOG1063|consen 77 ----------------SDGRVILWKLRDEYLIKIYTIQGHCKECVCVVARSSVMTCKAADGTVSVWDKQQDE-VFLLAVL 139 (764)
T ss_pred ----------------CCCcEEEEEEeehheEEEEeecCcceeEEEEEeeeeEEEeeccCceEEEeecCCCc-eeeehhe
Confidence 36889999998544444 44443 356666665554444 677889999984433 1111110
Q ss_pred CCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccc
Q 001814 231 PVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLS 310 (1010)
Q Consensus 231 p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ 310 (1010)
.- . +. -.-|++ ||... ..+.++
T Consensus 140 rf------~---~k-~~ipLc-----L~~~~---------------------------~~~~~l---------------- 161 (764)
T KOG1063|consen 140 RF------E---IK-EAIPLC-----LAALK---------------------------NNKTFL---------------- 161 (764)
T ss_pred eh------h---hh-hHhhHH-----Hhhhc---------------------------cCCcEE----------------
Confidence 00 0 00 000111 11111 011111
Q ss_pred eeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECC--CCcEEEEeccCCCCeEEEEECCCCC---EEE
Q 001814 311 KTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFV--TRAIISQFKAHTSPISALCFDPSGT---LLV 385 (1010)
Q Consensus 311 ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~--s~~~v~~~~aHtspIsaLaFSPdGt---lLA 385 (1010)
++-++.+..|.|+--. +-+.+..+.+|+.-|..|+|..-|. +||
T Consensus 162 -------------------------------la~Ggs~~~v~~~s~~~d~f~~v~el~GH~DWIrsl~f~~~~~~~~~la 210 (764)
T KOG1063|consen 162 -------------------------------LACGGSKFVVDLYSSSADSFARVAELEGHTDWIRSLAFARLGGDDLLLA 210 (764)
T ss_pred -------------------------------EEecCcceEEEEeccCCcceeEEEEeeccchhhhhhhhhccCCCcEEEE
Confidence 1112233344444333 2346789999999999999997665 778
Q ss_pred EEEcCCCeEEEEeCCCCcccCC-----CCCCccccCCcceEEEE---------EecccccccEEEEEEccCCCEEEEEeC
Q 001814 386 TASVYGNNINIFRIMPSCMRSG-----SGNHKYDWNSSHVHLYK---------LHRGITSATIQDICFSHYSQWIAIVSS 451 (1010)
Q Consensus 386 TAS~dGt~IrVwdi~p~~~~~~-----sG~~~~~~~~s~~~L~~---------L~RG~t~a~I~sIAFSpDg~~LAsgS~ 451 (1010)
|+|. ++.||||.+....+-+. +-+..++ ......+.+ +.-||+ .+|+++-|.|++..|.++|.
T Consensus 211 S~SQ-D~yIRiW~i~~~~~~~~~~~e~~~t~~~~-~~~f~~l~~i~~~is~eall~GHe-DWV~sv~W~p~~~~LLSASa 287 (764)
T KOG1063|consen 211 SSSQ-DRYIRIWRIVLGDDEDSNEREDSLTTLSN-LPVFMILEEIQYRISFEALLMGHE-DWVYSVWWHPEGLDLLSASA 287 (764)
T ss_pred ecCC-ceEEEEEEEEecCCccccccccccccccC-CceeeeeeeEEEEEehhhhhcCcc-cceEEEEEccchhhheeccc
Confidence 8887 68899999865310000 0000011 111222222 224754 57999999999999999999
Q ss_pred CCeEEEEeCCCCCC
Q 001814 452 KGTCHVFVLSPFGG 465 (1010)
Q Consensus 452 dGTVhIw~I~~~gg 465 (1010)
|.|+.||.-....|
T Consensus 288 DksmiiW~pd~~tG 301 (764)
T KOG1063|consen 288 DKSMIIWKPDENTG 301 (764)
T ss_pred CcceEEEecCCccc
Confidence 99999998766533
No 165
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=98.80 E-value=3.6e-07 Score=99.52 Aligned_cols=175 Identities=18% Similarity=0.309 Sum_probs=114.1
Q ss_pred CCEEEEEeCCCCeE--EEE--EeCCCcEEEEEEcC---CeEE-EEeCCeEEEEECCCCceeEEEeecCCccccCCCcccc
Q 001814 172 PTAVRFYSFQSHCY--EHV--LRFRSSVCMVRCSP---RIVA-VGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGI 243 (1010)
Q Consensus 172 p~tVrIWDlktge~--V~t--L~f~S~V~sVa~S~---rlLA-V~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~v 243 (1010)
++|..|||+++|.. |+| +-+...|++|+|.+ +++| ||.|+.+++||++.++.-..+...|.|.
T Consensus 172 DTTCTiWdie~~~~~~vkTQLIAHDKEV~DIaf~~~s~~~FASvgaDGSvRmFDLR~leHSTIIYE~p~~~--------- 242 (364)
T KOG0290|consen 172 DTTCTIWDIETGVSGTVKTQLIAHDKEVYDIAFLKGSRDVFASVGADGSVRMFDLRSLEHSTIIYEDPSPS--------- 242 (364)
T ss_pred cCeEEEEEEeeccccceeeEEEecCcceeEEEeccCccceEEEecCCCcEEEEEecccccceEEecCCCCC---------
Confidence 68999999999732 333 35667999999987 4555 6788899999999988655454444420
Q ss_pred ccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCC
Q 001814 244 NVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPD 323 (1010)
Q Consensus 244 nv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~ 323 (1010)
...+-|+ |+. |. | .|.
T Consensus 243 ---~pLlRLs-------------wnk-----qD-----------p--------------------------nym------ 258 (364)
T KOG0290|consen 243 ---TPLLRLS-------------WNK-----QD-----------P--------------------------NYM------ 258 (364)
T ss_pred ---Ccceeec-------------cCc-----CC-----------c--------------------------hHH------
Confidence 0001110 110 00 0 000
Q ss_pred CCCCCccCCCccccccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCC
Q 001814 324 GSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 324 gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p 401 (1010)
+ ..+.....|.|.|+... ..++.|+.|+..|+.++|-|.. ..|+||+. +...-|||+..
T Consensus 259 ---------------A---Tf~~dS~~V~iLDiR~P~tpva~L~~H~a~VNgIaWaPhS~~hictaGD-D~qaliWDl~q 319 (364)
T KOG0290|consen 259 ---------------A---TFAMDSNKVVILDIRVPCTPVARLRNHQASVNGIAWAPHSSSHICTAGD-DCQALIWDLQQ 319 (364)
T ss_pred ---------------h---hhhcCCceEEEEEecCCCcceehhhcCcccccceEecCCCCceeeecCC-cceEEEEeccc
Confidence 0 01234567999999764 5789999999999999999855 68999988 56788999964
Q ss_pred CcccCCCCCCccccCCcceEEEEEecccccccEEEEEEc-cCCCEEEEEeCC
Q 001814 402 SCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFS-HYSQWIAIVSSK 452 (1010)
Q Consensus 402 ~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFS-pDg~~LAsgS~d 452 (1010)
.+..+ + . ..-..|+ . .+.|..|.|+ ....|||++..+
T Consensus 320 ~~~~~--~---~----dPilay~--a---~~EVNqi~Ws~~~~Dwiai~~~k 357 (364)
T KOG0290|consen 320 MPREN--G---E----DPILAYT--A---GGEVNQIQWSSSQPDWIAICFGK 357 (364)
T ss_pred ccccC--C---C----Cchhhhh--c---cceeeeeeecccCCCEEEEEecC
Confidence 21100 1 0 0012333 2 2479999999 467899999866
No 166
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.80 E-value=1.2e-06 Score=97.58 Aligned_cols=110 Identities=9% Similarity=0.086 Sum_probs=72.6
Q ss_pred ccCCCCeEEEEECCCCcEEE-------EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccC
Q 001814 344 DMDNAGIVVVKDFVTRAIIS-------QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWN 416 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~-------~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~ 416 (1010)
....++.|.|||+.+...+. .+... .....++|+|||++|+++...+..|.+|++.+. .|
T Consensus 143 ~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g-~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~-----~~------- 209 (330)
T PRK11028 143 PCLKEDRIRLFTLSDDGHLVAQEPAEVTTVEG-AGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDP-----HG------- 209 (330)
T ss_pred eeCCCCEEEEEEECCCCcccccCCCceecCCC-CCCceEEECCCCCEEEEEecCCCEEEEEEEeCC-----CC-------
Confidence 34567899999998643221 22222 234679999999999999886778999999631 01
Q ss_pred CcceEEEEEecc---c-ccccEEEEEEccCCCEEEEEeC-CCeEEEEeCCCCCCcc
Q 001814 417 SSHVHLYKLHRG---I-TSATIQDICFSHYSQWIAIVSS-KGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 417 ~s~~~L~~L~RG---~-t~a~I~sIAFSpDg~~LAsgS~-dGTVhIw~I~~~gg~~ 467 (1010)
....+.++... . .......|.|+||+++|+++.. +++|.||+++..++..
T Consensus 210 -~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~ 264 (330)
T PRK11028 210 -EIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSEDGSVL 264 (330)
T ss_pred -CEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeE
Confidence 11233333210 0 0112346999999999999854 6899999998765543
No 167
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=98.78 E-value=7.3e-07 Score=107.92 Aligned_cols=286 Identities=13% Similarity=0.122 Sum_probs=162.5
Q ss_pred CCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 73 ~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
.+.+.|++.+.+.++||.+ .++++...+..|..++..+.++|.+.. -.++.+++
T Consensus 26 nD~k~l~~~~~~~V~VyS~-~Tg~~i~~l~~~~a~l~s~~~~~~~~~----------~~~~~~~s--------------- 79 (792)
T KOG1963|consen 26 NDAKFLFLCTGNFVKVYST-ATGECITSLEDHTAPLTSVIVLPSSEN----------ANYLIVCS--------------- 79 (792)
T ss_pred cCCcEEEEeeCCEEEEEec-chHhhhhhcccccCccceeeecCCCcc----------ceEEEEEe---------------
Confidence 4778999999999999998 468888899999999999999987621 01333322
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCC-----eEEEEeCCeEEEEECCCCc-----
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPR-----IVAVGLATQIYCFDALTLE----- 222 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~r-----lLAV~ld~~I~IwD~~Tle----- 222 (1010)
.+++|++||...++.++++.-+-+|.+..+.+. .++....+...++.....+
T Consensus 80 ------------------l~G~I~vwd~~~~~Llkt~~~~~~v~~~~~~~~~a~~s~~~~~s~~~~~~~~~~s~~~~~q~ 141 (792)
T KOG1963|consen 80 ------------------LDGTIRVWDWSDGELLKTFDNNLPVHALVYKPAQADISANVYVSVEDYSILTTFSKKLSKQS 141 (792)
T ss_pred ------------------cCccEEEecCCCcEEEEEEecCCceeEEEechhHhCccceeEeecccceeeeecccccccce
Confidence 247899999999999999988878777666442 1221111222222211111
Q ss_pred eeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhh
Q 001814 223 NKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHS 302 (1010)
Q Consensus 223 ~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dss 302 (1010)
..+.+.+.... ..+... .... |+-|++..... ...++|.-.+.+
T Consensus 142 ~~~~~~t~~~~------~~d~~~---~~~~-~~~I~~~~~ge--------------------------~~~i~~~~~~~~ 185 (792)
T KOG1963|consen 142 SRFVLATFDSA------KGDFLK---EHQE-PKSIVDNNSGE--------------------------FKGIVHMCKIHI 185 (792)
T ss_pred eeeEeeecccc------chhhhh---hhcC-CccEEEcCCce--------------------------EEEEEEeeeEEE
Confidence 11111111110 000000 0000 23333332210 000111000000
Q ss_pred hhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCC--C--cEEEEeccCCCCeEEEEEC
Q 001814 303 KQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVT--R--AIISQFKAHTSPISALCFD 378 (1010)
Q Consensus 303 k~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s--~--~~v~~~~aHtspIsaLaFS 378 (1010)
+.+..+- +....+...+.-.-.......++... .++.++.||.|.||.--. + .....|.=|..+|.+|+||
T Consensus 186 ~~v~~~~-~~~~~~~~~~~Htf~~t~~~~spn~~----~~Aa~d~dGrI~vw~d~~~~~~~~t~t~lHWH~~~V~~L~fS 260 (792)
T KOG1963|consen 186 YFVPKHT-KHTSSRDITVHHTFNITCVALSPNER----YLAAGDSDGRILVWRDFGSSDDSETCTLLHWHHDEVNSLSFS 260 (792)
T ss_pred EEecccc-eeeccchhhhhhcccceeEEeccccc----eEEEeccCCcEEEEeccccccccccceEEEecccccceeEEe
Confidence 0000000 00000000000000000111122222 345788899999997543 2 2456778899999999999
Q ss_pred CCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEE
Q 001814 379 PSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVF 458 (1010)
Q Consensus 379 PdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw 458 (1010)
+||.+|.||+..| ++-+|++.+ + ..+.|-+| .+.|..+.+|||+...+....|..||+-
T Consensus 261 ~~G~~LlSGG~E~-VLv~Wq~~T-------~--------~kqfLPRL-----gs~I~~i~vS~ds~~~sl~~~DNqI~li 319 (792)
T KOG1963|consen 261 SDGAYLLSGGREG-VLVLWQLET-------G--------KKQFLPRL-----GSPILHIVVSPDSDLYSLVLEDNQIHLI 319 (792)
T ss_pred cCCceEeecccce-EEEEEeecC-------C--------Cccccccc-----CCeeEEEEEcCCCCeEEEEecCceEEEE
Confidence 9999999999966 688999864 2 11233333 3679999999999999999999999998
Q ss_pred eCCCCC
Q 001814 459 VLSPFG 464 (1010)
Q Consensus 459 ~I~~~g 464 (1010)
......
T Consensus 320 ~~~dl~ 325 (792)
T KOG1963|consen 320 KASDLE 325 (792)
T ss_pred eccchh
Confidence 875543
No 168
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.76 E-value=2.4e-08 Score=123.06 Aligned_cols=103 Identities=17% Similarity=0.176 Sum_probs=80.2
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCC--CeEEEEECCCC-CEEEEEEcCCC--eEEEEeCCCCcccCCCCCCccccC
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTS--PISALCFDPSG-TLLVTASVYGN--NINIFRIMPSCMRSGSGNHKYDWN 416 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHts--pIsaLaFSPdG-tlLATAS~dGt--~IrVwdi~p~~~~~~sG~~~~~~~ 416 (1010)
+++++.+|.+.|||+..++.+..|.-|.. -++.|+|+||+ +.|++|+.|++ +|.+||+.-.
T Consensus 177 LAS~s~sg~~~iWDlr~~~pii~ls~~~~~~~~S~l~WhP~~aTql~~As~dd~~PviqlWDlR~a-------------- 242 (1049)
T KOG0307|consen 177 LASGSPSGRAVIWDLRKKKPIIKLSDTPGRMHCSVLAWHPDHATQLLVASGDDSAPVIQLWDLRFA-------------- 242 (1049)
T ss_pred hhccCCCCCceeccccCCCcccccccCCCccceeeeeeCCCCceeeeeecCCCCCceeEeeccccc--------------
Confidence 34667889999999999988877776654 47889999998 68888888765 5999998531
Q ss_pred CcceEEEEEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCC
Q 001814 417 SSHVHLYKLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 417 ~s~~~L~~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~ 462 (1010)
. ..+.+| +||.. .|.+|+|++.+ ++|++++.|+.|.+|+.+.
T Consensus 243 s--sP~k~~-~~H~~-GilslsWc~~D~~lllSsgkD~~ii~wN~~t 285 (1049)
T KOG0307|consen 243 S--SPLKIL-EGHQR-GILSLSWCPQDPRLLLSSGKDNRIICWNPNT 285 (1049)
T ss_pred C--Cchhhh-ccccc-ceeeeccCCCCchhhhcccCCCCeeEecCCC
Confidence 0 134444 45443 48999999866 9999999999999999876
No 169
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=98.76 E-value=1.1e-07 Score=105.37 Aligned_cols=102 Identities=21% Similarity=0.278 Sum_probs=76.5
Q ss_pred CCCeEEEEECCCCcE-EE-EeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 347 NAGIVVVKDFVTRAI-IS-QFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~-v~-~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
.+-.|.+||+...+. +. -+..|..-|+.|+|.|+. .+|+|||.|| .++|||+.-.. ..-.++.
T Consensus 141 s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrFHP~~pnlLlSGSvDG-LvnlfD~~~d~-------------EeDaL~~ 206 (376)
T KOG1188|consen 141 SDASVVLWDVRSEQQLLRQLNESHNDDVTQLRFHPSDPNLLLSGSVDG-LVNLFDTKKDN-------------EEDALLH 206 (376)
T ss_pred CceEEEEEEeccccchhhhhhhhccCcceeEEecCCCCCeEEeecccc-eEEeeecCCCc-------------chhhHHH
Confidence 355799999988664 44 346899999999999977 7999999977 59999996320 0012333
Q ss_pred EEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCCCC
Q 001814 424 KLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
.+- +.+.|-.+.|..++ +.|.+-+..+|..+|+++....
T Consensus 207 viN---~~sSI~~igw~~~~ykrI~clTH~Etf~~~ele~~~~ 246 (376)
T KOG1188|consen 207 VIN---HGSSIHLIGWLSKKYKRIMCLTHMETFAIYELEDGSE 246 (376)
T ss_pred hhc---ccceeeeeeeecCCcceEEEEEccCceeEEEccCCCh
Confidence 332 22469999999888 5688999999999999987653
No 170
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=98.73 E-value=4.2e-08 Score=107.99 Aligned_cols=101 Identities=20% Similarity=0.327 Sum_probs=86.9
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
+.++.-|.|+|.|+.+++....+++|...|+.|.|.|+. +||++||. ++.||+|+|... .+
T Consensus 109 a~~G~~GvIrVid~~~~~~~~~~~ghG~sINeik~~p~~~qlvls~Sk-D~svRlwnI~~~-----------------~C 170 (385)
T KOG1034|consen 109 AAGGYLGVIRVIDVVSGQCSKNYRGHGGSINEIKFHPDRPQLVLSASK-DHSVRLWNIQTD-----------------VC 170 (385)
T ss_pred EeecceeEEEEEecchhhhccceeccCccchhhhcCCCCCcEEEEecC-CceEEEEeccCC-----------------eE
Confidence 345688999999999999999999999999999999987 68899998 678999999742 45
Q ss_pred EEEEe--cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 422 LYKLH--RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 422 L~~L~--RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+..|- .| |+..|.++.|++||.+||+++.|.++.+|+|+.
T Consensus 171 v~VfGG~eg-HrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~~ 212 (385)
T KOG1034|consen 171 VAVFGGVEG-HRDEVLSVDFSLDGDRIASCGMDHSLKLWRLNV 212 (385)
T ss_pred EEEeccccc-ccCcEEEEEEcCCCCeeeccCCcceEEEEecCh
Confidence 55552 23 345799999999999999999999999999984
No 171
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.72 E-value=1.9e-07 Score=99.77 Aligned_cols=107 Identities=21% Similarity=0.415 Sum_probs=87.5
Q ss_pred ccccCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEECC--CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccC
Q 001814 342 GADMDNAGIVVVKDFVTR---AIISQFKAHTSPISALCFDP--SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWN 416 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~---~~v~~~~aHtspIsaLaFSP--dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~ 416 (1010)
+++++.|++|+||+..+. ..+.+|.+|.+||..++|-. -|++||+||-||+ +.||.-.. | .|+
T Consensus 26 lATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgk-VIiWke~~-------g----~w~ 93 (299)
T KOG1332|consen 26 LATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGK-VIIWKEEN-------G----RWT 93 (299)
T ss_pred eeeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecccccCcEeeEeecCce-EEEEecCC-------C----chh
Confidence 468899999999999874 46889999999999999987 8999999999887 66998642 2 243
Q ss_pred CcceEEEEEecccccccEEEEEEccC--CCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 417 SSHVHLYKLHRGITSATIQDICFSHY--SQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 417 ~s~~~L~~L~RG~t~a~I~sIAFSpD--g~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
+ +|+ +..+.+.|.+|+|.|. |-.||++|+||+|.|+.....|+-
T Consensus 94 k----~~e--~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g~w 139 (299)
T KOG1332|consen 94 K----AYE--HAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGGW 139 (299)
T ss_pred h----hhh--hhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCCCCc
Confidence 3 333 2335678999999986 689999999999999999887554
No 172
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.70 E-value=1.1e-05 Score=90.65 Aligned_cols=197 Identities=17% Similarity=0.266 Sum_probs=125.7
Q ss_pred EEEEEeCCCCeEEEEEeCC----CcEEEEEEcCC--eEEEE---eCCeEEEEECCCCceeEEEeecCCccccCCCccccc
Q 001814 174 AVRFYSFQSHCYEHVLRFR----SSVCMVRCSPR--IVAVG---LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGIN 244 (1010)
Q Consensus 174 tVrIWDlktge~V~tL~f~----S~V~sVa~S~r--lLAV~---ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vn 244 (1010)
.+.|||+++-+.+|+|+-. ..+.++..|.. .||.= ..+.|++||+.+++..-++..|..+
T Consensus 107 ~IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~~----------- 175 (391)
T KOG2110|consen 107 SIYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKGP----------- 175 (391)
T ss_pred cEEEEecccceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCCc-----------
Confidence 4999999999999999653 24677777764 77752 2457999999999988888877552
Q ss_pred cCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCC
Q 001814 245 VGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDG 324 (1010)
Q Consensus 245 v~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~g 324 (1010)
.-.+| |..+ |.+
T Consensus 176 --lAala-------fs~~----------------------------G~l------------------------------- 187 (391)
T KOG2110|consen 176 --LAALA-------FSPD----------------------------GTL------------------------------- 187 (391)
T ss_pred --eeEEE-------ECCC----------------------------CCE-------------------------------
Confidence 12233 3321 111
Q ss_pred CCCCccCCCccccccccccccCCCCe-EEEEECCCCcEEEEeccCCC--CeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 325 SSSPVSPNSVWKVGRHAGADMDNAGI-VVVKDFVTRAIISQFKAHTS--PISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 325 s~s~~S~s~~~k~~~~~iasgs~dG~-V~VwDl~s~~~v~~~~aHts--pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
+|+++..|+ |+||++.+++.+.+|+--+. .|.+|+|+||+++|+..|..++ |+||.+..
T Consensus 188 -----------------lATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeT-VHiFKL~~ 249 (391)
T KOG2110|consen 188 -----------------LATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTET-VHIFKLEK 249 (391)
T ss_pred -----------------EEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCe-EEEEEecc
Confidence 123445555 89999999999999986554 5779999999999998888666 99999864
Q ss_pred Cccc-C---CCCC-CccccC--------CcceEEEEEeccccc-----ccE-EEEEEc--cCCCEEEEEeCCCeEEEEeC
Q 001814 402 SCMR-S---GSGN-HKYDWN--------SSHVHLYKLHRGITS-----ATI-QDICFS--HYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 402 ~~~~-~---~sG~-~~~~~~--------~s~~~L~~L~RG~t~-----a~I-~sIAFS--pDg~~LAsgS~dGTVhIw~I 460 (1010)
.... . ..+. ....|. +.+.......|-+-. +.. ..++|+ ++...+.+++.||....|.+
T Consensus 250 ~~~~~~~~p~~~~~~~~~~sk~~~sylps~V~~~~~~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~dG~~y~y~l 329 (391)
T KOG2110|consen 250 VSNNPPESPTAGTSWFGKVSKAATSYLPSQVSSVLDQSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYDGHLYSYRL 329 (391)
T ss_pred cccCCCCCCCCCCcccchhhhhhhhhcchhhhhhhhhccceeEEEccCCCccceEEeeccCCCCEEEEEEcCCeEEEEEc
Confidence 2110 0 0000 000000 011111111111111 111 234455 57889999999999999999
Q ss_pred CCC-CCcc
Q 001814 461 SPF-GGDS 467 (1010)
Q Consensus 461 ~~~-gg~~ 467 (1010)
++. ||+.
T Consensus 330 ~~~~gGec 337 (391)
T KOG2110|consen 330 PPKEGGEC 337 (391)
T ss_pred CCCCCcee
Confidence 985 5554
No 173
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.69 E-value=4.6e-06 Score=97.24 Aligned_cols=95 Identities=12% Similarity=0.113 Sum_probs=58.0
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCC--CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYG--NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dG--t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.|.++|+.+++.. .+..+......++|||||++||..+.++ ..|.+|++.. | .+..+..
T Consensus 317 ~iy~~dl~~g~~~-~lt~~g~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~-------g-----------~~~~Lt~ 377 (433)
T PRK04922 317 QIYRVAASGGSAE-RLTFQGNYNARASVSPDGKKIAMVHGSGGQYRIAVMDLST-------G-----------SVRTLTP 377 (433)
T ss_pred eEEEEECCCCCeE-EeecCCCCccCEEECCCCCEEEEEECCCCceeEEEEECCC-------C-----------CeEECCC
Confidence 3555666555432 2222223445689999999998876543 2588888742 2 1223333
Q ss_pred ccccccEEEEEEccCCCEEEEEeCC-CeEEEEeCCCCCCc
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSK-GTCHVFVLSPFGGD 466 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~d-GTVhIw~I~~~gg~ 466 (1010)
+. .....+|+|||++|+..+.+ |.-+||.++..|+.
T Consensus 378 ~~---~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g~~ 414 (433)
T PRK04922 378 GS---LDESPSFAPNGSMVLYATREGGRGVLAAVSTDGRV 414 (433)
T ss_pred CC---CCCCceECCCCCEEEEEEecCCceEEEEEECCCCc
Confidence 31 24567999999999888774 45566666655543
No 174
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.69 E-value=1.4e-06 Score=104.39 Aligned_cols=97 Identities=14% Similarity=0.246 Sum_probs=76.0
Q ss_pred CCeEEEEECC-CCcEEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 348 AGIVVVKDFV-TRAIISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 348 dG~V~VwDl~-s~~~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
|-+|+||.-. ....+..+.-+...|.+++|||-- ..+|++..+|+ |.|||+.-.. ...+.+.
T Consensus 419 DW~vriWs~~~~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~-l~iWDLl~~~---------------~~Pv~s~ 482 (555)
T KOG1587|consen 419 DWTVRIWSEDVIASPLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGN-LDIWDLLQDD---------------EEPVLSQ 482 (555)
T ss_pred cceeEeccccCCCCcchhhhhccceeeeeEEcCcCceEEEEEcCCCc-eehhhhhccc---------------cCCcccc
Confidence 9999999998 667778888899999999999976 57888888776 9999996321 0122222
Q ss_pred ecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 426 HRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 426 ~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
..+ ......+.|+++|+.||+|...|++|+|+|.+
T Consensus 483 ~~~--~~~l~~~~~s~~g~~lavGd~~G~~~~~~l~~ 517 (555)
T KOG1587|consen 483 KVC--SPALTRVRWSPNGKLLAVGDANGTTHILKLSE 517 (555)
T ss_pred ccc--ccccceeecCCCCcEEEEecCCCcEEEEEcCc
Confidence 222 23467788999999999999999999999964
No 175
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.68 E-value=1.1e-05 Score=89.08 Aligned_cols=119 Identities=18% Similarity=0.266 Sum_probs=79.3
Q ss_pred cccCCCCe-EEEEECCCCcEEEEeccC--CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcc-cC--CCCCCccc-c
Q 001814 343 ADMDNAGI-VVVKDFVTRAIISQFKAH--TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCM-RS--GSGNHKYD-W 415 (1010)
Q Consensus 343 asgs~dG~-V~VwDl~s~~~v~~~~aH--tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~-~~--~sG~~~~~-~ 415 (1010)
|+++..|+ |+|||..+|..+..|+-- ...|.+|+||||+.+||.+|.+|| ++||.+.+... .. ++- +... |
T Consensus 197 ATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsSdKgT-lHiF~l~~~~~~~~~~SSl-~~~~~~ 274 (346)
T KOG2111|consen 197 ATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPNSSWLAVSSDKGT-LHIFSLRDTENTEDESSSL-SFKRLV 274 (346)
T ss_pred EEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCCccEEEEEcCCCe-EEEEEeecCCCCccccccc-cccccc
Confidence 35666776 999999999999999843 357999999999999999999987 99999975311 00 000 0000 0
Q ss_pred C----CcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 416 N----SSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 416 ~----~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
- .+.+-+.+++ .......-++|-.+.+-|++...||+-+=+.+.+..+
T Consensus 275 lpky~~S~wS~~~f~--l~~~~~~~~~fg~~~nsvi~i~~Dgsy~k~~f~~~~~ 326 (346)
T KOG2111|consen 275 LPKYFSSEWSFAKFQ--LPQGTQCIIAFGSETNTVIAICADGSYYKFKFDPKNG 326 (346)
T ss_pred cchhcccceeEEEEE--ccCCCcEEEEecCCCCeEEEEEeCCcEEEEEeccccc
Confidence 0 0011111221 0122356688998878888888899988888877633
No 176
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.68 E-value=9.7e-06 Score=94.54 Aligned_cols=73 Identities=18% Similarity=0.276 Sum_probs=48.4
Q ss_pred eEEEEECCCCCEEEEEEcCCC--eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE
Q 001814 372 ISALCFDPSGTLLVTASVYGN--NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV 449 (1010)
Q Consensus 372 IsaLaFSPdGtlLATAS~dGt--~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg 449 (1010)
....+|||||++||.++.++. .|.+|++.. | ....+..+ ......+|+|||++|+.+
T Consensus 330 ~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~-------g-----------~~~~lt~~---~~~~~p~~spdg~~l~~~ 388 (427)
T PRK02889 330 NTSPRISPDGKLLAYISRVGGAFKLYVQDLAT-------G-----------QVTALTDT---TRDESPSFAPNGRYILYA 388 (427)
T ss_pred cCceEECCCCCEEEEEEccCCcEEEEEEECCC-------C-----------CeEEccCC---CCccCceECCCCCEEEEE
Confidence 345789999999998776543 588888743 2 11223222 224678999999999998
Q ss_pred eCCC-eEEEEeCCCCCC
Q 001814 450 SSKG-TCHVFVLSPFGG 465 (1010)
Q Consensus 450 S~dG-TVhIw~I~~~gg 465 (1010)
+.++ .-.||-++..|.
T Consensus 389 ~~~~g~~~l~~~~~~g~ 405 (427)
T PRK02889 389 TQQGGRSVLAAVSSDGR 405 (427)
T ss_pred EecCCCEEEEEEECCCC
Confidence 8654 445666665444
No 177
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=98.66 E-value=6.7e-07 Score=105.13 Aligned_cols=104 Identities=13% Similarity=0.097 Sum_probs=68.4
Q ss_pred cccCCCCeEEEEECCCCc--EEEEeccCCCC--eEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 343 ADMDNAGIVVVKDFVTRA--IISQFKAHTSP--ISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~--~v~~~~aHtsp--IsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
+++ .|+.|..|++.+.. .++.+.+|... -..-..+|||.+|++++.+++ ..+|.+... |.
T Consensus 288 AsC-tD~sIy~ynm~s~s~sP~~~~sg~~~~sf~vks~lSpd~~~l~SgSsd~~-ayiw~vs~~------------e~-- 351 (720)
T KOG0321|consen 288 ASC-TDNSIYFYNMRSLSISPVAEFSGKLNSSFYVKSELSPDDCSLLSGSSDEQ-AYIWVVSSP------------EA-- 351 (720)
T ss_pred EEe-cCCcEEEEeccccCcCchhhccCcccceeeeeeecCCCCceEeccCCCcc-eeeeeecCc------------cC--
Confidence 344 49999999998653 34455554221 112357999999999999776 789998531 00
Q ss_pred ceEEEEEecccccccEEEEEEccC-CCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 419 HVHLYKLHRGITSATIQDICFSHY-SQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a~I~sIAFSpD-g~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
.. .+.-|++ -.|..++|.|- -.-+|++|+|-++.||+|..+..+
T Consensus 352 --~~-~~l~Ght-~eVt~V~w~pS~~t~v~TcSdD~~~kiW~l~~~l~e 396 (720)
T KOG0321|consen 352 --PP-ALLLGHT-REVTTVRWLPSATTPVATCSDDFRVKIWRLSNGLEE 396 (720)
T ss_pred --Ch-hhhhCcc-eEEEEEeeccccCCCceeeccCcceEEEeccCchhh
Confidence 11 1224554 35888999653 234556699999999999765544
No 178
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=98.65 E-value=9.8e-08 Score=107.42 Aligned_cols=138 Identities=15% Similarity=0.151 Sum_probs=97.5
Q ss_pred ccccCCCCeEEEEECCCC---------cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCc
Q 001814 342 GADMDNAGIVVVKDFVTR---------AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHK 412 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~---------~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~ 412 (1010)
+++++.|..|+||-+... +-+..|..|+..|+.+.|+|+|.+||+|+.+| .|.+|......... ....
T Consensus 29 laT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D~g-~v~lWk~~~~~~~~--~d~e 105 (434)
T KOG1009|consen 29 LATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGDGG-EVFLWKQGDVRIFD--ADTE 105 (434)
T ss_pred eecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCCCc-eEEEEEecCcCCcc--ccch
Confidence 457888999999998642 13467788999999999999999999999855 58899865210000 0001
Q ss_pred cccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCCccCC
Q 001814 413 YDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFP 483 (1010)
Q Consensus 413 ~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~p 483 (1010)
.+.....+.+.+..|| +...|++++|+||++++++++.|.++.+|++...........|..-+-+-.+.|
T Consensus 106 ~~~~ke~w~v~k~lr~-h~~diydL~Ws~d~~~l~s~s~dns~~l~Dv~~G~l~~~~~dh~~yvqgvawDp 175 (434)
T KOG1009|consen 106 ADLNKEKWVVKKVLRG-HRDDIYDLAWSPDSNFLVSGSVDNSVRLWDVHAGQLLAILDDHEHYVQGVAWDP 175 (434)
T ss_pred hhhCccceEEEEEecc-cccchhhhhccCCCceeeeeeccceEEEEEeccceeEeeccccccccceeecch
Confidence 1112223455555677 456899999999999999999999999999987655555556654444433444
No 179
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=98.65 E-value=1e-07 Score=104.58 Aligned_cols=127 Identities=21% Similarity=0.273 Sum_probs=82.8
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc--
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH-- 419 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~-- 419 (1010)
++.|..+|.|.|||+.+..+-..|.||..||.+||||+||++|+|+|. +..|.+||+.....--.-.-.++-|....
T Consensus 38 lAvGc~nG~vvI~D~~T~~iar~lsaH~~pi~sl~WS~dgr~LltsS~-D~si~lwDl~~gs~l~rirf~spv~~~q~hp 116 (405)
T KOG1273|consen 38 LAVGCANGRVVIYDFDTFRIARMLSAHVRPITSLCWSRDGRKLLTSSR-DWSIKLWDLLKGSPLKRIRFDSPVWGAQWHP 116 (405)
T ss_pred eeeeccCCcEEEEEccccchhhhhhccccceeEEEecCCCCEeeeecC-CceeEEEeccCCCceeEEEccCccceeeecc
Confidence 456788999999999999988999999999999999999999999999 57799999863100000000000111100
Q ss_pred ----eEEE----------EEecccc-----------cccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcccc
Q 001814 420 ----VHLY----------KLHRGIT-----------SATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGF 469 (1010)
Q Consensus 420 ----~~L~----------~L~RG~t-----------~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~ 469 (1010)
..+. .+.-+.+ +..-....|.+-|++|.+|+.+|-++|++.+..+....+
T Consensus 117 ~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fdr~g~yIitGtsKGkllv~~a~t~e~vas~ 191 (405)
T KOG1273|consen 117 RKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFDRRGKYIITGTSKGKLLVYDAETLECVASF 191 (405)
T ss_pred ccCCeEEEEEecCCcEEEEecCCceeeccCCCccccccccccccccCCCCEEEEecCcceEEEEecchheeeeee
Confidence 0010 1100000 000112237777999999999999999998887654433
No 180
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.64 E-value=4.8e-07 Score=102.35 Aligned_cols=74 Identities=23% Similarity=0.326 Sum_probs=62.2
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE
Q 001814 370 SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV 449 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg 449 (1010)
..|++|+.|+||++||-++.+|. |-|+++.. .+.+|-.++. |...|+++.|+||.++++..
T Consensus 282 ~siSsl~VS~dGkf~AlGT~dGs-Vai~~~~~-----------------lq~~~~vk~a-H~~~VT~ltF~Pdsr~~~sv 342 (398)
T KOG0771|consen 282 KSISSLAVSDDGKFLALGTMDGS-VAIYDAKS-----------------LQRLQYVKEA-HLGFVTGLTFSPDSRYLASV 342 (398)
T ss_pred CcceeEEEcCCCcEEEEeccCCc-EEEEEece-----------------eeeeEeehhh-heeeeeeEEEcCCcCccccc
Confidence 57999999999999999999876 78998753 1455555554 44579999999999999999
Q ss_pred eCCCeEEEEeCCC
Q 001814 450 SSKGTCHVFVLSP 462 (1010)
Q Consensus 450 S~dGTVhIw~I~~ 462 (1010)
|.+.+++|..|.-
T Consensus 343 Ss~~~~~v~~l~v 355 (398)
T KOG0771|consen 343 SSDNEAAVTKLAV 355 (398)
T ss_pred ccCCceeEEEEee
Confidence 9999999999875
No 181
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=98.64 E-value=9.9e-08 Score=114.25 Aligned_cols=281 Identities=17% Similarity=0.203 Sum_probs=177.0
Q ss_pred CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 51 ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 51 ~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
+.-+|..-|+.|.||+ .++.+++||++- ++||..+ ++.+...+.+|.|-++.+++.-+.
T Consensus 185 rLlgH~naVyca~fDr-------tg~~Iitgsdd~lvKiwS~e-t~~~lAs~rGhs~ditdlavs~~n------------ 244 (1113)
T KOG0644|consen 185 RLLGHRNAVYCAIFDR-------TGRYIITGSDDRLVKIWSME-TARCLASCRGHSGDITDLAVSSNN------------ 244 (1113)
T ss_pred HHHhhhhheeeeeecc-------ccceEeecCccceeeeeecc-chhhhccCCCCccccchhccchhh------------
Confidence 4567888999999997 567899999987 7999974 566777888899999988865221
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCCeEEEE
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPRIVAVG 208 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~rlLAV~ 208 (1010)
-++|. ++ .+..||+|-+.++..|..|..+ +.|.+|+|+|+. +.+
T Consensus 245 -~~iaa-----------------------aS----------~D~vIrvWrl~~~~pvsvLrghtgavtaiafsP~~-sss 289 (1113)
T KOG0644|consen 245 -TMIAA-----------------------AS----------NDKVIRVWRLPDGAPVSVLRGHTGAVTAIAFSPRA-SSS 289 (1113)
T ss_pred -hhhhh-----------------------cc----------cCceEEEEecCCCchHHHHhccccceeeeccCccc-cCC
Confidence 12221 11 2588999999999999999766 699999999965 666
Q ss_pred eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCC
Q 001814 209 LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSP 288 (1010)
Q Consensus 209 ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP 288 (1010)
.++++++||.+ ++.+. ..|.|... .+.. -.-.+-+...|-+|....- .+.. +.
T Consensus 290 ~dgt~~~wd~r-~~~~~---y~prp~~~----~~~~-~~~s~~~~~~~~~f~Tgs~----d~ea--~n------------ 342 (1113)
T KOG0644|consen 290 DDGTCRIWDAR-LEPRI---YVPRPLKF----TEKD-LVDSILFENNGDRFLTGSR----DGEA--RN------------ 342 (1113)
T ss_pred CCCceEecccc-ccccc---cCCCCCCc----cccc-ceeeeeccccccccccccC----Cccc--cc------------
Confidence 77899999988 32222 22333100 0000 0011112223333332110 0000 00
Q ss_pred CCCceEEEeehhhhhhhhccccee-eccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEecc
Q 001814 289 GGSSLVARYAMEHSKQFAAGLSKT-LSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKA 367 (1010)
Q Consensus 289 ~~gslVa~~A~dssk~la~Gi~kt-ls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~a 367 (1010)
+++.....+. +-.|.+ .++ + ..+ ...++-.+-.+.+|.+.++.+++.+.+
T Consensus 343 --------------~e~~~l~~~~~~lif~t-----~ss-------d--~~~-~~~~ar~~~~~~vwnl~~g~l~H~l~g 393 (1113)
T KOG0644|consen 343 --------------HEFEQLAWRSNLLIFVT-----RSS-------D--LSS-IVVTARNDHRLCVWNLYTGQLLHNLMG 393 (1113)
T ss_pred --------------chhhHhhhhccceEEEe-----ccc-------c--ccc-cceeeeeeeEeeeeecccchhhhhhcc
Confidence 0000000000 000000 000 0 000 012345677889999999999999999
Q ss_pred CCCCeEEEEECCCCCEEE-EEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEE
Q 001814 368 HTSPISALCFDPSGTLLV-TASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWI 446 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLA-TAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~L 446 (1010)
|..++..|.|.|=...++ +|+.||. +.|||+.. | ...+.|. .| ...+.+-+||+||+.+
T Consensus 394 hsd~~yvLd~Hpfn~ri~msag~dgs-t~iwdi~e-------g--------~pik~y~--~g--h~kl~d~kFSqdgts~ 453 (1113)
T KOG0644|consen 394 HSDEVYVLDVHPFNPRIAMSAGYDGS-TIIWDIWE-------G--------IPIKHYF--IG--HGKLVDGKFSQDGTSI 453 (1113)
T ss_pred cccceeeeeecCCCcHhhhhccCCCc-eEeeeccc-------C--------Ccceeee--cc--cceeeccccCCCCceE
Confidence 999999999999877654 6777776 67999953 3 1123333 45 2468889999999999
Q ss_pred EEEeCCCeEEEEeCCC
Q 001814 447 AIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 447 AsgS~dGTVhIw~I~~ 462 (1010)
+..-+-|.+.|+....
T Consensus 454 ~lsd~hgql~i~g~gq 469 (1113)
T KOG0644|consen 454 ALSDDHGQLYILGTGQ 469 (1113)
T ss_pred ecCCCCCceEEeccCC
Confidence 9998889888876543
No 182
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=98.63 E-value=2.1e-06 Score=104.09 Aligned_cols=231 Identities=16% Similarity=0.097 Sum_probs=131.2
Q ss_pred CEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCC------eEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCcccccc
Q 001814 173 TAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPR------IVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINV 245 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~-S~V~sVa~S~r------lLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv 245 (1010)
++|+||+..||++++.|.++ .++..+.+.++ .++..+++.|++||-...+++.++...--+
T Consensus 37 ~~V~VyS~~Tg~~i~~l~~~~a~l~s~~~~~~~~~~~~~~~~sl~G~I~vwd~~~~~Llkt~~~~~~v------------ 104 (792)
T KOG1963|consen 37 NFVKVYSTATGECITSLEDHTAPLTSVIVLPSSENANYLIVCSLDGTIRVWDWSDGELLKTFDNNLPV------------ 104 (792)
T ss_pred CEEEEEecchHhhhhhcccccCccceeeecCCCccceEEEEEecCccEEEecCCCcEEEEEEecCCce------------
Confidence 68999999999999999877 47888877662 235577899999999999888877642110
Q ss_pred CccceEEcc-----ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccc
Q 001814 246 GYGPMAVGP-----RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQEL 320 (1010)
Q Consensus 246 ~~gplAlgp-----RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l 320 (1010)
..+-+.| +..+|.+... ....+.. + .+..++...+..-++.....++
T Consensus 105 --~~~~~~~~~a~~s~~~~~s~~~------~~~~~~~---------s-----------~~~~~q~~~~~~~t~~~~~~d~ 156 (792)
T KOG1963|consen 105 --HALVYKPAQADISANVYVSVED------YSILTTF---------S-----------KKLSKQSSRFVLATFDSAKGDF 156 (792)
T ss_pred --eEEEechhHhCccceeEeeccc------ceeeeec---------c-----------cccccceeeeEeeeccccchhh
Confidence 0011101 1122221100 0000000 0 0001111111110100000000
Q ss_pred cCCC-CCCCccCCCccccccccccccCCCCeEEEEECCCCcEEE----EeccCCCCeEEEEECCCCCEEEEEEcCCCeEE
Q 001814 321 LPDG-SSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIIS----QFKAHTSPISALCFDPSGTLLVTASVYGNNIN 395 (1010)
Q Consensus 321 ~p~g-s~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~----~~~aHtspIsaLaFSPdGtlLATAS~dGt~Ir 395 (1010)
.-.- -..++..++.+. +...-.+..+.+|+..++.... .=.-|+.++.+.+|||+|.++|++..+|+ |.
T Consensus 157 ~~~~~~~~~I~~~~~ge-----~~~i~~~~~~~~~~v~~~~~~~~~~~~~~~Htf~~t~~~~spn~~~~Aa~d~dGr-I~ 230 (792)
T KOG1963|consen 157 LKEHQEPKSIVDNNSGE-----FKGIVHMCKIHIYFVPKHTKHTSSRDITVHHTFNITCVALSPNERYLAAGDSDGR-IL 230 (792)
T ss_pred hhhhcCCccEEEcCCce-----EEEEEEeeeEEEEEecccceeeccchhhhhhcccceeEEeccccceEEEeccCCc-EE
Confidence 0000 001122222111 1111245567888887754111 11348888999999999999999999998 89
Q ss_pred EEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 396 IFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 396 Vwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
||+-.-. ++. + .....|++ |.+.|.+++||+||.+|.+|+..|-.-+|.+...+
T Consensus 231 vw~d~~~-----~~~-----~---~t~t~lHW--H~~~V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~ 284 (792)
T KOG1963|consen 231 VWRDFGS-----SDD-----S---ETCTLLHW--HHDEVNSLSFSSDGAYLLSGGREGVLVLWQLETGK 284 (792)
T ss_pred EEecccc-----ccc-----c---ccceEEEe--cccccceeEEecCCceEeecccceEEEEEeecCCC
Confidence 9974210 011 1 11223445 34579999999999999999999999999998866
No 183
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.63 E-value=2.4e-06 Score=89.73 Aligned_cols=52 Identities=23% Similarity=0.333 Sum_probs=42.8
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC-----CCeEEEEeCC
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY-----GNNINIFRIM 400 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d-----Gt~IrVwdi~ 400 (1010)
..|.|.+||+.+.+.+..+... .++.++|||||++|+|++.. +..++||+..
T Consensus 123 ~~G~l~~wd~~~~~~i~~~~~~--~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 123 LNGDLEFWDVRKKKKISTFEHS--DATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred CCcEEEEEECCCCEEeeccccC--cEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence 4588999999998888777643 47899999999999999863 4568999984
No 184
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=98.58 E-value=6.2e-07 Score=104.90 Aligned_cols=101 Identities=14% Similarity=0.246 Sum_probs=75.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
+.+.+|-+|++||+.+.+....|.+|+..|..++|||||+++||...||+ ||||+-.. + -+.+
T Consensus 694 a~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~-~rVy~Prs-------~---------e~pv 756 (1012)
T KOG1445|consen 694 AVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGT-LRVYEPRS-------R---------EQPV 756 (1012)
T ss_pred hhhhccceeeeeehhhhhhhheeccCcCceeEEEECCCCcceeeeecCce-EEEeCCCC-------C---------CCcc
Confidence 35678999999999999998999999999999999999999999999887 99998542 1 1344
Q ss_pred EEEecccccccEEEEEEccCCCEEEEEeCCCe----EEEEeCC
Q 001814 423 YKLHRGITSATIQDICFSHYSQWIAIVSSKGT----CHVFVLS 461 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGT----VhIw~I~ 461 (1010)
|+-. |....+=-.|.|--||++|.+.+.|.. |.+|+-.
T Consensus 757 ~Eg~-gpvgtRgARi~wacdgr~viv~Gfdk~SeRQv~~Y~Aq 798 (1012)
T KOG1445|consen 757 YEGK-GPVGTRGARILWACDGRIVIVVGFDKSSERQVQMYDAQ 798 (1012)
T ss_pred ccCC-CCccCcceeEEEEecCcEEEEecccccchhhhhhhhhh
Confidence 4421 111111123668889999999887753 4455443
No 185
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=98.58 E-value=1.3e-07 Score=105.55 Aligned_cols=122 Identities=19% Similarity=0.305 Sum_probs=92.9
Q ss_pred cccCCCCeEEEEECCCCcEE---EEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 343 ADMDNAGIVVVKDFVTRAII---SQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v---~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
.+|+..+.|++|...++.-. .-|.+|+..|-.|+|||.- +.|||||.||+ |||||+.. |. .
T Consensus 228 lsGDc~~~I~lw~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~Dgs-IrIWDiRs-------~~-------~ 292 (440)
T KOG0302|consen 228 LSGDCVKGIHLWEPSTGSWKVDQRPFTGHTKSVEDLQWSPTEDGVFASCSCDGS-IRIWDIRS-------GP-------K 292 (440)
T ss_pred ccCccccceEeeeeccCceeecCccccccccchhhhccCCccCceEEeeecCce-EEEEEecC-------CC-------c
Confidence 46778889999999987632 3567899999999999976 58999999776 99999963 20 0
Q ss_pred ceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcc---ccccccCCCCCCccCCCCCCCcc
Q 001814 419 HVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDS---GFQTLSSQGGDPYLFPVLSLPWW 490 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~---~~~~H~s~~~~~~~~pv~~lpw~ 490 (1010)
...+.. -.++..|.-|+|+.+-.+||+|+++||++||+|..+.... .+..| ..|++++.|-
T Consensus 293 ~~~~~~---kAh~sDVNVISWnr~~~lLasG~DdGt~~iwDLR~~~~~~pVA~fk~H--------k~pItsieW~ 356 (440)
T KOG0302|consen 293 KAAVST---KAHNSDVNVISWNRREPLLASGGDDGTLSIWDLRQFKSGQPVATFKYH--------KAPITSIEWH 356 (440)
T ss_pred cceeEe---eccCCceeeEEccCCcceeeecCCCceEEEEEhhhccCCCcceeEEec--------cCCeeEEEec
Confidence 112222 2255689999999998899999999999999998775542 33455 3578888774
No 186
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=98.56 E-value=2.5e-06 Score=96.14 Aligned_cols=128 Identities=11% Similarity=0.049 Sum_probs=93.6
Q ss_pred CCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceE------eeeeccCCEEEEEEecCCCCCCCCC
Q 001814 52 SEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNE------LVSKRDGPVSFLQMQPFPVKDDGCE 124 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~e------llS~hdGpV~~v~~lP~p~~s~~~D 124 (1010)
.-+|+..|+-+.|+.. ..++|+.|.++. ++||++.+.+..+. .|.+|.-.|..|++-|.-
T Consensus 77 v~GHt~~vLDi~w~Pf------nD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA------- 143 (472)
T KOG0303|consen 77 VCGHTAPVLDIDWCPF------NDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTA------- 143 (472)
T ss_pred ccCccccccccccCcc------CCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccc-------
Confidence 4679999887776531 246899999875 99999976554433 345677778888877643
Q ss_pred CccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--
Q 001814 125 GFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP-- 202 (1010)
Q Consensus 125 ~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~-- 202 (1010)
.+.|+. +..+++|.+||+.+|+.+-+|+++..|+++.||.
T Consensus 144 -----~NVLls---------------------------------ag~Dn~v~iWnv~tgeali~l~hpd~i~S~sfn~dG 185 (472)
T KOG0303|consen 144 -----PNVLLS---------------------------------AGSDNTVSIWNVGTGEALITLDHPDMVYSMSFNRDG 185 (472)
T ss_pred -----hhhHhh---------------------------------ccCCceEEEEeccCCceeeecCCCCeEEEEEeccCC
Confidence 123332 1136899999999999999999889999999997
Q ss_pred CeEE-EEeCCeEEEEECCCCceeEEEeec
Q 001814 203 RIVA-VGLATQIYCFDALTLENKFSVLTY 230 (1010)
Q Consensus 203 rlLA-V~ld~~I~IwD~~Tle~l~tL~t~ 230 (1010)
.+|+ ++-|.+|+|||.++.+.+..-..|
T Consensus 186 s~l~TtckDKkvRv~dpr~~~~v~e~~~h 214 (472)
T KOG0303|consen 186 SLLCTTCKDKKVRVIDPRRGTVVSEGVAH 214 (472)
T ss_pred ceeeeecccceeEEEcCCCCcEeeecccc
Confidence 3555 566778999999998876554333
No 187
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=98.55 E-value=5.5e-06 Score=90.46 Aligned_cols=246 Identities=15% Similarity=0.166 Sum_probs=151.0
Q ss_pred CCeEEEEEec--------CcEEEEEccCCCcc-----eEeeee----ccCCEEEEEEecCCCCCCCCCCccccCcEEEEE
Q 001814 74 FKQVLLLGYQ--------NGFQVLDVEDASNF-----NELVSK----RDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVV 136 (1010)
Q Consensus 74 ~~~vLalGy~--------~G~qVWDv~~~g~v-----~ellS~----hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvV 136 (1010)
++++|++.|. .+.-||.+...-+- .|.+.. +-|.|.||.+-|+.. -||.+
T Consensus 75 d~~ilaT~yn~~s~s~vl~~aaiw~ipe~~~~S~~~tlE~v~~Ldteavg~i~cvew~Pns~-------------klasm 141 (370)
T KOG1007|consen 75 DQRILATVYNDTSDSGVLTGAAIWQIPEPLGQSNSSTLECVASLDTEAVGKINCVEWEPNSD-------------KLASM 141 (370)
T ss_pred CCceEEEEEeccCCCcceeeEEEEecccccCccccchhhHhhcCCHHHhCceeeEEEcCCCC-------------eeEEe
Confidence 6789999997 45789999643211 233332 347899999998652 23422
Q ss_pred ecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeE-EEEEeCC------CcEEEEEEcC----CeE
Q 001814 137 AGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCY-EHVLRFR------SSVCMVRCSP----RIV 205 (1010)
Q Consensus 137 sgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~-V~tL~f~------S~V~sVa~S~----rlL 205 (1010)
. ++.|.||++..+.. +..+.-. -.-.+=+++| ..+
T Consensus 142 ~----------------------------------dn~i~l~~l~ess~~vaev~ss~s~e~~~~ftsg~WspHHdgnqv 187 (370)
T KOG1007|consen 142 D----------------------------------DNNIVLWSLDESSKIVAEVLSSESAEMRHSFTSGAWSPHHDGNQV 187 (370)
T ss_pred c----------------------------------cCceEEEEcccCcchheeecccccccccceecccccCCCCccceE
Confidence 1 35799999987765 5554322 1234556666 588
Q ss_pred EEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCC
Q 001814 206 AVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPS 285 (1010)
Q Consensus 206 AV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~s 285 (1010)
++..+..+.+||++|+++...|...-. .+. |=|-|.
T Consensus 188 ~tt~d~tl~~~D~RT~~~~~sI~dAHg----------------q~v---rdlDfN------------------------- 223 (370)
T KOG1007|consen 188 ATTSDSTLQFWDLRTMKKNNSIEDAHG----------------QRV---RDLDFN------------------------- 223 (370)
T ss_pred EEeCCCcEEEEEccchhhhcchhhhhc----------------cee---eeccCC-------------------------
Confidence 999999999999999987776653110 000 001111
Q ss_pred cCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCC-CcEEEE
Q 001814 286 TSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVT-RAIISQ 364 (1010)
Q Consensus 286 tSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s-~~~v~~ 364 (1010)
|..+. .+++++.||.|+|||... ...+..
T Consensus 224 -----------------------------------------------pnkq~---~lvt~gDdgyvriWD~R~tk~pv~e 253 (370)
T KOG1007|consen 224 -----------------------------------------------PNKQH---ILVTCGDDGYVRIWDTRKTKFPVQE 253 (370)
T ss_pred -----------------------------------------------CCceE---EEEEcCCCccEEEEeccCCCccccc
Confidence 11110 234678899999999986 457899
Q ss_pred eccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccC--CCCCCcc-----ccCCcceEE-----EEEeccccc
Q 001814 365 FKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRS--GSGNHKY-----DWNSSHVHL-----YKLHRGITS 431 (1010)
Q Consensus 365 ~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~--~sG~~~~-----~~~~s~~~L-----~~L~RG~t~ 431 (1010)
+.+|++-|.++.|+|.- +||.|++.|- .+.+|......+-. ..+...+ +-...+++| .++. .+.
T Consensus 254 l~~HsHWvW~VRfn~~hdqLiLs~~SDs-~V~Lsca~svSSE~qi~~~~dese~e~~dseer~kpL~dg~l~tyd--ehE 330 (370)
T KOG1007|consen 254 LPGHSHWVWAVRFNPEHDQLILSGGSDS-AVNLSCASSVSSEQQIEFEDDESESEDEDSEERVKPLQDGQLETYD--EHE 330 (370)
T ss_pred cCCCceEEEEEEecCccceEEEecCCCc-eeEEEeccccccccccccccccccCcchhhHHhccccccccccccc--ccc
Confidence 99999999999999965 6888888854 47788763210000 0000000 000011111 1221 133
Q ss_pred ccEEEEEEccCCCEE-EEEeCCCeEEEEeCCCC
Q 001814 432 ATIQDICFSHYSQWI-AIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 432 a~I~sIAFSpDg~~L-AsgS~dGTVhIw~I~~~ 463 (1010)
..|++++||.-.-|+ |+-|-||.+.|=.+.++
T Consensus 331 DSVY~~aWSsadPWiFASLSYDGRviIs~V~r~ 363 (370)
T KOG1007|consen 331 DSVYALAWSSADPWIFASLSYDGRVIISSVPRF 363 (370)
T ss_pred cceEEEeeccCCCeeEEEeccCceEEeecCChh
Confidence 479999999777776 45677999988776543
No 188
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=98.54 E-value=0.00013 Score=82.69 Aligned_cols=83 Identities=19% Similarity=0.383 Sum_probs=58.6
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
....|+++|||++|..+......|-+|++.+. .| ....+..+.-+ ......++|+|||+||+++.
T Consensus 246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~-----~g--------~l~~~~~~~~~--G~~Pr~~~~s~~g~~l~Va~ 310 (345)
T PF10282_consen 246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPA-----TG--------TLTLVQTVPTG--GKFPRHFAFSPDGRYLYVAN 310 (345)
T ss_dssp SEEEEEE-TTSSEEEEEECTTTEEEEEEECTT-----TT--------TEEEEEEEEES--SSSEEEEEE-TTSSEEEEEE
T ss_pred CceeEEEecCCCEEEEEeccCCEEEEEEEecC-----CC--------ceEEEEEEeCC--CCCccEEEEeCCCCEEEEEe
Confidence 57889999999999888876778999999532 12 11233333221 12378999999999999987
Q ss_pred -CCCeEEEEeCCCCCCccc
Q 001814 451 -SKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 451 -~dGTVhIw~I~~~gg~~~ 468 (1010)
.+++|.+|++++..|...
T Consensus 311 ~~s~~v~vf~~d~~tG~l~ 329 (345)
T PF10282_consen 311 QDSNTVSVFDIDPDTGKLT 329 (345)
T ss_dssp TTTTEEEEEEEETTTTEEE
T ss_pred cCCCeEEEEEEeCCCCcEE
Confidence 467999999987766543
No 189
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=98.51 E-value=2e-05 Score=89.25 Aligned_cols=95 Identities=18% Similarity=0.304 Sum_probs=67.2
Q ss_pred CeEEEEECCC-CcEEEEeccCCCCeEEEEECC------------------CCCEEEEEEcCCCeEEEEeCCCCcccCCCC
Q 001814 349 GIVVVKDFVT-RAIISQFKAHTSPISALCFDP------------------SGTLLVTASVYGNNINIFRIMPSCMRSGSG 409 (1010)
Q Consensus 349 G~V~VwDl~s-~~~v~~~~aHtspIsaLaFSP------------------dGtlLATAS~dGt~IrVwdi~p~~~~~~sG 409 (1010)
.+..+|+-.. .+.+..+..-..+..++.|+| -+-.+|.|.. ..+.|||....
T Consensus 261 n~tYvfsrk~l~rP~~~lp~~~k~~lavr~~pVy~elrp~~~~~~~~~lpyrlvfaiAt~--~svyvydtq~~------- 331 (434)
T KOG1009|consen 261 NTSYVFSRKDLKRPAARLPSPKKPALAVRFSPVYYELRPLSSEKFLFVLPYRLVFAIATK--NSVYVYDTQTL------- 331 (434)
T ss_pred ceeEeeccccccCceeecCCCCcceEEEEeeeeEEEeccccccccccccccceEEEEeec--ceEEEeccccc-------
Confidence 3455665544 345677777777777887776 2334566666 34788987531
Q ss_pred CCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 410 NHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 410 ~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..++.+ -++|...|++|+||+||..|+++|.||-+-+-.+++.
T Consensus 332 ----------~P~~~v-~nihy~~iTDiaws~dg~~l~vSS~DGyCS~vtfe~~ 374 (434)
T KOG1009|consen 332 ----------EPLAVV-DNIHYSAITDIAWSDDGSVLLVSSTDGFCSLVTFEPW 374 (434)
T ss_pred ----------cceEEE-eeeeeeeecceeecCCCcEEEEeccCCceEEEEEcch
Confidence 345544 4566778999999999999999999999888777654
No 190
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.51 E-value=1e-06 Score=92.48 Aligned_cols=93 Identities=15% Similarity=0.300 Sum_probs=71.1
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCC--CeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYG--NNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dG--t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
+..++.|.+||+. .+.+..|. ..++..|.|||+|++||+|+.++ ..|.+||+.. . ..+
T Consensus 79 g~~~~~v~lyd~~-~~~i~~~~--~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~-------~----------~~i 138 (194)
T PF08662_consen 79 GSMPAKVTLYDVK-GKKIFSFG--TQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVRK-------K----------KKI 138 (194)
T ss_pred ccCCcccEEEcCc-ccEeEeec--CCCceEEEECCCCCEEEEEEccCCCcEEEEEECCC-------C----------EEe
Confidence 4456799999997 66666664 56889999999999999998632 2499999853 1 355
Q ss_pred EEEecccccccEEEEEEccCCCEEEEEeC------CCeEEEEeCC
Q 001814 423 YKLHRGITSATIQDICFSHYSQWIAIVSS------KGTCHVFVLS 461 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~~LAsgS~------dGTVhIw~I~ 461 (1010)
.++.. ..+..++|||||++|++++. |..+.||+..
T Consensus 139 ~~~~~----~~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 139 STFEH----SDATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred ecccc----CcEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence 55432 24789999999999999875 6778999974
No 191
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=98.50 E-value=1.4e-06 Score=96.70 Aligned_cols=245 Identities=12% Similarity=0.151 Sum_probs=151.6
Q ss_pred CeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCc
Q 001814 75 KQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHL 153 (1010)
Q Consensus 75 ~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~ 153 (1010)
.+.+++|+.+| +++||... ++..+.++.+.+.+..++|+-.. .|-++.+++
T Consensus 40 e~~vav~lSngsv~lyd~~t-g~~l~~fk~~~~~~N~vrf~~~d------------s~h~v~s~s--------------- 91 (376)
T KOG1188|consen 40 ETAVAVSLSNGSVRLYDKGT-GQLLEEFKGPPATTNGVRFISCD------------SPHGVISCS--------------- 91 (376)
T ss_pred ceeEEEEecCCeEEEEeccc-hhhhheecCCCCcccceEEecCC------------CCCeeEEec---------------
Confidence 46899999987 99999965 77888888888888888887321 122233221
Q ss_pred cccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC----cEEEEEE--cCCeEEEEeCC-----eEEEEECCCCc
Q 001814 154 GGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS----SVCMVRC--SPRIVAVGLAT-----QIYCFDALTLE 222 (1010)
Q Consensus 154 ~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S----~V~sVa~--S~rlLAV~ld~-----~I~IwD~~Tle 222 (1010)
++++||+||+++...+..+.+.. +-.+++. +.+++++++.. .+++||++.-+
T Consensus 92 -----------------sDG~Vr~wD~Rs~~e~a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~q 154 (376)
T KOG1188|consen 92 -----------------SDGTVRLWDIRSQAESARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQ 154 (376)
T ss_pred -----------------cCCeEEEEEeecchhhhheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEecccc
Confidence 35789999999998888887642 3344444 56788887643 59999998654
Q ss_pred e-eEEEee-cCCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehh
Q 001814 223 N-KFSVLT-YPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAME 300 (1010)
Q Consensus 223 ~-l~tL~t-~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~d 300 (1010)
. +..+.. |..- ...+- |.|.+-.
T Consensus 155 q~l~~~~eSH~DD-------------VT~lr----------------------------------FHP~~pn-------- 179 (376)
T KOG1188|consen 155 QLLRQLNESHNDD-------------VTQLR----------------------------------FHPSDPN-------- 179 (376)
T ss_pred chhhhhhhhccCc-------------ceeEE----------------------------------ecCCCCC--------
Confidence 3 222211 1110 00000 1111000
Q ss_pred hhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCc---EEEEeccCCCCeEEEEE
Q 001814 301 HSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRA---IISQFKAHTSPISALCF 377 (1010)
Q Consensus 301 ssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~---~v~~~~aHtspIsaLaF 377 (1010)
.+.+|+.||-|.|||+.... .+.+.-.|.+.|..+.|
T Consensus 180 ----------------------------------------lLlSGSvDGLvnlfD~~~d~EeDaL~~viN~~sSI~~igw 219 (376)
T KOG1188|consen 180 ----------------------------------------LLLSGSVDGLVNLFDTKKDNEEDALLHVINHGSSIHLIGW 219 (376)
T ss_pred ----------------------------------------eEEeecccceEEeeecCCCcchhhHHHhhcccceeeeeee
Confidence 12467889999999997643 23334457788999999
Q ss_pred CCCC-CEEEEEEcCCCeEEEEeCCCCcc--c-------------------------CC-------CCCCcc---------
Q 001814 378 DPSG-TLLVTASVYGNNINIFRIMPSCM--R-------------------------SG-------SGNHKY--------- 413 (1010)
Q Consensus 378 SPdG-tlLATAS~dGt~IrVwdi~p~~~--~-------------------------~~-------sG~~~~--------- 413 (1010)
..+| +.|.+-+..++ +.+|++..... . .+ .|.-..
T Consensus 220 ~~~~ykrI~clTH~Et-f~~~ele~~~~~~~~~~~~~~~~d~r~~~~~dY~I~~~~~~~~~~~~l~g~~~n~~~~~~~~~ 298 (376)
T KOG1188|consen 220 LSKKYKRIMCLTHMET-FAIYELEDGSEETWLENPDVSADDLRKEDNCDYVINEHSPGDKDTCALAGTDSNKGTIFPLVD 298 (376)
T ss_pred ecCCcceEEEEEccCc-eeEEEccCCChhhcccCccchhhhHHhhhhhhheeecccCCCcceEEEeccccCceeEEEeee
Confidence 9888 34555555454 88898864210 0 00 000000
Q ss_pred -ccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 414 -DWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 414 -~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.-.....++..|++| +...|.++.|..-+..+.+|+.||-+.+|+.+
T Consensus 299 ~~s~~~~~~~a~l~g~-~~eiVR~i~~~~~~~~l~TGGEDG~l~~Wk~~ 346 (376)
T KOG1188|consen 299 TSSGSLLTEPAILQGG-HEEIVRDILFDVKNDVLYTGGEDGLLQAWKVE 346 (376)
T ss_pred cccccccCccccccCC-cHHHHHHHhhhcccceeeccCCCceEEEEecC
Confidence 000011223334444 34578999999999999999999999999973
No 192
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.49 E-value=4.4e-06 Score=102.65 Aligned_cols=101 Identities=16% Similarity=0.233 Sum_probs=75.7
Q ss_pred cccCCCCeEEEEECCCCc--EEEEeccCC--C-CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCC
Q 001814 343 ADMDNAGIVVVKDFVTRA--IISQFKAHT--S-PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNS 417 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~--~v~~~~aHt--s-pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~ 417 (1010)
.+++.+|.|.+||+.... ..-++..|- + .+++|...++..++|+|+. ..|+||++. |
T Consensus 1273 vSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiAsGs~--q~ikIy~~~--------G-------- 1334 (1387)
T KOG1517|consen 1273 VSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIASGSA--QLIKIYSLS--------G-------- 1334 (1387)
T ss_pred eeeccCCeEEEEecccCcccccceeeeccccCccceeeeeccCCCeeeecCc--ceEEEEecC--------h--------
Confidence 467899999999998732 233455554 3 5999999999999999997 569999985 3
Q ss_pred cceEEEEEe-----cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 418 SHVHLYKLH-----RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 418 s~~~L~~L~-----RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.++..++ -|-....+.+++|.|---.||+|+.|.||-||..++.
T Consensus 1335 --~~l~~~k~n~~F~~q~~gs~scL~FHP~~~llAaG~~Ds~V~iYs~~k~ 1383 (1387)
T KOG1517|consen 1335 --EQLNIIKYNPGFMGQRIGSVSCLAFHPHRLLLAAGSADSTVSIYSCEKP 1383 (1387)
T ss_pred --hhhcccccCcccccCcCCCcceeeecchhHhhhhccCCceEEEeecCCc
Confidence 1221111 1222235789999999999999999999999997654
No 193
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=98.48 E-value=1.6e-06 Score=97.62 Aligned_cols=104 Identities=12% Similarity=0.120 Sum_probs=78.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
.+++.|.+|.||++.+++.+.++. |..-|.+++|+.||.+|+|++. +..|||||... | ..+
T Consensus 148 lsag~Dn~v~iWnv~tgeali~l~-hpd~i~S~sfn~dGs~l~Ttck-DKkvRv~dpr~-------~----------~~v 208 (472)
T KOG0303|consen 148 LSAGSDNTVSIWNVGTGEALITLD-HPDMVYSMSFNRDGSLLCTTCK-DKKVRVIDPRR-------G----------TVV 208 (472)
T ss_pred hhccCCceEEEEeccCCceeeecC-CCCeEEEEEeccCCceeeeecc-cceeEEEcCCC-------C----------cEe
Confidence 356789999999999999888888 9999999999999999999999 56799999753 3 244
Q ss_pred EEEecccccccEEEEEEccCCCEEEEEeC---CCeEEEEeCCCCCCc
Q 001814 423 YKLHRGITSATIQDICFSHYSQWIAIVSS---KGTCHVFVLSPFGGD 466 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~~LAsgS~---dGTVhIw~I~~~gg~ 466 (1010)
.+- .++..++-..+-|=.+|..+.+|-+ +..+-||+-.....+
T Consensus 209 ~e~-~~heG~k~~Raifl~~g~i~tTGfsr~seRq~aLwdp~nl~eP 254 (472)
T KOG0303|consen 209 SEG-VAHEGAKPARAIFLASGKIFTTGFSRMSERQIALWDPNNLEEP 254 (472)
T ss_pred eec-ccccCCCcceeEEeccCceeeeccccccccceeccCcccccCc
Confidence 443 3444455566778889995555443 346888876554443
No 194
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=98.47 E-value=4.3e-07 Score=105.63 Aligned_cols=110 Identities=19% Similarity=0.255 Sum_probs=85.0
Q ss_pred ccccCCCCeEEEEECCC--------CcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcc
Q 001814 342 GADMDNAGIVVVKDFVT--------RAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKY 413 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s--------~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~ 413 (1010)
+++++.+|++.+|.++. -+.+.+|+||..||.|++.+++|..+.|++.||+ ||.|++.++.+ ...
T Consensus 309 lit~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl~v~v~~n~~~~ysgg~Dg~-I~~w~~p~n~d------p~d 381 (577)
T KOG0642|consen 309 LITASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVLCVVVPSNGEHCYSGGIDGT-IRCWNLPPNQD------PDD 381 (577)
T ss_pred EEEeccccchhhhhhcccCCccccceeeeEEEecccCceEEEEecCCceEEEeeccCce-eeeeccCCCCC------ccc
Confidence 46789999999999932 2478899999999999999999999999999876 99999854311 000
Q ss_pred ccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 414 DWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 414 ~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
..... .....| -|++.+ ||.+++|.....|+++|.|||+++|...
T Consensus 382 s~dp~-vl~~~l-~Ghtda-vw~l~~s~~~~~Llscs~DgTvr~w~~~ 426 (577)
T KOG0642|consen 382 SYDPS-VLSGTL-LGHTDA-VWLLALSSTKDRLLSCSSDGTVRLWEPT 426 (577)
T ss_pred ccCcc-hhccce-eccccc-eeeeeecccccceeeecCCceEEeeccC
Confidence 00110 122222 577655 9999999999999999999999999864
No 195
>PRK00178 tolB translocation protein TolB; Provisional
Probab=98.47 E-value=0.00011 Score=85.22 Aligned_cols=95 Identities=8% Similarity=0.012 Sum_probs=55.5
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC--eEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN--NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt--~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.|.++|+.+++... +..........+|||||++|+..+.++. .|.+||+.. | ....+..
T Consensus 312 ~iy~~d~~~g~~~~-lt~~~~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~t-------g-----------~~~~lt~ 372 (430)
T PRK00178 312 QIYKVNVNGGRAER-VTFVGNYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQR-------G-----------SVRILTD 372 (430)
T ss_pred eEEEEECCCCCEEE-eecCCCCccceEECCCCCEEEEEEccCCceEEEEEECCC-------C-----------CEEEccC
Confidence 46666776665322 2111122345789999999988875433 466667642 2 1122222
Q ss_pred ccccccEEEEEEccCCCEEEEEeCC-CeEEEEeCCCCCCc
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSK-GTCHVFVLSPFGGD 466 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~d-GTVhIw~I~~~gg~ 466 (1010)
+ ......+|||||++|+..+.+ |.-+||-+...|+.
T Consensus 373 ~---~~~~~p~~spdg~~i~~~~~~~g~~~l~~~~~~g~~ 409 (430)
T PRK00178 373 T---SLDESPSVAPNGTMLIYATRQQGRGVLMLVSINGRV 409 (430)
T ss_pred C---CCCCCceECCCCCEEEEEEecCCceEEEEEECCCCc
Confidence 1 112356899999999988765 45666766655543
No 196
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=98.44 E-value=5.8e-06 Score=90.36 Aligned_cols=106 Identities=15% Similarity=0.210 Sum_probs=82.8
Q ss_pred ccccCCCCeEEEEECCCCcEEEEec---cCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCC
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFK---AHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNS 417 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~---aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~ 417 (1010)
+++.+.||.|++||+...+--..+- ....|+..|+++++. .+|||-..+...|.|-|+.-. +
T Consensus 212 FASvgaDGSvRmFDLR~leHSTIIYE~p~~~~pLlRLswnkqDpnymATf~~dS~~V~iLDiR~P------~-------- 277 (364)
T KOG0290|consen 212 FASVGADGSVRMFDLRSLEHSTIIYEDPSPSTPLLRLSWNKQDPNYMATFAMDSNKVVILDIRVP------C-------- 277 (364)
T ss_pred EEEecCCCcEEEEEecccccceEEecCCCCCCcceeeccCcCCchHHhhhhcCCceEEEEEecCC------C--------
Confidence 3466789999999999876433332 235799999999866 799999888888999999632 2
Q ss_pred cceEEEEEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCCCC
Q 001814 418 SHVHLYKLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 418 s~~~L~~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
..+.+|+ + |.+.|..|+|.|.+ ..|+++++|..+.||+|.....
T Consensus 278 --tpva~L~-~-H~a~VNgIaWaPhS~~hictaGDD~qaliWDl~q~~~ 322 (364)
T KOG0290|consen 278 --TPVARLR-N-HQASVNGIAWAPHSSSHICTAGDDCQALIWDLQQMPR 322 (364)
T ss_pred --cceehhh-c-CcccccceEecCCCCceeeecCCcceEEEEecccccc
Confidence 2566674 4 45789999999975 6999999999999999987544
No 197
>PRK01742 tolB translocation protein TolB; Provisional
Probab=98.44 E-value=6.3e-06 Score=96.06 Aligned_cols=89 Identities=13% Similarity=0.144 Sum_probs=59.0
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccc
Q 001814 351 VVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGIT 430 (1010)
Q Consensus 351 V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t 430 (1010)
|.+||+.++. +..+..|...+...+|+|||+.|+.++..+...+||++... +. ....+ +..
T Consensus 274 Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~------~~----------~~~~l--~~~ 334 (429)
T PRK01742 274 IYVMGANGGT-PSQLTSGAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSAS------GG----------GASLV--GGR 334 (429)
T ss_pred EEEEECCCCC-eEeeccCCCCcCCEEECCCCCEEEEEECCCCCceEEEEECC------CC----------CeEEe--cCC
Confidence 4455766655 34566677778899999999988877754445899987432 10 11111 111
Q ss_pred cccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 431 SATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 431 ~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
. ...+|||||++|+.++.++. .+|++..
T Consensus 335 -~--~~~~~SpDG~~ia~~~~~~i-~~~Dl~~ 362 (429)
T PRK01742 335 -G--YSAQISADGKTLVMINGDNV-VKQDLTS 362 (429)
T ss_pred -C--CCccCCCCCCEEEEEcCCCE-EEEECCC
Confidence 1 45789999999999887654 4577754
No 198
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.44 E-value=1.7e-06 Score=100.30 Aligned_cols=89 Identities=15% Similarity=0.273 Sum_probs=68.7
Q ss_pred ccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCC
Q 001814 329 VSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGS 408 (1010)
Q Consensus 329 ~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~s 408 (1010)
+.|++++++++ ..++||.++|||+.+-+++..++.--..+.|++|||||+++||+++|+ .+.||.+..
T Consensus 296 f~FS~DG~~LA----~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGEDD-LVtVwSf~e------- 363 (636)
T KOG2394|consen 296 FAFSPDGKYLA----TVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSPDGKYIVTGGEDD-LVTVWSFEE------- 363 (636)
T ss_pred eeEcCCCceEE----EEecCceEEEeeccHHHHHHHHHhhccceEEEEEcCCccEEEecCCcc-eEEEEEecc-------
Confidence 44555666544 678999999999999888888888889999999999999999999965 699999853
Q ss_pred CCCccccCCcceEEEEEecccccccEEEEEEcc
Q 001814 409 GNHKYDWNSSHVHLYKLHRGITSATIQDICFSH 441 (1010)
Q Consensus 409 G~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp 441 (1010)
+ ++..--.| |..+|..++|.|
T Consensus 364 r-----------RVVARGqG-HkSWVs~VaFDp 384 (636)
T KOG2394|consen 364 R-----------RVVARGQG-HKSWVSVVAFDP 384 (636)
T ss_pred c-----------eEEEeccc-cccceeeEeecc
Confidence 1 22221124 345899999983
No 199
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=98.43 E-value=5.2e-06 Score=96.20 Aligned_cols=208 Identities=19% Similarity=0.235 Sum_probs=135.0
Q ss_pred CeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCCc
Q 001814 75 KQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSHL 153 (1010)
Q Consensus 75 ~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~~ 153 (1010)
...|+++..+| |.|.+ ..+.+...++-|.|.|.+-++.|++.. | +. +|+
T Consensus 75 ~d~~~i~s~DGkf~il~--k~~rVE~sv~AH~~A~~~gRW~~dGtg------------L-lt-~GE-------------- 124 (737)
T KOG1524|consen 75 SDTLLICSNDGRFVILN--KSARVERSISAHAAAISSGRWSPDGAG------------L-LT-AGE-------------- 124 (737)
T ss_pred cceEEEEcCCceEEEec--ccchhhhhhhhhhhhhhhcccCCCCce------------e-ee-ecC--------------
Confidence 34666666665 88876 468899999999999999999988721 2 32 221
Q ss_pred cccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEE-eCCCcEEEEEEcC--CeEEEEeCCeEEEEECCCCceeEEEeec
Q 001814 154 GGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVL-RFRSSVCMVRCSP--RIVAVGLATQIYCFDALTLENKFSVLTY 230 (1010)
Q Consensus 154 ~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL-~f~S~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle~l~tL~t~ 230 (1010)
++.|++|+- +|-.-.++ ++..+|+++++.| .-++-+..++|+|=-+.-....
T Consensus 125 ------------------DG~iKiWSr-sGMLRStl~Q~~~~v~c~~W~p~S~~vl~c~g~h~~IKpL~~n~k~------ 179 (737)
T KOG1524|consen 125 ------------------DGVIKIWSR-SGMLRSTVVQNEESIRCARWAPNSNSIVFCQGGHISIKPLAANSKI------ 179 (737)
T ss_pred ------------------CceEEEEec-cchHHHHHhhcCceeEEEEECCCCCceEEecCCeEEEeecccccce------
Confidence 468999985 55444444 5778999999987 4566666677766322111101
Q ss_pred CCccccCCCccccccCccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccc
Q 001814 231 PVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLS 310 (1010)
Q Consensus 231 p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ 310 (1010)
.||=|.-|- ++.
T Consensus 180 -----------------------i~WkAHDGi--iL~------------------------------------------- 191 (737)
T KOG1524|consen 180 -----------------------IRWRAHDGL--VLS------------------------------------------- 191 (737)
T ss_pred -----------------------eEEeccCcE--EEE-------------------------------------------
Confidence 133332220 000
Q ss_pred eeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC
Q 001814 311 KTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY 390 (1010)
Q Consensus 311 ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d 390 (1010)
+ +|..-.-.+++++.|-..+|||.. |..+-+-.+|..||++++|+|+ ++.|.+|.
T Consensus 192 --~--------------------~W~~~s~lI~sgGED~kfKvWD~~-G~~Lf~S~~~ey~ITSva~npd-~~~~v~S~- 246 (737)
T KOG1524|consen 192 --L--------------------SWSTQSNIIASGGEDFRFKIWDAQ-GANLFTSAAEEYAITSVAFNPE-KDYLLWSY- 246 (737)
T ss_pred --e--------------------ecCccccceeecCCceeEEeeccc-CcccccCChhccceeeeeeccc-cceeeeee-
Confidence 0 000000124577899999999986 5556666789999999999999 88887876
Q ss_pred CCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeE-EEEeCC
Q 001814 391 GNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTC-HVFVLS 461 (1010)
Q Consensus 391 Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTV-hIw~I~ 461 (1010)
++.| |.- | ....|..++||+||..+++|+..|.+ |-+.|+
T Consensus 247 -nt~R-~~~-p----------------------------~~GSifnlsWS~DGTQ~a~gt~~G~v~~A~~ie 287 (737)
T KOG1524|consen 247 -NTAR-FSS-P----------------------------RVGSIFNLSWSADGTQATCGTSTGQLIVAYAIE 287 (737)
T ss_pred -eeee-ecC-C----------------------------CccceEEEEEcCCCceeeccccCceEEEeeeeh
Confidence 3344 211 1 11358899999999999999999874 334443
No 200
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.41 E-value=1.9e-05 Score=89.10 Aligned_cols=106 Identities=21% Similarity=0.186 Sum_probs=87.9
Q ss_pred ccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 342 GADMDNAGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
++++..-+.|++||...+ +.+.+|.--..+|++++..|+|+++.+|...|. +..||+.. | +
T Consensus 219 fat~T~~hqvR~YDt~~qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~g~-l~~FD~r~-------~----------k 280 (412)
T KOG3881|consen 219 FATITRYHQVRLYDTRHQRRPVAQFDFLENPISSTGLTPSGNFIYTGNTKGQ-LAKFDLRG-------G----------K 280 (412)
T ss_pred EEEEecceeEEEecCcccCcceeEeccccCcceeeeecCCCcEEEEecccch-hheecccC-------c----------e
Confidence 445667789999999875 478899888999999999999999999999886 88999853 3 3
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
++-....|.+ ..|.+|.-.|..++||+++.|.-|+||++...+-.
T Consensus 281 l~g~~~kg~t-Gsirsih~hp~~~~las~GLDRyvRIhD~ktrkll 325 (412)
T KOG3881|consen 281 LLGCGLKGIT-GSIRSIHCHPTHPVLASCGLDRYVRIHDIKTRKLL 325 (412)
T ss_pred eeccccCCcc-CCcceEEEcCCCceEEeeccceeEEEeecccchhh
Confidence 4444456665 45999999999999999999999999999885544
No 201
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=98.40 E-value=7.4e-06 Score=90.13 Aligned_cols=40 Identities=15% Similarity=0.361 Sum_probs=35.6
Q ss_pred cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 360 AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 360 ~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
+.+..|..|..+|..+.|+=.|+.|++.+.||. +|+|...
T Consensus 279 ~~vs~~~~H~~~VWrv~wNmtGtiLsStGdDG~-VRLWkan 318 (361)
T KOG2445|consen 279 EKVSELDDHNGEVWRVRWNMTGTILSSTGDDGC-VRLWKAN 318 (361)
T ss_pred EEeeeccCCCCceEEEEEeeeeeEEeecCCCce-eeehhhh
Confidence 356788899999999999999999999999886 8999863
No 202
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.37 E-value=3.7e-07 Score=105.63 Aligned_cols=103 Identities=17% Similarity=0.205 Sum_probs=72.1
Q ss_pred EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEc
Q 001814 361 IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFS 440 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFS 440 (1010)
.+..+.--..+|...+|||||++||+.|.||. +|||+... ++|.-+.+- .-+...|++||
T Consensus 282 Pv~~w~~~~g~in~f~FS~DG~~LA~VSqDGf-LRvF~fdt------------------~eLlg~mkS-YFGGLLCvcWS 341 (636)
T KOG2394|consen 282 PVARWHIGEGSINEFAFSPDGKYLATVSQDGF-LRIFDFDT------------------QELLGVMKS-YFGGLLCVCWS 341 (636)
T ss_pred ccceeEeccccccceeEcCCCceEEEEecCce-EEEeeccH------------------HHHHHHHHh-hccceEEEEEc
Confidence 33334334558999999999999999999876 99999742 122111111 11247899999
Q ss_pred cCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCCccCC
Q 001814 441 HYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFP 483 (1010)
Q Consensus 441 pDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~p 483 (1010)
|||+||++|+.|.-|.||.+....-..--+.|.++|......|
T Consensus 342 PDGKyIvtGGEDDLVtVwSf~erRVVARGqGHkSWVs~VaFDp 384 (636)
T KOG2394|consen 342 PDGKYIVTGGEDDLVTVWSFEERRVVARGQGHKSWVSVVAFDP 384 (636)
T ss_pred CCccEEEecCCcceEEEEEeccceEEEeccccccceeeEeecc
Confidence 9999999999999999999976433222367777665443333
No 203
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.37 E-value=4.9e-05 Score=88.73 Aligned_cols=96 Identities=14% Similarity=0.206 Sum_probs=60.8
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC--eEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN--NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt--~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
..|.+||+.++.. ..+..|.......+|+|||+.||.++..+. .|.+|++.. + ....+.
T Consensus 270 ~~Iy~~d~~~~~~-~~Lt~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g-------~-----------~~~~lt 330 (435)
T PRK05137 270 TDIYTMDLRSGTT-TRLTDSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADG-------S-----------NPRRIS 330 (435)
T ss_pred ceEEEEECCCCce-EEccCCCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCC-------C-----------CeEEee
Confidence 4577888887764 456666666777999999999998875432 355556531 2 122232
Q ss_pred cccccccEEEEEEccCCCEEEEEeCC-CeEEEEeCCCCCC
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSK-GTCHVFVLSPFGG 465 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~d-GTVhIw~I~~~gg 465 (1010)
.+ ...+...+|||||++|+..+.+ +..+||-++..++
T Consensus 331 ~~--~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~ 368 (435)
T PRK05137 331 FG--GGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGS 368 (435)
T ss_pred cC--CCcccCeEECCCCCEEEEEEcCCCceEEEEEECCCC
Confidence 22 1235668899999999998764 3345554444343
No 204
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=98.35 E-value=0.00016 Score=82.86 Aligned_cols=89 Identities=16% Similarity=0.191 Sum_probs=56.2
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC--eEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN--NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt--~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.|.++|+.+++. ..+..+...+..++|+|||++|+.++.++. .|.+|++.. + .+..+..
T Consensus 303 ~iy~~d~~~~~~-~~l~~~~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~-------~-----------~~~~l~~ 363 (417)
T TIGR02800 303 QIYMMDADGGEV-RRLTFRGGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDG-------G-----------GERVLTD 363 (417)
T ss_pred eEEEEECCCCCE-EEeecCCCCccCeEECCCCCEEEEEEccCCceEEEEEeCCC-------C-----------CeEEccC
Confidence 567777776653 344445556778899999999998887543 355666532 2 1122222
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
+ ......+|+|||++|+..+.++....+.+
T Consensus 364 ~---~~~~~p~~spdg~~l~~~~~~~~~~~l~~ 393 (417)
T TIGR02800 364 T---GLDESPSFAPNGRMILYATTRGGRGVLGL 393 (417)
T ss_pred C---CCCCCceECCCCCEEEEEEeCCCcEEEEE
Confidence 1 12345689999999999888765443333
No 205
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.35 E-value=3.7e-05 Score=94.89 Aligned_cols=101 Identities=20% Similarity=0.221 Sum_probs=75.2
Q ss_pred cCCCCeEEEEECCCC---cEEEEeccCCCC--eEEEEECCCCCE-EEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 345 MDNAGIVVVKDFVTR---AIISQFKAHTSP--ISALCFDPSGTL-LVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~---~~v~~~~aHtsp--IsaLaFSPdGtl-LATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
|-.||.|++||.... ..+...+.|+.+ |.-+.|.++|-- |++||.+|. |++||+.-... ...
T Consensus 1227 GfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~-I~~~DlR~~~~-----------e~~ 1294 (1387)
T KOG1517|consen 1227 GFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGD-IQLLDLRMSSK-----------ETF 1294 (1387)
T ss_pred eecCCceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccCCe-EEEEecccCcc-----------ccc
Confidence 446999999998753 367888999987 999999998865 999999886 99999963200 000
Q ss_pred ceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 419 HVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
....+....|- .++++.-.++...||+|+. +.|.||++.
T Consensus 1295 ~~iv~~~~yGs---~lTal~VH~hapiiAsGs~-q~ikIy~~~ 1333 (1387)
T KOG1517|consen 1295 LTIVAHWEYGS---ALTALTVHEHAPIIASGSA-QLIKIYSLS 1333 (1387)
T ss_pred ceeeeccccCc---cceeeeeccCCCeeeecCc-ceEEEEecC
Confidence 11222222241 3678999999999999999 899999975
No 206
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.34 E-value=5.1e-05 Score=88.70 Aligned_cols=93 Identities=18% Similarity=0.179 Sum_probs=60.3
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccc
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGI 429 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~ 429 (1010)
.|.+||+.+++.. .+..+...+...+|+|||+.|+.++.++...+||.+... .| ...++...
T Consensus 268 ~I~~~d~~tg~~~-~lt~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~-----~g-----------~~~~lt~~- 329 (429)
T PRK03629 268 NLYVMDLASGQIR-QVTDGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNIN-----GG-----------APQRITWE- 329 (429)
T ss_pred EEEEEECCCCCEE-EccCCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECC-----CC-----------CeEEeecC-
Confidence 5888999887654 444445567889999999999888876555677765321 12 11222211
Q ss_pred ccccEEEEEEccCCCEEEEEeCC-Ce--EEEEeCC
Q 001814 430 TSATIQDICFSHYSQWIAIVSSK-GT--CHVFVLS 461 (1010)
Q Consensus 430 t~a~I~sIAFSpDg~~LAsgS~d-GT--VhIw~I~ 461 (1010)
...+...+|||||++|+..+.+ +. +.+|++.
T Consensus 330 -~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~ 363 (429)
T PRK03629 330 -GSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLA 363 (429)
T ss_pred -CCCccCEEECCCCCEEEEEEccCCCceEEEEECC
Confidence 1235678999999999887654 33 4445543
No 207
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.33 E-value=1.7e-05 Score=88.80 Aligned_cols=100 Identities=21% Similarity=0.346 Sum_probs=73.0
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
.+..|+|||..++..+-...---..++-|.|||||.+|..|.-|+ +++||+.... |.... ..+
T Consensus 216 gsssi~iWdpdtg~~~pL~~~glgg~slLkwSPdgd~lfaAt~da-vfrlw~e~q~------------wt~er---w~l- 278 (445)
T KOG2139|consen 216 GSSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDA-VFRLWQENQS------------WTKER---WIL- 278 (445)
T ss_pred CcceEEEEcCCCCCcccccccCCCceeeEEEcCCCCEEEEecccc-eeeeehhccc------------ceecc---eec-
Confidence 356799999999887655545557788999999999999999855 5999976421 22211 122
Q ss_pred cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcc
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
....|+.-+|+|+|++|..+....+ .||.+...+...
T Consensus 279 ---gsgrvqtacWspcGsfLLf~~sgsp-~lysl~f~~~~~ 315 (445)
T KOG2139|consen 279 ---GSGRVQTACWSPCGSFLLFACSGSP-RLYSLTFDGEDS 315 (445)
T ss_pred ---cCCceeeeeecCCCCEEEEEEcCCc-eEEEEeecCCCc
Confidence 1237999999999999988876655 588887655543
No 208
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.32 E-value=4.4e-05 Score=89.09 Aligned_cols=94 Identities=24% Similarity=0.314 Sum_probs=60.6
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
..|.+||+.+++. ..+..|.......+|+|||+.|+.++..+...+||.+... .| ..+.+ .+ .|
T Consensus 272 ~~Iy~~d~~~g~~-~~lt~~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~-----~g--------~~~~l-t~-~g 335 (433)
T PRK04922 272 PEIYVMDLGSRQL-TRLTNHFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAAS-----GG--------SAERL-TF-QG 335 (433)
T ss_pred ceEEEEECCCCCe-EECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECC-----CC--------CeEEe-ec-CC
Confidence 4689999988764 4566666556778999999999888765434455554211 12 11122 22 22
Q ss_pred cccccEEEEEEccCCCEEEEEeCCC---eEEEEeCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKG---TCHVFVLS 461 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dG---TVhIw~I~ 461 (1010)
.....++|||||++|+..+.++ .|.+|++.
T Consensus 336 ---~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~ 368 (433)
T PRK04922 336 ---NYNARASVSPDGKKIAMVHGSGGQYRIAVMDLS 368 (433)
T ss_pred ---CCccCEEECCCCCEEEEEECCCCceeEEEEECC
Confidence 1234689999999999876543 46777764
No 209
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=98.30 E-value=1.1e-06 Score=102.98 Aligned_cols=99 Identities=24% Similarity=0.312 Sum_probs=79.5
Q ss_pred cccCCCCeEEEEECCCCc-------EEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccc
Q 001814 343 ADMDNAGIVVVKDFVTRA-------IISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYD 414 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~-------~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~ 414 (1010)
+-+..||.|+||-+..+. .-..|.+|...|.+|.|.|=- ..||+||.| .+|++||+.. +
T Consensus 644 AVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd-~Ti~lWDl~~-------~----- 710 (1012)
T KOG1445|consen 644 AVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYD-STIELWDLAN-------A----- 710 (1012)
T ss_pred eecccCceEEEEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhcc-ceeeeeehhh-------h-----
Confidence 457789999999997642 446788999999999999955 588999995 5699999964 2
Q ss_pred cCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 415 WNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 415 ~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.+|.-..|++ ..|.++||||||+.+|+...||+++||.-.
T Consensus 711 ------~~~~~l~gHt-dqIf~~AWSpdGr~~AtVcKDg~~rVy~Pr 750 (1012)
T KOG1445|consen 711 ------KLYSRLVGHT-DQIFGIAWSPDGRRIATVCKDGTLRVYEPR 750 (1012)
T ss_pred ------hhhheeccCc-CceeEEEECCCCcceeeeecCceEEEeCCC
Confidence 2222224654 579999999999999999999999999743
No 210
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=98.25 E-value=1.1e-05 Score=88.34 Aligned_cols=102 Identities=20% Similarity=0.225 Sum_probs=80.6
Q ss_pred cCCCCeEEEEECCCCcEEEEe-ccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 345 MDNAGIVVVKDFVTRAIISQF-KAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~-~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
+..++++..||+.+-+....| .||...|..|-|+|+-+ +||||+.||. |||||.... ...+
T Consensus 189 tt~d~tl~~~D~RT~~~~~sI~dAHgq~vrdlDfNpnkq~~lvt~gDdgy-vriWD~R~t----------------k~pv 251 (370)
T KOG1007|consen 189 TTSDSTLQFWDLRTMKKNNSIEDAHGQRVRDLDFNPNKQHILVTCGDDGY-VRIWDTRKT----------------KFPV 251 (370)
T ss_pred EeCCCcEEEEEccchhhhcchhhhhcceeeeccCCCCceEEEEEcCCCcc-EEEEeccCC----------------Cccc
Confidence 357899999999987766556 48999999999999986 7899998776 999998531 1356
Q ss_pred EEEecccccccEEEEEEccC-CCEEEEEeCCCeEEEEeCCCCCC
Q 001814 423 YKLHRGITSATIQDICFSHY-SQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpD-g~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
++|. | +..+||++-|.|- .+.|.+++.|..|.+|....-..
T Consensus 252 ~el~-~-HsHWvW~VRfn~~hdqLiLs~~SDs~V~Lsca~svSS 293 (370)
T KOG1007|consen 252 QELP-G-HSHWVWAVRFNPEHDQLILSGGSDSAVNLSCASSVSS 293 (370)
T ss_pred cccC-C-CceEEEEEEecCccceEEEecCCCceeEEEecccccc
Confidence 6663 3 3468999999985 68899999999999997765443
No 211
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.25 E-value=4.1e-06 Score=94.97 Aligned_cols=122 Identities=17% Similarity=0.260 Sum_probs=85.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCC-C--CC-------C
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSG-S--GN-------H 411 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~-s--G~-------~ 411 (1010)
+++++.||+++||+.-+...+..+.+|...|..|.|||||++||+-+.+ ..+||++........ + ++ .
T Consensus 159 latgg~dg~lRv~~~Ps~~t~l~e~~~~~eV~DL~FS~dgk~lasig~d--~~~VW~~~~g~~~a~~t~~~k~~~~~~cR 236 (398)
T KOG0771|consen 159 LATGGTDGTLRVWEWPSMLTILEEIAHHAEVKDLDFSPDGKFLASIGAD--SARVWSVNTGAALARKTPFSKDEMFSSCR 236 (398)
T ss_pred eeeccccceEEEEecCcchhhhhhHhhcCccccceeCCCCcEEEEecCC--ceEEEEeccCchhhhcCCcccchhhhhce
Confidence 4578899999999998888888999999999999999999999999985 589999975310000 0 00 0
Q ss_pred cc-ccCC------------cceEEEE--Eecc---------cc-cccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 412 KY-DWNS------------SHVHLYK--LHRG---------IT-SATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 412 ~~-~~~~------------s~~~L~~--L~RG---------~t-~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
+. +-.+ -...+++ +..+ .. ...|.+++-|+||+++|.|+.+|.|-|++......
T Consensus 237 F~~d~~~~~l~laa~~~~~~~v~~~~~~~w~~~~~l~~~~~~~~~~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~ 315 (398)
T KOG0771|consen 237 FSVDNAQETLRLAASQFPGGGVRLCDISLWSGSNFLRLRKKIKRFKSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQR 315 (398)
T ss_pred ecccCCCceEEEEEecCCCCceeEEEeeeeccccccchhhhhhccCcceeEEEcCCCcEEEEeccCCcEEEEEeceeee
Confidence 00 0000 0000111 1111 01 12589999999999999999999999999876443
No 212
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.24 E-value=0.0018 Score=74.47 Aligned_cols=98 Identities=14% Similarity=0.160 Sum_probs=66.4
Q ss_pred CCCeEEEEECCCC-----cEEEEecc-------CCCCeEEEEECCCCCEEEEEEcC---------CCeEEEEeCCCCccc
Q 001814 347 NAGIVVVKDFVTR-----AIISQFKA-------HTSPISALCFDPSGTLLVTASVY---------GNNINIFRIMPSCMR 405 (1010)
Q Consensus 347 ~dG~V~VwDl~s~-----~~v~~~~a-------HtspIsaLaFSPdGtlLATAS~d---------Gt~IrVwdi~p~~~~ 405 (1010)
..|.|.+.|+... ..+..+.. .-..+.-++|+|+|+.|..+... |+.|-|+|+..
T Consensus 213 ~eG~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t---- 288 (352)
T TIGR02658 213 YTGKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKT---- 288 (352)
T ss_pred cCCeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCC----
Confidence 4499999996443 23333221 11223349999999988875421 25688888753
Q ss_pred CCCCCCccccCCcceEEEEEecccccccEEEEEEccCCC-EEEEEe-CCCeEEEEeCCCCC
Q 001814 406 SGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQ-WIAIVS-SKGTCHVFVLSPFG 464 (1010)
Q Consensus 406 ~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~-~LAsgS-~dGTVhIw~I~~~g 464 (1010)
+ +.+.++.-| ..++.|+||||++ +|.+.. .+++|+|+|+...+
T Consensus 289 ---~----------kvi~~i~vG---~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k 333 (352)
T TIGR02658 289 ---G----------KRLRKIELG---HEIDSINVSQDAKPLLYALSTGDKTLYIFDAETGK 333 (352)
T ss_pred ---C----------eEEEEEeCC---CceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCe
Confidence 2 456666544 3689999999999 887777 57899999987653
No 213
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.23 E-value=6.6e-05 Score=87.64 Aligned_cols=93 Identities=17% Similarity=0.210 Sum_probs=58.6
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccc
Q 001814 351 VVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGIT 430 (1010)
Q Consensus 351 V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t 430 (1010)
|.++|+.++. +..+..|...+...+|+|||+.|+..+..+....||.+... .+ ..+.+ .+ .|
T Consensus 266 Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~-----~g--------~~~~l-t~-~g-- 327 (427)
T PRK02889 266 IYTVNADGSG-LRRLTQSSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPAS-----GG--------AAQRV-TF-TG-- 327 (427)
T ss_pred EEEEECCCCC-cEECCCCCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECC-----CC--------ceEEE-ec-CC--
Confidence 4444555444 44555565556678899999999887765555788876321 12 11222 22 22
Q ss_pred cccEEEEEEccCCCEEEEEeCCC---eEEEEeCCC
Q 001814 431 SATIQDICFSHYSQWIAIVSSKG---TCHVFVLSP 462 (1010)
Q Consensus 431 ~a~I~sIAFSpDg~~LAsgS~dG---TVhIw~I~~ 462 (1010)
......+|||||++||..+.++ .|.+|++..
T Consensus 328 -~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~ 361 (427)
T PRK02889 328 -SYNTSPRISPDGKLLAYISRVGGAFKLYVQDLAT 361 (427)
T ss_pred -CCcCceEECCCCCEEEEEEccCCcEEEEEEECCC
Confidence 1234678999999999888765 477877754
No 214
>PRK04792 tolB translocation protein TolB; Provisional
Probab=98.23 E-value=0.00069 Score=79.88 Aligned_cols=95 Identities=11% Similarity=0.078 Sum_probs=54.5
Q ss_pred eEEEEECCCCcEEE-EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 350 IVVVKDFVTRAIIS-QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~-~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
.|.++|+.+++... ++.++. ....+|+|||++|+..+.++...+||.+... +| .+..+..+
T Consensus 331 ~Iy~~dl~~g~~~~Lt~~g~~--~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~-----~g-----------~~~~lt~~ 392 (448)
T PRK04792 331 QIYRVNLASGKVSRLTFEGEQ--NLGGSITPDGRSMIMVNRTNGKFNIARQDLE-----TG-----------AMQVLTST 392 (448)
T ss_pred eEEEEECCCCCEEEEecCCCC--CcCeeECCCCCEEEEEEecCCceEEEEEECC-----CC-----------CeEEccCC
Confidence 56667777665432 223332 3346899999999888765554666654321 12 12222222
Q ss_pred cccccEEEEEEccCCCEEEEEeCCC-eEEEEeCCCCCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKG-TCHVFVLSPFGG 465 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dG-TVhIw~I~~~gg 465 (1010)
. .....+|+|||++|+..+.++ .-.||-++..|+
T Consensus 393 ~---~d~~ps~spdG~~I~~~~~~~g~~~l~~~~~~G~ 427 (448)
T PRK04792 393 R---LDESPSVAPNGTMVIYSTTYQGKQVLAAVSIDGR 427 (448)
T ss_pred C---CCCCceECCCCCEEEEEEecCCceEEEEEECCCC
Confidence 1 123458999999998877654 444665554444
No 215
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=98.15 E-value=9.2e-06 Score=95.84 Aligned_cols=143 Identities=15% Similarity=0.170 Sum_probs=107.0
Q ss_pred cEEEEEEcC--CeEEEEeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCccceEEccceEEEccCCeeeccCCc
Q 001814 194 SVCMVRCSP--RIVAVGLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGPMAVGPRWLAYASNTLLLSNSGR 271 (1010)
Q Consensus 194 ~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gplAlgpRwLAyas~~~~iwd~G~ 271 (1010)
.|++|+|.| ..|+++.++++++||..++..+++|..|..- ...+||+-+.
T Consensus 14 ci~d~afkPDGsqL~lAAg~rlliyD~ndG~llqtLKgHKDt--------------------VycVAys~dG-------- 65 (1081)
T KOG1538|consen 14 CINDIAFKPDGTQLILAAGSRLLVYDTSDGTLLQPLKGHKDT--------------------VYCVAYAKDG-------- 65 (1081)
T ss_pred chheeEECCCCceEEEecCCEEEEEeCCCcccccccccccce--------------------EEEEEEccCC--------
Confidence 688999988 4788888899999999999999999887651 1234555421
Q ss_pred cCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCccCCCccccccccccccCCCCeE
Q 001814 272 LSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIV 351 (1010)
Q Consensus 272 vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V 351 (1010)
. -+++|+.|.+|
T Consensus 66 -------------------k-------------------------------------------------rFASG~aDK~V 77 (1081)
T KOG1538|consen 66 -------------------K-------------------------------------------------RFASGSADKSV 77 (1081)
T ss_pred -------------------c-------------------------------------------------eeccCCCceeE
Confidence 0 12346678999
Q ss_pred EEEECCCCcEEEEec-cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccc
Q 001814 352 VVKDFVTRAIISQFK-AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGIT 430 (1010)
Q Consensus 352 ~VwDl~s~~~v~~~~-aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t 430 (1010)
.||.-.- -..++ .|+..|.||.|+|-...|||+|-. .+-+|.... +.+.+. + .
T Consensus 78 I~W~~kl---EG~LkYSH~D~IQCMsFNP~~h~LasCsLs--dFglWS~~q------------------K~V~K~-k--s 131 (1081)
T KOG1538|consen 78 IIWTSKL---EGILKYSHNDAIQCMSFNPITHQLASCSLS--DFGLWSPEQ------------------KSVSKH-K--S 131 (1081)
T ss_pred EEecccc---cceeeeccCCeeeEeecCchHHHhhhcchh--hccccChhh------------------hhHHhh-h--h
Confidence 9997542 23344 699999999999999999999983 367887642 112221 1 2
Q ss_pred cccEEEEEEccCCCEEEEEeCCCeEEEE
Q 001814 431 SATIQDICFSHYSQWIAIVSSKGTCHVF 458 (1010)
Q Consensus 431 ~a~I~sIAFSpDg~~LAsgS~dGTVhIw 458 (1010)
.++|.+.+|..||++||.|-.+|||.|=
T Consensus 132 s~R~~~CsWtnDGqylalG~~nGTIsiR 159 (1081)
T KOG1538|consen 132 SSRIICCSWTNDGQYLALGMFNGTISIR 159 (1081)
T ss_pred heeEEEeeecCCCcEEEEeccCceEEee
Confidence 4579999999999999999999999886
No 216
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=98.14 E-value=3.1e-05 Score=90.56 Aligned_cols=56 Identities=21% Similarity=0.408 Sum_probs=52.3
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
.+..+++|+++|..++.+++...+|...+++++|.|+|-+|++++.+|. +++|.+.
T Consensus 506 ~~hed~~Ir~~dn~~~~~l~s~~a~~~svtslai~~ng~~l~s~s~d~s-v~l~kld 561 (577)
T KOG0642|consen 506 TAHEDRSIRFFDNKTGKILHSMVAHKDSVTSLAIDPNGPYLMSGSHDGS-VRLWKLD 561 (577)
T ss_pred ecccCCceecccccccccchheeeccceecceeecCCCceEEeecCCce-eehhhcc
Confidence 5678999999999999999999999999999999999999999999776 8999884
No 217
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=98.12 E-value=5e-06 Score=63.96 Aligned_cols=38 Identities=26% Similarity=0.655 Sum_probs=34.9
Q ss_pred cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEe
Q 001814 360 AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFR 398 (1010)
Q Consensus 360 ~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwd 398 (1010)
+++.+|++|.++|.+|+|+|++.+|||++.|+ .|+|||
T Consensus 2 ~~~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~-~i~vwd 39 (39)
T PF00400_consen 2 KCVRTFRGHSSSINSIAWSPDGNFLASGSSDG-TIRVWD 39 (39)
T ss_dssp EEEEEEESSSSSEEEEEEETTSSEEEEEETTS-EEEEEE
T ss_pred eEEEEEcCCCCcEEEEEEecccccceeeCCCC-EEEEEC
Confidence 56789999999999999999999999999965 599997
No 218
>PRK01029 tolB translocation protein TolB; Provisional
Probab=98.08 E-value=0.0011 Score=77.73 Aligned_cols=85 Identities=13% Similarity=0.228 Sum_probs=53.9
Q ss_pred EEEeccCCCCeEEEEECCCCCEEEEEEcCC--CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE
Q 001814 362 ISQFKAHTSPISALCFDPSGTLLVTASVYG--NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF 439 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSPdGtlLATAS~dG--t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF 439 (1010)
...+..+...+...+|||||++||..+.++ ..|.+|++.. | ....+..+ ...+.+.+|
T Consensus 319 ~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~-------g-----------~~~~Lt~~--~~~~~~p~w 378 (428)
T PRK01029 319 PRLLTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLAT-------G-----------RDYQLTTS--PENKESPSW 378 (428)
T ss_pred eEEeccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCC-------C-----------CeEEccCC--CCCccceEE
Confidence 344445555677889999999998776542 3588888752 2 12233222 124677999
Q ss_pred ccCCCEEEEEeC-CCeEEEEeCCCCCCc
Q 001814 440 SHYSQWIAIVSS-KGTCHVFVLSPFGGD 466 (1010)
Q Consensus 440 SpDg~~LAsgS~-dGTVhIw~I~~~gg~ 466 (1010)
+|||++|+..+. ++.-.||.++..++.
T Consensus 379 SpDG~~L~f~~~~~g~~~L~~vdl~~g~ 406 (428)
T PRK01029 379 AIDSLHLVYSAGNSNESELYLISLITKK 406 (428)
T ss_pred CCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 999999986554 445556655544443
No 219
>PRK04792 tolB translocation protein TolB; Provisional
Probab=98.06 E-value=0.00036 Score=82.25 Aligned_cols=98 Identities=19% Similarity=0.272 Sum_probs=62.4
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccc
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGI 429 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~ 429 (1010)
.|.++|+.+++. ..+..|.......+|+|||+.|+..+..+....||.+... +|. .+.+ .+ .|.
T Consensus 287 ~Iy~~dl~tg~~-~~lt~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~-----~g~--------~~~L-t~-~g~ 350 (448)
T PRK04792 287 EIYVVDIATKAL-TRITRHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLA-----SGK--------VSRL-TF-EGE 350 (448)
T ss_pred EEEEEECCCCCe-EECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECC-----CCC--------EEEE-ec-CCC
Confidence 577888887764 4555565566778999999998877765444566654321 121 1122 11 222
Q ss_pred ccccEEEEEEccCCCEEEEEeC-CCeEEEEeCCCCCCc
Q 001814 430 TSATIQDICFSHYSQWIAIVSS-KGTCHVFVLSPFGGD 466 (1010)
Q Consensus 430 t~a~I~sIAFSpDg~~LAsgS~-dGTVhIw~I~~~gg~ 466 (1010)
.....+|||||++|+..+. ++..+||.++..++.
T Consensus 351 ---~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~ 385 (448)
T PRK04792 351 ---QNLGGSITPDGRSMIMVNRTNGKFNIARQDLETGA 385 (448)
T ss_pred ---CCcCeeECCCCCEEEEEEecCCceEEEEEECCCCC
Confidence 2345799999999988776 456788877765554
No 220
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=98.05 E-value=0.00018 Score=85.27 Aligned_cols=262 Identities=15% Similarity=0.103 Sum_probs=146.1
Q ss_pred CCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 73 ~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
|++.-|+++..+.+-|||+++ |.+.+.|.+|...|.+|++..++ .+.|. |
T Consensus 22 PDGsqL~lAAg~rlliyD~nd-G~llqtLKgHKDtVycVAys~dG-------------krFAS--G-------------- 71 (1081)
T KOG1538|consen 22 PDGTQLILAAGSRLLVYDTSD-GTLLQPLKGHKDTVYCVAYAKDG-------------KRFAS--G-------------- 71 (1081)
T ss_pred CCCceEEEecCCEEEEEeCCC-cccccccccccceEEEEEEccCC-------------ceecc--C--------------
Confidence 567778888889999999965 66788999999999999988665 23342 1
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEeCCeEEEEECCCCceeEEEeec
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGLATQIYCFDALTLENKFSVLTY 230 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle~l~tL~t~ 230 (1010)
++++.|.+|+-+- +-+-.+.+...|.++.||| +.|+++.-...-+|....... .. +
T Consensus 72 -----------------~aDK~VI~W~~kl-EG~LkYSH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V-~K---~ 129 (1081)
T KOG1538|consen 72 -----------------SADKSVIIWTSKL-EGILKYSHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSV-SK---H 129 (1081)
T ss_pred -----------------CCceeEEEecccc-cceeeeccCCeeeEeecCchHHHhhhcchhhccccChhhhhH-Hh---h
Confidence 1368899998753 3233344557899999999 677777666677886543211 00 1
Q ss_pred CCccccCCCccccccCccceEEcc--ceEEEccC--Ceeecc-CCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhh
Q 001814 231 PVPQLAGQGAVGINVGYGPMAVGP--RWLAYASN--TLLLSN-SGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQF 305 (1010)
Q Consensus 231 p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~--~~~iwd-~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~l 305 (1010)
.. .. ....+++.. .+||.+-. ++.+=+ .|.....-
T Consensus 130 ks-------s~----R~~~CsWtnDGqylalG~~nGTIsiRNk~gEek~~I----------------------------- 169 (1081)
T KOG1538|consen 130 KS-------SS----RIICCSWTNDGQYLALGMFNGTISIRNKNGEEKVKI----------------------------- 169 (1081)
T ss_pred hh-------he----eEEEeeecCCCcEEEEeccCceEEeecCCCCcceEE-----------------------------
Confidence 00 00 011233322 33443221 111100 01100000
Q ss_pred hcccceeeccccccccCCCCCCCc---cCCCccccccc-cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCC
Q 001814 306 AAGLSKTLSKYCQELLPDGSSSPV---SPNSVWKVGRH-AGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSG 381 (1010)
Q Consensus 306 a~Gi~ktls~y~~~l~p~gs~s~~---S~s~~~k~~~~-~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdG 381 (1010)
-.|+|.++++ .++|....|.. +++-.+-..++..|.+. |+.+..=++-.-.--|+++=|+|
T Consensus 170 --------------~Rpgg~Nspiwsi~~~p~sg~G~~di~aV~DW~qTLSFy~Ls-G~~Igk~r~L~FdP~CisYf~NG 234 (1081)
T KOG1538|consen 170 --------------ERPGGSNSPIWSICWNPSSGEGRNDILAVADWGQTLSFYQLS-GKQIGKDRALNFDPCCISYFTNG 234 (1081)
T ss_pred --------------eCCCCCCCCceEEEecCCCCCCccceEEEEeccceeEEEEec-ceeecccccCCCCchhheeccCC
Confidence 0122222211 11111111100 00111222233333332 22222112212222477888999
Q ss_pred CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 382 TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 382 tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.++..++.++. +++|.-. | ..|-++ |....+||.++..|+|+.+++|..|||+--|++-
T Consensus 235 Ey~LiGGsdk~-L~~fTR~--------G----------vrLGTv--g~~D~WIWtV~~~PNsQ~v~~GCqDGTiACyNl~ 293 (1081)
T KOG1538|consen 235 EYILLGGSDKQ-LSLFTRD--------G----------VRLGTV--GEQDSWIWTVQAKPNSQYVVVGCQDGTIACYNLI 293 (1081)
T ss_pred cEEEEccCCCc-eEEEeec--------C----------eEEeec--cccceeEEEEEEccCCceEEEEEccCeeehhhhH
Confidence 99999988665 8888642 4 455554 3234589999999999999999999999998875
Q ss_pred C
Q 001814 462 P 462 (1010)
Q Consensus 462 ~ 462 (1010)
.
T Consensus 294 f 294 (1081)
T KOG1538|consen 294 F 294 (1081)
T ss_pred H
Confidence 3
No 221
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.05 E-value=0.0078 Score=68.16 Aligned_cols=106 Identities=18% Similarity=0.221 Sum_probs=69.9
Q ss_pred CCCeEEEEECCCC-c------EEEEecc---CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccC
Q 001814 347 NAGIVVVKDFVTR-A------IISQFKA---HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWN 416 (1010)
Q Consensus 347 ~dG~V~VwDl~s~-~------~v~~~~a---HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~ 416 (1010)
-+++|.||+.... . .+.++++ -+...++|..+|||++|..+-..-..|-+|.+.+. +|
T Consensus 211 L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~-----~g------- 278 (346)
T COG2706 211 LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPD-----GG------- 278 (346)
T ss_pred cCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCC-----CC-------
Confidence 3556666666552 2 2222322 23467889999999999866553446899999764 12
Q ss_pred CcceEEEEEecccccc-cEEEEEEccCCCEEEEEeCC-CeEEEEeCCCCCCccc
Q 001814 417 SSHVHLYKLHRGITSA-TIQDICFSHYSQWIAIVSSK-GTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 417 ~s~~~L~~L~RG~t~a-~I~sIAFSpDg~~LAsgS~d-GTVhIw~I~~~gg~~~ 468 (1010)
+|--+.+-.+.+ .-.+..|++++++|+++..+ .+++||.+++.-|...
T Consensus 279 ----~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~~TG~L~ 328 (346)
T COG2706 279 ----KLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFERDKETGRLT 328 (346)
T ss_pred ----EEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEEEcCCCceEE
Confidence 232222222222 25789999999999998876 4799999998877543
No 222
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=98.04 E-value=0.00057 Score=78.35 Aligned_cols=95 Identities=17% Similarity=0.214 Sum_probs=59.8
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
..|.+||+.++.. ..+..|.......+|+|||+.|+.++.++...+||.+... .+ ....+..+
T Consensus 258 ~~i~~~d~~~~~~-~~l~~~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~-----~~-----------~~~~l~~~ 320 (417)
T TIGR02800 258 PDIYVMDLDGKQL-TRLTNGPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDAD-----GG-----------EVRRLTFR 320 (417)
T ss_pred ccEEEEECCCCCE-EECCCCCCCCCCEEECCCCCEEEEEECCCCCceEEEEECC-----CC-----------CEEEeecC
Confidence 4688888887654 3455555555677999999999887765543345543211 12 12222211
Q ss_pred cccccEEEEEEccCCCEEEEEeCCC---eEEEEeCCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKG---TCHVFVLSP 462 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dG---TVhIw~I~~ 462 (1010)
...+..++|||||++|+.++.++ .|.+|++..
T Consensus 321 --~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~ 355 (417)
T TIGR02800 321 --GGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDG 355 (417)
T ss_pred --CCCccCeEECCCCCEEEEEEccCCceEEEEEeCCC
Confidence 12356789999999999998875 566666543
No 223
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=98.03 E-value=0.00084 Score=76.00 Aligned_cols=296 Identities=14% Similarity=0.115 Sum_probs=158.9
Q ss_pred CCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCC-----CcceEeeeecc-CCEEEEEEecCCCCCCCCCC
Q 001814 53 EDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDA-----SNFNELVSKRD-GPVSFLQMQPFPVKDDGCEG 125 (1010)
Q Consensus 53 ~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~-----g~v~ellS~hd-GpV~~v~~lP~p~~s~~~D~ 125 (1010)
-+|.+-|....|.. ..+.|+.|.++- .+||+++.. -+-..+...|. ..|.+++|.-..
T Consensus 53 ~~H~GCiNAlqFS~-------N~~~L~SGGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~N-------- 117 (609)
T KOG4227|consen 53 REHTGCINALQFSH-------NDRFLASGGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLEN-------- 117 (609)
T ss_pred hhhccccceeeecc-------CCeEEeecCCcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccCC--------
Confidence 44566666666642 478999999875 899999642 11122233222 456666654110
Q ss_pred ccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCC---CcEEEEEEcC
Q 001814 126 FRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFR---SSVCMVRCSP 202 (1010)
Q Consensus 126 F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~---S~V~sVa~S~ 202 (1010)
..|. +| +. ..+|.+-|+.+.+-+...... +.||.+..+|
T Consensus 118 -----~~~~--SG--------------------~~-----------~~~VI~HDiEt~qsi~V~~~~~~~~~VY~m~~~P 159 (609)
T KOG4227|consen 118 -----RFLY--SG--------------------ER-----------WGTVIKHDIETKQSIYVANENNNRGDVYHMDQHP 159 (609)
T ss_pred -----eeEe--cC--------------------CC-----------cceeEeeecccceeeeeecccCcccceeecccCC
Confidence 1111 11 11 268999999999888777554 4899999998
Q ss_pred --CeEEEE-eCCeEEEEECCCCceeEEEe---ecCCccccCCCccccccCccceEEc---cceEEEccC--CeeeccCCc
Q 001814 203 --RIVAVG-LATQIYCFDALTLENKFSVL---TYPVPQLAGQGAVGINVGYGPMAVG---PRWLAYASN--TLLLSNSGR 271 (1010)
Q Consensus 203 --rlLAV~-ld~~I~IwD~~Tle~l~tL~---t~p~p~~~~~g~~~vnv~~gplAlg---pRwLAyas~--~~~iwd~G~ 271 (1010)
++|++. .++.|-+||++.-.....+. .++. .+-..-|. |++||.+.. .+-+||.-
T Consensus 160 ~DN~~~~~t~~~~V~~~D~Rd~~~~~~~~~~AN~~~-------------~F~t~~F~P~~P~Li~~~~~~~G~~~~D~R- 225 (609)
T KOG4227|consen 160 TDNTLIVVTRAKLVSFIDNRDRQNPISLVLPANSGK-------------NFYTAEFHPETPALILVNSETGGPNVFDRR- 225 (609)
T ss_pred CCceEEEEecCceEEEEeccCCCCCCceeeecCCCc-------------cceeeeecCCCceeEEeccccCCCCceeec-
Confidence 566654 55689999998755322222 2221 12223343 378887765 35567631
Q ss_pred cCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeec----cccccccCCCCCCCccCCCccccccccccccCC
Q 001814 272 LSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLS----KYCQELLPDGSSSPVSPNSVWKVGRHAGADMDN 347 (1010)
Q Consensus 272 vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls----~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~ 347 (1010)
. +| .+ +-+| |..+.|. .|...+ +++. +.++.+-..
T Consensus 226 ~-~~------~~----------~~~~----------~~~~~L~~~~~~~M~~~----------~~~~----G~Q~msiRR 264 (609)
T KOG4227|consen 226 M-QA------RP----------VYQR----------SMFKGLPQENTEWMGSL----------WSPS----GNQFMSIRR 264 (609)
T ss_pred c-cc------ch----------HHhh----------hccccCcccchhhhhee----------eCCC----CCeehhhhc
Confidence 0 00 00 0000 0001110 011111 1111 111111122
Q ss_pred CCeEEEEECCCCcE-EEEeccC-------CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCC---CCCCccccC
Q 001814 348 AGIVVVKDFVTRAI-ISQFKAH-------TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSG---SGNHKYDWN 416 (1010)
Q Consensus 348 dG~V~VwDl~s~~~-v~~~~aH-------tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~---sG~~~~~~~ 416 (1010)
-..-.+||+.+..+ +-++. | ...|.+++|--|-+ +||+|. .-.|++|.+.......+ .|...-..+
T Consensus 265 ~~~P~~~D~~S~R~~V~k~D-~N~~GY~N~~T~KS~~F~~D~~-v~tGSD-~~~i~~WklP~~~ds~G~~~IG~~~~~~~ 341 (609)
T KOG4227|consen 265 GKCPLYFDFISQRCFVLKSD-HNPNGYCNIKTIKSMTFIDDYT-VATGSD-HWGIHIWKLPRANDSYGFTQIGHDEEEMP 341 (609)
T ss_pred cCCCEEeeeecccceeEecc-CCCCcceeeeeeeeeeeeccee-eeccCc-ccceEEEecCCCccccCccccCcchhhCc
Confidence 22345677776443 22222 2 23577899977655 888887 56699999943211111 010000000
Q ss_pred C--cceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 417 S--SHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 417 ~--s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
+ ....-+...||+ +..+..+-|+|-..+|++++-...++||.-
T Consensus 342 ~~~~i~~~~~VLrGH-RSv~NQVRF~~H~~~l~SSGVE~~~KlWS~ 386 (609)
T KOG4227|consen 342 SEIFIEKELTVLRGH-RSVPNQVRFSQHNNLLVSSGVENSFKLWSD 386 (609)
T ss_pred hhheecceeEEEecc-cccccceeecCCcceEeccchhhheecccc
Confidence 0 111223345775 467888999999999999999999999974
No 224
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=98.03 E-value=4.7e-05 Score=82.62 Aligned_cols=57 Identities=21% Similarity=0.302 Sum_probs=53.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
+++++-||.|+||...+.+.++.++-|...|.+|+|+||-.++|.||.|++ |-+|++
T Consensus 266 lATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd~~lmAaaskD~r-ISLWkL 322 (323)
T KOG0322|consen 266 LATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPDCELMAAASKDAR-ISLWKL 322 (323)
T ss_pred EeecccCCcEEEEEeccCCchhhhhhhhcceeEEEeCCCCchhhhccCCce-EEeeec
Confidence 457889999999999999999999999999999999999999999999765 999985
No 225
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=98.01 E-value=0.0015 Score=70.51 Aligned_cols=63 Identities=13% Similarity=0.089 Sum_probs=53.7
Q ss_pred CCEEEEEeCCCCeEEEEEeCCC-cEEEEEE---cCCeEEEEeCCeEEEEECCCCceeEEEeecCCcc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRS-SVCMVRC---SPRIVAVGLATQIYCFDALTLENKFSVLTYPVPQ 234 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S-~V~sVa~---S~rlLAV~ld~~I~IwD~~Tle~l~tL~t~p~p~ 234 (1010)
++.+.-||+++|+.-.+++.|+ .|.+|.. ++++|.-+-|+++++||.+|.++..+|..+..|.
T Consensus 135 D~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k~v~~ie~yk~~~ 201 (325)
T KOG0649|consen 135 DGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQKHVSMIEPYKNPN 201 (325)
T ss_pred CeEEEEEEecCCEEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEeccccceeEEeccccChh
Confidence 4789999999999999999875 7888877 3478888888999999999999999998776653
No 226
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=98.00 E-value=2e-06 Score=103.39 Aligned_cols=96 Identities=23% Similarity=0.371 Sum_probs=83.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
++++..|..|+||...+..+++..++|...|+.|+.+.+.+++|+||. +.+||||.+.. | ..
T Consensus 205 Iitgsdd~lvKiwS~et~~~lAs~rGhs~ditdlavs~~n~~iaaaS~-D~vIrvWrl~~-------~----------~p 266 (1113)
T KOG0644|consen 205 IITGSDDRLVKIWSMETARCLASCRGHSGDITDLAVSSNNTMIAAASN-DKVIRVWRLPD-------G----------AP 266 (1113)
T ss_pred EeecCccceeeeeeccchhhhccCCCCccccchhccchhhhhhhhccc-CceEEEEecCC-------C----------ch
Confidence 467888999999999999999999999999999999999999999999 68899999953 3 23
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
+- +.||++.+ |++|+|||-- +++.|||+.+|+..
T Consensus 267 vs-vLrghtga-vtaiafsP~~----sss~dgt~~~wd~r 300 (1113)
T KOG0644|consen 267 VS-VLRGHTGA-VTAIAFSPRA----SSSDDGTCRIWDAR 300 (1113)
T ss_pred HH-HHhccccc-eeeeccCccc----cCCCCCceEecccc
Confidence 33 34787754 9999999965 88999999999876
No 227
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=97.99 E-value=0.00036 Score=77.61 Aligned_cols=55 Identities=25% Similarity=0.311 Sum_probs=47.7
Q ss_pred ccccCCCCeEEEEECCC-CcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 342 GADMDNAGIVVVKDFVT-RAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s-~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
+++++.+|.|++||+.+ +..+..+..|..-++.++|+|-=.+|||+| |+ |+|...
T Consensus 312 LasG~tdG~V~vwdlk~~gn~~sv~~~~sd~vNgvslnP~mpilatss--Gq--r~f~~~ 367 (406)
T KOG2919|consen 312 LASGDTDGSVRVWDLKDLGNEVSVTGNYSDTVNGVSLNPIMPILATSS--GQ--RIFKYP 367 (406)
T ss_pred eeccCCCccEEEEecCCCCCcccccccccccccceecCcccceeeecc--Cc--eeecCC
Confidence 45788999999999998 666889999999999999999988999998 55 778663
No 228
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=97.97 E-value=0.00024 Score=83.08 Aligned_cols=121 Identities=10% Similarity=0.037 Sum_probs=70.5
Q ss_pred ccCCCCeEEEEECCCCc-------EEEEe---ccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCC---CCcccCCCC
Q 001814 344 DMDNAGIVVVKDFVTRA-------IISQF---KAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIM---PSCMRSGSG 409 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~-------~v~~~---~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~---p~~~~~~sG 409 (1010)
+++.||+++-+|+.... +...+ ....-...+|+.||... +||.++. +-..|+||.. +.+.. .|
T Consensus 165 sasEDGtirQyDiREph~c~p~~~~~~~l~ny~~~lielk~ltisp~rp~~laVGgs-dpfarLYD~Rr~lks~~s--~~ 241 (758)
T KOG1310|consen 165 SASEDGTIRQYDIREPHVCNPDEDCPSILVNYNPQLIELKCLTISPSRPYYLAVGGS-DPFARLYDRRRVLKSFRS--DG 241 (758)
T ss_pred EecCCcceeeecccCCccCCccccccHHHHHhchhhheeeeeeecCCCCceEEecCC-CchhhhhhhhhhccCCCC--Cc
Confidence 46789999999997522 11111 11223567899999875 6677766 6678999942 22111 12
Q ss_pred CCccccCCcceEEEE-------Eeccc-ccc--cEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcc
Q 001814 410 NHKYDWNSSHVHLYK-------LHRGI-TSA--TIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 410 ~~~~~~~~s~~~L~~-------L~RG~-t~a--~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
.-....+...+++.. ..+|. +.- -++-++|+|+|.-|.+.=...-|.+|+++...++.
T Consensus 242 ~~~~~pp~~~~cv~yf~p~hlkn~~gn~~~~~~~~t~vtfnpNGtElLvs~~gEhVYlfdvn~~~~~~ 309 (758)
T KOG1310|consen 242 TMNTCPPKDCRCVRYFSPGHLKNSQGNLDRYITCCTYVTFNPNGTELLVSWGGEHVYLFDVNEDKSPT 309 (758)
T ss_pred cccCCCCcccchhheecCccccCcccccccceeeeEEEEECCCCcEEEEeeCCeEEEEEeecCCCCce
Confidence 100000111112222 22331 111 25678999999988887777789999998766654
No 229
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=97.95 E-value=2.6e-05 Score=85.95 Aligned_cols=127 Identities=20% Similarity=0.208 Sum_probs=93.7
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC-----CCcc--c--CC------
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM-----PSCM--R--SG------ 407 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~-----p~~~--~--~~------ 407 (1010)
.+++.|.+..||.++++.++.+..+|.+.|+++.|+++|.||+|||-|++ -+||... |... . ++
T Consensus 164 gtASADhTA~iWs~Esg~CL~~Y~GH~GSVNsikfh~s~~L~lTaSGD~t-aHIW~~av~~~vP~~~a~~~hSsEeE~e~ 242 (481)
T KOG0300|consen 164 GTASADHTARIWSLESGACLATYTGHTGSVNSIKFHNSGLLLLTASGDET-AHIWKAAVNWEVPSNNAPSDHSSEEEEEH 242 (481)
T ss_pred eecccccceeEEeeccccceeeecccccceeeEEeccccceEEEccCCcc-hHHHHHhhcCcCCCCCCCCCCCchhhhhc
Confidence 36778999999999999999999999999999999999999999999776 7999832 2110 0 00
Q ss_pred --------CCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccc
Q 001814 408 --------SGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTL 472 (1010)
Q Consensus 408 --------sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H 472 (1010)
.+...++-..-...|.+|. | +.+.|.+..|=.-|+.++++|=|.|..+|+++...-...+..|
T Consensus 243 sDe~~~d~d~~~~sD~~tiRvPl~~lt-g-H~~vV~a~dWL~gg~Q~vTaSWDRTAnlwDVEtge~v~~LtGH 313 (481)
T KOG0300|consen 243 SDEHNRDTDSSEKSDGHTIRVPLMRLT-G-HRAVVSACDWLAGGQQMVTASWDRTANLWDVETGEVVNILTGH 313 (481)
T ss_pred ccccccccccccccCCceeeeeeeeee-c-cccceEehhhhcCcceeeeeeccccceeeeeccCceeccccCc
Confidence 0101111112234566663 4 5678888999999999999999999999999864433344455
No 230
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=97.95 E-value=0.00025 Score=76.40 Aligned_cols=99 Identities=13% Similarity=0.232 Sum_probs=72.3
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE-CCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCF-DPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaF-SPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
.++.|+.+.-||+++|+.-..+++|+..|-+++- +.+|+ +.|+++||+ +||||+.+ + +++
T Consensus 131 ~AgGD~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~q-ilsG~EDGt-vRvWd~kt-------~----------k~v 191 (325)
T KOG0649|consen 131 FAGGDGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQ-ILSGAEDGT-VRVWDTKT-------Q----------KHV 191 (325)
T ss_pred EecCCeEEEEEEecCCEEEEEEcCCcceeeeeeecccCcc-eeecCCCcc-EEEEeccc-------c----------cee
Confidence 3457999999999999999999999999999998 66665 568999887 89999975 2 222
Q ss_pred EE--------EecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 423 YK--------LHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 423 ~~--------L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
.. +.|-+-..+|-+++ -|..||++|... ..-||.+....
T Consensus 192 ~~ie~yk~~~~lRp~~g~wigala--~~edWlvCGgGp-~lslwhLrsse 238 (325)
T KOG0649|consen 192 SMIEPYKNPNLLRPDWGKWIGALA--VNEDWLVCGGGP-KLSLWHLRSSE 238 (325)
T ss_pred EEeccccChhhcCcccCceeEEEe--ccCceEEecCCC-ceeEEeccCCC
Confidence 22 12322223565555 456699887654 47799887543
No 231
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.91 E-value=0.0013 Score=76.32 Aligned_cols=96 Identities=19% Similarity=0.220 Sum_probs=57.2
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEE--eCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIF--RIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVw--di~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
..|.+||+.+++. ..+..+........|+|||+.|+..+..+...+|| ++.. |. .+.+ .+
T Consensus 267 ~~Iy~~d~~~~~~-~~lt~~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~-------g~--------~~~l-t~- 328 (430)
T PRK00178 267 PEIYVMDLASRQL-SRVTNHPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNG-------GR--------AERV-TF- 328 (430)
T ss_pred ceEEEEECCCCCe-EEcccCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCC-------CC--------EEEe-ec-
Confidence 3578889887764 34555555566789999999888777644334444 4421 21 1122 11
Q ss_pred cccccccEEEEEEccCCCEEEEEeCC-CeEEEEeCCCCCC
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSK-GTCHVFVLSPFGG 465 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~d-GTVhIw~I~~~gg 465 (1010)
.+. .....+|||||++|+..+.+ +..+||-++..++
T Consensus 329 ~~~---~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg 365 (430)
T PRK00178 329 VGN---YNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRG 365 (430)
T ss_pred CCC---CccceEECCCCCEEEEEEccCCceEEEEEECCCC
Confidence 221 23457899999999988764 4334444443333
No 232
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=97.86 E-value=0.00088 Score=78.96 Aligned_cols=186 Identities=14% Similarity=0.167 Sum_probs=121.6
Q ss_pred CCCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccC
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVG 246 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~ 246 (1010)
+...|.-.+|..|..+..++-. +.+..|..|+ .+|++|. ++.|-+||.++-....+|....+ .. ..++++
T Consensus 153 sg~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~-v~---s~pg~~-- 226 (703)
T KOG2321|consen 153 SGSEVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSRVGTLDAASS-VN---SHPGGD-- 226 (703)
T ss_pred cCcceEEEEccccccccccccccccceeeeecCccceEEecccCceEEEecchhhhhheeeecccc-cC---CCcccc--
Confidence 3456777789999888888776 5788899987 6888887 67899999988766555542111 00 000000
Q ss_pred ccceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCC
Q 001814 247 YGPMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSS 326 (1010)
Q Consensus 247 ~gplAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~ 326 (1010)
-+.++.
T Consensus 227 ---~~~svT----------------------------------------------------------------------- 232 (703)
T KOG2321|consen 227 ---AAPSVT----------------------------------------------------------------------- 232 (703)
T ss_pred ---ccCcce-----------------------------------------------------------------------
Confidence 000000
Q ss_pred CCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC--CCCeEEEEECCC--CCEEEEEEcCCCeEEEEeCCCC
Q 001814 327 SPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH--TSPISALCFDPS--GTLLVTASVYGNNINIFRIMPS 402 (1010)
Q Consensus 327 s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH--tspIsaLaFSPd--GtlLATAS~dGt~IrVwdi~p~ 402 (1010)
++.|+.+ ++.++-|...|.|.|||+.+.+.+ .++.| .-||..|.|.+. +..|+|+.. +.++|||-.+
T Consensus 233 -al~F~d~----gL~~aVGts~G~v~iyDLRa~~pl-~~kdh~~e~pi~~l~~~~~~~q~~v~S~Dk--~~~kiWd~~~- 303 (703)
T KOG2321|consen 233 -ALKFRDD----GLHVAVGTSTGSVLIYDLRASKPL-LVKDHGYELPIKKLDWQDTDQQNKVVSMDK--RILKIWDECT- 303 (703)
T ss_pred -EEEecCC----ceeEEeeccCCcEEEEEcccCCce-eecccCCccceeeecccccCCCceEEecch--HHhhhccccc-
Confidence 1111111 122334667899999999988765 33445 458999999777 456766665 7899999643
Q ss_pred cccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 403 CMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 403 ~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
|. ..... -....|.+++|-|++-++.++-..+-+|.|-|...|
T Consensus 304 ------Gk----------~~asi---Ept~~lND~C~~p~sGm~f~Ane~~~m~~yyiP~LG 346 (703)
T KOG2321|consen 304 ------GK----------PMASI---EPTSDLNDFCFVPGSGMFFTANESSKMHTYYIPSLG 346 (703)
T ss_pred ------CC----------ceeec---cccCCcCceeeecCCceEEEecCCCcceeEEccccC
Confidence 31 11111 123458999999999999999999999999887654
No 233
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.86 E-value=0.0014 Score=77.29 Aligned_cols=129 Identities=22% Similarity=0.273 Sum_probs=89.6
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEeC---CeEEEEECCCCceeEEEeecCCccccCCCccccccC
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGLA---TQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVG 246 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~ld---~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~ 246 (1010)
-.++.+++++..+++..|.-.++|++|.+++ +-++|+.. .++.|||++. ..++.+.+-|
T Consensus 250 Eq~Lyll~t~g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~-~~v~df~egp--------------- 313 (566)
T KOG2315|consen 250 EQTLYLLATQGESVSVPLLKEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLRG-KPVFDFPEGP--------------- 313 (566)
T ss_pred cceEEEEEecCceEEEecCCCCCceEEEECCCCCEEEEEEecccceEEEEcCCC-CEeEeCCCCC---------------
Confidence 3799999999777877787789999999998 34555443 4799998763 3333332211
Q ss_pred ccceEEcc--ceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCC
Q 001814 247 YGPMAVGP--RWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDG 324 (1010)
Q Consensus 247 ~gplAlgp--RwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~g 324 (1010)
.+.+-++| ++|..+|
T Consensus 314 RN~~~fnp~g~ii~lAG--------------------------------------------------------------- 330 (566)
T KOG2315|consen 314 RNTAFFNPHGNIILLAG--------------------------------------------------------------- 330 (566)
T ss_pred ccceEECCCCCEEEEee---------------------------------------------------------------
Confidence 22333333 2222221
Q ss_pred CCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC-----CCeEEEEeC
Q 001814 325 SSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY-----GNNINIFRI 399 (1010)
Q Consensus 325 s~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d-----Gt~IrVwdi 399 (1010)
.+.-.|.|-|||+.+.+.+..+.+-.. +...|+|||.+++||..- ++-|+||+.
T Consensus 331 -------------------FGNL~G~mEvwDv~n~K~i~~~~a~~t--t~~eW~PdGe~flTATTaPRlrvdNg~Kiwhy 389 (566)
T KOG2315|consen 331 -------------------FGNLPGDMEVWDVPNRKLIAKFKAANT--TVFEWSPDGEYFLTATTAPRLRVDNGIKIWHY 389 (566)
T ss_pred -------------------cCCCCCceEEEeccchhhccccccCCc--eEEEEcCCCcEEEEEeccccEEecCCeEEEEe
Confidence 134568899999999999999998755 457899999999999863 445999998
Q ss_pred C
Q 001814 400 M 400 (1010)
Q Consensus 400 ~ 400 (1010)
.
T Consensus 390 t 390 (566)
T KOG2315|consen 390 T 390 (566)
T ss_pred c
Confidence 3
No 234
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.84 E-value=3.8e-05 Score=88.25 Aligned_cols=182 Identities=18% Similarity=0.288 Sum_probs=121.5
Q ss_pred CEEEEEeCCCCeEEEEEeCC-CcEEEEEEcCCe--EEEE-eCCeEEEEECCCCceeEEEeecCCccccCCCccccccCcc
Q 001814 173 TAVRFYSFQSHCYEHVLRFR-SSVCMVRCSPRI--VAVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYG 248 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~-S~V~sVa~S~rl--LAV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~g 248 (1010)
+-++--|+.+|+.|..+.-. +.+..+.-||.- +-+| ..++|-+|.....+.+-.+.+|..| ..
T Consensus 231 G~L~Y~DVS~GklVa~~~t~~G~~~vm~qNP~NaVih~GhsnGtVSlWSP~skePLvKiLcH~g~-------------V~ 297 (545)
T KOG1272|consen 231 GFLKYQDVSTGKLVASIRTGAGRTDVMKQNPYNAVIHLGHSNGTVSLWSPNSKEPLVKILCHRGP-------------VS 297 (545)
T ss_pred CceEEEeechhhhhHHHHccCCccchhhcCCccceEEEcCCCceEEecCCCCcchHHHHHhcCCC-------------cc
Confidence 45788899999999988765 577788888843 3333 3468999988877777777777663 22
Q ss_pred ceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCC
Q 001814 249 PMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSP 328 (1010)
Q Consensus 249 plAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~ 328 (1010)
.+|+-+ . |.+
T Consensus 298 siAv~~-------~----------------------------G~Y----------------------------------- 307 (545)
T KOG1272|consen 298 SIAVDR-------G----------------------------GRY----------------------------------- 307 (545)
T ss_pred eEEECC-------C----------------------------CcE-----------------------------------
Confidence 334321 0 110
Q ss_pred ccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCC
Q 001814 329 VSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGS 408 (1010)
Q Consensus 329 ~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~s 408 (1010)
.++++.|..|+|||+.....+.++.. ..+.+.|+||..|. | .+|. |..+.||.=.-. ++
T Consensus 308 -------------MaTtG~Dr~~kIWDlR~~~ql~t~~t-p~~a~~ls~Sqkgl-L-A~~~-G~~v~iw~d~~~----~s 366 (545)
T KOG1272|consen 308 -------------MATTGLDRKVKIWDLRNFYQLHTYRT-PHPASNLSLSQKGL-L-ALSY-GDHVQIWKDALK----GS 366 (545)
T ss_pred -------------EeecccccceeEeeeccccccceeec-CCCccccccccccc-e-eeec-CCeeeeehhhhc----CC
Confidence 12345688999999988765555544 45888999998874 3 3455 777999964321 11
Q ss_pred CCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccc
Q 001814 409 GNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQ 470 (1010)
Q Consensus 409 G~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~ 470 (1010)
| .....|.-++ ....|.++-|.|-...|.+|...|-.-| |-|..|++++.
T Consensus 367 ~--------~~~~pYm~H~--~~~~V~~l~FcP~EDvLGIGH~~G~tsi--lVPGsGePN~D 416 (545)
T KOG1272|consen 367 G--------HGETPYMNHR--CGGPVEDLRFCPYEDVLGIGHAGGITSI--LVPGSGEPNYD 416 (545)
T ss_pred C--------CCCcchhhhc--cCcccccceeccHHHeeeccccCCceeE--eccCCCCCCcc
Confidence 1 1123333222 3347999999999999999999997666 45777877664
No 235
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.84 E-value=0.0051 Score=71.11 Aligned_cols=177 Identities=15% Similarity=0.108 Sum_probs=108.2
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcE-EEEEEcC--CeEEE-EeCCeEEEEECCCCceeEEEeecCCccccCCCccccccCcc
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSV-CMVRCSP--RIVAV-GLATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYG 248 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V-~sVa~S~--rlLAV-~ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~g 248 (1010)
+.|.|.|..+.+.+.++.....+ ..+.+++ +.+.| +.++.|.++|+.+++.+.++.....| .
T Consensus 16 ~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~--------------~ 81 (369)
T PF02239_consen 16 GSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNP--------------R 81 (369)
T ss_dssp TEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEE--------------E
T ss_pred CEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCc--------------c
Confidence 68999999999999999876555 4566777 55655 44678999999999988887643221 1
Q ss_pred ceEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCC
Q 001814 249 PMAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSP 328 (1010)
Q Consensus 249 plAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~ 328 (1010)
.+++++ ++..+++
T Consensus 82 ~i~~s~----------------------------------DG~~~~v--------------------------------- 94 (369)
T PF02239_consen 82 GIAVSP----------------------------------DGKYVYV--------------------------------- 94 (369)
T ss_dssp EEEE------------------------------------TTTEEEE---------------------------------
T ss_pred eEEEcC----------------------------------CCCEEEE---------------------------------
Confidence 133322 1111110
Q ss_pred ccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccC-------CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 329 VSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAH-------TSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 329 ~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aH-------tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
..-.++.|.|+|..+.+.+.++... .+.+.+|..+|....++.+-.+...|-+-|...
T Consensus 95 ---------------~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d 159 (369)
T PF02239_consen 95 ---------------ANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSD 159 (369)
T ss_dssp ---------------EEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETTT
T ss_pred ---------------EecCCCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEecc
Confidence 1123678999999999999888754 346889999999997777777655455555432
Q ss_pred CcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE-eCCCeEEEEeCCCC
Q 001814 402 SCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV-SSKGTCHVFVLSPF 463 (1010)
Q Consensus 402 ~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg-S~dGTVhIw~I~~~ 463 (1010)
. ....+..+..| ....+..|+||++++.++ ..+..+-+++....
T Consensus 160 ~---------------~~~~~~~i~~g---~~~~D~~~dpdgry~~va~~~sn~i~viD~~~~ 204 (369)
T PF02239_consen 160 P---------------KNLKVTTIKVG---RFPHDGGFDPDGRYFLVAANGSNKIAVIDTKTG 204 (369)
T ss_dssp S---------------SCEEEEEEE-----TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTTT
T ss_pred c---------------cccceeeeccc---ccccccccCcccceeeecccccceeEEEeeccc
Confidence 1 00122233333 246789999999987664 45668888887654
No 236
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.83 E-value=0.016 Score=68.03 Aligned_cols=49 Identities=16% Similarity=0.219 Sum_probs=35.5
Q ss_pred EEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEeC----CeEEEEECCCCc
Q 001814 174 AVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGLA----TQIYCFDALTLE 222 (1010)
Q Consensus 174 tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~ld----~~I~IwD~~Tle 222 (1010)
.|.++|+.+|+......+...+...+++| +.|+.... .+|+++|+.+.+
T Consensus 214 ~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~ 268 (419)
T PRK04043 214 TLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKT 268 (419)
T ss_pred EEEEEECCCCcEEEEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCc
Confidence 68999999998765556777677777887 45665443 369999987765
No 237
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=97.76 E-value=0.0005 Score=77.77 Aligned_cols=219 Identities=18% Similarity=0.156 Sum_probs=121.6
Q ss_pred CCEEEEEeCC------CCeEEEEEe--CCCcEEEEEEcC--CeEEEE-eCCeEEEEECCCCceeEEEeecCCccccCCCc
Q 001814 172 PTAVRFYSFQ------SHCYEHVLR--FRSSVCMVRCSP--RIVAVG-LATQIYCFDALTLENKFSVLTYPVPQLAGQGA 240 (1010)
Q Consensus 172 p~tVrIWDlk------tge~V~tL~--f~S~V~sVa~S~--rlLAV~-ld~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~ 240 (1010)
+..+++|++. +-+.|.... +++.|++++|+. ++|..| -.++|...|+.+.+.++......+. |
T Consensus 77 D~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~N~~~~SG~~~~~VI~HDiEt~qsi~V~~~~~~~-----~- 150 (609)
T KOG4227|consen 77 DMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLENRFLYSGERWGTVIKHDIETKQSIYVANENNNR-----G- 150 (609)
T ss_pred cceeeeechHHHHhhcCCCCceeccCccccceEEEEEccCCeeEecCCCcceeEeeecccceeeeeecccCcc-----c-
Confidence 4789999984 234444443 347999999976 455554 4578999999998877665433221 1
Q ss_pred cccccCccceEEcc--ceEEEccC--CeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccc
Q 001814 241 VGINVGYGPMAVGP--RWLAYASN--TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKY 316 (1010)
Q Consensus 241 ~~vnv~~gplAlgp--RwLAyas~--~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y 316 (1010)
+ .-.|..+| ..+|..+. .+.+||.- -++.+..+ +-++.+|. .| |
T Consensus 151 -~----VY~m~~~P~DN~~~~~t~~~~V~~~D~R-d~~~~~~~--~~~AN~~~----------------------~F--~ 198 (609)
T KOG4227|consen 151 -D----VYHMDQHPTDNTLIVVTRAKLVSFIDNR-DRQNPISL--VLPANSGK----------------------NF--Y 198 (609)
T ss_pred -c----eeecccCCCCceEEEEecCceEEEEecc-CCCCCCce--eeecCCCc----------------------cc--e
Confidence 0 11244444 45555443 35678741 11101100 00111111 00 1
Q ss_pred cccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcE----EEEeccC---CCCeEEEEECCCCCEEEEEEc
Q 001814 317 CQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAI----ISQFKAH---TSPISALCFDPSGTLLVTASV 389 (1010)
Q Consensus 317 ~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~----v~~~~aH---tspIsaLaFSPdGtlLATAS~ 389 (1010)
...+.|. .. ..++.+...|-+.|||....+. ...+.+- ...-..+-|+|+|+.|.+--.
T Consensus 199 t~~F~P~-----------~P---~Li~~~~~~~G~~~~D~R~~~~~~~~~~~~~~L~~~~~~~M~~~~~~~G~Q~msiRR 264 (609)
T KOG4227|consen 199 TAEFHPE-----------TP---ALILVNSETGGPNVFDRRMQARPVYQRSMFKGLPQENTEWMGSLWSPSGNQFMSIRR 264 (609)
T ss_pred eeeecCC-----------Cc---eeEEeccccCCCCceeeccccchHHhhhccccCcccchhhhheeeCCCCCeehhhhc
Confidence 1111111 00 0134556778899999975431 1122222 223356789999998876654
Q ss_pred CCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec-----ccc-cccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 390 YGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR-----GIT-SATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 390 dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R-----G~t-~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
|..=-+||+.. +.++.|+- |.- .+.|.+++|--| .-|++||++-.+|||.|..
T Consensus 265 -~~~P~~~D~~S------------------~R~~V~k~D~N~~GY~N~~T~KS~~F~~D-~~v~tGSD~~~i~~WklP~ 323 (609)
T KOG4227|consen 265 -GKCPLYFDFIS------------------QRCFVLKSDHNPNGYCNIKTIKSMTFIDD-YTVATGSDHWGIHIWKLPR 323 (609)
T ss_pred -cCCCEEeeeec------------------ccceeEeccCCCCcceeeeeeeeeeeecc-eeeeccCcccceEEEecCC
Confidence 55455677742 13333332 221 236889999855 5599999999999999964
No 238
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=97.74 E-value=0.00088 Score=80.84 Aligned_cols=102 Identities=17% Similarity=0.088 Sum_probs=72.0
Q ss_pred cccCCCCeEEEEECCCC--------cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccc
Q 001814 343 ADMDNAGIVVVKDFVTR--------AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYD 414 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~--------~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~ 414 (1010)
.-|...|.|..=.-... +.+.++..|.++|.++.|+|=+..+.+++- +..+|||..... .
T Consensus 364 iVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~v~~nPF~~k~fls~g-DW~vriWs~~~~------~----- 431 (555)
T KOG1587|consen 364 IVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVYAVSRNPFYPKNFLSVG-DWTVRIWSEDVI------A----- 431 (555)
T ss_pred EEEcCCcEEEEEeccCCcccccccccccccccccCcceEeeecCCCccceeeeec-cceeEeccccCC------C-----
Confidence 34566777766222221 234577789999999999999987766666 455999987521 1
Q ss_pred cCCcceEEEEEecccccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCC
Q 001814 415 WNSSHVHLYKLHRGITSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 415 ~~~s~~~L~~L~RG~t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..++.+.+. ...|.+++|||-- ..+|++..||++.||||...
T Consensus 432 -----~Pl~~~~~~--~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~~ 474 (555)
T KOG1587|consen 432 -----SPLLSLDSS--PDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQD 474 (555)
T ss_pred -----Ccchhhhhc--cceeeeeEEcCcCceEEEEEcCCCceehhhhhcc
Confidence 355665543 2359999999975 57778888999999999754
No 239
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.73 E-value=0.031 Score=63.66 Aligned_cols=55 Identities=22% Similarity=0.353 Sum_probs=38.7
Q ss_pred CCCeEEEEECC--CC--cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 347 NAGIVVVKDFV--TR--AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 347 ~dG~V~VwDl~--s~--~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
..+.|.+|++. ++ +.+..+.....--..++|+|+|++|+.+..++..|.+|++.+
T Consensus 265 ~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~ 323 (345)
T PF10282_consen 265 GSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDIDP 323 (345)
T ss_dssp TTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEET
T ss_pred cCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEeC
Confidence 35667777773 22 234444443344588999999999999998888999999853
No 240
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=97.73 E-value=0.00011 Score=90.81 Aligned_cols=99 Identities=21% Similarity=0.210 Sum_probs=78.6
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
++.+..-|.|.+|+....+.-..+.+|.+.|-++.|+-||+++||+|. ++.||+|++.+. ..
T Consensus 148 i~~gsv~~~iivW~~~~dn~p~~l~GHeG~iF~i~~s~dg~~i~s~Sd-DRsiRlW~i~s~-----------------~~ 209 (967)
T KOG0974|consen 148 IASGSVFGEIIVWKPHEDNKPIRLKGHEGSIFSIVTSLDGRYIASVSD-DRSIRLWPIDSR-----------------EV 209 (967)
T ss_pred EEeccccccEEEEeccccCCcceecccCCceEEEEEccCCcEEEEEec-Ccceeeeecccc-----------------cc
Confidence 345677889999999854444468999999999999999999999999 677999999642 01
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
+--.--| |.|+|+.+.|.|. .|++++.|-|+++|..+
T Consensus 210 ~~~~~fg-HsaRvw~~~~~~n--~i~t~gedctcrvW~~~ 246 (967)
T KOG0974|consen 210 LGCTGFG-HSARVWACCFLPN--RIITVGEDCTCRVWGVN 246 (967)
T ss_pred cCccccc-ccceeEEEEeccc--eeEEeccceEEEEEecc
Confidence 1101124 4589999999998 99999999999999543
No 241
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.72 E-value=0.0012 Score=75.15 Aligned_cols=80 Identities=15% Similarity=0.186 Sum_probs=62.4
Q ss_pred ccCCCCeEEEEECCCCcEEEE-eccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQ-FKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~-~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
.+...|.+..||+..++.+.. |.+-++.|++|...|.+.+||+|+. ++.+||||+.+. +.+
T Consensus 264 ~gn~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~GL-DRyvRIhD~ktr-----------------kll 325 (412)
T KOG3881|consen 264 TGNTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASCGL-DRYVRIHDIKTR-----------------KLL 325 (412)
T ss_pred EecccchhheecccCceeeccccCCccCCcceEEEcCCCceEEeecc-ceeEEEeecccc-----------------hhh
Confidence 456778899999999988766 8889999999999999999999999 678999999642 345
Q ss_pred EEEecccccccEEEEEEccCCC
Q 001814 423 YKLHRGITSATIQDICFSHYSQ 444 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~ 444 (1010)
++.+-+ ..+++|-|.++-.
T Consensus 326 ~kvYvK---s~lt~il~~~~~n 344 (412)
T KOG3881|consen 326 HKVYVK---SRLTFILLRDDVN 344 (412)
T ss_pred hhhhhh---ccccEEEecCCcc
Confidence 544322 3467777776543
No 242
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.71 E-value=0.00039 Score=80.20 Aligned_cols=99 Identities=20% Similarity=0.332 Sum_probs=73.6
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 346 DNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
..+|.|.|.|..+.+.+..|..+..+-..++|+|||++|..++.+| .|.++|+.. + +.+.++
T Consensus 13 ~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg-~vsviD~~~-------~----------~~v~~i 74 (369)
T PF02239_consen 13 RGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDG-TVSVIDLAT-------G----------KVVATI 74 (369)
T ss_dssp GGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTS-EEEEEETTS-------S----------SEEEEE
T ss_pred cCCCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCC-eEEEEECCc-------c----------cEEEEE
Confidence 4578999999999999999997655555688999999999999876 599999964 2 466777
Q ss_pred ecccccccEEEEEEccCCCEEEEEe-CCCeEEEEeCCCCCC
Q 001814 426 HRGITSATIQDICFSHYSQWIAIVS-SKGTCHVFVLSPFGG 465 (1010)
Q Consensus 426 ~RG~t~a~I~sIAFSpDg~~LAsgS-~dGTVhIw~I~~~gg 465 (1010)
+-|. .-.++++|+||++|+++. ..+++.|++......
T Consensus 75 ~~G~---~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~ 112 (369)
T PF02239_consen 75 KVGG---NPRGIAVSPDGKYVYVANYEPGTVSVIDAETLEP 112 (369)
T ss_dssp E-SS---EEEEEEE--TTTEEEEEEEETTEEEEEETTT--E
T ss_pred ecCC---CcceEEEcCCCCEEEEEecCCCceeEeccccccc
Confidence 6664 357899999999999876 589999999876543
No 243
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=97.57 E-value=0.015 Score=73.89 Aligned_cols=102 Identities=13% Similarity=0.108 Sum_probs=63.7
Q ss_pred cCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEECC---CCCEEEEEEc-CCCeEEEEeCCCCcccCCCCCCccccCCcc
Q 001814 345 MDNAGIVVVKDFVTRAIISQFK-AHTSPISALCFDP---SGTLLVTASV-YGNNINIFRIMPSCMRSGSGNHKYDWNSSH 419 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~-aHtspIsaLaFSP---dGtlLATAS~-dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~ 419 (1010)
|...|.+.+||+.=+..+..+. +|..+|..|+..| .....++++. --+-+-+|++....
T Consensus 1213 Gts~G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~~~~~~~~S~~vs~~~~~~nevs~wn~~~g~---------------- 1276 (1431)
T KOG1240|consen 1213 GTSRGQLVLWDLRFRVPILSWEHPARAPIRHVWLCPTYPQESVSVSAGSSSNNEVSTWNMETGL---------------- 1276 (1431)
T ss_pred ecCCceEEEEEeecCceeecccCcccCCcceEEeeccCCCCceEEEecccCCCceeeeecccCc----------------
Confidence 3456789999998877776554 3457888877765 3356666655 23458899986421
Q ss_pred eEEEEEecc---------------cccc--cEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 420 VHLYKLHRG---------------ITSA--TIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 420 ~~L~~L~RG---------------~t~a--~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+-+.|..+ .++. .....++..-+.++.+|+.|+.|+.|+....
T Consensus 1277 -~~~vl~~s~~~p~ls~~~Ps~~~~kp~~~~~~~~~~~~~~~~~ltggsd~kIR~wD~~~p 1336 (1431)
T KOG1240|consen 1277 -RQTVLWASDGAPILSYALPSNDARKPDSLAGISCGVCEKNGFLLTGGSDMKIRKWDPTRP 1336 (1431)
T ss_pred -ceEEEEcCCCCcchhhhcccccCCCCCcccceeeecccCCceeeecCCccceeeccCCCc
Confidence 11122111 0011 1223455556779999999999999998654
No 244
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=97.55 E-value=0.00097 Score=78.02 Aligned_cols=86 Identities=15% Similarity=0.293 Sum_probs=65.7
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.|.+.|--+.....+-.++||++-|.++.|+|...++|||++| -.++|||.. | +.||.-
T Consensus 165 g~h~~IKpL~~n~k~i~WkAHDGiiL~~~W~~~s~lI~sgGED-~kfKvWD~~--------G----------~~Lf~S-- 223 (737)
T KOG1524|consen 165 GGHISIKPLAANSKIIRWRAHDGLVLSLSWSTQSNIIASGGED-FRFKIWDAQ--------G----------ANLFTS-- 223 (737)
T ss_pred CCeEEEeecccccceeEEeccCcEEEEeecCccccceeecCCc-eeEEeeccc--------C----------cccccC--
Confidence 3456666666666677899999999999999999999999995 559999974 4 456553
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCCeEE
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKGTCH 456 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dGTVh 456 (1010)
..+...|++++|.|| +.++++|.. |.+
T Consensus 224 ~~~ey~ITSva~npd-~~~~v~S~n-t~R 250 (737)
T KOG1524|consen 224 AAEEYAITSVAFNPE-KDYLLWSYN-TAR 250 (737)
T ss_pred Chhccceeeeeeccc-cceeeeeee-eee
Confidence 223446999999999 777777754 454
No 245
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=97.55 E-value=0.00019 Score=83.85 Aligned_cols=83 Identities=19% Similarity=0.271 Sum_probs=70.2
Q ss_pred EEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEcc
Q 001814 362 ISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSH 441 (1010)
Q Consensus 362 v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSp 441 (1010)
-+.|.+|++=|++|.|+.||.+||++|.| +-+.|||... +++++..+.|++ +.|.++.|=|
T Consensus 43 E~eL~GH~GCVN~LeWn~dG~lL~SGSDD-~r~ivWd~~~-----------------~KllhsI~TgHt-aNIFsvKFvP 103 (758)
T KOG1310|consen 43 EAELTGHTGCVNCLEWNADGELLASGSDD-TRLIVWDPFE-----------------YKLLHSISTGHT-ANIFSVKFVP 103 (758)
T ss_pred hhhhccccceecceeecCCCCEEeecCCc-ceEEeecchh-----------------cceeeeeecccc-cceeEEeeec
Confidence 36789999999999999999999999984 5589999742 256777778865 6799999988
Q ss_pred C--CCEEEEEeCCCeEEEEeCCCC
Q 001814 442 Y--SQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 442 D--g~~LAsgS~dGTVhIw~I~~~ 463 (1010)
. .+.|++|..|.-||||+++..
T Consensus 104 ~tnnriv~sgAgDk~i~lfdl~~~ 127 (758)
T KOG1310|consen 104 YTNNRIVLSGAGDKLIKLFDLDSS 127 (758)
T ss_pred cCCCeEEEeccCcceEEEEecccc
Confidence 5 578999999999999999853
No 246
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.54 E-value=0.0062 Score=71.62 Aligned_cols=75 Identities=23% Similarity=0.188 Sum_probs=45.6
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
.....+|||||+.||..+..+...+||.+... ..+ .....+..+ ...+...+|||||++|+..+
T Consensus 282 ~~~~p~wSPDG~~Laf~s~~~g~~~ly~~~~~----~~g----------~~~~~lt~~--~~~~~~p~wSPDG~~Laf~~ 345 (428)
T PRK01029 282 TQGNPSFSPDGTRLVFVSNKDGRPRIYIMQID----PEG----------QSPRLLTKK--YRNSSCPAWSPDGKKIAFCS 345 (428)
T ss_pred CcCCeEECCCCCEEEEEECCCCCceEEEEECc----ccc----------cceEEeccC--CCCccceeECCCCCEEEEEE
Confidence 34567999999988887754444567764211 001 012222221 12467789999999999877
Q ss_pred CC-C--eEEEEeCC
Q 001814 451 SK-G--TCHVFVLS 461 (1010)
Q Consensus 451 ~d-G--TVhIw~I~ 461 (1010)
.+ | .|++|++.
T Consensus 346 ~~~g~~~I~v~dl~ 359 (428)
T PRK01029 346 VIKGVRQICVYDLA 359 (428)
T ss_pred cCCCCcEEEEEECC
Confidence 64 3 46666654
No 247
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.53 E-value=0.05 Score=71.00 Aligned_cols=72 Identities=10% Similarity=0.098 Sum_probs=51.3
Q ss_pred EEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe-ccc-----------ccccEEEEEEc
Q 001814 373 SALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH-RGI-----------TSATIQDICFS 440 (1010)
Q Consensus 373 saLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~-RG~-----------t~a~I~sIAFS 440 (1010)
..|+|+++|.++++-+. ++.|++||... + .+..+- .|. .-.....|+++
T Consensus 807 ~Gvavd~dG~LYVADs~-N~rIrviD~~t-------g-----------~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd 867 (1057)
T PLN02919 807 LGVLCAKDGQIYVADSY-NHKIKKLDPAT-------K-----------RVTTLAGTGKAGFKDGKALKAQLSEPAGLALG 867 (1057)
T ss_pred ceeeEeCCCcEEEEECC-CCEEEEEECCC-------C-----------eEEEEeccCCcCCCCCcccccccCCceEEEEe
Confidence 47899999997776655 66799999753 1 111110 110 00146789999
Q ss_pred cCCCEEEEEeCCCeEEEEeCCCC
Q 001814 441 HYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 441 pDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+||+.+++-+.+++|++|++...
T Consensus 868 ~dG~lyVaDt~Nn~Irvid~~~~ 890 (1057)
T PLN02919 868 ENGRLFVADTNNSLIRYLDLNKG 890 (1057)
T ss_pred CCCCEEEEECCCCEEEEEECCCC
Confidence 99999999999999999999764
No 248
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.50 E-value=0.0012 Score=73.78 Aligned_cols=92 Identities=17% Similarity=0.259 Sum_probs=63.1
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC-----------------------------------
Q 001814 346 DNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY----------------------------------- 390 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d----------------------------------- 390 (1010)
..+-.|+||.+.+.+.. .++--...+.-++|.|||++.|.++..
T Consensus 111 eF~lriTVWSL~t~~~~-~~~~pK~~~kg~~f~~dg~f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~dT~DltgieWsPd 189 (447)
T KOG4497|consen 111 EFDLRITVWSLNTQKGY-LLPHPKTNVKGYAFHPDGQFCAILSRRDCKDYVQISSCKAWILLKEFKLDTIDLTGIEWSPD 189 (447)
T ss_pred cceeEEEEEEeccceeE-EecccccCceeEEECCCCceeeeeecccHHHHHHHHhhHHHHHHHhcCCCcccccCceECCC
Confidence 45667888888876543 333334456778899999888888763
Q ss_pred CCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEE
Q 001814 391 GNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVF 458 (1010)
Q Consensus 391 Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw 458 (1010)
|..+-|||.-- -..+|..+||. .|..++|||-+++||+|+.|+.++|-
T Consensus 190 g~~laVwd~~L-----------------eykv~aYe~~l---G~k~v~wsP~~qflavGsyD~~lrvl 237 (447)
T KOG4497|consen 190 GNWLAVWDNVL-----------------EYKVYAYERGL---GLKFVEWSPCNQFLAVGSYDQMLRVL 237 (447)
T ss_pred CcEEEEecchh-----------------hheeeeeeecc---ceeEEEeccccceEEeeccchhhhhh
Confidence 33333443210 02445566764 48899999999999999999999884
No 249
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=97.48 E-value=0.00054 Score=77.20 Aligned_cols=76 Identities=20% Similarity=0.355 Sum_probs=61.6
Q ss_pred ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCE
Q 001814 366 KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQW 445 (1010)
Q Consensus 366 ~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~ 445 (1010)
++| .||++|++++||+.|+|||-++..|+|||... |. ...|.. +|. ..+.-+-||||+.+
T Consensus 193 pgh-~pVtsmqwn~dgt~l~tAS~gsssi~iWdpdt-------g~--------~~pL~~--~gl--gg~slLkwSPdgd~ 252 (445)
T KOG2139|consen 193 PGH-NPVTSMQWNEDGTILVTASFGSSSIMIWDPDT-------GQ--------KIPLIP--KGL--GGFSLLKWSPDGDV 252 (445)
T ss_pred CCC-ceeeEEEEcCCCCEEeecccCcceEEEEcCCC-------CC--------cccccc--cCC--CceeeEEEcCCCCE
Confidence 456 79999999999999999999999999999853 31 133332 332 35788999999999
Q ss_pred EEEEeCCCeEEEEeCC
Q 001814 446 IAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 446 LAsgS~dGTVhIw~I~ 461 (1010)
|.+++-|++.+||..+
T Consensus 253 lfaAt~davfrlw~e~ 268 (445)
T KOG2139|consen 253 LFAATCDAVFRLWQEN 268 (445)
T ss_pred EEEecccceeeeehhc
Confidence 9999999999999754
No 250
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=97.44 E-value=0.011 Score=70.25 Aligned_cols=53 Identities=21% Similarity=0.449 Sum_probs=46.6
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCC-----CCEEEEEEcCCCeEEEEeCC
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPS-----GTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPd-----GtlLATAS~dGt~IrVwdi~ 400 (1010)
.+.|.+||+.+++++.+|.+|.+||++++|--+ |.++.+...-++.|.+|-+.
T Consensus 163 s~~ik~~~~~~kevv~~ftgh~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~v~ 220 (541)
T KOG4547|consen 163 SRQIKVLDIETKEVVITFTGHGSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWVVE 220 (541)
T ss_pred cceEEEEEccCceEEEEecCCCcceEEEEEEEeccccccceeeeccccccceeEEEEE
Confidence 568999999999999999999999999999887 77777766657789999875
No 251
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=97.43 E-value=0.0084 Score=67.08 Aligned_cols=88 Identities=17% Similarity=0.236 Sum_probs=60.7
Q ss_pred cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc---eEEEEEecccccccEEEEEEccCC
Q 001814 367 AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH---VHLYKLHRGITSATIQDICFSHYS 443 (1010)
Q Consensus 367 aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~---~~L~~L~RG~t~a~I~sIAFSpDg 443 (1010)
.+.+.|.+|.|.|++-+||++|.|+. .|||..---. ... --...+|.+.. +++.++.+ ....|..+.|||+|
T Consensus 144 PirStv~sldWhpnnVLlaaGs~D~k-~rVfSayIK~-Vde-kpap~pWgsk~PFG~lm~E~~~--~ggwvh~v~fs~sG 218 (361)
T KOG1523|consen 144 PIRSTVTSLDWHPNNVLLAAGSTDGK-CRVFSAYIKG-VDE-KPAPTPWGSKMPFGQLMSEASS--SGGWVHGVLFSPSG 218 (361)
T ss_pred ccccceeeeeccCCcceecccccCcc-eeEEEEeeec-ccc-CCCCCCCccCCcHHHHHHhhcc--CCCceeeeEeCCCC
Confidence 45678999999999999999999776 8999752100 000 00112344432 33444432 34579999999999
Q ss_pred CEEEEEeCCCeEEEEe
Q 001814 444 QWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 444 ~~LAsgS~dGTVhIw~ 459 (1010)
..||-.+.|.++-+=+
T Consensus 219 ~~lawv~Hds~v~~~d 234 (361)
T KOG1523|consen 219 NRLAWVGHDSTVSFVD 234 (361)
T ss_pred CEeeEecCCCceEEee
Confidence 9999999999998744
No 252
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.42 E-value=0.00052 Score=52.67 Aligned_cols=37 Identities=32% Similarity=0.440 Sum_probs=31.5
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
+++++ +|+ ...|++|+|+|++++|++++.|++|+||+
T Consensus 3 ~~~~~-~~h-~~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 3 CVRTF-RGH-SSSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEE-ESS-SSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred EEEEE-cCC-CCcEEEEEEecccccceeeCCCCEEEEEC
Confidence 45566 454 45799999999999999999999999996
No 253
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=97.41 E-value=0.0015 Score=81.24 Aligned_cols=103 Identities=15% Similarity=0.162 Sum_probs=82.7
Q ss_pred cccCCCCeEEEEECCCCcEEE-EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 343 ADMDNAGIVVVKDFVTRAIIS-QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~-~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
++.+.|..+++|++.+.+.+. +.-+|+..|..++|.|+ +|+|+++| -+.|+|+.. | ..
T Consensus 191 ~s~SdDRsiRlW~i~s~~~~~~~~fgHsaRvw~~~~~~n--~i~t~ged-ctcrvW~~~--------~----------~~ 249 (967)
T KOG0974|consen 191 ASVSDDRSIRLWPIDSREVLGCTGFGHSARVWACCFLPN--RIITVGED-CTCRVWGVN--------G----------TQ 249 (967)
T ss_pred EEEecCcceeeeecccccccCcccccccceeEEEEeccc--eeEEeccc-eEEEEEecc--------c----------ce
Confidence 456789999999999988665 77789999999999999 99999995 569999764 3 12
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcc
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
|.++ +++-...|+.++-.++.-++.++.+||++++|++...+.+.
T Consensus 250 l~~y-~~h~g~~iw~~~~~~~~~~~vT~g~Ds~lk~~~l~~r~~e~ 294 (967)
T KOG0974|consen 250 LEVY-DEHSGKGIWKIAVPIGVIIKVTGGNDSTLKLWDLNGRGLEG 294 (967)
T ss_pred ehhh-hhhhhcceeEEEEcCCceEEEeeccCcchhhhhhhcccccc
Confidence 2222 33333469999999999999999999999999997665443
No 254
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=97.41 E-value=0.0033 Score=73.22 Aligned_cols=121 Identities=17% Similarity=0.144 Sum_probs=78.8
Q ss_pred CCCCCCCcEEEEEEeeccCCCCCCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCcccc
Q 001814 51 ASEDLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKL 129 (1010)
Q Consensus 51 ~~~~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~s 129 (1010)
..+.|++-|.-+.|++ .+.+|+.|.++- +.|||.....-.....++|...|.-.+|+| |...
T Consensus 137 kL~~H~GcVntV~FN~-------~Gd~l~SgSDD~~vv~WdW~~~~~~l~f~SGH~~NvfQaKFiP----------~s~d 199 (559)
T KOG1334|consen 137 KLNKHKGCVNTVHFNQ-------RGDVLASGSDDLQVVVWDWVSGSPKLSFESGHCNNVFQAKFIP----------FSGD 199 (559)
T ss_pred cccCCCCccceeeecc-------cCceeeccCccceEEeehhhccCcccccccccccchhhhhccC----------CCCC
Confidence 4577899999999986 567999999875 899999765555556677877888888887 4455
Q ss_pred CcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCC-CCeEEEE---EeCCCcEEEEEEcC---
Q 001814 130 HPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQ-SHCYEHV---LRFRSSVCMVRCSP--- 202 (1010)
Q Consensus 130 rpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlk-tge~V~t---L~f~S~V~sVa~S~--- 202 (1010)
+++ +.++ + ++.||+=.+- ++.+..+ -+++++|.-++.-|
T Consensus 200 ~ti-~~~s-------------------~--------------dgqvr~s~i~~t~~~e~t~rl~~h~g~vhklav~p~sp 245 (559)
T KOG1334|consen 200 RTI-VTSS-------------------R--------------DGQVRVSEILETGYVENTKRLAPHEGPVHKLAVEPDSP 245 (559)
T ss_pred cCc-eecc-------------------c--------------cCceeeeeeccccceecceecccccCccceeeecCCCC
Confidence 554 2211 1 1234443332 3333222 24667887777755
Q ss_pred C-eEEEEeCCeEEEEECCCCc
Q 001814 203 R-IVAVGLATQIYCFDALTLE 222 (1010)
Q Consensus 203 r-lLAV~ld~~I~IwD~~Tle 222 (1010)
. ++.+|-+..+.=+|+++..
T Consensus 246 ~~f~S~geD~~v~~~Dlr~~~ 266 (559)
T KOG1334|consen 246 KPFLSCGEDAVVFHIDLRQDV 266 (559)
T ss_pred CcccccccccceeeeeeccCC
Confidence 3 5556666677778877654
No 255
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=97.36 E-value=0.0032 Score=74.41 Aligned_cols=124 Identities=15% Similarity=0.233 Sum_probs=88.5
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE-
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL- 425 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L- 425 (1010)
....|.=+++..|.-+.-|.-...+|+++..++-..|||++..+|. +-.||-..- +. +..|..-
T Consensus 153 sg~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~-VEfwDpR~k---sr-----------v~~l~~~~ 217 (703)
T KOG2321|consen 153 SGSEVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGV-VEFWDPRDK---SR-----------VGTLDAAS 217 (703)
T ss_pred cCcceEEEEccccccccccccccccceeeeecCccceEEecccCce-EEEecchhh---hh-----------heeeeccc
Confidence 3445777888889888888888899999999999999999998775 899996431 00 0111110
Q ss_pred ----eccccc-ccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCCccCCCCCCCcccC
Q 001814 426 ----HRGITS-ATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFPVLSLPWWCT 492 (1010)
Q Consensus 426 ----~RG~t~-a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~pv~~lpw~~~ 492 (1010)
+-|... ..|++|+|+.||-.+|+|+..|.|.||+|.... +.-+.-|. .-.|+..|.|..+
T Consensus 218 ~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa~~-pl~~kdh~------~e~pi~~l~~~~~ 282 (703)
T KOG2321|consen 218 SVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRASK-PLLVKDHG------YELPIKKLDWQDT 282 (703)
T ss_pred ccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcccCC-ceeecccC------Cccceeeeccccc
Confidence 012222 259999999999999999999999999997643 33334553 4567777777443
No 256
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=97.30 E-value=0.0032 Score=79.64 Aligned_cols=92 Identities=13% Similarity=0.315 Sum_probs=69.9
Q ss_pred CcEEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEE
Q 001814 359 RAIISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDI 437 (1010)
Q Consensus 359 ~~~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sI 437 (1010)
|..+++|.-|...|..++.++.. .+++|||.||+ ||||+...... . .| .....+.|.. ....+..+
T Consensus 1038 G~lVAhL~Ehs~~v~k~a~s~~~~s~FvsgS~DGt-VKvW~~~k~~~-~-~~------s~rS~ltys~----~~sr~~~v 1104 (1431)
T KOG1240|consen 1038 GILVAHLHEHSSAVIKLAVSSEHTSLFVSGSDDGT-VKVWNLRKLEG-E-GG------SARSELTYSP----EGSRVEKV 1104 (1431)
T ss_pred ceEeehhhhccccccceeecCCCCceEEEecCCce-EEEeeehhhhc-C-cc------eeeeeEEEec----cCCceEEE
Confidence 67889999999999998887655 89999999886 99999854210 0 01 0111233332 23468899
Q ss_pred EEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 438 CFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 438 AFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.+-+.+..+|+++.||.|++++|+.+
T Consensus 1105 t~~~~~~~~Av~t~DG~v~~~~id~~ 1130 (1431)
T KOG1240|consen 1105 TMCGNGDQFAVSTKDGSVRVLRIDHY 1130 (1431)
T ss_pred EeccCCCeEEEEcCCCeEEEEEcccc
Confidence 99999999999999999999999986
No 257
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=97.29 E-value=0.26 Score=63.69 Aligned_cols=98 Identities=14% Similarity=0.223 Sum_probs=64.7
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCC--CeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYG--NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dG--t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
-..++||+-. |.+-.+-..-.+-=.+|+|-|+|.++|++-..+ +.|..|.= + |- .+--+.|
T Consensus 236 ~R~iRVy~Re-G~L~stSE~v~gLe~~l~WrPsG~lIA~~q~~~~~~~VvFfEr--N------GL--------rhgeF~l 298 (928)
T PF04762_consen 236 RRVIRVYSRE-GELQSTSEPVDGLEGALSWRPSGNLIASSQRLPDRHDVVFFER--N------GL--------RHGEFTL 298 (928)
T ss_pred eeEEEEECCC-ceEEeccccCCCccCCccCCCCCCEEEEEEEcCCCcEEEEEec--C------Cc--------EeeeEec
Confidence 4689999976 544333332222234689999999999987632 34555552 1 31 0122455
Q ss_pred ecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 426 HRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 426 ~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+.......|..|+|++||..||+.-.|. |++|....|
T Consensus 299 ~~~~~~~~v~~l~Wn~ds~iLAv~~~~~-vqLWt~~NY 335 (928)
T PF04762_consen 299 RFDPEEEKVIELAWNSDSEILAVWLEDR-VQLWTRSNY 335 (928)
T ss_pred CCCCCCceeeEEEECCCCCEEEEEecCC-ceEEEeeCC
Confidence 4323345799999999999999988554 999998876
No 258
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=97.19 E-value=0.0062 Score=68.06 Aligned_cols=128 Identities=18% Similarity=0.192 Sum_probs=92.1
Q ss_pred ccccCCCCeEEEEECCCCc---EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 342 GADMDNAGIVVVKDFVTRA---IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~---~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
++.+-+...|.||...... ..++|.-|...|+.+.++|.+..|+|++.| +.-.||...+ | ..|.+.
T Consensus 25 iAv~~~~~evhiy~~~~~~~w~~~htls~Hd~~vtgvdWap~snrIvtcs~d-rnayVw~~~~-------~---~~Wkpt 93 (361)
T KOG1523|consen 25 IAVSPNNHEVHIYSMLGADLWEPAHTLSEHDKIVTGVDWAPKSNRIVTCSHD-RNAYVWTQPS-------G---GTWKPT 93 (361)
T ss_pred EEeccCCceEEEEEecCCCCceeceehhhhCcceeEEeecCCCCceeEccCC-CCccccccCC-------C---Ceeccc
Confidence 3445567789999887654 678999999999999999999999999995 4589998732 2 236554
Q ss_pred ceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccccccccCCCCCCccCCCCCCCcc
Q 001814 419 HVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSGFQTLSSQGGDPYLFPVLSLPWW 490 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~~~~H~s~~~~~~~~pv~~lpw~ 490 (1010)
.+.| ++. ....++.|+|.+..+|+||.-..|-||-.+.. .+. .=+.|...|..+-+++|.|-
T Consensus 94 lvLl-RiN-----rAAt~V~WsP~enkFAVgSgar~isVcy~E~E---NdW-WVsKhikkPirStv~sldWh 155 (361)
T KOG1523|consen 94 LVLL-RIN-----RAATCVKWSPKENKFAVGSGARLISVCYYEQE---NDW-WVSKHIKKPIRSTVTSLDWH 155 (361)
T ss_pred eeEE-Eec-----cceeeEeecCcCceEEeccCccEEEEEEEecc---cce-ehhhhhCCccccceeeeecc
Confidence 4332 232 23688999999999999999999999987642 211 00112345666777777773
No 259
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=97.14 E-value=0.15 Score=58.03 Aligned_cols=30 Identities=27% Similarity=0.491 Sum_probs=27.0
Q ss_pred EEEEECCCCCEEEEEEcCCCeEEEEeCCCC
Q 001814 373 SALCFDPSGTLLVTASVYGNNINIFRIMPS 402 (1010)
Q Consensus 373 saLaFSPdGtlLATAS~dGt~IrVwdi~p~ 402 (1010)
....|+|+|++|+.|..++..|.||.+.+.
T Consensus 294 R~F~i~~~g~~Liaa~q~sd~i~vf~~d~~ 323 (346)
T COG2706 294 RDFNINPSGRFLIAANQKSDNITVFERDKE 323 (346)
T ss_pred ccceeCCCCCEEEEEccCCCcEEEEEEcCC
Confidence 567899999999999999999999999753
No 260
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.12 E-value=0.0027 Score=71.06 Aligned_cols=88 Identities=17% Similarity=0.291 Sum_probs=69.2
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 346 DNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
-.+++|.+|++...+--..|..-..++++++|||||+.|.+.|+-+-.|.||.+.+. ..+-+
T Consensus 68 yk~~~vqvwsl~Qpew~ckIdeg~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~------------------~~~~~ 129 (447)
T KOG4497|consen 68 YKDPKVQVWSLVQPEWYCKIDEGQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQ------------------KGYLL 129 (447)
T ss_pred eccceEEEEEeecceeEEEeccCCCcceeeeECCCcceEeeeecceeEEEEEEeccc------------------eeEEe
Confidence 468899999999998888999889999999999999766666665667999998641 12222
Q ss_pred ecccccccEEEEEEccCCCEEEEEeCCC
Q 001814 426 HRGITSATIQDICFSHYSQWIAIVSSKG 453 (1010)
Q Consensus 426 ~RG~t~a~I~sIAFSpDg~~LAsgS~dG 453 (1010)
.. ..+.+..++|.|||++.|..+.+.
T Consensus 130 ~~--pK~~~kg~~f~~dg~f~ai~sRrD 155 (447)
T KOG4497|consen 130 PH--PKTNVKGYAFHPDGQFCAILSRRD 155 (447)
T ss_pred cc--cccCceeEEECCCCceeeeeeccc
Confidence 11 223478999999999999999874
No 261
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=97.11 E-value=0.027 Score=67.08 Aligned_cols=99 Identities=20% Similarity=0.347 Sum_probs=78.4
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
+++.|++|..|+...+..+..+++-+..+++++.+|||..|++||. .|++|++.+ + +.+.
T Consensus 119 S~~ad~~v~~~~~~~~~~~~~~~~~~~~~~sl~is~D~~~l~~as~---~ik~~~~~~-------k----------evv~ 178 (541)
T KOG4547|consen 119 SVGADLKVVYILEKEKVIIRIWKEQKPLVSSLCISPDGKILLTASR---QIKVLDIET-------K----------EVVI 178 (541)
T ss_pred ecCCceeEEEEecccceeeeeeccCCCccceEEEcCCCCEEEeccc---eEEEEEccC-------c----------eEEE
Confidence 5678999999999999999999999999999999999999999984 599999974 2 3444
Q ss_pred EEecccccccEEEEEEccC-----CCEEEEEeC-CCeEEEEeCCCCC
Q 001814 424 KLHRGITSATIQDICFSHY-----SQWIAIVSS-KGTCHVFVLSPFG 464 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpD-----g~~LAsgS~-dGTVhIw~I~~~g 464 (1010)
+| .| |...|.+++|--+ |+++.++-. ..-+-+|.+....
T Consensus 179 ~f-tg-h~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~v~~~~ 223 (541)
T KOG4547|consen 179 TF-TG-HGSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWVVEKED 223 (541)
T ss_pred Ee-cC-CCcceEEEEEEEeccccccceeeeccccccceeEEEEEccc
Confidence 44 56 4568999999888 666655433 3347788876543
No 262
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.10 E-value=0.11 Score=61.77 Aligned_cols=99 Identities=13% Similarity=0.244 Sum_probs=72.1
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEE-EcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTA-SVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATA-S~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.++.+.++....++..+. -.+||-+++|+|+|+-++.+ +---..+-||++. | ..++.|-.
T Consensus 251 q~Lyll~t~g~s~~V~L~-k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr--------~----------~~v~df~e 311 (566)
T KOG2315|consen 251 QTLYLLATQGESVSVPLL-KEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLR--------G----------KPVFDFPE 311 (566)
T ss_pred ceEEEEEecCceEEEecC-CCCCceEEEECCCCCEEEEEEecccceEEEEcCC--------C----------CEeEeCCC
Confidence 357777777444444443 35899999999999877654 4434568899884 4 47888866
Q ss_pred ccccccEEEEEEccCCCEEEEEeC---CCeEEEEeCCCCCCccccc
Q 001814 428 GITSATIQDICFSHYSQWIAIVSS---KGTCHVFVLSPFGGDSGFQ 470 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~---dGTVhIw~I~~~gg~~~~~ 470 (1010)
|.. .++-|+|.|++|+.++. .|.+-|||+..++....+.
T Consensus 312 gpR----N~~~fnp~g~ii~lAGFGNL~G~mEvwDv~n~K~i~~~~ 353 (566)
T KOG2315|consen 312 GPR----NTAFFNPHGNIILLAGFGNLPGDMEVWDVPNRKLIAKFK 353 (566)
T ss_pred CCc----cceEECCCCCEEEEeecCCCCCceEEEeccchhhccccc
Confidence 654 45789999999999876 4789999999877655443
No 263
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=97.09 E-value=0.01 Score=66.95 Aligned_cols=219 Identities=16% Similarity=0.243 Sum_probs=131.6
Q ss_pred CCEEEEE-eCCCCeEEEEEeC--CCcEEEEEEcC--CeEEEEeCC-eEEEEECC-CCceeEEEeecCCccccCCCccccc
Q 001814 172 PTAVRFY-SFQSHCYEHVLRF--RSSVCMVRCSP--RIVAVGLAT-QIYCFDAL-TLENKFSVLTYPVPQLAGQGAVGIN 244 (1010)
Q Consensus 172 p~tVrIW-Dlktge~V~tL~f--~S~V~sVa~S~--rlLAV~ld~-~I~IwD~~-Tle~l~tL~t~p~p~~~~~g~~~vn 244 (1010)
+++|||| ...++++-..+.+ ++++.++..+. ++|+|+++. .+.=|-+. +.+....++.++.- ++.
T Consensus 45 drtvrv~lkrds~q~wpsI~~~mP~~~~~~~y~~e~~~L~vg~~ngtvtefs~sedfnkm~~~r~~~~h----~~~---- 116 (404)
T KOG1409|consen 45 DRTVRVWLKRDSGQYWPSIYHYMPSPCSAMEYVSESRRLYVGQDNGTVTEFALSEDFNKMTFLKDYLAH----QAR---- 116 (404)
T ss_pred cceeeeEEeccccccCchhhhhCCCCceEeeeeccceEEEEEEecceEEEEEhhhhhhhcchhhhhhhh----hcc----
Confidence 6899999 4567888777754 57888888876 688888864 67666543 22222222222210 000
Q ss_pred cCccceEEccceEEEccC-CeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCC
Q 001814 245 VGYGPMAVGPRWLAYASN-TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPD 323 (1010)
Q Consensus 245 v~~gplAlgpRwLAyas~-~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~ 323 (1010)
+..-.+++.+.|+-..+. .-..|+.-.. | +.|.+|.-.-.+
T Consensus 117 v~~~if~~~~e~V~s~~~dk~~~~hc~e~------------------~-------------------~~lg~Y~~~~~~- 158 (404)
T KOG1409|consen 117 VSAIVFSLTHEWVLSTGKDKQFAWHCTES------------------G-------------------NRLGGYNFETPA- 158 (404)
T ss_pred eeeEEecCCceeEEEeccccceEEEeecc------------------C-------------------CcccceEeeccC-
Confidence 011224555677766653 2344542100 0 111222111000
Q ss_pred CCCCCccCCCccccccccccccCCCCeEEEEEC--CCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 324 GSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDF--VTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 324 gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl--~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
++..++-.. .-.++..|.|.+-.+ ....++.++.+|..+|.+++|+|...+|.+++. ++.+-+|||-
T Consensus 159 ---t~~~~d~~~------~fvGd~~gqvt~lr~~~~~~~~i~~~~~h~~~~~~l~Wd~~~~~LfSg~~-d~~vi~wdig- 227 (404)
T KOG1409|consen 159 ---SALQFDALY------AFVGDHSGQITMLKLEQNGCQLITTFNGHTGEVTCLKWDPGQRLLFSGAS-DHSVIMWDIG- 227 (404)
T ss_pred ---CCCceeeEE------EEecccccceEEEEEeecCCceEEEEcCcccceEEEEEcCCCcEEEeccc-cCceEEEecc-
Confidence 000000000 013455666655444 445688999999999999999999999999998 5668899993
Q ss_pred CcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 402 SCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 402 ~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
|. .-..|++.+ ++..|+.++--+--+.|.+++.||-|-+|+++-.-
T Consensus 228 -------g~--------~g~~~el~g--h~~kV~~l~~~~~t~~l~S~~edg~i~~w~mn~~r 273 (404)
T KOG1409|consen 228 -------GR--------KGTAYELQG--HNDKVQALSYAQHTRQLISCGEDGGIVVWNMNVKR 273 (404)
T ss_pred -------CC--------cceeeeecc--chhhhhhhhhhhhheeeeeccCCCeEEEEecccee
Confidence 21 024566654 34568888888888999999999999999997543
No 264
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=97.07 E-value=0.043 Score=65.13 Aligned_cols=293 Identities=13% Similarity=0.154 Sum_probs=154.6
Q ss_pred CCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 72 SVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 72 ~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
+|.+++|++=-..||++|--++-+.++ ..-.-.|.++.|+|+- .+|+..+....
T Consensus 219 SP~GTYL~t~Hk~GI~lWGG~~f~r~~---RF~Hp~Vq~idfSP~E-------------kYLVT~s~~p~---------- 272 (698)
T KOG2314|consen 219 SPKGTYLVTFHKQGIALWGGESFDRIQ---RFYHPGVQFIDFSPNE-------------KYLVTYSPEPI---------- 272 (698)
T ss_pred cCCceEEEEEeccceeeecCccHHHHH---hccCCCceeeecCCcc-------------ceEEEecCCcc----------
Confidence 467889988888999999865433332 2223569999988864 35554332110
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC-CC-----cEEEEEEcCCeEEEEeCCeEEEEECCCCceeE
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF-RS-----SVCMVRCSPRIVAVGLATQIYCFDALTLENKF 225 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f-~S-----~V~sVa~S~rlLAV~ld~~I~IwD~~Tle~l~ 225 (1010)
.+..+.. -+..++|||+.+|...+.+.. ++ ++....++.+++|-...+.|.||+...+.++-
T Consensus 273 ---~~~~~d~---------e~~~l~IWDI~tG~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~~sisIyEtpsf~lld 340 (698)
T KOG2314|consen 273 ---IVEEDDN---------EGQQLIIWDIATGLLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTGNSISIYETPSFMLLD 340 (698)
T ss_pred ---ccCcccC---------CCceEEEEEccccchhcceeccCCCccccceEEeccCCceeEEeccceEEEEecCceeeec
Confidence 1111111 136899999999998888764 22 44555566689998778899999977654321
Q ss_pred EEeecCCccccCCCccccccCccceEEcc--ceEEEccC-CeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhh
Q 001814 226 SVLTYPVPQLAGQGAVGINVGYGPMAVGP--RWLAYASN-TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHS 302 (1010)
Q Consensus 226 tL~t~p~p~~~~~g~~~vnv~~gplAlgp--RwLAyas~-~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dss 302 (1010)
.-+.. +. +..-+.++| ..|||=.. .-.+. .++. +.--|....+-. ...
T Consensus 341 ---~Kslk---------i~-gIr~FswsP~~~llAYwtpe~~~~p--arvt----------L~evPs~~~iRt----~nl 391 (698)
T KOG2314|consen 341 ---KKSLK---------IS-GIRDFSWSPTSNLLAYWTPETNNIP--ARVT----------LMEVPSKREIRT----KNL 391 (698)
T ss_pred ---ccccC---------Cc-cccCcccCCCcceEEEEcccccCCc--ceEE----------EEecCccceeee----ccc
Confidence 11110 00 233466777 67887431 10000 0000 000011111100 000
Q ss_pred hhhhcccceeeccccccccCCCCCCCccCCCcccccccccc-ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCC
Q 001814 303 KQFAAGLSKTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGA-DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSG 381 (1010)
Q Consensus 303 k~la~Gi~ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~ia-sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdG 381 (1010)
-+++..- | |++. +.--+. .++.+.+-. ..+.--.+.|+.+..+.+...-..-..+|-+.+|.|.|
T Consensus 392 fnVsDck---L--hWQk-----~gdyLc----vkvdR~tK~~~~g~f~n~eIfrireKdIpve~velke~vi~FaWEP~g 457 (698)
T KOG2314|consen 392 FNVSDCK---L--HWQK-----SGDYLC----VKVDRHTKSKVKGQFSNLEIFRIREKDIPVEVVELKESVIAFAWEPHG 457 (698)
T ss_pred eeeeccE---E--Eecc-----CCcEEE----EEEEeeccccccceEeeEEEEEeeccCCCceeeecchheeeeeeccCC
Confidence 0000000 0 0000 000000 000000000 01122246677776665433333445789999999999
Q ss_pred CEEEEEEcC--CCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeC---CCeEE
Q 001814 382 TLLVTASVY--GNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSS---KGTCH 456 (1010)
Q Consensus 382 tlLATAS~d--Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~---dGTVh 456 (1010)
..+|+-+.. -.++.+|.+... + ....++.+|.. + .-..+.|||-|+|++++.. .|++.
T Consensus 458 dkF~vi~g~~~k~tvsfY~~e~~------~-------~~~~lVk~~dk--~--~~N~vfwsPkG~fvvva~l~s~~g~l~ 520 (698)
T KOG2314|consen 458 DKFAVISGNTVKNTVSFYAVETN------I-------KKPSLVKELDK--K--FANTVFWSPKGRFVVVAALVSRRGDLE 520 (698)
T ss_pred CeEEEEEccccccceeEEEeecC------C-------Cchhhhhhhcc--c--ccceEEEcCCCcEEEEEEecccccceE
Confidence 999887643 345788887632 1 11234455533 1 2467899999999998654 57888
Q ss_pred EEeCCC
Q 001814 457 VFVLSP 462 (1010)
Q Consensus 457 Iw~I~~ 462 (1010)
.++.+-
T Consensus 521 F~D~~~ 526 (698)
T KOG2314|consen 521 FYDTDY 526 (698)
T ss_pred EEecch
Confidence 888774
No 265
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=96.99 E-value=0.11 Score=59.05 Aligned_cols=78 Identities=19% Similarity=0.272 Sum_probs=54.1
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe---c--------ccccccEEEE
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH---R--------GITSATIQDI 437 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~---R--------G~t~a~I~sI 437 (1010)
-+.|+.+.|+++|++++|-+- -+++|||+.-.. + ....|..+ | ...-..=..+
T Consensus 272 IsSISDvKFs~sGryilsRDy--ltvk~wD~nme~-----~---------pv~t~~vh~~lr~kLc~lYEnD~IfdKFec 335 (433)
T KOG1354|consen 272 ISSISDVKFSHSGRYILSRDY--LTVKLWDLNMEA-----K---------PVETYPVHEYLRSKLCSLYENDAIFDKFEC 335 (433)
T ss_pred hhhhhceEEccCCcEEEEecc--ceeEEEeccccC-----C---------cceEEeehHhHHHHHHHHhhccchhheeEE
Confidence 357899999999999999886 459999993210 1 01112211 1 1000112468
Q ss_pred EEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 438 CFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 438 AFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+||-|+.++++||-..-+|||++..
T Consensus 336 ~~sg~~~~v~TGsy~n~frvf~~~~ 360 (433)
T KOG1354|consen 336 SWSGNDSYVMTGSYNNVFRVFNLAR 360 (433)
T ss_pred EEcCCcceEecccccceEEEecCCC
Confidence 9999999999999999999999654
No 266
>PRK04043 tolB translocation protein TolB; Provisional
Probab=96.95 E-value=0.054 Score=63.75 Aligned_cols=98 Identities=18% Similarity=0.159 Sum_probs=57.1
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
+..|.++|+.+++. ..+..+........|+|||+.|+-.+..+..-+||.+... +|. .+.+. + .
T Consensus 256 ~~~Iy~~dl~~g~~-~~LT~~~~~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~-----~g~--------~~rlt-~-~ 319 (419)
T PRK04043 256 QPDIYLYDTNTKTL-TQITNYPGIDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLN-----SGS--------VEQVV-F-H 319 (419)
T ss_pred CcEEEEEECCCCcE-EEcccCCCccCccEECCCCCEEEEEECCCCCceEEEEECC-----CCC--------eEeCc-c-C
Confidence 45678888877653 3443333222345799999988777755444556654321 121 11211 1 2
Q ss_pred ccccccEEEEEEccCCCEEEEEeCC-------CeEEEEeCCCCCCc
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSK-------GTCHVFVLSPFGGD 466 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~d-------GTVhIw~I~~~gg~ 466 (1010)
|. ...+|||||++||..+.. ++.+||-++..++.
T Consensus 320 g~-----~~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~ 360 (419)
T PRK04043 320 GK-----NNSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDY 360 (419)
T ss_pred CC-----cCceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCC
Confidence 32 124899999999998875 33567766655554
No 267
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=96.85 E-value=0.0035 Score=81.64 Aligned_cols=88 Identities=20% Similarity=0.335 Sum_probs=66.8
Q ss_pred CCCeEEEEECCCC---cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 347 NAGIVVVKDFVTR---AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 347 ~dG~V~VwDl~s~---~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
.++.|.+||..-. ..+. .+|.+.++++++-|.-++|.||+.+|. +.|||+... ++.+
T Consensus 2313 d~~n~~lwDtl~~~~~s~v~--~~H~~gaT~l~~~P~~qllisggr~G~-v~l~D~rqr-----------------ql~h 2372 (2439)
T KOG1064|consen 2313 DNRNVCLWDTLLPPMNSLVH--TCHDGGATVLAYAPKHQLLISGGRKGE-VCLFDIRQR-----------------QLRH 2372 (2439)
T ss_pred CCCcccchhcccCcccceee--eecCCCceEEEEcCcceEEEecCCcCc-EEEeehHHH-----------------HHHH
Confidence 4678899997532 2444 899999999999999999999999987 899999521 1222
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
.+ .+ +. .-.++..++.+|+++||+++.++-.
T Consensus 2373 ~~-----~~------~~-~~~~f~~~ss~g~ikIw~~s~~~ll 2403 (2439)
T KOG1064|consen 2373 TF-----QA------LD-TREYFVTGSSEGNIKIWRLSEFGLL 2403 (2439)
T ss_pred Hh-----hh------hh-hhheeeccCcccceEEEEccccchh
Confidence 21 11 22 4568899999999999999987543
No 268
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.85 E-value=0.014 Score=67.15 Aligned_cols=98 Identities=8% Similarity=0.006 Sum_probs=74.8
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEc---------CCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASV---------YGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~---------dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
.|.|.|.|..+++.+.++..-..|-- + +||||+.|..|.. +...|.|||+.+ +
T Consensus 26 ~~~v~ViD~~~~~v~g~i~~G~~P~~-~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t-------~--------- 87 (352)
T TIGR02658 26 TTQVYTIDGEAGRVLGMTDGGFLPNP-V-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQT-------H--------- 87 (352)
T ss_pred CceEEEEECCCCEEEEEEEccCCCce-e-ECCCCCEEEEEeccccccccCCCCCEEEEEECcc-------C---------
Confidence 38999999999999999987665554 4 9999999999988 777899999964 2
Q ss_pred ceEEEEEecccc-----cccEEEEEEccCCCEEEEEeC--CCeEEEEeCCCCC
Q 001814 419 HVHLYKLHRGIT-----SATIQDICFSHYSQWIAIVSS--KGTCHVFVLSPFG 464 (1010)
Q Consensus 419 ~~~L~~L~RG~t-----~a~I~sIAFSpDg~~LAsgS~--dGTVhIw~I~~~g 464 (1010)
+.+.++.-|.. ...-..++|||||++|.+... +..|.|.++...+
T Consensus 88 -~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~k 139 (352)
T TIGR02658 88 -LPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKA 139 (352)
T ss_pred -cEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCCc
Confidence 34444442211 112357899999999998873 6789999988654
No 269
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=96.83 E-value=0.11 Score=61.47 Aligned_cols=284 Identities=18% Similarity=0.232 Sum_probs=150.5
Q ss_pred CCCCeEEEEEecCcEEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 72 SVFKQVLLLGYQNGFQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 72 ~~~~~vLalGy~~G~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
++.++.|+.....|+++|+-+..+ .++.....-|+.+-|+|+. .+|..+....- +.
T Consensus 41 SP~G~~l~~~~~~~V~~~~g~~~~---~l~~~~~~~V~~~~fSP~~-------------kYL~tw~~~pi------~~-- 96 (561)
T COG5354 41 SPLGTYLFSEHAAGVECWGGPSKA---KLVRFRHPDVKYLDFSPNE-------------KYLVTWSREPI------IE-- 96 (561)
T ss_pred cCcchheehhhccceEEccccchh---heeeeecCCceecccCccc-------------ceeeeeccCCc------cC--
Confidence 357889999999999999985533 4555556679998888765 45555432211 10
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC-c-----EEEEEEcCCeEEEEeCCeEEEEECCCCceeE
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS-S-----VCMVRCSPRIVAVGLATQIYCFDALTLENKF 225 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S-~-----V~sVa~S~rlLAV~ld~~I~IwD~~Tle~l~ 225 (1010)
-..+..+ ..+.+.+.+||..+|..+..+...+ + +....++.+++|=.....++|+++ |...
T Consensus 97 -------pe~e~sp---~~~~n~~~vwd~~sg~iv~sf~~~~q~~~~Wp~~k~s~~D~y~ARvv~~sl~i~e~-t~n~-- 163 (561)
T COG5354 97 -------PEIEISP---FTSKNNVFVWDIASGMIVFSFNGISQPYLGWPVLKFSIDDKYVARVVGSSLYIHEI-TDNI-- 163 (561)
T ss_pred -------hhhccCC---ccccCceeEEeccCceeEeeccccCCcccccceeeeeecchhhhhhccCeEEEEec-CCcc--
Confidence 0001100 1124689999999999999887653 2 555566666665445567999997 3211
Q ss_pred EEeecCCccccCCCccccccCccceEEcc----ceEEEcc-C------CeeeccCCccCCCcCCCCCCCCCcCCCCCceE
Q 001814 226 SVLTYPVPQLAGQGAVGINVGYGPMAVGP----RWLAYAS-N------TLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLV 294 (1010)
Q Consensus 226 tL~t~p~p~~~~~g~~~vnv~~gplAlgp----RwLAyas-~------~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslV 294 (1010)
..+|-..+..+ +..-++++| .-|||=. . .+.+|.. |.+..++
T Consensus 164 --~~~p~~~lr~~-------gi~dFsisP~~n~~~la~~tPEk~~kpa~~~i~sI------------------p~~s~l~ 216 (561)
T COG5354 164 --EEHPFKNLRPV-------GILDFSISPEGNHDELAYWTPEKLNKPAMVRILSI------------------PKNSVLV 216 (561)
T ss_pred --ccCchhhcccc-------ceeeEEecCCCCCceEEEEccccCCCCcEEEEEEc------------------cCCCeee
Confidence 01111000001 223355555 1234321 1 1222321 1122222
Q ss_pred EEeehhhhhhhhcccc---eeeccccccccCCCCCCCccCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCC
Q 001814 295 ARYAMEHSKQFAAGLS---KTLSKYCQELLPDGSSSPVSPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSP 371 (1010)
Q Consensus 295 a~~A~dssk~la~Gi~---ktls~y~~~l~p~gs~s~~S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtsp 371 (1010)
.+--.+ ++++. +.+..|.--+.- -..+.+. +-=....+.|+++....+.... .-.+|
T Consensus 217 tk~lfk-----~~~~qLkW~~~g~~ll~l~~----------t~~ksnK----syfgesnLyl~~~~e~~i~V~~-~~~~p 276 (561)
T COG5354 217 TKNLFK-----VSGVQLKWQVLGKYLLVLVM----------THTKSNK----SYFGESNLYLLRITERSIPVEK-DLKDP 276 (561)
T ss_pred eeeeEe-----ecccEEEEecCCceEEEEEE----------Eeeeccc----ceeccceEEEEeecccccceec-ccccc
Confidence 111000 01110 111111100000 0000000 0001346888888755543332 55789
Q ss_pred eEEEEECCCCCEEEEEE-cCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe
Q 001814 372 ISALCFDPSGTLLVTAS-VYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 372 IsaLaFSPdGtlLATAS-~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
|-..+|+|.+..+|+.+ .....+.+|++. | ...+.+-.+.. ..+.|||.++|+..++
T Consensus 277 Vhdf~W~p~S~~F~vi~g~~pa~~s~~~lr--------~----------Nl~~~~Pe~~r----NT~~fsp~~r~il~ag 334 (561)
T COG5354 277 VHDFTWEPLSSRFAVISGYMPASVSVFDLR--------G----------NLRFYFPEQKR----NTIFFSPHERYILFAG 334 (561)
T ss_pred ceeeeecccCCceeEEecccccceeecccc--------c----------ceEEecCCccc----ccccccCcccEEEEec
Confidence 99999999999999988 556678899985 2 13333322222 3367888888888877
Q ss_pred CC---CeEEEEeCC
Q 001814 451 SK---GTCHVFVLS 461 (1010)
Q Consensus 451 ~d---GTVhIw~I~ 461 (1010)
.+ |.+-||+..
T Consensus 335 F~nl~gni~i~~~~ 348 (561)
T COG5354 335 FDNLQGNIEIFDPA 348 (561)
T ss_pred CCccccceEEeccC
Confidence 66 457787753
No 270
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=96.81 E-value=0.0032 Score=71.85 Aligned_cols=99 Identities=12% Similarity=0.097 Sum_probs=73.2
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
....+.+|....+ ....+.+|-+-|..|+|+||++++.||+.|++ |||=.. |. . .-+..|-
T Consensus 130 D~~~~di~s~~~~-~~~~~lGhvSml~dVavS~D~~~IitaDRDEk-IRvs~y-pa-------~---------f~Iesfc 190 (390)
T KOG3914|consen 130 DVYSFDILSADSG-RCEPILGHVSMLLDVAVSPDDQFIITADRDEK-IRVSRY-PA-------T---------FVIESFC 190 (390)
T ss_pred Cceeeeeeccccc-CcchhhhhhhhhheeeecCCCCEEEEecCCce-EEEEec-Cc-------c---------cchhhhc
Confidence 3455556665543 33456789999999999999999999999776 898776 21 0 1233444
Q ss_pred cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
-||+ .-|..|+.- |++.|+++|.|+|+++|++...+..
T Consensus 191 lGH~-eFVS~isl~-~~~~LlS~sGD~tlr~Wd~~sgk~L 228 (390)
T KOG3914|consen 191 LGHK-EFVSTISLT-DNYLLLSGSGDKTLRLWDITSGKLL 228 (390)
T ss_pred cccH-hheeeeeec-cCceeeecCCCCcEEEEecccCCcc
Confidence 5755 458888887 5677999999999999999877654
No 271
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=96.80 E-value=0.0033 Score=44.33 Aligned_cols=38 Identities=32% Similarity=0.708 Sum_probs=33.1
Q ss_pred cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEe
Q 001814 360 AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFR 398 (1010)
Q Consensus 360 ~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwd 398 (1010)
+.+..+.+|..+|.+|+|+|++.++++++.+|. +++|+
T Consensus 3 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~d~~-~~~~~ 40 (40)
T smart00320 3 ELLKTLKGHTGPVTSVAFSPDGKYLASASDDGT-IKLWD 40 (40)
T ss_pred EEEEEEEecCCceeEEEECCCCCEEEEecCCCe-EEEcC
Confidence 456778899999999999999999999999765 89985
No 272
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=96.60 E-value=0.013 Score=69.90 Aligned_cols=91 Identities=15% Similarity=0.139 Sum_probs=68.0
Q ss_pred EEEEECCCCc--EEEEe-ccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 351 VVVKDFVTRA--IISQF-KAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 351 V~VwDl~s~~--~v~~~-~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
-.+|++..++ .++.. -.+.++|.+.++||+.+.|+.|..||. |++||... + +-.+-+
T Consensus 238 ~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~DgS-iiLyD~~~-------~------------~t~~~k 297 (545)
T PF11768_consen 238 SCIYECSRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCEDGS-IILYDTTR-------G------------VTLLAK 297 (545)
T ss_pred EEEEEeecCceeEEEEEEEecCCcceEEecCcccceEEEEecCCe-EEEEEcCC-------C------------eeeeee
Confidence 3467776543 22222 257889999999999999999999886 89999853 1 111111
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.......++|.|||..+++|+.+|.+.+||+...
T Consensus 298 --a~~~P~~iaWHp~gai~~V~s~qGelQ~FD~ALs 331 (545)
T PF11768_consen 298 --AEFIPTLIAWHPDGAIFVVGSEQGELQCFDMALS 331 (545)
T ss_pred --ecccceEEEEcCCCcEEEEEcCCceEEEEEeecC
Confidence 1235788999999999999999999999998753
No 273
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=96.54 E-value=0.31 Score=60.13 Aligned_cols=124 Identities=12% Similarity=0.115 Sum_probs=84.5
Q ss_pred CCCCCCcEEEEEEeeccCCC---C-C-CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCC
Q 001814 52 SEDLKDQVTWAGFDRLEYGP---S-V-FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEG 125 (1010)
Q Consensus 52 ~~~~kd~V~wa~Fd~le~~~---~-~-~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~ 125 (1010)
.|.|+..|.-+++-.....- + + -..+|+++.-+| |.|||.- .+.++.-++.+..+|-.+.+.|+-
T Consensus 51 ie~h~s~V~~VrWap~~~p~~llS~~~~~lliAsaD~~GrIil~d~~-~~s~~~~l~~~~~~~qdl~W~~~r-------- 121 (1062)
T KOG1912|consen 51 IELHQSAVTSVRWAPAPSPRDLLSPSSSQLLIASADISGRIILVDFV-LASVINWLSHSNDSVQDLCWVPAR-------- 121 (1062)
T ss_pred cccCccceeEEEeccCCCchhccCccccceeEEeccccCcEEEEEeh-hhhhhhhhcCCCcchhheeeeecc--------
Confidence 56677777777766543321 1 1 234566666667 9999994 566777788888999999998763
Q ss_pred ccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCCcEEE-EEEcC--
Q 001814 126 FRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRSSVCM-VRCSP-- 202 (1010)
Q Consensus 126 F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S~V~s-Va~S~-- 202 (1010)
..+|.+|++.. .+.+|.+|+..+|+.+-...+...|++ +++.|
T Consensus 122 -d~Srd~LlaIh---------------------------------~ss~lvLwntdtG~k~Wk~~ys~~iLs~f~~DPfd 167 (1062)
T KOG1912|consen 122 -DDSRDVLLAIH---------------------------------GSSTLVLWNTDTGEKFWKYDYSHEILSCFRVDPFD 167 (1062)
T ss_pred -CcchheeEEec---------------------------------CCcEEEEEEccCCceeeccccCCcceeeeeeCCCC
Confidence 12455666432 147899999999998887777766654 88877
Q ss_pred --CeEEEEeCCeEEEEEC
Q 001814 203 --RIVAVGLATQIYCFDA 218 (1010)
Q Consensus 203 --rlLAV~ld~~I~IwD~ 218 (1010)
.+.+.++.+.+.+-+.
T Consensus 168 ~rh~~~l~s~g~vl~~~~ 185 (1062)
T KOG1912|consen 168 SRHFCVLGSKGFVLSCKD 185 (1062)
T ss_pred cceEEEEccCceEEEEec
Confidence 3555566776666543
No 274
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=96.29 E-value=1 Score=51.25 Aligned_cols=104 Identities=21% Similarity=0.239 Sum_probs=62.7
Q ss_pred CCeEEEEECCCCcEEEE--ec--cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 348 AGIVVVKDFVTRAIISQ--FK--AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~--~~--aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
+-.+...|..+++++.+ +. -|...|.-|+++++|+.+.-.=.+|- -++..|....-..|. . ..+.
T Consensus 137 ~psL~~ld~~sG~ll~q~~Lp~~~~~lSiRHLa~~~~G~V~~a~Q~qg~---~~~~~PLva~~~~g~-------~-~~~~ 205 (305)
T PF07433_consen 137 QPSLVYLDARSGALLEQVELPPDLHQLSIRHLAVDGDGTVAFAMQYQGD---PGDAPPLVALHRRGG-------A-LRLL 205 (305)
T ss_pred CCceEEEecCCCceeeeeecCccccccceeeEEecCCCcEEEEEecCCC---CCccCCeEEEEcCCC-------c-ceec
Confidence 34566778888988876 53 38889999999999987765544443 112111000000010 0 0011
Q ss_pred EE----ecccccccEEEEEEccCCCEEEEEeCCC-eEEEEeCCCC
Q 001814 424 KL----HRGITSATIQDICFSHYSQWIAIVSSKG-TCHVFVLSPF 463 (1010)
Q Consensus 424 ~L----~RG~t~a~I~sIAFSpDg~~LAsgS~dG-TVhIw~I~~~ 463 (1010)
.. .+.+ ..-|-+|+|++|+.++|++|-+| .+.||+..+.
T Consensus 206 ~~p~~~~~~l-~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg 249 (305)
T PF07433_consen 206 PAPEEQWRRL-NGYIGSIAADRDGRLIAVTSPRGGRVAVWDAATG 249 (305)
T ss_pred cCChHHHHhh-CCceEEEEEeCCCCEEEEECCCCCEEEEEECCCC
Confidence 10 0111 12488999999999998888775 6999987653
No 275
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=96.19 E-value=2.1 Score=45.00 Aligned_cols=94 Identities=18% Similarity=0.170 Sum_probs=61.0
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCC----------e-EEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcc
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSP----------I-SALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKY 413 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtsp----------I-saLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~ 413 (1010)
+..+|.|..+|+.+|+.+.....+..+ + ..+.++ +| .+..++.+|..+.+ |+.. |
T Consensus 128 ~~~~g~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~v~~~~~~g~~~~~-d~~t-------g---- 193 (238)
T PF13360_consen 128 GTSSGKLVALDPKTGKLLWKYPVGEPRGSSPISSFSDINGSPVIS-DG-RVYVSSGDGRVVAV-DLAT-------G---- 193 (238)
T ss_dssp EETCSEEEEEETTTTEEEEEEESSTT-SS--EEEETTEEEEEECC-TT-EEEEECCTSSEEEE-ETTT-------T----
T ss_pred EeccCcEEEEecCCCcEEEEeecCCCCCCcceeeecccccceEEE-CC-EEEEEcCCCeEEEE-ECCC-------C----
Confidence 345899999999999998888765433 1 334444 55 55555555776776 8753 4
Q ss_pred ccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 414 DWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 414 ~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..+.+.. . ..+.. .+..++..|++++.+++++.|++...
T Consensus 194 ------~~~w~~~--~--~~~~~-~~~~~~~~l~~~~~~~~l~~~d~~tG 232 (238)
T PF13360_consen 194 ------EKLWSKP--I--SGIYS-LPSVDGGTLYVTSSDGRLYALDLKTG 232 (238)
T ss_dssp ------EEEEEEC--S--S-ECE-CEECCCTEEEEEETTTEEEEEETTTT
T ss_pred ------CEEEEec--C--CCccC-CceeeCCEEEEEeCCCEEEEEECCCC
Confidence 3333221 1 11222 25678889999989999999998753
No 276
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=96.15 E-value=1.9 Score=51.06 Aligned_cols=119 Identities=14% Similarity=0.193 Sum_probs=79.7
Q ss_pred CCCCcEEEEEEeeccCCCCCCCeEEEEEecCc--EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCc
Q 001814 54 DLKDQVTWAGFDRLEYGPSVFKQVLLLGYQNG--FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHP 131 (1010)
Q Consensus 54 ~~kd~V~wa~Fd~le~~~~~~~~vLalGy~~G--~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srp 131 (1010)
+++.+|.+.+|.. +..-++.|..+| +-|+|.+ .+++ +.+...-|.|..|++.|++.
T Consensus 357 ~~~~~VrY~r~~~-------~~e~~vigt~dgD~l~iyd~~-~~e~-kr~e~~lg~I~av~vs~dGK------------- 414 (668)
T COG4946 357 GKKGGVRYRRIQV-------DPEGDVIGTNDGDKLGIYDKD-GGEV-KRIEKDLGNIEAVKVSPDGK------------- 414 (668)
T ss_pred CCCCceEEEEEcc-------CCcceEEeccCCceEEEEecC-CceE-EEeeCCccceEEEEEcCCCc-------------
Confidence 5777888888864 334677787777 8999984 4544 45555568899999998872
Q ss_pred EEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeC-C-CcEEEEEEcC--CeEEE
Q 001814 132 FLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRF-R-SSVCMVRCSP--RIVAV 207 (1010)
Q Consensus 132 LLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f-~-S~V~sVa~S~--rlLAV 207 (1010)
.++ |+.+ .-.+-++|+.+|.. ..+.- + +-|..+.+++ +++|-
T Consensus 415 ~~v-vaNd--------------------------------r~el~vididngnv-~~idkS~~~lItdf~~~~nsr~iAY 460 (668)
T COG4946 415 KVV-VAND--------------------------------RFELWVIDIDNGNV-RLIDKSEYGLITDFDWHPNSRWIAY 460 (668)
T ss_pred EEE-EEcC--------------------------------ceEEEEEEecCCCe-eEecccccceeEEEEEcCCceeEEE
Confidence 223 3321 13466778888874 23322 2 4688888887 78887
Q ss_pred EeCC-----eEEEEECCCCceeEEEee
Q 001814 208 GLAT-----QIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 208 ~ld~-----~I~IwD~~Tle~l~tL~t 229 (1010)
+..+ .|++||+.+.+. +.+.|
T Consensus 461 afP~gy~tq~Iklydm~~~Ki-y~vTT 486 (668)
T COG4946 461 AFPEGYYTQSIKLYDMDGGKI-YDVTT 486 (668)
T ss_pred ecCcceeeeeEEEEecCCCeE-EEecC
Confidence 7643 499999988663 44443
No 277
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=96.15 E-value=0.13 Score=64.08 Aligned_cols=185 Identities=15% Similarity=0.162 Sum_probs=124.5
Q ss_pred CCEEEEEeCCCCeEEEEEeCCC-cEEEEEEcCCeEEEEe-CCeEEEEECCCCceeEEEeecCCccccCCCccccccCccc
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRS-SVCMVRCSPRIVAVGL-ATQIYCFDALTLENKFSVLTYPVPQLAGQGAVGINVGYGP 249 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S-~V~sVa~S~rlLAV~l-d~~I~IwD~~Tle~l~tL~t~p~p~~~~~g~~~vnv~~gp 249 (1010)
...+..+|+++.++.......+ .|.-++-|.+.+.+|. .++|.+-|..+.+.++++.+|... ..-
T Consensus 156 Q~~li~~Dl~~~~e~r~~~v~a~~v~imR~Nnr~lf~G~t~G~V~LrD~~s~~~iht~~aHs~s-------------iSD 222 (1118)
T KOG1275|consen 156 QEKLIHIDLNTEKETRTTNVSASGVTIMRYNNRNLFCGDTRGTVFLRDPNSFETIHTFDAHSGS-------------ISD 222 (1118)
T ss_pred hhheeeeecccceeeeeeeccCCceEEEEecCcEEEeecccceEEeecCCcCceeeeeeccccc-------------eee
Confidence 4678889999999888887765 6888888888887775 478999999999999999988651 122
Q ss_pred eEEccceEEEccCCeeeccCCccCCCcCCCCCCCCCcCCCCCceEEEeehhhhhhhhcccceeeccccccccCCCCCCCc
Q 001814 250 MAVGPRWLAYASNTLLLSNSGRLSPQNLTPSGVSPSTSPGGSSLVARYAMEHSKQFAAGLSKTLSKYCQELLPDGSSSPV 329 (1010)
Q Consensus 250 lAlgpRwLAyas~~~~iwd~G~vs~Q~lt~p~vS~stSP~~gslVa~~A~dssk~la~Gi~ktls~y~~~l~p~gs~s~~ 329 (1010)
|.+..+.|+..|... + .|
T Consensus 223 fDv~GNlLitCG~S~------R-------------------------------------------~~------------- 240 (1118)
T KOG1275|consen 223 FDVQGNLLITCGYSM------R-------------------------------------------RY------------- 240 (1118)
T ss_pred eeccCCeEEEeeccc------c-------------------------------------------cc-------------
Confidence 333334443333110 0 00
Q ss_pred cCCCccccccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCC
Q 001814 330 SPNSVWKVGRHAGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGS 408 (1010)
Q Consensus 330 S~s~~~k~~~~~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~s 408 (1010)
.-..|--|+|||+...+.+.-+.-|..| ..+.|.|.= +.||.+|.-|. +.+-|...-
T Consensus 241 --------------~l~~D~FvkVYDLRmmral~PI~~~~~P-~flrf~Psl~t~~~V~S~sGq-~q~vd~~~l------ 298 (1118)
T KOG1275|consen 241 --------------NLAMDPFVKVYDLRMMRALSPIQFPYGP-QFLRFHPSLTTRLAVTSQSGQ-FQFVDTATL------ 298 (1118)
T ss_pred --------------cccccchhhhhhhhhhhccCCcccccCc-hhhhhcccccceEEEEecccc-eeecccccc------
Confidence 0023667899999988877777777666 668888875 46777777676 566553210
Q ss_pred CCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 409 GNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 409 G~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
++.......+.. ....|..+.||+.++.+|.+-.+|.||+|.=.+
T Consensus 299 ----sNP~~~~~~v~p-----~~s~i~~fDiSsn~~alafgd~~g~v~~wa~~~ 343 (1118)
T KOG1275|consen 299 ----SNPPAGVKMVNP-----NGSGISAFDISSNGDALAFGDHEGHVNLWADRP 343 (1118)
T ss_pred ----CCCccceeEEcc-----CCCcceeEEecCCCceEEEecccCcEeeecCCC
Confidence 011111111111 112488999999999999999999999999443
No 278
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=96.02 E-value=0.34 Score=54.42 Aligned_cols=96 Identities=13% Similarity=0.159 Sum_probs=59.9
Q ss_pred CCeEEEEECCCCc-EEE--EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 348 AGIVVVKDFVTRA-IIS--QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 348 dG~V~VwDl~s~~-~v~--~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
.+.|.||++...+ .+. .+..+ ..|.+|.. .+.+++.++.... +.++..... + ..+..
T Consensus 106 g~~l~v~~l~~~~~l~~~~~~~~~-~~i~sl~~--~~~~I~vgD~~~s-v~~~~~~~~------~----------~~l~~ 165 (321)
T PF03178_consen 106 GNKLYVYDLDNSKTLLKKAFYDSP-FYITSLSV--FKNYILVGDAMKS-VSLLRYDEE------N----------NKLIL 165 (321)
T ss_dssp TTEEEEEEEETTSSEEEEEEE-BS-SSEEEEEE--ETTEEEEEESSSS-EEEEEEETT------T----------E-EEE
T ss_pred cCEEEEEEccCcccchhhheecce-EEEEEEec--cccEEEEEEcccC-EEEEEEEcc------C----------CEEEE
Confidence 3567777777666 332 22222 24444443 4668888887555 666654321 1 34555
Q ss_pred EecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 425 LHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+.|-.....+.++.|-+|++.++++..+|.+++|..++.
T Consensus 166 va~d~~~~~v~~~~~l~d~~~~i~~D~~gnl~~l~~~~~ 204 (321)
T PF03178_consen 166 VARDYQPRWVTAAEFLVDEDTIIVGDKDGNLFVLRYNPE 204 (321)
T ss_dssp EEEESS-BEEEEEEEE-SSSEEEEEETTSEEEEEEE-SS
T ss_pred EEecCCCccEEEEEEecCCcEEEEEcCCCeEEEEEECCC
Confidence 555545557999999877789999999999999998754
No 279
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=95.94 E-value=0.071 Score=62.39 Aligned_cols=98 Identities=20% Similarity=0.239 Sum_probs=73.2
Q ss_pred ccCCCC-eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 344 DMDNAG-IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 344 sgs~dG-~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
-+..|| .+-|+|..+++. ..+...-+.|-+|..+|||++++.|-.... |-+.|+.. |. .+.+
T Consensus 376 igt~dgD~l~iyd~~~~e~-kr~e~~lg~I~av~vs~dGK~~vvaNdr~e-l~vididn-------gn--------v~~i 438 (668)
T COG4946 376 IGTNDGDKLGIYDKDGGEV-KRIEKDLGNIEAVKVSPDGKKVVVANDRFE-LWVIDIDN-------GN--------VRLI 438 (668)
T ss_pred EeccCCceEEEEecCCceE-EEeeCCccceEEEEEcCCCcEEEEEcCceE-EEEEEecC-------CC--------eeEe
Confidence 356777 799999987764 566777789999999999999988866443 66667743 32 2333
Q ss_pred EEEecccccccEEEEEEccCCCEEEEEeCCC----eEEEEeCCC
Q 001814 423 YKLHRGITSATIQDICFSHYSQWIAIVSSKG----TCHVFVLSP 462 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dG----TVhIw~I~~ 462 (1010)
-+-+. +-|.+++|+|+++|||-+=-+| .||||++..
T Consensus 439 dkS~~----~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~ 478 (668)
T COG4946 439 DKSEY----GLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDG 478 (668)
T ss_pred ccccc----ceeEEEEEcCCceeEEEecCcceeeeeEEEEecCC
Confidence 33322 4699999999999999987776 589999864
No 280
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=95.86 E-value=0.024 Score=65.17 Aligned_cols=96 Identities=25% Similarity=0.253 Sum_probs=74.6
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccc
Q 001814 351 VVVKDFVTRAIISQFKAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGI 429 (1010)
Q Consensus 351 V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~ 429 (1010)
|++.+-.+.+.+..+..|..-|..|+|||... ||..||. |.+|+|+|+...+ ....|...
T Consensus 175 v~~l~~~~fkssq~lp~~g~~IrdlafSp~~~GLl~~asl-~nkiki~dlet~~---------------~vssy~a~--- 235 (463)
T KOG1645|consen 175 VQKLESHDFKSSQILPGEGSFIRDLAFSPFNEGLLGLASL-GNKIKIMDLETSC---------------VVSSYIAY--- 235 (463)
T ss_pred eEEeccCCcchhhcccccchhhhhhccCccccceeeeecc-CceEEEEecccce---------------eeeheecc---
Confidence 67777766666667788999999999999998 7888888 8889999997531 12334432
Q ss_pred ccccEEEEEEccCC-CEEEEEeCCCeEEEEeCCCCCCcc
Q 001814 430 TSATIQDICFSHYS-QWIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 430 t~a~I~sIAFSpDg-~~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
..||+++|.-|. .+|.+|-.+|.|.|||+....++.
T Consensus 236 --~~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~~~~~~ 272 (463)
T KOG1645|consen 236 --NQIWSCCWDLDERHVIYAGLQNGMVLVYDMRQPEGPL 272 (463)
T ss_pred --CCceeeeeccCCcceeEEeccCceEEEEEccCCCchH
Confidence 459999999876 567778889999999998776653
No 281
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=95.85 E-value=4.6 Score=46.16 Aligned_cols=58 Identities=14% Similarity=-0.022 Sum_probs=40.9
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcEEEE-EE-cCCeEEEEeCCeEEEEECCCCceeEEEee
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSVCMV-RC-SPRIVAVGLATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V~sV-a~-S~rlLAV~ld~~I~IwD~~Tle~l~tL~t 229 (1010)
++.|.-+|.++|+.+-...+...+.+. .. ...+++...++.|++||+.+++.+++...
T Consensus 114 ~g~l~ald~~tG~~~W~~~~~~~~~~~p~v~~~~v~v~~~~g~l~a~d~~tG~~~W~~~~ 173 (377)
T TIGR03300 114 KGEVIALDAEDGKELWRAKLSSEVLSPPLVANGLVVVRTNDGRLTALDAATGERLWTYSR 173 (377)
T ss_pred CCEEEEEECCCCcEeeeeccCceeecCCEEECCEEEEECCCCeEEEEEcCCCceeeEEcc
Confidence 367999999999988877766554321 12 23444445677899999999998877654
No 282
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=95.76 E-value=0.15 Score=63.73 Aligned_cols=52 Identities=12% Similarity=0.028 Sum_probs=42.2
Q ss_pred CEEEEEeCCCCeEEEEEeCCC-cEEEEEEcCCeEEEEe-CC---------eEEEEECCCCcee
Q 001814 173 TAVRFYSFQSHCYEHVLRFRS-SVCMVRCSPRIVAVGL-AT---------QIYCFDALTLENK 224 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S-~V~sVa~S~rlLAV~l-d~---------~I~IwD~~Tle~l 224 (1010)
++|-+-|+++.+.+|++..|+ .|.++....++|+++. .. -|+|||++.++.+
T Consensus 197 G~V~LrD~~s~~~iht~~aHs~siSDfDv~GNlLitCG~S~R~~~l~~D~FvkVYDLRmmral 259 (1118)
T KOG1275|consen 197 GTVFLRDPNSFETIHTFDAHSGSISDFDVQGNLLITCGYSMRRYNLAMDPFVKVYDLRMMRAL 259 (1118)
T ss_pred ceEEeecCCcCceeeeeeccccceeeeeccCCeEEEeecccccccccccchhhhhhhhhhhcc
Confidence 689999999999999998775 8899999998777543 32 2889999887743
No 283
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=95.75 E-value=0.03 Score=65.53 Aligned_cols=56 Identities=16% Similarity=0.268 Sum_probs=50.1
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
.+|+..|.|-|||-.+++++.-+.|-..-|+||.=.|---+|||++. ++.|+||.-
T Consensus 410 vSGSDCGhIFiW~K~t~eii~~MegDr~VVNCLEpHP~~PvLAsSGi-d~DVKIWTP 465 (559)
T KOG1334|consen 410 VSGSDCGHIFIWDKKTGEIIRFMEGDRHVVNCLEPHPHLPVLASSGI-DHDVKIWTP 465 (559)
T ss_pred EecCccceEEEEecchhHHHHHhhcccceEeccCCCCCCchhhccCC-ccceeeecC
Confidence 46788999999999999999888887779999999999999999999 577999975
No 284
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=95.74 E-value=2 Score=48.03 Aligned_cols=31 Identities=23% Similarity=0.522 Sum_probs=28.2
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
-..-|-.|+.||||++||+....|. |-+|++
T Consensus 228 ~~d~i~kmSlSPdg~~La~ih~sG~-lsLW~i 258 (282)
T PF15492_consen 228 EQDGIFKMSLSPDGSLLACIHFSGS-LSLWEI 258 (282)
T ss_pred CCCceEEEEECCCCCEEEEEEcCCe-EEEEec
Confidence 4568999999999999999999886 999999
No 285
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=95.71 E-value=0.18 Score=62.08 Aligned_cols=101 Identities=24% Similarity=0.298 Sum_probs=75.5
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC------CCe---EEEEeCCCCcccCCCCCCcc
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY------GNN---INIFRIMPSCMRSGSGNHKY 413 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d------Gt~---IrVwdi~p~~~~~~sG~~~~ 413 (1010)
+-|.+.|+|.|+|+.++.+.+.|..|++.|..|.|--. +.|+|.+.. |.+ +.|=|+. +|
T Consensus 441 AvGT~sGTV~vvdvst~~v~~~fsvht~~VkgleW~g~-sslvSfsys~~n~~sg~vrN~l~vtdLr-------tG---- 508 (1062)
T KOG1912|consen 441 AVGTNSGTVDVVDVSTNAVAASFSVHTSLVKGLEWLGN-SSLVSFSYSHVNSASGGVRNDLVVTDLR-------TG---- 508 (1062)
T ss_pred EeecCCceEEEEEecchhhhhhhcccccceeeeeeccc-eeEEEeeeccccccccceeeeEEEEEcc-------cc----
Confidence 45778999999999999999999999999999999755 456666541 111 2233332 13
Q ss_pred ccCCcceEEEEEe--cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 414 DWNSSHVHLYKLH--RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 414 ~~~~s~~~L~~L~--RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
.-..|| ++.....|..|--|.-++|||+.=.|.-+.||++..
T Consensus 509 -------lsk~fR~l~~~despI~~irvS~~~~yLai~Fr~~plEiwd~kt 552 (1062)
T KOG1912|consen 509 -------LSKRFRGLQKPDESPIRAIRVSSSGRYLAILFRREPLEIWDLKT 552 (1062)
T ss_pred -------cccccccCCCCCcCcceeeeecccCceEEEEecccchHHHhhcc
Confidence 112233 566667899999999999999999999999999854
No 286
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=95.63 E-value=5.6 Score=45.51 Aligned_cols=92 Identities=15% Similarity=0.156 Sum_probs=56.6
Q ss_pred cCCCCeEEEEECCCCcEEEEecc-CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKA-HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~a-HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
+..+|.|..+|..+++.+..... .......... .|.+|++++.+|. |.+|+..+ | +.+.
T Consensus 285 ~~~~G~l~~~d~~tG~~~W~~~~~~~~~~ssp~i--~g~~l~~~~~~G~-l~~~d~~t-------G----------~~~~ 344 (377)
T TIGR03300 285 TDADGVVVALDRRSGSELWKNDELKYRQLTAPAV--VGGYLVVGDFEGY-LHWLSRED-------G----------SFVA 344 (377)
T ss_pred ECCCCeEEEEECCCCcEEEccccccCCccccCEE--ECCEEEEEeCCCE-EEEEECCC-------C----------CEEE
Confidence 45789999999999988766532 1112222222 3668888888775 88998753 4 4555
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
++.-+.. ....+.++. |+ .|.+++.||+++.|.
T Consensus 345 ~~~~~~~-~~~~sp~~~-~~-~l~v~~~dG~l~~~~ 377 (377)
T TIGR03300 345 RLKTDGS-GIASPPVVV-GD-GLLVQTRDGDLYAFR 377 (377)
T ss_pred EEEcCCC-ccccCCEEE-CC-EEEEEeCCceEEEeC
Confidence 5543211 112223333 33 588999999998773
No 287
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=95.50 E-value=0.77 Score=51.80 Aligned_cols=81 Identities=15% Similarity=0.204 Sum_probs=55.2
Q ss_pred eccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE------ec----ccccccE
Q 001814 365 FKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL------HR----GITSATI 434 (1010)
Q Consensus 365 ~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L------~R----G~t~a~I 434 (1010)
|..-.+.|+.+.|+++|+++++-+- -+++|||+.-. - .++.+. +. -..+.-|
T Consensus 276 f~eivsSISD~kFs~ngryIlsRdy--ltvkiwDvnm~------k----------~pikTi~~h~~l~~~l~d~YEnDai 337 (460)
T COG5170 276 FEEIVSSISDFKFSDNGRYILSRDY--LTVKIWDVNMA------K----------NPIKTIPMHCDLMDELNDVYENDAI 337 (460)
T ss_pred HHHHhhhhcceEEcCCCcEEEEecc--ceEEEEecccc------c----------CCceeechHHHHHHHHHhhhhccce
Confidence 3444678999999999999998876 46999998531 0 111111 00 0001112
Q ss_pred ---EEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 435 ---QDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 435 ---~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..+.||-|++.+.+||-....-||.....
T Consensus 338 fdkFeisfSgd~~~v~sgsy~NNfgiyp~~ss 369 (460)
T COG5170 338 FDKFEISFSGDDKHVLSGSYSNNFGIYPTDSS 369 (460)
T ss_pred eeeEEEEecCCcccccccccccceeeeccccC
Confidence 45899999999999999999888886543
No 288
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=95.49 E-value=0.91 Score=54.02 Aligned_cols=78 Identities=19% Similarity=0.342 Sum_probs=53.5
Q ss_pred CCCeEEEEECCCCcEEE-EeccCCCCeEEEEECCCCCEEEEEEc--C---CCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 347 NAGIVVVKDFVTRAIIS-QFKAHTSPISALCFDPSGTLLVTASV--Y---GNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~-~~~aHtspIsaLaFSPdGtlLATAS~--d---Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
..|.|.|||...+-.+. .|.+-. .+...|+|||.++-++-. + +..|+||++. | .
T Consensus 338 l~gni~i~~~~~rf~~~~~~~~~n--~s~~~wspd~qF~~~~~ts~k~~~Dn~i~l~~v~--------g----------~ 397 (561)
T COG5354 338 LQGNIEIFDPAGRFKVAGAFNGLN--TSYCDWSPDGQFYDTDTTSEKLRVDNSIKLWDVY--------G----------A 397 (561)
T ss_pred cccceEEeccCCceEEEEEeecCC--ceEeeccCCceEEEecCCCcccccCcceEEEEec--------C----------c
Confidence 35678888888765444 676643 345679999998877643 2 5569999984 3 1
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCC
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKG 453 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dG 453 (1010)
..++ .+.+.|.|-+++..+.|...
T Consensus 398 ~~fe---------l~~~~W~p~~~~~ttsSs~~ 421 (561)
T COG5354 398 KVFE---------LTNITWDPSGQYVTTSSSCP 421 (561)
T ss_pred hhhh---------hhhccccCCcccceeeccCC
Confidence 2222 34477889899998888776
No 289
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=95.16 E-value=2.6 Score=55.55 Aligned_cols=86 Identities=14% Similarity=0.113 Sum_probs=52.1
Q ss_pred eEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccC-CCCCCccccCCcceEEEEEec--cc----ccccEEEEEEccCCC
Q 001814 372 ISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRS-GSGNHKYDWNSSHVHLYKLHR--GI----TSATIQDICFSHYSQ 444 (1010)
Q Consensus 372 IsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~-~sG~~~~~~~~s~~~L~~L~R--G~----t~a~I~sIAFSpDg~ 444 (1010)
...|+|+|+|+.|..+....+.|++||+......- ..|.. .....++.+-. |. .-..-..|+|++||+
T Consensus 742 P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~-----~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~ 816 (1057)
T PLN02919 742 PSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDP-----TFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQ 816 (1057)
T ss_pred ccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEeccc-----ccCcccccccCCCCchhhhhccCCceeeEeCCCc
Confidence 35699999999777777767789999985310000 00000 00001111100 00 001246899999999
Q ss_pred EEEEEeCCCeEEEEeCCC
Q 001814 445 WIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 445 ~LAsgS~dGTVhIw~I~~ 462 (1010)
.+++-+.+++|++|+...
T Consensus 817 LYVADs~N~rIrviD~~t 834 (1057)
T PLN02919 817 IYVADSYNHKIKKLDPAT 834 (1057)
T ss_pred EEEEECCCCEEEEEECCC
Confidence 999999999999999764
No 290
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=95.05 E-value=2.1 Score=47.92 Aligned_cols=118 Identities=12% Similarity=0.206 Sum_probs=67.9
Q ss_pred CCCCeEEEEECC-----CCcEEEEec--c-CCCCeEEEEECCCCCEEEEEEcC-CC---------eEEEEeCCCCcc--c
Q 001814 346 DNAGIVVVKDFV-----TRAIISQFK--A-HTSPISALCFDPSGTLLVTASVY-GN---------NINIFRIMPSCM--R 405 (1010)
Q Consensus 346 s~dG~V~VwDl~-----s~~~v~~~~--a-HtspIsaLaFSPdGtlLATAS~d-Gt---------~IrVwdi~p~~~--~ 405 (1010)
...|.++-|-+. ..+.-+.|. . +...|.++.++|.-++|..|+.. .. -+-.|++..... .
T Consensus 116 ~Y~G~L~Sy~vs~gt~q~y~e~hsfsf~~~yp~Gi~~~vy~p~h~LLlVgG~~~~~~~~s~a~~~GLtaWRiL~~~Pyyk 195 (282)
T PF15492_consen 116 NYRGQLRSYLVSVGTNQGYQENHSFSFSSHYPHGINSAVYHPKHRLLLVGGCEQNQDGMSKASSCGLTAWRILSDSPYYK 195 (282)
T ss_pred eccceeeeEEEEcccCCcceeeEEEEecccCCCceeEEEEcCCCCEEEEeccCCCCCccccccccCceEEEEcCCCCcEE
Confidence 456666655542 223333333 3 46799999999998888766542 11 256777753210 0
Q ss_pred --CCCCC----C---ccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 406 --SGSGN----H---KYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 406 --~~sG~----~---~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
...+. . ..-|+-.-..++. +.+.....|..|+.||||+.||+...+|.+-||+|..-.
T Consensus 196 ~v~~~~~~~~~~~~~~~~~~~~~~~~fs-~~~~~~d~i~kmSlSPdg~~La~ih~sG~lsLW~iPsL~ 262 (282)
T PF15492_consen 196 QVTSSEDDITASSKRRGLLRIPSFKFFS-RQGQEQDGIFKMSLSPDGSLLACIHFSGSLSLWEIPSLR 262 (282)
T ss_pred EccccCccccccccccceeeccceeeee-ccccCCCceEEEEECCCCCEEEEEEcCCeEEEEecCcch
Confidence 00000 0 0000000001111 223334469999999999999999999999999997643
No 291
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=94.77 E-value=0.11 Score=36.33 Aligned_cols=29 Identities=21% Similarity=0.442 Sum_probs=26.5
Q ss_pred cccEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 431 SATIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 431 ~a~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
...|.+++|.++++++++++.|+++.+|+
T Consensus 12 ~~~i~~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 12 TGPVTSVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred CCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence 35699999999999999999999999995
No 292
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=94.55 E-value=0.2 Score=55.94 Aligned_cols=104 Identities=15% Similarity=0.126 Sum_probs=72.9
Q ss_pred ccCCCCeEEEEECCCCc--EEEEeccCCCCeEEEEECCCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 344 DMDNAGIVVVKDFVTRA--IISQFKAHTSPISALCFDPSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~--~v~~~~aHtspIsaLaFSPdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
....+|.+.+-+..... .++.+++|.-++....|+... .++.|++.||. +.-||+.-. + .
T Consensus 138 vs~s~G~~~~v~~t~~~le~vq~wk~He~E~Wta~f~~~~pnlvytGgDD~~-l~~~D~R~p------~----------~ 200 (339)
T KOG0280|consen 138 VSDSRGSISGVYETEMVLEKVQTWKVHEFEAWTAKFSDKEPNLVYTGGDDGS-LSCWDIRIP------K----------T 200 (339)
T ss_pred EEcCCCcEEEEecceeeeeecccccccceeeeeeecccCCCceEEecCCCce-EEEEEecCC------c----------c
Confidence 44556666655554443 345889999999999998654 68889998775 999999621 1 1
Q ss_pred EEEEEecccccccEEEEEEc-cCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 421 HLYKLHRGITSATIQDICFS-HYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFS-pDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
.++.-.+- +...|.+|.=| |+..+|++|+-|.++++|+...-+.
T Consensus 201 ~i~~n~kv-H~~GV~SI~ss~~~~~~I~TGsYDe~i~~~DtRnm~k 245 (339)
T KOG0280|consen 201 FIWHNSKV-HTSGVVSIYSSPPKPTYIATGSYDECIRVLDTRNMGK 245 (339)
T ss_pred eeeeccee-eecceEEEecCCCCCceEEEeccccceeeeehhcccC
Confidence 33321121 23457778766 4688999999999999999876554
No 293
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=94.00 E-value=0.64 Score=51.62 Aligned_cols=100 Identities=12% Similarity=0.062 Sum_probs=63.9
Q ss_pred CCCCeEEEEECC--CCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 346 DNAGIVVVKDFV--TRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 346 s~dG~V~VwDl~--s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
+.|.++++.++. ..+...+++. -....++.|+|++++++.+. -..|-.|.|... + ..++.
T Consensus 135 sndht~k~~~~~~~s~~~~~h~~~--~~~ns~~~snd~~~~~~Vgd-s~~Vf~y~id~~------s---------ey~~~ 196 (344)
T KOG4532|consen 135 SNDHTGKTMVVSGDSNKFAVHNQN--LTQNSLHYSNDPSWGSSVGD-SRRVFRYAIDDE------S---------EYIEN 196 (344)
T ss_pred cCCcceeEEEEecCcccceeeccc--cceeeeEEcCCCceEEEecC-CCcceEEEeCCc------c---------ceeee
Confidence 445555555554 3333323222 12788999999999998876 344666777432 2 12222
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
.+...+...=.+.+||..+..+|+++.|||+-||++...+
T Consensus 197 -~~~a~t~D~gF~~S~s~~~~~FAv~~Qdg~~~I~DVR~~~ 236 (344)
T KOG4532|consen 197 -IYEAPTSDHGFYNSFSENDLQFAVVFQDGTCAIYDVRNMA 236 (344)
T ss_pred -eEecccCCCceeeeeccCcceEEEEecCCcEEEEEecccc
Confidence 2222232334678999999999999999999999998654
No 294
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=93.80 E-value=0.14 Score=60.62 Aligned_cols=93 Identities=16% Similarity=0.195 Sum_probs=59.1
Q ss_pred CCCCeEEEEECCCCcEEEEec-----cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 346 DNAGIVVVKDFVTRAIISQFK-----AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~-----aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
....+|+++|....+-+..++ +...-+.+++.-|.|+.||.|=..|. |-+-|... |..-..|.+
T Consensus 848 saeSTVKl~DaRsce~~~E~kVcna~~Pna~~R~iaVa~~GN~lAa~LSnGc-i~~LDaR~-------G~vINswrp--- 916 (1034)
T KOG4190|consen 848 SAESTVKLFDARSCEWTCELKVCNAPGPNALTRAIAVADKGNKLAAALSNGC-IAILDARN-------GKVINSWRP--- 916 (1034)
T ss_pred cchhhheeeecccccceeeEEeccCCCCchheeEEEeccCcchhhHHhcCCc-EEEEecCC-------CceeccCCc---
Confidence 345578888888776555554 23445789999999999998866576 77777753 421122222
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEE-Ee
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHV-FV 459 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhI-w~ 459 (1010)
+ .+....++ .|..+.||.+..|.++-| |.
T Consensus 917 --------m-ecdllqla-apsdq~L~~saldHslaVnWh 946 (1034)
T KOG4190|consen 917 --------M-ECDLLQLA-APSDQALAQSALDHSLAVNWH 946 (1034)
T ss_pred --------c-cchhhhhc-CchhHHHHhhcccceeEeeeh
Confidence 1 12122222 366788888888888887 65
No 295
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=93.76 E-value=0.057 Score=66.86 Aligned_cols=107 Identities=15% Similarity=0.134 Sum_probs=68.5
Q ss_pred CCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCCC
Q 001814 74 FKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRSH 152 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~~ 152 (1010)
+-+.|++|.-+| ++++++. +|...+-+..|+.+|..|+-.-++ .+++..+.
T Consensus 1112 ~~~hL~vG~~~Geik~~nv~-sG~~e~s~ncH~SavT~vePs~dg-------------s~~Ltsss-------------- 1163 (1516)
T KOG1832|consen 1112 GTNHLAVGSHAGEIKIFNVS-SGSMEESVNCHQSAVTLVEPSVDG-------------STQLTSSS-------------- 1163 (1516)
T ss_pred CCceEEeeeccceEEEEEcc-CccccccccccccccccccccCCc-------------ceeeeecc--------------
Confidence 346889998877 9999995 577788889999999988844332 24343110
Q ss_pred ccccccCCcCCCCCCCCCCCCEEEEEeCC-CCeEEEEEeCCCcEEEEEEcC--C-eEEEEeCCeEEEEECCCCceeEEEe
Q 001814 153 LGGVRDGMMDSQSGNCVNSPTAVRFYSFQ-SHCYEHVLRFRSSVCMVRCSP--R-IVAVGLATQIYCFDALTLENKFSVL 228 (1010)
Q Consensus 153 ~~~vr~gs~d~~~~~~~~sp~tVrIWDlk-tge~V~tL~f~S~V~sVa~S~--r-lLAV~ld~~I~IwD~~Tle~l~tL~ 228 (1010)
| +.--..+|++. ++..+|++.-.. +|.|+. + .++....+...+||+.|...+.++.
T Consensus 1164 --------~---------S~PlsaLW~~~s~~~~~Hsf~ed~---~vkFsn~~q~r~~gt~~d~a~~YDvqT~~~l~tyl 1223 (1516)
T KOG1832|consen 1164 --------S---------SSPLSALWDASSTGGPRHSFDEDK---AVKFSNSLQFRALGTEADDALLYDVQTCSPLQTYL 1223 (1516)
T ss_pred --------c---------cCchHHHhccccccCccccccccc---eeehhhhHHHHHhcccccceEEEecccCcHHHHhc
Confidence 0 01135578886 456666664333 455655 2 3333334678999999977666544
No 296
>KOG4415 consensus Uncharacterized conserved protein [Function unknown]
Probab=93.59 E-value=0.062 Score=56.24 Aligned_cols=38 Identities=24% Similarity=0.568 Sum_probs=34.5
Q ss_pred CCCccccceeEeeeeeEeeccCC-ccccccceeEEEEcCC
Q 001814 682 SVKSYERSHWYLSNAEVQMSSGR-LPIWQSSKISFFKMDS 720 (1010)
Q Consensus 682 ~~~~~e~~~~yls~aEvq~~~~~-~piW~~~~i~F~~m~~ 720 (1010)
+...+|+..| |+.+|+.+|.++ ..|||.|||.|+.+..
T Consensus 20 dh~GdEDeeW-l~hVEi~Th~gPHRriWmGPQFef~eih~ 58 (247)
T KOG4415|consen 20 DHIGDEDEEW-LPHVEIRTHLGPHRRIWMGPQFEFFEIHE 58 (247)
T ss_pred cccCcccccc-ccceEEEeccCccceeeecCceeEEEecC
Confidence 5567999999 999999999998 6999999999998875
No 297
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=93.56 E-value=16 Score=41.21 Aligned_cols=50 Identities=10% Similarity=0.195 Sum_probs=41.1
Q ss_pred CEEEEEeCCCC-------eEEEEEeCCCcEEEEEEcCCeEEEEeCCeEEEEECCCCc
Q 001814 173 TAVRFYSFQSH-------CYEHVLRFRSSVCMVRCSPRIVAVGLATQIYCFDALTLE 222 (1010)
Q Consensus 173 ~tVrIWDlktg-------e~V~tL~f~S~V~sVa~S~rlLAV~ld~~I~IwD~~Tle 222 (1010)
+.|.++++... +.++..+++++|++|..-...|+++...+|++|++...+
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g~~l~v~~l~~~~ 118 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVGNKLYVYDLDNSK 118 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEETTEEEEEEEETTS
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeecCEEEEEEccCcc
Confidence 78999999884 345666788999999888877999999999999988766
No 298
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=93.48 E-value=0.17 Score=55.71 Aligned_cols=115 Identities=18% Similarity=0.170 Sum_probs=71.4
Q ss_pred cccCCCCeEEEEECCCCc-EEEEeccCCCCeEEEEECC-CCCEEEEEEcCCCeEEEEeCCCCccc-CCCCCCccccCCcc
Q 001814 343 ADMDNAGIVVVKDFVTRA-IISQFKAHTSPISALCFDP-SGTLLVTASVYGNNINIFRIMPSCMR-SGSGNHKYDWNSSH 419 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~-~v~~~~aHtspIsaLaFSP-dGtlLATAS~dGt~IrVwdi~p~~~~-~~sG~~~~~~~~s~ 419 (1010)
..+..+|.|.|||..... .+..+.+|..+|+-+-|.| ++..|.|+|++|.. --||....+-+ +.....-+.|-+.-
T Consensus 196 ~cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~sedGsl-w~wdas~~~l~i~~~~s~~s~WLsgD 274 (319)
T KOG4714|consen 196 CCGTDDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPKNPEHLFTCSEDGSL-WHWDASTTFLSISNQASVISSWLSGD 274 (319)
T ss_pred EEecCCCeEEEEEcccccchHHHHHHhhhhhhheeccCCCchheeEecCCCcE-EEEcCCCceEEecCccccccccccCC
Confidence 467889999999998875 4467889999999999998 45899999998874 45776521100 00001112232211
Q ss_pred --eEEEEEecccccccEEEE-EEccCCCEEEEEeCCCeEEEEe
Q 001814 420 --VHLYKLHRGITSATIQDI-CFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 420 --~~L~~L~RG~t~a~I~sI-AFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
+-+.++ .+.-+....+| +|.--|..|++|++-+-|.+++
T Consensus 275 ~v~s~i~i-~~ll~~~~~SinsfDV~g~~lVcgtd~eaIyl~~ 316 (319)
T KOG4714|consen 275 PVKSRIEI-TSLLPSRSLSINSFDVLGPCLVCGTDAEAIYLTR 316 (319)
T ss_pred cccceEee-eccccccceeeeeeeccCceEEeccccceEEEec
Confidence 111111 11112222222 3666688999999998888764
No 299
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=93.43 E-value=7.5 Score=49.59 Aligned_cols=56 Identities=13% Similarity=0.188 Sum_probs=44.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
+-|+.+|.|++||-...+....|++-..||..|..+.||++|+.... +.+-++++.
T Consensus 592 avgs~~G~IRLyd~~g~~AKT~lp~lG~pI~~iDvt~DGkwilaTc~--tyLlLi~t~ 647 (794)
T PF08553_consen 592 AVGSNKGDIRLYDRLGKRAKTALPGLGDPIIGIDVTADGKWILATCK--TYLLLIDTL 647 (794)
T ss_pred EEEeCCCcEEeecccchhhhhcCCCCCCCeeEEEecCCCcEEEEeec--ceEEEEEEe
Confidence 45678999999997666666778888899999999999997755443 567788763
No 300
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=93.39 E-value=18 Score=41.42 Aligned_cols=73 Identities=18% Similarity=0.279 Sum_probs=50.1
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
-...|-+|+|+++|.++|..|-.|..+-+||..+ | +.+-.. .-..+..++-.+++ |++
T Consensus 215 l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~t-------g----------~~~~~~----~l~D~cGva~~~~~-f~~ 272 (305)
T PF07433_consen 215 LNGYIGSIAADRDGRLIAVTSPRGGRVAVWDAAT-------G----------RLLGSV----PLPDACGVAPTDDG-FLV 272 (305)
T ss_pred hCCceEEEEEeCCCCEEEEECCCCCEEEEEECCC-------C----------CEeecc----ccCceeeeeecCCc-eEE
Confidence 3468999999999999998888899999999864 4 232221 22357788888777 555
Q ss_pred EEeCCCeEEEEeCCCCCC
Q 001814 448 IVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 448 sgS~dGTVhIw~I~~~gg 465 (1010)
+++ .|- ++.+.+.+.
T Consensus 273 ssG-~G~--~~~~~~~~~ 287 (305)
T PF07433_consen 273 SSG-QGQ--LIRLSPDGP 287 (305)
T ss_pred eCC-Ccc--EEEccCccc
Confidence 543 443 556655443
No 301
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=93.33 E-value=0.22 Score=60.90 Aligned_cols=103 Identities=13% Similarity=0.075 Sum_probs=75.6
Q ss_pred ccCCCCeEEEEECCCCcEE-EEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 344 DMDNAGIVVVKDFVTRAII-SQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v-~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
-|...|.|.+|.-..+... ....+-+.-+..++.|++..+.|.++..| .|-||.+... +. ..+
T Consensus 50 ~GsS~G~lyl~~R~~~~~~~~~~~~~~~~~~~~~vs~~e~lvAagt~~g-~V~v~ql~~~------~p---------~~~ 113 (726)
T KOG3621|consen 50 MGSSAGSVYLYNRHTGEMRKLKNEGATGITCVRSVSSVEYLVAAGTASG-RVSVFQLNKE------LP---------RDL 113 (726)
T ss_pred EecccceEEEEecCchhhhcccccCccceEEEEEecchhHhhhhhcCCc-eEEeehhhcc------CC---------Ccc
Confidence 4667899999998876543 23333456777899999988888777755 5889988531 10 122
Q ss_pred EEEecccc--cccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 423 YKLHRGIT--SATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 423 ~~L~RG~t--~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
..+.+++. ...|++++||+|++.|.+|-+.|+|+.-.++.
T Consensus 114 ~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 114 DYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred eeeccccccCCceEEEEEecccccEEeecCCCceEEEEEech
Confidence 23334443 56899999999999999999999999988876
No 302
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=93.11 E-value=0.095 Score=62.02 Aligned_cols=89 Identities=24% Similarity=0.346 Sum_probs=63.8
Q ss_pred EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEc
Q 001814 361 IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFS 440 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFS 440 (1010)
.+..|.+|+..|.+++-=.+.+-+++||.| .++++|.+.+..+ +.| ++ .+.+++.. +...|.+|.|=
T Consensus 727 rL~nf~GH~~~iRai~AidNENSFiSASkD-KTVKLWSik~EgD--~~~------ts--aCQfTY~a--Hkk~i~~igfL 793 (1034)
T KOG4190|consen 727 RLCNFTGHQEKIRAIAAIDNENSFISASKD-KTVKLWSIKPEGD--EIG------TS--ACQFTYQA--HKKPIHDIGFL 793 (1034)
T ss_pred eeecccCcHHHhHHHHhcccccceeeccCC-ceEEEEEeccccC--ccc------cc--eeeeEhhh--ccCcccceeee
Confidence 356788999999987665666778899994 5699999988632 112 11 23333322 34579999999
Q ss_pred cCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 441 HYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 441 pDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
.|-+++|++ ||-+|+|+ |+-|.
T Consensus 794 ~~lr~i~Sc--D~giHlWD--PFigr 815 (1034)
T KOG4190|consen 794 ADLRSIASC--DGGIHLWD--PFIGR 815 (1034)
T ss_pred eccceeeec--cCcceeec--ccccc
Confidence 999999877 89999998 55443
No 303
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=93.10 E-value=0.21 Score=66.17 Aligned_cols=121 Identities=13% Similarity=0.189 Sum_probs=88.4
Q ss_pred cccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCccc-----C---------C
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFK-AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMR-----S---------G 407 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~-aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~-----~---------~ 407 (1010)
.+++.||.|++|.-..++.+..++ +-.+.|+.+.|+.+|.....+..||. |-+|++.|-... + +
T Consensus 2224 ltgs~dgsv~~~~w~~~~~v~~~rt~g~s~vtr~~f~~qGnk~~i~d~dg~-l~l~q~~pk~~~s~qchnk~~~Df~Fi~ 2302 (2439)
T KOG1064|consen 2224 LTGSQDGSVRMFEWGHGQQVVCFRTAGNSRVTRSRFNHQGNKFGIVDGDGD-LSLWQASPKPYTSWQCHNKALSDFRFIG 2302 (2439)
T ss_pred EecCCCceEEEEeccCCCeEEEeeccCcchhhhhhhcccCCceeeeccCCc-eeecccCCcceeccccCCccccceeeee
Confidence 478899999999999999887776 34489999999999999999988775 999999763210 0 0
Q ss_pred -----CC-----CCccccCCcc----eEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCccc
Q 001814 408 -----SG-----NHKYDWNSSH----VHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 408 -----sG-----~~~~~~~~s~----~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~~ 468 (1010)
.| .....|++-. ..+.+. |...++++++-|.-+.|.+|+.+|-|.|||+........
T Consensus 2303 s~~~tag~s~d~~n~~lwDtl~~~~~s~v~~~----H~~gaT~l~~~P~~qllisggr~G~v~l~D~rqrql~h~ 2373 (2439)
T KOG1064|consen 2303 SLLATAGRSSDNRNVCLWDTLLPPMNSLVHTC----HDGGATVLAYAPKHQLLISGGRKGEVCLFDIRQRQLRHT 2373 (2439)
T ss_pred hhhhccccCCCCCcccchhcccCcccceeeee----cCCCceEEEEcCcceEEEecCCcCcEEEeehHHHHHHHH
Confidence 01 1112344321 223322 234578999999999999999999999999987655443
No 304
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=92.97 E-value=7.5 Score=43.59 Aligned_cols=60 Identities=18% Similarity=0.087 Sum_probs=41.1
Q ss_pred ccccCCCCeEEEEECCCCcEE-----EEeccCCCCeEEEEECCCCCE-EEEEEcCCCeEEEEeCCC
Q 001814 342 GADMDNAGIVVVKDFVTRAII-----SQFKAHTSPISALCFDPSGTL-LVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v-----~~~~aHtspIsaLaFSPdGtl-LATAS~dGt~IrVwdi~p 401 (1010)
+|.+.+||.+-|||+...... .+-..|.+.+..+.|+|-|-+ |+--|+.=..++|-|+..
T Consensus 218 FAv~~Qdg~~~I~DVR~~~tpm~~~sstrp~hnGa~R~c~Fsl~g~lDLLf~sEhfs~~hv~D~R~ 283 (344)
T KOG4532|consen 218 FAVVFQDGTCAIYDVRNMATPMAEISSTRPHHNGAFRVCRFSLYGLLDLLFISEHFSRVHVVDTRN 283 (344)
T ss_pred EEEEecCCcEEEEEecccccchhhhcccCCCCCCceEEEEecCCCcceEEEEecCcceEEEEEccc
Confidence 456778999999999875533 223458899999999987742 223444334478888753
No 305
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=92.95 E-value=0.25 Score=60.52 Aligned_cols=57 Identities=19% Similarity=0.149 Sum_probs=42.9
Q ss_pred CEEEEEeCCCC-eEEEEEeCC-CcEEEEEEcC----CeEEEEeCCeEEEEECC--CCceeEEEee
Q 001814 173 TAVRFYSFQSH-CYEHVLRFR-SSVCMVRCSP----RIVAVGLATQIYCFDAL--TLENKFSVLT 229 (1010)
Q Consensus 173 ~tVrIWDlktg-e~V~tL~f~-S~V~sVa~S~----rlLAV~ld~~I~IwD~~--Tle~l~tL~t 229 (1010)
+-|++||++.| ..+++++-+ +.|..++|++ .++..+.+++|+.||-. |-+..+++.+
T Consensus 180 ~~i~vwd~r~gs~pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~d~tvkfw~y~kSt~e~~~~vtt 244 (1081)
T KOG0309|consen 180 NDIFVWDLRKGSTPLCSLKGHVSSVNSIDFNRFKYSEIMSSSNDGTVKFWDYSKSTTESKRTVTT 244 (1081)
T ss_pred CceEEEeccCCCcceEEecccceeeehHHHhhhhhhhhcccCCCCceeeecccccccccceeccc
Confidence 57999999866 578889876 5789999998 46667788899999865 3344455543
No 306
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=91.68 E-value=1.3 Score=54.63 Aligned_cols=96 Identities=13% Similarity=0.112 Sum_probs=61.0
Q ss_pred CCCeEEEEECCCCc-EEEEeccCCCCeEEEEECCC-CCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 347 NAGIVVVKDFVTRA-IISQFKAHTSPISALCFDPS-GTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~-~v~~~~aHtspIsaLaFSPd-GtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
....|.|||+..+. .+..+++|.+.|..+.|+.- -+.+.+++.||+ ++.|+-... - . ..-+
T Consensus 178 hg~~i~vwd~r~gs~pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~d~t-vkfw~y~kS------t---~------e~~~- 240 (1081)
T KOG0309|consen 178 HGNDIFVWDLRKGSTPLCSLKGHVSSVNSIDFNRFKYSEIMSSSNDGT-VKFWDYSKS------T---T------ESKR- 240 (1081)
T ss_pred cCCceEEEeccCCCcceEEecccceeeehHHHhhhhhhhhcccCCCCc-eeeeccccc------c---c------ccce-
Confidence 34569999998765 67889999999999999853 356778888665 899997431 0 0 0111
Q ss_pred EecccccccEEEEEEccCCCEEEEEeC--CCeEEEEeCC
Q 001814 425 LHRGITSATIQDICFSHYSQWIAIVSS--KGTCHVFVLS 461 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSpDg~~LAsgS~--dGTVhIw~I~ 461 (1010)
+-.+...|+--.|-|-|.=.++--. +..|++++-+
T Consensus 241 --~vtt~~piw~~r~~Pfg~g~~~mp~~G~n~v~~~~c~ 277 (1081)
T KOG0309|consen 241 --TVTTNFPIWRGRYLPFGEGYCIMPMVGGNMVPQLRCE 277 (1081)
T ss_pred --eccccCcceeccccccCceeEeccccCCeeeeecccc
Confidence 1123446777777776553333222 2256666544
No 307
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=91.05 E-value=0.51 Score=39.23 Aligned_cols=31 Identities=16% Similarity=0.298 Sum_probs=28.5
Q ss_pred cccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 431 SATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 431 ~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.+.|..++|+|....||.++.+|.|.||.++
T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~ 41 (47)
T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTEDGEVLVYRLN 41 (47)
T ss_pred CCcEEEEEECCCCCEEEEEECCCeEEEEECC
Confidence 3569999999999999999999999999984
No 308
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=90.92 E-value=0.86 Score=55.42 Aligned_cols=76 Identities=18% Similarity=0.302 Sum_probs=58.7
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe-cccccccE-EEEEEccCCCEEEE
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH-RGITSATI-QDICFSHYSQWIAI 448 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~-RG~t~a~I-~sIAFSpDg~~LAs 448 (1010)
.|-.+-|+|.=.++|++-.+|. +-+.++.- +++..+- +|. .+ .+++|.|||+.||+
T Consensus 22 ~i~~~ewnP~~dLiA~~t~~ge-lli~R~n~------------------qRlwtip~p~~---~v~~sL~W~~DGkllaV 79 (665)
T KOG4640|consen 22 NIKRIEWNPKMDLIATRTEKGE-LLIHRLNW------------------QRLWTIPIPGE---NVTASLCWRPDGKLLAV 79 (665)
T ss_pred ceEEEEEcCccchhheeccCCc-EEEEEecc------------------ceeEeccCCCC---ccceeeeecCCCCEEEE
Confidence 5667889999999999999887 44666531 3555554 442 23 59999999999999
Q ss_pred EeCCCeEEEEeCCCCCCccc
Q 001814 449 VSSKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 449 gS~dGTVhIw~I~~~gg~~~ 468 (1010)
|=.||||.|-|++..++...
T Consensus 80 g~kdG~I~L~Dve~~~~l~~ 99 (665)
T KOG4640|consen 80 GFKDGTIRLHDVEKGGRLVS 99 (665)
T ss_pred EecCCeEEEEEccCCCceec
Confidence 99999999999998766543
No 309
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=90.91 E-value=0.73 Score=51.70 Aligned_cols=103 Identities=18% Similarity=0.169 Sum_probs=69.6
Q ss_pred cccCCCCeEEEEECC-CCcEEE-EeccCCCCeEEEEECC-CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc
Q 001814 343 ADMDNAGIVVVKDFV-TRAIIS-QFKAHTSPISALCFDP-SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH 419 (1010)
Q Consensus 343 asgs~dG~V~VwDl~-s~~~v~-~~~aHtspIsaLaFSP-dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~ 419 (1010)
-+|+.||.+.-||+. .++.+- ..+-|+..|.++.-|| .+++++|+|-|.+ ||+||.... |
T Consensus 182 ytGgDD~~l~~~D~R~p~~~i~~n~kvH~~GV~SI~ss~~~~~~I~TGsYDe~-i~~~DtRnm------~---------- 244 (339)
T KOG0280|consen 182 YTGGDDGSLSCWDIRIPKTFIWHNSKVHTSGVVSIYSSPPKPTYIATGSYDEC-IRVLDTRNM------G---------- 244 (339)
T ss_pred EecCCCceEEEEEecCCcceeeecceeeecceEEEecCCCCCceEEEeccccc-eeeeehhcc------c----------
Confidence 367899999999998 445443 3678999999988775 6899999999655 999998532 2
Q ss_pred eEEEEEecccccccEEEEEEccCC--CEEEEEeCCCeEEEEeCCCCCCc
Q 001814 420 VHLYKLHRGITSATIQDICFSHYS--QWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 420 ~~L~~L~RG~t~a~I~sIAFSpDg--~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
+.|++-. -...||.|.++|-- +.||++=.+| .+|-+++...++
T Consensus 245 kPl~~~~---v~GGVWRi~~~p~~~~~lL~~CMh~G-~ki~~~~~~~~e 289 (339)
T KOG0280|consen 245 KPLFKAK---VGGGVWRIKHHPEIFHRLLAACMHNG-AKILDSSDKVLE 289 (339)
T ss_pred CccccCc---cccceEEEEecchhhhHHHHHHHhcC-ceEEEecccccc
Confidence 3443321 12458888888753 3444444444 466666554433
No 310
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=90.79 E-value=0.88 Score=53.98 Aligned_cols=98 Identities=20% Similarity=0.292 Sum_probs=65.9
Q ss_pred CeEEEEECCCCc--EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 349 GIVVVKDFVTRA--IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 349 G~V~VwDl~s~~--~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
..+.++|+.+++ .+..+.+|.. +-+|||||++||-++.++....||-+.-. + ..+.+|.
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~g~~~---~P~fspDG~~l~f~~~rdg~~~iy~~dl~------~----------~~~~~Lt 278 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFNGNNG---APAFSPDGSKLAFSSSRDGSPDIYLMDLD------G----------KNLPRLT 278 (425)
T ss_pred ceEEEEeccCCccceeeccCCccC---CccCCCCCCEEEEEECCCCCccEEEEcCC------C----------Ccceecc
Confidence 568899998865 4556666654 46899999999888776555666654211 2 1233344
Q ss_pred cccccccEEEEEEccCCCEEEEEeCC-CeEEEEeCCCCCCcc
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSK-GTCHVFVLSPFGGDS 467 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~d-GTVhIw~I~~~gg~~ 467 (1010)
.+... -..=+|||||++|+-.|++ |.-.||-++..++.+
T Consensus 279 ~~~gi--~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~ 318 (425)
T COG0823 279 NGFGI--NTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQV 318 (425)
T ss_pred cCCcc--ccCccCCCCCCEEEEEeCCCCCcceEEECCCCCce
Confidence 43322 2366899999999998886 456788888777655
No 311
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=90.78 E-value=67 Score=42.24 Aligned_cols=85 Identities=13% Similarity=0.254 Sum_probs=50.7
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE------ecccccccEEEEEEccC
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL------HRGITSATIQDICFSHY 442 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L------~RG~t~a~I~sIAFSpD 442 (1010)
.++|.+++|++++..+|.-..+|+ |.+|......... +. .......+.. .-......+..++|-.+
T Consensus 426 ~~~v~~vaf~~~~~~~avl~~d~~-l~~~~~~~~~~~~--~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (928)
T PF04762_consen 426 PSPVNDVAFSPSNSRFAVLTSDGS-LSIYEWDLKNMWS--VK-----PPKLLSSISLDSMDISDSELPLGSLRQLAWLND 497 (928)
T ss_pred CCCcEEEEEeCCCCeEEEEECCCC-EEEEEecCCCccc--cc-----CcchhhhcccccccccccccccccEEEEEEeCC
Confidence 479999999999998888888776 7888853210000 00 0000000000 01112335788999988
Q ss_pred CCEEEEEeCC---CeEEEEeCC
Q 001814 443 SQWIAIVSSK---GTCHVFVLS 461 (1010)
Q Consensus 443 g~~LAsgS~d---GTVhIw~I~ 461 (1010)
+..+++...+ ..+.++++.
T Consensus 498 ~~~~~~~~~~~~~~~i~~~~~~ 519 (928)
T PF04762_consen 498 DTLLVLSDSDSNQSKIVLVDID 519 (928)
T ss_pred CEEEEEEecCcccceEEEEEec
Confidence 8888777765 457777764
No 312
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=90.32 E-value=0.71 Score=52.74 Aligned_cols=104 Identities=15% Similarity=0.147 Sum_probs=70.3
Q ss_pred ccCCCCeEEEEECCCC----cEEEEeccCCCCeEEEEECC-CCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 344 DMDNAGIVVVKDFVTR----AIISQFKAHTSPISALCFDP-SGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~----~~v~~~~aHtspIsaLaFSP-dGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
.|...|.|.++|+..+ ...++---|.+.|++|..=. ++.+|...+.+|+ |++||..-... +
T Consensus 269 ~GcRngeI~~iDLR~rnqG~~~~a~rlyh~Ssvtslq~Lq~s~q~LmaS~M~gk-ikLyD~R~~K~----~--------- 334 (425)
T KOG2695|consen 269 NGCRNGEIFVIDLRCRNQGNGWCAQRLYHDSSVTSLQILQFSQQKLMASDMTGK-IKLYDLRATKC----K--------- 334 (425)
T ss_pred ecccCCcEEEEEeeecccCCCcceEEEEcCcchhhhhhhccccceEeeccCcCc-eeEeeehhhhc----c---------
Confidence 4568899999999875 23344456999999987655 7888888888776 99999852100 1
Q ss_pred ceEEEEEecccccc-cEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 419 HVHLYKLHRGITSA-TIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 419 ~~~L~~L~RG~t~a-~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+-+... .||-+. .-.-+-..+....|+++++|--.+||.+...
T Consensus 335 -~~V~qY-eGHvN~~a~l~~~v~~eeg~I~s~GdDcytRiWsl~~g 378 (425)
T KOG2695|consen 335 -KSVMQY-EGHVNLSAYLPAHVKEEEGSIFSVGDDCYTRIWSLDSG 378 (425)
T ss_pred -cceeee-ecccccccccccccccccceEEEccCeeEEEEEecccC
Confidence 112222 344322 1112334567889999999999999999753
No 313
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=90.30 E-value=0.75 Score=55.16 Aligned_cols=95 Identities=18% Similarity=0.192 Sum_probs=63.1
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEc----------CCCeEEEEeCCCCcccCCCCCCccccCCcc
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASV----------YGNNINIFRIMPSCMRSGSGNHKYDWNSSH 419 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~----------dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~ 419 (1010)
-|.+|--.+-..++.|. |. .|..+.|||..++|+|-|. +|+.|+||||.+ |
T Consensus 232 GI~lWGG~~f~r~~RF~-Hp-~Vq~idfSP~EkYLVT~s~~p~~~~~~d~e~~~l~IWDI~t-------G---------- 292 (698)
T KOG2314|consen 232 GIALWGGESFDRIQRFY-HP-GVQFIDFSPNEKYLVTYSPEPIIVEEDDNEGQQLIIWDIAT-------G---------- 292 (698)
T ss_pred ceeeecCccHHHHHhcc-CC-CceeeecCCccceEEEecCCccccCcccCCCceEEEEEccc-------c----------
Confidence 36677655544444443 53 5889999999999999774 478899999974 4
Q ss_pred eEEEEEecccccccEE-EEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 420 VHLYKLHRGITSATIQ-DICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 420 ~~L~~L~RG~t~a~I~-sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
..+..|.-=.+...+| -.-||.|++|+|--..+ +|+||.-..+.
T Consensus 293 ~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~~-sisIyEtpsf~ 337 (698)
T KOG2314|consen 293 LLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTGN-SISIYETPSFM 337 (698)
T ss_pred chhcceeccCCCccccceEEeccCCceeEEeccc-eEEEEecCcee
Confidence 1222221100111223 24799999999998874 79999876643
No 314
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=90.05 E-value=30 Score=37.11 Aligned_cols=99 Identities=16% Similarity=0.157 Sum_probs=60.3
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
|.|..++.. ++... +...-.--+.|+|+|||+.|..+......|..|++... +. .+ ...+.+..+..+
T Consensus 115 g~v~~~~~~-~~~~~-~~~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~------~~---~~-~~~~~~~~~~~~ 182 (246)
T PF08450_consen 115 GSVYRIDPD-GKVTV-VADGLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDAD------GG---EL-SNRRVFIDFPGG 182 (246)
T ss_dssp EEEEEEETT-SEEEE-EEEEESSEEEEEEETTSSEEEEEETTTTEEEEEEEETT------TC---CE-EEEEEEEE-SSS
T ss_pred cceEEECCC-CeEEE-EecCcccccceEECCcchheeecccccceeEEEecccc------cc---ce-eeeeeEEEcCCC
Confidence 667777776 44322 22233455789999999988766665666777877421 10 00 001222333322
Q ss_pred cccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
. .....+++..+|+..++.-..+.|++|+-+
T Consensus 183 ~--g~pDG~~vD~~G~l~va~~~~~~I~~~~p~ 213 (246)
T PF08450_consen 183 P--GYPDGLAVDSDGNLWVADWGGGRIVVFDPD 213 (246)
T ss_dssp S--CEEEEEEEBTTS-EEEEEETTTEEEEEETT
T ss_pred C--cCCCcceEcCCCCEEEEEcCCCEEEEECCC
Confidence 1 247889999999988888888888888743
No 315
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=90.01 E-value=27 Score=36.54 Aligned_cols=57 Identities=11% Similarity=-0.030 Sum_probs=41.6
Q ss_pred CEEEEEeCCCCeEEEEE-eCCC------cEEEEEEcCCeEEEEe-CCeEEEEECCCCceeEEEee
Q 001814 173 TAVRFYSFQSHCYEHVL-RFRS------SVCMVRCSPRIVAVGL-ATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL-~f~S------~V~sVa~S~rlLAV~l-d~~I~IwD~~Tle~l~tL~t 229 (1010)
+.|..+|.++|+.+... .... ......+..+.++++. .+.|+++|+.+++.+.....
T Consensus 86 ~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l~~~d~~tG~~~w~~~~ 150 (238)
T PF13360_consen 86 GSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSGKLVALDPKTGKLLWKYPV 150 (238)
T ss_dssp SEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCSEEEEEETTTTEEEEEEES
T ss_pred eeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEeccCcEEEEecCCCcEEEEeec
Confidence 57999999999998884 4321 1234455566777766 78999999999998887765
No 316
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=89.60 E-value=0.78 Score=52.51 Aligned_cols=103 Identities=15% Similarity=0.235 Sum_probs=59.9
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCccc--CCCCCCccccCCcceEEEEE
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMR--SGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~--~~sG~~~~~~~~s~~~L~~L 425 (1010)
.+.+.|||+.+++....... ...+....|||+|+.||-... +.|.++++...... ...|. ..++
T Consensus 22 ~~~y~i~d~~~~~~~~l~~~-~~~~~~~~~sP~g~~~~~v~~--~nly~~~~~~~~~~~lT~dg~---------~~i~-- 87 (353)
T PF00930_consen 22 KGDYYIYDIETGEITPLTPP-PPKLQDAKWSPDGKYIAFVRD--NNLYLRDLATGQETQLTTDGE---------PGIY-- 87 (353)
T ss_dssp EEEEEEEETTTTEEEESS-E-ETTBSEEEE-SSSTEEEEEET--TEEEEESSTTSEEEESES--T---------TTEE--
T ss_pred ceeEEEEecCCCceEECcCC-ccccccceeecCCCeeEEEec--CceEEEECCCCCeEEeccccc---------eeEE--
Confidence 46799999999765543333 678899999999999999874 66888876421000 00000 0000
Q ss_pred ecccc--------cccEEEEEEccCCCEEEEEeCCCe-EEEEeCCCCCC
Q 001814 426 HRGIT--------SATIQDICFSHYSQWIAIVSSKGT-CHVFVLSPFGG 465 (1010)
Q Consensus 426 ~RG~t--------~a~I~sIAFSpDg~~LAsgS~dGT-VhIw~I~~~gg 465 (1010)
-|.. -..-..+.|||||++||....|.+ |+.+.+..+..
T Consensus 88 -nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~~~~~ 135 (353)
T PF00930_consen 88 -NGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLPDYSP 135 (353)
T ss_dssp -ESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEEEESS
T ss_pred -cCccceeccccccccccceEECCCCCEEEEEEECCcCCceEEeeccCC
Confidence 0110 001134779999999999776644 77776655443
No 317
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=89.30 E-value=0.91 Score=54.84 Aligned_cols=55 Identities=15% Similarity=0.247 Sum_probs=43.8
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
-|..||.|++||...+. +++.-+.-..+.++|+|+|.+++.|+..|. |.+||+.-
T Consensus 276 lGC~DgSiiLyD~~~~~--t~~~ka~~~P~~iaWHp~gai~~V~s~qGe-lQ~FD~AL 330 (545)
T PF11768_consen 276 LGCEDGSIILYDTTRGV--TLLAKAEFIPTLIAWHPDGAIFVVGSEQGE-LQCFDMAL 330 (545)
T ss_pred EEecCCeEEEEEcCCCe--eeeeeecccceEEEEcCCCcEEEEEcCCce-EEEEEeec
Confidence 46789999999997663 233333455678999999999999999886 99999864
No 318
>PRK02888 nitrous-oxide reductase; Validated
Probab=89.28 E-value=3.1 Score=51.51 Aligned_cols=116 Identities=12% Similarity=0.095 Sum_probs=68.6
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEE---cCCCe-----------EEEEeCCC--CcccCC-
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTAS---VYGNN-----------INIFRIMP--SCMRSG- 407 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS---~dGt~-----------IrVwdi~p--~~~~~~- 407 (1010)
....+++.+.|..+.+++.++.--..| .-++|+|||+++.+.+ +.|.. +.+|++.. ....++
T Consensus 211 ~ey~~~vSvID~etmeV~~qV~Vdgnp-d~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d~~vvfni~~iea~vkdGK 289 (635)
T PRK02888 211 KKYRSLFTAVDAETMEVAWQVMVDGNL-DNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERDWVVVFNIARIEEAVKAGK 289 (635)
T ss_pred cceeEEEEEEECccceEEEEEEeCCCc-ccceECCCCCEEEEeccCcccCcceeeeccccCceEEEEchHHHHHhhhCCC
Confidence 356788999999998888887654333 5678999999998775 43333 23333320 000000
Q ss_pred ----CCC--CccccCC----cceEEEEEecccccccEEEEEEccCCCEEEEEeC-CCeEEEEeCCCCC
Q 001814 408 ----SGN--HKYDWNS----SHVHLYKLHRGITSATIQDICFSHYSQWIAIVSS-KGTCHVFVLSPFG 464 (1010)
Q Consensus 408 ----sG~--~~~~~~~----s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~-dGTVhIw~I~~~g 464 (1010)
.+. .-.+... ....++.+--| ...+.|++||||+++.+++. +.||.|.++....
T Consensus 290 ~~~V~gn~V~VID~~t~~~~~~~v~~yIPVG---KsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k 354 (635)
T PRK02888 290 FKTIGGSKVPVVDGRKAANAGSALTRYVPVP---KNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLD 354 (635)
T ss_pred EEEECCCEEEEEECCccccCCcceEEEEECC---CCccceEECCCCCEEEEeCCCCCcEEEEEChhhh
Confidence 000 0000000 01122323222 24678999999999988766 8899999998754
No 319
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.96 E-value=1.4 Score=54.91 Aligned_cols=92 Identities=17% Similarity=0.159 Sum_probs=65.8
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
+-|...|.|.+++....- .+...|... +-+|.++||||.||+ +.|..+-+. .-.+.
T Consensus 53 ~~GtH~g~v~~~~~~~~~--~~~~~~s~~------~~~Gey~asCS~DGk-v~I~sl~~~---------------~~~~~ 108 (846)
T KOG2066|consen 53 ALGTHRGAVYLTTCQGNP--KTNFDHSSS------ILEGEYVASCSDDGK-VVIGSLFTD---------------DEITQ 108 (846)
T ss_pred eeccccceEEEEecCCcc--ccccccccc------ccCCceEEEecCCCc-EEEeeccCC---------------cccee
Confidence 456789999999987442 444455443 789999999999987 667766321 11245
Q ss_pred EEEecccccccEEEEEEccC-----CCEEEEEeCCCeEEEEeCCCCC
Q 001814 423 YKLHRGITSATIQDICFSHY-----SQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpD-----g~~LAsgS~dGTVhIw~I~~~g 464 (1010)
+.+.| .|.+|+|+|| ++.+++|+..| +-++.-+-.|
T Consensus 109 ~df~r-----piksial~Pd~~~~~sk~fv~GG~ag-lvL~er~wlg 149 (846)
T KOG2066|consen 109 YDFKR-----PIKSIALHPDFSRQQSKQFVSGGMAG-LVLSERNWLG 149 (846)
T ss_pred EecCC-----cceeEEeccchhhhhhhheeecCcce-EEEehhhhhc
Confidence 56654 5889999999 78999999999 7777654433
No 320
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.55 E-value=19 Score=43.69 Aligned_cols=55 Identities=9% Similarity=0.093 Sum_probs=43.6
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
+.++.+|.|++||-........|++-..+|..+..+.||++|+.... +.+-+-++
T Consensus 445 vvgS~~GdIRLYdri~~~AKTAlPgLG~~I~hVdvtadGKwil~Tc~--tyLlLi~t 499 (644)
T KOG2395|consen 445 VVGSLKGDIRLYDRIGRRAKTALPGLGDAIKHVDVTADGKWILATCK--TYLLLIDT 499 (644)
T ss_pred EEeecCCcEEeehhhhhhhhhcccccCCceeeEEeeccCcEEEEecc--cEEEEEEE
Confidence 45678999999999666667788999999999999999997654443 45667765
No 321
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=88.49 E-value=1.3 Score=53.89 Aligned_cols=58 Identities=17% Similarity=0.314 Sum_probs=50.1
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeE-EEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPIS-ALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIs-aLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
+|.+..+|.|.+.-+. -+.+.+|.-|..++. +|||.|||++||.|=.+|+ |++-|+..
T Consensus 35 iA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~DGkllaVg~kdG~-I~L~Dve~ 93 (665)
T KOG4640|consen 35 IATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRPDGKLLAVGFKDGT-IRLHDVEK 93 (665)
T ss_pred hheeccCCcEEEEEec-cceeEeccCCCCccceeeeecCCCCEEEEEecCCe-EEEEEccC
Confidence 4566789999999888 677889998888888 9999999999999999876 99999964
No 322
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=88.38 E-value=0.8 Score=56.17 Aligned_cols=101 Identities=17% Similarity=0.293 Sum_probs=72.2
Q ss_pred ccccCCCCeEEEEECCCC---------------cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccC
Q 001814 342 GADMDNAGIVVVKDFVTR---------------AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRS 406 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~---------------~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~ 406 (1010)
++-++.||.++|..+.+. ..-.++.+|...|.-+.|+.+.+.|-|.+.+| .|.||-+-.
T Consensus 29 IAcgG~dGlLKVlKl~t~t~d~~~~glaa~snLsmNQtLeGH~~sV~vvTWNe~~QKLTtSDt~G-lIiVWmlyk----- 102 (1189)
T KOG2041|consen 29 IACGGADGLLKVLKLGTDTTDLNKSGLAAASNLSMNQTLEGHNASVMVVTWNENNQKLTTSDTSG-LIIVWMLYK----- 102 (1189)
T ss_pred EEeccccceeEEEEccccCCcccccccccccccchhhhhccCcceEEEEEeccccccccccCCCc-eEEEEeeec-----
Confidence 456778899888877541 12357889999999999999999998888855 699997742
Q ss_pred CCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 407 GSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 407 ~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
|. |-. .... .| ....|.+++|..||+.|++.=.||.|.|=.+
T Consensus 103 --gs----W~E---EMiN-nR--nKSvV~SmsWn~dG~kIcIvYeDGavIVGsv 144 (1189)
T KOG2041|consen 103 --GS----WCE---EMIN-NR--NKSVVVSMSWNLDGTKICIVYEDGAVIVGSV 144 (1189)
T ss_pred --cc----HHH---HHhh-Cc--CccEEEEEEEcCCCcEEEEEEccCCEEEEee
Confidence 21 110 0100 12 2346999999999999999999988765443
No 323
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=88.11 E-value=2.5 Score=49.90 Aligned_cols=112 Identities=16% Similarity=0.230 Sum_probs=73.9
Q ss_pred CCCeEEEEECCCCc-EEEEec-cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc-eEEE
Q 001814 347 NAGIVVVKDFVTRA-IISQFK-AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH-VHLY 423 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~-~v~~~~-aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~-~~L~ 423 (1010)
.+|.|.|+|-.... .+..+. -|.+||-.+-++|-|...++... +..|.-|...+.+-... + .-+|.-.+ .-||
T Consensus 120 ~sg~i~VvD~~~d~~q~~~fkklH~sPV~~i~y~qa~Ds~vSiD~-~gmVEyWs~e~~~qfPr-~--~l~~~~K~eTdLy 195 (558)
T KOG0882|consen 120 KSGKIFVVDGFGDFCQDGYFKKLHFSPVKKIRYNQAGDSAVSIDI-SGMVEYWSAEGPFQFPR-T--NLNFELKHETDLY 195 (558)
T ss_pred cCCCcEEECCcCCcCccceecccccCceEEEEeeccccceeeccc-cceeEeecCCCcccCcc-c--cccccccccchhh
Confidence 45667777765443 333443 59999999999999999998888 45699998763111000 0 01122111 1233
Q ss_pred EEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCC
Q 001814 424 KLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFG 464 (1010)
Q Consensus 424 ~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~g 464 (1010)
.+-.. .....++.|||||..+++-+.|.+|++|.+...+
T Consensus 196 ~f~K~--Kt~pts~Efsp~g~qistl~~DrkVR~F~~KtGk 234 (558)
T KOG0882|consen 196 GFPKA--KTEPTSFEFSPDGAQISTLNPDRKVRGFVFKTGK 234 (558)
T ss_pred ccccc--ccCccceEEccccCcccccCcccEEEEEEeccch
Confidence 33221 1257899999999999999999999999986543
No 324
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=87.71 E-value=1.2 Score=55.56 Aligned_cols=99 Identities=17% Similarity=0.273 Sum_probs=69.5
Q ss_pred ccCCCCeEEEEECCCCcEEE--EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 344 DMDNAGIVVVKDFVTRAIIS--QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~--~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
+....|.|.||- .+|++-. +.+-| +++|||.|.--.||.+=..| .+.||.... +.
T Consensus 36 S~er~GSVtIfa-dtGEPqr~Vt~P~h---atSLCWHpe~~vLa~gwe~g-~~~v~~~~~------------------~e 92 (1416)
T KOG3617|consen 36 SPERGGSVTIFA-DTGEPQRDVTYPVH---ATSLCWHPEEFVLAQGWEMG-VSDVQKTNT------------------TE 92 (1416)
T ss_pred cCCCCceEEEEe-cCCCCCccccccee---hhhhccChHHHHHhhccccc-eeEEEecCC------------------ce
Confidence 455678888874 3343211 11222 34599999988888887655 589998742 12
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
.++.- -.+.+.|+-+.|||||..|.++..=|.||+|.....|..
T Consensus 93 ~htv~-~th~a~i~~l~wS~~G~~l~t~d~~g~v~lwr~d~~g~~ 136 (1416)
T KOG3617|consen 93 THTVV-ETHPAPIQGLDWSHDGTVLMTLDNPGSVHLWRYDVIGEI 136 (1416)
T ss_pred eeeec-cCCCCCceeEEecCCCCeEEEcCCCceeEEEEeeecccc
Confidence 23332 235688999999999999999999999999999866443
No 325
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=87.08 E-value=1.5 Score=50.12 Aligned_cols=81 Identities=19% Similarity=0.215 Sum_probs=54.9
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEeccc-------------c-cccEE
Q 001814 370 SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGI-------------T-SATIQ 435 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~-------------t-~a~I~ 435 (1010)
.-|+++.|+.+|.+||||..+|+ +-+|.-... . .+.|.++... . .-+|.
T Consensus 26 diis~vef~~~Ge~LatGdkgGR-Vv~f~r~~~------~----------~~ey~~~t~fqshepEFDYLkSleieEKin 88 (433)
T KOG1354|consen 26 DIISAVEFDHYGERLATGDKGGR-VVLFEREKL------Y----------KGEYNFQTEFQSHEPEFDYLKSLEIEEKIN 88 (433)
T ss_pred cceeeEEeecccceEeecCCCCe-EEEeecccc------c----------ccceeeeeeeeccCcccchhhhhhhhhhhh
Confidence 46889999999999999999777 557764321 1 0112211110 0 12578
Q ss_pred EEEEccCCC--EEEEEeCCCeEEEEeCCCCCCcc
Q 001814 436 DICFSHYSQ--WIAIVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 436 sIAFSpDg~--~LAsgS~dGTVhIw~I~~~gg~~ 467 (1010)
.|.|-+++. .+..++.|.||++|++...+...
T Consensus 89 kIrw~~~~n~a~FLlstNdktiKlWKi~er~~k~ 122 (433)
T KOG1354|consen 89 KIRWLDDGNLAEFLLSTNDKTIKLWKIRERGSKK 122 (433)
T ss_pred hceecCCCCccEEEEecCCcceeeeeeecccccc
Confidence 889988764 56678889999999998765543
No 326
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=87.04 E-value=14 Score=40.29 Aligned_cols=55 Identities=13% Similarity=0.149 Sum_probs=45.3
Q ss_pred CCCCCEEEEEeCCCC-----eEEEEEeCCCcEEEEEEcCCeEEEEeCCeEEEEECCCCce
Q 001814 169 VNSPTAVRFYSFQSH-----CYEHVLRFRSSVCMVRCSPRIVAVGLATQIYCFDALTLEN 223 (1010)
Q Consensus 169 ~~sp~tVrIWDlktg-----e~V~tL~f~S~V~sVa~S~rlLAV~ld~~I~IwD~~Tle~ 223 (1010)
++..++|.+|..... +.++.+..+..+..|.+.++.|+++..+...+.|+.+...
T Consensus 110 va~kk~i~i~~~~~~~~~f~~~~ke~~lp~~~~~i~~~~~~i~v~~~~~f~~idl~~~~~ 169 (275)
T PF00780_consen 110 VAVKKKILIYEWNDPRNSFSKLLKEISLPDPPSSIAFLGNKICVGTSKGFYLIDLNTGSP 169 (275)
T ss_pred EEECCEEEEEEEECCcccccceeEEEEcCCCcEEEEEeCCEEEEEeCCceEEEecCCCCc
Confidence 344579999887642 5778888899999999999999999999999999987653
No 327
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=86.81 E-value=1.6 Score=36.40 Aligned_cols=30 Identities=20% Similarity=0.471 Sum_probs=27.1
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
..+|.+++|+|...+||.++.+|. |.||++
T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~g~-v~v~Rl 40 (47)
T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTEDGE-VLVYRL 40 (47)
T ss_pred CCcEEEEEECCCCCEEEEEECCCe-EEEEEC
Confidence 357999999999999999999887 889998
No 328
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=86.03 E-value=6.1 Score=47.04 Aligned_cols=85 Identities=16% Similarity=0.175 Sum_probs=52.5
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG 428 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG 428 (1010)
-.|.+.|+.++.... +..-...-..=.|+|||+.|+-+|..+..=+||..... |. ....+++.-|
T Consensus 262 ~~iy~~dl~~~~~~~-Lt~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~------g~--------~~~riT~~~~ 326 (425)
T COG0823 262 PDIYLMDLDGKNLPR-LTNGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLE------GS--------QVTRLTFSGG 326 (425)
T ss_pred ccEEEEcCCCCccee-cccCCccccCccCCCCCCEEEEEeCCCCCcceEEECCC------CC--------ceeEeeccCC
Confidence 347778888776333 32221111245799999999988877666688877543 21 1233333322
Q ss_pred cccccEEEEEEccCCCEEEEEeCC
Q 001814 429 ITSATIQDICFSHYSQWIAIVSSK 452 (1010)
Q Consensus 429 ~t~a~I~sIAFSpDg~~LAsgS~d 452 (1010)
.. . .-.|||||++|+..+..
T Consensus 327 ~~--~--~p~~SpdG~~i~~~~~~ 346 (425)
T COG0823 327 GN--S--NPVWSPDGDKIVFESSS 346 (425)
T ss_pred CC--c--CccCCCCCCEEEEEecc
Confidence 21 1 56799999999998854
No 329
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=85.79 E-value=7.6 Score=38.15 Aligned_cols=65 Identities=20% Similarity=0.288 Sum_probs=45.8
Q ss_pred eEEEEEC---CCC-CEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 372 ISALCFD---PSG-TLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 372 IsaLaFS---PdG-tlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
|++|++. -|| ..|+.||. +..||||+-. ..+++... ...|.+++=... ..++
T Consensus 2 V~al~~~d~d~dg~~eLlvGs~-D~~IRvf~~~-------------------e~~~Ei~e---~~~v~~L~~~~~-~~F~ 57 (111)
T PF14783_consen 2 VTALCLFDFDGDGENELLVGSD-DFEIRVFKGD-------------------EIVAEITE---TDKVTSLCSLGG-GRFA 57 (111)
T ss_pred eeEEEEEecCCCCcceEEEecC-CcEEEEEeCC-------------------cEEEEEec---ccceEEEEEcCC-CEEE
Confidence 5566654 454 47777887 5679999853 36667643 246787877665 6688
Q ss_pred EEeCCCeEEEEeC
Q 001814 448 IVSSKGTCHVFVL 460 (1010)
Q Consensus 448 sgS~dGTVhIw~I 460 (1010)
.+..+|||-||+-
T Consensus 58 Y~l~NGTVGvY~~ 70 (111)
T PF14783_consen 58 YALANGTVGVYDR 70 (111)
T ss_pred EEecCCEEEEEeC
Confidence 9999999999864
No 330
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=84.72 E-value=25 Score=44.51 Aligned_cols=46 Identities=11% Similarity=0.034 Sum_probs=36.0
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcEEEEEEcCC-------eEEEEeCCeEEEEE
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSVCMVRCSPR-------IVAVGLATQIYCFD 217 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V~sVa~S~r-------lLAV~ld~~I~IwD 217 (1010)
|++|.|-.+-+.+..+++.|+-++.+|++.|+ .+++|....+.++.
T Consensus 92 DGkv~I~sl~~~~~~~~~df~rpiksial~Pd~~~~~sk~fv~GG~aglvL~e 144 (846)
T KOG2066|consen 92 DGKVVIGSLFTDDEITQYDFKRPIKSIALHPDFSRQQSKQFVSGGMAGLVLSE 144 (846)
T ss_pred CCcEEEeeccCCccceeEecCCcceeEEeccchhhhhhhheeecCcceEEEeh
Confidence 46899999999999999999999999999884 34444433366664
No 331
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=84.18 E-value=64 Score=34.61 Aligned_cols=60 Identities=18% Similarity=0.238 Sum_probs=40.6
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE-ccCCCEEEEE
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF-SHYSQWIAIV 449 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF-SpDg~~LAsg 449 (1010)
..--|+++.+|.+.++.-. +..|.+|+.. | +.+.++.-.. ..+.+++| -+|.+.|.+.
T Consensus 185 ~pDG~~vD~~G~l~va~~~-~~~I~~~~p~--------G----------~~~~~i~~p~--~~~t~~~fgg~~~~~L~vT 243 (246)
T PF08450_consen 185 YPDGLAVDSDGNLWVADWG-GGRIVVFDPD--------G----------KLLREIELPV--PRPTNCAFGGPDGKTLYVT 243 (246)
T ss_dssp EEEEEEEBTTS-EEEEEET-TTEEEEEETT--------S----------CEEEEEE-SS--SSEEEEEEESTTSSEEEEE
T ss_pred CCCcceEcCCCCEEEEEcC-CCEEEEECCC--------c----------cEEEEEcCCC--CCEEEEEEECCCCCEEEEE
Confidence 3667999999998876554 4557888752 4 3555554331 36899999 4788888877
Q ss_pred eC
Q 001814 450 SS 451 (1010)
Q Consensus 450 S~ 451 (1010)
+.
T Consensus 244 ta 245 (246)
T PF08450_consen 244 TA 245 (246)
T ss_dssp EB
T ss_pred eC
Confidence 64
No 332
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=83.38 E-value=1.3e+02 Score=38.87 Aligned_cols=84 Identities=11% Similarity=0.210 Sum_probs=52.6
Q ss_pred CCeEEEEECCCCc-EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 348 AGIVVVKDFVTRA-IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 348 dG~V~VwDl~s~~-~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
...|.+|.+..+. .+..+..|.-++.|-+|++.-.-+..|+. .-+-+|+.. |. .+.+.|.
T Consensus 192 t~~V~~y~l~gr~p~~~~ld~~G~~lnCss~~~~t~qfIca~~--e~l~fY~sd--------~~---------~~cfaf~ 252 (933)
T KOG2114|consen 192 TEQVMLYSLSGRTPSLKVLDNNGISLNCSSFSDGTYQFICAGS--EFLYFYDSD--------GR---------GPCFAFE 252 (933)
T ss_pred cceeEEEEecCCCcceeeeccCCccceeeecCCCCccEEEecC--ceEEEEcCC--------Cc---------ceeeeec
Confidence 3467788877554 34557888899999999976553555543 348889873 21 3556676
Q ss_pred cccccccEEEEEEccCCCEEEEEeCCCe
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSKGT 454 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~dGT 454 (1010)
+|.+.- +-|..-|..|++....||
T Consensus 253 ~g~kk~----~~~~~~g~~L~v~~~~~~ 276 (933)
T KOG2114|consen 253 VGEKKE----MLVFSFGLLLCVTTDKGT 276 (933)
T ss_pred CCCeEE----EEEEecCEEEEEEccCCC
Confidence 665421 333334667777666654
No 333
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=83.21 E-value=1.6 Score=50.56 Aligned_cols=74 Identities=14% Similarity=0.113 Sum_probs=53.0
Q ss_pred CCCeEEEEEecCc-EEEEEccCCCcceEeeeeccCCEEEEEEecCCCCCCCCCCccccCcEEEEEecCCCCCCCCCCCCC
Q 001814 73 VFKQVLLLGYQNG-FQVLDVEDASNFNELVSKRDGPVSFLQMQPFPVKDDGCEGFRKLHPFLLVVAGEDTNTLAPGQNRS 151 (1010)
Q Consensus 73 ~~~~vLalGy~~G-~qVWDv~~~g~v~ellS~hdGpV~~v~~lP~p~~s~~~D~F~~srpLLAvVsgd~~~~s~~~q~~~ 151 (1010)
++.+.|+++..++ |+|-.....-.......+|..-|+.+.+.++- +|+. + +|
T Consensus 161 ~D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS~isl~~~~--------------~LlS--~-----sG------ 213 (390)
T KOG3914|consen 161 PDDQFIITADRDEKIRVSRYPATFVIESFCLGHKEFVSTISLTDNY--------------LLLS--G-----SG------ 213 (390)
T ss_pred CCCCEEEEecCCceEEEEecCcccchhhhccccHhheeeeeeccCc--------------eeee--c-----CC------
Confidence 4678888888776 88877754443444556688899999988642 2342 2 11
Q ss_pred CccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEeCCC
Q 001814 152 HLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLRFRS 193 (1010)
Q Consensus 152 ~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~f~S 193 (1010)
|++|++||+++|++++++..++
T Consensus 214 --------------------D~tlr~Wd~~sgk~L~t~dl~s 235 (390)
T KOG3914|consen 214 --------------------DKTLRLWDITSGKLLDTCDLSS 235 (390)
T ss_pred --------------------CCcEEEEecccCCcccccchhH
Confidence 4799999999999999887664
No 334
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=82.44 E-value=27 Score=37.96 Aligned_cols=54 Identities=13% Similarity=-0.084 Sum_probs=40.8
Q ss_pred EEEEeCCCCeEE--EEEeCCCcEEEEEEcCCeEEEEeCCeEEEEECCCCceeEEEee
Q 001814 175 VRFYSFQSHCYE--HVLRFRSSVCMVRCSPRIVAVGLATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 175 VrIWDlktge~V--~tL~f~S~V~sVa~S~rlLAV~ld~~I~IwD~~Tle~l~tL~t 229 (1010)
-.|.|. .|+.. .++++.+.+.++.+...+|++..++.|.||++.+++.++++..
T Consensus 210 g~fv~~-~G~~~r~~~i~W~~~p~~~~~~~pyli~~~~~~iEV~~~~~~~lvQ~i~~ 265 (275)
T PF00780_consen 210 GVFVNK-NGEPSRKSTIQWSSAPQSVAYSSPYLIAFSSNSIEVRSLETGELVQTIPL 265 (275)
T ss_pred EEEEcC-CCCcCcccEEEcCCchhEEEEECCEEEEECCCEEEEEECcCCcEEEEEEC
Confidence 334444 44443 3678888888888887778777788899999999999888863
No 335
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=81.68 E-value=1.3e+02 Score=40.29 Aligned_cols=98 Identities=16% Similarity=0.246 Sum_probs=59.1
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC--CCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY--GNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d--Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
..|+|||-. +.+-.+=..-..-=.+|+|=|+|.++|+--.+ +..|.+|.-. |- . +--+.++
T Consensus 222 RkirV~drE-g~Lns~se~~~~l~~~LsWkPsgs~iA~iq~~~sd~~IvffErN--------GL-------~-hg~f~l~ 284 (1265)
T KOG1920|consen 222 RKIRVYDRE-GALNSTSEPVEGLQHSLSWKPSGSLIAAIQCKTSDSDIVFFERN--------GL-------R-HGEFVLP 284 (1265)
T ss_pred eeEEEeccc-chhhcccCcccccccceeecCCCCeEeeeeecCCCCcEEEEecC--------Cc-------c-ccccccC
Confidence 789999987 32211111111222468999999999986432 2247788642 31 0 0112222
Q ss_pred cccccccEEEEEEccCCCEEEEE---eCCCeEEEEeCCCC
Q 001814 427 RGITSATIQDICFSHYSQWIAIV---SSKGTCHVFVLSPF 463 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsg---S~dGTVhIw~I~~~ 463 (1010)
+-.....|..++|+.++..||+. ....-|.+|.+..|
T Consensus 285 ~p~de~~ve~L~Wns~sdiLAv~~~~~e~~~v~lwt~~Ny 324 (1265)
T KOG1920|consen 285 FPLDEKEVEELAWNSNSDILAVVTSNLENSLVQLWTTGNY 324 (1265)
T ss_pred CcccccchheeeecCCCCceeeeecccccceEEEEEecCe
Confidence 22222238999999999999994 44445999998765
No 336
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=80.89 E-value=7.1 Score=49.38 Aligned_cols=108 Identities=15% Similarity=0.111 Sum_probs=76.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCC-eEEEEECCCCCEEEEEEcCCC----eEEEEeCCCCcccCCCCCCccccC
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSP-ISALCFDPSGTLLVTASVYGN----NINIFRIMPSCMRSGSGNHKYDWN 416 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtsp-IsaLaFSPdGtlLATAS~dGt----~IrVwdi~p~~~~~~sG~~~~~~~ 416 (1010)
++-|..+|.|.+.+- +.+.+..|+||... |..|-...+-.+|++-.+++. .++||++.+.- +.
T Consensus 38 vvigt~~G~V~~Ln~-s~~~~~~fqa~~~siv~~L~~~~~~~~L~sv~Ed~~~np~llkiw~lek~~-----~n------ 105 (933)
T KOG2114|consen 38 VVIGTADGRVVILNS-SFQLIRGFQAYEQSIVQFLYILNKQNFLFSVGEDEQGNPVLLKIWDLEKVD-----KN------ 105 (933)
T ss_pred EEEeeccccEEEecc-cceeeehheecchhhhhHhhcccCceEEEEEeecCCCCceEEEEecccccC-----CC------
Confidence 345678888877763 24456889999888 666655555579999999887 79999987520 10
Q ss_pred CcceEEEEE-----ecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 417 SSHVHLYKL-----HRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 417 ~s~~~L~~L-----~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.+.+++|+. .-+.....+.+|+.|.|=+.+|+|=.+|+|..+.=+
T Consensus 106 ~sP~c~~~~ri~~~~np~~~~p~s~l~Vs~~l~~Iv~Gf~nG~V~~~~GD 155 (933)
T KOG2114|consen 106 NSPQCLYEHRIFTIKNPTNPSPASSLAVSEDLKTIVCGFTNGLVICYKGD 155 (933)
T ss_pred CCcceeeeeeeeccCCCCCCCcceEEEEEccccEEEEEecCcEEEEEcCc
Confidence 012344333 223234478999999999999999999999887533
No 337
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=80.34 E-value=1.2e+02 Score=35.15 Aligned_cols=90 Identities=11% Similarity=0.023 Sum_probs=47.2
Q ss_pred CCCeEEEEECCCCcEEEEeccCC-CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHT-SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHt-spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
.+|.|..+|..+++.+.....-. ...++..+ .+.+|..++.+|. +.++|..+ | +.+.+.
T Consensus 302 ~~g~l~ald~~tG~~~W~~~~~~~~~~~sp~v--~~g~l~v~~~~G~-l~~ld~~t-------G----------~~~~~~ 361 (394)
T PRK11138 302 QNDRVYALDTRGGVELWSQSDLLHRLLTAPVL--YNGYLVVGDSEGY-LHWINRED-------G----------RFVAQQ 361 (394)
T ss_pred CCCeEEEEECCCCcEEEcccccCCCcccCCEE--ECCEEEEEeCCCE-EEEEECCC-------C----------CEEEEE
Confidence 45666667776666554332100 01111112 1345566777675 77888743 5 345444
Q ss_pred ecccccccEE-EEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 426 HRGITSATIQ-DICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 426 ~RG~t~a~I~-sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
+-+.. .+. ...+ .+..|.+++.+|+++.|++
T Consensus 362 ~~~~~--~~~s~P~~--~~~~l~v~t~~G~l~~~~~ 393 (394)
T PRK11138 362 KVDSS--GFLSEPVV--ADDKLLIQARDGTVYAITR 393 (394)
T ss_pred EcCCC--cceeCCEE--ECCEEEEEeCCceEEEEeC
Confidence 32211 122 1222 2457889999999988765
No 338
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=80.33 E-value=57 Score=41.64 Aligned_cols=88 Identities=15% Similarity=0.303 Sum_probs=54.4
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE-EEE----EecccccccEEEEEEccC--
Q 001814 370 SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH-LYK----LHRGITSATIQDICFSHY-- 442 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~-L~~----L~RG~t~a~I~sIAFSpD-- 442 (1010)
-.|..|.+||+|++||-++..| |-|-.+ |...+. .|. ..+......+ .+. +.+......|..+.|.|.
T Consensus 85 f~v~~i~~n~~g~~lal~G~~~--v~V~~L-P~r~g~-~~~-~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~ 159 (717)
T PF10168_consen 85 FEVHQISLNPTGSLLALVGPRG--VVVLEL-PRRWGK-NGE-FEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSE 159 (717)
T ss_pred eeEEEEEECCCCCEEEEEcCCc--EEEEEe-ccccCc-ccc-ccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCC
Confidence 4688899999999999999865 555555 321000 000 0000111111 011 112223457999999987
Q ss_pred -CCEEEEEeCCCeEEEEeCCC
Q 001814 443 -SQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 443 -g~~LAsgS~dGTVhIw~I~~ 462 (1010)
+..|++=++|+++++|++..
T Consensus 160 ~~~~l~vLtsdn~lR~y~~~~ 180 (717)
T PF10168_consen 160 SDSHLVVLTSDNTLRLYDISD 180 (717)
T ss_pred CCCeEEEEecCCEEEEEecCC
Confidence 48999999999999999963
No 339
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=79.77 E-value=3.9 Score=47.01 Aligned_cols=107 Identities=15% Similarity=0.179 Sum_probs=73.6
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
.+.+..|-+-|++++.. ..|. ..+.|.++.|.-.+.+|..+...|. |-++|+... . .| .. ++...
T Consensus 230 ~G~sqqv~L~nvetg~~-qsf~-sksDVfAlQf~~s~nLv~~GcRnge-I~~iDLR~r--n--qG---~~-----~~a~r 294 (425)
T KOG2695|consen 230 VGLSQQVLLTNVETGHQ-QSFQ-SKSDVFALQFAGSDNLVFNGCRNGE-IFVIDLRCR--N--QG---NG-----WCAQR 294 (425)
T ss_pred ccccceeEEEEeecccc-cccc-cchhHHHHHhcccCCeeEecccCCc-EEEEEeeec--c--cC---CC-----cceEE
Confidence 35566788888887742 3344 6678999999999999998888676 778898531 0 12 11 34444
Q ss_pred EecccccccEEEEEEcc-CCCEEEEEeCCCeEEEEeCCCCCCcccc
Q 001814 425 LHRGITSATIQDICFSH-YSQWIAIVSSKGTCHVFVLSPFGGDSGF 469 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSp-Dg~~LAsgS~dGTVhIw~I~~~gg~~~~ 469 (1010)
|. +...|+++..=. ++++|++++.+|+|.+||+...+....+
T Consensus 295 ly---h~Ssvtslq~Lq~s~q~LmaS~M~gkikLyD~R~~K~~~~V 337 (425)
T KOG2695|consen 295 LY---HDSSVTSLQILQFSQQKLMASDMTGKIKLYDLRATKCKKSV 337 (425)
T ss_pred EE---cCcchhhhhhhccccceEeeccCcCceeEeeehhhhcccce
Confidence 43 233455555443 5799999999999999999876664433
No 340
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=79.33 E-value=1.4e+02 Score=35.37 Aligned_cols=48 Identities=15% Similarity=0.264 Sum_probs=39.1
Q ss_pred CCEEEEEeCCCCeEEEEEeCC-CcEEEEEEcC--CeEEEEeCCeEEEEECCC
Q 001814 172 PTAVRFYSFQSHCYEHVLRFR-SSVCMVRCSP--RIVAVGLATQIYCFDALT 220 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~-S~V~sVa~S~--rlLAV~ld~~I~IwD~~T 220 (1010)
|..|+||+. +|+.+.++.+. +.|..+.|+. ++|+|..++.+++||+..
T Consensus 60 p~~I~iys~-sG~ll~~i~w~~~~iv~~~wt~~e~LvvV~~dG~v~vy~~~G 110 (410)
T PF04841_consen 60 PNSIQIYSS-SGKLLSSIPWDSGRIVGMGWTDDEELVVVQSDGTVRVYDLFG 110 (410)
T ss_pred CcEEEEECC-CCCEeEEEEECCCCEEEEEECCCCeEEEEEcCCEEEEEeCCC
Confidence 457999998 57788888876 5899999976 577778888999999863
No 341
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=79.07 E-value=21 Score=42.81 Aligned_cols=45 Identities=13% Similarity=0.246 Sum_probs=31.2
Q ss_pred CCEEEEEeCCCCeEEEEEeCCC---cEEEEEEcC------CeEEEEeCCeEEEE
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRS---SVCMVRCSP------RIVAVGLATQIYCF 216 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S---~V~sVa~S~------rlLAV~ld~~I~Iw 216 (1010)
.+++.|||+++++.+.++.... .++.|+|-. -++.+++..+|..|
T Consensus 221 G~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~aLss~i~~~ 274 (461)
T PF05694_consen 221 GHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGCALSSSIWRF 274 (461)
T ss_dssp --EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEEE--EEEEEE
T ss_pred cCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEEeccceEEEE
Confidence 4899999999999999998863 578888853 26667777777665
No 342
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=78.77 E-value=1.6e+02 Score=35.67 Aligned_cols=58 Identities=16% Similarity=0.272 Sum_probs=37.3
Q ss_pred CCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 381 GTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 381 GtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
|.+|+..+. + .|.+||+.. + +.+.+.. -..|..+.||+|+.++|..+.+ ++.|++.
T Consensus 117 G~LL~~~~~-~-~i~~yDw~~-------~----------~~i~~i~----v~~vk~V~Ws~~g~~val~t~~-~i~il~~ 172 (443)
T PF04053_consen 117 GNLLGVKSS-D-FICFYDWET-------G----------KLIRRID----VSAVKYVIWSDDGELVALVTKD-SIYILKY 172 (443)
T ss_dssp SSSEEEEET-T-EEEEE-TTT-------------------EEEEES----S-E-EEEEE-TTSSEEEEE-S--SEEEEEE
T ss_pred CcEEEEECC-C-CEEEEEhhH-------c----------ceeeEEe----cCCCcEEEEECCCCEEEEEeCC-eEEEEEe
Confidence 999988876 3 599999863 3 3454442 2248999999999999999866 6778775
Q ss_pred CC
Q 001814 461 SP 462 (1010)
Q Consensus 461 ~~ 462 (1010)
+.
T Consensus 173 ~~ 174 (443)
T PF04053_consen 173 NL 174 (443)
T ss_dssp -H
T ss_pred cc
Confidence 43
No 343
>PRK13616 lipoprotein LpqB; Provisional
Probab=78.39 E-value=11 Score=46.84 Aligned_cols=97 Identities=11% Similarity=0.154 Sum_probs=56.9
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE--EEe
Q 001814 349 GIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY--KLH 426 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~--~L~ 426 (1010)
..+.+++... .....+.+. .++.-.|||||+.|++.+......++.+-.. .+ .++ .+.
T Consensus 379 s~Lwv~~~gg-~~~~lt~g~--~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~------~g-----------ql~~~~vd 438 (591)
T PRK13616 379 SSLWVGPLGG-VAVQVLEGH--SLTRPSWSLDADAVWVVVDGNTVVRVIRDPA------TG-----------QLARTPVD 438 (591)
T ss_pred eEEEEEeCCC-cceeeecCC--CCCCceECCCCCceEEEecCcceEEEeccCC------Cc-----------eEEEEecc
Confidence 3566666532 222222332 3777899999999999987435555544211 11 222 221
Q ss_pred cccc----cccEEEEEEccCCCEEEEEeCCCeEEEEeCCC-CCCc
Q 001814 427 RGIT----SATIQDICFSHYSQWIAIVSSKGTCHVFVLSP-FGGD 466 (1010)
Q Consensus 427 RG~t----~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~-~gg~ 466 (1010)
.|.. ...|.++.|||||++||... +|.|+|=.|.. .+|.
T Consensus 439 ~ge~~~~~~g~Issl~wSpDG~RiA~i~-~g~v~Va~Vvr~~~G~ 482 (591)
T PRK13616 439 ASAVASRVPGPISELQLSRDGVRAAMII-GGKVYLAVVEQTEDGQ 482 (591)
T ss_pred CchhhhccCCCcCeEEECCCCCEEEEEE-CCEEEEEEEEeCCCCc
Confidence 1111 23599999999999999987 57776655544 3444
No 344
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=78.33 E-value=17 Score=42.19 Aligned_cols=101 Identities=14% Similarity=0.147 Sum_probs=59.9
Q ss_pred cCCCCeEEEEECCCCcEEEE-eccCCCCeEEEEECCCCCEEEEEEcCC----------CeEEEEeCCCCcccCCCCCCcc
Q 001814 345 MDNAGIVVVKDFVTRAIISQ-FKAHTSPISALCFDPSGTLLVTASVYG----------NNINIFRIMPSCMRSGSGNHKY 413 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~-~~aHtspIsaLaFSPdGtlLATAS~dG----------t~IrVwdi~p~~~~~~sG~~~~ 413 (1010)
|+....++|+|+.+++.+.. |..- . -..++|.+||+.|.....+. ..|..|++-. +
T Consensus 146 G~e~~~l~v~Dl~tg~~l~d~i~~~-~-~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt-------~---- 212 (414)
T PF02897_consen 146 GSEWYTLRVFDLETGKFLPDGIENP-K-FSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGT-------P---- 212 (414)
T ss_dssp TSSEEEEEEEETTTTEEEEEEEEEE-E-SEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS------------
T ss_pred CCceEEEEEEECCCCcCcCCccccc-c-cceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCC-------C----
Confidence 44556799999999987643 2321 1 12399999999877665432 2355555522 1
Q ss_pred ccCCcceEEEEEeccccccc-EEEEEEccCCCEEEEEeCCCe--EEEEeCCCC
Q 001814 414 DWNSSHVHLYKLHRGITSAT-IQDICFSHYSQWIAIVSSKGT--CHVFVLSPF 463 (1010)
Q Consensus 414 ~~~~s~~~L~~L~RG~t~a~-I~sIAFSpDg~~LAsgS~dGT--VhIw~I~~~ 463 (1010)
......+|+- ..... ..++..|+|++||.+.+.+++ -.||-+...
T Consensus 213 --~~~d~lvfe~---~~~~~~~~~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~ 260 (414)
T PF02897_consen 213 --QSEDELVFEE---PDEPFWFVSVSRSKDGRYLFISSSSGTSESEVYLLDLD 260 (414)
T ss_dssp --GGG-EEEEC----TTCTTSEEEEEE-TTSSEEEEEEESSSSEEEEEEEECC
T ss_pred --hHhCeeEEee---cCCCcEEEEEEecCcccEEEEEEEccccCCeEEEEecc
Confidence 0111345443 22233 678999999999998666554 456655544
No 345
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=78.24 E-value=6.5 Score=46.02 Aligned_cols=52 Identities=10% Similarity=0.213 Sum_probs=44.1
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcEEEEEEcC---CeEEEEeC-CeEEEEECCCCce
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSVCMVRCSP---RIVAVGLA-TQIYCFDALTLEN 223 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V~sVa~S~---rlLAV~ld-~~I~IwD~~Tle~ 223 (1010)
.++++|.|+++..++.++..+..++++.|.. +.|..|+. +.|+|||++..+.
T Consensus 215 ~nkiki~dlet~~~vssy~a~~~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~~~~ 270 (463)
T KOG1645|consen 215 GNKIKIMDLETSCVVSSYIAYNQIWSCCWDLDERHVIYAGLQNGMVLVYDMRQPEG 270 (463)
T ss_pred CceEEEEecccceeeeheeccCCceeeeeccCCcceeEEeccCceEEEEEccCCCc
Confidence 4899999999999999998889999999865 57777775 5799999997553
No 346
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=76.78 E-value=2.9 Score=53.75 Aligned_cols=71 Identities=13% Similarity=0.139 Sum_probs=50.9
Q ss_pred CCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccc-cccEEEEEEccCCCEEEEEeCCCeEEE
Q 001814 379 PSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGIT-SATIQDICFSHYSQWIAIVSSKGTCHV 457 (1010)
Q Consensus 379 PdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t-~a~I~sIAFSpDg~~LAsgS~dGTVhI 457 (1010)
-.+..+|.++.+|+ +-.+|.. | .|..+++|.. ...|.+++|+.||+.++.|-.+|-|.+
T Consensus 97 ~~~~~ivi~Ts~gh-vl~~d~~--------~-----------nL~~~~~ne~v~~~Vtsvafn~dg~~l~~G~~~G~V~v 156 (1206)
T KOG2079|consen 97 IVVVPIVIGTSHGH-VLLSDMT--------G-----------NLGPLHQNERVQGPVTSVAFNQDGSLLLAGLGDGHVTV 156 (1206)
T ss_pred eeeeeEEEEcCchh-hhhhhhh--------c-----------ccchhhcCCccCCcceeeEecCCCceeccccCCCcEEE
Confidence 45678888888777 5567663 2 1222333332 247999999999999999999999999
Q ss_pred EeCCCCCCcccc
Q 001814 458 FVLSPFGGDSGF 469 (1010)
Q Consensus 458 w~I~~~gg~~~~ 469 (1010)
|++...+-...+
T Consensus 157 ~D~~~~k~l~~i 168 (1206)
T KOG2079|consen 157 WDMHRAKILKVI 168 (1206)
T ss_pred EEccCCcceeee
Confidence 999875444433
No 347
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=76.53 E-value=36 Score=39.73 Aligned_cols=94 Identities=16% Similarity=0.209 Sum_probs=65.7
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEc--CCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASV--YGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK 424 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~--dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~ 424 (1010)
.+..|.+.|..+...+..+.--.. -..++|+|+|+.+..+.. ....+-++|... + ..+.+
T Consensus 94 ~~~~v~vid~~~~~~~~~~~vG~~-P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t-------~----------~~~~~ 155 (381)
T COG3391 94 DSNTVSVIDTATNTVLGSIPVGLG-PVGLAVDPDGKYVYVANAGNGNNTVSVIDAAT-------N----------KVTAT 155 (381)
T ss_pred CCCeEEEEcCcccceeeEeeeccC-CceEEECCCCCEEEEEecccCCceEEEEeCCC-------C----------eEEEE
Confidence 467899999887777665543333 356999999988877766 245577777542 2 24444
Q ss_pred EecccccccEEEEEEccCCCEEEEEe-CCCeEEEEeCC
Q 001814 425 LHRGITSATIQDICFSHYSQWIAIVS-SKGTCHVFVLS 461 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSpDg~~LAsgS-~dGTVhIw~I~ 461 (1010)
...|... ..++|+|||+.+.+.. .++++.+++..
T Consensus 156 ~~vG~~P---~~~a~~p~g~~vyv~~~~~~~v~vi~~~ 190 (381)
T COG3391 156 IPVGNTP---TGVAVDPDGNKVYVTNSDDNTVSVIDTS 190 (381)
T ss_pred EecCCCc---ceEEECCCCCeEEEEecCCCeEEEEeCC
Confidence 5556433 7899999999777665 78899998844
No 348
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=76.30 E-value=1.5 Score=55.03 Aligned_cols=100 Identities=13% Similarity=0.129 Sum_probs=78.1
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCC-eEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 342 GADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGN-NINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 342 iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt-~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
++-|...|.|++|++.+|.......+|.++|+-|.=+.||.++.|.|.-.. ..-+|++.. .| .
T Consensus 1116 L~vG~~~Geik~~nv~sG~~e~s~ncH~SavT~vePs~dgs~~Ltsss~S~PlsaLW~~~s------~~----------~ 1179 (1516)
T KOG1832|consen 1116 LAVGSHAGEIKIFNVSSGSMEESVNCHQSAVTLVEPSVDGSTQLTSSSSSSPLSALWDASS------TG----------G 1179 (1516)
T ss_pred EEeeeccceEEEEEccCccccccccccccccccccccCCcceeeeeccccCchHHHhcccc------cc----------C
Confidence 345778999999999999999999999999999999999998887765333 456888742 12 2
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCC
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
..+.|+ + -.++.||...++-+.|+...-+|||++.+.
T Consensus 1180 ~~Hsf~-e-----d~~vkFsn~~q~r~~gt~~d~a~~YDvqT~ 1216 (1516)
T KOG1832|consen 1180 PRHSFD-E-----DKAVKFSNSLQFRALGTEADDALLYDVQTC 1216 (1516)
T ss_pred cccccc-c-----cceeehhhhHHHHHhcccccceEEEecccC
Confidence 333442 2 245889988888889999899999999864
No 349
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=75.72 E-value=3.1 Score=46.29 Aligned_cols=94 Identities=18% Similarity=0.216 Sum_probs=61.7
Q ss_pred CeEEEEECCCCcEE-EEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 349 GIVVVKDFVTRAII-SQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 349 G~V~VwDl~s~~~v-~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
+..++|++...+.+ -..++- ..|.+|+-.|.-+.|+.++.++..+-+||... + .++ .-+++
T Consensus 159 d~~~a~~~~p~~t~~~~~~~~-~~v~~l~~hp~qq~~v~cgt~dg~~~l~d~rn-------~--~~p-----~S~l~--- 220 (319)
T KOG4714|consen 159 DNFYANTLDPIKTLIPSKKAL-DAVTALCSHPAQQHLVCCGTDDGIVGLWDARN-------V--AMP-----VSLLK--- 220 (319)
T ss_pred cceeeeccccccccccccccc-ccchhhhCCcccccEEEEecCCCeEEEEEccc-------c--cch-----HHHHH---
Confidence 45566777644321 111222 34999999998776666666677799999853 1 000 11121
Q ss_pred ccccccEEEEEEcc-CCCEEEEEeCCCeEEEEeCC
Q 001814 428 GITSATIQDICFSH-YSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 428 G~t~a~I~sIAFSp-Dg~~LAsgS~dGTVhIw~I~ 461 (1010)
. +.+.|+.+-|.| +...|.+++.||.+--|+-+
T Consensus 221 a-hk~~i~eV~FHpk~p~~Lft~sedGslw~wdas 254 (319)
T KOG4714|consen 221 A-HKAEIWEVHFHPKNPEHLFTCSEDGSLWHWDAS 254 (319)
T ss_pred H-hhhhhhheeccCCCchheeEecCCCcEEEEcCC
Confidence 1 346799999998 57899999999998888765
No 350
>PRK02888 nitrous-oxide reductase; Validated
Probab=75.44 E-value=21 Score=44.59 Aligned_cols=106 Identities=9% Similarity=0.075 Sum_probs=70.2
Q ss_pred CCeEEEEECCC-----CcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEE
Q 001814 348 AGIVVVKDFVT-----RAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHL 422 (1010)
Q Consensus 348 dG~V~VwDl~s-----~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L 422 (1010)
++.|.|.|..+ .+.+..+.--.+| .-|.+||||++|+.++....++-|+|+......= .+. .+... ...
T Consensus 295 gn~V~VID~~t~~~~~~~v~~yIPVGKsP-HGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~-~~~--~~~~~--~vv 368 (635)
T PRK02888 295 GSKVPVVDGRKAANAGSALTRYVPVPKNP-HGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLF-DGK--IKPRD--AVV 368 (635)
T ss_pred CCEEEEEECCccccCCcceEEEEECCCCc-cceEECCCCCEEEEeCCCCCcEEEEEChhhhhhh-hcc--CCccc--eEE
Confidence 56799999998 4566666543333 5689999999999998877789999986420000 000 00011 112
Q ss_pred EEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 423 YKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 423 ~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
.+..-|.. -...+|.++|.-..+--.|..|-.|++..
T Consensus 369 aevevGlG---PLHTaFDg~G~aytslf~dsqv~kwn~~~ 405 (635)
T PRK02888 369 AEPELGLG---PLHTAFDGRGNAYTTLFLDSQIVKWNIEA 405 (635)
T ss_pred EeeccCCC---cceEEECCCCCEEEeEeecceeEEEehHH
Confidence 22333432 24578999999888888899999999976
No 351
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=75.14 E-value=1.6e+02 Score=33.79 Aligned_cols=101 Identities=14% Similarity=0.152 Sum_probs=56.5
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.|.|..+|. .+..+..+..|-.-=+.|||||||+.|..+......|..|+..+. .|.. ..+..+....
T Consensus 142 ~G~lyr~~p-~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~~-----~g~~------~~~~~~~~~~ 209 (307)
T COG3386 142 TGSLYRVDP-DGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPA-----TGPI------GGRRGFVDFD 209 (307)
T ss_pred cceEEEEcC-CCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecCcc-----cCcc------CCcceEEEcc
Confidence 344545554 455666666654445679999999999888887676777776431 0110 0011111111
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCC-eEEEEeCC
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKG-TCHVFVLS 461 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dG-TVhIw~I~ 461 (1010)
......-.++...||.+-+++-..| -|++|+-+
T Consensus 210 -~~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd 243 (307)
T COG3386 210 -EEPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPD 243 (307)
T ss_pred -CCCCCCCceEEeCCCCEEEecccCCceEEEECCC
Confidence 1122344567777777664433333 78887754
No 352
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=74.48 E-value=76 Score=36.81 Aligned_cols=51 Identities=14% Similarity=0.109 Sum_probs=32.1
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEE-EEECCCCCEEEEEEcCCCeEEEEe
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISA-LCFDPSGTLLVTASVYGNNINIFR 398 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsa-LaFSPdGtlLATAS~dGt~IrVwd 398 (1010)
++.+|.|.+.|..+++.+...+-+...+.+ ..+ .+| .|..++.+|+ +..|+
T Consensus 341 ~~~~G~l~~ld~~tG~~~~~~~~~~~~~~s~P~~-~~~-~l~v~t~~G~-l~~~~ 392 (394)
T PRK11138 341 GDSEGYLHWINREDGRFVAQQKVDSSGFLSEPVV-ADD-KLLIQARDGT-VYAIT 392 (394)
T ss_pred EeCCCEEEEEECCCCCEEEEEEcCCCcceeCCEE-ECC-EEEEEeCCce-EEEEe
Confidence 467899999999999988777644333332 222 244 5556677676 44444
No 353
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=73.47 E-value=5.3 Score=50.30 Aligned_cols=59 Identities=24% Similarity=0.311 Sum_probs=50.7
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 341 AGADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 341 ~iasgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
.++.+-.-|.+.||...+.+....-..|..||.-|.|||+|+.|+|+..-| .+.+|+..
T Consensus 73 vLa~gwe~g~~~v~~~~~~e~htv~~th~a~i~~l~wS~~G~~l~t~d~~g-~v~lwr~d 131 (1416)
T KOG3617|consen 73 VLAQGWEMGVSDVQKTNTTETHTVVETHPAPIQGLDWSHDGTVLMTLDNPG-SVHLWRYD 131 (1416)
T ss_pred HHhhccccceeEEEecCCceeeeeccCCCCCceeEEecCCCCeEEEcCCCc-eeEEEEee
Confidence 345667789999999998887767778999999999999999999999966 48999875
No 354
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=73.36 E-value=8.3 Score=49.90 Aligned_cols=57 Identities=18% Similarity=0.369 Sum_probs=38.8
Q ss_pred ccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 344 DMDNAGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
-+...|.|...|.... ....+=..-+.||++++|+.||++|+.|=.+|. |.|||+..
T Consensus 104 i~Ts~ghvl~~d~~~nL~~~~~ne~v~~~Vtsvafn~dg~~l~~G~~~G~-V~v~D~~~ 161 (1206)
T KOG2079|consen 104 IGTSHGHVLLSDMTGNLGPLHQNERVQGPVTSVAFNQDGSLLLAGLGDGH-VTVWDMHR 161 (1206)
T ss_pred EEcCchhhhhhhhhcccchhhcCCccCCcceeeEecCCCceeccccCCCc-EEEEEccC
Confidence 3445566777776542 111111123579999999999999988877665 89999964
No 355
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=72.38 E-value=1.2e+02 Score=34.77 Aligned_cols=50 Identities=14% Similarity=0.117 Sum_probs=37.9
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEeCCeEEEEECCCCc
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGLATQIYCFDALTLE 222 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~ld~~I~IwD~~Tle 222 (1010)
..+-+||+.+++..........+....++| +.||...+..|+++++.+.+
T Consensus 23 ~~y~i~d~~~~~~~~l~~~~~~~~~~~~sP~g~~~~~v~~~nly~~~~~~~~ 74 (353)
T PF00930_consen 23 GDYYIYDIETGEITPLTPPPPKLQDAKWSPDGKYIAFVRDNNLYLRDLATGQ 74 (353)
T ss_dssp EEEEEEETTTTEEEESS-EETTBSEEEE-SSSTEEEEEETTEEEEESSTTSE
T ss_pred eeEEEEecCCCceEECcCCccccccceeecCCCeeEEEecCceEEEECCCCC
Confidence 679999999987644333345778888887 68888899999999987763
No 356
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=72.11 E-value=17 Score=41.97 Aligned_cols=111 Identities=13% Similarity=0.151 Sum_probs=68.0
Q ss_pred EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCccc-----------------------CCCCCCc--ccc
Q 001814 361 IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMR-----------------------SGSGNHK--YDW 415 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~-----------------------~~sG~~~--~~~ 415 (1010)
.+....+|..+|.++-|+-.-+++++++.|-. + .|........ .-.|... .-.
T Consensus 106 ~~r~~~~h~~~v~~~if~~~~e~V~s~~~dk~-~-~~hc~e~~~~lg~Y~~~~~~t~~~~d~~~~fvGd~~gqvt~lr~~ 183 (404)
T KOG1409|consen 106 FLKDYLAHQARVSAIVFSLTHEWVLSTGKDKQ-F-AWHCTESGNRLGGYNFETPASALQFDALYAFVGDHSGQITMLKLE 183 (404)
T ss_pred hhhhhhhhhcceeeEEecCCceeEEEeccccc-e-EEEeeccCCcccceEeeccCCCCceeeEEEEecccccceEEEEEe
Confidence 44556678888888888888888888877422 2 4443221000 0000000 000
Q ss_pred CCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCCcc-ccccccCC
Q 001814 416 NSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGGDS-GFQTLSSQ 475 (1010)
Q Consensus 416 ~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg~~-~~~~H~s~ 475 (1010)
......++++ .|++ +.|.+++|.+-.+.|.++..|..+.+|+|.-..+.. .++.|+..
T Consensus 184 ~~~~~~i~~~-~~h~-~~~~~l~Wd~~~~~LfSg~~d~~vi~wdigg~~g~~~el~gh~~k 242 (404)
T KOG1409|consen 184 QNGCQLITTF-NGHT-GEVTCLKWDPGQRLLFSGASDHSVIMWDIGGRKGTAYELQGHNDK 242 (404)
T ss_pred ecCCceEEEE-cCcc-cceEEEEEcCCCcEEEeccccCceEEEeccCCcceeeeeccchhh
Confidence 0111234444 4544 469999999999999999999999999998766643 44666543
No 357
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=70.56 E-value=8.7 Score=29.71 Aligned_cols=26 Identities=15% Similarity=0.335 Sum_probs=20.4
Q ss_pred cEEEEEEccCCCEEEEEeCC---CeEEEE
Q 001814 433 TIQDICFSHYSQWIAIVSSK---GTCHVF 458 (1010)
Q Consensus 433 ~I~sIAFSpDg~~LAsgS~d---GTVhIw 458 (1010)
.....+|||||++|+-++.+ |..+||
T Consensus 10 ~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 10 DDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp SEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred cccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 46778999999999988876 567776
No 358
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=68.76 E-value=36 Score=38.98 Aligned_cols=78 Identities=19% Similarity=0.342 Sum_probs=48.1
Q ss_pred CCeEEEEECCCCCEEEEEEc---CCC-----eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecc-cccccEEEEEEc
Q 001814 370 SPISALCFDPSGTLLVTASV---YGN-----NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRG-ITSATIQDICFS 440 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~---dGt-----~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG-~t~a~I~sIAFS 440 (1010)
...+.+..+|+|.+-++... .+. .-+||.+.|. | ++.++..+ .. .-..||||
T Consensus 111 ~r~ND~~v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~------g-----------~~~~l~~~~~~--~~NGla~S 171 (307)
T COG3386 111 NRPNDGVVDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPD------G-----------GVVRLLDDDLT--IPNGLAFS 171 (307)
T ss_pred CCCCceeEcCCCCEEEeCCCccccCccccCCcceEEEEcCC------C-----------CEEEeecCcEE--ecCceEEC
Confidence 34567889999998887655 111 1256666542 2 22233233 21 23679999
Q ss_pred cCCCEEEEEeC-CCeEEEEeCCCCCCc
Q 001814 441 HYSQWIAIVSS-KGTCHVFVLSPFGGD 466 (1010)
Q Consensus 441 pDg~~LAsgS~-dGTVhIw~I~~~gg~ 466 (1010)
||++.|..+-. .+.||-|++.+..+.
T Consensus 172 pDg~tly~aDT~~~~i~r~~~d~~~g~ 198 (307)
T COG3386 172 PDGKTLYVADTPANRIHRYDLDPATGP 198 (307)
T ss_pred CCCCEEEEEeCCCCeEEEEecCcccCc
Confidence 99976666544 588999998864443
No 359
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=68.74 E-value=15 Score=28.34 Aligned_cols=30 Identities=13% Similarity=0.298 Sum_probs=21.2
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCC--CeEEEE
Q 001814 368 HTSPISALCFDPSGTLLVTASVYG--NNINIF 397 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dG--t~IrVw 397 (1010)
....-..-+|||||+.|+-++..+ ....||
T Consensus 7 ~~~~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 7 SPGDDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp SSSSEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred CCccccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 445667789999999998888765 445555
No 360
>PRK13616 lipoprotein LpqB; Provisional
Probab=68.52 E-value=23 Score=44.13 Aligned_cols=101 Identities=15% Similarity=0.160 Sum_probs=59.8
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.+.+.+.++..+.... .....|..+.|||||+.||-... |+ |.|=-+.+. ..|. .......++..
T Consensus 429 ~gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~~-g~-v~Va~Vvr~----~~G~------~~l~~~~~l~~ 493 (591)
T PRK13616 429 TGQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMIIG-GK-VYLAVVEQT----EDGQ------YALTNPREVGP 493 (591)
T ss_pred CceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEEC-CE-EEEEEEEeC----CCCc------eeecccEEeec
Confidence 4556555665554322 33568999999999999988764 54 555333321 0121 00011122322
Q ss_pred ccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCCCCC
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSPFGG 465 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~~gg 465 (1010)
+.. ..+.+++|..|+.. +++..++...||.++..|.
T Consensus 494 ~l~-~~~~~l~W~~~~~L-~V~~~~~~~~v~~v~vDG~ 529 (591)
T PRK13616 494 GLG-DTAVSLDWRTGDSL-VVGRSDPEHPVWYVNLDGS 529 (591)
T ss_pred ccC-CccccceEecCCEE-EEEecCCCCceEEEecCCc
Confidence 322 13578999999995 4667777778898876654
No 361
>PF12429 DUF3676: Protein of unknown function (DUF3676) ; InterPro: IPR022144 This domain family is found in eukaryotes, and is approximately 230 amino acids in length.
Probab=68.01 E-value=6.1 Score=41.84 Aligned_cols=109 Identities=15% Similarity=0.124 Sum_probs=58.1
Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCccCccceeeccCcc--ccccccceeccCcccCCccCCCCcceeeccceeeeeec
Q 001814 871 ENDNPHVNNHIPNGLPSLESNLPSAGRDDTIVAVSMLGAD--YYDSHMGIIMEDRALPLLSCPVNLGVSLREEHCKIVEQ 948 (1010)
Q Consensus 871 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 948 (1010)
+.....+|.+++-..||-...-++.++-.+-..+-.++.- +--.|..-... -+--+.+.+-.-++-.-..+++
T Consensus 28 eqeE~~v~d~vpAA~sST~~aGSSV~EpAiA~esA~nS~~edNaQlsegetsq-----QaT~~e~~~smQrdSdvQ~qd~ 102 (230)
T PF12429_consen 28 EQEEESVDDPVPAASSSTVAAGSSVPEPAIAAESAENSRPEDNAQLSEGETSQ-----QATLNEDNKSMQRDSDVQPQDP 102 (230)
T ss_pred hhcccccccccccCCccccccccccCchhHhhhhhhccCccccccccCCcccc-----cccccccchhhhcccccCccCC
Confidence 3444667777777777766666666666555444444332 11111110000 0111112221112222222222
Q ss_pred CCcccceeeeeccCCCCCCcccccccccccCcccccccc
Q 001814 949 NGLCKSTDVVNDDINGGNSHCESKKLEEDAEDDEMLGGM 987 (1010)
Q Consensus 949 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 987 (1010)
...+|||| +|+=-++-| -+.++|||+|+-+|+-||-
T Consensus 103 -qs~e~Te~-~DvE~sses-~d~e~PeEeg~and~Sg~s 138 (230)
T PF12429_consen 103 -QSEELTEV-TDVEGSSES-NDTEQPEEEGEANDRSGGS 138 (230)
T ss_pred -cchhcccc-ccccccccc-ccccCcchhccccCCCCCc
Confidence 35688887 887444444 4899999999999999985
No 362
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=67.88 E-value=2.1e+02 Score=33.60 Aligned_cols=51 Identities=18% Similarity=0.292 Sum_probs=41.1
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCC-EEEEEEcCCCeEEEEeCCC
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGT-LLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGt-lLATAS~dGt~IrVwdi~p 401 (1010)
.|=++|+.+++.+..+..- .++.+|+.+.|.+ +|.+.+..+..+.|||..+
T Consensus 270 eVWv~D~~t~krv~Ri~l~-~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~t 321 (342)
T PF06433_consen 270 EVWVYDLKTHKRVARIPLE-HPIDSIAVSQDDKPLLYALSAGDGTLDVYDAAT 321 (342)
T ss_dssp EEEEEETTTTEEEEEEEEE-EEESEEEEESSSS-EEEEEETTTTEEEEEETTT
T ss_pred EEEEEECCCCeEEEEEeCC-CccceEEEccCCCcEEEEEcCCCCeEEEEeCcC
Confidence 5888999999999988842 3788999999998 7767766556699999864
No 363
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=67.79 E-value=11 Score=43.53 Aligned_cols=57 Identities=18% Similarity=0.167 Sum_probs=43.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcEEEEEEcC----CeEEEEe-CCeEEEEECCCCceeEEEee
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSVCMVRCSP----RIVAVGL-ATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V~sVa~S~----rlLAV~l-d~~I~IwD~~Tle~l~tL~t 229 (1010)
+.|=+||+++++.+..+....++.+|.++. .++++.. ++.|.+||+.|++.+.++..
T Consensus 269 teVWv~D~~t~krv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk~~~~~~~ 330 (342)
T PF06433_consen 269 TEVWVYDLKTHKRVARIPLEHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATGKLVRSIEQ 330 (342)
T ss_dssp EEEEEEETTTTEEEEEEEEEEEESEEEEESSSS-EEEEEETTTTEEEEEETTT--EEEEE--
T ss_pred eEEEEEECCCCeEEEEEeCCCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCCcEEeehhc
Confidence 678889999999999999888888888875 2444443 46799999999999888874
No 364
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=67.61 E-value=13 Score=30.64 Aligned_cols=29 Identities=17% Similarity=0.233 Sum_probs=25.2
Q ss_pred EEEEEEccCC---CEEEEEeCCCeEEEEeCCC
Q 001814 434 IQDICFSHYS---QWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 434 I~sIAFSpDg---~~LAsgS~dGTVhIw~I~~ 462 (1010)
|.++.|||+. ..||.+-..|-|||+|+..
T Consensus 3 vR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~ 34 (43)
T PF10313_consen 3 VRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRS 34 (43)
T ss_pred eEEEEeCCCCCcccEEEEEccCCeEEEEEccc
Confidence 7899999865 4999999999999999973
No 365
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=66.74 E-value=6.7 Score=48.68 Aligned_cols=97 Identities=15% Similarity=0.263 Sum_probs=68.0
Q ss_pred ccCCCCeEEEEECCCCcEEEEec--cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFK--AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~--aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
+.+.+|.|.||-+-.+.-.-.+. -..+-|.+|+|+-||+.++.+=+||.+ .|=.+. |. ..|.
T Consensus 88 tSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~dG~kIcIvYeDGav-IVGsvd--------GN--RIwg----- 151 (1189)
T KOG2041|consen 88 TSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNLDGTKICIVYEDGAV-IVGSVD--------GN--RIWG----- 151 (1189)
T ss_pred ccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcCCCcEEEEEEccCCE-EEEeec--------cc--eecc-----
Confidence 56789999999998776432221 245788999999999999999888874 454442 21 0111
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.+| .|.. ...+.||+|.+.+..+-..|.+|+|+..
T Consensus 152 -KeL-kg~~---l~hv~ws~D~~~~Lf~~ange~hlydnq 186 (1189)
T KOG2041|consen 152 -KEL-KGQL---LAHVLWSEDLEQALFKKANGETHLYDNQ 186 (1189)
T ss_pred -hhc-chhe---ccceeecccHHHHHhhhcCCcEEEeccc
Confidence 112 2322 3457899999999999999999999864
No 366
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=65.96 E-value=27 Score=40.56 Aligned_cols=64 Identities=16% Similarity=0.245 Sum_probs=40.7
Q ss_pred CCeEEEEECCCCCEEEEE-EcCCC---eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCE
Q 001814 370 SPISALCFDPSGTLLVTA-SVYGN---NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQW 445 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATA-S~dGt---~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~ 445 (1010)
..+...++||||++||-+ +..|. .|+|+|+.. | +.+-.. +....-..++|.+|++.
T Consensus 124 ~~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~t-------g----------~~l~d~---i~~~~~~~~~W~~d~~~ 183 (414)
T PF02897_consen 124 VSLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFDLET-------G----------KFLPDG---IENPKFSSVSWSDDGKG 183 (414)
T ss_dssp EEEEEEEETTTSSEEEEEEEETTSSEEEEEEEETTT-------T----------EEEEEE---EEEEESEEEEECTTSSE
T ss_pred EEeeeeeECCCCCEEEEEecCCCCceEEEEEEECCC-------C----------cCcCCc---ccccccceEEEeCCCCE
Confidence 345578999999988855 55443 599999963 4 233221 11122234999999988
Q ss_pred EEEEeCCC
Q 001814 446 IAIVSSKG 453 (1010)
Q Consensus 446 LAsgS~dG 453 (1010)
|.....+.
T Consensus 184 ~~y~~~~~ 191 (414)
T PF02897_consen 184 FFYTRFDE 191 (414)
T ss_dssp EEEEECST
T ss_pred EEEEEeCc
Confidence 77766544
No 367
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=65.05 E-value=90 Score=37.03 Aligned_cols=86 Identities=15% Similarity=0.166 Sum_probs=48.6
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccC---CCCCC-----------------------ccccCCcceEEEE
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRS---GSGNH-----------------------KYDWNSSHVHLYK 424 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~---~sG~~-----------------------~~~~~~s~~~L~~ 424 (1010)
.|..+.|.++-.-||.+-..|. +-||....+.... ..... ......-...++-
T Consensus 3 ~v~~vs~a~~t~Elav~~~~Ge-Vv~~k~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~l~di~~r~~~~~~~gf~P~~l 81 (395)
T PF08596_consen 3 SVTHVSFAPETLELAVGLESGE-VVLFKFGKNQNYGNREQPPDLDYNFRRFSLNNSPGKLTDISDRAPPSLKEGFLPLTL 81 (395)
T ss_dssp -EEEEEEETTTTEEEEEETTS--EEEEEEEE------------------S--GGGSS-SEEE-GGG--TT-SEEEEEEEE
T ss_pred eEEEEEecCCCceEEEEccCCc-EEEEEcccCCCCCccCCCcccCcccccccccCCCcceEEehhhCCcccccccCchhh
Confidence 5789999999888888888787 4477654321000 00000 0000111122222
Q ss_pred EecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 425 LHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 425 L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
++- ..+.|..++.| |=-|+|+|..+|++-|.|+
T Consensus 82 ~~~--~~g~vtal~~S-~iGFvaigy~~G~l~viD~ 114 (395)
T PF08596_consen 82 LDA--KQGPVTALKNS-DIGFVAIGYESGSLVVIDL 114 (395)
T ss_dssp E-----S-SEEEEEE--BTSEEEEEETTSEEEEEET
T ss_pred eec--cCCcEeEEecC-CCcEEEEEecCCcEEEEEC
Confidence 221 24579999998 7779999999999999998
No 368
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=64.89 E-value=72 Score=35.18 Aligned_cols=77 Identities=16% Similarity=0.147 Sum_probs=47.9
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEe
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
.+..-+|+++|.+.+.... ....+++.-... |. .....+........|.++.+||||.++|...
T Consensus 67 ~l~~PS~d~~g~~W~v~~~-~~~~~~~~~~~~------g~---------~~~~~v~~~~~~~~I~~l~vSpDG~RvA~v~ 130 (253)
T PF10647_consen 67 SLTRPSWDPDGWVWTVDDG-SGGVRVVRDSAS------GT---------GEPVEVDWPGLRGRITALRVSPDGTRVAVVV 130 (253)
T ss_pred ccccccccCCCCEEEEEcC-CCceEEEEecCC------Cc---------ceeEEecccccCCceEEEEECCCCcEEEEEE
Confidence 6778899999988876665 344666642110 21 0111221110111799999999999999988
Q ss_pred C---CCeEEEEeCCCC
Q 001814 451 S---KGTCHVFVLSPF 463 (1010)
Q Consensus 451 ~---dGTVhIw~I~~~ 463 (1010)
. ++.|.|=.|...
T Consensus 131 ~~~~~~~v~va~V~r~ 146 (253)
T PF10647_consen 131 EDGGGGRVYVAGVVRD 146 (253)
T ss_pred ecCCCCeEEEEEEEeC
Confidence 3 467777777543
No 369
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=64.88 E-value=84 Score=36.73 Aligned_cols=93 Identities=12% Similarity=0.199 Sum_probs=63.6
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEE--
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYK-- 424 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~-- 424 (1010)
..+.|.+.|..+++++..+..-..| ..++|+|+|+.+..+......|-+++.... .+.+
T Consensus 138 ~~~~vsvid~~t~~~~~~~~vG~~P-~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~------------------~v~~~~ 198 (381)
T COG3391 138 GNNTVSVIDAATNKVTATIPVGNTP-TGVAVDPDGNKVYVTNSDDNTVSVIDTSGN------------------SVVRGS 198 (381)
T ss_pred CCceEEEEeCCCCeEEEEEecCCCc-ceEEECCCCCeEEEEecCCCeEEEEeCCCc------------------ceeccc
Confidence 4678999999999988877665567 889999999977777655667889986421 1111
Q ss_pred ----EecccccccEEEEEEccCCCEEEEEeCC---CeEEEEeCC
Q 001814 425 ----LHRGITSATIQDICFSHYSQWIAIVSSK---GTCHVFVLS 461 (1010)
Q Consensus 425 ----L~RG~t~a~I~sIAFSpDg~~LAsgS~d---GTVhIw~I~ 461 (1010)
...+. .-..++++|||+++.+.-.. +++.+.+..
T Consensus 199 ~~~~~~~~~---~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~ 239 (381)
T COG3391 199 VGSLVGVGT---GPAGIAVDPDGNRVYVANDGSGSNNVLKIDTA 239 (381)
T ss_pred cccccccCC---CCceEEECCCCCEEEEEeccCCCceEEEEeCC
Confidence 11111 23678999999977665544 355555544
No 370
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=63.33 E-value=16 Score=40.14 Aligned_cols=105 Identities=13% Similarity=0.091 Sum_probs=65.0
Q ss_pred ccCCCCeEEEEECCCCcEE-EEeccCCCCeEEE-EECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 344 DMDNAGIVVVKDFVTRAII-SQFKAHTSPISAL-CFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v-~~~~aHtspIsaL-aFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
-+..+|.|.+|...--... -.+..-..+|.++ .--.++.+..++..+|. ||-|++.|+ ++
T Consensus 75 vG~~dg~v~~~n~n~~g~~~d~~~s~~e~i~~~Ip~~~~~~~~c~~~~dg~-ir~~n~~p~-----------------k~ 136 (238)
T KOG2444|consen 75 VGTSDGAVYVFNWNLEGAHSDRVCSGEESIDLGIPNGRDSSLGCVGAQDGR-IRACNIKPN-----------------KV 136 (238)
T ss_pred eecccceEEEecCCccchHHHhhhcccccceeccccccccceeEEeccCCc-eeeeccccC-----------------ce
Confidence 4567999999987621111 1111122344442 22245556677777554 999999873 23
Q ss_pred EEEEeccccc-ccEEEEEEccCCCEEEEE--eCCCeEEEEeCCCCCCccc
Q 001814 422 LYKLHRGITS-ATIQDICFSHYSQWIAIV--SSKGTCHVFVLSPFGGDSG 468 (1010)
Q Consensus 422 L~~L~RG~t~-a~I~sIAFSpDg~~LAsg--S~dGTVhIw~I~~~gg~~~ 468 (1010)
+- ++|.++ ..+..+.-+.-+++|+++ |.+.+++.|++.+....+.
T Consensus 137 ~g--~~g~h~~~~~e~~ivv~sd~~i~~a~~S~d~~~k~W~ve~~~d~~~ 184 (238)
T KOG2444|consen 137 LG--YVGQHNFESGEELIVVGSDEFLKIADTSHDRVLKKWNVEKIKDESP 184 (238)
T ss_pred ee--eeccccCCCcceeEEecCCceEEeeccccchhhhhcchhhhhccCc
Confidence 22 234444 556666677778888888 8889999999987655543
No 371
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=62.79 E-value=40 Score=39.95 Aligned_cols=84 Identities=18% Similarity=0.298 Sum_probs=53.7
Q ss_pred EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe------cccccccE
Q 001814 361 IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH------RGITSATI 434 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~------RG~t~a~I 434 (1010)
....+.....+|++|+.|.=| ++|.|.++|+ +-|.|+. |- ..+|.-. .+.....|
T Consensus 78 P~~l~~~~~g~vtal~~S~iG-Fvaigy~~G~-l~viD~R--------GP---------avI~~~~i~~~~~~~~~~~~v 138 (395)
T PF08596_consen 78 PLTLLDAKQGPVTALKNSDIG-FVAIGYESGS-LVVIDLR--------GP---------AVIYNENIRESFLSKSSSSYV 138 (395)
T ss_dssp EEEEE---S-SEEEEEE-BTS-EEEEEETTSE-EEEEETT--------TT---------EEEEEEEGGG--T-SS----E
T ss_pred chhheeccCCcEeEEecCCCc-EEEEEecCCc-EEEEECC--------CC---------eEEeeccccccccccccccCe
Confidence 344556668999999998555 8899998775 8899995 31 2444411 11223468
Q ss_pred EEEEEc-----cCC---CEEEEEeCCCeEEEEeCCCC
Q 001814 435 QDICFS-----HYS---QWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 435 ~sIAFS-----pDg---~~LAsgS~dGTVhIw~I~~~ 463 (1010)
.+|.|+ .|+ -.|.+|++.|++.+|+|-|.
T Consensus 139 t~ieF~vm~~~~D~ySSi~L~vGTn~G~v~~fkIlp~ 175 (395)
T PF08596_consen 139 TSIEFSVMTLGGDGYSSICLLVGTNSGNVLTFKILPS 175 (395)
T ss_dssp EEEEEEEEE-TTSSSEEEEEEEEETTSEEEEEEEEE-
T ss_pred eEEEEEEEecCCCcccceEEEEEeCCCCEEEEEEecC
Confidence 888888 233 58899999999999999763
No 372
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=62.26 E-value=36 Score=38.98 Aligned_cols=70 Identities=19% Similarity=0.232 Sum_probs=39.9
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
...|..+..++||+++|.++. |..+.-|+- |. ...+.+. |. +..+|++|.|+||+...++
T Consensus 144 ~gs~~~~~r~~dG~~vavs~~-G~~~~s~~~---------G~-------~~w~~~~--r~-~~~riq~~gf~~~~~lw~~ 203 (302)
T PF14870_consen 144 SGSINDITRSSDGRYVAVSSR-GNFYSSWDP---------GQ-------TTWQPHN--RN-SSRRIQSMGFSPDGNLWML 203 (302)
T ss_dssp ---EEEEEE-TTS-EEEEETT-SSEEEEE-T---------T--------SS-EEEE-----SSS-EEEEEE-TTS-EEEE
T ss_pred cceeEeEEECCCCcEEEEECc-ccEEEEecC---------CC-------ccceEEc--cC-ccceehhceecCCCCEEEE
Confidence 367888899999999987766 987777763 31 1123333 43 3457999999999877665
Q ss_pred EeCCCeEEEEe
Q 001814 449 VSSKGTCHVFV 459 (1010)
Q Consensus 449 gS~dGTVhIw~ 459 (1010)
+ ..|-++.=+
T Consensus 204 ~-~Gg~~~~s~ 213 (302)
T PF14870_consen 204 A-RGGQIQFSD 213 (302)
T ss_dssp E-TTTEEEEEE
T ss_pred e-CCcEEEEcc
Confidence 5 666665543
No 373
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=61.48 E-value=15 Score=42.03 Aligned_cols=87 Identities=18% Similarity=0.264 Sum_probs=50.8
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcc---ccCCcceEEEEEecccc-cccEEEEEEccCCC-
Q 001814 370 SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKY---DWNSSHVHLYKLHRGIT-SATIQDICFSHYSQ- 444 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~---~~~~s~~~L~~L~RG~t-~a~I~sIAFSpDg~- 444 (1010)
.-|+++-|+..|.+|||+...|+ +-+|.-... -|+... .. +++..-+.+..... ...|..|-|-.|++
T Consensus 27 d~ItaVefd~tg~YlatGDkgGR-Vvlfer~~s-----~~ceykf~teF-Qshe~EFDYLkSleieEKin~I~w~~~t~r 99 (460)
T COG5170 27 DKITAVEFDETGLYLATGDKGGR-VVLFEREKS-----YGCEYKFFTEF-QSHELEFDYLKSLEIEEKINAIEWFDDTGR 99 (460)
T ss_pred ceeeEEEeccccceEeecCCCce-EEEeecccc-----cccchhhhhhh-cccccchhhhhhccHHHHhhheeeecCCCc
Confidence 46899999999999999998666 567764321 011000 00 00000000000001 12477788876653
Q ss_pred -EEEEEeCCCeEEEEeCCCC
Q 001814 445 -WIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 445 -~LAsgS~dGTVhIw~I~~~ 463 (1010)
.+..++.|.|++||+|-..
T Consensus 100 ~hFLlstNdktiKlWKiyek 119 (460)
T COG5170 100 NHFLLSTNDKTIKLWKIYEK 119 (460)
T ss_pred ceEEEecCCceeeeeeeecc
Confidence 5667888999999999654
No 374
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=60.96 E-value=24 Score=45.33 Aligned_cols=93 Identities=15% Similarity=0.211 Sum_probs=59.8
Q ss_pred CCCeEEEEECCCC--cEEE-Eec--cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 347 NAGIVVVKDFVTR--AIIS-QFK--AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 347 ~dG~V~VwDl~s~--~~v~-~~~--aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
.+..+--||..-. +++. +.. +.....+|++-+.+| +||.||.+|. ||+||-. |. .
T Consensus 550 s~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~aTt~~G-~iavgs~~G~-IRLyd~~--------g~----------~ 609 (794)
T PF08553_consen 550 SDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFSCFATTEDG-YIAVGSNKGD-IRLYDRL--------GK----------R 609 (794)
T ss_pred CCCceEEeccCCCCCceeeccccccccCCCceEEEecCCc-eEEEEeCCCc-EEeeccc--------ch----------h
Confidence 3455666776532 2221 111 345678899999988 4778999887 9999842 31 1
Q ss_pred EEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 422 LYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
-.++.-|. ...|..|.-+.||+||++.+.. -+.|++..
T Consensus 610 AKT~lp~l-G~pI~~iDvt~DGkwilaTc~t-yLlLi~t~ 647 (794)
T PF08553_consen 610 AKTALPGL-GDPIIGIDVTADGKWILATCKT-YLLLIDTL 647 (794)
T ss_pred hhhcCCCC-CCCeeEEEecCCCcEEEEeecc-eEEEEEEe
Confidence 12222232 2479999999999999887655 56666654
No 375
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=57.98 E-value=97 Score=39.04 Aligned_cols=101 Identities=14% Similarity=0.226 Sum_probs=66.1
Q ss_pred CCeEEEEECCCCcEE--EEeccCCCCeEEEEE--CCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEE
Q 001814 348 AGIVVVKDFVTRAII--SQFKAHTSPISALCF--DPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLY 423 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v--~~~~aHtspIsaLaF--SPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~ 423 (1010)
.-.++|||...+... ..| ....+|..|.| .|||+.+.+.+- ++.|.+|.-...-..+ ..+.|. .+.
T Consensus 50 ~~~LtIWD~~~~~lE~~~~f-~~~~~I~dLDWtst~d~qsiLaVGf-~~~v~l~~Q~R~dy~~----~~p~w~----~i~ 119 (631)
T PF12234_consen 50 RSELTIWDTRSGVLEYEESF-SEDDPIRDLDWTSTPDGQSILAVGF-PHHVLLYTQLRYDYTN----KGPSWA----PIR 119 (631)
T ss_pred CCEEEEEEcCCcEEEEeeee-cCCCceeeceeeecCCCCEEEEEEc-CcEEEEEEccchhhhc----CCcccc----eeE
Confidence 347999999887643 233 45678998877 589999988888 5668888643210000 112233 333
Q ss_pred EE-ecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 424 KL-HRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 424 ~L-~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
++ -+.+|+..|.+..|-+||..++.++ ..++||+=
T Consensus 120 ~i~i~~~T~h~Igds~Wl~~G~LvV~sG--Nqlfv~dk 155 (631)
T PF12234_consen 120 KIDISSHTPHPIGDSIWLKDGTLVVGSG--NQLFVFDK 155 (631)
T ss_pred EEEeecCCCCCccceeEecCCeEEEEeC--CEEEEECC
Confidence 33 2566777899999999998876663 34778763
No 376
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=56.72 E-value=24 Score=29.21 Aligned_cols=29 Identities=24% Similarity=0.326 Sum_probs=23.0
Q ss_pred CeEEEEECCCCC---EEEEEEcCCCeEEEEeCC
Q 001814 371 PISALCFDPSGT---LLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 371 pIsaLaFSPdGt---lLATAS~dGt~IrVwdi~ 400 (1010)
.|.++.|||+.. +||-+-..|. |+|+|+.
T Consensus 2 AvR~~kFsP~~~~~DLL~~~E~~g~-vhi~D~R 33 (43)
T PF10313_consen 2 AVRCCKFSPEPGGNDLLAWAEHQGR-VHIVDTR 33 (43)
T ss_pred CeEEEEeCCCCCcccEEEEEccCCe-EEEEEcc
Confidence 578999998554 8887766565 9999986
No 377
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=56.58 E-value=8.2 Score=49.12 Aligned_cols=108 Identities=13% Similarity=0.197 Sum_probs=61.9
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEE-----------ECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcc
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALC-----------FDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKY 413 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLa-----------FSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~ 413 (1010)
+..+|+|++.....- ....|+.|...+..++ +||||+.+|+++.||. ++.|++--. |.
T Consensus 201 ~~~~~~i~lL~~~ra-~~~l~rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~-v~f~Qiyi~------g~--- 269 (1283)
T KOG1916|consen 201 GLKGGEIRLLNINRA-LRSLFRSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISDGS-VGFYQIYIT------GK--- 269 (1283)
T ss_pred ccCCCceeEeeechH-HHHHHHhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecCCc-cceeeeeee------cc---
Confidence 345677776555432 2255667766544433 6999999999999886 788887421 21
Q ss_pred ccCCcceEEEEEecccccccEEEEEEccC-------CC--EEEEEeCCCe-EEEEeCCCCCCcc
Q 001814 414 DWNSSHVHLYKLHRGITSATIQDICFSHY-------SQ--WIAIVSSKGT-CHVFVLSPFGGDS 467 (1010)
Q Consensus 414 ~~~~s~~~L~~L~RG~t~a~I~sIAFSpD-------g~--~LAsgS~dGT-VhIw~I~~~gg~~ 467 (1010)
-.++++++.+-....-.|+.+ |... +. ++.++++-++ +++|.-.++.+..
T Consensus 270 ---~~~rclhewkphd~~p~vC~l-c~~~~~~~v~i~~w~~~Itttd~nre~k~w~~a~w~Cll 329 (1283)
T KOG1916|consen 270 ---IVHRCLHEWKPHDKHPRVCWL-CHKQEILVVSIGKWVLRITTTDVNREEKFWAEAPWQCLL 329 (1283)
T ss_pred ---ccHhhhhccCCCCCCCceeee-eccccccCCccceeEEEEecccCCcceeEeeccchhhhh
Confidence 122344444322212234444 3221 33 3445555444 8899988887763
No 378
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=55.37 E-value=1.4e+02 Score=30.68 Aligned_cols=58 Identities=19% Similarity=0.145 Sum_probs=44.3
Q ss_pred CCCEEEEEeCCCCeEEEEEeCCCcEEEEEEc------CCeEEEEeCCeEEEEECCCCceeEEEe
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFRSSVCMVRCS------PRIVAVGLATQIYCFDALTLENKFSVL 228 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~S~V~sVa~S------~rlLAV~ld~~I~IwD~~Tle~l~tL~ 228 (1010)
+++.|..||...+..+-.-+.+..|.+|.+- ..+++||..-.|.-||..-.+..+++.
T Consensus 71 t~t~llaYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGncsi~Gfd~~G~e~fWtVt 134 (136)
T PF14781_consen 71 TQTSLLAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGNCSIQGFDYEGNEIFWTVT 134 (136)
T ss_pred ccceEEEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECceEEEEEeCCCCcEEEEEec
Confidence 3578999999988776555677788877772 357778888889999988777777654
No 379
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=55.08 E-value=6.5e+02 Score=33.51 Aligned_cols=97 Identities=11% Similarity=0.145 Sum_probs=67.9
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 347 NAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
....|++|+..+.+.+..=..|..++.+|...-.|..+|.|+.-+. |.+-.-... -| .++++-
T Consensus 846 In~~vrLye~t~~~eLr~e~~~~~~~~aL~l~v~gdeI~VgDlm~S-itll~y~~~-----eg-----------~f~evA 908 (1096)
T KOG1897|consen 846 INQSVRLYEWTTERELRIECNISNPIIALDLQVKGDEIAVGDLMRS-ITLLQYKGD-----EG-----------NFEEVA 908 (1096)
T ss_pred cCcEEEEEEccccceehhhhcccCCeEEEEEEecCcEEEEeeccce-EEEEEEecc-----CC-----------ceEEee
Confidence 3457999999988777666678899999999999999999988554 444333221 02 356666
Q ss_pred cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
|-..+.++..+.+=.|..++ .+...|.+.+-..+
T Consensus 909 rD~~p~Wmtaveil~~d~yl-gae~~gNlf~v~~d 942 (1096)
T KOG1897|consen 909 RDYNPNWMTAVEILDDDTYL-GAENSGNLFTVRKD 942 (1096)
T ss_pred hhhCccceeeEEEecCceEE-eecccccEEEEEec
Confidence 76666678888887555554 56667777666555
No 380
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=54.00 E-value=1.1e+02 Score=36.43 Aligned_cols=101 Identities=19% Similarity=0.176 Sum_probs=46.7
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC---------------CCeEEEEeCCCCcccCCCC
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY---------------GNNINIFRIMPSCMRSGSG 409 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d---------------Gt~IrVwdi~p~~~~~~sG 409 (1010)
++.+..|.=+|..+++... + ....+.+-+--|+||+|+|-=+.+ +-.|.||++.. +
T Consensus 260 ~~~~~~i~~~d~~t~~~~~-~-~~~p~~~H~~ss~Dg~L~vGDG~d~p~~v~~~~~~~~~~~p~i~~~~~~~-------~ 330 (386)
T PF14583_consen 260 GGQDFWIAGYDPDTGERRR-L-MEMPWCSHFMSSPDGKLFVGDGGDAPVDVADAGGYKIENDPWIYLFDVEA-------G 330 (386)
T ss_dssp TT--EEEEEE-TTT--EEE-E-EEE-SEEEEEE-TTSSEEEEEE-------------------EEEEEETTT-------T
T ss_pred CCCceEEEeeCCCCCCceE-E-EeCCceeeeEEcCCCCEEEecCCCCCccccccccceecCCcEEEEecccc-------C
Confidence 3445567778888775432 1 122345567789999988743332 11355666542 1
Q ss_pred CCccccCCcceEEEE------EecccccccEEEEEEccCCCEEEEEeC-CCeEEEEeCCC
Q 001814 410 NHKYDWNSSHVHLYK------LHRGITSATIQDICFSHYSQWIAIVSS-KGTCHVFVLSP 462 (1010)
Q Consensus 410 ~~~~~~~~s~~~L~~------L~RG~t~a~I~sIAFSpDg~~LAsgS~-dGTVhIw~I~~ 462 (1010)
....|.+ ...|.....=-...||||++||.-.|+ .|..+||.++-
T Consensus 331 --------~~~~l~~h~~sw~v~~~~~q~~hPhp~FSPDgk~VlF~Sd~~G~~~vY~v~i 382 (386)
T PF14583_consen 331 --------RFRKLARHDTSWKVLDGDRQVTHPHPSFSPDGKWVLFRSDMEGPPAVYLVEI 382 (386)
T ss_dssp --------EEEEEEE-------BTTBSSTT----EE-TTSSEEEEEE-TTSS-EEEEEE-
T ss_pred --------ceeeeeeccCcceeecCCCccCCCCCccCCCCCEEEEECCCCCCccEEEEeC
Confidence 0011111 111211111134789999999987766 68899998764
No 381
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=52.92 E-value=77 Score=37.93 Aligned_cols=40 Identities=15% Similarity=0.346 Sum_probs=32.2
Q ss_pred EEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 361 IISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 361 ~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
....|....-.+.+|+.+|+|++.|+.+.-|+ |-++|+..
T Consensus 299 ~r~~l~D~~R~~~~i~~sP~~~laA~tDslGR-V~LiD~~~ 338 (415)
T PF14655_consen 299 MRFGLPDSKREGESICLSPSGRLAAVTDSLGR-VLLIDVAR 338 (415)
T ss_pred eEEeeccCCceEEEEEECCCCCEEEEEcCCCc-EEEEECCC
Confidence 44556666667889999999999999888788 78999864
No 382
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=50.45 E-value=1.7e+02 Score=32.69 Aligned_cols=83 Identities=12% Similarity=0.195 Sum_probs=47.5
Q ss_pred EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe-cccccccEEEEEEccC
Q 001814 364 QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH-RGITSATIQDICFSHY 442 (1010)
Q Consensus 364 ~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~-RG~t~a~I~sIAFSpD 442 (1010)
.+.+-...|+.|+|+|+...|++...+...|-..+.. | ..+.+.. .|. ...-+|++--+
T Consensus 16 ~l~g~~~e~SGLTy~pd~~tLfaV~d~~~~i~els~~--------G----------~vlr~i~l~g~--~D~EgI~y~g~ 75 (248)
T PF06977_consen 16 PLPGILDELSGLTYNPDTGTLFAVQDEPGEIYELSLD--------G----------KVLRRIPLDGF--GDYEGITYLGN 75 (248)
T ss_dssp E-TT--S-EEEEEEETTTTEEEEEETTTTEEEEEETT--------------------EEEEEE-SS---SSEEEEEE-ST
T ss_pred ECCCccCCccccEEcCCCCeEEEEECCCCEEEEEcCC--------C----------CEEEEEeCCCC--CCceeEEEECC
Confidence 4445556799999999876666665555545444431 4 2333321 332 35678999888
Q ss_pred CCEEEEEeCCCeEEEEeCCCCCCc
Q 001814 443 SQWIAIVSSKGTCHVFVLSPFGGD 466 (1010)
Q Consensus 443 g~~LAsgS~dGTVhIw~I~~~gg~ 466 (1010)
++++++.-.++++.++.+...+..
T Consensus 76 ~~~vl~~Er~~~L~~~~~~~~~~~ 99 (248)
T PF06977_consen 76 GRYVLSEERDQRLYIFTIDDDTTS 99 (248)
T ss_dssp TEEEEEETTTTEEEEEEE----TT
T ss_pred CEEEEEEcCCCcEEEEEEeccccc
Confidence 877776666899999999765543
No 383
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=49.66 E-value=5.7e+02 Score=32.99 Aligned_cols=74 Identities=14% Similarity=0.174 Sum_probs=49.1
Q ss_pred CeEEEEEC--CCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCC-----
Q 001814 371 PISALCFD--PSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYS----- 443 (1010)
Q Consensus 371 pIsaLaFS--PdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg----- 443 (1010)
....|++. ...++||.++. ++.|-||-..... . ...+.... .+...|-+|+|-++.
T Consensus 165 SaWGLdIh~~~~~rlIAVSsN-s~~VTVFaf~l~~-----~--------r~~~~~s~---~~~hNIP~VSFl~~~~d~~G 227 (717)
T PF08728_consen 165 SAWGLDIHDYKKSRLIAVSSN-SQEVTVFAFALVD-----E--------RFYHVPSH---QHSHNIPNVSFLDDDLDPNG 227 (717)
T ss_pred ceeEEEEEecCcceEEEEecC-CceEEEEEEeccc-----c--------cccccccc---ccccCCCeeEeecCCCCCcc
Confidence 67788887 66677776666 6779998764310 0 00011111 123468999997765
Q ss_pred -CEEEEEeCCCeEEEEeCC
Q 001814 444 -QWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 444 -~~LAsgS~dGTVhIw~I~ 461 (1010)
.+|++++-.|.+-+|++.
T Consensus 228 ~v~v~a~dI~G~v~~~~I~ 246 (717)
T PF08728_consen 228 HVKVVATDISGEVWTFKIK 246 (717)
T ss_pred ceEEEEEeccCcEEEEEEE
Confidence 299999999999999884
No 384
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=48.80 E-value=5.7e+02 Score=30.98 Aligned_cols=57 Identities=14% Similarity=-0.037 Sum_probs=36.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCc-------E--EEEEEcC-CeEEE-EeCCeEEEEECCCCceeEEEee
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSS-------V--CMVRCSP-RIVAV-GLATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~-------V--~sVa~S~-rlLAV-~ld~~I~IwD~~Tle~l~tL~t 229 (1010)
+.|.-.|+++|+.+-..+.... + ..+.... ..|.+ ..++.|+.+|+.|++.+.....
T Consensus 71 g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v~~~~g~v~AlD~~TG~~~W~~~~ 138 (488)
T cd00216 71 SALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFFGTFDGRLVALDAETGKQVWKFGN 138 (488)
T ss_pred CcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEEecCCCeEEEEECCCCCEeeeecC
Confidence 5677778888887766544321 1 1223333 44444 4567899999999998877653
No 385
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=48.22 E-value=73 Score=42.29 Aligned_cols=68 Identities=13% Similarity=0.164 Sum_probs=50.1
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE
Q 001814 370 SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV 449 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg 449 (1010)
..|.++.|--++.-+.-+...|..|-+ |... + ..+. -|.....|.+++||||.+++|..
T Consensus 69 ~~i~s~~fl~d~~~i~v~~~~G~iilv-d~et-------~------------~~ei-vg~vd~GI~aaswS~Dee~l~li 127 (1265)
T KOG1920|consen 69 DEIVSVQFLADTNSICVITALGDIILV-DPET-------L------------ELEI-VGNVDNGISAASWSPDEELLALI 127 (1265)
T ss_pred cceEEEEEecccceEEEEecCCcEEEE-cccc-------c------------ceee-eeeccCceEEEeecCCCcEEEEE
Confidence 578888888888888888888885544 5421 1 1122 34444569999999999999999
Q ss_pred eCCCeEEEE
Q 001814 450 SSKGTCHVF 458 (1010)
Q Consensus 450 S~dGTVhIw 458 (1010)
+.++|+.+-
T Consensus 128 T~~~tll~m 136 (1265)
T KOG1920|consen 128 TGRQTLLFM 136 (1265)
T ss_pred eCCcEEEEE
Confidence 999998654
No 386
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=47.72 E-value=1.8e+02 Score=35.23 Aligned_cols=107 Identities=15% Similarity=0.230 Sum_probs=54.5
Q ss_pred CCCCeEEEEECCCCcEEEEeccCC-C-CeEEEEE--CCCCCEEEEEEcC-CCeEEEEeCCCCcccCCCCCCccccCCcce
Q 001814 346 DNAGIVVVKDFVTRAIISQFKAHT-S-PISALCF--DPSGTLLVTASVY-GNNINIFRIMPSCMRSGSGNHKYDWNSSHV 420 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~~~aHt-s-pIsaLaF--SPdGtlLATAS~d-Gt~IrVwdi~p~~~~~~sG~~~~~~~~s~~ 420 (1010)
..-..+.|||+.+.+.+++|.--. + -...|.| +|+-++=.++..- .++.++|.... | .|.+.
T Consensus 219 ~yG~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~aLss~i~~~~k~~~-------g----~W~a~-- 285 (461)
T PF05694_consen 219 KYGHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGCALSSSIWRFYKDDD-------G----EWAAE-- 285 (461)
T ss_dssp -S--EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEEE--EEEEEEEE-ET-------T----EEEEE--
T ss_pred cccCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEEeccceEEEEEEcCC-------C----Ceeee--
Confidence 345689999999999988886432 2 2335666 5556654444432 33344444221 2 24332
Q ss_pred EEEEEe---------------cccccccEEEEEEccCCCEEEEEeC-CCeEEEEeCCCCCC
Q 001814 421 HLYKLH---------------RGITSATIQDICFSHYSQWIAIVSS-KGTCHVFVLSPFGG 465 (1010)
Q Consensus 421 ~L~~L~---------------RG~t~a~I~sIAFSpDg~~LAsgS~-dGTVhIw~I~~~gg 465 (1010)
.+.++- .|..+..|++|..|.|.|||.++.- +|.|+.|+|+....
T Consensus 286 kVi~ip~~~v~~~~lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP~~ 346 (461)
T PF05694_consen 286 KVIDIPAKKVEGWILPEMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYDISDPFN 346 (461)
T ss_dssp EEEEE--EE--SS---GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEEEE-SSTTS
T ss_pred EEEECCCcccCcccccccccccccCCCceEeEEEccCCCEEEEEcccCCcEEEEecCCCCC
Confidence 222210 1112456999999999999988764 89999999987543
No 387
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=47.72 E-value=95 Score=31.77 Aligned_cols=52 Identities=10% Similarity=0.119 Sum_probs=39.3
Q ss_pred CEEEEEeCC--------CCeEEEEEeCCCcEEEEEEc-------CCeEEEEeCCeEEEEECCCCcee
Q 001814 173 TAVRFYSFQ--------SHCYEHVLRFRSSVCMVRCS-------PRIVAVGLATQIYCFDALTLENK 224 (1010)
Q Consensus 173 ~tVrIWDlk--------tge~V~tL~f~S~V~sVa~S-------~rlLAV~ld~~I~IwD~~Tle~l 224 (1010)
++|-|++.. ....+..|.+...|.+|+.- ++.|++|....|-+||+..-..+
T Consensus 20 gKV~IH~ph~~~~~~~~~~~~i~~LNin~~italaaG~l~~~~~~D~LliGt~t~llaYDV~~N~d~ 86 (136)
T PF14781_consen 20 GKVFIHNPHERGQRTGRQDSDISFLNINQEITALAAGRLKPDDGRDCLLIGTQTSLLAYDVENNSDL 86 (136)
T ss_pred CEEEEECCCccccccccccCceeEEECCCceEEEEEEecCCCCCcCEEEEeccceEEEEEcccCchh
Confidence 467777654 44567888999888888653 36899999999999998865543
No 388
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.62 E-value=1.6e+02 Score=33.84 Aligned_cols=42 Identities=24% Similarity=0.313 Sum_probs=33.5
Q ss_pred CCCeEEEEECCCC-cEEEEeccCCCCeEEEEECCCCCEEEEEE
Q 001814 347 NAGIVVVKDFVTR-AIISQFKAHTSPISALCFDPSGTLLVTAS 388 (1010)
Q Consensus 347 ~dG~V~VwDl~s~-~~v~~~~aHtspIsaLaFSPdGtlLATAS 388 (1010)
.-|+|-|||...+ +.+..|..|.-.--.|.+.+||++||.+.
T Consensus 138 ~rGViGvYd~r~~fqrvgE~~t~GiGpHev~lm~DGrtlvvan 180 (366)
T COG3490 138 NRGVIGVYDAREGFQRVGEFSTHGIGPHEVTLMADGRTLVVAN 180 (366)
T ss_pred CCceEEEEecccccceecccccCCcCcceeEEecCCcEEEEeC
Confidence 4688999999754 56778888865556788999999999885
No 389
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=47.06 E-value=1.1e+02 Score=36.25 Aligned_cols=48 Identities=25% Similarity=0.357 Sum_probs=33.9
Q ss_pred eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeCC
Q 001814 393 NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLS 461 (1010)
Q Consensus 393 ~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~ 461 (1010)
.|+||+.. | ..+.+...- ...|.++.|+.+ ..|++...||++++|++.
T Consensus 62 ~I~iys~s--------G----------~ll~~i~w~--~~~iv~~~wt~~-e~LvvV~~dG~v~vy~~~ 109 (410)
T PF04841_consen 62 SIQIYSSS--------G----------KLLSSIPWD--SGRIVGMGWTDD-EELVVVQSDGTVRVYDLF 109 (410)
T ss_pred EEEEECCC--------C----------CEeEEEEEC--CCCEEEEEECCC-CeEEEEEcCCEEEEEeCC
Confidence 48888763 5 355554332 257999999975 555577799999999863
No 390
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=47.03 E-value=40 Score=41.42 Aligned_cols=76 Identities=25% Similarity=0.399 Sum_probs=40.7
Q ss_pred EEEECCCCCEEEEEEcCCCeEEEEeCCCCcc---c-CCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE
Q 001814 374 ALCFDPSGTLLVTASVYGNNINIFRIMPSCM---R-SGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV 449 (1010)
Q Consensus 374 aLaFSPdGtlLATAS~dGt~IrVwdi~p~~~---~-~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg 449 (1010)
-|+|+|+|.|++.-+..+...+|+-+.+... . ...|..-.--......+..|..+...+.++.++|+||++.|.+.
T Consensus 440 NL~~d~~G~LwI~eD~~~~~~~l~g~t~~G~~~~~~~~~G~~~~~~~~~~g~~~rf~~~P~gaE~tG~~fspDg~tlFvn 519 (524)
T PF05787_consen 440 NLAFDPDGNLWIQEDGGGSNNNLPGVTPDGEVYDFARNDGNNVWAYDPDTGELKRFLVGPNGAEITGPCFSPDGRTLFVN 519 (524)
T ss_pred ceEECCCCCEEEEeCCCCCCcccccccccCceeeeeecccceeeeccccccceeeeccCCCCcccccceECCCCCEEEEE
Confidence 3899999998876555443332221111000 0 00000000000111345566677777899999999999988763
No 391
>PF12657 TFIIIC_delta: Transcription factor IIIC subunit delta N-term; InterPro: IPR024761 This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 [].
Probab=46.52 E-value=1.6e+02 Score=30.55 Aligned_cols=30 Identities=13% Similarity=0.238 Sum_probs=25.2
Q ss_pred cEEEEEEccCC------CEEEEEeCCCeEEEEeCCC
Q 001814 433 TIQDICFSHYS------QWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 433 ~I~sIAFSpDg------~~LAsgS~dGTVhIw~I~~ 462 (1010)
.|..++|||-| -.||+-+.+|.+.||.-..
T Consensus 87 ~vv~~aWSP~Gl~~~~rClLavLTs~~~l~l~~~~~ 122 (173)
T PF12657_consen 87 QVVSAAWSPSGLGPNGRCLLAVLTSNGRLSLYGPPG 122 (173)
T ss_pred cEEEEEECCCCCCCCCceEEEEEcCCCeEEEEecCC
Confidence 68899999953 4899999999999998543
No 392
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=45.55 E-value=33 Score=42.93 Aligned_cols=80 Identities=18% Similarity=0.250 Sum_probs=58.0
Q ss_pred CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 368 HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 368 HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
|...|..-|++-.+++|+-|+..|. +.+|+-.. |. .++++. .| ....+..++.|++..++|
T Consensus 32 ~~~~v~lTc~dst~~~l~~GsS~G~-lyl~~R~~-------~~---------~~~~~~-~~-~~~~~~~~~vs~~e~lvA 92 (726)
T KOG3621|consen 32 FPARVKLTCVDATEEYLAMGSSAGS-VYLYNRHT-------GE---------MRKLKN-EG-ATGITCVRSVSSVEYLVA 92 (726)
T ss_pred CcceEEEEEeecCCceEEEecccce-EEEEecCc-------hh---------hhcccc-cC-ccceEEEEEecchhHhhh
Confidence 4456667789999999999998775 77887421 20 122333 22 223577789999999999
Q ss_pred EEeCCCeEEEEeCCCCCCcc
Q 001814 448 IVSSKGTCHVFVLSPFGGDS 467 (1010)
Q Consensus 448 sgS~dGTVhIw~I~~~gg~~ 467 (1010)
+|+..|-|-||.+.. ++++
T Consensus 93 agt~~g~V~v~ql~~-~~p~ 111 (726)
T KOG3621|consen 93 AGTASGRVSVFQLNK-ELPR 111 (726)
T ss_pred hhcCCceEEeehhhc-cCCC
Confidence 999999999999987 4443
No 393
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=45.53 E-value=3.2e+02 Score=27.15 Aligned_cols=52 Identities=12% Similarity=0.107 Sum_probs=37.9
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 343 ADMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 343 asgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
..++.|..|+||+-. .++..+.-+ +.|..|+-... ..+|.|-..|+ |-||+-
T Consensus 19 lvGs~D~~IRvf~~~--e~~~Ei~e~-~~v~~L~~~~~-~~F~Y~l~NGT-VGvY~~ 70 (111)
T PF14783_consen 19 LVGSDDFEIRVFKGD--EIVAEITET-DKVTSLCSLGG-GRFAYALANGT-VGVYDR 70 (111)
T ss_pred EEecCCcEEEEEeCC--cEEEEEecc-cceEEEEEcCC-CEEEEEecCCE-EEEEeC
Confidence 457889999999865 567777655 45666665555 56888888787 889875
No 394
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=41.01 E-value=1.1e+02 Score=28.72 Aligned_cols=50 Identities=20% Similarity=0.227 Sum_probs=34.3
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCC
Q 001814 348 AGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 348 dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~ 400 (1010)
-|.|.-||-..-+.+ ..+ -..-+-+++||++++|..|+.-++.|+||...
T Consensus 35 ~~~Vvyyd~~~~~~v--a~g-~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~~ 84 (86)
T PF01731_consen 35 WGNVVYYDGKEVKVV--ASG-FSFANGIAISPDKKYLYVASSLAHSIHVYKRH 84 (86)
T ss_pred CceEEEEeCCEeEEe--ecc-CCCCceEEEcCCCCEEEEEeccCCeEEEEEec
Confidence 356777876432221 122 22335689999999999999888889999864
No 395
>smart00036 CNH Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
Probab=39.51 E-value=4.2e+02 Score=30.09 Aligned_cols=56 Identities=9% Similarity=-0.118 Sum_probs=38.9
Q ss_pred EEEEEeCCC--CeEEEEEeCCCcEEEEEEcCCeEEEEeCCeEEEEECCCCceeEEEee
Q 001814 174 AVRFYSFQS--HCYEHVLRFRSSVCMVRCSPRIVAVGLATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 174 tVrIWDlkt--ge~V~tL~f~S~V~sVa~S~rlLAV~ld~~I~IwD~~Tle~l~tL~t 229 (1010)
...+-|... ...-..+.+...+.++++....|.+-.+..|.|+++.+++.++++..
T Consensus 223 ~~v~Vn~~G~~~~r~~~l~w~~~p~~~~~~~pyll~~~~~~ievr~l~~~~l~q~i~~ 280 (302)
T smart00036 223 FGVFVNLYGKRRSRNPILHWEFMPESFAYHSPYLLAFHDNGIEIRSIKTGELLQELAD 280 (302)
T ss_pred EEEEEeCCCCccccceEEEcCCcccEEEEECCEEEEEcCCcEEEEECCCCceEEEEec
Confidence 344555532 12234567888888888877666666677899999999988887753
No 396
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=38.92 E-value=5.3e+02 Score=29.17 Aligned_cols=106 Identities=11% Similarity=0.176 Sum_probs=59.0
Q ss_pred CeEEEEECCCCcEEEEecc------CCCCeEEEEECCCC-----CEEEEEEcCCCeEEEEeCCCCcccCC-CCCCccccC
Q 001814 349 GIVVVKDFVTRAIISQFKA------HTSPISALCFDPSG-----TLLVTASVYGNNINIFRIMPSCMRSG-SGNHKYDWN 416 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~a------HtspIsaLaFSPdG-----tlLATAS~dGt~IrVwdi~p~~~~~~-sG~~~~~~~ 416 (1010)
-.+.+||+.++++++++.- ..+-+..|.++... .+..-++..+.-|-|||+.......- .+ .....
T Consensus 34 pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~~s~Rv~~~--~~~~~ 111 (287)
T PF03022_consen 34 PKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATGKSWRVLHN--SFSPD 111 (287)
T ss_dssp -EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTTEEEEEETC--GCTTS
T ss_pred cEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCCcEEEEecC--Cccee
Confidence 3688999999998776642 25678889998732 45556666556788999965211000 00 00000
Q ss_pred Ccce------EEEEEecccccccEEEEEEcc---CCCEEEEEeCCCeEEEEeCCC
Q 001814 417 SSHV------HLYKLHRGITSATIQDICFSH---YSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 417 ~s~~------~L~~L~RG~t~a~I~sIAFSp---Dg~~LAsgS~dGTVhIw~I~~ 462 (1010)
+... ..+.+. ..|..++.+| |++||.-....++ ++|.|+.
T Consensus 112 p~~~~~~i~g~~~~~~-----dg~~gial~~~~~d~r~LYf~~lss~-~ly~v~T 160 (287)
T PF03022_consen 112 PDAGPFTIGGESFQWP-----DGIFGIALSPISPDGRWLYFHPLSSR-KLYRVPT 160 (287)
T ss_dssp -SSEEEEETTEEEEET-----TSEEEEEE-TTSTTS-EEEEEETT-S-EEEEEEH
T ss_pred ccccceeccCceEecC-----CCccccccCCCCCCccEEEEEeCCCC-cEEEEEH
Confidence 0000 111211 1277788866 8888888877665 6777764
No 397
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=38.03 E-value=8e+02 Score=29.63 Aligned_cols=52 Identities=13% Similarity=0.315 Sum_probs=31.0
Q ss_pred CCCEEEEEeCC--CCe------E----EEEEeCCCcEEEEEEc-------CCeEEE-EeCCeEEEEECCCCc
Q 001814 171 SPTAVRFYSFQ--SHC------Y----EHVLRFRSSVCMVRCS-------PRIVAV-GLATQIYCFDALTLE 222 (1010)
Q Consensus 171 sp~tVrIWDlk--tge------~----V~tL~f~S~V~sVa~S-------~rlLAV-~ld~~I~IwD~~Tle 222 (1010)
.|+++.||.+. .|. + +....|....+++.+- +.+|.| +.|+++.+|+-...-
T Consensus 95 hP~kl~vY~v~~~~g~~~~g~~~~L~~~yeh~l~~~a~nm~~G~Fgg~~~~~~IcVQS~DG~L~~feqe~~~ 166 (418)
T PF14727_consen 95 HPRKLSVYSVSLVDGTVEHGNQYQLELIYEHSLQRTAYNMCCGPFGGVKGRDFICVQSMDGSLSFFEQESFA 166 (418)
T ss_pred cCCEEEEEEEEecCCCcccCcEEEEEEEEEEecccceeEEEEEECCCCCCceEEEEEecCceEEEEeCCcEE
Confidence 37899999883 222 1 1122344455555552 245555 789999999866543
No 398
>KOG2377 consensus Uncharacterized conserved protein [Function unknown]
Probab=37.59 E-value=92 Score=37.67 Aligned_cols=87 Identities=14% Similarity=0.274 Sum_probs=58.5
Q ss_pred CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 369 TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 369 tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
.++|.++.||+|.+.||.--. ...|..++..+... .....++.+. .++.|....|+. ++-||.
T Consensus 66 ~G~I~SIkFSlDnkilAVQR~-~~~v~f~nf~~d~~-------------~l~~~~~ck~--k~~~IlGF~W~~-s~e~A~ 128 (657)
T KOG2377|consen 66 KGEIKSIKFSLDNKILAVQRT-SKTVDFCNFIPDNS-------------QLEYTQECKT--KNANILGFCWTS-STEIAF 128 (657)
T ss_pred CCceeEEEeccCcceEEEEec-CceEEEEecCCCch-------------hhHHHHHhcc--CcceeEEEEEec-CeeEEE
Confidence 358999999999999998877 45688888754210 0011122222 345699999994 588999
Q ss_pred EeCCCeEEEEeCCCCCCccc-ccccc
Q 001814 449 VSSKGTCHVFVLSPFGGDSG-FQTLS 473 (1010)
Q Consensus 449 gS~dGTVhIw~I~~~gg~~~-~~~H~ 473 (1010)
.+..| +-+|.+.|...... +.+|+
T Consensus 129 i~~~G-~e~y~v~pekrslRlVks~~ 153 (657)
T KOG2377|consen 129 ITDQG-IEFYQVLPEKRSLRLVKSHN 153 (657)
T ss_pred EecCC-eEEEEEchhhhhhhhhhhcc
Confidence 88887 68999988665432 24443
No 399
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=36.77 E-value=1.7e+02 Score=22.29 Aligned_cols=22 Identities=23% Similarity=0.391 Sum_probs=18.2
Q ss_pred CCCCEEEEEEcCCCeEEEEeCC
Q 001814 379 PSGTLLVTASVYGNNINIFRIM 400 (1010)
Q Consensus 379 PdGtlLATAS~dGt~IrVwdi~ 400 (1010)
|+|+.|..+...+..|-++|..
T Consensus 1 pd~~~lyv~~~~~~~v~~id~~ 22 (42)
T TIGR02276 1 PDGTKLYVTNSGSNTVSVIDTA 22 (42)
T ss_pred CCCCEEEEEeCCCCEEEEEECC
Confidence 6888888887777789999985
No 400
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=36.68 E-value=52 Score=41.02 Aligned_cols=56 Identities=20% Similarity=0.256 Sum_probs=39.2
Q ss_pred CCCCeEEEEEC-CC-CcEEEEeccCCCC----eEEEEECCCCC-EEEEEEcCCCeEEEEeCCC
Q 001814 346 DNAGIVVVKDF-VT-RAIISQFKAHTSP----ISALCFDPSGT-LLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 346 s~dG~V~VwDl-~s-~~~v~~~~aHtsp----IsaLaFSPdGt-lLATAS~dGt~IrVwdi~p 401 (1010)
..+|.|.+||. .+ ...+.++...+.+ +.++++.|--+ +||+.+.+..+|+.+++..
T Consensus 214 ~~dg~iAiwD~~rnienpl~~i~~~~N~~~~~l~~~aycPtrtglla~l~RdS~tIrlydi~~ 276 (783)
T KOG1008|consen 214 NSDGDIAIWDTYRNIENPLQIILRNENKKPKQLFALAYCPTRTGLLAVLSRDSITIRLYDICV 276 (783)
T ss_pred cccCceeeccchhhhccHHHHHhhCCCCcccceeeEEeccCCcchhhhhccCcceEEEecccc
Confidence 45999999994 22 1223333333333 89999999664 7888888888899999864
No 401
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=34.79 E-value=9e+02 Score=29.27 Aligned_cols=58 Identities=10% Similarity=0.052 Sum_probs=39.5
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcE---EEE----EEcCCeEEEEe----------CCeEEEEECCCCceeEEEee
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSV---CMV----RCSPRIVAVGL----------ATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V---~sV----a~S~rlLAV~l----------d~~I~IwD~~Tle~l~tL~t 229 (1010)
++.|.-+|.++|+.+-..+....+ +.+ .+...+++++. ++.|+.+|+.|++.+++...
T Consensus 119 ~g~v~AlD~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~vg~~~~~~~~~~~~g~v~alD~~TG~~~W~~~~ 193 (488)
T cd00216 119 DGRLVALDAETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVIIGSSGAEFFACGVRGALRAYDVETGKLLWRFYT 193 (488)
T ss_pred CCeEEEEECCCCCEeeeecCCCCcCcceEecCCCEEECCEEEEeccccccccCCCCcEEEEEECCCCceeeEeec
Confidence 367888999999988776654431 112 22234555543 45799999999999887765
No 402
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=34.28 E-value=1.4e+02 Score=36.70 Aligned_cols=102 Identities=18% Similarity=0.228 Sum_probs=61.1
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCC--EEEEEE----cCCCeEEEEeCCCCcccCCCCCCccccCC
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGT--LLVTAS----VYGNNINIFRIMPSCMRSGSGNHKYDWNS 417 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGt--lLATAS----~dGt~IrVwdi~p~~~~~~sG~~~~~~~~ 417 (1010)
++.....+.--|+..|+++..+.-|.. |+-+.|.|+++ .|.+.+ .+.. +||++.|.. .|..--.|.
T Consensus 351 ~~~~~~~l~klDIE~GKIVeEWk~~~d-i~mv~~t~d~K~~Ql~~e~TlvGLs~n--~vfriDpRv----~~~~kl~~~- 422 (644)
T KOG2395|consen 351 DGGEQDKLYKLDIERGKIVEEWKFEDD-INMVDITPDFKFAQLTSEQTLVGLSDN--SVFRIDPRV----QGKNKLAVV- 422 (644)
T ss_pred CCCCcCcceeeecccceeeeEeeccCC-cceeeccCCcchhcccccccEEeecCC--ceEEecccc----cCcceeeee-
Confidence 455556678889999999999998877 78888888764 332111 1122 355554421 121000111
Q ss_pred cceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 418 SHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 418 s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
+-+.+.++. .-+|++-.-+ -+||+||.+|-|++|+-
T Consensus 423 ---q~kqy~~k~---nFsc~aTT~s-G~IvvgS~~GdIRLYdr 458 (644)
T KOG2395|consen 423 ---QSKQYSTKN---NFSCFATTES-GYIVVGSLKGDIRLYDR 458 (644)
T ss_pred ---ecccccccc---ccceeeecCC-ceEEEeecCCcEEeehh
Confidence 112233332 3466666554 49999999999999985
No 403
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=33.89 E-value=69 Score=38.59 Aligned_cols=44 Identities=20% Similarity=0.282 Sum_probs=31.1
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcEEEEEEcC--CeEEEEeCCeEEEEE
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSVCMVRCSP--RIVAVGLATQIYCFD 217 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V~sVa~S~--rlLAV~ld~~I~IwD 217 (1010)
..|.|||+.+++.+..++... |..|-+++ .++|...++.|+|++
T Consensus 126 ~~i~~yDw~~~~~i~~i~v~~-vk~V~Ws~~g~~val~t~~~i~il~ 171 (443)
T PF04053_consen 126 DFICFYDWETGKLIRRIDVSA-VKYVIWSDDGELVALVTKDSIYILK 171 (443)
T ss_dssp TEEEEE-TTT--EEEEESS-E--EEEEE-TTSSEEEEE-S-SEEEEE
T ss_pred CCEEEEEhhHcceeeEEecCC-CcEEEEECCCCEEEEEeCCeEEEEE
Confidence 569999999999999998875 89999996 588888888898876
No 404
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=33.39 E-value=1.1e+02 Score=28.86 Aligned_cols=28 Identities=21% Similarity=0.322 Sum_probs=23.6
Q ss_pred EEEEEccCCCEEEEEeC-CCeEEEEeCCC
Q 001814 435 QDICFSHYSQWIAIVSS-KGTCHVFVLSP 462 (1010)
Q Consensus 435 ~sIAFSpDg~~LAsgS~-dGTVhIw~I~~ 462 (1010)
..|++|||+++|.+++. +++||||++..
T Consensus 57 NGI~~s~~~k~lyVa~~~~~~I~vy~~~~ 85 (86)
T PF01731_consen 57 NGIAISPDKKYLYVASSLAHSIHVYKRHK 85 (86)
T ss_pred ceEEEcCCCCEEEEEeccCCeEEEEEecC
Confidence 67999999999988766 68999998753
No 405
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=31.86 E-value=4.2e+02 Score=33.65 Aligned_cols=81 Identities=15% Similarity=0.141 Sum_probs=53.9
Q ss_pred EeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEE--cc
Q 001814 364 QFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICF--SH 441 (1010)
Q Consensus 364 ~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAF--Sp 441 (1010)
+|..--...+.+.-|.-+ .+|..+.+++.+.|||+.. | +..|+-.- .....|.++.| .|
T Consensus 24 ~~~T~i~~~~li~gss~~-k~a~V~~~~~~LtIWD~~~-------~----------~lE~~~~f-~~~~~I~dLDWtst~ 84 (631)
T PF12234_consen 24 TFETGISNPSLISGSSIK-KIAVVDSSRSELTIWDTRS-------G----------VLEYEESF-SEDDPIRDLDWTSTP 84 (631)
T ss_pred EEecCCCCcceEeecccC-cEEEEECCCCEEEEEEcCC-------c----------EEEEeeee-cCCCceeeceeeecC
Confidence 344334455556666644 4566677799999999852 2 23333211 12346899887 48
Q ss_pred CCCEEEEEeCCCeEEEEeCCCC
Q 001814 442 YSQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 442 Dg~~LAsgS~dGTVhIw~I~~~ 463 (1010)
||+.|.+.+....|.||.--.+
T Consensus 85 d~qsiLaVGf~~~v~l~~Q~R~ 106 (631)
T PF12234_consen 85 DGQSILAVGFPHHVLLYTQLRY 106 (631)
T ss_pred CCCEEEEEEcCcEEEEEEccch
Confidence 9999999999999999976543
No 406
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=31.10 E-value=8.7e+02 Score=27.95 Aligned_cols=56 Identities=11% Similarity=0.123 Sum_probs=40.8
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcE--E-EEEEcCCeEEEEe-CCeEEEEECCCCceeEEEe
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSV--C-MVRCSPRIVAVGL-ATQIYCFDALTLENKFSVL 228 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V--~-sVa~S~rlLAV~l-d~~I~IwD~~Tle~l~tL~ 228 (1010)
+.+.|-+.++|+.+..+.--..| . .++++..++-.+. ++..|..|..+..+++...
T Consensus 73 g~lYfl~~~tGs~~w~f~~~~~vk~~a~~d~~~glIycgshd~~~yalD~~~~~cVyksk 132 (354)
T KOG4649|consen 73 GGLYFLCVKTGSQIWNFVILETVKVRAQCDFDGGLIYCGSHDGNFYALDPKTYGCVYKSK 132 (354)
T ss_pred CcEEEEEecchhheeeeeehhhhccceEEcCCCceEEEecCCCcEEEecccccceEEecc
Confidence 57889999999766655444443 2 4556667777766 4579999999999988865
No 407
>smart00036 CNH Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
Probab=30.36 E-value=4e+02 Score=30.24 Aligned_cols=39 Identities=26% Similarity=0.502 Sum_probs=30.0
Q ss_pred eEEEEEecCcEEEEEccCC-CcceEeeeeccCCEEEEEEecC
Q 001814 76 QVLLLGYQNGFQVLDVEDA-SNFNELVSKRDGPVSFLQMQPF 116 (1010)
Q Consensus 76 ~vLalGy~~G~qVWDv~~~-g~v~ellS~hdGpV~~v~~lP~ 116 (1010)
+.|++|.+.|+-+-|++.. +...++++.+ +|..+.+++.
T Consensus 14 ~~lL~GTe~Gly~~~~~~~~~~~~kl~~~~--~v~q~~v~~~ 53 (302)
T smart00036 14 KWLLVGTEEGLYVLNISDQPGTLEKLIGRR--SVTQIWVLEE 53 (302)
T ss_pred cEEEEEeCCceEEEEcccCCCCeEEecCcC--ceEEEEEEhh
Confidence 5799999999988888543 4566777654 7999998853
No 408
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=30.28 E-value=2.4e+02 Score=31.12 Aligned_cols=67 Identities=12% Similarity=0.170 Sum_probs=44.9
Q ss_pred CeEEEEECCCCCEEEEEE--cCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEE
Q 001814 371 PISALCFDPSGTLLVTAS--VYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAI 448 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS--~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAs 448 (1010)
.+.+.++|+||+.+|... .++..+.++... + ....+..| ..+..-+|++|+...++
T Consensus 25 ~~~s~AvS~dg~~~A~v~~~~~~~~L~~~~~~--------~-----------~~~~~~~g---~~l~~PS~d~~g~~W~v 82 (253)
T PF10647_consen 25 DVTSPAVSPDGSRVAAVSEGDGGRSLYVGPAG--------G-----------PVRPVLTG---GSLTRPSWDPDGWVWTV 82 (253)
T ss_pred cccceEECCCCCeEEEEEEcCCCCEEEEEcCC--------C-----------cceeeccC---CccccccccCCCCEEEE
Confidence 678899999999998877 666666666542 1 11122122 24677899999777766
Q ss_pred EeCCCeEEEEe
Q 001814 449 VSSKGTCHVFV 459 (1010)
Q Consensus 449 gS~dGTVhIw~ 459 (1010)
...+....++.
T Consensus 83 ~~~~~~~~~~~ 93 (253)
T PF10647_consen 83 DDGSGGVRVVR 93 (253)
T ss_pred EcCCCceEEEE
Confidence 66666666664
No 409
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=30.24 E-value=27 Score=41.67 Aligned_cols=58 Identities=16% Similarity=0.199 Sum_probs=48.0
Q ss_pred ccCCCCeEEEEECCC---CcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 344 DMDNAGIVVVKDFVT---RAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s---~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
++..||.++.|--.. -+.+..|++|...|..|+.|-+|.++.|.+.-++.+||||+..
T Consensus 25 qASlDGh~KFWkKs~isGvEfVKhFraHL~~I~sl~~S~dg~L~~Sv~d~Dhs~KvfDvEn 85 (558)
T KOG0882|consen 25 QASLDGHKKFWKKSRISGVEFVKHFRAHLGVILSLAVSYDGWLFRSVEDPDHSVKVFDVEN 85 (558)
T ss_pred eeecchhhhhcCCCCccceeehhhhHHHHHHHHhhhccccceeEeeccCcccceeEEEeec
Confidence 567888888887643 2356789999999999999999999999777567899999964
No 410
>PF07250 Glyoxal_oxid_N: Glyoxal oxidase N-terminus; InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=27.79 E-value=1.5e+02 Score=32.99 Aligned_cols=90 Identities=21% Similarity=0.139 Sum_probs=52.5
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCC--CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEec
Q 001814 350 IVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYG--NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHR 427 (1010)
Q Consensus 350 ~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dG--t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~R 427 (1010)
.-.+||+.+++....-.....-.+.=+|-|||++|.+++..+ +.||+|+-... + ...+|......+.. .|
T Consensus 47 ~s~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~G~~~ir~~~p~~~---~----~~~~w~e~~~~m~~-~R 118 (243)
T PF07250_consen 47 HSVEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDNDGNKAIRIFTPCTS---D----GTCDWTESPNDMQS-GR 118 (243)
T ss_pred EEEEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCccccceEEEecCCC---C----CCCCceECcccccC-CC
Confidence 356899998875433233444555667889999999997643 45899885320 0 11234332212111 11
Q ss_pred ccccccEEEEEEccCCCEEEEEeCC
Q 001814 428 GITSATIQDICFSHYSQWIAIVSSK 452 (1010)
Q Consensus 428 G~t~a~I~sIAFSpDg~~LAsgS~d 452 (1010)
+=-...-=|||+.|++|+.+
T Consensus 119 -----WYpT~~~L~DG~vlIvGG~~ 138 (243)
T PF07250_consen 119 -----WYPTATTLPDGRVLIVGGSN 138 (243)
T ss_pred -----ccccceECCCCCEEEEeCcC
Confidence 11123334799999999888
No 411
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=27.75 E-value=2.9e+02 Score=33.70 Aligned_cols=71 Identities=14% Similarity=0.185 Sum_probs=0.0
Q ss_pred CCeEEEEEecCc-EEEEEccC---CCcceEeeeeccC----------------------CEEEEEEec----CCCCCCCC
Q 001814 74 FKQVLLLGYQNG-FQVLDVED---ASNFNELVSKRDG----------------------PVSFLQMQP----FPVKDDGC 123 (1010)
Q Consensus 74 ~~~vLalGy~~G-~qVWDv~~---~g~v~ellS~hdG----------------------pV~~v~~lP----~p~~s~~~ 123 (1010)
....|++++.+| +-...... .+...+..-..+. .+..+.+.+ .-
T Consensus 157 ~~~~l~v~~~dG~ll~l~~~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 230 (547)
T PF11715_consen 157 SEANLVVSLQDGGLLRLKRSSGDSDGSVWSEELFNDSSWLRSLSGLFPWSYRGDNSSSSVAASLAVSSSEINDD------ 230 (547)
T ss_dssp SSSBEEEEESSS-EEEEEES----SSS-EE----STHHHHHCCTTTS-TT---SSSS---EEEEEE-----ETT------
T ss_pred CCCEEEEEECCCCeEEEECCcccCCCCeeEEEEeCCCchhhhhhCcCCcccccCCCCCCccceEEEecceeCCC------
Q ss_pred CCccccCcEEEEEecCCCCCCCCCCCCCCccccccCCcCCCCCCCCCCCCEEEEEeCCCCeEEEEEe
Q 001814 124 EGFRKLHPFLLVVAGEDTNTLAPGQNRSHLGGVRDGMMDSQSGNCVNSPTAVRFYSFQSHCYEHVLR 190 (1010)
Q Consensus 124 D~F~~srpLLAvVsgd~~~~s~~~q~~~~~~~vr~gs~d~~~~~~~~sp~tVrIWDlktge~V~tL~ 190 (1010)
.+|++++.| +++|+||+++++++.++.
T Consensus 231 -------~~l~tl~~D---------------------------------~~LRiW~l~t~~~~~~~~ 257 (547)
T PF11715_consen 231 -------TFLFTLSRD---------------------------------HTLRIWSLETGQCLATID 257 (547)
T ss_dssp -------TEEEEEETT---------------------------------SEEEEEETTTTCEEEEEE
T ss_pred -------CEEEEEeCC---------------------------------CeEEEEECCCCeEEEEec
No 412
>PRK13684 Ycf48-like protein; Provisional
Probab=27.67 E-value=2.4e+02 Score=32.42 Aligned_cols=65 Identities=18% Similarity=0.268 Sum_probs=41.3
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEE
Q 001814 370 SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIV 449 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsg 449 (1010)
..+..+.+.|+|.+++++.. |..++-++- |. +.-....++ +...+++++|.++++.++++
T Consensus 173 g~~~~i~~~~~g~~v~~g~~-G~i~~s~~~---------gg---------~tW~~~~~~-~~~~l~~i~~~~~g~~~~vg 232 (334)
T PRK13684 173 GVVRNLRRSPDGKYVAVSSR-GNFYSTWEP---------GQ---------TAWTPHQRN-SSRRLQSMGFQPDGNLWMLA 232 (334)
T ss_pred ceEEEEEECCCCeEEEEeCC-ceEEEEcCC---------CC---------CeEEEeeCC-CcccceeeeEcCCCCEEEEe
Confidence 46788999999988877765 754433221 10 111122233 33468999999999987775
Q ss_pred eCCCeE
Q 001814 450 SSKGTC 455 (1010)
Q Consensus 450 S~dGTV 455 (1010)
..|.+
T Consensus 233 -~~G~~ 237 (334)
T PRK13684 233 -RGGQI 237 (334)
T ss_pred -cCCEE
Confidence 45765
No 413
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=27.44 E-value=3e+02 Score=31.15 Aligned_cols=57 Identities=7% Similarity=0.059 Sum_probs=43.3
Q ss_pred CEEEEEeCCCCeEEEEEeCCCcEEEEEEcCCeEEEEe-CCeEEEEECCCCceeEEEee
Q 001814 173 TAVRFYSFQSHCYEHVLRFRSSVCMVRCSPRIVAVGL-ATQIYCFDALTLENKFSVLT 229 (1010)
Q Consensus 173 ~tVrIWDlktge~V~tL~f~S~V~sVa~S~rlLAV~l-d~~I~IwD~~Tle~l~tL~t 229 (1010)
++.-+||.++-+.+.++.++..=+.+....+.|+.+. ..+|+++|..+++...++.-
T Consensus 110 ~~~f~yd~~tl~~~~~~~y~~EGWGLt~dg~~Li~SDGS~~L~~~dP~~f~~~~~i~V 167 (264)
T PF05096_consen 110 GTGFVYDPNTLKKIGTFPYPGEGWGLTSDGKRLIMSDGSSRLYFLDPETFKEVRTIQV 167 (264)
T ss_dssp SEEEEEETTTTEEEEEEE-SSS--EEEECSSCEEEE-SSSEEEEE-TTT-SEEEEEE-
T ss_pred CeEEEEccccceEEEEEecCCcceEEEcCCCEEEEECCccceEEECCcccceEEEEEE
Confidence 7899999999999999999988889998887777665 46899999999998877763
No 414
>PRK10115 protease 2; Provisional
Probab=26.26 E-value=5.1e+02 Score=33.08 Aligned_cols=100 Identities=5% Similarity=-0.021 Sum_probs=55.4
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcC-C----CeEEEEeCCCCcccCCCCCCccccCCcc
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVY-G----NNINIFRIMPSCMRSGSGNHKYDWNSSH 419 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~d-G----t~IrVwdi~p~~~~~~sG~~~~~~~~s~ 419 (1010)
++..-.+.|.|+.++..+.....+.. ..++|++||+.|+-...+ + ..|..+++.+ + ...-
T Consensus 149 G~E~~~l~v~d~~tg~~l~~~i~~~~--~~~~w~~D~~~~~y~~~~~~~~~~~~v~~h~lgt-------~------~~~d 213 (686)
T PRK10115 149 SRRQYGIRFRNLETGNWYPELLDNVE--PSFVWANDSWTFYYVRKHPVTLLPYQVWRHTIGT-------P------ASQD 213 (686)
T ss_pred CcEEEEEEEEECCCCCCCCccccCcc--eEEEEeeCCCEEEEEEecCCCCCCCEEEEEECCC-------C------hhHC
Confidence 44455799999998864322222222 459999999866655443 2 2344555532 1 0012
Q ss_pred eEEEEEecccccccEEEEEEccCCCEEEEEeCCC---eEEEEeCCC
Q 001814 420 VHLYKLHRGITSATIQDICFSHYSQWIAIVSSKG---TCHVFVLSP 462 (1010)
Q Consensus 420 ~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dG---TVhIw~I~~ 462 (1010)
+++++- . ....-..+..+.|++++.+.+..+ .+.+|+...
T Consensus 214 ~lv~~e--~-~~~~~~~~~~s~d~~~l~i~~~~~~~~~~~l~~~~~ 256 (686)
T PRK10115 214 ELVYEE--K-DDTFYVSLHKTTSKHYVVIHLASATTSEVLLLDAEL 256 (686)
T ss_pred eEEEee--C-CCCEEEEEEEcCCCCEEEEEEECCccccEEEEECcC
Confidence 455542 1 111112445566999988776665 377777543
No 415
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=26.07 E-value=2.5e+02 Score=26.63 Aligned_cols=27 Identities=11% Similarity=0.172 Sum_probs=17.5
Q ss_pred EEEEEecccccccEEEEEEccCCCEEEEE
Q 001814 421 HLYKLHRGITSATIQDICFSHYSQWIAIV 449 (1010)
Q Consensus 421 ~L~~L~RG~t~a~I~sIAFSpDg~~LAsg 449 (1010)
.+..|..|.. --..|++|+|+++|.++
T Consensus 48 ~~~vl~~~L~--fpNGVals~d~~~vlv~ 74 (89)
T PF03088_consen 48 ETTVLLDGLY--FPNGVALSPDESFVLVA 74 (89)
T ss_dssp EEEEEEEEES--SEEEEEE-TTSSEEEEE
T ss_pred eEEEehhCCC--ccCeEEEcCCCCEEEEE
Confidence 3444545543 34889999999987665
No 416
>PF12657 TFIIIC_delta: Transcription factor IIIC subunit delta N-term; InterPro: IPR024761 This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 [].
Probab=25.58 E-value=7.4e+02 Score=25.62 Aligned_cols=16 Identities=25% Similarity=0.144 Sum_probs=13.9
Q ss_pred cEEEEcCCccEEEEec
Q 001814 569 HLLVYTPSGYVVQHEL 584 (1010)
Q Consensus 569 ~LlV~s~~G~l~~Y~L 584 (1010)
=|.|.+.+|.|.+|.-
T Consensus 105 lLavLTs~~~l~l~~~ 120 (173)
T PF12657_consen 105 LLAVLTSNGRLSLYGP 120 (173)
T ss_pred EEEEEcCCCeEEEEec
Confidence 4899999999999984
No 417
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=25.47 E-value=70 Score=35.40 Aligned_cols=57 Identities=12% Similarity=0.091 Sum_probs=46.5
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCC-CCeEEEEECCCCCEEEEE--EcCCCeEEEEeCCC
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHT-SPISALCFDPSGTLLVTA--SVYGNNINIFRIMP 401 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHt-spIsaLaFSPdGtlLATA--S~dGt~IrVwdi~p 401 (1010)
.+..+|.|+.|.+.-.+.+...-.|+ .++..+..+-.+..|+.+ |. +.+++.|++.+
T Consensus 119 ~~~~dg~ir~~n~~p~k~~g~~g~h~~~~~e~~ivv~sd~~i~~a~~S~-d~~~k~W~ve~ 178 (238)
T KOG2444|consen 119 VGAQDGRIRACNIKPNKVLGYVGQHNFESGEELIVVGSDEFLKIADTSH-DRVLKKWNVEK 178 (238)
T ss_pred EeccCCceeeeccccCceeeeeccccCCCcceeEEecCCceEEeecccc-chhhhhcchhh
Confidence 35689999999999988888777888 788888888888888888 76 56688888864
No 418
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=24.91 E-value=2.2e+02 Score=31.05 Aligned_cols=52 Identities=15% Similarity=0.286 Sum_probs=38.5
Q ss_pred CCCCeEEEEECCCCcEEEE-------ec-------cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeC
Q 001814 346 DNAGIVVVKDFVTRAIISQ-------FK-------AHTSPISALCFDPSGTLLVTASVYGNNINIFRI 399 (1010)
Q Consensus 346 s~dG~V~VwDl~s~~~v~~-------~~-------aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi 399 (1010)
..+|.+.|||+.+++.+.. +. .....|..+.++.+|.-|++-+. |. ..+|+.
T Consensus 29 T~~G~l~vWnl~~~k~~~~~~Si~pll~~~~~~~~~~~~~i~~~~lt~~G~PiV~lsn-g~-~y~y~~ 94 (219)
T PF07569_consen 29 TSSGLLYVWNLKKGKAVLPPVSIAPLLNSSPVSDKSSSPNITSCSLTSNGVPIVTLSN-GD-SYSYSP 94 (219)
T ss_pred eCCCeEEEEECCCCeeccCCccHHHHhcccccccCCCCCcEEEEEEcCCCCEEEEEeC-CC-EEEecc
Confidence 4689999999998764321 22 24567888888999998888876 65 567775
No 419
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.48 E-value=3.3e+02 Score=31.49 Aligned_cols=75 Identities=16% Similarity=0.234 Sum_probs=44.7
Q ss_pred EEEEECCCCcEEE--------EeccCCCCeEEEEECCCCCEEEEEEcC----CCeEEEEeCCCCcccCCCCCCccccCCc
Q 001814 351 VVVKDFVTRAIIS--------QFKAHTSPISALCFDPSGTLLVTASVY----GNNINIFRIMPSCMRSGSGNHKYDWNSS 418 (1010)
Q Consensus 351 V~VwDl~s~~~v~--------~~~aHtspIsaLaFSPdGtlLATAS~d----Gt~IrVwdi~p~~~~~~sG~~~~~~~~s 418 (1010)
..|+|....+.+. +|-+|. +|||||.+|...-.| -.+|-|||....
T Consensus 93 ~~vfD~~~~~~pv~~~s~~~RHfyGHG------vfs~dG~~LYATEndfd~~rGViGvYd~r~~---------------- 150 (366)
T COG3490 93 AMVFDPNGAQEPVTLVSQEGRHFYGHG------VFSPDGRLLYATENDFDPNRGVIGVYDAREG---------------- 150 (366)
T ss_pred EEEECCCCCcCcEEEecccCceeeccc------ccCCCCcEEEeecCCCCCCCceEEEEecccc----------------
Confidence 3456665554333 455663 699999998643221 236888887521
Q ss_pred ceEEEEEe-cccccccEEEEEEccCCCEEEEEe
Q 001814 419 HVHLYKLH-RGITSATIQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 419 ~~~L~~L~-RG~t~a~I~sIAFSpDg~~LAsgS 450 (1010)
.+++-++. -|+ .--.+.|.+||+.|+++.
T Consensus 151 fqrvgE~~t~Gi---GpHev~lm~DGrtlvvan 180 (366)
T COG3490 151 FQRVGEFSTHGI---GPHEVTLMADGRTLVVAN 180 (366)
T ss_pred cceecccccCCc---CcceeEEecCCcEEEEeC
Confidence 12333332 122 235688999999999873
No 420
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.16 E-value=2.8e+02 Score=34.30 Aligned_cols=85 Identities=14% Similarity=0.253 Sum_probs=50.6
Q ss_pred CeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcc--------eEEEEEecccccccEEEEEEccC
Q 001814 371 PISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSH--------VHLYKLHRGITSATIQDICFSHY 442 (1010)
Q Consensus 371 pIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~--------~~L~~L~RG~t~a~I~sIAFSpD 442 (1010)
.|..+..|+.|.++|-++.+| |.|..+......++ ..-+-.+.. ..+|.- .+.-.+.-.+|.||
T Consensus 105 eV~~vl~s~~GS~VaL~G~~G--i~vMeLp~rwG~~s---~~eDgk~~v~CRt~~i~~~~fts---s~~ltl~Qa~WHP~ 176 (741)
T KOG4460|consen 105 EVYQVLLSPTGSHVALIGIKG--LMVMELPKRWGKNS---EFEDGKSTVNCRTTPVAERFFTS---STSLTLKQAAWHPS 176 (741)
T ss_pred EEEEEEecCCCceEEEecCCe--eEEEEchhhcCccc---eecCCCceEEEEeecccceeecc---CCceeeeeccccCC
Confidence 466778899999999888877 44555411100000 000001111 122211 12225677899999
Q ss_pred C---CEEEEEeCCCeEEEEeCCCC
Q 001814 443 S---QWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 443 g---~~LAsgS~dGTVhIw~I~~~ 463 (1010)
+ ..|.+-++|.+++||+++..
T Consensus 177 S~~D~hL~iL~sdnviRiy~lS~~ 200 (741)
T KOG4460|consen 177 SILDPHLVLLTSDNVIRIYSLSEP 200 (741)
T ss_pred ccCCceEEEEecCcEEEEEecCCc
Confidence 7 78889999999999999753
No 421
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=24.16 E-value=1.8e+02 Score=33.81 Aligned_cols=63 Identities=16% Similarity=0.229 Sum_probs=37.5
Q ss_pred CCeEEEEECCCCCEEEEEEcCCCe----------------EEEEeCCCCcccCCCCCCccccCCcceEEEEEeccccccc
Q 001814 370 SPISALCFDPSGTLLVTASVYGNN----------------INIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSAT 433 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~dGt~----------------IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~ 433 (1010)
.....|+|.|||.+.++-+..+.. =.||++.|. | ..+..+..|+. .
T Consensus 124 ~~~~~l~~gpDG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pd------g----------~~~e~~a~G~r--n 185 (367)
T TIGR02604 124 HSLNSLAWGPDGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPD------G----------GKLRVVAHGFQ--N 185 (367)
T ss_pred ccccCceECCCCCEEEecccCCCceeccCCCccCcccccCceEEEEecC------C----------CeEEEEecCcC--C
Confidence 456789999999988766632110 024444332 1 12223334432 3
Q ss_pred EEEEEEccCCCEEEEEe
Q 001814 434 IQDICFSHYSQWIAIVS 450 (1010)
Q Consensus 434 I~sIAFSpDg~~LAsgS 450 (1010)
.+.++|+++|+++++-.
T Consensus 186 p~Gl~~d~~G~l~~tdn 202 (367)
T TIGR02604 186 PYGHSVDSWGDVFFCDN 202 (367)
T ss_pred CccceECCCCCEEEEcc
Confidence 58899999999887644
No 422
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=24.14 E-value=1.9e+03 Score=29.71 Aligned_cols=94 Identities=12% Similarity=0.151 Sum_probs=55.0
Q ss_pred CeEEEEECCCCcEEEEec--cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEe
Q 001814 349 GIVVVKDFVTRAIISQFK--AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLH 426 (1010)
Q Consensus 349 G~V~VwDl~s~~~v~~~~--aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~ 426 (1010)
..+++||+..++++...+ .-..-|+.++. .++.++.++.+.. ++.+.-.+. + .+++.+.
T Consensus 954 ~~l~~YdlG~K~lLRk~e~k~~p~~Is~iqt--~~~RI~VgD~qeS-V~~~~y~~~------~----------n~l~~fa 1014 (1205)
T KOG1898|consen 954 RFLRLYDLGKKKLLRKCELKFIPNRISSIQT--YGARIVVGDIQES-VHFVRYRRE------D----------NQLIVFA 1014 (1205)
T ss_pred cEEEEeeCChHHHHhhhhhccCceEEEEEee--cceEEEEeeccce-EEEEEEecC------C----------CeEEEEe
Confidence 357788887766554333 22445666665 4677888877655 444443331 1 3566653
Q ss_pred cccccccEEEEEEccCCCEEEEEeCCCeEEEEeCCC
Q 001814 427 RGITSATIQDICFSHYSQWIAIVSSKGTCHVFVLSP 462 (1010)
Q Consensus 427 RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I~~ 462 (1010)
-...+..|.++.+ -|-..+|.+-.=|.+++-++.+
T Consensus 1015 dD~~pR~Vt~~~~-lD~~tvagaDrfGNi~~vR~P~ 1049 (1205)
T KOG1898|consen 1015 DDPVPRHVTALEL-LDYDTVAGADRFGNIAVVRIPP 1049 (1205)
T ss_pred CCCccceeeEEEE-ecCCceeeccccCcEEEEECCC
Confidence 3222224565544 4666788887778888877765
No 423
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=24.09 E-value=4.4e+02 Score=30.19 Aligned_cols=56 Identities=18% Similarity=0.016 Sum_probs=41.9
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCC
Q 001814 345 MDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMP 401 (1010)
Q Consensus 345 gs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p 401 (1010)
+-..|.+.+.++.++..+..|..-..-=..-..+++|.++-.+|.|++ +..-|...
T Consensus 69 GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~a~~d~~~glIycgshd~~-~yalD~~~ 124 (354)
T KOG4649|consen 69 GCYSGGLYFLCVKTGSQIWNFVILETVKVRAQCDFDGGLIYCGSHDGN-FYALDPKT 124 (354)
T ss_pred EEccCcEEEEEecchhheeeeeehhhhccceEEcCCCceEEEecCCCc-EEEecccc
Confidence 567888999999999888877764332233467899999999999766 67777643
No 424
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=24.09 E-value=1.7e+02 Score=36.38 Aligned_cols=64 Identities=17% Similarity=0.307 Sum_probs=38.8
Q ss_pred EEEEECCCCCEEEEEEcCC-----CeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEE
Q 001814 373 SALCFDPSGTLLVTASVYG-----NNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIA 447 (1010)
Q Consensus 373 saLaFSPdGtlLATAS~dG-----t~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LA 447 (1010)
--|+|+|.|.|.+.-+-.+ +..=+|.+... + +..-.+..|..+-..+.+...+||||++.+.
T Consensus 503 Dnl~fD~~GrLWi~TDg~~s~~~~~~~G~~~m~~~------~-------p~~g~~~rf~t~P~g~E~tG~~FspD~~TlF 569 (616)
T COG3211 503 DNLAFDPWGRLWIQTDGSGSTLRNRFRGVTQMLTP------D-------PKTGTIKRFLTGPIGCEFTGPCFSPDGKTLF 569 (616)
T ss_pred CceEECCCCCEEEEecCCCCccCcccccccccccC------C-------CccceeeeeccCCCcceeecceeCCCCceEE
Confidence 3488999998776543222 22334433210 1 1112455566676678999999999998776
Q ss_pred EE
Q 001814 448 IV 449 (1010)
Q Consensus 448 sg 449 (1010)
++
T Consensus 570 V~ 571 (616)
T COG3211 570 VN 571 (616)
T ss_pred EE
Confidence 64
No 425
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=23.99 E-value=3.6e+02 Score=30.56 Aligned_cols=59 Identities=7% Similarity=0.042 Sum_probs=39.7
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCc-------EEEEEEcC-------CeEEEEeCC--eEEEEECCCCceeEEEeec
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSS-------VCMVRCSP-------RIVAVGLAT--QIYCFDALTLENKFSVLTY 230 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~-------V~sVa~S~-------rlLAV~ld~--~I~IwD~~Tle~l~tL~t~ 230 (1010)
|-+|-+||+++++.++++.|+.. ..++++.. .++-++..+ .|.|||+.+.+-...+..+
T Consensus 33 ~pKLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~~s~Rv~~~~ 107 (287)
T PF03022_consen 33 PPKLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATGKSWRVLHNS 107 (287)
T ss_dssp --EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTTEEEEEETCG
T ss_pred CcEEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCCcEEEEecCC
Confidence 45888999999999999998743 34555543 244455543 7999999998876665543
No 426
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=23.96 E-value=2.2e+02 Score=34.01 Aligned_cols=30 Identities=27% Similarity=0.310 Sum_probs=21.1
Q ss_pred EEEEeccC-CCCeEEEEECCCCCEEEEEEcC
Q 001814 361 IISQFKAH-TSPISALCFDPSGTLLVTASVY 390 (1010)
Q Consensus 361 ~v~~~~aH-tspIsaLaFSPdGtlLATAS~d 390 (1010)
++..|+.+ .+.=..|.|+|||+|.+|.+..
T Consensus 167 i~~~lP~~~~H~g~~l~f~pDG~Lyvs~G~~ 197 (399)
T COG2133 167 IFRGIPKGGHHFGGRLVFGPDGKLYVTTGSN 197 (399)
T ss_pred EeecCCCCCCcCcccEEECCCCcEEEEeCCC
Confidence 34455543 2455789999999999988775
No 427
>PF10395 Utp8: Utp8 family; InterPro: IPR018843 Utp8 is an essential component of the nuclear tRNA export machinery in Saccharomyces cerevisiae (Baker's yeast). It is a tRNA binding protein that acts at a step between tRNA maturation /aminoacylation, and translocation of the tRNA across the nuclear pore complex [].
Probab=23.67 E-value=7.2e+02 Score=31.84 Aligned_cols=49 Identities=20% Similarity=0.346 Sum_probs=37.0
Q ss_pred CCCEEEEEeCCCCeEEEEEeCCC-------cEEEEE-EcCCeEEEEeCCeEEEEECC
Q 001814 171 SPTAVRFYSFQSHCYEHVLRFRS-------SVCMVR-CSPRIVAVGLATQIYCFDAL 219 (1010)
Q Consensus 171 sp~tVrIWDlktge~V~tL~f~S-------~V~sVa-~S~rlLAV~ld~~I~IwD~~ 219 (1010)
...++.+|++-+-+..+++..+. .+.++. .+++++..+.+.+|++.|+.
T Consensus 249 ~~~~i~~ysip~f~~~~tI~l~~ii~~~~~~~vSl~~~s~nRvLLs~~nkIyLld~~ 305 (670)
T PF10395_consen 249 SKKTISSYSIPNFQIQKTISLPSIIDKESDDLVSLKPPSPNRVLLSVNNKIYLLDLK 305 (670)
T ss_pred eCCEEEEEEcCCceEEEEEEechhhccccccceEeecCCCCeEEEEcCCEEEEEeeh
Confidence 46899999998888888887652 233333 35688888889999999975
No 428
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=23.63 E-value=3.7e+02 Score=30.91 Aligned_cols=92 Identities=18% Similarity=0.277 Sum_probs=43.1
Q ss_pred CCCeE-EEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEE
Q 001814 347 NAGIV-VVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKL 425 (1010)
Q Consensus 347 ~dG~V-~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L 425 (1010)
..|.+ .-||-....-..+-+.-...|.+|.|+|+|.+.+.+ . |..|+.=+. +. ....|... +...
T Consensus 163 ~~G~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~~~lw~~~-~-Gg~~~~s~~-~~--------~~~~w~~~---~~~~ 228 (302)
T PF14870_consen 163 SRGNFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPDGNLWMLA-R-GGQIQFSDD-PD--------DGETWSEP---IIPI 228 (302)
T ss_dssp TTSSEEEEE-TT-SS-EEEE--SSS-EEEEEE-TTS-EEEEE-T-TTEEEEEE--TT--------EEEEE------B-TT
T ss_pred CcccEEEEecCCCccceEEccCccceehhceecCCCCEEEEe-C-CcEEEEccC-CC--------Cccccccc---cCCc
Confidence 34443 345544322333344456799999999999987766 4 665655441 11 01122221 1111
Q ss_pred ecccccccEEEEEEccCCCEEEEEeCCCeE
Q 001814 426 HRGITSATIQDICFSHYSQWIAIVSSKGTC 455 (1010)
Q Consensus 426 ~RG~t~a~I~sIAFSpDg~~LAsgS~dGTV 455 (1010)
. ...-.|.+++|.+++...|+++ .|++
T Consensus 229 ~--~~~~~~ld~a~~~~~~~wa~gg-~G~l 255 (302)
T PF14870_consen 229 K--TNGYGILDLAYRPPNEIWAVGG-SGTL 255 (302)
T ss_dssp S--S--S-EEEEEESSSS-EEEEES-TT-E
T ss_pred c--cCceeeEEEEecCCCCEEEEeC-CccE
Confidence 0 1112489999999988887665 4554
No 429
>KOG1900 consensus Nuclear pore complex, Nup155 component (D Nup154, sc Nup157/Nup170) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=23.43 E-value=3.9e+02 Score=36.38 Aligned_cols=35 Identities=23% Similarity=0.374 Sum_probs=31.0
Q ss_pred cCCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCC
Q 001814 367 AHTSPISALCFDPSGTLLVTASVYGNNINIFRIMPS 402 (1010)
Q Consensus 367 aHtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~ 402 (1010)
.|..||..|+.+.+-.+|.+=+++|+ |++|++.++
T Consensus 240 ~~~dpI~qi~ID~SR~IlY~lsek~~-v~~Y~i~~~ 274 (1311)
T KOG1900|consen 240 SSKDPIRQITIDNSRNILYVLSEKGT-VSAYDIGGN 274 (1311)
T ss_pred CCCCcceeeEeccccceeeeeccCce-EEEEEccCC
Confidence 57789999999999999999999876 999999753
No 430
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=22.80 E-value=32 Score=42.74 Aligned_cols=92 Identities=13% Similarity=0.216 Sum_probs=59.5
Q ss_pred CCCeEEEEECCCC----cEEEEecc-CCCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceE
Q 001814 347 NAGIVVVKDFVTR----AIISQFKA-HTSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVH 421 (1010)
Q Consensus 347 ~dG~V~VwDl~s~----~~v~~~~a-HtspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~ 421 (1010)
.+-.+.|||+.++ +.-..|.+ -....+.+|+..+-++|.+|.. -+.+++||+.-.+. .
T Consensus 127 nds~~~Iwdi~s~ltvPke~~~fs~~~l~gqns~cwlrd~klvlaGm~-sr~~~ifdlRqs~~----------------~ 189 (783)
T KOG1008|consen 127 NDSSLKIWDINSLLTVPKESPLFSSSTLDGQNSVCWLRDTKLVLAGMT-SRSVHIFDLRQSLD----------------S 189 (783)
T ss_pred ccCCccceecccccCCCccccccccccccCccccccccCcchhhcccc-cchhhhhhhhhhhh----------------h
Confidence 4667889999876 22234444 3345668899988888877776 56799999853211 0
Q ss_pred EEEEecccccccEEEEEEcc-CCCEEEEEeCCCeEEEEeC
Q 001814 422 LYKLHRGITSATIQDICFSH-YSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 422 L~~L~RG~t~a~I~sIAFSp-Dg~~LAsgS~dGTVhIw~I 460 (1010)
...+ ....++.+..+| ...++|+-+ ||-|-|||-
T Consensus 190 ~~sv----nTk~vqG~tVdp~~~nY~cs~~-dg~iAiwD~ 224 (783)
T KOG1008|consen 190 VSSV----NTKYVQGITVDPFSPNYFCSNS-DGDIAIWDT 224 (783)
T ss_pred hhhh----hhhhcccceecCCCCCceeccc-cCceeeccc
Confidence 0111 011245566677 667888777 999999994
No 431
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=22.47 E-value=2.1e+02 Score=27.08 Aligned_cols=53 Identities=9% Similarity=0.111 Sum_probs=32.9
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEECCCCCEEEEEEc-CCCeEEEE
Q 001814 344 DMDNAGIVVVKDFVTRAIISQFKAHTSPISALCFDPSGTLLVTASV-YGNNINIF 397 (1010)
Q Consensus 344 sgs~dG~V~VwDl~s~~~v~~~~aHtspIsaLaFSPdGtlLATAS~-dGt~IrVw 397 (1010)
.+...|.+.-||..+++....+.+-. --+-++++||++.|+.+-. ..++.|.|
T Consensus 32 e~~~~GRll~ydp~t~~~~vl~~~L~-fpNGVals~d~~~vlv~Et~~~Ri~ryw 85 (89)
T PF03088_consen 32 EGRPTGRLLRYDPSTKETTVLLDGLY-FPNGVALSPDESFVLVAETGRYRILRYW 85 (89)
T ss_dssp HT---EEEEEEETTTTEEEEEEEEES-SEEEEEE-TTSSEEEEEEGGGTEEEEEE
T ss_pred cCCCCcCEEEEECCCCeEEEehhCCC-ccCeEEEcCCCCEEEEEeccCceEEEEE
Confidence 56678999999999987644444322 3377999999997776644 33434444
No 432
>PRK10115 protease 2; Provisional
Probab=21.87 E-value=2.2e+02 Score=36.21 Aligned_cols=62 Identities=13% Similarity=0.231 Sum_probs=38.5
Q ss_pred CCeEEEEECCCCCEEEEEEcC-CC---eEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCE
Q 001814 370 SPISALCFDPSGTLLVTASVY-GN---NINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQW 445 (1010)
Q Consensus 370 spIsaLaFSPdGtlLATAS~d-Gt---~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~ 445 (1010)
-.+..+.+||||++||-+-.. |. .|+|.++.+ | ..+-+. ..... ..++|++|++.
T Consensus 127 ~~l~~~~~Spdg~~la~~~d~~G~E~~~l~v~d~~t-------g----------~~l~~~---i~~~~-~~~~w~~D~~~ 185 (686)
T PRK10115 127 YTLGGMAITPDNTIMALAEDFLSRRQYGIRFRNLET-------G----------NWYPEL---LDNVE-PSFVWANDSWT 185 (686)
T ss_pred EEEeEEEECCCCCEEEEEecCCCcEEEEEEEEECCC-------C----------CCCCcc---ccCcc-eEEEEeeCCCE
Confidence 457778999999988876443 32 366666642 3 111111 11122 45999999998
Q ss_pred EEEEeCC
Q 001814 446 IAIVSSK 452 (1010)
Q Consensus 446 LAsgS~d 452 (1010)
|+.+..+
T Consensus 186 ~~y~~~~ 192 (686)
T PRK10115 186 FYYVRKH 192 (686)
T ss_pred EEEEEec
Confidence 8887764
No 433
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=21.58 E-value=2.7e+02 Score=30.65 Aligned_cols=60 Identities=18% Similarity=0.407 Sum_probs=40.3
Q ss_pred EEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCccccCCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCC
Q 001814 374 ALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKG 453 (1010)
Q Consensus 374 aLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dG 453 (1010)
+++-++.+.+++.++ |..|.+|++... + ...+++|. |-..|..|+++.-|.+|++-=.+.
T Consensus 22 ~~c~~g~d~Lfva~~--g~~Vev~~l~~~------~---------~~~~~~F~---Tv~~V~~l~y~~~GDYlvTlE~k~ 81 (215)
T PF14761_consen 22 AVCCGGPDALFVAAS--GCKVEVYDLEQE------E---------CPLLCTFS---TVGRVLQLVYSEAGDYLVTLEEKN 81 (215)
T ss_pred eeeccCCceEEEEcC--CCEEEEEEcccC------C---------CceeEEEc---chhheeEEEeccccceEEEEEeec
Confidence 344444344544433 678999999732 1 25677774 445799999999999999975543
No 434
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=21.51 E-value=1.4e+02 Score=32.43 Aligned_cols=38 Identities=13% Similarity=0.258 Sum_probs=28.6
Q ss_pred EEEeCCCcEEEEEEcCCe-EEEEeCCeEEEEECCCCcee
Q 001814 187 HVLRFRSSVCMVRCSPRI-VAVGLATQIYCFDALTLENK 224 (1010)
Q Consensus 187 ~tL~f~S~V~sVa~S~rl-LAV~ld~~I~IwD~~Tle~l 224 (1010)
-.|...++|.-+.++.++ +++...+.+++||+.+++..
T Consensus 7 P~i~Lgs~~~~l~~~~~~Ll~iT~~G~l~vWnl~~~k~~ 45 (219)
T PF07569_consen 7 PPIVLGSPVSFLECNGSYLLAITSSGLLYVWNLKKGKAV 45 (219)
T ss_pred CcEecCCceEEEEeCCCEEEEEeCCCeEEEEECCCCeec
Confidence 345667777778888765 45667789999999988754
No 435
>PF10214 Rrn6: RNA polymerase I-specific transcription-initiation factor; InterPro: IPR019350 RNA polymerase I-specific transcription-initiation factor Rrn6 and Rrn7 represent components of a multisubunit transcription factor essential for the initiation of rDNA transcription by Pol I []. These proteins are found in fungi.
Probab=21.49 E-value=1.8e+03 Score=28.51 Aligned_cols=102 Identities=16% Similarity=0.092 Sum_probs=58.9
Q ss_pred cccccCCCCeEEEEECCCC-----cEEEEeccC----------CCCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCccc
Q 001814 341 AGADMDNAGIVVVKDFVTR-----AIISQFKAH----------TSPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMR 405 (1010)
Q Consensus 341 ~iasgs~dG~V~VwDl~s~-----~~v~~~~aH----------tspIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~ 405 (1010)
++|..+..|.-.|||+... ..+.....+ .+.-..+.|.++-+.|+.++. +.+.++|+.+.
T Consensus 160 q~AiVD~~G~Wsvw~i~~~~~~~~~~~~~~~~~~gsi~~d~~e~s~w~rI~W~~~~~~lLv~~r--~~l~~~d~~~~--- 234 (765)
T PF10214_consen 160 QFAIVDEKGNWSVWDIKGRPKRKSSNLRLSRNISGSIIFDPEELSNWKRILWVSDSNRLLVCNR--SKLMLIDFESN--- 234 (765)
T ss_pred eEEEEeccCcEEEEEeccccccCCcceeeccCCCccccCCCcccCcceeeEecCCCCEEEEEcC--CceEEEECCCC---
Confidence 3455678899999999211 111111111 133347889888777777876 45889998642
Q ss_pred CCCCCCccccCCcceEEEEEecccccccEEEEEEccC--CCEEEEEeCCCeEEEEeCCCC
Q 001814 406 SGSGNHKYDWNSSHVHLYKLHRGITSATIQDICFSHY--SQWIAIVSSKGTCHVFVLSPF 463 (1010)
Q Consensus 406 ~~sG~~~~~~~~s~~~L~~L~RG~t~a~I~sIAFSpD--g~~LAsgS~dGTVhIw~I~~~ 463 (1010)
|. ..+ +....+..+|.++.=+++ +..++..+ ..|..+++.+.
T Consensus 235 ---------~~---~~~--l~~~~~~~~IlDv~~~~~~~~~~FiLTs--~eiiw~~~~~~ 278 (765)
T PF10214_consen 235 ---------WQ---TEY--LVTAKTWSWILDVKRSPDNPSHVFILTS--KEIIWLDVKSS 278 (765)
T ss_pred ---------Cc---cch--hccCCChhheeeEEecCCccceEEEEec--CeEEEEEccCC
Confidence 00 011 222224457999998888 44444433 35666677664
No 436
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=20.88 E-value=91 Score=41.34 Aligned_cols=95 Identities=11% Similarity=0.024 Sum_probs=56.9
Q ss_pred CCCeEEEEECCCCcE-----EEEeccCC------CCeEEEEECCCCCEEEEEEcCCCeEEEEeCCCCcccCCCCCCcccc
Q 001814 347 NAGIVVVKDFVTRAI-----ISQFKAHT------SPISALCFDPSGTLLVTASVYGNNINIFRIMPSCMRSGSGNHKYDW 415 (1010)
Q Consensus 347 ~dG~V~VwDl~s~~~-----v~~~~aHt------spIsaLaFSPdGtlLATAS~dGt~IrVwdi~p~~~~~~sG~~~~~~ 415 (1010)
++-.|..||+.+... -.-|..|. .-..++.|+|.=-+-..++..+..|+|+.+.-.
T Consensus 122 ng~~v~~fD~~~fs~s~~~~~~pl~~s~ts~ek~vf~~~~~wnP~vp~n~av~l~dlsl~V~~~~~~------------- 188 (1405)
T KOG3630|consen 122 NGEAVYSFDLEEFSESRYETTVPLKNSATSFEKPVFQLKNVWNPLVPLNSAVDLSDLSLRVKSTKQL------------- 188 (1405)
T ss_pred CCceEEEEehHhhhhhhhhhccccccccchhccccccccccccCCccchhhhhccccchhhhhhhhh-------------
Confidence 344688999976321 11222222 123467777765433333333455888887421
Q ss_pred CCcceEEEEEecccccccEEEEEEccCCCEEEEEeCCCeEEEEeC
Q 001814 416 NSSHVHLYKLHRGITSATIQDICFSHYSQWIAIVSSKGTCHVFVL 460 (1010)
Q Consensus 416 ~~s~~~L~~L~RG~t~a~I~sIAFSpDg~~LAsgS~dGTVhIw~I 460 (1010)
...+..+ .-.....+++|||-|+.+++|=..||+.-|.-
T Consensus 189 ---~~~v~s~---p~t~~~Tav~WSprGKQl~iG~nnGt~vQy~P 227 (1405)
T KOG3630|consen 189 ---AQNVTSF---PVTNSQTAVLWSPRGKQLFIGRNNGTEVQYEP 227 (1405)
T ss_pred ---hhhhccc---CcccceeeEEeccccceeeEecCCCeEEEeec
Confidence 0111122 12245899999999999999999999988763
No 437
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=20.47 E-value=1.8e+02 Score=21.90 Aligned_cols=25 Identities=28% Similarity=0.635 Sum_probs=20.6
Q ss_pred cEEEEEEccCCCEEEEEeCCCeEEEEe
Q 001814 433 TIQDICFSHYSQWIAIVSSKGTCHVFV 459 (1010)
Q Consensus 433 ~I~sIAFSpDg~~LAsgS~dGTVhIw~ 459 (1010)
.|.+|+-++ .|+|++++.+-++||.
T Consensus 3 ~i~aia~g~--~~vavaTS~~~lRifs 27 (27)
T PF12341_consen 3 EIEAIAAGD--SWVAVATSAGYLRIFS 27 (27)
T ss_pred eEEEEEccC--CEEEEEeCCCeEEecC
Confidence 367777775 5999999999999984
No 438
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=20.37 E-value=2e+03 Score=28.46 Aligned_cols=58 Identities=12% Similarity=-0.017 Sum_probs=36.4
Q ss_pred CCEEEEEeCCCCeEEEEEeCCCcE-------------EEEEEc----CCeEEEEe-----------CCeEEEEECCCCce
Q 001814 172 PTAVRFYSFQSHCYEHVLRFRSSV-------------CMVRCS----PRIVAVGL-----------ATQIYCFDALTLEN 223 (1010)
Q Consensus 172 p~tVrIWDlktge~V~tL~f~S~V-------------~sVa~S----~rlLAV~l-----------d~~I~IwD~~Tle~ 223 (1010)
+..|.=.|.++|+.+-.+..++.| +.+... ...|+++. .+.|+-||+.|++.
T Consensus 269 Dg~LiALDA~TGk~~W~fg~~G~vdl~~~~g~~~~g~~~~ts~P~V~~g~VIvG~~v~d~~~~~~~~G~I~A~Da~TGkl 348 (764)
T TIGR03074 269 DARLIALDADTGKLCEDFGNNGTVDLTAGMGTTPPGYYYPTSPPLVAGTTVVIGGRVADNYSTDEPSGVIRAFDVNTGAL 348 (764)
T ss_pred CCeEEEEECCCCCEEEEecCCCceeeecccCcCCCcccccccCCEEECCEEEEEecccccccccCCCcEEEEEECCCCcE
Confidence 456666788888877654333222 111222 24566653 35699999999999
Q ss_pred eEEEee
Q 001814 224 KFSVLT 229 (1010)
Q Consensus 224 l~tL~t 229 (1010)
++....
T Consensus 349 ~W~~~~ 354 (764)
T TIGR03074 349 VWAWDP 354 (764)
T ss_pred eeEEec
Confidence 888764
Done!