Query 003310
Match_columns 832
No_of_seqs 398 out of 2361
Neff 6.1
Searched_HMMs 46136
Date Thu Mar 28 21:04:56 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/003310.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/003310hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF12490 BCAS3: Breast carcino 100.0 2E-59 4.2E-64 495.6 16.6 241 457-698 1-251 (251)
2 KOG2109 WD40 repeat protein [G 100.0 1.4E-52 2.9E-57 469.2 17.3 602 2-712 41-653 (788)
3 KOG2110 Uncharacterized conser 100.0 1.2E-32 2.6E-37 293.5 31.0 230 17-412 16-250 (391)
4 KOG2111 Uncharacterized conser 100.0 2E-29 4.3E-34 264.9 26.1 238 17-412 16-258 (346)
5 KOG0271 Notchless-like WD40 re 99.9 3.4E-23 7.5E-28 221.6 18.3 280 16-426 167-455 (480)
6 KOG0263 Transcription initiati 99.9 4.9E-21 1.1E-25 220.4 21.3 179 116-415 472-654 (707)
7 KOG0315 G-protein beta subunit 99.9 5.1E-20 1.1E-24 188.7 26.0 276 19-424 12-302 (311)
8 KOG0272 U4/U6 small nuclear ri 99.9 5.3E-21 1.1E-25 207.3 18.0 242 18-426 188-434 (459)
9 KOG0271 Notchless-like WD40 re 99.8 8.7E-20 1.9E-24 195.7 18.1 232 117-424 137-411 (480)
10 KOG0318 WD40 repeat stress pro 99.8 6.2E-18 1.3E-22 187.4 30.0 332 18-455 203-589 (603)
11 KOG0291 WD40-repeat-containing 99.8 2.7E-18 5.9E-23 196.2 27.5 278 18-421 277-561 (893)
12 KOG0272 U4/U6 small nuclear ri 99.8 6.4E-19 1.4E-23 191.3 16.8 191 115-426 195-391 (459)
13 cd00200 WD40 WD40 domain, foun 99.8 1.1E-16 2.3E-21 163.2 30.1 221 19-408 65-289 (289)
14 KOG0286 G-protein beta subunit 99.8 3.4E-17 7.5E-22 171.3 26.7 232 117-426 77-319 (343)
15 KOG0279 G protein beta subunit 99.8 2.4E-17 5.3E-22 171.6 24.7 186 114-419 82-271 (315)
16 cd00200 WD40 WD40 domain, foun 99.8 2.6E-16 5.6E-21 160.4 30.8 236 18-422 21-261 (289)
17 KOG0273 Beta-transducin family 99.8 6.7E-17 1.4E-21 177.5 27.1 269 18-410 247-523 (524)
18 PLN00181 protein SPA1-RELATED; 99.8 1.1E-16 2.4E-21 195.9 31.3 236 18-409 546-792 (793)
19 KOG0295 WD40 repeat-containing 99.8 1.3E-17 2.8E-22 178.6 19.0 239 18-409 162-405 (406)
20 KOG0315 G-protein beta subunit 99.8 9E-17 2E-21 165.1 22.1 230 115-422 18-258 (311)
21 KOG0266 WD40 repeat-containing 99.8 1.5E-16 3.3E-21 183.1 26.3 192 115-424 179-378 (456)
22 KOG0266 WD40 repeat-containing 99.8 2.6E-16 5.7E-21 181.2 27.9 236 20-422 174-421 (456)
23 KOG0295 WD40 repeat-containing 99.7 6.1E-17 1.3E-21 173.4 18.8 236 18-422 120-376 (406)
24 KOG0263 Transcription initiati 99.7 8.6E-17 1.9E-21 185.7 19.2 116 294-426 508-623 (707)
25 KOG1446 Histone H3 (Lys4) meth 99.7 7.9E-15 1.7E-19 155.3 29.7 238 17-420 25-272 (311)
26 KOG0286 G-protein beta subunit 99.7 3E-15 6.4E-20 156.9 25.9 222 18-408 110-343 (343)
27 KOG0268 Sof1-like rRNA process 99.7 9.5E-17 2.1E-21 171.8 14.7 255 18-412 80-347 (433)
28 PLN00181 protein SPA1-RELATED; 99.7 8.1E-15 1.8E-19 179.6 32.8 181 116-412 554-740 (793)
29 KOG0265 U5 snRNP-specific prot 99.7 9.9E-16 2.1E-20 161.0 21.0 245 18-425 59-311 (338)
30 KOG0292 Vesicle coat complex C 99.7 6.2E-16 1.3E-20 179.2 21.0 238 16-420 19-290 (1202)
31 KOG0279 G protein beta subunit 99.7 4.8E-15 1E-19 154.7 25.4 224 18-411 76-314 (315)
32 KOG0278 Serine/threonine kinas 99.7 2.2E-16 4.8E-21 162.2 15.1 184 18-354 113-299 (334)
33 KOG0281 Beta-TrCP (transducin 99.7 3.8E-17 8.3E-22 173.6 9.4 213 26-415 216-433 (499)
34 PTZ00421 coronin; Provisional 99.7 2.5E-14 5.5E-19 165.9 32.7 179 117-414 98-294 (493)
35 KOG0274 Cdc4 and related F-box 99.7 1.9E-15 4.1E-20 176.2 23.3 225 19-418 263-490 (537)
36 KOG0319 WD40-repeat-containing 99.7 2.3E-15 5E-20 172.6 23.3 289 17-425 293-592 (775)
37 KOG0288 WD40 repeat protein Ti 99.7 5.1E-16 1.1E-20 168.5 17.0 225 16-408 230-459 (459)
38 KOG0284 Polyadenylation factor 99.7 1.6E-16 3.4E-21 172.1 11.4 224 18-411 109-338 (464)
39 KOG0274 Cdc4 and related F-box 99.7 3.8E-15 8.3E-20 173.6 23.5 231 18-420 219-451 (537)
40 KOG0273 Beta-transducin family 99.7 3.5E-15 7.6E-20 164.1 21.7 189 114-425 254-455 (524)
41 KOG0285 Pleiotropic regulator 99.7 2.2E-15 4.7E-20 161.1 18.7 190 115-426 171-364 (460)
42 KOG0318 WD40 repeat stress pro 99.7 7.2E-14 1.6E-18 155.5 31.0 108 295-418 164-273 (603)
43 KOG0285 Pleiotropic regulator 99.7 4.1E-15 8.8E-20 159.0 19.9 226 16-412 161-391 (460)
44 KOG0319 WD40-repeat-containing 99.7 3.8E-15 8.3E-20 170.8 20.5 227 117-426 303-551 (775)
45 KOG0276 Vesicle coat complex C 99.7 3.7E-15 8.1E-20 168.0 19.6 233 18-418 26-265 (794)
46 KOG0276 Vesicle coat complex C 99.7 2.7E-15 5.8E-20 169.1 18.4 189 117-426 35-231 (794)
47 KOG0643 Translation initiation 99.6 2.7E-14 5.9E-19 148.3 23.7 239 18-411 23-318 (327)
48 KOG0265 U5 snRNP-specific prot 99.6 1.4E-14 3E-19 152.5 21.5 239 1-407 71-335 (338)
49 KOG0306 WD40-repeat-containing 99.6 8.1E-15 1.8E-19 168.1 20.1 226 17-411 423-665 (888)
50 KOG0282 mRNA splicing factor [ 99.6 1.1E-15 2.3E-20 168.8 11.5 243 18-424 228-476 (503)
51 KOG1036 Mitotic spindle checkp 99.6 1.2E-14 2.6E-19 153.4 18.9 219 18-400 66-294 (323)
52 KOG0316 Conserved WD40 repeat- 99.6 2.2E-14 4.8E-19 146.7 19.1 232 18-421 30-268 (307)
53 KOG0281 Beta-TrCP (transducin 99.6 2E-15 4.3E-20 160.7 11.7 227 18-412 247-479 (499)
54 KOG0291 WD40-repeat-containing 99.6 1.1E-13 2.4E-18 159.0 26.6 250 15-419 359-621 (893)
55 PTZ00420 coronin; Provisional 99.6 4.9E-13 1.1E-17 156.8 32.1 103 297-412 185-295 (568)
56 KOG1407 WD40 repeat protein [F 99.6 4.6E-14 1E-18 146.2 20.1 104 299-419 167-270 (313)
57 KOG0310 Conserved WD40 repeat- 99.6 4.5E-14 9.8E-19 156.2 20.8 235 4-409 64-308 (487)
58 KOG0647 mRNA export protein (c 99.6 2.7E-14 5.8E-19 150.4 17.0 224 18-340 85-312 (347)
59 TIGR03866 PQQ_ABC_repeats PQQ- 99.6 3.6E-12 7.8E-17 134.7 33.2 221 117-419 53-288 (300)
60 KOG0288 WD40 repeat protein Ti 99.6 1.4E-14 3E-19 157.5 14.1 235 18-420 188-427 (459)
61 PTZ00420 coronin; Provisional 99.6 4.6E-13 1E-17 157.0 27.0 106 295-417 142-255 (568)
62 KOG0316 Conserved WD40 repeat- 99.6 1E-13 2.2E-18 141.8 18.7 183 117-424 39-227 (307)
63 KOG0296 Angio-associated migra 99.6 3E-13 6.5E-18 145.5 23.0 235 16-410 158-398 (399)
64 KOG0313 Microtubule binding pr 99.6 5.3E-13 1.1E-17 144.1 23.3 267 17-426 115-393 (423)
65 PTZ00421 coronin; Provisional 99.6 7.2E-13 1.6E-17 153.9 26.1 105 295-415 142-250 (493)
66 KOG0305 Anaphase promoting com 99.5 4.5E-13 9.7E-18 152.6 23.4 242 17-412 187-463 (484)
67 KOG0292 Vesicle coat complex C 99.5 1.5E-13 3.2E-18 159.8 18.3 201 117-425 31-253 (1202)
68 KOG0264 Nucleosome remodeling 99.5 2.2E-13 4.8E-18 149.9 18.4 109 294-418 243-355 (422)
69 KOG0277 Peroxisomal targeting 99.5 2.7E-13 5.9E-18 140.1 17.9 223 28-415 39-270 (311)
70 KOG0310 Conserved WD40 repeat- 99.5 2.1E-13 4.4E-18 151.1 18.1 230 20-418 41-276 (487)
71 KOG0973 Histone transcription 99.5 1.9E-13 4E-18 162.9 18.8 241 294-546 28-363 (942)
72 KOG2055 WD40 repeat protein [G 99.5 1.9E-13 4.2E-18 150.3 16.7 178 117-412 325-514 (514)
73 KOG0645 WD40 repeat protein [G 99.5 4.7E-12 1E-16 132.0 25.3 176 117-410 37-225 (312)
74 KOG0275 Conserved WD40 repeat- 99.5 8.8E-14 1.9E-18 147.0 12.4 111 294-421 278-389 (508)
75 KOG0284 Polyadenylation factor 99.5 8.8E-14 1.9E-18 151.1 12.2 243 5-415 128-385 (464)
76 KOG1539 WD repeat protein [Gen 99.5 1.4E-11 3.1E-16 143.3 30.3 120 297-418 466-614 (910)
77 KOG0308 Conserved WD40 repeat- 99.5 1.8E-12 4E-17 147.3 20.9 107 294-417 186-292 (735)
78 KOG0313 Microtubule binding pr 99.5 4.2E-12 9.2E-17 137.2 22.1 99 297-412 318-420 (423)
79 KOG0645 WD40 repeat protein [G 99.5 2.6E-11 5.6E-16 126.6 26.2 239 20-410 30-311 (312)
80 KOG0772 Uncharacterized conser 99.5 1.3E-12 2.8E-17 145.2 16.8 114 292-418 282-402 (641)
81 KOG1539 WD repeat protein [Gen 99.5 7.3E-12 1.6E-16 145.7 23.5 172 116-410 469-648 (910)
82 TIGR03866 PQQ_ABC_repeats PQQ- 99.4 2.1E-10 4.6E-15 121.1 32.9 180 117-418 11-195 (300)
83 KOG0264 Nucleosome remodeling 99.4 8.1E-12 1.7E-16 137.7 21.9 115 294-411 288-405 (422)
84 KOG0282 mRNA splicing factor [ 99.4 3.3E-13 7.2E-18 149.2 10.8 177 115-411 235-416 (503)
85 KOG0277 Peroxisomal targeting 99.4 1.7E-12 3.8E-17 134.2 15.1 176 115-409 125-308 (311)
86 KOG0308 Conserved WD40 repeat- 99.4 5.3E-13 1.1E-17 151.6 11.5 113 295-424 134-257 (735)
87 KOG0289 mRNA splicing factor [ 99.4 1.4E-11 3.1E-16 134.9 21.4 228 18-411 232-463 (506)
88 KOG0306 WD40-repeat-containing 99.4 2.8E-11 6.1E-16 139.5 24.8 233 117-423 394-635 (888)
89 KOG0269 WD40 repeat-containing 99.4 9E-13 2E-17 152.1 12.1 118 294-427 192-314 (839)
90 KOG0275 Conserved WD40 repeat- 99.4 6E-13 1.3E-17 140.8 9.3 209 113-426 231-483 (508)
91 KOG0772 Uncharacterized conser 99.4 7.3E-12 1.6E-16 139.4 18.2 111 294-419 332-454 (641)
92 KOG0640 mRNA cleavage stimulat 99.4 4.8E-12 1E-16 133.7 15.6 105 301-420 238-345 (430)
93 KOG0643 Translation initiation 99.4 4.4E-11 9.5E-16 124.7 22.0 188 117-420 32-230 (327)
94 KOG1408 WD40 repeat protein [F 99.4 1.2E-11 2.5E-16 141.4 18.3 214 117-411 481-714 (1080)
95 KOG0641 WD40 repeat protein [G 99.4 3.1E-10 6.8E-15 115.8 26.3 240 18-410 101-349 (350)
96 KOG0293 WD40 repeat-containing 99.4 7E-12 1.5E-16 136.5 15.3 224 18-412 282-515 (519)
97 KOG0973 Histone transcription 99.4 1.1E-10 2.3E-15 139.7 26.1 289 18-411 26-356 (942)
98 KOG2096 WD40 repeat protein [G 99.4 8E-11 1.7E-15 125.1 21.7 96 312-409 269-401 (420)
99 KOG1407 WD40 repeat protein [F 99.4 1.9E-10 4.2E-15 119.7 23.7 163 117-401 128-293 (313)
100 KOG0267 Microtubule severing p 99.3 1.4E-12 3.1E-17 149.5 8.4 164 117-401 92-259 (825)
101 KOG4283 Transcription-coupled 99.3 6.6E-11 1.4E-15 124.6 19.2 110 299-412 166-278 (397)
102 KOG1446 Histone H3 (Lys4) meth 99.3 4.4E-10 9.5E-15 119.7 25.0 97 299-414 78-174 (311)
103 KOG1274 WD40 repeat protein [G 99.3 2E-10 4.3E-15 135.3 24.3 100 297-411 156-263 (933)
104 KOG0278 Serine/threonine kinas 99.3 4.4E-12 9.5E-17 130.9 9.0 220 113-414 77-301 (334)
105 KOG0305 Anaphase promoting com 99.3 5.7E-11 1.2E-15 135.6 18.5 179 117-418 197-384 (484)
106 KOG0299 U3 snoRNP-associated p 99.3 9.7E-11 2.1E-15 129.4 18.1 213 18-398 215-443 (479)
107 KOG0296 Angio-associated migra 99.3 3.2E-09 6.9E-14 114.9 27.4 183 117-419 86-272 (399)
108 KOG2109 WD40 repeat protein [G 99.3 6.5E-12 1.4E-16 143.4 7.2 321 18-423 252-589 (788)
109 KOG0639 Transducin-like enhanc 99.3 5.6E-11 1.2E-15 131.9 14.0 112 298-413 528-665 (705)
110 KOG1408 WD40 repeat protein [F 99.3 4.9E-10 1.1E-14 128.5 21.8 125 293-421 473-637 (1080)
111 KOG0270 WD40 repeat-containing 99.2 3.4E-10 7.4E-15 124.7 19.9 103 296-413 347-452 (463)
112 KOG0283 WD40 repeat-containing 99.2 9.5E-11 2.1E-15 137.2 16.3 110 292-420 381-491 (712)
113 KOG0267 Microtubule severing p 99.2 8E-12 1.7E-16 143.5 6.8 180 117-417 50-233 (825)
114 KOG0301 Phospholipase A2-activ 99.2 3.1E-10 6.7E-15 130.3 19.6 168 115-412 120-290 (745)
115 KOG0294 WD40 repeat-containing 99.2 1.5E-10 3.3E-15 123.0 15.7 61 293-354 99-159 (362)
116 KOG1273 WD40 repeat protein [G 99.2 5.9E-10 1.3E-14 118.5 19.5 240 18-412 35-282 (405)
117 KOG4378 Nuclear protein COP1 [ 99.2 1.3E-10 2.9E-15 128.9 14.8 188 117-424 101-295 (673)
118 KOG0639 Transducin-like enhanc 99.2 2.8E-10 6E-15 126.5 17.3 261 17-409 431-703 (705)
119 KOG0647 mRNA export protein (c 99.2 2E-09 4.3E-14 114.1 22.9 187 116-422 93-293 (347)
120 KOG0289 mRNA splicing factor [ 99.2 2.9E-10 6.3E-15 124.9 16.9 108 295-419 319-428 (506)
121 KOG4328 WD40 protein [Function 99.2 9.1E-10 2E-14 121.7 20.4 99 299-410 299-399 (498)
122 KOG2106 Uncharacterized conser 99.2 2.2E-08 4.8E-13 111.9 31.2 104 293-418 382-485 (626)
123 KOG0307 Vesicle coat complex C 99.2 6.6E-11 1.4E-15 142.1 12.2 234 19-413 81-330 (1049)
124 KOG0283 WD40 repeat-containing 99.2 3.3E-10 7.2E-15 132.7 17.5 180 112-413 385-579 (712)
125 KOG0301 Phospholipase A2-activ 99.2 7E-10 1.5E-14 127.5 19.1 206 117-410 35-249 (745)
126 KOG0300 WD40 repeat-containing 99.2 3.3E-10 7.2E-15 120.1 15.2 110 294-420 287-397 (481)
127 KOG2919 Guanine nucleotide-bin 99.2 5.6E-10 1.2E-14 119.0 16.8 108 302-423 231-341 (406)
128 KOG0299 U3 snoRNP-associated p 99.2 1.2E-09 2.7E-14 120.8 20.1 177 117-412 224-412 (479)
129 KOG0293 WD40 repeat-containing 99.2 1.6E-10 3.4E-15 126.2 12.5 185 18-353 325-514 (519)
130 KOG2048 WD40 repeat protein [G 99.2 8.4E-09 1.8E-13 118.6 26.8 101 298-415 87-189 (691)
131 KOG0302 Ribosome Assembly prot 99.2 2.5E-10 5.3E-15 123.7 13.5 115 294-423 227-348 (440)
132 KOG1332 Vesicle coat complex C 99.2 2.6E-09 5.7E-14 110.5 19.9 248 18-414 24-290 (299)
133 KOG2445 Nuclear pore complex c 99.2 1.4E-08 2.9E-13 108.1 25.5 107 300-410 198-318 (361)
134 KOG2096 WD40 repeat protein [G 99.2 8.1E-10 1.8E-14 117.6 16.2 107 294-410 202-308 (420)
135 KOG4283 Transcription-coupled 99.1 5.4E-09 1.2E-13 110.4 21.7 136 115-354 122-278 (397)
136 KOG0294 WD40 repeat-containing 99.1 2.2E-09 4.8E-14 114.3 18.6 107 295-418 57-165 (362)
137 KOG1034 Transcriptional repres 99.1 1.4E-09 3.1E-14 116.2 17.1 250 18-409 106-382 (385)
138 KOG0268 Sof1-like rRNA process 99.1 3.9E-10 8.3E-15 121.7 12.7 218 114-413 86-305 (433)
139 KOG0641 WD40 repeat protein [G 99.1 1.1E-08 2.5E-13 104.5 21.7 101 293-410 196-303 (350)
140 KOG1274 WD40 repeat protein [G 99.1 5.8E-09 1.3E-13 123.2 22.4 173 116-407 117-297 (933)
141 KOG0646 WD40 repeat protein [G 99.1 6.9E-10 1.5E-14 122.9 14.0 106 299-412 101-208 (476)
142 KOG1272 WD40-repeat-containing 99.1 1.6E-10 3.5E-15 127.7 8.7 205 117-409 151-361 (545)
143 KOG0640 mRNA cleavage stimulat 99.1 1.6E-09 3.5E-14 114.9 15.6 112 294-409 276-425 (430)
144 KOG1036 Mitotic spindle checkp 99.1 9.8E-09 2.1E-13 109.2 21.5 187 116-418 74-270 (323)
145 KOG0302 Ribosome Assembly prot 99.1 5E-10 1.1E-14 121.4 11.7 106 292-412 271-380 (440)
146 KOG2055 WD40 repeat protein [G 99.1 8.6E-09 1.9E-13 114.2 21.5 98 297-411 321-418 (514)
147 KOG0321 WD40 repeat-containing 99.1 1.1E-09 2.4E-14 124.8 14.6 114 299-426 238-364 (720)
148 PRK11028 6-phosphogluconolacto 99.0 1.3E-07 2.9E-12 103.9 28.8 104 300-416 196-310 (330)
149 KOG1332 Vesicle coat complex C 99.0 2.6E-09 5.7E-14 110.5 14.0 200 294-538 26-241 (299)
150 PRK01742 tolB translocation pr 99.0 6.8E-08 1.5E-12 110.8 27.1 78 325-422 336-415 (429)
151 KOG0646 WD40 repeat protein [G 99.0 7.1E-09 1.5E-13 115.0 18.0 120 294-418 191-315 (476)
152 KOG1188 WD40 repeat protein [G 99.0 3.3E-09 7E-14 114.1 14.8 105 300-418 142-250 (376)
153 KOG2048 WD40 repeat protein [G 99.0 3.8E-08 8.3E-13 113.3 24.0 231 17-409 80-318 (691)
154 PRK11028 6-phosphogluconolacto 99.0 2.2E-07 4.7E-12 102.2 29.5 112 299-421 146-269 (330)
155 KOG0269 WD40 repeat-containing 99.0 6E-10 1.3E-14 129.2 8.4 112 294-421 149-261 (839)
156 KOG0650 WD40 repeat nucleolar 99.0 3.7E-09 8.1E-14 119.9 14.5 95 300-410 586-680 (733)
157 COG2319 FOG: WD40 repeat [Gene 99.0 7.6E-07 1.7E-11 94.6 30.6 184 116-419 133-323 (466)
158 KOG0322 G-protein beta subunit 99.0 5.8E-09 1.3E-13 109.0 13.4 70 323-409 253-322 (323)
159 KOG1007 WD repeat protein TSSC 99.0 2.6E-08 5.6E-13 105.2 18.1 117 294-411 230-362 (370)
160 KOG4328 WD40 protein [Function 99.0 1.5E-08 3.4E-13 112.2 17.1 108 295-410 339-450 (498)
161 KOG1009 Chromatin assembly com 98.9 3.5E-09 7.7E-14 115.8 10.4 129 294-424 29-167 (434)
162 PRK03629 tolB translocation pr 98.9 8.5E-07 1.8E-11 101.9 30.0 101 302-422 312-417 (429)
163 KOG0300 WD40 repeat-containing 98.9 9.1E-08 2E-12 101.9 20.0 97 297-411 332-429 (481)
164 KOG2106 Uncharacterized conser 98.9 4.5E-07 9.9E-12 101.7 26.4 92 299-408 426-519 (626)
165 KOG1445 Tumor-specific antigen 98.9 4.5E-09 9.7E-14 119.4 10.4 101 295-412 644-752 (1012)
166 KOG1587 Cytoplasmic dynein int 98.9 1.3E-07 2.8E-12 110.8 22.5 102 294-411 414-517 (555)
167 KOG0321 WD40 repeat-containing 98.9 3.3E-08 7.2E-13 113.1 16.7 102 298-416 290-397 (720)
168 KOG1273 WD40 repeat protein [G 98.9 6.4E-09 1.4E-13 110.8 9.2 127 294-421 38-194 (405)
169 KOG0650 WD40 repeat nucleolar 98.9 2.1E-07 4.5E-12 106.1 21.7 299 16-407 410-732 (733)
170 COG2319 FOG: WD40 repeat [Gene 98.9 3.3E-06 7.1E-11 89.7 29.6 221 26-413 133-362 (466)
171 KOG0290 Conserved WD40 repeat- 98.8 2E-07 4.3E-12 98.8 18.1 90 300-401 265-357 (364)
172 KOG2110 Uncharacterized conser 98.8 2.2E-06 4.9E-11 93.4 26.3 199 118-421 107-343 (391)
173 KOG1034 Transcriptional repres 98.8 1.6E-08 3.4E-13 108.4 9.5 101 295-411 109-212 (385)
174 PRK05137 tolB translocation pr 98.8 3.7E-06 7.9E-11 96.7 29.4 81 302-401 315-397 (435)
175 KOG4378 Nuclear protein COP1 [ 98.8 2.9E-07 6.4E-12 102.8 19.1 107 295-417 95-202 (673)
176 KOG0303 Actin-binding protein 98.8 5.7E-08 1.2E-12 106.1 12.6 113 294-424 97-217 (472)
177 KOG2111 Uncharacterized conser 98.8 6.4E-06 1.4E-10 88.5 27.2 108 302-412 205-324 (346)
178 KOG3881 Uncharacterized conser 98.7 6.2E-07 1.4E-11 98.2 19.3 107 293-415 218-325 (412)
179 KOG0303 Actin-binding protein 98.7 1.8E-07 3.8E-12 102.4 14.9 117 295-428 148-270 (472)
180 KOG0307 Vesicle coat complex C 98.7 4.1E-08 8.9E-13 118.5 10.4 104 295-413 178-287 (1049)
181 KOG1063 RNA polymerase II elon 98.7 2.8E-08 6.2E-13 114.5 8.6 99 301-412 552-650 (764)
182 PF08662 eIF2A: Eukaryotic tra 98.7 1.5E-07 3.3E-12 96.7 13.3 91 298-410 80-179 (194)
183 KOG1063 RNA polymerase II elon 98.7 6.5E-07 1.4E-11 103.6 19.1 101 312-414 182-301 (764)
184 KOG2445 Nuclear pore complex c 98.7 2.5E-06 5.4E-11 91.2 21.9 251 293-584 27-294 (361)
185 KOG1007 WD repeat protein TSSC 98.7 8.3E-07 1.8E-11 94.1 18.0 103 298-416 190-295 (370)
186 KOG1963 WD40 repeat protein [G 98.7 2.5E-06 5.5E-11 101.1 23.9 108 294-419 220-331 (792)
187 PRK02889 tolB translocation pr 98.7 1.4E-05 3.1E-10 91.8 28.8 70 324-411 330-402 (427)
188 KOG0270 WD40 repeat-containing 98.7 2.2E-07 4.7E-12 103.0 13.0 118 292-426 257-377 (463)
189 PRK04922 tolB translocation pr 98.6 1.9E-05 4.1E-10 90.8 29.5 89 302-409 317-410 (433)
190 KOG1445 Tumor-specific antigen 98.6 2.9E-07 6.3E-12 105.0 13.7 93 295-402 694-786 (1012)
191 KOG0642 Cell-cycle nuclear pro 98.6 1E-07 2.3E-12 108.0 9.6 113 292-411 307-427 (577)
192 KOG1517 Guanine nucleotide bin 98.6 1.8E-06 3.9E-11 103.6 18.4 109 293-413 1271-1384(1387)
193 PRK01742 tolB translocation pr 98.6 2.4E-06 5.3E-11 98.0 19.1 90 303-413 274-364 (429)
194 KOG0771 Prolactin regulatory e 98.6 6.2E-07 1.4E-11 99.0 13.1 74 322-411 282-355 (398)
195 KOG0290 Conserved WD40 repeat- 98.5 2E-06 4.3E-11 91.4 16.1 107 293-414 211-322 (364)
196 KOG1963 WD40 repeat protein [G 98.5 1.4E-06 3E-11 103.2 16.3 117 299-437 179-299 (792)
197 KOG0771 Prolactin regulatory e 98.5 3.5E-07 7.5E-12 100.9 10.6 127 293-422 158-324 (398)
198 KOG2394 WD40 protein DMR-N9 [G 98.5 1.3E-07 2.8E-12 106.7 7.1 84 312-412 281-364 (636)
199 KOG1538 Uncharacterized conser 98.5 1.6E-06 3.4E-11 99.7 15.5 144 138-408 14-160 (1081)
200 KOG1517 Guanine nucleotide bin 98.5 8.7E-06 1.9E-10 98.0 20.5 103 293-410 1223-1333(1387)
201 KOG0649 WD40 repeat protein [G 98.5 1E-05 2.3E-10 84.2 18.6 111 297-418 132-243 (325)
202 KOG2394 WD40 protein DMR-N9 [G 98.5 1.4E-06 3.1E-11 98.4 12.8 61 293-354 304-364 (636)
203 TIGR02658 TTQ_MADH_Hv methylam 98.4 0.00058 1.3E-08 76.6 33.2 101 301-418 215-338 (352)
204 PRK04922 tolB translocation pr 98.4 2.6E-05 5.6E-10 89.7 23.2 93 301-412 272-370 (433)
205 PRK03629 tolB translocation pr 98.4 4.5E-05 9.7E-10 87.8 24.9 93 302-412 268-365 (429)
206 PRK05137 tolB translocation pr 98.4 7.1E-05 1.5E-09 86.1 25.5 91 302-410 271-366 (435)
207 TIGR02800 propeller_TolB tol-p 98.4 0.00015 3.3E-09 82.1 27.6 84 302-404 303-388 (417)
208 PF08662 eIF2A: Eukaryotic tra 98.4 1.1E-05 2.4E-10 83.1 16.5 51 300-352 124-179 (194)
209 KOG1009 Chromatin assembly com 98.4 7.2E-05 1.6E-09 82.7 23.4 93 302-411 262-373 (434)
210 PF00400 WD40: WD domain, G-be 98.4 8.2E-07 1.8E-11 66.6 6.0 39 311-350 1-39 (39)
211 PF10282 Lactonase: Lactonase, 98.4 0.0013 2.8E-08 73.6 34.0 85 323-419 246-331 (345)
212 PRK02889 tolB translocation pr 98.4 4.6E-05 1E-09 87.6 22.5 95 302-412 265-362 (427)
213 KOG1524 WD40 repeat-containing 98.3 9.2E-06 2E-10 91.9 16.0 81 294-405 201-281 (737)
214 PRK00178 tolB translocation pr 98.3 0.00045 9.8E-09 79.1 30.3 51 117-167 223-279 (430)
215 KOG2139 WD40 repeat protein [G 98.3 3.3E-05 7.1E-10 84.3 18.6 101 300-418 217-317 (445)
216 KOG1188 WD40 repeat protein [G 98.3 5.8E-06 1.3E-10 89.5 12.1 105 296-414 89-200 (376)
217 PRK04792 tolB translocation pr 98.3 0.00057 1.2E-08 79.3 29.3 51 117-167 242-298 (448)
218 KOG0644 Uncharacterized conser 98.2 8.5E-06 1.8E-10 96.0 13.1 99 296-411 370-469 (1113)
219 PF02239 Cytochrom_D1: Cytochr 98.2 0.00043 9.3E-09 78.3 25.6 183 117-418 16-210 (369)
220 KOG0644 Uncharacterized conser 98.2 5.7E-07 1.2E-11 105.6 1.8 95 295-410 206-300 (1113)
221 KOG0642 Cell-cycle nuclear pro 98.2 2.3E-05 4.9E-10 89.5 14.1 57 296-353 506-562 (577)
222 PF02239 Cytochrom_D1: Cytochr 98.1 2.3E-05 5.1E-10 88.5 13.7 105 297-419 12-117 (369)
223 KOG1310 WD40 repeat protein [G 98.1 5.1E-06 1.1E-10 94.2 8.0 82 314-411 43-126 (758)
224 KOG1587 Cytoplasmic dynein int 98.1 7.8E-05 1.7E-09 87.8 17.8 104 293-412 362-474 (555)
225 PRK01029 tolB translocation pr 98.1 0.0016 3.4E-08 75.2 27.7 92 303-411 307-404 (428)
226 TIGR02800 propeller_TolB tol-p 98.1 0.00044 9.5E-09 78.3 22.8 92 302-411 259-355 (417)
227 KOG0322 G-protein beta subunit 98.1 1.8E-05 3.9E-10 83.4 10.1 57 294-351 266-322 (323)
228 PRK00178 tolB translocation pr 98.0 0.0017 3.7E-08 74.4 24.5 92 302-411 268-364 (430)
229 KOG0974 WD-repeat protein WDR6 97.9 3E-05 6.5E-10 93.7 10.2 96 297-410 151-246 (967)
230 KOG3881 Uncharacterized conser 97.9 0.0007 1.5E-08 74.9 18.0 81 297-395 265-346 (412)
231 KOG4415 Uncharacterized conser 97.8 8.7E-06 1.9E-10 81.3 2.9 32 635-666 28-60 (247)
232 PRK04792 tolB translocation pr 97.8 0.0016 3.5E-08 75.5 21.9 93 302-412 287-382 (448)
233 KOG2321 WD40 repeat protein [G 97.8 0.00099 2.1E-08 76.6 18.5 194 302-546 156-350 (703)
234 KOG2321 WD40 repeat protein [G 97.8 0.00027 5.7E-09 81.1 13.9 189 117-418 155-351 (703)
235 KOG1240 Protein kinase contain 97.8 0.0005 1.1E-08 84.6 16.2 93 310-412 1037-1130(1431)
236 KOG4497 Uncharacterized conser 97.8 0.00033 7.1E-09 75.9 13.1 93 298-408 111-238 (447)
237 KOG2919 Guanine nucleotide-bin 97.8 0.00011 2.4E-09 79.4 9.5 119 294-426 126-254 (406)
238 PRK04043 tolB translocation pr 97.8 0.035 7.7E-07 64.0 30.7 49 118-166 214-268 (419)
239 KOG3914 WD repeat protein WDR4 97.7 8.4E-05 1.8E-09 82.2 8.0 103 298-418 129-231 (390)
240 PF10282 Lactonase: Lactonase, 97.7 0.039 8.4E-07 61.7 29.3 108 301-418 166-283 (345)
241 PF00400 WD40: WD domain, G-be 97.7 0.00015 3.2E-09 54.2 6.7 37 370-408 3-39 (39)
242 KOG2139 WD40 repeat protein [G 97.7 0.00013 2.8E-09 79.8 8.5 77 317-410 192-268 (445)
243 COG2706 3-carboxymuconate cycl 97.6 0.062 1.3E-06 59.5 28.8 85 322-419 244-330 (346)
244 PF13360 PQQ_2: PQQ-like domai 97.6 0.086 1.9E-06 54.5 28.9 94 297-412 128-232 (238)
245 KOG1538 Uncharacterized conser 97.6 6E-05 1.3E-09 87.1 5.6 90 300-409 32-121 (1081)
246 KOG1240 Protein kinase contain 97.6 0.021 4.6E-07 70.9 26.8 113 297-412 1213-1336(1431)
247 KOG0974 WD-repeat protein WDR6 97.6 0.00069 1.5E-08 82.3 14.0 100 297-415 193-293 (967)
248 KOG4547 WD40 repeat-containing 97.6 0.0067 1.5E-07 70.2 21.3 97 296-411 119-221 (541)
249 PLN02919 haloacid dehalogenase 97.6 0.039 8.3E-07 70.7 30.0 82 325-413 807-891 (1057)
250 PRK01029 tolB translocation pr 97.6 0.0062 1.3E-07 70.3 21.1 76 323-412 282-361 (428)
251 KOG4547 WD40 repeat-containing 97.5 0.00068 1.5E-08 78.1 11.9 108 297-423 76-185 (541)
252 TIGR03300 assembly_YfgL outer 97.5 0.064 1.4E-06 60.2 27.8 101 297-419 247-347 (377)
253 KOG1310 WD40 repeat protein [G 97.5 0.00034 7.3E-09 79.8 8.8 111 294-411 65-179 (758)
254 KOG1523 Actin-related protein 97.5 0.0032 6.9E-08 68.3 15.6 112 298-412 119-238 (361)
255 KOG0649 WD40 repeat protein [G 97.4 0.00058 1.2E-08 71.6 9.2 80 323-419 116-195 (325)
256 KOG1272 WD40-repeat-containing 97.4 0.00012 2.5E-09 82.3 4.4 107 294-419 224-332 (545)
257 KOG2315 Predicted translation 97.4 0.1 2.2E-06 60.6 27.3 101 300-420 250-354 (566)
258 KOG1409 Uncharacterized conser 97.4 0.0065 1.4E-07 66.6 16.6 99 298-412 172-272 (404)
259 KOG1064 RAVE (regulator of V-A 97.4 0.00054 1.2E-08 86.8 9.4 91 299-418 2313-2406(2439)
260 COG4946 Uncharacterized protei 97.4 0.0038 8.2E-08 70.6 15.0 105 297-420 377-486 (668)
261 KOG1912 WD40 repeat protein [G 97.3 0.0032 6.9E-08 74.6 14.8 102 295-412 441-553 (1062)
262 KOG4227 WD40 repeat protein [G 97.3 0.0015 3.2E-08 72.1 11.3 105 294-413 71-182 (609)
263 KOG2315 Predicted translation 97.3 0.0068 1.5E-07 69.9 16.3 51 299-352 334-390 (566)
264 KOG4497 Uncharacterized conser 97.3 0.0015 3.3E-08 70.9 10.3 87 298-402 68-155 (447)
265 TIGR03300 assembly_YfgL outer 97.3 0.17 3.7E-06 56.9 27.4 92 297-408 285-377 (377)
266 PF04762 IKI3: IKI3 family; I 97.2 0.49 1.1E-05 59.9 33.7 99 300-413 236-336 (928)
267 KOG1524 WD40 repeat-containing 97.2 0.00055 1.2E-08 78.0 6.3 115 292-409 76-215 (737)
268 smart00320 WD40 WD40 repeats. 97.2 0.001 2.2E-08 46.3 5.5 39 311-350 2-40 (40)
269 PF11768 DUF3312: Protein of u 97.1 0.0029 6.3E-08 73.5 11.3 91 303-412 238-331 (545)
270 TIGR02658 TTQ_MADH_Hv methylam 97.1 0.0068 1.5E-07 68.2 13.5 102 301-418 27-144 (352)
271 KOG1334 WD40 repeat protein [G 97.0 0.011 2.4E-07 67.1 13.8 111 297-412 300-426 (559)
272 COG2706 3-carboxymuconate cycl 96.9 0.1 2.2E-06 57.8 20.7 94 324-430 147-242 (346)
273 KOG1275 PAB-dependent poly(A) 96.9 0.013 2.9E-07 70.7 14.0 184 116-410 156-342 (1118)
274 PF07433 DUF1513: Protein of u 96.7 0.37 7.9E-06 53.2 23.1 102 302-412 139-249 (305)
275 PRK04043 tolB translocation pr 96.7 0.2 4.4E-06 57.8 22.5 81 301-401 257-339 (419)
276 KOG4227 WD40 repeat protein [G 96.7 0.0027 6E-08 70.1 6.5 99 314-423 49-147 (609)
277 KOG0280 Uncharacterized conser 96.7 0.0098 2.1E-07 64.1 9.9 103 298-415 140-246 (339)
278 KOG1523 Actin-related protein 96.6 0.015 3.3E-07 63.3 11.2 114 297-425 28-144 (361)
279 KOG4190 Uncharacterized conser 96.6 0.01 2.3E-07 67.9 10.3 125 293-420 749-916 (1034)
280 KOG1275 PAB-dependent poly(A) 96.5 0.039 8.4E-07 67.0 14.3 51 117-167 197-258 (1118)
281 PF08450 SGL: SMP-30/Gluconola 96.5 0.94 2E-05 47.8 23.6 99 301-410 115-213 (246)
282 PF15492 Nbas_N: Neuroblastoma 96.3 0.79 1.7E-05 49.7 21.7 99 320-418 146-267 (282)
283 COG5354 Uncharacterized protei 96.1 1.1 2.5E-05 51.8 22.7 91 301-411 255-349 (561)
284 PLN02919 haloacid dehalogenase 96.1 1.1 2.4E-05 57.8 25.2 87 324-411 742-834 (1057)
285 KOG2314 Translation initiation 96.0 0.93 2E-05 52.9 21.7 96 302-411 426-526 (698)
286 KOG1645 RING-finger-containing 96.0 0.017 3.6E-07 64.6 7.3 95 303-415 175-271 (463)
287 PF03178 CPSF_A: CPSF A subuni 95.7 0.54 1.2E-05 51.9 18.0 94 301-411 107-203 (321)
288 KOG1354 Serine/threonine prote 95.6 0.038 8.3E-07 60.7 8.1 107 293-412 228-361 (433)
289 KOG4532 WD40-like repeat conta 95.6 0.58 1.3E-05 50.4 16.4 61 293-354 217-284 (344)
290 COG3386 Gluconolactonase [Carb 95.5 1.4 3.1E-05 48.8 20.4 52 300-352 142-193 (307)
291 KOG0309 Conserved WD40 repeat- 95.5 0.09 2E-06 62.6 11.2 114 300-429 179-306 (1081)
292 KOG4640 Anaphase-promoting com 95.3 0.066 1.4E-06 62.9 9.1 77 323-417 22-99 (665)
293 KOG1334 WD40 repeat protein [G 95.1 0.11 2.3E-06 59.6 10.0 58 295-353 410-467 (559)
294 KOG4532 WD40-like repeat conta 95.1 0.22 4.8E-06 53.5 11.7 101 299-415 136-238 (344)
295 PF13360 PQQ_2: PQQ-like domai 95.1 2.7 6E-05 43.3 19.9 55 117-171 46-102 (238)
296 smart00320 WD40 WD40 repeats. 94.7 0.054 1.2E-06 37.2 4.2 28 381-408 13-40 (40)
297 KOG1064 RAVE (regulator of V-A 94.7 0.062 1.4E-06 69.2 7.2 121 294-419 2223-2375(2439)
298 PRK11138 outer membrane biogen 94.6 2.7 5.8E-05 47.8 20.0 57 117-173 130-188 (394)
299 KOG0309 Conserved WD40 repeat- 94.6 0.22 4.7E-06 59.5 11.0 104 295-415 131-237 (1081)
300 KOG1645 RING-finger-containing 94.6 1.1 2.5E-05 50.5 16.0 52 115-166 214-269 (463)
301 KOG4190 Uncharacterized conser 94.6 0.03 6.5E-07 64.3 3.9 85 313-409 727-811 (1034)
302 KOG0280 Uncharacterized conser 94.2 0.14 3E-06 55.5 7.6 104 295-415 182-289 (339)
303 KOG4640 Anaphase-promoting com 94.1 0.14 3.1E-06 60.2 8.1 59 294-354 35-94 (665)
304 PF15492 Nbas_N: Neuroblastoma 94.1 0.27 6E-06 53.1 9.6 72 327-411 3-74 (282)
305 KOG2314 Translation initiation 93.9 0.13 2.8E-06 59.7 7.2 96 324-445 213-319 (698)
306 PF03178 CPSF_A: CPSF A subuni 93.8 13 0.00027 41.1 28.2 50 117-166 62-118 (321)
307 KOG2695 WD40 repeat protein [G 93.8 0.16 3.4E-06 56.2 7.3 103 297-412 270-378 (425)
308 KOG2066 Vacuolar assembly/sort 93.7 0.21 4.6E-06 60.2 8.8 91 294-411 52-147 (846)
309 KOG1912 WD40 repeat protein [G 93.7 0.75 1.6E-05 55.5 13.0 99 295-410 83-186 (1062)
310 COG5354 Uncharacterized protei 93.5 0.13 2.9E-06 59.1 6.4 93 306-418 17-124 (561)
311 KOG1354 Serine/threonine prote 93.4 0.18 3.8E-06 55.7 6.8 79 322-414 26-120 (433)
312 KOG4714 Nucleoporin [Nuclear s 93.2 0.05 1.1E-06 58.0 2.2 58 295-353 196-255 (319)
313 PF08450 SGL: SMP-30/Gluconola 93.2 1.3 2.7E-05 46.8 12.9 97 302-414 61-168 (246)
314 KOG2695 WD40 repeat protein [G 93.1 0.1 2.2E-06 57.5 4.5 113 298-425 231-347 (425)
315 COG0823 TolB Periplasmic compo 93.1 1.7 3.6E-05 50.5 14.7 50 117-166 218-273 (425)
316 KOG3621 WD40 repeat-containing 93.1 0.22 4.8E-06 59.3 7.5 102 297-411 51-155 (726)
317 cd00216 PQQ_DH Dehydrogenases 92.9 11 0.00023 44.5 21.4 57 117-173 71-138 (488)
318 PF12894 Apc4_WD40: Anaphase-p 92.9 0.28 6E-06 39.3 5.6 31 380-410 11-41 (47)
319 PF07433 DUF1513: Protein of u 92.8 11 0.00025 41.7 19.6 70 321-412 216-285 (305)
320 PRK11138 outer membrane biogen 92.6 22 0.00048 40.4 31.0 92 297-409 300-393 (394)
321 KOG4714 Nucleoporin [Nuclear s 92.4 0.14 3.1E-06 54.6 4.2 94 301-411 159-255 (319)
322 KOG1920 IkappaB kinase complex 92.2 15 0.00032 47.1 21.5 97 301-412 222-324 (1265)
323 KOG2041 WD40 repeat protein [G 92.2 0.24 5.3E-06 58.9 6.1 100 294-408 29-143 (1189)
324 PF08553 VID27: VID27 cytoplas 92.1 6.6 0.00014 48.9 18.5 59 292-352 589-647 (794)
325 PRK02888 nitrous-oxide reducta 91.9 1.6 3.4E-05 52.6 12.5 114 297-414 211-355 (635)
326 COG5170 CDC55 Serine/threonine 91.4 0.42 9.2E-06 52.2 6.5 105 293-412 236-369 (460)
327 KOG0882 Cyclophilin-related pe 91.0 0.64 1.4E-05 53.1 7.6 117 298-418 119-239 (558)
328 PF06433 Me-amine-dh_H: Methyl 90.7 0.5 1.1E-05 52.8 6.5 58 117-174 269-331 (342)
329 KOG1832 HIV-1 Vpr-binding prot 90.6 0.13 2.7E-06 62.5 1.8 106 293-417 1115-1221(1516)
330 COG4946 Uncharacterized protei 90.4 4.9 0.00011 46.4 13.8 96 300-414 340-435 (668)
331 PF12894 Apc4_WD40: Anaphase-p 90.3 0.72 1.6E-05 36.9 5.3 29 322-351 12-40 (47)
332 PF11768 DUF3312: Protein of u 89.9 0.59 1.3E-05 55.0 6.5 58 294-354 274-331 (545)
333 KOG2079 Vacuolar assembly/sort 89.6 0.75 1.6E-05 57.3 7.2 98 297-410 105-203 (1206)
334 KOG3914 WD repeat protein WDR4 88.9 0.41 8.9E-06 53.8 4.0 60 293-354 165-225 (390)
335 PF00930 DPPIV_N: Dipeptidyl p 88.9 0.97 2.1E-05 50.7 7.2 101 301-411 23-132 (353)
336 PF04053 Coatomer_WDAD: Coatom 87.7 28 0.0006 40.8 18.1 58 333-411 117-174 (443)
337 PF10313 DUF2415: Uncharacteri 87.0 1.5 3.3E-05 34.5 5.0 32 322-354 1-35 (43)
338 PF04841 Vps16_N: Vps16, N-ter 87.0 66 0.0014 37.2 22.0 47 116-163 60-109 (410)
339 PF00780 CNH: CNH domain; Int 86.9 15 0.00032 39.2 14.5 43 131-173 223-265 (275)
340 TIGR03075 PQQ_enz_alc_DH PQQ-d 85.9 63 0.0014 38.7 20.3 78 300-394 440-517 (527)
341 PF14783 BBS2_Mid: Ciliary BBS 85.8 7 0.00015 37.1 9.7 65 324-409 2-70 (111)
342 COG0823 TolB Periplasmic compo 85.7 2.7 5.9E-05 48.8 8.5 95 301-416 218-318 (425)
343 KOG3617 WD40 and TPR repeat-co 85.2 0.7 1.5E-05 56.2 3.4 76 324-416 62-137 (1416)
344 PRK02888 nitrous-oxide reducta 84.7 6.4 0.00014 47.6 11.0 106 300-411 295-405 (635)
345 PF10168 Nup88: Nuclear pore c 84.3 18 0.0004 44.8 15.1 91 322-415 85-184 (717)
346 cd00216 PQQ_DH Dehydrogenases 84.0 89 0.0019 36.9 20.3 22 391-412 405-426 (488)
347 COG3391 Uncharacterized conser 83.7 16 0.00036 41.6 13.6 95 298-410 93-190 (381)
348 PF14655 RAB3GAP2_N: Rab3 GTPa 83.5 7.2 0.00016 45.2 10.5 91 313-419 299-407 (415)
349 PF06977 SdiA-regulated: SdiA- 83.3 16 0.00034 39.5 12.5 81 317-414 17-98 (248)
350 KOG2114 Vacuolar assembly/sort 82.9 5.2 0.00011 49.2 9.3 107 295-408 39-153 (933)
351 KOG2079 Vacuolar assembly/sort 82.8 1.7 3.7E-05 54.4 5.4 74 331-421 97-171 (1206)
352 PF10313 DUF2415: Uncharacteri 82.6 3.6 7.9E-05 32.4 5.3 29 383-411 3-34 (43)
353 KOG2066 Vacuolar assembly/sort 82.3 9.3 0.0002 46.8 11.0 105 294-418 86-195 (846)
354 PRK13616 lipoprotein LpqB; Pro 82.3 7.6 0.00016 47.1 10.6 100 301-418 379-485 (591)
355 COG5170 CDC55 Serine/threonine 81.6 2.5 5.4E-05 46.4 5.5 86 322-412 27-119 (460)
356 KOG2395 Protein involved in va 80.8 78 0.0017 37.7 17.2 58 293-352 443-500 (644)
357 PF04762 IKI3: IKI3 family; I 80.6 16 0.00036 46.6 13.1 82 313-408 62-148 (928)
358 KOG2041 WD40 repeat protein [G 80.2 2 4.2E-05 51.6 4.5 96 295-410 87-186 (1189)
359 PF06433 Me-amine-dh_H: Methyl 79.8 48 0.001 37.5 14.9 51 302-354 270-322 (342)
360 PF02897 Peptidase_S9_N: Proly 78.3 13 0.00029 42.3 10.5 76 323-415 125-215 (414)
361 KOG1409 Uncharacterized conser 76.6 15 0.00032 41.2 9.5 114 309-426 102-244 (404)
362 PF14783 BBS2_Mid: Ciliary BBS 76.4 63 0.0014 30.8 12.5 88 294-405 18-109 (111)
363 PF00780 CNH: CNH domain; Int 76.0 1.1E+02 0.0024 32.4 24.9 40 18-59 7-46 (275)
364 COG3386 Gluconolactonase [Carb 75.4 1E+02 0.0022 34.4 16.0 99 301-416 47-155 (307)
365 PF12234 Rav1p_C: RAVE protein 75.1 30 0.00065 42.2 12.5 102 300-409 50-155 (631)
366 PRK13616 lipoprotein LpqB; Pro 74.2 16 0.00034 44.4 10.0 99 300-415 429-530 (591)
367 PF08596 Lgl_C: Lethal giant l 74.2 1.8E+02 0.0038 33.7 18.8 96 299-410 233-335 (395)
368 PF14781 BBS2_N: Ciliary BBSom 73.5 19 0.00042 35.3 8.5 46 126-171 37-89 (136)
369 PF08728 CRT10: CRT10; InterP 73.4 1.2E+02 0.0025 37.8 16.9 74 322-409 164-245 (717)
370 PF08553 VID27: VID27 cytoplas 73.3 25 0.00054 44.0 11.5 97 296-408 499-604 (794)
371 KOG2444 WD40 repeat protein [G 72.6 8.4 0.00018 40.9 6.2 106 295-417 74-184 (238)
372 PF08596 Lgl_C: Lethal giant l 72.4 27 0.00059 40.2 11.0 83 313-411 78-174 (395)
373 PF14583 Pectate_lyase22: Olig 72.2 1.9E+02 0.0042 33.3 17.7 41 115-155 166-209 (386)
374 COG3391 Uncharacterized conser 72.0 58 0.0012 37.2 13.5 93 300-411 139-240 (381)
375 PF02897 Peptidase_S9_N: Proly 71.5 27 0.00058 39.8 10.7 99 298-411 147-261 (414)
376 PF07676 PD40: WD40-like Beta 71.1 13 0.00027 27.8 5.4 30 320-349 7-38 (39)
377 PF05694 SBP56: 56kDa selenium 71.1 45 0.00099 38.9 12.1 108 298-415 219-347 (461)
378 PF07676 PD40: WD40-like Beta 69.7 9.5 0.00021 28.4 4.4 26 382-407 10-38 (39)
379 KOG3617 WD40 and TPR repeat-co 69.1 9.3 0.0002 47.1 6.3 56 296-352 76-131 (1416)
380 KOG1832 HIV-1 Vpr-binding prot 68.3 6.4 0.00014 48.6 4.8 91 312-419 1092-1185(1516)
381 TIGR03075 PQQ_enz_alc_DH PQQ-d 66.9 2.8E+02 0.0062 33.2 23.1 56 117-172 79-147 (527)
382 TIGR03074 PQQ_membr_DH membran 64.3 3.8E+02 0.0083 33.8 22.7 57 117-173 204-288 (764)
383 KOG4649 PQQ (pyrrolo-quinoline 64.2 2.3E+02 0.005 31.2 18.1 84 297-397 69-153 (354)
384 PF00930 DPPIV_N: Dipeptidyl p 60.5 2.8E+02 0.0061 31.0 20.6 50 117-166 23-74 (353)
385 smart00036 CNH Domain found in 60.4 1.7E+02 0.0036 32.4 13.8 43 131-173 238-280 (302)
386 COG3490 Uncharacterized protei 59.8 1.1E+02 0.0025 33.9 11.7 53 120-173 52-109 (366)
387 KOG2395 Protein involved in va 59.8 54 0.0012 39.0 9.9 89 302-409 405-499 (644)
388 KOG4460 Nuclear pore complex, 59.3 1.1E+02 0.0024 36.5 12.1 86 322-414 104-202 (741)
389 PF04053 Coatomer_WDAD: Coatom 58.4 25 0.00054 41.2 7.2 51 297-351 122-172 (443)
390 KOG1920 IkappaB kinase complex 57.7 44 0.00095 43.1 9.4 68 322-407 69-136 (1265)
391 PF10647 Gmad1: Lipoprotein Lp 56.7 94 0.002 33.4 10.9 75 323-411 67-145 (253)
392 TIGR02276 beta_rpt_yvtn 40-res 56.0 59 0.0013 24.0 6.7 24 331-354 1-24 (42)
393 KOG2377 Uncharacterized conser 54.7 3.1E+02 0.0067 32.5 14.6 81 320-415 65-145 (657)
394 KOG1916 Nuclear protein, conta 54.0 8.9 0.00019 47.6 2.6 53 298-352 202-265 (1283)
395 PF12234 Rav1p_C: RAVE protein 51.9 1.3E+02 0.0027 37.1 11.8 80 316-411 24-105 (631)
396 PF12657 TFIIIC_delta: Transcr 51.8 97 0.0021 31.2 9.4 30 382-411 87-122 (173)
397 KOG2114 Vacuolar assembly/sort 51.6 1E+02 0.0022 38.7 10.7 97 301-410 92-201 (933)
398 PF14870 PSII_BNR: Photosynthe 50.5 1E+02 0.0022 34.3 10.1 69 322-408 145-213 (302)
399 KOG3621 WD40 repeat-containing 49.4 33 0.00071 41.8 6.2 75 322-412 34-108 (726)
400 KOG0882 Cyclophilin-related pe 46.7 16 0.00034 42.3 3.0 59 295-354 24-86 (558)
401 KOG1008 Uncharacterized conser 46.0 7.4 0.00016 46.6 0.3 97 299-414 127-230 (783)
402 KOG1008 Uncharacterized conser 45.3 7.5 0.00016 46.6 0.2 102 298-410 77-184 (783)
403 PF14781 BBS2_N: Ciliary BBSom 45.0 2.6E+02 0.0057 27.7 10.6 56 117-172 73-134 (136)
404 PF04841 Vps16_N: Vps16, N-ter 44.0 2.5E+02 0.0054 32.5 12.4 31 321-352 216-246 (410)
405 PF06977 SdiA-regulated: SdiA- 43.6 3.4E+02 0.0075 29.3 12.5 107 295-411 38-148 (248)
406 PF05787 DUF839: Bacterial pro 43.4 47 0.001 39.8 6.4 69 326-397 440-518 (524)
407 COG3490 Uncharacterized protei 43.0 1.1E+02 0.0024 34.0 8.5 81 303-399 93-180 (366)
408 PF11715 Nup160: Nucleoporin N 42.8 98 0.0021 36.9 9.1 36 383-418 217-256 (547)
409 PF07569 Hira: TUP1-like enhan 42.5 43 0.00094 35.3 5.4 38 131-168 7-45 (219)
410 PF07569 Hira: TUP1-like enhan 41.6 1.6E+02 0.0035 31.1 9.5 73 329-411 18-96 (219)
411 PF01731 Arylesterase: Arylest 41.4 1.1E+02 0.0025 27.6 7.1 49 301-352 36-84 (86)
412 PF05096 Glu_cyclase_2: Glutam 39.7 1.6E+02 0.0034 32.3 9.1 58 115-172 108-166 (264)
413 PF10214 Rrn6: RNA polymerase 39.0 1.4E+02 0.0029 37.6 9.8 84 322-414 146-236 (765)
414 COG3211 PhoX Predicted phospha 38.2 72 0.0016 38.3 6.7 64 325-398 503-571 (616)
415 PRK10115 protease 2; Provision 34.8 1.6E+02 0.0036 36.4 9.5 73 322-412 127-209 (686)
416 PF13570 PQQ_3: PQQ-like domai 34.5 67 0.0014 24.1 3.9 23 142-164 17-40 (40)
417 PF10214 Rrn6: RNA polymerase 34.4 1E+03 0.023 29.9 17.3 72 323-412 205-278 (765)
418 TIGR03606 non_repeat_PQQ dehyd 33.9 1.4E+02 0.0031 35.2 8.3 68 326-397 150-246 (454)
419 PF14870 PSII_BNR: Photosynthe 33.9 2.8E+02 0.006 31.0 10.2 92 299-404 163-255 (302)
420 PF01011 PQQ: PQQ enzyme repea 32.7 80 0.0017 23.6 4.0 28 148-175 3-30 (38)
421 PF03088 Str_synth: Strictosid 32.4 2.3E+02 0.0049 25.9 7.6 16 383-398 59-74 (89)
422 PF12768 Rax2: Cortical protei 32.2 4.9E+02 0.011 28.7 11.7 55 300-354 15-74 (281)
423 PF10647 Gmad1: Lipoprotein Lp 31.3 2.5E+02 0.0054 30.1 9.2 67 323-408 25-93 (253)
424 PF03022 MRJP: Major royal jel 31.3 7.4E+02 0.016 27.2 13.5 53 302-354 35-98 (287)
425 PF01731 Arylesterase: Arylest 30.9 74 0.0016 28.8 4.2 29 383-411 56-85 (86)
426 KOG1897 Damage-specific DNA bi 30.8 1.3E+03 0.029 30.0 24.4 94 302-410 849-942 (1096)
427 COG5422 ROM1 RhoGEF, Guanine n 30.6 2.8E+02 0.0061 35.2 10.0 34 142-175 1104-1137(1175)
428 TIGR03032 conserved hypothetic 30.4 8.5E+02 0.018 27.6 15.5 171 297-504 24-204 (335)
429 PF12341 DUF3639: Protein of u 29.4 1E+02 0.0022 22.1 3.7 24 383-408 4-27 (27)
430 smart00564 PQQ beta-propeller 29.1 1.3E+02 0.0027 21.2 4.5 25 147-171 8-32 (33)
431 PF05694 SBP56: 56kDa selenium 27.8 1.1E+02 0.0024 35.9 5.8 48 115-162 220-276 (461)
432 PF03088 Str_synth: Strictosid 27.5 1.5E+02 0.0032 27.1 5.5 45 296-341 32-76 (89)
433 KOG0183 20S proteasome, regula 27.5 37 0.00081 35.7 1.9 15 522-536 7-21 (249)
434 KOG2280 Vacuolar assembly/sort 27.1 8.1E+02 0.018 30.8 13.0 47 117-164 64-113 (829)
435 PF14761 HPS3_N: Hermansky-Pud 26.9 2E+02 0.0044 30.5 7.2 44 343-401 37-80 (215)
436 TIGR03118 PEPCTERM_chp_1 conse 26.6 7.2E+02 0.016 28.1 11.5 54 298-354 219-281 (336)
437 COG2133 Glucose/sorbosone dehy 26.0 2.1E+02 0.0046 33.2 7.8 20 323-342 178-197 (399)
438 PF05096 Glu_cyclase_2: Glutam 25.1 9.4E+02 0.02 26.4 21.4 55 117-171 68-126 (264)
439 TIGR02604 Piru_Ver_Nterm putat 24.4 1.9E+02 0.0042 32.7 7.1 17 383-399 186-202 (367)
440 PF07995 GSDH: Glucose / Sorbo 24.2 4.4E+02 0.0095 29.4 9.8 80 312-405 244-330 (331)
441 TIGR02604 Piru_Ver_Nterm putat 23.6 4.3E+02 0.0093 29.9 9.7 62 323-397 15-87 (367)
442 PF14269 Arylsulfotran_2: Aryl 22.4 2.8E+02 0.0061 30.8 7.7 71 324-410 146-220 (299)
443 COG3204 Uncharacterized protei 22.1 3.8E+02 0.0083 30.0 8.4 84 319-418 83-166 (316)
444 KOG1916 Nuclear protein, conta 22.1 44 0.00096 41.9 1.4 98 299-414 151-269 (1283)
445 TIGR02608 delta_60_rpt delta-6 21.0 2.1E+02 0.0045 23.9 4.7 32 383-414 3-39 (55)
446 PF11715 Nup160: Nucleoporin N 21.0 1.7E+02 0.0037 34.8 6.1 31 323-354 216-250 (547)
447 PF01436 NHL: NHL repeat; Int 20.3 1.7E+02 0.0037 20.5 3.6 25 383-407 4-28 (28)
448 PF05935 Arylsulfotrans: Aryls 20.2 4.4E+02 0.0095 31.2 9.2 53 301-354 167-244 (477)
No 1
>PF12490 BCAS3: Breast carcinoma amplified sequence 3 ; InterPro: IPR022175 This domain family is found in eukaryotes, and is typically between 229 and 245 amino acids in length. The proteins in this family have been shown to be proto-oncogenes implicated in the development of breast cancer.
Probab=100.00 E-value=2e-59 Score=495.59 Aligned_cols=241 Identities=46% Similarity=0.728 Sum_probs=211.9
Q ss_pred CCCeeeeeceEEEcCC-CCCCccccccchhccC-cccCCCcceeeeeeccCCCccccccCCcccccccEEEEcCCCcEEE
Q 003310 457 GPPVTLSVVSRIRNGN-NGWRGTVSGAAAAATG-RVSSLSGAIASSFHNCKGNSETYAAGSSLKIKNHLLVFSPSGCMIQ 534 (832)
Q Consensus 457 ~~p~~ls~v~~I~~~~-~~~~~~v~~~~~~a~g-~~~~~~g~~~~~~h~~~~~~~~~~~~~~~~~~~~Llv~s~~G~l~~ 534 (832)
|+|++|++|+|||+++ +||+++|+++|++|+| |.+.++||+|+.||+|.+...........+++||||||+|+|||||
T Consensus 1 P~Pv~l~~vsrIK~~~~~g~~~tv~~aassa~g~~~~~~sga~a~~f~~~~~~~~~~~~~~~~~~~~~LlV~spsG~Liq 80 (251)
T PF12490_consen 1 PPPVTLSVVSRIKQGNTLGWLNTVSNAASSATGGKPSSVSGAFASSFHNSKGSSSEPSDSSSSKAVESLLVFSPSGHLIQ 80 (251)
T ss_pred CCCEEechHHhhcCCccccccccccccccchhcCCcccceeEEccccccCCCCcccccccccccccceEEEECCCCcEEE
Confidence 5799999999999999 8999999999999999 8899999999999999666656666676789999999999999999
Q ss_pred EeeeccCCCCccccCCCCCCcCCCC-CCCCceEEeeeeeeeecccccccccccc-cccccCCCCCcCC-CcccccccccC
Q 003310 535 YALRISTGLDVTMGVPGLGSAYDSV-PEDDPRLVVEAIQKWNICQKQARRERED-NIDIYGDNGTLDS-NKIYPEEVKDG 611 (832)
Q Consensus 535 y~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ve~~~~Wdv~rr~~~~e~~~-~~~i~~~~~~~~~-n~~~~~~~~~~ 611 (832)
|+|+|+.+.++..++++.++++++. +|+++||+|||+||||||||++|+|+++ +..+++.....+. +++.....+++
T Consensus 81 y~L~p~~~~~~~~~~~~~~~~~~~~~~~~~l~l~vep~~~Wdl~R~~~w~e~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (251)
T PF12490_consen 81 YELRPSPGSDPTEGGSGNGPPSESQMDDTELRLVVEPVQQWDLCRRPNWPEREEDCVPPLPENNPLDSASKIDPSDCRKG 160 (251)
T ss_pred EEEeeccccCcccccccccCccccccccCcceEEeeeccceeEeccccCCccchhccCCCCCCCHhhhhhhccccccccc
Confidence 9999999999999999999999999 7799999999999999999999999999 6677787655443 46666666665
Q ss_pred c-cccCCCcccccccCCCcccccceeeeeeEEeecCC-CcccccCCeeEEEEeecCcc-ccCcccccc--ceEEEeeccc
Q 003310 612 N-FASTEANGVIEKTKVSPEDKHHLYISEAELQMHPP-RIPLWAKPQIYFQSMMIKDF-KMGEENFLK--GEIEIERFPT 686 (832)
Q Consensus 612 ~-~~~~~~~~~~~~~~~~~~e~~~~ylS~aEvq~h~~-~~plW~~~~~~F~~m~~~~~-~~~~~~~~~--~~~eie~~~~ 686 (832)
+ +++.+. +...+.+++++|++++||||||||||++ ||||||||||+||+|.+++. ++...+..+ ||||||++|+
T Consensus 161 ~~~~~~~~-~~~~~~~~~~~e~~~~wlS~vEi~th~~phrpLW~gpQf~F~~~~~~~~~~~~~s~~~~~~~e~EIE~~~~ 239 (251)
T PF12490_consen 161 NSVNPSND-SYVSKESDSPEERDHWWLSNVEIQTHSGPHRPLWMGPQFSFKTMSSPSSSELNISSSSGEAGEIEIEKIPT 239 (251)
T ss_pred CCcccccc-ccccccCCCcccccCcEEeeeeeEeccCCccccccCCcEEEEEecCCCCccccccccccccCceeeccccc
Confidence 5 666553 3336678888999999999999999999 69999999999999998774 444556667 9999999999
Q ss_pred cccccccCCccc
Q 003310 687 RMIEARSKDLVP 698 (832)
Q Consensus 687 ~~~~~~~~~l~p 698 (832)
|+||+|+|||||
T Consensus 240 ~~ve~r~k~l~p 251 (251)
T PF12490_consen 240 REVEIRRKDLLP 251 (251)
T ss_pred cceeeeccccCC
Confidence 999999999998
No 2
>KOG2109 consensus WD40 repeat protein [General function prediction only]
Probab=100.00 E-value=1.4e-52 Score=469.20 Aligned_cols=602 Identities=28% Similarity=0.322 Sum_probs=419.5
Q ss_pred eeeeccccccccCCCCCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEe
Q 003310 2 VLWAGFDKLESEAGATRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCA 81 (832)
Q Consensus 2 v~w~~fd~l~~~~~~~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~ 81 (832)
|+|++||+ .++..+.||+++|.+||||||++++..+.+.++.+||++.|.+|++.|..+. ..+.|+.++|++|+|.
T Consensus 41 vlw~~fD~---~~~~~~~Vlll~~~~gfqv~d~~Dsp~vh~~vs~~dd~~~f~sm~~~pl~sg-~~~gf~ss~avpavv~ 116 (788)
T KOG2109|consen 41 VLWIKFDP---KPEVLEEVLLLNREEGFQVVDETDSPTVHKEVSISDDLLDFSSMDKSPLSSG-PDSGFESSDAVPAVVR 116 (788)
T ss_pred ccccccCC---chhHHHHHHHHhhccCceEEeeccCCccceeeeecCCcceecccCCCCccCC-CCCccccCCceeeecc
Confidence 79999994 4456799999999999999999999999999999999999999999987653 4567999999999998
Q ss_pred CCCCccCccccCCcccccCCCCCCCCCCCC-CCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEEEeCCEEEEE
Q 003310 82 DGSRSCGTKVQDGLATACNGTSANYHDLGN-GSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAICQAAQVHCF 160 (832)
Q Consensus 82 ~g~~~g~~~~~Dg~~~~~~g~~~~~h~~g~-~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAVa~~~~I~vw 160 (832)
..... ..+.+ .++|. ....+....++||++..++|.|+|+ +|+||
T Consensus 117 ~t~S~--p~I~~-------------S~~Gse~d~t~an~~v~dl~S~~yah~l~fR-------------------qi~Cf 162 (788)
T KOG2109|consen 117 TTTSP--PTIPP-------------SQTGSEQDSTQANEMVVDLMSLDYAHALPFR-------------------QIHCF 162 (788)
T ss_pred cccCC--CcCCC-------------CCCcceecccccccceeccccccchhccccc-------------------ccccc
Confidence 21110 00000 01111 1123455778999999999999997 89999
Q ss_pred ECCCCceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCccccccccccc--ccCCCcceee
Q 003310 161 DAATLEIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSG--FASNGSRVAH 238 (832)
Q Consensus 161 Dl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~--~~s~g~~V~~ 238 (832)
|+.+++..+.+.+.+.+... .+++|+|+++++||+||+...+..+.. +.+++.++.++ ++..+..++.
T Consensus 163 Da~tle~d~~~~~n~~p~l~-----l~VGYGplaVg~rWaaya~~~a~~vss-----~~Vt~~~~VspttSs~~~~~va~ 232 (788)
T KOG2109|consen 163 DAPTLEIDSMNTINTKPRLL-----LSVGYGPLAVGRRWAAYAQTLANQVSS-----HLVTMGMSVSPTTSSQITAEVAE 232 (788)
T ss_pred cCcccCCchhhccccccccc-----eeeccccccceeeeeeeccCcchhhhh-----ccccccccccCCCCCchhHHHHH
Confidence 99999988888877655432 357899999999999999765443322 11111122222 2345667999
Q ss_pred eecccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCC-CCe--EEEEECCCCcEEE
Q 003310 239 YAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADN-VGM--VIVRDIVSKNVIA 315 (832)
Q Consensus 239 ~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~-~G~--V~IwDl~s~~~l~ 315 (832)
||++.+|++|.||..+||.||+.+++||.+..+.+.+.-...+...+ .|.+.++ ++.. -|+ +.+-|+.+.+.+.
T Consensus 233 ~A~essk~lA~gl~nlgDkGy~~isglc~g~~~~g~gpglgg~~~~~-vGrvg~v--saesV~g~~~vivkdf~S~a~i~ 309 (788)
T KOG2109|consen 233 WAQESSKELAGGLVNLGDKGYVLISGLCRGSYQIGTGPGLGGFEEVL-VGRVGPV--SAESVLGNNLVIVKDFDSFADIR 309 (788)
T ss_pred hhhhhhHHHhhhhcccccchHHHHHHHhhcccCCCCCCCCCCcCcee-ccccccc--cceeecccceEEeecccchhhhh
Confidence 99999999999999999999999999999977765432211111101 1222111 2222 355 8889999999999
Q ss_pred EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEE
Q 003310 316 QFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWI 395 (832)
Q Consensus 316 ~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~L 395 (832)
+|++|+.+|++|||+++|.+|++++..|+.|++|++++...... ..... +.--+||++++.|++|||+-+.+|+
T Consensus 310 QfkAhkspiSaLcfdqsgsllViasi~g~nVnvfRimet~~t~~-~~~qs-----~~~s~ra~t~aviqdicfs~~s~~r 383 (788)
T KOG2109|consen 310 QFKAHKSPISALCFDQSGSLLVIASITGRNVNVFRIMETVCTVN-VSDQS-----LVVSPRANTAAVIQDICFSEVSTIR 383 (788)
T ss_pred heeeecCcccccccccCceEEEEEeeccceeeeEEecccccccc-ccccc-----cccchhcchHHHHHHHhhhhhcceE
Confidence 99999999999999999999999999999999999987522110 01110 0111589999999999999999999
Q ss_pred EEEeCCCcEEEEecCCCCCceeeccCCCCcccccCCcccccccCCCCCCCCCCCCcccccCCCCeeeeeceEEEcCCCCC
Q 003310 396 MISSSRGTSHLFAINPLGGSVNFQPTDANFTTKHGAMAKSGVRWPPNLGLQMPNQQSLCASGPPVTLSVVSRIRNGNNGW 475 (832)
Q Consensus 396 AsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~~~~~~~~~~~~r~~~~s~~~~~~~~~l~~~~~p~~ls~v~~I~~~~~~~ 475 (832)
+.+|.+|+- ..+.. +.-..|+..+.+. ..-+++ .++....+...+++.-.+.-
T Consensus 384 ~~gsc~Ge~-----------P~ls~-------------t~~lp~~A~~Sl~-~gl~s~-g~~aa~gla~~sag~~a~s~- 436 (788)
T KOG2109|consen 384 TAGSCEGEP-----------PALSL-------------TCQLPAYADTSLD-LGLQSS-GGLAAEGLATSSAGYTAHSY- 436 (788)
T ss_pred eecccCCCC-----------ccccc-------------ccccchhhchhhh-cccccc-Ccccceeeeecccccccccc-
Confidence 999976653 11110 0011121111111 011111 22223345555555554311
Q ss_pred CccccccchhccCcccCCCcceeeeeeccCCCccccccCCcccccccEEEEcCCC-cEEEEeeeccCCCCcccc-CCCCC
Q 003310 476 RGTVSGAAAAATGRVSSLSGAIASSFHNCKGNSETYAAGSSLKIKNHLLVFSPSG-CMIQYALRISTGLDVTMG-VPGLG 553 (832)
Q Consensus 476 ~~~v~~~~~~a~g~~~~~~g~~~~~~h~~~~~~~~~~~~~~~~~~~~Llv~s~~G-~l~~y~l~~~~~~~~~~~-~~~~~ 553 (832)
+|+.-.++.-.+++..+..||-...-. .......++.|||+.|+| +|+||.|+|+.+..-.+. ....+
T Consensus 437 ------~asSv~s~s~~pd~ks~gv~~gsv~k~----~q~~~~~l~~llv~~psGd~vvqh~vahs~~gv~~Ef~~~~~l 506 (788)
T KOG2109|consen 437 ------TASSVFSRSVKPDSKSVGVGSGSVTKA----NQGVITVLNLLLVGEPSGDGVVQHYVAHSDPGVYIEFSPDQRL 506 (788)
T ss_pred ------ccceeeccccccchhhccceeeecccc----CccchhhhhheeeecCCCCceeEEEeeccCccceeeecccccc
Confidence 122223344444444455555431111 012345799999999999 999999999988777664 34445
Q ss_pred CcCCCCCCCC-ceEEeeeeeeeecccccccccccccccccCCCCCcCCCcccccccccCccccCCCcccccccCCCcccc
Q 003310 554 SAYDSVPEDD-PRLVVEAIQKWNICQKQARREREDNIDIYGDNGTLDSNKIYPEEVKDGNFASTEANGVIEKTKVSPEDK 632 (832)
Q Consensus 554 ~~~~~~~~~~-~~~~ve~~~~Wdv~rr~~~~e~~~~~~i~~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 632 (832)
.-.+...+++ .++.|+|++.|+.||+..|+|++++ |.. +-|+++..++++... .++..+...+.-++
T Consensus 507 ~lSad~~e~ef~~f~V~Ph~~wsslaav~hly~l~r----G~T----saKv~~~afs~dsrw----~A~~t~~~TthVfk 574 (788)
T KOG2109|consen 507 VLSADANENEFNIFLVMPHATWSSLAAVQHLYKLNR----GST----SAKVVSTAFSEDSRW----LAITTNHATTHVFK 574 (788)
T ss_pred eecccccccccceEEeecccccHHHhhhhhhhhccC----CCc----cceeeeeEeecchhh----hhhhhcCCceeeee
Confidence 5555568888 9999999999999999999999996 222 126666665543211 11113444556789
Q ss_pred cceeeeeeEEeecCCCcccccCCeeEEEEeecCccccCcccc--ccceEEEeeccccccccccCCcccccccccCccccc
Q 003310 633 HHLYISEAELQMHPPRIPLWAKPQIYFQSMMIKDFKMGEENF--LKGEIEIERFPTRMIEARSKDLVPVFDYLQSPKFSQ 710 (832)
Q Consensus 633 ~~~ylS~aEvq~h~~~~plW~~~~~~F~~m~~~~~~~~~~~~--~~~~~eie~~~~~~~~~~~~~l~pv~~~~~~~~~~~ 710 (832)
+|.|+-++|++||.. +|||+|.+ .||.|...+.+.+.++. .++|.||+++.++.+|.|+||||||++ .+|+|-+-
T Consensus 575 ~hpYgg~aeqrth~~-lp~vnk~s-rFhrsagl~~d~~~~~s~ggg~e~ei~~~~~~t~e~r~~dllPvy~-~tS~rsr~ 651 (788)
T KOG2109|consen 575 VHPYGGKAEQRTHGD-LPFVNKES-RFHRSAGLTDDADVTASIGGGKEREIADSCSYTKEHRIADLLPVYA-KTSGRSRV 651 (788)
T ss_pred eccccccccceecCC-chhccchh-hhccccCCCccccccccCCCCccceecccccccccccccccCCccc-ccCccccc
Confidence 999999999999999 99999999 99999987765554443 455999999999999999999999999 67766554
Q ss_pred cc
Q 003310 711 AR 712 (832)
Q Consensus 711 ~~ 712 (832)
.|
T Consensus 652 ~~ 653 (788)
T KOG2109|consen 652 GP 653 (788)
T ss_pred cC
Confidence 33
No 3
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=100.00 E-value=1.2e-32 Score=293.54 Aligned_cols=230 Identities=27% Similarity=0.492 Sum_probs=200.2
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
...+|.+|..+|+++|.+++.++ ..+...+.+.+++||. +..|||+|+..
T Consensus 16 d~~~lsvGs~~Gyk~~~~~~~~k---~~~~~~~~~~IvEmLF--------------SSSLvaiV~~~------------- 65 (391)
T KOG2110|consen 16 DSTLLSVGSKDGYKIFSCSPFEK---CFSKDTEGVSIVEMLF--------------SSSLVAIVSIK------------- 65 (391)
T ss_pred ceeEEEccCCCceeEEecCchHH---hhcccCCCeEEEEeec--------------ccceeEEEecC-------------
Confidence 36789999999999999998543 5566678999999995 23589999852
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEEEeCCEEEEEECCCCceEEEEecC-C
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAICQAAQVHCFDAATLEIEYAILTN-P 175 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAVa~~~~I~vwDl~t~~~~~tl~t~-~ 175 (832)
.++.+++++.+.+..++.+.|+++|.+|++|+++|+|++.++|+|||+.+++.++++.+. |
T Consensus 66 ------------------qpr~Lkv~~~Kk~~~ICe~~fpt~IL~VrmNr~RLvV~Lee~IyIydI~~MklLhTI~t~~~ 127 (391)
T KOG2110|consen 66 ------------------QPRKLKVVHFKKKTTICEIFFPTSILAVRMNRKRLVVCLEESIYIYDIKDMKLLHTIETTPP 127 (391)
T ss_pred ------------------CCceEEEEEcccCceEEEEecCCceEEEEEccceEEEEEcccEEEEecccceeehhhhccCC
Confidence 237899999999999999999999999999999999999999999999999999999987 4
Q ss_pred CccCCCCCCCCCcccceeeecc----ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 176 IVMGHPSAGGIGIGYGPLAVGP----RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 176 ~~~~~p~~~~~~~~~~p~Alg~----r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
++.+. +|+++ .+|||+++
T Consensus 128 n~~gl------------~AlS~n~~n~ylAyp~s---------------------------------------------- 149 (391)
T KOG2110|consen 128 NPKGL------------CALSPNNANCYLAYPGS---------------------------------------------- 149 (391)
T ss_pred Cccce------------EeeccCCCCceEEecCC----------------------------------------------
Confidence 54432 33333 46666520
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP 331 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP 331 (832)
...|.|.|||+.+.+++..+.+|.++|.||+|+|
T Consensus 150 ----------------------------------------------~t~GdV~l~d~~nl~~v~~I~aH~~~lAalafs~ 183 (391)
T KOG2110|consen 150 ----------------------------------------------TTSGDVVLFDTINLQPVNTINAHKGPLAALAFSP 183 (391)
T ss_pred ----------------------------------------------CCCceEEEEEcccceeeeEEEecCCceeEEEECC
Confidence 1246899999999999999999999999999999
Q ss_pred CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 332 SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 332 dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
||++|||||++|++||||.+..+ .++|+||||...+.|++++||||+++|+++|..+|||||.++.
T Consensus 184 ~G~llATASeKGTVIRVf~v~~G--------------~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~ 249 (391)
T KOG2110|consen 184 DGTLLATASEKGTVIRVFSVPEG--------------QKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEK 249 (391)
T ss_pred CCCEEEEeccCceEEEEEEcCCc--------------cEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEecc
Confidence 99999999999999999999877 7899999999988899999999999999999999999999976
Q ss_pred C
Q 003310 412 L 412 (832)
Q Consensus 412 ~ 412 (832)
.
T Consensus 250 ~ 250 (391)
T KOG2110|consen 250 V 250 (391)
T ss_pred c
Confidence 4
No 4
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=99.97 E-value=2e-29 Score=264.92 Aligned_cols=238 Identities=26% Similarity=0.411 Sum_probs=196.7
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEeee--cCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCC
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVSR--YDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~--~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg 94 (832)
...++++|.++||+||++++ ..|..++ +++.+..++||- ..++||+|+++.
T Consensus 16 D~ScFava~~~Gfriyn~~P---~ke~~~r~~~~~G~~~veMLf--------------R~N~laLVGGg~---------- 68 (346)
T KOG2111|consen 16 DHSCFAVATDTGFRIYNCDP---FKESASRQFIDGGFKIVEMLF--------------RSNYLALVGGGS---------- 68 (346)
T ss_pred CCceEEEEecCceEEEecCc---hhhhhhhccccCchhhhhHhh--------------hhceEEEecCCC----------
Confidence 37899999999999999998 4444442 466688899984 236899998642
Q ss_pred cccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEEEeCCEEEEEECC-CCceEEEEec
Q 003310 95 LATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAICQAAQVHCFDAA-TLEIEYAILT 173 (832)
Q Consensus 95 ~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAVa~~~~I~vwDl~-t~~~~~tl~t 173 (832)
++.|.|+.|.|||-....++.+|.|.++|.+|++.+..|+|.++++|+||... +.+.++.+.|
T Consensus 69 ----------------~pky~pNkviIWDD~k~~~i~el~f~~~I~~V~l~r~riVvvl~~~I~VytF~~n~k~l~~~et 132 (346)
T KOG2111|consen 69 ----------------RPKYPPNKVIIWDDLKERCIIELSFNSEIKAVKLRRDRIVVVLENKIYVYTFPDNPKLLHVIET 132 (346)
T ss_pred ----------------CCCCCCceEEEEecccCcEEEEEEeccceeeEEEcCCeEEEEecCeEEEEEcCCChhheeeeec
Confidence 24578899999999999999999999999999999999999999999999998 7888999999
Q ss_pred CCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
.+||.|++ ++ . |. . .|++
T Consensus 133 ~~NPkGlC------------~~-------~-------------~~-------------~-----------~k~~------ 150 (346)
T KOG2111|consen 133 RSNPKGLC------------SL-------C-------------PT-------------S-----------NKSL------ 150 (346)
T ss_pred ccCCCceE------------ee-------c-------------CC-------------C-----------CceE------
Confidence 87766542 11 1 00 0 0111
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcE--EEEeccCCCCeEEEEEcC
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNV--IAQFRAHKSPISALCFDP 331 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~--l~~~~aH~~pIs~LaFSP 331 (832)
.++|+. ..|.|+|-|+...+. ...++||.++|.||+.+-
T Consensus 151 -------------------------LafPg~--------------k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~ 191 (346)
T KOG2111|consen 151 -------------------------LAFPGF--------------KTGQVQIVDLASTKPNAPSIINAHDSDIACVALNL 191 (346)
T ss_pred -------------------------EEcCCC--------------ccceEEEEEhhhcCcCCceEEEcccCceeEEEEcC
Confidence 123332 347899999887654 578899999999999999
Q ss_pred CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 332 SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 332 dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+|++|||||.+||.|||||..++ ..+++||||...|.|++|+||||++|||++|+.||+|||.+..
T Consensus 192 ~Gt~vATaStkGTLIRIFdt~~g--------------~~l~E~RRG~d~A~iy~iaFSp~~s~LavsSdKgTlHiF~l~~ 257 (346)
T KOG2111|consen 192 QGTLVATASTKGTLIRIFDTEDG--------------TLLQELRRGVDRADIYCIAFSPNSSWLAVSSDKGTLHIFSLRD 257 (346)
T ss_pred CccEEEEeccCcEEEEEEEcCCC--------------cEeeeeecCCchheEEEEEeCCCccEEEEEcCCCeEEEEEeec
Confidence 99999999999999999999987 7899999999999999999999999999999999999999976
Q ss_pred C
Q 003310 412 L 412 (832)
Q Consensus 412 ~ 412 (832)
.
T Consensus 258 ~ 258 (346)
T KOG2111|consen 258 T 258 (346)
T ss_pred C
Confidence 3
No 5
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.90 E-value=3.4e-23 Score=221.58 Aligned_cols=280 Identities=16% Similarity=0.256 Sum_probs=205.3
Q ss_pred CCCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCC
Q 003310 16 ATRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 16 ~~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg 94 (832)
+..+.|+.|..+| +.+||-......-.-+.+|...|.++.+.|....+ ..| +||-++
T Consensus 167 PDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K~It~Lawep~hl~p--------~~r-~las~s------------- 224 (480)
T KOG0271|consen 167 PDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKKWITALAWEPLHLVP--------PCR-RLASSS------------- 224 (480)
T ss_pred CCcchhhccccCCeEEEecCCCCCcccccccCcccceeEEeecccccCC--------Ccc-ceeccc-------------
Confidence 4466666666555 99999877666677778899999999988743211 111 333222
Q ss_pred cccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEc-CCEEEE-EeCCEEEEEECCCCceEEEE
Q 003310 95 LATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCS-SRVVAI-CQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 95 ~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S-~r~LAV-a~~~~I~vwDl~t~~~~~tl 171 (832)
.+++|+|||+..++++.++.. ..+|.+|++- ..+|.. +.|++|++|++..+.+.+++
T Consensus 225 --------------------kDg~vrIWd~~~~~~~~~lsgHT~~VTCvrwGG~gliySgS~DrtIkvw~a~dG~~~r~l 284 (480)
T KOG0271|consen 225 --------------------KDGSVRIWDTKLGTCVRTLSGHTASVTCVRWGGEGLIYSGSQDRTIKVWRALDGKLCREL 284 (480)
T ss_pred --------------------CCCCEEEEEccCceEEEEeccCccceEEEEEcCCceEEecCCCceEEEEEccchhHHHhh
Confidence 237899999999999999975 6699999997 466666 67999999999999999999
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccce----EEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccce
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRW----LAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHL 247 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~----LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~l 247 (832)
.+|. -.+|.+|++..| =||- ++|+..+
T Consensus 285 kGHa------------hwvN~lalsTdy~LRtgaf~-------~t~~~~~------------------------------ 315 (480)
T KOG0271|consen 285 KGHA------------HWVNHLALSTDYVLRTGAFD-------HTGRKPK------------------------------ 315 (480)
T ss_pred cccc------------hheeeeeccchhhhhccccc-------cccccCC------------------------------
Confidence 9884 245566665421 1111 1111100
Q ss_pred eceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECC-CCcEEEEeccCCCCeEE
Q 003310 248 AAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIV-SKNVIAQFRAHKSPISA 326 (832)
Q Consensus 248 asGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~-s~~~l~~~~aH~~pIs~ 326 (832)
.+.|...+.+.+|-.. .+++ | -.++++.+|+++.+|+-. +.+++..+.+|..-|..
T Consensus 316 -----~~se~~~~Al~rY~~~-~~~~--------------~---erlVSgsDd~tlflW~p~~~kkpi~rmtgHq~lVn~ 372 (480)
T KOG0271|consen 316 -----SFSEEQKKALERYEAV-LKDS--------------G---ERLVSGSDDFTLFLWNPFKSKKPITRMTGHQALVNH 372 (480)
T ss_pred -----ChHHHHHHHHHHHHHh-hccC--------------c---ceeEEecCCceEEEecccccccchhhhhchhhheee
Confidence 0011122334444321 1111 1 135677889999999964 56799999999999999
Q ss_pred EEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEE
Q 003310 327 LCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHL 406 (832)
Q Consensus 327 LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhI 406 (832)
+.|||||+++|+||.|.. |++||-.+| ..+..| |||- +.|+.|+||.|+++|+++|.|.|++|
T Consensus 373 V~fSPd~r~IASaSFDkS-VkLW~g~tG--------------k~lasf-RGHv-~~VYqvawsaDsRLlVS~SkDsTLKv 435 (480)
T KOG0271|consen 373 VSFSPDGRYIASASFDKS-VKLWDGRTG--------------KFLASF-RGHV-AAVYQVAWSADSRLLVSGSKDSTLKV 435 (480)
T ss_pred EEECCCccEEEEeecccc-eeeeeCCCc--------------chhhhh-hhcc-ceeEEEEeccCccEEEEcCCCceEEE
Confidence 999999999999999976 999999988 556666 7875 45999999999999999999999999
Q ss_pred EecCCCCCceeeccCCCCcc
Q 003310 407 FAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 407 wdl~~~g~~~~~~~H~~~~~ 426 (832)
|++.+.+-...+-+|.++.-
T Consensus 436 w~V~tkKl~~DLpGh~DEVf 455 (480)
T KOG0271|consen 436 WDVRTKKLKQDLPGHADEVF 455 (480)
T ss_pred EEeeeeeecccCCCCCceEE
Confidence 99999888888999988765
No 6
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.87 E-value=4.9e-21 Score=220.35 Aligned_cols=179 Identities=20% Similarity=0.331 Sum_probs=140.5
Q ss_pred CCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccc
Q 003310 116 PTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~ 191 (832)
+++||+|+|.|..++...+. ..||+.|+|++ -++|. +.|++-++|.......++.+.+|-.-. +
T Consensus 472 D~svRLWsl~t~s~~V~y~GH~~PVwdV~F~P~GyYFatas~D~tArLWs~d~~~PlRifaghlsDV------------~ 539 (707)
T KOG0263|consen 472 DSSVRLWSLDTWSCLVIYKGHLAPVWDVQFAPRGYYFATASHDQTARLWSTDHNKPLRIFAGHLSDV------------D 539 (707)
T ss_pred CcceeeeecccceeEEEecCCCcceeeEEecCCceEEEecCCCceeeeeecccCCchhhhccccccc------------c
Confidence 48899999999999998885 46999999986 46666 578889999987766666665552111 1
Q ss_pred eeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccC
Q 003310 192 PLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLP 271 (832)
Q Consensus 192 p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p 271 (832)
+ ++|. | +..
T Consensus 540 c-------v~FH-------------P--------------Ns~------------------------------------- 548 (707)
T KOG0263|consen 540 C-------VSFH-------------P--------------NSN------------------------------------- 548 (707)
T ss_pred e-------EEEC-------------C--------------ccc-------------------------------------
Confidence 1 1221 1 000
Q ss_pred CCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeC
Q 003310 272 DSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKI 351 (832)
Q Consensus 272 ~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi 351 (832)
-.++++.|.+|++||+.+|..++.|.+|+++|.+|+|||+|++||+|+.||. |+|||+
T Consensus 549 ---------------------Y~aTGSsD~tVRlWDv~~G~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~-I~iWDl 606 (707)
T KOG0263|consen 549 ---------------------YVATGSSDRTVRLWDVSTGNSVRIFTGHKGPVTALAFSPCGRYLASGDEDGL-IKIWDL 606 (707)
T ss_pred ---------------------ccccCCCCceEEEEEcCCCcEEEEecCCCCceEEEEEcCCCceEeecccCCc-EEEEEc
Confidence 0123467889999999999999999999999999999999999999999997 999999
Q ss_pred CCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 352 IPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 352 ~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
..+ ..+..+ +|| ...|.+|.||.||..||+++.|.+|++||+....+.
T Consensus 607 ~~~--------------~~v~~l-~~H-t~ti~SlsFS~dg~vLasgg~DnsV~lWD~~~~~~~ 654 (707)
T KOG0263|consen 607 ANG--------------SLVKQL-KGH-TGTIYSLSFSRDGNVLASGGADNSVRLWDLTKVIEL 654 (707)
T ss_pred CCC--------------cchhhh-hcc-cCceeEEEEecCCCEEEecCCCCeEEEEEchhhccc
Confidence 876 333344 677 456999999999999999999999999999765443
No 7
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.87 E-value=5.1e-20 Score=188.74 Aligned_cols=276 Identities=20% Similarity=0.292 Sum_probs=188.5
Q ss_pred cEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccc
Q 003310 19 RVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATA 98 (832)
Q Consensus 19 ~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~ 98 (832)
.+...||+..+++|...+ |.+...+...|+.|..+++.|+. ..||..+.
T Consensus 12 iLvsA~YDhTIRfWqa~t-G~C~rTiqh~dsqVNrLeiTpdk--------------~~LAaa~~---------------- 60 (311)
T KOG0315|consen 12 ILVSAGYDHTIRFWQALT-GICSRTIQHPDSQVNRLEITPDK--------------KDLAAAGN---------------- 60 (311)
T ss_pred EEEeccCcceeeeeehhc-CeEEEEEecCccceeeEEEcCCc--------------chhhhccC----------------
Confidence 344459999999999987 78888888789999999998742 13443321
Q ss_pred cCCCCCCCCCCCCCCCCCCEEEEEECCCCc--EEEEEeCC-CCEEEEEE--cCCEEEE-EeCCEEEEEECCCCceEEEEe
Q 003310 99 CNGTSANYHDLGNGSSVPTVVHFYSLRSQS--YVHMLKFR-SPIYSVRC--SSRVVAI-CQAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 99 ~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~--~V~tL~f~-s~V~sV~~--S~r~LAV-a~~~~I~vwDl~t~~~~~tl~ 172 (832)
-.||+||+.++. .+.++... ..|.+|.| .+|.+.. +.|++++|||++...+.+.+.
T Consensus 61 ------------------qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseDgt~kIWdlR~~~~qR~~~ 122 (311)
T KOG0315|consen 61 ------------------QHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSEDGTVKIWDLRSLSCQRNYQ 122 (311)
T ss_pred ------------------CeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCCceEEEEeccCcccchhcc
Confidence 359999999886 57788764 68999988 4788877 678899999999977655442
Q ss_pred cCCCccCCCCCCCCCcccceeeeccc--eEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceece
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPR--WLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAG 250 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r--~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasG 250 (832)
- +. .++.+.+.|. -|-.++ . +|
T Consensus 123 ~-~s------------pVn~vvlhpnQteLis~d---------------------------q----------------sg 146 (311)
T KOG0315|consen 123 H-NS------------PVNTVVLHPNQTELISGD---------------------------Q----------------SG 146 (311)
T ss_pred C-CC------------CcceEEecCCcceEEeec---------------------------C----------------CC
Confidence 2 11 1222332220 000000 0 01
Q ss_pred eeeccCccccccccccc-cccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC------cEEEEeccCCCC
Q 003310 251 IVNLGDLGYKKLSQYCS-EFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK------NVIAQFRAHKSP 323 (832)
Q Consensus 251 i~~lGd~g~~~ls~y~~-~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~------~~l~~~~aH~~p 323 (832)
-..++|.+. ..|. .+.|+...++++..-.+ +| ..++.+.+.|...+|++-+. .++..|++|.+.
T Consensus 147 ~irvWDl~~----~~c~~~liPe~~~~i~sl~v~~--dg---sml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~ 217 (311)
T KOG0315|consen 147 NIRVWDLGE----NSCTHELIPEDDTSIQSLTVMP--DG---SMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAHNGH 217 (311)
T ss_pred cEEEEEccC----CccccccCCCCCcceeeEEEcC--CC---cEEEEecCCccEEEEEccCCCccccceEhhheecccce
Confidence 122233221 1222 24555544444321111 12 12445678899999999764 578889999999
Q ss_pred eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCc
Q 003310 324 ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGT 403 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgT 403 (832)
|....||||+++|||+|.|.+ ++||.+... ...-..+ .|+ ...+++++||.||+||+++|.|++
T Consensus 218 il~C~lSPd~k~lat~ssdkt-v~iwn~~~~-------------~kle~~l-~gh-~rWvWdc~FS~dg~YlvTassd~~ 281 (311)
T KOG0315|consen 218 ILRCLLSPDVKYLATCSSDKT-VKIWNTDDF-------------FKLELVL-TGH-QRWVWDCAFSADGEYLVTASSDHT 281 (311)
T ss_pred EEEEEECCCCcEEEeecCCce-EEEEecCCc-------------eeeEEEe-ecC-CceEEeeeeccCccEEEecCCCCc
Confidence 999999999999999999977 899998763 0111222 233 236999999999999999999999
Q ss_pred EEEEecCCCCCceeeccCCCC
Q 003310 404 SHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 404 VhIwdl~~~g~~~~~~~H~~~ 424 (832)
+|+|++...+......+|...
T Consensus 282 ~rlW~~~~~k~v~qy~gh~K~ 302 (311)
T KOG0315|consen 282 ARLWDLSAGKEVRQYQGHHKA 302 (311)
T ss_pred eeecccccCceeeecCCcccc
Confidence 999999998877888888543
No 8
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.86 E-value=5.3e-21 Score=207.27 Aligned_cols=242 Identities=14% Similarity=0.201 Sum_probs=189.1
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
..++..+.++..+||.++.- +....+..|.+.|.++.|-|.- +---||-|.
T Consensus 188 ~~laT~swsG~~kvW~~~~~-~~~~~l~gH~~~v~~~~fhP~~------------~~~~lat~s---------------- 238 (459)
T KOG0272|consen 188 KHLATGSWSGLVKVWSVPQC-NLLQTLRGHTSRVGAAVFHPVD------------SDLNLATAS---------------- 238 (459)
T ss_pred CeEEEeecCCceeEeecCCc-ceeEEEeccccceeeEEEccCC------------Cccceeeec----------------
Confidence 45555566677999999974 6667777899999999998731 001133222
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEec
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t 173 (832)
.+++|++|++.+-..+..|.. ...|-.|+|. +++|+. +.|.+=++||++|...+...++
T Consensus 239 -----------------~Dgtvklw~~~~e~~l~~l~gH~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~ElL~QEG 301 (459)
T KOG0272|consen 239 -----------------ADGTVKLWKLSQETPLQDLEGHLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSELLLQEG 301 (459)
T ss_pred -----------------cCCceeeeccCCCcchhhhhcchhhheeeeecCCCceeeecccccchhhcccccchhhHhhcc
Confidence 348899999999889998875 6789999995 688887 7899999999999998888888
Q ss_pred CCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
|+... + -|||-. +|+++
T Consensus 302 Hs~~v--------------~-----~iaf~~---------------------------DGSL~----------------- 318 (459)
T KOG0272|consen 302 HSKGV--------------F-----SIAFQP---------------------------DGSLA----------------- 318 (459)
T ss_pred ccccc--------------c-----eeEecC---------------------------CCcee-----------------
Confidence 75311 1 123321 23322
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCC
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSG 333 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG 333 (832)
++++.|..-+|||+++++++..|.+|..+|..|+|||+|
T Consensus 319 -----------------------------------------~tGGlD~~~RvWDlRtgr~im~L~gH~k~I~~V~fsPNG 357 (459)
T KOG0272|consen 319 -----------------------------------------ATGGLDSLGRVWDLRTGRCIMFLAGHIKEILSVAFSPNG 357 (459)
T ss_pred -----------------------------------------eccCccchhheeecccCcEEEEecccccceeeEeECCCc
Confidence 223456678999999999999999999999999999999
Q ss_pred CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCC
Q 003310 334 ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 334 ~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~ 412 (832)
..|||||.|++ ++|||++.. ..+|.+. +|++ .|..|.|+| .|++|+++|-|+|++||.-...
T Consensus 358 y~lATgs~Dnt-~kVWDLR~r--------------~~ly~ip-AH~n-lVS~Vk~~p~~g~fL~TasyD~t~kiWs~~~~ 420 (459)
T KOG0272|consen 358 YHLATGSSDNT-CKVWDLRMR--------------SELYTIP-AHSN-LVSQVKYSPQEGYFLVTASYDNTVKIWSTRTW 420 (459)
T ss_pred eEEeecCCCCc-EEEeeeccc--------------ccceecc-cccc-hhhheEecccCCeEEEEcccCcceeeecCCCc
Confidence 99999999987 899999875 3466663 4432 499999999 7999999999999999999988
Q ss_pred CCceeeccCCCCcc
Q 003310 413 GGSVNFQPTDANFT 426 (832)
Q Consensus 413 g~~~~~~~H~~~~~ 426 (832)
.....+.+|.+++.
T Consensus 421 ~~~ksLaGHe~kV~ 434 (459)
T KOG0272|consen 421 SPLKSLAGHEGKVI 434 (459)
T ss_pred ccchhhcCCccceE
Confidence 88889999987654
No 9
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.83 E-value=8.7e-20 Score=195.70 Aligned_cols=232 Identities=18% Similarity=0.226 Sum_probs=155.7
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCce-EEEEecCCCccCCCCCCCCCcccc
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEI-EYAILTNPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~-~~tl~t~~~~~~~p~~~~~~~~~~ 191 (832)
.||||||+.|....++.+- ...|.+|++++ +.||. +.+++|.+||..++++ ...|.+|..... ++...
T Consensus 137 ~TvR~WD~~TeTp~~t~KgH~~WVlcvawsPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K~It-------~Lawe 209 (480)
T KOG0271|consen 137 TTVRLWDLDTETPLFTCKGHKNWVLCVAWSPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKKWIT-------ALAWE 209 (480)
T ss_pred ceEEeeccCCCCcceeecCCccEEEEEEECCCcchhhccccCCeEEEecCCCCCcccccccCccccee-------EEeec
Confidence 8999999999999999985 78999999984 77887 7899999999999875 456666632110 11222
Q ss_pred eeeecc--ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 192 PLAVGP--RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 192 p~Alg~--r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
|+-+.| |.||.++. +..+.-+-....+.+. ...|+.
T Consensus 210 p~hl~p~~r~las~sk---------------------------Dg~vrIWd~~~~~~~~---~lsgHT------------ 247 (480)
T KOG0271|consen 210 PLHLVPPCRRLASSSK---------------------------DGSVRIWDTKLGTCVR---TLSGHT------------ 247 (480)
T ss_pred ccccCCCccceecccC---------------------------CCCEEEEEccCceEEE---EeccCc------------
Confidence 222222 44444421 1112111111111110 000111
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEc-----------CCCC----
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFD-----------PSGI---- 334 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFS-----------PdG~---- 334 (832)
.++....-+ | -+.+.++..|++|++|+...|+++.+|++|.+.|+.|+.| |.|.
T Consensus 248 -----~~VTCvrwG----G--~gliySgS~DrtIkvw~a~dG~~~r~lkGHahwvN~lalsTdy~LRtgaf~~t~~~~~~ 316 (480)
T KOG0271|consen 248 -----ASVTCVRWG----G--EGLIYSGSQDRTIKVWRALDGKLCRELKGHAHWVNHLALSTDYVLRTGAFDHTGRKPKS 316 (480)
T ss_pred -----cceEEEEEc----C--CceEEecCCCceEEEEEccchhHHHhhcccchheeeeeccchhhhhccccccccccCCC
Confidence 000000001 0 1234567899999999999999999999999999999877 4444
Q ss_pred ---------------------EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCC
Q 003310 335 ---------------------LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSN 393 (832)
Q Consensus 335 ---------------------lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~ 393 (832)
.|++||+|++ +.+|+-... ...+.++ .|| .+.|+.+.||||++
T Consensus 317 ~se~~~~Al~rY~~~~~~~~erlVSgsDd~t-lflW~p~~~-------------kkpi~rm-tgH-q~lVn~V~fSPd~r 380 (480)
T KOG0271|consen 317 FSEEQKKALERYEAVLKDSGERLVSGSDDFT-LFLWNPFKS-------------KKPITRM-TGH-QALVNHVSFSPDGR 380 (480)
T ss_pred hHHHHHHHHHHHHHhhccCcceeEEecCCce-EEEeccccc-------------ccchhhh-hch-hhheeeEEECCCcc
Confidence 4999999998 679986543 1222222 243 45699999999999
Q ss_pred EEEEEeCCCcEEEEecCCCCCceeeccCCCC
Q 003310 394 WIMISSSRGTSHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 394 ~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~ 424 (832)
|||++|-|..|++|+-.++.-..+|++|-..
T Consensus 381 ~IASaSFDkSVkLW~g~tGk~lasfRGHv~~ 411 (480)
T KOG0271|consen 381 YIASASFDKSVKLWDGRTGKFLASFRGHVAA 411 (480)
T ss_pred EEEEeecccceeeeeCCCcchhhhhhhccce
Confidence 9999999999999999998888999999543
No 10
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.82 E-value=6.2e-18 Score=187.41 Aligned_cols=332 Identities=12% Similarity=0.214 Sum_probs=207.5
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEee--ecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVS--RYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGL 95 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS--~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~ 95 (832)
.++...|.++.+.+||=.+...+.++-+ .|.|.|..+.++|+.. .++-++.
T Consensus 203 ~~Fat~gsDgki~iyDGktge~vg~l~~~~aHkGsIfalsWsPDs~--------------~~~T~Sa------------- 255 (603)
T KOG0318|consen 203 SRFATAGSDGKIYIYDGKTGEKVGELEDSDAHKGSIFALSWSPDST--------------QFLTVSA------------- 255 (603)
T ss_pred CeEEEecCCccEEEEcCCCccEEEEecCCCCccccEEEEEECCCCc--------------eEEEecC-------------
Confidence 5777888888899999877544555543 6899999999998642 2343432
Q ss_pred ccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEE----E-cCCEEEEEeCCEEEEEECCCCceEEE
Q 003310 96 ATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVR----C-SSRVVAICQAAQVHCFDAATLEIEYA 170 (832)
Q Consensus 96 ~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~----~-S~r~LAVa~~~~I~vwDl~t~~~~~t 170 (832)
+.++||||+.+.+++.++.+.+.|.+.. + +.++|.|++.+.|..++...+..+++
T Consensus 256 --------------------Dkt~KIWdVs~~slv~t~~~~~~v~dqqvG~lWqkd~lItVSl~G~in~ln~~d~~~~~~ 315 (603)
T KOG0318|consen 256 --------------------DKTIKIWDVSTNSLVSTWPMGSTVEDQQVGCLWQKDHLITVSLSGTINYLNPSDPSVLKV 315 (603)
T ss_pred --------------------CceEEEEEeeccceEEEeecCCchhceEEEEEEeCCeEEEEEcCcEEEEecccCCChhhe
Confidence 2789999999999999999876653332 2 46788889999999999999998899
Q ss_pred EecCCCccCCCCCCCCCcccceeeecc--ceEEeeCCC--ceecCCCccCCcccccccccccccCCCcceeeeecccccc
Q 003310 171 ILTNPIVMGHPSAGGIGIGYGPLAVGP--RWLAYSGSP--VVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKH 246 (832)
Q Consensus 171 l~t~~~~~~~p~~~~~~~~~~p~Alg~--r~LAya~~~--~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~ 246 (832)
+.+|.. +...+++++ .+|-.++.. +..|..|.-....+.+ ...+..+...+...+.+
T Consensus 316 i~GHnK------------~ITaLtv~~d~~~i~SgsyDG~I~~W~~~~g~~~~~~g-------~~h~nqI~~~~~~~~~~ 376 (603)
T KOG0318|consen 316 ISGHNK------------SITALTVSPDGKTIYSGSYDGHINSWDSGSGTSDRLAG-------KGHTNQIKGMAASESGE 376 (603)
T ss_pred eccccc------------ceeEEEEcCCCCEEEeeccCceEEEEecCCcccccccc-------ccccceEEEEeecCCCc
Confidence 988843 233455555 333333222 2223322211111110 00111122222111111
Q ss_pred eec------------------------------eeeeccCcccccccccc-----cc-----ccCCCcCccccc-cCCCC
Q 003310 247 LAA------------------------------GIVNLGDLGYKKLSQYC-----SE-----FLPDSQNSLQSA-IPGGK 285 (832)
Q Consensus 247 las------------------------------Gi~~lGd~g~~~ls~y~-----~~-----~~p~~~~si~sa-~~~~~ 285 (832)
+.+ |+..+.+.++--++-+- .+ ..|-+..+...| +++.
T Consensus 377 ~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg~QP~~lav~~d~~~avv~~~~~iv~l~~~~~~~~~~~~y~~s~vAv~~~~- 455 (603)
T KOG0318|consen 377 LFTIGWDDTLRVISLKDNGYTKSEVVKLGSQPKGLAVLSDGGTAVVACISDIVLLQDQTKVSSIPIGYESSAVAVSPDG- 455 (603)
T ss_pred EEEEecCCeEEEEecccCcccccceeecCCCceeEEEcCCCCEEEEEecCcEEEEecCCcceeeccccccceEEEcCCC-
Confidence 110 00000000000000000 00 000111111111 1111
Q ss_pred CCCcccccccccCCCCeEEEEECCCCc--EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccC
Q 003310 286 SNGTVNGHFPDADNVGMVIVRDIVSKN--VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACD 363 (832)
Q Consensus 286 ~~g~~~g~~~s~~~~G~V~IwDl~s~~--~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~ 363 (832)
..++-+..||.|+||.+.... ....+..|..+|++++|||||++||+++..+. +-+||+.+.
T Consensus 456 ------~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rk-vv~yd~~s~--------- 519 (603)
T KOG0318|consen 456 ------SEVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAGDASRK-VVLYDVASR--------- 519 (603)
T ss_pred ------CEEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEeccCCc-EEEEEcccC---------
Confidence 133457899999999998754 34456789999999999999999999999987 789999875
Q ss_pred CCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec-cCCCCcccccCCcccccccCCCC
Q 003310 364 AGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ-PTDANFTTKHGAMAKSGVRWPPN 442 (832)
Q Consensus 364 ~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~-~H~~~~~~~~~~~~~~~~r~~~~ 442 (832)
....-+.+++.++|.+++|||++++||+||.|.+|+||.++.....+.++ +| +.. .
T Consensus 520 ------~~~~~~w~FHtakI~~~aWsP~n~~vATGSlDt~Viiysv~kP~~~i~iknAH----------------~~g-V 576 (603)
T KOG0318|consen 520 ------EVKTNRWAFHTAKINCVAWSPNNKLVATGSLDTNVIIYSVKKPAKHIIIKNAH----------------LGG-V 576 (603)
T ss_pred ------ceecceeeeeeeeEEEEEeCCCceEEEeccccceEEEEEccChhhheEecccc----------------ccC-c
Confidence 11333456677899999999999999999999999999999887776664 45 333 4
Q ss_pred CCCCCCCCccccc
Q 003310 443 LGLQMPNQQSLCA 455 (832)
Q Consensus 443 s~~~~~~~~~l~~ 455 (832)
..+.|+++.++..
T Consensus 577 n~v~wlde~tvvS 589 (603)
T KOG0318|consen 577 NSVAWLDESTVVS 589 (603)
T ss_pred eeEEEecCceEEe
Confidence 5567777777643
No 11
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.82 E-value=2.7e-18 Score=196.23 Aligned_cols=278 Identities=18% Similarity=0.236 Sum_probs=181.9
Q ss_pred CcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCC-CcccccCCccccc-CCEEEEEeCCCCccCccccCC
Q 003310 18 RRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRP-ITSKRSRDKFAEV-RPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 18 ~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p-~~~~~~~d~f~~~-rPLLavv~~g~~~g~~~~~Dg 94 (832)
-.+|++|+.+| |.+|.+.+. ++-..+|--+-+|..+.+-..+ |. .|... --.|.|+-.-.-.=.-..|.+
T Consensus 277 t~~lvvgFssG~f~LyelP~f-~lih~LSis~~~I~t~~~N~tGDWi------A~g~~klgQLlVweWqsEsYVlKQQgH 349 (893)
T KOG0291|consen 277 TNLLVVGFSSGEFGLYELPDF-NLIHSLSISDQKILTVSFNSTGDWI------AFGCSKLGQLLVWEWQSESYVLKQQGH 349 (893)
T ss_pred ceEEEEEecCCeeEEEecCCc-eEEEEeecccceeeEEEecccCCEE------EEcCCccceEEEEEeeccceeeecccc
Confidence 57888999999 679999763 4555666667777777664322 11 11111 112334432110000000111
Q ss_pred cccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-CCCCEEEEEEc--CCEEE-EEeCCEEEEEECCCCceEEE
Q 003310 95 LATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCS--SRVVA-ICQAAQVHCFDAATLEIEYA 170 (832)
Q Consensus 95 ~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S--~r~LA-Va~~~~I~vwDl~t~~~~~t 170 (832)
.......+-+++.+.......+++||+||..+|-|+.|+. +.+.|.++.|+ ++.|. .++|++|+.||+...++.+|
T Consensus 350 ~~~i~~l~YSpDgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRYrNfRT 429 (893)
T KOG0291|consen 350 SDRITSLAYSPDGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRYRNFRT 429 (893)
T ss_pred ccceeeEEECCCCcEEEeccCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCeEEeeeecccceeee
Confidence 1111111112222222233367999999999999999995 67899999996 45554 48999999999999998888
Q ss_pred EecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceece
Q 003310 171 ILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAG 250 (832)
Q Consensus 171 l~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasG 250 (832)
+.. |.+ +..+.+|+.| .|.+| .
T Consensus 430 ft~-P~p----------~QfscvavD~----------------------------------sGelV-----------~-- 451 (893)
T KOG0291|consen 430 FTS-PEP----------IQFSCVAVDP----------------------------------SGELV-----------C-- 451 (893)
T ss_pred ecC-CCc----------eeeeEEEEcC----------------------------------CCCEE-----------E--
Confidence 743 322 1222333211 12221 0
Q ss_pred eeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEc
Q 003310 251 IVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFD 330 (832)
Q Consensus 251 i~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFS 330 (832)
+.+.+.=.|.||++++|+.+..+.+|.+||.+|+|+
T Consensus 452 --------------------------------------------AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~ 487 (893)
T KOG0291|consen 452 --------------------------------------------AGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFS 487 (893)
T ss_pred --------------------------------------------eeccceEEEEEEEeecCeeeehhcCCCCcceeeEEc
Confidence 001122369999999999999999999999999999
Q ss_pred CCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 331 PSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 331 PdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
|+|.+|||+|.|.| ||+||+... .+.++ ++. ....+.+++|+|||+-||+++.||-|-+||+.
T Consensus 488 ~~~~~LaS~SWDkT-VRiW~if~s----------~~~vE---tl~---i~sdvl~vsfrPdG~elaVaTldgqItf~d~~ 550 (893)
T KOG0291|consen 488 PDGSLLASGSWDKT-VRIWDIFSS----------SGTVE---TLE---IRSDVLAVSFRPDGKELAVATLDGQITFFDIK 550 (893)
T ss_pred cccCeEEeccccce-EEEEEeecc----------Cceee---eEe---eccceeEEEEcCCCCeEEEEEecceEEEEEhh
Confidence 99999999999987 999999753 11222 232 12458999999999999999999999999998
Q ss_pred CCCCceeeccC
Q 003310 411 PLGGSVNFQPT 421 (832)
Q Consensus 411 ~~g~~~~~~~H 421 (832)
...+..++.+-
T Consensus 551 ~~~q~~~Idgr 561 (893)
T KOG0291|consen 551 EAVQVGSIDGR 561 (893)
T ss_pred hceeeccccch
Confidence 87766666553
No 12
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.80 E-value=6.4e-19 Score=191.26 Aligned_cols=191 Identities=14% Similarity=0.160 Sum_probs=155.6
Q ss_pred CCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcCC----EEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCc
Q 003310 115 VPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSSR----VVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~r----~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
+.+.+++|+..+...+++|.- .+.|.++.|.+. -||. +.|+++++|++.+-+.+..+.+|...
T Consensus 195 wsG~~kvW~~~~~~~~~~l~gH~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~e~~l~~l~gH~~R----------- 263 (459)
T KOG0272|consen 195 WSGLVKVWSVPQCNLLQTLRGHTSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQETPLQDLEGHLAR----------- 263 (459)
T ss_pred cCCceeEeecCCcceeEEEeccccceeeEEEccCCCccceeeeccCCceeeeccCCCcchhhhhcchhh-----------
Confidence 558899999999999999975 679999999753 4565 68999999999998888888887421
Q ss_pred ccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccc
Q 003310 189 GYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSE 268 (832)
Q Consensus 189 ~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~ 268 (832)
+.. +||-+ +|.
T Consensus 264 -Vs~-------VafHP---------------------------sG~---------------------------------- 274 (459)
T KOG0272|consen 264 -VSR-------VAFHP---------------------------SGK---------------------------------- 274 (459)
T ss_pred -hee-------eeecC---------------------------CCc----------------------------------
Confidence 111 12221 111
Q ss_pred ccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEE
Q 003310 269 FLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINI 348 (832)
Q Consensus 269 ~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~I 348 (832)
.+.++..|.+-++||+.+++.+...++|..+|.+++|.|||.+++||+.|.. -||
T Consensus 275 ------------------------~L~TasfD~tWRlWD~~tk~ElL~QEGHs~~v~~iaf~~DGSL~~tGGlD~~-~Rv 329 (459)
T KOG0272|consen 275 ------------------------FLGTASFDSTWRLWDLETKSELLLQEGHSKGVFSIAFQPDGSLAATGGLDSL-GRV 329 (459)
T ss_pred ------------------------eeeecccccchhhcccccchhhHhhcccccccceeEecCCCceeeccCccch-hhe
Confidence 1223556778999999999999999999999999999999999999999976 799
Q ss_pred EeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcc
Q 003310 349 FKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 349 wdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~ 426 (832)
||++++ +++..|. ||. ..|.+|+|||+|..||+||.|+|++|||+.......++-+|++...
T Consensus 330 WDlRtg--------------r~im~L~-gH~-k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~nlVS 391 (459)
T KOG0272|consen 330 WDLRTG--------------RCIMFLA-GHI-KEILSVAFSPNGYHLATGSSDNTCKVWDLRMRSELYTIPAHSNLVS 391 (459)
T ss_pred eecccC--------------cEEEEec-ccc-cceeeEeECCCceEEeecCCCCcEEEeeecccccceecccccchhh
Confidence 999998 5666664 654 4699999999999999999999999999999888889999987655
No 13
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.79 E-value=1.1e-16 Score=163.23 Aligned_cols=221 Identities=17% Similarity=0.310 Sum_probs=162.5
Q ss_pred cEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccc
Q 003310 19 RVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATA 98 (832)
Q Consensus 19 ~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~ 98 (832)
.+++.+.++.++|||+... .....+..+.+.|.++.+.|.. .+++.+..
T Consensus 65 ~l~~~~~~~~i~i~~~~~~-~~~~~~~~~~~~i~~~~~~~~~--------------~~~~~~~~---------------- 113 (289)
T cd00200 65 YLASGSSDKTIRLWDLETG-ECVRTLTGHTSYVSSVAFSPDG--------------RILSSSSR---------------- 113 (289)
T ss_pred EEEEEcCCCeEEEEEcCcc-cceEEEeccCCcEEEEEEcCCC--------------CEEEEecC----------------
Confidence 5555566777999999863 3444455677889988887631 23433321
Q ss_pred cCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-CCCCEEEEEEcC--CEEEEEe-CCEEEEEECCCCceEEEEecC
Q 003310 99 CNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCSS--RVVAICQ-AAQVHCFDAATLEIEYAILTN 174 (832)
Q Consensus 99 ~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~--r~LAVa~-~~~I~vwDl~t~~~~~tl~t~ 174 (832)
++.|++||+++++.+..+. +...|.++.+++ ++++++. ++.|++||+.+++.+..+..+
T Consensus 114 -----------------~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~ 176 (289)
T cd00200 114 -----------------DKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGH 176 (289)
T ss_pred -----------------CCeEEEEECCCcEEEEEeccCCCcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEecC
Confidence 1679999999999998887 467899999975 6777766 899999999988776666543
Q ss_pred CCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeec
Q 003310 175 PIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNL 254 (832)
Q Consensus 175 ~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~l 254 (832)
... + .-+++.. ++.
T Consensus 177 ~~~---------------i----~~~~~~~---------------------------~~~-------------------- 190 (289)
T cd00200 177 TGE---------------V----NSVAFSP---------------------------DGE-------------------- 190 (289)
T ss_pred ccc---------------c----ceEEECC---------------------------CcC--------------------
Confidence 210 0 1112221 000
Q ss_pred cCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCC
Q 003310 255 GDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGI 334 (832)
Q Consensus 255 Gd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~ 334 (832)
.++.+..+|.|++||+.+++.+..++.|..+|.+++|+|++.
T Consensus 191 --------------------------------------~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~ 232 (289)
T cd00200 191 --------------------------------------KLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGY 232 (289)
T ss_pred --------------------------------------EEEEecCCCcEEEEECCCCceecchhhcCCceEEEEEcCCCc
Confidence 011123378899999999999999999999999999999999
Q ss_pred EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 335 LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 335 lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
++++++.+|. |++||+.++ ..+..+. ++ ...|.+++|+|++++|++++.|++++||+
T Consensus 233 ~~~~~~~~~~-i~i~~~~~~--------------~~~~~~~-~~-~~~i~~~~~~~~~~~l~~~~~d~~i~iw~ 289 (289)
T cd00200 233 LLASGSEDGT-IRVWDLRTG--------------ECVQTLS-GH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289 (289)
T ss_pred EEEEEcCCCc-EEEEEcCCc--------------eeEEEcc-cc-CCcEEEEEECCCCCEEEEecCCCeEEecC
Confidence 9999998887 899999865 3344443 33 34599999999999999999999999996
No 14
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.79 E-value=3.4e-17 Score=171.31 Aligned_cols=232 Identities=14% Similarity=0.162 Sum_probs=166.2
Q ss_pred CEEEEEECCCCcEEEEEeCCC-CEEEEEEc--CCEEEE-EeCCEEEEEECCCC------ceEEEEecCCCccCCCCCCCC
Q 003310 117 TVVHFYSLRSQSYVHMLKFRS-PIYSVRCS--SRVVAI-CQAAQVHCFDAATL------EIEYAILTNPIVMGHPSAGGI 186 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s-~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~------~~~~tl~t~~~~~~~p~~~~~ 186 (832)
+.+.|||.-|...+|-|..++ .|...+|+ +++||. ++++...||++.+. +..+.+.+|.. +
T Consensus 77 GklIvWDs~TtnK~haipl~s~WVMtCA~sPSg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtg---y------ 147 (343)
T KOG0286|consen 77 GKLIVWDSFTTNKVHAIPLPSSWVMTCAYSPSGNFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTG---Y------ 147 (343)
T ss_pred CeEEEEEcccccceeEEecCceeEEEEEECCCCCeEEecCcCceeEEEecccccccccceeeeeecCccc---e------
Confidence 789999999999999999965 89999996 578887 68999999999865 22344555521 1
Q ss_pred CcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccc
Q 003310 187 GIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYC 266 (832)
Q Consensus 187 ~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~ 266 (832)
+ + ..-|..+..+ ++ +|...+-+.|-.+.++++.. +.|+.+
T Consensus 148 ------l--S--cC~f~dD~~i-----------lT--------~SGD~TCalWDie~g~~~~~---f~GH~g-------- 187 (343)
T KOG0286|consen 148 ------L--S--CCRFLDDNHI-----------LT--------GSGDMTCALWDIETGQQTQV---FHGHTG-------- 187 (343)
T ss_pred ------e--E--EEEEcCCCce-----------Ee--------cCCCceEEEEEcccceEEEE---ecCCcc--------
Confidence 1 0 0112222111 11 22345566666666665431 122221
Q ss_pred ccccCCCcCcccc-ccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCE
Q 003310 267 SEFLPDSQNSLQS-AIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHN 345 (832)
Q Consensus 267 ~~~~p~~~~si~s-a~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~ 345 (832)
.+.+ ..++. + ..+|+++..|+..+|||++++.++.+|.+|.+.|++|+|-|+|.-+||||+|++
T Consensus 188 ---------DV~slsl~p~--~---~ntFvSg~cD~~aklWD~R~~~c~qtF~ghesDINsv~ffP~G~afatGSDD~t- 252 (343)
T KOG0286|consen 188 ---------DVMSLSLSPS--D---GNTFVSGGCDKSAKLWDVRSGQCVQTFEGHESDINSVRFFPSGDAFATGSDDAT- 252 (343)
T ss_pred ---------cEEEEecCCC--C---CCeEEecccccceeeeeccCcceeEeecccccccceEEEccCCCeeeecCCCce-
Confidence 1100 01110 1 136889999999999999999999999999999999999999999999999998
Q ss_pred EEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCc
Q 003310 346 INIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANF 425 (832)
Q Consensus 346 I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~ 425 (832)
.|+||++.. +.+..+..-.....|.+|+||..|++|.+|..|.+++|||.-+....-.+.+|.+.+
T Consensus 253 cRlyDlRaD--------------~~~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~~c~vWDtlk~e~vg~L~GHeNRv 318 (343)
T KOG0286|consen 253 CRLYDLRAD--------------QELAVYSHDSIICGITSVAFSKSGRLLFAGYDDFTCNVWDTLKGERVGVLAGHENRV 318 (343)
T ss_pred eEEEeecCC--------------cEEeeeccCcccCCceeEEEcccccEEEeeecCCceeEeeccccceEEEeeccCCee
Confidence 899999986 333333222223459999999999999999999999999998876667889997765
Q ss_pred c
Q 003310 426 T 426 (832)
Q Consensus 426 ~ 426 (832)
.
T Consensus 319 S 319 (343)
T KOG0286|consen 319 S 319 (343)
T ss_pred E
Confidence 5
No 15
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.79 E-value=2.4e-17 Score=171.57 Aligned_cols=186 Identities=15% Similarity=0.297 Sum_probs=141.3
Q ss_pred CCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcc
Q 003310 114 SVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIG 189 (832)
Q Consensus 114 ~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~ 189 (832)
.+++++|+||+.+|+..+.|.. ..-|.+|+|++ +++..+-|++|.+||.- ++|.+++.....
T Consensus 82 swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~-g~ck~t~~~~~~------------- 147 (315)
T KOG0279|consen 82 SWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTL-GVCKYTIHEDSH------------- 147 (315)
T ss_pred cccceEEEEEecCCcEEEEEEecCCceEEEEecCCCceeecCCCcceeeeeeec-ccEEEEEecCCC-------------
Confidence 4679999999999988887765 56899999974 44444679999999976 456777754310
Q ss_pred cceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 190 YGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 190 ~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
..|+..- |++|...
T Consensus 148 -------~~WVscv----------rfsP~~~------------------------------------------------- 161 (315)
T KOG0279|consen 148 -------REWVSCV----------RFSPNES------------------------------------------------- 161 (315)
T ss_pred -------cCcEEEE----------EEcCCCC-------------------------------------------------
Confidence 0111110 0011100
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEE
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIF 349 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iw 349 (832)
+..+++++.|++|+|||+.+.+....|.+|++.++.++|||||.++|+|+.||. +.+|
T Consensus 162 ---------------------~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpDGslcasGgkdg~-~~Lw 219 (315)
T KOG0279|consen 162 ---------------------NPIIVSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPDGSLCASGGKDGE-AMLW 219 (315)
T ss_pred ---------------------CcEEEEccCCceEEEEccCCcchhhccccccccEEEEEECCCCCEEecCCCCce-EEEE
Confidence 012455678999999999999999999999999999999999999999999998 8999
Q ss_pred eCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 350 KIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 350 di~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
|++.+ +++|.|.- ...|.+++|+|.--||+.+... .|+|||+.+......++
T Consensus 220 dL~~~--------------k~lysl~a---~~~v~sl~fspnrywL~~at~~-sIkIwdl~~~~~v~~l~ 271 (315)
T KOG0279|consen 220 DLNEG--------------KNLYSLEA---FDIVNSLCFSPNRYWLCAATAT-SIKIWDLESKAVVEELK 271 (315)
T ss_pred EccCC--------------ceeEeccC---CCeEeeEEecCCceeEeeccCC-ceEEEeccchhhhhhcc
Confidence 99987 77898842 2359999999998888877654 59999999876665554
No 16
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.78 E-value=2.6e-16 Score=160.40 Aligned_cols=236 Identities=19% Similarity=0.308 Sum_probs=171.1
Q ss_pred CcEEEEEe-cCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGY-RSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy-~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+|+.|. ++.+++||+.. +.....+..|..++..+.+.|.. ..+++++.
T Consensus 21 ~~~l~~~~~~g~i~i~~~~~-~~~~~~~~~~~~~i~~~~~~~~~--------------~~l~~~~~-------------- 71 (289)
T cd00200 21 GKLLATGSGDGTIKVWDLET-GELLRTLKGHTGPVRDVAASADG--------------TYLASGSS-------------- 71 (289)
T ss_pred CCEEEEeecCcEEEEEEeeC-CCcEEEEecCCcceeEEEECCCC--------------CEEEEEcC--------------
Confidence 35555555 66699999986 34555556678888777777632 23444432
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEEEEe-CCEEEEEECCCCceEEEEe
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVAICQ-AAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LAVa~-~~~I~vwDl~t~~~~~tl~ 172 (832)
++.|++||+.+++.+..+.. ...|.++.+++ ++++++. ++.|++||+.+++....+.
T Consensus 72 -------------------~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 132 (289)
T cd00200 72 -------------------DKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR 132 (289)
T ss_pred -------------------CCeEEEEEcCcccceEEEeccCCcEEEEEEcCCCCEEEEecCCCeEEEEECCCcEEEEEec
Confidence 26799999999888888875 45899999975 6777776 8999999999888777665
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
.+.. ++ ..+++.. ++..
T Consensus 133 ~~~~---------------~i----~~~~~~~---------------------------~~~~----------------- 149 (289)
T cd00200 133 GHTD---------------WV----NSVAFSP---------------------------DGTF----------------- 149 (289)
T ss_pred cCCC---------------cE----EEEEEcC---------------------------cCCE-----------------
Confidence 4321 01 1122221 0000
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS 332 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd 332 (832)
++.+..+|.|++||+.+++++..+..|..+|.+++|+|+
T Consensus 150 -----------------------------------------l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~ 188 (289)
T cd00200 150 -----------------------------------------VASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD 188 (289)
T ss_pred -----------------------------------------EEEEcCCCcEEEEEccccccceeEecCccccceEEECCC
Confidence 111234788999999999999999999999999999999
Q ss_pred CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 333 GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
|+.|++++.+|. |++||+..+ ..+..+. ++ ...|.+++|+|++.++++++.||++++|++...
T Consensus 189 ~~~l~~~~~~~~-i~i~d~~~~--------------~~~~~~~-~~-~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~~ 251 (289)
T cd00200 189 GEKLLSSSSDGT-IKLWDLSTG--------------KCLGTLR-GH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTG 251 (289)
T ss_pred cCEEEEecCCCc-EEEEECCCC--------------ceecchh-hc-CCceEEEEEcCCCcEEEEEcCCCcEEEEEcCCc
Confidence 999999999987 899999865 2222331 22 235999999999999999988999999999876
Q ss_pred CCceeeccCC
Q 003310 413 GGSVNFQPTD 422 (832)
Q Consensus 413 g~~~~~~~H~ 422 (832)
.....+..|.
T Consensus 252 ~~~~~~~~~~ 261 (289)
T cd00200 252 ECVQTLSGHT 261 (289)
T ss_pred eeEEEccccC
Confidence 6555666553
No 17
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.78 E-value=6.7e-17 Score=177.51 Aligned_cols=269 Identities=17% Similarity=0.300 Sum_probs=188.1
Q ss_pred CcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+|+.|.-+| ++||+.+ ++....+..|.|||..+++-.. +. .|+ +.+
T Consensus 247 G~~LatG~~~G~~riw~~~--G~l~~tl~~HkgPI~slKWnk~--------G~------yil--S~~------------- 295 (524)
T KOG0273|consen 247 GTLLATGSEDGEARIWNKD--GNLISTLGQHKGPIFSLKWNKK--------GT------YIL--SGG------------- 295 (524)
T ss_pred CCeEEEeecCcEEEEEecC--chhhhhhhccCCceEEEEEcCC--------CC------EEE--ecc-------------
Confidence 77888888777 7999987 4677788899999999998532 12 222 211
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCE-EEEEEc-CCEEEE-EeCCEEEEEECCCCceEEEEec
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPI-YSVRCS-SRVVAI-CQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V-~sV~~S-~r~LAV-a~~~~I~vwDl~t~~~~~tl~t 173 (832)
.++++-+||..+|++...+.|.+.. .+|.+- ..-+|. ..++.|+||-+..-....++.+
T Consensus 296 ------------------vD~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~~~~F~ts~td~~i~V~kv~~~~P~~t~~G 357 (524)
T KOG0273|consen 296 ------------------VDGTTILWDAHTGTVKQQFEFHSAPALDVDWQSNDEFATSSTDGCIHVCKVGEDRPVKTFIG 357 (524)
T ss_pred ------------------CCccEEEEeccCceEEEeeeeccCCccceEEecCceEeecCCCceEEEEEecCCCcceeeec
Confidence 2378899999999999999997765 888883 344444 6788999999988888888888
Q ss_pred CCCccCCCCCCCCCcccceeeecc--ceEEeeCCC--ceecCCCccCCcccccccccccccCCCcceeeeecccccceec
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGP--RWLAYSGSP--VVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAA 249 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~--r~LAya~~~--~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~las 249 (832)
|.++ ++.+-..| ..||.+++. +..|+.|.-...| .+- .-+|++
T Consensus 358 H~g~------------V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~~~--------------~l~-----~Hskei-- 404 (524)
T KOG0273|consen 358 HHGE------------VNALKWNPTGSLLASCSDDGTLKIWSMGQSNSVH--------------DLQ-----AHSKEI-- 404 (524)
T ss_pred ccCc------------eEEEEECCCCceEEEecCCCeeEeeecCCCcchh--------------hhh-----hhccce--
Confidence 8653 23343333 456665432 2223222100000 000 001111
Q ss_pred eeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE
Q 003310 250 GIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCF 329 (832)
Q Consensus 250 Gi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaF 329 (832)
|.....|++... .++ ..+..++++..+++|++||+..+.+++.|..|+.||.+|+|
T Consensus 405 ---------------~t~~wsp~g~v~---~n~------~~~~~l~sas~dstV~lwdv~~gv~i~~f~kH~~pVysvaf 460 (524)
T KOG0273|consen 405 ---------------YTIKWSPTGPVT---SNP------NMNLMLASASFDSTVKLWDVESGVPIHTLMKHQEPVYSVAF 460 (524)
T ss_pred ---------------eeEeecCCCCcc---CCC------cCCceEEEeecCCeEEEEEccCCceeEeeccCCCceEEEEe
Confidence 111223332110 011 11234567788999999999999999999999999999999
Q ss_pred cCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 330 DPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 330 SPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
||+|+|||+|+.||. |+||+++++ .+++-.+|. .-|..++|+-+|..|..+-+|+.+.+-|+
T Consensus 461 S~~g~ylAsGs~dg~-V~iws~~~~---------------~l~~s~~~~--~~Ifel~Wn~~G~kl~~~~sd~~vcvldl 522 (524)
T KOG0273|consen 461 SPNGRYLASGSLDGC-VHIWSTKTG---------------KLVKSYQGT--GGIFELCWNAAGDKLGACASDGSVCVLDL 522 (524)
T ss_pred cCCCcEEEecCCCCe-eEeccccch---------------heeEeecCC--CeEEEEEEcCCCCEEEEEecCCCceEEEe
Confidence 999999999999997 899999886 455555553 34999999999999999999999999887
Q ss_pred C
Q 003310 410 N 410 (832)
Q Consensus 410 ~ 410 (832)
.
T Consensus 523 r 523 (524)
T KOG0273|consen 523 R 523 (524)
T ss_pred c
Confidence 4
No 18
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.77 E-value=1.1e-16 Score=195.85 Aligned_cols=236 Identities=14% Similarity=0.193 Sum_probs=162.6
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
..++..++++.++|||+.. +.....+..|.+.|..+.+.|.. ..+|+.++.
T Consensus 546 ~~las~~~Dg~v~lWd~~~-~~~~~~~~~H~~~V~~l~~~p~~-------------~~~L~Sgs~--------------- 596 (793)
T PLN00181 546 SQVASSNFEGVVQVWDVAR-SQLVTEMKEHEKRVWSIDYSSAD-------------PTLLASGSD--------------- 596 (793)
T ss_pred CEEEEEeCCCeEEEEECCC-CeEEEEecCCCCCEEEEEEcCCC-------------CCEEEEEcC---------------
Confidence 3456667777899999986 44555566799999999997521 023443331
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEc---CCEEEE-EeCCEEEEEECCCCc-eEEEEe
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCS---SRVVAI-CQAAQVHCFDAATLE-IEYAIL 172 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S---~r~LAV-a~~~~I~vwDl~t~~-~~~tl~ 172 (832)
+++|++||+++++++.++.....|.++.|+ +++||+ +.++.|++||+.+.+ .+.++.
T Consensus 597 ------------------Dg~v~iWd~~~~~~~~~~~~~~~v~~v~~~~~~g~~latgs~dg~I~iwD~~~~~~~~~~~~ 658 (793)
T PLN00181 597 ------------------DGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMI 658 (793)
T ss_pred ------------------CCEEEEEECCCCcEEEEEecCCCeEEEEEeCCCCCEEEEEeCCCeEEEEECCCCCccceEec
Confidence 278999999999999999988899999994 467776 578899999998765 344444
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
.|.. ++ .+++|.. +.
T Consensus 659 ~h~~---------------~V----~~v~f~~----------------------------~~------------------ 673 (793)
T PLN00181 659 GHSK---------------TV----SYVRFVD----------------------------SS------------------ 673 (793)
T ss_pred CCCC---------------CE----EEEEEeC----------------------------CC------------------
Confidence 3321 01 1122210 00
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC------CcEEEEeccCCCCeEE
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS------KNVIAQFRAHKSPISA 326 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s------~~~l~~~~aH~~pIs~ 326 (832)
.++++..||+|+|||+.. .+++..|.+|...+.+
T Consensus 674 ----------------------------------------~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~ 713 (793)
T PLN00181 674 ----------------------------------------TLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNF 713 (793)
T ss_pred ----------------------------------------EEEEEECCCEEEEEeCCCCccccCCcceEEEcCCCCCeeE
Confidence 123345788999999974 3678899999999999
Q ss_pred EEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEE
Q 003310 327 LCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHL 406 (832)
Q Consensus 327 LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhI 406 (832)
++|+|+|.+||+|+.||+ |+||+......-..-... ....+..+.-......|.+++|+|+++.|++++.||+|+|
T Consensus 714 v~~s~~~~~lasgs~D~~-v~iw~~~~~~~~~s~~~~---~~~~~~~~~~~~~~~~V~~v~ws~~~~~lva~~~dG~I~i 789 (793)
T PLN00181 714 VGLSVSDGYIATGSETNE-VFVYHKAFPMPVLSYKFK---TIDPVSGLEVDDASQFISSVCWRGQSSTLVAANSTGNIKI 789 (793)
T ss_pred EEEcCCCCEEEEEeCCCE-EEEEECCCCCceEEEecc---cCCcccccccCCCCcEEEEEEEcCCCCeEEEecCCCcEEE
Confidence 999999999999999997 899997643100000000 0000000000011234999999999999999999999999
Q ss_pred Eec
Q 003310 407 FAI 409 (832)
Q Consensus 407 wdl 409 (832)
|++
T Consensus 790 ~~~ 792 (793)
T PLN00181 790 LEM 792 (793)
T ss_pred Eec
Confidence 986
No 19
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.77 E-value=1.3e-17 Score=178.55 Aligned_cols=239 Identities=12% Similarity=0.191 Sum_probs=178.0
Q ss_pred CcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
.++|+.+...- +++||.+..-.+...+.+|+-.|.++.++|.+ ..|+-|+
T Consensus 162 Gk~l~tcSsDl~~~LWd~~~~~~c~ks~~gh~h~vS~V~f~P~g--------------d~ilS~s--------------- 212 (406)
T KOG0295|consen 162 GKYLATCSSDLSAKLWDFDTFFRCIKSLIGHEHGVSSVFFLPLG--------------DHILSCS--------------- 212 (406)
T ss_pred ccEEEecCCccchhheeHHHHHHHHHHhcCcccceeeEEEEecC--------------Ceeeecc---------------
Confidence 56777776665 89999987656777777899999999999852 1233232
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEEE-eCCEEEEEECCCCceEEEEe
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAIC-QAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAVa-~~~~I~vwDl~t~~~~~tl~ 172 (832)
.+.+|+.|++.+|.|++++.- +..|..|+.+ +.++|.+ .+.++++|-+.+++|+..+.
T Consensus 213 ------------------rD~tik~We~~tg~cv~t~~~h~ewvr~v~v~~DGti~As~s~dqtl~vW~~~t~~~k~~lR 274 (406)
T KOG0295|consen 213 ------------------RDNTIKAWECDTGYCVKTFPGHSEWVRMVRVNQDGTIIASCSNDQTLRVWVVATKQCKAELR 274 (406)
T ss_pred ------------------cccceeEEecccceeEEeccCchHhEEEEEecCCeeEEEecCCCceEEEEEeccchhhhhhh
Confidence 237899999999999999986 5689999997 5777774 57799999999998888887
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
.|..+. . .+|++.......+.++
T Consensus 275 ~hEh~v------------E-------ci~wap~~~~~~i~~a-------------------------------------- 297 (406)
T KOG0295|consen 275 EHEHPV------------E-------CIAWAPESSYPSISEA-------------------------------------- 297 (406)
T ss_pred ccccce------------E-------EEEecccccCcchhhc--------------------------------------
Confidence 764321 1 2333311000000000
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS 332 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd 332 (832)
.+.+ +|. ..+.++..|++|++||+.++.++.+|.+|...|..++|+|.
T Consensus 298 -----------------t~~~-------------~~~--~~l~s~SrDktIk~wdv~tg~cL~tL~ghdnwVr~~af~p~ 345 (406)
T KOG0295|consen 298 -----------------TGST-------------NGG--QVLGSGSRDKTIKIWDVSTGMCLFTLVGHDNWVRGVAFSPG 345 (406)
T ss_pred -----------------cCCC-------------CCc--cEEEeecccceEEEEeccCCeEEEEEecccceeeeeEEcCC
Confidence 0000 000 01234578999999999999999999999999999999999
Q ss_pred CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 333 GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
|+||+++.+|++ +||||+... +++..+. .+...+.++.|..+.-++++|+-|.|+++|..
T Consensus 346 Gkyi~ScaDDkt-lrvwdl~~~--------------~cmk~~~--ah~hfvt~lDfh~~~p~VvTGsVdqt~KvwEc 405 (406)
T KOG0295|consen 346 GKYILSCADDKT-LRVWDLKNL--------------QCMKTLE--AHEHFVTSLDFHKTAPYVVTGSVDQTVKVWEC 405 (406)
T ss_pred CeEEEEEecCCc-EEEEEeccc--------------eeeeccC--CCcceeEEEecCCCCceEEeccccceeeeeec
Confidence 999999999998 899999986 5555554 22345999999999999999999999999964
No 20
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.76 E-value=9e-17 Score=165.09 Aligned_cols=230 Identities=15% Similarity=0.223 Sum_probs=157.3
Q ss_pred CCCEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEEEeCCEEEEEECCCCc--eEEEEecCCCccCCCCCCCCCcc
Q 003310 115 VPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAICQAAQVHCFDAATLE--IEYAILTNPIVMGHPSAGGIGIG 189 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAVa~~~~I~vwDl~t~~--~~~tl~t~~~~~~~p~~~~~~~~ 189 (832)
++.|||||-+.||.|..+|++ .+.|.++.++ ++.||++....|++||+++.. .+.++..+..
T Consensus 18 YDhTIRfWqa~tG~C~rTiqh~dsqVNrLeiTpdk~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~k------------- 84 (311)
T KOG0315|consen 18 YDHTIRFWQALTGICSRTIQHPDSQVNRLEITPDKKDLAAAGNQHVRLYDLNSNNPNPVATFEGHTK------------- 84 (311)
T ss_pred CcceeeeeehhcCeEEEEEecCccceeeEEEcCCcchhhhccCCeeEEEEccCCCCCceeEEeccCC-------------
Confidence 558999999999999999999 5799999996 578999989999999999865 4667766522
Q ss_pred cceeeec----cceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccc
Q 003310 190 YGPLAVG----PRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQY 265 (832)
Q Consensus 190 ~~p~Alg----~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y 265 (832)
|.+++| .||+...++. .++.-+-+.+ -...+.
T Consensus 85 -NVtaVgF~~dgrWMyTgseD---------------------------gt~kIWdlR~----------------~~~qR~ 120 (311)
T KOG0315|consen 85 -NVTAVGFQCDGRWMYTGSED---------------------------GTVKIWDLRS----------------LSCQRN 120 (311)
T ss_pred -ceEEEEEeecCeEEEecCCC---------------------------ceEEEEeccC----------------cccchh
Confidence 223333 4887665422 1111111110 000000
Q ss_pred cccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCC
Q 003310 266 CSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGILLVTASVQGH 344 (832)
Q Consensus 266 ~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt 344 (832)
+..-.|. +.+ ...|. -+++.+++.+|.|+|||+.+..+...+- ....+|.+|+..|||++|+.+-..|+
T Consensus 121 ~~~~spV--n~v-vlhpn-------QteLis~dqsg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG~ 190 (311)
T KOG0315|consen 121 YQHNSPV--NTV-VLHPN-------QTELISGDQSGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKGN 190 (311)
T ss_pred ccCCCCc--ceE-EecCC-------cceEEeecCCCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCcc
Confidence 0000000 000 00111 1355678899999999999887766654 44568999999999999999999998
Q ss_pred EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC-CCceeeccCC
Q 003310 345 NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL-GGSVNFQPTD 422 (832)
Q Consensus 345 ~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~-g~~~~~~~H~ 422 (832)
..+|++..+..-+ ....+.+++ .+ ..-|..+-||||+++||++|+|.|++||.++.+ +.+..+++|.
T Consensus 191 -cyvW~l~~~~~~s--------~l~P~~k~~-ah-~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~kle~~l~gh~ 258 (311)
T KOG0315|consen 191 -CYVWRLLNHQTAS--------ELEPVHKFQ-AH-NGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFKLELVLTGHQ 258 (311)
T ss_pred -EEEEEccCCCccc--------cceEhhhee-cc-cceEEEEEECCCCcEEEeecCCceEEEEecCCceeeEEEeecCC
Confidence 7899998752111 123333332 12 223889999999999999999999999999988 6777778884
No 21
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.75 E-value=1.5e-16 Score=183.13 Aligned_cols=192 Identities=19% Similarity=0.300 Sum_probs=150.9
Q ss_pred CCCEEEEEECCCCc--EEEEE-eCCCCEEEEEEc--CCEEEE-EeCCEEEEEEC-CCCceEEEEecCCCccCCCCCCCCC
Q 003310 115 VPTVVHFYSLRSQS--YVHML-KFRSPIYSVRCS--SRVVAI-CQAAQVHCFDA-ATLEIEYAILTNPIVMGHPSAGGIG 187 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~--~V~tL-~f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl-~t~~~~~tl~t~~~~~~~p~~~~~~ 187 (832)
.++++++|++.+.+ ..+++ .+...|.+++|+ +++++. +.|.+|+|||+ ..+.+++++.+|+...
T Consensus 179 ~~~~i~~~~~~~~~~~~~~~l~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v--------- 249 (456)
T KOG0266|consen 179 SDGLIRIWKLEGIKSNLLRELSGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHSTYV--------- 249 (456)
T ss_pred CCCcEEEeecccccchhhccccccccceeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEecCCCCce---------
Confidence 45889999998777 66666 356789999997 456665 67889999999 5568899999885421
Q ss_pred cccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccc
Q 003310 188 IGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCS 267 (832)
Q Consensus 188 ~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~ 267 (832)
. -++|.. + |.
T Consensus 250 -----~-----~~~f~p-------------~--------------g~--------------------------------- 259 (456)
T KOG0266|consen 250 -----T-----SVAFSP-------------D--------------GN--------------------------------- 259 (456)
T ss_pred -----E-----EEEecC-------------C--------------CC---------------------------------
Confidence 1 122321 0 11
Q ss_pred cccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEE
Q 003310 268 EFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNIN 347 (832)
Q Consensus 268 ~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~ 347 (832)
.++++..|++|+|||+.+++++..|++|..+|++++|++||.+|+++|.||. |+
T Consensus 260 -------------------------~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~d~~-i~ 313 (456)
T KOG0266|consen 260 -------------------------LLVSGSDDGTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGNLLVSASYDGT-IR 313 (456)
T ss_pred -------------------------EEEEecCCCcEEEEeccCCeEEEeeeccCCceEEEEECCCCCEEEEcCCCcc-EE
Confidence 1234567899999999999999999999999999999999999999999987 99
Q ss_pred EEeCCCCCCCCCCccCCCCceeEEEEEeccCccc-cEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCC
Q 003310 348 IFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA-VIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 348 Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a-~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~ 424 (832)
|||+.++ ...++..+. +.... .+..++|+|++++|++++.|+++++|++........+.+|...
T Consensus 314 vwd~~~~------------~~~~~~~~~-~~~~~~~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~~~~~~~~~~~~~~ 378 (456)
T KOG0266|consen 314 VWDLETG------------SKLCLKLLS-GAENSAPVTSVQFSPNGKYLLSASLDRTLKLWDLRSGKSVGTYTGHSNL 378 (456)
T ss_pred EEECCCC------------ceeeeeccc-CCCCCCceeEEEECCCCcEEEEecCCCeEEEEEccCCcceeeecccCCc
Confidence 9999987 101233443 33333 6899999999999999999999999999998888899888553
No 22
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.75 E-value=2.6e-16 Score=181.19 Aligned_cols=236 Identities=17% Similarity=0.228 Sum_probs=172.3
Q ss_pred EEEEEecCCeEEEEeccCC-CeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccc
Q 003310 20 VLLLGYRSGFQVWDVEEAD-NVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATA 98 (832)
Q Consensus 20 vLl~Gy~~G~qVWdv~~~~-~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~ 98 (832)
+.....+.-+.+|+..... +....+..|.-.|+.+++.|++. .+++.+
T Consensus 174 l~~~~~~~~i~~~~~~~~~~~~~~~l~~h~~~v~~~~fs~d~~----------------~l~s~s--------------- 222 (456)
T KOG0266|consen 174 LAAASSDGLIRIWKLEGIKSNLLRELSGHTRGVSDVAFSPDGS----------------YLLSGS--------------- 222 (456)
T ss_pred EEEccCCCcEEEeecccccchhhccccccccceeeeEECCCCc----------------EEEEec---------------
Confidence 3333456668899985533 13333366888889888887531 123321
Q ss_pred cCCCCCCCCCCCCCCCCCCEEEEEEC-CCCcEEEEEeC-CCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEec
Q 003310 99 CNGTSANYHDLGNGSSVPTVVHFYSL-RSQSYVHMLKF-RSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 99 ~~g~~~~~h~~g~~~~~~~tVrlWDL-~Tg~~V~tL~f-~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t 173 (832)
.+.+||+||+ ..+.++++|+- ...|++++|+ +++|+. +.|++|+|||+.+++++.++.+
T Consensus 223 ----------------~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~ 286 (456)
T KOG0266|consen 223 ----------------DDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKG 286 (456)
T ss_pred ----------------CCceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeec
Confidence 2389999999 55689999974 7799999997 456665 6789999999999999999988
Q ss_pred CCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
|... +. -+++.. +|.
T Consensus 287 hs~~---------------is----~~~f~~---------------------------d~~------------------- 301 (456)
T KOG0266|consen 287 HSDG---------------IS----GLAFSP---------------------------DGN------------------- 301 (456)
T ss_pred cCCc---------------eE----EEEECC---------------------------CCC-------------------
Confidence 8431 10 122221 111
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCc--EEEEeccCCCC--eEEEEE
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKN--VIAQFRAHKSP--ISALCF 329 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~--~l~~~~aH~~p--Is~LaF 329 (832)
.++++..||.|+|||+.++. ++..+..+..+ +++++|
T Consensus 302 ---------------------------------------~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~f 342 (456)
T KOG0266|consen 302 ---------------------------------------LLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQF 342 (456)
T ss_pred ---------------------------------------EEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEE
Confidence 12234568999999999999 67888877665 999999
Q ss_pred cCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc--cEEEEEEccCCCEEEEEeCCCcEEEE
Q 003310 330 DPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA--VIQDISFSDDSNWIMISSSRGTSHLF 407 (832)
Q Consensus 330 SPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a--~I~~IaFSpDg~~LAsgS~DgTVhIw 407 (832)
+|+|++|+++..|++ +++||+..+ ..+ ...+++... .+.+..++++++++.+++.|++|++|
T Consensus 343 sp~~~~ll~~~~d~~-~~~w~l~~~--------------~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~sg~~d~~v~~~ 406 (456)
T KOG0266|consen 343 SPNGKYLLSASLDRT-LKLWDLRSG--------------KSV-GTYTGHSNLVRCIFSPTLSTGGKLIYSGSEDGSVYVW 406 (456)
T ss_pred CCCCcEEEEecCCCe-EEEEEccCC--------------cce-eeecccCCcceeEecccccCCCCeEEEEeCCceEEEE
Confidence 999999999999987 899999876 111 112344432 46677779999999999999999999
Q ss_pred ecCCCCCceeeccCC
Q 003310 408 AINPLGGSVNFQPTD 422 (832)
Q Consensus 408 dl~~~g~~~~~~~H~ 422 (832)
++.+......+.+|.
T Consensus 407 ~~~s~~~~~~l~~h~ 421 (456)
T KOG0266|consen 407 DSSSGGILQRLEGHS 421 (456)
T ss_pred eCCccchhhhhcCCC
Confidence 999977777888883
No 23
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.74 E-value=6.1e-17 Score=173.38 Aligned_cols=236 Identities=15% Similarity=0.265 Sum_probs=176.2
Q ss_pred CcEEEEEe-cCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGY-RSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy-~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+++++. +..+++||..+ +++.+.+..|...|..+.+-.. .-+||-|+. |
T Consensus 120 ~~~v~~as~d~tikv~D~~t-g~~e~~LrGHt~sv~di~~~a~--------------Gk~l~tcSs----------D--- 171 (406)
T KOG0295|consen 120 EALVVSASEDATIKVFDTET-GELERSLRGHTDSVFDISFDAS--------------GKYLATCSS----------D--- 171 (406)
T ss_pred ceEEEEecCCceEEEEEccc-hhhhhhhhccccceeEEEEecC--------------ccEEEecCC----------c---
Confidence 44555554 44599999987 6676677777766777766421 124444442 1
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCC-CcEEEEEe-CCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEE
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRS-QSYVHMLK-FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~T-g~~V~tL~-f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl 171 (832)
-.+++||..+ .+|++++. +...|.+|.|- +..|+. +-|++|+.||..|+.|++++
T Consensus 172 --------------------l~~~LWd~~~~~~c~ks~~gh~h~vS~V~f~P~gd~ilS~srD~tik~We~~tg~cv~t~ 231 (406)
T KOG0295|consen 172 --------------------LSAKLWDFDTFFRCIKSLIGHEHGVSSVFFLPLGDHILSCSRDNTIKAWECDTGYCVKTF 231 (406)
T ss_pred --------------------cchhheeHHHHHHHHHHhcCcccceeeEEEEecCCeeeecccccceeEEecccceeEEec
Confidence 3389999987 56666654 45678888884 466666 56889999999999999998
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
..|+.- +-.++ -.
T Consensus 232 ~~h~ew------------vr~v~-------v~------------------------------------------------ 244 (406)
T KOG0295|consen 232 PGHSEW------------VRMVR-------VN------------------------------------------------ 244 (406)
T ss_pred cCchHh------------EEEEE-------ec------------------------------------------------
Confidence 776420 00011 00
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP 331 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP 331 (832)
.+|++ +++.+.+.+|++|-+.++++...|+.|..+|-|++|-|
T Consensus 245 ----------------------------------~DGti---~As~s~dqtl~vW~~~t~~~k~~lR~hEh~vEci~wap 287 (406)
T KOG0295|consen 245 ----------------------------------QDGTI---IASCSNDQTLRVWVVATKQCKAELREHEHPVECIAWAP 287 (406)
T ss_pred ----------------------------------CCeeE---EEecCCCceEEEEEeccchhhhhhhccccceEEEEecc
Confidence 00111 23456778999999999999999999999999999976
Q ss_pred C---------------CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEE
Q 003310 332 S---------------GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIM 396 (832)
Q Consensus 332 d---------------G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LA 396 (832)
. |.+|+++|.|++ |++||+.++ .+|++| .|+.+ .|.+++|+|-|+||+
T Consensus 288 ~~~~~~i~~at~~~~~~~~l~s~SrDkt-Ik~wdv~tg--------------~cL~tL-~ghdn-wVr~~af~p~Gkyi~ 350 (406)
T KOG0295|consen 288 ESSYPSISEATGSTNGGQVLGSGSRDKT-IKIWDVSTG--------------MCLFTL-VGHDN-WVRGVAFSPGGKYIL 350 (406)
T ss_pred cccCcchhhccCCCCCccEEEeecccce-EEEEeccCC--------------eEEEEE-ecccc-eeeeeEEcCCCeEEE
Confidence 3 358999999987 999999987 788998 46654 599999999999999
Q ss_pred EEeCCCcEEEEecCCCCCceeeccCC
Q 003310 397 ISSSRGTSHLFAINPLGGSVNFQPTD 422 (832)
Q Consensus 397 sgS~DgTVhIwdl~~~g~~~~~~~H~ 422 (832)
++.+|+|++|||++...+..++..|.
T Consensus 351 ScaDDktlrvwdl~~~~cmk~~~ah~ 376 (406)
T KOG0295|consen 351 SCADDKTLRVWDLKNLQCMKTLEAHE 376 (406)
T ss_pred EEecCCcEEEEEeccceeeeccCCCc
Confidence 99999999999999998888887773
No 24
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.73 E-value=8.6e-17 Score=185.70 Aligned_cols=116 Identities=13% Similarity=0.268 Sum_probs=101.5
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
|++++.|++.++|.....++++.|.+|.+.|.|++|.|+..|+||||.|.+ +|+||+.+| ..++++
T Consensus 508 Fatas~D~tArLWs~d~~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD~t-VRlWDv~~G------------~~VRiF- 573 (707)
T KOG0263|consen 508 FATASHDQTARLWSTDHNKPLRIFAGHLSDVDCVSFHPNSNYVATGSSDRT-VRLWDVSTG------------NSVRIF- 573 (707)
T ss_pred EEecCCCceeeeeecccCCchhhhcccccccceEEECCcccccccCCCCce-EEEEEcCCC------------cEEEEe-
Confidence 456678899999999999999999999999999999999999999999976 999999987 223333
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcc
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~ 426 (832)
+||+ +.|.+++|||+|+|||+|+.||.|+|||+..+.-...+.+|++...
T Consensus 574 --~GH~-~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti~ 623 (707)
T KOG0263|consen 574 --TGHK-GPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGHTGTIY 623 (707)
T ss_pred --cCCC-CceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcccCcee
Confidence 6764 5799999999999999999999999999988777788899966544
No 25
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.71 E-value=7.9e-15 Score=155.33 Aligned_cols=238 Identities=14% Similarity=0.231 Sum_probs=161.6
Q ss_pred CCcEEEEEecC-CeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCc
Q 003310 17 TRRVLLLGYRS-GFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGL 95 (832)
Q Consensus 17 ~~~vLl~Gy~~-G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~ 95 (832)
...+|+.+.++ .+||||+.. +.....+..+.=.|..+++...+ --++.+..
T Consensus 25 ~G~~litss~dDsl~LYd~~~-g~~~~ti~skkyG~~~~~Fth~~---------------~~~i~sSt------------ 76 (311)
T KOG1446|consen 25 DGLLLITSSEDDSLRLYDSLS-GKQVKTINSKKYGVDLACFTHHS---------------NTVIHSST------------ 76 (311)
T ss_pred CCCEEEEecCCCeEEEEEcCC-CceeeEeecccccccEEEEecCC---------------ceEEEccC------------
Confidence 45666665555 799999987 44544554444455566665321 12233311
Q ss_pred ccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEEEeCCEEEEEECCCCceEEEE
Q 003310 96 ATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAICQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 96 ~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~~~~tl 171 (832)
-.+.+||.-||.++++++.+.. ...|.+|+.++ .+|.++.|++|++||++..+|...+
T Consensus 77 ------------------k~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~d~FlS~S~D~tvrLWDlR~~~cqg~l 138 (311)
T KOG1446|consen 77 ------------------KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKDDTFLSSSLDKTVRLWDLRVKKCQGLL 138 (311)
T ss_pred ------------------CCCCceEEEEeecCceEEEcCCCCceEEEEEecCCCCeEEecccCCeEEeeEecCCCCceEE
Confidence 0237899999999999999975 67899999985 4677789999999999988776555
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
.-.+ .|+ .||.+ + | ++
T Consensus 139 ~~~~---------------~pi------~AfDp-------------~--------------G-Li--------------- 154 (311)
T KOG1446|consen 139 NLSG---------------RPI------AAFDP-------------E--------------G-LI--------------- 154 (311)
T ss_pred ecCC---------------Ccc------eeECC-------------C--------------C-cE---------------
Confidence 3221 122 24541 1 0 00
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC--CcEEEEecc---CCCCeEE
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS--KNVIAQFRA---HKSPISA 326 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s--~~~l~~~~a---H~~pIs~ 326 (832)
|+.+.....|+|||+++ +.+..+|.- -...++.
T Consensus 155 ------------------------------------------fA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~ 192 (311)
T KOG1446|consen 155 ------------------------------------------FALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTD 192 (311)
T ss_pred ------------------------------------------EEEecCCCeEEEEEecccCCCCceeEccCCCCccceee
Confidence 11122233799999986 456666653 3567899
Q ss_pred EEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEE
Q 003310 327 LCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHL 406 (832)
Q Consensus 327 LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhI 406 (832)
|.|||||++|+-+...+. +.|.|...| .+.+-+.+++....- -.+.+|+|||++|.+|+.||+|||
T Consensus 193 l~FS~dGK~iLlsT~~s~-~~~lDAf~G------------~~~~tfs~~~~~~~~-~~~a~ftPds~Fvl~gs~dg~i~v 258 (311)
T KOG1446|consen 193 LEFSPDGKSILLSTNASF-IYLLDAFDG------------TVKSTFSGYPNAGNL-PLSATFTPDSKFVLSGSDDGTIHV 258 (311)
T ss_pred eEEcCCCCEEEEEeCCCc-EEEEEccCC------------cEeeeEeeccCCCCc-ceeEEECCCCcEEEEecCCCcEEE
Confidence 999999999988877765 899999887 233333333322111 257899999999999999999999
Q ss_pred EecCCCCCceeecc
Q 003310 407 FAINPLGGSVNFQP 420 (832)
Q Consensus 407 wdl~~~g~~~~~~~ 420 (832)
|+++++.....+++
T Consensus 259 w~~~tg~~v~~~~~ 272 (311)
T KOG1446|consen 259 WNLETGKKVAVLRG 272 (311)
T ss_pred EEcCCCcEeeEecC
Confidence 99988766667766
No 26
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.71 E-value=3e-15 Score=156.93 Aligned_cols=222 Identities=13% Similarity=0.167 Sum_probs=169.2
Q ss_pred CcEEEEEecCCeEEEEeccC--C---CeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCcccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEA--D---NVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQ 92 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~--~---~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~ 92 (832)
..|-+.|.+|-..||++... . .+...+..|.|=+.+++++++.+ | .+++
T Consensus 110 ~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD~~---------------i--lT~S--------- 163 (343)
T KOG0286|consen 110 NFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDDNH---------------I--LTGS--------- 163 (343)
T ss_pred CeEEecCcCceeEEEecccccccccceeeeeecCccceeEEEEEcCCCc---------------e--EecC---------
Confidence 44555677888999999854 1 23444567888888888886421 1 1211
Q ss_pred CCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEE-EeCCEEEEEECCCCce
Q 003310 93 DGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAI-CQAAQVHCFDAATLEI 167 (832)
Q Consensus 93 Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAV-a~~~~I~vwDl~t~~~ 167 (832)
-+.|.-+||+++|+.+..+.- ..-|.++.+++ +.++. +.|+..++||++.+.+
T Consensus 164 ----------------------GD~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c 221 (343)
T KOG0286|consen 164 ----------------------GDMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQC 221 (343)
T ss_pred ----------------------CCceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccCcce
Confidence 127899999999999999875 56899999964 66665 6799999999999999
Q ss_pred EEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccce
Q 003310 168 EYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHL 247 (832)
Q Consensus 168 ~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~l 247 (832)
.+++.+|.. .+|.+ .|-
T Consensus 222 ~qtF~ghes------------DINsv-------~ff-------------------------------------------- 238 (343)
T KOG0286|consen 222 VQTFEGHES------------DINSV-------RFF-------------------------------------------- 238 (343)
T ss_pred eEeeccccc------------ccceE-------EEc--------------------------------------------
Confidence 999988732 11222 121
Q ss_pred eceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccC--CCCeE
Q 003310 248 AAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAH--KSPIS 325 (832)
Q Consensus 248 asGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH--~~pIs 325 (832)
|++ -.|+++.+|++.++||++..+.++.|... ..+|+
T Consensus 239 -----------------------P~G------------------~afatGSDD~tcRlyDlRaD~~~a~ys~~~~~~git 277 (343)
T KOG0286|consen 239 -----------------------PSG------------------DAFATGSDDATCRLYDLRADQELAVYSHDSIICGIT 277 (343)
T ss_pred -----------------------cCC------------------CeeeecCCCceeEEEeecCCcEEeeeccCcccCCce
Confidence 111 02456788999999999999999998733 36899
Q ss_pred EEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEE
Q 003310 326 ALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSH 405 (832)
Q Consensus 326 ~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVh 405 (832)
+++||-+|++|..|-.|.+ ++|||...+ ++.-.|. ||.+ +|.++..+|||.-|++||=|.+++
T Consensus 278 Sv~FS~SGRlLfagy~d~~-c~vWDtlk~--------------e~vg~L~-GHeN-RvScl~~s~DG~av~TgSWDs~lr 340 (343)
T KOG0286|consen 278 SVAFSKSGRLLFAGYDDFT-CNVWDTLKG--------------ERVGVLA-GHEN-RVSCLGVSPDGMAVATGSWDSTLR 340 (343)
T ss_pred eEEEcccccEEEeeecCCc-eeEeecccc--------------ceEEEee-ccCC-eeEEEEECCCCcEEEecchhHhee
Confidence 9999999999999999987 899999876 4555553 6654 699999999999999999999999
Q ss_pred EEe
Q 003310 406 LFA 408 (832)
Q Consensus 406 Iwd 408 (832)
||.
T Consensus 341 iW~ 343 (343)
T KOG0286|consen 341 IWA 343 (343)
T ss_pred ecC
Confidence 994
No 27
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.71 E-value=9.5e-17 Score=171.76 Aligned_cols=255 Identities=16% Similarity=0.238 Sum_probs=174.3
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCc----ccccC-CEEEEEeCCCCccCcccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDK----FAEVR-PLLVFCADGSRSCGTKVQ 92 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~----f~~~r-PLLavv~~g~~~g~~~~~ 92 (832)
..|+..++++.++|||+.. -++...+..|+|.|+-+.+-.+- ....++|. +...- |+=.+.+++...+
T Consensus 80 s~~aSGs~DG~VkiWnlsq-R~~~~~f~AH~G~V~Gi~v~~~~-~~tvgdDKtvK~wk~~~~p~~tilg~s~~~g----- 152 (433)
T KOG0268|consen 80 STVASGSCDGEVKIWNLSQ-RECIRTFKAHEGLVRGICVTQTS-FFTVGDDKTVKQWKIDGPPLHTILGKSVYLG----- 152 (433)
T ss_pred hhhhccccCceEEEEehhh-hhhhheeecccCceeeEEecccc-eEEecCCcceeeeeccCCcceeeeccccccc-----
Confidence 5677788888999999987 46777888899999999875421 11112221 11011 2211222211111
Q ss_pred CCcccccCCCCCCCCCCCCCCCCC--CEEEEEECCCCcEEEEEeCC-CCEEEEEEcC---CEEEEE-eCCEEEEEECCCC
Q 003310 93 DGLATACNGTSANYHDLGNGSSVP--TVVHFYSLRSQSYVHMLKFR-SPIYSVRCSS---RVVAIC-QAAQVHCFDAATL 165 (832)
Q Consensus 93 Dg~~~~~~g~~~~~h~~g~~~~~~--~tVrlWDL~Tg~~V~tL~f~-s~V~sV~~S~---r~LAVa-~~~~I~vwDl~t~ 165 (832)
-+|.-....+.. -.|.|||..-...+.++... ..|.+|.||+ .+||+| .|+.|.+||+++.
T Consensus 153 ------------Idh~~~~~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfNpvETsILas~~sDrsIvLyD~R~~ 220 (433)
T KOG0268|consen 153 ------------IDHHRKNSVFATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFNPVETSILASCASDRSIVLYDLRQA 220 (433)
T ss_pred ------------cccccccccccccCceeeecccccCCccceeecCCCceeEEecCCCcchheeeeccCCceEEEecccC
Confidence 111111111111 35899999988999999874 5799999985 678885 8899999999998
Q ss_pred ceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeeccccc
Q 003310 166 EIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSK 245 (832)
Q Consensus 166 ~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk 245 (832)
..++.+..- ...|.++++| =||
T Consensus 221 ~Pl~KVi~~-------------mRTN~IswnP--eaf------------------------------------------- 242 (433)
T KOG0268|consen 221 SPLKKVILT-------------MRTNTICWNP--EAF------------------------------------------- 242 (433)
T ss_pred Cccceeeee-------------ccccceecCc--ccc-------------------------------------------
Confidence 877666432 1122232221 011
Q ss_pred ceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCCCCe
Q 003310 246 HLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPI 324 (832)
Q Consensus 246 ~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pI 324 (832)
.|+.++.|-.+..+|+... .++..++.|.+.|
T Consensus 243 -----------------------------------------------nF~~a~ED~nlY~~DmR~l~~p~~v~~dhvsAV 275 (433)
T KOG0268|consen 243 -----------------------------------------------NFVAANEDHNLYTYDMRNLSRPLNVHKDHVSAV 275 (433)
T ss_pred -----------------------------------------------ceeeccccccceehhhhhhcccchhhcccceeE
Confidence 1344567788999998864 5788889999999
Q ss_pred EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcE
Q 003310 325 SALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTS 404 (832)
Q Consensus 325 s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTV 404 (832)
..+.|||-|+-++|||.|.+ ||||.+..+. -+.+|..+|- ..|.++.||.|++|+.+||+|+.|
T Consensus 276 ~dVdfsptG~EfvsgsyDks-IRIf~~~~~~------------SRdiYhtkRM---q~V~~Vk~S~Dskyi~SGSdd~nv 339 (433)
T KOG0268|consen 276 MDVDFSPTGQEFVSGSYDKS-IRIFPVNHGH------------SRDIYHTKRM---QHVFCVKYSMDSKYIISGSDDGNV 339 (433)
T ss_pred EEeccCCCcchhccccccce-EEEeecCCCc------------chhhhhHhhh---heeeEEEEeccccEEEecCCCcce
Confidence 99999999999999999976 9999988761 1334444332 249999999999999999999999
Q ss_pred EEEecCCC
Q 003310 405 HLFAINPL 412 (832)
Q Consensus 405 hIwdl~~~ 412 (832)
++|.-...
T Consensus 340 RlWka~As 347 (433)
T KOG0268|consen 340 RLWKAKAS 347 (433)
T ss_pred eeeecchh
Confidence 99987654
No 28
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.70 E-value=8.1e-15 Score=179.59 Aligned_cols=181 Identities=16% Similarity=0.183 Sum_probs=134.4
Q ss_pred CCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccc
Q 003310 116 PTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGY 190 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~ 190 (832)
+++|+|||+.+++.+..++. ...|++|+|++ .+|++ +.|++|++||+.+++++.++..+..
T Consensus 554 Dg~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~~~-------------- 619 (793)
T PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTKAN-------------- 619 (793)
T ss_pred CCeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEEecCCC--------------
Confidence 38899999999999998864 67899999973 56666 5688999999999888777754210
Q ss_pred ceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccccc
Q 003310 191 GPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFL 270 (832)
Q Consensus 191 ~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~ 270 (832)
.. .++|... +|.
T Consensus 620 -v~-----~v~~~~~--------------------------~g~------------------------------------ 631 (793)
T PLN00181 620 -IC-----CVQFPSE--------------------------SGR------------------------------------ 631 (793)
T ss_pred -eE-----EEEEeCC--------------------------CCC------------------------------------
Confidence 01 1122100 011
Q ss_pred CCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCc-EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEE
Q 003310 271 PDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKN-VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIF 349 (832)
Q Consensus 271 p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~-~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iw 349 (832)
.++++..+|.|++||+.+.+ ++..+.+|..+|.+++|. ++.+|+|++.||+ |+||
T Consensus 632 ----------------------~latgs~dg~I~iwD~~~~~~~~~~~~~h~~~V~~v~f~-~~~~lvs~s~D~~-ikiW 687 (793)
T PLN00181 632 ----------------------SLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNT-LKLW 687 (793)
T ss_pred ----------------------EEEEEeCCCeEEEEECCCCCccceEecCCCCCEEEEEEe-CCCEEEEEECCCE-EEEE
Confidence 12334578899999998765 677889999999999997 7889999999997 9999
Q ss_pred eCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 350 KIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 350 di~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
|+.....+. ....+..+ .|+. ..+.+++|+|++++||+|+.|++|+||+....
T Consensus 688 d~~~~~~~~--------~~~~l~~~-~gh~-~~i~~v~~s~~~~~lasgs~D~~v~iw~~~~~ 740 (793)
T PLN00181 688 DLSMSISGI--------NETPLHSF-MGHT-NVKNFVGLSVSDGYIATGSETNEVFVYHKAFP 740 (793)
T ss_pred eCCCCcccc--------CCcceEEE-cCCC-CCeeEEEEcCCCCEEEEEeCCCEEEEEECCCC
Confidence 997541000 01234444 4554 35899999999999999999999999997654
No 29
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.70 E-value=9.9e-16 Score=160.99 Aligned_cols=245 Identities=13% Similarity=0.148 Sum_probs=173.4
Q ss_pred CcEE-EEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVL-LLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vL-l~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+| ..|+++-+-+|++..--+-.-.+..|.|.|--+++.++. ..|.-|+
T Consensus 59 gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~d~--------------s~i~S~g--------------- 109 (338)
T KOG0265|consen 59 GSCFASGGSDRAIVLWNVYGDCENFWVLKGHSGAVMELHGMRDG--------------SHILSCG--------------- 109 (338)
T ss_pred CCeEeecCCcceEEEEeccccccceeeeccccceeEeeeeccCC--------------CEEEEec---------------
Confidence 4444 558888999999865333345566899999988886532 1232222
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCC-EEEEEEc---CCEEEE-EeCCEEEEEECCCCceEEEE
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSP-IYSVRCS---SRVVAI-CQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~-V~sV~~S---~r~LAV-a~~~~I~vwDl~t~~~~~tl 171 (832)
.+++|+.||.++|++++.++.... |.++.-+ ..+|+. ..|+++++||+++.+.++++
T Consensus 110 ------------------tDk~v~~wD~~tG~~~rk~k~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k~~~~t~ 171 (338)
T KOG0265|consen 110 ------------------TDKTVRGWDAETGKRIRKHKGHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKKEAIKTF 171 (338)
T ss_pred ------------------CCceEEEEecccceeeehhccccceeeecCccccCCeEEEecCCCceEEEEeecccchhhcc
Confidence 238999999999999998886554 4444433 355665 46789999999987766655
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
..- |.-.|+ ++.. .+-+
T Consensus 172 ~~k---------------yqltAv-----~f~d--------------------------------------~s~q----- 188 (338)
T KOG0265|consen 172 ENK---------------YQLTAV-----GFKD--------------------------------------TSDQ----- 188 (338)
T ss_pred ccc---------------eeEEEE-----Eecc--------------------------------------cccc-----
Confidence 321 111121 1110 0000
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP 331 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP 331 (832)
..++.-|+.|++||+.....+.++++|..+|+.|..+|
T Consensus 189 ------------------------------------------v~sggIdn~ikvWd~r~~d~~~~lsGh~DtIt~lsls~ 226 (338)
T KOG0265|consen 189 ------------------------------------------VISGGIDNDIKVWDLRKNDGLYTLSGHADTITGLSLSR 226 (338)
T ss_pred ------------------------------------------eeeccccCceeeeccccCcceEEeecccCceeeEEecc
Confidence 11234567899999999999999999999999999999
Q ss_pred CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcccc--EEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 332 SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAV--IQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 332 dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~--I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
+|.+|.+-+-|.+ +++||+++..+. .+++..+..+.+.-. ...++|||+++.+.++|.|+.++|||.
T Consensus 227 ~gs~llsnsMd~t-vrvwd~rp~~p~----------~R~v~if~g~~hnfeknlL~cswsp~~~~i~ags~dr~vyvwd~ 295 (338)
T KOG0265|consen 227 YGSFLLSNSMDNT-VRVWDVRPFAPS----------QRCVKIFQGHIHNFEKNLLKCSWSPNGTKITAGSADRFVYVWDT 295 (338)
T ss_pred CCCccccccccce-EEEEEecccCCC----------CceEEEeecchhhhhhhcceeeccCCCCccccccccceEEEeec
Confidence 9999999999976 999999986322 244444433222222 568999999999999999999999999
Q ss_pred CCCCCceeeccCCCCc
Q 003310 410 NPLGGSVNFQPTDANF 425 (832)
Q Consensus 410 ~~~g~~~~~~~H~~~~ 425 (832)
...+-...+-+|....
T Consensus 296 ~~r~~lyklpGh~gsv 311 (338)
T KOG0265|consen 296 TSRRILYKLPGHYGSV 311 (338)
T ss_pred ccccEEEEcCCcceeE
Confidence 8877788888885543
No 30
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.70 E-value=6.2e-16 Score=179.18 Aligned_cols=238 Identities=16% Similarity=0.263 Sum_probs=172.1
Q ss_pred CCCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCC
Q 003310 16 ATRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 16 ~~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg 94 (832)
+.|..++.+..+| +|+||.-- +-+.+-+..||||||-+.|=|.. |+. |++|. |
T Consensus 19 P~rPwILtslHsG~IQlWDYRM-~tli~rFdeHdGpVRgv~FH~~q--------------plF--VSGGD--------D- 72 (1202)
T KOG0292|consen 19 PKRPWILTSLHSGVIQLWDYRM-GTLIDRFDEHDGPVRGVDFHPTQ--------------PLF--VSGGD--------D- 72 (1202)
T ss_pred CCCCEEEEeecCceeeeehhhh-hhHHhhhhccCCccceeeecCCC--------------CeE--EecCC--------c-
Confidence 5688888888777 89999976 44566667899999999987642 443 44431 2
Q ss_pred cccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEEEeCCEEEEEECCCCceEEE
Q 003310 95 LATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAICQAAQVHCFDAATLEIEYA 170 (832)
Q Consensus 95 ~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~~~~t 170 (832)
-+|++|+.++.+|+-+|.. -+-|+.+.|.. -+|..+.|.+|+||+-.+.+|+-+
T Consensus 73 ----------------------ykIkVWnYk~rrclftL~GHlDYVRt~~FHheyPWIlSASDDQTIrIWNwqsr~~iav 130 (1202)
T KOG0292|consen 73 ----------------------YKIKVWNYKTRRCLFTLLGHLDYVRTVFFHHEYPWILSASDDQTIRIWNWQSRKCIAV 130 (1202)
T ss_pred ----------------------cEEEEEecccceehhhhccccceeEEeeccCCCceEEEccCCCeEEEEeccCCceEEE
Confidence 6899999999999999875 57899999974 455557788999999999999999
Q ss_pred EecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceece
Q 003310 171 ILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAG 250 (832)
Q Consensus 171 l~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasG 250 (832)
+.+|.-- .|+- .|- |. .+.|
T Consensus 131 ltGHnHY--------------VMcA-----qFh-------------pt--------------EDlI-------------- 150 (1202)
T KOG0292|consen 131 LTGHNHY--------------VMCA-----QFH-------------PT--------------EDLI-------------- 150 (1202)
T ss_pred EecCceE--------------EEee-----ccC-------------Cc--------------cceE--------------
Confidence 9988421 1110 011 10 0000
Q ss_pred eeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC--------------------
Q 003310 251 IVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS-------------------- 310 (832)
Q Consensus 251 i~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s-------------------- 310 (832)
++++-|.+|+|||+..
T Consensus 151 --------------------------------------------VSaSLDQTVRVWDisGLRkk~~~pg~~e~~~~~~~~ 186 (1202)
T KOG0292|consen 151 --------------------------------------------VSASLDQTVRVWDISGLRKKNKAPGSLEDQMRGQQG 186 (1202)
T ss_pred --------------------------------------------EEecccceEEEEeecchhccCCCCCCchhhhhcccc
Confidence 1111222333333211
Q ss_pred --------C-cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc
Q 003310 311 --------K-NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA 381 (832)
Q Consensus 311 --------~-~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a 381 (832)
. -+...+.+|...|+-++|.|.--+|++|++|. .|++|+.... ..+.+-++ |||.+
T Consensus 187 ~~dLfg~~DaVVK~VLEGHDRGVNwaAfhpTlpliVSG~DDR-qVKlWrmnet------------KaWEvDtc-rgH~n- 251 (1202)
T KOG0292|consen 187 NSDLFGQTDAVVKHVLEGHDRGVNWAAFHPTLPLIVSGADDR-QVKLWRMNET------------KAWEVDTC-RGHYN- 251 (1202)
T ss_pred chhhcCCcCeeeeeeecccccccceEEecCCcceEEecCCcc-eeeEEEeccc------------cceeehhh-hcccC-
Confidence 1 13356789999999999999999999999995 5999998754 23444444 78765
Q ss_pred cEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeecc
Q 003310 382 VIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 382 ~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~ 420 (832)
.|.++-|.|.-.+|.+.|.|++|+|||++...+..+|+-
T Consensus 252 nVssvlfhp~q~lIlSnsEDksirVwDm~kRt~v~tfrr 290 (1202)
T KOG0292|consen 252 NVSSVLFHPHQDLILSNSEDKSIRVWDMTKRTSVQTFRR 290 (1202)
T ss_pred CcceEEecCccceeEecCCCccEEEEecccccceeeeec
Confidence 599999999999999999999999999999888777743
No 31
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.70 E-value=4.8e-15 Score=154.65 Aligned_cols=224 Identities=12% Similarity=0.224 Sum_probs=157.8
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
...|..+.+..+++||++. ++....+-.|-.-|-+++|.|+.. - +|+++
T Consensus 76 ~~alS~swD~~lrlWDl~~-g~~t~~f~GH~~dVlsva~s~dn~-------------q---ivSGS-------------- 124 (315)
T KOG0279|consen 76 NFALSASWDGTLRLWDLAT-GESTRRFVGHTKDVLSVAFSTDNR-------------Q---IVSGS-------------- 124 (315)
T ss_pred ceEEeccccceEEEEEecC-CcEEEEEEecCCceEEEEecCCCc-------------e---eecCC--------------
Confidence 4556667777799999997 467777777988999999987421 1 24432
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEE-eC--CCCEEEEEEcCC----EEEE-EeCCEEEEEECCCCceEE
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHML-KF--RSPIYSVRCSSR----VVAI-CQAAQVHCFDAATLEIEY 169 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL-~f--~s~V~sV~~S~r----~LAV-a~~~~I~vwDl~t~~~~~ 169 (832)
-++|+++|+...+. ..++ .. +..|..|+|+++ +|+. +.|+++++||+++.+...
T Consensus 125 -----------------rDkTiklwnt~g~c-k~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~ 186 (315)
T KOG0279|consen 125 -----------------RDKTIKLWNTLGVC-KYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLRT 186 (315)
T ss_pred -----------------CcceeeeeeecccE-EEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchhh
Confidence 23899999997554 4444 33 678999999864 4444 678999999999998877
Q ss_pred EEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceec
Q 003310 170 AILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAA 249 (832)
Q Consensus 170 tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~las 249 (832)
++.+|.. + .+.++++ | +|..
T Consensus 187 ~~~gh~~---~---------v~t~~vS--------------------p--------------DGsl-------------- 206 (315)
T KOG0279|consen 187 TFIGHSG---Y---------VNTVTVS--------------------P--------------DGSL-------------- 206 (315)
T ss_pred ccccccc---c---------EEEEEEC--------------------C--------------CCCE--------------
Confidence 7766521 1 1112211 1 1111
Q ss_pred eeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE
Q 003310 250 GIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCF 329 (832)
Q Consensus 250 Gi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaF 329 (832)
.++++.+|.+.+||+..++.+..|. |..+|.+|+|
T Consensus 207 --------------------------------------------casGgkdg~~~LwdL~~~k~lysl~-a~~~v~sl~f 241 (315)
T KOG0279|consen 207 --------------------------------------------CASGGKDGEAMLWDLNEGKNLYSLE-AFDIVNSLCF 241 (315)
T ss_pred --------------------------------------------EecCCCCceEEEEEccCCceeEecc-CCCeEeeEEe
Confidence 2345788999999999999977765 5689999999
Q ss_pred cCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe---ccC--c--cccEEEEEEccCCCEEEEEeCCC
Q 003310 330 DPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ---RGL--T--NAVIQDISFSDDSNWIMISSSRG 402 (832)
Q Consensus 330 SPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~---rG~--t--~a~I~~IaFSpDg~~LAsgS~Dg 402 (832)
+|+-..|+.|-.. .|+|||+.+.. .+.+|+ -|. . ...-.++|||+||+.|.++-.|+
T Consensus 242 spnrywL~~at~~--sIkIwdl~~~~--------------~v~~l~~d~~g~s~~~~~~~clslaws~dG~tLf~g~td~ 305 (315)
T KOG0279|consen 242 SPNRYWLCAATAT--SIKIWDLESKA--------------VVEELKLDGIGPSSKAGDPICLSLAWSADGQTLFAGYTDN 305 (315)
T ss_pred cCCceeEeeccCC--ceEEEeccchh--------------hhhhccccccccccccCCcEEEEEEEcCCCcEEEeeecCC
Confidence 9999888877644 49999998761 111111 011 0 11235689999999999999999
Q ss_pred cEEEEecCC
Q 003310 403 TSHLFAINP 411 (832)
Q Consensus 403 TVhIwdl~~ 411 (832)
.|++|.+..
T Consensus 306 ~irv~qv~~ 314 (315)
T KOG0279|consen 306 VIRVWQVAK 314 (315)
T ss_pred cEEEEEeec
Confidence 999999864
No 32
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.70 E-value=2.2e-16 Score=162.24 Aligned_cols=184 Identities=22% Similarity=0.336 Sum_probs=146.2
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
..+|..|.+.-++|||++......+-++.|-|.||.+-++-. |. .+|- ..
T Consensus 113 ~~lltgg~ekllrvfdln~p~App~E~~ghtg~Ir~v~wc~e--------D~-----~iLS-Sa---------------- 162 (334)
T KOG0278|consen 113 NYLLTGGQEKLLRVFDLNRPKAPPKEISGHTGGIRTVLWCHE--------DK-----CILS-SA---------------- 162 (334)
T ss_pred hhhhccchHHHhhhhhccCCCCCchhhcCCCCcceeEEEecc--------Cc-----eEEe-ec----------------
Confidence 677888888889999999877777888889999999888732 11 1221 11
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEEeCCEEEEEECCCCceEEEEecCC
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAICQAAQVHCFDAATLEIEYAILTNP 175 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~ 175 (832)
.+++||+||.+|++.+++|.|+++|.++.++ +++|.++..+.|.+||+.++..+.......
T Consensus 163 -----------------dd~tVRLWD~rTgt~v~sL~~~s~VtSlEvs~dG~ilTia~gssV~Fwdaksf~~lKs~k~P~ 225 (334)
T KOG0278|consen 163 -----------------DDKTVRLWDHRTGTEVQSLEFNSPVTSLEVSQDGRILTIAYGSSVKFWDAKSFGLLKSYKMPC 225 (334)
T ss_pred -----------------cCCceEEEEeccCcEEEEEecCCCCcceeeccCCCEEEEecCceeEEeccccccceeeccCcc
Confidence 2388999999999999999999999999996 689999999999999999998776654321
Q ss_pred CccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeecc
Q 003310 176 IVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLG 255 (832)
Q Consensus 176 ~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lG 255 (832)
++ +.++
T Consensus 226 nV-----------------------~SAS--------------------------------------------------- 231 (334)
T KOG0278|consen 226 NV-----------------------ESAS--------------------------------------------------- 231 (334)
T ss_pred cc-----------------------cccc---------------------------------------------------
Confidence 10 0010
Q ss_pred CccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEe-ccCCCCeEEEEEcCCCC
Q 003310 256 DLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQF-RAHKSPISALCFDPSGI 334 (832)
Q Consensus 256 d~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~-~aH~~pIs~LaFSPdG~ 334 (832)
+.|+. ..|+.++.++.++.||..++..+..+ ++|-+||.||.|+|||.
T Consensus 232 -------------L~P~k------------------~~fVaGged~~~~kfDy~TgeEi~~~nkgh~gpVhcVrFSPdGE 280 (334)
T KOG0278|consen 232 -------------LHPKK------------------EFFVAGGEDFKVYKFDYNTGEEIGSYNKGHFGPVHCVRFSPDGE 280 (334)
T ss_pred -------------ccCCC------------------ceEEecCcceEEEEEeccCCceeeecccCCCCceEEEEECCCCc
Confidence 00110 12455678999999999999998887 89999999999999999
Q ss_pred EEEEEEcCCCEEEEEeCCCC
Q 003310 335 LLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 335 lLATaS~dGt~I~Iwdi~t~ 354 (832)
+.|+||.||+ ||||...++
T Consensus 281 ~yAsGSEDGT-irlWQt~~~ 299 (334)
T KOG0278|consen 281 LYASGSEDGT-IRLWQTTPG 299 (334)
T ss_pred eeeccCCCce-EEEEEecCC
Confidence 9999999998 899998876
No 33
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.69 E-value=3.8e-17 Score=173.57 Aligned_cols=213 Identities=16% Similarity=0.282 Sum_probs=156.9
Q ss_pred cCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccccCCCCCC
Q 003310 26 RSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATACNGTSAN 105 (832)
Q Consensus 26 ~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~~~g~~~~ 105 (832)
++.++|||.++. .+..++..|.|.|-|+++- .| ++++++
T Consensus 216 DnTikiWD~n~~-~c~~~L~GHtGSVLCLqyd---------------~r---viisGS---------------------- 254 (499)
T KOG0281|consen 216 DNTIKIWDKNSL-ECLKILTGHTGSVLCLQYD---------------ER---VIVSGS---------------------- 254 (499)
T ss_pred cCceEEeccccH-HHHHhhhcCCCcEEeeecc---------------ce---EEEecC----------------------
Confidence 456899999874 5778888999999999873 12 234432
Q ss_pred CCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcCCEEEEE-eCCEEEEEECCCCce---EEEEecCCCccCC
Q 003310 106 YHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSSRVVAIC-QAAQVHCFDAATLEI---EYAILTNPIVMGH 180 (832)
Q Consensus 106 ~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~r~LAVa-~~~~I~vwDl~t~~~---~~tl~t~~~~~~~ 180 (832)
.+.||++||..||++++++-+ ...|..++|+..+++.+ -|..|.|||+..... .+.|.+|-.
T Consensus 255 ---------SDsTvrvWDv~tge~l~tlihHceaVLhlrf~ng~mvtcSkDrsiaVWdm~sps~it~rrVLvGHrA---- 321 (499)
T KOG0281|consen 255 ---------SDSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSNGYMVTCSKDRSIAVWDMASPTDITLRRVLVGHRA---- 321 (499)
T ss_pred ---------CCceEEEEeccCCchhhHHhhhcceeEEEEEeCCEEEEecCCceeEEEeccCchHHHHHHHHhhhhh----
Confidence 237999999999999999865 56899999998888774 578999999875431 111222200
Q ss_pred CCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccc
Q 003310 181 PSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYK 260 (832)
Q Consensus 181 p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~ 260 (832)
.+|.+. |.
T Consensus 322 --------aVNvVd-------fd--------------------------------------------------------- 329 (499)
T KOG0281|consen 322 --------AVNVVD-------FD--------------------------------------------------------- 329 (499)
T ss_pred --------heeeec-------cc---------------------------------------------------------
Confidence 000010 00
Q ss_pred ccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEE
Q 003310 261 KLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTAS 340 (832)
Q Consensus 261 ~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS 340 (832)
. ..++++..|.+|++|++.++..+.++.+|...|.|+.+ .|+++++||
T Consensus 330 -----------------------~-------kyIVsASgDRTikvW~~st~efvRtl~gHkRGIAClQY--r~rlvVSGS 377 (499)
T KOG0281|consen 330 -----------------------D-------KYIVSASGDRTIKVWSTSTCEFVRTLNGHKRGIACLQY--RDRLVVSGS 377 (499)
T ss_pred -----------------------c-------ceEEEecCCceEEEEeccceeeehhhhcccccceehhc--cCeEEEecC
Confidence 0 01234567889999999999999999999999999887 689999999
Q ss_pred cCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 341 VQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 341 ~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
.|.+ |++||+..| .+|..| .|+.. -|.+|-| |.+.+++|.-||+|+|||+...-.+
T Consensus 378 SDnt-IRlwdi~~G--------------~cLRvL-eGHEe-LvRciRF--d~krIVSGaYDGkikvWdl~aaldp 433 (499)
T KOG0281|consen 378 SDNT-IRLWDIECG--------------ACLRVL-EGHEE-LVRCIRF--DNKRIVSGAYDGKIKVWDLQAALDP 433 (499)
T ss_pred CCce-EEEEecccc--------------HHHHHH-hchHH-hhhheee--cCceeeeccccceEEEEecccccCC
Confidence 9976 999999877 344344 35432 3889999 5689999999999999999875433
No 34
>PTZ00421 coronin; Provisional
Probab=99.69 E-value=2.5e-14 Score=165.93 Aligned_cols=179 Identities=15% Similarity=0.118 Sum_probs=126.6
Q ss_pred CEEEEEECCCCc-------EEEEEe-CCCCEEEEEEcC---CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCC
Q 003310 117 TVVHFYSLRSQS-------YVHMLK-FRSPIYSVRCSS---RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAG 184 (832)
Q Consensus 117 ~tVrlWDL~Tg~-------~V~tL~-f~s~V~sV~~S~---r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~ 184 (832)
++|++||+.++. .+.+|. +...|..|+|++ ++|++ +.|++|+|||+.+++.+.++..|...
T Consensus 98 gtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~~~------- 170 (493)
T PTZ00421 98 GTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQ------- 170 (493)
T ss_pred CEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEcCCCCc-------
Confidence 789999998753 456665 467899999974 46776 67899999999999888777655320
Q ss_pred CCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccc
Q 003310 185 GIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQ 264 (832)
Q Consensus 185 ~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~ 264 (832)
+ ..|++.. +|.
T Consensus 171 --------V----~sla~sp---------------------------dG~------------------------------ 181 (493)
T PTZ00421 171 --------I----TSLEWNL---------------------------DGS------------------------------ 181 (493)
T ss_pred --------e----EEEEEEC---------------------------CCC------------------------------
Confidence 1 1123321 011
Q ss_pred ccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCC-eEEEEEcCCCCEEEEEE---
Q 003310 265 YCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSP-ISALCFDPSGILLVTAS--- 340 (832)
Q Consensus 265 y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~p-Is~LaFSPdG~lLATaS--- 340 (832)
.++++..||+|+|||+.+++.+.++.+|.+. +..+.|.+++.+|+|++
T Consensus 182 ----------------------------lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~ 233 (493)
T PTZ00421 182 ----------------------------LLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSK 233 (493)
T ss_pred ----------------------------EEEEecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcCCCCeEEEEecCC
Confidence 1234567899999999999999999999875 45678999988887765
Q ss_pred -cCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe-CCCcEEEEecCCCCC
Q 003310 341 -VQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS-SRGTSHLFAINPLGG 414 (832)
Q Consensus 341 -~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS-~DgTVhIwdl~~~g~ 414 (832)
.|+. |+|||++... .......+ .....+....|++|+++|++++ .|++|++|++.....
T Consensus 234 s~Dr~-VklWDlr~~~-----------~p~~~~~~---d~~~~~~~~~~d~d~~~L~lggkgDg~Iriwdl~~~~~ 294 (493)
T PTZ00421 234 SQQRQ-IMLWDTRKMA-----------SPYSTVDL---DQSSALFIPFFDEDTNLLYIGSKGEGNIRCFELMNERL 294 (493)
T ss_pred CCCCe-EEEEeCCCCC-----------CceeEecc---CCCCceEEEEEcCCCCEEEEEEeCCCeEEEEEeeCCce
Confidence 3565 9999997640 00111111 1223366778999999999988 499999999987543
No 35
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.69 E-value=1.9e-15 Score=176.18 Aligned_cols=225 Identities=16% Similarity=0.224 Sum_probs=169.7
Q ss_pred cEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccc
Q 003310 19 RVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATA 98 (832)
Q Consensus 19 ~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~ 98 (832)
.++...++..++|||+.+ +.+.-++-.|.+.|+++.+.+. + +++++
T Consensus 263 ~lvsgS~D~t~rvWd~~s-g~C~~~l~gh~stv~~~~~~~~-----------------~-~~sgs--------------- 308 (537)
T KOG0274|consen 263 KLVSGSTDKTERVWDCST-GECTHSLQGHTSSVRCLTIDPF-----------------L-LVSGS--------------- 308 (537)
T ss_pred EEEEEecCCcEEeEecCC-CcEEEEecCCCceEEEEEccCc-----------------e-Eeecc---------------
Confidence 334444488899999776 7888888899999999987641 1 12321
Q ss_pred cCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-CCCCEEEEEEcCCEEEE-EeCCEEEEEECCCCceEEEEecCCC
Q 003310 99 CNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCSSRVVAI-CQAAQVHCFDAATLEIEYAILTNPI 176 (832)
Q Consensus 99 ~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~ 176 (832)
.|++|++|++.++.+++++. +..+|++|.++..++++ ++++.|.|||+.+++++.++.+|..
T Consensus 309 ----------------~D~tVkVW~v~n~~~l~l~~~h~~~V~~v~~~~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~ 372 (537)
T KOG0274|consen 309 ----------------RDNTVKVWDVTNGACLNLLRGHTGPVNCVQLDEPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTG 372 (537)
T ss_pred ----------------CCceEEEEeccCcceEEEeccccccEEEEEecCCEEEEEecCceEEEEEhhhceeeeeecCCcc
Confidence 23899999999999999999 88999999999766655 7899999999999999999988742
Q ss_pred ccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccC
Q 003310 177 VMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGD 256 (832)
Q Consensus 177 ~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd 256 (832)
.. -+ |++..+ .
T Consensus 373 ~V------------~s-------l~~~~~---------------------------------------~----------- 383 (537)
T KOG0274|consen 373 RV------------YS-------LIVDSE---------------------------------------N----------- 383 (537)
T ss_pred eE------------EE-------EEecCc---------------------------------------c-----------
Confidence 10 01 111100 0
Q ss_pred ccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCCCCE
Q 003310 257 LGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPSGIL 335 (832)
Q Consensus 257 ~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPdG~l 335 (832)
...++..|+.|++||+.++ +++.+++.|..-|..+. ..+++
T Consensus 384 ------------------------------------~~~Sgs~D~~IkvWdl~~~~~c~~tl~~h~~~v~~l~--~~~~~ 425 (537)
T KOG0274|consen 384 ------------------------------------RLLSGSLDTTIKVWDLRTKRKCIHTLQGHTSLVSSLL--LRDNF 425 (537)
T ss_pred ------------------------------------eEEeeeeccceEeecCCchhhhhhhhcCCcccccccc--cccce
Confidence 0112345689999999999 99999999999886655 47799
Q ss_pred EEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 336 LVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 336 LATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
|++++.||+ |++||+.++ ..+..+.-. ....|..+++. -..+++++.||++++||+..+...
T Consensus 426 Lvs~~aD~~-Ik~WD~~~~--------------~~~~~~~~~-~~~~v~~l~~~--~~~il~s~~~~~~~l~dl~~~~~~ 487 (537)
T KOG0274|consen 426 LVSSSADGT-IKLWDAEEG--------------ECLRTLEGR-HVGGVSALALG--KEEILCSSDDGSVKLWDLRSGTLI 487 (537)
T ss_pred eEecccccc-EEEeecccC--------------ceeeeeccC-CcccEEEeecC--cceEEEEecCCeeEEEecccCchh
Confidence 999999997 999999887 344455322 22348888887 357889999999999999887665
Q ss_pred eee
Q 003310 416 VNF 418 (832)
Q Consensus 416 ~~~ 418 (832)
..+
T Consensus 488 ~~l 490 (537)
T KOG0274|consen 488 RTL 490 (537)
T ss_pred hhh
Confidence 544
No 36
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.69 E-value=2.3e-15 Score=172.58 Aligned_cols=289 Identities=16% Similarity=0.222 Sum_probs=192.3
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..++|++-.+..+.+||.+...-+..+++ -.+.|..++++- | .+. .|||++.+
T Consensus 293 ~~~~l~vtaeQnl~l~d~~~l~i~k~ivG-~ndEI~Dm~~lG-~------e~~------~laVATNs------------- 345 (775)
T KOG0319|consen 293 MSQLLLVTAEQNLFLYDEDELTIVKQIVG-YNDEILDMKFLG-P------EES------HLAVATNS------------- 345 (775)
T ss_pred cCceEEEEccceEEEEEccccEEehhhcC-CchhheeeeecC-C------ccc------eEEEEeCC-------------
Confidence 37888888899999999887433333343 456788888873 1 122 47777642
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEE-c-CCEEEEE-eCCEEEEEECCCC----ceEE
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRC-S-SRVVAIC-QAAQVHCFDAATL----EIEY 169 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~-S-~r~LAVa-~~~~I~vwDl~t~----~~~~ 169 (832)
..+|+|++.+-.|----.+...|.++.. + +-+||.+ -|+++.+|.+... -+..
T Consensus 346 --------------------~~lr~y~~~~~~c~ii~GH~e~vlSL~~~~~g~llat~sKD~svilWr~~~~~~~~~~~a 405 (775)
T KOG0319|consen 346 --------------------PELRLYTLPTSYCQIIPGHTEAVLSLDVWSSGDLLATGSKDKSVILWRLNNNCSKSLCVA 405 (775)
T ss_pred --------------------CceEEEecCCCceEEEeCchhheeeeeecccCcEEEEecCCceEEEEEecCCcchhhhhh
Confidence 4699999998887633345678999983 3 4577774 5789999977432 2333
Q ss_pred EEecCCCccCCCCCCCCCcccceeee---ccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccc
Q 003310 170 AILTNPIVMGHPSAGGIGIGYGPLAV---GPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKH 246 (832)
Q Consensus 170 tl~t~~~~~~~p~~~~~~~~~~p~Al---g~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~ 246 (832)
...+|.+.. +.++. ++.+++..+.. .++.-|-..-+|.
T Consensus 406 ~~~gH~~sv------------gava~~~~~asffvsvS~D---------------------------~tlK~W~l~~s~~ 446 (775)
T KOG0319|consen 406 QANGHTNSV------------GAVAGSKLGASFFVSVSQD---------------------------CTLKLWDLPKSKE 446 (775)
T ss_pred hhccccccc------------ceeeecccCccEEEEecCC---------------------------ceEEEecCCCccc
Confidence 334443321 12222 12344443321 1111111111111
Q ss_pred eeceeeeccCccccccccccccccCCC-cCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeE
Q 003310 247 LAAGIVNLGDLGYKKLSQYCSEFLPDS-QNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPIS 325 (832)
Q Consensus 247 lasGi~~lGd~g~~~ls~y~~~~~p~~-~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs 325 (832)
-+.-+ ..-.+|.. ...++ .+.+. ..|.. ..+++++.|.+.+||++.....+.+|.+|+..|.
T Consensus 447 ~~~~~--------~~~~~~t~-~aHdKdIN~Va-ia~nd-------kLiAT~SqDktaKiW~le~~~l~~vLsGH~RGvw 509 (775)
T KOG0319|consen 447 TAFPI--------VLTCRYTE-RAHDKDINCVA-IAPND-------KLIATGSQDKTAKIWDLEQLRLLGVLSGHTRGVW 509 (775)
T ss_pred ccccc--------eehhhHHH-HhhcccccceE-ecCCC-------ceEEecccccceeeecccCceEEEEeeCCccceE
Confidence 11000 00000100 01111 11111 11211 2467899999999999999999999999999999
Q ss_pred EEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEE
Q 003310 326 ALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSH 405 (832)
Q Consensus 326 ~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVh 405 (832)
|+.|+|..++|||+|.|+| |+||.+.+. .++.+| .||+.+ |..++|-.+|+.|+++++||-++
T Consensus 510 ~V~Fs~~dq~laT~SgD~T-vKIW~is~f--------------SClkT~-eGH~~a-Vlra~F~~~~~qliS~~adGliK 572 (775)
T KOG0319|consen 510 CVSFSKNDQLLATCSGDKT-VKIWSISTF--------------SCLKTF-EGHTSA-VLRASFIRNGKQLISAGADGLIK 572 (775)
T ss_pred EEEeccccceeEeccCCce-EEEEEeccc--------------eeeeee-cCccce-eEeeeeeeCCcEEEeccCCCcEE
Confidence 9999999999999999988 999999987 577788 477765 99999999999999999999999
Q ss_pred EEecCCCCCceeeccCCCCc
Q 003310 406 LFAINPLGGSVNFQPTDANF 425 (832)
Q Consensus 406 Iwdl~~~g~~~~~~~H~~~~ 425 (832)
||++++..+..++..|++..
T Consensus 573 lWnikt~eC~~tlD~H~Drv 592 (775)
T KOG0319|consen 573 LWNIKTNECEMTLDAHNDRV 592 (775)
T ss_pred EEeccchhhhhhhhhcccee
Confidence 99999999999999997754
No 37
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.69 E-value=5.1e-16 Score=168.47 Aligned_cols=225 Identities=15% Similarity=0.235 Sum_probs=168.6
Q ss_pred CCCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCc
Q 003310 16 ATRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGL 95 (832)
Q Consensus 16 ~~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~ 95 (832)
+.+++|+.++++..++|+++. .-....++.|.+.|.++++.-. . .+++++.
T Consensus 230 ~~~~~iAas~d~~~r~Wnvd~-~r~~~TLsGHtdkVt~ak~~~~------------~----~~vVsgs------------ 280 (459)
T KOG0288|consen 230 DNKHVIAASNDKNLRLWNVDS-LRLRHTLSGHTDKVTAAKFKLS------------H----SRVVSGS------------ 280 (459)
T ss_pred CCceEEeecCCCceeeeeccc-hhhhhhhcccccceeeehhhcc------------c----cceeecc------------
Confidence 348899999999999999997 4577788889999999987531 1 1234431
Q ss_pred ccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEEE-eCCEEEEEECCCCceEEEEecC
Q 003310 96 ATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAIC-QAAQVHCFDAATLEIEYAILTN 174 (832)
Q Consensus 96 ~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAVa-~~~~I~vwDl~t~~~~~tl~t~ 174 (832)
.+.+||+|||....|.+++-+.+.+.+|.++...++.+ .|++|++||+++..+.+++...
T Consensus 281 -------------------~DRtiK~WDl~k~~C~kt~l~~S~cnDI~~~~~~~~SgH~DkkvRfwD~Rs~~~~~sv~~g 341 (459)
T KOG0288|consen 281 -------------------ADRTIKLWDLQKAYCSKTVLPGSQCNDIVCSISDVISGHFDKKVRFWDIRSADKTRSVPLG 341 (459)
T ss_pred -------------------ccchhhhhhhhhhheeccccccccccceEecceeeeecccccceEEEeccCCceeeEeecC
Confidence 34899999999999999999999999999996655554 6889999999998887776443
Q ss_pred CCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeec
Q 003310 175 PIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNL 254 (832)
Q Consensus 175 ~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~l 254 (832)
.. +..+.++ + +|.
T Consensus 342 g~-------------vtSl~ls--------------------~--------------~g~-------------------- 354 (459)
T KOG0288|consen 342 GR-------------VTSLDLS--------------------M--------------DGL-------------------- 354 (459)
T ss_pred cc-------------eeeEeec--------------------c--------------CCe--------------------
Confidence 11 0001100 0 000
Q ss_pred cCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCC----CCeEEEEEc
Q 003310 255 GDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHK----SPISALCFD 330 (832)
Q Consensus 255 Gd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~----~pIs~LaFS 330 (832)
.+.+...|.++.+.|+.+......|.+-. +.++.+.||
T Consensus 355 --------------------------------------~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfS 396 (459)
T KOG0288|consen 355 --------------------------------------ELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFS 396 (459)
T ss_pred --------------------------------------EEeeecCCCceeeeecccccEEEEeeccccccccccceeEEC
Confidence 01112345578999999998888887542 458899999
Q ss_pred CCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 331 PSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 331 PdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
|+|.|+|+||.||. |+||++.++ .....+.-...++.|.+++|+|-|+.|++++.++.+.+|.
T Consensus 397 pd~~YvaAGS~dgs-v~iW~v~tg--------------KlE~~l~~s~s~~aI~s~~W~~sG~~Llsadk~~~v~lW~ 459 (459)
T KOG0288|consen 397 PDGSYVAAGSADGS-VYIWSVFTG--------------KLEKVLSLSTSNAAITSLSWNPSGSGLLSADKQKAVTLWT 459 (459)
T ss_pred CCCceeeeccCCCc-EEEEEccCc--------------eEEEEeccCCCCcceEEEEEcCCCchhhcccCCcceEecC
Confidence 99999999999998 899999987 3333443333344599999999999999999999999993
No 38
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.67 E-value=1.6e-16 Score=172.14 Aligned_cols=224 Identities=17% Similarity=0.326 Sum_probs=165.0
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
|++|..++.+-|.+|+.... +...++-.||.+||+++++++. . -+|.++.
T Consensus 109 RRLltgs~SGEFtLWNg~~f-nFEtilQaHDs~Vr~m~ws~~g--------~-------wmiSgD~-------------- 158 (464)
T KOG0284|consen 109 RRLLTGSQSGEFTLWNGTSF-NFETILQAHDSPVRTMKWSHNG--------T-------WMISGDK-------------- 158 (464)
T ss_pred ceeEeecccccEEEecCcee-eHHHHhhhhcccceeEEEccCC--------C-------EEEEcCC--------------
Confidence 77777777777999998653 4556667899999999998753 1 1233321
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-C-CCCEEEEEEcC---CEEEEEeCCEEEEEECCCCceEEEEe
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-F-RSPIYSVRCSS---RVVAICQAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f-~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~~~~tl~ 172 (832)
.++||+|+..-.. |+.++ + ...|++++|++ +++..+.|++|+|||....+....|.
T Consensus 159 ------------------gG~iKyWqpnmnn-Vk~~~ahh~eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~kee~vL~ 219 (464)
T KOG0284|consen 159 ------------------GGMIKYWQPNMNN-VKIIQAHHAEAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKEERVLR 219 (464)
T ss_pred ------------------CceEEecccchhh-hHHhhHhhhhhhheeccCCCCceeEEecCCCeEEEEeccCCchhheec
Confidence 1789999986443 44444 3 36899999984 56666788999999999887777776
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
+|. +-+ +.+++- |+ |
T Consensus 220 GHg--------------wdV-----ksvdWH-------------P~--------------------------k------- 234 (464)
T KOG0284|consen 220 GHG--------------WDV-----KSVDWH-------------PT--------------------------K------- 234 (464)
T ss_pred cCC--------------CCc-----ceeccC-------------Cc--------------------------c-------
Confidence 652 100 111111 11 0
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS 332 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd 332 (832)
+.+++++.|..|++||.+++.+++++..|...|..+.|+|+
T Consensus 235 ---------------------------------------gLiasgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n 275 (464)
T KOG0284|consen 235 ---------------------------------------GLIASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPN 275 (464)
T ss_pred ---------------------------------------ceeEEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCC
Confidence 12234556679999999999999999999999999999999
Q ss_pred CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCC
Q 003310 333 GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~ 411 (832)
|.+|+|+|.|. .+++||+++- ..++.+ ||+.. .|.+++|+| .-.+|.+|+.||.|..|.+..
T Consensus 276 ~N~Llt~skD~-~~kv~DiR~m--------------kEl~~~-r~Hkk-dv~~~~WhP~~~~lftsgg~Dgsvvh~~v~~ 338 (464)
T KOG0284|consen 276 GNWLLTGSKDQ-SCKVFDIRTM--------------KELFTY-RGHKK-DVTSLTWHPLNESLFTSGGSDGSVVHWVVGL 338 (464)
T ss_pred CCeeEEccCCc-eEEEEehhHh--------------HHHHHh-hcchh-hheeeccccccccceeeccCCCceEEEeccc
Confidence 99999999996 5999999853 233444 56643 599999999 456888999999999999863
No 39
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.67 E-value=3.8e-15 Score=173.64 Aligned_cols=231 Identities=13% Similarity=0.158 Sum_probs=171.6
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
+.+....++..+++||....-.+..-+..|.|.|..+++... + ++| ++++
T Consensus 219 ~~~~~~s~~~tl~~~~~~~~~~i~~~l~GH~g~V~~l~~~~~--------~------~~l--vsgS-------------- 268 (537)
T KOG0274|consen 219 GFFKSGSDDSTLHLWDLNNGYLILTRLVGHFGGVWGLAFPSG--------G------DKL--VSGS-------------- 268 (537)
T ss_pred CeEEecCCCceeEEeecccceEEEeeccCCCCCceeEEEecC--------C------CEE--EEEe--------------
Confidence 455555666678899998743333326689999999998521 0 233 3321
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcCCEEEE-EeCCEEEEEECCCCceEEEEecCC
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSSRVVAI-CQAAQVHCFDAATLEIEYAILTNP 175 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~ 175 (832)
.+.|+|+||+.+|+|+++|.. .+.|+.+..-+.+++. +.|.+|+|||+.++.++.++.+|.
T Consensus 269 -----------------~D~t~rvWd~~sg~C~~~l~gh~stv~~~~~~~~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~ 331 (537)
T KOG0274|consen 269 -----------------TDKTERVWDCSTGECTHSLQGHTSSVRCLTIDPFLLVSGSRDNTVKVWDVTNGACLNLLRGHT 331 (537)
T ss_pred -----------------cCCcEEeEecCCCcEEEEecCCCceEEEEEccCceEeeccCCceEEEEeccCcceEEEecccc
Confidence 238999999999999999994 6788888877766665 689999999999999999987652
Q ss_pred CccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeecc
Q 003310 176 IVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLG 255 (832)
Q Consensus 176 ~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lG 255 (832)
. ++ +-+.+. +.
T Consensus 332 ~---------------~V----~~v~~~-----------------------------~~--------------------- 342 (537)
T KOG0274|consen 332 G---------------PV----NCVQLD-----------------------------EP--------------------- 342 (537)
T ss_pred c---------------cE----EEEEec-----------------------------CC---------------------
Confidence 1 11 001111 00
Q ss_pred CccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCE
Q 003310 256 DLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGIL 335 (832)
Q Consensus 256 d~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~l 335 (832)
.++++..||+|+|||+.++++++++++|+..|.++.|++. ..
T Consensus 343 -------------------------------------~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~~V~sl~~~~~-~~ 384 (537)
T KOG0274|consen 343 -------------------------------------LLVSGSYDGTVKVWDPRTGKCLKSLSGHTGRVYSLIVDSE-NR 384 (537)
T ss_pred -------------------------------------EEEEEecCceEEEEEhhhceeeeeecCCcceEEEEEecCc-ce
Confidence 1233467889999999999999999999999999998877 89
Q ss_pred EEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 336 LVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 336 LATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
+.+||.|++ |++||+.+. ..+++.| .|++ +.+.++.+ .+++|.+++.|++|++||++.++..
T Consensus 385 ~~Sgs~D~~-IkvWdl~~~-------------~~c~~tl-~~h~-~~v~~l~~--~~~~Lvs~~aD~~Ik~WD~~~~~~~ 446 (537)
T KOG0274|consen 385 LLSGSLDTT-IKVWDLRTK-------------RKCIHTL-QGHT-SLVSSLLL--RDNFLVSSSADGTIKLWDAEEGECL 446 (537)
T ss_pred EEeeeeccc-eEeecCCch-------------hhhhhhh-cCCc-cccccccc--ccceeEeccccccEEEeecccCcee
Confidence 999999977 999999875 0344455 3443 33555544 5689999999999999999999988
Q ss_pred eeecc
Q 003310 416 VNFQP 420 (832)
Q Consensus 416 ~~~~~ 420 (832)
.++.+
T Consensus 447 ~~~~~ 451 (537)
T KOG0274|consen 447 RTLEG 451 (537)
T ss_pred eeecc
Confidence 88877
No 40
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.67 E-value=3.5e-15 Score=164.14 Aligned_cols=189 Identities=17% Similarity=0.246 Sum_probs=145.3
Q ss_pred CCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEE-EEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcc
Q 003310 114 SVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVA-ICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIG 189 (832)
Q Consensus 114 ~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LA-Va~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~ 189 (832)
+.++.+|+|+. +|..+.+|.+ +.+|+++++++ .+|+ .+-|+++.+||..+++..+.+.-|..+
T Consensus 254 ~~~G~~riw~~-~G~l~~tl~~HkgPI~slKWnk~G~yilS~~vD~ttilwd~~~g~~~q~f~~~s~~------------ 320 (524)
T KOG0273|consen 254 SEDGEARIWNK-DGNLISTLGQHKGPIFSLKWNKKGTYILSGGVDGTTILWDAHTGTVKQQFEFHSAP------------ 320 (524)
T ss_pred ecCcEEEEEec-CchhhhhhhccCCceEEEEEcCCCCEEEeccCCccEEEEeccCceEEEeeeeccCC------------
Confidence 35699999997 5777888876 78999999985 4455 478999999999999877666443210
Q ss_pred cceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 190 YGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 190 ~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
++.-.|. +
T Consensus 321 ----~lDVdW~------------------------------------------------------~-------------- 328 (524)
T KOG0273|consen 321 ----ALDVDWQ------------------------------------------------------S-------------- 328 (524)
T ss_pred ----ccceEEe------------------------------------------------------c--------------
Confidence 0000111 0
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEE
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIF 349 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iw 349 (832)
+..|++...+|.|+|+.+...+|+.+|.+|.++|.+|.|+|.|.+|||||+|+| ++||
T Consensus 329 ---------------------~~~F~ts~td~~i~V~kv~~~~P~~t~~GH~g~V~alk~n~tg~LLaS~SdD~T-lkiW 386 (524)
T KOG0273|consen 329 ---------------------NDEFATSSTDGCIHVCKVGEDRPVKTFIGHHGEVNALKWNPTGSLLASCSDDGT-LKIW 386 (524)
T ss_pred ---------------------CceEeecCCCceEEEEEecCCCcceeeecccCceEEEEECCCCceEEEecCCCe-eEee
Confidence 012445567899999999999999999999999999999999999999999998 8999
Q ss_pred eCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCC---------EEEEEeCCCcEEEEecCCCCCceeecc
Q 003310 350 KIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSN---------WIMISSSRGTSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 350 di~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~---------~LAsgS~DgTVhIwdl~~~g~~~~~~~ 420 (832)
..... .....| ++|. ..|+.|.|||+|. .+|+++.|+||++||+..+.+..+|..
T Consensus 387 s~~~~--------------~~~~~l-~~Hs-kei~t~~wsp~g~v~~n~~~~~~l~sas~dstV~lwdv~~gv~i~~f~k 450 (524)
T KOG0273|consen 387 SMGQS--------------NSVHDL-QAHS-KEIYTIKWSPTGPVTSNPNMNLMLASASFDSTVKLWDVESGVPIHTLMK 450 (524)
T ss_pred ecCCC--------------cchhhh-hhhc-cceeeEeecCCCCccCCCcCCceEEEeecCCeEEEEEccCCceeEeecc
Confidence 87654 122233 2333 3599999999754 599999999999999999988889988
Q ss_pred CCCCc
Q 003310 421 TDANF 425 (832)
Q Consensus 421 H~~~~ 425 (832)
|+...
T Consensus 451 H~~pV 455 (524)
T KOG0273|consen 451 HQEPV 455 (524)
T ss_pred CCCce
Confidence 85443
No 41
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.67 E-value=2.2e-15 Score=161.12 Aligned_cols=190 Identities=13% Similarity=0.217 Sum_probs=144.4
Q ss_pred CCCEEEEEECCCCcEEEEEe-CCCCEEEEEEcC---CEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccc
Q 003310 115 VPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCSS---RVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGY 190 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~ 190 (832)
.+++++|||+.||++..+|. +-..|..|++|. -++.++.+++|+|||+..-+.++..-+|-.
T Consensus 171 ~DrtikIwDlatg~LkltltGhi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS-------------- 236 (460)
T KOG0285|consen 171 ADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLS-------------- 236 (460)
T ss_pred CCceeEEEEcccCeEEEeecchhheeeeeeecccCceEEEecCCCeeEEEechhhhhHHHhccccc--------------
Confidence 34899999999999999998 678999999985 345567889999999998776665544410
Q ss_pred ceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccccc
Q 003310 191 GPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFL 270 (832)
Q Consensus 191 ~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~ 270 (832)
+. |+ -++.
T Consensus 237 ~V---------~~---------------------------------------------------------------L~lh 244 (460)
T KOG0285|consen 237 GV---------YC---------------------------------------------------------------LDLH 244 (460)
T ss_pred ee---------EE---------------------------------------------------------------Eecc
Confidence 00 11 0000
Q ss_pred CCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 271 PDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 271 p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
|.- ..+++++.|.+++|||++++..+..|.+|+.+|..+.|.|-.-.++|||.|++ |++||
T Consensus 245 PTl------------------dvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V~~~~~dpqvit~S~D~t-vrlWD 305 (460)
T KOG0285|consen 245 PTL------------------DVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPVASVMCQPTDPQVITGSHDST-VRLWD 305 (460)
T ss_pred ccc------------------eeEEecCCcceEEEeeecccceEEEecCCCCcceeEEeecCCCceEEecCCce-EEEee
Confidence 100 01234567889999999999999999999999999999997777999999998 99999
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcc
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~ 426 (832)
+..+ +.+.++. ++...|.+++..|.-..+|++|.| .|+-|++..+.-...+.+|+....
T Consensus 306 l~ag--------------kt~~tlt--~hkksvral~lhP~e~~fASas~d-nik~w~~p~g~f~~nlsgh~~iin 364 (460)
T KOG0285|consen 306 LRAG--------------KTMITLT--HHKKSVRALCLHPKENLFASASPD-NIKQWKLPEGEFLQNLSGHNAIIN 364 (460)
T ss_pred eccC--------------ceeEeee--cccceeeEEecCCchhhhhccCCc-cceeccCCccchhhccccccceee
Confidence 9877 3333442 223348999999999999999887 588999977666677888876554
No 42
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.66 E-value=7.2e-14 Score=155.53 Aligned_cols=108 Identities=19% Similarity=0.257 Sum_probs=91.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
+++++|+.|.+|+-.--+-..+++.|..-|.|+.|||||.++||++.||+ |.|||-.++ ..+..|
T Consensus 164 ~T~sdDn~v~ffeGPPFKFk~s~r~HskFV~~VRysPDG~~Fat~gsDgk-i~iyDGktg--------------e~vg~l 228 (603)
T KOG0318|consen 164 ATGSDDNTVAFFEGPPFKFKSSFREHSKFVNCVRYSPDGSRFATAGSDGK-IYIYDGKTG--------------EKVGEL 228 (603)
T ss_pred EeccCCCeEEEeeCCCeeeeecccccccceeeEEECCCCCeEEEecCCcc-EEEEcCCCc--------------cEEEEe
Confidence 45678889999987777777888899999999999999999999999998 889998887 566666
Q ss_pred ec--cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 375 QR--GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 375 ~r--G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
.- +| ...|+.|+||||++.|+++|.|.|++|||+++.....++
T Consensus 229 ~~~~aH-kGsIfalsWsPDs~~~~T~SaDkt~KIWdVs~~slv~t~ 273 (603)
T KOG0318|consen 229 EDSDAH-KGSIFALSWSPDSTQFLTVSADKTIKIWDVSTNSLVSTW 273 (603)
T ss_pred cCCCCc-cccEEEEEECCCCceEEEecCCceEEEEEeeccceEEEe
Confidence 42 22 234999999999999999999999999999987555554
No 43
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.66 E-value=4.1e-15 Score=159.05 Aligned_cols=226 Identities=17% Similarity=0.257 Sum_probs=175.1
Q ss_pred CCCcEEEEEecC-CeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCC
Q 003310 16 ATRRVLLLGYRS-GFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 16 ~~~~vLl~Gy~~-G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg 94 (832)
+....|+.|... .++|||+++ +.+.-.+-.|-..||-+++.+ .+|+|.-|+++
T Consensus 161 P~n~wf~tgs~DrtikIwDlat-g~LkltltGhi~~vr~vavS~--------------rHpYlFs~ged----------- 214 (460)
T KOG0285|consen 161 PGNEWFATGSADRTIKIWDLAT-GQLKLTLTGHIETVRGVAVSK--------------RHPYLFSAGED----------- 214 (460)
T ss_pred CCceeEEecCCCceeEEEEccc-CeEEEeecchhheeeeeeecc--------------cCceEEEecCC-----------
Confidence 346777777655 499999997 677777778999999999875 34777655432
Q ss_pred cccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCceEEE
Q 003310 95 LATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEIEYA 170 (832)
Q Consensus 95 ~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~t 170 (832)
+.|+-|||...+.++..-. =+.|++++.-+ ++|+. +.|..|+|||++|...+++
T Consensus 215 ----------------------k~VKCwDLe~nkvIR~YhGHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V~~ 272 (460)
T KOG0285|consen 215 ----------------------KQVKCWDLEYNKVIRHYHGHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASVHV 272 (460)
T ss_pred ----------------------CeeEEEechhhhhHHHhccccceeEEEeccccceeEEecCCcceEEEeeecccceEEE
Confidence 7899999999998887643 57899999975 55665 5788999999999999999
Q ss_pred EecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceece
Q 003310 171 ILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAG 250 (832)
Q Consensus 171 l~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasG 250 (832)
+.+|.++ ++ +-++-+
T Consensus 273 l~GH~~~---------------V~---~V~~~~----------------------------------------------- 287 (460)
T KOG0285|consen 273 LSGHTNP---------------VA---SVMCQP----------------------------------------------- 287 (460)
T ss_pred ecCCCCc---------------ce---eEEeec-----------------------------------------------
Confidence 9888542 21 000000
Q ss_pred eeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEc
Q 003310 251 IVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFD 330 (832)
Q Consensus 251 i~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFS 330 (832)
+++.+.++..|++|++||+..++...++..|...|.||+..
T Consensus 288 ---------------------------------------~dpqvit~S~D~tvrlWDl~agkt~~tlt~hkksvral~lh 328 (460)
T KOG0285|consen 288 ---------------------------------------TDPQVITGSHDSTVRLWDLRAGKTMITLTHHKKSVRALCLH 328 (460)
T ss_pred ---------------------------------------CCCceEEecCCceEEEeeeccCceeEeeecccceeeEEecC
Confidence 00112345678999999999999999999999999999999
Q ss_pred CCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 331 PSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 331 PdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
|.-.++|+||.| +|+-|++..+ ..+..+ .|+ .+.|++++...|+ .+++|++.|++..||..
T Consensus 329 P~e~~fASas~d--nik~w~~p~g--------------~f~~nl-sgh-~~iintl~~nsD~-v~~~G~dng~~~fwdwk 389 (460)
T KOG0285|consen 329 PKENLFASASPD--NIKQWKLPEG--------------EFLQNL-SGH-NAIINTLSVNSDG-VLVSGGDNGSIMFWDWK 389 (460)
T ss_pred CchhhhhccCCc--cceeccCCcc--------------chhhcc-ccc-cceeeeeeeccCc-eEEEcCCceEEEEEecC
Confidence 999999999998 3899999876 344444 344 3569999999997 56799999999999998
Q ss_pred CC
Q 003310 411 PL 412 (832)
Q Consensus 411 ~~ 412 (832)
.+
T Consensus 390 sg 391 (460)
T KOG0285|consen 390 SG 391 (460)
T ss_pred cC
Confidence 75
No 44
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.66 E-value=3.8e-15 Score=170.78 Aligned_cols=227 Identities=18% Similarity=0.255 Sum_probs=157.6
Q ss_pred CEEEEEECCCCcEEEEEe-CCCCEEEEEEcC---CEEEEEe-CCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccc
Q 003310 117 TVVHFYSLRSQSYVHMLK-FRSPIYSVRCSS---RVVAICQ-AAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~---r~LAVa~-~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~ 191 (832)
..+.|||.++.+.++.|- ++..|++++|-+ +.|||+. ...+++||..++.|. .+.+|....
T Consensus 303 Qnl~l~d~~~l~i~k~ivG~ndEI~Dm~~lG~e~~~laVATNs~~lr~y~~~~~~c~-ii~GH~e~v------------- 368 (775)
T KOG0319|consen 303 QNLFLYDEDELTIVKQIVGYNDEILDMKFLGPEESHLAVATNSPELRLYTLPTSYCQ-IIPGHTEAV------------- 368 (775)
T ss_pred ceEEEEEccccEEehhhcCCchhheeeeecCCccceEEEEeCCCceEEEecCCCceE-EEeCchhhe-------------
Confidence 568999999999988875 588999999964 7888865 568999999999886 666775321
Q ss_pred eeeec----cceEEeeCCC--ceecCCCccCCcccccccccccccCCCcceeeeecccccce--eceeeeccCccccccc
Q 003310 192 PLAVG----PRWLAYSGSP--VVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHL--AAGIVNLGDLGYKKLS 263 (832)
Q Consensus 192 p~Alg----~r~LAya~~~--~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~l--asGi~~lGd~g~~~ls 263 (832)
++|. .-|||.+++. ++.| .+-..++|.+ +.+
T Consensus 369 -lSL~~~~~g~llat~sKD~svilW---------------------------r~~~~~~~~~~~a~~------------- 407 (775)
T KOG0319|consen 369 -LSLDVWSSGDLLATGSKDKSVILW---------------------------RLNNNCSKSLCVAQA------------- 407 (775)
T ss_pred -eeeeecccCcEEEEecCCceEEEE---------------------------EecCCcchhhhhhhh-------------
Confidence 2222 2466666432 2222 0000011100 000
Q ss_pred cccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCc-----EEE----EeccCCCCeEEEEEcCCCC
Q 003310 264 QYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKN-----VIA----QFRAHKSPISALCFDPSGI 334 (832)
Q Consensus 264 ~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~-----~l~----~~~aH~~pIs~LaFSPdG~ 334 (832)
.++++...+-...+ .| ..-|++.+.|+++++|++...+ .+. +..+|...|+|++.+|+.+
T Consensus 408 --------~gH~~svgava~~~-~~--asffvsvS~D~tlK~W~l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~ndk 476 (775)
T KOG0319|consen 408 --------NGHTNSVGAVAGSK-LG--ASFFVSVSQDCTLKLWDLPKSKETAFPIVLTCRYTERAHDKDINCVAIAPNDK 476 (775)
T ss_pred --------cccccccceeeecc-cC--ccEEEEecCCceEEEecCCCcccccccceehhhHHHHhhcccccceEecCCCc
Confidence 01100000000000 01 1235678899999999997621 111 3358999999999999999
Q ss_pred EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCC
Q 003310 335 LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 335 lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
++||||.|.+ .+||++... .++..| +||+. .|+||.|||..+.+|++|.|+||+||.|+++.+
T Consensus 477 LiAT~SqDkt-aKiW~le~~--------------~l~~vL-sGH~R-Gvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSC 539 (775)
T KOG0319|consen 477 LIATGSQDKT-AKIWDLEQL--------------RLLGVL-SGHTR-GVWCVSFSKNDQLLATCSGDKTVKIWSISTFSC 539 (775)
T ss_pred eEEecccccc-eeeecccCc--------------eEEEEe-eCCcc-ceEEEEeccccceeEeccCCceEEEEEecccee
Confidence 9999999987 899999854 566677 67764 499999999999999999999999999999999
Q ss_pred ceeeccCCCCcc
Q 003310 415 SVNFQPTDANFT 426 (832)
Q Consensus 415 ~~~~~~H~~~~~ 426 (832)
.-+|.+|+....
T Consensus 540 lkT~eGH~~aVl 551 (775)
T KOG0319|consen 540 LKTFEGHTSAVL 551 (775)
T ss_pred eeeecCccceeE
Confidence 999999965443
No 45
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.65 E-value=3.7e-15 Score=167.96 Aligned_cols=233 Identities=15% Similarity=0.208 Sum_probs=172.5
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
+=+|+.=|.+.++|||.++..-+ .-+.-.+-|||..+|++.- .-+ +++. +|
T Consensus 26 Pw~la~LynG~V~IWnyetqtmV-ksfeV~~~PvRa~kfiaRk--------------nWi-v~Gs---------DD---- 76 (794)
T KOG0276|consen 26 PWILAALYNGDVQIWNYETQTMV-KSFEVSEVPVRAAKFIARK--------------NWI-VTGS---------DD---- 76 (794)
T ss_pred ceEEEeeecCeeEEEecccceee-eeeeecccchhhheeeecc--------------ceE-EEec---------CC----
Confidence 44566667777999999873222 2222347888988887521 123 2321 12
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEEEeCCEEEEEECCC-CceEEEEe
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAICQAAQVHCFDAAT-LEIEYAIL 172 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t-~~~~~tl~ 172 (832)
..||+|+..|++.|+++.- ++-|++|+..+ -+|..+.|-.|++||-.. ..|.+++.
T Consensus 77 -------------------~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~P~vLtsSDDm~iKlW~we~~wa~~qtfe 137 (794)
T KOG0276|consen 77 -------------------MQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTLPYVLTSSDDMTIKLWDWENEWACEQTFE 137 (794)
T ss_pred -------------------ceEEEEecccceeeEEeeccccceeeeeecCCCCeEEecCCccEEEEeeccCceeeeeEEc
Confidence 5699999999999999985 67899999974 345556667999999875 46778887
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
+|.-- .|. +|+-++
T Consensus 138 GH~Hy--------------VMq-----v~fnPk----------------------------------------------- 151 (794)
T KOG0276|consen 138 GHEHY--------------VMQ-----VAFNPK----------------------------------------------- 151 (794)
T ss_pred CcceE--------------EEE-----EEecCC-----------------------------------------------
Confidence 77420 121 122110
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS 332 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd 332 (832)
. +.+|+++.-|++|+||.+.+..+..+|++|...|+|+.|=+-
T Consensus 152 --------------------D-----------------~ntFaS~sLDrTVKVWslgs~~~nfTl~gHekGVN~Vdyy~~ 194 (794)
T KOG0276|consen 152 --------------------D-----------------PNTFASASLDRTVKVWSLGSPHPNFTLEGHEKGVNCVDYYTG 194 (794)
T ss_pred --------------------C-----------------ccceeeeeccccEEEEEcCCCCCceeeeccccCcceEEeccC
Confidence 0 124667778999999999999999999999999999999886
Q ss_pred C--CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 333 G--ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 333 G--~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
| -+|+||++|-+ |+|||..+. .++.+| .||++ .|..++|.|.=.++++||.|||++||.-.
T Consensus 195 gdkpylIsgaDD~t-iKvWDyQtk--------------~CV~TL-eGHt~-Nvs~v~fhp~lpiiisgsEDGTvriWhs~ 257 (794)
T KOG0276|consen 195 GDKPYLISGADDLT-IKVWDYQTK--------------SCVQTL-EGHTN-NVSFVFFHPELPIIISGSEDGTVRIWNSK 257 (794)
T ss_pred CCcceEEecCCCce-EEEeecchH--------------HHHHHh-hcccc-cceEEEecCCCcEEEEecCCccEEEecCc
Confidence 6 48999999965 999999886 344466 47664 59999999999999999999999999998
Q ss_pred CCCCceee
Q 003310 411 PLGGSVNF 418 (832)
Q Consensus 411 ~~g~~~~~ 418 (832)
++..+-++
T Consensus 258 Ty~lE~tL 265 (794)
T KOG0276|consen 258 TYKLEKTL 265 (794)
T ss_pred ceehhhhh
Confidence 88776544
No 46
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.65 E-value=2.7e-15 Score=169.13 Aligned_cols=189 Identities=13% Similarity=0.211 Sum_probs=150.8
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
+.|.||+..|...|+++.. .-||++..|= ++.+++ +.|.+|+||+..|++...++..|+.-.
T Consensus 35 G~V~IWnyetqtmVksfeV~~~PvRa~kfiaRknWiv~GsDD~~IrVfnynt~ekV~~FeAH~DyI-------------- 100 (794)
T KOG0276|consen 35 GDVQIWNYETQTMVKSFEVSEVPVRAAKFIARKNWIVTGSDDMQIRVFNYNTGEKVKTFEAHSDYI-------------- 100 (794)
T ss_pred CeeEEEecccceeeeeeeecccchhhheeeeccceEEEecCCceEEEEecccceeeEEeeccccce--------------
Confidence 6799999999999999997 5689988884 566777 456699999999999999999886410
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
|.||.. |+ ++
T Consensus 101 -----R~iavH-------------Pt-~P--------------------------------------------------- 110 (794)
T KOG0276|consen 101 -----RSIAVH-------------PT-LP--------------------------------------------------- 110 (794)
T ss_pred -----eeeeec-------------CC-CC---------------------------------------------------
Confidence 222221 21 00
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEe
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFK 350 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwd 350 (832)
.+.++++|-+|++||.+.. .+..+|.+|++.|.+++|+| |-..+|+||-|+| |+||.
T Consensus 111 --------------------~vLtsSDDm~iKlW~we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDrT-VKVWs 169 (794)
T KOG0276|consen 111 --------------------YVLTSSDDMTIKLWDWENEWACEQTFEGHEHYVMQVAFNPKDPNTFASASLDRT-VKVWS 169 (794)
T ss_pred --------------------eEEecCCccEEEEeeccCceeeeeEEcCcceEEEEEEecCCCccceeeeecccc-EEEEE
Confidence 0123456778999999865 78889999999999999999 5679999999988 99999
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc--CCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcc
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD--DSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp--Dg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~ 426 (832)
+... ...++|. ||.. -|++|+|=+ |--+|++|++|.|++|||..+..+..++.+|+.+..
T Consensus 170 lgs~--------------~~nfTl~-gHek-GVN~Vdyy~~gdkpylIsgaDD~tiKvWDyQtk~CV~TLeGHt~Nvs 231 (794)
T KOG0276|consen 170 LGSP--------------HPNFTLE-GHEK-GVNCVDYYTGGDKPYLISGADDLTIKVWDYQTKSCVQTLEGHTNNVS 231 (794)
T ss_pred cCCC--------------CCceeee-cccc-CcceEEeccCCCcceEEecCCCceEEEeecchHHHHHHhhcccccce
Confidence 8654 3456775 6654 399999976 446999999999999999999999999999998654
No 47
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.65 E-value=2.7e-14 Score=148.27 Aligned_cols=239 Identities=16% Similarity=0.160 Sum_probs=158.5
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
..++.++-+....||--.+ |+..-.+..|.|.|-++.+--. ++ .+++++
T Consensus 23 DLlFscaKD~~~~vw~s~n-GerlGty~GHtGavW~~Did~~-------------s~---~liTGS-------------- 71 (327)
T KOG0643|consen 23 DLLFSCAKDSTPTVWYSLN-GERLGTYDGHTGAVWCCDIDWD-------------SK---HLITGS-------------- 71 (327)
T ss_pred cEEEEecCCCCceEEEecC-CceeeeecCCCceEEEEEecCC-------------cc---eeeecc--------------
Confidence 4566677777889998754 5555566689999999987421 00 123321
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEEeC------CEEEEEECCCCc---
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAICQA------AQVHCFDAATLE--- 166 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa~~------~~I~vwDl~t~~--- 166 (832)
.+.+++|||+++|+++.+++++++|..+.|+ +++++++.+ ..|.+||++...
T Consensus 72 -----------------AD~t~kLWDv~tGk~la~~k~~~~Vk~~~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~ 134 (327)
T KOG0643|consen 72 -----------------ADQTAKLWDVETGKQLATWKTNSPVKRVDFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDI 134 (327)
T ss_pred -----------------ccceeEEEEcCCCcEEEEeecCCeeEEEeeccCCcEEEEEehhhcCcceEEEEEEccCChhhh
Confidence 2378999999999999999999999999997 466666544 368899987432
Q ss_pred ----eEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecc
Q 003310 167 ----IEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKE 242 (832)
Q Consensus 167 ----~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ 242 (832)
+...+.++ +
T Consensus 135 ~s~ep~~kI~t~-------------------------------------------~------------------------ 147 (327)
T KOG0643|consen 135 DSEEPYLKIPTP-------------------------------------------D------------------------ 147 (327)
T ss_pred cccCceEEecCC-------------------------------------------c------------------------
Confidence 11111110 0
Q ss_pred cccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCC
Q 003310 243 SSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHK 321 (832)
Q Consensus 243 ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~ 321 (832)
+|...+++..+|+ .++.+..+|.|.+||+.++ +.+..-+.|.
T Consensus 148 -skit~a~Wg~l~~------------------------------------~ii~Ghe~G~is~~da~~g~~~v~s~~~h~ 190 (327)
T KOG0643|consen 148 -SKITSALWGPLGE------------------------------------TIIAGHEDGSISIYDARTGKELVDSDEEHS 190 (327)
T ss_pred -cceeeeeecccCC------------------------------------EEEEecCCCcEEEEEcccCceeeechhhhc
Confidence 0000111111111 2345678999999999997 5566668999
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCC---------CCCCCcc---------CCCCceeE-------------
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGI---------LGTSSAC---------DAGTSYVH------------- 370 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~---------~~~~s~~---------~~~~~~~~------------- 370 (832)
..|+.|+||||.++++|+|.|-+ -++||..+-. +-+..+. ++......
T Consensus 191 ~~Ind~q~s~d~T~FiT~s~Dtt-akl~D~~tl~v~Kty~te~PvN~aaisP~~d~VilgGGqeA~dVTTT~~r~GKFEA 269 (327)
T KOG0643|consen 191 SKINDLQFSRDRTYFITGSKDTT-AKLVDVRTLEVLKTYTTERPVNTAAISPLLDHVILGGGQEAMDVTTTSTRAGKFEA 269 (327)
T ss_pred cccccccccCCcceEEecccCcc-ceeeeccceeeEEEeeecccccceecccccceEEecCCceeeeeeeecccccchhh
Confidence 99999999999999999999976 7999987641 1110000 00000000
Q ss_pred ----------EEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 371 ----------LYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 371 ----------l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+-++ .|| -.+|++|||+|||+-.++|+.||.|+|.....
T Consensus 270 rFyh~i~eEEigrv-kGH-FGPINsvAfhPdGksYsSGGEDG~VR~h~Fd~ 318 (327)
T KOG0643|consen 270 RFYHLIFEEEIGRV-KGH-FGPINSVAFHPDGKSYSSGGEDGYVRLHHFDS 318 (327)
T ss_pred hHHHHHHHHHhccc-ccc-ccCcceeEECCCCcccccCCCCceEEEEEecc
Confidence 0011 233 24699999999999999999999998875543
No 48
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.65 E-value=1.4e-14 Score=152.45 Aligned_cols=239 Identities=17% Similarity=0.278 Sum_probs=171.0
Q ss_pred Ceeeecccc------ccccCC---------CCCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccc
Q 003310 1 MVLWAGFDK------LESEAG---------ATRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKR 65 (832)
Q Consensus 1 ~v~w~~fd~------l~~~~~---------~~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~ 65 (832)
|++|.-|+. +.++.+ ....+|.+|.+..+.+||+++ +.+..-...|.+-|..+. |
T Consensus 71 I~LWnv~gdceN~~~lkgHsgAVM~l~~~~d~s~i~S~gtDk~v~~wD~~t-G~~~rk~k~h~~~vNs~~----p----- 140 (338)
T KOG0265|consen 71 IVLWNVYGDCENFWVLKGHSGAVMELHGMRDGSHILSCGTDKTVRGWDAET-GKRIRKHKGHTSFVNSLD----P----- 140 (338)
T ss_pred EEEEeccccccceeeeccccceeEeeeeccCCCEEEEecCCceEEEEeccc-ceeeehhccccceeeecC----c-----
Confidence 478875544 444444 237888899999999999997 434333333444444443 2
Q ss_pred cCCcccccCCEEEEEeCCCCccCccccCCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEc
Q 003310 66 SRDKFAEVRPLLVFCADGSRSCGTKVQDGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCS 145 (832)
Q Consensus 66 ~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S 145 (832)
..|.+..||++. .++|+||||+++.++++++.-+-++.+|.|+
T Consensus 141 ------~rrg~~lv~Sgs-------------------------------dD~t~kl~D~R~k~~~~t~~~kyqltAv~f~ 183 (338)
T KOG0265|consen 141 ------SRRGPQLVCSGS-------------------------------DDGTLKLWDIRKKEAIKTFENKYQLTAVGFK 183 (338)
T ss_pred ------cccCCeEEEecC-------------------------------CCceEEEEeecccchhhccccceeEEEEEec
Confidence 123344456642 1389999999999999999888899999996
Q ss_pred ---CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCccccc
Q 003310 146 ---SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQ 222 (832)
Q Consensus 146 ---~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~ 222 (832)
.+++....++.|++||++..+.++++.+|..+.. -+++++
T Consensus 184 d~s~qv~sggIdn~ikvWd~r~~d~~~~lsGh~DtIt------------~lsls~------------------------- 226 (338)
T KOG0265|consen 184 DTSDQVISGGIDNDIKVWDLRKNDGLYTLSGHADTIT------------GLSLSR------------------------- 226 (338)
T ss_pred ccccceeeccccCceeeeccccCcceEEeecccCcee------------eEEecc-------------------------
Confidence 4677778899999999999999999998854210 011100
Q ss_pred ccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCe
Q 003310 223 SRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGM 302 (832)
Q Consensus 223 s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~ 302 (832)
+|+.+ .+-.-|.+
T Consensus 227 ---------~gs~l----------------------------------------------------------lsnsMd~t 239 (338)
T KOG0265|consen 227 ---------YGSFL----------------------------------------------------------LSNSMDNT 239 (338)
T ss_pred ---------CCCcc----------------------------------------------------------ccccccce
Confidence 11100 01124568
Q ss_pred EEEEECCC----CcEEEEeccCCCC----eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 303 VIVRDIVS----KNVIAQFRAHKSP----ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 303 V~IwDl~s----~~~l~~~~aH~~p----Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
|++||++- .+++..|.+|... ....+|||+++.+..+|.|.. +.+||.... ..+|.|
T Consensus 240 vrvwd~rp~~p~~R~v~if~g~~hnfeknlL~cswsp~~~~i~ags~dr~-vyvwd~~~r--------------~~lykl 304 (338)
T KOG0265|consen 240 VRVWDVRPFAPSQRCVKIFQGHIHNFEKNLLKCSWSPNGTKITAGSADRF-VYVWDTTSR--------------RILYKL 304 (338)
T ss_pred EEEEEecccCCCCceEEEeecchhhhhhhcceeeccCCCCccccccccce-EEEeecccc--------------cEEEEc
Confidence 99999874 4568889887643 345789999999999999965 899998653 678888
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEE
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLF 407 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIw 407 (832)
-|+. ..|.+++|.|.-.+|.++++|.||.+=
T Consensus 305 -pGh~-gsvn~~~Fhp~e~iils~~sdk~i~lg 335 (338)
T KOG0265|consen 305 -PGHY-GSVNEVDFHPTEPIILSCSSDKTIYLG 335 (338)
T ss_pred -CCcc-eeEEEeeecCCCcEEEEeccCceeEee
Confidence 4664 359999999999999999999999863
No 49
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.64 E-value=8.1e-15 Score=168.12 Aligned_cols=226 Identities=16% Similarity=0.276 Sum_probs=175.5
Q ss_pred CCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCc
Q 003310 17 TRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGL 95 (832)
Q Consensus 17 ~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~ 95 (832)
-.+.+++|+.+| +||||+..+ ...|.+..|+|.+..++++|+.. .| ++++
T Consensus 423 gd~~Iv~G~k~Gel~vfdlaS~-~l~Eti~AHdgaIWsi~~~pD~~-------g~---------vT~s------------ 473 (888)
T KOG0306|consen 423 GDRYIVLGTKNGELQVFDLASA-SLVETIRAHDGAIWSISLSPDNK-------GF---------VTGS------------ 473 (888)
T ss_pred CCceEEEeccCCceEEEEeehh-hhhhhhhccccceeeeeecCCCC-------ce---------EEec------------
Confidence 367788888888 999999974 56677778999999999988531 11 2221
Q ss_pred ccccCCCCCCCCCCCCCCCCCCEEEEEECCC-----Cc--------EEEEEeCCCCEEEEEEc--CCEEEEE-eCCEEEE
Q 003310 96 ATACNGTSANYHDLGNGSSVPTVVHFYSLRS-----QS--------YVHMLKFRSPIYSVRCS--SRVVAIC-QAAQVHC 159 (832)
Q Consensus 96 ~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~T-----g~--------~V~tL~f~s~V~sV~~S--~r~LAVa-~~~~I~v 159 (832)
.++||+|||.+- |. .-++|+++..|.+|++| +++|||+ ++.+++|
T Consensus 474 -------------------aDktVkfWdf~l~~~~~gt~~k~lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkV 534 (888)
T KOG0306|consen 474 -------------------ADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKV 534 (888)
T ss_pred -------------------CCcEEEEEeEEEEeccCcccceeeeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEE
Confidence 238999998741 11 12567889999999998 6899995 7999999
Q ss_pred EECCCCceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeee
Q 003310 160 FDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHY 239 (832)
Q Consensus 160 wDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~ 239 (832)
|=+.|++...+|-+|..|.- .|.+ +|. +
T Consensus 535 yflDtlKFflsLYGHkLPV~------------smDI-----S~D-----------------------------S------ 562 (888)
T KOG0306|consen 535 YFLDTLKFFLSLYGHKLPVL------------SMDI-----SPD-----------------------------S------ 562 (888)
T ss_pred EEecceeeeeeeccccccee------------EEec-----cCC-----------------------------c------
Confidence 99999998888888754321 1211 110 0
Q ss_pred ecccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEecc
Q 003310 240 AKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRA 319 (832)
Q Consensus 240 A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~a 319 (832)
| .+++++.|..|+||-+.=|.|-..|-|
T Consensus 563 -----k-----------------------------------------------livTgSADKnVKiWGLdFGDCHKS~fA 590 (888)
T KOG0306|consen 563 -----K-----------------------------------------------LIVTGSADKNVKIWGLDFGDCHKSFFA 590 (888)
T ss_pred -----C-----------------------------------------------eEEeccCCCceEEeccccchhhhhhhc
Confidence 0 123456678999999999999999999
Q ss_pred CCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe
Q 003310 320 HKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS 399 (832)
Q Consensus 320 H~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS 399 (832)
|...|.++.|=|+..++.||+.||. |+=||-... .++.+|. ||+ ..|++++-+|+|.+++++|
T Consensus 591 HdDSvm~V~F~P~~~~FFt~gKD~k-vKqWDg~kF--------------e~iq~L~-~H~-~ev~cLav~~~G~~vvs~s 653 (888)
T KOG0306|consen 591 HDDSVMSVQFLPKTHLFFTCGKDGK-VKQWDGEKF--------------EEIQKLD-GHH-SEVWCLAVSPNGSFVVSSS 653 (888)
T ss_pred ccCceeEEEEcccceeEEEecCcce-EEeechhhh--------------hhheeec-cch-heeeeeEEcCCCCeEEecc
Confidence 9999999999999999999999987 999997764 5666664 554 4799999999999999999
Q ss_pred CCCcEEEEecCC
Q 003310 400 SRGTSHLFAINP 411 (832)
Q Consensus 400 ~DgTVhIwdl~~ 411 (832)
.|.+|++|.-..
T Consensus 654 hD~sIRlwE~td 665 (888)
T KOG0306|consen 654 HDKSIRLWERTD 665 (888)
T ss_pred CCceeEeeeccC
Confidence 999999998754
No 50
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.63 E-value=1.1e-15 Score=168.75 Aligned_cols=243 Identities=12% Similarity=0.171 Sum_probs=174.2
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
..+|..|.++-++||++-+.+.+...+..|..+|+.+.+.+.+. . +| +.+
T Consensus 228 hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~Vrd~~~s~~g~-------~------fL---S~s-------------- 277 (503)
T KOG0282|consen 228 HLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPVRDASFNNCGT-------S------FL---SAS-------------- 277 (503)
T ss_pred eEEEecCCCceEEEEEEecCcceehhhhcchhhhhhhhccccCC-------e------ee---eee--------------
Confidence 45555566777999999887888889999999999998875321 1 12 111
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcC---CEEEE-EeCCEEEEEECCCCceEEEEec
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSS---RVVAI-CQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~---r~LAV-a~~~~I~vwDl~t~~~~~tl~t 173 (832)
.+++|++||++||+++..+.....+++|.|.+ +++.+ .++++|..||+++++..+....
T Consensus 278 -----------------fD~~lKlwDtETG~~~~~f~~~~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~kvvqeYd~ 340 (503)
T KOG0282|consen 278 -----------------FDRFLKLWDTETGQVLSRFHLDKVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGKVVQEYDR 340 (503)
T ss_pred -----------------cceeeeeeccccceEEEEEecCCCceeeecCCCCCcEEEEecCCCcEEEEeccchHHHHHHHh
Confidence 34889999999999999999999999999963 55444 7899999999999986554433
Q ss_pred CCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
|- +++- -|.|-
T Consensus 341 hL---------------g~i~----~i~F~-------------------------------------------------- 351 (503)
T KOG0282|consen 341 HL---------------GAIL----DITFV-------------------------------------------------- 351 (503)
T ss_pred hh---------------hhee----eeEEc--------------------------------------------------
Confidence 31 0000 00010
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCC
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPS 332 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPd 332 (832)
+. +..|+++.+++.|+||+.....++..+. .+.....||+..|+
T Consensus 352 -----------------~~------------------g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~ 396 (503)
T KOG0282|consen 352 -----------------DE------------------GRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPN 396 (503)
T ss_pred -----------------cC------------------CceEeeeccCccEEEEEcCCCccchhhcchhhccCcceecCCC
Confidence 00 0124556778899999999877665443 33345678999999
Q ss_pred CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcccc-EEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 333 GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAV-IQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
|..++.=|.|.. |-||.+.+... ...++..+|+..+- -..|.|||||++|++|+.||.+.+||.++
T Consensus 397 ~~~~~aQs~dN~-i~ifs~~~~~r------------~nkkK~feGh~vaGys~~v~fSpDG~~l~SGdsdG~v~~wdwkt 463 (503)
T KOG0282|consen 397 GKWFAAQSMDNY-IAIFSTVPPFR------------LNKKKRFEGHSVAGYSCQVDFSPDGRTLCSGDSDGKVNFWDWKT 463 (503)
T ss_pred CCeehhhccCce-EEEEecccccc------------cCHhhhhcceeccCceeeEEEcCCCCeEEeecCCccEEEeechh
Confidence 999999999865 88998766411 11122224543332 45689999999999999999999999999
Q ss_pred CCCceeeccCCCC
Q 003310 412 LGGSVNFQPTDAN 424 (832)
Q Consensus 412 ~g~~~~~~~H~~~ 424 (832)
.+-.-.+++|+..
T Consensus 464 ~kl~~~lkah~~~ 476 (503)
T KOG0282|consen 464 TKLVSKLKAHDQP 476 (503)
T ss_pred hhhhhccccCCcc
Confidence 8888888998543
No 51
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.63 E-value=1.2e-14 Score=153.44 Aligned_cols=219 Identities=19% Similarity=0.284 Sum_probs=143.5
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
.+++..|.++.++++|++. .....+..|+++|+++...+.- . .++.++
T Consensus 66 ~~~~~G~~dg~vr~~Dln~--~~~~~igth~~~i~ci~~~~~~--------------~--~vIsgs-------------- 113 (323)
T KOG1036|consen 66 STIVTGGLDGQVRRYDLNT--GNEDQIGTHDEGIRCIEYSYEV--------------G--CVISGS-------------- 113 (323)
T ss_pred ceEEEeccCceEEEEEecC--CcceeeccCCCceEEEEeeccC--------------C--eEEEcc--------------
Confidence 4556666666667777765 2334555577777777665420 0 123333
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEE-EeCCEEEEEECCCCceEEEEecCCC
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAI-CQAAQVHCFDAATLEIEYAILTNPI 176 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~ 176 (832)
|+++|+|||.+...++-++.-..+|+++..++++|+| +.+.+|.+||++++...++..+++.
T Consensus 114 -----------------WD~~ik~wD~R~~~~~~~~d~~kkVy~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~l 176 (323)
T KOG1036|consen 114 -----------------WDKTIKFWDPRNKVVVGTFDQGKKVYCMDVSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSL 176 (323)
T ss_pred -----------------cCccEEEEeccccccccccccCceEEEEeccCCEEEEeecCceEEEEEcccccchhhhccccc
Confidence 5589999999997777777777899999999988888 7889999999999987777666543
Q ss_pred ccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccC
Q 003310 177 VMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGD 256 (832)
Q Consensus 177 ~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd 256 (832)
+. .+-.+++-|..=+|+ ..+..||++.+.+.+++-.
T Consensus 177 ky----------qtR~v~~~pn~eGy~----~sSieGRVavE~~d~s~~~------------------------------ 212 (323)
T KOG1036|consen 177 KY----------QTRCVALVPNGEGYV----VSSIEGRVAVEYFDDSEEA------------------------------ 212 (323)
T ss_pred ee----------EEEEEEEecCCCceE----EEeecceEEEEccCCchHH------------------------------
Confidence 21 111222222111121 1234455444333321000
Q ss_pred ccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCC---------CCeEEE
Q 003310 257 LGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHK---------SPISAL 327 (832)
Q Consensus 257 ~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~---------~pIs~L 327 (832)
.+..-.|++|. .||.+|
T Consensus 213 ------------------------------------------------------~skkyaFkCHr~~~~~~~~~yPVNai 238 (323)
T KOG1036|consen 213 ------------------------------------------------------QSKKYAFKCHRLSEKDTEIIYPVNAI 238 (323)
T ss_pred ------------------------------------------------------hhhceeEEeeecccCCceEEEEecee
Confidence 01112233332 489999
Q ss_pred EEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeC
Q 003310 328 CFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS 400 (832)
Q Consensus 328 aFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~ 400 (832)
+|+|--..||||+.||. |.+||+.+. ++++.|.+-. ..|..++|+.||..||++++
T Consensus 239 ~Fhp~~~tfaTgGsDG~-V~~Wd~~~r--------------Krl~q~~~~~--~SI~slsfs~dG~~LAia~s 294 (323)
T KOG1036|consen 239 AFHPIHGTFATGGSDGI-VNIWDLFNR--------------KRLKQLAKYE--TSISSLSFSMDGSLLAIASS 294 (323)
T ss_pred EeccccceEEecCCCce-EEEccCcch--------------hhhhhccCCC--CceEEEEeccCCCeEEEEec
Confidence 99999888999999996 899999875 6677776532 24999999999999999975
No 52
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.62 E-value=2.2e-14 Score=146.66 Aligned_cols=232 Identities=15% Similarity=0.212 Sum_probs=167.9
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
+-+|.+|.+..+++|+.-. +.+....+.|--.|..++...+. .. ++-|++
T Consensus 30 nY~ltcGsdrtvrLWNp~r-g~liktYsghG~EVlD~~~s~Dn-------sk-------f~s~Gg--------------- 79 (307)
T KOG0316|consen 30 NYCLTCGSDRTVRLWNPLR-GALIKTYSGHGHEVLDAALSSDN-------SK-------FASCGG--------------- 79 (307)
T ss_pred CEEEEcCCCceEEeecccc-cceeeeecCCCceeeeccccccc-------cc-------cccCCC---------------
Confidence 6789999999999999875 67888888888888877665321 01 222221
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEEE-EeCCEEEEEECCC--CceEEEE
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVAI-CQAAQVHCFDAAT--LEIEYAI 171 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t--~~~~~tl 171 (832)
++.|.+||+.||+.++.++- ..+|.+|+||. .+|+. +.|..|++||-+. .+.++.+
T Consensus 80 ------------------Dk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS~s~ePiQil 141 (307)
T KOG0316|consen 80 ------------------DKAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRSRSFEPIQIL 141 (307)
T ss_pred ------------------CceEEEEEcccCeeeeecccccceeeEEEecCcceEEEeccccceeEEEEcccCCCCccchh
Confidence 27799999999999999875 57999999984 55665 6899999999875 3444444
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
.+... +.+++ +.+++
T Consensus 142 dea~D--------------~V~Si----------------------------------------------~v~~h----- 156 (307)
T KOG0316|consen 142 DEAKD--------------GVSSI----------------------------------------------DVAEH----- 156 (307)
T ss_pred hhhcC--------------ceeEE----------------------------------------------Eeccc-----
Confidence 33210 00000 00000
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP 331 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP 331 (832)
.++.+..||+++.||++.++.....-+| ||+|++||+
T Consensus 157 -----------------------------------------eIvaGS~DGtvRtydiR~G~l~sDy~g~--pit~vs~s~ 193 (307)
T KOG0316|consen 157 -----------------------------------------EIVAGSVDGTVRTYDIRKGTLSSDYFGH--PITSVSFSK 193 (307)
T ss_pred -----------------------------------------EEEeeccCCcEEEEEeecceeehhhcCC--cceeEEecC
Confidence 1234567899999999999887766665 999999999
Q ss_pred CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcccc-EEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 332 SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAV-IQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 332 dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
||..++.++-|++ +|+.|-.++ +.| .-..|+.+.. =.+++|+....++++||.||.|.+||+.
T Consensus 194 d~nc~La~~l~st-lrLlDk~tG--------------klL-~sYkGhkn~eykldc~l~qsdthV~sgSEDG~Vy~wdLv 257 (307)
T KOG0316|consen 194 DGNCSLASSLDST-LRLLDKETG--------------KLL-KSYKGHKNMEYKLDCCLNQSDTHVFSGSEDGKVYFWDLV 257 (307)
T ss_pred CCCEEEEeeccce-eeecccchh--------------HHH-HHhcccccceeeeeeeecccceeEEeccCCceEEEEEec
Confidence 9999999999987 899998887 222 2234554433 4578899888999999999999999998
Q ss_pred CCCCceeeccC
Q 003310 411 PLGGSVNFQPT 421 (832)
Q Consensus 411 ~~g~~~~~~~H 421 (832)
.......+..|
T Consensus 258 d~~~~sk~~~~ 268 (307)
T KOG0316|consen 258 DETQISKLSVV 268 (307)
T ss_pred cceeeeeeccC
Confidence 87666666554
No 53
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.62 E-value=2e-15 Score=160.66 Aligned_cols=227 Identities=17% Similarity=0.276 Sum_probs=160.7
Q ss_pred CcEEEEEec-CCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYR-SGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~-~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
++|++.|.+ ..++|||+++ ++...++-.|-..|-.+++.. .+++.|+.
T Consensus 247 ~rviisGSSDsTvrvWDv~t-ge~l~tlihHceaVLhlrf~n----------------g~mvtcSk-------------- 295 (499)
T KOG0281|consen 247 ERVIVSGSSDSTVRVWDVNT-GEPLNTLIHHCEAVLHLRFSN----------------GYMVTCSK-------------- 295 (499)
T ss_pred ceEEEecCCCceEEEEeccC-CchhhHHhhhcceeEEEEEeC----------------CEEEEecC--------------
Confidence 556666654 4589999997 556555555777777777653 13443432
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEE---EEE-eCCCCEEEEEEcCCEEEE-EeCCEEEEEECCCCceEEEE
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYV---HML-KFRSPIYSVRCSSRVVAI-CQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V---~tL-~f~s~V~sV~~S~r~LAV-a~~~~I~vwDl~t~~~~~tl 171 (832)
+.++.+||+.+..-+ +.| .+...|..|.|+.++++. +.|.+|++|+..|+++++++
T Consensus 296 -------------------DrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~kyIVsASgDRTikvW~~st~efvRtl 356 (499)
T KOG0281|consen 296 -------------------DRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDKYIVSASGDRTIKVWSTSTCEFVRTL 356 (499)
T ss_pred -------------------CceeEEEeccCchHHHHHHHHhhhhhheeeeccccceEEEecCCceEEEEeccceeeehhh
Confidence 278999999876522 222 247789999999888877 56889999999999999999
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
.+|.. | +| .|.|- |..
T Consensus 357 ~gHkR--G-------------IA----ClQYr-----------------------------~rl---------------- 372 (499)
T KOG0281|consen 357 NGHKR--G-------------IA----CLQYR-----------------------------DRL---------------- 372 (499)
T ss_pred hcccc--c-------------ce----ehhcc-----------------------------CeE----------------
Confidence 88732 1 11 11121 111
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP 331 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP 331 (832)
++++..|.+|+|||+..|+++..+++|..-|.|+.|+
T Consensus 373 ------------------------------------------vVSGSSDntIRlwdi~~G~cLRvLeGHEeLvRciRFd- 409 (499)
T KOG0281|consen 373 ------------------------------------------VVSGSSDNTIRLWDIECGACLRVLEGHEELVRCIRFD- 409 (499)
T ss_pred ------------------------------------------EEecCCCceEEEEeccccHHHHHHhchHHhhhheeec-
Confidence 1234567899999999999999999999999999994
Q ss_pred CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 332 SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 332 dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.+.+++|..||+ |+|||+..+.... ..+ ...++..+-+ +..+|..+.| |...++++|.|.||-|||.-.
T Consensus 410 -~krIVSGaYDGk-ikvWdl~aaldpr-a~~----~~~Cl~~lv~--hsgRVFrLQF--D~fqIvsssHddtILiWdFl~ 478 (499)
T KOG0281|consen 410 -NKRIVSGAYDGK-IKVWDLQAALDPR-APA----STLCLRTLVE--HSGRVFRLQF--DEFQIISSSHDDTILIWDFLN 478 (499)
T ss_pred -Cceeeeccccce-EEEEecccccCCc-ccc----cchHHHhhhh--ccceeEEEee--cceEEEeccCCCeEEEEEcCC
Confidence 588999999998 9999998762111 000 1123334433 2346888888 556889999999999999865
Q ss_pred C
Q 003310 412 L 412 (832)
Q Consensus 412 ~ 412 (832)
+
T Consensus 479 ~ 479 (499)
T KOG0281|consen 479 G 479 (499)
T ss_pred C
Confidence 3
No 54
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.62 E-value=1.1e-13 Score=158.99 Aligned_cols=250 Identities=17% Similarity=0.249 Sum_probs=169.8
Q ss_pred CCCCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccC
Q 003310 15 GATRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQD 93 (832)
Q Consensus 15 ~~~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~D 93 (832)
++..++++.|.+.| ++|||... +-|.-.++.|...|..+++.-.. -+.++..
T Consensus 359 SpDgq~iaTG~eDgKVKvWn~~S-gfC~vTFteHts~Vt~v~f~~~g---------------~~llssS----------- 411 (893)
T KOG0291|consen 359 SPDGQLIATGAEDGKVKVWNTQS-GFCFVTFTEHTSGVTAVQFTARG---------------NVLLSSS----------- 411 (893)
T ss_pred CCCCcEEEeccCCCcEEEEeccC-ceEEEEeccCCCceEEEEEEecC---------------CEEEEee-----------
Confidence 35577888877666 99999986 67888899999999999986432 1222221
Q ss_pred CcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEE--EEE--cCCEEEEE-eCC-EEEEEECCCCce
Q 003310 94 GLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYS--VRC--SSRVVAIC-QAA-QVHCFDAATLEI 167 (832)
Q Consensus 94 g~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~s--V~~--S~r~LAVa-~~~-~I~vwDl~t~~~ 167 (832)
.++|||.||++...+.+++..+.++.- |+. ++.+|+++ ++. .|++|+..||+.
T Consensus 412 ---------------------LDGtVRAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGql 470 (893)
T KOG0291|consen 412 ---------------------LDGTVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQL 470 (893)
T ss_pred ---------------------cCCeEEeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCee
Confidence 238999999999999999999887753 333 46666664 333 799999999999
Q ss_pred EEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccce
Q 003310 168 EYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHL 247 (832)
Q Consensus 168 ~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~l 247 (832)
+-.|.+|.. |+. -|++.. .|+
T Consensus 471 lDiLsGHEg---------------PVs----~l~f~~---------------------------~~~------------- 491 (893)
T KOG0291|consen 471 LDILSGHEG---------------PVS----GLSFSP---------------------------DGS------------- 491 (893)
T ss_pred eehhcCCCC---------------cce----eeEEcc---------------------------ccC-------------
Confidence 999988843 221 122220 111
Q ss_pred eceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEE
Q 003310 248 AAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISA 326 (832)
Q Consensus 248 asGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~ 326 (832)
+++++.-|.+|++||+-.. ..+.++ .+.+.+..
T Consensus 492 ---------------------------------------------~LaS~SWDkTVRiW~if~s~~~vEtl-~i~sdvl~ 525 (893)
T KOG0291|consen 492 ---------------------------------------------LLASGSWDKTVRIWDIFSSSGTVETL-EIRSDVLA 525 (893)
T ss_pred ---------------------------------------------eEEeccccceEEEEEeeccCceeeeE-eeccceeE
Confidence 1234456789999998654 344444 45678999
Q ss_pred EEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCC--CceeE-E--EEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 327 LCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAG--TSYVH-L--YRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 327 LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~--~~~~~-l--~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
++|+|||+-||.+..||. |.+||+..+..-. +-++.. ...+. . .+-+.......+..|++|+||+.|.+|+..
T Consensus 526 vsfrPdG~elaVaTldgq-Itf~d~~~~~q~~-~IdgrkD~~~gR~~~D~~ta~~sa~~K~Ftti~ySaDG~~IlAgG~s 603 (893)
T KOG0291|consen 526 VSFRPDGKELAVATLDGQ-ITFFDIKEAVQVG-SIDGRKDLSGGRKETDRITAENSAKGKTFTTICYSADGKCILAGGES 603 (893)
T ss_pred EEEcCCCCeEEEEEecce-EEEEEhhhceeec-cccchhhccccccccceeehhhcccCCceEEEEEcCCCCEEEecCCc
Confidence 999999999999999997 9999998762210 000000 00010 0 000000001128899999999999999999
Q ss_pred CcEEEEecCCCCCceeec
Q 003310 402 GTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 402 gTVhIwdl~~~g~~~~~~ 419 (832)
..|.||++...--...|+
T Consensus 604 n~iCiY~v~~~vllkkfq 621 (893)
T KOG0291|consen 604 NSICIYDVPEGVLLKKFQ 621 (893)
T ss_pred ccEEEEECchhheeeeEE
Confidence 999999997754444554
No 55
>PTZ00420 coronin; Provisional
Probab=99.61 E-value=4.9e-13 Score=156.77 Aligned_cols=103 Identities=7% Similarity=0.064 Sum_probs=75.1
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEE-----EEEcCCCCEEEEEEcCC---CEEEEEeCCCCCCCCCCccCCCCce
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISA-----LCFDPSGILLVTASVQG---HNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~-----LaFSPdG~lLATaS~dG---t~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
+..++.|+|||+.+++++.++.+|.+.+.+ ..|++++.+|+|++.++ +.|+|||++... ..
T Consensus 185 ~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~-----------~p 253 (568)
T PTZ00420 185 TCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTT-----------SA 253 (568)
T ss_pred EecCCEEEEEECCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCC-----------Cc
Confidence 446789999999999999999999876543 34679999999988775 249999998530 11
Q ss_pred eEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 369 VHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.....+.. ....+...-+.++|.++++|+.|++|++|++...
T Consensus 254 l~~~~ld~--~~~~L~p~~D~~tg~l~lsGkGD~tIr~~e~~~~ 295 (568)
T PTZ00420 254 LVTMSIDN--ASAPLIPHYDESTGLIYLIGKGDGNCRYYQHSLG 295 (568)
T ss_pred eEEEEecC--CccceEEeeeCCCCCEEEEEECCCeEEEEEccCC
Confidence 22223321 1222444445677999999999999999999753
No 56
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.61 E-value=4.6e-14 Score=146.21 Aligned_cols=104 Identities=17% Similarity=0.228 Sum_probs=86.2
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL 378 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~ 378 (832)
.-|+|.|....+.+++..+++|....-||+|+|+|++||+||.|-- +-+||+..- .++..+.|-
T Consensus 167 GlG~v~ILsypsLkpv~si~AH~snCicI~f~p~GryfA~GsADAl-vSLWD~~EL--------------iC~R~isRl- 230 (313)
T KOG1407|consen 167 GLGCVEILSYPSLKPVQSIKAHPSNCICIEFDPDGRYFATGSADAL-VSLWDVDEL--------------ICERCISRL- 230 (313)
T ss_pred CCceEEEEeccccccccccccCCcceEEEEECCCCceEeeccccce-eeccChhHh--------------hhheeeccc-
Confidence 4579999999999999999999999999999999999999999964 899999864 333333332
Q ss_pred ccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 379 TNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 379 t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
...|..|+||.||++||++|.|.-|-|=.++++.....++
T Consensus 231 -dwpVRTlSFS~dg~~lASaSEDh~IDIA~vetGd~~~eI~ 270 (313)
T KOG1407|consen 231 -DWPVRTLSFSHDGRMLASASEDHFIDIAEVETGDRVWEIP 270 (313)
T ss_pred -cCceEEEEeccCcceeeccCccceEEeEecccCCeEEEee
Confidence 2359999999999999999999999998887765544443
No 57
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.60 E-value=4.5e-14 Score=156.20 Aligned_cols=235 Identities=19% Similarity=0.292 Sum_probs=161.6
Q ss_pred eeccccccccCC--CCCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEE
Q 003310 4 WAGFDKLESEAG--ATRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFC 80 (832)
Q Consensus 4 w~~fd~l~~~~~--~~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv 80 (832)
.++|++--.+.. ...++|++|-..| +||||..+ -.....+..|..||+.++|.|... -+++.+
T Consensus 64 ~srFk~~v~s~~fR~DG~LlaaGD~sG~V~vfD~k~-r~iLR~~~ah~apv~~~~f~~~d~-------------t~l~s~ 129 (487)
T KOG0310|consen 64 FSRFKDVVYSVDFRSDGRLLAAGDESGHVKVFDMKS-RVILRQLYAHQAPVHVTKFSPQDN-------------TMLVSG 129 (487)
T ss_pred HHhhccceeEEEeecCCeEEEccCCcCcEEEecccc-HHHHHHHhhccCceeEEEecccCC-------------eEEEec
Confidence 355666444333 2267888888777 89999554 334455567999999999987421 123333
Q ss_pred eCCCCccCccccCCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-CCCCEEEEEEcC---CEEEE-EeCC
Q 003310 81 ADGSRSCGTKVQDGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCSS---RVVAI-CQAA 155 (832)
Q Consensus 81 ~~g~~~g~~~~~Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~---r~LAV-a~~~ 155 (832)
+| ++++++||+.+......|. +.+-|++.+|++ .+++. +.|+
T Consensus 130 sD---------------------------------d~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg 176 (487)
T KOG0310|consen 130 SD---------------------------------DKVVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDG 176 (487)
T ss_pred CC---------------------------------CceEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCc
Confidence 32 1789999999888644554 467899999964 56666 6899
Q ss_pred EEEEEECCCC-ceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCc
Q 003310 156 QVHCFDAATL-EIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGS 234 (832)
Q Consensus 156 ~I~vwDl~t~-~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~ 234 (832)
+|++||+++. ..+.++. | ++| +. +-||-+ +|+
T Consensus 177 ~vrl~DtR~~~~~v~eln-h----g~p-----------Ve---~vl~lp----------------------------sgs 209 (487)
T KOG0310|consen 177 KVRLWDTRSLTSRVVELN-H----GCP-----------VE---SVLALP----------------------------SGS 209 (487)
T ss_pred eEEEEEeccCCceeEEec-C----CCc-----------ee---eEEEcC----------------------------CCC
Confidence 9999999987 4555552 2 121 10 011111 111
Q ss_pred ceeeeecccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC-CcE
Q 003310 235 RVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS-KNV 313 (832)
Q Consensus 235 ~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s-~~~ 313 (832)
.+ +++ ....|+|||+.+ ++.
T Consensus 210 ~i----------------------------------------------------------asA-gGn~vkVWDl~~G~ql 230 (487)
T KOG0310|consen 210 LI----------------------------------------------------------ASA-GGNSVKVWDLTTGGQL 230 (487)
T ss_pred EE----------------------------------------------------------EEc-CCCeEEEEEecCCcee
Confidence 11 111 112699999995 556
Q ss_pred EEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCC
Q 003310 314 IAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSN 393 (832)
Q Consensus 314 l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~ 393 (832)
+..+..|...|+||+|..+++.|++|+-||+ ++|||+... ..++.+. + ++.|.+|+.|||++
T Consensus 231 l~~~~~H~KtVTcL~l~s~~~rLlS~sLD~~-VKVfd~t~~--------------Kvv~s~~--~-~~pvLsiavs~dd~ 292 (487)
T KOG0310|consen 231 LTSMFNHNKTVTCLRLASDSTRLLSGSLDRH-VKVFDTTNY--------------KVVHSWK--Y-PGPVLSIAVSPDDQ 292 (487)
T ss_pred hhhhhcccceEEEEEeecCCceEeecccccc-eEEEEccce--------------EEEEeee--c-ccceeeEEecCCCc
Confidence 6666669999999999999999999999998 899997653 3333432 2 45699999999999
Q ss_pred EEEEEeCCCcEEEEec
Q 003310 394 WIMISSSRGTSHLFAI 409 (832)
Q Consensus 394 ~LAsgS~DgTVhIwdl 409 (832)
.+++|-.+|.+-+-+.
T Consensus 293 t~viGmsnGlv~~rr~ 308 (487)
T KOG0310|consen 293 TVVIGMSNGLVSIRRR 308 (487)
T ss_pred eEEEecccceeeeehh
Confidence 9999999999877633
No 58
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.59 E-value=2.7e-14 Score=150.41 Aligned_cols=224 Identities=16% Similarity=0.247 Sum_probs=154.7
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
.+|++.|.++..++||+.. + ....+..|++||+++++.+.+. .++| ++++
T Consensus 85 skVf~g~~Dk~~k~wDL~S-~-Q~~~v~~Hd~pvkt~~wv~~~~------------~~cl--~TGS-------------- 134 (347)
T KOG0647|consen 85 SKVFSGGCDKQAKLWDLAS-G-QVSQVAAHDAPVKTCHWVPGMN------------YQCL--VTGS-------------- 134 (347)
T ss_pred ceEEeeccCCceEEEEccC-C-CeeeeeecccceeEEEEecCCC------------ccee--Eecc--------------
Confidence 7899999999999999997 3 4556677999999999997542 1233 4543
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEE-EeCCEEEEEECCCCceEEEEecCCC
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAI-CQAAQVHCFDAATLEIEYAILTNPI 176 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~ 176 (832)
+++|||+||+|+...+.++..+..||++.+-..+++| ..++.|.+|+|+....++....+|.
T Consensus 135 -----------------WDKTlKfWD~R~~~pv~t~~LPeRvYa~Dv~~pm~vVata~r~i~vynL~n~~te~k~~~SpL 197 (347)
T KOG0647|consen 135 -----------------WDKTLKFWDTRSSNPVATLQLPERVYAADVLYPMAVVATAERHIAVYNLENPPTEFKRIESPL 197 (347)
T ss_pred -----------------cccceeecccCCCCeeeeeeccceeeehhccCceeEEEecCCcEEEEEcCCCcchhhhhcCcc
Confidence 4599999999999999999999999999998777777 4677899999998776666655553
Q ss_pred ccCCCCCCCCCcccceeeeccceEEeeCC---CceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 177 VMGHPSAGGIGIGYGPLAVGPRWLAYSGS---PVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 177 ~~~~p~~~~~~~~~~p~Alg~r~LAya~~---~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
... -|.+|...+ .++-+..||+.-+++... .....+...+.+.-..
T Consensus 198 k~Q-----------------~R~va~f~d~~~~alGsiEGrv~iq~id~~----------~~~~nFtFkCHR~~~~---- 246 (347)
T KOG0647|consen 198 KWQ-----------------TRCVACFQDKDGFALGSIEGRVAIQYIDDP----------NPKDNFTFKCHRSTNS---- 246 (347)
T ss_pred cce-----------------eeEEEEEecCCceEeeeecceEEEEecCCC----------CccCceeEEEeccCCC----
Confidence 221 133443221 134456777665555420 0011122222221000
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCC
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSG 333 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG 333 (832)
....+++++.-... -+.|+|++++.||++..||-.....+.+.+.|..||+|.+|+.+|
T Consensus 247 -------------------~~~~VYaVNsi~Fh--P~hgtlvTaGsDGtf~FWDkdar~kLk~s~~~~qpItcc~fn~~G 305 (347)
T KOG0647|consen 247 -------------------VNDDVYAVNSIAFH--PVHGTLVTAGSDGTFSFWDKDARTKLKTSETHPQPITCCSFNRNG 305 (347)
T ss_pred -------------------CCCceEEecceEee--cccceEEEecCCceEEEecchhhhhhhccCcCCCccceeEecCCC
Confidence 00011221110000 034688999999999999999999899999999999999999999
Q ss_pred CEEEEEE
Q 003310 334 ILLVTAS 340 (832)
Q Consensus 334 ~lLATaS 340 (832)
.++|-|.
T Consensus 306 ~ifaYA~ 312 (347)
T KOG0647|consen 306 SIFAYAL 312 (347)
T ss_pred CEEEEEe
Confidence 9999875
No 59
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.59 E-value=3.6e-12 Score=134.67 Aligned_cols=221 Identities=12% Similarity=0.138 Sum_probs=129.5
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEE--eCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAIC--QAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa--~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++|++||+.+++.++.+.....+..+.++ ++.++++ .+++|++||+.+.+.+..+..... ...
T Consensus 53 ~~v~~~d~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~~~-------------~~~ 119 (300)
T TIGR03866 53 DTIQVIDLATGEVIGTLPSGPDPELFALHPNGKILYIANEDDNLVTVIDIETRKVLAEIPVGVE-------------PEG 119 (300)
T ss_pred CeEEEEECCCCcEEEeccCCCCccEEEECCCCCEEEEEcCCCCeEEEEECCCCeEEeEeeCCCC-------------cce
Confidence 67999999999998888765556677776 4555543 468999999999877666642211 112
Q ss_pred eeecc--ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccccc
Q 003310 193 LAVGP--RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFL 270 (832)
Q Consensus 193 ~Alg~--r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~ 270 (832)
++++| ++|+++... +..+..+-.+..+.+.. + ..+..
T Consensus 120 ~~~~~dg~~l~~~~~~--------------------------~~~~~~~d~~~~~~~~~-~-~~~~~------------- 158 (300)
T TIGR03866 120 MAVSPDGKIVVNTSET--------------------------TNMAHFIDTKTYEIVDN-V-LVDQR------------- 158 (300)
T ss_pred EEECCCCCEEEEEecC--------------------------CCeEEEEeCCCCeEEEE-E-EcCCC-------------
Confidence 34433 455554211 00010000000000000 0 00000
Q ss_pred CCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCC-----C--CeEEEEEcCCCCEEEEEE-cC
Q 003310 271 PDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHK-----S--PISALCFDPSGILLVTAS-VQ 342 (832)
Q Consensus 271 p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~-----~--pIs~LaFSPdG~lLATaS-~d 342 (832)
+ ..+ ...++ |.. .++....+|.|++||+.+++.+..+..+. . ....++|+|||++++.+. .+
T Consensus 159 ~---~~~-~~s~d----g~~--l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~ 228 (300)
T TIGR03866 159 P---RFA-EFTAD----GKE--LWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPA 228 (300)
T ss_pred c---cEE-EECCC----CCE--EEEEcCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCC
Confidence 0 000 00111 110 12344567899999999998877765332 1 124689999999865543 34
Q ss_pred CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe-CCCcEEEEecCCCCCceeec
Q 003310 343 GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS-SRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 343 Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS-~DgTVhIwdl~~~g~~~~~~ 419 (832)
++ |.+||+.++ ..+..+..+ ..+.+++|+|||++|++++ .+++|+|||+........++
T Consensus 229 ~~-i~v~d~~~~--------------~~~~~~~~~---~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~~~~~~~~ 288 (300)
T TIGR03866 229 NR-VAVVDAKTY--------------EVLDYLLVG---QRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAALKVIKSIK 288 (300)
T ss_pred Ce-EEEEECCCC--------------cEEEEEEeC---CCcceEEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEE
Confidence 44 899999765 222233222 2488999999999998874 58999999999866555554
No 60
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.58 E-value=1.4e-14 Score=157.47 Aligned_cols=235 Identities=11% Similarity=0.132 Sum_probs=164.9
Q ss_pred CcEEEEEecCCeEEEEeccCC-CeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYRSGFQVWDVEEAD-NVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~-~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+...|.+.-+++|++.... +....++...|++..+.+-+.. +-.|| ..
T Consensus 188 dtlatgg~Dr~Ik~W~v~~~k~~~~~tLaGs~g~it~~d~d~~~-------------~~~iA--as-------------- 238 (459)
T KOG0288|consen 188 DTLATGGSDRIIKLWNVLGEKSELISTLAGSLGNITSIDFDSDN-------------KHVIA--AS-------------- 238 (459)
T ss_pred chhhhcchhhhhhhhhcccchhhhhhhhhccCCCcceeeecCCC-------------ceEEe--ec--------------
Confidence 456667788889999997532 1233334556667666653211 01121 11
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEEEeCCEEEEEECCCCceEEEEe
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAICQAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~~~~tl~ 172 (832)
.++.+++|++.+.+..++|.. ..+|.++.|.. ++|..+.|.+|+.||+....|..++.
T Consensus 239 ------------------~d~~~r~Wnvd~~r~~~TLsGHtdkVt~ak~~~~~~~vVsgs~DRtiK~WDl~k~~C~kt~l 300 (459)
T KOG0288|consen 239 ------------------NDKNLRLWNVDSLRLRHTLSGHTDKVTAAKFKLSHSRVVSGSADRTIKLWDLQKAYCSKTVL 300 (459)
T ss_pred ------------------CCCceeeeeccchhhhhhhcccccceeeehhhccccceeeccccchhhhhhhhhhheecccc
Confidence 237799999999999999985 67999999953 32333678899999999877665543
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
.-+. .+.+.++
T Consensus 301 ~~S~-------------cnDI~~~-------------------------------------------------------- 311 (459)
T KOG0288|consen 301 PGSQ-------------CNDIVCS-------------------------------------------------------- 311 (459)
T ss_pred cccc-------------ccceEec--------------------------------------------------------
Confidence 2110 0011100
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS 332 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd 332 (832)
. ..+.++..|++|+.||+.+..+....+.|. .|++|..+++
T Consensus 312 ---~-----------------------------------~~~~SgH~DkkvRfwD~Rs~~~~~sv~~gg-~vtSl~ls~~ 352 (459)
T KOG0288|consen 312 ---I-----------------------------------SDVISGHFDKKVRFWDIRSADKTRSVPLGG-RVTSLDLSMD 352 (459)
T ss_pred ---c-----------------------------------eeeeecccccceEEEeccCCceeeEeecCc-ceeeEeeccC
Confidence 0 012234568889999999999999998885 8999999999
Q ss_pred CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 333 GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
|..|.+++.|.+ ++++|+++. ...+.|+-.--.+.+.+..+.||||+.|+|+||.||.|+||++.++
T Consensus 353 g~~lLsssRDdt-l~viDlRt~------------eI~~~~sA~g~k~asDwtrvvfSpd~~YvaAGS~dgsv~iW~v~tg 419 (459)
T KOG0288|consen 353 GLELLSSSRDDT-LKVIDLRTK------------EIRQTFSAEGFKCASDWTRVVFSPDGSYVAAGSADGSVYIWSVFTG 419 (459)
T ss_pred CeEEeeecCCCc-eeeeecccc------------cEEEEeeccccccccccceeEECCCCceeeeccCCCcEEEEEccCc
Confidence 999999999976 899999886 3455554422222345889999999999999999999999999987
Q ss_pred CCceeecc
Q 003310 413 GGSVNFQP 420 (832)
Q Consensus 413 g~~~~~~~ 420 (832)
+.+-.+..
T Consensus 420 KlE~~l~~ 427 (459)
T KOG0288|consen 420 KLEKVLSL 427 (459)
T ss_pred eEEEEecc
Confidence 77666543
No 61
>PTZ00420 coronin; Provisional
Probab=99.57 E-value=4.6e-13 Score=156.98 Aligned_cols=106 Identities=16% Similarity=0.133 Sum_probs=82.0
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
++++.||+|+|||+.+++.+..+. |...|.+++|+|+|.+||+++.|++ |+|||++++ ..+.++
T Consensus 142 aSgS~DgtIrIWDl~tg~~~~~i~-~~~~V~SlswspdG~lLat~s~D~~-IrIwD~Rsg--------------~~i~tl 205 (568)
T PTZ00420 142 CSSGFDSFVNIWDIENEKRAFQIN-MPKKLSSLKWNIKGNLLSGTCVGKH-MHIIDPRKQ--------------EIASSF 205 (568)
T ss_pred EEEeCCCeEEEEECCCCcEEEEEe-cCCcEEEEEECCCCCEEEEEecCCE-EEEEECCCC--------------cEEEEE
Confidence 345678999999999998887776 5678999999999999999999987 999999876 233344
Q ss_pred eccCccc----cEEEEEEccCCCEEEEEeCCC----cEEEEecCCCCCcee
Q 003310 375 QRGLTNA----VIQDISFSDDSNWIMISSSRG----TSHLFAINPLGGSVN 417 (832)
Q Consensus 375 ~rG~t~a----~I~~IaFSpDg~~LAsgS~Dg----TVhIwdl~~~g~~~~ 417 (832)
.+|... .++...|++|+++|++++.|+ +|+|||+...+....
T Consensus 206 -~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~ 255 (568)
T PTZ00420 206 -HIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALV 255 (568)
T ss_pred -ecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceE
Confidence 233322 145567889999999988774 799999987555443
No 62
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.57 E-value=1e-13 Score=141.83 Aligned_cols=183 Identities=16% Similarity=0.156 Sum_probs=140.3
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc---CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS---SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S---~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++||+|+...|.++++..- ...|.+++.+ .++.+.+.|+.|++||..|++.++.+.+|.. .+|.
T Consensus 39 rtvrLWNp~rg~liktYsghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~rgH~a------------qVNt 106 (307)
T KOG0316|consen 39 RTVRLWNPLRGALIKTYSGHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGKVDRRFRGHLA------------QVNT 106 (307)
T ss_pred ceEEeecccccceeeeecCCCceeeeccccccccccccCCCCceEEEEEcccCeeeeecccccc------------eeeE
Confidence 8999999999999999986 5589988885 3444446889999999999999999988831 2222
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
+ +|..+ +.
T Consensus 107 V-------~fNee----------------------------sS------------------------------------- 114 (307)
T KOG0316|consen 107 V-------RFNEE----------------------------SS------------------------------------- 114 (307)
T ss_pred E-------EecCc----------------------------ce-------------------------------------
Confidence 2 23210 00
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC--cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK--NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~--~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
.++++.-|..+++||-++. +|+..|..-...|.++.. .+..+++||.||+ +|.||
T Consensus 115 --------------------Vv~SgsfD~s~r~wDCRS~s~ePiQildea~D~V~Si~v--~~heIvaGS~DGt-vRtyd 171 (307)
T KOG0316|consen 115 --------------------VVASGSFDSSVRLWDCRSRSFEPIQILDEAKDGVSSIDV--AEHEIVAGSVDGT-VRTYD 171 (307)
T ss_pred --------------------EEEeccccceeEEEEcccCCCCccchhhhhcCceeEEEe--cccEEEeeccCCc-EEEEE
Confidence 0123455778999999864 688888877788888776 5689999999998 89999
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCC
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~ 424 (832)
++.+ .. ..=.-| .+|++++||+||+.+.+++.|+|+|+.|-++++-...+.+|.+.
T Consensus 172 iR~G--------------~l-~sDy~g---~pit~vs~s~d~nc~La~~l~stlrLlDk~tGklL~sYkGhkn~ 227 (307)
T KOG0316|consen 172 IRKG--------------TL-SSDYFG---HPITSVSFSKDGNCSLASSLDSTLRLLDKETGKLLKSYKGHKNM 227 (307)
T ss_pred eecc--------------ee-ehhhcC---CcceeEEecCCCCEEEEeeccceeeecccchhHHHHHhcccccc
Confidence 9987 11 111123 46999999999999999999999999999887666778888543
No 63
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.57 E-value=3e-13 Score=145.46 Aligned_cols=235 Identities=12% Similarity=0.178 Sum_probs=160.5
Q ss_pred CCCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCC
Q 003310 16 ATRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 16 ~~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg 94 (832)
+..++|+.|...| +.+|.+.. ....++++.|..++.+-+++|.+. .+ ++..
T Consensus 158 p~a~illAG~~DGsvWmw~ip~-~~~~kv~~Gh~~~ct~G~f~pdGK--------------r~-~tgy------------ 209 (399)
T KOG0296|consen 158 PRAHILLAGSTDGSVWMWQIPS-QALCKVMSGHNSPCTCGEFIPDGK--------------RI-LTGY------------ 209 (399)
T ss_pred ccccEEEeecCCCcEEEEECCC-cceeeEecCCCCCcccccccCCCc--------------eE-EEEe------------
Confidence 3577888887776 89999986 467788899999999999998531 12 1221
Q ss_pred cccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCC--CCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEE
Q 003310 95 LATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFR--SPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEY 169 (832)
Q Consensus 95 ~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~--s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~ 169 (832)
.+++|++||++|++.++.+.-. ..-..+.++ +..+.. ..+..+++-+..+++.+.
T Consensus 210 --------------------~dgti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~ 269 (399)
T KOG0296|consen 210 --------------------DDGTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVN 269 (399)
T ss_pred --------------------cCceEEEEecCCCceeEEecccccCcCCccccccccceeEeccCCccEEEEccccceEEE
Confidence 1278999999999999999731 122234443 444444 456678888888887665
Q ss_pred EEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceec
Q 003310 170 AILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAA 249 (832)
Q Consensus 170 tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~las 249 (832)
...... | . +. +. .+++..+..+
T Consensus 270 ~~n~~~-----~-----------~-l~----~~--------------~e~~~esve~----------------------- 291 (399)
T KOG0296|consen 270 CNNGTV-----P-----------E-LK----PS--------------QEELDESVES----------------------- 291 (399)
T ss_pred ecCCCC-----c-----------c-cc----cc--------------chhhhhhhhh-----------------------
Confidence 553210 0 0 00 00 0000000000
Q ss_pred eeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE
Q 003310 250 GIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCF 329 (832)
Q Consensus 250 Gi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaF 329 (832)
...|. .-..++.+.-||+|.|||+.+.++ ++.-.|..+|..|.|
T Consensus 292 -------------~~~ss----------------------~lpL~A~G~vdG~i~iyD~a~~~~-R~~c~he~~V~~l~w 335 (399)
T KOG0296|consen 292 -------------IPSSS----------------------KLPLAACGSVDGTIAIYDLAASTL-RHICEHEDGVTKLKW 335 (399)
T ss_pred -------------ccccc----------------------ccchhhcccccceEEEEecccchh-heeccCCCceEEEEE
Confidence 00000 001234566789999999998765 455578899999999
Q ss_pred cCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 330 DPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 330 SPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
-+ -.+|+||+.+|+ |++||.++| ..++++ +||. ..|++++.+||.++++++|.|+|.+||++
T Consensus 336 ~~-t~~l~t~c~~g~-v~~wDaRtG--------------~l~~~y-~GH~-~~Il~f~ls~~~~~vvT~s~D~~a~VF~v 397 (399)
T KOG0296|consen 336 LN-TDYLLTACANGK-VRQWDARTG--------------QLKFTY-TGHQ-MGILDFALSPQKRLVVTVSDDNTALVFEV 397 (399)
T ss_pred cC-cchheeeccCce-EEeeecccc--------------ceEEEE-ecCc-hheeEEEEcCCCcEEEEecCCCeEEEEec
Confidence 99 788999999997 999999998 455554 6875 45999999999999999999999999987
Q ss_pred C
Q 003310 410 N 410 (832)
Q Consensus 410 ~ 410 (832)
.
T Consensus 398 ~ 398 (399)
T KOG0296|consen 398 P 398 (399)
T ss_pred C
Confidence 4
No 64
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.56 E-value=5.3e-13 Score=144.11 Aligned_cols=267 Identities=14% Similarity=0.212 Sum_probs=171.9
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEE-ecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCc
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQM-LPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGL 95 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~i-lp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~ 95 (832)
.+.+|..+|++..+|||.. |+...++..|-++++.+.. .+++.. .++ +..+
T Consensus 115 ~~~IltgsYDg~~riWd~~--Gk~~~~~~Ght~~ik~v~~v~~n~~~------------~~f--vsas------------ 166 (423)
T KOG0313|consen 115 SKWILTGSYDGTSRIWDLK--GKSIKTIVGHTGPIKSVAWVIKNSSS------------CLF--VSAS------------ 166 (423)
T ss_pred CceEEEeecCCeeEEEecC--CceEEEEecCCcceeeeEEEecCCcc------------ceE--EEec------------
Confidence 5899999999999999986 6899999999999996655 444311 122 2221
Q ss_pred ccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEE-----eCCCCEEEEEEc--C-CEEEEEeCCEEEEEECCCCce
Q 003310 96 ATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHML-----KFRSPIYSVRCS--S-RVVAICQAAQVHCFDAATLEI 167 (832)
Q Consensus 96 ~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL-----~f~s~V~sV~~S--~-r~LAVa~~~~I~vwDl~t~~~ 167 (832)
.+.++++|-..-++.+-.. .+...|.+|+.+ + +++..+.|.+|.||+.. .+.
T Consensus 167 -------------------~Dqtl~Lw~~~~~~~~~~~~~~~~GHk~~V~sVsv~~sgtr~~SgS~D~~lkiWs~~-~~~ 226 (423)
T KOG0313|consen 167 -------------------MDQTLRLWKWNVGENKVKALKVCRGHKRSVDSVSVDSSGTRFCSGSWDTMLKIWSVE-TDE 226 (423)
T ss_pred -------------------CCceEEEEEecCchhhhhHHhHhcccccceeEEEecCCCCeEEeecccceeeecccC-CCc
Confidence 2378999988877654332 246789999985 3 44444789999999932 222
Q ss_pred EEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccce
Q 003310 168 EYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHL 247 (832)
Q Consensus 168 ~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~l 247 (832)
.-+++...+.. |-.+-.. -.+++..|
T Consensus 227 ~~~~E~~s~~r-------------------rk~~~~~-----~~~~~r~P------------------------------ 252 (423)
T KOG0313|consen 227 EDELESSSNRR-------------------RKKQKRE-----KEGGTRTP------------------------------ 252 (423)
T ss_pred cccccccchhh-------------------hhhhhhh-----hcccccCc------------------------------
Confidence 22333222100 0000000 00000000
Q ss_pred eceeeec-cCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEE
Q 003310 248 AAGIVNL-GDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISA 326 (832)
Q Consensus 248 asGi~~l-Gd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~ 326 (832)
+..+ |+.. .++ ++ ..++ ..+.-+...|.+|+.||+.+++.+.++.+. .++.|
T Consensus 253 ---~vtl~GHt~--~Vs------------~V--~w~d-------~~v~yS~SwDHTIk~WDletg~~~~~~~~~-ksl~~ 305 (423)
T KOG0313|consen 253 ---LVTLEGHTE--PVS------------SV--VWSD-------ATVIYSVSWDHTIKVWDLETGGLKSTLTTN-KSLNC 305 (423)
T ss_pred ---eEEeccccc--cee------------eE--EEcC-------CCceEeecccceEEEEEeecccceeeeecC-cceeE
Confidence 0000 1110 000 00 0010 012234567889999999999998888764 57999
Q ss_pred EEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCC-CEEEEEeCCCcEE
Q 003310 327 LCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDS-NWIMISSSRGTSH 405 (832)
Q Consensus 327 LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg-~~LAsgS~DgTVh 405 (832)
+..+|...+||+||.|.+ |++||-+++ .+ +.. ...| -||.+ .|..+.|+|.. ..|+++|.|+|++
T Consensus 306 i~~~~~~~Ll~~gssdr~-irl~DPR~~-~g---------s~v-~~s~-~gH~n-wVssvkwsp~~~~~~~S~S~D~t~k 371 (423)
T KOG0313|consen 306 ISYSPLSKLLASGSSDRH-IRLWDPRTG-DG---------SVV-SQSL-IGHKN-WVSSVKWSPTNEFQLVSGSYDNTVK 371 (423)
T ss_pred eecccccceeeecCCCCc-eeecCCCCC-CC---------cee-EEee-ecchh-hhhheecCCCCceEEEEEecCCeEE
Confidence 999999999999999976 999999886 11 122 2344 36654 79999999965 4588999999999
Q ss_pred EEecCCCC-CceeeccCCCCcc
Q 003310 406 LFAINPLG-GSVNFQPTDANFT 426 (832)
Q Consensus 406 Iwdl~~~g-~~~~~~~H~~~~~ 426 (832)
+||+.... ....+.+|.+..-
T Consensus 372 lWDvRS~k~plydI~~h~DKvl 393 (423)
T KOG0313|consen 372 LWDVRSTKAPLYDIAGHNDKVL 393 (423)
T ss_pred EEEeccCCCcceeeccCCceEE
Confidence 99999887 5579999977654
No 65
>PTZ00421 coronin; Provisional
Probab=99.55 E-value=7.2e-13 Score=153.89 Aligned_cols=105 Identities=18% Similarity=0.248 Sum_probs=86.3
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
+++..||+|+|||+.+++.+..+.+|...|.+|+|+|+|.+|||++.||+ |+|||++++ ..+.++
T Consensus 142 aSgs~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~spdG~lLatgs~Dg~-IrIwD~rsg--------------~~v~tl 206 (493)
T PTZ00421 142 ASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKK-LNIIDPRDG--------------TIVSSV 206 (493)
T ss_pred EEEeCCCEEEEEECCCCeEEEEEcCCCCceEEEEEECCCCEEEEecCCCE-EEEEECCCC--------------cEEEEE
Confidence 44567899999999999999999999999999999999999999999997 999999876 344455
Q ss_pred eccCccccEEEEEEccCCCEEEEEe----CCCcEEEEecCCCCCc
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISS----SRGTSHLFAINPLGGS 415 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS----~DgTVhIwdl~~~g~~ 415 (832)
. ++....+..+.|.+++..|++++ .|++|+|||+......
T Consensus 207 ~-~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p 250 (493)
T PTZ00421 207 E-AHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASP 250 (493)
T ss_pred e-cCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCc
Confidence 3 44444566788999988877654 4799999999865543
No 66
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.55 E-value=4.5e-13 Score=152.57 Aligned_cols=242 Identities=18% Similarity=0.226 Sum_probs=177.5
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
...+|++|....+.+|+-.. +.+.++++.+...|..+.+.+.+ ..|||...
T Consensus 187 s~n~laValg~~vylW~~~s-~~v~~l~~~~~~~vtSv~ws~~G--------------~~LavG~~-------------- 237 (484)
T KOG0305|consen 187 SANVLAVALGQSVYLWSASS-GSVTELCSFGEELVTSVKWSPDG--------------SHLAVGTS-------------- 237 (484)
T ss_pred cCCeEEEEecceEEEEecCC-CceEEeEecCCCceEEEEECCCC--------------CEEEEeec--------------
Confidence 36799999999999999987 67889998778899999987643 14555442
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC--CCCEEEEEEcCCEEEE-EeCCEEEEEECCCCceEEE-Ee
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF--RSPIYSVRCSSRVVAI-CQAAQVHCFDAATLEIEYA-IL 172 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f--~s~V~sV~~S~r~LAV-a~~~~I~vwDl~t~~~~~t-l~ 172 (832)
.++|.|||.++.+.+.++.. ...|-+++++..++.. ..++.|..+|++..+.... +.
T Consensus 238 -------------------~g~v~iwD~~~~k~~~~~~~~h~~rvg~laW~~~~lssGsr~~~I~~~dvR~~~~~~~~~~ 298 (484)
T KOG0305|consen 238 -------------------DGTVQIWDVKEQKKTRTLRGSHASRVGSLAWNSSVLSSGSRDGKILNHDVRISQHVVSTLQ 298 (484)
T ss_pred -------------------CCeEEEEehhhccccccccCCcCceeEEEeccCceEEEecCCCcEEEEEEecchhhhhhhh
Confidence 17899999999999999987 6789999999887776 5788999999998764333 22
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
.|... +-||.
T Consensus 299 ~H~qe----------------------------------------------------------------------VCgLk 308 (484)
T KOG0305|consen 299 GHRQE----------------------------------------------------------------------VCGLK 308 (484)
T ss_pred cccce----------------------------------------------------------------------eeeeE
Confidence 22110 00111
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC-
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP- 331 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP- 331 (832)
+--+. ..+++++.|..+.|||.....++.+|..|+..|-+|+|+|
T Consensus 309 ws~d~----------------------------------~~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~ 354 (484)
T KOG0305|consen 309 WSPDG----------------------------------NQLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPW 354 (484)
T ss_pred ECCCC----------------------------------CeeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCC
Confidence 10000 1244567889999999999999999999999999999999
Q ss_pred CCCEEEEE--EcCCCEEEEEeCCCCCC----CCCC----------------ccCCCCceeEEEEE--------eccCccc
Q 003310 332 SGILLVTA--SVQGHNINIFKIIPGIL----GTSS----------------ACDAGTSYVHLYRL--------QRGLTNA 381 (832)
Q Consensus 332 dG~lLATa--S~dGt~I~Iwdi~t~~~----~~~s----------------~~~~~~~~~~l~~l--------~rG~t~a 381 (832)
...+|||| +.|++ |++|++.++.. .+.+ .-|.......+|++ .-||+ .
T Consensus 355 q~~lLAsGGGs~D~~-i~fwn~~~g~~i~~vdtgsQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~ps~~~~~~l~gH~-~ 432 (484)
T KOG0305|consen 355 QSGLLATGGGSADRC-IKFWNTNTGARIDSVDTGSQVCSLIWSKKYKELLSTHGYSENQITLWKYPSMKLVAELLGHT-S 432 (484)
T ss_pred ccCceEEcCCCcccE-EEEEEcCCCcEecccccCCceeeEEEcCCCCEEEEecCCCCCcEEEEeccccceeeeecCCc-c
Confidence 57789986 46766 99999998732 1100 01111122233332 23544 4
Q ss_pred cEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 382 VIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 382 ~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
+|..+++||||..|++|+.|.|+++|.+=+.
T Consensus 433 RVl~la~SPdg~~i~t~a~DETlrfw~~f~~ 463 (484)
T KOG0305|consen 433 RVLYLALSPDGETIVTGAADETLRFWNLFDE 463 (484)
T ss_pred eeEEEEECCCCCEEEEecccCcEEeccccCC
Confidence 6999999999999999999999999999665
No 67
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.54 E-value=1.5e-13 Score=159.81 Aligned_cols=201 Identities=15% Similarity=0.191 Sum_probs=147.2
Q ss_pred CEEEEEECCCCcEEEEEe-CCCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLK-FRSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++|++||.+=+.+++.+. +.++|+.|.|.+ .+++. +.|-+|+||+..+-+|+++|.+|-. |
T Consensus 31 G~IQlWDYRM~tli~rFdeHdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclftL~GHlD-------------Y-- 95 (1202)
T KOG0292|consen 31 GVIQLWDYRMGTLIDRFDEHDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGHLD-------------Y-- 95 (1202)
T ss_pred ceeeeehhhhhhHHhhhhccCCccceeeecCCCCeEEecCCccEEEEEecccceehhhhccccc-------------e--
Confidence 789999999999999874 688999999964 45555 4566999999999999999988732 0
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
+ |-+.|. ++
T Consensus 96 V----Rt~~FH-------------he------------------------------------------------------ 104 (1202)
T KOG0292|consen 96 V----RTVFFH-------------HE------------------------------------------------------ 104 (1202)
T ss_pred e----EEeecc-------------CC------------------------------------------------------
Confidence 0 112221 00
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
-+ =+.++++|.+|+||+..++++++.+.+|.+.|.|.+|.|.-.++++||-|.+ |||||+.
T Consensus 105 ---------yP---------WIlSASDDQTIrIWNwqsr~~iavltGHnHYVMcAqFhptEDlIVSaSLDQT-VRVWDis 165 (1202)
T KOG0292|consen 105 ---------YP---------WILSASDDQTIRIWNWQSRKCIAVLTGHNHYVMCAQFHPTEDLIVSASLDQT-VRVWDIS 165 (1202)
T ss_pred ---------Cc---------eEEEccCCCeEEEEeccCCceEEEEecCceEEEeeccCCccceEEEecccce-EEEEeec
Confidence 00 0234567889999999999999999999999999999999999999999987 9999985
Q ss_pred CCCC---CCCC-------------ccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCC--
Q 003310 353 PGIL---GTSS-------------ACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGG-- 414 (832)
Q Consensus 353 t~~~---~~~s-------------~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~-- 414 (832)
--.. .+.+ .++. ....--+.| .||+. -|+-+||.|.--.|++|++|+-|++|..+..+.
T Consensus 166 GLRkk~~~pg~~e~~~~~~~~~~dLfg~-~DaVVK~VL-EGHDR-GVNwaAfhpTlpliVSG~DDRqVKlWrmnetKaWE 242 (1202)
T KOG0292|consen 166 GLRKKNKAPGSLEDQMRGQQGNSDLFGQ-TDAVVKHVL-EGHDR-GVNWAAFHPTLPLIVSGADDRQVKLWRMNETKAWE 242 (1202)
T ss_pred chhccCCCCCCchhhhhccccchhhcCC-cCeeeeeee-ccccc-ccceEEecCCcceEEecCCcceeeEEEecccccee
Confidence 3211 0111 1111 122222334 46654 389999999999999999999999999976554
Q ss_pred ceeeccCCCCc
Q 003310 415 SVNFQPTDANF 425 (832)
Q Consensus 415 ~~~~~~H~~~~ 425 (832)
.-+.++|.++.
T Consensus 243 vDtcrgH~nnV 253 (1202)
T KOG0292|consen 243 VDTCRGHYNNV 253 (1202)
T ss_pred ehhhhcccCCc
Confidence 12446666543
No 68
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.53 E-value=2.2e-13 Score=149.85 Aligned_cols=109 Identities=14% Similarity=0.295 Sum_probs=91.0
Q ss_pred ccccCCCCeEEEEECC--CCcEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeE
Q 003310 294 FPDADNVGMVIVRDIV--SKNVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVH 370 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~--s~~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~ 370 (832)
|.+.+.++.+.|||++ +.++....++|+.+|.|++|+| ++.+|||||.|++ +.+||++.- ...
T Consensus 243 F~sv~dd~~L~iwD~R~~~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D~t-V~LwDlRnL-------------~~~ 308 (422)
T KOG0264|consen 243 FGSVGDDGKLMIWDTRSNTSKPSHSVKAHSAEVNCVAFNPFNEFILATGSADKT-VALWDLRNL-------------NKP 308 (422)
T ss_pred heeecCCCeEEEEEcCCCCCCCcccccccCCceeEEEeCCCCCceEEeccCCCc-EEEeechhc-------------ccC
Confidence 4456788999999999 5677778889999999999999 6778999999998 899999875 235
Q ss_pred EEEEeccCccccEEEEEEccC-CCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 371 LYRLQRGLTNAVIQDISFSDD-SNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 371 l~~l~rG~t~a~I~~IaFSpD-g~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
++++. |+. ..|..|.|||+ ...||+++.|+.++|||+..-+++...
T Consensus 309 lh~~e-~H~-dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~ 355 (422)
T KOG0264|consen 309 LHTFE-GHE-DEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSP 355 (422)
T ss_pred ceecc-CCC-cceEEEEeCCCCCceeEecccCCcEEEEeccccccccCh
Confidence 66763 443 35999999995 567899999999999999998887663
No 69
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.53 E-value=2.7e-13 Score=140.13 Aligned_cols=223 Identities=15% Similarity=0.223 Sum_probs=154.1
Q ss_pred CeEEEEeccCCCeeEEeee-cCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccccCCCCCCC
Q 003310 28 GFQVWDVEEADNVHDLVSR-YDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATACNGTSANY 106 (832)
Q Consensus 28 G~qVWdv~~~~~~~ellS~-~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~~~g~~~~~ 106 (832)
.+-|.+++....++|..+- -...+..+++.++. ...++.|.+ |
T Consensus 39 ~L~ile~~~~~gi~e~~s~d~~D~LfdV~Wse~~-------------e~~~~~a~G----------D------------- 82 (311)
T KOG0277|consen 39 RLFILEVTDPKGIQECQSYDTEDGLFDVAWSENH-------------ENQVIAASG----------D------------- 82 (311)
T ss_pred eEEEEecCCCCCeEEEEeeecccceeEeeecCCC-------------cceEEEEec----------C-------------
Confidence 3567777655667777763 24566777776531 123444442 1
Q ss_pred CCCCCCCCCCCEEEEEECCC-CcEEEEEe-CCCCEEEEEEc---CC-EEEEEeCCEEEEEECCCCceEEEEecCCCccCC
Q 003310 107 HDLGNGSSVPTVVHFYSLRS-QSYVHMLK-FRSPIYSVRCS---SR-VVAICQAAQVHCFDAATLEIEYAILTNPIVMGH 180 (832)
Q Consensus 107 h~~g~~~~~~~tVrlWDL~T-g~~V~tL~-f~s~V~sV~~S---~r-~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~ 180 (832)
+++||||+.- -..++.++ +...|++|.++ ++ +|..+-|++|++||..-.+.+.|..+|..
T Consensus 83 ----------GSLrl~d~~~~s~Pi~~~kEH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~---- 148 (311)
T KOG0277|consen 83 ----------GSLRLFDLTMPSKPIHKFKEHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTFNGHNS---- 148 (311)
T ss_pred ----------ceEEEeccCCCCcchhHHHhhhhheEEeccccccceeEEeeccCCceEeecCCCCcceEeecCCcc----
Confidence 6799999742 34566554 57799999997 23 34447799999999888877777766522
Q ss_pred CCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccc
Q 003310 181 PSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYK 260 (832)
Q Consensus 181 p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~ 260 (832)
+-|.. . .+|.
T Consensus 149 -------------------~Iy~a----~-----~sp~------------------------------------------ 158 (311)
T KOG0277|consen 149 -------------------CIYQA----A-----FSPH------------------------------------------ 158 (311)
T ss_pred -------------------EEEEE----e-----cCCC------------------------------------------
Confidence 11220 0 0000
Q ss_pred ccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC-CCCEEEEE
Q 003310 261 KLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP-SGILLVTA 339 (832)
Q Consensus 261 ~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP-dG~lLATa 339 (832)
+...|+++..||+.+|||++.......|++|...|.|+.|+. +-.+||||
T Consensus 159 -----------------------------~~nlfas~Sgd~~l~lwdvr~~gk~~~i~ah~~Eil~cdw~ky~~~vl~Tg 209 (311)
T KOG0277|consen 159 -----------------------------IPNLFASASGDGTLRLWDVRSPGKFMSIEAHNSEILCCDWSKYNHNVLATG 209 (311)
T ss_pred -----------------------------CCCeEEEccCCceEEEEEecCCCceeEEEeccceeEeecccccCCcEEEec
Confidence 001244566789999999986433334999999999999998 66789999
Q ss_pred EcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccC-CCEEEEEeCCCcEEEEecCCCCCc
Q 003310 340 SVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDD-SNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 340 S~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpD-g~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
+.|+. |++||++.- ..++..| -|+.-| |..|.|||. ...||++|-|.|++||+......-
T Consensus 210 ~vd~~-vr~wDir~~-------------r~pl~eL-~gh~~A-VRkvk~Sph~~~lLaSasYDmT~riw~~~~~ds~ 270 (311)
T KOG0277|consen 210 GVDNL-VRGWDIRNL-------------RTPLFEL-NGHGLA-VRKVKFSPHHASLLASASYDMTVRIWDPERQDSA 270 (311)
T ss_pred CCCce-EEEEehhhc-------------cccceee-cCCceE-EEEEecCcchhhHhhhccccceEEecccccchhh
Confidence 99976 999999875 2456777 466544 999999995 568999999999999999865443
No 70
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.53 E-value=2.1e-13 Score=151.08 Aligned_cols=230 Identities=17% Similarity=0.235 Sum_probs=162.4
Q ss_pred EEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccccc
Q 003310 20 VLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATAC 99 (832)
Q Consensus 20 vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~~ 99 (832)
=+++..+-++|||+.... .+...+++-+..|+.+.|-.++ -||| +++.+
T Consensus 41 d~aVt~S~rvqly~~~~~-~~~k~~srFk~~v~s~~fR~DG--------------~Lla-aGD~s--------------- 89 (487)
T KOG0310|consen 41 DFAVTSSVRVQLYSSVTR-SVRKTFSRFKDVVYSVDFRSDG--------------RLLA-AGDES--------------- 89 (487)
T ss_pred ceEEecccEEEEEecchh-hhhhhHHhhccceeEEEeecCC--------------eEEE-ccCCc---------------
Confidence 355556778999998763 4666677667778877764322 2454 33321
Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEEEe-CCEEEEEECCCCceEEEEecC
Q 003310 100 NGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAICQ-AAQVHCFDAATLEIEYAILTN 174 (832)
Q Consensus 100 ~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAVa~-~~~I~vwDl~t~~~~~tl~t~ 174 (832)
+.|++||+++...++.+.- ..+|..+.|++ .+++.+. +..+++||+.+......+.+|
T Consensus 90 -----------------G~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~v~~~l~~h 152 (487)
T KOG0310|consen 90 -----------------GHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLSTAYVQAELSGH 152 (487)
T ss_pred -----------------CcEEEeccccHHHHHHHhhccCceeEEEecccCCeEEEecCCCceEEEEEcCCcEEEEEecCC
Confidence 6799999988777777764 67999999974 4555554 457899999998765566665
Q ss_pred CCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeec
Q 003310 175 PIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNL 254 (832)
Q Consensus 175 ~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~l 254 (832)
.+- + |..+.. |.
T Consensus 153 tDY---------------V----R~g~~~-------------~~------------------------------------ 164 (487)
T KOG0310|consen 153 TDY---------------V----RCGDIS-------------PA------------------------------------ 164 (487)
T ss_pred cce---------------e----Eeeccc-------------cC------------------------------------
Confidence 321 0 111111 00
Q ss_pred cCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCCC
Q 003310 255 GDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPSG 333 (832)
Q Consensus 255 Gd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPdG 333 (832)
++. .+++++.||+|++||+.+. ..+.+| .|..||..+.|=|+|
T Consensus 165 --------------------------------~~h---ivvtGsYDg~vrl~DtR~~~~~v~el-nhg~pVe~vl~lpsg 208 (487)
T KOG0310|consen 165 --------------------------------NDH---IVVTGSYDGKVRLWDTRSLTSRVVEL-NHGCPVESVLALPSG 208 (487)
T ss_pred --------------------------------CCe---EEEecCCCceEEEEEeccCCceeEEe-cCCCceeeEEEcCCC
Confidence 000 1346789999999999987 566665 588899999999999
Q ss_pred CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 334 ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 334 ~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
.++|||+ |..|+|||+.+| ...+..+ ..+.-.|+|+++..|++.|.++|.|+.|+|||+..++
T Consensus 209 s~iasAg--Gn~vkVWDl~~G-------------~qll~~~--~~H~KtVTcL~l~s~~~rLlS~sLD~~VKVfd~t~~K 271 (487)
T KOG0310|consen 209 SLIASAG--GNSVKVWDLTTG-------------GQLLTSM--FNHNKTVTCLRLASDSTRLLSGSLDRHVKVFDTTNYK 271 (487)
T ss_pred CEEEEcC--CCeEEEEEecCC-------------ceehhhh--hcccceEEEEEeecCCceEeecccccceEEEEccceE
Confidence 9999988 667999999876 1333332 2233459999999999999999999999999988876
Q ss_pred Cceee
Q 003310 414 GSVNF 418 (832)
Q Consensus 414 ~~~~~ 418 (832)
-...+
T Consensus 272 vv~s~ 276 (487)
T KOG0310|consen 272 VVHSW 276 (487)
T ss_pred EEEee
Confidence 55444
No 71
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.53 E-value=1.9e-13 Score=162.86 Aligned_cols=241 Identities=12% Similarity=0.109 Sum_probs=154.7
Q ss_pred ccccC--CCCeEEEEECCC------------CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCC
Q 003310 294 FPDAD--NVGMVIVRDIVS------------KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTS 359 (832)
Q Consensus 294 ~~s~~--~~G~V~IwDl~s------------~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~ 359 (832)
|++++ .||.++||.... .+.+.++..|.+.|+|+.|||||++||+||+|. .|.||+..+. +..
T Consensus 28 ~aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~~h~~sv~CVR~S~dG~~lAsGSDD~-~v~iW~~~~~--~~~ 104 (942)
T KOG0973|consen 28 FATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMDDHDGSVNCVRFSPDGSYLASGSDDR-LVMIWERAEI--GSG 104 (942)
T ss_pred EecCCccccccceeeccccccchhhhhhcccchhheeeccccCceeEEEECCCCCeEeeccCcc-eEEEeeeccc--CCc
Confidence 34444 677788998753 346788889999999999999999999999996 4899998752 111
Q ss_pred Ccc-----CCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCccc-------
Q 003310 360 SAC-----DAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFTT------- 427 (832)
Q Consensus 360 s~~-----~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~~------- 427 (832)
..+ ..+-...+.+...|||. ..|.+++||||+.+||++|.|++|+||+..+++....+++|...+-.
T Consensus 105 ~~fgs~g~~~~vE~wk~~~~l~~H~-~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~G 183 (942)
T KOG0973|consen 105 TVFGSTGGAKNVESWKVVSILRGHD-SDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFELLKVLRGHQSLVKGVSWDPIG 183 (942)
T ss_pred ccccccccccccceeeEEEEEecCC-CccceeccCCCccEEEEecccceEEEEccccceeeeeeecccccccceEECCcc
Confidence 111 11112223333447864 57999999999999999999999999999999777888898864211
Q ss_pred -----------------------------ccCCcccccccCCCCCCCCCCCCcccccCCC---CeeeeeceEEEcCCCCC
Q 003310 428 -----------------------------KHGAMAKSGVRWPPNLGLQMPNQQSLCASGP---PVTLSVVSRIRNGNNGW 475 (832)
Q Consensus 428 -----------------------------~~~~~~~~~~r~~~~s~~~~~~~~~l~~~~~---p~~ls~v~~I~~~~~~~ 475 (832)
|...+..+.+++..|| |+.+.|+.+++ | -+.++.|.++ +|
T Consensus 184 ky~ASqsdDrtikvwrt~dw~i~k~It~pf~~~~~~T~f~RlSWS----PDG~~las~nA~n~~--~~~~~IieR~--tW 255 (942)
T KOG0973|consen 184 KYFASQSDDRTLKVWRTSDWGIEKSITKPFEESPLTTFFLRLSWS----PDGHHLASPNAVNGG--KSTIAIIERG--TW 255 (942)
T ss_pred CeeeeecCCceEEEEEcccceeeEeeccchhhCCCcceeeecccC----CCcCeecchhhccCC--cceeEEEecC--Cc
Confidence 1123444555555555 89999977653 4 5566677774 49
Q ss_pred Ccccc---ccchhc--c------------CcccCCC---cceeeeeeccCCCccccc------------cCC-----ccc
Q 003310 476 RGTVS---GAAAAA--T------------GRVSSLS---GAIASSFHNCKGNSETYA------------AGS-----SLK 518 (832)
Q Consensus 476 ~~~v~---~~~~~a--~------------g~~~~~~---g~~~~~~h~~~~~~~~~~------------~~~-----~~~ 518 (832)
..++. ..++.- + |.+..+. .++|..-+.-....|+.. ..+ =.+
T Consensus 256 k~~~~LvGH~~p~evvrFnP~lfe~~~~ng~~~~~~~~y~i~AvgSqDrSlSVW~T~~~RPl~vi~~lf~~SI~DmsWsp 335 (942)
T KOG0973|consen 256 KVDKDLVGHSAPVEVVRFNPKLFERNNKNGTSTQPNCYYCIAAVGSQDRSLSVWNTALPRPLFVIHNLFNKSIVDMSWSP 335 (942)
T ss_pred eeeeeeecCCCceEEEEeChHHhccccccCCccCCCcceEEEEEecCCccEEEEecCCCCchhhhhhhhcCceeeeeEcC
Confidence 76532 221110 0 1110010 122222121111112221 000 034
Q ss_pred ccccEEEEcCCCcEEEEeeeccCCCCcc
Q 003310 519 IKNHLLVFSPSGCMIQYALRISTGLDVT 546 (832)
Q Consensus 519 ~~~~Llv~s~~G~l~~y~l~~~~~~~~~ 546 (832)
..-.||++|-||.+.+..+....-++..
T Consensus 336 dG~~LfacS~DGtV~~i~Fee~ElG~~l 363 (942)
T KOG0973|consen 336 DGFSLFACSLDGTVALIHFEEKELGVAL 363 (942)
T ss_pred CCCeEEEEecCCeEEEEEcchHHhCccc
Confidence 6778999999999999999998766554
No 72
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.52 E-value=1.9e-13 Score=150.29 Aligned_cols=178 Identities=15% Similarity=0.248 Sum_probs=128.2
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc---CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccccee
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS---SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPL 193 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S---~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~ 193 (832)
+.|.|-..+|++.+++++..+.|.++.|+ +++++++.+++|++||+++..+++....-....+ ..+
T Consensus 325 G~I~lLhakT~eli~s~KieG~v~~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~v~g-----------ts~ 393 (514)
T KOG2055|consen 325 GHIHLLHAKTKELITSFKIEGVVSDFTFSSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGSVHG-----------TSL 393 (514)
T ss_pred ceEEeehhhhhhhhheeeeccEEeeEEEecCCcEEEEEcCCceEEEEecCCcceEEEEeecCccce-----------eee
Confidence 66899999999999999999999999996 4677778899999999999988888765321110 011
Q ss_pred e--eccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccC
Q 003310 194 A--VGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLP 271 (832)
Q Consensus 194 A--lg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p 271 (832)
+ +.++|
T Consensus 394 ~~S~ng~y------------------------------------------------------------------------ 401 (514)
T KOG2055|consen 394 CISLNGSY------------------------------------------------------------------------ 401 (514)
T ss_pred eecCCCce------------------------------------------------------------------------
Confidence 1 11111
Q ss_pred CCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC------CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCC-C
Q 003310 272 DSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS------KNVIAQFRAHKSPISALCFDPSGILLVTASVQG-H 344 (832)
Q Consensus 272 ~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s------~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dG-t 344 (832)
++++...|.|.|||..+ .+++..+..-+..|+.|+|+||+++||.||..- .
T Consensus 402 ----------------------lA~GS~~GiVNIYd~~s~~~s~~PkPik~~dNLtt~Itsl~Fn~d~qiLAiaS~~~kn 459 (514)
T KOG2055|consen 402 ----------------------LATGSDSGIVNIYDGNSCFASTNPKPIKTVDNLTTAITSLQFNHDAQILAIASRVKKN 459 (514)
T ss_pred ----------------------EEeccCcceEEEeccchhhccCCCCchhhhhhhheeeeeeeeCcchhhhhhhhhcccc
Confidence 23345667888888653 568888888888999999999999999998642 2
Q ss_pred EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 345 NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 345 ~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.+|+--+.... +..-+-. ++..-..|.|++|||.|-+||.|..+|.+++|.|+.|
T Consensus 460 alrLVHvPS~T------------VFsNfP~-~n~~vg~vtc~aFSP~sG~lAvGNe~grv~l~kL~hy 514 (514)
T KOG2055|consen 460 ALRLVHVPSCT------------VFSNFPT-SNTKVGHVTCMAFSPNSGYLAVGNEAGRVHLFKLHHY 514 (514)
T ss_pred ceEEEecccee------------eeccCCC-CCCcccceEEEEecCCCceEEeecCCCceeeEeeccC
Confidence 35665543320 0000000 1122234899999999999999999999999999754
No 73
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.51 E-value=4.7e-12 Score=132.05 Aligned_cols=176 Identities=18% Similarity=0.275 Sum_probs=133.7
Q ss_pred CEEEEEECCCC---cEEEEEe--CCCCEEEEEEc--CCEEEE-EeCCEEEEEECC--CCceEEEEecCCCccCCCCCCCC
Q 003310 117 TVVHFYSLRSQ---SYVHMLK--FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAA--TLEIEYAILTNPIVMGHPSAGGI 186 (832)
Q Consensus 117 ~tVrlWDL~Tg---~~V~tL~--f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~--t~~~~~tl~t~~~~~~~p~~~~~ 186 (832)
++||+|++..+ .|...|. +...|++|+++ +++||+ ++|.++.||--. ++++..+|++|.+..-
T Consensus 37 k~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~~efecv~~lEGHEnEVK------- 109 (312)
T KOG0645|consen 37 KAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWKKEDGEFECVATLEGHENEVK------- 109 (312)
T ss_pred ceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEeecCCCceeEEeeeecccccee-------
Confidence 78999999853 4554553 46789999996 589988 689999999765 5688999999865321
Q ss_pred CcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccc
Q 003310 187 GIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYC 266 (832)
Q Consensus 187 ~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~ 266 (832)
.+|++. +|.+
T Consensus 110 ------------~Vaws~---------------------------sG~~------------------------------- 119 (312)
T KOG0645|consen 110 ------------CVAWSA---------------------------SGNY------------------------------- 119 (312)
T ss_pred ------------EEEEcC---------------------------CCCE-------------------------------
Confidence 123321 1111
Q ss_pred ccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCC
Q 003310 267 SEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK---NVIAQFRAHKSPISALCFDPSGILLVTASVQG 343 (832)
Q Consensus 267 ~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~---~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dG 343 (832)
+++...|..|-||.+..+ .+++.|+.|++.|--+.|.|.-.+|+|+|.|.
T Consensus 120 ---------------------------LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYDn 172 (312)
T KOG0645|consen 120 ---------------------------LATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYDN 172 (312)
T ss_pred ---------------------------EEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEeccCC
Confidence 122334567889988743 58899999999999999999999999999997
Q ss_pred CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 344 HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 344 t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
+ |++|+-.++ + ...++.+|. |++. .|++++|.+.|..|++++.|+|++||.+-
T Consensus 173 T-Ik~~~~~~d-------d----dW~c~~tl~-g~~~-TVW~~~F~~~G~rl~s~sdD~tv~Iw~~~ 225 (312)
T KOG0645|consen 173 T-IKVYRDEDD-------D----DWECVQTLD-GHEN-TVWSLAFDNIGSRLVSCSDDGTVSIWRLY 225 (312)
T ss_pred e-EEEEeecCC-------C----CeeEEEEec-Cccc-eEEEEEecCCCceEEEecCCcceEeeeec
Confidence 7 999987643 1 235666773 5543 69999999999999999999999999965
No 74
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.51 E-value=8.8e-14 Score=147.01 Aligned_cols=111 Identities=23% Similarity=0.401 Sum_probs=99.2
Q ss_pred ccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
++++..||.|+||.+.+|.++..|. +|+..|+||.||.|+..+.++|.|-+ +||.-++.| ..|.
T Consensus 278 lAsGsqDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~t-vRiHGlKSG--------------K~LK 342 (508)
T KOG0275|consen 278 LASGSQDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQT-VRIHGLKSG--------------KCLK 342 (508)
T ss_pred hhccCcCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccccce-EEEeccccc--------------hhHH
Confidence 3456789999999999999999998 99999999999999999999999966 899988876 5666
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccC
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPT 421 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H 421 (832)
++ ||++. -|+...|++||.++.++|+||||+||+..+..+..+|+.-
T Consensus 343 Ef-rGHsS-yvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~Tfk~~ 389 (508)
T KOG0275|consen 343 EF-RGHSS-YVNEATFTDDGHHIISASSDGTVKVWHGKTTECLSTFKPL 389 (508)
T ss_pred Hh-cCccc-cccceEEcCCCCeEEEecCCccEEEecCcchhhhhhccCC
Confidence 66 78765 4999999999999999999999999999999988888753
No 75
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.50 E-value=8.8e-14 Score=151.10 Aligned_cols=243 Identities=16% Similarity=0.263 Sum_probs=175.3
Q ss_pred ecccc-ccccCCCC--------CcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccC
Q 003310 5 AGFDK-LESEAGAT--------RRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVR 74 (832)
Q Consensus 5 ~~fd~-l~~~~~~~--------~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~r 74 (832)
.-|+. |..|++++ ..-++.|-.+| +++|+.+- ++++.+-..|...||++++.|+- ..|
T Consensus 128 fnFEtilQaHDs~Vr~m~ws~~g~wmiSgD~gG~iKyWqpnm-nnVk~~~ahh~eaIRdlafSpnD-------skF---- 195 (464)
T KOG0284|consen 128 FNFETILQAHDSPVRTMKWSHNGTWMISGDKGGMIKYWQPNM-NNVKIIQAHHAEAIRDLAFSPND-------SKF---- 195 (464)
T ss_pred eeHHHHhhhhcccceeEEEccCCCEEEEcCCCceEEecccch-hhhHHhhHhhhhhhheeccCCCC-------cee----
Confidence 34555 34466543 56777887777 89999886 45655555566999999999841 233
Q ss_pred CEEEEEeCCCCccCccccCCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCC-CEEEEEEc--CCEEEE
Q 003310 75 PLLVFCADGSRSCGTKVQDGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRS-PIYSVRCS--SRVVAI 151 (832)
Q Consensus 75 PLLavv~~g~~~g~~~~~Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s-~V~sV~~S--~r~LAV 151 (832)
+-|++ +++|+|||..-.+.-..|..+. -|.++++. +.+||+
T Consensus 196 ---~t~Sd---------------------------------Dg~ikiWdf~~~kee~vL~GHgwdVksvdWHP~kgLias 239 (464)
T KOG0284|consen 196 ---LTCSD---------------------------------DGTIKIWDFRMPKEERVLRGHGWDVKSVDWHPTKGLIAS 239 (464)
T ss_pred ---EEecC---------------------------------CCeEEEEeccCCchhheeccCCCCcceeccCCccceeEE
Confidence 33443 2789999999888777776544 68999996 467777
Q ss_pred E-eCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCccccccccccccc
Q 003310 152 C-QAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFA 230 (832)
Q Consensus 152 a-~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~ 230 (832)
+ -|+.|++||.++++|+-++..|.+. .+++ -|..
T Consensus 240 gskDnlVKlWDprSg~cl~tlh~HKnt--------------Vl~~-----~f~~-------------------------- 274 (464)
T KOG0284|consen 240 GSKDNLVKLWDPRSGSCLATLHGHKNT--------------VLAV-----KFNP-------------------------- 274 (464)
T ss_pred ccCCceeEeecCCCcchhhhhhhccce--------------EEEE-----EEcC--------------------------
Confidence 4 5678999999999999888877541 1221 1110
Q ss_pred CCCcceeeeecccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC
Q 003310 231 SNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS 310 (832)
Q Consensus 231 s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s 310 (832)
++. -+++++.|..++++|+.+
T Consensus 275 -n~N----------------------------------------------------------~Llt~skD~~~kv~DiR~ 295 (464)
T KOG0284|consen 275 -NGN----------------------------------------------------------WLLTGSKDQSCKVFDIRT 295 (464)
T ss_pred -CCC----------------------------------------------------------eeEEccCCceEEEEehhH
Confidence 010 123345677899999999
Q ss_pred CcEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEc
Q 003310 311 KNVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFS 389 (832)
Q Consensus 311 ~~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFS 389 (832)
.+.+..+++|...|.+++|+|= -.+|.+++.||. |..|.+... ..+-.+.-++. ..|++++|.
T Consensus 296 mkEl~~~r~Hkkdv~~~~WhP~~~~lftsgg~Dgs-vvh~~v~~~--------------~p~~~i~~AHd-~~iwsl~~h 359 (464)
T KOG0284|consen 296 MKELFTYRGHKKDVTSLTWHPLNESLFTSGGSDGS-VVHWVVGLE--------------EPLGEIPPAHD-GEIWSLAYH 359 (464)
T ss_pred hHHHHHhhcchhhheeeccccccccceeeccCCCc-eEEEecccc--------------ccccCCCcccc-cceeeeecc
Confidence 9999999999999999999994 558899999998 788988632 11222222232 249999999
Q ss_pred cCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 390 DDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 390 pDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
|=|.+||+||.|.|++.|.-...+..
T Consensus 360 PlGhil~tgsnd~t~rfw~r~rp~d~ 385 (464)
T KOG0284|consen 360 PLGHILATGSNDRTVRFWTRNRPGDK 385 (464)
T ss_pred ccceeEeecCCCcceeeeccCCCCCc
Confidence 99999999999999999988765543
No 76
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.50 E-value=1.4e-11 Score=143.29 Aligned_cols=120 Identities=21% Similarity=0.277 Sum_probs=89.7
Q ss_pred cCCCCeEEEEECCCCcEEEEe---ccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC------CCCCc------
Q 003310 297 ADNVGMVIVRDIVSKNVIAQF---RAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL------GTSSA------ 361 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~---~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~------~~~s~------ 361 (832)
+...|.|.+|++++|-....| ++|.++|+.|+.+--+++++||+.+|. ++.||...... +..-.
T Consensus 466 G~S~G~Id~fNmQSGi~r~sf~~~~ah~~~V~gla~D~~n~~~vsa~~~Gi-lkfw~f~~k~l~~~l~l~~~~~~iv~hr 544 (910)
T KOG1539|consen 466 GYSKGTIDRFNMQSGIHRKSFGDSPAHKGEVTGLAVDGTNRLLVSAGADGI-LKFWDFKKKVLKKSLRLGSSITGIVYHR 544 (910)
T ss_pred eccCCeEEEEEcccCeeecccccCccccCceeEEEecCCCceEEEccCcce-EEEEecCCcceeeeeccCCCcceeeeee
Confidence 446789999999999888888 699999999999999999999999997 89999887531 11000
Q ss_pred -----c-CCCCceeEEEE--------EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 362 -----C-DAGTSYVHLYR--------LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 362 -----~-~~~~~~~~l~~--------l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
+ .-..-...++. --+||+ .+|++++|||||+||++++.|+||++||+-+....-.+
T Consensus 545 ~s~l~a~~~ddf~I~vvD~~t~kvvR~f~gh~-nritd~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~ 614 (910)
T KOG1539|consen 545 VSDLLAIALDDFSIRVVDVVTRKVVREFWGHG-NRITDMTFSPDGRWLISASMDSTIRTWDLPTGTLIDGL 614 (910)
T ss_pred hhhhhhhhcCceeEEEEEchhhhhhHHhhccc-cceeeeEeCCCCcEEEEeecCCcEEEEeccCcceeeeE
Confidence 0 00000111111 125665 47999999999999999999999999999776544333
No 77
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.48 E-value=1.8e-12 Score=147.33 Aligned_cols=107 Identities=14% Similarity=0.109 Sum_probs=90.9
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
++.++..+.+++||-.+++.+..+++|+..|.+|-.++||+.++|||.||+ ||+||+.-. +++.+
T Consensus 186 ivsGgtek~lr~wDprt~~kimkLrGHTdNVr~ll~~dDGt~~ls~sSDgt-IrlWdLgqQ--------------rCl~T 250 (735)
T KOG0308|consen 186 IVSGGTEKDLRLWDPRTCKKIMKLRGHTDNVRVLLVNDDGTRLLSASSDGT-IRLWDLGQQ--------------RCLAT 250 (735)
T ss_pred EEecCcccceEEeccccccceeeeeccccceEEEEEcCCCCeEeecCCCce-EEeeecccc--------------ceeee
Confidence 445677889999999999999999999999999999999999999999998 999999754 56666
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCcee
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVN 417 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~ 417 (832)
+. -+.. -||++.-+|+=+.+.+|+.||.|..=|+..+...+.
T Consensus 251 ~~-vH~e-~VWaL~~~~sf~~vYsG~rd~~i~~Tdl~n~~~~tl 292 (735)
T KOG0308|consen 251 YI-VHKE-GVWALQSSPSFTHVYSGGRDGNIYRTDLRNPAKSTL 292 (735)
T ss_pred EE-eccC-ceEEEeeCCCcceEEecCCCCcEEecccCCchhheE
Confidence 53 2332 299999999999999999999999999988644433
No 78
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.47 E-value=4.2e-12 Score=137.22 Aligned_cols=99 Identities=15% Similarity=0.283 Sum_probs=80.8
Q ss_pred cCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 297 ADNVGMVIVRDIVSK---NVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~---~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
+..|..+++||-+++ -+..+|-+|+.-|+++.++|... +|+++|.|++ +++||++.. ...||
T Consensus 318 gssdr~irl~DPR~~~gs~v~~s~~gH~nwVssvkwsp~~~~~~~S~S~D~t-~klWDvRS~-------------k~ply 383 (423)
T KOG0313|consen 318 GSSDRHIRLWDPRTGDGSVVSQSLIGHKNWVSSVKWSPTNEFQLVSGSYDNT-VKLWDVRST-------------KAPLY 383 (423)
T ss_pred cCCCCceeecCCCCCCCceeEEeeecchhhhhheecCCCCceEEEEEecCCe-EEEEEeccC-------------CCcce
Confidence 455678999998865 36678899999999999999766 6899999998 899999875 12578
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.+.+. .-+|.++.|+ ++..|++|+.|.+++||.-.+.
T Consensus 384 dI~~h--~DKvl~vdW~-~~~~IvSGGaD~~l~i~~~~~~ 420 (423)
T KOG0313|consen 384 DIAGH--NDKVLSVDWN-EGGLIVSGGADNKLRIFKGSPI 420 (423)
T ss_pred eeccC--CceEEEEecc-CCceEEeccCcceEEEeccccc
Confidence 88653 3469999998 5678999999999999987654
No 79
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.46 E-value=2.6e-11 Score=126.62 Aligned_cols=239 Identities=15% Similarity=0.262 Sum_probs=157.2
Q ss_pred EEEEEecCCeEEEEeccCC--CeeEEee-ecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 20 VLLLGYRSGFQVWDVEEAD--NVHDLVS-RYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 20 vLl~Gy~~G~qVWdv~~~~--~~~ellS-~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
+...|.++.++||+..... .++.+++ .|...||.+++.|..+ +||..+
T Consensus 30 lAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~--------------~La~aS--------------- 80 (312)
T KOG0645|consen 30 LASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGR--------------YLASAS--------------- 80 (312)
T ss_pred EEeecCCceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCc--------------EEEEee---------------
Confidence 3344556679999998411 3455554 5788999999988431 454333
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCC--cEEEEEeC-CCCEEEEEEc--CCEEEEE-eCCEEEEEECCCC---ce
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQ--SYVHMLKF-RSPIYSVRCS--SRVVAIC-QAAQVHCFDAATL---EI 167 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg--~~V~tL~f-~s~V~sV~~S--~r~LAVa-~~~~I~vwDl~t~---~~ 167 (832)
-+.|+-||.-..+ +|+.+|+. .+.|.+|+|| +++||.| -++.|.||.+... ++
T Consensus 81 ------------------FD~t~~Iw~k~~~efecv~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~deddEfec 142 (312)
T KOG0645|consen 81 ------------------FDATVVIWKKEDGEFECVATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDDEFEC 142 (312)
T ss_pred ------------------ccceEEEeecCCCceeEEeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEE
Confidence 1277889976544 68888975 6799999996 6999995 5789999998743 34
Q ss_pred EEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccce
Q 003310 168 EYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHL 247 (832)
Q Consensus 168 ~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~l 247 (832)
+-.|..|.. -|.++...-++
T Consensus 143 ~aVL~~Htq----------------------------------------------------------DVK~V~WHPt~-- 162 (312)
T KOG0645|consen 143 IAVLQEHTQ----------------------------------------------------------DVKHVIWHPTE-- 162 (312)
T ss_pred Eeeeccccc----------------------------------------------------------cccEEEEcCCc--
Confidence 444444321 01111100000
Q ss_pred eceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECC---CCcEEEEeccCCCCe
Q 003310 248 AAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIV---SKNVIAQFRAHKSPI 324 (832)
Q Consensus 248 asGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~---s~~~l~~~~aH~~pI 324 (832)
..+++.+.|.+|++|+-. .-.++++|.+|...|
T Consensus 163 --------------------------------------------dlL~S~SYDnTIk~~~~~~dddW~c~~tl~g~~~TV 198 (312)
T KOG0645|consen 163 --------------------------------------------DLLFSCSYDNTIKVYRDEDDDDWECVQTLDGHENTV 198 (312)
T ss_pred --------------------------------------------ceeEEeccCCeEEEEeecCCCCeeEEEEecCccceE
Confidence 012345678899999766 346899999999999
Q ss_pred EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCC-------------ccCCCCceeEEEEEec--------------c
Q 003310 325 SALCFDPSGILLVTASVQGHNINIFKIIPGILGTSS-------------ACDAGTSYVHLYRLQR--------------G 377 (832)
Q Consensus 325 s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s-------------~~~~~~~~~~l~~l~r--------------G 377 (832)
.+++|+|.|..|++++.|++ ++||...+..++..+ +.+.......+|+-.- +
T Consensus 199 W~~~F~~~G~rl~s~sdD~t-v~Iw~~~~~~~~~~sr~~Y~v~W~~~~IaS~ggD~~i~lf~~s~~~d~p~~~l~~~~~~ 277 (312)
T KOG0645|consen 199 WSLAFDNIGSRLVSCSDDGT-VSIWRLYTDLSGMHSRALYDVPWDNGVIASGGGDDAIRLFKESDSPDEPSWNLLAKKEG 277 (312)
T ss_pred EEEEecCCCceEEEecCCcc-eEeeeeccCcchhcccceEeeeecccceEeccCCCEEEEEEecCCCCCchHHHHHhhhc
Confidence 99999999999999999998 899985543211110 0011111122222211 1
Q ss_pred CccccEEEEEEccC-CCEEEEEeCCCcEEEEecC
Q 003310 378 LTNAVIQDISFSDD-SNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 378 ~t~a~I~~IaFSpD-g~~LAsgS~DgTVhIwdl~ 410 (832)
.+.-.|++|+|.|. ..+|+++++||+|++|.+.
T Consensus 278 aHe~dVNsV~w~p~~~~~L~s~~DDG~v~~W~l~ 311 (312)
T KOG0645|consen 278 AHEVDVNSVQWNPKVSNRLASGGDDGIVNFWELE 311 (312)
T ss_pred ccccccceEEEcCCCCCceeecCCCceEEEEEec
Confidence 11225899999994 7899999999999999874
No 80
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.45 E-value=1.3e-12 Score=145.24 Aligned_cols=114 Identities=16% Similarity=0.324 Sum_probs=86.3
Q ss_pred ccccccCCCCeEEEEECCCC-cEEEEec-----cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCC
Q 003310 292 GHFPDADNVGMVIVRDIVSK-NVIAQFR-----AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAG 365 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s~-~~l~~~~-----aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~ 365 (832)
..|.++..||+++|||+..- +.+..|+ +-.-+++..+|+|||+++|+|..||. |.+|+....
T Consensus 282 ~~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~~iAagc~DGS-IQ~W~~~~~----------- 349 (641)
T KOG0772|consen 282 EEFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGKLIAAGCLDGS-IQIWDKGSR----------- 349 (641)
T ss_pred cceEEecCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcchhhhcccCCc-eeeeecCCc-----------
Confidence 35778899999999999753 3333333 22347889999999999999999998 999997443
Q ss_pred CceeEEEEEeccCcc-ccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 366 TSYVHLYRLQRGLTN-AVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 366 ~~~~~l~~l~rG~t~-a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
.++..+..+..|.. ..|.||+||+||++|++=+.|+|++||||..+......
T Consensus 350 -~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~~ 402 (641)
T KOG0772|consen 350 -TVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLNV 402 (641)
T ss_pred -ccccceEeeeccCCCCceeEEEeccccchhhhccCCCceeeeeccccccchhh
Confidence 12223333333333 24999999999999999999999999999988776543
No 81
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.45 E-value=7.3e-12 Score=145.69 Aligned_cols=172 Identities=15% Similarity=0.280 Sum_probs=130.2
Q ss_pred CCEEEEEECCCCcEEEEE----eCCCCEEEEEEc--CC-EEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCc
Q 003310 116 PTVVHFYSLRSQSYVHML----KFRSPIYSVRCS--SR-VVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL----~f~s~V~sV~~S--~r-~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
.++|-+|++++|-..+++ -+..+|.+|++. .+ +++.++++-+.+||..+...+.++.-. ++
T Consensus 469 ~G~Id~fNmQSGi~r~sf~~~~ah~~~V~gla~D~~n~~~vsa~~~Gilkfw~f~~k~l~~~l~l~-----~~------- 536 (910)
T KOG1539|consen 469 KGTIDRFNMQSGIHRKSFGDSPAHKGEVTGLAVDGTNRLLVSAGADGILKFWDFKKKVLKKSLRLG-----SS------- 536 (910)
T ss_pred CCeEEEEEcccCeeecccccCccccCceeEEEecCCCceEEEccCcceEEEEecCCcceeeeeccC-----CC-------
Confidence 489999999999999988 357899999995 23 444578999999999988766665321 00
Q ss_pred ccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccc
Q 003310 189 GYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSE 268 (832)
Q Consensus 189 ~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~ 268 (832)
+ ..+.|. ..
T Consensus 537 ----~----~~iv~h---------------------------------------------------------r~------ 545 (910)
T KOG1539|consen 537 ----I----TGIVYH---------------------------------------------------------RV------ 545 (910)
T ss_pred ----c----ceeeee---------------------------------------------------------eh------
Confidence 0 001111 00
Q ss_pred ccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEE
Q 003310 269 FLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINI 348 (832)
Q Consensus 269 ~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~I 348 (832)
.+.++.+..+-.|+|+|..+.++++.|.+|+..|++++|||||+.|++|+.|++ ||+
T Consensus 546 ----------------------s~l~a~~~ddf~I~vvD~~t~kvvR~f~gh~nritd~~FS~DgrWlisasmD~t-Ir~ 602 (910)
T KOG1539|consen 546 ----------------------SDLLAIALDDFSIRVVDVVTRKVVREFWGHGNRITDMTFSPDGRWLISASMDST-IRT 602 (910)
T ss_pred ----------------------hhhhhhhcCceeEEEEEchhhhhhHHhhccccceeeeEeCCCCcEEEEeecCCc-EEE
Confidence 001222345567999999999999999999999999999999999999999998 999
Q ss_pred EeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC-CcEEEEecC
Q 003310 349 FKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR-GTSHLFAIN 410 (832)
Q Consensus 349 wdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D-gTVhIwdl~ 410 (832)
||+.++ ..+--+. .......+.|||+|.+||++..| .-|.+|.=.
T Consensus 603 wDlpt~--------------~lID~~~---vd~~~~sls~SPngD~LAT~Hvd~~gIylWsNk 648 (910)
T KOG1539|consen 603 WDLPTG--------------TLIDGLL---VDSPCTSLSFSPNGDFLATVHVDQNGIYLWSNK 648 (910)
T ss_pred EeccCc--------------ceeeeEe---cCCcceeeEECCCCCEEEEEEecCceEEEEEch
Confidence 999987 3443442 23457889999999999999998 569999653
No 82
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.45 E-value=2.1e-10 Score=121.14 Aligned_cols=180 Identities=14% Similarity=0.184 Sum_probs=124.5
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEE-EEE-eCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVV-AIC-QAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~L-AVa-~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++|++||+.++++++++.....+..+.++ ++.+ +++ .++.|++||+.+++.+..+..+..
T Consensus 11 ~~v~~~d~~t~~~~~~~~~~~~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~---------------- 74 (300)
T TIGR03866 11 NTISVIDTATLEVTRTFPVGQRPRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPD---------------- 74 (300)
T ss_pred CEEEEEECCCCceEEEEECCCCCCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCC----------------
Confidence 78999999999999999876667788886 3455 443 567999999999877655532210
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
+..+++.. ++..+
T Consensus 75 ----~~~~~~~~---------------------------~g~~l------------------------------------ 87 (300)
T TIGR03866 75 ----PELFALHP---------------------------NGKIL------------------------------------ 87 (300)
T ss_pred ----ccEEEECC---------------------------CCCEE------------------------------------
Confidence 01222321 11110
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
+++...++.|++||+.+.+.+..++.+ ..+.+++|+|||.+|++++.++..+.+||..
T Consensus 88 ---------------------~~~~~~~~~l~~~d~~~~~~~~~~~~~-~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~ 145 (300)
T TIGR03866 88 ---------------------YIANEDDNLVTVIDIETRKVLAEIPVG-VEPEGMAVSPDGKIVVNTSETTNMAHFIDTK 145 (300)
T ss_pred ---------------------EEEcCCCCeEEEEECCCCeEEeEeeCC-CCcceEEECCCCCEEEEEecCCCeEEEEeCC
Confidence 111234578999999998888888743 3468899999999999999887767888987
Q ss_pred CCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe-CCCcEEEEecCCCCCceee
Q 003310 353 PGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS-SRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 353 t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS-~DgTVhIwdl~~~g~~~~~ 418 (832)
++ ..+..+..+ ..+..++|+|||++|++++ .+++|++||+.+......+
T Consensus 146 ~~--------------~~~~~~~~~---~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~~~~ 195 (300)
T TIGR03866 146 TY--------------EIVDNVLVD---QRPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVIKKI 195 (300)
T ss_pred CC--------------eEEEEEEcC---CCccEEEECCCCCEEEEEcCCCCEEEEEEcCcceeeeee
Confidence 65 222222222 2356799999999986554 5999999999876443433
No 83
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.44 E-value=8.1e-12 Score=137.67 Aligned_cols=115 Identities=14% Similarity=0.268 Sum_probs=87.7
Q ss_pred ccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 294 FPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
+++++.|++|.+||+++. +++.+|..|...|.+|.|||. -+.|||++.|++ ++|||+..-.......+...+....+
T Consensus 288 lAT~S~D~tV~LwDlRnL~~~lh~~e~H~dev~~V~WSPh~etvLASSg~D~r-l~vWDls~ig~eq~~eda~dgppEll 366 (422)
T KOG0264|consen 288 LATGSADKTVALWDLRNLNKPLHTFEGHEDEVFQVEWSPHNETVLASSGTDRR-LNVWDLSRIGEEQSPEDAEDGPPELL 366 (422)
T ss_pred EEeccCCCcEEEeechhcccCceeccCCCcceEEEEeCCCCCceeEecccCCc-EEEEeccccccccChhhhccCCccee
Confidence 345677899999999974 688999999999999999996 568999999998 89999975311111122222233444
Q ss_pred EEEeccCccccEEEEEEccCCCE-EEEEeCCCcEEEEecCC
Q 003310 372 YRLQRGLTNAVIQDISFSDDSNW-IMISSSRGTSHLFAINP 411 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~~-LAsgS~DgTVhIwdl~~ 411 (832)
+ .++||+ +.|.+++|.|+-.| |++.+.|+.++||+...
T Consensus 367 F-~HgGH~-~kV~DfsWnp~ePW~I~SvaeDN~LqIW~~s~ 405 (422)
T KOG0264|consen 367 F-IHGGHT-AKVSDFSWNPNEPWTIASVAEDNILQIWQMAE 405 (422)
T ss_pred E-EecCcc-cccccccCCCCCCeEEEEecCCceEEEeeccc
Confidence 4 457876 57999999998877 56788899999999863
No 84
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.44 E-value=3.3e-13 Score=149.24 Aligned_cols=177 Identities=13% Similarity=0.207 Sum_probs=138.2
Q ss_pred CCCEEEEEECCC-CcEEEEEeC-CCCEEEEEEc---CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcc
Q 003310 115 VPTVVHFYSLRS-QSYVHMLKF-RSPIYSVRCS---SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIG 189 (832)
Q Consensus 115 ~~~tVrlWDL~T-g~~V~tL~f-~s~V~sV~~S---~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~ 189 (832)
.++.|+||++.. +.|++++.. ..+|.+++|| .++|.++.|..|++||.+||+++..+.+-..+
T Consensus 235 mD~~vklW~vy~~~~~lrtf~gH~k~Vrd~~~s~~g~~fLS~sfD~~lKlwDtETG~~~~~f~~~~~~------------ 302 (503)
T KOG0282|consen 235 MDGLVKLWNVYDDRRCLRTFKGHRKPVRDASFNNCGTSFLSASFDRFLKLWDTETGQVLSRFHLDKVP------------ 302 (503)
T ss_pred CCceEEEEEEecCcceehhhhcchhhhhhhhccccCCeeeeeecceeeeeeccccceEEEEEecCCCc------------
Confidence 348999999987 899999975 6699999998 47888899999999999999999888653210
Q ss_pred cceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 190 YGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 190 ~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
.. +-|
T Consensus 303 ---~c-----vkf------------------------------------------------------------------- 307 (503)
T KOG0282|consen 303 ---TC-----VKF------------------------------------------------------------------- 307 (503)
T ss_pred ---ee-----eec-------------------------------------------------------------------
Confidence 00 001
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEE
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIF 349 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iw 349 (832)
-|+. ...|..+..++.|+.||+++++++..+..|-++|..+.|=++|++++|+|+|++ ++||
T Consensus 308 ~pd~-----------------~n~fl~G~sd~ki~~wDiRs~kvvqeYd~hLg~i~~i~F~~~g~rFissSDdks-~riW 369 (503)
T KOG0282|consen 308 HPDN-----------------QNIFLVGGSDKKIRQWDIRSGKVVQEYDRHLGAILDITFVDEGRRFISSSDDKS-VRIW 369 (503)
T ss_pred CCCC-----------------CcEEEEecCCCcEEEEeccchHHHHHHHhhhhheeeeEEccCCceEeeeccCcc-EEEE
Confidence 0110 002344567889999999999999999999999999999999999999999987 8999
Q ss_pred eCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 350 KIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 350 di~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+.... +.+-+...-. ....-+|+.+|.++|++.-|.|..|-||.+.+
T Consensus 370 e~~~~-------------v~ik~i~~~~--~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~ 416 (503)
T KOG0282|consen 370 ENRIP-------------VPIKNIADPE--MHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVP 416 (503)
T ss_pred EcCCC-------------ccchhhcchh--hccCcceecCCCCCeehhhccCceEEEEeccc
Confidence 99875 1121222211 22367899999999999999999999999765
No 85
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.43 E-value=1.7e-12 Score=134.23 Aligned_cols=176 Identities=14% Similarity=0.135 Sum_probs=126.2
Q ss_pred CCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC---CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcc
Q 003310 115 VPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS---RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIG 189 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~---r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~ 189 (832)
+++||||||..-++-++|++- .+.||...|++ +++|. +.|+..++||++..-....+..|...
T Consensus 125 WD~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk~~~i~ah~~E------------ 192 (311)
T KOG0277|consen 125 WDGTIKLWDPNRPNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSPGKFMSIEAHNSE------------ 192 (311)
T ss_pred cCCceEeecCCCCcceEeecCCccEEEEEecCCCCCCeEEEccCCceEEEEEecCCCceeEEEeccce------------
Confidence 679999999999999999875 67999999984 67776 57889999998754333334333100
Q ss_pred cceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 190 YGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 190 ~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
++.-.|=-|- .
T Consensus 193 ----il~cdw~ky~------------------------------~----------------------------------- 203 (311)
T KOG0277|consen 193 ----ILCCDWSKYN------------------------------H----------------------------------- 203 (311)
T ss_pred ----eEeecccccC------------------------------C-----------------------------------
Confidence 0000010010 0
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEE
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNIN 347 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~ 347 (832)
..+++++.|+.|++||+++. .++..+.+|.-.|..|+|||. ..+|||||.|=+ +|
T Consensus 204 ----------------------~vl~Tg~vd~~vr~wDir~~r~pl~eL~gh~~AVRkvk~Sph~~~lLaSasYDmT-~r 260 (311)
T KOG0277|consen 204 ----------------------NVLATGGVDNLVRGWDIRNLRTPLFELNGHGLAVRKVKFSPHHASLLASASYDMT-VR 260 (311)
T ss_pred ----------------------cEEEecCCCceEEEEehhhccccceeecCCceEEEEEecCcchhhHhhhccccce-EE
Confidence 01345678899999999974 588999999999999999996 558999999976 89
Q ss_pred EEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEec
Q 003310 348 IFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 348 Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl 409 (832)
|||..... + ....+.. | .--+..+.||+ +..++|+++=|+++.||+-
T Consensus 261 iw~~~~~d----s-------~~e~~~~---H-tEFv~g~Dws~~~~~~vAs~gWDe~l~Vw~p 308 (311)
T KOG0277|consen 261 IWDPERQD----S-------AIETVDH---H-TEFVCGLDWSLFDPGQVASTGWDELLYVWNP 308 (311)
T ss_pred ecccccch----h-------hhhhhhc---c-ceEEeccccccccCceeeecccccceeeecc
Confidence 99987540 0 1111221 1 12377888987 7889999999999999973
No 86
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.43 E-value=5.3e-13 Score=151.64 Aligned_cols=113 Identities=16% Similarity=0.306 Sum_probs=96.3
Q ss_pred cccCCCCeEEEEECCCC--cEEEE--------ec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccC
Q 003310 295 PDADNVGMVIVRDIVSK--NVIAQ--------FR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACD 363 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~--~~l~~--------~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~ 363 (832)
++++-|+.|.|||+.++ +.++. +. +|..+|.+|+-++.|+.|++|+..+- |++||.++.
T Consensus 134 aSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGgtek~-lr~wDprt~--------- 203 (735)
T KOG0308|consen 134 ASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGGTEKD-LRLWDPRTC--------- 203 (735)
T ss_pred EecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecCcccc-eEEeccccc---------
Confidence 34567889999999977 33333 33 78889999999999999999999975 999999886
Q ss_pred CCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCC
Q 003310 364 AGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 364 ~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~ 424 (832)
.++.+|+ ||+. .|.++-.++||+.+.++|+||||++||+.-..+..++..|...
T Consensus 204 -----~kimkLr-GHTd-NVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~T~~vH~e~ 257 (735)
T KOG0308|consen 204 -----KKIMKLR-GHTD-NVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLATYIVHKEG 257 (735)
T ss_pred -----cceeeee-cccc-ceEEEEEcCCCCeEeecCCCceEEeeeccccceeeeEEeccCc
Confidence 6677884 8875 4999999999999999999999999999999999999999764
No 87
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.42 E-value=1.4e-11 Score=134.93 Aligned_cols=228 Identities=14% Similarity=0.229 Sum_probs=154.8
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
..+|.+|-+...-++|... +....++.+|.-.|..+.+-|.. .++..+
T Consensus 232 ~~ilTGG~d~~av~~d~~s-~q~l~~~~Gh~kki~~v~~~~~~----------------~~v~~a--------------- 279 (506)
T KOG0289|consen 232 SKILTGGEDKTAVLFDKPS-NQILATLKGHTKKITSVKFHKDL----------------DTVITA--------------- 279 (506)
T ss_pred CcceecCCCCceEEEecch-hhhhhhccCcceEEEEEEeccch----------------hheeec---------------
Confidence 4566666666788888775 33445555666666655554421 111121
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-CCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEec
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t 173 (832)
+.+..|++|+.-...+...+. +..+|..+... +.++.. +.++.+.+.|++++.++-....
T Consensus 280 ----------------Sad~~i~vws~~~~s~~~~~~~h~~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~ 343 (506)
T KOG0289|consen 280 ----------------SADEIIRVWSVPLSSEPTSSRPHEEPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSD 343 (506)
T ss_pred ----------------CCcceEEeeccccccCccccccccccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEee
Confidence 123789999998777666665 46789888884 566655 5677888889999987655533
Q ss_pred CCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
-.. .. +|.+ +.++|+
T Consensus 344 ~~s----------~v------------~~ts--------~~fHpD----------------------------------- 358 (506)
T KOG0289|consen 344 ETS----------DV------------EYTS--------AAFHPD----------------------------------- 358 (506)
T ss_pred ccc----------cc------------eeEE--------eeEcCC-----------------------------------
Confidence 110 00 1110 001121
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCC
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSG 333 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG 333 (832)
|. .|.++..+|.|+|||+.++..++.|.+|+++|.+|+|+-+|
T Consensus 359 ----------------------------------gL---ifgtgt~d~~vkiwdlks~~~~a~Fpght~~vk~i~FsENG 401 (506)
T KOG0289|consen 359 ----------------------------------GL---IFGTGTPDGVVKIWDLKSQTNVAKFPGHTGPVKAISFSENG 401 (506)
T ss_pred ----------------------------------ce---EEeccCCCceEEEEEcCCccccccCCCCCCceeEEEeccCc
Confidence 11 13345678999999999999999999999999999999999
Q ss_pred CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 334 ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 334 ~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.+|||+.+||. |++||++... ..+.+.+.. ...|.+++|.+.|++|++++.|=+|++++-.+
T Consensus 402 Y~Lat~add~~-V~lwDLRKl~------------n~kt~~l~~---~~~v~s~~fD~SGt~L~~~g~~l~Vy~~~k~~ 463 (506)
T KOG0289|consen 402 YWLATAADDGS-VKLWDLRKLK------------NFKTIQLDE---KKEVNSLSFDQSGTYLGIAGSDLQVYICKKKT 463 (506)
T ss_pred eEEEEEecCCe-EEEEEehhhc------------ccceeeccc---cccceeEEEcCCCCeEEeecceeEEEEEeccc
Confidence 99999999987 8999998750 122233322 12489999999999999998888888877443
No 88
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.42 E-value=2.8e-11 Score=139.50 Aligned_cols=233 Identities=13% Similarity=0.194 Sum_probs=150.1
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEE-eCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccccee
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAIC-QAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPL 193 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa-~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~ 193 (832)
+.++||+..|++|++++... -+.+..|- .+.++++ -.+++.+||+.....+-++..|.. ...++
T Consensus 394 ~SikiWn~~t~kciRTi~~~-y~l~~~Fvpgd~~Iv~G~k~Gel~vfdlaS~~l~Eti~AHdg------------aIWsi 460 (888)
T KOG0306|consen 394 ESIKIWNRDTLKCIRTITCG-YILASKFVPGDRYIVLGTKNGELQVFDLASASLVETIRAHDG------------AIWSI 460 (888)
T ss_pred CcEEEEEccCcceeEEeccc-cEEEEEecCCCceEEEeccCCceEEEEeehhhhhhhhhcccc------------ceeee
Confidence 56999999999999999876 44555552 3555554 567999999998887777765531 12345
Q ss_pred eecc--ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee----eeccCccccccccccc
Q 003310 194 AVGP--RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI----VNLGDLGYKKLSQYCS 267 (832)
Q Consensus 194 Alg~--r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi----~~lGd~g~~~ls~y~~ 267 (832)
+++| +.++-++ .+.+|.-|....-.. ..|- ..+-+.. +|
T Consensus 461 ~~~pD~~g~vT~s---------------------------aDktVkfWdf~l~~~-~~gt~~k~lsl~~~r--tL----- 505 (888)
T KOG0306|consen 461 SLSPDNKGFVTGS---------------------------ADKTVKFWDFKLVVS-VPGTQKKVLSLKHTR--TL----- 505 (888)
T ss_pred eecCCCCceEEec---------------------------CCcEEEEEeEEEEec-cCcccceeeeeccce--EE-----
Confidence 5554 1111111 112222111111000 0000 0000000 00
Q ss_pred cccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEE
Q 003310 268 EFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNIN 347 (832)
Q Consensus 268 ~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~ 347 (832)
.+++..-. .+++|+.+ .++-+--+.+|+||-+.+.+-.-.+.+|.-||.||..|||+++|+|||.|.+ |+
T Consensus 506 -el~ddvL~-v~~Spdgk-------~LaVsLLdnTVkVyflDtlKFflsLYGHkLPV~smDIS~DSklivTgSADKn-VK 575 (888)
T KOG0306|consen 506 -ELEDDVLC-VSVSPDGK-------LLAVSLLDNTVKVYFLDTLKFFLSLYGHKLPVLSMDISPDSKLIVTGSADKN-VK 575 (888)
T ss_pred -eccccEEE-EEEcCCCc-------EEEEEeccCeEEEEEecceeeeeeecccccceeEEeccCCcCeEEeccCCCc-eE
Confidence 01111000 11233322 1233445779999999999999999999999999999999999999999976 99
Q ss_pred EEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCC
Q 003310 348 IFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDA 423 (832)
Q Consensus 348 Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~ 423 (832)
||-+.-| .+++ .+. +|.. .|.++.|-|+...+.+++.|+.|+-||-..+.....+.+|..
T Consensus 576 iWGLdFG------------DCHK--S~f-AHdD-Svm~V~F~P~~~~FFt~gKD~kvKqWDg~kFe~iq~L~~H~~ 635 (888)
T KOG0306|consen 576 IWGLDFG------------DCHK--SFF-AHDD-SVMSVQFLPKTHLFFTCGKDGKVKQWDGEKFEEIQKLDGHHS 635 (888)
T ss_pred Eeccccc------------hhhh--hhh-cccC-ceeEEEEcccceeEEEecCcceEEeechhhhhhheeeccchh
Confidence 9988766 2232 221 2222 389999999999999999999999999999999999999954
No 89
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.41 E-value=9e-13 Score=152.14 Aligned_cols=118 Identities=15% Similarity=0.271 Sum_probs=92.7
Q ss_pred ccccCCCCeEEEEECCC-CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 294 FPDADNVGMVIVRDIVS-KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s-~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
|+++.+.|.+++||++. .++...|.||.+||.|+.|+|++.+||||+.|+. |+|||...+. ...+.
T Consensus 192 F~s~~dsG~lqlWDlRqp~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~-vkiWd~t~~~------------~~~~~ 258 (839)
T KOG0269|consen 192 FASIHDSGYLQLWDLRQPDRCEKKLTAHNGPVLCLNWHPNREWLATGGRDKM-VKIWDMTDSR------------AKPKH 258 (839)
T ss_pred EEEecCCceEEEeeccCchhHHHHhhcccCceEEEeecCCCceeeecCCCcc-EEEEeccCCC------------cccee
Confidence 55567789999999985 5677888999999999999999999999999987 9999987541 12233
Q ss_pred EEeccCccccEEEEEEccCCCE-EEEEeC--CCcEEEEecC-CCCCceeeccCCCCccc
Q 003310 373 RLQRGLTNAVIQDISFSDDSNW-IMISSS--RGTSHLFAIN-PLGGSVNFQPTDANFTT 427 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~-LAsgS~--DgTVhIwdl~-~~g~~~~~~~H~~~~~~ 427 (832)
+.. |.+.|..|.|-|+-++ ||+++. |-.|||||+. ||-.-.+|..|++....
T Consensus 259 tIn---Tiapv~rVkWRP~~~~hLAtcsmv~dtsV~VWDvrRPYIP~~t~~eH~~~vt~ 314 (839)
T KOG0269|consen 259 TIN---TIAPVGRVKWRPARSYHLATCSMVVDTSVHVWDVRRPYIPYATFLEHTDSVTG 314 (839)
T ss_pred EEe---ecceeeeeeeccCccchhhhhhccccceEEEEeeccccccceeeeccCccccc
Confidence 442 4577999999997654 666665 6789999996 55555788889876653
No 90
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.40 E-value=6e-13 Score=140.79 Aligned_cols=209 Identities=14% Similarity=0.206 Sum_probs=149.8
Q ss_pred CCCCCEEEEEECCCCcEEEEEeC---------CCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCceEEEEe-cCCCccC
Q 003310 113 SSVPTVVHFYSLRSQSYVHMLKF---------RSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEIEYAIL-TNPIVMG 179 (832)
Q Consensus 113 ~~~~~tVrlWDL~Tg~~V~tL~f---------~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~tl~-t~~~~~~ 179 (832)
.+.++-|.+||..+|+..+.|++ ..+|.+|.|++ ..||. +.|++|+||.+.|+.|++.+. .|..
T Consensus 231 gSvDGFiEVWny~~GKlrKDLkYQAqd~fMMmd~aVlci~FSRDsEMlAsGsqDGkIKvWri~tG~ClRrFdrAHtk--- 307 (508)
T KOG0275|consen 231 GSVDGFIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVLCISFSRDSEMLASGSQDGKIKVWRIETGQCLRRFDRAHTK--- 307 (508)
T ss_pred ccccceeeeehhccchhhhhhhhhhhcceeecccceEEEeecccHHHhhccCcCCcEEEEEEecchHHHHhhhhhcc---
Confidence 34679999999999999888863 57899999985 67887 689999999999999988775 2311
Q ss_pred CCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccc
Q 003310 180 HPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGY 259 (832)
Q Consensus 180 ~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~ 259 (832)
++ . .|.|+- +.
T Consensus 308 ---------Gv--t-----~l~FSr-------------------------------------D~---------------- 318 (508)
T KOG0275|consen 308 ---------GV--T-----CLSFSR-------------------------------------DN---------------- 318 (508)
T ss_pred ---------Ce--e-----EEEEcc-------------------------------------Cc----------------
Confidence 11 1 122220 00
Q ss_pred cccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEE
Q 003310 260 KKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTA 339 (832)
Q Consensus 260 ~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATa 339 (832)
.++.+++.|.+|+|--+.+|+++..|++|++.|+...|++||..+.+|
T Consensus 319 --------------------------------SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSyvn~a~ft~dG~~iisa 366 (508)
T KOG0275|consen 319 --------------------------------SQILSASFDQTVRIHGLKSGKCLKEFRGHSSYVNEATFTDDGHHIISA 366 (508)
T ss_pred --------------------------------chhhcccccceEEEeccccchhHHHhcCccccccceEEcCCCCeEEEe
Confidence 012345667899999999999999999999999999999999999999
Q ss_pred EcCCCEEEEEeCCCCCCCCC----CccCC-------CCceeEEE-------------------EEeccCc-cccEEEEEE
Q 003310 340 SVQGHNINIFKIIPGILGTS----SACDA-------GTSYVHLY-------------------RLQRGLT-NAVIQDISF 388 (832)
Q Consensus 340 S~dGt~I~Iwdi~t~~~~~~----s~~~~-------~~~~~~l~-------------------~l~rG~t-~a~I~~IaF 388 (832)
|.||+ |+||+.++..+-.+ +.+-+ ..+..|.. .+..|.. .....+++.
T Consensus 367 SsDgt-vkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNrsntv~imn~qGQvVrsfsSGkREgGdFi~~~l 445 (508)
T KOG0275|consen 367 SSDGT-VKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCNRSNTVYIMNMQGQVVRSFSSGKREGGDFINAIL 445 (508)
T ss_pred cCCcc-EEEecCcchhhhhhccCCCCcccceeEEEcCCCCceEEEEcCCCeEEEEeccceEEeeeccCCccCCceEEEEe
Confidence 99998 89999887632111 00000 00011111 1111111 112456789
Q ss_pred ccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcc
Q 003310 389 SDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 389 SpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~ 426 (832)
||-|.|+.+.+.|+.+.-|.+..++-+.++.-|...+.
T Consensus 446 SpkGewiYcigED~vlYCF~~~sG~LE~tl~VhEkdvI 483 (508)
T KOG0275|consen 446 SPKGEWIYCIGEDGVLYCFSVLSGKLERTLPVHEKDVI 483 (508)
T ss_pred cCCCcEEEEEccCcEEEEEEeecCceeeeeeccccccc
Confidence 99999999999999999999998888888877754433
No 91
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.40 E-value=7.3e-12 Score=139.36 Aligned_cols=111 Identities=20% Similarity=0.245 Sum_probs=82.6
Q ss_pred ccccCCCCeEEEEECCCC--cEEEEe-ccCCC--CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 294 FPDADNVGMVIVRDIVSK--NVIAQF-RAHKS--PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~--~~l~~~-~aH~~--pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
++.+..||.|.+||..+. .+...+ +||.. .|+||+||+||++|++-+.|++ +++||++.. .
T Consensus 332 iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~t-LKvWDLrq~-------------k 397 (641)
T KOG0772|consen 332 IAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDT-LKVWDLRQF-------------K 397 (641)
T ss_pred hhhcccCCceeeeecCCcccccceEeeeccCCCCceeEEEeccccchhhhccCCCc-eeeeecccc-------------c
Confidence 345678899999998653 233333 49987 8999999999999999999988 899999865 1
Q ss_pred eEEEEEeccCc-cccEEEEEEccCCCEEEEEeC------CCcEEEEecCCCCCceeec
Q 003310 369 VHLYRLQRGLT-NAVIQDISFSDDSNWIMISSS------RGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 369 ~~l~~l~rG~t-~a~I~~IaFSpDg~~LAsgS~------DgTVhIwdl~~~g~~~~~~ 419 (832)
..|... -|.. ...-.++|||||.++|++|++ -|++.+||-.++.....+.
T Consensus 398 kpL~~~-tgL~t~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~t~d~v~ki~ 454 (641)
T KOG0772|consen 398 KPLNVR-TGLPTPFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRMTLDTVYKID 454 (641)
T ss_pred cchhhh-cCCCccCCCCccccCCCceEEEecccccCCCCCceEEEEeccceeeEEEec
Confidence 222222 2222 223578999999999999876 4678899988887666654
No 92
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.40 E-value=4.8e-12 Score=133.71 Aligned_cols=105 Identities=16% Similarity=0.263 Sum_probs=89.6
Q ss_pred CeEEEEECCCCcEEEEec---cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 301 GMVIVRDIVSKNVIAQFR---AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~---aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
-++++||+.+.++...-. .|++.|+++.+|+.|++.+|||.||. |+|||--.. +++.++.+.
T Consensus 238 p~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys~t~~lYvTaSkDG~-IklwDGVS~--------------rCv~t~~~A 302 (430)
T KOG0640|consen 238 PTLRLYDVNTYQCFVSANPDDQHTGAITQVRYSSTGSLYVTASKDGA-IKLWDGVSN--------------RCVRTIGNA 302 (430)
T ss_pred CceeEEeccceeEeeecCcccccccceeEEEecCCccEEEEeccCCc-EEeeccccH--------------HHHHHHHhh
Confidence 369999999988755433 78999999999999999999999998 999996554 556666666
Q ss_pred CccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeecc
Q 003310 378 LTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~ 420 (832)
+..+.|.+..|+.+|+||.+++.|.++++|.|.++.......+
T Consensus 303 H~gsevcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R~l~~YtG 345 (430)
T KOG0640|consen 303 HGGSEVCSAVFTKNGKYILSSGKDSTVKLWEISTGRMLKEYTG 345 (430)
T ss_pred cCCceeeeEEEccCCeEEeecCCcceeeeeeecCCceEEEEec
Confidence 7777899999999999999999999999999999877666644
No 93
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.39 E-value=4.4e-11 Score=124.71 Aligned_cols=188 Identities=16% Similarity=0.191 Sum_probs=136.4
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEE-EEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVA-ICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LA-Va~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
.+..+|=-..|+.+-++.. ...|+++.++ .+.|+ -+.|.++++||..+++++.++.+..-+..+ -
T Consensus 32 ~~~~vw~s~nGerlGty~GHtGavW~~Did~~s~~liTGSAD~t~kLWDv~tGk~la~~k~~~~Vk~~-----------~ 100 (327)
T KOG0643|consen 32 STPTVWYSLNGERLGTYDGHTGAVWCCDIDWDSKHLITGSADQTAKLWDVETGKQLATWKTNSPVKRV-----------D 100 (327)
T ss_pred CCceEEEecCCceeeeecCCCceEEEEEecCCcceeeeccccceeEEEEcCCCcEEEEeecCCeeEEE-----------e
Confidence 4567888888999999975 6789998885 34444 478899999999999999998763211000 0
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
|..+...+.++.++
T Consensus 101 F~~~gn~~l~~tD~------------------------------------------------------------------ 114 (327)
T KOG0643|consen 101 FSFGGNLILASTDK------------------------------------------------------------------ 114 (327)
T ss_pred eccCCcEEEEEehh------------------------------------------------------------------
Confidence 11111111111000
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECC-------CCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCE
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIV-------SKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHN 345 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~-------s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~ 345 (832)
..+..+.|.++|++ +..++..+..+.+.|+.+-|+|-|+.|++|..||.
T Consensus 115 -----------------------~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~skit~a~Wg~l~~~ii~Ghe~G~- 170 (327)
T KOG0643|consen 115 -----------------------QMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSKITSALWGPLGETIIAGHEDGS- 170 (327)
T ss_pred -----------------------hcCcceEEEEEEccCChhhhcccCceEEecCCccceeeeeecccCCEEEEecCCCc-
Confidence 01234567777776 56678888889999999999999999999999998
Q ss_pred EEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeecc
Q 003310 346 INIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 346 I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~ 420 (832)
|.+||++++ ..+..-.+-+ .+.|++|.||+|..+++++|.|.|.++||..+....-++.+
T Consensus 171 is~~da~~g--------------~~~v~s~~~h-~~~Ind~q~s~d~T~FiT~s~Dttakl~D~~tl~v~Kty~t 230 (327)
T KOG0643|consen 171 ISIYDARTG--------------KELVDSDEEH-SSKINDLQFSRDRTYFITGSKDTTAKLVDVRTLEVLKTYTT 230 (327)
T ss_pred EEEEEcccC--------------ceeeechhhh-ccccccccccCCcceEEecccCccceeeeccceeeEEEeee
Confidence 999999987 2222222222 34699999999999999999999999999998766666654
No 94
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.38 E-value=1.2e-11 Score=141.44 Aligned_cols=214 Identities=17% Similarity=0.332 Sum_probs=144.5
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc-----CCEEEEE-eCCEEEEEECC-CCceEEEEecCCCccCCCCCCCCCc
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS-----SRVVAIC-QAAQVHCFDAA-TLEIEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S-----~r~LAVa-~~~~I~vwDl~-t~~~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
+++|+|||..-++...++- .+.|.++.++ .++||.+ -+.-|+|||+. ...++++|.+|+...+
T Consensus 481 GnlrVy~Lq~l~~~~~~eAHesEilcLeyS~p~~~~kLLASasrdRlIHV~Dv~rny~l~qtld~HSssIT--------- 551 (1080)
T KOG1408|consen 481 GNLRVYDLQELEYTCFMEAHESEILCLEYSFPVLTNKLLASASRDRLIHVYDVKRNYDLVQTLDGHSSSIT--------- 551 (1080)
T ss_pred CceEEEEehhhhhhhheecccceeEEEeecCchhhhHhhhhccCCceEEEEecccccchhhhhccccccee---------
Confidence 7899999998888877764 7899999997 3678875 57789999986 4667788888753111
Q ss_pred ccceeeecc---ceEEeeCCCcee------cCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccc
Q 003310 189 GYGPLAVGP---RWLAYSGSPVVV------SNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGY 259 (832)
Q Consensus 189 ~~~p~Alg~---r~LAya~~~~~~------s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~ 259 (832)
.-.||-+. +.|....++.+. ...|++.|.+ ..++ +|+ .|-|+.
T Consensus 552 -svKFa~~gln~~MiscGADksimFr~~qk~~~g~~f~r~-------------t~t~-------~kt------TlYDm~- 603 (1080)
T KOG1408|consen 552 -SVKFACNGLNRKMISCGADKSIMFRVNQKASSGRLFPRH-------------TQTL-------SKT------TLYDMA- 603 (1080)
T ss_pred -EEEEeecCCceEEEeccCchhhheehhccccCceecccc-------------cccc-------ccc------eEEEee-
Confidence 00121111 223322222111 1112211110 0000 000 001110
Q ss_pred cccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec---cCCCCeEEEEEcCCCCEE
Q 003310 260 KKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR---AHKSPISALCFDPSGILL 336 (832)
Q Consensus 260 ~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~---aH~~pIs~LaFSPdG~lL 336 (832)
+-|.+ ++.+++..|..|+|||+.+++.++.|+ .|.+..-.+..+|+|-||
T Consensus 604 --------------------Vdp~~-------k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~ 656 (1080)
T KOG1408|consen 604 --------------------VDPTS-------KLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILDPSGIYL 656 (1080)
T ss_pred --------------------eCCCc-------ceEEEEecccceEEEeccccceeeeecccccCCCceEEEEECCCccEE
Confidence 11222 245677899999999999999999998 466778889999999999
Q ss_pred EEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 337 VTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 337 ATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
||...|.+ +.+||..++ +++.++ .||.. -|..+-|++|.++|++.+.||.|.||.+..
T Consensus 657 atScsdkt-l~~~Df~sg--------------EcvA~m-~GHsE-~VTG~kF~nDCkHlISvsgDgCIFvW~lp~ 714 (1080)
T KOG1408|consen 657 ATSCSDKT-LCFVDFVSG--------------ECVAQM-TGHSE-AVTGVKFLNDCKHLISVSGDGCIFVWKLPL 714 (1080)
T ss_pred EEeecCCc-eEEEEeccc--------------hhhhhh-cCcch-heeeeeecccchhheeecCCceEEEEECch
Confidence 99999977 899999887 444444 35543 399999999999999999999999999954
No 95
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.37 E-value=3.1e-10 Score=115.77 Aligned_cols=240 Identities=16% Similarity=0.174 Sum_probs=157.7
Q ss_pred CcEEEEEecC-CeEEEEecc--CC--CeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCcccc
Q 003310 18 RRVLLLGYRS-GFQVWDVEE--AD--NVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQ 92 (832)
Q Consensus 18 ~~vLl~Gy~~-G~qVWdv~~--~~--~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~ 92 (832)
..+++.|... .+++.-.+. .+ .-.--++-|||.||.+.|+..|.. ..-+|+-.+.
T Consensus 101 geliatgsndk~ik~l~fn~dt~~~~g~dle~nmhdgtirdl~fld~~~s----------~~~il~s~ga---------- 160 (350)
T KOG0641|consen 101 GELIATGSNDKTIKVLPFNADTCNATGHDLEFNMHDGTIRDLAFLDDPES----------GGAILASAGA---------- 160 (350)
T ss_pred cCeEEecCCCceEEEEecccccccccCcceeeeecCCceeeeEEecCCCc----------CceEEEecCC----------
Confidence 4556666543 366654432 11 112234568999999999976521 1123332221
Q ss_pred CCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEE-EcCCEEEE-EeCCEEEEEECCCCceEE
Q 003310 93 DGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVR-CSSRVVAI-CQAAQVHCFDAATLEIEY 169 (832)
Q Consensus 93 Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~-~S~r~LAV-a~~~~I~vwDl~t~~~~~ 169 (832)
-++.|.+-|-.+|+..+.+.. ...|.++- ++.-++|. +++.+|++||++-..+..
T Consensus 161 ----------------------gdc~iy~tdc~~g~~~~a~sghtghilalyswn~~m~~sgsqdktirfwdlrv~~~v~ 218 (350)
T KOG0641|consen 161 ----------------------GDCKIYITDCGRGQGFHALSGHTGHILALYSWNGAMFASGSQDKTIRFWDLRVNSCVN 218 (350)
T ss_pred ----------------------CcceEEEeecCCCCcceeecCCcccEEEEEEecCcEEEccCCCceEEEEeeeccceee
Confidence 127888999999999998875 56777764 47777777 678899999999877777
Q ss_pred EEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecc-ccccee
Q 003310 170 AILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKE-SSKHLA 248 (832)
Q Consensus 170 tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~-ssk~la 248 (832)
++.+. +.+. | +.. +.|+.+|.+ +++
T Consensus 219 ~l~~~-----~~~~-g--les-------------------------------------------savaav~vdpsgr--- 244 (350)
T KOG0641|consen 219 TLDND-----FHDG-G--LES-------------------------------------------SAVAAVAVDPSGR--- 244 (350)
T ss_pred eccCc-----ccCC-C--ccc-------------------------------------------ceeEEEEECCCcc---
Confidence 77442 1000 0 000 011111111 111
Q ss_pred ceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEE
Q 003310 249 AGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALC 328 (832)
Q Consensus 249 sGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~La 328 (832)
.++++..|....+||++.++++..|..|+..|.|+.
T Consensus 245 --------------------------------------------ll~sg~~dssc~lydirg~r~iq~f~phsadir~vr 280 (350)
T KOG0641|consen 245 --------------------------------------------LLASGHADSSCMLYDIRGGRMIQRFHPHSADIRCVR 280 (350)
T ss_pred --------------------------------------------eeeeccCCCceEEEEeeCCceeeeeCCCccceeEEE
Confidence 123344566789999999999999999999999999
Q ss_pred EcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 329 FDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 329 FSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
|||...||+|+|.|-. |++-|+.-.. .+.|-.+--+.+.-++..+-|.|..--+.+.|.|.|+.+|.
T Consensus 281 fsp~a~yllt~syd~~-ikltdlqgdl------------a~el~~~vv~ehkdk~i~~rwh~~d~sfisssadkt~tlwa 347 (350)
T KOG0641|consen 281 FSPGAHYLLTCSYDMK-IKLTDLQGDL------------AHELPIMVVAEHKDKAIQCRWHPQDFSFISSSADKTATLWA 347 (350)
T ss_pred eCCCceEEEEecccce-EEEeecccch------------hhcCceEEEEeccCceEEEEecCccceeeeccCcceEEEec
Confidence 9999999999999965 9999886430 11222222233333466688999888889999999999998
Q ss_pred cC
Q 003310 409 IN 410 (832)
Q Consensus 409 l~ 410 (832)
++
T Consensus 348 ~~ 349 (350)
T KOG0641|consen 348 LN 349 (350)
T ss_pred cC
Confidence 85
No 96
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.37 E-value=7e-12 Score=136.53 Aligned_cols=224 Identities=15% Similarity=0.260 Sum_probs=154.6
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEee-ecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVS-RYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS-~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
+.+|++|++.-+.+||+++ |+.+.+.. .+.-.|.++.+.|+. |+ ++.++
T Consensus 282 ryLlaCg~~e~~~lwDv~t-gd~~~~y~~~~~~S~~sc~W~pDg---------~~-------~V~Gs------------- 331 (519)
T KOG0293|consen 282 RYLLACGFDEVLSLWDVDT-GDLRHLYPSGLGFSVSSCAWCPDG---------FR-------FVTGS------------- 331 (519)
T ss_pred CeEEecCchHheeeccCCc-chhhhhcccCcCCCcceeEEccCC---------ce-------eEecC-------------
Confidence 8889999999999999997 66666665 334567777888743 21 23432
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC--CCCEEEEEEc---CCEEEEEeCCEEEEEECCCCceEEEE
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF--RSPIYSVRCS---SRVVAICQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f--~s~V~sV~~S---~r~LAVa~~~~I~vwDl~t~~~~~tl 171 (832)
+++++..||+... .+...+. .-.|++++++ +.+++|+.+..|++|+..+..+...+
T Consensus 332 ------------------~dr~i~~wdlDgn-~~~~W~gvr~~~v~dlait~Dgk~vl~v~~d~~i~l~~~e~~~dr~li 392 (519)
T KOG0293|consen 332 ------------------PDRTIIMWDLDGN-ILGNWEGVRDPKVHDLAITYDGKYVLLVTVDKKIRLYNREARVDRGLI 392 (519)
T ss_pred ------------------CCCcEEEecCCcc-hhhcccccccceeEEEEEcCCCcEEEEEecccceeeechhhhhhhccc
Confidence 2277899998643 3444433 2368999986 35677889999999999887654333
Q ss_pred ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceecee
Q 003310 172 LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGI 251 (832)
Q Consensus 172 ~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi 251 (832)
.++.. +..+.++ .++..+ +
T Consensus 393 se~~~-------------its~~iS----------------------------------~d~k~~--------------L 411 (519)
T KOG0293|consen 393 SEEQP-------------ITSFSIS----------------------------------KDGKLA--------------L 411 (519)
T ss_pred cccCc-------------eeEEEEc----------------------------------CCCcEE--------------E
Confidence 33210 0011110 011110 0
Q ss_pred eeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCC--eEEEEE
Q 003310 252 VNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSP--ISALCF 329 (832)
Q Consensus 252 ~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~p--Is~LaF 329 (832)
..-.+..|++||++..+.+..+.+|+.. |-.-||
T Consensus 412 --------------------------------------------vnL~~qei~LWDl~e~~lv~kY~Ghkq~~fiIrSCF 447 (519)
T KOG0293|consen 412 --------------------------------------------VNLQDQEIHLWDLEENKLVRKYFGHKQGHFIIRSCF 447 (519)
T ss_pred --------------------------------------------EEcccCeeEEeecchhhHHHHhhcccccceEEEecc
Confidence 0012347999999999999999999854 445578
Q ss_pred cC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEE
Q 003310 330 DP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLF 407 (832)
Q Consensus 330 SP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIw 407 (832)
-- +.+++|+||.|+. |.||+-..+ ..+..| .||.. .|++|+|+| |-.++|++|+||||+||
T Consensus 448 gg~~~~fiaSGSED~k-vyIWhr~sg--------------kll~~L-sGHs~-~vNcVswNP~~p~m~ASasDDgtIRIW 510 (519)
T KOG0293|consen 448 GGGNDKFIASGSEDSK-VYIWHRISG--------------KLLAVL-SGHSK-TVNCVSWNPADPEMFASASDDGTIRIW 510 (519)
T ss_pred CCCCcceEEecCCCce-EEEEEccCC--------------ceeEee-cCCcc-eeeEEecCCCCHHHhhccCCCCeEEEe
Confidence 55 5589999999987 999998876 455566 57654 499999999 56679999999999999
Q ss_pred ecCCC
Q 003310 408 AINPL 412 (832)
Q Consensus 408 dl~~~ 412 (832)
...+.
T Consensus 511 g~~~~ 515 (519)
T KOG0293|consen 511 GPSDN 515 (519)
T ss_pred cCCcc
Confidence 98765
No 97
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.36 E-value=1.1e-10 Score=139.70 Aligned_cols=289 Identities=17% Similarity=0.276 Sum_probs=169.3
Q ss_pred CcEEEEE--ecCCeEEEEeccCC----C----eeEEe---eecCCCEEEEEEecCCCccc-ccCCcccccCCEEEEEeCC
Q 003310 18 RRVLLLG--YRSGFQVWDVEEAD----N----VHDLV---SRYDGPVSFMQMLPRPITSK-RSRDKFAEVRPLLVFCADG 83 (832)
Q Consensus 18 ~~vLl~G--y~~G~qVWdv~~~~----~----~~ell---S~~dG~Vr~v~ilp~p~~~~-~~~d~f~~~rPLLavv~~g 83 (832)
.++-..| ++.|..||+.+.-. + +.+.+ ..|+|+|.|++|.|++.... ++.|. ++-++.-.
T Consensus 26 ~~~aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~~h~~sv~CVR~S~dG~~lAsGSDD~------~v~iW~~~ 99 (942)
T KOG0973|consen 26 VKFATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMDDHDGSVNCVRFSPDGSYLASGSDDR------LVMIWERA 99 (942)
T ss_pred eeEecCCccccccceeeccccccchhhhhhcccchhheeeccccCceeEEEECCCCCeEeeccCcc------eEEEeeec
Confidence 4555666 78888899987532 1 12222 36899999999999864311 11111 22222211
Q ss_pred CCccCccccCCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe-CCCCEEEEEEc--CCEEEE-EeCCEEEE
Q 003310 84 SRSCGTKVQDGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCS--SRVVAI-CQAAQVHC 159 (832)
Q Consensus 84 ~~~g~~~~~Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S--~r~LAV-a~~~~I~v 159 (832)
. .+ .+..+. ..|.. ..| +.=+++..|. +++.|.+|.++ ..+||. +.|++|.|
T Consensus 100 ~-~~-------~~~~fg-------s~g~~----~~v-----E~wk~~~~l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsVii 155 (942)
T KOG0973|consen 100 E-IG-------SGTVFG-------STGGA----KNV-----ESWKVVSILRGHDSDVLDVNWSPDDSLLVSVSLDNSVII 155 (942)
T ss_pred c-cC-------Cccccc-------ccccc----ccc-----ceeeEEEEEecCCCccceeccCCCccEEEEecccceEEE
Confidence 0 00 000000 00000 112 2223566665 47899999998 467776 78999999
Q ss_pred EECCCCceEEEEecCCCccCCCCCCCCCcccceeeecc--ceEEeeCCCceecCCCccCCcccccccccccccCCCccee
Q 003310 160 FDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPLAVGP--RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVA 237 (832)
Q Consensus 160 wDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~--r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~ 237 (832)
||+.|++++..+.+|.... .-+++.| +|||.-+ +++++.
T Consensus 156 wn~~tF~~~~vl~~H~s~V------------KGvs~DP~Gky~ASqs---------------------------dDrtik 196 (942)
T KOG0973|consen 156 WNAKTFELLKVLRGHQSLV------------KGVSWDPIGKYFASQS---------------------------DDRTLK 196 (942)
T ss_pred Eccccceeeeeeecccccc------------cceEECCccCeeeeec---------------------------CCceEE
Confidence 9999999999999985321 1144444 7777653 233333
Q ss_pred eeecc---cccceeceeeeccCccccccccccc--cccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCc
Q 003310 238 HYAKE---SSKHLAAGIVNLGDLGYKKLSQYCS--EFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKN 312 (832)
Q Consensus 238 ~~A~~---ssk~lasGi~~lGd~g~~~ls~y~~--~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~ 312 (832)
-|... ..|.+.. .+.+. .++.|.. +..|||+- +.+++.. -+..-++.|.+-.+.+
T Consensus 197 vwrt~dw~i~k~It~---pf~~~---~~~T~f~RlSWSPDG~~-las~nA~-------------n~~~~~~~IieR~tWk 256 (942)
T KOG0973|consen 197 VWRTSDWGIEKSITK---PFEES---PLTTFFLRLSWSPDGHH-LASPNAV-------------NGGKSTIAIIERGTWK 256 (942)
T ss_pred EEEcccceeeEeecc---chhhC---CCcceeeecccCCCcCe-ecchhhc-------------cCCcceeEEEecCCce
Confidence 33311 1111111 01111 1111221 23455421 1111100 1122368888888888
Q ss_pred EEEEeccCCCCeEEEEEcCC--------CC---------EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 313 VIAQFRAHKSPISALCFDPS--------GI---------LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 313 ~l~~~~aH~~pIs~LaFSPd--------G~---------lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
.-..|-+|..|+.+++|+|. |+ .+|+||.|++ |-||..... +.+...+
T Consensus 257 ~~~~LvGH~~p~evvrFnP~lfe~~~~ng~~~~~~~~y~i~AvgSqDrS-lSVW~T~~~--------------RPl~vi~ 321 (942)
T KOG0973|consen 257 VDKDLVGHSAPVEVVRFNPKLFERNNKNGTSTQPNCYYCIAAVGSQDRS-LSVWNTALP--------------RPLFVIH 321 (942)
T ss_pred eeeeeecCCCceEEEEeChHHhccccccCCccCCCcceEEEEEecCCcc-EEEEecCCC--------------Cchhhhh
Confidence 88889999999999999982 11 5789999987 899987543 2222221
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
- .....|.|++|||||.-|.++|.||||.++.++.
T Consensus 322 ~-lf~~SI~DmsWspdG~~LfacS~DGtV~~i~Fee 356 (942)
T KOG0973|consen 322 N-LFNKSIVDMSWSPDGFSLFACSLDGTVALIHFEE 356 (942)
T ss_pred h-hhcCceeeeeEcCCCCeEEEEecCCeEEEEEcch
Confidence 1 1234599999999999999999999999998864
No 98
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.36 E-value=8e-11 Score=125.07 Aligned_cols=96 Identities=20% Similarity=0.273 Sum_probs=65.6
Q ss_pred cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC----------CCCCc--cC----------------
Q 003310 312 NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL----------GTSSA--CD---------------- 363 (832)
Q Consensus 312 ~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~----------~~~s~--~~---------------- 363 (832)
+.+.+|++|.+.|.++||||+.+.++|+|.||+ +||||+.-.+. ++... ++
T Consensus 269 ~rvf~LkGH~saV~~~aFsn~S~r~vtvSkDG~-wriwdtdVrY~~~qDpk~Lk~g~~pl~aag~~p~RL~lsP~g~~lA 347 (420)
T KOG2096|consen 269 KRVFSLKGHQSAVLAAAFSNSSTRAVTVSKDGK-WRIWDTDVRYEAGQDPKILKEGSAPLHAAGSEPVRLELSPSGDSLA 347 (420)
T ss_pred hhhheeccchhheeeeeeCCCcceeEEEecCCc-EEEeeccceEecCCCchHhhcCCcchhhcCCCceEEEeCCCCcEEE
Confidence 345678999999999999999999999999998 89999764321 11000 00
Q ss_pred -CCCceeEEEEEeccCc--------cccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 364 -AGTSYVHLYRLQRGLT--------NAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 364 -~~~~~~~l~~l~rG~t--------~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
..++-.+++.-++|.. ...|.+|+|++||+++|+++ |+-++|+.-
T Consensus 348 ~s~gs~l~~~~se~g~~~~~~e~~h~~~Is~is~~~~g~~~atcG-dr~vrv~~n 401 (420)
T KOG2096|consen 348 VSFGSDLKVFASEDGKDYPELEDIHSTTISSISYSSDGKYIATCG-DRYVRVIRN 401 (420)
T ss_pred eecCCceEEEEcccCccchhHHHhhcCceeeEEecCCCcEEeeec-ceeeeeecC
Confidence 0112233444444422 23489999999999998887 556777653
No 99
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.35 E-value=1.9e-10 Score=119.67 Aligned_cols=163 Identities=14% Similarity=0.240 Sum_probs=125.6
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccccee
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPL 193 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~ 193 (832)
..|-+.|.++.+.+++.+|+-.+..+.++ ..++.. ..-++|.|.....++..++|..||. +.+
T Consensus 128 D~it~id~r~~~~~~~~~~~~e~ne~~w~~~nd~Fflt~GlG~v~ILsypsLkpv~si~AH~s--------------nCi 193 (313)
T KOG1407|consen 128 DRITFIDARTYKIVNEEQFKFEVNEISWNNSNDLFFLTNGLGCVEILSYPSLKPVQSIKAHPS--------------NCI 193 (313)
T ss_pred ccEEEEEecccceeehhcccceeeeeeecCCCCEEEEecCCceEEEEeccccccccccccCCc--------------ceE
Confidence 56889999999999999999888888885 344433 4457999998888999999988852 122
Q ss_pred eeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCCC
Q 003310 194 AVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDS 273 (832)
Q Consensus 194 Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~ 273 (832)
+ |++.+ .|+
T Consensus 194 c-----I~f~p---------------------------~Gr--------------------------------------- 202 (313)
T KOG1407|consen 194 C-----IEFDP---------------------------DGR--------------------------------------- 202 (313)
T ss_pred E-----EEECC---------------------------CCc---------------------------------------
Confidence 2 12210 111
Q ss_pred cCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCC
Q 003310 274 QNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIP 353 (832)
Q Consensus 274 ~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t 353 (832)
.|+.+..|..|.+||+...-++..|..|.-||..|.||.||++||+||.| +.|-|=++.+
T Consensus 203 -------------------yfA~GsADAlvSLWD~~ELiC~R~isRldwpVRTlSFS~dg~~lASaSED-h~IDIA~vet 262 (313)
T KOG1407|consen 203 -------------------YFATGSADALVSLWDVDELICERCISRLDWPVRTLSFSHDGRMLASASED-HFIDIAEVET 262 (313)
T ss_pred -------------------eEeeccccceeeccChhHhhhheeeccccCceEEEEeccCcceeeccCcc-ceEEeEeccc
Confidence 13334566789999999999999999999999999999999999999999 5588888877
Q ss_pred CCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 354 GILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 354 ~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
| .++++.+ +.+.-..|||.|...+||-+.+|
T Consensus 263 G--------------d~~~eI~---~~~~t~tVAWHPk~~LLAyA~dd 293 (313)
T KOG1407|consen 263 G--------------DRVWEIP---CEGPTFTVAWHPKRPLLAYACDD 293 (313)
T ss_pred C--------------CeEEEee---ccCCceeEEecCCCceeeEEecC
Confidence 6 4556653 23457899999999999988876
No 100
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.34 E-value=1.4e-12 Score=149.51 Aligned_cols=164 Identities=15% Similarity=0.202 Sum_probs=127.0
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++||||||..++.+++|-. ...+.+|.|++ .+.|. +.+..+.+||++..-|.++...|+- .+
T Consensus 92 gtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~~~~a~gStdtd~~iwD~Rk~Gc~~~~~s~~~------------vv-- 157 (825)
T KOG0267|consen 92 GTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYGEFFASGSTDTDLKIWDIRKKGCSHTYKSHTR------------VV-- 157 (825)
T ss_pred CceeeeehhhhhhhhhhhccccCcceeeeccceEEeccccccccceehhhhccCceeeecCCcc------------ee--
Confidence 8999999999999999975 67899999985 56665 6788999999998888888876532 11
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
+.|++.+ +|.++
T Consensus 158 -----~~l~lsP---------------------------~Gr~v------------------------------------ 169 (825)
T KOG0267|consen 158 -----DVLRLSP---------------------------DGRWV------------------------------------ 169 (825)
T ss_pred -----EEEeecC---------------------------CCcee------------------------------------
Confidence 2233321 22221
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
++++.|..|+|||+..++.+..|+.|...|.++.|.|.--+|++||.|++ +++||+.
T Consensus 170 ----------------------~~g~ed~tvki~d~~agk~~~ef~~~e~~v~sle~hp~e~Lla~Gs~d~t-v~f~dle 226 (825)
T KOG0267|consen 170 ----------------------ASGGEDNTVKIWDLTAGKLSKEFKSHEGKVQSLEFHPLEVLLAPGSSDRT-VRFWDLE 226 (825)
T ss_pred ----------------------eccCCcceeeeecccccccccccccccccccccccCchhhhhccCCCCce-eeeeccc
Confidence 12345779999999999999999999999999999999999999999987 8999998
Q ss_pred CCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 353 PGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 353 t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
+. ..+-.. .+. ...|.+.+|+||++.+++|-..
T Consensus 227 tf--------------e~I~s~-~~~-~~~v~~~~fn~~~~~~~~G~q~ 259 (825)
T KOG0267|consen 227 TF--------------EVISSG-KPE-TDGVRSLAFNPDGKIVLSGEQI 259 (825)
T ss_pred ee--------------EEeecc-CCc-cCCceeeeecCCceeeecCchh
Confidence 75 222222 111 3459999999999998887554
No 101
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.33 E-value=6.6e-11 Score=124.60 Aligned_cols=110 Identities=16% Similarity=0.236 Sum_probs=83.1
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCc-cCCCCceeEEEEEe-
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSA-CDAGTSYVHLYRLQ- 375 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~-~~~~~~~~~l~~l~- 375 (832)
.+-.|++.|+.+|..-++|.+|...|.++.|+|... .|||||.||. ||+||++.. .+.-.. +.-++ .....++
T Consensus 166 r~~~VrLCDi~SGs~sH~LsGHr~~vlaV~Wsp~~e~vLatgsaDg~-irlWDiRra-sgcf~~lD~hn~--k~~p~~~~ 241 (397)
T KOG4283|consen 166 RDVQVRLCDIASGSFSHTLSGHRDGVLAVEWSPSSEWVLATGSADGA-IRLWDIRRA-SGCFRVLDQHNT--KRPPILKT 241 (397)
T ss_pred CCCcEEEEeccCCcceeeeccccCceEEEEeccCceeEEEecCCCce-EEEEEeecc-cceeEEeecccC--ccCccccc
Confidence 345799999999999999999999999999999887 5899999998 999999864 111000 00000 1111111
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
+-.+..+|..+||+.|+.++++.+.|..+++|....+
T Consensus 242 n~ah~gkvngla~tSd~~~l~~~gtd~r~r~wn~~~G 278 (397)
T KOG4283|consen 242 NTAHYGKVNGLAWTSDARYLASCGTDDRIRVWNMESG 278 (397)
T ss_pred cccccceeeeeeecccchhhhhccCccceEEeecccC
Confidence 2223456999999999999999999999999998764
No 102
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.32 E-value=4.4e-10 Score=119.70 Aligned_cols=97 Identities=11% Similarity=0.246 Sum_probs=81.6
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL 378 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~ 378 (832)
.|-+|+..++.+.+.++.|.+|...|+.|+.+|-+..++++|.|++ ||+||++.. .+..+..+. +
T Consensus 78 ~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~d~FlS~S~D~t-vrLWDlR~~------------~cqg~l~~~-~- 142 (311)
T KOG1446|consen 78 EDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKDDTFLSSSLDKT-VRLWDLRVK------------KCQGLLNLS-G- 142 (311)
T ss_pred CCCceEEEEeecCceEEEcCCCCceEEEEEecCCCCeEEecccCCe-EEeeEecCC------------CCceEEecC-C-
Confidence 4568999999999999999999999999999999999999999987 999999964 123333332 1
Q ss_pred ccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCC
Q 003310 379 TNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 379 t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
. ..+||.|+|-++|++.....|+|||+..++.
T Consensus 143 ~----pi~AfDp~GLifA~~~~~~~IkLyD~Rs~dk 174 (311)
T KOG1446|consen 143 R----PIAAFDPEGLIFALANGSELIKLYDLRSFDK 174 (311)
T ss_pred C----cceeECCCCcEEEEecCCCeEEEEEecccCC
Confidence 1 2489999999999999999999999998843
No 103
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.32 E-value=2e-10 Score=135.29 Aligned_cols=100 Identities=15% Similarity=0.281 Sum_probs=81.2
Q ss_pred cCCCCeEEEEECCCCcEEEEecc-------C-CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRA-------H-KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~a-------H-~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
...+|.|+|||+.++.+..++.. - ...+..++|+|+|..||..+.|+. |++|+....
T Consensus 156 ss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~-Vkvy~r~~w-------------- 220 (933)
T KOG1274|consen 156 SSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNT-VKVYSRKGW-------------- 220 (933)
T ss_pred EecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCe-EEEEccCCc--------------
Confidence 34678999999999877666542 1 356778999999777777777876 899998775
Q ss_pred eEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 369 VHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.++++|+-.+....+.+++|||+|+|||+++.||-|-|||+.+
T Consensus 221 e~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~g~I~vWnv~t 263 (933)
T KOG1274|consen 221 ELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLDGQILVWNVDT 263 (933)
T ss_pred eeheeecccccccceEEEEEcCCCcEEeeeccCCcEEEEeccc
Confidence 6778886555555699999999999999999999999999984
No 104
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.31 E-value=4.4e-12 Score=130.91 Aligned_cols=220 Identities=15% Similarity=0.206 Sum_probs=143.6
Q ss_pred CCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcC---CEEEEEeCCEEEEEECCCCc-eEEEEecCCCccCCCCCCCCCc
Q 003310 113 SSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSS---RVVAICQAAQVHCFDAATLE-IEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 113 ~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~---r~LAVa~~~~I~vwDl~t~~-~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
+..+-+-++||.-||..+|+|+++.-|.+++|+. ++|.-+.++-++|||+...+ ....+.+|+. ++
T Consensus 77 aaadftakvw~a~tgdelhsf~hkhivk~~af~~ds~~lltgg~ekllrvfdln~p~App~E~~ghtg----------~I 146 (334)
T KOG0278|consen 77 AAADFTAKVWDAVTGDELHSFEHKHIVKAVAFSQDSNYLLTGGQEKLLRVFDLNRPKAPPKEISGHTG----------GI 146 (334)
T ss_pred hcccchhhhhhhhhhhhhhhhhhhheeeeEEecccchhhhccchHHHhhhhhccCCCCCchhhcCCCC----------cc
Confidence 3456788999999999999999999999999973 44455778889999998754 2223333321 01
Q ss_pred ccceeeeccceEEee-CCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccc
Q 003310 189 GYGPLAVGPRWLAYS-GSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCS 267 (832)
Q Consensus 189 ~~~p~Alg~r~LAya-~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~ 267 (832)
|-+-+. .++.+.+ +++..+|.-|-..+.+.+-+ |.
T Consensus 147 ---------r~v~wc~eD~~iLS-------------------Sadd~tVRLWD~rTgt~v~s------------L~---- 182 (334)
T KOG0278|consen 147 ---------RTVLWCHEDKCILS-------------------SADDKTVRLWDHRTGTEVQS------------LE---- 182 (334)
T ss_pred ---------eeEEEeccCceEEe-------------------eccCCceEEEEeccCcEEEE------------Ee----
Confidence 111122 1111110 12334443333333222110 00
Q ss_pred cccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEE
Q 003310 268 EFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNIN 347 (832)
Q Consensus 268 ~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~ 347 (832)
++....+... ++ .|.+++....+.|+.||..+-.+++.++.. -.|.+..++|+-..++.|..|+. ++
T Consensus 183 --~~s~VtSlEv-s~--------dG~ilTia~gssV~Fwdaksf~~lKs~k~P-~nV~SASL~P~k~~fVaGged~~-~~ 249 (334)
T KOG0278|consen 183 --FNSPVTSLEV-SQ--------DGRILTIAYGSSVKFWDAKSFGLLKSYKMP-CNVESASLHPKKEFFVAGGEDFK-VY 249 (334)
T ss_pred --cCCCCcceee-cc--------CCCEEEEecCceeEEeccccccceeeccCc-cccccccccCCCceEEecCcceE-EE
Confidence 0000000000 01 123455566788999999999999888754 36889999999999999999987 78
Q ss_pred EEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCC
Q 003310 348 IFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 348 Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
.||..++ ..+-.+..|+. ..|.|+.|||||...|+||.||||+||...+...
T Consensus 250 kfDy~Tg--------------eEi~~~nkgh~-gpVhcVrFSPdGE~yAsGSEDGTirlWQt~~~~~ 301 (334)
T KOG0278|consen 250 KFDYNTG--------------EEIGSYNKGHF-GPVHCVRFSPDGELYASGSEDGTIRLWQTTPGKT 301 (334)
T ss_pred EEeccCC--------------ceeeecccCCC-CceEEEEECCCCceeeccCCCceEEEEEecCCCc
Confidence 9999987 22222335654 4699999999999999999999999999977543
No 105
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.31 E-value=5.7e-11 Score=135.56 Aligned_cols=179 Identities=15% Similarity=0.248 Sum_probs=133.3
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEec-CCCccCCCCCCCCCcccc
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILT-NPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t-~~~~~~~p~~~~~~~~~~ 191 (832)
..|.+|+-.+++......| ...|++|.++ ++.||| ..++.|.|||..+.+.+.++.. |.. -++
T Consensus 197 ~~vylW~~~s~~v~~l~~~~~~~vtSv~ws~~G~~LavG~~~g~v~iwD~~~~k~~~~~~~~h~~------------rvg 264 (484)
T KOG0305|consen 197 QSVYLWSASSGSVTELCSFGEELVTSVKWSPDGSHLAVGTSDGTVQIWDVKEQKKTRTLRGSHAS------------RVG 264 (484)
T ss_pred ceEEEEecCCCceEEeEecCCCceEEEEECCCCCEEEEeecCCeEEEEehhhccccccccCCcCc------------eeE
Confidence 5799999999998777778 7899999997 689998 5788999999998887777655 210 001
Q ss_pred eeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccC
Q 003310 192 PLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLP 271 (832)
Q Consensus 192 p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p 271 (832)
.+| ...
T Consensus 265 ~la-------W~~------------------------------------------------------------------- 270 (484)
T KOG0305|consen 265 SLA-------WNS------------------------------------------------------------------- 270 (484)
T ss_pred EEe-------ccC-------------------------------------------------------------------
Confidence 111 100
Q ss_pred CCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEE-eccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 272 DSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQ-FRAHKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 272 ~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~-~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
..+.++..+|.|.++|+...+.+.. +.+|...|..|+|++||.+||+++.|.. +.|||
T Consensus 271 --------------------~~lssGsr~~~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgnDN~-~~Iwd 329 (484)
T KOG0305|consen 271 --------------------SVLSSGSRDGKILNHDVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGNDNV-VFIWD 329 (484)
T ss_pred --------------------ceEEEecCCCcEEEEEEecchhhhhhhhcccceeeeeEECCCCCeeccCCCccc-eEecc
Confidence 0012345678999999998765555 8899999999999999999999999976 89999
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc-CCCEEEEE--eCCCcEEEEecCCCCCceee
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD-DSNWIMIS--SSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsg--S~DgTVhIwdl~~~g~~~~~ 418 (832)
.... ..++++. .|+ |.|..++|+| ....||+| +.|++||+|++..+.....+
T Consensus 330 ~~~~--------------~p~~~~~-~H~-aAVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~~g~~i~~v 384 (484)
T KOG0305|consen 330 GLSP--------------EPKFTFT-EHT-AAVKALAWCPWQSGLLATGGGSADRCIKFWNTNTGARIDSV 384 (484)
T ss_pred CCCc--------------cccEEEe-ccc-eeeeEeeeCCCccCceEEcCCCcccEEEEEEcCCCcEeccc
Confidence 8432 2233332 333 4599999999 66788885 56999999999865443333
No 106
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.29 E-value=9.7e-11 Score=129.39 Aligned_cols=213 Identities=20% Similarity=0.321 Sum_probs=148.5
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
+-++..|.++-|+|||+++ .+....+..|.+.|..+.|--.+. .|.. ++
T Consensus 215 kylatgg~d~~v~Iw~~~t-~ehv~~~~ghr~~V~~L~fr~gt~-------------~lys-~s---------------- 263 (479)
T KOG0299|consen 215 KYLATGGRDRHVQIWDCDT-LEHVKVFKGHRGAVSSLAFRKGTS-------------ELYS-AS---------------- 263 (479)
T ss_pred cEEEecCCCceEEEecCcc-cchhhcccccccceeeeeeecCcc-------------ceee-ee----------------
Confidence 4444456677799999998 456667888999999988853221 1221 11
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEE-eCCCCEEEEEEc--CCEEEEE-eCCEEEEEECCCCceEEEEec
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHML-KFRSPIYSVRCS--SRVVAIC-QAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL-~f~s~V~sV~~S--~r~LAVa-~~~~I~vwDl~t~~~~~tl~t 173 (832)
.+.+|++|++..-.++.++ .+.+.|.+|... .+.+.|+ -|.++++|++..-.. ....+
T Consensus 264 -----------------~Drsvkvw~~~~~s~vetlyGHqd~v~~IdaL~reR~vtVGgrDrT~rlwKi~eesq-lifrg 325 (479)
T KOG0299|consen 264 -----------------ADRSVKVWSIDQLSYVETLYGHQDGVLGIDALSRERCVTVGGRDRTVRLWKIPEESQ-LIFRG 325 (479)
T ss_pred -----------------cCCceEEEehhHhHHHHHHhCCccceeeechhcccceEEeccccceeEEEeccccce-eeeeC
Confidence 2378999999998888876 457899999995 4666664 799999999832211 11111
Q ss_pred CCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeee
Q 003310 174 NPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVN 253 (832)
Q Consensus 174 ~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~ 253 (832)
+. ++ ....||-
T Consensus 326 ~~---~s----------------idcv~~I-------------------------------------------------- 336 (479)
T KOG0299|consen 326 GE---GS----------------IDCVAFI-------------------------------------------------- 336 (479)
T ss_pred CC---CC----------------eeeEEEe--------------------------------------------------
Confidence 10 00 0001111
Q ss_pred ccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec-cCC---C-------
Q 003310 254 LGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR-AHK---S------- 322 (832)
Q Consensus 254 lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~-aH~---~------- 322 (832)
. .-||+++..+|.|.+|++..++++.+.+ +|. .
T Consensus 337 -n-----------------------------------~~HfvsGSdnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~ 380 (479)
T KOG0299|consen 337 -N-----------------------------------DEHFVSGSDNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGN 380 (479)
T ss_pred -c-----------------------------------ccceeeccCCceEEEeeecccCceeEeeccccccCCccccccc
Confidence 0 0256778899999999999999998887 663 2
Q ss_pred -CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEE
Q 003310 323 -PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMIS 398 (832)
Q Consensus 323 -pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsg 398 (832)
.|++|+.-|...++|++|.+|. +|+|.+.++. .....++.+. . ...|++|+|+++|++|.+|
T Consensus 381 ~Witsla~i~~sdL~asGS~~G~-vrLW~i~~g~----------r~i~~l~~ls--~-~GfVNsl~f~~sgk~ivag 443 (479)
T KOG0299|consen 381 FWITSLAVIPGSDLLASGSWSGC-VRLWKIEDGL----------RAINLLYSLS--L-VGFVNSLAFSNSGKRIVAG 443 (479)
T ss_pred cceeeeEecccCceEEecCCCCc-eEEEEecCCc----------cccceeeecc--c-ccEEEEEEEccCCCEEEEe
Confidence 6899999999999999999998 8999998861 1234455553 1 2359999999999988776
No 107
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.26 E-value=3.2e-09 Score=114.87 Aligned_cols=183 Identities=15% Similarity=0.196 Sum_probs=140.4
Q ss_pred CEEEEEECCCCcEEEEEe-CCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLK-FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
..-.||++.+|+...++. ++..|.++.|| +.+||. .++++|.||...++.....+...- +-
T Consensus 86 D~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~~~~~~~~e~---------------~d 150 (399)
T KOG0296|consen 86 DLAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQEV---------------ED 150 (399)
T ss_pred ceEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecCCCccEEEEEcccCceEEEeeccc---------------Cc
Confidence 456799999999888885 58899999997 678888 589999999999998777775210 01
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
| .||-+-. . + .
T Consensus 151 i----eWl~WHp-------------~--------------a-----------~--------------------------- 161 (399)
T KOG0296|consen 151 I----EWLKWHP-------------R--------------A-----------H--------------------------- 161 (399)
T ss_pred e----EEEEecc-------------c--------------c-----------c---------------------------
Confidence 1 2444431 0 0 0
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
.++.+..||.|-+|.+.++...+.|.+|..+++|=.|.|||++++++..||+ |++|+.+
T Consensus 162 --------------------illAG~~DGsvWmw~ip~~~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~dgt-i~~Wn~k 220 (399)
T KOG0296|consen 162 --------------------ILLAGSTDGSVWMWQIPSQALCKVMSGHNSPCTCGEFIPDGKRILTGYDDGT-IIVWNPK 220 (399)
T ss_pred --------------------EEEeecCCCcEEEEECCCcceeeEecCCCCCcccccccCCCceEEEEecCce-EEEEecC
Confidence 1223467899999999999899999999999999999999999999999998 8999999
Q ss_pred CCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 353 PGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 353 t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
++ ..++++.. .......+++++.++..+..|+.++.+++-...+++-...+.
T Consensus 221 tg--------------~p~~~~~~-~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n 272 (399)
T KOG0296|consen 221 TG--------------QPLHKITQ-AEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNN 272 (399)
T ss_pred CC--------------ceeEEecc-cccCcCCccccccccceeEeccCCccEEEEccccceEEEecC
Confidence 87 34444421 112236789999999999999999999998776654444444
No 108
>KOG2109 consensus WD40 repeat protein [General function prediction only]
Probab=99.25 E-value=6.5e-12 Score=143.35 Aligned_cols=321 Identities=25% Similarity=0.349 Sum_probs=189.8
Q ss_pred CcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+++.||-.| .++|-....+.+.+++..+.|+|+...++++- +.++.+
T Consensus 252 Gy~~isglc~g~~~~g~gpglgg~~~~~vGrvg~vsaesV~g~~----------------~vivkd-------------- 301 (788)
T KOG2109|consen 252 GYVLISGLCRGSYQIGTGPGLGGFEEVLVGRVGPVSAESVLGNN----------------LVIVKD-------------- 301 (788)
T ss_pred hHHHHHHHhhcccCCCCCCCCCCcCceeccccccccceeecccc----------------eEEeec--------------
Confidence 45566666666 78888888788888888899999988877542 222221
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEEEeCCEEEEEECCCCceEEEEecCCC
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAICQAAQVHCFDAATLEIEYAILTNPI 176 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~ 176 (832)
|-+..+...++..+++..+.+++-++.+|++++-..+.|++++.++..-++...
T Consensus 302 ------------------------f~S~a~i~QfkAhkspiSaLcfdqsgsllViasi~g~nVnvfRimet~~t~~~~-- 355 (788)
T KOG2109|consen 302 ------------------------FDSFADIRQFKAHKSPISALCFDQSGSLLVIASITGRNVNVFRIMETVCTVNVS-- 355 (788)
T ss_pred ------------------------ccchhhhhheeeecCcccccccccCceEEEEEeeccceeeeEEecccccccccc--
Confidence 112333444555555544444444666777776666666666666654444322
Q ss_pred ccCCCCCCCCCcccceeeeccceEEeeCCCce-e-----cCCCccCCcccccccccccccCCCcceeeeecccccceece
Q 003310 177 VMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVV-V-----SNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAG 250 (832)
Q Consensus 177 ~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~-~-----s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasG 250 (832)
.+. +..++.++.++||+|.--... . +..|. +..+++.. .+. -|| ++-|-.|
T Consensus 356 -~qs-------~~~s~ra~t~aviqdicfs~~s~~r~~gsc~Ge--~P~ls~t~---------~lp-~~A---~~Sl~~g 412 (788)
T KOG2109|consen 356 -DQS-------LVVSPRANTAAVIQDICFSEVSTIRTAGSCEGE--PPALSLTC---------QLP-AYA---DTSLDLG 412 (788)
T ss_pred -ccc-------cccchhcchHHHHHHHhhhhhcceEeecccCCC--Cccccccc---------ccc-hhh---chhhhcc
Confidence 111 123456667777766421110 0 11111 10111000 000 011 1111112
Q ss_pred eeeccCcccccc----ccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECC-----CC-cEEEEeccC
Q 003310 251 IVNLGDLGYKKL----SQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIV-----SK-NVIAQFRAH 320 (832)
Q Consensus 251 i~~lGd~g~~~l----s~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~-----s~-~~l~~~~aH 320 (832)
+...|......+ ..||....-.++-...+..|+.|..|...+..+.+ ..|.+.+.+.. ++ .+++++-+|
T Consensus 413 l~s~g~~aa~gla~~sag~~a~s~~asSv~s~s~~pd~ks~gv~~gsv~k~-~q~~~~~l~~llv~~psGd~vvqh~vah 491 (788)
T KOG2109|consen 413 LQSSGGLAAEGLATSSAGYTAHSYTASSVFSRSVKPDSKSVGVGSGSVTKA-NQGVITVLNLLLVGEPSGDGVVQHYVAH 491 (788)
T ss_pred ccccCcccceeeeeccccccccccccceeeccccccchhhccceeeecccc-CccchhhhhheeeecCCCCceeEEEeec
Confidence 222222211111 12232221000000012234445445443333332 23444444432 33 567788899
Q ss_pred CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeC
Q 003310 321 KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS 400 (832)
Q Consensus 321 ~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~ 400 (832)
..++..+.|+|+++++.+++..++.+++|.+++...++.-+ .+.|+|+++||.|.++|..++|+-|++|+|....
T Consensus 492 s~~gv~~Ef~~~~~l~lSad~~e~ef~~f~V~Ph~~wssla-----av~hly~l~rG~TsaKv~~~afs~dsrw~A~~t~ 566 (788)
T KOG2109|consen 492 SDPGVYIEFSPDQRLVLSADANENEFNIFLVMPHATWSSLA-----AVQHLYKLNRGSTSAKVVSTAFSEDSRWLAITTN 566 (788)
T ss_pred cCccceeeecccccceecccccccccceEEeecccccHHHh-----hhhhhhhccCCCccceeeeeEeecchhhhhhhhc
Confidence 99999999999999999999999988999999875554332 3578999999999999999999999999999999
Q ss_pred CCcEEEEecCCCCCceeeccCCC
Q 003310 401 RGTSHLFAINPLGGSVNFQPTDA 423 (832)
Q Consensus 401 DgTVhIwdl~~~g~~~~~~~H~~ 423 (832)
.+|-|||.+++|++....++|++
T Consensus 567 ~~TthVfk~hpYgg~aeqrth~~ 589 (788)
T KOG2109|consen 567 HATTHVFKVHPYGGKAEQRTHGD 589 (788)
T ss_pred CCceeeeeeccccccccceecCC
Confidence 99999999999999999999977
No 109
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.25 E-value=5.6e-11 Score=131.85 Aligned_cols=112 Identities=18% Similarity=0.345 Sum_probs=87.0
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC-------------CCCCccCC
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL-------------GTSSACDA 364 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~-------------~~~s~~~~ 364 (832)
..||.|.|||+.+...+.+|++|+..+.||..++||+.|=||+-|.+ +|.||++++.. +-....+|
T Consensus 528 csdGnI~vwDLhnq~~VrqfqGhtDGascIdis~dGtklWTGGlDnt-vRcWDlregrqlqqhdF~SQIfSLg~cP~~dW 606 (705)
T KOG0639|consen 528 CSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISKDGTKLWTGGLDNT-VRCWDLREGRQLQQHDFSSQIFSLGYCPTGDW 606 (705)
T ss_pred ccCCcEEEEEcccceeeecccCCCCCceeEEecCCCceeecCCCccc-eeehhhhhhhhhhhhhhhhhheecccCCCccc
Confidence 46789999999999999999999999999999999999999999977 89999998721 11111222
Q ss_pred ------CCceeE-------EEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 365 ------GTSYVH-------LYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 365 ------~~~~~~-------l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
+..++. .|.|+ .+...|.++.|++.|+|+++.+.|.-+-.|.. ++|
T Consensus 607 lavGMens~vevlh~skp~kyqlh--lheScVLSlKFa~cGkwfvStGkDnlLnawrt-PyG 665 (705)
T KOG0639|consen 607 LAVGMENSNVEVLHTSKPEKYQLH--LHESCVLSLKFAYCGKWFVSTGKDNLLNAWRT-PYG 665 (705)
T ss_pred eeeecccCcEEEEecCCccceeec--ccccEEEEEEecccCceeeecCchhhhhhccC-ccc
Confidence 112222 23332 23345999999999999999999999999988 444
No 110
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.25 E-value=4.9e-10 Score=128.51 Aligned_cols=125 Identities=20% Similarity=0.255 Sum_probs=94.9
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC---CCCEEEEEEcCCCEEEEEeCCCCCC------C-CCCc-
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP---SGILLVTASVQGHNINIFKIIPGIL------G-TSSA- 361 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP---dG~lLATaS~dGt~I~Iwdi~t~~~------~-~~s~- 361 (832)
++++++.-|+++|||+.+.+....+.+|.+.|.||.||. .-++||+||.| +.|+|||+...+. + +++.
T Consensus 473 hLAsGDr~GnlrVy~Lq~l~~~~~~eAHesEilcLeyS~p~~~~kLLASasrd-RlIHV~Dv~rny~l~qtld~HSssIT 551 (1080)
T KOG1408|consen 473 HLASGDRGGNLRVYDLQELEYTCFMEAHESEILCLEYSFPVLTNKLLASASRD-RLIHVYDVKRNYDLVQTLDGHSSSIT 551 (1080)
T ss_pred eecccCccCceEEEEehhhhhhhheecccceeEEEeecCchhhhHhhhhccCC-ceEEEEecccccchhhhhccccccee
Confidence 677888899999999999999999999999999999986 35689999998 6799999876521 1 1110
Q ss_pred ------cC-----------C---------CCceeEEEEEeccCc---cccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 362 ------CD-----------A---------GTSYVHLYRLQRGLT---NAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 362 ------~~-----------~---------~~~~~~l~~l~rG~t---~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.+ + .++++ .+.|++. ...+++++.-|..+++++++.|+.|+||+++.+
T Consensus 552 svKFa~~gln~~MiscGADksimFr~~qk~~~g~---~f~r~t~t~~ktTlYDm~Vdp~~k~v~t~cQDrnirif~i~sg 628 (1080)
T KOG1408|consen 552 SVKFACNGLNRKMISCGADKSIMFRVNQKASSGR---LFPRHTQTLSKTTLYDMAVDPTSKLVVTVCQDRNIRIFDIESG 628 (1080)
T ss_pred EEEEeecCCceEEEeccCchhhheehhccccCce---eccccccccccceEEEeeeCCCcceEEEEecccceEEEecccc
Confidence 00 0 00111 1123322 123899999999999999999999999999998
Q ss_pred CCceeeccC
Q 003310 413 GGSVNFQPT 421 (832)
Q Consensus 413 g~~~~~~~H 421 (832)
+..-.|++.
T Consensus 629 Kq~k~FKgs 637 (1080)
T KOG1408|consen 629 KQVKSFKGS 637 (1080)
T ss_pred ceeeeeccc
Confidence 888888653
No 111
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.25 E-value=3.4e-10 Score=124.69 Aligned_cols=103 Identities=18% Similarity=0.192 Sum_probs=77.6
Q ss_pred ccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 296 DADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
.+..+|+|+-+|+++. +++.++++|..+|++|++++.-. +|+|+|.|+. +++|++....+ ...+-..
T Consensus 347 ~~tddG~v~~~D~R~~~~~vwt~~AHd~~ISgl~~n~~~p~~l~t~s~d~~-Vklw~~~~~~~----------~~v~~~~ 415 (463)
T KOG0270|consen 347 VSTDDGTVYYFDIRNPGKPVWTLKAHDDEISGLSVNIQTPGLLSTASTDKV-VKLWKFDVDSP----------KSVKEHS 415 (463)
T ss_pred EecCCceEEeeecCCCCCceeEEEeccCCcceEEecCCCCcceeeccccce-EEEEeecCCCC----------ccccccc
Confidence 3457899999999875 89999999999999999998644 7899999986 89999865411 1122222
Q ss_pred EeccCccccEEEEEEccCC-CEEEEEeCCCcEEEEecCCCC
Q 003310 374 LQRGLTNAVIQDISFSDDS-NWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg-~~LAsgS~DgTVhIwdl~~~g 413 (832)
+.-| +..|.++.|+- -++|.|+..+.++|||+.+..
T Consensus 416 ~~~~----rl~c~~~~~~~a~~la~GG~k~~~~vwd~~~~~ 452 (463)
T KOG0270|consen 416 FKLG----RLHCFALDPDVAFTLAFGGEKAVLRVWDIFTNS 452 (463)
T ss_pred cccc----ceeecccCCCcceEEEecCccceEEEeecccCh
Confidence 2222 25677777764 467888889999999997653
No 112
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.24 E-value=9.5e-11 Score=137.15 Aligned_cols=110 Identities=16% Similarity=0.305 Sum_probs=91.2
Q ss_pred ccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeE
Q 003310 292 GHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVH 370 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~ 370 (832)
+.+.++..|.+|+||++....+++.|. |..-|+|++|+| |.+|+++||.||+ ||||+|... .+..
T Consensus 381 ~fLLSSSMDKTVRLWh~~~~~CL~~F~-HndfVTcVaFnPvDDryFiSGSLD~K-vRiWsI~d~------------~Vv~ 446 (712)
T KOG0283|consen 381 NFLLSSSMDKTVRLWHPGRKECLKVFS-HNDFVTCVAFNPVDDRYFISGSLDGK-VRLWSISDK------------KVVD 446 (712)
T ss_pred CeeEeccccccEEeecCCCcceeeEEe-cCCeeEEEEecccCCCcEeecccccc-eEEeecCcC------------eeEe
Confidence 355678899999999999999999994 899999999999 8899999999998 999999765 2233
Q ss_pred EEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeecc
Q 003310 371 LYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 371 l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~ 420 (832)
-+.++ --|..+||+|||++.++|+.+|.+++|+.....-...++.
T Consensus 447 W~Dl~-----~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I 491 (712)
T KOG0283|consen 447 WNDLR-----DLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHI 491 (712)
T ss_pred ehhhh-----hhheeEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeE
Confidence 23332 2399999999999999999999999999976554445443
No 113
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.24 E-value=8e-12 Score=143.52 Aligned_cols=180 Identities=14% Similarity=0.258 Sum_probs=133.4
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
..+-||..-.-.++..|.. .++|.+|.|+ ..+|+. +.+++|++||+...+..++|.+|-.. +
T Consensus 50 ~k~~L~~i~kp~~i~S~~~hespIeSl~f~~~E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~--------------~ 115 (825)
T KOG0267|consen 50 EKVNLWAIGKPNAITSLTGHESPIESLTFDTSERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLN--------------I 115 (825)
T ss_pred eeeccccccCCchhheeeccCCcceeeecCcchhhhcccccCCceeeeehhhhhhhhhhhccccC--------------c
Confidence 5566777665556666654 6799999997 466666 56789999999999988888776321 1
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
.. ++|.+ |+.
T Consensus 116 ~s-----v~f~P------------------------------------------------------------~~~----- 125 (825)
T KOG0267|consen 116 TS-----VDFHP------------------------------------------------------------YGE----- 125 (825)
T ss_pred ce-----eeecc------------------------------------------------------------ceE-----
Confidence 11 11210 000
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
-++.+..++.+++||+....|...+++|...|.++.|+|+|++++.+.+|.+ ++|||+.
T Consensus 126 --------------------~~a~gStdtd~~iwD~Rk~Gc~~~~~s~~~vv~~l~lsP~Gr~v~~g~ed~t-vki~d~~ 184 (825)
T KOG0267|consen 126 --------------------FFASGSTDTDLKIWDIRKKGCSHTYKSHTRVVDVLRLSPDGRWVASGGEDNT-VKIWDLT 184 (825)
T ss_pred --------------------EeccccccccceehhhhccCceeeecCCcceeEEEeecCCCceeeccCCcce-eeeeccc
Confidence 0122345678999999999999999999999999999999999999999865 9999997
Q ss_pred CCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCcee
Q 003310 353 PGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVN 417 (832)
Q Consensus 353 t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~ 417 (832)
.+ ..+.+|. ++ ...|+.+.|.|-.-.+++||.|+|+++||++++.-...
T Consensus 185 ag--------------k~~~ef~-~~-e~~v~sle~hp~e~Lla~Gs~d~tv~f~dletfe~I~s 233 (825)
T KOG0267|consen 185 AG--------------KLSKEFK-SH-EGKVQSLEFHPLEVLLAPGSSDRTVRFWDLETFEVISS 233 (825)
T ss_pred cc--------------ccccccc-cc-cccccccccCchhhhhccCCCCceeeeeccceeEEeec
Confidence 66 3333442 22 24688999999999999999999999999997643333
No 114
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.24 E-value=3.1e-10 Score=130.31 Aligned_cols=168 Identities=15% Similarity=0.224 Sum_probs=126.3
Q ss_pred CCCEEEEEECCCCcEEEEEe-CCCCEEEEEEcC--CEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccc
Q 003310 115 VPTVVHFYSLRSQSYVHMLK-FRSPIYSVRCSS--RVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S~--r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~ 191 (832)
+++|+++|-. ++++.+++ +...|++|.+-+ .+|..+.|++|++|.. ++++.++.+|..+.
T Consensus 120 WD~TakvW~~--~~l~~~l~gH~asVWAv~~l~e~~~vTgsaDKtIklWk~--~~~l~tf~gHtD~V------------- 182 (745)
T KOG0301|consen 120 WDSTAKVWRI--GELVYSLQGHTASVWAVASLPENTYVTGSADKTIKLWKG--GTLLKTFSGHTDCV------------- 182 (745)
T ss_pred cccceEEecc--hhhhcccCCcchheeeeeecCCCcEEeccCcceeeeccC--Cchhhhhccchhhe-------------
Confidence 5699999975 56677676 478999998843 4455578999999987 55667777764310
Q ss_pred eeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccC
Q 003310 192 PLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLP 271 (832)
Q Consensus 192 p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p 271 (832)
|-||.- +
T Consensus 183 ------RgL~vl-------------------------------------------------------------------~ 189 (745)
T KOG0301|consen 183 ------RGLAVL-------------------------------------------------------------------D 189 (745)
T ss_pred ------eeeEEe-------------------------------------------------------------------c
Confidence 111111 0
Q ss_pred CCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeC
Q 003310 272 DSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKI 351 (832)
Q Consensus 272 ~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi 351 (832)
..+|++.++||.|++||+ ++.++..+.+|+.-|.+++..+++.+++|++.|++ ++||+.
T Consensus 190 -------------------~~~flScsNDg~Ir~w~~-~ge~l~~~~ghtn~vYsis~~~~~~~Ivs~gEDrt-lriW~~ 248 (745)
T KOG0301|consen 190 -------------------DSHFLSCSNDGSIRLWDL-DGEVLLEMHGHTNFVYSISMALSDGLIVSTGEDRT-LRIWKK 248 (745)
T ss_pred -------------------CCCeEeecCCceEEEEec-cCceeeeeeccceEEEEEEecCCCCeEEEecCCce-EEEeec
Confidence 014667789999999999 78899999999999999999999999999999998 899987
Q ss_pred CCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 352 IPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 352 ~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.. +....++. ...||++++=++|.. ++|++||.|+||...+.
T Consensus 249 ~e--------------~~q~I~lP----ttsiWsa~~L~NgDI-vvg~SDG~VrVfT~~k~ 290 (745)
T KOG0301|consen 249 DE--------------CVQVITLP----TTSIWSAKVLLNGDI-VVGGSDGRVRVFTVDKD 290 (745)
T ss_pred Cc--------------eEEEEecC----ccceEEEEEeeCCCE-EEeccCceEEEEEeccc
Confidence 63 23333431 124999999888765 68889999999998753
No 115
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.24 E-value=1.5e-10 Score=123.01 Aligned_cols=61 Identities=21% Similarity=0.461 Sum_probs=56.0
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
++.++..||.|.|||...-.++.+|++|.+.|+.|+..|+|++-++-+.|+. ++.|++..+
T Consensus 99 hLlS~sdDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D~~-lr~WNLV~G 159 (362)
T KOG0294|consen 99 HLLSGSDDGHIIIWRVGSWELLKSLKAHKGQVTDLSIHPSGKLALSVGGDQV-LRTWNLVRG 159 (362)
T ss_pred heeeecCCCcEEEEEcCCeEEeeeecccccccceeEecCCCceEEEEcCCce-eeeehhhcC
Confidence 3455678999999999999999999999999999999999999999999976 999999887
No 116
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.22 E-value=5.9e-10 Score=118.49 Aligned_cols=240 Identities=18% Similarity=0.247 Sum_probs=156.9
Q ss_pred CcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 18 RRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 18 ~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
+..|++|+.+| +-|||..+.+ ...+++.|--||.++.+.+.++ -|| ..+
T Consensus 35 G~~lAvGc~nG~vvI~D~~T~~-iar~lsaH~~pi~sl~WS~dgr-------------~Ll-tsS--------------- 84 (405)
T KOG1273|consen 35 GDYLAVGCANGRVVIYDFDTFR-IARMLSAHVRPITSLCWSRDGR-------------KLL-TSS--------------- 84 (405)
T ss_pred cceeeeeccCCcEEEEEccccc-hhhhhhccccceeEEEecCCCC-------------Eee-eec---------------
Confidence 67899999888 9999999864 7788999999999999987542 133 111
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcC---CE-EEEEeCCEEEEEECCCCceEEEEe
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSS---RV-VAICQAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~---r~-LAVa~~~~I~vwDl~t~~~~~tl~ 172 (832)
.+..|++||+..|.+++.++|+++|+...+.+ +. ||.-.+..-.+-+.... ++++.
T Consensus 85 ------------------~D~si~lwDl~~gs~l~rirf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~--~h~~L 144 (405)
T KOG1273|consen 85 ------------------RDWSIKLWDLLKGSPLKRIRFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDP--KHSVL 144 (405)
T ss_pred ------------------CCceeEEEeccCCCceeEEEccCccceeeeccccCCeEEEEEecCCcEEEEecCC--ceeec
Confidence 12679999999999999999999999999964 22 33334444444443331 12221
Q ss_pred cCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceee
Q 003310 173 TNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIV 252 (832)
Q Consensus 173 t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~ 252 (832)
.- .+. | .+..++ +.+
T Consensus 145 p~----------------------------d~d-------~-----dln~sa------s~~------------------- 159 (405)
T KOG1273|consen 145 PK----------------------------DDD-------G-----DLNSSA------SHG------------------- 159 (405)
T ss_pred cC----------------------------CCc-------c-----cccccc------ccc-------------------
Confidence 10 000 0 000000 000
Q ss_pred eccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCC-CCeEEEEEcC
Q 003310 253 NLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHK-SPISALCFDP 331 (832)
Q Consensus 253 ~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~-~pIs~LaFSP 331 (832)
.| ...| ..+.++...|.+.|+|..+.++++.|+-.+ ..|..+-|+.
T Consensus 160 -----~f-------------------------dr~g---~yIitGtsKGkllv~~a~t~e~vas~rits~~~IK~I~~s~ 206 (405)
T KOG1273|consen 160 -----VF-------------------------DRRG---KYIITGTSKGKLLVYDAETLECVASFRITSVQAIKQIIVSR 206 (405)
T ss_pred -----cc-------------------------cCCC---CEEEEecCcceEEEEecchheeeeeeeechheeeeEEEEec
Confidence 00 0000 123345677999999999999999999766 8899999999
Q ss_pred CCCEEEEEEcCCCEEEEEeCCCCC-CCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC-CcEEEEec
Q 003310 332 SGILLVTASVQGHNINIFKIIPGI-LGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR-GTSHLFAI 409 (832)
Q Consensus 332 dG~lLATaS~dGt~I~Iwdi~t~~-~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D-gTVhIwdl 409 (832)
.|..|++-+.| ++||.|++..-. .+. .+.....++++--.....-.+++||.||.|++++|.. ..+.||.-
T Consensus 207 ~g~~liiNtsD-RvIR~ye~~di~~~~r------~~e~e~~~K~qDvVNk~~Wk~ccfs~dgeYv~a~s~~aHaLYIWE~ 279 (405)
T KOG1273|consen 207 KGRFLIINTSD-RVIRTYEISDIDDEGR------DGEVEPEHKLQDVVNKLQWKKCCFSGDGEYVCAGSARAHALYIWEK 279 (405)
T ss_pred cCcEEEEecCC-ceEEEEehhhhcccCc------cCCcChhHHHHHHHhhhhhhheeecCCccEEEeccccceeEEEEec
Confidence 99999999998 469999987421 111 1122222333221222335789999999999888764 46899988
Q ss_pred CCC
Q 003310 410 NPL 412 (832)
Q Consensus 410 ~~~ 412 (832)
..+
T Consensus 280 ~~G 282 (405)
T KOG1273|consen 280 SIG 282 (405)
T ss_pred CCc
Confidence 664
No 117
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=99.22 E-value=1.3e-10 Score=128.89 Aligned_cols=188 Identities=23% Similarity=0.341 Sum_probs=135.9
Q ss_pred CEEEEEECCCCcEEEEEe-CCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLK-FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++|+||||+..-+.+.|+ +.+.|..|.+| ..+||. ...+.|.|-.+.|...-.++. +++ +-
T Consensus 101 ~~Vkiwdl~~kl~hr~lkdh~stvt~v~YN~~DeyiAsvs~gGdiiih~~~t~~~tt~f~-~~s--------------gq 165 (673)
T KOG4378|consen 101 GCVKIWDLRAKLIHRFLKDHQSTVTYVDYNNTDEYIASVSDGGDIIIHGTKTKQKTTTFT-IDS--------------GQ 165 (673)
T ss_pred ceeeehhhHHHHHhhhccCCcceeEEEEecCCcceeEEeccCCcEEEEecccCcccccee-cCC--------------CC
Confidence 799999999766666666 46899999997 456665 677889998888876433332 110 00
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
. -|.|-|+..+ |.
T Consensus 166 s---vRll~ys~sk--------------------------------------r~-------------------------- 178 (673)
T KOG4378|consen 166 S---VRLLRYSPSK--------------------------------------RF-------------------------- 178 (673)
T ss_pred e---EEEeeccccc--------------------------------------ce--------------------------
Confidence 0 1566665210 00
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEe
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGI-LLVTASVQGHNINIFK 350 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwd 350 (832)
++..+.++|.|.+||+....++..+. +|..|...|||+|... +||+.+.|.+ |.+||
T Consensus 179 --------------------lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkk-i~~yD 237 (673)
T KOG4378|consen 179 --------------------LLSIASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKK-INIYD 237 (673)
T ss_pred --------------------eeEeeccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEecccce-EEEee
Confidence 12235678999999999988887764 9999999999999755 7789999976 89999
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee-ccCCCC
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF-QPTDAN 424 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~-~~H~~~ 424 (832)
+... .....|- ..++...++|++||.+|++|++.|.|..||+...+.++.. ..|...
T Consensus 238 ~~s~--------------~s~~~l~---y~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R~~k~Pv~v~sah~~s 295 (673)
T KOG4378|consen 238 IRSQ--------------ASTDRLT---YSHPLSTVAFSECGTYLCAGNSKGELIAYDMRSTKAPVAVRSAHDAS 295 (673)
T ss_pred cccc--------------cccceee---ecCCcceeeecCCceEEEeecCCceEEEEecccCCCCceEeeecccc
Confidence 9753 1111221 1245889999999999999999999999999988888755 566544
No 118
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.22 E-value=2.8e-10 Score=126.47 Aligned_cols=261 Identities=16% Similarity=0.246 Sum_probs=160.5
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEee----ecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCcccc
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVS----RYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQ 92 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS----~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~ 92 (832)
.++|+..| .++++|||+...++-..+-. -+|.-+|.++++|+.. -|| +++.
T Consensus 431 trhVyTgG-kgcVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdgr-------------tLi--vGGe--------- 485 (705)
T KOG0639|consen 431 TRHVYTGG-KGCVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDGR-------------TLI--VGGE--------- 485 (705)
T ss_pred cceeEecC-CCeEEEeeccCCCCCCccccccccCcccceeeeEecCCCc-------------eEE--eccc---------
Confidence 37777665 68899999987543221111 2466677777777542 232 3321
Q ss_pred CCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCC---CEEEEEEcC--CE-EEEEeCCEEEEEECCCCc
Q 003310 93 DGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRS---PIYSVRCSS--RV-VAICQAAQVHCFDAATLE 166 (832)
Q Consensus 93 Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s---~V~sV~~S~--r~-LAVa~~~~I~vwDl~t~~ 166 (832)
-.+|.||||.+-...-..+.++ ..|++++|+ ++ ++.+.++.|.|||+.+-.
T Consensus 486 -----------------------astlsiWDLAapTprikaeltssapaCyALa~spDakvcFsccsdGnI~vwDLhnq~ 542 (705)
T KOG0639|consen 486 -----------------------ASTLSIWDLAAPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQT 542 (705)
T ss_pred -----------------------cceeeeeeccCCCcchhhhcCCcchhhhhhhcCCccceeeeeccCCcEEEEEcccce
Confidence 1579999998776544445543 568888885 44 344789999999999999
Q ss_pred eEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccc
Q 003310 167 IEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKH 246 (832)
Q Consensus 167 ~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~ 246 (832)
..+++.+|+. +...|.++. +....|..|. +.+|..|-....++
T Consensus 543 ~VrqfqGhtD------------GascIdis~-------dGtklWTGGl------------------DntvRcWDlregrq 585 (705)
T KOG0639|consen 543 LVRQFQGHTD------------GASCIDISK-------DGTKLWTGGL------------------DNTVRCWDLREGRQ 585 (705)
T ss_pred eeecccCCCC------------CceeEEecC-------CCceeecCCC------------------ccceeehhhhhhhh
Confidence 9999988864 122333321 1112232221 12343333333333
Q ss_pred eeceeeeccCccccccccccccc--cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCe
Q 003310 247 LAAGIVNLGDLGYKKLSQYCSEF--LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPI 324 (832)
Q Consensus 247 lasGi~~lGd~g~~~ls~y~~~~--~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pI 324 (832)
+.. . .++....++ .|+ .+| ++-+-..+.|-|.... +....++.-|.+-|
T Consensus 586 lqq-------h---dF~SQIfSLg~cP~---------~dW---------lavGMens~vevlh~s-kp~kyqlhlheScV 636 (705)
T KOG0639|consen 586 LQQ-------H---DFSSQIFSLGYCPT---------GDW---------LAVGMENSNVEVLHTS-KPEKYQLHLHESCV 636 (705)
T ss_pred hhh-------h---hhhhhheecccCCC---------ccc---------eeeecccCcEEEEecC-CccceeecccccEE
Confidence 321 0 011111000 111 111 1223344556665543 33446677888999
Q ss_pred EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcE
Q 003310 325 SALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTS 404 (832)
Q Consensus 325 s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTV 404 (832)
.+|+|++-|+++++.+.|. .++.|+..-| ..++.... ...|.++.+|-|.++|++||.|...
T Consensus 637 LSlKFa~cGkwfvStGkDn-lLnawrtPyG--------------asiFqskE---~SsVlsCDIS~ddkyIVTGSGdkkA 698 (705)
T KOG0639|consen 637 LSLKFAYCGKWFVSTGKDN-LLNAWRTPYG--------------ASIFQSKE---SSSVLSCDISFDDKYIVTGSGDKKA 698 (705)
T ss_pred EEEEecccCceeeecCchh-hhhhccCccc--------------cceeeccc---cCcceeeeeccCceEEEecCCCcce
Confidence 9999999999999999995 5899988655 34444432 2359999999999999999999998
Q ss_pred EEEec
Q 003310 405 HLFAI 409 (832)
Q Consensus 405 hIwdl 409 (832)
.||.+
T Consensus 699 TVYeV 703 (705)
T KOG0639|consen 699 TVYEV 703 (705)
T ss_pred EEEEE
Confidence 88876
No 119
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.22 E-value=2e-09 Score=114.12 Aligned_cols=187 Identities=11% Similarity=0.122 Sum_probs=125.9
Q ss_pred CCEEEEEECCCCcEEEEEeCCCCEEEEEEc----CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccc
Q 003310 116 PTVVHFYSLRSQSYVHMLKFRSPIYSVRCS----SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGY 190 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S----~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~ 190 (832)
++.+++|||.+++....--+..+|..++|= -..|+. +-|++|+.||.+.-..+.++.- |+.
T Consensus 93 Dk~~k~wDL~S~Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~pv~t~~L-------PeR------- 158 (347)
T KOG0647|consen 93 DKQAKLWDLASGQVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSNPVATLQL-------PER------- 158 (347)
T ss_pred CCceEEEEccCCCeeeeeecccceeEEEEecCCCcceeEecccccceeecccCCCCeeeeeec-------cce-------
Confidence 478999999999866666678899999993 245666 6799999999998776666632 110
Q ss_pred ceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccccc
Q 003310 191 GPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFL 270 (832)
Q Consensus 191 ~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~ 270 (832)
.|+ ++.-+.++
T Consensus 159 ----------vYa-------------------------------------~Dv~~pm~---------------------- 169 (347)
T KOG0647|consen 159 ----------VYA-------------------------------------ADVLYPMA---------------------- 169 (347)
T ss_pred ----------eee-------------------------------------hhccCcee----------------------
Confidence 122 11100000
Q ss_pred CCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCC----CeEEEEEcCCCCEEEEEEcCCCEE
Q 003310 271 PDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKS----PISALCFDPSGILLVTASVQGHNI 346 (832)
Q Consensus 271 p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~----pIs~LaFSPdG~lLATaS~dGt~I 346 (832)
+-+..+..|.+|.++++. .+|+.+.+ .+.||+.-+|....|-||..|+ +
T Consensus 170 ------------------------vVata~r~i~vynL~n~~--te~k~~~SpLk~Q~R~va~f~d~~~~alGsiEGr-v 222 (347)
T KOG0647|consen 170 ------------------------VVATAERHIAVYNLENPP--TEFKRIESPLKWQTRCVACFQDKDGFALGSIEGR-V 222 (347)
T ss_pred ------------------------EEEecCCcEEEEEcCCCc--chhhhhcCcccceeeEEEEEecCCceEeeeecce-E
Confidence 011223457888887643 34555554 4688988888888899999998 7
Q ss_pred EEEeCCCCCCCCCCccCCCCceeEEEEEeccCc-----cccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccC
Q 003310 347 NIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT-----NAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPT 421 (832)
Q Consensus 347 ~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t-----~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H 421 (832)
-|..+..+.+ .....++.+|... -..|++|+|.|.-..|+++++|||.-.||-......-+...|
T Consensus 223 ~iq~id~~~~----------~~nFtFkCHR~~~~~~~~VYaVNsi~FhP~hgtlvTaGsDGtf~FWDkdar~kLk~s~~~ 292 (347)
T KOG0647|consen 223 AIQYIDDPNP----------KDNFTFKCHRSTNSVNDDVYAVNSIAFHPVHGTLVTAGSDGTFSFWDKDARTKLKTSETH 292 (347)
T ss_pred EEEecCCCCc----------cCceeEEEeccCCCCCCceEEecceEeecccceEEEecCCceEEEecchhhhhhhccCcC
Confidence 8888876411 1234456666311 123789999999999999999999999998665444444555
Q ss_pred C
Q 003310 422 D 422 (832)
Q Consensus 422 ~ 422 (832)
.
T Consensus 293 ~ 293 (347)
T KOG0647|consen 293 P 293 (347)
T ss_pred C
Confidence 3
No 120
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.21 E-value=2.9e-10 Score=124.86 Aligned_cols=108 Identities=20% Similarity=0.408 Sum_probs=87.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCC--CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHK--SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~--~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
.+++++|+.-..|+.++..+......+ -.+++++|.|||.+|+|+..||- ++|||+... ..+.
T Consensus 319 lsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpDgLifgtgt~d~~-vkiwdlks~--------------~~~a 383 (506)
T KOG0289|consen 319 LSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPDGLIFGTGTPDGV-VKIWDLKSQ--------------TNVA 383 (506)
T ss_pred EEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCCceEEeccCCCce-EEEEEcCCc--------------cccc
Confidence 446778899999999998877665432 25899999999999999999986 999999875 1233
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
+| -|+ .+.|..|+||.+|-|||++++|+.|++||+.+.....+|.
T Consensus 384 ~F-pgh-t~~vk~i~FsENGY~Lat~add~~V~lwDLRKl~n~kt~~ 428 (506)
T KOG0289|consen 384 KF-PGH-TGPVKAISFSENGYWLATAADDGSVKLWDLRKLKNFKTIQ 428 (506)
T ss_pred cC-CCC-CCceeEEEeccCceEEEEEecCCeEEEEEehhhcccceee
Confidence 45 354 4679999999999999999999999999999876554543
No 121
>KOG4328 consensus WD40 protein [Function unknown]
Probab=99.20 E-value=9.1e-10 Score=121.74 Aligned_cols=99 Identities=22% Similarity=0.319 Sum_probs=76.6
Q ss_pred CCCeEEEEECCCCc-EEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 299 NVGMVIVRDIVSKN-VIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 299 ~~G~V~IwDl~s~~-~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
.-|...+||+++++ ....+.-|...|..|+|+|- -.+|||||.|++ .+|||++.-. +.. + -.|+.+.
T Consensus 299 ~~G~f~~iD~R~~~s~~~~~~lh~kKI~sv~~NP~~p~~laT~s~D~T-~kIWD~R~l~-~K~-------s-p~lst~~- 367 (498)
T KOG4328|consen 299 NVGNFNVIDLRTDGSEYENLRLHKKKITSVALNPVCPWFLATASLDQT-AKIWDLRQLR-GKA-------S-PFLSTLP- 367 (498)
T ss_pred cccceEEEEeecCCccchhhhhhhcccceeecCCCCchheeecccCcc-eeeeehhhhc-CCC-------C-cceeccc-
Confidence 34578899998765 47778889999999999995 568999999998 8999998641 110 0 1244442
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
| ...|.+..|||++-.|++.+.|.+|+|||..
T Consensus 368 -H-rrsV~sAyFSPs~gtl~TT~~D~~IRv~dss 399 (498)
T KOG4328|consen 368 -H-RRSVNSAYFSPSGGTLLTTCQDNEIRVFDSS 399 (498)
T ss_pred -c-cceeeeeEEcCCCCceEeeccCCceEEeecc
Confidence 1 2349999999988889999999999999995
No 122
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.20 E-value=2.2e-08 Score=111.90 Aligned_cols=104 Identities=13% Similarity=0.225 Sum_probs=83.0
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
.+++.+.|+.|+||+ ..+++.+.. -..|+.|+.|.|.| .||.|...|+ --|.|+.+. .+.
T Consensus 382 q~~T~gqdk~v~lW~--~~k~~wt~~-~~d~~~~~~fhpsg-~va~Gt~~G~-w~V~d~e~~---------------~lv 441 (626)
T KOG2106|consen 382 QLLTCGQDKHVRLWN--DHKLEWTKI-IEDPAECADFHPSG-VVAVGTATGR-WFVLDTETQ---------------DLV 441 (626)
T ss_pred heeeccCcceEEEcc--CCceeEEEE-ecCceeEeeccCcc-eEEEeeccce-EEEEecccc---------------eeE
Confidence 456788999999999 445544433 24689999999999 8999999998 458888763 344
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
+++.- ++.|..++|||||.+||+||.|+.|.||.++..|..+..
T Consensus 442 ~~~~d--~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~~g~~y~r 485 (626)
T KOG2106|consen 442 TIHTD--NEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSANGRKYSR 485 (626)
T ss_pred EEEec--CCceEEEEEcCCCCEEEEecCCCeEEEEEECCCCcEEEE
Confidence 55433 567999999999999999999999999999988777654
No 123
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.20 E-value=6.6e-11 Score=142.05 Aligned_cols=234 Identities=17% Similarity=0.224 Sum_probs=163.3
Q ss_pred cEEEEEecCC-eEEEEecc--CCCeeEEee---ecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCcccc
Q 003310 19 RVLLLGYRSG-FQVWDVEE--ADNVHDLVS---RYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQ 92 (832)
Q Consensus 19 ~vLl~Gy~~G-~qVWdv~~--~~~~~ellS---~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~ 92 (832)
-|++.|.++| +-+||.+. .++..+++. .|.|+|+.+.|-+. ++++||-+.+
T Consensus 81 GlIaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~~-------------q~nlLASGa~---------- 137 (1049)
T KOG0307|consen 81 GLIAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNPF-------------QGNLLASGAD---------- 137 (1049)
T ss_pred ceeeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeecccc-------------CCceeeccCC----------
Confidence 4788888887 99999987 355556664 68999999988642 2346652221
Q ss_pred CCcccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEE---eCCCCEEEEEEcC---CEEEEE-eCCEEEEEECCCC
Q 003310 93 DGLATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHML---KFRSPIYSVRCSS---RVVAIC-QAAQVHCFDAATL 165 (832)
Q Consensus 93 Dg~~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL---~f~s~V~sV~~S~---r~LAVa-~~~~I~vwDl~t~ 165 (832)
++.|.||||..-+.-.++ .+.+.|..|++|+ ++||.+ ..+++.|||++.-
T Consensus 138 -----------------------~geI~iWDlnn~~tP~~~~~~~~~~eI~~lsWNrkvqhILAS~s~sg~~~iWDlr~~ 194 (1049)
T KOG0307|consen 138 -----------------------DGEILIWDLNKPETPFTPGSQAPPSEIKCLSWNRKVSHILASGSPSGRAVIWDLRKK 194 (1049)
T ss_pred -----------------------CCcEEEeccCCcCCCCCCCCCCCcccceEeccchhhhHHhhccCCCCCceeccccCC
Confidence 267999999875544433 3578899999985 678875 4569999999987
Q ss_pred ceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeeccccc
Q 003310 166 EIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSK 245 (832)
Q Consensus 166 ~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk 245 (832)
+.+..+..++.-. ..+. |++. |.+.
T Consensus 195 ~pii~ls~~~~~~----------~~S~-------l~Wh-------------P~~a------------------------- 219 (1049)
T KOG0307|consen 195 KPIIKLSDTPGRM----------HCSV-------LAWH-------------PDHA------------------------- 219 (1049)
T ss_pred CcccccccCCCcc----------ceee-------eeeC-------------CCCc-------------------------
Confidence 6555554432100 0001 1221 1110
Q ss_pred ceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC-CcEEEEeccCCCCe
Q 003310 246 HLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS-KNVIAQFRAHKSPI 324 (832)
Q Consensus 246 ~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s-~~~l~~~~aH~~pI 324 (832)
+.+..++ +.+..-.|.+||++. -.++.+++.|...|
T Consensus 220 -----------------------------Tql~~As--------------~dd~~PviqlWDlR~assP~k~~~~H~~Gi 256 (1049)
T KOG0307|consen 220 -----------------------------TQLLVAS--------------GDDSAPVIQLWDLRFASSPLKILEGHQRGI 256 (1049)
T ss_pred -----------------------------eeeeeec--------------CCCCCceeEeecccccCCchhhhcccccce
Confidence 0010111 012234799999875 46888899999999
Q ss_pred EEEEEcCCC-CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCC-EEEEEeCCC
Q 003310 325 SALCFDPSG-ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSN-WIMISSSRG 402 (832)
Q Consensus 325 s~LaFSPdG-~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~-~LAsgS~Dg 402 (832)
.+|.|.+.+ ++|+|++.|++ |.+|+..++ +.++.+-++ ...+.++.|+|-.- .+|++|-||
T Consensus 257 lslsWc~~D~~lllSsgkD~~-ii~wN~~tg--------------Evl~~~p~~--~nW~fdv~w~pr~P~~~A~asfdg 319 (1049)
T KOG0307|consen 257 LSLSWCPQDPRLLLSSGKDNR-IICWNPNTG--------------EVLGELPAQ--GNWCFDVQWCPRNPSVMAAASFDG 319 (1049)
T ss_pred eeeccCCCCchhhhcccCCCC-eeEecCCCc--------------eEeeecCCC--CcceeeeeecCCCcchhhhheecc
Confidence 999999976 89999999998 789999887 677887543 33599999999554 899999999
Q ss_pred cEEEEecCCCC
Q 003310 403 TSHLFAINPLG 413 (832)
Q Consensus 403 TVhIwdl~~~g 413 (832)
+|-||.+....
T Consensus 320 kI~I~sl~~~~ 330 (1049)
T KOG0307|consen 320 KISIYSLQGTD 330 (1049)
T ss_pred ceeeeeeecCC
Confidence 99999997643
No 124
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.20 E-value=3.3e-10 Score=132.70 Aligned_cols=180 Identities=17% Similarity=0.222 Sum_probs=121.2
Q ss_pred CCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEcC---CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCC
Q 003310 112 GSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSS---RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIG 187 (832)
Q Consensus 112 ~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~---r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~ 187 (832)
.++.++|||||++...+|+++|.+++-|.+|+|++ ++++. ++|++|+||++..-+....-...-.
T Consensus 385 SSSMDKTVRLWh~~~~~CL~~F~HndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~~l----------- 453 (712)
T KOG0283|consen 385 SSSMDKTVRLWHPGRKECLKVFSHNDFVTCVAFNPVDDRYFISGSLDGKVRLWSISDKKVVDWNDLRDL----------- 453 (712)
T ss_pred eccccccEEeecCCCcceeeEEecCCeeEEEEecccCCCcEeecccccceEEeecCcCeeEeehhhhhh-----------
Confidence 45688999999999999999999999999999985 55555 8999999999987653322211100
Q ss_pred cccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccc
Q 003310 188 IGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCS 267 (832)
Q Consensus 188 ~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~ 267 (832)
+. | +.|.
T Consensus 454 --IT--A-----vcy~---------------------------------------------------------------- 460 (712)
T KOG0283|consen 454 --IT--A-----VCYS---------------------------------------------------------------- 460 (712)
T ss_pred --he--e-----EEec----------------------------------------------------------------
Confidence 00 0 1122
Q ss_pred cccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec---------cCCCCeEEEEEcCCCC-EEE
Q 003310 268 EFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR---------AHKSPISALCFDPSGI-LLV 337 (832)
Q Consensus 268 ~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~---------aH~~pIs~LaFSPdG~-lLA 337 (832)
|+|.+ .+ -+...|.+++|+....+....+. .|. .|+.+.|.|.-. .++
T Consensus 461 ---PdGk~-------------av-----IGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~-rITG~Q~~p~~~~~vL 518 (712)
T KOG0283|consen 461 ---PDGKG-------------AV-----IGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGK-RITGLQFFPGDPDEVL 518 (712)
T ss_pred ---cCCce-------------EE-----EEEeccEEEEEEccCCeEEEeeeEeeccCccccCc-eeeeeEecCCCCCeEE
Confidence 11111 11 12345778888887776655543 233 799999998433 344
Q ss_pred EEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcccc-EEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 338 TASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAV-IQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 338 TaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
..|.|.+ |||||.++. ..+.+| +|+.+.. =....|+.||++|+++|.|.-|+||++....
T Consensus 519 VTSnDSr-IRI~d~~~~--------------~lv~Kf-KG~~n~~SQ~~Asfs~Dgk~IVs~seDs~VYiW~~~~~~ 579 (712)
T KOG0283|consen 519 VTSNDSR-IRIYDGRDK--------------DLVHKF-KGFRNTSSQISASFSSDGKHIVSASEDSWVYIWKNDSFN 579 (712)
T ss_pred EecCCCc-eEEEeccch--------------hhhhhh-cccccCCcceeeeEccCCCEEEEeecCceEEEEeCCCCc
Confidence 4567766 999999654 122233 2333222 3457899999999999999999999996543
No 125
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.19 E-value=7e-10 Score=127.45 Aligned_cols=206 Identities=17% Similarity=0.262 Sum_probs=135.9
Q ss_pred CEEEEEECCCCcEEEEEeCCCC---EEE-EEE--c-C-CEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCc
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSP---IYS-VRC--S-S-RVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~---V~s-V~~--S-~-r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
++++||+-+.++++.+..|..+ |-. +++ + + ++++.+.|..|.+|...+.+.+++|.+|..-.++-. +
T Consensus 35 ~t~~vw~~~~~~~l~~~~~~~~~g~i~~~i~y~e~~~~~l~~g~~D~~i~v~~~~~~~P~~~LkgH~snVC~ls-----~ 109 (745)
T KOG0301|consen 35 GTVKVWAKKGKQYLETHAFEGPKGFIANSICYAESDKGRLVVGGMDTTIIVFKLSQAEPLYTLKGHKSNVCSLS-----I 109 (745)
T ss_pred CceeeeeccCcccccceecccCcceeeccceeccccCcceEeecccceEEEEecCCCCchhhhhccccceeeee-----c
Confidence 7899999999998886665332 222 333 2 2 455557899999999999999999999854211100 0
Q ss_pred ccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccc
Q 003310 189 GYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSE 268 (832)
Q Consensus 189 ~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~ 268 (832)
+-.+.-++ +|=+.++..|. .|+..+.
T Consensus 110 ~~~~~~iS---------------------------------gSWD~TakvW~-------------~~~l~~~-------- 135 (745)
T KOG0301|consen 110 GEDGTLIS---------------------------------GSWDSTAKVWR-------------IGELVYS-------- 135 (745)
T ss_pred CCcCceEe---------------------------------cccccceEEec-------------chhhhcc--------
Confidence 00000011 01112221111 1222211
Q ss_pred ccCCCc-CccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEE
Q 003310 269 FLPDSQ-NSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNIN 347 (832)
Q Consensus 269 ~~p~~~-~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~ 347 (832)
++ ++ .+++.+-.. .-.++++++.|.+|++|.- ++.+.+|.+|+.-|..|++=+++. +++|+.||. |+
T Consensus 136 -l~-gH~asVWAv~~l------~e~~~vTgsaDKtIklWk~--~~~l~tf~gHtD~VRgL~vl~~~~-flScsNDg~-Ir 203 (745)
T KOG0301|consen 136 -LQ-GHTASVWAVASL------PENTYVTGSADKTIKLWKG--GTLLKTFSGHTDCVRGLAVLDDSH-FLSCSNDGS-IR 203 (745)
T ss_pred -cC-Ccchheeeeeec------CCCcEEeccCcceeeeccC--CchhhhhccchhheeeeEEecCCC-eEeecCCce-EE
Confidence 01 11 112211110 0125778899999999975 788999999999999999988876 679999997 99
Q ss_pred EEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 348 IFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 348 Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
.|++. + ..+++++ ||++ -|++|+..+++..+++++.|+|++||+..
T Consensus 204 ~w~~~-g--------------e~l~~~~-ghtn-~vYsis~~~~~~~Ivs~gEDrtlriW~~~ 249 (745)
T KOG0301|consen 204 LWDLD-G--------------EVLLEMH-GHTN-FVYSISMALSDGLIVSTGEDRTLRIWKKD 249 (745)
T ss_pred EEecc-C--------------ceeeeee-ccce-EEEEEEecCCCCeEEEecCCceEEEeecC
Confidence 99993 3 4666664 6654 49999988899999999999999999987
No 126
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.18 E-value=3.3e-10 Score=120.14 Aligned_cols=110 Identities=15% Similarity=0.258 Sum_probs=90.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
.+++.-|.+..+||++++.++..|.+|.+..+.++-.|.-++++|+|.|-+ +++||+++. ...+..
T Consensus 287 ~vTaSWDRTAnlwDVEtge~v~~LtGHd~ELtHcstHptQrLVvTsSrDtT-FRLWDFRea-------------I~sV~V 352 (481)
T KOG0300|consen 287 MVTASWDRTANLWDVETGEVVNILTGHDSELTHCSTHPTQRLVVTSSRDTT-FRLWDFREA-------------IQSVAV 352 (481)
T ss_pred eeeeeccccceeeeeccCceeccccCcchhccccccCCcceEEEEeccCce-eEeccchhh-------------cceeee
Confidence 456677889999999999999999999999999999999999999999965 999999875 122333
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc-eeecc
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS-VNFQP 420 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~-~~~~~ 420 (832)
| .|++. .|.++.|..|. .+++||+|.||+|||+..-..+ .+++.
T Consensus 353 F-QGHtd-tVTS~vF~~dd-~vVSgSDDrTvKvWdLrNMRsplATIRt 397 (481)
T KOG0300|consen 353 F-QGHTD-TVTSVVFNTDD-RVVSGSDDRTVKVWDLRNMRSPLATIRT 397 (481)
T ss_pred e-ccccc-ceeEEEEecCC-ceeecCCCceEEEeeeccccCcceeeec
Confidence 4 57764 49999999886 4679999999999999765444 35543
No 127
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.18 E-value=5.6e-10 Score=119.03 Aligned_cols=108 Identities=16% Similarity=0.223 Sum_probs=84.5
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC--c
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL--T 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~--t 379 (832)
.+-||.-....++..+-+|.+.|+.|+|-+||..|.+|+...-.|..||++.. ...+|.|.|.. |
T Consensus 231 ~~giy~~~~~~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~-------------~~pv~~L~rhv~~T 297 (406)
T KOG2919|consen 231 RVGIYNDDGRRPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYS-------------RDPVYALERHVGDT 297 (406)
T ss_pred eeeeEecCCCCceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhc-------------cchhhhhhhhccCc
Confidence 45666666788999999999999999999999999999887667999999874 34567776543 3
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCcee-eccCCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVN-FQPTDA 423 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~-~~~H~~ 423 (832)
+-+|+ ....|+|+|||+|+.||.|++||+..++..+. +..|.+
T Consensus 298 NQRI~-FDld~~~~~LasG~tdG~V~vwdlk~~gn~~sv~~~~sd 341 (406)
T KOG2919|consen 298 NQRIL-FDLDPKGEILASGDTDGSVRVWDLKDLGNEVSVTGNYSD 341 (406)
T ss_pred cceEE-EecCCCCceeeccCCCccEEEEecCCCCCcccccccccc
Confidence 33344 34478999999999999999999999887543 344433
No 128
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.18 E-value=1.2e-09 Score=120.75 Aligned_cols=177 Identities=17% Similarity=0.190 Sum_probs=129.6
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc---CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS---SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S---~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
+.|.|||.+|.+.++.++. +..|.+++|- .++++.+.|..|++|++..+..+-++-+|+.. .
T Consensus 224 ~~v~Iw~~~t~ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~~~s~vetlyGHqd~--------------v 289 (479)
T KOG0299|consen 224 RHVQIWDCDTLEHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSIDQLSYVETLYGHQDG--------------V 289 (479)
T ss_pred ceEEEecCcccchhhcccccccceeeeeeecCccceeeeecCCceEEEehhHhHHHHHHhCCccc--------------e
Confidence 7899999999999999875 7899999994 57888899999999999988877777666431 1
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
++++. .+ -+..
T Consensus 290 ~~Ida----L~----------------------------reR~------------------------------------- 300 (479)
T KOG0299|consen 290 LGIDA----LS----------------------------RERC------------------------------------- 300 (479)
T ss_pred eeech----hc----------------------------ccce-------------------------------------
Confidence 11100 00 0000
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
+..++.|.+++||++... ..-.|.+|.+.|-|++|=.+ ..++|||++|. |-+|++.
T Consensus 301 ---------------------vtVGgrDrT~rlwKi~ee-sqlifrg~~~sidcv~~In~-~HfvsGSdnG~-IaLWs~~ 356 (479)
T KOG0299|consen 301 ---------------------VTVGGRDRTVRLWKIPEE-SQLIFRGGEGSIDCVAFIND-EHFVSGSDNGS-IALWSLL 356 (479)
T ss_pred ---------------------EEeccccceeEEEecccc-ceeeeeCCCCCeeeEEEecc-cceeeccCCce-EEEeeec
Confidence 001346889999999544 33578899999999999554 56899999998 8999987
Q ss_pred CCCCCCCCccCCCCceeEEEEEeccCc--------cccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 353 PGILGTSSACDAGTSYVHLYRLQRGLT--------NAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 353 t~~~~~~s~~~~~~~~~~l~~l~rG~t--------~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.-.+ .+++.+-.|.. +..|.+++-.|.+.++|+||-+|.|++|.++++
T Consensus 357 KKkp------------lf~~~~AHgv~~~~~~~~~~~Witsla~i~~sdL~asGS~~G~vrLW~i~~g 412 (479)
T KOG0299|consen 357 KKKP------------LFTSRLAHGVIPELDPVNGNFWITSLAVIPGSDLLASGSWSGCVRLWKIEDG 412 (479)
T ss_pred ccCc------------eeEeeccccccCCccccccccceeeeEecccCceEEecCCCCceEEEEecCC
Confidence 6411 11111111111 126999999999999999999999999999875
No 129
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.17 E-value=1.6e-10 Score=126.17 Aligned_cols=185 Identities=16% Similarity=0.241 Sum_probs=126.8
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
.++++.+.+.++..||++.. .....-.-++-.|.+++|.+++. -+++++.+
T Consensus 325 ~~~V~Gs~dr~i~~wdlDgn-~~~~W~gvr~~~v~dlait~Dgk-------------~vl~v~~d--------------- 375 (519)
T KOG0293|consen 325 FRFVTGSPDRTIIMWDLDGN-ILGNWEGVRDPKVHDLAITYDGK-------------YVLLVTVD--------------- 375 (519)
T ss_pred ceeEecCCCCcEEEecCCcc-hhhcccccccceeEEEEEcCCCc-------------EEEEEecc---------------
Confidence 45666666777899998752 11112223445577777776431 13444332
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecC
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTN 174 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~ 174 (832)
..+++|+..+..++..+.-..+|.++.+| ++++.| ..+..|++||+..-..++...+|
T Consensus 376 -------------------~~i~l~~~e~~~dr~lise~~~its~~iS~d~k~~LvnL~~qei~LWDl~e~~lv~kY~Gh 436 (519)
T KOG0293|consen 376 -------------------KKIRLYNREARVDRGLISEEQPITSFSISKDGKLALVNLQDQEIHLWDLEENKLVRKYFGH 436 (519)
T ss_pred -------------------cceeeechhhhhhhccccccCceeEEEEcCCCcEEEEEcccCeeEEeecchhhHHHHhhcc
Confidence 56999999998888877888899999997 466666 56789999999865544444443
Q ss_pred CCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeec
Q 003310 175 PIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNL 254 (832)
Q Consensus 175 ~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~l 254 (832)
.. + -+.+ .+..| |
T Consensus 437 kq--~------------~fiI-------------rSCFg-------------------g--------------------- 449 (519)
T KOG0293|consen 437 KQ--G------------HFII-------------RSCFG-------------------G--------------------- 449 (519)
T ss_pred cc--c------------ceEE-------------EeccC-------------------C---------------------
Confidence 21 0 0100 00000 0
Q ss_pred cCccccccccccccccCCCcCccccccCCCCCCCccc-ccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC-C
Q 003310 255 GDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVN-GHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP-S 332 (832)
Q Consensus 255 Gd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~-g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP-d 332 (832)
.+ .-++++..|+.|+||+..++++++.+.+|...|+|++++| |
T Consensus 450 -----------------------------------~~~~fiaSGSED~kvyIWhr~sgkll~~LsGHs~~vNcVswNP~~ 494 (519)
T KOG0293|consen 450 -----------------------------------GNDKFIASGSEDSKVYIWHRISGKLLAVLSGHSKTVNCVSWNPAD 494 (519)
T ss_pred -----------------------------------CCcceEEecCCCceEEEEEccCCceeEeecCCcceeeEEecCCCC
Confidence 00 0124567899999999999999999999999999999999 5
Q ss_pred CCEEEEEEcCCCEEEEEeCCC
Q 003310 333 GILLVTASVQGHNINIFKIIP 353 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t 353 (832)
-.+||+||+||+ ||||-..+
T Consensus 495 p~m~ASasDDgt-IRIWg~~~ 514 (519)
T KOG0293|consen 495 PEMFASASDDGT-IRIWGPSD 514 (519)
T ss_pred HHHhhccCCCCe-EEEecCCc
Confidence 568999999998 99997654
No 130
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.17 E-value=8.4e-09 Score=118.58 Aligned_cols=101 Identities=16% Similarity=0.217 Sum_probs=85.3
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE--EEe
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY--RLQ 375 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~--~l~ 375 (832)
+-+|.|.-||+.+++++..+....++|.+++.+|.++.|+.+++|| .+..+++.++ ...| .|.
T Consensus 87 g~sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~p~~~~l~IgcddG-vl~~~s~~p~--------------~I~~~r~l~ 151 (691)
T KOG2048|consen 87 GLSGSITEWDLHTLKQKYNIDSNGGAIWSIAINPENTILAIGCDDG-VLYDFSIGPD--------------KITYKRSLM 151 (691)
T ss_pred cCCceEEEEecccCceeEEecCCCcceeEEEeCCccceEEeecCCc-eEEEEecCCc--------------eEEEEeecc
Confidence 4568899999999999999999999999999999999999999999 4788888776 2222 233
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
| ..++|.+++|+|++..||+||.||.|++||+......
T Consensus 152 r--q~sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~~~t~ 189 (691)
T KOG2048|consen 152 R--QKSRVLSLSWNPTGTKIAGGSIDGVIRIWDVKSGQTL 189 (691)
T ss_pred c--ccceEEEEEecCCccEEEecccCceEEEEEcCCCceE
Confidence 3 2467999999999999999999999999999875433
No 131
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.17 E-value=2.5e-10 Score=123.69 Aligned_cols=115 Identities=17% Similarity=0.279 Sum_probs=87.5
Q ss_pred ccccCCCCeEEEEECCCCcE---EEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCcee
Q 003310 294 FPDADNVGMVIVRDIVSKNV---IAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYV 369 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~---l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~ 369 (832)
+.+++..+.|++|...++.- ...|.+|+..|-.|+|||. ...|||||.||+ |+|||++.++.. .
T Consensus 227 LlsGDc~~~I~lw~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~Dgs-IrIWDiRs~~~~-----------~ 294 (440)
T KOG0302|consen 227 LLSGDCVKGIHLWEPSTGSWKVDQRPFTGHTKSVEDLQWSPTEDGVFASCSCDGS-IRIWDIRSGPKK-----------A 294 (440)
T ss_pred cccCccccceEeeeeccCceeecCccccccccchhhhccCCccCceEEeeecCce-EEEEEecCCCcc-----------c
Confidence 45567778899999887642 2346689999999999996 558999999998 999999986211 1
Q ss_pred EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc---eeeccCCC
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS---VNFQPTDA 423 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~---~~~~~H~~ 423 (832)
.+.+ . -+.+.|+-|+|+.+-.+||+|++|||++||||..+... ..|+.|..
T Consensus 295 ~~~~--k-Ah~sDVNVISWnr~~~lLasG~DdGt~~iwDLR~~~~~~pVA~fk~Hk~ 348 (440)
T KOG0302|consen 295 AVST--K-AHNSDVNVISWNRREPLLASGGDDGTLSIWDLRQFKSGQPVATFKYHKA 348 (440)
T ss_pred eeEe--e-ccCCceeeEEccCCcceeeecCCCceEEEEEhhhccCCCcceeEEeccC
Confidence 2222 1 23457999999999889999999999999999875443 35666643
No 132
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.16 E-value=2.6e-09 Score=110.47 Aligned_cols=248 Identities=17% Similarity=0.234 Sum_probs=160.3
Q ss_pred CcEEEEEecCCeEEEEeccCC--CeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCc
Q 003310 18 RRVLLLGYRSGFQVWDVEEAD--NVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGL 95 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~--~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~ 95 (832)
+++..++.+..++|+.+...+ .....|..|.|||--+.+.. | .| -.+||-|+
T Consensus 24 krlATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wah-P--------k~---G~iLAScs-------------- 77 (299)
T KOG1332|consen 24 KRLATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAH-P--------KF---GTILASCS-------------- 77 (299)
T ss_pred ceeeeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecc-c--------cc---CcEeeEee--------------
Confidence 566666777779999999866 34444567999999888862 2 11 12566554
Q ss_pred ccccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC---CCCEEEEEEcC----CEEEE-EeCCEEEEEECCCC-c
Q 003310 96 ATACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF---RSPIYSVRCSS----RVVAI-CQAAQVHCFDAATL-E 166 (832)
Q Consensus 96 ~~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f---~s~V~sV~~S~----r~LAV-a~~~~I~vwDl~t~-~ 166 (832)
+++.|.||.-..|+.-+...+ .+.|.+|++-+ -+||. +.|+.|.|++..+- .
T Consensus 78 -------------------YDgkVIiWke~~g~w~k~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g~ 138 (299)
T KOG1332|consen 78 -------------------YDGKVIIWKEENGRWTKAYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGG 138 (299)
T ss_pred -------------------cCceEEEEecCCCchhhhhhhhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCCCC
Confidence 347899999988865444443 67899999964 45665 68999999987753 2
Q ss_pred -eEEEE-ecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccc
Q 003310 167 -IEYAI-LTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESS 244 (832)
Q Consensus 167 -~~~tl-~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ss 244 (832)
....+ ..|+ ++.+.+.. + |... .|..+
T Consensus 139 w~t~ki~~aH~------------~GvnsVsw-------a-------------pa~~-----------~g~~~-------- 167 (299)
T KOG1332|consen 139 WTTSKIVFAHE------------IGVNSVSW-------A-------------PASA-----------PGSLV-------- 167 (299)
T ss_pred ccchhhhhccc------------cccceeee-------c-------------CcCC-----------Ccccc--------
Confidence 11111 1121 23333321 1 1000 01111
Q ss_pred cceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCc--EEEEeccCCC
Q 003310 245 KHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKN--VIAQFRAHKS 322 (832)
Q Consensus 245 k~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~--~l~~~~aH~~ 322 (832)
+.+ .+.+ ...|++++.|..|+||+..+++ .-.+|++|+.
T Consensus 168 -----------~~~--------------------~~~~--------~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H~d 208 (299)
T KOG1332|consen 168 -----------DQG--------------------PAAK--------VKRLVSGGCDNLVKIWKFDSDSWKLERTLEGHKD 208 (299)
T ss_pred -----------ccC--------------------cccc--------cceeeccCCccceeeeecCCcchhhhhhhhhcch
Confidence 000 0000 0135667889999999998853 3345899999
Q ss_pred CeEEEEEcCCC----CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEE
Q 003310 323 PISALCFDPSG----ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMIS 398 (832)
Q Consensus 323 pIs~LaFSPdG----~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsg 398 (832)
-|..+++.|.- .+||++|.||+ +.||...... ..+ ....|.+| +..++.++||..|+.||.+
T Consensus 209 wVRDVAwaP~~gl~~s~iAS~SqDg~-viIwt~~~e~------e~w--k~tll~~f-----~~~~w~vSWS~sGn~LaVs 274 (299)
T KOG1332|consen 209 WVRDVAWAPSVGLPKSTIASCSQDGT-VIIWTKDEEY------EPW--KKTLLEEF-----PDVVWRVSWSLSGNILAVS 274 (299)
T ss_pred hhhhhhhccccCCCceeeEEecCCCc-EEEEEecCcc------Ccc--cccccccC-----CcceEEEEEeccccEEEEe
Confidence 99999999964 47999999999 6799776320 001 01122222 2349999999999999999
Q ss_pred eCCCcEEEEecCCCCC
Q 003310 399 SSRGTSHLFAINPLGG 414 (832)
Q Consensus 399 S~DgTVhIwdl~~~g~ 414 (832)
..|..|.+|.=+..|.
T Consensus 275 ~GdNkvtlwke~~~Gk 290 (299)
T KOG1332|consen 275 GGDNKVTLWKENVDGK 290 (299)
T ss_pred cCCcEEEEEEeCCCCc
Confidence 9999999998765544
No 133
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.15 E-value=1.4e-08 Score=108.12 Aligned_cols=107 Identities=18% Similarity=0.278 Sum_probs=77.2
Q ss_pred CCeEEEEECCCC----cEEEEeccCCCCeEEEEEcCC-C---CEEEEEEcCCCEEEEEeCCCCCC-----CCC-CccCCC
Q 003310 300 VGMVIVRDIVSK----NVIAQFRAHKSPISALCFDPS-G---ILLVTASVQGHNINIFKIIPGIL-----GTS-SACDAG 365 (832)
Q Consensus 300 ~G~V~IwDl~s~----~~l~~~~aH~~pIs~LaFSPd-G---~lLATaS~dGt~I~Iwdi~t~~~-----~~~-s~~~~~ 365 (832)
-+.++||..... ..++++..|+.||..|+|.|+ | .+||+|+.|| |+||.+..... +.. +..-..
T Consensus 198 ~~~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~y~~lAvA~kDg--v~I~~v~~~~s~i~~ee~~~~~~~~~ 275 (361)
T KOG2445|consen 198 LNKVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRSYHLLAVATKDG--VRIFKVKVARSAIEEEEVLAPDLMTD 275 (361)
T ss_pred ccceEEEEecCCcceeeeehhcCCCCCcceeeeeccccCCceeeEEEeecCc--EEEEEEeeccchhhhhcccCCCCccc
Confidence 456888876543 367788899999999999996 4 3799999998 89999985311 000 000111
Q ss_pred CceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 366 TSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 366 ~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
..++.+..+ +.++..|+.+.|.--|..|++.++||+|++|..+
T Consensus 276 l~v~~vs~~--~~H~~~VWrv~wNmtGtiLsStGdDG~VRLWkan 318 (361)
T KOG2445|consen 276 LPVEKVSEL--DDHNGEVWRVRWNMTGTILSSTGDDGCVRLWKAN 318 (361)
T ss_pred cceEEeeec--cCCCCceEEEEEeeeeeEEeecCCCceeeehhhh
Confidence 233333333 3345679999999999999999999999999864
No 134
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.15 E-value=8.1e-10 Score=117.55 Aligned_cols=107 Identities=13% Similarity=0.220 Sum_probs=86.6
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
++++..+-.|.|||+. |+.+.++......-...+.||+|++||+++..-. ++||.+--.. +|.-..+.+.+.
T Consensus 202 imsas~dt~i~lw~lk-Gq~L~~idtnq~~n~~aavSP~GRFia~~gFTpD-VkVwE~~f~k------dG~fqev~rvf~ 273 (420)
T KOG2096|consen 202 IMSASLDTKICLWDLK-GQLLQSIDTNQSSNYDAAVSPDGRFIAVSGFTPD-VKVWEPIFTK------DGTFQEVKRVFS 273 (420)
T ss_pred EEEecCCCcEEEEecC-CceeeeeccccccccceeeCCCCcEEEEecCCCC-ceEEEEEecc------Ccchhhhhhhhe
Confidence 3566778899999999 8899999888777788899999999999998765 8999874321 111234566777
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
| .|+..+ |..+|||++++.+++.|.||+++|||++
T Consensus 274 L-kGH~sa-V~~~aFsn~S~r~vtvSkDG~wriwdtd 308 (420)
T KOG2096|consen 274 L-KGHQSA-VLAAAFSNSSTRAVTVSKDGKWRIWDTD 308 (420)
T ss_pred e-ccchhh-eeeeeeCCCcceeEEEecCCcEEEeecc
Confidence 7 577554 9999999999999999999999999985
No 135
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.14 E-value=5.4e-09 Score=110.43 Aligned_cols=136 Identities=18% Similarity=0.258 Sum_probs=101.5
Q ss_pred CCCEEEEEECCCCcEEEEEeCCCCEEEEEEcC-----CEEEEEe-CCEEEEEECCCCceEEEEecCCCccCCCCCCCCCc
Q 003310 115 VPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSS-----RVVAICQ-AAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~-----r~LAVa~-~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
-+.++|+||..|-+.+..++++..||+-++++ -+||++. +-+|++.|+..+.+-++|.+|-.
T Consensus 122 FDhtlKVWDtnTlQ~a~~F~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs~sH~LsGHr~------------ 189 (397)
T KOG4283|consen 122 FDHTLKVWDTNTLQEAVDFKMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGSFSHTLSGHRD------------ 189 (397)
T ss_pred ccceEEEeecccceeeEEeecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCcceeeeccccC------------
Confidence 35899999999999999999999999998874 3566654 45999999999999999988732
Q ss_pred ccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccc
Q 003310 189 GYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSE 268 (832)
Q Consensus 189 ~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~ 268 (832)
+.+|+. ++ |. ...
T Consensus 190 --~vlaV~-----Ws-------------p~--------------~e~--------------------------------- 202 (397)
T KOG4283|consen 190 --GVLAVE-----WS-------------PS--------------SEW--------------------------------- 202 (397)
T ss_pred --ceEEEE-----ec-------------cC--------------cee---------------------------------
Confidence 234431 11 00 000
Q ss_pred ccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCC-cE--------------EEEeccCCCCeEEEEEcCCC
Q 003310 269 FLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSK-NV--------------IAQFRAHKSPISALCFDPSG 333 (832)
Q Consensus 269 ~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~-~~--------------l~~~~aH~~pIs~LaFSPdG 333 (832)
.++++..||.|++||++.. .+ +.+-.+|.+.|..+||+.||
T Consensus 203 ------------------------vLatgsaDg~irlWDiRrasgcf~~lD~hn~k~~p~~~~n~ah~gkvngla~tSd~ 258 (397)
T KOG4283|consen 203 ------------------------VLATGSADGAIRLWDIRRASGCFRVLDQHNTKRPPILKTNTAHYGKVNGLAWTSDA 258 (397)
T ss_pred ------------------------EEEecCCCceEEEEEeecccceeEEeecccCccCccccccccccceeeeeeecccc
Confidence 1234566788888888642 11 12234788999999999999
Q ss_pred CEEEEEEcCCCEEEEEeCCCC
Q 003310 334 ILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 334 ~lLATaS~dGt~I~Iwdi~t~ 354 (832)
.+|+++..|.+ +++|+...|
T Consensus 259 ~~l~~~gtd~r-~r~wn~~~G 278 (397)
T KOG4283|consen 259 RYLASCGTDDR-IRVWNMESG 278 (397)
T ss_pred hhhhhccCccc-eEEeecccC
Confidence 99999999977 899998776
No 136
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.14 E-value=2.2e-09 Score=114.34 Aligned_cols=107 Identities=16% Similarity=0.202 Sum_probs=91.9
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCC--EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGI--LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~--lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
++++.|-+|+|||+.+...+..+-.|.+.|++|.|.+.-. .|++|++||+ |.|||..+. ..+.
T Consensus 57 aSGssDetI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLlS~sdDG~-i~iw~~~~W--------------~~~~ 121 (362)
T KOG0294|consen 57 ASGSSDETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLLSGSDDGH-IIIWRVGSW--------------ELLK 121 (362)
T ss_pred eccCCCCcEEEEeccchhhhcceeccccceEEEEecCCcchhheeeecCCCc-EEEEEcCCe--------------EEee
Confidence 4556788999999999999999999999999999999876 8999999998 889999765 5566
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
.+ +++.. .|++|+..|.|++..+.+.|++++.|++-.+..-...
T Consensus 122 sl-K~H~~-~Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a~v~ 165 (362)
T KOG0294|consen 122 SL-KAHKG-QVTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVAFVL 165 (362)
T ss_pred ee-ccccc-ccceeEecCCCceEEEEcCCceeeeehhhcCccceee
Confidence 66 56654 4999999999999999999999999999876554443
No 137
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.13 E-value=1.4e-09 Score=116.24 Aligned_cols=250 Identities=16% Similarity=0.228 Sum_probs=150.2
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCccc
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLAT 97 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~ 97 (832)
+-+.+.||.+-++|.|+.. +.+..-+-.|.+.|..+++.|. +|.|+++..
T Consensus 106 p~la~~G~~GvIrVid~~~-~~~~~~~~ghG~sINeik~~p~--------------~~qlvls~S--------------- 155 (385)
T KOG1034|consen 106 PFLAAGGYLGVIRVIDVVS-GQCSKNYRGHGGSINEIKFHPD--------------RPQLVLSAS--------------- 155 (385)
T ss_pred eeEEeecceeEEEEEecch-hhhccceeccCccchhhhcCCC--------------CCcEEEEec---------------
Confidence 3445556666688888876 4455555567888888887763 345555542
Q ss_pred ccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEe----CCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEE
Q 003310 98 ACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLK----FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYA 170 (832)
Q Consensus 98 ~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~----f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~t 170 (832)
.+.+||+|+++++.||..|. ++..|.+|.|+ .++++. +.|-+|.+|++...+....
T Consensus 156 -----------------kD~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~~~~f~~~ 218 (385)
T KOG1034|consen 156 -----------------KDHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLNVKEFKNK 218 (385)
T ss_pred -----------------CCceEEEEeccCCeEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecChhHHhhh
Confidence 23789999999999999986 47899999997 466666 6899999999985432222
Q ss_pred EecC----CCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccc
Q 003310 171 ILTN----PIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKH 246 (832)
Q Consensus 171 l~t~----~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~ 246 (832)
++.. +++.. .|| + .....-+.|+ .++..+.
T Consensus 219 lE~s~~~~~~~t~-----------~pf---------p--------------t~~~~fp~fs------------t~diHrn 252 (385)
T KOG1034|consen 219 LELSITYSPNKTT-----------RPF---------P--------------TPKTHFPDFS------------TTDIHRN 252 (385)
T ss_pred hhhhcccCCCCcc-----------CcC---------C--------------cccccccccc------------ccccccc
Confidence 2211 11100 011 0 0000000010 0011111
Q ss_pred eeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC--------------Cc
Q 003310 247 LAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS--------------KN 312 (832)
Q Consensus 247 lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s--------------~~ 312 (832)
-+..++++|+.. .+.+.++.|..|.... -.
T Consensus 253 yVDCvrw~gd~i------------------------------------lSkscenaI~~w~pgkl~e~~~~vkp~es~~T 296 (385)
T KOG1034|consen 253 YVDCVRWFGDFI------------------------------------LSKSCENAIVCWKPGKLEESIHNVKPPESATT 296 (385)
T ss_pred hHHHHHHHhhhe------------------------------------eecccCceEEEEecchhhhhhhccCCCcccee
Confidence 122334445432 1122333455554410 12
Q ss_pred EEEEeccCCCCeEEEE--EcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc
Q 003310 313 VIAQFRAHKSPISALC--FDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD 390 (832)
Q Consensus 313 ~l~~~~aH~~pIs~La--FSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp 390 (832)
.+.+|+-....|.-|. |+|-+++||.+...|. +.+||+....+. +.-++......+.|+..+||.
T Consensus 297 i~~~~~~~~c~iWfirf~~d~~~~~la~gnq~g~-v~vwdL~~~ep~------------~~ttl~~s~~~~tVRQ~sfS~ 363 (385)
T KOG1034|consen 297 ILGEFDYPMCDIWFIRFAFDPWQKMLALGNQSGK-VYVWDLDNNEPP------------KCTTLTHSKSGSTVRQTSFSR 363 (385)
T ss_pred eeeEeccCccceEEEEEeecHHHHHHhhccCCCc-EEEEECCCCCCc------------cCceEEeccccceeeeeeecc
Confidence 4555655555666665 4666999999999998 899999875211 111111111234699999999
Q ss_pred CCCEEEEEeCCCcEEEEec
Q 003310 391 DSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 391 Dg~~LAsgS~DgTVhIwdl 409 (832)
||.+|+...+|+||--||.
T Consensus 364 dgs~lv~vcdd~~Vwrwdr 382 (385)
T KOG1034|consen 364 DGSILVLVCDDGTVWRWDR 382 (385)
T ss_pred cCcEEEEEeCCCcEEEEEe
Confidence 9999999999999998885
No 138
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.13 E-value=3.9e-10 Score=121.68 Aligned_cols=218 Identities=13% Similarity=0.219 Sum_probs=137.6
Q ss_pred CCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEc-CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccc
Q 003310 114 SVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCS-SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 114 ~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S-~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~ 191 (832)
+.++.|+||||.+.+|..+|+. .+.|..|++. ..++.++.|++|+.|-+.- ..++++.+-....++- .+ -..+
T Consensus 86 s~DG~VkiWnlsqR~~~~~f~AH~G~V~Gi~v~~~~~~tvgdDKtvK~wk~~~-~p~~tilg~s~~~gId-h~---~~~~ 160 (433)
T KOG0268|consen 86 SCDGEVKIWNLSQRECIRTFKAHEGLVRGICVTQTSFFTVGDDKTVKQWKIDG-PPLHTILGKSVYLGID-HH---RKNS 160 (433)
T ss_pred ccCceEEEEehhhhhhhheeecccCceeeEEecccceEEecCCcceeeeeccC-Ccceeeeccccccccc-cc---cccc
Confidence 3679999999999999999987 5699999996 5677788999999998764 3566665432211110 00 0000
Q ss_pred eeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccC
Q 003310 192 PLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLP 271 (832)
Q Consensus 192 p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p 271 (832)
. .|.++...-.|..-|-.|- .. + .+|+.++..
T Consensus 161 ~-------FaTcGe~i~IWD~~R~~Pv------------------~s------------m----swG~Dti~s------- 192 (433)
T KOG0268|consen 161 V-------FATCGEQIDIWDEQRDNPV------------------SS------------M----SWGADSISS------- 192 (433)
T ss_pred c-------ccccCceeeecccccCCcc------------------ce------------e----ecCCCceeE-------
Confidence 1 1122212222221111110 00 0 001111100
Q ss_pred CCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeC
Q 003310 272 DSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKI 351 (832)
Q Consensus 272 ~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi 351 (832)
+ ..+|.- ...++++..|+.|.|||+...+++..+-- +..-+.|||+|.+--+++|+.|-. +..||+
T Consensus 193 -----v-kfNpvE------TsILas~~sDrsIvLyD~R~~~Pl~KVi~-~mRTN~IswnPeafnF~~a~ED~n-lY~~Dm 258 (433)
T KOG0268|consen 193 -----V-KFNPVE------TSILASCASDRSIVLYDLRQASPLKKVIL-TMRTNTICWNPEAFNFVAANEDHN-LYTYDM 258 (433)
T ss_pred -----E-ecCCCc------chheeeeccCCceEEEecccCCccceeee-eccccceecCccccceeecccccc-ceehhh
Confidence 0 011110 11234567899999999999887765432 223467999998888999999954 889998
Q ss_pred CCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 352 IPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 352 ~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
+.- .+-...++++..| |.+|+|||.|+-+++||-|.||+||.++...
T Consensus 259 R~l--------------~~p~~v~~dhvsA-V~dVdfsptG~EfvsgsyDksIRIf~~~~~~ 305 (433)
T KOG0268|consen 259 RNL--------------SRPLNVHKDHVSA-VMDVDFSPTGQEFVSGSYDKSIRIFPVNHGH 305 (433)
T ss_pred hhh--------------cccchhhccccee-EEEeccCCCcchhccccccceEEEeecCCCc
Confidence 754 1212345787766 9999999999999999999999999998653
No 139
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.11 E-value=1.1e-08 Score=104.54 Aligned_cols=101 Identities=20% Similarity=0.334 Sum_probs=85.0
Q ss_pred cccccCCCCeEEEEECCCCcEEEEec--cC-----CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFR--AH-----KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAG 365 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~--aH-----~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~ 365 (832)
.|+++.+|.+|+.||++-..++.++. -| .+.|.+++.+|.|++||++-.|.. ..+||++-+
T Consensus 196 m~~sgsqdktirfwdlrv~~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~dss-c~lydirg~----------- 263 (350)
T KOG0641|consen 196 MFASGSQDKTIRFWDLRVNSCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHADSS-CMLYDIRGG----------- 263 (350)
T ss_pred EEEccCCCceEEEEeeeccceeeeccCcccCCCcccceeEEEEECCCcceeeeccCCCc-eEEEEeeCC-----------
Confidence 35567788999999999888888775 22 267999999999999999999976 789999876
Q ss_pred CceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 366 TSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 366 ~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
+.+.+++- +.+.|.++-|||...|+.++|-|..|++=|+.
T Consensus 264 ---r~iq~f~p--hsadir~vrfsp~a~yllt~syd~~ikltdlq 303 (350)
T KOG0641|consen 264 ---RMIQRFHP--HSADIRCVRFSPGAHYLLTCSYDMKIKLTDLQ 303 (350)
T ss_pred ---ceeeeeCC--CccceeEEEeCCCceEEEEecccceEEEeecc
Confidence 56666642 35679999999999999999999999999885
No 140
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.11 E-value=5.8e-09 Score=123.18 Aligned_cols=173 Identities=16% Similarity=0.234 Sum_probs=120.8
Q ss_pred CCEEEEEECCCCcEEEEEe-CCCCEEEEEEc--CCEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccc
Q 003310 116 PTVVHFYSLRSQSYVHMLK-FRSPIYSVRCS--SRVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYG 191 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~-f~s~V~sV~~S--~r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~ 191 (832)
+..|++-++.+......++ +..+|..|.++ .++||| +.+++|+|||+.++.+.+++..-.-. +
T Consensus 117 D~~vK~~~~~D~s~~~~lrgh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~-------------n 183 (933)
T KOG1274|consen 117 DTAVKLLNLDDSSQEKVLRGHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKD-------------N 183 (933)
T ss_pred ceeEEEEeccccchheeecccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCcc-------------c
Confidence 3789999999888777776 57899999997 478888 57999999999999988887543110 0
Q ss_pred eeeecc--ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 192 PLAVGP--RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 192 p~Alg~--r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
-+-.+. .-+|+
T Consensus 184 ~~~~s~i~~~~aW------------------------------------------------------------------- 196 (933)
T KOG1274|consen 184 EFILSRICTRLAW------------------------------------------------------------------- 196 (933)
T ss_pred cccccceeeeeee-------------------------------------------------------------------
Confidence 000000 00011
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEec--cCCCCeEEEEEcCCCCEEEEEEcCCCEEE
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFR--AHKSPISALCFDPSGILLVTASVQGHNIN 347 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~--aH~~pIs~LaFSPdG~lLATaS~dGt~I~ 347 (832)
.|+ +|+++-...++.|++|+..+......++ -|.+.+++++|||.|+|||+++.||. |-
T Consensus 197 ~Pk------------------~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~g~-I~ 257 (933)
T KOG1274|consen 197 HPK------------------GGTLAVPPVDNTVKVYSRKGWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLDGQ-IL 257 (933)
T ss_pred cCC------------------CCeEEeeccCCeEEEEccCCceeheeecccccccceEEEEEcCCCcEEeeeccCCc-EE
Confidence 111 1234445567899999999988877776 34445999999999999999999998 88
Q ss_pred EEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEE
Q 003310 348 IFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLF 407 (832)
Q Consensus 348 Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIw 407 (832)
|||+.+. .+| .++ ..|.+++|-|++.-+-.-..-|+.-+|
T Consensus 258 vWnv~t~-------------~~~--~~~-----~~Vc~~aw~p~~n~it~~~~~g~~~~~ 297 (933)
T KOG1274|consen 258 VWNVDTH-------------ERH--EFK-----RAVCCEAWKPNANAITLITALGTLGVS 297 (933)
T ss_pred EEecccc-------------hhc--ccc-----ceeEEEecCCCCCeeEEEeeccccccC
Confidence 9999874 011 221 238899999988876555555544433
No 141
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.11 E-value=6.9e-10 Score=122.93 Aligned_cols=106 Identities=19% Similarity=0.247 Sum_probs=83.7
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL 378 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~ 378 (832)
..|.+++|.+.+|..+..+.+|-.+|+||.|+-||.+|+|||.||. |.+|++..-.. ++...+...++.+. +|
T Consensus 101 i~g~lYlWelssG~LL~v~~aHYQ~ITcL~fs~dgs~iiTgskDg~-V~vW~l~~lv~-----a~~~~~~~p~~~f~-~H 173 (476)
T KOG0646|consen 101 ISGNLYLWELSSGILLNVLSAHYQSITCLKFSDDGSHIITGSKDGA-VLVWLLTDLVS-----ADNDHSVKPLHIFS-DH 173 (476)
T ss_pred ccCcEEEEEeccccHHHHHHhhccceeEEEEeCCCcEEEecCCCcc-EEEEEEEeecc-----cccCCCccceeeec-cC
Confidence 5678999999999999999999999999999999999999999998 89998765310 11112344555553 33
Q ss_pred ccccEEEEEEcc--CCCEEEEEeCCCcEEEEecCCC
Q 003310 379 TNAVIQDISFSD--DSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 379 t~a~I~~IaFSp--Dg~~LAsgS~DgTVhIwdl~~~ 412 (832)
+ -.|.|+-..+ -..+|+++|.|.||++||++.+
T Consensus 174 t-lsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS~g 208 (476)
T KOG0646|consen 174 T-LSITDLQIGSGGTNARLYTASEDRTIKLWDLSLG 208 (476)
T ss_pred c-ceeEEEEecCCCccceEEEecCCceEEEEEeccc
Confidence 3 3488877654 4568999999999999999875
No 142
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.10 E-value=1.6e-10 Score=127.74 Aligned_cols=205 Identities=16% Similarity=0.217 Sum_probs=136.0
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEE--cCCEEEEEeCCEEEEEECCCCceEEEEecCCCccC---CCCCCCCCcccc
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRC--SSRVVAICQAAQVHCFDAATLEIEYAILTNPIVMG---HPSAGGIGIGYG 191 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~--S~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~---~p~~~~~~~~~~ 191 (832)
+.|-.+|..|+++...|.....|++|.| +.+++||++.+-++|||- .+..+++|..+..+.- +| +.+-
T Consensus 151 GHlAa~Dw~t~~L~~Ei~v~Etv~Dv~~LHneq~~AVAQK~y~yvYD~-~GtElHClk~~~~v~rLeFLP------yHfL 223 (545)
T KOG1272|consen 151 GHLAAFDWVTKKLHFEINVMETVRDVTFLHNEQFFAVAQKKYVYVYDN-NGTELHCLKRHIRVARLEFLP------YHFL 223 (545)
T ss_pred cceeeeecccceeeeeeehhhhhhhhhhhcchHHHHhhhhceEEEecC-CCcEEeehhhcCchhhhcccc------hhhe
Confidence 6788999999999999999999999999 689999999999999994 5777888877632110 01 0100
Q ss_pred eeeecc-ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccccc
Q 003310 192 PLAVGP-RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFL 270 (832)
Q Consensus 192 p~Alg~-r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~ 270 (832)
-++.+. -||-|-+ .|.|..|+...... |+.. .+
T Consensus 224 L~~~~~~G~L~Y~D-------------------------VS~GklVa~~~t~~-----------G~~~---------vm- 257 (545)
T KOG1272|consen 224 LVAASEAGFLKYQD-------------------------VSTGKLVASIRTGA-----------GRTD---------VM- 257 (545)
T ss_pred eeecccCCceEEEe-------------------------echhhhhHHHHccC-----------Cccc---------hh-
Confidence 011110 2222321 12233333222111 1100 00
Q ss_pred CCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 271 PDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 271 p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
.-+| .|+++ -.+...|+|.+|.-.+.+++..+-+|.++|++|+++|+|+|+||++.|.. ++|||
T Consensus 258 --------~qNP---~NaVi----h~GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~G~YMaTtG~Dr~-~kIWD 321 (545)
T KOG1272|consen 258 --------KQNP---YNAVI----HLGHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRGGRYMATTGLDRK-VKIWD 321 (545)
T ss_pred --------hcCC---ccceE----EEcCCCceEEecCCCCcchHHHHHhcCCCcceEEECCCCcEEeecccccc-eeEee
Confidence 0011 11222 23567799999999999999999999999999999999999999999965 99999
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
++.. .++.+++. +.....++||.-| .||. |--.-|+||.=
T Consensus 322 lR~~--------------~ql~t~~t---p~~a~~ls~Sqkg-lLA~-~~G~~v~iw~d 361 (545)
T KOG1272|consen 322 LRNF--------------YQLHTYRT---PHPASNLSLSQKG-LLAL-SYGDHVQIWKD 361 (545)
T ss_pred eccc--------------cccceeec---CCCcccccccccc-ceee-ecCCeeeeehh
Confidence 9876 44555432 3346789999877 3333 33446888853
No 143
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.10 E-value=1.6e-09 Score=114.87 Aligned_cols=112 Identities=18% Similarity=0.288 Sum_probs=79.9
Q ss_pred ccccCCCCeEEEEECCCCcEEEEec-cCCC-CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC-----CC--CC----
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFR-AHKS-PISALCFDPSGILLVTASVQGHNINIFKIIPGIL-----GT--SS---- 360 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~-pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~-----~~--~s---- 360 (832)
++++..||.|+|||-.+++++.+|. +|.+ .|.++.|..+|++|++.+.|.. +++|.+.++.. |. +.
T Consensus 276 YvTaSkDG~IklwDGVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG~DS~-vkLWEi~t~R~l~~YtGAg~tgrq~~ 354 (430)
T KOG0640|consen 276 YVTASKDGAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSGKDST-VKLWEISTGRMLKEYTGAGTTGRQKH 354 (430)
T ss_pred EEEeccCCcEEeeccccHHHHHHHHhhcCCceeeeEEEccCCeEEeecCCcce-eeeeeecCCceEEEEecCCcccchhh
Confidence 3567789999999999999999997 8875 6999999999999999999975 99999988721 11 00
Q ss_pred ----------------------ccCC---CCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 361 ----------------------ACDA---GTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 361 ----------------------~~~~---~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
...| ......++ .-|++. .+..|.-||.+.-++++|+|..++.|--
T Consensus 355 rtqAvFNhtEdyVl~pDEas~slcsWdaRtadr~~l~--slgHn~-a~R~i~HSP~~p~FmTcsdD~raRFWyr 425 (430)
T KOG0640|consen 355 RTQAVFNHTEDYVLFPDEASNSLCSWDARTADRVALL--SLGHNG-AVRWIVHSPVEPAFMTCSDDFRARFWYR 425 (430)
T ss_pred hhhhhhcCccceEEccccccCceeeccccchhhhhhc--ccCCCC-CceEEEeCCCCCceeeecccceeeeeee
Confidence 0000 01111111 224443 3677777888888888888888888853
No 144
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.10 E-value=9.8e-09 Score=109.24 Aligned_cols=187 Identities=13% Similarity=0.100 Sum_probs=122.6
Q ss_pred CCEEEEEECCCCcEEEEEeCCCCEEEEEEc---CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 116 PTVVHFYSLRSQSYVHMLKFRSPIYSVRCS---SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S---~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
++.|+.+|+.+++...-..+..+|.+|..+ ..+|+.+-|++|++||.++-....++... .
T Consensus 74 dg~vr~~Dln~~~~~~igth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~~~~~~~~d~~-k---------------- 136 (323)
T KOG1036|consen 74 DGQVRRYDLNTGNEDQIGTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRNKVVVGTFDQG-K---------------- 136 (323)
T ss_pred CceEEEEEecCCcceeeccCCCceEEEEeeccCCeEEEcccCccEEEEeccccccccccccC-c----------------
Confidence 388999999999987777788899999997 35666688999999999872222111110 0
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
. .|+ ++... +
T Consensus 137 -k------Vy~-------------------------------------~~v~g----------~---------------- 146 (323)
T KOG1036|consen 137 -K------VYC-------------------------------------MDVSG----------N---------------- 146 (323)
T ss_pred -e------EEE-------------------------------------EeccC----------C----------------
Confidence 0 011 11000 0
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEecc--CCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRA--HKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~a--H~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
.++-+..+..|.+||+++.....+.+. -.-.+.||++-|++.=.|.+|.||+ |-+=.
T Consensus 147 --------------------~LvVg~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~pn~eGy~~sSieGR-VavE~ 205 (323)
T KOG1036|consen 147 --------------------RLVVGTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVPNGEGYVVSSIEGR-VAVEY 205 (323)
T ss_pred --------------------EEEEeecCceEEEEEcccccchhhhccccceeEEEEEEEecCCCceEEEeecce-EEEEc
Confidence 011234566899999998765443332 2347899999999999999999998 55533
Q ss_pred CCCCCCCCCCccCCCCceeEEEEEeccCc-----cccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 351 IIPGILGTSSACDAGTSYVHLYRLQRGLT-----NAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 351 i~t~~~~~~s~~~~~~~~~~l~~l~rG~t-----~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
+.+... .+ .....++.||-.. ...|++|+|+|--+.||+|+.||-|-+||+.+.+....|
T Consensus 206 ~d~s~~-~~-------skkyaFkCHr~~~~~~~~~yPVNai~Fhp~~~tfaTgGsDG~V~~Wd~~~rKrl~q~ 270 (323)
T KOG1036|consen 206 FDDSEE-AQ-------SKKYAFKCHRLSEKDTEIIYPVNAIAFHPIHGTFATGGSDGIVNIWDLFNRKRLKQL 270 (323)
T ss_pred cCCchH-Hh-------hhceeEEeeecccCCceEEEEeceeEeccccceEEecCCCceEEEccCcchhhhhhc
Confidence 333200 00 1122233333221 124899999999999999999999999999876655444
No 145
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.09 E-value=5e-10 Score=121.36 Aligned_cols=106 Identities=19% Similarity=0.328 Sum_probs=86.2
Q ss_pred ccccccCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 292 GHFPDADNVGMVIVRDIVSK---NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s~---~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
+.|++++.||.|+|||++++ .++.+ ++|.+.|+.|.|+..-.+||+|++||+ ++|||++....+ ..
T Consensus 271 ~vfaScS~DgsIrIWDiRs~~~~~~~~~-kAh~sDVNVISWnr~~~lLasG~DdGt-~~iwDLR~~~~~---------~p 339 (440)
T KOG0302|consen 271 GVFASCSCDGSIRIWDIRSGPKKAAVST-KAHNSDVNVISWNRREPLLASGGDDGT-LSIWDLRQFKSG---------QP 339 (440)
T ss_pred ceEEeeecCceEEEEEecCCCccceeEe-eccCCceeeEEccCCcceeeecCCCce-EEEEEhhhccCC---------Cc
Confidence 45788899999999999987 34443 899999999999999889999999998 899999976222 12
Q ss_pred eEEEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCC
Q 003310 369 VHLYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~ 412 (832)
...++.+ .+.|++|.|+| +...||+++.|..|.|||+...
T Consensus 340 VA~fk~H----k~pItsieW~p~e~s~iaasg~D~QitiWDlsvE 380 (440)
T KOG0302|consen 340 VATFKYH----KAPITSIEWHPHEDSVIAASGEDNQITIWDLSVE 380 (440)
T ss_pred ceeEEec----cCCeeEEEeccccCceEEeccCCCcEEEEEeecc
Confidence 3344443 35699999998 5678899999999999999754
No 146
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.09 E-value=8.6e-09 Score=114.23 Aligned_cols=98 Identities=16% Similarity=0.307 Sum_probs=77.1
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
.+..|.|.|....++..+.+|+-. +.|+.++|+.||+.|..++.+|. |.+||+... .+.+.+.-..
T Consensus 321 ~G~~G~I~lLhakT~eli~s~Kie-G~v~~~~fsSdsk~l~~~~~~Ge-V~v~nl~~~------------~~~~rf~D~G 386 (514)
T KOG2055|consen 321 AGNNGHIHLLHAKTKELITSFKIE-GVVSDFTFSSDSKELLASGGTGE-VYVWNLRQN------------SCLHRFVDDG 386 (514)
T ss_pred cccCceEEeehhhhhhhhheeeec-cEEeeEEEecCCcEEEEEcCCce-EEEEecCCc------------ceEEEEeecC
Confidence 456678888888888888888753 67999999999999999999996 899999875 2233333222
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+. .=.++|.|+++.|||+||..|-|-|||.+.
T Consensus 387 ~v---~gts~~~S~ng~ylA~GS~~GiVNIYd~~s 418 (514)
T KOG2055|consen 387 SV---HGTSLCISLNGSYLATGSDSGIVNIYDGNS 418 (514)
T ss_pred cc---ceeeeeecCCCceEEeccCcceEEEeccch
Confidence 21 135788999999999999999999999765
No 147
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.08 E-value=1.1e-09 Score=124.81 Aligned_cols=114 Identities=15% Similarity=0.166 Sum_probs=76.2
Q ss_pred CCCeEEEEECCCCc------EEEEec--cC---CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCc
Q 003310 299 NVGMVIVRDIVSKN------VIAQFR--AH---KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTS 367 (832)
Q Consensus 299 ~~G~V~IwDl~s~~------~l~~~~--aH---~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~ 367 (832)
.|+.|+|||+.... ++...+ -| ...+++|..+..|++|...+.|++ |..|++.......
T Consensus 238 ~D~~iKVWDLRk~~~~~r~ep~~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD~s-Iy~ynm~s~s~sP--------- 307 (720)
T KOG0321|consen 238 ADSTIKVWDLRKNYTAYRQEPRGSDKYPTHSKRSVGQVNLILDSSGTYLFASCTDNS-IYFYNMRSLSISP--------- 307 (720)
T ss_pred CCcceEEEeecccccccccCCCcccCccCcccceeeeEEEEecCCCCeEEEEecCCc-EEEEeccccCcCc---------
Confidence 58999999998643 222222 33 236889999999998877777887 8899987651111
Q ss_pred eeEEEEEeccCcccc-EEEEEEccCCCEEEEEeCCCcEEEEecCCCCCce-eeccCCCCcc
Q 003310 368 YVHLYRLQRGLTNAV-IQDISFSDDSNWIMISSSRGTSHLFAINPLGGSV-NFQPTDANFT 426 (832)
Q Consensus 368 ~~~l~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~-~~~~H~~~~~ 426 (832)
..++ .|.-... -..-..|||+.+|++|+.|.-..||.+....... .+.+|+....
T Consensus 308 -~~~~---sg~~~~sf~vks~lSpd~~~l~SgSsd~~ayiw~vs~~e~~~~~l~Ght~eVt 364 (720)
T KOG0321|consen 308 -VAEF---SGKLNSSFYVKSELSPDDCSLLSGSSDEQAYIWVVSSPEAPPALLLGHTREVT 364 (720)
T ss_pred -hhhc---cCcccceeeeeeecCCCCceEeccCCCcceeeeeecCccCChhhhhCcceEEE
Confidence 0001 1111111 1223579999999999999999999999876664 4578865443
No 148
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.05 E-value=1.3e-07 Score=103.93 Aligned_cols=104 Identities=9% Similarity=0.113 Sum_probs=67.4
Q ss_pred CCeEEEEECCC--C--cEEEEeccC------CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCcee
Q 003310 300 VGMVIVRDIVS--K--NVIAQFRAH------KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYV 369 (832)
Q Consensus 300 ~G~V~IwDl~s--~--~~l~~~~aH------~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~ 369 (832)
++.|.+||+.. + +.+..+..+ ......+.|+|||++|+++......|.+|++.... ....
T Consensus 196 ~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~----------~~~~ 265 (330)
T PRK11028 196 NSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSEDG----------SVLS 265 (330)
T ss_pred CCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCCC----------CeEE
Confidence 56788888863 2 233443322 11234689999999999987655569999996530 0111
Q ss_pred EEEEEeccCccccEEEEEEccCCCEEEEEeC-CCcEEEEecCCCCCce
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDSNWIMISSS-RGTSHLFAINPLGGSV 416 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~-DgTVhIwdl~~~g~~~ 416 (832)
.+....-|. ....++|+|||++|+++.. +++|.||+++...+..
T Consensus 266 ~~~~~~~~~---~p~~~~~~~dg~~l~va~~~~~~v~v~~~~~~~g~l 310 (330)
T PRK11028 266 FEGHQPTET---QPRGFNIDHSGKYLIAAGQKSHHISVYEIDGETGLL 310 (330)
T ss_pred EeEEEeccc---cCCceEECCCCCEEEEEEccCCcEEEEEEcCCCCcE
Confidence 122222221 2457899999999998886 8999999997554433
No 149
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.04 E-value=2.6e-09 Score=110.49 Aligned_cols=200 Identities=17% Similarity=0.267 Sum_probs=126.5
Q ss_pred ccccCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEEcC--CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 294 FPDADNVGMVIVRDIVSK---NVIAQFRAHKSPISALCFDP--SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~---~~l~~~~aH~~pIs~LaFSP--dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
++++..|++|+|+++.+. +++.+|.+|.+||..++|-. -|++||++|.||+ |.||.-..+ .+
T Consensus 26 lATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgk-VIiWke~~g-~w----------- 92 (299)
T KOG1332|consen 26 LATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGK-VIIWKEENG-RW----------- 92 (299)
T ss_pred eeeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecccccCcEeeEeecCce-EEEEecCCC-ch-----------
Confidence 567788999999999864 57899999999999999976 7999999999998 679987665 11
Q ss_pred eEEEEEeccCccccEEEEEEccC--CCEEEEEeCCCcEEEEecCCCCCce---eeccCCCCcccccCCcccccccCCCCC
Q 003310 369 VHLYRLQRGLTNAVIQDISFSDD--SNWIMISSSRGTSHLFAINPLGGSV---NFQPTDANFTTKHGAMAKSGVRWPPNL 443 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~IaFSpD--g~~LAsgS~DgTVhIwdl~~~g~~~---~~~~H~~~~~~~~~~~~~~~~r~~~~s 443 (832)
.+++.. ..+.+.|++|+|.|. |-.||++|+||+|.|+++...++-. ....|..... .+.|.|.+
T Consensus 93 ~k~~e~--~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g~w~t~ki~~aH~~Gvn---------sVswapa~ 161 (299)
T KOG1332|consen 93 TKAYEH--AAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGGWTTSKIVFAHEIGVN---------SVSWAPAS 161 (299)
T ss_pred hhhhhh--hhhcccceeecccccccceEEEEeeCCCcEEEEEEcCCCCccchhhhhccccccc---------eeeecCcC
Confidence 122222 223456999999985 6789999999999999998875532 2355643332 23555554
Q ss_pred CCCCCCCcccccCCCCeeeeeceEEEcC-CCCCCccccccchhccCcc-cCCCc----ceeeeeeccCCCccccccCCcc
Q 003310 444 GLQMPNQQSLCASGPPVTLSVVSRIRNG-NNGWRGTVSGAAAAATGRV-SSLSG----AIASSFHNCKGNSETYAAGSSL 517 (832)
Q Consensus 444 ~~~~~~~~~l~~~~~p~~ls~v~~I~~~-~~~~~~~v~~~~~~a~g~~-~~~~g----~~~~~~h~~~~~~~~~~~~~~~ 517 (832)
... +.+--+..+++|+= +.|=+++| |+ -..++ ..++.-|.--.|...-.+....
T Consensus 162 ~~g-----------~~~~~~~~~~~krlvSgGcDn~V---------kiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~gl 221 (299)
T KOG1332|consen 162 APG-----------SLVDQGPAAKVKRLVSGGCDNLV---------KIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVGL 221 (299)
T ss_pred CCc-----------cccccCcccccceeeccCCccce---------eeeecCCcchhhhhhhhhcchhhhhhhhccccCC
Confidence 332 11111122222221 11223333 11 11111 2235556554554222223222
Q ss_pred cccccEEEEcCCCcEEEEeee
Q 003310 518 KIKNHLLVFSPSGCMIQYALR 538 (832)
Q Consensus 518 ~~~~~Llv~s~~G~l~~y~l~ 538 (832)
.+..+--.+-||++|.|--+
T Consensus 222 -~~s~iAS~SqDg~viIwt~~ 241 (299)
T KOG1332|consen 222 -PKSTIASCSQDGTVIIWTKD 241 (299)
T ss_pred -CceeeEEecCCCcEEEEEec
Confidence 55667788899999999876
No 150
>PRK01742 tolB translocation protein TolB; Provisional
Probab=99.04 E-value=6.8e-08 Score=110.77 Aligned_cols=78 Identities=17% Similarity=0.150 Sum_probs=54.9
Q ss_pred EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcE
Q 003310 325 SALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTS 404 (832)
Q Consensus 325 s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTV 404 (832)
.+++|||||++|+.++.++ +.+||+.++ .... +..+. ....++|||||++|+.++.++.+
T Consensus 336 ~~~~~SpDG~~ia~~~~~~--i~~~Dl~~g------------~~~~---lt~~~---~~~~~~~sPdG~~i~~~s~~g~~ 395 (429)
T PRK01742 336 YSAQISADGKTLVMINGDN--VVKQDLTSG------------STEV---LSSTF---LDESPSISPNGIMIIYSSTQGLG 395 (429)
T ss_pred CCccCCCCCCEEEEEcCCC--EEEEECCCC------------CeEE---ecCCC---CCCCceECCCCCEEEEEEcCCCc
Confidence 4578999999999888764 556998775 1111 21111 24568899999999999999999
Q ss_pred EEEecCC--CCCceeeccCC
Q 003310 405 HLFAINP--LGGSVNFQPTD 422 (832)
Q Consensus 405 hIwdl~~--~g~~~~~~~H~ 422 (832)
.+|.+.. ++....+.+|.
T Consensus 396 ~~l~~~~~~G~~~~~l~~~~ 415 (429)
T PRK01742 396 KVLQLVSADGRFKARLPGSD 415 (429)
T ss_pred eEEEEEECCCCceEEccCCC
Confidence 9988743 33445666664
No 151
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.04 E-value=7.1e-09 Score=115.05 Aligned_cols=120 Identities=17% Similarity=0.193 Sum_probs=86.4
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCcc----CCCCcee
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSAC----DAGTSYV 369 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~----~~~~~~~ 369 (832)
+.+++.|.++++||+..+.++.++-- ..+|.|++.+|-++.+..|..+|. |-+.++.... +. +.+ +......
T Consensus 191 l~TaS~D~t~k~wdlS~g~LLlti~f-p~si~av~lDpae~~~yiGt~~G~-I~~~~~~~~~-~~-~~~v~~k~~~~~~t 266 (476)
T KOG0646|consen 191 LYTASEDRTIKLWDLSLGVLLLTITF-PSSIKAVALDPAERVVYIGTEEGK-IFQNLLFKLS-GQ-SAGVNQKGRHEENT 266 (476)
T ss_pred EEEecCCceEEEEEeccceeeEEEec-CCcceeEEEcccccEEEecCCcce-EEeeehhcCC-cc-cccccccccccccc
Confidence 45667889999999999998887753 358999999999999999999997 6677665431 10 100 0000112
Q ss_pred EEEEEeccCcc-ccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 370 HLYRLQRGLTN-AVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 370 ~l~~l~rG~t~-a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
.+..+ -|+.. ..|.|++.|-||..|++|+.||+|.|||+.......++
T Consensus 267 ~~~~~-~Gh~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~Q~iRtl 315 (476)
T KOG0646|consen 267 QINVL-VGHENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSKQCIRTL 315 (476)
T ss_pred eeeee-ccccCCcceeEEEEecCccEEEeeCCCCCEEEEecchHHHHHHH
Confidence 22333 34443 35999999999999999999999999999765444333
No 152
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=99.03 E-value=3.3e-09 Score=114.06 Aligned_cols=105 Identities=22% Similarity=0.312 Sum_probs=78.5
Q ss_pred CCeEEEEECCCCcE-EEEe-ccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 300 VGMVIVRDIVSKNV-IAQF-RAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 300 ~G~V~IwDl~s~~~-l~~~-~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
+-.|.+||++..+. +..+ ..|...|++|+|.|+ -.+|+|||.||- |+|||+..... .-..+..+..
T Consensus 142 ~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrFHP~~pnlLlSGSvDGL-vnlfD~~~d~E----------eDaL~~viN~ 210 (376)
T KOG1188|consen 142 DASVVLWDVRSEQQLLRQLNESHNDDVTQLRFHPSDPNLLLSGSVDGL-VNLFDTKKDNE----------EDALLHVINH 210 (376)
T ss_pred ceEEEEEEeccccchhhhhhhhccCcceeEEecCCCCCeEEeecccce-EEeeecCCCcc----------hhhHHHhhcc
Confidence 44799999997654 5554 489999999999995 669999999996 99999976410 0012222322
Q ss_pred cCccccEEEEEEccCC-CEEEEEeCCCcEEEEecCCCCCceee
Q 003310 377 GLTNAVIQDISFSDDS-NWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg-~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
| +.|-.|.|..++ +.|.+-+..+|..+|+++...+...+
T Consensus 211 ~---sSI~~igw~~~~ykrI~clTH~Etf~~~ele~~~~~~~~ 250 (376)
T KOG1188|consen 211 G---SSIHLIGWLSKKYKRIMCLTHMETFAIYELEDGSEETWL 250 (376)
T ss_pred c---ceeeeeeeecCCcceEEEEEccCceeEEEccCCChhhcc
Confidence 2 348999999887 45888899999999999988755554
No 153
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.03 E-value=3.8e-08 Score=113.29 Aligned_cols=231 Identities=14% Similarity=0.172 Sum_probs=153.9
Q ss_pred CCcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcc
Q 003310 17 TRRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLA 96 (832)
Q Consensus 17 ~~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~ 96 (832)
..+++.+|.++.+.-||+.+ .+.......--|++-.+++-|.. -.++|.+++
T Consensus 80 ~~RLFS~g~sg~i~EwDl~~-lk~~~~~d~~gg~IWsiai~p~~--------------~~l~Igcdd------------- 131 (691)
T KOG2048|consen 80 GGRLFSSGLSGSITEWDLHT-LKQKYNIDSNGGAIWSIAINPEN--------------TILAIGCDD------------- 131 (691)
T ss_pred CCeEEeecCCceEEEEeccc-CceeEEecCCCcceeEEEeCCcc--------------ceEEeecCC-------------
Confidence 36777888888888888876 33444444445666666665421 134444331
Q ss_pred cccCCCCCCCCCCCCCCCCCCEEEEEECCCCcEEEEEeC---CCCEEEEEEcC--CEEEE-EeCCEEEEEECCCCceEEE
Q 003310 97 TACNGTSANYHDLGNGSSVPTVVHFYSLRSQSYVHMLKF---RSPIYSVRCSS--RVVAI-CQAAQVHCFDAATLEIEYA 170 (832)
Q Consensus 97 ~~~~g~~~~~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f---~s~V~sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~t 170 (832)
+.+.+.+...++......| .+.|.+|.|++ ..||. +.|+.|+|||+.++..++.
T Consensus 132 --------------------Gvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~ 191 (691)
T KOG2048|consen 132 --------------------GVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDGVIRIWDVKSGQTLHI 191 (691)
T ss_pred --------------------ceEEEEecCCceEEEEeecccccceEEEEEecCCccEEEecccCceEEEEEcCCCceEEE
Confidence 4566777777776666555 57899999985 33454 7788899999999887662
Q ss_pred EecCCCccCCCCCCCCCcccceeeecc--ceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeeccccccee
Q 003310 171 ILTNPIVMGHPSAGGIGIGYGPLAVGP--RWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLA 248 (832)
Q Consensus 171 l~t~~~~~~~p~~~~~~~~~~p~Alg~--r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~la 248 (832)
+ +-. + +-++- ..|+++
T Consensus 192 ~-~~~----~------------d~l~k~~~~iVWS--------------------------------------------- 209 (691)
T KOG2048|consen 192 I-TMQ----L------------DRLSKREPTIVWS--------------------------------------------- 209 (691)
T ss_pred e-eec----c------------cccccCCceEEEE---------------------------------------------
Confidence 2 210 0 00000 001111
Q ss_pred ceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEE
Q 003310 249 AGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALC 328 (832)
Q Consensus 249 sGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~La 328 (832)
+..| . .+++++++..|+|++||...+..+..++.|...|.||+
T Consensus 210 --v~~L----------------r-------------------d~tI~sgDS~G~V~FWd~~~gTLiqS~~~h~adVl~La 252 (691)
T KOG2048|consen 210 --VLFL----------------R-------------------DSTIASGDSAGTVTFWDSIFGTLIQSHSCHDADVLALA 252 (691)
T ss_pred --EEEe----------------e-------------------cCcEEEecCCceEEEEcccCcchhhhhhhhhcceeEEE
Confidence 0000 0 02456678889999999999999999999999999999
Q ss_pred EcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 329 FDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 329 FSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
-++++.++.+|+.|+++|++...... . .-+...+|..+.+.|.++|..++ .|.+|+.|.++.|=.
T Consensus 253 v~~~~d~vfsaGvd~~ii~~~~~~~~------------~-~wv~~~~r~~h~hdvrs~av~~~--~l~sgG~d~~l~i~~ 317 (691)
T KOG2048|consen 253 VADNEDRVFSAGVDPKIIQYSLTTNK------------S-EWVINSRRDLHAHDVRSMAVIEN--ALISGGRDFTLAICS 317 (691)
T ss_pred EcCCCCeEEEccCCCceEEEEecCCc------------c-ceeeeccccCCcccceeeeeecc--eEEecceeeEEEEcc
Confidence 99999999999999996655443322 0 11223345555677999999988 888999999987644
Q ss_pred c
Q 003310 409 I 409 (832)
Q Consensus 409 l 409 (832)
.
T Consensus 318 s 318 (691)
T KOG2048|consen 318 S 318 (691)
T ss_pred c
Confidence 4
No 154
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.03 E-value=2.2e-07 Score=102.23 Aligned_cols=112 Identities=8% Similarity=0.134 Sum_probs=72.2
Q ss_pred CCCeEEEEECCCCcEEE-------EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 299 NVGMVIVRDIVSKNVIA-------QFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~-------~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
.++.|.|||+.+...+. .+... .....++|+|||++|++++.....|.+||+... . +....+
T Consensus 146 ~~~~v~v~d~~~~g~l~~~~~~~~~~~~g-~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~-~---------~~~~~~ 214 (330)
T PRK11028 146 KEDRIRLFTLSDDGHLVAQEPAEVTTVEG-AGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDP-H---------GEIECV 214 (330)
T ss_pred CCCEEEEEEECCCCcccccCCCceecCCC-CCCceEEECCCCCEEEEEecCCCEEEEEEEeCC-C---------CCEEEE
Confidence 45789999997643221 12222 234679999999999988874455999999742 0 012223
Q ss_pred EEEecc---C-ccccEEEEEEccCCCEEEEEeC-CCcEEEEecCCCCCceeeccC
Q 003310 372 YRLQRG---L-TNAVIQDISFSDDSNWIMISSS-RGTSHLFAINPLGGSVNFQPT 421 (832)
Q Consensus 372 ~~l~rG---~-t~a~I~~IaFSpDg~~LAsgS~-DgTVhIwdl~~~g~~~~~~~H 421 (832)
.++... . .......|+|+||+++|+++.. +++|.+|++...++...+..|
T Consensus 215 ~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~ 269 (330)
T PRK11028 215 QTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGH 269 (330)
T ss_pred EEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEE
Confidence 333210 0 0111346999999999999854 789999999877665555444
No 155
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.00 E-value=6e-10 Score=129.22 Aligned_cols=112 Identities=13% Similarity=0.237 Sum_probs=89.5
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
++++++||+|++||++..+-..+|.+....|..++|+| .+.++|++.+.|. +.+||++.. ..+
T Consensus 149 liSGSQDg~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~~~~F~s~~dsG~-lqlWDlRqp---------------~r~ 212 (839)
T KOG0269|consen 149 LISGSQDGTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGYGNKFASIHDSGY-LQLWDLRQP---------------DRC 212 (839)
T ss_pred EEecCCCceEEEEeeecccccccccccchhhhceeeccCCCceEEEecCCce-EEEeeccCc---------------hhH
Confidence 45788999999999999999999999889999999999 5888999998887 899999864 111
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccC
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPT 421 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H 421 (832)
.++-.-+...|.++.|+|+..|||+|+.|++|+||+............|
T Consensus 213 ~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~~~~~~tIn 261 (839)
T KOG0269|consen 213 EKKLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTDSRAKPKHTIN 261 (839)
T ss_pred HHHhhcccCceEEEeecCCCceeeecCCCccEEEEeccCCCccceeEEe
Confidence 1111222346999999999999999999999999999764443333333
No 156
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=99.00 E-value=3.7e-09 Score=119.87 Aligned_cols=95 Identities=17% Similarity=0.223 Sum_probs=76.3
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
...|+|||+....++..+..-...|+.|+.+|.|.-|+.++.|++ +.+||+.-. ..-|+.-|-+.
T Consensus 586 q~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d~k-~~WfDldls--------------skPyk~lr~H~ 650 (733)
T KOG0650|consen 586 QRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYDKK-MCWFDLDLS--------------SKPYKTLRLHE 650 (733)
T ss_pred ccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCCCe-eEEEEcccC--------------cchhHHhhhhh
Confidence 457999999998888888777778999999999999999999987 778898754 11222223333
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
..|.+|+|.+-=.++|+||.|||++||--.
T Consensus 651 -~avr~Va~H~ryPLfas~sdDgtv~Vfhg~ 680 (733)
T KOG0650|consen 651 -KAVRSVAFHKRYPLFASGSDDGTVIVFHGM 680 (733)
T ss_pred -hhhhhhhhccccceeeeecCCCcEEEEeee
Confidence 349999999998999999999999998643
No 157
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=98.98 E-value=7.6e-07 Score=94.55 Aligned_cols=184 Identities=18% Similarity=0.298 Sum_probs=129.6
Q ss_pred CCEEEEEECCC-CcEEEEEeC-CCCEEEEEEcC--CEEEEEe--CCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcc
Q 003310 116 PTVVHFYSLRS-QSYVHMLKF-RSPIYSVRCSS--RVVAICQ--AAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIG 189 (832)
Q Consensus 116 ~~tVrlWDL~T-g~~V~tL~f-~s~V~sV~~S~--r~LAVa~--~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~ 189 (832)
++.+++||+.+ ...+..+.. ...|..+.+++ +.++++. ++.+++||+.+...+.++..|...
T Consensus 133 d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------ 200 (466)
T COG2319 133 DGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDP------------ 200 (466)
T ss_pred CccEEEEEecCCCeEEEEEecCcccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceEEeeccCCCc------------
Confidence 47899999998 677776665 57888899964 4566543 889999999987766666553110
Q ss_pred cceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccc
Q 003310 190 YGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEF 269 (832)
Q Consensus 190 ~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~ 269 (832)
+ ..++|.. .+..
T Consensus 201 ---v----~~~~~~~---------------------------~~~~---------------------------------- 212 (466)
T COG2319 201 ---V----SSLAFSP---------------------------DGGL---------------------------------- 212 (466)
T ss_pred ---e----EEEEEcC---------------------------Ccce----------------------------------
Confidence 0 1122220 0000
Q ss_pred cCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEE-EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEE
Q 003310 270 LPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIA-QFRAHKSPISALCFDPSGILLVTASVQGHNINI 348 (832)
Q Consensus 270 ~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~-~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~I 348 (832)
.+++...++.|++||...+..+. .+..|...+ ...|+|++.++++++.|+. +++
T Consensus 213 -----------------------~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d~~-~~~ 267 (466)
T COG2319 213 -----------------------LIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSDGT-IRL 267 (466)
T ss_pred -----------------------EEEEecCCCcEEEEECCCCcEEeeecCCCCcce-eEeECCCCCEEEEecCCCc-EEE
Confidence 01112467789999999888888 689998875 4489999999999999987 899
Q ss_pred EeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 349 FKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 349 wdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
|++.... ..+..+ .++ ...|.+++|+|++..+++++.|+++++|++..........
T Consensus 268 ~~~~~~~-------------~~~~~~-~~~-~~~v~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 323 (466)
T COG2319 268 WDLRSSS-------------SLLRTL-SGH-SSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLT 323 (466)
T ss_pred eeecCCC-------------cEEEEE-ecC-CccEEEEEECCCCCEEEEeeCCCcEEEEEcCCCceEEEee
Confidence 9998650 122333 333 3459999999999999999999999999887765444443
No 158
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=98.97 E-value=5.8e-09 Score=108.96 Aligned_cols=70 Identities=21% Similarity=0.317 Sum_probs=57.9
Q ss_pred CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCC
Q 003310 323 PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRG 402 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~Dg 402 (832)
.|..+..-||++.||||+.||+ ||||..++. ..|..|. ++.+.|+++|||||...+|++|.|+
T Consensus 253 Gv~gvrIRpD~KIlATAGWD~R-iRVyswrtl--------------~pLAVLk--yHsagvn~vAfspd~~lmAaaskD~ 315 (323)
T KOG0322|consen 253 GVSGVRIRPDGKILATAGWDHR-IRVYSWRTL--------------NPLAVLK--YHSAGVNAVAFSPDCELMAAASKDA 315 (323)
T ss_pred CccceEEccCCcEEeecccCCc-EEEEEeccC--------------Cchhhhh--hhhcceeEEEeCCCCchhhhccCCc
Confidence 4677888899999999999998 999998876 2222332 1235699999999999999999999
Q ss_pred cEEEEec
Q 003310 403 TSHLFAI 409 (832)
Q Consensus 403 TVhIwdl 409 (832)
+|-+|++
T Consensus 316 rISLWkL 322 (323)
T KOG0322|consen 316 RISLWKL 322 (323)
T ss_pred eEEeeec
Confidence 9999987
No 159
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=98.97 E-value=2.6e-08 Score=105.22 Aligned_cols=117 Identities=16% Similarity=0.228 Sum_probs=75.8
Q ss_pred ccccCCCCeEEEEECCC-CcEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCC-------CCCCccCC
Q 003310 294 FPDADNVGMVIVRDIVS-KNVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGIL-------GTSSACDA 364 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s-~~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~-------~~~s~~~~ 364 (832)
++++++||.|+|||.+. +.++.++.+|++.|.++.|+|. .++|+|++.|.. +.+|-...-.. ...+....
T Consensus 230 lvt~gDdgyvriWD~R~tk~pv~el~~HsHWvW~VRfn~~hdqLiLs~~SDs~-V~Lsca~svSSE~qi~~~~dese~e~ 308 (370)
T KOG1007|consen 230 LVTCGDDGYVRIWDTRKTKFPVQELPGHSHWVWAVRFNPEHDQLILSGGSDSA-VNLSCASSVSSEQQIEFEDDESESED 308 (370)
T ss_pred EEEcCCCccEEEEeccCCCccccccCCCceEEEEEEecCccceEEEecCCCce-eEEEeccccccccccccccccccCcc
Confidence 35567899999999985 5689999999999999999996 668899999976 77886543210 00000000
Q ss_pred CCceeEEEEEecc----Cc--cccEEEEEEccCCCE-EEEEeCCCcEEEEecCC
Q 003310 365 GTSYVHLYRLQRG----LT--NAVIQDISFSDDSNW-IMISSSRGTSHLFAINP 411 (832)
Q Consensus 365 ~~~~~~l~~l~rG----~t--~a~I~~IaFSpDg~~-LAsgS~DgTVhIwdl~~ 411 (832)
....++..-|.-| ++ --.|++++||.-.-| +|+-|-||.+.|=.+.+
T Consensus 309 ~dseer~kpL~dg~l~tydehEDSVY~~aWSsadPWiFASLSYDGRviIs~V~r 362 (370)
T KOG1007|consen 309 EDSEERVKPLQDGQLETYDEHEDSVYALAWSSADPWIFASLSYDGRVIISSVPR 362 (370)
T ss_pred hhhHHhcccccccccccccccccceEEEeeccCCCeeEEEeccCceEEeecCCh
Confidence 0001110011111 11 123999999875555 67788899998866644
No 160
>KOG4328 consensus WD40 protein [Function unknown]
Probab=98.97 E-value=1.5e-08 Score=112.17 Aligned_cols=108 Identities=19% Similarity=0.175 Sum_probs=73.6
Q ss_pred cccCCCCeEEEEECCCCc----EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeE
Q 003310 295 PDADNVGMVIVRDIVSKN----VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVH 370 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~----~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~ 370 (832)
++++.|++++|||++... ++-..-.|+.+|.+..|||+|-.|+|.+.|.+ |+|||..-- + ..+-+.....|
T Consensus 339 aT~s~D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT~~D~~-IRv~dss~~--s--a~~~p~~~I~H 413 (498)
T KOG4328|consen 339 ATASLDQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTTCQDNE-IRVFDSSCI--S--AKDEPLGTIPH 413 (498)
T ss_pred eecccCcceeeeehhhhcCCCCcceecccccceeeeeEEcCCCCceEeeccCCc-eEEeecccc--c--ccCCccceeec
Confidence 456678899999997532 33444589999999999998888999999976 999998411 0 00000011122
Q ss_pred EEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 371 LYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 371 l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
-...-|-. .+...+|.||-.+|++|-.-..|-||+-+
T Consensus 414 n~~t~Rwl---T~fKA~W~P~~~li~vg~~~r~IDv~~~~ 450 (498)
T KOG4328|consen 414 NNRTGRWL---TPFKAAWDPDYNLIVVGRYPRPIDVFDGN 450 (498)
T ss_pred cCcccccc---cchhheeCCCccEEEEeccCcceeEEcCC
Confidence 11111111 15567899999999999998899998764
No 161
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=98.93 E-value=3.5e-09 Score=115.79 Aligned_cols=129 Identities=16% Similarity=0.245 Sum_probs=96.4
Q ss_pred ccccCCCCeEEEEECCCC---------cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC-CCCCccC
Q 003310 294 FPDADNVGMVIVRDIVSK---------NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL-GTSSACD 363 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~---------~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~-~~~s~~~ 363 (832)
+++++.|..|+||-+... +.+..|..|+..|+++.|+|+|.+||||+++|. |.+|....-.. .......
T Consensus 29 laT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D~g~-v~lWk~~~~~~~~~d~e~~ 107 (434)
T KOG1009|consen 29 LATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGDGGE-VFLWKQGDVRIFDADTEAD 107 (434)
T ss_pred eecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCCCce-EEEEEecCcCCccccchhh
Confidence 566778889999988642 245677899999999999999999999999987 88997652100 0000111
Q ss_pred CCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCC
Q 003310 364 AGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 364 ~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~ 424 (832)
.....+.+.+.-||+. ..|++++|+||+.++++++.|.++++||++.+.-...+..|..-
T Consensus 108 ~~ke~w~v~k~lr~h~-~diydL~Ws~d~~~l~s~s~dns~~l~Dv~~G~l~~~~~dh~~y 167 (434)
T KOG1009|consen 108 LNKEKWVVKKVLRGHR-DDIYDLAWSPDSNFLVSGSVDNSVRLWDVHAGQLLAILDDHEHY 167 (434)
T ss_pred hCccceEEEEEecccc-cchhhhhccCCCceeeeeeccceEEEEEeccceeEeeccccccc
Confidence 1223355556567753 56999999999999999999999999999987666677777443
No 162
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.92 E-value=8.5e-07 Score=101.90 Aligned_cols=101 Identities=18% Similarity=0.163 Sum_probs=63.4
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCC--CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQG--HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dG--t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.++|+.+++. ..+..+...+.+.+|||||++|+.++.++ ..|.+||+.++ ... .+..+.
T Consensus 312 ~Iy~~d~~~g~~-~~lt~~~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g------------~~~---~Lt~~~- 374 (429)
T PRK03629 312 QVYKVNINGGAP-QRITWEGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATG------------GVQ---VLTDTF- 374 (429)
T ss_pred eEEEEECCCCCe-EEeecCCCCccCEEECCCCCEEEEEEccCCCceEEEEECCCC------------CeE---EeCCCC-
Confidence 355557766543 33333444566789999999998876543 24677888765 112 232221
Q ss_pred cccEEEEEEccCCCEEEEEeCCCc---EEEEecCCCCCceeeccCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGT---SHLFAINPLGGSVNFQPTD 422 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgT---VhIwdl~~~g~~~~~~~H~ 422 (832)
.....+|||||++|+.++.++. +.++++ .++....+.+|.
T Consensus 375 --~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~-~G~~~~~l~~~~ 417 (429)
T PRK03629 375 --LDETPSIAPNGTMVIYSSSQGMGSVLNLVST-DGRFKARLPATD 417 (429)
T ss_pred --CCCCceECCCCCEEEEEEcCCCceEEEEEEC-CCCCeEECccCC
Confidence 2346789999999999998876 455555 334455566553
No 163
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.92 E-value=9.1e-08 Score=101.94 Aligned_cols=97 Identities=15% Similarity=0.266 Sum_probs=78.3
Q ss_pred cCCCCeEEEEECCC-CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVS-KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s-~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+..|-+.++||+.. -..+..|++|+..|+++.|..|.+ ++++|+|.+ |+|||++.- ...|.+++
T Consensus 332 sSrDtTFRLWDFReaI~sV~VFQGHtdtVTS~vF~~dd~-vVSgSDDrT-vKvWdLrNM-------------RsplATIR 396 (481)
T KOG0300|consen 332 SSRDTTFRLWDFREAIQSVAVFQGHTDTVTSVVFNTDDR-VVSGSDDRT-VKVWDLRNM-------------RSPLATIR 396 (481)
T ss_pred eccCceeEeccchhhcceeeeecccccceeEEEEecCCc-eeecCCCce-EEEeeeccc-------------cCcceeee
Confidence 44567899999974 356889999999999999988766 789999976 999999864 12344553
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+...++.|+.|.-+..||.--+.+-|+|||++-
T Consensus 397 ---tdS~~NRvavs~g~~iIAiPhDNRqvRlfDlnG 429 (481)
T KOG0300|consen 397 ---TDSPANRVAVSKGHPIIAIPHDNRQVRLFDLNG 429 (481)
T ss_pred ---cCCccceeEeecCCceEEeccCCceEEEEecCC
Confidence 234588999999999999999999999999974
No 164
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=98.92 E-value=4.5e-07 Score=101.66 Aligned_cols=92 Identities=18% Similarity=0.345 Sum_probs=72.2
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE--EEEec
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL--YRLQR 376 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l--~~l~r 376 (832)
..|.-.|.|.++...+ +++-...++++++|+|||.+||.||.|+. |.||.+... .+.. +...+
T Consensus 426 ~~G~w~V~d~e~~~lv-~~~~d~~~ls~v~ysp~G~~lAvgs~d~~-iyiy~Vs~~-------------g~~y~r~~k~~ 490 (626)
T KOG2106|consen 426 ATGRWFVLDTETQDLV-TIHTDNEQLSVVRYSPDGAFLAVGSHDNH-IYIYRVSAN-------------GRKYSRVGKCS 490 (626)
T ss_pred ccceEEEEecccceeE-EEEecCCceEEEEEcCCCCEEEEecCCCe-EEEEEECCC-------------CcEEEEeeeec
Confidence 3466777888885544 44433889999999999999999999987 899998764 1222 22233
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
| +.|..+.||+|+++|.+-|.|-.|-.|.
T Consensus 491 g---s~ithLDwS~Ds~~~~~~S~d~eiLyW~ 519 (626)
T KOG2106|consen 491 G---SPITHLDWSSDSQFLVSNSGDYEILYWK 519 (626)
T ss_pred C---ceeEEeeecCCCceEEeccCceEEEEEc
Confidence 4 6799999999999999999999999993
No 165
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=98.91 E-value=4.5e-09 Score=119.40 Aligned_cols=101 Identities=22% Similarity=0.289 Sum_probs=82.8
Q ss_pred cccCCCCeEEEEECCCC-------cEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCC
Q 003310 295 PDADNVGMVIVRDIVSK-------NVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGT 366 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~-------~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~ 366 (832)
+-+.++|.|+||.+..+ .+-..+.+|...|++|.|.|= ...||+++.|-+ |+|||+.+.
T Consensus 644 AVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd~T-i~lWDl~~~------------ 710 (1012)
T KOG1445|consen 644 AVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYDST-IELWDLANA------------ 710 (1012)
T ss_pred eecccCceEEEEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhccce-eeeeehhhh------------
Confidence 34578999999999754 456778899999999999994 668999999987 999999886
Q ss_pred ceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 367 SYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 367 ~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
....+| -||+. .|.++||||||+.+|+.+.||+++||.-...
T Consensus 711 --~~~~~l-~gHtd-qIf~~AWSpdGr~~AtVcKDg~~rVy~Prs~ 752 (1012)
T KOG1445|consen 711 --KLYSRL-VGHTD-QIFGIAWSPDGRRIATVCKDGTLRVYEPRSR 752 (1012)
T ss_pred --hhhhee-ccCcC-ceeEEEECCCCcceeeeecCceEEEeCCCCC
Confidence 111234 46653 5999999999999999999999999987654
No 166
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.90 E-value=1.3e-07 Score=110.81 Aligned_cols=102 Identities=10% Similarity=0.257 Sum_probs=80.2
Q ss_pred ccccCCCCeEEEEECC-CCcEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 294 FPDADNVGMVIVRDIV-SKNVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~-s~~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
|.+.+ |-+|+||.-. ...++..+..+...|++++|||- -..+|+++.||+ |.|||+.....+. +
T Consensus 414 fls~g-DW~vriWs~~~~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~-l~iWDLl~~~~~P------------v 479 (555)
T KOG1587|consen 414 FLSVG-DWTVRIWSEDVIASPLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGN-LDIWDLLQDDEEP------------V 479 (555)
T ss_pred eeeec-cceeEeccccCCCCcchhhhhccceeeeeEEcCcCceEEEEEcCCCc-eehhhhhccccCC------------c
Confidence 33443 8899999988 77888899999999999999996 468899999998 8999997652221 1
Q ss_pred EEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 372 YRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.+...+ ......+.|+++|+.||+|...|++|+|++..
T Consensus 480 ~s~~~~--~~~l~~~~~s~~g~~lavGd~~G~~~~~~l~~ 517 (555)
T KOG1587|consen 480 LSQKVC--SPALTRVRWSPNGKLLAVGDANGTTHILKLSE 517 (555)
T ss_pred cccccc--ccccceeecCCCCcEEEEecCCCcEEEEEcCc
Confidence 122222 23367788999999999999999999999964
No 167
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=98.89 E-value=3.3e-08 Score=113.10 Aligned_cols=102 Identities=11% Similarity=0.093 Sum_probs=72.0
Q ss_pred CCCCeEEEEECCCC--cEEEEeccCCCCe--EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 298 DNVGMVIVRDIVSK--NVIAQFRAHKSPI--SALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 298 ~~~G~V~IwDl~s~--~~l~~~~aH~~pI--s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
..|+.|..|++.+. .+++.|.+|...- ..-..||||.+|++|+.|++ ..||.+..... --.
T Consensus 290 CtD~sIy~ynm~s~s~sP~~~~sg~~~~sf~vks~lSpd~~~l~SgSsd~~-ayiw~vs~~e~--------------~~~ 354 (720)
T KOG0321|consen 290 CTDNSIYFYNMRSLSISPVAEFSGKLNSSFYVKSELSPDDCSLLSGSSDEQ-AYIWVVSSPEA--------------PPA 354 (720)
T ss_pred ecCCcEEEEeccccCcCchhhccCcccceeeeeeecCCCCceEeccCCCcc-eeeeeecCccC--------------Chh
Confidence 34889999999863 4666776664221 22357999999999999988 78999876411 011
Q ss_pred EeccCccccEEEEEEcc--CCCEEEEEeCCCcEEEEecCCCCCce
Q 003310 374 LQRGLTNAVIQDISFSD--DSNWIMISSSRGTSHLFAINPLGGSV 416 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSp--Dg~~LAsgS~DgTVhIwdl~~~g~~~ 416 (832)
+..|++. .|.+++|.| ++. +|++|+|-+++||++..+.+..
T Consensus 355 ~l~Ght~-eVt~V~w~pS~~t~-v~TcSdD~~~kiW~l~~~l~e~ 397 (720)
T KOG0321|consen 355 LLLGHTR-EVTTVRWLPSATTP-VATCSDDFRVKIWRLSNGLEEI 397 (720)
T ss_pred hhhCcce-EEEEEeeccccCCC-ceeeccCcceEEEeccCchhhc
Confidence 2235543 488999976 444 5677999999999997765543
No 168
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=98.86 E-value=6.4e-09 Score=110.77 Aligned_cols=127 Identities=19% Similarity=0.296 Sum_probs=88.3
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCC-----CCc--c--CC
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGT-----SSA--C--DA 364 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~-----~s~--~--~~ 364 (832)
++.+..+|.|.|||+.+..+-..|.+|..||++||||+||++|+|+|.|-. |.+||+..+.+.. +.. + .+
T Consensus 38 lAvGc~nG~vvI~D~~T~~iar~lsaH~~pi~sl~WS~dgr~LltsS~D~s-i~lwDl~~gs~l~rirf~spv~~~q~hp 116 (405)
T KOG1273|consen 38 LAVGCANGRVVIYDFDTFRIARMLSAHVRPITSLCWSRDGRKLLTSSRDWS-IKLWDLLKGSPLKRIRFDSPVWGAQWHP 116 (405)
T ss_pred eeeeccCCcEEEEEccccchhhhhhccccceeEEEecCCCCEeeeecCCce-eEEEeccCCCceeEEEccCccceeeecc
Confidence 456788999999999999988899999999999999999999999999965 9999998872200 000 0 00
Q ss_pred CCceeEE----------EEEeccC-----------ccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccC
Q 003310 365 GTSYVHL----------YRLQRGL-----------TNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPT 421 (832)
Q Consensus 365 ~~~~~~l----------~~l~rG~-----------t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H 421 (832)
......+ ..+.-+. -+..-.+..|.+.|+++.+|++.|.++|++.++......++..
T Consensus 117 ~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fdr~g~yIitGtsKGkllv~~a~t~e~vas~rit 194 (405)
T KOG1273|consen 117 RKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFDRRGKYIITGTSKGKLLVYDAETLECVASFRIT 194 (405)
T ss_pred ccCCeEEEEEecCCcEEEEecCCceeeccCCCccccccccccccccCCCCEEEEecCcceEEEEecchheeeeeeeec
Confidence 0000000 0110000 0000012347788999999999999999999998877777643
No 169
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=98.86 E-value=2.1e-07 Score=106.05 Aligned_cols=299 Identities=17% Similarity=0.185 Sum_probs=171.7
Q ss_pred CCCcEEEEEecCC-eEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCC
Q 003310 16 ATRRVLLLGYRSG-FQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDG 94 (832)
Q Consensus 16 ~~~~vLl~Gy~~G-~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg 94 (832)
+...-|+.|.+.| ++||-|.+ +-+...+. -|+.|+++++-|.+ ..++||++.+.... .+.+.
T Consensus 410 p~G~wlasGsdDGtvriWEi~T-gRcvr~~~-~d~~I~~vaw~P~~------------~~~vLAvA~~~~~~---ivnp~ 472 (733)
T KOG0650|consen 410 PSGEWLASGSDDGTVRIWEIAT-GRCVRTVQ-FDSEIRSVAWNPLS------------DLCVLAVAVGECVL---IVNPI 472 (733)
T ss_pred CCcceeeecCCCCcEEEEEeec-ceEEEEEe-ecceeEEEEecCCC------------CceeEEEEecCceE---EeCcc
Confidence 3466677777776 99999998 33433443 48899999998864 34678777754310 11111
Q ss_pred cccc-cCCC-CCCCCCCCCCCCCCCEEEEEECCC---Cc--EEEEEeCCCCEEEEEEc--CCEEEEEeC----CEEEEEE
Q 003310 95 LATA-CNGT-SANYHDLGNGSSVPTVVHFYSLRS---QS--YVHMLKFRSPIYSVRCS--SRVVAICQA----AQVHCFD 161 (832)
Q Consensus 95 ~~~~-~~g~-~~~~h~~g~~~~~~~tVrlWDL~T---g~--~V~tL~f~s~V~sV~~S--~r~LAVa~~----~~I~vwD 161 (832)
.+.. -.+. .-..+...+....+..|-.|.-.. ++ .-.+|++..+|..|.+. +++||+.+. ..|.|++
T Consensus 473 ~G~~~e~~~t~ell~~~~~~~~p~~~~~~W~~~~~~e~~~~v~~~I~~~k~i~~vtWHrkGDYlatV~~~~~~~~VliHQ 552 (733)
T KOG0650|consen 473 FGDRLEVGPTKELLASAPNESEPDAAVVTWSRASLDELEKGVCIVIKHPKSIRQVTWHRKGDYLATVMPDSGNKSVLIHQ 552 (733)
T ss_pred ccchhhhcchhhhhhcCCCccCCcccceeechhhhhhhccceEEEEecCCccceeeeecCCceEEEeccCCCcceEEEEe
Confidence 1100 0000 000111123344557888996542 22 22457888999999996 578887543 6899999
Q ss_pred CCCCceE--E-EEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceee
Q 003310 162 AATLEIE--Y-AILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAH 238 (832)
Q Consensus 162 l~t~~~~--~-tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~ 238 (832)
+...... + ...+.+ ++ ..|-+..+..- | +.-..|..
T Consensus 553 LSK~~sQ~PF~kskG~v-----------------q~-----v~FHPs~p~lf----V---------------aTq~~vRi 591 (733)
T KOG0650|consen 553 LSKRKSQSPFRKSKGLV-----------------QR-----VKFHPSKPYLF----V---------------ATQRSVRI 591 (733)
T ss_pred cccccccCchhhcCCce-----------------eE-----EEecCCCceEE----E---------------EeccceEE
Confidence 8754321 1 111111 11 11111100000 0 00011211
Q ss_pred eec---ccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCC-CcEE
Q 003310 239 YAK---ESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVS-KNVI 314 (832)
Q Consensus 239 ~A~---~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s-~~~l 314 (832)
|-. +.-|.|..|..++..+. ..|. | ..++-++.++.+..+|+.- .++.
T Consensus 592 YdL~kqelvKkL~tg~kwiS~ms----------ihp~---------------G---Dnli~gs~d~k~~WfDldlsskPy 643 (733)
T KOG0650|consen 592 YDLSKQELVKKLLTGSKWISSMS----------IHPN---------------G---DNLILGSYDKKMCWFDLDLSSKPY 643 (733)
T ss_pred EehhHHHHHHHHhcCCeeeeeee----------ecCC---------------C---CeEEEecCCCeeEEEEcccCcchh
Confidence 211 23344444444332221 0111 1 1133456788999999974 4688
Q ss_pred EEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc---cEEEEEEccC
Q 003310 315 AQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA---VIQDISFSDD 391 (832)
Q Consensus 315 ~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a---~I~~IaFSpD 391 (832)
.+++-|...|.+++|.+.=-++|+||.||+ +.||--+-.. .. -.+.-...+..| ||+... -|.++.|.|.
T Consensus 644 k~lr~H~~avr~Va~H~ryPLfas~sdDgt-v~Vfhg~VY~---Dl--~qnpliVPlK~L-~gH~~~~~~gVLd~~wHP~ 716 (733)
T KOG0650|consen 644 KTLRLHEKAVRSVAFHKRYPLFASGSDDGT-VIVFHGMVYN---DL--LQNPLIVPLKRL-RGHEKTNDLGVLDTIWHPR 716 (733)
T ss_pred HHhhhhhhhhhhhhhccccceeeeecCCCc-EEEEeeeeeh---hh--hcCCceEeeeec-cCceeecccceEeecccCC
Confidence 899999999999999999999999999998 6677432210 00 001123455555 455432 2889999999
Q ss_pred CCEEEEEeCCCcEEEE
Q 003310 392 SNWIMISSSRGTSHLF 407 (832)
Q Consensus 392 g~~LAsgS~DgTVhIw 407 (832)
--||.+++.||||++|
T Consensus 717 qpWLfsAGAd~tirlf 732 (733)
T KOG0650|consen 717 QPWLFSAGADGTIRLF 732 (733)
T ss_pred CceEEecCCCceEEee
Confidence 9999999999999999
No 170
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=98.85 E-value=3.3e-06 Score=89.70 Aligned_cols=221 Identities=19% Similarity=0.360 Sum_probs=146.0
Q ss_pred cCCeEEEEeccCCCeeEEeeecCCCEEEEEEecCCCcccccCCcccccCCEEEEEeCCCCccCccccCCcccccCCCCCC
Q 003310 26 RSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPRPITSKRSRDKFAEVRPLLVFCADGSRSCGTKVQDGLATACNGTSAN 105 (832)
Q Consensus 26 ~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~p~~~~~~~d~f~~~rPLLavv~~g~~~g~~~~~Dg~~~~~~g~~~~ 105 (832)
++.+.+||+.........+..|...|+.+.+.|... .++.+..
T Consensus 133 d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~--------------~~~~~~~----------------------- 175 (466)
T COG2319 133 DGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGK--------------LLASGSS----------------------- 175 (466)
T ss_pred CccEEEEEecCCCeEEEEEecCcccEEEEEECCCCC--------------EEEecCC-----------------------
Confidence 556888888752344555566778888777776421 2222210
Q ss_pred CCCCCCCCCCCCEEEEEECCCCcEEEEEeC-CCCEEEEEEcC--C-EEEE-EeCCEEEEEECCCCceEE-EEecCCCccC
Q 003310 106 YHDLGNGSSVPTVVHFYSLRSQSYVHMLKF-RSPIYSVRCSS--R-VVAI-CQAAQVHCFDAATLEIEY-AILTNPIVMG 179 (832)
Q Consensus 106 ~h~~g~~~~~~~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~--r-~LAV-a~~~~I~vwDl~t~~~~~-tl~t~~~~~~ 179 (832)
.++++++|++.++..+..+.. ...|..+++++ + +++. +.++.|++||..+++.+. .+..+...
T Consensus 176 ---------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~-- 244 (466)
T COG2319 176 ---------LDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDS-- 244 (466)
T ss_pred ---------CCCceEEEEcCCCceEEeeccCCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCcEEeeecCCCCcc--
Confidence 127899999999999998886 67899999974 3 4444 578899999988776665 34333110
Q ss_pred CCCCCCCCcccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccc
Q 003310 180 HPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGY 259 (832)
Q Consensus 180 ~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~ 259 (832)
. +. .+. | ++.
T Consensus 245 -----------~-~~------~~~-------------~--------------~~~------------------------- 254 (466)
T COG2319 245 -----------V-VS------SFS-------------P--------------DGS------------------------- 254 (466)
T ss_pred -----------e-eE------eEC-------------C--------------CCC-------------------------
Confidence 0 00 011 0 000
Q ss_pred cccccccccccCCCcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcE-EEEeccCCCCeEEEEEcCCCCEEEE
Q 003310 260 KKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNV-IAQFRAHKSPISALCFDPSGILLVT 338 (832)
Q Consensus 260 ~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~-l~~~~aH~~pIs~LaFSPdG~lLAT 338 (832)
.++.+..++.+++||+..... +..+..|..+|.++.|+|++..+++
T Consensus 255 ---------------------------------~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~ 301 (466)
T COG2319 255 ---------------------------------LLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLAS 301 (466)
T ss_pred ---------------------------------EEEEecCCCcEEEeeecCCCcEEEEEecCCccEEEEEECCCCCEEEE
Confidence 011235678899999987664 5555788999999999999999999
Q ss_pred EEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe-ccCccccEEEEEEccCCCEEEEE-eCCCcEEEEecCCCC
Q 003310 339 ASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ-RGLTNAVIQDISFSDDSNWIMIS-SSRGTSHLFAINPLG 413 (832)
Q Consensus 339 aS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~-rG~t~a~I~~IaFSpDg~~LAsg-S~DgTVhIwdl~~~g 413 (832)
++.|+. +++||+... ....... .++.. .|..++|++++..++.+ ..|+.+.+|++....
T Consensus 302 ~~~d~~-~~~~~~~~~--------------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 362 (466)
T COG2319 302 GSSDGT-VRLWDLETG--------------KLLSSLTLKGHEG-PVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGK 362 (466)
T ss_pred eeCCCc-EEEEEcCCC--------------ceEEEeeecccCC-ceEEEEECCCCCEEEEeecCCCcEEeeecCCCc
Confidence 999965 999988765 1111221 23322 58899995443566666 688999999998765
No 171
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=98.81 E-value=2e-07 Score=98.78 Aligned_cols=90 Identities=19% Similarity=0.333 Sum_probs=66.2
Q ss_pred CCeEEEEECCC-CcEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 300 VGMVIVRDIVS-KNVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 300 ~G~V~IwDl~s-~~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
...|.|.|++. ..+++.|+.|.+.|+.|+|.| +...|+||++|-. .-|||+..-.... .. .....|+.
T Consensus 265 S~~V~iLDiR~P~tpva~L~~H~a~VNgIaWaPhS~~hictaGDD~q-aliWDl~q~~~~~--~~----dPilay~a--- 334 (364)
T KOG0290|consen 265 SNKVVILDIRVPCTPVARLRNHQASVNGIAWAPHSSSHICTAGDDCQ-ALIWDLQQMPREN--GE----DPILAYTA--- 334 (364)
T ss_pred CceEEEEEecCCCcceehhhcCcccccceEecCCCCceeeecCCcce-EEEEecccccccC--CC----Cchhhhhc---
Confidence 34689999986 568999999999999999999 5678999999965 7899997641100 00 11222332
Q ss_pred CccccEEEEEEcc-CCCEEEEEeCC
Q 003310 378 LTNAVIQDISFSD-DSNWIMISSSR 401 (832)
Q Consensus 378 ~t~a~I~~IaFSp-Dg~~LAsgS~D 401 (832)
.+.|..|.|++ .+.|||++...
T Consensus 335 --~~EVNqi~Ws~~~~Dwiai~~~k 357 (364)
T KOG0290|consen 335 --GGEVNQIQWSSSQPDWIAICFGK 357 (364)
T ss_pred --cceeeeeeecccCCCEEEEEecC
Confidence 24699999995 78899998754
No 172
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.81 E-value=2.2e-06 Score=93.41 Aligned_cols=199 Identities=19% Similarity=0.278 Sum_probs=129.6
Q ss_pred EEEEEECCCCcEEEEEeCC----CCEEEEEEcC--CEEEE---EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCc
Q 003310 118 VVHFYSLRSQSYVHMLKFR----SPIYSVRCSS--RVVAI---CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGI 188 (832)
Q Consensus 118 tVrlWDL~Tg~~V~tL~f~----s~V~sV~~S~--r~LAV---a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~ 188 (832)
.|.|||+++-+.+|+|.-. ..+.++.+|. -+||. ...+.|++||+.+.+..-++..|..
T Consensus 107 ~IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~------------ 174 (391)
T KOG2110|consen 107 SIYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKG------------ 174 (391)
T ss_pred cEEEEecccceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCC------------
Confidence 4899999999999999752 3466666664 37776 2457999999999998888877632
Q ss_pred ccceeeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCcccccccccccc
Q 003310 189 GYGPLAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSE 268 (832)
Q Consensus 189 ~~~p~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~ 268 (832)
++| -||+.. +|..+
T Consensus 175 ---~lA----alafs~---------------------------~G~ll-------------------------------- 188 (391)
T KOG2110|consen 175 ---PLA----ALAFSP---------------------------DGTLL-------------------------------- 188 (391)
T ss_pred ---cee----EEEECC---------------------------CCCEE--------------------------------
Confidence 233 245542 22222
Q ss_pred ccCCCcCccccccCCCCCCCcccccccccCCCC-eEEEEECCCCcEEEEeccCCC--CeEEEEEcCCCCEEEEEEcCCCE
Q 003310 269 FLPDSQNSLQSAIPGGKSNGTVNGHFPDADNVG-MVIVRDIVSKNVIAQFRAHKS--PISALCFDPSGILLVTASVQGHN 345 (832)
Q Consensus 269 ~~p~~~~si~sa~~~~~~~g~~~g~~~s~~~~G-~V~IwDl~s~~~l~~~~aH~~--pIs~LaFSPdG~lLATaS~dGt~ 345 (832)
+++...| .|||+++.+++.+.+|+.-.. .|.+|+|+||+++|+..|..++
T Consensus 189 --------------------------ATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeT- 241 (391)
T KOG2110|consen 189 --------------------------ATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTET- 241 (391)
T ss_pred --------------------------EEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCe-
Confidence 1122223 489999999999999996554 4789999999999999999888
Q ss_pred EEEEeCCCCCCCCCC----ccCCC------------CceeEEEEEeccCccccE------EEEEEc--cCCCEEEEEeCC
Q 003310 346 INIFKIIPGILGTSS----ACDAG------------TSYVHLYRLQRGLTNAVI------QDISFS--DDSNWIMISSSR 401 (832)
Q Consensus 346 I~Iwdi~t~~~~~~s----~~~~~------------~~~~~l~~l~rG~t~a~I------~~IaFS--pDg~~LAsgS~D 401 (832)
|+||.+......... ..++. ..+...+...|-+-.++| ..++|+ +...++.+++.|
T Consensus 242 VHiFKL~~~~~~~~~~p~~~~~~~~~~sk~~~sylps~V~~~~~~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~d 321 (391)
T KOG2110|consen 242 VHIFKLEKVSNNPPESPTAGTSWFGKVSKAATSYLPSQVSSVLDQSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYD 321 (391)
T ss_pred EEEEEecccccCCCCCCCCCCcccchhhhhhhhhcchhhhhhhhhccceeEEEccCCCccceEEeeccCCCCEEEEEEcC
Confidence 999988764210000 00110 011111122222211111 345566 477899999999
Q ss_pred CcEEEEecCCC-CCce-eeccC
Q 003310 402 GTSHLFAINPL-GGSV-NFQPT 421 (832)
Q Consensus 402 gTVhIwdl~~~-g~~~-~~~~H 421 (832)
|.+..|.+.+. ||+. .+..|
T Consensus 322 G~~y~y~l~~~~gGec~lik~h 343 (391)
T KOG2110|consen 322 GHLYSYRLPPKEGGECALIKRH 343 (391)
T ss_pred CeEEEEEcCCCCCceeEEEEee
Confidence 99999999874 5554 44445
No 173
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=98.80 E-value=1.6e-08 Score=108.39 Aligned_cols=101 Identities=22% Similarity=0.369 Sum_probs=85.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCC-CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSG-ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG-~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
+.++.-|.|+|.|+.++++...+.+|...|+.|+|.|+- +||++||.|- .||+|++.+. .++..
T Consensus 109 a~~G~~GvIrVid~~~~~~~~~~~ghG~sINeik~~p~~~qlvls~SkD~-svRlwnI~~~--------------~Cv~V 173 (385)
T KOG1034|consen 109 AAGGYLGVIRVIDVVSGQCSKNYRGHGGSINEIKFHPDRPQLVLSASKDH-SVRLWNIQTD--------------VCVAV 173 (385)
T ss_pred EeecceeEEEEEecchhhhccceeccCccchhhhcCCCCCcEEEEecCCc-eEEEEeccCC--------------eEEEE
Confidence 344577999999999999999999999999999999974 7899999995 5999999987 34434
Q ss_pred E--eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 374 L--QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 374 l--~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+ ..||. ..|.+|.|++||.+||+++.|-++++|+++.
T Consensus 174 fGG~egHr-deVLSvD~~~~gd~i~ScGmDhslk~W~l~~ 212 (385)
T KOG1034|consen 174 FGGVEGHR-DEVLSVDFSLDGDRIASCGMDHSLKLWRLNV 212 (385)
T ss_pred eccccccc-CcEEEEEEcCCCCeeeccCCcceEEEEecCh
Confidence 3 12322 2499999999999999999999999999984
No 174
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.79 E-value=3.7e-06 Score=96.66 Aligned_cols=81 Identities=16% Similarity=0.187 Sum_probs=53.2
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCC--CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQG--HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dG--t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.++|+.+++. ..+..+...+...+|||||++||..+.++ ..|.+||+..+ ... .+..+
T Consensus 315 ~Iy~~d~~g~~~-~~lt~~~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~-------------~~~--~lt~~-- 376 (435)
T PRK05137 315 QLYVMNADGSNP-RRISFGGGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGS-------------GER--ILTSG-- 376 (435)
T ss_pred eEEEEECCCCCe-EEeecCCCcccCeEECCCCCEEEEEEcCCCceEEEEEECCCC-------------ceE--eccCC--
Confidence 578888876654 33333445567789999999998876543 34667776443 122 22222
Q ss_pred cccEEEEEEccCCCEEEEEeCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~D 401 (832)
..+.+.+|||||++|+..+.+
T Consensus 377 -~~~~~p~~spDG~~i~~~~~~ 397 (435)
T PRK05137 377 -FLVEGPTWAPNGRVIMFFRQT 397 (435)
T ss_pred -CCCCCCeECCCCCEEEEEEcc
Confidence 136788999999999887664
No 175
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=98.78 E-value=2.9e-07 Score=102.85 Aligned_cols=107 Identities=14% Similarity=0.233 Sum_probs=81.9
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
.+++..|+|+|||++.+.+...++.|++.|+++.++-...+||+++..|. |.|..+.++... -.|
T Consensus 95 ~sgG~~~~Vkiwdl~~kl~hr~lkdh~stvt~v~YN~~DeyiAsvs~gGd-iiih~~~t~~~t--------------t~f 159 (673)
T KOG4378|consen 95 ISGGQSGCVKIWDLRAKLIHRFLKDHQSTVTYVDYNNTDEYIASVSDGGD-IIIHGTKTKQKT--------------TTF 159 (673)
T ss_pred eccCcCceeeehhhHHHHHhhhccCCcceeEEEEecCCcceeEEeccCCc-EEEEecccCccc--------------cce
Confidence 45677899999999988888899999999999999999999999999887 778888876211 122
Q ss_pred eccCccccEEEEEEccCCCE-EEEEeCCCcEEEEecCCCCCcee
Q 003310 375 QRGLTNAVIQDISFSDDSNW-IMISSSRGTSHLFAINPLGGSVN 417 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~-LAsgS~DgTVhIwdl~~~g~~~~ 417 (832)
.-+. .-.|.-+.|||-.++ |.++|++|+|++||+........
T Consensus 160 ~~~s-gqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~sp~~~ 202 (673)
T KOG4378|consen 160 TIDS-GQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGMSPIFH 202 (673)
T ss_pred ecCC-CCeEEEeecccccceeeEeeccCCeEEEEeccCCCcccc
Confidence 1111 112556778887776 55789999999999976544433
No 176
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=98.76 E-value=5.7e-08 Score=106.13 Aligned_cols=113 Identities=12% Similarity=0.224 Sum_probs=92.8
Q ss_pred ccccCCCCeEEEEECCCC-------cEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCC
Q 003310 294 FPDADNVGMVIVRDIVSK-------NVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAG 365 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~-------~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~ 365 (832)
+++++.|-+|+||++..+ +++..|.+|+..|.-++|.|... .|+|++.|.+ |.||++.++
T Consensus 97 IASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~-v~iWnv~tg----------- 164 (472)
T KOG0303|consen 97 IASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNT-VSIWNVGTG----------- 164 (472)
T ss_pred eecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCce-EEEEeccCC-----------
Confidence 467788999999999653 57889999999999999999755 7889999865 899999987
Q ss_pred CceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCC
Q 003310 366 TSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDAN 424 (832)
Q Consensus 366 ~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~ 424 (832)
..+.++. ++-.|++++|+.||.+|++++.|..|+|||..+..-...-.+|.+.
T Consensus 165 ---eali~l~---hpd~i~S~sfn~dGs~l~TtckDKkvRv~dpr~~~~v~e~~~heG~ 217 (472)
T KOG0303|consen 165 ---EALITLD---HPDMVYSMSFNRDGSLLCTTCKDKKVRVIDPRRGTVVSEGVAHEGA 217 (472)
T ss_pred ---ceeeecC---CCCeEEEEEeccCCceeeeecccceeEEEcCCCCcEeeecccccCC
Confidence 5556664 3446999999999999999999999999999875444333577654
No 177
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.75 E-value=6.4e-06 Score=88.53 Aligned_cols=108 Identities=18% Similarity=0.276 Sum_probs=73.0
Q ss_pred eEEEEECCCCcEEEEeccC--CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCC---CccCCC-------Ccee
Q 003310 302 MVIVRDIVSKNVIAQFRAH--KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTS---SACDAG-------TSYV 369 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH--~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~---s~~~~~-------~~~~ 369 (832)
.|||||..+|+.+..|+.- ...|.||+||||+.+||.+|++|| ++||.++....... |..-+. .+.+
T Consensus 205 LIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsSdKgT-lHiF~l~~~~~~~~~~SSl~~~~~~lpky~~S~w 283 (346)
T KOG2111|consen 205 LIRIFDTEDGTLLQELRRGVDRADIYCIAFSPNSSWLAVSSDKGT-LHIFSLRDTENTEDESSSLSFKRLVLPKYFSSEW 283 (346)
T ss_pred EEEEEEcCCCcEeeeeecCCchheEEEEEeCCCccEEEEEcCCCe-EEEEEeecCCCCccccccccccccccchhcccce
Confidence 4999999999999999843 346999999999999999999998 89999876422111 111000 0112
Q ss_pred EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
-+.+++- ......-++|-.+.+-+++...||+-+-+.+.+.
T Consensus 284 S~~~f~l--~~~~~~~~~fg~~~nsvi~i~~Dgsy~k~~f~~~ 324 (346)
T KOG2111|consen 284 SFAKFQL--PQGTQCIIAFGSETNTVIAICADGSYYKFKFDPK 324 (346)
T ss_pred eEEEEEc--cCCCcEEEEecCCCCeEEEEEeCCcEEEEEeccc
Confidence 2222211 1122456789888677777788899876666543
No 178
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.73 E-value=6.2e-07 Score=98.21 Aligned_cols=107 Identities=21% Similarity=0.239 Sum_probs=89.1
Q ss_pred cccccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 293 HFPDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
+|++...-+.|++||...+ +++.+|.--..+|+++...|+|+++.+|...|. +..||++.+ .++
T Consensus 218 ~fat~T~~hqvR~YDt~~qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~g~-l~~FD~r~~--------------kl~ 282 (412)
T KOG3881|consen 218 KFATITRYHQVRLYDTRHQRRPVAQFDFLENPISSTGLTPSGNFIYTGNTKGQ-LAKFDLRGG--------------KLL 282 (412)
T ss_pred eEEEEecceeEEEecCcccCcceeEeccccCcceeeeecCCCcEEEEecccch-hheecccCc--------------eee
Confidence 4555666789999999865 689999888889999999999999999999997 899999886 343
Q ss_pred EEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 372 YRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
-....|.+. .|.+|.-.|..++||+++.|+-++|||+.+.+-.
T Consensus 283 g~~~kg~tG-sirsih~hp~~~~las~GLDRyvRIhD~ktrkll 325 (412)
T KOG3881|consen 283 GCGLKGITG-SIRSIHCHPTHPVLASCGLDRYVRIHDIKTRKLL 325 (412)
T ss_pred ccccCCccC-CcceEEEcCCCceEEeeccceeEEEeecccchhh
Confidence 343456554 4999999999999999999999999999885433
No 179
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=98.73 E-value=1.8e-07 Score=102.39 Aligned_cols=117 Identities=13% Similarity=0.141 Sum_probs=87.1
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
++++.|.+|.||++.+++.+-++. |..-|.+++|+.||.+|+|++.|.. |||||.+++ ..+.+-
T Consensus 148 lsag~Dn~v~iWnv~tgeali~l~-hpd~i~S~sfn~dGs~l~TtckDKk-vRv~dpr~~--------------~~v~e~ 211 (472)
T KOG0303|consen 148 LSAGSDNTVSIWNVGTGEALITLD-HPDMVYSMSFNRDGSLLCTTCKDKK-VRVIDPRRG--------------TVVSEG 211 (472)
T ss_pred hhccCCceEEEEeccCCceeeecC-CCCeEEEEEeccCCceeeeecccce-eEEEcCCCC--------------cEeeec
Confidence 456678899999999999988988 9999999999999999999999976 999999987 222332
Q ss_pred eccCccccEEEEEEccCCCEEEEEeC---CCcEEEEecCCCCCc---eeeccCCCCcccc
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSS---RGTSHLFAINPLGGS---VNFQPTDANFTTK 428 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~---DgTVhIwdl~~~g~~---~~~~~H~~~~~~~ 428 (832)
.+|..++-..+-|=.+|+++-+|-+ ++.+-|||-.....+ ..+.+.++.+-+|
T Consensus 212 -~~heG~k~~Raifl~~g~i~tTGfsr~seRq~aLwdp~nl~eP~~~~elDtSnGvl~PF 270 (472)
T KOG0303|consen 212 -VAHEGAKPARAIFLASGKIFTTGFSRMSERQIALWDPNNLEEPIALQELDTSNGVLLPF 270 (472)
T ss_pred -ccccCCCcceeEEeccCceeeeccccccccceeccCcccccCcceeEEeccCCceEEee
Confidence 4555556666778889996655544 567889986554443 4445544444433
No 180
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.71 E-value=4.1e-08 Score=118.53 Aligned_cols=104 Identities=17% Similarity=0.218 Sum_probs=78.8
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCC--CeEEEEEcCCCC-EEEEEEcCCC--EEEEEeCCCCCCCCCCccCCCCcee
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKS--PISALCFDPSGI-LLVTASVQGH--NINIFKIIPGILGTSSACDAGTSYV 369 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~--pIs~LaFSPdG~-lLATaS~dGt--~I~Iwdi~t~~~~~~s~~~~~~~~~ 369 (832)
+++...|.+.|||++.++++-.|.-|.. .++.|+|+||+. .|++|+.|.+ +|.+||++-. + .
T Consensus 178 AS~s~sg~~~iWDlr~~~pii~ls~~~~~~~~S~l~WhP~~aTql~~As~dd~~PviqlWDlR~a---s----------s 244 (1049)
T KOG0307|consen 178 ASGSPSGRAVIWDLRKKKPIIKLSDTPGRMHCSVLAWHPDHATQLLVASGDDSAPVIQLWDLRFA---S----------S 244 (1049)
T ss_pred hccCCCCCceeccccCCCcccccccCCCccceeeeeeCCCCceeeeeecCCCCCceeEeeccccc---C----------C
Confidence 4456778999999999999888887754 478899999855 6777777643 6999998754 1 1
Q ss_pred EEEEEeccCccccEEEEEEccCC-CEEEEEeCCCcEEEEecCCCC
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDS-NWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg-~~LAsgS~DgTVhIwdl~~~g 413 (832)
.+..+ ++|.. -|.++.|++.+ ++|++++.|+.|.+|+.+++.
T Consensus 245 P~k~~-~~H~~-GilslsWc~~D~~lllSsgkD~~ii~wN~~tgE 287 (1049)
T KOG0307|consen 245 PLKIL-EGHQR-GILSLSWCPQDPRLLLSSGKDNRIICWNPNTGE 287 (1049)
T ss_pred chhhh-ccccc-ceeeeccCCCCchhhhcccCCCCeeEecCCCce
Confidence 12222 34433 38999999854 999999999999999998843
No 181
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=98.71 E-value=2.8e-08 Score=114.48 Aligned_cols=99 Identities=15% Similarity=0.306 Sum_probs=79.4
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcc
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTN 380 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~ 380 (832)
..|+||...+-..+..++.|.-.|+.|+|||||++|+++|.|.+ +.+|........ ..-+..-.-| .
T Consensus 552 AvI~lw~t~~W~~~~~L~~HsLTVT~l~FSpdg~~LLsvsRDRt-~sl~~~~~~~~~-----------e~~fa~~k~H-t 618 (764)
T KOG1063|consen 552 AVIRLWNTANWLQVQELEGHSLTVTRLAFSPDGRYLLSVSRDRT-VSLYEVQEDIKD-----------EFRFACLKAH-T 618 (764)
T ss_pred eEEEEEeccchhhhheecccceEEEEEEECCCCcEEEEeecCce-EEeeeeecccch-----------hhhhcccccc-c
Confidence 46999999998888899999999999999999999999999976 899988654110 0001111122 1
Q ss_pred ccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 381 AVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 381 a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.-|+++.|+||++++||+|.|.+|+||.....
T Consensus 619 RIIWdcsW~pde~~FaTaSRDK~VkVW~~~~~ 650 (764)
T KOG1063|consen 619 RIIWDCSWSPDEKYFATASRDKKVKVWEEPDL 650 (764)
T ss_pred eEEEEcccCcccceeEEecCCceEEEEeccCc
Confidence 23999999999999999999999999999765
No 182
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.71 E-value=1.5e-07 Score=96.73 Aligned_cols=91 Identities=15% Similarity=0.357 Sum_probs=71.8
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC---CCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ---GHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d---Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
..+..|.|||+. .+.+.+|. ..++..|+|||+|++||+|+.+ |. |.+||+... ..+.++
T Consensus 80 ~~~~~v~lyd~~-~~~i~~~~--~~~~n~i~wsP~G~~l~~~g~~n~~G~-l~~wd~~~~--------------~~i~~~ 141 (194)
T PF08662_consen 80 SMPAKVTLYDVK-GKKIFSFG--TQPRNTISWSPDGRFLVLAGFGNLNGD-LEFWDVRKK--------------KKISTF 141 (194)
T ss_pred cCCcccEEEcCc-ccEeEeec--CCCceEEEECCCCCEEEEEEccCCCcE-EEEEECCCC--------------EEeecc
Confidence 445689999997 66667775 4678999999999999999854 44 899999865 444444
Q ss_pred eccCccccEEEEEEccCCCEEEEEeC------CCcEEEEecC
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSS------RGTSHLFAIN 410 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~------DgTVhIwdl~ 410 (832)
.. ..+.+++|||||++|++++. |..++||++.
T Consensus 142 ~~----~~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 142 EH----SDATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred cc----CcEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence 32 23789999999999999875 7889999984
No 183
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=98.70 E-value=6.5e-07 Score=103.56 Aligned_cols=101 Identities=18% Similarity=0.223 Sum_probs=70.7
Q ss_pred cEEEEeccCCCCeEEEEEcCCCC---EEEEEEcCCCEEEEEeCCCCCC-CCCCc--cC---CCCce-eEEE---------
Q 003310 312 NVIAQFRAHKSPISALCFDPSGI---LLVTASVQGHNINIFKIIPGIL-GTSSA--CD---AGTSY-VHLY--------- 372 (832)
Q Consensus 312 ~~l~~~~aH~~pIs~LaFSPdG~---lLATaS~dGt~I~Iwdi~t~~~-~~~s~--~~---~~~~~-~~l~--------- 372 (832)
+.+..+++|+..|.+|+|..-|. +|||+|.|. .||||.+..... ++-.. .. .+... ..+.
T Consensus 182 ~~v~el~GH~DWIrsl~f~~~~~~~~~laS~SQD~-yIRiW~i~~~~~~~~~~~e~~~t~~~~~~~f~~l~~i~~~is~e 260 (764)
T KOG1063|consen 182 ARVAELEGHTDWIRSLAFARLGGDDLLLASSSQDR-YIRIWRIVLGDDEDSNEREDSLTTLSNLPVFMILEEIQYRISFE 260 (764)
T ss_pred eEEEEeeccchhhhhhhhhccCCCcEEEEecCCce-EEEEEEEEecCCccccccccccccccCCceeeeeeeEEEEEehh
Confidence 57789999999999999988655 889999995 599999876520 00000 00 00000 1111
Q ss_pred EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCC
Q 003310 373 RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
.+--||.. .|+++-|+|++..|.++|.|.|+.||.-....|
T Consensus 261 all~GHeD-WV~sv~W~p~~~~LLSASaDksmiiW~pd~~tG 301 (764)
T KOG1063|consen 261 ALLMGHED-WVYSVWWHPEGLDLLSASADKSMIIWKPDENTG 301 (764)
T ss_pred hhhcCccc-ceEEEEEccchhhheecccCcceEEEecCCccc
Confidence 22246654 499999999999999999999999998876655
No 184
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=98.69 E-value=2.5e-06 Score=91.20 Aligned_cols=251 Identities=15% Similarity=0.197 Sum_probs=146.4
Q ss_pred cccccCCCCeEEEEECCCC----cEEEEeccCCCCeEEEEEcCC--CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSK----NVIAQFRAHKSPISALCFDPS--GILLVTASVQGHNINIFKIIPGILGTSSACDAGT 366 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~----~~l~~~~aH~~pIs~LaFSPd--G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~ 366 (832)
..++...|++|+|||..+. .+-...++|.+.|..+.|-+- |+.+|++|.|++ +.||.-..... .....
T Consensus 27 RmAtCSsDq~vkI~d~~~~s~~W~~Ts~Wrah~~Si~rV~WAhPEfGqvvA~cS~Drt-v~iWEE~~~~~-----~~~~~ 100 (361)
T KOG2445|consen 27 RMATCSSDQTVKIWDSTSDSGTWSCTSSWRAHDGSIWRVVWAHPEFGQVVATCSYDRT-VSIWEEQEKSE-----EAHGR 100 (361)
T ss_pred eeeeccCCCcEEEEeccCCCCceEEeeeEEecCCcEEEEEecCccccceEEEEecCCc-eeeeeeccccc-----ccccc
Confidence 3567788999999997543 577889999999999999663 999999999998 89997532200 00001
Q ss_pred ceeEEEEEeccCccccEEEEEEcc--CCCEEEEEeCCCcEEEEecCCCCCceeec-cCCCC-c--ccccCCcccccccCC
Q 003310 367 SYVHLYRLQRGLTNAVIQDISFSD--DSNWIMISSSRGTSHLFAINPLGGSVNFQ-PTDAN-F--TTKHGAMAKSGVRWP 440 (832)
Q Consensus 367 ~~~~l~~l~rG~t~a~I~~IaFSp--Dg~~LAsgS~DgTVhIwdl~~~g~~~~~~-~H~~~-~--~~~~~~~~~~~~r~~ 440 (832)
...+..+|.- ....|++|+|.| -|-.||+++.||+++||+.-.......+. .|.-. . ++........|+.|.
T Consensus 101 ~Wv~~ttl~D--srssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~dp~nLs~W~Lq~Ei~~~~~pp~~~~~~~~CvsWn 178 (361)
T KOG2445|consen 101 RWVRRTTLVD--SRSSVTDVKFAPKHLGLKLAAASADGILRIYEAPDPMNLSQWTLQHEIQNVIDPPGKNKQPCFCVSWN 178 (361)
T ss_pred eeEEEEEeec--CCcceeEEEecchhcceEEEEeccCcEEEEEecCCccccccchhhhhhhhccCCcccccCcceEEeec
Confidence 1223333321 123499999998 48889999999999999986654433331 22111 0 100111222344555
Q ss_pred CCCCCCCCCCccccc--CCCCeeeeeceEEEcCCCC--CCccccccchhccCcccCCCcceeeeeeccCCCccccccCCc
Q 003310 441 PNLGLQMPNQQSLCA--SGPPVTLSVVSRIRNGNNG--WRGTVSGAAAAATGRVSSLSGAIASSFHNCKGNSETYAAGSS 516 (832)
Q Consensus 441 ~~s~~~~~~~~~l~~--~~~p~~ls~v~~I~~~~~~--~~~~v~~~~~~a~g~~~~~~g~~~~~~h~~~~~~~~~~~~~~ 516 (832)
+.- + -.+-+++ ...+-.+..+-+-.+..++ |.- + -.+.-|-++.+..++++..
T Consensus 179 ~sr-~---~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~k------------v------a~L~d~~dpI~di~wAPn~- 235 (361)
T KOG2445|consen 179 PSR-M---HEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLK------------V------AELPDHTDPIRDISWAPNI- 235 (361)
T ss_pred ccc-c---cCceEEEEcccCCccccceEEEEecCCcceeee------------e------hhcCCCCCcceeeeecccc-
Confidence 222 1 1222221 1111124444333333322 310 0 0122577888887787664
Q ss_pred ccccccEEEEcCCCcEEEEeeeccCCCCccccCCCCCCcCCCC-CCCCceEEeeeeeeeeccccccccc
Q 003310 517 LKIKNHLLVFSPSGCMIQYALRISTGLDVTMGVPGLGSAYDSV-PEDDPRLVVEAIQKWNICQKQARRE 584 (832)
Q Consensus 517 ~~~~~~Llv~s~~G~l~~y~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ve~~~~Wdv~rr~~~~e 584 (832)
...-+-|-|.+-|| +-.|.+.+...+-. ..|.. ++.+.++.||-+.+-|=.+-+-|+-
T Consensus 236 Gr~y~~lAvA~kDg-v~I~~v~~~~s~i~---------~ee~~~~~~~~~l~v~~vs~~~~H~~~VWrv 294 (361)
T KOG2445|consen 236 GRSYHLLAVATKDG-VRIFKVKVARSAIE---------EEEVLAPDLMTDLPVEKVSELDDHNGEVWRV 294 (361)
T ss_pred CCceeeEEEeecCc-EEEEEEeeccchhh---------hhcccCCCCccccceEEeeeccCCCCceEEE
Confidence 33555677888899 88999987632111 11222 5666777777777666555455554
No 185
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=98.69 E-value=8.3e-07 Score=94.08 Aligned_cols=103 Identities=17% Similarity=0.241 Sum_probs=82.7
Q ss_pred CCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 298 DNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
..++++..||+++.+....++ +|...|..|.|+|+-+ +||||++||. |||||.+.. ...+.+|.
T Consensus 190 t~d~tl~~~D~RT~~~~~sI~dAHgq~vrdlDfNpnkq~~lvt~gDdgy-vriWD~R~t-------------k~pv~el~ 255 (370)
T KOG1007|consen 190 TSDSTLQFWDLRTMKKNNSIEDAHGQRVRDLDFNPNKQHILVTCGDDGY-VRIWDTRKT-------------KFPVQELP 255 (370)
T ss_pred eCCCcEEEEEccchhhhcchhhhhcceeeeccCCCCceEEEEEcCCCcc-EEEEeccCC-------------CccccccC
Confidence 467899999999987776666 9999999999999877 6899999998 999999864 12344553
Q ss_pred ccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCCCCce
Q 003310 376 RGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPLGGSV 416 (832)
Q Consensus 376 rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~g~~~ 416 (832)
++ .+.|++|-|.| ..++|.++++|..|.+|....-..+.
T Consensus 256 -~H-sHWvW~VRfn~~hdqLiLs~~SDs~V~Lsca~svSSE~ 295 (370)
T KOG1007|consen 256 -GH-SHWVWAVRFNPEHDQLILSGGSDSAVNLSCASSVSSEQ 295 (370)
T ss_pred -CC-ceEEEEEEecCccceEEEecCCCceeEEEecccccccc
Confidence 44 45799999998 46889999999999999876554443
No 186
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=98.68 E-value=2.5e-06 Score=101.11 Aligned_cols=108 Identities=19% Similarity=0.302 Sum_probs=81.7
Q ss_pred ccccCCCCeEEEEECCC--C--cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCcee
Q 003310 294 FPDADNVGMVIVRDIVS--K--NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYV 369 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s--~--~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~ 369 (832)
++.+..+|.|.||.--. + .-...|.=|..+|.+|+||+||.+|.||+..|- +-+|.+.++ ...
T Consensus 220 ~Aa~d~dGrI~vw~d~~~~~~~~t~t~lHWH~~~V~~L~fS~~G~~LlSGG~E~V-Lv~Wq~~T~------------~kq 286 (792)
T KOG1963|consen 220 LAAGDSDGRILVWRDFGSSDDSETCTLLHWHHDEVNSLSFSSDGAYLLSGGREGV-LVLWQLETG------------KKQ 286 (792)
T ss_pred EEEeccCCcEEEEeccccccccccceEEEecccccceeEEecCCceEeecccceE-EEEEeecCC------------Ccc
Confidence 34567789999995322 1 233556668889999999999999999999985 789999886 112
Q ss_pred EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
.|-+| .+.|..+.+|||+...+....|..||+-....-....++.
T Consensus 287 fLPRL-----gs~I~~i~vS~ds~~~sl~~~DNqI~li~~~dl~~k~tIs 331 (792)
T KOG1963|consen 287 FLPRL-----GSPILHIVVSPDSDLYSLVLEDNQIHLIKASDLEIKSTIS 331 (792)
T ss_pred ccccc-----CCeeEEEEEcCCCCeEEEEecCceEEEEeccchhhhhhcc
Confidence 23333 3569999999999999999999999998876554444443
No 187
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.66 E-value=1.4e-05 Score=91.75 Aligned_cols=70 Identities=21% Similarity=0.326 Sum_probs=46.2
Q ss_pred eEEEEEcCCCCEEEEEEcCCC--EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 324 ISALCFDPSGILLVTASVQGH--NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt--~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
....+|||||++||.++.++. .|.+||+.++ ... .+..+ ......+|+|||++|+.++.+
T Consensus 330 ~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g------------~~~---~lt~~---~~~~~p~~spdg~~l~~~~~~ 391 (427)
T PRK02889 330 NTSPRISPDGKLLAYISRVGGAFKLYVQDLATG------------QVT---ALTDT---TRDESPSFAPNGRYILYATQQ 391 (427)
T ss_pred cCceEECCCCCEEEEEEccCCcEEEEEEECCCC------------CeE---EccCC---CCccCceECCCCCEEEEEEec
Confidence 345789999999998776542 4889998765 112 22122 124678999999999988865
Q ss_pred C-cEEEEecCC
Q 003310 402 G-TSHLFAINP 411 (832)
Q Consensus 402 g-TVhIwdl~~ 411 (832)
+ .-.+|-+..
T Consensus 392 ~g~~~l~~~~~ 402 (427)
T PRK02889 392 GGRSVLAAVSS 402 (427)
T ss_pred CCCEEEEEEEC
Confidence 4 334444443
No 188
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.65 E-value=2.2e-07 Score=103.00 Aligned_cols=118 Identities=14% Similarity=0.248 Sum_probs=94.2
Q ss_pred ccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeE
Q 003310 292 GHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVH 370 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~ 370 (832)
..+++++.|.+|++||+.++++..++..|..+|.+|+|.|. .+.|++||.||+ ++++|.+.. .. ...
T Consensus 257 nVLaSgsaD~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~-V~l~D~R~~--~~---------s~~ 324 (463)
T KOG0270|consen 257 NVLASGSADKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGT-VALKDCRDP--SN---------SGK 324 (463)
T ss_pred eeEEecCCCceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccce-EEeeeccCc--cc---------cCc
Confidence 34677889999999999999999999999999999999994 889999999998 899999852 11 023
Q ss_pred EEEEeccCccccEEEEEEccCCCE-EEEEeCCCcEEEEecCCCCCc-eeeccCCCCcc
Q 003310 371 LYRLQRGLTNAVIQDISFSDDSNW-IMISSSRGTSHLFAINPLGGS-VNFQPTDANFT 426 (832)
Q Consensus 371 l~~l~rG~t~a~I~~IaFSpDg~~-LAsgS~DgTVhIwdl~~~g~~-~~~~~H~~~~~ 426 (832)
-+++ .+.|-.++|.|.+.. +.+++.||+++=||++..+.. .++++|.+.+.
T Consensus 325 ~wk~-----~g~VEkv~w~~~se~~f~~~tddG~v~~~D~R~~~~~vwt~~AHd~~IS 377 (463)
T KOG0270|consen 325 EWKF-----DGEVEKVAWDPHSENSFFVSTDDGTVYYFDIRNPGKPVWTLKAHDDEIS 377 (463)
T ss_pred eEEe-----ccceEEEEecCCCceeEEEecCCceEEeeecCCCCCceeEEEeccCCcc
Confidence 3444 246899999987654 567778999999999987654 47788866444
No 189
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.65 E-value=1.9e-05 Score=90.78 Aligned_cols=89 Identities=12% Similarity=0.178 Sum_probs=55.0
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCC--CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQG--HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dG--t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.++|+.+++.. .+..+......++|||||++||..+.++ ..|.+||+.++ ... .+..+.
T Consensus 317 ~iy~~dl~~g~~~-~lt~~g~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g------------~~~---~Lt~~~- 379 (433)
T PRK04922 317 QIYRVAASGGSAE-RLTFQGNYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTG------------SVR---TLTPGS- 379 (433)
T ss_pred eEEEEECCCCCeE-EeecCCCCccCEEECCCCCEEEEEECCCCceeEEEEECCCC------------CeE---ECCCCC-
Confidence 4666677665532 2222223445789999999998766543 24889998765 111 232221
Q ss_pred cccEEEEEEccCCCEEEEEeCC-Cc--EEEEec
Q 003310 380 NAVIQDISFSDDSNWIMISSSR-GT--SHLFAI 409 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~D-gT--VhIwdl 409 (832)
.....+|||||++|+..+.+ +. +.++++
T Consensus 380 --~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~ 410 (433)
T PRK04922 380 --LDESPSFAPNGSMVLYATREGGRGVLAAVST 410 (433)
T ss_pred --CCCCceECCCCCEEEEEEecCCceEEEEEEC
Confidence 24467899999998887764 34 445555
No 190
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=98.64 E-value=2.9e-07 Score=105.04 Aligned_cols=93 Identities=13% Similarity=0.236 Sum_probs=69.6
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
+.+..+-+|+|||+.+.+....|.+|+..|..++|||||+++||.+.||+ |+||.-+... ..+|+-
T Consensus 694 a~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~-~rVy~Prs~e-------------~pv~Eg 759 (1012)
T KOG1445|consen 694 AVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGT-LRVYEPRSRE-------------QPVYEG 759 (1012)
T ss_pred hhhhccceeeeeehhhhhhhheeccCcCceeEEEECCCCcceeeeecCce-EEEeCCCCCC-------------CccccC
Confidence 34567789999999999999999999999999999999999999999998 8999876541 112211
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCC
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRG 402 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~Dg 402 (832)
.|....+--.|.|.-||++|++.+.|.
T Consensus 760 -~gpvgtRgARi~wacdgr~viv~Gfdk 786 (1012)
T KOG1445|consen 760 -KGPVGTRGARILWACDGRIVIVVGFDK 786 (1012)
T ss_pred -CCCccCcceeEEEEecCcEEEEecccc
Confidence 111111112355778999998877664
No 191
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=98.62 E-value=1e-07 Score=107.98 Aligned_cols=113 Identities=20% Similarity=0.297 Sum_probs=87.9
Q ss_pred ccccccCCCCeEEEEECCC--------CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccC
Q 003310 292 GHFPDADNVGMVIVRDIVS--------KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACD 363 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s--------~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~ 363 (832)
+.++++..+|++++|.++. -+++.+|++|.+||.|++.++.|..+.||+.||+ |+.|.+.+.. .. . +
T Consensus 307 p~lit~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl~v~v~~n~~~~ysgg~Dg~-I~~w~~p~n~-dp--~-d 381 (577)
T KOG0642|consen 307 PVLITASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVLCVVVPSNGEHCYSGGIDGT-IRCWNLPPNQ-DP--D-D 381 (577)
T ss_pred CeEEEeccccchhhhhhcccCCccccceeeeEEEecccCceEEEEecCCceEEEeeccCce-eeeeccCCCC-Cc--c-c
Confidence 3567889999999999932 3589999999999999999999999999999998 9999886431 00 0 0
Q ss_pred CCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 364 AGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 364 ~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.... ..+....-|++.+ ||.+++|+....|+++|.|||+++|....
T Consensus 382 s~dp-~vl~~~l~Ghtda-vw~l~~s~~~~~Llscs~DgTvr~w~~~~ 427 (577)
T KOG0642|consen 382 SYDP-SVLSGTLLGHTDA-VWLLALSSTKDRLLSCSSDGTVRLWEPTE 427 (577)
T ss_pred ccCc-chhccceeccccc-eeeeeecccccceeeecCCceEEeeccCC
Confidence 0000 1122222577764 99999999999999999999999999854
No 192
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.57 E-value=1.8e-06 Score=103.61 Aligned_cols=109 Identities=13% Similarity=0.163 Sum_probs=75.7
Q ss_pred cccccCCCCeEEEEECCCCcE--EEEeccCC--C-CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCc
Q 003310 293 HFPDADNVGMVIVRDIVSKNV--IAQFRAHK--S-PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTS 367 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~--l~~~~aH~--~-pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~ 367 (832)
.++++..+|.|++||++.... .-++..|. + .+++|...++...+|+|+. + .|+||++.-.....
T Consensus 1271 elvSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiAsGs~-q-~ikIy~~~G~~l~~--------- 1339 (1387)
T KOG1517|consen 1271 ELVSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIASGSA-Q-LIKIYSLSGEQLNI--------- 1339 (1387)
T ss_pred ceeeeccCCeEEEEecccCcccccceeeeccccCccceeeeeccCCCeeeecCc-c-eEEEEecChhhhcc---------
Confidence 345677899999999987422 22333554 4 5999999999999999998 4 49999986431000
Q ss_pred eeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 368 YVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 368 ~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
.+-+...-|.....+.|++|.|.--.+|+|+.|.+|-||...+.+
T Consensus 1340 -~k~n~~F~~q~~gs~scL~FHP~~~llAaG~~Ds~V~iYs~~k~~ 1384 (1387)
T KOG1517|consen 1340 -IKYNPGFMGQRIGSVSCLAFHPHRLLLAAGSADSTVSIYSCEKPR 1384 (1387)
T ss_pred -cccCcccccCcCCCcceeeecchhHhhhhccCCceEEEeecCCcC
Confidence 000000011112237899999999999999999999999887653
No 193
>PRK01742 tolB translocation protein TolB; Provisional
Probab=98.56 E-value=2.4e-06 Score=98.04 Aligned_cols=90 Identities=10% Similarity=0.113 Sum_probs=60.8
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEc-CCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc
Q 003310 303 VIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASV-QGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA 381 (832)
Q Consensus 303 V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~-dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a 381 (832)
|.+||+.+++. ..+..|...+.+.+|+|||+.|+.++. +|. .+||++.... ..... + +.. .
T Consensus 274 Iy~~d~~~~~~-~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g~-~~I~~~~~~~----------~~~~~---l--~~~-~ 335 (429)
T PRK01742 274 IYVMGANGGTP-SQLTSGAGNNTEPSWSPDGQSILFTSDRSGS-PQVYRMSASG----------GGASL---V--GGR-G 335 (429)
T ss_pred EEEEECCCCCe-EeeccCCCCcCCEEECCCCCEEEEEECCCCC-ceEEEEECCC----------CCeEE---e--cCC-C
Confidence 55667776654 556667777889999999998876664 565 7899875430 01111 1 111 1
Q ss_pred cEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 382 VIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 382 ~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
..++|||||++|+.++.++ +.+||+.++.
T Consensus 336 --~~~~~SpDG~~ia~~~~~~-i~~~Dl~~g~ 364 (429)
T PRK01742 336 --YSAQISADGKTLVMINGDN-VVKQDLTSGS 364 (429)
T ss_pred --CCccCCCCCCEEEEEcCCC-EEEEECCCCC
Confidence 3578999999999988765 4558987643
No 194
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.55 E-value=6.2e-07 Score=98.97 Aligned_cols=74 Identities=19% Similarity=0.320 Sum_probs=62.4
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
..|++|+.|+||+++|-|+.||. |-|++...- +.++-.++-|. ..|.++.|+||.+++++.|.|
T Consensus 282 ~siSsl~VS~dGkf~AlGT~dGs-Vai~~~~~l--------------q~~~~vk~aH~-~~VT~ltF~Pdsr~~~svSs~ 345 (398)
T KOG0771|consen 282 KSISSLAVSDDGKFLALGTMDGS-VAIYDAKSL--------------QRLQYVKEAHL-GFVTGLTFSPDSRYLASVSSD 345 (398)
T ss_pred CcceeEEEcCCCcEEEEeccCCc-EEEEEecee--------------eeeEeehhhhe-eeeeeEEEcCCcCcccccccC
Confidence 47999999999999999999997 889998764 45555544443 359999999999999999999
Q ss_pred CcEEEEecCC
Q 003310 402 GTSHLFAINP 411 (832)
Q Consensus 402 gTVhIwdl~~ 411 (832)
.+++|..+.-
T Consensus 346 ~~~~v~~l~v 355 (398)
T KOG0771|consen 346 NEAAVTKLAV 355 (398)
T ss_pred CceeEEEEee
Confidence 9999998853
No 195
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=98.55 E-value=2e-06 Score=91.38 Aligned_cols=107 Identities=16% Similarity=0.242 Sum_probs=81.0
Q ss_pred cccccCCCCeEEEEECCCCcEEEEe---ccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQF---RAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~---~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
.|++-+.||.||++|++...--..+ .....|...|++++ |-.+|||-..|...|.|-|++.. +
T Consensus 211 ~FASvgaDGSvRmFDLR~leHSTIIYE~p~~~~pLlRLswnkqDpnymATf~~dS~~V~iLDiR~P-------------~ 277 (364)
T KOG0290|consen 211 VFASVGADGSVRMFDLRSLEHSTIIYEDPSPSTPLLRLSWNKQDPNYMATFAMDSNKVVILDIRVP-------------C 277 (364)
T ss_pred eEEEecCCCcEEEEEecccccceEEecCCCCCCcceeeccCcCCchHHhhhhcCCceEEEEEecCC-------------C
Confidence 4566678999999999986432222 22246888999988 56689998888888999999864 1
Q ss_pred eEEEEEeccCccccEEEEEEccC-CCEEEEEeCCCcEEEEecCCCCC
Q 003310 369 VHLYRLQRGLTNAVIQDISFSDD-SNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~IaFSpD-g~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
..+.+| |+| .+.|+.|+|.|. +..|+++++|.-+-|||+.....
T Consensus 278 tpva~L-~~H-~a~VNgIaWaPhS~~hictaGDD~qaliWDl~q~~~ 322 (364)
T KOG0290|consen 278 TPVARL-RNH-QASVNGIAWAPHSSSHICTAGDDCQALIWDLQQMPR 322 (364)
T ss_pred cceehh-hcC-cccccceEecCCCCceeeecCCcceEEEEecccccc
Confidence 344566 454 467999999996 56899999999999999976543
No 196
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=98.54 E-value=1.4e-06 Score=103.23 Aligned_cols=117 Identities=15% Similarity=0.190 Sum_probs=82.1
Q ss_pred CCCeEEEEECCCCcE-EEEe---ccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 299 NVGMVIVRDIVSKNV-IAQF---RAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~-l~~~---~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
..-.+.+|+..++.. .... .-|+-+++|.+|||.++++|+|..||+ |.||.-... + + .......|
T Consensus 179 ~~~~~~~~~v~~~~~~~~~~~~~~~Htf~~t~~~~spn~~~~Aa~d~dGr-I~vw~d~~~-----~-~----~~~t~t~l 247 (792)
T KOG1963|consen 179 HMCKIHIYFVPKHTKHTSSRDITVHHTFNITCVALSPNERYLAAGDSDGR-ILVWRDFGS-----S-D----DSETCTLL 247 (792)
T ss_pred EeeeEEEEEecccceeeccchhhhhhcccceeEEeccccceEEEeccCCc-EEEEecccc-----c-c----ccccceEE
Confidence 344678888887541 1111 257778999999999999999999999 899964321 0 0 00111245
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcccccCCcccccc
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFTTKHGAMAKSGV 437 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~~~~~~~~~~~~ 437 (832)
++.+ +.|.+++||+||.+|.+|+..+.+-+|.+.+.+.. +.++.|+++.+++
T Consensus 248 HWH~--~~V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~kq---------fLPRLgs~I~~i~ 299 (792)
T KOG1963|consen 248 HWHH--DEVNSLSFSSDGAYLLSGGREGVLVLWQLETGKKQ---------FLPRLGSPILHIV 299 (792)
T ss_pred Eecc--cccceeEEecCCceEeecccceEEEEEeecCCCcc---------cccccCCeeEEEE
Confidence 5543 46999999999999999999999999999987532 2233456666653
No 197
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.54 E-value=3.5e-07 Score=100.94 Aligned_cols=127 Identities=18% Similarity=0.262 Sum_probs=91.5
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCC--C-----------
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGT--S----------- 359 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~--~----------- 359 (832)
.+++++.||++|||+..+...+..+.+|...|.+|.|||||++||+-+.| . .+||++.++.+.. +
T Consensus 158 ~latgg~dg~lRv~~~Ps~~t~l~e~~~~~eV~DL~FS~dgk~lasig~d-~-~~VW~~~~g~~~a~~t~~~k~~~~~~c 235 (398)
T KOG0771|consen 158 KLATGGTDGTLRVWEWPSMLTILEEIAHHAEVKDLDFSPDGKFLASIGAD-S-ARVWSVNTGAALARKTPFSKDEMFSSC 235 (398)
T ss_pred EeeeccccceEEEEecCcchhhhhhHhhcCccccceeCCCCcEEEEecCC-c-eEEEEeccCchhhhcCCcccchhhhhc
Confidence 45778899999999999988888999999999999999999999999999 3 7999999872200 0
Q ss_pred --CccCCCCceeEEE--------------EEecc------C---c-cccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 360 --SACDAGTSYVHLY--------------RLQRG------L---T-NAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 360 --s~~~~~~~~~~l~--------------~l~rG------~---t-~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
+.++.. ....+. .+.++ . . .-.|.+++.|+||+++|.|+.||.|-|++..+-.
T Consensus 236 RF~~d~~~-~~l~laa~~~~~~~v~~~~~~~w~~~~~l~~~~~~~~~~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq 314 (398)
T KOG0771|consen 236 RFSVDNAQ-ETLRLAASQFPGGGVRLCDISLWSGSNFLRLRKKIKRFKSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQ 314 (398)
T ss_pred eecccCCC-ceEEEEEecCCCCceeEEEeeeeccccccchhhhhhccCcceeEEEcCCCcEEEEeccCCcEEEEEeceee
Confidence 000100 011111 11112 0 0 1138899999999999999999999999998765
Q ss_pred Cceee-ccCC
Q 003310 414 GSVNF-QPTD 422 (832)
Q Consensus 414 ~~~~~-~~H~ 422 (832)
...-+ +.|.
T Consensus 315 ~~~~vk~aH~ 324 (398)
T KOG0771|consen 315 RLQYVKEAHL 324 (398)
T ss_pred eeEeehhhhe
Confidence 44333 4563
No 198
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.53 E-value=1.3e-07 Score=106.67 Aligned_cols=84 Identities=19% Similarity=0.370 Sum_probs=65.6
Q ss_pred cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccC
Q 003310 312 NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDD 391 (832)
Q Consensus 312 ~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpD 391 (832)
.++..+.--.++|...+|||||++||+.|.||. +||||..+. +|.-+.+-+ -+-..|||||||
T Consensus 281 NPv~~w~~~~g~in~f~FS~DG~~LA~VSqDGf-LRvF~fdt~---------------eLlg~mkSY-FGGLLCvcWSPD 343 (636)
T KOG2394|consen 281 NPVARWHIGEGSINEFAFSPDGKYLATVSQDGF-LRIFDFDTQ---------------ELLGVMKSY-FGGLLCVCWSPD 343 (636)
T ss_pred CccceeEeccccccceeEcCCCceEEEEecCce-EEEeeccHH---------------HHHHHHHhh-ccceEEEEEcCC
Confidence 455555544568999999999999999999998 899998764 211111111 123889999999
Q ss_pred CCEEEEEeCCCcEEEEecCCC
Q 003310 392 SNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 392 g~~LAsgS~DgTVhIwdl~~~ 412 (832)
|+||++|+.|.-|.||.+...
T Consensus 344 GKyIvtGGEDDLVtVwSf~er 364 (636)
T KOG2394|consen 344 GKYIVTGGEDDLVTVWSFEER 364 (636)
T ss_pred ccEEEecCCcceEEEEEeccc
Confidence 999999999999999999764
No 199
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=98.52 E-value=1.6e-06 Score=99.68 Aligned_cols=144 Identities=14% Similarity=0.248 Sum_probs=106.7
Q ss_pred CEEEEEEc--CCEEEEEeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccceeeeccceEEeeCCCceecCCCcc
Q 003310 138 PIYSVRCS--SRVVAICQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPLAVGPRWLAYSGSPVVVSNDGRV 215 (832)
Q Consensus 138 ~V~sV~~S--~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~Alg~r~LAya~~~~~~s~~Grv 215 (832)
.|++|+|- +.-|+++.+.++++||...+..++++.+|.... ..+||+-
T Consensus 14 ci~d~afkPDGsqL~lAAg~rlliyD~ndG~llqtLKgHKDtV-------------------ycVAys~----------- 63 (1081)
T KOG1538|consen 14 CINDIAFKPDGTQLILAAGSRLLVYDTSDGTLLQPLKGHKDTV-------------------YCVAYAK----------- 63 (1081)
T ss_pred chheeEECCCCceEEEecCCEEEEEeCCCcccccccccccceE-------------------EEEEEcc-----------
Confidence 67888885 456778889999999999999999999985411 1245652
Q ss_pred CCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCCCcCccccccCCCCCCCccccccc
Q 003310 216 NPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDSQNSLQSAIPGGKSNGTVNGHFP 295 (832)
Q Consensus 216 sp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~~~si~sa~~~~~~~g~~~g~~~ 295 (832)
+|. .|+
T Consensus 64 ----------------dGk----------------------------------------------------------rFA 69 (1081)
T KOG1538|consen 64 ----------------DGK----------------------------------------------------------RFA 69 (1081)
T ss_pred ----------------CCc----------------------------------------------------------eec
Confidence 111 244
Q ss_pred ccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
++..|..|.||.-.-. ..++ .|+..|.||.|+|-...|||+|-.. +-+|..... ++.+ +
T Consensus 70 SG~aDK~VI~W~~klE---G~LkYSH~D~IQCMsFNP~~h~LasCsLsd--FglWS~~qK------------~V~K-~-- 129 (1081)
T KOG1538|consen 70 SGSADKSVIIWTSKLE---GILKYSHNDAIQCMSFNPITHQLASCSLSD--FGLWSPEQK------------SVSK-H-- 129 (1081)
T ss_pred cCCCceeEEEeccccc---ceeeeccCCeeeEeecCchHHHhhhcchhh--ccccChhhh------------hHHh-h--
Confidence 5567889999975432 3344 7999999999999999999999763 678876442 1111 1
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
| ..++|.+++|..||++||.|-.+|||.|-.
T Consensus 130 -k--ss~R~~~CsWtnDGqylalG~~nGTIsiRN 160 (1081)
T KOG1538|consen 130 -K--SSSRIICCSWTNDGQYLALGMFNGTISIRN 160 (1081)
T ss_pred -h--hheeEEEeeecCCCcEEEEeccCceEEeec
Confidence 1 235699999999999999999999999863
No 200
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.47 E-value=8.7e-06 Score=97.99 Aligned_cols=103 Identities=17% Similarity=0.246 Sum_probs=79.5
Q ss_pred cccccCCCCeEEEEECCCC---cEEEEeccCCCC--eEEEEEcCCCCE-EEEEEcCCCEEEEEeCCCCCCCCCCccCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSK---NVIAQFRAHKSP--ISALCFDPSGIL-LVTASVQGHNINIFKIIPGILGTSSACDAGT 366 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~---~~l~~~~aH~~p--Is~LaFSPdG~l-LATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~ 366 (832)
.++.+-.||.|++||.+.. ..+...+.|+.. |..+.|.+.|.- |++||.+|. |.+||++.....+
T Consensus 1223 ~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~-I~~~DlR~~~~e~-------- 1293 (1387)
T KOG1517|consen 1223 IIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGD-IQLLDLRMSSKET-------- 1293 (1387)
T ss_pred eEEEeecCCceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccCCe-EEEEecccCcccc--------
Confidence 3566778999999998753 467888999877 999999998876 999999998 9999998731100
Q ss_pred ceeEEEEEec--cCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 367 SYVHLYRLQR--GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 367 ~~~~l~~l~r--G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
..-...++ |. ...++...++...+|+||. +.|+||++.
T Consensus 1294 --~~~iv~~~~yGs---~lTal~VH~hapiiAsGs~-q~ikIy~~~ 1333 (1387)
T KOG1517|consen 1294 --FLTIVAHWEYGS---ALTALTVHEHAPIIASGSA-QLIKIYSLS 1333 (1387)
T ss_pred --cceeeeccccCc---cceeeeeccCCCeeeecCc-ceEEEEecC
Confidence 01111122 42 3678889999999999999 999999995
No 201
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=98.47 E-value=1e-05 Score=84.20 Aligned_cols=111 Identities=13% Similarity=0.272 Sum_probs=77.3
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEE-cCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCF-DPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaF-SPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+..|+.++-||+++++...+|++|+..|.++.- +.+|+ +.||+.||+ +||||++++.+-..-. ...+ ..+-
T Consensus 132 AgGD~~~y~~dlE~G~i~r~~rGHtDYvH~vv~R~~~~q-ilsG~EDGt-vRvWd~kt~k~v~~ie-----~yk~-~~~l 203 (325)
T KOG0649|consen 132 AGGDGVIYQVDLEDGRIQREYRGHTDYVHSVVGRNANGQ-ILSGAEDGT-VRVWDTKTQKHVSMIE-----PYKN-PNLL 203 (325)
T ss_pred ecCCeEEEEEEecCCEEEEEEcCCcceeeeeeecccCcc-eeecCCCcc-EEEEeccccceeEEec-----cccC-hhhc
Confidence 457899999999999999999999999999998 55555 779999998 8999999872110000 0000 1122
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
|-+....|-+++-+. .||++|.. ..+-||.+........|
T Consensus 204 Rp~~g~wigala~~e--dWlvCGgG-p~lslwhLrsse~t~vf 243 (325)
T KOG0649|consen 204 RPDWGKWIGALAVNE--DWLVCGGG-PKLSLWHLRSSESTCVF 243 (325)
T ss_pred CcccCceeEEEeccC--ceEEecCC-CceeEEeccCCCceEEE
Confidence 333333466666554 59877754 46789999877665544
No 202
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.45 E-value=1.4e-06 Score=98.44 Aligned_cols=61 Identities=11% Similarity=0.324 Sum_probs=54.7
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
+++.-..||.++|+|+.+.+++..++..-+...|++|||||+|||||+.|. .+.||.+...
T Consensus 304 ~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGEDD-LVtVwSf~er 364 (636)
T KOG2394|consen 304 YLATVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSPDGKYIVTGGEDD-LVTVWSFEER 364 (636)
T ss_pred eEEEEecCceEEEeeccHHHHHHHHHhhccceEEEEEcCCccEEEecCCcc-eEEEEEeccc
Confidence 456668899999999999999999988888999999999999999999995 6999998764
No 203
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.44 E-value=0.00058 Score=76.61 Aligned_cols=101 Identities=15% Similarity=0.173 Sum_probs=70.3
Q ss_pred CeEEEEECCC-----CcEEEEecc-------CCCCeEEEEEcCCCCEEEEEEc---------CCCEEEEEeCCCCCCCCC
Q 003310 301 GMVIVRDIVS-----KNVIAQFRA-------HKSPISALCFDPSGILLVTASV---------QGHNINIFKIIPGILGTS 359 (832)
Q Consensus 301 G~V~IwDl~s-----~~~l~~~~a-------H~~pIs~LaFSPdG~lLATaS~---------dGt~I~Iwdi~t~~~~~~ 359 (832)
|.|.+.|+.. .+.+..+.. ..+.+.-++|+|||++|..+.. .++.|.++|..++
T Consensus 215 G~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~----- 289 (352)
T TIGR02658 215 GKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTG----- 289 (352)
T ss_pred CeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCC-----
Confidence 9999999644 333333321 1233445999999998887542 1245888898876
Q ss_pred CccCCCCceeEEEEEeccCccccEEEEEEccCCC-EEEEEe-CCCcEEEEecCCCCCceee
Q 003310 360 SACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSN-WIMISS-SRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 360 s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~-~LAsgS-~DgTVhIwdl~~~g~~~~~ 418 (832)
+.+.++.-| ..++.|+||||++ +|.+.+ .+++|.|+|+.+.+...++
T Consensus 290 ---------kvi~~i~vG---~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~~i 338 (352)
T TIGR02658 290 ---------KRLRKIELG---HEIDSINVSQDAKPLLYALSTGDKTLYIFDAETGKELSSV 338 (352)
T ss_pred ---------eEEEEEeCC---CceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCeEEeee
Confidence 555555444 3589999999999 887776 5899999999876555444
No 204
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.44 E-value=2.6e-05 Score=89.68 Aligned_cols=93 Identities=20% Similarity=0.245 Sum_probs=62.0
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC-CCEEEEE--eCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ-GHNINIF--KIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d-Gt~I~Iw--di~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
..|.+||+.+++. ..+..|.......+|+|||+.|+.++.. |. ..|| |+.++ ....+ .+. +
T Consensus 272 ~~Iy~~d~~~g~~-~~lt~~~~~~~~~~~spDG~~l~f~sd~~g~-~~iy~~dl~~g------------~~~~l-t~~-g 335 (433)
T PRK04922 272 PEIYVMDLGSRQL-TRLTNHFGIDTEPTWAPDGKSIYFTSDRGGR-PQIYRVAASGG------------SAERL-TFQ-G 335 (433)
T ss_pred ceEEEEECCCCCe-EECccCCCCccceEECCCCCEEEEEECCCCC-ceEEEEECCCC------------CeEEe-ecC-C
Confidence 3689999988865 4566666556788999999999887754 43 3455 54443 11222 221 2
Q ss_pred CccccEEEEEEccCCCEEEEEeCCC---cEEEEecCCC
Q 003310 378 LTNAVIQDISFSDDSNWIMISSSRG---TSHLFAINPL 412 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsgS~Dg---TVhIwdl~~~ 412 (832)
.....++|||||++|+..+.++ .|.+|++...
T Consensus 336 ---~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g 370 (433)
T PRK04922 336 ---NYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTG 370 (433)
T ss_pred ---CCccCEEECCCCCEEEEEECCCCceeEEEEECCCC
Confidence 1244689999999999876543 5888888653
No 205
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.43 E-value=4.5e-05 Score=87.78 Aligned_cols=93 Identities=20% Similarity=0.226 Sum_probs=61.3
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeC--CCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKI--IPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi--~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.+||+.+++.. .+..+...+...+|+|||+.|+.++.++...+||.+ ..+ ....+ .. .+
T Consensus 268 ~I~~~d~~tg~~~-~lt~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g------------~~~~l-t~-~~-- 330 (429)
T PRK03629 268 NLYVMDLASGQIR-QVTDGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGG------------APQRI-TW-EG-- 330 (429)
T ss_pred EEEEEECCCCCEE-EccCCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCC------------CeEEe-ec-CC--
Confidence 4889999888764 444444567889999999999887765433567754 332 12222 11 11
Q ss_pred cccEEEEEEccCCCEEEEEeCCC---cEEEEecCCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRG---TSHLFAINPL 412 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~Dg---TVhIwdl~~~ 412 (832)
....+.+|||||++|+..+.++ .|.+||+...
T Consensus 331 -~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g 365 (429)
T PRK03629 331 -SQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATG 365 (429)
T ss_pred -CCccCEEECCCCCEEEEEEccCCCceEEEEECCCC
Confidence 1256789999999998876543 4667777553
No 206
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.40 E-value=7.1e-05 Score=86.09 Aligned_cols=91 Identities=12% Similarity=0.176 Sum_probs=60.3
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC-CC-EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ-GH-NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d-Gt-~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.+||+.+++. ..+..|.......+|+|||+.|+.++.. |. .|.+||+..+ ..+.+. .+
T Consensus 271 ~Iy~~d~~~~~~-~~Lt~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~------------~~~~lt---~~-- 332 (435)
T PRK05137 271 DIYTMDLRSGTT-TRLTDSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGS------------NPRRIS---FG-- 332 (435)
T ss_pred eEEEEECCCCce-EEccCCCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCC------------CeEEee---cC--
Confidence 588889988765 4566666667789999999999887753 32 3566676543 122221 11
Q ss_pred cccEEEEEEccCCCEEEEEeCCC---cEEEEecC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRG---TSHLFAIN 410 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~Dg---TVhIwdl~ 410 (832)
...+...+|||||++||..+.++ .|.+|++.
T Consensus 333 ~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~ 366 (435)
T PRK05137 333 GGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPD 366 (435)
T ss_pred CCcccCeEECCCCCEEEEEEcCCCceEEEEEECC
Confidence 12356688999999999877643 45666653
No 207
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=98.39 E-value=0.00015 Score=82.06 Aligned_cols=84 Identities=18% Similarity=0.283 Sum_probs=55.4
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCC--EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGH--NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt--~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.++|+.+++. ..+..+...+..++|+|||++|+.++.++. .|.+||+.++ . .. .+..+
T Consensus 303 ~iy~~d~~~~~~-~~l~~~~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~------------~-~~--~l~~~-- 364 (417)
T TIGR02800 303 QIYMMDADGGEV-RRLTFRGGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGG------------G-ER--VLTDT-- 364 (417)
T ss_pred eEEEEECCCCCE-EEeecCCCCccCeEECCCCCEEEEEEccCCceEEEEEeCCCC------------C-eE--EccCC--
Confidence 577788877654 334444556778899999999998887651 3667777654 1 11 12111
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcE
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTS 404 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTV 404 (832)
......+|+|||++|+..+.++..
T Consensus 365 -~~~~~p~~spdg~~l~~~~~~~~~ 388 (417)
T TIGR02800 365 -GLDESPSFAPNGRMILYATTRGGR 388 (417)
T ss_pred -CCCCCceECCCCCEEEEEEeCCCc
Confidence 123456899999999988887643
No 208
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.38 E-value=1.1e-05 Score=83.08 Aligned_cols=51 Identities=22% Similarity=0.358 Sum_probs=41.1
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC-----CCEEEEEeCC
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ-----GHNINIFKII 352 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d-----Gt~I~Iwdi~ 352 (832)
.|.|.+||+.+.+.+.+++.. .++.++|||||++|+|+... .+-++||+..
T Consensus 124 ~G~l~~wd~~~~~~i~~~~~~--~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 124 NGDLEFWDVRKKKKISTFEHS--DATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred CcEEEEEECCCCEEeeccccC--cEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence 478999999999998887643 47899999999999998753 1337899874
No 209
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=98.38 E-value=7.2e-05 Score=82.69 Aligned_cols=93 Identities=23% Similarity=0.343 Sum_probs=63.6
Q ss_pred eEEEEECC-CCcEEEEeccCCCCeEEEEEcC------------------CCCEEEEEEcCCCEEEEEeCCCCCCCCCCcc
Q 003310 302 MVIVRDIV-SKNVIAQFRAHKSPISALCFDP------------------SGILLVTASVQGHNINIFKIIPGILGTSSAC 362 (832)
Q Consensus 302 ~V~IwDl~-s~~~l~~~~aH~~pIs~LaFSP------------------dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~ 362 (832)
+.++++-. .+++++.+..-..++.++.|+| -+..+|.|..+ .+.|||..+.
T Consensus 262 ~tYvfsrk~l~rP~~~lp~~~k~~lavr~~pVy~elrp~~~~~~~~~lpyrlvfaiAt~~--svyvydtq~~-------- 331 (434)
T KOG1009|consen 262 TSYVFSRKDLKRPAARLPSPKKPALAVRFSPVYYELRPLSSEKFLFVLPYRLVFAIATKN--SVYVYDTQTL-------- 331 (434)
T ss_pred eeEeeccccccCceeecCCCCcceEEEEeeeeEEEeccccccccccccccceEEEEeecc--eEEEeccccc--------
Confidence 34455433 2467777777777777777765 33456777776 3789998764
Q ss_pred CCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 363 DAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 363 ~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
..++.+ -+.+-+.|.++|||+||..|+.+|.||-+-+-.+++
T Consensus 332 ------~P~~~v-~nihy~~iTDiaws~dg~~l~vSS~DGyCS~vtfe~ 373 (434)
T KOG1009|consen 332 ------EPLAVV-DNIHYSAITDIAWSDDGSVLLVSSTDGFCSLVTFEP 373 (434)
T ss_pred ------cceEEE-eeeeeeeecceeecCCCcEEEEeccCCceEEEEEcc
Confidence 233333 133445699999999999999999999887766654
No 210
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=98.37 E-value=8.2e-07 Score=66.63 Aligned_cols=39 Identities=28% Similarity=0.653 Sum_probs=36.5
Q ss_pred CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 311 KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 311 ~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
++++.+|++|.++|.+|+|+|++.+|||++.|++ |+|||
T Consensus 1 g~~~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~~-i~vwd 39 (39)
T PF00400_consen 1 GKCVRTFRGHSSSINSIAWSPDGNFLASGSSDGT-IRVWD 39 (39)
T ss_dssp EEEEEEEESSSSSEEEEEEETTSSEEEEEETTSE-EEEEE
T ss_pred CeEEEEEcCCCCcEEEEEEecccccceeeCCCCE-EEEEC
Confidence 3578999999999999999999999999999987 99997
No 211
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=98.37 E-value=0.0013 Score=73.56 Aligned_cols=85 Identities=18% Similarity=0.395 Sum_probs=59.8
Q ss_pred CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe-CC
Q 003310 323 PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS-SR 401 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS-~D 401 (832)
....|+++|||++|..+......|-+|++... .+....+..+.-+ ......++|+|||++|+++. .+
T Consensus 246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~----------~g~l~~~~~~~~~--G~~Pr~~~~s~~g~~l~Va~~~s 313 (345)
T PF10282_consen 246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPA----------TGTLTLVQTVPTG--GKFPRHFAFSPDGRYLYVANQDS 313 (345)
T ss_dssp SEEEEEE-TTSSEEEEEECTTTEEEEEEECTT----------TTTEEEEEEEEES--SSSEEEEEE-TTSSEEEEEETTT
T ss_pred CceeEEEecCCCEEEEEeccCCEEEEEEEecC----------CCceEEEEEEeCC--CCCccEEEEeCCCCEEEEEecCC
Confidence 57889999999999888877777999999542 0122333333321 11268999999999999887 46
Q ss_pred CcEEEEecCCCCCceeec
Q 003310 402 GTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 402 gTVhIwdl~~~g~~~~~~ 419 (832)
++|.+|++....|.....
T Consensus 314 ~~v~vf~~d~~tG~l~~~ 331 (345)
T PF10282_consen 314 NTVSVFDIDPDTGKLTPV 331 (345)
T ss_dssp TEEEEEEEETTTTEEEEE
T ss_pred CeEEEEEEeCCCCcEEEe
Confidence 799999998766765554
No 212
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.35 E-value=4.6e-05 Score=87.56 Aligned_cols=95 Identities=13% Similarity=0.152 Sum_probs=60.5
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA 381 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a 381 (832)
.|.++|+.++. +..+..|...+...+|+|||+.|+..+..+....||.+... .+..+.+ .+. +.
T Consensus 265 ~Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~----------~g~~~~l-t~~-g~--- 328 (427)
T PRK02889 265 QIYTVNADGSG-LRRLTQSSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPAS----------GGAAQRV-TFT-GS--- 328 (427)
T ss_pred eEEEEECCCCC-cEECCCCCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECC----------CCceEEE-ecC-CC---
Confidence 35555666554 45555565556778999999998877764333678876432 0012222 222 21
Q ss_pred cEEEEEEccCCCEEEEEeCCC---cEEEEecCCC
Q 003310 382 VIQDISFSDDSNWIMISSSRG---TSHLFAINPL 412 (832)
Q Consensus 382 ~I~~IaFSpDg~~LAsgS~Dg---TVhIwdl~~~ 412 (832)
.....+|||||++||..+.++ .|.+|++...
T Consensus 329 ~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g 362 (427)
T PRK02889 329 YNTSPRISPDGKLLAYISRVGGAFKLYVQDLATG 362 (427)
T ss_pred CcCceEECCCCCEEEEEEccCCcEEEEEEECCCC
Confidence 134578999999999888765 5889998664
No 213
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=98.35 E-value=9.2e-06 Score=91.93 Aligned_cols=81 Identities=15% Similarity=0.101 Sum_probs=61.3
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
+++++.|-..+|||-. |+.+..-.+|..||++++|+|| +++|.+|.. + .| +..
T Consensus 201 I~sgGED~kfKvWD~~-G~~Lf~S~~~ey~ITSva~npd-~~~~v~S~n-t-~R---~~~-------------------- 253 (737)
T KOG1524|consen 201 IASGGEDFRFKIWDAQ-GANLFTSAAEEYAITSVAFNPE-KDYLLWSYN-T-AR---FSS-------------------- 253 (737)
T ss_pred eeecCCceeEEeeccc-CcccccCChhccceeeeeeccc-cceeeeeee-e-ee---ecC--------------------
Confidence 4566788899999976 4566666799999999999999 888877754 2 33 111
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEE
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSH 405 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVh 405 (832)
. ....|..++||+||..++.|+..|-+.
T Consensus 254 p----~~GSifnlsWS~DGTQ~a~gt~~G~v~ 281 (737)
T KOG1524|consen 254 P----RVGSIFNLSWSADGTQATCGTSTGQLI 281 (737)
T ss_pred C----CccceEEEEEcCCCceeeccccCceEE
Confidence 1 112488999999999999999988653
No 214
>PRK00178 tolB translocation protein TolB; Provisional
Probab=98.34 E-value=0.00045 Score=79.10 Aligned_cols=51 Identities=12% Similarity=0.123 Sum_probs=36.7
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEEe--C--CEEEEEECCCCce
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAICQ--A--AQVHCFDAATLEI 167 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa~--~--~~I~vwDl~t~~~ 167 (832)
..|.+||+.+++......++..+.+.+|+ ++.|+.+. + ..|++||+.+++.
T Consensus 223 ~~l~~~~l~~g~~~~l~~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~ 279 (430)
T PRK00178 223 PRIFVQNLDTGRREQITNFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQL 279 (430)
T ss_pred CEEEEEECCCCCEEEccCCCCCcCCeEECCCCCEEEEEEccCCCceEEEEECCCCCe
Confidence 46999999999865544566666678887 46676533 2 2799999998764
No 215
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.31 E-value=3.3e-05 Score=84.35 Aligned_cols=101 Identities=16% Similarity=0.238 Sum_probs=72.3
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
+..|+|||..++..+....--.+.++-|+|||||.+|..|..|+. +++|+.... +. ...+.+-
T Consensus 217 sssi~iWdpdtg~~~pL~~~glgg~slLkwSPdgd~lfaAt~dav-frlw~e~q~--wt----------~erw~lg---- 279 (445)
T KOG2139|consen 217 SSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDAV-FRLWQENQS--WT----------KERWILG---- 279 (445)
T ss_pred cceEEEEcCCCCCcccccccCCCceeeEEEcCCCCEEEEecccce-eeeehhccc--ce----------ecceecc----
Confidence 457999999998765554444567999999999999999999986 999965432 00 1112332
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
...|+..+|+|+|++|..+... .-.||.+.-.+....+
T Consensus 280 sgrvqtacWspcGsfLLf~~sg-sp~lysl~f~~~~~~~ 317 (445)
T KOG2139|consen 280 SGRVQTACWSPCGSFLLFACSG-SPRLYSLTFDGEDSVF 317 (445)
T ss_pred CCceeeeeecCCCCEEEEEEcC-CceEEEEeecCCCccc
Confidence 2369999999999998877654 4568888655554444
No 216
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=98.28 E-value=5.8e-06 Score=89.48 Aligned_cols=105 Identities=17% Similarity=0.250 Sum_probs=75.2
Q ss_pred ccCCCCeEEEEECCCCcEEEEec--cCC-CCeEEEEEcCCCCEEEEEEcCC---CEEEEEeCCCCCCCCCCccCCCCcee
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFR--AHK-SPISALCFDPSGILLVTASVQG---HNINIFKIIPGILGTSSACDAGTSYV 369 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~--aH~-~pIs~LaFSPdG~lLATaS~dG---t~I~Iwdi~t~~~~~~s~~~~~~~~~ 369 (832)
++..||+|++||+++...++.+. +|. .+..|++.+-.+..++++...- -.+.+||++... .
T Consensus 89 s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~q-------------q 155 (376)
T KOG1188|consen 89 SCSSDGTVRLWDIRSQAESARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQ-------------Q 155 (376)
T ss_pred EeccCCeEEEEEeecchhhhheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEecccc-------------c
Confidence 35678999999999876555544 665 4677777777889999886532 247899998751 1
Q ss_pred EEEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCCCC
Q 003310 370 HLYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
.+..+... +.-.|++|.|.| |-..|++||.||-|-|||++....
T Consensus 156 ~l~~~~eS-H~DDVT~lrFHP~~pnlLlSGSvDGLvnlfD~~~d~E 200 (376)
T KOG1188|consen 156 LLRQLNES-HNDDVTQLRFHPSDPNLLLSGSVDGLVNLFDTKKDNE 200 (376)
T ss_pred hhhhhhhh-ccCcceeEEecCCCCCeEEeecccceEEeeecCCCcc
Confidence 11122111 234599999999 678999999999999999976533
No 217
>PRK04792 tolB translocation protein TolB; Provisional
Probab=98.28 E-value=0.00057 Score=79.25 Aligned_cols=51 Identities=16% Similarity=0.158 Sum_probs=34.7
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEEe--CC--EEEEEECCCCce
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAICQ--AA--QVHCFDAATLEI 167 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa~--~~--~I~vwDl~t~~~ 167 (832)
..|.+||+.+++......++....+..++ ++.|+.+. ++ +|+++|+.+++.
T Consensus 242 ~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~ 298 (448)
T PRK04792 242 AEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKAL 298 (448)
T ss_pred cEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCe
Confidence 46999999998864433455555677887 46666532 33 599999988763
No 218
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=98.24 E-value=8.5e-06 Score=96.02 Aligned_cols=99 Identities=20% Similarity=0.296 Sum_probs=79.0
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
++..+-.+.+|.+.++..++.+.+|..++..|.|.|=.. ...+|+.||. +.|||+-.+ ...+.|.
T Consensus 370 ~ar~~~~~~vwnl~~g~l~H~l~ghsd~~yvLd~Hpfn~ri~msag~dgs-t~iwdi~eg------------~pik~y~- 435 (1113)
T KOG0644|consen 370 TARNDHRLCVWNLYTGQLLHNLMGHSDEVYVLDVHPFNPRIAMSAGYDGS-TIIWDIWEG------------IPIKHYF- 435 (1113)
T ss_pred eeeeeeEeeeeecccchhhhhhcccccceeeeeecCCCcHhhhhccCCCc-eEeeecccC------------Ccceeee-
Confidence 345566788999999999999999999999999999555 4558999998 569999876 1234444
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.| ...+.+.+||+||+.++..-+.|-+.|+....
T Consensus 436 -~g--h~kl~d~kFSqdgts~~lsd~hgql~i~g~gq 469 (1113)
T KOG0644|consen 436 -IG--HGKLVDGKFSQDGTSIALSDDHGQLYILGTGQ 469 (1113)
T ss_pred -cc--cceeeccccCCCCceEecCCCCCceEEeccCC
Confidence 34 34688999999999999998889998876543
No 219
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.20 E-value=0.00043 Score=78.34 Aligned_cols=183 Identities=15% Similarity=0.130 Sum_probs=114.0
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEE-EEEEcC--CEEEE-EeCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIY-SVRCSS--RVVAI-CQAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~-sV~~S~--r~LAV-a~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
+.|.+.|..+.+.+.++.....+. .+.+++ +++.| ..++.|.++|+.+++.+.++.....
T Consensus 16 ~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~---------------- 79 (369)
T PF02239_consen 16 GSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGN---------------- 79 (369)
T ss_dssp TEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSE----------------
T ss_pred CEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCC----------------
Confidence 789999999999999999765554 456663 66655 5688999999999999888865321
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
++-+|++. +|+.+ +
T Consensus 80 ----~~~i~~s~---------------------------DG~~~-----------~------------------------ 93 (369)
T PF02239_consen 80 ----PRGIAVSP---------------------------DGKYV-----------Y------------------------ 93 (369)
T ss_dssp ----EEEEEE-----------------------------TTTEE-----------E------------------------
T ss_pred ----cceEEEcC---------------------------CCCEE-----------E------------------------
Confidence 12233331 22222 0
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccC-------CCCeEEEEEcCCCCEEEEEEcCCCE
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAH-------KSPISALCFDPSGILLVTASVQGHN 345 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH-------~~pIs~LaFSPdG~lLATaS~dGt~ 345 (832)
++...++.|.|+|..+.+++.++... ...+.+|..+|....++.+-.|...
T Consensus 94 ----------------------v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~ 151 (369)
T PF02239_consen 94 ----------------------VANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGE 151 (369)
T ss_dssp ----------------------EEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTE
T ss_pred ----------------------EEecCCCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCe
Confidence 01123468999999999999988743 3468899999999988877777544
Q ss_pred EEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEE-eCCCcEEEEecCCCCCceee
Q 003310 346 INIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMIS-SSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 346 I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsg-S~DgTVhIwdl~~~g~~~~~ 418 (832)
|-+.|.... .......+..|. ...+..|+||+++++++ ..+..+-++|..+.+....+
T Consensus 152 I~vVdy~d~------------~~~~~~~i~~g~---~~~D~~~dpdgry~~va~~~sn~i~viD~~~~k~v~~i 210 (369)
T PF02239_consen 152 IWVVDYSDP------------KNLKVTTIKVGR---FPHDGGFDPDGRYFLVAANGSNKIAVIDTKTGKLVALI 210 (369)
T ss_dssp EEEEETTTS------------SCEEEEEEE--T---TEEEEEE-TTSSEEEEEEGGGTEEEEEETTTTEEEEEE
T ss_pred EEEEEeccc------------cccceeeecccc---cccccccCcccceeeecccccceeEEEeeccceEEEEe
Confidence 555565543 112222333332 36789999999987664 55778999998875443333
No 220
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=98.18 E-value=5.7e-07 Score=105.57 Aligned_cols=95 Identities=22% Similarity=0.383 Sum_probs=83.0
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
.++.+|-.|+||..++..+++..++|.+.|+.++.+....++|+||.| .+|++|.+..+ ..+ .+
T Consensus 206 itgsdd~lvKiwS~et~~~lAs~rGhs~ditdlavs~~n~~iaaaS~D-~vIrvWrl~~~--------------~pv-sv 269 (1113)
T KOG0644|consen 206 ITGSDDRLVKIWSMETARCLASCRGHSGDITDLAVSSNNTMIAAASND-KVIRVWRLPDG--------------APV-SV 269 (1113)
T ss_pred eecCccceeeeeeccchhhhccCCCCccccchhccchhhhhhhhcccC-ceEEEEecCCC--------------chH-HH
Confidence 456788899999999999999999999999999999999999999999 46999999886 222 33
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
-||++.+ |+.|+|||- ++++.|||+++||..
T Consensus 270 Lrghtga-vtaiafsP~----~sss~dgt~~~wd~r 300 (1113)
T KOG0644|consen 270 LRGHTGA-VTAIAFSPR----ASSSDDGTCRIWDAR 300 (1113)
T ss_pred Hhccccc-eeeeccCcc----ccCCCCCceEecccc
Confidence 4788765 999999995 488999999999986
No 221
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=98.17 E-value=2.3e-05 Score=89.51 Aligned_cols=57 Identities=23% Similarity=0.402 Sum_probs=53.1
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCC
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIP 353 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t 353 (832)
++..+++|+++|..++++++...+|...+++++|.|+|-+|++++.||. +++|.+..
T Consensus 506 ~~hed~~Ir~~dn~~~~~l~s~~a~~~svtslai~~ng~~l~s~s~d~s-v~l~kld~ 562 (577)
T KOG0642|consen 506 TAHEDRSIRFFDNKTGKILHSMVAHKDSVTSLAIDPNGPYLMSGSHDGS-VRLWKLDV 562 (577)
T ss_pred ecccCCceecccccccccchheeeccceecceeecCCCceEEeecCCce-eehhhccc
Confidence 4678899999999999999999999999999999999999999999998 89998754
No 222
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.14 E-value=2.3e-05 Score=88.51 Aligned_cols=105 Identities=21% Similarity=0.358 Sum_probs=78.5
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
...+|.|.|.|..+.+++..++....+-..++|+|||++|.+++.||. |.++|+.+. +.+.+++-
T Consensus 12 ~~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~-vsviD~~~~--------------~~v~~i~~ 76 (369)
T PF02239_consen 12 ERGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGT-VSVIDLATG--------------KVVATIKV 76 (369)
T ss_dssp EGGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSE-EEEEETTSS--------------SEEEEEE-
T ss_pred ecCCCEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCe-EEEEECCcc--------------cEEEEEec
Confidence 346789999999999999999976555556889999999999999986 899999886 45566665
Q ss_pred cCccccEEEEEEccCCCEEEEEe-CCCcEEEEecCCCCCceeec
Q 003310 377 GLTNAVIQDISFSDDSNWIMISS-SRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS-~DgTVhIwdl~~~g~~~~~~ 419 (832)
|.. -.++++|+||++++++. .++++.|+|.++......+.
T Consensus 77 G~~---~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~ 117 (369)
T PF02239_consen 77 GGN---PRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIP 117 (369)
T ss_dssp SSE---EEEEEE--TTTEEEEEEEETTEEEEEETTT--EEEEEE
T ss_pred CCC---cceEEEcCCCCEEEEEecCCCceeEeccccccceeecc
Confidence 542 57899999999999776 58999999998876555554
No 223
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=98.13 E-value=5.1e-06 Score=94.17 Aligned_cols=82 Identities=18% Similarity=0.296 Sum_probs=71.2
Q ss_pred EEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEcc--C
Q 003310 314 IAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSD--D 391 (832)
Q Consensus 314 l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSp--D 391 (832)
-+.|.+|++-|.||.|+.||.+||+||+|-+ +.|||.... +.++.++.||+ +.|.++.|=| .
T Consensus 43 E~eL~GH~GCVN~LeWn~dG~lL~SGSDD~r-~ivWd~~~~--------------KllhsI~TgHt-aNIFsvKFvP~tn 106 (758)
T KOG1310|consen 43 EAELTGHTGCVNCLEWNADGELLASGSDDTR-LIVWDPFEY--------------KLLHSISTGHT-ANIFSVKFVPYTN 106 (758)
T ss_pred hhhhccccceecceeecCCCCEEeecCCcce-EEeecchhc--------------ceeeeeecccc-cceeEEeeeccCC
Confidence 3678899999999999999999999999966 889998754 56667777876 5799999988 5
Q ss_pred CCEEEEEeCCCcEEEEecCC
Q 003310 392 SNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 392 g~~LAsgS~DgTVhIwdl~~ 411 (832)
.+.|++|..|..|||||+..
T Consensus 107 nriv~sgAgDk~i~lfdl~~ 126 (758)
T KOG1310|consen 107 NRIVLSGAGDKLIKLFDLDS 126 (758)
T ss_pred CeEEEeccCcceEEEEeccc
Confidence 67899999999999999985
No 224
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.11 E-value=7.8e-05 Score=87.84 Aligned_cols=104 Identities=14% Similarity=0.098 Sum_probs=73.0
Q ss_pred cccccCCCCeEEEEECC---CC-----cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCC
Q 003310 293 HFPDADNVGMVIVRDIV---SK-----NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDA 364 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~---s~-----~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~ 364 (832)
+|.-+...|.|.--.-. .. +.+..+..|.++|.++.|+|=+..+..++.|.+ ++||.-...
T Consensus 362 ~FiVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~v~~nPF~~k~fls~gDW~-vriWs~~~~---------- 430 (555)
T KOG1587|consen 362 HFIVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVYAVSRNPFYPKNFLSVGDWT-VRIWSEDVI---------- 430 (555)
T ss_pred eEEEEcCCcEEEEEeccCCcccccccccccccccccCcceEeeecCCCccceeeeeccce-eEeccccCC----------
Confidence 45556677887762222 11 335577789999999999998887766666877 899986522
Q ss_pred CCceeEEEEEeccCccccEEEEEEccC-CCEEEEEeCCCcEEEEecCCC
Q 003310 365 GTSYVHLYRLQRGLTNAVIQDISFSDD-SNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 365 ~~~~~~l~~l~rG~t~a~I~~IaFSpD-g~~LAsgS~DgTVhIwdl~~~ 412 (832)
...++.+.+. ...|.+++|||- ...||++..||.+.|||+...
T Consensus 431 ---~~Pl~~~~~~--~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~~ 474 (555)
T KOG1587|consen 431 ---ASPLLSLDSS--PDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQD 474 (555)
T ss_pred ---CCcchhhhhc--cceeeeeEEcCcCceEEEEEcCCCceehhhhhcc
Confidence 1233444332 233999999995 457888999999999999653
No 225
>PRK01029 tolB translocation protein TolB; Provisional
Probab=98.09 E-value=0.0016 Score=75.17 Aligned_cols=92 Identities=16% Similarity=0.207 Sum_probs=56.1
Q ss_pred EEEEECCC-CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC-C-CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 303 VIVRDIVS-KNVIAQFRAHKSPISALCFDPSGILLVTASVQ-G-HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 303 V~IwDl~s-~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d-G-t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
|.++++.. +.....+..+...+...+|||||++||..+.+ | ..|.+||+.++ ..+ .+..+
T Consensus 307 ly~~~~~~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g------------~~~---~Lt~~-- 369 (428)
T PRK01029 307 IYIMQIDPEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATG------------RDY---QLTTS-- 369 (428)
T ss_pred EEEEECcccccceEEeccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCC------------CeE---EccCC--
Confidence 44444432 22234444444567788999999999876654 2 35888898765 112 22222
Q ss_pred cccEEEEEEccCCCEEEEEeC-CC--cEEEEecCC
Q 003310 380 NAVIQDISFSDDSNWIMISSS-RG--TSHLFAINP 411 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~-Dg--TVhIwdl~~ 411 (832)
...+.+.+|+|||++|+..+. ++ .+.++++..
T Consensus 370 ~~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~ 404 (428)
T PRK01029 370 PENKESPSWAIDSLHLVYSAGNSNESELYLISLIT 404 (428)
T ss_pred CCCccceEECCCCCEEEEEECCCCCceEEEEECCC
Confidence 123677999999999886544 33 456666654
No 226
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=98.09 E-value=0.00044 Score=78.33 Aligned_cols=92 Identities=17% Similarity=0.229 Sum_probs=61.0
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCC-C-EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQG-H-NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dG-t-~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.+||+.+++. ..+..|.......+|+|||+.|+.++.++ . .|.++|+..+ ... .+..+
T Consensus 259 ~i~~~d~~~~~~-~~l~~~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~------------~~~---~l~~~-- 320 (417)
T TIGR02800 259 DIYVMDLDGKQL-TRLTNGPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGG------------EVR---RLTFR-- 320 (417)
T ss_pred cEEEEECCCCCE-EECCCCCCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCC------------CEE---EeecC--
Confidence 588899887754 44455555556789999999988776543 2 3555565543 112 22111
Q ss_pred cccEEEEEEccCCCEEEEEeCCC---cEEEEecCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRG---TSHLFAINP 411 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~Dg---TVhIwdl~~ 411 (832)
...+..++|||||++|+.++.++ .|.+|++..
T Consensus 321 ~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~ 355 (417)
T TIGR02800 321 GGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDG 355 (417)
T ss_pred CCCccCeEECCCCCEEEEEEccCCceEEEEEeCCC
Confidence 12366789999999999998876 677777765
No 227
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=98.07 E-value=1.8e-05 Score=83.35 Aligned_cols=57 Identities=23% Similarity=0.333 Sum_probs=54.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeC
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKI 351 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi 351 (832)
+++++-||.||||..++.++++.++-|...|.+|+|+||..++|.||.|++ |-+|++
T Consensus 266 lATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd~~lmAaaskD~r-ISLWkL 322 (323)
T KOG0322|consen 266 LATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPDCELMAAASKDAR-ISLWKL 322 (323)
T ss_pred EeecccCCcEEEEEeccCCchhhhhhhhcceeEEEeCCCCchhhhccCCce-EEeeec
Confidence 567888999999999999999999999999999999999999999999988 899986
No 228
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.95 E-value=0.0017 Score=74.36 Aligned_cols=92 Identities=15% Similarity=0.189 Sum_probs=58.5
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC-CC-EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ-GH-NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d-Gt-~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.+||+.+++. ..+..+........|+|||+.|+..+.. |. .|.++|+.++ ....+ .+ .+.
T Consensus 268 ~Iy~~d~~~~~~-~~lt~~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g------------~~~~l-t~-~~~- 331 (430)
T PRK00178 268 EIYVMDLASRQL-SRVTNHPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGG------------RAERV-TF-VGN- 331 (430)
T ss_pred eEEEEECCCCCe-EEcccCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCC------------CEEEe-ec-CCC-
Confidence 588889988765 3455555556678999999988776653 33 3555566544 12222 22 121
Q ss_pred cccEEEEEEccCCCEEEEEeCC-C--cEEEEecCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSR-G--TSHLFAINP 411 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~D-g--TVhIwdl~~ 411 (832)
.....+|||||++|+..+.+ + .|.+||+.+
T Consensus 332 --~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~t 364 (430)
T PRK00178 332 --YNARPRLSADGKTLVMVHRQDGNFHVAAQDLQR 364 (430)
T ss_pred --CccceEECCCCCEEEEEEccCCceEEEEEECCC
Confidence 13457899999999988754 3 366677765
No 229
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=97.95 E-value=3e-05 Score=93.68 Aligned_cols=96 Identities=17% Similarity=0.196 Sum_probs=76.3
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
+..-|.|.+|+....+....+.+|.+.|.++.|+-||+++||+|+|-+ ||+|++.+..... ... -
T Consensus 151 gsv~~~iivW~~~~dn~p~~l~GHeG~iF~i~~s~dg~~i~s~SdDRs-iRlW~i~s~~~~~-------------~~~-f 215 (967)
T KOG0974|consen 151 GSVFGEIIVWKPHEDNKPIRLKGHEGSIFSIVTSLDGRYIASVSDDRS-IRLWPIDSREVLG-------------CTG-F 215 (967)
T ss_pred ccccccEEEEeccccCCcceecccCCceEEEEEccCCcEEEEEecCcc-eeeeecccccccC-------------ccc-c
Confidence 344578999998854444468899999999999999999999999965 9999998862111 011 2
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
|| .|+|+.++|.|. .+++++.|-|+++|+.+
T Consensus 216 gH-saRvw~~~~~~n--~i~t~gedctcrvW~~~ 246 (967)
T KOG0974|consen 216 GH-SARVWACCFLPN--RIITVGEDCTCRVWGVN 246 (967)
T ss_pred cc-cceeEEEEeccc--eeEEeccceEEEEEecc
Confidence 54 478999999998 89999999999999543
No 230
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.86 E-value=0.0007 Score=74.86 Aligned_cols=81 Identities=15% Similarity=0.223 Sum_probs=62.8
Q ss_pred cCCCCeEEEEECCCCcEEEE-eccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSKNVIAQ-FRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~-~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
++.-|.+..+|+..++.+.. |++-++.|++|-..|.+.+||+++-|.. +||||+.+. ..++..+
T Consensus 265 gn~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~GLDRy-vRIhD~ktr--------------kll~kvY 329 (412)
T KOG3881|consen 265 GNTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASCGLDRY-VRIHDIKTR--------------KLLHKVY 329 (412)
T ss_pred ecccchhheecccCceeeccccCCccCCcceEEEcCCCceEEeecccee-EEEeecccc--------------hhhhhhh
Confidence 45567889999999988777 8899999999999999999999999955 999999874 2333332
Q ss_pred ccCccccEEEEEEccCCCEE
Q 003310 376 RGLTNAVIQDISFSDDSNWI 395 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~L 395 (832)
-.+.+++|-|.++-++.
T Consensus 330 ---vKs~lt~il~~~~~n~e 346 (412)
T KOG3881|consen 330 ---VKSRLTFILLRDDVNIE 346 (412)
T ss_pred ---hhccccEEEecCCcccc
Confidence 22347778787765443
No 231
>KOG4415 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.85 E-value=8.7e-06 Score=81.33 Aligned_cols=32 Identities=22% Similarity=0.457 Sum_probs=30.1
Q ss_pred eeeeeeEEeecCC-CcccccCCeeEEEEeecCc
Q 003310 635 LYISEAELQMHPP-RIPLWAKPQIYFQSMMIKD 666 (832)
Q Consensus 635 ~ylS~aEvq~h~~-~~plW~~~~~~F~~m~~~~ 666 (832)
.||++|||.||.+ ||+|||+|||.|+.+...+
T Consensus 28 eWl~hVEi~Th~gPHRriWmGPQFef~eih~d~ 60 (247)
T KOG4415|consen 28 EWLPHVEIRTHLGPHRRIWMGPQFEFFEIHEDD 60 (247)
T ss_pred ccccceEEEeccCccceeeecCceeEEEecCCC
Confidence 7999999999999 9999999999999998754
No 232
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.84 E-value=0.0016 Score=75.46 Aligned_cols=93 Identities=18% Similarity=0.284 Sum_probs=58.1
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEE--eCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIF--KIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iw--di~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.|.++|+.+++. ..+..+.......+|+|||+.|+..+..+....|| |+.++ ....+ ++ .+.
T Consensus 287 ~Iy~~dl~tg~~-~~lt~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g------------~~~~L-t~-~g~- 350 (448)
T PRK04792 287 EIYVVDIATKAL-TRITRHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASG------------KVSRL-TF-EGE- 350 (448)
T ss_pred EEEEEECCCCCe-EECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCC------------CEEEE-ec-CCC-
Confidence 588889887764 44555555667889999999888766542224455 55443 12222 22 222
Q ss_pred cccEEEEEEccCCCEEEEEeC-CCcEEEEecCCC
Q 003310 380 NAVIQDISFSDDSNWIMISSS-RGTSHLFAINPL 412 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~-DgTVhIwdl~~~ 412 (832)
.....+|||||++|+..+. ++..+||-+...
T Consensus 351 --~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~ 382 (448)
T PRK04792 351 --QNLGGSITPDGRSMIMVNRTNGKFNIARQDLE 382 (448)
T ss_pred --CCcCeeECCCCCEEEEEEecCCceEEEEEECC
Confidence 1345789999999988766 455667655433
No 233
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=97.80 E-value=0.00099 Score=76.62 Aligned_cols=194 Identities=14% Similarity=0.182 Sum_probs=120.2
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcc-
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTN- 380 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~- 380 (832)
.|.=.++..|+-+..|+...+++.++..++--.+||+|..+|. +..||-+.......... ...+-.. -|...
T Consensus 156 evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~-VEfwDpR~ksrv~~l~~-----~~~v~s~-pg~~~~ 228 (703)
T KOG2321|consen 156 EVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGV-VEFWDPRDKSRVGTLDA-----ASSVNSH-PGGDAA 228 (703)
T ss_pred ceEEEEccccccccccccccccceeeeecCccceEEecccCce-EEEecchhhhhheeeec-----ccccCCC-cccccc
Confidence 5777889999999999988899999999999999999999997 89999876411000000 0000001 12222
Q ss_pred ccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcccccCCcccccccCCCCCCCCCCCCcccccCCCCe
Q 003310 381 AVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFTTKHGAMAKSGVRWPPNLGLQMPNQQSLCASGPPV 460 (832)
Q Consensus 381 a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~~~~~~~~~~~~r~~~~s~~~~~~~~~l~~~~~p~ 460 (832)
..|.+|.|+.||-.+|+|++.|.|.||||... .+.-+.-|...++ . ..+.|.+. ..-+.
T Consensus 229 ~svTal~F~d~gL~~aVGts~G~v~iyDLRa~-~pl~~kdh~~e~p------i---------~~l~~~~~-----~~q~~ 287 (703)
T KOG2321|consen 229 PSVTALKFRDDGLHVAVGTSTGSVLIYDLRAS-KPLLVKDHGYELP------I---------KKLDWQDT-----DQQNK 287 (703)
T ss_pred CcceEEEecCCceeEEeeccCCcEEEEEcccC-CceeecccCCccc------e---------eeeccccc-----CCCce
Confidence 23999999999999999999999999999875 4555667755443 2 22233221 01121
Q ss_pred eeeeceEEEcCCCCCCccccccchhccCcccCCCcceeeeeeccCCCccccccCCcccccccEEEEcCCCcEEEEeeecc
Q 003310 461 TLSVVSRIRNGNNGWRGTVSGAAAAATGRVSSLSGAIASSFHNCKGNSETYAAGSSLKIKNHLLVFSPSGCMIQYALRIS 540 (832)
Q Consensus 461 ~ls~v~~I~~~~~~~~~~v~~~~~~a~g~~~~~~g~~~~~~h~~~~~~~~~~~~~~~~~~~~Llv~s~~G~l~~y~l~~~ 540 (832)
.++.=.+|-. .|+-. +||. .|+.=-.|....-|+.+++-+ +|++..++-|-+|- .|+
T Consensus 288 v~S~Dk~~~k---iWd~~--------~Gk~------~asiEpt~~lND~C~~p~sGm-----~f~Ane~~~m~~yy-iP~ 344 (703)
T KOG2321|consen 288 VVSMDKRILK---IWDEC--------TGKP------MASIEPTSDLNDFCFVPGSGM-----FFTANESSKMHTYY-IPS 344 (703)
T ss_pred EEecchHHhh---hcccc--------cCCc------eeeccccCCcCceeeecCCce-----EEEecCCCcceeEE-ccc
Confidence 1222223322 24322 2332 121111233445566667666 88888889998885 455
Q ss_pred CCCCcc
Q 003310 541 TGLDVT 546 (832)
Q Consensus 541 ~~~~~~ 546 (832)
.|+-|.
T Consensus 345 LGPaPr 350 (703)
T KOG2321|consen 345 LGPAPR 350 (703)
T ss_pred cCCCch
Confidence 555553
No 234
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=97.79 E-value=0.00027 Score=81.13 Aligned_cols=189 Identities=16% Similarity=0.248 Sum_probs=124.9
Q ss_pred CEEEEEECCCCcEEEEEeCC-CCEEEEEEcC--CEEEEEe-CCEEEEEECCCCceEEEEecCCCccCCCCCCCCCcccce
Q 003310 117 TVVHFYSLRSQSYVHMLKFR-SPIYSVRCSS--RVVAICQ-AAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGP 192 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~-s~V~sV~~S~--r~LAVa~-~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p 192 (832)
..|.-.+|..|..+..|... ..+..|.+|. .+||++. ++.|.+||.++-....+|....++...|+. +.
T Consensus 155 ~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~-------~~ 227 (703)
T KOG2321|consen 155 SEVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGG-------DA 227 (703)
T ss_pred cceEEEEccccccccccccccccceeeeecCccceEEecccCceEEEecchhhhhheeeecccccCCCccc-------cc
Confidence 34666789999998888875 6888999984 6788854 889999999987766666432221111110 00
Q ss_pred eeeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCC
Q 003310 193 LAVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPD 272 (832)
Q Consensus 193 ~Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~ 272 (832)
+.+..-|+|..
T Consensus 228 -~~svTal~F~d-------------------------------------------------------------------- 238 (703)
T KOG2321|consen 228 -APSVTALKFRD-------------------------------------------------------------------- 238 (703)
T ss_pred -cCcceEEEecC--------------------------------------------------------------------
Confidence 00000111110
Q ss_pred CcCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCC--CCeEEEEEcCCC--CEEEEEEcCCCEEEE
Q 003310 273 SQNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHK--SPISALCFDPSG--ILLVTASVQGHNINI 348 (832)
Q Consensus 273 ~~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~--~pIs~LaFSPdG--~lLATaS~dGt~I~I 348 (832)
+|. +++-+...|.|.|||+.+.+++.. +.|. .||..|.|-+.+ ..|+| .|.++++|
T Consensus 239 --------------~gL---~~aVGts~G~v~iyDLRa~~pl~~-kdh~~e~pi~~l~~~~~~~q~~v~S--~Dk~~~ki 298 (703)
T KOG2321|consen 239 --------------DGL---HVAVGTSTGSVLIYDLRASKPLLV-KDHGYELPIKKLDWQDTDQQNKVVS--MDKRILKI 298 (703)
T ss_pred --------------Cce---eEEeeccCCcEEEEEcccCCceee-cccCCccceeeecccccCCCceEEe--cchHHhhh
Confidence 010 122345678999999999988643 5554 589999998764 45555 45567999
Q ss_pred EeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 349 FKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 349 wdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
||-.++. ....+ .....|.++||=|++-++.++-..+.++.|=+..-|..+.+
T Consensus 299 Wd~~~Gk--------------~~asi---Ept~~lND~C~~p~sGm~f~Ane~~~m~~yyiP~LGPaPrW 351 (703)
T KOG2321|consen 299 WDECTGK--------------PMASI---EPTSDLNDFCFVPGSGMFFTANESSKMHTYYIPSLGPAPRW 351 (703)
T ss_pred cccccCC--------------ceeec---cccCCcCceeeecCCceEEEecCCCcceeEEccccCCCchh
Confidence 9987761 11111 12345999999999999999999999999988777665554
No 235
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=97.76 E-value=0.0005 Score=84.58 Aligned_cols=93 Identities=14% Similarity=0.279 Sum_probs=71.3
Q ss_pred CCcEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEE
Q 003310 310 SKNVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISF 388 (832)
Q Consensus 310 s~~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaF 388 (832)
.|..++++..|...|..++.++. +.+++|||.||+ ||||++..-. +.. +..+...+..+ ...++.++.+
T Consensus 1037 ~G~lVAhL~Ehs~~v~k~a~s~~~~s~FvsgS~DGt-VKvW~~~k~~-~~~------~s~rS~ltys~--~~sr~~~vt~ 1106 (1431)
T KOG1240|consen 1037 RGILVAHLHEHSSAVIKLAVSSEHTSLFVSGSDDGT-VKVWNLRKLE-GEG------GSARSELTYSP--EGSRVEKVTM 1106 (1431)
T ss_pred cceEeehhhhccccccceeecCCCCceEEEecCCce-EEEeeehhhh-cCc------ceeeeeEEEec--cCCceEEEEe
Confidence 36789999999999999998775 589999999998 9999987641 110 11222222221 2345889999
Q ss_pred ccCCCEEEEEeCCCcEEEEecCCC
Q 003310 389 SDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 389 SpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.+.|..+|.++.||.|+++++..+
T Consensus 1107 ~~~~~~~Av~t~DG~v~~~~id~~ 1130 (1431)
T KOG1240|consen 1107 CGNGDQFAVSTKDGSVRVLRIDHY 1130 (1431)
T ss_pred ccCCCeEEEEcCCCeEEEEEcccc
Confidence 999999999999999999999886
No 236
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.76 E-value=0.00033 Score=75.91 Aligned_cols=93 Identities=14% Similarity=0.264 Sum_probs=65.1
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEc-----------------------------------C
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASV-----------------------------------Q 342 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~-----------------------------------d 342 (832)
.-+-.|.||.+.+.+.. .++-....+..++|.|||++.|.++. |
T Consensus 111 eF~lriTVWSL~t~~~~-~~~~pK~~~kg~~f~~dg~f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~dT~DltgieWsPd 189 (447)
T KOG4497|consen 111 EFDLRITVWSLNTQKGY-LLPHPKTNVKGYAFHPDGQFCAILSRRDCKDYVQISSCKAWILLKEFKLDTIDLTGIEWSPD 189 (447)
T ss_pred cceeEEEEEEeccceeE-EecccccCceeEEECCCCceeeeeecccHHHHHHHHhhHHHHHHHhcCCCcccccCceECCC
Confidence 34457888998887653 33323345677899999998888764 3
Q ss_pred CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 343 GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 343 Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
|..+-|||.--. ..+|..+||.. |..++|||.+++||+||.|+.++|.+
T Consensus 190 g~~laVwd~~Le--------------ykv~aYe~~lG---~k~v~wsP~~qflavGsyD~~lrvln 238 (447)
T KOG4497|consen 190 GNWLAVWDNVLE--------------YKVYAYERGLG---LKFVEWSPCNQFLAVGSYDQMLRVLN 238 (447)
T ss_pred CcEEEEecchhh--------------heeeeeeeccc---eeEEEeccccceEEeeccchhhhhhc
Confidence 445555554322 34455667642 88999999999999999999999843
No 237
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=97.75 E-value=0.00011 Score=79.35 Aligned_cols=119 Identities=19% Similarity=0.232 Sum_probs=85.8
Q ss_pred ccccCCCCeEEEEECCCCcEEEEecc--CCC---CeEEEEEcCCCCEEEEEEcCCCEEEEEeC-CCCCCCCCCccCCCCc
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRA--HKS---PISALCFDPSGILLVTASVQGHNINIFKI-IPGILGTSSACDAGTS 367 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~a--H~~---pIs~LaFSPdG~lLATaS~dGt~I~Iwdi-~t~~~~~~s~~~~~~~ 367 (832)
++....+.-|++||.-+++..+.+++ |.. ...+|+|+|||+.|..|-.. .|||||+ +++..-
T Consensus 126 ~a~ssr~~PIh~wdaftG~lraSy~~ydh~de~taAhsL~Fs~DGeqlfaGykr--cirvFdt~RpGr~c---------- 193 (406)
T KOG2919|consen 126 FAVSSRDQPIHLWDAFTGKLRASYRAYDHQDEYTAAHSLQFSPDGEQLFAGYKR--CIRVFDTSRPGRDC---------- 193 (406)
T ss_pred eeeccccCceeeeeccccccccchhhhhhHHhhhhheeEEecCCCCeEeecccc--eEEEeeccCCCCCC----------
Confidence 34455667799999999999888874 433 34579999999999987654 5999999 665210
Q ss_pred eeEEEEEe---ccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcc
Q 003310 368 YVHLYRLQ---RGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFT 426 (832)
Q Consensus 368 ~~~l~~l~---rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~ 426 (832)
...-++. -|. ...|.+++||| |.+.+|.+|--.++-||.-...+....+.+|.+.++
T Consensus 194 -~vy~t~~~~k~gq-~giisc~a~sP~~~~~~a~gsY~q~~giy~~~~~~pl~llggh~gGvT 254 (406)
T KOG2919|consen 194 -PVYTTVTKGKFGQ-KGIISCFAFSPMDSKTLAVGSYGQRVGIYNDDGRRPLQLLGGHGGGVT 254 (406)
T ss_pred -cchhhhhcccccc-cceeeeeeccCCCCcceeeecccceeeeEecCCCCceeeecccCCCee
Confidence 1101111 121 12388999999 677999999999999998887777778888866554
No 238
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.75 E-value=0.035 Score=64.01 Aligned_cols=49 Identities=12% Similarity=0.124 Sum_probs=36.4
Q ss_pred EEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEEeC----CEEEEEECCCCc
Q 003310 118 VVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAICQA----AQVHCFDAATLE 166 (832)
Q Consensus 118 tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa~~----~~I~vwDl~t~~ 166 (832)
.|.++|+.+|+...-..++..+....++ ++.|+...+ .+|+++|+.+++
T Consensus 214 ~Iyv~dl~tg~~~~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~ 268 (419)
T PRK04043 214 TLYKYNLYTGKKEKIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKT 268 (419)
T ss_pred EEEEEECCCCcEEEEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCc
Confidence 6899999999876655677777777787 456665432 479999998876
No 239
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=97.71 E-value=8.4e-05 Score=82.17 Aligned_cols=103 Identities=16% Similarity=0.151 Sum_probs=77.5
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
++...+.+|....+.+ ..+-+|-+-++.|+||||++.|.||+.|++ |||=..... .-+..+.-|
T Consensus 129 gD~~~~di~s~~~~~~-~~~lGhvSml~dVavS~D~~~IitaDRDEk-IRvs~ypa~--------------f~IesfclG 192 (390)
T KOG3914|consen 129 GDVYSFDILSADSGRC-EPILGHVSMLLDVAVSPDDQFIITADRDEK-IRVSRYPAT--------------FVIESFCLG 192 (390)
T ss_pred CCceeeeeecccccCc-chhhhhhhhhheeeecCCCCEEEEecCCce-EEEEecCcc--------------cchhhhccc
Confidence 4456677777766433 455699999999999999999999999998 787655321 233345557
Q ss_pred CccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 378 LTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
|+. -|..++.-++ +.|+++|.|+|+++||+..++...++
T Consensus 193 H~e-FVS~isl~~~-~~LlS~sGD~tlr~Wd~~sgk~L~t~ 231 (390)
T KOG3914|consen 193 HKE-FVSTISLTDN-YLLLSGSGDKTLRLWDITSGKLLDTC 231 (390)
T ss_pred cHh-heeeeeeccC-ceeeecCCCCcEEEEecccCCccccc
Confidence 653 4888888755 67899999999999999998776444
No 240
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.69 E-value=0.039 Score=61.73 Aligned_cols=108 Identities=19% Similarity=0.379 Sum_probs=67.2
Q ss_pred CeEEEEECCCCc--E--EEEeccC-CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE-
Q 003310 301 GMVIVRDIVSKN--V--IAQFRAH-KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL- 374 (832)
Q Consensus 301 G~V~IwDl~s~~--~--l~~~~aH-~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l- 374 (832)
..|.+|++.... + ...++.. ...-..|+|+|||+++.........|.+|++.+. . +....+...
T Consensus 166 D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~-~---------g~~~~~~~~~ 235 (345)
T PF10282_consen 166 DRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPS-D---------GSLTEIQTIS 235 (345)
T ss_dssp TEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETT-T---------TEEEEEEEEE
T ss_pred CEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeeccc-C---------CceeEEEEee
Confidence 368888886543 2 2333322 2334789999999999888887777999998732 0 112222222
Q ss_pred --eccCccc-cEEEEEEccCCCEEEEEeC-CCcEEEEecCCCCCceee
Q 003310 375 --QRGLTNA-VIQDISFSDDSNWIMISSS-RGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 375 --~rG~t~a-~I~~IaFSpDg~~LAsgS~-DgTVhIwdl~~~g~~~~~ 418 (832)
..+.... ....|++||||++|.++.. ..+|.+|++.+..+...+
T Consensus 236 ~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~ 283 (345)
T PF10282_consen 236 TLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTL 283 (345)
T ss_dssp SCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEE
T ss_pred eccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEE
Confidence 1222211 3678999999999887764 678999999665555443
No 241
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.68 E-value=0.00015 Score=54.22 Aligned_cols=37 Identities=30% Similarity=0.516 Sum_probs=30.9
Q ss_pred EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
.+.++ +++. ..|.+|+|+|++++||+++.|++|+|||
T Consensus 3 ~~~~~-~~h~-~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 3 CVRTF-RGHS-SSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEE-ESSS-SSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred EEEEE-cCCC-CcEEEEEEecccccceeeCCCCEEEEEC
Confidence 34455 4554 4599999999999999999999999997
No 242
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=97.67 E-value=0.00013 Score=79.81 Aligned_cols=77 Identities=23% Similarity=0.398 Sum_probs=62.7
Q ss_pred eccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEE
Q 003310 317 FRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIM 396 (832)
Q Consensus 317 ~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LA 396 (832)
-++| .+|++|++.+||+.|+|||.+...|.|||..++ ....|. .+|. .-+.-+.|||||.+|.
T Consensus 192 ~pgh-~pVtsmqwn~dgt~l~tAS~gsssi~iWdpdtg------------~~~pL~--~~gl--gg~slLkwSPdgd~lf 254 (445)
T KOG2139|consen 192 DPGH-NPVTSMQWNEDGTILVTASFGSSSIMIWDPDTG------------QKIPLI--PKGL--GGFSLLKWSPDGDVLF 254 (445)
T ss_pred CCCC-ceeeEEEEcCCCCEEeecccCcceEEEEcCCCC------------Cccccc--ccCC--CceeeEEEcCCCCEEE
Confidence 3466 699999999999999999999888999999887 122332 2332 2377899999999999
Q ss_pred EEeCCCcEEEEecC
Q 003310 397 ISSSRGTSHLFAIN 410 (832)
Q Consensus 397 sgS~DgTVhIwdl~ 410 (832)
+++-|++.+||..+
T Consensus 255 aAt~davfrlw~e~ 268 (445)
T KOG2139|consen 255 AATCDAVFRLWQEN 268 (445)
T ss_pred Eecccceeeeehhc
Confidence 99999999999654
No 243
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=97.65 E-value=0.062 Score=59.48 Aligned_cols=85 Identities=15% Similarity=0.285 Sum_probs=59.9
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcccc-EEEEEEccCCCEEEEEeC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAV-IQDISFSDDSNWIMISSS 400 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~ 400 (832)
....+|..+|||++|..+...-..|-+|.+.+. + .+|--+.+-.+... -.+..|+++|++|+++..
T Consensus 244 ~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~--~-----------g~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q 310 (346)
T COG2706 244 NWAAAIHISPDGRFLYASNRGHDSIAVFSVDPD--G-----------GKLELVGITPTEGQFPRDFNINPSGRFLIAANQ 310 (346)
T ss_pred CceeEEEECCCCCEEEEecCCCCeEEEEEEcCC--C-----------CEEEEEEEeccCCcCCccceeCCCCCEEEEEcc
Confidence 457889999999999887665556888888764 0 11222212222222 468899999999999887
Q ss_pred C-CcEEEEecCCCCCceeec
Q 003310 401 R-GTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 401 D-gTVhIwdl~~~g~~~~~~ 419 (832)
+ .+|+||.+++..|..+..
T Consensus 311 ~sd~i~vf~~d~~TG~L~~~ 330 (346)
T COG2706 311 KSDNITVFERDKETGRLTLL 330 (346)
T ss_pred CCCcEEEEEEcCCCceEEec
Confidence 5 479999999887776653
No 244
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.63 E-value=0.086 Score=54.53 Aligned_cols=94 Identities=15% Similarity=0.229 Sum_probs=63.6
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCC----------e-EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCC
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSP----------I-SALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAG 365 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~p----------I-s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~ 365 (832)
+...|.|..+|+.+|+.+..++.+..+ + ..+.++ +| .+..++.+|..+.+ |+.++
T Consensus 128 ~~~~g~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~v~~~~~~g~~~~~-d~~tg----------- 193 (238)
T PF13360_consen 128 GTSSGKLVALDPKTGKLLWKYPVGEPRGSSPISSFSDINGSPVIS-DG-RVYVSSGDGRVVAV-DLATG----------- 193 (238)
T ss_dssp EETCSEEEEEETTTTEEEEEEESSTT-SS--EEEETTEEEEEECC-TT-EEEEECCTSSEEEE-ETTTT-----------
T ss_pred EeccCcEEEEecCCCcEEEEeecCCCCCCcceeeecccccceEEE-CC-EEEEEcCCCeEEEE-ECCCC-----------
Confidence 344789999999999999888765432 1 233333 55 66666777775666 99887
Q ss_pred CceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 366 TSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 366 ~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
..+++.. .. .+.. ....++..|.+++.++++..||+.++
T Consensus 194 ---~~~w~~~--~~--~~~~-~~~~~~~~l~~~~~~~~l~~~d~~tG 232 (238)
T PF13360_consen 194 ---EKLWSKP--IS--GIYS-LPSVDGGTLYVTSSDGRLYALDLKTG 232 (238)
T ss_dssp ---EEEEEEC--SS---ECE-CEECCCTEEEEEETTTEEEEEETTTT
T ss_pred ---CEEEEec--CC--CccC-CceeeCCEEEEEeCCCEEEEEECCCC
Confidence 4445332 11 1222 24577888888889999999999875
No 245
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=97.63 E-value=6e-05 Score=87.13 Aligned_cols=90 Identities=16% Similarity=0.194 Sum_probs=70.9
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
+..+.|||...|..+.++++|...|.|++++.||+++|+|+.|.- +.||.-.-. -.|+..|+
T Consensus 32 g~rlliyD~ndG~llqtLKgHKDtVycVAys~dGkrFASG~aDK~-VI~W~~klE-----------------G~LkYSH~ 93 (1081)
T KOG1538|consen 32 GSRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKS-VIIWTSKLE-----------------GILKYSHN 93 (1081)
T ss_pred CCEEEEEeCCCcccccccccccceEEEEEEccCCceeccCCCcee-EEEeccccc-----------------ceeeeccC
Confidence 347999999999999999999999999999999999999999965 779975432 12333333
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
-.|+|+.|.|-+..||++|... .-+|..
T Consensus 94 -D~IQCMsFNP~~h~LasCsLsd-FglWS~ 121 (1081)
T KOG1538|consen 94 -DAIQCMSFNPITHQLASCSLSD-FGLWSP 121 (1081)
T ss_pred -CeeeEeecCchHHHhhhcchhh-ccccCh
Confidence 3499999999999999887542 334543
No 246
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=97.60 E-value=0.021 Score=70.93 Aligned_cols=113 Identities=15% Similarity=0.221 Sum_probs=65.6
Q ss_pred cCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcC---CCCEEEEEEc-CCCEEEEEeCCCCCCCCC-CccCCCCceeE
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDP---SGILLVTASV-QGHNINIFKIIPGILGTS-SACDAGTSYVH 370 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSP---dG~lLATaS~-dGt~I~Iwdi~t~~~~~~-s~~~~~~~~~~ 370 (832)
+...|.+.+||++=+.++...+ +|..+|..|+..| .....++++. -.+-+-+|++.++..... .+++.. ...
T Consensus 1213 Gts~G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~~~~~~~~S~~vs~~~~~~nevs~wn~~~g~~~~vl~~s~~~--p~l 1290 (1431)
T KOG1240|consen 1213 GTSRGQLVLWDLRFRVPILSWEHPARAPIRHVWLCPTYPQESVSVSAGSSSNNEVSTWNMETGLRQTVLWASDGA--PIL 1290 (1431)
T ss_pred ecCCceEEEEEeecCceeecccCcccCCcceEEeeccCCCCceEEEecccCCCceeeeecccCcceEEEEcCCCC--cch
Confidence 3456889999999888887765 5557888777665 3356665555 334589999988721000 000000 000
Q ss_pred EEEEe----c-cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 371 LYRLQ----R-GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 371 l~~l~----r-G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
-+.+. | -..-+ ....++..-+.++.+|+.|+.|+.||....
T Consensus 1291 s~~~Ps~~~~kp~~~~-~~~~~~~~~~~~~ltggsd~kIR~wD~~~p 1336 (1431)
T KOG1240|consen 1291 SYALPSNDARKPDSLA-GISCGVCEKNGFLLTGGSDMKIRKWDPTRP 1336 (1431)
T ss_pred hhhcccccCCCCCccc-ceeeecccCCceeeecCCccceeeccCCCc
Confidence 00000 0 00001 223445555678999999999999999654
No 247
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=97.59 E-value=0.00069 Score=82.28 Aligned_cols=100 Identities=17% Similarity=0.210 Sum_probs=79.3
Q ss_pred cCCCCeEEEEECCCCcEEE-EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSKNVIA-QFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~-~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
...|-.+++|++++.+.+. +.=+|+..|..++|.|+ +|+|++.|-+ .++|+..-. ....|+-+
T Consensus 193 ~SdDRsiRlW~i~s~~~~~~~~fgHsaRvw~~~~~~n--~i~t~gedct-crvW~~~~~-------------~l~~y~~h 256 (967)
T KOG0974|consen 193 VSDDRSIRLWPIDSREVLGCTGFGHSARVWACCFLPN--RIITVGEDCT-CRVWGVNGT-------------QLEVYDEH 256 (967)
T ss_pred EecCcceeeeecccccccCcccccccceeEEEEeccc--eeEEeccceE-EEEEecccc-------------eehhhhhh
Confidence 4567789999999987665 55689999999999999 9999999976 899965432 11133333
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
+| .-|+.++..++.-++.++..|+++++|++..-+..
T Consensus 257 ~g---~~iw~~~~~~~~~~~vT~g~Ds~lk~~~l~~r~~e 293 (967)
T KOG0974|consen 257 SG---KGIWKIAVPIGVIIKVTGGNDSTLKLWDLNGRGLE 293 (967)
T ss_pred hh---cceeEEEEcCCceEEEeeccCcchhhhhhhccccc
Confidence 33 23999999999999999999999999999765443
No 248
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=97.59 E-value=0.0067 Score=70.22 Aligned_cols=97 Identities=20% Similarity=0.339 Sum_probs=77.3
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+.+.+++|..|+...++.++.+++-...+.+++.+|||+.|++||. .|++||+.+. +.+.+|
T Consensus 119 S~~ad~~v~~~~~~~~~~~~~~~~~~~~~~sl~is~D~~~l~~as~---~ik~~~~~~k--------------evv~~f- 180 (541)
T KOG4547|consen 119 SVGADLKVVYILEKEKVIIRIWKEQKPLVSSLCISPDGKILLTASR---QIKVLDIETK--------------EVVITF- 180 (541)
T ss_pred ecCCceeEEEEecccceeeeeeccCCCccceEEEcCCCCEEEeccc---eEEEEEccCc--------------eEEEEe-
Confidence 4467889999999999999999999999999999999999999984 3999999986 444455
Q ss_pred ccCccccEEEEEEccC-----CCEEEEE-eCCCcEEEEecCC
Q 003310 376 RGLTNAVIQDISFSDD-----SNWIMIS-SSRGTSHLFAINP 411 (832)
Q Consensus 376 rG~t~a~I~~IaFSpD-----g~~LAsg-S~DgTVhIwdl~~ 411 (832)
.|| ...|.+++|--+ |+++.++ -.+.-+-+|-+..
T Consensus 181 tgh-~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~v~~ 221 (541)
T KOG4547|consen 181 TGH-GSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWVVEK 221 (541)
T ss_pred cCC-CcceEEEEEEEeccccccceeeeccccccceeEEEEEc
Confidence 465 457999999887 6666553 3345567777754
No 249
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.57 E-value=0.039 Score=70.68 Aligned_cols=82 Identities=7% Similarity=0.068 Sum_probs=52.8
Q ss_pred EEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCC---CCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 325 SALCFDPSGILLVTASVQGHNINIFKIIPGILG---TSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 325 s~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~---~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
..|+|+++|.++++-+.+++ |++||..++... .....+.. .... + .+. -.....|++++||+++++-+.+
T Consensus 807 ~Gvavd~dG~LYVADs~N~r-IrviD~~tg~v~tiaG~G~~G~~-dG~~---~-~a~-l~~P~GIavd~dG~lyVaDt~N 879 (1057)
T PLN02919 807 LGVLCAKDGQIYVADSYNHK-IKKLDPATKRVTTLAGTGKAGFK-DGKA---L-KAQ-LSEPAGLALGENGRLFVADTNN 879 (1057)
T ss_pred ceeeEeCCCcEEEEECCCCE-EEEEECCCCeEEEEeccCCcCCC-CCcc---c-ccc-cCCceEEEEeCCCCEEEEECCC
Confidence 47999999997777666654 999998765100 00000000 0000 0 000 0136789999999999999999
Q ss_pred CcEEEEecCCCC
Q 003310 402 GTSHLFAINPLG 413 (832)
Q Consensus 402 gTVhIwdl~~~g 413 (832)
++|++||+.+..
T Consensus 880 n~Irvid~~~~~ 891 (1057)
T PLN02919 880 SLIRYLDLNKGE 891 (1057)
T ss_pred CEEEEEECCCCc
Confidence 999999998754
No 250
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.57 E-value=0.0062 Score=70.35 Aligned_cols=76 Identities=20% Similarity=0.170 Sum_probs=46.7
Q ss_pred CeEEEEEcCCCCEEEEEEc-CCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 323 PISALCFDPSGILLVTASV-QGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~-dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
.....+|||||+.||..+. +|. .+||.+.....+ +. .. .+..+ ...+...+|||||++||..+.+
T Consensus 282 ~~~~p~wSPDG~~Laf~s~~~g~-~~ly~~~~~~~g--------~~-~~--~lt~~--~~~~~~p~wSPDG~~Laf~~~~ 347 (428)
T PRK01029 282 TQGNPSFSPDGTRLVFVSNKDGR-PRIYIMQIDPEG--------QS-PR--LLTKK--YRNSSCPAWSPDGKKIAFCSVI 347 (428)
T ss_pred CcCCeEECCCCCEEEEEECCCCC-ceEEEEECcccc--------cc-eE--EeccC--CCCccceeECCCCCEEEEEEcC
Confidence 3466799999998887764 554 566654321000 01 11 22111 1236678999999999987764
Q ss_pred ---CcEEEEecCCC
Q 003310 402 ---GTSHLFAINPL 412 (832)
Q Consensus 402 ---gTVhIwdl~~~ 412 (832)
..|++||+..+
T Consensus 348 ~g~~~I~v~dl~~g 361 (428)
T PRK01029 348 KGVRQICVYDLATG 361 (428)
T ss_pred CCCcEEEEEECCCC
Confidence 35788888654
No 251
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=97.51 E-value=0.00068 Score=78.15 Aligned_cols=108 Identities=9% Similarity=0.126 Sum_probs=89.7
Q ss_pred cCCCCeEEEEECCCCcEEEEec--cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFR--AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~--aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
+...|.|.+|++..++.-..|. .|.++|.++..+.+-.-|-|++.|++ +-.|+.... +++...
T Consensus 76 gt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad~~-v~~~~~~~~--------------~~~~~~ 140 (541)
T KOG4547|consen 76 GTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCIYSVGADLK-VVYILEKEK--------------VIIRIW 140 (541)
T ss_pred ecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccccCceEecCCcee-EEEEecccc--------------eeeeee
Confidence 4567899999999999988887 89999999999999999999999987 789998775 333333
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCC
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDA 423 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~ 423 (832)
.+.. -.+.+++.+|||+.++++| ++|++|++++..-..+|.+|.+
T Consensus 141 -~~~~-~~~~sl~is~D~~~l~~as--~~ik~~~~~~kevv~~ftgh~s 185 (541)
T KOG4547|consen 141 -KEQK-PLVSSLCISPDGKILLTAS--RQIKVLDIETKEVVITFTGHGS 185 (541)
T ss_pred -ccCC-CccceEEEcCCCCEEEecc--ceEEEEEccCceEEEEecCCCc
Confidence 2322 3489999999999999886 6899999999888889999944
No 252
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.50 E-value=0.064 Score=60.25 Aligned_cols=101 Identities=13% Similarity=0.135 Sum_probs=63.6
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
...+|.+..+|..+++.+...+.. ....+. .++..|..++.+|. +..+|..++ ..+++...
T Consensus 247 ~~~~g~l~a~d~~tG~~~W~~~~~--~~~~p~--~~~~~vyv~~~~G~-l~~~d~~tG--------------~~~W~~~~ 307 (377)
T TIGR03300 247 VSYQGRVAALDLRSGRVLWKRDAS--SYQGPA--VDDNRLYVTDADGV-VVALDRRSG--------------SELWKNDE 307 (377)
T ss_pred EEcCCEEEEEECCCCcEEEeeccC--CccCce--EeCCEEEEECCCCe-EEEEECCCC--------------cEEEcccc
Confidence 456789999999999887766521 112222 35667777788887 889999876 34444421
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
.........+. .+..|.+++.+|.+++||..++...-.+.
T Consensus 308 -~~~~~~ssp~i--~g~~l~~~~~~G~l~~~d~~tG~~~~~~~ 347 (377)
T TIGR03300 308 -LKYRQLTAPAV--VGGYLVVGDFEGYLHWLSREDGSFVARLK 347 (377)
T ss_pred -ccCCccccCEE--ECCEEEEEeCCCEEEEEECCCCCEEEEEE
Confidence 11111111112 46688899999999999987754433443
No 253
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=97.48 E-value=0.00034 Score=79.84 Aligned_cols=111 Identities=14% Similarity=0.204 Sum_probs=82.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcC--CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDP--SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVH 370 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSP--dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~ 370 (832)
++++.+|-.+.|||.-..+++..+. +|+..|.+++|=| +.++++||+.|.. |++||+.....+. .++.-....+
T Consensus 65 L~SGSDD~r~ivWd~~~~KllhsI~TgHtaNIFsvKFvP~tnnriv~sgAgDk~-i~lfdl~~~~~~~--~d~~~~~~~~ 141 (758)
T KOG1310|consen 65 LASGSDDTRLIVWDPFEYKLLHSISTGHTANIFSVKFVPYTNNRIVLSGAGDKL-IKLFDLDSSKEGG--MDHGMEETTR 141 (758)
T ss_pred EeecCCcceEEeecchhcceeeeeecccccceeEEeeeccCCCeEEEeccCcce-EEEEecccccccc--cccCccchhh
Confidence 4677888899999999999988887 9999999999999 5678899999964 9999997531100 1110001111
Q ss_pred EEEEeccCccccEEEEEEccCC-CEEEEEeCCCcEEEEecCC
Q 003310 371 LYRLQRGLTNAVIQDISFSDDS-NWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 371 l~~l~rG~t~a~I~~IaFSpDg-~~LAsgS~DgTVhIwdl~~ 411 (832)
+|. -+..+|-.|+-.|++ ..+-++|.|||++-+|+..
T Consensus 142 ~~~----cht~rVKria~~p~~PhtfwsasEDGtirQyDiRE 179 (758)
T KOG1310|consen 142 CWS----CHTDRVKRIATAPNGPHTFWSASEDGTIRQYDIRE 179 (758)
T ss_pred hhh----hhhhhhhheecCCCCCceEEEecCCcceeeecccC
Confidence 122 222347788899998 7888999999999999965
No 254
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=97.47 E-value=0.0032 Score=68.34 Aligned_cols=112 Identities=13% Similarity=0.150 Sum_probs=70.4
Q ss_pred CCCCeEEEEECCCCc---EEEEe-ccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCC----CCCCCccCCCCcee
Q 003310 298 DNVGMVIVRDIVSKN---VIAQF-RAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGI----LGTSSACDAGTSYV 369 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~---~l~~~-~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~----~~~~s~~~~~~~~~ 369 (832)
+....|.||-++... +-+++ +.+.+.|++|.|.|++-+||+||.|+. .|||...-.. +....-+..-+=..
T Consensus 119 Sgar~isVcy~E~ENdWWVsKhikkPirStv~sldWhpnnVLlaaGs~D~k-~rVfSayIK~Vdekpap~pWgsk~PFG~ 197 (361)
T KOG1523|consen 119 SGARLISVCYYEQENDWWVSKHIKKPIRSTVTSLDWHPNNVLLAAGSTDGK-CRVFSAYIKGVDEKPAPTPWGSKMPFGQ 197 (361)
T ss_pred cCccEEEEEEEecccceehhhhhCCccccceeeeeccCCcceecccccCcc-eeEEEEeeeccccCCCCCCCccCCcHHH
Confidence 334456666554322 11122 367789999999999999999999998 7999643210 10000000000112
Q ss_pred EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 370 HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 370 ~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.+.++. .....|..+.|||+|..||=.+.|.++.+=|....
T Consensus 198 lm~E~~--~~ggwvh~v~fs~sG~~lawv~Hds~v~~~da~~p 238 (361)
T KOG1523|consen 198 LMSEAS--SSGGWVHGVLFSPSGNRLAWVGHDSTVSFVDAAGP 238 (361)
T ss_pred HHHhhc--cCCCceeeeEeCCCCCEeeEecCCCceEEeecCCC
Confidence 223332 22345999999999999999999999988776544
No 255
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=97.43 E-value=0.00058 Score=71.55 Aligned_cols=80 Identities=20% Similarity=0.272 Sum_probs=63.9
Q ss_pred CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCC
Q 003310 323 PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRG 402 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~Dg 402 (832)
.|++|-.+|+..-|.+|..|+. |.-||+.++ .+.+..|||+.. |.+++--.-+..+.+|+.||
T Consensus 116 eINam~ldP~enSi~~AgGD~~-~y~~dlE~G---------------~i~r~~rGHtDY-vH~vv~R~~~~qilsG~EDG 178 (325)
T KOG0649|consen 116 EINAMWLDPSENSILFAGGDGV-IYQVDLEDG---------------RIQREYRGHTDY-VHSVVGRNANGQILSGAEDG 178 (325)
T ss_pred ccceeEeccCCCcEEEecCCeE-EEEEEecCC---------------EEEEEEcCCcce-eeeeeecccCcceeecCCCc
Confidence 5899999998887888888886 899999987 344555898764 88888844445567999999
Q ss_pred cEEEEecCCCCCceeec
Q 003310 403 TSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 403 TVhIwdl~~~g~~~~~~ 419 (832)
|++|||+.+.+....+.
T Consensus 179 tvRvWd~kt~k~v~~ie 195 (325)
T KOG0649|consen 179 TVRVWDTKTQKHVSMIE 195 (325)
T ss_pred cEEEEeccccceeEEec
Confidence 99999999877666654
No 256
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.43 E-value=0.00012 Score=82.26 Aligned_cols=107 Identities=10% Similarity=0.271 Sum_probs=89.0
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
+++++..|.++.-|+.+|+.++.|..-.+.+..|+-+|=...+-+|-..|+ +-+|.-... ..|.+
T Consensus 224 L~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~~vm~qNP~NaVih~GhsnGt-VSlWSP~sk--------------ePLvK 288 (545)
T KOG1272|consen 224 LVAASEAGFLKYQDVSTGKLVASIRTGAGRTDVMKQNPYNAVIHLGHSNGT-VSLWSPNSK--------------EPLVK 288 (545)
T ss_pred eeecccCCceEEEeechhhhhHHHHccCCccchhhcCCccceEEEcCCCce-EEecCCCCc--------------chHHH
Confidence 356778899999999999999999988899999999999999999999998 899976543 12212
Q ss_pred E--eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 374 L--QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 374 l--~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
+ +|| .|.+||+.++|+|+|+++.|..++|||+..+....++.
T Consensus 289 iLcH~g----~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~~~ql~t~~ 332 (545)
T KOG1272|consen 289 ILCHRG----PVSSIAVDRGGRYMATTGLDRKVKIWDLRNFYQLHTYR 332 (545)
T ss_pred HHhcCC----CcceEEECCCCcEEeecccccceeEeeeccccccceee
Confidence 1 233 49999999999999999999999999999987655543
No 257
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.40 E-value=0.1 Score=60.56 Aligned_cols=101 Identities=13% Similarity=0.243 Sum_probs=73.1
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEE-cCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTAS-VQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL 378 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS-~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~ 378 (832)
..++++.+++...+...+.. .+||.+++|+|+|+-++.+- .-=-.+-|||++- ..++.|-.|.
T Consensus 250 Eq~Lyll~t~g~s~~V~L~k-~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~---------------~~v~df~egp 313 (566)
T KOG2315|consen 250 EQTLYLLATQGESVSVPLLK-EGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLRG---------------KPVFDFPEGP 313 (566)
T ss_pred cceEEEEEecCceEEEecCC-CCCceEEEECCCCCEEEEEEecccceEEEEcCCC---------------CEeEeCCCCC
Confidence 34788888885555545443 58999999999999887643 2112367888753 3556665443
Q ss_pred ccccEEEEEEccCCCEEEEEeC---CCcEEEEecCCCCCceeecc
Q 003310 379 TNAVIQDISFSDDSNWIMISSS---RGTSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 379 t~a~I~~IaFSpDg~~LAsgS~---DgTVhIwdl~~~g~~~~~~~ 420 (832)
=+++-|+|.|++|+.++- .|.+-|||+...+....+..
T Consensus 314 ----RN~~~fnp~g~ii~lAGFGNL~G~mEvwDv~n~K~i~~~~a 354 (566)
T KOG2315|consen 314 ----RNTAFFNPHGNIILLAGFGNLPGDMEVWDVPNRKLIAKFKA 354 (566)
T ss_pred ----ccceEECCCCCEEEEeecCCCCCceEEEeccchhhcccccc
Confidence 356789999999998776 58999999999887777754
No 258
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=97.37 E-value=0.0065 Score=66.56 Aligned_cols=99 Identities=20% Similarity=0.361 Sum_probs=75.8
Q ss_pred CCCCeEEEE--ECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 298 DNVGMVIVR--DIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 298 ~~~G~V~Iw--Dl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
...|.|.+- +.....++.++++|..+|.+++|+|.-.+|.+++.|-. +.+||+--. ....+.+.
T Consensus 172 d~~gqvt~lr~~~~~~~~i~~~~~h~~~~~~l~Wd~~~~~LfSg~~d~~-vi~wdigg~-------------~g~~~el~ 237 (404)
T KOG1409|consen 172 DHSGQITMLKLEQNGCQLITTFNGHTGEVTCLKWDPGQRLLFSGASDHS-VIMWDIGGR-------------KGTAYELQ 237 (404)
T ss_pred ccccceEEEEEeecCCceEEEEcCcccceEEEEEcCCCcEEEeccccCc-eEEEeccCC-------------cceeeeec
Confidence 445555553 34456789999999999999999999999999999955 889998543 12334554
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
|+ ...|+.++.-+--+.+.+++.||.|-+|+.+-.
T Consensus 238 -gh-~~kV~~l~~~~~t~~l~S~~edg~i~~w~mn~~ 272 (404)
T KOG1409|consen 238 -GH-NDKVQALSYAQHTRQLISCGEDGGIVVWNMNVK 272 (404)
T ss_pred -cc-hhhhhhhhhhhhheeeeeccCCCeEEEEeccce
Confidence 33 235788888788889999999999999999754
No 259
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=97.36 E-value=0.00054 Score=86.80 Aligned_cols=91 Identities=19% Similarity=0.387 Sum_probs=68.5
Q ss_pred CCCeEEEEECCC---CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 299 NVGMVIVRDIVS---KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 299 ~~G~V~IwDl~s---~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
.++.+.+||.-- ..++. .+|.+.+++++|-|.-++|.||+.+|. |.|||++.. ..+| +
T Consensus 2313 d~~n~~lwDtl~~~~~s~v~--~~H~~gaT~l~~~P~~qllisggr~G~-v~l~D~rqr------------ql~h--~-- 2373 (2439)
T KOG1064|consen 2313 DNRNVCLWDTLLPPMNSLVH--TCHDGGATVLAYAPKHQLLISGGRKGE-VCLFDIRQR------------QLRH--T-- 2373 (2439)
T ss_pred CCCcccchhcccCcccceee--eecCCCceEEEEcCcceEEEecCCcCc-EEEeehHHH------------HHHH--H--
Confidence 456788898642 23444 799999999999999999999999998 899999754 1111 1
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
++. +. .-.++.+++++|.++||+++.++-..+|
T Consensus 2374 -------~~~--~~-~~~~f~~~ss~g~ikIw~~s~~~ll~~~ 2406 (2439)
T KOG1064|consen 2374 -------FQA--LD-TREYFVTGSSEGNIKIWRLSEFGLLHTF 2406 (2439)
T ss_pred -------hhh--hh-hhheeeccCcccceEEEEccccchhhcC
Confidence 111 22 3457899999999999999988555555
No 260
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=97.35 E-value=0.0038 Score=70.58 Aligned_cols=105 Identities=19% Similarity=0.297 Sum_probs=79.2
Q ss_pred cCCCC-eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVG-MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G-~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+..+| .+-|+|..+++. ..+...-+.|-+|+.+|||++++.|-.... |.+.|+.++ .++.+-+-+
T Consensus 377 gt~dgD~l~iyd~~~~e~-kr~e~~lg~I~av~vs~dGK~~vvaNdr~e-l~vididng------------nv~~idkS~ 442 (668)
T COG4946 377 GTNDGDKLGIYDKDGGEV-KRIEKDLGNIEAVKVSPDGKKVVVANDRFE-LWVIDIDNG------------NVRLIDKSE 442 (668)
T ss_pred eccCCceEEEEecCCceE-EEeeCCccceEEEEEcCCCcEEEEEcCceE-EEEEEecCC------------CeeEecccc
Confidence 44566 899999998865 566677789999999999999999887765 677788887 233333322
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCC----cEEEEecCCCCCceeecc
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRG----TSHLFAINPLGGSVNFQP 420 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~Dg----TVhIwdl~~~g~~~~~~~ 420 (832)
. +-|.+++|+|+++|+|-+--+| .|||||+.. +....+.+
T Consensus 443 ~----~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~-~Kiy~vTT 486 (668)
T COG4946 443 Y----GLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDG-GKIYDVTT 486 (668)
T ss_pred c----ceeEEEEEcCCceeEEEecCcceeeeeEEEEecCC-CeEEEecC
Confidence 2 3499999999999999887766 699999965 34555543
No 261
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=97.34 E-value=0.0032 Score=74.65 Aligned_cols=102 Identities=23% Similarity=0.342 Sum_probs=77.1
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC------C---CEEEEEeCCCCCCCCCCccCCC
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ------G---HNINIFKIIPGILGTSSACDAG 365 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d------G---t~I~Iwdi~t~~~~~~s~~~~~ 365 (832)
+-+.+.|+|.|+|+.++.+-+.|..|++.|.+|.|--... |+|.+.. | +.+.|-|+++|.
T Consensus 441 AvGT~sGTV~vvdvst~~v~~~fsvht~~VkgleW~g~ss-lvSfsys~~n~~sg~vrN~l~vtdLrtGl---------- 509 (1062)
T KOG1912|consen 441 AVGTNSGTVDVVDVSTNAVAASFSVHTSLVKGLEWLGNSS-LVSFSYSHVNSASGGVRNDLVVTDLRTGL---------- 509 (1062)
T ss_pred EeecCCceEEEEEecchhhhhhhcccccceeeeeecccee-EEEeeeccccccccceeeeEEEEEccccc----------
Confidence 3466889999999999999999999999999999966554 4444321 1 124466787761
Q ss_pred CceeEEEEEe--ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 366 TSYVHLYRLQ--RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 366 ~~~~~l~~l~--rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
. ..+| ++...+.|..|--|.-++|||+.-.|.-+-|||+.+.
T Consensus 510 ---s--k~fR~l~~~despI~~irvS~~~~yLai~Fr~~plEiwd~kt~ 553 (1062)
T KOG1912|consen 510 ---S--KRFRGLQKPDESPIRAIRVSSSGRYLAILFRREPLEIWDLKTL 553 (1062)
T ss_pred ---c--cccccCCCCCcCcceeeeecccCceEEEEecccchHHHhhccc
Confidence 1 1232 4555566999999999999999999999999999653
No 262
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=97.33 E-value=0.0015 Score=72.06 Aligned_cols=105 Identities=12% Similarity=0.115 Sum_probs=83.4
Q ss_pred ccccCCCCeEEEEECC------CCcEEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCC
Q 003310 294 FPDADNVGMVIVRDIV------SKNVIAQFR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGT 366 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~------s~~~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~ 366 (832)
+++++.|-.++||.+. +.++|.... .|.+.|.||+|+-..+.|.+|..+|+ +..-|+.+.
T Consensus 71 L~SGGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~N~~~~SG~~~~~-VI~HDiEt~------------ 137 (609)
T KOG4227|consen 71 LASGGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLENRFLYSGERWGT-VIKHDIETK------------ 137 (609)
T ss_pred EeecCCcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccCCeeEecCCCcce-eEeeecccc------------
Confidence 4567788899999985 345665554 55689999999999999999999998 557899875
Q ss_pred ceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 367 SYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 367 ~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
..+|.+..-.....|+.+.-+|-.+.||+.+.++.|.+||+....
T Consensus 138 --qsi~V~~~~~~~~~VY~m~~~P~DN~~~~~t~~~~V~~~D~Rd~~ 182 (609)
T KOG4227|consen 138 --QSIYVANENNNRGDVYHMDQHPTDNTLIVVTRAKLVSFIDNRDRQ 182 (609)
T ss_pred --eeeeeecccCcccceeecccCCCCceEEEEecCceEEEEeccCCC
Confidence 455555333223359999999999999999999999999997654
No 263
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.28 E-value=0.0068 Score=69.88 Aligned_cols=51 Identities=25% Similarity=0.470 Sum_probs=41.5
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEc------CCCEEEEEeCC
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASV------QGHNINIFKII 352 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~------dGt~I~Iwdi~ 352 (832)
-.|.|-|||+.+.+++..+++-.. +-..|+|||++|+||.. |.. |+||+..
T Consensus 334 L~G~mEvwDv~n~K~i~~~~a~~t--t~~eW~PdGe~flTATTaPRlrvdNg-~Kiwhyt 390 (566)
T KOG2315|consen 334 LPGDMEVWDVPNRKLIAKFKAANT--TVFEWSPDGEYFLTATTAPRLRVDNG-IKIWHYT 390 (566)
T ss_pred CCCceEEEeccchhhccccccCCc--eEEEEcCCCcEEEEEeccccEEecCC-eEEEEec
Confidence 457899999999999999987644 55789999999999875 333 8999874
No 264
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.26 E-value=0.0015 Score=70.90 Aligned_cols=87 Identities=17% Similarity=0.265 Sum_probs=68.2
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
..+++|.+|++....--..+..-..++++++|||||+ +|.|...|-+ |.||.+.+. .+.+ +.
T Consensus 68 yk~~~vqvwsl~Qpew~ckIdeg~agls~~~WSPdgrhiL~tseF~lr-iTVWSL~t~------------~~~~---~~- 130 (447)
T KOG4497|consen 68 YKDPKVQVWSLVQPEWYCKIDEGQAGLSSISWSPDGRHILLTSEFDLR-ITVWSLNTQ------------KGYL---LP- 130 (447)
T ss_pred eccceEEEEEeecceeEEEeccCCCcceeeeECCCcceEeeeecceeE-EEEEEeccc------------eeEE---ec-
Confidence 4678999999998887788888888999999999996 5566677765 999999875 1222 22
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCC
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRG 402 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~Dg 402 (832)
+..+.+..++|.|||++.|..+.+.
T Consensus 131 -~pK~~~kg~~f~~dg~f~ai~sRrD 155 (447)
T KOG4497|consen 131 -HPKTNVKGYAFHPDGQFCAILSRRD 155 (447)
T ss_pred -ccccCceeEEECCCCceeeeeeccc
Confidence 2233478999999999999998763
No 265
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.26 E-value=0.17 Score=56.87 Aligned_cols=92 Identities=17% Similarity=0.207 Sum_probs=58.2
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCC-CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHK-SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~-~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
...+|.|..+|..+++.+..+..-. ..+.+... .|.+|++++.+|. +.+||..++ +.++++.
T Consensus 285 ~~~~G~l~~~d~~tG~~~W~~~~~~~~~~ssp~i--~g~~l~~~~~~G~-l~~~d~~tG--------------~~~~~~~ 347 (377)
T TIGR03300 285 TDADGVVVALDRRSGSELWKNDELKYRQLTAPAV--VGGYLVVGDFEGY-LHWLSREDG--------------SFVARLK 347 (377)
T ss_pred ECCCCeEEEEECCCCcEEEccccccCCccccCEE--ECCEEEEEeCCCE-EEEEECCCC--------------CEEEEEE
Confidence 4568999999999998877663211 12222222 4678888999997 889998876 4555554
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
-+... ....-++. ++ .|..++.||+|+.|.
T Consensus 348 ~~~~~-~~~sp~~~-~~-~l~v~~~dG~l~~~~ 377 (377)
T TIGR03300 348 TDGSG-IASPPVVV-GD-GLLVQTRDGDLYAFR 377 (377)
T ss_pred cCCCc-cccCCEEE-CC-EEEEEeCCceEEEeC
Confidence 22211 01122222 43 477899999998773
No 266
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=97.24 E-value=0.49 Score=59.93 Aligned_cols=99 Identities=14% Similarity=0.191 Sum_probs=66.1
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC--CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ--GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d--Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
...++||+-+ |.+..+-+.-.+-=.+|+|-|+|.+||+.... .+.|..|.- .| -.+--+.++..
T Consensus 236 ~R~iRVy~Re-G~L~stSE~v~gLe~~l~WrPsG~lIA~~q~~~~~~~VvFfEr-NG------------LrhgeF~l~~~ 301 (928)
T PF04762_consen 236 RRVIRVYSRE-GELQSTSEPVDGLEGALSWRPSGNLIASSQRLPDRHDVVFFER-NG------------LRHGEFTLRFD 301 (928)
T ss_pred eeEEEEECCC-ceEEeccccCCCccCCccCCCCCCEEEEEEEcCCCcEEEEEec-CC------------cEeeeEecCCC
Confidence 3589999976 55444433222333578999999999998762 234555542 22 22334566432
Q ss_pred CccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCC
Q 003310 378 LTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLG 413 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g 413 (832)
.....|..++|++||..||+...|. |.+|-...|-
T Consensus 302 ~~~~~v~~l~Wn~ds~iLAv~~~~~-vqLWt~~NYH 336 (928)
T PF04762_consen 302 PEEEKVIELAWNSDSEILAVWLEDR-VQLWTRSNYH 336 (928)
T ss_pred CCCceeeEEEECCCCCEEEEEecCC-ceEEEeeCCE
Confidence 3344699999999999999988665 9999987763
No 267
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=97.19 E-value=0.00055 Score=78.02 Aligned_cols=115 Identities=22% Similarity=0.251 Sum_probs=79.5
Q ss_pred ccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC---------------
Q 003310 292 GHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL--------------- 356 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~--------------- 356 (832)
.+|+-...||.+.|..- ++++-..+.+|.+.|.|-.|+|||+-|+|++.||- |+||.- +|..
T Consensus 76 d~~~i~s~DGkf~il~k-~~rVE~sv~AH~~A~~~gRW~~dGtgLlt~GEDG~-iKiWSr-sGMLRStl~Q~~~~v~c~~ 152 (737)
T KOG1524|consen 76 DTLLICSNDGRFVILNK-SARVERSISAHAAAISSGRWSPDGAGLLTAGEDGV-IKIWSR-SGMLRSTVVQNEESIRCAR 152 (737)
T ss_pred ceEEEEcCCceEEEecc-cchhhhhhhhhhhhhhhcccCCCCceeeeecCCce-EEEEec-cchHHHHHhhcCceeEEEE
Confidence 34566678899888763 45666778899999999999999999999999996 999963 2210
Q ss_pred -CCCCccCCCCceeEEE---------EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 357 -GTSSACDAGTSYVHLY---------RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 357 -~~~s~~~~~~~~~~l~---------~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
++.|..--.....|++ -+++..+..-|.++.|++.+..+++|+.|-..+|||-
T Consensus 153 W~p~S~~vl~c~g~h~~IKpL~~n~k~i~WkAHDGiiL~~~W~~~s~lI~sgGED~kfKvWD~ 215 (737)
T KOG1524|consen 153 WAPNSNSIVFCQGGHISIKPLAANSKIIRWRAHDGLVLSLSWSTQSNIIASGGEDFRFKIWDA 215 (737)
T ss_pred ECCCCCceEEecCCeEEEeecccccceeEEeccCcEEEEeecCccccceeecCCceeEEeecc
Confidence 1111100000001111 1112222234899999999999999999999999996
No 268
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=97.18 E-value=0.001 Score=46.26 Aligned_cols=39 Identities=26% Similarity=0.663 Sum_probs=34.7
Q ss_pred CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEe
Q 003310 311 KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFK 350 (832)
Q Consensus 311 ~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwd 350 (832)
++++..+.+|...|.+++|+|++.++++++.||+ +++|+
T Consensus 2 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~d~~-~~~~~ 40 (40)
T smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASASDDGT-IKLWD 40 (40)
T ss_pred cEEEEEEEecCCceeEEEECCCCCEEEEecCCCe-EEEcC
Confidence 3567788899999999999999999999999987 89996
No 269
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=97.10 E-value=0.0029 Score=73.50 Aligned_cols=91 Identities=15% Similarity=0.191 Sum_probs=68.3
Q ss_pred EEEEECCCCc--EEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 303 VIVRDIVSKN--VIAQFR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 303 V~IwDl~s~~--~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
-.+|++...+ .++... ...+.|.|.+++|+.+.|+.|+.||. |.+||...+ ..+..+.
T Consensus 238 ~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~DgS-iiLyD~~~~-------------~t~~~ka----- 298 (545)
T PF11768_consen 238 SCIYECSRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCEDGS-IILYDTTRG-------------VTLLAKA----- 298 (545)
T ss_pred EEEEEeecCceeEEEEEEEecCCcceEEecCcccceEEEEecCCe-EEEEEcCCC-------------eeeeeee-----
Confidence 3456765543 222222 66788999999999999999999998 899998775 1111111
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.-....++|+|||..|++|+..|.+.+||+.-.
T Consensus 299 ~~~P~~iaWHp~gai~~V~s~qGelQ~FD~ALs 331 (545)
T PF11768_consen 299 EFIPTLIAWHPDGAIFVVGSEQGELQCFDMALS 331 (545)
T ss_pred cccceEEEEcCCCcEEEEEcCCceEEEEEeecC
Confidence 123678999999999999999999999999654
No 270
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=97.07 E-value=0.0068 Score=68.18 Aligned_cols=102 Identities=10% Similarity=0.036 Sum_probs=75.6
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEc---------CCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASV---------QGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~---------dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
|.|.|.|..+.+.+.++..-..|-. + +||||+.|..|.. +...|.|||+.+. ..+
T Consensus 27 ~~v~ViD~~~~~v~g~i~~G~~P~~-~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~--------------~~~ 90 (352)
T TIGR02658 27 TQVYTIDGEAGRVLGMTDGGFLPNP-V-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTH--------------LPI 90 (352)
T ss_pred ceEEEEECCCCEEEEEEEccCCCce-e-ECCCCCEEEEEeccccccccCCCCCEEEEEECccC--------------cEE
Confidence 7899999999999999985555543 4 9999999988887 5556999999986 333
Q ss_pred EEEeccCc-----cccEEEEEEccCCCEEEEEe-C-CCcEEEEecCCCCCceee
Q 003310 372 YRLQRGLT-----NAVIQDISFSDDSNWIMISS-S-RGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 372 ~~l~rG~t-----~a~I~~IaFSpDg~~LAsgS-~-DgTVhIwdl~~~g~~~~~ 418 (832)
.++.-+.. ...-..+++||||++|.+.. + +.+|-|.|+...+-...+
T Consensus 91 ~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei 144 (352)
T TIGR02658 91 ADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMM 144 (352)
T ss_pred eEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEE
Confidence 34432211 11245789999999999877 3 689999999876443333
No 271
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=96.96 E-value=0.011 Score=67.15 Aligned_cols=111 Identities=18% Similarity=0.296 Sum_probs=63.3
Q ss_pred cCCCCeEEEEECCCC------cEEEEeccC------CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCC--cc
Q 003310 297 ADNVGMVIVRDIVSK------NVIAQFRAH------KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSS--AC 362 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~------~~l~~~~aH------~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s--~~ 362 (832)
++.+-.++|||...- .++.+|..| .-.|+|++|+.+|.-|.++-.|-. |.+|.-..+ .++.. .+
T Consensus 300 gG~dqf~RvYD~R~~~~e~~n~~~~~f~p~hl~~d~~v~ITgl~Ysh~~sElLaSYnDe~-IYLF~~~~~-~G~~p~~~s 377 (559)
T KOG1334|consen 300 GGSDQFARVYDQRRIDKEENNGVLDKFCPHHLVEDDPVNITGLVYSHDGSELLASYNDED-IYLFNKSMG-DGSEPDPSS 377 (559)
T ss_pred CChhhhhhhhcccchhhccccchhhhcCCccccccCcccceeEEecCCccceeeeecccc-eEEeccccc-cCCCCCCCc
Confidence 344555666665431 133444333 235899999987664444444433 778843222 11110 00
Q ss_pred CCCCceeEEEEEeccCcccc-EEEEEE-ccCCCEEEEEeCCCcEEEEecCCC
Q 003310 363 DAGTSYVHLYRLQRGLTNAV-IQDISF-SDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 363 ~~~~~~~~l~~l~rG~t~a~-I~~IaF-SpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.......++| .||.+.. |-.+-| -|.+.|+++||+=|-|.||+-++.
T Consensus 378 ~~~~~~k~vY---KGHrN~~TVKgVNFfGPrsEyVvSGSDCGhIFiW~K~t~ 426 (559)
T KOG1334|consen 378 PREQYVKRVY---KGHRNSRTVKGVNFFGPRSEYVVSGSDCGHIFIWDKKTG 426 (559)
T ss_pred chhhccchhh---cccccccccceeeeccCccceEEecCccceEEEEecchh
Confidence 0001122224 4555544 777765 799999999999999999998764
No 272
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.94 E-value=0.1 Score=57.81 Aligned_cols=94 Identities=19% Similarity=0.305 Sum_probs=66.9
Q ss_pred eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeC-CC
Q 003310 324 ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS-RG 402 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~-Dg 402 (832)
+-+..|+|+|++|++.+-.--.|.+|++..+... ....+.+..|.. -.-|+|+|++++.-..+. ++
T Consensus 147 ~H~a~~tP~~~~l~v~DLG~Dri~~y~~~dg~L~----------~~~~~~v~~G~G---PRHi~FHpn~k~aY~v~EL~s 213 (346)
T COG2706 147 VHSANFTPDGRYLVVPDLGTDRIFLYDLDDGKLT----------PADPAEVKPGAG---PRHIVFHPNGKYAYLVNELNS 213 (346)
T ss_pred cceeeeCCCCCEEEEeecCCceEEEEEcccCccc----------cccccccCCCCC---cceEEEcCCCcEEEEEeccCC
Confidence 6678899999999998865556899999866210 111223344432 456899999999877665 99
Q ss_pred cEEEEecCCCCCc-eeeccCCCCcccccC
Q 003310 403 TSHLFAINPLGGS-VNFQPTDANFTTKHG 430 (832)
Q Consensus 403 TVhIwdl~~~g~~-~~~~~H~~~~~~~~~ 430 (832)
||-+|.+++..+. ..++.+..-+..|.|
T Consensus 214 tV~v~~y~~~~g~~~~lQ~i~tlP~dF~g 242 (346)
T COG2706 214 TVDVLEYNPAVGKFEELQTIDTLPEDFTG 242 (346)
T ss_pred EEEEEEEcCCCceEEEeeeeccCccccCC
Confidence 9999999997554 567777666665543
No 273
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=96.85 E-value=0.013 Score=70.73 Aligned_cols=184 Identities=15% Similarity=0.142 Sum_probs=124.7
Q ss_pred CCEEEEEECCCCcEEEEEeCCC-CEEEEEEcCCEEEEE-eCCEEEEEECCCCceEEEEecCCCccCCCCCCCCCccccee
Q 003310 116 PTVVHFYSLRSQSYVHMLKFRS-PIYSVRCSSRVVAIC-QAAQVHCFDAATLEIEYAILTNPIVMGHPSAGGIGIGYGPL 193 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~f~s-~V~sV~~S~r~LAVa-~~~~I~vwDl~t~~~~~tl~t~~~~~~~p~~~~~~~~~~p~ 193 (832)
+..+-.+|+++.+..+.....+ .|.=++-|.+++..+ ..++|.+-|..+.+.++++..|... ...|
T Consensus 156 Q~~li~~Dl~~~~e~r~~~v~a~~v~imR~Nnr~lf~G~t~G~V~LrD~~s~~~iht~~aHs~s------------iSDf 223 (1118)
T KOG1275|consen 156 QEKLIHIDLNTEKETRTTNVSASGVTIMRYNNRNLFCGDTRGTVFLRDPNSFETIHTFDAHSGS------------ISDF 223 (1118)
T ss_pred hhheeeeecccceeeeeeeccCCceEEEEecCcEEEeecccceEEeecCCcCceeeeeeccccc------------eeee
Confidence 3557778999999888888754 688888888877764 6789999999999999999988531 1223
Q ss_pred eeccceEEeeCCCceecCCCccCCcccccccccccccCCCcceeeeecccccceeceeeeccCccccccccccccccCCC
Q 003310 194 AVGPRWLAYSGSPVVVSNDGRVNPQHLMQSRSFSGFASNGSRVAHYAKESSKHLAAGIVNLGDLGYKKLSQYCSEFLPDS 273 (832)
Q Consensus 194 Alg~r~LAya~~~~~~s~~Grvsp~~l~~s~~~s~~~s~g~~V~~~A~~ssk~lasGi~~lGd~g~~~ls~y~~~~~p~~ 273 (832)
.+....|+.++.. +| .|
T Consensus 224 Dv~GNlLitCG~S------~R-------------------------------------------------~~-------- 240 (1118)
T KOG1275|consen 224 DVQGNLLITCGYS------MR-------------------------------------------------RY-------- 240 (1118)
T ss_pred eccCCeEEEeecc------cc-------------------------------------------------cc--------
Confidence 3322222222100 00 00
Q ss_pred cCccccccCCCCCCCcccccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCC
Q 003310 274 QNSLQSAIPGGKSNGTVNGHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKII 352 (832)
Q Consensus 274 ~~si~sa~~~~~~~g~~~g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~ 352 (832)
.-..|--|+|||++..+.+.-+.-|.+| .-+.|.|. -+.||.+|.-|. +.+-|..
T Consensus 241 ----------------------~l~~D~FvkVYDLRmmral~PI~~~~~P-~flrf~Psl~t~~~V~S~sGq-~q~vd~~ 296 (1118)
T KOG1275|consen 241 ----------------------NLAMDPFVKVYDLRMMRALSPIQFPYGP-QFLRFHPSLTTRLAVTSQSGQ-FQFVDTA 296 (1118)
T ss_pred ----------------------cccccchhhhhhhhhhhccCCcccccCc-hhhhhcccccceEEEEecccc-eeecccc
Confidence 0123456999999998888777777776 66889997 457888888887 6777744
Q ss_pred CCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 353 PGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 353 t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
+-.. +. ..++.+. ....-|..++||+.|+.+|.+-.+|.||+|.=.
T Consensus 297 ~lsN---------P~-~~~~~v~--p~~s~i~~fDiSsn~~alafgd~~g~v~~wa~~ 342 (1118)
T KOG1275|consen 297 TLSN---------PP-AGVKMVN--PNGSGISAFDISSNGDALAFGDHEGHVNLWADR 342 (1118)
T ss_pred ccCC---------Cc-cceeEEc--cCCCcceeEEecCCCceEEEecccCcEeeecCC
Confidence 3200 00 0111111 111238899999999999999999999999843
No 274
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=96.75 E-value=0.37 Score=53.21 Aligned_cols=102 Identities=21% Similarity=0.235 Sum_probs=63.5
Q ss_pred eEEEEECCCCcEEEE--ec--cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE---
Q 003310 302 MVIVRDIVSKNVIAQ--FR--AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL--- 374 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~--~~--aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l--- 374 (832)
.+.+.|..+++++.+ +. -|...|..|+++++|+.+...-.+|- -++..+- -+... .+....+..+
T Consensus 139 sL~~ld~~sG~ll~q~~Lp~~~~~lSiRHLa~~~~G~V~~a~Q~qg~---~~~~~PL-va~~~----~g~~~~~~~~p~~ 210 (305)
T PF07433_consen 139 SLVYLDARSGALLEQVELPPDLHQLSIRHLAVDGDGTVAFAMQYQGD---PGDAPPL-VALHR----RGGALRLLPAPEE 210 (305)
T ss_pred ceEEEecCCCceeeeeecCccccccceeeEEecCCCcEEEEEecCCC---CCccCCe-EEEEc----CCCcceeccCChH
Confidence 466678889988887 52 37789999999999998877766654 1222111 00000 0000011111
Q ss_pred -eccCccccEEEEEEccCCCEEEEEeCCC-cEEEEecCCC
Q 003310 375 -QRGLTNAVIQDISFSDDSNWIMISSSRG-TSHLFAINPL 412 (832)
Q Consensus 375 -~rG~t~a~I~~IaFSpDg~~LAsgS~Dg-TVhIwdl~~~ 412 (832)
.+.+.. -|-+|||++|+.++|++|-+| .+.+||..+.
T Consensus 211 ~~~~l~~-Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg 249 (305)
T PF07433_consen 211 QWRRLNG-YIGSIAADRDGRLIAVTSPRGGRVAVWDAATG 249 (305)
T ss_pred HHHhhCC-ceEEEEEeCCCCEEEEECCCCCEEEEEECCCC
Confidence 011111 278999999999998888875 7899999775
No 275
>PRK04043 tolB translocation protein TolB; Provisional
Probab=96.75 E-value=0.2 Score=57.81 Aligned_cols=81 Identities=23% Similarity=0.187 Sum_probs=48.5
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC-CC-EEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ-GH-NINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL 378 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d-Gt-~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~ 378 (832)
..|.++|+.+++. ..+..+........|+|||+.|+-.+.. |. .|.+.|+.++ ..+.+ .. .|.
T Consensus 257 ~~Iy~~dl~~g~~-~~LT~~~~~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g------------~~~rl-t~-~g~ 321 (419)
T PRK04043 257 PDIYLYDTNTKTL-TQITNYPGIDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSG------------SVEQV-VF-HGK 321 (419)
T ss_pred cEEEEEECCCCcE-EEcccCCCccCccEECCCCCEEEEEECCCCCceEEEEECCCC------------CeEeC-cc-CCC
Confidence 4688888877754 3444333323445899999987766643 33 3445565544 12222 11 122
Q ss_pred ccccEEEEEEccCCCEEEEEeCC
Q 003310 379 TNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 379 t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
...+|||||++||..+..
T Consensus 322 -----~~~~~SPDG~~Ia~~~~~ 339 (419)
T PRK04043 322 -----NNSSVSTYKNYIVYSSRE 339 (419)
T ss_pred -----cCceECCCCCEEEEEEcC
Confidence 124899999999988765
No 276
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=96.72 E-value=0.0027 Score=70.06 Aligned_cols=99 Identities=13% Similarity=0.126 Sum_probs=71.5
Q ss_pred EEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCC
Q 003310 314 IAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSN 393 (832)
Q Consensus 314 l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~ 393 (832)
.+.+.+|.+-|.+|.||.+|++||+|++|-+ ++||.+....... +.+-..+...++...|.|++|.-..+
T Consensus 49 qKD~~~H~GCiNAlqFS~N~~~L~SGGDD~~-~~~W~~de~~~~k---------~~KPI~~~~~~H~SNIF~L~F~~~N~ 118 (609)
T KOG4227|consen 49 QKDVREHTGCINALQFSHNDRFLASGGDDMH-GRVWNVDELMVRK---------TPKPIGVMEHPHRSNIFSLEFDLENR 118 (609)
T ss_pred hhhhhhhccccceeeeccCCeEEeecCCcce-eeeechHHHHhhc---------CCCCceeccCccccceEEEEEccCCe
Confidence 3456789999999999999999999999977 8999875431100 00111222122224599999999999
Q ss_pred EEEEEeCCCcEEEEecCCCCCceeeccCCC
Q 003310 394 WIMISSSRGTSHLFAINPLGGSVNFQPTDA 423 (832)
Q Consensus 394 ~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~ 423 (832)
+|.+|..++||.+-|+++...... ..|.+
T Consensus 119 ~~~SG~~~~~VI~HDiEt~qsi~V-~~~~~ 147 (609)
T KOG4227|consen 119 FLYSGERWGTVIKHDIETKQSIYV-ANENN 147 (609)
T ss_pred eEecCCCcceeEeeecccceeeee-ecccC
Confidence 999999999999999988644433 34544
No 277
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=96.66 E-value=0.0098 Score=64.09 Aligned_cols=103 Identities=14% Similarity=0.141 Sum_probs=73.1
Q ss_pred CCCCeEEEEECCCCcE--EEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 298 DNVGMVIVRDIVSKNV--IAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~--l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
...|.+.+-+.....+ ++..++|.-++...+|+.. -.++.||++||. +.-||++.. ...++.-
T Consensus 140 ~s~G~~~~v~~t~~~le~vq~wk~He~E~Wta~f~~~~pnlvytGgDD~~-l~~~D~R~p-------------~~~i~~n 205 (339)
T KOG0280|consen 140 DSRGSISGVYETEMVLEKVQTWKVHEFEAWTAKFSDKEPNLVYTGGDDGS-LSCWDIRIP-------------KTFIWHN 205 (339)
T ss_pred cCCCcEEEEecceeeeeecccccccceeeeeeecccCCCceEEecCCCce-EEEEEecCC-------------cceeeec
Confidence 3445555544444333 3478999999999999864 568899999997 899999832 1334432
Q ss_pred eccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 375 QRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
.+-++ +-|.+|.=|| +..+||+|+-|.+|++||...-+.+
T Consensus 206 ~kvH~-~GV~SI~ss~~~~~~I~TGsYDe~i~~~DtRnm~kP 246 (339)
T KOG0280|consen 206 SKVHT-SGVVSIYSSPPKPTYIATGSYDECIRVLDTRNMGKP 246 (339)
T ss_pred ceeee-cceEEEecCCCCCceEEEeccccceeeeehhcccCc
Confidence 23333 3477787775 7889999999999999999865443
No 278
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=96.63 E-value=0.015 Score=63.27 Aligned_cols=114 Identities=12% Similarity=0.083 Sum_probs=81.9
Q ss_pred cCCCCeEEEEECCCC---cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 297 ADNVGMVIVRDIVSK---NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~---~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
+.+...|.||..... +..++|+.|...|+.|+|+|.+..|+|++.|.+ -.||....+. .......
T Consensus 28 ~~~~~evhiy~~~~~~~w~~~htls~Hd~~vtgvdWap~snrIvtcs~drn-ayVw~~~~~~-----------~Wkptlv 95 (361)
T KOG1523|consen 28 SPNNHEVHIYSMLGADLWEPAHTLSEHDKIVTGVDWAPKSNRIVTCSHDRN-AYVWTQPSGG-----------TWKPTLV 95 (361)
T ss_pred ccCCceEEEEEecCCCCceeceehhhhCcceeEEeecCCCCceeEccCCCC-ccccccCCCC-----------eecccee
Confidence 344557888887654 478899999999999999999999999999966 6899875441 1122222
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCc
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANF 425 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~ 425 (832)
|.|- +....+|.|||.++.||+||.-..|-||=++.. ..-.+.-|..++
T Consensus 96 LlRi--NrAAt~V~WsP~enkFAVgSgar~isVcy~E~E-NdWWVsKhikkP 144 (361)
T KOG1523|consen 96 LLRI--NRAATCVKWSPKENKFAVGSGARLISVCYYEQE-NDWWVSKHIKKP 144 (361)
T ss_pred EEEe--ccceeeEeecCcCceEEeccCccEEEEEEEecc-cceehhhhhCCc
Confidence 2221 123789999999999999999999999877542 222234454444
No 279
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.62 E-value=0.01 Score=67.91 Aligned_cols=125 Identities=21% Similarity=0.264 Sum_probs=84.2
Q ss_pred cccccCCCCeEEEEECCC-------CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCC-------
Q 003310 293 HFPDADNVGMVIVRDIVS-------KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGT------- 358 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s-------~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~------- 358 (832)
.|+++..|.+|++|.++. ..+..++.+|+.+|..+.|-.|-+++|++ ||. |++||-.-+..-.
T Consensus 749 SFiSASkDKTVKLWSik~EgD~~~tsaCQfTY~aHkk~i~~igfL~~lr~i~Sc--D~g-iHlWDPFigr~Laq~~dapk 825 (1034)
T KOG4190|consen 749 SFISASKDKTVKLWSIKPEGDEIGTSACQFTYQAHKKPIHDIGFLADLRSIASC--DGG-IHLWDPFIGRLLAQMEDAPK 825 (1034)
T ss_pred ceeeccCCceEEEEEeccccCccccceeeeEhhhccCcccceeeeeccceeeec--cCc-ceeecccccchhHhhhcCcc
Confidence 366788999999999864 34777888999999999999999998875 455 8999965542100
Q ss_pred CCc----------------cC-CCCceeEEE-----------EEec-cCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 359 SSA----------------CD-AGTSYVHLY-----------RLQR-GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 359 ~s~----------------~~-~~~~~~~l~-----------~l~r-G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
.++ ++ .-.+..+++ +... ...++.+.+++..+.|+|+|++-+.|+|-+.|.
T Consensus 826 ~~a~~~ikcl~nv~~~iliAgcsaeSTVKl~DaRsce~~~E~kVcna~~Pna~~R~iaVa~~GN~lAa~LSnGci~~LDa 905 (1034)
T KOG4190|consen 826 EGAGGNIKCLENVDRHILIAGCSAESTVKLFDARSCEWTCELKVCNAPGPNALTRAIAVADKGNKLAAALSNGCIAILDA 905 (1034)
T ss_pred cCCCceeEecccCcchheeeeccchhhheeeecccccceeeEEeccCCCCchheeEEEeccCcchhhHHhcCCcEEEEec
Confidence 000 00 000011111 1110 011244789999999999999999999999999
Q ss_pred CCCCCceeecc
Q 003310 410 NPLGGSVNFQP 420 (832)
Q Consensus 410 ~~~g~~~~~~~ 420 (832)
..+.-.-.+++
T Consensus 906 R~G~vINswrp 916 (1034)
T KOG4190|consen 906 RNGKVINSWRP 916 (1034)
T ss_pred CCCceeccCCc
Confidence 87654444443
No 280
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=96.48 E-value=0.039 Score=66.97 Aligned_cols=51 Identities=14% Similarity=0.232 Sum_probs=42.6
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEcCCEEEEE----------eCCEEEEEECCCCce
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCSSRVVAIC----------QAAQVHCFDAATLEI 167 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S~r~LAVa----------~~~~I~vwDl~t~~~ 167 (832)
++|-+-|+++.+.+|++.. .+.|.++...+++|+.| .|.-|.|||+++++.
T Consensus 197 G~V~LrD~~s~~~iht~~aHs~siSDfDv~GNlLitCG~S~R~~~l~~D~FvkVYDLRmmra 258 (1118)
T KOG1275|consen 197 GTVFLRDPNSFETIHTFDAHSGSISDFDVQGNLLITCGYSMRRYNLAMDPFVKVYDLRMMRA 258 (1118)
T ss_pred ceEEeecCCcCceeeeeeccccceeeeeccCCeEEEeecccccccccccchhhhhhhhhhhc
Confidence 7899999999999999985 66889999998888775 234588999998764
No 281
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=96.46 E-value=0.94 Score=47.76 Aligned_cols=99 Identities=16% Similarity=0.097 Sum_probs=62.0
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcc
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTN 380 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~ 380 (832)
|.|..++.. ++....+. .-..-+.|+|+|||+.|..+......|..|++..... . -...+.+..+..+.
T Consensus 115 g~v~~~~~~-~~~~~~~~-~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~-~------~~~~~~~~~~~~~~-- 183 (246)
T PF08450_consen 115 GSVYRIDPD-GKVTVVAD-GLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGG-E------LSNRRVFIDFPGGP-- 183 (246)
T ss_dssp EEEEEEETT-SEEEEEEE-EESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTC-C------EEEEEEEEE-SSSS--
T ss_pred cceEEECCC-CeEEEEec-CcccccceEECCcchheeecccccceeEEEecccccc-c------eeeeeeEEEcCCCC--
Confidence 677777777 44333222 2344588999999998876666555577777753200 0 00112333443221
Q ss_pred ccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 381 AVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 381 a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
...-.+++..+|++.++....+.|.+|+-.
T Consensus 184 g~pDG~~vD~~G~l~va~~~~~~I~~~~p~ 213 (246)
T PF08450_consen 184 GYPDGLAVDSDGNLWVADWGGGRIVVFDPD 213 (246)
T ss_dssp CEEEEEEEBTTS-EEEEEETTTEEEEEETT
T ss_pred cCCCcceEcCCCCEEEEEcCCCEEEEECCC
Confidence 236789999999998888888999998865
No 282
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=96.31 E-value=0.79 Score=49.68 Aligned_cols=99 Identities=18% Similarity=0.256 Sum_probs=60.7
Q ss_pred CCCCeEEEEEcCCCCEEEEEEcC-C---C------EEEEEeCCCCCC----CCCCccCCC-----CceeEEEEE----ec
Q 003310 320 HKSPISALCFDPSGILLVTASVQ-G---H------NINIFKIIPGIL----GTSSACDAG-----TSYVHLYRL----QR 376 (832)
Q Consensus 320 H~~pIs~LaFSPdG~lLATaS~d-G---t------~I~Iwdi~t~~~----~~~s~~~~~-----~~~~~l~~l----~r 376 (832)
+...|.++.|+|.-++|..|+.. . . -+.-|++..+.+ -.....+-. .....+..+ ++
T Consensus 146 yp~Gi~~~vy~p~h~LLlVgG~~~~~~~~s~a~~~GLtaWRiL~~~Pyyk~v~~~~~~~~~~~~~~~~~~~~~~~~fs~~ 225 (282)
T PF15492_consen 146 YPHGINSAVYHPKHRLLLVGGCEQNQDGMSKASSCGLTAWRILSDSPYYKQVTSSEDDITASSKRRGLLRIPSFKFFSRQ 225 (282)
T ss_pred CCCceeEEEEcCCCCEEEEeccCCCCCccccccccCceEEEEcCCCCcEEEccccCccccccccccceeeccceeeeecc
Confidence 35689999999998887766532 1 1 156787766532 011111100 011111111 12
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
+...-.|..|+.||||+.||+...+|++-||++..-.....+
T Consensus 226 ~~~~d~i~kmSlSPdg~~La~ih~sG~lsLW~iPsL~~~~~W 267 (282)
T PF15492_consen 226 GQEQDGIFKMSLSPDGSLLACIHFSGSLSLWEIPSLRLQRSW 267 (282)
T ss_pred ccCCCceEEEEECCCCCEEEEEEcCCeEEEEecCcchhhccc
Confidence 222334999999999999999999999999999765554444
No 283
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=96.10 E-value=1.1 Score=51.77 Aligned_cols=91 Identities=13% Similarity=0.277 Sum_probs=63.5
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEE-cCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTAS-VQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS-~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
..+.|+++....+... ..-.+||...+|+|.+..+++.+ ..-..+.+||++.. ..+.+-.+
T Consensus 255 snLyl~~~~e~~i~V~-~~~~~pVhdf~W~p~S~~F~vi~g~~pa~~s~~~lr~N---------------l~~~~Pe~-- 316 (561)
T COG5354 255 SNLYLLRITERSIPVE-KDLKDPVHDFTWEPLSSRFAVISGYMPASVSVFDLRGN---------------LRFYFPEQ-- 316 (561)
T ss_pred ceEEEEeeccccccee-ccccccceeeeecccCCceeEEecccccceeecccccc---------------eEEecCCc--
Confidence 4788999885544333 25578999999999999999887 33334788888753 22222211
Q ss_pred cccEEEEEEccCCCEEEEEeCC---CcEEEEecCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSR---GTSHLFAINP 411 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~D---gTVhIwdl~~ 411 (832)
.=..+.|||.++|++.++-| |.|-|||...
T Consensus 317 --~rNT~~fsp~~r~il~agF~nl~gni~i~~~~~ 349 (561)
T COG5354 317 --KRNTIFFSPHERYILFAGFDNLQGNIEIFDPAG 349 (561)
T ss_pred --ccccccccCcccEEEEecCCccccceEEeccCC
Confidence 23457799999999998776 5688888743
No 284
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=96.06 E-value=1.1 Score=57.76 Aligned_cols=87 Identities=14% Similarity=0.079 Sum_probs=52.5
Q ss_pred eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec--cC----ccccEEEEEEccCCCEEEE
Q 003310 324 ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR--GL----TNAVIQDISFSDDSNWIMI 397 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r--G~----t~a~I~~IaFSpDg~~LAs 397 (832)
...|+|+|||+.|..++.+.+.|++||+.++.. .....+.......++.+-. |. .-..-..|+|++||+.+++
T Consensus 742 P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~-~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVA 820 (1057)
T PLN02919 742 PSGISLSPDLKELYIADSESSSIRALDLKTGGS-RLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVA 820 (1057)
T ss_pred ccEEEEeCCCCEEEEEECCCCeEEEEECCCCcE-EEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEE
Confidence 356999999997777766666799999876410 0000000000000111100 00 0011458999999999999
Q ss_pred EeCCCcEEEEecCC
Q 003310 398 SSSRGTSHLFAINP 411 (832)
Q Consensus 398 gS~DgTVhIwdl~~ 411 (832)
-+.+++|++||...
T Consensus 821 Ds~N~rIrviD~~t 834 (1057)
T PLN02919 821 DSYNHKIKKLDPAT 834 (1057)
T ss_pred ECCCCEEEEEECCC
Confidence 99999999999865
No 285
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=96.04 E-value=0.93 Score=52.92 Aligned_cols=96 Identities=13% Similarity=0.275 Sum_probs=61.2
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC--CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ--GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d--Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
.+.|+.+..+.+...--.-..+|-+.+|.|.|..+++-+.. .+.+..|-+.+.. ....++.+|..-
T Consensus 426 n~eIfrireKdIpve~velke~vi~FaWEP~gdkF~vi~g~~~k~tvsfY~~e~~~----------~~~~lVk~~dk~-- 493 (698)
T KOG2314|consen 426 NLEIFRIREKDIPVEVVELKESVIAFAWEPHGDKFAVISGNTVKNTVSFYAVETNI----------KKPSLVKELDKK-- 493 (698)
T ss_pred eEEEEEeeccCCCceeeecchheeeeeeccCCCeEEEEEccccccceeEEEeecCC----------Cchhhhhhhccc--
Confidence 46677766654322222334689999999999998876543 3447777766420 112334444331
Q ss_pred cccEEEEEEccCCCEEEEEe---CCCcEEEEecCC
Q 003310 380 NAVIQDISFSDDSNWIMISS---SRGTSHLFAINP 411 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS---~DgTVhIwdl~~ 411 (832)
.-+.|.|||.|+|++.+. ..|.+..+|..-
T Consensus 494 --~~N~vfwsPkG~fvvva~l~s~~g~l~F~D~~~ 526 (698)
T KOG2314|consen 494 --FANTVFWSPKGRFVVVAALVSRRGDLEFYDTDY 526 (698)
T ss_pred --ccceEEEcCCCcEEEEEEecccccceEEEecch
Confidence 256789999999988764 467888888763
No 286
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=95.96 E-value=0.017 Score=64.59 Aligned_cols=95 Identities=20% Similarity=0.180 Sum_probs=73.7
Q ss_pred EEEEECCCCcEEEEeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc
Q 003310 303 VIVRDIVSKNVIAQFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA 381 (832)
Q Consensus 303 V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a 381 (832)
|++.+-.+-+.+.-+..|..-|..|+|||.-. +|..+|.+. .|+|+|+.+. .+...|..+ .
T Consensus 175 v~~l~~~~fkssq~lp~~g~~IrdlafSp~~~GLl~~asl~n-kiki~dlet~------------~~vssy~a~-----~ 236 (463)
T KOG1645|consen 175 VQKLESHDFKSSQILPGEGSFIRDLAFSPFNEGLLGLASLGN-KIKIMDLETS------------CVVSSYIAY-----N 236 (463)
T ss_pred eEEeccCCcchhhcccccchhhhhhccCccccceeeeeccCc-eEEEEecccc------------eeeeheecc-----C
Confidence 77777777777777788999999999999888 777788775 5999999886 233444442 3
Q ss_pred cEEEEEEccCCC-EEEEEeCCCcEEEEecCCCCCc
Q 003310 382 VIQDISFSDDSN-WIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 382 ~I~~IaFSpDg~-~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
.||++||.-|.. +|..|-..|.|.|||+....++
T Consensus 237 ~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~~~~~ 271 (463)
T KOG1645|consen 237 QIWSCCWDLDERHVIYAGLQNGMVLVYDMRQPEGP 271 (463)
T ss_pred CceeeeeccCCcceeEEeccCceEEEEEccCCCch
Confidence 599999987654 5667777999999999876554
No 287
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=95.69 E-value=0.54 Score=51.88 Aligned_cols=94 Identities=13% Similarity=0.205 Sum_probs=60.2
Q ss_pred CeEEEEECCCCc-EEE--EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 301 GMVIVRDIVSKN-VIA--QFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 301 G~V~IwDl~s~~-~l~--~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
..|.+|++...+ .+. .+..+ ..|.+|..- +.+|+.|+.... +.++..... ...+..+.|.
T Consensus 107 ~~l~v~~l~~~~~l~~~~~~~~~-~~i~sl~~~--~~~I~vgD~~~s-v~~~~~~~~-------------~~~l~~va~d 169 (321)
T PF03178_consen 107 NKLYVYDLDNSKTLLKKAFYDSP-FYITSLSVF--KNYILVGDAMKS-VSLLRYDEE-------------NNKLILVARD 169 (321)
T ss_dssp TEEEEEEEETTSSEEEEEEE-BS-SSEEEEEEE--TTEEEEEESSSS-EEEEEEETT-------------TE-EEEEEEE
T ss_pred CEEEEEEccCcccchhhheecce-EEEEEEecc--ccEEEEEEcccC-EEEEEEEcc-------------CCEEEEEEec
Confidence 468888887766 332 22222 255555543 568888887755 677754432 1334455555
Q ss_pred CccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 378 LTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
..+..+.+++|-+|++.++.+..+|.++++...+
T Consensus 170 ~~~~~v~~~~~l~d~~~~i~~D~~gnl~~l~~~~ 203 (321)
T PF03178_consen 170 YQPRWVTAAEFLVDEDTIIVGDKDGNLFVLRYNP 203 (321)
T ss_dssp SS-BEEEEEEEE-SSSEEEEEETTSEEEEEEE-S
T ss_pred CCCccEEEEEEecCCcEEEEEcCCCeEEEEEECC
Confidence 5555689999987778999999999999999965
No 288
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=95.59 E-value=0.038 Score=60.73 Aligned_cols=107 Identities=16% Similarity=0.207 Sum_probs=70.9
Q ss_pred cccccCCCCeEEEEECCCCcEE----EE------------eccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVI----AQ------------FRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL 356 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l----~~------------~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~ 356 (832)
.|+-....|.|+|-|++...+. .. |..--..|+.++||++|+||+|-+.- .++|||+.....
T Consensus 228 ~f~YSSSKGtIrLcDmR~~aLCd~hsKlfEepedp~~rsffseiIsSISDvKFs~sGryilsRDyl--tvk~wD~nme~~ 305 (433)
T KOG1354|consen 228 VFVYSSSKGTIRLCDMRQSALCDAHSKLFEEPEDPSSRSFFSEIISSISDVKFSHSGRYILSRDYL--TVKLWDLNMEAK 305 (433)
T ss_pred EEEEecCCCcEEEeechhhhhhcchhhhhccccCCcchhhHHHHhhhhhceEEccCCcEEEEeccc--eeEEEeccccCC
Confidence 4565677899999999843210 11 12223678999999999999997654 499999954300
Q ss_pred CCCCccCCCCceeEEEEEec-----------cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 357 GTSSACDAGTSYVHLYRLQR-----------GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 357 ~~~s~~~~~~~~~~l~~l~r-----------G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
....|.++- ...-..-..++||-++.++++||-..-.|||++..+
T Consensus 306 -----------pv~t~~vh~~lr~kLc~lYEnD~IfdKFec~~sg~~~~v~TGsy~n~frvf~~~~g 361 (433)
T KOG1354|consen 306 -----------PVETYPVHEYLRSKLCSLYENDAIFDKFECSWSGNDSYVMTGSYNNVFRVFNLARG 361 (433)
T ss_pred -----------cceEEeehHhHHHHHHHHhhccchhheeEEEEcCCcceEecccccceEEEecCCCC
Confidence 011222210 000001246899999999999999999999997653
No 289
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=95.57 E-value=0.58 Score=50.39 Aligned_cols=61 Identities=13% Similarity=0.134 Sum_probs=44.1
Q ss_pred cccccCCCCeEEEEECCCCcEEEEe-----ccCCCCeEEEEEcCCCC--EEEEEEcCCCEEEEEeCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQF-----RAHKSPISALCFDPSGI--LLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~-----~aH~~pIs~LaFSPdG~--lLATaS~dGt~I~Iwdi~t~ 354 (832)
.|+.+.+||++-|||++.......+ ..|.+.|..+.|+|-|. +|.-.-.-+. ++|-|++++
T Consensus 217 ~FAv~~Qdg~~~I~DVR~~~tpm~~~sstrp~hnGa~R~c~Fsl~g~lDLLf~sEhfs~-~hv~D~R~~ 284 (344)
T KOG4532|consen 217 QFAVVFQDGTCAIYDVRNMATPMAEISSTRPHHNGAFRVCRFSLYGLLDLLFISEHFSR-VHVVDTRNY 284 (344)
T ss_pred eEEEEecCCcEEEEEecccccchhhhcccCCCCCCceEEEEecCCCcceEEEEecCcce-EEEEEcccC
Confidence 5677889999999999875433222 36889999999998776 3433333344 789999886
No 290
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=95.53 E-value=1.4 Score=48.85 Aligned_cols=52 Identities=19% Similarity=0.285 Sum_probs=36.5
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
-|.|+.+|. .++.+..+..|-.--+.|+|||||+.|..+......|.-|++.
T Consensus 142 ~G~lyr~~p-~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d 193 (307)
T COG3386 142 TGSLYRVDP-DGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLD 193 (307)
T ss_pred cceEEEEcC-CCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecC
Confidence 355555554 4566666666544457799999999999998876656677665
No 291
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.51 E-value=0.09 Score=62.58 Aligned_cols=114 Identities=12% Similarity=0.070 Sum_probs=75.7
Q ss_pred CCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCC-CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 300 VGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPS-GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 300 ~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPd-G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
...|.|||+..+ .++..+++|...|..+.|..- -..+.+++.||+ ++.||...... ...+ +-
T Consensus 179 g~~i~vwd~r~gs~pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~d~t-vkfw~y~kSt~----------e~~~-----~v 242 (1081)
T KOG0309|consen 179 GNDIFVWDLRKGSTPLCSLKGHVSSVNSIDFNRFKYSEIMSSSNDGT-VKFWDYSKSTT----------ESKR-----TV 242 (1081)
T ss_pred CCceEEEeccCCCcceEEecccceeeehHHHhhhhhhhhcccCCCCc-eeeeccccccc----------ccce-----ec
Confidence 346999999865 689999999999999999874 446788899988 89999865410 0011 11
Q ss_pred CccccEEEEEEcc--CCCEEEEEeCCCcEEEEecCC----------CCCceeeccCCCCccccc
Q 003310 378 LTNAVIQDISFSD--DSNWIMISSSRGTSHLFAINP----------LGGSVNFQPTDANFTTKH 429 (832)
Q Consensus 378 ~t~a~I~~IaFSp--Dg~~LAsgS~DgTVhIwdl~~----------~g~~~~~~~H~~~~~~~~ 429 (832)
.+...|+--.|-| +|.++.-.-.+..+++++.+. .....+|.+|++...+|.
T Consensus 243 tt~~piw~~r~~Pfg~g~~~mp~~G~n~v~~~~c~n~d~e~n~~~~~~pVh~F~GH~D~V~eFl 306 (1081)
T KOG0309|consen 243 TTNFPIWRGRYLPFGEGYCIMPMVGGNMVPQLRCENSDLEWNVFDLNTPVHTFVGHDDVVLEFL 306 (1081)
T ss_pred cccCcceeccccccCceeEeccccCCeeeeeccccchhhhhccccCCcceeeecCcchHHHHHh
Confidence 2334577667777 344444333344666665432 223468899999877664
No 292
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=95.26 E-value=0.066 Score=62.91 Aligned_cols=77 Identities=18% Similarity=0.305 Sum_probs=62.8
Q ss_pred CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe-ccCccccEEEEEEccCCCEEEEEeCC
Q 003310 323 PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ-RGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~-rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
.|--+-|+|.=.+||++-.+|. +-|.++. . ++++++. +|... ..++||.|||+.||+|-.|
T Consensus 22 ~i~~~ewnP~~dLiA~~t~~ge-lli~R~n-~--------------qRlwtip~p~~~v--~~sL~W~~DGkllaVg~kd 83 (665)
T KOG4640|consen 22 NIKRIEWNPKMDLIATRTEKGE-LLIHRLN-W--------------QRLWTIPIPGENV--TASLCWRPDGKLLAVGFKD 83 (665)
T ss_pred ceEEEEEcCccchhheeccCCc-EEEEEec-c--------------ceeEeccCCCCcc--ceeeeecCCCCEEEEEecC
Confidence 4667889999999999999998 5677775 2 5677775 45321 2599999999999999999
Q ss_pred CcEEEEecCCCCCcee
Q 003310 402 GTSHLFAINPLGGSVN 417 (832)
Q Consensus 402 gTVhIwdl~~~g~~~~ 417 (832)
|||+|-|+++++....
T Consensus 84 G~I~L~Dve~~~~l~~ 99 (665)
T KOG4640|consen 84 GTIRLHDVEKGGRLVS 99 (665)
T ss_pred CeEEEEEccCCCceec
Confidence 9999999999877666
No 293
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=95.14 E-value=0.11 Score=59.57 Aligned_cols=58 Identities=12% Similarity=0.282 Sum_probs=51.1
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCC
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIP 353 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t 353 (832)
+++...|.|-|||-.+++.|.-+++...-|+||.=.|-=-+|||++.|-. |+||--.+
T Consensus 410 vSGSDCGhIFiW~K~t~eii~~MegDr~VVNCLEpHP~~PvLAsSGid~D-VKIWTP~~ 467 (559)
T KOG1334|consen 410 VSGSDCGHIFIWDKKTGEIIRFMEGDRHVVNCLEPHPHLPVLASSGIDHD-VKIWTPLT 467 (559)
T ss_pred EecCccceEEEEecchhHHHHHhhcccceEeccCCCCCCchhhccCCccc-eeeecCCc
Confidence 46778899999999999999999988889999999998889999999955 99997543
No 294
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=95.13 E-value=0.22 Score=53.47 Aligned_cols=101 Identities=11% Similarity=0.007 Sum_probs=64.7
Q ss_pred CCCeEEEEEC--CCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 299 NVGMVIVRDI--VSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 299 ~~G~V~IwDl--~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
.|.++++.++ .+.+-..+++. -.+.+++.|+|++++++.++--. |-.|.+... ...++ +.+-
T Consensus 136 ndht~k~~~~~~~s~~~~~h~~~--~~~ns~~~snd~~~~~~Vgds~~-Vf~y~id~~------------sey~~-~~~~ 199 (344)
T KOG4532|consen 136 NDHTGKTMVVSGDSNKFAVHNQN--LTQNSLHYSNDPSWGSSVGDSRR-VFRYAIDDE------------SEYIE-NIYE 199 (344)
T ss_pred CCcceeEEEEecCcccceeeccc--cceeeeEEcCCCceEEEecCCCc-ceEEEeCCc------------cceee-eeEe
Confidence 4444555544 44443333332 13789999999999999876644 667777653 11222 2111
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
..+.-.=.+.+||.....+|+++.||++-|||+..-+-+
T Consensus 200 a~t~D~gF~~S~s~~~~~FAv~~Qdg~~~I~DVR~~~tp 238 (344)
T KOG4532|consen 200 APTSDHGFYNSFSENDLQFAVVFQDGTCAIYDVRNMATP 238 (344)
T ss_pred cccCCCceeeeeccCcceEEEEecCCcEEEEEecccccc
Confidence 122222467899999999999999999999999765433
No 295
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=95.07 E-value=2.7 Score=43.29 Aligned_cols=55 Identities=15% Similarity=0.173 Sum_probs=42.8
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEE-EEc-CCEEEEEeCCEEEEEECCCCceEEEE
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSV-RCS-SRVVAICQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV-~~S-~r~LAVa~~~~I~vwDl~t~~~~~tl 171 (832)
+.|..||..+|+.+.++.++..+... ... .++++...++.|+++|+.+++.+++.
T Consensus 46 ~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~ 102 (238)
T PF13360_consen 46 GNLYALDAKTGKVLWRFDLPGPISGAPVVDGGRVYVGTSDGSLYALDAKTGKVLWSI 102 (238)
T ss_dssp SEEEEEETTTSEEEEEEECSSCGGSGEEEETTEEEEEETTSEEEEEETTTSCEEEEE
T ss_pred CEEEEEECCCCCEEEEeeccccccceeeecccccccccceeeeEecccCCcceeeee
Confidence 67999999999999999986553322 333 45555566789999999999999984
No 296
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=94.70 E-value=0.054 Score=37.17 Aligned_cols=28 Identities=25% Similarity=0.525 Sum_probs=25.9
Q ss_pred ccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 381 AVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 381 a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
..|.+++|.++++++++++.|+++++|+
T Consensus 13 ~~i~~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 13 GPVTSVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred CceeEEEECCCCCEEEEecCCCeEEEcC
Confidence 3599999999999999999999999996
No 297
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=94.67 E-value=0.062 Score=69.23 Aligned_cols=121 Identities=14% Similarity=0.185 Sum_probs=87.2
Q ss_pred ccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC-----CCC--------
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL-----GTS-------- 359 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~-----~~~-------- 359 (832)
+.++..||.|++|....++.+..++ +-...|+.+.|+.+|..+..+..||. +-+|.+.+... +.-
T Consensus 2223 Yltgs~dgsv~~~~w~~~~~v~~~rt~g~s~vtr~~f~~qGnk~~i~d~dg~-l~l~q~~pk~~~s~qchnk~~~Df~Fi 2301 (2439)
T KOG1064|consen 2223 YLTGSQDGSVRMFEWGHGQQVVCFRTAGNSRVTRSRFNHQGNKFGIVDGDGD-LSLWQASPKPYTSWQCHNKALSDFRFI 2301 (2439)
T ss_pred EEecCCCceEEEEeccCCCeEEEeeccCcchhhhhhhcccCCceeeeccCCc-eeecccCCcceeccccCCccccceeee
Confidence 3577899999999999999888887 33378999999999999999999997 89999876421 110
Q ss_pred ---------CccC-----CC----CceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeec
Q 003310 360 ---------SACD-----AG----TSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 360 ---------s~~~-----~~----~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~ 419 (832)
+.++ |+ +....+.+.+ ..-+.++++-|.-+.|.+|+.+|-|.|||+....-...|+
T Consensus 2302 ~s~~~tag~s~d~~n~~lwDtl~~~~~s~v~~~H----~~gaT~l~~~P~~qllisggr~G~v~l~D~rqrql~h~~~ 2375 (2439)
T KOG1064|consen 2302 GSLLATAGRSSDNRNVCLWDTLLPPMNSLVHTCH----DGGATVLAYAPKHQLLISGGRKGEVCLFDIRQRQLRHTFQ 2375 (2439)
T ss_pred ehhhhccccCCCCCcccchhcccCcccceeeeec----CCCceEEEEcCcceEEEecCCcCcEEEeehHHHHHHHHhh
Confidence 0011 10 1111122221 1237899999999999999999999999997654444443
No 298
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=94.63 E-value=2.7 Score=47.79 Aligned_cols=57 Identities=7% Similarity=0.032 Sum_probs=42.8
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEE-EEE-cCCEEEEEeCCEEEEEECCCCceEEEEec
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYS-VRC-SSRVVAICQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~s-V~~-S~r~LAVa~~~~I~vwDl~t~~~~~tl~t 173 (832)
+.|.-+|.+||+.+.+.+....+.+ ..+ ..++++...++.|+.+|+.|++.+++...
T Consensus 130 g~l~ald~~tG~~~W~~~~~~~~~ssP~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~ 188 (394)
T PRK11138 130 GQVYALNAEDGEVAWQTKVAGEALSRPVVSDGLVLVHTSNGMLQALNESDGAVKWTVNL 188 (394)
T ss_pred CEEEEEECCCCCCcccccCCCceecCCEEECCEEEEECCCCEEEEEEccCCCEeeeecC
Confidence 6788999999999998887665543 122 34444446678999999999999888754
No 299
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=94.63 E-value=0.22 Score=59.51 Aligned_cols=104 Identities=18% Similarity=0.226 Sum_probs=73.8
Q ss_pred cccCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 295 PDADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
++..-+-.|..||+.+. .++..+..-....+.++|+- ++..||+ ..|+.|+|||++.+ + ..+.
T Consensus 131 atcsvdt~vh~wd~rSp~~p~ys~~~w~s~asqVkwnyk~p~vlas--shg~~i~vwd~r~g---s----------~pl~ 195 (1081)
T KOG0309|consen 131 ATCSVDTYVHAWDMRSPHRPFYSTSSWRSAASQVKWNYKDPNVLAS--SHGNDIFVWDLRKG---S----------TPLC 195 (1081)
T ss_pred eeccccccceeeeccCCCcceeeeecccccCceeeecccCcchhhh--ccCCceEEEeccCC---C----------cceE
Confidence 34456678999999985 45555554445567789986 6777765 56778999999876 1 2345
Q ss_pred EEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 373 RLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 373 ~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
.+++ + .+.|+.++|.. .-..+.+++.||||+.|+.+....+
T Consensus 196 s~K~-~-vs~vn~~~fnr~~~s~~~s~~~d~tvkfw~y~kSt~e 237 (1081)
T KOG0309|consen 196 SLKG-H-VSSVNSIDFNRFKYSEIMSSSNDGTVKFWDYSKSTTE 237 (1081)
T ss_pred Eecc-c-ceeeehHHHhhhhhhhhcccCCCCceeeecccccccc
Confidence 6654 3 35699999965 3345789999999999999875443
No 300
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=94.60 E-value=1.1 Score=50.49 Aligned_cols=52 Identities=19% Similarity=0.191 Sum_probs=42.8
Q ss_pred CCCEEEEEECCCCcEEEEEeCCCCEEEEEEc---CCEE-EEEeCCEEEEEECCCCc
Q 003310 115 VPTVVHFYSLRSQSYVHMLKFRSPIYSVRCS---SRVV-AICQAAQVHCFDAATLE 166 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S---~r~L-AVa~~~~I~vwDl~t~~ 166 (832)
..++|+|.|++|..++.+......+++.+|. .++| |--..+.|+|||++.-+
T Consensus 214 l~nkiki~dlet~~~vssy~a~~~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~~~ 269 (463)
T KOG1645|consen 214 LGNKIKIMDLETSCVVSSYIAYNQIWSCCWDLDERHVIYAGLQNGMVLVYDMRQPE 269 (463)
T ss_pred cCceEEEEecccceeeeheeccCCceeeeeccCCcceeEEeccCceEEEEEccCCC
Confidence 3489999999999999999989999999994 3444 44567899999999754
No 301
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=94.59 E-value=0.03 Score=64.30 Aligned_cols=85 Identities=21% Similarity=0.253 Sum_probs=60.6
Q ss_pred EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCC
Q 003310 313 VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDS 392 (832)
Q Consensus 313 ~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg 392 (832)
.+..|.+|+..|.+++-=.+-..+++||.|.+ +++|.+++... ..+...+.++.. .+ ...|.++.|-.|-
T Consensus 727 rL~nf~GH~~~iRai~AidNENSFiSASkDKT-VKLWSik~EgD-------~~~tsaCQfTY~-aH-kk~i~~igfL~~l 796 (1034)
T KOG4190|consen 727 RLCNFTGHQEKIRAIAAIDNENSFISASKDKT-VKLWSIKPEGD-------EIGTSACQFTYQ-AH-KKPIHDIGFLADL 796 (1034)
T ss_pred eeecccCcHHHhHHHHhcccccceeeccCCce-EEEEEeccccC-------ccccceeeeEhh-hc-cCcccceeeeecc
Confidence 35677899999988876566677899999987 89999987511 111112333332 12 2359999999999
Q ss_pred CEEEEEeCCCcEEEEec
Q 003310 393 NWIMISSSRGTSHLFAI 409 (832)
Q Consensus 393 ~~LAsgS~DgTVhIwdl 409 (832)
+++|++ ||-||+||-
T Consensus 797 r~i~Sc--D~giHlWDP 811 (1034)
T KOG4190|consen 797 RSIASC--DGGIHLWDP 811 (1034)
T ss_pred ceeeec--cCcceeecc
Confidence 888764 999999984
No 302
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=94.20 E-value=0.14 Score=55.51 Aligned_cols=104 Identities=13% Similarity=0.125 Sum_probs=70.0
Q ss_pred cccCCCCeEEEEECC-CCcEEEE-eccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 295 PDADNVGMVIVRDIV-SKNVIAQ-FRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~-s~~~l~~-~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
-++++||.+.-||++ .++.+.+ -+-|+..|.+|.=|| .+++||||+.|.+ |++||.+.- .+.+
T Consensus 182 ytGgDD~~l~~~D~R~p~~~i~~n~kvH~~GV~SI~ss~~~~~~I~TGsYDe~-i~~~DtRnm-------------~kPl 247 (339)
T KOG0280|consen 182 YTGGDDGSLSCWDIRIPKTFIWHNSKVHTSGVVSIYSSPPKPTYIATGSYDEC-IRVLDTRNM-------------GKPL 247 (339)
T ss_pred EecCCCceEEEEEecCCcceeeecceeeecceEEEecCCCCCceEEEeccccc-eeeeehhcc-------------cCcc
Confidence 367899999999999 4455544 668999999998876 6999999999987 999999843 0122
Q ss_pred EEEeccCccccEEEEEEccCCC-EEEEEeCCCcEEEEecCCCCCc
Q 003310 372 YRLQRGLTNAVIQDISFSDDSN-WIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~-~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
+.-. ...-||.|.++|--. .|..+....-.+|-+++....+
T Consensus 248 ~~~~---v~GGVWRi~~~p~~~~~lL~~CMh~G~ki~~~~~~~~e 289 (339)
T KOG0280|consen 248 FKAK---VGGGVWRIKHHPEIFHRLLAACMHNGAKILDSSDKVLE 289 (339)
T ss_pred ccCc---cccceEEEEecchhhhHHHHHHHhcCceEEEecccccc
Confidence 2211 112377888887422 2334445555666666654443
No 303
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=94.09 E-value=0.14 Score=60.21 Aligned_cols=59 Identities=17% Similarity=0.375 Sum_probs=50.9
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeE-EEEEcCCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPIS-ALCFDPSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs-~LaFSPdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
++....+|.|.|.-+. -+.+.+|.-|..+++ ++||.|||++||.|=.||+ |+|-|+..+
T Consensus 35 iA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~DGkllaVg~kdG~-I~L~Dve~~ 94 (665)
T KOG4640|consen 35 IATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRPDGKLLAVGFKDGT-IRLHDVEKG 94 (665)
T ss_pred hheeccCCcEEEEEec-cceeEeccCCCCccceeeeecCCCCEEEEEecCCe-EEEEEccCC
Confidence 4455678888888887 667788988888888 9999999999999999998 999999886
No 304
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=94.09 E-value=0.27 Score=53.13 Aligned_cols=72 Identities=19% Similarity=0.288 Sum_probs=48.4
Q ss_pred EEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEE
Q 003310 327 LCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHL 406 (832)
Q Consensus 327 LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhI 406 (832)
++.+.||++||.--+ +.|.|=..+.. . .+........+.. ...-..++||||+.+||.+.+.|+|+|
T Consensus 3 ~~~~~~Gk~lAi~qd--~~iEiRsa~Dd-f---------~si~~kcqVpkD~-~PQWRkl~WSpD~tlLa~a~S~G~i~v 69 (282)
T PF15492_consen 3 LALSSDGKLLAILQD--QCIEIRSAKDD-F---------SSIIGKCQVPKDP-NPQWRKLAWSPDCTLLAYAESTGTIRV 69 (282)
T ss_pred eeecCCCcEEEEEec--cEEEEEeccCC-c---------hheeEEEecCCCC-CchheEEEECCCCcEEEEEcCCCeEEE
Confidence 677899999988643 34666555543 1 1112212222221 223678999999999999999999999
Q ss_pred EecCC
Q 003310 407 FAINP 411 (832)
Q Consensus 407 wdl~~ 411 (832)
||+..
T Consensus 70 fdl~g 74 (282)
T PF15492_consen 70 FDLMG 74 (282)
T ss_pred Eeccc
Confidence 99963
No 305
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=93.93 E-value=0.13 Score=59.69 Aligned_cols=96 Identities=20% Similarity=0.202 Sum_probs=67.6
Q ss_pred eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC--
Q 003310 324 ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR-- 401 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D-- 401 (832)
=+.+.|||-|+||+|--..| |.+|--... .++.++ .++ .|+-|.|||+-+||++=|.-
T Consensus 213 etyv~wSP~GTYL~t~Hk~G--I~lWGG~~f--------------~r~~RF---~Hp-~Vq~idfSP~EkYLVT~s~~p~ 272 (698)
T KOG2314|consen 213 ETYVRWSPKGTYLVTFHKQG--IALWGGESF--------------DRIQRF---YHP-GVQFIDFSPNEKYLVTYSPEPI 272 (698)
T ss_pred eeeEEecCCceEEEEEeccc--eeeecCccH--------------HHHHhc---cCC-CceeeecCCccceEEEecCCcc
Confidence 46799999999999988887 679943221 222232 222 49999999999999987652
Q ss_pred ---------CcEEEEecCCCCCceeeccCCCCcccccCCcccccccCCCCCCC
Q 003310 402 ---------GTSHLFAINPLGGSVNFQPTDANFTTKHGAMAKSGVRWPPNLGL 445 (832)
Q Consensus 402 ---------gTVhIwdl~~~g~~~~~~~H~~~~~~~~~~~~~~~~r~~~~s~~ 445 (832)
..+.||||.++.-..+|..- ..+.+.-+++|||..-..
T Consensus 273 ~~~~~d~e~~~l~IWDI~tG~lkrsF~~~------~~~~~~WP~frWS~DdKy 319 (698)
T KOG2314|consen 273 IVEEDDNEGQQLIIWDIATGLLKRSFPVI------KSPYLKWPIFRWSHDDKY 319 (698)
T ss_pred ccCcccCCCceEEEEEccccchhcceecc------CCCccccceEEeccCCce
Confidence 46899999987666666432 124556678899877544
No 306
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=93.84 E-value=13 Score=41.07 Aligned_cols=50 Identities=6% Similarity=0.246 Sum_probs=41.2
Q ss_pred CEEEEEECCCC-------cEEEEEeCCCCEEEEEEcCCEEEEEeCCEEEEEECCCCc
Q 003310 117 TVVHFYSLRSQ-------SYVHMLKFRSPIYSVRCSSRVVAICQAAQVHCFDAATLE 166 (832)
Q Consensus 117 ~tVrlWDL~Tg-------~~V~tL~f~s~V~sV~~S~r~LAVa~~~~I~vwDl~t~~ 166 (832)
+.|.++++... +.++...++++|++|..-...|+++...+|++|++...+
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g~~l~v~~l~~~~ 118 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVGNKLYVYDLDNSK 118 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEETTEEEEEEEETTS
T ss_pred cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeecCEEEEEEccCcc
Confidence 88999999985 455666789999999986666888889999999998776
No 307
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=93.82 E-value=0.16 Score=56.15 Aligned_cols=103 Identities=17% Similarity=0.212 Sum_probs=69.9
Q ss_pred cCCCCeEEEEECCCC----cEEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 297 ADNVGMVIVRDIVSK----NVIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~----~~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
+...|.|.++|++.+ .-.++---|.+.|++|..=. ++++|.+.+-+|+ |++||.+--.++- ....
T Consensus 270 GcRngeI~~iDLR~rnqG~~~~a~rlyh~Ssvtslq~Lq~s~q~LmaS~M~gk-ikLyD~R~~K~~~---------~V~q 339 (425)
T KOG2695|consen 270 GCRNGEIFVIDLRCRNQGNGWCAQRLYHDSSVTSLQILQFSQQKLMASDMTGK-IKLYDLRATKCKK---------SVMQ 339 (425)
T ss_pred cccCCcEEEEEeeecccCCCcceEEEEcCcchhhhhhhccccceEeeccCcCc-eeEeeehhhhccc---------ceee
Confidence 456789999999865 22233335889999988655 8899999999998 9999997641110 0222
Q ss_pred EEEeccCcccc-EEEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 372 YRLQRGLTNAV-IQDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 372 ~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
| .||.+.. -.-+-..+....++++++|--.+||.+..+
T Consensus 340 Y---eGHvN~~a~l~~~v~~eeg~I~s~GdDcytRiWsl~~g 378 (425)
T KOG2695|consen 340 Y---EGHVNLSAYLPAHVKEEEGSIFSVGDDCYTRIWSLDSG 378 (425)
T ss_pred e---ecccccccccccccccccceEEEccCeeEEEEEecccC
Confidence 2 2443211 112334566778888999999999999764
No 308
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=93.75 E-value=0.21 Score=60.17 Aligned_cols=91 Identities=20% Similarity=0.202 Sum_probs=66.7
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
|+-+...|.|++.+.... + .+...|... +-+|.++||||.||+ +.|..+.+. .-.+.+.
T Consensus 52 ~~~GtH~g~v~~~~~~~~-~-~~~~~~s~~------~~~Gey~asCS~DGk-v~I~sl~~~------------~~~~~~d 110 (846)
T KOG2066|consen 52 FALGTHRGAVYLTTCQGN-P-KTNFDHSSS------ILEGEYVASCSDDGK-VVIGSLFTD------------DEITQYD 110 (846)
T ss_pred eeeccccceEEEEecCCc-c-ccccccccc------ccCCceEEEecCCCc-EEEeeccCC------------ccceeEe
Confidence 344667889999987643 2 444556544 789999999999998 678877665 1234466
Q ss_pred EeccCccccEEEEEEccC-----CCEEEEEeCCCcEEEEecCC
Q 003310 374 LQRGLTNAVIQDISFSDD-----SNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpD-----g~~LAsgS~DgTVhIwdl~~ 411 (832)
++| .|.+|+++|| ++.+++|+..| +.++.-+=
T Consensus 111 f~r-----piksial~Pd~~~~~sk~fv~GG~ag-lvL~er~w 147 (846)
T KOG2066|consen 111 FKR-----PIKSIALHPDFSRQQSKQFVSGGMAG-LVLSERNW 147 (846)
T ss_pred cCC-----cceeEEeccchhhhhhhheeecCcce-EEEehhhh
Confidence 654 4889999999 78899999998 77765433
No 309
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=93.73 E-value=0.75 Score=55.49 Aligned_cols=99 Identities=17% Similarity=0.278 Sum_probs=73.3
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcC---CC-CEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDP---SG-ILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVH 370 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSP---dG-~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~ 370 (832)
+.++..|.|.|||+..+..+..|..|..+|..|+|=| +. .+|+.-..-. .|-+|+..+| .+
T Consensus 83 AsaD~~GrIil~d~~~~s~~~~l~~~~~~~qdl~W~~~rd~Srd~LlaIh~ss-~lvLwntdtG--------------~k 147 (1062)
T KOG1912|consen 83 ASADISGRIILVDFVLASVINWLSHSNDSVQDLCWVPARDDSRDVLLAIHGSS-TLVLWNTDTG--------------EK 147 (1062)
T ss_pred EeccccCcEEEEEehhhhhhhhhcCCCcchhheeeeeccCcchheeEEecCCc-EEEEEEccCC--------------ce
Confidence 4456678999999999999999999999999999855 44 3444433333 4889999887 55
Q ss_pred EEEEeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEecC
Q 003310 371 LYRLQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 371 l~~l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~ 410 (832)
+|+...++ -...|+.|.| |.+.+..-+..|-+-+-+.-
T Consensus 148 ~Wk~~ys~--~iLs~f~~DPfd~rh~~~l~s~g~vl~~~~l 186 (1062)
T KOG1912|consen 148 FWKYDYSH--EILSCFRVDPFDSRHFCVLGSKGFVLSCKDL 186 (1062)
T ss_pred eeccccCC--cceeeeeeCCCCcceEEEEccCceEEEEecc
Confidence 66653332 1256688888 88899988899988777653
No 310
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=93.54 E-value=0.13 Score=59.09 Aligned_cols=93 Identities=15% Similarity=0.259 Sum_probs=66.7
Q ss_pred EECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEE
Q 003310 306 RDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQD 385 (832)
Q Consensus 306 wDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~ 385 (832)
|.-.+...-..+.+-.-|+..++|||-|++|++....| |.+|+-... ..+.+++ +..|..
T Consensus 17 ~~~~s~~~~~~~~~~~~p~~~~~~SP~G~~l~~~~~~~--V~~~~g~~~--------------~~l~~~~----~~~V~~ 76 (561)
T COG5354 17 WNSQSEVIHTRFESENWPVAYVSESPLGTYLFSEHAAG--VECWGGPSK--------------AKLVRFR----HPDVKY 76 (561)
T ss_pred ecCccccccccccccCcchhheeecCcchheehhhccc--eEEccccch--------------hheeeee----cCCcee
Confidence 55455444455555667999999999999999987775 789976543 2333332 356999
Q ss_pred EEEccCCCEEEEEeCCCc---------------EEEEecCCCCCceee
Q 003310 386 ISFSDDSNWIMISSSRGT---------------SHLFAINPLGGSVNF 418 (832)
Q Consensus 386 IaFSpDg~~LAsgS~DgT---------------VhIwdl~~~g~~~~~ 418 (832)
+.|||.++||.+=+..+. +.|||+..+.-...|
T Consensus 77 ~~fSP~~kYL~tw~~~pi~~pe~e~sp~~~~n~~~vwd~~sg~iv~sf 124 (561)
T COG5354 77 LDFSPNEKYLVTWSREPIIEPEIEISPFTSKNNVFVWDIASGMIVFSF 124 (561)
T ss_pred cccCcccceeeeeccCCccChhhccCCccccCceeEEeccCceeEeec
Confidence 999999999999887665 888998764333344
No 311
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=93.44 E-value=0.18 Score=55.74 Aligned_cols=79 Identities=28% Similarity=0.348 Sum_probs=54.2
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC---c-----------cccEEEEE
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL---T-----------NAVIQDIS 387 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~---t-----------~a~I~~Ia 387 (832)
.-|+++.|+.+|.+||||+.+|+ +-+|.-..... +.|.++... . .-+|..|.
T Consensus 26 diis~vef~~~Ge~LatGdkgGR-Vv~f~r~~~~~-------------~ey~~~t~fqshepEFDYLkSleieEKinkIr 91 (433)
T KOG1354|consen 26 DIISAVEFDHYGERLATGDKGGR-VVLFEREKLYK-------------GEYNFQTEFQSHEPEFDYLKSLEIEEKINKIR 91 (433)
T ss_pred cceeeEEeecccceEeecCCCCe-EEEeecccccc-------------cceeeeeeeeccCcccchhhhhhhhhhhhhce
Confidence 46899999999999999999998 66775332200 111111111 0 12378899
Q ss_pred EccCCC--EEEEEeCCCcEEEEecCCCCC
Q 003310 388 FSDDSN--WIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 388 FSpDg~--~LAsgS~DgTVhIwdl~~~g~ 414 (832)
|-++++ .+...+.|.||++|.+...+.
T Consensus 92 w~~~~n~a~FLlstNdktiKlWKi~er~~ 120 (433)
T KOG1354|consen 92 WLDDGNLAEFLLSTNDKTIKLWKIRERGS 120 (433)
T ss_pred ecCCCCccEEEEecCCcceeeeeeecccc
Confidence 998765 467788999999999976543
No 312
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=93.23 E-value=0.05 Score=57.99 Aligned_cols=58 Identities=22% Similarity=0.257 Sum_probs=49.4
Q ss_pred cccCCCCeEEEEECCCCc-EEEEeccCCCCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCC
Q 003310 295 PDADNVGMVIVRDIVSKN-VIAQFRAHKSPISALCFDP-SGILLVTASVQGHNINIFKIIP 353 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~-~l~~~~aH~~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t 353 (832)
..+..+|.|.|||.+... ++..+++|..+|..+-|.| ++..|.|+|.||. +--||..+
T Consensus 196 ~cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~sedGs-lw~wdas~ 255 (319)
T KOG4714|consen 196 CCGTDDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPKNPEHLFTCSEDGS-LWHWDAST 255 (319)
T ss_pred EEecCCCeEEEEEcccccchHHHHHHhhhhhhheeccCCCchheeEecCCCc-EEEEcCCC
Confidence 356789999999999875 4566789999999999999 6889999999998 56788764
No 313
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=93.20 E-value=1.3 Score=46.79 Aligned_cols=97 Identities=25% Similarity=0.296 Sum_probs=58.2
Q ss_pred eEEEEECCCCcEEEEecc-----CCCCeEEEEEcCCCCEEEEEEcCCCE-----EEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 302 MVIVRDIVSKNVIAQFRA-----HKSPISALCFDPSGILLVTASVQGHN-----INIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~a-----H~~pIs~LaFSPdG~lLATaS~dGt~-----I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
.+.++|+.+++....+.. .....+.++++|+|.+.+|-+..... =+||.+... +....+
T Consensus 61 ~~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-----------~~~~~~ 129 (246)
T PF08450_consen 61 GIAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-----------GKVTVV 129 (246)
T ss_dssp CEEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-----------SEEEEE
T ss_pred ceEEEecCCCcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-----------CeEEEE
Confidence 355669888754333332 23457889999999988887654210 134544432 011222
Q ss_pred EEEeccCccccEEEEEEccCCCEEE-EEeCCCcEEEEecCCCCC
Q 003310 372 YRLQRGLTNAVIQDISFSDDSNWIM-ISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~~LA-sgS~DgTVhIwdl~~~g~ 414 (832)
..+. ...+.|+|+||++.|. +-+..+.|..|++...+.
T Consensus 130 ---~~~~--~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~ 168 (246)
T PF08450_consen 130 ---ADGL--GFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGG 168 (246)
T ss_dssp ---EEEE--SSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTC
T ss_pred ---ecCc--ccccceEECCcchheeecccccceeEEEecccccc
Confidence 1222 2368999999999775 567788899999976555
No 314
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=93.14 E-value=0.1 Score=57.53 Aligned_cols=113 Identities=16% Similarity=0.191 Sum_probs=80.3
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
+.+-.|-|-|++++.- ..|. ..+.|.++.|.-.+.++..|...|. |-..|++..-.| ..++...|..+
T Consensus 231 G~sqqv~L~nvetg~~-qsf~-sksDVfAlQf~~s~nLv~~GcRnge-I~~iDLR~rnqG---------~~~~a~rlyh~ 298 (425)
T KOG2695|consen 231 GLSQQVLLTNVETGHQ-QSFQ-SKSDVFALQFAGSDNLVFNGCRNGE-IFVIDLRCRNQG---------NGWCAQRLYHD 298 (425)
T ss_pred cccceeEEEEeecccc-cccc-cchhHHHHHhcccCCeeEecccCCc-EEEEEeeecccC---------CCcceEEEEcC
Confidence 3445678888888743 3444 5678999999999999999999997 789999875222 22333444322
Q ss_pred CccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCCCCC---ceeeccCCCCc
Q 003310 378 LTNAVIQDISFSD-DSNWIMISSSRGTSHLFAINPLGG---SVNFQPTDANF 425 (832)
Q Consensus 378 ~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~~g~---~~~~~~H~~~~ 425 (832)
..|.++..=. ++++|++++.+|+|++||+.-.++ .....+|.+..
T Consensus 299 ---Ssvtslq~Lq~s~q~LmaS~M~gkikLyD~R~~K~~~~V~qYeGHvN~~ 347 (425)
T KOG2695|consen 299 ---SSVTSLQILQFSQQKLMASDMTGKIKLYDLRATKCKKSVMQYEGHVNLS 347 (425)
T ss_pred ---cchhhhhhhccccceEeeccCcCceeEeeehhhhcccceeeeecccccc
Confidence 2355544433 567999999999999999987766 66778886643
No 315
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=93.14 E-value=1.7 Score=50.49 Aligned_cols=50 Identities=12% Similarity=0.126 Sum_probs=40.6
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEcC--CEEEEEeCC----EEEEEECCCCc
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCSS--RVVAICQAA----QVHCFDAATLE 166 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~--r~LAVa~~~----~I~vwDl~t~~ 166 (832)
..+.++|+.+++....+.|...-..-+|++ +.||.+..+ .|+++|+.+.+
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~ 273 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKN 273 (425)
T ss_pred ceEEEEeccCCccceeeccCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCc
Confidence 468999999999888888887777788874 778776543 79999999877
No 316
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=93.13 E-value=0.22 Score=59.31 Aligned_cols=102 Identities=9% Similarity=0.129 Sum_probs=73.8
Q ss_pred cCCCCeEEEEECCCCcEEE-EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSKNVIA-QFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~-~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+...|.|.+|.-..+.... ...+-.+.+..++.|++..+.|.|+..|. |-||.+....+. ... -+.
T Consensus 51 GsS~G~lyl~~R~~~~~~~~~~~~~~~~~~~~~vs~~e~lvAagt~~g~-V~v~ql~~~~p~-----------~~~-~~t 117 (726)
T KOG3621|consen 51 GSSAGSVYLYNRHTGEMRKLKNEGATGITCVRSVSSVEYLVAAGTASGR-VSVFQLNKELPR-----------DLD-YVT 117 (726)
T ss_pred ecccceEEEEecCchhhhcccccCccceEEEEEecchhHhhhhhcCCce-EEeehhhccCCC-----------cce-eec
Confidence 4567899999877765432 22233456777889999999998888887 889988764221 121 222
Q ss_pred ccCc--cccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 376 RGLT--NAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 376 rG~t--~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+++. ..+|.+++||+|++.|.+|-+.|+|+.-.++.
T Consensus 118 ~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 118 PCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred cccccCCceEEEEEecccccEEeecCCCceEEEEEech
Confidence 3333 45699999999999999999999999888866
No 317
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=92.93 E-value=11 Score=44.50 Aligned_cols=57 Identities=11% Similarity=0.069 Sum_probs=39.9
Q ss_pred CEEEEEECCCCcEEEEEeCCCC-------E--EEEEE-c-CCEEEEEeCCEEEEEECCCCceEEEEec
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSP-------I--YSVRC-S-SRVVAICQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~-------V--~sV~~-S-~r~LAVa~~~~I~vwDl~t~~~~~tl~t 173 (832)
+.|.-.|++||+.+.+.+.... + ..+.+ . .++++...++.|+++|+.|++.+.....
T Consensus 71 g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v~~~~g~v~AlD~~TG~~~W~~~~ 138 (488)
T cd00216 71 SALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFFGTFDGRLVALDAETGKQVWKFGN 138 (488)
T ss_pred CcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEEecCCCeEEEEECCCCCEeeeecC
Confidence 5677789999998887765332 1 11223 2 3455556789999999999999888754
No 318
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=92.90 E-value=0.28 Score=39.26 Aligned_cols=31 Identities=13% Similarity=0.329 Sum_probs=28.4
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
.+.|..++|||...+||.++.||.|.||+++
T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~ 41 (47)
T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTEDGEVLVYRLN 41 (47)
T ss_pred CCcEEEEEECCCCCEEEEEECCCeEEEEECC
Confidence 3569999999999999999999999999993
No 319
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=92.77 E-value=11 Score=41.75 Aligned_cols=70 Identities=20% Similarity=0.347 Sum_probs=48.3
Q ss_pred CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeC
Q 003310 321 KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS 400 (832)
Q Consensus 321 ~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~ 400 (832)
.+.|-+|+|+++|.++|..|-.|..+.+||..++ ..+-. ..-.++..++-.+++ |++++ -
T Consensus 216 ~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg--------------~~~~~----~~l~D~cGva~~~~~-f~~ss-G 275 (305)
T PF07433_consen 216 NGYIGSIAADRDGRLIAVTSPRGGRVAVWDAATG--------------RLLGS----VPLPDACGVAPTDDG-FLVSS-G 275 (305)
T ss_pred CCceEEEEEeCCCCEEEEECCCCCEEEEEECCCC--------------CEeec----cccCceeeeeecCCc-eEEeC-C
Confidence 4689999999999999988888888999999887 22211 122346777777777 55443 3
Q ss_pred CCcEEEEecCCC
Q 003310 401 RGTSHLFAINPL 412 (832)
Q Consensus 401 DgTVhIwdl~~~ 412 (832)
.|. ++.+...
T Consensus 276 ~G~--~~~~~~~ 285 (305)
T PF07433_consen 276 QGQ--LIRLSPD 285 (305)
T ss_pred Ccc--EEEccCc
Confidence 443 4455443
No 320
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=92.61 E-value=22 Score=40.38 Aligned_cols=92 Identities=12% Similarity=0.050 Sum_probs=55.1
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCC-CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKS-PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~-pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
...+|.|..+|..+++.+-....-.. ...+... .+.+|..++.+|. +.++|..++ +.+++.+
T Consensus 300 ~~~~g~l~ald~~tG~~~W~~~~~~~~~~~sp~v--~~g~l~v~~~~G~-l~~ld~~tG--------------~~~~~~~ 362 (394)
T PRK11138 300 VDQNDRVYALDTRGGVELWSQSDLLHRLLTAPVL--YNGYLVVGDSEGY-LHWINREDG--------------RFVAQQK 362 (394)
T ss_pred EcCCCeEEEEECCCCcEEEcccccCCCcccCCEE--ECCEEEEEeCCCE-EEEEECCCC--------------CEEEEEE
Confidence 34678999999999987665432111 1111112 2455677888987 788898877 4445543
Q ss_pred ccCccccEE-EEEEccCCCEEEEEeCCCcEEEEec
Q 003310 376 RGLTNAVIQ-DISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 376 rG~t~a~I~-~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
-+.. .+. ...+ .+..|.+++.||+++.|++
T Consensus 363 ~~~~--~~~s~P~~--~~~~l~v~t~~G~l~~~~~ 393 (394)
T PRK11138 363 VDSS--GFLSEPVV--ADDKLLIQARDGTVYAITR 393 (394)
T ss_pred cCCC--cceeCCEE--ECCEEEEEeCCceEEEEeC
Confidence 2211 121 1122 2347888899999988765
No 321
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=92.36 E-value=0.14 Score=54.62 Aligned_cols=94 Identities=15% Similarity=0.264 Sum_probs=62.4
Q ss_pred CeEEEEECCCCcEEE-EeccCCCCeEEEEEcCCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC
Q 003310 301 GMVIVRDIVSKNVIA-QFRAHKSPISALCFDPSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL 378 (832)
Q Consensus 301 G~V~IwDl~s~~~l~-~~~aH~~pIs~LaFSPdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~ 378 (832)
+..++|+++-.+.+. ..++- ..|+++|-.|--+ ++++|+.||- +-|||.+... -. .-++. .
T Consensus 159 d~~~a~~~~p~~t~~~~~~~~-~~v~~l~~hp~qq~~v~cgt~dg~-~~l~d~rn~~-~p----------~S~l~---a- 221 (319)
T KOG4714|consen 159 DNFYANTLDPIKTLIPSKKAL-DAVTALCSHPAQQHLVCCGTDDGI-VGLWDARNVA-MP----------VSLLK---A- 221 (319)
T ss_pred cceeeeccccccccccccccc-ccchhhhCCcccccEEEEecCCCe-EEEEEccccc-ch----------HHHHH---H-
Confidence 456667766433211 11122 3499999999655 5666777765 9999998651 00 11111 1
Q ss_pred ccccEEEEEEcc-CCCEEEEEeCCCcEEEEecCC
Q 003310 379 TNAVIQDISFSD-DSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 379 t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwdl~~ 411 (832)
+.+.|+.|-|.| ++..|.++|.||.+--||-++
T Consensus 222 hk~~i~eV~FHpk~p~~Lft~sedGslw~wdas~ 255 (319)
T KOG4714|consen 222 HKAEIWEVHFHPKNPEHLFTCSEDGSLWHWDAST 255 (319)
T ss_pred hhhhhhheeccCCCchheeEecCCCcEEEEcCCC
Confidence 235699999998 788999999999999999875
No 322
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=92.21 E-value=15 Score=47.08 Aligned_cols=97 Identities=14% Similarity=0.214 Sum_probs=59.2
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEc---CCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASV---QGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~---dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
..|+|||-+ +.+-..=..-.+-=.+|+|-|+|.++|+--. |+. |.+|.-. | -.+--+.+++-
T Consensus 222 RkirV~drE-g~Lns~se~~~~l~~~LsWkPsgs~iA~iq~~~sd~~-IvffErN-G------------L~hg~f~l~~p 286 (1265)
T KOG1920|consen 222 RKIRVYDRE-GALNSTSEPVEGLQHSLSWKPSGSLIAAIQCKTSDSD-IVFFERN-G------------LRHGEFVLPFP 286 (1265)
T ss_pred eeEEEeccc-chhhcccCcccccccceeecCCCCeEeeeeecCCCCc-EEEEecC-C------------ccccccccCCc
Confidence 689999987 4321111112222357999999999998643 333 6677532 2 01112333333
Q ss_pred CccccEEEEEEccCCCEEEEE---eCCCcEEEEecCCC
Q 003310 378 LTNAVIQDISFSDDSNWIMIS---SSRGTSHLFAINPL 412 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsg---S~DgTVhIwdl~~~ 412 (832)
.....|..++|+.++..||+- .....|.+|-+..|
T Consensus 287 ~de~~ve~L~Wns~sdiLAv~~~~~e~~~v~lwt~~Ny 324 (1265)
T KOG1920|consen 287 LDEKEVEELAWNSNSDILAVVTSNLENSLVQLWTTGNY 324 (1265)
T ss_pred ccccchheeeecCCCCceeeeecccccceEEEEEecCe
Confidence 332238899999999999983 33444999988654
No 323
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=92.15 E-value=0.24 Score=58.87 Aligned_cols=100 Identities=15% Similarity=0.243 Sum_probs=73.0
Q ss_pred ccccCCCCeEEEEECCCC---------------cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCC
Q 003310 294 FPDADNVGMVIVRDIVSK---------------NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGT 358 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~---------------~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~ 358 (832)
++.++.+|.++|..+.+. ..-.++.+|...|.-+.|+-.-+.|-|.+.+|- |.||=+..+ .+.
T Consensus 29 IAcgG~dGlLKVlKl~t~t~d~~~~glaa~snLsmNQtLeGH~~sV~vvTWNe~~QKLTtSDt~Gl-IiVWmlykg-sW~ 106 (1189)
T KOG2041|consen 29 IACGGADGLLKVLKLGTDTTDLNKSGLAAASNLSMNQTLEGHNASVMVVTWNENNQKLTTSDTSGL-IIVWMLYKG-SWC 106 (1189)
T ss_pred EEeccccceeEEEEccccCCcccccccccccccchhhhhccCcceEEEEEeccccccccccCCCce-EEEEeeecc-cHH
Confidence 345677888888877542 123467899999999999999999999999985 889988765 110
Q ss_pred CCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 359 SSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 359 ~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
+.+.. .| ....|.+++|..||+.+++.-.||.|.|=.
T Consensus 107 ----------EEMiN-nR--nKSvV~SmsWn~dG~kIcIvYeDGavIVGs 143 (1189)
T KOG2041|consen 107 ----------EEMIN-NR--NKSVVVSMSWNLDGTKICIVYEDGAVIVGS 143 (1189)
T ss_pred ----------HHHhh-Cc--CccEEEEEEEcCCCcEEEEEEccCCEEEEe
Confidence 11111 12 134589999999999999999998876533
No 324
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=92.07 E-value=6.6 Score=48.86 Aligned_cols=59 Identities=14% Similarity=0.223 Sum_probs=45.2
Q ss_pred ccccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 292 GHFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 292 g~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
|+++-++.+|.||+||-...+....|++-..||..|..+.||++|+..+. +.+.|+++.
T Consensus 589 G~iavgs~~G~IRLyd~~g~~AKT~lp~lG~pI~~iDvt~DGkwilaTc~--tyLlLi~t~ 647 (794)
T PF08553_consen 589 GYIAVGSNKGDIRLYDRLGKRAKTALPGLGDPIIGIDVTADGKWILATCK--TYLLLIDTL 647 (794)
T ss_pred ceEEEEeCCCcEEeecccchhhhhcCCCCCCCeeEEEecCCCcEEEEeec--ceEEEEEEe
Confidence 34566778999999997666666677887899999999999997654443 347788764
No 325
>PRK02888 nitrous-oxide reductase; Validated
Probab=91.87 E-value=1.6 Score=52.56 Aligned_cols=114 Identities=12% Similarity=0.122 Sum_probs=67.4
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEE---cCCCEE-----------EEEeCCCCC----CCC
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTAS---VQGHNI-----------NIFKIIPGI----LGT 358 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS---~dGt~I-----------~Iwdi~t~~----~~~ 358 (832)
....+++.+.|..+.++..++.--.. -..++|+|||+++.+.+ ..|..+ .+|++.... .+.
T Consensus 211 ~ey~~~vSvID~etmeV~~qV~Vdgn-pd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d~~vvfni~~iea~vkdGK 289 (635)
T PRK02888 211 KKYRSLFTAVDAETMEVAWQVMVDGN-LDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERDWVVVFNIARIEEAVKAGK 289 (635)
T ss_pred cceeEEEEEEECccceEEEEEEeCCC-cccceECCCCCEEEEeccCcccCcceeeeccccCceEEEEchHHHHHhhhCCC
Confidence 45668899999999888888775443 35679999999998775 333222 223222100 000
Q ss_pred --------CCccCCCC----ceeEEEEEeccCccccEEEEEEccCCCEEEEEeC-CCcEEEEecCCCCC
Q 003310 359 --------SSACDAGT----SYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS-RGTSHLFAINPLGG 414 (832)
Q Consensus 359 --------~s~~~~~~----~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~-DgTVhIwdl~~~g~ 414 (832)
...-+... ....++.+.-| .....|++||||+++.+++. +.||.|.|+.+...
T Consensus 290 ~~~V~gn~V~VID~~t~~~~~~~v~~yIPVG---KsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k~ 355 (635)
T PRK02888 290 FKTIGGSKVPVVDGRKAANAGSALTRYVPVP---KNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDD 355 (635)
T ss_pred EEEECCCEEEEEECCccccCCcceEEEEECC---CCccceEECCCCCEEEEeCCCCCcEEEEEChhhhh
Confidence 00000000 00122222222 22568999999999877665 89999999988553
No 326
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=91.39 E-value=0.42 Score=52.17 Aligned_cols=105 Identities=17% Similarity=0.256 Sum_probs=69.4
Q ss_pred cccccCCCCeEEEEECCCCc----------------EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKN----------------VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGIL 356 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~----------------~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~ 356 (832)
+|.-....|.|+|-|++... -+.-|..-.+.|+.++|+++|+|+++-+.- .++|||+....
T Consensus 236 ~fmYSsSkG~Ikl~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~ngryIlsRdyl--tvkiwDvnm~k- 312 (460)
T COG5170 236 VFMYSSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDNGRYILSRDYL--TVKIWDVNMAK- 312 (460)
T ss_pred eEEEecCCCcEEehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCCCcEEEEeccc--eEEEEeccccc-
Confidence 34445678999999997321 111223335789999999999999987654 48999997641
Q ss_pred CCCCccCCCCceeEEEEE--e---c-----cCccccE---EEEEEccCCCEEEEEeCCCcEEEEecCCC
Q 003310 357 GTSSACDAGTSYVHLYRL--Q---R-----GLTNAVI---QDISFSDDSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 357 ~~~s~~~~~~~~~~l~~l--~---r-----G~t~a~I---~~IaFSpDg~~LAsgS~DgTVhIwdl~~~ 412 (832)
..+.+. + + -+..-.| ..|.||-|.+.+.+||-....-||...+.
T Consensus 313 ------------~pikTi~~h~~l~~~l~d~YEnDaifdkFeisfSgd~~~v~sgsy~NNfgiyp~~ss 369 (460)
T COG5170 313 ------------NPIKTIPMHCDLMDELNDVYENDAIFDKFEISFSGDDKHVLSGSYSNNFGIYPTDSS 369 (460)
T ss_pred ------------CCceeechHHHHHHHHHhhhhccceeeeEEEEecCCcccccccccccceeeeccccC
Confidence 111111 0 0 0000012 35889999999999999999888885543
No 327
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=90.99 E-value=0.64 Score=53.06 Aligned_cols=117 Identities=15% Similarity=0.157 Sum_probs=78.1
Q ss_pred CCCCeEEEEECCCCc-EEEEe-ccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCC-CCCCCCccCCCC-ceeEEEE
Q 003310 298 DNVGMVIVRDIVSKN-VIAQF-RAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPG-ILGTSSACDAGT-SYVHLYR 373 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~-~l~~~-~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~-~~~~~s~~~~~~-~~~~l~~ 373 (832)
..+|.|.|+|-.... ++..| +-|.+||.++.++|-|....+....| .|.-|..... +.-.. .-.++. .-.-||.
T Consensus 119 ~~sg~i~VvD~~~d~~q~~~fkklH~sPV~~i~y~qa~Ds~vSiD~~g-mVEyWs~e~~~qfPr~-~l~~~~K~eTdLy~ 196 (558)
T KOG0882|consen 119 FKSGKIFVVDGFGDFCQDGYFKKLHFSPVKKIRYNQAGDSAVSIDISG-MVEYWSAEGPFQFPRT-NLNFELKHETDLYG 196 (558)
T ss_pred ccCCCcEEECCcCCcCccceecccccCceEEEEeeccccceeeccccc-eeEeecCCCcccCccc-cccccccccchhhc
Confidence 456889999976543 34444 47999999999999999999988877 4999987631 10000 000000 0011222
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
+..- .....++.|||+|..+++-+.|.+|++|++.+++-...+
T Consensus 197 f~K~--Kt~pts~Efsp~g~qistl~~DrkVR~F~~KtGklvqei 239 (558)
T KOG0882|consen 197 FPKA--KTEPTSFEFSPDGAQISTLNPDRKVRGFVFKTGKLVQEI 239 (558)
T ss_pred cccc--ccCccceEEccccCcccccCcccEEEEEEeccchhhhhh
Confidence 2211 123789999999999999999999999999876554443
No 328
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=90.69 E-value=0.5 Score=52.83 Aligned_cols=58 Identities=26% Similarity=0.340 Sum_probs=45.6
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc----CCEEEEEe-CCEEEEEECCCCceEEEEecC
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS----SRVVAICQ-AAQVHCFDAATLEIEYAILTN 174 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S----~r~LAVa~-~~~I~vwDl~t~~~~~tl~t~ 174 (832)
+.|-+||++|++.+..+....++.+|..+ +.++++.. ++.+.|||+.|++.++++..-
T Consensus 269 teVWv~D~~t~krv~Ri~l~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk~~~~~~~l 331 (342)
T PF06433_consen 269 TEVWVYDLKTHKRVARIPLEHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATGKLVRSIEQL 331 (342)
T ss_dssp EEEEEEETTTTEEEEEEEEEEEESEEEEESSSS-EEEEEETTTTEEEEEETTT--EEEEE---
T ss_pred eEEEEEECCCCeEEEEEeCCCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCCcEEeehhcc
Confidence 78999999999999999998889899997 35556654 679999999999999998753
No 329
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=90.65 E-value=0.13 Score=62.46 Aligned_cols=106 Identities=11% Similarity=0.179 Sum_probs=81.0
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCC-EEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGH-NINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt-~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
+++-+...|.|+++++.+|.......+|.++|+-|.=|.||.+++|.|.-.. ..-+|++... .+ .+|
T Consensus 1115 hL~vG~~~Geik~~nv~sG~~e~s~ncH~SavT~vePs~dgs~~Ltsss~S~PlsaLW~~~s~-------~~----~~H- 1182 (1516)
T KOG1832|consen 1115 HLAVGSHAGEIKIFNVSSGSMEESVNCHQSAVTLVEPSVDGSTQLTSSSSSSPLSALWDASST-------GG----PRH- 1182 (1516)
T ss_pred eEEeeeccceEEEEEccCccccccccccccccccccccCCcceeeeeccccCchHHHhccccc-------cC----ccc-
Confidence 4556678899999999999999999999999999999999998887665433 4568987642 11 123
Q ss_pred EEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCcee
Q 003310 372 YRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVN 417 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~ 417 (832)
.+ ++ =.++.||...++-+.|+.....+|||+.+.....+
T Consensus 1183 -sf-~e-----d~~vkFsn~~q~r~~gt~~d~a~~YDvqT~~~l~t 1221 (1516)
T KOG1832|consen 1183 -SF-DE-----DKAVKFSNSLQFRALGTEADDALLYDVQTCSPLQT 1221 (1516)
T ss_pred -cc-cc-----cceeehhhhHHHHHhcccccceEEEecccCcHHHH
Confidence 33 12 23678998888888888888999999998655444
No 330
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=90.37 E-value=4.9 Score=46.44 Aligned_cols=96 Identities=15% Similarity=0.154 Sum_probs=71.7
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
-|++-|.+.-.+-.+. -+|.+.|....+.-++.-++.|..||..+-|+|..++ ...++..+.
T Consensus 340 RGkaFi~~~~~~~~iq--v~~~~~VrY~r~~~~~e~~vigt~dgD~l~iyd~~~~---------------e~kr~e~~l- 401 (668)
T COG4946 340 RGKAFIMRPWDGYSIQ--VGKKGGVRYRRIQVDPEGDVIGTNDGDKLGIYDKDGG---------------EVKRIEKDL- 401 (668)
T ss_pred cCcEEEECCCCCeeEE--cCCCCceEEEEEccCCcceEEeccCCceEEEEecCCc---------------eEEEeeCCc-
Confidence 3455554443333221 2577789999999999999999999988999999876 223343343
Q ss_pred cccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCC
Q 003310 380 NAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
..|..+..||||++++.+..+..+-+.|+.++..
T Consensus 402 -g~I~av~vs~dGK~~vvaNdr~el~vididngnv 435 (668)
T COG4946 402 -GNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNV 435 (668)
T ss_pred -cceEEEEEcCCCcEEEEEcCceEEEEEEecCCCe
Confidence 3499999999999999999999999999987543
No 331
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=90.29 E-value=0.72 Score=36.91 Aligned_cols=29 Identities=17% Similarity=0.456 Sum_probs=27.0
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKI 351 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi 351 (832)
.+|.+++|+|...+||.++.+|. |.|+++
T Consensus 12 ~~v~~~~w~P~mdLiA~~t~~g~-v~v~Rl 40 (47)
T PF12894_consen 12 SRVSCMSWCPTMDLIALGTEDGE-VLVYRL 40 (47)
T ss_pred CcEEEEEECCCCCEEEEEECCCe-EEEEEC
Confidence 57999999999999999999998 889988
No 332
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=89.94 E-value=0.59 Score=54.98 Aligned_cols=58 Identities=17% Similarity=0.270 Sum_probs=46.6
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
++-+..||.|++||...+. .++..+.-..+.++|+|+|.+++.|+..|. |.+||+.-.
T Consensus 274 LvlGC~DgSiiLyD~~~~~--t~~~ka~~~P~~iaWHp~gai~~V~s~qGe-lQ~FD~ALs 331 (545)
T PF11768_consen 274 LVLGCEDGSIILYDTTRGV--TLLAKAEFIPTLIAWHPDGAIFVVGSEQGE-LQCFDMALS 331 (545)
T ss_pred EEEEecCCeEEEEEcCCCe--eeeeeecccceEEEEcCCCcEEEEEcCCce-EEEEEeecC
Confidence 4567899999999988763 233333445688999999999999999997 899998765
No 333
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=89.57 E-value=0.75 Score=57.35 Aligned_cols=98 Identities=14% Similarity=0.350 Sum_probs=59.3
Q ss_pred cCCCCeEEEEECCCC-cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIVSK-NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~-~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+...|.|...|.... .+...=+.-++||++++|+.||++|+.|=.+|. |.+||+..+ ...++.+..
T Consensus 105 ~Ts~ghvl~~d~~~nL~~~~~ne~v~~~Vtsvafn~dg~~l~~G~~~G~-V~v~D~~~~------------k~l~~i~e~ 171 (1206)
T KOG2079|consen 105 GTSHGHVLLSDMTGNLGPLHQNERVQGPVTSVAFNQDGSLLLAGLGDGH-VTVWDMHRA------------KILKVITEH 171 (1206)
T ss_pred EcCchhhhhhhhhcccchhhcCCccCCcceeeEecCCCceeccccCCCc-EEEEEccCC------------cceeeeeec
Confidence 344566777676542 222222233579999999999999999999998 899999875 122333332
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
|....-|.-+-+..++..+.++-..|. +|.+.
T Consensus 172 -~ap~t~vi~v~~t~~nS~llt~D~~Gs--f~~lv 203 (1206)
T KOG2079|consen 172 -GAPVTGVIFVGRTSQNSKLLTSDTGGS--FWKLV 203 (1206)
T ss_pred -CCccceEEEEEEeCCCcEEEEccCCCc--eEEEE
Confidence 211111333444445556666666665 66653
No 334
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=88.92 E-value=0.41 Score=53.75 Aligned_cols=60 Identities=23% Similarity=0.327 Sum_probs=50.3
Q ss_pred cccccCCCCeEEEEECCCCcEEEEec-cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFR-AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
++.+++.|+.|+|-....--.+..|. +|+.-|+.|+.-+ +.+|++||.|++ +++||+..+
T Consensus 165 ~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS~isl~~-~~~LlS~sGD~t-lr~Wd~~sg 225 (390)
T KOG3914|consen 165 FIITADRDEKIRVSRYPATFVIESFCLGHKEFVSTISLTD-NYLLLSGSGDKT-LRLWDITSG 225 (390)
T ss_pred EEEEecCCceEEEEecCcccchhhhccccHhheeeeeecc-CceeeecCCCCc-EEEEecccC
Confidence 45678899999997777666777776 7999999999865 455999999998 899999987
No 335
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=88.87 E-value=0.97 Score=50.75 Aligned_cols=101 Identities=12% Similarity=0.165 Sum_probs=57.2
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc-
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT- 379 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t- 379 (832)
+.+.|||+.+++....... ...+....|||||+.||-... +.|.++++.++.......++ ... . + -|..
T Consensus 23 ~~y~i~d~~~~~~~~l~~~-~~~~~~~~~sP~g~~~~~v~~--~nly~~~~~~~~~~~lT~dg-~~~--i-~---nG~~d 92 (353)
T PF00930_consen 23 GDYYIYDIETGEITPLTPP-PPKLQDAKWSPDGKYIAFVRD--NNLYLRDLATGQETQLTTDG-EPG--I-Y---NGVPD 92 (353)
T ss_dssp EEEEEEETTTTEEEESS-E-ETTBSEEEE-SSSTEEEEEET--TEEEEESSTTSEEEESES---TTT--E-E---ESB--
T ss_pred eeEEEEecCCCceEECcCC-ccccccceeecCCCeeEEEec--CceEEEECCCCCeEEecccc-cee--E-E---cCccc
Confidence 5799999999865433333 567889999999999998864 35888887654110000000 000 0 0 0110
Q ss_pred -------cccEEEEEEccCCCEEEEEeCC-CcEEEEecCC
Q 003310 380 -------NAVIQDISFSDDSNWIMISSSR-GTSHLFAINP 411 (832)
Q Consensus 380 -------~a~I~~IaFSpDg~~LAsgS~D-gTVhIwdl~~ 411 (832)
-..-..+-|||||++||....| ..|+.+.+-.
T Consensus 93 wvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~~ 132 (353)
T PF00930_consen 93 WVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLPD 132 (353)
T ss_dssp HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEEE
T ss_pred eeccccccccccceEECCCCCEEEEEEECCcCCceEEeec
Confidence 0011347799999999987665 4456555543
No 336
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=87.67 E-value=28 Score=40.83 Aligned_cols=58 Identities=19% Similarity=0.289 Sum_probs=37.5
Q ss_pred CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 333 GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 333 G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
|.+|+..+.+ .|.+||..++ ..+.++. ...|..|.||+||+++|..+.+ ++.|++.+.
T Consensus 117 G~LL~~~~~~--~i~~yDw~~~--------------~~i~~i~----v~~vk~V~Ws~~g~~val~t~~-~i~il~~~~ 174 (443)
T PF04053_consen 117 GNLLGVKSSD--FICFYDWETG--------------KLIRRID----VSAVKYVIWSDDGELVALVTKD-SIYILKYNL 174 (443)
T ss_dssp SSSEEEEETT--EEEEE-TTT----------------EEEEES----S-E-EEEEE-TTSSEEEEE-S--SEEEEEE-H
T ss_pred CcEEEEECCC--CEEEEEhhHc--------------ceeeEEe----cCCCcEEEEECCCCEEEEEeCC-eEEEEEecc
Confidence 9988888766 3899999876 3444442 2238899999999999999855 777876643
No 337
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=87.01 E-value=1.5 Score=34.49 Aligned_cols=32 Identities=28% Similarity=0.415 Sum_probs=26.1
Q ss_pred CCeEEEEEcCC-C--CEEEEEEcCCCEEEEEeCCCC
Q 003310 322 SPISALCFDPS-G--ILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 322 ~pIs~LaFSPd-G--~lLATaS~dGt~I~Iwdi~t~ 354 (832)
+.|.+++|||+ + .+||-+-..|. |+|+|++..
T Consensus 1 GAvR~~kFsP~~~~~DLL~~~E~~g~-vhi~D~R~~ 35 (43)
T PF10313_consen 1 GAVRCCKFSPEPGGNDLLAWAEHQGR-VHIVDTRSN 35 (43)
T ss_pred CCeEEEEeCCCCCcccEEEEEccCCe-EEEEEcccC
Confidence 46899999985 4 48998888887 899999853
No 338
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=86.95 E-value=66 Score=37.22 Aligned_cols=47 Identities=17% Similarity=0.345 Sum_probs=39.5
Q ss_pred CCEEEEEECCCCcEEEEEeCC-CCEEEEEEc--CCEEEEEeCCEEEEEECC
Q 003310 116 PTVVHFYSLRSQSYVHMLKFR-SPIYSVRCS--SRVVAICQAAQVHCFDAA 163 (832)
Q Consensus 116 ~~tVrlWDL~Tg~~V~tL~f~-s~V~sV~~S--~r~LAVa~~~~I~vwDl~ 163 (832)
|..|+||+. +|+.+.++.++ +.|..+.|+ .++|+|..++.+++||+.
T Consensus 60 p~~I~iys~-sG~ll~~i~w~~~~iv~~~wt~~e~LvvV~~dG~v~vy~~~ 109 (410)
T PF04841_consen 60 PNSIQIYSS-SGKLLSSIPWDSGRIVGMGWTDDEELVVVQSDGTVRVYDLF 109 (410)
T ss_pred CcEEEEECC-CCCEeEEEEECCCCEEEEEECCCCeEEEEEcCCEEEEEeCC
Confidence 347999997 67889999884 689999996 477888899999999986
No 339
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=86.92 E-value=15 Score=39.17 Aligned_cols=43 Identities=19% Similarity=0.111 Sum_probs=36.6
Q ss_pred EEEeCCCCEEEEEEcCCEEEEEeCCEEEEEECCCCceEEEEec
Q 003310 131 HMLKFRSPIYSVRCSSRVVAICQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 131 ~tL~f~s~V~sV~~S~r~LAVa~~~~I~vwDl~t~~~~~tl~t 173 (832)
.+|++.+.+.++.+...+|.+..++.|.||++.++++++++..
T Consensus 223 ~~i~W~~~p~~~~~~~pyli~~~~~~iEV~~~~~~~lvQ~i~~ 265 (275)
T PF00780_consen 223 STIQWSSAPQSVAYSSPYLIAFSSNSIEVRSLETGELVQTIPL 265 (275)
T ss_pred cEEEcCCchhEEEEECCEEEEECCCEEEEEECcCCcEEEEEEC
Confidence 3778888888999987777777778899999999999999865
No 340
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=85.91 E-value=63 Score=38.69 Aligned_cols=78 Identities=15% Similarity=0.342 Sum_probs=49.1
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT 379 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t 379 (832)
-|.+.=+|+.+++.+-.++......... +.-.|.+++.++.+|. ++.+|..++ +.|++.+-|..
T Consensus 440 ~g~l~AiD~~tGk~~W~~~~~~p~~~~~-l~t~g~lvf~g~~~G~-l~a~D~~TG--------------e~lw~~~~g~~ 503 (527)
T TIGR03075 440 MGSLIAWDPITGKIVWEHKEDFPLWGGV-LATAGDLVFYGTLEGY-FKAFDAKTG--------------EELWKFKTGSG 503 (527)
T ss_pred ceeEEEEeCCCCceeeEecCCCCCCCcc-eEECCcEEEEECCCCe-EEEEECCCC--------------CEeEEEeCCCC
Confidence 4788889999999988776432222221 2225557777888987 899999998 66777654421
Q ss_pred cccEEEEEEccCCCE
Q 003310 380 NAVIQDISFSDDSNW 394 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~ 394 (832)
..-.=+.|.-||+.
T Consensus 504 -~~a~P~ty~~~G~q 517 (527)
T TIGR03075 504 -IVGPPVTYEQDGKQ 517 (527)
T ss_pred -ceecCEEEEeCCEE
Confidence 11122444456654
No 341
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=85.79 E-value=7 Score=37.09 Aligned_cols=65 Identities=17% Similarity=0.215 Sum_probs=44.6
Q ss_pred eEEEEEc---CCCC-EEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe
Q 003310 324 ISALCFD---PSGI-LLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS 399 (832)
Q Consensus 324 Is~LaFS---PdG~-lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS 399 (832)
|++|++. -||. .|+.||.|.. ||||+-.. .++++.- ...|..++=... ..+|.+.
T Consensus 2 V~al~~~d~d~dg~~eLlvGs~D~~-IRvf~~~e----------------~~~Ei~e---~~~v~~L~~~~~-~~F~Y~l 60 (111)
T PF14783_consen 2 VTALCLFDFDGDGENELLVGSDDFE-IRVFKGDE----------------IVAEITE---TDKVTSLCSLGG-GRFAYAL 60 (111)
T ss_pred eeEEEEEecCCCCcceEEEecCCcE-EEEEeCCc----------------EEEEEec---ccceEEEEEcCC-CEEEEEe
Confidence 5666654 3544 7889999965 99998543 3445432 235777777666 5688999
Q ss_pred CCCcEEEEec
Q 003310 400 SRGTSHLFAI 409 (832)
Q Consensus 400 ~DgTVhIwdl 409 (832)
..|||-||+-
T Consensus 61 ~NGTVGvY~~ 70 (111)
T PF14783_consen 61 ANGTVGVYDR 70 (111)
T ss_pred cCCEEEEEeC
Confidence 9999888754
No 342
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=85.65 E-value=2.7 Score=48.77 Aligned_cols=95 Identities=26% Similarity=0.395 Sum_probs=59.2
Q ss_pred CeEEEEECCCCc--EEEEeccCCCCeEEEEEcCCCCEEEEE-EcCCCEEEEE--eCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 301 GMVIVRDIVSKN--VIAQFRAHKSPISALCFDPSGILLVTA-SVQGHNINIF--KIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 301 G~V~IwDl~s~~--~l~~~~aH~~pIs~LaFSPdG~lLATa-S~dGt~I~Iw--di~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
..+.++|+.+++ .+..+.++.. .-+|||||++||-+ ..||. ..|| |+... .+.+|.
T Consensus 218 ~~i~~~~l~~g~~~~i~~~~g~~~---~P~fspDG~~l~f~~~rdg~-~~iy~~dl~~~---------------~~~~Lt 278 (425)
T COG0823 218 PRIYYLDLNTGKRPVILNFNGNNG---APAFSPDGSKLAFSSSRDGS-PDIYLMDLDGK---------------NLPRLT 278 (425)
T ss_pred ceEEEEeccCCccceeeccCCccC---CccCCCCCCEEEEEECCCCC-ccEEEEcCCCC---------------cceecc
Confidence 468999998865 4445555543 46899999988754 44554 4555 55443 122233
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCC-CcEEEEecCCCCCce
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSR-GTSHLFAINPLGGSV 416 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~D-gTVhIwdl~~~g~~~ 416 (832)
.+... -..=.|||||++|+-.|++ |.-.||-+...++.+
T Consensus 279 ~~~gi--~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~ 318 (425)
T COG0823 279 NGFGI--NTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQV 318 (425)
T ss_pred cCCcc--ccCccCCCCCCEEEEEeCCCCCcceEEECCCCCce
Confidence 33221 1255799999999987775 456777776665543
No 343
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=85.24 E-value=0.7 Score=56.15 Aligned_cols=76 Identities=21% Similarity=0.346 Sum_probs=60.0
Q ss_pred eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCc
Q 003310 324 ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGT 403 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgT 403 (832)
+++|||.|.--.||.+=..|- +.+|...+. ..++..- .+++.|+.+.|||||..++++-.-|.
T Consensus 62 atSLCWHpe~~vLa~gwe~g~-~~v~~~~~~---------------e~htv~~-th~a~i~~l~wS~~G~~l~t~d~~g~ 124 (1416)
T KOG3617|consen 62 ATSLCWHPEEFVLAQGWEMGV-SDVQKTNTT---------------ETHTVVE-THPAPIQGLDWSHDGTVLMTLDNPGS 124 (1416)
T ss_pred hhhhccChHHHHHhhccccce-eEEEecCCc---------------eeeeecc-CCCCCceeEEecCCCCeEEEcCCCce
Confidence 456999999999999888886 899987653 2233322 23577999999999999999999999
Q ss_pred EEEEecCCCCCce
Q 003310 404 SHLFAINPLGGSV 416 (832)
Q Consensus 404 VhIwdl~~~g~~~ 416 (832)
+|+|.+.--|...
T Consensus 125 v~lwr~d~~g~~q 137 (1416)
T KOG3617|consen 125 VHLWRYDVIGEIQ 137 (1416)
T ss_pred eEEEEeeeccccc
Confidence 9999997665443
No 344
>PRK02888 nitrous-oxide reductase; Validated
Probab=84.71 E-value=6.4 Score=47.59 Aligned_cols=106 Identities=10% Similarity=0.009 Sum_probs=70.0
Q ss_pred CCeEEEEECCC-----CcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEE
Q 003310 300 VGMVIVRDIVS-----KNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRL 374 (832)
Q Consensus 300 ~G~V~IwDl~s-----~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l 374 (832)
++.|.|.|..+ .+.+..+.--.. ...|++||||++|+.++.-...+.|+|+......-...-...... ..+.
T Consensus 295 gn~V~VID~~t~~~~~~~v~~yIPVGKs-PHGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~v--vaev 371 (635)
T PRK02888 295 GSKVPVVDGRKAANAGSALTRYVPVPKN-PHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAV--VAEP 371 (635)
T ss_pred CCEEEEEECCccccCCcceEEEEECCCC-ccceEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceE--EEee
Confidence 46799999998 466666664433 367899999999988877655699999977410000000000011 1222
Q ss_pred eccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 375 QRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 375 ~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
.-|.. -...+|.++|+-..+-..|..|-.|++.+
T Consensus 372 evGlG---PLHTaFDg~G~aytslf~dsqv~kwn~~~ 405 (635)
T PRK02888 372 ELGLG---PLHTAFDGRGNAYTTLFLDSQIVKWNIEA 405 (635)
T ss_pred ccCCC---cceEEECCCCCEEEeEeecceeEEEehHH
Confidence 22433 24588999999888888999999999976
No 345
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=84.26 E-value=18 Score=44.81 Aligned_cols=91 Identities=13% Similarity=0.240 Sum_probs=57.5
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCcc-CCCCceeE-EEE----EeccCccccEEEEEEccC---C
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSAC-DAGTSYVH-LYR----LQRGLTNAVIQDISFSDD---S 392 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~-~~~~~~~~-l~~----l~rG~t~a~I~~IaFSpD---g 392 (832)
..|..|.+||+|++||-++..| |-|-.+... .|..+.. ++.....+ .+. +.+......|..+.|.|. +
T Consensus 85 f~v~~i~~n~~g~~lal~G~~~--v~V~~LP~r-~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~~ 161 (717)
T PF10168_consen 85 FEVHQISLNPTGSLLALVGPRG--VVVLELPRR-WGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSESD 161 (717)
T ss_pred eeEEEEEECCCCCEEEEEcCCc--EEEEEeccc-cCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCCC
Confidence 4688899999999999999886 456666421 1111110 11111111 111 112223345899999987 5
Q ss_pred CEEEEEeCCCcEEEEecCCCCCc
Q 003310 393 NWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 393 ~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
.+|++=++|+++++||+.....+
T Consensus 162 ~~l~vLtsdn~lR~y~~~~~~~p 184 (717)
T PF10168_consen 162 SHLVVLTSDNTLRLYDISDPQHP 184 (717)
T ss_pred CeEEEEecCCEEEEEecCCCCCC
Confidence 89999999999999999765443
No 346
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=83.99 E-value=89 Score=36.90 Aligned_cols=22 Identities=14% Similarity=0.196 Sum_probs=18.2
Q ss_pred CCCEEEEEeCCCcEEEEecCCC
Q 003310 391 DSNWIMISSSRGTSHLFAINPL 412 (832)
Q Consensus 391 Dg~~LAsgS~DgTVhIwdl~~~ 412 (832)
.+..|.+++.||.++.+|..++
T Consensus 405 ~g~~v~~g~~dG~l~ald~~tG 426 (488)
T cd00216 405 AGNLVFAGAADGYFRAFDATTG 426 (488)
T ss_pred cCCeEEEECCCCeEEEEECCCC
Confidence 4567888899999999998774
No 347
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=83.72 E-value=16 Score=41.57 Aligned_cols=95 Identities=15% Similarity=0.231 Sum_probs=67.2
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcC--CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQ--GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~d--Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
.....|.+.|..+.+.+..+..-. .-..++|+|+|+.+..+... ...+.+.|..+. ..+.+..
T Consensus 93 ~~~~~v~vid~~~~~~~~~~~vG~-~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~--------------~~~~~~~ 157 (381)
T COG3391 93 GDSNTVSVIDTATNTVLGSIPVGL-GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATN--------------KVTATIP 157 (381)
T ss_pred CCCCeEEEEcCcccceeeEeeecc-CCceEEECCCCCEEEEEecccCCceEEEEeCCCC--------------eEEEEEe
Confidence 346789999988888777665333 33579999999888777662 344777777665 3333455
Q ss_pred ccCccccEEEEEEccCCCEEEEEe-CCCcEEEEecC
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISS-SRGTSHLFAIN 410 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS-~DgTVhIwdl~ 410 (832)
.|..+ ..++|+|+|+.+.... .++++.+++.+
T Consensus 158 vG~~P---~~~a~~p~g~~vyv~~~~~~~v~vi~~~ 190 (381)
T COG3391 158 VGNTP---TGVAVDPDGNKVYVTNSDDNTVSVIDTS 190 (381)
T ss_pred cCCCc---ceEEECCCCCeEEEEecCCCeEEEEeCC
Confidence 56533 7899999999666555 78999999954
No 348
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=83.45 E-value=7.2 Score=45.16 Aligned_cols=91 Identities=18% Similarity=0.325 Sum_probs=59.6
Q ss_pred EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEE-EccC
Q 003310 313 VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDIS-FSDD 391 (832)
Q Consensus 313 ~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~Ia-FSpD 391 (832)
+...|....-.+.+|+.+|+|++.|+.+.-|+ |.++|+..+ .+.++.+|+..|...-+. ....
T Consensus 299 ~r~~l~D~~R~~~~i~~sP~~~laA~tDslGR-V~LiD~~~~---------------~vvrmWKGYRdAqc~wi~~~~~~ 362 (415)
T PF14655_consen 299 MRFGLPDSKREGESICLSPSGRLAAVTDSLGR-VLLIDVARG---------------IVVRMWKGYRDAQCGWIEVPEEG 362 (415)
T ss_pred eEEeeccCCceEEEEEECCCCCEEEEEcCCCc-EEEEECCCC---------------hhhhhhccCccceEEEEEeeccc
Confidence 34455555567899999999999999888898 789999886 222333455555422111 1111
Q ss_pred ----------------CCEEEE-EeCCCcEEEEecCCCCCceeec
Q 003310 392 ----------------SNWIMI-SSSRGTSHLFAINPLGGSVNFQ 419 (832)
Q Consensus 392 ----------------g~~LAs-gS~DgTVhIwdl~~~g~~~~~~ 419 (832)
..+|++ +-.+|.+-||.+..+.....++
T Consensus 363 ~~~~~~~~~~~~~~~~~l~LvIyaprRg~lEvW~~~~g~Rv~a~~ 407 (415)
T PF14655_consen 363 DRDRSNSNSPKSSSRFALFLVIYAPRRGILEVWSMRQGPRVAAFN 407 (415)
T ss_pred ccccccccccCCCCcceEEEEEEeccCCeEEEEecCCCCEEEEEE
Confidence 234444 6668999999998865555553
No 349
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=83.27 E-value=16 Score=39.48 Aligned_cols=81 Identities=15% Similarity=0.245 Sum_probs=45.4
Q ss_pred eccCCCCeEEEEEcCCCCEE-EEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEE
Q 003310 317 FRAHKSPISALCFDPSGILL-VTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWI 395 (832)
Q Consensus 317 ~~aH~~pIs~LaFSPdG~lL-ATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~L 395 (832)
+.+-...++.|+|+|+...| |.....+. |...+.. + .....+.| .| ....-+|++.-+++++
T Consensus 17 l~g~~~e~SGLTy~pd~~tLfaV~d~~~~-i~els~~-G------------~vlr~i~l-~g--~~D~EgI~y~g~~~~v 79 (248)
T PF06977_consen 17 LPGILDELSGLTYNPDTGTLFAVQDEPGE-IYELSLD-G------------KVLRRIPL-DG--FGDYEGITYLGNGRYV 79 (248)
T ss_dssp -TT--S-EEEEEEETTTTEEEEEETTTTE-EEEEETT---------------EEEEEE--SS---SSEEEEEE-STTEEE
T ss_pred CCCccCCccccEEcCCCCeEEEEECCCCE-EEEEcCC-C------------CEEEEEeC-CC--CCCceeEEEECCCEEE
Confidence 34444569999999985545 55555443 5444542 2 22333344 23 2458899999888777
Q ss_pred EEEeCCCcEEEEecCCCCC
Q 003310 396 MISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 396 AsgS~DgTVhIwdl~~~g~ 414 (832)
++.-.++++.++++...+.
T Consensus 80 l~~Er~~~L~~~~~~~~~~ 98 (248)
T PF06977_consen 80 LSEERDQRLYIFTIDDDTT 98 (248)
T ss_dssp EEETTTTEEEEEEE----T
T ss_pred EEEcCCCcEEEEEEecccc
Confidence 6665689999999965443
No 350
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.91 E-value=5.2 Score=49.24 Aligned_cols=107 Identities=13% Similarity=0.169 Sum_probs=72.2
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccCCCC-eEEEEEcCCCCEEEEEEcCCC----EEEEEeCCCCCCCCCCccCCCCcee
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAHKSP-ISALCFDPSGILLVTASVQGH----NINIFKIIPGILGTSSACDAGTSYV 369 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH~~p-Is~LaFSPdG~lLATaS~dGt----~I~Iwdi~t~~~~~~s~~~~~~~~~ 369 (832)
+-+..+|.|.+.+- +-+.+..|+++... |..|-...+-.+|++-..|+. .++||++......+ + +.+.
T Consensus 39 vigt~~G~V~~Ln~-s~~~~~~fqa~~~siv~~L~~~~~~~~L~sv~Ed~~~np~llkiw~lek~~~n~-s-----P~c~ 111 (933)
T KOG2114|consen 39 VIGTADGRVVILNS-SFQLIRGFQAYEQSIVQFLYILNKQNFLFSVGEDEQGNPVLLKIWDLEKVDKNN-S-----PQCL 111 (933)
T ss_pred EEeeccccEEEecc-cceeeehheecchhhhhHhhcccCceEEEEEeecCCCCceEEEEecccccCCCC-C-----ccee
Confidence 34566777766652 34456889999888 666655555578998888877 79999997642111 1 1222
Q ss_pred ---EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 370 ---HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 370 ---~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
+++...-+..+.++.+++.|.|-+.+|+|-.+|+|..+.
T Consensus 112 ~~~ri~~~~np~~~~p~s~l~Vs~~l~~Iv~Gf~nG~V~~~~ 153 (933)
T KOG2114|consen 112 YEHRIFTIKNPTNPSPASSLAVSEDLKTIVCGFTNGLVICYK 153 (933)
T ss_pred eeeeeeccCCCCCCCcceEEEEEccccEEEEEecCcEEEEEc
Confidence 223322232344588999999999999999999998875
No 351
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.75 E-value=1.7 Score=54.35 Aligned_cols=74 Identities=11% Similarity=0.199 Sum_probs=53.4
Q ss_pred CCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccC-ccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 331 PSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGL-TNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 331 PdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~-t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
--+..+|.++..|+ +-.+|.... +.-+++|. ....|.++||+.||++++.|-.+|-|.+||+
T Consensus 97 ~~~~~ivi~Ts~gh-vl~~d~~~n----------------L~~~~~ne~v~~~Vtsvafn~dg~~l~~G~~~G~V~v~D~ 159 (1206)
T KOG2079|consen 97 IVVVPIVIGTSHGH-VLLSDMTGN----------------LGPLHQNERVQGPVTSVAFNQDGSLLLAGLGDGHVTVWDM 159 (1206)
T ss_pred eeeeeEEEEcCchh-hhhhhhhcc----------------cchhhcCCccCCcceeeEecCCCceeccccCCCcEEEEEc
Confidence 35677899988888 677776542 11122222 2345999999999999999999999999999
Q ss_pred CCCCCceeeccC
Q 003310 410 NPLGGSVNFQPT 421 (832)
Q Consensus 410 ~~~g~~~~~~~H 421 (832)
+..+....+.-|
T Consensus 160 ~~~k~l~~i~e~ 171 (1206)
T KOG2079|consen 160 HRAKILKVITEH 171 (1206)
T ss_pred cCCcceeeeeec
Confidence 886665555433
No 352
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=82.58 E-value=3.6 Score=32.44 Aligned_cols=29 Identities=14% Similarity=0.331 Sum_probs=25.1
Q ss_pred EEEEEEccCCC---EEEEEeCCCcEEEEecCC
Q 003310 383 IQDISFSDDSN---WIMISSSRGTSHLFAINP 411 (832)
Q Consensus 383 I~~IaFSpDg~---~LAsgS~DgTVhIwdl~~ 411 (832)
|.++.|||+.. +||.+-..|-|||+|+..
T Consensus 3 vR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~ 34 (43)
T PF10313_consen 3 VRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRS 34 (43)
T ss_pred eEEEEeCCCCCcccEEEEEccCCeEEEEEccc
Confidence 78999998554 899988899999999983
No 353
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.32 E-value=9.3 Score=46.79 Aligned_cols=105 Identities=17% Similarity=0.310 Sum_probs=70.4
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCC-----CCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPS-----GILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPd-----G~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
+++...||+|.|-.+-+.+...++.-+ -|+.+++|+|| .+.+++|+..| +-++.-.-- +. .
T Consensus 86 ~asCS~DGkv~I~sl~~~~~~~~~df~-rpiksial~Pd~~~~~sk~fv~GG~ag--lvL~er~wl--gn---------k 151 (846)
T KOG2066|consen 86 VASCSDDGKVVIGSLFTDDEITQYDFK-RPIKSIALHPDFSRQQSKQFVSGGMAG--LVLSERNWL--GN---------K 151 (846)
T ss_pred EEEecCCCcEEEeeccCCccceeEecC-CcceeEEeccchhhhhhhheeecCcce--EEEehhhhh--cC---------c
Confidence 456778999999998888776665543 58999999998 67889998887 345532211 10 0
Q ss_pred eEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 369 VHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
.. ..++-+ ...|.+|+|. |.++|=++++| |+|||+........+
T Consensus 152 ~~-v~l~~~--eG~I~~i~W~--g~lIAWand~G-v~vyd~~~~~~l~~i 195 (846)
T KOG2066|consen 152 DS-VVLSEG--EGPIHSIKWR--GNLIAWANDDG-VKVYDTPTRQRLTNI 195 (846)
T ss_pred cc-eeeecC--ccceEEEEec--CcEEEEecCCC-cEEEeccccceeecc
Confidence 01 123222 2349999995 66888888776 689999775444343
No 354
>PRK13616 lipoprotein LpqB; Provisional
Probab=82.26 E-value=7.6 Score=47.07 Aligned_cols=100 Identities=9% Similarity=0.091 Sum_probs=55.8
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE--EEecc-
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY--RLQRG- 377 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~--~l~rG- 377 (832)
..+.+++.... ....+.+. ..++-.|||||+.|++.+......++.+-... ..++ .+.-|
T Consensus 379 s~Lwv~~~gg~-~~~lt~g~--~~t~PsWspDG~~lw~v~dg~~~~~v~~~~~~--------------gql~~~~vd~ge 441 (591)
T PRK13616 379 SSLWVGPLGGV-AVQVLEGH--SLTRPSWSLDADAVWVVVDGNTVVRVIRDPAT--------------GQLARTPVDASA 441 (591)
T ss_pred eEEEEEeCCCc-ceeeecCC--CCCCceECCCCCceEEEecCcceEEEeccCCC--------------ceEEEEeccCch
Confidence 35666665332 22222332 37788999999999998764344444332111 0112 11111
Q ss_pred ---CccccEEEEEEccCCCEEEEEeCCCcEEEEecCC-CCCceee
Q 003310 378 ---LTNAVIQDISFSDDSNWIMISSSRGTSHLFAINP-LGGSVNF 418 (832)
Q Consensus 378 ---~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~-~g~~~~~ 418 (832)
.-...|.++.|||||++||... +|.|+|=-+.. .++...+
T Consensus 442 ~~~~~~g~Issl~wSpDG~RiA~i~-~g~v~Va~Vvr~~~G~~~l 485 (591)
T PRK13616 442 VASRVPGPISELQLSRDGVRAAMII-GGKVYLAVVEQTEDGQYAL 485 (591)
T ss_pred hhhccCCCcCeEEECCCCCEEEEEE-CCEEEEEEEEeCCCCceee
Confidence 0012499999999999999877 57666643333 3444444
No 355
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=81.63 E-value=2.5 Score=46.42 Aligned_cols=86 Identities=23% Similarity=0.344 Sum_probs=52.0
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCC-----CCceeEEEEEeccCccccEEEEEEccCCC--E
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDA-----GTSYVHLYRLQRGLTNAVIQDISFSDDSN--W 394 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~-----~~~~~~l~~l~rG~t~a~I~~IaFSpDg~--~ 394 (832)
.-|+++-|+..|.|||||+..|+ +-+|.-.... +....+=+ +.....|..+. -.-+|..|.|-.++. .
T Consensus 27 d~ItaVefd~tg~YlatGDkgGR-Vvlfer~~s~-~ceykf~teFQshe~EFDYLkSle---ieEKin~I~w~~~t~r~h 101 (460)
T COG5170 27 DKITAVEFDETGLYLATGDKGGR-VVLFEREKSY-GCEYKFFTEFQSHELEFDYLKSLE---IEEKINAIEWFDDTGRNH 101 (460)
T ss_pred ceeeEEEeccccceEeecCCCce-EEEeeccccc-ccchhhhhhhcccccchhhhhhcc---HHHHhhheeeecCCCcce
Confidence 46899999999999999999888 6677543321 11000000 00000011110 012378888877654 4
Q ss_pred EEEEeCCCcEEEEecCCC
Q 003310 395 IMISSSRGTSHLFAINPL 412 (832)
Q Consensus 395 LAsgS~DgTVhIwdl~~~ 412 (832)
+..++.|.||+||.+-..
T Consensus 102 FLlstNdktiKlWKiyek 119 (460)
T COG5170 102 FLLSTNDKTIKLWKIYEK 119 (460)
T ss_pred EEEecCCceeeeeeeecc
Confidence 677788999999999543
No 356
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=80.77 E-value=78 Score=37.69 Aligned_cols=58 Identities=7% Similarity=0.098 Sum_probs=43.2
Q ss_pred cccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 293 HFPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 293 ~~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
+++.++.+|.|++||-...+....|++-..+|..|..+.||+.|+..+. +.+.+-++.
T Consensus 443 ~IvvgS~~GdIRLYdri~~~AKTAlPgLG~~I~hVdvtadGKwil~Tc~--tyLlLi~t~ 500 (644)
T KOG2395|consen 443 YIVVGSLKGDIRLYDRIGRRAKTALPGLGDAIKHVDVTADGKWILATCK--TYLLLIDTL 500 (644)
T ss_pred eEEEeecCCcEEeehhhhhhhhhcccccCCceeeEEeeccCcEEEEecc--cEEEEEEEe
Confidence 4455678899999998655666678888899999999999997654433 235566653
No 357
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=80.55 E-value=16 Score=46.59 Aligned_cols=82 Identities=18% Similarity=0.209 Sum_probs=56.0
Q ss_pred EEEEecc-----CCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEE
Q 003310 313 VIAQFRA-----HKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDIS 387 (832)
Q Consensus 313 ~l~~~~a-----H~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~Ia 387 (832)
.+..+.+ ....|.++.|-++...|..+..+|. |-++....... ......+-.+ ..-|.+++
T Consensus 62 ~l~s~~~~~~~~~~~~ivs~~yl~d~~~l~~~~~~Gd-i~~~~~~~~~~--------~~~~E~VG~v-----d~GI~a~~ 127 (928)
T PF04762_consen 62 VLASWDAPLPDDPNDKIVSFQYLADSESLCIALASGD-IILVREDPDPD--------EDEIEIVGSV-----DSGILAAS 127 (928)
T ss_pred EEEeccccCCcCCCCcEEEEEeccCCCcEEEEECCce-EEEEEccCCCC--------CceeEEEEEE-----cCcEEEEE
Confidence 4555542 3467999999999998888888998 44552211100 0112332233 33499999
Q ss_pred EccCCCEEEEEeCCCcEEEEe
Q 003310 388 FSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 388 FSpDg~~LAsgS~DgTVhIwd 408 (832)
||||...||..+.++++.+..
T Consensus 128 WSPD~Ella~vT~~~~l~~mt 148 (928)
T PF04762_consen 128 WSPDEELLALVTGEGNLLLMT 148 (928)
T ss_pred ECCCcCEEEEEeCCCEEEEEe
Confidence 999999999999999998864
No 358
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=80.25 E-value=2 Score=51.64 Aligned_cols=96 Identities=16% Similarity=0.308 Sum_probs=67.0
Q ss_pred cccCCCCeEEEEECCCCcEEEEec--cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFR--AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~--aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
.+.+.+|.|.||=+-.+.-...+- ...+-|.+|+|+-||+.++..-.||.+ .|=.+.- .+++
T Consensus 87 TtSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~dG~kIcIvYeDGav-IVGsvdG---------------NRIw 150 (1189)
T KOG2041|consen 87 TTSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNLDGTKICIVYEDGAV-IVGSVDG---------------NRIW 150 (1189)
T ss_pred cccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcCCCcEEEEEEccCCE-EEEeecc---------------ceec
Confidence 345778999999987775433332 345678999999999999988888873 3322221 1111
Q ss_pred --EEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 373 --RLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 373 --~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
.| .|.. ...+.||+|.+.+..+-..|.+|+||-.
T Consensus 151 gKeL-kg~~---l~hv~ws~D~~~~Lf~~ange~hlydnq 186 (1189)
T KOG2041|consen 151 GKEL-KGQL---LAHVLWSEDLEQALFKKANGETHLYDNQ 186 (1189)
T ss_pred chhc-chhe---ccceeecccHHHHHhhhcCCcEEEeccc
Confidence 22 2221 3467899999999999999999999974
No 359
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=79.78 E-value=48 Score=37.48 Aligned_cols=51 Identities=22% Similarity=0.419 Sum_probs=40.7
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCC-EEEEEEc-CCCEEEEEeCCCC
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGI-LLVTASV-QGHNINIFKIIPG 354 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~-lLATaS~-dGt~I~Iwdi~t~ 354 (832)
.|-++|+.+++.+..+..- .++.+|+.+.|.+ +|.+.+. +|. +.|||..++
T Consensus 270 eVWv~D~~t~krv~Ri~l~-~~~~Si~Vsqd~~P~L~~~~~~~~~-l~v~D~~tG 322 (342)
T PF06433_consen 270 EVWVYDLKTHKRVARIPLE-HPIDSIAVSQDDKPLLYALSAGDGT-LDVYDAATG 322 (342)
T ss_dssp EEEEEETTTTEEEEEEEEE-EEESEEEEESSSS-EEEEEETTTTE-EEEEETTT-
T ss_pred EEEEEECCCCeEEEEEeCC-CccceEEEccCCCcEEEEEcCCCCe-EEEEeCcCC
Confidence 6888999999999998842 3688999999988 6666555 555 899999987
No 360
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=78.31 E-value=13 Score=42.31 Aligned_cols=76 Identities=17% Similarity=0.252 Sum_probs=47.6
Q ss_pred CeEEEEEcCCCCEEEEE-EcCC---CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEE
Q 003310 323 PISALCFDPSGILLVTA-SVQG---HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMIS 398 (832)
Q Consensus 323 pIs~LaFSPdG~lLATa-S~dG---t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsg 398 (832)
.+...++||||++||-+ +..| ..|+|+|+.++ ..+-.. ........++|++|++.|.-.
T Consensus 125 ~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg--------------~~l~d~---i~~~~~~~~~W~~d~~~~~y~ 187 (414)
T PF02897_consen 125 SLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETG--------------KFLPDG---IENPKFSSVSWSDDGKGFFYT 187 (414)
T ss_dssp EEEEEEETTTSSEEEEEEEETTSSEEEEEEEETTTT--------------EEEEEE---EEEEESEEEEECTTSSEEEEE
T ss_pred EeeeeeECCCCCEEEEEecCCCCceEEEEEEECCCC--------------cCcCCc---ccccccceEEEeCCCCEEEEE
Confidence 34568999999998855 4444 35899999887 222211 111123349999999887776
Q ss_pred eCCCc-----------EEEEecCCCCCc
Q 003310 399 SSRGT-----------SHLFAINPLGGS 415 (832)
Q Consensus 399 S~DgT-----------VhIwdl~~~g~~ 415 (832)
..+.. |..|.+.+....
T Consensus 188 ~~~~~~~~~~~~~~~~v~~~~~gt~~~~ 215 (414)
T PF02897_consen 188 RFDEDQRTSDSGYPRQVYRHKLGTPQSE 215 (414)
T ss_dssp ECSTTTSS-CCGCCEEEEEEETTS-GGG
T ss_pred EeCcccccccCCCCcEEEEEECCCChHh
Confidence 65442 666666554333
No 361
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=76.60 E-value=15 Score=41.24 Aligned_cols=114 Identities=11% Similarity=0.208 Sum_probs=73.2
Q ss_pred CCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCC-CCC---------CCCC--------ccCCCCc---
Q 003310 309 VSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIP-GIL---------GTSS--------ACDAGTS--- 367 (832)
Q Consensus 309 ~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t-~~~---------~~~s--------~~~~~~~--- 367 (832)
..-..++...+|..+|..+-|+-.-+++++++.|.. + .|-... +.. .+.. .++..+.
T Consensus 102 nkm~~~r~~~~h~~~v~~~if~~~~e~V~s~~~dk~-~-~~hc~e~~~~lg~Y~~~~~~t~~~~d~~~~fvGd~~gqvt~ 179 (404)
T KOG1409|consen 102 NKMTFLKDYLAHQARVSAIVFSLTHEWVLSTGKDKQ-F-AWHCTESGNRLGGYNFETPASALQFDALYAFVGDHSGQITM 179 (404)
T ss_pred hhcchhhhhhhhhcceeeEEecCCceeEEEeccccc-e-EEEeeccCCcccceEeeccCCCCceeeEEEEecccccceEE
Confidence 334455666788889999999888888888888744 2 443221 100 0000 0111111
Q ss_pred -------eeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc-eeeccCCCCcc
Q 003310 368 -------YVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS-VNFQPTDANFT 426 (832)
Q Consensus 368 -------~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~-~~~~~H~~~~~ 426 (832)
...++++ +|++ ..|.+++|.+-.+.|.++..|..+.+|||--..+. ..+++|.+...
T Consensus 180 lr~~~~~~~~i~~~-~~h~-~~~~~l~Wd~~~~~LfSg~~d~~vi~wdigg~~g~~~el~gh~~kV~ 244 (404)
T KOG1409|consen 180 LKLEQNGCQLITTF-NGHT-GEVTCLKWDPGQRLLFSGASDHSVIMWDIGGRKGTAYELQGHNDKVQ 244 (404)
T ss_pred EEEeecCCceEEEE-cCcc-cceEEEEEcCCCcEEEeccccCceEEEeccCCcceeeeeccchhhhh
Confidence 1123333 4554 35999999999999999999999999999766665 46688876544
No 362
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=76.44 E-value=63 Score=30.76 Aligned_cols=88 Identities=15% Similarity=0.238 Sum_probs=59.6
Q ss_pred ccccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 294 FPDADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 294 ~~s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
++.+..|..|+||+-. ..+.++..+ ..|++|+-... ..++.|-..|+ |-+|+-. .++|+
T Consensus 18 LlvGs~D~~IRvf~~~--e~~~Ei~e~-~~v~~L~~~~~-~~F~Y~l~NGT-VGvY~~~----------------~RlWR 76 (111)
T PF14783_consen 18 LLVGSDDFEIRVFKGD--EIVAEITET-DKVTSLCSLGG-GRFAYALANGT-VGVYDRS----------------QRLWR 76 (111)
T ss_pred EEEecCCcEEEEEeCC--cEEEEEecc-cceEEEEEcCC-CEEEEEecCCE-EEEEeCc----------------ceeee
Confidence 4456788999999854 577777754 46777776666 56899999998 8999753 34555
Q ss_pred EeccCccccEEEEEEcc---CCC-EEEEEeCCCcEE
Q 003310 374 LQRGLTNAVIQDISFSD---DSN-WIMISSSRGTSH 405 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSp---Dg~-~LAsgS~DgTVh 405 (832)
.+. ...+.++++.. ||. -|++|-++|.|-
T Consensus 77 iKS---K~~~~~~~~~D~~gdG~~eLI~GwsnGkve 109 (111)
T PF14783_consen 77 IKS---KNQVTSMAFYDINGDGVPELIVGWSNGKVE 109 (111)
T ss_pred ecc---CCCeEEEEEEcCCCCCceEEEEEecCCeEE
Confidence 542 22366666544 332 577888888764
No 363
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=76.03 E-value=1.1e+02 Score=32.41 Aligned_cols=40 Identities=20% Similarity=0.458 Sum_probs=28.6
Q ss_pred CcEEEEEecCCeEEEEeccCCCeeEEeeecCCCEEEEEEecC
Q 003310 18 RRVLLLGYRSGFQVWDVEEADNVHDLVSRYDGPVSFMQMLPR 59 (832)
Q Consensus 18 ~~vLl~Gy~~G~qVWdv~~~~~~~ellS~~dG~Vr~v~ilp~ 59 (832)
+..|++|.+.|+-++++.......++++... |.-+.++|.
T Consensus 7 ~~~L~vGt~~Gl~~~~~~~~~~~~~i~~~~~--I~ql~vl~~ 46 (275)
T PF00780_consen 7 GDRLLVGTEDGLYVYDLSDPSKPTRILKLSS--ITQLSVLPE 46 (275)
T ss_pred CCEEEEEECCCEEEEEecCCccceeEeecce--EEEEEEecc
Confidence 5678888999999999944455556665333 887777763
No 364
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=75.42 E-value=1e+02 Score=34.43 Aligned_cols=99 Identities=21% Similarity=0.232 Sum_probs=54.6
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce-eEEEEEeccCc
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY-VHLYRLQRGLT 379 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~-~~l~~l~rG~t 379 (832)
+.|.-||..+++. ..|......-..+.++.+|.++++ .+| +.+++..++ .. ..+.....+..
T Consensus 47 ~~i~r~~~~~g~~-~~~~~p~~~~~~~~~d~~g~Lv~~--~~g--~~~~~~~~~------------~~~t~~~~~~~~~~ 109 (307)
T COG3386 47 GRIHRLDPETGKK-RVFPSPGGFSSGALIDAGGRLIAC--EHG--VRLLDPDTG------------GKITLLAEPEDGLP 109 (307)
T ss_pred CeEEEecCCcCce-EEEECCCCcccceeecCCCeEEEE--ccc--cEEEeccCC------------ceeEEeccccCCCC
Confidence 5677777776543 333333333334556666665543 333 467777544 11 23344444544
Q ss_pred cccEEEEEEccCCCEEEEEeC---------CCcEEEEecCCCCCce
Q 003310 380 NAVIQDISFSDDSNWIMISSS---------RGTSHLFAINPLGGSV 416 (832)
Q Consensus 380 ~a~I~~IaFSpDg~~LAsgS~---------DgTVhIwdl~~~g~~~ 416 (832)
....+++...|||.+-++... ..+=.||.+.+.+...
T Consensus 110 ~~r~ND~~v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~ 155 (307)
T COG3386 110 LNRPNDGVVDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVV 155 (307)
T ss_pred cCCCCceeEcCCCCEEEeCCCccccCccccCCcceEEEEcCCCCEE
Confidence 455789999999988766544 1223566666543433
No 365
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=75.10 E-value=30 Score=42.22 Aligned_cols=102 Identities=14% Similarity=0.177 Sum_probs=64.6
Q ss_pred CCeEEEEECCCCcEEEEec-cCCCCeEEEEE--cCCCCEEEEEEcCCCEEEEEeCCC-CCCCCCCccCCCCceeEEEEEe
Q 003310 300 VGMVIVRDIVSKNVIAQFR-AHKSPISALCF--DPSGILLVTASVQGHNINIFKIIP-GILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~-aH~~pIs~LaF--SPdG~lLATaS~dGt~I~Iwdi~t-~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
...+.|||...+.....-. ....+|..|.| .|||+.+.+-+...+ |.++.-.. .+..... .+ .......+
T Consensus 50 ~~~LtIWD~~~~~lE~~~~f~~~~~I~dLDWtst~d~qsiLaVGf~~~-v~l~~Q~R~dy~~~~p--~w--~~i~~i~i- 123 (631)
T PF12234_consen 50 RSELTIWDTRSGVLEYEESFSEDDPIRDLDWTSTPDGQSILAVGFPHH-VLLYTQLRYDYTNKGP--SW--APIRKIDI- 123 (631)
T ss_pred CCEEEEEEcCCcEEEEeeeecCCCceeeceeeecCCCCEEEEEEcCcE-EEEEEccchhhhcCCc--cc--ceeEEEEe-
Confidence 3479999999876332211 34678999886 579999888888866 77774321 1110000 00 11122233
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
+.+|++.|.+.+|-+||.+++.+ ..-+.||+-
T Consensus 124 ~~~T~h~Igds~Wl~~G~LvV~s--GNqlfv~dk 155 (631)
T PF12234_consen 124 SSHTPHPIGDSIWLKDGTLVVGS--GNQLFVFDK 155 (631)
T ss_pred ecCCCCCccceeEecCCeEEEEe--CCEEEEECC
Confidence 56787889999999999877654 445777765
No 366
>PRK13616 lipoprotein LpqB; Provisional
Probab=74.22 E-value=16 Score=44.41 Aligned_cols=99 Identities=15% Similarity=0.228 Sum_probs=58.1
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEE---EEEec
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHL---YRLQR 376 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l---~~l~r 376 (832)
.+.+.+.++..+.... .....|..+.|||||+.||-.. +|+ |.|=-+.....| . ..+ ..+.-
T Consensus 429 ~gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~-~g~-v~Va~Vvr~~~G---------~-~~l~~~~~l~~ 493 (591)
T PRK13616 429 TGQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMII-GGK-VYLAVVEQTEDG---------Q-YALTNPREVGP 493 (591)
T ss_pred CceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEE-CCE-EEEEEEEeCCCC---------c-eeecccEEeec
Confidence 4566666776655432 3346799999999999988766 454 555222221000 1 122 22222
Q ss_pred cCccccEEEEEEccCCCEEEEEeCCCcEEEEecCCCCCc
Q 003310 377 GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAINPLGGS 415 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~ 415 (832)
+... .+.+++|..|++++ ++..++...+|.++-.|..
T Consensus 494 ~l~~-~~~~l~W~~~~~L~-V~~~~~~~~v~~v~vDG~~ 530 (591)
T PRK13616 494 GLGD-TAVSLDWRTGDSLV-VGRSDPEHPVWYVNLDGSN 530 (591)
T ss_pred ccCC-ccccceEecCCEEE-EEecCCCCceEEEecCCcc
Confidence 2221 25789999999855 6666666677877655443
No 367
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=74.17 E-value=1.8e+02 Score=33.74 Aligned_cols=96 Identities=14% Similarity=0.085 Sum_probs=50.4
Q ss_pred CCCeEEEEECCCCcEEEEeccCCCCeEEEEE-----cCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 299 NVGMVIVRDIVSKNVIAQFRAHKSPISALCF-----DPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 299 ~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaF-----SPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
.+..|+|+...+.+..+......-....++| ...+..|++-..+|. |++|.+..- +.+.+
T Consensus 233 Se~~irv~~~~~~k~~~K~~~~~~~~~~~~vv~~~~~~~~~~Lv~l~~~G~-i~i~SLP~L--------------kei~~ 297 (395)
T PF08596_consen 233 SESDIRVFKPPKSKGAHKSFDDPFLCSSASVVPTISRNGGYCLVCLFNNGS-IRIYSLPSL--------------KEIKS 297 (395)
T ss_dssp -SSEEEEE-TT---EEEEE-SS-EEEEEEEEEEEE-EEEEEEEEEEETTSE-EEEEETTT----------------EEEE
T ss_pred cccceEEEeCCCCcccceeeccccccceEEEEeecccCCceEEEEEECCCc-EEEEECCCc--------------hHhhc
Confidence 3457899988877665544422223334556 346888998899998 899998653 44444
Q ss_pred Eec--cCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 374 LQR--GLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 374 l~r--G~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
+.. ......+...+|+++|..+.-.+. ..+.+|.+-
T Consensus 298 ~~l~~~~d~~~~~~ssis~~Gdi~~~~gp-sE~~l~sv~ 335 (395)
T PF08596_consen 298 VSLPPPLDSRRLSSSSISRNGDIFYWTGP-SEIQLFSVW 335 (395)
T ss_dssp EE-SS---HHHHTT-EE-TTS-EEEE-SS-SEEEEEEEE
T ss_pred ccCCCccccccccccEECCCCCEEEEeCc-ccEEEEEEE
Confidence 432 111122557889999998766654 355555543
No 368
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=73.54 E-value=19 Score=35.32 Aligned_cols=46 Identities=11% Similarity=0.154 Sum_probs=36.3
Q ss_pred CCcEEEEEeCCCCEEEEEEc-------CCEEEEEeCCEEEEEECCCCceEEEE
Q 003310 126 SQSYVHMLKFRSPIYSVRCS-------SRVVAICQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 126 Tg~~V~tL~f~s~V~sV~~S-------~r~LAVa~~~~I~vwDl~t~~~~~tl 171 (832)
....+..|++...|.+|+.- ...|.++....+.+||+..-..++..
T Consensus 37 ~~~~i~~LNin~~italaaG~l~~~~~~D~LliGt~t~llaYDV~~N~d~Fyk 89 (136)
T PF14781_consen 37 QDSDISFLNINQEITALAAGRLKPDDGRDCLLIGTQTSLLAYDVENNSDLFYK 89 (136)
T ss_pred ccCceeEEECCCceEEEEEEecCCCCCcCEEEEeccceEEEEEcccCchhhhh
Confidence 44578889999999998773 46899999999999999876554443
No 369
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=73.44 E-value=1.2e+02 Score=37.77 Aligned_cols=74 Identities=19% Similarity=0.253 Sum_probs=47.6
Q ss_pred CCeEEEEEc--CCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCC---C---
Q 003310 322 SPISALCFD--PSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDS---N--- 393 (832)
Q Consensus 322 ~pIs~LaFS--PdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg---~--- 393 (832)
..+..|++. ...++||.++.. +.|.||-...... . ..+.... ...+.|-+|+|-++. .
T Consensus 164 ~SaWGLdIh~~~~~rlIAVSsNs-~~VTVFaf~l~~~-r---------~~~~~s~---~~~hNIP~VSFl~~~~d~~G~v 229 (717)
T PF08728_consen 164 ASAWGLDIHDYKKSRLIAVSSNS-QEVTVFAFALVDE-R---------FYHVPSH---QHSHNIPNVSFLDDDLDPNGHV 229 (717)
T ss_pred CceeEEEEEecCcceEEEEecCC-ceEEEEEEecccc-c---------ccccccc---ccccCCCeeEeecCCCCCccce
Confidence 367889987 666777666555 5588987654200 0 0010011 123358999997654 2
Q ss_pred EEEEEeCCCcEEEEec
Q 003310 394 WIMISSSRGTSHLFAI 409 (832)
Q Consensus 394 ~LAsgS~DgTVhIwdl 409 (832)
+|++++-.|.+-+|++
T Consensus 230 ~v~a~dI~G~v~~~~I 245 (717)
T PF08728_consen 230 KVVATDISGEVWTFKI 245 (717)
T ss_pred EEEEEeccCcEEEEEE
Confidence 8999999999999888
No 370
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=73.34 E-value=25 Score=43.96 Aligned_cols=97 Identities=14% Similarity=0.260 Sum_probs=60.5
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCC-CeEEEEEc-------CCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCc
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFRAHKS-PISALCFD-------PSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTS 367 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~aH~~-pIs~LaFS-------PdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~ 367 (832)
+......|+-.|+..|+++..++.|.. +|..++=+ +..++|.. .+ +.+..||.+-. ++ .-
T Consensus 499 ~~~~~~~ly~mDLe~GKVV~eW~~~~~~~v~~~~p~~K~aqlt~e~tflGl--s~-n~lfriDpR~~--~~-------k~ 566 (794)
T PF08553_consen 499 DPNNPNKLYKMDLERGKVVEEWKVHDDIPVVDIAPDSKFAQLTNEQTFLGL--SD-NSLFRIDPRLS--GN-------KL 566 (794)
T ss_pred cCCCCCceEEEecCCCcEEEEeecCCCcceeEecccccccccCCCceEEEE--CC-CceEEeccCCC--CC-------ce
Confidence 334567788899999999999998874 36555321 33333332 23 23556776543 11 00
Q ss_pred ee-EEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 368 YV-HLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 368 ~~-~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
+. ..+...+ ....+|+|=+.+| +||+||.+|-|++||
T Consensus 567 v~~~~k~Y~~---~~~Fs~~aTt~~G-~iavgs~~G~IRLyd 604 (794)
T PF08553_consen 567 VDSQSKQYSS---KNNFSCFATTEDG-YIAVGSNKGDIRLYD 604 (794)
T ss_pred eecccccccc---CCCceEEEecCCc-eEEEEeCCCcEEeec
Confidence 00 1111112 2348889888887 789999999999998
No 371
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=72.57 E-value=8.4 Score=40.93 Aligned_cols=106 Identities=13% Similarity=0.085 Sum_probs=65.6
Q ss_pred cccCCCCeEEEEECCCCcEE-EEeccCCCCeEEE-EEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVI-AQFRAHKSPISAL-CFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l-~~~~aH~~pIs~L-aFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
+.+..+|.|.+|...-.... ..+..-..+|.++ .--.++.+..++..+|. |+-|.+.++ +++-
T Consensus 74 ~vG~~dg~v~~~n~n~~g~~~d~~~s~~e~i~~~Ip~~~~~~~~c~~~~dg~-ir~~n~~p~--------------k~~g 138 (238)
T KOG2444|consen 74 MVGTSDGAVYVFNWNLEGAHSDRVCSGEESIDLGIPNGRDSSLGCVGAQDGR-IRACNIKPN--------------KVLG 138 (238)
T ss_pred EeecccceEEEecCCccchHHHhhhcccccceeccccccccceeEEeccCCc-eeeeccccC--------------ceee
Confidence 44567899999887621111 1111112344443 23345667888999987 999999886 2221
Q ss_pred EEeccCcc-ccEEEEEEccCCCEEEEE--eCCCcEEEEecCCCCCcee
Q 003310 373 RLQRGLTN-AVIQDISFSDDSNWIMIS--SSRGTSHLFAINPLGGSVN 417 (832)
Q Consensus 373 ~l~rG~t~-a~I~~IaFSpDg~~LAsg--S~DgTVhIwdl~~~g~~~~ 417 (832)
. +|.+. ..+....-+..+++|+++ |.|.+++.|++.+......
T Consensus 139 ~--~g~h~~~~~e~~ivv~sd~~i~~a~~S~d~~~k~W~ve~~~d~~~ 184 (238)
T KOG2444|consen 139 Y--VGQHNFESGEELIVVGSDEFLKIADTSHDRVLKKWNVEKIKDESP 184 (238)
T ss_pred e--eccccCCCcceeEEecCCceEEeeccccchhhhhcchhhhhccCc
Confidence 1 23333 456666667777888888 8888999999877655433
No 372
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=72.42 E-value=27 Score=40.24 Aligned_cols=83 Identities=18% Similarity=0.339 Sum_probs=54.6
Q ss_pred EEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe--c----cCccccEEEE
Q 003310 313 VIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ--R----GLTNAVIQDI 386 (832)
Q Consensus 313 ~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~--r----G~t~a~I~~I 386 (832)
++.-+....++|++|+.|.=| ++|.|..+|+ +.|.|++-. .-+|+-. . ......|.++
T Consensus 78 P~~l~~~~~g~vtal~~S~iG-Fvaigy~~G~-l~viD~RGP--------------avI~~~~i~~~~~~~~~~~~vt~i 141 (395)
T PF08596_consen 78 PLTLLDAKQGPVTALKNSDIG-FVAIGYESGS-LVVIDLRGP--------------AVIYNENIRESFLSKSSSSYVTSI 141 (395)
T ss_dssp EEEEE---S-SEEEEEE-BTS-EEEEEETTSE-EEEEETTTT--------------EEEEEEEGGG--T-SS----EEEE
T ss_pred chhheeccCCcEeEEecCCCc-EEEEEecCCc-EEEEECCCC--------------eEEeeccccccccccccccCeeEE
Confidence 556666678999999998545 7999999997 789999653 3334321 1 1112237788
Q ss_pred EEcc-----CC---CEEEEEeCCCcEEEEecCC
Q 003310 387 SFSD-----DS---NWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 387 aFSp-----Dg---~~LAsgS~DgTVhIwdl~~ 411 (832)
.|+. |+ -.|.+|+..|++.+|.|-+
T Consensus 142 eF~vm~~~~D~ySSi~L~vGTn~G~v~~fkIlp 174 (395)
T PF08596_consen 142 EFSVMTLGGDGYSSICLLVGTNSGNVLTFKILP 174 (395)
T ss_dssp EEEEEE-TTSSSEEEEEEEEETTSEEEEEEEEE
T ss_pred EEEEEecCCCcccceEEEEEeCCCCEEEEEEec
Confidence 8873 43 4688999999999999975
No 373
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=72.15 E-value=1.9e+02 Score=33.35 Aligned_cols=41 Identities=10% Similarity=0.134 Sum_probs=27.0
Q ss_pred CCCEEEEEECCCCcEEEEEeCCCCEEEEEEcC---CEEEEEeCC
Q 003310 115 VPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSS---RVVAICQAA 155 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~---r~LAVa~~~ 155 (832)
.++.|.-.|++||+.-..+.-+..+--+.|++ .+|+-|.++
T Consensus 166 p~~~i~~idl~tG~~~~v~~~~~wlgH~~fsP~dp~li~fCHEG 209 (386)
T PF14583_consen 166 PHCRIFTIDLKTGERKVVFEDTDWLGHVQFSPTDPTLIMFCHEG 209 (386)
T ss_dssp --EEEEEEETTT--EEEEEEESS-EEEEEEETTEEEEEEEEE-S
T ss_pred CCceEEEEECCCCceeEEEecCccccCcccCCCCCCEEEEeccC
Confidence 45788888999999766666677788888874 677778654
No 374
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=71.99 E-value=58 Score=37.18 Aligned_cols=93 Identities=15% Similarity=0.253 Sum_probs=62.6
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE------
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR------ 373 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~------ 373 (832)
.+.|.+.|-.+.+.+.....-..| ..++|+|+|+.+..+..+...|.++|.... .+.+
T Consensus 139 ~~~vsvid~~t~~~~~~~~vG~~P-~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~---------------~v~~~~~~~~ 202 (381)
T COG3391 139 NNTVSVIDAATNKVTATIPVGNTP-TGVAVDPDGNKVYVTNSDDNTVSVIDTSGN---------------SVVRGSVGSL 202 (381)
T ss_pred CceEEEEeCCCCeEEEEEecCCCc-ceEEECCCCCeEEEEecCCCeEEEEeCCCc---------------ceeccccccc
Confidence 568899999999888886654456 889999999977766644455889996543 1111
Q ss_pred EeccCccccEEEEEEccCCCEEEEEeCC---CcEEEEecCC
Q 003310 374 LQRGLTNAVIQDISFSDDSNWIMISSSR---GTSHLFAINP 411 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSpDg~~LAsgS~D---gTVhIwdl~~ 411 (832)
...+.. -..++++|||+++-+.-.. +++-+.|..+
T Consensus 203 ~~~~~~---P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~ 240 (381)
T COG3391 203 VGVGTG---PAGIAVDPDGNRVYVANDGSGSNNVLKIDTAT 240 (381)
T ss_pred cccCCC---CceEEECCCCCEEEEEeccCCCceEEEEeCCC
Confidence 111111 3578999999976555443 4666666654
No 375
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=71.54 E-value=27 Score=39.81 Aligned_cols=99 Identities=15% Similarity=0.202 Sum_probs=58.0
Q ss_pred CCCCeEEEEECCCCcEEEE-eccCCCCeEEEEEcCCCCEEEEEEcCC----------CEEEEEeCCCCCCCCCCccCCCC
Q 003310 298 DNVGMVIVRDIVSKNVIAQ-FRAHKSPISALCFDPSGILLVTASVQG----------HNINIFKIIPGILGTSSACDAGT 366 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~-~~aH~~pIs~LaFSPdG~lLATaS~dG----------t~I~Iwdi~t~~~~~~s~~~~~~ 366 (832)
+..-.++|+|+.+++.+.. |..- .-..++|.+||+.|.....+. +.|..|++.+.. .
T Consensus 147 ~e~~~l~v~Dl~tg~~l~d~i~~~--~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~~gt~~----------~ 214 (414)
T PF02897_consen 147 SEWYTLRVFDLETGKFLPDGIENP--KFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHKLGTPQ----------S 214 (414)
T ss_dssp SSEEEEEEEETTTTEEEEEEEEEE--ESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEETTS-G----------G
T ss_pred CceEEEEEEECCCCcCcCCccccc--ccceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEECCCCh----------H
Confidence 4446899999999987653 2321 123399999999876655332 335566665431 0
Q ss_pred ceeEEEEEeccCcccc-EEEEEEccCCCEEEEEeCCCc----EEEEecCC
Q 003310 367 SYVHLYRLQRGLTNAV-IQDISFSDDSNWIMISSSRGT----SHLFAINP 411 (832)
Q Consensus 367 ~~~~l~~l~rG~t~a~-I~~IaFSpDg~~LAsgS~DgT----VhIwdl~~ 411 (832)
.-..++... .... ...+..|+|+++|.+.+..++ +++.++..
T Consensus 215 ~d~lvfe~~---~~~~~~~~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~ 261 (414)
T PF02897_consen 215 EDELVFEEP---DEPFWFVSVSRSKDGRYLFISSSSGTSESEVYLLDLDD 261 (414)
T ss_dssp G-EEEEC-T---TCTTSEEEEEE-TTSSEEEEEEESSSSEEEEEEEECCC
T ss_pred hCeeEEeec---CCCcEEEEEEecCcccEEEEEEEccccCCeEEEEeccc
Confidence 113444432 2223 678999999999887555443 45555544
No 376
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=71.11 E-value=13 Score=27.75 Aligned_cols=30 Identities=13% Similarity=0.283 Sum_probs=19.8
Q ss_pred CCCCeEEEEEcCCCCEEEEEEcCC--CEEEEE
Q 003310 320 HKSPISALCFDPSGILLVTASVQG--HNINIF 349 (832)
Q Consensus 320 H~~pIs~LaFSPdG~lLATaS~dG--t~I~Iw 349 (832)
....-.+.+|||||+.|+-++... ....||
T Consensus 7 ~~~~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 7 SPGDDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp SSSSEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred CCccccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 344667889999999988766553 224555
No 377
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=71.06 E-value=45 Score=38.88 Aligned_cols=108 Identities=13% Similarity=0.111 Sum_probs=54.4
Q ss_pred CCCCeEEEEECCCCcEEEEeccC-CC-CeEEEEEcC--CCCEEEEEEc-CCCEEEEEeCCCCCCCCCCccCCCCceeEEE
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAH-KS-PISALCFDP--SGILLVTASV-QGHNINIFKIIPGILGTSSACDAGTSYVHLY 372 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH-~~-pIs~LaFSP--dG~lLATaS~-dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~ 372 (832)
..-..+.+||+.+.+++.++.-- .+ -...|.|-. +-.+=.+++. .+++.++|....+ . | ..+.+.
T Consensus 219 ~yG~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~aLss~i~~~~k~~~g-~-------W--~a~kVi 288 (461)
T PF05694_consen 219 KYGHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGCALSSSIWRFYKDDDG-E-------W--AAEKVI 288 (461)
T ss_dssp -S--EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEEE--EEEEEEEE-ETT-E-------E--EEEEEE
T ss_pred cccCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEEeccceEEEEEEcCCC-C-------e--eeeEEE
Confidence 34568999999999999998732 22 345678854 4555444433 3343444433322 0 1 112222
Q ss_pred EEe-------------c--cCccccEEEEEEccCCCEEEEEe-CCCcEEEEecCCCCCc
Q 003310 373 RLQ-------------R--GLTNAVIQDISFSDDSNWIMISS-SRGTSHLFAINPLGGS 415 (832)
Q Consensus 373 ~l~-------------r--G~t~a~I~~IaFSpDg~~LAsgS-~DgTVhIwdl~~~g~~ 415 (832)
++. + +..++-|.+|..|.|.++|-.+. .+|.|+.|||+....+
T Consensus 289 ~ip~~~v~~~~lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISDP~~P 347 (461)
T PF05694_consen 289 DIPAKKVEGWILPEMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYDISDPFNP 347 (461)
T ss_dssp EE--EE--SS---GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEEEE-SSTTS-
T ss_pred ECCCcccCcccccccccccccCCCceEeEEEccCCCEEEEEcccCCcEEEEecCCCCCC
Confidence 221 0 11123489999999999987655 6999999999876443
No 378
>PF07676 PD40: WD40-like Beta Propeller Repeat; InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=69.71 E-value=9.5 Score=28.42 Aligned_cols=26 Identities=23% Similarity=0.438 Sum_probs=19.6
Q ss_pred cEEEEEEccCCCEEEEEeCC---CcEEEE
Q 003310 382 VIQDISFSDDSNWIMISSSR---GTSHLF 407 (832)
Q Consensus 382 ~I~~IaFSpDg~~LAsgS~D---gTVhIw 407 (832)
.....+|||||++|+-++.. |...||
T Consensus 10 ~~~~p~~SpDGk~i~f~s~~~~~g~~diy 38 (39)
T PF07676_consen 10 DDGSPAWSPDGKYIYFTSNRNDRGSFDIY 38 (39)
T ss_dssp SEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred cccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence 46778999999999987765 566666
No 379
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=69.09 E-value=9.3 Score=47.06 Aligned_cols=56 Identities=18% Similarity=0.337 Sum_probs=46.4
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
.+-.-|.+.+|...+.+.-..-..|..+|.-|.|||||+.|.|+..-|. +.+|...
T Consensus 76 ~gwe~g~~~v~~~~~~e~htv~~th~a~i~~l~wS~~G~~l~t~d~~g~-v~lwr~d 131 (1416)
T KOG3617|consen 76 QGWEMGVSDVQKTNTTETHTVVETHPAPIQGLDWSHDGTVLMTLDNPGS-VHLWRYD 131 (1416)
T ss_pred hccccceeEEEecCCceeeeeccCCCCCceeEEecCCCCeEEEcCCCce-eEEEEee
Confidence 3445688999998877654444589999999999999999999999986 8999776
No 380
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=68.27 E-value=6.4 Score=48.60 Aligned_cols=91 Identities=22% Similarity=0.295 Sum_probs=66.5
Q ss_pred cEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccC
Q 003310 312 NVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDD 391 (832)
Q Consensus 312 ~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpD 391 (832)
+....|+.|+...+|++||-+-..|+.|+..|. |+||++.+|.- ..-+ -+| .+.|.-|.=|-|
T Consensus 1092 r~w~~frd~~~~fTc~afs~~~~hL~vG~~~Ge-ik~~nv~sG~~------------e~s~---ncH-~SavT~vePs~d 1154 (1516)
T KOG1832|consen 1092 RSWRSFRDETALFTCIAFSGGTNHLAVGSHAGE-IKIFNVSSGSM------------EESV---NCH-QSAVTLVEPSVD 1154 (1516)
T ss_pred ccchhhhccccceeeEEeecCCceEEeeeccce-EEEEEccCccc------------cccc---ccc-ccccccccccCC
Confidence 466788899999999999999999999999998 99999988711 0001 122 234777778889
Q ss_pred CCEEEEEeCCC--cEEEEecCCCCCc-eeec
Q 003310 392 SNWIMISSSRG--TSHLFAINPLGGS-VNFQ 419 (832)
Q Consensus 392 g~~LAsgS~Dg--TVhIwdl~~~g~~-~~~~ 419 (832)
|..+.+.|.-. -.-+|++...++. .+|+
T Consensus 1155 gs~~Ltsss~S~PlsaLW~~~s~~~~~Hsf~ 1185 (1516)
T KOG1832|consen 1155 GSTQLTSSSSSSPLSALWDASSTGGPRHSFD 1185 (1516)
T ss_pred cceeeeeccccCchHHHhccccccCcccccc
Confidence 98877766543 4789999765543 3443
No 381
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=66.90 E-value=2.8e+02 Score=33.23 Aligned_cols=56 Identities=13% Similarity=0.053 Sum_probs=38.9
Q ss_pred CEEEEEECCCCcEEEEEeCCC--CEE----------EEEEc-CCEEEEEeCCEEEEEECCCCceEEEEe
Q 003310 117 TVVHFYSLRSQSYVHMLKFRS--PIY----------SVRCS-SRVVAICQAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s--~V~----------sV~~S-~r~LAVa~~~~I~vwDl~t~~~~~tl~ 172 (832)
+.|.=.|.+||+.+.+..... .+. .+.+. .++++...++.++++|+.|++.+....
T Consensus 79 g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~dg~l~ALDa~TGk~~W~~~ 147 (527)
T TIGR03075 79 SRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTLDARLVALDAKTGKVVWSKK 147 (527)
T ss_pred CcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcCCCEEEEEECCCCCEEeecc
Confidence 457777999999988876532 121 12333 345555678999999999999887764
No 382
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=64.28 E-value=3.8e+02 Score=33.81 Aligned_cols=57 Identities=14% Similarity=0.086 Sum_probs=39.7
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEE---------EEE-------------------EcCCEEEEEeCCEEEEEECCCCceE
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIY---------SVR-------------------CSSRVVAICQAAQVHCFDAATLEIE 168 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~---------sV~-------------------~S~r~LAVa~~~~I~vwDl~t~~~~ 168 (832)
+.|.=.|.+||+.+..+.....+. .+. |..+++..+.|++++..|+.|++..
T Consensus 204 ~~V~ALDa~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~Dg~LiALDA~TGk~~ 283 (764)
T TIGR03074 204 NKVIALDAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSDARLIALDADTGKLC 283 (764)
T ss_pred CeEEEEECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCCCeEEEEECCCCCEE
Confidence 567777999999988877643321 111 2235566678999999999999988
Q ss_pred EEEec
Q 003310 169 YAILT 173 (832)
Q Consensus 169 ~tl~t 173 (832)
..+..
T Consensus 284 W~fg~ 288 (764)
T TIGR03074 284 EDFGN 288 (764)
T ss_pred EEecC
Confidence 76643
No 383
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=64.24 E-value=2.3e+02 Score=31.20 Aligned_cols=84 Identities=13% Similarity=0.038 Sum_probs=55.6
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
+...|.+++.++.++..+..|..-..-=.....++||.++-.++.||+ +..-|.++. ..+|+.+-
T Consensus 69 GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~a~~d~~~glIycgshd~~-~yalD~~~~--------------~cVykskc 133 (354)
T KOG4649|consen 69 GCYSGGLYFLCVKTGSQIWNFVILETVKVRAQCDFDGGLIYCGSHDGN-FYALDPKTY--------------GCVYKSKC 133 (354)
T ss_pred EEccCcEEEEEecchhheeeeeehhhhccceEEcCCCceEEEecCCCc-EEEeccccc--------------ceEEeccc
Confidence 456788999999999877777644321123457899999999999998 778787764 46677654
Q ss_pred cCccccEEEEEEcc-CCCEEEE
Q 003310 377 GLTNAVIQDISFSD-DSNWIMI 397 (832)
Q Consensus 377 G~t~a~I~~IaFSp-Dg~~LAs 397 (832)
|-+ ...+=+..| |+.+.++
T Consensus 134 gG~--~f~sP~i~~g~~sly~a 153 (354)
T KOG4649|consen 134 GGG--TFVSPVIAPGDGSLYAA 153 (354)
T ss_pred CCc--eeccceecCCCceEEEE
Confidence 422 122334455 5544443
No 384
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=60.54 E-value=2.8e+02 Score=31.03 Aligned_cols=50 Identities=12% Similarity=0.111 Sum_probs=38.4
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc--CCEEEEEeCCEEEEEECCCCc
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS--SRVVAICQAAQVHCFDAATLE 166 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S--~r~LAVa~~~~I~vwDl~t~~ 166 (832)
+.+.+||+.+++..........+....++ ++.||...++.|+++++.+.+
T Consensus 23 ~~y~i~d~~~~~~~~l~~~~~~~~~~~~sP~g~~~~~v~~~nly~~~~~~~~ 74 (353)
T PF00930_consen 23 GDYYIYDIETGEITPLTPPPPKLQDAKWSPDGKYIAFVRDNNLYLRDLATGQ 74 (353)
T ss_dssp EEEEEEETTTTEEEESS-EETTBSEEEE-SSSTEEEEEETTEEEEESSTTSE
T ss_pred eeEEEEecCCCceEECcCCccccccceeecCCCeeEEEecCceEEEECCCCC
Confidence 67899999998765433334677788887 588999999999999998873
No 385
>smart00036 CNH Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
Probab=60.37 E-value=1.7e+02 Score=32.41 Aligned_cols=43 Identities=12% Similarity=-0.002 Sum_probs=34.3
Q ss_pred EEEeCCCCEEEEEEcCCEEEEEeCCEEEEEECCCCceEEEEec
Q 003310 131 HMLKFRSPIYSVRCSSRVVAICQAAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 131 ~tL~f~s~V~sV~~S~r~LAVa~~~~I~vwDl~t~~~~~tl~t 173 (832)
..|.+.....++++....|.+..+.-|.|+++.+++.++++..
T Consensus 238 ~~l~w~~~p~~~~~~~pyll~~~~~~ievr~l~~~~l~q~i~~ 280 (302)
T smart00036 238 PILHWEFMPESFAYHSPYLLAFHDNGIEIRSIKTGELLQELAD 280 (302)
T ss_pred eEEEcCCcccEEEEECCEEEEEcCCcEEEEECCCCceEEEEec
Confidence 3567777788888876666666678899999999998888853
No 386
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=59.83 E-value=1.1e+02 Score=33.88 Aligned_cols=53 Identities=13% Similarity=0.116 Sum_probs=38.7
Q ss_pred EEEECCCCcEEEEEeCCCCEEEEEEc---CCEEEEEe--CCEEEEEECCCCceEEEEec
Q 003310 120 HFYSLRSQSYVHMLKFRSPIYSVRCS---SRVVAICQ--AAQVHCFDAATLEIEYAILT 173 (832)
Q Consensus 120 rlWDL~Tg~~V~tL~f~s~V~sV~~S---~r~LAVa~--~~~I~vwDl~t~~~~~tl~t 173 (832)
-.++. .|+++..+..+..-..|.++ ++-|+++- ..--.+||..+.+.+.++..
T Consensus 52 a~~~e-aGk~v~~~~lpaR~Hgi~~~p~~~ravafARrPGtf~~vfD~~~~~~pv~~~s 109 (366)
T COG3490 52 ATLSE-AGKIVFATALPARGHGIAFHPALPRAVAFARRPGTFAMVFDPNGAQEPVTLVS 109 (366)
T ss_pred EEEcc-CCceeeeeecccccCCeecCCCCcceEEEEecCCceEEEECCCCCcCcEEEec
Confidence 34443 68999999988888889996 45566552 34688999999887777743
No 387
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=59.81 E-value=54 Score=38.96 Aligned_cols=89 Identities=22% Similarity=0.362 Sum_probs=54.5
Q ss_pred eEEEEECCC--CcEEEEeccCC----CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 302 MVIVRDIVS--KNVIAQFRAHK----SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 302 ~V~IwDl~s--~~~l~~~~aH~----~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
.|-=||.+- ...+..-+.|. ...+|.+-.-+| ++|.||.+|. ||+||- .+ +.-.+.-
T Consensus 405 ~vfriDpRv~~~~kl~~~q~kqy~~k~nFsc~aTT~sG-~IvvgS~~Gd-IRLYdr-i~--------------~~AKTAl 467 (644)
T KOG2395|consen 405 SVFRIDPRVQGKNKLAVVQSKQYSTKNNFSCFATTESG-YIVVGSLKGD-IRLYDR-IG--------------RRAKTAL 467 (644)
T ss_pred ceEEecccccCcceeeeeeccccccccccceeeecCCc-eEEEeecCCc-EEeehh-hh--------------hhhhhcc
Confidence 455566542 22333334443 346676666666 5899999998 999985 32 1112222
Q ss_pred ccCccccEEEEEEccCCCEEEEEeCCCcEEEEec
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAI 409 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl 409 (832)
-|.. ..|..|.-+.||+||+..+ +.++.+-++
T Consensus 468 PgLG-~~I~hVdvtadGKwil~Tc-~tyLlLi~t 499 (644)
T KOG2395|consen 468 PGLG-DAIKHVDVTADGKWILATC-KTYLLLIDT 499 (644)
T ss_pred cccC-CceeeEEeeccCcEEEEec-ccEEEEEEE
Confidence 3433 3488899999999986554 666666555
No 388
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=59.29 E-value=1.1e+02 Score=36.53 Aligned_cols=86 Identities=9% Similarity=0.245 Sum_probs=54.1
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCc--cCCCCce--------eEEEEEeccCccccEEEEEEccC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSA--CDAGTSY--------VHLYRLQRGLTNAVIQDISFSDD 391 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~--~~~~~~~--------~~l~~l~rG~t~a~I~~IaFSpD 391 (832)
-.|..+..|+.|...|-++.+|- .|-.+... .|..+. +++ ..+ ..+|+ ..+.-.+..++|.|+
T Consensus 104 feV~~vl~s~~GS~VaL~G~~Gi--~vMeLp~r-wG~~s~~eDgk-~~v~CRt~~i~~~~ft---ss~~ltl~Qa~WHP~ 176 (741)
T KOG4460|consen 104 FEVYQVLLSPTGSHVALIGIKGL--MVMELPKR-WGKNSEFEDGK-STVNCRTTPVAERFFT---SSTSLTLKQAAWHPS 176 (741)
T ss_pred EEEEEEEecCCCceEEEecCCee--EEEEchhh-cCccceecCCC-ceEEEEeecccceeec---cCCceeeeeccccCC
Confidence 35677889999999999999984 45544221 122221 111 111 11121 112223678999998
Q ss_pred C---CEEEEEeCCCcEEEEecCCCCC
Q 003310 392 S---NWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 392 g---~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
+ ..|..-++|.+|+||+++....
T Consensus 177 S~~D~hL~iL~sdnviRiy~lS~~te 202 (741)
T KOG4460|consen 177 SILDPHLVLLTSDNVIRIYSLSEPTE 202 (741)
T ss_pred ccCCceEEEEecCcEEEEEecCCcch
Confidence 7 6788899999999999987543
No 389
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=58.45 E-value=25 Score=41.21 Aligned_cols=51 Identities=14% Similarity=0.255 Sum_probs=35.1
Q ss_pred cCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeC
Q 003310 297 ADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKI 351 (832)
Q Consensus 297 ~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi 351 (832)
...++.|.+||+.+++.++.+... +|..+.||++|+++|-.+.+. |.|++.
T Consensus 122 ~~~~~~i~~yDw~~~~~i~~i~v~--~vk~V~Ws~~g~~val~t~~~--i~il~~ 172 (443)
T PF04053_consen 122 VKSSDFICFYDWETGKLIRRIDVS--AVKYVIWSDDGELVALVTKDS--IYILKY 172 (443)
T ss_dssp EEETTEEEEE-TTT--EEEEESS---E-EEEEE-TTSSEEEEE-S-S--EEEEEE
T ss_pred EECCCCEEEEEhhHcceeeEEecC--CCcEEEEECCCCEEEEEeCCe--EEEEEe
Confidence 344568999999999999999754 589999999999999998874 566654
No 390
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=57.75 E-value=44 Score=43.09 Aligned_cols=68 Identities=16% Similarity=0.176 Sum_probs=51.5
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
..|.++.|--++.-|..+..+|. |.+-|..+. . ... -|....-|.+++||||.+++|..+.+
T Consensus 69 ~~i~s~~fl~d~~~i~v~~~~G~-iilvd~et~---------------~-~ei-vg~vd~GI~aaswS~Dee~l~liT~~ 130 (1265)
T KOG1920|consen 69 DEIVSVQFLADTNSICVITALGD-IILVDPETL---------------E-LEI-VGNVDNGISAASWSPDEELLALITGR 130 (1265)
T ss_pred cceEEEEEecccceEEEEecCCc-EEEEccccc---------------c-eee-eeeccCceEEEeecCCCcEEEEEeCC
Confidence 47899999999999988889998 455565553 1 111 23333349999999999999999999
Q ss_pred CcEEEE
Q 003310 402 GTSHLF 407 (832)
Q Consensus 402 gTVhIw 407 (832)
+|+.+-
T Consensus 131 ~tll~m 136 (1265)
T KOG1920|consen 131 QTLLFM 136 (1265)
T ss_pred cEEEEE
Confidence 998764
No 391
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=56.74 E-value=94 Score=33.39 Aligned_cols=75 Identities=12% Similarity=0.199 Sum_probs=46.8
Q ss_pred CeEEEEEcCCCCEEEEEEcCCCEEEEEe-CCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeC-
Q 003310 323 PISALCFDPSGILLVTASVQGHNINIFK-IIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS- 400 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~dGt~I~Iwd-i~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~- 400 (832)
.++.-+|+++|.+.+....+.. .+++. ...+ .... ..+........|..+.+||||..+|....
T Consensus 67 ~l~~PS~d~~g~~W~v~~~~~~-~~~~~~~~~g------------~~~~-~~v~~~~~~~~I~~l~vSpDG~RvA~v~~~ 132 (253)
T PF10647_consen 67 SLTRPSWDPDGWVWTVDDGSGG-VRVVRDSASG------------TGEP-VEVDWPGLRGRITALRVSPDGTRVAVVVED 132 (253)
T ss_pred ccccccccCCCCEEEEEcCCCc-eEEEEecCCC------------ccee-EEecccccCCceEEEEECCCCcEEEEEEec
Confidence 6788899999998877665544 56664 2222 0011 11111111116999999999999998873
Q ss_pred --CCcEEEEecCC
Q 003310 401 --RGTSHLFAINP 411 (832)
Q Consensus 401 --DgTVhIwdl~~ 411 (832)
++.|.|=.+..
T Consensus 133 ~~~~~v~va~V~r 145 (253)
T PF10647_consen 133 GGGGRVYVAGVVR 145 (253)
T ss_pred CCCCeEEEEEEEe
Confidence 46677766643
No 392
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=56.03 E-value=59 Score=24.04 Aligned_cols=24 Identities=13% Similarity=0.337 Sum_probs=19.0
Q ss_pred CCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 331 PSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 331 PdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
|||+.|..+..+...|.++|..+.
T Consensus 1 pd~~~lyv~~~~~~~v~~id~~~~ 24 (42)
T TIGR02276 1 PDGTKLYVTNSGSNTVSVIDTATN 24 (42)
T ss_pred CCCCEEEEEeCCCCEEEEEECCCC
Confidence 688988887776667999998765
No 393
>KOG2377 consensus Uncharacterized conserved protein [Function unknown]
Probab=54.70 E-value=3.1e+02 Score=32.46 Aligned_cols=81 Identities=16% Similarity=0.238 Sum_probs=53.3
Q ss_pred CCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEe
Q 003310 320 HKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISS 399 (832)
Q Consensus 320 H~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS 399 (832)
..++|.+++||+|.+.||.--.+.+ |..+...+... ......+.+-+ .+.|...+|+.. +-+|..+
T Consensus 65 d~G~I~SIkFSlDnkilAVQR~~~~-v~f~nf~~d~~----------~l~~~~~ck~k--~~~IlGF~W~~s-~e~A~i~ 130 (657)
T KOG2377|consen 65 DKGEIKSIKFSLDNKILAVQRTSKT-VDFCNFIPDNS----------QLEYTQECKTK--NANILGFCWTSS-TEIAFIT 130 (657)
T ss_pred CCCceeEEEeccCcceEEEEecCce-EEEEecCCCch----------hhHHHHHhccC--cceeEEEEEecC-eeEEEEe
Confidence 3468999999999999999888765 88887754310 00111122212 345999999865 6777777
Q ss_pred CCCcEEEEecCCCCCc
Q 003310 400 SRGTSHLFAINPLGGS 415 (832)
Q Consensus 400 ~DgTVhIwdl~~~g~~ 415 (832)
..| +-+|.+.+....
T Consensus 131 ~~G-~e~y~v~pekrs 145 (657)
T KOG2377|consen 131 DQG-IEFYQVLPEKRS 145 (657)
T ss_pred cCC-eEEEEEchhhhh
Confidence 655 678887665443
No 394
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=54.03 E-value=8.9 Score=47.57 Aligned_cols=53 Identities=19% Similarity=0.381 Sum_probs=35.7
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCeEEEE-----------EcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPISALC-----------FDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pIs~La-----------FSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
..+|.|++..+..... ..|+.|...+..++ .||||+.||+++.||. ++.|.+.
T Consensus 202 ~~~~~i~lL~~~ra~~-~l~rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~-v~f~Qiy 265 (1283)
T KOG1916|consen 202 LKGGEIRLLNINRALR-SLFRSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISDGS-VGFYQIY 265 (1283)
T ss_pred cCCCceeEeeechHHH-HHHHhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecCCc-cceeeee
Confidence 4566777755443211 44566765554433 6999999999999997 7888764
No 395
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=51.88 E-value=1.3e+02 Score=37.06 Aligned_cols=80 Identities=15% Similarity=0.152 Sum_probs=53.1
Q ss_pred EeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEE--ccCCC
Q 003310 316 QFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISF--SDDSN 393 (832)
Q Consensus 316 ~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaF--SpDg~ 393 (832)
+|..-....+.+.=|.-++ +|..+.+++.+.|||.+.+ ...|+-.- .....|.++.| .|||+
T Consensus 24 ~~~T~i~~~~li~gss~~k-~a~V~~~~~~LtIWD~~~~--------------~lE~~~~f-~~~~~I~dLDWtst~d~q 87 (631)
T PF12234_consen 24 TFETGISNPSLISGSSIKK-IAVVDSSRSELTIWDTRSG--------------VLEYEESF-SEDDPIRDLDWTSTPDGQ 87 (631)
T ss_pred EEecCCCCcceEeecccCc-EEEEECCCCEEEEEEcCCc--------------EEEEeeee-cCCCceeeceeeecCCCC
Confidence 3333334555565656444 5556777888999999876 22232211 11235999988 57999
Q ss_pred EEEEEeCCCcEEEEecCC
Q 003310 394 WIMISSSRGTSHLFAINP 411 (832)
Q Consensus 394 ~LAsgS~DgTVhIwdl~~ 411 (832)
.+.+.+-...|.||.--.
T Consensus 88 siLaVGf~~~v~l~~Q~R 105 (631)
T PF12234_consen 88 SILAVGFPHHVLLYTQLR 105 (631)
T ss_pred EEEEEEcCcEEEEEEccc
Confidence 999999999999987643
No 396
>PF12657 TFIIIC_delta: Transcription factor IIIC subunit delta N-term; InterPro: IPR024761 This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 [].
Probab=51.76 E-value=97 Score=31.24 Aligned_cols=30 Identities=13% Similarity=0.225 Sum_probs=25.2
Q ss_pred cEEEEEEccCC------CEEEEEeCCCcEEEEecCC
Q 003310 382 VIQDISFSDDS------NWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 382 ~I~~IaFSpDg------~~LAsgS~DgTVhIwdl~~ 411 (832)
.|..++|||-| -.||+-+.++.+.||.-..
T Consensus 87 ~vv~~aWSP~Gl~~~~rClLavLTs~~~l~l~~~~~ 122 (173)
T PF12657_consen 87 QVVSAAWSPSGLGPNGRCLLAVLTSNGRLSLYGPPG 122 (173)
T ss_pred cEEEEEECCCCCCCCCceEEEEEcCCCeEEEEecCC
Confidence 58899999943 4699999999999998764
No 397
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=51.55 E-value=1e+02 Score=38.70 Aligned_cols=97 Identities=15% Similarity=0.221 Sum_probs=61.5
Q ss_pred CeEEEEECCCC------cEE---EEec----cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCc
Q 003310 301 GMVIVRDIVSK------NVI---AQFR----AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTS 367 (832)
Q Consensus 301 G~V~IwDl~s~------~~l---~~~~----aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~ 367 (832)
-.|+|||+... .++ ..+. ....|+++|+.|-|=+.+|.|=.+|. |..+.-... ... .
T Consensus 92 ~llkiw~lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs~~l~~Iv~Gf~nG~-V~~~~GDi~-RDr--------g 161 (933)
T KOG2114|consen 92 VLLKIWDLEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVSEDLKTIVCGFTNGL-VICYKGDIL-RDR--------G 161 (933)
T ss_pred eEEEEecccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEEccccEEEEEecCcE-EEEEcCcch-hcc--------c
Confidence 37999998642 233 1111 23578999999999999999999998 444422111 000 0
Q ss_pred eeEEEEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 368 YVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 368 ~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
.+..+. ++|- ..|+.++|-.|++-+.-+..-..|++|.+.
T Consensus 162 sr~~~~-~~~~--~pITgL~~~~d~~s~lFv~Tt~~V~~y~l~ 201 (933)
T KOG2114|consen 162 SRQDYS-HRGK--EPITGLALRSDGKSVLFVATTEQVMLYSLS 201 (933)
T ss_pred cceeee-ccCC--CCceeeEEecCCceeEEEEecceeEEEEec
Confidence 122233 3443 349999999999884333345579999998
No 398
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=50.53 E-value=1e+02 Score=34.31 Aligned_cols=69 Identities=17% Similarity=0.240 Sum_probs=38.2
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
+.+..+..++||+++|.+ ..|..+.-||--. +..+.+. |. +..+|+.|.|+||+...+++ ..
T Consensus 145 gs~~~~~r~~dG~~vavs-~~G~~~~s~~~G~-------------~~w~~~~--r~-~~~riq~~gf~~~~~lw~~~-~G 206 (302)
T PF14870_consen 145 GSINDITRSSDGRYVAVS-SRGNFYSSWDPGQ-------------TTWQPHN--RN-SSRRIQSMGFSPDGNLWMLA-RG 206 (302)
T ss_dssp --EEEEEE-TTS-EEEEE-TTSSEEEEE-TT--------------SS-EEEE-----SSS-EEEEEE-TTS-EEEEE-TT
T ss_pred ceeEeEEECCCCcEEEEE-CcccEEEEecCCC-------------ccceEEc--cC-ccceehhceecCCCCEEEEe-CC
Confidence 678889999999988876 5677666665321 1133222 33 34569999999998776654 56
Q ss_pred CcEEEEe
Q 003310 402 GTSHLFA 408 (832)
Q Consensus 402 gTVhIwd 408 (832)
|-++.=+
T Consensus 207 g~~~~s~ 213 (302)
T PF14870_consen 207 GQIQFSD 213 (302)
T ss_dssp TEEEEEE
T ss_pred cEEEEcc
Confidence 6555433
No 399
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=49.40 E-value=33 Score=41.78 Aligned_cols=75 Identities=16% Similarity=0.281 Sum_probs=55.7
Q ss_pred CCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 322 SPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
..|.--||+-.+++|+-|+.-|- +.+|.-..+ ...+++.. | ....+..++.|++..++|+|+..
T Consensus 34 ~~v~lTc~dst~~~l~~GsS~G~-lyl~~R~~~-------------~~~~~~~~-~-~~~~~~~~~vs~~e~lvAagt~~ 97 (726)
T KOG3621|consen 34 ARVKLTCVDATEEYLAMGSSAGS-VYLYNRHTG-------------EMRKLKNE-G-ATGITCVRSVSSVEYLVAAGTAS 97 (726)
T ss_pred ceEEEEEeecCCceEEEecccce-EEEEecCch-------------hhhccccc-C-ccceEEEEEecchhHhhhhhcCC
Confidence 35666788999999999999986 788875543 12223321 2 22336678899999999999999
Q ss_pred CcEEEEecCCC
Q 003310 402 GTSHLFAINPL 412 (832)
Q Consensus 402 gTVhIwdl~~~ 412 (832)
|.|-||.++..
T Consensus 98 g~V~v~ql~~~ 108 (726)
T KOG3621|consen 98 GRVSVFQLNKE 108 (726)
T ss_pred ceEEeehhhcc
Confidence 99999999883
No 400
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=46.66 E-value=16 Score=42.30 Aligned_cols=59 Identities=20% Similarity=0.240 Sum_probs=47.3
Q ss_pred cccCCCCeEEEEECCC---CcEEEEeccCCCCeEEEEEcCCCCEEEEEEc-CCCEEEEEeCCCC
Q 003310 295 PDADNVGMVIVRDIVS---KNVIAQFRAHKSPISALCFDPSGILLVTASV-QGHNINIFKIIPG 354 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s---~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~-dGt~I~Iwdi~t~ 354 (832)
.++.-+|.++.|--.. -+.+..|.+|...|.+|+.|-||.++.|.+. | +.+++||+...
T Consensus 24 iqASlDGh~KFWkKs~isGvEfVKhFraHL~~I~sl~~S~dg~L~~Sv~d~D-hs~KvfDvEn~ 86 (558)
T KOG0882|consen 24 IQASLDGHKKFWKKSRISGVEFVKHFRAHLGVILSLAVSYDGWLFRSVEDPD-HSVKVFDVENF 86 (558)
T ss_pred EeeecchhhhhcCCCCccceeehhhhHHHHHHHHhhhccccceeEeeccCcc-cceeEEEeecc
Confidence 4566788888886432 2467788999999999999999999999777 7 45899998754
No 401
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=45.99 E-value=7.4 Score=46.60 Aligned_cols=97 Identities=15% Similarity=0.248 Sum_probs=62.5
Q ss_pred CCCeEEEEECCCC----cEEEEecc-CCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 299 NVGMVIVRDIVSK----NVIAQFRA-HKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 299 ~~G~V~IwDl~s~----~~l~~~~a-H~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
.+..+.|||+.++ +.-..|.+ -.....++|+..|-+++.+|.... .++|||++-... ....
T Consensus 127 nds~~~Iwdi~s~ltvPke~~~fs~~~l~gqns~cwlrd~klvlaGm~sr-~~~ifdlRqs~~-------------~~~s 192 (783)
T KOG1008|consen 127 NDSSLKIWDINSLLTVPKESPLFSSSTLDGQNSVCWLRDTKLVLAGMTSR-SVHIFDLRQSLD-------------SVSS 192 (783)
T ss_pred ccCCccceecccccCCCccccccccccccCccccccccCcchhhcccccc-hhhhhhhhhhhh-------------hhhh
Confidence 4557999999876 22233443 334566899999999988888774 489999974310 0001
Q ss_pred EeccCccccEEEEEEcc-CCCEEEEEeCCCcEEEEe-cCCCCC
Q 003310 374 LQRGLTNAVIQDISFSD-DSNWIMISSSRGTSHLFA-INPLGG 414 (832)
Q Consensus 374 l~rG~t~a~I~~IaFSp-Dg~~LAsgS~DgTVhIwd-l~~~g~ 414 (832)
+ .| .-++.+..+| .+.++++-+ ||-|-||| ......
T Consensus 193 v---nT-k~vqG~tVdp~~~nY~cs~~-dg~iAiwD~~rnien 230 (783)
T KOG1008|consen 193 V---NT-KYVQGITVDPFSPNYFCSNS-DGDIAIWDTYRNIEN 230 (783)
T ss_pred h---hh-hhcccceecCCCCCceeccc-cCceeeccchhhhcc
Confidence 1 01 1256667777 677887766 99999999 433333
No 402
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=45.26 E-value=7.5 Score=46.55 Aligned_cols=102 Identities=10% Similarity=0.144 Sum_probs=65.8
Q ss_pred CCCCeEEEEECCCCcE--EEEeccCCCCeEEEEEcC-CCCEEEEEEcC---CCEEEEEeCCCCCCCCCCccCCCCceeEE
Q 003310 298 DNVGMVIVRDIVSKNV--IAQFRAHKSPISALCFDP-SGILLVTASVQ---GHNINIFKIIPGILGTSSACDAGTSYVHL 371 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~--l~~~~aH~~pIs~LaFSP-dG~lLATaS~d---Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l 371 (832)
...|.|.+-.+....- -...++|..+.++++|++ |...||.|=++ ...+.|||+.+....+ ..-
T Consensus 77 ~atG~I~l~s~r~~hdSs~E~tp~~ar~Ct~lAwneLDtn~LAagldkhrnds~~~Iwdi~s~ltvP----------ke~ 146 (783)
T KOG1008|consen 77 SATGNISLLSVRHPHDSSAEVTPGYARPCTSLAWNELDTNHLAAGLDKHRNDSSLKIWDINSLLTVP----------KES 146 (783)
T ss_pred cccCceEEeecCCcccccceecccccccccccccccccHHHHHhhhhhhcccCCccceecccccCCC----------ccc
Confidence 4457777766654321 234467888999999998 55667766321 1348899998762100 000
Q ss_pred EEEeccCccccEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 372 YRLQRGLTNAVIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 372 ~~l~rG~t~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
..|.-+ +--...++||-.|-+.|.+|...+.+||||+.
T Consensus 147 ~~fs~~-~l~gqns~cwlrd~klvlaGm~sr~~~ifdlR 184 (783)
T KOG1008|consen 147 PLFSSS-TLDGQNSVCWLRDTKLVLAGMTSRSVHIFDLR 184 (783)
T ss_pred cccccc-cccCccccccccCcchhhcccccchhhhhhhh
Confidence 111111 11125589999999999999999999999997
No 403
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=45.01 E-value=2.6e+02 Score=27.65 Aligned_cols=56 Identities=16% Similarity=0.220 Sum_probs=42.9
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEEEEEc------CCEEEEEeCCEEEEEECCCCceEEEEe
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYSVRCS------SRVVAICQAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S------~r~LAVa~~~~I~vwDl~t~~~~~tl~ 172 (832)
+.+..||...+.-+-.-.++..|.+|.+- ..++.|+..-.|.-||..--+..+++.
T Consensus 73 t~llaYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGncsi~Gfd~~G~e~fWtVt 134 (136)
T PF14781_consen 73 TSLLAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGNCSIQGFDYEGNEIFWTVT 134 (136)
T ss_pred ceEEEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECceEEEEEeCCCCcEEEEEec
Confidence 77999999888776666678899999882 456666777789999987666666653
No 404
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=44.04 E-value=2.5e+02 Score=32.48 Aligned_cols=31 Identities=16% Similarity=0.345 Sum_probs=25.8
Q ss_pred CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 321 KSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 321 ~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
.++|..|++||+|++||--..+|+ +.|....
T Consensus 216 ~~~i~~iavSpng~~iAl~t~~g~-l~v~ssD 246 (410)
T PF04841_consen 216 DGPIIKIAVSPNGKFIALFTDSGN-LWVVSSD 246 (410)
T ss_pred CCCeEEEEECCCCCEEEEEECCCC-EEEEECc
Confidence 368999999999999999888998 5666543
No 405
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=43.65 E-value=3.4e+02 Score=29.33 Aligned_cols=107 Identities=12% Similarity=0.257 Sum_probs=62.5
Q ss_pred cccCCCCeEEEEECCCCcEEEEeccC-CCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEE
Q 003310 295 PDADNVGMVIVRDIVSKNVIAQFRAH-KSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYR 373 (832)
Q Consensus 295 ~s~~~~G~V~IwDl~s~~~l~~~~aH-~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~ 373 (832)
+..+.++.|..+|.. ++++..++-. .+-.-.|++--+|.++++.-.+++ +.++++..... ........+
T Consensus 38 aV~d~~~~i~els~~-G~vlr~i~l~g~~D~EgI~y~g~~~~vl~~Er~~~-L~~~~~~~~~~--------~~~~~~~~~ 107 (248)
T PF06977_consen 38 AVQDEPGEIYELSLD-GKVLRRIPLDGFGDYEGITYLGNGRYVLSEERDQR-LYIFTIDDDTT--------SLDRADVQK 107 (248)
T ss_dssp EEETTTTEEEEEETT---EEEEEE-SS-SSEEEEEE-STTEEEEEETTTTE-EEEEEE----T--------T--EEEEEE
T ss_pred EEECCCCEEEEEcCC-CCEEEEEeCCCCCCceeEEEECCCEEEEEEcCCCc-EEEEEEecccc--------ccchhhceE
Confidence 345677889888974 7788887633 256788899888877766544665 77888844310 001122222
Q ss_pred EeccCc---cccEEEEEEccCCCEEEEEeCCCcEEEEecCC
Q 003310 374 LQRGLT---NAVIQDISFSDDSNWIMISSSRGTSHLFAINP 411 (832)
Q Consensus 374 l~rG~t---~a~I~~IaFSpDg~~LAsgS~DgTVhIwdl~~ 411 (832)
+.-+.. +..+-.|||.|.++.|.++-.+....||.+..
T Consensus 108 ~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~ 148 (248)
T PF06977_consen 108 ISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNG 148 (248)
T ss_dssp EE---S---SS--EEEEEETTTTEEEEEEESSSEEEEEEES
T ss_pred EecccccCCCcceEEEEEcCCCCEEEEEeCCCChhhEEEcc
Confidence 322322 22389999999888888887777777877754
No 406
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=43.38 E-value=47 Score=39.82 Aligned_cols=69 Identities=28% Similarity=0.372 Sum_probs=38.2
Q ss_pred EEEEcCCCCEEEEEEcCCCEEE---------EEeCCCCCCCC-CCccCCCCceeEEEEEeccCccccEEEEEEccCCCEE
Q 003310 326 ALCFDPSGILLVTASVQGHNIN---------IFKIIPGILGT-SSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWI 395 (832)
Q Consensus 326 ~LaFSPdG~lLATaS~dGt~I~---------Iwdi~t~~~~~-~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~L 395 (832)
.|+|+|+|.+++.-+..+...+ +|++... .+. .... ......+.+|-.+...+.+..++|+||++.|
T Consensus 440 NL~~d~~G~LwI~eD~~~~~~~l~g~t~~G~~~~~~~~-~G~~~~~~--~~~~g~~~rf~~~P~gaE~tG~~fspDg~tl 516 (524)
T PF05787_consen 440 NLAFDPDGNLWIQEDGGGSNNNLPGVTPDGEVYDFARN-DGNNVWAY--DPDTGELKRFLVGPNGAEITGPCFSPDGRTL 516 (524)
T ss_pred ceEECCCCCEEEEeCCCCCCcccccccccCceeeeeec-ccceeeec--cccccceeeeccCCCCcccccceECCCCCEE
Confidence 4899999998776554443222 1222100 000 0000 0111234455556667789999999999988
Q ss_pred EE
Q 003310 396 MI 397 (832)
Q Consensus 396 As 397 (832)
.+
T Consensus 517 Fv 518 (524)
T PF05787_consen 517 FV 518 (524)
T ss_pred EE
Confidence 65
No 407
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.02 E-value=1.1e+02 Score=33.95 Aligned_cols=81 Identities=14% Similarity=0.272 Sum_probs=46.9
Q ss_pred EEEEECCCCcEEEEeccCCCC--eEEEEEcCCCCEEEEEEcC----CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe-
Q 003310 303 VIVRDIVSKNVIAQFRAHKSP--ISALCFDPSGILLVTASVQ----GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ- 375 (832)
Q Consensus 303 V~IwDl~s~~~l~~~~aH~~p--Is~LaFSPdG~lLATaS~d----Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~- 375 (832)
..++|....+.+.++...++. --.=+|||||.+|...-.| --+|-|||.+.+ ..++-++.
T Consensus 93 ~~vfD~~~~~~pv~~~s~~~RHfyGHGvfs~dG~~LYATEndfd~~rGViGvYd~r~~-------------fqrvgE~~t 159 (366)
T COG3490 93 AMVFDPNGAQEPVTLVSQEGRHFYGHGVFSPDGRLLYATENDFDPNRGVIGVYDAREG-------------FQRVGEFST 159 (366)
T ss_pred EEEECCCCCcCcEEEecccCceeecccccCCCCcEEEeecCCCCCCCceEEEEecccc-------------cceeccccc
Confidence 456787776554444322111 1123699999998654332 125889998765 12222221
Q ss_pred ccCccccEEEEEEccCCCEEEEEe
Q 003310 376 RGLTNAVIQDISFSDDSNWIMISS 399 (832)
Q Consensus 376 rG~t~a~I~~IaFSpDg~~LAsgS 399 (832)
-|.. -..+.|.+||+.|+.+.
T Consensus 160 ~GiG---pHev~lm~DGrtlvvan 180 (366)
T COG3490 160 HGIG---PHEVTLMADGRTLVVAN 180 (366)
T ss_pred CCcC---cceeEEecCCcEEEEeC
Confidence 1222 24678999999998874
No 408
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=42.76 E-value=98 Score=36.88 Aligned_cols=36 Identities=11% Similarity=0.205 Sum_probs=27.7
Q ss_pred EEEEEEcc----CCCEEEEEeCCCcEEEEecCCCCCceee
Q 003310 383 IQDISFSD----DSNWIMISSSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 383 I~~IaFSp----Dg~~LAsgS~DgTVhIwdl~~~g~~~~~ 418 (832)
+.+++++. +..+|++.+.|++++||++.+.....+.
T Consensus 217 ~~~~~~~~~~~~~~~~l~tl~~D~~LRiW~l~t~~~~~~~ 256 (547)
T PF11715_consen 217 AASLAVSSSEINDDTFLFTLSRDHTLRIWSLETGQCLATI 256 (547)
T ss_dssp EEEEEE-----ETTTEEEEEETTSEEEEEETTTTCEEEEE
T ss_pred cceEEEecceeCCCCEEEEEeCCCeEEEEECCCCeEEEEe
Confidence 55566666 7889999999999999999987764444
No 409
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=42.45 E-value=43 Score=35.35 Aligned_cols=38 Identities=13% Similarity=0.233 Sum_probs=30.0
Q ss_pred EEEeCCCCEEEEEEcCCE-EEEEeCCEEEEEECCCCceE
Q 003310 131 HMLKFRSPIYSVRCSSRV-VAICQAAQVHCFDAATLEIE 168 (832)
Q Consensus 131 ~tL~f~s~V~sV~~S~r~-LAVa~~~~I~vwDl~t~~~~ 168 (832)
-.|...++|.-+.++++. +|+...+.+++||+.+++..
T Consensus 7 P~i~Lgs~~~~l~~~~~~Ll~iT~~G~l~vWnl~~~k~~ 45 (219)
T PF07569_consen 7 PPIVLGSPVSFLECNGSYLLAITSSGLLYVWNLKKGKAV 45 (219)
T ss_pred CcEecCCceEEEEeCCCEEEEEeCCCeEEEEECCCCeec
Confidence 456678888889998665 45688999999999998753
No 410
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=41.56 E-value=1.6e+02 Score=31.11 Aligned_cols=73 Identities=16% Similarity=0.235 Sum_probs=46.0
Q ss_pred EcCCCCEEEEEEcCCCEEEEEeCCCCCC--CCCCccCCCCceeEEEEE----eccCccccEEEEEEccCCCEEEEEeCCC
Q 003310 329 FDPSGILLVTASVQGHNINIFKIIPGIL--GTSSACDAGTSYVHLYRL----QRGLTNAVIQDISFSDDSNWIMISSSRG 402 (832)
Q Consensus 329 FSPdG~lLATaS~dGt~I~Iwdi~t~~~--~~~s~~~~~~~~~~l~~l----~rG~t~a~I~~IaFSpDg~~LAsgS~Dg 402 (832)
+..+|.+|+.-..+|. +.+||+..... ...|. ..+..- .+ .....|..+.++.+|.-|++-+ +|
T Consensus 18 l~~~~~~Ll~iT~~G~-l~vWnl~~~k~~~~~~Si-------~pll~~~~~~~~-~~~~~i~~~~lt~~G~PiV~ls-ng 87 (219)
T PF07569_consen 18 LECNGSYLLAITSSGL-LYVWNLKKGKAVLPPVSI-------APLLNSSPVSDK-SSSPNITSCSLTSNGVPIVTLS-NG 87 (219)
T ss_pred EEeCCCEEEEEeCCCe-EEEEECCCCeeccCCccH-------HHHhcccccccC-CCCCcEEEEEEcCCCCEEEEEe-CC
Confidence 5567888888888997 89999988621 11010 000000 00 1123488899999998887665 57
Q ss_pred cEEEEecCC
Q 003310 403 TSHLFAINP 411 (832)
Q Consensus 403 TVhIwdl~~ 411 (832)
....|+..-
T Consensus 88 ~~y~y~~~L 96 (219)
T PF07569_consen 88 DSYSYSPDL 96 (219)
T ss_pred CEEEecccc
Confidence 788888754
No 411
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=41.37 E-value=1.1e+02 Score=27.64 Aligned_cols=49 Identities=24% Similarity=0.274 Sum_probs=33.8
Q ss_pred CeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCC
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKII 352 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~ 352 (832)
|.|..||-...+.+ ..+ -..-+.|.+||++++|..++.-++.|++|...
T Consensus 36 ~~Vvyyd~~~~~~v--a~g-~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~~ 84 (86)
T PF01731_consen 36 GNVVYYDGKEVKVV--ASG-FSFANGIAISPDKKYLYVASSLAHSIHVYKRH 84 (86)
T ss_pred ceEEEEeCCEeEEe--ecc-CCCCceEEEcCCCCEEEEEeccCCeEEEEEec
Confidence 46777886543222 222 12346799999999999988887779998764
No 412
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=39.71 E-value=1.6e+02 Score=32.30 Aligned_cols=58 Identities=7% Similarity=0.128 Sum_probs=44.1
Q ss_pred CCCEEEEEECCCCcEEEEEeCCCCEEEEEEcCCEEEEE-eCCEEEEEECCCCceEEEEe
Q 003310 115 VPTVVHFYSLRSQSYVHMLKFRSPIYSVRCSSRVVAIC-QAAQVHCFDAATLEIEYAIL 172 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f~s~V~sV~~S~r~LAVa-~~~~I~vwDl~t~~~~~tl~ 172 (832)
..++..+||..+-+.+.++.++..=+.++..++.|.++ ...+|+++|..+++...++.
T Consensus 108 k~~~~f~yd~~tl~~~~~~~y~~EGWGLt~dg~~Li~SDGS~~L~~~dP~~f~~~~~i~ 166 (264)
T PF05096_consen 108 KEGTGFVYDPNTLKKIGTFPYPGEGWGLTSDGKRLIMSDGSSRLYFLDPETFKEVRTIQ 166 (264)
T ss_dssp SSSEEEEEETTTTEEEEEEE-SSS--EEEECSSCEEEE-SSSEEEEE-TTT-SEEEEEE
T ss_pred cCCeEEEEccccceEEEEEecCCcceEEEcCCCEEEEECCccceEEECCcccceEEEEE
Confidence 34889999999999999999998888999876655554 56799999999999887774
No 413
>PF10214 Rrn6: RNA polymerase I-specific transcription-initiation factor; InterPro: IPR019350 RNA polymerase I-specific transcription-initiation factor Rrn6 and Rrn7 represent components of a multisubunit transcription factor essential for the initiation of rDNA transcription by Pol I []. These proteins are found in fungi.
Probab=38.99 E-value=1.4e+02 Score=37.63 Aligned_cols=84 Identities=19% Similarity=0.233 Sum_probs=52.2
Q ss_pred CCeEEEEEcC-CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCc------cccEEEEEEccCCCE
Q 003310 322 SPISALCFDP-SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLT------NAVIQDISFSDDSNW 394 (832)
Q Consensus 322 ~pIs~LaFSP-dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t------~a~I~~IaFSpDg~~ 394 (832)
.+...++|+| +-+.||..+..|+ ..|||+........ ....+.....|.- ......|.|.++-+.
T Consensus 146 ~~~aDv~FnP~~~~q~AiVD~~G~-Wsvw~i~~~~~~~~-------~~~~~~~~~~gsi~~d~~e~s~w~rI~W~~~~~~ 217 (765)
T PF10214_consen 146 FPHADVAFNPWDQRQFAIVDEKGN-WSVWDIKGRPKRKS-------SNLRLSRNISGSIIFDPEELSNWKRILWVSDSNR 217 (765)
T ss_pred CccceEEeccCccceEEEEeccCc-EEEEEeccccccCC-------cceeeccCCCccccCCCcccCcceeeEecCCCCE
Confidence 4788999999 4668999999998 89999932211000 0011110011210 112457899999888
Q ss_pred EEEEeCCCcEEEEecCCCCC
Q 003310 395 IMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 395 LAsgS~DgTVhIwdl~~~g~ 414 (832)
|++++ +..+.++|+.+...
T Consensus 218 lLv~~-r~~l~~~d~~~~~~ 236 (765)
T PF10214_consen 218 LLVCN-RSKLMLIDFESNWQ 236 (765)
T ss_pred EEEEc-CCceEEEECCCCCc
Confidence 87765 66788999976544
No 414
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=38.18 E-value=72 Score=38.31 Aligned_cols=64 Identities=19% Similarity=0.307 Sum_probs=38.3
Q ss_pred EEEEEcCCCCEEEEEEcCC-----CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEE
Q 003310 325 SALCFDPSGILLVTASVQG-----HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMIS 398 (832)
Q Consensus 325 s~LaFSPdG~lLATaS~dG-----t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsg 398 (832)
-.|+|+|.|.+.+.-+..+ +..-+|.+.+. ++....+..+.++-..+.+...|||||++.|.++
T Consensus 503 Dnl~fD~~GrLWi~TDg~~s~~~~~~~G~~~m~~~----------~p~~g~~~rf~t~P~g~E~tG~~FspD~~TlFV~ 571 (616)
T COG3211 503 DNLAFDPWGRLWIQTDGSGSTLRNRFRGVTQMLTP----------DPKTGTIKRFLTGPIGCEFTGPCFSPDGKTLFVN 571 (616)
T ss_pred CceEECCCCCEEEEecCCCCccCcccccccccccC----------CCccceeeeeccCCCcceeecceeCCCCceEEEE
Confidence 4589999999876533221 12223322221 1122344555566556779999999999887654
No 415
>PRK10115 protease 2; Provisional
Probab=34.84 E-value=1.6e+02 Score=36.44 Aligned_cols=73 Identities=12% Similarity=0.110 Sum_probs=43.7
Q ss_pred CCeEEEEEcCCCCEEEEEEcC-C---CEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEE
Q 003310 322 SPISALCFDPSGILLVTASVQ-G---HNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMI 397 (832)
Q Consensus 322 ~pIs~LaFSPdG~lLATaS~d-G---t~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAs 397 (832)
-.+..+.+||||++||-+-+. | ..|+|.|+.++ ..+.+ ...... ..++|++|++.|+.
T Consensus 127 ~~l~~~~~Spdg~~la~~~d~~G~E~~~l~v~d~~tg--------------~~l~~---~i~~~~-~~~~w~~D~~~~~y 188 (686)
T PRK10115 127 YTLGGMAITPDNTIMALAEDFLSRRQYGIRFRNLETG--------------NWYPE---LLDNVE-PSFVWANDSWTFYY 188 (686)
T ss_pred EEEeEEEECCCCCEEEEEecCCCcEEEEEEEEECCCC--------------CCCCc---cccCcc-eEEEEeeCCCEEEE
Confidence 357778999999988865433 3 23667777655 11111 111112 45999999998877
Q ss_pred EeCCC------cEEEEecCCC
Q 003310 398 SSSRG------TSHLFAINPL 412 (832)
Q Consensus 398 gS~Dg------TVhIwdl~~~ 412 (832)
+..+. .|..+++.+.
T Consensus 189 ~~~~~~~~~~~~v~~h~lgt~ 209 (686)
T PRK10115 189 VRKHPVTLLPYQVWRHTIGTP 209 (686)
T ss_pred EEecCCCCCCCEEEEEECCCC
Confidence 76542 3445555544
No 416
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=34.53 E-value=67 Score=24.10 Aligned_cols=23 Identities=26% Similarity=0.222 Sum_probs=14.7
Q ss_pred EEEc-CCEEEEEeCCEEEEEECCC
Q 003310 142 VRCS-SRVVAICQAAQVHCFDAAT 164 (832)
Q Consensus 142 V~~S-~r~LAVa~~~~I~vwDl~t 164 (832)
+.+. .++++.+.+++++++|++|
T Consensus 17 ~~v~~g~vyv~~~dg~l~ald~~t 40 (40)
T PF13570_consen 17 PAVAGGRVYVGTGDGNLYALDAAT 40 (40)
T ss_dssp -EECTSEEEEE-TTSEEEEEETT-
T ss_pred CEEECCEEEEEcCCCEEEEEeCCC
Confidence 3444 4555667789999999875
No 417
>PF10214 Rrn6: RNA polymerase I-specific transcription-initiation factor; InterPro: IPR019350 RNA polymerase I-specific transcription-initiation factor Rrn6 and Rrn7 represent components of a multisubunit transcription factor essential for the initiation of rDNA transcription by Pol I []. These proteins are found in fungi.
Probab=34.39 E-value=1e+03 Score=29.90 Aligned_cols=72 Identities=13% Similarity=0.105 Sum_probs=45.3
Q ss_pred CeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccC--CCEEEEEeC
Q 003310 323 PISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDD--SNWIMISSS 400 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpD--g~~LAsgS~ 400 (832)
....|.|.++-..|+.++.. + +.++|+.+... ..+ +-...+...|.++.=+|+ +..++-.+
T Consensus 205 ~w~rI~W~~~~~~lLv~~r~-~-l~~~d~~~~~~------------~~~--l~~~~~~~~IlDv~~~~~~~~~~FiLTs- 267 (765)
T PF10214_consen 205 NWKRILWVSDSNRLLVCNRS-K-LMLIDFESNWQ------------TEY--LVTAKTWSWILDVKRSPDNPSHVFILTS- 267 (765)
T ss_pred cceeeEecCCCCEEEEEcCC-c-eEEEECCCCCc------------cch--hccCCChhheeeEEecCCccceEEEEec-
Confidence 34578999988888887765 3 78999987511 111 222223345999999987 44443332
Q ss_pred CCcEEEEecCCC
Q 003310 401 RGTSHLFAINPL 412 (832)
Q Consensus 401 DgTVhIwdl~~~ 412 (832)
..|..+++.+.
T Consensus 268 -~eiiw~~~~~~ 278 (765)
T PF10214_consen 268 -KEIIWLDVKSS 278 (765)
T ss_pred -CeEEEEEccCC
Confidence 56777777764
No 418
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=33.95 E-value=1.4e+02 Score=35.16 Aligned_cols=68 Identities=21% Similarity=0.300 Sum_probs=0.0
Q ss_pred EEEEcCCCCEEEEEEcCCCEE-----------------------------EEEeCCCCCCCCCCccCCCCceeEEEEEec
Q 003310 326 ALCFDPSGILLVTASVQGHNI-----------------------------NIFKIIPGILGTSSACDAGTSYVHLYRLQR 376 (832)
Q Consensus 326 ~LaFSPdG~lLATaS~dGt~I-----------------------------~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~r 376 (832)
.|+|.|||+++++.++.|... +|+++.....-.....-..+....+|..
T Consensus 150 rI~FgPDG~LYVs~GD~g~~~~~n~~~~~~aQ~~~~~~~~~~~d~~~~~GkILRin~DGsiP~dNPf~~g~~~eIyA~-- 227 (454)
T TIGR03606 150 RLVFGPDGKIYYTIGEQGRNQGANFFLPNQAQHTPTQQELNGKDYHAYMGKVLRLNLDGSIPKDNPSINGVVSHIFTY-- 227 (454)
T ss_pred eEEECCCCcEEEEECCCCCCCcccccCcchhccccccccccccCcccCceEEEEEcCCCCCCCCCCccCCCcceEEEE--
Q ss_pred cCccccEEEEEEccCCCEEEE
Q 003310 377 GLTNAVIQDISFSDDSNWIMI 397 (832)
Q Consensus 377 G~t~a~I~~IaFSpDg~~LAs 397 (832)
|+.+ .+.++|.|+|++.++
T Consensus 228 G~RN--p~Gla~dp~G~Lw~~ 246 (454)
T TIGR03606 228 GHRN--PQGLAFTPDGTLYAS 246 (454)
T ss_pred eccc--cceeEECCCCCEEEE
No 419
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=33.88 E-value=2.8e+02 Score=30.99 Aligned_cols=92 Identities=16% Similarity=0.116 Sum_probs=40.5
Q ss_pred CCCeEE-EEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEecc
Q 003310 299 NVGMVI-VRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRG 377 (832)
Q Consensus 299 ~~G~V~-IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG 377 (832)
..|.+. -||-....-..+-+.-...|.+|.|+|||.+.+.+ ..|. |+.=+.... ............
T Consensus 163 ~~G~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~~~lw~~~-~Gg~-~~~s~~~~~-----------~~~w~~~~~~~~ 229 (302)
T PF14870_consen 163 SRGNFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPDGNLWMLA-RGGQ-IQFSDDPDD-----------GETWSEPIIPIK 229 (302)
T ss_dssp TTSSEEEEE-TT-SS-EEEE--SSS-EEEEEE-TTS-EEEEE-TTTE-EEEEE-TTE-----------EEEE---B-TTS
T ss_pred CcccEEEEecCCCccceEEccCccceehhceecCCCCEEEEe-CCcE-EEEccCCCC-----------ccccccccCCcc
Confidence 334433 35543322222333445789999999999987765 4443 544331111 000000000000
Q ss_pred CccccEEEEEEccCCCEEEEEeCCCcE
Q 003310 378 LTNAVIQDISFSDDSNWIMISSSRGTS 404 (832)
Q Consensus 378 ~t~a~I~~IaFSpDg~~LAsgS~DgTV 404 (832)
....-|.+++|.+++...|+|+ .|++
T Consensus 230 ~~~~~~ld~a~~~~~~~wa~gg-~G~l 255 (302)
T PF14870_consen 230 TNGYGILDLAYRPPNEIWAVGG-SGTL 255 (302)
T ss_dssp S--S-EEEEEESSSS-EEEEES-TT-E
T ss_pred cCceeeEEEEecCCCCEEEEeC-CccE
Confidence 1112389999999988887665 4444
No 420
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=32.73 E-value=80 Score=23.59 Aligned_cols=28 Identities=21% Similarity=0.212 Sum_probs=22.4
Q ss_pred EEEEEeCCEEEEEECCCCceEEEEecCC
Q 003310 148 VVAICQAAQVHCFDAATLEIEYAILTNP 175 (832)
Q Consensus 148 ~LAVa~~~~I~vwDl~t~~~~~tl~t~~ 175 (832)
+++...++.|+.+|+.|++.+......+
T Consensus 3 v~~~~~~g~l~AlD~~TG~~~W~~~~~~ 30 (38)
T PF01011_consen 3 VYVGTPDGYLYALDAKTGKVLWKFQTGP 30 (38)
T ss_dssp EEEETTTSEEEEEETTTTSEEEEEESSS
T ss_pred EEEeCCCCEEEEEECCCCCEEEeeeCCC
Confidence 3444678899999999999999887643
No 421
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=32.40 E-value=2.3e+02 Score=25.88 Aligned_cols=16 Identities=13% Similarity=0.530 Sum_probs=12.5
Q ss_pred EEEEEEccCCCEEEEE
Q 003310 383 IQDISFSDDSNWIMIS 398 (832)
Q Consensus 383 I~~IaFSpDg~~LAsg 398 (832)
-+.|++|+|+++|+.+
T Consensus 59 pNGVals~d~~~vlv~ 74 (89)
T PF03088_consen 59 PNGVALSPDESFVLVA 74 (89)
T ss_dssp EEEEEE-TTSSEEEEE
T ss_pred cCeEEEcCCCCEEEEE
Confidence 5789999999987665
No 422
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=32.22 E-value=4.9e+02 Score=28.71 Aligned_cols=55 Identities=11% Similarity=0.117 Sum_probs=39.8
Q ss_pred CCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEE-----cCCCEEEEEeCCCC
Q 003310 300 VGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTAS-----VQGHNINIFKIIPG 354 (832)
Q Consensus 300 ~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS-----~dGt~I~Iwdi~t~ 354 (832)
.-.|-+||..+.+-..--..-.+.|++|.|.-+.++|+.|. .....+..||+...
T Consensus 15 C~~lC~yd~~~~qW~~~g~~i~G~V~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~ 74 (281)
T PF12768_consen 15 CPGLCLYDTDNSQWSSPGNGISGTVTDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQ 74 (281)
T ss_pred CCEEEEEECCCCEeecCCCCceEEEEEEEEecCCEEEEEEeeEECCCCceeEEEEecCCC
Confidence 44689999988765443344567899999998888888876 23345788888764
No 423
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=31.27 E-value=2.5e+02 Score=30.12 Aligned_cols=67 Identities=12% Similarity=0.127 Sum_probs=44.1
Q ss_pred CeEEEEEcCCCCEEEEEE--cCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeC
Q 003310 323 PISALCFDPSGILLVTAS--VQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSS 400 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS--~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~ 400 (832)
.+.+.+.|+||+.+|... .++..+.++..... ...+. .|. .+..-+|++++...+....
T Consensus 25 ~~~s~AvS~dg~~~A~v~~~~~~~~L~~~~~~~~-------------~~~~~---~g~---~l~~PS~d~~g~~W~v~~~ 85 (253)
T PF10647_consen 25 DVTSPAVSPDGSRVAAVSEGDGGRSLYVGPAGGP-------------VRPVL---TGG---SLTRPSWDPDGWVWTVDDG 85 (253)
T ss_pred cccceEECCCCCeEEEEEEcCCCCEEEEEcCCCc-------------ceeec---cCC---ccccccccCCCCEEEEEcC
Confidence 688999999999988877 66665555543321 11111 221 3667889999777666666
Q ss_pred CCcEEEEe
Q 003310 401 RGTSHLFA 408 (832)
Q Consensus 401 DgTVhIwd 408 (832)
+....++.
T Consensus 86 ~~~~~~~~ 93 (253)
T PF10647_consen 86 SGGVRVVR 93 (253)
T ss_pred CCceEEEE
Confidence 66666664
No 424
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=31.27 E-value=7.4e+02 Score=27.20 Aligned_cols=53 Identities=15% Similarity=0.312 Sum_probs=38.0
Q ss_pred eEEEEECCCCcEEEEeccC------CCCeEEEEEcCC-C----CEEEEEEcCCCEEEEEeCCCC
Q 003310 302 MVIVRDIVSKNVIAQFRAH------KSPISALCFDPS-G----ILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH------~~pIs~LaFSPd-G----~lLATaS~dGt~I~Iwdi~t~ 354 (832)
++.+||+.+.++++++.-. .+-+..|.++.. + .+...++..+.-|-|+|+.++
T Consensus 35 KLv~~Dl~t~~li~~~~~p~~~~~~~s~lndl~VD~~~~~~~~~~aYItD~~~~glIV~dl~~~ 98 (287)
T PF03022_consen 35 KLVAFDLKTNQLIRRYPFPPDIAPPDSFLNDLVVDVRDGNCDDGFAYITDSGGPGLIVYDLATG 98 (287)
T ss_dssp EEEEEETTTTCEEEEEE--CCCS-TCGGEEEEEEECTTTTS-SEEEEEEETTTCEEEEEETTTT
T ss_pred EEEEEECCCCcEEEEEECChHHcccccccceEEEEccCCCCcceEEEEeCCCcCcEEEEEccCC
Confidence 7889999999998877622 356888998873 2 344455555456889999987
No 425
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=30.91 E-value=74 Score=28.83 Aligned_cols=29 Identities=21% Similarity=0.425 Sum_probs=23.7
Q ss_pred EEEEEEccCCCEEEEEeC-CCcEEEEecCC
Q 003310 383 IQDISFSDDSNWIMISSS-RGTSHLFAINP 411 (832)
Q Consensus 383 I~~IaFSpDg~~LAsgS~-DgTVhIwdl~~ 411 (832)
-+.|++|||+++|.+++. +++||||..++
T Consensus 56 aNGI~~s~~~k~lyVa~~~~~~I~vy~~~~ 85 (86)
T PF01731_consen 56 ANGIAISPDKKYLYVASSLAHSIHVYKRHK 85 (86)
T ss_pred CceEEEcCCCCEEEEEeccCCeEEEEEecC
Confidence 468999999999877665 68999998764
No 426
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=30.80 E-value=1.3e+03 Score=29.98 Aligned_cols=94 Identities=12% Similarity=0.140 Sum_probs=67.3
Q ss_pred eEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccc
Q 003310 302 MVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNA 381 (832)
Q Consensus 302 ~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a 381 (832)
.|+||++.+++.++.=..|..++.+|...-.|..+|.|+.-+. |.+-..+.. . -.++..-|-..+-
T Consensus 849 ~vrLye~t~~~eLr~e~~~~~~~~aL~l~v~gdeI~VgDlm~S-itll~y~~~-e------------g~f~evArD~~p~ 914 (1096)
T KOG1897|consen 849 SVRLYEWTTERELRIECNISNPIIALDLQVKGDEIAVGDLMRS-ITLLQYKGD-E------------GNFEEVARDYNPN 914 (1096)
T ss_pred EEEEEEccccceehhhhcccCCeEEEEEEecCcEEEEeeccce-EEEEEEecc-C------------CceEEeehhhCcc
Confidence 6999999999877777788899999999999999999998865 666554432 0 1245555666655
Q ss_pred cEEEEEEccCCCEEEEEeCCCcEEEEecC
Q 003310 382 VIQDISFSDDSNWIMISSSRGTSHLFAIN 410 (832)
Q Consensus 382 ~I~~IaFSpDg~~LAsgS~DgTVhIwdl~ 410 (832)
++..+.+=.|..++ .+-.+|.+.+-...
T Consensus 915 Wmtaveil~~d~yl-gae~~gNlf~v~~d 942 (1096)
T KOG1897|consen 915 WMTAVEILDDDTYL-GAENSGNLFTVRKD 942 (1096)
T ss_pred ceeeEEEecCceEE-eecccccEEEEEec
Confidence 67777776665554 55567776665554
No 427
>COG5422 ROM1 RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms]
Probab=30.62 E-value=2.8e+02 Score=35.18 Aligned_cols=34 Identities=15% Similarity=0.024 Sum_probs=25.1
Q ss_pred EEEcCCEEEEEeCCEEEEEECCCCceEEEEecCC
Q 003310 142 VRCSSRVVAICQAAQVHCFDAATLEIEYAILTNP 175 (832)
Q Consensus 142 V~~S~r~LAVa~~~~I~vwDl~t~~~~~tl~t~~ 175 (832)
.+++-.+|.+-..+-|.|++++|+++++++.++.
T Consensus 1104 FalsypYIlaf~~~fIeIr~ieTgeLI~~ilg~~ 1137 (1175)
T COG5422 1104 FALSYPYILAFEPNFIEIRHIETGELIRCILGHN 1137 (1175)
T ss_pred eeeecceEEEecCceEEEEecccceeeeeeccCc
Confidence 3445444444445679999999999999998873
No 428
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=30.41 E-value=8.5e+02 Score=27.64 Aligned_cols=171 Identities=15% Similarity=0.204 Sum_probs=83.0
Q ss_pred cCCCCeEEEEECC-CCcEEEEeccCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEe
Q 003310 297 ADNVGMVIVRDIV-SKNVIAQFRAHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQ 375 (832)
Q Consensus 297 ~~~~G~V~IwDl~-s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~ 375 (832)
+-+.|.+.+--+. +++ +..+..+-....-|+-++ ..|+-++.. .||++..... -............+|--+
T Consensus 24 TYQagkL~~ig~~~~g~-l~~~~r~F~r~MGl~~~~--~~l~~~t~~----qiw~f~~~~n-~l~~~~~~~~~D~~yvPr 95 (335)
T TIGR03032 24 TYQAGKLFFIGLQPNGE-LDVFERTFPRPMGLAVSP--QSLTLGTRY----QLWRFANVDN-LLPAGQTHPGYDRLYVPR 95 (335)
T ss_pred eeecceEEEEEeCCCCc-EEEEeeccCccceeeeeC--CeEEEEEcc----eeEEcccccc-cccccccCCCCCeEEeee
Confidence 3455666666555 333 444443333334455544 445555432 4788732200 000011111234455555
Q ss_pred ccCcccc--EEEEEEccCCCEEEEEeCCCcEEEEecCCCCCceeeccCCCCcccccCCcccccccCC--CCCCCCCCCCc
Q 003310 376 RGLTNAV--IQDISFSDDSNWIMISSSRGTSHLFAINPLGGSVNFQPTDANFTTKHGAMAKSGVRWP--PNLGLQMPNQQ 451 (832)
Q Consensus 376 rG~t~a~--I~~IaFSpDg~~LAsgS~DgTVhIwdl~~~g~~~~~~~H~~~~~~~~~~~~~~~~r~~--~~s~~~~~~~~ 451 (832)
..++... |.+|+| .++..+.+-+ .+.+..++....+=.+ +|- -++.+.--|.=
T Consensus 96 ~~~~TGdidiHdia~-~~~~l~fVNT-----------~fSCLatl~~~~SF~P-----------~WkPpFIs~la~eDRC 152 (335)
T TIGR03032 96 ASYVTGDIDAHDLAL-GAGRLLFVNT-----------LFSCLATVSPDYSFVP-----------LWKPPFISKLAPEDRC 152 (335)
T ss_pred eeeeccCcchhheee-cCCcEEEEEC-----------cceeEEEECCCCcccc-----------ccCCccccccCccCce
Confidence 5555444 778889 4554444433 2445555543211111 122 12322211222
Q ss_pred ccc----cCCCCeeeeeceEEEcCCCCCCccccccchhccCcc-cCCCcceeeeeecc
Q 003310 452 SLC----ASGPPVTLSVVSRIRNGNNGWRGTVSGAAAAATGRV-SSLSGAIASSFHNC 504 (832)
Q Consensus 452 ~l~----~~~~p~~ls~v~~I~~~~~~~~~~v~~~~~~a~g~~-~~~~g~~~~~~h~~ 504 (832)
+|+ ..+.|.-++++++=..+. ||+..-.+. |-+ ...++.+-+....+
T Consensus 153 HLNGlA~~~g~p~yVTa~~~sD~~~-gWR~~~~~g-----G~vidv~s~evl~~GLsm 204 (335)
T TIGR03032 153 HLNGMALDDGEPRYVTALSQSDVAD-GWREGRRDG-----GCVIDIPSGEVVASGLSM 204 (335)
T ss_pred eecceeeeCCeEEEEEEeeccCCcc-cccccccCC-----eEEEEeCCCCEEEcCccC
Confidence 332 345677788888766655 998876432 344 77777777777766
No 429
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=29.43 E-value=1e+02 Score=22.07 Aligned_cols=24 Identities=29% Similarity=0.725 Sum_probs=20.3
Q ss_pred EEEEEEccCCCEEEEEeCCCcEEEEe
Q 003310 383 IQDISFSDDSNWIMISSSRGTSHLFA 408 (832)
Q Consensus 383 I~~IaFSpDg~~LAsgS~DgTVhIwd 408 (832)
|.+|+-++ +|+|++++.+-++||.
T Consensus 4 i~aia~g~--~~vavaTS~~~lRifs 27 (27)
T PF12341_consen 4 IEAIAAGD--SWVAVATSAGYLRIFS 27 (27)
T ss_pred EEEEEccC--CEEEEEeCCCeEEecC
Confidence 77788776 4999999999999983
No 430
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=29.14 E-value=1.3e+02 Score=21.15 Aligned_cols=25 Identities=24% Similarity=0.184 Sum_probs=19.5
Q ss_pred CEEEEEeCCEEEEEECCCCceEEEE
Q 003310 147 RVVAICQAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 147 r~LAVa~~~~I~vwDl~t~~~~~tl 171 (832)
.+++...++.++.+|+.+++.+++.
T Consensus 8 ~v~~~~~~g~l~a~d~~~G~~~W~~ 32 (33)
T smart00564 8 TVYVGSTDGTLYALDAKTGEILWTY 32 (33)
T ss_pred EEEEEcCCCEEEEEEcccCcEEEEc
Confidence 4555567899999999999877653
No 431
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=27.79 E-value=1.1e+02 Score=35.86 Aligned_cols=48 Identities=10% Similarity=0.179 Sum_probs=33.5
Q ss_pred CCCEEEEEECCCCcEEEEEeCCC---CEEEEEEc--C----CEEEEEeCCEEEEEEC
Q 003310 115 VPTVVHFYSLRSQSYVHMLKFRS---PIYSVRCS--S----RVVAICQAAQVHCFDA 162 (832)
Q Consensus 115 ~~~tVrlWDL~Tg~~V~tL~f~s---~V~sV~~S--~----r~LAVa~~~~I~vwDl 162 (832)
.-+++.|||+++++.+++|.+.. -+.-|+|. + -++.+++..+|..|--
T Consensus 220 yG~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~aLss~i~~~~k 276 (461)
T PF05694_consen 220 YGHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGCALSSSIWRFYK 276 (461)
T ss_dssp S--EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEEE--EEEEEEEE
T ss_pred ccCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEEeccceEEEEEE
Confidence 34789999999999999999963 46789993 2 3566678888887754
No 432
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=27.53 E-value=1.5e+02 Score=27.12 Aligned_cols=45 Identities=11% Similarity=0.181 Sum_probs=29.6
Q ss_pred ccCCCCeEEEEECCCCcEEEEeccCCCCeEEEEEcCCCCEEEEEEc
Q 003310 296 DADNVGMVIVRDIVSKNVIAQFRAHKSPISALCFDPSGILLVTASV 341 (832)
Q Consensus 296 s~~~~G~V~IwDl~s~~~l~~~~aH~~pIs~LaFSPdG~lLATaS~ 341 (832)
.+...|.+.-||..+++.-..+.+- .--+.|++++|++.|+.+-.
T Consensus 32 e~~~~GRll~ydp~t~~~~vl~~~L-~fpNGVals~d~~~vlv~Et 76 (89)
T PF03088_consen 32 EGRPTGRLLRYDPSTKETTVLLDGL-YFPNGVALSPDESFVLVAET 76 (89)
T ss_dssp HT---EEEEEEETTTTEEEEEEEEE-SSEEEEEE-TTSSEEEEEEG
T ss_pred cCCCCcCEEEEECCCCeEEEehhCC-CccCeEEEcCCCCEEEEEec
Confidence 4567789999999998764444432 23478999999997766644
No 433
>KOG0183 consensus 20S proteasome, regulatory subunit alpha type PSMA7/PRE6 [Posttranslational modification, protein turnover, chaperones]
Probab=27.50 E-value=37 Score=35.69 Aligned_cols=15 Identities=47% Similarity=0.740 Sum_probs=12.6
Q ss_pred cEEEEcCCCcEEEEe
Q 003310 522 HLLVFSPSGCMIQYA 536 (832)
Q Consensus 522 ~Llv~s~~G~l~~y~ 536 (832)
-|-||||||||+|-+
T Consensus 7 altvFSPDGhL~QVE 21 (249)
T KOG0183|consen 7 ALTVFSPDGHLFQVE 21 (249)
T ss_pred ceEEECCCCCEEeeH
Confidence 388999999999844
No 434
>KOG2280 consensus Vacuolar assembly/sorting protein VPS16 [Intracellular trafficking, secretion, and vesicular transport]
Probab=27.12 E-value=8.1e+02 Score=30.80 Aligned_cols=47 Identities=9% Similarity=0.294 Sum_probs=38.2
Q ss_pred CEEEEEECCCCcEEEEEeC-CCCEEEEEEc--CCEEEEEeCCEEEEEECCC
Q 003310 117 TVVHFYSLRSQSYVHMLKF-RSPIYSVRCS--SRVVAICQAAQVHCFDAAT 164 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f-~s~V~sV~~S--~r~LAVa~~~~I~vwDl~t 164 (832)
-.|+||++ +|+.+.++.. +..+..+.|+ ..+|+|.-+++++||++.-
T Consensus 64 ~~I~If~~-sG~lL~~~~w~~~~lI~mgWs~~eeLI~v~k~g~v~Vy~~~g 113 (829)
T KOG2280|consen 64 PYIRIFNI-SGQLLGRILWKHGELIGMGWSDDEELICVQKDGTVHVYGLLG 113 (829)
T ss_pred eeEEEEec-cccchHHHHhcCCCeeeecccCCceEEEEeccceEEEeecch
Confidence 46999998 6887777765 5588889997 4788899999999999863
No 435
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=26.94 E-value=2e+02 Score=30.52 Aligned_cols=44 Identities=9% Similarity=0.332 Sum_probs=31.8
Q ss_pred CCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEEeCC
Q 003310 343 GHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMISSSR 401 (832)
Q Consensus 343 Gt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsgS~D 401 (832)
|..|.+|++... ....+.+|. |-..|..++++.-|.+|++-=.+
T Consensus 37 g~~Vev~~l~~~------------~~~~~~~F~---Tv~~V~~l~y~~~GDYlvTlE~k 80 (215)
T PF14761_consen 37 GCKVEVYDLEQE------------ECPLLCTFS---TVGRVLQLVYSEAGDYLVTLEEK 80 (215)
T ss_pred CCEEEEEEcccC------------CCceeEEEc---chhheeEEEeccccceEEEEEee
Confidence 556999999843 134566663 33569999999999999986443
No 436
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=26.56 E-value=7.2e+02 Score=28.09 Aligned_cols=54 Identities=24% Similarity=0.315 Sum_probs=37.7
Q ss_pred CCCCeEEEEECCCCcEEEEeccCCCCe---EEEEEc------CCCCEEEEEEcCCCEEEEEeCCCC
Q 003310 298 DNVGMVIVRDIVSKNVIAQFRAHKSPI---SALCFD------PSGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 298 ~~~G~V~IwDl~s~~~l~~~~aH~~pI---s~LaFS------PdG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
..-|.|-++|+. ++.++.|. +.++. ..|+.. .+|.+|+-=--||+ |++||..++
T Consensus 219 ~G~G~VdvFd~~-G~l~~r~a-s~g~LNaPWG~a~APa~FG~~sg~lLVGNFGDG~-InaFD~~sG 281 (336)
T TIGR03118 219 AGLGYVNVFTLN-GQLLRRVA-SSGRLNAPWGLAIAPESFGSLSGALLVGNFGDGT-INAYDPQSG 281 (336)
T ss_pred CCcceEEEEcCC-CcEEEEec-cCCcccCCceeeeChhhhCCCCCCeEEeecCCce-eEEecCCCC
Confidence 345799999975 55667763 33322 335553 47888887777998 999998876
No 437
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=26.03 E-value=2.1e+02 Score=33.17 Aligned_cols=20 Identities=35% Similarity=0.431 Sum_probs=16.0
Q ss_pred CeEEEEEcCCCCEEEEEEcC
Q 003310 323 PISALCFDPSGILLVTASVQ 342 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS~d 342 (832)
.=..|.|+|||+|+++.+..
T Consensus 178 ~g~~l~f~pDG~Lyvs~G~~ 197 (399)
T COG2133 178 FGGRLVFGPDGKLYVTTGSN 197 (399)
T ss_pred CcccEEECCCCcEEEEeCCC
Confidence 44689999999988886654
No 438
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=25.07 E-value=9.4e+02 Score=26.42 Aligned_cols=55 Identities=7% Similarity=0.158 Sum_probs=40.5
Q ss_pred CEEEEEECCCCcEEEEEeCCCCEEE--EEEc-CCEEEEE-eCCEEEEEECCCCceEEEE
Q 003310 117 TVVHFYSLRSQSYVHMLKFRSPIYS--VRCS-SRVVAIC-QAAQVHCFDAATLEIEYAI 171 (832)
Q Consensus 117 ~tVrlWDL~Tg~~V~tL~f~s~V~s--V~~S-~r~LAVa-~~~~I~vwDl~t~~~~~tl 171 (832)
..|+.+|+.||+.+....++...++ |..- .++.... .++...+||..|++.+.++
T Consensus 68 S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLTWk~~~~f~yd~~tl~~~~~~ 126 (264)
T PF05096_consen 68 SSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLTWKEGTGFVYDPNTLKKIGTF 126 (264)
T ss_dssp EEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEESSSSEEEEEETTTTEEEEEE
T ss_pred EEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEEecCCeEEEEccccceEEEEE
Confidence 6899999999999999998877653 3333 3444444 5779999999999987776
No 439
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=24.42 E-value=1.9e+02 Score=32.65 Aligned_cols=17 Identities=6% Similarity=-0.103 Sum_probs=14.2
Q ss_pred EEEEEEccCCCEEEEEe
Q 003310 383 IQDISFSDDSNWIMISS 399 (832)
Q Consensus 383 I~~IaFSpDg~~LAsgS 399 (832)
.+.++|+|+|+++++-.
T Consensus 186 p~Gl~~d~~G~l~~tdn 202 (367)
T TIGR02604 186 PYGHSVDSWGDVFFCDN 202 (367)
T ss_pred CccceECCCCCEEEEcc
Confidence 67899999999987644
No 440
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=24.16 E-value=4.4e+02 Score=29.45 Aligned_cols=80 Identities=16% Similarity=0.157 Sum_probs=43.6
Q ss_pred cEEEEeccCCCCeEEEEEcC-------CCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEE
Q 003310 312 NVIAQFRAHKSPISALCFDP-------SGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQ 384 (832)
Q Consensus 312 ~~l~~~~aH~~pIs~LaFSP-------dG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~ 384 (832)
.++..+.+|. .++.+.|-+ .|.+|++ ...+..|...++.+. ........+- +....++.
T Consensus 244 ~P~~~~~~~~-ap~G~~~y~g~~fp~~~g~~~~~-~~~~~~i~~~~~~~~-----------~~~~~~~~~~-~~~~~r~~ 309 (331)
T PF07995_consen 244 PPVFAYPPHS-APTGIIFYRGSAFPEYRGDLFVA-DYGGGRIWRLDLDED-----------GSVTEEEEFL-GGFGGRPR 309 (331)
T ss_dssp --SEEETTT---EEEEEEE-SSSSGGGTTEEEEE-ETTTTEEEEEEEETT-----------EEEEEEEEEC-TTSSS-EE
T ss_pred ccceeecCcc-ccCceEEECCccCccccCcEEEe-cCCCCEEEEEeeecC-----------CCccceEEcc-ccCCCCce
Confidence 4666777774 456677764 3445554 444443544455432 0122223332 22233699
Q ss_pred EEEEccCCCEEEEEeCCCcEE
Q 003310 385 DISFSDDSNWIMISSSRGTSH 405 (832)
Q Consensus 385 ~IaFSpDg~~LAsgS~DgTVh 405 (832)
+|++.|||.+.++...+|.|.
T Consensus 310 ~v~~~pDG~Lyv~~d~~G~iy 330 (331)
T PF07995_consen 310 DVAQGPDGALYVSDDSDGKIY 330 (331)
T ss_dssp EEEEETTSEEEEEE-TTTTEE
T ss_pred EEEEcCCCeEEEEECCCCeEe
Confidence 999999999998888888763
No 441
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=23.57 E-value=4.3e+02 Score=29.85 Aligned_cols=62 Identities=21% Similarity=0.312 Sum_probs=33.8
Q ss_pred CeEEEEEcCCCCEEEEEE-----------cCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccC
Q 003310 323 PISALCFDPSGILLVTAS-----------VQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDD 391 (832)
Q Consensus 323 pIs~LaFSPdG~lLATaS-----------~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpD 391 (832)
....|+|+++|+++++-. ..|..|.+++-..+ .+.......+..+.. ....|+|.++
T Consensus 15 ~P~~ia~d~~G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dg----------dG~~d~~~vfa~~l~--~p~Gi~~~~~ 82 (367)
T TIGR02604 15 NPIAVCFDERGRLWVAEGITYSRPAGRQGPLGDRILILEDADG----------DGKYDKSNVFAEELS--MVTGLAVAVG 82 (367)
T ss_pred CCceeeECCCCCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCC----------CCCcceeEEeecCCC--CccceeEecC
Confidence 346789999999887753 22323566654332 011112222322322 2578999999
Q ss_pred CCEEEE
Q 003310 392 SNWIMI 397 (832)
Q Consensus 392 g~~LAs 397 (832)
| .+++
T Consensus 83 G-lyV~ 87 (367)
T TIGR02604 83 G-VYVA 87 (367)
T ss_pred C-EEEe
Confidence 9 4443
No 442
>PF14269 Arylsulfotran_2: Arylsulfotransferase (ASST)
Probab=22.42 E-value=2.8e+02 Score=30.76 Aligned_cols=71 Identities=24% Similarity=0.342 Sum_probs=45.6
Q ss_pred eEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCcccc----EEEEEEccCCCEEEEEe
Q 003310 324 ISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAV----IQDISFSDDSNWIMISS 399 (832)
Q Consensus 324 Is~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~----I~~IaFSpDg~~LAsgS 399 (832)
|++|...++|.+|+|+-.-. .|-+.|-.++ .-++.| .|.+... -...+|-+|-+++-.+.
T Consensus 146 iNsV~~~~~G~yLiS~R~~~-~i~~I~~~tG--------------~I~W~l-gG~~~~df~~~~~~f~~QHdar~~~~~~ 209 (299)
T PF14269_consen 146 INSVDKDDDGDYLISSRNTS-TIYKIDPSTG--------------KIIWRL-GGKRNSDFTLPATNFSWQHDARFLNESN 209 (299)
T ss_pred eeeeeecCCccEEEEecccC-EEEEEECCCC--------------cEEEEe-CCCCCCcccccCCcEeeccCCEEeccCC
Confidence 56668888999998875543 3555555554 455665 2321111 11356667888887778
Q ss_pred CCCcEEEEecC
Q 003310 400 SRGTSHLFAIN 410 (832)
Q Consensus 400 ~DgTVhIwdl~ 410 (832)
.+++|.|||=.
T Consensus 210 ~~~~IslFDN~ 220 (299)
T PF14269_consen 210 DDGTISLFDNA 220 (299)
T ss_pred CCCEEEEEcCC
Confidence 89999999973
No 443
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.09 E-value=3.8e+02 Score=29.95 Aligned_cols=84 Identities=13% Similarity=0.275 Sum_probs=56.9
Q ss_pred cCCCCeEEEEEcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCceeEEEEEeccCccccEEEEEEccCCCEEEEE
Q 003310 319 AHKSPISALCFDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSYVHLYRLQRGLTNAVIQDISFSDDSNWIMIS 398 (832)
Q Consensus 319 aH~~pIs~LaFSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~~~l~~l~rG~t~a~I~~IaFSpDg~~LAsg 398 (832)
+-+..|++|.|+|+.+.|.+.-.....| ||=...| +......| .|. +.--.|++.-+|+++++-
T Consensus 83 g~~~nvS~LTynp~~rtLFav~n~p~~i-VElt~~G------------dlirtiPL-~g~--~DpE~Ieyig~n~fvi~d 146 (316)
T COG3204 83 GETANVSSLTYNPDTRTLFAVTNKPAAI-VELTKEG------------DLIRTIPL-TGF--SDPETIEYIGGNQFVIVD 146 (316)
T ss_pred cccccccceeeCCCcceEEEecCCCceE-EEEecCC------------ceEEEecc-ccc--CChhHeEEecCCEEEEEe
Confidence 3345599999999988887776666633 4433333 22222233 233 235678899999999888
Q ss_pred eCCCcEEEEecCCCCCceee
Q 003310 399 SSRGTSHLFAINPLGGSVNF 418 (832)
Q Consensus 399 S~DgTVhIwdl~~~g~~~~~ 418 (832)
=.++++.++.+.+......+
T Consensus 147 ER~~~l~~~~vd~~t~~~~~ 166 (316)
T COG3204 147 ERDRALYLFTVDADTTVISA 166 (316)
T ss_pred hhcceEEEEEEcCCccEEec
Confidence 88999999999887655444
No 444
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=22.08 E-value=44 Score=41.91 Aligned_cols=98 Identities=20% Similarity=0.260 Sum_probs=55.4
Q ss_pred CCCeEEEEECC--CCcEEEEecc-----CCCCeEEEE---EcCCCCEEEEEEcCCCEEEEEeCCCCCCCCCCccCCCCce
Q 003310 299 NVGMVIVRDIV--SKNVIAQFRA-----HKSPISALC---FDPSGILLVTASVQGHNINIFKIIPGILGTSSACDAGTSY 368 (832)
Q Consensus 299 ~~G~V~IwDl~--s~~~l~~~~a-----H~~pIs~La---FSPdG~lLATaS~dGt~I~Iwdi~t~~~~~~s~~~~~~~~ 368 (832)
..|..-|||+. .|+....+.- -..++.-+. |-++.-++..+..+|+ |++-.+.+.
T Consensus 151 ~vg~lfVy~vd~l~G~iq~~l~v~~~~p~gs~~~~V~wcp~~~~~~~ic~~~~~~~-i~lL~~~ra-------------- 215 (1283)
T KOG1916|consen 151 LVGELFVYDVDVLQGEIQPQLEVTPITPYGSDPQLVSWCPIAVNKVYICYGLKGGE-IRLLNINRA-------------- 215 (1283)
T ss_pred HhhhhheeehHhhccccccceEEeecCcCCCCcceeeecccccccceeeeccCCCc-eeEeeechH--------------
Confidence 45778888875 3443333322 223333333 3345555555555565 677655443
Q ss_pred eEEEEEeccCccccEEEE-----------EEccCCCEEEEEeCCCcEEEEecCCCCC
Q 003310 369 VHLYRLQRGLTNAVIQDI-----------SFSDDSNWIMISSSRGTSHLFAINPLGG 414 (832)
Q Consensus 369 ~~l~~l~rG~t~a~I~~I-----------aFSpDg~~LAsgS~DgTVhIwdl~~~g~ 414 (832)
+..+-|+|.. .+.++ ..||||+.||.++.||.++.|.+--+|.
T Consensus 216 --~~~l~rsHs~-~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~v~f~Qiyi~g~ 269 (1283)
T KOG1916|consen 216 --LRSLFRSHSQ-RVTDMAFFAEGVLKLASLSPDGTVFAWAISDGSVGFYQIYITGK 269 (1283)
T ss_pred --HHHHHHhcCC-CcccHHHHhhchhhheeeCCCCcEEEEeecCCccceeeeeeecc
Confidence 0123344322 12222 2799999999999999999888754433
No 445
>TIGR02608 delta_60_rpt delta-60 repeat domain. This domain occurs in tandem repeats, as many as 13, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The periodicity of the repeat ranges from about 57 to 61 amino acids, and a core region of about 54 is represented by this model and seed alignment.
Probab=20.99 E-value=2.1e+02 Score=23.85 Aligned_cols=32 Identities=13% Similarity=0.151 Sum_probs=24.2
Q ss_pred EEEEEEccCCCEEEEEeC-----CCcEEEEecCCCCC
Q 003310 383 IQDISFSDDSNWIMISSS-----RGTSHLFAINPLGG 414 (832)
Q Consensus 383 I~~IaFSpDg~~LAsgS~-----DgTVhIwdl~~~g~ 414 (832)
+++++.-||||+|++|.. +....|+.+++-|.
T Consensus 3 ~~~~~~q~DGkIlv~G~~~~~~~~~~~~l~Rln~DGs 39 (55)
T TIGR02608 3 AYAVAVQSDGKILVAGYVDNSSGNNDFVLARLNADGS 39 (55)
T ss_pred eEEEEECCCCcEEEEEEeecCCCcccEEEEEECCCCC
Confidence 578899999999999964 33566777776554
No 446
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=20.97 E-value=1.7e+02 Score=34.83 Aligned_cols=31 Identities=19% Similarity=0.319 Sum_probs=23.3
Q ss_pred CeEEEEEcC----CCCEEEEEEcCCCEEEEEeCCCC
Q 003310 323 PISALCFDP----SGILLVTASVQGHNINIFKIIPG 354 (832)
Q Consensus 323 pIs~LaFSP----dG~lLATaS~dGt~I~Iwdi~t~ 354 (832)
.+.++++++ +-.+|+|.+.|++ +||||+.++
T Consensus 216 ~~~~~~~~~~~~~~~~~l~tl~~D~~-LRiW~l~t~ 250 (547)
T PF11715_consen 216 VAASLAVSSSEINDDTFLFTLSRDHT-LRIWSLETG 250 (547)
T ss_dssp -EEEEEE-----ETTTEEEEEETTSE-EEEEETTTT
T ss_pred ccceEEEecceeCCCCEEEEEeCCCe-EEEEECCCC
Confidence 445566666 6779999999987 999999987
No 447
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=20.28 E-value=1.7e+02 Score=20.46 Aligned_cols=25 Identities=16% Similarity=0.295 Sum_probs=20.1
Q ss_pred EEEEEEccCCCEEEEEeCCCcEEEE
Q 003310 383 IQDISFSDDSNWIMISSSRGTSHLF 407 (832)
Q Consensus 383 I~~IaFSpDg~~LAsgS~DgTVhIw 407 (832)
..+|+++++|+.+++=+....|.+|
T Consensus 4 P~gvav~~~g~i~VaD~~n~rV~vf 28 (28)
T PF01436_consen 4 PHGVAVDSDGNIYVADSGNHRVQVF 28 (28)
T ss_dssp EEEEEEETTSEEEEEECCCTEEEEE
T ss_pred CcEEEEeCCCCEEEEECCCCEEEEC
Confidence 4678899999988888777777775
No 448
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=20.20 E-value=4.4e+02 Score=31.18 Aligned_cols=53 Identities=19% Similarity=0.179 Sum_probs=27.2
Q ss_pred CeEEEEECCCCcEEEEeccCCCC---eEEEEEcCCCCEEEEEE----------------------cCCCEEEEEeCCCC
Q 003310 301 GMVIVRDIVSKNVIAQFRAHKSP---ISALCFDPSGILLVTAS----------------------VQGHNINIFKIIPG 354 (832)
Q Consensus 301 G~V~IwDl~s~~~l~~~~aH~~p---Is~LaFSPdG~lLATaS----------------------~dGt~I~Iwdi~t~ 354 (832)
..+..+|+. |+++..+..-... =..+.+-|+|.+|+.+. .+|.++..||+...
T Consensus 167 ~~~~e~D~~-G~v~~~~~l~~~~~~~HHD~~~l~nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd~tG~vv~~wd~~d~ 244 (477)
T PF05935_consen 167 NRLYEIDLL-GKVIWEYDLPGGYYDFHHDIDELPNGNLLILASETKYVDEDKDVDTVEDVIVEVDPTGEVVWEWDFFDH 244 (477)
T ss_dssp TEEEEE-TT---EEEEEE--TTEE-B-S-EEE-TTS-EEEEEEETTEE-TS-EE---S-EEEEE-TTS-EEEEEEGGGT
T ss_pred CceEEEcCC-CCEEEeeecCCcccccccccEECCCCCEEEEEeecccccCCCCccEecCEEEEECCCCCEEEEEehHHh
Confidence 567777775 4444444422211 13467889999988877 35666777776553
Done!