Query 000425
Match_columns 1525
No_of_seqs 525 out of 2767
Neff 5.5
Searched_HMMs 46136
Date Fri Mar 29 08:11:57 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/000425.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/000425hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0292 Vesicle coat complex C 100.0 6E-166 1E-170 1475.0 68.8 1137 9-1517 12-1197(1202)
2 KOG0276 Vesicle coat complex C 100.0 8E-107 2E-111 945.2 47.6 695 6-903 13-773 (794)
3 PF04053 Coatomer_WDAD: Coatom 100.0 9.1E-71 2E-75 661.9 30.5 408 375-893 1-443 (443)
4 PF06957 COPI_C: Coatomer (COP 100.0 9.3E-71 2E-75 647.8 13.1 233 1268-1520 173-415 (422)
5 KOG0276 Vesicle coat complex C 100.0 2.7E-33 5.8E-38 329.6 36.0 628 6-848 97-750 (794)
6 KOG0292 Vesicle coat complex C 99.9 6E-22 1.3E-26 240.7 38.6 244 8-314 53-328 (1202)
7 KOG0271 Notchless-like WD40 re 99.9 8.1E-24 1.8E-28 239.8 20.1 269 6-309 115-441 (480)
8 KOG0272 U4/U6 small nuclear ri 99.9 1.1E-22 2.4E-27 233.5 13.0 214 8-266 219-435 (459)
9 KOG0279 G protein beta subunit 99.9 4.1E-21 8.8E-26 212.0 22.4 246 7-315 16-270 (315)
10 KOG0271 Notchless-like WD40 re 99.9 2.9E-21 6.3E-26 219.2 21.2 173 49-264 113-291 (480)
11 KOG0272 U4/U6 small nuclear ri 99.9 6.9E-21 1.5E-25 218.9 19.2 206 50-314 174-382 (459)
12 KOG0645 WD40 repeat protein [G 99.9 5.7E-20 1.2E-24 202.3 24.0 183 49-267 12-201 (312)
13 KOG0263 Transcription initiati 99.8 2.2E-20 4.7E-25 227.9 17.6 246 4-309 376-651 (707)
14 KOG0291 WD40-repeat-containing 99.8 2.3E-19 4.9E-24 216.6 26.0 160 4-170 305-507 (893)
15 KOG0266 WD40 repeat-containing 99.8 7.8E-19 1.7E-23 215.0 25.0 254 6-314 159-416 (456)
16 KOG0645 WD40 repeat protein [G 99.8 8.2E-19 1.8E-23 193.3 22.4 205 7-247 15-225 (312)
17 KOG0285 Pleiotropic regulator 99.8 1E-18 2.3E-23 197.4 21.3 176 46-264 146-322 (460)
18 KOG0263 Transcription initiati 99.8 5.8E-19 1.3E-23 215.6 17.5 197 7-247 452-649 (707)
19 KOG0266 WD40 repeat-containing 99.8 5.8E-18 1.3E-22 207.4 23.9 157 8-171 205-364 (456)
20 KOG0279 G protein beta subunit 99.8 5E-18 1.1E-22 187.8 19.9 203 11-262 68-275 (315)
21 KOG0285 Pleiotropic regulator 99.8 2E-18 4.3E-23 195.2 17.0 210 8-264 153-363 (460)
22 KOG0286 G-protein beta subunit 99.8 1E-17 2.3E-22 186.1 20.8 214 10-266 101-320 (343)
23 KOG0318 WD40 repeat stress pro 99.8 3.7E-15 8.1E-20 175.5 42.4 156 9-172 106-266 (603)
24 KOG0291 WD40-repeat-containing 99.8 7.5E-17 1.6E-21 195.2 25.8 236 10-306 269-507 (893)
25 KOG0284 Polyadenylation factor 99.8 2.5E-18 5.3E-23 197.3 12.4 196 8-246 182-379 (464)
26 KOG0282 mRNA splicing factor [ 99.8 5.6E-18 1.2E-22 197.8 14.9 240 7-264 215-477 (503)
27 PTZ00421 coronin; Provisional 99.7 1.5E-16 3.2E-21 196.3 25.5 217 45-314 70-297 (493)
28 KOG0315 G-protein beta subunit 99.7 1E-16 2.3E-21 175.0 21.0 265 16-314 7-295 (311)
29 KOG0275 Conserved WD40 repeat- 99.7 5.9E-18 1.3E-22 188.7 9.1 153 10-171 217-378 (508)
30 KOG0284 Polyadenylation factor 99.7 4.9E-18 1.1E-22 194.9 8.2 205 50-314 95-301 (464)
31 KOG0315 G-protein beta subunit 99.7 1.3E-16 2.8E-21 174.3 18.6 210 11-263 45-302 (311)
32 KOG0295 WD40 repeat-containing 99.7 1.4E-16 2.9E-21 181.3 18.7 195 7-245 194-404 (406)
33 KOG0265 U5 snRNP-specific prot 99.7 3.8E-16 8.3E-21 174.4 21.3 251 8-314 48-303 (338)
34 PTZ00421 coronin; Provisional 99.7 7.6E-16 1.7E-20 190.0 25.5 205 7-250 76-293 (493)
35 KOG0286 G-protein beta subunit 99.7 9.3E-16 2E-20 170.8 18.7 188 12-244 151-342 (343)
36 KOG0295 WD40 repeat-containing 99.7 2.4E-16 5.1E-21 179.4 13.9 215 7-266 151-381 (406)
37 KOG0296 Angio-associated migra 99.7 3.8E-15 8.2E-20 169.9 23.2 156 6-171 64-220 (399)
38 cd00200 WD40 WD40 domain, foun 99.7 2.2E-14 4.8E-19 155.4 28.4 234 7-264 10-264 (289)
39 PLN00181 protein SPA1-RELATED; 99.7 5.7E-15 1.2E-19 192.4 27.4 204 9-249 535-740 (793)
40 KOG0277 Peroxisomal targeting 99.7 9.5E-16 2.1E-20 168.1 16.7 201 6-247 60-265 (311)
41 KOG0772 Uncharacterized conser 99.7 7.8E-16 1.7E-20 180.5 16.8 212 9-227 217-473 (641)
42 KOG0640 mRNA cleavage stimulat 99.7 1.6E-15 3.4E-20 169.4 18.4 155 10-170 116-290 (430)
43 KOG0318 WD40 repeat stress pro 99.7 1.4E-14 3.1E-19 170.7 27.0 153 9-171 150-350 (603)
44 KOG0289 mRNA splicing factor [ 99.7 2.3E-15 5.1E-20 174.1 20.0 211 8-264 263-479 (506)
45 KOG0319 WD40-repeat-containing 99.7 1.1E-15 2.4E-20 185.4 17.5 198 8-246 413-618 (775)
46 PTZ00420 coronin; Provisional 99.6 7.7E-15 1.7E-19 182.7 23.6 121 43-172 67-198 (568)
47 KOG0273 Beta-transducin family 99.6 3.7E-15 8E-20 173.9 18.6 142 22-172 332-483 (524)
48 PTZ00420 coronin; Provisional 99.6 1.7E-14 3.8E-19 179.6 25.8 158 7-172 75-249 (568)
49 PLN00181 protein SPA1-RELATED; 99.6 2E-14 4.3E-19 187.5 27.5 248 6-314 483-745 (793)
50 KOG0265 U5 snRNP-specific prot 99.6 4.7E-15 1E-19 165.8 17.7 216 11-242 95-333 (338)
51 KOG0302 Ribosome Assembly prot 99.6 2.5E-15 5.5E-20 171.6 14.9 205 6-227 211-424 (440)
52 KOG0310 Conserved WD40 repeat- 99.6 3E-15 6.6E-20 175.6 14.9 246 7-314 27-275 (487)
53 cd00200 WD40 WD40 domain, foun 99.6 7.3E-14 1.6E-18 151.3 24.7 152 10-171 55-207 (289)
54 KOG0264 Nucleosome remodeling 99.6 5.8E-15 1.2E-19 172.4 16.2 253 3-309 120-406 (422)
55 KOG0274 Cdc4 and related F-box 99.6 1.1E-14 2.5E-19 180.5 19.9 226 23-314 222-448 (537)
56 KOG0293 WD40 repeat-containing 99.6 9.3E-15 2E-19 168.1 17.5 237 9-266 227-487 (519)
57 KOG0283 WD40 repeat-containing 99.6 8.8E-15 1.9E-19 180.7 18.7 248 7-306 268-575 (712)
58 KOG0316 Conserved WD40 repeat- 99.6 1.7E-14 3.6E-19 156.9 18.1 243 8-312 19-262 (307)
59 KOG0278 Serine/threonine kinas 99.6 1.4E-15 3E-20 166.1 9.8 264 8-307 16-297 (334)
60 KOG0273 Beta-transducin family 99.6 2.9E-14 6.2E-19 166.6 20.4 254 5-315 234-490 (524)
61 KOG0281 Beta-TrCP (transducin 99.6 1.4E-15 3E-20 171.5 8.9 194 23-267 211-406 (499)
62 KOG0264 Nucleosome remodeling 99.6 1.5E-14 3.2E-19 169.0 16.8 215 9-248 180-405 (422)
63 KOG0269 WD40 repeat-containing 99.6 3.5E-15 7.5E-20 181.8 11.3 208 19-264 100-312 (839)
64 KOG0300 WD40 repeat-containing 99.6 1.3E-14 2.7E-19 162.3 14.1 212 6-262 148-400 (481)
65 KOG0640 mRNA cleavage stimulat 99.6 2.2E-14 4.9E-19 160.3 15.1 219 7-247 173-426 (430)
66 KOG0313 Microtubule binding pr 99.6 7.3E-14 1.6E-18 160.0 18.8 229 18-306 115-375 (423)
67 KOG0319 WD40-repeat-containing 99.6 2.1E-14 4.4E-19 174.6 15.1 151 10-170 467-618 (775)
68 KOG0313 Microtubule binding pr 99.6 9E-14 2E-18 159.3 19.0 161 7-174 194-379 (423)
69 KOG0643 Translation initiation 99.6 1.7E-13 3.6E-18 151.5 20.3 222 12-243 16-313 (327)
70 KOG1407 WD40 repeat protein [F 99.6 2.1E-13 4.5E-18 150.4 20.3 208 7-227 65-288 (313)
71 KOG0973 Histone transcription 99.5 8.8E-14 1.9E-18 175.5 18.6 156 8-171 15-201 (942)
72 KOG0306 WD40-repeat-containing 99.5 7.4E-14 1.6E-18 169.6 15.5 155 7-171 509-664 (888)
73 KOG1538 Uncharacterized conser 99.5 5.1E-12 1.1E-16 151.4 30.1 140 52-232 13-153 (1081)
74 KOG0647 mRNA export protein (c 99.5 3E-13 6.6E-18 151.6 17.8 119 5-128 26-146 (347)
75 KOG0267 Microtubule severing p 99.5 2.1E-14 4.5E-19 173.9 8.0 167 39-247 59-226 (825)
76 KOG1332 Vesicle coat complex C 99.5 1.5E-13 3.3E-18 150.3 13.7 205 14-254 65-294 (299)
77 KOG0647 mRNA export protein (c 99.5 2.8E-13 6.1E-18 151.8 15.7 114 50-171 26-145 (347)
78 KOG0289 mRNA splicing factor [ 99.5 4.3E-13 9.3E-18 155.6 17.4 211 8-262 221-432 (506)
79 KOG0281 Beta-TrCP (transducin 99.5 4.4E-14 9.6E-19 159.5 8.9 195 50-312 196-393 (499)
80 KOG0302 Ribosome Assembly prot 99.5 4.1E-13 8.9E-18 153.8 16.6 195 13-245 158-376 (440)
81 KOG0310 Conserved WD40 repeat- 99.5 4.1E-13 8.8E-18 158.0 16.8 153 10-171 114-268 (487)
82 KOG0282 mRNA splicing factor [ 99.5 1.8E-13 3.8E-18 160.7 13.7 193 9-244 261-502 (503)
83 KOG0973 Histone transcription 99.5 1.3E-12 2.9E-17 165.0 22.5 174 50-264 12-216 (942)
84 KOG0267 Microtubule severing p 99.5 3E-14 6.6E-19 172.5 6.9 155 8-172 72-227 (825)
85 KOG0277 Peroxisomal targeting 99.5 3.5E-13 7.5E-18 148.2 13.1 155 9-171 107-265 (311)
86 KOG0274 Cdc4 and related F-box 99.5 1.6E-12 3.5E-17 161.6 20.7 151 8-173 251-402 (537)
87 KOG0300 WD40 repeat-containing 99.5 3.7E-13 7.9E-18 150.7 13.4 210 48-314 145-394 (481)
88 KOG0772 Uncharacterized conser 99.5 3.9E-13 8.5E-18 158.2 13.9 217 44-308 161-395 (641)
89 KOG4328 WD40 protein [Function 99.5 1.4E-12 3E-17 152.4 17.8 162 4-171 184-399 (498)
90 KOG0296 Angio-associated migra 99.4 1.2E-11 2.6E-16 141.7 24.5 135 49-190 62-202 (399)
91 KOG0269 WD40 repeat-containing 99.4 4.4E-13 9.6E-18 163.8 13.8 155 9-171 136-296 (839)
92 KOG0275 Conserved WD40 repeat- 99.4 3.1E-13 6.8E-18 151.4 11.4 171 46-259 208-388 (508)
93 KOG1332 Vesicle coat complex C 99.4 1.6E-12 3.5E-17 142.5 16.5 190 45-264 5-210 (299)
94 KOG2048 WD40 repeat protein [G 99.4 7.3E-10 1.6E-14 134.9 40.4 506 8-616 27-557 (691)
95 KOG0308 Conserved WD40 repeat- 99.4 1.1E-12 2.3E-17 158.1 15.5 144 21-172 87-244 (735)
96 KOG0294 WD40 repeat-containing 99.4 2.1E-12 4.5E-17 145.6 16.8 114 49-171 41-157 (362)
97 KOG1036 Mitotic spindle checkp 99.4 5E-12 1.1E-16 142.5 19.6 224 2-249 9-264 (323)
98 KOG0306 WD40-repeat-containing 99.4 3.6E-12 7.7E-17 155.3 19.4 194 9-246 457-663 (888)
99 KOG1274 WD40 repeat protein [G 99.4 3.6E-12 7.8E-17 158.4 19.8 160 10-172 100-263 (933)
100 KOG0293 WD40 repeat-containing 99.4 1.9E-12 4E-17 149.5 15.6 220 6-248 269-514 (519)
101 KOG0301 Phospholipase A2-activ 99.4 1.6E-12 3.4E-17 157.6 14.6 241 23-344 29-273 (745)
102 KOG0308 Conserved WD40 repeat- 99.4 5.5E-13 1.2E-17 160.5 10.2 220 50-315 20-251 (735)
103 KOG0278 Serine/threonine kinas 99.4 1.7E-12 3.7E-17 142.3 12.5 154 9-171 103-297 (334)
104 KOG0305 Anaphase promoting com 99.4 6.1E-12 1.3E-16 152.6 17.2 188 18-249 188-378 (484)
105 KOG0299 U3 snoRNP-associated p 99.4 7.2E-12 1.6E-16 146.8 16.5 227 8-250 144-413 (479)
106 KOG1446 Histone H3 (Lys4) meth 99.4 1E-10 2.2E-15 132.7 24.9 251 7-314 15-269 (311)
107 KOG0268 Sof1-like rRNA process 99.4 1.5E-12 3.3E-17 148.6 9.8 157 6-171 187-345 (433)
108 KOG0299 U3 snoRNP-associated p 99.4 1.7E-11 3.7E-16 143.7 18.6 215 44-321 136-369 (479)
109 KOG1446 Histone H3 (Lys4) meth 99.3 2.7E-10 5.8E-15 129.3 26.8 117 49-172 12-131 (311)
110 KOG0639 Transducin-like enhanc 99.3 4.5E-12 9.8E-17 148.6 13.0 161 6-175 417-585 (705)
111 KOG0283 WD40 repeat-containing 99.3 1.5E-11 3.3E-16 152.5 17.9 143 20-171 381-532 (712)
112 KOG0316 Conserved WD40 repeat- 99.3 7.6E-12 1.6E-16 136.5 13.0 120 45-172 12-132 (307)
113 KOG0305 Anaphase promoting com 99.3 2.4E-11 5.2E-16 147.4 18.1 155 6-171 217-376 (484)
114 KOG1538 Uncharacterized conser 99.3 1.2E-09 2.6E-14 131.6 30.4 155 10-171 16-211 (1081)
115 KOG4283 Transcription-coupled 99.3 4.5E-11 9.7E-16 133.7 17.3 162 3-171 40-219 (397)
116 KOG0301 Phospholipase A2-activ 99.3 1.9E-11 4E-16 148.6 15.4 134 22-171 115-249 (745)
117 KOG2048 WD40 repeat protein [G 99.3 2.8E-09 6.1E-14 129.9 33.4 163 4-171 108-275 (691)
118 KOG2919 Guanine nucleotide-bin 99.3 3.5E-11 7.6E-16 136.0 15.7 184 9-227 161-356 (406)
119 KOG1539 WD repeat protein [Gen 99.3 2.6E-11 5.7E-16 149.4 15.3 155 8-171 450-606 (910)
120 PF08662 eIF2A: Eukaryotic tra 99.3 2.9E-10 6.2E-15 125.0 21.6 154 7-171 6-179 (194)
121 KOG0294 WD40 repeat-containing 99.3 5.2E-11 1.1E-15 134.6 15.2 149 17-173 52-283 (362)
122 KOG0268 Sof1-like rRNA process 99.3 2.6E-11 5.7E-16 138.7 12.5 222 7-248 67-346 (433)
123 KOG1524 WD40 repeat-containing 99.2 1.3E-09 2.9E-14 129.2 26.2 132 27-171 83-216 (737)
124 KOG0642 Cell-cycle nuclear pro 99.2 4.9E-11 1.1E-15 142.7 13.8 118 49-171 292-426 (577)
125 KOG2096 WD40 repeat protein [G 99.2 4.4E-10 9.5E-15 127.0 20.3 161 4-172 130-309 (420)
126 KOG0303 Actin-binding protein 99.2 4.2E-11 9.1E-16 138.1 11.2 117 48-172 78-204 (472)
127 KOG0650 WD40 repeat nucleolar 99.2 3.6E-11 7.9E-16 143.9 10.4 155 5-168 565-732 (733)
128 KOG0641 WD40 repeat protein [G 99.2 1E-09 2.2E-14 119.2 20.1 160 8-172 138-304 (350)
129 KOG1274 WD40 repeat protein [G 99.2 6.6E-10 1.4E-14 138.8 21.2 153 8-171 15-168 (933)
130 KOG2445 Nuclear pore complex c 99.2 9.1E-10 2E-14 124.4 19.6 204 46-270 8-232 (361)
131 KOG1407 WD40 repeat protein [F 99.2 3.4E-10 7.3E-15 125.5 15.7 154 7-171 107-261 (313)
132 KOG0646 WD40 repeat protein [G 99.2 2.7E-10 5.8E-15 134.1 15.1 117 50-171 80-206 (476)
133 KOG4283 Transcription-coupled 99.2 5.8E-10 1.3E-14 125.0 16.9 154 10-171 105-276 (397)
134 KOG2096 WD40 repeat protein [G 99.2 9.1E-10 2E-14 124.5 18.5 124 45-171 81-217 (420)
135 KOG0288 WD40 repeat protein Ti 99.2 2.7E-10 5.9E-15 132.3 14.5 210 8-262 221-430 (459)
136 KOG1063 RNA polymerase II elon 99.2 1.2E-10 2.6E-15 141.6 11.9 120 45-171 520-648 (764)
137 KOG0288 WD40 repeat protein Ti 99.2 4.4E-10 9.5E-15 130.6 15.6 140 19-168 312-458 (459)
138 TIGR03866 PQQ_ABC_repeats PQQ- 99.1 6.7E-09 1.4E-13 117.0 24.8 153 9-172 33-188 (300)
139 KOG1273 WD40 repeat protein [G 99.1 1.3E-09 2.8E-14 123.2 18.4 163 9-176 26-231 (405)
140 KOG0321 WD40 repeat-containing 99.1 6.3E-10 1.4E-14 134.5 15.7 161 9-172 103-302 (720)
141 KOG1539 WD repeat protein [Gen 99.1 1.2E-08 2.6E-13 126.6 26.8 153 10-171 162-318 (910)
142 KOG0321 WD40 repeat-containing 99.1 1E-09 2.3E-14 132.6 17.3 153 15-172 60-249 (720)
143 KOG2106 Uncharacterized conser 99.1 3.1E-09 6.6E-14 125.8 20.7 188 4-232 198-427 (626)
144 KOG0646 WD40 repeat protein [G 99.1 2.1E-09 4.5E-14 126.7 19.1 157 9-171 126-307 (476)
145 KOG1408 WD40 repeat protein [F 99.1 8.5E-10 1.8E-14 133.8 16.0 281 9-323 327-685 (1080)
146 KOG1034 Transcriptional repres 99.1 5.9E-10 1.3E-14 126.7 13.6 147 17-171 53-211 (385)
147 KOG0270 WD40 repeat-containing 99.1 8E-10 1.7E-14 129.5 14.9 152 12-171 249-404 (463)
148 KOG1009 Chromatin assembly com 99.1 2.8E-09 6E-14 124.0 19.2 156 9-171 16-195 (434)
149 KOG1445 Tumor-specific antigen 99.1 2.5E-10 5.4E-15 136.7 10.4 115 51-172 627-751 (1012)
150 KOG1036 Mitotic spindle checkp 99.1 5.4E-10 1.2E-14 126.4 12.2 114 50-172 12-125 (323)
151 KOG1063 RNA polymerase II elon 99.1 1.1E-07 2.5E-12 116.3 32.1 197 12-245 151-389 (764)
152 KOG0641 WD40 repeat protein [G 99.1 3.7E-09 8.1E-14 114.8 17.2 158 5-170 180-348 (350)
153 KOG1007 WD repeat protein TSSC 99.1 9.6E-10 2.1E-14 123.1 13.0 155 8-171 125-289 (370)
154 KOG0303 Actin-binding protein 99.1 2.6E-09 5.6E-14 123.7 16.2 157 10-173 85-251 (472)
155 KOG2445 Nuclear pore complex c 99.0 7.8E-09 1.7E-13 117.0 18.6 236 6-262 59-334 (361)
156 KOG1034 Transcriptional repres 99.0 6.7E-10 1.5E-14 126.2 10.1 144 18-171 105-279 (385)
157 TIGR03866 PQQ_ABC_repeats PQQ- 99.0 8E-09 1.7E-13 116.4 18.8 156 10-171 118-279 (300)
158 KOG2106 Uncharacterized conser 99.0 8.9E-09 1.9E-13 122.0 19.0 152 6-170 368-520 (626)
159 KOG0290 Conserved WD40 repeat- 99.0 6.7E-09 1.4E-13 116.7 16.5 190 13-232 157-357 (364)
160 KOG1188 WD40 repeat protein [G 99.0 2.2E-09 4.7E-14 122.8 12.9 145 19-171 41-196 (376)
161 KOG0643 Translation initiation 99.0 2.7E-08 5.8E-13 111.0 20.8 181 45-262 5-191 (327)
162 KOG2055 WD40 repeat protein [G 99.0 9.9E-09 2.1E-13 120.9 18.2 157 10-174 261-420 (514)
163 KOG0644 Uncharacterized conser 99.0 4.6E-10 1E-14 138.2 7.5 112 49-171 188-300 (1113)
164 KOG1007 WD repeat protein TSSC 99.0 7E-09 1.5E-13 116.4 15.9 161 7-175 64-249 (370)
165 KOG0650 WD40 repeat nucleolar 99.0 2.5E-09 5.3E-14 128.6 12.6 205 11-244 526-732 (733)
166 KOG0307 Vesicle coat complex C 99.0 1.4E-09 3E-14 138.8 10.2 203 8-249 66-286 (1049)
167 KOG2394 WD40 protein DMR-N9 [G 99.0 4.5E-09 9.8E-14 125.3 13.4 137 7-151 220-384 (636)
168 COG2319 FOG: WD40 repeat [Gene 99.0 3E-08 6.4E-13 112.1 19.6 154 10-172 113-272 (466)
169 KOG0649 WD40 repeat protein [G 98.9 1.9E-08 4.2E-13 110.8 16.9 154 8-172 12-187 (325)
170 KOG0642 Cell-cycle nuclear pro 98.9 6.3E-09 1.4E-13 125.0 13.9 198 7-246 295-560 (577)
171 COG2319 FOG: WD40 repeat [Gene 98.9 5.1E-08 1.1E-12 110.1 20.0 153 10-171 159-314 (466)
172 KOG0639 Transducin-like enhanc 98.9 2.3E-09 4.9E-14 126.5 8.3 150 11-171 514-663 (705)
173 KOG4378 Nuclear protein COP1 [ 98.9 8.8E-09 1.9E-13 121.6 12.8 159 4-171 77-239 (673)
174 KOG1009 Chromatin assembly com 98.9 1.1E-08 2.4E-13 119.2 12.1 117 51-172 13-154 (434)
175 KOG2394 WD40 protein DMR-N9 [G 98.9 2E-08 4.2E-13 120.0 14.3 114 51-171 219-362 (636)
176 KOG1523 Actin-related protein 98.9 7.1E-08 1.5E-12 110.0 18.0 160 6-171 10-176 (361)
177 KOG4378 Nuclear protein COP1 [ 98.9 2.5E-08 5.4E-13 117.9 14.5 145 6-158 164-309 (673)
178 KOG0270 WD40 repeat-containing 98.9 5.3E-08 1.2E-12 114.6 17.0 160 5-171 285-449 (463)
179 KOG1273 WD40 repeat protein [G 98.8 3.3E-08 7.2E-13 112.1 14.7 108 54-169 26-135 (405)
180 KOG0322 G-protein beta subunit 98.8 9.2E-09 2E-13 114.5 9.6 143 20-170 166-322 (323)
181 KOG1408 WD40 repeat protein [F 98.8 3.7E-08 8E-13 120.0 15.2 113 52-171 597-713 (1080)
182 KOG1310 WD40 repeat protein [G 98.8 1.1E-08 2.5E-13 121.8 9.5 119 49-172 48-179 (758)
183 KOG4227 WD40 repeat protein [G 98.8 2.3E-07 4.9E-12 107.2 19.3 155 11-172 61-226 (609)
184 PRK11028 6-phosphogluconolacto 98.8 2.5E-07 5.4E-12 108.7 20.0 157 8-171 36-205 (330)
185 KOG1445 Tumor-specific antigen 98.8 1.5E-08 3.3E-13 121.8 9.7 153 10-171 631-798 (1012)
186 KOG2919 Guanine nucleotide-bin 98.8 3.2E-08 7E-13 112.5 11.8 148 6-162 207-361 (406)
187 KOG4328 WD40 protein [Function 98.8 4.2E-08 9.2E-13 115.7 12.5 152 9-171 282-450 (498)
188 KOG2139 WD40 repeat protein [G 98.8 1.6E-07 3.5E-12 108.3 16.7 152 9-171 101-268 (445)
189 KOG0307 Vesicle coat complex C 98.8 9.8E-09 2.1E-13 131.2 7.7 159 8-172 118-285 (1049)
190 PRK11028 6-phosphogluconolacto 98.8 3.8E-07 8.2E-12 107.2 20.2 159 8-171 81-258 (330)
191 KOG4227 WD40 repeat protein [G 98.7 1.3E-07 2.9E-12 109.1 13.7 118 49-174 54-182 (609)
192 KOG0290 Conserved WD40 repeat- 98.7 1.3E-07 2.9E-12 106.5 13.2 194 12-246 102-317 (364)
193 PRK01742 tolB translocation pr 98.7 6.9E-07 1.5E-11 109.5 20.2 150 8-171 205-361 (429)
194 KOG2110 Uncharacterized conser 98.6 9.2E-07 2E-11 102.7 18.5 110 51-170 87-203 (391)
195 KOG1587 Cytoplasmic dynein int 98.6 5E-07 1.1E-11 112.9 17.7 159 7-172 243-473 (555)
196 KOG2055 WD40 repeat protein [G 98.6 3.5E-07 7.6E-12 108.2 15.1 157 5-172 212-375 (514)
197 KOG1188 WD40 repeat protein [G 98.6 5.5E-07 1.2E-11 103.6 15.8 152 11-171 75-242 (376)
198 KOG0771 Prolactin regulatory e 98.6 8E-07 1.7E-11 104.6 15.9 161 7-172 145-355 (398)
199 KOG2110 Uncharacterized conser 98.6 1.5E-06 3.3E-11 100.9 17.4 118 50-173 128-250 (391)
200 KOG0771 Prolactin regulatory e 98.6 4.7E-07 1E-11 106.5 13.0 69 55-129 148-217 (398)
201 PRK01742 tolB translocation pr 98.5 1.1E-06 2.4E-11 107.7 16.6 150 9-171 250-401 (429)
202 KOG1523 Actin-related protein 98.5 7.9E-07 1.7E-11 101.6 13.5 157 50-244 9-173 (361)
203 KOG1517 Guanine nucleotide bin 98.5 2E-06 4.4E-11 109.2 17.0 190 10-230 1169-1370(1387)
204 KOG1587 Cytoplasmic dynein int 98.5 1E-06 2.2E-11 110.2 14.0 157 9-171 350-516 (555)
205 KOG0974 WD-repeat protein WDR6 98.5 8.6E-07 1.9E-11 113.1 13.4 143 16-171 143-288 (967)
206 PRK03629 tolB translocation pr 98.5 1.1E-05 2.4E-10 99.1 22.4 154 7-171 199-363 (429)
207 KOG1310 WD40 repeat protein [G 98.4 2.1E-06 4.5E-11 103.1 14.1 157 10-174 54-234 (758)
208 KOG1272 WD40-repeat-containing 98.4 2.3E-07 5E-12 109.7 6.1 118 52-176 210-328 (545)
209 KOG1963 WD40 repeat protein [G 98.4 3.2E-06 7E-11 106.5 15.8 151 10-170 164-321 (792)
210 PF04762 IKI3: IKI3 family; I 98.4 0.0039 8.5E-08 83.6 45.3 223 6-262 75-348 (928)
211 KOG2041 WD40 repeat protein [G 98.4 7.1E-05 1.5E-09 92.2 26.4 112 52-170 15-144 (1189)
212 PF08662 eIF2A: Eukaryotic tra 98.4 4.1E-06 8.8E-11 92.4 14.7 109 8-127 61-179 (194)
213 KOG3881 Uncharacterized conser 98.4 4.1E-06 8.9E-11 98.0 15.1 159 6-172 148-321 (412)
214 KOG0322 G-protein beta subunit 98.4 1.5E-06 3.2E-11 97.3 11.0 156 51-245 150-321 (323)
215 KOG2111 Uncharacterized conser 98.4 2E-05 4.4E-10 90.5 19.0 105 18-128 148-257 (346)
216 KOG0649 WD40 repeat protein [G 98.3 1.2E-05 2.7E-10 89.2 16.2 155 11-174 119-277 (325)
217 KOG1517 Guanine nucleotide bin 98.3 1.1E-05 2.3E-10 102.9 17.3 190 21-245 1125-1331(1387)
218 KOG2139 WD40 repeat protein [G 98.3 1E-05 2.2E-10 93.9 15.7 202 8-230 142-364 (445)
219 KOG0974 WD-repeat protein WDR6 98.3 5.3E-06 1.1E-10 106.1 14.1 149 57-247 139-288 (967)
220 KOG0644 Uncharacterized conser 98.3 1.3E-06 2.8E-11 108.8 8.4 149 9-171 193-346 (1113)
221 PRK05137 tolB translocation pr 98.3 2.8E-05 6.1E-10 95.5 19.6 152 7-170 202-365 (435)
222 PRK05137 tolB translocation pr 98.2 8.2E-05 1.8E-09 91.5 21.9 130 29-170 182-321 (435)
223 PRK04922 tolB translocation pr 98.2 3.3E-05 7.1E-10 95.0 18.2 152 8-171 205-368 (433)
224 PRK02889 tolB translocation pr 98.2 3E-05 6.4E-10 95.3 17.5 153 7-171 196-360 (427)
225 PRK03629 tolB translocation pr 98.2 5.3E-05 1.1E-09 93.2 19.3 145 9-166 245-398 (429)
226 KOG0280 Uncharacterized conser 98.2 2.1E-05 4.6E-10 89.5 13.8 116 54-175 124-245 (339)
227 PRK02889 tolB translocation pr 98.2 5.8E-05 1.2E-09 92.8 18.9 146 8-164 241-393 (427)
228 TIGR02800 propeller_TolB tol-p 98.2 6.4E-05 1.4E-09 91.0 18.9 153 8-171 191-354 (417)
229 KOG3914 WD repeat protein WDR4 98.1 1.3E-05 2.9E-10 94.2 11.7 155 12-175 68-227 (390)
230 KOG1272 WD40-repeat-containing 98.1 3.5E-06 7.6E-11 100.0 6.7 146 12-170 215-361 (545)
231 PRK04922 tolB translocation pr 98.1 6.6E-05 1.4E-09 92.3 18.3 148 10-170 251-410 (433)
232 KOG1064 RAVE (regulator of V-A 98.1 1.1E-05 2.4E-10 106.9 10.9 154 7-172 2209-2367(2439)
233 KOG2111 Uncharacterized conser 98.1 9.6E-05 2.1E-09 85.1 17.0 99 69-174 155-259 (346)
234 KOG2314 Translation initiation 98.0 0.0002 4.4E-09 86.9 19.0 140 53-229 212-364 (698)
235 KOG1334 WD40 repeat protein [G 98.0 3.4E-05 7.3E-10 92.4 11.5 121 45-171 137-263 (559)
236 TIGR02800 propeller_TolB tol-p 98.0 0.00021 4.6E-09 86.4 18.5 147 9-168 236-391 (417)
237 KOG4547 WD40 repeat-containing 98.0 0.00013 2.9E-09 89.4 16.1 115 48-171 99-220 (541)
238 KOG1240 Protein kinase contain 97.9 0.00033 7.2E-09 91.2 19.5 156 11-172 1053-1226(1431)
239 KOG1524 WD40 repeat-containing 97.9 7.5E-05 1.6E-09 90.0 12.9 136 7-167 146-282 (737)
240 KOG2041 WD40 repeat protein [G 97.9 0.0068 1.5E-07 75.6 29.4 154 8-168 73-244 (1189)
241 KOG2321 WD40 repeat protein [G 97.9 0.00014 3E-09 88.5 15.1 131 30-171 156-302 (703)
242 KOG0309 Conserved WD40 repeat- 97.9 2.1E-05 4.5E-10 97.2 7.7 155 9-171 72-232 (1081)
243 KOG4497 Uncharacterized conser 97.9 0.00016 3.4E-09 83.3 13.8 152 8-169 50-238 (447)
244 PRK00178 tolB translocation pr 97.9 0.00049 1.1E-08 84.3 19.3 153 7-171 199-363 (430)
245 KOG2315 Predicted translation 97.8 0.00032 7E-09 85.6 16.6 111 50-171 269-390 (566)
246 COG2706 3-carboxymuconate cycl 97.8 0.0062 1.3E-07 71.8 26.3 146 20-171 4-175 (346)
247 PF00400 WD40: WD domain, G-be 97.8 3.8E-05 8.2E-10 62.1 6.0 36 134-169 4-39 (39)
248 PF10282 Lactonase: Lactonase, 97.8 0.0016 3.4E-08 78.1 21.9 161 11-171 148-322 (345)
249 PF02239 Cytochrom_D1: Cytochr 97.8 0.0008 1.7E-08 81.6 19.5 141 20-169 7-156 (369)
250 KOG1334 WD40 repeat protein [G 97.8 6E-05 1.3E-09 90.3 9.4 202 11-247 237-466 (559)
251 KOG1963 WD40 repeat protein [G 97.8 0.00027 5.8E-09 89.9 15.5 154 11-171 210-375 (792)
252 KOG2321 WD40 repeat protein [G 97.8 0.00037 8.1E-09 85.0 16.1 154 11-172 56-259 (703)
253 PRK04792 tolB translocation pr 97.8 0.00073 1.6E-08 83.8 18.7 152 9-171 220-380 (448)
254 PRK01029 tolB translocation pr 97.8 0.00074 1.6E-08 83.3 18.4 155 9-171 233-403 (428)
255 KOG1240 Protein kinase contain 97.7 0.00028 6.1E-09 91.9 14.6 120 49-173 1046-1183(1431)
256 PF11768 DUF3312: Protein of u 97.7 0.00084 1.8E-08 83.0 17.6 153 7-171 146-329 (545)
257 PF00400 WD40: WD domain, G-be 97.7 8.5E-05 1.8E-09 60.0 6.2 37 84-125 2-39 (39)
258 KOG4714 Nucleoporin [Nuclear s 97.7 0.0001 2.2E-09 83.0 8.8 69 102-171 184-254 (319)
259 PRK01029 tolB translocation pr 97.7 0.001 2.3E-08 82.0 18.4 154 9-171 187-359 (428)
260 KOG1409 Uncharacterized conser 97.7 0.00025 5.5E-09 82.3 11.8 80 42-127 189-270 (404)
261 KOG3881 Uncharacterized conser 97.7 0.00027 5.8E-09 83.3 11.8 104 19-128 216-321 (412)
262 KOG1354 Serine/threonine prote 97.7 0.0002 4.2E-09 83.1 10.5 152 11-171 169-359 (433)
263 PF10282 Lactonase: Lactonase, 97.7 0.0064 1.4E-07 72.9 23.8 161 6-171 86-275 (345)
264 PF02239 Cytochrom_D1: Cytochr 97.6 0.0024 5.2E-08 77.5 19.7 159 3-171 31-202 (369)
265 PRK00178 tolB translocation pr 97.6 0.0018 3.9E-08 79.4 18.5 147 10-168 246-401 (430)
266 KOG4497 Uncharacterized conser 97.6 0.00016 3.5E-09 83.3 8.6 135 57-228 14-150 (447)
267 KOG0280 Uncharacterized conser 97.6 0.00082 1.8E-08 77.0 13.6 154 9-171 124-284 (339)
268 PF04762 IKI3: IKI3 family; I 97.6 0.64 1.4E-05 63.1 42.8 69 93-169 76-148 (928)
269 KOG2695 WD40 repeat protein [G 97.6 0.00018 3.8E-09 83.6 8.1 116 51-171 252-376 (425)
270 PRK04792 tolB translocation pr 97.5 0.003 6.5E-08 78.5 18.8 141 10-163 265-414 (448)
271 PLN02919 haloacid dehalogenase 97.5 0.0022 4.7E-08 87.2 18.7 161 10-172 686-889 (1057)
272 KOG1354 Serine/threonine prote 97.5 0.00049 1.1E-08 80.0 10.6 153 12-172 90-302 (433)
273 PLN02919 haloacid dehalogenase 97.5 0.0083 1.8E-07 81.7 24.1 159 10-171 627-833 (1057)
274 KOG4190 Uncharacterized conser 97.4 0.00021 4.6E-09 85.9 6.5 117 48-172 732-860 (1034)
275 KOG2695 WD40 repeat protein [G 97.4 0.00026 5.6E-09 82.3 6.9 114 9-128 255-377 (425)
276 KOG0309 Conserved WD40 repeat- 97.4 0.00028 6.1E-09 87.7 6.9 115 8-128 116-233 (1081)
277 PF11768 DUF3312: Protein of u 97.4 0.002 4.4E-08 79.8 14.1 74 49-128 257-330 (545)
278 KOG2066 Vacuolar assembly/sort 97.3 0.0034 7.4E-08 79.6 16.1 101 60-171 80-187 (846)
279 KOG1064 RAVE (regulator of V-A 97.3 0.0019 4E-08 87.0 14.0 153 50-245 2207-2364(2439)
280 COG2706 3-carboxymuconate cycl 97.3 0.012 2.6E-07 69.5 18.7 161 4-171 86-274 (346)
281 KOG3914 WD repeat protein WDR4 97.2 0.00067 1.4E-08 80.4 7.9 75 49-129 149-225 (390)
282 COG4946 Uncharacterized protei 97.2 0.011 2.4E-07 71.3 17.4 120 45-171 353-477 (668)
283 COG5354 Uncharacterized protei 97.2 0.21 4.5E-06 61.4 27.7 111 50-170 31-159 (561)
284 KOG1409 Uncharacterized conser 97.1 0.0052 1.1E-07 71.8 13.6 156 10-173 72-272 (404)
285 PF15492 Nbas_N: Neuroblastoma 97.1 0.029 6.2E-07 64.8 18.8 162 11-172 2-260 (282)
286 KOG4532 WD40-like repeat conta 97.0 0.014 3E-07 66.6 14.9 107 65-176 130-238 (344)
287 KOG3621 WD40 repeat-containing 97.0 0.025 5.5E-07 71.3 18.5 118 50-172 32-155 (726)
288 KOG4714 Nucleoporin [Nuclear s 97.0 0.0019 4.1E-08 73.1 7.7 72 52-128 180-255 (319)
289 KOG4547 WD40 repeat-containing 96.9 0.019 4.1E-07 71.1 16.3 99 64-171 71-172 (541)
290 PRK02888 nitrous-oxide reducta 96.9 0.019 4.1E-07 73.0 16.6 162 2-172 188-405 (635)
291 KOG4532 WD40-like repeat conta 96.7 0.047 1E-06 62.4 16.5 117 54-175 161-286 (344)
292 KOG2314 Translation initiation 96.7 0.0097 2.1E-07 72.9 11.4 136 8-158 212-363 (698)
293 COG5170 CDC55 Serine/threonine 96.6 0.0094 2E-07 68.7 10.1 121 49-176 170-314 (460)
294 COG5354 Uncharacterized protei 96.6 0.054 1.2E-06 66.3 16.9 150 9-171 225-395 (561)
295 KOG2315 Predicted translation 96.6 0.016 3.4E-07 71.5 12.3 91 51-153 311-412 (566)
296 PRK04043 tolB translocation pr 96.6 0.071 1.5E-06 66.0 18.2 149 8-170 234-397 (419)
297 PRK04043 tolB translocation pr 96.5 0.14 3.1E-06 63.4 20.1 140 8-162 189-339 (419)
298 KOG3617 WD40 and TPR repeat-co 96.5 0.0039 8.5E-08 78.7 6.2 107 57-172 21-132 (1416)
299 KOG4190 Uncharacterized conser 96.2 0.012 2.6E-07 71.5 8.3 143 21-172 749-907 (1034)
300 smart00320 WD40 WD40 repeats. 96.1 0.012 2.5E-07 43.9 5.1 34 136-169 7-40 (40)
301 PF04053 Coatomer_WDAD: Coatom 96.1 0.41 8.9E-06 59.8 21.2 329 406-847 71-430 (443)
302 KOG0882 Cyclophilin-related pe 96.0 0.027 5.8E-07 68.0 9.7 205 16-250 17-234 (558)
303 KOG1275 PAB-dependent poly(A) 96.0 0.03 6.6E-07 72.1 10.6 140 20-170 188-341 (1118)
304 smart00320 WD40 WD40 repeats. 95.9 0.02 4.4E-07 42.6 5.6 38 83-125 2-40 (40)
305 KOG1924 RhoA GTPase effector D 95.9 0.024 5.2E-07 71.7 8.7 46 1346-1397 714-759 (1102)
306 TIGR02658 TTQ_MADH_Hv methylam 95.7 0.62 1.3E-05 56.5 19.5 106 62-171 205-330 (352)
307 TIGR02658 TTQ_MADH_Hv methylam 95.6 0.35 7.6E-06 58.6 17.1 108 17-128 11-137 (352)
308 KOG1832 HIV-1 Vpr-binding prot 95.6 0.037 8E-07 70.8 8.8 77 47-128 1097-1176(1516)
309 KOG3617 WD40 and TPR repeat-co 95.4 0.023 5E-07 72.2 6.5 108 11-127 20-131 (1416)
310 KOG1275 PAB-dependent poly(A) 95.4 0.14 3E-06 66.4 13.3 105 62-176 186-301 (1118)
311 KOG1924 RhoA GTPase effector D 95.1 0.079 1.7E-06 67.3 9.5 15 819-833 378-392 (1102)
312 PF08450 SGL: SMP-30/Gluconola 95.1 1.6 3.5E-05 49.5 19.6 114 52-171 86-213 (246)
313 KOG1920 IkappaB kinase complex 95.0 0.36 7.8E-06 64.6 15.6 159 6-171 109-322 (1265)
314 PF08450 SGL: SMP-30/Gluconola 94.9 1.1 2.5E-05 50.7 17.8 146 11-171 4-164 (246)
315 COG4946 Uncharacterized protei 94.8 0.98 2.1E-05 55.3 17.2 103 50-160 400-507 (668)
316 KOG1912 WD40 repeat protein [G 94.8 0.35 7.5E-06 61.9 13.9 148 11-170 20-185 (1062)
317 COG5170 CDC55 Serine/threonine 94.6 0.081 1.7E-06 61.4 7.4 116 50-171 25-201 (460)
318 KOG1645 RING-finger-containing 94.5 0.13 2.9E-06 61.6 9.0 92 75-174 175-269 (463)
319 PF06977 SdiA-regulated: SdiA- 94.4 1.3 2.9E-05 51.3 16.8 117 50-172 20-148 (248)
320 KOG1920 IkappaB kinase complex 94.3 2.7 5.8E-05 56.8 20.8 199 52-260 69-335 (1265)
321 KOG1008 Uncharacterized conser 94.2 0.018 3.8E-07 71.9 1.2 156 11-175 107-279 (783)
322 PF14783 BBS2_Mid: Ciliary BBS 94.2 1.6 3.4E-05 44.8 14.7 99 54-165 2-108 (111)
323 KOG1008 Uncharacterized conser 93.7 0.017 3.6E-07 72.1 -0.4 112 50-170 101-224 (783)
324 KOG1645 RING-finger-containing 93.6 0.18 3.9E-06 60.6 7.8 74 50-128 192-267 (463)
325 KOG4640 Anaphase-promoting com 93.5 0.29 6.2E-06 61.7 9.5 74 50-128 19-93 (665)
326 PRK02888 nitrous-oxide reducta 93.2 0.79 1.7E-05 58.9 12.9 104 51-160 320-450 (635)
327 PF07433 DUF1513: Protein of u 92.6 4.4 9.6E-05 48.3 17.2 143 11-161 9-182 (305)
328 PF07433 DUF1513: Protein of u 92.5 4 8.7E-05 48.6 16.8 155 11-171 55-247 (305)
329 KOG2066 Vacuolar assembly/sort 92.0 2.4 5.2E-05 55.1 15.0 136 18-171 83-233 (846)
330 KOG2444 WD40 repeat protein [G 91.9 0.13 2.9E-06 58.0 3.5 157 9-172 9-178 (238)
331 KOG1912 WD40 repeat protein [G 91.6 0.45 9.8E-06 60.9 8.0 112 51-171 15-143 (1062)
332 COG0823 TolB Periplasmic compo 91.1 2.5 5.5E-05 52.7 13.9 149 12-171 198-353 (425)
333 PF08553 VID27: VID27 cytoplas 90.9 4.2 9E-05 54.2 15.9 126 37-171 511-647 (794)
334 PF03178 CPSF_A: CPSF A subuni 90.7 7.3 0.00016 46.3 16.8 146 18-171 42-202 (321)
335 KOG4849 mRNA cleavage factor I 90.5 16 0.00034 43.7 18.2 52 1293-1344 344-407 (498)
336 COG0823 TolB Periplasmic compo 90.4 2.1 4.5E-05 53.5 12.2 146 6-161 237-389 (425)
337 COG3391 Uncharacterized conser 90.4 12 0.00025 46.1 18.5 151 10-171 77-239 (381)
338 COG3490 Uncharacterized protei 90.0 3.9 8.4E-05 48.0 12.8 141 18-168 35-186 (366)
339 KOG0882 Cyclophilin-related pe 89.5 2.8 6.1E-05 51.4 11.7 146 18-171 64-231 (558)
340 COG3391 Uncharacterized conser 89.2 14 0.00031 45.4 17.9 153 8-171 117-283 (381)
341 COG3204 Uncharacterized protei 89.1 18 0.00038 43.1 17.3 117 50-171 84-210 (316)
342 PF02897 Peptidase_S9_N: Proly 88.4 8 0.00017 47.4 15.2 102 55-163 127-248 (414)
343 PF15492 Nbas_N: Neuroblastoma 87.8 24 0.00052 41.6 17.3 66 57-128 3-74 (282)
344 PRK11788 tetratricopeptide rep 87.8 2 4.4E-05 51.5 9.4 25 1314-1338 184-208 (389)
345 PRK13616 lipoprotein LpqB; Pro 87.8 10 0.00022 49.5 16.0 152 8-171 351-525 (591)
346 PF13360 PQQ_2: PQQ-like domai 87.4 8.3 0.00018 42.8 13.3 104 62-171 35-140 (238)
347 KOG4640 Anaphase-promoting com 87.3 1.5 3.3E-05 55.6 7.8 67 102-171 25-92 (665)
348 PF15390 DUF4613: Domain of un 87.1 15 0.00032 47.0 16.0 154 11-171 16-186 (671)
349 KOG2079 Vacuolar assembly/sort 87.0 1.6 3.4E-05 58.2 8.0 73 51-127 130-203 (1206)
350 PF08553 VID27: VID27 cytoplas 86.9 4 8.8E-05 54.3 11.8 99 20-126 544-646 (794)
351 KOG2079 Vacuolar assembly/sort 86.8 1.7 3.7E-05 57.9 8.2 101 62-171 98-203 (1206)
352 TIGR03300 assembly_YfgL outer 86.3 12 0.00025 45.3 14.8 96 62-168 278-376 (377)
353 PF12894 Apc4_WD40: Anaphase-p 86.0 1.6 3.4E-05 38.1 5.1 34 50-84 10-43 (47)
354 KOG3621 WD40 repeat-containing 85.6 2.5 5.3E-05 54.4 8.5 109 12-128 39-155 (726)
355 PF12234 Rav1p_C: RAVE protein 84.9 8.2 0.00018 50.3 12.9 107 57-170 35-155 (631)
356 PF13360 PQQ_2: PQQ-like domai 84.8 31 0.00067 38.3 16.2 144 18-171 76-230 (238)
357 KOG2114 Vacuolar assembly/sort 84.7 16 0.00035 48.3 15.1 121 50-171 63-201 (933)
358 TIGR03300 assembly_YfgL outer 84.7 20 0.00043 43.4 15.6 138 18-171 65-208 (377)
359 PF13763 DUF4167: Domain of un 84.3 1.8 3.8E-05 41.8 5.0 38 803-840 36-73 (80)
360 KOG1832 HIV-1 Vpr-binding prot 83.9 1.4 3E-05 57.4 5.3 86 83-175 1091-1179(1516)
361 KOG4649 PQQ (pyrrolo-quinoline 83.4 25 0.00054 41.1 14.3 103 18-129 23-125 (354)
362 PF12894 Apc4_WD40: Anaphase-p 83.3 2.1 4.6E-05 37.3 4.7 31 141-171 11-41 (47)
363 KOG2395 Protein involved in va 82.1 15 0.00034 46.3 13.0 124 37-171 363-500 (644)
364 KOG3630 Nuclear pore complex, 81.7 12 0.00026 50.7 12.5 153 11-171 46-228 (1405)
365 PF13432 TPR_16: Tetratricopep 81.4 5.6 0.00012 35.6 7.0 56 770-834 4-59 (65)
366 PF10647 Gmad1: Lipoprotein Lp 80.4 59 0.0013 37.8 16.7 146 8-163 25-187 (253)
367 KOG3616 Selective LIM binding 80.4 6 0.00013 50.9 8.8 79 750-844 437-515 (1636)
368 PF14559 TPR_19: Tetratricopep 80.2 2.6 5.7E-05 37.8 4.5 54 773-835 1-54 (68)
369 TIGR02795 tol_pal_ybgF tol-pal 79.6 26 0.00057 34.1 11.7 87 772-864 11-108 (119)
370 PF02897 Peptidase_S9_N: Proly 79.1 45 0.00097 41.0 16.0 109 11-128 128-261 (414)
371 PF05191 ADK_lid: Adenylate ki 78.3 1.3 2.8E-05 36.5 1.7 29 1486-1514 3-36 (36)
372 PF00930 DPPIV_N: Dipeptidyl p 77.1 8.2 0.00018 46.7 8.7 104 60-171 1-131 (353)
373 PF14783 BBS2_Mid: Ciliary BBS 77.1 11 0.00023 38.9 8.1 63 103-171 8-71 (111)
374 TIGR02604 Piru_Ver_Nterm putat 76.9 43 0.00093 40.9 14.8 142 8-160 15-202 (367)
375 PRK15359 type III secretion sy 76.7 19 0.00041 38.0 10.3 89 771-876 32-120 (144)
376 PRK13616 lipoprotein LpqB; Pro 76.4 17 0.00037 47.4 11.7 107 52-165 350-470 (591)
377 PF04841 Vps16_N: Vps16, N-ter 75.8 1.1E+02 0.0024 38.3 18.1 70 9-82 31-110 (410)
378 PF06977 SdiA-regulated: SdiA- 75.2 96 0.0021 36.3 16.3 156 9-171 24-201 (248)
379 TIGR02795 tol_pal_ybgF tol-pal 74.3 32 0.0007 33.4 10.7 107 1312-1447 4-117 (119)
380 KOG2114 Vacuolar assembly/sort 74.1 82 0.0018 42.2 16.4 114 51-171 125-243 (933)
381 PLN03088 SGT1, suppressor of 73.9 34 0.00074 41.8 12.9 98 1312-1438 4-101 (356)
382 COG3490 Uncharacterized protei 73.9 70 0.0015 38.1 14.3 100 12-116 73-180 (366)
383 PF08728 CRT10: CRT10; InterP 72.9 29 0.00062 46.0 12.4 113 53-171 102-246 (717)
384 PLN03081 pentatricopeptide (PP 72.5 13 0.00028 49.1 9.5 110 740-862 433-558 (697)
385 KOG2444 WD40 repeat protein [G 72.4 9 0.0002 43.9 6.8 62 62-128 113-178 (238)
386 TIGR02552 LcrH_SycD type III s 72.2 39 0.00085 34.2 11.0 83 771-862 25-115 (135)
387 COG5010 TadD Flp pilus assembl 72.0 27 0.00058 40.9 10.5 87 771-866 108-202 (257)
388 PF14761 HPS3_N: Hermansky-Pud 71.2 69 0.0015 36.7 13.4 106 60-172 24-164 (215)
389 PF14853 Fis1_TPR_C: Fis1 C-te 71.0 6.6 0.00014 35.2 4.3 33 1417-1449 11-44 (53)
390 PF14870 PSII_BNR: Photosynthe 70.5 1.1E+02 0.0024 36.9 15.7 124 33-165 126-255 (302)
391 KOG2395 Protein involved in va 70.4 21 0.00047 45.1 9.8 102 62-170 344-458 (644)
392 PF14583 Pectate_lyase22: Olig 70.4 1.3E+02 0.0029 37.3 16.5 154 10-170 191-380 (386)
393 PF07569 Hira: TUP1-like enhan 70.3 15 0.00033 41.9 8.2 68 103-171 16-95 (219)
394 KOG1155 Anaphase-promoting com 70.0 42 0.00092 42.1 12.1 39 1416-1456 516-554 (559)
395 TIGR02552 LcrH_SycD type III s 70.0 82 0.0018 31.9 12.8 99 1311-1438 18-116 (135)
396 PF07719 TPR_2: Tetratricopept 69.9 6.4 0.00014 30.4 3.6 27 809-835 4-30 (34)
397 PF03178 CPSF_A: CPSF A subuni 69.8 2.4E+02 0.0053 33.5 19.6 148 12-168 92-262 (321)
398 PF13414 TPR_11: TPR repeat; P 69.7 11 0.00025 33.8 5.8 60 808-875 5-65 (69)
399 KOG3630 Nuclear pore complex, 69.3 20 0.00043 48.8 9.8 102 53-159 157-261 (1405)
400 PRK10803 tol-pal system protei 68.3 93 0.002 36.7 14.3 117 1310-1450 142-261 (263)
401 PF07995 GSDH: Glucose / Sorbo 67.9 41 0.00088 40.7 11.6 104 54-164 4-136 (331)
402 KOG1923 Rac1 GTPase effector F 66.2 40 0.00086 44.5 11.2 72 1231-1307 313-387 (830)
403 PF00780 CNH: CNH domain; Int 65.7 55 0.0012 37.6 11.8 104 62-172 6-123 (275)
404 KOG1916 Nuclear protein, conta 65.6 4.2 9.2E-05 53.5 2.8 119 52-171 181-323 (1283)
405 PF06433 Me-amine-dh_H: Methyl 65.0 1.6E+02 0.0035 36.1 15.5 112 54-171 38-165 (342)
406 COG3386 Gluconolactonase [Carb 64.3 68 0.0015 38.6 12.4 99 12-116 168-274 (307)
407 PF13176 TPR_7: Tetratricopept 63.8 9.7 0.00021 30.8 3.6 24 809-832 2-25 (36)
408 PF08596 Lgl_C: Lethal giant l 63.8 89 0.0019 39.0 13.6 111 53-171 3-173 (395)
409 PRK11788 tetratricopeptide rep 63.7 82 0.0018 37.9 13.2 83 774-865 191-282 (389)
410 COG4391 Uncharacterized protei 62.9 4.4 9.5E-05 37.2 1.6 20 1476-1495 40-59 (62)
411 PF13429 TPR_15: Tetratricopep 62.8 28 0.00061 40.3 8.7 111 750-867 92-216 (280)
412 PRK10803 tol-pal system protei 62.7 66 0.0014 37.9 11.6 102 766-873 142-258 (263)
413 PRK11138 outer membrane biogen 62.2 1.9E+02 0.0041 35.5 16.1 98 62-170 293-393 (394)
414 cd00189 TPR Tetratricopeptide 61.8 1E+02 0.0022 26.9 10.5 26 1313-1338 3-28 (100)
415 KOG1916 Nuclear protein, conta 60.9 9.6 0.00021 50.4 4.6 112 50-171 131-265 (1283)
416 KOG0553 TPR repeat-containing 60.8 12 0.00027 44.3 5.2 89 749-859 102-193 (304)
417 PF09976 TPR_21: Tetratricopep 60.5 19 0.00042 37.7 6.2 49 774-832 96-144 (145)
418 PF00515 TPR_1: Tetratricopept 60.1 13 0.00027 29.1 3.6 27 809-835 4-30 (34)
419 COG2956 Predicted N-acetylgluc 59.9 1.4E+02 0.003 36.4 13.4 143 721-875 26-188 (389)
420 PF13181 TPR_8: Tetratricopept 59.2 16 0.00034 28.4 4.0 28 809-836 4-31 (34)
421 PF13429 TPR_15: Tetratricopep 59.0 3.8 8.2E-05 47.5 0.7 87 752-858 9-103 (280)
422 PRK12496 hypothetical protein; 58.6 5.3 0.00012 43.7 1.7 24 1485-1508 128-152 (164)
423 PRK11447 cellulose synthase su 58.3 45 0.00097 47.0 10.9 100 765-876 302-413 (1157)
424 PLN03088 SGT1, suppressor of 58.0 70 0.0015 39.1 11.3 83 769-860 8-98 (356)
425 PF00780 CNH: CNH domain; Int 58.0 2.9E+02 0.0063 31.7 15.9 141 16-171 5-165 (275)
426 PF00930 DPPIV_N: Dipeptidyl p 56.4 32 0.00069 41.7 8.0 108 15-127 1-131 (353)
427 cd00189 TPR Tetratricopeptide 56.3 1.1E+02 0.0023 26.8 9.6 54 772-834 9-62 (100)
428 PRK13684 Ycf48-like protein; P 56.1 4E+02 0.0086 32.4 17.1 148 8-169 174-329 (334)
429 KOG1925 Rac1 GTPase effector F 55.7 14 0.00031 45.8 4.7 26 1219-1246 226-251 (817)
430 PF13371 TPR_9: Tetratricopept 55.6 33 0.00072 31.1 6.2 57 771-836 3-59 (73)
431 TIGR02521 type_IV_pilW type IV 55.1 81 0.0018 33.7 10.1 82 772-860 108-197 (234)
432 PF10313 DUF2415: Uncharacteri 54.2 39 0.00084 29.3 5.7 31 52-82 1-34 (43)
433 PF13432 TPR_16: Tetratricopep 53.8 30 0.00064 30.8 5.5 58 811-876 2-59 (65)
434 PF10647 Gmad1: Lipoprotein Lp 53.7 1.7E+02 0.0037 34.1 13.0 103 53-160 25-130 (253)
435 TIGR02521 type_IV_pilW type IV 53.3 1.3E+02 0.0029 32.0 11.4 83 770-861 72-164 (234)
436 PF08271 TF_Zn_Ribbon: TFIIB z 52.6 4.8 0.0001 34.2 0.1 22 1486-1507 2-27 (43)
437 KOG2377 Uncharacterized conser 52.6 1.7E+02 0.0038 36.9 12.9 115 49-171 64-185 (657)
438 PF08596 Lgl_C: Lethal giant l 52.1 67 0.0015 40.0 9.8 72 50-127 85-173 (395)
439 PLN00033 photosystem II stabil 51.9 5.3E+02 0.012 32.4 17.5 147 9-168 241-396 (398)
440 PF10602 RPN7: 26S proteasome 51.8 64 0.0014 35.7 8.7 82 769-860 9-101 (177)
441 PRK10370 formate-dependent nit 51.6 1E+02 0.0022 34.6 10.3 29 806-834 73-101 (198)
442 PF11715 Nup160: Nucleoporin N 51.6 66 0.0014 41.4 10.1 39 52-90 215-257 (547)
443 PRK10115 protease 2; Provision 51.6 1.7E+02 0.0036 39.2 13.9 110 53-170 128-254 (686)
444 COG3386 Gluconolactonase [Carb 50.8 2.1E+02 0.0045 34.7 13.3 111 51-171 65-193 (307)
445 KOG4849 mRNA cleavage factor I 50.7 62 0.0013 39.0 8.5 25 945-975 80-105 (498)
446 KOG4648 Uncharacterized conser 49.8 26 0.00057 42.2 5.5 64 1310-1374 96-177 (536)
447 cd00350 rubredoxin_like Rubred 49.6 12 0.00026 30.3 1.9 23 1486-1508 3-26 (33)
448 TIGR02604 Piru_Ver_Nterm putat 49.5 1.2E+02 0.0026 37.1 11.4 102 53-161 15-143 (367)
449 KOG4649 PQQ (pyrrolo-quinoline 49.1 2E+02 0.0044 34.1 12.1 101 63-172 23-124 (354)
450 KOG0985 Vesicle coat protein c 48.7 86 0.0019 42.8 10.1 58 809-866 1107-1167(1666)
451 TIGR03606 non_repeat_PQQ dehyd 48.6 2.2E+02 0.0047 36.4 13.5 103 53-162 31-166 (454)
452 PF05694 SBP56: 56kDa selenium 48.6 1.7E+02 0.0036 37.1 12.1 117 55-176 184-347 (461)
453 PLN03077 Protein ECB2; Provisi 47.9 2.1E+02 0.0046 38.9 14.5 113 1318-1438 59-183 (857)
454 PF07719 TPR_2: Tetratricopept 47.6 26 0.00057 26.9 3.6 26 1313-1338 4-29 (34)
455 KOG0985 Vesicle coat protein c 47.6 28 0.0006 47.0 5.6 88 774-875 1231-1322(1666)
456 PF13240 zinc_ribbon_2: zinc-r 47.6 9.3 0.0002 28.6 0.9 21 1486-1507 1-21 (23)
457 PF07569 Hira: TUP1-like enhan 47.5 78 0.0017 36.2 8.8 35 52-88 13-47 (219)
458 PF10276 zf-CHCC: Zinc-finger 47.5 11 0.00024 32.1 1.4 20 1476-1495 20-40 (40)
459 PF15390 DUF4613: Domain of un 47.5 1.2E+02 0.0026 39.4 10.8 111 11-127 61-186 (671)
460 PF14583 Pectate_lyase22: Olig 47.4 3.9E+02 0.0083 33.5 14.9 150 13-171 42-222 (386)
461 PRK13684 Ycf48-like protein; P 47.2 5.8E+02 0.013 31.0 16.6 124 33-165 154-282 (334)
462 PF10168 Nup88: Nuclear pore c 47.1 2.1E+02 0.0047 38.5 13.8 69 102-171 89-179 (717)
463 PF13428 TPR_14: Tetratricopep 46.9 26 0.00056 29.4 3.7 25 809-833 4-28 (44)
464 CHL00033 ycf3 photosystem I as 46.6 1.6E+02 0.0035 31.5 10.6 57 772-834 44-100 (168)
465 PF04212 MIT: MIT (microtubule 46.1 1.1E+02 0.0024 28.3 8.1 56 1311-1366 6-64 (69)
466 COG3063 PilF Tfp pilus assembl 46.1 1E+02 0.0022 35.9 9.1 139 720-875 55-200 (250)
467 TIGR02917 PEP_TPR_lipo putativ 45.6 2.4E+02 0.0052 37.1 14.1 121 1311-1438 23-156 (899)
468 PF14655 RAB3GAP2_N: Rab3 GTPa 45.1 1.1E+02 0.0024 38.4 10.2 41 50-90 306-346 (415)
469 PF14870 PSII_BNR: Photosynthe 44.5 6.5E+02 0.014 30.5 21.6 126 33-169 83-213 (302)
470 PRK10049 pgaA outer membrane p 44.5 2.1E+02 0.0045 38.7 13.4 102 768-878 364-473 (765)
471 PLN03218 maturation of RBCL 1; 43.8 1.5E+02 0.0032 41.7 12.0 88 772-867 658-754 (1060)
472 COG1996 RPC10 DNA-directed RNA 43.6 12 0.00026 33.2 1.1 27 1472-1498 12-38 (49)
473 PRK10747 putative protoheme IX 43.5 2.2E+02 0.0047 35.3 12.5 74 774-856 305-385 (398)
474 PLN03218 maturation of RBCL 1; 43.0 1.2E+02 0.0026 42.6 10.9 112 748-867 662-789 (1060)
475 PF00515 TPR_1: Tetratricopept 42.5 35 0.00076 26.6 3.6 26 1313-1338 4-29 (34)
476 PF00301 Rubredoxin: Rubredoxi 42.3 14 0.00029 32.6 1.3 24 1486-1509 3-44 (47)
477 PF13414 TPR_11: TPR repeat; P 42.3 40 0.00087 30.2 4.5 57 769-834 9-66 (69)
478 PF09538 FYDLN_acid: Protein o 42.2 14 0.0003 37.9 1.5 31 1467-1497 7-39 (108)
479 TIGR02300 FYDLN_acid conserved 41.9 14 0.00031 38.7 1.6 32 1467-1498 7-40 (129)
480 cd00729 rubredoxin_SM Rubredox 41.5 18 0.00039 29.5 1.8 24 1485-1508 3-27 (34)
481 PF10313 DUF2415: Uncharacteri 41.5 49 0.0011 28.8 4.4 30 143-172 2-34 (43)
482 PRK15359 type III secretion sy 41.5 65 0.0014 34.0 6.5 61 765-834 57-120 (144)
483 COG2956 Predicted N-acetylgluc 41.2 1.4E+02 0.0031 36.3 9.6 77 775-860 192-277 (389)
484 cd02677 MIT_SNX15 MIT: domain 41.1 95 0.0021 29.8 6.9 52 1311-1362 7-61 (75)
485 PLN03077 Protein ECB2; Provisi 40.6 1.2E+02 0.0026 41.2 10.5 81 771-862 633-721 (857)
486 KOG1129 TPR repeat-containing 40.5 27 0.00059 42.0 3.7 50 802-862 394-443 (478)
487 PF10168 Nup88: Nuclear pore c 40.3 1.8E+02 0.0039 39.2 11.5 74 9-82 87-180 (717)
488 cd02683 MIT_1 MIT: domain cont 39.8 1.1E+02 0.0023 29.7 7.0 56 1310-1365 6-64 (77)
489 TIGR03302 OM_YfiO outer membra 39.3 4.4E+02 0.0095 29.4 13.2 23 1416-1438 175-197 (235)
490 PRK15331 chaperone protein Sic 38.7 4.6E+02 0.0099 29.2 12.4 121 1311-1460 38-159 (165)
491 cd02684 MIT_2 MIT: domain cont 38.4 1.2E+02 0.0025 29.2 7.0 52 1311-1362 7-61 (75)
492 cd02682 MIT_AAA_Arch MIT: doma 38.0 1.2E+02 0.0027 29.3 7.0 56 1311-1366 7-65 (75)
493 smart00745 MIT Microtubule Int 37.6 1.3E+02 0.0029 28.3 7.3 54 1311-1364 9-65 (77)
494 TIGR00990 3a0801s09 mitochondr 37.5 1.9E+02 0.0041 37.8 11.2 48 811-858 404-459 (615)
495 PRK11138 outer membrane biogen 37.2 2.6E+02 0.0055 34.4 11.7 58 63-126 335-393 (394)
496 PF14781 BBS2_N: Ciliary BBSom 37.0 4.1E+02 0.0089 28.7 11.3 104 57-170 4-124 (136)
497 PF12234 Rav1p_C: RAVE protein 37.0 2.6E+02 0.0056 37.1 11.9 93 27-127 49-156 (631)
498 PRK10115 protease 2; Provision 36.9 1.2E+03 0.026 31.4 19.7 108 12-127 132-255 (686)
499 PF08728 CRT10: CRT10; InterP 36.7 2E+02 0.0044 38.5 11.0 101 20-127 115-246 (717)
500 PF13424 TPR_12: Tetratricopep 36.6 59 0.0013 30.0 4.7 53 769-832 20-72 (78)
No 1
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=6.1e-166 Score=1474.96 Aligned_cols=1137 Identities=18% Similarity=0.256 Sum_probs=875.1
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t 88 (1525)
.=-|+|||.+|||+++.| +..|.+|+. .++....++.. |.+||++|+|||++++|+||++|.+||+|++++.+|+++
T Consensus 12 vKglsFHP~rPwILtslH-sG~IQlWDY-RM~tli~rFde-HdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclft 88 (1202)
T KOG0292|consen 12 VKGLSFHPKRPWILTSLH-SGVIQLWDY-RMGTLIDRFDE-HDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFT 88 (1202)
T ss_pred ccceecCCCCCEEEEeec-Cceeeeehh-hhhhHHhhhhc-cCCccceeeecCCCCeEEecCCccEEEEEecccceehhh
Confidence 345899999999999998 688889984 45555555554 999999999999999999999999999999999999999
Q ss_pred Ee-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeE
Q 000425 89 LQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQV 167 (1525)
Q Consensus 89 L~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkV 167 (1525)
+. |.+.|+ .+.||+..+||+|+|+|.||||||++ .++++.+++||+++|+|..|||...+++|+|.|.||||
T Consensus 89 L~GHlDYVR-----t~~FHheyPWIlSASDDQTIrIWNwq--sr~~iavltGHnHYVMcAqFhptEDlIVSaSLDQTVRV 161 (1202)
T KOG0292|consen 89 LLGHLDYVR-----TVFFHHEYPWILSASDDQTIRIWNWQ--SRKCIAVLTGHNHYVMCAQFHPTEDLIVSASLDQTVRV 161 (1202)
T ss_pred hccccceeE-----EeeccCCCceEEEccCCCeEEEEecc--CCceEEEEecCceEEEeeccCCccceEEEecccceEEE
Confidence 99 999999 99999999999999999999999999 47788889999999999999999999999999999999
Q ss_pred EEeeeecCCCCCCCc-----------cccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccC
Q 000425 168 WKTRVIINPNRPPMQ-----------ANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGG 236 (1525)
Q Consensus 168 WDlr~~~~p~~~~~~-----------a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~ 236 (1525)
||+......+..+.. ...|... ...+..++.++. -+|+|++|||+++++. +|+
T Consensus 162 WDisGLRkk~~~pg~~e~~~~~~~~~~dLfg~~---DaVVK~VLEGHD-------RGVNwaAfhpTlpliV------SG~ 225 (1202)
T KOG0292|consen 162 WDISGLRKKNKAPGSLEDQMRGQQGNSDLFGQT---DAVVKHVLEGHD-------RGVNWAAFHPTLPLIV------SGA 225 (1202)
T ss_pred EeecchhccCCCCCCchhhhhccccchhhcCCc---Ceeeeeeecccc-------cccceEEecCCcceEE------ecC
Confidence 999865544433221 1111100 011333444443 3699999999999998 999
Q ss_pred CcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhc--cCCCceeeehhhhhceeee
Q 000425 237 DNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHL--KGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 237 Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~--~~~~~~~I~di~r~~~i~s 314 (1525)
||..+|.|++.|.++|+++|+|||.+++.|+++. ++| +.++ .+|+.++||||.||+.+++
T Consensus 226 DDRqVKlWrmnetKaWEvDtcrgH~nnVssvlfh------------p~q------~lIlSnsEDksirVwDm~kRt~v~t 287 (1202)
T KOG0292|consen 226 DDRQVKLWRMNETKAWEVDTCRGHYNNVSSVLFH------------PHQ------DLILSNSEDKSIRVWDMTKRTSVQT 287 (1202)
T ss_pred CcceeeEEEeccccceeehhhhcccCCcceEEec------------Ccc------ceeEecCCCccEEEEecccccceee
Confidence 9999999999999999999999999999666544 343 2222 3799999999999999999
Q ss_pred eecccccccCccccCcceeeeccccccCCCcccccccccccccccccccccccceeeEEeCceEEEeecCCCcccccccc
Q 000425 315 HFMEGHAKSAPISRLPLITIFDSKHQLKDIPVCQPFHLELNFFNRENRVLHYPVRAFYVDGINLVAYNLCSGADSIYRKL 394 (1525)
Q Consensus 315 ~~~d~~~k~~~~~~~p~~~i~~~~~~~~~~~vv~~~~~e~~~f~~~~~v~~~~~~~~y~~~~~l~~~~l~~g~~~~~~~L 394 (1525)
+.+|.+++ |.+..+|..++.+++||.++ +|+++..|+..|+- ..+.+||++++.|+.||+.+..+..+.+
T Consensus 288 frrendRF-W~laahP~lNLfAAgHDsGm--~VFkleRErpa~~v------~~n~LfYvkd~~i~~~d~~t~~d~~v~~- 357 (1202)
T KOG0292|consen 288 FRRENDRF-WILAAHPELNLFAAGHDSGM--IVFKLERERPAYAV------NGNGLFYVKDRFIRSYDLRTQKDTAVAS- 357 (1202)
T ss_pred eeccCCeE-EEEEecCCcceeeeecCCce--EEEEEcccCceEEE------cCCEEEEEccceEEeeeccccccceeEe-
Confidence 65566665 78888888888888888877 57777777777655 5567899999999999999966544433
Q ss_pred ccccCCCeeeccccccccCCccEEEEEeeccCCcceEEEEEeccccc---ccccCCCCcccceeEEeCCCCCcEEEEeCC
Q 000425 395 YSTIPGTVEYYPKHMVYSKRQQLFLVVYEFSGTTNEVVLYRENVDTQ---LADSKSSTVKGRDAAFIGPNEDQFAILDDD 471 (1525)
Q Consensus 395 ~~~~~G~~e~~pq~l~hspngr~v~V~g~~~~~~~E~iiYt~~~~l~---~rnk~G~~~~g~~~~fig~~~n~yAvle~~ 471 (1525)
.+-+|+.+..|.+|+|||....|++|.+.+++.+|+++|-...+.. ...++| .|.+++|+++ |+|||++.+
T Consensus 358 -lr~~g~~~~~~~smsYNpae~~vlics~~~n~~y~L~~ipk~~~~~~~~~~~~k~---tG~~a~fvar--Nrfavl~k~ 431 (1202)
T KOG0292|consen 358 -LRRPGTLWQPPRSLSYNPAENAVLICSNLDNGEYELVQIPKDSDGVSDGKDVKKG---TGEGALFVAR--NRFAVLDKS 431 (1202)
T ss_pred -ccCCCcccCCcceeeeccccCeEEEEeccCCCeEEEEEecCcccccCCchhhhcC---CCCceEEEEe--cceEEEEec
Confidence 3467999999999999999999999998888888888775432222 223347 7899999998 589999998
Q ss_pred CCeEEEEEcCCchhhhhhcccCCcccCCCccccccccccCCcccccccccceeeecCCcceeEEeecCCeEEEEeecccc
Q 000425 472 KTGLALYILKGVTLQEAADENNGVVDHNQSTDTNVGSVQGPLQLMFESEVDRIFSTPIESTLMFACDGDQIGMAKLVQGY 551 (1525)
Q Consensus 472 ~~~v~I~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~If~~pl~~~l~~~~~~~~i~~~~~~~~y 551 (1525)
.+++.|.++.++.++ .++ .+..++.||-+.-+. |+ ..++..|.+
T Consensus 432 ~~~v~ik~l~N~vtk---------------------kl~------~~~~~~~IF~ag~g~-ll-l~~~~~v~l------- 475 (1202)
T KOG0292|consen 432 NEQVVIKNLKNKVTK---------------------KLL------LPESTDDIFYAGTGN-LL-LRSPDSVTL------- 475 (1202)
T ss_pred CcceEEecccchhhh---------------------ccc------CcccccceeeccCcc-EE-EEcCCeEEE-------
Confidence 888998888766653 333 224589999543222 22 222233333
Q ss_pred ccccccccccccccccceee-EeeccceEEEEEEcccCCccEEEEEeeeeEEEEccchhhhhccccccCCCCCcceeeee
Q 000425 552 RLSARAGHYLQTKSEGKKSI-KLKVTEVMLKVAWQETQRGYVAGVLTTQRVLIVSADLDILASSSTKFDKGLPSFRSLLW 630 (1525)
Q Consensus 552 d~~~~~~~~i~~~~e~~r~i-~~~~~e~v~~V~Ws~~~~g~~vaI~t~~~i~i~~~~l~~l~~~~~~~~~~~~~IkSg~W 630 (1525)
++|+++ +.. .++.. .|++|+||.+ ++.||+++||+|+|++++|+.+|++||++ |||||.|
T Consensus 476 -------fdvQq~----~~~~si~~s-~vkyvvws~d--m~~vAll~Kh~i~i~~kkL~l~~sihEti-----riksgaw 536 (1202)
T KOG0292|consen 476 -------FDVQQK----KKVGSIKVS-KVKYVVWSND--MSRVALLSKHTITIADKKLELLCSIHETI-----RIKSGAW 536 (1202)
T ss_pred -------EEeecc----eEEEEEecC-ceeEEEEcCc--cchhhhcccceEEEEecchhheecchhee-----Eeeecee
Confidence 333333 112 33333 4899999999 89999999999999999999999999997 9999999
Q ss_pred ecC-EEEEEcccccEEEeeCCceeeEEEecCCceEEEEEecCcEEEecCCCCCcccCCceeEEeeeccchhhHHHhhhhh
Q 000425 631 VGP-ALLFSTATAISVLGWDGKVRNILSISMPNAVLVGALNDRLLLANPTEINPRQKKGIEIKSCLVGLLEPLLIGFATM 709 (1525)
Q Consensus 631 vg~-~fIYtT~t~l~Yl~~~G~~~~I~~l~~~~~yLlgyl~~Rl~l~~~~~~~p~~dk~~~I~s~~l~llE~ll~g~a~~ 709 (1525)
+.+ +|||||.+||+|.+.+|++|+|.|||. |+|++. +.+.+++|++|++.| .+..++-+| +.|++|++
T Consensus 537 de~gVfiYtT~nHikYal~~GD~GIikTLd~-~iyitk-v~gn~V~cl~rd~~~--------~~~~IDptE-y~FKlALi 605 (1202)
T KOG0292|consen 537 DEDGVFIYTTLNHIKYALENGDSGIIKTLDK-PIYITK-VKGNKVFCLNRDGEI--------ECLTIDPTE-YRFKLALL 605 (1202)
T ss_pred ccCceEEEEehhhhhhhhccCCcceEEeccc-ceEEEE-eeCCEEEEEecCCCe--------EEEeechHH-HHHHHHHH
Confidence 776 999999999999999999999999998 699998 565555554555444 354567799 59999999
Q ss_pred hHHHHHhhhHHHHHHHHHhccCCCCCChhHHHHHHcCCCCcccchhhccccCCCCCcchhhhHHhhcCCHHHHHHHHHHh
Q 000425 710 QQYFEQKLDLSEILYQITSRFDSLRITPRSLDILAKGPPVCGDLAVSLSQAGPQFTQVLRGIYAIKALRFSTALSVLKDE 789 (1525)
Q Consensus 710 ~~~~~~t~~lr~~l~~~~~~~~s~~i~~~~l~~L~~~~~~~~elAl~ls~~~~~~d~~~rF~LAl~lg~l~~A~~ia~~~ 789 (1525)
.+.|++ +-.+++ .++.++|.++.||+++| |+|+||++++ |+..||+|||++||||+|++.|++.
T Consensus 606 ~k~yde-------Vl~lI~--ns~LvGqaiIaYLqKkg--ypeiAL~FVk-----D~~tRF~LaLe~gnle~ale~akkl 669 (1202)
T KOG0292|consen 606 NKKYDE-------VLHLIK--NSNLVGQAIIAYLQKKG--YPEIALHFVK-----DERTRFELALECGNLEVALEAAKKL 669 (1202)
T ss_pred hhhhHH-------HHHHHH--hcCcccHHHHHHHHhcC--Ccceeeeeec-----CcchheeeehhcCCHHHHHHHHHhc
Confidence 999975 444444 77889999999999955 9999999998 9999999999999999999999986
Q ss_pred hcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhHHH
Q 000425 790 FLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNPSAMRRLAQRLEEEGANPELRR 869 (1525)
Q Consensus 790 ~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~~~ 869 (1525)
+ .| ++ |+.||.+||++||+++||+|||+.+.|++|+|||++|||.|||+||+++||.+. |....+
T Consensus 670 d-----d~------d~---w~rLge~Al~qgn~~IaEm~yQ~~knfekLsfLYliTgn~eKL~Km~~iae~r~-D~~~~~ 734 (1202)
T KOG0292|consen 670 D-----DK------DV---WERLGEEALRQGNHQIAEMCYQRTKNFEKLSFLYLITGNLEKLSKMMKIAEIRN-DATGQF 734 (1202)
T ss_pred C-----cH------HH---HHHHHHHHHHhcchHHHHHHHHHhhhhhheeEEEEEeCCHHHHHHHHHHHHhhh-hhHHHH
Confidence 4 22 44 556999999999999999999999999999999999999999999999998765 655544
Q ss_pred --------HHHHHHhhccCCcccchhhhhhccccCCCC-CCCCCCcceeecCCCCCCcccccccccccccccCCCCCCCc
Q 000425 870 --------YCERILRVRSTGWTQGIFANFAAESMVPKG-PEWGGGNWEIKTPTNLKSIPQWELATEVVPYMRTDDGPIPS 940 (1525)
Q Consensus 870 --------~c~~il~~~~~~~~~~~~a~~~a~s~~~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 940 (1525)
+-+|+-+.++-|.++ +||.+|.+ +| .|-+ .++++..+. |-+-++|++|.-.....| ++
T Consensus 735 qnalYl~dv~ervkIl~n~g~~~--laylta~~---~G~~~~a---e~l~ee~~~----~~~~lP~~~~~a~ll~pP-~p 801 (1202)
T KOG0292|consen 735 QNALYLGDVKERVKILENGGQLP--LAYLTAAA---HGLEDQA---EKLGEELEK----QVPSLPEVDPNASLLQPP-VP 801 (1202)
T ss_pred HHHHHhccHHHHHHHHHhcCccc--HHHHHHhh---cCcHHHH---HHHHHhhcc----ccCCCCCCCCcccccCCC-Cc
Confidence 678888888889988 99999999 88 3333 445544333 222334445554444433 33
Q ss_pred eecccceeeeccccccceEEEEehhhhhhhccccCCCCCCCCCCCCccccccccCCCCCCCCccccccccchhhhhccCC
Q 000425 941 IISDHVGIYLGSIKGRGTIVEVTEKSLVKDFIPAGADNKPNGVHSSSVKSTYNKSKGASDVDSKVGSLMGLETLTIQNTS 1020 (1525)
Q Consensus 941 ~~~~~~~~ylg~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1020 (1525)
|. +|++||-......+||.+|...-.....+..+.+ .-...+|++ +-+.|.
T Consensus 802 ~~------~l~nwPLl~~s~~~fe~~~~~~~~~~av~~~~e~---------~~g~e~~~~-----e~~l~e--------- 852 (1202)
T KOG0292|consen 802 IM------PLENWPLLSVSKGTFEGALLSRSSSLAVDRDDEG---------DWGEEGWDV-----ELMLGE--------- 852 (1202)
T ss_pred cc------cccCCchhhhhhhhhhHhhhhhcccCcccccccc---------cccccchhh-----hhcccc---------
Confidence 32 4578999999989999999866532222211110 001112211 000000
Q ss_pred CcchhHHHHHHHHHHhhhcccCCCCCCCCcccccccccccccccccc---ccccccchhhHHHHhhhccccCCCCCCc-c
Q 000425 1021 SAADDEQAKAEEEFKKTMYGAAADGSSSDEEGTSKTKKLQIRIRDKP---IASSAVDVNKIKEATKQFKLGEGLGPPM-R 1096 (1525)
Q Consensus 1021 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 1096 (1525)
++.....++|+.++|| ..+| ++++.+-+ ..+..+|.+.+.- -..+..+.. .
T Consensus 853 ---------------d~~~~~d~~g~~~~dE--~~gW--dv~d~~l~pe~~~~~~~~~~~~~~------p~~~~~~~~W~ 907 (1202)
T KOG0292|consen 853 ---------------DGILFNDGAGEVGEDE--GGGW--DVGDLDLPPEEDTPKGADDGEFVV------PAQGMSVSIWS 907 (1202)
T ss_pred ---------------ccccccccccccCccc--ccCc--CcccccCCccccccccccccceec------CCCCCcchhcc
Confidence 0000111111222222 1366 44443332 1111122211110 001111111 2
Q ss_pred ccc------cCCCCcccccccCCCCCCCCCCCCCCCCCCCCCCcccccccccCCccCCCCCCCCccCccCCCCCchhhcc
Q 000425 1097 TKS------LIPGSQDLGQLSSQPSAAGGDGNITAPASSAPGDLFGTESWVQPASVSKPASAGSSVGAQGQPIPEDFFQN 1170 (1525)
Q Consensus 1097 ~~s------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1170 (1525)
++| +++|+|+.+ |+.|.+| +|+
T Consensus 908 ~nS~L~adhvaAGsf~tA----------------------------~~lL~dq------------vgv------------ 935 (1202)
T KOG0292|consen 908 NNSPLAADHVAAGSFETA----------------------------MRLLHDQ------------VGV------------ 935 (1202)
T ss_pred ccCcchhhhhhcCchHHH----------------------------HHHHHhh------------hcc------------
Confidence 333 677798988 9999999 666
Q ss_pred cccchhhcccCCCCccccccccccccccccCccCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCcccccCCCCCCCCCCCC
Q 000425 1171 TIPSLQVAASLPPPGTYLSKYDQVSQGVASGKVAPNQANAPAADSGLP-DGGVPPQIAPQPAIPVESIGLPDGGVPPQSS 1249 (1525)
Q Consensus 1171 ~~~~~~~~~~~p~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~ 1249 (1525)
.+|.|++..|+..|. . +++++| .+ |- ++
T Consensus 936 -------v~f~p~Kt~fl~iy~------------~-------sR~~l~~~~---~~-------------------~~--- 964 (1202)
T KOG0292|consen 936 -------VNFGPLKTHFLKIYA------------G-------SRTYLRATP---CL-------------------PV--- 964 (1202)
T ss_pred -------eecccHHhhhhhhcc------------c-------cceeccCCC---Cc-------------------cc---
Confidence 789999999999998 3 556665 22 11 00
Q ss_pred CCCCCCCCCCCCCCCCCCCCCccccccccCCCCCCCCCCCCCCCCCCCCCCCCccCCchhh-HHHHHHHHHHhhcCChHH
Q 000425 1250 GQTPFPYQSQVLPAQVPPSTQPLDLSALGVPNSGDSGKSPANPASPPTSVRPGQVPRGAAA-SVCFKTGLAHLEQNQLPD 1328 (1525)
Q Consensus 1250 ~~~~~~~~~~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~-~~~lk~gy~~~t~gkf~e 1328 (1525)
+.-| .|...+..+ ...+| +|+++++. ..+|++||++||.|||.|
T Consensus 965 ------------------~~~~-----~R~~se~~~-----------~~~~P-~v~~~l~~l~~kl~~gy~ltt~gKf~e 1009 (1202)
T KOG0292|consen 965 ------------------SLYP-----VRNWSETSS-----------KQGLP-AVGFKLSQLNKKLQKGYKLTTEGKFGE 1009 (1202)
T ss_pred ------------------cccc-----ccccccchh-----------hccCC-cccccHHHHHHHHHHHHhhhccCcHHH
Confidence 1112 122222111 22334 67888876 559999999999999999
Q ss_pred HHHHHHHHHH----HHhcCCcchHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHHHHHHHHhhcCCCCchh
Q 000425 1329 ALSCFDEAFL----ALAKDHSRGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDEMARLSRHLGSLPLQTK 1404 (1525)
Q Consensus 1329 Al~~Fr~iL~----~~v~~~~e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R~lELaaYfT~c~Lqp~ 1404 (1525)
|+++||+||+ .+|++++|+++++++|+||||||+||+| |++||+ ++ ++ +.+|+||||||||||+|||.
T Consensus 1010 Aie~Frsii~~i~l~vvd~~~e~aea~~li~i~~eYi~gL~~--E~~Rr~-l~-~~----~~~~~~ElAaYFt~~~Lqp~ 1081 (1202)
T KOG0292|consen 1010 AIEKFRSIIYSIPLLVVDSKEEEAEADELIKICREYIVGLSV--ELERRK-LK-KP----NLEQQLELAAYFTHCKLQPM 1081 (1202)
T ss_pred HHHHHHHHHhheeEEEecchhhHHHHHHHHHHHHHHHhhhee--eeeecc-cC-Cc----hHHHHHHHHHHhhcCCCCcH
Confidence 9999999999 7899999999999999999999999999 999999 73 32 34599999999999999999
Q ss_pred HHHHHHHHHHHHHHHhccHHHHHHHHHHHHccCC-hhhHHHHHHHHHHHHhcCCCCcccC--CCCCCceeeeccceeccC
Q 000425 1405 HRINCIRTAIKRNMEVQNYAYAKQMLELLLSKAP-ASKQDELRSLIDMCVQRGLSNKSID--PLEDPSQFCAATLSRLST 1481 (1525)
Q Consensus 1405 H~~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p-~~~~~qarkil~~ce~~~~d~~~i~--~~e~pf~iCa~tltpIy~ 1481 (1525)
|++||||+||+.+||+|||+||++||+|||+++| +++++||||++++||++|+|+++|+ ++ |||+||++||+|||+
T Consensus 1082 H~ilalrtA~n~ffK~kN~ktAs~fa~rLlel~~~~~~A~q~rki~~a~eknp~Da~~l~yd~~-n~f~iC~~t~~Piy~ 1160 (1202)
T KOG0292|consen 1082 HRILALRTAMNVFFKLKNLKTAAEFARRLLELAPSPPVAEQARKIKQAAEKNPTDAYELNYDPH-NPFVICGATYVPIYR 1160 (1202)
T ss_pred HHHHHHHHHHHHHHHhccHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhhcCcccccccCcccC-CCeeEecccceeeec
Confidence 9999999999999999999999999999999999 9999999999999999999999999 88 999999999999999
Q ss_pred -CCccccCCCCceecCCC-CCCCccccceeecCCCCCC
Q 000425 1482 -IGYDVCDLCGAKFSALS-APGCIICGMGSIKRSDALA 1517 (1525)
Q Consensus 1482 -~~~v~Cp~cgA~y~~~~-g~~C~iC~l~~IG~~~~~~ 1517 (1525)
.|.++||||||.|++.+ |++|+||++++||+ |+.+
T Consensus 1161 g~p~~~cp~cga~y~~~~~g~iCtvc~V~~ig~-~~~G 1197 (1202)
T KOG0292|consen 1161 GRPDVSCPYCGACFVPSSKGNICTVCDVGVIGA-DASG 1197 (1202)
T ss_pred CCCCcCCCcccceeccccCCceeeeeeeeeecC-cccc
Confidence 99999999999999999 99999999999999 5543
No 2
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=8.1e-107 Score=945.23 Aligned_cols=695 Identities=20% Similarity=0.270 Sum_probs=572.2
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
+|+.=++.|||+.||++++.. ..++.+|+.... ..+..+.- -.-||++..|-+..+++++|++|+.||||++.+++.
T Consensus 13 SdRVKsVd~HPtePw~la~Ly-nG~V~IWnyetq-tmVksfeV-~~~PvRa~kfiaRknWiv~GsDD~~IrVfnynt~ek 89 (794)
T KOG0276|consen 13 SDRVKSVDFHPTEPWILAALY-NGDVQIWNYETQ-TMVKSFEV-SEVPVRAAKFIARKNWIVTGSDDMQIRVFNYNTGEK 89 (794)
T ss_pred CCceeeeecCCCCceEEEeee-cCeeEEEecccc-eeeeeeee-cccchhhheeeeccceEEEecCCceEEEEeccccee
Confidence 567778999999999999987 688889986332 22222222 467999999999999999999999999999999999
Q ss_pred EEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcC-CCCEEEEEECCC
Q 000425 86 HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLP-MLRLLVTLCRDG 163 (1525)
Q Consensus 86 l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsP-dg~~LaSgS~Dg 163 (1525)
+++|+ |.+.++ +++.||..++++|+|+|.+||+||++.. -.+.++++||++.|++++|+| |...++|||.|+
T Consensus 90 V~~FeAH~DyIR-----~iavHPt~P~vLtsSDDm~iKlW~we~~-wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDr 163 (794)
T KOG0276|consen 90 VKTFEAHSDYIR-----SIAVHPTLPYVLTSSDDMTIKLWDWENE-WACEQTFEGHEHYVMQVAFNPKDPNTFASASLDR 163 (794)
T ss_pred eEEeecccccee-----eeeecCCCCeEEecCCccEEEEeeccCc-eeeeeEEcCcceEEEEEEecCCCccceeeeeccc
Confidence 99999 999999 9999999999999999999999999953 556677899999999999999 567899999999
Q ss_pred CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecC----------------CCeeEEEEecCCCeEE
Q 000425 164 SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPL----------------PRVRALEVHPRLNLAV 227 (1525)
Q Consensus 164 tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l----------------~~V~~v~~HP~~Nlia 227 (1525)
|||||.+.... ++..-. -+..||+++| .+.+|++.|.+ .+|.+++.| +.|+.+
T Consensus 164 TVKVWslgs~~----~nfTl~-gHekGVN~Vd-----yy~~gdkpylIsgaDD~tiKvWDyQtk~CV~TLeGH-t~Nvs~ 232 (794)
T KOG0276|consen 164 TVKVWSLGSPH----PNFTLE-GHEKGVNCVD-----YYTGGDKPYLISGADDLTIKVWDYQTKSCVQTLEGH-TNNVSF 232 (794)
T ss_pred cEEEEEcCCCC----Cceeee-ccccCcceEE-----eccCCCcceEEecCCCceEEEeecchHHHHHHhhcc-cccceE
Confidence 99999986211 000000 1345677666 34444444432 234444444 444445
Q ss_pred EEeccc-----ccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCC---cc-cchhhHHHHHHHhhhccCC
Q 000425 228 LLFANF-----TGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGS---SG-ILADHQLQAQLQEHHLKGH 298 (1525)
Q Consensus 228 l~F~~~-----s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs---~g-~l~~~ql~~~~~~~~~~~~ 298 (1525)
+.|||. ++++|..+|+|.. .|++.+...+++. +|.|+.+. ++ ++.+| |
T Consensus 233 v~fhp~lpiiisgsEDGTvriWhs--------~Ty~lE~tLn~gl--eRvW~I~~~k~~~~i~vG~-------------D 289 (794)
T KOG0276|consen 233 VFFHPELPIIISGSEDGTVRIWNS--------KTYKLEKTLNYGL--ERVWCIAAHKGDGKIAVGF-------------D 289 (794)
T ss_pred EEecCCCcEEEEecCCccEEEecC--------cceehhhhhhcCC--ceEEEEeecCCCCeEEEec-------------c
Confidence 555554 8999999999983 5788888887764 77777764 22 23444 4
Q ss_pred CceeeehhhhhceeeeeecccccccCccccCcceeeeccccccCCCcccccccccccccccccccccccceeeEEeCceE
Q 000425 299 SHLTISDIARKAFLYSHFMEGHAKSAPISRLPLITIFDSKHQLKDIPVCQPFHLELNFFNRENRVLHYPVRAFYVDGINL 378 (1525)
Q Consensus 299 ~~~~I~di~r~~~i~s~~~d~~~k~~~~~~~p~~~i~~~~~~~~~~~vv~~~~~e~~~f~~~~~v~~~~~~~~y~~~~~l 378 (1525)
++..+-.++|..|..| ||.++|+ +|..+.+|
T Consensus 290 eg~i~v~lgreeP~vs--Md~~gKI-----------------------------------------------iwa~~~ei 320 (794)
T KOG0276|consen 290 EGSVTVKLGREEPAVS--MDSNGKI-----------------------------------------------IWAVHSEI 320 (794)
T ss_pred CCcEEEEccCCCCcee--ecCCccE-----------------------------------------------EEEcCcee
Confidence 4444445555555555 5555542 24555566
Q ss_pred EEeecCCCccc------cccccccccCCCeeeccccccccCCccEEEEEeeccCCcceEEEEEecccccccccC-CCCcc
Q 000425 379 VAYNLCSGADS------IYRKLYSTIPGTVEYYPKHMVYSKRQQLFLVVYEFSGTTNEVVLYRENVDTQLADSK-SSTVK 451 (1525)
Q Consensus 379 ~~~~l~~g~~~------~~~~L~~~~~G~~e~~pq~l~hspngr~v~V~g~~~~~~~E~iiYt~~~~l~~rnk~-G~~~~ 451 (1525)
.+.++++-.+. +..+|++|++|++|+|||+|.||||||||+||| +|||||||+ ++||||. | +
T Consensus 321 ~~~~~ks~~~~~ev~DgErL~LsvKeLgs~eiyPq~L~hsPNGrfV~Vcg-----dGEyiIyTa---la~RnK~fG---~ 389 (794)
T KOG0276|consen 321 QAVNLKSVGAQKEVTDGERLPLSVKELGSVEIYPQTLAHSPNGRFVVVCG-----DGEYIIYTA---LALRNKAFG---S 389 (794)
T ss_pred eeeeceeccCcccccCCccccchhhhccccccchHHhccCCCCcEEEEec-----CccEEEEEe---eehhhcccc---c
Confidence 66665553222 689999999999999999999999999999999 599999995 9999995 9 9
Q ss_pred cceeEEeCCCCCcEEEEeCCCCeEEEEEcCCchhhhhhcccCCcccCCCccccccccccCCcccccccccceeeecCCcc
Q 000425 452 GRDAAFIGPNEDQFAILDDDKTGLALYILKGVTLQEAADENNGVVDHNQSTDTNVGSVQGPLQLMFESEVDRIFSTPIES 531 (1525)
Q Consensus 452 g~~~~fig~~~n~yAvle~~~~~v~I~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~If~~pl~~ 531 (1525)
|++|+| +.++|.|||||++ +.|+||+|| ||+ ++++ .+..+++||+++
T Consensus 390 ~~eFvw-~~dsne~avRes~-~~vki~knf--------ke~--------------ksi~------~~~~~e~i~gg~--- 436 (794)
T KOG0276|consen 390 GLEFVW-AADSNEFAVRESN-GNVKIFKNF--------KEH--------------KSIR------PDMSAEGIFGGP--- 436 (794)
T ss_pred ceeEEE-cCCCCeEEEEecC-CceEEEecc--------eec--------------cccc------cccceeeecCCc---
Confidence 999999 5577899999886 489999999 665 4566 334599999986
Q ss_pred eeEEeecCCeEEEEeeccccccccccccccccccccceeeEeeccceEEEEEEcccCCccEEEEEeeeeEEEEccchhhh
Q 000425 532 TLMFACDGDQIGMAKLVQGYRLSARAGHYLQTKSEGKKSIKLKVTEVMLKVAWQETQRGYVAGVLTTQRVLIVSADLDIL 611 (1525)
Q Consensus 532 ~l~~~~~~~~i~~~~~~~~yd~~~~~~~~i~~~~e~~r~i~~~~~e~v~~V~Ws~~~~g~~vaI~t~~~i~i~~~~l~~l 611 (1525)
|+.+++.+++|| |||++. +++|||++.+. .|+|+++ |++|||.+++++||+.+|.+++
T Consensus 437 -Llg~~ss~~~~f------ydW~~~---------~lVrrI~v~~k----~v~w~d~--g~lVai~~d~Sfyil~~n~d~v 494 (794)
T KOG0276|consen 437 -LLGVRSSDFLCF------YDWESG---------ELVRRIEVTSK----HVYWSDN--GELVAIAGDDSFYILKFNADAV 494 (794)
T ss_pred -eEEEEeCCeEEE------EEcccc---------eEEEEEeeccc----eeEEecC--CCEEEEEecCceeEEEecHHHH
Confidence 555899999999 799988 67899999888 8999999 9999999999999999999987
Q ss_pred hccccc----cCCC-----------CCcceeeeeecCEEEEEcccc-cEEEeeCCceeeEEEecCCceEEEEEec--CcE
Q 000425 612 ASSSTK----FDKG-----------LPSFRSLLWVGPALLFSTATA-ISVLGWDGKVRNILSISMPNAVLVGALN--DRL 673 (1525)
Q Consensus 612 ~~~~~~----~~~~-----------~~~IkSg~Wvg~~fIYtT~t~-l~Yl~~~G~~~~I~~l~~~~~yLlgyl~--~Rl 673 (1525)
.+..++ .|+| +|+|++|.|+||||||||++| +||++ +|++.+|+|++. +|||+||++ +|+
T Consensus 495 ~~a~e~g~~v~eeGiedAfevLgE~sE~v~tg~WvgD~fiytts~nrlnY~v-gGe~~~v~h~~~-~mylLgy~~~~~rv 572 (794)
T KOG0276|consen 495 ANAVEQGIEVTEEGIEDAFEVLGEVSESVKTGKWVGDCFIYTTSNNRLNYLV-GGETYTVAHLDR-IMYLLGYVANDNRV 572 (794)
T ss_pred HHHHhcCCCCcchhHHHHHHHHhhhhhheeeceeeeeEEEEeecccceeEEc-CCceEEEEEecc-chhheeeeecCCEE
Confidence 666653 3333 799999999999999999999 99999 779999999998 699999999 799
Q ss_pred EEecCCCCCcccCCceeEEeee--ccchhhHHHhhhhhhHHHHHhhhHHHHHHHHHhccCCCCCChhHHHHHHc--CCCC
Q 000425 674 LLANPTEINPRQKKGIEIKSCL--VGLLEPLLIGFATMQQYFEQKLDLSEILYQITSRFDSLRITPRSLDILAK--GPPV 749 (1525)
Q Consensus 674 ~l~~~~~~~p~~dk~~~I~s~~--l~llE~ll~g~a~~~~~~~~t~~lr~~l~~~~~~~~s~~i~~~~l~~L~~--~~~~ 749 (1525)
||. ||+++|.++. +.++|| ||++||+|++.+...++. |+..++.+++. ++||
T Consensus 573 YL~---------Dke~nVi~y~l~l~vley-------------qt~vmrrd~~~a~~vLp~--I~k~~rt~va~Fle~~g 628 (794)
T KOG0276|consen 573 YLH---------DKELNVISYKILLEVLEY-------------QTLVLRRDLEVADGVLPT--IPKEIRTKVAHFLESQG 628 (794)
T ss_pred EEe---------ecccceEeEeeehHHHHH-------------HHHhhhcccccccccccc--CchhhhhhHHhHhhhcc
Confidence 999 9999988865 799995 499999999999998988 55555555555 5899
Q ss_pred cccchhhccccCCCCCcchhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHH
Q 000425 750 CGDLAVSLSQAGPQFTQVLRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETF 829 (1525)
Q Consensus 750 ~~elAl~ls~~~~~~d~~~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~ 829 (1525)
|+|-||++++ |+++|||||||+|+|++|+++|.+++ .+.|||+||++||..|++.+|+|||
T Consensus 629 ~~e~AL~~s~-----D~d~rFelal~lgrl~iA~~la~e~~--------------s~~Kw~~Lg~~al~~~~l~lA~EC~ 689 (794)
T KOG0276|consen 629 MKEQALELST-----DPDQRFELALKLGRLDIAFDLAVEAN--------------SEVKWRQLGDAALSAGELPLASECF 689 (794)
T ss_pred chHhhhhcCC-----ChhhhhhhhhhcCcHHHHHHHHHhhc--------------chHHHHHHHHHHhhcccchhHHHHH
Confidence 9999999999 89999999999999999999999875 5889999999999999999999999
Q ss_pred HHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhH---------HHHHHHHHhhccCCcccchhhhhhccccCCCC-
Q 000425 830 EVIADYESILDLFICHLNPSAMRRLAQRLEEEGANPEL---------RRYCERILRVRSTGWTQGIFANFAAESMVPKG- 899 (1525)
Q Consensus 830 q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~---------~~~c~~il~~~~~~~~~~~~a~~~a~s~~~~~- 899 (1525)
++++||++|||||+++||.++|.+|++.|+++|.+|.+ ...|..+|..++|-.++++||++|.+|.|..-
T Consensus 690 ~~a~d~~~LlLl~t~~g~~~~l~~la~~~~~~g~~N~AF~~~~l~g~~~~C~~lLi~t~r~peAal~ArtYlps~vs~iv 769 (794)
T KOG0276|consen 690 LRARDLGSLLLLYTSSGNAEGLAVLASLAKKQGKNNLAFLAYFLSGDYEECLELLISTQRLPEAALFARTYLPSQVSRIV 769 (794)
T ss_pred HhhcchhhhhhhhhhcCChhHHHHHHHHHHhhcccchHHHHHHHcCCHHHHHHHHHhcCcCcHHHHHHhhhChHHHHHHH
Confidence 99999999999999999999999999999999998887 33899999999999999999999999988777
Q ss_pred CCCC
Q 000425 900 PEWG 903 (1525)
Q Consensus 900 ~e~~ 903 (1525)
++|-
T Consensus 770 ~~wk 773 (794)
T KOG0276|consen 770 ELWK 773 (794)
T ss_pred HHHH
Confidence 7776
No 3
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=100.00 E-value=9.1e-71 Score=661.88 Aligned_cols=408 Identities=19% Similarity=0.252 Sum_probs=298.9
Q ss_pred CceEEEeecCCC---ccccccccccccCCCeeeccccccccCCccEEEEEeeccCCcceEEEEEecccccccccC-CCCc
Q 000425 375 GINLVAYNLCSG---ADSIYRKLYSTIPGTVEYYPKHMVYSKRQQLFLVVYEFSGTTNEVVLYRENVDTQLADSK-SSTV 450 (1525)
Q Consensus 375 ~~~l~~~~l~~g---~~~~~~~L~~~~~G~~e~~pq~l~hspngr~v~V~g~~~~~~~E~iiYt~~~~l~~rnk~-G~~~ 450 (1525)
+++|+++|++++ .+.....++.|++|++|++|++|+||||||||+||| +|||+||+ +++|||+. |
T Consensus 1 ~~~i~~~~~~~~~~~~dg~~~~l~~k~lg~~~~~p~~ls~npngr~v~V~g-----~geY~iyt---~~~~r~k~~G--- 69 (443)
T PF04053_consen 1 HNEIRTANLKNISEIKDGERLPLSVKELGSCEIYPQSLSHNPNGRFVLVCG-----DGEYEIYT---ALAWRNKAFG--- 69 (443)
T ss_dssp -TEEEEEE--S-----TTS-B----EEEEE-SS--SEEEE-TTSSEEEEEE-----TTEEEEEE---TTTTEEEEEE---
T ss_pred CCceEEEECcCCCccCCCceeeEEeccCCCCCcCCeeEEECCCCCEEEEEc-----CCEEEEEE---ccCCcccccC---
Confidence 468899999988 333578889999999999999999999999999999 69999999 48999995 8
Q ss_pred ccceeEEeCCCCCcEEEEeCCCCeEEEEEcCCchhhhhhcccCCcccCCCccccccccccCCcccccccccceeeecCCc
Q 000425 451 KGRDAAFIGPNEDQFAILDDDKTGLALYILKGVTLQEAADENNGVVDHNQSTDTNVGSVQGPLQLMFESEVDRIFSTPIE 530 (1525)
Q Consensus 451 ~g~~~~fig~~~n~yAvle~~~~~v~I~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~If~~pl~ 530 (1525)
+|.+|||+++ |+|||+|+ +.+|+||+++ ++... ++++ ++..+++||+|
T Consensus 70 ~g~~~vw~~~--n~yAv~~~-~~~I~I~kn~--------~~~~~------------k~i~------~~~~~~~If~G--- 117 (443)
T PF04053_consen 70 SGLSFVWSSR--NRYAVLES-SSTIKIYKNF--------KNEVV------------KSIK------LPFSVEKIFGG--- 117 (443)
T ss_dssp E-SEEEE-TS--SEEEEE-T-TS-EEEEETT--------EE-TT-----------------------SS-EEEEE-S---
T ss_pred ceeEEEEecC--ccEEEEEC-CCeEEEEEcC--------ccccc------------eEEc------CCcccceEEcC---
Confidence 9999999874 68999998 4689999998 44432 3455 44669999984
Q ss_pred ceeEEeecCCeEEEEeeccccccccccccccccccccceeeEeeccceEEEEEEcccCCccEEEEEeeeeEEEEccchh-
Q 000425 531 STLMFACDGDQIGMAKLVQGYRLSARAGHYLQTKSEGKKSIKLKVTEVMLKVAWQETQRGYVAGVLTTQRVLIVSADLD- 609 (1525)
Q Consensus 531 ~~l~~~~~~~~i~~~~~~~~yd~~~~~~~~i~~~~e~~r~i~~~~~e~v~~V~Ws~~~~g~~vaI~t~~~i~i~~~~l~- 609 (1525)
.|++++++++|+| |||+++. ..|+|++.+ |++|+|+++ |++|||+|++.|||++++++
T Consensus 118 -~LL~~~~~~~i~~------yDw~~~~---------~i~~i~v~~---vk~V~Ws~~--g~~val~t~~~i~il~~~~~~ 176 (443)
T PF04053_consen 118 -NLLGVKSSDFICF------YDWETGK---------LIRRIDVSA---VKYVIWSDD--GELVALVTKDSIYILKYNLEA 176 (443)
T ss_dssp -SSEEEEETTEEEE------E-TTT-----------EEEEESS-E----EEEEE-TT--SSEEEEE-S-SEEEEEE-HHH
T ss_pred -cEEEEECCCCEEE------EEhhHcc---------eeeEEecCC---CcEEEEECC--CCEEEEEeCCeEEEEEecchh
Confidence 5666888889999 7999883 346776653 779999999 99999999999999999999
Q ss_pred -----------hhhccccccCCCCCcceeeeeecCEEEEEcccccEEEeeCCceeeEEEecCCceEEEEEec--CcEEEe
Q 000425 610 -----------ILASSSTKFDKGLPSFRSLLWVGPALLFSTATAISVLGWDGKVRNILSISMPNAVLVGALN--DRLLLA 676 (1525)
Q Consensus 610 -----------~l~~~~~~~~~~~~~IkSg~Wvg~~fIYtT~t~l~Yl~~~G~~~~I~~l~~~~~yLlgyl~--~Rl~l~ 676 (1525)
+++..+|. .++||||+|+++||||||.+||||++ ||++++|+||+. ++||+||++ +|+|++
T Consensus 177 ~~~~~~~g~e~~f~~~~E~----~~~IkSg~W~~d~fiYtT~~~lkYl~-~Ge~~~i~~ld~-~~yllgy~~~~~~ly~~ 250 (443)
T PF04053_consen 177 VAAIPEEGVEDAFELIHEI----SERIKSGCWVEDCFIYTTSNHLKYLV-NGETGIIAHLDK-PLYLLGYLPKENRLYLI 250 (443)
T ss_dssp HHHBTTTB-GGGEEEEEEE-----S--SEEEEETTEEEEE-TTEEEEEE-TTEEEEEEE-SS---EEEEEETTTTEEEEE
T ss_pred cccccccCchhceEEEEEe----cceeEEEEEEcCEEEEEcCCeEEEEE-cCCcceEEEcCC-ceEEEEEEccCCEEEEE
Confidence 55555552 36999999999999999999999955 999999999998 699999999 999999
Q ss_pred cCCCCCcccCCceeEEeeeccchhhHHHhhhhhhHHHHHhhhHHHHHHHH------HhccCCCCC--ChhHHHHHHcCCC
Q 000425 677 NPTEINPRQKKGIEIKSCLVGLLEPLLIGFATMQQYFEQKLDLSEILYQI------TSRFDSLRI--TPRSLDILAKGPP 748 (1525)
Q Consensus 677 ~~~~~~p~~dk~~~I~s~~l~llE~ll~g~a~~~~~~~~t~~lr~~l~~~------~~~~~s~~i--~~~~l~~L~~~~~ 748 (1525)
|++.+|.++.|++.|+ . | +++++|+|++.+ .+.+++.+. .++...||.+ +
T Consensus 251 ---------Dr~~~v~~~~ld~~~~-~---------f-k~av~~~d~~~v~~~i~~~~ll~~i~~~~~~~i~~fL~~--~ 308 (443)
T PF04053_consen 251 ---------DRDGNVISYELDLSEL-E---------F-KTAVLRGDFEEVLRMIAASNLLPNIPKDQGQSIARFLEK--K 308 (443)
T ss_dssp ----------TT--EEEEE--HHHH-H---------H-HHHHHTT-HHH-----HHHHTGGG--HHHHHHHHHHHHH--T
T ss_pred ---------ECCCCEEEEEECHHHH-H---------H-HHHHHcCChhhhhhhhhhhhhcccCChhHHHHHHHHHHH--C
Confidence 8999999988877773 3 3 378888888873 444544111 3555677777 5
Q ss_pred CcccchhhccccCCCCCcchhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHH
Q 000425 749 VCGDLAVSLSQAGPQFTQVLRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKET 828 (1525)
Q Consensus 749 ~~~elAl~ls~~~~~~d~~~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec 828 (1525)
||+|+||++++ |+++||||||++|||++|+++|++.+ ..++||+||++||++||+++||+|
T Consensus 309 G~~e~AL~~~~-----D~~~rFeLAl~lg~L~~A~~~a~~~~--------------~~~~W~~Lg~~AL~~g~~~lAe~c 369 (443)
T PF04053_consen 309 GYPELALQFVT-----DPDHRFELALQLGNLDIALEIAKELD--------------DPEKWKQLGDEALRQGNIELAEEC 369 (443)
T ss_dssp T-HHHHHHHSS------HHHHHHHHHHCT-HHHHHHHCCCCS--------------THHHHHHHHHHHHHTTBHHHHHHH
T ss_pred CCHHHHHhhcC-----ChHHHhHHHHhcCCHHHHHHHHHhcC--------------cHHHHHHHHHHHHHcCCHHHHHHH
Confidence 69999999999 99999999999999999999998753 467999999999999999999999
Q ss_pred HHHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhH---------HHHHHHHHhhccCCcccchhhhhhcc
Q 000425 829 FEVIADYESILDLFICHLNPSAMRRLAQRLEEEGANPEL---------RRYCERILRVRSTGWTQGIFANFAAE 893 (1525)
Q Consensus 829 ~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~---------~~~c~~il~~~~~~~~~~~~a~~~a~ 893 (1525)
|++++||++|+|||+++||+++|+||+++|+++|..+.+ ...|..||...++-.++.+||++|++
T Consensus 370 ~~k~~d~~~L~lLy~~~g~~~~L~kl~~~a~~~~~~n~af~~~~~lgd~~~cv~lL~~~~~~~~A~~~A~ty~~ 443 (443)
T PF04053_consen 370 YQKAKDFSGLLLLYSSTGDREKLSKLAKIAEERGDINIAFQAALLLGDVEECVDLLIETGRLPEAALFARTYGP 443 (443)
T ss_dssp HHHCT-HHHHHHHHHHCT-HHHHHHHHHHHHHTT-HHHHHHHHHHHT-HHHHHHHHHHTT-HHHHHHHHHHTT-
T ss_pred HHhhcCccccHHHHHHhCCHHHHHHHHHHHHHccCHHHHHHHHHHcCCHHHHHHHHHHcCCchHHHHHHHhcCC
Confidence 999999999999999999999999999999999955443 22788888877777777777777754
No 4
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=100.00 E-value=9.3e-71 Score=647.76 Aligned_cols=233 Identities=20% Similarity=0.377 Sum_probs=184.5
Q ss_pred CCCccccccccCCCCCCCCCCCCCCCCCCCCCCCCccCCchhh-HHHHHHHHHHhhcCChHHHHHHHHHHHH----HHhc
Q 000425 1268 STQPLDLSALGVPNSGDSGKSPANPASPPTSVRPGQVPRGAAA-SVCFKTGLAHLEQNQLPDALSCFDEAFL----ALAK 1342 (1525)
Q Consensus 1268 ~~~pl~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~-~~~lk~gy~~~t~gkf~eAl~~Fr~iL~----~~v~ 1342 (1525)
+.+||..++++++.+. .++...| .++++++. .++||.|||+||+|||+||+..||+||| ++|+
T Consensus 173 ~~p~l~~~~~r~~~~~-----------~~~~~lP-~i~~~l~~L~~~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~ 240 (422)
T PF06957_consen 173 SLPPLPSYIRRNWDES-----------NPKNGLP-AIPLSLSSLEERLKEGYKLFTAGKFEEAIEIFRSILHSIPLLVVE 240 (422)
T ss_dssp TTS-EEEEEBCTTTTS-----------SSCCG-B-B----HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHC--BS
T ss_pred CCCCccccccCCcccc-----------ccccCCC-cCcCCHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhheeeec
Confidence 4457788888888773 2334566 56777776 7799999999999999999999999999 7899
Q ss_pred CCcchHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHHHHHHHHhhcCCCCchhHHHHHHHHHHHHHHHhcc
Q 000425 1343 DHSRGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDEMARLSRHLGSLPLQTKHRINCIRTAIKRNMEVQN 1422 (1525)
Q Consensus 1343 ~~~e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R~lELaaYfT~c~Lqp~H~~L~Lr~Am~~~~K~kN 1422 (1525)
+++|++|++++|+||||||+||+| |++||+ +++ ++ +.++||+||||||||||+|||.|++||||+||+++||+||
T Consensus 241 ~~~E~~e~~eli~icrEYilgl~i--El~Rr~-l~~-~~-~~~~kR~lELAAYFThc~LQp~H~~LaLr~AM~~~~K~KN 315 (422)
T PF06957_consen 241 SREEEDEAKELIEICREYILGLSI--ELERRE-LPK-DP-VEDQKRNLELAAYFTHCKLQPSHLILALRSAMSQAFKLKN 315 (422)
T ss_dssp SCHHHHHHHHHHHHHHHHHHHHHH--HHHHCT-S-T-TT-HHHHHHHHHHHHHHCCS---HHHHHHHHHHHHHHCCCTTB
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH--HHHHHh-ccc-cc-hhhHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHHHHhcc
Confidence 999999999999999999999999 999999 743 33 4567799999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHccCC-hhhHHHHHHHHHHHHhcCCCCcccC--CCCCCceeeeccceeccC-CCccccCCCCceecCCC
Q 000425 1423 YAYAKQMLELLLSKAP-ASKQDELRSLIDMCVQRGLSNKSID--PLEDPSQFCAATLSRLST-IGYDVCDLCGAKFSALS 1498 (1525)
Q Consensus 1423 y~tAa~fA~rLL~l~p-~~~~~qarkil~~ce~~~~d~~~i~--~~e~pf~iCa~tltpIy~-~~~v~Cp~cgA~y~~~~ 1498 (1525)
|+||++||||||+++| +++++||||||++|||+++|+++|+ ++ +||+||++||||||+ +++|+||||||+|+++|
T Consensus 316 f~tAa~FArRLLel~p~~~~a~qArKil~~~e~~~tDa~~i~yD~~-npF~ICa~s~tPIY~G~~~v~CP~cgA~y~~~~ 394 (422)
T PF06957_consen 316 FITAASFARRLLELNPSPEVAEQARKILQACERNPTDAHEIDYDER-NPFDICAASYTPIYRGSPSVKCPYCGAKYHPEY 394 (422)
T ss_dssp HHHHHHHHHHHHCT--SCHHHHHHHHHHHHHCCS--BSS--S--TT-S-EEEBTTT--EEETTS-EEE-TTT--EEEGGG
T ss_pred HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhcCCCCceecCCCCC-CCceeeecccccccCCCCCeeCCCCCCccChhh
Confidence 9999999999999999 9999999999999999999999998 88 999999999999999 99999999999999999
Q ss_pred -CCCCccccceeecCCCCCCCCC
Q 000425 1499 -APGCIICGMGSIKRSDALAGPV 1520 (1525)
Q Consensus 1499 -g~~C~iC~l~~IG~~~~~~~~~ 1520 (1525)
|++|+||+|++||+ |+ .|-+
T Consensus 395 kG~lC~vC~l~~IG~-~a-~GLr 415 (422)
T PF06957_consen 395 KGQLCPVCELSEIGA-DA-SGLR 415 (422)
T ss_dssp TTSB-TTTTTBBTT-----S---
T ss_pred CCCCCCCCcceeeCC-cc-eeeE
Confidence 99999999999999 55 3443
No 5
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=2.7e-33 Score=329.62 Aligned_cols=628 Identities=18% Similarity=0.201 Sum_probs=367.7
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k 84 (1525)
+|-..+|++||++|+++.++ +||.|++|+ +++.+.......+|++.|++++|+|.++ .|+|+|.|+|||+|.+.+..
T Consensus 97 ~DyIR~iavHPt~P~vLtsS-DDm~iKlW~-we~~wa~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDrTVKVWslgs~~ 174 (794)
T KOG0276|consen 97 SDYIRSIAVHPTLPYVLTSS-DDMTIKLWD-WENEWACEQTFEGHEHYVMQVAFNPKDPNTFASASLDRTVKVWSLGSPH 174 (794)
T ss_pred ccceeeeeecCCCCeEEecC-CccEEEEee-ccCceeeeeEEcCcceEEEEEEecCCCccceeeeeccccEEEEEcCCCC
Confidence 46678999999999998877 699999999 6888888888888999999999999875 79999999999999999999
Q ss_pred EEEEEe-cCCceeecCceeEEEeC--CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHP--TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSP--dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~ 161 (1525)
+.++|+ |...|+ ||+|-+ |.++|+||++|.+|+|||++ +++|++++.||++.|+++.|||..++|+|||.
T Consensus 175 ~nfTl~gHekGVN-----~Vdyy~~gdkpylIsgaDD~tiKvWDyQ--tk~CV~TLeGHt~Nvs~v~fhp~lpiiisgsE 247 (794)
T KOG0276|consen 175 PNFTLEGHEKGVN-----CVDYYTGGDKPYLISGADDLTIKVWDYQ--TKSCVQTLEGHTNNVSFVFFHPELPIIISGSE 247 (794)
T ss_pred CceeeeccccCcc-----eEEeccCCCcceEEecCCCceEEEeecc--hHHHHHHhhcccccceEEEecCCCcEEEEecC
Confidence 999999 999999 999987 55899999999999999999 58888899999999999999999999999999
Q ss_pred CCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccc
Q 000425 162 DGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKN 241 (1525)
Q Consensus 162 DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~ 241 (1525)
|||+|||+..++.. ...| .|.+.|+|+++.|++.+.+++.|+.
T Consensus 248 DGTvriWhs~Ty~l---------------------E~tL-------n~gleRvW~I~~~k~~~~i~vG~De--------- 290 (794)
T KOG0276|consen 248 DGTVRIWNSKTYKL---------------------EKTL-------NYGLERVWCIAAHKGDGKIAVGFDE--------- 290 (794)
T ss_pred CccEEEecCcceeh---------------------hhhh-------hcCCceEEEEeecCCCCeEEEeccC---------
Confidence 99999998765542 1122 3666799999999999999988876
Q ss_pred cccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeeeeeccccc
Q 000425 242 RAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSHFMEGHA 321 (1525)
Q Consensus 242 k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~~~d~~~ 321 (1525)
+..+++..++ ++..+|+++|+|. +++...|++. ++
T Consensus 291 -----------g~i~v~lgre-------eP~vsMd~~gKIi-----------------------wa~~~ei~~~----~~ 325 (794)
T KOG0276|consen 291 -----------GSVTVKLGRE-------EPAVSMDSNGKII-----------------------WAVHSEIQAV----NL 325 (794)
T ss_pred -----------CcEEEEccCC-------CCceeecCCccEE-----------------------EEcCceeeee----ec
Confidence 2457777777 8999999999773 1122223221 11
Q ss_pred cc-------CccccCcceeeeccccccCCCcccccccccccccccccccccccceeeEE---eCceEEEeecCCCccccc
Q 000425 322 KS-------APISRLPLITIFDSKHQLKDIPVCQPFHLELNFFNRENRVLHYPVRAFYV---DGINLVAYNLCSGADSIY 391 (1525)
Q Consensus 322 k~-------~~~~~~p~~~i~~~~~~~~~~~vv~~~~~e~~~f~~~~~v~~~~~~~~y~---~~~~l~~~~l~~g~~~~~ 391 (1525)
|+ .-..++|+.+= +++. | -+|. ..+-|.|+--|.+ +|..|...-|.
T Consensus 326 ks~~~~~ev~DgErL~LsvK-----eLgs---~-------eiyP--q~L~hsPNGrfV~VcgdGEyiIyTala------- 381 (794)
T KOG0276|consen 326 KSVGAQKEVTDGERLPLSVK-----ELGS---V-------EIYP--QTLAHSPNGRFVVVCGDGEYIIYTALA------- 381 (794)
T ss_pred eeccCcccccCCccccchhh-----hccc---c-------ccch--HHhccCCCCcEEEEecCccEEEEEeee-------
Confidence 11 11233333321 1111 0 0111 1111222222232 45554422222
Q ss_pred cccccccCCCeeeccccccccCCccEEEEEeeccCCcceEEEEEecccc-cccccCCCCcccceeEEeCCCCCcEEEEeC
Q 000425 392 RKLYSTIPGTVEYYPKHMVYSKRQQLFLVVYEFSGTTNEVVLYRENVDT-QLADSKSSTVKGRDAAFIGPNEDQFAILDD 470 (1525)
Q Consensus 392 ~~L~~~~~G~~e~~pq~l~hspngr~v~V~g~~~~~~~E~iiYt~~~~l-~~rnk~G~~~~g~~~~fig~~~n~yAvle~ 470 (1525)
+-.|..|+. +-+.+++.-.-++|-- .++.+.||+-.... .+|.+.+ .+++|.|. -.++..+
T Consensus 382 --~RnK~fG~~----~eFvw~~dsne~avRe----s~~~vki~knfke~ksi~~~~~-----~e~i~gg~---Llg~~ss 443 (794)
T KOG0276|consen 382 --LRNKAFGSG----LEFVWAADSNEFAVRE----SNGNVKIFKNFKEHKSIRPDMS-----AEGIFGGP---LLGVRSS 443 (794)
T ss_pred --hhhcccccc----eeEEEcCCCCeEEEEe----cCCceEEEecceeccccccccc-----eeeecCCc---eEEEEeC
Confidence 334455653 2334455422233332 14566676532222 2333332 67888543 4555544
Q ss_pred CCCeEEEEEcCCchhhhhhcccCCcccCCCccccccccccCCcccccccccceeeecCCcceeEEeecCCeEEEEeeccc
Q 000425 471 DKTGLALYILKGVTLQEAADENNGVVDHNQSTDTNVGSVQGPLQLMFESEVDRIFSTPIESTLMFACDGDQIGMAKLVQG 550 (1525)
Q Consensus 471 ~~~~v~I~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~If~~pl~~~l~~~~~~~~i~~~~~~~~ 550 (1525)
..+.+|.-..... ++ +++...+.|||...+..|++++..+++.+..|.+.
T Consensus 444 --~~~~fydW~~~~l-----------------------Vr-----rI~v~~k~v~w~d~g~lVai~~d~Sfyil~~n~d~ 493 (794)
T KOG0276|consen 444 --DFLCFYDWESGEL-----------------------VR-----RIEVTSKHVYWSDNGELVAIAGDDSFYILKFNADA 493 (794)
T ss_pred --CeEEEEEcccceE-----------------------EE-----EEeeccceeEEecCCCEEEEEecCceeEEEecHHH
Confidence 3688886431111 11 25567899999988889999999999999877666
Q ss_pred cccccccccccccccccceeeEeeccceEEEEEEcccCC-ccEEEEEeeeeEEEEccchhhhhccccccCCCCCcceeee
Q 000425 551 YRLSARAGHYLQTKSEGKKSIKLKVTEVMLKVAWQETQR-GYVAGVLTTQRVLIVSADLDILASSSTKFDKGLPSFRSLL 629 (1525)
Q Consensus 551 yd~~~~~~~~i~~~~e~~r~i~~~~~e~v~~V~Ws~~~~-g~~vaI~t~~~i~i~~~~l~~l~~~~~~~~~~~~~IkSg~ 629 (1525)
+.-....+..+...+.. .++|..-++ +|.++ |.-| .+.-||..+ + .. --.
T Consensus 494 v~~a~e~g~~v~eeGie------dAfevLgE~--sE~v~tg~Wv---gD~fiytts-~-nr----------------lnY 544 (794)
T KOG0276|consen 494 VANAVEQGIEVTEEGIE------DAFEVLGEV--SESVKTGKWV---GDCFIYTTS-N-NR----------------LNY 544 (794)
T ss_pred HHHHHhcCCCCcchhHH------HHHHHHhhh--hhheeeceee---eeEEEEeec-c-cc----------------eeE
Confidence 64444433333222111 233332222 22211 1112 222222211 1 11 111
Q ss_pred eecCEEEEEc-ccccEEEeeCCceee-EEEecCCceEEEEEecCcEEEecCCCCCcccCCceeEEeeec-cchhhHHHhh
Q 000425 630 WVGPALLFST-ATAISVLGWDGKVRN-ILSISMPNAVLVGALNDRLLLANPTEINPRQKKGIEIKSCLV-GLLEPLLIGF 706 (1525)
Q Consensus 630 Wvg~~fIYtT-~t~l~Yl~~~G~~~~-I~~l~~~~~yLlgyl~~Rl~l~~~~~~~p~~dk~~~I~s~~l-~llE~ll~g~ 706 (1525)
.+|+=-.+.- ....+||++.=.... ++-.|. .+-..+|--+-=++-+++.. .+.+++.-...| .+-+...-
T Consensus 545 ~vgGe~~~v~h~~~~mylLgy~~~~~rvYL~Dk-e~nVi~y~l~l~vleyqt~v---mrrd~~~a~~vLp~I~k~~rt-- 618 (794)
T KOG0276|consen 545 LVGGETYTVAHLDRIMYLLGYVANDNRVYLHDK-ELNVISYKILLEVLEYQTLV---LRRDLEVADGVLPTIPKEIRT-- 618 (794)
T ss_pred EcCCceEEEEEeccchhheeeeecCCEEEEeec-ccceEeEeeehHHHHHHHHh---hhccccccccccccCchhhhh--
Confidence 2332221111 112556552200000 111111 11112221000000011000 011111100000 00000000
Q ss_pred hhhhHHHHHhhhHHHHHHHHHhccCCCCCChhHHHHHHcCCCCcccchhhccccCCCCCcchh----hhHHhhcCCHHHH
Q 000425 707 ATMQQYFEQKLDLSEILYQITSRFDSLRITPRSLDILAKGPPVCGDLAVSLSQAGPQFTQVLR----GIYAIKALRFSTA 782 (1525)
Q Consensus 707 a~~~~~~~~t~~lr~~l~~~~~~~~s~~i~~~~l~~L~~~~~~~~elAl~ls~~~~~~d~~~r----F~LAl~lg~l~~A 782 (1525)
.+..+|+ + +|.-+++++.- .++..+=-|+.+ .|..++|.++++. .+.+.| -++|++.|+|..|
T Consensus 619 -~va~Fle-~---~g~~e~AL~~s----~D~d~rFelal~-lgrl~iA~~la~e---~~s~~Kw~~Lg~~al~~~~l~lA 685 (794)
T KOG0276|consen 619 -KVAHFLE-S---QGMKEQALELS----TDPDQRFELALK-LGRLDIAFDLAVE---ANSEVKWRQLGDAALSAGELPLA 685 (794)
T ss_pred -hHHhHhh-h---ccchHhhhhcC----CChhhhhhhhhh-cCcHHHHHHHHHh---hcchHHHHHHHHHHhhcccchhH
Confidence 0111121 1 12223343311 223332233332 3478999998873 244444 5899999999999
Q ss_pred HHHHHHhhcccCCCCCCC--CC-hhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCH
Q 000425 783 LSVLKDEFLRSRDYPKCP--PT-SQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNP 848 (1525)
Q Consensus 783 ~~ia~~~~~~~~d~~~~~--~~-~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~ 848 (1525)
.|.+.. ++|+-..= .+ +...+--..||..|-++|.+++|--||-..+|++.-+-|.++|+..
T Consensus 686 ~EC~~~----a~d~~~LlLl~t~~g~~~~l~~la~~~~~~g~~N~AF~~~~l~g~~~~C~~lLi~t~r~ 750 (794)
T KOG0276|consen 686 SECFLR----ARDLGSLLLLYTSSGNAEGLAVLASLAKKQGKNNLAFLAYFLSGDYEECLELLISTQRL 750 (794)
T ss_pred HHHHHh----hcchhhhhhhhhhcCChhHHHHHHHHHHhhcccchHHHHHHHcCCHHHHHHHHHhcCcC
Confidence 998754 34543100 00 1112224569999999999999999999999999999999888753
No 6
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.92 E-value=6e-22 Score=240.67 Aligned_cols=244 Identities=18% Similarity=0.268 Sum_probs=192.5
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
...-|+|||+||+. ++..+|-.|+||+. ...++.+.+.+ |-+.|+.+.||+.-++++|+|+|.|||||+..+.+++.
T Consensus 53 pVRgv~FH~~qplF-VSGGDDykIkVWnY-k~rrclftL~G-HlDYVRt~~FHheyPWIlSASDDQTIrIWNwqsr~~ia 129 (1202)
T KOG0292|consen 53 PVRGVDFHPTQPLF-VSGGDDYKIKVWNY-KTRRCLFTLLG-HLDYVRTVFFHHEYPWILSASDDQTIRIWNWQSRKCIA 129 (1202)
T ss_pred ccceeeecCCCCeE-EecCCccEEEEEec-ccceehhhhcc-ccceeEEeeccCCCceEEEccCCCeEEEEeccCCceEE
Confidence 45679999999974 55567999999995 55677778887 99999999999999999999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC-----c----------------------eeeEeecc
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER-----P----------------------SMIGIIQV 139 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~-----~----------------------s~i~il~g 139 (1525)
.+. |...|. |-+|||....++|+|-|.||||||+...+ + -+..++.|
T Consensus 130 vltGHnHYVM-----cAqFhptEDlIVSaSLDQTVRVWDisGLRkk~~~pg~~e~~~~~~~~~~dLfg~~DaVVK~VLEG 204 (1202)
T KOG0292|consen 130 VLTGHNHYVM-----CAQFHPTEDLIVSASLDQTVRVWDISGLRKKNKAPGSLEDQMRGQQGNSDLFGQTDAVVKHVLEG 204 (1202)
T ss_pred EEecCceEEE-----eeccCCccceEEEecccceEEEEeecchhccCCCCCCchhhhhccccchhhcCCcCeeeeeeecc
Confidence 999 999999 99999999999999999999999986310 0 01135789
Q ss_pred cCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEE
Q 000425 140 GSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEV 219 (1525)
Q Consensus 140 H~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~ 219 (1525)
|...|+-++|||..++|+||++|..||+|..+.+.. |+ +| +.-++ ..+|.++.|
T Consensus 205 HDRGVNwaAfhpTlpliVSG~DDRqVKlWrmnetKa----------WE------vD---tcrgH-------~nnVssvlf 258 (1202)
T KOG0292|consen 205 HDRGVNWAAFHPTLPLIVSGADDRQVKLWRMNETKA----------WE------VD---TCRGH-------YNNVSSVLF 258 (1202)
T ss_pred cccccceEEecCCcceEEecCCcceeeEEEeccccc----------ee------eh---hhhcc-------cCCcceEEe
Confidence 999999999999999999999999999998863221 11 11 11111 136889999
Q ss_pred ecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCc--ccc--hhhHHHHHHHhhhc
Q 000425 220 HPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSS--GIL--ADHQLQAQLQEHHL 295 (1525)
Q Consensus 220 HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~--g~l--~~~ql~~~~~~~~~ 295 (1525)
||..|++. +.++|+.+++|.+... ..+.++|-.+ +|||.+... +.+ ++|
T Consensus 259 hp~q~lIl------SnsEDksirVwDm~kR--t~v~tfrren--------dRFW~laahP~lNLfAAgH----------- 311 (1202)
T KOG0292|consen 259 HPHQDLIL------SNSEDKSIRVWDMTKR--TSVQTFRREN--------DRFWILAAHPELNLFAAGH----------- 311 (1202)
T ss_pred cCccceeE------ecCCCccEEEEecccc--cceeeeeccC--------CeEEEEEecCCcceeeeec-----------
Confidence 99999988 9999999999987543 2233444322 577877764 222 445
Q ss_pred cCCCceeeehhhhhceeee
Q 000425 296 KGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 296 ~~~~~~~I~di~r~~~i~s 314 (1525)
|+++.|.++-|..++|-
T Consensus 312 --DsGm~VFkleRErpa~~ 328 (1202)
T KOG0292|consen 312 --DSGMIVFKLERERPAYA 328 (1202)
T ss_pred --CCceEEEEEcccCceEE
Confidence 77777777777766654
No 7
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.91 E-value=8.1e-24 Score=239.79 Aligned_cols=269 Identities=16% Similarity=0.202 Sum_probs=202.2
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
++..++++|+|++..+..|+ .|.++++|+. +...+....++ |++.|.|++|+|||+.+|+||.||+|++||.+++++
T Consensus 115 ~e~Vl~~~fsp~g~~l~tGs-GD~TvR~WD~-~TeTp~~t~Kg-H~~WVlcvawsPDgk~iASG~~dg~I~lwdpktg~~ 191 (480)
T KOG0271|consen 115 GEAVLSVQFSPTGSRLVTGS-GDTTVRLWDL-DTETPLFTCKG-HKNWVLCVAWSPDGKKIASGSKDGSIRLWDPKTGQQ 191 (480)
T ss_pred CCcEEEEEecCCCceEEecC-CCceEEeecc-CCCCcceeecC-CccEEEEEEECCCcchhhccccCCeEEEecCCCCCc
Confidence 35778999999999987776 6899999995 55566777777 999999999999999999999999999999998876
Q ss_pred E-EEEe-cCCceeecCceeEEEeC-----CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 86 H-YTLQ-LDNTIKLLGAGAFAFHP-----TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 86 l-~tL~-H~~~V~~~ga~sVaFSP-----dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
+ ..|. |.+.|+ +++|.| ..+.++++|.||+|+|||+. ...++-.+.+|+..|+|+.|--+| +|.|
T Consensus 192 ~g~~l~gH~K~It-----~Lawep~hl~p~~r~las~skDg~vrIWd~~--~~~~~~~lsgHT~~VTCvrwGG~g-liyS 263 (480)
T KOG0271|consen 192 IGRALRGHKKWIT-----ALAWEPLHLVPPCRRLASSSKDGSVRIWDTK--LGTCVRTLSGHTASVTCVRWGGEG-LIYS 263 (480)
T ss_pred ccccccCccccee-----EEeecccccCCCccceecccCCCCEEEEEcc--CceEEEEeccCccceEEEEEcCCc-eEEe
Confidence 5 6677 999999 999976 56789999999999999998 355555688999999999997654 8999
Q ss_pred EECCCCeeEEEeee-----ecCCC-----------CCCCccccccCCCcc-------------------ccccccccccc
Q 000425 159 LCRDGSLQVWKTRV-----IINPN-----------RPPMQANFFEPASIE-------------------SIDIPRILSQQ 203 (1525)
Q Consensus 159 gS~DgtIkVWDlr~-----~~~p~-----------~~~~~a~~f~p~gVe-------------------sldi~~iLs~~ 203 (1525)
||.|+|||+|+..- .+..| +.......|++.+-. .-...+.++.+
T Consensus 264 gS~DrtIkvw~a~dG~~~r~lkGHahwvN~lalsTdy~LRtgaf~~t~~~~~~~se~~~~Al~rY~~~~~~~~erlVSgs 343 (480)
T KOG0271|consen 264 GSQDRTIKVWRALDGKLCRELKGHAHWVNHLALSTDYVLRTGAFDHTGRKPKSFSEEQKKALERYEAVLKDSGERLVSGS 343 (480)
T ss_pred cCCCceEEEEEccchhHHHhhcccchheeeeeccchhhhhccccccccccCCChHHHHHHHHHHHHHhhccCcceeEEec
Confidence 99999999999761 01111 111122233332211 00112233444
Q ss_pred CCcceec---------C-------CCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccc
Q 000425 204 GGEAVYP---------L-------PRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASV 267 (1525)
Q Consensus 204 gG~~vy~---------l-------~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~ 267 (1525)
.+...+- + .-|+.+.|.|+.-++| +++=|..+|.|..++| +.+.++|||-+.+|-+
T Consensus 344 Dd~tlflW~p~~~kkpi~rmtgHq~lVn~V~fSPd~r~IA------SaSFDkSVkLW~g~tG--k~lasfRGHv~~VYqv 415 (480)
T KOG0271|consen 344 DDFTLFLWNPFKSKKPITRMTGHQALVNHVSFSPDGRYIA------SASFDKSVKLWDGRTG--KFLASFRGHVAAVYQV 415 (480)
T ss_pred CCceEEEecccccccchhhhhchhhheeeEEECCCccEEE------EeecccceeeeeCCCc--chhhhhhhccceeEEE
Confidence 3333222 1 2488999999999988 8888999999999988 4677999999999875
Q ss_pred hhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhh
Q 000425 268 LKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARK 309 (1525)
Q Consensus 268 ~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~ 309 (1525)
.|+-++--++... .|.++.|||+..+
T Consensus 416 ----awsaDsRLlVS~S------------kDsTLKvw~V~tk 441 (480)
T KOG0271|consen 416 ----AWSADSRLLVSGS------------KDSTLKVWDVRTK 441 (480)
T ss_pred ----EeccCccEEEEcC------------CCceEEEEEeeee
Confidence 3444542233322 6889999996555
No 8
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.88 E-value=1.1e-22 Score=233.52 Aligned_cols=214 Identities=18% Similarity=0.164 Sum_probs=174.3
Q ss_pred cEEEEEEcCC-CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 8 TEVHLALTPL-QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 8 ~e~~LA~hP~-qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
....+.|||. ...-+.++.-|.++++|.. ++..+...+.+ |...|..++|||+|++|+++|.|.+-|+||++++..+
T Consensus 219 ~v~~~~fhP~~~~~~lat~s~Dgtvklw~~-~~e~~l~~l~g-H~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~El 296 (459)
T KOG0272|consen 219 RVGAAVFHPVDSDLNLATASADGTVKLWKL-SQETPLQDLEG-HLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSEL 296 (459)
T ss_pred ceeeEEEccCCCccceeeeccCCceeeecc-CCCcchhhhhc-chhhheeeeecCCCceeeecccccchhhcccccchhh
Confidence 4567899999 4555566667999999985 44455555555 9999999999999999999999999999999999888
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
...+ |...|. +++|+|||..++|||.|..-||||+++...-+ ++.+|...|.+|+|+|+|-.|+|||.|+++
T Consensus 297 L~QEGHs~~v~-----~iaf~~DGSL~~tGGlD~~~RvWDlRtgr~im--~L~gH~k~I~~V~fsPNGy~lATgs~Dnt~ 369 (459)
T KOG0272|consen 297 LLQEGHSKGVF-----SIAFQPDGSLAATGGLDSLGRVWDLRTGRCIM--FLAGHIKEILSVAFSPNGYHLATGSSDNTC 369 (459)
T ss_pred Hhhcccccccc-----eeEecCCCceeeccCccchhheeecccCcEEE--EecccccceeeEeECCCceEEeecCCCCcE
Confidence 8888 999999 99999999999999999999999999764444 489999999999999999999999999999
Q ss_pred eEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEec-CCCeEEEEecccccCCccccccc
Q 000425 166 QVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHP-RLNLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 166 kVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP-~~Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
+|||+|....-+ .+ | .+. .-|..|.|.| .+.+++ +++=|.+.++|
T Consensus 370 kVWDLR~r~~ly--~i------p-------------AH~-------nlVS~Vk~~p~~g~fL~------TasyD~t~kiW 415 (459)
T KOG0272|consen 370 KVWDLRMRSELY--TI------P-------------AHS-------NLVSQVKYSPQEGYFLV------TASYDNTVKIW 415 (459)
T ss_pred EEeeecccccce--ec------c-------------ccc-------chhhheEecccCCeEEE------EcccCcceeee
Confidence 999998433210 01 1 000 1255678887 456665 78888888999
Q ss_pred ccccCceeeeeeeccccCCccc
Q 000425 245 YTREGRKQLFAVLQSARGSSAS 266 (1525)
Q Consensus 245 ~~~eg~~q~~~tlq~~~~~~~s 266 (1525)
..++ ++++.+|.||.+-+.|
T Consensus 416 s~~~--~~~~ksLaGHe~kV~s 435 (459)
T KOG0272|consen 416 STRT--WSPLKSLAGHEGKVIS 435 (459)
T ss_pred cCCC--cccchhhcCCccceEE
Confidence 8765 5888999999997644
No 9
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.87 E-value=4.1e-21 Score=212.03 Aligned_cols=246 Identities=16% Similarity=0.175 Sum_probs=192.4
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecC----CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG----GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT 82 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg----gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t 82 (1525)
|-...|+..+.+|-++.++++|+++.+|..... |.+...+++ |++.|..++++++++++++++.|+++|+||+.+
T Consensus 16 d~Vt~la~~~~~~~~l~sasrDk~ii~W~L~~dd~~~G~~~r~~~G-HsH~v~dv~~s~dg~~alS~swD~~lrlWDl~~ 94 (315)
T KOG0279|consen 16 DWVTALAIKIKNSDILVSASRDKTIIVWKLTSDDIKYGVPVRRLTG-HSHFVSDVVLSSDGNFALSASWDGTLRLWDLAT 94 (315)
T ss_pred ceEEEEEeecCCCceEEEcccceEEEEEEeccCccccCceeeeeec-cceEecceEEccCCceEEeccccceEEEEEecC
Confidence 445678889999999999999999999987443 556667777 999999999999999999999999999999999
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC--CCEEEEE
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM--LRLLVTL 159 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd--g~~LaSg 159 (1525)
++..+.|. |...|. +|+|++|.+.|++||.|++|++||+-.. ......-..|...|+|+.|+|+ .++|+++
T Consensus 95 g~~t~~f~GH~~dVl-----sva~s~dn~qivSGSrDkTiklwnt~g~-ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~ 168 (315)
T KOG0279|consen 95 GESTRRFVGHTKDVL-----SVAFSTDNRQIVSGSRDKTIKLWNTLGV-CKYTIHEDSHREWVSCVRFSPNESNPIIVSA 168 (315)
T ss_pred CcEEEEEEecCCceE-----EEEecCCCceeecCCCcceeeeeeeccc-EEEEEecCCCcCcEEEEEEcCCCCCcEEEEc
Confidence 99999999 999999 9999999999999999999999998853 2222112234788999999998 7899999
Q ss_pred ECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcc
Q 000425 160 CRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNI 239 (1525)
Q Consensus 160 S~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~ 239 (1525)
|.|++||+||++.....+ + ..+ ... -++++++.|++.+.+ +++.|.
T Consensus 169 s~DktvKvWnl~~~~l~~------~---------------~~g---h~~----~v~t~~vSpDGslca------sGgkdg 214 (315)
T KOG0279|consen 169 SWDKTVKVWNLRNCQLRT------T---------------FIG---HSG----YVNTVTVSPDGSLCA------SGGKDG 214 (315)
T ss_pred cCCceEEEEccCCcchhh------c---------------ccc---ccc----cEEEEEECCCCCEEe------cCCCCc
Confidence 999999999998432210 1 111 112 278999999999998 899999
Q ss_pred cccccccccCceeeeeeeccccCCcccchhhhcccCCCcc--cchhhHHHHHHHhhhccCCCceeeehhhhhceeeee
Q 000425 240 KNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSG--ILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSH 315 (1525)
Q Consensus 240 ~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g--~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~ 315 (1525)
+...|+.++|+. .+++ .|...+.++. ++++- ..+. -+..+.|||+-.+..|.+.
T Consensus 215 ~~~LwdL~~~k~--lysl-~a~~~v~sl~------fspnrywL~~a-------------t~~sIkIwdl~~~~~v~~l 270 (315)
T KOG0279|consen 215 EAMLWDLNEGKN--LYSL-EAFDIVNSLC------FSPNRYWLCAA-------------TATSIKIWDLESKAVVEEL 270 (315)
T ss_pred eEEEEEccCCce--eEec-cCCCeEeeEE------ecCCceeEeec-------------cCCceEEEeccchhhhhhc
Confidence 999999999976 2233 4454443321 11111 0111 3667999998888777764
No 10
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=99.87 E-value=2.9e-21 Score=219.25 Aligned_cols=173 Identities=20% Similarity=0.301 Sum_probs=143.6
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
+|.++|.|++|+|++..+++|+.|.++|+||+.+..++++.+ |.+.|. ||+|+|||++|++|+.||+|++||-.
T Consensus 113 GH~e~Vl~~~fsp~g~~l~tGsGD~TvR~WD~~TeTp~~t~KgH~~WVl-----cvawsPDgk~iASG~~dg~I~lwdpk 187 (480)
T KOG0271|consen 113 GHGEAVLSVQFSPTGSRLVTGSGDTTVRLWDLDTETPLFTCKGHKNWVL-----CVAWSPDGKKIASGSKDGSIRLWDPK 187 (480)
T ss_pred CCCCcEEEEEecCCCceEEecCCCceEEeeccCCCCcceeecCCccEEE-----EEEECCCcchhhccccCCeEEEecCC
Confidence 499999999999999999999999999999999999999999 999999 99999999999999999999999988
Q ss_pred CCCceeeEeecccCcCeEEEEEcC-----CCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccc
Q 000425 128 IERPSMIGIIQVGSQPITSVAWLP-----MLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQ 202 (1525)
Q Consensus 128 t~~~s~i~il~gH~~~VtsVafsP-----dg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~ 202 (1525)
+. ......+.+|...|++++|+| ..++|+|+|.||+|+|||+..... -..+++
T Consensus 188 tg-~~~g~~l~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd~~~~~~---------------------~~~lsg 245 (480)
T KOG0271|consen 188 TG-QQIGRALRGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWDTKLGTC---------------------VRTLSG 245 (480)
T ss_pred CC-CcccccccCcccceeEEeecccccCCCccceecccCCCCEEEEEccCceE---------------------EEEecc
Confidence 64 333345669999999999988 467999999999999999863110 001222
Q ss_pred cCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCc
Q 000425 203 QGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 203 ~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
+. ..|+|+.|- ..+++. +++.|..+|+|+..+|+ +..+|+||..-+
T Consensus 246 HT-------~~VTCvrwG-G~gliy------SgS~DrtIkvw~a~dG~--~~r~lkGHahwv 291 (480)
T KOG0271|consen 246 HT-------ASVTCVRWG-GEGLIY------SGSQDRTIKVWRALDGK--LCRELKGHAHWV 291 (480)
T ss_pred Cc-------cceEEEEEc-CCceEE------ecCCCceEEEEEccchh--HHHhhcccchhe
Confidence 21 246777775 346666 89999999999988874 667889988754
No 11
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=99.86 E-value=6.9e-21 Score=218.91 Aligned_cols=206 Identities=22% Similarity=0.273 Sum_probs=173.7
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCC--CCEEEEEeCCCcEEEEeC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPT--LEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPd--g~~L~SgS~DgtVrVWDl 126 (1525)
-+.||..+.|+.++..|++||.+|.+++|+..+...+.+|. |...|. ++.|||. +..++||+.||+|++|++
T Consensus 174 d~rPis~~~fS~ds~~laT~swsG~~kvW~~~~~~~~~~l~gH~~~v~-----~~~fhP~~~~~~lat~s~Dgtvklw~~ 248 (459)
T KOG0272|consen 174 DTRPISGCSFSRDSKHLATGSWSGLVKVWSVPQCNLLQTLRGHTSRVG-----AAVFHPVDSDLNLATASADGTVKLWKL 248 (459)
T ss_pred CCCcceeeEeecCCCeEEEeecCCceeEeecCCcceeEEEecccccee-----eEEEccCCCccceeeeccCCceeeecc
Confidence 46899999999999999999999999999999999999999 999999 9999996 568999999999999999
Q ss_pred CCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCc
Q 000425 127 SIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGE 206 (1525)
Q Consensus 127 ~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~ 206 (1525)
.+ ...++.+.+|...|..++|||+|++|+|+|-|.+-++||+++... .+.+ .|.
T Consensus 249 ~~--e~~l~~l~gH~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~E-----------------------lL~Q-EGH 302 (459)
T KOG0272|consen 249 SQ--ETPLQDLEGHLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSE-----------------------LLLQ-EGH 302 (459)
T ss_pred CC--CcchhhhhcchhhheeeeecCCCceeeecccccchhhcccccchh-----------------------hHhh-ccc
Confidence 95 344556779999999999999999999999999999999985332 1111 121
Q ss_pred ceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHH
Q 000425 207 AVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQL 286 (1525)
Q Consensus 207 ~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql 286 (1525)
. ..|-.++|||++.+++ +|+-|..-++|..|+|++.+ +|.+|..-++++ .+.+-|. +
T Consensus 303 s----~~v~~iaf~~DGSL~~------tGGlD~~~RvWDlRtgr~im--~L~gH~k~I~~V---~fsPNGy-------~- 359 (459)
T KOG0272|consen 303 S----KGVFSIAFQPDGSLAA------TGGLDSLGRVWDLRTGRCIM--FLAGHIKEILSV---AFSPNGY-------H- 359 (459)
T ss_pred c----cccceeEecCCCceee------ccCccchhheeecccCcEEE--EecccccceeeE---eECCCce-------E-
Confidence 1 2567899999999999 99999999999999998777 599999988774 3444343 2
Q ss_pred HHHHHhhhccCCCceeeehhhhhceeee
Q 000425 287 QAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 287 ~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
.+.-++|++.+|||+-++.++|+
T Consensus 360 -----lATgs~Dnt~kVWDLR~r~~ly~ 382 (459)
T KOG0272|consen 360 -----LATGSSDNTCKVWDLRMRSELYT 382 (459)
T ss_pred -----EeecCCCCcEEEeeeccccccee
Confidence 12234899999999999999988
No 12
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.85 E-value=5.7e-20 Score=202.32 Aligned_cols=183 Identities=16% Similarity=0.263 Sum_probs=154.4
Q ss_pred CCCCCeEEEEEcCC-CCEEEEEECCCcEEEEEcCC---CcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEE
Q 000425 49 DLKKPIVNLACHPR-LPVLYVAYADGLIRAYNIHT---YAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLL 122 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pd-g~~LaSGs~DgtIrLWDl~t---~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVr 122 (1525)
+|+++|.+++|||- |..||+|+.|+.||+|+... ..+...+. |...|+ +|+|+|.|++|+++|.|.++.
T Consensus 12 gh~~r~W~~awhp~~g~ilAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVR-----svAwsp~g~~La~aSFD~t~~ 86 (312)
T KOG0645|consen 12 GHKDRVWSVAWHPGKGVILASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVR-----SVAWSPHGRYLASASFDATVV 86 (312)
T ss_pred CCCCcEEEEEeccCCceEEEeecCCceEEEEecCCCCcEEEEEeccccchheee-----eeeecCCCcEEEEeeccceEE
Confidence 39999999999998 88999999999999999874 56777776 999999 999999999999999999999
Q ss_pred EEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccc
Q 000425 123 AWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQ 202 (1525)
Q Consensus 123 VWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~ 202 (1525)
||.-......++.++.||...|.|++|+++|.||+|||.|++|.||.+. +.+.. +...+|..
T Consensus 87 Iw~k~~~efecv~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~d---------------eddEf---ec~aVL~~ 148 (312)
T KOG0645|consen 87 IWKKEDGEFECVATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEID---------------EDDEF---ECIAVLQE 148 (312)
T ss_pred EeecCCCceeEEeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEec---------------CCCcE---EEEeeecc
Confidence 9966544577888899999999999999999999999999999999875 11111 22223333
Q ss_pred cCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccc-cCceeeeeeeccccCCcccc
Q 000425 203 QGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTR-EGRKQLFAVLQSARGSSASV 267 (1525)
Q Consensus 203 ~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~-eg~~q~~~tlq~~~~~~~s~ 267 (1525)
+. ..|+.+.|||+..+++ +++=|..+|.|+.. ++.+++.++|.+|.+.+.++
T Consensus 149 Ht-------qDVK~V~WHPt~dlL~------S~SYDnTIk~~~~~~dddW~c~~tl~g~~~TVW~~ 201 (312)
T KOG0645|consen 149 HT-------QDVKHVIWHPTEDLLF------SCSYDNTIKVYRDEDDDDWECVQTLDGHENTVWSL 201 (312)
T ss_pred cc-------ccccEEEEcCCcceeE------EeccCCeEEEEeecCCCCeeEEEEecCccceEEEE
Confidence 22 4688999999999887 77878888888877 78999999999999977554
No 13
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.84 E-value=2.2e-20 Score=227.94 Aligned_cols=246 Identities=14% Similarity=0.176 Sum_probs=193.8
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecC-----------------------------CccceEEccCCCCCe
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG-----------------------------GRAPTKIKTDLKKPI 54 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg-----------------------------gk~~tklk~~H~~~V 54 (1525)
+...+..+..|+++...+.+|+. +..|++|..... .........+|.+||
T Consensus 376 nt~~~v~ca~fSddssmlA~Gf~-dS~i~~~Sl~p~kl~~lk~~~~l~~~d~~sad~~~~~~D~~~~~~~~~L~GH~GPV 454 (707)
T KOG0263|consen 376 NTYQGVTCAEFSDDSSMLACGFV-DSSVRVWSLTPKKLKKLKDASDLSNIDTESADVDVDMLDDDSSGTSRTLYGHSGPV 454 (707)
T ss_pred EcCCcceeEeecCCcchhhcccc-ccEEEEEecchhhhccccchhhhccccccccchhhhhccccCCceeEEeecCCCce
Confidence 34566778899999998888885 788888865410 001112223599999
Q ss_pred EEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCcee
Q 000425 55 VNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSM 133 (1525)
Q Consensus 55 ~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~ 133 (1525)
.++.|+|+.++|+++|.|++||+|++.+..++..++ |..+|. +|+|+|.|-|++|+|.|++-++|..... ..
T Consensus 455 yg~sFsPd~rfLlScSED~svRLWsl~t~s~~V~y~GH~~PVw-----dV~F~P~GyYFatas~D~tArLWs~d~~--~P 527 (707)
T KOG0263|consen 455 YGCSFSPDRRFLLSCSEDSSVRLWSLDTWSCLVIYKGHLAPVW-----DVQFAPRGYYFATASHDQTARLWSTDHN--KP 527 (707)
T ss_pred eeeeecccccceeeccCCcceeeeecccceeEEEecCCCccee-----eEEecCCceEEEecCCCceeeeeecccC--Cc
Confidence 999999999999999999999999999999999999 999999 9999999999999999999999988742 22
Q ss_pred eEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCC
Q 000425 134 IGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPR 213 (1525)
Q Consensus 134 i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~ 213 (1525)
..++.+|.+.|.|+.|||+..|++|||.|.|||+||+..... . ++..++. .+
T Consensus 528 lRifaghlsDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~------V---------------RiF~GH~-------~~ 579 (707)
T KOG0263|consen 528 LRIFAGHLSDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNS------V---------------RIFTGHK-------GP 579 (707)
T ss_pred hhhhcccccccceEEECCcccccccCCCCceEEEEEcCCCcE------E---------------EEecCCC-------Cc
Confidence 234669999999999999999999999999999999863210 0 1122222 25
Q ss_pred eeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhh
Q 000425 214 VRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEH 293 (1525)
Q Consensus 214 V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~ 293 (1525)
|..++|.|.+-.++ ++++|..+++|+...|+ ++..+++|.+..+|+.+ +.+.+..+.+.
T Consensus 580 V~al~~Sp~Gr~La------Sg~ed~~I~iWDl~~~~--~v~~l~~Ht~ti~SlsF----S~dg~vLasgg--------- 638 (707)
T KOG0263|consen 580 VTALAFSPCGRYLA------SGDEDGLIKIWDLANGS--LVKQLKGHTGTIYSLSF----SRDGNVLASGG--------- 638 (707)
T ss_pred eEEEEEcCCCceEe------ecccCCcEEEEEcCCCc--chhhhhcccCceeEEEE----ecCCCEEEecC---------
Confidence 78889999888887 99999999999988774 56689999998887532 22322233555
Q ss_pred hccCCCceeeehhhhh
Q 000425 294 HLKGHSHLTISDIARK 309 (1525)
Q Consensus 294 ~~~~~~~~~I~di~r~ 309 (1525)
.|+++++||+-+.
T Consensus 639 ---~DnsV~lWD~~~~ 651 (707)
T KOG0263|consen 639 ---ADNSVRLWDLTKV 651 (707)
T ss_pred ---CCCeEEEEEchhh
Confidence 6999999996655
No 14
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.84 E-value=2.3e-19 Score=216.65 Aligned_cols=160 Identities=20% Similarity=0.289 Sum_probs=134.3
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~ 83 (1525)
||.-..-.++|+-++.|+.+|+.+-..+.||... ...-..+..+ |...+.+++++|||+++++|+.||+|||||..++
T Consensus 305 is~~~I~t~~~N~tGDWiA~g~~klgQLlVweWq-sEsYVlKQQg-H~~~i~~l~YSpDgq~iaTG~eDgKVKvWn~~Sg 382 (893)
T KOG0291|consen 305 ISDQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQ-SESYVLKQQG-HSDRITSLAYSPDGQLIATGAEDGKVKVWNTQSG 382 (893)
T ss_pred cccceeeEEEecccCCEEEEcCCccceEEEEEee-ccceeeeccc-cccceeeEEECCCCcEEEeccCCCcEEEEeccCc
Confidence 3444556688999999999999876666676642 2222333333 9999999999999999999999999999999999
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC-------c-----eee----------------
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER-------P-----SMI---------------- 134 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~-------~-----s~i---------------- 134 (1525)
-|+.+|. |...|. .+.|+.+|+.++|+|-||+||.||+..++ + .++
T Consensus 383 fC~vTFteHts~Vt-----~v~f~~~g~~llssSLDGtVRAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~ 457 (893)
T KOG0291|consen 383 FCFVTFTEHTSGVT-----AVQFTARGNVLLSSSLDGTVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDS 457 (893)
T ss_pred eEEEEeccCCCceE-----EEEEEecCCEEEEeecCCeEEeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccce
Confidence 9999999 999999 99999999999999999999999986530 0 011
Q ss_pred --------------EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 135 --------------GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 135 --------------~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
-++.||.++|.+++|+|++..|+|+|+|+|||+||+
T Consensus 458 F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDkTVRiW~i 507 (893)
T KOG0291|consen 458 FEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDKTVRIWDI 507 (893)
T ss_pred EEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccceEEEEEe
Confidence 135789999999999999999999999999999997
No 15
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.82 E-value=7.8e-19 Score=215.03 Aligned_cols=254 Identities=17% Similarity=0.206 Sum_probs=188.9
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCc--cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc-CC
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR--APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNI-HT 82 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk--~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl-~t 82 (1525)
......+.|||++.++.. ...++.+.+|+. +..+ ....+ .+|...|++++|+|++.++++|+.|++|||||+ ..
T Consensus 159 ~~sv~~~~fs~~g~~l~~-~~~~~~i~~~~~-~~~~~~~~~~l-~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~ 235 (456)
T KOG0266|consen 159 CPSVTCVDFSPDGRALAA-ASSDGLIRIWKL-EGIKSNLLREL-SGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDD 235 (456)
T ss_pred cCceEEEEEcCCCCeEEE-ccCCCcEEEeec-ccccchhhccc-cccccceeeeEECCCCcEEEEecCCceEEEeeccCC
Confidence 456677999999999544 445888888886 3333 33333 459999999999999999999999999999999 55
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~ 161 (1525)
+..+++++ |...|+ +++|+|+++.+++|+.|++|+|||+++ ..+...+.+|...|++++|++++.+|+++|.
T Consensus 236 ~~~~~~l~gH~~~v~-----~~~f~p~g~~i~Sgs~D~tvriWd~~~--~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~ 308 (456)
T KOG0266|consen 236 GRNLKTLKGHSTYVT-----SVAFSPDGNLLVSGSDDGTVRIWDVRT--GECVRKLKGHSDGISGLAFSPDGNLLVSASY 308 (456)
T ss_pred CeEEEEecCCCCceE-----EEEecCCCCEEEEecCCCcEEEEeccC--CeEEEeeeccCCceEEEEECCCCCEEEEcCC
Confidence 68999999 999999 999999999999999999999999995 5666678899999999999999999999999
Q ss_pred CCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccc
Q 000425 162 DGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKN 241 (1525)
Q Consensus 162 DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~ 241 (1525)
|++|+|||+.....- ....+.... .+- .++.+.|||++..+. ++.+|...
T Consensus 309 d~~i~vwd~~~~~~~-------------------~~~~~~~~~----~~~-~~~~~~fsp~~~~ll------~~~~d~~~ 358 (456)
T KOG0266|consen 309 DGTIRVWDLETGSKL-------------------CLKLLSGAE----NSA-PVTSVQFSPNGKYLL------SASLDRTL 358 (456)
T ss_pred CccEEEEECCCCcee-------------------eeecccCCC----CCC-ceeEEEECCCCcEEE------EecCCCeE
Confidence 999999998743210 000111111 111 468899999999887 78889999
Q ss_pred cccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeee
Q 000425 242 RAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 242 k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
+.|..+.++ ...++.+|.+..++. ......-+...++.++ ++..+.+||+.....++.
T Consensus 359 ~~w~l~~~~--~~~~~~~~~~~~~~~-~~~~~~~~~~~i~sg~------------~d~~v~~~~~~s~~~~~~ 416 (456)
T KOG0266|consen 359 KLWDLRSGK--SVGTYTGHSNLVRCI-FSPTLSTGGKLIYSGS------------EDGSVYVWDSSSGGILQR 416 (456)
T ss_pred EEEEccCCc--ceeeecccCCcceeE-ecccccCCCCeEEEEe------------CCceEEEEeCCccchhhh
Confidence 999877663 445677888753222 1111111222223333 567777777665444433
No 16
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.82 E-value=8.2e-19 Score=193.33 Aligned_cols=205 Identities=18% Similarity=0.242 Sum_probs=167.3
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCC--ccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC--C
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGG--RAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH--T 82 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg--k~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~--t 82 (1525)
|++-.+|+||-.-.+|.+...++.+++|++.++. ...+.+...|+..|++++|+|.|++|++||.|.++.||.-. .
T Consensus 15 ~r~W~~awhp~~g~ilAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~~e 94 (312)
T KOG0645|consen 15 DRVWSVAWHPGKGVILASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWKKEDGE 94 (312)
T ss_pred CcEEEEEeccCCceEEEeecCCceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEeecCCCc
Confidence 3567899999966678888889999999986532 12344556799999999999999999999999999999744 4
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-CceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-RPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
++++.+++ |++.|. +++|+++|.+|+++|+|++|-||..... ...+.++++.|++.|..+.|||...+|+|||
T Consensus 95 fecv~~lEGHEnEVK-----~Vaws~sG~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~S 169 (312)
T KOG0645|consen 95 FECVATLEGHENEVK-----CVAWSASGNYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCS 169 (312)
T ss_pred eeEEeeeecccccee-----EEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEec
Confidence 58999999 999999 9999999999999999999999988742 5677888999999999999999999999999
Q ss_pred CCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccc
Q 000425 161 RDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIK 240 (1525)
Q Consensus 161 ~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~ 240 (1525)
.|.|||+|+-.- .+ ..+....+..+.+ .|+.++|||.++-.+ +++||..
T Consensus 170 YDnTIk~~~~~~---------------dd---dW~c~~tl~g~~~-------TVW~~~F~~~G~rl~------s~sdD~t 218 (312)
T KOG0645|consen 170 YDNTIKVYRDED---------------DD---DWECVQTLDGHEN-------TVWSLAFDNIGSRLV------SCSDDGT 218 (312)
T ss_pred cCCeEEEEeecC---------------CC---CeeEEEEecCccc-------eEEEEEecCCCceEE------EecCCcc
Confidence 999999997421 00 0111112222221 478999999987766 8899999
Q ss_pred ccccccc
Q 000425 241 NRAAYTR 247 (1525)
Q Consensus 241 ~k~~~~~ 247 (1525)
+++|+.+
T Consensus 219 v~Iw~~~ 225 (312)
T KOG0645|consen 219 VSIWRLY 225 (312)
T ss_pred eEeeeec
Confidence 9999855
No 17
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.81 E-value=1e-18 Score=197.36 Aligned_cols=176 Identities=20% Similarity=0.278 Sum_probs=145.2
Q ss_pred EccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEE
Q 000425 46 IKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAW 124 (1525)
Q Consensus 46 lk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVW 124 (1525)
+..+|.+.|+|+++.|.+.+|++|+.|++|+|||+.++++..++. |-..|+ .|++|+..+|++|++.|+.|+.|
T Consensus 146 Vi~gHlgWVr~vavdP~n~wf~tgs~DrtikIwDlatg~LkltltGhi~~vr-----~vavS~rHpYlFs~gedk~VKCw 220 (460)
T KOG0285|consen 146 VISGHLGWVRSVAVDPGNEWFATGSADRTIKIWDLATGQLKLTLTGHIETVR-----GVAVSKRHPYLFSAGEDKQVKCW 220 (460)
T ss_pred hhhhccceEEEEeeCCCceeEEecCCCceeEEEEcccCeEEEeecchhheee-----eeeecccCceEEEecCCCeeEEE
Confidence 334599999999999999999999999999999999999999999 999999 99999999999999999999999
Q ss_pred eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccC
Q 000425 125 DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQG 204 (1525)
Q Consensus 125 Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~g 204 (1525)
|+. ...++..+.||.+.|.|++.||..+.|+||+.|.++||||+|+..+- .++.++.
T Consensus 221 DLe--~nkvIR~YhGHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V---------------------~~l~GH~ 277 (460)
T KOG0285|consen 221 DLE--YNKVIRHYHGHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASV---------------------HVLSGHT 277 (460)
T ss_pred ech--hhhhHHHhccccceeEEEeccccceeEEecCCcceEEEeeecccceE---------------------EEecCCC
Confidence 999 46666677799999999999999999999999999999999954320 0122221
Q ss_pred CcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCc
Q 000425 205 GEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 205 G~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
..|..|.+.|...-+. +++-|..++.|..+.|+.-. ++--|.-++
T Consensus 278 -------~~V~~V~~~~~dpqvi------t~S~D~tvrlWDl~agkt~~--tlt~hkksv 322 (460)
T KOG0285|consen 278 -------NPVASVMCQPTDPQVI------TGSHDSTVRLWDLRAGKTMI--TLTHHKKSV 322 (460)
T ss_pred -------CcceeEEeecCCCceE------EecCCceEEEeeeccCceeE--eeeccccee
Confidence 1345566666555444 88999999999999986433 555555544
No 18
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.80 E-value=5.8e-19 Score=215.59 Aligned_cols=197 Identities=17% Similarity=0.196 Sum_probs=162.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+..+-.+|+|+..+++.++ .|+++++|.... -......+ +|..||..+.|+|.|-+||||+.|++-++|......++
T Consensus 452 GPVyg~sFsPd~rfLlScS-ED~svRLWsl~t-~s~~V~y~-GH~~PVwdV~F~P~GyYFatas~D~tArLWs~d~~~Pl 528 (707)
T KOG0263|consen 452 GPVYGCSFSPDRRFLLSCS-EDSSVRLWSLDT-WSCLVIYK-GHLAPVWDVQFAPRGYYFATASHDQTARLWSTDHNKPL 528 (707)
T ss_pred Cceeeeeecccccceeecc-CCcceeeeeccc-ceeEEEec-CCCcceeeEEecCCceEEEecCCCceeeeeecccCCch
Confidence 3566789999999987665 689999998533 23333344 59999999999999999999999999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+.|. |...|. |++|||+..|+++||.|++||+||+.+. ..+.++.||.++|++++|+|+|++|+||+.|+.|
T Consensus 529 RifaghlsDV~-----cv~FHPNs~Y~aTGSsD~tVRlWDv~~G--~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~I 601 (707)
T KOG0263|consen 529 RIFAGHLSDVD-----CVSFHPNSNYVATGSSDRTVRLWDVSTG--NSVRIFTGHKGPVTALAFSPCGRYLASGDEDGLI 601 (707)
T ss_pred hhhcccccccc-----eEEECCcccccccCCCCceEEEEEcCCC--cEEEEecCCCCceEEEEEcCCCceEeecccCCcE
Confidence 9999 999999 9999999999999999999999999964 4456688999999999999999999999999999
Q ss_pred eEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccc
Q 000425 166 QVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 166 kVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~ 245 (1525)
++||+..... ...+ -.+. ..|..++|...+++++ +++-|..++.|.
T Consensus 602 ~iWDl~~~~~------v~~l---------------~~Ht-------~ti~SlsFS~dg~vLa------sgg~DnsV~lWD 647 (707)
T KOG0263|consen 602 KIWDLANGSL------VKQL---------------KGHT-------GTIYSLSFSRDGNVLA------SGGADNSVRLWD 647 (707)
T ss_pred EEEEcCCCcc------hhhh---------------hccc-------CceeEEEEecCCCEEE------ecCCCCeEEEEE
Confidence 9999863110 0001 1111 1356788999999988 788888888886
Q ss_pred cc
Q 000425 246 TR 247 (1525)
Q Consensus 246 ~~ 247 (1525)
..
T Consensus 648 ~~ 649 (707)
T KOG0263|consen 648 LT 649 (707)
T ss_pred ch
Confidence 43
No 19
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.79 E-value=5.8e-18 Score=207.39 Aligned_cols=157 Identities=20% Similarity=0.347 Sum_probs=138.4
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
....++|+|++.+++.++ .|+++++|+..+.++....+++ |...|++++|+|+++++++|+.|++|||||+++++++.
T Consensus 205 ~v~~~~fs~d~~~l~s~s-~D~tiriwd~~~~~~~~~~l~g-H~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~ 282 (456)
T KOG0266|consen 205 GVSDVAFSPDGSYLLSGS-DDKTLRIWDLKDDGRNLKTLKG-HSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVR 282 (456)
T ss_pred ceeeeEECCCCcEEEEec-CCceEEEeeccCCCeEEEEecC-CCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEE
Confidence 567899999999765555 7999999997455677777776 99999999999999999999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcC--eEEEEEcCCCCEEEEEECCCC
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQP--ITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~--VtsVafsPdg~~LaSgS~Dgt 164 (1525)
.+. |...|. +++|++++.+|++++.|+.|+|||+.++...+...+.+|... ++++.|||++.++++++.|++
T Consensus 283 ~l~~hs~~is-----~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~~ 357 (456)
T KOG0266|consen 283 KLKGHSDGIS-----GLAFSPDGNLLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDRT 357 (456)
T ss_pred eeeccCCceE-----EEEECCCCCEEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCCe
Confidence 999 999999 999999999999999999999999997532133334466555 999999999999999999999
Q ss_pred eeEEEee
Q 000425 165 LQVWKTR 171 (1525)
Q Consensus 165 IkVWDlr 171 (1525)
+++||++
T Consensus 358 ~~~w~l~ 364 (456)
T KOG0266|consen 358 LKLWDLR 364 (456)
T ss_pred EEEEEcc
Confidence 9999998
No 20
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.78 E-value=5e-18 Score=187.85 Aligned_cols=203 Identities=13% Similarity=0.126 Sum_probs=155.1
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
-+++++++.+.|.++ .|..+++|+...+ .+...+. +|..-|.+++|+|+.+.+++|+.|.+|++||+.. .+.+++.
T Consensus 68 dv~~s~dg~~alS~s-wD~~lrlWDl~~g-~~t~~f~-GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~g-~ck~t~~ 143 (315)
T KOG0279|consen 68 DVVLSSDGNFALSAS-WDGTLRLWDLATG-ESTRRFV-GHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTLG-VCKYTIH 143 (315)
T ss_pred ceEEccCCceEEecc-ccceEEEEEecCC-cEEEEEE-ecCCceEEEEecCCCceeecCCCcceeeeeeecc-cEEEEEe
Confidence 478889999887666 6899999997544 4444444 4999999999999999999999999999999864 5666666
Q ss_pred ---cCCceeecCceeEEEeCC--CCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 91 ---LDNTIKLLGAGAFAFHPT--LEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 91 ---H~~~V~~~ga~sVaFSPd--g~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
|.+.|. ||+|+|. .++|+++|.|++||+||+++ .+....+-+|+.+|+.+.+||||.+.++|+.||.+
T Consensus 144 ~~~~~~WVs-----cvrfsP~~~~p~Ivs~s~DktvKvWnl~~--~~l~~~~~gh~~~v~t~~vSpDGslcasGgkdg~~ 216 (315)
T KOG0279|consen 144 EDSHREWVS-----CVRFSPNESNPIIVSASWDKTVKVWNLRN--CQLRTTFIGHSGYVNTVTVSPDGSLCASGGKDGEA 216 (315)
T ss_pred cCCCcCcEE-----EEEEcCCCCCcEEEEccCCceEEEEccCC--cchhhccccccccEEEEEECCCCCEEecCCCCceE
Confidence 478999 9999997 78999999999999999995 33333345999999999999999999999999999
Q ss_pred eEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccc
Q 000425 166 QVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 166 kVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~ 245 (1525)
.+||++...+ +.+++ + ...|+.+.|.|+.-.+.. +-+..+|+|.
T Consensus 217 ~LwdL~~~k~---------------lysl~---------a-----~~~v~sl~fspnrywL~~-------at~~sIkIwd 260 (315)
T KOG0279|consen 217 MLWDLNEGKN---------------LYSLE---------A-----FDIVNSLCFSPNRYWLCA-------ATATSIKIWD 260 (315)
T ss_pred EEEEccCCce---------------eEecc---------C-----CCeEeeEEecCCceeEee-------ccCCceEEEe
Confidence 9999974322 00011 0 124677888887655442 2223488888
Q ss_pred cccCceeeeeeeccccC
Q 000425 246 TREGRKQLFAVLQSARG 262 (1525)
Q Consensus 246 ~~eg~~q~~~tlq~~~~ 262 (1525)
.+.++ ++..|+....
T Consensus 261 l~~~~--~v~~l~~d~~ 275 (315)
T KOG0279|consen 261 LESKA--VVEELKLDGI 275 (315)
T ss_pred ccchh--hhhhcccccc
Confidence 87764 3445655444
No 21
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.78 E-value=2e-18 Score=195.19 Aligned_cols=210 Identities=20% Similarity=0.289 Sum_probs=166.5
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
=..+|||-|...|...|+ .|.++.+|+.. .+.-...+.+ |...|+.+++++..+++++++.|+.||.||+...+.++
T Consensus 153 WVr~vavdP~n~wf~tgs-~DrtikIwDla-tg~LkltltG-hi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nkvIR 229 (460)
T KOG0285|consen 153 WVRSVAVDPGNEWFATGS-ADRTIKIWDLA-TGQLKLTLTG-HIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNKVIR 229 (460)
T ss_pred eEEEEeeCCCceeEEecC-CCceeEEEEcc-cCeEEEeecc-hhheeeeeeecccCceEEEecCCCeeEEEechhhhhHH
Confidence 357899999988876666 58999999954 4444445555 99999999999999999999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
.+. |...|. |++.||.-..|++|+.|.++||||+++. .-+..+.||...|..+.+.|-.+.++|||.|++|+
T Consensus 230 ~YhGHlS~V~-----~L~lhPTldvl~t~grDst~RvWDiRtr--~~V~~l~GH~~~V~~V~~~~~dpqvit~S~D~tvr 302 (460)
T KOG0285|consen 230 HYHGHLSGVY-----CLDLHPTLDVLVTGGRDSTIRVWDIRTR--ASVHVLSGHTNPVASVMCQPTDPQVITGSHDSTVR 302 (460)
T ss_pred HhccccceeE-----EEeccccceeEEecCCcceEEEeeeccc--ceEEEecCCCCcceeEEeecCCCceEEecCCceEE
Confidence 999 999999 9999999999999999999999999963 33345789999999999999999999999999999
Q ss_pred EEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccc
Q 000425 167 VWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYT 246 (1525)
Q Consensus 167 VWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~ 246 (1525)
+||++.... +.+.+ . ....+.++..||+.|+.| +++.| .++.|..
T Consensus 303 lWDl~agkt-----~~tlt-~----------------------hkksvral~lhP~e~~fA------Sas~d-nik~w~~ 347 (460)
T KOG0285|consen 303 LWDLRAGKT-----MITLT-H----------------------HKKSVRALCLHPKENLFA------SASPD-NIKQWKL 347 (460)
T ss_pred EeeeccCce-----eEeee-c----------------------ccceeeEEecCCchhhhh------ccCCc-cceeccC
Confidence 999983211 11000 0 112478899999999876 33333 3456777
Q ss_pred ccCceeeeeeeccccCCc
Q 000425 247 REGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 247 ~eg~~q~~~tlq~~~~~~ 264 (1525)
.+|. ....+.+|+...
T Consensus 348 p~g~--f~~nlsgh~~ii 363 (460)
T KOG0285|consen 348 PEGE--FLQNLSGHNAII 363 (460)
T ss_pred Cccc--hhhcccccccee
Confidence 7772 333466666643
No 22
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.78 E-value=1e-17 Score=186.11 Aligned_cols=214 Identities=14% Similarity=0.123 Sum_probs=170.7
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEee----cCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTV----EGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~ti----eggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
..-||+|.+.+|.+|. -+..+.++... ++.........+|++.+.++.|-+ +..+++++.|.+.-+||+++++.
T Consensus 101 MtCA~sPSg~~VAcGG-LdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~d-D~~ilT~SGD~TCalWDie~g~~ 178 (343)
T KOG0286|consen 101 MTCAYSPSGNFVACGG-LDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLD-DNHILTGSGDMTCALWDIETGQQ 178 (343)
T ss_pred EEEEECCCCCeEEecC-cCceeEEEecccccccccceeeeeecCccceeEEEEEcC-CCceEecCCCceEEEEEcccceE
Confidence 4568999999998887 46677777754 233333333445999999999987 56789999999999999999999
Q ss_pred EEEEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 86 HYTLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 86 l~tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
+..|. |...|. +++++| +++++++|+.|++.++||++. ..+.+.+.+|...|++|.|+|+|.-|+|||+|+
T Consensus 179 ~~~f~GH~gDV~-----slsl~p~~~ntFvSg~cD~~aklWD~R~--~~c~qtF~ghesDINsv~ffP~G~afatGSDD~ 251 (343)
T KOG0286|consen 179 TQVFHGHTGDVM-----SLSLSPSDGNTFVSGGCDKSAKLWDVRS--GQCVQTFEGHESDINSVRFFPSGDAFATGSDDA 251 (343)
T ss_pred EEEecCCcccEE-----EEecCCCCCCeEEecccccceeeeeccC--cceeEeecccccccceEEEccCCCeeeecCCCc
Confidence 99999 999999 999999 999999999999999999995 456667789999999999999999999999999
Q ss_pred CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccc
Q 000425 164 SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRA 243 (1525)
Q Consensus 164 tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~ 243 (1525)
++|+||+|.-.. . ..|++.. ..-+|+.++|.-++-+.. .+.+|....+
T Consensus 252 tcRlyDlRaD~~------~-a~ys~~~-------------------~~~gitSv~FS~SGRlLf------agy~d~~c~v 299 (343)
T KOG0286|consen 252 TCRLYDLRADQE------L-AVYSHDS-------------------IICGITSVAFSKSGRLLF------AGYDDFTCNV 299 (343)
T ss_pred eeEEEeecCCcE------E-eeeccCc-------------------ccCCceeEEEcccccEEE------eeecCCceeE
Confidence 999999994221 0 0111100 012578889988888766 5667777778
Q ss_pred cccccCceeeeeeeccccCCccc
Q 000425 244 AYTREGRKQLFAVLQSARGSSAS 266 (1525)
Q Consensus 244 ~~~~eg~~q~~~tlq~~~~~~~s 266 (1525)
|..-.|+.. .+|-||.+-+.+
T Consensus 300 WDtlk~e~v--g~L~GHeNRvSc 320 (343)
T KOG0286|consen 300 WDTLKGERV--GVLAGHENRVSC 320 (343)
T ss_pred eeccccceE--EEeeccCCeeEE
Confidence 888777544 478899997755
No 23
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.77 E-value=3.7e-15 Score=175.51 Aligned_cols=156 Identities=12% Similarity=0.206 Sum_probs=133.2
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCE-EEEEECCCcEEEEEcCCCcEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPV-LYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~-LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
.--|++.+++-.+.+....+.....+-.++.+...-.+.+ |...|.+++|-|..++ +++|+.|++|-+|+=.-++...
T Consensus 106 I~Di~Wd~ds~RI~avGEGrerfg~~F~~DSG~SvGei~G-hSr~ins~~~KpsRPfRi~T~sdDn~v~ffeGPPFKFk~ 184 (603)
T KOG0318|consen 106 IKDISWDFDSKRIAAVGEGRERFGHVFLWDSGNSVGEITG-HSRRINSVDFKPSRPFRIATGSDDNTVAFFEGPPFKFKS 184 (603)
T ss_pred cccceeCCCCcEEEEEecCccceeEEEEecCCCccceeec-cceeEeeeeccCCCceEEEeccCCCeEEEeeCCCeeeee
Confidence 3357788887777777766666555555677777777777 9999999999999985 9999999999999877778888
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeec---ccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQ---VGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~---gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
+++ |.+.|+ |++|+|||..+++.+.||+|.+||-.+. ..++.+. +|.+.|.+++|+||+..|+|+|.|+
T Consensus 185 s~r~HskFV~-----~VRysPDG~~Fat~gsDgki~iyDGktg--e~vg~l~~~~aHkGsIfalsWsPDs~~~~T~SaDk 257 (603)
T KOG0318|consen 185 SFREHSKFVN-----CVRYSPDGSRFATAGSDGKIYIYDGKTG--EKVGELEDSDAHKGSIFALSWSPDSTQFLTVSADK 257 (603)
T ss_pred ccccccccee-----eEEECCCCCeEEEecCCccEEEEcCCCc--cEEEEecCCCCccccEEEEEECCCCceEEEecCCc
Confidence 888 999999 9999999999999999999999998863 3334454 8999999999999999999999999
Q ss_pred CeeEEEeee
Q 000425 164 SLQVWKTRV 172 (1525)
Q Consensus 164 tIkVWDlr~ 172 (1525)
++||||+..
T Consensus 258 t~KIWdVs~ 266 (603)
T KOG0318|consen 258 TIKIWDVST 266 (603)
T ss_pred eEEEEEeec
Confidence 999999873
No 24
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.76 E-value=7.5e-17 Score=195.22 Aligned_cols=236 Identities=16% Similarity=0.242 Sum_probs=170.7
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECC-CcEEEEEcCCCcEEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYAD-GLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~D-gtIrLWDl~t~k~l~t 88 (1525)
-+.+|||.--.+.+++.+ .-..++.. ..-.....+.- -..+|..++|+.+|.++|.||.. |.+-||+.++..-+..
T Consensus 269 taa~fH~~t~~lvvgFss-G~f~Lyel-P~f~lih~LSi-s~~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsEsYVlK 345 (893)
T KOG0291|consen 269 TAAAFHKGTNLLVVGFSS-GEFGLYEL-PDFNLIHSLSI-SDQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSESYVLK 345 (893)
T ss_pred eeeeccCCceEEEEEecC-CeeEEEec-CCceEEEEeec-ccceeeEEEecccCCEEEEcCCccceEEEEEeeccceeee
Confidence 345677777777676643 22223221 11111111111 24789999999999999998875 7899999998877777
Q ss_pred Ee-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeE
Q 000425 89 LQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQV 167 (1525)
Q Consensus 89 L~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkV 167 (1525)
.+ |...+. +++|+|||+++++|++||.|||||.++ .-|+.++..|++.|+.+.|+-.++.++|.|.||||+.
T Consensus 346 QQgH~~~i~-----~l~YSpDgq~iaTG~eDgKVKvWn~~S--gfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtVRA 418 (893)
T KOG0291|consen 346 QQGHSDRIT-----SLAYSPDGQLIATGAEDGKVKVWNTQS--GFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTVRA 418 (893)
T ss_pred cccccccee-----eEEECCCCcEEEeccCCCcEEEEeccC--ceEEEEeccCCCceEEEEEEecCCEEEEeecCCeEEe
Confidence 77 999999 999999999999999999999999995 4455568899999999999999999999999999999
Q ss_pred EEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccc
Q 000425 168 WKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTR 247 (1525)
Q Consensus 168 WDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~ 247 (1525)
||+.-+.+ ...|..|..+ -..+++..|.+-++.. .+-|.=.+++|...
T Consensus 419 wDlkRYrN------fRTft~P~p~---------------------QfscvavD~sGelV~A-----G~~d~F~IfvWS~q 466 (893)
T KOG0291|consen 419 WDLKRYRN------FRTFTSPEPI---------------------QFSCVAVDPSGELVCA-----GAQDSFEIFVWSVQ 466 (893)
T ss_pred eeecccce------eeeecCCCce---------------------eeeEEEEcCCCCEEEe-----eccceEEEEEEEee
Confidence 99975544 1112222110 1357888888877651 33344467789888
Q ss_pred cCceeeeeeeccccCCcccchhhhcccCCCcccc-hhhHHHHHHHhhhccCCCceeeehh
Q 000425 248 EGRKQLFAVLQSARGSSASVLKEKLSSMGSSGIL-ADHQLQAQLQEHHLKGHSHLTISDI 306 (1525)
Q Consensus 248 eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l-~~~ql~~~~~~~~~~~~~~~~I~di 306 (1525)
+| ++.++|.||.+.+.++ .+.++|+. .+ ..| |++++|||+
T Consensus 467 TG--qllDiLsGHEgPVs~l---~f~~~~~~-LaS~SW-------------DkTVRiW~i 507 (893)
T KOG0291|consen 467 TG--QLLDILSGHEGPVSGL---SFSPDGSL-LASGSW-------------DKTVRIWDI 507 (893)
T ss_pred cC--eeeehhcCCCCcceee---EEccccCe-EEeccc-------------cceEEEEEe
Confidence 87 6899999999998653 24444541 11 223 999999993
No 25
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.76 E-value=2.5e-18 Score=197.29 Aligned_cols=196 Identities=18% Similarity=0.280 Sum_probs=160.1
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
....+||+|+ ...|+++++|.++++|++. ..+...++.+ |.--|++++|||...++++|+.|..||+||.+++.|+.
T Consensus 182 aIRdlafSpn-DskF~t~SdDg~ikiWdf~-~~kee~vL~G-HgwdVksvdWHP~kgLiasgskDnlVKlWDprSg~cl~ 258 (464)
T KOG0284|consen 182 AIRDLAFSPN-DSKFLTCSDDGTIKIWDFR-MPKEERVLRG-HGWDVKSVDWHPTKGLIASGSKDNLVKLWDPRSGSCLA 258 (464)
T ss_pred hhheeccCCC-CceeEEecCCCeEEEEecc-CCchhheecc-CCCCcceeccCCccceeEEccCCceeEeecCCCcchhh
Confidence 4567999994 5567788889999999964 4566667766 99999999999999999999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcC-CCCEEEEEECCCCe
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLP-MLRLLVTLCRDGSL 165 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsP-dg~~LaSgS~DgtI 165 (1525)
++. |.+.|. .+.|+|+++||+++|.|.+++++|+++. +. ...+.+|...|+++.||| ...+|.||+.||.|
T Consensus 259 tlh~HKntVl-----~~~f~~n~N~Llt~skD~~~kv~DiR~m-kE-l~~~r~Hkkdv~~~~WhP~~~~lftsgg~Dgsv 331 (464)
T KOG0284|consen 259 TLHGHKNTVL-----AVKFNPNGNWLLTGSKDQSCKVFDIRTM-KE-LFTYRGHKKDVTSLTWHPLNESLFTSGGSDGSV 331 (464)
T ss_pred hhhhccceEE-----EEEEcCCCCeeEEccCCceEEEEehhHh-HH-HHHhhcchhhheeeccccccccceeeccCCCce
Confidence 999 999999 9999999999999999999999999953 11 123569999999999999 45689999999999
Q ss_pred eEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccc
Q 000425 166 QVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 166 kVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~ 245 (1525)
..|.+.... | +..+-..+. ..||.++|||-+-+.+ +++.|..+|.|.
T Consensus 332 vh~~v~~~~-------------p-------~~~i~~AHd-------~~iwsl~~hPlGhil~------tgsnd~t~rfw~ 378 (464)
T KOG0284|consen 332 VHWVVGLEE-------------P-------LGEIPPAHD-------GEIWSLAYHPLGHILA------TGSNDRTVRFWT 378 (464)
T ss_pred EEEeccccc-------------c-------ccCCCcccc-------cceeeeeccccceeEe------ecCCCcceeeec
Confidence 999876211 1 000001111 2589999999988877 788888888775
Q ss_pred c
Q 000425 246 T 246 (1525)
Q Consensus 246 ~ 246 (1525)
+
T Consensus 379 r 379 (464)
T KOG0284|consen 379 R 379 (464)
T ss_pred c
Confidence 4
No 26
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.75 E-value=5.6e-18 Score=197.83 Aligned_cols=240 Identities=15% Similarity=0.210 Sum_probs=180.2
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
++...|.|-|..--+|.++..+..|.+|+....++++..+.+ |..+|+.++|+.+|..|++++.|++||+||+++++++
T Consensus 215 kgvsai~~fp~~~hLlLS~gmD~~vklW~vy~~~~~lrtf~g-H~k~Vrd~~~s~~g~~fLS~sfD~~lKlwDtETG~~~ 293 (503)
T KOG0282|consen 215 KGVSAIQWFPKKGHLLLSGGMDGLVKLWNVYDDRRCLRTFKG-HRKPVRDASFNNCGTSFLSASFDRFLKLWDTETGQVL 293 (503)
T ss_pred cccchhhhccceeeEEEecCCCceEEEEEEecCcceehhhhc-chhhhhhhhccccCCeeeeeecceeeeeeccccceEE
Confidence 456677888877777788888999999998887777777777 9999999999999999999999999999999999999
Q ss_pred EEEecCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQLDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
..|.....+. ||.|+||+ +.+++|+.|+.|+.||+++ ..+++.+..|-+.|..+.|-+++++++|.|+|+++
T Consensus 294 ~~f~~~~~~~-----cvkf~pd~~n~fl~G~sd~ki~~wDiRs--~kvvqeYd~hLg~i~~i~F~~~g~rFissSDdks~ 366 (503)
T KOG0282|consen 294 SRFHLDKVPT-----CVKFHPDNQNIFLVGGSDKKIRQWDIRS--GKVVQEYDRHLGAILDITFVDEGRRFISSSDDKSV 366 (503)
T ss_pred EEEecCCCce-----eeecCCCCCcEEEEecCCCcEEEEeccc--hHHHHHHHhhhhheeeeEEccCCceEeeeccCccE
Confidence 9998777888 99999998 6788999999999999995 45666677899999999999999999999999999
Q ss_pred eEEEeeee------cCCCCCCCccccccCCC----cccccccccccccCCcceecC------CC------eeEEEEecCC
Q 000425 166 QVWKTRVI------INPNRPPMQANFFEPAS----IESIDIPRILSQQGGEAVYPL------PR------VRALEVHPRL 223 (1525)
Q Consensus 166 kVWDlr~~------~~p~~~~~~a~~f~p~g----Vesldi~~iLs~~gG~~vy~l------~~------V~~v~~HP~~ 223 (1525)
+||+.+.. ..++.+.|.+....|.+ -+++|.. ++... -...|.+ .+ -..|.|.|.+
T Consensus 367 riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~-i~ifs-~~~~~r~nkkK~feGh~vaGys~~v~fSpDG 444 (503)
T KOG0282|consen 367 RIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNY-IAIFS-TVPPFRLNKKKRFEGHSVAGYSCQVDFSPDG 444 (503)
T ss_pred EEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCce-EEEEe-cccccccCHhhhhcceeccCceeeEEEcCCC
Confidence 99999822 22233333332223322 1122211 00000 0001111 11 2358888888
Q ss_pred CeEEEEecccccCCcccccccccccCceeeeeeeccccCCc
Q 000425 224 NLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 224 Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
-.+. +|..+.+...|..++. .++..++.|.+..
T Consensus 445 ~~l~------SGdsdG~v~~wdwkt~--kl~~~lkah~~~c 477 (503)
T KOG0282|consen 445 RTLC------SGDSDGKVNFWDWKTT--KLVSKLKAHDQPC 477 (503)
T ss_pred CeEE------eecCCccEEEeechhh--hhhhccccCCcce
Confidence 8776 7888888887765553 3666788886643
No 27
>PTZ00421 coronin; Provisional
Probab=99.74 E-value=1.5e-16 Score=196.29 Aligned_cols=217 Identities=18% Similarity=0.234 Sum_probs=148.4
Q ss_pred EEccCCCCCeEEEEEcC-CCCEEEEEECCCcEEEEEcCCC-------cEEEEEe-cCCceeecCceeEEEeCCC-CEEEE
Q 000425 45 KIKTDLKKPIVNLACHP-RLPVLYVAYADGLIRAYNIHTY-------AVHYTLQ-LDNTIKLLGAGAFAFHPTL-EWLFV 114 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~P-dg~~LaSGs~DgtIrLWDl~t~-------k~l~tL~-H~~~V~~~ga~sVaFSPdg-~~L~S 114 (1525)
.+.+ |.++|++++|+| ++.+|++|+.|++|++||+.+. +++..+. |...|. +++|+|++ .+|++
T Consensus 70 ~l~G-H~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~-----~l~f~P~~~~iLaS 143 (493)
T PTZ00421 70 ILLG-QEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVG-----IVSFHPSAMNVLAS 143 (493)
T ss_pred eEeC-CCCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEE-----EEEeCcCCCCEEEE
Confidence 3444 999999999999 7889999999999999998764 3567788 999999 99999986 68999
Q ss_pred EeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccc
Q 000425 115 GDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESI 194 (1525)
Q Consensus 115 gS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesl 194 (1525)
|+.|++|+|||+++. .....+.+|...|++++|+|++.+|++++.|++|++||+|.... ..
T Consensus 144 gs~DgtVrIWDl~tg--~~~~~l~~h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~------v~----------- 204 (493)
T PTZ00421 144 AGADMVVNVWDVERG--KAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTI------VS----------- 204 (493)
T ss_pred EeCCCEEEEEECCCC--eEEEEEcCCCCceEEEEEECCCCEEEEecCCCEEEEEECCCCcE------EE-----------
Confidence 999999999999953 33445678999999999999999999999999999999873110 00
Q ss_pred cccccccccCCcceecCCCeeEEEEecCCCeEE-EEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcc
Q 000425 195 DIPRILSQQGGEAVYPLPRVRALEVHPRLNLAV-LLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLS 273 (1525)
Q Consensus 195 di~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlia-l~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~ 273 (1525)
.+..+.+ .....+.|+|..+.+. ..| +.+.|..+++|..+..... +.+...+.... +...++
T Consensus 205 ----tl~~H~~------~~~~~~~w~~~~~~ivt~G~---s~s~Dr~VklWDlr~~~~p-~~~~~~d~~~~---~~~~~~ 267 (493)
T PTZ00421 205 ----SVEAHAS------AKSQRCLWAKRKDLIITLGC---SKSQQRQIMLWDTRKMASP-YSTVDLDQSSA---LFIPFF 267 (493)
T ss_pred ----EEecCCC------CcceEEEEcCCCCeEEEEec---CCCCCCeEEEEeCCCCCCc-eeEeccCCCCc---eEEEEE
Confidence 0111111 1123466777766543 222 3356788899987764322 22333332211 111222
Q ss_pred cCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeee
Q 000425 274 SMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 274 slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
+-+.+-++.. -++|+.+++||+.....+++
T Consensus 268 d~d~~~L~lg-----------gkgDg~Iriwdl~~~~~~~~ 297 (493)
T PTZ00421 268 DEDTNLLYIG-----------SKGEGNIRCFELMNERLTFC 297 (493)
T ss_pred cCCCCEEEEE-----------EeCCCeEEEEEeeCCceEEE
Confidence 2222111110 13688899999877766554
No 28
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.74 E-value=1e-16 Score=174.99 Aligned_cols=265 Identities=15% Similarity=0.180 Sum_probs=179.5
Q ss_pred CCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc--EEEEEe-cC
Q 000425 16 PLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA--VHYTLQ-LD 92 (1525)
Q Consensus 16 P~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k--~l~tL~-H~ 92 (1525)
++.|.+++++.-|.+|++|.. ..|++...+.. -...|+.+...|++..||+|+. ..||+||+.+++ ++.+|+ |.
T Consensus 7 ~d~~viLvsA~YDhTIRfWqa-~tG~C~rTiqh-~dsqVNrLeiTpdk~~LAaa~~-qhvRlyD~~S~np~Pv~t~e~h~ 83 (311)
T KOG0315|consen 7 TDDPVILVSAGYDHTIRFWQA-LTGICSRTIQH-PDSQVNRLEITPDKKDLAAAGN-QHVRLYDLNSNNPNPVATFEGHT 83 (311)
T ss_pred CCCceEEEeccCcceeeeeeh-hcCeEEEEEec-CccceeeEEEcCCcchhhhccC-CeeEEEEccCCCCCceeEEeccC
Confidence 346788999989999999984 55666666654 4578999999999999999886 579999999875 688999 99
Q ss_pred CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 93 NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 93 ~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
+.|. +|.|+-||+|+++||.||++||||++. ..+...+ .|..+|+++..||+..-|++|..+|.|++||++.
T Consensus 84 kNVt-----aVgF~~dgrWMyTgseDgt~kIWdlR~--~~~qR~~-~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~ 155 (311)
T KOG0315|consen 84 KNVT-----AVGFQCDGRWMYTGSEDGTVKIWDLRS--LSCQRNY-QHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGE 155 (311)
T ss_pred CceE-----EEEEeecCeEEEecCCCceEEEEeccC--cccchhc-cCCCCcceEEecCCcceEEeecCCCcEEEEEccC
Confidence 9999 999999999999999999999999995 3332222 5789999999999999999999999999999983
Q ss_pred ecC-----CCC-CCCccccccCCCc-----ccccccccccccCCcceecCC----------CeeEEEEecCCCeEEEEec
Q 000425 173 IIN-----PNR-PPMQANFFEPASI-----ESIDIPRILSQQGGEAVYPLP----------RVRALEVHPRLNLAVLLFA 231 (1525)
Q Consensus 173 ~~~-----p~~-~~~~a~~f~p~gV-----esldi~~iLs~~gG~~vy~l~----------~V~~v~~HP~~Nlial~F~ 231 (1525)
... |.. -.++.....++|- +.---..+.....+...-++. -+..+.+.|..-+++
T Consensus 156 ~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~la---- 231 (311)
T KOG0315|consen 156 NSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLA---- 231 (311)
T ss_pred CccccccCCCCCcceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheecccceEEEEEECCCCcEEE----
Confidence 211 111 0111111111110 000000000001111111111 144567777777777
Q ss_pred ccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhce
Q 000425 232 NFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAF 311 (1525)
Q Consensus 232 ~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~ 311 (1525)
+++.|...|+|...+= -.+--++.+|..-+.. -.|++.|-. ++ .-+.|+..++||+...+.
T Consensus 232 --t~ssdktv~iwn~~~~-~kle~~l~gh~rWvWd---c~FS~dg~Y-lv------------Tassd~~~rlW~~~~~k~ 292 (311)
T KOG0315|consen 232 --TCSSDKTVKIWNTDDF-FKLELVLTGHQRWVWD---CAFSADGEY-LV------------TASSDHTARLWDLSAGKE 292 (311)
T ss_pred --eecCCceEEEEecCCc-eeeEEEeecCCceEEe---eeeccCccE-EE------------ecCCCCceeecccccCce
Confidence 8889999999987654 3344578888876633 222222221 11 112578899999877765
Q ss_pred eee
Q 000425 312 LYS 314 (1525)
Q Consensus 312 i~s 314 (1525)
|-.
T Consensus 293 v~q 295 (311)
T KOG0315|consen 293 VRQ 295 (311)
T ss_pred eee
Confidence 544
No 29
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.73 E-value=5.9e-18 Score=188.67 Aligned_cols=153 Identities=12% Similarity=0.165 Sum_probs=133.1
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCC-cc------ceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGG-RA------PTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT 82 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg-k~------~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t 82 (1525)
.+..|+|++.++..|+ .|.-+.+|+...+. +. ...+.. |..+|.|+.|+.|..++|+|+.||.|++|.+.+
T Consensus 217 EcA~FSPDgqyLvsgS-vDGFiEVWny~~GKlrKDLkYQAqd~fMM-md~aVlci~FSRDsEMlAsGsqDGkIKvWri~t 294 (508)
T KOG0275|consen 217 ECARFSPDGQYLVSGS-VDGFIEVWNYTTGKLRKDLKYQAQDNFMM-MDDAVLCISFSRDSEMLASGSQDGKIKVWRIET 294 (508)
T ss_pred hheeeCCCCceEeecc-ccceeeeehhccchhhhhhhhhhhcceee-cccceEEEeecccHHHhhccCcCCcEEEEEEec
Confidence 4568999999987776 58889999976652 11 112222 789999999999999999999999999999999
Q ss_pred CcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 83 YAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 83 ~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
+.|++.|. |...|+ |+.||.|+..++++|.|.++||.-+.+. .+...+.||++.|+.+.|.+||.+++|+|
T Consensus 295 G~ClRrFdrAHtkGvt-----~l~FSrD~SqiLS~sfD~tvRiHGlKSG--K~LKEfrGHsSyvn~a~ft~dG~~iisaS 367 (508)
T KOG0275|consen 295 GQCLRRFDRAHTKGVT-----CLSFSRDNSQILSASFDQTVRIHGLKSG--KCLKEFRGHSSYVNEATFTDDGHHIISAS 367 (508)
T ss_pred chHHHHhhhhhccCee-----EEEEccCcchhhcccccceEEEeccccc--hhHHHhcCccccccceEEcCCCCeEEEec
Confidence 99999999 999999 9999999999999999999999998853 34444569999999999999999999999
Q ss_pred CCCCeeEEEee
Q 000425 161 RDGSLQVWKTR 171 (1525)
Q Consensus 161 ~DgtIkVWDlr 171 (1525)
.||+|+||+.+
T Consensus 368 sDgtvkvW~~K 378 (508)
T KOG0275|consen 368 SDGTVKVWHGK 378 (508)
T ss_pred CCccEEEecCc
Confidence 99999999987
No 30
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.72 E-value=4.9e-18 Score=194.88 Aligned_cols=205 Identities=20% Similarity=0.362 Sum_probs=162.1
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
-+.+|..|.|.|+|+.|++|+..|.+.+|+..+......++ |+..|+ ++.|++++.|++||+.+|.||+|+.+-
T Consensus 95 vkc~V~~v~WtPeGRRLltgs~SGEFtLWNg~~fnFEtilQaHDs~Vr-----~m~ws~~g~wmiSgD~gG~iKyWqpnm 169 (464)
T KOG0284|consen 95 VKCPVNVVRWTPEGRRLLTGSQSGEFTLWNGTSFNFETILQAHDSPVR-----TMKWSHNGTWMISGDKGGMIKYWQPNM 169 (464)
T ss_pred cccceeeEEEcCCCceeEeecccccEEEecCceeeHHHHhhhhcccce-----eEEEccCCCEEEEcCCCceEEecccch
Confidence 35689999999999999999999999999985544444445 999999 999999999999999999999999873
Q ss_pred CCceeeEeeccc-CcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcc
Q 000425 129 ERPSMIGIIQVG-SQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEA 207 (1525)
Q Consensus 129 ~~~s~i~il~gH-~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~ 207 (1525)
. .+...++| ...|++++|+|+...|+|||+|++|+|||.+...+ .++|.++|
T Consensus 170 n---nVk~~~ahh~eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~ke---------------------e~vL~GHg--- 222 (464)
T KOG0284|consen 170 N---NVKIIQAHHAEAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKE---------------------ERVLRGHG--- 222 (464)
T ss_pred h---hhHHhhHhhhhhhheeccCCCCceeEEecCCCeEEEEeccCCch---------------------hheeccCC---
Confidence 2 22223344 48999999999999999999999999999874322 12345554
Q ss_pred eecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHH
Q 000425 208 VYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQ 287 (1525)
Q Consensus 208 vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~ 287 (1525)
-.|+++.|||+..+++ ++++|..+|.|..+.|. |+.++.+|.+.+-. .++-.-+. |
T Consensus 223 ----wdVksvdWHP~kgLia------sgskDnlVKlWDprSg~--cl~tlh~HKntVl~---~~f~~n~N------~--- 278 (464)
T KOG0284|consen 223 ----WDVKSVDWHPTKGLIA------SGSKDNLVKLWDPRSGS--CLATLHGHKNTVLA---VKFNPNGN------W--- 278 (464)
T ss_pred ----CCcceeccCCccceeE------EccCCceeEeecCCCcc--hhhhhhhccceEEE---EEEcCCCC------e---
Confidence 2578999999999998 88889999999999984 66689999997633 33333332 2
Q ss_pred HHHHhhhccCCCceeeehhhhhceeee
Q 000425 288 AQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 288 ~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
-...+.|..+++.||.-.+++++
T Consensus 279 ----Llt~skD~~~kv~DiR~mkEl~~ 301 (464)
T KOG0284|consen 279 ----LLTGSKDQSCKVFDIRTMKELFT 301 (464)
T ss_pred ----eEEccCCceEEEEehhHhHHHHH
Confidence 22345788899999987777877
No 31
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.72 E-value=1.3e-16 Score=174.29 Aligned_cols=210 Identities=16% Similarity=0.258 Sum_probs=166.1
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCc-cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGR-APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk-~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
.|.++|++..+.++. ...|++++...+.. +...+. +|++.|+++.|+.+|+++++|+.||++||||++...+.+.+
T Consensus 45 rLeiTpdk~~LAaa~--~qhvRlyD~~S~np~Pv~t~e-~h~kNVtaVgF~~dgrWMyTgseDgt~kIWdlR~~~~qR~~ 121 (311)
T KOG0315|consen 45 RLEITPDKKDLAAAG--NQHVRLYDLNSNNPNPVATFE-GHTKNVTAVGFQCDGRWMYTGSEDGTVKIWDLRSLSCQRNY 121 (311)
T ss_pred eEEEcCCcchhhhcc--CCeeEEEEccCCCCCceeEEe-ccCCceEEEEEeecCeEEEecCCCceEEEEeccCcccchhc
Confidence 588999999887776 46788888655543 344444 49999999999999999999999999999999999999999
Q ss_pred ecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC--c-----------e--------ee--------------
Q 000425 90 QLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER--P-----------S--------MI-------------- 134 (1525)
Q Consensus 90 ~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~--~-----------s--------~i-------------- 134 (1525)
+|..+|+ +|..||....|++|..+|.|+|||+.... . + +.
T Consensus 122 ~~~spVn-----~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG~cyvW~l 196 (311)
T KOG0315|consen 122 QHNSPVN-----TVVLHPNQTELISGDQSGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKGNCYVWRL 196 (311)
T ss_pred cCCCCcc-----eEEecCCcceEEeecCCCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCccEEEEEc
Confidence 9889999 99999999999999999999999987420 0 0 00
Q ss_pred ------------EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccc
Q 000425 135 ------------GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQ 202 (1525)
Q Consensus 135 ------------~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~ 202 (1525)
..+++|+..+..+.+|||++||++||.|++++||++...+. +...+..
T Consensus 197 ~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~k--------------------le~~l~g 256 (311)
T KOG0315|consen 197 LNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFK--------------------LELVLTG 256 (311)
T ss_pred cCCCccccceEhhheecccceEEEEEECCCCcEEEeecCCceEEEEecCCcee--------------------eEEEeec
Confidence 02457899999999999999999999999999999864311 1111221
Q ss_pred cCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCC
Q 000425 203 QGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGS 263 (1525)
Q Consensus 203 ~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~ 263 (1525)
+ ..+ +|.++|.-.+-.+. +++.|...|+|....|+ .+..+++|.-.
T Consensus 257 h---~rW----vWdc~FS~dg~Ylv------Tassd~~~rlW~~~~~k--~v~qy~gh~K~ 302 (311)
T KOG0315|consen 257 H---QRW----VWDCAFSADGEYLV------TASSDHTARLWDLSAGK--EVRQYQGHHKA 302 (311)
T ss_pred C---Cce----EEeeeeccCccEEE------ecCCCCceeecccccCc--eeeecCCcccc
Confidence 1 112 68899998888877 88888999999988875 44578888764
No 32
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.72 E-value=1.4e-16 Score=181.33 Aligned_cols=195 Identities=16% Similarity=0.199 Sum_probs=158.9
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
.+...++|-|.+..|+.++ ++.+|+.|+ ...+.+...+.+ |...|+.++.+-||.++++++.|.++++|-+.++++.
T Consensus 194 h~vS~V~f~P~gd~ilS~s-rD~tik~We-~~tg~cv~t~~~-h~ewvr~v~v~~DGti~As~s~dqtl~vW~~~t~~~k 270 (406)
T KOG0295|consen 194 HGVSSVFFLPLGDHILSCS-RDNTIKAWE-CDTGYCVKTFPG-HSEWVRMVRVNQDGTIIASCSNDQTLRVWVVATKQCK 270 (406)
T ss_pred cceeeEEEEecCCeeeecc-cccceeEEe-cccceeEEeccC-chHhEEEEEecCCeeEEEecCCCceEEEEEeccchhh
Confidence 3667899999999887665 799999998 566777777766 9999999999999999999999999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCC---------------CCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEc
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPT---------------LEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWL 150 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPd---------------g~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafs 150 (1525)
..++ |+..|. +++|-|. +++++++|.|++||+||+.+. .+.-++.+|...|..++||
T Consensus 271 ~~lR~hEh~vE-----ci~wap~~~~~~i~~at~~~~~~~~l~s~SrDktIk~wdv~tg--~cL~tL~ghdnwVr~~af~ 343 (406)
T KOG0295|consen 271 AELREHEHPVE-----CIAWAPESSYPSISEATGSTNGGQVLGSGSRDKTIKIWDVSTG--MCLFTLVGHDNWVRGVAFS 343 (406)
T ss_pred hhhhccccceE-----EEEecccccCcchhhccCCCCCccEEEeecccceEEEEeccCC--eEEEEEecccceeeeeEEc
Confidence 9999 999999 9998653 258999999999999999964 4444577999999999999
Q ss_pred CCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEe
Q 000425 151 PMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLF 230 (1525)
Q Consensus 151 Pdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F 230 (1525)
|.|+||+||.+|+++++||++....-+..+ + ..+ =++++.||.+...+.
T Consensus 344 p~Gkyi~ScaDDktlrvwdl~~~~cmk~~~-------a------------h~h---------fvt~lDfh~~~p~Vv--- 392 (406)
T KOG0295|consen 344 PGGKYILSCADDKTLRVWDLKNLQCMKTLE-------A------------HEH---------FVTSLDFHKTAPYVV--- 392 (406)
T ss_pred CCCeEEEEEecCCcEEEEEeccceeeeccC-------C------------Ccc---------eeEEEecCCCCceEE---
Confidence 999999999999999999998432100000 0 000 156788888877766
Q ss_pred cccccCCcccccccc
Q 000425 231 ANFTGGDNIKNRAAY 245 (1525)
Q Consensus 231 ~~~s~~Dd~~~k~~~ 245 (1525)
+|+-|.+.|.|-
T Consensus 393 ---TGsVdqt~KvwE 404 (406)
T KOG0295|consen 393 ---TGSVDQTVKVWE 404 (406)
T ss_pred ---eccccceeeeee
Confidence 777777777664
No 33
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.71 E-value=3.8e-16 Score=174.42 Aligned_cols=251 Identities=14% Similarity=0.119 Sum_probs=171.7
Q ss_pred cEE-EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 8 TEV-HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 8 ~e~-~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+|+ .+.|||+++.+..|.. |.-|.+|+......-..++++ |++.|+.+.|.+++..+++++.|.+|+.||.++++++
T Consensus 48 geI~~~~F~P~gs~~aSgG~-Dr~I~LWnv~gdceN~~~lkg-HsgAVM~l~~~~d~s~i~S~gtDk~v~~wD~~tG~~~ 125 (338)
T KOG0265|consen 48 GEIYTIKFHPDGSCFASGGS-DRAIVLWNVYGDCENFWVLKG-HSGAVMELHGMRDGSHILSCGTDKTVRGWDAETGKRI 125 (338)
T ss_pred ceEEEEEECCCCCeEeecCC-cceEEEEeccccccceeeecc-ccceeEeeeeccCCCEEEEecCCceEEEEecccceee
Confidence 444 5789999998877775 778889987666666677776 9999999999999999999999999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCE-EEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEW-LFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~-L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
+.++ |.+-|+ +++-+..|.. +.|+++|+++|+||+++ +..+.++ .....++++.|..++.-+.+|+-|+.
T Consensus 126 rk~k~h~~~vN-----s~~p~rrg~~lv~SgsdD~t~kl~D~R~--k~~~~t~-~~kyqltAv~f~d~s~qv~sggIdn~ 197 (338)
T KOG0265|consen 126 RKHKGHTSFVN-----SLDPSRRGPQLVCSGSDDGTLKLWDIRK--KEAIKTF-ENKYQLTAVGFKDTSDQVISGGIDND 197 (338)
T ss_pred ehhccccceee-----ecCccccCCeEEEecCCCceEEEEeecc--cchhhcc-ccceeEEEEEecccccceeeccccCc
Confidence 9999 999999 8875555654 45778999999999994 3333222 12456899999999999999999999
Q ss_pred eeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccc
Q 000425 165 LQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 165 IkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
|++||+|.. +...+++++. ..|+.+.-.|.++.+. +-+=|.+.+.|
T Consensus 198 ikvWd~r~~---------------------d~~~~lsGh~-------DtIt~lsls~~gs~ll------snsMd~tvrvw 243 (338)
T KOG0265|consen 198 IKVWDLRKN---------------------DGLYTLSGHA-------DTITGLSLSRYGSFLL------SNSMDNTVRVW 243 (338)
T ss_pred eeeeccccC---------------------cceEEeeccc-------CceeeEEeccCCCccc------cccccceEEEE
Confidence 999999721 0111111111 1233333333333322 22333333444
Q ss_pred cccc--CceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeee
Q 000425 245 YTRE--GRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 245 ~~~e--g~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
..+. -...++.++++|.-+--.-+..--|+-+..++-++. .|+...|||-.-+..+|-
T Consensus 244 d~rp~~p~~R~v~if~g~~hnfeknlL~cswsp~~~~i~ags------------~dr~vyvwd~~~r~~lyk 303 (338)
T KOG0265|consen 244 DVRPFAPSQRCVKIFQGHIHNFEKNLLKCSWSPNGTKITAGS------------ADRFVYVWDTTSRRILYK 303 (338)
T ss_pred EecccCCCCceEEEeecchhhhhhhcceeeccCCCCcccccc------------ccceEEEeecccccEEEE
Confidence 3332 122334566666554322122333444445555555 799999999888888865
No 34
>PTZ00421 coronin; Provisional
Probab=99.71 E-value=7.6e-16 Score=189.98 Aligned_cols=205 Identities=16% Similarity=0.230 Sum_probs=152.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCC------ccceEEccCCCCCeEEEEEcCCC-CEEEEEECCCcEEEEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGG------RAPTKIKTDLKKPIVNLACHPRL-PVLYVAYADGLIRAYN 79 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg------k~~tklk~~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWD 79 (1525)
+....++|+|....++++...+.+|++|+..+++ .....+.+ |...|.+++|||++ .+|++|+.|++|+|||
T Consensus 76 ~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~g-H~~~V~~l~f~P~~~~iLaSgs~DgtVrIWD 154 (493)
T PTZ00421 76 GPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQG-HTKKVGIVSFHPSAMNVLASAGADMVVNVWD 154 (493)
T ss_pred CCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecC-CCCcEEEEEeCcCCCCEEEEEeCCCEEEEEE
Confidence 4567899999544444555579999999865432 22334554 99999999999986 6899999999999999
Q ss_pred cCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcC-eEEEEEcCCCCEEE
Q 000425 80 IHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQP-ITSVAWLPMLRLLV 157 (1525)
Q Consensus 80 l~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~-VtsVafsPdg~~La 157 (1525)
+.+++....+. |...|. +++|+|+|..|++++.|++|++||.++. ..+..+.+|... +..+.|++++..++
T Consensus 155 l~tg~~~~~l~~h~~~V~-----sla~spdG~lLatgs~Dg~IrIwD~rsg--~~v~tl~~H~~~~~~~~~w~~~~~~iv 227 (493)
T PTZ00421 155 VERGKAVEVIKCHSDQIT-----SLEWNLDGSLLCTTSKDKKLNIIDPRDG--TIVSSVEAHASAKSQRCLWAKRKDLII 227 (493)
T ss_pred CCCCeEEEEEcCCCCceE-----EEEEECCCCEEEEecCCCEEEEEECCCC--cEEEEEecCCCCcceEEEEcCCCCeEE
Confidence 99999999999 999999 9999999999999999999999999953 334445678765 35688999988887
Q ss_pred EEE----CCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEeccc
Q 000425 158 TLC----RDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANF 233 (1525)
Q Consensus 158 SgS----~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~ 233 (1525)
+++ .|++|++||+|....+. . . .++ ... ..+..+.|+|..+++++
T Consensus 228 t~G~s~s~Dr~VklWDlr~~~~p~----~--~--------~~~----d~~--------~~~~~~~~d~d~~~L~l----- 276 (493)
T PTZ00421 228 TLGCSKSQQRQIMLWDTRKMASPY----S--T--------VDL----DQS--------SALFIPFFDEDTNLLYI----- 276 (493)
T ss_pred EEecCCCCCCeEEEEeCCCCCCce----e--E--------ecc----CCC--------CceEEEEEcCCCCEEEE-----
Confidence 764 47899999997432210 0 0 000 000 12345678888887662
Q ss_pred ccCCcccccccccccCc
Q 000425 234 TGGDNIKNRAAYTREGR 250 (1525)
Q Consensus 234 s~~Dd~~~k~~~~~eg~ 250 (1525)
.+..|..++.|..+++.
T Consensus 277 ggkgDg~Iriwdl~~~~ 293 (493)
T PTZ00421 277 GSKGEGNIRCFELMNER 293 (493)
T ss_pred EEeCCCeEEEEEeeCCc
Confidence 23357788888877664
No 35
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.68 E-value=9.3e-16 Score=170.77 Aligned_cols=188 Identities=19% Similarity=0.205 Sum_probs=152.0
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcC-CCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHP-RLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~P-dg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
..|-+ ...+++++ .|+++.+|+ +|.+.....+.+ |.+-|.+++++| +++.|++|+-|++.+|||++.+.+.++|.
T Consensus 151 C~f~d-D~~ilT~S-GD~TCalWD-ie~g~~~~~f~G-H~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c~qtF~ 226 (343)
T KOG0286|consen 151 CRFLD-DNHILTGS-GDMTCALWD-IETGQQTQVFHG-HTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQCVQTFE 226 (343)
T ss_pred EEEcC-CCceEecC-CCceEEEEE-cccceEEEEecC-CcccEEEEecCCCCCCeEEecccccceeeeeccCcceeEeec
Confidence 34445 33455555 699999998 788888777776 999999999999 88999999999999999999999999999
Q ss_pred -cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeec--ccCcCeEEEEEcCCCCEEEEEECCCCeeE
Q 000425 91 -LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQ--VGSQPITSVAWLPMLRLLVTLCRDGSLQV 167 (1525)
Q Consensus 91 -H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~--gH~~~VtsVafsPdg~~LaSgS~DgtIkV 167 (1525)
|+..|+ +|+|+|+|.-+++||+|+++|+||++.. +++. .+. .-...|++++||-.|++|..|-.|.++.+
T Consensus 227 ghesDIN-----sv~ffP~G~afatGSDD~tcRlyDlRaD-~~~a-~ys~~~~~~gitSv~FS~SGRlLfagy~d~~c~v 299 (343)
T KOG0286|consen 227 GHESDIN-----SVRFFPSGDAFATGSDDATCRLYDLRAD-QELA-VYSHDSIICGITSVAFSKSGRLLFAGYDDFTCNV 299 (343)
T ss_pred ccccccc-----eEEEccCCCeeeecCCCceeEEEeecCC-cEEe-eeccCcccCCceeEEEcccccEEEeeecCCceeE
Confidence 999999 9999999999999999999999999963 2221 222 23457899999999999999999999999
Q ss_pred EEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccc
Q 000425 168 WKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 168 WDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
||.-.. +...+|.+++ .||.++...|.+..++ +++=|...|+|
T Consensus 300 WDtlk~---------------------e~vg~L~GHe-------NRvScl~~s~DG~av~------TgSWDs~lriW 342 (343)
T KOG0286|consen 300 WDTLKG---------------------ERVGVLAGHE-------NRVSCLGVSPDGMAVA------TGSWDSTLRIW 342 (343)
T ss_pred eecccc---------------------ceEEEeeccC-------CeeEEEEECCCCcEEE------ecchhHheeec
Confidence 996210 0111344444 3789999999998777 77777777766
No 36
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.68 E-value=2.4e-16 Score=179.37 Aligned_cols=215 Identities=14% Similarity=0.133 Sum_probs=173.2
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
|....|+|+-.+-++.. ...|+++.+|+..+--.+...+.+ |+..|.++.|-|.|.++++++.|.+|+.|++.++-++
T Consensus 151 ~sv~di~~~a~Gk~l~t-cSsDl~~~LWd~~~~~~c~ks~~g-h~h~vS~V~f~P~gd~ilS~srD~tik~We~~tg~cv 228 (406)
T KOG0295|consen 151 DSVFDISFDASGKYLAT-CSSDLSAKLWDFDTFFRCIKSLIG-HEHGVSSVFFLPLGDHILSCSRDNTIKAWECDTGYCV 228 (406)
T ss_pred cceeEEEEecCccEEEe-cCCccchhheeHHHHHHHHHHhcC-cccceeeEEEEecCCeeeecccccceeEEecccceeE
Confidence 45678999988866654 446888999986443333333444 9999999999999999999999999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-------------
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM------------- 152 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd------------- 152 (1525)
++|. |...|+ .++.+.||..++++|.|.+|++|-+.+ .++...+..|..+|.|++|-|.
T Consensus 229 ~t~~~h~ewvr-----~v~v~~DGti~As~s~dqtl~vW~~~t--~~~k~~lR~hEh~vEci~wap~~~~~~i~~at~~~ 301 (406)
T KOG0295|consen 229 KTFPGHSEWVR-----MVRVNQDGTIIASCSNDQTLRVWVVAT--KQCKAELREHEHPVECIAWAPESSYPSISEATGST 301 (406)
T ss_pred EeccCchHhEE-----EEEecCCeeEEEecCCCceEEEEEecc--chhhhhhhccccceEEEEecccccCcchhhccCCC
Confidence 9999 999999 999999999999999999999999884 4455557799999999999773
Q ss_pred --CCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEe
Q 000425 153 --LRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLF 230 (1525)
Q Consensus 153 --g~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F 230 (1525)
+++++++|.|++||+||+.+... - ..|.+++ .-|..++|||.+-+++
T Consensus 302 ~~~~~l~s~SrDktIk~wdv~tg~c--------L-------------~tL~ghd-------nwVr~~af~p~Gkyi~--- 350 (406)
T KOG0295|consen 302 NGGQVLGSGSRDKTIKIWDVSTGMC--------L-------------FTLVGHD-------NWVRGVAFSPGGKYIL--- 350 (406)
T ss_pred CCccEEEeecccceEEEEeccCCeE--------E-------------EEEeccc-------ceeeeeEEcCCCeEEE---
Confidence 25999999999999999874321 0 0122221 2377899999999988
Q ss_pred cccccCCcccccccccccCceeeeeeeccccCCccc
Q 000425 231 ANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSAS 266 (1525)
Q Consensus 231 ~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s 266 (1525)
+..||+..++|....+ ++..++..|...+.+
T Consensus 351 ---ScaDDktlrvwdl~~~--~cmk~~~ah~hfvt~ 381 (406)
T KOG0295|consen 351 ---SCADDKTLRVWDLKNL--QCMKTLEAHEHFVTS 381 (406)
T ss_pred ---EEecCCcEEEEEeccc--eeeeccCCCcceeEE
Confidence 8999999999998776 456677777776543
No 37
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.68 E-value=3.8e-15 Score=169.85 Aligned_cols=156 Identities=17% Similarity=0.312 Sum_probs=134.5
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
.+..+.+++||+.+++.+|..+++ -.+|....+. ....+. +|++.|+++.|+.+|.+||+|..+|.|+||...++..
T Consensus 64 ~~svFavsl~P~~~l~aTGGgDD~-AflW~~~~ge-~~~elt-gHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~~ 140 (399)
T KOG0296|consen 64 TDSVFAVSLHPNNNLVATGGGDDL-AFLWDISTGE-FAGELT-GHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGGE 140 (399)
T ss_pred CCceEEEEeCCCCceEEecCCCce-EEEEEccCCc-ceeEec-CCCCceEEEEEccCceEEEecCCCccEEEEEcccCce
Confidence 357899999999999999886554 4577765554 333344 5999999999999999999999999999999999988
Q ss_pred EEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 86 HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 86 l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
...+. .-..+. -+.|||.+..|+.|+.||.+.+|.+.+. ...+.+.||+.++++=.|.|+|+.++++..|++
T Consensus 141 ~~~~~~e~~die-----Wl~WHp~a~illAG~~DGsvWmw~ip~~--~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~dgt 213 (399)
T KOG0296|consen 141 QWKLDQEVEDIE-----WLKWHPRAHILLAGSTDGSVWMWQIPSQ--ALCKVMSGHNSPCTCGEFIPDGKRILTGYDDGT 213 (399)
T ss_pred EEEeecccCceE-----EEEecccccEEEeecCCCcEEEEECCCc--ceeeEecCCCCCcccccccCCCceEEEEecCce
Confidence 88886 556677 8999999999999999999999999852 444567799999999999999999999999999
Q ss_pred eeEEEee
Q 000425 165 LQVWKTR 171 (1525)
Q Consensus 165 IkVWDlr 171 (1525)
|++||..
T Consensus 214 i~~Wn~k 220 (399)
T KOG0296|consen 214 IIVWNPK 220 (399)
T ss_pred EEEEecC
Confidence 9999987
No 38
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.68 E-value=2.2e-14 Score=155.37 Aligned_cols=234 Identities=20% Similarity=0.264 Sum_probs=167.1
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+...+++++|++.+++++.. +..+.+|+... +.....+.. |...+..+.|+|+++.+++++.|+.|++||+.+++..
T Consensus 10 ~~i~~~~~~~~~~~l~~~~~-~g~i~i~~~~~-~~~~~~~~~-~~~~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~~~~ 86 (289)
T cd00200 10 GGVTCVAFSPDGKLLATGSG-DGTIKVWDLET-GELLRTLKG-HTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECV 86 (289)
T ss_pred CCEEEEEEcCCCCEEEEeec-CcEEEEEEeeC-CCcEEEEec-CCcceeEEEECCCCCEEEEEcCCCeEEEEEcCcccce
Confidence 56779999999999888774 78888988543 333444444 8999999999999999999999999999999998888
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
..+. |...+. ++.|+++++++++++.|+.|++||+.+. .....+..|...|+++.|+|++.++++++.|+.|
T Consensus 87 ~~~~~~~~~i~-----~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i 159 (289)
T cd00200 87 RTLTGHTSYVS-----SVAFSPDGRILSSSSRDKTIKVWDVETG--KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159 (289)
T ss_pred EEEeccCCcEE-----EEEEcCCCCEEEEecCCCeEEEEECCCc--EEEEEeccCCCcEEEEEEcCcCCEEEEEcCCCcE
Confidence 8888 888999 9999999888888888999999999843 3333455899999999999999999999889999
Q ss_pred eEEEeeee-----cCCCCCCCccccccCCC-----------cccccccccccccCCcceec----CCCeeEEEEecCCCe
Q 000425 166 QVWKTRVI-----INPNRPPMQANFFEPAS-----------IESIDIPRILSQQGGEAVYP----LPRVRALEVHPRLNL 225 (1525)
Q Consensus 166 kVWDlr~~-----~~p~~~~~~a~~f~p~g-----------Vesldi~~iLs~~gG~~vy~----l~~V~~v~~HP~~Nl 225 (1525)
++||++.. ...+........+.+++ +..+++. .+..... -..+..+.|+|..++
T Consensus 160 ~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~------~~~~~~~~~~~~~~i~~~~~~~~~~~ 233 (289)
T cd00200 160 KLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS------TGKCLGTLRGHENGVNSVAFSPDGYL 233 (289)
T ss_pred EEEEccccccceeEecCccccceEEECCCcCEEEEecCCCcEEEEECC------CCceecchhhcCCceEEEEEcCCCcE
Confidence 99999721 11111122222223322 1111110 0111111 125778888888666
Q ss_pred EEEEecccccCCcccccccccccCceeeeeeeccccCCc
Q 000425 226 AVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 226 ial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
++ +++.+..+++|....++ ....++.|.+..
T Consensus 234 ~~------~~~~~~~i~i~~~~~~~--~~~~~~~~~~~i 264 (289)
T cd00200 234 LA------SGSEDGTIRVWDLRTGE--CVQTLSGHTNSV 264 (289)
T ss_pred EE------EEcCCCcEEEEEcCCce--eEEEccccCCcE
Confidence 65 45557788888766542 344566665543
No 39
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.67 E-value=5.7e-15 Score=192.45 Aligned_cols=204 Identities=13% Similarity=0.103 Sum_probs=153.0
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcC-CCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHP-RLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~P-dg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
...++++|.....++++..+.++.+|+. +.+.....+.. |.+.|++++|+| ++.+|++|+.|++|++||+.++.++.
T Consensus 535 v~~l~~~~~~~~~las~~~Dg~v~lWd~-~~~~~~~~~~~-H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~ 612 (793)
T PLN00181 535 LSGICWNSYIKSQVASSNFEGVVQVWDV-ARSQLVTEMKE-HEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIG 612 (793)
T ss_pred eeeEEeccCCCCEEEEEeCCCeEEEEEC-CCCeEEEEecC-CCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEE
Confidence 3467888764434455557999999985 44445555555 999999999997 67899999999999999999999998
Q ss_pred EEecCCceeecCceeEEEe-CCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQLDNTIKLLGAGAFAFH-PTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~H~~~V~~~ga~sVaFS-Pdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
++.+...+. ++.|+ +++.++++|+.|++|++||+++.. .....+.+|...|+++.|+ ++.+|+|+|.|++|+
T Consensus 613 ~~~~~~~v~-----~v~~~~~~g~~latgs~dg~I~iwD~~~~~-~~~~~~~~h~~~V~~v~f~-~~~~lvs~s~D~~ik 685 (793)
T PLN00181 613 TIKTKANIC-----CVQFPSESGRSLAFGSADHKVYYYDLRNPK-LPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLK 685 (793)
T ss_pred EEecCCCeE-----EEEEeCCCCCEEEEEeCCCeEEEEECCCCC-ccceEecCCCCCEEEEEEe-CCCEEEEEECCCEEE
Confidence 888556788 99995 478999999999999999998532 1223456899999999997 678999999999999
Q ss_pred EEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccc
Q 000425 167 VWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYT 246 (1525)
Q Consensus 167 VWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~ 246 (1525)
+||++........ .+ + ..+..+. ..++.+.|+|..++++ ++++|..+++|..
T Consensus 686 iWd~~~~~~~~~~-------~~--l------~~~~gh~-------~~i~~v~~s~~~~~la------sgs~D~~v~iw~~ 737 (793)
T PLN00181 686 LWDLSMSISGINE-------TP--L------HSFMGHT-------NVKNFVGLSVSDGYIA------TGSETNEVFVYHK 737 (793)
T ss_pred EEeCCCCccccCC-------cc--e------EEEcCCC-------CCeeEEEEcCCCCEEE------EEeCCCEEEEEEC
Confidence 9998632110000 00 0 0111111 2357789999999888 8888999998876
Q ss_pred ccC
Q 000425 247 REG 249 (1525)
Q Consensus 247 ~eg 249 (1525)
..+
T Consensus 738 ~~~ 740 (793)
T PLN00181 738 AFP 740 (793)
T ss_pred CCC
Confidence 543
No 40
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.67 E-value=9.5e-16 Score=168.07 Aligned_cols=201 Identities=19% Similarity=0.241 Sum_probs=163.8
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k 84 (1525)
+|+-+-+++++...-++.++..|.+++++++.....+..+++. |+..|.++.|++..+ .++++|.|++||+||..-.+
T Consensus 60 ~D~LfdV~Wse~~e~~~~~a~GDGSLrl~d~~~~s~Pi~~~kE-H~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~ 138 (311)
T KOG0277|consen 60 EDGLFDVAWSENHENQVIAASGDGSLRLFDLTMPSKPIHKFKE-HKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPN 138 (311)
T ss_pred ccceeEeeecCCCcceEEEEecCceEEEeccCCCCcchhHHHh-hhhheEEeccccccceeEEeeccCCceEeecCCCCc
Confidence 4677889999998877788888999999997776777777776 999999999999765 68889999999999998889
Q ss_pred EEEEEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcC-CCCEEEEEEC
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLP-MLRLLVTLCR 161 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsP-dg~~LaSgS~ 161 (1525)
.+.+|. |..-|. ...||| ..+.++++|.|+++++||++...+.+. +..|...|.|+.|+. +.+.++||+.
T Consensus 139 Sv~Tf~gh~~~Iy-----~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk~~~--i~ah~~Eil~cdw~ky~~~vl~Tg~v 211 (311)
T KOG0277|consen 139 SVQTFNGHNSCIY-----QAAFSPHIPNLFASASGDGTLRLWDVRSPGKFMS--IEAHNSEILCCDWSKYNHNVLATGGV 211 (311)
T ss_pred ceEeecCCccEEE-----EEecCCCCCCeEEEccCCceEEEEEecCCCceeE--EEeccceeEeecccccCCcEEEecCC
Confidence 999999 999999 999999 567888999999999999996433443 568999999999997 6678999999
Q ss_pred CCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecC-CCeEEEEecccccCCccc
Q 000425 162 DGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPR-LNLAVLLFANFTGGDNIK 240 (1525)
Q Consensus 162 DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~-~Nlial~F~~~s~~Dd~~ 240 (1525)
|+.|++||+|....| .| .|..++ | -|+.+.|.|. .++++ +++=|..
T Consensus 212 d~~vr~wDir~~r~p--------l~------------eL~gh~----~---AVRkvk~Sph~~~lLa------SasYDmT 258 (311)
T KOG0277|consen 212 DNLVRGWDIRNLRTP--------LF------------ELNGHG----L---AVRKVKFSPHHASLLA------SASYDMT 258 (311)
T ss_pred CceEEEEehhhcccc--------ce------------eecCCc----e---EEEEEecCcchhhHhh------hccccce
Confidence 999999999965532 00 122221 1 3677888876 46666 7888888
Q ss_pred ccccccc
Q 000425 241 NRAAYTR 247 (1525)
Q Consensus 241 ~k~~~~~ 247 (1525)
.++|...
T Consensus 259 ~riw~~~ 265 (311)
T KOG0277|consen 259 VRIWDPE 265 (311)
T ss_pred EEecccc
Confidence 8888765
No 41
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.67 E-value=7.8e-16 Score=180.48 Aligned_cols=212 Identities=16% Similarity=0.189 Sum_probs=143.6
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEe--------ecC---CccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGT--------VEG---GRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIR 76 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~t--------ieg---gk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIr 76 (1525)
.-+|+++|++..+++... ..-.++++. ..| .+.....++ |...++|.+|||+.+ .|++++.||++|
T Consensus 217 i~sl~ys~Tg~~iLvvsg-~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKG-Hia~lt~g~whP~~k~~FlT~s~DgtlR 294 (641)
T KOG0772|consen 217 INSLQYSVTGDQILVVSG-SAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKG-HIAELTCGCWHPDNKEEFLTCSYDGTLR 294 (641)
T ss_pred cceeeecCCCCeEEEEec-CcceeEEccCCceeeeeeccchhhhhhhccCC-ceeeeeccccccCcccceEEecCCCcEE
Confidence 457899999888877664 333445442 111 122333344 999999999999875 799999999999
Q ss_pred EEEcCCC-cEEEEEec------CCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-CceeeEeecccCc--CeEE
Q 000425 77 AYNIHTY-AVHYTLQL------DNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-RPSMIGIIQVGSQ--PITS 146 (1525)
Q Consensus 77 LWDl~t~-k~l~tL~H------~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-~~s~i~il~gH~~--~Vts 146 (1525)
|||+... +.+..|+| .-++. .++|+|||++|++|+.||+|.+||..+. .+....+.++|.. .|+|
T Consensus 295 iWdv~~~k~q~qVik~k~~~g~Rv~~t-----sC~~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g~~Its 369 (641)
T KOG0772|consen 295 IWDVNNTKSQLQVIKTKPAGGKRVPVT-----SCAWNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLPGQDITS 369 (641)
T ss_pred EEecCCchhheeEEeeccCCCcccCce-----eeecCCCcchhhhcccCCceeeeecCCcccccceEeeeccCCCCceeE
Confidence 9997654 34445552 23455 8999999999999999999999998763 2222234678887 8999
Q ss_pred EEEcCCCCEEEEEECCCCeeEEEeeeecCC--------CCCCCccccccCCCccccccccccccc-----------CCcc
Q 000425 147 VAWLPMLRLLVTLCRDGSLQVWKTRVIINP--------NRPPMQANFFEPASIESIDIPRILSQQ-----------GGEA 207 (1525)
Q Consensus 147 VafsPdg~~LaSgS~DgtIkVWDlr~~~~p--------~~~~~~a~~f~p~gVesldi~~iLs~~-----------gG~~ 207 (1525)
|+||+||++|+|-|.|.++++||+|....| +...+...+|.|+.--.+--..+-..+ .=+.
T Consensus 370 i~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~t~d~ 449 (641)
T KOG0772|consen 370 ISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRMTLDT 449 (641)
T ss_pred EEeccccchhhhccCCCceeeeeccccccchhhhcCCCccCCCCccccCCCceEEEecccccCCCCCceEEEEeccceee
Confidence 999999999999999999999999944322 222333345566431111000000000 0012
Q ss_pred eecC----CCeeEEEEecCCCeEE
Q 000425 208 VYPL----PRVRALEVHPRLNLAV 227 (1525)
Q Consensus 208 vy~l----~~V~~v~~HP~~Nlia 227 (1525)
+|.+ ..|..+.|||++|-+.
T Consensus 450 v~ki~i~~aSvv~~~WhpkLNQi~ 473 (641)
T KOG0772|consen 450 VYKIDISTASVVRCLWHPKLNQIF 473 (641)
T ss_pred EEEecCCCceEEEEeecchhhhee
Confidence 3332 4577899999999876
No 42
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.67 E-value=1.6e-15 Score=169.36 Aligned_cols=155 Identities=21% Similarity=0.347 Sum_probs=126.3
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeec------------C----CccceEEccCCCCCeEEEEEcCCCCEEEEEECCC
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVE------------G----GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADG 73 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tie------------g----gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~Dg 73 (1525)
..-+|+|++.++.+|+. |-+|++.+.-. + ..++.....+|.+.|+.+.|||....|++|+.|+
T Consensus 116 R~aafs~DG~lvATGsa-D~SIKildvermlaks~~~em~~~~~qa~hPvIRTlYDH~devn~l~FHPre~ILiS~srD~ 194 (430)
T KOG0640|consen 116 RAAAFSPDGSLVATGSA-DASIKILDVERMLAKSKPKEMISGDTQARHPVIRTLYDHVDEVNDLDFHPRETILISGSRDN 194 (430)
T ss_pred eeeeeCCCCcEEEccCC-cceEEEeehhhhhhhcchhhhccCCcccCCceEeehhhccCcccceeecchhheEEeccCCC
Confidence 46789999999988885 78898887531 1 1233344457999999999999999999999999
Q ss_pred cEEEEEcCCCcEEEEEe---cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE-eecccCcCeEEEEE
Q 000425 74 LIRAYNIHTYAVHYTLQ---LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG-IIQVGSQPITSVAW 149 (1525)
Q Consensus 74 tIrLWDl~t~k~l~tL~---H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~-il~gH~~~VtsVaf 149 (1525)
+||+||+......+.++ ....|+ ++.|||.|.+|+.|.+-.++++||+++...-... --+.|+..|+++.+
T Consensus 195 tvKlFDfsK~saKrA~K~~qd~~~vr-----siSfHPsGefllvgTdHp~~rlYdv~T~QcfvsanPd~qht~ai~~V~Y 269 (430)
T KOG0640|consen 195 TVKLFDFSKTSAKRAFKVFQDTEPVR-----SISFHPSGEFLLVGTDHPTLRLYDVNTYQCFVSANPDDQHTGAITQVRY 269 (430)
T ss_pred eEEEEecccHHHHHHHHHhhccceee-----eEeecCCCceEEEecCCCceeEEeccceeEeeecCcccccccceeEEEe
Confidence 99999987654444444 567899 9999999999999999999999999974111110 13579999999999
Q ss_pred cCCCCEEEEEECCCCeeEEEe
Q 000425 150 LPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 150 sPdg~~LaSgS~DgtIkVWDl 170 (1525)
++.+++.+|+|.||.||+||-
T Consensus 270 s~t~~lYvTaSkDG~IklwDG 290 (430)
T KOG0640|consen 270 SSTGSLYVTASKDGAIKLWDG 290 (430)
T ss_pred cCCccEEEEeccCCcEEeecc
Confidence 999999999999999999994
No 43
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.66 E-value=1.4e-14 Score=170.72 Aligned_cols=153 Identities=21% Similarity=0.261 Sum_probs=131.1
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEc---cCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIK---TDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk---~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
.-.++|-|.+|+-..+..+|.++.++ ++ ++++++ ++|++.|+++.|+|||..|++++.||+|.+||=++++.
T Consensus 150 ins~~~KpsRPfRi~T~sdDn~v~ff---eG--PPFKFk~s~r~HskFV~~VRysPDG~~Fat~gsDgki~iyDGktge~ 224 (603)
T KOG0318|consen 150 INSVDFKPSRPFRIATGSDDNTVAFF---EG--PPFKFKSSFREHSKFVNCVRYSPDGSRFATAGSDGKIYIYDGKTGEK 224 (603)
T ss_pred EeeeeccCCCceEEEeccCCCeEEEe---eC--CCeeeeecccccccceeeEEECCCCCeEEEecCCccEEEEcCCCccE
Confidence 34567779999998888889888654 44 344443 46999999999999999999999999999999999999
Q ss_pred EEEEe----cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC--C----------ceee---------------
Q 000425 86 HYTLQ----LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE--R----------PSMI--------------- 134 (1525)
Q Consensus 86 l~tL~----H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~--~----------~s~i--------------- 134 (1525)
+..|. |...|. +++|+||+..++++|.|.++||||+.+. . .+.+
T Consensus 225 vg~l~~~~aHkGsIf-----alsWsPDs~~~~T~SaDkt~KIWdVs~~slv~t~~~~~~v~dqqvG~lWqkd~lItVSl~ 299 (603)
T KOG0318|consen 225 VGELEDSDAHKGSIF-----ALSWSPDSTQFLTVSADKTIKIWDVSTNSLVSTWPMGSTVEDQQVGCLWQKDHLITVSLS 299 (603)
T ss_pred EEEecCCCCccccEE-----EEEECCCCceEEEecCCceEEEEEeeccceEEEeecCCchhceEEEEEEeCCeEEEEEcC
Confidence 99987 899999 9999999999999999999999998752 0 0000
Q ss_pred --------------EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 135 --------------GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 135 --------------~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+..+|...|+++..+|++.+|.|||.||.|.-||..
T Consensus 300 G~in~ln~~d~~~~~~i~GHnK~ITaLtv~~d~~~i~SgsyDG~I~~W~~~ 350 (603)
T KOG0318|consen 300 GTINYLNPSDPSVLKVISGHNKSITALTVSPDGKTIYSGSYDGHINSWDSG 350 (603)
T ss_pred cEEEEecccCCChhheecccccceeEEEEcCCCCEEEeeccCceEEEEecC
Confidence 1346899999999999999999999999999999987
No 44
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.66 E-value=2.3e-15 Score=174.07 Aligned_cols=211 Identities=18% Similarity=0.203 Sum_probs=169.4
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
..-.+-+||....++.++ .++.+++|..- ....++.+.. |..+|+.+..||+|.||++++.|+++-..|++++.++.
T Consensus 263 ki~~v~~~~~~~~v~~aS-ad~~i~vws~~-~~s~~~~~~~-h~~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt 339 (506)
T KOG0289|consen 263 KITSVKFHKDLDTVITAS-ADEIIRVWSVP-LSSEPTSSRP-HEEPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLT 339 (506)
T ss_pred EEEEEEeccchhheeecC-CcceEEeeccc-cccCcccccc-ccccceeeeeccCCcEEEEecCCceEEEEEccCCcEEE
Confidence 445678999999988777 48899988743 3335555555 99999999999999999999999999999999999887
Q ss_pred EEecC---CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 88 TLQLD---NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 88 tL~H~---~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
...++ -.+. +.+|||||-.+.+|..|+.|+|||+.+. ...+-+.+|+++|..|+|+-+|-||+++++|+.
T Consensus 340 ~vs~~~s~v~~t-----s~~fHpDgLifgtgt~d~~vkiwdlks~--~~~a~Fpght~~vk~i~FsENGY~Lat~add~~ 412 (506)
T KOG0289|consen 340 VVSDETSDVEYT-----SAAFHPDGLIFGTGTPDGVVKIWDLKSQ--TNVAKFPGHTGPVKAISFSENGYWLATAADDGS 412 (506)
T ss_pred EEeeccccceeE-----EeeEcCCceEEeccCCCceEEEEEcCCc--cccccCCCCCCceeEEEeccCceEEEEEecCCe
Confidence 77753 3477 9999999999999999999999999963 233345689999999999999999999999999
Q ss_pred eeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecC---CCeeEEEEecCCCeEEEEecccccCCcccc
Q 000425 165 LQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPL---PRVRALEVHPRLNLAVLLFANFTGGDNIKN 241 (1525)
Q Consensus 165 IkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l---~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~ 241 (1525)
|++||+|...+-. .|++ ..++.+.|.+++.+.+ .++.+-++
T Consensus 413 V~lwDLRKl~n~k------------------------------t~~l~~~~~v~s~~fD~SGt~L~------~~g~~l~V 456 (506)
T KOG0289|consen 413 VKLWDLRKLKNFK------------------------------TIQLDEKKEVNSLSFDQSGTYLG------IAGSDLQV 456 (506)
T ss_pred EEEEEehhhcccc------------------------------eeeccccccceeEEEcCCCCeEE------eecceeEE
Confidence 9999999554310 0111 1478889999887776 45778888
Q ss_pred cccccccCceeeeeeeccccCCc
Q 000425 242 RAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 242 k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
++.......|.++..+..|.+.+
T Consensus 457 y~~~k~~k~W~~~~~~~~~sg~s 479 (506)
T KOG0289|consen 457 YICKKKTKSWTEIKELADHSGLS 479 (506)
T ss_pred EEEecccccceeeehhhhccccc
Confidence 87776667788887777777655
No 45
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.66 E-value=1.1e-15 Score=185.42 Aligned_cols=198 Identities=16% Similarity=0.203 Sum_probs=163.6
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecC--CccceEE-----ccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG--GRAPTKI-----KTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNI 80 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg--gk~~tkl-----k~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl 80 (1525)
+.-.+|++-.++-.|++.+.|.++++|+.... ...+..+ ...|.+.|+|++++|+..++|+||.|.+.|||++
T Consensus 413 svgava~~~~~asffvsvS~D~tlK~W~l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~ndkLiAT~SqDktaKiW~l 492 (775)
T KOG0319|consen 413 SVGAVAGSKLGASFFVSVSQDCTLKLWDLPKSKETAFPIVLTCRYTERAHDKDINCVAIAPNDKLIATGSQDKTAKIWDL 492 (775)
T ss_pred ccceeeecccCccEEEEecCCceEEEecCCCcccccccceehhhHHHHhhcccccceEecCCCceEEecccccceeeecc
Confidence 44567788888888899889999999987541 1112222 2249999999999999999999999999999999
Q ss_pred CCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE
Q 000425 81 HTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 81 ~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg 159 (1525)
.......+|. |...|. +|.|+|..+.++++|.|+|||||.++ ..++.+++.||+..|..++|-.++..|+||
T Consensus 493 e~~~l~~vLsGH~RGvw-----~V~Fs~~dq~laT~SgD~TvKIW~is--~fSClkT~eGH~~aVlra~F~~~~~qliS~ 565 (775)
T KOG0319|consen 493 EQLRLLGVLSGHTRGVW-----CVSFSKNDQLLATCSGDKTVKIWSIS--TFSCLKTFEGHTSAVLRASFIRNGKQLISA 565 (775)
T ss_pred cCceEEEEeeCCccceE-----EEEeccccceeEeccCCceEEEEEec--cceeeeeecCccceeEeeeeeeCCcEEEec
Confidence 9999999999 999999 99999999999999999999999999 578888899999999999999999999999
Q ss_pred ECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcc
Q 000425 160 CRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNI 239 (1525)
Q Consensus 160 S~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~ 239 (1525)
+.||-||+|++++... +. .+..+. .+||.++..|..+.+. +++-|.
T Consensus 566 ~adGliKlWnikt~eC---------------~~------tlD~H~-------DrvWaL~~~~~~~~~~------tgg~Dg 611 (775)
T KOG0319|consen 566 GADGLIKLWNIKTNEC---------------EM------TLDAHN-------DRVWALSVSPLLDMFV------TGGGDG 611 (775)
T ss_pred cCCCcEEEEeccchhh---------------hh------hhhhcc-------ceeEEEeecCccceeE------ecCCCe
Confidence 9999999999873211 11 122332 3789999999888766 677777
Q ss_pred ccccccc
Q 000425 240 KNRAAYT 246 (1525)
Q Consensus 240 ~~k~~~~ 246 (1525)
.+..|..
T Consensus 612 ~i~~wkD 618 (775)
T KOG0319|consen 612 RIIFWKD 618 (775)
T ss_pred EEEEeec
Confidence 7666653
No 46
>PTZ00420 coronin; Provisional
Probab=99.65 E-value=7.7e-15 Score=182.73 Aligned_cols=121 Identities=19% Similarity=0.268 Sum_probs=100.9
Q ss_pred ceEEccCCCCCeEEEEEcCC-CCEEEEEECCCcEEEEEcCCCc--------EEEEEe-cCCceeecCceeEEEeCCCCEE
Q 000425 43 PTKIKTDLKKPIVNLACHPR-LPVLYVAYADGLIRAYNIHTYA--------VHYTLQ-LDNTIKLLGAGAFAFHPTLEWL 112 (1525)
Q Consensus 43 ~tklk~~H~~~V~sLaf~Pd-g~~LaSGs~DgtIrLWDl~t~k--------~l~tL~-H~~~V~~~ga~sVaFSPdg~~L 112 (1525)
...+.+ |.++|.+++|+|+ +.+|++|+.|++|+|||+.+.. ++..+. |...|. +++|||++..+
T Consensus 67 v~~L~g-H~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~-----sVaf~P~g~~i 140 (568)
T PTZ00420 67 VIKLKG-HTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS-----IIDWNPMNYYI 140 (568)
T ss_pred EEEEcC-CCCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEE-----EEEECCCCCeE
Confidence 344554 9999999999997 6899999999999999997542 345677 999999 99999999865
Q ss_pred -EEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 113 -FVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 113 -~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
++++.|++|+|||+++... ...+ .|...|.+++|+|+|.+|++++.|++|+|||+|.
T Consensus 141 LaSgS~DgtIrIWDl~tg~~--~~~i-~~~~~V~SlswspdG~lLat~s~D~~IrIwD~Rs 198 (568)
T PTZ00420 141 MCSSGFDSFVNIWDIENEKR--AFQI-NMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRK 198 (568)
T ss_pred EEEEeCCCeEEEEECCCCcE--EEEE-ecCCcEEEEEECCCCCEEEEEecCCEEEEEECCC
Confidence 6889999999999996422 2122 3667899999999999999999999999999973
No 47
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.64 E-value=3.7e-15 Score=173.92 Aligned_cols=142 Identities=22% Similarity=0.280 Sum_probs=123.0
Q ss_pred EEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCc
Q 000425 22 FFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGA 100 (1525)
Q Consensus 22 f~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga 100 (1525)
|+++..++.+.|+. +...++..++.+ |.++|.++.|+|++.+|+++++|+|+|||+.....+...|. |.+.|.
T Consensus 332 F~ts~td~~i~V~k-v~~~~P~~t~~G-H~g~V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~~~~l~~Hskei~---- 405 (524)
T KOG0273|consen 332 FATSSTDGCIHVCK-VGEDRPVKTFIG-HHGEVNALKWNPTGSLLASCSDDGTLKIWSMGQSNSVHDLQAHSKEIY---- 405 (524)
T ss_pred EeecCCCceEEEEE-ecCCCcceeeec-ccCceEEEEECCCCceEEEecCCCeeEeeecCCCcchhhhhhhcccee----
Confidence 45555677777766 444456666666 99999999999999999999999999999999999999999 999999
Q ss_pred eeEEEeCCCC---------EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 101 GAFAFHPTLE---------WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 101 ~sVaFSPdg~---------~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+.|+|+|+ .+++++.|++|++||+.. ..++..+..|..+|.+++|+|+|+|+++||.||.|.+|+.+
T Consensus 406 -t~~wsp~g~v~~n~~~~~~l~sas~dstV~lwdv~~--gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws~~ 482 (524)
T KOG0273|consen 406 -TIKWSPTGPVTSNPNMNLMLASASFDSTVKLWDVES--GVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWSTK 482 (524)
T ss_pred -eEeecCCCCccCCCcCCceEEEeecCCeEEEEEccC--CceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEecccc
Confidence 999999763 488999999999999995 44555577999999999999999999999999999999987
Q ss_pred e
Q 000425 172 V 172 (1525)
Q Consensus 172 ~ 172 (1525)
.
T Consensus 483 ~ 483 (524)
T KOG0273|consen 483 T 483 (524)
T ss_pred c
Confidence 4
No 48
>PTZ00420 coronin; Provisional
Probab=99.64 E-value=1.7e-14 Score=179.58 Aligned_cols=158 Identities=8% Similarity=0.096 Sum_probs=126.8
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCc-------cceEEccCCCCCeEEEEEcCCCCE-EEEEECCCcEEEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR-------APTKIKTDLKKPIVNLACHPRLPV-LYVAYADGLIRAY 78 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk-------~~tklk~~H~~~V~sLaf~Pdg~~-LaSGs~DgtIrLW 78 (1525)
+....++|||..+.++++...+.+|++|+..+++. +...+. .|...|.+++|||++.. +++|+.|++|+||
T Consensus 75 ~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~-gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIW 153 (568)
T PTZ00420 75 SSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILK-GHKKKISIIDWNPMNYYIMCSSGFDSFVNIW 153 (568)
T ss_pred CCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEee-cCCCcEEEEEECCCCCeEEEEEeCCCeEEEE
Confidence 46788999998665666666799999999654322 122344 49999999999999875 5789999999999
Q ss_pred EcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEE-----EEEcCCC
Q 000425 79 NIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITS-----VAWLPML 153 (1525)
Q Consensus 79 Dl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~Vts-----VafsPdg 153 (1525)
|+++++....+.|...|. +++|+|+|.+|++++.|++|+|||.++. ..+..+.+|.+.+.+ ..|++++
T Consensus 154 Dl~tg~~~~~i~~~~~V~-----SlswspdG~lLat~s~D~~IrIwD~Rsg--~~i~tl~gH~g~~~s~~v~~~~fs~d~ 226 (568)
T PTZ00420 154 DIENEKRAFQINMPKKLS-----SLKWNIKGNLLSGTCVGKHMHIIDPRKQ--EIASSFHIHDGGKNTKNIWIDGLGGDD 226 (568)
T ss_pred ECCCCcEEEEEecCCcEE-----EEEECCCCCEEEEEecCCEEEEEECCCC--cEEEEEecccCCceeEEEEeeeEcCCC
Confidence 999998888777777899 9999999999999999999999999963 344456788876543 3456899
Q ss_pred CEEEEEECCC----CeeEEEeee
Q 000425 154 RLLVTLCRDG----SLQVWKTRV 172 (1525)
Q Consensus 154 ~~LaSgS~Dg----tIkVWDlr~ 172 (1525)
.+|+|++.|+ +|+|||++.
T Consensus 227 ~~IlTtG~d~~~~R~VkLWDlr~ 249 (568)
T PTZ00420 227 NYILSTGFSKNNMREMKLWDLKN 249 (568)
T ss_pred CEEEEEEcCCCCccEEEEEECCC
Confidence 9999988775 799999984
No 49
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.64 E-value=2e-14 Score=187.46 Aligned_cols=248 Identities=10% Similarity=0.093 Sum_probs=172.0
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecC---Cc----cceEEccCCCCCeEEEEEcCC-CCEEEEEECCCcEEE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG---GR----APTKIKTDLKKPIVNLACHPR-LPVLYVAYADGLIRA 77 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg---gk----~~tklk~~H~~~V~sLaf~Pd-g~~LaSGs~DgtIrL 77 (1525)
.+....++|+|++.+++++. .+..|++|+.... +. +...+ .+...|.+++|+|. +.+|++++.||+|++
T Consensus 483 ~~~V~~i~fs~dg~~latgg-~D~~I~iwd~~~~~~~~~~~~~~~~~~--~~~~~v~~l~~~~~~~~~las~~~Dg~v~l 559 (793)
T PLN00181 483 SNLVCAIGFDRDGEFFATAG-VNKKIKIFECESIIKDGRDIHYPVVEL--ASRSKLSGICWNSYIKSQVASSNFEGVVQV 559 (793)
T ss_pred CCcEEEEEECCCCCEEEEEe-CCCEEEEEECCcccccccccccceEEe--cccCceeeEEeccCCCCEEEEEeCCCeEEE
Confidence 34567799999999887776 5889999985321 11 11122 24578999999985 678999999999999
Q ss_pred EEcCCCcEEEEEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEc-CCCC
Q 000425 78 YNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWL-PMLR 154 (1525)
Q Consensus 78 WDl~t~k~l~tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafs-Pdg~ 154 (1525)
||+.+++.+..+. |...|. +++|+| ++.+|++|+.|++|++||+++. .....+..| ..|.++.|+ +++.
T Consensus 560 Wd~~~~~~~~~~~~H~~~V~-----~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~--~~~~~~~~~-~~v~~v~~~~~~g~ 631 (793)
T PLN00181 560 WDVARSQLVTEMKEHEKRVW-----SIDYSSADPTLLASGSDDGSVKLWSINQG--VSIGTIKTK-ANICCVQFPSESGR 631 (793)
T ss_pred EECCCCeEEEEecCCCCCEE-----EEEEcCCCCCEEEEEcCCCEEEEEECCCC--cEEEEEecC-CCeEEEEEeCCCCC
Confidence 9999999999998 999999 999997 7889999999999999999853 333334444 678999996 5799
Q ss_pred EEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccc
Q 000425 155 LLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFT 234 (1525)
Q Consensus 155 ~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s 234 (1525)
+|++|+.|++|++||++....+ . . .+..+. ..|.++.|++ .+.++ +
T Consensus 632 ~latgs~dg~I~iwD~~~~~~~----~-~---------------~~~~h~-------~~V~~v~f~~-~~~lv------s 677 (793)
T PLN00181 632 SLAFGSADHKVYYYDLRNPKLP----L-C---------------TMIGHS-------KTVSYVRFVD-SSTLV------S 677 (793)
T ss_pred EEEEEeCCCeEEEEECCCCCcc----c-e---------------EecCCC-------CCEEEEEEeC-CCEEE------E
Confidence 9999999999999998732110 0 0 011111 1467788874 44444 5
Q ss_pred cCCcccccccccccC----ceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhc
Q 000425 235 GGDNIKNRAAYTREG----RKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKA 310 (1525)
Q Consensus 235 ~~Dd~~~k~~~~~eg----~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~ 310 (1525)
++.|..+++|..+.+ ....+.++.+|.+....+ .+.+.+. -++.+ ..++.+.|||.....
T Consensus 678 ~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v---~~s~~~~-~lasg------------s~D~~v~iw~~~~~~ 741 (793)
T PLN00181 678 SSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFV---GLSVSDG-YIATG------------SETNEVFVYHKAFPM 741 (793)
T ss_pred EECCCEEEEEeCCCCccccCCcceEEEcCCCCCeeEE---EEcCCCC-EEEEE------------eCCCEEEEEECCCCC
Confidence 677888999987643 124456788887654221 1111111 11122 257888888865554
Q ss_pred eeee
Q 000425 311 FLYS 314 (1525)
Q Consensus 311 ~i~s 314 (1525)
++.+
T Consensus 742 ~~~s 745 (793)
T PLN00181 742 PVLS 745 (793)
T ss_pred ceEE
Confidence 5544
No 50
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.63 E-value=4.7e-15 Score=165.78 Aligned_cols=216 Identities=17% Similarity=0.227 Sum_probs=161.6
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
-|.+++++..|+.+. .|+++..|+ .+.++...+.++ |.+-|+++.-...|. ++.+|+.|+++||||+++..+++++
T Consensus 95 ~l~~~~d~s~i~S~g-tDk~v~~wD-~~tG~~~rk~k~-h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k~~~~t~ 171 (338)
T KOG0265|consen 95 ELHGMRDGSHILSCG-TDKTVRGWD-AETGKRIRKHKG-HTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKKEAIKTF 171 (338)
T ss_pred eeeeccCCCEEEEec-CCceEEEEe-cccceeeehhcc-ccceeeecCccccCCeEEEecCCCceEEEEeecccchhhcc
Confidence 456788888887666 689999998 577788888888 999999998555565 6789999999999999999999999
Q ss_pred ecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEE
Q 000425 90 QLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWK 169 (1525)
Q Consensus 90 ~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWD 169 (1525)
...-.++ ++.|.-++..+++|+-|+.|++||++. ....-++.||.+.|+++..+|+|.++.|-+.|.++++||
T Consensus 172 ~~kyqlt-----Av~f~d~s~qv~sggIdn~ikvWd~r~--~d~~~~lsGh~DtIt~lsls~~gs~llsnsMd~tvrvwd 244 (338)
T KOG0265|consen 172 ENKYQLT-----AVGFKDTSDQVISGGIDNDIKVWDLRK--NDGLYTLSGHADTITGLSLSRYGSFLLSNSMDNTVRVWD 244 (338)
T ss_pred ccceeEE-----EEEecccccceeeccccCceeeecccc--CcceEEeecccCceeeEEeccCCCccccccccceEEEEE
Confidence 8667788 999999999999999999999999984 333344789999999999999999999999999999999
Q ss_pred eeeecCCCCC-------------CCccccccCCC--cc--ccc-ccccccccCCcceecCCC----eeEEEEecCCCeEE
Q 000425 170 TRVIINPNRP-------------PMQANFFEPAS--IE--SID-IPRILSQQGGEAVYPLPR----VRALEVHPRLNLAV 227 (1525)
Q Consensus 170 lr~~~~p~~~-------------~~~a~~f~p~g--Ve--sld-i~~iLs~~gG~~vy~l~~----V~~v~~HP~~Nlia 227 (1525)
+|.....++. +.....+.|++ +. +.| ...++...+...+|-+|+ |+++.|||+..++.
T Consensus 245 ~rp~~p~~R~v~if~g~~hnfeknlL~cswsp~~~~i~ags~dr~vyvwd~~~r~~lyklpGh~gsvn~~~Fhp~e~iil 324 (338)
T KOG0265|consen 245 VRPFAPSQRCVKIFQGHIHNFEKNLLKCSWSPNGTKITAGSADRFVYVWDTTSRRILYKLPGHYGSVNEVDFHPTEPIIL 324 (338)
T ss_pred ecccCCCCceEEEeecchhhhhhhcceeeccCCCCccccccccceEEEeecccccEEEEcCCcceeEEEeeecCCCcEEE
Confidence 9944332221 00011112211 11 111 111223333345666664 88999999999887
Q ss_pred EEecccccCCccccc
Q 000425 228 LLFANFTGGDNIKNR 242 (1525)
Q Consensus 228 l~F~~~s~~Dd~~~k 242 (1525)
+++.|+.++
T Consensus 325 ------s~~sdk~i~ 333 (338)
T KOG0265|consen 325 ------SCSSDKTIY 333 (338)
T ss_pred ------EeccCceeE
Confidence 555565543
No 51
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.63 E-value=2.5e-15 Score=171.55 Aligned_cols=205 Identities=19% Similarity=0.279 Sum_probs=151.3
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEE-ccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKI-KTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tkl-k~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~ 83 (1525)
.+--..|+++|...-.+.+.+....|.+|...++.+..... ..+|++.|-.++|+|+.. .|+|||.||+|||||++.+
T Consensus 211 k~EGy~LdWSp~~~g~LlsGDc~~~I~lw~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~DgsIrIWDiRs~ 290 (440)
T KOG0302|consen 211 KGEGYGLDWSPIKTGRLLSGDCVKGIHLWEPSTGSWKVDQRPFTGHTKSVEDLQWSPTEDGVFASCSCDGSIRIWDIRSG 290 (440)
T ss_pred CccceeeecccccccccccCccccceEeeeeccCceeecCccccccccchhhhccCCccCceEEeeecCceEEEEEecCC
Confidence 34456799999777666777788889998877776544322 224999999999999875 7999999999999999988
Q ss_pred ---cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-CceeeEeecccCcCeEEEEEcCC-CCEEE
Q 000425 84 ---AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-RPSMIGIIQVGSQPITSVAWLPM-LRLLV 157 (1525)
Q Consensus 84 ---k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-~~s~i~il~gH~~~VtsVafsPd-g~~La 157 (1525)
.++.+ + |...|+ .|.|+.+-..|++|++||+++|||+++. ..+.++.++-|..+|+||.|||. ...|+
T Consensus 291 ~~~~~~~~-kAh~sDVN-----VISWnr~~~lLasG~DdGt~~iwDLR~~~~~~pVA~fk~Hk~pItsieW~p~e~s~ia 364 (440)
T KOG0302|consen 291 PKKAAVST-KAHNSDVN-----VISWNRREPLLASGGDDGTLSIWDLRQFKSGQPVATFKYHKAPITSIEWHPHEDSVIA 364 (440)
T ss_pred CccceeEe-eccCCcee-----eEEccCCcceeeecCCCceEEEEEhhhccCCCcceeEEeccCCeeEEEeccccCceEE
Confidence 34444 5 999999 9999999889999999999999999974 33455677899999999999995 45788
Q ss_pred EEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccc-cccCCcceecCCCeeEEEEecCCCeEE
Q 000425 158 TLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRIL-SQQGGEAVYPLPRVRALEVHPRLNLAV 227 (1525)
Q Consensus 158 SgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iL-s~~gG~~vy~l~~V~~v~~HP~~Nlia 227 (1525)
+++.|.+|.+||+.+-.........+. .+ -.|++..| -.+-|.+ .++.+.||+..|-..
T Consensus 365 asg~D~QitiWDlsvE~D~ee~~~~a~----~~--L~dlPpQLLFVHqGQk-----e~KevhWH~QiPG~l 424 (440)
T KOG0302|consen 365 ASGEDNQITIWDLSVEADEEEIDQEAA----EG--LQDLPPQLLFVHQGQK-----EVKEVHWHRQIPGLL 424 (440)
T ss_pred eccCCCcEEEEEeeccCChhhhccccc----cc--hhcCCceeEEEecchh-----HhhhheeccCCCCeE
Confidence 888999999999985443211111100 00 12343322 2232322 467899999877654
No 52
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.62 E-value=3e-15 Score=175.60 Aligned_cols=246 Identities=16% Similarity=0.250 Sum_probs=172.1
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+..-+|+|+|..|+=|+-.+ ...+.+++.... ...+.....++.|.+++|-.||.+++.|...|.|+|||+++...+
T Consensus 27 ~~vssl~fsp~~P~d~aVt~-S~rvqly~~~~~--~~~k~~srFk~~v~s~~fR~DG~LlaaGD~sG~V~vfD~k~r~iL 103 (487)
T KOG0310|consen 27 NSVSSLCFSPKHPYDFAVTS-SVRVQLYSSVTR--SVRKTFSRFKDVVYSVDFRSDGRLLAAGDESGHVKVFDMKSRVIL 103 (487)
T ss_pred CcceeEecCCCCCCceEEec-ccEEEEEecchh--hhhhhHHhhccceeEEEeecCCeEEEccCCcCcEEEeccccHHHH
Confidence 46789999999998776664 566667765432 222222236789999999999999999999999999998887788
Q ss_pred EEEe-cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEEEEECCC
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLVTLCRDG 163 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~LaSgS~Dg 163 (1525)
+.+. |..+++ .+.|+|++ ..+++|++|+.+++||+.+ ......+.+|+++|+|.+|+|- ...++|||-||
T Consensus 104 R~~~ah~apv~-----~~~f~~~d~t~l~s~sDd~v~k~~d~s~--a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg 176 (487)
T KOG0310|consen 104 RQLYAHQAPVH-----VTKFSPQDNTMLVSGSDDKVVKYWDLST--AYVQAELSGHTDYVRCGDISPANDHIVVTGSYDG 176 (487)
T ss_pred HHHhhccCcee-----EEEecccCCeEEEecCCCceEEEEEcCC--cEEEEEecCCcceeEeeccccCCCeEEEecCCCc
Confidence 8888 999999 99999955 5677888999999999995 3344468899999999999996 45899999999
Q ss_pred CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccc
Q 000425 164 SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRA 243 (1525)
Q Consensus 164 tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~ 243 (1525)
+||+||+|...+ .+++...| .+|-.+.+-|.+.+++ +++- ..+|+
T Consensus 177 ~vrl~DtR~~~~----------------------~v~elnhg------~pVe~vl~lpsgs~ia------sAgG-n~vkV 221 (487)
T KOG0310|consen 177 KVRLWDTRSLTS----------------------RVVELNHG------CPVESVLALPSGSLIA------SAGG-NSVKV 221 (487)
T ss_pred eEEEEEeccCCc----------------------eeEEecCC------CceeeEEEcCCCCEEE------EcCC-CeEEE
Confidence 999999995321 01111111 2456677777777776 3322 34678
Q ss_pred cccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeee
Q 000425 244 AYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 244 ~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
|....|+..+ ...-.|.-.+.|+ ++.+.+. ..+..- =|+...|.|+.+..-+||
T Consensus 222 WDl~~G~qll-~~~~~H~KtVTcL---~l~s~~~-rLlS~s------------LD~~VKVfd~t~~Kvv~s 275 (487)
T KOG0310|consen 222 WDLTTGGQLL-TSMFNHNKTVTCL---RLASDST-RLLSGS------------LDRHVKVFDTTNYKVVHS 275 (487)
T ss_pred EEecCCceeh-hhhhcccceEEEE---EeecCCc-eEeecc------------cccceEEEEccceEEEEe
Confidence 8888775333 2333366665553 3333222 122111 155566666666655655
No 53
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.62 E-value=7.3e-14 Score=151.32 Aligned_cols=152 Identities=17% Similarity=0.276 Sum_probs=130.0
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
..+.++|.+..++++. .+..+.+|+.. .+.....+.. |...|.++.|+|++.++++++.|+.|++||+.+++....+
T Consensus 55 ~~~~~~~~~~~l~~~~-~~~~i~i~~~~-~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 131 (289)
T cd00200 55 RDVAASADGTYLASGS-SDKTIRLWDLE-TGECVRTLTG-HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTL 131 (289)
T ss_pred eEEEECCCCCEEEEEc-CCCeEEEEEcC-cccceEEEec-cCCcEEEEEEcCCCCEEEEecCCCeEEEEECCCcEEEEEe
Confidence 4899999998877776 47889898854 3344555554 8889999999999888888888999999999999999999
Q ss_pred e-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEE
Q 000425 90 Q-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 90 ~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVW 168 (1525)
. |...+. +++|+|++.++++++.|+.|++||+++. .....+..|...|.++.|+|++..+++++.|+.|++|
T Consensus 132 ~~~~~~i~-----~~~~~~~~~~l~~~~~~~~i~i~d~~~~--~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~ 204 (289)
T cd00200 132 RGHTDWVN-----SVAFSPDGTFVASSSQDGTIKLWDLRTG--KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204 (289)
T ss_pred ccCCCcEE-----EEEEcCcCCEEEEEcCCCcEEEEEcccc--ccceeEecCccccceEEECCCcCEEEEecCCCcEEEE
Confidence 8 999999 9999999999988888999999999843 3333455888899999999999999999999999999
Q ss_pred Eee
Q 000425 169 KTR 171 (1525)
Q Consensus 169 Dlr 171 (1525)
|++
T Consensus 205 d~~ 207 (289)
T cd00200 205 DLS 207 (289)
T ss_pred ECC
Confidence 997
No 54
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.61 E-value=5.8e-15 Score=172.35 Aligned_cols=253 Identities=18% Similarity=0.236 Sum_probs=176.8
Q ss_pred cccCCcEEE-EEEcCCCCEEEEEEeccceEEEEEeecC---------CccceEEccCCCCCeEEEEEcCCCC-EEEEEEC
Q 000425 3 SISVDTEVH-LALTPLQPVVFFGFHRRMSVTVVGTVEG---------GRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYA 71 (1525)
Q Consensus 3 ~~~~~~e~~-LA~hP~qp~Vf~s~~~d~sI~vv~tieg---------gk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~ 71 (1525)
+|.-++||. +...|+.|.+..+......+.|++..+- .++...+++ |++.-.+++|++... .+++|+.
T Consensus 120 ~i~h~gEVnRaRymPQnp~iVAt~t~~~dv~Vfd~tk~~s~~~~~~~~~Pdl~L~g-H~~eg~glsWn~~~~g~Lls~~~ 198 (422)
T KOG0264|consen 120 KINHDGEVNRARYMPQNPNIVATKTSSGDVYVFDYTKHPSKPKASGECRPDLRLKG-HEKEGYGLSWNRQQEGTLLSGSD 198 (422)
T ss_pred eccCCccchhhhhCCCCCcEEEecCCCCCEEEEEeccCCCcccccccCCCceEEEe-ecccccccccccccceeEeeccC
Confidence 356667775 4456999999998888889999887432 133346777 988889999999865 7999999
Q ss_pred CCcEEEEEcCCC-------cEEEEEe-cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCCCceeeEeecccCc
Q 000425 72 DGLIRAYNIHTY-------AVHYTLQ-LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQ 142 (1525)
Q Consensus 72 DgtIrLWDl~t~-------k~l~tL~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~ 142 (1525)
|++|++||+... .+...+. |...|. .|+||+.. ..+.++++|+.+.|||+++..........+|+.
T Consensus 199 d~~i~lwdi~~~~~~~~~~~p~~~~~~h~~~Ve-----DV~~h~~h~~lF~sv~dd~~L~iwD~R~~~~~~~~~~~ah~~ 273 (422)
T KOG0264|consen 199 DHTICLWDINAESKEDKVVDPKTIFSGHEDVVE-----DVAWHPLHEDLFGSVGDDGKLMIWDTRSNTSKPSHSVKAHSA 273 (422)
T ss_pred CCcEEEEeccccccCCccccceEEeecCCccee-----hhhccccchhhheeecCCCeEEEEEcCCCCCCCcccccccCC
Confidence 999999998643 2345566 999999 99999955 567788999999999999531222233569999
Q ss_pred CeEEEEEcCC-CCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEec
Q 000425 143 PITSVAWLPM-LRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHP 221 (1525)
Q Consensus 143 ~VtsVafsPd-g~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP 221 (1525)
.|+|++|+|. +.+|+|||.|++|++||+|....+ . ..+..+. ..|..|+|.|
T Consensus 274 ~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~~----l----------------h~~e~H~-------dev~~V~WSP 326 (422)
T KOG0264|consen 274 EVNCVAFNPFNEFILATGSADKTVALWDLRNLNKP----L----------------HTFEGHE-------DEVFQVEWSP 326 (422)
T ss_pred ceeEEEeCCCCCceEEeccCCCcEEEeechhcccC----c----------------eeccCCC-------cceEEEEeCC
Confidence 9999999994 678999999999999999954331 0 0111222 2467899999
Q ss_pred CC-CeEEEEecccccCCcccccccc-cccCcee-----------eeeeeccccCCcccchhhhcccCCCcccchhhHHHH
Q 000425 222 RL-NLAVLLFANFTGGDNIKNRAAY-TREGRKQ-----------LFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQA 288 (1525)
Q Consensus 222 ~~-Nlial~F~~~s~~Dd~~~k~~~-~~eg~~q-----------~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~ 288 (1525)
.. ++.| +++.|....+|. .+-|..| ++-+..||.+.+.- .+... ..
T Consensus 327 h~etvLA------SSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~~kV~D------fsWnp---------~e 385 (422)
T KOG0264|consen 327 HNETVLA------SSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHTAKVSD------FSWNP---------NE 385 (422)
T ss_pred CCCceeE------ecccCCcEEEEeccccccccChhhhccCCcceeEEecCccccccc------ccCCC---------CC
Confidence 84 5555 555666666564 3445444 23367777775411 11111 12
Q ss_pred HHHhhhccCCCceeeehhhhh
Q 000425 289 QLQEHHLKGHSHLTISDIARK 309 (1525)
Q Consensus 289 ~~~~~~~~~~~~~~I~di~r~ 309 (1525)
-|......+|+.+-||++++.
T Consensus 386 PW~I~SvaeDN~LqIW~~s~~ 406 (422)
T KOG0264|consen 386 PWTIASVAEDNILQIWQMAEN 406 (422)
T ss_pred CeEEEEecCCceEEEeecccc
Confidence 233334558999999998876
No 55
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.61 E-value=1.1e-14 Score=180.52 Aligned_cols=226 Identities=20% Similarity=0.218 Sum_probs=170.0
Q ss_pred EEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCce
Q 000425 23 FGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAG 101 (1525)
Q Consensus 23 ~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~ 101 (1525)
.+...++++.+|+...+-.....+.+ |.+.|+++++..-+..+++|+.|.++|+||..+++|..++. |...+.
T Consensus 222 ~~~s~~~tl~~~~~~~~~~i~~~l~G-H~g~V~~l~~~~~~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh~stv~----- 295 (537)
T KOG0274|consen 222 KSGSDDSTLHLWDLNNGYLILTRLVG-HFGGVWGLAFPSGGDKLVSGSTDKTERVWDCSTGECTHSLQGHTSSVR----- 295 (537)
T ss_pred EecCCCceeEEeecccceEEEeeccC-CCCCceeEEEecCCCEEEEEecCCcEEeEecCCCcEEEEecCCCceEE-----
Confidence 34446788888887665444444555 99999999998878899999999999999999999999999 999999
Q ss_pred eEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCC
Q 000425 102 AFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPM 181 (1525)
Q Consensus 102 sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~ 181 (1525)
++.. ...++++||.|.+|+|||++ ...+..++.+|...|+++..+ ++++++||.|++|+|||++...-
T Consensus 296 ~~~~--~~~~~~sgs~D~tVkVW~v~--n~~~l~l~~~h~~~V~~v~~~--~~~lvsgs~d~~v~VW~~~~~~c------ 363 (537)
T KOG0274|consen 296 CLTI--DPFLLVSGSRDNTVKVWDVT--NGACLNLLRGHTGPVNCVQLD--EPLLVSGSYDGTVKVWDPRTGKC------ 363 (537)
T ss_pred EEEc--cCceEeeccCCceEEEEecc--CcceEEEeccccccEEEEEec--CCEEEEEecCceEEEEEhhhcee------
Confidence 7654 44577888999999999999 456666677899999999997 88999999999999999872211
Q ss_pred ccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeecccc
Q 000425 182 QANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSAR 261 (1525)
Q Consensus 182 ~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~ 261 (1525)
-..+..+.+ +|.++.+.+. +.+. +++-|+.++.|..+..+ +++.+|++|.
T Consensus 364 ---------------l~sl~gH~~-------~V~sl~~~~~-~~~~------Sgs~D~~IkvWdl~~~~-~c~~tl~~h~ 413 (537)
T KOG0274|consen 364 ---------------LKSLSGHTG-------RVYSLIVDSE-NRLL------SGSLDTTIKVWDLRTKR-KCIHTLQGHT 413 (537)
T ss_pred ---------------eeeecCCcc-------eEEEEEecCc-ceEE------eeeeccceEeecCCchh-hhhhhhcCCc
Confidence 012222221 4555555544 4443 67778899999988775 7888999999
Q ss_pred CCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeee
Q 000425 262 GSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 262 ~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
+.+.+++.- +. .+ .+ -.-|+.+++||+.....+.+
T Consensus 414 ~~v~~l~~~-----~~--~L-----vs------~~aD~~Ik~WD~~~~~~~~~ 448 (537)
T KOG0274|consen 414 SLVSSLLLR-----DN--FL-----VS------SSADGTIKLWDAEEGECLRT 448 (537)
T ss_pred ccccccccc-----cc--ee-----Ee------ccccccEEEeecccCceeee
Confidence 987433211 00 11 01 11478899999888888877
No 56
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.61 E-value=9.3e-15 Score=168.12 Aligned_cols=237 Identities=12% Similarity=0.161 Sum_probs=162.9
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCc--cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR--APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk--~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
.--+.|+|++.++..++ ++.+..+|....... -...+. +|..+|.-|.|+||++++++++.|..+++||..++.+.
T Consensus 227 VWfl~FS~nGkyLAsaS-kD~Taiiw~v~~d~~~kl~~tlv-gh~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~tgd~~ 304 (519)
T KOG0293|consen 227 VWFLQFSHNGKYLASAS-KDSTAIIWIVVYDVHFKLKKTLV-GHSQPVSYIMWSPDDRYLLACGFDEVLSLWDVDTGDLR 304 (519)
T ss_pred EEEEEEcCCCeeEeecc-CCceEEEEEEecCcceeeeeeee-cccCceEEEEECCCCCeEEecCchHheeeccCCcchhh
Confidence 44689999999987665 578887887765543 122233 49999999999999999999999999999999999999
Q ss_pred EEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc-cCcCeEEEEEcCCCCEEEEEECCC
Q 000425 87 YTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV-GSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 87 ~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g-H~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
..+. |..++. +++|.|||..+++|+.|+++..||+... ....+++ ....|.+++.++||+++++...|.
T Consensus 305 ~~y~~~~~~S~~-----sc~W~pDg~~~V~Gs~dr~i~~wdlDgn---~~~~W~gvr~~~v~dlait~Dgk~vl~v~~d~ 376 (519)
T KOG0293|consen 305 HLYPSGLGFSVS-----SCAWCPDGFRFVTGSPDRTIIMWDLDGN---ILGNWEGVRDPKVHDLAITYDGKYVLLVTVDK 376 (519)
T ss_pred hhcccCcCCCcc-----eeEEccCCceeEecCCCCcEEEecCCcc---hhhcccccccceeEEEEEcCCCcEEEEEeccc
Confidence 9888 568888 9999999999999999999999999843 1111222 235688999999999999999999
Q ss_pred CeeEEEeeeecCC----CCCCCcccc-----------ccCCCcccccccc--cccccCCcc--eecCCCeeEEEEecCCC
Q 000425 164 SLQVWKTRVIINP----NRPPMQANF-----------FEPASIESIDIPR--ILSQQGGEA--VYPLPRVRALEVHPRLN 224 (1525)
Q Consensus 164 tIkVWDlr~~~~p----~~~~~~a~~-----------f~p~gVesldi~~--iLs~~gG~~--vy~l~~V~~v~~HP~~N 224 (1525)
.|++++....... ...+...-. ..+.++.-+|+.. +.....|.+ .|.+ -.|+..+ +..
T Consensus 377 ~i~l~~~e~~~dr~lise~~~its~~iS~d~k~~LvnL~~qei~LWDl~e~~lv~kY~Ghkq~~fiI--rSCFgg~-~~~ 453 (519)
T KOG0293|consen 377 KIRLYNREARVDRGLISEEQPITSFSISKDGKLALVNLQDQEIHLWDLEENKLVRKYFGHKQGHFII--RSCFGGG-NDK 453 (519)
T ss_pred ceeeechhhhhhhccccccCceeEEEEcCCCcEEEEEcccCeeEEeecchhhHHHHhhcccccceEE--EeccCCC-Ccc
Confidence 9999987632221 111111000 0112222222221 111111111 1100 0122222 234
Q ss_pred eEEEEecccccCCcccccccccccCceeeeeeeccccCCccc
Q 000425 225 LAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSAS 266 (1525)
Q Consensus 225 lial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s 266 (1525)
+++ +|++|.++|+|.++.|+ ++.+|.||...+-+
T Consensus 454 fia------SGSED~kvyIWhr~sgk--ll~~LsGHs~~vNc 487 (519)
T KOG0293|consen 454 FIA------SGSEDSKVYIWHRISGK--LLAVLSGHSKTVNC 487 (519)
T ss_pred eEE------ecCCCceEEEEEccCCc--eeEeecCCcceeeE
Confidence 455 99999999999999884 77899999987643
No 57
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.61 E-value=8.8e-15 Score=180.71 Aligned_cols=248 Identities=15% Similarity=0.141 Sum_probs=178.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeec--------------------------------C---------------
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVE--------------------------------G--------------- 39 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tie--------------------------------g--------------- 39 (1525)
+..-.+-|+|++-++..|. .|.-|+||..++ .
T Consensus 268 gaIw~mKFS~DGKyLAsaG-eD~virVWkVie~e~~~~~~~~~~~~~~~~~~~s~~~p~~s~~~~~~~~~s~~~~~~~s~ 346 (712)
T KOG0283|consen 268 GAIWAMKFSHDGKYLASAG-EDGVIRVWKVIESERMRVAEGDSSCMYFEYNANSQIEPSTSSEEKISSRTSSSRKGSQSP 346 (712)
T ss_pred CcEEEEEeCCCCceeeecC-CCceEEEEEEeccchhcccccccchhhhhhhhccccCccccccccccccccccccccCCc
Confidence 4677899999999997776 588999998877 1
Q ss_pred ------------CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeC
Q 000425 40 ------------GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHP 107 (1525)
Q Consensus 40 ------------gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSP 107 (1525)
.++...+ .+|.+.|..|.|+. ..+|++++.|.|||||++...+|+.+|.|.+.|+ ||+|+|
T Consensus 347 ~~~~p~~~f~f~ekP~~ef-~GHt~DILDlSWSK-n~fLLSSSMDKTVRLWh~~~~~CL~~F~HndfVT-----cVaFnP 419 (712)
T KOG0283|consen 347 CVLLPLKAFVFSEKPFCEF-KGHTADILDLSWSK-NNFLLSSSMDKTVRLWHPGRKECLKVFSHNDFVT-----CVAFNP 419 (712)
T ss_pred cccCCCccccccccchhhh-hccchhheeccccc-CCeeEeccccccEEeecCCCcceeeEEecCCeeE-----EEEecc
Confidence 0011112 25999999999995 4689999999999999999999999999999999 999999
Q ss_pred -CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccc
Q 000425 108 -TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFF 186 (1525)
Q Consensus 108 -dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f 186 (1525)
|.+|+++||-|+.||||++. ..++. .+..-...|++++|.|||.+.+.|+-+|.+++++++-..-
T Consensus 420 vDDryFiSGSLD~KvRiWsI~--d~~Vv-~W~Dl~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~----------- 485 (712)
T KOG0283|consen 420 VDDRYFISGSLDGKVRLWSIS--DKKVV-DWNDLRDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKL----------- 485 (712)
T ss_pred cCCCcEeecccccceEEeecC--cCeeE-eehhhhhhheeEEeccCCceEEEEEeccEEEEEEccCCeE-----------
Confidence 78999999999999999998 34444 2333347899999999999999999999999999862210
Q ss_pred cCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCccc
Q 000425 187 EPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSAS 266 (1525)
Q Consensus 187 ~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s 266 (1525)
.. ..-+....+.++..- +|+.+.|+|..+--.+ ..+.|.++|+...++. .++..|+|+++.. +
T Consensus 486 ~~--------~~~I~~~~~Kk~~~~-rITG~Q~~p~~~~~vL-----VTSnDSrIRI~d~~~~--~lv~KfKG~~n~~-S 548 (712)
T KOG0283|consen 486 VS--------DFHIRLHNKKKKQGK-RITGLQFFPGDPDEVL-----VTSNDSRIRIYDGRDK--DLVHKFKGFRNTS-S 548 (712)
T ss_pred EE--------eeeEeeccCccccCc-eeeeeEecCCCCCeEE-----EecCCCceEEEeccch--hhhhhhcccccCC-c
Confidence 00 001111222222222 7899999988762112 2355667776665443 4566788887765 2
Q ss_pred chhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehh
Q 000425 267 VLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDI 306 (1525)
Q Consensus 267 ~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di 306 (1525)
-+.+.|.+.|. -++.. .+|+.+.||++
T Consensus 549 Q~~Asfs~Dgk-~IVs~------------seDs~VYiW~~ 575 (712)
T KOG0283|consen 549 QISASFSSDGK-HIVSA------------SEDSWVYIWKN 575 (712)
T ss_pred ceeeeEccCCC-EEEEe------------ecCceEEEEeC
Confidence 23344444332 23332 27899999984
No 58
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.61 E-value=1.7e-14 Score=156.88 Aligned_cols=243 Identities=10% Similarity=0.092 Sum_probs=186.3
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
-...+.++-++-+.+++. +++++++|+...+ ....... +|...|..++.+.|...|++|+.|..|.+||+.++++++
T Consensus 19 aV~avryN~dGnY~ltcG-sdrtvrLWNp~rg-~liktYs-ghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGkv~R 95 (307)
T KOG0316|consen 19 AVRAVRYNVDGNYCLTCG-SDRTVRLWNPLRG-ALIKTYS-GHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGKVDR 95 (307)
T ss_pred ceEEEEEccCCCEEEEcC-CCceEEeeccccc-ceeeeec-CCCceeeeccccccccccccCCCCceEEEEEcccCeeee
Confidence 345677888899988776 5899999996544 3333344 499999999999999999999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
.|. |...|+ .|+|+-+...+++||.|.++++||.++.....++++..-...|.+|... +..|++||.||++|
T Consensus 96 r~rgH~aqVN-----tV~fNeesSVv~SgsfD~s~r~wDCRS~s~ePiQildea~D~V~Si~v~--~heIvaGS~DGtvR 168 (307)
T KOG0316|consen 96 RFRGHLAQVN-----TVRFNEESSVVASGSFDSSVRLWDCRSRSFEPIQILDEAKDGVSSIDVA--EHEIVAGSVDGTVR 168 (307)
T ss_pred ecccccceee-----EEEecCcceEEEeccccceeEEEEcccCCCCccchhhhhcCceeEEEec--ccEEEeeccCCcEE
Confidence 999 999999 9999999999999999999999999986666677788888899998875 56899999999999
Q ss_pred EEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccc
Q 000425 167 VWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYT 246 (1525)
Q Consensus 167 VWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~ 246 (1525)
.||+|.-.. ... +.+ ..|+++.|.|.+|..- .+.-+...+....
T Consensus 169 tydiR~G~l------~sD-----------------y~g-------~pit~vs~s~d~nc~L------a~~l~stlrLlDk 212 (307)
T KOG0316|consen 169 TYDIRKGTL------SSD-----------------YFG-------HPITSVSFSKDGNCSL------ASSLDSTLRLLDK 212 (307)
T ss_pred EEEeeccee------ehh-----------------hcC-------CcceeEEecCCCCEEE------Eeeccceeeeccc
Confidence 999984221 111 111 2589999999999876 4455666777776
Q ss_pred ccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhcee
Q 000425 247 REGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFL 312 (1525)
Q Consensus 247 ~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i 312 (1525)
.+| ++...|+||.+-.|.+- +++. |+.-....=++|.....||+-..+.+
T Consensus 213 ~tG--klL~sYkGhkn~eykld----------c~l~----qsdthV~sgSEDG~Vy~wdLvd~~~~ 262 (307)
T KOG0316|consen 213 ETG--KLLKSYKGHKNMEYKLD----------CCLN----QSDTHVFSGSEDGKVYFWDLVDETQI 262 (307)
T ss_pred chh--HHHHHhcccccceeeee----------eeec----ccceeEEeccCCceEEEEEeccceee
Confidence 666 47789999999877641 1110 00000111237888888987665444
No 59
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.61 E-value=1.4e-15 Score=166.13 Aligned_cols=264 Identities=16% Similarity=0.205 Sum_probs=187.3
Q ss_pred cEEEEEEcCCCC--EEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 8 TEVHLALTPLQP--VVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 8 ~e~~LA~hP~qp--~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
-.|+|+|+|--| +.+.+..+|..-.+-+ .+.+...-.+.+ |++.|.+...+.+...-++++.|-+-|+||.-++..
T Consensus 16 pvvdl~~s~itp~g~flisa~kd~~pmlr~-g~tgdwigtfeg-hkgavw~~~l~~na~~aasaaadftakvw~a~tgde 93 (334)
T KOG0278|consen 16 PVVDLAFSPITPDGYFLISASKDGKPMLRN-GDTGDWIGTFEG-HKGAVWSATLNKNATRAASAAADFTAKVWDAVTGDE 93 (334)
T ss_pred ceeEEeccCCCCCceEEEEeccCCCchhcc-CCCCCcEEeeec-cCcceeeeecCchhhhhhhhcccchhhhhhhhhhhh
Confidence 468999998755 4445555554443322 233444445555 999999999998888899999999999999999999
Q ss_pred EEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 86 HYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 86 l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+.+|+|..-|. +++|+.|.++|++|+.+..+||||++.... ....+.+|+..|+.+.|....+.|+|.++|++|
T Consensus 94 lhsf~hkhivk-----~~af~~ds~~lltgg~ekllrvfdln~p~A-pp~E~~ghtg~Ir~v~wc~eD~~iLSSadd~tV 167 (334)
T KOG0278|consen 94 LHSFEHKHIVK-----AVAFSQDSNYLLTGGQEKLLRVFDLNRPKA-PPKEISGHTGGIRTVLWCHEDKCILSSADDKTV 167 (334)
T ss_pred hhhhhhhheee-----eEEecccchhhhccchHHHhhhhhccCCCC-CchhhcCCCCcceeEEEeccCceEEeeccCCce
Confidence 99999999999 999999999999999999999999996411 112345999999999999988999999999999
Q ss_pred eEEEeeeecC-----CCCCCCccccccCCC----------cccccccccccccCCcceecCC-CeeEEEEecCCCeEEEE
Q 000425 166 QVWKTRVIIN-----PNRPPMQANFFEPAS----------IESIDIPRILSQQGGEAVYPLP-RVRALEVHPRLNLAVLL 229 (1525)
Q Consensus 166 kVWDlr~~~~-----p~~~~~~a~~f~p~g----------Vesldi~~iLs~~gG~~vy~l~-~V~~v~~HP~~Nlial~ 229 (1525)
|+||.|+... -+.+...... .+++ |..+|.... +--+-|-+| .|...+.||+.+...
T Consensus 168 RLWD~rTgt~v~sL~~~s~VtSlEv-s~dG~ilTia~gssV~Fwdaksf----~~lKs~k~P~nV~SASL~P~k~~fV-- 240 (334)
T KOG0278|consen 168 RLWDHRTGTEVQSLEFNSPVTSLEV-SQDGRILTIAYGSSVKFWDAKSF----GLLKSYKMPCNVESASLHPKKEFFV-- 240 (334)
T ss_pred EEEEeccCcEEEEEecCCCCcceee-ccCCCEEEEecCceeEEeccccc----cceeeccCccccccccccCCCceEE--
Confidence 9999994321 1111111111 1111 111110000 000112233 478889999988766
Q ss_pred ecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhh
Q 000425 230 FANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIA 307 (1525)
Q Consensus 230 F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~ 307 (1525)
.+++|.+.++....+|..... --++|++.+.+ .+|++.|-. | ..=++|.+++||.++
T Consensus 241 ----aGged~~~~kfDy~TgeEi~~-~nkgh~gpVhc---VrFSPdGE~-----y--------AsGSEDGTirlWQt~ 297 (334)
T KOG0278|consen 241 ----AGGEDFKVYKFDYNTGEEIGS-YNKGHFGPVHC---VRFSPDGEL-----Y--------ASGSEDGTIRLWQTT 297 (334)
T ss_pred ----ecCcceEEEEEeccCCceeee-cccCCCCceEE---EEECCCCce-----e--------eccCCCceEEEEEec
Confidence 789999999998888854443 13999999866 577777762 1 122379999999754
No 60
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.60 E-value=2.9e-14 Score=166.58 Aligned_cols=254 Identities=13% Similarity=0.170 Sum_probs=181.2
Q ss_pred cCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc
Q 000425 5 SVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 5 ~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k 84 (1525)
++...-.|+++-++..+.+|+. +..+++|+. .+ .....+. .|++||.++.|+.+|+++++++.|+++-+||..++.
T Consensus 234 ~nkdVT~L~Wn~~G~~LatG~~-~G~~riw~~-~G-~l~~tl~-~HkgPI~slKWnk~G~yilS~~vD~ttilwd~~~g~ 309 (524)
T KOG0273|consen 234 SNKDVTSLDWNNDGTLLATGSE-DGEARIWNK-DG-NLISTLG-QHKGPIFSLKWNKKGTYILSGGVDGTTILWDAHTGT 309 (524)
T ss_pred ccCCcceEEecCCCCeEEEeec-CcEEEEEec-Cc-hhhhhhh-ccCCceEEEEEcCCCCEEEeccCCccEEEEeccCce
Confidence 4456678999999999999985 788888874 33 2223333 499999999999999999999999999999999999
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
+...|. |..+-. .|.|-.+ .-+++++.|+.|+|+-+.. ...+.++.+|...|+++.|+|.+.+|+|||+|+
T Consensus 310 ~~q~f~~~s~~~l-----DVdW~~~-~~F~ts~td~~i~V~kv~~--~~P~~t~~GH~g~V~alk~n~tg~LLaS~SdD~ 381 (524)
T KOG0273|consen 310 VKQQFEFHSAPAL-----DVDWQSN-DEFATSSTDGCIHVCKVGE--DRPVKTFIGHHGEVNALKWNPTGSLLASCSDDG 381 (524)
T ss_pred EEEeeeeccCCcc-----ceEEecC-ceEeecCCCceEEEEEecC--CCcceeeecccCceEEEEECCCCceEEEecCCC
Confidence 999999 887766 7888664 4588999999999998873 233334558999999999999999999999999
Q ss_pred CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecC--CCeeEEEEecCCCeEEEEecccccCCcccc
Q 000425 164 SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPL--PRVRALEVHPRLNLAVLLFANFTGGDNIKN 241 (1525)
Q Consensus 164 tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l--~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~ 241 (1525)
|++||....... ++ .++ .+. ..+|.+ .+...+.+||..|+.-+ +++-|..+
T Consensus 382 TlkiWs~~~~~~----------~~-----~l~------~Hs-kei~t~~wsp~g~v~~n~~~~~~l~-----sas~dstV 434 (524)
T KOG0273|consen 382 TLKIWSMGQSNS----------VH-----DLQ------AHS-KEIYTIKWSPTGPVTSNPNMNLMLA-----SASFDSTV 434 (524)
T ss_pred eeEeeecCCCcc----------hh-----hhh------hhc-cceeeEeecCCCCccCCCcCCceEE-----EeecCCeE
Confidence 999998652111 00 010 000 001111 12334566777766543 56677888
Q ss_pred cccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeeee
Q 000425 242 RAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSH 315 (1525)
Q Consensus 242 k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~ 315 (1525)
+.|.-..| +.+.+|-.|..++|++-+. + +|....+ =+-|..+-||++--.+-++++
T Consensus 435 ~lwdv~~g--v~i~~f~kH~~pVysvafS---~---~g~ylAs----------Gs~dg~V~iws~~~~~l~~s~ 490 (524)
T KOG0273|consen 435 KLWDVESG--VPIHTLMKHQEPVYSVAFS---P---NGRYLAS----------GSLDGCVHIWSTKTGKLVKSY 490 (524)
T ss_pred EEEEccCC--ceeEeeccCCCceEEEEec---C---CCcEEEe----------cCCCCeeEeccccchheeEee
Confidence 99988777 4556899999999986422 2 2222111 012566777877666666663
No 61
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.60 E-value=1.4e-15 Score=171.49 Aligned_cols=194 Identities=14% Similarity=0.216 Sum_probs=143.8
Q ss_pred EEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCce
Q 000425 23 FGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAG 101 (1525)
Q Consensus 23 ~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~ 101 (1525)
++..++.+|++|+. .......++.+ |++.|.|+.|. .+.+++|+.|.+|++||..+++++.++- |...|.
T Consensus 211 VSGlrDnTikiWD~-n~~~c~~~L~G-HtGSVLCLqyd--~rviisGSSDsTvrvWDv~tge~l~tlihHceaVL----- 281 (499)
T KOG0281|consen 211 VSGLRDNTIKIWDK-NSLECLKILTG-HTGSVLCLQYD--ERVIVSGSSDSTVRVWDVNTGEPLNTLIHHCEAVL----- 281 (499)
T ss_pred hcccccCceEEecc-ccHHHHHhhhc-CCCcEEeeecc--ceEEEecCCCceEEEEeccCCchhhHHhhhcceeE-----
Confidence 44457899999984 33333344444 99999999997 4599999999999999999999999998 889999
Q ss_pred eEEEeCCCCEEEEEeCCCcEEEEeCCCCC-ceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCC
Q 000425 102 AFAFHPTLEWLFVGDRRGTLLAWDVSIER-PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPP 180 (1525)
Q Consensus 102 sVaFSPdg~~L~SgS~DgtVrVWDl~t~~-~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~ 180 (1525)
.+.|+. .+++++|.|.++.|||+.+.. -.+...+-||...|+.+.|+. +|++|+|.|.||++|++.+..
T Consensus 282 hlrf~n--g~mvtcSkDrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~--kyIVsASgDRTikvW~~st~e------ 351 (499)
T KOG0281|consen 282 HLRFSN--GYMVTCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDD--KYIVSASGDRTIKVWSTSTCE------ 351 (499)
T ss_pred EEEEeC--CEEEEecCCceeEEEeccCchHHHHHHHHhhhhhheeeecccc--ceEEEecCCceEEEEecccee------
Confidence 999975 689999999999999998631 111133569999999999954 599999999999999986321
Q ss_pred CccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccc
Q 000425 181 MQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSA 260 (1525)
Q Consensus 181 ~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~ 260 (1525)
.-+++.++. -+|-++.+ .+-++. +|+.|..+++|.-..|. +..+|+||
T Consensus 352 ---------------fvRtl~gHk-------RGIAClQY--r~rlvV------SGSSDntIRlwdi~~G~--cLRvLeGH 399 (499)
T KOG0281|consen 352 ---------------FVRTLNGHK-------RGIACLQY--RDRLVV------SGSSDNTIRLWDIECGA--CLRVLEGH 399 (499)
T ss_pred ---------------eehhhhccc-------ccceehhc--cCeEEE------ecCCCceEEEEeccccH--HHHHHhch
Confidence 011222221 01222222 233444 78888888999877773 67789999
Q ss_pred cCCcccc
Q 000425 261 RGSSASV 267 (1525)
Q Consensus 261 ~~~~~s~ 267 (1525)
...+-++
T Consensus 400 EeLvRci 406 (499)
T KOG0281|consen 400 EELVRCI 406 (499)
T ss_pred HHhhhhe
Confidence 9987443
No 62
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.59 E-value=1.5e-14 Score=168.98 Aligned_cols=215 Identities=16% Similarity=0.203 Sum_probs=161.6
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCC-----ccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcC-
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGG-----RAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIH- 81 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg-----k~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~- 81 (1525)
..-|++++++.--+.+.+.+.++.+|+.-..+ ..+..+..+|+..|..++|||... +|++++.|+.+.|||++
T Consensus 180 g~glsWn~~~~g~Lls~~~d~~i~lwdi~~~~~~~~~~~p~~~~~~h~~~VeDV~~h~~h~~lF~sv~dd~~L~iwD~R~ 259 (422)
T KOG0264|consen 180 GYGLSWNRQQEGTLLSGSDDHTICLWDINAESKEDKVVDPKTIFSGHEDVVEDVAWHPLHEDLFGSVGDDGKLMIWDTRS 259 (422)
T ss_pred ccccccccccceeEeeccCCCcEEEEeccccccCCccccceEEeecCCcceehhhccccchhhheeecCCCeEEEEEcCC
Confidence 45589999999999999999999999863332 334455556999999999999764 78999999999999999
Q ss_pred -CCcEEEEEe-cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEE
Q 000425 82 -TYAVHYTLQ-LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLV 157 (1525)
Q Consensus 82 -t~k~l~tL~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~La 157 (1525)
+.++..... |...|+ |++|+|-+ ..|++||.|++|++||++.... ....+.+|...|.+|.|+|+ ...|+
T Consensus 260 ~~~~~~~~~~ah~~~vn-----~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~-~lh~~e~H~dev~~V~WSPh~etvLA 333 (422)
T KOG0264|consen 260 NTSKPSHSVKAHSAEVN-----CVAFNPFNEFILATGSADKTVALWDLRNLNK-PLHTFEGHEDEVFQVEWSPHNETVLA 333 (422)
T ss_pred CCCCCcccccccCCcee-----EEEeCCCCCceEEeccCCCcEEEeechhccc-CceeccCCCcceEEEEeCCCCCceeE
Confidence 556667777 999999 99999954 5677899999999999997533 33457799999999999996 46899
Q ss_pred EEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCC
Q 000425 158 TLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGD 237 (1525)
Q Consensus 158 SgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~D 237 (1525)
|++.|+.+.|||+........ +.. ..+ -.+..+-.++|.+ .+|.++.|+|..+.+.. +.+|
T Consensus 334 SSg~D~rl~vWDls~ig~eq~----~ed-a~d-----gppEllF~HgGH~----~kV~DfsWnp~ePW~I~-----Svae 394 (422)
T KOG0264|consen 334 SSGTDRRLNVWDLSRIGEEQS----PED-AED-----GPPELLFIHGGHT----AKVSDFSWNPNEPWTIA-----SVAE 394 (422)
T ss_pred ecccCCcEEEEeccccccccC----hhh-hcc-----CCcceeEEecCcc----cccccccCCCCCCeEEE-----EecC
Confidence 999999999999975433111 000 000 0122333445433 35899999999887542 5667
Q ss_pred ccccccccccc
Q 000425 238 NIKNRAAYTRE 248 (1525)
Q Consensus 238 d~~~k~~~~~e 248 (1525)
|...-+|.+.+
T Consensus 395 DN~LqIW~~s~ 405 (422)
T KOG0264|consen 395 DNILQIWQMAE 405 (422)
T ss_pred CceEEEeeccc
Confidence 76666777554
No 63
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.59 E-value=3.5e-15 Score=181.78 Aligned_cols=208 Identities=15% Similarity=0.194 Sum_probs=150.9
Q ss_pred CEEEEEEeccceEEEEEeecCCc-cceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEEe-cCCce
Q 000425 19 PVVFFGFHRRMSVTVVGTVEGGR-APTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTI 95 (1525)
Q Consensus 19 p~Vf~s~~~d~sI~vv~tieggk-~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V 95 (1525)
..+++.......|.+|+....++ +...+..+|+..|.+++||+..+ +|++|+.||+||+||++..+-..++. ...+|
T Consensus 100 ~NlIAT~s~nG~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSGSQDg~vK~~DlR~~~S~~t~~~nSESi 179 (839)
T KOG0269|consen 100 SNLIATCSTNGVISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISGSQDGTVKCWDLRSKKSKSTFRSNSESI 179 (839)
T ss_pred hhhheeecCCCcEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEecCCCceEEEEeeecccccccccccchhh
Confidence 33444555677788888644222 22233445999999999999987 68899999999999999998888888 88899
Q ss_pred eecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeec
Q 000425 96 KLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVII 174 (1525)
Q Consensus 96 ~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~ 174 (1525)
+ .|+|+| .+.+++++.+.|.++.||++.. ..+..-+.+|.++|.|+.|||++.+|+||+.|++|+|||+....
T Consensus 180 R-----DV~fsp~~~~~F~s~~dsG~lqlWDlRqp-~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~ 253 (839)
T KOG0269|consen 180 R-----DVKFSPGYGNKFASIHDSGYLQLWDLRQP-DRCEKKLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTDSR 253 (839)
T ss_pred h-----ceeeccCCCceEEEecCCceEEEeeccCc-hhHHHHhhcccCceEEEeecCCCceeeecCCCccEEEEeccCCC
Confidence 9 999999 5678889999999999999963 22222356999999999999999999999999999999986311
Q ss_pred CCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCC-eEEEEecccccCCcccccccccccCceee
Q 000425 175 NPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLN-LAVLLFANFTGGDNIKNRAAYTREGRKQL 253 (1525)
Q Consensus 175 ~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~N-lial~F~~~s~~Dd~~~k~~~~~eg~~q~ 253 (1525)
. ..+. -| ++++ .|..|.|-|... .+| .++..+|..+++|.-+-- .-=
T Consensus 254 ~---~~~~------------tI---------nTia---pv~rVkWRP~~~~hLA----tcsmv~dtsV~VWDvrRP-YIP 301 (839)
T KOG0269|consen 254 A---KPKH------------TI---------NTIA---PVGRVKWRPARSYHLA----TCSMVVDTSVHVWDVRRP-YIP 301 (839)
T ss_pred c---ccee------------EE---------eecc---eeeeeeeccCccchhh----hhhccccceEEEEeeccc-ccc
Confidence 1 0000 00 0122 457788888754 222 336677888888864321 222
Q ss_pred eeeeccccCCc
Q 000425 254 FAVLQSARGSS 264 (1525)
Q Consensus 254 ~~tlq~~~~~~ 264 (1525)
+.++..|.+.+
T Consensus 302 ~~t~~eH~~~v 312 (839)
T KOG0269|consen 302 YATFLEHTDSV 312 (839)
T ss_pred ceeeeccCccc
Confidence 45777787765
No 64
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.58 E-value=1.3e-14 Score=162.25 Aligned_cols=212 Identities=18% Similarity=0.218 Sum_probs=162.4
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc-----
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNI----- 80 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl----- 80 (1525)
-|+.-|+|...++|.+ -+..-+.+..+|. +|.+++..+..+ |.+.|++|.||+.+.++++++.|++-+||..
T Consensus 148 kDGiW~Vaa~~tqpi~-gtASADhTA~iWs-~Esg~CL~~Y~G-H~GSVNsikfh~s~~L~lTaSGD~taHIW~~av~~~ 224 (481)
T KOG0300|consen 148 KDGIWHVAADSTQPIC-GTASADHTARIWS-LESGACLATYTG-HTGSVNSIKFHNSGLLLLTASGDETAHIWKAAVNWE 224 (481)
T ss_pred ccceeeehhhcCCcce-eecccccceeEEe-eccccceeeecc-cccceeeEEeccccceEEEccCCcchHHHHHhhcCc
Confidence 3688899999999954 4444578888887 788888888877 9999999999999999999999999999951
Q ss_pred -CC----------------------------------CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEE
Q 000425 81 -HT----------------------------------YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAW 124 (1525)
Q Consensus 81 -~t----------------------------------~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVW 124 (1525)
.. ..++..|. |...|. +..|-..|+.++++|+|++..+|
T Consensus 225 vP~~~a~~~hSsEeE~e~sDe~~~d~d~~~~sD~~tiRvPl~~ltgH~~vV~-----a~dWL~gg~Q~vTaSWDRTAnlw 299 (481)
T KOG0300|consen 225 VPSNNAPSDHSSEEEEEHSDEHNRDTDSSEKSDGHTIRVPLMRLTGHRAVVS-----ACDWLAGGQQMVTASWDRTANLW 299 (481)
T ss_pred CCCCCCCCCCCchhhhhcccccccccccccccCCceeeeeeeeeeccccceE-----ehhhhcCcceeeeeeccccceee
Confidence 00 01334455 777788 88999999999999999999999
Q ss_pred eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccC
Q 000425 125 DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQG 204 (1525)
Q Consensus 125 Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~g 204 (1525)
|+++ ...+..+.||.+..+.++-||..++++|.|.|.|.++||.|.... ..+.| . ++.
T Consensus 300 DVEt--ge~v~~LtGHd~ELtHcstHptQrLVvTsSrDtTFRLWDFReaI~-----sV~VF-Q--------------GHt 357 (481)
T KOG0300|consen 300 DVET--GEVVNILTGHDSELTHCSTHPTQRLVVTSSRDTTFRLWDFREAIQ-----SVAVF-Q--------------GHT 357 (481)
T ss_pred eecc--CceeccccCcchhccccccCCcceEEEEeccCceeEeccchhhcc-----eeeee-c--------------ccc
Confidence 9995 445566889999999999999999999999999999999983221 00011 1 110
Q ss_pred CcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccC
Q 000425 205 GEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARG 262 (1525)
Q Consensus 205 G~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~ 262 (1525)
..|+.+.|.-..+++ +++||..+|+|..+.=|.-+ .|+|.-..
T Consensus 358 -------dtVTS~vF~~dd~vV-------SgSDDrTvKvWdLrNMRspl-ATIRtdS~ 400 (481)
T KOG0300|consen 358 -------DTVTSVVFNTDDRVV-------SGSDDRTVKVWDLRNMRSPL-ATIRTDSP 400 (481)
T ss_pred -------cceeEEEEecCCcee-------ecCCCceEEEeeeccccCcc-eeeecCCc
Confidence 124455555444443 79999999999987766555 47766444
No 65
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.57 E-value=2.2e-14 Score=160.27 Aligned_cols=219 Identities=18% Similarity=0.280 Sum_probs=159.6
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecC-CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG-GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg-gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
|-.-.|+|||....+. +..+|.++++++.... .+...++.. ...+|++|.|||.|.++++|...-++|+||+.+.+|
T Consensus 173 devn~l~FHPre~ILi-S~srD~tvKlFDfsK~saKrA~K~~q-d~~~vrsiSfHPsGefllvgTdHp~~rlYdv~T~Qc 250 (430)
T KOG0640|consen 173 DEVNDLDFHPRETILI-SGSRDNTVKLFDFSKTSAKRAFKVFQ-DTEPVRSISFHPSGEFLLVGTDHPTLRLYDVNTYQC 250 (430)
T ss_pred CcccceeecchhheEE-eccCCCeEEEEecccHHHHHHHHHhh-ccceeeeEeecCCCceEEEecCCCceeEEeccceeE
Confidence 3445799999988765 5557999999986443 344455554 578999999999999999999999999999999998
Q ss_pred EEEEe----cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceee-EeecccC-cCeEEEEEcCCCCEEEEE
Q 000425 86 HYTLQ----LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMI-GIIQVGS-QPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 86 l~tL~----H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i-~il~gH~-~~VtsVafsPdg~~LaSg 159 (1525)
...-. |...|+ +|.|++.|+..+++|.||.||+||--+. +|+ .+-.+|. ..|.+..|..+++|++|.
T Consensus 251 fvsanPd~qht~ai~-----~V~Ys~t~~lYvTaSkDG~IklwDGVS~--rCv~t~~~AH~gsevcSa~Ftkn~kyiLsS 323 (430)
T KOG0640|consen 251 FVSANPDDQHTGAIT-----QVRYSSTGSLYVTASKDGAIKLWDGVSN--RCVRTIGNAHGGSEVCSAVFTKNGKYILSS 323 (430)
T ss_pred eeecCccccccccee-----EEEecCCccEEEEeccCCcEEeeccccH--HHHHHHHhhcCCceeeeEEEccCCeEEeec
Confidence 77655 888999 9999999999999999999999997742 333 2234665 468889999999999999
Q ss_pred ECCCCeeEEEeeeec-----------CCCCCCCccccccCC------------CcccccccccccccCCc--ceecC---
Q 000425 160 CRDGSLQVWKTRVII-----------NPNRPPMQANFFEPA------------SIESIDIPRILSQQGGE--AVYPL--- 211 (1525)
Q Consensus 160 S~DgtIkVWDlr~~~-----------~p~~~~~~a~~f~p~------------gVesldi~~iLs~~gG~--~vy~l--- 211 (1525)
+.|.+|++|.+.+.. ...++..++.| +.. .+.++|-. .++ +.+++
T Consensus 324 G~DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvF-NhtEdyVl~pDEas~slcsWdaR------tadr~~l~slgHn 396 (430)
T KOG0640|consen 324 GKDSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVF-NHTEDYVLFPDEASNSLCSWDAR------TADRVALLSLGHN 396 (430)
T ss_pred CCcceeeeeeecCCceEEEEecCCcccchhhhhhhhh-cCccceEEccccccCceeecccc------chhhhhhcccCCC
Confidence 999999999986211 01112222322 111 12222211 111 11222
Q ss_pred CCeeEEEEecCCCeEEEEecccccCCcccccccccc
Q 000425 212 PRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTR 247 (1525)
Q Consensus 212 ~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~ 247 (1525)
..++|+...|.++... +++||-..|.|+++
T Consensus 397 ~a~R~i~HSP~~p~Fm------TcsdD~raRFWyrr 426 (430)
T KOG0640|consen 397 GAVRWIVHSPVEPAFM------TCSDDFRARFWYRR 426 (430)
T ss_pred CCceEEEeCCCCCcee------eecccceeeeeeec
Confidence 2467888888877655 89999999999876
No 66
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.56 E-value=7.3e-14 Score=160.02 Aligned_cols=229 Identities=14% Similarity=0.189 Sum_probs=168.4
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC---EEEEEECCCcEEEEEcCCCcE----EEEEe
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP---VLYVAYADGLIRAYNIHTYAV----HYTLQ 90 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~---~LaSGs~DgtIrLWDl~t~k~----l~tL~ 90 (1525)
..+||.|+. +..+++|+. .|+....+.+ |.++|.+++|.-..+ .|++++.|.++++|-++.+.. +..-.
T Consensus 115 ~~~IltgsY-Dg~~riWd~--~Gk~~~~~~G-ht~~ik~v~~v~~n~~~~~fvsas~Dqtl~Lw~~~~~~~~~~~~~~~~ 190 (423)
T KOG0313|consen 115 SKWILTGSY-DGTSRIWDL--KGKSIKTIVG-HTGPIKSVAWVIKNSSSCLFVSASMDQTLRLWKWNVGENKVKALKVCR 190 (423)
T ss_pred CceEEEeec-CCeeEEEec--CCceEEEEec-CCcceeeeEEEecCCccceEEEecCCceEEEEEecCchhhhhHHhHhc
Confidence 357888875 788888873 4555555555 999999999875544 599999999999998876643 23334
Q ss_pred -cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-----------------------CceeeEeecccCcCeEE
Q 000425 91 -LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-----------------------RPSMIGIIQVGSQPITS 146 (1525)
Q Consensus 91 -H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-----------------------~~s~i~il~gH~~~Vts 146 (1525)
|...|- +|+..++|..+++||.|.+++||+..+. .+.....+.+|+..|.+
T Consensus 191 GHk~~V~-----sVsv~~sgtr~~SgS~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~~~r~P~vtl~GHt~~Vs~ 265 (423)
T KOG0313|consen 191 GHKRSVD-----SVSVDSSGTRFCSGSWDTMLKIWSVETDEEDELESSSNRRRKKQKREKEGGTRTPLVTLEGHTEPVSS 265 (423)
T ss_pred cccccee-----EEEecCCCCeEEeecccceeeecccCCCccccccccchhhhhhhhhhhcccccCceEEecccccceee
Confidence 999999 9999999999999999999999993221 11112357899999999
Q ss_pred EEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeE
Q 000425 147 VAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLA 226 (1525)
Q Consensus 147 VafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nli 226 (1525)
|.|++ ...+.|+|.|.||+.||+.+..... .+ ..-..+++++.||..+++
T Consensus 266 V~w~d-~~v~yS~SwDHTIk~WDletg~~~~---------------------~~--------~~~ksl~~i~~~~~~~Ll 315 (423)
T KOG0313|consen 266 VVWSD-ATVIYSVSWDHTIKVWDLETGGLKS---------------------TL--------TTNKSLNCISYSPLSKLL 315 (423)
T ss_pred EEEcC-CCceEeecccceEEEEEeeccccee---------------------ee--------ecCcceeEeeccccccee
Confidence 99998 6689999999999999997433200 00 001246889999999999
Q ss_pred EEEecccccCCcccccccccccCcee-eeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeeh
Q 000425 227 VLLFANFTGGDNIKNRAAYTREGRKQ-LFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISD 305 (1525)
Q Consensus 227 al~F~~~s~~Dd~~~k~~~~~eg~~q-~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~d 305 (1525)
+ +++.|..++.|..|.|... -..+|-||.+.+.++ +.++....-.+. -+-|++..+||
T Consensus 316 ~------~gssdr~irl~DPR~~~gs~v~~s~~gH~nwVssv---kwsp~~~~~~~S------------~S~D~t~klWD 374 (423)
T KOG0313|consen 316 A------SGSSDRHIRLWDPRTGDGSVVSQSLIGHKNWVSSV---KWSPTNEFQLVS------------GSYDNTVKLWD 374 (423)
T ss_pred e------ecCCCCceeecCCCCCCCceeEEeeecchhhhhhe---ecCCCCceEEEE------------EecCCeEEEEE
Confidence 9 8999999999998876433 335688999987543 333333321221 12488999999
Q ss_pred h
Q 000425 306 I 306 (1525)
Q Consensus 306 i 306 (1525)
.
T Consensus 375 v 375 (423)
T KOG0313|consen 375 V 375 (423)
T ss_pred e
Confidence 4
No 67
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.56 E-value=2.1e-14 Score=174.63 Aligned_cols=151 Identities=15% Similarity=0.229 Sum_probs=138.0
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
=+++++|....+.+++ .|++.++|+. +...-.-.+.+ |+..|.++.|+|.++.++++|.|+|||||.+.++.|+++|
T Consensus 467 N~Vaia~ndkLiAT~S-qDktaKiW~l-e~~~l~~vLsG-H~RGvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT~ 543 (775)
T KOG0319|consen 467 NCVAIAPNDKLIATGS-QDKTAKIWDL-EQLRLLGVLSG-HTRGVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKTF 543 (775)
T ss_pred cceEecCCCceEEecc-cccceeeecc-cCceEEEEeeC-CccceEEEEeccccceeEeccCCceEEEEEeccceeeeee
Confidence 4689999999988777 5899999985 56666666666 9999999999999999999999999999999999999999
Q ss_pred e-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEE
Q 000425 90 Q-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 90 ~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVW 168 (1525)
+ |...|. .+.|-.+|+.|+|++.||.+|+||+.+ ..|...+++|++.|.+++-+|....++||+.|+.|.+|
T Consensus 544 eGH~~aVl-----ra~F~~~~~qliS~~adGliKlWnikt--~eC~~tlD~H~DrvWaL~~~~~~~~~~tgg~Dg~i~~w 616 (775)
T KOG0319|consen 544 EGHTSAVL-----RASFIRNGKQLISAGADGLIKLWNIKT--NECEMTLDAHNDRVWALSVSPLLDMFVTGGGDGRIIFW 616 (775)
T ss_pred cCccceeE-----eeeeeeCCcEEEeccCCCcEEEEeccc--hhhhhhhhhccceeEEEeecCccceeEecCCCeEEEEe
Confidence 9 999999 999999999999999999999999995 55666689999999999999999999999999999999
Q ss_pred Ee
Q 000425 169 KT 170 (1525)
Q Consensus 169 Dl 170 (1525)
.=
T Consensus 617 kD 618 (775)
T KOG0319|consen 617 KD 618 (775)
T ss_pred ec
Confidence 64
No 68
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.56 E-value=9e-14 Score=159.29 Aligned_cols=161 Identities=17% Similarity=0.276 Sum_probs=132.2
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEe-------ecC----------------CccceEEccCCCCCeEEEEEcCCC
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGT-------VEG----------------GRAPTKIKTDLKKPIVNLACHPRL 63 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~t-------ieg----------------gk~~tklk~~H~~~V~sLaf~Pdg 63 (1525)
....++++.+++..+..|+ -|.++.+|+. .|. .+.+.....+|..+|.++.|++ .
T Consensus 194 ~~V~sVsv~~sgtr~~SgS-~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~~~r~P~vtl~GHt~~Vs~V~w~d-~ 271 (423)
T KOG0313|consen 194 RSVDSVSVDSSGTRFCSGS-WDTMLKIWSVETDEEDELESSSNRRRKKQKREKEGGTRTPLVTLEGHTEPVSSVVWSD-A 271 (423)
T ss_pred cceeEEEecCCCCeEEeec-ccceeeecccCCCccccccccchhhhhhhhhhhcccccCceEEecccccceeeEEEcC-C
Confidence 3567888889988765555 5888999981 111 1223333445999999999998 6
Q ss_pred CEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCcee-eEeecccCc
Q 000425 64 PVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSM-IGIIQVGSQ 142 (1525)
Q Consensus 64 ~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~-i~il~gH~~ 142 (1525)
..+++++.|.+||.||+.++....++.....++ +++++|....|++||.|..+++||-++...++ ...+.+|..
T Consensus 272 ~v~yS~SwDHTIk~WDletg~~~~~~~~~ksl~-----~i~~~~~~~Ll~~gssdr~irl~DPR~~~gs~v~~s~~gH~n 346 (423)
T KOG0313|consen 272 TVIYSVSWDHTIKVWDLETGGLKSTLTTNKSLN-----CISYSPLSKLLASGSSDRHIRLWDPRTGDGSVVSQSLIGHKN 346 (423)
T ss_pred CceEeecccceEEEEEeecccceeeeecCccee-----EeecccccceeeecCCCCceeecCCCCCCCceeEEeeecchh
Confidence 789999999999999999999999998888999 99999999999999999999999999852222 245669999
Q ss_pred CeEEEEEcCCCC-EEEEEECCCCeeEEEeeeec
Q 000425 143 PITSVAWLPMLR-LLVTLCRDGSLQVWKTRVII 174 (1525)
Q Consensus 143 ~VtsVafsPdg~-~LaSgS~DgtIkVWDlr~~~ 174 (1525)
.|.++.|||... .|+|+|.|+++++||+|...
T Consensus 347 wVssvkwsp~~~~~~~S~S~D~t~klWDvRS~k 379 (423)
T KOG0313|consen 347 WVSSVKWSPTNEFQLVSGSYDNTVKLWDVRSTK 379 (423)
T ss_pred hhhheecCCCCceEEEEEecCCeEEEEEeccCC
Confidence 999999999655 68999999999999999654
No 69
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.56 E-value=1.7e-13 Score=151.53 Aligned_cols=222 Identities=14% Similarity=0.168 Sum_probs=154.1
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEec
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQL 91 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H 91 (1525)
|.++-.+.++|. +.+|.+..||....+.+ .-...+ |++.|.|++.+-+.+.+++|+.|.++++||.++++++.+++.
T Consensus 16 iKyN~eGDLlFs-caKD~~~~vw~s~nGer-lGty~G-HtGavW~~Did~~s~~liTGSAD~t~kLWDv~tGk~la~~k~ 92 (327)
T KOG0643|consen 16 IKYNREGDLLFS-CAKDSTPTVWYSLNGER-LGTYDG-HTGAVWCCDIDWDSKHLITGSADQTAKLWDVETGKQLATWKT 92 (327)
T ss_pred EEecCCCcEEEE-ecCCCCceEEEecCCce-eeeecC-CCceEEEEEecCCcceeeeccccceeEEEEcCCCcEEEEeec
Confidence 456667777654 45789999998665543 334444 999999999999999999999999999999999988776652
Q ss_pred CCcee---e----------------------------------------------cCceeEEEeCCCCEEEEEeCCCcEE
Q 000425 92 DNTIK---L----------------------------------------------LGAGAFAFHPTLEWLFVGDRRGTLL 122 (1525)
Q Consensus 92 ~~~V~---~----------------------------------------------~ga~sVaFSPdg~~L~SgS~DgtVr 122 (1525)
..+|. + +.....-|.|-+++|++|..||.|.
T Consensus 93 ~~~Vk~~~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~skit~a~Wg~l~~~ii~Ghe~G~is 172 (327)
T KOG0643|consen 93 NSPVKRVDFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSKITSALWGPLGETIIAGHEDGSIS 172 (327)
T ss_pred CCeeEEEeeccCCcEEEEEehhhcCcceEEEEEEccCChhhhcccCceEEecCCccceeeeeecccCCEEEEecCCCcEE
Confidence 22221 0 1113455788888889999999999
Q ss_pred EEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCC----CCCCcccccc--------CCC
Q 000425 123 AWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPN----RPPMQANFFE--------PAS 190 (1525)
Q Consensus 123 VWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~----~~~~~a~~f~--------p~g 190 (1525)
+||.++. ...+..-+.|+..|+.+++++|..+++|+|.|.+-++||.+....-+ ..++.+..+. ..|
T Consensus 173 ~~da~~g-~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~Dttakl~D~~tl~v~Kty~te~PvN~aaisP~~d~VilgGG 251 (327)
T KOG0643|consen 173 IYDARTG-KELVDSDEEHSSKINDLQFSRDRTYFITGSKDTTAKLVDVRTLEVLKTYTTERPVNTAAISPLLDHVILGGG 251 (327)
T ss_pred EEEcccC-ceeeechhhhccccccccccCCcceEEecccCccceeeeccceeeEEEeeecccccceecccccceEEecCC
Confidence 9999964 34444456799999999999999999999999999999998321100 1111111111 234
Q ss_pred cccccccccccccC---------------CcceecCCCeeEEEEecCCCeEEEEecccccCCcccccc
Q 000425 191 IESIDIPRILSQQG---------------GEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRA 243 (1525)
Q Consensus 191 Vesldi~~iLs~~g---------------G~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~ 243 (1525)
++..|+...-...| |..+-.+-.|+.++|||++-..+ ++++|.-+|+
T Consensus 252 qeA~dVTTT~~r~GKFEArFyh~i~eEEigrvkGHFGPINsvAfhPdGksYs------SGGEDG~VR~ 313 (327)
T KOG0643|consen 252 QEAMDVTTTSTRAGKFEARFYHLIFEEEIGRVKGHFGPINSVAFHPDGKSYS------SGGEDGYVRL 313 (327)
T ss_pred ceeeeeeeecccccchhhhHHHHHHHHHhccccccccCcceeEECCCCcccc------cCCCCceEEE
Confidence 44444443322111 11111234689999999998877 8999987763
No 70
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.55 E-value=2.1e-13 Score=150.44 Aligned_cols=208 Identities=18% Similarity=0.278 Sum_probs=157.3
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+...-|+++|.+|-+|++...+++|++|+. ..+++...+.. ++.-..+.|+|+|.+++++..|..|...|.++.+..
T Consensus 65 ~svdql~w~~~~~d~~atas~dk~ir~wd~-r~~k~~~~i~~--~~eni~i~wsp~g~~~~~~~kdD~it~id~r~~~~~ 141 (313)
T KOG1407|consen 65 DSVDQLCWDPKHPDLFATASGDKTIRIWDI-RSGKCTARIET--KGENINITWSPDGEYIAVGNKDDRITFIDARTYKIV 141 (313)
T ss_pred cchhhheeCCCCCcceEEecCCceEEEEEe-ccCcEEEEeec--cCcceEEEEcCCCCEEEEecCcccEEEEEeccccee
Confidence 456779999999999999999999999995 44455444443 333445799999999999999999999999999888
Q ss_pred EEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 87 YTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 87 ~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
...+....++ -+.|+-++..++....-|+|.|-.+-.. ..+..+++|+....||.|+|+|+||++||.|..+.
T Consensus 142 ~~~~~~~e~n-----e~~w~~~nd~Fflt~GlG~v~ILsypsL--kpv~si~AH~snCicI~f~p~GryfA~GsADAlvS 214 (313)
T KOG1407|consen 142 NEEQFKFEVN-----EISWNNSNDLFFLTNGLGCVEILSYPSL--KPVQSIKAHPSNCICIEFDPDGRYFATGSADALVS 214 (313)
T ss_pred ehhcccceee-----eeeecCCCCEEEEecCCceEEEEecccc--ccccccccCCcceEEEEECCCCceEeeccccceee
Confidence 8877666777 8999988888888888899999887743 22234679999999999999999999999999999
Q ss_pred EEEeeee-----cCCCCCCCccccccCCCc------c--cccccccccccCCcceecCC---CeeEEEEecCCCeEE
Q 000425 167 VWKTRVI-----INPNRPPMQANFFEPASI------E--SIDIPRILSQQGGEAVYPLP---RVRALEVHPRLNLAV 227 (1525)
Q Consensus 167 VWDlr~~-----~~p~~~~~~a~~f~p~gV------e--sldi~~iLs~~gG~~vy~l~---~V~~v~~HP~~Nlia 227 (1525)
+||+.-. ....+.++.+..|..++- + .+| |.....|++++-++ ...+|+|||+.+|+|
T Consensus 215 LWD~~ELiC~R~isRldwpVRTlSFS~dg~~lASaSEDh~ID---IA~vetGd~~~eI~~~~~t~tVAWHPk~~LLA 288 (313)
T KOG1407|consen 215 LWDVDELICERCISRLDWPVRTLSFSHDGRMLASASEDHFID---IAEVETGDRVWEIPCEGPTFTVAWHPKRPLLA 288 (313)
T ss_pred ccChhHhhhheeeccccCceEEEEeccCcceeeccCccceEE---eEecccCCeEEEeeccCCceeEEecCCCceee
Confidence 9998721 111222333322333220 0 123 34455677776664 467899999999988
No 71
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.54 E-value=8.8e-14 Score=175.48 Aligned_cols=156 Identities=13% Similarity=0.231 Sum_probs=130.1
Q ss_pred cEEEEEEcCCCCEEEEEEec-cceEEEEEeecC-----------CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcE
Q 000425 8 TEVHLALTPLQPVVFFGFHR-RMSVTVVGTVEG-----------GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLI 75 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~-d~sI~vv~tieg-----------gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtI 75 (1525)
..+.|++||++-.+.+|.+. |.++++|....- .+....+. +|.+.|.|+.|+|+|.+||+|++|+.|
T Consensus 15 ~IfSIdv~pdg~~~aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~-~h~~sv~CVR~S~dG~~lAsGSDD~~v 93 (942)
T KOG0973|consen 15 SIFSIDVHPDGVKFATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMD-DHDGSVNCVRFSPDGSYLASGSDDRLV 93 (942)
T ss_pred eEEEEEecCCceeEecCCccccccceeeccccccchhhhhhcccchhheeec-cccCceeEEEECCCCCeEeeccCcceE
Confidence 46799999999988777632 445556654110 12233344 499999999999999999999999999
Q ss_pred EEEEcCC------------------CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEe
Q 000425 76 RAYNIHT------------------YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGI 136 (1525)
Q Consensus 76 rLWDl~t------------------~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~i 136 (1525)
-||+... +++...+. |...|. .|+|+|++.+++++|-|++|.|||..+. ..+..
T Consensus 94 ~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~-----Dv~Wsp~~~~lvS~s~DnsViiwn~~tF--~~~~v 166 (942)
T KOG0973|consen 94 MIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVL-----DVNWSPDDSLLVSVSLDNSVIIWNAKTF--ELLKV 166 (942)
T ss_pred EEeeecccCCcccccccccccccceeeEEEEEecCCCccc-----eeccCCCccEEEEecccceEEEEccccc--eeeee
Confidence 9998761 24677888 999999 9999999999999999999999999975 55567
Q ss_pred ecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 137 IQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 137 l~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+.+|...|..++|.|-|+||+|-|+|++|+||++.
T Consensus 167 l~~H~s~VKGvs~DP~Gky~ASqsdDrtikvwrt~ 201 (942)
T KOG0973|consen 167 LRGHQSLVKGVSWDPIGKYFASQSDDRTLKVWRTS 201 (942)
T ss_pred eecccccccceEECCccCeeeeecCCceEEEEEcc
Confidence 88999999999999999999999999999999965
No 72
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.53 E-value=7.4e-14 Score=169.65 Aligned_cols=155 Identities=15% Similarity=0.261 Sum_probs=138.8
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
|...+++++|++.++.++. -+.++.++.. +.-+-...+.+ |+=||.|++.+||+.++++||.|.+||||-+.=+.|.
T Consensus 509 ddvL~v~~Spdgk~LaVsL-LdnTVkVyfl-DtlKFflsLYG-HkLPV~smDIS~DSklivTgSADKnVKiWGLdFGDCH 585 (888)
T KOG0306|consen 509 DDVLCVSVSPDGKLLAVSL-LDNTVKVYFL-DTLKFFLSLYG-HKLPVLSMDISPDSKLIVTGSADKNVKIWGLDFGDCH 585 (888)
T ss_pred ccEEEEEEcCCCcEEEEEe-ccCeEEEEEe-cceeeeeeecc-cccceeEEeccCCcCeEEeccCCCceEEeccccchhh
Confidence 5678999999999999998 5888888763 44444445555 9999999999999999999999999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+.|- |+++|. +|.|-|+...++++|.|+.|+-||-. ...+++.+.+|...|+|++.+|+|.+++|+|.|.+|
T Consensus 586 KS~fAHdDSvm-----~V~F~P~~~~FFt~gKD~kvKqWDg~--kFe~iq~L~~H~~ev~cLav~~~G~~vvs~shD~sI 658 (888)
T KOG0306|consen 586 KSFFAHDDSVM-----SVQFLPKTHLFFTCGKDGKVKQWDGE--KFEEIQKLDGHHSEVWCLAVSPNGSFVVSSSHDKSI 658 (888)
T ss_pred hhhhcccCcee-----EEEEcccceeEEEecCcceEEeechh--hhhhheeeccchheeeeeEEcCCCCeEEeccCCcee
Confidence 9999 999999 99999999999999999999999998 456667788999999999999999999999999999
Q ss_pred eEEEee
Q 000425 166 QVWKTR 171 (1525)
Q Consensus 166 kVWDlr 171 (1525)
++|.-.
T Consensus 659 RlwE~t 664 (888)
T KOG0306|consen 659 RLWERT 664 (888)
T ss_pred Eeeecc
Confidence 999754
No 73
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=99.52 E-value=5.1e-12 Score=151.37 Aligned_cols=140 Identities=14% Similarity=0.203 Sum_probs=109.2
Q ss_pred CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC
Q 000425 52 KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER 130 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~ 130 (1525)
+.|..++|-|||..++.+..| .+.+||...+..+.+++ |.+.|. +|+|+.||+.++||+.|..|.+|+-.-+
T Consensus 13 hci~d~afkPDGsqL~lAAg~-rlliyD~ndG~llqtLKgHKDtVy-----cVAys~dGkrFASG~aDK~VI~W~~klE- 85 (1081)
T KOG1538|consen 13 HCINDIAFKPDGTQLILAAGS-RLLVYDTSDGTLLQPLKGHKDTVY-----CVAYAKDGKRFASGSADKSVIIWTSKLE- 85 (1081)
T ss_pred cchheeEECCCCceEEEecCC-EEEEEeCCCcccccccccccceEE-----EEEEccCCceeccCCCceeEEEeccccc-
Confidence 378999999999988887654 68899999999999999 999999 9999999999999999999999986632
Q ss_pred ceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceec
Q 000425 131 PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYP 210 (1525)
Q Consensus 131 ~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~ 210 (1525)
. .. .-.|++.|.|++|+|-...|+|||.. ..-+|...... +..+ .+
T Consensus 86 G-~L--kYSH~D~IQCMsFNP~~h~LasCsLs-dFglWS~~qK~------------------------V~K~------ks 131 (1081)
T KOG1538|consen 86 G-IL--KYSHNDAIQCMSFNPITHQLASCSLS-DFGLWSPEQKS------------------------VSKH------KS 131 (1081)
T ss_pred c-ee--eeccCCeeeEeecCchHHHhhhcchh-hccccChhhhh------------------------HHhh------hh
Confidence 1 11 12699999999999999999999875 35567543100 0000 11
Q ss_pred CCCeeEEEEecCCCeEEEEecc
Q 000425 211 LPRVRALEVHPRLNLAVLLFAN 232 (1525)
Q Consensus 211 l~~V~~v~~HP~~Nlial~F~~ 232 (1525)
..|+.+++|..++...++...+
T Consensus 132 s~R~~~CsWtnDGqylalG~~n 153 (1081)
T KOG1538|consen 132 SSRIICCSWTNDGQYLALGMFN 153 (1081)
T ss_pred heeEEEeeecCCCcEEEEeccC
Confidence 2378889999888888866544
No 74
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.51 E-value=3e-13 Score=151.58 Aligned_cols=119 Identities=16% Similarity=0.231 Sum_probs=100.5
Q ss_pred cCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc
Q 000425 5 SVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 5 ~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k 84 (1525)
+.|+...|+|+|..-.++++.+.|.+|++|+.-..+...-+....|.+||.+++|+.+|..+++|+.|+.+++||+.+++
T Consensus 26 P~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~WsddgskVf~g~~Dk~~k~wDL~S~Q 105 (347)
T KOG0647|consen 26 PEDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDGSKVFSGGCDKQAKLWDLASGQ 105 (347)
T ss_pred cccchheeEeccccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEEEccCCceEEeeccCCceEEEEccCCC
Confidence 57899999999977777777778999999997554444334444599999999999999999999999999999999995
Q ss_pred EEEEEecCCceeecCceeEEEeCCCC--EEEEEeCCCcEEEEeCCC
Q 000425 85 VHYTLQLDNTIKLLGAGAFAFHPTLE--WLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 85 ~l~tL~H~~~V~~~ga~sVaFSPdg~--~L~SgS~DgtVrVWDl~t 128 (1525)
....-.|..+|. ++.|-+... .|++||+|++||.||.+.
T Consensus 106 ~~~v~~Hd~pvk-----t~~wv~~~~~~cl~TGSWDKTlKfWD~R~ 146 (347)
T KOG0647|consen 106 VSQVAAHDAPVK-----TCHWVPGMNYQCLVTGSWDKTLKFWDTRS 146 (347)
T ss_pred eeeeeeccccee-----EEEEecCCCcceeEecccccceeecccCC
Confidence 544333999999 999987655 799999999999999874
No 75
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.50 E-value=2.1e-14 Score=173.93 Aligned_cols=167 Identities=20% Similarity=0.292 Sum_probs=137.9
Q ss_pred CCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeC
Q 000425 39 GGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDR 117 (1525)
Q Consensus 39 ggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~ 117 (1525)
+.+..+.+.+ |+.+|.||.|++...+++.|+.||+||+||+...+.+++|. |...+. +|.|||-+.+.++|+.
T Consensus 59 kp~~i~S~~~-hespIeSl~f~~~E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~~~-----sv~f~P~~~~~a~gSt 132 (825)
T KOG0267|consen 59 KPNAITSLTG-HESPIESLTFDTSERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLNIT-----SVDFHPYGEFFASGST 132 (825)
T ss_pred CCchhheeec-cCCcceeeecCcchhhhcccccCCceeeeehhhhhhhhhhhccccCcc-----eeeeccceEEeccccc
Confidence 3444555555 99999999999999999999999999999999999999999 999999 9999999999999999
Q ss_pred CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCccccccc
Q 000425 118 RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIP 197 (1525)
Q Consensus 118 DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~ 197 (1525)
|..+++||.+. .-+..++.+|...|.++.|+|+|++++++++|.+++|||.+... +...|
T Consensus 133 dtd~~iwD~Rk--~Gc~~~~~s~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk------~~~ef------------ 192 (825)
T KOG0267|consen 133 DTDLKIWDIRK--KGCSHTYKSHTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGK------LSKEF------------ 192 (825)
T ss_pred cccceehhhhc--cCceeeecCCcceeEEEeecCCCceeeccCCcceeeeecccccc------ccccc------------
Confidence 99999999993 44555577899999999999999999999999999999986211 11111
Q ss_pred ccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccc
Q 000425 198 RILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTR 247 (1525)
Q Consensus 198 ~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~ 247 (1525)
..+.| ++..++|||..-+.+ +++.|..++.|..+
T Consensus 193 ---~~~e~-------~v~sle~hp~e~Lla------~Gs~d~tv~f~dle 226 (825)
T KOG0267|consen 193 ---KSHEG-------KVQSLEFHPLEVLLA------PGSSDRTVRFWDLE 226 (825)
T ss_pred ---ccccc-------cccccccCchhhhhc------cCCCCceeeeeccc
Confidence 11111 355678999877766 88889998988865
No 76
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.50 E-value=1.5e-13 Score=150.30 Aligned_cols=205 Identities=13% Similarity=0.187 Sum_probs=150.6
Q ss_pred EcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC--CCEEEEEECCCcEEEEEcCCC--cEE-EE
Q 000425 14 LTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR--LPVLYVAYADGLIRAYNIHTY--AVH-YT 88 (1525)
Q Consensus 14 ~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd--g~~LaSGs~DgtIrLWDl~t~--k~l-~t 88 (1525)
.||..-.++.+++-+..|.+|..-.+.+........|+..|++|+|.|. |-.|+++|.||+|+|.++++. -.. +.
T Consensus 65 ahPk~G~iLAScsYDgkVIiWke~~g~w~k~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g~w~t~ki 144 (299)
T KOG1332|consen 65 AHPKFGTILASCSYDGKVIIWKEENGRWTKAYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGGWTTSKI 144 (299)
T ss_pred cccccCcEeeEeecCceEEEEecCCCchhhhhhhhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCCCCccchhh
Confidence 4788888888888899998887544444444445569999999999887 458999999999999997754 222 23
Q ss_pred Ee-cCCceeecCceeEEEeCC---C-----------CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC
Q 000425 89 LQ-LDNTIKLLGAGAFAFHPT---L-----------EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML 153 (1525)
Q Consensus 89 L~-H~~~V~~~ga~sVaFSPd---g-----------~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg 153 (1525)
.. |.-.|+ +++|.|- | +.|++|+.|..|+||+.....-.+..++.+|++.|+.++|.|.-
T Consensus 145 ~~aH~~Gvn-----sVswapa~~~g~~~~~~~~~~~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~ 219 (299)
T KOG1332|consen 145 VFAHEIGVN-----SVSWAPASAPGSLVDQGPAAKVKRLVSGGCDNLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSV 219 (299)
T ss_pred hhccccccc-----eeeecCcCCCccccccCcccccceeeccCCccceeeeecCCcchhhhhhhhhcchhhhhhhhcccc
Confidence 33 999999 9999885 4 55999999999999998863222223467999999999999963
Q ss_pred ----CEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEE
Q 000425 154 ----RLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLL 229 (1525)
Q Consensus 154 ----~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~ 229 (1525)
.+|+|||.||+|.||..+.-.++-+...... ++ ..++.++|.+++|+++
T Consensus 220 gl~~s~iAS~SqDg~viIwt~~~e~e~wk~tll~~------------------------f~-~~~w~vSWS~sGn~La-- 272 (299)
T KOG1332|consen 220 GLPKSTIASCSQDGTVIIWTKDEEYEPWKKTLLEE------------------------FP-DVVWRVSWSLSGNILA-- 272 (299)
T ss_pred CCCceeeEEecCCCcEEEEEecCccCccccccccc------------------------CC-cceEEEEEeccccEEE--
Confidence 5799999999999998762222111111100 00 1367899999999999
Q ss_pred ecccccCCccccccccc-ccCceeee
Q 000425 230 FANFTGGDNIKNRAAYT-REGRKQLF 254 (1525)
Q Consensus 230 F~~~s~~Dd~~~k~~~~-~eg~~q~~ 254 (1525)
.++.|.++..|+. .+|+++.+
T Consensus 273 ----Vs~GdNkvtlwke~~~Gkw~~v 294 (299)
T KOG1332|consen 273 ----VSGGDNKVTLWKENVDGKWEEV 294 (299)
T ss_pred ----EecCCcEEEEEEeCCCCcEEEc
Confidence 5666777777764 45666664
No 77
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.50 E-value=2.8e-13 Score=151.82 Aligned_cols=114 Identities=23% Similarity=0.379 Sum_probs=96.3
Q ss_pred CCCCeEEEEEcCC-CCEEEEEECCCcEEEEEcCC-CcEEEE-Ee-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEe
Q 000425 50 LKKPIVNLACHPR-LPVLYVAYADGLIRAYNIHT-YAVHYT-LQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWD 125 (1525)
Q Consensus 50 H~~~V~sLaf~Pd-g~~LaSGs~DgtIrLWDl~t-~k~l~t-L~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWD 125 (1525)
-.+.|.+|+|+|. ..+++.||+|++||+|+++. +...-+ .. |..+|. +++|+.||..+++|+.|+++++||
T Consensus 26 P~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL-----~v~WsddgskVf~g~~Dk~~k~wD 100 (347)
T KOG0647|consen 26 PEDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVL-----DVCWSDDGSKVFSGGCDKQAKLWD 100 (347)
T ss_pred cccchheeEeccccCceEEecccCCceEEEEEecCCcccchhhhccCCCeE-----EEEEccCCceEEeeccCCceEEEE
Confidence 4678999999994 45677999999999999876 343333 23 899999 999999999999999999999999
Q ss_pred CCCCCceeeEeecccCcCeEEEEEcCCCC--EEEEEECCCCeeEEEee
Q 000425 126 VSIERPSMIGIIQVGSQPITSVAWLPMLR--LLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 126 l~t~~~s~i~il~gH~~~VtsVafsPdg~--~LaSgS~DgtIkVWDlr 171 (1525)
+.+..... ...|..+|.++.|-+... .|+|||.|+|||.||.|
T Consensus 101 L~S~Q~~~---v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R 145 (347)
T KOG0647|consen 101 LASGQVSQ---VAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTR 145 (347)
T ss_pred ccCCCeee---eeecccceeEEEEecCCCcceeEecccccceeecccC
Confidence 99752222 348999999999988665 89999999999999999
No 78
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.49 E-value=4.3e-13 Score=155.63 Aligned_cols=211 Identities=19% Similarity=0.224 Sum_probs=159.5
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
+.-.+...|..+.+++|. .+.++.+++ ...+.....+++ |...|+++.|||+...+++++.|..|+||.........
T Consensus 221 gi~ald~~~s~~~ilTGG-~d~~av~~d-~~s~q~l~~~~G-h~kki~~v~~~~~~~~v~~aSad~~i~vws~~~~s~~~ 297 (506)
T KOG0289|consen 221 GITALDIIPSSSKILTGG-EDKTAVLFD-KPSNQILATLKG-HTKKITSVKFHKDLDTVITASADEIIRVWSVPLSSEPT 297 (506)
T ss_pred CeeEEeecCCCCcceecC-CCCceEEEe-cchhhhhhhccC-cceEEEEEEeccchhheeecCCcceEEeeccccccCcc
Confidence 344556667778888877 477777766 455555556666 99999999999999999999999999999988777766
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
... |+..|+ .+..||.|.||+++|.|++..+.|+++...-....-....-.+++.+|||||-+|.+|+.|+.|+
T Consensus 298 ~~~~h~~~V~-----~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHpDgLifgtgt~d~~vk 372 (506)
T KOG0289|consen 298 SSRPHEEPVT-----GLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHPDGLIFGTGTPDGVVK 372 (506)
T ss_pred ccccccccce-----eeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcCCceEEeccCCCceEE
Confidence 677 999999 99999999999999999999999999742222111111223489999999999999999999999
Q ss_pred EEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccc
Q 000425 167 VWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYT 246 (1525)
Q Consensus 167 VWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~ 246 (1525)
|||+..-.+ .+.| . .+. ..|+.+.|..++-..+ ++.||..++.|..
T Consensus 373 iwdlks~~~------~a~F-p--------------ght-------~~vk~i~FsENGY~La------t~add~~V~lwDL 418 (506)
T KOG0289|consen 373 IWDLKSQTN------VAKF-P--------------GHT-------GPVKAISFSENGYWLA------TAADDGSVKLWDL 418 (506)
T ss_pred EEEcCCccc------cccC-C--------------CCC-------CceeEEEeccCceEEE------EEecCCeEEEEEe
Confidence 999873221 1122 1 111 1578888888777777 7888888999998
Q ss_pred ccCceeeeeeeccccC
Q 000425 247 REGRKQLFAVLQSARG 262 (1525)
Q Consensus 247 ~eg~~q~~~tlq~~~~ 262 (1525)
|.-+ .+.+++....
T Consensus 419 RKl~--n~kt~~l~~~ 432 (506)
T KOG0289|consen 419 RKLK--NFKTIQLDEK 432 (506)
T ss_pred hhhc--ccceeecccc
Confidence 7654 5666666553
No 79
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.49 E-value=4.4e-14 Score=159.52 Aligned_cols=195 Identities=15% Similarity=0.239 Sum_probs=146.1
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
..+.|.|+.+. +..+++|..|.+|+|||..+..+++.+. |.++|. |+.|.. +.|++||+|.||+|||+++
T Consensus 196 ~skgVYClQYD--D~kiVSGlrDnTikiWD~n~~~c~~~L~GHtGSVL-----CLqyd~--rviisGSSDsTvrvWDv~t 266 (499)
T KOG0281|consen 196 NSKGVYCLQYD--DEKIVSGLRDNTIKIWDKNSLECLKILTGHTGSVL-----CLQYDE--RVIVSGSSDSTVRVWDVNT 266 (499)
T ss_pred cCCceEEEEec--chhhhcccccCceEEeccccHHHHHhhhcCCCcEE-----eeeccc--eEEEecCCCceEEEEeccC
Confidence 45789999985 5679999999999999999999999999 999999 998854 5999999999999999996
Q ss_pred CCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcce
Q 000425 129 ERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAV 208 (1525)
Q Consensus 129 ~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~v 208 (1525)
. .++.++-+|...|..+.|+. .+++|||.|.+|.|||+..... +-+.++|-++
T Consensus 267 g--e~l~tlihHceaVLhlrf~n--g~mvtcSkDrsiaVWdm~sps~------------------it~rrVLvGH----- 319 (499)
T KOG0281|consen 267 G--EPLNTLIHHCEAVLHLRFSN--GYMVTCSKDRSIAVWDMASPTD------------------ITLRRVLVGH----- 319 (499)
T ss_pred C--chhhHHhhhcceeEEEEEeC--CEEEEecCCceeEEEeccCchH------------------HHHHHHHhhh-----
Confidence 3 34444558999999999964 5999999999999999862211 0112233222
Q ss_pred ecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCCcccc--hhhHH
Q 000425 209 YPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGIL--ADHQL 286 (1525)
Q Consensus 209 y~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l--~~~ql 286 (1525)
...|+.|.|..+ ++. +++.|.++|.|...++ .++.++.+|+.-.+++ -..|.+ .+.
T Consensus 320 --rAaVNvVdfd~k--yIV------sASgDRTikvW~~st~--efvRtl~gHkRGIACl--------QYr~rlvVSGS-- 377 (499)
T KOG0281|consen 320 --RAAVNVVDFDDK--YIV------SASGDRTIKVWSTSTC--EFVRTLNGHKRGIACL--------QYRDRLVVSGS-- 377 (499)
T ss_pred --hhheeeeccccc--eEE------EecCCceEEEEeccce--eeehhhhcccccceeh--------hccCeEEEecC--
Confidence 124666777655 444 7788899999988765 5778999998866543 223322 222
Q ss_pred HHHHHhhhccCCCceeeehhhhhcee
Q 000425 287 QAQLQEHHLKGHSHLTISDIARKAFL 312 (1525)
Q Consensus 287 ~~~~~~~~~~~~~~~~I~di~r~~~i 312 (1525)
+|+++++||+.--+++
T Consensus 378 ----------SDntIRlwdi~~G~cL 393 (499)
T KOG0281|consen 378 ----------SDNTIRLWDIECGACL 393 (499)
T ss_pred ----------CCceEEEEeccccHHH
Confidence 6999999996554444
No 80
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.49 E-value=4.1e-13 Score=153.79 Aligned_cols=195 Identities=16% Similarity=0.206 Sum_probs=144.2
Q ss_pred EEcCC-CCEEEEEEeccceEEEEEeecC---------------CccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcE
Q 000425 13 ALTPL-QPVVFFGFHRRMSVTVVGTVEG---------------GRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLI 75 (1525)
Q Consensus 13 A~hP~-qp~Vf~s~~~d~sI~vv~tieg---------------gk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtI 75 (1525)
...|. +..+..+.+....|.||+..+. .++..++.+ |++.-++|+|+|-.. .+++|..-+.|
T Consensus 158 r~~~~~~~~~~aswse~G~V~Vw~l~~~l~~l~~~~~~~~~s~~~Pl~t~~g-hk~EGy~LdWSp~~~g~LlsGDc~~~I 236 (440)
T KOG0302|consen 158 RVSRLGNEVLCASWSENGRVQVWDLAPHLNALSEPGLEVKDSEFRPLFTFNG-HKGEGYGLDWSPIKTGRLLSGDCVKGI 236 (440)
T ss_pred eecccCCcceeeeecccCcEEEEEchhhhhhhcCccccccccccCceEEecc-cCccceeeecccccccccccCccccce
Confidence 33444 4566777778888888875321 345566666 999999999999543 48888878899
Q ss_pred EEEEcCCCcEE---EEEe-cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCC-CceeeEeecccCcCeEEEEE
Q 000425 76 RAYNIHTYAVH---YTLQ-LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIE-RPSMIGIIQVGSQPITSVAW 149 (1525)
Q Consensus 76 rLWDl~t~k~l---~tL~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~-~~s~i~il~gH~~~VtsVaf 149 (1525)
++|...++.=. ..|. |..+|. .++|||.. ..|+|||.|++|+|||+++. ...++ ..++|...|+.|+|
T Consensus 237 ~lw~~~~g~W~vd~~Pf~gH~~SVE-----DLqWSptE~~vfaScS~DgsIrIWDiRs~~~~~~~-~~kAh~sDVNVISW 310 (440)
T KOG0302|consen 237 HLWEPSTGSWKVDQRPFTGHTKSVE-----DLQWSPTEDGVFASCSCDGSIRIWDIRSGPKKAAV-STKAHNSDVNVISW 310 (440)
T ss_pred EeeeeccCceeecCccccccccchh-----hhccCCccCceEEeeecCceEEEEEecCCCcccee-EeeccCCceeeEEc
Confidence 99998775322 2344 999999 99999965 57889999999999999975 23333 34799999999999
Q ss_pred cCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCC-CeEEE
Q 000425 150 LPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRL-NLAVL 228 (1525)
Q Consensus 150 sPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~-Nlial 228 (1525)
+-+-++|+||++||+++|||+|..... ...+.| +|....|++++|||.. ..++
T Consensus 311 nr~~~lLasG~DdGt~~iwDLR~~~~~---~pVA~f----------------------k~Hk~pItsieW~p~e~s~ia- 364 (440)
T KOG0302|consen 311 NRREPLLASGGDDGTLSIWDLRQFKSG---QPVATF----------------------KYHKAPITSIEWHPHEDSVIA- 364 (440)
T ss_pred cCCcceeeecCCCceEEEEEhhhccCC---CcceeE----------------------EeccCCeeEEEeccccCceEE-
Confidence 999899999999999999999965431 111222 3444578999999974 3333
Q ss_pred EecccccCCcccccccc
Q 000425 229 LFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 229 ~F~~~s~~Dd~~~k~~~ 245 (1525)
.+++|.++-+|.
T Consensus 365 -----asg~D~QitiWD 376 (440)
T KOG0302|consen 365 -----ASGEDNQITIWD 376 (440)
T ss_pred -----eccCCCcEEEEE
Confidence 556666666676
No 81
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.49 E-value=4.1e-13 Score=158.00 Aligned_cols=153 Identities=10% Similarity=0.114 Sum_probs=128.2
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC-cEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY-AVHY 87 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~-k~l~ 87 (1525)
-.+-|+|..-.+|.+.++++.+.+|+..+.- ....+.+ |++.|+|.+|+|-.+ +++||++||+||+||+++. ..+.
T Consensus 114 ~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~-v~~~l~~-htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~~~v~ 191 (487)
T KOG0310|consen 114 HVTKFSPQDNTMLVSGSDDKVVKYWDLSTAY-VQAELSG-HTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLTSRVV 191 (487)
T ss_pred eEEEecccCCeEEEecCCCceEEEEEcCCcE-EEEEecC-CcceeEeeccccCCCeEEEecCCCceEEEEEeccCCceeE
Confidence 3467899999999999989888888854433 3334555 999999999999866 7899999999999999887 7778
Q ss_pred EEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeE
Q 000425 88 TLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQV 167 (1525)
Q Consensus 88 tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkV 167 (1525)
.+.|..+|. .+.|-|.|..|++++. ..|||||+.++ .+.+.....|...|+|+.+..++..|+|+|.|+.|++
T Consensus 192 elnhg~pVe-----~vl~lpsgs~iasAgG-n~vkVWDl~~G-~qll~~~~~H~KtVTcL~l~s~~~rLlS~sLD~~VKV 264 (487)
T KOG0310|consen 192 ELNHGCPVE-----SVLALPSGSLIASAGG-NSVKVWDLTTG-GQLLTSMFNHNKTVTCLRLASDSTRLLSGSLDRHVKV 264 (487)
T ss_pred EecCCCcee-----eEEEcCCCCEEEEcCC-CeEEEEEecCC-ceehhhhhcccceEEEEEeecCCceEeecccccceEE
Confidence 888999999 9999999999988874 58999999975 2233233359999999999999999999999999999
Q ss_pred EEee
Q 000425 168 WKTR 171 (1525)
Q Consensus 168 WDlr 171 (1525)
||+.
T Consensus 265 fd~t 268 (487)
T KOG0310|consen 265 FDTT 268 (487)
T ss_pred EEcc
Confidence 9965
No 82
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.49 E-value=1.8e-13 Score=160.71 Aligned_cols=193 Identities=16% Similarity=0.126 Sum_probs=146.7
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCCcEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
...++|++++...+.++. |..+++|++ |.|+....+.. ...+.|+.|||++ +.|++|+.|+.|+.||+++++.++
T Consensus 261 Vrd~~~s~~g~~fLS~sf-D~~lKlwDt-ETG~~~~~f~~--~~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~kvvq 336 (503)
T KOG0282|consen 261 VRDASFNNCGTSFLSASF-DRFLKLWDT-ETGQVLSRFHL--DKVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGKVVQ 336 (503)
T ss_pred hhhhhccccCCeeeeeec-ceeeeeecc-ccceEEEEEec--CCCceeeecCCCCCcEEEEecCCCcEEEEeccchHHHH
Confidence 345788888887766664 889999995 77777777655 5678999999999 689999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC----------ceee----------------------
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER----------PSMI---------------------- 134 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~----------~s~i---------------------- 134 (1525)
.+. |...|. .+.|-++|+.++++++|++++||+.+... .++.
T Consensus 337 eYd~hLg~i~-----~i~F~~~g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~i~i 411 (503)
T KOG0282|consen 337 EYDRHLGAIL-----DITFVDEGRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNYIAI 411 (503)
T ss_pred HHHhhhhhee-----eeEEccCCceEeeeccCccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCceEEE
Confidence 998 999999 99999999999999999999999987520 0000
Q ss_pred ------------EeecccCc--CeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccc
Q 000425 135 ------------GIIQVGSQ--PITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRIL 200 (1525)
Q Consensus 135 ------------~il~gH~~--~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iL 200 (1525)
..+.+|.. +-..|.|||||.+|+||+.||.+.+||.++...-. .+
T Consensus 412 fs~~~~~r~nkkK~feGh~vaGys~~v~fSpDG~~l~SGdsdG~v~~wdwkt~kl~~---------------------~l 470 (503)
T KOG0282|consen 412 FSTVPPFRLNKKKRFEGHSVAGYSCQVDFSPDGRTLCSGDSDGKVNFWDWKTTKLVS---------------------KL 470 (503)
T ss_pred EecccccccCHhhhhcceeccCceeeEEEcCCCCeEEeecCCccEEEeechhhhhhh---------------------cc
Confidence 01234542 33458899999999999999999999998543210 01
Q ss_pred cccCCcceecCCCeeEEEEecCC-CeEEEEecccccCCccccccc
Q 000425 201 SQQGGEAVYPLPRVRALEVHPRL-NLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 201 s~~gG~~vy~l~~V~~v~~HP~~-Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
..+. ..+..+.|||-. +.++ +++=+..+|+|
T Consensus 471 kah~-------~~ci~v~wHP~e~Skva------t~~w~G~Ikiw 502 (503)
T KOG0282|consen 471 KAHD-------QPCIGVDWHPVEPSKVA------TCGWDGLIKIW 502 (503)
T ss_pred ccCC-------cceEEEEecCCCcceeE------ecccCceeEec
Confidence 1111 135678999974 3444 56667777766
No 83
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.49 E-value=1.3e-12 Score=164.98 Aligned_cols=174 Identities=17% Similarity=0.256 Sum_probs=138.1
Q ss_pred CCCCeEEEEEcCCCCEEEEEE--CCCcEEEEEcCC------------CcEEEEEe-cCCceeecCceeEEEeCCCCEEEE
Q 000425 50 LKKPIVNLACHPRLPVLYVAY--ADGLIRAYNIHT------------YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFV 114 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs--~DgtIrLWDl~t------------~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~S 114 (1525)
+.+.|.+|++||++..|++|+ .|+.++||+... .+.+.++. |...|+ ||.|+|||++|++
T Consensus 12 ~~~~IfSIdv~pdg~~~aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~~h~~sv~-----CVR~S~dG~~lAs 86 (942)
T KOG0973|consen 12 NEKSIFSIDVHPDGVKFATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMDDHDGSVN-----CVRFSPDGSYLAS 86 (942)
T ss_pred CCeeEEEEEecCCceeEecCCccccccceeeccccccchhhhhhcccchhheeeccccCcee-----EEEECCCCCeEee
Confidence 567799999999999999999 999999998642 23455666 999999 9999999999999
Q ss_pred EeCCCcEEEEeCCC----C------------CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCC
Q 000425 115 GDRRGTLLAWDVSI----E------------RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNR 178 (1525)
Q Consensus 115 gS~DgtVrVWDl~t----~------------~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~ 178 (1525)
||+|+.|.||+... . .-+...++.+|...|..++|+|++.+|+|+|.|++|.|||.++...
T Consensus 87 GSDD~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~--- 163 (942)
T KOG0973|consen 87 GSDDRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFEL--- 163 (942)
T ss_pred ccCcceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccCCCccEEEEecccceEEEEcccccee---
Confidence 99999999999872 0 1123356789999999999999999999999999999999875411
Q ss_pred CCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeec
Q 000425 179 PPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQ 258 (1525)
Q Consensus 179 ~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq 258 (1525)
..++..+.+ -|+.+.|.|.+-++| +-+||.++|+|+..+ +++..++.
T Consensus 164 ------------------~~vl~~H~s-------~VKGvs~DP~Gky~A------SqsdDrtikvwrt~d--w~i~k~It 210 (942)
T KOG0973|consen 164 ------------------LKVLRGHQS-------LVKGVSWDPIGKYFA------SQSDDRTLKVWRTSD--WGIEKSIT 210 (942)
T ss_pred ------------------eeeeecccc-------cccceEECCccCeee------eecCCceEEEEEccc--ceeeEeec
Confidence 111222211 367799999999998 899999999998544 55666666
Q ss_pred cccCCc
Q 000425 259 SARGSS 264 (1525)
Q Consensus 259 ~~~~~~ 264 (1525)
.++..+
T Consensus 211 ~pf~~~ 216 (942)
T KOG0973|consen 211 KPFEES 216 (942)
T ss_pred cchhhC
Confidence 666643
No 84
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.48 E-value=3e-14 Score=172.54 Aligned_cols=155 Identities=14% Similarity=0.247 Sum_probs=139.0
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
...+|.|++.+-.|..|. .+.+|++||. +..+.+..+.+ |...+.+++|||-+.++++|+.|+.+++||++..-|.+
T Consensus 72 pIeSl~f~~~E~Llaags-asgtiK~wDl-eeAk~vrtLtg-h~~~~~sv~f~P~~~~~a~gStdtd~~iwD~Rk~Gc~~ 148 (825)
T KOG0267|consen 72 PIESLTFDTSERLLAAGS-ASGTIKVWDL-EEAKIVRTLTG-HLLNITSVDFHPYGEFFASGSTDTDLKIWDIRKKGCSH 148 (825)
T ss_pred cceeeecCcchhhhcccc-cCCceeeeeh-hhhhhhhhhhc-cccCcceeeeccceEEeccccccccceehhhhccCcee
Confidence 456899999998887777 4889999995 55566666666 99999999999999999999999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
++. |...|+ +++|+|+|+|+++|++|.+++|||.. ..++...+..|...|..+.|||..-++++||.|++|+
T Consensus 149 ~~~s~~~vv~-----~l~lsP~Gr~v~~g~ed~tvki~d~~--agk~~~ef~~~e~~v~sle~hp~e~Lla~Gs~d~tv~ 221 (825)
T KOG0267|consen 149 TYKSHTRVVD-----VLRLSPDGRWVASGGEDNTVKIWDLT--AGKLSKEFKSHEGKVQSLEFHPLEVLLAPGSSDRTVR 221 (825)
T ss_pred eecCCcceeE-----EEeecCCCceeeccCCcceeeeeccc--ccccccccccccccccccccCchhhhhccCCCCceee
Confidence 999 999999 99999999999999999999999998 3555556678999999999999999999999999999
Q ss_pred EEEeee
Q 000425 167 VWKTRV 172 (1525)
Q Consensus 167 VWDlr~ 172 (1525)
+||+++
T Consensus 222 f~dlet 227 (825)
T KOG0267|consen 222 FWDLET 227 (825)
T ss_pred eeccce
Confidence 999984
No 85
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.46 E-value=3.5e-13 Score=148.23 Aligned_cols=155 Identities=12% Similarity=0.214 Sum_probs=129.4
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCCcEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
.+++.+.+..-..|+.++-|.+|++|+.. .......+++ |..-|...+|||.. ++|+++|.|++.++||++..-...
T Consensus 107 V~Svdwn~~~r~~~ltsSWD~TiKLW~~~-r~~Sv~Tf~g-h~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk~~ 184 (311)
T KOG0277|consen 107 VYSVDWNTVRRRIFLTSSWDGTIKLWDPN-RPNSVQTFNG-HNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSPGKFM 184 (311)
T ss_pred eEEeccccccceeEEeeccCCceEeecCC-CCcceEeecC-CccEEEEEecCCCCCCeEEEccCCceEEEEEecCCCcee
Confidence 46788899988888888899999999853 3445555666 99999999999975 589999999999999988654444
Q ss_pred EEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC-CEEEEEECCCC
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML-RLLVTLCRDGS 164 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg-~~LaSgS~Dgt 164 (1525)
.+. |...+. ++.|+. +...+++|+.|+.||+||++.. +..+..+.+|.-.|+.+.|||.. .+|+|||-|.|
T Consensus 185 ~i~ah~~Eil-----~cdw~ky~~~vl~Tg~vd~~vr~wDir~~-r~pl~eL~gh~~AVRkvk~Sph~~~lLaSasYDmT 258 (311)
T KOG0277|consen 185 SIEAHNSEIL-----CCDWSKYNHNVLATGGVDNLVRGWDIRNL-RTPLFELNGHGLAVRKVKFSPHHASLLASASYDMT 258 (311)
T ss_pred EEEeccceeE-----eecccccCCcEEEecCCCceEEEEehhhc-cccceeecCCceEEEEEecCcchhhHhhhccccce
Confidence 477 998999 999988 5567889999999999999975 33344578999999999999964 68999999999
Q ss_pred eeEEEee
Q 000425 165 LQVWKTR 171 (1525)
Q Consensus 165 IkVWDlr 171 (1525)
++|||..
T Consensus 259 ~riw~~~ 265 (311)
T KOG0277|consen 259 VRIWDPE 265 (311)
T ss_pred EEecccc
Confidence 9999975
No 86
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.46 E-value=1.6e-12 Score=161.61 Aligned_cols=151 Identities=14% Similarity=0.155 Sum_probs=128.3
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
+.-.+++.--+..+ ++...|.++++|+ ...+.+...+.+ |...|+++... ..++++|+.|.+|++|++.++.++.
T Consensus 251 ~V~~l~~~~~~~~l-vsgS~D~t~rvWd-~~sg~C~~~l~g-h~stv~~~~~~--~~~~~sgs~D~tVkVW~v~n~~~l~ 325 (537)
T KOG0274|consen 251 GVWGLAFPSGGDKL-VSGSTDKTERVWD-CSTGECTHSLQG-HTSSVRCLTID--PFLLVSGSRDNTVKVWDVTNGACLN 325 (537)
T ss_pred CceeEEEecCCCEE-EEEecCCcEEeEe-cCCCcEEEEecC-CCceEEEEEcc--CceEeeccCCceEEEEeccCcceEE
Confidence 34456665545554 4555799999999 566777777776 99999999876 4578889999999999999999999
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
++. |..+|+ ++..+ +..+++|+.|++|+|||+. ..++...+.+|+..|.++.+.+. .++++||.|++|+
T Consensus 326 l~~~h~~~V~-----~v~~~--~~~lvsgs~d~~v~VW~~~--~~~cl~sl~gH~~~V~sl~~~~~-~~~~Sgs~D~~Ik 395 (537)
T KOG0274|consen 326 LLRGHTGPVN-----CVQLD--EPLLVSGSYDGTVKVWDPR--TGKCLKSLSGHTGRVYSLIVDSE-NRLLSGSLDTTIK 395 (537)
T ss_pred EeccccccEE-----EEEec--CCEEEEEecCceEEEEEhh--hceeeeeecCCcceEEEEEecCc-ceEEeeeeccceE
Confidence 999 999999 99887 8899999999999999999 46677778899999999988776 8999999999999
Q ss_pred EEEeeee
Q 000425 167 VWKTRVI 173 (1525)
Q Consensus 167 VWDlr~~ 173 (1525)
+||++..
T Consensus 396 vWdl~~~ 402 (537)
T KOG0274|consen 396 VWDLRTK 402 (537)
T ss_pred eecCCch
Confidence 9999854
No 87
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.46 E-value=3.7e-13 Score=150.72 Aligned_cols=210 Identities=19% Similarity=0.227 Sum_probs=165.3
Q ss_pred cCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 48 TDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 48 ~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
.+|++.|..++.....+.+.+++.|.+-+||.+++++|+.++. |.++|+ +++||+.+..++++|.|++-.||.+
T Consensus 145 ~GHkDGiW~Vaa~~tqpi~gtASADhTA~iWs~Esg~CL~~Y~GH~GSVN-----sikfh~s~~L~lTaSGD~taHIW~~ 219 (481)
T KOG0300|consen 145 EGHKDGIWHVAADSTQPICGTASADHTARIWSLESGACLATYTGHTGSVN-----SIKFHNSGLLLLTASGDETAHIWKA 219 (481)
T ss_pred cccccceeeehhhcCCcceeecccccceeEEeeccccceeeeccccccee-----eEEeccccceEEEccCCcchHHHHH
Confidence 3599999999999889999999999999999999999999999 999999 9999999999999999999999973
Q ss_pred C------CC--------------------------------CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEE
Q 000425 127 S------IE--------------------------------RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 127 ~------t~--------------------------------~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVW 168 (1525)
. +. .+...-.+.+|...|.++.|-..|..++|+|+|.|..+|
T Consensus 220 av~~~vP~~~a~~~hSsEeE~e~sDe~~~d~d~~~~sD~~tiRvPl~~ltgH~~vV~a~dWL~gg~Q~vTaSWDRTAnlw 299 (481)
T KOG0300|consen 220 AVNWEVPSNNAPSDHSSEEEEEHSDEHNRDTDSSEKSDGHTIRVPLMRLTGHRAVVSACDWLAGGQQMVTASWDRTANLW 299 (481)
T ss_pred hhcCcCCCCCCCCCCCchhhhhcccccccccccccccCCceeeeeeeeeeccccceEehhhhcCcceeeeeeccccceee
Confidence 1 00 000012357899999999999999999999999999999
Q ss_pred EeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccccc
Q 000425 169 KTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTRE 248 (1525)
Q Consensus 169 Dlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~e 248 (1525)
|+.+..- -.+|.++. ..++.+..||+.-|+. +++-|+.-+.|..++
T Consensus 300 DVEtge~---------------------v~~LtGHd-------~ELtHcstHptQrLVv------TsSrDtTFRLWDFRe 345 (481)
T KOG0300|consen 300 DVETGEV---------------------VNILTGHD-------SELTHCSTHPTQRLVV------TSSRDTTFRLWDFRE 345 (481)
T ss_pred eeccCce---------------------eccccCcc-------hhccccccCCcceEEE------EeccCceeEeccchh
Confidence 9873211 11233333 2467899999999987 888899999999997
Q ss_pred CceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhh-hceeee
Q 000425 249 GRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIAR-KAFLYS 314 (1525)
Q Consensus 249 g~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r-~~~i~s 314 (1525)
.. +.+.++|||...+.|+.+.. +. .++.+. .|+++.|||+.+ |.++-|
T Consensus 346 aI-~sV~VFQGHtdtVTS~vF~~----dd-~vVSgS------------DDrTvKvWdLrNMRsplAT 394 (481)
T KOG0300|consen 346 AI-QSVAVFQGHTDTVTSVVFNT----DD-RVVSGS------------DDRTVKVWDLRNMRSPLAT 394 (481)
T ss_pred hc-ceeeeecccccceeEEEEec----CC-ceeecC------------CCceEEEeeeccccCccee
Confidence 64 55679999999987754331 21 233333 799999999755 444443
No 88
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.46 E-value=3.9e-13 Score=158.16 Aligned_cols=217 Identities=16% Similarity=0.134 Sum_probs=150.6
Q ss_pred eEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC----cEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeC
Q 000425 44 TKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY----AVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDR 117 (1525)
Q Consensus 44 tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~----k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~ 117 (1525)
..+++ |++.|.++++.|.|..|++|+.|.+|++||+... +..+.+. ..+.|+ +++|++.|..|++.+.
T Consensus 161 i~l~h-gtk~Vsal~~Dp~GaR~~sGs~Dy~v~~wDf~gMdas~~~fr~l~P~E~h~i~-----sl~ys~Tg~~iLvvsg 234 (641)
T KOG0772|consen 161 IQLKH-GTKIVSALAVDPSGARFVSGSLDYTVKFWDFQGMDASMRSFRQLQPCETHQIN-----SLQYSVTGDQILVVSG 234 (641)
T ss_pred EeccC-CceEEEEeeecCCCceeeeccccceEEEEecccccccchhhhccCcccccccc-----eeeecCCCCeEEEEec
Confidence 34555 8999999999999999999999999999998754 3334444 457889 9999999999999999
Q ss_pred CCcEEEEeCCCC-Ccee------e---EeecccCcCeEEEEEcCCC-CEEEEEECCCCeeEEEeeeecCCCCCCCccccc
Q 000425 118 RGTLLAWDVSIE-RPSM------I---GIIQVGSQPITSVAWLPML-RLLVTLCRDGSLQVWKTRVIINPNRPPMQANFF 186 (1525)
Q Consensus 118 DgtVrVWDl~t~-~~s~------i---~il~gH~~~VtsVafsPdg-~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f 186 (1525)
....+|+|-... ...+ + ....||...++|.+|||+. ..|+|||.|+++|+||+..... +...|
T Consensus 235 ~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~------q~qVi 308 (641)
T KOG0772|consen 235 SAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKS------QLQVI 308 (641)
T ss_pred CcceeEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchh------heeEE
Confidence 999999997742 1111 1 2356999999999999974 5799999999999999873321 00010
Q ss_pred cCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccccc-CceeeeeeeccccCCcc
Q 000425 187 EPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTRE-GRKQLFAVLQSARGSSA 265 (1525)
Q Consensus 187 ~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~e-g~~q~~~tlq~~~~~~~ 265 (1525)
-+ . ..+|..+ .++.++|.|.++++| .+..|..+.+|..+. +..--+.+=+.|..- .
T Consensus 309 k~----------k--~~~g~Rv----~~tsC~~nrdg~~iA------agc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g-~ 365 (641)
T KOG0772|consen 309 KT----------K--PAGGKRV----PVTSCAWNRDGKLIA------AGCLDGSIQIWDKGSRTVRPVMKVKDAHLPG-Q 365 (641)
T ss_pred ee----------c--cCCCccc----CceeeecCCCcchhh------hcccCCceeeeecCCcccccceEeeeccCCC-C
Confidence 11 0 1122221 367899999999998 788888888887532 222223344445441 1
Q ss_pred cchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhh
Q 000425 266 SVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIAR 308 (1525)
Q Consensus 266 s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r 308 (1525)
.+.+-.|+++|. +|.+.. .|..+.+|||-+
T Consensus 366 ~Itsi~FS~dg~-------~LlSRg------~D~tLKvWDLrq 395 (641)
T KOG0772|consen 366 DITSISFSYDGN-------YLLSRG------FDDTLKVWDLRQ 395 (641)
T ss_pred ceeEEEeccccc-------hhhhcc------CCCceeeeeccc
Confidence 223355666665 122222 699999999644
No 89
>KOG4328 consensus WD40 protein [Function unknown]
Probab=99.45 E-value=1.4e-12 Score=152.42 Aligned_cols=162 Identities=19% Similarity=0.250 Sum_probs=124.6
Q ss_pred ccCCcEEEEEEcCCCC-EEEEEEeccceEEEEEee--cCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEE
Q 000425 4 ISVDTEVHLALTPLQP-VVFFGFHRRMSVTVVGTV--EGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYN 79 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp-~Vf~s~~~d~sI~vv~ti--eggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWD 79 (1525)
+-....++++|||+.+ -++.+.+....|-+|+.. +....-..+...|..+|.++.|+|... .+++.|+||+||+-|
T Consensus 184 v~~~Rit~l~fHPt~~~~lva~GdK~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~s~i~ssSyDGtiR~~D 263 (498)
T KOG4328|consen 184 VTDRRITSLAFHPTENRKLVAVGDKGGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANTSQIYSSSYDGTIRLQD 263 (498)
T ss_pred ecccceEEEEecccCcceEEEEccCCCcEEEEecCCCCCccCceEEeccCCccccceEecCCChhheeeeccCceeeeee
Confidence 3345788999999998 666677788899999973 334444556667999999999999875 799999999999988
Q ss_pred cCCCc-------------------------EE--------------------EEEe-cCCceeecCceeEEEeCCCC-EE
Q 000425 80 IHTYA-------------------------VH--------------------YTLQ-LDNTIKLLGAGAFAFHPTLE-WL 112 (1525)
Q Consensus 80 l~t~k-------------------------~l--------------------~tL~-H~~~V~~~ga~sVaFSPdg~-~L 112 (1525)
+++.. ++ ..+. |...|+ +|+++|..+ +|
T Consensus 264 ~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~~s~~~~~~lh~kKI~-----sv~~NP~~p~~l 338 (498)
T KOG4328|consen 264 FEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGNFNVIDLRTDGSEYENLRLHKKKIT-----SVALNPVCPWFL 338 (498)
T ss_pred ecchhhHHHhhcCccceeeeeccccCCCccEEEeecccceEEEEeecCCccchhhhhhhcccc-----eeecCCCCchhe
Confidence 75320 00 0111 334455 999999776 57
Q ss_pred EEEeCCCcEEEEeCCCC--Cce-eeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 113 FVGDRRGTLLAWDVSIE--RPS-MIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 113 ~SgS~DgtVrVWDl~t~--~~s-~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+++|.|++.+|||++.. ..+ ++ ..-.|...|+++.|||++-.|+|.+.|.+|+|||..
T Consensus 339 aT~s~D~T~kIWD~R~l~~K~sp~l-st~~HrrsV~sAyFSPs~gtl~TT~~D~~IRv~dss 399 (498)
T KOG4328|consen 339 ATASLDQTAKIWDLRQLRGKASPFL-STLPHRRSVNSAYFSPSGGTLLTTCQDNEIRVFDSS 399 (498)
T ss_pred eecccCcceeeeehhhhcCCCCcce-ecccccceeeeeEEcCCCCceEeeccCCceEEeecc
Confidence 78999999999999973 222 22 234799999999999988889999999999999995
No 90
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.45 E-value=1.2e-11 Score=141.73 Aligned_cols=135 Identities=19% Similarity=0.332 Sum_probs=111.6
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
.|+..|.+++.||+.+++++|+.|-.-.||++.++.....+. |.++|. ++.|+.||.+|++|+.+|.|+||+..
T Consensus 62 ~H~~svFavsl~P~~~l~aTGGgDD~AflW~~~~ge~~~eltgHKDSVt-----~~~FshdgtlLATGdmsG~v~v~~~s 136 (399)
T KOG0296|consen 62 KHTDSVFAVSLHPNNNLVATGGGDDLAFLWDISTGEFAGELTGHKDSVT-----CCSFSHDGTLLATGDMSGKVLVFKVS 136 (399)
T ss_pred hcCCceEEEEeCCCCceEEecCCCceEEEEEccCCcceeEecCCCCceE-----EEEEccCceEEEecCCCccEEEEEcc
Confidence 499999999999999999999999999999999999988998 999999 99999999999999999999999999
Q ss_pred CCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee-----ecCCCCCCCccccccCCC
Q 000425 128 IERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV-----IINPNRPPMQANFFEPAS 190 (1525)
Q Consensus 128 t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~-----~~~p~~~~~~a~~f~p~g 190 (1525)
+....+. +..-...+.=+.|||.++.|+.|+.||.+.+|.+.. ....+..+.....|.|+|
T Consensus 137 tg~~~~~--~~~e~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~~~~kv~~Gh~~~ct~G~f~pdG 202 (399)
T KOG0296|consen 137 TGGEQWK--LDQEVEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQALCKVMSGHNSPCTCGEFIPDG 202 (399)
T ss_pred cCceEEE--eecccCceEEEEecccccEEEeecCCCcEEEEECCCcceeeEecCCCCCcccccccCCC
Confidence 7533332 223345667789999999999999999999999863 223333344444455644
No 91
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.45 E-value=4.4e-13 Score=163.79 Aligned_cols=155 Identities=19% Similarity=0.331 Sum_probs=125.2
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC-CCEEEEEECCCcEEEEEcCCC-cEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR-LPVLYVAYADGLIRAYNIHTY-AVH 86 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd-g~~LaSGs~DgtIrLWDl~t~-k~l 86 (1525)
.=.+.||++.|.++.+...|..|++|+..+..... ..++ -...|+.|.|+|. +..|+++.+.|.+++||++.. ++.
T Consensus 136 ~~~ldfh~tep~iliSGSQDg~vK~~DlR~~~S~~-t~~~-nSESiRDV~fsp~~~~~F~s~~dsG~lqlWDlRqp~r~~ 213 (839)
T KOG0269|consen 136 ANKLDFHSTEPNILISGSQDGTVKCWDLRSKKSKS-TFRS-NSESIRDVKFSPGYGNKFASIHDSGYLQLWDLRQPDRCE 213 (839)
T ss_pred eeeeeeccCCccEEEecCCCceEEEEeeecccccc-cccc-cchhhhceeeccCCCceEEEecCCceEEEeeccCchhHH
Confidence 34689999999999999999999999976553322 2233 6789999999996 457999999999999999865 566
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCC-EEEEEEC--C
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLR-LLVTLCR--D 162 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~-~LaSgS~--D 162 (1525)
+.|. |.++|. |+.|||++.||++||+|++|+|||+.+.......+. .-..+|.+|.|-|..+ .|+|||. |
T Consensus 214 ~k~~AH~GpV~-----c~nwhPnr~~lATGGRDK~vkiWd~t~~~~~~~~tI-nTiapv~rVkWRP~~~~hLAtcsmv~d 287 (839)
T KOG0269|consen 214 KKLTAHNGPVL-----CLNWHPNREWLATGGRDKMVKIWDMTDSRAKPKHTI-NTIAPVGRVKWRPARSYHLATCSMVVD 287 (839)
T ss_pred HHhhcccCceE-----EEeecCCCceeeecCCCccEEEEeccCCCccceeEE-eecceeeeeeeccCccchhhhhhcccc
Confidence 7777 999999 999999999999999999999999996422221111 2256889999999866 4777764 7
Q ss_pred CCeeEEEee
Q 000425 163 GSLQVWKTR 171 (1525)
Q Consensus 163 gtIkVWDlr 171 (1525)
..|+|||+|
T Consensus 288 tsV~VWDvr 296 (839)
T KOG0269|consen 288 TSVHVWDVR 296 (839)
T ss_pred ceEEEEeec
Confidence 799999998
No 92
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.45 E-value=3.1e-13 Score=151.42 Aligned_cols=171 Identities=14% Similarity=0.210 Sum_probs=134.0
Q ss_pred EccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE--------e-cCCceeecCceeEEEeCCCCEEEEEe
Q 000425 46 IKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL--------Q-LDNTIKLLGAGAFAFHPTLEWLFVGD 116 (1525)
Q Consensus 46 lk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL--------~-H~~~V~~~ga~sVaFSPdg~~L~SgS 116 (1525)
++.+.+..+.|..|+|||+++++|+.||.|.+||+.+++.-+.+ - ++..|. |++||.|...+++|+
T Consensus 208 IKFg~KSh~EcA~FSPDgqyLvsgSvDGFiEVWny~~GKlrKDLkYQAqd~fMMmd~aVl-----ci~FSRDsEMlAsGs 282 (508)
T KOG0275|consen 208 IKFGQKSHVECARFSPDGQYLVSGSVDGFIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVL-----CISFSRDSEMLASGS 282 (508)
T ss_pred eecccccchhheeeCCCCceEeeccccceeeeehhccchhhhhhhhhhhcceeecccceE-----EEeecccHHHhhccC
Confidence 44456778899999999999999999999999999988765443 3 677888 999999999999999
Q ss_pred CCCcEEEEeCCCCCceeeEee-cccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCccccc
Q 000425 117 RRGTLLAWDVSIERPSMIGII-QVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESID 195 (1525)
Q Consensus 117 ~DgtVrVWDl~t~~~s~i~il-~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesld 195 (1525)
.||.|+||.+++. .|..-+ .+|+..|+|+.|+.|+..++|+|-|.++++.-++....
T Consensus 283 qDGkIKvWri~tG--~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~-------------------- 340 (508)
T KOG0275|consen 283 QDGKIKVWRIETG--QCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKC-------------------- 340 (508)
T ss_pred cCCcEEEEEEecc--hHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchh--------------------
Confidence 9999999999964 333222 38999999999999999999999999999987762211
Q ss_pred ccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeecc
Q 000425 196 IPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQS 259 (1525)
Q Consensus 196 i~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~ 259 (1525)
+....|..-| |+.+.|.+.++-+. +++.|..+|+|...++. |..|++.
T Consensus 341 ----LKEfrGHsSy----vn~a~ft~dG~~ii------saSsDgtvkvW~~Ktte--C~~Tfk~ 388 (508)
T KOG0275|consen 341 ----LKEFRGHSSY----VNEATFTDDGHHII------SASSDGTVKVWHGKTTE--CLSTFKP 388 (508)
T ss_pred ----HHHhcCcccc----ccceEEcCCCCeEE------EecCCccEEEecCcchh--hhhhccC
Confidence 1111111122 67788888888877 88889999999977663 4455554
No 93
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.44 E-value=1.6e-12 Score=142.47 Aligned_cols=190 Identities=17% Similarity=0.123 Sum_probs=137.6
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC---cEEEEEe-cCCceeecCceeEEE-eC-CCCEEEEEeCC
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY---AVHYTLQ-LDNTIKLLGAGAFAF-HP-TLEWLFVGDRR 118 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~---k~l~tL~-H~~~V~~~ga~sVaF-SP-dg~~L~SgS~D 118 (1525)
++.+.|++.|-.+..+--|..|+++|.|++|||+..+.. +++.+|. |..+|. .++| || -|..|+|++.|
T Consensus 5 ~idt~H~D~IHda~lDyygkrlATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVw-----qv~wahPk~G~iLAScsYD 79 (299)
T KOG1332|consen 5 TIDTQHEDMIHDAQLDYYGKRLATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVW-----KVAWAHPKFGTILASCSYD 79 (299)
T ss_pred ehhhhhhhhhhHhhhhhhcceeeeecCCccEEEEEEcCCCCceeeeEecCCCCCee-----EEeecccccCcEeeEeecC
Confidence 344569998888877778999999999999999998765 4677888 999999 9999 55 68999999999
Q ss_pred CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC--CCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccc
Q 000425 119 GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM--LRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDI 196 (1525)
Q Consensus 119 gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd--g~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi 196 (1525)
|.|.||.....+-+-...+..|...|++|+|-|+ |-.|+++|.||+|.|.+.+.- ..........+ ++-
T Consensus 80 gkVIiWke~~g~w~k~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~-g~w~t~ki~~a-H~~------- 150 (299)
T KOG1332|consen 80 GKVIIWKEENGRWTKAYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSS-GGWTTSKIVFA-HEI------- 150 (299)
T ss_pred ceEEEEecCCCchhhhhhhhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCC-CCccchhhhhc-ccc-------
Confidence 9999999886422222336689999999999996 568999999999999998733 10000000000 111
Q ss_pred cccccccCCcceecCCCeeEEEEecC---CCeEE-----EEecccccCCcccccccccccCceeeeeeeccccCCc
Q 000425 197 PRILSQQGGEAVYPLPRVRALEVHPR---LNLAV-----LLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 197 ~~iLs~~gG~~vy~l~~V~~v~~HP~---~Nlia-----l~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
+++.++|-|. +.++. -+.---+++-|..+|+|+..++.+.+-.+|++|.+.+
T Consensus 151 ----------------GvnsVswapa~~~g~~~~~~~~~~~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H~dwV 210 (299)
T KOG1332|consen 151 ----------------GVNSVSWAPASAPGSLVDQGPAAKVKRLVSGGCDNLVKIWKFDSDSWKLERTLEGHKDWV 210 (299)
T ss_pred ----------------ccceeeecCcCCCccccccCcccccceeeccCCccceeeeecCCcchhhhhhhhhcchhh
Confidence 3444555544 11110 0000017888888999999988888888899999976
No 94
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.44 E-value=7.3e-10 Score=134.90 Aligned_cols=506 Identities=14% Similarity=0.151 Sum_probs=269.7
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCcc-ceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRA-PTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~-~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
..+++|++-....+.++- .+.+|.+|+... ++- ...+.++-...|.+++|.+. ..|++.+.+|+|.-||+.+.++.
T Consensus 27 ~I~slA~s~kS~~lAvsR-t~g~IEiwN~~~-~w~~~~vi~g~~drsIE~L~W~e~-~RLFS~g~sg~i~EwDl~~lk~~ 103 (691)
T KOG2048|consen 27 EIVSLAYSHKSNQLAVSR-TDGNIEIWNLSN-NWFLEPVIHGPEDRSIESLAWAEG-GRLFSSGLSGSITEWDLHTLKQK 103 (691)
T ss_pred ceEEEEEeccCCceeeec-cCCcEEEEccCC-CceeeEEEecCCCCceeeEEEccC-CeEEeecCCceEEEEecccCcee
Confidence 567889988777765554 578888888543 443 34455544567999999954 45677788999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+.+. ....+. +++.+|.+..++.|++||.+..++.....-.....+....++|.+++|+|++..|++||.||.|
T Consensus 104 ~~~d~~gg~IW-----siai~p~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg~I 178 (691)
T KOG2048|consen 104 YNIDSNGGAIW-----SIAINPENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDGVI 178 (691)
T ss_pred EEecCCCccee-----EEEeCCccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEEecCCccEEEecccCceE
Confidence 9999 778888 9999999999999999997777777743222223445567899999999999999999999999
Q ss_pred eEEEeeeecCCCCCCCccccccCCCcccccccccccccC-CcceecCCCeeEEEEecCCCeEEEEecccccCCccccccc
Q 000425 166 QVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQG-GEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 166 kVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~g-G~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
++||......-+...| -+...+ ++.. -||.+.|- ..+.++ ++.....++.|
T Consensus 179 riwd~~~~~t~~~~~~-----------------~~d~l~k~~~~----iVWSv~~L-rd~tI~------sgDS~G~V~FW 230 (691)
T KOG2048|consen 179 RIWDVKSGQTLHIITM-----------------QLDRLSKREPT----IVWSVLFL-RDSTIA------SGDSAGTVTFW 230 (691)
T ss_pred EEEEcCCCceEEEeee-----------------cccccccCCce----EEEEEEEe-ecCcEE------EecCCceEEEE
Confidence 9999873211000000 011111 0111 25666666 334444 45556678888
Q ss_pred ccccCceeeeeeeccccCCcccch----hhhcccCCCcccchhhHHHHH---HHhhhccC--CCceeeehhhhhceeeee
Q 000425 245 YTREGRKQLFAVLQSARGSSASVL----KEKLSSMGSSGILADHQLQAQ---LQEHHLKG--HSHLTISDIARKAFLYSH 315 (1525)
Q Consensus 245 ~~~eg~~q~~~tlq~~~~~~~s~~----~e~~~slgs~g~l~~~ql~~~---~~~~~~~~--~~~~~I~di~r~~~i~s~ 315 (1525)
....|- +.+.++.|...+-++- -.++.+-|-.+.+..|++... |..-.... .++++-++.- ...++|+
T Consensus 231 d~~~gT--LiqS~~~h~adVl~Lav~~~~d~vfsaGvd~~ii~~~~~~~~~~wv~~~~r~~h~hdvrs~av~-~~~l~sg 307 (691)
T KOG2048|consen 231 DSIFGT--LIQSHSCHDADVLALAVADNEDRVFSAGVDPKIIQYSLTTNKSEWVINSRRDLHAHDVRSMAVI-ENALISG 307 (691)
T ss_pred cccCcc--hhhhhhhhhcceeEEEEcCCCCeEEEccCCCceEEEEecCCccceeeeccccCCcccceeeeee-cceEEec
Confidence 877661 2233333333321110 133344455555554433211 21110000 0111111000 0133333
Q ss_pred ecccccccCccccCcceeeeccccccCCCcccccccccccccccccccccccceeeEE-e-CceEEEeecCCCc------
Q 000425 316 FMEGHAKSAPISRLPLITIFDSKHQLKDIPVCQPFHLELNFFNRENRVLHYPVRAFYV-D-GINLVAYNLCSGA------ 387 (1525)
Q Consensus 316 ~~d~~~k~~~~~~~p~~~i~~~~~~~~~~~vv~~~~~e~~~f~~~~~v~~~~~~~~y~-~-~~~l~~~~l~~g~------ 387 (1525)
=+|.+..+ . -+++|+ .+.|...+.|....++..-+.+.+|+ + ...+.-|-+.+..
T Consensus 308 G~d~~l~i-----------~----~s~~~~--~~~h~~~~~~p~~~~v~~a~~~~L~~~w~~h~v~lwrlGS~~~~g~~~ 370 (691)
T KOG2048|consen 308 GRDFTLAI-----------C----SSREFK--NMDHRQKNLFPASDRVSVAPENRLLVLWKAHGVDLWRLGSVILQGEYN 370 (691)
T ss_pred ceeeEEEE-----------c----cccccC--chhhhccccccccceeecCccceEEEEeccccccceeccCcccccccC
Confidence 22322221 0 122222 34566556677766666555543333 3 3444444444441
Q ss_pred cccccccccccCCCeeeccccccccCCccEEEEEeeccCCcceEEEEEecccc--c--cccc-CCCCcccceeEEeCCCC
Q 000425 388 DSIYRKLYSTIPGTVEYYPKHMVYSKRQQLFLVVYEFSGTTNEVVLYRENVDT--Q--LADS-KSSTVKGRDAAFIGPNE 462 (1525)
Q Consensus 388 ~~~~~~L~~~~~G~~e~~pq~l~hspngr~v~V~g~~~~~~~E~iiYt~~~~l--~--~rnk-~G~~~~g~~~~fig~~~ 462 (1525)
+.-+.+|.+++.-+.- .-.-||.|..+++-- -.++-||+.+-+- . .+.+ .-....+.-..| .-++
T Consensus 371 ~~~Llkl~~k~~~nIs----~~aiSPdg~~Ia~st-----~~~~~iy~L~~~~~vk~~~v~~~~~~~~~a~~i~f-tid~ 440 (691)
T KOG2048|consen 371 YIHLLKLFTKEKENIS----CAAISPDGNLIAIST-----VSRTKIYRLQPDPNVKVINVDDVPLALLDASAISF-TIDK 440 (691)
T ss_pred hhhheeeecCCcccee----eeccCCCCCEEEEee-----ccceEEEEeccCcceeEEEeccchhhhccceeeEE-EecC
Confidence 1112222222211100 113356666554432 2456677643211 0 0111 001122333344 2233
Q ss_pred CcEEEEeCCCCeEEEEEcCCchhhhhhcccCCcccCCCccccccccccCCcccccccccceeeecCCcceeEEeecCCeE
Q 000425 463 DQFAILDDDKTGLALYILKGVTLQEAADENNGVVDHNQSTDTNVGSVQGPLQLMFESEVDRIFSTPIESTLMFACDGDQI 542 (1525)
Q Consensus 463 n~yAvle~~~~~v~I~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~If~~pl~~~l~~~~~~~~i 542 (1525)
|...+.-.+-..+.++.++....+++ ++.+. ..+ ...+.+|+..|-+.-+...++.+.|
T Consensus 441 ~k~~~~s~~~~~le~~el~~ps~kel-~~~~~-------------~~~-------~~~I~~l~~SsdG~yiaa~~t~g~I 499 (691)
T KOG2048|consen 441 NKLFLVSKNIFSLEEFELETPSFKEL-KSIQS-------------QAK-------CPSISRLVVSSDGNYIAAISTRGQI 499 (691)
T ss_pred ceEEEEecccceeEEEEecCcchhhh-hcccc-------------ccC-------CCcceeEEEcCCCCEEEEEeccceE
Confidence 44433333445788888887777665 33221 001 1347888888777755544443444
Q ss_pred EEEeeccccccccccccccccccccceeeEeeccceEEEEEEcccCCccEEEEEeeeeEEEEccchhhhhcccc
Q 000425 543 GMAKLVQGYRLSARAGHYLQTKSEGKKSIKLKVTEVMLKVAWQETQRGYVAGVLTTQRVLIVSADLDILASSST 616 (1525)
Q Consensus 543 ~~~~~~~~yd~~~~~~~~i~~~~e~~r~i~~~~~e~v~~V~Ws~~~~g~~vaI~t~~~i~i~~~~l~~l~~~~~ 616 (1525)
.. |++++...+ -+....+-.|+-..-+...++.+|..-+++.+|-++-+...|...+.
T Consensus 500 ~v------~nl~~~~~~----------~l~~rln~~vTa~~~~~~~~~~lvvats~nQv~efdi~~~~l~~ws~ 557 (691)
T KOG2048|consen 500 FV------YNLETLESH----------LLKVRLNIDVTAAAFSPFVRNRLVVATSNNQVFEFDIEARNLTRWSK 557 (691)
T ss_pred EE------EEcccceee----------cchhccCcceeeeeccccccCcEEEEecCCeEEEEecchhhhhhhhh
Confidence 43 455555222 11222222244444343345666766777788877776665554443
No 95
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.43 E-value=1.1e-12 Score=158.05 Aligned_cols=144 Identities=15% Similarity=0.173 Sum_probs=119.0
Q ss_pred EEEEEeccceEEEEEeecCC-ccceEEccCCCCCeEEEEE-cCCCCEEEEEECCCcEEEEEcCCC--cEEEEEe------
Q 000425 21 VFFGFHRRMSVTVVGTVEGG-RAPTKIKTDLKKPIVNLAC-HPRLPVLYVAYADGLIRAYNIHTY--AVHYTLQ------ 90 (1525)
Q Consensus 21 Vf~s~~~d~sI~vv~tiegg-k~~tklk~~H~~~V~sLaf-~Pdg~~LaSGs~DgtIrLWDl~t~--k~l~tL~------ 90 (1525)
.|.+.+.|.+|++|+-..++ .+...+. .|++.|.|+++ -++..++|+|+.|+.|.+||+.++ +.+.++.
T Consensus 87 tlIS~SsDtTVK~W~~~~~~~~c~stir-~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~t~~s 165 (735)
T KOG0308|consen 87 TLISASSDTTVKVWNAHKDNTFCMSTIR-THKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTATLVASFNNVTVNS 165 (735)
T ss_pred ceEEecCCceEEEeecccCcchhHhhhh-cccchheeeeecccCceeEEecCCCccEEEEEccCcchhhhhhcccccccc
Confidence 45566679999999976665 4444455 49999999999 778889999999999999999977 3333333
Q ss_pred ----cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 91 ----LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 91 ----H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
+..+|. +++.++.|..|++|+..+-+++||.++..+.+. +.||+..|+++..++||..++|+|.||+|+
T Consensus 166 l~sG~k~siY-----SLA~N~t~t~ivsGgtek~lr~wDprt~~kimk--LrGHTdNVr~ll~~dDGt~~ls~sSDgtIr 238 (735)
T KOG0308|consen 166 LGSGPKDSIY-----SLAMNQTGTIIVSGGTEKDLRLWDPRTCKKIMK--LRGHTDNVRVLLVNDDGTRLLSASSDGTIR 238 (735)
T ss_pred CCCCCcccee-----eeecCCcceEEEecCcccceEEeccccccceee--eeccccceEEEEEcCCCCeEeecCCCceEE
Confidence 334555 999999999999999999999999997544443 669999999999999999999999999999
Q ss_pred EEEeee
Q 000425 167 VWKTRV 172 (1525)
Q Consensus 167 VWDlr~ 172 (1525)
+||++.
T Consensus 239 lWdLgq 244 (735)
T KOG0308|consen 239 LWDLGQ 244 (735)
T ss_pred eeeccc
Confidence 999973
No 96
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.43 E-value=2.1e-12 Score=145.62 Aligned_cols=114 Identities=21% Similarity=0.339 Sum_probs=105.3
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCC--EEEEEeCCCcEEEEe
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLE--WLFVGDRRGTLLAWD 125 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~--~L~SgS~DgtVrVWD 125 (1525)
.|.++|+++|++ ++++|+|+.|.+|+|||+++.+.+..+- |...++ ++.|.++.. +|++|++||.|.+|+
T Consensus 41 aH~~sitavAVs--~~~~aSGssDetI~IYDm~k~~qlg~ll~Hagsit-----aL~F~~~~S~shLlS~sdDG~i~iw~ 113 (362)
T KOG0294|consen 41 AHAGSITALAVS--GPYVASGSSDETIHIYDMRKRKQLGILLSHAGSIT-----ALKFYPPLSKSHLLSGSDDGHIIIWR 113 (362)
T ss_pred ccccceeEEEec--ceeEeccCCCCcEEEEeccchhhhcceeccccceE-----EEEecCCcchhheeeecCCCcEEEEE
Confidence 499999999997 8999999999999999999999999988 999999 999998765 999999999999999
Q ss_pred CCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 126 VSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 126 l~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
... ..+...+.+|...|+.++.||.+++-+|.+.|+.+++||+-
T Consensus 114 ~~~--W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D~~lr~WNLV 157 (362)
T KOG0294|consen 114 VGS--WELLKSLKAHKGQVTDLSIHPSGKLALSVGGDQVLRTWNLV 157 (362)
T ss_pred cCC--eEEeeeecccccccceeEecCCCceEEEEcCCceeeeehhh
Confidence 995 35556678999999999999999999999999999999984
No 97
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.43 E-value=5e-12 Score=142.54 Aligned_cols=224 Identities=18% Similarity=0.219 Sum_probs=159.9
Q ss_pred ccccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC
Q 000425 2 ESISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH 81 (1525)
Q Consensus 2 ~~~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~ 81 (1525)
+..+.|+...+.|+|....+++++ .|.++++++.... ..+++..|..++.+++|.+ ...+++|+.||.||.+|+.
T Consensus 9 ~npP~d~IS~v~f~~~~~~LLvss-WDgslrlYdv~~~---~l~~~~~~~~plL~c~F~d-~~~~~~G~~dg~vr~~Dln 83 (323)
T KOG1036|consen 9 ENPPEDGISSVKFSPSSSDLLVSS-WDGSLRLYDVPAN---SLKLKFKHGAPLLDCAFAD-ESTIVTGGLDGQVRRYDLN 83 (323)
T ss_pred CCCChhceeeEEEcCcCCcEEEEe-ccCcEEEEeccch---hhhhheecCCceeeeeccC-CceEEEeccCceEEEEEec
Confidence 356788999999999999999888 6899999885332 2333445999999999997 5678999999999999999
Q ss_pred CCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC
Q 000425 82 TYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 82 t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~ 161 (1525)
+++......|...++ |+.+++....+++||+|++|++||.+. ....+.++. ...|.|++. .+..|+.|+.
T Consensus 84 ~~~~~~igth~~~i~-----ci~~~~~~~~vIsgsWD~~ik~wD~R~--~~~~~~~d~-~kkVy~~~v--~g~~LvVg~~ 153 (323)
T KOG1036|consen 84 TGNEDQIGTHDEGIR-----CIEYSYEVGCVISGSWDKTIKFWDPRN--KVVVGTFDQ-GKKVYCMDV--SGNRLVVGTS 153 (323)
T ss_pred CCcceeeccCCCceE-----EEEeeccCCeEEEcccCccEEEEeccc--ccccccccc-CceEEEEec--cCCEEEEeec
Confidence 988776666999999 999999888999999999999999994 333322222 346777665 4678899999
Q ss_pred CCCeeEEEeeeecCCCC-------CCCccccccCC--C-----------ccccccccc------------ccccCCccee
Q 000425 162 DGSLQVWKTRVIINPNR-------PPMQANFFEPA--S-----------IESIDIPRI------------LSQQGGEAVY 209 (1525)
Q Consensus 162 DgtIkVWDlr~~~~p~~-------~~~~a~~f~p~--g-----------Vesldi~~i------------Ls~~gG~~vy 209 (1525)
|..|.+||+|....+.. ..+....+.|. | ++.+|-..- ....+-+.+|
T Consensus 154 ~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~pn~eGy~~sSieGRVavE~~d~s~~~~skkyaFkCHr~~~~~~~~~y 233 (323)
T KOG1036|consen 154 DRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVPNGEGYVVSSIEGRVAVEYFDDSEEAQSKKYAFKCHRLSEKDTEIIY 233 (323)
T ss_pred CceEEEEEcccccchhhhccccceeEEEEEEEecCCCceEEEeecceEEEEccCCchHHhhhceeEEeeecccCCceEEE
Confidence 99999999995543320 01111111221 1 222222200 0111223355
Q ss_pred cCCCeeEEEEecCCCeEEEEecccccCCcccccccccccC
Q 000425 210 PLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREG 249 (1525)
Q Consensus 210 ~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg 249 (1525)
| |++++|||-....+ +++.|..+-.|.....
T Consensus 234 P---VNai~Fhp~~~tfa------TgGsDG~V~~Wd~~~r 264 (323)
T KOG1036|consen 234 P---VNAIAFHPIHGTFA------TGGSDGIVNIWDLFNR 264 (323)
T ss_pred E---eceeEeccccceEE------ecCCCceEEEccCcch
Confidence 4 69999999877777 8888888888875543
No 98
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.43 E-value=3.6e-12 Score=155.34 Aligned_cols=194 Identities=18% Similarity=0.281 Sum_probs=154.5
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeec----CCccceEEcc------CCCCCeEEEEEcCCCCEEEEEECCCcEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVE----GGRAPTKIKT------DLKKPIVNLACHPRLPVLYVAYADGLIRAY 78 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tie----ggk~~tklk~------~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLW 78 (1525)
.-.|+..|++....+++ .|.+|++|+..- .+.....++- +.++.|.|+.++||+.+++++-.|.+|+||
T Consensus 457 IWsi~~~pD~~g~vT~s-aDktVkfWdf~l~~~~~gt~~k~lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkVy 535 (888)
T KOG0306|consen 457 IWSISLSPDNKGFVTGS-ADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKVY 535 (888)
T ss_pred eeeeeecCCCCceEEec-CCcEEEEEeEEEEeccCcccceeeeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEEE
Confidence 45789999999887777 489999998621 1211111221 245789999999999999999999999999
Q ss_pred EcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC--CceeeEeecccCcCeEEEEEcCCCCE
Q 000425 79 NIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE--RPSMIGIIQVGSQPITSVAWLPMLRL 155 (1525)
Q Consensus 79 Dl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~--~~s~i~il~gH~~~VtsVafsPdg~~ 155 (1525)
-+.+.+...++- |.-+|. |++.|||++.+++||.|++|+||-++-+ .+++. +|...|+++.|-|+...
T Consensus 536 flDtlKFflsLYGHkLPV~-----smDIS~DSklivTgSADKnVKiWGLdFGDCHKS~f----AHdDSvm~V~F~P~~~~ 606 (888)
T KOG0306|consen 536 FLDTLKFFLSLYGHKLPVL-----SMDISPDSKLIVTGSADKNVKIWGLDFGDCHKSFF----AHDDSVMSVQFLPKTHL 606 (888)
T ss_pred Eecceeeeeeeccccccee-----EEeccCCcCeEEeccCCCceEEeccccchhhhhhh----cccCceeEEEEccccee
Confidence 999998888888 999999 9999999999999999999999977743 44555 99999999999999999
Q ss_pred EEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEeccccc
Q 000425 156 LVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTG 235 (1525)
Q Consensus 156 LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~ 235 (1525)
+.|||.|+.||-||-..... | ..+..+. ..||+++..|++.++. ++
T Consensus 607 FFt~gKD~kvKqWDg~kFe~---------------i------q~L~~H~-------~ev~cLav~~~G~~vv------s~ 652 (888)
T KOG0306|consen 607 FFTCGKDGKVKQWDGEKFEE---------------I------QKLDGHH-------SEVWCLAVSPNGSFVV------SS 652 (888)
T ss_pred EEEecCcceEEeechhhhhh---------------h------eeeccch-------heeeeeEEcCCCCeEE------ec
Confidence 99999999999999652211 0 0111111 2589999999998877 88
Q ss_pred CCccccccccc
Q 000425 236 GDNIKNRAAYT 246 (1525)
Q Consensus 236 ~Dd~~~k~~~~ 246 (1525)
+.|..++.|-+
T Consensus 653 shD~sIRlwE~ 663 (888)
T KOG0306|consen 653 SHDKSIRLWER 663 (888)
T ss_pred cCCceeEeeec
Confidence 88888887754
No 99
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.43 E-value=3.6e-12 Score=158.44 Aligned_cols=160 Identities=18% Similarity=0.234 Sum_probs=127.4
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
.+++|+-.+.++.+|+ ++..|++++..+. ...+.+++ |+.+|.+|.|+|.+.+||+.+.||+|++||+.++.+.+++
T Consensus 100 r~~~v~g~g~~iaags-dD~~vK~~~~~D~-s~~~~lrg-h~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl 176 (933)
T KOG1274|consen 100 RDLAVSGSGKMIAAGS-DDTAVKLLNLDDS-SQEKVLRG-HDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTL 176 (933)
T ss_pred eEEEEecCCcEEEeec-CceeEEEEecccc-chheeecc-cCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhc
Confidence 5789999999998887 5888999886544 44556666 9999999999999999999999999999999999888777
Q ss_pred e---cC-CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 90 Q---LD-NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 90 ~---H~-~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
. .. ..+.-...+.++|||+|..++..+.|++|++|+..++...+.=..+.|...+.+++|+|+|.||++++.||.|
T Consensus 177 ~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~g~I 256 (933)
T KOG1274|consen 177 TGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLDGQI 256 (933)
T ss_pred ccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEccCCceeheeecccccccceEEEEEcCCCcEEeeeccCCcE
Confidence 6 11 1110001138899999888899999999999998875333321123455568999999999999999999999
Q ss_pred eEEEeee
Q 000425 166 QVWKTRV 172 (1525)
Q Consensus 166 kVWDlr~ 172 (1525)
.|||+.+
T Consensus 257 ~vWnv~t 263 (933)
T KOG1274|consen 257 LVWNVDT 263 (933)
T ss_pred EEEeccc
Confidence 9999875
No 100
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.42 E-value=1.9e-12 Score=149.48 Aligned_cols=220 Identities=12% Similarity=0.072 Sum_probs=157.4
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
.++..+|+++|+..+++++.. +..+.+|+. ..+.........|...+.+++|.|||..|++|+.|++|..||+... .
T Consensus 269 ~~~V~yi~wSPDdryLlaCg~-~e~~~lwDv-~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~dr~i~~wdlDgn-~ 345 (519)
T KOG0293|consen 269 SQPVSYIMWSPDDRYLLACGF-DEVLSLWDV-DTGDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPDRTIIMWDLDGN-I 345 (519)
T ss_pred cCceEEEEECCCCCeEEecCc-hHheeeccC-CcchhhhhcccCcCCCcceeEEccCCceeEecCCCCcEEEecCCcc-h
Confidence 357789999999998876664 556778884 4444444454457789999999999999999999999999998743 3
Q ss_pred EEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC-ceeeEeecccCcCeEEEEEcCCCCEEEEEECC
Q 000425 86 HYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER-PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRD 162 (1525)
Q Consensus 86 l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~-~s~i~il~gH~~~VtsVafsPdg~~LaSgS~D 162 (1525)
+...+ ....|. +++.++||+++++...|..|++++..+.. ...+ .-.++|++++.|.+++++++.=.+
T Consensus 346 ~~~W~gvr~~~v~-----dlait~Dgk~vl~v~~d~~i~l~~~e~~~dr~li----se~~~its~~iS~d~k~~LvnL~~ 416 (519)
T KOG0293|consen 346 LGNWEGVRDPKVH-----DLAITYDGKYVLLVTVDKKIRLYNREARVDRGLI----SEEQPITSFSISKDGKLALVNLQD 416 (519)
T ss_pred hhcccccccceeE-----EEEEcCCCcEEEEEecccceeeechhhhhhhccc----cccCceeEEEEcCCCcEEEEEccc
Confidence 33333 234466 99999999999999999999999988531 1133 335789999999999999999999
Q ss_pred CCeeEEEeeeecC-----CC--CCCCccccccCCCcccccccccccccCCcceecC---------------CCeeEEEEe
Q 000425 163 GSLQVWKTRVIIN-----PN--RPPMQANFFEPASIESIDIPRILSQQGGEAVYPL---------------PRVRALEVH 220 (1525)
Q Consensus 163 gtIkVWDlr~~~~-----p~--~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l---------------~~V~~v~~H 220 (1525)
.+|++||+.-... .+ ..-+...+|.. .+-..++++..+.++|.. ..|++|+|.
T Consensus 417 qei~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg-----~~~~fiaSGSED~kvyIWhr~sgkll~~LsGHs~~vNcVswN 491 (519)
T KOG0293|consen 417 QEIHLWDLEENKLVRKYFGHKQGHFIIRSCFGG-----GNDKFIASGSEDSKVYIWHRISGKLLAVLSGHSKTVNCVSWN 491 (519)
T ss_pred CeeEEeecchhhHHHHhhcccccceEEEeccCC-----CCcceEEecCCCceEEEEEccCCceeEeecCCcceeeEEecC
Confidence 9999999972111 11 11122223332 122344555544455442 358889998
Q ss_pred cCCCe-EEEEecccccCCccccccccccc
Q 000425 221 PRLNL-AVLLFANFTGGDNIKNRAAYTRE 248 (1525)
Q Consensus 221 P~~Nl-ial~F~~~s~~Dd~~~k~~~~~e 248 (1525)
|...- .| +++||.++|+|....
T Consensus 492 P~~p~m~A------SasDDgtIRIWg~~~ 514 (519)
T KOG0293|consen 492 PADPEMFA------SASDDGTIRIWGPSD 514 (519)
T ss_pred CCCHHHhh------ccCCCCeEEEecCCc
Confidence 87542 23 899999999998643
No 101
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.41 E-value=1.6e-12 Score=157.64 Aligned_cols=241 Identities=20% Similarity=0.179 Sum_probs=178.1
Q ss_pred EEEeccceEEEEEeecCCccceEEccCCCCCeEE-EEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecC
Q 000425 23 FGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVN-LACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLG 99 (1525)
Q Consensus 23 ~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~s-Laf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~g 99 (1525)
.+..++.++++|....+....+....+|.+-|.+ +++.+..+ .+++|+.|.+|.+|...+..++.+|. |...|+
T Consensus 29 ~s~sRd~t~~vw~~~~~~~l~~~~~~~~~g~i~~~i~y~e~~~~~l~~g~~D~~i~v~~~~~~~P~~~LkgH~snVC--- 105 (745)
T KOG0301|consen 29 ISGSRDGTVKVWAKKGKQYLETHAFEGPKGFIANSICYAESDKGRLVVGGMDTTIIVFKLSQAEPLYTLKGHKSNVC--- 105 (745)
T ss_pred eecCCCCceeeeeccCcccccceecccCcceeeccceeccccCcceEeecccceEEEEecCCCCchhhhhcccccee---
Confidence 4555788899999877776665555558888877 88887444 59999999999999999999999999 999999
Q ss_pred ceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCC
Q 000425 100 AGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRP 179 (1525)
Q Consensus 100 a~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~ 179 (1525)
+++-..++. ++|||+|.|++||-....... +.+|+..|.++.+-|++ .++|||.|++||+|.-.....
T Consensus 106 --~ls~~~~~~-~iSgSWD~TakvW~~~~l~~~----l~gH~asVWAv~~l~e~-~~vTgsaDKtIklWk~~~~l~---- 173 (745)
T KOG0301|consen 106 --SLSIGEDGT-LISGSWDSTAKVWRIGELVYS----LQGHTASVWAVASLPEN-TYVTGSADKTIKLWKGGTLLK---- 173 (745)
T ss_pred --eeecCCcCc-eEecccccceEEecchhhhcc----cCCcchheeeeeecCCC-cEEeccCcceeeeccCCchhh----
Confidence 888777776 999999999999976632222 56999999999999998 789999999999997543222
Q ss_pred CCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeecc
Q 000425 180 PMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQS 259 (1525)
Q Consensus 180 ~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~ 259 (1525)
.++++. .+|+.+++-|+.-++ +++.|..+|.|.. +| ..+..+-|
T Consensus 174 -------------------tf~gHt-------D~VRgL~vl~~~~fl-------ScsNDg~Ir~w~~-~g--e~l~~~~g 217 (745)
T KOG0301|consen 174 -------------------TFSGHT-------DCVRGLAVLDDSHFL-------SCSNDGSIRLWDL-DG--EVLLEMHG 217 (745)
T ss_pred -------------------hhccch-------hheeeeEEecCCCeE-------eecCCceEEEEec-cC--ceeeeeec
Confidence 122221 356777777765554 5788889998987 33 24557889
Q ss_pred ccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeeeeecccccccCccccCcceeeecccc
Q 000425 260 ARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSHFMEGHAKSAPISRLPLITIFDSKH 339 (1525)
Q Consensus 260 ~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~~~d~~~k~~~~~~~p~~~i~~~~~ 339 (1525)
|.+.+||+. ...+++.+.-- -+|++++||+.. .+++. ..+|..+|.+.+.
T Consensus 218 htn~vYsis-----~~~~~~~Ivs~-----------gEDrtlriW~~~--e~~q~------------I~lPttsiWsa~~ 267 (745)
T KOG0301|consen 218 HTNFVYSIS-----MALSDGLIVST-----------GEDRTLRIWKKD--ECVQV------------ITLPTTSIWSAKV 267 (745)
T ss_pred cceEEEEEE-----ecCCCCeEEEe-----------cCCceEEEeecC--ceEEE------------EecCccceEEEEE
Confidence 999999862 22334444322 389999999744 44444 4688888875554
Q ss_pred c-cCCC
Q 000425 340 Q-LKDI 344 (1525)
Q Consensus 340 ~-~~~~ 344 (1525)
. ++|+
T Consensus 268 L~NgDI 273 (745)
T KOG0301|consen 268 LLNGDI 273 (745)
T ss_pred eeCCCE
Confidence 4 5554
No 102
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.40 E-value=5.5e-13 Score=160.48 Aligned_cols=220 Identities=15% Similarity=0.243 Sum_probs=154.6
Q ss_pred CCCCeE---EEEEcC-CCCEEEEEECCCcEEEEEcCCCc------EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCC
Q 000425 50 LKKPIV---NLACHP-RLPVLYVAYADGLIRAYNIHTYA------VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRR 118 (1525)
Q Consensus 50 H~~~V~---sLaf~P-dg~~LaSGs~DgtIrLWDl~t~k------~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~D 118 (1525)
|...|. .+..++ .+++|++|+.||.|++|++.... .+.+++ |.+.|+ .+....+++.++|+|.|
T Consensus 20 n~~~v~~~~~Lq~da~~~ryLfTgGRDg~i~~W~~~~d~~~~s~~~~asme~HsDWVN-----DiiL~~~~~tlIS~SsD 94 (735)
T KOG0308|consen 20 NRNGVNITKALQLDAPNGRYLFTGGRDGIIRLWSVTQDSNEPSTPYIASMEHHSDWVN-----DIILCGNGKTLISASSD 94 (735)
T ss_pred ccccccchhhccccCCCCceEEecCCCceEEEeccccccCCcccchhhhhhhhHhHHh-----hHHhhcCCCceEEecCC
Confidence 444444 444443 45679999999999999975432 367788 999999 99999999999999999
Q ss_pred CcEEEEeCCCCCceeeEeecccCcCeEEEEE-cCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCccccccc
Q 000425 119 GTLLAWDVSIERPSMIGIIQVGSQPITSVAW-LPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIP 197 (1525)
Q Consensus 119 gtVrVWDl~t~~~s~i~il~gH~~~VtsVaf-sPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~ 197 (1525)
-+|++|+.......+..++..|..+|.|+++ -++..+++||+.|+.|.+||+++... . ..+. |+...++++
T Consensus 95 tTVK~W~~~~~~~~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~--~--l~~s-~n~~t~~sl--- 166 (735)
T KOG0308|consen 95 TTVKVWNAHKDNTFCMSTIRTHKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTA--T--LVAS-FNNVTVNSL--- 166 (735)
T ss_pred ceEEEeecccCcchhHhhhhcccchheeeeecccCceeEEecCCCccEEEEEccCcch--h--hhhh-ccccccccC---
Confidence 9999999986433444556789999999999 88899999999999999999984421 0 0000 111111111
Q ss_pred ccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCcccchhhhcccCCC
Q 000425 198 RILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGS 277 (1525)
Q Consensus 198 ~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs 277 (1525)
.+ ..-+.+| .++-.+++.+++ +|+-..-.+.|..+++++.+ -||||..++-.++ +...|.
T Consensus 167 --~s-G~k~siY------SLA~N~t~t~iv------sGgtek~lr~wDprt~~kim--kLrGHTdNVr~ll---~~dDGt 226 (735)
T KOG0308|consen 167 --GS-GPKDSIY------SLAMNQTGTIIV------SGGTEKDLRLWDPRTCKKIM--KLRGHTDNVRVLL---VNDDGT 226 (735)
T ss_pred --CC-CCcccee------eeecCCcceEEE------ecCcccceEEecccccccee--eeeccccceEEEE---EcCCCC
Confidence 11 1112344 456667665655 77778888899998876554 6999999973321 111121
Q ss_pred cccchhhHHHHHHHhhhccCCCceeeehhhhhceeeee
Q 000425 278 SGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSH 315 (1525)
Q Consensus 278 ~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~ 315 (1525)
-+|.. ++|.++++||++.+.+++|+
T Consensus 227 -~~ls~------------sSDgtIrlWdLgqQrCl~T~ 251 (735)
T KOG0308|consen 227 -RLLSA------------SSDGTIRLWDLGQQRCLATY 251 (735)
T ss_pred -eEeec------------CCCceEEeeeccccceeeeE
Confidence 13322 27999999999999999994
No 103
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.39 E-value=1.7e-12 Score=142.31 Aligned_cols=154 Identities=15% Similarity=0.202 Sum_probs=113.1
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t 88 (1525)
...+||+-+.-++++|.. .+-+++++......++..+.+ |++.|+.+-|...++.|+++++|++||+||.++++.+.+
T Consensus 103 vk~~af~~ds~~lltgg~-ekllrvfdln~p~App~E~~g-htg~Ir~v~wc~eD~~iLSSadd~tVRLWD~rTgt~v~s 180 (334)
T KOG0278|consen 103 VKAVAFSQDSNYLLTGGQ-EKLLRVFDLNRPKAPPKEISG-HTGGIRTVLWCHEDKCILSSADDKTVRLWDHRTGTEVQS 180 (334)
T ss_pred eeeEEecccchhhhccch-HHHhhhhhccCCCCCchhhcC-CCCcceeEEEeccCceEEeeccCCceEEEEeccCcEEEE
Confidence 346788888888887774 455666664333344445555 999999999999889999999999999999999998888
Q ss_pred EecCCceeecCceeEEEeCCC----------------------------------------CEEEEEeCCCcEEEEeCCC
Q 000425 89 LQLDNTIKLLGAGAFAFHPTL----------------------------------------EWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 89 L~H~~~V~~~ga~sVaFSPdg----------------------------------------~~L~SgS~DgtVrVWDl~t 128 (1525)
+.....|+ ++.+|+|| ..+++|+.|..+..||+.+
T Consensus 181 L~~~s~Vt-----SlEvs~dG~ilTia~gssV~Fwdaksf~~lKs~k~P~nV~SASL~P~k~~fVaGged~~~~kfDy~T 255 (334)
T KOG0278|consen 181 LEFNSPVT-----SLEVSQDGRILTIAYGSSVKFWDAKSFGLLKSYKMPCNVESASLHPKKEFFVAGGEDFKVYKFDYNT 255 (334)
T ss_pred EecCCCCc-----ceeeccCCCEEEEecCceeEEeccccccceeeccCccccccccccCCCceEEecCcceEEEEEeccC
Confidence 87444444 44444444 4455555555555555554
Q ss_pred CCceeeEe-ecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 129 ERPSMIGI-IQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 129 ~~~s~i~i-l~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
. .-++. -.+|.++|.|+.|+|+|...++||.||||++|-..
T Consensus 256 g--eEi~~~nkgh~gpVhcVrFSPdGE~yAsGSEDGTirlWQt~ 297 (334)
T KOG0278|consen 256 G--EEIGSYNKGHFGPVHCVRFSPDGELYASGSEDGTIRLWQTT 297 (334)
T ss_pred C--ceeeecccCCCCceEEEEECCCCceeeccCCCceEEEEEec
Confidence 2 11122 26899999999999999999999999999999875
No 104
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.38 E-value=6.1e-12 Score=152.55 Aligned_cols=188 Identities=19% Similarity=0.237 Sum_probs=142.7
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCce
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTI 95 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V 95 (1525)
.-+|.++. ...+.+|+...+. .+.+...+...|+++.|+++|.+|++|..+|.|.|||..+.+.+.++. |...|
T Consensus 188 ~n~laVal--g~~vylW~~~s~~--v~~l~~~~~~~vtSv~ws~~G~~LavG~~~g~v~iwD~~~~k~~~~~~~~h~~rv 263 (484)
T KOG0305|consen 188 ANVLAVAL--GQSVYLWSASSGS--VTELCSFGEELVTSVKWSPDGSHLAVGTSDGTVQIWDVKEQKKTRTLRGSHASRV 263 (484)
T ss_pred CCeEEEEe--cceEEEEecCCCc--eEEeEecCCCceEEEEECCCCCEEEEeecCCeEEEEehhhccccccccCCcCcee
Confidence 33555555 3456677654442 333333358999999999999999999999999999999999888888 77888
Q ss_pred eecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecC
Q 000425 96 KLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIIN 175 (1525)
Q Consensus 96 ~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~ 175 (1525)
. +++|+ +..+.+|+.|+.|.++|++.. ......+.+|.+.|..+.|++|+.+++||+.|+.+.|||.+....
T Consensus 264 g-----~laW~--~~~lssGsr~~~I~~~dvR~~-~~~~~~~~~H~qeVCgLkws~d~~~lASGgnDN~~~Iwd~~~~~p 335 (484)
T KOG0305|consen 264 G-----SLAWN--SSVLSSGSRDGKILNHDVRIS-QHVVSTLQGHRQEVCGLKWSPDGNQLASGGNDNVVFIWDGLSPEP 335 (484)
T ss_pred E-----EEecc--CceEEEecCCCcEEEEEEecc-hhhhhhhhcccceeeeeEECCCCCeeccCCCccceEeccCCCccc
Confidence 8 99998 567889999999999999964 222224779999999999999999999999999999999831111
Q ss_pred CCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecC-CCeEEEEecccccCCcccccccccccC
Q 000425 176 PNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPR-LNLAVLLFANFTGGDNIKNRAAYTREG 249 (1525)
Q Consensus 176 p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~-~Nlial~F~~~s~~Dd~~~k~~~~~eg 249 (1525)
+. .+..+. ..|++++|+|- .+++|. ..|.-|..++.|....|
T Consensus 336 -----~~----------------~~~~H~-------aAVKA~awcP~q~~lLAs----GGGs~D~~i~fwn~~~g 378 (484)
T KOG0305|consen 336 -----KF----------------TFTEHT-------AAVKALAWCPWQSGLLAT----GGGSADRCIKFWNTNTG 378 (484)
T ss_pred -----cE----------------EEeccc-------eeeeEeeeCCCccCceEE----cCCCcccEEEEEEcCCC
Confidence 00 011111 24788999996 577763 26778888999987776
No 105
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.37 E-value=7.2e-12 Score=146.77 Aligned_cols=227 Identities=17% Similarity=0.221 Sum_probs=157.2
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCcc---------------ceEEc-cCCCCCeEEEEEcCCCCEEEEEEC
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRA---------------PTKIK-TDLKKPIVNLACHPRLPVLYVAYA 71 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~---------------~tklk-~~H~~~V~sLaf~Pdg~~LaSGs~ 71 (1525)
+.+.++++|+..++|.++ ++.+|.-|....+-+. +.+.. ..|.+.|.+++.+||+++|++|+.
T Consensus 144 s~~~vals~d~~~~fsas-k~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r~~h~keil~~avS~Dgkylatgg~ 222 (479)
T KOG0299|consen 144 SVTSVALSPDDKRVFSAS-KDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESRKGHVKEILTLAVSSDGKYLATGGR 222 (479)
T ss_pred cceEEEeeccccceeecC-CCcceeeeehhcCcccccccccchhhhhccCCCCcccccccceeEEEEEcCCCcEEEecCC
Confidence 467899999999997666 5667755543222100 11111 258889999999999999999999
Q ss_pred CCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEc
Q 000425 72 DGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWL 150 (1525)
Q Consensus 72 DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafs 150 (1525)
|..|.|||.++.+.++++. |.+.|. +++|--.-.-++++|.|++|++|+++. ...+.++-+|.+.|.+|.-.
T Consensus 223 d~~v~Iw~~~t~ehv~~~~ghr~~V~-----~L~fr~gt~~lys~s~Drsvkvw~~~~--~s~vetlyGHqd~v~~IdaL 295 (479)
T KOG0299|consen 223 DRHVQIWDCDTLEHVKVFKGHRGAVS-----SLAFRKGTSELYSASADRSVKVWSIDQ--LSYVETLYGHQDGVLGIDAL 295 (479)
T ss_pred CceEEEecCcccchhhccccccccee-----eeeeecCccceeeeecCCceEEEehhH--hHHHHHHhCCccceeeechh
Confidence 9999999999999999999 999999 999988777899999999999999984 33333445899999888888
Q ss_pred CCCCEEEEEECCCCeeEEEee----eecCCCCCCCccccccCC---------C-cccccccc---cc--cccCCcceecC
Q 000425 151 PMLRLLVTLCRDGSLQVWKTR----VIINPNRPPMQANFFEPA---------S-IESIDIPR---IL--SQQGGEAVYPL 211 (1525)
Q Consensus 151 Pdg~~LaSgS~DgtIkVWDlr----~~~~p~~~~~~a~~f~p~---------g-Vesldi~~---iL--s~~gG~~vy~l 211 (1525)
...+.+..|+.|+|+++|++. .+..++.....+..|... | |.-+++.. +. ....| +.+.
T Consensus 296 ~reR~vtVGgrDrT~rlwKi~eesqlifrg~~~sidcv~~In~~HfvsGSdnG~IaLWs~~KKkplf~~~~AHg--v~~~ 373 (479)
T KOG0299|consen 296 SRERCVTVGGRDRTVRLWKIPEESQLIFRGGEGSIDCVAFINDEHFVSGSDNGSIALWSLLKKKPLFTSRLAHG--VIPE 373 (479)
T ss_pred cccceEEeccccceeEEEeccccceeeeeCCCCCeeeEEEecccceeeccCCceEEEeeecccCceeEeecccc--ccCC
Confidence 778877777889999999874 333333322222111110 0 10010000 00 00000 0000
Q ss_pred ----C---CeeEEEEecCCCeEEEEecccccCCcccccccccccCc
Q 000425 212 ----P---RVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGR 250 (1525)
Q Consensus 212 ----~---~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~ 250 (1525)
. =|+.++.-|..||++ +++-+..+|.|...+|-
T Consensus 374 ~~~~~~~~Witsla~i~~sdL~a------sGS~~G~vrLW~i~~g~ 413 (479)
T KOG0299|consen 374 LDPVNGNFWITSLAVIPGSDLLA------SGSWSGCVRLWKIEDGL 413 (479)
T ss_pred ccccccccceeeeEecccCceEE------ecCCCCceEEEEecCCc
Confidence 0 156688888888888 88888999999988883
No 106
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.37 E-value=1e-10 Score=132.67 Aligned_cols=251 Identities=10% Similarity=0.109 Sum_probs=170.4
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC--CCcEEEEEcCCCc
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA--DGLIRAYNIHTYA 84 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~--DgtIrLWDl~t~k 84 (1525)
+..-.|.||+++-.++++.+ +-++.+++. ..++....+.. ++-.|-.+.|-.....++.++. |.+||..++.+.+
T Consensus 15 ~~i~sl~fs~~G~~litss~-dDsl~LYd~-~~g~~~~ti~s-kkyG~~~~~Fth~~~~~i~sStk~d~tIryLsl~dNk 91 (311)
T KOG1446|consen 15 GKINSLDFSDDGLLLITSSE-DDSLRLYDS-LSGKQVKTINS-KKYGVDLACFTHHSNTVIHSSTKEDDTIRYLSLHDNK 91 (311)
T ss_pred CceeEEEecCCCCEEEEecC-CCeEEEEEc-CCCceeeEeec-ccccccEEEEecCCceEEEccCCCCCceEEEEeecCc
Confidence 34568999999999988664 667888885 44555556655 5556777778666666665555 8999999999999
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
.++.|. |.+.|+ +++.+|-+..++|+|.|++||+||+++ ..+.+.+. ...-..++|+|+|-++|+++...
T Consensus 92 ylRYF~GH~~~V~-----sL~~sP~~d~FlS~S~D~tvrLWDlR~--~~cqg~l~--~~~~pi~AfDp~GLifA~~~~~~ 162 (311)
T KOG1446|consen 92 YLRYFPGHKKRVN-----SLSVSPKDDTFLSSSLDKTVRLWDLRV--KKCQGLLN--LSGRPIAAFDPEGLIFALANGSE 162 (311)
T ss_pred eEEEcCCCCceEE-----EEEecCCCCeEEecccCCeEEeeEecC--CCCceEEe--cCCCcceeECCCCcEEEEecCCC
Confidence 999999 999999 999999999999999999999999994 44443332 33345689999999999999888
Q ss_pred CeeEEEeeeecC-CCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccc
Q 000425 164 SLQVWKTRVIIN-PNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNR 242 (1525)
Q Consensus 164 tIkVWDlr~~~~-p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k 242 (1525)
.|+++|+|.+-. |-. .| - +.. -...+++.++|.|.+-.+-+ +.+....+
T Consensus 163 ~IkLyD~Rs~dkgPF~-----tf---------~----i~~------~~~~ew~~l~FS~dGK~iLl------sT~~s~~~ 212 (311)
T KOG1446|consen 163 LIKLYDLRSFDKGPFT-----TF---------S----ITD------NDEAEWTDLEFSPDGKSILL------STNASFIY 212 (311)
T ss_pred eEEEEEecccCCCCce-----eE---------c----cCC------CCccceeeeEEcCCCCEEEE------EeCCCcEE
Confidence 999999996532 110 00 0 000 01235778999998877653 33444555
Q ss_pred ccccccCceeeeeeeccccCCcccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeee
Q 000425 243 AAYTREGRKQLFAVLQSARGSSASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYS 314 (1525)
Q Consensus 243 ~~~~~eg~~q~~~tlq~~~~~~~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s 314 (1525)
+....+|. ...++.++.+...-.+...+.+.+. =++.+. +|+.+-||++..-..+..
T Consensus 213 ~lDAf~G~--~~~tfs~~~~~~~~~~~a~ftPds~-Fvl~gs------------~dg~i~vw~~~tg~~v~~ 269 (311)
T KOG1446|consen 213 LLDAFDGT--VKSTFSGYPNAGNLPLSATFTPDSK-FVLSGS------------DDGTIHVWNLETGKKVAV 269 (311)
T ss_pred EEEccCCc--EeeeEeeccCCCCcceeEEECCCCc-EEEEec------------CCCcEEEEEcCCCcEeeE
Confidence 55555553 2335555555431112222222222 133333 678899998755544544
No 107
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.36 E-value=1.5e-12 Score=148.57 Aligned_cols=157 Identities=13% Similarity=0.192 Sum_probs=129.6
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC-c
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY-A 84 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~-k 84 (1525)
.|+..++.|+|..+.|+.++..+.++.+++... +.+..++.. +-.-+.|+|+|..-.|.+|+.|..+..||++.. .
T Consensus 187 ~Dti~svkfNpvETsILas~~sDrsIvLyD~R~-~~Pl~KVi~--~mRTN~IswnPeafnF~~a~ED~nlY~~DmR~l~~ 263 (433)
T KOG0268|consen 187 ADSISSVKFNPVETSILASCASDRSIVLYDLRQ-ASPLKKVIL--TMRTNTICWNPEAFNFVAANEDHNLYTYDMRNLSR 263 (433)
T ss_pred CCceeEEecCCCcchheeeeccCCceEEEeccc-CCccceeee--eccccceecCccccceeeccccccceehhhhhhcc
Confidence 578899999999999999999999999999654 444444433 334467999997778999999999999998865 4
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
++.... |.+.|. +|+|||.|+-+++||-|.+||||.++.....-+ ..+...+.|.|+.||-|.+|++|||+|+
T Consensus 264 p~~v~~dhvsAV~-----dVdfsptG~EfvsgsyDksIRIf~~~~~~SRdi-YhtkRMq~V~~Vk~S~Dskyi~SGSdd~ 337 (433)
T KOG0268|consen 264 PLNVHKDHVSAVM-----DVDFSPTGQEFVSGSYDKSIRIFPVNHGHSRDI-YHTKRMQHVFCVKYSMDSKYIISGSDDG 337 (433)
T ss_pred cchhhcccceeEE-----EeccCCCcchhccccccceEEEeecCCCcchhh-hhHhhhheeeEEEEeccccEEEecCCCc
Confidence 566666 999999 999999999999999999999999985421111 1234467899999999999999999999
Q ss_pred CeeEEEee
Q 000425 164 SLQVWKTR 171 (1525)
Q Consensus 164 tIkVWDlr 171 (1525)
.|++|..+
T Consensus 338 nvRlWka~ 345 (433)
T KOG0268|consen 338 NVRLWKAK 345 (433)
T ss_pred ceeeeecc
Confidence 99999876
No 108
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.36 E-value=1.7e-11 Score=143.69 Aligned_cols=215 Identities=19% Similarity=0.261 Sum_probs=147.5
Q ss_pred eEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE----EE---E-----------e-cCCceeecCceeEE
Q 000425 44 TKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH----YT---L-----------Q-LDNTIKLLGAGAFA 104 (1525)
Q Consensus 44 tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l----~t---L-----------~-H~~~V~~~ga~sVa 104 (1525)
..+. .|..+|.+++++|++.+.++++.|++|-=|++.+++.. .+ + + |.+.+. +++
T Consensus 136 ~~~~-~H~~s~~~vals~d~~~~fsask~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r~~h~keil-----~~a 209 (479)
T KOG0299|consen 136 RVIG-KHQLSVTSVALSPDDKRVFSASKDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESRKGHVKEIL-----TLA 209 (479)
T ss_pred eeec-cccCcceEEEeeccccceeecCCCcceeeeehhcCcccccccccchhhhhccCCCCcccccccceeE-----EEE
Confidence 3444 49999999999999999999999999999998877633 11 1 1 444566 999
Q ss_pred EeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccc
Q 000425 105 FHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQAN 184 (1525)
Q Consensus 105 FSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~ 184 (1525)
.|+||+||++|+.|..|.|||.++. ..+..+.+|...|.+++|--.-.-+.++|.|.+|++|++....- +-..
T Consensus 210 vS~Dgkylatgg~d~~v~Iw~~~t~--ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~~~s~-----vetl 282 (479)
T KOG0299|consen 210 VSSDGKYLATGGRDRHVQIWDCDTL--EHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSIDQLSY-----VETL 282 (479)
T ss_pred EcCCCcEEEecCCCceEEEecCccc--chhhcccccccceeeeeeecCccceeeeecCCceEEEehhHhHH-----HHHH
Confidence 9999999999999999999999974 22233669999999999998888899999999999999873210 1112
Q ss_pred cccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccCCc
Q 000425 185 FFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARGSS 264 (1525)
Q Consensus 185 ~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~~~ 264 (1525)
|-++++|..+| .|+ ..-++. .++-|...++|+-.+- .|+ ++++|.++.
T Consensus 283 yGHqd~v~~Id---aL~--------------------reR~vt------VGgrDrT~rlwKi~ee-sql--ifrg~~~si 330 (479)
T KOG0299|consen 283 YGHQDGVLGID---ALS--------------------RERCVT------VGGRDRTVRLWKIPEE-SQL--IFRGGEGSI 330 (479)
T ss_pred hCCccceeeec---hhc--------------------ccceEE------eccccceeEEEecccc-cee--eeeCCCCCe
Confidence 22344444444 111 111112 5566777777776432 344 578887654
Q ss_pred ccchhhhcccCCCcccchhhHHHHHHHhhhccCCCceeeehhhhhceeeeeeccccc
Q 000425 265 ASVLKEKLSSMGSSGILADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSHFMEGHA 321 (1525)
Q Consensus 265 ~s~~~e~~~slgs~g~l~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~~~d~~~ 321 (1525)
-++ +-+.....+.+. .+..+.+|++.|+.++++ +...|+
T Consensus 331 dcv-----~~In~~HfvsGS------------dnG~IaLWs~~KKkplf~-~~~AHg 369 (479)
T KOG0299|consen 331 DCV-----AFINDEHFVSGS------------DNGSIALWSLLKKKPLFT-SRLAHG 369 (479)
T ss_pred eeE-----EEecccceeecc------------CCceEEEeeecccCceeE-eecccc
Confidence 221 222222222222 477889999999999999 334444
No 109
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.35 E-value=2.7e-10 Score=129.27 Aligned_cols=117 Identities=13% Similarity=0.136 Sum_probs=104.4
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe--CCCcEEEEe
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD--RRGTLLAWD 125 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS--~DgtVrVWD 125 (1525)
+..+.|.++.|+++|.++++++.|-+++|||..++++++++. +.-.+. .++|-.....++.++ .|.+||.-+
T Consensus 12 ~~~~~i~sl~fs~~G~~litss~dDsl~LYd~~~g~~~~ti~skkyG~~-----~~~Fth~~~~~i~sStk~d~tIryLs 86 (311)
T KOG1446|consen 12 ETNGKINSLDFSDDGLLLITSSEDDSLRLYDSLSGKQVKTINSKKYGVD-----LACFTHHSNTVIHSSTKEDDTIRYLS 86 (311)
T ss_pred cCCCceeEEEecCCCCEEEEecCCCeEEEEEcCCCceeeEeeccccccc-----EEEEecCCceEEEccCCCCCceEEEE
Confidence 367899999999999999999999999999999999999999 667777 889977666666666 499999999
Q ss_pred CCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 126 VSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 126 l~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
+.+ .+.+..+.||...|++++.+|-+..++|+|.|++|++||+|+
T Consensus 87 l~d--NkylRYF~GH~~~V~sL~~sP~~d~FlS~S~D~tvrLWDlR~ 131 (311)
T KOG1446|consen 87 LHD--NKYLRYFPGHKKRVNSLSVSPKDDTFLSSSLDKTVRLWDLRV 131 (311)
T ss_pred eec--CceEEEcCCCCceEEEEEecCCCCeEEecccCCeEEeeEecC
Confidence 984 566777889999999999999999999999999999999994
No 110
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.35 E-value=4.5e-12 Score=148.61 Aligned_cols=161 Identities=17% Similarity=0.175 Sum_probs=127.6
Q ss_pred CCcEEEEEEcCCCC--EEEEEEeccceEEEEEeecCCc--cceEEcc-CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc
Q 000425 6 VDTEVHLALTPLQP--VVFFGFHRRMSVTVVGTVEGGR--APTKIKT-DLKKPIVNLACHPRLPVLYVAYADGLIRAYNI 80 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp--~Vf~s~~~d~sI~vv~tieggk--~~tklk~-~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl 80 (1525)
+-+||+.|++-..| .||+|. +..|+||+....+. +...+.. .....|+++..+|+|+.|++|+.-.++.|||+
T Consensus 417 ~HGEvVcAvtIS~~trhVyTgG--kgcVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdgrtLivGGeastlsiWDL 494 (705)
T KOG0639|consen 417 AHGEVVCAVTISNPTRHVYTGG--KGCVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLSIWDL 494 (705)
T ss_pred ccCcEEEEEEecCCcceeEecC--CCeEEEeeccCCCCCCccccccccCcccceeeeEecCCCceEEeccccceeeeeec
Confidence 45788888875544 566664 77899998654432 1111211 13578999999999999999999999999999
Q ss_pred CCCcEEEEEe-c--CCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEE
Q 000425 81 HTYAVHYTLQ-L--DNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 81 ~t~k~l~tL~-H--~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~La 157 (1525)
....+....+ . .-... +++.+||.+..+++..||.|.|||+. .+.+++.+++|++.+.||..++||..|-
T Consensus 495 AapTprikaeltssapaCy-----ALa~spDakvcFsccsdGnI~vwDLh--nq~~VrqfqGhtDGascIdis~dGtklW 567 (705)
T KOG0639|consen 495 AAPTPRIKAELTSSAPACY-----ALAISPDAKVCFSCCSDGNIAVWDLH--NQTLVRQFQGHTDGASCIDISKDGTKLW 567 (705)
T ss_pred cCCCcchhhhcCCcchhhh-----hhhcCCccceeeeeccCCcEEEEEcc--cceeeecccCCCCCceeEEecCCCceee
Confidence 8765543333 1 12344 88999999999999999999999999 5777778889999999999999999999
Q ss_pred EEECCCCeeEEEeeeecC
Q 000425 158 TLCRDGSLQVWKTRVIIN 175 (1525)
Q Consensus 158 SgS~DgtIkVWDlr~~~~ 175 (1525)
||+.|.+||.||+|....
T Consensus 568 TGGlDntvRcWDlregrq 585 (705)
T KOG0639|consen 568 TGGLDNTVRCWDLREGRQ 585 (705)
T ss_pred cCCCccceeehhhhhhhh
Confidence 999999999999996654
No 111
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.34 E-value=1.5e-11 Score=152.50 Aligned_cols=143 Identities=17% Similarity=0.200 Sum_probs=112.8
Q ss_pred EEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC-CCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeec
Q 000425 20 VVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR-LPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLL 98 (1525)
Q Consensus 20 ~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd-g~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ 98 (1525)
..+.++.-|++|++|.. +.+...+++. |.+.|+||+|||. +++|++||.||+||||++...++..-.....-|+
T Consensus 381 ~fLLSSSMDKTVRLWh~--~~~~CL~~F~-HndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~~lIT-- 455 (712)
T KOG0283|consen 381 NFLLSSSMDKTVRLWHP--GRKECLKVFS-HNDFVTCVAFNPVDDRYFISGSLDGKVRLWSISDKKVVDWNDLRDLIT-- 455 (712)
T ss_pred CeeEeccccccEEeecC--CCcceeeEEe-cCCeeEEEEecccCCCcEeecccccceEEeecCcCeeEeehhhhhhhe--
Confidence 34556668999999985 3556666776 9999999999996 5699999999999999999888877766458899
Q ss_pred CceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCcee---eEe----ecccCcCeEEEEEcCCCC-EEEEEECCCCeeEEEe
Q 000425 99 GAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSM---IGI----IQVGSQPITSVAWLPMLR-LLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 99 ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~---i~i----l~gH~~~VtsVafsPdg~-~LaSgS~DgtIkVWDl 170 (1525)
+++|.|||++.+.|+.+|.+++|+........ +.. -..|. .|+++.|.|... .++..|.|..|||+|.
T Consensus 456 ---Avcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~-rITG~Q~~p~~~~~vLVTSnDSrIRI~d~ 531 (712)
T KOG0283|consen 456 ---AVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGK-RITGLQFFPGDPDEVLVTSNDSRIRIYDG 531 (712)
T ss_pred ---eEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeEeeccCccccCc-eeeeeEecCCCCCeEEEecCCCceEEEec
Confidence 99999999999999999999999988521100 100 01233 899999998544 3555588999999999
Q ss_pred e
Q 000425 171 R 171 (1525)
Q Consensus 171 r 171 (1525)
+
T Consensus 532 ~ 532 (712)
T KOG0283|consen 532 R 532 (712)
T ss_pred c
Confidence 5
No 112
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.34 E-value=7.6e-12 Score=136.53 Aligned_cols=120 Identities=12% Similarity=0.102 Sum_probs=110.5
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEE
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLA 123 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrV 123 (1525)
.+.. |+++|+.+.|+-+|++.++++.|.+||+|+...+.+++++. |...|. .++.+.|...+++|+.|+.|.+
T Consensus 12 ~l~~-~qgaV~avryN~dGnY~ltcGsdrtvrLWNp~rg~liktYsghG~EVl-----D~~~s~Dnskf~s~GgDk~v~v 85 (307)
T KOG0316|consen 12 ILDC-AQGAVRAVRYNVDGNYCLTCGSDRTVRLWNPLRGALIKTYSGHGHEVL-----DAALSSDNSKFASCGGDKAVQV 85 (307)
T ss_pred eecc-cccceEEEEEccCCCEEEEcCCCceEEeecccccceeeeecCCCceee-----eccccccccccccCCCCceEEE
Confidence 3444 89999999999999999999999999999999999999999 999999 9999999999999999999999
Q ss_pred EeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 124 WDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 124 WDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
||+++. +...-+.+|...|+.+.|+.+...++|||-|.++++||-|.
T Consensus 86 wDV~TG--kv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS 132 (307)
T KOG0316|consen 86 WDVNTG--KVDRRFRGHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRS 132 (307)
T ss_pred EEcccC--eeeeecccccceeeEEEecCcceEEEeccccceeEEEEccc
Confidence 999974 33444669999999999999999999999999999999884
No 113
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.33 E-value=2.4e-11 Score=147.44 Aligned_cols=155 Identities=17% Similarity=0.229 Sum_probs=129.7
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
.+..-.+.++|.+..+.+|.. +..+.+|+. +..+....+...|...|.+++|+ +..+.+|+.|+.|..+|++..+.
T Consensus 217 ~~~vtSv~ws~~G~~LavG~~-~g~v~iwD~-~~~k~~~~~~~~h~~rvg~laW~--~~~lssGsr~~~I~~~dvR~~~~ 292 (484)
T KOG0305|consen 217 EELVTSVKWSPDGSHLAVGTS-DGTVQIWDV-KEQKKTRTLRGSHASRVGSLAWN--SSVLSSGSRDGKILNHDVRISQH 292 (484)
T ss_pred CCceEEEEECCCCCEEEEeec-CCeEEEEeh-hhccccccccCCcCceeEEEecc--CceEEEecCCCcEEEEEEecchh
Confidence 466778999999999999985 788989984 44444555555599999999998 67899999999999999998766
Q ss_pred EEE-Ee-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEEEE--E
Q 000425 86 HYT-LQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLVTL--C 160 (1525)
Q Consensus 86 l~t-L~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~LaSg--S 160 (1525)
... +. |...|. .++|++|+.++++|+.|+.+.|||.... .....+..|...|..++|+|- ..+||+| +
T Consensus 293 ~~~~~~~H~qeVC-----gLkws~d~~~lASGgnDN~~~Iwd~~~~--~p~~~~~~H~aAVKA~awcP~q~~lLAsGGGs 365 (484)
T KOG0305|consen 293 VVSTLQGHRQEVC-----GLKWSPDGNQLASGGNDNVVFIWDGLSP--EPKFTFTEHTAAVKALAWCPWQSGLLATGGGS 365 (484)
T ss_pred hhhhhhcccceee-----eeEECCCCCeeccCCCccceEeccCCCc--cccEEEeccceeeeEeeeCCCccCceEEcCCC
Confidence 555 66 999999 9999999999999999999999999643 222235699999999999995 5688886 5
Q ss_pred CCCCeeEEEee
Q 000425 161 RDGSLQVWKTR 171 (1525)
Q Consensus 161 ~DgtIkVWDlr 171 (1525)
.|++|++||..
T Consensus 366 ~D~~i~fwn~~ 376 (484)
T KOG0305|consen 366 ADRCIKFWNTN 376 (484)
T ss_pred cccEEEEEEcC
Confidence 79999999997
No 114
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=99.31 E-value=1.2e-09 Score=131.57 Aligned_cols=155 Identities=17% Similarity=0.169 Sum_probs=110.9
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
--|||.|++..++.+..+ .+.++++.+| .....+++ |++.|.+++|+.+|..|+||+.|..|-+|+-+- -.++
T Consensus 16 ~d~afkPDGsqL~lAAg~--rlliyD~ndG-~llqtLKg-HKDtVycVAys~dGkrFASG~aDK~VI~W~~kl---EG~L 88 (1081)
T KOG1538|consen 16 NDIAFKPDGTQLILAAGS--RLLVYDTSDG-TLLQPLKG-HKDTVYCVAYAKDGKRFASGSADKSVIIWTSKL---EGIL 88 (1081)
T ss_pred heeEECCCCceEEEecCC--EEEEEeCCCc-cccccccc-ccceEEEEEEccCCceeccCCCceeEEEecccc---ccee
Confidence 348999999999988754 4567776554 44556777 999999999999999999999999999997431 1111
Q ss_pred e--cCCcee---------------e------------------cC-ceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCcee
Q 000425 90 Q--LDNTIK---------------L------------------LG-AGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSM 133 (1525)
Q Consensus 90 ~--H~~~V~---------------~------------------~g-a~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~ 133 (1525)
+ |.+.|. + +. .++++|..||++++.|-.||||.+-+-..+..-.
T Consensus 89 kYSH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~kss~R~~~CsWtnDGqylalG~~nGTIsiRNk~gEek~~ 168 (1081)
T KOG1538|consen 89 KYSHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKSSSRIICCSWTNDGQYLALGMFNGTISIRNKNGEEKVK 168 (1081)
T ss_pred eeccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhhheeEEEeeecCCCcEEEEeccCceEEeecCCCCcceE
Confidence 1 211110 0 00 1478899999999999999999998765432222
Q ss_pred eEeecccCcCeEEEEEcCCC-----CEEEEEECCCCeeEEEee
Q 000425 134 IGIIQVGSQPITSVAWLPML-----RLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 134 i~il~gH~~~VtsVafsPdg-----~~LaSgS~DgtIkVWDlr 171 (1525)
+.-.-|.+++|.+++|+|.. ..++...+..|+..+.+.
T Consensus 169 I~Rpgg~Nspiwsi~~~p~sg~G~~di~aV~DW~qTLSFy~Ls 211 (1081)
T KOG1538|consen 169 IERPGGSNSPIWSICWNPSSGEGRNDILAVADWGQTLSFYQLS 211 (1081)
T ss_pred EeCCCCCCCCceEEEecCCCCCCccceEEEEeccceeEEEEec
Confidence 22223567899999999963 357777777777777766
No 115
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.31 E-value=4.5e-11 Score=133.72 Aligned_cols=162 Identities=14% Similarity=0.193 Sum_probs=127.0
Q ss_pred cccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCC---------ccceEE----ccCCCCCeEEEEEcCCCC-EEEE
Q 000425 3 SISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGG---------RAPTKI----KTDLKKPIVNLACHPRLP-VLYV 68 (1525)
Q Consensus 3 ~~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg---------k~~tkl----k~~H~~~V~sLaf~Pdg~-~LaS 68 (1525)
++-+.+.=.|.+-|+.-..+.+...+.++.+|+.-... +....+ ...|+-.|.++.|-|-++ +|.+
T Consensus 40 r~HgGsvNsL~id~tegrymlSGgadgsi~v~Dl~n~t~~e~s~li~k~~c~v~~~h~~~Hky~iss~~WyP~DtGmFts 119 (397)
T KOG4283|consen 40 RPHGGSVNSLQIDLTEGRYMLSGGADGSIAVFDLQNATDYEASGLIAKHKCIVAKQHENGHKYAISSAIWYPIDTGMFTS 119 (397)
T ss_pred ccCCCccceeeeccccceEEeecCCCccEEEEEeccccchhhccceeheeeeccccCCccceeeeeeeEEeeecCceeec
Confidence 34455666788888877666777779999999863221 111111 234777899999999654 8999
Q ss_pred EECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCC---CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeE
Q 000425 69 AYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTL---EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPIT 145 (1525)
Q Consensus 69 Gs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg---~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~Vt 145 (1525)
++.|.++|+||..+.+....|+-...|. +-++||-. -.|++|.+|-.|++.|+.+. .+..++.||.+.|.
T Consensus 120 sSFDhtlKVWDtnTlQ~a~~F~me~~VY-----shamSp~a~sHcLiA~gtr~~~VrLCDi~SG--s~sH~LsGHr~~vl 192 (397)
T KOG4283|consen 120 SSFDHTLKVWDTNTLQEAVDFKMEGKVY-----SHAMSPMAMSHCLIAAGTRDVQVRLCDIASG--SFSHTLSGHRDGVL 192 (397)
T ss_pred ccccceEEEeecccceeeEEeecCceee-----hhhcChhhhcceEEEEecCCCcEEEEeccCC--cceeeeccccCceE
Confidence 9999999999999999998898666677 77888843 35677779999999999964 44455889999999
Q ss_pred EEEEcCCCCE-EEEEECCCCeeEEEee
Q 000425 146 SVAWLPMLRL-LVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 146 sVafsPdg~~-LaSgS~DgtIkVWDlr 171 (1525)
++.|+|...+ |+|||.||.||+||+|
T Consensus 193 aV~Wsp~~e~vLatgsaDg~irlWDiR 219 (397)
T KOG4283|consen 193 AVEWSPSSEWVLATGSADGAIRLWDIR 219 (397)
T ss_pred EEEeccCceeEEEecCCCceEEEEEee
Confidence 9999998775 7899999999999998
No 116
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.31 E-value=1.9e-11 Score=148.56 Aligned_cols=134 Identities=14% Similarity=0.167 Sum_probs=109.8
Q ss_pred EEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCc
Q 000425 22 FFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGA 100 (1525)
Q Consensus 22 f~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga 100 (1525)
+.+.+.|.++++|-..+- ...+.+ |+..|.++++-|.+ .++|||.|.+||+|.- ++++++|. |.+.|+
T Consensus 115 ~iSgSWD~TakvW~~~~l---~~~l~g-H~asVWAv~~l~e~-~~vTgsaDKtIklWk~--~~~l~tf~gHtD~VR---- 183 (745)
T KOG0301|consen 115 LISGSWDSTAKVWRIGEL---VYSLQG-HTASVWAVASLPEN-TYVTGSADKTIKLWKG--GTLLKTFSGHTDCVR---- 183 (745)
T ss_pred eEecccccceEEecchhh---hcccCC-cchheeeeeecCCC-cEEeccCcceeeeccC--Cchhhhhccchhhee----
Confidence 344556777777753222 222444 99999999999988 8999999999999954 78899999 999999
Q ss_pred eeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 101 GAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 101 ~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+++-+++ .++|++.||.|+.||+.. .++-.+.+|+..|.+++..+++..++||+.|++++||+..
T Consensus 184 -gL~vl~~~-~flScsNDg~Ir~w~~~g---e~l~~~~ghtn~vYsis~~~~~~~Ivs~gEDrtlriW~~~ 249 (745)
T KOG0301|consen 184 -GLAVLDDS-HFLSCSNDGSIRLWDLDG---EVLLEMHGHTNFVYSISMALSDGLIVSTGEDRTLRIWKKD 249 (745)
T ss_pred -eeEEecCC-CeEeecCCceEEEEeccC---ceeeeeeccceEEEEEEecCCCCeEEEecCCceEEEeecC
Confidence 99998865 588999999999999963 3334466999999999977788899999999999999864
No 117
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.30 E-value=2.8e-09 Score=129.92 Aligned_cols=163 Identities=13% Similarity=0.126 Sum_probs=118.6
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccce-EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPT-KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT 82 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~t-klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t 82 (1525)
..+...-.+|.+|.+..+..|+++ . +...-.++.+.... ......+++|.+++|+|++..+++||.||.||+||..+
T Consensus 108 ~~gg~IWsiai~p~~~~l~Igcdd-G-vl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~ 185 (691)
T KOG2048|consen 108 SNGGAIWSIAINPENTILAIGCDD-G-VLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDGVIRIWDVKS 185 (691)
T ss_pred CCCcceeEEEeCCccceEEeecCC-c-eEEEEecCCceEEEEeecccccceEEEEEecCCccEEEecccCceEEEEEcCC
Confidence 345677789999999999999864 3 22221122222211 22222568999999999999999999999999999999
Q ss_pred CcEEEEEe-cCCceee---cCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 83 YAVHYTLQ-LDNTIKL---LGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~---~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
+..++... ....+.. .-+++|.|-.| ..|++|.+-|+|++||... .+.++.+..|...|.|++..+++.++++
T Consensus 186 ~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd-~tI~sgDS~G~V~FWd~~~--gTLiqS~~~h~adVl~Lav~~~~d~vfs 262 (691)
T KOG2048|consen 186 GQTLHIITMQLDRLSKREPTIVWSVLFLRD-STIASGDSAGTVTFWDSIF--GTLIQSHSCHDADVLALAVADNEDRVFS 262 (691)
T ss_pred CceEEEeeecccccccCCceEEEEEEEeec-CcEEEecCCceEEEEcccC--cchhhhhhhhhcceeEEEEcCCCCeEEE
Confidence 88776332 1111110 01247777655 4689999999999999884 4444455689999999999999999999
Q ss_pred EECCCCeeEEEee
Q 000425 159 LCRDGSLQVWKTR 171 (1525)
Q Consensus 159 gS~DgtIkVWDlr 171 (1525)
++.|+.|.-+...
T Consensus 263 aGvd~~ii~~~~~ 275 (691)
T KOG2048|consen 263 AGVDPKIIQYSLT 275 (691)
T ss_pred ccCCCceEEEEec
Confidence 9999998777654
No 118
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.30 E-value=3.5e-11 Score=136.02 Aligned_cols=184 Identities=19% Similarity=0.250 Sum_probs=139.5
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceE----E--ccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcC
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTK----I--KTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIH 81 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tk----l--k~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~ 81 (1525)
-.+|+|+|++..+|+|. ...|+++++...++.... . +.+..+-|.+++|||... .+++|+.-.++-||.-.
T Consensus 161 AhsL~Fs~DGeqlfaGy--krcirvFdt~RpGr~c~vy~t~~~~k~gq~giisc~a~sP~~~~~~a~gsY~q~~giy~~~ 238 (406)
T KOG2919|consen 161 AHSLQFSPDGEQLFAGY--KRCIRVFDTSRPGRDCPVYTTVTKGKFGQKGIISCFAFSPMDSKTLAVGSYGQRVGIYNDD 238 (406)
T ss_pred heeEEecCCCCeEeecc--cceEEEeeccCCCCCCcchhhhhcccccccceeeeeeccCCCCcceeeecccceeeeEecC
Confidence 36899999999999997 457888998665543221 1 223467789999999876 89999999999999999
Q ss_pred CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCe-EE--EEEcCCCCEE
Q 000425 82 TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPI-TS--VAWLPMLRLL 156 (1525)
Q Consensus 82 t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~V-ts--VafsPdg~~L 156 (1525)
.+.+++.+- |.+.|+ .++|++||+.|++|+ .|-.|.+||++.. ..++-.+..|.... .. +...|++++|
T Consensus 239 ~~~pl~llggh~gGvT-----hL~~~edGn~lfsGaRk~dkIl~WDiR~~-~~pv~~L~rhv~~TNQRI~FDld~~~~~L 312 (406)
T KOG2919|consen 239 GRRPLQLLGGHGGGVT-----HLQWCEDGNKLFSGARKDDKILCWDIRYS-RDPVYALERHVGDTNQRILFDLDPKGEIL 312 (406)
T ss_pred CCCceeeecccCCCee-----eEEeccCcCeecccccCCCeEEEEeehhc-cchhhhhhhhccCccceEEEecCCCCcee
Confidence 999999999 999999 999999999999999 6889999999953 22221244554422 22 4456889999
Q ss_pred EEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEE
Q 000425 157 VTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAV 227 (1525)
Q Consensus 157 aSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlia 227 (1525)
++|+.||.|++||++...++-. +.. .|. .+++.|++||.+++.+
T Consensus 313 asG~tdG~V~vwdlk~~gn~~s--------------------v~~------~~s-d~vNgvslnP~mpila 356 (406)
T KOG2919|consen 313 ASGDTDGSVRVWDLKDLGNEVS--------------------VTG------NYS-DTVNGVSLNPIMPILA 356 (406)
T ss_pred eccCCCccEEEEecCCCCCccc--------------------ccc------ccc-ccccceecCcccceee
Confidence 9999999999999985544100 000 011 2678899999988877
No 119
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.29 E-value=2.6e-11 Score=149.45 Aligned_cols=155 Identities=17% Similarity=0.273 Sum_probs=132.4
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEE--ccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKI--KTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tkl--k~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
..-++++++++-++|.|.+ ..+|.+++.-.|... ..+ ...|+++|++++..-.++.+++++.||.++.||......
T Consensus 450 ~~~av~vs~CGNF~~IG~S-~G~Id~fNmQSGi~r-~sf~~~~ah~~~V~gla~D~~n~~~vsa~~~Gilkfw~f~~k~l 527 (910)
T KOG1539|consen 450 NATAVCVSFCGNFVFIGYS-KGTIDRFNMQSGIHR-KSFGDSPAHKGEVTGLAVDGTNRLLVSAGADGILKFWDFKKKVL 527 (910)
T ss_pred ceEEEEEeccCceEEEecc-CCeEEEEEcccCeee-cccccCccccCceeEEEecCCCceEEEccCcceEEEEecCCcce
Confidence 3457899999999999885 788888886554221 111 125999999999999999999999999999999999888
Q ss_pred EEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 86 HYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 86 l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+..++....+. ++.+|.....++.+.+|-.|+++|..+ +.++..+.+|+..|++++|||||++|++++.|++|
T Consensus 528 ~~~l~l~~~~~-----~iv~hr~s~l~a~~~ddf~I~vvD~~t--~kvvR~f~gh~nritd~~FS~DgrWlisasmD~tI 600 (910)
T KOG1539|consen 528 KKSLRLGSSIT-----GIVYHRVSDLLAIALDDFSIRVVDVVT--RKVVREFWGHGNRITDMTFSPDGRWLISASMDSTI 600 (910)
T ss_pred eeeeccCCCcc-----eeeeeehhhhhhhhcCceeEEEEEchh--hhhhHHhhccccceeeeEeCCCCcEEEEeecCCcE
Confidence 88888667777 889999999999999999999999995 55555667999999999999999999999999999
Q ss_pred eEEEee
Q 000425 166 QVWKTR 171 (1525)
Q Consensus 166 kVWDlr 171 (1525)
++||+-
T Consensus 601 r~wDlp 606 (910)
T KOG1539|consen 601 RTWDLP 606 (910)
T ss_pred EEEecc
Confidence 999985
No 120
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=99.28 E-value=2.9e-10 Score=124.95 Aligned_cols=154 Identities=19% Similarity=0.284 Sum_probs=113.3
Q ss_pred CcEEEEEEcCCCCEEEEEEecc--ce-------EEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEE--ECCCcE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRR--MS-------VTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVA--YADGLI 75 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d--~s-------I~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSG--s~DgtI 75 (1525)
-.++.+.+||.+..|++-.+.+ .+ ..++...+...+...+.....++|.+++|+|++..|++. ..+..|
T Consensus 6 ~~~~~~~W~~~G~~l~~~~~~~~~~~~ks~~~~~~l~~~~~~~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~~~~v 85 (194)
T PF08662_consen 6 VDDAKLHWQPSGDYLLVKVQTRVDKSGKSYYGEFELFYLNEKNIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSMPAKV 85 (194)
T ss_pred cceEEEEecccCCEEEEEEEEeeccCcceEEeeEEEEEEecCCCccceeeccCCCceEEEEECcCCCEEEEEEccCCccc
Confidence 3578899999999998887732 11 223333334445555555456789999999999987654 467899
Q ss_pred EEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC---CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC
Q 000425 76 RAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR---RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM 152 (1525)
Q Consensus 76 rLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~---DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd 152 (1525)
.+||++ .+.+.++ +...++ .+.|+|+|+++++++. .|.|.+||.++ ...+.. .+...++.++|+||
T Consensus 86 ~lyd~~-~~~i~~~-~~~~~n-----~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~--~~~i~~--~~~~~~t~~~WsPd 154 (194)
T PF08662_consen 86 TLYDVK-GKKIFSF-GTQPRN-----TISWSPDGRFLVLAGFGNLNGDLEFWDVRK--KKKIST--FEHSDATDVEWSPD 154 (194)
T ss_pred EEEcCc-ccEeEee-cCCCce-----EEEECCCCCEEEEEEccCCCcEEEEEECCC--CEEeec--cccCcEEEEEEcCC
Confidence 999997 5555555 456778 9999999999999874 46799999994 333322 22345789999999
Q ss_pred CCEEEEEEC------CCCeeEEEee
Q 000425 153 LRLLVTLCR------DGSLQVWKTR 171 (1525)
Q Consensus 153 g~~LaSgS~------DgtIkVWDlr 171 (1525)
|++|+|++. |..++||+..
T Consensus 155 Gr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 155 GRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred CCEEEEEEeccceeccccEEEEEec
Confidence 999999864 7899999975
No 121
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.27 E-value=5.2e-11 Score=134.58 Aligned_cols=149 Identities=17% Similarity=0.265 Sum_probs=120.5
Q ss_pred CCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC--EEEEEECCCcEEEEEcCCCcEEEEEe-cCC
Q 000425 17 LQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP--VLYVAYADGLIRAYNIHTYAVHYTLQ-LDN 93 (1525)
Q Consensus 17 ~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~--~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~ 93 (1525)
++|++..|+ .|.+|.+++.. ..+..-.+.. |.+.|+++.|.+.-. +|++|+.||.|.+||...+.++.+++ |..
T Consensus 52 s~~~~aSGs-sDetI~IYDm~-k~~qlg~ll~-HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~ 128 (362)
T KOG0294|consen 52 SGPYVASGS-SDETIHIYDMR-KRKQLGILLS-HAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKG 128 (362)
T ss_pred cceeEeccC-CCCcEEEEecc-chhhhcceec-cccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeeccccc
Confidence 578887666 58899999853 3333334444 999999999998765 89999999999999999999999999 999
Q ss_pred ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCc----------e----------e--------------------
Q 000425 94 TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERP----------S----------M-------------------- 133 (1525)
Q Consensus 94 ~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~----------s----------~-------------------- 133 (1525)
.|+ .++.||.++.-++.+.|+.++.||+-+.+. . +
T Consensus 129 ~Vt-----~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a~v~~L~~~at~v~w~~~Gd~F~v~~~~~i~i~q~d~A~v~~ 203 (362)
T KOG0294|consen 129 QVT-----DLSIHPSGKLALSVGGDQVLRTWNLVRGRVAFVLNLKNKATLVSWSPQGDHFVVSGRNKIDIYQLDNASVFR 203 (362)
T ss_pred ccc-----eeEecCCCceEEEEcCCceeeeehhhcCccceeeccCCcceeeEEcCCCCEEEEEeccEEEEEecccHhHhh
Confidence 999 999999999999999999999999743200 0 0
Q ss_pred --------------------------------------eEeecccCcCeEEEEE--cCCCCEEEEEECCCCeeEEEeeee
Q 000425 134 --------------------------------------IGIIQVGSQPITSVAW--LPMLRLLVTLCRDGSLQVWKTRVI 173 (1525)
Q Consensus 134 --------------------------------------i~il~gH~~~VtsVaf--sPdg~~LaSgS~DgtIkVWDlr~~ 173 (1525)
...+.+|..+|.++.+ .|++.||+|+|.||.|+|||++..
T Consensus 204 ~i~~~~r~l~~~~l~~~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~~~ 283 (362)
T KOG0294|consen 204 EIENPKRILCATFLDGSELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNPEHEYLVTASSDGFIKVWDIDME 283 (362)
T ss_pred hhhccccceeeeecCCceEEEecCCceEEEeccCCCccceeeecchhheeeeEEEecCCceEEEEeccCceEEEEEcccc
Confidence 0123468888888874 367889999999999999999854
No 122
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.26 E-value=2.6e-11 Score=138.65 Aligned_cols=222 Identities=14% Similarity=0.195 Sum_probs=130.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC----
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT---- 82 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t---- 82 (1525)
|+.-+||=||.++-.+++...|+.|.+|+. +...+...++. |.+.|++|+++- ..++++++|.+||.|-+..
T Consensus 67 dGV~~lakhp~~ls~~aSGs~DG~VkiWnl-sqR~~~~~f~A-H~G~V~Gi~v~~--~~~~tvgdDKtvK~wk~~~~p~~ 142 (433)
T KOG0268|consen 67 DGVSCLAKHPNKLSTVASGSCDGEVKIWNL-SQRECIRTFKA-HEGLVRGICVTQ--TSFFTVGDDKTVKQWKIDGPPLH 142 (433)
T ss_pred cccchhhcCcchhhhhhccccCceEEEEeh-hhhhhhheeec-ccCceeeEEecc--cceEEecCCcceeeeeccCCcce
Confidence 678899999999877778888999999995 33344445555 999999999986 6788888999999886321
Q ss_pred ----------------------------------CcEEEEEe-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeC
Q 000425 83 ----------------------------------YAVHYTLQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 83 ----------------------------------~k~l~tL~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl 126 (1525)
..++.++. ..+.+. ++.|+|... .|++|+.|++|.+||+
T Consensus 143 tilg~s~~~gIdh~~~~~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~-----svkfNpvETsILas~~sDrsIvLyD~ 217 (433)
T KOG0268|consen 143 TILGKSVYLGIDHHRKNSVFATCGEQIDIWDEQRDNPVSSMSWGADSIS-----SVKFNPVETSILASCASDRSIVLYDL 217 (433)
T ss_pred eeeccccccccccccccccccccCceeeecccccCCccceeecCCCcee-----EEecCCCcchheeeeccCCceEEEec
Confidence 12333333 334455 889999665 5667779999999999
Q ss_pred CCC--CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCC---CC---CCccccccCCCccccccc-
Q 000425 127 SIE--RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPN---RP---PMQANFFEPASIESIDIP- 197 (1525)
Q Consensus 127 ~t~--~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~---~~---~~~a~~f~p~gVesldi~- 197 (1525)
++. .+.++ + ...-+.|+|+|.+--|++|+.|..+..+|+|....|- .. .....-|.|.|.+.+.-.
T Consensus 218 R~~~Pl~KVi--~---~mRTN~IswnPeafnF~~a~ED~nlY~~DmR~l~~p~~v~~dhvsAV~dVdfsptG~Efvsgsy 292 (433)
T KOG0268|consen 218 RQASPLKKVI--L---TMRTNTICWNPEAFNFVAANEDHNLYTYDMRNLSRPLNVHKDHVSAVMDVDFSPTGQEFVSGSY 292 (433)
T ss_pred ccCCccceee--e---eccccceecCccccceeeccccccceehhhhhhcccchhhcccceeEEEeccCCCcchhccccc
Confidence 863 22222 0 1122334444444444444444444444444222110 00 011111233222211100
Q ss_pred ----ccccccCCc--ceec---CCCeeEEEEecCCCeEEEEecccccCCccccccccccc
Q 000425 198 ----RILSQQGGE--AVYP---LPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTRE 248 (1525)
Q Consensus 198 ----~iLs~~gG~--~vy~---l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~e 248 (1525)
++....+|. -+|- +..|.+|.|.-+.-.+. +|+||..++.|+...
T Consensus 293 DksIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S~Dskyi~------SGSdd~nvRlWka~A 346 (433)
T KOG0268|consen 293 DKSIRIFPVNHGHSRDIYHTKRMQHVFCVKYSMDSKYII------SGSDDGNVRLWKAKA 346 (433)
T ss_pred cceEEEeecCCCcchhhhhHhhhheeeEEEEeccccEEE------ecCCCcceeeeecch
Confidence 000000000 0121 34577888887777666 999999999998554
No 123
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=99.25 E-value=1.3e-09 Score=129.19 Aligned_cols=132 Identities=13% Similarity=0.215 Sum_probs=96.1
Q ss_pred ccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEE
Q 000425 27 RRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAF 105 (1525)
Q Consensus 27 ~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaF 105 (1525)
+|..+.+.. ..++....+.. |.+.|.+-.|+|+|.-|++++.||.||+|+- ++-.-.++. ...+|. |++|
T Consensus 83 ~DGkf~il~--k~~rVE~sv~A-H~~A~~~gRW~~dGtgLlt~GEDG~iKiWSr-sGMLRStl~Q~~~~v~-----c~~W 153 (737)
T KOG1524|consen 83 NDGRFVILN--KSARVERSISA-HAAAISSGRWSPDGAGLLTAGEDGVIKIWSR-SGMLRSTVVQNEESIR-----CARW 153 (737)
T ss_pred CCceEEEec--ccchhhhhhhh-hhhhhhhcccCCCCceeeeecCCceEEEEec-cchHHHHHhhcCceeE-----EEEE
Confidence 345444433 22333444444 9999999999999999999999999999974 343333444 667899 9999
Q ss_pred eCCCCEEE-EEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 106 HPTLEWLF-VGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 106 SPdg~~L~-SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
-|+.+.++ +.+. .+.|=.+.-. ...+ .+.+|.+-|.|++|+|...+++||+.|-..+|||..
T Consensus 154 ~p~S~~vl~c~g~--h~~IKpL~~n-~k~i-~WkAHDGiiL~~~W~~~s~lI~sgGED~kfKvWD~~ 216 (737)
T KOG1524|consen 154 APNSNSIVFCQGG--HISIKPLAAN-SKII-RWRAHDGLVLSLSWSTQSNIIASGGEDFRFKIWDAQ 216 (737)
T ss_pred CCCCCceEEecCC--eEEEeecccc-ccee-EEeccCcEEEEeecCccccceeecCCceeEEeeccc
Confidence 99887555 4443 3444334321 2222 367999999999999999999999999999999974
No 124
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=99.24 E-value=4.9e-11 Score=142.67 Aligned_cols=118 Identities=20% Similarity=0.366 Sum_probs=104.6
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC--------CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH--------TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG 119 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~--------t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg 119 (1525)
.|.+-|+.++|||..+.|++++.|+++++|+++ ..+++.+|. |..+|. ||++.+.+.++++|+-||
T Consensus 292 s~~d~ir~l~~~~sep~lit~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl-----~v~v~~n~~~~ysgg~Dg 366 (577)
T KOG0642|consen 292 SHDDCIRALAFHPSEPVLITASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVL-----CVVVPSNGEHCYSGGIDG 366 (577)
T ss_pred cchhhhhhhhcCCCCCeEEEeccccchhhhhhcccCCccccceeeeEEEecccCceE-----EEEecCCceEEEeeccCc
Confidence 388899999999999999999999999999983 236889999 999999 999999999999999999
Q ss_pred cEEEEeCCCC--------CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 120 TLLAWDVSIE--------RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 120 tVrVWDl~t~--------~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+|+.|++-.. .....+.++||+..|+.+++|+....|++||.|||++.|+..
T Consensus 367 ~I~~w~~p~n~dp~ds~dp~vl~~~l~Ghtdavw~l~~s~~~~~Llscs~DgTvr~w~~~ 426 (577)
T KOG0642|consen 367 TIRCWNLPPNQDPDDSYDPSVLSGTLLGHTDAVWLLALSSTKDRLLSCSSDGTVRLWEPT 426 (577)
T ss_pred eeeeeccCCCCCcccccCcchhccceeccccceeeeeecccccceeeecCCceEEeeccC
Confidence 9999965421 123346788999999999999999899999999999999976
No 125
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.24 E-value=4.4e-10 Score=127.03 Aligned_cols=161 Identities=14% Similarity=0.205 Sum_probs=126.7
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecC---CccceE-------EccCCCCCeEEEEEcCCCCEEEEEECCC
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG---GRAPTK-------IKTDLKKPIVNLACHPRLPVLYVAYADG 73 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg---gk~~tk-------lk~~H~~~V~sLaf~Pdg~~LaSGs~Dg 73 (1525)
+..|.--.++|+|+-.-+.++.++..++.++..... +-+..- +...|+-.|-++.....+.++++++.|.
T Consensus 130 ve~dhpT~V~FapDc~s~vv~~~~g~~l~vyk~~K~~dG~~~~~~v~~D~~~f~~kh~v~~i~iGiA~~~k~imsas~dt 209 (420)
T KOG2096|consen 130 VEYDHPTRVVFAPDCKSVVVSVKRGNKLCVYKLVKKTDGSGSHHFVHIDNLEFERKHQVDIINIGIAGNAKYIMSASLDT 209 (420)
T ss_pred ccCCCceEEEECCCcceEEEEEccCCEEEEEEeeecccCCCCcccccccccccchhcccceEEEeecCCceEEEEecCCC
Confidence 345566678999998888888888888888876432 222221 2233666788888888889999999999
Q ss_pred cEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC---C-----CceeeEeecccCcCe
Q 000425 74 LIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI---E-----RPSMIGIIQVGSQPI 144 (1525)
Q Consensus 74 tIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t---~-----~~s~i~il~gH~~~V 144 (1525)
+|.+||++ ++.+.++. ....-. ..+.||+|+++++++..--|+||..-- . .+.+ .+.||.+.|
T Consensus 210 ~i~lw~lk-Gq~L~~idtnq~~n~-----~aavSP~GRFia~~gFTpDVkVwE~~f~kdG~fqev~rvf--~LkGH~saV 281 (420)
T KOG2096|consen 210 KICLWDLK-GQLLQSIDTNQSSNY-----DAAVSPDGRFIAVSGFTPDVKVWEPIFTKDGTFQEVKRVF--SLKGHQSAV 281 (420)
T ss_pred cEEEEecC-Cceeeeecccccccc-----ceeeCCCCcEEEEecCCCCceEEEEEeccCcchhhhhhhh--eeccchhhe
Confidence 99999999 88888887 444445 678899999999999999999998541 1 1122 367999999
Q ss_pred EEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 145 TSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 145 tsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
+.++|+++...++|.|.||++||||+++
T Consensus 282 ~~~aFsn~S~r~vtvSkDG~wriwdtdV 309 (420)
T KOG2096|consen 282 LAAAFSNSSTRAVTVSKDGKWRIWDTDV 309 (420)
T ss_pred eeeeeCCCcceeEEEecCCcEEEeeccc
Confidence 9999999999999999999999999883
No 126
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=99.22 E-value=4.2e-11 Score=138.12 Aligned_cols=117 Identities=19% Similarity=0.281 Sum_probs=101.2
Q ss_pred cCCCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCC-------cEEEEEe-cCCceeecCceeEEEeCCC-CEEEEEeC
Q 000425 48 TDLKKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTY-------AVHYTLQ-LDNTIKLLGAGAFAFHPTL-EWLFVGDR 117 (1525)
Q Consensus 48 ~~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~-------k~l~tL~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~ 117 (1525)
.+|+.+|..++|+|-. ..+||||.|.+|+||.+... +++..|. |...|. .|.|||.. +.|+|++.
T Consensus 78 ~GHt~~vLDi~w~PfnD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg-----~V~wHPtA~NVLlsag~ 152 (472)
T KOG0303|consen 78 CGHTAPVLDIDWCPFNDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVG-----LVQWHPTAPNVLLSAGS 152 (472)
T ss_pred cCccccccccccCccCCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEE-----EEeecccchhhHhhccC
Confidence 3599999999999965 47999999999999997643 5567778 999999 99999976 57889999
Q ss_pred CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 118 RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 118 DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
|.+|.+||+.+. ..++. + .|...|.+++|+-||.+|+|.+.|+.|||||.|.
T Consensus 153 Dn~v~iWnv~tg-eali~-l-~hpd~i~S~sfn~dGs~l~TtckDKkvRv~dpr~ 204 (472)
T KOG0303|consen 153 DNTVSIWNVGTG-EALIT-L-DHPDMVYSMSFNRDGSLLCTTCKDKKVRVIDPRR 204 (472)
T ss_pred CceEEEEeccCC-ceeee-c-CCCCeEEEEEeccCCceeeeecccceeEEEcCCC
Confidence 999999999975 33332 3 3999999999999999999999999999999883
No 127
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=99.21 E-value=3.6e-11 Score=143.87 Aligned_cols=155 Identities=18% Similarity=0.294 Sum_probs=127.3
Q ss_pred cCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC-C
Q 000425 5 SVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT-Y 83 (1525)
Q Consensus 5 ~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t-~ 83 (1525)
+......+.|||..|++|++. ..+|++++.... ..+.++.. -...|.++++||.|..|+.|+.|+.+..+|+.- .
T Consensus 565 skG~vq~v~FHPs~p~lfVaT--q~~vRiYdL~kq-elvKkL~t-g~kwiS~msihp~GDnli~gs~d~k~~WfDldlss 640 (733)
T KOG0650|consen 565 SKGLVQRVKFHPSKPYLFVAT--QRSVRIYDLSKQ-ELVKKLLT-GSKWISSMSIHPNGDNLILGSYDKKMCWFDLDLSS 640 (733)
T ss_pred cCCceeEEEecCCCceEEEEe--ccceEEEehhHH-HHHHHHhc-CCeeeeeeeecCCCCeEEEecCCCeeEEEEcccCc
Confidence 456788999999999999987 567888887542 23333333 467799999999999999999999999999874 4
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-----Ccee--eEeecccCcC----eEEEEEcC
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-----RPSM--IGIIQVGSQP----ITSVAWLP 151 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-----~~s~--i~il~gH~~~----VtsVafsP 151 (1525)
++.+++. |.+.++ +|+||+.-+.+++|++|+++.|+--.-+ ++-+ +..+.+|... |..+.|||
T Consensus 641 kPyk~lr~H~~avr-----~Va~H~ryPLfas~sdDgtv~Vfhg~VY~Dl~qnpliVPlK~L~gH~~~~~~gVLd~~wHP 715 (733)
T KOG0650|consen 641 KPYKTLRLHEKAVR-----SVAFHKRYPLFASGSDDGTVIVFHGMVYNDLLQNPLIVPLKRLRGHEKTNDLGVLDTIWHP 715 (733)
T ss_pred chhHHhhhhhhhhh-----hhhhccccceeeeecCCCcEEEEeeeeehhhhcCCceEeeeeccCceeecccceEeecccC
Confidence 6788888 999999 9999999999999999999999854322 1111 1456688766 88999999
Q ss_pred CCCEEEEEECCCCeeEE
Q 000425 152 MLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 152 dg~~LaSgS~DgtIkVW 168 (1525)
..++|+|++.|++|++|
T Consensus 716 ~qpWLfsAGAd~tirlf 732 (733)
T KOG0650|consen 716 RQPWLFSAGADGTIRLF 732 (733)
T ss_pred CCceEEecCCCceEEee
Confidence 99999999999999999
No 128
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.20 E-value=1e-09 Score=119.15 Aligned_cols=160 Identities=18% Similarity=0.144 Sum_probs=119.4
Q ss_pred cEEEEEEcCC---CCEEEEEEeccceEEEEEeecC-CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC
Q 000425 8 TEVHLALTPL---QPVVFFGFHRRMSVTVVGTVEG-GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 8 ~e~~LA~hP~---qp~Vf~s~~~d~sI~vv~tieg-gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~ 83 (1525)
+..-+||-.. +..++.+. .-..++++-+.-+ +.....+.+ |++.|.++ ++=++-+|++|+.|.+||.||++-.
T Consensus 138 tirdl~fld~~~s~~~il~s~-gagdc~iy~tdc~~g~~~~a~sg-htghilal-yswn~~m~~sgsqdktirfwdlrv~ 214 (350)
T KOG0641|consen 138 TIRDLAFLDDPESGGAILASA-GAGDCKIYITDCGRGQGFHALSG-HTGHILAL-YSWNGAMFASGSQDKTIRFWDLRVN 214 (350)
T ss_pred ceeeeEEecCCCcCceEEEec-CCCcceEEEeecCCCCcceeecC-CcccEEEE-EEecCcEEEccCCCceEEEEeeecc
Confidence 4455666422 33333333 3445555544333 334444444 99999886 2334789999999999999999999
Q ss_pred cEEEEEe---cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 84 AVHYTLQ---LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 84 k~l~tL~---H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
.++.++. |...+.-+...+|+..|.|+.|++|-.|.++.+||++. ..+++.+..|+..|+|+.|+|...|++|||
T Consensus 215 ~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~dssc~lydirg--~r~iq~f~phsadir~vrfsp~a~yllt~s 292 (350)
T KOG0641|consen 215 SCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHADSSCMLYDIRG--GRMIQRFHPHSADIRCVRFSPGAHYLLTCS 292 (350)
T ss_pred ceeeeccCcccCCCcccceeEEEEECCCcceeeeccCCCceEEEEeeC--CceeeeeCCCccceeEEEeCCCceEEEEec
Confidence 9998887 54432212223999999999999999999999999993 566666778999999999999999999999
Q ss_pred CCCCeeEEEeee
Q 000425 161 RDGSLQVWKTRV 172 (1525)
Q Consensus 161 ~DgtIkVWDlr~ 172 (1525)
.|..|++=|+..
T Consensus 293 yd~~ikltdlqg 304 (350)
T KOG0641|consen 293 YDMKIKLTDLQG 304 (350)
T ss_pred ccceEEEeeccc
Confidence 999999999873
No 129
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.20 E-value=6.6e-10 Score=138.75 Aligned_cols=153 Identities=12% Similarity=0.167 Sum_probs=126.5
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
+---|++.|.+-+|+++. .+..|+.|......-.+..+.. |...|.+++.. +.+|++|+.+++|.+|.+.+++.-.
T Consensus 15 G~t~i~~d~~gefi~tcg-sdg~ir~~~~~sd~e~P~ti~~-~g~~v~~ia~~--s~~f~~~s~~~tv~~y~fps~~~~~ 90 (933)
T KOG1274|consen 15 GLTLICYDPDGEFICTCG-SDGDIRKWKTNSDEEEPETIDI-SGELVSSIACY--SNHFLTGSEQNTVLRYKFPSGEEDT 90 (933)
T ss_pred ceEEEEEcCCCCEEEEec-CCCceEEeecCCcccCCchhhc-cCceeEEEeec--ccceEEeeccceEEEeeCCCCCccc
Confidence 445689999999887666 5888999887555233333333 78899999876 4599999999999999999887766
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
.+. ..-+++ +++|+.+|++++.||+|-.|++-+........ .+.+|..+|.++.|+|++.+|++.+-||+|+
T Consensus 91 iL~Rftlp~r-----~~~v~g~g~~iaagsdD~~vK~~~~~D~s~~~--~lrgh~apVl~l~~~p~~~fLAvss~dG~v~ 163 (933)
T KOG1274|consen 91 ILARFTLPIR-----DLAVSGSGKMIAAGSDDTAVKLLNLDDSSQEK--VLRGHDAPVLQLSYDPKGNFLAVSSCDGKVQ 163 (933)
T ss_pred eeeeeeccce-----EEEEecCCcEEEeecCceeEEEEeccccchhe--eecccCCceeeeeEcCCCCEEEEEecCceEE
Confidence 666 677888 99999999999999999999999988642222 3669999999999999999999999999999
Q ss_pred EEEee
Q 000425 167 VWKTR 171 (1525)
Q Consensus 167 VWDlr 171 (1525)
+||+.
T Consensus 164 iw~~~ 168 (933)
T KOG1274|consen 164 IWDLQ 168 (933)
T ss_pred EEEcc
Confidence 99997
No 130
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.18 E-value=9.1e-10 Score=124.35 Aligned_cols=204 Identities=14% Similarity=0.143 Sum_probs=139.3
Q ss_pred EccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC----CcEEEEEe-cCCceeecCceeEEE-eC-CCCEEEEEeCC
Q 000425 46 IKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT----YAVHYTLQ-LDNTIKLLGAGAFAF-HP-TLEWLFVGDRR 118 (1525)
Q Consensus 46 lk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t----~k~l~tL~-H~~~V~~~ga~sVaF-SP-dg~~L~SgS~D 118 (1525)
+..+|++-|.++.|++.|+.+|+|+.|++|+|||.++ ..+....+ |..+|. .|.| || -|+.++++|.|
T Consensus 8 i~s~h~DlihdVs~D~~GRRmAtCSsDq~vkI~d~~~~s~~W~~Ts~Wrah~~Si~-----rV~WAhPEfGqvvA~cS~D 82 (361)
T KOG2445|consen 8 IDSGHKDLIHDVSFDFYGRRMATCSSDQTVKIWDSTSDSGTWSCTSSWRAHDGSIW-----RVVWAHPEFGQVVATCSYD 82 (361)
T ss_pred cccCCcceeeeeeecccCceeeeccCCCcEEEEeccCCCCceEEeeeEEecCCcEE-----EEEecCccccceEEEEecC
Confidence 3446999999999999999999999999999999543 35666667 999999 9999 55 48999999999
Q ss_pred CcEEEEeCCC--C---Cceee--EeecccCcCeEEEEEcCC--CCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCC
Q 000425 119 GTLLAWDVSI--E---RPSMI--GIIQVGSQPITSVAWLPM--LRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPA 189 (1525)
Q Consensus 119 gtVrVWDl~t--~---~~s~i--~il~gH~~~VtsVafsPd--g~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~ 189 (1525)
++++||.-.. . ..+.+ .++......|++|.|.|. |-.+++++.||++|||+.--..+-..-..+.++
T Consensus 83 rtv~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~dp~nLs~W~Lq~Ei---- 158 (361)
T KOG2445|consen 83 RTVSIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTDVKFAPKHLGLKLAAASADGILRIYEAPDPMNLSQWTLQHEI---- 158 (361)
T ss_pred CceeeeeecccccccccceeEEEEEeecCCcceeEEEecchhcceEEEEeccCcEEEEEecCCccccccchhhhhh----
Confidence 9999997532 1 11111 234456778999999995 668999999999999987533322111111111
Q ss_pred CcccccccccccccCCcceecCCCeeEEEEecC---CCeEEEEecccccCCccccccccccc-C-ceeeeeeeccccCCc
Q 000425 190 SIESIDIPRILSQQGGEAVYPLPRVRALEVHPR---LNLAVLLFANFTGGDNIKNRAAYTRE-G-RKQLFAVLQSARGSS 264 (1525)
Q Consensus 190 gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~---~Nlial~F~~~s~~Dd~~~k~~~~~e-g-~~q~~~tlq~~~~~~ 264 (1525)
.. ... .-|.... +--++.|.|+ ..++|+.+.. .+..-.+.+++...+ + +++-+++|.+|.+.+
T Consensus 159 --~~-----~~~-pp~~~~~---~~~CvsWn~sr~~~p~iAvgs~e-~a~~~~~~~Iye~~e~~rKw~kva~L~d~~dpI 226 (361)
T KOG2445|consen 159 --QN-----VID-PPGKNKQ---PCFCVSWNPSRMHEPLIAVGSDE-DAPHLNKVKIYEYNENGRKWLKVAELPDHTDPI 226 (361)
T ss_pred --hh-----ccC-CcccccC---cceEEeeccccccCceEEEEccc-CCccccceEEEEecCCcceeeeehhcCCCCCcc
Confidence 00 000 1111111 2345666655 4778877766 444445566665544 4 677788899999998
Q ss_pred ccchhh
Q 000425 265 ASVLKE 270 (1525)
Q Consensus 265 ~s~~~e 270 (1525)
+.+..+
T Consensus 227 ~di~wA 232 (361)
T KOG2445|consen 227 RDISWA 232 (361)
T ss_pred eeeeec
Confidence 775433
No 131
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.18 E-value=3.4e-10 Score=125.53 Aligned_cols=154 Identities=10% Similarity=0.172 Sum_probs=124.1
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
..++.++++|++.++.++..+ ..+..++..+. +.. ...+.+..+.-++|+-++.+|+....-|+|.|..+...+++
T Consensus 107 ~eni~i~wsp~g~~~~~~~kd-D~it~id~r~~-~~~--~~~~~~~e~ne~~w~~~nd~Fflt~GlG~v~ILsypsLkpv 182 (313)
T KOG1407|consen 107 GENINITWSPDGEYIAVGNKD-DRITFIDARTY-KIV--NEEQFKFEVNEISWNNSNDLFFLTNGLGCVEILSYPSLKPV 182 (313)
T ss_pred CcceEEEEcCCCCEEEEecCc-ccEEEEEeccc-cee--ehhcccceeeeeeecCCCCEEEEecCCceEEEEeccccccc
Confidence 467899999999999888754 44555554322 111 11224566888999988888887777899999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
.+++ |..... ||.|+|+|+|+++|+.|..|.+||++.. -+...+..|..+|+.++|+.+|++|+|+|.|..|
T Consensus 183 ~si~AH~snCi-----cI~f~p~GryfA~GsADAlvSLWD~~EL--iC~R~isRldwpVRTlSFS~dg~~lASaSEDh~I 255 (313)
T KOG1407|consen 183 QSIKAHPSNCI-----CIEFDPDGRYFATGSADALVSLWDVDEL--ICERCISRLDWPVRTLSFSHDGRMLASASEDHFI 255 (313)
T ss_pred cccccCCcceE-----EEEECCCCceEeeccccceeeccChhHh--hhheeeccccCceEEEEeccCcceeeccCccceE
Confidence 9999 999999 9999999999999999999999999943 3333455899999999999999999999999988
Q ss_pred eEEEee
Q 000425 166 QVWKTR 171 (1525)
Q Consensus 166 kVWDlr 171 (1525)
-|=++.
T Consensus 256 DIA~ve 261 (313)
T KOG1407|consen 256 DIAEVE 261 (313)
T ss_pred EeEecc
Confidence 655443
No 132
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.17 E-value=2.7e-10 Score=134.09 Aligned_cols=117 Identities=17% Similarity=0.225 Sum_probs=102.9
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
..++|.+++-+|.|.+++.|.-.|.|.+|.+.++..+..+. |-..|+ |+.|+-||.+|+|||.||.|.+|++.+
T Consensus 80 ~Pg~v~al~s~n~G~~l~ag~i~g~lYlWelssG~LL~v~~aHYQ~IT-----cL~fs~dgs~iiTgskDg~V~vW~l~~ 154 (476)
T KOG0646|consen 80 LPGPVHALASSNLGYFLLAGTISGNLYLWELSSGILLNVLSAHYQSIT-----CLKFSDDGSHIITGSKDGAVLVWLLTD 154 (476)
T ss_pred cccceeeeecCCCceEEEeecccCcEEEEEeccccHHHHHHhhcccee-----EEEEeCCCcEEEecCCCccEEEEEEEe
Confidence 46789999999999999988899999999999999999999 999999 999999999999999999999998764
Q ss_pred C-------CceeeEeecccCcCeEEEEEcCC--CCEEEEEECCCCeeEEEee
Q 000425 129 E-------RPSMIGIIQVGSQPITSVAWLPM--LRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 129 ~-------~~s~i~il~gH~~~VtsVafsPd--g~~LaSgS~DgtIkVWDlr 171 (1525)
. .......+..|+-.|+++...+. ..+++|+|.|.+||+||+.
T Consensus 155 lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS 206 (476)
T KOG0646|consen 155 LVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWDLS 206 (476)
T ss_pred ecccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEEec
Confidence 2 12223457899999999877664 4689999999999999997
No 133
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.17 E-value=5.8e-10 Score=124.99 Aligned_cols=154 Identities=18% Similarity=0.325 Sum_probs=125.3
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC---CCEEEEEECCCcEEEEEcCCCcEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR---LPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd---g~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
..+-+-|-..-+|.+++=|.+++||++.. ......++ .++.|.+-+++|- ..++|+|..|-.|++.|+.++.+.
T Consensus 105 ss~~WyP~DtGmFtssSFDhtlKVWDtnT-lQ~a~~F~--me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs~s 181 (397)
T KOG4283|consen 105 SSAIWYPIDTGMFTSSSFDHTLKVWDTNT-LQEAVDFK--MEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGSFS 181 (397)
T ss_pred eeeEEeeecCceeecccccceEEEeeccc-ceeeEEee--cCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCcce
Confidence 44566788888888888899999999743 22222232 5788999999985 357899999999999999999999
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEE-EEeCCCcEEEEeCCCCC-------------ceeeEeecccCcCeEEEEEcC
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLF-VGDRRGTLLAWDVSIER-------------PSMIGIIQVGSQPITSVAWLP 151 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~-SgS~DgtVrVWDl~t~~-------------~s~i~il~gH~~~VtsVafsP 151 (1525)
+++. |...|. +|.|+|...|++ +||.||.||+||++.-. ......-.+|.+.|++++|+.
T Consensus 182 H~LsGHr~~vl-----aV~Wsp~~e~vLatgsaDg~irlWDiRrasgcf~~lD~hn~k~~p~~~~n~ah~gkvngla~tS 256 (397)
T KOG4283|consen 182 HTLSGHRDGVL-----AVEWSPSSEWVLATGSADGAIRLWDIRRASGCFRVLDQHNTKRPPILKTNTAHYGKVNGLAWTS 256 (397)
T ss_pred eeeccccCceE-----EEEeccCceeEEEecCCCceEEEEEeecccceeEEeecccCccCccccccccccceeeeeeecc
Confidence 9999 999999 999999999755 78899999999998520 011112347889999999999
Q ss_pred CCCEEEEEECCCCeeEEEee
Q 000425 152 MLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 152 dg~~LaSgS~DgtIkVWDlr 171 (1525)
++.++++++.|..+++|+..
T Consensus 257 d~~~l~~~gtd~r~r~wn~~ 276 (397)
T KOG4283|consen 257 DARYLASCGTDDRIRVWNME 276 (397)
T ss_pred cchhhhhccCccceEEeecc
Confidence 99999999999999999976
No 134
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.17 E-value=9.1e-10 Score=124.49 Aligned_cols=124 Identities=14% Similarity=0.233 Sum_probs=90.0
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE--EEEEecCCceeecCceeEEEeCCCCEEEE-EeCCCcE
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV--HYTLQLDNTIKLLGAGAFAFHPTLEWLFV-GDRRGTL 121 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~--l~tL~H~~~V~~~ga~sVaFSPdg~~L~S-gS~DgtV 121 (1525)
.+++ |++.|++++|+.+|+.|++++.|++||+||++.+.. .+.++ ..|..-....|+|.||.+.++. .-...++
T Consensus 81 ~LKg-H~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~DF~~~eHr~~R--~nve~dhpT~V~FapDc~s~vv~~~~g~~l 157 (420)
T KOG2096|consen 81 VLKG-HKKEVTDVAFSSDGKKLATISGDRSIRLWDVRDFENKEHRCIR--QNVEYDHPTRVVFAPDCKSVVVSVKRGNKL 157 (420)
T ss_pred hhhc-cCCceeeeEEcCCCceeEEEeCCceEEEEecchhhhhhhhHhh--ccccCCCceEEEECCCcceEEEEEccCCEE
Confidence 4555 999999999999999999999999999999987422 11221 1111112338999999876554 4477889
Q ss_pred EEEeCCCC-----Cceee-----EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 122 LAWDVSIE-----RPSMI-----GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 122 rVWDl~t~-----~~s~i-----~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+||-+... ....+ .+...|...|-.+-.-..+.+|.|||.|.+|.+||++
T Consensus 158 ~vyk~~K~~dG~~~~~~v~~D~~~f~~kh~v~~i~iGiA~~~k~imsas~dt~i~lw~lk 217 (420)
T KOG2096|consen 158 CVYKLVKKTDGSGSHHFVHIDNLEFERKHQVDIINIGIAGNAKYIMSASLDTKICLWDLK 217 (420)
T ss_pred EEEEeeecccCCCCcccccccccccchhcccceEEEeecCCceEEEEecCCCcEEEEecC
Confidence 99976531 11111 1234566777778788889999999999999999998
No 135
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.16 E-value=2.7e-10 Score=132.27 Aligned_cols=210 Identities=12% Similarity=0.093 Sum_probs=155.6
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
..-.++|.+.+-.++.+.+ +..+.+|+. ...+....+.+ |++.|+++.|.-....+++|+.|.+||+||++...|.+
T Consensus 221 ~it~~d~d~~~~~~iAas~-d~~~r~Wnv-d~~r~~~TLsG-HtdkVt~ak~~~~~~~vVsgs~DRtiK~WDl~k~~C~k 297 (459)
T KOG0288|consen 221 NITSIDFDSDNKHVIAASN-DKNLRLWNV-DSLRLRHTLSG-HTDKVTAAKFKLSHSRVVSGSADRTIKLWDLQKAYCSK 297 (459)
T ss_pred CcceeeecCCCceEEeecC-CCceeeeec-cchhhhhhhcc-cccceeeehhhccccceeeccccchhhhhhhhhhheec
Confidence 3557888888888877765 888999984 44555555666 99999999998776679999999999999999988888
Q ss_pred EEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeE
Q 000425 88 TLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQV 167 (1525)
Q Consensus 88 tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkV 167 (1525)
++-....++ .|..+ ...++||-.|++||.||.++.. .......|. .|+++..++++..+.+++.|.++++
T Consensus 298 t~l~~S~cn-----DI~~~--~~~~~SgH~DkkvRfwD~Rs~~--~~~sv~~gg-~vtSl~ls~~g~~lLsssRDdtl~v 367 (459)
T KOG0288|consen 298 TVLPGSQCN-----DIVCS--ISDVISGHFDKKVRFWDIRSAD--KTRSVPLGG-RVTSLDLSMDGLELLSSSRDDTLKV 367 (459)
T ss_pred ccccccccc-----ceEec--ceeeeecccccceEEEeccCCc--eeeEeecCc-ceeeEeeccCCeEEeeecCCCceee
Confidence 877555556 56554 4557778899999999998642 222233544 8999999999999999999999999
Q ss_pred EEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccc
Q 000425 168 WKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTR 247 (1525)
Q Consensus 168 WDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~ 247 (1525)
.|+|+.-- . . .++..+ ...-..++.+.|.|...+++ .|+-|..+++|...
T Consensus 368 iDlRt~eI------~-~--------------~~sA~g---~k~asDwtrvvfSpd~~Yva------AGS~dgsv~iW~v~ 417 (459)
T KOG0288|consen 368 IDLRTKEI------R-Q--------------TFSAEG---FKCASDWTRVVFSPDGSYVA------AGSADGSVYIWSVF 417 (459)
T ss_pred eecccccE------E-E--------------Eeeccc---cccccccceeEECCCCceee------eccCCCcEEEEEcc
Confidence 99984321 0 0 000000 00012367789999999998 89999999999999
Q ss_pred cCceeeeeeeccccC
Q 000425 248 EGRKQLFAVLQSARG 262 (1525)
Q Consensus 248 eg~~q~~~tlq~~~~ 262 (1525)
.|+... ++++-..
T Consensus 418 tgKlE~--~l~~s~s 430 (459)
T KOG0288|consen 418 TGKLEK--VLSLSTS 430 (459)
T ss_pred CceEEE--EeccCCC
Confidence 887555 3444333
No 136
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.16 E-value=1.2e-10 Score=141.62 Aligned_cols=120 Identities=21% Similarity=0.331 Sum_probs=105.5
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECC-----CcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCC
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYAD-----GLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRR 118 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~D-----gtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~D 118 (1525)
|+.+ |.-.|.+++.+|+++++||+|.. -.|++|+..+...++.+. |.-.|+ .++|||||++|++.|.|
T Consensus 520 KLYG-HGyEv~~l~~s~~gnliASaCKS~~~ehAvI~lw~t~~W~~~~~L~~HsLTVT-----~l~FSpdg~~LLsvsRD 593 (764)
T KOG1063|consen 520 KLYG-HGYEVYALAISPTGNLIASACKSSLKEHAVIRLWNTANWLQVQELEGHSLTVT-----RLAFSPDGRYLLSVSRD 593 (764)
T ss_pred Hhcc-CceeEEEEEecCCCCEEeehhhhCCccceEEEEEeccchhhhheecccceEEE-----EEEECCCCcEEEEeecC
Confidence 4555 99999999999999999999975 458999999999999899 999999 99999999999999999
Q ss_pred CcEEEEeCCCC---CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 119 GTLLAWDVSIE---RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 119 gtVrVWDl~t~---~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+++.+|..... ..++ .....|+..|.+++|+|+..+|+|+|.|++|+||...
T Consensus 594 Rt~sl~~~~~~~~~e~~f-a~~k~HtRIIWdcsW~pde~~FaTaSRDK~VkVW~~~ 648 (764)
T KOG1063|consen 594 RTVSLYEVQEDIKDEFRF-ACLKAHTRIIWDCSWSPDEKYFATASRDKKVKVWEEP 648 (764)
T ss_pred ceEEeeeeecccchhhhh-ccccccceEEEEcccCcccceeEEecCCceEEEEecc
Confidence 99999987642 1121 2356899999999999999999999999999999864
No 137
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.15 E-value=4.4e-10 Score=130.62 Aligned_cols=140 Identities=15% Similarity=0.239 Sum_probs=114.8
Q ss_pred CEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-c----CC
Q 000425 19 PVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-L----DN 93 (1525)
Q Consensus 19 p~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H----~~ 93 (1525)
+..+++.|-+..+++|+. ....+...+.. .+.|+++..++++..+++++.|.++++.|+++.....++. . ..
T Consensus 312 ~~~~~SgH~DkkvRfwD~-Rs~~~~~sv~~--gg~vtSl~ls~~g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~as 388 (459)
T KOG0288|consen 312 ISDVISGHFDKKVRFWDI-RSADKTRSVPL--GGRVTSLDLSMDGLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCAS 388 (459)
T ss_pred ceeeeecccccceEEEec-cCCceeeEeec--CcceeeEeeccCCeEEeeecCCCceeeeecccccEEEEeecccccccc
Confidence 667889999999999983 44444444443 4599999999999999999999999999999999888887 1 22
Q ss_pred ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc-cCc-CeEEEEEcCCCCEEEEEECCCCeeEE
Q 000425 94 TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV-GSQ-PITSVAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 94 ~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g-H~~-~VtsVafsPdg~~LaSgS~DgtIkVW 168 (1525)
..+ .+.|||++.|+++||.||.|+||++.+.....+ ++. |.. .|++++|+|.|.++++++.++.+.+|
T Consensus 389 Dwt-----rvvfSpd~~YvaAGS~dgsv~iW~v~tgKlE~~--l~~s~s~~aI~s~~W~~sG~~Llsadk~~~v~lW 458 (459)
T KOG0288|consen 389 DWT-----RVVFSPDGSYVAAGSADGSVYIWSVFTGKLEKV--LSLSTSNAAITSLSWNPSGSGLLSADKQKAVTLW 458 (459)
T ss_pred ccc-----eeEECCCCceeeeccCCCcEEEEEccCceEEEE--eccCCCCcceEEEEEcCCCchhhcccCCcceEec
Confidence 355 899999999999999999999999997522222 322 233 69999999999999999999999999
No 138
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.15 E-value=6.7e-09 Score=117.03 Aligned_cols=153 Identities=13% Similarity=0.113 Sum_probs=110.9
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEE-EECCCcEEEEEcCCCcEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYV-AYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaS-Gs~DgtIrLWDl~t~k~l~ 87 (1525)
...++++|++..++++...+..+.+|+... +.....+.. |. .+..++|+|+++.+++ ++.|+.|++||+++.+.+.
T Consensus 33 ~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~-~~~~~~~~~-~~-~~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~ 109 (300)
T TIGR03866 33 PRGITLSKDGKLLYVCASDSDTIQVIDLAT-GEVIGTLPS-GP-DPELFALHPNGKILYIANEDDNLVTVIDIETRKVLA 109 (300)
T ss_pred CCceEECCCCCEEEEEECCCCeEEEEECCC-CcEEEeccC-CC-CccEEEECCCCCEEEEEcCCCCeEEEEECCCCeEEe
Confidence 356899999998877766678888888533 332223333 33 3567899999997655 4568999999999988887
Q ss_pred EEecCCceeecCceeEEEeCCCCEEEEEeCCC-cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEE-EEECCCCe
Q 000425 88 TLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRG-TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLV-TLCRDGSL 165 (1525)
Q Consensus 88 tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg-tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~La-SgS~DgtI 165 (1525)
.+.+...+. +++|+|++.++++++.++ .+.+||..+. ...... .+...+.+++|+|++.+|+ +++.|+.|
T Consensus 110 ~~~~~~~~~-----~~~~~~dg~~l~~~~~~~~~~~~~d~~~~--~~~~~~-~~~~~~~~~~~s~dg~~l~~~~~~~~~v 181 (300)
T TIGR03866 110 EIPVGVEPE-----GMAVSPDGKIVVNTSETTNMAHFIDTKTY--EIVDNV-LVDQRPRFAEFTADGKELWVSSEIGGTV 181 (300)
T ss_pred EeeCCCCcc-----eEEECCCCCEEEEEecCCCeEEEEeCCCC--eEEEEE-EcCCCccEEEECCCCCEEEEEcCCCCEE
Confidence 777444567 899999999999888764 5677898753 222111 1233457899999999875 55569999
Q ss_pred eEEEeee
Q 000425 166 QVWKTRV 172 (1525)
Q Consensus 166 kVWDlr~ 172 (1525)
++||++.
T Consensus 182 ~i~d~~~ 188 (300)
T TIGR03866 182 SVIDVAT 188 (300)
T ss_pred EEEEcCc
Confidence 9999873
No 139
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.15 E-value=1.3e-09 Score=123.24 Aligned_cols=163 Identities=14% Similarity=0.163 Sum_probs=122.8
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t 88 (1525)
--++.|++.+.++.+|+. +..+.+|+...- +.. .+.+.|..+|.+++|+++|+.|+++|.|-.|++||+..+.+++.
T Consensus 26 a~~~~Fs~~G~~lAvGc~-nG~vvI~D~~T~-~ia-r~lsaH~~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs~l~r 102 (405)
T KOG1273|consen 26 AECCQFSRWGDYLAVGCA-NGRVVIYDFDTF-RIA-RMLSAHVRPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGSPLKR 102 (405)
T ss_pred cceEEeccCcceeeeecc-CCcEEEEEcccc-chh-hhhhccccceeEEEecCCCCEeeeecCCceeEEEeccCCCceeE
Confidence 357889999999999985 788888886432 233 33445999999999999999999999999999999988877765
Q ss_pred EecCCcee----------e--------------------------------cCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 89 LQLDNTIK----------L--------------------------------LGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 89 L~H~~~V~----------~--------------------------------~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
+....+|. . +.+.+..|++.|++|++|..-|.+.|+|.
T Consensus 103 irf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fdr~g~yIitGtsKGkllv~~a 182 (405)
T KOG1273|consen 103 IRFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFDRRGKYIITGTSKGKLLVYDA 182 (405)
T ss_pred EEccCccceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCCccccccccccccccCCCCEEEEecCcceEEEEec
Confidence 54111111 0 00122347778999999999999999999
Q ss_pred CCCCceeeEeeccc-CcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCC
Q 000425 127 SIERPSMIGIIQVG-SQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINP 176 (1525)
Q Consensus 127 ~t~~~s~i~il~gH-~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p 176 (1525)
.+. .+++.+.-- ...|..+.|+-.|+.|+.-+.|..||.++++-+.++
T Consensus 183 ~t~--e~vas~rits~~~IK~I~~s~~g~~liiNtsDRvIR~ye~~di~~~ 231 (405)
T KOG1273|consen 183 ETL--ECVASFRITSVQAIKQIIVSRKGRFLIINTSDRVIRTYEISDIDDE 231 (405)
T ss_pred chh--eeeeeeeechheeeeEEEEeccCcEEEEecCCceEEEEehhhhccc
Confidence 863 333222222 367888999999999999999999999999844443
No 140
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.12 E-value=6.3e-10 Score=134.45 Aligned_cols=161 Identities=15% Similarity=0.149 Sum_probs=110.5
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc---
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA--- 84 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k--- 84 (1525)
.+-|.+.| +...|++...|.++++|+..+....-..+..+|...|.+++|+|+++ .|++|+.||.|.|||++-..
T Consensus 103 ifDl~wap-ge~~lVsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~R~n~~d~ 181 (720)
T KOG0321|consen 103 IFDLKWAP-GESLLVSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDCRCNGVDA 181 (720)
T ss_pred eEeeccCC-CceeEEEccCCceeeeeeeccceeecceeecccccccchhhhccCCCcceeeccCCCcEEEEEEeccchhh
Confidence 45677788 55556677789999999975543333334445999999999999987 68999999999999976321
Q ss_pred -----------------EEEEE-------e-cCCceeecCceeEEEeCCCCEEEEEeC-CCcEEEEeCCCC----Cceee
Q 000425 85 -----------------VHYTL-------Q-LDNTIKLLGAGAFAFHPTLEWLFVGDR-RGTLLAWDVSIE----RPSMI 134 (1525)
Q Consensus 85 -----------------~l~tL-------~-H~~~V~~~ga~sVaFSPdg~~L~SgS~-DgtVrVWDl~t~----~~s~i 134 (1525)
+...+ . +.+.+. +...+.+.-|...|+++|. |+.|||||+++. .....
T Consensus 182 ~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~ti~--ssvTvv~fkDe~tlaSaga~D~~iKVWDLRk~~~~~r~ep~ 259 (720)
T KOG0321|consen 182 LEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNTIF--SSVTVVLFKDESTLASAGAADSTIKVWDLRKNYTAYRQEPR 259 (720)
T ss_pred HHHHhhhhhccccCCCCCCchhhccccccccccCcee--eeeEEEEEeccceeeeccCCCcceEEEeecccccccccCCC
Confidence 11111 1 223333 1112667788899999885 999999999963 11111
Q ss_pred Eeec--cc---CcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 135 GIIQ--VG---SQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 135 ~il~--gH---~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
+... .| ...+.++.....|.+|...+.|++|..||++.
T Consensus 260 ~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD~sIy~ynm~s 302 (720)
T KOG0321|consen 260 GSDKYPTHSKRSVGQVNLILDSSGTYLFASCTDNSIYFYNMRS 302 (720)
T ss_pred cccCccCcccceeeeEEEEecCCCCeEEEEecCCcEEEEeccc
Confidence 1111 22 34567788888888877666799999999983
No 141
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.12 E-value=1.2e-08 Score=126.57 Aligned_cols=153 Identities=19% Similarity=0.308 Sum_probs=124.3
Q ss_pred EEEEEcCC--CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 10 VHLALTPL--QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 10 ~~LA~hP~--qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
+.--+||. ---|+.|+ ....+.+|+ +..++....+.+ +...|+++.=+|--..+++|..+|+|.|++++..+.+.
T Consensus 162 Ital~HP~TYLNKIvvGs-~~G~lql~N-vrt~K~v~~f~~-~~s~IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~ 238 (910)
T KOG1539|consen 162 ITALLHPSTYLNKIVVGS-SQGRLQLWN-VRTGKVVYTFQE-FFSRITAIEQSPALDVVAIGLENGTVIIFNLKFDKILM 238 (910)
T ss_pred eeeEecchhheeeEEEee-cCCcEEEEE-eccCcEEEEecc-cccceeEeccCCcceEEEEeccCceEEEEEcccCcEEE
Confidence 44446665 33455665 367777887 566677777766 88999999999999999999999999999999999999
Q ss_pred EEecC-CceeecCceeEEEeCCCCEE-EEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 88 TLQLD-NTIKLLGAGAFAFHPTLEWL-FVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 88 tL~H~-~~V~~~ga~sVaFSPdg~~L-~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+|+|. ..|. .++|.-||+.+ ++|+..|.+-+||++.. +....+.+.|...|....|-|..+.++|++.|..+
T Consensus 239 sFk~d~g~Vt-----slSFrtDG~p~las~~~~G~m~~wDLe~k-kl~~v~~nah~~sv~~~~fl~~epVl~ta~~DnSl 312 (910)
T KOG1539|consen 239 SFKQDWGRVT-----SLSFRTDGNPLLASGRSNGDMAFWDLEKK-KLINVTRNAHYGSVTGATFLPGEPVLVTAGADNSL 312 (910)
T ss_pred EEEcccccee-----EEEeccCCCeeEEeccCCceEEEEEcCCC-eeeeeeeccccCCcccceecCCCceEeeccCCCce
Confidence 99975 8999 99999999755 45667799999999942 22223467899999999999999999999999999
Q ss_pred eEEEee
Q 000425 166 QVWKTR 171 (1525)
Q Consensus 166 kVWDlr 171 (1525)
++|=.+
T Consensus 313 k~~vfD 318 (910)
T KOG1539|consen 313 KVWVFD 318 (910)
T ss_pred eEEEee
Confidence 999544
No 142
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.12 E-value=1e-09 Score=132.63 Aligned_cols=153 Identities=22% Similarity=0.250 Sum_probs=115.7
Q ss_pred cCCCCEEEEEEeccceEEEEEeecCCc----cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE--
Q 000425 15 TPLQPVVFFGFHRRMSVTVVGTVEGGR----APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT-- 88 (1525)
Q Consensus 15 hP~qp~Vf~s~~~d~sI~vv~tieggk----~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t-- 88 (1525)
+|.+--+|+-+..+..|.++++....- ..++....|...|..+.|-|....|++++.|.++|+||+++.++...
T Consensus 60 ~~n~eHiLavadE~G~i~l~dt~~~~fr~ee~~lk~~~aH~nAifDl~wapge~~lVsasGDsT~r~Wdvk~s~l~G~~~ 139 (720)
T KOG0321|consen 60 APNKEHILAVADEDGGIILFDTKSIVFRLEERQLKKPLAHKNAIFDLKWAPGESLLVSASGDSTIRPWDVKTSRLVGGRL 139 (720)
T ss_pred CCCccceEEEecCCCceeeecchhhhcchhhhhhcccccccceeEeeccCCCceeEEEccCCceeeeeeeccceeeccee
Confidence 466777888888889888887632211 11122234999999999999667899999999999999999988776
Q ss_pred Ee-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCC-------cee------------------eEeecccC
Q 000425 89 LQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIER-------PSM------------------IGIIQVGS 141 (1525)
Q Consensus 89 L~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~-------~s~------------------i~il~gH~ 141 (1525)
+. |...|. +++|+|+.+ .+++|+.||.|.|||++-.. ..+ +....+|.
T Consensus 140 ~~GH~~Svk-----S~cf~~~n~~vF~tGgRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s 214 (720)
T KOG0321|consen 140 NLGHTGSVK-----SECFMPTNPAVFCTGGRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAAS 214 (720)
T ss_pred ecccccccc-----hhhhccCCCcceeeccCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCchhhcccccccccc
Confidence 55 999999 999999765 67788899999999987310 000 01123455
Q ss_pred cCeEE---EEEcCCCCEEEEEEC-CCCeeEEEeee
Q 000425 142 QPITS---VAWLPMLRLLVTLCR-DGSLQVWKTRV 172 (1525)
Q Consensus 142 ~~Vts---VafsPdg~~LaSgS~-DgtIkVWDlr~ 172 (1525)
..|.+ +.+.-|...|+|+|. |++|||||+|.
T Consensus 215 ~ti~ssvTvv~fkDe~tlaSaga~D~~iKVWDLRk 249 (720)
T KOG0321|consen 215 NTIFSSVTVVLFKDESTLASAGAADSTIKVWDLRK 249 (720)
T ss_pred CceeeeeEEEEEeccceeeeccCCCcceEEEeecc
Confidence 55665 667778889999988 99999999994
No 143
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.12 E-value=3.1e-09 Score=125.82 Aligned_cols=188 Identities=14% Similarity=0.186 Sum_probs=140.1
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCc---cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR---APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNI 80 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk---~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl 80 (1525)
-|+|..+..+|||+.|.+.+.+. +..+.+|. .+++. ..-.++..-++.|.|++|.++|. +++|..+|.|.||+.
T Consensus 198 ~sne~v~~a~FHPtd~nliit~G-k~H~~Fw~-~~~~~l~k~~~~fek~ekk~Vl~v~F~engd-viTgDS~G~i~Iw~~ 274 (626)
T KOG2106|consen 198 TSNEVVFLATFHPTDPNLIITCG-KGHLYFWT-LRGGSLVKRQGIFEKREKKFVLCVTFLENGD-VITGDSGGNILIWSK 274 (626)
T ss_pred eccceEEEEEeccCCCcEEEEeC-CceEEEEE-ccCCceEEEeeccccccceEEEEEEEcCCCC-EEeecCCceEEEEeC
Confidence 35777888999999999988874 67787874 44432 22233333457899999999887 678999999999999
Q ss_pred CCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC--CC-----------Cce--------------
Q 000425 81 HTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS--IE-----------RPS-------------- 132 (1525)
Q Consensus 81 ~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~--t~-----------~~s-------------- 132 (1525)
.+.+..+... |...|. +++.-.+|. |+||+.|+.|..||-. +. .+.
T Consensus 275 ~~~~~~k~~~aH~ggv~-----~L~~lr~Gt-llSGgKDRki~~Wd~~y~k~r~~elPe~~G~iRtv~e~~~di~vGTtr 348 (626)
T KOG2106|consen 275 GTNRISKQVHAHDGGVF-----SLCMLRDGT-LLSGGKDRKIILWDDNYRKLRETELPEQFGPIRTVAEGKGDILVGTTR 348 (626)
T ss_pred CCceEEeEeeecCCceE-----EEEEecCcc-EeecCccceEEeccccccccccccCchhcCCeeEEecCCCcEEEeecc
Confidence 9888888877 999999 999999987 4569999999999832 10 000
Q ss_pred -----------eeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCccccccccccc
Q 000425 133 -----------MIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILS 201 (1525)
Q Consensus 133 -----------~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs 201 (1525)
+..+..+|......++-||+.+.++||+.|+++++|+-. +.. ...+..
T Consensus 349 N~iL~Gt~~~~f~~~v~gh~delwgla~hps~~q~~T~gqdk~v~lW~~~------k~~---------------wt~~~~ 407 (626)
T KOG2106|consen 349 NFILQGTLENGFTLTVQGHGDELWGLATHPSKNQLLTCGQDKHVRLWNDH------KLE---------------WTKIIE 407 (626)
T ss_pred ceEEEeeecCCceEEEEecccceeeEEcCCChhheeeccCcceEEEccCC------cee---------------EEEEec
Confidence 001235788888999999999999999999999999821 110 011111
Q ss_pred ccCCcceecCCCeeEEEEecCCCeEEEEecc
Q 000425 202 QQGGEAVYPLPRVRALEVHPRLNLAVLLFAN 232 (1525)
Q Consensus 202 ~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~ 232 (1525)
....++.|||.+ .+++.++.
T Consensus 408 ----------d~~~~~~fhpsg-~va~Gt~~ 427 (626)
T KOG2106|consen 408 ----------DPAECADFHPSG-VVAVGTAT 427 (626)
T ss_pred ----------CceeEeeccCcc-eEEEeecc
Confidence 235789999998 88888765
No 144
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.12 E-value=2.1e-09 Score=126.75 Aligned_cols=157 Identities=20% Similarity=0.228 Sum_probs=129.2
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecC-------CccceEEccCCCCCeEEEEEcCCC--CEEEEEECCCcEEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEG-------GRAPTKIKTDLKKPIVNLACHPRL--PVLYVAYADGLIRAYN 79 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg-------gk~~tklk~~H~~~V~sLaf~Pdg--~~LaSGs~DgtIrLWD 79 (1525)
.-+|.|+-++..++.|+ +|..|.+|....- ...+.....+|+-+|+.+.+.+.+ .+++++|.|.++|+||
T Consensus 126 ITcL~fs~dgs~iiTgs-kDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wd 204 (476)
T KOG0646|consen 126 ITCLKFSDDGSHIITGS-KDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWD 204 (476)
T ss_pred eeEEEEeCCCcEEEecC-CCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEE
Confidence 34788888888877666 6899988876432 233444555699999999887754 5899999999999999
Q ss_pred cCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC--------------ceeeEeecccCc--C
Q 000425 80 IHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER--------------PSMIGIIQVGSQ--P 143 (1525)
Q Consensus 80 l~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~--------------~s~i~il~gH~~--~ 143 (1525)
+..+.++.++.....+. +++..|-...++.|+.+|.|.+.++.... ..-+..+.||.. .
T Consensus 205 lS~g~LLlti~fp~si~-----av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ 279 (476)
T KOG0646|consen 205 LSLGVLLLTITFPSSIK-----AVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESA 279 (476)
T ss_pred eccceeeEEEecCCcce-----eEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccceeeeeccccCCcc
Confidence 99999999998778889 99999999999999999999988876421 111234668988 9
Q ss_pred eEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 144 ITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 144 VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
|+|++.+-||.+|+||+.||.|.|||+.
T Consensus 280 ITcLais~DgtlLlSGd~dg~VcvWdi~ 307 (476)
T KOG0646|consen 280 ITCLAISTDGTLLLSGDEDGKVCVWDIY 307 (476)
T ss_pred eeEEEEecCccEEEeeCCCCCEEEEecc
Confidence 9999999999999999999999999986
No 145
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.11 E-value=8.5e-10 Score=133.76 Aligned_cols=281 Identities=16% Similarity=0.229 Sum_probs=178.2
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCc--cceEEccCCCCCeEEEEEcC------C-----CCEEEEEECCCcE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR--APTKIKTDLKKPIVNLACHP------R-----LPVLYVAYADGLI 75 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk--~~tklk~~H~~~V~sLaf~P------d-----g~~LaSGs~DgtI 75 (1525)
-+.+.|.|+..++-+-. .+.++.+|+..+-.+ ....+.. |..-|+++..-| . ...|.+++.|++|
T Consensus 327 ~IA~~Fdet~~klscVY-ndhSlYvWDvrD~~kvgk~~s~ly-HS~ciW~Ve~~p~nv~~~~~aclp~~cF~TCSsD~TI 404 (1080)
T KOG1408|consen 327 AIACQFDETTDKLSCVY-NDHSLYVWDVRDVNKVGKCSSMLY-HSACIWDVENLPCNVHSPTAACLPRGCFTTCSSDGTI 404 (1080)
T ss_pred eeEEEecCCCceEEEEE-cCceEEEEeccccccccceeeeee-ccceeeeeccccccccCcccccCCccceeEecCCCcE
Confidence 36778899999987666 588999999654322 1222333 777777665443 1 1258999999999
Q ss_pred EEEEcCCCcEEEEE--------------------------------------ecCCceeecCceeEEEeCCCCEEEEEeC
Q 000425 76 RAYNIHTYAVHYTL--------------------------------------QLDNTIKLLGAGAFAFHPTLEWLFVGDR 117 (1525)
Q Consensus 76 rLWDl~t~k~l~tL--------------------------------------~H~~~V~~~ga~sVaFSPdg~~L~SgS~ 117 (1525)
|+||+..+....++ .....++ +++.+|+|++|++|..
T Consensus 405 RlW~l~~ctnn~vyrRNils~~l~ki~y~d~~~q~~~d~~~~~fdka~~s~~d~r~G~R-----~~~vSp~gqhLAsGDr 479 (1080)
T KOG1408|consen 405 RLWDLAFCTNNQVYRRNILSANLSKIPYEDSTQQIMHDASAGIFDKALVSTCDSRFGFR-----ALAVSPDGQHLASGDR 479 (1080)
T ss_pred EEeecccccccceeecccchhhhhcCccccCchhhhhhccCCcccccchhhcCcccceE-----EEEECCCcceecccCc
Confidence 99998752111110 0123355 8999999999999999
Q ss_pred CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcC---CCCEEEEEECCCCeeEEEeee------ecCCCCCCCccccccC
Q 000425 118 RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLP---MLRLLVTLCRDGSLQVWKTRV------IINPNRPPMQANFFEP 188 (1525)
Q Consensus 118 DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsP---dg~~LaSgS~DgtIkVWDlr~------~~~p~~~~~~a~~f~p 188 (1525)
-|+++|||+... .....+++|...|.|+.|+- ..++|+|+|.|..|+|+|+.. .+..|...+.+.-|..
T Consensus 480 ~GnlrVy~Lq~l--~~~~~~eAHesEilcLeyS~p~~~~kLLASasrdRlIHV~Dv~rny~l~qtld~HSssITsvKFa~ 557 (1080)
T KOG1408|consen 480 GGNLRVYDLQEL--EYTCFMEAHESEILCLEYSFPVLTNKLLASASRDRLIHVYDVKRNYDLVQTLDGHSSSITSVKFAC 557 (1080)
T ss_pred cCceEEEEehhh--hhhhheecccceeEEEeecCchhhhHhhhhccCCceEEEEecccccchhhhhcccccceeEEEEee
Confidence 999999999953 22223669999999999985 357899999999999999871 2222333332222222
Q ss_pred CCccc------cc---ccccccccCCccee-------cCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCcee
Q 000425 189 ASIES------ID---IPRILSQQGGEAVY-------PLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQ 252 (1525)
Q Consensus 189 ~gVes------ld---i~~iLs~~gG~~vy-------~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q 252 (1525)
++.+. .| .-++..-.+....+ ..-.+-+++..|+.-+++ ++..|.++++.....|+
T Consensus 558 ~gln~~MiscGADksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~~k~v~------t~cQDrnirif~i~sgK-- 629 (1080)
T KOG1408|consen 558 NGLNRKMISCGADKSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPTSKLVV------TVCQDRNIRIFDIESGK-- 629 (1080)
T ss_pred cCCceEEEeccCchhhheehhccccCceeccccccccccceEEEeeeCCCcceEE------EEecccceEEEeccccc--
Confidence 22100 00 00000000000111 111245678888888887 78889999988877774
Q ss_pred eeeeeccccCCcccchhhhcccCCCcccc--hhhHHHHHHHhhhccCCCceeeehhhhhceeeeeeccccccc
Q 000425 253 LFAVLQSARGSSASVLKEKLSSMGSSGIL--ADHQLQAQLQEHHLKGHSHLTISDIARKAFLYSHFMEGHAKS 323 (1525)
Q Consensus 253 ~~~tlq~~~~~~~s~~~e~~~slgs~g~l--~~~ql~~~~~~~~~~~~~~~~I~di~r~~~i~s~~~d~~~k~ 323 (1525)
...++++-++.-..+++ ..++++|+. -.+ .|+++.+.|+...+++-+ |-||...
T Consensus 630 q~k~FKgs~~~eG~lIK---v~lDPSgiY~atSc------------sdktl~~~Df~sgEcvA~--m~GHsE~ 685 (1080)
T KOG1408|consen 630 QVKSFKGSRDHEGDLIK---VILDPSGIYLATSC------------SDKTLCFVDFVSGECVAQ--MTGHSEA 685 (1080)
T ss_pred eeeeecccccCCCceEE---EEECCCccEEEEee------------cCCceEEEEeccchhhhh--hcCcchh
Confidence 23345555444333332 245667765 233 799999999988888866 6676654
No 146
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.11 E-value=5.9e-10 Score=126.66 Aligned_cols=147 Identities=17% Similarity=0.248 Sum_probs=114.9
Q ss_pred CCCEEEEEEeccceEEEEEeecCC--ccce-EEccCCCCCeEEEEEcCC----CCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 17 LQPVVFFGFHRRMSVTVVGTVEGG--RAPT-KIKTDLKKPIVNLACHPR----LPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 17 ~qp~Vf~s~~~d~sI~vv~tiegg--k~~t-klk~~H~~~V~sLaf~Pd----g~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
.+|.||+... ...+.++.....+ +... ....+|...-..++|.-+ .+++|+|+.-|.|||.|+.++++...+
T Consensus 53 ~e~~vfatvG-~~rvtiy~c~~d~~ir~lq~y~D~d~~Esfytcsw~yd~~~~~p~la~~G~~GvIrVid~~~~~~~~~~ 131 (385)
T KOG1034|consen 53 DEPQVFATVG-GNRVTIYECPGDGGIRLLQSYADEDHDESFYTCSWSYDSNTGNPFLAAGGYLGVIRVIDVVSGQCSKNY 131 (385)
T ss_pred CCCceEEEeC-CcEEEEEEECCccceeeeeeccCCCCCcceEEEEEEecCCCCCeeEEeecceeEEEEEecchhhhccce
Confidence 3566665543 3445555432222 1111 122347777777888643 568999999999999999999999999
Q ss_pred e-cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCCCceee---EeecccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 90 Q-LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMI---GIIQVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 90 ~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i---~il~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
. |...|+ .+.|+|+. ++++++|.|.+||+||+++. .++ +-..+|...|.++.||++|.+|+||+.|.+
T Consensus 132 ~ghG~sIN-----eik~~p~~~qlvls~SkD~svRlwnI~~~--~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhs 204 (385)
T KOG1034|consen 132 RGHGGSIN-----EIKFHPDRPQLVLSASKDHSVRLWNIQTD--VCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHS 204 (385)
T ss_pred eccCccch-----hhhcCCCCCcEEEEecCCceEEEEeccCC--eEEEEecccccccCcEEEEEEcCCCCeeeccCCcce
Confidence 9 999999 99999975 68889999999999999964 333 345789999999999999999999999999
Q ss_pred eeEEEee
Q 000425 165 LQVWKTR 171 (1525)
Q Consensus 165 IkVWDlr 171 (1525)
+++|++.
T Consensus 205 lk~W~l~ 211 (385)
T KOG1034|consen 205 LKLWRLN 211 (385)
T ss_pred EEEEecC
Confidence 9999987
No 147
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.10 E-value=8e-10 Score=129.53 Aligned_cols=152 Identities=20% Similarity=0.269 Sum_probs=120.8
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcE-EEEE
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAV-HYTL 89 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~-l~tL 89 (1525)
|++.-.-.-|+++..-+.+|++|+ .+.+++...+.. |.++|.+++|||..+ ++++||.|++|++.|.+.... -...
T Consensus 249 Ls~n~~~~nVLaSgsaD~TV~lWD-~~~g~p~~s~~~-~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~~s~~~w 326 (463)
T KOG0270|consen 249 LSWNRNFRNVLASGSADKTVKLWD-VDTGKPKSSITH-HGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPSNSGKEW 326 (463)
T ss_pred HHhccccceeEEecCCCceEEEEE-cCCCCcceehhh-cCCceeEEEecCCCceEEEeccccceEEeeeccCccccCceE
Confidence 334444556778888899999999 466677777775 999999999999876 789999999999999985322 2233
Q ss_pred ecCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCC-EEEEEECCCCeeE
Q 000425 90 QLDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLR-LLVTLCRDGSLQV 167 (1525)
Q Consensus 90 ~H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~-~LaSgS~DgtIkV 167 (1525)
+-...|- .++|+|... .++++.+||+|+-+|.+... ..+-++++|...|.++++++.-+ +++|+|.|++|++
T Consensus 327 k~~g~VE-----kv~w~~~se~~f~~~tddG~v~~~D~R~~~-~~vwt~~AHd~~ISgl~~n~~~p~~l~t~s~d~~Vkl 400 (463)
T KOG0270|consen 327 KFDGEVE-----KVAWDPHSENSFFVSTDDGTVYYFDIRNPG-KPVWTLKAHDDEISGLSVNIQTPGLLSTASTDKVVKL 400 (463)
T ss_pred EeccceE-----EEEecCCCceeEEEecCCceEEeeecCCCC-CceeEEEeccCCcceEEecCCCCcceeeccccceEEE
Confidence 3567788 999999765 56777899999999999742 33334779999999999998754 6889999999999
Q ss_pred EEee
Q 000425 168 WKTR 171 (1525)
Q Consensus 168 WDlr 171 (1525)
|++.
T Consensus 401 w~~~ 404 (463)
T KOG0270|consen 401 WKFD 404 (463)
T ss_pred Eeec
Confidence 9986
No 148
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=99.10 E-value=2.8e-09 Score=124.02 Aligned_cols=156 Identities=13% Similarity=0.203 Sum_probs=123.1
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecC--Cc-----cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEG--GR-----APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH 81 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg--gk-----~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~ 81 (1525)
...+.||+..--.+++...+..|++|..... +. ........|...|+++.|+|+|.++|||+++|.|.+|-..
T Consensus 16 v~s~dfq~n~~~~laT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D~g~v~lWk~~ 95 (434)
T KOG1009|consen 16 VYSVDFQKNSLNKLATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGDGGEVFLWKQG 95 (434)
T ss_pred eEEEEeccCcccceecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCCCceEEEEEec
Confidence 3456666665534555556888988875432 11 1111223599999999999999999999999999999654
Q ss_pred --------C--------CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCe
Q 000425 82 --------T--------YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPI 144 (1525)
Q Consensus 82 --------t--------~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~V 144 (1525)
+ +...+.+. |.+.+. .++|+||+.++++++.|+++++||+.. .....++.+|..+|
T Consensus 96 ~~~~~~~d~e~~~~ke~w~v~k~lr~h~~diy-----dL~Ws~d~~~l~s~s~dns~~l~Dv~~--G~l~~~~~dh~~yv 168 (434)
T KOG1009|consen 96 DVRIFDADTEADLNKEKWVVKKVLRGHRDDIY-----DLAWSPDSNFLVSGSVDNSVRLWDVHA--GQLLAILDDHEHYV 168 (434)
T ss_pred CcCCccccchhhhCccceEEEEEecccccchh-----hhhccCCCceeeeeeccceEEEEEecc--ceeEeecccccccc
Confidence 2 34455666 888899 999999999999999999999999995 45556688999999
Q ss_pred EEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 145 TSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 145 tsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
..++|.|..+++++-|.|...++.+++
T Consensus 169 qgvawDpl~qyv~s~s~dr~~~~~~~~ 195 (434)
T KOG1009|consen 169 QGVAWDPLNQYVASKSSDRHPEGFSAK 195 (434)
T ss_pred ceeecchhhhhhhhhccCcccceeeee
Confidence 999999999999999999977777665
No 149
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=99.09 E-value=2.5e-10 Score=136.69 Aligned_cols=115 Identities=26% Similarity=0.419 Sum_probs=98.0
Q ss_pred CCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCC-------cEEEEEe-cCCceeecCceeEEEeCC-CCEEEEEeCCCc
Q 000425 51 KKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTY-------AVHYTLQ-LDNTIKLLGAGAFAFHPT-LEWLFVGDRRGT 120 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~-------k~l~tL~-H~~~V~~~ga~sVaFSPd-g~~L~SgS~Dgt 120 (1525)
...|+.+.|+|-+ ..||++++||.|++|.+..+ .+-..+. |...|. +++|||- ...|++++.|.+
T Consensus 627 gt~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~-----slRfHPLAadvLa~asyd~T 701 (1012)
T KOG1445|consen 627 GTLVTDLHWDPFDDERLAVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKIT-----SLRFHPLAADVLAVASYDST 701 (1012)
T ss_pred CceeeecccCCCChHHeeecccCceEEEEEeccCCCCcccCCcceeeecccceEE-----EEEecchhhhHhhhhhccce
Confidence 3578999999965 57999999999999998643 4556777 999999 9999994 568889999999
Q ss_pred EEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 121 LLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 121 VrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
|++||+.+....+. +.+|++.|.+++|||+|+.++|.+.||+|+|++.|.
T Consensus 702 i~lWDl~~~~~~~~--l~gHtdqIf~~AWSpdGr~~AtVcKDg~~rVy~Prs 751 (1012)
T KOG1445|consen 702 IELWDLANAKLYSR--LVGHTDQIFGIAWSPDGRRIATVCKDGTLRVYEPRS 751 (1012)
T ss_pred eeeeehhhhhhhhe--eccCcCceeEEEECCCCcceeeeecCceEEEeCCCC
Confidence 99999997422222 559999999999999999999999999999998873
No 150
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.09 E-value=5.4e-10 Score=126.40 Aligned_cols=114 Identities=16% Similarity=0.290 Sum_probs=101.7
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE 129 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~ 129 (1525)
-++.|+++.|+|.+..|+++++||++++||.........++|..++. +++|.++ ..+++|+.||.|+.+|+++.
T Consensus 12 P~d~IS~v~f~~~~~~LLvssWDgslrlYdv~~~~l~~~~~~~~plL-----~c~F~d~-~~~~~G~~dg~vr~~Dln~~ 85 (323)
T KOG1036|consen 12 PEDGISSVKFSPSSSDLLVSSWDGSLRLYDVPANSLKLKFKHGAPLL-----DCAFADE-STIVTGGLDGQVRRYDLNTG 85 (323)
T ss_pred ChhceeeEEEcCcCCcEEEEeccCcEEEEeccchhhhhheecCCcee-----eeeccCC-ceEEEeccCceEEEEEecCC
Confidence 35779999999999999999999999999999888888888999999 9999884 56899999999999999975
Q ss_pred CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 130 RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 130 ~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
....+ ..|...|+|+.+++....++|||+|++|++||.|.
T Consensus 86 ~~~~i---gth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~ 125 (323)
T KOG1036|consen 86 NEDQI---GTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRN 125 (323)
T ss_pred cceee---ccCCCceEEEEeeccCCeEEEcccCccEEEEeccc
Confidence 33333 47999999999999888999999999999999984
No 151
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.08 E-value=1.1e-07 Score=116.34 Aligned_cols=197 Identities=15% Similarity=0.215 Sum_probs=127.3
Q ss_pred EEEcC-CCCEEEEEEeccceEEEEEeec-CCccceEEccCCCCCeEEEEEcCCCC---EEEEEECCCcEEEEEcCCCc--
Q 000425 12 LALTP-LQPVVFFGFHRRMSVTVVGTVE-GGRAPTKIKTDLKKPIVNLACHPRLP---VLYVAYADGLIRAYNIHTYA-- 84 (1525)
Q Consensus 12 LA~hP-~qp~Vf~s~~~d~sI~vv~tie-ggk~~tklk~~H~~~V~sLaf~Pdg~---~LaSGs~DgtIrLWDl~t~k-- 84 (1525)
|++.| .+.+++.....+..+.++.... .-+....+.+ |.+.|++|+|...+. +++|+|.|.+||||.+.-..
T Consensus 151 L~~~~~~~~~lla~Ggs~~~v~~~s~~~d~f~~v~el~G-H~DWIrsl~f~~~~~~~~~laS~SQD~yIRiW~i~~~~~~ 229 (764)
T KOG1063|consen 151 LAALKNNKTFLLACGGSKFVVDLYSSSADSFARVAELEG-HTDWIRSLAFARLGGDDLLLASSSQDRYIRIWRIVLGDDE 229 (764)
T ss_pred HhhhccCCcEEEEecCcceEEEEeccCCcceeEEEEeec-cchhhhhhhhhccCCCcEEEEecCCceEEEEEEEEecCCc
Confidence 44455 3444444444444444444321 1233444555 999999999987654 78999999999999853211
Q ss_pred -----------------------EEE-----E-Ee-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC----
Q 000425 85 -----------------------VHY-----T-LQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER---- 130 (1525)
Q Consensus 85 -----------------------~l~-----t-L~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~---- 130 (1525)
..+ + +. |++.|. ++.|+|++..|+++|.|.++.+|--.+..
T Consensus 230 ~~~~~e~~~t~~~~~~~f~~l~~i~~~is~eall~GHeDWV~-----sv~W~p~~~~LLSASaDksmiiW~pd~~tGiWv 304 (764)
T KOG1063|consen 230 DSNEREDSLTTLSNLPVFMILEEIQYRISFEALLMGHEDWVY-----SVWWHPEGLDLLSASADKSMIIWKPDENTGIWV 304 (764)
T ss_pred cccccccccccccCCceeeeeeeEEEEEehhhhhcCcccceE-----EEEEccchhhheecccCcceEEEecCCccceEE
Confidence 111 1 12 889999 99999999999999999999999655421
Q ss_pred -ceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCccee
Q 000425 131 -PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVY 209 (1525)
Q Consensus 131 -~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy 209 (1525)
.-..+...+....-.++.|+|++..+++-+.-|..++|... ++ .++. .....+++
T Consensus 305 ~~vRlGe~gg~a~GF~g~lw~~n~~~ii~~g~~Gg~hlWkt~---d~-------~~w~--------~~~~iSGH------ 360 (764)
T KOG1063|consen 305 DVVRLGEVGGSAGGFWGGLWSPNSNVIIAHGRTGGFHLWKTK---DK-------TFWT--------QEPVISGH------ 360 (764)
T ss_pred EEEEeecccccccceeeEEEcCCCCEEEEecccCcEEEEecc---Cc-------ccee--------eccccccc------
Confidence 11123333334457789999999999999999999999722 10 0001 11122332
Q ss_pred cCCCeeEEEEecCCCeEEEEecccccCCcccccccc
Q 000425 210 PLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 210 ~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~ 245 (1525)
+..|+.+.|||++-++- +.+-|.+.|+..
T Consensus 361 -~~~V~dv~W~psGeflL------svs~DQTTRlFa 389 (764)
T KOG1063|consen 361 -VDGVKDVDWDPSGEFLL------SVSLDQTTRLFA 389 (764)
T ss_pred -cccceeeeecCCCCEEE------Eeccccceeeec
Confidence 24689999999988655 445555555443
No 152
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.07 E-value=3.7e-09 Score=114.81 Aligned_cols=158 Identities=16% Similarity=0.197 Sum_probs=128.9
Q ss_pred cCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccC-C-----CCCeEEEEEcCCCCEEEEEECCCcEEEE
Q 000425 5 SVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTD-L-----KKPIVNLACHPRLPVLYVAYADGLIRAY 78 (1525)
Q Consensus 5 ~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~-H-----~~~V~sLaf~Pdg~~LaSGs~DgtIrLW 78 (1525)
|+-+---|++-.=.-++|++.+.+++|++|+..- ..++..+..+ | +..|.+++..|.|+++++|..|...-+|
T Consensus 180 sghtghilalyswn~~m~~sgsqdktirfwdlrv-~~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~dssc~ly 258 (350)
T KOG0641|consen 180 SGHTGHILALYSWNGAMFASGSQDKTIRFWDLRV-NSCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHADSSCMLY 258 (350)
T ss_pred cCCcccEEEEEEecCcEEEccCCCceEEEEeeec-cceeeeccCcccCCCcccceeEEEEECCCcceeeeccCCCceEEE
Confidence 3444445666666678889999999999998632 2222222211 2 3579999999999999999999999999
Q ss_pred EcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC----CceeeEeecccCcCeEEEEEcCCC
Q 000425 79 NIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE----RPSMIGIIQVGSQPITSVAWLPML 153 (1525)
Q Consensus 79 Dl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~----~~s~i~il~gH~~~VtsVafsPdg 153 (1525)
|++.+..+..|. |...|+ ||+|+|..-|++++|-|..|++-|++.. ...++ ...|...+..+.|||+.
T Consensus 259 dirg~r~iq~f~phsadir-----~vrfsp~a~yllt~syd~~ikltdlqgdla~el~~~v--v~ehkdk~i~~rwh~~d 331 (350)
T KOG0641|consen 259 DIRGGRMIQRFHPHSADIR-----CVRFSPGAHYLLTCSYDMKIKLTDLQGDLAHELPIMV--VAEHKDKAIQCRWHPQD 331 (350)
T ss_pred EeeCCceeeeeCCCcccee-----EEEeCCCceEEEEecccceEEEeecccchhhcCceEE--EEeccCceEEEEecCcc
Confidence 999999999999 999999 9999999999999999999999999852 22232 45788888999999998
Q ss_pred CEEEEEECCCCeeEEEe
Q 000425 154 RLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 154 ~~LaSgS~DgtIkVWDl 170 (1525)
--++|.|.|+|+.+|-+
T Consensus 332 ~sfisssadkt~tlwa~ 348 (350)
T KOG0641|consen 332 FSFISSSADKTATLWAL 348 (350)
T ss_pred ceeeeccCcceEEEecc
Confidence 88999999999999965
No 153
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=99.07 E-value=9.6e-10 Score=123.13 Aligned_cols=155 Identities=11% Similarity=0.164 Sum_probs=124.2
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEE----ccCCCCCeEEEEEcC--CCCEEEEEECCCcEEEEEcC
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKI----KTDLKKPIVNLACHP--RLPVLYVAYADGLIRAYNIH 81 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tkl----k~~H~~~V~sLaf~P--dg~~LaSGs~DgtIrLWDl~ 81 (1525)
..-+|-++|++..+.+. .+..|.+|+..++.+..-.+ ..+|....++-+|+| +++.+++. .|++++.||++
T Consensus 125 ~i~cvew~Pns~klasm--~dn~i~l~~l~ess~~vaev~ss~s~e~~~~ftsg~WspHHdgnqv~tt-~d~tl~~~D~R 201 (370)
T KOG1007|consen 125 KINCVEWEPNSDKLASM--DDNNIVLWSLDESSKIVAEVLSSESAEMRHSFTSGAWSPHHDGNQVATT-SDSTLQFWDLR 201 (370)
T ss_pred ceeeEEEcCCCCeeEEe--ccCceEEEEcccCcchheeecccccccccceecccccCCCCccceEEEe-CCCcEEEEEcc
Confidence 34678899988876544 37889899877765422222 224667788889988 56677775 58999999999
Q ss_pred CCcEEEEEe--cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEE
Q 000425 82 TYAVHYTLQ--LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLV 157 (1525)
Q Consensus 82 t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~La 157 (1525)
+.++...++ |...|+ .++|+|..+ +|++|++||.|++||.+.. +..+..+.+|...|.+|.|+|. ..+|+
T Consensus 202 T~~~~~sI~dAHgq~vr-----dlDfNpnkq~~lvt~gDdgyvriWD~R~t-k~pv~el~~HsHWvW~VRfn~~hdqLiL 275 (370)
T KOG1007|consen 202 TMKKNNSIEDAHGQRVR-----DLDFNPNKQHILVTCGDDGYVRIWDTRKT-KFPVQELPGHSHWVWAVRFNPEHDQLIL 275 (370)
T ss_pred chhhhcchhhhhcceee-----eccCCCCceEEEEEcCCCccEEEEeccCC-CccccccCCCceEEEEEEecCccceEEE
Confidence 999999998 999999 999999887 5678899999999999953 3334567799999999999995 46889
Q ss_pred EEECCCCeeEEEee
Q 000425 158 TLCRDGSLQVWKTR 171 (1525)
Q Consensus 158 SgS~DgtIkVWDlr 171 (1525)
|+|.|..|.+|...
T Consensus 276 s~~SDs~V~Lsca~ 289 (370)
T KOG1007|consen 276 SGGSDSAVNLSCAS 289 (370)
T ss_pred ecCCCceeEEEecc
Confidence 99999999999875
No 154
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=99.06 E-value=2.6e-09 Score=123.67 Aligned_cols=157 Identities=15% Similarity=0.186 Sum_probs=125.6
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCc-----cceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR-----APTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk-----~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~ 83 (1525)
.-+++.|....+..+.+.|-+|.+|..-+.+. .+.+...+|...|--++|||.-. .|++++.|.+|.+||+.++
T Consensus 85 LDi~w~PfnD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~v~iWnv~tg 164 (472)
T KOG0303|consen 85 LDIDWCPFNDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNTVSIWNVGTG 164 (472)
T ss_pred cccccCccCCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCceEEEEeccCC
Confidence 34788999999999999999999998766532 22333334999999999999865 6899999999999999999
Q ss_pred cEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcC-eEEEEEcCCCCEEEEE---
Q 000425 84 AVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQP-ITSVAWLPMLRLLVTL--- 159 (1525)
Q Consensus 84 k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~-VtsVafsPdg~~LaSg--- 159 (1525)
+.+.++.|.+-|. ++.|+.||..+++++.|+.|||||.++. .++..-.+|.+. -..+.|--+|.++.||
T Consensus 165 eali~l~hpd~i~-----S~sfn~dGs~l~TtckDKkvRv~dpr~~--~~v~e~~~heG~k~~Raifl~~g~i~tTGfsr 237 (472)
T KOG0303|consen 165 EALITLDHPDMVY-----SMSFNRDGSLLCTTCKDKKVRVIDPRRG--TVVSEGVAHEGAKPARAIFLASGKIFTTGFSR 237 (472)
T ss_pred ceeeecCCCCeEE-----EEEeccCCceeeeecccceeEEEcCCCC--cEeeecccccCCCcceeEEeccCceeeecccc
Confidence 9999999999999 9999999999999999999999999953 343333456543 3457788888866555
Q ss_pred ECCCCeeEEEeeee
Q 000425 160 CRDGSLQVWKTRVI 173 (1525)
Q Consensus 160 S~DgtIkVWDlr~~ 173 (1525)
..+..+-+||....
T Consensus 238 ~seRq~aLwdp~nl 251 (472)
T KOG0303|consen 238 MSERQIALWDPNNL 251 (472)
T ss_pred ccccceeccCcccc
Confidence 24678999997643
No 155
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.04 E-value=7.8e-09 Score=117.00 Aligned_cols=236 Identities=16% Similarity=0.248 Sum_probs=149.8
Q ss_pred CCcEEEEEE-cCCCCEEEEEEeccceEEEEEeecC-----C--c-cceEEccCCCCCeEEEEEcCC--CCEEEEEECCCc
Q 000425 6 VDTEVHLAL-TPLQPVVFFGFHRRMSVTVVGTVEG-----G--R-APTKIKTDLKKPIVNLACHPR--LPVLYVAYADGL 74 (1525)
Q Consensus 6 ~~~e~~LA~-hP~qp~Vf~s~~~d~sI~vv~tieg-----g--k-~~tklk~~H~~~V~sLaf~Pd--g~~LaSGs~Dgt 74 (1525)
....+.+.+ ||.--.|+.+..-|+++.+|...+. + + ..+++. +-...|+.|.|.|. |-.+|+++.||+
T Consensus 59 ~~Si~rV~WAhPEfGqvvA~cS~Drtv~iWEE~~~~~~~~~~~Wv~~ttl~-DsrssV~DV~FaP~hlGLklA~~~aDG~ 137 (361)
T KOG2445|consen 59 DGSIWRVVWAHPEFGQVVATCSYDRTVSIWEEQEKSEEAHGRRWVRRTTLV-DSRSSVTDVKFAPKHLGLKLAAASADGI 137 (361)
T ss_pred CCcEEEEEecCccccceEEEEecCCceeeeeecccccccccceeEEEEEee-cCCcceeEEEecchhcceEEEEeccCcE
Confidence 344555555 5665555566667899999976332 1 1 122333 36788999999997 457999999999
Q ss_pred EEEEEcCC------CcEEEEEe--------cCCceeecCceeEEEeCC---CCEEEEEeCC-----CcEEEEeCCCCCc-
Q 000425 75 IRAYNIHT------YAVHYTLQ--------LDNTIKLLGAGAFAFHPT---LEWLFVGDRR-----GTLLAWDVSIERP- 131 (1525)
Q Consensus 75 IrLWDl~t------~k~l~tL~--------H~~~V~~~ga~sVaFSPd---g~~L~SgS~D-----gtVrVWDl~t~~~- 131 (1525)
+|||+.-. ..+...++ +..... ||.|+|. .++|+.|+++ +.++||......+
T Consensus 138 lRIYEA~dp~nLs~W~Lq~Ei~~~~~pp~~~~~~~~-----CvsWn~sr~~~p~iAvgs~e~a~~~~~~~Iye~~e~~rK 212 (361)
T KOG2445|consen 138 LRIYEAPDPMNLSQWTLQHEIQNVIDPPGKNKQPCF-----CVSWNPSRMHEPLIAVGSDEDAPHLNKVKIYEYNENGRK 212 (361)
T ss_pred EEEEecCCccccccchhhhhhhhccCCcccccCcce-----EEeeccccccCceEEEEcccCCccccceEEEEecCCcce
Confidence 99998543 22222222 334455 9999874 4678888866 5889998775322
Q ss_pred -eeeEeecccCcCeEEEEEcCCC----CEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCc
Q 000425 132 -SMIGIIQVGSQPITSVAWLPML----RLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGE 206 (1525)
Q Consensus 132 -s~i~il~gH~~~VtsVafsPdg----~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~ 206 (1525)
..+..+.+|+.+|++++|.|+- .+|++++.|| |+||.++....+-. ......+.....+.+..+ +..++.
T Consensus 213 w~kva~L~d~~dpI~di~wAPn~Gr~y~~lAvA~kDg-v~I~~v~~~~s~i~---~ee~~~~~~~~~l~v~~v-s~~~~H 287 (361)
T KOG2445|consen 213 WLKVAELPDHTDPIRDISWAPNIGRSYHLLAVATKDG-VRIFKVKVARSAIE---EEEVLAPDLMTDLPVEKV-SELDDH 287 (361)
T ss_pred eeeehhcCCCCCcceeeeeccccCCceeeEEEeecCc-EEEEEEeeccchhh---hhcccCCCCccccceEEe-eeccCC
Confidence 2234566999999999999963 4799999999 99999986543110 000011111111222221 111111
Q ss_pred ceecCCCeeEEEEecCCCeEEEEecccccCCccccccccccc-CceeeeeeeccccC
Q 000425 207 AVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTRE-GRKQLFAVLQSARG 262 (1525)
Q Consensus 207 ~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~e-g~~q~~~tlq~~~~ 262 (1525)
+ -.||.+.|.-++..++ +.|||..+|.|+..- ..+.+..++.....
T Consensus 288 ~----~~VWrv~wNmtGtiLs------StGdDG~VRLWkany~n~~kC~sv~~~e~~ 334 (361)
T KOG2445|consen 288 N----GEVWRVRWNMTGTILS------STGDDGCVRLWKANYNNLWKCTSVLKAEGS 334 (361)
T ss_pred C----CceEEEEEeeeeeEEe------ecCCCceeeehhhhhhhhheeeeEEeccCC
Confidence 1 1478899988888777 899999999998643 34556555555433
No 156
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.03 E-value=6.7e-10 Score=126.24 Aligned_cols=144 Identities=15% Similarity=0.291 Sum_probs=115.3
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEEe----cC
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTLQ----LD 92 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL~----H~ 92 (1525)
.|++.++. .+.-|++++... +.....+.+ |.+.|+.|.|||+.+ +++++|.|.+||+||+++..|+..|- |.
T Consensus 105 ~p~la~~G-~~GvIrVid~~~-~~~~~~~~g-hG~sINeik~~p~~~qlvls~SkD~svRlwnI~~~~Cv~VfGG~egHr 181 (385)
T KOG1034|consen 105 NPFLAAGG-YLGVIRVIDVVS-GQCSKNYRG-HGGSINEIKFHPDRPQLVLSASKDHSVRLWNIQTDVCVAVFGGVEGHR 181 (385)
T ss_pred CeeEEeec-ceeEEEEEecch-hhhccceec-cCccchhhhcCCCCCcEEEEecCCceEEEEeccCCeEEEEeccccccc
Confidence 66766666 567777877543 333444454 999999999999875 78999999999999999999998886 99
Q ss_pred CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC--------------Cce---ee---------EeecccCcCeEE
Q 000425 93 NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE--------------RPS---MI---------GIIQVGSQPITS 146 (1525)
Q Consensus 93 ~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~--------------~~s---~i---------~il~gH~~~Vts 146 (1525)
+.|. +++|+++|.+|+|++.|.++++|++... .++ +. .+..-|..+|-|
T Consensus 182 deVL-----SvD~~~~gd~i~ScGmDhslk~W~l~~~~f~~~lE~s~~~~~~~t~~pfpt~~~~fp~fst~diHrnyVDC 256 (385)
T KOG1034|consen 182 DEVL-----SVDFSLDGDRIASCGMDHSLKLWRLNVKEFKNKLELSITYSPNKTTRPFPTPKTHFPDFSTTDIHRNYVDC 256 (385)
T ss_pred CcEE-----EEEEcCCCCeeeccCCcceEEEEecChhHHhhhhhhhcccCCCCccCcCCccccccccccccccccchHHH
Confidence 9999 9999999999999999999999999831 000 00 133468888888
Q ss_pred EEEcCCCCEEEEEECCCCeeEEEee
Q 000425 147 VAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 147 VafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+.|- |.+++|=|-++.|..|...
T Consensus 257 vrw~--gd~ilSkscenaI~~w~pg 279 (385)
T KOG1034|consen 257 VRWF--GDFILSKSCENAIVCWKPG 279 (385)
T ss_pred HHHH--hhheeecccCceEEEEecc
Confidence 8885 5789999999999999873
No 157
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.03 E-value=8e-09 Score=116.42 Aligned_cols=156 Identities=12% Similarity=0.066 Sum_probs=108.1
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEE-EECCCcEEEEEcCCCcEEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYV-AYADGLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaS-Gs~DgtIrLWDl~t~k~l~t 88 (1525)
..++++|++.+++++......+.+++. +.+.....+. +.....+++|+|++.++++ +..|+.|++||+.+++.+..
T Consensus 118 ~~~~~~~dg~~l~~~~~~~~~~~~~d~-~~~~~~~~~~--~~~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~~~ 194 (300)
T TIGR03866 118 EGMAVSPDGKIVVNTSETTNMAHFIDT-KTYEIVDNVL--VDQRPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVIKK 194 (300)
T ss_pred ceEEECCCCCEEEEEecCCCeEEEEeC-CCCeEEEEEE--cCCCccEEEECCCCCEEEEEcCCCCEEEEEEcCcceeeee
Confidence 468999999999888764444545553 3332222222 2345678999999998754 45699999999999988877
Q ss_pred Ee-cCCceeec--CceeEEEeCCCCEEEEE-eCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE-ECCC
Q 000425 89 LQ-LDNTIKLL--GAGAFAFHPTLEWLFVG-DRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL-CRDG 163 (1525)
Q Consensus 89 L~-H~~~V~~~--ga~sVaFSPdg~~L~Sg-S~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg-S~Dg 163 (1525)
+. +...+... ....++|+|++++++.+ +.++.+.+||.++. ...... .+...+.+++|+|++.+|+++ +.++
T Consensus 195 ~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~--~~~~~~-~~~~~~~~~~~~~~g~~l~~~~~~~~ 271 (300)
T TIGR03866 195 ITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTY--EVLDYL-LVGQRVWQLAFTPDEKYLLTTNGVSN 271 (300)
T ss_pred eeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCC--cEEEEE-EeCCCcceEEECCCCCEEEEEcCCCC
Confidence 76 32211100 01268899999986554 56778999999853 222212 345578999999999998876 4689
Q ss_pred CeeEEEee
Q 000425 164 SLQVWKTR 171 (1525)
Q Consensus 164 tIkVWDlr 171 (1525)
+|++||++
T Consensus 272 ~i~v~d~~ 279 (300)
T TIGR03866 272 DVSVIDVA 279 (300)
T ss_pred eEEEEECC
Confidence 99999987
No 158
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.02 E-value=8.9e-09 Score=121.98 Aligned_cols=152 Identities=19% Similarity=0.226 Sum_probs=122.0
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
.|.---||.||..+.++++.+ ++.+.+|+ +.+..+... ...++.|++|||.| .++.|...|...+.|.++ +.
T Consensus 368 ~delwgla~hps~~q~~T~gq-dk~v~lW~---~~k~~wt~~--~~d~~~~~~fhpsg-~va~Gt~~G~w~V~d~e~-~~ 439 (626)
T KOG2106|consen 368 GDELWGLATHPSKNQLLTCGQ-DKHVRLWN---DHKLEWTKI--IEDPAECADFHPSG-VVAVGTATGRWFVLDTET-QD 439 (626)
T ss_pred ccceeeEEcCCChhheeeccC-cceEEEcc---CCceeEEEE--ecCceeEeeccCcc-eEEEeeccceEEEEeccc-ce
Confidence 344456899999999888876 78899998 223333222 35899999999999 999999999999999998 44
Q ss_pred EEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 86 HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 86 l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
+.++. ...+++ +++|+|+|.+|+.||.|+.|.+|-+..........-..|.++|+.+.|++|+++|.+-|.|-.
T Consensus 440 lv~~~~d~~~ls-----~v~ysp~G~~lAvgs~d~~iyiy~Vs~~g~~y~r~~k~~gs~ithLDwS~Ds~~~~~~S~d~e 514 (626)
T KOG2106|consen 440 LVTIHTDNEQLS-----VVRYSPDGAFLAVGSHDNHIYIYRVSANGRKYSRVGKCSGSPITHLDWSSDSQFLVSNSGDYE 514 (626)
T ss_pred eEEEEecCCceE-----EEEEcCCCCEEEEecCCCeEEEEEECCCCcEEEEeeeecCceeEEeeecCCCceEEeccCceE
Confidence 44555 677899 999999999999999999999998875433333223345599999999999999999999999
Q ss_pred eeEEEe
Q 000425 165 LQVWKT 170 (1525)
Q Consensus 165 IkVWDl 170 (1525)
|-.|..
T Consensus 515 iLyW~~ 520 (626)
T KOG2106|consen 515 ILYWKP 520 (626)
T ss_pred EEEEcc
Confidence 999943
No 159
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=99.01 E-value=6.7e-09 Score=116.69 Aligned_cols=190 Identities=13% Similarity=0.233 Sum_probs=135.7
Q ss_pred EEcCCCCEEEEEEeccceEEEEEeecC--CccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 13 ALTPLQPVVFFGFHRRMSVTVVGTVEG--GRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 13 A~hP~qp~Vf~s~~~d~sI~vv~tieg--gk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
.+.-..|...-.++-|-++.+|+...+ +...+.+.. |.+.|..|+|...+. .||+.+.||.||+||++..+.-..+
T Consensus 157 DWne~dp~~igtSSiDTTCTiWdie~~~~~~vkTQLIA-HDKEV~DIaf~~~s~~~FASvgaDGSvRmFDLR~leHSTII 235 (364)
T KOG0290|consen 157 DWNEVDPNLIGTSSIDTTCTIWDIETGVSGTVKTQLIA-HDKEVYDIAFLKGSRDVFASVGADGSVRMFDLRSLEHSTII 235 (364)
T ss_pred ccccCCcceeEeecccCeEEEEEEeeccccceeeEEEe-cCcceeEEEeccCccceEEEecCCCcEEEEEecccccceEE
Confidence 344556777777777899999996543 223556665 999999999998664 7999999999999999876543332
Q ss_pred -e---cCCceeecCceeEEEeC-CCCEEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEEEEECC
Q 000425 90 -Q---LDNTIKLLGAGAFAFHP-TLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLVTLCRD 162 (1525)
Q Consensus 90 -~---H~~~V~~~ga~sVaFSP-dg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~LaSgS~D 162 (1525)
+ ...+.. .++|++ |.+++++-. +...|.|-|++.. ...+..+.+|...|++|+|+|. ...|+||++|
T Consensus 236 YE~p~~~~pLl-----RLswnkqDpnymATf~~dS~~V~iLDiR~P-~tpva~L~~H~a~VNgIaWaPhS~~hictaGDD 309 (364)
T KOG0290|consen 236 YEDPSPSTPLL-----RLSWNKQDPNYMATFAMDSNKVVILDIRVP-CTPVARLRNHQASVNGIAWAPHSSSHICTAGDD 309 (364)
T ss_pred ecCCCCCCcce-----eeccCcCCchHHhhhhcCCceEEEEEecCC-CcceehhhcCcccccceEecCCCCceeeecCCc
Confidence 2 244566 788988 556777665 6678999999953 3444568899999999999996 5689999999
Q ss_pred CCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecC-CCeEEEEecc
Q 000425 163 GSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPR-LNLAVLLFAN 232 (1525)
Q Consensus 163 gtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~-~Nlial~F~~ 232 (1525)
..+-+||+.....++. ...++.+..| ..|+.+.|.|. .+.++++|..
T Consensus 310 ~qaliWDl~q~~~~~~-----------------~dPilay~a~------~EVNqi~Ws~~~~Dwiai~~~k 357 (364)
T KOG0290|consen 310 CQALIWDLQQMPRENG-----------------EDPILAYTAG------GEVNQIQWSSSQPDWIAICFGK 357 (364)
T ss_pred ceEEEEecccccccCC-----------------CCchhhhhcc------ceeeeeeecccCCCEEEEEecC
Confidence 9999999985443110 0112222211 14677888865 4678888765
No 160
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=99.01 E-value=2.2e-09 Score=122.81 Aligned_cols=145 Identities=15% Similarity=0.220 Sum_probs=112.0
Q ss_pred CEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC--CCEEEEEECCCcEEEEEcCCCcEEEEEe---cC-
Q 000425 19 PVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR--LPVLYVAYADGLIRAYNIHTYAVHYTLQ---LD- 92 (1525)
Q Consensus 19 p~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd--g~~LaSGs~DgtIrLWDl~t~k~l~tL~---H~- 92 (1525)
..|+++. +..++++++... +.....+++ |...++.+.|... ...+.+|+.||+||+||+++......+. +.
T Consensus 41 ~~vav~l-Sngsv~lyd~~t-g~~l~~fk~-~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~ 117 (376)
T KOG1188|consen 41 TAVAVSL-SNGSVRLYDKGT-GQLLEEFKG-PPATTNGVRFISCDSPHGVISCSSDGTVRLWDIRSQAESARISWTQQSG 117 (376)
T ss_pred eeEEEEe-cCCeEEEEeccc-hhhhheecC-CCCcccceEEecCCCCCeeEEeccCCeEEEEEeecchhhhheeccCCCC
Confidence 3466666 478899988644 566667776 8888888888764 3468999999999999999886655555 43
Q ss_pred CceeecCceeEEEeCCCCEEEEEe----CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC-CEEEEEECCCCeeE
Q 000425 93 NTIKLLGAGAFAFHPTLEWLFVGD----RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML-RLLVTLCRDGSLQV 167 (1525)
Q Consensus 93 ~~V~~~ga~sVaFSPdg~~L~SgS----~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg-~~LaSgS~DgtIkV 167 (1525)
.+.. +++-.-.++.+++|. .|-.|.+||++...+.+-.....|...|+++.|||.. .+|+|||.||-|.+
T Consensus 118 ~~f~-----~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lrFHP~~pnlLlSGSvDGLvnl 192 (376)
T KOG1188|consen 118 TPFI-----CLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQLLRQLNESHNDDVTQLRFHPSDPNLLLSGSVDGLVNL 192 (376)
T ss_pred Ccce-----EeeccCcCCeEEeccccccCceEEEEEEeccccchhhhhhhhccCcceeEEecCCCCCeEEeecccceEEe
Confidence 4556 677666777777775 5788999999975332444567899999999999964 58999999999999
Q ss_pred EEee
Q 000425 168 WKTR 171 (1525)
Q Consensus 168 WDlr 171 (1525)
+|++
T Consensus 193 fD~~ 196 (376)
T KOG1188|consen 193 FDTK 196 (376)
T ss_pred eecC
Confidence 9986
No 161
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.00 E-value=2.7e-08 Score=111.03 Aligned_cols=181 Identities=13% Similarity=0.130 Sum_probs=135.0
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEE
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLA 123 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrV 123 (1525)
.+.+ |+.|++.|.++.+|.+|++++.|.+..+|-..+++.+.++. |...|. |++...+.+++++|+.|.++++
T Consensus 5 ~l~G-HERplTqiKyN~eGDLlFscaKD~~~~vw~s~nGerlGty~GHtGavW-----~~Did~~s~~liTGSAD~t~kL 78 (327)
T KOG0643|consen 5 LLQG-HERPLTQIKYNREGDLLFSCAKDSTPTVWYSLNGERLGTYDGHTGAVW-----CCDIDWDSKHLITGSADQTAKL 78 (327)
T ss_pred cccc-CccccceEEecCCCcEEEEecCCCCceEEEecCCceeeeecCCCceEE-----EEEecCCcceeeeccccceeEE
Confidence 3444 99999999999999999999999999999988999999999 999999 9999999999999999999999
Q ss_pred EeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCC-----CeeEEEeeeecCCCCCCCccccccCCCcccccccc
Q 000425 124 WDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG-----SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPR 198 (1525)
Q Consensus 124 WDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg-----tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~ 198 (1525)
||+.+. +....+. -...|..+.|+++|++++.++++. .|.++|++.... ... .-+| +-.
T Consensus 79 WDv~tG--k~la~~k-~~~~Vk~~~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~-----~~~-s~ep-------~~k 142 (327)
T KOG0643|consen 79 WDVETG--KQLATWK-TNSPVKRVDFSFGGNLILASTDKQMGYTCFVSVFDIRDDSS-----DID-SEEP-------YLK 142 (327)
T ss_pred EEcCCC--cEEEEee-cCCeeEEEeeccCCcEEEEEehhhcCcceEEEEEEccCChh-----hhc-ccCc-------eEE
Confidence 999964 2332333 367899999999999988876543 688888872110 000 0000 000
Q ss_pred cccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccC
Q 000425 199 ILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARG 262 (1525)
Q Consensus 199 iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~ 262 (1525)
+.. +-..++...|.|..-.+. .+.++.++..|..+.|. ++++.-+.|..
T Consensus 143 I~t--------~~skit~a~Wg~l~~~ii------~Ghe~G~is~~da~~g~-~~v~s~~~h~~ 191 (327)
T KOG0643|consen 143 IPT--------PDSKITSALWGPLGETII------AGHEDGSISIYDARTGK-ELVDSDEEHSS 191 (327)
T ss_pred ecC--------CccceeeeeecccCCEEE------EecCCCcEEEEEcccCc-eeeechhhhcc
Confidence 000 112466778888877766 77888888888877762 44445455544
No 162
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.00 E-value=9.9e-09 Score=120.88 Aligned_cols=157 Identities=15% Similarity=0.168 Sum_probs=120.4
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCc-cceEEccCCC-CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR-APTKIKTDLK-KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk-~~tklk~~H~-~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
...+|+|++..+.+.+.++.-...|+.-+.-. +...+.+ |. ..+....++|++++++..+..|.|.|....|++.+.
T Consensus 261 ~~a~f~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g-~e~~~~e~FeVShd~~fia~~G~~G~I~lLhakT~eli~ 339 (514)
T KOG2055|consen 261 QKAEFAPNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYG-VEEKSMERFEVSHDSNFIAIAGNNGHIHLLHAKTKELIT 339 (514)
T ss_pred ceeeecCCCceEEEecccceEEEEeeccccccccccCCCC-cccchhheeEecCCCCeEEEcccCceEEeehhhhhhhhh
Confidence 45789999994444444577777787633211 1111222 43 457788999999999999999999999999999999
Q ss_pred EEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEee-cccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGII-QVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il-~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
+++....|. .++|+.|++.|++++.+|.|.+||+++. .+...+ ....-.-++++.++++.||++||+.|-|.
T Consensus 340 s~KieG~v~-----~~~fsSdsk~l~~~~~~GeV~v~nl~~~--~~~~rf~D~G~v~gts~~~S~ng~ylA~GS~~GiVN 412 (514)
T KOG2055|consen 340 SFKIEGVVS-----DFTFSSDSKELLASGGTGEVYVWNLRQN--SCLHRFVDDGSVHGTSLCISLNGSYLATGSDSGIVN 412 (514)
T ss_pred eeeeccEEe-----eEEEecCCcEEEEEcCCceEEEEecCCc--ceEEEEeecCccceeeeeecCCCceEEeccCcceEE
Confidence 999888899 9999999999999999999999999953 443222 22222336788889999999999999999
Q ss_pred EEEeeeec
Q 000425 167 VWKTRVII 174 (1525)
Q Consensus 167 VWDlr~~~ 174 (1525)
|||.+...
T Consensus 413 IYd~~s~~ 420 (514)
T KOG2055|consen 413 IYDGNSCF 420 (514)
T ss_pred Eeccchhh
Confidence 99987443
No 163
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=99.00 E-value=4.6e-10 Score=138.15 Aligned_cols=112 Identities=12% Similarity=0.258 Sum_probs=102.8
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
+|...|+|+.|.-+|.++++|++|..||||..++..++.++. |...++ .++.+...-.++++|.|..|+||-+.
T Consensus 188 gH~naVyca~fDrtg~~Iitgsdd~lvKiwS~et~~~lAs~rGhs~dit-----dlavs~~n~~iaaaS~D~vIrvWrl~ 262 (1113)
T KOG0644|consen 188 GHRNAVYCAIFDRTGRYIITGSDDRLVKIWSMETARCLASCRGHSGDIT-----DLAVSSNNTMIAAASNDKVIRVWRLP 262 (1113)
T ss_pred hhhhheeeeeeccccceEeecCccceeeeeeccchhhhccCCCCccccc-----hhccchhhhhhhhcccCceEEEEecC
Confidence 499999999999999999999999999999999999999999 999999 99998888899999999999999998
Q ss_pred CCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 128 IERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 128 t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.. ..+..+.+|++.|++++|+|-. +.+.|||+++||.|
T Consensus 263 ~~--~pvsvLrghtgavtaiafsP~~----sss~dgt~~~wd~r 300 (1113)
T KOG0644|consen 263 DG--APVSVLRGHTGAVTAIAFSPRA----SSSDDGTCRIWDAR 300 (1113)
T ss_pred CC--chHHHHhccccceeeeccCccc----cCCCCCceEecccc
Confidence 53 3334577999999999999976 77899999999999
No 164
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=98.99 E-value=7e-09 Score=116.37 Aligned_cols=161 Identities=17% Similarity=0.188 Sum_probs=113.9
Q ss_pred CcEEEEEEcCCCCEEEEEEeccce-------EEEEEeecCCcc--------ceEEccCCCCCeEEEEEcCCCCEEEEEEC
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMS-------VTVVGTVEGGRA--------PTKIKTDLKKPIVNLACHPRLPVLYVAYA 71 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~s-------I~vv~tieggk~--------~tklk~~H~~~V~sLaf~Pdg~~LaSGs~ 71 (1525)
+-.-+||-||..+.||.+...+.+ ..+|..-+.-.. +..+.+.|-+.|.|+.|+|++..+++-.
T Consensus 64 gEvw~las~P~d~~ilaT~yn~~s~s~vl~~aaiw~ipe~~~~S~~~tlE~v~~Ldteavg~i~cvew~Pns~klasm~- 142 (370)
T KOG1007|consen 64 GEVWDLASSPFDQRILATVYNDTSDSGVLTGAAIWQIPEPLGQSNSSTLECVASLDTEAVGKINCVEWEPNSDKLASMD- 142 (370)
T ss_pred cceehhhcCCCCCceEEEEEeccCCCcceeeEEEEecccccCccccchhhHhhcCCHHHhCceeeEEEcCCCCeeEEec-
Confidence 345589999999999887665322 234443222111 1223334567999999999998887754
Q ss_pred CCcEEEEEcCCCcE-EEEEe------cCCceeecCceeEEEeC--CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCc
Q 000425 72 DGLIRAYNIHTYAV-HYTLQ------LDNTIKLLGAGAFAFHP--TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQ 142 (1525)
Q Consensus 72 DgtIrLWDl~t~k~-l~tL~------H~~~V~~~ga~sVaFSP--dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~ 142 (1525)
|..|-+|++..... ...+. |....+ +-+|+| +|..+++. .|++++.||+++. .....+-.+|.+
T Consensus 143 dn~i~l~~l~ess~~vaev~ss~s~e~~~~ft-----sg~WspHHdgnqv~tt-~d~tl~~~D~RT~-~~~~sI~dAHgq 215 (370)
T KOG1007|consen 143 DNNIVLWSLDESSKIVAEVLSSESAEMRHSFT-----SGAWSPHHDGNQVATT-SDSTLQFWDLRTM-KKNNSIEDAHGQ 215 (370)
T ss_pred cCceEEEEcccCcchheeecccccccccceec-----ccccCCCCccceEEEe-CCCcEEEEEccch-hhhcchhhhhcc
Confidence 88999999887654 33332 345556 778888 56655544 6899999999975 222335679999
Q ss_pred CeEEEEEcCCCC-EEEEEECCCCeeEEEeeeecC
Q 000425 143 PITSVAWLPMLR-LLVTLCRDGSLQVWKTRVIIN 175 (1525)
Q Consensus 143 ~VtsVafsPdg~-~LaSgS~DgtIkVWDlr~~~~ 175 (1525)
.|+.+.|+|+.. +|+||++||.|++||.|....
T Consensus 216 ~vrdlDfNpnkq~~lvt~gDdgyvriWD~R~tk~ 249 (370)
T KOG1007|consen 216 RVRDLDFNPNKQHILVTCGDDGYVRIWDTRKTKF 249 (370)
T ss_pred eeeeccCCCCceEEEEEcCCCccEEEEeccCCCc
Confidence 999999999864 689999999999999995543
No 165
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=98.98 E-value=2.5e-09 Score=128.57 Aligned_cols=205 Identities=16% Similarity=0.206 Sum_probs=137.8
Q ss_pred EEEEcCCCCEEEEEEeccceEE-EEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEE
Q 000425 11 HLALTPLQPVVFFGFHRRMSVT-VVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTL 89 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~-vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL 89 (1525)
.+.+|-.+.++.+......+-. ++....+.+...-+.. -++.|.++.|||..++|+++.. ..|||||+.....++.+
T Consensus 526 ~vtWHrkGDYlatV~~~~~~~~VliHQLSK~~sQ~PF~k-skG~vq~v~FHPs~p~lfVaTq-~~vRiYdL~kqelvKkL 603 (733)
T KOG0650|consen 526 QVTWHRKGDYLATVMPDSGNKSVLIHQLSKRKSQSPFRK-SKGLVQRVKFHPSKPYLFVATQ-RSVRIYDLSKQELVKKL 603 (733)
T ss_pred eeeeecCCceEEEeccCCCcceEEEEecccccccCchhh-cCCceeEEEecCCCceEEEEec-cceEEEehhHHHHHHHH
Confidence 4556666666554443222211 2222333333333322 4567899999999999998876 67999999888777777
Q ss_pred e-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEE
Q 000425 90 Q-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 90 ~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVW 168 (1525)
. ....|. .++.||.|..|+.|+.|+.+..+|+.-. ......+.-|...|++|+||+..++|+|||+|+++.|+
T Consensus 604 ~tg~kwiS-----~msihp~GDnli~gs~d~k~~WfDldls-skPyk~lr~H~~avr~Va~H~ryPLfas~sdDgtv~Vf 677 (733)
T KOG0650|consen 604 LTGSKWIS-----SMSIHPNGDNLILGSYDKKMCWFDLDLS-SKPYKTLRLHEKAVRSVAFHKRYPLFASGSDDGTVIVF 677 (733)
T ss_pred hcCCeeee-----eeeecCCCCeEEEecCCCeeEEEEcccC-cchhHHhhhhhhhhhhhhhccccceeeeecCCCcEEEE
Confidence 7 667888 9999999999999999999999998843 12223455799999999999999999999999999999
Q ss_pred EeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccc
Q 000425 169 KTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 169 Dlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
.-++...-...++. .| -..|-++ .+..--+|-.+.|||....+. +++-|..++.|
T Consensus 678 hg~VY~Dl~qnpli----VP--------lK~L~gH---~~~~~~gVLd~~wHP~qpWLf------sAGAd~tirlf 732 (733)
T KOG0650|consen 678 HGMVYNDLLQNPLI----VP--------LKRLRGH---EKTNDLGVLDTIWHPRQPWLF------SAGADGTIRLF 732 (733)
T ss_pred eeeeehhhhcCCce----Ee--------eeeccCc---eeecccceEeecccCCCceEE------ecCCCceEEee
Confidence 87755331111111 11 0111111 111112577888999998766 66666666544
No 166
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.96 E-value=1.4e-09 Score=138.77 Aligned_cols=203 Identities=18% Similarity=0.267 Sum_probs=139.3
Q ss_pred cEEEEEEcCCCCE---EEEEEeccceEEEEEeec--CCcc--ceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEE
Q 000425 8 TEVHLALTPLQPV---VFFGFHRRMSVTVVGTVE--GGRA--PTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYN 79 (1525)
Q Consensus 8 ~e~~LA~hP~qp~---Vf~s~~~d~sI~vv~tie--ggk~--~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWD 79 (1525)
+.=-|++.+.++- |..|.-.+++|.+|+... .+.. ...-+..|+++|++++|++.+. ++|+|+.||.|.|||
T Consensus 66 rF~kL~W~~~g~~~~GlIaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~~q~nlLASGa~~geI~iWD 145 (1049)
T KOG0307|consen 66 RFNKLAWGSYGSHSHGLIAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNPFQGNLLASGADDGEILIWD 145 (1049)
T ss_pred cceeeeecccCCCccceeeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeeccccCCceeeccCCCCcEEEec
Confidence 4456888888777 567777899999998754 1221 2222346999999999999877 899999999999999
Q ss_pred cCCCcEEEEEe---cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccC--cCeEEEEEcCCC
Q 000425 80 IHTYAVHYTLQ---LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGS--QPITSVAWLPML 153 (1525)
Q Consensus 80 l~t~k~l~tL~---H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~--~~VtsVafsPdg 153 (1525)
+...+.-+++. ....|. +|+|+..-. .|++++.+|.+.|||++.. +.++ .+..|. ..+..++|||+.
T Consensus 146 lnn~~tP~~~~~~~~~~eI~-----~lsWNrkvqhILAS~s~sg~~~iWDlr~~-~pii-~ls~~~~~~~~S~l~WhP~~ 218 (1049)
T KOG0307|consen 146 LNKPETPFTPGSQAPPSEIK-----CLSWNRKVSHILASGSPSGRAVIWDLRKK-KPII-KLSDTPGRMHCSVLAWHPDH 218 (1049)
T ss_pred cCCcCCCCCCCCCCCcccce-----EeccchhhhHHhhccCCCCCceeccccCC-Cccc-ccccCCCccceeeeeeCCCC
Confidence 98765544442 456788 999998655 4667789999999999953 2233 122333 346789999986
Q ss_pred C-EEEEEECCC---CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEE
Q 000425 154 R-LLVTLCRDG---SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLL 229 (1525)
Q Consensus 154 ~-~LaSgS~Dg---tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~ 229 (1525)
- .|+++|+|. .|++||+|....|-+ ++..+. .+|..++|.+..+-..+
T Consensus 219 aTql~~As~dd~~PviqlWDlR~assP~k--------------------~~~~H~-------~GilslsWc~~D~~lll- 270 (1049)
T KOG0307|consen 219 ATQLLVASGDDSAPVIQLWDLRFASSPLK--------------------ILEGHQ-------RGILSLSWCPQDPRLLL- 270 (1049)
T ss_pred ceeeeeecCCCCCceeEeecccccCCchh--------------------hhcccc-------cceeeeccCCCCchhhh-
Confidence 3 566777665 799999995443211 111111 24667778777643221
Q ss_pred ecccccCCcccccccccccC
Q 000425 230 FANFTGGDNIKNRAAYTREG 249 (1525)
Q Consensus 230 F~~~s~~Dd~~~k~~~~~eg 249 (1525)
+++.|..+..|...+|
T Consensus 271 ----SsgkD~~ii~wN~~tg 286 (1049)
T KOG0307|consen 271 ----SSGKDNRIICWNPNTG 286 (1049)
T ss_pred ----cccCCCCeeEecCCCc
Confidence 6677777777776655
No 167
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.96 E-value=4.5e-09 Score=125.31 Aligned_cols=137 Identities=14% Similarity=0.203 Sum_probs=110.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecC-------------C--------------ccceEEccCCCCCeEEEEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG-------------G--------------RAPTKIKTDLKKPIVNLAC 59 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg-------------g--------------k~~tklk~~H~~~V~sLaf 59 (1525)
.+.-+|.+-|-.+-+|+.++.+....+++..-. + .+..+... -++.|...+|
T Consensus 220 tsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~~k~~~~f~i~t~ksk~~rNPv~~w~~-~~g~in~f~F 298 (636)
T KOG2394|consen 220 SSVTCIKWVPGSDSLFLVAHASGNLYLYDKEIVCGATAPSYQALKDGDQFAILTSKSKKTRNPVARWHI-GEGSINEFAF 298 (636)
T ss_pred cceEEEEEEeCCCceEEEEEecCceEEeeccccccCCCCcccccCCCCeeEEeeeeccccCCccceeEe-ccccccceeE
Confidence 456678888888888888888888877743110 0 01111111 1346888999
Q ss_pred cCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeec
Q 000425 60 HPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQ 138 (1525)
Q Consensus 60 ~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~ 138 (1525)
+|||.+||+-+.||.+||+|..+.+++-.++ .-.... ||+|||||+||++|+.|-.|.||.+. .+++++--+
T Consensus 299 S~DG~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGGLL-----CvcWSPDGKyIvtGGEDDLVtVwSf~--erRVVARGq 371 (636)
T KOG2394|consen 299 SPDGKYLATVSQDGFLRIFDFDTQELLGVMKSYFGGLL-----CVCWSPDGKYIVTGGEDDLVTVWSFE--ERRVVARGQ 371 (636)
T ss_pred cCCCceEEEEecCceEEEeeccHHHHHHHHHhhccceE-----EEEEcCCccEEEecCCcceEEEEEec--cceEEEecc
Confidence 9999999999999999999999988888887 667788 99999999999999999999999998 467777778
Q ss_pred ccCcCeEEEEEcC
Q 000425 139 VGSQPITSVAWLP 151 (1525)
Q Consensus 139 gH~~~VtsVafsP 151 (1525)
+|.+.|+.|+|.|
T Consensus 372 GHkSWVs~VaFDp 384 (636)
T KOG2394|consen 372 GHKSWVSVVAFDP 384 (636)
T ss_pred ccccceeeEeecc
Confidence 9999999999984
No 168
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=98.96 E-value=3e-08 Score=112.05 Aligned_cols=154 Identities=21% Similarity=0.349 Sum_probs=120.6
Q ss_pred EEEEE-cCCCCEEEEE-EeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CCcEEEEEcCCCcEE
Q 000425 10 VHLAL-TPLQPVVFFG-FHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DGLIRAYNIHTYAVH 86 (1525)
Q Consensus 10 ~~LA~-hP~qp~Vf~s-~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-DgtIrLWDl~t~k~l 86 (1525)
..+.+ +++...++.. ...+..+.+|+..........+.. |...|+.++|+|++..+++++. |+.+++|++.+...+
T Consensus 113 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (466)
T COG2319 113 SKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEG-HSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPL 191 (466)
T ss_pred eeEEEECCCcceEEeccCCCCccEEEEEecCCCeEEEEEec-CcccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceE
Confidence 33444 7787744444 333778888875331444555555 9999999999999998888885 999999999998899
Q ss_pred EEEe-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeE-eecccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIG-IIQVGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~-il~gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
..+. |...+. +++|+|++. .+++++.|++|++||... ..... .+.+|...+ ...|+|++.++++++.|+
T Consensus 192 ~~~~~~~~~v~-----~~~~~~~~~~~~~~~~~d~~i~~wd~~~--~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d~ 263 (466)
T COG2319 192 STLAGHTDPVS-----SLAFSPDGGLLIASGSSDGTIRLWDLST--GKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSDG 263 (466)
T ss_pred EeeccCCCceE-----EEEEcCCcceEEEEecCCCcEEEEECCC--CcEEeeecCCCCcce-eEeECCCCCEEEEecCCC
Confidence 9999 999999 999999998 555558999999998883 33333 466788775 448999999999999999
Q ss_pred CeeEEEeee
Q 000425 164 SLQVWKTRV 172 (1525)
Q Consensus 164 tIkVWDlr~ 172 (1525)
.+++||.+.
T Consensus 264 ~~~~~~~~~ 272 (466)
T COG2319 264 TIRLWDLRS 272 (466)
T ss_pred cEEEeeecC
Confidence 999999873
No 169
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=98.95 E-value=1.9e-08 Score=110.78 Aligned_cols=154 Identities=18% Similarity=0.227 Sum_probs=111.4
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeec--------CCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVE--------GGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYN 79 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tie--------ggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWD 79 (1525)
+.+.-|++|...+++++. ...++.+....+ .++....-...|.++|+.++|+ ..+|++|+ ||.|+-|.
T Consensus 12 tvf~qa~sp~~~~l~agn-~~G~iav~sl~sl~s~sa~~~gk~~iv~eqahdgpiy~~~f~--d~~Lls~g-dG~V~gw~ 87 (325)
T KOG0649|consen 12 TVFAQAISPSKQYLFAGN-LFGDIAVLSLKSLDSGSAEPPGKLKIVPEQAHDGPIYYLAFH--DDFLLSGG-DGLVYGWE 87 (325)
T ss_pred HHHHHhhCCcceEEEEec-CCCeEEEEEehhhhccccCCCCCcceeeccccCCCeeeeeee--hhheeecc-CceEEEee
Confidence 455668899998876665 577776654321 1333333335699999999999 45667665 69999997
Q ss_pred cCCCc------EEEEEe---cC-----CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeE
Q 000425 80 IHTYA------VHYTLQ---LD-----NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPIT 145 (1525)
Q Consensus 80 l~t~k------~l~tL~---H~-----~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~Vt 145 (1525)
..... .+-.+. |. -.|+ ++-..|..+.++.++.|+.+..||+++. ++...+.+|+++|+
T Consensus 88 W~E~~es~~~K~lwe~~~P~~~~~~evPeIN-----am~ldP~enSi~~AgGD~~~y~~dlE~G--~i~r~~rGHtDYvH 160 (325)
T KOG0649|consen 88 WNEEEESLATKRLWEVKIPMQVDAVEVPEIN-----AMWLDPSENSILFAGGDGVIYQVDLEDG--RIQREYRGHTDYVH 160 (325)
T ss_pred ehhhhhhccchhhhhhcCccccCcccCCccc-----eeEeccCCCcEEEecCCeEEEEEEecCC--EEEEEEcCCcceee
Confidence 54321 111111 32 2355 7888888888888889999999999964 44455889999999
Q ss_pred EEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 146 SVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 146 sVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
+++--.....++||+.||++|+||.++
T Consensus 161 ~vv~R~~~~qilsG~EDGtvRvWd~kt 187 (325)
T KOG0649|consen 161 SVVGRNANGQILSGAEDGTVRVWDTKT 187 (325)
T ss_pred eeeecccCcceeecCCCccEEEEeccc
Confidence 999866666789999999999999983
No 170
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=98.94 E-value=6.3e-09 Score=125.01 Aligned_cols=198 Identities=20% Similarity=0.253 Sum_probs=142.1
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecC--------CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG--------GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAY 78 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg--------gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLW 78 (1525)
|...+++|||..|.++.+. -+..+.+|.. ++ ..+...+.. |++||.|+++.+.+.++++|+.||+|+.|
T Consensus 295 d~ir~l~~~~sep~lit~s-ed~~lk~WnL-qk~~~s~~~~~epi~tfra-H~gPVl~v~v~~n~~~~ysgg~Dg~I~~w 371 (577)
T KOG0642|consen 295 DCIRALAFHPSEPVLITAS-EDGTLKLWNL-QKAKKSAEKDVEPILTFRA-HEGPVLCVVVPSNGEHCYSGGIDGTIRCW 371 (577)
T ss_pred hhhhhhhcCCCCCeEEEec-cccchhhhhh-cccCCccccceeeeEEEec-ccCceEEEEecCCceEEEeeccCceeeee
Confidence 5678899999999988776 4788888886 22 123344555 99999999999999999999999999999
Q ss_pred EcC-------CC---cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE------------
Q 000425 79 NIH-------TY---AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG------------ 135 (1525)
Q Consensus 79 Dl~-------t~---k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~------------ 135 (1525)
++. +. ....++. |.+.|. .+++|.....|+++|.||||+.|+.........+
T Consensus 372 ~~p~n~dp~ds~dp~vl~~~l~Ghtdavw-----~l~~s~~~~~Llscs~DgTvr~w~~~~~~~~~f~~~~e~g~Plsvd 446 (577)
T KOG0642|consen 372 NLPPNQDPDDSYDPSVLSGTLLGHTDAVW-----LLALSSTKDRLLSCSSDGTVRLWEPTEESPCTFGEPKEHGYPLSVD 446 (577)
T ss_pred ccCCCCCcccccCcchhccceecccccee-----eeeecccccceeeecCCceEEeeccCCcCccccCCccccCCcceEe
Confidence 543 12 2345566 999999 9999999999999999999999987642110000
Q ss_pred ------------------------------eec-------ccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCC
Q 000425 136 ------------------------------IIQ-------VGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNR 178 (1525)
Q Consensus 136 ------------------------------il~-------gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~ 178 (1525)
++. .-...++-+.+||.+++.+++..|++|+++|..+..--
T Consensus 447 ~~ss~~a~~~~s~~~~~~~~~~~ev~s~~~~~~s~~~~~~~~~~~in~vVs~~~~~~~~~~hed~~Ir~~dn~~~~~l-- 524 (577)
T KOG0642|consen 447 RTSSRPAHSLASFRFGYTSIDDMEVVSDLLIFESSASPGPRRYPQINKVVSHPTADITFTAHEDRSIRFFDNKTGKIL-- 524 (577)
T ss_pred eccchhHhhhhhcccccccchhhhhhhheeeccccCCCcccccCccceEEecCCCCeeEecccCCceecccccccccc--
Confidence 000 01134567889999999999999999999987633210
Q ss_pred CCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCccccccccc
Q 000425 179 PPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYT 246 (1525)
Q Consensus 179 ~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~ 246 (1525)
+.+ +-.+..+..+++.|++..+. +++-|..++.|+.
T Consensus 525 ~s~--------------------------~a~~~svtslai~~ng~~l~------s~s~d~sv~l~kl 560 (577)
T KOG0642|consen 525 HSM--------------------------VAHKDSVTSLAIDPNGPYLM------SGSHDGSVRLWKL 560 (577)
T ss_pred hhe--------------------------eeccceecceeecCCCceEE------eecCCceeehhhc
Confidence 000 11112456788888888776 6666777776664
No 171
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=98.92 E-value=5.1e-08 Score=110.13 Aligned_cols=153 Identities=22% Similarity=0.361 Sum_probs=120.8
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEE-
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHY- 87 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~- 87 (1525)
-.++++|....++.+...+..+.+|+... ......+.. |...|.+++|+|+++ .+++++.|++|++||...+....
T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~~~~~ 236 (466)
T COG2319 159 TSLAFSPDGKLLASGSSLDGTIKLWDLRT-GKPLSTLAG-HTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRS 236 (466)
T ss_pred EEEEECCCCCEEEecCCCCCceEEEEcCC-CceEEeecc-CCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCcEEee
Confidence 36999999997666643378888887543 444455555 999999999999998 56666999999999998888877
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
.+. |..... . .|+|++..+++++.|+.+++||......... ...+|...|.++.|+|++..+++++.|+.++
T Consensus 237 ~~~~~~~~~~-----~-~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~~~~~~~~~~d~~~~ 309 (466)
T COG2319 237 TLSGHSDSVV-----S-SFSPDGSLLASGSSDGTIRLWDLRSSSSLLR-TLSGHSSSVLSVAFSPDGKLLASGSSDGTVR 309 (466)
T ss_pred ecCCCCccee-----E-eECCCCCEEEEecCCCcEEEeeecCCCcEEE-EEecCCccEEEEEECCCCCEEEEeeCCCcEE
Confidence 566 665422 2 7899998888999999999999986433122 2368899999999999988888899999999
Q ss_pred EEEee
Q 000425 167 VWKTR 171 (1525)
Q Consensus 167 VWDlr 171 (1525)
+||.+
T Consensus 310 ~~~~~ 314 (466)
T COG2319 310 LWDLE 314 (466)
T ss_pred EEEcC
Confidence 99876
No 172
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=98.90 E-value=2.3e-09 Score=126.47 Aligned_cols=150 Identities=15% Similarity=0.208 Sum_probs=124.7
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
.||++|+.-+.|.++ .+..|.||+..+. ..+..+.+ |++.+.||.++++|..|-+|+.|.+||-||++++..+....
T Consensus 514 ALa~spDakvcFscc-sdGnI~vwDLhnq-~~VrqfqG-htDGascIdis~dGtklWTGGlDntvRcWDlregrqlqqhd 590 (705)
T KOG0639|consen 514 ALAISPDAKVCFSCC-SDGNIAVWDLHNQ-TLVRQFQG-HTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQLQQHD 590 (705)
T ss_pred hhhcCCccceeeeec-cCCcEEEEEcccc-eeeecccC-CCCCceeEEecCCCceeecCCCccceeehhhhhhhhhhhhh
Confidence 588999998776666 6899999997654 33444555 99999999999999999999999999999999887655444
Q ss_pred cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 91 LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 91 H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
....|. ++.++|++.|++.|-..+.|.|-..+...+.. +.-|.+-|.++.|.+.|++++|.+.|.-+..|..
T Consensus 591 F~SQIf-----SLg~cP~~dWlavGMens~vevlh~skp~kyq---lhlheScVLSlKFa~cGkwfvStGkDnlLnawrt 662 (705)
T KOG0639|consen 591 FSSQIF-----SLGYCPTGDWLAVGMENSNVEVLHTSKPEKYQ---LHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRT 662 (705)
T ss_pred hhhhhe-----ecccCCCccceeeecccCcEEEEecCCcccee---ecccccEEEEEEecccCceeeecCchhhhhhccC
Confidence 457788 99999999999999999998887766432222 3468999999999999999999999999999976
Q ss_pred e
Q 000425 171 R 171 (1525)
Q Consensus 171 r 171 (1525)
-
T Consensus 663 P 663 (705)
T KOG0639|consen 663 P 663 (705)
T ss_pred c
Confidence 4
No 173
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=98.90 E-value=8.8e-09 Score=121.59 Aligned_cols=159 Identities=16% Similarity=0.235 Sum_probs=125.7
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~ 83 (1525)
.-+|-+++++..-+.-++..+. ....|++|+... ...-..++ +|+..|+++.++-++.++|+++..|-|.|..+.++
T Consensus 77 ~~gd~~~Cv~~~s~S~y~~sgG-~~~~Vkiwdl~~-kl~hr~lk-dh~stvt~v~YN~~DeyiAsvs~gGdiiih~~~t~ 153 (673)
T KOG4378|consen 77 LTGDNAFCVACASQSLYEISGG-QSGCVKIWDLRA-KLIHRFLK-DHQSTVTYVDYNNTDEYIASVSDGGDIIIHGTKTK 153 (673)
T ss_pred cccchHHHHhhhhcceeeeccC-cCceeeehhhHH-HHHhhhcc-CCcceeEEEEecCCcceeEEeccCCcEEEEecccC
Confidence 3456677777776665554444 577888888642 22223344 59999999999999999999999999999999999
Q ss_pred cEEEEEec--CCceeecCceeEEEeCCCCEEE-EEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEEEE
Q 000425 84 AVHYTLQL--DNTIKLLGAGAFAFHPTLEWLF-VGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLVTL 159 (1525)
Q Consensus 84 k~l~tL~H--~~~V~~~ga~sVaFSPdg~~L~-SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~LaSg 159 (1525)
....+|.| ...|+ .++|||..++++ ++|++|.|.+||++...+.+ .....|..+...|+|+|. ..+|+|.
T Consensus 154 ~~tt~f~~~sgqsvR-----ll~ys~skr~lL~~asd~G~VtlwDv~g~sp~~-~~~~~HsAP~~gicfspsne~l~vsV 227 (673)
T KOG4378|consen 154 QKTTTFTIDSGQSVR-----LLRYSPSKRFLLSIASDKGAVTLWDVQGMSPIF-HASEAHSAPCRGICFSPSNEALLVSV 227 (673)
T ss_pred ccccceecCCCCeEE-----EeecccccceeeEeeccCCeEEEEeccCCCccc-chhhhccCCcCcceecCCccceEEEe
Confidence 88888883 46788 999999887654 78899999999999643322 345689999999999996 4578999
Q ss_pred ECCCCeeEEEee
Q 000425 160 CRDGSLQVWKTR 171 (1525)
Q Consensus 160 S~DgtIkVWDlr 171 (1525)
+.|+.|.+||++
T Consensus 228 G~Dkki~~yD~~ 239 (673)
T KOG4378|consen 228 GYDKKINIYDIR 239 (673)
T ss_pred cccceEEEeecc
Confidence 999999999998
No 174
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=98.87 E-value=1.1e-08 Score=119.16 Aligned_cols=117 Identities=21% Similarity=0.351 Sum_probs=97.3
Q ss_pred CCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC-------c--EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC
Q 000425 51 KKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY-------A--VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG 119 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~-------k--~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg 119 (1525)
..+|.++.|++.+. .+++|+.|..||+|-+... + -+..|. |...|+ +|.|+|+|..++||+++|
T Consensus 13 ~~pv~s~dfq~n~~~~laT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN-----~vRf~p~gelLASg~D~g 87 (434)
T KOG1009|consen 13 HEPVYSVDFQKNSLNKLATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVN-----VVRFSPDGELLASGGDGG 87 (434)
T ss_pred CCceEEEEeccCcccceecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeE-----EEEEcCCcCeeeecCCCc
Confidence 46899999999876 8999999999999986432 1 234455 999999 999999999999999999
Q ss_pred cEEEEeCC--------CC------CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 120 TLLAWDVS--------IE------RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 120 tVrVWDl~--------t~------~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
.|.+|-.. +. .......+.+|...|..++|+|++.+++++|.|.++++||++.
T Consensus 88 ~v~lWk~~~~~~~~~d~e~~~~ke~w~v~k~lr~h~~diydL~Ws~d~~~l~s~s~dns~~l~Dv~~ 154 (434)
T KOG1009|consen 88 EVFLWKQGDVRIFDADTEADLNKEKWVVKKVLRGHRDDIYDLAWSPDSNFLVSGSVDNSVRLWDVHA 154 (434)
T ss_pred eEEEEEecCcCCccccchhhhCccceEEEEEecccccchhhhhccCCCceeeeeeccceEEEEEecc
Confidence 99999655 20 1122234568999999999999999999999999999999973
No 175
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=98.87 E-value=2e-08 Score=120.02 Aligned_cols=114 Identities=22% Similarity=0.340 Sum_probs=87.4
Q ss_pred CCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCC--------------CcE--------------EEEEe-cCCceeecCc
Q 000425 51 KKPIVNLACHPRLP-VLYVAYADGLIRAYNIHT--------------YAV--------------HYTLQ-LDNTIKLLGA 100 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t--------------~k~--------------l~tL~-H~~~V~~~ga 100 (1525)
+..|+++.|-|.++ .|+++..+|.+.+||... +.. +.... ....++
T Consensus 219 ktsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~~k~~~~f~i~t~ksk~~rNPv~~w~~~~g~in---- 294 (636)
T KOG2394|consen 219 KSSVTCIKWVPGSDSLFLVAHASGNLYLYDKEIVCGATAPSYQALKDGDQFAILTSKSKKTRNPVARWHIGEGSIN---- 294 (636)
T ss_pred ccceEEEEEEeCCCceEEEEEecCceEEeeccccccCCCCcccccCCCCeeEEeeeeccccCCccceeEecccccc----
Confidence 46799999998765 688889999999997431 111 11111 223455
Q ss_pred eeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 101 GAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 101 ~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.++|+|||++|++.|.||.+||+|+.+ ..+.+.+...-....|++|+|||+||++|++|.-|.||.+.
T Consensus 295 -~f~FS~DG~~LA~VSqDGfLRvF~fdt--~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGEDDLVtVwSf~ 362 (636)
T KOG2394|consen 295 -EFAFSPDGKYLATVSQDGFLRIFDFDT--QELLGVMKSYFGGLLCVCWSPDGKYIVTGGEDDLVTVWSFE 362 (636)
T ss_pred -ceeEcCCCceEEEEecCceEEEeeccH--HHHHHHHHhhccceEEEEEcCCccEEEecCCcceEEEEEec
Confidence 889999999999999999999999995 33333344445678999999999999999999999999864
No 176
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=98.86 E-value=7.1e-08 Score=109.98 Aligned_cols=160 Identities=13% Similarity=0.140 Sum_probs=123.5
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCC-ccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc-CCC
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGG-RAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNI-HTY 83 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg-k~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl-~t~ 83 (1525)
.+..-+-|+++++..+.++.. ...+.++...... +...-...+|...|++|+|+|..+.+++++.|+.-.+|.. ..+
T Consensus 10 ~~pitchAwn~drt~iAv~~~-~~evhiy~~~~~~~w~~~htls~Hd~~vtgvdWap~snrIvtcs~drnayVw~~~~~~ 88 (361)
T KOG1523|consen 10 LEPITCHAWNSDRTQIAVSPN-NHEVHIYSMLGADLWEPAHTLSEHDKIVTGVDWAPKSNRIVTCSHDRNAYVWTQPSGG 88 (361)
T ss_pred cCceeeeeecCCCceEEeccC-CceEEEEEecCCCCceeceehhhhCcceeEEeecCCCCceeEccCCCCccccccCCCC
Confidence 344556788888888877764 5567666643333 3333333459999999999999999999999999999997 333
Q ss_pred --cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceee--EeecccCcCeEEEEEcCCCCEEEE
Q 000425 84 --AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMI--GIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 84 --k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i--~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
++...+. ++.... +|.|+|..+.|++||.-+.|.||=++.+..-.+ .+...+.+.|++++|||+.-+|+.
T Consensus 89 ~WkptlvLlRiNrAAt-----~V~WsP~enkFAVgSgar~isVcy~E~ENdWWVsKhikkPirStv~sldWhpnnVLlaa 163 (361)
T KOG1523|consen 89 TWKPTLVLLRINRAAT-----CVKWSPKENKFAVGSGARLISVCYYEQENDWWVSKHIKKPIRSTVTSLDWHPNNVLLAA 163 (361)
T ss_pred eeccceeEEEecccee-----eEeecCcCceEEeccCccEEEEEEEecccceehhhhhCCccccceeeeeccCCcceecc
Confidence 4444444 888999 999999999999999999999998886421111 123457788999999999999999
Q ss_pred EECCCCeeEEEee
Q 000425 159 LCRDGSLQVWKTR 171 (1525)
Q Consensus 159 gS~DgtIkVWDlr 171 (1525)
||.|+.++|+..=
T Consensus 164 Gs~D~k~rVfSay 176 (361)
T KOG1523|consen 164 GSTDGKCRVFSAY 176 (361)
T ss_pred cccCcceeEEEEe
Confidence 9999999999753
No 177
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=98.85 E-value=2.5e-08 Score=117.89 Aligned_cols=145 Identities=19% Similarity=0.247 Sum_probs=124.5
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k 84 (1525)
++...||.+||....++....++..|.+|++ .+..+.......|+.|.+.|+|+|... +|++-+.|..|.+||+.+.+
T Consensus 164 gqsvRll~ys~skr~lL~~asd~G~VtlwDv-~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~ 242 (673)
T KOG4378|consen 164 GQSVRLLRYSPSKRFLLSIASDKGAVTLWDV-QGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQA 242 (673)
T ss_pred CCeEEEeecccccceeeEeeccCCeEEEEec-cCCCcccchhhhccCCcCcceecCCccceEEEecccceEEEeeccccc
Confidence 5566799999999999999999999999994 555555556667999999999999876 68889999999999999888
Q ss_pred EEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 85 VHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 85 ~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
...++....+.. .|+|+++|.+|+.|+..|.|..||++. .+..+.++.+|...|++++|-|.. .+++
T Consensus 243 s~~~l~y~~Pls-----tvaf~~~G~~L~aG~s~G~~i~YD~R~-~k~Pv~v~sah~~sVt~vafq~s~-tvlt 309 (673)
T KOG4378|consen 243 STDRLTYSHPLS-----TVAFSECGTYLCAGNSKGELIAYDMRS-TKAPVAVRSAHDASVTRVAFQPSP-TVLT 309 (673)
T ss_pred ccceeeecCCcc-----eeeecCCceEEEeecCCceEEEEeccc-CCCCceEeeecccceeEEEeeecc-eeee
Confidence 888888777888 999999999999999999999999996 355556778999999999998875 4444
No 178
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.85 E-value=5.3e-08 Score=114.58 Aligned_cols=160 Identities=16% Similarity=0.233 Sum_probs=123.3
Q ss_pred cCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC
Q 000425 5 SVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 5 ~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~ 83 (1525)
.++-.-.++|||..|.++.+...++++.+.+..+........+ ..+.|-.++|+|... .|+++..||+|+-+|++..
T Consensus 285 ~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk--~~g~VEkv~w~~~se~~f~~~tddG~v~~~D~R~~ 362 (463)
T KOG0270|consen 285 HGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPSNSGKEWK--FDGEVEKVAWDPHSENSFFVSTDDGTVYYFDIRNP 362 (463)
T ss_pred cCCceeEEEecCCCceEEEeccccceEEeeeccCccccCceEE--eccceEEEEecCCCceeEEEecCCceEEeeecCCC
Confidence 3456678999999999999999999999988764333333333 357899999999876 5888999999999999876
Q ss_pred -cEEEEEe-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCC-EEEEE
Q 000425 84 -AVHYTLQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLR-LLVTL 159 (1525)
Q Consensus 84 -k~l~tL~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~-~LaSg 159 (1525)
+++.+++ |...|. +|++++.-+ .++++|.|++|++|++.......+....-..+...|.++.|+-. +++.|
T Consensus 363 ~~~vwt~~AHd~~IS-----gl~~n~~~p~~l~t~s~d~~Vklw~~~~~~~~~v~~~~~~~~rl~c~~~~~~~a~~la~G 437 (463)
T KOG0270|consen 363 GKPVWTLKAHDDEIS-----GLSVNIQTPGLLSTASTDKVVKLWKFDVDSPKSVKEHSFKLGRLHCFALDPDVAFTLAFG 437 (463)
T ss_pred CCceeEEEeccCCcc-----eEEecCCCCcceeeccccceEEEEeecCCCCcccccccccccceeecccCCCcceEEEec
Confidence 8999999 999999 999998654 57788899999999987533333321111233466788888766 46667
Q ss_pred ECCCCeeEEEee
Q 000425 160 CRDGSLQVWKTR 171 (1525)
Q Consensus 160 S~DgtIkVWDlr 171 (1525)
+..+.++|||+-
T Consensus 438 G~k~~~~vwd~~ 449 (463)
T KOG0270|consen 438 GEKAVLRVWDIF 449 (463)
T ss_pred CccceEEEeecc
Confidence 777789999985
No 179
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=98.85 E-value=3.3e-08 Score=112.09 Aligned_cols=108 Identities=19% Similarity=0.195 Sum_probs=87.5
Q ss_pred eEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCce
Q 000425 54 IVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPS 132 (1525)
Q Consensus 54 V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s 132 (1525)
..|+.|++.|.++|+||.||.|-|||+.|...-+.+. |-.+|. +++||+||+.|+++|.|..|++||+... +
T Consensus 26 a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T~~iar~lsaH~~pi~-----sl~WS~dgr~LltsS~D~si~lwDl~~g--s 98 (405)
T KOG1273|consen 26 AECCQFSRWGDYLAVGCANGRVVIYDFDTFRIARMLSAHVRPIT-----SLCWSRDGRKLLTSSRDWSIKLWDLLKG--S 98 (405)
T ss_pred cceEEeccCcceeeeeccCCcEEEEEccccchhhhhhcccccee-----EEEecCCCCEeeeecCCceeEEEeccCC--C
Confidence 7889999999999999999999999999999888899 999999 9999999999999999999999999964 2
Q ss_pred eeEeecccCcCeEEEEEcCCC-CEEEEEECCCCeeEEE
Q 000425 133 MIGIIQVGSQPITSVAWLPML-RLLVTLCRDGSLQVWK 169 (1525)
Q Consensus 133 ~i~il~gH~~~VtsVafsPdg-~~LaSgS~DgtIkVWD 169 (1525)
+.... .-.++|..+.|||.. +..+.+=.+..-.+-+
T Consensus 99 ~l~ri-rf~spv~~~q~hp~k~n~~va~~~~~sp~vi~ 135 (405)
T KOG1273|consen 99 PLKRI-RFDSPVWGAQWHPRKRNKCVATIMEESPVVID 135 (405)
T ss_pred ceeEE-EccCccceeeeccccCCeEEEEEecCCcEEEE
Confidence 22111 247899999999954 3444443444333333
No 180
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=98.84 E-value=9.2e-09 Score=114.47 Aligned_cols=143 Identities=13% Similarity=0.208 Sum_probs=110.1
Q ss_pred EEEEEEeccceEEEEEeecCC---------ccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC--CcEE--
Q 000425 20 VVFFGFHRRMSVTVVGTVEGG---------RAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT--YAVH-- 86 (1525)
Q Consensus 20 ~Vf~s~~~d~sI~vv~tiegg---------k~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t--~k~l-- 86 (1525)
+++.+...+..+..|+..++. +....+. .|+.+|.++.|.+.-..=++|+.+..+-.|++.. +.+.
T Consensus 166 ~lllaGyEsghvv~wd~S~~~~~~~~~~~~kv~~~~a-sh~qpvlsldyas~~~rGisgga~dkl~~~Sl~~s~gslq~~ 244 (323)
T KOG0322|consen 166 FLLLAGYESGHVVIWDLSTGDKIIQLPQSSKVESPNA-SHKQPVLSLDYASSCDRGISGGADDKLVMYSLNHSTGSLQIR 244 (323)
T ss_pred EEEEEeccCCeEEEEEccCCceeeccccccccccchh-hccCcceeeeechhhcCCcCCCccccceeeeeccccCccccc
Confidence 445556678899888865541 1122233 4999999999998766667888888899998753 2222
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
..+. .+-.|. .+...||++.++++++|+.||||++++. ...+++.-|...|++++|||+.++++++|.|++|
T Consensus 245 ~e~~lknpGv~-----gvrIRpD~KIlATAGWD~RiRVyswrtl--~pLAVLkyHsagvn~vAfspd~~lmAaaskD~rI 317 (323)
T KOG0322|consen 245 KEITLKNPGVS-----GVRIRPDGKILATAGWDHRIRVYSWRTL--NPLAVLKYHSAGVNAVAFSPDCELMAAASKDARI 317 (323)
T ss_pred ceEEecCCCcc-----ceEEccCCcEEeecccCCcEEEEEeccC--CchhhhhhhhcceeEEEeCCCCchhhhccCCceE
Confidence 1222 334566 8889999999999999999999999964 3334578899999999999999999999999999
Q ss_pred eEEEe
Q 000425 166 QVWKT 170 (1525)
Q Consensus 166 kVWDl 170 (1525)
.+|++
T Consensus 318 SLWkL 322 (323)
T KOG0322|consen 318 SLWKL 322 (323)
T ss_pred Eeeec
Confidence 99985
No 181
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=98.83 E-value=3.7e-08 Score=119.96 Aligned_cols=113 Identities=19% Similarity=0.281 Sum_probs=102.8
Q ss_pred CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe----cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 52 KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ----LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~----H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
..++.++..|+.++++++|.|+.|||||+++++..++|+ |++... .|...|.|.|+++...|+++.++|+.
T Consensus 597 tTlYDm~Vdp~~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lI-----Kv~lDPSgiY~atScsdktl~~~Df~ 671 (1080)
T KOG1408|consen 597 TTLYDMAVDPTSKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLI-----KVILDPSGIYLATSCSDKTLCFVDFV 671 (1080)
T ss_pred ceEEEeeeCCCcceEEEEecccceEEEeccccceeeeecccccCCCceE-----EEEECCCccEEEEeecCCceEEEEec
Confidence 468889999999999999999999999999999999998 556666 78889999999999999999999999
Q ss_pred CCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 128 IERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 128 t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+ ..+.+.+.||...|+.+.|.+|.+.|+|.|.||.|.||.+.
T Consensus 672 s--gEcvA~m~GHsE~VTG~kF~nDCkHlISvsgDgCIFvW~lp 713 (1080)
T KOG1408|consen 672 S--GECVAQMTGHSEAVTGVKFLNDCKHLISVSGDGCIFVWKLP 713 (1080)
T ss_pred c--chhhhhhcCcchheeeeeecccchhheeecCCceEEEEECc
Confidence 5 44555566999999999999999999999999999999886
No 182
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=98.80 E-value=1.1e-08 Score=121.78 Aligned_cols=119 Identities=12% Similarity=0.144 Sum_probs=104.0
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCceeecCceeEEEeC--CCCEEEEEeCCCcEEEE
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTIKLLGAGAFAFHP--TLEWLFVGDRRGTLLAW 124 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V~~~ga~sVaFSP--dg~~L~SgS~DgtVrVW 124 (1525)
+|++-|.+++|+.+|.+|++|++|-.+.|||.-..++++.+. |...|. ++.|-| ..+.+++|+.|..|+++
T Consensus 48 GH~GCVN~LeWn~dG~lL~SGSDD~r~ivWd~~~~KllhsI~TgHtaNIF-----svKFvP~tnnriv~sgAgDk~i~lf 122 (758)
T KOG1310|consen 48 GHTGCVNCLEWNADGELLASGSDDTRLIVWDPFEYKLLHSISTGHTANIF-----SVKFVPYTNNRIVLSGAGDKLIKLF 122 (758)
T ss_pred cccceecceeecCCCCEEeecCCcceEEeecchhcceeeeeeccccccee-----EEeeeccCCCeEEEeccCcceEEEE
Confidence 599999999999999999999999999999998999999998 999999 999998 34678899999999999
Q ss_pred eCCCC--------CceeeEeecccCcCeEEEEEcCCC-CEEEEEECCCCeeEEEeee
Q 000425 125 DVSIE--------RPSMIGIIQVGSQPITSVAWLPML-RLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 125 Dl~t~--------~~s~i~il~gH~~~VtsVafsPdg-~~LaSgS~DgtIkVWDlr~ 172 (1525)
|+.+. .......+..|...|..++-.|++ ..+-++|.||+|+-+|+|-
T Consensus 123 dl~~~~~~~~d~~~~~~~~~~~cht~rVKria~~p~~PhtfwsasEDGtirQyDiRE 179 (758)
T KOG1310|consen 123 DLDSSKEGGMDHGMEETTRCWSCHTDRVKRIATAPNGPHTFWSASEDGTIRQYDIRE 179 (758)
T ss_pred ecccccccccccCccchhhhhhhhhhhhhheecCCCCCceEEEecCCcceeeecccC
Confidence 99851 111112356899999999999998 5788999999999999983
No 183
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=98.80 E-value=2.3e-07 Score=107.23 Aligned_cols=155 Identities=18% Similarity=0.149 Sum_probs=120.6
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecC----CccceEEcc-CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEG----GRAPTKIKT-DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieg----gk~~tklk~-~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~ 85 (1525)
.|-|+..+.|+..|. +++.+.+|...+- .-++..+++ .|...|.|++|+-..+.+++|..+++|.+-|+.+.+.
T Consensus 61 AlqFS~N~~~L~SGG-DD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~N~~~~SG~~~~~VI~HDiEt~qs 139 (609)
T KOG4227|consen 61 ALQFSHNDRFLASGG-DDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLENRFLYSGERWGTVIKHDIETKQS 139 (609)
T ss_pred eeeeccCCeEEeecC-CcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccCCeeEecCCCcceeEeeeccccee
Confidence 577888888876665 6899999986332 213334444 3568999999999889999999999999999999999
Q ss_pred EEEEecCC---ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE--eecccCcCeEEEEEcCCC-CEEEEE
Q 000425 86 HYTLQLDN---TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG--IIQVGSQPITSVAWLPML-RLLVTL 159 (1525)
Q Consensus 86 l~tL~H~~---~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~--il~gH~~~VtsVafsPdg-~~LaSg 159 (1525)
+..+.|.+ .|. .+..||..+.+++.++++.|.+||.+.. +..+. .+.........+.|||.. .+|++.
T Consensus 140 i~V~~~~~~~~~VY-----~m~~~P~DN~~~~~t~~~~V~~~D~Rd~-~~~~~~~~~AN~~~~F~t~~F~P~~P~Li~~~ 213 (609)
T KOG4227|consen 140 IYVANENNNRGDVY-----HMDQHPTDNTLIVVTRAKLVSFIDNRDR-QNPISLVLPANSGKNFYTAEFHPETPALILVN 213 (609)
T ss_pred eeeecccCccccee-----ecccCCCCceEEEEecCceEEEEeccCC-CCCCceeeecCCCccceeeeecCCCceeEEec
Confidence 99998655 787 9999999999999999999999999853 11110 112234556789999975 478889
Q ss_pred ECCCCeeEEEeee
Q 000425 160 CRDGSLQVWKTRV 172 (1525)
Q Consensus 160 S~DgtIkVWDlr~ 172 (1525)
+..+.+.+||.|.
T Consensus 214 ~~~~G~~~~D~R~ 226 (609)
T KOG4227|consen 214 SETGGPNVFDRRM 226 (609)
T ss_pred cccCCCCceeecc
Confidence 9999999999983
No 184
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.78 E-value=2.5e-07 Score=108.67 Aligned_cols=157 Identities=11% Similarity=0.166 Sum_probs=108.8
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEc-cCCCCCeEEEEEcCCCCEEEEEEC-CCcEEEEEcCCC--
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIK-TDLKKPIVNLACHPRLPVLYVAYA-DGLIRAYNIHTY-- 83 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk-~~H~~~V~sLaf~Pdg~~LaSGs~-DgtIrLWDl~t~-- 83 (1525)
.-..|+++|++.+++++...+..+.+|+..+.++. ..+. ..+.+....++|+|+++++++++. ++.|.+||+.+.
T Consensus 36 ~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l-~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~ 114 (330)
T PRK11028 36 QVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGAL-TFAAESPLPGSPTHISTDHQGRFLFSASYNANCVSVSPLDKDGI 114 (330)
T ss_pred CCccEEECCCCCEEEEEECCCCcEEEEEECCCCce-EEeeeecCCCCceEEEECCCCCEEEEEEcCCCeEEEEEECCCCC
Confidence 45679999999999998877888877775433321 1111 113345678999999998877764 889999998642
Q ss_pred --cEEEEEecCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceee-----EeecccCcCeEEEEEcCCCCE
Q 000425 84 --AVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMI-----GIIQVGSQPITSVAWLPMLRL 155 (1525)
Q Consensus 84 --k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i-----~il~gH~~~VtsVafsPdg~~ 155 (1525)
+.+..+.+..... .++|+|+++++++++ .+++|.+||+.+. ..+. ..........+++.|+|++++
T Consensus 115 ~~~~~~~~~~~~~~~-----~~~~~p~g~~l~v~~~~~~~v~v~d~~~~-g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~ 188 (330)
T PRK11028 115 PVAPIQIIEGLEGCH-----SANIDPDNRTLWVPCLKEDRIRLFTLSDD-GHLVAQEPAEVTTVEGAGPRHMVFHPNQQY 188 (330)
T ss_pred CCCceeeccCCCccc-----EeEeCCCCCEEEEeeCCCCEEEEEEECCC-CcccccCCCceecCCCCCCceEEECCCCCE
Confidence 2233333333455 889999999987666 7799999999752 1110 000011234578999999999
Q ss_pred EEEEEC-CCCeeEEEee
Q 000425 156 LVTLCR-DGSLQVWKTR 171 (1525)
Q Consensus 156 LaSgS~-DgtIkVWDlr 171 (1525)
+++++. +++|.+||++
T Consensus 189 lyv~~~~~~~v~v~~~~ 205 (330)
T PRK11028 189 AYCVNELNSSVDVWQLK 205 (330)
T ss_pred EEEEecCCCEEEEEEEe
Confidence 877766 8999999986
No 185
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=98.78 E-value=1.5e-08 Score=121.78 Aligned_cols=153 Identities=18% Similarity=0.211 Sum_probs=117.5
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCC-----ccceEEccCCCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCC
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGG-----RAPTKIKTDLKKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg-----k~~tklk~~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~ 83 (1525)
--+++.|..+.-+.-..++..|++|..-.++ ..+.++.+.|...|+++.|||-. ..|+++++|.+|++||+.++
T Consensus 631 tDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd~Ti~lWDl~~~ 710 (1012)
T KOG1445|consen 631 TDLHWDPFDDERLAVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYDSTIELWDLANA 710 (1012)
T ss_pred eecccCCCChHHeeecccCceEEEEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhccceeeeeehhhh
Confidence 3467778877666666678999999764332 22333444599999999999964 58999999999999999999
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCc----CeEEEEEcCCCCEEEE
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQ----PITSVAWLPMLRLLVT 158 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~----~VtsVafsPdg~~LaS 158 (1525)
+....|. |.+.|. .++|||||+.+++.+.||+|+||+-++....+. .+ .. .--.|.|--+|++++.
T Consensus 711 ~~~~~l~gHtdqIf-----~~AWSpdGr~~AtVcKDg~~rVy~Prs~e~pv~---Eg-~gpvgtRgARi~wacdgr~viv 781 (1012)
T KOG1445|consen 711 KLYSRLVGHTDQIF-----GIAWSPDGRRIATVCKDGTLRVYEPRSREQPVY---EG-KGPVGTRGARILWACDGRIVIV 781 (1012)
T ss_pred hhhheeccCcCcee-----EEEECCCCcceeeeecCceEEEeCCCCCCCccc---cC-CCCccCcceeEEEEecCcEEEE
Confidence 9888898 999999 999999999999999999999999886311111 11 11 1235778889999888
Q ss_pred EECCC----CeeEEEee
Q 000425 159 LCRDG----SLQVWKTR 171 (1525)
Q Consensus 159 gS~Dg----tIkVWDlr 171 (1525)
.+-|+ .|.+||..
T Consensus 782 ~Gfdk~SeRQv~~Y~Aq 798 (1012)
T KOG1445|consen 782 VGFDKSSERQVQMYDAQ 798 (1012)
T ss_pred ecccccchhhhhhhhhh
Confidence 77665 57777765
No 186
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=98.78 E-value=3.2e-08 Score=112.53 Aligned_cols=148 Identities=18% Similarity=0.228 Sum_probs=111.0
Q ss_pred CCcEEEEEEcCCCCEE-EEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CCcEEEEEcCCC
Q 000425 6 VDTEVHLALTPLQPVV-FFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DGLIRAYNIHTY 83 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~V-f~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-DgtIrLWDl~t~ 83 (1525)
......+||||..+-. .++.. ..++-++. ..+..+...+. +|.+.|+-+.|+++|+.|++|+. |-.|-.||++..
T Consensus 207 ~giisc~a~sP~~~~~~a~gsY-~q~~giy~-~~~~~pl~llg-gh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~ 283 (406)
T KOG2919|consen 207 KGIISCFAFSPMDSKTLAVGSY-GQRVGIYN-DDGRRPLQLLG-GHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYS 283 (406)
T ss_pred cceeeeeeccCCCCcceeeecc-cceeeeEe-cCCCCceeeec-ccCCCeeeEEeccCcCeecccccCCCeEEEEeehhc
Confidence 3456789999999944 44443 44554554 33444555555 59999999999999999999875 778999999864
Q ss_pred -cEEEEEe-cCC---ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 84 -AVHYTLQ-LDN---TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 84 -k~l~tL~-H~~---~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
.++..+. |.. ... .++..|++++|++|+.||.|++||+..... .+..+..|...|+.+++||-.++++|
T Consensus 284 ~~pv~~L~rhv~~TNQRI-----~FDld~~~~~LasG~tdG~V~vwdlk~~gn-~~sv~~~~sd~vNgvslnP~mpilat 357 (406)
T KOG2919|consen 284 RDPVYALERHVGDTNQRI-----LFDLDPKGEILASGDTDGSVRVWDLKDLGN-EVSVTGNYSDTVNGVSLNPIMPILAT 357 (406)
T ss_pred cchhhhhhhhccCccceE-----EEecCCCCceeeccCCCccEEEEecCCCCC-cccccccccccccceecCcccceeee
Confidence 5677777 543 223 456679999999999999999999997311 22235578999999999999999999
Q ss_pred EECC
Q 000425 159 LCRD 162 (1525)
Q Consensus 159 gS~D 162 (1525)
+|..
T Consensus 358 ssGq 361 (406)
T KOG2919|consen 358 SSGQ 361 (406)
T ss_pred ccCc
Confidence 8864
No 187
>KOG4328 consensus WD40 protein [Function unknown]
Probab=98.77 E-value=4.2e-08 Score=115.66 Aligned_cols=152 Identities=19% Similarity=0.237 Sum_probs=118.0
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc---
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA--- 84 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k--- 84 (1525)
...+.|+-..+.|+++-+-. ...+|++..++.....+.. |+..|+++++||..+ +|++++.|++.+|||++...
T Consensus 282 fs~~d~~~e~~~vl~~~~~G-~f~~iD~R~~~s~~~~~~l-h~kKI~sv~~NP~~p~~laT~s~D~T~kIWD~R~l~~K~ 359 (498)
T KOG4328|consen 282 FSSLDFSAESRSVLFGDNVG-NFNVIDLRTDGSEYENLRL-HKKKITSVALNPVCPWFLATASLDQTAKIWDLRQLRGKA 359 (498)
T ss_pred eeeccccCCCccEEEeeccc-ceEEEEeecCCccchhhhh-hhcccceeecCCCCchheeecccCcceeeeehhhhcCCC
Confidence 44566777778888887654 6778888777777777777 999999999999987 58999999999999987542
Q ss_pred --EEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC---C--CceeeEeecccCcC------eEEEEEcC
Q 000425 85 --VHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI---E--RPSMIGIIQVGSQP------ITSVAWLP 151 (1525)
Q Consensus 85 --~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t---~--~~s~i~il~gH~~~------VtsVafsP 151 (1525)
.+.++.|...|+ +..|||++-.|++.+.|..|||||..- . ....+ .|+.. ..-..|+|
T Consensus 360 sp~lst~~HrrsV~-----sAyFSPs~gtl~TT~~D~~IRv~dss~~sa~~~p~~~I----~Hn~~t~RwlT~fKA~W~P 430 (498)
T KOG4328|consen 360 SPFLSTLPHRRSVN-----SAYFSPSGGTLLTTCQDNEIRVFDSSCISAKDEPLGTI----PHNNRTGRWLTPFKAAWDP 430 (498)
T ss_pred Ccceecccccceee-----eeEEcCCCCceEeeccCCceEEeecccccccCCcccee----eccCcccccccchhheeCC
Confidence 245555999999 999999888899999999999999841 1 12222 23221 23478999
Q ss_pred CCCEEEEEECCCCeeEEEee
Q 000425 152 MLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 152 dg~~LaSgS~DgtIkVWDlr 171 (1525)
+..+|+.|-.-..|-|+|-.
T Consensus 431 ~~~li~vg~~~r~IDv~~~~ 450 (498)
T KOG4328|consen 431 DYNLIVVGRYPRPIDVFDGN 450 (498)
T ss_pred CccEEEEeccCcceeEEcCC
Confidence 99999999888888888754
No 188
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.76 E-value=1.6e-07 Score=108.32 Aligned_cols=152 Identities=18% Similarity=0.247 Sum_probs=112.8
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCC----
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTY---- 83 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~---- 83 (1525)
-..+|+||. -..|.....+..|++++. ..+.+++++......|++++|-|-+ ..|++||..| |.+|.....
T Consensus 101 lr~~aWhqH-~~~fava~nddvVriy~k--sst~pt~Lks~sQrnvtclawRPlsaselavgCr~g-IciW~~s~tln~~ 176 (445)
T KOG2139|consen 101 LRGVAWHQH-IIAFAVATNDDVVRIYDK--SSTCPTKLKSVSQRNVTCLAWRPLSASELAVGCRAG-ICIWSDSRTLNAN 176 (445)
T ss_pred eeeEeechh-hhhhhhhccCcEEEEecc--CCCCCceecchhhcceeEEEeccCCcceeeeeecce-eEEEEcCcccccc
Confidence 357889983 333444445677777764 3378889988788999999999975 4789999866 899975421
Q ss_pred c--------EEEEEe--cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC
Q 000425 84 A--------VHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM 152 (1525)
Q Consensus 84 k--------~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd 152 (1525)
. +...++ ...+|. +++|.+||..++++| .|..|.|||..+.....+ .-.....++-+.||||
T Consensus 177 r~~~~~s~~~~qvl~~pgh~pVt-----smqwn~dgt~l~tAS~gsssi~iWdpdtg~~~pL--~~~glgg~slLkwSPd 249 (445)
T KOG2139|consen 177 RNIRMMSTHHLQVLQDPGHNPVT-----SMQWNEDGTILVTASFGSSSIMIWDPDTGQKIPL--IPKGLGGFSLLKWSPD 249 (445)
T ss_pred cccccccccchhheeCCCCceee-----EEEEcCCCCEEeecccCcceEEEEcCCCCCcccc--cccCCCceeeEEEcCC
Confidence 1 122333 236788 999999999999999 899999999997522222 1123567788999999
Q ss_pred CCEEEEEECCCCeeEEEee
Q 000425 153 LRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 153 g~~LaSgS~DgtIkVWDlr 171 (1525)
+.+|+.++-|++.++|...
T Consensus 250 gd~lfaAt~davfrlw~e~ 268 (445)
T KOG2139|consen 250 GDVLFAATCDAVFRLWQEN 268 (445)
T ss_pred CCEEEEecccceeeeehhc
Confidence 9999999999999999544
No 189
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.76 E-value=9.8e-09 Score=131.23 Aligned_cols=159 Identities=17% Similarity=0.259 Sum_probs=122.2
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
...-|.|+|.++.++.+..++..|.+||...-..+.+--...-.+.|.+++|+-.-. .|++++.+|++-|||++..+++
T Consensus 118 ~V~gLDfN~~q~nlLASGa~~geI~iWDlnn~~tP~~~~~~~~~~eI~~lsWNrkvqhILAS~s~sg~~~iWDlr~~~pi 197 (1049)
T KOG0307|consen 118 PVLGLDFNPFQGNLLASGADDGEILIWDLNKPETPFTPGSQAPPSEIKCLSWNRKVSHILASGSPSGRAVIWDLRKKKPI 197 (1049)
T ss_pred ceeeeeccccCCceeeccCCCCcEEEeccCCcCCCCCCCCCCCcccceEeccchhhhHHhhccCCCCCceeccccCCCcc
Confidence 345689999999888888899999999964422222211112457899999997654 6899999999999999999888
Q ss_pred EEEe-cCC--ceeecCceeEEEeCCCC-EEEEEe-CC--CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC-CEEEE
Q 000425 87 YTLQ-LDN--TIKLLGAGAFAFHPTLE-WLFVGD-RR--GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML-RLLVT 158 (1525)
Q Consensus 87 ~tL~-H~~--~V~~~ga~sVaFSPdg~-~L~SgS-~D--gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg-~~LaS 158 (1525)
-.+. |.. .+. .|.|||+.. .+++++ +| -.|.+||++.- .+-..++++|...|.+++|++.. ++++|
T Consensus 198 i~ls~~~~~~~~S-----~l~WhP~~aTql~~As~dd~~PviqlWDlR~a-ssP~k~~~~H~~GilslsWc~~D~~lllS 271 (1049)
T KOG0307|consen 198 IKLSDTPGRMHCS-----VLAWHPDHATQLLVASGDDSAPVIQLWDLRFA-SSPLKILEGHQRGILSLSWCPQDPRLLLS 271 (1049)
T ss_pred cccccCCCcccee-----eeeeCCCCceeeeeecCCCCCceeEeeccccc-CCchhhhcccccceeeeccCCCCchhhhc
Confidence 8887 543 366 899999873 455555 33 46999999842 33334567999999999999966 89999
Q ss_pred EECCCCeeEEEeee
Q 000425 159 LCRDGSLQVWKTRV 172 (1525)
Q Consensus 159 gS~DgtIkVWDlr~ 172 (1525)
|+.|+.|.+|+.++
T Consensus 272 sgkD~~ii~wN~~t 285 (1049)
T KOG0307|consen 272 SGKDNRIICWNPNT 285 (1049)
T ss_pred ccCCCCeeEecCCC
Confidence 99999999998753
No 190
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=98.75 E-value=3.8e-07 Score=107.17 Aligned_cols=159 Identities=12% Similarity=0.166 Sum_probs=108.9
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEcc-CCCCCeEEEEEcCCCCEEEE-EECCCcEEEEEcCCCcE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKT-DLKKPIVNLACHPRLPVLYV-AYADGLIRAYNIHTYAV 85 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~-~H~~~V~sLaf~Pdg~~LaS-Gs~DgtIrLWDl~t~k~ 85 (1525)
.-.||+++|++.+++++...+..+.+++..+.+.....+.. .+.....+++++|+++++++ ...+++|++||+.+...
T Consensus 81 ~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~ 160 (330)
T PRK11028 81 SPTHISTDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGH 160 (330)
T ss_pred CceEEEECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCc
Confidence 45799999999999999887888888886543322211111 13334577899999998755 45579999999976432
Q ss_pred EE-----EEe--cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCC--CceeeEeecc------cCcCeEEEEE
Q 000425 86 HY-----TLQ--LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIE--RPSMIGIIQV------GSQPITSVAW 149 (1525)
Q Consensus 86 l~-----tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~--~~s~i~il~g------H~~~VtsVaf 149 (1525)
+. ... .....+ .++|+|+|++++++. .+++|.+||+... ....+..+.. +......+.|
T Consensus 161 l~~~~~~~~~~~~g~~p~-----~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~ 235 (330)
T PRK11028 161 LVAQEPAEVTTVEGAGPR-----HMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHI 235 (330)
T ss_pred ccccCCCceecCCCCCCc-----eEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEE
Confidence 21 112 233455 899999999998887 4999999999731 1122222211 1223346899
Q ss_pred cCCCCEEEEEE-CCCCeeEEEee
Q 000425 150 LPMLRLLVTLC-RDGSLQVWKTR 171 (1525)
Q Consensus 150 sPdg~~LaSgS-~DgtIkVWDlr 171 (1525)
||++++++++. .+++|.+|++.
T Consensus 236 ~pdg~~lyv~~~~~~~I~v~~i~ 258 (330)
T PRK11028 236 TPDGRHLYACDRTASLISVFSVS 258 (330)
T ss_pred CCCCCEEEEecCCCCeEEEEEEe
Confidence 99999888875 47899999985
No 191
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=98.70 E-value=1.3e-07 Score=109.05 Aligned_cols=118 Identities=17% Similarity=0.178 Sum_probs=100.8
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC------CCcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCc
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH------TYAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGT 120 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~------t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~Dgt 120 (1525)
+|.+-|..|.|+.++++|++|+.|..+++|++. +.+++.... |...|. |++|.-....+++|+.+++
T Consensus 54 ~H~GCiNAlqFS~N~~~L~SGGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF-----~L~F~~~N~~~~SG~~~~~ 128 (609)
T KOG4227|consen 54 EHTGCINALQFSHNDRFLASGGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIF-----SLEFDLENRFLYSGERWGT 128 (609)
T ss_pred hhccccceeeeccCCeEEeecCCcceeeeechHHHHhhcCCCCceeccCccccceE-----EEEEccCCeeEecCCCcce
Confidence 499999999999999999999999999999964 457777777 668899 9999999999999999999
Q ss_pred EEEEeCCCCCceeeEeecccCc---CeEEEEEcCCCCEEEEEECCCCeeEEEeeeec
Q 000425 121 LLAWDVSIERPSMIGIIQVGSQ---PITSVAWLPMLRLLVTLCRDGSLQVWKTRVII 174 (1525)
Q Consensus 121 VrVWDl~t~~~s~i~il~gH~~---~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~ 174 (1525)
|...|+.+. .++- ...|.. .|..+..||..+.|++.+.|+.|.+||.|-..
T Consensus 129 VI~HDiEt~-qsi~--V~~~~~~~~~VY~m~~~P~DN~~~~~t~~~~V~~~D~Rd~~ 182 (609)
T KOG4227|consen 129 VIKHDIETK-QSIY--VANENNNRGDVYHMDQHPTDNTLIVVTRAKLVSFIDNRDRQ 182 (609)
T ss_pred eEeeecccc-eeee--eecccCcccceeecccCCCCceEEEEecCceEEEEeccCCC
Confidence 999999963 2221 124544 89999999999999999999999999998543
No 192
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=98.70 E-value=1.3e-07 Score=106.53 Aligned_cols=194 Identities=17% Similarity=0.198 Sum_probs=131.6
Q ss_pred EEEcCCC----CEEEEEEeccceEEEEEee--cCCcc-----ceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEE
Q 000425 12 LALTPLQ----PVVFFGFHRRMSVTVVGTV--EGGRA-----PTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYN 79 (1525)
Q Consensus 12 LA~hP~q----p~Vf~s~~~d~sI~vv~ti--eggk~-----~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWD 79 (1525)
|-|.|++ |-++.++.+ .+++|... +.... .+....+|..++++.+|+.-++ ++.++|-|-|..|||
T Consensus 102 ~~wiPd~~g~~pdlLATs~D--~LRlWri~~ee~~~~~~~~L~~~kns~~~aPlTSFDWne~dp~~igtSSiDTTCTiWd 179 (364)
T KOG0290|consen 102 LMWIPDSKGVYPDLLATSSD--FLRLWRIGDEESRVELQSVLNNNKNSEFCAPLTSFDWNEVDPNLIGTSSIDTTCTIWD 179 (364)
T ss_pred eEecCCccccCcchhhcccC--eEEEEeccCcCCceehhhhhccCcccccCCcccccccccCCcceeEeecccCeEEEEE
Confidence 4455654 556555543 44556543 22111 1112345788999999998765 688999999999999
Q ss_pred cCCCcE---EEEEe-cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCCCceeeEeecc-c-CcCeEEEEEcCC
Q 000425 80 IHTYAV---HYTLQ-LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQV-G-SQPITSVAWLPM 152 (1525)
Q Consensus 80 l~t~k~---l~tL~-H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~g-H-~~~VtsVafsPd 152 (1525)
++++.. ...+- |++.|. .|+|...+ +.++|.|.||+||+||++...++.+ +++. . ..+...++|+++
T Consensus 180 ie~~~~~~vkTQLIAHDKEV~-----DIaf~~~s~~~FASvgaDGSvRmFDLR~leHSTI-IYE~p~~~~pLlRLswnkq 253 (364)
T KOG0290|consen 180 IETGVSGTVKTQLIAHDKEVY-----DIAFLKGSRDVFASVGADGSVRMFDLRSLEHSTI-IYEDPSPSTPLLRLSWNKQ 253 (364)
T ss_pred EeeccccceeeEEEecCccee-----EEEeccCccceEEEecCCCcEEEEEecccccceE-EecCCCCCCcceeeccCcC
Confidence 998633 34444 999999 99999866 5788999999999999997533333 1322 1 346678999985
Q ss_pred -CCEEEEEECCC-CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCC-CeEEEE
Q 000425 153 -LRLLVTLCRDG-SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRL-NLAVLL 229 (1525)
Q Consensus 153 -g~~LaSgS~Dg-tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~-Nlial~ 229 (1525)
-+|++|-..|. .|.|-|+|....|- + -|..+. ..|+.++|+|.. +.+.
T Consensus 254 DpnymATf~~dS~~V~iLDiR~P~tpv-----a---------------~L~~H~-------a~VNgIaWaPhS~~hic-- 304 (364)
T KOG0290|consen 254 DPNYMATFAMDSNKVVILDIRVPCTPV-----A---------------RLRNHQ-------ASVNGIAWAPHSSSHIC-- 304 (364)
T ss_pred CchHHhhhhcCCceEEEEEecCCCcce-----e---------------hhhcCc-------ccccceEecCCCCceee--
Confidence 45777766555 69999998543310 0 112222 258889999975 4444
Q ss_pred ecccccCCccccccccc
Q 000425 230 FANFTGGDNIKNRAAYT 246 (1525)
Q Consensus 230 F~~~s~~Dd~~~k~~~~ 246 (1525)
++|||.+.-+|..
T Consensus 305 ----taGDD~qaliWDl 317 (364)
T KOG0290|consen 305 ----TAGDDCQALIWDL 317 (364)
T ss_pred ----ecCCcceEEEEec
Confidence 8999999988874
No 193
>PRK01742 tolB translocation protein TolB; Provisional
Probab=98.68 E-value=6.9e-07 Score=109.49 Aligned_cols=150 Identities=14% Similarity=0.096 Sum_probs=100.0
Q ss_pred cEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEE-CCCcEEEE--EcCC
Q 000425 8 TEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAY-ADGLIRAY--NIHT 82 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs-~DgtIrLW--Dl~t 82 (1525)
....++|+|++..+++....+ ..+.+++.. ++.. ..+.. ..+....++|+|+|+.|+.++ .||.+.|| |+.+
T Consensus 205 ~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~-tg~~-~~l~~-~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~ 281 (429)
T PRK01742 205 PLMSPAWSPDGSKLAYVSFENKKSQLVVHDLR-SGAR-KVVAS-FRGHNGAPAFSPDGSRLAFASSKDGVLNIYVMGANG 281 (429)
T ss_pred ccccceEcCCCCEEEEEEecCCCcEEEEEeCC-CCce-EEEec-CCCccCceeECCCCCEEEEEEecCCcEEEEEEECCC
Confidence 467899999999888776443 357777753 3321 12221 233345689999999887764 68877666 5666
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
++. ..+. +...+. ...|+|||++|+.++ .++...||++........ .+ .|.. .+..|+|++++|+..+
T Consensus 282 ~~~-~~lt~~~~~~~-----~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~-~l-~~~~--~~~~~SpDG~~ia~~~ 351 (429)
T PRK01742 282 GTP-SQLTSGAGNNT-----EPSWSPDGQSILFTSDRSGSPQVYRMSASGGGAS-LV-GGRG--YSAQISADGKTLVMIN 351 (429)
T ss_pred CCe-EeeccCCCCcC-----CEEECCCCCEEEEEECCCCCceEEEEECCCCCeE-Ee-cCCC--CCccCCCCCCEEEEEc
Confidence 554 4455 666677 899999999877665 678899998764211111 11 3433 4578999999998888
Q ss_pred CCCCeeEEEee
Q 000425 161 RDGSLQVWKTR 171 (1525)
Q Consensus 161 ~DgtIkVWDlr 171 (1525)
.++ +.+||+.
T Consensus 352 ~~~-i~~~Dl~ 361 (429)
T PRK01742 352 GDN-VVKQDLT 361 (429)
T ss_pred CCC-EEEEECC
Confidence 765 4458864
No 194
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.65 E-value=9.2e-07 Score=102.67 Aligned_cols=110 Identities=18% Similarity=0.256 Sum_probs=83.2
Q ss_pred CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-c-C--CceeecCceeEEEeCCCCEEEEEe--CCCcEEEE
Q 000425 51 KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-L-D--NTIKLLGAGAFAFHPTLEWLFVGD--RRGTLLAW 124 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H-~--~~V~~~ga~sVaFSPdg~~L~SgS--~DgtVrVW 124 (1525)
..+|.+|.++ .+.++++-. ..|.|||+++++.++++. . . ..+. ++.+++.+.|++--+ ..|.|.+|
T Consensus 87 pt~IL~VrmN--r~RLvV~Le-e~IyIydI~~MklLhTI~t~~~n~~gl~-----AlS~n~~n~ylAyp~s~t~GdV~l~ 158 (391)
T KOG2110|consen 87 PTSILAVRMN--RKRLVVCLE-ESIYIYDIKDMKLLHTIETTPPNPKGLC-----ALSPNNANCYLAYPGSTTSGDVVLF 158 (391)
T ss_pred CCceEEEEEc--cceEEEEEc-ccEEEEecccceeehhhhccCCCccceE-----eeccCCCCceEEecCCCCCceEEEE
Confidence 3456666665 345666554 459999999999999998 2 2 3344 666666667888544 68999999
Q ss_pred eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCC-eeEEEe
Q 000425 125 DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGS-LQVWKT 170 (1525)
Q Consensus 125 Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dgt-IkVWDl 170 (1525)
|..+. +....+.+|.+.+-|++||++|.+|+|+|+.|| |||+++
T Consensus 159 d~~nl--~~v~~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v 203 (391)
T KOG2110|consen 159 DTINL--QPVNTINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSV 203 (391)
T ss_pred Ecccc--eeeeEEEecCCceeEEEECCCCCEEEEeccCceEEEEEEc
Confidence 99853 222346699999999999999999999999997 677765
No 195
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.65 E-value=5e-07 Score=112.90 Aligned_cols=159 Identities=18% Similarity=0.307 Sum_probs=117.1
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCcc----ceEEccCCCCCeEEEEEcCCCCE--EEEEECCCcEEEEEc
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRA----PTKIKTDLKKPIVNLACHPRLPV--LYVAYADGLIRAYNI 80 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~----~tklk~~H~~~V~sLaf~Pdg~~--LaSGs~DgtIrLWDl 80 (1525)
....+++|+|..|.+++|....+.|.+|++..+... .......|..+|..+.|-.+... |+++|.||.|..|++
T Consensus 243 s~v~~~~f~p~~p~ll~gG~y~GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~~~~~~f~s~ssDG~i~~W~~ 322 (555)
T KOG1587|consen 243 SEVTCLKFCPFDPNLLAGGCYNGQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQNEHNTEFFSLSSDGSICSWDT 322 (555)
T ss_pred CceeEEEeccCCcceEEeeccCceEEEEEccCCCCCCCcccccccccCCcCeEEEEEeccCCCCceEEEecCCcEeeeec
Confidence 356789999999999999999999999998666442 22233458899999999765544 999999999999986
Q ss_pred CCCc---------------------------------------------E------------------EEEEe-cCCcee
Q 000425 81 HTYA---------------------------------------------V------------------HYTLQ-LDNTIK 96 (1525)
Q Consensus 81 ~t~k---------------------------------------------~------------------l~tL~-H~~~V~ 96 (1525)
+... + +..+. |...|+
T Consensus 323 ~~l~~P~e~~~~~~~~~~~~~~~~~~~~t~~~F~~~~p~~FiVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~ 402 (555)
T KOG1587|consen 323 DMLSLPVEGLLLESKKHKGQQSSKAVGATSLKFEPTDPNHFIVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVY 402 (555)
T ss_pred cccccchhhcccccccccccccccccceeeEeeccCCCceEEEEcCCcEEEEEeccCCcccccccccccccccccCcceE
Confidence 4210 0 00111 334445
Q ss_pred ecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC-CCceeeEeecccCcCeEEEEEcCCCC-EEEEEECCCCeeEEEeee
Q 000425 97 LLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI-ERPSMIGIIQVGSQPITSVAWLPMLR-LLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 97 ~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t-~~~s~i~il~gH~~~VtsVafsPdg~-~LaSgS~DgtIkVWDlr~ 172 (1525)
++.++|=+..++..+.|.+|++|.... ..+.+ .++.+...|++++|||..+ .|+++..||.|.+||+..
T Consensus 403 -----~v~~nPF~~k~fls~gDW~vriWs~~~~~~Pl~--~~~~~~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~ 473 (555)
T KOG1587|consen 403 -----AVSRNPFYPKNFLSVGDWTVRIWSEDVIASPLL--SLDSSPDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQ 473 (555)
T ss_pred -----eeecCCCccceeeeeccceeEeccccCCCCcch--hhhhccceeeeeEEcCcCceEEEEEcCCCceehhhhhc
Confidence 788888776666555599999998762 22222 2567888899999999876 677788899999999973
No 196
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=98.64 E-value=3.5e-07 Score=108.16 Aligned_cols=157 Identities=13% Similarity=0.231 Sum_probs=122.1
Q ss_pred cCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecC--CccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcC
Q 000425 5 SVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEG--GRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIH 81 (1525)
Q Consensus 5 ~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg--gk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~ 81 (1525)
+..+.-++.|||+.|.++++.- +.+++++.. ++ ......+.. -+.||.+..|+|+|. .+++++.-.....||+.
T Consensus 212 s~~~I~sv~FHp~~plllvaG~-d~~lrifqv-DGk~N~~lqS~~l-~~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle 288 (514)
T KOG2055|consen 212 SHGGITSVQFHPTAPLLLVAGL-DGTLRIFQV-DGKVNPKLQSIHL-EKFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLE 288 (514)
T ss_pred CcCCceEEEecCCCceEEEecC-CCcEEEEEe-cCccChhheeeee-ccCccceeeecCCCceEEEecccceEEEEeecc
Confidence 4456788999999999998874 778888763 43 223334443 468999999999998 89999999999999998
Q ss_pred CCcEEEEEe---c-CCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEE
Q 000425 82 TYAVHYTLQ---L-DNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 82 t~k~l~tL~---H-~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~La 157 (1525)
+.+..+.-. + ...+. ....+|++++|+..+..|.|.+-...| ..+++.+. -.+.|..++|+.|++.|+
T Consensus 289 ~ak~~k~~~~~g~e~~~~e-----~FeVShd~~fia~~G~~G~I~lLhakT--~eli~s~K-ieG~v~~~~fsSdsk~l~ 360 (514)
T KOG2055|consen 289 TAKVTKLKPPYGVEEKSME-----RFEVSHDSNFIAIAGNNGHIHLLHAKT--KELITSFK-IEGVVSDFTFSSDSKELL 360 (514)
T ss_pred ccccccccCCCCcccchhh-----eeEecCCCCeEEEcccCceEEeehhhh--hhhhheee-eccEEeeEEEecCCcEEE
Confidence 876543322 2 34677 788899999999999999999987765 23332221 356789999999999999
Q ss_pred EEECCCCeeEEEeee
Q 000425 158 TLCRDGSLQVWKTRV 172 (1525)
Q Consensus 158 SgS~DgtIkVWDlr~ 172 (1525)
.++.+|.|.+||++.
T Consensus 361 ~~~~~GeV~v~nl~~ 375 (514)
T KOG2055|consen 361 ASGGTGEVYVWNLRQ 375 (514)
T ss_pred EEcCCceEEEEecCC
Confidence 999999999999984
No 197
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=98.63 E-value=5.5e-07 Score=103.60 Aligned_cols=152 Identities=13% Similarity=0.229 Sum_probs=119.4
Q ss_pred EEEEcCC-CCEEEEEEeccceEEEEEeecCCccceEEccCCC-CCeEEEEEcCCCCEEEEEE----CCCcEEEEEcCCCc
Q 000425 11 HLALTPL-QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLK-KPIVNLACHPRLPVLYVAY----ADGLIRAYNIHTYA 84 (1525)
Q Consensus 11 ~LA~hP~-qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~-~~V~sLaf~Pdg~~LaSGs----~DgtIrLWDl~t~k 84 (1525)
-++|... +|-.+.++..+.+|++|+............++|. .+-.+++.+.++..+++|. .|-.|.+||++..+
T Consensus 75 ~vrf~~~ds~h~v~s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~q 154 (376)
T KOG1188|consen 75 GVRFISCDSPHGVISCSSDGTVRLWDIRSQAESARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQ 154 (376)
T ss_pred ceEEecCCCCCeeEEeccCCeEEEEEeecchhhhheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEecccc
Confidence 3555543 7777777778999999998766555554455566 5677888887888888875 36789999999887
Q ss_pred E-EEEEe--cCCceeecCceeEEEeCC-CCEEEEEeCCCcEEEEeCCCC-----CceeeEeecccCcCeEEEEEcCCC-C
Q 000425 85 V-HYTLQ--LDNTIKLLGAGAFAFHPT-LEWLFVGDRRGTLLAWDVSIE-----RPSMIGIIQVGSQPITSVAWLPML-R 154 (1525)
Q Consensus 85 ~-l~tL~--H~~~V~~~ga~sVaFSPd-g~~L~SgS~DgtVrVWDl~t~-----~~s~i~il~gH~~~VtsVafsPdg-~ 154 (1525)
. +..+. |.+.|+ +|+|||. .+.|+|||.||.|.++|+... ..+.+ .|...|.++.|+.++ +
T Consensus 155 q~l~~~~eSH~DDVT-----~lrFHP~~pnlLlSGSvDGLvnlfD~~~d~EeDaL~~vi----N~~sSI~~igw~~~~yk 225 (376)
T KOG1188|consen 155 QLLRQLNESHNDDVT-----QLRFHPSDPNLLLSGSVDGLVNLFDTKKDNEEDALLHVI----NHGSSIHLIGWLSKKYK 225 (376)
T ss_pred chhhhhhhhccCcce-----eEEecCCCCCeEEeecccceEEeeecCCCcchhhHHHhh----cccceeeeeeeecCCcc
Confidence 6 77776 999999 9999995 568899999999999999853 11223 577889999999887 3
Q ss_pred EEEEEECCCCeeEEEee
Q 000425 155 LLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 155 ~LaSgS~DgtIkVWDlr 171 (1525)
.|.+-+-+.+..+|+++
T Consensus 226 rI~clTH~Etf~~~ele 242 (376)
T KOG1188|consen 226 RIMCLTHMETFAIYELE 242 (376)
T ss_pred eEEEEEccCceeEEEcc
Confidence 47777899999999987
No 198
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.58 E-value=8e-07 Score=104.61 Aligned_cols=161 Identities=16% Similarity=0.098 Sum_probs=121.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE-
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV- 85 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~- 85 (1525)
+.+-.+||.+++..+.++. .|..+++|+.-.. ....... .|.+.|.+++|+||+..|++-+.| ..+|||..++.+
T Consensus 145 g~~k~vaf~~~gs~latgg-~dg~lRv~~~Ps~-~t~l~e~-~~~~eV~DL~FS~dgk~lasig~d-~~~VW~~~~g~~~ 220 (398)
T KOG0771|consen 145 GQQKVVAFNGDGSKLATGG-TDGTLRVWEWPSM-LTILEEI-AHHAEVKDLDFSPDGKFLASIGAD-SARVWSVNTGAAL 220 (398)
T ss_pred CcceEEEEcCCCCEeeecc-ccceEEEEecCcc-hhhhhhH-hhcCccccceeCCCCcEEEEecCC-ceEEEEeccCchh
Confidence 3557899999999998887 4888989874322 2222222 488999999999999999999999 899999877611
Q ss_pred -------------------------EEEEe---cCCceee---------------------cCceeEEEeCCCCEEEEEe
Q 000425 86 -------------------------HYTLQ---LDNTIKL---------------------LGAGAFAFHPTLEWLFVGD 116 (1525)
Q Consensus 86 -------------------------l~tL~---H~~~V~~---------------------~ga~sVaFSPdg~~L~SgS 116 (1525)
++.+. ....|+. ++..+++.++||++++.|+
T Consensus 221 a~~t~~~k~~~~~~cRF~~d~~~~~l~laa~~~~~~~v~~~~~~~w~~~~~l~~~~~~~~~~siSsl~VS~dGkf~AlGT 300 (398)
T KOG0771|consen 221 ARKTPFSKDEMFSSCRFSVDNAQETLRLAASQFPGGGVRLCDISLWSGSNFLRLRKKIKRFKSISSLAVSDDGKFLALGT 300 (398)
T ss_pred hhcCCcccchhhhhceecccCCCceEEEEEecCCCCceeEEEeeeeccccccchhhhhhccCcceeEEEcCCCcEEEEec
Confidence 11111 1111111 2335899999999999999
Q ss_pred CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 117 RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 117 ~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
.||.|.|++..+. +...-+...|...|+.+.|+|+.+++++-|.|.++.|-.+..
T Consensus 301 ~dGsVai~~~~~l-q~~~~vk~aH~~~VT~ltF~Pdsr~~~svSs~~~~~v~~l~v 355 (398)
T KOG0771|consen 301 MDGSVAIYDAKSL-QRLQYVKEAHLGFVTGLTFSPDSRYLASVSSDNEAAVTKLAV 355 (398)
T ss_pred cCCcEEEEEecee-eeeEeehhhheeeeeeEEEcCCcCcccccccCCceeEEEEee
Confidence 9999999998853 111123568999999999999999999999999999887763
No 199
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.57 E-value=1.5e-06 Score=100.91 Aligned_cols=118 Identities=15% Similarity=0.182 Sum_probs=93.8
Q ss_pred CCCCeEEEEEcCCCCEEEEEE--CCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCc-EEEEe
Q 000425 50 LKKPIVNLACHPRLPVLYVAY--ADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGT-LLAWD 125 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs--~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dgt-VrVWD 125 (1525)
|...+..+++|+.+.+++.-+ ..|.|.|||..+-+++.++. |+..+- +++|+++|.+|+++|+-|+ |||+.
T Consensus 128 n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~~lA-----alafs~~G~llATASeKGTVIRVf~ 202 (391)
T KOG2110|consen 128 NPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKGPLA-----ALAFSPDGTLLATASEKGTVIRVFS 202 (391)
T ss_pred CccceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCCcee-----EEEECCCCCEEEEeccCceEEEEEE
Confidence 455577777777777887632 35889999999999999999 999999 9999999999999998887 58999
Q ss_pred CCCCCceeeEeecc-cCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeee
Q 000425 126 VSIERPSMIGIIQV-GSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVI 173 (1525)
Q Consensus 126 l~t~~~s~i~il~g-H~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~ 173 (1525)
+.+. ..+..+..| -...|.+++|||++.+|++.|..+||+|+.+...
T Consensus 203 v~~G-~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~ 250 (391)
T KOG2110|consen 203 VPEG-QKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKV 250 (391)
T ss_pred cCCc-cEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEeccc
Confidence 8864 222211111 1345789999999999999999999999998743
No 200
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.55 E-value=4.7e-07 Score=106.46 Aligned_cols=69 Identities=16% Similarity=0.179 Sum_probs=63.8
Q ss_pred EEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC
Q 000425 55 VNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE 129 (1525)
Q Consensus 55 ~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~ 129 (1525)
.+++|+++|..+++|+.||++|+|+..+...+.... |...|. +++|+|||+.|++-+.| ..+|||.++.
T Consensus 148 k~vaf~~~gs~latgg~dg~lRv~~~Ps~~t~l~e~~~~~eV~-----DL~FS~dgk~lasig~d-~~~VW~~~~g 217 (398)
T KOG0771|consen 148 KVVAFNGDGSKLATGGTDGTLRVWEWPSMLTILEEIAHHAEVK-----DLDFSPDGKFLASIGAD-SARVWSVNTG 217 (398)
T ss_pred eEEEEcCCCCEeeeccccceEEEEecCcchhhhhhHhhcCccc-----cceeCCCCcEEEEecCC-ceEEEEeccC
Confidence 679999999999999999999999988887777777 999999 99999999999999999 9999998864
No 201
>PRK01742 tolB translocation protein TolB; Provisional
Probab=98.54 E-value=1.1e-06 Score=107.66 Aligned_cols=150 Identities=9% Similarity=0.024 Sum_probs=102.1
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEE-CCCcEEEEEcCCC-cEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAY-ADGLIRAYNIHTY-AVH 86 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs-~DgtIrLWDl~t~-k~l 86 (1525)
...++|+|++..++++...+....+|...-.+.....+.. |...+.+.+|+|+|+.++.++ .++..+||++... ...
T Consensus 250 ~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~ 328 (429)
T PRK01742 250 NGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTS-GAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGA 328 (429)
T ss_pred cCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeecc-CCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCe
Confidence 3468999999999888765655555432222233444444 667788999999999766544 6788899986432 222
Q ss_pred EEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 87 YTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 87 ~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
..+.+.. . ..+|+|||++|+.++.++.+ +||+.+...... ...+ ...+++|+|++++|++++.++.++
T Consensus 329 ~~l~~~~--~-----~~~~SpDG~~ia~~~~~~i~-~~Dl~~g~~~~l--t~~~--~~~~~~~sPdG~~i~~~s~~g~~~ 396 (429)
T PRK01742 329 SLVGGRG--Y-----SAQISADGKTLVMINGDNVV-KQDLTSGSTEVL--SSTF--LDESPSISPNGIMIIYSSTQGLGK 396 (429)
T ss_pred EEecCCC--C-----CccCCCCCCEEEEEcCCCEE-EEECCCCCeEEe--cCCC--CCCCceECCCCCEEEEEEcCCCce
Confidence 2223332 4 67899999999888876654 599886422222 1122 235688999999999999999999
Q ss_pred EEEee
Q 000425 167 VWKTR 171 (1525)
Q Consensus 167 VWDlr 171 (1525)
+|++.
T Consensus 397 ~l~~~ 401 (429)
T PRK01742 397 VLQLV 401 (429)
T ss_pred EEEEE
Confidence 99874
No 202
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=98.53 E-value=7.9e-07 Score=101.63 Aligned_cols=157 Identities=17% Similarity=0.239 Sum_probs=122.5
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC---CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEe
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT---YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWD 125 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t---~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWD 125 (1525)
...+|+|.+|++|...+|++.....|.||.... .+..++++ |+..|+ .|+|+|..+.|++++.|+.-.||.
T Consensus 9 ~~~pitchAwn~drt~iAv~~~~~evhiy~~~~~~~w~~~htls~Hd~~vt-----gvdWap~snrIvtcs~drnayVw~ 83 (361)
T KOG1523|consen 9 LLEPITCHAWNSDRTQIAVSPNNHEVHIYSMLGADLWEPAHTLSEHDKIVT-----GVDWAPKSNRIVTCSHDRNAYVWT 83 (361)
T ss_pred ccCceeeeeecCCCceEEeccCCceEEEEEecCCCCceeceehhhhCccee-----EEeecCCCCceeEccCCCCccccc
Confidence 468999999999999999999999999998654 45788898 999999 999999999999999999999998
Q ss_pred CCCC---CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccc
Q 000425 126 VSIE---RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQ 202 (1525)
Q Consensus 126 l~t~---~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~ 202 (1525)
.... .+..+ +-.|+...+||.|+|..+.|++||....|.||-+..-- + +.+ +-
T Consensus 84 ~~~~~~Wkptlv--LlRiNrAAt~V~WsP~enkFAVgSgar~isVcy~E~EN---------d---------WWV----sK 139 (361)
T KOG1523|consen 84 QPSGGTWKPTLV--LLRINRAATCVKWSPKENKFAVGSGARLISVCYYEQEN---------D---------WWV----SK 139 (361)
T ss_pred cCCCCeecccee--EEEeccceeeEeecCcCceEEeccCccEEEEEEEeccc---------c---------eeh----hh
Confidence 8432 33333 55789999999999999999999999999999764210 0 000 00
Q ss_pred cCCcceecC-CCeeEEEEecCCCeEEEEecccccCCccccccc
Q 000425 203 QGGEAVYPL-PRVRALEVHPRLNLAVLLFANFTGGDNIKNRAA 244 (1525)
Q Consensus 203 ~gG~~vy~l-~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~ 244 (1525)
+ .+.|+ ..|.++.|||..-+.+ .++-|.+.|+.
T Consensus 140 h---ikkPirStv~sldWhpnnVLla------aGs~D~k~rVf 173 (361)
T KOG1523|consen 140 H---IKKPIRSTVTSLDWHPNNVLLA------AGSTDGKCRVF 173 (361)
T ss_pred h---hCCccccceeeeeccCCcceec------ccccCcceeEE
Confidence 0 01222 3578999999877777 66666666543
No 203
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.50 E-value=2e-06 Score=109.24 Aligned_cols=190 Identities=11% Similarity=0.128 Sum_probs=125.3
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcC-CCCEEEEEECCCcEEEEEcCCCc---E
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHP-RLPVLYVAYADGLIRAYNIHTYA---V 85 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~P-dg~~LaSGs~DgtIrLWDl~t~k---~ 85 (1525)
..+++.-+.-.++++.. ...|++|+.... .....+..+-..-|+++..+- .|+.++.|..||.||+||.+... .
T Consensus 1169 ~v~dWqQ~~G~Ll~tGd-~r~IRIWDa~~E-~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~ 1246 (1387)
T KOG1517|consen 1169 LVVDWQQQSGHLLVTGD-VRSIRIWDAHKE-QVVADIPYGSSTLVTALSADLVHGNIIAAGFADGSVRVYDRRMAPPDSL 1246 (1387)
T ss_pred eeeehhhhCCeEEecCC-eeEEEEEecccc-eeEeecccCCCccceeecccccCCceEEEeecCCceEEeecccCCcccc
Confidence 45556655556666664 678888886432 222222333445567665543 36899999999999999977543 3
Q ss_pred EEEEe-cCCc--eeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCC-CceeeEeeccc---CcCeEEEEEcCCCCEEE
Q 000425 86 HYTLQ-LDNT--IKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIE-RPSMIGIIQVG---SQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 86 l~tL~-H~~~--V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~-~~s~i~il~gH---~~~VtsVafsPdg~~La 157 (1525)
+...+ |... |. .+.+.+.|- .|++||.||.|++||++.. ...+.. ...| .+.++++..|++.+.++
T Consensus 1247 v~~~R~h~~~~~Iv-----~~slq~~G~~elvSgs~~G~I~~~DlR~~~~e~~~~-iv~~~~yGs~lTal~VH~hapiiA 1320 (1387)
T KOG1517|consen 1247 VCVYREHNDVEPIV-----HLSLQRQGLGELVSGSQDGDIQLLDLRMSSKETFLT-IVAHWEYGSALTALTVHEHAPIIA 1320 (1387)
T ss_pred ceeecccCCcccce-----eEEeecCCCcceeeeccCCeEEEEecccCcccccce-eeeccccCccceeeeeccCCCeee
Confidence 45555 8766 88 899988664 5999999999999999962 222221 1133 23589999999999999
Q ss_pred EEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEe
Q 000425 158 TLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLF 230 (1525)
Q Consensus 158 SgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F 230 (1525)
|||. +.|+||++..-... . + - ...|-..+....+.+++|||-.-++|+.+
T Consensus 1321 sGs~-q~ikIy~~~G~~l~----~---~-k--------------~n~~F~~q~~gs~scL~FHP~~~llAaG~ 1370 (1387)
T KOG1517|consen 1321 SGSA-QLIKIYSLSGEQLN----I---I-K--------------YNPGFMGQRIGSVSCLAFHPHRLLLAAGS 1370 (1387)
T ss_pred ecCc-ceEEEEecChhhhc----c---c-c--------------cCcccccCcCCCcceeeecchhHhhhhcc
Confidence 9998 99999998621110 0 0 0 00111112233468899999988888443
No 204
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=98.48 E-value=1e-06 Score=110.25 Aligned_cols=157 Identities=20% Similarity=0.256 Sum_probs=115.7
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccce-------EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPT-------KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH 81 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~t-------klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~ 81 (1525)
--.++|+|..|..|+-....+.| ..+...+..... ....-|.++|+++.++|-...++..+.|-+||||+..
T Consensus 350 ~t~~~F~~~~p~~FiVGTe~G~v-~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~v~~nPF~~k~fls~gDW~vriWs~~ 428 (555)
T KOG1587|consen 350 ATSLKFEPTDPNHFIVGTEEGKV-YKGCRKGYTPAPEVSYKGHSTFITHIGPVYAVSRNPFYPKNFLSVGDWTVRIWSED 428 (555)
T ss_pred eeeEeeccCCCceEEEEcCCcEE-EEEeccCCcccccccccccccccccCcceEeeecCCCccceeeeeccceeEecccc
Confidence 34689999999998887654444 333333322222 1222388999999999988755444459999999987
Q ss_pred -CCcEEEEEe-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 82 -TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 82 -t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
...++..+. +.+.|. +++|||..+ .++++..||.+-+||+.......+.....+....+.+.|+++|+.|+.
T Consensus 429 ~~~~Pl~~~~~~~~~v~-----~vaWSptrpavF~~~d~~G~l~iWDLl~~~~~Pv~s~~~~~~~l~~~~~s~~g~~lav 503 (555)
T KOG1587|consen 429 VIASPLLSLDSSPDYVT-----DVAWSPTRPAVFATVDGDGNLDIWDLLQDDEEPVLSQKVCSPALTRVRWSPNGKLLAV 503 (555)
T ss_pred CCCCcchhhhhccceee-----eeEEcCcCceEEEEEcCCCceehhhhhccccCCcccccccccccceeecCCCCcEEEE
Confidence 667888888 888899 999999776 555667899999999985322111112234666778899999999999
Q ss_pred EECCCCeeEEEee
Q 000425 159 LCRDGSLQVWKTR 171 (1525)
Q Consensus 159 gS~DgtIkVWDlr 171 (1525)
|...|++++|++.
T Consensus 504 Gd~~G~~~~~~l~ 516 (555)
T KOG1587|consen 504 GDANGTTHILKLS 516 (555)
T ss_pred ecCCCcEEEEEcC
Confidence 9999999999996
No 205
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=98.48 E-value=8.6e-07 Score=113.08 Aligned_cols=143 Identities=17% Similarity=0.141 Sum_probs=112.8
Q ss_pred CCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE--EEecCC
Q 000425 16 PLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY--TLQLDN 93 (1525)
Q Consensus 16 P~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~--tL~H~~ 93 (1525)
+..=++.+|.. -..+.+|...+..++. .+.+ |++.|.++.|+-+|.++++.|+|+++|+|++.+.+... .|.|..
T Consensus 143 ~~~~~i~~gsv-~~~iivW~~~~dn~p~-~l~G-HeG~iF~i~~s~dg~~i~s~SdDRsiRlW~i~s~~~~~~~~fgHsa 219 (967)
T KOG0974|consen 143 AEELYIASGSV-FGEIIVWKPHEDNKPI-RLKG-HEGSIFSIVTSLDGRYIASVSDDRSIRLWPIDSREVLGCTGFGHSA 219 (967)
T ss_pred CcEEEEEeccc-cccEEEEeccccCCcc-eecc-cCCceEEEEEccCCcEEEEEecCcceeeeecccccccCcccccccc
Confidence 33334444443 4455666666443333 4555 99999999999999999999999999999999988776 233999
Q ss_pred ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccC-cCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 94 TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGS-QPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 94 ~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~-~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.|. .+.|+|. .+++++.|.+.++|+.+- ... ..+.+|. ..|..++.++....++|++.|+++++||+.
T Consensus 220 Rvw-----~~~~~~n--~i~t~gedctcrvW~~~~--~~l-~~y~~h~g~~iw~~~~~~~~~~~vT~g~Ds~lk~~~l~ 288 (967)
T KOG0974|consen 220 RVW-----ACCFLPN--RIITVGEDCTCRVWGVNG--TQL-EVYDEHSGKGIWKIAVPIGVIIKVTGGNDSTLKLWDLN 288 (967)
T ss_pred eeE-----EEEeccc--eeEEeccceEEEEEeccc--cee-hhhhhhhhcceeEEEEcCCceEEEeeccCcchhhhhhh
Confidence 999 9999998 899999999999997773 222 2455664 458899999999999999999999999986
No 206
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.47 E-value=1.1e-05 Score=99.10 Aligned_cols=154 Identities=10% Similarity=0.084 Sum_probs=99.4
Q ss_pred CcEEEEEEcCCCCEEEEEEec--cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEE-ECCC--cEEEEEcC
Q 000425 7 DTEVHLALTPLQPVVFFGFHR--RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVA-YADG--LIRAYNIH 81 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~--d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSG-s~Dg--tIrLWDl~ 81 (1525)
+.....+|+|++..+++.... +..+.+++. .++. ...+.. +.+.+..++|+|+|+.++.. +.++ .|.+||+.
T Consensus 199 ~~~~~p~wSPDG~~la~~s~~~g~~~i~i~dl-~~G~-~~~l~~-~~~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~ 275 (429)
T PRK03629 199 QPLMSPAWSPDGSKLAYVTFESGRSALVIQTL-ANGA-VRQVAS-FPRHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLA 275 (429)
T ss_pred CceeeeEEcCCCCEEEEEEecCCCcEEEEEEC-CCCC-eEEccC-CCCCcCCeEECCCCCEEEEEEcCCCCcEEEEEECC
Confidence 356789999999988776532 344555553 3332 222222 34455678999999987654 4455 48889998
Q ss_pred CCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC-CCcEEEE--eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 82 TYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR-RGTLLAW--DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 82 t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~-DgtVrVW--Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
+++......+...+. ...|+|||++|+..++ ++...+| |..+..... +..+...+.+..|+|||++|+.
T Consensus 276 tg~~~~lt~~~~~~~-----~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~---lt~~~~~~~~~~~SpDG~~Ia~ 347 (429)
T PRK03629 276 SGQIRQVTDGRSNNT-----EPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQR---ITWEGSQNQDADVSSDGKFMVM 347 (429)
T ss_pred CCCEEEccCCCCCcC-----ceEECCCCCEEEEEeCCCCCceEEEEECCCCCeEE---eecCCCCccCEEECCCCCEEEE
Confidence 887654333545566 8999999999887764 4555566 555321111 1233445667899999999887
Q ss_pred EECCC---CeeEEEee
Q 000425 159 LCRDG---SLQVWKTR 171 (1525)
Q Consensus 159 gS~Dg---tIkVWDlr 171 (1525)
.+.++ .|.+||+.
T Consensus 348 ~~~~~g~~~I~~~dl~ 363 (429)
T PRK03629 348 VSSNGGQQHIAKQDLA 363 (429)
T ss_pred EEccCCCceEEEEECC
Confidence 66543 46667754
No 207
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=98.43 E-value=2.1e-06 Score=103.11 Aligned_cols=157 Identities=13% Similarity=0.129 Sum_probs=120.3
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC--CCEEEEEECCCcEEEEEcCCC----
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR--LPVLYVAYADGLIRAYNIHTY---- 83 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd--g~~LaSGs~DgtIrLWDl~t~---- 83 (1525)
=+|+++-++.++..|+ +|..+.||+..+ -+....+.++|...|.++.|-|. ...+++|+.|..|+++|+...
T Consensus 54 N~LeWn~dG~lL~SGS-DD~r~ivWd~~~-~KllhsI~TgHtaNIFsvKFvP~tnnriv~sgAgDk~i~lfdl~~~~~~~ 131 (758)
T KOG1310|consen 54 NCLEWNADGELLASGS-DDTRLIVWDPFE-YKLLHSISTGHTANIFSVKFVPYTNNRIVLSGAGDKLIKLFDLDSSKEGG 131 (758)
T ss_pred cceeecCCCCEEeecC-CcceEEeecchh-cceeeeeecccccceeEEeeeccCCCeEEEeccCcceEEEEecccccccc
Confidence 3689999999987777 578888999774 46667788889999999999985 457999999999999998741
Q ss_pred ------cEEEEEe-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEee-------cccC--cCeEE
Q 000425 84 ------AVHYTLQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGII-------QVGS--QPITS 146 (1525)
Q Consensus 84 ------k~l~tL~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il-------~gH~--~~Vts 146 (1525)
...+.+. |.+.|. .++-.|+++ .+.+++.||+++-+|++.. ..+.... .-|. -...|
T Consensus 132 ~d~~~~~~~~~~~cht~rVK-----ria~~p~~PhtfwsasEDGtirQyDiREp-h~c~p~~~~~~~l~ny~~~lielk~ 205 (758)
T KOG1310|consen 132 MDHGMEETTRCWSCHTDRVK-----RIATAPNGPHTFWSASEDGTIRQYDIREP-HVCNPDEDCPSILVNYNPQLIELKC 205 (758)
T ss_pred cccCccchhhhhhhhhhhhh-----heecCCCCCceEEEecCCcceeeecccCC-ccCCccccccHHHHHhchhhheeee
Confidence 3344555 889999 899999984 5679999999999999952 1111000 1111 23468
Q ss_pred EEEcCCCC-EEEEEECCCCeeEEEeeeec
Q 000425 147 VAWLPMLR-LLVTLCRDGSLQVWKTRVII 174 (1525)
Q Consensus 147 VafsPdg~-~LaSgS~DgtIkVWDlr~~~ 174 (1525)
+..+|..+ +|+.|+.|--.+++|.|..+
T Consensus 206 ltisp~rp~~laVGgsdpfarLYD~Rr~l 234 (758)
T KOG1310|consen 206 LTISPSRPYYLAVGGSDPFARLYDRRRVL 234 (758)
T ss_pred eeecCCCCceEEecCCCchhhhhhhhhhc
Confidence 99999765 78899999999999977443
No 208
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=98.43 E-value=2.3e-07 Score=109.67 Aligned_cols=118 Identities=18% Similarity=0.258 Sum_probs=100.3
Q ss_pred CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC
Q 000425 52 KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER 130 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~ 130 (1525)
..|..+.|-|-.-+|++++..|.++--|+.+++.+..+. ....+. .++-+|-+-.+-.|...|+|.+|.-.+..
T Consensus 210 ~~v~rLeFLPyHfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~~-----vm~qNP~NaVih~GhsnGtVSlWSP~ske 284 (545)
T KOG1272|consen 210 IRVARLEFLPYHFLLVAASEAGFLKYQDVSTGKLVASIRTGAGRTD-----VMKQNPYNAVIHLGHSNGTVSLWSPNSKE 284 (545)
T ss_pred CchhhhcccchhheeeecccCCceEEEeechhhhhHHHHccCCccc-----hhhcCCccceEEEcCCCceEEecCCCCcc
Confidence 467788999988889999999999999999999998888 667777 77888988888899999999999877531
Q ss_pred ceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCC
Q 000425 131 PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINP 176 (1525)
Q Consensus 131 ~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p 176 (1525)
+-+ -+-.|.++|++|+++++|+|++|.+.|..|+|||+|.+..+
T Consensus 285 PLv--KiLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~~~ql 328 (545)
T KOG1272|consen 285 PLV--KILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLRNFYQL 328 (545)
T ss_pred hHH--HHHhcCCCcceEEECCCCcEEeecccccceeEeeecccccc
Confidence 211 13379999999999999999999999999999999976543
No 209
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=98.41 E-value=3.2e-06 Score=106.52 Aligned_cols=151 Identities=15% Similarity=0.193 Sum_probs=111.4
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCC--ccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEE-cC--C-C
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGG--RAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYN-IH--T-Y 83 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg--k~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWD-l~--t-~ 83 (1525)
..|...|.+-+..... ...+.+|...++. .....+...|...+++.++||++.++|+|..||.|.+|. +. . .
T Consensus 164 ~~I~~~~~ge~~~i~~--~~~~~~~~v~~~~~~~~~~~~~~~Htf~~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~~~~~ 241 (792)
T KOG1963|consen 164 KSIVDNNSGEFKGIVH--MCKIHIYFVPKHTKHTSSRDITVHHTFNITCVALSPNERYLAAGDSDGRILVWRDFGSSDDS 241 (792)
T ss_pred ccEEEcCCceEEEEEE--eeeEEEEEecccceeeccchhhhhhcccceeEEeccccceEEEeccCCcEEEEecccccccc
Confidence 3455555555433332 2333344433321 122223335888899999999999999999999999997 33 1 1
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECC
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRD 162 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~D 162 (1525)
.....+. |...|+ ++.|+++|.+|+||+..+.+-+|.+.+..+++. -.-.+.|..+.++||+.+.+....|
T Consensus 242 ~t~t~lHWH~~~V~-----~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~kqfL---PRLgs~I~~i~vS~ds~~~sl~~~D 313 (792)
T KOG1963|consen 242 ETCTLLHWHHDEVN-----SLSFSSDGAYLLSGGREGVLVLWQLETGKKQFL---PRLGSPILHIVVSPDSDLYSLVLED 313 (792)
T ss_pred ccceEEEecccccc-----eeEEecCCceEeecccceEEEEEeecCCCcccc---cccCCeeEEEEEcCCCCeEEEEecC
Confidence 2345666 889999 999999999999999999999999997644433 3456789999999999999999999
Q ss_pred CCeeEEEe
Q 000425 163 GSLQVWKT 170 (1525)
Q Consensus 163 gtIkVWDl 170 (1525)
..|.+-..
T Consensus 314 NqI~li~~ 321 (792)
T KOG1963|consen 314 NQIHLIKA 321 (792)
T ss_pred ceEEEEec
Confidence 99988865
No 210
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=98.41 E-value=0.0039 Score=83.58 Aligned_cols=223 Identities=10% Similarity=0.078 Sum_probs=129.7
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEee-cCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTV-EGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~ti-eggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k 84 (1525)
.+..+.+.+-++.-.+++++. .+.|.++... +.......+-+.-...|.+++|+||+.+++....++++-+.+ +++.
T Consensus 75 ~~~ivs~~yl~d~~~l~~~~~-~Gdi~~~~~~~~~~~~~~E~VG~vd~GI~a~~WSPD~Ella~vT~~~~l~~mt-~~fd 152 (928)
T PF04762_consen 75 NDKIVSFQYLADSESLCIALA-SGDIILVREDPDPDEDEIEIVGSVDSGILAASWSPDEELLALVTGEGNLLLMT-RDFD 152 (928)
T ss_pred CCcEEEEEeccCCCcEEEEEC-CceEEEEEccCCCCCceeEEEEEEcCcEEEEEECCCcCEEEEEeCCCEEEEEe-ccce
Confidence 456677777777777766664 5666555211 112222222233567899999999999999988888887764 2222
Q ss_pred EEEEEe-----------------------cC---C-----------------ce-eecCceeEEEeCCCCEEEEEeC---
Q 000425 85 VHYTLQ-----------------------LD---N-----------------TI-KLLGAGAFAFHPTLEWLFVGDR--- 117 (1525)
Q Consensus 85 ~l~tL~-----------------------H~---~-----------------~V-~~~ga~sVaFSPdg~~L~SgS~--- 117 (1525)
++.... |. + .+ ...+...|+|..||.|+++.+-
T Consensus 153 ~i~E~~l~~~~~~~~~~VsVGWGkKeTQF~Gs~gK~aa~~~~~p~~~~~d~~~~s~dd~~~~ISWRGDG~yFAVss~~~~ 232 (928)
T PF04762_consen 153 PISEVPLDSDDFGESKHVSVGWGKKETQFHGSAGKAAARQLRDPTVPKVDEGKLSWDDGRVRISWRGDGEYFAVSSVEPE 232 (928)
T ss_pred EEEEeecCccccCCCceeeeccCcccCccCcchhhhhhhhccCCCCCccccCccccCCCceEEEECCCCcEEEEEEEEcC
Confidence 221111 00 0 00 1113457899999999998773
Q ss_pred -C--CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccc
Q 000425 118 -R--GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESI 194 (1525)
Q Consensus 118 -D--gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesl 194 (1525)
. +.+||||-+.. ...+.+.-.+--.+++|.|.|.+|++...-..- +|+ .||+.+|...-
T Consensus 233 ~~~~R~iRVy~ReG~---L~stSE~v~gLe~~l~WrPsG~lIA~~q~~~~~--~~V-------------vFfErNGLrhg 294 (928)
T PF04762_consen 233 TGSRRVIRVYSREGE---LQSTSEPVDGLEGALSWRPSGNLIASSQRLPDR--HDV-------------VFFERNGLRHG 294 (928)
T ss_pred CCceeEEEEECCCce---EEeccccCCCccCCccCCCCCCEEEEEEEcCCC--cEE-------------EEEecCCcEee
Confidence 2 68999997721 111111111122468999999999987641100 111 24444443221
Q ss_pred cccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccccccCceeeeeeeccccC
Q 000425 195 DIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSARG 262 (1525)
Q Consensus 195 di~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~~~ 262 (1525)
++.--.. ..+ ..+..+.|.++..+.|+++.. .+..|.+..-+|.+-+.++...+
T Consensus 295 eF~l~~~-~~~------~~v~~l~Wn~ds~iLAv~~~~-------~vqLWt~~NYHWYLKqei~~~~~ 348 (928)
T PF04762_consen 295 EFTLRFD-PEE------EKVIELAWNSDSEILAVWLED-------RVQLWTRSNYHWYLKQEIRFSSS 348 (928)
T ss_pred eEecCCC-CCC------ceeeEEEECCCCCEEEEEecC-------CceEEEeeCCEEEEEEEEEccCC
Confidence 1110000 000 247889999999999988744 16778777777777766665444
No 211
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=98.41 E-value=7.1e-05 Score=92.18 Aligned_cols=112 Identities=16% Similarity=0.230 Sum_probs=88.4
Q ss_pred CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC---------------cEEEEEe-cCCceeecCceeEEEeCCCCEEEEE
Q 000425 52 KPIVNLACHPRLPVLYVAYADGLIRAYNIHTY---------------AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVG 115 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~---------------k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~Sg 115 (1525)
....+++|+....++++|+.||.+|+.-+.+. ....+++ |..+|. -+.|+-..+.|-++
T Consensus 15 vkL~c~~WNke~gyIAcgG~dGlLKVlKl~t~t~d~~~~glaa~snLsmNQtLeGH~~sV~-----vvTWNe~~QKLTtS 89 (1189)
T KOG2041|consen 15 VKLHCAEWNKESGYIACGGADGLLKVLKLGTDTTDLNKSGLAAASNLSMNQTLEGHNASVM-----VVTWNENNQKLTTS 89 (1189)
T ss_pred ceEEEEEEcccCCeEEeccccceeEEEEccccCCcccccccccccccchhhhhccCcceEE-----EEEecccccccccc
Confidence 45678999999999999999999999976542 2245677 999999 99999988888888
Q ss_pred eCCCcEEEEeCCCC--CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 116 DRRGTLLAWDVSIE--RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 116 S~DgtVrVWDl~t~--~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
..+|.|.||=+-.+ ...++ -+...+.|.+++|+.||..|+..-.||.|.|=.+
T Consensus 90 Dt~GlIiVWmlykgsW~EEMi--NnRnKSvV~SmsWn~dG~kIcIvYeDGavIVGsv 144 (1189)
T KOG2041|consen 90 DTSGLIIVWMLYKGSWCEEMI--NNRNKSVVVSMSWNLDGTKICIVYEDGAVIVGSV 144 (1189)
T ss_pred CCCceEEEEeeecccHHHHHh--hCcCccEEEEEEEcCCCcEEEEEEccCCEEEEee
Confidence 89999999976532 12232 1234566889999999999988888888866544
No 212
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=98.40 E-value=4.1e-06 Score=92.40 Aligned_cols=109 Identities=13% Similarity=0.166 Sum_probs=84.3
Q ss_pred cEEEEEEcCCCCEEEEEE-eccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECC---CcEEEEEcCCC
Q 000425 8 TEVHLALTPLQPVVFFGF-HRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYAD---GLIRAYNIHTY 83 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~-~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~D---gtIrLWDl~t~ 83 (1525)
....++++|++..+++.. .....+.+++. + +.....+ +..++..|.|+|+|+++++|+.+ |.|.+||.++.
T Consensus 61 ~I~~~~WsP~g~~favi~g~~~~~v~lyd~-~-~~~i~~~---~~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~~ 135 (194)
T PF08662_consen 61 PIHDVAWSPNGNEFAVIYGSMPAKVTLYDV-K-GKKIFSF---GTQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVRKK 135 (194)
T ss_pred ceEEEEECcCCCEEEEEEccCCcccEEEcC-c-ccEeEee---cCCCceEEEECCCCCEEEEEEccCCCcEEEEEECCCC
Confidence 367899999988765443 23457777775 2 3333322 45788899999999999998754 66999999999
Q ss_pred cEEEEEecCCceeecCceeEEEeCCCCEEEEEeC------CCcEEEEeCC
Q 000425 84 AVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR------RGTLLAWDVS 127 (1525)
Q Consensus 84 k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~------DgtVrVWDl~ 127 (1525)
+.+.+++|.. +. .++|+|||++++++.. |+.++||++.
T Consensus 136 ~~i~~~~~~~-~t-----~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 136 KKISTFEHSD-AT-----DVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred EEeeccccCc-EE-----EEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence 9888887654 56 8999999999998873 7889999987
No 213
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.39 E-value=4.1e-06 Score=97.97 Aligned_cols=159 Identities=13% Similarity=0.160 Sum_probs=126.0
Q ss_pred CCcEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEcc------CCC--CCeEEEEEcCC--CCEEEEEECCC
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKT------DLK--KPIVNLACHPR--LPVLYVAYADG 73 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~------~H~--~~V~sLaf~Pd--g~~LaSGs~Dg 73 (1525)
+.++.++.-+|..|.+++..... ..+.+|+... .+..+.-+. +.. -.++++.|-|. ...|+++..-+
T Consensus 148 g~g~~~~r~~~~~p~Iva~GGke~~n~lkiwdle~-~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~h 226 (412)
T KOG3881|consen 148 GPGLYDVRQTDTDPYIVATGGKENINELKIWDLEQ-SKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRYH 226 (412)
T ss_pred CCceeeeccCCCCCceEecCchhcccceeeeeccc-ceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecce
Confidence 46788889999999988755555 5667777533 233332222 112 23567889887 66899999999
Q ss_pred cEEEEEcCCC-cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE-eecccCcCeEEEEEc
Q 000425 74 LIRAYNIHTY-AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG-IIQVGSQPITSVAWL 150 (1525)
Q Consensus 74 tIrLWDl~t~-k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~-il~gH~~~VtsVafs 150 (1525)
.+|+||.+.+ +++..|. ..+++. ++..-|+++.|++|..-+.+..+|.++ ....+ .+.+-.+.|++|.-|
T Consensus 227 qvR~YDt~~qRRPV~~fd~~E~~is-----~~~l~p~gn~Iy~gn~~g~l~~FD~r~--~kl~g~~~kg~tGsirsih~h 299 (412)
T KOG3881|consen 227 QVRLYDTRHQRRPVAQFDFLENPIS-----STGLTPSGNFIYTGNTKGQLAKFDLRG--GKLLGCGLKGITGSIRSIHCH 299 (412)
T ss_pred eEEEecCcccCcceeEeccccCcce-----eeeecCCCcEEEEecccchhheecccC--ceeeccccCCccCCcceEEEc
Confidence 9999998865 6888888 888999 999999999999999999999999995 33332 256778899999999
Q ss_pred CCCCEEEEEECCCCeeEEEeee
Q 000425 151 PMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 151 Pdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
|.++++++|+.|..|||+|+.+
T Consensus 300 p~~~~las~GLDRyvRIhD~kt 321 (412)
T KOG3881|consen 300 PTHPVLASCGLDRYVRIHDIKT 321 (412)
T ss_pred CCCceEEeeccceeEEEeeccc
Confidence 9999999999999999999974
No 214
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=98.39 E-value=1.5e-06 Score=97.30 Aligned_cols=156 Identities=13% Similarity=0.177 Sum_probs=112.6
Q ss_pred CCCeEEEEEcCC-CC--EEEEEECCCcEEEEEcCCCcE----------EEEEe-cCCceeecCceeEEEeCCCCEEEEEe
Q 000425 51 KKPIVNLACHPR-LP--VLYVAYADGLIRAYNIHTYAV----------HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD 116 (1525)
Q Consensus 51 ~~~V~sLaf~Pd-g~--~LaSGs~DgtIrLWDl~t~k~----------l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS 116 (1525)
.+.|++..+..+ +. +++.|..+|.|-+||+.++.. ..... |..+|. ++.|-+.-..=++|+
T Consensus 150 lgsvmc~~~~~~c~s~~lllaGyEsghvv~wd~S~~~~~~~~~~~~kv~~~~ash~qpvl-----sldyas~~~rGisgg 224 (323)
T KOG0322|consen 150 LGSVMCQDKDHACGSTFLLLAGYESGHVVIWDLSTGDKIIQLPQSSKVESPNASHKQPVL-----SLDYASSCDRGISGG 224 (323)
T ss_pred cCceeeeeccccccceEEEEEeccCCeEEEEEccCCceeeccccccccccchhhccCcce-----eeeechhhcCCcCCC
Confidence 356777775433 33 467888999999999998733 33333 788888 999987555556788
Q ss_pred CCCcEEEEeCCCCCcee--eEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccc
Q 000425 117 RRGTLLAWDVSIERPSM--IGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESI 194 (1525)
Q Consensus 117 ~DgtVrVWDl~t~~~s~--i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesl 194 (1525)
.+..+.+|+++....++ ...++-.+-.|..+..-||++.++|+++|+.||||..|+... +
T Consensus 225 a~dkl~~~Sl~~s~gslq~~~e~~lknpGv~gvrIRpD~KIlATAGWD~RiRVyswrtl~p------------------L 286 (323)
T KOG0322|consen 225 ADDKLVMYSLNHSTGSLQIRKEITLKNPGVSGVRIRPDGKILATAGWDHRIRVYSWRTLNP------------------L 286 (323)
T ss_pred ccccceeeeeccccCcccccceEEecCCCccceEEccCCcEEeecccCCcEEEEEeccCCc------------------h
Confidence 78888899887421111 122233345688899999999999999999999999985421 0
Q ss_pred cccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccc
Q 000425 195 DIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 195 di~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~ 245 (1525)
-+|.++. ..|++++|||..|++| .++.|..+-.|+
T Consensus 287 ---AVLkyHs-------agvn~vAfspd~~lmA------aaskD~rISLWk 321 (323)
T KOG0322|consen 287 ---AVLKYHS-------AGVNAVAFSPDCELMA------AASKDARISLWK 321 (323)
T ss_pred ---hhhhhhh-------cceeEEEeCCCCchhh------hccCCceEEeee
Confidence 1333333 4689999999999998 888888888776
No 215
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.35 E-value=2e-05 Score=90.47 Aligned_cols=105 Identities=22% Similarity=0.303 Sum_probs=86.5
Q ss_pred CCEEEEEEeccceEEEEEeecCCc-cceEEccCCCCCeEEEEEcCCCCEEEEEECCCc-EEEEEcCCCcEEEEEe---cC
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGR-APTKIKTDLKKPIVNLACHPRLPVLYVAYADGL-IRAYNIHTYAVHYTLQ---LD 92 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk-~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~Dgt-IrLWDl~t~k~l~tL~---H~ 92 (1525)
..++.|-......+.+.+...... ++..+.. |...|.|++.+-+|.++|++|..|| |||||..+++.+..++ ..
T Consensus 148 k~~LafPg~k~GqvQi~dL~~~~~~~p~~I~A-H~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~ 226 (346)
T KOG2111|consen 148 KSLLAFPGFKTGQVQIVDLASTKPNAPSIINA-HDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDR 226 (346)
T ss_pred ceEEEcCCCccceEEEEEhhhcCcCCceEEEc-ccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCch
Confidence 334445555567777777644322 4566666 9999999999999999999999998 8999999999999998 34
Q ss_pred CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 93 NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 93 ~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
..+. +++|||+..||+++|+-||+.|+.++.
T Consensus 227 A~iy-----~iaFSp~~s~LavsSdKgTlHiF~l~~ 257 (346)
T KOG2111|consen 227 ADIY-----CIAFSPNSSWLAVSSDKGTLHIFSLRD 257 (346)
T ss_pred heEE-----EEEeCCCccEEEEEcCCCeEEEEEeec
Confidence 5677 999999999999999999999998774
No 216
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=98.33 E-value=1.2e-05 Score=89.17 Aligned_cols=155 Identities=14% Similarity=0.184 Sum_probs=110.4
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
.+-+-|..--++++. .|.-+..++ .|.++....+++ |++.|-++.--.....+++|+.||++|+||.++.+++..+.
T Consensus 119 am~ldP~enSi~~Ag-GD~~~y~~d-lE~G~i~r~~rG-HtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k~v~~ie 195 (325)
T KOG0649|consen 119 AMWLDPSENSILFAG-GDGVIYQVD-LEDGRIQREYRG-HTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQKHVSMIE 195 (325)
T ss_pred eeEeccCCCcEEEec-CCeEEEEEE-ecCCEEEEEEcC-CcceeeeeeecccCcceeecCCCccEEEEeccccceeEEec
Confidence 456778877777776 566666776 577777777777 99999999986677789999999999999999999999988
Q ss_pred ---cCCcee-ecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 91 ---LDNTIK-LLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 91 ---H~~~V~-~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
|.+-.+ ..|.+--+.--+..|+++|+. ..+.+|++++....++- .-..+|..+.|.. ..+++++....|.
T Consensus 196 ~yk~~~~lRp~~g~wigala~~edWlvCGgG-p~lslwhLrsse~t~vf---pipa~v~~v~F~~--d~vl~~G~g~~v~ 269 (325)
T KOG0649|consen 196 PYKNPNLLRPDWGKWIGALAVNEDWLVCGGG-PKLSLWHLRSSESTCVF---PIPARVHLVDFVD--DCVLIGGEGNHVQ 269 (325)
T ss_pred cccChhhcCcccCceeEEEeccCceEEecCC-CceeEEeccCCCceEEE---ecccceeEeeeec--ceEEEecccccee
Confidence 222111 012211233345679888764 46789999975333331 2345567777854 4788888888999
Q ss_pred EEEeeeec
Q 000425 167 VWKTRVII 174 (1525)
Q Consensus 167 VWDlr~~~ 174 (1525)
-|.+...+
T Consensus 270 ~~~l~Gvl 277 (325)
T KOG0649|consen 270 SYTLNGVL 277 (325)
T ss_pred eeeeccEE
Confidence 99887433
No 217
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=98.32 E-value=1.1e-05 Score=102.94 Aligned_cols=190 Identities=15% Similarity=0.161 Sum_probs=124.1
Q ss_pred EEEEEeccceEEEEEeecCC-ccceEEcc-----CC----CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 21 VFFGFHRRMSVTVVGTVEGG-RAPTKIKT-----DL----KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 21 Vf~s~~~d~sI~vv~tiegg-k~~tklk~-----~H----~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
++..++.+..|++|...... +.+..+.. ++ .+.-.-++|.-...+|++++.-..|||||.+...+...+-
T Consensus 1125 LlLtas~dGvIRIwk~y~~~~~~~eLVTaw~~Ls~~~~~~r~~~~v~dWqQ~~G~Ll~tGd~r~IRIWDa~~E~~~~diP 1204 (1387)
T KOG1517|consen 1125 LLLTASSDGVIRIWKDYADKWKKPELVTAWSSLSDQLPGARGTGLVVDWQQQSGHLLVTGDVRSIRIWDAHKEQVVADIP 1204 (1387)
T ss_pred heeeeccCceEEEecccccccCCceeEEeeccccccCccCCCCCeeeehhhhCCeEEecCCeeEEEEEecccceeEeecc
Confidence 33444568899998765443 22211110 11 1112346787776777777778999999999888777776
Q ss_pred --cCCceeecCceeEEE-eCCCCEEEEEeCCCcEEEEeCCCC-CceeeEeecccCcC--eEEEEEcCCCC-EEEEEECCC
Q 000425 91 --LDNTIKLLGAGAFAF-HPTLEWLFVGDRRGTLLAWDVSIE-RPSMIGIIQVGSQP--ITSVAWLPMLR-LLVTLCRDG 163 (1525)
Q Consensus 91 --H~~~V~~~ga~sVaF-SPdg~~L~SgS~DgtVrVWDl~t~-~~s~i~il~gH~~~--VtsVafsPdg~-~LaSgS~Dg 163 (1525)
....++ ++.= .+.|+.|+.|-.||+||+||.+.. ....+..+..|+.. |..+.+.+.|- .|+|||.||
T Consensus 1205 ~~s~t~vT-----aLS~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G 1279 (1387)
T KOG1517|consen 1205 YGSSTLVT-----ALSADLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDG 1279 (1387)
T ss_pred cCCCccce-----eecccccCCceEEEeecCCceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccCC
Confidence 344555 4432 234788999999999999999863 33445556789888 99999998764 499999999
Q ss_pred CeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecCCCeEEEEecccccCCcccccc
Q 000425 164 SLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRA 243 (1525)
Q Consensus 164 tIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~ 243 (1525)
.|++||+|.... . ++ +++..- +..|+ .++++..|+....+| +|+- ..+++
T Consensus 1280 ~I~~~DlR~~~~--e-----~~--------~~iv~~-~~yGs-------~lTal~VH~hapiiA------sGs~-q~ikI 1329 (1387)
T KOG1517|consen 1280 DIQLLDLRMSSK--E-----TF--------LTIVAH-WEYGS-------ALTALTVHEHAPIIA------SGSA-QLIKI 1329 (1387)
T ss_pred eEEEEecccCcc--c-----cc--------ceeeec-cccCc-------cceeeeeccCCCeee------ecCc-ceEEE
Confidence 999999995311 0 11 111111 11121 367899999999988 5554 55554
Q ss_pred cc
Q 000425 244 AY 245 (1525)
Q Consensus 244 ~~ 245 (1525)
|.
T Consensus 1330 y~ 1331 (1387)
T KOG1517|consen 1330 YS 1331 (1387)
T ss_pred Ee
Confidence 44
No 218
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.32 E-value=1e-05 Score=93.90 Aligned_cols=202 Identities=15% Similarity=0.212 Sum_probs=124.6
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCC---cc--------ceEEcc-CCCCCeEEEEEcCCCCEEEEEEC-CCc
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGG---RA--------PTKIKT-DLKKPIVNLACHPRLPVLYVAYA-DGL 74 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tiegg---k~--------~tklk~-~H~~~V~sLaf~Pdg~~LaSGs~-Dgt 74 (1525)
-.-+||+-|...-.+.-. .+..|-+|...... +. ...++. +| .+|++++|++||..+++++. |..
T Consensus 142 nvtclawRPlsaselavg-Cr~gIciW~~s~tln~~r~~~~~s~~~~qvl~~pgh-~pVtsmqwn~dgt~l~tAS~gsss 219 (445)
T KOG2139|consen 142 NVTCLAWRPLSASELAVG-CRAGICIWSDSRTLNANRNIRMMSTHHLQVLQDPGH-NPVTSMQWNEDGTILVTASFGSSS 219 (445)
T ss_pred ceeEEEeccCCcceeeee-ecceeEEEEcCcccccccccccccccchhheeCCCC-ceeeEEEEcCCCCEEeecccCcce
Confidence 345789999865544433 36667666542221 11 111111 13 68999999999999998875 678
Q ss_pred EEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC--CceeeEeecccCcCeEEEEEcC
Q 000425 75 IRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE--RPSMIGIIQVGSQPITSVAWLP 151 (1525)
Q Consensus 75 IrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~--~~s~i~il~gH~~~VtsVafsP 151 (1525)
|.|||..++..+.... ....+. -+.|||||.+|+++.-|+..++|+.... ....+ -..++|...+|+|
T Consensus 220 i~iWdpdtg~~~pL~~~glgg~s-----lLkwSPdgd~lfaAt~davfrlw~e~q~wt~erw~----lgsgrvqtacWsp 290 (445)
T KOG2139|consen 220 IMIWDPDTGQKIPLIPKGLGGFS-----LLKWSPDGDVLFAATCDAVFRLWQENQSWTKERWI----LGSGRVQTACWSP 290 (445)
T ss_pred EEEEcCCCCCcccccccCCCcee-----eEEEcCCCCEEEEecccceeeeehhcccceeccee----ccCCceeeeeecC
Confidence 9999999987766554 556677 8999999999999999999999965532 11222 2355899999999
Q ss_pred CCCEEE-EEECCCCeeEEEeeeecCCCCCCCccccccCCCcc----cccccccccccCCcceecCCCeeEEEEecCCCeE
Q 000425 152 MLRLLV-TLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIE----SIDIPRILSQQGGEAVYPLPRVRALEVHPRLNLA 226 (1525)
Q Consensus 152 dg~~La-SgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVe----sldi~~iLs~~gG~~vy~l~~V~~v~~HP~~Nli 226 (1525)
+|+.|+ +++. .=+++.+.....+. . +..+.++. ..|++.+-...+ .... --...+++|+|.++..
T Consensus 291 cGsfLLf~~sg--sp~lysl~f~~~~~---~---~~~~~~~k~~lliaDL~e~ti~ag-~~l~-cgeaq~lawDpsGeyL 360 (445)
T KOG2139|consen 291 CGSFLLFACSG--SPRLYSLTFDGEDS---V---FLRPQSIKRVLLIADLQEVTICAG-QRLC-CGEAQCLAWDPSGEYL 360 (445)
T ss_pred CCCEEEEEEcC--CceEEEEeecCCCc---c---ccCcccceeeeeeccchhhhhhcC-cccc-cCccceeeECCCCCEE
Confidence 998655 4443 34456554221110 0 00011111 123332222221 0000 0135789999999998
Q ss_pred EEEe
Q 000425 227 VLLF 230 (1525)
Q Consensus 227 al~F 230 (1525)
|+.|
T Consensus 361 av~f 364 (445)
T KOG2139|consen 361 AVIF 364 (445)
T ss_pred EEEE
Confidence 8666
No 219
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=98.30 E-value=5.3e-06 Score=106.13 Aligned_cols=149 Identities=13% Similarity=0.132 Sum_probs=112.1
Q ss_pred EEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE
Q 000425 57 LACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG 135 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~ 135 (1525)
+.++++.-++++|+.-+.|.+|+....+.-..+. |++.+. ++.|+-||.++++.|+|+++|+|++.+. .....
T Consensus 139 ~g~s~~~~~i~~gsv~~~iivW~~~~dn~p~~l~GHeG~iF-----~i~~s~dg~~i~s~SdDRsiRlW~i~s~-~~~~~ 212 (967)
T KOG0974|consen 139 IGDSAEELYIASGSVFGEIIVWKPHEDNKPIRLKGHEGSIF-----SIVTSLDGRYIASVSDDRSIRLWPIDSR-EVLGC 212 (967)
T ss_pred EeccCcEEEEEeccccccEEEEeccccCCcceecccCCceE-----EEEEccCCcEEEEEecCcceeeeecccc-cccCc
Confidence 4456667789999999999999976332222455 999999 9999999999999999999999999973 11112
Q ss_pred eecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCee
Q 000425 136 IIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVR 215 (1525)
Q Consensus 136 il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~ 215 (1525)
..-+|+.+|..++|+|+ +++|++.|.|.++|+.+...- . ...++.| ..+|
T Consensus 213 ~~fgHsaRvw~~~~~~n--~i~t~gedctcrvW~~~~~~l---------------------~-~y~~h~g------~~iw 262 (967)
T KOG0974|consen 213 TGFGHSARVWACCFLPN--RIITVGEDCTCRVWGVNGTQL---------------------E-VYDEHSG------KGIW 262 (967)
T ss_pred ccccccceeEEEEeccc--eeEEeccceEEEEEeccccee---------------------h-hhhhhhh------ccee
Confidence 34489999999999999 999999999999997652211 0 1111111 3477
Q ss_pred EEEEecCCCeEEEEecccccCCcccccccccc
Q 000425 216 ALEVHPRLNLAVLLFANFTGGDNIKNRAAYTR 247 (1525)
Q Consensus 216 ~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~~~ 247 (1525)
.++.++...... ++++|...|.|...
T Consensus 263 ~~~~~~~~~~~v------T~g~Ds~lk~~~l~ 288 (967)
T KOG0974|consen 263 KIAVPIGVIIKV------TGGNDSTLKLWDLN 288 (967)
T ss_pred EEEEcCCceEEE------eeccCcchhhhhhh
Confidence 888887665554 88999999988753
No 220
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=98.30 E-value=1.3e-06 Score=108.77 Aligned_cols=149 Identities=19% Similarity=0.224 Sum_probs=121.7
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t 88 (1525)
.-+.+|.-++-+|+.++ +++-+++|. .+...+.-...+ |++.|+.++.+....++++++.|..|++|-+.++.++..
T Consensus 193 Vyca~fDrtg~~Iitgs-dd~lvKiwS-~et~~~lAs~rG-hs~ditdlavs~~n~~iaaaS~D~vIrvWrl~~~~pvsv 269 (1113)
T KOG0644|consen 193 VYCAIFDRTGRYIITGS-DDRLVKIWS-METARCLASCRG-HSGDITDLAVSSNNTMIAAASNDKVIRVWRLPDGAPVSV 269 (1113)
T ss_pred eeeeeeccccceEeecC-ccceeeeee-ccchhhhccCCC-CccccchhccchhhhhhhhcccCceEEEEecCCCchHHH
Confidence 45677888899998888 478899998 566666666676 999999999998888999999999999999999999999
Q ss_pred Ee-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC----CceeeEeecccCcCeEEEEEcCCCCEEEEEECCC
Q 000425 89 LQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE----RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 89 L~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~----~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg 163 (1525)
+. |.+.|+ +++|+|.. +.++||++++||++-. .+.... . ....-+-++.|...+.-+.|++.|+
T Consensus 270 Lrghtgavt-----aiafsP~~----sss~dgt~~~wd~r~~~~~y~prp~~-~-~~~~~~~s~~~~~~~~~f~Tgs~d~ 338 (1113)
T KOG0644|consen 270 LRGHTGAVT-----AIAFSPRA----SSSDDGTCRIWDARLEPRIYVPRPLK-F-TEKDLVDSILFENNGDRFLTGSRDG 338 (1113)
T ss_pred Hhcccccee-----eeccCccc----cCCCCCceEeccccccccccCCCCCC-c-ccccceeeeeccccccccccccCCc
Confidence 99 999999 99999964 7789999999999821 111110 1 1234566788888888999999999
Q ss_pred CeeEEEee
Q 000425 164 SLQVWKTR 171 (1525)
Q Consensus 164 tIkVWDlr 171 (1525)
+-+.|...
T Consensus 339 ea~n~e~~ 346 (1113)
T KOG0644|consen 339 EARNHEFE 346 (1113)
T ss_pred ccccchhh
Confidence 99999765
No 221
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.28 E-value=2.8e-05 Score=95.54 Aligned_cols=152 Identities=15% Similarity=0.177 Sum_probs=102.5
Q ss_pred CcEEEEEEcCCCCEEEEEEec--cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEE-EEECCCc--EEEEEcC
Q 000425 7 DTEVHLALTPLQPVVFFGFHR--RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLY-VAYADGL--IRAYNIH 81 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~--d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~La-SGs~Dgt--IrLWDl~ 81 (1525)
+...+.+|+|++..+++.... +..+.+++. +++.. ..+.. +.+.+.+.+|+|+|+.++ +.+.++. |.+||+.
T Consensus 202 ~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl-~~g~~-~~l~~-~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~ 278 (435)
T PRK05137 202 SLVLTPRFSPNRQEITYMSYANGRPRVYLLDL-ETGQR-ELVGN-FPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLR 278 (435)
T ss_pred CCeEeeEECCCCCEEEEEEecCCCCEEEEEEC-CCCcE-EEeec-CCCcccCcEECCCCCEEEEEEecCCCceEEEEECC
Confidence 457889999999988777643 456667764 33322 23333 667788899999998765 5566665 6677888
Q ss_pred CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCC--cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEE
Q 000425 82 TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRG--TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 82 t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~Dg--tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~La 157 (1525)
+++.. .+. +..... ..+|+|||++|+..+ .++ .|.+||..+.....+ ..+...+....|+|+|++|+
T Consensus 279 ~~~~~-~Lt~~~~~~~-----~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~l---t~~~~~~~~~~~SpdG~~ia 349 (435)
T PRK05137 279 SGTTT-RLTDSPAIDT-----SPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRI---SFGGGRYSTPVWSPRGDLIA 349 (435)
T ss_pred CCceE-EccCCCCccC-----ceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEe---ecCCCcccCeEECCCCCEEE
Confidence 77653 344 555566 789999999988776 333 577778775322222 23345567789999999998
Q ss_pred EEECCC---CeeEEEe
Q 000425 158 TLCRDG---SLQVWKT 170 (1525)
Q Consensus 158 SgS~Dg---tIkVWDl 170 (1525)
..+.++ .|.+||+
T Consensus 350 ~~~~~~~~~~i~~~d~ 365 (435)
T PRK05137 350 FTKQGGGQFSIGVMKP 365 (435)
T ss_pred EEEcCCCceEEEEEEC
Confidence 876543 4555654
No 222
>PRK05137 tolB translocation protein TolB; Provisional
Probab=98.22 E-value=8.2e-05 Score=91.48 Aligned_cols=130 Identities=12% Similarity=0.072 Sum_probs=88.8
Q ss_pred ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC---CCcEEEEEcCCCcEEEEEe-cCCceeecCceeEE
Q 000425 29 MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA---DGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFA 104 (1525)
Q Consensus 29 ~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~---DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVa 104 (1525)
..+.+++. .+.....+.. |...+.+.+|+|+|+.|+..+. +..|.+||+.+++.. .+. +...+. ..+
T Consensus 182 ~~l~~~d~--dg~~~~~lt~-~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~-~l~~~~g~~~-----~~~ 252 (435)
T PRK05137 182 KRLAIMDQ--DGANVRYLTD-GSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRE-LVGNFPGMTF-----APR 252 (435)
T ss_pred eEEEEECC--CCCCcEEEec-CCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEE-EeecCCCccc-----CcE
Confidence 34555543 2333444444 7889999999999998877654 468999999888664 344 556667 889
Q ss_pred EeCCCCEEE-EEeCCCc--EEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC-CCC--eeEEEe
Q 000425 105 FHPTLEWLF-VGDRRGT--LLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR-DGS--LQVWKT 170 (1525)
Q Consensus 105 FSPdg~~L~-SgS~Dgt--VrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~-Dgt--IkVWDl 170 (1525)
|+|||+.|+ +.+.++. |.+||+.+. ... .+..|.....+.+|+|||+.|+..+. ++. |.+||+
T Consensus 253 ~SPDG~~la~~~~~~g~~~Iy~~d~~~~--~~~-~Lt~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~ 321 (435)
T PRK05137 253 FSPDGRKVVMSLSQGGNTDIYTMDLRSG--TTT-RLTDSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNA 321 (435)
T ss_pred ECCCCCEEEEEEecCCCceEEEEECCCC--ceE-EccCCCCccCceeEcCCCCEEEEEECCCCCCeEEEEEC
Confidence 999999875 5556655 666687753 221 23355666778999999998887774 444 444443
No 223
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.21 E-value=3.3e-05 Score=94.98 Aligned_cols=152 Identities=12% Similarity=0.076 Sum_probs=99.1
Q ss_pred cEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEE-EEECCC--cEEEEEcCC
Q 000425 8 TEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLY-VAYADG--LIRAYNIHT 82 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~La-SGs~Dg--tIrLWDl~t 82 (1525)
.....+|+|++..+++..... ..+.+++. .++.. ..+.. +.+...+.+|+|+|+.++ +.+.++ .|.+||+.+
T Consensus 205 ~v~~p~wSpDg~~la~~s~~~~~~~l~~~dl-~~g~~-~~l~~-~~g~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~ 281 (433)
T PRK04922 205 PILSPAWSPDGKKLAYVSFERGRSAIYVQDL-ATGQR-ELVAS-FRGINGAPSFSPDGRRLALTLSRDGNPEIYVMDLGS 281 (433)
T ss_pred ccccccCCCCCCEEEEEecCCCCcEEEEEEC-CCCCE-EEecc-CCCCccCceECCCCCEEEEEEeCCCCceEEEEECCC
Confidence 356788999999888776433 34666664 33322 22322 455566789999998765 555565 599999988
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCCc--EEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGT--LLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~Dgt--VrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
++.. .+. +..... ..+|+|||++|+.++ .++. |.++|+.+.....+ ..+.......+|+|+|++|+.
T Consensus 282 g~~~-~lt~~~~~~~-----~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~l---t~~g~~~~~~~~SpDG~~Ia~ 352 (433)
T PRK04922 282 RQLT-RLTNHFGIDT-----EPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERL---TFQGNYNARASVSPDGKKIAM 352 (433)
T ss_pred CCeE-ECccCCCCcc-----ceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEe---ecCCCCccCEEECCCCCEEEE
Confidence 7754 344 544455 789999999988776 4555 55556654322211 123344557899999999887
Q ss_pred EECCC---CeeEEEee
Q 000425 159 LCRDG---SLQVWKTR 171 (1525)
Q Consensus 159 gS~Dg---tIkVWDlr 171 (1525)
.+.++ .|.+||+.
T Consensus 353 ~~~~~~~~~I~v~d~~ 368 (433)
T PRK04922 353 VHGSGGQYRIAVMDLS 368 (433)
T ss_pred EECCCCceeEEEEECC
Confidence 66543 58888864
No 224
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.20 E-value=3e-05 Score=95.26 Aligned_cols=153 Identities=18% Similarity=0.119 Sum_probs=100.3
Q ss_pred CcEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEE-EEECCCcEEEEEc--C
Q 000425 7 DTEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLY-VAYADGLIRAYNI--H 81 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~La-SGs~DgtIrLWDl--~ 81 (1525)
+.....+|+|++..+++....+ ..+.+++. .++.. ..+.. ..+.+.+.+|+|+|+.++ +.+.|+..+||.+ .
T Consensus 196 ~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl-~~g~~-~~l~~-~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~ 272 (427)
T PRK02889 196 EPIISPAWSPDGTKLAYVSFESKKPVVYVHDL-ATGRR-RVVAN-FKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVNAD 272 (427)
T ss_pred CCcccceEcCCCCEEEEEEccCCCcEEEEEEC-CCCCE-EEeec-CCCCccceEECCCCCEEEEEEccCCCceEEEEECC
Confidence 4456789999999888876543 34666664 33322 23332 445667899999999776 5677888777764 4
Q ss_pred CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCC--CCCceeeEeecccCcCeEEEEEcCCCCEEE
Q 000425 82 TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVS--IERPSMIGIIQVGSQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 82 t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~--t~~~s~i~il~gH~~~VtsVafsPdg~~La 157 (1525)
++. ...+. +..... ...|+|||++|+..+ .++...+|.+. +.....+ ..+.......+|+|||++|+
T Consensus 273 ~~~-~~~lt~~~~~~~-----~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~l---t~~g~~~~~~~~SpDG~~Ia 343 (427)
T PRK02889 273 GSG-LRRLTQSSGIDT-----EPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRV---TFTGSYNTSPRISPDGKLLA 343 (427)
T ss_pred CCC-cEECCCCCCCCc-----CeEEcCCCCEEEEEecCCCCcEEEEEECCCCceEEE---ecCCCCcCceEECCCCCEEE
Confidence 443 44454 554556 789999999888666 45677777654 2211111 12233344678999999998
Q ss_pred EEECCC---CeeEEEee
Q 000425 158 TLCRDG---SLQVWKTR 171 (1525)
Q Consensus 158 SgS~Dg---tIkVWDlr 171 (1525)
..+.++ .|.+||+.
T Consensus 344 ~~s~~~g~~~I~v~d~~ 360 (427)
T PRK02889 344 YISRVGGAFKLYVQDLA 360 (427)
T ss_pred EEEccCCcEEEEEEECC
Confidence 877655 58888864
No 225
>PRK03629 tolB translocation protein TolB; Provisional
Probab=98.19 E-value=5.3e-05 Score=93.20 Aligned_cols=145 Identities=10% Similarity=0.077 Sum_probs=95.7
Q ss_pred EEEEEEcCCCCEEEEEEeccc--eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECC-CcEEEE--EcCCC
Q 000425 9 EVHLALTPLQPVVFFGFHRRM--SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYAD-GLIRAY--NIHTY 83 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~--sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~D-gtIrLW--Dl~t~ 83 (1525)
...++|+|++..+++....+. .+.+++. +++ ....+.. +...+....|+|+|+.++..+.+ +..+|| |+.++
T Consensus 245 ~~~~~~SPDG~~La~~~~~~g~~~I~~~d~-~tg-~~~~lt~-~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g 321 (429)
T PRK03629 245 NGAPAFSPDGSKLAFALSKTGSLNLYVMDL-ASG-QIRQVTD-GRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGG 321 (429)
T ss_pred cCCeEECCCCCEEEEEEcCCCCcEEEEEEC-CCC-CEEEccC-CCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCC
Confidence 346799999999988765443 4666664 332 2333333 44567889999999988766654 455555 66666
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCC---CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRR---GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~D---gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg 159 (1525)
+. ..+. +..... ...|+|||++|+..+.+ ..|.+||+.+.....+ ... ......+|+|||++|+.+
T Consensus 322 ~~-~~lt~~~~~~~-----~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~L--t~~--~~~~~p~~SpDG~~i~~~ 391 (429)
T PRK03629 322 AP-QRITWEGSQNQ-----DADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVL--TDT--FLDETPSIAPNGTMVIYS 391 (429)
T ss_pred Ce-EEeecCCCCcc-----CEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEe--CCC--CCCCCceECCCCCEEEEE
Confidence 54 3344 444556 78999999998876633 4588899886432222 112 223467899999999999
Q ss_pred ECCCCee
Q 000425 160 CRDGSLQ 166 (1525)
Q Consensus 160 S~DgtIk 166 (1525)
+.|+...
T Consensus 392 s~~~~~~ 398 (429)
T PRK03629 392 SSQGMGS 398 (429)
T ss_pred EcCCCce
Confidence 9888643
No 226
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=98.17 E-value=2.1e-05 Score=89.50 Aligned_cols=116 Identities=18% Similarity=0.238 Sum_probs=89.9
Q ss_pred eEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE--EEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCC
Q 000425 54 IVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY--TLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIE 129 (1525)
Q Consensus 54 V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~--tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~ 129 (1525)
.-++.|++.+.-++++..+|.+.+-+........ +.+ |+-... ...|+- +.+.+++||+|+.+..||++..
T Consensus 124 ~lslD~~~~~~~i~vs~s~G~~~~v~~t~~~le~vq~wk~He~E~W-----ta~f~~~~pnlvytGgDD~~l~~~D~R~p 198 (339)
T KOG0280|consen 124 ALSLDISTSGTKIFVSDSRGSISGVYETEMVLEKVQTWKVHEFEAW-----TAKFSDKEPNLVYTGGDDGSLSCWDIRIP 198 (339)
T ss_pred eeEEEeeccCceEEEEcCCCcEEEEecceeeeeecccccccceeee-----eeecccCCCceEEecCCCceEEEEEecCC
Confidence 3568899999999999999999976665554433 566 887777 788876 4467889999999999999943
Q ss_pred Cceee-EeecccCcCeEEEEEcC-CCCEEEEEECCCCeeEEEeeeecC
Q 000425 130 RPSMI-GIIQVGSQPITSVAWLP-MLRLLVTLCRDGSLQVWKTRVIIN 175 (1525)
Q Consensus 130 ~~s~i-~il~gH~~~VtsVafsP-dg~~LaSgS~DgtIkVWDlr~~~~ 175 (1525)
.+++ .....|...|.+|.=+| ...+|+|||-|-.|++||.|.+..
T Consensus 199 -~~~i~~n~kvH~~GV~SI~ss~~~~~~I~TGsYDe~i~~~DtRnm~k 245 (339)
T KOG0280|consen 199 -KTFIWHNSKVHTSGVVSIYSSPPKPTYIATGSYDECIRVLDTRNMGK 245 (339)
T ss_pred -cceeeecceeeecceEEEecCCCCCceEEEeccccceeeeehhcccC
Confidence 2222 22346889999998886 577999999999999999996554
No 227
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.16 E-value=5.8e-05 Score=92.76 Aligned_cols=146 Identities=13% Similarity=0.112 Sum_probs=95.7
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CCcEEEEEc--CCCc
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DGLIRAYNI--HTYA 84 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-DgtIrLWDl--~t~k 84 (1525)
...+.+|+|++..++++...+....+|.....+.....+.. |...+.+..|+|+|+.++..+. ++...||.+ .+++
T Consensus 241 ~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~-~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~ 319 (427)
T PRK02889 241 SNSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQ-SSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGA 319 (427)
T ss_pred CccceEECCCCCEEEEEEccCCCceEEEEECCCCCcEECCC-CCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCc
Confidence 34578999999999887766665555543222333444433 5556677899999998775554 566777764 4443
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC---cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG---TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg---tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
. ..+. +..... ..+|||||++|+..+.++ .|.+||+.+.....+ . . .......+|+||++.|+.++
T Consensus 320 ~-~~lt~~g~~~~-----~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~l--t-~-~~~~~~p~~spdg~~l~~~~ 389 (427)
T PRK02889 320 A-QRVTFTGSYNT-----SPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTAL--T-D-TTRDESPSFAPNGRYILYAT 389 (427)
T ss_pred e-EEEecCCCCcC-----ceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEc--c-C-CCCccCceECCCCCEEEEEE
Confidence 3 2233 333344 678999999998777543 699999986422222 1 2 22346789999999988887
Q ss_pred CCCC
Q 000425 161 RDGS 164 (1525)
Q Consensus 161 ~Dgt 164 (1525)
.++.
T Consensus 390 ~~~g 393 (427)
T PRK02889 390 QQGG 393 (427)
T ss_pred ecCC
Confidence 6654
No 228
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=98.16 E-value=6.4e-05 Score=90.96 Aligned_cols=153 Identities=14% Similarity=0.098 Sum_probs=102.2
Q ss_pred cEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEE-EECCC--cEEEEEcCC
Q 000425 8 TEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYV-AYADG--LIRAYNIHT 82 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaS-Gs~Dg--tIrLWDl~t 82 (1525)
.....+++|++.+++++.... ..+.+++. +.+. ...+.. +...+.+++|+|+++.|+. .+.++ .|.+||+.+
T Consensus 191 ~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~-~~g~-~~~~~~-~~~~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~ 267 (417)
T TIGR02800 191 PILSPAWSPDGQKLAYVSFESGKPEIYVQDL-ATGQ-REKVAS-FPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDG 267 (417)
T ss_pred ceecccCCCCCCEEEEEEcCCCCcEEEEEEC-CCCC-EEEeec-CCCCccceEECCCCCEEEEEECCCCCccEEEEECCC
Confidence 356788999999998876543 45666664 3332 222333 5667778999999987654 44444 588889887
Q ss_pred CcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC-CC--cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE
Q 000425 83 YAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR-RG--TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 83 ~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~-Dg--tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg 159 (1525)
++......+..... ..+|+|||++|+.++. ++ .|.+||..+.... .+..+...+....|+|++++|+.+
T Consensus 268 ~~~~~l~~~~~~~~-----~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~---~l~~~~~~~~~~~~spdg~~i~~~ 339 (417)
T TIGR02800 268 KQLTRLTNGPGIDT-----EPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVR---RLTFRGGYNASPSWSPDGDLIAFV 339 (417)
T ss_pred CCEEECCCCCCCCC-----CEEECCCCCEEEEEECCCCCceEEEEECCCCCEE---EeecCCCCccCeEECCCCCEEEEE
Confidence 75433333544455 7799999999886663 33 5677777643211 122455667789999999999988
Q ss_pred ECCC---CeeEEEee
Q 000425 160 CRDG---SLQVWKTR 171 (1525)
Q Consensus 160 S~Dg---tIkVWDlr 171 (1525)
+.++ .|.+||+.
T Consensus 340 ~~~~~~~~i~~~d~~ 354 (417)
T TIGR02800 340 HREGGGFNIAVMDLD 354 (417)
T ss_pred EccCCceEEEEEeCC
Confidence 8776 67777764
No 229
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=98.13 E-value=1.3e-05 Score=94.18 Aligned_cols=155 Identities=10% Similarity=0.012 Sum_probs=102.4
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCc--cceEEccCCCCCeEEEEEcCCCCE---EEEEECCCcEEEEEcCCCcEE
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGR--APTKIKTDLKKPIVNLACHPRLPV---LYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk--~~tklk~~H~~~V~sLaf~Pdg~~---LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+..||.+..+.++... +...++...+..+ ....... ....-+++.|-.+... +-.+++++.+.+|....+.+.
T Consensus 68 ~~~s~~~~llAv~~~~-K~~~~f~~~~~~~~~kl~~~~~-v~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~~~~~~ 145 (390)
T KOG3914|consen 68 VLTSDSGRLVAVATSS-KQRAVFDYRENPKGAKLLDVSC-VPKRPTAISFIREDTSVLVADKAGDVYSFDILSADSGRCE 145 (390)
T ss_pred cccCCCceEEEEEeCC-CceEEEEEecCCCcceeeeEee-cccCcceeeeeeccceEEEEeecCCceeeeeecccccCcc
Confidence 4456777777776654 4444444444332 2211111 1222233444333333 333455666777776666666
Q ss_pred EEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 87 YTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 87 ~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
-.+-|..-+. .|+|+||+++|+++..|..|||-.+-. ...+-+.+-||...|..++.-++. .|+|+|.|++|+
T Consensus 146 ~~lGhvSml~-----dVavS~D~~~IitaDRDEkIRvs~ypa-~f~IesfclGH~eFVS~isl~~~~-~LlS~sGD~tlr 218 (390)
T KOG3914|consen 146 PILGHVSMLL-----DVAVSPDDQFIITADRDEKIRVSRYPA-TFVIESFCLGHKEFVSTISLTDNY-LLLSGSGDKTLR 218 (390)
T ss_pred hhhhhhhhhh-----eeeecCCCCEEEEecCCceEEEEecCc-ccchhhhccccHhheeeeeeccCc-eeeecCCCCcEE
Confidence 6666888899 999999999999999999999977663 122223345899999999998775 489999999999
Q ss_pred EEEeeeecC
Q 000425 167 VWKTRVIIN 175 (1525)
Q Consensus 167 VWDlr~~~~ 175 (1525)
+||++....
T Consensus 219 ~Wd~~sgk~ 227 (390)
T KOG3914|consen 219 LWDITSGKL 227 (390)
T ss_pred EEecccCCc
Confidence 999985544
No 230
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=98.12 E-value=3.5e-06 Score=100.00 Aligned_cols=146 Identities=12% Similarity=0.185 Sum_probs=107.0
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ- 90 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~- 90 (1525)
|.|=|-. +++++.....-... ..+..|+.+..+.+ -.+.+..+.-+|-...+-+|...|+|.+|+..+.+++..+-
T Consensus 215 LeFLPyH-fLL~~~~~~G~L~Y-~DVS~GklVa~~~t-~~G~~~vm~qNP~NaVih~GhsnGtVSlWSP~skePLvKiLc 291 (545)
T KOG1272|consen 215 LEFLPYH-FLLVAASEAGFLKY-QDVSTGKLVASIRT-GAGRTDVMKQNPYNAVIHLGHSNGTVSLWSPNSKEPLVKILC 291 (545)
T ss_pred hcccchh-heeeecccCCceEE-EeechhhhhHHHHc-cCCccchhhcCCccceEEEcCCCceEEecCCCCcchHHHHHh
Confidence 4454444 44555544444444 33566665555555 56788889999999999999999999999999888887777
Q ss_pred cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 91 LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 91 H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
|..+|+ +|+++++|.|+++++.|+.|+|||++... ....... ......++||..| ++.+|....|.||.-
T Consensus 292 H~g~V~-----siAv~~~G~YMaTtG~Dr~~kIWDlR~~~-ql~t~~t--p~~a~~ls~Sqkg--lLA~~~G~~v~iw~d 361 (545)
T KOG1272|consen 292 HRGPVS-----SIAVDRGGRYMATTGLDRKVKIWDLRNFY-QLHTYRT--PHPASNLSLSQKG--LLALSYGDHVQIWKD 361 (545)
T ss_pred cCCCcc-----eEEECCCCcEEeecccccceeEeeecccc-ccceeec--CCCcccccccccc--ceeeecCCeeeeehh
Confidence 999999 99999999999999999999999999642 2221112 2345667787655 445566668999953
No 231
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.12 E-value=6.6e-05 Score=92.29 Aligned_cols=148 Identities=13% Similarity=0.053 Sum_probs=95.5
Q ss_pred EEEEEcCCCCEEEEEEeccc--eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CCc--EEEEEcCCCc
Q 000425 10 VHLALTPLQPVVFFGFHRRM--SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DGL--IRAYNIHTYA 84 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~--sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-Dgt--IrLWDl~t~k 84 (1525)
..++|+|++..++++...+. .+.+++.. ++ ....+.. |.......+|+|+++.++.++. ++. |.++|+.+++
T Consensus 251 ~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~-~g-~~~~lt~-~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~ 327 (433)
T PRK04922 251 GAPSFSPDGRRLALTLSRDGNPEIYVMDLG-SR-QLTRLTN-HFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGS 327 (433)
T ss_pred cCceECCCCCEEEEEEeCCCCceEEEEECC-CC-CeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 46899999998887765444 46666643 22 2233333 5555677899999998877664 455 5666776665
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCC---CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRR---GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~D---gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
.. .+. +..... ..+|||||++|+..+.+ ..|.+||..+.....+ .+.......+|+|||++|+..+
T Consensus 328 ~~-~lt~~g~~~~-----~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~L----t~~~~~~~p~~spdG~~i~~~s 397 (433)
T PRK04922 328 AE-RLTFQGNYNA-----RASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTL----TPGSLDESPSFAPNGSMVLYAT 397 (433)
T ss_pred eE-EeecCCCCcc-----CEEECCCCCEEEEEECCCCceeEEEEECCCCCeEEC----CCCCCCCCceECCCCCEEEEEE
Confidence 43 333 444455 78999999998876533 2699999876432222 2333445679999999887766
Q ss_pred CC-C--CeeEEEe
Q 000425 161 RD-G--SLQVWKT 170 (1525)
Q Consensus 161 ~D-g--tIkVWDl 170 (1525)
.+ + .|.+||+
T Consensus 398 ~~~g~~~L~~~~~ 410 (433)
T PRK04922 398 REGGRGVLAAVST 410 (433)
T ss_pred ecCCceEEEEEEC
Confidence 54 3 3555554
No 232
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=98.09 E-value=1.1e-05 Score=106.91 Aligned_cols=154 Identities=16% Similarity=0.196 Sum_probs=121.0
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+....+.-||..|+-++|+ .+.++++|... .+..+...+..-...|+.+.|+..|+.+..+..||.+.+|.+. .++.
T Consensus 2209 ~~v~r~~sHp~~~~Yltgs-~dgsv~~~~w~-~~~~v~~~rt~g~s~vtr~~f~~qGnk~~i~d~dg~l~l~q~~-pk~~ 2285 (2439)
T KOG1064|consen 2209 ENVRRMTSHPSDPYYLTGS-QDGSVRMFEWG-HGQQVVCFRTAGNSRVTRSRFNHQGNKFGIVDGDGDLSLWQAS-PKPY 2285 (2439)
T ss_pred CceeeecCCCCCceEEecC-CCceEEEEecc-CCCeEEEeeccCcchhhhhhhcccCCceeeeccCCceeecccC-Ccce
Confidence 4456677899999988777 58888776642 2333333343334778889999999999999999999999887 5666
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEe---CCCcEEEEeCCCC-CceeeEeecccCcCeEEEEEcCCCCEEEEEEC
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD---RRGTLLAWDVSIE-RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS---~DgtVrVWDl~t~-~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~ 161 (1525)
...+ |+.... +++|-. ..+++++ +++.+.+||.... ...++ ...|.+.++++++-|...+|+||+.
T Consensus 2286 ~s~qchnk~~~-----Df~Fi~--s~~~tag~s~d~~n~~lwDtl~~~~~s~v--~~~H~~gaT~l~~~P~~qllisggr 2356 (2439)
T KOG1064|consen 2286 TSWQCHNKALS-----DFRFIG--SLLATAGRSSDNRNVCLWDTLLPPMNSLV--HTCHDGGATVLAYAPKHQLLISGGR 2356 (2439)
T ss_pred eccccCCcccc-----ceeeee--hhhhccccCCCCCcccchhcccCccccee--eeecCCCceEEEEcCcceEEEecCC
Confidence 6677 999888 888876 4556554 7899999997753 33444 4789999999999999999999999
Q ss_pred CCCeeEEEeee
Q 000425 162 DGSLQVWKTRV 172 (1525)
Q Consensus 162 DgtIkVWDlr~ 172 (1525)
+|.|++||+|.
T Consensus 2357 ~G~v~l~D~rq 2367 (2439)
T KOG1064|consen 2357 KGEVCLFDIRQ 2367 (2439)
T ss_pred cCcEEEeehHH
Confidence 99999999984
No 233
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=98.08 E-value=9.6e-05 Score=85.09 Aligned_cols=99 Identities=14% Similarity=0.240 Sum_probs=79.7
Q ss_pred EECCCcEEEEEcCCCcE--EEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCc-EEEEeCCCCCceeeEeec-c-cCc
Q 000425 69 AYADGLIRAYNIHTYAV--HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGT-LLAWDVSIERPSMIGIIQ-V-GSQ 142 (1525)
Q Consensus 69 Gs~DgtIrLWDl~t~k~--l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dgt-VrVWDl~t~~~s~i~il~-g-H~~ 142 (1525)
|-.-|.|.|-|+...+. -..+. |+..|. |++.+-+|..++|+|..|| |||||..+. ..+..+. | ...
T Consensus 155 g~k~GqvQi~dL~~~~~~~p~~I~AH~s~Ia-----cv~Ln~~Gt~vATaStkGTLIRIFdt~~g--~~l~E~RRG~d~A 227 (346)
T KOG2111|consen 155 GFKTGQVQIVDLASTKPNAPSIINAHDSDIA-----CVALNLQGTLVATASTKGTLIRIFDTEDG--TLLQELRRGVDRA 227 (346)
T ss_pred CCccceEEEEEhhhcCcCCceEEEcccCcee-----EEEEcCCccEEEEeccCcEEEEEEEcCCC--cEeeeeecCCchh
Confidence 55679999999876655 35666 999999 9999999999999998877 799999964 2221121 1 234
Q ss_pred CeEEEEEcCCCCEEEEEECCCCeeEEEeeeec
Q 000425 143 PITSVAWLPMLRLLVTLCRDGSLQVWKTRVII 174 (1525)
Q Consensus 143 ~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~ 174 (1525)
.|.|++|||+..+|+.+|+.|||+|+.++...
T Consensus 228 ~iy~iaFSp~~s~LavsSdKgTlHiF~l~~~~ 259 (346)
T KOG2111|consen 228 DIYCIAFSPNSSWLAVSSDKGTLHIFSLRDTE 259 (346)
T ss_pred eEEEEEeCCCccEEEEEcCCCeEEEEEeecCC
Confidence 68999999999999999999999999998544
No 234
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=98.03 E-value=0.0002 Score=86.92 Aligned_cols=140 Identities=14% Similarity=0.254 Sum_probs=95.7
Q ss_pred CeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC-----------CCcE
Q 000425 53 PIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR-----------RGTL 121 (1525)
Q Consensus 53 ~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~-----------DgtV 121 (1525)
.-+-+.|+|.|.+|++-..-| |-+|-=.+...++.|-|.+ |. .++|||..+||++=|. -..+
T Consensus 212 Tetyv~wSP~GTYL~t~Hk~G-I~lWGG~~f~r~~RF~Hp~-Vq-----~idfSP~EkYLVT~s~~p~~~~~~d~e~~~l 284 (698)
T KOG2314|consen 212 TETYVRWSPKGTYLVTFHKQG-IALWGGESFDRIQRFYHPG-VQ-----FIDFSPNEKYLVTYSPEPIIVEEDDNEGQQL 284 (698)
T ss_pred eeeeEEecCCceEEEEEeccc-eeeecCccHHHHHhccCCC-ce-----eeecCCccceEEEecCCccccCcccCCCceE
Confidence 345689999999999988776 8899666666666666654 56 7999999999998662 2579
Q ss_pred EEEeCCCC--CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCccccccccc
Q 000425 122 LAWDVSIE--RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRI 199 (1525)
Q Consensus 122 rVWDl~t~--~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~i 199 (1525)
+|||+.++ .+.+. ........-.-+.||.|++|++--.. .+|.|++...... +|
T Consensus 285 ~IWDI~tG~lkrsF~-~~~~~~~~WP~frWS~DdKy~Arm~~-~sisIyEtpsf~l------------------ld---- 340 (698)
T KOG2314|consen 285 IIWDIATGLLKRSFP-VIKSPYLKWPIFRWSHDDKYFARMTG-NSISIYETPSFML------------------LD---- 340 (698)
T ss_pred EEEEccccchhccee-ccCCCccccceEEeccCCceeEEecc-ceEEEEecCceee------------------ec----
Confidence 99999985 22222 11121111234789999999987766 4788887642110 01
Q ss_pred ccccCCcceecCCCeeEEEEecCCCeEEEE
Q 000425 200 LSQQGGEAVYPLPRVRALEVHPRLNLAVLL 229 (1525)
Q Consensus 200 Ls~~gG~~vy~l~~V~~v~~HP~~Nlial~ 229 (1525)
.+...+++|+.++|.|+.|++|-.
T Consensus 341 ------~Kslki~gIr~FswsP~~~llAYw 364 (698)
T KOG2314|consen 341 ------KKSLKISGIRDFSWSPTSNLLAYW 364 (698)
T ss_pred ------ccccCCccccCcccCCCcceEEEE
Confidence 011223578999999999999843
No 235
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=97.99 E-value=3.4e-05 Score=92.40 Aligned_cols=121 Identities=13% Similarity=0.083 Sum_probs=99.7
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCceeecCceeEEEeC--CCCEEEEEeCCCc
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTIKLLGAGAFAFHP--TLEWLFVGDRRGT 120 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V~~~ga~sVaFSP--dg~~L~SgS~Dgt 120 (1525)
++.. |.+-|..|.|+..|..+++|+.|..|.+||..+.++...|. |.+.|. .-.|-| +...+++++.||.
T Consensus 137 kL~~-H~GcVntV~FN~~Gd~l~SgSDD~~vv~WdW~~~~~~l~f~SGH~~Nvf-----QaKFiP~s~d~ti~~~s~dgq 210 (559)
T KOG1334|consen 137 KLNK-HKGCVNTVHFNQRGDVLASGSDDLQVVVWDWVSGSPKLSFESGHCNNVF-----QAKFIPFSGDRTIVTSSRDGQ 210 (559)
T ss_pred cccC-CCCccceeeecccCceeeccCccceEEeehhhccCcccccccccccchh-----hhhccCCCCCcCceeccccCc
Confidence 4444 99999999999999999999999999999999999988888 999888 778887 3467999999999
Q ss_pred EEEEeCCCC-CceeeEeecccCcCeEEEEEcCCCC-EEEEEECCCCeeEEEee
Q 000425 121 LLAWDVSIE-RPSMIGIIQVGSQPITSVAWLPMLR-LLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 121 VrVWDl~t~-~~s~i~il~gH~~~VtsVafsPdg~-~LaSgS~DgtIkVWDlr 171 (1525)
|++=.+... ...-...+..|...|+-++.-|+.+ -|.||+.|+.++-.|+|
T Consensus 211 vr~s~i~~t~~~e~t~rl~~h~g~vhklav~p~sp~~f~S~geD~~v~~~Dlr 263 (559)
T KOG1334|consen 211 VRVSEILETGYVENTKRLAPHEGPVHKLAVEPDSPKPFLSCGEDAVVFHIDLR 263 (559)
T ss_pred eeeeeeccccceecceecccccCccceeeecCCCCCcccccccccceeeeeec
Confidence 998655421 1110122447899999999999866 48899999999999998
No 236
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=97.97 E-value=0.00021 Score=86.45 Aligned_cols=147 Identities=13% Similarity=0.108 Sum_probs=96.3
Q ss_pred EEEEEEcCCCCEEEEEEeccc--eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CC--cEEEEEcCCC
Q 000425 9 EVHLALTPLQPVVFFGFHRRM--SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DG--LIRAYNIHTY 83 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~--sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-Dg--tIrLWDl~t~ 83 (1525)
....+|+|++..++++...+. .+.+++. .+ .....+.. |.......+|+|+++.++..+. ++ .|.+||+.++
T Consensus 236 ~~~~~~spDg~~l~~~~~~~~~~~i~~~d~-~~-~~~~~l~~-~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~ 312 (417)
T TIGR02800 236 NGAPAFSPDGSKLAVSLSKDGNPDIYVMDL-DG-KQLTRLTN-GPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGG 312 (417)
T ss_pred ccceEECCCCCEEEEEECCCCCccEEEEEC-CC-CCEEECCC-CCCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCC
Confidence 346889999998887665443 3555543 22 22333333 5555567899999998776554 33 5777788776
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC---cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG---TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg---tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg 159 (1525)
+.. .+. +..... ...|+|+|++++.++.++ .|.+||+.+.....+ ..+ ......+|+|++++|+..
T Consensus 313 ~~~-~l~~~~~~~~-----~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~~l---~~~-~~~~~p~~spdg~~l~~~ 382 (417)
T TIGR02800 313 EVR-RLTFRGGYNA-----SPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGERVL---TDT-GLDESPSFAPNGRMILYA 382 (417)
T ss_pred CEE-EeecCCCCcc-----CeEECCCCCEEEEEEccCCceEEEEEeCCCCCeEEc---cCC-CCCCCceECCCCCEEEEE
Confidence 643 344 556666 889999999999888765 788889875322222 122 234557899999988887
Q ss_pred ECCCCeeEE
Q 000425 160 CRDGSLQVW 168 (1525)
Q Consensus 160 S~DgtIkVW 168 (1525)
+.++....|
T Consensus 383 ~~~~~~~~l 391 (417)
T TIGR02800 383 TTRGGRGVL 391 (417)
T ss_pred EeCCCcEEE
Confidence 776644433
No 237
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=97.96 E-value=0.00013 Score=89.36 Aligned_cols=115 Identities=10% Similarity=0.165 Sum_probs=98.5
Q ss_pred cCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 48 TDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 48 ~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
..|.++|.++.++.+-..|.+++.|+.+..|+.++.+.+...+ ....+. +++++|||..++++| ++|++||+
T Consensus 99 ~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~~~~~~-----sl~is~D~~~l~~as--~~ik~~~~ 171 (541)
T KOG4547|consen 99 DKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVIIRIWKEQKPLVS-----SLCISPDGKILLTAS--RQIKVLDI 171 (541)
T ss_pred CCCCCcceeeecccccCceEecCCceeEEEEecccceeeeeeccCCCccc-----eEEEcCCCCEEEecc--ceEEEEEc
Confidence 4699999999999999999999999999999999999888888 777788 999999999998887 58999999
Q ss_pred CCCCceeeEeecccCcCeEEEEEcCC-----CCEEEEE-ECCCCeeEEEee
Q 000425 127 SIERPSMIGIIQVGSQPITSVAWLPM-----LRLLVTL-CRDGSLQVWKTR 171 (1525)
Q Consensus 127 ~t~~~s~i~il~gH~~~VtsVafsPd-----g~~LaSg-S~DgtIkVWDlr 171 (1525)
++ ..++..+.||.+.|++++|--+ |.+++++ ..+.-+.+|-++
T Consensus 172 ~~--kevv~~ftgh~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~v~ 220 (541)
T KOG4547|consen 172 ET--KEVVITFTGHGSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWVVE 220 (541)
T ss_pred cC--ceEEEEecCCCcceEEEEEEEeccccccceeeeccccccceeEEEEE
Confidence 94 5666678899999999999877 6666654 445578888765
No 238
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=97.93 E-value=0.00033 Score=91.21 Aligned_cols=156 Identities=17% Similarity=0.178 Sum_probs=107.1
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCcc------ceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRA------PTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~------~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k 84 (1525)
.+|+++...-+|++..+|++|++|+..+.... ...... -..++.++...+.+..||+|+.||.|++.++...+
T Consensus 1053 k~a~s~~~~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~-~~sr~~~vt~~~~~~~~Av~t~DG~v~~~~id~~~ 1131 (1431)
T KOG1240|consen 1053 KLAVSSEHTSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSP-EGSRVEKVTMCGNGDQFAVSTKDGSVRVLRIDHYN 1131 (1431)
T ss_pred ceeecCCCCceEEEecCCceEEEeeehhhhcCcceeeeeEEEec-cCCceEEEEeccCCCeEEEEcCCCeEEEEEccccc
Confidence 67888877556677778999999998553211 111111 35789999999999999999999999999987531
Q ss_pred -------EEEEEe-cCC--ceeecCceeEEEeC-CCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC
Q 000425 85 -------VHYTLQ-LDN--TIKLLGAGAFAFHP-TLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM 152 (1525)
Q Consensus 85 -------~l~tL~-H~~--~V~~~ga~sVaFSP-dg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd 152 (1525)
+.+... +.. .|. .-+|.. ++. .++.+..-+.|-.||.++...-..-..+-..+.|++++.+|.
T Consensus 1132 ~~~~~~~~~ri~n~~~~g~vv~-----m~a~~~~~~S~~lvy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~idp~ 1206 (1431)
T KOG1240|consen 1132 VSKRVATQVRIPNLKKDGVVVS-----MHAFTAIVQSHVLVYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIVIDPW 1206 (1431)
T ss_pred cccceeeeeecccccCCCceEE-----eecccccccceeEEEEEeccceEEecchhhhhHHhhhcCccccceeEEEecCC
Confidence 122222 222 222 233332 233 566777888899999996311111001223468999999999
Q ss_pred CCEEEEEECCCCeeEEEeee
Q 000425 153 LRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 153 g~~LaSgS~DgtIkVWDlr~ 172 (1525)
+.++++|+..|.+-+||+|.
T Consensus 1207 ~~WlviGts~G~l~lWDLRF 1226 (1431)
T KOG1240|consen 1207 CNWLVIGTSRGQLVLWDLRF 1226 (1431)
T ss_pred ceEEEEecCCceEEEEEeec
Confidence 99999999999999999993
No 239
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=97.93 E-value=7.5e-05 Score=90.01 Aligned_cols=136 Identities=13% Similarity=0.124 Sum_probs=101.8
Q ss_pred CcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
....++++.|+..-|+++..+...|+ .......+.+.+. |.+-|.+++|++..+++++|+.|...|+||-. +..+
T Consensus 146 ~~v~c~~W~p~S~~vl~c~g~h~~IK---pL~~n~k~i~WkA-HDGiiL~~~W~~~s~lI~sgGED~kfKvWD~~-G~~L 220 (737)
T KOG1524|consen 146 ESIRCARWAPNSNSIVFCQGGHISIK---PLAANSKIIRWRA-HDGLVLSLSWSTQSNIIASGGEDFRFKIWDAQ-GANL 220 (737)
T ss_pred ceeEEEEECCCCCceEEecCCeEEEe---ecccccceeEEec-cCcEEEEeecCccccceeecCCceeEEeeccc-Cccc
Confidence 45678999999888887765544332 2233444555565 99999999999999999999999999999975 5666
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
++-. |+.+++ +|+|.|| +.++.+|. ++.|+ + ....+.|..++||+||..++.|+..|.+
T Consensus 221 f~S~~~ey~IT-----Sva~npd-~~~~v~S~-nt~R~---~----------~p~~GSifnlsWS~DGTQ~a~gt~~G~v 280 (737)
T KOG1524|consen 221 FTSAAEEYAIT-----SVAFNPE-KDYLLWSY-NTARF---S----------SPRVGSIFNLSWSADGTQATCGTSTGQL 280 (737)
T ss_pred ccCChhcccee-----eeeeccc-cceeeeee-eeeee---c----------CCCccceEEEEEcCCCceeeccccCceE
Confidence 6666 999999 9999999 54444432 23331 1 1235678999999999999999888866
Q ss_pred eE
Q 000425 166 QV 167 (1525)
Q Consensus 166 kV 167 (1525)
.+
T Consensus 281 ~~ 282 (737)
T KOG1524|consen 281 IV 282 (737)
T ss_pred EE
Confidence 43
No 240
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=97.92 E-value=0.0068 Score=75.58 Aligned_cols=154 Identities=14% Similarity=0.161 Sum_probs=104.6
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
+...+.++..+.- +++++.+.-|.||....+.|....+....+..|.+++|+.+|..++....||.|-+=.+.......
T Consensus 73 sV~vvTWNe~~QK-LTtSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~dG~kIcIvYeDGavIVGsvdGNRIwg 151 (1189)
T KOG2041|consen 73 SVMVVTWNENNQK-LTTSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNLDGTKICIVYEDGAVIVGSVDGNRIWG 151 (1189)
T ss_pred eEEEEEecccccc-ccccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcCCCcEEEEEEccCCEEEEeeccceecc
Confidence 4455556655443 477777888889998888888777777677889999999999999999999998887665443221
Q ss_pred -EEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-----Ccee----eEeecccCcCeEEEEEc-------
Q 000425 88 -TLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-----RPSM----IGIIQVGSQPITSVAWL------- 150 (1525)
Q Consensus 88 -tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-----~~s~----i~il~gH~~~VtsVafs------- 150 (1525)
.++ ..-.. .+.|++|.+.++-+-..|.+.++|.+.. ...+ .+........|..+.|.
T Consensus 152 KeLk-g~~l~-----hv~ws~D~~~~Lf~~ange~hlydnqgnF~~Kl~~~c~Vn~tg~~s~~~~kia~i~w~~g~~~~v 225 (1189)
T KOG2041|consen 152 KELK-GQLLA-----HVLWSEDLEQALFKKANGETHLYDNQGNFERKLEKDCEVNGTGIFSNFPTKIAEIEWNTGPYQPV 225 (1189)
T ss_pred hhcc-hhecc-----ceeecccHHHHHhhhcCCcEEEecccccHHHhhhhceEEeeeeeecCCCccccceeeccCccccC
Confidence 122 11233 6789999988888888899999987742 1111 12222222335555553
Q ss_pred -CCCCEEEEEECCCCeeEE
Q 000425 151 -PMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 151 -Pdg~~LaSgS~DgtIkVW 168 (1525)
|+.+.|+.|-..|.++|-
T Consensus 226 ~pdrP~lavcy~nGr~QiM 244 (1189)
T KOG2041|consen 226 PPDRPRLAVCYANGRMQIM 244 (1189)
T ss_pred CCCCCEEEEEEcCceehhh
Confidence 467788888777766654
No 241
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=97.92 E-value=0.00014 Score=88.54 Aligned_cols=131 Identities=18% Similarity=0.317 Sum_probs=99.9
Q ss_pred eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe---c----C-----Cceee
Q 000425 30 SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ---L----D-----NTIKL 97 (1525)
Q Consensus 30 sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~---H----~-----~~V~~ 97 (1525)
.|.-++ .+.|+-...+.. ....+++|..++-..+|++|..||.|-.||.++.....++. + . ..|+
T Consensus 156 evYRlN-LEqGrfL~P~~~-~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~~~~~svT- 232 (703)
T KOG2321|consen 156 EVYRLN-LEQGRFLNPFET-DSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGGDAAPSVT- 232 (703)
T ss_pred ceEEEE-cccccccccccc-ccccceeeeecCccceEEecccCceEEEecchhhhhheeeecccccCCCccccccCcce-
Confidence 343444 355655555555 55889999999999999999999999999999887777776 1 1 1266
Q ss_pred cCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeeccc--CcCeEEEEEcCC--CCEEEEEECCCCeeEEEee
Q 000425 98 LGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVG--SQPITSVAWLPM--LRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 98 ~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH--~~~VtsVafsPd--g~~LaSgS~DgtIkVWDlr 171 (1525)
++.|+.+|-.+++|..+|.|.|||+++..+.++ ..| ..+|..+.|.+. ++.++|+ +...++|||-.
T Consensus 233 ----al~F~d~gL~~aVGts~G~v~iyDLRa~~pl~~---kdh~~e~pi~~l~~~~~~~q~~v~S~-Dk~~~kiWd~~ 302 (703)
T KOG2321|consen 233 ----ALKFRDDGLHVAVGTSTGSVLIYDLRASKPLLV---KDHGYELPIKKLDWQDTDQQNKVVSM-DKRILKIWDEC 302 (703)
T ss_pred ----EEEecCCceeEEeeccCCcEEEEEcccCCceee---cccCCccceeeecccccCCCceEEec-chHHhhhcccc
Confidence 999999999999999999999999997544443 234 457888999875 5667665 45689999965
No 242
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.89 E-value=2.1e-05 Score=97.20 Aligned_cols=155 Identities=11% Similarity=0.279 Sum_probs=116.9
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecC-Cc-cceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC-c
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEG-GR-APTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY-A 84 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg-gk-~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~-k 84 (1525)
.+-...||..|+=.++....+++ +|+.... .. ..+.+. +|...|+.+.|+|..+ .+++++.|..|..||+++. .
T Consensus 72 d~qws~h~a~~~wiVsts~qkai-iwnlA~ss~~aIef~lh-ghsraitd~n~~~q~pdVlatcsvdt~vh~wd~rSp~~ 149 (1081)
T KOG0309|consen 72 DVQWSPHPAKPYWIVSTSNQKAI-IWNLAKSSSNAIEFVLH-GHSRAITDINFNPQHPDVLATCSVDTYVHAWDMRSPHR 149 (1081)
T ss_pred ceecccCCCCceeEEecCcchhh-hhhhhcCCccceEEEEe-cCccceeccccCCCCCcceeeccccccceeeeccCCCc
Confidence 45566788888777777666655 5555332 23 334444 4999999999999887 6899999999999999875 4
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEEEEECC
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLVTLCRD 162 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~LaSgS~D 162 (1525)
+++.+. ...... .|+|+-..+.+++.+..+.|.|||.+.... ....+.+|...|+.+.|..- ...+.+++.|
T Consensus 150 p~ys~~~w~s~as-----qVkwnyk~p~vlasshg~~i~vwd~r~gs~-pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~d 223 (1081)
T KOG0309|consen 150 PFYSTSSWRSAAS-----QVKWNYKDPNVLASSHGNDIFVWDLRKGST-PLCSLKGHVSSVNSIDFNRFKYSEIMSSSND 223 (1081)
T ss_pred ceeeeecccccCc-----eeeecccCcchhhhccCCceEEEeccCCCc-ceEEecccceeeehHHHhhhhhhhhcccCCC
Confidence 666666 555555 899998777788888889999999996422 22235679999999998763 3468899999
Q ss_pred CCeeEEEee
Q 000425 163 GSLQVWKTR 171 (1525)
Q Consensus 163 gtIkVWDlr 171 (1525)
++|+.||..
T Consensus 224 ~tvkfw~y~ 232 (1081)
T KOG0309|consen 224 GTVKFWDYS 232 (1081)
T ss_pred Cceeeeccc
Confidence 999999987
No 243
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.87 E-value=0.00016 Score=83.35 Aligned_cols=152 Identities=18% Similarity=0.176 Sum_probs=104.0
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCE-EEEEECCCcEEEEEcCCCcEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPV-LYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~-LaSGs~DgtIrLWDl~t~k~l 86 (1525)
..+||-+.-+.-.++++...+..|.+|+.....+. -++.. -..++.++.|+|+|+. +.+..-|-.|.+|++.+.+..
T Consensus 50 ki~yieW~ads~~ilC~~yk~~~vqvwsl~Qpew~-ckIde-g~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~~~~ 127 (447)
T KOG4497|consen 50 KIVYIEWKADSCHILCVAYKDPKVQVWSLVQPEWY-CKIDE-GQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQKGY 127 (447)
T ss_pred HhhheeeeccceeeeeeeeccceEEEEEeecceeE-EEecc-CCCcceeeeECCCcceEeeeecceeEEEEEEeccceeE
Confidence 34677777777888888888888888886443322 23333 4678999999999965 667778999999999887654
Q ss_pred EEEecCCceeecCceeEEEeCCCCEEEEEeC------------------------------------CCcEEEEeCCCCC
Q 000425 87 YTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR------------------------------------RGTLLAWDVSIER 130 (1525)
Q Consensus 87 ~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~------------------------------------DgtVrVWDl~t~~ 130 (1525)
..-.....+. .++|+|||++.+..++ +..+-|||.--..
T Consensus 128 ~~~~pK~~~k-----g~~f~~dg~f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~dT~DltgieWsPdg~~laVwd~~Ley 202 (447)
T KOG4497|consen 128 LLPHPKTNVK-----GYAFHPDGQFCAILSRRDCKDYVQISSCKAWILLKEFKLDTIDLTGIEWSPDGNWLAVWDNVLEY 202 (447)
T ss_pred EecccccCce-----eEEECCCCceeeeeecccHHHHHHHHhhHHHHHHHhcCCCcccccCceECCCCcEEEEecchhhh
Confidence 3332334456 7899999998776653 2334455433111
Q ss_pred ceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEE
Q 000425 131 PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWK 169 (1525)
Q Consensus 131 ~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWD 169 (1525)
+-.. + ...-.+..++|+|.+++|+.||.|+.+||-+
T Consensus 203 kv~a--Y-e~~lG~k~v~wsP~~qflavGsyD~~lrvln 238 (447)
T KOG4497|consen 203 KVYA--Y-ERGLGLKFVEWSPCNQFLAVGSYDQMLRVLN 238 (447)
T ss_pred eeee--e-eeccceeEEEeccccceEEeeccchhhhhhc
Confidence 1011 1 1134678899999999999999999888754
No 244
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.87 E-value=0.00049 Score=84.33 Aligned_cols=153 Identities=14% Similarity=0.084 Sum_probs=95.2
Q ss_pred CcEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEE-EEECCC--cEEEEEcC
Q 000425 7 DTEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLY-VAYADG--LIRAYNIH 81 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~La-SGs~Dg--tIrLWDl~ 81 (1525)
+...+.+|+|++..+++....+ ..+.+++. +++.. ..+.. ..+.+...+|+|+|+.++ +.+.++ .|.+||+.
T Consensus 199 ~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l-~~g~~-~~l~~-~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~ 275 (430)
T PRK00178 199 EPILSPRWSPDGKRIAYVSFEQKRPRIFVQNL-DTGRR-EQITN-FEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLA 275 (430)
T ss_pred CceeeeeECCCCCEEEEEEcCCCCCEEEEEEC-CCCCE-EEccC-CCCCcCCeEECCCCCEEEEEEccCCCceEEEEECC
Confidence 3457789999999887766443 34555554 33321 22222 344556789999999776 444555 57888998
Q ss_pred CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCC--cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEE
Q 000425 82 TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRG--TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 82 t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~Dg--tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~La 157 (1525)
+++... +. +..... ...|+|||+.|+..+ .++ .|.+||+.+.....+ . ..........|+|++++|+
T Consensus 276 ~~~~~~-lt~~~~~~~-----~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~l--t-~~~~~~~~~~~Spdg~~i~ 346 (430)
T PRK00178 276 SRQLSR-VTNHPAIDT-----EPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERV--T-FVGNYNARPRLSADGKTLV 346 (430)
T ss_pred CCCeEE-cccCCCCcC-----CeEECCCCCEEEEEECCCCCceEEEEECCCCCEEEe--e-cCCCCccceEECCCCCEEE
Confidence 876543 44 544455 789999999887665 344 466667764322211 1 1122334578999999988
Q ss_pred EEECC-C--CeeEEEee
Q 000425 158 TLCRD-G--SLQVWKTR 171 (1525)
Q Consensus 158 SgS~D-g--tIkVWDlr 171 (1525)
..+.+ + .|.+||+.
T Consensus 347 ~~~~~~~~~~l~~~dl~ 363 (430)
T PRK00178 347 MVHRQDGNFHVAAQDLQ 363 (430)
T ss_pred EEEccCCceEEEEEECC
Confidence 77654 3 35556653
No 245
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.85 E-value=0.00032 Score=85.63 Aligned_cols=111 Identities=15% Similarity=0.347 Sum_probs=84.3
Q ss_pred CCCCeEEEEEcCCCCEEEE--EECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEe---CCCcEEEE
Q 000425 50 LKKPIVNLACHPRLPVLYV--AYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD---RRGTLLAW 124 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaS--Gs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS---~DgtVrVW 124 (1525)
-.+||.++.|+|+++-|++ |-.--++-|+|++ +++++.| ...+.+ ++-|+|.|++|+.+| --|.|-||
T Consensus 269 k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr-~~~v~df-~egpRN-----~~~fnp~g~ii~lAGFGNL~G~mEvw 341 (566)
T KOG2315|consen 269 KEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLR-GKPVFDF-PEGPRN-----TAFFNPHGNIILLAGFGNLPGDMEVW 341 (566)
T ss_pred CCCCceEEEECCCCCEEEEEEecccceEEEEcCC-CCEeEeC-CCCCcc-----ceEECCCCCEEEEeecCCCCCceEEE
Confidence 4799999999999987665 5556789999886 3444443 556677 889999999999887 45889999
Q ss_pred eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC------CCCeeEEEee
Q 000425 125 DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR------DGSLQVWKTR 171 (1525)
Q Consensus 125 Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~------DgtIkVWDlr 171 (1525)
|+.+ ...++...+ ..-+-+.|+|||.+|+|++. |+.++||+..
T Consensus 342 Dv~n--~K~i~~~~a--~~tt~~eW~PdGe~flTATTaPRlrvdNg~Kiwhyt 390 (566)
T KOG2315|consen 342 DVPN--RKLIAKFKA--ANTTVFEWSPDGEYFLTATTAPRLRVDNGIKIWHYT 390 (566)
T ss_pred eccc--hhhcccccc--CCceEEEEcCCCcEEEEEeccccEEecCCeEEEEec
Confidence 9995 333322222 23355899999999999863 7899999986
No 246
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=97.84 E-value=0.0062 Score=71.81 Aligned_cols=146 Identities=15% Similarity=0.184 Sum_probs=96.5
Q ss_pred EEEEEEec---cceEEEEEeecC-CccceEEccCCCCCeEEEEEcCCCCEEEEEECC---CcEEEEEcCC--CcEEEEEe
Q 000425 20 VVFFGFHR---RMSVTVVGTVEG-GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYAD---GLIRAYNIHT--YAVHYTLQ 90 (1525)
Q Consensus 20 ~Vf~s~~~---d~sI~vv~tieg-gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~D---gtIrLWDl~t--~k~l~tL~ 90 (1525)
.+++|... +..|.+|...+. +.........+....+-|+|+|++++|.++-.+ |.|-.|.+.. ++....=+
T Consensus 4 ~~YiGtyT~~~s~gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~ 83 (346)
T COG2706 4 TVYIGTYTKRESQGIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNR 83 (346)
T ss_pred EEEEeeecccCCCceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeec
Confidence 45666666 777888876522 221111122366778889999999988877654 7787777764 43322111
Q ss_pred ---cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCC--CceeeEeecccCcC----------eEEEEEcCCCC
Q 000425 91 ---LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIE--RPSMIGIIQVGSQP----------ITSVAWLPMLR 154 (1525)
Q Consensus 91 ---H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~--~~s~i~il~gH~~~----------VtsVafsPdg~ 154 (1525)
..++-+ .|+++++|++++++. ..|.|.|+-++.. ....++ ...|... +.+..|.|+++
T Consensus 84 ~~~~g~~p~-----yvsvd~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~-~~~h~g~~p~~rQ~~~h~H~a~~tP~~~ 157 (346)
T COG2706 84 QTLPGSPPC-----YVSVDEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQ-VVKHTGSGPHERQESPHVHSANFTPDGR 157 (346)
T ss_pred cccCCCCCe-----EEEECCCCCEEEEEEccCceEEEEEcccCCcccccee-eeecCCCCCCccccCCccceeeeCCCCC
Confidence 223334 899999999999999 7899999988752 111211 1134444 78899999999
Q ss_pred EEEEEECCC-CeeEEEee
Q 000425 155 LLVTLCRDG-SLQVWKTR 171 (1525)
Q Consensus 155 ~LaSgS~Dg-tIkVWDlr 171 (1525)
+|+++...- .|.+|++.
T Consensus 158 ~l~v~DLG~Dri~~y~~~ 175 (346)
T COG2706 158 YLVVPDLGTDRIFLYDLD 175 (346)
T ss_pred EEEEeecCCceEEEEEcc
Confidence 999886422 57788875
No 247
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.84 E-value=3.8e-05 Score=62.08 Aligned_cols=36 Identities=31% Similarity=0.604 Sum_probs=33.5
Q ss_pred eEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEE
Q 000425 134 IGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWK 169 (1525)
Q Consensus 134 i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWD 169 (1525)
...+.+|...|++++|+|++.+|+|||.|++|++||
T Consensus 4 ~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 4 VRTFRGHSSSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred EEEEcCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence 345779999999999999999999999999999998
No 248
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.82 E-value=0.0016 Score=78.06 Aligned_cols=161 Identities=15% Similarity=0.170 Sum_probs=103.4
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccc---eEEccCCCCCeEEEEEcCCCCEEEE-EECCCcEEEEEcC--CCc
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAP---TKIKTDLKKPIVNLACHPRLPVLYV-AYADGLIRAYNIH--TYA 84 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~---tklk~~H~~~V~sLaf~Pdg~~LaS-Gs~DgtIrLWDl~--t~k 84 (1525)
++.++|++.++|+.--....|.++......... ..+.......-+.++|||+++++++ .-.+++|.++++. ++.
T Consensus 148 ~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~ 227 (345)
T PF10282_consen 148 QVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGS 227 (345)
T ss_dssp EEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTE
T ss_pred eEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCc
Confidence 589999999999877666677777754433122 1222223455688999999997655 4567889999987 442
Q ss_pred --EEEEEe-cCCceee-cCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCC--CceeeEeecccCcCeEEEEEcCCCCEEE
Q 000425 85 --VHYTLQ-LDNTIKL-LGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIE--RPSMIGIIQVGSQPITSVAWLPMLRLLV 157 (1525)
Q Consensus 85 --~l~tL~-H~~~V~~-~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~--~~s~i~il~gH~~~VtsVafsPdg~~La 157 (1525)
.+.++. ......- .....++++|||++|+++. .+.+|.+|++... .-+.++........-+.++++|+|++|+
T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~ 307 (345)
T PF10282_consen 228 LTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLY 307 (345)
T ss_dssp EEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEE
T ss_pred eeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEE
Confidence 333333 2111110 0223899999999988877 7789999999532 2333333333345578999999999988
Q ss_pred EEE-CCCCeeEEEee
Q 000425 158 TLC-RDGSLQVWKTR 171 (1525)
Q Consensus 158 SgS-~DgtIkVWDlr 171 (1525)
.+. .+++|.+|+++
T Consensus 308 Va~~~s~~v~vf~~d 322 (345)
T PF10282_consen 308 VANQDSNTVSVFDID 322 (345)
T ss_dssp EEETTTTEEEEEEEE
T ss_pred EEecCCCeEEEEEEe
Confidence 776 56689999875
No 249
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.81 E-value=0.0008 Score=81.56 Aligned_cols=141 Identities=21% Similarity=0.256 Sum_probs=93.9
Q ss_pred EEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecC
Q 000425 20 VVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLG 99 (1525)
Q Consensus 20 ~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~g 99 (1525)
++++....+.++.+++. +..+....+.. ....-..+.|+||++++++++.|+.|.++|+.+.+.+.++.......
T Consensus 7 l~~V~~~~~~~v~viD~-~t~~~~~~i~~-~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~--- 81 (369)
T PF02239_consen 7 LFYVVERGSGSVAVIDG-ATNKVVARIPT-GGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNPR--- 81 (369)
T ss_dssp EEEEEEGGGTEEEEEET-TT-SEEEEEE--STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEE---
T ss_pred EEEEEecCCCEEEEEEC-CCCeEEEEEcC-CCCceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCcc---
Confidence 45566667788888874 33444555554 22333457899999999999999999999999999999999555567
Q ss_pred ceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeEeec-------ccCcCeEEEEEcCCCCEEEEEECC-CCeeEEE
Q 000425 100 AGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQ-------VGSQPITSVAWLPMLRLLVTLCRD-GSLQVWK 169 (1525)
Q Consensus 100 a~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~-------gH~~~VtsVafsPdg~~LaSgS~D-gtIkVWD 169 (1525)
.+++|+||++++++. .++++.|+|.++. +.+.... .....+..+..+|..+.++..-.| +.|.+-|
T Consensus 82 --~i~~s~DG~~~~v~n~~~~~v~v~D~~tl--e~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVd 156 (369)
T PF02239_consen 82 --GIAVSPDGKYVYVANYEPGTVSVIDAETL--EPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVD 156 (369)
T ss_dssp --EEEE--TTTEEEEEEEETTEEEEEETTT----EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEE
T ss_pred --eEEEcCCCCEEEEEecCCCceeEeccccc--cceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEE
Confidence 899999999999887 8999999999863 2221111 124567888889998877766665 4443334
No 250
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=97.81 E-value=6e-05 Score=90.30 Aligned_cols=202 Identities=16% Similarity=0.189 Sum_probs=136.0
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEE--ccCCCC---CeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC-
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKI--KTDLKK---PIVNLACHPRLP-VLYVAYADGLIRAYNIHTY- 83 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tkl--k~~H~~---~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~- 83 (1525)
-||+-|..|..|.++..+..+.-++.... .+..++ +..++. ....|+.+|..+ .|++|+.|..+|+||.+..
T Consensus 237 klav~p~sp~~f~S~geD~~v~~~Dlr~~-~pa~~~~cr~~~~~~~v~L~~Ia~~P~nt~~faVgG~dqf~RvYD~R~~~ 315 (559)
T KOG1334|consen 237 KLAVEPDSPKPFLSCGEDAVVFHIDLRQD-VPAEKFVCREADEKERVGLYTIAVDPRNTNEFAVGGSDQFARVYDQRRID 315 (559)
T ss_pred eeeecCCCCCcccccccccceeeeeeccC-CccceeeeeccCCccceeeeeEecCCCCccccccCChhhhhhhhcccchh
Confidence 37888999999999888877765554322 222222 121333 457899999987 7999999999999997643
Q ss_pred -----cEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC-----------ceeeEeecccCc--C
Q 000425 84 -----AVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER-----------PSMIGIIQVGSQ--P 143 (1525)
Q Consensus 84 -----k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~-----------~s~i~il~gH~~--~ 143 (1525)
.++.+|. |...-..++..+++|+.++.-++++-.|-.|.++.-.-.. ......+.||.. .
T Consensus 316 ~e~~n~~~~~f~p~hl~~d~~v~ITgl~Ysh~~sElLaSYnDe~IYLF~~~~~~G~~p~~~s~~~~~~k~vYKGHrN~~T 395 (559)
T KOG1334|consen 316 KEENNGVLDKFCPHHLVEDDPVNITGLVYSHDGSELLASYNDEDIYLFNKSMGDGSEPDPSSPREQYVKRVYKGHRNSRT 395 (559)
T ss_pred hccccchhhhcCCccccccCcccceeEEecCCccceeeeecccceEEeccccccCCCCCCCcchhhccchhhcccccccc
Confidence 2234454 3221111123399999988877777788888888433110 111223668854 3
Q ss_pred eEEEEE-cCCCCEEEEEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCeeEEEEecC
Q 000425 144 ITSVAW-LPMLRLLVTLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRVRALEVHPR 222 (1525)
Q Consensus 144 VtsVaf-sPdg~~LaSgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V~~v~~HP~ 222 (1525)
|.++-| -|...|++|||+-|.|.|||-.... ++..+.|+.. -|+|++=||-
T Consensus 396 VKgVNFfGPrsEyVvSGSDCGhIFiW~K~t~e------------------------ii~~MegDr~----VVNCLEpHP~ 447 (559)
T KOG1334|consen 396 VKGVNFFGPRSEYVVSGSDCGHIFIWDKKTGE------------------------IIRFMEGDRH----VVNCLEPHPH 447 (559)
T ss_pred cceeeeccCccceEEecCccceEEEEecchhH------------------------HHHHhhcccc----eEeccCCCCC
Confidence 666665 4788999999999999999854211 2223333332 2799999999
Q ss_pred CCeEEEEecccccCCcccccccccc
Q 000425 223 LNLAVLLFANFTGGDNIKNRAAYTR 247 (1525)
Q Consensus 223 ~Nlial~F~~~s~~Dd~~~k~~~~~ 247 (1525)
++++| ++|=|.-+|+|...
T Consensus 448 ~PvLA------sSGid~DVKIWTP~ 466 (559)
T KOG1334|consen 448 LPVLA------SSGIDHDVKIWTPL 466 (559)
T ss_pred Cchhh------ccCCccceeeecCC
Confidence 99999 88888889999863
No 251
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=97.81 E-value=0.00027 Score=89.86 Aligned_cols=154 Identities=15% Similarity=0.101 Sum_probs=117.6
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeec--C-CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVE--G-GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tie--g-gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
+.++||++.++..+- .++.|.+|.... + .-..+.++- |...|.+++|+++|.+|++|+..+.+-+|.+.+++..+
T Consensus 210 ~~~~spn~~~~Aa~d-~dGrI~vw~d~~~~~~~~t~t~lHW-H~~~V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~kqf 287 (792)
T KOG1963|consen 210 CVALSPNERYLAAGD-SDGRILVWRDFGSSDDSETCTLLHW-HHDEVNSLSFSSDGAYLLSGGREGVLVLWQLETGKKQF 287 (792)
T ss_pred eEEeccccceEEEec-cCCcEEEEeccccccccccceEEEe-cccccceeEEecCCceEeecccceEEEEEeecCCCccc
Confidence 578999999886554 577798887655 2 224556666 88999999999999999999999999999999988322
Q ss_pred EEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-Cc-eeeEe-------ecccCcCeEEEEEcCCCCEEEE
Q 000425 88 TLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-RP-SMIGI-------IQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 88 tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-~~-s~i~i-------l~gH~~~VtsVafsPdg~~LaS 158 (1525)
--+-..+|. .+.+|||+.+......|+.|.+-...+. .. .+.++ .....+-.+.++++|.-+.++-
T Consensus 288 LPRLgs~I~-----~i~vS~ds~~~sl~~~DNqI~li~~~dl~~k~tIsgi~~~~~~~k~~~~~l~t~~~idpr~~~~vl 362 (792)
T KOG1963|consen 288 LPRLGSPIL-----HIVVSPDSDLYSLVLEDNQIHLIKASDLEIKSTISGIKPPTPSTKTRPQSLTTGVSIDPRTNSLVL 362 (792)
T ss_pred ccccCCeeE-----EEEEcCCCCeEEEEecCceEEEEeccchhhhhhccCccCCCccccccccccceeEEEcCCCCceee
Confidence 222778899 9999999999888889999998876542 11 11111 1122344677899997778888
Q ss_pred EECCCCeeEEEee
Q 000425 159 LCRDGSLQVWKTR 171 (1525)
Q Consensus 159 gS~DgtIkVWDlr 171 (1525)
++..+.|++||+-
T Consensus 363 n~~~g~vQ~ydl~ 375 (792)
T KOG1963|consen 363 NGHPGHVQFYDLY 375 (792)
T ss_pred cCCCceEEEEecc
Confidence 8999999999975
No 252
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=97.80 E-value=0.00037 Score=85.00 Aligned_cols=154 Identities=12% Similarity=0.115 Sum_probs=106.2
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCc-----------cceEEccC---------------C-----------CCC
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGR-----------APTKIKTD---------------L-----------KKP 53 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk-----------~~tklk~~---------------H-----------~~~ 53 (1525)
.|.++|++.+++.+..-.-.|+|++..+-.. ....+..+ | ...
T Consensus 56 ~ik~s~DGqY~lAtG~YKP~ikvydlanLSLKFERhlDae~V~feiLsDD~SK~v~L~~DR~IefHak~G~hy~~RIP~~ 135 (703)
T KOG2321|consen 56 RIKVSPDGQYLLATGTYKPQIKVYDLANLSLKFERHLDAEVVDFEILSDDYSKSVFLQNDRTIEFHAKYGRHYRTRIPKF 135 (703)
T ss_pred eeEecCCCcEEEEecccCCceEEEEcccceeeeeecccccceeEEEeccchhhheEeecCceeeehhhcCeeeeeecCcC
Confidence 4678899999888877777777776533210 01111111 1 112
Q ss_pred eEEEEEcCCC-CEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCc
Q 000425 54 IVNLACHPRL-PVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERP 131 (1525)
Q Consensus 54 V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~ 131 (1525)
-+.+++|.-. .++++|+. ..|.=+++..+.-+..|. ....++ +|..++....|++|+.||.|-.||.++ +
T Consensus 136 GRDm~y~~~scDly~~gsg-~evYRlNLEqGrfL~P~~~~~~~lN-----~v~in~~hgLla~Gt~~g~VEfwDpR~--k 207 (703)
T KOG2321|consen 136 GRDMKYHKPSCDLYLVGSG-SEVYRLNLEQGRFLNPFETDSGELN-----VVSINEEHGLLACGTEDGVVEFWDPRD--K 207 (703)
T ss_pred CccccccCCCccEEEeecC-cceEEEEccccccccccccccccce-----eeeecCccceEEecccCceEEEecchh--h
Confidence 2334444323 34444444 456667889998888888 668889 999999999999999999999999985 2
Q ss_pred eeeEee------cccCc-----CeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 132 SMIGII------QVGSQ-----PITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 132 s~i~il------~gH~~-----~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
+.++.+ ..|.. .|+++.|+.+|-.++.|+.+|.|.|+|+|.
T Consensus 208 srv~~l~~~~~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa 259 (703)
T KOG2321|consen 208 SRVGTLDAASSVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRA 259 (703)
T ss_pred hhheeeecccccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEccc
Confidence 222222 23333 399999999999999999999999999994
No 253
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.78 E-value=0.00073 Score=83.81 Aligned_cols=152 Identities=15% Similarity=0.109 Sum_probs=93.3
Q ss_pred EEEEEEcCCCCEEEEEEeccc--eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEE-EEECCCc--EEEEEcCCC
Q 000425 9 EVHLALTPLQPVVFFGFHRRM--SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLY-VAYADGL--IRAYNIHTY 83 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~--sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~La-SGs~Dgt--IrLWDl~t~ 83 (1525)
..+.+|+|++..+++....+. .+.+++. .++. ...+.. ..+.....+|+|+|+.|+ +.+.++. |.+||+.++
T Consensus 220 ~~~p~wSPDG~~La~~s~~~g~~~L~~~dl-~tg~-~~~lt~-~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg 296 (448)
T PRK04792 220 LMSPAWSPDGRKLAYVSFENRKAEIFVQDI-YTQV-REKVTS-FPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATK 296 (448)
T ss_pred ccCceECCCCCEEEEEEecCCCcEEEEEEC-CCCC-eEEecC-CCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCC
Confidence 356899999998888765443 3555553 3332 222322 233445789999999765 4566775 777788877
Q ss_pred cEEEEEecCCceeecCceeEEEeCCCCEEEEEe-CCCc--EEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 84 AVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD-RRGT--LLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 84 k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS-~Dgt--VrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
+......+..... ..+|+|||++|+..+ .++. |.++|..+.....+ ..+.......+|+|||++|+..+
T Consensus 297 ~~~~lt~~~~~~~-----~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~L---t~~g~~~~~~~~SpDG~~l~~~~ 368 (448)
T PRK04792 297 ALTRITRHRAIDT-----EPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRL---TFEGEQNLGGSITPDGRSMIMVN 368 (448)
T ss_pred CeEECccCCCCcc-----ceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEE---ecCCCCCcCeeECCCCCEEEEEE
Confidence 6533222545556 789999999887665 3444 55557664322211 11222334579999999887765
Q ss_pred C-CCCeeEEEee
Q 000425 161 R-DGSLQVWKTR 171 (1525)
Q Consensus 161 ~-DgtIkVWDlr 171 (1525)
. ++..+||-++
T Consensus 369 ~~~g~~~I~~~d 380 (448)
T PRK04792 369 RTNGKFNIARQD 380 (448)
T ss_pred ecCCceEEEEEE
Confidence 5 4455665443
No 254
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.76 E-value=0.00074 Score=83.31 Aligned_cols=155 Identities=15% Similarity=0.102 Sum_probs=93.8
Q ss_pred EEEEEEcCCCCEEEEEEeccc--eEEE--EEeecC-CccceEEccCCCCCeEEEEEcCCCCEEEEEE-CCCcEEEEE--c
Q 000425 9 EVHLALTPLQPVVFFGFHRRM--SVTV--VGTVEG-GRAPTKIKTDLKKPIVNLACHPRLPVLYVAY-ADGLIRAYN--I 80 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~--sI~v--v~tieg-gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs-~DgtIrLWD--l 80 (1525)
....+|+|++..|++...... .+.+ ++...+ +.....+...+.......+|+|||+.|+..+ .+|...||. +
T Consensus 233 ~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPDG~~Laf~s~~~g~~~ly~~~~ 312 (428)
T PRK01029 233 QLMPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSPDGTRLVFVSNKDGRPRIYIMQI 312 (428)
T ss_pred ccceEECCCCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECCCCCEEEEEECCCCCceEEEEEC
Confidence 456789999999888774322 3322 222211 1223333332333446789999999776655 467666665 3
Q ss_pred CC-CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCC---CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCE
Q 000425 81 HT-YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRR---GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRL 155 (1525)
Q Consensus 81 ~t-~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~D---gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~ 155 (1525)
.. +.....+. +...+. ...|||||++|+..+.+ ..|.+||+.+.....+ ..+...+.+..|+|||+.
T Consensus 313 ~~~g~~~~~lt~~~~~~~-----~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~L---t~~~~~~~~p~wSpDG~~ 384 (428)
T PRK01029 313 DPEGQSPRLLTKKYRNSS-----CPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQL---TTSPENKESPSWAIDSLH 384 (428)
T ss_pred cccccceEEeccCCCCcc-----ceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEc---cCCCCCccceEECCCCCE
Confidence 32 23344454 445566 88999999998876633 4689999986422222 222345678999999998
Q ss_pred EEEEE-CCC--CeeEEEee
Q 000425 156 LVTLC-RDG--SLQVWKTR 171 (1525)
Q Consensus 156 LaSgS-~Dg--tIkVWDlr 171 (1525)
|+..+ .++ .|.+||+.
T Consensus 385 L~f~~~~~g~~~L~~vdl~ 403 (428)
T PRK01029 385 LVYSAGNSNESELYLISLI 403 (428)
T ss_pred EEEEECCCCCceEEEEECC
Confidence 77443 333 45556653
No 255
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=97.75 E-value=0.00028 Score=91.86 Aligned_cols=120 Identities=13% Similarity=0.227 Sum_probs=84.7
Q ss_pred CCCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcCCC-------cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC
Q 000425 49 DLKKPIVNLACHPRL-PVLYVAYADGLIRAYNIHTY-------AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG 119 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~t~-------k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg 119 (1525)
+|+..|..++.++.. .+|++||.||+||+||.+.. +...++. ....+. ++.+.+.+..+++++.||
T Consensus 1046 Ehs~~v~k~a~s~~~~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~-----~vt~~~~~~~~Av~t~DG 1120 (1431)
T KOG1240|consen 1046 EHSSAVIKLAVSSEHTSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVE-----KVTMCGNGDQFAVSTKDG 1120 (1431)
T ss_pred hccccccceeecCCCCceEEEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceE-----EEEeccCCCeEEEEcCCC
Confidence 488888888888766 89999999999999997632 2333444 456788 999999999999999999
Q ss_pred cEEEEeCCCC-----CceeeEeecccCc--CeEEEEEcC--CCCEEEEEECCCCeeEEEeeee
Q 000425 120 TLLAWDVSIE-----RPSMIGIIQVGSQ--PITSVAWLP--MLRLLVTLCRDGSLQVWKTRVI 173 (1525)
Q Consensus 120 tVrVWDl~t~-----~~s~i~il~gH~~--~VtsVafsP--dg~~LaSgS~DgtIkVWDlr~~ 173 (1525)
.|++.++... ......+.+.|.. .|..-+|.. ....++-+..-+.|..||+|..
T Consensus 1121 ~v~~~~id~~~~~~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~iv~~D~r~~ 1183 (1431)
T KOG1240|consen 1121 SVRVLRIDHYNVSKRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRIVSWDTRMR 1183 (1431)
T ss_pred eEEEEEccccccccceeeeeecccccCCCceEEeecccccccceeEEEEEeccceEEecchhh
Confidence 9999988752 1111112222322 233333332 2347888888899999999854
No 256
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=97.72 E-value=0.00084 Score=83.04 Aligned_cols=153 Identities=18% Similarity=0.269 Sum_probs=95.6
Q ss_pred CcEEEEEEcCCCCEEEEEEec----------------cceEEEEEeecCCc--cceEEccCCCCCeEEEEEcCCCC-EEE
Q 000425 7 DTEVHLALTPLQPVVFFGFHR----------------RMSVTVVGTVEGGR--APTKIKTDLKKPIVNLACHPRLP-VLY 67 (1525)
Q Consensus 7 ~~e~~LA~hP~qp~Vf~s~~~----------------d~sI~vv~tieggk--~~tklk~~H~~~V~sLaf~Pdg~-~La 67 (1525)
..+.||++++.+.++.+=... |-++.++.. .+++ ....+.+ +....++.|+-..+ .+.
T Consensus 146 r~~r~l~~N~~qDl~ivWW~~~~~e~~PWsP~~~e~draNl~L~~~-~~~klEvL~yirT--E~dPl~~~Fs~~~~~qi~ 222 (545)
T PF11768_consen 146 RLERHLSVNSSQDLVIVWWPSAQDEVWPWSPISSEKDRANLHLLSC-SGGKLEVLSYIRT--ENDPLDVEFSLNQPYQIH 222 (545)
T ss_pred ccceeEeeccCCCEEEEEecCCCCCcCCCCCccccchhccEEEEEe-cCCcEEEEEEEEe--cCCcEEEEccCCCCcEEE
Confidence 368899999888877543211 223333332 2211 1111222 22234566765443 233
Q ss_pred E----EECCCcE----EEEEcCCCcE--EEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE
Q 000425 68 V----AYADGLI----RAYNIHTYAV--HYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG 135 (1525)
Q Consensus 68 S----Gs~DgtI----rLWDl~t~k~--l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~ 135 (1525)
+ .+.+|.+ .+|++...+. +.... ....+. +++++|+..+++.|+.||+|.+||..+......
T Consensus 223 tVE~s~s~~g~~~~d~ciYE~~r~klqrvsvtsipL~s~v~-----~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t~~~- 296 (545)
T PF11768_consen 223 TVEQSISVKGEPSADSCIYECSRNKLQRVSVTSIPLPSQVI-----CCARSPSEDKLVLGCEDGSIILYDTTRGVTLLA- 296 (545)
T ss_pred EEEEecCCCCCceeEEEEEEeecCceeEEEEEEEecCCcce-----EEecCcccceEEEEecCCeEEEEEcCCCeeeee-
Confidence 2 2234433 3455443332 22222 566777 999999999999999999999999985422222
Q ss_pred eecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 136 IIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 136 il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+.-..+.++|||+|.+++.|+.-|.+++||+-
T Consensus 297 ---ka~~~P~~iaWHp~gai~~V~s~qGelQ~FD~A 329 (545)
T PF11768_consen 297 ---KAEFIPTLIAWHPDGAIFVVGSEQGELQCFDMA 329 (545)
T ss_pred ---eecccceEEEEcCCCcEEEEEcCCceEEEEEee
Confidence 345567899999999999999999999999975
No 257
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=97.72 E-value=8.5e-05 Score=60.01 Aligned_cols=37 Identities=27% Similarity=0.534 Sum_probs=35.5
Q ss_pred cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEe
Q 000425 84 AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWD 125 (1525)
Q Consensus 84 k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWD 125 (1525)
+++.++. |...|+ +++|+|++.+|++++.|++|++||
T Consensus 2 ~~~~~~~~h~~~i~-----~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 2 KCVRTFRGHSSSIN-----SIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEEEESSSSSEE-----EEEEETTSSEEEEEETTSEEEEEE
T ss_pred eEEEEEcCCCCcEE-----EEEEecccccceeeCCCCEEEEEC
Confidence 6788999 999999 999999999999999999999998
No 258
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=97.71 E-value=0.0001 Score=82.99 Aligned_cols=69 Identities=25% Similarity=0.412 Sum_probs=56.4
Q ss_pred eEEEeCCCCE-EEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC-CCEEEEEECCCCeeEEEee
Q 000425 102 AFAFHPTLEW-LFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM-LRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 102 sVaFSPdg~~-L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd-g~~LaSgS~DgtIkVWDlr 171 (1525)
+++-||..+. +++|++||.+-+||.+.. ......+.+|...|+-+-|||. +..|++||.||.+..||..
T Consensus 184 ~l~~hp~qq~~v~cgt~dg~~~l~d~rn~-~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~sedGslw~wdas 254 (319)
T KOG4714|consen 184 ALCSHPAQQHLVCCGTDDGIVGLWDARNV-AMPVSLLKAHKAEIWEVHFHPKNPEHLFTCSEDGSLWHWDAS 254 (319)
T ss_pred hhhCCcccccEEEEecCCCeEEEEEcccc-cchHHHHHHhhhhhhheeccCCCchheeEecCCCcEEEEcCC
Confidence 7888886655 556779999999999953 2223456799999999999995 5689999999999999987
No 259
>PRK01029 tolB translocation protein TolB; Provisional
Probab=97.70 E-value=0.001 Score=82.01 Aligned_cols=154 Identities=14% Similarity=0.121 Sum_probs=90.1
Q ss_pred EEEEEEcCCCCE---EEEEEec-cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-----CCcEEEEE
Q 000425 9 EVHLALTPLQPV---VFFGFHR-RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-----DGLIRAYN 79 (1525)
Q Consensus 9 e~~LA~hP~qp~---Vf~s~~~-d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-----DgtIrLWD 79 (1525)
...-+|+|++.. +|++... +..|.+.+ ..++. ...+.. ..+.....+|+|||+.|+..+. |..+.+||
T Consensus 187 ~~sP~wSPDG~~~~~~y~S~~~g~~~I~~~~-l~~g~-~~~lt~-~~g~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~ 263 (428)
T PRK01029 187 SITPTWMHIGSGFPYLYVSYKLGVPKIFLGS-LENPA-GKKILA-LQGNQLMPTFSPRKKLLAFISDRYGNPDLFIQSFS 263 (428)
T ss_pred cccceEccCCCceEEEEEEccCCCceEEEEE-CCCCC-ceEeec-CCCCccceEECCCCCEEEEEECCCCCcceeEEEee
Confidence 355689999764 3344433 23344444 33332 223322 4455667899999998776542 33455577
Q ss_pred cCCC---cEEEEEec-CCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeC--CCCCceeeEeecccCcCeEEEEEcCC
Q 000425 80 IHTY---AVHYTLQL-DNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDV--SIERPSMIGIIQVGSQPITSVAWLPM 152 (1525)
Q Consensus 80 l~t~---k~l~tL~H-~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl--~t~~~s~i~il~gH~~~VtsVafsPd 152 (1525)
+.++ +......+ ..... ..+|+|||++|+..+ .++...+|.+ ...... ...+..+...+....||||
T Consensus 264 ~~~g~~g~~~~lt~~~~~~~~-----~p~wSPDG~~Laf~s~~~g~~~ly~~~~~~~g~~-~~~lt~~~~~~~~p~wSPD 337 (428)
T PRK01029 264 LETGAIGKPRRLLNEAFGTQG-----NPSFSPDGTRLVFVSNKDGRPRIYIMQIDPEGQS-PRLLTKKYRNSSCPAWSPD 337 (428)
T ss_pred cccCCCCcceEeecCCCCCcC-----CeEECCCCCEEEEEECCCCCceEEEEECcccccc-eEEeccCCCCccceeECCC
Confidence 7653 33222222 23345 789999999888666 5676666644 311111 1122344556778999999
Q ss_pred CCEEEEEECCC---CeeEEEee
Q 000425 153 LRLLVTLCRDG---SLQVWKTR 171 (1525)
Q Consensus 153 g~~LaSgS~Dg---tIkVWDlr 171 (1525)
|++|+..+.++ .|.+||+.
T Consensus 338 G~~Laf~~~~~g~~~I~v~dl~ 359 (428)
T PRK01029 338 GKKIAFCSVIKGVRQICVYDLA 359 (428)
T ss_pred CCEEEEEEcCCCCcEEEEEECC
Confidence 99988776542 57777764
No 260
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=97.69 E-value=0.00025 Score=82.31 Aligned_cols=80 Identities=16% Similarity=0.314 Sum_probs=67.6
Q ss_pred cceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE-EEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC
Q 000425 42 APTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV-HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG 119 (1525)
Q Consensus 42 ~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~-l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg 119 (1525)
..+.+++ |.+.|++++|.|....+++|+.|..|.+||+..++- ...+. |.+.|. .+.+-+.-+.+++++.||
T Consensus 189 ~i~~~~~-h~~~~~~l~Wd~~~~~LfSg~~d~~vi~wdigg~~g~~~el~gh~~kV~-----~l~~~~~t~~l~S~~edg 262 (404)
T KOG1409|consen 189 LITTFNG-HTGEVTCLKWDPGQRLLFSGASDHSVIMWDIGGRKGTAYELQGHNDKVQ-----ALSYAQHTRQLISCGEDG 262 (404)
T ss_pred eEEEEcC-cccceEEEEEcCCCcEEEeccccCceEEEeccCCcceeeeeccchhhhh-----hhhhhhhheeeeeccCCC
Confidence 3344444 999999999999999999999999999999876643 34444 888899 888878788899999999
Q ss_pred cEEEEeCC
Q 000425 120 TLLAWDVS 127 (1525)
Q Consensus 120 tVrVWDl~ 127 (1525)
.|-+||++
T Consensus 263 ~i~~w~mn 270 (404)
T KOG1409|consen 263 GIVVWNMN 270 (404)
T ss_pred eEEEEecc
Confidence 99999986
No 261
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.67 E-value=0.00027 Score=83.26 Aligned_cols=104 Identities=17% Similarity=0.234 Sum_probs=94.3
Q ss_pred CEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE-Ee-cCCcee
Q 000425 19 PVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT-LQ-LDNTIK 96 (1525)
Q Consensus 19 p~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t-L~-H~~~V~ 96 (1525)
++.|+.......+++|++..+.+++..+.. .+.+|+++...|+++.+++|..-+.+..+|+++++.... +. ..++++
T Consensus 216 ~~~fat~T~~hqvR~YDt~~qRRPV~~fd~-~E~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsir 294 (412)
T KOG3881|consen 216 NYKFATITRYHQVRLYDTRHQRRPVAQFDF-LENPISSTGLTPSGNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSIR 294 (412)
T ss_pred CceEEEEecceeEEEecCcccCcceeEecc-ccCcceeeeecCCCcEEEEecccchhheecccCceeeccccCCccCCcc
Confidence 566777777889999999888888888777 789999999999999999999999999999999998877 66 788999
Q ss_pred ecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 97 LLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 97 ~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
+|.-||.+++++++|-|+.|||+|+.+
T Consensus 295 -----sih~hp~~~~las~GLDRyvRIhD~kt 321 (412)
T KOG3881|consen 295 -----SIHCHPTHPVLASCGLDRYVRIHDIKT 321 (412)
T ss_pred -----eEEEcCCCceEEeeccceeEEEeeccc
Confidence 999999999999999999999999995
No 262
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=97.67 E-value=0.0002 Score=83.12 Aligned_cols=152 Identities=14% Similarity=0.215 Sum_probs=99.8
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEcc------CCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCC
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKT------DLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~------~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~ 83 (1525)
+|.++.++-.. . +.++..|.+|...-.......+.- +.+.-|++..|||... +|+..+..|+||+.|++..
T Consensus 169 SIS~NsD~Et~-l-SADdLRINLWnlei~d~sFnIVDIKP~nmEeLteVITsaEFhp~~cn~f~YSSSKGtIrLcDmR~~ 246 (433)
T KOG1354|consen 169 SISVNSDKETF-L-SADDLRINLWNLEIIDQSFNIVDIKPANMEELTEVITSAEFHPHHCNVFVYSSSKGTIRLCDMRQS 246 (433)
T ss_pred eeeecCccceE-e-eccceeeeeccccccCCceeEEEccccCHHHHHHHHhhhccCHhHccEEEEecCCCcEEEeechhh
Confidence 35566665543 2 336888988875322222222211 1345688899999875 6788889999999999843
Q ss_pred c----EEEEEe-cC------------CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccC-----
Q 000425 84 A----VHYTLQ-LD------------NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGS----- 141 (1525)
Q Consensus 84 k----~l~tL~-H~------------~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~----- 141 (1525)
. ..+.|+ .. .+|. .|.|++.|+|+++-+ =-+|++||++-..+.+. +.+.|.
T Consensus 247 aLCd~hsKlfEepedp~~rsffseiIsSIS-----DvKFs~sGryilsRD-yltvk~wD~nme~~pv~-t~~vh~~lr~k 319 (433)
T KOG1354|consen 247 ALCDAHSKLFEEPEDPSSRSFFSEIISSIS-----DVKFSHSGRYILSRD-YLTVKLWDLNMEAKPVE-TYPVHEYLRSK 319 (433)
T ss_pred hhhcchhhhhccccCCcchhhHHHHhhhhh-----ceEEccCCcEEEEec-cceeEEEeccccCCcce-EEeehHhHHHH
Confidence 2 112222 11 2345 899999999998653 26899999964322222 233332
Q ss_pred -------cCe---EEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 142 -------QPI---TSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 142 -------~~V---tsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
..| ..++|+-++.++.|||-...+++++..
T Consensus 320 Lc~lYEnD~IfdKFec~~sg~~~~v~TGsy~n~frvf~~~ 359 (433)
T KOG1354|consen 320 LCSLYENDAIFDKFECSWSGNDSYVMTGSYNNVFRVFNLA 359 (433)
T ss_pred HHHHhhccchhheeEEEEcCCcceEecccccceEEEecCC
Confidence 222 358999999999999999999999943
No 263
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.66 E-value=0.0064 Score=72.87 Aligned_cols=161 Identities=16% Similarity=0.182 Sum_probs=108.1
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceE--Ec----------cCCCCCeEEEEEcCCCCEEEEEEC-C
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTK--IK----------TDLKKPIVNLACHPRLPVLYVAYA-D 72 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tk--lk----------~~H~~~V~sLaf~Pdg~~LaSGs~-D 72 (1525)
+..-.||+++|++.+++++.....++.++...+.+..... +. .+-....-++.|+|+++++++... .
T Consensus 86 g~~p~~i~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~ 165 (345)
T PF10282_consen 86 GSSPCHIAVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGA 165 (345)
T ss_dssp SSCEEEEEECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTT
T ss_pred CCCcEEEEEecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCC
Confidence 4566899999999999999999999988887654432222 11 111234678999999998777543 4
Q ss_pred CcEEEEEcCCCc--E--EEEEe--cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCC--CCceeeE---ee-cc
Q 000425 73 GLIRAYNIHTYA--V--HYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSI--ERPSMIG---II-QV 139 (1525)
Q Consensus 73 gtIrLWDl~t~k--~--l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t--~~~s~i~---il-~g 139 (1525)
..|++|++.... . ...+. ....-+ .+.|+|+++++++.. .+++|.++++.. .....++ .+ .+
T Consensus 166 D~v~~~~~~~~~~~l~~~~~~~~~~G~GPR-----h~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~ 240 (345)
T PF10282_consen 166 DRVYVYDIDDDTGKLTPVDSIKVPPGSGPR-----HLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEG 240 (345)
T ss_dssp TEEEEEEE-TTS-TEEEEEEEECSTTSSEE-----EEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETT
T ss_pred CEEEEEEEeCCCceEEEeeccccccCCCCc-----EEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeecccc
Confidence 579999987654 3 23344 445667 899999999887666 899999999883 2111111 11 01
Q ss_pred --cCcCeEEEEEcCCCCEEEEEE-CCCCeeEEEee
Q 000425 140 --GSQPITSVAWLPMLRLLVTLC-RDGSLQVWKTR 171 (1525)
Q Consensus 140 --H~~~VtsVafsPdg~~LaSgS-~DgtIkVWDlr 171 (1525)
.......|.++|||++|..+. .+.+|.+|+++
T Consensus 241 ~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d 275 (345)
T PF10282_consen 241 FTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLD 275 (345)
T ss_dssp SCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEEC
T ss_pred ccccCCceeEEEecCCCEEEEEeccCCEEEEEEEe
Confidence 122567899999999866654 56689999984
No 264
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.63 E-value=0.0024 Score=77.48 Aligned_cols=159 Identities=18% Similarity=0.171 Sum_probs=102.0
Q ss_pred cccCCcEEE--EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEE-CCCcEEEEE
Q 000425 3 SISVDTEVH--LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAY-ADGLIRAYN 79 (1525)
Q Consensus 3 ~~~~~~e~~--LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs-~DgtIrLWD 79 (1525)
+|......| ++++|++.++|++. ++..+.+++.... +....++. ...-++++++++|+++++++ .++.+.++|
T Consensus 31 ~i~~~~~~h~~~~~s~Dgr~~yv~~-rdg~vsviD~~~~-~~v~~i~~--G~~~~~i~~s~DG~~~~v~n~~~~~v~v~D 106 (369)
T PF02239_consen 31 RIPTGGAPHAGLKFSPDGRYLYVAN-RDGTVSVIDLATG-KVVATIKV--GGNPRGIAVSPDGKYVYVANYEPGTVSVID 106 (369)
T ss_dssp EEE-STTEEEEEE-TT-SSEEEEEE-TTSEEEEEETTSS-SEEEEEE---SSEEEEEEE--TTTEEEEEEEETTEEEEEE
T ss_pred EEcCCCCceeEEEecCCCCEEEEEc-CCCeEEEEECCcc-cEEEEEec--CCCcceEEEcCCCCEEEEEecCCCceeEec
Confidence 455444555 57899999999986 6788989886443 34444544 34568899999999988765 689999999
Q ss_pred cCCCcEEEEEe-cC-------CceeecCceeEEEeCCCCEEE-EEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEc
Q 000425 80 IHTYAVHYTLQ-LD-------NTIKLLGAGAFAFHPTLEWLF-VGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWL 150 (1525)
Q Consensus 80 l~t~k~l~tL~-H~-------~~V~~~ga~sVaFSPdg~~L~-SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafs 150 (1525)
.++.++++++. .. ..+. .+..+|..+.++ +--+.+.|.+-|+........... ..........|+
T Consensus 107 ~~tle~v~~I~~~~~~~~~~~~Rv~-----aIv~s~~~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i-~~g~~~~D~~~d 180 (369)
T PF02239_consen 107 AETLEPVKTIPTGGMPVDGPESRVA-----AIVASPGRPEFVVNLKDTGEIWVVDYSDPKNLKVTTI-KVGRFPHDGGFD 180 (369)
T ss_dssp TTT--EEEEEE--EE-TTTS---EE-----EEEE-SSSSEEEEEETTTTEEEEEETTTSSCEEEEEE-E--TTEEEEEE-
T ss_pred cccccceeecccccccccccCCCce-----eEEecCCCCEEEEEEccCCeEEEEEeccccccceeee-cccccccccccC
Confidence 99999999887 32 2345 777788887444 444668887779875322222122 234456789999
Q ss_pred CCCCEEEEE-ECCCCeeEEEee
Q 000425 151 PMLRLLVTL-CRDGSLQVWKTR 171 (1525)
Q Consensus 151 Pdg~~LaSg-S~DgtIkVWDlr 171 (1525)
|+++|++.+ ..+..|-++|..
T Consensus 181 pdgry~~va~~~sn~i~viD~~ 202 (369)
T PF02239_consen 181 PDGRYFLVAANGSNKIAVIDTK 202 (369)
T ss_dssp TTSSEEEEEEGGGTEEEEEETT
T ss_pred cccceeeecccccceeEEEeec
Confidence 999987664 556688889875
No 265
>PRK00178 tolB translocation protein TolB; Provisional
Probab=97.61 E-value=0.0018 Score=79.41 Aligned_cols=147 Identities=14% Similarity=0.052 Sum_probs=90.6
Q ss_pred EEEEEcCCCCEEEEEEeccc--eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CC--cEEEEEcCCCc
Q 000425 10 VHLALTPLQPVVFFGFHRRM--SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DG--LIRAYNIHTYA 84 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~--sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-Dg--tIrLWDl~t~k 84 (1525)
...+|+|++..++++...+. .+.+++.. ++ ....+.. +........|+|+++.++..+. ++ .|.+||+.+++
T Consensus 246 ~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~-~~-~~~~lt~-~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~ 322 (430)
T PRK00178 246 GAPAWSPDGSKLAFVLSKDGNPEIYVMDLA-SR-QLSRVTN-HPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGR 322 (430)
T ss_pred CCeEECCCCCEEEEEEccCCCceEEEEECC-CC-CeEEccc-CCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 35899999999888765544 45555542 22 2333333 5555677899999997765553 33 57777887776
Q ss_pred EEEEEecCCceeecCceeEEEeCCCCEEEEEeC-CC--cEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC
Q 000425 85 VHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR-RG--TLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 85 ~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~-Dg--tVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~ 161 (1525)
............ ...|+|||++|+..+. ++ .|.+||+.+.....+ . +.......+|+|+|+.++..+.
T Consensus 323 ~~~lt~~~~~~~-----~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~l--t--~~~~~~~p~~spdg~~i~~~~~ 393 (430)
T PRK00178 323 AERVTFVGNYNA-----RPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRIL--T--DTSLDESPSVAPNGTMLIYATR 393 (430)
T ss_pred EEEeecCCCCcc-----ceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEc--c--CCCCCCCceECCCCCEEEEEEe
Confidence 433221222333 5689999999887763 33 577889886422222 1 1222235689999998877665
Q ss_pred C-CCeeEE
Q 000425 162 D-GSLQVW 168 (1525)
Q Consensus 162 D-gtIkVW 168 (1525)
+ +.-++|
T Consensus 394 ~~g~~~l~ 401 (430)
T PRK00178 394 QQGRGVLM 401 (430)
T ss_pred cCCceEEE
Confidence 4 333344
No 266
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=97.61 E-value=0.00016 Score=83.26 Aligned_cols=135 Identities=15% Similarity=0.206 Sum_probs=97.9
Q ss_pred EEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeE
Q 000425 57 LACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIG 135 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~ 135 (1525)
..|+|+|+++|+.+.- .+-|-|..+.+.++.|.--+.|. .+.|..|.-+++++- .|+.|.+|++.. +....
T Consensus 14 c~fSp~g~yiAs~~~y-rlviRd~~tlq~~qlf~cldki~-----yieW~ads~~ilC~~yk~~~vqvwsl~Q--pew~c 85 (447)
T KOG4497|consen 14 CSFSPCGNYIASLSRY-RLVIRDSETLQLHQLFLCLDKIV-----YIEWKADSCHILCVAYKDPKVQVWSLVQ--PEWYC 85 (447)
T ss_pred eeECCCCCeeeeeeee-EEEEeccchhhHHHHHHHHHHhh-----heeeeccceeeeeeeeccceEEEEEeec--ceeEE
Confidence 5799999999998754 67677777665554444234566 789999999988776 889999999984 33333
Q ss_pred eecccCcCeEEEEEcCCCCEEE-EEECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccCCcceecCCCe
Q 000425 136 IIQVGSQPITSVAWLPMLRLLV-TLCRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLPRV 214 (1525)
Q Consensus 136 il~gH~~~VtsVafsPdg~~La-SgS~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~~V 214 (1525)
..+.....+.+++|||||+.|+ +.+-|-.|.||.+.+... . ...||..++
T Consensus 86 kIdeg~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~~~---------~--------------------~~~~pK~~~ 136 (447)
T KOG4497|consen 86 KIDEGQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQKG---------Y--------------------LLPHPKTNV 136 (447)
T ss_pred EeccCCCcceeeeECCCcceEeeeecceeEEEEEEecccee---------E--------------------EecccccCc
Confidence 3556788899999999997655 555688899998763211 0 013455567
Q ss_pred eEEEEecCCCeEEE
Q 000425 215 RALEVHPRLNLAVL 228 (1525)
Q Consensus 215 ~~v~~HP~~Nlial 228 (1525)
+-++|||++-..++
T Consensus 137 kg~~f~~dg~f~ai 150 (447)
T KOG4497|consen 137 KGYAFHPDGQFCAI 150 (447)
T ss_pred eeEEECCCCceeee
Confidence 77899998887764
No 267
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=97.58 E-value=0.00082 Score=76.97 Aligned_cols=154 Identities=12% Similarity=0.138 Sum_probs=103.0
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecC-CccceEEccCCCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCC-CcE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEG-GRAPTKIKTDLKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHT-YAV 85 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg-gk~~tklk~~H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t-~k~ 85 (1525)
-..|.+++.+.-++++. .+.++..+...+. .......+. |.-......|+-..+ ++.+|++|+.+..||++. .+.
T Consensus 124 ~lslD~~~~~~~i~vs~-s~G~~~~v~~t~~~le~vq~wk~-He~E~Wta~f~~~~pnlvytGgDD~~l~~~D~R~p~~~ 201 (339)
T KOG0280|consen 124 ALSLDISTSGTKIFVSD-SRGSISGVYETEMVLEKVQTWKV-HEFEAWTAKFSDKEPNLVYTGGDDGSLSCWDIRIPKTF 201 (339)
T ss_pred eeEEEeeccCceEEEEc-CCCcEEEEecceeeeeecccccc-cceeeeeeecccCCCceEEecCCCceEEEEEecCCcce
Confidence 45678888888887776 4677764443222 112223444 988999999987665 788999999999999993 333
Q ss_pred EEE-Ee-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCC-EEEEEEC
Q 000425 86 HYT-LQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLR-LLVTLCR 161 (1525)
Q Consensus 86 l~t-L~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~-~LaSgS~ 161 (1525)
+.. -+ |...|. ++.-+| ++.+|++|+-|-.|++||.+...+.+. -..-.+.|..+.+||... .++.++.
T Consensus 202 i~~n~kvH~~GV~-----SI~ss~~~~~~I~TGsYDe~i~~~DtRnm~kPl~--~~~v~GGVWRi~~~p~~~~~lL~~CM 274 (339)
T KOG0280|consen 202 IWHNSKVHTSGVV-----SIYSSPPKPTYIATGSYDECIRVLDTRNMGKPLF--KAKVGGGVWRIKHHPEIFHRLLAACM 274 (339)
T ss_pred eeecceeeecceE-----EEecCCCCCceEEEeccccceeeeehhcccCccc--cCccccceEEEEecchhhhHHHHHHH
Confidence 333 33 777777 888776 567999999999999999996422222 112246799999999532 2333344
Q ss_pred CCCeeEEEee
Q 000425 162 DGSLQVWKTR 171 (1525)
Q Consensus 162 DgtIkVWDlr 171 (1525)
-...+|-+..
T Consensus 275 h~G~ki~~~~ 284 (339)
T KOG0280|consen 275 HNGAKILDSS 284 (339)
T ss_pred hcCceEEEec
Confidence 4455555543
No 268
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=97.58 E-value=0.64 Score=63.06 Aligned_cols=69 Identities=20% Similarity=0.380 Sum_probs=55.3
Q ss_pred CceeecCceeEEEeCCCCEEEEEeCCCcEEEE----eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEE
Q 000425 93 NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAW----DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 93 ~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVW----Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVW 168 (1525)
..|. ++.|-++..-++.+..+|.|.+. |..+....+++.. ...|.+++||||+.+|+..+.++++-+-
T Consensus 76 ~~iv-----s~~yl~d~~~l~~~~~~Gdi~~~~~~~~~~~~~~E~VG~v---d~GI~a~~WSPD~Ella~vT~~~~l~~m 147 (928)
T PF04762_consen 76 DKIV-----SFQYLADSESLCIALASGDIILVREDPDPDEDEIEIVGSV---DSGILAASWSPDEELLALVTGEGNLLLM 147 (928)
T ss_pred CcEE-----EEEeccCCCcEEEEECCceEEEEEccCCCCCceeEEEEEE---cCcEEEEEECCCcCEEEEEeCCCEEEEE
Confidence 4677 89999999999999999999999 5554344555433 4579999999999999999999988775
Q ss_pred E
Q 000425 169 K 169 (1525)
Q Consensus 169 D 169 (1525)
+
T Consensus 148 t 148 (928)
T PF04762_consen 148 T 148 (928)
T ss_pred e
Confidence 4
No 269
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=97.56 E-value=0.00018 Score=83.55 Aligned_cols=116 Identities=10% Similarity=0.112 Sum_probs=90.9
Q ss_pred CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC-----cEEEEEecCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEE
Q 000425 51 KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY-----AVHYTLQLDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAW 124 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~-----k~l~tL~H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVW 124 (1525)
+..|.++.|+-.++++..||..|.|..+|++.. .+...+-|...|+ ++..-. ++++|++.+.+|+|++|
T Consensus 252 ksDVfAlQf~~s~nLv~~GcRngeI~~iDLR~rnqG~~~~a~rlyh~Ssvt-----slq~Lq~s~q~LmaS~M~gkikLy 326 (425)
T KOG2695|consen 252 KSDVFALQFAGSDNLVFNGCRNGEIFVIDLRCRNQGNGWCAQRLYHDSSVT-----SLQILQFSQQKLMASDMTGKIKLY 326 (425)
T ss_pred chhHHHHHhcccCCeeEecccCCcEEEEEeeecccCCCcceEEEEcCcchh-----hhhhhccccceEeeccCcCceeEe
Confidence 456888999999999999999999999999864 4555666999999 887766 78899999999999999
Q ss_pred eCCCC-CceeeEeecccCcCeEEE--EEcCCCCEEEEEECCCCeeEEEee
Q 000425 125 DVSIE-RPSMIGIIQVGSQPITSV--AWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 125 Dl~t~-~~s~i~il~gH~~~VtsV--afsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
|.+-. ...-+..+.||...-.-+ -.++....++++++|...|||.++
T Consensus 327 D~R~~K~~~~V~qYeGHvN~~a~l~~~v~~eeg~I~s~GdDcytRiWsl~ 376 (425)
T KOG2695|consen 327 DLRATKCKKSVMQYEGHVNLSAYLPAHVKEEEGSIFSVGDDCYTRIWSLD 376 (425)
T ss_pred eehhhhcccceeeeecccccccccccccccccceEEEccCeeEEEEEecc
Confidence 99842 111244567775543333 345667789999999999999987
No 270
>PRK04792 tolB translocation protein TolB; Provisional
Probab=97.52 E-value=0.003 Score=78.45 Aligned_cols=141 Identities=13% Similarity=0.054 Sum_probs=85.5
Q ss_pred EEEEEcCCCCEEEEEEeccce--EEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CC--cEEEEEcCCCc
Q 000425 10 VHLALTPLQPVVFFGFHRRMS--VTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DG--LIRAYNIHTYA 84 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~s--I~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-Dg--tIrLWDl~t~k 84 (1525)
...+|+|++..++++...+.. +.+++. +++ ....+.. |.......+|+|+++.++..+. ++ .|.++|+.+++
T Consensus 265 ~~~~wSPDG~~La~~~~~~g~~~Iy~~dl-~tg-~~~~lt~-~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~ 341 (448)
T PRK04792 265 GAPRFSPDGKKLALVLSKDGQPEIYVVDI-ATK-ALTRITR-HRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGK 341 (448)
T ss_pred CCeeECCCCCEEEEEEeCCCCeEEEEEEC-CCC-CeEECcc-CCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC
Confidence 468999999988887665554 444443 222 3333333 5556678899999997765543 44 46666777776
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCCcE--EEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTL--LAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtV--rVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
... +. +..... ..+|+|||++|+..+ .++.. .++|+.+.....+ ..+. .....+|+|+|+.|+..+
T Consensus 342 ~~~-Lt~~g~~~~-----~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~~l---t~~~-~d~~ps~spdG~~I~~~~ 411 (448)
T PRK04792 342 VSR-LTFEGEQNL-----GGSITPDGRSMIMVNRTNGKFNIARQDLETGAMQVL---TSTR-LDESPSVAPNGTMVIYST 411 (448)
T ss_pred EEE-EecCCCCCc-----CeeECCCCCEEEEEEecCCceEEEEEECCCCCeEEc---cCCC-CCCCceECCCCCEEEEEE
Confidence 433 32 333334 578999999988765 44544 4457765322222 1221 223458999999877655
Q ss_pred CCC
Q 000425 161 RDG 163 (1525)
Q Consensus 161 ~Dg 163 (1525)
.++
T Consensus 412 ~~~ 414 (448)
T PRK04792 412 TYQ 414 (448)
T ss_pred ecC
Confidence 443
No 271
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.50 E-value=0.0022 Score=87.24 Aligned_cols=161 Identities=11% Similarity=0.089 Sum_probs=104.4
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEcc--------C------CCCCeEEEEEcCCCCE-EEEEECCCc
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKT--------D------LKKPIVNLACHPRLPV-LYVAYADGL 74 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~--------~------H~~~V~sLaf~Pdg~~-LaSGs~Dgt 74 (1525)
..|+|+|..-.++++-..+..|.+++...+.. ..+.+ + .-....+|+|+|++.. +++.+.++.
T Consensus 686 ~gVa~dp~~g~LyVad~~~~~I~v~d~~~g~v--~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~ 763 (1057)
T PLN02919 686 WDVCFEPVNEKVYIAMAGQHQIWEYNISDGVT--RVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSS 763 (1057)
T ss_pred eEEEEecCCCeEEEEECCCCeEEEEECCCCeE--EEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCe
Confidence 35889997777888776677777776433211 01100 0 1123467999999984 556666799
Q ss_pred EEEEEcCCCcEEEEE-------------e-cCC---ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC-ceeeEe
Q 000425 75 IRAYNIHTYAVHYTL-------------Q-LDN---TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER-PSMIGI 136 (1525)
Q Consensus 75 IrLWDl~t~k~l~tL-------------~-H~~---~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~-~s~i~i 136 (1525)
|++||+.++...... . +.. ...+.....++|+++|..+++-+.++.|++||..+.. ..+.+.
T Consensus 764 Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~ 843 (1057)
T PLN02919 764 IRALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGT 843 (1057)
T ss_pred EEEEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEecc
Confidence 999999876532111 0 000 0011122488999999988888899999999987531 111110
Q ss_pred ----------ecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 137 ----------IQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 137 ----------l~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
..++.....+++++++|+++++.+.+++|++||++.
T Consensus 844 G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~ 889 (1057)
T PLN02919 844 GKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889 (1057)
T ss_pred CCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCC
Confidence 112233567899999999999999999999999863
No 272
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=97.50 E-value=0.00049 Score=79.95 Aligned_cols=153 Identities=14% Similarity=0.211 Sum_probs=103.1
Q ss_pred EEEcC-CCCEEEEEEeccceEEEEEeecCCccc----------------------------------eEEccCCCCCeEE
Q 000425 12 LALTP-LQPVVFFGFHRRMSVTVVGTVEGGRAP----------------------------------TKIKTDLKKPIVN 56 (1525)
Q Consensus 12 LA~hP-~qp~Vf~s~~~d~sI~vv~tieggk~~----------------------------------tklk~~H~~~V~s 56 (1525)
|.+.+ .+...|.-+..++++++|-..+.+... ......|+--|.+
T Consensus 90 Irw~~~~n~a~FLlstNdktiKlWKi~er~~k~~~~~~~~~~~~~~~~~lr~p~~~~~~~~vea~prRv~aNaHtyhiNS 169 (433)
T KOG1354|consen 90 IRWLDDGNLAEFLLSTNDKTIKLWKIRERGSKKEGYNLPEEGPPGTITSLRLPVEGRHDLEVEASPRRVYANAHTYHINS 169 (433)
T ss_pred ceecCCCCccEEEEecCCcceeeeeeeccccccccccccccCCCCccceeeceeeccccceeeeeeeeeccccceeEeee
Confidence 44554 355556666678999988765543222 1112247777999
Q ss_pred EEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe----cC-----CceeecCceeEEEeCC-CCEEEEEeCCCcEEEEeC
Q 000425 57 LACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ----LD-----NTIKLLGAGAFAFHPT-LEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~----H~-----~~V~~~ga~sVaFSPd-g~~L~SgS~DgtVrVWDl 126 (1525)
|.++.|+..++++ +|-.|.+|++.--..-+++- +. .-|+ +-.|||. .++++-+|+-|+|++.|+
T Consensus 170 IS~NsD~Et~lSA-DdLRINLWnlei~d~sFnIVDIKP~nmEeLteVIT-----saEFhp~~cn~f~YSSSKGtIrLcDm 243 (433)
T KOG1354|consen 170 ISVNSDKETFLSA-DDLRINLWNLEIIDQSFNIVDIKPANMEELTEVIT-----SAEFHPHHCNVFVYSSSKGTIRLCDM 243 (433)
T ss_pred eeecCccceEeec-cceeeeeccccccCCceeEEEccccCHHHHHHHHh-----hhccCHhHccEEEEecCCCcEEEeec
Confidence 9999999988886 67889999976443333322 21 3355 8899995 577888889999999999
Q ss_pred CCC-----Cceee---------EeecccCcCeEEEEEcCCCCEEEEEECC-CCeeEEEeee
Q 000425 127 SIE-----RPSMI---------GIIQVGSQPITSVAWLPMLRLLVTLCRD-GSLQVWKTRV 172 (1525)
Q Consensus 127 ~t~-----~~s~i---------~il~gH~~~VtsVafsPdg~~LaSgS~D-gtIkVWDlr~ 172 (1525)
+.. ..++. ..+.+--..|..+.|+++|+|++| .| .+|++||++.
T Consensus 244 R~~aLCd~hsKlfEepedp~~rsffseiIsSISDvKFs~sGryils--RDyltvk~wD~nm 302 (433)
T KOG1354|consen 244 RQSALCDAHSKLFEEPEDPSSRSFFSEIISSISDVKFSHSGRYILS--RDYLTVKLWDLNM 302 (433)
T ss_pred hhhhhhcchhhhhccccCCcchhhHHHHhhhhhceEEccCCcEEEE--eccceeEEEeccc
Confidence 842 00000 112233346788999999999987 45 5999999963
No 273
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.50 E-value=0.0083 Score=81.73 Aligned_cols=159 Identities=11% Similarity=0.117 Sum_probs=96.8
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEcc-CC---------------CCCeEEEEEcCC-CCEEEEEECC
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKT-DL---------------KKPIVNLACHPR-LPVLYVAYAD 72 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~-~H---------------~~~V~sLaf~Pd-g~~LaSGs~D 72 (1525)
..|++.|++-.++++-.....|.+++...+ . ...+.+ +. -..-..++|+|+ +.++++.+.+
T Consensus 627 ~GIavd~~gn~LYVaDt~n~~Ir~id~~~~-~-V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~ 704 (1057)
T PLN02919 627 QGLAYNAKKNLLYVADTENHALREIDFVNE-T-VRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQ 704 (1057)
T ss_pred cEEEEeCCCCEEEEEeCCCceEEEEecCCC-E-EEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCC
Confidence 567888887777776554455655553211 1 111110 00 011257899994 5566777788
Q ss_pred CcEEEEEcCCCcEEEEEe-cC------C----ceeecCceeEEEeCCCCEEEEE-eCCCcEEEEeCCCCCceeeEe----
Q 000425 73 GLIRAYNIHTYAVHYTLQ-LD------N----TIKLLGAGAFAFHPTLEWLFVG-DRRGTLLAWDVSIERPSMIGI---- 136 (1525)
Q Consensus 73 gtIrLWDl~t~k~l~tL~-H~------~----~V~~~ga~sVaFSPdg~~L~Sg-S~DgtVrVWDl~t~~~s~i~i---- 136 (1525)
+.|++||..++... ++. .. . ...+.....++|+|++++|+.+ +.++.|++||+.+.......-
T Consensus 705 ~~I~v~d~~~g~v~-~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~ 783 (1057)
T PLN02919 705 HQIWEYNISDGVTR-VFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPT 783 (1057)
T ss_pred CeEEEEECCCCeEE-EEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEecccc
Confidence 99999998876543 222 10 0 0011122479999999866544 478999999988532111100
Q ss_pred -------e--------cccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 137 -------I--------QVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 137 -------l--------~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+ ..+...-.+++|+++|.++++.+.+++|++||..
T Consensus 784 ~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~ 833 (1057)
T PLN02919 784 FSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPA 833 (1057)
T ss_pred cCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECC
Confidence 0 0011123589999999999999999999999985
No 274
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=97.41 E-value=0.00021 Score=85.87 Aligned_cols=117 Identities=17% Similarity=0.219 Sum_probs=83.5
Q ss_pred cCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcC-------CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCC
Q 000425 48 TDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIH-------TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRG 119 (1525)
Q Consensus 48 ~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~-------t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg 119 (1525)
.+|...|+.++--.+.+-|++++.|.+||+|.++ +..|.++++ |.++|+ ++.|-.+.++++++ |+
T Consensus 732 ~GH~~~iRai~AidNENSFiSASkDKTVKLWSik~EgD~~~tsaCQfTY~aHkk~i~-----~igfL~~lr~i~Sc--D~ 804 (1034)
T KOG4190|consen 732 TGHQEKIRAIAAIDNENSFISASKDKTVKLWSIKPEGDEIGTSACQFTYQAHKKPIH-----DIGFLADLRSIASC--DG 804 (1034)
T ss_pred cCcHHHhHHHHhcccccceeeccCCceEEEEEeccccCccccceeeeEhhhccCccc-----ceeeeeccceeeec--cC
Confidence 3588889988776677889999999999999976 345888999 999999 99999999888654 78
Q ss_pred cEEEEeCCCC--Cceee-EeecccCcCeEEEEEcCCCCE-EEEEECCCCeeEEEeee
Q 000425 120 TLLAWDVSIE--RPSMI-GIIQVGSQPITSVAWLPMLRL-LVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 120 tVrVWDl~t~--~~s~i-~il~gH~~~VtsVafsPdg~~-LaSgS~DgtIkVWDlr~ 172 (1525)
-|.+||--.. ..++. ....+....|.|+- +-+.++ ++-|+...|||++|.|.
T Consensus 805 giHlWDPFigr~Laq~~dapk~~a~~~ikcl~-nv~~~iliAgcsaeSTVKl~DaRs 860 (1034)
T KOG4190|consen 805 GIHLWDPFIGRLLAQMEDAPKEGAGGNIKCLE-NVDRHILIAGCSAESTVKLFDARS 860 (1034)
T ss_pred cceeecccccchhHhhhcCcccCCCceeEecc-cCcchheeeeccchhhheeeeccc
Confidence 9999995421 11111 01112233455542 113444 44558899999999984
No 275
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=97.41 E-value=0.00026 Score=82.25 Aligned_cols=114 Identities=13% Similarity=0.119 Sum_probs=88.6
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecC----CccceEEccCCCCCeEEEEEcC-CCCEEEEEECCCcEEEEEcCCC
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEG----GRAPTKIKTDLKKPIVNLACHP-RLPVLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieg----gk~~tklk~~H~~~V~sLaf~P-dg~~LaSGs~DgtIrLWDl~t~ 83 (1525)
.+.+.||-..|.||.|. +...|.++|.... +++...+. |...|+++..-. ++++|++.+.+|+|++||.+--
T Consensus 255 VfAlQf~~s~nLv~~Gc-RngeI~~iDLR~rnqG~~~~a~rly--h~Ssvtslq~Lq~s~q~LmaS~M~gkikLyD~R~~ 331 (425)
T KOG2695|consen 255 VFALQFAGSDNLVFNGC-RNGEIFVIDLRCRNQGNGWCAQRLY--HDSSVTSLQILQFSQQKLMASDMTGKIKLYDLRAT 331 (425)
T ss_pred HHHHHhcccCCeeEecc-cCCcEEEEEeeecccCCCcceEEEE--cCcchhhhhhhccccceEeeccCcCceeEeeehhh
Confidence 45677888899999888 5889999998766 55555553 889999988766 6788999999999999998876
Q ss_pred cE---EEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 84 AV---HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 84 k~---l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
++ +.+++ |-+.-. ..-+..++....++++++|...|||.+.+
T Consensus 332 K~~~~V~qYeGHvN~~a---~l~~~v~~eeg~I~s~GdDcytRiWsl~~ 377 (425)
T KOG2695|consen 332 KCKKSVMQYEGHVNLSA---YLPAHVKEEEGSIFSVGDDCYTRIWSLDS 377 (425)
T ss_pred hcccceeeeeccccccc---ccccccccccceEEEccCeeEEEEEeccc
Confidence 66 77777 755433 00233456677899999999999999985
No 276
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.36 E-value=0.00028 Score=87.67 Aligned_cols=115 Identities=12% Similarity=0.156 Sum_probs=90.7
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC-cEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY-AVH 86 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~-k~l 86 (1525)
.+.-+.|||+.|.|+....-+..+.+|++....++...... -......|+|+-..+.+++.+....|++||.+.+ .++
T Consensus 116 aitd~n~~~q~pdVlatcsvdt~vh~wd~rSp~~p~ys~~~-w~s~asqVkwnyk~p~vlasshg~~i~vwd~r~gs~pl 194 (1081)
T KOG0309|consen 116 AITDINFNPQHPDVLATCSVDTYVHAWDMRSPHRPFYSTSS-WRSAASQVKWNYKDPNVLASSHGNDIFVWDLRKGSTPL 194 (1081)
T ss_pred ceeccccCCCCCcceeeccccccceeeeccCCCcceeeeec-ccccCceeeecccCcchhhhccCCceEEEeccCCCcce
Confidence 45667899999999999888999999987655444444333 3345567899988888777788889999998765 678
Q ss_pred EEEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCCC
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~t 128 (1525)
.+++ |...|+ .++|.. ....+.+++.|++|+.||+..
T Consensus 195 ~s~K~~vs~vn-----~~~fnr~~~s~~~s~~~d~tvkfw~y~k 233 (1081)
T KOG0309|consen 195 CSLKGHVSSVN-----SIDFNRFKYSEIMSSSNDGTVKFWDYSK 233 (1081)
T ss_pred EEecccceeee-----hHHHhhhhhhhhcccCCCCceeeecccc
Confidence 8888 888899 888876 335678899999999999874
No 277
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=97.35 E-value=0.002 Score=79.80 Aligned_cols=74 Identities=22% Similarity=0.332 Sum_probs=63.0
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
.+...|.+++++|+...|+.||.||+|.+||...+....+ +..-... .++|||+|..+++|++-|.+.+||+.-
T Consensus 257 pL~s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t~~~-ka~~~P~-----~iaWHp~gai~~V~s~qGelQ~FD~AL 330 (545)
T PF11768_consen 257 PLPSQVICCARSPSEDKLVLGCEDGSIILYDTTRGVTLLA-KAEFIPT-----LIAWHPDGAIFVVGSEQGELQCFDMAL 330 (545)
T ss_pred ecCCcceEEecCcccceEEEEecCCeEEEEEcCCCeeeee-eecccce-----EEEEcCCCcEEEEEcCCceEEEEEeec
Confidence 4788999999999999999999999999999876533322 3445566 899999999999999999999999874
No 278
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.34 E-value=0.0034 Score=79.60 Aligned_cols=101 Identities=21% Similarity=0.295 Sum_probs=77.3
Q ss_pred cCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCC-----CCEEEEEeCCCcEEEEeCCC--CCce
Q 000425 60 HPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPT-----LEWLFVGDRRGTLLAWDVSI--ERPS 132 (1525)
Q Consensus 60 ~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPd-----g~~L~SgS~DgtVrVWDl~t--~~~s 132 (1525)
+-+|.+++||+.||+|-|-.+-+.+...++.-..++. +|+++|+ .+.+++||.-| +.++.-+- ...+
T Consensus 80 ~~~Gey~asCS~DGkv~I~sl~~~~~~~~~df~rpik-----sial~Pd~~~~~sk~fv~GG~ag-lvL~er~wlgnk~~ 153 (846)
T KOG2066|consen 80 ILEGEYVASCSDDGKVVIGSLFTDDEITQYDFKRPIK-----SIALHPDFSRQQSKQFVSGGMAG-LVLSERNWLGNKDS 153 (846)
T ss_pred ccCCceEEEecCCCcEEEeeccCCccceeEecCCcce-----eEEeccchhhhhhhheeecCcce-EEEehhhhhcCccc
Confidence 4568999999999999999988888777777677888 9999998 46789999888 55553321 1222
Q ss_pred eeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 133 MIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 133 ~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
. .++...++|.+|.|+ |.+|+=++++ .|+|+|+.
T Consensus 154 v--~l~~~eG~I~~i~W~--g~lIAWand~-Gv~vyd~~ 187 (846)
T KOG2066|consen 154 V--VLSEGEGPIHSIKWR--GNLIAWANDD-GVKVYDTP 187 (846)
T ss_pred e--eeecCccceEEEEec--CcEEEEecCC-CcEEEecc
Confidence 2 255667899999995 5677777655 59999986
No 279
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=97.32 E-value=0.0019 Score=87.05 Aligned_cols=153 Identities=16% Similarity=0.237 Sum_probs=114.3
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
|-..|+++.-||..++.++|+.||.|++|....+..+..++ ....|. .+.|+..|+.+..+..||.+.+|...
T Consensus 2207 ~v~~v~r~~sHp~~~~Yltgs~dgsv~~~~w~~~~~v~~~rt~g~s~vt-----r~~f~~qGnk~~i~d~dg~l~l~q~~ 2281 (2439)
T KOG1064|consen 2207 PVENVRRMTSHPSDPYYLTGSQDGSVRMFEWGHGQQVVCFRTAGNSRVT-----RSRFNHQGNKFGIVDGDGDLSLWQAS 2281 (2439)
T ss_pred ccCceeeecCCCCCceEEecCCCceEEEEeccCCCeEEEeeccCcchhh-----hhhhcccCCceeeeccCCceeecccC
Confidence 56788999999999999999999999999999999999998 446777 89999999999999999999999988
Q ss_pred CCCceeeEeecccCcCeEEEEEcCCCCEEEEE---ECCCCeeEEEeeeecCCCCCCCccccccCCCcccccccccccccC
Q 000425 128 IERPSMIGIIQVGSQPITSVAWLPMLRLLVTL---CRDGSLQVWKTRVIINPNRPPMQANFFEPASIESIDIPRILSQQG 204 (1525)
Q Consensus 128 t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg---S~DgtIkVWDlr~~~~p~~~~~~a~~f~p~gVesldi~~iLs~~g 204 (1525)
. +.+. ..+.|+...+.+.|-. ..++++ ++++.+.+||... ++....+. ..+.
T Consensus 2282 p--k~~~-s~qchnk~~~Df~Fi~--s~~~tag~s~d~~n~~lwDtl~--~~~~s~v~------------------~~H~ 2336 (2439)
T KOG1064|consen 2282 P--KPYT-SWQCHNKALSDFRFIG--SLLATAGRSSDNRNVCLWDTLL--PPMNSLVH------------------TCHD 2336 (2439)
T ss_pred C--ccee-ccccCCccccceeeee--hhhhccccCCCCCcccchhccc--Ccccceee------------------eecC
Confidence 2 3332 4667888888888865 556665 4677999999742 21111110 1111
Q ss_pred CcceecCCCeeEEEEecCCCeEEEEecccccCCcccccccc
Q 000425 205 GEAVYPLPRVRALEVHPRLNLAVLLFANFTGGDNIKNRAAY 245 (1525)
Q Consensus 205 G~~vy~l~~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k~~~ 245 (1525)
.+++++++-|+.-++. +|+-+..+++|.
T Consensus 2337 -------~gaT~l~~~P~~qlli------sggr~G~v~l~D 2364 (2439)
T KOG1064|consen 2337 -------GGATVLAYAPKHQLLI------SGGRKGEVCLFD 2364 (2439)
T ss_pred -------CCceEEEEcCcceEEE------ecCCcCcEEEee
Confidence 2467788888877765 666666666554
No 280
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=97.28 E-value=0.012 Score=69.47 Aligned_cols=161 Identities=13% Similarity=0.109 Sum_probs=110.4
Q ss_pred ccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCc--cceEEccCCCCC----------eEEEEEcCCCCEEEEEEC
Q 000425 4 ISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGR--APTKIKTDLKKP----------IVNLACHPRLPVLYVAYA 71 (1525)
Q Consensus 4 ~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk--~~tklk~~H~~~----------V~sLaf~Pdg~~LaSGs~ 71 (1525)
+++..--|+++.+++.+||++.+....|.++...+.|. ....+. .|.++ +-+..+.|+++++++.+.
T Consensus 86 ~~g~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~-~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DL 164 (346)
T COG2706 86 LPGSPPCYVSVDEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVV-KHTGSGPHERQESPHVHSANFTPDGRYLVVPDL 164 (346)
T ss_pred cCCCCCeEEEECCCCCEEEEEEccCceEEEEEcccCCccccceeee-ecCCCCCCccccCCccceeeeCCCCCEEEEeec
Confidence 44555589999999999999999999998877644332 221222 25555 778899999998887654
Q ss_pred -CCcEEEEEcCCCcEEEEEe----cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCcee--eEe-------
Q 000425 72 -DGLIRAYNIHTYAVHYTLQ----LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSM--IGI------- 136 (1525)
Q Consensus 72 -DgtIrLWDl~t~k~l~tL~----H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~--i~i------- 136 (1525)
--.|.+|++..++....-. ....-+ .+.|||++++..+.. -+++|-+|.++.....+ ++.
T Consensus 165 G~Dri~~y~~~dg~L~~~~~~~v~~G~GPR-----Hi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~d 239 (346)
T COG2706 165 GTDRIFLYDLDDGKLTPADPAEVKPGAGPR-----HIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPED 239 (346)
T ss_pred CCceEEEEEcccCccccccccccCCCCCcc-----eEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccc
Confidence 2358999998775543222 334556 899999999988776 89999999998631111 111
Q ss_pred ecccCcCeEEEEEcCCCCEEEEEECC-CCeeEEEee
Q 000425 137 IQVGSQPITSVAWLPMLRLLVTLCRD-GSLQVWKTR 171 (1525)
Q Consensus 137 l~gH~~~VtsVafsPdg~~LaSgS~D-gtIkVWDlr 171 (1525)
+.+ ......|..+|||++|..+-.- ..|.++.+.
T Consensus 240 F~g-~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~ 274 (346)
T COG2706 240 FTG-TNWAAAIHISPDGRFLYASNRGHDSIAVFSVD 274 (346)
T ss_pred cCC-CCceeEEEECCCCCEEEEecCCCCeEEEEEEc
Confidence 111 3345678899999999887542 255555543
No 281
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=97.23 E-value=0.00067 Score=80.36 Aligned_cols=75 Identities=17% Similarity=0.213 Sum_probs=65.3
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
+|-.-+..|+|+||+++++++..|..||+-.+..-..+.+|- |...|. .++.-++ ..|+|||.|+++++||+
T Consensus 149 GhvSml~dVavS~D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS-----~isl~~~-~~LlS~sGD~tlr~Wd~ 222 (390)
T KOG3914|consen 149 GHVSMLLDVAVSPDDQFIITADRDEKIRVSRYPATFVIESFCLGHKEFVS-----TISLTDN-YLLLSGSGDKTLRLWDI 222 (390)
T ss_pred hhhhhhheeeecCCCCEEEEecCCceEEEEecCcccchhhhccccHhhee-----eeeeccC-ceeeecCCCCcEEEEec
Confidence 488899999999999999999999999998877666677766 999999 8887654 55899999999999999
Q ss_pred CCC
Q 000425 127 SIE 129 (1525)
Q Consensus 127 ~t~ 129 (1525)
.+.
T Consensus 223 ~sg 225 (390)
T KOG3914|consen 223 TSG 225 (390)
T ss_pred ccC
Confidence 964
No 282
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=97.20 E-value=0.011 Score=71.27 Aligned_cols=120 Identities=18% Similarity=0.166 Sum_probs=98.3
Q ss_pred EEccCCCCCeEEEEEcCCCCEEEEEECCC-cEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEE
Q 000425 45 KIKTDLKKPIVNLACHPRLPVLYVAYADG-LIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLA 123 (1525)
Q Consensus 45 klk~~H~~~V~sLaf~Pdg~~LaSGs~Dg-tIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrV 123 (1525)
.+.-.|.+.|+=..+.-+..-++.|..|| .+-|+|..+++..+.-..-..|. ++..+|+|++++.+.+...+.+
T Consensus 353 ~iqv~~~~~VrY~r~~~~~e~~vigt~dgD~l~iyd~~~~e~kr~e~~lg~I~-----av~vs~dGK~~vvaNdr~el~v 427 (668)
T COG4946 353 SIQVGKKGGVRYRRIQVDPEGDVIGTNDGDKLGIYDKDGGEVKRIEKDLGNIE-----AVKVSPDGKKVVVANDRFELWV 427 (668)
T ss_pred eEEcCCCCceEEEEEccCCcceEEeccCCceEEEEecCCceEEEeeCCccceE-----EEEEcCCCcEEEEEcCceEEEE
Confidence 34445888899888888888899999999 89999999887655544777888 9999999999999999999999
Q ss_pred EeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCC----CeeEEEee
Q 000425 124 WDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDG----SLQVWKTR 171 (1525)
Q Consensus 124 WDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dg----tIkVWDlr 171 (1525)
.|+.+...+++ -+....-|+.+.|||++++|+-+--+| .|+++|+.
T Consensus 428 ididngnv~~i--dkS~~~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~ 477 (668)
T COG4946 428 IDIDNGNVRLI--DKSEYGLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMD 477 (668)
T ss_pred EEecCCCeeEe--cccccceeEEEEEcCCceeEEEecCcceeeeeEEEEecC
Confidence 99998655555 223456799999999999999887776 68888875
No 283
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=97.16 E-value=0.21 Score=61.45 Aligned_cols=111 Identities=16% Similarity=0.221 Sum_probs=74.4
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCC----------
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRG---------- 119 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg---------- 119 (1525)
-..++..+.|+|.|.+|++....+ |.+|+-..+..+..+.|. .|. .+.|||.++||++-++..
T Consensus 31 ~~~p~~~~~~SP~G~~l~~~~~~~-V~~~~g~~~~~l~~~~~~-~V~-----~~~fSP~~kYL~tw~~~pi~~pe~e~sp 103 (561)
T COG5354 31 ENWPVAYVSESPLGTYLFSEHAAG-VECWGGPSKAKLVRFRHP-DVK-----YLDFSPNEKYLVTWSREPIIEPEIEISP 103 (561)
T ss_pred cCcchhheeecCcchheehhhccc-eEEccccchhheeeeecC-Cce-----ecccCcccceeeeeccCCccChhhccCC
Confidence 457899999999999999876654 899988777755555554 467 899999999999876433
Q ss_pred -----cEEEEeCCCCCceeeEeecccCcC--eE-EEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 120 -----TLLAWDVSIERPSMIGIIQVGSQP--IT-SVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 120 -----tVrVWDl~t~~~s~i~il~gH~~~--Vt-sVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
.+.|||..++ .++..+.+-..+ .. -+.|+-+..|.+=-. ...++|+++
T Consensus 104 ~~~~n~~~vwd~~sg--~iv~sf~~~~q~~~~Wp~~k~s~~D~y~ARvv-~~sl~i~e~ 159 (561)
T COG5354 104 FTSKNNVFVWDIASG--MIVFSFNGISQPYLGWPVLKFSIDDKYVARVV-GSSLYIHEI 159 (561)
T ss_pred ccccCceeEEeccCc--eeEeeccccCCcccccceeeeeecchhhhhhc-cCeEEEEec
Confidence 4889998864 122112222222 22 466777666554332 235777775
No 284
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=97.14 E-value=0.0052 Score=71.82 Aligned_cols=156 Identities=13% Similarity=0.196 Sum_probs=105.3
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccc--eEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEc-------
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAP--TKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNI------- 80 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~--tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl------- 80 (1525)
..+++++.-..+++++. ..++.-+...+.-... .+....|..+|..+-|+-...++++.+.|..+- |.-
T Consensus 72 ~~~~y~~e~~~L~vg~~-ngtvtefs~sedfnkm~~~r~~~~h~~~v~~~if~~~~e~V~s~~~dk~~~-~hc~e~~~~l 149 (404)
T KOG1409|consen 72 SAMEYVSESRRLYVGQD-NGTVTEFALSEDFNKMTFLKDYLAHQARVSAIVFSLTHEWVLSTGKDKQFA-WHCTESGNRL 149 (404)
T ss_pred eEeeeeccceEEEEEEe-cceEEEEEhhhhhhhcchhhhhhhhhcceeeEEecCCceeEEEeccccceE-EEeeccCCcc
Confidence 34555555555666653 4444333222221111 112224788888888877777777777664332 210
Q ss_pred -----------------------------------CCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEE
Q 000425 81 -----------------------------------HTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAW 124 (1525)
Q Consensus 81 -----------------------------------~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVW 124 (1525)
....++.++. |...+. +++|.+....+++|..|..|.+|
T Consensus 150 g~Y~~~~~~t~~~~d~~~~fvGd~~gqvt~lr~~~~~~~~i~~~~~h~~~~~-----~l~Wd~~~~~LfSg~~d~~vi~w 224 (404)
T KOG1409|consen 150 GGYNFETPASALQFDALYAFVGDHSGQITMLKLEQNGCQLITTFNGHTGEVT-----CLKWDPGQRLLFSGASDHSVIMW 224 (404)
T ss_pred cceEeeccCCCCceeeEEEEecccccceEEEEEeecCCceEEEEcCcccceE-----EEEEcCCCcEEEeccccCceEEE
Confidence 1123445666 778888 99999999999999999999999
Q ss_pred eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeee
Q 000425 125 DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVI 173 (1525)
Q Consensus 125 Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~ 173 (1525)
|+--. +-..-.+.+|...|..+++-+.-+.++|++.||.|-+||.+..
T Consensus 225 digg~-~g~~~el~gh~~kV~~l~~~~~t~~l~S~~edg~i~~w~mn~~ 272 (404)
T KOG1409|consen 225 DIGGR-KGTAYELQGHNDKVQALSYAQHTRQLISCGEDGGIVVWNMNVK 272 (404)
T ss_pred eccCC-cceeeeeccchhhhhhhhhhhhheeeeeccCCCeEEEEeccce
Confidence 99842 2222346799999999999888899999999999999999843
No 285
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=97.09 E-value=0.029 Score=64.81 Aligned_cols=162 Identities=20% Similarity=0.303 Sum_probs=93.2
Q ss_pred EEEEcCCCCEEEEEEeccceEEEE-EeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE----
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVV-GTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV---- 85 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv-~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~---- 85 (1525)
|+++++++..+..--+....++-. |..........+..+-.-.-+-++|+||+.+||.+...|+|+++|+-...+
T Consensus 2 ~~~~~~~Gk~lAi~qd~~iEiRsa~Ddf~si~~kcqVpkD~~PQWRkl~WSpD~tlLa~a~S~G~i~vfdl~g~~lf~I~ 81 (282)
T PF15492_consen 2 HLALSSDGKLLAILQDQCIEIRSAKDDFSSIIGKCQVPKDPNPQWRKLAWSPDCTLLAYAESTGTIRVFDLMGSELFVIP 81 (282)
T ss_pred ceeecCCCcEEEEEeccEEEEEeccCCchheeEEEecCCCCCchheEEEECCCCcEEEEEcCCCeEEEEecccceeEEcC
Confidence 678888888776543322222111 001111111122222233467899999999999999999999999742100
Q ss_pred ------------E------------------EEEecC---C----------------ceee-----cCceeEEEeCCCCE
Q 000425 86 ------------H------------------YTLQLD---N----------------TIKL-----LGAGAFAFHPTLEW 111 (1525)
Q Consensus 86 ------------l------------------~tL~H~---~----------------~V~~-----~ga~sVaFSPdg~~ 111 (1525)
+ ..+.+. . .+.+ .|.+++.|+|..+.
T Consensus 82 p~~~~~~d~~~Aiagl~Fl~~~~s~~ws~ELlvi~Y~G~L~Sy~vs~gt~q~y~e~hsfsf~~~yp~Gi~~~vy~p~h~L 161 (282)
T PF15492_consen 82 PAMSFPGDLSDAIAGLIFLEYKKSAQWSYELLVINYRGQLRSYLVSVGTNQGYQENHSFSFSSHYPHGINSAVYHPKHRL 161 (282)
T ss_pred cccccCCccccceeeeEeeccccccccceeEEEEeccceeeeEEEEcccCCcceeeEEEEecccCCCceeEEEEcCCCCE
Confidence 0 000000 0 0000 13358889999888
Q ss_pred EEEEeC----CC-------cEEEEeCCCCCcee----------------------e--Eee---cccCcCeEEEEEcCCC
Q 000425 112 LFVGDR----RG-------TLLAWDVSIERPSM----------------------I--GII---QVGSQPITSVAWLPML 153 (1525)
Q Consensus 112 L~SgS~----Dg-------tVrVWDl~t~~~s~----------------------i--~il---~gH~~~VtsVafsPdg 153 (1525)
|+.||. |+ -+..|-+-+..+.. . ..+ ..-...|..++.||||
T Consensus 162 LlVgG~~~~~~~~s~a~~~GLtaWRiL~~~Pyyk~v~~~~~~~~~~~~~~~~~~~~~~~~fs~~~~~~d~i~kmSlSPdg 241 (282)
T PF15492_consen 162 LLVGGCEQNQDGMSKASSCGLTAWRILSDSPYYKQVTSSEDDITASSKRRGLLRIPSFKFFSRQGQEQDGIFKMSLSPDG 241 (282)
T ss_pred EEEeccCCCCCccccccccCceEEEEcCCCCcEEEccccCccccccccccceeeccceeeeeccccCCCceEEEEECCCC
Confidence 887762 11 25566543310000 0 011 1124568889999999
Q ss_pred CEEEEEECCCCeeEEEeee
Q 000425 154 RLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 154 ~~LaSgS~DgtIkVWDlr~ 172 (1525)
.+|++.+.+|.|.+|++-.
T Consensus 242 ~~La~ih~sG~lsLW~iPs 260 (282)
T PF15492_consen 242 SLLACIHFSGSLSLWEIPS 260 (282)
T ss_pred CEEEEEEcCCeEEEEecCc
Confidence 9999999999999999863
No 286
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=97.00 E-value=0.014 Score=66.60 Aligned_cols=107 Identities=10% Similarity=0.110 Sum_probs=77.4
Q ss_pred EEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE-eecccCc
Q 000425 65 VLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG-IIQVGSQ 142 (1525)
Q Consensus 65 ~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~-il~gH~~ 142 (1525)
-+..++.|.++++.++..+..-+.+. ..-.++ ++++++|++|+++.++...|-.|.+......... ....-+.
T Consensus 130 ~~~i~sndht~k~~~~~~~s~~~~~h~~~~~~n-----s~~~snd~~~~~~Vgds~~Vf~y~id~~sey~~~~~~a~t~D 204 (344)
T KOG4532|consen 130 PLNIASNDHTGKTMVVSGDSNKFAVHNQNLTQN-----SLHYSNDPSWGSSVGDSRRVFRYAIDDESEYIENIYEAPTSD 204 (344)
T ss_pred ceeeccCCcceeEEEEecCcccceeecccccee-----eeEEcCCCceEEEecCCCcceEEEeCCccceeeeeEecccCC
Confidence 36677889999999887554333333 223377 9999999999999999999999977653111111 1112233
Q ss_pred CeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCC
Q 000425 143 PITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINP 176 (1525)
Q Consensus 143 ~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p 176 (1525)
.-.+.+|+.....+++++.||++.|||+|....|
T Consensus 205 ~gF~~S~s~~~~~FAv~~Qdg~~~I~DVR~~~tp 238 (344)
T KOG4532|consen 205 HGFYNSFSENDLQFAVVFQDGTCAIYDVRNMATP 238 (344)
T ss_pred CceeeeeccCcceEEEEecCCcEEEEEecccccc
Confidence 4468899999999999999999999999965543
No 287
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.99 E-value=0.025 Score=71.33 Aligned_cols=118 Identities=12% Similarity=0.059 Sum_probs=85.5
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE-EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY-TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~-tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
|...|.=-+++.++.+++.|+.-|.+.+|+-.+++... +.. ....+. .+..|++..+++.|+..|.|.|+-++
T Consensus 32 ~~~~v~lTc~dst~~~l~~GsS~G~lyl~~R~~~~~~~~~~~~~~~~~~-----~~~vs~~e~lvAagt~~g~V~v~ql~ 106 (726)
T KOG3621|consen 32 FPARVKLTCVDATEEYLAMGSSAGSVYLYNRHTGEMRKLKNEGATGITC-----VRSVSSVEYLVAAGTASGRVSVFQLN 106 (726)
T ss_pred CcceEEEEEeecCCceEEEecccceEEEEecCchhhhcccccCccceEE-----EEEecchhHhhhhhcCCceEEeehhh
Confidence 44555545667778999999999999999977665432 222 223333 56778888888888899999999877
Q ss_pred CC--Cceee--EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 128 IE--RPSMI--GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 128 t~--~~s~i--~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
.. ...+. ..-..|...|+|++|++++..+.+|..-|+|..-.+..
T Consensus 107 ~~~p~~~~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 107 KELPRDLDYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred ccCCCcceeeccccccCCceEEEEEecccccEEeecCCCceEEEEEech
Confidence 53 11111 11234889999999999999999999999988776653
No 288
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=96.96 E-value=0.0019 Score=73.11 Aligned_cols=72 Identities=26% Similarity=0.476 Sum_probs=61.1
Q ss_pred CCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc-EEEEEe-cCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEeCC
Q 000425 52 KPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA-VHYTLQ-LDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k-~l~tL~-H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWDl~ 127 (1525)
+.|.+++-||..+ .+++|+.||.+-+||.+... +...+. |+..+. -|-||| ++..|++++.||.+..||..
T Consensus 180 ~~v~~l~~hp~qq~~v~cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~-----eV~FHpk~p~~Lft~sedGslw~wdas 254 (319)
T KOG4714|consen 180 DAVTALCSHPAQQHLVCCGTDDGIVGLWDARNVAMPVSLLKAHKAEIW-----EVHFHPKNPEHLFTCSEDGSLWHWDAS 254 (319)
T ss_pred ccchhhhCCcccccEEEEecCCCeEEEEEcccccchHHHHHHhhhhhh-----heeccCCCchheeEecCCCcEEEEcCC
Confidence 4599999999865 57889999999999998764 334455 999999 999999 56799999999999999988
Q ss_pred C
Q 000425 128 I 128 (1525)
Q Consensus 128 t 128 (1525)
+
T Consensus 255 ~ 255 (319)
T KOG4714|consen 255 T 255 (319)
T ss_pred C
Confidence 4
No 289
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.91 E-value=0.019 Score=71.12 Aligned_cols=99 Identities=12% Similarity=0.118 Sum_probs=87.9
Q ss_pred CEEEEEECCCcEEEEEcCCCcEEEEEe---cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeeccc
Q 000425 64 PVLYVAYADGLIRAYNIHTYAVHYTLQ---LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVG 140 (1525)
Q Consensus 64 ~~LaSGs~DgtIrLWDl~t~k~l~tL~---H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH 140 (1525)
..++-|-.-|.|-+|+...++....+. |.+.|+ ++.++.+...|.|++.|..+..|+... ...+.++.+.
T Consensus 71 ~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~-----~~~~~~~~~ciyS~~ad~~v~~~~~~~--~~~~~~~~~~ 143 (541)
T KOG4547|consen 71 SMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVN-----EILDAQRLGCIYSVGADLKVVYILEKE--KVIIRIWKEQ 143 (541)
T ss_pred eEEEeecCCccEEEEEecCCeEEEEEecCCCCCcce-----eeecccccCceEecCCceeEEEEeccc--ceeeeeeccC
Confidence 367778889999999999999998888 999999 999999999999999999999999994 4555567788
Q ss_pred CcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 141 SQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 141 ~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
...+.+++.+||+..++++| ++|++||+.
T Consensus 144 ~~~~~sl~is~D~~~l~~as--~~ik~~~~~ 172 (541)
T KOG4547|consen 144 KPLVSSLCISPDGKILLTAS--RQIKVLDIE 172 (541)
T ss_pred CCccceEEEcCCCCEEEecc--ceEEEEEcc
Confidence 88899999999999999887 489999997
No 290
>PRK02888 nitrous-oxide reductase; Validated
Probab=96.90 E-value=0.019 Score=72.99 Aligned_cols=162 Identities=11% Similarity=0.051 Sum_probs=108.3
Q ss_pred ccccCCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC----------
Q 000425 2 ESISVDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA---------- 71 (1525)
Q Consensus 2 ~~~~~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~---------- 71 (1525)
+-+=..+|+.+=+.|++..+.....-+..+.+++..+. .....+.- .+.-..++++|++.++++.+.
T Consensus 188 ~yv~~~~e~~~PlpnDGk~l~~~~ey~~~vSvID~etm-eV~~qV~V--dgnpd~v~~spdGk~afvTsyNsE~G~tl~e 264 (635)
T PRK02888 188 GYVFCNGEFRIPLPNDGKDLDDPKKYRSLFTAVDAETM-EVAWQVMV--DGNLDNVDTDYDGKYAFSTCYNSEEGVTLAE 264 (635)
T ss_pred cEEEeCcccccccCCCCCEeecccceeEEEEEEECccc-eEEEEEEe--CCCcccceECCCCCEEEEeccCcccCcceee
Confidence 44556788888899998777655544555556654322 22222222 234456788888888777652
Q ss_pred ------------------------------CCcEEEEEcCC-----CcEEEEEecCCceeecCceeEEEeCCCCEEEEEe
Q 000425 72 ------------------------------DGLIRAYNIHT-----YAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD 116 (1525)
Q Consensus 72 ------------------------------DgtIrLWDl~t-----~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS 116 (1525)
+++|.+.|..+ .+.+..+.-.+... .++++|||+++++++
T Consensus 265 m~a~e~d~~vvfni~~iea~vkdGK~~~V~gn~V~VID~~t~~~~~~~v~~yIPVGKsPH-----GV~vSPDGkylyVan 339 (635)
T PRK02888 265 MMAAERDWVVVFNIARIEEAVKAGKFKTIGGSKVPVVDGRKAANAGSALTRYVPVPKNPH-----GVNTSPDGKYFIANG 339 (635)
T ss_pred eccccCceEEEEchHHHHHhhhCCCEEEECCCEEEEEECCccccCCcceEEEEECCCCcc-----ceEECCCCCEEEEeC
Confidence 24577777766 34555555455556 899999999988777
Q ss_pred -CCCcEEEEeCCCCC----------ceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 117 -RRGTLLAWDVSIER----------PSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 117 -~DgtVrVWDl~t~~----------~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
.+.+|.|.|+.+.. ..+.+...-...+ ...+|.++|....|...|..|-.||+..
T Consensus 340 klS~tVSVIDv~k~k~~~~~~~~~~~~vvaevevGlGP-LHTaFDg~G~aytslf~dsqv~kwn~~~ 405 (635)
T PRK02888 340 KLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEPELGLGP-LHTAFDGRGNAYTTLFLDSQIVKWNIEA 405 (635)
T ss_pred CCCCcEEEEEChhhhhhhhccCCccceEEEeeccCCCc-ceEEECCCCCEEEeEeecceeEEEehHH
Confidence 79999999998631 1122222222233 4578999999999999999999999874
No 291
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=96.75 E-value=0.047 Score=62.45 Aligned_cols=117 Identities=13% Similarity=-0.041 Sum_probs=85.8
Q ss_pred eEEEEEcCCCCEEEEEECCCcEEEEEcCCC--cEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC
Q 000425 54 IVNLACHPRLPVLYVAYADGLIRAYNIHTY--AVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE 129 (1525)
Q Consensus 54 V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~--k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~ 129 (1525)
+.++++++++.+.++-++...|..|.+... ....+.. ..+.-. +..|+.....++++..||++.|||++..
T Consensus 161 ~ns~~~snd~~~~~~Vgds~~Vf~y~id~~sey~~~~~~a~t~D~gF-----~~S~s~~~~~FAv~~Qdg~~~I~DVR~~ 235 (344)
T KOG4532|consen 161 QNSLHYSNDPSWGSSVGDSRRVFRYAIDDESEYIENIYEAPTSDHGF-----YNSFSENDLQFAVVFQDGTCAIYDVRNM 235 (344)
T ss_pred eeeeEEcCCCceEEEecCCCcceEEEeCCccceeeeeEecccCCCce-----eeeeccCcceEEEEecCCcEEEEEeccc
Confidence 788999999999999999999999987643 2333222 444445 8899999999999999999999999963
Q ss_pred Cc---eeeEeecccCcCeEEEEEcCCCC--EEEEEECCCCeeEEEeeeecC
Q 000425 130 RP---SMIGIIQVGSQPITSVAWLPMLR--LLVTLCRDGSLQVWKTRVIIN 175 (1525)
Q Consensus 130 ~~---s~i~il~gH~~~VtsVafsPdg~--~LaSgS~DgtIkVWDlr~~~~ 175 (1525)
.. ........|.+.++.+.|++-|. +|+-.-.=+.+.+-|+|...+
T Consensus 236 ~tpm~~~sstrp~hnGa~R~c~Fsl~g~lDLLf~sEhfs~~hv~D~R~~~~ 286 (344)
T KOG4532|consen 236 ATPMAEISSTRPHHNGAFRVCRFSLYGLLDLLFISEHFSRVHVVDTRNYVN 286 (344)
T ss_pred ccchhhhcccCCCCCCceEEEEecCCCcceEEEEecCcceEEEEEcccCce
Confidence 11 11123346899999999998664 333333345788899985443
No 292
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=96.70 E-value=0.0097 Score=72.94 Aligned_cols=136 Identities=13% Similarity=0.173 Sum_probs=94.1
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-----------CCcEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-----------DGLIR 76 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-----------DgtIr 76 (1525)
||-++.++|.+.++ +++|. ..|.+|+--.-.+. . +.-|. .|.-+.|+|..++|++=+. -..++
T Consensus 212 Tetyv~wSP~GTYL-~t~Hk-~GI~lWGG~~f~r~-~--RF~Hp-~Vq~idfSP~EkYLVT~s~~p~~~~~~d~e~~~l~ 285 (698)
T KOG2314|consen 212 TETYVRWSPKGTYL-VTFHK-QGIALWGGESFDRI-Q--RFYHP-GVQFIDFSPNEKYLVTYSPEPIIVEEDDNEGQQLI 285 (698)
T ss_pred eeeeEEecCCceEE-EEEec-cceeeecCccHHHH-H--hccCC-CceeeecCCccceEEEecCCccccCcccCCCceEE
Confidence 78899999999997 68885 56778873211111 1 11243 4888999999999998553 25789
Q ss_pred EEEcCCCcEEEEEe-c-CCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc---cCcCeEEEEEcC
Q 000425 77 AYNIHTYAVHYTLQ-L-DNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV---GSQPITSVAWLP 151 (1525)
Q Consensus 77 LWDl~t~k~l~tL~-H-~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g---H~~~VtsVafsP 151 (1525)
|||+.+|...++|. . ...+. +--+.||.|++|++.-.. .+|.|++..+. - .+++ .-..|...+|+|
T Consensus 286 IWDI~tG~lkrsF~~~~~~~~~---WP~frWS~DdKy~Arm~~-~sisIyEtpsf--~---lld~Kslki~gIr~FswsP 356 (698)
T KOG2314|consen 286 IWDIATGLLKRSFPVIKSPYLK---WPIFRWSHDDKYFARMTG-NSISIYETPSF--M---LLDKKSLKISGIRDFSWSP 356 (698)
T ss_pred EEEccccchhcceeccCCCccc---cceEEeccCCceeEEecc-ceEEEEecCce--e---eecccccCCccccCcccCC
Confidence 99999999888887 2 22111 124699999999987665 57889987752 0 1111 245678899999
Q ss_pred CCCEEEE
Q 000425 152 MLRLLVT 158 (1525)
Q Consensus 152 dg~~LaS 158 (1525)
.+.+|+-
T Consensus 357 ~~~llAY 363 (698)
T KOG2314|consen 357 TSNLLAY 363 (698)
T ss_pred CcceEEE
Confidence 9666553
No 293
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=96.62 E-value=0.0094 Score=68.73 Aligned_cols=121 Identities=14% Similarity=0.254 Sum_probs=83.7
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE---Ee-cC-----CceeecCceeEEEeCCC-CEEEEEeCC
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT---LQ-LD-----NTIKLLGAGAFAFHPTL-EWLFVGDRR 118 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t---L~-H~-----~~V~~~ga~sVaFSPdg-~~L~SgS~D 118 (1525)
.|.--|.++.|+.+...++++ +|-.|.+|++.....-++ ++ |. .-|+ +-.|||.. ..+.-+++.
T Consensus 170 aH~yhiNSiS~NsD~et~lSa-DdLrINLWnl~i~D~sFnIVDiKP~nmeeLteVIt-----SaeFhp~~cn~fmYSsSk 243 (460)
T COG5170 170 AHPYHINSISFNSDKETLLSA-DDLRINLWNLEIIDGSFNIVDIKPHNMEELTEVIT-----SAEFHPEMCNVFMYSSSK 243 (460)
T ss_pred cceeEeeeeeecCchheeeec-cceeeeeccccccCCceEEEeccCccHHHHHHHHh-----hcccCHhHcceEEEecCC
Confidence 366668899999988887775 678899999765433232 22 32 2355 88999954 566667789
Q ss_pred CcEEEEeCCCC-----Cceee---------EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeeeecCC
Q 000425 119 GTLLAWDVSIE-----RPSMI---------GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRVIINP 176 (1525)
Q Consensus 119 gtVrVWDl~t~-----~~s~i---------~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~~~~p 176 (1525)
|+|++-|++.. ..... ..+.+-.+.|..+.|+++|+|+++-+. -+|++||++...+|
T Consensus 244 G~Ikl~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~ngryIlsRdy-ltvkiwDvnm~k~p 314 (460)
T COG5170 244 GEIKLNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDNGRYILSRDY-LTVKIWDVNMAKNP 314 (460)
T ss_pred CcEEehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCCCcEEEEecc-ceEEEEecccccCC
Confidence 99999999842 11110 112233456788999999999987543 48999999966554
No 294
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=96.62 E-value=0.054 Score=66.32 Aligned_cols=150 Identities=16% Similarity=0.283 Sum_probs=101.0
Q ss_pred EEEEEEcCCCCEEEEEEeccc----------eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEE--EECCCcEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRM----------SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYV--AYADGLIR 76 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~----------sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaS--Gs~DgtIr 76 (1525)
.+.|-+++.+.++++-...+- ++.+++..+. .....+ +.+++|-..+|.|+++.|++ |..+-++.
T Consensus 225 ~~qLkW~~~g~~ll~l~~t~~ksnKsyfgesnLyl~~~~e~--~i~V~~-~~~~pVhdf~W~p~S~~F~vi~g~~pa~~s 301 (561)
T COG5354 225 GVQLKWQVLGKYLLVLVMTHTKSNKSYFGESNLYLLRITER--SIPVEK-DLKDPVHDFTWEPLSSRFAVISGYMPASVS 301 (561)
T ss_pred ccEEEEecCCceEEEEEEEeeecccceeccceEEEEeeccc--ccceec-cccccceeeeecccCCceeEEeccccccee
Confidence 356788888888876543322 1223332221 111222 46899999999999987665 45788999
Q ss_pred EEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC---CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC
Q 000425 77 AYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR---RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML 153 (1525)
Q Consensus 77 LWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~---DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg 153 (1525)
++|++.. +.+.+ ....-+ .+-|+|.++|++.++- .|.|-+||... +..+++.+.+ ...+-+.|+||+
T Consensus 302 ~~~lr~N-l~~~~-Pe~~rN-----T~~fsp~~r~il~agF~nl~gni~i~~~~~-rf~~~~~~~~--~n~s~~~wspd~ 371 (561)
T COG5354 302 VFDLRGN-LRFYF-PEQKRN-----TIFFSPHERYILFAGFDNLQGNIEIFDPAG-RFKVAGAFNG--LNTSYCDWSPDG 371 (561)
T ss_pred ecccccc-eEEec-CCcccc-----cccccCcccEEEEecCCccccceEEeccCC-ceEEEEEeec--CCceEeeccCCc
Confidence 9999876 33333 333344 6789999999999874 47799999884 3445433433 233457899999
Q ss_pred CEEEEEE------CCCCeeEEEee
Q 000425 154 RLLVTLC------RDGSLQVWKTR 171 (1525)
Q Consensus 154 ~~LaSgS------~DgtIkVWDlr 171 (1525)
.++.+.. .|..|+|||+-
T Consensus 372 qF~~~~~ts~k~~~Dn~i~l~~v~ 395 (561)
T COG5354 372 QFYDTDTTSEKLRVDNSIKLWDVY 395 (561)
T ss_pred eEEEecCCCcccccCcceEEEEec
Confidence 9887763 47799999985
No 295
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=96.59 E-value=0.016 Score=71.48 Aligned_cols=91 Identities=14% Similarity=0.317 Sum_probs=68.1
Q ss_pred CCCeEEEEEcCCCCEEEEEECC---CcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeC------CCc
Q 000425 51 KKPIVNLACHPRLPVLYVAYAD---GLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDR------RGT 120 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~D---gtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~------Dgt 120 (1525)
+++=+++-|+|.|++++.++-+ |.|-+||..+.+++..+. ....+ ++|+|||+||+++.. |+.
T Consensus 311 egpRN~~~fnp~g~ii~lAGFGNL~G~mEvwDv~n~K~i~~~~a~~tt~-------~eW~PdGe~flTATTaPRlrvdNg 383 (566)
T KOG2315|consen 311 EGPRNTAFFNPHGNIILLAGFGNLPGDMEVWDVPNRKLIAKFKAANTTV-------FEWSPDGEYFLTATTAPRLRVDNG 383 (566)
T ss_pred CCCccceEECCCCCEEEEeecCCCCCceEEEeccchhhccccccCCceE-------EEEcCCCcEEEEEeccccEEecCC
Confidence 4666778999999998887764 889999999999998888 44443 599999999998873 889
Q ss_pred EEEEeCCCCCceeeEeecccCc-CeEEEEEcCCC
Q 000425 121 LLAWDVSIERPSMIGIIQVGSQ-PITSVAWLPML 153 (1525)
Q Consensus 121 VrVWDl~t~~~s~i~il~gH~~-~VtsVafsPdg 153 (1525)
++||++.. .+. ...|-. ....+.|-|..
T Consensus 384 ~KiwhytG---~~l--~~~~f~sEL~qv~W~P~~ 412 (566)
T KOG2315|consen 384 IKIWHYTG---SLL--HEKMFKSELLQVEWRPFN 412 (566)
T ss_pred eEEEEecC---cee--ehhhhhHhHhheeeeecC
Confidence 99999983 222 111111 35568887743
No 296
>PRK04043 tolB translocation protein TolB; Provisional
Probab=96.57 E-value=0.071 Score=65.99 Aligned_cols=149 Identities=7% Similarity=-0.070 Sum_probs=87.2
Q ss_pred cEEEEEEcCCCCEEEEEEecc--ceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CC--cEEEEEcCC
Q 000425 8 TEVHLALTPLQPVVFFGFHRR--MSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DG--LIRAYNIHT 82 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d--~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-Dg--tIrLWDl~t 82 (1525)
.....+|+|++..+++..... ..+.+++. .++ ....+.. +........|+|||+.++-.+. .+ .|.++|+.+
T Consensus 234 ~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl-~~g-~~~~LT~-~~~~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~ 310 (419)
T PRK04043 234 MLVVSDVSKDGSKLLLTMAPKGQPDIYLYDT-NTK-TLTQITN-YPGIDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNS 310 (419)
T ss_pred cEEeeEECCCCCEEEEEEccCCCcEEEEEEC-CCC-cEEEccc-CCCccCccEECCCCCEEEEEECCCCCceEEEEECCC
Confidence 355688999999888877554 34444443 232 2333332 3333345689999987655443 33 577778877
Q ss_pred CcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCC---------CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC
Q 000425 83 YAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRR---------GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML 153 (1525)
Q Consensus 83 ~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~D---------gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg 153 (1525)
++......... . ...|||||++|+..+.. ..|.+.|+.+.....+ ..+. ......|+|||
T Consensus 311 g~~~rlt~~g~--~-----~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~L---T~~~-~~~~p~~SPDG 379 (419)
T PRK04043 311 GSVEQVVFHGK--N-----NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRL---TANG-VNQFPRFSSDG 379 (419)
T ss_pred CCeEeCccCCC--c-----CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEEC---CCCC-CcCCeEECCCC
Confidence 76533222211 2 34899999998866633 2567778775422222 2222 22358899999
Q ss_pred CEEEEEECC-CCeeEEEe
Q 000425 154 RLLVTLCRD-GSLQVWKT 170 (1525)
Q Consensus 154 ~~LaSgS~D-gtIkVWDl 170 (1525)
+.|+-.+.+ +.-.+|-+
T Consensus 380 ~~I~f~~~~~~~~~L~~~ 397 (419)
T PRK04043 380 GSIMFIKYLGNQSALGII 397 (419)
T ss_pred CEEEEEEccCCcEEEEEE
Confidence 987766544 34344443
No 297
>PRK04043 tolB translocation protein TolB; Provisional
Probab=96.49 E-value=0.14 Score=63.37 Aligned_cols=140 Identities=9% Similarity=0.049 Sum_probs=85.4
Q ss_pred cEEEEEEcCCCCE-EEEEEec--cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEE-EECC--CcEEEEEcC
Q 000425 8 TEVHLALTPLQPV-VFFGFHR--RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYV-AYAD--GLIRAYNIH 81 (1525)
Q Consensus 8 ~e~~LA~hP~qp~-Vf~s~~~--d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaS-Gs~D--gtIrLWDl~ 81 (1525)
.....+++|++.. +++.... +..+.+++. .++. ...+.. ..+.+...+|+|||+.++. .+.+ ..|.++|+.
T Consensus 189 ~~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl-~tg~-~~~lt~-~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~ 265 (419)
T PRK04043 189 LNIFPKWANKEQTAFYYTSYGERKPTLYKYNL-YTGK-KEKIAS-SQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTN 265 (419)
T ss_pred CeEeEEECCCCCcEEEEEEccCCCCEEEEEEC-CCCc-EEEEec-CCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECC
Confidence 4567889999874 6664443 455666664 3332 233333 4566777889999986554 3334 457777887
Q ss_pred CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCC--cEEEEeCCCCCc-eeeEeecccCcCeEEEEEcCCCCEE
Q 000425 82 TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRG--TLLAWDVSIERP-SMIGIIQVGSQPITSVAWLPMLRLL 156 (1525)
Q Consensus 82 t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~Dg--tVrVWDl~t~~~-s~i~il~gH~~~VtsVafsPdg~~L 156 (1525)
+++. ..+. +..... .-.|+|||+.|+-.+ ..+ .|.++|+.+... .+. ..+. . ...|||||++|
T Consensus 266 ~g~~-~~LT~~~~~d~-----~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt--~~g~--~--~~~~SPDG~~I 333 (419)
T PRK04043 266 TKTL-TQITNYPGIDV-----NGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVV--FHGK--N--NSSVSTYKNYI 333 (419)
T ss_pred CCcE-EEcccCCCccC-----ccEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCc--cCCC--c--CceECCCCCEE
Confidence 7764 3344 433233 568999999777655 334 566678775422 222 1121 1 24899999988
Q ss_pred EEEECC
Q 000425 157 VTLCRD 162 (1525)
Q Consensus 157 aSgS~D 162 (1525)
+-.+..
T Consensus 334 a~~~~~ 339 (419)
T PRK04043 334 VYSSRE 339 (419)
T ss_pred EEEEcC
Confidence 877654
No 298
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=96.46 E-value=0.0039 Score=78.74 Aligned_cols=107 Identities=21% Similarity=0.311 Sum_probs=84.1
Q ss_pred EEEcCCCCEEEEEEC----CCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC-Cc
Q 000425 57 LACHPRLPVLYVAYA----DGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE-RP 131 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~----DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~-~~ 131 (1525)
..|||..++|++++. .|.|-|| ..++++.......-.+. +++|||..-.|++|=.-|.+.||..++. ..
T Consensus 21 ~SWHPsePlfAVA~fS~er~GSVtIf-adtGEPqr~Vt~P~hat-----SLCWHpe~~vLa~gwe~g~~~v~~~~~~e~h 94 (1416)
T KOG3617|consen 21 SSWHPSEPLFAVASFSPERGGSVTIF-ADTGEPQRDVTYPVHAT-----SLCWHPEEFVLAQGWEMGVSDVQKTNTTETH 94 (1416)
T ss_pred cccCCCCceeEEEEecCCCCceEEEE-ecCCCCCcccccceehh-----hhccChHHHHHhhccccceeEEEecCCceee
Confidence 579999999999875 4778887 55666554443222234 8999998877888888899999998853 22
Q ss_pred eeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 132 SMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 132 s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
++ ...|..+|..+.|||+|..++|+..=|.|.+|...+
T Consensus 95 tv---~~th~a~i~~l~wS~~G~~l~t~d~~g~v~lwr~d~ 132 (1416)
T KOG3617|consen 95 TV---VETHPAPIQGLDWSHDGTVLMTLDNPGSVHLWRYDV 132 (1416)
T ss_pred ee---ccCCCCCceeEEecCCCCeEEEcCCCceeEEEEeee
Confidence 32 357999999999999999999999999999998873
No 299
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.23 E-value=0.012 Score=71.48 Aligned_cols=143 Identities=13% Similarity=0.128 Sum_probs=97.7
Q ss_pred EEEEEeccceEEEEEeecCC------ccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe---c
Q 000425 21 VFFGFHRRMSVTVVGTVEGG------RAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ---L 91 (1525)
Q Consensus 21 Vf~s~~~d~sI~vv~tiegg------k~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~---H 91 (1525)
-|.+..+|+++++|.....+ .+.+... .|+++|.++.|-.+.++++++ ||-|++||.--+.++..+. .
T Consensus 749 SFiSASkDKTVKLWSik~EgD~~~tsaCQfTY~-aHkk~i~~igfL~~lr~i~Sc--D~giHlWDPFigr~Laq~~dapk 825 (1034)
T KOG4190|consen 749 SFISASKDKTVKLWSIKPEGDEIGTSACQFTYQ-AHKKPIHDIGFLADLRSIASC--DGGIHLWDPFIGRLLAQMEDAPK 825 (1034)
T ss_pred ceeeccCCceEEEEEeccccCccccceeeeEhh-hccCcccceeeeeccceeeec--cCcceeecccccchhHhhhcCcc
Confidence 35677789999999863221 1334444 499999999999888877754 7899999976666665444 1
Q ss_pred CCceeecCceeEEEeC--CCCEEEEE-eCCCcEEEEeCCCC--CceeeEee--cccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 92 DNTIKLLGAGAFAFHP--TLEWLFVG-DRRGTLLAWDVSIE--RPSMIGII--QVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 92 ~~~V~~~ga~sVaFSP--dg~~L~Sg-S~DgtVrVWDl~t~--~~s~i~il--~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
...+. -+.--+ +...+..| +...+|+++|.++. ...+. .. .+-+..+++++.-|.|++++.+-..|+
T Consensus 826 ~~a~~-----~ikcl~nv~~~iliAgcsaeSTVKl~DaRsce~~~E~k-Vcna~~Pna~~R~iaVa~~GN~lAa~LSnGc 899 (1034)
T KOG4190|consen 826 EGAGG-----NIKCLENVDRHILIAGCSAESTVKLFDARSCEWTCELK-VCNAPGPNALTRAIAVADKGNKLAAALSNGC 899 (1034)
T ss_pred cCCCc-----eeEecccCcchheeeeccchhhheeeecccccceeeEE-eccCCCCchheeEEEeccCcchhhHHhcCCc
Confidence 12222 222222 44444444 68899999999963 11111 11 234466899999999999999999999
Q ss_pred eeEEEeee
Q 000425 165 LQVWKTRV 172 (1525)
Q Consensus 165 IkVWDlr~ 172 (1525)
|.+-|.|.
T Consensus 900 i~~LDaR~ 907 (1034)
T KOG4190|consen 900 IAILDARN 907 (1034)
T ss_pred EEEEecCC
Confidence 99999984
No 300
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=96.12 E-value=0.012 Score=43.93 Aligned_cols=34 Identities=29% Similarity=0.617 Sum_probs=31.0
Q ss_pred eecccCcCeEEEEEcCCCCEEEEEECCCCeeEEE
Q 000425 136 IIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWK 169 (1525)
Q Consensus 136 il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWD 169 (1525)
.+..|...|+++.|++++.++++++.|+.+++||
T Consensus 7 ~~~~~~~~i~~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 7 TLKGHTGPVTSVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred EEEecCCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence 3457889999999999999999999999999996
No 301
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=96.11 E-value=0.41 Score=59.81 Aligned_cols=329 Identities=13% Similarity=0.127 Sum_probs=157.1
Q ss_pred cccccccCCccEEEEEeeccCCcceEEEEEecccccccc--cC--CCCcccceeEEeCCCCCcEEEEeCCCCeEEEEEcC
Q 000425 406 PKHMVYSKRQQLFLVVYEFSGTTNEVVLYRENVDTQLAD--SK--SSTVKGRDAAFIGPNEDQFAILDDDKTGLALYILK 481 (1525)
Q Consensus 406 pq~l~hspngr~v~V~g~~~~~~~E~iiYt~~~~l~~rn--k~--G~~~~g~~~~fig~~~n~yAvle~~~~~v~I~~~~ 481 (1525)
.....++++++|.++-. ...+.||.. ..... +. . ...+-+|.|. -.++..+ ..|.+|...
T Consensus 71 g~~~vw~~~n~yAv~~~-----~~~I~I~kn---~~~~~~k~i~~~---~~~~~If~G~---LL~~~~~--~~i~~yDw~ 134 (443)
T PF04053_consen 71 GLSFVWSSRNRYAVLES-----SSTIKIYKN---FKNEVVKSIKLP---FSVEKIFGGN---LLGVKSS--DFICFYDWE 134 (443)
T ss_dssp -SEEEE-TSSEEEEE-T-----TS-EEEEET---TEE-TT-----S---S-EEEEE-SS---SEEEEET--TEEEEE-TT
T ss_pred eeEEEEecCccEEEEEC-----CCeEEEEEc---CccccceEEcCC---cccceEEcCc---EEEEECC--CCEEEEEhh
Confidence 34557777888555543 456778763 33333 21 2 1246788432 4555544 269999765
Q ss_pred CchhhhhhcccCCcccCCCccccccccccCCcccccccc-cceeeecCCcceeEEeecCCeEEEEeeccccccccccccc
Q 000425 482 GVTLQEAADENNGVVDHNQSTDTNVGSVQGPLQLMFESE-VDRIFSTPIESTLMFACDGDQIGMAKLVQGYRLSARAGHY 560 (1525)
Q Consensus 482 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v~~If~~pl~~~l~~~~~~~~i~~~~~~~~yd~~~~~~~~ 560 (1525)
.... +.. ++.. |..|+|.+.+..|.+++..+++++.++.+.+.-...++.
T Consensus 135 ~~~~--------------------i~~--------i~v~~vk~V~Ws~~g~~val~t~~~i~il~~~~~~~~~~~~~g~- 185 (443)
T PF04053_consen 135 TGKL--------------------IRR--------IDVSAVKYVIWSDDGELVALVTKDSIYILKYNLEAVAAIPEEGV- 185 (443)
T ss_dssp T--E--------------------EEE--------ESS-E-EEEEE-TTSSEEEEE-S-SEEEEEE-HHHHHHBTTTB--
T ss_pred Hcce--------------------eeE--------EecCCCcEEEEECCCCEEEEEeCCeEEEEEecchhcccccccCc-
Confidence 1100 112 2233 799999999888888988888888866554422111222
Q ss_pred cccccccceeeEe-e-ccceEEEEEEcccCCccEEEEEeeeeEEEEccchhhhhccccccCCCCCcceeeeeecCEEEEE
Q 000425 561 LQTKSEGKKSIKL-K-VTEVMLKVAWQETQRGYVAGVLTTQRVLIVSADLDILASSSTKFDKGLPSFRSLLWVGPALLFS 638 (1525)
Q Consensus 561 i~~~~e~~r~i~~-~-~~e~v~~V~Ws~~~~g~~vaI~t~~~i~i~~~~l~~l~~~~~~~~~~~~~IkSg~Wvg~~fIYt 638 (1525)
. ..+++ . ..+.|+...|-++ ++|++- ..+++= +-.|++-+..
T Consensus 186 ------e-~~f~~~~E~~~~IkSg~W~~d-------------~fiYtT---------------~~~lkY-l~~Ge~~~i~ 229 (443)
T PF04053_consen 186 ------E-DAFELIHEISERIKSGCWVED-------------CFIYTT---------------SNHLKY-LVNGETGIIA 229 (443)
T ss_dssp ------G-GGEEEEEEE-S--SEEEEETT-------------EEEEE----------------TTEEEE-EETTEEEEEE
T ss_pred ------h-hceEEEEEecceeEEEEEEcC-------------EEEEEc---------------CCeEEE-EEcCCcceEE
Confidence 1 12222 2 2567888999665 233321 014444 3347888888
Q ss_pred cccccEEEeeC-CceeeEEEecCCceEEEEE-ecC-----cEEEecCCCCCcccCCceeEEeeeccchhhH----HH---
Q 000425 639 TATAISVLGWD-GKVRNILSISMPNAVLVGA-LND-----RLLLANPTEINPRQKKGIEIKSCLVGLLEPL----LI--- 704 (1525)
Q Consensus 639 T~t~l~Yl~~~-G~~~~I~~l~~~~~yLlgy-l~~-----Rl~l~~~~~~~p~~dk~~~I~s~~l~llE~l----l~--- 704 (1525)
+...-.|+++= ++-+.+..+|.. .-+..| ++- +..++ +++.+ .+++.+ ++
T Consensus 230 ~ld~~~yllgy~~~~~~ly~~Dr~-~~v~~~~ld~~~~~fk~av~---------~~d~~------~v~~~i~~~~ll~~i 293 (443)
T PF04053_consen 230 HLDKPLYLLGYLPKENRLYLIDRD-GNVISYELDLSELEFKTAVL---------RGDFE------EVLRMIAASNLLPNI 293 (443)
T ss_dssp E-SS--EEEEEETTTTEEEEE-TT---EEEEE--HHHHHHHHHHH---------TT-HH------H-----HHHHTGGG-
T ss_pred EcCCceEEEEEEccCCEEEEEECC-CCEEEEEECHHHHHHHHHHH---------cCChh------hhhhhhhhhhhcccC
Confidence 87777777721 222456666643 111221 210 11111 11110 001100 00
Q ss_pred --h-hhhhhHHHHHhhhHHHHHHHHHhccCCCCCChhHHHHHHcCCCCcccchhhccccCCCCCc----chhhhHHhhcC
Q 000425 705 --G-FATMQQYFEQKLDLSEILYQITSRFDSLRITPRSLDILAKGPPVCGDLAVSLSQAGPQFTQ----VLRGIYAIKAL 777 (1525)
Q Consensus 705 --g-~a~~~~~~~~t~~lr~~l~~~~~~~~s~~i~~~~l~~L~~~~~~~~elAl~ls~~~~~~d~----~~rF~LAl~lg 777 (1525)
. -..+.++|++ . |-.+.+.+.- -+++.+=-|+.+ -|-.++|++++.. .+. +.-.++|++.|
T Consensus 294 ~~~~~~~i~~fL~~-~---G~~e~AL~~~----~D~~~rFeLAl~-lg~L~~A~~~a~~---~~~~~~W~~Lg~~AL~~g 361 (443)
T PF04053_consen 294 PKDQGQSIARFLEK-K---GYPELALQFV----TDPDHRFELALQ-LGNLDIALEIAKE---LDDPEKWKQLGDEALRQG 361 (443)
T ss_dssp -HHHHHHHHHHHHH-T---T-HHHHHHHS----S-HHHHHHHHHH-CT-HHHHHHHCCC---CSTHHHHHHHHHHHHHTT
T ss_pred ChhHHHHHHHHHHH-C---CCHHHHHhhc----CChHHHhHHHHh-cCCHHHHHHHHHh---cCcHHHHHHHHHHHHHcC
Confidence 0 0001111211 1 1112222211 223333334443 3478999999973 222 23478999999
Q ss_pred CHHHHHHHHHHhhcccCCCCCCC---CChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCC
Q 000425 778 RFSTALSVLKDEFLRSRDYPKCP---PTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLN 847 (1525)
Q Consensus 778 ~l~~A~~ia~~~~~~~~d~~~~~---~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd 847 (1525)
|++.|.+.+.... |+...- -.+....+.+.|++.|...|++.+|-.|+-.+||.+...-|...+|.
T Consensus 362 ~~~lAe~c~~k~~----d~~~L~lLy~~~g~~~~L~kl~~~a~~~~~~n~af~~~~~lgd~~~cv~lL~~~~~ 430 (443)
T PF04053_consen 362 NIELAEECYQKAK----DFSGLLLLYSSTGDREKLSKLAKIAEERGDINIAFQAALLLGDVEECVDLLIETGR 430 (443)
T ss_dssp BHHHHHHHHHHCT-----HHHHHHHHHHCT-HHHHHHHHHHHHHTT-HHHHHHHHHHHT-HHHHHHHHHHTT-
T ss_pred CHHHHHHHHHhhc----CccccHHHHHHhCCHHHHHHHHHHHHHccCHHHHHHHHHHcCCHHHHHHHHHHcCC
Confidence 9999999987642 211000 00112467788999999999999999999999999888888877775
No 302
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=96.01 E-value=0.027 Score=67.95 Aligned_cols=205 Identities=13% Similarity=0.103 Sum_probs=130.4
Q ss_pred CCCCEEEEEEeccceEEEEEeec-CCccceEEccCCCCCeEEEEEcCCCCEEEEEEC-CCcEEEEEcCCCcEEEEEe---
Q 000425 16 PLQPVVFFGFHRRMSVTVVGTVE-GGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA-DGLIRAYNIHTYAVHYTLQ--- 90 (1525)
Q Consensus 16 P~qp~Vf~s~~~d~sI~vv~tie-ggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~-DgtIrLWDl~t~k~l~tL~--- 90 (1525)
|+..-+.....-+.+++.|.-.. .+....+-...|-..|.+++.+-++-++.|.+. |..+|++|+.+.....-++
T Consensus 17 ~tka~fiiqASlDGh~KFWkKs~isGvEfVKhFraHL~~I~sl~~S~dg~L~~Sv~d~Dhs~KvfDvEn~DminmiKL~~ 96 (558)
T KOG0882|consen 17 PTKAKFIIQASLDGHKKFWKKSRISGVEFVKHFRAHLGVILSLAVSYDGWLFRSVEDPDHSVKVFDVENFDMINMIKLVD 96 (558)
T ss_pred eehhheEEeeecchhhhhcCCCCccceeehhhhHHHHHHHHhhhccccceeEeeccCcccceeEEEeeccchhhhccccc
Confidence 34333344444577777776543 233333334457788899999999999999888 9999999998776554444
Q ss_pred cCCceeecCceeEEEeCC-CC-EEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeE
Q 000425 91 LDNTIKLLGAGAFAFHPT-LE-WLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQV 167 (1525)
Q Consensus 91 H~~~V~~~ga~sVaFSPd-g~-~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkV 167 (1525)
....+ ++..++. .. .|+.+. .++.+-|+|-........-.-.-|.++|.++-+.+-+..++|....|.|.-
T Consensus 97 lPg~a------~wv~skGd~~s~IAVs~~~sg~i~VvD~~~d~~q~~~fkklH~sPV~~i~y~qa~Ds~vSiD~~gmVEy 170 (558)
T KOG0882|consen 97 LPGFA------EWVTSKGDKISLIAVSLFKSGKIFVVDGFGDFCQDGYFKKLHFSPVKKIRYNQAGDSAVSIDISGMVEY 170 (558)
T ss_pred CCCce------EEecCCCCeeeeEEeecccCCCcEEECCcCCcCccceecccccCceEEEEeeccccceeeccccceeEe
Confidence 12222 3333442 22 344333 889999999774322222234469999999999999999999988999999
Q ss_pred EEeee-ecCCCCCCCccccccCCCcccccccccccccCCcceecCC----CeeEEEEecCCCeEEEEecccccCCccccc
Q 000425 168 WKTRV-IINPNRPPMQANFFEPASIESIDIPRILSQQGGEAVYPLP----RVRALEVHPRLNLAVLLFANFTGGDNIKNR 242 (1525)
Q Consensus 168 WDlr~-~~~p~~~~~~a~~f~p~gVesldi~~iLs~~gG~~vy~l~----~V~~v~~HP~~Nlial~F~~~s~~Dd~~~k 242 (1525)
|.... ...|... .++ +.. .....|.++ ...+++|.|.++-++ +-++|.+++
T Consensus 171 Ws~e~~~qfPr~~---l~~-~~K--------------~eTdLy~f~K~Kt~pts~Efsp~g~qis------tl~~DrkVR 226 (558)
T KOG0882|consen 171 WSAEGPFQFPRTN---LNF-ELK--------------HETDLYGFPKAKTEPTSFEFSPDGAQIS------TLNPDRKVR 226 (558)
T ss_pred ecCCCcccCcccc---ccc-ccc--------------ccchhhcccccccCccceEEccccCccc------ccCcccEEE
Confidence 98762 1111110 000 110 000112211 235789999998887 777999999
Q ss_pred ccccccCc
Q 000425 243 AAYTREGR 250 (1525)
Q Consensus 243 ~~~~~eg~ 250 (1525)
.+....|+
T Consensus 227 ~F~~KtGk 234 (558)
T KOG0882|consen 227 GFVFKTGK 234 (558)
T ss_pred EEEeccch
Confidence 88877774
No 303
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=95.99 E-value=0.03 Score=72.05 Aligned_cols=140 Identities=15% Similarity=0.193 Sum_probs=100.9
Q ss_pred EEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC---------CCcEEEEEcCCCcEEEEEe
Q 000425 20 VVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA---------DGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 20 ~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~---------DgtIrLWDl~t~k~l~tL~ 90 (1525)
.++|+.+.+++|.+-+ ....+....+.. |++.|..++.+ |+.|++++. |..|+|||++..+.+.-+.
T Consensus 188 r~lf~G~t~G~V~LrD-~~s~~~iht~~a-Hs~siSDfDv~--GNlLitCG~S~R~~~l~~D~FvkVYDLRmmral~PI~ 263 (1118)
T KOG1275|consen 188 RNLFCGDTRGTVFLRD-PNSFETIHTFDA-HSGSISDFDVQ--GNLLITCGYSMRRYNLAMDPFVKVYDLRMMRALSPIQ 263 (1118)
T ss_pred cEEEeecccceEEeec-CCcCceeeeeec-cccceeeeecc--CCeEEEeecccccccccccchhhhhhhhhhhccCCcc
Confidence 4455556677776655 334455556666 99999877665 888888775 7789999999888877777
Q ss_pred -cCCceeecCceeEEEeCCC-CEEEEEeCCCcEEEEeCCCC-Cc--eeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 91 -LDNTIKLLGAGAFAFHPTL-EWLFVGDRRGTLLAWDVSIE-RP--SMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 91 -H~~~V~~~ga~sVaFSPdg-~~L~SgS~DgtVrVWDl~t~-~~--s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+-.+ . .++|+|.- ..++++|..|.+.+-|..+. .+ .+. -...-...+..+++++++..++-|..+|.|
T Consensus 264 ~~~~P-~-----flrf~Psl~t~~~V~S~sGq~q~vd~~~lsNP~~~~~-~v~p~~s~i~~fDiSsn~~alafgd~~g~v 336 (1118)
T KOG1275|consen 264 FPYGP-Q-----FLRFHPSLTTRLAVTSQSGQFQFVDTATLSNPPAGVK-MVNPNGSGISAFDISSNGDALAFGDHEGHV 336 (1118)
T ss_pred cccCc-h-----hhhhcccccceEEEEecccceeeccccccCCCcccee-EEccCCCcceeEEecCCCceEEEecccCcE
Confidence 5444 3 68999975 46778888999999884432 11 111 112234558999999999999999999999
Q ss_pred eEEEe
Q 000425 166 QVWKT 170 (1525)
Q Consensus 166 kVWDl 170 (1525)
.+|--
T Consensus 337 ~~wa~ 341 (1118)
T KOG1275|consen 337 NLWAD 341 (1118)
T ss_pred eeecC
Confidence 99973
No 304
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=95.91 E-value=0.02 Score=42.59 Aligned_cols=38 Identities=26% Similarity=0.449 Sum_probs=33.4
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEe
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWD 125 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWD 125 (1525)
+++...+. |...+. +++|++++..+++++.|+.+++||
T Consensus 2 ~~~~~~~~~~~~~i~-----~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 2 GELLKTLKGHTGPVT-----SVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred cEEEEEEEecCCcee-----EEEECCCCCEEEEecCCCeEEEcC
Confidence 35566777 888999 999999999999999999999996
No 305
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=95.85 E-value=0.024 Score=71.71 Aligned_cols=46 Identities=7% Similarity=0.117 Sum_probs=30.3
Q ss_pred chHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHHHHHHHHhhc
Q 000425 1346 RGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDEMARLSRHLG 1397 (1525)
Q Consensus 1346 e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R~lELaaYfT 1397 (1525)
--+|++-.|--|-|=+++.+|||-| ++. ++.+ +-.++..|+-+-|.
T Consensus 714 pyeeik~~ILevne~vLse~~iqnL-ik~-lPe~----E~l~~L~e~Kaeye 759 (1102)
T KOG1924|consen 714 PYEEIKNVILEVNEDVLSESMIQNL-IKH-LPEQ----EQLNKLSELKAEYE 759 (1102)
T ss_pred CHHHHHHHHhhccHHHHHHHHHHHH-HHh-CCCH----HHHHHHHHHHHhcc
Confidence 4457888888899999999998877 333 5422 33346666655443
No 306
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=95.65 E-value=0.62 Score=56.54 Aligned_cols=106 Identities=15% Similarity=0.208 Sum_probs=70.3
Q ss_pred CCCEEEEEECCCcEEEEEcCCC-----cEEEEEe-c--CCceeecCceeEEEeCCCCEEEEEe----------CCCcEEE
Q 000425 62 RLPVLYVAYADGLIRAYNIHTY-----AVHYTLQ-L--DNTIKLLGAGAFAFHPTLEWLFVGD----------RRGTLLA 123 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~-----k~l~tL~-H--~~~V~~~ga~sVaFSPdg~~L~SgS----------~DgtVrV 123 (1525)
++.++.+... |+|.+.|+... +.+..+. . ....+-.|..-++++|+++.++... ..+.|.+
T Consensus 205 dg~~~~vs~e-G~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~V 283 (352)
T TIGR02658 205 SGRLVWPTYT-GKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFV 283 (352)
T ss_pred CCcEEEEecC-CeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEE
Confidence 6777766665 99999995443 3333333 1 1122323344699999999888743 2257999
Q ss_pred EeCCCCCceeeEeecccCcCeEEEEEcCCCC-EEEEEE-CCCCeeEEEee
Q 000425 124 WDVSIERPSMIGIIQVGSQPITSVAWLPMLR-LLVTLC-RDGSLQVWKTR 171 (1525)
Q Consensus 124 WDl~t~~~s~i~il~gH~~~VtsVafsPdg~-~LaSgS-~DgtIkVWDlr 171 (1525)
+|..+. +.+..+.. ...+..++++||++ +|++.+ .+++|.+.|..
T Consensus 284 iD~~t~--kvi~~i~v-G~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~ 330 (352)
T TIGR02658 284 VDAKTG--KRLRKIEL-GHEIDSINVSQDAKPLLYALSTGDKTLYIFDAE 330 (352)
T ss_pred EECCCC--eEEEEEeC-CCceeeEEECCCCCeEEEEeCCCCCcEEEEECc
Confidence 999853 33322222 34788999999999 777665 67889999985
No 307
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=95.59 E-value=0.35 Score=58.62 Aligned_cols=108 Identities=10% Similarity=-0.010 Sum_probs=72.9
Q ss_pred CCCEEEEEEec----cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC----------CCcEEEEEcCC
Q 000425 17 LQPVVFFGFHR----RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA----------DGLIRAYNIHT 82 (1525)
Q Consensus 17 ~qp~Vf~s~~~----d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~----------DgtIrLWDl~t 82 (1525)
+..++|+.-.. ..++.+++.. ..+....+..+... +.+ ++|+++.+++++. +..|.+||..+
T Consensus 11 ~~~~v~V~d~~~~~~~~~v~ViD~~-~~~v~g~i~~G~~P--~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t 86 (352)
T TIGR02658 11 DARRVYVLDPGHFAATTQVYTIDGE-AGRVLGMTDGGFLP--NPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQT 86 (352)
T ss_pred CCCEEEEECCcccccCceEEEEECC-CCEEEEEEEccCCC--cee-ECCCCCEEEEEeccccccccCCCCCEEEEEECcc
Confidence 35567766543 2567777743 23333334332222 234 9999998777665 78999999999
Q ss_pred CcEEEEEe-cCCceeecC--ceeEEEeCCCCEEEEEe-C-CCcEEEEeCCC
Q 000425 83 YAVHYTLQ-LDNTIKLLG--AGAFAFHPTLEWLFVGD-R-RGTLLAWDVSI 128 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~g--a~sVaFSPdg~~L~SgS-~-DgtVrVWDl~t 128 (1525)
.+.+..+. ..++-.+.+ ...+++||||++++... + +..|.|.|+.+
T Consensus 87 ~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~ 137 (352)
T TIGR02658 87 HLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEG 137 (352)
T ss_pred CcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCC
Confidence 99999888 333221111 23678999999999877 4 89999999995
No 308
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=95.56 E-value=0.037 Score=70.77 Aligned_cols=77 Identities=13% Similarity=0.100 Sum_probs=62.1
Q ss_pred ccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCC-cEEE
Q 000425 47 KTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRG-TLLA 123 (1525)
Q Consensus 47 k~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~Dg-tVrV 123 (1525)
..+|...-+|++|+-+.+++++|+..|.|++|+..++....... |...|+ .|.=+.||..+++.+ +.. ..-+
T Consensus 1097 frd~~~~fTc~afs~~~~hL~vG~~~Geik~~nv~sG~~e~s~ncH~SavT-----~vePs~dgs~~Ltsss~S~PlsaL 1171 (1516)
T KOG1832|consen 1097 FRDETALFTCIAFSGGTNHLAVGSHAGEIKIFNVSSGSMEESVNCHQSAVT-----LVEPSVDGSTQLTSSSSSSPLSAL 1171 (1516)
T ss_pred hhccccceeeEEeecCCceEEeeeccceEEEEEccCccccccccccccccc-----cccccCCcceeeeeccccCchHHH
Confidence 44688889999999999999999999999999999998888888 999988 777777887766555 222 4556
Q ss_pred EeCCC
Q 000425 124 WDVSI 128 (1525)
Q Consensus 124 WDl~t 128 (1525)
|+..+
T Consensus 1172 W~~~s 1176 (1516)
T KOG1832|consen 1172 WDASS 1176 (1516)
T ss_pred hcccc
Confidence 76653
No 309
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=95.44 E-value=0.023 Score=72.20 Aligned_cols=108 Identities=18% Similarity=0.248 Sum_probs=80.0
Q ss_pred EEEEcCCCCEEEEEEec---cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEE
Q 000425 11 HLALTPLQPVVFFGFHR---RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHY 87 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~---d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~ 87 (1525)
-+.+||..|++.+++-+ ..+|.++- +.|-+.+-+. ..-.+.+++|||..-.|++|-.-|.+.+|...+.+...
T Consensus 20 i~SWHPsePlfAVA~fS~er~GSVtIfa--dtGEPqr~Vt--~P~hatSLCWHpe~~vLa~gwe~g~~~v~~~~~~e~ht 95 (1416)
T KOG3617|consen 20 ISSWHPSEPLFAVASFSPERGGSVTIFA--DTGEPQRDVT--YPVHATSLCWHPEEFVLAQGWEMGVSDVQKTNTTETHT 95 (1416)
T ss_pred ccccCCCCceeEEEEecCCCCceEEEEe--cCCCCCcccc--cceehhhhccChHHHHHhhccccceeEEEecCCceeee
Confidence 35799999987665433 44554432 2332322221 12335679999998899999999999999887765544
Q ss_pred EEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 88 TLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 88 tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
... |..++. .++|||+|..++++..-|.|..|.+.
T Consensus 96 v~~th~a~i~-----~l~wS~~G~~l~t~d~~g~v~lwr~d 131 (1416)
T KOG3617|consen 96 VVETHPAPIQ-----GLDWSHDGTVLMTLDNPGSVHLWRYD 131 (1416)
T ss_pred eccCCCCCce-----eEEecCCCCeEEEcCCCceeEEEEee
Confidence 444 999999 99999999999999999999999765
No 310
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=95.43 E-value=0.14 Score=66.40 Aligned_cols=105 Identities=18% Similarity=0.241 Sum_probs=80.9
Q ss_pred CCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeC---------CCcEEEEeCCCCCc
Q 000425 62 RLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDR---------RGTLLAWDVSIERP 131 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~---------DgtVrVWDl~t~~~ 131 (1525)
+++++.+|...|+|.+-|..+.+.+++|. |.+.+. +|.-.|+.|+++|- |.-|+|||++..+.
T Consensus 186 Nnr~lf~G~t~G~V~LrD~~s~~~iht~~aHs~siS-------DfDv~GNlLitCG~S~R~~~l~~D~FvkVYDLRmmra 258 (1118)
T KOG1275|consen 186 NNRNLFCGDTRGTVFLRDPNSFETIHTFDAHSGSIS-------DFDVQGNLLITCGYSMRRYNLAMDPFVKVYDLRMMRA 258 (1118)
T ss_pred cCcEEEeecccceEEeecCCcCceeeeeecccccee-------eeeccCCeEEEeecccccccccccchhhhhhhhhhhc
Confidence 46889999999999999999999999999 999998 77788999998873 67799999996411
Q ss_pred eeeEeecccCcCeEEEEEcCCCC-EEEEEECCCCeeEEEeeeecCC
Q 000425 132 SMIGIIQVGSQPITSVAWLPMLR-LLVTLCRDGSLQVWKTRVIINP 176 (1525)
Q Consensus 132 s~i~il~gH~~~VtsVafsPdg~-~LaSgS~DgtIkVWDlr~~~~p 176 (1525)
+. -+.-|-.+ .-+.|||... .++.+|..|.+.+-|.-...+|
T Consensus 259 -l~-PI~~~~~P-~flrf~Psl~t~~~V~S~sGq~q~vd~~~lsNP 301 (1118)
T KOG1275|consen 259 -LS-PIQFPYGP-QFLRFHPSLTTRLAVTSQSGQFQFVDTATLSNP 301 (1118)
T ss_pred -cC-CcccccCc-hhhhhcccccceEEEEecccceeeccccccCCC
Confidence 11 01122222 4578999754 6778888999999886655553
No 311
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=95.08 E-value=0.079 Score=67.31 Aligned_cols=15 Identities=40% Similarity=0.702 Sum_probs=10.2
Q ss_pred cCChHHHHHHHHHHh
Q 000425 819 YGQFDSAKETFEVIA 833 (1525)
Q Consensus 819 ~G~~~lAeec~q~~~ 833 (1525)
.+++|-|-+||+.+.
T Consensus 378 r~emDd~~~~f~lL~ 392 (1102)
T KOG1924|consen 378 RAEMDDANEVFELLA 392 (1102)
T ss_pred hhhhccHHHHHHHHH
Confidence 466777777877654
No 312
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=95.07 E-value=1.6 Score=49.55 Aligned_cols=114 Identities=15% Similarity=0.096 Sum_probs=74.9
Q ss_pred CCeEEEEEcCCCCEEEEEECC--------CcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEE-EeCCCcEE
Q 000425 52 KPIVNLACHPRLPVLYVAYAD--------GLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFV-GDRRGTLL 122 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~D--------gtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~S-gS~DgtVr 122 (1525)
...+.++++|+|.+.++.... |.|..++.. ++...........+ .++|+||++.|+. -+..+.|.
T Consensus 86 ~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~pN-----Gi~~s~dg~~lyv~ds~~~~i~ 159 (246)
T PF08450_consen 86 NRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGFPN-----GIAFSPDGKTLYVADSFNGRIW 159 (246)
T ss_dssp EEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESSEE-----EEEEETTSSEEEEEETTTTEEE
T ss_pred CCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCccccc-----ceEECCcchheeecccccceeE
Confidence 456789999999977776654 446666666 45444444555567 9999999997764 45788888
Q ss_pred EEeCCCCCc-----eeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 123 AWDVSIERP-----SMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 123 VWDl~t~~~-----s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+|+..... .....+.......-.+++..+|++.++....+.|.++|.+
T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~ 213 (246)
T PF08450_consen 160 RFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD 213 (246)
T ss_dssp EEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETTTEEEEEETT
T ss_pred EEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCCCEEEEECCC
Confidence 888864211 1111111212236689999999999988888889888754
No 313
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=95.04 E-value=0.36 Score=64.56 Aligned_cols=159 Identities=13% Similarity=0.200 Sum_probs=97.7
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEe----------------------ecCCccceEEccC-----------CCC
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGT----------------------VEGGRAPTKIKTD-----------LKK 52 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~t----------------------ieggk~~tklk~~-----------H~~ 52 (1525)
.++.+..+++|++.++.+... +.++.+... +..|+..+.+++. |.+
T Consensus 109 d~GI~aaswS~Dee~l~liT~-~~tll~mT~~f~~i~E~~L~~d~~~~sk~v~VGwGrkeTqfrgs~gr~~~~~~~~~ek 187 (1265)
T KOG1920|consen 109 DNGISAASWSPDEELLALITG-RQTLLFMTKDFEPIAEKPLDADDERKSKFVNVGWGRKETQFRGSEGRQAARQKIEKEK 187 (1265)
T ss_pred cCceEEEeecCCCcEEEEEeC-CcEEEEEeccccchhccccccccccccccceecccccceeeecchhhhcccccccccc
Confidence 358889999999998876653 344422111 1124444555420 111
Q ss_pred ---------CeEEEEEcCCCCEEEEEE-----CCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-
Q 000425 53 ---------PIVNLACHPRLPVLYVAY-----ADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD- 116 (1525)
Q Consensus 53 ---------~V~sLaf~Pdg~~LaSGs-----~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS- 116 (1525)
.=.+|.|--||.+||+.. ..++||+||-. +....+-+ ...--. +++|-|.|.++++-.
T Consensus 188 ~~~~~~~~~~~~~IsWRgDg~~fAVs~~~~~~~~RkirV~drE-g~Lns~se~~~~l~~-----~LsWkPsgs~iA~iq~ 261 (1265)
T KOG1920|consen 188 ALEQIEQDDHKTSISWRGDGEYFAVSFVESETGTRKIRVYDRE-GALNSTSEPVEGLQH-----SLSWKPSGSLIAAIQC 261 (1265)
T ss_pred cccchhhccCCceEEEccCCcEEEEEEEeccCCceeEEEeccc-chhhcccCccccccc-----ceeecCCCCeEeeeee
Confidence 123589999999999832 23899999976 43332222 222223 899999999998754
Q ss_pred --CCCcEEEEeCCCCC-ceeeEeecccCcCeEEEEEcCCCCEEEE---EECCCCeeEEEee
Q 000425 117 --RRGTLLAWDVSIER-PSMIGIIQVGSQPITSVAWLPMLRLLVT---LCRDGSLQVWKTR 171 (1525)
Q Consensus 117 --~DgtVrVWDl~t~~-~s~i~il~gH~~~VtsVafsPdg~~LaS---gS~DgtIkVWDlr 171 (1525)
+|+.|.++.-+... ..+.--+......|..++|+.++..|+. +.....|++|-+.
T Consensus 262 ~~sd~~IvffErNGL~hg~f~l~~p~de~~ve~L~Wns~sdiLAv~~~~~e~~~v~lwt~~ 322 (1265)
T KOG1920|consen 262 KTSDSDIVFFERNGLRHGEFVLPFPLDEKEVEELAWNSNSDILAVVTSNLENSLVQLWTTG 322 (1265)
T ss_pred cCCCCcEEEEecCCccccccccCCcccccchheeeecCCCCceeeeecccccceEEEEEec
Confidence 66789998766421 1111001112233889999999998887 4444459999875
No 314
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=94.90 E-value=1.1 Score=50.72 Aligned_cols=146 Identities=16% Similarity=0.090 Sum_probs=89.1
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
-++++|..-.++++-.....+..++... + ....+.. .. ..+++++..+..++++..++ +.++|..+++....+.
T Consensus 4 gp~~d~~~g~l~~~D~~~~~i~~~~~~~-~-~~~~~~~--~~-~~G~~~~~~~g~l~v~~~~~-~~~~d~~~g~~~~~~~ 77 (246)
T PF08450_consen 4 GPVWDPRDGRLYWVDIPGGRIYRVDPDT-G-EVEVIDL--PG-PNGMAFDRPDGRLYVADSGG-IAVVDPDTGKVTVLAD 77 (246)
T ss_dssp EEEEETTTTEEEEEETTTTEEEEEETTT-T-EEEEEES--SS-EEEEEEECTTSEEEEEETTC-EEEEETTTTEEEEEEE
T ss_pred ceEEECCCCEEEEEEcCCCEEEEEECCC-C-eEEEEec--CC-CceEEEEccCCEEEEEEcCc-eEEEecCCCcEEEEee
Confidence 4678886666777766677776665322 2 2222322 22 67788883335556666554 5666999886554444
Q ss_pred c------CCceeecCceeEEEeCCCCEEEEEeCC--------CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEE
Q 000425 91 L------DNTIKLLGAGAFAFHPTLEWLFVGDRR--------GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLL 156 (1525)
Q Consensus 91 H------~~~V~~~ga~sVaFSPdg~~L~SgS~D--------gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~L 156 (1525)
. ....+ .+++.|+|+..++.... +.|..++.. . .... ........+.++|+|+++.|
T Consensus 78 ~~~~~~~~~~~N-----D~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~--~~~~-~~~~~~~pNGi~~s~dg~~l 148 (246)
T PF08450_consen 78 LPDGGVPFNRPN-----DVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-G--KVTV-VADGLGFPNGIAFSPDGKTL 148 (246)
T ss_dssp EETTCSCTEEEE-----EEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-S--EEEE-EEEEESSEEEEEEETTSSEE
T ss_pred ccCCCcccCCCc-----eEEEcCCCCEEEEecCCCccccccccceEEECCC-C--eEEE-EecCcccccceEECCcchhe
Confidence 1 24566 99999999977766543 446666666 2 1111 11334567899999999865
Q ss_pred E-EEECCCCeeEEEee
Q 000425 157 V-TLCRDGSLQVWKTR 171 (1525)
Q Consensus 157 a-SgS~DgtIkVWDlr 171 (1525)
. +-+..+.|..+|+.
T Consensus 149 yv~ds~~~~i~~~~~~ 164 (246)
T PF08450_consen 149 YVADSFNGRIWRFDLD 164 (246)
T ss_dssp EEEETTTTEEEEEEEE
T ss_pred eecccccceeEEEecc
Confidence 4 66677777777764
No 315
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=94.83 E-value=0.98 Score=55.34 Aligned_cols=103 Identities=17% Similarity=0.126 Sum_probs=76.9
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCC----CcEEEE
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRR----GTLLAW 124 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~D----gtVrVW 124 (1525)
.-+.|.++..+|+|+.++++.....+-+.|+.++++...=+ ...-|. .++|||+++|++-+--+ ..|+++
T Consensus 400 ~lg~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idkS~~~lIt-----df~~~~nsr~iAYafP~gy~tq~Ikly 474 (668)
T COG4946 400 DLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDKSEYGLIT-----DFDWHPNSRWIAYAFPEGYYTQSIKLY 474 (668)
T ss_pred CccceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEecccccceeE-----EEEEcCCceeEEEecCcceeeeeEEEE
Confidence 45789999999999999999988999999999998754444 556788 99999999999987654 478999
Q ss_pred eCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 125 DVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 125 Dl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
|+... .+....+ .+..=.+=+|.|++++|.--|
T Consensus 475 dm~~~--Kiy~vTT-~ta~DfsPaFD~d~ryLYfLs 507 (668)
T COG4946 475 DMDGG--KIYDVTT-PTAYDFSPAFDPDGRYLYFLS 507 (668)
T ss_pred ecCCC--eEEEecC-CcccccCcccCCCCcEEEEEe
Confidence 99853 2221111 122224567899999876654
No 316
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=94.78 E-value=0.35 Score=61.92 Aligned_cols=148 Identities=16% Similarity=0.187 Sum_probs=99.0
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC------------CCEEEEEECCCcEEEE
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR------------LPVLYVAYADGLIRAY 78 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd------------g~~LaSGs~DgtIrLW 78 (1525)
.+.++|.+ .+.+|+| ..|.+++... -+....+.. |+..|+++.|.|- ...+|++...|.|-+|
T Consensus 20 A~Dw~~~G-LiAygsh--slV~VVDs~s-~q~iqsie~-h~s~V~~VrWap~~~p~~llS~~~~~lliAsaD~~GrIil~ 94 (1062)
T KOG1912|consen 20 AADWSPSG-LIAYGSH--SLVSVVDSRS-LQLIQSIEL-HQSAVTSVRWAPAPSPRDLLSPSSSQLLIASADISGRIILV 94 (1062)
T ss_pred ccccCccc-eEEEecC--ceEEEEehhh-hhhhhcccc-CccceeEEEeccCCCchhccCccccceeEEeccccCcEEEE
Confidence 45566665 3445554 4566666432 223334444 8999999999762 2257788888999999
Q ss_pred EcCCCcEEEEEe-cCCceeecCceeEEEeC---CC-CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcC-C
Q 000425 79 NIHTYAVHYTLQ-LDNTIKLLGAGAFAFHP---TL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLP-M 152 (1525)
Q Consensus 79 Dl~t~k~l~tL~-H~~~V~~~ga~sVaFSP---dg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsP-d 152 (1525)
|......+..++ |.+++. .++|-+ +. ..++.-....++-+||..++..-+. ++.......|+.+.| |
T Consensus 95 d~~~~s~~~~l~~~~~~~q-----dl~W~~~rd~Srd~LlaIh~ss~lvLwntdtG~k~Wk--~~ys~~iLs~f~~DPfd 167 (1062)
T KOG1912|consen 95 DFVLASVINWLSHSNDSVQ-----DLCWVPARDDSRDVLLAIHGSSTLVLWNTDTGEKFWK--YDYSHEILSCFRVDPFD 167 (1062)
T ss_pred EehhhhhhhhhcCCCcchh-----heeeeeccCcchheeEEecCCcEEEEEEccCCceeec--cccCCcceeeeeeCCCC
Confidence 999998888888 788888 888865 44 4566667888999999987522222 222234566788888 5
Q ss_pred CCEEEEEECCCCeeEEEe
Q 000425 153 LRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 153 g~~LaSgS~DgtIkVWDl 170 (1525)
.+.+...+..|.+-+-+.
T Consensus 168 ~rh~~~l~s~g~vl~~~~ 185 (1062)
T KOG1912|consen 168 SRHFCVLGSKGFVLSCKD 185 (1062)
T ss_pred cceEEEEccCceEEEEec
Confidence 555555555666655554
No 317
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=94.63 E-value=0.081 Score=61.40 Aligned_cols=116 Identities=14% Similarity=0.257 Sum_probs=81.7
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC-CcE----EEEEe-cC------------CceeecCceeEEEeCCC--
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT-YAV----HYTLQ-LD------------NTIKLLGAGAFAFHPTL-- 109 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t-~k~----l~tL~-H~------------~~V~~~ga~sVaFSPdg-- 109 (1525)
..+.|+++.|..+|.++++|...|.|.+|.-.. .-| ...|+ |. ..|+ .+.|..++
T Consensus 25 ead~ItaVefd~tg~YlatGDkgGRVvlfer~~s~~ceykf~teFQshe~EFDYLkSleieEKin-----~I~w~~~t~r 99 (460)
T COG5170 25 EADKITAVEFDETGLYLATGDKGGRVVLFEREKSYGCEYKFFTEFQSHELEFDYLKSLEIEEKIN-----AIEWFDDTGR 99 (460)
T ss_pred ccceeeEEEeccccceEeecCCCceEEEeecccccccchhhhhhhcccccchhhhhhccHHHHhh-----heeeecCCCc
Confidence 356799999999999999999999999997432 212 22233 33 2345 77776543
Q ss_pred CEEEEEeCCCcEEEEeCCCCC-----------------------------c------eee-----Ee-ecccCcCeEEEE
Q 000425 110 EWLFVGDRRGTLLAWDVSIER-----------------------------P------SMI-----GI-IQVGSQPITSVA 148 (1525)
Q Consensus 110 ~~L~SgS~DgtVrVWDl~t~~-----------------------------~------s~i-----~i-l~gH~~~VtsVa 148 (1525)
.+++..+.|++|++|-+.... + .++ .+ -.+|...|++++
T Consensus 100 ~hFLlstNdktiKlWKiyeknlk~va~nnls~~~~~~~~g~~~s~~~l~lprls~hd~iiaa~p~rvyaNaH~yhiNSiS 179 (460)
T COG5170 100 NHFLLSTNDKTIKLWKIYEKNLKVVAENNLSDSFHSPMGGPLTSTKELLLPRLSEHDEIIAAKPCRVYANAHPYHINSIS 179 (460)
T ss_pred ceEEEecCCceeeeeeeecccchhhhccccccccccccCCCcCCHHHhhcccccccceEEEeccceeccccceeEeeeee
Confidence 467778899999999764210 0 000 01 156888899999
Q ss_pred EcCCCCEEEEEECCCCeeEEEee
Q 000425 149 WLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 149 fsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
|+.|...++|+ +|-.|.+|++.
T Consensus 180 ~NsD~et~lSa-DdLrINLWnl~ 201 (460)
T COG5170 180 FNSDKETLLSA-DDLRINLWNLE 201 (460)
T ss_pred ecCchheeeec-cceeeeecccc
Confidence 99999888874 68899999876
No 318
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=94.51 E-value=0.13 Score=61.64 Aligned_cols=92 Identities=16% Similarity=0.195 Sum_probs=71.6
Q ss_pred EEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC
Q 000425 75 IRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM 152 (1525)
Q Consensus 75 IrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd 152 (1525)
|++.+..+.+...-+. |...|+ .++|||..+ .+..++.+.+|+|.|+++ ..++..+..| ..+.+++|.-|
T Consensus 175 v~~l~~~~fkssq~lp~~g~~Ir-----dlafSp~~~GLl~~asl~nkiki~dlet--~~~vssy~a~-~~~wSC~wDld 246 (463)
T KOG1645|consen 175 VQKLESHDFKSSQILPGEGSFIR-----DLAFSPFNEGLLGLASLGNKIKIMDLET--SCVVSSYIAY-NQIWSCCWDLD 246 (463)
T ss_pred eEEeccCCcchhhcccccchhhh-----hhccCccccceeeeeccCceEEEEeccc--ceeeeheecc-CCceeeeeccC
Confidence 6666666666655444 778899 999999887 678899999999999995 3344445566 78999999976
Q ss_pred C-CEEEEEECCCCeeEEEeeeec
Q 000425 153 L-RLLVTLCRDGSLQVWKTRVII 174 (1525)
Q Consensus 153 g-~~LaSgS~DgtIkVWDlr~~~ 174 (1525)
. ++|..|-..|.|.|+|+|...
T Consensus 247 e~h~IYaGl~nG~VlvyD~R~~~ 269 (463)
T KOG1645|consen 247 ERHVIYAGLQNGMVLVYDMRQPE 269 (463)
T ss_pred CcceeEEeccCceEEEEEccCCC
Confidence 4 577788889999999998543
No 319
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=94.40 E-value=1.3 Score=51.27 Aligned_cols=117 Identities=12% Similarity=0.183 Sum_probs=77.0
Q ss_pred CCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEEe-c-CCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 50 LKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTLQ-L-DNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL~-H-~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
..+.+++|+|+|+.. ++++....+.|..+|. +++.+..+. . ..-.- .|++.-++.++++.-.++.+.++++
T Consensus 20 ~~~e~SGLTy~pd~~tLfaV~d~~~~i~els~-~G~vlr~i~l~g~~D~E-----gI~y~g~~~~vl~~Er~~~L~~~~~ 93 (248)
T PF06977_consen 20 ILDELSGLTYNPDTGTLFAVQDEPGEIYELSL-DGKVLRRIPLDGFGDYE-----GITYLGNGRYVLSEERDQRLYIFTI 93 (248)
T ss_dssp --S-EEEEEEETTTTEEEEEETTTTEEEEEET-T--EEEEEE-SS-SSEE-----EEEE-STTEEEEEETTTTEEEEEEE
T ss_pred ccCCccccEEcCCCCeEEEEECCCCEEEEEcC-CCCEEEEEeCCCCCCce-----eEEEECCCEEEEEEcCCCcEEEEEE
Confidence 445599999999754 7788888888888886 478888887 3 35567 8899888887777768999999988
Q ss_pred CCC-----Cceee----EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 127 SIE-----RPSMI----GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 127 ~t~-----~~s~i----~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
... ..... +....++..+-.++|+|.+..|+.+-...-.+++.++.
T Consensus 94 ~~~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~ 148 (248)
T PF06977_consen 94 DDDTTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNG 148 (248)
T ss_dssp ----TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEEESSSEEEEEEES
T ss_pred eccccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCCCChhhEEEcc
Confidence 431 11111 11223566789999999877777777777788888764
No 320
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=94.29 E-value=2.7 Score=56.81 Aligned_cols=199 Identities=21% Similarity=0.213 Sum_probs=117.3
Q ss_pred CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC--
Q 000425 52 KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI-- 128 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t-- 128 (1525)
+.|.++.|.-+..-++.+..+|.|.+-|..+.... ... -++.|. +.+||||.++++....++++.+-+..-
T Consensus 69 ~~i~s~~fl~d~~~i~v~~~~G~iilvd~et~~~e-ivg~vd~GI~-----aaswS~Dee~l~liT~~~tll~mT~~f~~ 142 (1265)
T KOG1920|consen 69 DEIVSVQFLADTNSICVITALGDIILVDPETLELE-IVGNVDNGIS-----AASWSPDEELLALITGRQTLLFMTKDFEP 142 (1265)
T ss_pred cceEEEEEecccceEEEEecCCcEEEEccccccee-eeeeccCceE-----EEeecCCCcEEEEEeCCcEEEEEeccccc
Confidence 68999999999998888999999999987765332 222 577888 999999999999999888887765421
Q ss_pred --CC---------ceee--E------eec----------------------ccCcCeEEEEEcCCCCEEEEE-----ECC
Q 000425 129 --ER---------PSMI--G------IIQ----------------------VGSQPITSVAWLPMLRLLVTL-----CRD 162 (1525)
Q Consensus 129 --~~---------~s~i--~------il~----------------------gH~~~VtsVafsPdg~~LaSg-----S~D 162 (1525)
.. ..++ + .+. -|+ .=+.|+|--||.++++. ...
T Consensus 143 i~E~~L~~d~~~~sk~v~VGwGrkeTqfrgs~gr~~~~~~~~~ek~~~~~~~~~-~~~~IsWRgDg~~fAVs~~~~~~~~ 221 (1265)
T KOG1920|consen 143 IAEKPLDADDERKSKFVNVGWGRKETQFRGSEGRQAARQKIEKEKALEQIEQDD-HKTSISWRGDGEYFAVSFVESETGT 221 (1265)
T ss_pred hhccccccccccccccceecccccceeeecchhhhcccccccccccccchhhcc-CCceEEEccCCcEEEEEEEeccCCc
Confidence 00 0000 0 000 111 12459999999999883 333
Q ss_pred CCeeEEEeeeecCC----CCCCCccccccCCC--ccccc----cccc-cc----ccCCcce--ecCC--CeeEEEEecCC
Q 000425 163 GSLQVWKTRVIINP----NRPPMQANFFEPAS--IESID----IPRI-LS----QQGGEAV--YPLP--RVRALEVHPRL 223 (1525)
Q Consensus 163 gtIkVWDlr~~~~p----~~~~~~a~~f~p~g--Vesld----i~~i-Ls----~~gG~~v--y~l~--~V~~v~~HP~~ 223 (1525)
+.|+|||-...++. .......-.+-|.| +.++. -.++ .- ...|+-+ +|+. .+..++|.-+.
T Consensus 222 RkirV~drEg~Lns~se~~~~l~~~LsWkPsgs~iA~iq~~~sd~~IvffErNGL~hg~f~l~~p~de~~ve~L~Wns~s 301 (1265)
T KOG1920|consen 222 RKIRVYDREGALNSTSEPVEGLQHSLSWKPSGSLIAAIQCKTSDSDIVFFERNGLRHGEFVLPFPLDEKEVEELAWNSNS 301 (1265)
T ss_pred eeEEEecccchhhcccCcccccccceeecCCCCeEeeeeecCCCCcEEEEecCCccccccccCCcccccchheeeecCCC
Confidence 79999998722221 11000000111111 00000 0000 00 0112222 2221 26789999998
Q ss_pred CeEEEEecccccCCcccccccccccCceeeeeeeccc
Q 000425 224 NLAVLLFANFTGGDNIKNRAAYTREGRKQLFAVLQSA 260 (1525)
Q Consensus 224 Nlial~F~~~s~~Dd~~~k~~~~~eg~~q~~~tlq~~ 260 (1525)
.+.|++- +..+...++.|+.-.-+|.+-+.++-.
T Consensus 302 diLAv~~---~~~e~~~v~lwt~~NyhWYLKq~l~~~ 335 (1265)
T KOG1920|consen 302 DILAVVT---SNLENSLVQLWTTGNYHWYLKQELQFS 335 (1265)
T ss_pred Cceeeee---cccccceEEEEEecCeEEEEEEEEecc
Confidence 8888654 346666677887766666666555543
No 321
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=94.24 E-value=0.018 Score=71.90 Aligned_cols=156 Identities=12% Similarity=0.179 Sum_probs=91.8
Q ss_pred EEEEcCCCCEEEEEEec----cceEEEEEeecC---CccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC
Q 000425 11 HLALTPLQPVVFFGFHR----RMSVTVVGTVEG---GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY 83 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~----d~sI~vv~tieg---gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~ 83 (1525)
.+|+.+...-++++.++ +..+.+|+.-.+ .+....+.++....+.+++|--+.+++.+|...+.++++|++..
T Consensus 107 ~lAwneLDtn~LAagldkhrnds~~~Iwdi~s~ltvPke~~~fs~~~l~gqns~cwlrd~klvlaGm~sr~~~ifdlRqs 186 (783)
T KOG1008|consen 107 SLAWNELDTNHLAAGLDKHRNDSSLKIWDINSLLTVPKESPLFSSSTLDGQNSVCWLRDTKLVLAGMTSRSVHIFDLRQS 186 (783)
T ss_pred ccccccccHHHHHhhhhhhcccCCccceecccccCCCccccccccccccCccccccccCcchhhcccccchhhhhhhhhh
Confidence 35666654444444333 555667774332 11112222223345568888877889999999999999998732
Q ss_pred cE-EEEEecCCceeecCceeEEEeC-CCCEEEEEeCCCcEEEEe-CCCCCceeeEeecccC-----cCeEEEEEcCCCC-
Q 000425 84 AV-HYTLQLDNTIKLLGAGAFAFHP-TLEWLFVGDRRGTLLAWD-VSIERPSMIGIIQVGS-----QPITSVAWLPMLR- 154 (1525)
Q Consensus 84 k~-l~tL~H~~~V~~~ga~sVaFSP-dg~~L~SgS~DgtVrVWD-l~t~~~s~i~il~gH~-----~~VtsVafsPdg~- 154 (1525)
.. ...+ ..+.+. .+...| .+.|+.+- .|+.|-+|| .+.. .....+. .|. ..+..++|.|..+
T Consensus 187 ~~~~~sv-nTk~vq-----G~tVdp~~~nY~cs~-~dg~iAiwD~~rni-enpl~~i-~~~~N~~~~~l~~~aycPtrtg 257 (783)
T KOG1008|consen 187 LDSVSSV-NTKYVQ-----GITVDPFSPNYFCSN-SDGDIAIWDTYRNI-ENPLQII-LRNENKKPKQLFALAYCPTRTG 257 (783)
T ss_pred hhhhhhh-hhhhcc-----cceecCCCCCceecc-ccCceeeccchhhh-ccHHHHH-hhCCCCcccceeeEEeccCCcc
Confidence 11 1111 333445 566677 56666554 599999999 4431 1111111 122 2488999999643
Q ss_pred EEEEEECC-CCeeEEEeeeecC
Q 000425 155 LLVTLCRD-GSLQVWKTRVIIN 175 (1525)
Q Consensus 155 ~LaSgS~D-gtIkVWDlr~~~~ 175 (1525)
++++++.| ++|+.+|++....
T Consensus 258 lla~l~RdS~tIrlydi~~v~t 279 (783)
T KOG1008|consen 258 LLAVLSRDSITIRLYDICVVGT 279 (783)
T ss_pred hhhhhccCcceEEEecccccCC
Confidence 56677765 4899999985543
No 322
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=94.22 E-value=1.6 Score=44.77 Aligned_cols=99 Identities=16% Similarity=0.117 Sum_probs=65.7
Q ss_pred eEEEEEcC---CC-CEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC
Q 000425 54 IVNLACHP---RL-PVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE 129 (1525)
Q Consensus 54 V~sLaf~P---dg-~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~ 129 (1525)
|+++++.. +| +.|++||.|..||+|+-. ..+..+.....|. ++.-... ..++.+-.+|||-+|+-..
T Consensus 2 V~al~~~d~d~dg~~eLlvGs~D~~IRvf~~~--e~~~Ei~e~~~v~-----~L~~~~~-~~F~Y~l~NGTVGvY~~~~- 72 (111)
T PF14783_consen 2 VTALCLFDFDGDGENELLVGSDDFEIRVFKGD--EIVAEITETDKVT-----SLCSLGG-GRFAYALANGTVGVYDRSQ- 72 (111)
T ss_pred eeEEEEEecCCCCcceEEEecCCcEEEEEeCC--cEEEEEecccceE-----EEEEcCC-CEEEEEecCCEEEEEeCcc-
Confidence 55655543 44 469999999999999644 6777777555666 6665554 5688899999999998763
Q ss_pred CceeeEeecccCcCeEEEEEcC-CC---CEEEEEECCCCe
Q 000425 130 RPSMIGIIQVGSQPITSVAWLP-ML---RLLVTLCRDGSL 165 (1525)
Q Consensus 130 ~~s~i~il~gH~~~VtsVafsP-dg---~~LaSgS~DgtI 165 (1525)
+...+...| .++++.+.. ++ +-|++|-.+|.|
T Consensus 73 --RlWRiKSK~--~~~~~~~~D~~gdG~~eLI~GwsnGkv 108 (111)
T PF14783_consen 73 --RLWRIKSKN--QVTSMAFYDINGDGVPELIVGWSNGKV 108 (111)
T ss_pred --eeeeeccCC--CeEEEEEEcCCCCCceEEEEEecCCeE
Confidence 222222232 356655543 32 368888778765
No 323
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=93.70 E-value=0.017 Score=72.13 Aligned_cols=112 Identities=13% Similarity=0.241 Sum_probs=80.2
Q ss_pred CCCCeEEEEEcCCCC-EEEEEE----CCCcEEEEEcCCC--cEEEEEe----cCCceeecCceeEEEeCCCCEEEEEeCC
Q 000425 50 LKKPIVNLACHPRLP-VLYVAY----ADGLIRAYNIHTY--AVHYTLQ----LDNTIKLLGAGAFAFHPTLEWLFVGDRR 118 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~-~LaSGs----~DgtIrLWDl~t~--k~l~tL~----H~~~V~~~ga~sVaFSPdg~~L~SgS~D 118 (1525)
|..+-++++|++-+. +||.|- .|..++|||+.+. .+..+-. ...... +++|-.+.+.+++|...
T Consensus 101 ~ar~Ct~lAwneLDtn~LAagldkhrnds~~~Iwdi~s~ltvPke~~~fs~~~l~gqn-----s~cwlrd~klvlaGm~s 175 (783)
T KOG1008|consen 101 YARPCTSLAWNELDTNHLAAGLDKHRNDSSLKIWDINSLLTVPKESPLFSSSTLDGQN-----SVCWLRDTKLVLAGMTS 175 (783)
T ss_pred ccccccccccccccHHHHHhhhhhhcccCCccceecccccCCCccccccccccccCcc-----ccccccCcchhhccccc
Confidence 667889999999764 555553 2678999999876 2222222 223344 88888888899999999
Q ss_pred CcEEEEeCCCCCceeeEeecccCcCeEEEEEcC-CCCEEEEEECCCCeeEEEe
Q 000425 119 GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLP-MLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 119 gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsP-dg~~LaSgS~DgtIkVWDl 170 (1525)
+.+.++|++.. +.......+..|..+..+| .+.|+++-+ |+.|.+||.
T Consensus 176 r~~~ifdlRqs---~~~~~svnTk~vqG~tVdp~~~nY~cs~~-dg~iAiwD~ 224 (783)
T KOG1008|consen 176 RSVHIFDLRQS---LDSVSSVNTKYVQGITVDPFSPNYFCSNS-DGDIAIWDT 224 (783)
T ss_pred chhhhhhhhhh---hhhhhhhhhhhcccceecCCCCCceeccc-cCceeeccc
Confidence 99999999832 1111112455677888999 788888866 999999993
No 324
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=93.64 E-value=0.18 Score=60.59 Aligned_cols=74 Identities=20% Similarity=0.209 Sum_probs=64.5
Q ss_pred CCCCeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCC
Q 000425 50 LKKPIVNLACHPRLP-VLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~ 127 (1525)
|...|+.++|+|... ++..++.+.+|+|.|+++..++.++.-.+.+. +++|.-|.. +|..|-..|.|.|||++
T Consensus 192 ~g~~IrdlafSp~~~GLl~~asl~nkiki~dlet~~~vssy~a~~~~w-----SC~wDlde~h~IYaGl~nG~VlvyD~R 266 (463)
T KOG1645|consen 192 EGSFIRDLAFSPFNEGLLGLASLGNKIKIMDLETSCVVSSYIAYNQIW-----SCCWDLDERHVIYAGLQNGMVLVYDMR 266 (463)
T ss_pred cchhhhhhccCccccceeeeeccCceEEEEecccceeeeheeccCCce-----eeeeccCCcceeEEeccCceEEEEEcc
Confidence 778899999999987 78899999999999999999999888337788 999988765 56677799999999999
Q ss_pred C
Q 000425 128 I 128 (1525)
Q Consensus 128 t 128 (1525)
.
T Consensus 267 ~ 267 (463)
T KOG1645|consen 267 Q 267 (463)
T ss_pred C
Confidence 6
No 325
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=93.46 E-value=0.29 Score=61.70 Aligned_cols=74 Identities=18% Similarity=0.324 Sum_probs=63.2
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
....|.-+.|+|.-+++|.+..+|.|-+..+. ++.+.++. |...++ .+++|.|||+.|+.|-.||+|++.|+.+
T Consensus 19 l~~~i~~~ewnP~~dLiA~~t~~gelli~R~n-~qRlwtip~p~~~v~----~sL~W~~DGkllaVg~kdG~I~L~Dve~ 93 (665)
T KOG4640|consen 19 LPINIKRIEWNPKMDLIATRTEKGELLIHRLN-WQRLWTIPIPGENVT----ASLCWRPDGKLLAVGFKDGTIRLHDVEK 93 (665)
T ss_pred cccceEEEEEcCccchhheeccCCcEEEEEec-cceeEeccCCCCccc----eeeeecCCCCEEEEEecCCeEEEEEccC
Confidence 34567779999999999999999999998887 77788888 654433 2899999999999999999999999995
No 326
>PRK02888 nitrous-oxide reductase; Validated
Probab=93.16 E-value=0.79 Score=58.86 Aligned_cols=104 Identities=8% Similarity=-0.008 Sum_probs=68.0
Q ss_pred CCCeEEEEEcCCCCEEEE-EECCCcEEEEEcCCCcE------------EEEEe-cCCceeecCceeEEEeCCCCEEEEEe
Q 000425 51 KKPIVNLACHPRLPVLYV-AYADGLIRAYNIHTYAV------------HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD 116 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaS-Gs~DgtIrLWDl~t~k~------------l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS 116 (1525)
.+....++++|||+++++ +..+.+|.|.|+.+.+. ....+ -..++ ..+|.++|....|-.
T Consensus 320 GKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~vvaevevGlGPL------HTaFDg~G~aytslf 393 (635)
T PRK02888 320 PKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEPELGLGPL------HTAFDGRGNAYTTLF 393 (635)
T ss_pred CCCccceEECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceEEEeeccCCCcc------eEEECCCCCEEEeEe
Confidence 345567899999998665 55599999999987653 23333 22332 579999998777778
Q ss_pred CCCcEEEEeCCCC--------CceeeEeec-----ccCcCeEEEEEcCCCCEEEEEE
Q 000425 117 RRGTLLAWDVSIE--------RPSMIGIIQ-----VGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 117 ~DgtVrVWDl~t~--------~~s~i~il~-----gH~~~VtsVafsPdg~~LaSgS 160 (1525)
-|..|-.||+.+. ...++.-+. +|.....+=.-.|+|+||++..
T Consensus 394 ~dsqv~kwn~~~a~~~~~g~~~~~v~~k~dV~y~pgh~~~~~g~t~~~dgk~l~~~n 450 (635)
T PRK02888 394 LDSQIVKWNIEAAIRAYKGEKVDPIVQKLDVHYQPGHNHASMGETKEADGKWLVSLN 450 (635)
T ss_pred ecceeEEEehHHHHHHhccccCCcceecccCCCccceeeecCCCcCCCCCCEEEEcc
Confidence 9999999998851 112222222 3332222333467888888864
No 327
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=92.62 E-value=4.4 Score=48.33 Aligned_cols=143 Identities=15% Similarity=0.130 Sum_probs=82.9
Q ss_pred EEEEcCCCCEEEEEEeccce-EEEEEeecCCccceEEccCCCCC--eEEEEEcCCCCEEEEEEC-----CCcEEEEEcC-
Q 000425 11 HLALTPLQPVVFFGFHRRMS-VTVVGTVEGGRAPTKIKTDLKKP--IVNLACHPRLPVLYVAYA-----DGLIRAYNIH- 81 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~s-I~vv~tieggk~~tklk~~H~~~--V~sLaf~Pdg~~LaSGs~-----DgtIrLWDl~- 81 (1525)
-++.||+.+.+.+-..+-.+ ..+++. ..+.....+.. -.+. ----.|+|+|++|++.=. .|.|-|||..
T Consensus 9 ~~a~~p~~~~avafaRRPG~~~~v~D~-~~g~~~~~~~a-~~gRHFyGHg~fs~dG~~LytTEnd~~~g~G~IgVyd~~~ 86 (305)
T PF07433_consen 9 GVAAHPTRPEAVAFARRPGTFALVFDC-RTGQLLQRLWA-PPGRHFYGHGVFSPDGRLLYTTENDYETGRGVIGVYDAAR 86 (305)
T ss_pred ceeeCCCCCeEEEEEeCCCcEEEEEEc-CCCceeeEEcC-CCCCEEecCEEEcCCCCEEEEeccccCCCcEEEEEEECcC
Confidence 37899977655433333233 234443 33332222221 0110 011689999999888533 5899999988
Q ss_pred CCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe-------CCCcEEEEeCCCCCceee----------E--ee--cc
Q 000425 82 TYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-------RRGTLLAWDVSIERPSMI----------G--II--QV 139 (1525)
Q Consensus 82 t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-------~DgtVrVWDl~t~~~s~i----------~--il--~g 139 (1525)
+.+.+..+. |.-.-. .+.+.|||+.|+.+. +.|..++ |+.+...++. . .+ .-
T Consensus 87 ~~~ri~E~~s~GIGPH-----el~l~pDG~tLvVANGGI~Thpd~GR~kL-Nl~tM~psL~~ld~~sG~ll~q~~Lp~~~ 160 (305)
T PF07433_consen 87 GYRRIGEFPSHGIGPH-----ELLLMPDGETLVVANGGIETHPDSGRAKL-NLDTMQPSLVYLDARSGALLEQVELPPDL 160 (305)
T ss_pred CcEEEeEecCCCcChh-----hEEEcCCCCEEEEEcCCCccCcccCceec-ChhhcCCceEEEecCCCceeeeeecCccc
Confidence 777888887 665566 888999998777653 1233333 3333211111 1 12 33
Q ss_pred cCcCeEEEEEcCCCCEEEEEEC
Q 000425 140 GSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 140 H~~~VtsVafsPdg~~LaSgS~ 161 (1525)
|...|+.++++++|..++.+-.
T Consensus 161 ~~lSiRHLa~~~~G~V~~a~Q~ 182 (305)
T PF07433_consen 161 HQLSIRHLAVDGDGTVAFAMQY 182 (305)
T ss_pred cccceeeEEecCCCcEEEEEec
Confidence 6667888888888876655443
No 328
>PF07433 DUF1513: Protein of unknown function (DUF1513); InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=92.53 E-value=4 Score=48.65 Aligned_cols=155 Identities=17% Similarity=0.081 Sum_probs=93.3
Q ss_pred EEEEcCCCCEEEEEEec----cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC---------------
Q 000425 11 HLALTPLQPVVFFGFHR----RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA--------------- 71 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~----d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~--------------- 71 (1525)
|-+|+|++.++|++=.+ +.-|-||+...+-...-.+.+ |.--=--+.+.|+++.|+++-.
T Consensus 55 Hg~fs~dG~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s-~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl 133 (305)
T PF07433_consen 55 HGVFSPDGRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPS-HGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNL 133 (305)
T ss_pred CEEEcCCCCEEEEeccccCCCcEEEEEEECcCCcEEEeEecC-CCcChhhEEEcCCCCEEEEEcCCCccCcccCceecCh
Confidence 78999999999987544 344556664323222222222 2111123668889877776532
Q ss_pred ---CCcEEEEEcCCCcEEEE--E--e-cCCceeecCceeEEEeCCCCEEEEEeCCCc-------EEEEeCCCCCceee--
Q 000425 72 ---DGLIRAYNIHTYAVHYT--L--Q-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGT-------LLAWDVSIERPSMI-- 134 (1525)
Q Consensus 72 ---DgtIrLWDl~t~k~l~t--L--~-H~~~V~~~ga~sVaFSPdg~~L~SgS~Dgt-------VrVWDl~t~~~s~i-- 134 (1525)
+-.+-..|..+++.+.. + . |..+++ .+++.++|..++..-..+. |-+|+.......+.
T Consensus 134 ~tM~psL~~ld~~sG~ll~q~~Lp~~~~~lSiR-----HLa~~~~G~V~~a~Q~qg~~~~~~PLva~~~~g~~~~~~~~p 208 (305)
T PF07433_consen 134 DTMQPSLVYLDARSGALLEQVELPPDLHQLSIR-----HLAVDGDGTVAFAMQYQGDPGDAPPLVALHRRGGALRLLPAP 208 (305)
T ss_pred hhcCCceEEEecCCCceeeeeecCcccccccee-----eEEecCCCcEEEEEecCCCCCccCCeEEEEcCCCcceeccCC
Confidence 23444556677777655 4 3 778899 9999999987666554332 33444332100000
Q ss_pred -EeecccCcCeEEEEEcCCCCEEEE-EECCCCeeEEEee
Q 000425 135 -GIIQVGSQPITSVAWLPMLRLLVT-LCRDGSLQVWKTR 171 (1525)
Q Consensus 135 -~il~gH~~~VtsVafsPdg~~LaS-gS~DgtIkVWDlr 171 (1525)
..+..-.++|-+|++++++.+++. +=.-+.+.+||..
T Consensus 209 ~~~~~~l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~ 247 (305)
T PF07433_consen 209 EEQWRRLNGYIGSIAADRDGRLIAVTSPRGGRVAVWDAA 247 (305)
T ss_pred hHHHHhhCCceEEEEEeCCCCEEEEECCCCCEEEEEECC
Confidence 011233568899999999987755 4456789999876
No 329
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=92.04 E-value=2.4 Score=55.08 Aligned_cols=136 Identities=15% Similarity=0.199 Sum_probs=84.0
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCC-----CCEEEEEECCCcEEEEE---cCCCcEEEEE
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPR-----LPVLYVAYADGLIRAYN---IHTYAVHYTL 89 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pd-----g~~LaSGs~DgtIrLWD---l~t~k~l~tL 89 (1525)
+.++..++ .|+++.+.+.... ...+.+ +.+.++.+|++||+ .+.|++|+.-| +-++. +.....+ .+
T Consensus 83 Gey~asCS-~DGkv~I~sl~~~-~~~~~~--df~rpiksial~Pd~~~~~sk~fv~GG~ag-lvL~er~wlgnk~~v-~l 156 (846)
T KOG2066|consen 83 GEYVASCS-DDGKVVIGSLFTD-DEITQY--DFKRPIKSIALHPDFSRQQSKQFVSGGMAG-LVLSERNWLGNKDSV-VL 156 (846)
T ss_pred CceEEEec-CCCcEEEeeccCC-ccceeE--ecCCcceeEEeccchhhhhhhheeecCcce-EEEehhhhhcCccce-ee
Confidence 45554444 5777766665433 222222 24689999999998 46799999888 66664 2222222 34
Q ss_pred e-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCe------EEEEEcCCCCEEEEEECC
Q 000425 90 Q-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPI------TSVAWLPMLRLLVTLCRD 162 (1525)
Q Consensus 90 ~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~V------tsVafsPdg~~LaSgS~D 162 (1525)
. ..++|. ++.|+ |.+|+=++++| |+|||+.+. +.+..+-..| ..+ ..+.|.++.++++ | +.
T Consensus 157 ~~~eG~I~-----~i~W~--g~lIAWand~G-v~vyd~~~~-~~l~~i~~p~-~~~R~e~fpphl~W~~~~~LVI-G-W~ 224 (846)
T KOG2066|consen 157 SEGEGPIH-----SIKWR--GNLIAWANDDG-VKVYDTPTR-QRLTNIPPPS-QSVRPELFPPHLHWQDEDRLVI-G-WG 224 (846)
T ss_pred ecCccceE-----EEEec--CcEEEEecCCC-cEEEecccc-ceeeccCCCC-CCCCcccCCCceEecCCCeEEE-e-cC
Confidence 4 678899 99886 56777777666 799999853 2221111112 222 3578887766544 3 45
Q ss_pred CCeeEEEee
Q 000425 163 GSLQVWKTR 171 (1525)
Q Consensus 163 gtIkVWDlr 171 (1525)
.+|+|..++
T Consensus 225 d~v~i~~I~ 233 (846)
T KOG2066|consen 225 DSVKICSIK 233 (846)
T ss_pred CeEEEEEEe
Confidence 589999887
No 330
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=91.90 E-value=0.13 Score=58.02 Aligned_cols=157 Identities=14% Similarity=0.146 Sum_probs=85.3
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEee-cCCccceEEccCCCCCeEEEEEcCC-------CCEEEEEECCCcEEEEEc
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTV-EGGRAPTKIKTDLKKPIVNLACHPR-------LPVLYVAYADGLIRAYNI 80 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~ti-eggk~~tklk~~H~~~V~sLaf~Pd-------g~~LaSGs~DgtIrLWDl 80 (1525)
-+++.+||.+..+..+......-.+..+. .+......+.. |+-.+++..|+-. +..+++|..||.|.+|+.
T Consensus 9 ~~~~dk~~~~d~i~~~~t~~~~~~l~~~sa~~~v~~~~~~k-~k~s~rse~~~~e~~~v~~~~~~~~vG~~dg~v~~~n~ 87 (238)
T KOG2444|consen 9 NFGIDKHPSTDLIAAGLTDGHLHLLRATSADGLVRERKVRK-HKESCRSERFIDEGQRVVTASAKLMVGTSDGAVYVFNW 87 (238)
T ss_pred ccceeccchhhhhhhcchhhhhhhhccccCCcccccchhhh-hhhhhhhhhhhhcceeecccCceEEeecccceEEEecC
Confidence 46788888888776555332111122221 22222223332 5444444444332 356888999999999987
Q ss_pred CCCcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccC-cCeEEEEEcCCCCEEE
Q 000425 81 HTYAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGS-QPITSVAWLPMLRLLV 157 (1525)
Q Consensus 81 ~t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~-~~VtsVafsPdg~~La 157 (1525)
...-.+.... ....+. +.|.--.++.+..+++.|+.||.|+... ....+....|+ ..+.....+..++.++
T Consensus 88 n~~g~~~d~~~s~~e~i~----~~Ip~~~~~~~~c~~~~dg~ir~~n~~p--~k~~g~~g~h~~~~~e~~ivv~sd~~i~ 161 (238)
T KOG2444|consen 88 NLEGAHSDRVCSGEESID----LGIPNGRDSSLGCVGAQDGRIRACNIKP--NKVLGYVGQHNFESGEELIVVGSDEFLK 161 (238)
T ss_pred CccchHHHhhhcccccce----eccccccccceeEEeccCCceeeecccc--CceeeeeccccCCCcceeEEecCCceEE
Confidence 6221111111 112222 1222222444667788999999999983 33333344565 4455555555566666
Q ss_pred EE--ECCCCeeEEEeee
Q 000425 158 TL--CRDGSLQVWKTRV 172 (1525)
Q Consensus 158 Sg--S~DgtIkVWDlr~ 172 (1525)
++ |.|..++.|++..
T Consensus 162 ~a~~S~d~~~k~W~ve~ 178 (238)
T KOG2444|consen 162 IADTSHDRVLKKWNVEK 178 (238)
T ss_pred eeccccchhhhhcchhh
Confidence 66 7777777777653
No 331
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=91.64 E-value=0.45 Score=60.95 Aligned_cols=112 Identities=13% Similarity=0.312 Sum_probs=85.3
Q ss_pred CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCC------------CEEEEEeC
Q 000425 51 KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTL------------EWLFVGDR 117 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg------------~~L~SgS~ 117 (1525)
...-.+++|+|.| +||.||. ..|.+-|..+.+.+.+++ |...|+ .|+|.|-. -.|+++..
T Consensus 15 ~sN~~A~Dw~~~G-LiAygsh-slV~VVDs~s~q~iqsie~h~s~V~-----~VrWap~~~p~~llS~~~~~lliAsaD~ 87 (1062)
T KOG1912|consen 15 RSNRNAADWSPSG-LIAYGSH-SLVSVVDSRSLQLIQSIELHQSAVT-----SVRWAPAPSPRDLLSPSSSQLLIASADI 87 (1062)
T ss_pred cccccccccCccc-eEEEecC-ceEEEEehhhhhhhhccccCcccee-----EEEeccCCCchhccCccccceeEEeccc
Confidence 3445678999876 7777775 468899999999999999 999999 99997631 23666778
Q ss_pred CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCC---C-CEEEEEECCCCeeEEEee
Q 000425 118 RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPM---L-RLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 118 DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPd---g-~~LaSgS~DgtIkVWDlr 171 (1525)
.|.|.+||.... ..+..++.|...|..++|-+. . .+|+.-..-.+|.+|+..
T Consensus 88 ~GrIil~d~~~~--s~~~~l~~~~~~~qdl~W~~~rd~Srd~LlaIh~ss~lvLwntd 143 (1062)
T KOG1912|consen 88 SGRIILVDFVLA--SVINWLSHSNDSVQDLCWVPARDDSRDVLLAIHGSSTLVLWNTD 143 (1062)
T ss_pred cCcEEEEEehhh--hhhhhhcCCCcchhheeeeeccCcchheeEEecCCcEEEEEEcc
Confidence 899999999853 222235678889999999773 3 456666777899999876
No 332
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=91.15 E-value=2.5 Score=52.70 Aligned_cols=149 Identities=14% Similarity=0.171 Sum_probs=79.8
Q ss_pred EEEcCCCCEE-EEEEeccc--eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEE-EEEECCCcEEE--EEcCCCcE
Q 000425 12 LALTPLQPVV-FFGFHRRM--SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVL-YVAYADGLIRA--YNIHTYAV 85 (1525)
Q Consensus 12 LA~hP~qp~V-f~s~~~d~--sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~L-aSGs~DgtIrL--WDl~t~k~ 85 (1525)
-+++|++..+ +..+.... .+.+++ .+.+.....+.. .+.-..-+|+|||+.+ ++...||...| .|+.+..
T Consensus 198 p~ws~~~~~~~y~~f~~~~~~~i~~~~-l~~g~~~~i~~~--~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~- 273 (425)
T COG0823 198 PAWSPDGKKLAYVSFELGGCPRIYYLD-LNTGKRPVILNF--NGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKN- 273 (425)
T ss_pred cccCcCCCceEEEEEecCCCceEEEEe-ccCCccceeecc--CCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCc-
Confidence 3556664443 33333333 233333 344443333331 2333446899999864 56666776555 5666655
Q ss_pred EEEEecCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 86 HYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 86 l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
...+.+...+. +.=.|+|||++|+-.+ ..|.-.||-+......... +..+...-.+-.|+|||++|+..+.++.
T Consensus 274 ~~~Lt~~~gi~----~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~r-iT~~~~~~~~p~~SpdG~~i~~~~~~~g 348 (425)
T COG0823 274 LPRLTNGFGIN----TSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTR-LTFSGGGNSNPVWSPDGDKIVFESSSGG 348 (425)
T ss_pred ceecccCCccc----cCccCCCCCCEEEEEeCCCCCcceEEECCCCCceeE-eeccCCCCcCccCCCCCCEEEEEeccCC
Confidence 33355443333 1447899999888555 6676666644432111111 1122222226789999999998875422
Q ss_pred eeEEEee
Q 000425 165 LQVWKTR 171 (1525)
Q Consensus 165 IkVWDlr 171 (1525)
- |++.
T Consensus 349 ~--~~i~ 353 (425)
T COG0823 349 Q--WDID 353 (425)
T ss_pred c--eeeE
Confidence 2 5554
No 333
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=90.85 E-value=4.2 Score=54.15 Aligned_cols=126 Identities=15% Similarity=0.189 Sum_probs=80.4
Q ss_pred ecCCccceEEccCCCCCeEEEEEcC-----CCCEEEEEECCCcEEEEEcCCC--cEEEE-Ee---cCCceeecCceeEEE
Q 000425 37 VEGGRAPTKIKTDLKKPIVNLACHP-----RLPVLYVAYADGLIRAYNIHTY--AVHYT-LQ---LDNTIKLLGAGAFAF 105 (1525)
Q Consensus 37 ieggk~~tklk~~H~~~V~sLaf~P-----dg~~LaSGs~DgtIrLWDl~t~--k~l~t-L~---H~~~V~~~ga~sVaF 105 (1525)
.+.|+.+...+.....+|..++-.. +...-++|-.+..+..||.+-. +++.. .. ...... |++-
T Consensus 511 Le~GKVV~eW~~~~~~~v~~~~p~~K~aqlt~e~tflGls~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs-----~~aT 585 (794)
T PF08553_consen 511 LERGKVVEEWKVHDDIPVVDIAPDSKFAQLTNEQTFLGLSDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFS-----CFAT 585 (794)
T ss_pred cCCCcEEEEeecCCCcceeEecccccccccCCCceEEEECCCceEEeccCCCCCceeeccccccccCCCce-----EEEe
Confidence 4556666655553334466654321 1234567888889999997643 33321 11 233445 6665
Q ss_pred eCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 106 HPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 106 SPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+.+| +|+.||.+|.||+||-... +-...+.+-..+|.+|..+.||++|+..+ +..+.++|..
T Consensus 586 t~~G-~iavgs~~G~IRLyd~~g~--~AKT~lp~lG~pI~~iDvt~DGkwilaTc-~tyLlLi~t~ 647 (794)
T PF08553_consen 586 TEDG-YIAVGSNKGDIRLYDRLGK--RAKTALPGLGDPIIGIDVTADGKWILATC-KTYLLLIDTL 647 (794)
T ss_pred cCCc-eEEEEeCCCcEEeecccch--hhhhcCCCCCCCeeEEEecCCCcEEEEee-cceEEEEEEe
Confidence 5554 7999999999999995421 11122346678999999999999987766 4577788764
No 334
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=90.73 E-value=7.3 Score=46.32 Aligned_cols=146 Identities=15% Similarity=0.162 Sum_probs=87.7
Q ss_pred CCEEEEEEec--------c-ceEEEEEeecCC----ccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc
Q 000425 18 QPVVFFGFHR--------R-MSVTVVGTVEGG----RAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 18 qp~Vf~s~~~--------d-~sI~vv~tiegg----k~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k 84 (1525)
.+++.+|... + ..+.++...+.+ +-.......++++|++++-- +..++++. ...|.+|++...+
T Consensus 42 ~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~--~~~lv~~~-g~~l~v~~l~~~~ 118 (321)
T PF03178_consen 42 KEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLIHSTEVKGPVTAICSF--NGRLVVAV-GNKLYVYDLDNSK 118 (321)
T ss_dssp SEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEE--TTEEEEEE-TTEEEEEEEETTS
T ss_pred cCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEEEEEeecCcceEhhhh--CCEEEEee-cCEEEEEEccCcc
Confidence 5666666652 2 556666654421 11222223467899998766 44455554 4789999998877
Q ss_pred -EEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEe-ecccCcCeEEEEEcCCCCEEEEEECC
Q 000425 85 -VHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGI-IQVGSQPITSVAWLPMLRLLVTLCRD 162 (1525)
Q Consensus 85 -~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~i-l~gH~~~VtsVafsPdg~~LaSgS~D 162 (1525)
....-.+...+. .++....+++|+.|.....+.++.++....++.-. .+.....++++.|-+++..++.+..+
T Consensus 119 ~l~~~~~~~~~~~-----i~sl~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~ 193 (321)
T PF03178_consen 119 TLLKKAFYDSPFY-----ITSLSVFKNYILVGDAMKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKD 193 (321)
T ss_dssp SEEEEEEE-BSSS-----EEEEEEETTEEEEEESSSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETT
T ss_pred cchhhheecceEE-----EEEEeccccEEEEEEcccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCC
Confidence 444433333223 33444446799999988888888655322223211 12335568889998776799999999
Q ss_pred CCeeEEEee
Q 000425 163 GSLQVWKTR 171 (1525)
Q Consensus 163 gtIkVWDlr 171 (1525)
|.+.++...
T Consensus 194 gnl~~l~~~ 202 (321)
T PF03178_consen 194 GNLFVLRYN 202 (321)
T ss_dssp SEEEEEEE-
T ss_pred CeEEEEEEC
Confidence 999999875
No 335
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=90.49 E-value=16 Score=43.72 Aligned_cols=52 Identities=17% Similarity=0.291 Sum_probs=33.9
Q ss_pred CCCCCCCCCCccCCchhh------------HHHHHHHHHHhhcCChHHHHHHHHHHHHHHhcCC
Q 000425 1293 ASPPTSVRPGQVPRGAAA------------SVCFKTGLAHLEQNQLPDALSCFDEAFLALAKDH 1344 (1525)
Q Consensus 1293 ~~~~~~~~p~~~~~~~~~------------~~~lk~gy~~~t~gkf~eAl~~Fr~iL~~~v~~~ 1344 (1525)
+-+++.+-|--.|-+.+- .+.+..|..--.+|.+.+|++.+..+|+++-.++
T Consensus 344 ~~~~r~~~p~~~plSeAEFEdiM~RNraiSSSAIsrAvsdASaGDy~~AiETllTAI~lIKqSr 407 (498)
T KOG4849|consen 344 GLPPRHVNPQMFPLSEAEFEDIMTRNRAISSSAISRAVSDASAGDYKGAIETLLTAIQLIKQSR 407 (498)
T ss_pred CCCcccCCCCCccchHHHHHHHHhhcchhhHHHHHHHhcccccccchhHHHHHHHHHHHHHhhc
Confidence 455667776555533221 2235556666779999999999999998544443
No 336
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=90.40 E-value=2.1 Score=53.46 Aligned_cols=146 Identities=14% Similarity=0.106 Sum_probs=85.3
Q ss_pred CCcEEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEE-EEECCCc--EEEEEcCC
Q 000425 6 VDTEVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLY-VAYADGL--IRAYNIHT 82 (1525)
Q Consensus 6 ~~~e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~La-SGs~Dgt--IrLWDl~t 82 (1525)
...+.+.+|+|++..|+|+..++.+..++-....++....+.. -.+.-..=.|+|+|+.++ +.+..|. |.+.|...
T Consensus 237 ~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~Lt~-~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g 315 (425)
T COG0823 237 NGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPRLTN-GFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEG 315 (425)
T ss_pred CCccCCccCCCCCCEEEEEECCCCCccEEEEcCCCCcceeccc-CCccccCccCCCCCCEEEEEeCCCCCcceEEECCCC
Confidence 3456788999999999999999888777655444444444332 122222457899999755 4444555 44445555
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeC-CCc--EEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEE
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDR-RGT--LLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVT 158 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~-Dgt--VrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaS 158 (1525)
... ..+. ....-. .-.|+|||++|+..+. +|. |-+.|..+... ...+ .+......-+|.|+++.+..
T Consensus 316 ~~~-~riT~~~~~~~-----~p~~SpdG~~i~~~~~~~g~~~i~~~~~~~~~~--~~~l-t~~~~~e~ps~~~ng~~i~~ 386 (425)
T COG0823 316 SQV-TRLTFSGGGNS-----NPVWSPDGDKIVFESSSGGQWDIDKNDLASGGK--IRIL-TSTYLNESPSWAPNGRMIMF 386 (425)
T ss_pred Cce-eEeeccCCCCc-----CccCCCCCCEEEEEeccCCceeeEEeccCCCCc--EEEc-cccccCCCCCcCCCCceEEE
Confidence 544 3333 322222 4578999999887663 344 44555543211 1112 23333445667778777665
Q ss_pred EEC
Q 000425 159 LCR 161 (1525)
Q Consensus 159 gS~ 161 (1525)
.+.
T Consensus 387 ~s~ 389 (425)
T COG0823 387 SSG 389 (425)
T ss_pred ecc
Confidence 543
No 337
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=90.38 E-value=12 Score=46.14 Aligned_cols=151 Identities=19% Similarity=0.163 Sum_probs=100.0
Q ss_pred EEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEEC---CCcEEEEEcCCCcEE
Q 000425 10 VHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYA---DGLIRAYNIHTYAVH 86 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~---DgtIrLWDl~t~k~l 86 (1525)
-.+++.|.+..++....++..+.++++... +....+..+. .-..++++|++..+.++.. +++|.+.|..+.+.+
T Consensus 77 ~~i~v~~~~~~vyv~~~~~~~v~vid~~~~-~~~~~~~vG~--~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~ 153 (381)
T COG3391 77 AGVAVNPAGNKVYVTTGDSNTVSVIDTATN-TVLGSIPVGL--GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVT 153 (381)
T ss_pred cceeeCCCCCeEEEecCCCCeEEEEcCccc-ceeeEeeecc--CCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEE
Confidence 357899999999988877788888883222 1111111112 4467999999987666554 789999999999988
Q ss_pred EEEe-cCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeE-e---ecccCcCeEEEEEcCCCCEEEEEE
Q 000425 87 YTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIG-I---IQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 87 ~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~-i---l~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
.+.. ...+ . .+++.|+|..++... .++.|.+.|.++. .... . ...-...-..+.++|++.++...-
T Consensus 154 ~~~~vG~~P-~-----~~a~~p~g~~vyv~~~~~~~v~vi~~~~~--~v~~~~~~~~~~~~~~P~~i~v~~~g~~~yV~~ 225 (381)
T COG3391 154 ATIPVGNTP-T-----GVAVDPDGNKVYVTNSDDNTVSVIDTSGN--SVVRGSVGSLVGVGTGPAGIAVDPDGNRVYVAN 225 (381)
T ss_pred EEEecCCCc-c-----eEEECCCCCeEEEEecCCCeEEEEeCCCc--ceeccccccccccCCCCceEEECCCCCEEEEEe
Confidence 8855 3344 6 899999999766665 8999999997742 2210 0 001122235689999998655443
Q ss_pred CC---CCeeEEEee
Q 000425 161 RD---GSLQVWKTR 171 (1525)
Q Consensus 161 ~D---gtIkVWDlr 171 (1525)
.. +++..-|..
T Consensus 226 ~~~~~~~v~~id~~ 239 (381)
T COG3391 226 DGSGSNNVLKIDTA 239 (381)
T ss_pred ccCCCceEEEEeCC
Confidence 32 466666654
No 338
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=90.03 E-value=3.9 Score=48.01 Aligned_cols=141 Identities=19% Similarity=0.187 Sum_probs=91.9
Q ss_pred CCEEEEEEec-cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCE-EEEEECCCc-EEEEEcCCCcEEEEEe-cC-
Q 000425 18 QPVVFFGFHR-RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPV-LYVAYADGL-IRAYNIHTYAVHYTLQ-LD- 92 (1525)
Q Consensus 18 qp~Vf~s~~~-d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~-LaSGs~Dgt-IrLWDl~t~k~l~tL~-H~- 92 (1525)
+|.+..++.. |.+..+....+.++....+. ....--.++|||..+. ++.+-.-|+ ..++|..+.+...++. .+
T Consensus 35 qavfasaf~~~dgs~g~a~~~eaGk~v~~~~--lpaR~Hgi~~~p~~~ravafARrPGtf~~vfD~~~~~~pv~~~s~~~ 112 (366)
T COG3490 35 QAVFASAFDARDGSFGAATLSEAGKIVFATA--LPARGHGIAFHPALPRAVAFARRPGTFAMVFDPNGAQEPVTLVSQEG 112 (366)
T ss_pred cchheeeeeccCCceeEEEEccCCceeeeee--cccccCCeecCCCCcceEEEEecCCceEEEECCCCCcCcEEEecccC
Confidence 3444444432 44444444455566555443 2344456899998774 666666665 5578988887766665 21
Q ss_pred -CceeecCceeEEEeCCCCEEEEEeCC-----CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 93 -NTIKLLGAGAFAFHPTLEWLFVGDRR-----GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 93 -~~V~~~ga~sVaFSPdg~~L~SgS~D-----gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
...- .=.|||||.||...-.| |.|-|||.+. ..+.++.+..|.-.-..+.|.+||+.++.+ +|-|.
T Consensus 113 RHfyG-----HGvfs~dG~~LYATEndfd~~rGViGvYd~r~-~fqrvgE~~t~GiGpHev~lm~DGrtlvva--nGGIe 184 (366)
T COG3490 113 RHFYG-----HGVFSPDGRLLYATENDFDPNRGVIGVYDARE-GFQRVGEFSTHGIGPHEVTLMADGRTLVVA--NGGIE 184 (366)
T ss_pred ceeec-----ccccCCCCcEEEeecCCCCCCCceEEEEeccc-ccceecccccCCcCcceeEEecCCcEEEEe--CCcee
Confidence 2222 44789999999987643 6799999984 344556677888888899999999998875 34454
Q ss_pred EE
Q 000425 167 VW 168 (1525)
Q Consensus 167 VW 168 (1525)
.-
T Consensus 185 th 186 (366)
T COG3490 185 TH 186 (366)
T ss_pred cc
Confidence 43
No 339
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=89.51 E-value=2.8 Score=51.40 Aligned_cols=146 Identities=16% Similarity=0.093 Sum_probs=94.1
Q ss_pred CCEEEEEEec-cceEEEEEeecCCccceEEcc-CCCCCeEEEEEcCCCC--EEEE-EECCCcEEEEEcCCCc-EEEEEe-
Q 000425 18 QPVVFFGFHR-RMSVTVVGTVEGGRAPTKIKT-DLKKPIVNLACHPRLP--VLYV-AYADGLIRAYNIHTYA-VHYTLQ- 90 (1525)
Q Consensus 18 qp~Vf~s~~~-d~sI~vv~tieggk~~tklk~-~H~~~V~sLaf~Pdg~--~LaS-Gs~DgtIrLWDl~t~k-~l~tL~- 90 (1525)
.-|++++-.. +.++++++. +.-.....++. ...+.+..+ .+|... .+++ .-.++.|.++|-.... +..-+.
T Consensus 64 dg~L~~Sv~d~Dhs~KvfDv-En~DminmiKL~~lPg~a~wv-~skGd~~s~IAVs~~~sg~i~VvD~~~d~~q~~~fkk 141 (558)
T KOG0882|consen 64 DGWLFRSVEDPDHSVKVFDV-ENFDMINMIKLVDLPGFAEWV-TSKGDKISLIAVSLFKSGKIFVVDGFGDFCQDGYFKK 141 (558)
T ss_pred cceeEeeccCcccceeEEEe-eccchhhhcccccCCCceEEe-cCCCCeeeeEEeecccCCCcEEECCcCCcCccceecc
Confidence 3466666444 788887664 33222222222 133433332 222221 3444 3457899999955432 333343
Q ss_pred -cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC-C---Cc----------eeeEeecccCcCeEEEEEcCCCCE
Q 000425 91 -LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI-E---RP----------SMIGIIQVGSQPITSVAWLPMLRL 155 (1525)
Q Consensus 91 -H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t-~---~~----------s~i~il~gH~~~VtsVafsPdg~~ 155 (1525)
|..+|. ++.+++.+..++|....|.|.-|.... . .. .+.++.. ......++.|+|++..
T Consensus 142 lH~sPV~-----~i~y~qa~Ds~vSiD~~gmVEyWs~e~~~qfPr~~l~~~~K~eTdLy~f~K-~Kt~pts~Efsp~g~q 215 (558)
T KOG0882|consen 142 LHFSPVK-----KIRYNQAGDSAVSIDISGMVEYWSAEGPFQFPRTNLNFELKHETDLYGFPK-AKTEPTSFEFSPDGAQ 215 (558)
T ss_pred cccCceE-----EEEeeccccceeeccccceeEeecCCCcccCccccccccccccchhhcccc-cccCccceEEccccCc
Confidence 999999 999999999999999999999998772 1 00 0111111 2345678999999999
Q ss_pred EEEEECCCCeeEEEee
Q 000425 156 LVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 156 LaSgS~DgtIkVWDlr 171 (1525)
+.+-+.|..|++++.+
T Consensus 216 istl~~DrkVR~F~~K 231 (558)
T KOG0882|consen 216 ISTLNPDRKVRGFVFK 231 (558)
T ss_pred ccccCcccEEEEEEec
Confidence 9999999999999987
No 340
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=89.19 E-value=14 Score=45.37 Aligned_cols=153 Identities=18% Similarity=0.232 Sum_probs=100.7
Q ss_pred cEEEEEEcCCCCEEEEEEec--cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEE-CCCcEEEEEcCCCc
Q 000425 8 TEVHLALTPLQPVVFFGFHR--RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAY-ADGLIRAYNIHTYA 84 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~--d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs-~DgtIrLWDl~t~k 84 (1525)
....++++|.+..++++-.. ..++.+++........+ +..+ ..+ ..+++.|++..++++. .+++|.+.|..+..
T Consensus 117 ~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~-~~vG-~~P-~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~ 193 (381)
T COG3391 117 GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTAT-IPVG-NTP-TGVAVDPDGNKVYVTNSDDNTVSVIDTSGNS 193 (381)
T ss_pred CCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEE-EecC-CCc-ceEEECCCCCeEEEEecCCCeEEEEeCCCcc
Confidence 45679999999999999883 57777777544433333 3332 234 7899999999665554 78999999977766
Q ss_pred EEE-E----EecCCceeecCceeEEEeCCCCEEEEEeCC---CcEEEEeCCCCCceeeEe--ecccCcCeEEEEEcCCCC
Q 000425 85 VHY-T----LQLDNTIKLLGAGAFAFHPTLEWLFVGDRR---GTLLAWDVSIERPSMIGI--IQVGSQPITSVAWLPMLR 154 (1525)
Q Consensus 85 ~l~-t----L~H~~~V~~~ga~sVaFSPdg~~L~SgS~D---gtVrVWDl~t~~~s~i~i--l~gH~~~VtsVafsPdg~ 154 (1525)
..+ + +.... +.+.+.++|+|.++...... +.+.+.|..+. ..... .-+-. .-..+.++|++.
T Consensus 194 v~~~~~~~~~~~~~-----~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~--~v~~~~~~~~~~-~~~~v~~~p~g~ 265 (381)
T COG3391 194 VVRGSVGSLVGVGT-----GPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATG--NVTATDLPVGSG-APRGVAVDPAGK 265 (381)
T ss_pred eeccccccccccCC-----CCceEEECCCCCEEEEEeccCCCceEEEEeCCCc--eEEEeccccccC-CCCceeECCCCC
Confidence 654 1 21111 12378999999976665533 58888888853 22210 11111 345689999999
Q ss_pred EEEEE-ECCCCeeEEEee
Q 000425 155 LLVTL-CRDGSLQVWKTR 171 (1525)
Q Consensus 155 ~LaSg-S~DgtIkVWDlr 171 (1525)
.+... +..+++.+-|.+
T Consensus 266 ~~yv~~~~~~~V~vid~~ 283 (381)
T COG3391 266 AAYVANSQGGTVSVIDGA 283 (381)
T ss_pred EEEEEecCCCeEEEEeCC
Confidence 88777 444678777765
No 341
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=89.05 E-value=18 Score=43.07 Aligned_cols=117 Identities=11% Similarity=0.166 Sum_probs=80.8
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
.+..|++++|+|+.+.|++-.....--||=...++.+.++. .-.... .+.|.-+|++.++--.++.+.++.+.
T Consensus 84 ~~~nvS~LTynp~~rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE-----~Ieyig~n~fvi~dER~~~l~~~~vd 158 (316)
T COG3204 84 ETANVSSLTYNPDTRTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPE-----TIEYIGGNQFVIVDERDRALYLFTVD 158 (316)
T ss_pred ccccccceeeCCCcceEEEecCCCceEEEEecCCceEEEecccccCChh-----HeEEecCCEEEEEehhcceEEEEEEc
Confidence 34459999999999877766666666677667789998887 223344 67888888888887799999888665
Q ss_pred CCCc-------eee-EeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 128 IERP-------SMI-GIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 128 t~~~-------s~i-~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.... .+. +.....+..--.++|+|....|..+=.-.-++|+.+.
T Consensus 159 ~~t~~~~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P~~I~~~~ 210 (316)
T COG3204 159 ADTTVISAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNPIGIFEVT 210 (316)
T ss_pred CCccEEeccceEEeccccCCCCcCceeeecCCCCceEEEEEccCCcEEEEEe
Confidence 3211 111 1111114556789999988877777676677777665
No 342
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=88.42 E-value=8 Score=47.44 Aligned_cols=102 Identities=11% Similarity=0.091 Sum_probs=63.8
Q ss_pred EEEEEcCCCCEEEEE-ECC----CcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCC-----------
Q 000425 55 VNLACHPRLPVLYVA-YAD----GLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRR----------- 118 (1525)
Q Consensus 55 ~sLaf~Pdg~~LaSG-s~D----gtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~D----------- 118 (1525)
....++|+++++|.+ +.. .+++++|+.+++.+...-...... .+.|.+|++.|+....|
T Consensus 127 ~~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~~~~~~-----~~~W~~d~~~~~y~~~~~~~~~~~~~~~ 201 (414)
T PF02897_consen 127 GGFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIENPKFS-----SVSWSDDGKGFFYTRFDEDQRTSDSGYP 201 (414)
T ss_dssp EEEEETTTSSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEEEEESE-----EEEECTTSSEEEEEECSTTTSS-CCGCC
T ss_pred eeeeECCCCCEEEEEecCCCCceEEEEEEECCCCcCcCCcccccccc-----eEEEeCCCCEEEEEEeCcccccccCCCC
Confidence 357899999988765 333 459999999996654321111122 48999999887766533
Q ss_pred CcEEEEeCCCCCce--eeEeecccCcC--eEEEEEcCCCCEEEEEECCC
Q 000425 119 GTLLAWDVSIERPS--MIGIIQVGSQP--ITSVAWLPMLRLLVTLCRDG 163 (1525)
Q Consensus 119 gtVrVWDl~t~~~s--~i~il~gH~~~--VtsVafsPdg~~LaSgS~Dg 163 (1525)
..|+.|.+.+.... ++ +...... ...+..++|+++|+..+..+
T Consensus 202 ~~v~~~~~gt~~~~d~lv--fe~~~~~~~~~~~~~s~d~~~l~i~~~~~ 248 (414)
T PF02897_consen 202 RQVYRHKLGTPQSEDELV--FEEPDEPFWFVSVSRSKDGRYLFISSSSG 248 (414)
T ss_dssp EEEEEEETTS-GGG-EEE--EC-TTCTTSEEEEEE-TTSSEEEEEEESS
T ss_pred cEEEEEECCCChHhCeeE--EeecCCCcEEEEEEecCcccEEEEEEEcc
Confidence 23778888764222 33 3333322 56788999999877554443
No 343
>PF15492 Nbas_N: Neuroblastoma-amplified sequence, N terminal
Probab=87.80 E-value=24 Score=41.62 Aligned_cols=66 Identities=18% Similarity=0.177 Sum_probs=47.0
Q ss_pred EEEcCCCCEEEEEECCCcEEEEEcCCC--cEEEEEe-cC---CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 57 LACHPRLPVLYVAYADGLIRAYNIHTY--AVHYTLQ-LD---NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~DgtIrLWDl~t~--k~l~tL~-H~---~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
++.+.+|+.||+- .|..|.|=..++. .++.+.+ +. -.-+ .++||||+..|+.+...|+|+++|+..
T Consensus 3 ~~~~~~Gk~lAi~-qd~~iEiRsa~Ddf~si~~kcqVpkD~~PQWR-----kl~WSpD~tlLa~a~S~G~i~vfdl~g 74 (282)
T PF15492_consen 3 LALSSDGKLLAIL-QDQCIEIRSAKDDFSSIIGKCQVPKDPNPQWR-----KLAWSPDCTLLAYAESTGTIRVFDLMG 74 (282)
T ss_pred eeecCCCcEEEEE-eccEEEEEeccCCchheeEEEecCCCCCchhe-----EEEECCCCcEEEEEcCCCeEEEEeccc
Confidence 4567788877764 5677777654432 3334434 22 2335 799999999999999999999999984
No 344
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=87.78 E-value=2 Score=51.50 Aligned_cols=25 Identities=16% Similarity=0.182 Sum_probs=19.1
Q ss_pred HHHHHHHhhcCChHHHHHHHHHHHH
Q 000425 1314 FKTGLAHLEQNQLPDALSCFDEAFL 1338 (1525)
Q Consensus 1314 lk~gy~~~t~gkf~eAl~~Fr~iL~ 1338 (1525)
+..|.-+...|++.+|+..|+.++.
T Consensus 184 ~~la~~~~~~~~~~~A~~~~~~al~ 208 (389)
T PRK11788 184 CELAQQALARGDLDAARALLKKALA 208 (389)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHh
Confidence 3455566778889999988888876
No 345
>PRK13616 lipoprotein LpqB; Provisional
Probab=87.78 E-value=10 Score=49.49 Aligned_cols=152 Identities=12% Similarity=0.021 Sum_probs=84.0
Q ss_pred cEEEEEEcCCCCEEEEEEe-------ccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECC-CcEEEEE
Q 000425 8 TEVHLALTPLQPVVFFGFH-------RRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYAD-GLIRAYN 79 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~-------~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~D-gtIrLWD 79 (1525)
...+.+++|++..+.+... .+..+.++.. ++.. ..+.. ......-.|+|+|..+++.+.. ..+++.+
T Consensus 351 ~vsspaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~--gg~~-~~lt~--g~~~t~PsWspDG~~lw~v~dg~~~~~v~~ 425 (591)
T PRK13616 351 NITSAALSRSGRQVAAVVTLGRGAPDPASSLWVGPL--GGVA-VQVLE--GHSLTRPSWSLDADAVWVVVDGNTVVRVIR 425 (591)
T ss_pred CcccceECCCCCEEEEEEeecCCCCCcceEEEEEeC--CCcc-eeeec--CCCCCCceECCCCCceEEEecCcceEEEec
Confidence 3457889999998877762 1234444332 2222 22222 1236778999998877666533 2333333
Q ss_pred cCCCcEEEEEe---------cCCceeecCceeEEEeCCCCEEEEEeCCCcEEE---EeCCCCCceee---EeecccCcCe
Q 000425 80 IHTYAVHYTLQ---------LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLA---WDVSIERPSMI---GIIQVGSQPI 144 (1525)
Q Consensus 80 l~t~k~l~tL~---------H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrV---WDl~t~~~s~i---~il~gH~~~V 144 (1525)
-.....+..+. ....|. .++|||||..++... ++.|.| -........+. ....+-...+
T Consensus 426 ~~~~gql~~~~vd~ge~~~~~~g~Is-----sl~wSpDG~RiA~i~-~g~v~Va~Vvr~~~G~~~l~~~~~l~~~l~~~~ 499 (591)
T PRK13616 426 DPATGQLARTPVDASAVASRVPGPIS-----ELQLSRDGVRAAMII-GGKVYLAVVEQTEDGQYALTNPREVGPGLGDTA 499 (591)
T ss_pred cCCCceEEEEeccCchhhhccCCCcC-----eEEECCCCCEEEEEE-CCEEEEEEEEeCCCCceeecccEEeecccCCcc
Confidence 22222222221 134577 999999999888665 466665 22221111221 0111222335
Q ss_pred EEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 145 TSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 145 tsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.++.|.+++.++ .+..++...+|.+.
T Consensus 500 ~~l~W~~~~~L~-V~~~~~~~~v~~v~ 525 (591)
T PRK13616 500 VSLDWRTGDSLV-VGRSDPEHPVWYVN 525 (591)
T ss_pred ccceEecCCEEE-EEecCCCCceEEEe
Confidence 789999999865 55556666778765
No 346
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=87.43 E-value=8.3 Score=42.84 Aligned_cols=104 Identities=19% Similarity=0.125 Sum_probs=65.5
Q ss_pred CCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc-c
Q 000425 62 RLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV-G 140 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g-H 140 (1525)
++..+++++.++.+..||..+++.+..+.....+. ..- ..++..+++++.|+.+..+|..+........... .
T Consensus 35 ~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~~~-----~~~-~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~ 108 (238)
T PF13360_consen 35 DGGRVYVASGDGNLYALDAKTGKVLWRFDLPGPIS-----GAP-VVDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSP 108 (238)
T ss_dssp ETTEEEEEETTSEEEEEETTTSEEEEEEECSSCGG-----SGE-EEETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSC
T ss_pred eCCEEEEEcCCCEEEEEECCCCCEEEEeecccccc-----cee-eecccccccccceeeeEecccCCcceeeeecccccc
Confidence 56778888899999999999999988887333322 111 2345566777788899999988642222100101 0
Q ss_pred Cc-CeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 141 SQ-PITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 141 ~~-~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.. .........++..++.+..++.|..+|.+
T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~g~l~~~d~~ 140 (238)
T PF13360_consen 109 PAGVRSSSSPAVDGDRLYVGTSSGKLVALDPK 140 (238)
T ss_dssp TCSTB--SEEEEETTEEEEEETCSEEEEEETT
T ss_pred ccccccccCceEecCEEEEEeccCcEEEEecC
Confidence 00 11222233337788888888999988865
No 347
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=87.29 E-value=1.5 Score=55.62 Aligned_cols=67 Identities=19% Similarity=0.449 Sum_probs=55.7
Q ss_pred eEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeE-EEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 102 AFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPIT-SVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 102 sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~Vt-sVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+.|+|.-..|+.+..+|.|.+..++ .+++. +..-|...++ +++|.|||++|+.|=.||+|++-|+.
T Consensus 25 ~~ewnP~~dLiA~~t~~gelli~R~n--~qRlw-tip~p~~~v~~sL~W~~DGkllaVg~kdG~I~L~Dve 92 (665)
T KOG4640|consen 25 RIEWNPKMDLIATRTEKGELLIHRLN--WQRLW-TIPIPGENVTASLCWRPDGKLLAVGFKDGTIRLHDVE 92 (665)
T ss_pred EEEEcCccchhheeccCCcEEEEEec--cceeE-eccCCCCccceeeeecCCCCEEEEEecCCeEEEEEcc
Confidence 78999999999999999999888777 23333 2334666776 99999999999999999999999985
No 348
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=87.05 E-value=15 Score=47.02 Aligned_cols=154 Identities=15% Similarity=0.206 Sum_probs=84.1
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEe--ecCCc--cceEEccCCCCCeEEEEEcCC----CCEEEEEECCCcEEEEEcC-
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGT--VEGGR--APTKIKTDLKKPIVNLACHPR----LPVLYVAYADGLIRAYNIH- 81 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~t--ieggk--~~tklk~~H~~~V~sLaf~Pd----g~~LaSGs~DgtIrLWDl~- 81 (1525)
|=|+||..-.+++ +...|.+.+. +.+.. -..++.+ .-..|.++.|.|- -+.++.-.....|.||-+.
T Consensus 16 ~qAiHPvhGlaWT---DGkqVvLT~L~l~~gE~kfGds~viG-qFEhV~GlsW~P~~~~~~paLLAVQHkkhVtVWqL~~ 91 (671)
T PF15390_consen 16 HQAIHPVHGLAWT---DGKQVVLTDLQLHNGEPKFGDSKVIG-QFEHVHGLSWAPPCTADTPALLAVQHKKHVTVWQLCP 91 (671)
T ss_pred HhhhccccceEec---CCCEEEEEeeeeeCCccccCCccEee-ccceeeeeeecCcccCCCCceEEEeccceEEEEEecc
Confidence 4477888776554 2333333222 22210 0112222 3456899999985 3334444456789999865
Q ss_pred ----CCcEEEEE--ecCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCC
Q 000425 82 ----TYAVHYTL--QLDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLR 154 (1525)
Q Consensus 82 ----t~k~l~tL--~H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~ 154 (1525)
..+.+..- +...+.- ...+.+.|||....|..-. .|-+| +++++.....+..-+. ..+.|.|.+|..||+
T Consensus 92 s~~e~~K~l~sQtcEi~e~~p-vLpQGCVWHPk~~iL~VLT~~dvSV-~~sV~~d~srVkaDi~-~~G~IhCACWT~DG~ 168 (671)
T PF15390_consen 92 STTERNKLLMSQTCEIREPFP-VLPQGCVWHPKKAILTVLTARDVSV-LPSVHCDSSRVKADIK-TSGLIHCACWTKDGQ 168 (671)
T ss_pred CccccccceeeeeeeccCCcc-cCCCcccccCCCceEEEEecCceeE-eeeeeeCCceEEEecc-CCceEEEEEecCcCC
Confidence 22333222 2222211 1135678999988766544 44333 4566643333433333 356899999999999
Q ss_pred EEEEEECC-CCeeEEEee
Q 000425 155 LLVTLCRD-GSLQVWKTR 171 (1525)
Q Consensus 155 ~LaSgS~D-gtIkVWDlr 171 (1525)
.|+.+-.. =.-.|||-.
T Consensus 169 RLVVAvGSsLHSyiWd~~ 186 (671)
T PF15390_consen 169 RLVVAVGSSLHSYIWDSA 186 (671)
T ss_pred EEEEEeCCeEEEEEecCc
Confidence 87765433 246678854
No 349
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=86.96 E-value=1.6 Score=58.23 Aligned_cols=73 Identities=15% Similarity=0.316 Sum_probs=57.5
Q ss_pred CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 51 KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
.++|++++|+-+|..++.|-.+|.|.+||++.++.++.+. |..+.. |...+.+..++..++++..-|. +|.+.
T Consensus 130 ~~~Vtsvafn~dg~~l~~G~~~G~V~v~D~~~~k~l~~i~e~~ap~t--~vi~v~~t~~nS~llt~D~~Gs--f~~lv 203 (1206)
T KOG2079|consen 130 QGPVTSVAFNQDGSLLLAGLGDGHVTVWDMHRAKILKVITEHGAPVT--GVIFVGRTSQNSKLLTSDTGGS--FWKLV 203 (1206)
T ss_pred CCcceeeEecCCCceeccccCCCcEEEEEccCCcceeeeeecCCccc--eEEEEEEeCCCcEEEEccCCCc--eEEEE
Confidence 5799999999999999999999999999999999998888 655544 1115555566667777777776 67654
No 350
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=86.93 E-value=4 Score=54.26 Aligned_cols=99 Identities=19% Similarity=0.210 Sum_probs=61.6
Q ss_pred EEEEEEeccceEEEEEeecCCccceE--Ec-cCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCce
Q 000425 20 VVFFGFHRRMSVTVVGTVEGGRAPTK--IK-TDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTI 95 (1525)
Q Consensus 20 ~Vf~s~~~d~sI~vv~tieggk~~tk--lk-~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V 95 (1525)
..|+|.+ +..+..|+..-.+..... .+ ...+..-.|++-..+| +||+|+.+|.||+||--..+....|. ...+|
T Consensus 544 ~tflGls-~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~aTt~~G-~iavgs~~G~IRLyd~~g~~AKT~lp~lG~pI 621 (794)
T PF08553_consen 544 QTFLGLS-DNSLFRIDPRLSGNKLVDSQSKQYSSKNNFSCFATTEDG-YIAVGSNKGDIRLYDRLGKRAKTALPGLGDPI 621 (794)
T ss_pred ceEEEEC-CCceEEeccCCCCCceeeccccccccCCCceEEEecCCc-eEEEEeCCCcEEeecccchhhhhcCCCCCCCe
Confidence 4567774 566666665433321110 00 0123445666666554 79999999999999943222222333 57899
Q ss_pred eecCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 96 KLLGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 96 ~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
. .|+.+.||+||++.+. ..+.++|.
T Consensus 622 ~-----~iDvt~DGkwilaTc~-tyLlLi~t 646 (794)
T PF08553_consen 622 I-----GIDVTADGKWILATCK-TYLLLIDT 646 (794)
T ss_pred e-----EEEecCCCcEEEEeec-ceEEEEEE
Confidence 9 9999999999987764 45556664
No 351
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=86.75 E-value=1.7 Score=57.93 Aligned_cols=101 Identities=19% Similarity=0.315 Sum_probs=69.0
Q ss_pred CCCEEEEEECCCcEEEEEcCCC-cE-EEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc
Q 000425 62 RLPVLYVAYADGLIRAYNIHTY-AV-HYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV 139 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~-k~-l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g 139 (1525)
.+..++.|+..|.|-..|..+. .+ ....+...+|. +++|+.+|..++.|-.+|.|.+||..+ ......++.
T Consensus 98 ~~~~ivi~Ts~ghvl~~d~~~nL~~~~~ne~v~~~Vt-----svafn~dg~~l~~G~~~G~V~v~D~~~--~k~l~~i~e 170 (1206)
T KOG2079|consen 98 VVVPIVIGTSHGHVLLSDMTGNLGPLHQNERVQGPVT-----SVAFNQDGSLLLAGLGDGHVTVWDMHR--AKILKVITE 170 (1206)
T ss_pred eeeeEEEEcCchhhhhhhhhcccchhhcCCccCCcce-----eeEecCCCceeccccCCCcEEEEEccC--Ccceeeeee
Confidence 4566888888888888886653 11 11111346777 999999999999999999999999994 344444556
Q ss_pred cCcCeEEEE---EcCCCCEEEEEECCCCeeEEEee
Q 000425 140 GSQPITSVA---WLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 140 H~~~VtsVa---fsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
|..+++++- |..++..++++..-|. +|...
T Consensus 171 ~~ap~t~vi~v~~t~~nS~llt~D~~Gs--f~~lv 203 (1206)
T KOG2079|consen 171 HGAPVTGVIFVGRTSQNSKLLTSDTGGS--FWKLV 203 (1206)
T ss_pred cCCccceEEEEEEeCCCcEEEEccCCCc--eEEEE
Confidence 666655544 4455666777655555 68765
No 352
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=86.33 E-value=12 Score=45.35 Aligned_cols=96 Identities=25% Similarity=0.250 Sum_probs=61.8
Q ss_pred CCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCC-ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc
Q 000425 62 RLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDN-TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV 139 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~-~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g 139 (1525)
.+..+++++.||.+..+|..+++.+.... ... ... +... .+..+++++.||.|.+||..+. +....+..
T Consensus 278 ~~~~vyv~~~~G~l~~~d~~tG~~~W~~~~~~~~~~s-----sp~i--~g~~l~~~~~~G~l~~~d~~tG--~~~~~~~~ 348 (377)
T TIGR03300 278 DDNRLYVTDADGVVVALDRRSGSELWKNDELKYRQLT-----APAV--VGGYLVVGDFEGYLHWLSREDG--SFVARLKT 348 (377)
T ss_pred eCCEEEEECCCCeEEEEECCCCcEEEccccccCCccc-----cCEE--ECCEEEEEeCCCEEEEEECCCC--CEEEEEEc
Confidence 46778888899999999999998776553 111 112 1111 4678899999999999998864 22222333
Q ss_pred cCcCeEE-EEEcCCCCEEEEEECCCCeeEE
Q 000425 140 GSQPITS-VAWLPMLRLLVTLCRDGSLQVW 168 (1525)
Q Consensus 140 H~~~Vts-VafsPdg~~LaSgS~DgtIkVW 168 (1525)
|...+.. -.+. + ..|+.++.||.|..|
T Consensus 349 ~~~~~~~sp~~~-~-~~l~v~~~dG~l~~~ 376 (377)
T TIGR03300 349 DGSGIASPPVVV-G-DGLLVQTRDGDLYAF 376 (377)
T ss_pred CCCccccCCEEE-C-CEEEEEeCCceEEEe
Confidence 4333222 1222 3 357788899988765
No 353
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=86.00 E-value=1.6 Score=38.09 Aligned_cols=34 Identities=15% Similarity=0.409 Sum_probs=30.1
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA 84 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k 84 (1525)
....|.+++|+|+.+++|.|..||.|.++.+ +++
T Consensus 10 l~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl-~~q 43 (47)
T PF12894_consen 10 LPSRVSCMSWCPTMDLIALGTEDGEVLVYRL-NWQ 43 (47)
T ss_pred CCCcEEEEEECCCCCEEEEEECCCeEEEEEC-CCc
Confidence 4567999999999999999999999999998 443
No 354
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=85.63 E-value=2.5 Score=54.36 Aligned_cols=109 Identities=12% Similarity=0.128 Sum_probs=75.2
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccC-CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCc-----E
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTD-LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYA-----V 85 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~-H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k-----~ 85 (1525)
-++.-+..++.+|.+ .+.+.++....+ ...+++.+ -.+.+..+..|++..++|.|+..|.|.++-+.... .
T Consensus 39 Tc~dst~~~l~~GsS-~G~lyl~~R~~~--~~~~~~~~~~~~~~~~~~vs~~e~lvAagt~~g~V~v~ql~~~~p~~~~~ 115 (726)
T KOG3621|consen 39 TCVDATEEYLAMGSS-AGSVYLYNRHTG--EMRKLKNEGATGITCVRSVSSVEYLVAAGTASGRVSVFQLNKELPRDLDY 115 (726)
T ss_pred EEeecCCceEEEecc-cceEEEEecCch--hhhcccccCccceEEEEEecchhHhhhhhcCCceEEeehhhccCCCccee
Confidence 355566677777764 566666554322 22222221 23445667789999999999999999999766532 1
Q ss_pred EEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC
Q 000425 86 HYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI 128 (1525)
Q Consensus 86 l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t 128 (1525)
+..+. |...|+ +++|++++..+++|.+-|+|..--+++
T Consensus 116 ~t~~d~~~~~rVT-----al~Ws~~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 116 VTPCDKSHKCRVT-----ALEWSKNGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred eccccccCCceEE-----EEEecccccEEeecCCCceEEEEEech
Confidence 22222 678899 999999999999999999998876663
No 355
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=84.89 E-value=8.2 Score=50.28 Aligned_cols=107 Identities=16% Similarity=0.217 Sum_probs=71.7
Q ss_pred EEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe--cCCceeecCceeEEE--eCCCCEEEEEeCCCcEEEEeCCC----
Q 000425 57 LACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ--LDNTIKLLGAGAFAF--HPTLEWLFVGDRRGTLLAWDVSI---- 128 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~--H~~~V~~~ga~sVaF--SPdg~~L~SgS~DgtVrVWDl~t---- 128 (1525)
+..+..++.-++.+.-.++.|||.+.+...+.-. ..+.|. +++| .|||+.+++.|..+.|.++--..
T Consensus 35 i~gss~~k~a~V~~~~~~LtIWD~~~~~lE~~~~f~~~~~I~-----dLDWtst~d~qsiLaVGf~~~v~l~~Q~R~dy~ 109 (631)
T PF12234_consen 35 ISGSSIKKIAVVDSSRSELTIWDTRSGVLEYEESFSEDDPIR-----DLDWTSTPDGQSILAVGFPHHVLLYTQLRYDYT 109 (631)
T ss_pred EeecccCcEEEEECCCCEEEEEEcCCcEEEEeeeecCCCcee-----eceeeecCCCCEEEEEEcCcEEEEEEccchhhh
Confidence 4444455555555556679999999887544433 467888 9998 46899999999999999985321
Q ss_pred -CCceee--E--eecccC-cCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 129 -ERPSMI--G--IIQVGS-QPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 129 -~~~s~i--~--il~gH~-~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
..+... . .+..|+ ++|.+.+|.++|.+++.+ ++.+.|+|-
T Consensus 110 ~~~p~w~~i~~i~i~~~T~h~Igds~Wl~~G~LvV~s--GNqlfv~dk 155 (631)
T PF12234_consen 110 NKGPSWAPIRKIDISSHTPHPIGDSIWLKDGTLVVGS--GNQLFVFDK 155 (631)
T ss_pred cCCcccceeEEEEeecCCCCCccceeEecCCeEEEEe--CCEEEEECC
Confidence 111111 0 123444 679999999999876654 346777764
No 356
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=84.80 E-value=31 Score=38.31 Aligned_cols=144 Identities=15% Similarity=0.059 Sum_probs=79.6
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEE-ccCC--CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCC
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKI-KTDL--KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDN 93 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tkl-k~~H--~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~ 93 (1525)
...+++.. .+..+..++ .+.+...... .... ............+..++++..++.|..+|.++++.+.... +..
T Consensus 76 ~~~v~v~~-~~~~l~~~d-~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~~~ 153 (238)
T PF13360_consen 76 GGRVYVGT-SDGSLYALD-AKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSGKLVALDPKTGKLLWKYPVGEP 153 (238)
T ss_dssp TTEEEEEE-TTSEEEEEE-TTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCSEEEEEETTTTEEEEEEESSTT
T ss_pred cccccccc-ceeeeEecc-cCCcceeeeeccccccccccccccCceEecCEEEEEeccCcEEEEecCCCcEEEEeecCCC
Confidence 34455555 334666666 3445545542 2210 0111222333337888888889999999999999988887 332
Q ss_pred cee----ecC--ceeEEEeCCCCEEEEEeCCCc-EEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCee
Q 000425 94 TIK----LLG--AGAFAFHPTLEWLFVGDRRGT-LLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQ 166 (1525)
Q Consensus 94 ~V~----~~g--a~sVaFSPdg~~L~SgS~Dgt-VrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIk 166 (1525)
... ... ...+.+. ++ .++.++.++. +.+ |..+...... .....+.. ....++..++.++.|+.|.
T Consensus 154 ~~~~~~~~~~~~~~~~~~~-~~-~v~~~~~~g~~~~~-d~~tg~~~w~----~~~~~~~~-~~~~~~~~l~~~~~~~~l~ 225 (238)
T PF13360_consen 154 RGSSPISSFSDINGSPVIS-DG-RVYVSSGDGRVVAV-DLATGEKLWS----KPISGIYS-LPSVDGGTLYVTSSDGRLY 225 (238)
T ss_dssp -SS--EEEETTEEEEEECC-TT-EEEEECCTSSEEEE-ETTTTEEEEE----ECSS-ECE-CEECCCTEEEEEETTTEEE
T ss_pred CCCcceeeecccccceEEE-CC-EEEEEcCCCeEEEE-ECCCCCEEEE----ecCCCccC-CceeeCCEEEEEeCCCEEE
Confidence 210 000 0122222 34 6777777775 555 9986432221 11112222 1346777888888999999
Q ss_pred EEEee
Q 000425 167 VWKTR 171 (1525)
Q Consensus 167 VWDlr 171 (1525)
.||++
T Consensus 226 ~~d~~ 230 (238)
T PF13360_consen 226 ALDLK 230 (238)
T ss_dssp EEETT
T ss_pred EEECC
Confidence 99986
No 357
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=84.70 E-value=16 Score=48.32 Aligned_cols=121 Identities=15% Similarity=0.221 Sum_probs=72.3
Q ss_pred CCCC-eEEEEEcCCC-CEEEEEECCC-----cEEEEEcCC------CcEEEEEe---cCCceeecCceeEEEeCCCCEEE
Q 000425 50 LKKP-IVNLACHPRL-PVLYVAYADG-----LIRAYNIHT------YAVHYTLQ---LDNTIKLLGAGAFAFHPTLEWLF 113 (1525)
Q Consensus 50 H~~~-V~sLaf~Pdg-~~LaSGs~Dg-----tIrLWDl~t------~k~l~tL~---H~~~V~~~ga~sVaFSPdg~~L~ 113 (1525)
|... |..+ ++-++ .+|++-+.|+ .++|||+.. ..+++... |..+.+-+.+++++.|-+-+.++
T Consensus 63 ~~~siv~~L-~~~~~~~~L~sv~Ed~~~np~llkiw~lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs~~l~~Iv 141 (933)
T KOG2114|consen 63 YEQSIVQFL-YILNKQNFLFSVGEDEQGNPVLLKIWDLEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVSEDLKTIV 141 (933)
T ss_pred cchhhhhHh-hcccCceEEEEEeecCCCCceEEEEecccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEEccccEEE
Confidence 4444 3333 33334 4666655543 599999753 23443233 44443322345999999999999
Q ss_pred EEeCCCcEEEEeCC--CCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 114 VGDRRGTLLAWDVS--IERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 114 SgS~DgtVrVWDl~--t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+|-.||.|..+.-. ..+..-.........+|+++.+..++..++-...-..|.++.+.
T Consensus 142 ~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pITgL~~~~d~~s~lFv~Tt~~V~~y~l~ 201 (933)
T KOG2114|consen 142 CGFTNGLVICYKGDILRDRGSRQDYSHRGKEPITGLALRSDGKSVLFVATTEQVMLYSLS 201 (933)
T ss_pred EEecCcEEEEEcCcchhccccceeeeccCCCCceeeEEecCCceeEEEEecceeEEEEec
Confidence 99999999988433 21111111223346789999999888763333334568888764
No 358
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=84.66 E-value=20 Score=43.44 Aligned_cols=138 Identities=12% Similarity=0.081 Sum_probs=76.5
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCe-EEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCcee
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPI-VNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIK 96 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V-~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~ 96 (1525)
+..++++.. +..+..++. +.++....... ...+ .+... ++..+++++.|+.+..||..+++.+........+.
T Consensus 65 ~~~v~v~~~-~g~v~a~d~-~tG~~~W~~~~--~~~~~~~p~v--~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~ 138 (377)
T TIGR03300 65 GGKVYAADA-DGTVVALDA-ETGKRLWRVDL--DERLSGGVGA--DGGLVFVGTEKGEVIALDAEDGKELWRAKLSSEVL 138 (377)
T ss_pred CCEEEEECC-CCeEEEEEc-cCCcEeeeecC--CCCcccceEE--cCCEEEEEcCCCEEEEEECCCCcEeeeeccCceee
Confidence 345566553 455666653 33444443332 1222 22222 46788889999999999999999877665222222
Q ss_pred ecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCe-----EEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 97 LLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPI-----TSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 97 ~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~V-----tsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
..... .+..++.++.|+.+..||..+...... .......+ .+.... +..++.++.++.+..+|.+
T Consensus 139 ----~~p~v--~~~~v~v~~~~g~l~a~d~~tG~~~W~--~~~~~~~~~~~~~~sp~~~--~~~v~~~~~~g~v~ald~~ 208 (377)
T TIGR03300 139 ----SPPLV--ANGLVVVRTNDGRLTALDAATGERLWT--YSRVTPALTLRGSASPVIA--DGGVLVGFAGGKLVALDLQ 208 (377)
T ss_pred ----cCCEE--ECCEEEEECCCCeEEEEEcCCCceeeE--EccCCCceeecCCCCCEEE--CCEEEEECCCCEEEEEEcc
Confidence 01111 245677788899999999986422111 11111100 011111 2466777788888888764
No 359
>PF13763 DUF4167: Domain of unknown function (DUF4167)
Probab=84.35 E-value=1.8 Score=41.84 Aligned_cols=38 Identities=24% Similarity=0.290 Sum_probs=34.4
Q ss_pred hhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHH
Q 000425 803 SQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILD 840 (1525)
Q Consensus 803 ~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~Lll 840 (1525)
.++.+||++|++.|+..|+--+||-+||++--|-.++-
T Consensus 36 ~ql~EKY~~LArDA~ssGDrV~aEny~QHAeHY~Ril~ 73 (80)
T PF13763_consen 36 QQLIEKYNQLARDAQSSGDRVLAENYLQHAEHYFRILA 73 (80)
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHH
Confidence 46789999999999999999999999999998877763
No 360
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=83.90 E-value=1.4 Score=57.36 Aligned_cols=86 Identities=16% Similarity=0.151 Sum_probs=68.7
Q ss_pred CcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEEC
Q 000425 83 YAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 83 ~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~ 161 (1525)
++.+++|. |....+ |++|+-+.++|+.|+..|.|+++++.++ .+.+..++|.+.|+.+.=+-+|..+++.|.
T Consensus 1091 Fr~w~~frd~~~~fT-----c~afs~~~~hL~vG~~~Geik~~nv~sG--~~e~s~ncH~SavT~vePs~dgs~~Ltsss 1163 (1516)
T KOG1832|consen 1091 FRSWRSFRDETALFT-----CIAFSGGTNHLAVGSHAGEIKIFNVSSG--SMEESVNCHQSAVTLVEPSVDGSTQLTSSS 1163 (1516)
T ss_pred cccchhhhcccccee-----eEEeecCCceEEeeeccceEEEEEccCc--cccccccccccccccccccCCcceeeeecc
Confidence 45667787 888888 9999999999999999999999999963 344456699999999998889987776554
Q ss_pred CC--CeeEEEeeeecC
Q 000425 162 DG--SLQVWKTRVIIN 175 (1525)
Q Consensus 162 Dg--tIkVWDlr~~~~ 175 (1525)
-. -..+|+......
T Consensus 1164 ~S~PlsaLW~~~s~~~ 1179 (1516)
T KOG1832|consen 1164 SSSPLSALWDASSTGG 1179 (1516)
T ss_pred ccCchHHHhccccccC
Confidence 33 477899875443
No 361
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=83.41 E-value=25 Score=41.11 Aligned_cols=103 Identities=17% Similarity=0.115 Sum_probs=67.4
Q ss_pred CCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceee
Q 000425 18 QPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKL 97 (1525)
Q Consensus 18 qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~ 97 (1525)
..+|+.++|.+. +..++.-.+ +..+ ......++.+-++- -|+.++.||..|.+.+.+.+++.....|.....|.
T Consensus 23 kT~v~igSHs~~-~~avd~~sG-~~~W--e~ilg~RiE~sa~v-vgdfVV~GCy~g~lYfl~~~tGs~~w~f~~~~~vk- 96 (354)
T KOG4649|consen 23 KTLVVIGSHSGI-VIAVDPQSG-NLIW--EAILGVRIECSAIV-VGDFVVLGCYSGGLYFLCVKTGSQIWNFVILETVK- 96 (354)
T ss_pred ceEEEEecCCce-EEEecCCCC-cEEe--ehhhCceeeeeeEE-ECCEEEEEEccCcEEEEEecchhheeeeeehhhhc-
Confidence 567778887543 334443222 1111 11123444443332 47789999999999999999998877777333332
Q ss_pred cCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC
Q 000425 98 LGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE 129 (1525)
Q Consensus 98 ~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~ 129 (1525)
..-...+++..|.+|+.|++....|..++
T Consensus 97 ---~~a~~d~~~glIycgshd~~~yalD~~~~ 125 (354)
T KOG4649|consen 97 ---VRAQCDFDGGLIYCGSHDGNFYALDPKTY 125 (354)
T ss_pred ---cceEEcCCCceEEEecCCCcEEEeccccc
Confidence 02234678999999999999999999964
No 362
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=83.32 E-value=2.1 Score=37.30 Aligned_cols=31 Identities=19% Similarity=0.552 Sum_probs=28.0
Q ss_pred CcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 141 SQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 141 ~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
...|.+++|+|..++|+.++.||.|.++.++
T Consensus 11 ~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~ 41 (47)
T PF12894_consen 11 PSRVSCMSWCPTMDLIALGTEDGEVLVYRLN 41 (47)
T ss_pred CCcEEEEEECCCCCEEEEEECCCeEEEEECC
Confidence 4568999999999999999999999999863
No 363
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.15 E-value=15 Score=46.27 Aligned_cols=124 Identities=17% Similarity=0.183 Sum_probs=79.8
Q ss_pred ecCCccceEEccCCCCCeEEEEEcCCCC-------EEEEEECCCcEEEEEcCCCc--EEEEEe-cC----CceeecCcee
Q 000425 37 VEGGRAPTKIKTDLKKPIVNLACHPRLP-------VLYVAYADGLIRAYNIHTYA--VHYTLQ-LD----NTIKLLGAGA 102 (1525)
Q Consensus 37 ieggk~~tklk~~H~~~V~sLaf~Pdg~-------~LaSGs~DgtIrLWDl~t~k--~l~tL~-H~----~~V~~~ga~s 102 (1525)
+|.|+.+...+. |.+ |.-+.+.|+.+ .=++|-.|..|.=||.+-.. .+..-+ |. .... |
T Consensus 363 IE~GKIVeEWk~-~~d-i~mv~~t~d~K~~Ql~~e~TlvGLs~n~vfriDpRv~~~~kl~~~q~kqy~~k~nFs-----c 435 (644)
T KOG2395|consen 363 IERGKIVEEWKF-EDD-INMVDITPDFKFAQLTSEQTLVGLSDNSVFRIDPRVQGKNKLAVVQSKQYSTKNNFS-----C 435 (644)
T ss_pred cccceeeeEeec-cCC-cceeeccCCcchhcccccccEEeecCCceEEecccccCcceeeeeeccccccccccc-----e
Confidence 556666655555 444 77788888743 23567788999999976322 222222 21 2233 3
Q ss_pred EEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 103 FAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 103 VaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
.+-. ...+|+.||.+|.||+||-. ... -...+.+-..+|..|..+-+|++|+..+ +..+.+-|+.
T Consensus 436 ~aTT-~sG~IvvgS~~GdIRLYdri-~~~-AKTAlPgLG~~I~hVdvtadGKwil~Tc-~tyLlLi~t~ 500 (644)
T KOG2395|consen 436 FATT-ESGYIVVGSLKGDIRLYDRI-GRR-AKTALPGLGDAIKHVDVTADGKWILATC-KTYLLLIDTL 500 (644)
T ss_pred eeec-CCceEEEeecCCcEEeehhh-hhh-hhhcccccCCceeeEEeeccCcEEEEec-ccEEEEEEEe
Confidence 3322 33589999999999999974 211 1112346788999999999999887655 5577777765
No 364
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=81.73 E-value=12 Score=50.67 Aligned_cols=153 Identities=14% Similarity=0.160 Sum_probs=91.6
Q ss_pred EEEEcCCCCEEEEEEeccceEEEEEeec-------CCccc--------eEEccCCCCCeEEEEEcCCCCE--EEEEECCC
Q 000425 11 HLALTPLQPVVFFGFHRRMSVTVVGTVE-------GGRAP--------TKIKTDLKKPIVNLACHPRLPV--LYVAYADG 73 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~sI~vv~tie-------ggk~~--------tklk~~H~~~V~sLaf~Pdg~~--LaSGs~Dg 73 (1525)
.+|.++....+|++.. .+++.|+++-. ..... ..++-...-++..+.+.+|+.. ++..+.+.
T Consensus 46 ~la~sn~ysl~Fa~~n-sk~L~vfgtknlLi~~it~D~~n~~Vd~~~~~t~~v~k~~pi~~~v~~~D~t~s~v~~tsng~ 124 (1405)
T KOG3630|consen 46 NLAISNSYSLFFAASN-SKSLAVFGTKNLLIDHITSDSTNSLVDADENLTFKVEKEIPIVIFVCFHDATDSVVVSTSNGE 124 (1405)
T ss_pred hhhcccccceEEEecC-CcceeeeccccceeecccccccccccccccccceeeeccccceEEEeccCCceEEEEEecCCc
Confidence 4677888887776664 55454554311 10011 1122212345666677778664 33444556
Q ss_pred cEEEEEcCCCcE-----EEEEe-cC------CceeecCceeEEEeCCCC-EEEEEeCCCcEEEEeCCCCCceeeEeeccc
Q 000425 74 LIRAYNIHTYAV-----HYTLQ-LD------NTIKLLGAGAFAFHPTLE-WLFVGDRRGTLLAWDVSIERPSMIGIIQVG 140 (1525)
Q Consensus 74 tIrLWDl~t~k~-----l~tL~-H~------~~V~~~ga~sVaFSPdg~-~L~SgS~DgtVrVWDl~t~~~s~i~il~gH 140 (1525)
.|..||+++... ..-|. |. .... ++.|+|.-. ..+.+..|+.|+|.-+....+.+. .+ .-
T Consensus 125 ~v~~fD~~~fs~s~~~~~~pl~~s~ts~ek~vf~~-----~~~wnP~vp~n~av~l~dlsl~V~~~~~~~~~v~-s~-p~ 197 (1405)
T KOG3630|consen 125 AVYSFDLEEFSESRYETTVPLKNSATSFEKPVFQL-----KNVWNPLVPLNSAVDLSDLSLRVKSTKQLAQNVT-SF-PV 197 (1405)
T ss_pred eEEEEehHhhhhhhhhhccccccccchhccccccc-----cccccCCccchhhhhccccchhhhhhhhhhhhhc-cc-Cc
Confidence 788999876422 12222 21 2233 788988654 355666999999987663222222 01 22
Q ss_pred CcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 141 SQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 141 ~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
...++|++|+|.|+.++.|-..|++.-|...
T Consensus 198 t~~~Tav~WSprGKQl~iG~nnGt~vQy~P~ 228 (1405)
T KOG3630|consen 198 TNSQTAVLWSPRGKQLFIGRNNGTEVQYEPS 228 (1405)
T ss_pred ccceeeEEeccccceeeEecCCCeEEEeecc
Confidence 4567899999999999999999999877654
No 365
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=81.36 E-value=5.6 Score=35.56 Aligned_cols=56 Identities=14% Similarity=0.215 Sum_probs=43.0
Q ss_pred hhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh
Q 000425 770 GIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD 834 (1525)
Q Consensus 770 F~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d 834 (1525)
-...++.|+++.|.+++++.-. ..| +...-|..||+..+.+|+++.|.+.|+++-+
T Consensus 4 a~~~~~~g~~~~A~~~~~~~l~---~~P------~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~ 59 (65)
T PF13432_consen 4 ARALYQQGDYDEAIAAFEQALK---QDP------DNPEAWYLLGRILYQQGRYDEALAYYERALE 59 (65)
T ss_dssp HHHHHHCTHHHHHHHHHHHHHC---CST------THHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHcCCHHHHHHHHHHHHH---HCC------CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3567899999999999988752 222 2344467899999999999999999998743
No 366
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=80.37 E-value=59 Score=37.79 Aligned_cols=146 Identities=18% Similarity=0.069 Sum_probs=88.1
Q ss_pred cEEEEEEcCCCCEEEEEE--eccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEE-cCCCc
Q 000425 8 TEVHLALTPLQPVVFFGF--HRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYN-IHTYA 84 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~--~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWD-l~t~k 84 (1525)
.-.+.|++|++..+.+-. .....+.+.. .+.....+.. ...+..-.|++++...++...+...+++. ..++.
T Consensus 25 ~~~s~AvS~dg~~~A~v~~~~~~~~L~~~~---~~~~~~~~~~--g~~l~~PS~d~~g~~W~v~~~~~~~~~~~~~~~g~ 99 (253)
T PF10647_consen 25 DVTSPAVSPDGSRVAAVSEGDGGRSLYVGP---AGGPVRPVLT--GGSLTRPSWDPDGWVWTVDDGSGGVRVVRDSASGT 99 (253)
T ss_pred cccceEECCCCCeEEEEEEcCCCCEEEEEc---CCCcceeecc--CCccccccccCCCCEEEEEcCCCceEEEEecCCCc
Confidence 457889999998777666 3444443332 2222222212 23677789999988777777777777774 33332
Q ss_pred E-EEEEe--cCC-ceeecCceeEEEeCCCCEEEEEe---CCCcEEEEeCCCC--C-ceee----EeecccCcCeEEEEEc
Q 000425 85 V-HYTLQ--LDN-TIKLLGAGAFAFHPTLEWLFVGD---RRGTLLAWDVSIE--R-PSMI----GIIQVGSQPITSVAWL 150 (1525)
Q Consensus 85 ~-l~tL~--H~~-~V~~~ga~sVaFSPdg~~L~SgS---~DgtVrVWDl~t~--~-~s~i----~il~gH~~~VtsVafs 150 (1525)
. ...+. ... .|. .+++||||..++... .++.|.|=-+... . .... .........++.++|.
T Consensus 100 ~~~~~v~~~~~~~~I~-----~l~vSpDG~RvA~v~~~~~~~~v~va~V~r~~~g~~~~l~~~~~~~~~~~~~v~~v~W~ 174 (253)
T PF10647_consen 100 GEPVEVDWPGLRGRIT-----ALRVSPDGTRVAVVVEDGGGGRVYVAGVVRDGDGVPRRLTGPRRVAPPLLSDVTDVAWS 174 (253)
T ss_pred ceeEEecccccCCceE-----EEEECCCCcEEEEEEecCCCCeEEEEEEEeCCCCCcceeccceEecccccCcceeeeec
Confidence 2 22233 222 788 999999998776554 4566666544321 1 1111 1122335678999999
Q ss_pred CCCCEEEEEECCC
Q 000425 151 PMLRLLVTLCRDG 163 (1525)
Q Consensus 151 Pdg~~LaSgS~Dg 163 (1525)
++..+++.+...+
T Consensus 175 ~~~~L~V~~~~~~ 187 (253)
T PF10647_consen 175 DDSTLVVLGRSAG 187 (253)
T ss_pred CCCEEEEEeCCCC
Confidence 9998877765544
No 367
>KOG3616 consensus Selective LIM binding factor [Transcription]
Probab=80.35 E-value=6 Score=50.89 Aligned_cols=79 Identities=18% Similarity=0.094 Sum_probs=61.6
Q ss_pred cccchhhccccCCCCCcchhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHH
Q 000425 750 CGDLAVSLSQAGPQFTQVLRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETF 829 (1525)
Q Consensus 750 ~~elAl~ls~~~~~~d~~~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~ 829 (1525)
-..+|+++-. -...|--||.-|+++-|...+.... -+.+.+..|+.|+..||..||.-+|+.||
T Consensus 437 takvayeld~------~~iefgaaid~~df~ra~afles~~----------~~~da~amw~~laelale~~nl~iaercf 500 (1636)
T KOG3616|consen 437 TAKVAYELDE------GLIEFGAAIDDGDFDRATAFLESLE----------MGPDAEAMWIRLAELALEAGNLFIAERCF 500 (1636)
T ss_pred ceeEEEEecC------cceecccccccCchHHHHHHHHhhc----------cCccHHHHHHHHHHHHHHhccchHHHHHH
Confidence 3445555542 2446777999999999988876532 13466888999999999999999999999
Q ss_pred HHHhhhhHHHHHHHH
Q 000425 830 EVIADYESILDLFIC 844 (1525)
Q Consensus 830 q~~~d~~~LllLy~~ 844 (1525)
..++|..+--+|+-.
T Consensus 501 aai~dvak~r~lhd~ 515 (1636)
T KOG3616|consen 501 AAIGDVAKARFLHDI 515 (1636)
T ss_pred HHHHHHHHHHHHHHH
Confidence 999999988888743
No 368
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=80.22 E-value=2.6 Score=37.81 Aligned_cols=54 Identities=24% Similarity=0.353 Sum_probs=41.2
Q ss_pred HhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhh
Q 000425 773 AIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADY 835 (1525)
Q Consensus 773 Al~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~ 835 (1525)
+++.|+++.|++++++...+.++. .+. |-.||...++.|+++.|++.++++-..
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p~~------~~~---~~~la~~~~~~g~~~~A~~~l~~~~~~ 54 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNPDN------PEA---RLLLAQCYLKQGQYDEAEELLERLLKQ 54 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTTTS------HHH---HHHHHHHHHHTT-HHHHHHHHHCCHGG
T ss_pred ChhccCHHHHHHHHHHHHHHCCCC------HHH---HHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 478899999999998876433332 244 446999999999999999999987643
No 369
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=79.55 E-value=26 Score=34.11 Aligned_cols=87 Identities=15% Similarity=0.172 Sum_probs=55.3
Q ss_pred HHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh-----------hhHHHH
Q 000425 772 YAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD-----------YESILD 840 (1525)
Q Consensus 772 LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d-----------~~~Lll 840 (1525)
...+.|+++.|.+.++..-.. .|+.+..... +-.+|...++.|+++.|..+|+.+-. +..+-.
T Consensus 11 ~~~~~~~~~~A~~~~~~~~~~---~~~~~~~~~~---~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~ 84 (119)
T TIGR02795 11 LVLKAGDYADAIQAFQAFLKK---YPKSTYAPNA---HYWLGEAYYAQGKYADAAKAFLAVVKKYPKSPKAPDALLKLGM 84 (119)
T ss_pred HHHHcCCHHHHHHHHHHHHHH---CCCccccHHH---HHHHHHHHHhhccHHHHHHHHHHHHHHCCCCCcccHHHHHHHH
Confidence 356789999999998776422 2222212122 34599999999999999999998753 223334
Q ss_pred HHHHhCCHHHHHHHHHHHHHhcCC
Q 000425 841 LFICHLNPSAMRRLAQRLEEEGAN 864 (1525)
Q Consensus 841 Ly~~~gd~e~L~kLa~~ae~~g~~ 864 (1525)
+|...|+.+.-.+..+.+.+..-+
T Consensus 85 ~~~~~~~~~~A~~~~~~~~~~~p~ 108 (119)
T TIGR02795 85 SLQELGDKEKAKATLQQVIKRYPG 108 (119)
T ss_pred HHHHhCChHHHHHHHHHHHHHCcC
Confidence 555566666655555555444433
No 370
>PF02897 Peptidase_S9_N: Prolyl oligopeptidase, N-terminal beta-propeller domain; InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs. Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=79.09 E-value=45 Score=41.00 Aligned_cols=109 Identities=11% Similarity=0.124 Sum_probs=65.2
Q ss_pred EEEEcCCCCEEEEEEeccce----EEEEEeecCCccce-EEccCCCCCeEEEEEcCCCCEEEEEECCC-----------c
Q 000425 11 HLALTPLQPVVFFGFHRRMS----VTVVGTVEGGRAPT-KIKTDLKKPIVNLACHPRLPVLYVAYADG-----------L 74 (1525)
Q Consensus 11 ~LA~hP~qp~Vf~s~~~d~s----I~vv~tieggk~~t-klk~~H~~~V~sLaf~Pdg~~LaSGs~Dg-----------t 74 (1525)
..+++|++.+++++.+..++ +.+.+. +.++... .+.. ..-.++.|.+++..|+....|. .
T Consensus 128 ~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl-~tg~~l~d~i~~---~~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~ 203 (414)
T PF02897_consen 128 GFSVSPDGKRLAYSLSDGGSEWYTLRVFDL-ETGKFLPDGIEN---PKFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQ 203 (414)
T ss_dssp EEEETTTSSEEEEEEEETTSSEEEEEEEET-TTTEEEEEEEEE---EESEEEEECTTSSEEEEEECSTTTSS-CCGCCEE
T ss_pred eeeECCCCCEEEEEecCCCCceEEEEEEEC-CCCcCcCCcccc---cccceEEEeCCCCEEEEEEeCcccccccCCCCcE
Confidence 45789999999999876444 555553 3332221 1111 1112399999988776665443 3
Q ss_pred EEEEEcCCCcE--EEEEe-cC-Cc-eeecCceeEEEeCCCCEEEEEe-C--C-CcEEEEeCCC
Q 000425 75 IRAYNIHTYAV--HYTLQ-LD-NT-IKLLGAGAFAFHPTLEWLFVGD-R--R-GTLLAWDVSI 128 (1525)
Q Consensus 75 IrLWDl~t~k~--l~tL~-H~-~~-V~~~ga~sVaFSPdg~~L~SgS-~--D-gtVrVWDl~t 128 (1525)
|+.|.+.+... ...++ .. .. .. .+..++|+++++..+ . + ..+.+-|...
T Consensus 204 v~~~~~gt~~~~d~lvfe~~~~~~~~~-----~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~ 261 (414)
T PF02897_consen 204 VYRHKLGTPQSEDELVFEEPDEPFWFV-----SVSRSKDGRYLFISSSSGTSESEVYLLDLDD 261 (414)
T ss_dssp EEEEETTS-GGG-EEEEC-TTCTTSEE-----EEEE-TTSSEEEEEEESSSSEEEEEEEECCC
T ss_pred EEEEECCCChHhCeeEEeecCCCcEEE-----EEEecCcccEEEEEEEccccCCeEEEEeccc
Confidence 77888877632 34454 22 22 45 788899999987544 2 3 3466667774
No 371
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=78.29 E-value=1.3 Score=36.55 Aligned_cols=29 Identities=28% Similarity=0.592 Sum_probs=24.6
Q ss_pred ccCCCCceecCCC-----CCCCccccceeecCCC
Q 000425 1486 VCDLCGAKFSALS-----APGCIICGMGSIKRSD 1514 (1525)
Q Consensus 1486 ~Cp~cgA~y~~~~-----g~~C~iC~l~~IG~~~ 1514 (1525)
.||-||+.|+..| ..+|.+|+-..|-++|
T Consensus 3 ~C~~Cg~~Yh~~~~pP~~~~~Cd~cg~~L~qR~D 36 (36)
T PF05191_consen 3 ICPKCGRIYHIEFNPPKVEGVCDNCGGELVQRKD 36 (36)
T ss_dssp EETTTTEEEETTTB--SSTTBCTTTTEBEBEEGG
T ss_pred CcCCCCCccccccCCCCCCCccCCCCCeeEeCCC
Confidence 6999999999877 6899999987776665
No 372
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=77.14 E-value=8.2 Score=46.71 Aligned_cols=104 Identities=13% Similarity=0.122 Sum_probs=61.1
Q ss_pred cCCCCEEEEEE---------CCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC
Q 000425 60 HPRLPVLYVAY---------ADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER 130 (1525)
Q Consensus 60 ~Pdg~~LaSGs---------~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~ 130 (1525)
|||+++++... ..+.+.|||+.+++..........+. ...|||+|++++-.. ++.|.+++..+.
T Consensus 1 S~d~~~~l~~~~~~~~~r~s~~~~y~i~d~~~~~~~~l~~~~~~~~-----~~~~sP~g~~~~~v~-~~nly~~~~~~~- 73 (353)
T PF00930_consen 1 SPDGKFVLFATNYTKQWRHSFKGDYYIYDIETGEITPLTPPPPKLQ-----DAKWSPDGKYIAFVR-DNNLYLRDLATG- 73 (353)
T ss_dssp -TTSSEEEEEEEEEEESSSEEEEEEEEEETTTTEEEESS-EETTBS-----EEEE-SSSTEEEEEE-TTEEEEESSTTS-
T ss_pred CCCCCeEEEEECcEEeeeeccceeEEEEecCCCceEECcCCccccc-----cceeecCCCeeEEEe-cCceEEEECCCC-
Confidence 46666655522 24678889998865433222445566 889999999988775 578999987643
Q ss_pred ceeeEe-ecc-------c---------CcCeEEEEEcCCCCEEEEEEC-CCCeeEEEee
Q 000425 131 PSMIGI-IQV-------G---------SQPITSVAWLPMLRLLVTLCR-DGSLQVWKTR 171 (1525)
Q Consensus 131 ~s~i~i-l~g-------H---------~~~VtsVafsPdg~~LaSgS~-DgtIkVWDlr 171 (1525)
...++ ..+ - -..-..+-||||+++|+.... +..|+.+.+-
T Consensus 74 -~~~~lT~dg~~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~ 131 (353)
T PF00930_consen 74 -QETQLTTDGEPGIYNGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLP 131 (353)
T ss_dssp -EEEESES--TTTEEESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEE
T ss_pred -CeEEeccccceeEEcCccceeccccccccccceEECCCCCEEEEEEECCcCCceEEee
Confidence 11111 111 0 112246889999999887654 4467777654
No 373
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=77.10 E-value=11 Score=38.89 Aligned_cols=63 Identities=16% Similarity=0.162 Sum_probs=44.5
Q ss_pred EEEeCCC-CEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 103 FAFHPTL-EWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 103 VaFSPdg-~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+.|.-|| +.|+.||+|..||||+-......+. . ...|++++-... ..|+.+..+|||-+++-.
T Consensus 8 ~d~d~dg~~eLlvGs~D~~IRvf~~~e~~~Ei~----e-~~~v~~L~~~~~-~~F~Y~l~NGTVGvY~~~ 71 (111)
T PF14783_consen 8 FDFDGDGENELLVGSDDFEIRVFKGDEIVAEIT----E-TDKVTSLCSLGG-GRFAYALANGTVGVYDRS 71 (111)
T ss_pred EecCCCCcceEEEecCCcEEEEEeCCcEEEEEe----c-ccceEEEEEcCC-CEEEEEecCCEEEEEeCc
Confidence 3444455 5799999999999998663222222 2 456777776666 568889999999988753
No 374
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=76.88 E-value=43 Score=40.94 Aligned_cols=142 Identities=10% Similarity=0.045 Sum_probs=73.1
Q ss_pred cEEEEEEcCCCCEEEEEEe----------c-cceEEEEEeecC-Ccc--ceEEccCCCCCeEEEEEcCCCCEEEEEECCC
Q 000425 8 TEVHLALTPLQPVVFFGFH----------R-RMSVTVVGTVEG-GRA--PTKIKTDLKKPIVNLACHPRLPVLYVAYADG 73 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~----------~-d~sI~vv~tieg-gk~--~tklk~~H~~~V~sLaf~Pdg~~LaSGs~Dg 73 (1525)
.-+.|++.+.+-...+... . ...|.++...++ ++. .+.+.. .-...+++++.+++ |++++.+.
T Consensus 15 ~P~~ia~d~~G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~-~l~~p~Gi~~~~~G--lyV~~~~~ 91 (367)
T TIGR02604 15 NPIAVCFDERGRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAE-ELSMVTGLAVAVGG--VYVATPPD 91 (367)
T ss_pred CCceeeECCCCCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeec-CCCCccceeEecCC--EEEeCCCe
Confidence 3467788888775433221 1 114544443222 332 223322 33456889999988 44455555
Q ss_pred cEEEEEcCCC-----c--EE-EEEe-c----CCceeecCceeEEEeCCCCEEEEEeCC-------------------CcE
Q 000425 74 LIRAYNIHTY-----A--VH-YTLQ-L----DNTIKLLGAGAFAFHPTLEWLFVGDRR-------------------GTL 121 (1525)
Q Consensus 74 tIrLWDl~t~-----k--~l-~tL~-H----~~~V~~~ga~sVaFSPdg~~L~SgS~D-------------------gtV 121 (1525)
..++.|.... + .+ ..+. . ....+ .++|.|||...++-+.. +.|
T Consensus 92 i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~-----~l~~gpDG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i 166 (367)
T TIGR02604 92 ILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLN-----SLAWGPDGWLYFNHGNTLASKVTRPGTSDESRQGLGGGL 166 (367)
T ss_pred EEEEeCCCCCCCCCCccEEEEEccCCCCCccccccc-----CceECCCCCEEEecccCCCceeccCCCccCcccccCceE
Confidence 4445454321 2 12 1122 1 23466 89999999866654421 233
Q ss_pred EEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE
Q 000425 122 LAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 122 rVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
..+|..+....+. ..+ ......++|+|+|.++++-.
T Consensus 167 ~r~~pdg~~~e~~--a~G-~rnp~Gl~~d~~G~l~~tdn 202 (367)
T TIGR02604 167 FRYNPDGGKLRVV--AHG-FQNPYGHSVDSWGDVFFCDN 202 (367)
T ss_pred EEEecCCCeEEEE--ecC-cCCCccceECCCCCEEEEcc
Confidence 3344442212222 123 34456899999998887644
No 375
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=76.71 E-value=19 Score=38.02 Aligned_cols=89 Identities=10% Similarity=0.050 Sum_probs=59.5
Q ss_pred hHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCHHH
Q 000425 771 IYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNPSA 850 (1525)
Q Consensus 771 ~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~e~ 850 (1525)
....+.|+++.|.+..+..-. ..| ....-|..+|..+++.|+++-|.++|+++-.. .-+|.+.
T Consensus 32 ~~~~~~g~~~~A~~~~~~al~---~~P------~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l--------~p~~~~a 94 (144)
T PRK15359 32 YASWQEGDYSRAVIDFSWLVM---AQP------WSWRAHIALAGTWMMLKEYTTAINFYGHALML--------DASHPEP 94 (144)
T ss_pred HHHHHcCCHHHHHHHHHHHHH---cCC------CcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc--------CCCCcHH
Confidence 345788899999888776421 111 12444678999999999999999999998653 3455555
Q ss_pred HHHHHHHHHHhcCChhHHHHHHHHHh
Q 000425 851 MRRLAQRLEEEGANPELRRYCERILR 876 (1525)
Q Consensus 851 L~kLa~~ae~~g~~~~~~~~c~~il~ 876 (1525)
+-.++......|...+++..+++.+.
T Consensus 95 ~~~lg~~l~~~g~~~eAi~~~~~Al~ 120 (144)
T PRK15359 95 VYQTGVCLKMMGEPGLAREAFQTAIK 120 (144)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 55555555555555555555555443
No 376
>PRK13616 lipoprotein LpqB; Provisional
Probab=76.38 E-value=17 Score=47.43 Aligned_cols=107 Identities=7% Similarity=-0.048 Sum_probs=60.5
Q ss_pred CCeEEEEEcCCCCEEEEEE------CCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCC-CcEEEE
Q 000425 52 KPIVNLACHPRLPVLYVAY------ADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRR-GTLLAW 124 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs------~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~D-gtVrVW 124 (1525)
..+.+.+++|+|..++... .|+.-+||=...+.....+....... .-.|+|||.++.+.++. ..+++.
T Consensus 350 ~~vsspaiSpdG~~vA~v~~~~~~~~d~~s~Lwv~~~gg~~~~lt~g~~~t-----~PsWspDG~~lw~v~dg~~~~~v~ 424 (591)
T PRK13616 350 GNITSAALSRSGRQVAAVVTLGRGAPDPASSLWVGPLGGVAVQVLEGHSLT-----RPSWSLDADAVWVVVDGNTVVRVI 424 (591)
T ss_pred cCcccceECCCCCEEEEEEeecCCCCCcceEEEEEeCCCcceeeecCCCCC-----CceECCCCCceEEEecCcceEEEe
Confidence 4678899999999876554 35444455322222223333222356 78999999988777643 223333
Q ss_pred eCCCCCceee------Eeec-ccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 125 DVSIERPSMI------GIIQ-VGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 125 Dl~t~~~s~i------~il~-gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+.... ..+. +... .....|..+.|||||..++... ++.|
T Consensus 425 ~~~~~-gql~~~~vd~ge~~~~~~g~Issl~wSpDG~RiA~i~-~g~v 470 (591)
T PRK13616 425 RDPAT-GQLARTPVDASAVASRVPGPISELQLSRDGVRAAMII-GGKV 470 (591)
T ss_pred ccCCC-ceEEEEeccCchhhhccCCCcCeEEECCCCCEEEEEE-CCEE
Confidence 22110 0110 0000 2245699999999999887765 3533
No 377
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=75.80 E-value=1.1e+02 Score=38.26 Aligned_cols=70 Identities=17% Similarity=0.267 Sum_probs=45.0
Q ss_pred EEEEEEcCCCCEEEEEEeccc----------eEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRM----------SVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAY 78 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~----------sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLW 78 (1525)
...+|+.|.+-.+......++ .|.++. ..|+....+.-.+ +.|.++.|..+..+++ -..||++++|
T Consensus 31 ~~~va~a~~gGpIAi~~d~~k~~~~~~~~p~~I~iys--~sG~ll~~i~w~~-~~iv~~~wt~~e~Lvv-V~~dG~v~vy 106 (410)
T PF04841_consen 31 DYIVAVAPYGGPIAIIRDESKLVPVGSAKPNSIQIYS--SSGKLLSSIPWDS-GRIVGMGWTDDEELVV-VQSDGTVRVY 106 (410)
T ss_pred ceeEEEcCCCceEEEEecCcccccccCCCCcEEEEEC--CCCCEeEEEEECC-CCEEEEEECCCCeEEE-EEcCCEEEEE
Confidence 446788887666655544422 233333 2244455555545 8999999998665554 4589999999
Q ss_pred EcCC
Q 000425 79 NIHT 82 (1525)
Q Consensus 79 Dl~t 82 (1525)
|+..
T Consensus 107 ~~~G 110 (410)
T PF04841_consen 107 DLFG 110 (410)
T ss_pred eCCC
Confidence 9863
No 378
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=75.24 E-value=96 Score=36.34 Aligned_cols=156 Identities=13% Similarity=0.103 Sum_probs=88.5
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCC-----
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTY----- 83 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~----- 83 (1525)
..-|+++|+.-.+|+-.+....+..+++ .++....+...-.+-.-+|++--++.++++.-.++.+.++++...
T Consensus 24 ~SGLTy~pd~~tLfaV~d~~~~i~els~--~G~vlr~i~l~g~~D~EgI~y~g~~~~vl~~Er~~~L~~~~~~~~~~~~~ 101 (248)
T PF06977_consen 24 LSGLTYNPDTGTLFAVQDEPGEIYELSL--DGKVLRRIPLDGFGDYEGITYLGNGRYVLSEERDQRLYIFTIDDDTTSLD 101 (248)
T ss_dssp EEEEEEETTTTEEEEEETTTTEEEEEET--T--EEEEEE-SS-SSEEEEEE-STTEEEEEETTTTEEEEEEE----TT--
T ss_pred ccccEEcCCCCeEEEEECCCCEEEEEcC--CCCEEEEEeCCCCCCceeEEEECCCEEEEEEcCCCcEEEEEEeccccccc
Confidence 5679999998888888776666644442 344444444322356778888877766665545888988887321
Q ss_pred -cEEEEEe------cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCC--CCcee-eEe------ecccCcCeEEE
Q 000425 84 -AVHYTLQ------LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSI--ERPSM-IGI------IQVGSQPITSV 147 (1525)
Q Consensus 84 -k~l~tL~------H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t--~~~s~-i~i------l~gH~~~VtsV 147 (1525)
.....+. ++..+- .|+|.|.++.|+.+-...-.++|.+.. ..... ... ...+...+.++
T Consensus 102 ~~~~~~~~l~~~~~~N~G~E-----Gla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l 176 (248)
T PF06977_consen 102 RADVQKISLGFPNKGNKGFE-----GLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGL 176 (248)
T ss_dssp EEEEEEEE---S---SS--E-----EEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EE
T ss_pred hhhceEEecccccCCCcceE-----EEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccce
Confidence 1112222 334577 999999888777777666666776553 10111 100 11234567899
Q ss_pred EEcCCC-CEEEEEECCCCeeEEEee
Q 000425 148 AWLPML-RLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 148 afsPdg-~~LaSgS~DgtIkVWDlr 171 (1525)
++||.. .+++-...+..|..+|..
T Consensus 177 ~~~p~t~~lliLS~es~~l~~~d~~ 201 (248)
T PF06977_consen 177 SYDPRTGHLLILSDESRLLLELDRQ 201 (248)
T ss_dssp EEETTTTEEEEEETTTTEEEEE-TT
T ss_pred EEcCCCCeEEEEECCCCeEEEECCC
Confidence 999964 556666667778777754
No 379
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=74.34 E-value=32 Score=33.44 Aligned_cols=107 Identities=14% Similarity=0.169 Sum_probs=61.5
Q ss_pred HHHHHHHHHhhcCChHHHHHHHHHHHHHHhcCCcchHHH----HHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchH
Q 000425 1312 VCFKTGLAHLEQNQLPDALSCFDEAFLALAKDHSRGADV----KAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKD 1387 (1525)
Q Consensus 1312 ~~lk~gy~~~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~----~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~ 1387 (1525)
..+..|..++..|++.+|+..|+.++..--++.. ..++ -.+..--.+|--|+.+++++.+.
T Consensus 4 ~~~~~~~~~~~~~~~~~A~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~-------------- 68 (119)
T TIGR02795 4 AYYDAALLVLKAGDYADAIQAFQAFLKKYPKSTY-APNAHYWLGEAYYAQGKYADAAKAFLAVVKK-------------- 68 (119)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccc-cHHHHHHHHHHHHhhccHHHHHHHHHHHHHH--------------
Confidence 4678899999999999999999999763221111 1111 11111112233333332222221
Q ss_pred HHHHHHHhhcCCCCchhHH--HHHHHHHHHHHHHhccHHHHHHHHHHHHccCC-hhhHHHHHH
Q 000425 1388 EMARLSRHLGSLPLQTKHR--INCIRTAIKRNMEVQNYAYAKQMLELLLSKAP-ASKQDELRS 1447 (1525)
Q Consensus 1388 R~lELaaYfT~c~Lqp~H~--~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p-~~~~~qark 1447 (1525)
.|.|. ..++-..-....+.+++..|....+++++..| ...+.+|++
T Consensus 69 --------------~p~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~ 117 (119)
T TIGR02795 69 --------------YPKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKRYPGSSAAKLAQK 117 (119)
T ss_pred --------------CCCCCcccHHHHHHHHHHHHhCChHHHHHHHHHHHHHCcCChhHHHHHh
Confidence 12221 11222222334678999999999999999998 555666654
No 380
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=74.08 E-value=82 Score=42.20 Aligned_cols=114 Identities=10% Similarity=0.184 Sum_probs=79.2
Q ss_pred CCCeEEEEEcCCCCEEEEEECCCcEEEEE---cCCCcEEEEEe--cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEe
Q 000425 51 KKPIVNLACHPRLPVLYVAYADGLIRAYN---IHTYAVHYTLQ--LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWD 125 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~DgtIrLWD---l~t~k~l~tL~--H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWD 125 (1525)
..|+.++++|.+-..+|+|-.||.|..+. ++..-.-..+. ...+|+ .+.|..++..++-......|.+|.
T Consensus 125 ~~p~s~l~Vs~~l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pIT-----gL~~~~d~~s~lFv~Tt~~V~~y~ 199 (933)
T KOG2114|consen 125 PSPASSLAVSEDLKTIVCGFTNGLVICYKGDILRDRGSRQDYSHRGKEPIT-----GLALRSDGKSVLFVATTEQVMLYS 199 (933)
T ss_pred CCcceEEEEEccccEEEEEecCcEEEEEcCcchhccccceeeeccCCCCce-----eeEEecCCceeEEEEecceeEEEE
Confidence 46789999999999999999999999884 22211122223 457899 999999988744344456788998
Q ss_pred CCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEEee
Q 000425 126 VSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 126 l~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+.... .....++.|...++|.+|++...-|+.|+ +..+.+||..
T Consensus 200 l~gr~-p~~~~ld~~G~~lnCss~~~~t~qfIca~-~e~l~fY~sd 243 (933)
T KOG2114|consen 200 LSGRT-PSLKVLDNNGISLNCSSFSDGTYQFICAG-SEFLYFYDSD 243 (933)
T ss_pred ecCCC-cceeeeccCCccceeeecCCCCccEEEec-CceEEEEcCC
Confidence 88322 12234678888999999998766454443 3467777754
No 381
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=73.92 E-value=34 Score=41.77 Aligned_cols=98 Identities=16% Similarity=0.136 Sum_probs=60.3
Q ss_pred HHHHHHHHHhhcCChHHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHHHHH
Q 000425 1312 VCFKTGLAHLEQNQLPDALSCFDEAFLALAKDHSRGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDEMAR 1391 (1525)
Q Consensus 1312 ~~lk~gy~~~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R~lE 1391 (1525)
..+++|-++|..|+|.+|+..|+++|..--. . ..-+. .+|..+ ..+.+-+ +. +...+|.++
T Consensus 4 ~l~~~a~~a~~~~~~~~Ai~~~~~Al~~~P~---~---~~a~~------~~a~~~-~~~g~~~-----eA-l~~~~~Al~ 64 (356)
T PLN03088 4 DLEDKAKEAFVDDDFALAVDLYTQAIDLDPN---N---AELYA------DRAQAN-IKLGNFT-----EA-VADANKAIE 64 (356)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHhCCC---C---HHHHH------HHHHHH-HHcCCHH-----HH-HHHHHHHHH
Confidence 3578899999999999999999999872111 1 11122 222222 0111111 00 111223333
Q ss_pred HHHhhcCCCCchhHHHHHHHHHHHHHHHhccHHHHHHHHHHHHccCC
Q 000425 1392 LSRHLGSLPLQTKHRINCIRTAIKRNMEVQNYAYAKQMLELLLSKAP 1438 (1525)
Q Consensus 1392 LaaYfT~c~Lqp~H~~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p 1438 (1525)
|.|.|...-+|.| ...++.|+|..|....++.|++.|
T Consensus 65 ---------l~P~~~~a~~~lg-~~~~~lg~~~eA~~~~~~al~l~P 101 (356)
T PLN03088 65 ---------LDPSLAKAYLRKG-TACMKLEEYQTAKAALEKGASLAP 101 (356)
T ss_pred ---------hCcCCHHHHHHHH-HHHHHhCCHHHHHHHHHHHHHhCC
Confidence 4555555555555 355789999999999999999998
No 382
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=73.86 E-value=70 Score=38.14 Aligned_cols=100 Identities=16% Similarity=0.143 Sum_probs=57.4
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccC-CCCCeEEEEEcCCCCEEEEEECC-----CcEEEEEcCCC-c
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTD-LKKPIVNLACHPRLPVLYVAYAD-----GLIRAYNIHTY-A 84 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~-H~~~V~sLaf~Pdg~~LaSGs~D-----gtIrLWDl~t~-k 84 (1525)
+++||..|..++=..+-.+...+-...+...+..+... -.+.---=.|||+|.+|...-.| |.|-|||.+.+ +
T Consensus 73 i~~~p~~~ravafARrPGtf~~vfD~~~~~~pv~~~s~~~RHfyGHGvfs~dG~~LYATEndfd~~rGViGvYd~r~~fq 152 (366)
T COG3490 73 IAFHPALPRAVAFARRPGTFAMVFDPNGAQEPVTLVSQEGRHFYGHGVFSPDGRLLYATENDFDPNRGVIGVYDAREGFQ 152 (366)
T ss_pred eecCCCCcceEEEEecCCceEEEECCCCCcCcEEEecccCceeecccccCCCCcEEEeecCCCCCCCceEEEEecccccc
Confidence 68899988654333222333222223333332222210 00111113699999998765444 78999997632 4
Q ss_pred EEEEEe-cCCceeecCceeEEEeCCCCEEEEEe
Q 000425 85 VHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD 116 (1525)
Q Consensus 85 ~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS 116 (1525)
.+..+. |.-.-. .+.|.+||+.++.+.
T Consensus 153 rvgE~~t~GiGpH-----ev~lm~DGrtlvvan 180 (366)
T COG3490 153 RVGEFSTHGIGPH-----EVTLMADGRTLVVAN 180 (366)
T ss_pred eecccccCCcCcc-----eeEEecCCcEEEEeC
Confidence 455555 665556 889999999888664
No 383
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=72.93 E-value=29 Score=45.99 Aligned_cols=113 Identities=17% Similarity=0.206 Sum_probs=74.9
Q ss_pred CeEEEEEcC--CCCEEEEEECCCcEEEEEcCC-------C-------------cEEEEEecCCceeecCceeEEEe--CC
Q 000425 53 PIVNLACHP--RLPVLYVAYADGLIRAYNIHT-------Y-------------AVHYTLQLDNTIKLLGAGAFAFH--PT 108 (1525)
Q Consensus 53 ~V~sLaf~P--dg~~LaSGs~DgtIrLWDl~t-------~-------------k~l~tL~H~~~V~~~ga~sVaFS--Pd 108 (1525)
.|+-+.+.- ....|+.+++||.|.+|.+++ . ++.+.+....++ |.+++| ..
T Consensus 102 tIN~i~v~~lg~~EVLl~c~DdG~V~~Yyt~~I~~~i~~~~~~~~~~~~r~~i~P~f~~~v~~Sa-----WGLdIh~~~~ 176 (717)
T PF08728_consen 102 TINFIKVGDLGGEEVLLLCTDDGDVLAYYTETIIEAIERFSEDNDSGFSRLKIKPFFHLRVGASA-----WGLDIHDYKK 176 (717)
T ss_pred eeeEEEecccCCeeEEEEEecCCeEEEEEHHHHHHHHHhhccccccccccccCCCCeEeecCCce-----eEEEEEecCc
Confidence 344444432 345789999999999997531 1 112222223344 499999 77
Q ss_pred CCEEEEEeCCCcEEEEeCCCC--CceeeEeecccCcCeEEEEEcCCC---C---EEEEEECCCCeeEEEee
Q 000425 109 LEWLFVGDRRGTLLAWDVSIE--RPSMIGIIQVGSQPITSVAWLPML---R---LLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 109 g~~L~SgS~DgtVrVWDl~t~--~~s~i~il~gH~~~VtsVafsPdg---~---~LaSgS~DgtIkVWDlr 171 (1525)
.+.|++++....|.||-.... ...... -..|.+.|.+|+|-++. . ++++++-.|.+.+|++.
T Consensus 177 ~rlIAVSsNs~~VTVFaf~l~~~r~~~~~-s~~~~hNIP~VSFl~~~~d~~G~v~v~a~dI~G~v~~~~I~ 246 (717)
T PF08728_consen 177 SRLIAVSSNSQEVTVFAFALVDERFYHVP-SHQHSHNIPNVSFLDDDLDPNGHVKVVATDISGEVWTFKIK 246 (717)
T ss_pred ceEEEEecCCceEEEEEEecccccccccc-ccccccCCCeeEeecCCCCCccceEEEEEeccCcEEEEEEE
Confidence 888999998899998854431 111100 11367889999998753 2 78889999999999985
No 384
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=72.53 E-value=13 Score=49.15 Aligned_cols=110 Identities=13% Similarity=0.112 Sum_probs=79.9
Q ss_pred HHHHHcCCCCcccchhhccccC-----CCCCcch---hhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHH
Q 000425 740 LDILAKGPPVCGDLAVSLSQAG-----PQFTQVL---RGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQ 811 (1525)
Q Consensus 740 l~~L~~~~~~~~elAl~ls~~~-----~~~d~~~---rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~ 811 (1525)
+....+ .|..|.|+++-+.. -..|..+ -.++-.+.|+++.|+++.++.. .. |+... |..
T Consensus 433 l~a~~~--~g~~~~a~~~f~~m~~~~g~~p~~~~y~~li~~l~r~G~~~eA~~~~~~~~----~~----p~~~~---~~~ 499 (697)
T PLN03081 433 LSACRY--SGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAP----FK----PTVNM---WAA 499 (697)
T ss_pred HHHHhc--CCcHHHHHHHHHHHHHhcCCCCCccchHhHHHHHHhcCCHHHHHHHHHHCC----CC----CCHHH---HHH
Confidence 344444 44777787775421 1111111 3456678999999999987631 11 23333 778
Q ss_pred HHHHHHHcCChHHHHHHHHHH--------hhhhHHHHHHHHhCCHHHHHHHHHHHHHhc
Q 000425 812 LGYACIKYGQFDSAKETFEVI--------ADYESILDLFICHLNPSAMRRLAQRLEEEG 862 (1525)
Q Consensus 812 Lg~~AL~~G~~~lAeec~q~~--------~d~~~LllLy~~~gd~e~L~kLa~~ae~~g 862 (1525)
|-.+|-.+|++++|+++++++ ..|..|+-+|...|+.+.-.++.+..+++|
T Consensus 500 Ll~a~~~~g~~~~a~~~~~~l~~~~p~~~~~y~~L~~~y~~~G~~~~A~~v~~~m~~~g 558 (697)
T PLN03081 500 LLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKG 558 (697)
T ss_pred HHHHHHHcCCcHHHHHHHHHHhCCCCCCCcchHHHHHHHHhCCCHHHHHHHHHHHHHcC
Confidence 999999999999999999974 578889999999999999999999887777
No 385
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=72.43 E-value=9 Score=43.89 Aligned_cols=62 Identities=23% Similarity=0.241 Sum_probs=45.1
Q ss_pred CCCEEEEEECCCcEEEEEcCCCcEEEEEe-cC-CceeecCceeEEEeCCCCEEEEE--eCCCcEEEEeCCC
Q 000425 62 RLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LD-NTIKLLGAGAFAFHPTLEWLFVG--DRRGTLLAWDVSI 128 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~-~~V~~~ga~sVaFSPdg~~L~Sg--S~DgtVrVWDl~t 128 (1525)
++.+..+++.|+.||.|++.-++.+-... |. .++. ....+..++.+..+ |.|..++.||+..
T Consensus 113 ~~~~~c~~~~dg~ir~~n~~p~k~~g~~g~h~~~~~e-----~~ivv~sd~~i~~a~~S~d~~~k~W~ve~ 178 (238)
T KOG2444|consen 113 DSSLGCVGAQDGRIRACNIKPNKVLGYVGQHNFESGE-----ELIVVGSDEFLKIADTSHDRVLKKWNVEK 178 (238)
T ss_pred ccceeEEeccCCceeeeccccCceeeeeccccCCCcc-----eeEEecCCceEEeeccccchhhhhcchhh
Confidence 34578899999999999998777766665 65 4555 55555666666666 7777777777763
No 386
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=72.17 E-value=39 Score=34.22 Aligned_cols=83 Identities=16% Similarity=0.209 Sum_probs=58.2
Q ss_pred hHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHh-----h---hhHHHHHH
Q 000425 771 IYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIA-----D---YESILDLF 842 (1525)
Q Consensus 771 ~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~-----d---~~~LllLy 842 (1525)
...++.|+++.|.+.++..-... |..... |..+|...+..|+++-|.++|+++- + +-.+-.+|
T Consensus 25 ~~~~~~~~~~~A~~~~~~~~~~~------p~~~~~---~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~ 95 (135)
T TIGR02552 25 YNLYQQGRYDEALKLFQLLAAYD------PYNSRY---WLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECL 95 (135)
T ss_pred HHHHHcccHHHHHHHHHHHHHhC------CCcHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHH
Confidence 44677899999999987754211 122233 5679999999999999999999884 2 33356677
Q ss_pred HHhCCHHHHHHHHHHHHHhc
Q 000425 843 ICHLNPSAMRRLAQRLEEEG 862 (1525)
Q Consensus 843 ~~~gd~e~L~kLa~~ae~~g 862 (1525)
...|+.++-.+..+.+.+..
T Consensus 96 ~~~g~~~~A~~~~~~al~~~ 115 (135)
T TIGR02552 96 LALGEPESALKALDLAIEIC 115 (135)
T ss_pred HHcCCHHHHHHHHHHHHHhc
Confidence 88888776666666555443
No 387
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=72.00 E-value=27 Score=40.86 Aligned_cols=87 Identities=17% Similarity=0.156 Sum_probs=72.5
Q ss_pred hHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhh--------hHHHHHH
Q 000425 771 IYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADY--------ESILDLF 842 (1525)
Q Consensus 771 ~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~--------~~LllLy 842 (1525)
-.++..||+..|.+.+++.- .++| ..|.-|--||-.+.+.|+++.|.-.|-++-++ .-|..+|
T Consensus 108 k~~~~~g~~~~A~~~~rkA~-----~l~p----~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p~~~nNlgms~ 178 (257)
T COG5010 108 KNQIRNGNFGEAVSVLRKAA-----RLAP----TDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSL 178 (257)
T ss_pred HHHHHhcchHHHHHHHHHHh-----ccCC----CChhhhhHHHHHHHHccChhHHHHHHHHHHHhccCCchhhhhHHHHH
Confidence 34688999999999998865 2222 34777889999999999999999999998765 6799999
Q ss_pred HHhCCHHHHHHHHHHHHHhcCChh
Q 000425 843 ICHLNPSAMRRLAQRLEEEGANPE 866 (1525)
Q Consensus 843 ~~~gd~e~L~kLa~~ae~~g~~~~ 866 (1525)
...||.++=+++...|...+....
T Consensus 179 ~L~gd~~~A~~lll~a~l~~~ad~ 202 (257)
T COG5010 179 LLRGDLEDAETLLLPAYLSPAADS 202 (257)
T ss_pred HHcCCHHHHHHHHHHHHhCCCCch
Confidence 999999999999999988776333
No 388
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=71.21 E-value=69 Score=36.73 Aligned_cols=106 Identities=16% Similarity=0.217 Sum_probs=66.4
Q ss_pred cCCC-CEEEEEECCCcEEEEEcC--CCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC------CCcEEEE-eCCCC
Q 000425 60 HPRL-PVLYVAYADGLIRAYNIH--TYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR------RGTLLAW-DVSIE 129 (1525)
Q Consensus 60 ~Pdg-~~LaSGs~DgtIrLWDl~--t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~------DgtVrVW-Dl~t~ 129 (1525)
++.+ ..|+.+...++|.+|++. ..++..+|..-+.|. .+.|+.-|+||++--. +..+|+| |++..
T Consensus 24 c~~g~d~Lfva~~g~~Vev~~l~~~~~~~~~~F~Tv~~V~-----~l~y~~~GDYlvTlE~k~~~~~~~fvR~Y~NWr~~ 98 (215)
T PF14761_consen 24 CCGGPDALFVAASGCKVEVYDLEQEECPLLCTFSTVGRVL-----QLVYSEAGDYLVTLEEKNKRSPVDFVRAYFNWRSQ 98 (215)
T ss_pred eccCCceEEEEcCCCEEEEEEcccCCCceeEEEcchhhee-----EEEeccccceEEEEEeecCCccceEEEEEEEhhhh
Confidence 3334 455554667889999987 345677777558888 9999999999998652 2267887 66542
Q ss_pred ---CceeeEeeccc---------------------CcCeEEEEEcC-CCCEEEEEECCCCeeEEEeee
Q 000425 130 ---RPSMIGIIQVG---------------------SQPITSVAWLP-MLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 130 ---~~s~i~il~gH---------------------~~~VtsVafsP-dg~~LaSgS~DgtIkVWDlr~ 172 (1525)
...+...+-|| ...+.|++=.| .|++++. + ++++.+|.++.
T Consensus 99 ~~~~~~v~vRiaG~~v~~~~~~~~~~qleiiElPl~~~p~ciaCC~~tG~LlVg-~-~~~l~lf~l~~ 164 (215)
T PF14761_consen 99 KEENSPVRVRIAGHRVTPSFNESSKDQLEIIELPLSEPPLCIACCPVTGNLLVG-C-GNKLVLFTLKY 164 (215)
T ss_pred cccCCcEEEEEcccccccCCCCccccceEEEEecCCCCCCEEEecCCCCCEEEE-c-CCEEEEEEEEE
Confidence 11111011122 22445666666 4665554 3 56899999874
No 389
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=70.96 E-value=6.6 Score=35.20 Aligned_cols=33 Identities=24% Similarity=0.471 Sum_probs=24.8
Q ss_pred HHHhccHHHHHHHHHHHHccCC-hhhHHHHHHHH
Q 000425 1417 NMEVQNYAYAKQMLELLLSKAP-ASKQDELRSLI 1449 (1525)
Q Consensus 1417 ~~K~kNy~tAa~fA~rLL~l~p-~~~~~qarkil 1449 (1525)
.||.|||..|..++++||++.| ...+.+.++++
T Consensus 11 ~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~i 44 (53)
T PF14853_consen 11 HYKLGEYEKARRYCDALLEIEPDNRQAQSLKELI 44 (53)
T ss_dssp HHHTT-HHHHHHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred HHHhhhHHHHHHHHHHHHhhCCCcHHHHHHHHHH
Confidence 5999999999999999999998 44444444444
No 390
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=70.53 E-value=1.1e+02 Score=36.86 Aligned_cols=124 Identities=15% Similarity=0.237 Sum_probs=64.6
Q ss_pred EEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC--CcEEEEEecCCceeecCceeEEEeCCCC
Q 000425 33 VVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT--YAVHYTLQLDNTIKLLGAGAFAFHPTLE 110 (1525)
Q Consensus 33 vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t--~k~l~tL~H~~~V~~~ga~sVaFSPdg~ 110 (1525)
++-+.++++.=..+..+-...++.+...+++.++++++.-....-||-.. ..+. .......|. .+.|.|++.
T Consensus 126 iy~T~DgG~tW~~~~~~~~gs~~~~~r~~dG~~vavs~~G~~~~s~~~G~~~w~~~-~r~~~~riq-----~~gf~~~~~ 199 (302)
T PF14870_consen 126 IYRTTDGGKTWQAVVSETSGSINDITRSSDGRYVAVSSRGNFYSSWDPGQTTWQPH-NRNSSRRIQ-----SMGFSPDGN 199 (302)
T ss_dssp EEEESSTTSSEEEEE-S----EEEEEE-TTS-EEEEETTSSEEEEE-TT-SS-EEE-E--SSS-EE-----EEEE-TTS-
T ss_pred EEEeCCCCCCeeEcccCCcceeEeEEECCCCcEEEEECcccEEEEecCCCccceEE-ccCccceeh-----hceecCCCC
Confidence 45555665443333344567899999999999999987666666787442 2222 111457788 999999987
Q ss_pred EEEEEeCCCcEEEEeCCCC----CceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 111 WLFVGDRRGTLLAWDVSIE----RPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 111 ~L~SgS~DgtVrVWDl~t~----~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
..+. ...+.|+.=|.... ...... .......+..++|.++....++|+. |++
T Consensus 200 lw~~-~~Gg~~~~s~~~~~~~~w~~~~~~-~~~~~~~~ld~a~~~~~~~wa~gg~-G~l 255 (302)
T PF14870_consen 200 LWML-ARGGQIQFSDDPDDGETWSEPIIP-IKTNGYGILDLAYRPPNEIWAVGGS-GTL 255 (302)
T ss_dssp EEEE-ETTTEEEEEE-TTEEEEE---B-T-TSS--S-EEEEEESSSS-EEEEEST-T-E
T ss_pred EEEE-eCCcEEEEccCCCCccccccccCC-cccCceeeEEEEecCCCCEEEEeCC-ccE
Confidence 5553 37888887662211 111110 1122344789999999888887654 444
No 391
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=70.44 E-value=21 Score=45.08 Aligned_cols=102 Identities=12% Similarity=0.197 Sum_probs=68.1
Q ss_pred CCCE-EEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCE-------EEEEeCCCcEEEEeCCCCCce
Q 000425 62 RLPV-LYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEW-------LFVGDRRGTLLAWDVSIERPS 132 (1525)
Q Consensus 62 dg~~-LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~-------L~SgS~DgtVrVWDl~t~~~s 132 (1525)
+.++ |.++..-..++=.|++.++.+...+ |.+ |+ -+.+.|+.+. -+.|-+|..|.-||.+-....
T Consensus 344 dsnlil~~~~~~~~l~klDIE~GKIVeEWk~~~d-i~-----mv~~t~d~K~~Ql~~e~TlvGLs~n~vfriDpRv~~~~ 417 (644)
T KOG2395|consen 344 DSNLILMDGGEQDKLYKLDIERGKIVEEWKFEDD-IN-----MVDITPDFKFAQLTSEQTLVGLSDNSVFRIDPRVQGKN 417 (644)
T ss_pred ccceEeeCCCCcCcceeeecccceeeeEeeccCC-cc-----eeeccCCcchhcccccccEEeecCCceEEecccccCcc
Confidence 3344 4456666677778999999999998 665 77 6788887532 244667888999998853222
Q ss_pred eeEeecccC----cCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 133 MIGIIQVGS----QPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 133 ~i~il~gH~----~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
.+...++|. ..-.|++-..+ .+|+.||.+|-||+||-
T Consensus 418 kl~~~q~kqy~~k~nFsc~aTT~s-G~IvvgS~~GdIRLYdr 458 (644)
T KOG2395|consen 418 KLAVVQSKQYSTKNNFSCFATTES-GYIVVGSLKGDIRLYDR 458 (644)
T ss_pred eeeeeeccccccccccceeeecCC-ceEEEeecCCcEEeehh
Confidence 222233432 22344444444 48999999999999985
No 392
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=70.37 E-value=1.3e+02 Score=37.32 Aligned_cols=154 Identities=14% Similarity=0.162 Sum_probs=69.0
Q ss_pred EEEEEcCCCCEEEEEEec----cceEEEEEeecCCccceEEccCCCCC--eEEEEEcCCCCEEEE-EE----CCCcEEEE
Q 000425 10 VHLALTPLQPVVFFGFHR----RMSVTVVGTVEGGRAPTKIKTDLKKP--IVNLACHPRLPVLYV-AY----ADGLIRAY 78 (1525)
Q Consensus 10 ~~LA~hP~qp~Vf~s~~~----d~sI~vv~tieggk~~tklk~~H~~~--V~sLaf~Pdg~~LaS-Gs----~DgtIrLW 78 (1525)
=|+-++|+.|.++.=+|. .-+-++|-....+....+++. |... +.-=-|.|+|..+.- +. .+..|.-+
T Consensus 191 gH~~fsP~dp~li~fCHEGpw~~Vd~RiW~i~~dg~~~~~v~~-~~~~e~~gHEfw~~DG~~i~y~~~~~~~~~~~i~~~ 269 (386)
T PF14583_consen 191 GHVQFSPTDPTLIMFCHEGPWDLVDQRIWTINTDGSNVKKVHR-RMEGESVGHEFWVPDGSTIWYDSYTPGGQDFWIAGY 269 (386)
T ss_dssp EEEEEETTEEEEEEEEE-S-TTTSS-SEEEEETTS---EESS----TTEEEEEEEE-TTSS-EEEEEEETTT--EEEEEE
T ss_pred cCcccCCCCCCEEEEeccCCcceeceEEEEEEcCCCcceeeec-CCCCcccccccccCCCCEEEEEeecCCCCceEEEee
Confidence 489999999987654444 112234544455555555554 3322 122247889885443 22 24566667
Q ss_pred EcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCC----------------cEEEEeCCCCCceee-------E
Q 000425 79 NIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRG----------------TLLAWDVSIERPSMI-------G 135 (1525)
Q Consensus 79 Dl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~Dg----------------tVrVWDl~t~~~s~i-------~ 135 (1525)
|..+++......- .... .+--++||++++-=+.|. .|.+++......... .
T Consensus 270 d~~t~~~~~~~~~-p~~~-----H~~ss~Dg~L~vGDG~d~p~~v~~~~~~~~~~~p~i~~~~~~~~~~~~l~~h~~sw~ 343 (386)
T PF14583_consen 270 DPDTGERRRLMEM-PWCS-----HFMSSPDGKLFVGDGGDAPVDVADAGGYKIENDPWIYLFDVEAGRFRKLARHDTSWK 343 (386)
T ss_dssp -TTT--EEEEEEE--SEE-----EEEE-TTSSEEEEEE-------------------EEEEEETTTTEEEEEEE------
T ss_pred CCCCCCceEEEeC-Ccee-----eeEEcCCCCEEEecCCCCCccccccccceecCCcEEEEeccccCceeeeeeccCcce
Confidence 8887755332221 1233 444578998776433332 455677764311111 1
Q ss_pred eecccCc-CeEEEEEcCCCCEEEE-EECCCCeeEEEe
Q 000425 136 IIQVGSQ-PITSVAWLPMLRLLVT-LCRDGSLQVWKT 170 (1525)
Q Consensus 136 il~gH~~-~VtsVafsPdg~~LaS-gS~DgtIkVWDl 170 (1525)
.+.+|.. .-...+|+||+++++- +...|...|+=+
T Consensus 344 v~~~~~q~~hPhp~FSPDgk~VlF~Sd~~G~~~vY~v 380 (386)
T PF14583_consen 344 VLDGDRQVTHPHPSFSPDGKWVLFRSDMEGPPAVYLV 380 (386)
T ss_dssp -BTTBSSTT----EE-TTSSEEEEEE-TTSS-EEEEE
T ss_pred eecCCCccCCCCCccCCCCCEEEEECCCCCCccEEEE
Confidence 1222222 1246899999997664 445676666643
No 393
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=70.26 E-value=15 Score=41.89 Aligned_cols=68 Identities=21% Similarity=0.289 Sum_probs=50.8
Q ss_pred EEEeCCCCEEEEEeCCCcEEEEeCCCCC----c-eeeEeec-------ccCcCeEEEEEcCCCCEEEEEECCCCeeEEEe
Q 000425 103 FAFHPTLEWLFVGDRRGTLLAWDVSIER----P-SMIGIIQ-------VGSQPITSVAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 103 VaFSPdg~~L~SgS~DgtVrVWDl~t~~----~-s~i~il~-------gH~~~VtsVafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
+.+..++.|+++-..+|.+++||+.+.. . ++..++. .....|..+..+.+|.-|++-+ +|....||.
T Consensus 16 ~~l~~~~~~Ll~iT~~G~l~vWnl~~~k~~~~~~Si~pll~~~~~~~~~~~~~i~~~~lt~~G~PiV~ls-ng~~y~y~~ 94 (219)
T PF07569_consen 16 SFLECNGSYLLAITSSGLLYVWNLKKGKAVLPPVSIAPLLNSSPVSDKSSSPNITSCSLTSNGVPIVTLS-NGDSYSYSP 94 (219)
T ss_pred EEEEeCCCEEEEEeCCCeEEEEECCCCeeccCCccHHHHhcccccccCCCCCcEEEEEEcCCCCEEEEEe-CCCEEEecc
Confidence 3456789999999999999999999641 1 1111222 3567788999999998888877 477888987
Q ss_pred e
Q 000425 171 R 171 (1525)
Q Consensus 171 r 171 (1525)
.
T Consensus 95 ~ 95 (219)
T PF07569_consen 95 D 95 (219)
T ss_pred c
Confidence 6
No 394
>KOG1155 consensus Anaphase-promoting complex (APC), Cdc23 subunit [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=70.03 E-value=42 Score=42.14 Aligned_cols=39 Identities=13% Similarity=0.165 Sum_probs=31.4
Q ss_pred HHHHhccHHHHHHHHHHHHccCChhhHHHHHHHHHHHHhcC
Q 000425 1416 RNMEVQNYAYAKQMLELLLSKAPASKQDELRSLIDMCVQRG 1456 (1525)
Q Consensus 1416 ~~~K~kNy~tAa~fA~rLL~l~p~~~~~qarkil~~ce~~~ 1456 (1525)
.++|.|||..|...|.+.+.- ....|+||.++..|.+..
T Consensus 516 ~f~k~~~~~~As~Ya~~~~~~--~~e~eeak~LlReir~~~ 554 (559)
T KOG1155|consen 516 YFKKMKDFDEASYYATLVLKG--ETECEEAKALLREIRKIQ 554 (559)
T ss_pred HHHhhcchHHHHHHHHHHhcC--CchHHHHHHHHHHHHHhc
Confidence 468999999999999999987 456677888887776543
No 395
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=69.96 E-value=82 Score=31.88 Aligned_cols=99 Identities=12% Similarity=0.062 Sum_probs=57.0
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHHHH
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFLALAKDHSRGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDEMA 1390 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R~l 1390 (1525)
...+..|+.++..|++.+|+..|+.++..--.+..-...+-.......+|--|..++++..
T Consensus 18 ~~~~~~a~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~------------------- 78 (135)
T TIGR02552 18 EQIYALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAA------------------- 78 (135)
T ss_pred HHHHHHHHHHHHcccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------
Confidence 3367889999999999999999999866321111100011111111122333333222211
Q ss_pred HHHHhhcCCCCchhHHHHHHHHHHHHHHHhccHHHHHHHHHHHHccCC
Q 000425 1391 RLSRHLGSLPLQTKHRINCIRTAIKRNMEVQNYAYAKQMLELLLSKAP 1438 (1525)
Q Consensus 1391 ELaaYfT~c~Lqp~H~~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p 1438 (1525)
.+.|..-..-...|. ..++.|++..|..+.++.+++.|
T Consensus 79 ---------~~~p~~~~~~~~la~-~~~~~g~~~~A~~~~~~al~~~p 116 (135)
T TIGR02552 79 ---------ALDPDDPRPYFHAAE-CLLALGEPESALKALDLAIEICG 116 (135)
T ss_pred ---------hcCCCChHHHHHHHH-HHHHcCCHHHHHHHHHHHHHhcc
Confidence 122333233333444 56788999999999999999998
No 396
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=69.88 E-value=6.4 Score=30.44 Aligned_cols=27 Identities=30% Similarity=0.477 Sum_probs=22.9
Q ss_pred HHHHHHHHHHcCChHHHHHHHHHHhhh
Q 000425 809 FRQLGYACIKYGQFDSAKETFEVIADY 835 (1525)
Q Consensus 809 wk~Lg~~AL~~G~~~lAeec~q~~~d~ 835 (1525)
|..+|...+..|+++.|.+||+++=.+
T Consensus 4 ~~~lg~~~~~~~~~~~A~~~~~~al~l 30 (34)
T PF07719_consen 4 WYYLGQAYYQLGNYEEAIEYFEKALEL 30 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHH
Confidence 567999999999999999999987543
No 397
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=69.80 E-value=2.4e+02 Score=33.53 Aligned_cols=148 Identities=16% Similarity=0.095 Sum_probs=81.1
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCC-CcEEEEEe
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHT-YAVHYTLQ 90 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t-~k~l~tL~ 90 (1525)
.++.+-+..++++.. ..+.++......+-...-..+..-.|.++... ++++++|.....+.++.++. ...+..+.
T Consensus 92 ~ai~~~~~~lv~~~g--~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~--~~~I~vgD~~~sv~~~~~~~~~~~l~~va 167 (321)
T PF03178_consen 92 TAICSFNGRLVVAVG--NKLYVYDLDNSKTLLKKAFYDSPFYITSLSVF--KNYILVGDAMKSVSLLRYDEENNKLILVA 167 (321)
T ss_dssp EEEEEETTEEEEEET--TEEEEEEEETTSSEEEEEEE-BSSSEEEEEEE--TTEEEEEESSSSEEEEEEETTTE-EEEEE
T ss_pred eEhhhhCCEEEEeec--CEEEEEEccCcccchhhheecceEEEEEEecc--ccEEEEEEcccCEEEEEEEccCCEEEEEE
Confidence 344455566555553 45555554333311111111123366666665 66999999888898885543 23233333
Q ss_pred ---cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCC------Cc-eeeEeecc-cCcCeEEE---EEcCC--C-
Q 000425 91 ---LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIE------RP-SMIGIIQV-GSQPITSV---AWLPM--L- 153 (1525)
Q Consensus 91 ---H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~------~~-s~i~il~g-H~~~VtsV---afsPd--g- 153 (1525)
....+. ++.|-++++.++.+..+|.+.++.+... .. .+.....- ....|+++ ++.|. .
T Consensus 168 ~d~~~~~v~-----~~~~l~d~~~~i~~D~~gnl~~l~~~~~~~~~~~~~~~L~~~~~f~lg~~v~~~~~~~l~~~~~~~ 242 (321)
T PF03178_consen 168 RDYQPRWVT-----AAEFLVDEDTIIVGDKDGNLFVLRYNPEIPNSRDGDPKLERISSFHLGDIVNSFRRGSLIPRSGSS 242 (321)
T ss_dssp EESS-BEEE-----EEEEE-SSSEEEEEETTSEEEEEEE-SS-SSTTTTTTBEEEEEEEE-SS-EEEEEE--SS--SSSS
T ss_pred ecCCCccEE-----EEEEecCCcEEEEEcCCCeEEEEEECCCCcccccccccceeEEEEECCCccceEEEEEeeecCCCC
Confidence 445677 8888877788999999999999977631 11 22211112 24557776 44452 2
Q ss_pred -----CEEEEEECCCCeeEE
Q 000425 154 -----RLLVTLCRDGSLQVW 168 (1525)
Q Consensus 154 -----~~LaSgS~DgtIkVW 168 (1525)
+.++-++.+|.|-+-
T Consensus 243 ~~~~~~~i~~~T~~G~Ig~l 262 (321)
T PF03178_consen 243 ESPNRPQILYGTVDGSIGVL 262 (321)
T ss_dssp -TTEEEEEEEEETTS-EEEE
T ss_pred cccccceEEEEecCCEEEEE
Confidence 247778889988743
No 398
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=69.68 E-value=11 Score=33.84 Aligned_cols=60 Identities=23% Similarity=0.284 Sum_probs=42.8
Q ss_pred HHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhc-CChhHHHHHHHHH
Q 000425 808 RFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNPSAMRRLAQRLEEEG-ANPELRRYCERIL 875 (1525)
Q Consensus 808 rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g-~~~~~~~~c~~il 875 (1525)
-|..+|...+..|+++.|+++|+++=+++ -.+.+.+-.|+......| ....+...+++.+
T Consensus 5 ~~~~~g~~~~~~~~~~~A~~~~~~ai~~~--------p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al 65 (69)
T PF13414_consen 5 AWYNLGQIYFQQGDYEEAIEYFEKAIELD--------PNNAEAYYNLGLAYMKLGKDYEEAIEDFEKAL 65 (69)
T ss_dssp HHHHHHHHHHHTTHHHHHHHHHHHHHHHS--------TTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHcC--------CCCHHHHHHHHHHHHHhCccHHHHHHHHHHHH
Confidence 37789999999999999999999987764 445556666666666665 3444454454443
No 399
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=69.31 E-value=20 Score=48.79 Aligned_cols=102 Identities=24% Similarity=0.290 Sum_probs=70.2
Q ss_pred CeEEEEEcCCCC-EEEEEECCCcEEEEEcCCCc-EEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCC
Q 000425 53 PIVNLACHPRLP-VLYVAYADGLIRAYNIHTYA-VHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIER 130 (1525)
Q Consensus 53 ~V~sLaf~Pdg~-~LaSGs~DgtIrLWDl~t~k-~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~ 130 (1525)
.+.++.|+|.=+ .+++...|+.|++.-+.... ...++....... +++|+|.|+.++.|-..|++.-|--.-..
T Consensus 157 f~~~~~wnP~vp~n~av~l~dlsl~V~~~~~~~~~v~s~p~t~~~T-----av~WSprGKQl~iG~nnGt~vQy~P~lei 231 (1405)
T KOG3630|consen 157 FQLKNVWNPLVPLNSAVDLSDLSLRVKSTKQLAQNVTSFPVTNSQT-----AVLWSPRGKQLFIGRNNGTEVQYEPSLEI 231 (1405)
T ss_pred ccccccccCCccchhhhhccccchhhhhhhhhhhhhcccCccccee-----eEEeccccceeeEecCCCeEEEeecccce
Confidence 456788999865 57888899999998765332 223334556677 99999999999999999999887654221
Q ss_pred ceee-EeecccCcCeEEEEEcCCCCEEEEE
Q 000425 131 PSMI-GIIQVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 131 ~s~i-~il~gH~~~VtsVafsPdg~~LaSg 159 (1525)
...+ .--..-...|.||+|--...+++.-
T Consensus 232 k~~ip~Pp~~e~yrvl~v~Wl~t~eflvvy 261 (1405)
T KOG3630|consen 232 KSEIPEPPVEENYRVLSVTWLSTQEFLVVY 261 (1405)
T ss_pred eecccCCCcCCCcceeEEEEecceeEEEEe
Confidence 1111 0001125789999999887777664
No 400
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=68.32 E-value=93 Score=36.66 Aligned_cols=117 Identities=12% Similarity=0.151 Sum_probs=68.2
Q ss_pred hHHHHHHHHHH-hhcCChHHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHH
Q 000425 1310 ASVCFKTGLAH-LEQNQLPDALSCFDEAFLALAKDHSRGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDE 1388 (1525)
Q Consensus 1310 ~~~~lk~gy~~-~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R 1388 (1525)
....+..|+.+ +..|+|.+|+..|+.++..-=++. ...++ .|-+|-.. .- .++ . + .
T Consensus 142 e~~~Y~~A~~l~~~~~~y~~Ai~af~~fl~~yP~s~-~a~~A--------~y~LG~~y--~~-~g~-~-------~---~ 198 (263)
T PRK10803 142 ANTDYNAAIALVQDKSRQDDAIVAFQNFVKKYPDST-YQPNA--------NYWLGQLN--YN-KGK-K-------D---D 198 (263)
T ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCCc-chHHH--------HHHHHHHH--HH-cCC-H-------H---H
Confidence 35578899987 678999999999999987222211 11111 35555544 11 111 0 0 1
Q ss_pred HHH-HHHhhcCCCCchhHHHHHHHHHHHHHHHhccHHHHHHHHHHHHccCC-hhhHHHHHHHHH
Q 000425 1389 MAR-LSRHLGSLPLQTKHRINCIRTAIKRNMEVQNYAYAKQMLELLLSKAP-ASKQDELRSLID 1450 (1525)
Q Consensus 1389 ~lE-LaaYfT~c~Lqp~H~~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p-~~~~~qarkil~ 1450 (1525)
.++ +..+...-+-.+. ..-+|...-...++.|++..|..+.+++++.-| ...+.+|++.|.
T Consensus 199 A~~~f~~vv~~yP~s~~-~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s~~a~~A~~rL~ 261 (263)
T PRK10803 199 AAYYFASVVKNYPKSPK-AADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGTDGAKQAQKRLN 261 (263)
T ss_pred HHHHHHHHHHHCCCCcc-hhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHHHHHh
Confidence 111 1111111122121 222333233345589999999999999999999 777888888764
No 401
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=67.91 E-value=41 Score=40.65 Aligned_cols=104 Identities=18% Similarity=0.225 Sum_probs=55.7
Q ss_pred eEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE---EEEEe-----cCCceeecCceeEEEeCC---CCEEEEEeC-C---
Q 000425 54 IVNLACHPRLPVLYVAYADGLIRAYNIHTYAV---HYTLQ-----LDNTIKLLGAGAFAFHPT---LEWLFVGDR-R--- 118 (1525)
Q Consensus 54 V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~---l~tL~-----H~~~V~~~ga~sVaFSPd---g~~L~SgS~-D--- 118 (1525)
-++|+|.|++.+|++ -..|.|++++ ..+.. +..+. ....+. .++|+|+ ..+|..... .
T Consensus 4 P~~~a~~pdG~l~v~-e~~G~i~~~~-~~g~~~~~v~~~~~v~~~~~~gll-----gia~~p~f~~n~~lYv~~t~~~~~ 76 (331)
T PF07995_consen 4 PRSMAFLPDGRLLVA-ERSGRIWVVD-KDGSLKTPVADLPEVFADGERGLL-----GIAFHPDFASNGYLYVYYTNADED 76 (331)
T ss_dssp EEEEEEETTSCEEEE-ETTTEEEEEE-TTTEECEEEEE-TTTBTSTTBSEE-----EEEE-TTCCCC-EEEEEEEEE-TS
T ss_pred ceEEEEeCCCcEEEE-eCCceEEEEe-CCCcCcceecccccccccccCCcc-----cceeccccCCCCEEEEEEEcccCC
Confidence 478999999887665 5699999999 44443 22231 223455 8999995 344443332 1
Q ss_pred -----CcEEEEeCCCC------CceeeEeec---ccCcCeEEEEEcCCCCEEEEEECCCC
Q 000425 119 -----GTLLAWDVSIE------RPSMIGIIQ---VGSQPITSVAWLPMLRLLVTLCRDGS 164 (1525)
Q Consensus 119 -----gtVrVWDl~t~------~~s~i~il~---gH~~~VtsVafsPdg~~LaSgS~Dgt 164 (1525)
..|.-|.+... ...++..+. ...+.-..+.|.|||.+.++.++.+.
T Consensus 77 ~~~~~~~v~r~~~~~~~~~~~~~~~l~~~~p~~~~~~H~g~~l~fgpDG~LYvs~G~~~~ 136 (331)
T PF07995_consen 77 GGDNDNRVVRFTLSDGDGDLSSEEVLVTGLPDTSSGNHNGGGLAFGPDGKLYVSVGDGGN 136 (331)
T ss_dssp SSSEEEEEEEEEEETTSCEEEEEEEEEEEEES-CSSSS-EEEEEE-TTSEEEEEEB-TTT
T ss_pred CCCcceeeEEEeccCCccccccceEEEEEeCCCCCCCCCCccccCCCCCcEEEEeCCCCC
Confidence 23433433321 111111111 23345567999999987777665543
No 402
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=66.21 E-value=40 Score=44.46 Aligned_cols=72 Identities=19% Similarity=0.155 Sum_probs=43.0
Q ss_pred CCcccccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCccccccccCCCCCCCCCCCCCCCCCCCC---CCCCccCCc
Q 000425 1231 AIPVESIGLPDGGVPPQSSGQTPFPYQSQVLPAQVPPSTQPLDLSALGVPNSGDSGKSPANPASPPTS---VRPGQVPRG 1307 (1525)
Q Consensus 1231 ~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~~~---~~p~~~~~~ 1307 (1525)
.-|+..+.-+-++.+||..+.++++.++++++|+++...+|.-+.+.+ -+.+.|+.-.+|... .+|++|-+.
T Consensus 313 ~ppv~~ln~~~s~~~p~pp~~p~l~~~~espvpp~~~~~~~a~~~a~~-----ik~~~~tk~~~p~lnW~alKP~qv~~t 387 (830)
T KOG1923|consen 313 FPPVGPLNSNLSSGAPQPPGVPFLLTFPESPVPPPQRLMIPAELFAAT-----IKKPTPTKTKSPSLNWLALKPIQVKGT 387 (830)
T ss_pred CCCCCCCCCCCcCCCCCCCCCCcccCCCCCCCCCCcccccchHhHHHh-----hcCCCcccccCCCccccccCccccccc
Confidence 334455555556677776666777777777777777777887776642 222333333333433 457777653
No 403
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=65.67 E-value=55 Score=37.63 Aligned_cols=104 Identities=14% Similarity=0.125 Sum_probs=62.8
Q ss_pred CCCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCcee-----e--
Q 000425 62 RLPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSM-----I-- 134 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~-----i-- 134 (1525)
.++.++.|..+| +.++++........+.+...|. .+...++-+.+++-+ |+++.++|+....... .
T Consensus 6 ~~~~L~vGt~~G-l~~~~~~~~~~~~~i~~~~~I~-----ql~vl~~~~~llvLs-d~~l~~~~L~~l~~~~~~~~~~~~ 78 (275)
T PF00780_consen 6 WGDRLLVGTEDG-LYVYDLSDPSKPTRILKLSSIT-----QLSVLPELNLLLVLS-DGQLYVYDLDSLEPVSTSAPLAFP 78 (275)
T ss_pred CCCEEEEEECCC-EEEEEecCCccceeEeecceEE-----EEEEecccCEEEEEc-CCccEEEEchhhcccccccccccc
Confidence 467899999999 8999984333333333444488 888889888777665 4999999988631111 0
Q ss_pred -----EeecccCcCeEEEE--EcCCCCEEEEEECCCCeeEEEeee
Q 000425 135 -----GIIQVGSQPITSVA--WLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 135 -----~il~gH~~~VtsVa--fsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
...-.....+..++ -...+...+.....++|.+|.+..
T Consensus 79 ~~~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk~i~i~~~~~ 123 (275)
T PF00780_consen 79 KSRSLPTKLPETKGVSFFAVNGGHEGSRRLCVAVKKKILIYEWND 123 (275)
T ss_pred ccccccccccccCCeeEEeeccccccceEEEEEECCEEEEEEEEC
Confidence 00011233444444 112333344444566999998763
No 404
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=65.58 E-value=4.2 Score=53.46 Aligned_cols=119 Identities=11% Similarity=0.058 Sum_probs=68.8
Q ss_pred CCeEEEEEcC---CCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecC-----ce-eEEEeCCCCEEEEEeCCCcE
Q 000425 52 KPIVNLACHP---RLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLG-----AG-AFAFHPTLEWLFVGDRRGTL 121 (1525)
Q Consensus 52 ~~V~sLaf~P---dg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~g-----a~-sVaFSPdg~~L~SgS~DgtV 121 (1525)
.++.-|.|.| ..-++..+-.+++|++.++.+... ..|. |...+.... .. --..||||..++..+.||.+
T Consensus 181 s~~~~V~wcp~~~~~~~ic~~~~~~~i~lL~~~ra~~-~l~rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~v 259 (1283)
T KOG1916|consen 181 SDPQLVSWCPIAVNKVYICYGLKGGEIRLLNINRALR-SLFRSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISDGSV 259 (1283)
T ss_pred CCcceeeecccccccceeeeccCCCceeEeeechHHH-HHHHhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecCCcc
Confidence 3445555554 456777888899999987654322 2222 554443110 00 11268999999999999999
Q ss_pred EEEeCCCC---CceeeEeecccCc-CeEEEEEcCCC---------CEEEEEEC-CCCeeEEEee
Q 000425 122 LAWDVSIE---RPSMIGIIQVGSQ-PITSVAWLPML---------RLLVTLCR-DGSLQVWKTR 171 (1525)
Q Consensus 122 rVWDl~t~---~~s~i~il~gH~~-~VtsVafsPdg---------~~LaSgS~-DgtIkVWDlr 171 (1525)
+.|...-. ...+...++.|.. +-.|.-+|.+. .+++++++ ...+++|.--
T Consensus 260 ~f~Qiyi~g~~~~rclhewkphd~~p~vC~lc~~~~~~~v~i~~w~~~Itttd~nre~k~w~~a 323 (1283)
T KOG1916|consen 260 GFYQIYITGKIVHRCLHEWKPHDKHPRVCWLCHKQEILVVSIGKWVLRITTTDVNREEKFWAEA 323 (1283)
T ss_pred ceeeeeeeccccHhhhhccCCCCCCCceeeeeccccccCCccceeEEEEecccCCcceeEeecc
Confidence 99976532 3344444556663 22232233321 24566655 4468888754
No 405
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=65.00 E-value=1.6e+02 Score=36.07 Aligned_cols=112 Identities=20% Similarity=0.276 Sum_probs=70.5
Q ss_pred eEEEEEcCCCCEEEEEE----------CCCcEEEEEcCCCcEEEEEe-cCC-cee-ecCceeEEEeCCCCEEEEEe--CC
Q 000425 54 IVNLACHPRLPVLYVAY----------ADGLIRAYNIHTYAVHYTLQ-LDN-TIK-LLGAGAFAFHPTLEWLFVGD--RR 118 (1525)
Q Consensus 54 V~sLaf~Pdg~~LaSGs----------~DgtIrLWDl~t~k~l~tL~-H~~-~V~-~~ga~sVaFSPdg~~L~SgS--~D 118 (1525)
.-.+..+|+++.++++. .+-.|.+||..+.++...+. ..+ ... ..-.+.++++.|+++++... --
T Consensus 38 ~~~~~~spdgk~~y~a~T~~sR~~rG~RtDvv~~~D~~TL~~~~EI~iP~k~R~~~~~~~~~~~ls~dgk~~~V~N~TPa 117 (342)
T PF06433_consen 38 LGNVALSPDGKTIYVAETFYSRGTRGERTDVVEIWDTQTLSPTGEIEIPPKPRAQVVPYKNMFALSADGKFLYVQNFTPA 117 (342)
T ss_dssp SEEEEE-TTSSEEEEEEEEEEETTEEEEEEEEEEEETTTTEEEEEEEETTS-B--BS--GGGEEE-TTSSEEEEEEESSS
T ss_pred CCceeECCCCCEEEEEEEEEeccccccceeEEEEEecCcCcccceEecCCcchheecccccceEEccCCcEEEEEccCCC
Confidence 34467899999877643 24569999999999988877 332 111 01134678899999888765 55
Q ss_pred CcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCC-CEEEEEECCCCeeEEEee
Q 000425 119 GTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPML-RLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 119 gtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg-~~LaSgS~DgtIkVWDlr 171 (1525)
.+|.|-|+.. +++.+.... . . |....|.+ +-|.+-|.||++.-..+.
T Consensus 118 ~SVtVVDl~~--~kvv~ei~~-P-G--C~~iyP~~~~~F~~lC~DGsl~~v~Ld 165 (342)
T PF06433_consen 118 TSVTVVDLAA--KKVVGEIDT-P-G--CWLIYPSGNRGFSMLCGDGSLLTVTLD 165 (342)
T ss_dssp EEEEEEETTT--TEEEEEEEG-T-S--EEEEEEEETTEEEEEETTSCEEEEEET
T ss_pred CeEEEEECCC--CceeeeecC-C-C--EEEEEecCCCceEEEecCCceEEEEEC
Confidence 6788889883 455432221 1 1 33333432 457788899988777665
No 406
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=64.32 E-value=68 Score=38.65 Aligned_cols=99 Identities=14% Similarity=0.060 Sum_probs=66.5
Q ss_pred EEEcCCCCEEEEEEeccceEEEEEee-cC----CccceEEccCCCCCeEEEEEcCCCCEEEEEECCC-cEEEEEcCCCcE
Q 000425 12 LALTPLQPVVFFGFHRRMSVTVVGTV-EG----GRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADG-LIRAYNIHTYAV 85 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~sI~vv~ti-eg----gk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~Dg-tIrLWDl~t~k~ 85 (1525)
|||||++..++++=.....+..++.. .. ++........+.+.-=+++...+|.+.+++..+| .|.+|+.+ ++.
T Consensus 168 la~SpDg~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l 246 (307)
T COG3386 168 LAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKL 246 (307)
T ss_pred eEECCCCCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEecccCCceEEEECCC-CcE
Confidence 89999999999887666666666543 11 1222222222345555677788888776666654 89999988 899
Q ss_pred EEEEe-cCCceeecCceeEEE-eCCCCEEEEEe
Q 000425 86 HYTLQ-LDNTIKLLGAGAFAF-HPTLEWLFVGD 116 (1525)
Q Consensus 86 l~tL~-H~~~V~~~ga~sVaF-SPdg~~L~SgS 116 (1525)
+..+. ....+. +++| .|+.+.|+..+
T Consensus 247 ~~~i~lP~~~~t-----~~~FgG~~~~~L~iTs 274 (307)
T COG3386 247 LGEIKLPVKRPT-----NPAFGGPDLNTLYITS 274 (307)
T ss_pred EEEEECCCCCCc-----cceEeCCCcCEEEEEe
Confidence 98888 656777 7777 45666666554
No 407
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=63.78 E-value=9.7 Score=30.77 Aligned_cols=24 Identities=25% Similarity=0.441 Sum_probs=20.9
Q ss_pred HHHHHHHHHHcCChHHHHHHHHHH
Q 000425 809 FRQLGYACIKYGQFDSAKETFEVI 832 (1525)
Q Consensus 809 wk~Lg~~AL~~G~~~lAeec~q~~ 832 (1525)
|..||+.....|+++.|.++|+++
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~a 25 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQA 25 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHH
Confidence 457999999999999999999983
No 408
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=63.76 E-value=89 Score=39.01 Aligned_cols=111 Identities=17% Similarity=0.222 Sum_probs=66.9
Q ss_pred CeEEEEEcCCCCEEEEEECCCcEEEEEcCCC-------------------------------------------cEEEEE
Q 000425 53 PIVNLACHPRLPVLYVAYADGLIRAYNIHTY-------------------------------------------AVHYTL 89 (1525)
Q Consensus 53 ~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~-------------------------------------------k~l~tL 89 (1525)
.|..+.|.+...-|++|...|.|-||..... .+...+
T Consensus 3 ~v~~vs~a~~t~Elav~~~~GeVv~~k~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~l~di~~r~~~~~~~gf~P~~l~ 82 (395)
T PF08596_consen 3 SVTHVSFAPETLELAVGLESGEVVLFKFGKNQNYGNREQPPDLDYNFRRFSLNNSPGKLTDISDRAPPSLKEGFLPLTLL 82 (395)
T ss_dssp -EEEEEEETTTTEEEEEETTS-EEEEEEEE------------------S--GGGSS-SEEE-GGG--TT-SEEEEEEEEE
T ss_pred eEEEEEecCCCceEEEEccCCcEEEEEcccCCCCCccCCCcccCcccccccccCCCcceEEehhhCCcccccccCchhhe
Confidence 3666777777677777777777777653221 111122
Q ss_pred e-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeE--ee------cccCcCeEEEEEcC-----CC--
Q 000425 90 Q-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIG--II------QVGSQPITSVAWLP-----ML-- 153 (1525)
Q Consensus 90 ~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~--il------~gH~~~VtsVafsP-----dg-- 153 (1525)
+ ....|. +++.| |=.+++.|..||++.|.|++. +.++- .. ......|+++.|.. |+
T Consensus 83 ~~~~g~vt-----al~~S-~iGFvaigy~~G~l~viD~RG--PavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~yS 154 (395)
T PF08596_consen 83 DAKQGPVT-----ALKNS-DIGFVAIGYESGSLVVIDLRG--PAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYS 154 (395)
T ss_dssp ---S-SEE-----EEEE--BTSEEEEEETTSEEEEEETTT--TEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSE
T ss_pred eccCCcEe-----EEecC-CCcEEEEEecCCcEEEEECCC--CeEEeeccccccccccccccCeeEEEEEEEecCCCccc
Confidence 2 334555 77776 445788899999999999984 33330 01 12345688888863 33
Q ss_pred -CEEEEEECCCCeeEEEee
Q 000425 154 -RLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 154 -~~LaSgS~DgtIkVWDlr 171 (1525)
.+++.|...|.+.+|.+-
T Consensus 155 Si~L~vGTn~G~v~~fkIl 173 (395)
T PF08596_consen 155 SICLLVGTNSGNVLTFKIL 173 (395)
T ss_dssp EEEEEEEETTSEEEEEEEE
T ss_pred ceEEEEEeCCCCEEEEEEe
Confidence 478888889999999873
No 409
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=63.67 E-value=82 Score=37.87 Aligned_cols=83 Identities=16% Similarity=0.201 Sum_probs=59.4
Q ss_pred hhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhh---------hHHHHHHHH
Q 000425 774 IKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADY---------ESILDLFIC 844 (1525)
Q Consensus 774 l~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~---------~~LllLy~~ 844 (1525)
.+.|+++.|.+.+++.... .| . ...-|..||+..+..|+++.|.+.|+++-+. ..|..+|..
T Consensus 191 ~~~~~~~~A~~~~~~al~~---~p---~---~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~~l~~~~~~ 261 (389)
T PRK11788 191 LARGDLDAARALLKKALAA---DP---Q---CVRASILLGDLALAQGDYAAAIEALERVEEQDPEYLSEVLPKLMECYQA 261 (389)
T ss_pred HhCCCHHHHHHHHHHHHhH---Cc---C---CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHChhhHHHHHHHHHHHHHH
Confidence 5779999999988875421 11 1 2334567999999999999999999987742 345567788
Q ss_pred hCCHHHHHHHHHHHHHhcCCh
Q 000425 845 HLNPSAMRRLAQRLEEEGANP 865 (1525)
Q Consensus 845 ~gd~e~L~kLa~~ae~~g~~~ 865 (1525)
.|+.+.-.++.+.+.+..-++
T Consensus 262 ~g~~~~A~~~l~~~~~~~p~~ 282 (389)
T PRK11788 262 LGDEAEGLEFLRRALEEYPGA 282 (389)
T ss_pred cCCHHHHHHHHHHHHHhCCCc
Confidence 888888888777776654443
No 410
>COG4391 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=62.94 E-value=4.4 Score=37.22 Aligned_cols=20 Identities=20% Similarity=0.590 Sum_probs=15.7
Q ss_pred ceeccCCCccccCCCCceec
Q 000425 1476 LSRLSTIGYDVCDLCGAKFS 1495 (1525)
Q Consensus 1476 ltpIy~~~~v~Cp~cgA~y~ 1495 (1525)
|-+|-.+..|.||||+.+|.
T Consensus 40 ~L~mg~~gev~CPYC~t~y~ 59 (62)
T COG4391 40 FLDMGDEGEVVCPYCSTRYR 59 (62)
T ss_pred EEEcCCCCcEecCccccEEE
Confidence 44552389999999999996
No 411
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=62.84 E-value=28 Score=40.31 Aligned_cols=111 Identities=15% Similarity=0.093 Sum_probs=63.6
Q ss_pred cccchhhccccCCCCCcc-----hhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHH
Q 000425 750 CGDLAVSLSQAGPQFTQV-----LRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDS 824 (1525)
Q Consensus 750 ~~elAl~ls~~~~~~d~~-----~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~l 824 (1525)
.++-|++++..+-..+++ .-..++...++++.+.+.++.... .+..+. ...-|..+|...++.|+.+.
T Consensus 92 ~~~~A~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~l~~~~~----~~~~~~---~~~~~~~~a~~~~~~G~~~~ 164 (280)
T PF13429_consen 92 DPEEALKLAEKAYERDGDPRYLLSALQLYYRLGDYDEAEELLEKLEE----LPAAPD---SARFWLALAEIYEQLGDPDK 164 (280)
T ss_dssp --------------------------H-HHHTT-HHHHHHHHHHHHH-----T---T----HHHHHHHHHHHHHCCHHHH
T ss_pred cccccccccccccccccccchhhHHHHHHHHHhHHHHHHHHHHHHHh----ccCCCC---CHHHHHHHHHHHHHcCCHHH
Confidence 566666665421111111 233667788999999999887431 121222 23447789999999999999
Q ss_pred HHHHHHHHhh--------hhHHHHHHHHhCCHHHHHHHHHHHHHh-cCChhH
Q 000425 825 AKETFEVIAD--------YESILDLFICHLNPSAMRRLAQRLEEE-GANPEL 867 (1525)
Q Consensus 825 Aeec~q~~~d--------~~~LllLy~~~gd~e~L~kLa~~ae~~-g~~~~~ 867 (1525)
|+++|+++=. ...|+.+++-+|+.+.++.+.+..... ..++.+
T Consensus 165 A~~~~~~al~~~P~~~~~~~~l~~~li~~~~~~~~~~~l~~~~~~~~~~~~~ 216 (280)
T PF13429_consen 165 ALRDYRKALELDPDDPDARNALAWLLIDMGDYDEAREALKRLLKAAPDDPDL 216 (280)
T ss_dssp HHHHHHHHHHH-TT-HHHHHHHHHHHCTTCHHHHHHHHHHHHHHH-HTSCCH
T ss_pred HHHHHHHHHHcCCCCHHHHHHHHHHHHHCCChHHHHHHHHHHHHHCcCHHHH
Confidence 9999998853 446778888899999888877777655 344444
No 412
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=62.70 E-value=66 Score=37.88 Aligned_cols=102 Identities=14% Similarity=0.144 Sum_probs=63.9
Q ss_pred cchhhhHHhh----cCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHh-hh-----
Q 000425 766 QVLRGIYAIK----ALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIA-DY----- 835 (1525)
Q Consensus 766 ~~~rF~LAl~----lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~-d~----- 835 (1525)
...-|+.|+. .|+++.|.+.+++.- ..+|+.+-....+ -.||..-+..|+++.|.+.|+++= .|
T Consensus 142 e~~~Y~~A~~l~~~~~~y~~Ai~af~~fl---~~yP~s~~a~~A~---y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~ 215 (263)
T PRK10803 142 ANTDYNAAIALVQDKSRQDDAIVAFQNFV---KKYPDSTYQPNAN---YWLGQLNYNKGKKDDAAYYFASVVKNYPKSPK 215 (263)
T ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHH---HHCcCCcchHHHH---HHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcc
Confidence 3445555554 489999999987754 4566543222332 359999999999999999998874 22
Q ss_pred --hH---HHHHHHHhCCHHHHHHHHHHHHHhcCChhHHHHHHH
Q 000425 836 --ES---ILDLFICHLNPSAMRRLAQRLEEEGANPELRRYCER 873 (1525)
Q Consensus 836 --~~---LllLy~~~gd~e~L~kLa~~ae~~g~~~~~~~~c~~ 873 (1525)
+. |-.+|...||.++-++..+...+.--+......+++
T Consensus 216 ~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s~~a~~A~~ 258 (263)
T PRK10803 216 AADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGTDGAKQAQK 258 (263)
T ss_pred hhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHHH
Confidence 22 334555666666666666655544444444444444
No 413
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=62.22 E-value=1.9e+02 Score=35.54 Aligned_cols=98 Identities=17% Similarity=0.132 Sum_probs=59.7
Q ss_pred CCCEEEEEECCCcEEEEEcCCCcEEEEEe-cCC-ceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecc
Q 000425 62 RLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDN-TIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQV 139 (1525)
Q Consensus 62 dg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~-~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~g 139 (1525)
.+..++.++.|+.+..+|..+++.+-... ... ... +... .+..++.++.||.+.+.|..+. +.......
T Consensus 293 ~~~~vy~~~~~g~l~ald~~tG~~~W~~~~~~~~~~~-----sp~v--~~g~l~v~~~~G~l~~ld~~tG--~~~~~~~~ 363 (394)
T PRK11138 293 DGGRIYLVDQNDRVYALDTRGGVELWSQSDLLHRLLT-----APVL--YNGYLVVGDSEGYLHWINREDG--RFVAQQKV 363 (394)
T ss_pred ECCEEEEEcCCCeEEEEECCCCcEEEcccccCCCccc-----CCEE--ECCEEEEEeCCCEEEEEECCCC--CEEEEEEc
Confidence 35677788899999999999987654433 111 111 1111 2567888999999999998864 22211111
Q ss_pred cCcCeEE-EEEcCCCCEEEEEECCCCeeEEEe
Q 000425 140 GSQPITS-VAWLPMLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 140 H~~~Vts-VafsPdg~~LaSgS~DgtIkVWDl 170 (1525)
+...+.+ -.+ .+..|+.++.||.|..++.
T Consensus 364 ~~~~~~s~P~~--~~~~l~v~t~~G~l~~~~~ 393 (394)
T PRK11138 364 DSSGFLSEPVV--ADDKLLIQARDGTVYAITR 393 (394)
T ss_pred CCCcceeCCEE--ECCEEEEEeCCceEEEEeC
Confidence 2222221 112 2446778899999887753
No 414
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=61.84 E-value=1e+02 Score=26.91 Aligned_cols=26 Identities=15% Similarity=0.255 Sum_probs=22.3
Q ss_pred HHHHHHHHhhcCChHHHHHHHHHHHH
Q 000425 1313 CFKTGLAHLEQNQLPDALSCFDEAFL 1338 (1525)
Q Consensus 1313 ~lk~gy~~~t~gkf~eAl~~Fr~iL~ 1338 (1525)
.+..|..++..|++.+|+..|+.++.
T Consensus 3 ~~~~a~~~~~~~~~~~A~~~~~~~~~ 28 (100)
T cd00189 3 LLNLGNLYYKLGDYDEALEYYEKALE 28 (100)
T ss_pred HHHHHHHHHHHhcHHHHHHHHHHHHh
Confidence 35677788889999999999999976
No 415
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=60.86 E-value=9.6 Score=50.42 Aligned_cols=112 Identities=15% Similarity=0.184 Sum_probs=70.1
Q ss_pred CCCCeEEEEEcCCC-CEEEEEECCCcEEEEEcC--CCcEE----EEEe--cCCceeecCceeEEEeC---CCCEEEEEeC
Q 000425 50 LKKPIVNLACHPRL-PVLYVAYADGLIRAYNIH--TYAVH----YTLQ--LDNTIKLLGAGAFAFHP---TLEWLFVGDR 117 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg-~~LaSGs~DgtIrLWDl~--t~k~l----~tL~--H~~~V~~~ga~sVaFSP---dg~~L~SgS~ 117 (1525)
..+.|-.++|.... +.++ -.=|...|||+. .++.. .+.. ....+. -|.|.| ..-++-.+-+
T Consensus 131 f~G~v~dl~fah~~~pk~~--~~vg~lfVy~vd~l~G~iq~~l~v~~~~p~gs~~~-----~V~wcp~~~~~~~ic~~~~ 203 (1283)
T KOG1916|consen 131 FPGGVGDLQFAHTKCPKGR--RLVGELFVYDVDVLQGEIQPQLEVTPITPYGSDPQ-----LVSWCPIAVNKVYICYGLK 203 (1283)
T ss_pred CCCCcccccccccCChHHH--HHhhhhheeehHhhccccccceEEeecCcCCCCcc-----eeeecccccccceeeeccC
Confidence 55667777774322 2222 223667889865 23322 2222 344556 566665 4456666668
Q ss_pred CCcEEEEeCCCCCceeeEeecccCcCeEEEE-----------EcCCCCEEEEEECCCCeeEEEee
Q 000425 118 RGTLLAWDVSIERPSMIGIIQVGSQPITSVA-----------WLPMLRLLVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 118 DgtVrVWDl~t~~~s~i~il~gH~~~VtsVa-----------fsPdg~~LaSgS~DgtIkVWDlr 171 (1525)
+++|++.++++....++ .+|+..+..++ .+|||..|+.++.||.++.|-+-
T Consensus 204 ~~~i~lL~~~ra~~~l~---rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~v~f~Qiy 265 (1283)
T KOG1916|consen 204 GGEIRLLNINRALRSLF---RSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISDGSVGFYQIY 265 (1283)
T ss_pred CCceeEeeechHHHHHH---HhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecCCccceeeee
Confidence 99999887775323333 35777665543 68999999999999999999775
No 416
>KOG0553 consensus TPR repeat-containing protein [General function prediction only]
Probab=60.84 E-value=12 Score=44.32 Aligned_cols=89 Identities=15% Similarity=0.206 Sum_probs=62.2
Q ss_pred CcccchhhccccCCCCCcch---hhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHH
Q 000425 749 VCGDLAVSLSQAGPQFTQVL---RGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSA 825 (1525)
Q Consensus 749 ~~~elAl~ls~~~~~~d~~~---rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lA 825 (1525)
++..-|++|.. .|+.. |-.--.+||+++.|.+.++..- .+ +.+.|.-|-+||.+-+.+|++..|
T Consensus 102 ~kY~~AI~l~P----~nAVyycNRAAAy~~Lg~~~~AVkDce~Al----~i-----Dp~yskay~RLG~A~~~~gk~~~A 168 (304)
T KOG0553|consen 102 DKYTEAIELDP----TNAVYYCNRAAAYSKLGEYEDAVKDCESAL----SI-----DPHYSKAYGRLGLAYLALGKYEEA 168 (304)
T ss_pred HHHHHHHhcCC----CcchHHHHHHHHHHHhcchHHHHHHHHHHH----hc-----ChHHHHHHHHHHHHHHccCcHHHH
Confidence 35666777774 24433 4444578899999998887642 22 235677788899999999999999
Q ss_pred HHHHHHHhhhhHHHHHHHHhCCHHHHHHHHHHHH
Q 000425 826 KETFEVIADYESILDLFICHLNPSAMRRLAQRLE 859 (1525)
Q Consensus 826 eec~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae 859 (1525)
+++|+|+=+++- |-+-.++=.++|+
T Consensus 169 ~~aykKaLeldP---------~Ne~~K~nL~~Ae 193 (304)
T KOG0553|consen 169 IEAYKKALELDP---------DNESYKSNLKIAE 193 (304)
T ss_pred HHHHHhhhccCC---------CcHHHHHHHHHHH
Confidence 999999977663 4444454455454
No 417
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=60.48 E-value=19 Score=37.74 Aligned_cols=49 Identities=16% Similarity=0.237 Sum_probs=30.9
Q ss_pred hhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHH
Q 000425 774 IKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVI 832 (1525)
Q Consensus 774 l~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~ 832 (1525)
++.|+++.|+.++.... ..+-. -..|..+|+..++.|+.+-|..+|+++
T Consensus 96 ~~~~~~d~Al~~L~~~~-------~~~~~---~~~~~~~Gdi~~~~g~~~~A~~~y~~A 144 (145)
T PF09976_consen 96 LQQGQYDEALATLQQIP-------DEAFK---ALAAELLGDIYLAQGDYDEARAAYQKA 144 (145)
T ss_pred HHcCCHHHHHHHHHhcc-------CcchH---HHHHHHHHHHHHHCCCHHHHHHHHHHh
Confidence 56778888888875421 11111 123446788888888888888888764
No 418
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=60.10 E-value=13 Score=29.08 Aligned_cols=27 Identities=22% Similarity=0.502 Sum_probs=22.6
Q ss_pred HHHHHHHHHHcCChHHHHHHHHHHhhh
Q 000425 809 FRQLGYACIKYGQFDSAKETFEVIADY 835 (1525)
Q Consensus 809 wk~Lg~~AL~~G~~~lAeec~q~~~d~ 835 (1525)
|-.+|...+..|+++.|++||+++=.+
T Consensus 4 ~~~~g~~~~~~~~~~~A~~~~~~al~~ 30 (34)
T PF00515_consen 4 YYNLGNAYFQLGDYEEALEYYQRALEL 30 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCchHHHHHHHHHHHH
Confidence 556999999999999999999997543
No 419
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=59.90 E-value=1.4e+02 Score=36.38 Aligned_cols=143 Identities=20% Similarity=0.259 Sum_probs=91.2
Q ss_pred HHHHHHHhccCCCCCChhHHHHHHcCCCCc-ccchhhccccCCCCCcchhhhHHhhcCCH-------HHHHHHHHHhhcc
Q 000425 721 EILYQITSRFDSLRITPRSLDILAKGPPVC-GDLAVSLSQAGPQFTQVLRGIYAIKALRF-------STALSVLKDEFLR 792 (1525)
Q Consensus 721 ~~l~~~~~~~~s~~i~~~~l~~L~~~~~~~-~elAl~ls~~~~~~d~~~rF~LAl~lg~l-------~~A~~ia~~~~~~ 792 (1525)
.+......++++..+ +-+++|-...++- -++=+++++ ....-||+-+.+||| |-|++|=... .+
T Consensus 26 q~~~~qa~~lsr~Yv--~GlNfLLs~Q~dKAvdlF~e~l~-----~d~~t~e~~ltLGnLfRsRGEvDRAIRiHQ~L-~~ 97 (389)
T COG2956 26 QDKQDQANRLSRDYV--KGLNFLLSNQPDKAVDLFLEMLQ-----EDPETFEAHLTLGNLFRSRGEVDRAIRIHQTL-LE 97 (389)
T ss_pred hhHHHHHhhccHHHH--hHHHHHhhcCcchHHHHHHHHHh-----cCchhhHHHHHHHHHHHhcchHHHHHHHHHHH-hc
Confidence 344444444443222 2466665543222 344456666 333468888888775 7888885443 23
Q ss_pred cCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh--------hhHHHHHHHHhCCHHHHHHHHHHHHHhcCC
Q 000425 793 SRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD--------YESILDLFICHLNPSAMRRLAQRLEEEGAN 864 (1525)
Q Consensus 793 ~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d--------~~~LllLy~~~gd~e~L~kLa~~ae~~g~~ 864 (1525)
++|++ -..... --.+||+--|+.|-+|-||+-|..+-| ..-|+-||-.+-+-+|--..|.....-|..
T Consensus 98 spdlT---~~qr~l-Al~qL~~Dym~aGl~DRAE~~f~~L~de~efa~~AlqqLl~IYQ~treW~KAId~A~~L~k~~~q 173 (389)
T COG2956 98 SPDLT---FEQRLL-ALQQLGRDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERLVKLGGQ 173 (389)
T ss_pred CCCCc---hHHHHH-HHHHHHHHHHHhhhhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHHHHHHHHHcCCc
Confidence 44443 222222 245899999999999999999999887 778999999999988877777755444433
Q ss_pred hh----HHHHHHHHH
Q 000425 865 PE----LRRYCERIL 875 (1525)
Q Consensus 865 ~~----~~~~c~~il 875 (1525)
.+ +.|+||--+
T Consensus 174 ~~~~eIAqfyCELAq 188 (389)
T COG2956 174 TYRVEIAQFYCELAQ 188 (389)
T ss_pred cchhHHHHHHHHHHH
Confidence 32 377887644
No 420
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=59.18 E-value=16 Score=28.42 Aligned_cols=28 Identities=25% Similarity=0.453 Sum_probs=25.2
Q ss_pred HHHHHHHHHHcCChHHHHHHHHHHhhhh
Q 000425 809 FRQLGYACIKYGQFDSAKETFEVIADYE 836 (1525)
Q Consensus 809 wk~Lg~~AL~~G~~~lAeec~q~~~d~~ 836 (1525)
|-.+|......|+++.|.++|+++-+++
T Consensus 4 ~~~lg~~y~~~~~~~~A~~~~~~a~~~~ 31 (34)
T PF13181_consen 4 YYNLGKIYEQLGDYEEALEYFEKALELN 31 (34)
T ss_dssp HHHHHHHHHHTTSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHhhC
Confidence 5579999999999999999999998765
No 421
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=59.01 E-value=3.8 Score=47.47 Aligned_cols=87 Identities=25% Similarity=0.387 Sum_probs=0.0
Q ss_pred cchhhccccCCCCCcchhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHH
Q 000425 752 DLAVSLSQAGPQFTQVLRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEV 831 (1525)
Q Consensus 752 elAl~ls~~~~~~d~~~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~ 831 (1525)
+.+++++. +-.+.|+++.|++++++.-. .. .+|.. ..-|+.+|+.|...++++.|+++|++
T Consensus 9 ~~~l~~A~------------~~~~~~~~~~Al~~L~~~~~---~~-~~~~~---~~~~~~~a~La~~~~~~~~A~~ay~~ 69 (280)
T PF13429_consen 9 EEALRLAR------------LLYQRGDYEKALEVLKKAAQ---KI-APPDD---PEYWRLLADLAWSLGDYDEAIEAYEK 69 (280)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred cccccccc------------cccccccccccccccccccc---cc-ccccc---cccccccccccccccccccccccccc
Confidence 66676664 35688999999999864311 11 01112 34588899999999999999999987
Q ss_pred Hh--------hhhHHHHHHHHhCCHHHHHHHHHHH
Q 000425 832 IA--------DYESILDLFICHLNPSAMRRLAQRL 858 (1525)
Q Consensus 832 ~~--------d~~~LllLy~~~gd~e~L~kLa~~a 858 (1525)
+= ++..|..| ...++.++-.+++..+
T Consensus 70 l~~~~~~~~~~~~~l~~l-~~~~~~~~A~~~~~~~ 103 (280)
T PF13429_consen 70 LLASDKANPQDYERLIQL-LQDGDPEEALKLAEKA 103 (280)
T ss_dssp -----------------------------------
T ss_pred cccccccccccccccccc-cccccccccccccccc
Confidence 64 33444444 3455555544444433
No 422
>PRK12496 hypothetical protein; Provisional
Probab=58.58 E-value=5.3 Score=43.66 Aligned_cols=24 Identities=33% Similarity=0.666 Sum_probs=20.8
Q ss_pred cccCCCCceecCCC-CCCCccccce
Q 000425 1485 DVCDLCGAKFSALS-APGCIICGMG 1508 (1525)
Q Consensus 1485 v~Cp~cgA~y~~~~-g~~C~iC~l~ 1508 (1525)
..|+-||..|...+ +..|++||--
T Consensus 128 ~~C~gC~~~~~~~~~~~~C~~CG~~ 152 (164)
T PRK12496 128 KVCKGCKKKYPEDYPDDVCEICGSP 152 (164)
T ss_pred EECCCCCccccCCCCCCcCCCCCCh
Confidence 46999999998877 8999999954
No 423
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=58.34 E-value=45 Score=47.02 Aligned_cols=100 Identities=13% Similarity=0.146 Sum_probs=64.1
Q ss_pred CcchhhhHH---hhcCCHHHHHHHHHHhhcccCCCCCCCCChhHH-------HH--HHHHHHHHHHcCChHHHHHHHHHH
Q 000425 765 TQVLRGIYA---IKALRFSTALSVLKDEFLRSRDYPKCPPTSQLF-------HR--FRQLGYACIKYGQFDSAKETFEVI 832 (1525)
Q Consensus 765 d~~~rF~LA---l~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~-------~r--wk~Lg~~AL~~G~~~lAeec~q~~ 832 (1525)
|.+..+.|+ .+.|+++.|.+.+++.-...++.+ . ...| .- ...+|+.+++.|+++.|+++|+++
T Consensus 302 ~~~a~~~Lg~~~~~~g~~~eA~~~l~~Al~~~p~~~---~-~~~~~~ll~~~~~~~~~~~g~~~~~~g~~~eA~~~~~~A 377 (1157)
T PRK11447 302 DSEALGALGQAYSQQGDRARAVAQFEKALALDPHSS---N-RDKWESLLKVNRYWLLIQQGDAALKANNLAQAERLYQQA 377 (1157)
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCcc---c-hhHHHHHHHhhhHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 445555544 578999999999887642222221 1 0111 11 134599999999999999999998
Q ss_pred hhhhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhHHHHHHHHHh
Q 000425 833 ADYESILDLFICHLNPSAMRRLAQRLEEEGANPELRRYCERILR 876 (1525)
Q Consensus 833 ~d~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~~~~c~~il~ 876 (1525)
-..+. .+...+..|+......|....+...+++.|.
T Consensus 378 l~~~P--------~~~~a~~~Lg~~~~~~g~~~eA~~~y~~aL~ 413 (1157)
T PRK11447 378 RQVDN--------TDSYAVLGLGDVAMARKDYAAAERYYQQALR 413 (1157)
T ss_pred HHhCC--------CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 65432 4555666666666666666666666666554
No 424
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=58.00 E-value=70 Score=39.10 Aligned_cols=83 Identities=12% Similarity=0.062 Sum_probs=52.3
Q ss_pred hhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh--------hhHHHH
Q 000425 769 RGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD--------YESILD 840 (1525)
Q Consensus 769 rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d--------~~~Lll 840 (1525)
+.+-|...|+++.|++.+++.-...++ . ..-|-.+|...+..|+++.|..+|+++=+ |-.|-.
T Consensus 8 ~a~~a~~~~~~~~Ai~~~~~Al~~~P~------~---~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~lg~ 78 (356)
T PLN03088 8 KAKEAFVDDDFALAVDLYTQAIDLDPN------N---AELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGT 78 (356)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHhCCC------C---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHHHHH
Confidence 455677778888888877765321111 1 22355688888888999888888888743 333455
Q ss_pred HHHHhCCHHHHHHHHHHHHH
Q 000425 841 LFICHLNPSAMRRLAQRLEE 860 (1525)
Q Consensus 841 Ly~~~gd~e~L~kLa~~ae~ 860 (1525)
+|...|+.+.-.+..+.|.+
T Consensus 79 ~~~~lg~~~eA~~~~~~al~ 98 (356)
T PLN03088 79 ACMKLEEYQTAKAALEKGAS 98 (356)
T ss_pred HHHHhCCHHHHHHHHHHHHH
Confidence 66667776544444444443
No 425
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=57.98 E-value=2.9e+02 Score=31.73 Aligned_cols=141 Identities=17% Similarity=0.180 Sum_probs=81.0
Q ss_pred CCCCEEEEEEeccceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE-------
Q 000425 16 PLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT------- 88 (1525)
Q Consensus 16 P~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t------- 88 (1525)
..+.++++|.. + .+.++.. ........+. +..+|..+..-|.-+.+++=+ |+++.++|+........
T Consensus 5 ~~~~~L~vGt~-~-Gl~~~~~-~~~~~~~~i~--~~~~I~ql~vl~~~~~llvLs-d~~l~~~~L~~l~~~~~~~~~~~~ 78 (275)
T PF00780_consen 5 SWGDRLLVGTE-D-GLYVYDL-SDPSKPTRIL--KLSSITQLSVLPELNLLLVLS-DGQLYVYDLDSLEPVSTSAPLAFP 78 (275)
T ss_pred cCCCEEEEEEC-C-CEEEEEe-cCCccceeEe--ecceEEEEEEecccCEEEEEc-CCccEEEEchhhcccccccccccc
Confidence 34678888854 3 3656554 2223333332 345599999999877766544 59999999876544331
Q ss_pred --------EecCCceeecCceeEE--EeCCCCEEEEEeCCCcEEEEeCCCC--Cc-eeeEeecccCcCeEEEEEcCCCCE
Q 000425 89 --------LQLDNTIKLLGAGAFA--FHPTLEWLFVGDRRGTLLAWDVSIE--RP-SMIGIIQVGSQPITSVAWLPMLRL 155 (1525)
Q Consensus 89 --------L~H~~~V~~~ga~sVa--FSPdg~~L~SgS~DgtVrVWDl~t~--~~-s~i~il~gH~~~VtsVafsPdg~~ 155 (1525)
+.....+. .++ -...+...++....++|.+|.+... .. .....+. -...+.+++|. ++.
T Consensus 79 ~~~~~~~~~~~~~~v~-----~f~~~~~~~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~-lp~~~~~i~~~--~~~ 150 (275)
T PF00780_consen 79 KSRSLPTKLPETKGVS-----FFAVNGGHEGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEIS-LPDPPSSIAFL--GNK 150 (275)
T ss_pred ccccccccccccCCee-----EEeeccccccceEEEEEECCEEEEEEEECCcccccceeEEEE-cCCCcEEEEEe--CCE
Confidence 11223333 333 1134444555555669999987752 11 2222222 24678889998 456
Q ss_pred EEEEECCCCeeEEEee
Q 000425 156 LVTLCRDGSLQVWKTR 171 (1525)
Q Consensus 156 LaSgS~DgtIkVWDlr 171 (1525)
|+.|..+ ...+.|+.
T Consensus 151 i~v~~~~-~f~~idl~ 165 (275)
T PF00780_consen 151 ICVGTSK-GFYLIDLN 165 (275)
T ss_pred EEEEeCC-ceEEEecC
Confidence 6666644 35566654
No 426
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=56.43 E-value=32 Score=41.69 Aligned_cols=108 Identities=14% Similarity=0.126 Sum_probs=60.0
Q ss_pred cCCCCEEEEEEec--------cceEEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 15 TPLQPVVFFGFHR--------RMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 15 hP~qp~Vf~s~~~--------d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
+|++.++++.... ...+.+++. +.+ ....+.. ....+....|+|+|..++... ++.|.+++..++..
T Consensus 1 S~d~~~~l~~~~~~~~~r~s~~~~y~i~d~-~~~-~~~~l~~-~~~~~~~~~~sP~g~~~~~v~-~~nly~~~~~~~~~- 75 (353)
T PF00930_consen 1 SPDGKFVLFATNYTKQWRHSFKGDYYIYDI-ETG-EITPLTP-PPPKLQDAKWSPDGKYIAFVR-DNNLYLRDLATGQE- 75 (353)
T ss_dssp -TTSSEEEEEEEEEEESSSEEEEEEEEEET-TTT-EEEESS--EETTBSEEEE-SSSTEEEEEE-TTEEEEESSTTSEE-
T ss_pred CCCCCeEEEEECcEEeeeeccceeEEEEec-CCC-ceEECcC-CccccccceeecCCCeeEEEe-cCceEEEECCCCCe-
Confidence 5777777774432 123334443 221 1222222 145788899999999988775 57899999877733
Q ss_pred EEEecCC--ce----e--------ecCceeEEEeCCCCEEEEEe-CCCcEEEEeCC
Q 000425 87 YTLQLDN--TI----K--------LLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVS 127 (1525)
Q Consensus 87 ~tL~H~~--~V----~--------~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~ 127 (1525)
..+.++. .+ . +.....+-||||+++|+... ++..|+.+.+-
T Consensus 76 ~~lT~dg~~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~ 131 (353)
T PF00930_consen 76 TQLTTDGEPGIYNGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLP 131 (353)
T ss_dssp EESES--TTTEEESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEE
T ss_pred EEeccccceeEEcCccceeccccccccccceEECCCCCEEEEEEECCcCCceEEee
Confidence 2233111 10 0 11124788999999998776 66777766544
No 427
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=56.33 E-value=1.1e+02 Score=26.75 Aligned_cols=54 Identities=20% Similarity=0.266 Sum_probs=38.6
Q ss_pred HHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh
Q 000425 772 YAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD 834 (1525)
Q Consensus 772 LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d 834 (1525)
...+.|+++.|.+.+++.-. ..| .. ..-|..+|......|+++.|.++|+.+-+
T Consensus 9 ~~~~~~~~~~A~~~~~~~~~---~~~---~~---~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 62 (100)
T cd00189 9 LYYKLGDYDEALEYYEKALE---LDP---DN---ADAYYNLAAAYYKLGKYEEALEDYEKALE 62 (100)
T ss_pred HHHHHhcHHHHHHHHHHHHh---cCC---cc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34567889999888876531 111 11 13356799999999999999999998754
No 428
>PRK13684 Ycf48-like protein; Provisional
Probab=56.11 E-value=4e+02 Score=32.38 Aligned_cols=148 Identities=16% Similarity=0.082 Sum_probs=81.2
Q ss_pred cEEEEEEcCCCCEEEEEEeccceEEEEEee-cCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE
Q 000425 8 TEVHLALTPLQPVVFFGFHRRMSVTVVGTV-EGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH 86 (1525)
Q Consensus 8 ~e~~LA~hP~qp~Vf~s~~~d~sI~vv~ti-eggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l 86 (1525)
....+++.|.+.+++++. +..+ +.+. .++..-..+...-...++++.+.++++.+++| ..|.+++=+...+...
T Consensus 174 ~~~~i~~~~~g~~v~~g~--~G~i--~~s~~~gg~tW~~~~~~~~~~l~~i~~~~~g~~~~vg-~~G~~~~~s~d~G~sW 248 (334)
T PRK13684 174 VVRNLRRSPDGKYVAVSS--RGNF--YSTWEPGQTAWTPHQRNSSRRLQSMGFQPDGNLWMLA-RGGQIRFNDPDDLESW 248 (334)
T ss_pred eEEEEEECCCCeEEEEeC--CceE--EEEcCCCCCeEEEeeCCCcccceeeeEcCCCCEEEEe-cCCEEEEccCCCCCcc
Confidence 345677888887766554 2322 2222 23322223333345678999999999887776 4576654233444333
Q ss_pred EEEe-c----CCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEee--cccCcCeEEEEEcCCCCEEEEE
Q 000425 87 YTLQ-L----DNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGII--QVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 87 ~tL~-H----~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il--~gH~~~VtsVafsPdg~~LaSg 159 (1525)
.... + ...+. .+.|++++..++ ++.+|.+.. ..... .+..... ..-......+.|..++..+++
T Consensus 249 ~~~~~~~~~~~~~l~-----~v~~~~~~~~~~-~G~~G~v~~-S~d~G-~tW~~~~~~~~~~~~~~~~~~~~~~~~~~~- 319 (334)
T PRK13684 249 SKPIIPEITNGYGYL-----DLAYRTPGEIWA-GGGNGTLLV-SKDGG-KTWEKDPVGEEVPSNFYKIVFLDPEKGFVL- 319 (334)
T ss_pred ccccCCcccccccee-----eEEEcCCCCEEE-EcCCCeEEE-eCCCC-CCCeECCcCCCCCcceEEEEEeCCCceEEE-
Confidence 3222 2 13356 889999877554 445676543 22211 1111100 111235677888877777665
Q ss_pred ECCCCeeEEE
Q 000425 160 CRDGSLQVWK 169 (1525)
Q Consensus 160 S~DgtIkVWD 169 (1525)
+..|.|-.|+
T Consensus 320 G~~G~il~~~ 329 (334)
T PRK13684 320 GQRGVLLRYV 329 (334)
T ss_pred CCCceEEEec
Confidence 4678887665
No 429
>KOG1925 consensus Rac1 GTPase effector FHOS [Signal transduction mechanisms; Cytoskeleton]
Probab=55.74 E-value=14 Score=45.83 Aligned_cols=26 Identities=27% Similarity=0.391 Sum_probs=12.3
Q ss_pred CCCCCCCCCCCCCCcccccCCCCCCCCC
Q 000425 1219 DGGVPPQIAPQPAIPVESIGLPDGGVPP 1246 (1525)
Q Consensus 1219 ~~~~~~~~~~~~~~~~~~~~l~~~g~~~ 1246 (1525)
..|||+..+....+|....- -|||||
T Consensus 226 ~~~~P~~P~~P~~~P~~~~L--~~GvPP 251 (817)
T KOG1925|consen 226 EPKEPLIPASPKELPTRDFL--LSGVPP 251 (817)
T ss_pred CCCCCCCCCChhccCCchhh--hcCCCC
Confidence 55666663333333332221 267776
No 430
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=55.62 E-value=33 Score=31.08 Aligned_cols=57 Identities=16% Similarity=0.182 Sum_probs=42.6
Q ss_pred hHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhh
Q 000425 771 IYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYE 836 (1525)
Q Consensus 771 ~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~ 836 (1525)
+++++.++++.|++.+...-.-.+ +...-|..+|..-...|+++.|.++|+++=+.+
T Consensus 3 ~~~~~~~~~~~A~~~~~~~l~~~p---------~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~ 59 (73)
T PF13371_consen 3 QIYLQQEDYEEALEVLERALELDP---------DDPELWLQRARCLFQLGRYEEALEDLERALELS 59 (73)
T ss_pred HHHHhCCCHHHHHHHHHHHHHhCc---------ccchhhHHHHHHHHHhccHHHHHHHHHHHHHHC
Confidence 567888899999988876532111 123447789999999999999999999876443
No 431
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=55.12 E-value=81 Score=33.69 Aligned_cols=82 Identities=16% Similarity=0.289 Sum_probs=51.1
Q ss_pred HHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh--------hhHHHHHHH
Q 000425 772 YAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD--------YESILDLFI 843 (1525)
Q Consensus 772 LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d--------~~~LllLy~ 843 (1525)
+..+.|+++.|.+..++.-. .+.-+ ....-|..+|...+..|+++.|+++|+++-. +..|..+|.
T Consensus 108 ~~~~~g~~~~A~~~~~~~~~----~~~~~---~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~ 180 (234)
T TIGR02521 108 FLCQQGKYEQAMQQFEQAIE----DPLYP---QPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYY 180 (234)
T ss_pred HHHHcccHHHHHHHHHHHHh----ccccc---cchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHHHHHH
Confidence 34577899999888877531 11111 1233466789999999999999999988653 334444555
Q ss_pred HhCCHHHHHHHHHHHHH
Q 000425 844 CHLNPSAMRRLAQRLEE 860 (1525)
Q Consensus 844 ~~gd~e~L~kLa~~ae~ 860 (1525)
..|+.+.-.++.+.+.+
T Consensus 181 ~~~~~~~A~~~~~~~~~ 197 (234)
T TIGR02521 181 LRGQYKDARAYLERYQQ 197 (234)
T ss_pred HcCCHHHHHHHHHHHHH
Confidence 56665554444444443
No 432
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=54.22 E-value=39 Score=29.35 Aligned_cols=31 Identities=13% Similarity=0.124 Sum_probs=25.6
Q ss_pred CCeEEEEEcCCC---CEEEEEECCCcEEEEEcCC
Q 000425 52 KPIVNLACHPRL---PVLYVAYADGLIRAYNIHT 82 (1525)
Q Consensus 52 ~~V~sLaf~Pdg---~~LaSGs~DgtIrLWDl~t 82 (1525)
+.|+++.|+|.. .+|+.+-.-+.|.|+|+++
T Consensus 1 GAvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~ 34 (43)
T PF10313_consen 1 GAVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRS 34 (43)
T ss_pred CCeEEEEeCCCCCcccEEEEEccCCeEEEEEccc
Confidence 468999999854 4777777789999999984
No 433
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=53.83 E-value=30 Score=30.84 Aligned_cols=58 Identities=31% Similarity=0.371 Sum_probs=45.9
Q ss_pred HHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhHHHHHHHHHh
Q 000425 811 QLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNPSAMRRLAQRLEEEGANPELRRYCERILR 876 (1525)
Q Consensus 811 ~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~~~~c~~il~ 876 (1525)
.+|...+..|+++.|+++|+.+=... -+|.+.+..++......|....++...++++.
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l~~~--------P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~ 59 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQALKQD--------PDNPEAWYLLGRILYQQGRYDEALAYYERALE 59 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHHCCS--------TTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred hHHHHHHHcCCHHHHHHHHHHHHHHC--------CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 47899999999999999999886554 44778888888888888877777777766554
No 434
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=53.68 E-value=1.7e+02 Score=34.07 Aligned_cols=103 Identities=12% Similarity=-0.006 Sum_probs=59.3
Q ss_pred CeEEEEEcCCCCEEEEEE-CCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEe-CCCCC
Q 000425 53 PIVNLACHPRLPVLYVAY-ADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWD-VSIER 130 (1525)
Q Consensus 53 ~V~sLaf~Pdg~~LaSGs-~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWD-l~t~~ 130 (1525)
.+.+.++++++..++.-. .++.-.||-...+.....+.....+. .-.|++++...+....+...+++. ..+..
T Consensus 25 ~~~s~AvS~dg~~~A~v~~~~~~~~L~~~~~~~~~~~~~~g~~l~-----~PS~d~~g~~W~v~~~~~~~~~~~~~~~g~ 99 (253)
T PF10647_consen 25 DVTSPAVSPDGSRVAAVSEGDGGRSLYVGPAGGPVRPVLTGGSLT-----RPSWDPDGWVWTVDDGSGGVRVVRDSASGT 99 (253)
T ss_pred cccceEECCCCCeEEEEEEcCCCCEEEEEcCCCcceeeccCCccc-----cccccCCCCEEEEEcCCCceEEEEecCCCc
Confidence 688899999998765544 33344444444333333332223566 778999976555555666667763 22211
Q ss_pred ceeeEe-ecccCcCeEEEEEcCCCCEEEEEE
Q 000425 131 PSMIGI-IQVGSQPITSVAWLPMLRLLVTLC 160 (1525)
Q Consensus 131 ~s~i~i-l~gH~~~VtsVafsPdg~~LaSgS 160 (1525)
...... ...-...|+.+.+||||..++.-.
T Consensus 100 ~~~~~v~~~~~~~~I~~l~vSpDG~RvA~v~ 130 (253)
T PF10647_consen 100 GEPVEVDWPGLRGRITALRVSPDGTRVAVVV 130 (253)
T ss_pred ceeEEecccccCCceEEEEECCCCcEEEEEE
Confidence 111100 011111899999999998776655
No 435
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=53.29 E-value=1.3e+02 Score=32.03 Aligned_cols=83 Identities=12% Similarity=0.119 Sum_probs=56.2
Q ss_pred hhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhh----------hHHH
Q 000425 770 GIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADY----------ESIL 839 (1525)
Q Consensus 770 F~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~----------~~Ll 839 (1525)
-.+....|+++.|.+.+++.-... |... .-|..+|...+..|+++.|++.|+++-+. ..|-
T Consensus 72 a~~~~~~~~~~~A~~~~~~al~~~------~~~~---~~~~~~~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~l~ 142 (234)
T TIGR02521 72 ALYYQQLGELEKAEDSFRRALTLN------PNNG---DVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAG 142 (234)
T ss_pred HHHHHHcCCHHHHHHHHHHHHhhC------CCCH---HHHHHHHHHHHHcccHHHHHHHHHHHHhccccccchHHHHHHH
Confidence 345677899999999887764211 1122 23567999999999999999999998532 2344
Q ss_pred HHHHHhCCHHHHHHHHHHHHHh
Q 000425 840 DLFICHLNPSAMRRLAQRLEEE 861 (1525)
Q Consensus 840 lLy~~~gd~e~L~kLa~~ae~~ 861 (1525)
.+|...|+.+.-.++.+.+...
T Consensus 143 ~~~~~~g~~~~A~~~~~~~~~~ 164 (234)
T TIGR02521 143 LCALKAGDFDKAEKYLTRALQI 164 (234)
T ss_pred HHHHHcCCHHHHHHHHHHHHHh
Confidence 5566667766666665555443
No 436
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=52.59 E-value=4.8 Score=34.23 Aligned_cols=22 Identities=32% Similarity=0.606 Sum_probs=14.9
Q ss_pred ccCCCCceec-CC--CCC-CCccccc
Q 000425 1486 VCDLCGAKFS-AL--SAP-GCIICGM 1507 (1525)
Q Consensus 1486 ~Cp~cgA~y~-~~--~g~-~C~iC~l 1507 (1525)
+||.||++.. -+ .|. +|+.||+
T Consensus 2 ~Cp~Cg~~~~~~D~~~g~~vC~~CG~ 27 (43)
T PF08271_consen 2 KCPNCGSKEIVFDPERGELVCPNCGL 27 (43)
T ss_dssp SBTTTSSSEEEEETTTTEEEETTT-B
T ss_pred CCcCCcCCceEEcCCCCeEECCCCCC
Confidence 6999999862 22 244 8999964
No 437
>KOG2377 consensus Uncharacterized conserved protein [Function unknown]
Probab=52.56 E-value=1.7e+02 Score=36.89 Aligned_cols=115 Identities=12% Similarity=0.125 Sum_probs=73.8
Q ss_pred CCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-----cCCceeecCceeEEEeCCCCEEEEEeCCCcEEE
Q 000425 49 DLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-----LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLA 123 (1525)
Q Consensus 49 ~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-----H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrV 123 (1525)
..+++|++|.|++|.+.+|+--.|.+|.+++....+....+. ....|. ...|+... -++-.++. -+-+
T Consensus 64 ~d~G~I~SIkFSlDnkilAVQR~~~~v~f~nf~~d~~~l~~~~~ck~k~~~Il-----GF~W~~s~-e~A~i~~~-G~e~ 136 (657)
T KOG2377|consen 64 DDKGEIKSIKFSLDNKILAVQRTSKTVDFCNFIPDNSQLEYTQECKTKNANIL-----GFCWTSST-EIAFITDQ-GIEF 136 (657)
T ss_pred cCCCceeEEEeccCcceEEEEecCceEEEEecCCCchhhHHHHHhccCcceeE-----EEEEecCe-eEEEEecC-CeEE
Confidence 456899999999999999999999999999874333322222 233466 77787653 33433333 3566
Q ss_pred EeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEE--CCCCeeEEEee
Q 000425 124 WDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLC--RDGSLQVWKTR 171 (1525)
Q Consensus 124 WDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS--~DgtIkVWDlr 171 (1525)
|-+....+++ .....|...|+-..|.++-+.++-|+ ..+++.=+.++
T Consensus 137 y~v~pekrsl-RlVks~~~nvnWy~yc~et~v~LL~t~~~~n~lnpf~~~ 185 (657)
T KOG2377|consen 137 YQVLPEKRSL-RLVKSHNLNVNWYMYCPETAVILLSTTVLENVLNPFHFR 185 (657)
T ss_pred EEEchhhhhh-hhhhhcccCccEEEEccccceEeeeccccccccccEEEe
Confidence 6655432222 23457888999999999987644433 34455444444
No 438
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=52.13 E-value=67 Score=40.04 Aligned_cols=72 Identities=18% Similarity=0.349 Sum_probs=52.1
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-cC--------CceeecCceeEEEeC-----CC---CEE
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ-LD--------NTIKLLGAGAFAFHP-----TL---EWL 112 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~--------~~V~~~ga~sVaFSP-----dg---~~L 112 (1525)
..++|++++.+ +=-++|+|+.||.+.|.|++....++.-. .+ ..+. ++.|.. |+ ..+
T Consensus 85 ~~g~vtal~~S-~iGFvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt-----~ieF~vm~~~~D~ySSi~L 158 (395)
T PF08596_consen 85 KQGPVTALKNS-DIGFVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVT-----SIEFSVMTLGGDGYSSICL 158 (395)
T ss_dssp -S-SEEEEEE--BTSEEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EE-----EEEEEEEE-TTSSSEEEEE
T ss_pred cCCcEeEEecC-CCcEEEEEecCCcEEEEECCCCeEEeeccccccccccccccCee-----EEEEEEEecCCCcccceEE
Confidence 47899999987 44589999999999999999999888854 21 2455 677752 33 457
Q ss_pred EEEeCCCcEEEEeCC
Q 000425 113 FVGDRRGTLLAWDVS 127 (1525)
Q Consensus 113 ~SgS~DgtVrVWDl~ 127 (1525)
++|...|.+.+|.+.
T Consensus 159 ~vGTn~G~v~~fkIl 173 (395)
T PF08596_consen 159 LVGTNSGNVLTFKIL 173 (395)
T ss_dssp EEEETTSEEEEEEEE
T ss_pred EEEeCCCCEEEEEEe
Confidence 778888888888665
No 439
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=51.95 E-value=5.3e+02 Score=32.40 Aligned_cols=147 Identities=14% Similarity=0.021 Sum_probs=80.8
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEeecCCccc-eEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEE-
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGTVEGGRAP-TKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVH- 86 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~tieggk~~-tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l- 86 (1525)
.+-+...+++.+++++.. .. ++-..+.+... ..+...-...+.++.|.+++.+++++ .+|.+ +|....++..
T Consensus 241 f~~v~~~~dG~~~~vg~~--G~--~~~s~d~G~~~W~~~~~~~~~~l~~v~~~~dg~l~l~g-~~G~l-~~S~d~G~~~~ 314 (398)
T PLN00033 241 FSTVNRSPDGDYVAVSSR--GN--FYLTWEPGQPYWQPHNRASARRIQNMGWRADGGLWLLT-RGGGL-YVSKGTGLTEE 314 (398)
T ss_pred eeeEEEcCCCCEEEEECC--cc--EEEecCCCCcceEEecCCCccceeeeeEcCCCCEEEEe-CCceE-EEecCCCCccc
Confidence 344566677777776653 22 23333444332 22333345678999999999887766 55654 3444444322
Q ss_pred -EEEe---cC---CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE
Q 000425 87 -YTLQ---LD---NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL 159 (1525)
Q Consensus 87 -~tL~---H~---~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg 159 (1525)
..|. .. ..+. ++.|.+++..+ .++.+|.+....-.-..-+....-..-......+.|.++++.+++|
T Consensus 315 ~~~f~~~~~~~~~~~l~-----~v~~~~d~~~~-a~G~~G~v~~s~D~G~tW~~~~~~~~~~~~ly~v~f~~~~~g~~~G 388 (398)
T PLN00033 315 DFDFEEADIKSRGFGIL-----DVGYRSKKEAW-AAGGSGILLRSTDGGKSWKRDKGADNIAANLYSVKFFDDKKGFVLG 388 (398)
T ss_pred ccceeecccCCCCcceE-----EEEEcCCCcEE-EEECCCcEEEeCCCCcceeEccccCCCCcceeEEEEcCCCceEEEe
Confidence 1232 11 2466 88998887755 4556777766532211111110001223456789998887777765
Q ss_pred ECCCCeeEE
Q 000425 160 CRDGSLQVW 168 (1525)
Q Consensus 160 S~DgtIkVW 168 (1525)
.+|.|--|
T Consensus 389 -~~G~il~~ 396 (398)
T PLN00033 389 -NDGVLLRY 396 (398)
T ss_pred -CCcEEEEe
Confidence 56766444
No 440
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=51.83 E-value=64 Score=35.69 Aligned_cols=82 Identities=15% Similarity=0.196 Sum_probs=61.7
Q ss_pred hhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhh-----------H
Q 000425 769 RGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYE-----------S 837 (1525)
Q Consensus 769 rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~-----------~ 837 (1525)
+-.-+-++.+|+.-++-+++... ......-|..||+...+-|+++.|.+||.++.|+- .
T Consensus 9 ~~~~~~~~~~Le~elk~~~~n~~----------kesir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~ 78 (177)
T PF10602_consen 9 KAKNAEELEKLEAELKDAKSNLG----------KESIRMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLN 78 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHhccc----------hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHH
Confidence 33445666678888887765321 22345557789999999999999999999999875 4
Q ss_pred HHHHHHHhCCHHHHHHHHHHHHH
Q 000425 838 ILDLFICHLNPSAMRRLAQRLEE 860 (1525)
Q Consensus 838 LllLy~~~gd~e~L~kLa~~ae~ 860 (1525)
++-+.+..||.+.+.+...+|+.
T Consensus 79 ~irv~i~~~d~~~v~~~i~ka~~ 101 (177)
T PF10602_consen 79 VIRVAIFFGDWSHVEKYIEKAES 101 (177)
T ss_pred HHHHHHHhCCHHHHHHHHHHHHH
Confidence 55677788999988888877754
No 441
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=51.64 E-value=1e+02 Score=34.58 Aligned_cols=29 Identities=14% Similarity=0.188 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHHHcCChHHHHHHHHHHhh
Q 000425 806 FHRFRQLGYACIKYGQFDSAKETFEVIAD 834 (1525)
Q Consensus 806 ~~rwk~Lg~~AL~~G~~~lAeec~q~~~d 834 (1525)
..-|..||+.++..|+++.|.+||+++-.
T Consensus 73 ~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~ 101 (198)
T PRK10370 73 SEQWALLGEYYLWRNDYDNALLAYRQALQ 101 (198)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 44577899999999999999999999874
No 442
>PF11715 Nup160: Nucleoporin Nup120/160; InterPro: IPR021717 Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=51.59 E-value=66 Score=41.36 Aligned_cols=39 Identities=18% Similarity=0.204 Sum_probs=31.3
Q ss_pred CCeEEEEEcC----CCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 52 KPIVNLACHP----RLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 52 ~~V~sLaf~P----dg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
..+.++++++ +..++++-+.|+++|+||+.+++++.++.
T Consensus 215 ~~~~~~~~~~~~~~~~~~l~tl~~D~~LRiW~l~t~~~~~~~~ 257 (547)
T PF11715_consen 215 SVAASLAVSSSEINDDTFLFTLSRDHTLRIWSLETGQCLATID 257 (547)
T ss_dssp --EEEEEE-----ETTTEEEEEETTSEEEEEETTTTCEEEEEE
T ss_pred CccceEEEecceeCCCCEEEEEeCCCeEEEEECCCCeEEEEec
Confidence 4566777877 67899999999999999999999988886
No 443
>PRK10115 protease 2; Provisional
Probab=51.56 E-value=1.7e+02 Score=39.22 Aligned_cols=110 Identities=11% Similarity=0.058 Sum_probs=62.6
Q ss_pred CeEEEEEcCCCCEEEEEEC-----CCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeC-C-----CcE
Q 000425 53 PIVNLACHPRLPVLYVAYA-----DGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDR-R-----GTL 121 (1525)
Q Consensus 53 ~V~sLaf~Pdg~~LaSGs~-----DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~-D-----gtV 121 (1525)
.+..+.|+|++++|+.+.+ ...|++.|+.+++.+...- ...-. .++|++|++.|+-+.. + ..|
T Consensus 128 ~l~~~~~Spdg~~la~~~d~~G~E~~~l~v~d~~tg~~l~~~i-~~~~~-----~~~w~~D~~~~~y~~~~~~~~~~~~v 201 (686)
T PRK10115 128 TLGGMAITPDNTIMALAEDFLSRRQYGIRFRNLETGNWYPELL-DNVEP-----SFVWANDSWTFYYVRKHPVTLLPYQV 201 (686)
T ss_pred EEeEEEECCCCCEEEEEecCCCcEEEEEEEEECCCCCCCCccc-cCcce-----EEEEeeCCCEEEEEEecCCCCCCCEE
Confidence 3667899999998876543 2458888888775321111 11114 6899999987776543 2 356
Q ss_pred EEEeCCCC--CceeeEeecccCcCeEEEEEcC-CCCEEEEEECC---CCeeEEEe
Q 000425 122 LAWDVSIE--RPSMIGIIQVGSQPITSVAWLP-MLRLLVTLCRD---GSLQVWKT 170 (1525)
Q Consensus 122 rVWDl~t~--~~s~i~il~gH~~~VtsVafsP-dg~~LaSgS~D---gtIkVWDl 170 (1525)
..|++.+. ...++ +........-..+.+ ++++++..+.+ +.+.+|+.
T Consensus 202 ~~h~lgt~~~~d~lv--~~e~~~~~~~~~~~s~d~~~l~i~~~~~~~~~~~l~~~ 254 (686)
T PRK10115 202 WRHTIGTPASQDELV--YEEKDDTFYVSLHKTTSKHYVVIHLASATTSEVLLLDA 254 (686)
T ss_pred EEEECCCChhHCeEE--EeeCCCCEEEEEEEcCCCCEEEEEEECCccccEEEEEC
Confidence 66777764 22233 332222233223333 77765544333 35666764
No 444
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=50.78 E-value=2.1e+02 Score=34.65 Aligned_cols=111 Identities=14% Similarity=0.035 Sum_probs=59.1
Q ss_pred CCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe------cCCceeecCceeEEEeCCCCEEEEEeC-------
Q 000425 51 KKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ------LDNTIKLLGAGAFAFHPTLEWLFVGDR------- 117 (1525)
Q Consensus 51 ~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~------H~~~V~~~ga~sVaFSPdg~~L~SgS~------- 117 (1525)
.+.+.++..--.+..|++ +.++ +.+++.+++..+..+. .....+ .+...|+|.+-++...
T Consensus 65 p~~~~~~~~~d~~g~Lv~-~~~g-~~~~~~~~~~~~t~~~~~~~~~~~~r~N-----D~~v~pdG~~wfgt~~~~~~~~~ 137 (307)
T COG3386 65 PGGFSSGALIDAGGRLIA-CEHG-VRLLDPDTGGKITLLAEPEDGLPLNRPN-----DGVVDPDGRIWFGDMGYFDLGKS 137 (307)
T ss_pred CCCcccceeecCCCeEEE-Eccc-cEEEeccCCceeEEeccccCCCCcCCCC-----ceeEcCCCCEEEeCCCccccCcc
Confidence 344444443333334443 3333 6677876655533322 224566 8888999987664433
Q ss_pred ----CCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEE-ECCCCeeEEEee
Q 000425 118 ----RGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTL-CRDGSLQVWKTR 171 (1525)
Q Consensus 118 ----DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSg-S~DgtIkVWDlr 171 (1525)
-|.|...|.. .........|-..-+.++||||++.|..+ +..+.|.-|++.
T Consensus 138 ~~~~~G~lyr~~p~---g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d 193 (307)
T COG3386 138 EERPTGSLYRVDPD---GGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLD 193 (307)
T ss_pred ccCCcceEEEEcCC---CCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecC
Confidence 1223333322 11222233444555789999999765544 445667666654
No 445
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=50.72 E-value=62 Score=39.02 Aligned_cols=25 Identities=24% Similarity=0.460 Sum_probs=14.8
Q ss_pred cceeeeccccccceEEEEe-hhhhhhhccccC
Q 000425 945 HVGIYLGSIKGRGTIVEVT-EKSLVKDFIPAG 975 (1525)
Q Consensus 945 ~~~~ylg~~~~~~~~~~~~-~~~l~~~~~~~~ 975 (1525)
-+..|+|+ +.-.+ +..|.+++-.+|
T Consensus 80 k~~~YvGN------L~W~TTD~DL~~A~~S~G 105 (498)
T KOG4849|consen 80 KYCCYVGN------LLWYTTDADLLKALQSTG 105 (498)
T ss_pred eEEEEecc------eeEEeccHHHHHHHHhhh
Confidence 34567664 33344 777888775555
No 446
>KOG4648 consensus Uncharacterized conserved protein, contains LRR repeats [Function unknown]
Probab=49.78 E-value=26 Score=42.25 Aligned_cols=64 Identities=17% Similarity=0.246 Sum_probs=43.8
Q ss_pred hHHHHH-HHHHHhhcCChHHHHHHHHHHHH-----HHhc-C-----------CcchHHHHHHHHHHHHHHHHHchHHHHH
Q 000425 1310 ASVCFK-TGLAHLEQNQLPDALSCFDEAFL-----ALAK-D-----------HSRGADVKAQATICAQYKIAVTLLQEIL 1371 (1525)
Q Consensus 1310 ~~~~lk-~gy~~~t~gkf~eAl~~Fr~iL~-----~~v~-~-----------~~e~~e~~~li~icreYilal~ll~El~ 1371 (1525)
..++|| .|-++|++|||.||++||-..+- .|.. + ...+.++...|.+-+.|+-|-+- |-.+
T Consensus 96 ~~SEiKE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KAYSR-R~~A 174 (536)
T KOG4648|consen 96 KASEIKERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKAYSR-RMQA 174 (536)
T ss_pred hhHHHHHhhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHH-HHHH
Confidence 344554 59999999999999999988775 1111 0 12344678888888999888765 3444
Q ss_pred Hhh
Q 000425 1372 RLQ 1374 (1525)
Q Consensus 1372 R~~ 1374 (1525)
|.+
T Consensus 175 R~~ 177 (536)
T KOG4648|consen 175 RES 177 (536)
T ss_pred HHH
Confidence 544
No 447
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=49.55 E-value=12 Score=30.25 Aligned_cols=23 Identities=30% Similarity=0.769 Sum_probs=19.8
Q ss_pred ccCCCCceecCCC-CCCCccccce
Q 000425 1486 VCDLCGAKFSALS-APGCIICGMG 1508 (1525)
Q Consensus 1486 ~Cp~cgA~y~~~~-g~~C~iC~l~ 1508 (1525)
.|+.||-.|.+.. ...||+|+..
T Consensus 3 ~C~~CGy~y~~~~~~~~CP~Cg~~ 26 (33)
T cd00350 3 VCPVCGYIYDGEEAPWVCPVCGAP 26 (33)
T ss_pred ECCCCCCEECCCcCCCcCcCCCCc
Confidence 6999999999887 8899999874
No 448
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=49.50 E-value=1.2e+02 Score=37.10 Aligned_cols=102 Identities=12% Similarity=0.045 Sum_probs=57.9
Q ss_pred CeEEEEEcCCCCEEEEEE-----------CCC-cEEEEEcCC--CcE--EEEEe-cCCceeecCceeEEEeCCCCEEEEE
Q 000425 53 PIVNLACHPRLPVLYVAY-----------ADG-LIRAYNIHT--YAV--HYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVG 115 (1525)
Q Consensus 53 ~V~sLaf~Pdg~~LaSGs-----------~Dg-tIrLWDl~t--~k~--l~tL~-H~~~V~~~ga~sVaFSPdg~~L~Sg 115 (1525)
..+.|+|.++|+++++-. ..+ .|.+++-.+ ++. ...|. .....+ .+++.++| |+++
T Consensus 15 ~P~~ia~d~~G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l~~p~-----Gi~~~~~G--lyV~ 87 (367)
T TIGR02604 15 NPIAVCFDERGRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAEELSMVT-----GLAVAVGG--VYVA 87 (367)
T ss_pred CCceeeECCCCCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeecCCCCcc-----ceeEecCC--EEEe
Confidence 346689999999776643 223 677776432 332 23443 334456 88999988 4445
Q ss_pred eCCCcEEEEeCCCC-----Cce-eeEeecc----cCcCeEEEEEcCCCCEEEEEEC
Q 000425 116 DRRGTLLAWDVSIE-----RPS-MIGIIQV----GSQPITSVAWLPMLRLLVTLCR 161 (1525)
Q Consensus 116 S~DgtVrVWDl~t~-----~~s-~i~il~g----H~~~VtsVafsPdg~~LaSgS~ 161 (1525)
+.+...++.|.... ..+ +..-+.. +....+.+.|.|||.+.++.+.
T Consensus 88 ~~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpDG~LYv~~G~ 143 (367)
T TIGR02604 88 TPPDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPDGWLYFNHGN 143 (367)
T ss_pred CCCeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCCCCEEEeccc
Confidence 55544444455421 111 2111111 2345778999999988776553
No 449
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=49.09 E-value=2e+02 Score=34.06 Aligned_cols=101 Identities=13% Similarity=0.117 Sum_probs=66.0
Q ss_pred CCEEEEEECCCcEEEEEcCCCcEEEEEecCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCCCCCcee-eEeecccC
Q 000425 63 LPVLYVAYADGLIRAYNIHTYAVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVSIERPSM-IGIIQVGS 141 (1525)
Q Consensus 63 g~~LaSGs~DgtIrLWDl~t~k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~t~~~s~-i~il~gH~ 141 (1525)
.++++.|+..+.++.-|.++++..-.-.....+. +-+.- -|+.++.|...+.+.+.+..+...-. ..++. .
T Consensus 23 kT~v~igSHs~~~~avd~~sG~~~We~ilg~RiE-----~sa~v-vgdfVV~GCy~g~lYfl~~~tGs~~w~f~~~~--~ 94 (354)
T KOG4649|consen 23 KTLVVIGSHSGIVIAVDPQSGNLIWEAILGVRIE-----CSAIV-VGDFVVLGCYSGGLYFLCVKTGSQIWNFVILE--T 94 (354)
T ss_pred ceEEEEecCCceEEEecCCCCcEEeehhhCceee-----eeeEE-ECCEEEEEEccCcEEEEEecchhheeeeeehh--h
Confidence 4577888888888888888887654433222222 11111 47779999999999999988741111 11111 1
Q ss_pred cCeEEEEEcCCCCEEEEEECCCCeeEEEeee
Q 000425 142 QPITSVAWLPMLRLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 142 ~~VtsVafsPdg~~LaSgS~DgtIkVWDlr~ 172 (1525)
-.+ .....+++.++..||.|++...-|.+.
T Consensus 95 vk~-~a~~d~~~glIycgshd~~~yalD~~~ 124 (354)
T KOG4649|consen 95 VKV-RAQCDFDGGLIYCGSHDGNFYALDPKT 124 (354)
T ss_pred hcc-ceEEcCCCceEEEecCCCcEEEecccc
Confidence 111 234567899999999999999999874
No 450
>KOG0985 consensus Vesicle coat protein clathrin, heavy chain [Intracellular trafficking, secretion, and vesicular transport]
Probab=48.72 E-value=86 Score=42.77 Aligned_cols=58 Identities=17% Similarity=0.182 Sum_probs=49.3
Q ss_pred HHHHHHHHHHcCChHHHHHHHHHHhh---hhHHHHHHHHhCCHHHHHHHHHHHHHhcCChh
Q 000425 809 FRQLGYACIKYGQFDSAKETFEVIAD---YESILDLFICHLNPSAMRRLAQRLEEEGANPE 866 (1525)
Q Consensus 809 wk~Lg~~AL~~G~~~lAeec~q~~~d---~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~ 866 (1525)
|.|||.+-|..|-+.-|.+.|-|+.| |.....+.+-+|+-|.|-|....|..+...+-
T Consensus 1107 WsqlakAQL~~~~v~dAieSyikadDps~y~eVi~~a~~~~~~edLv~yL~MaRkk~~E~~ 1167 (1666)
T KOG0985|consen 1107 WSQLAKAQLQGGLVKDAIESYIKADDPSNYLEVIDVASRTGKYEDLVKYLLMARKKVREPY 1167 (1666)
T ss_pred HHHHHHHHHhcCchHHHHHHHHhcCCcHHHHHHHHHHHhcCcHHHHHHHHHHHHHhhcCcc
Confidence 66899999999999999999999986 55666788889999999998888877765543
No 451
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=48.62 E-value=2.2e+02 Score=36.41 Aligned_cols=103 Identities=16% Similarity=0.202 Sum_probs=58.7
Q ss_pred CeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcE--EEEEe------cCCceeecCceeEEEeCCC------CEEEEEe--
Q 000425 53 PIVNLACHPRLPVLYVAYADGLIRAYNIHTYAV--HYTLQ------LDNTIKLLGAGAFAFHPTL------EWLFVGD-- 116 (1525)
Q Consensus 53 ~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~--l~tL~------H~~~V~~~ga~sVaFSPdg------~~L~SgS-- 116 (1525)
.-+.|+|.|++.+|++--..|.|++++-.+... +..+. ....+. .|+|||+- .+|...-
T Consensus 31 ~Pw~maflPDG~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLl-----glal~PdF~~~~~n~~lYvsyt~ 105 (454)
T TIGR03606 31 KPWALLWGPDNQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLL-----GLALHPDFMQEKGNPYVYISYTY 105 (454)
T ss_pred CceEEEEcCCCeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCcee-----eEEECCCccccCCCcEEEEEEec
Confidence 346789999998777765579999998655432 22221 123445 99999874 3444432
Q ss_pred --------CCCcEEEEeCCCC------CceeeEee---cccCcCeEEEEEcCCCCEEEEEECC
Q 000425 117 --------RRGTLLAWDVSIE------RPSMIGII---QVGSQPITSVAWLPMLRLLVTLCRD 162 (1525)
Q Consensus 117 --------~DgtVrVWDl~t~------~~s~i~il---~gH~~~VtsVafsPdg~~LaSgS~D 162 (1525)
....|.-|.+... ...++..+ ..|. -..+.|.|||.++++.++.
T Consensus 106 ~~~~~~~~~~~~I~R~~l~~~~~~l~~~~~Il~~lP~~~~H~--GgrI~FgPDG~LYVs~GD~ 166 (454)
T TIGR03606 106 KNGDKELPNHTKIVRYTYDKSTQTLEKPVDLLAGLPAGNDHN--GGRLVFGPDGKIYYTIGEQ 166 (454)
T ss_pred cCCCCCccCCcEEEEEEecCCCCccccceEEEecCCCCCCcC--CceEEECCCCcEEEEECCC
Confidence 1223444544311 11111111 1233 3468899999987776654
No 452
>PF05694 SBP56: 56kDa selenium binding protein (SBP56); InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=48.60 E-value=1.7e+02 Score=37.14 Aligned_cols=117 Identities=16% Similarity=0.169 Sum_probs=0.0
Q ss_pred EEEEEcCCCCEEEEEECC--------------------CcEEEEEcCCCcEEEEEe---cCCceeecCceeEEE--eCCC
Q 000425 55 VNLACHPRLPVLYVAYAD--------------------GLIRAYNIHTYAVHYTLQ---LDNTIKLLGAGAFAF--HPTL 109 (1525)
Q Consensus 55 ~sLaf~Pdg~~LaSGs~D--------------------gtIrLWDl~t~k~l~tL~---H~~~V~~~ga~sVaF--SPdg 109 (1525)
..+-|+|..+.++|...- .++.+||+++.+.++++. ...-.. .|+| .|+.
T Consensus 184 YDfw~qpr~nvMiSSeWg~P~~~~~Gf~~~d~~~~~yG~~l~vWD~~~r~~~Q~idLg~~g~~pL-----EvRflH~P~~ 258 (461)
T PF05694_consen 184 YDFWYQPRHNVMISSEWGAPSMFEKGFNPEDLEAGKYGHSLHVWDWSTRKLLQTIDLGEEGQMPL-----EVRFLHDPDA 258 (461)
T ss_dssp --EEEETTTTEEEE-B---HHHHTT---TTTHHHH-S--EEEEEETTTTEEEEEEES-TTEEEEE-----EEEE-SSTT-
T ss_pred CCeEEcCCCCEEEEeccCChhhcccCCChhHhhcccccCeEEEEECCCCcEeeEEecCCCCCceE-----EEEecCCCCc
Q ss_pred CEEEEEe-CCCcEEEEeCCCC----CceeeEeeccc----------------CcCeEEEEEcCCCCEEEEEEC-CCCeeE
Q 000425 110 EWLFVGD-RRGTLLAWDVSIE----RPSMIGIIQVG----------------SQPITSVAWLPMLRLLVTLCR-DGSLQV 167 (1525)
Q Consensus 110 ~~L~SgS-~DgtVrVWDl~t~----~~s~i~il~gH----------------~~~VtsVafsPdg~~LaSgS~-DgtIkV 167 (1525)
.+=++|+ -..+|..|--... ...++.+-..- ..-|+.|..|.|.++|..++. +|.|+.
T Consensus 259 ~~gFvg~aLss~i~~~~k~~~g~W~a~kVi~ip~~~v~~~~lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~Gdvrq 338 (461)
T PF05694_consen 259 NYGFVGCALSSSIWRFYKDDDGEWAAEKVIDIPAKKVEGWILPEMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQ 338 (461)
T ss_dssp -EEEEEEE--EEEEEEEE-ETTEEEEEEEEEE--EE--SS---GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEE
T ss_pred cceEEEEeccceEEEEEEcCCCCeeeeEEEECCCcccCcccccccccccccCCCceEeEEEccCCCEEEEEcccCCcEEE
Q ss_pred EEeeeecCC
Q 000425 168 WKTRVIINP 176 (1525)
Q Consensus 168 WDlr~~~~p 176 (1525)
||+.-..+|
T Consensus 339 YDISDP~~P 347 (461)
T PF05694_consen 339 YDISDPFNP 347 (461)
T ss_dssp EE-SSTTS-
T ss_pred EecCCCCCC
No 453
>PLN03077 Protein ECB2; Provisional
Probab=47.91 E-value=2.1e+02 Score=38.92 Aligned_cols=113 Identities=12% Similarity=-0.007 Sum_probs=65.4
Q ss_pred HHHhhcCChHHHHHHHHHHHHHHhcCCcchHHHHHHHHHH---HHHHHHHchHHHHHHhhccCCCCccccchHHHHHHHH
Q 000425 1318 LAHLEQNQLPDALSCFDEAFLALAKDHSRGADVKAQATIC---AQYKIAVTLLQEILRLQKVQGPSAAISAKDEMARLSR 1394 (1525)
Q Consensus 1318 y~~~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~~~li~ic---reYilal~ll~El~R~~~l~~~~~~~~~~~R~lELaa 1394 (1525)
-.+...|++.+|+..|++....-+. ........++..| +.+..|..++..+.+.. . ..+. . --+.-++.
T Consensus 59 ~~l~~~g~~~~A~~l~~~m~~~g~~--~~~~~~~~ll~~~~~~~~~~~a~~~~~~~~~~~-~-~~~~--~--~~n~li~~ 130 (857)
T PLN03077 59 RALCSHGQLEQALKLLESMQELRVP--VDEDAYVALFRLCEWKRAVEEGSRVCSRALSSH-P-SLGV--R--LGNAMLSM 130 (857)
T ss_pred HHHHhCCCHHHHHHHHHHHHhcCCC--CChhHHHHHHHHHhhCCCHHHHHHHHHHHHHcC-C-CCCc--h--HHHHHHHH
Confidence 3567899999999999988662111 1112233444444 33555566644443332 1 1111 1 12444555
Q ss_pred hhcCCCCchhHHH---------HHHHHHHHHHHHhccHHHHHHHHHHHHccCC
Q 000425 1395 HLGSLPLQTKHRI---------NCIRTAIKRNMEVQNYAYAKQMLELLLSKAP 1438 (1525)
Q Consensus 1395 YfT~c~Lqp~H~~---------L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p 1438 (1525)
|-.+=++....+. .+..+-|....|.+++..|..+-++++..+-
T Consensus 131 ~~~~g~~~~A~~~f~~m~~~d~~~~n~li~~~~~~g~~~~A~~~f~~M~~~g~ 183 (857)
T PLN03077 131 FVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGV 183 (857)
T ss_pred HHhCCChHHHHHHHhcCCCCCeeEHHHHHHHHHhCCCHHHHHHHHHHHHHcCC
Confidence 5444334333332 3566778888999999999999999886543
No 454
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=47.65 E-value=26 Score=26.94 Aligned_cols=26 Identities=19% Similarity=0.344 Sum_probs=22.4
Q ss_pred HHHHHHHHhhcCChHHHHHHHHHHHH
Q 000425 1313 CFKTGLAHLEQNQLPDALSCFDEAFL 1338 (1525)
Q Consensus 1313 ~lk~gy~~~t~gkf~eAl~~Fr~iL~ 1338 (1525)
....|.-++..|+|++|+.+|+.++.
T Consensus 4 ~~~lg~~~~~~~~~~~A~~~~~~al~ 29 (34)
T PF07719_consen 4 WYYLGQAYYQLGNYEEAIEYFEKALE 29 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 46789999999999999999999875
No 455
>KOG0985 consensus Vesicle coat protein clathrin, heavy chain [Intracellular trafficking, secretion, and vesicular transport]
Probab=47.64 E-value=28 Score=47.02 Aligned_cols=88 Identities=10% Similarity=0.033 Sum_probs=68.3
Q ss_pred hhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHH----HHhhhhHHHHHHHHhCCHH
Q 000425 774 IKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFE----VIADYESILDLFICHLNPS 849 (1525)
Q Consensus 774 l~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q----~~~d~~~LllLy~~~gd~e 849 (1525)
+.+|+++-|.+.|++.. +-.-||+++.+|+..+.|.+|..|=- ++.+++.|.=.|-..|=.|
T Consensus 1231 V~LgeyQ~AVD~aRKAn--------------s~ktWK~VcfaCvd~~EFrlAQiCGL~iivhadeLeeli~~Yq~rGyFe 1296 (1666)
T KOG0985|consen 1231 VYLGEYQGAVDAARKAN--------------STKTWKEVCFACVDKEEFRLAQICGLNIIVHADELEELIEYYQDRGYFE 1296 (1666)
T ss_pred HHHHHHHHHHHHhhhcc--------------chhHHHHHHHHHhchhhhhHHHhcCceEEEehHhHHHHHHHHHhcCcHH
Confidence 56899999999998865 23459999999999999999999954 4679999999999999999
Q ss_pred HHHHHHHHHHHhcCChhHHHHHHHHH
Q 000425 850 AMRRLAQRLEEEGANPELRRYCERIL 875 (1525)
Q Consensus 850 ~L~kLa~~ae~~g~~~~~~~~c~~il 875 (1525)
.|-.|.+.+---..-+|.-|-.--||
T Consensus 1297 ElIsl~Ea~LGLERAHMgmfTELaiL 1322 (1666)
T KOG0985|consen 1297 ELISLLEAGLGLERAHMGMFTELAIL 1322 (1666)
T ss_pred HHHHHHHhhhchhHHHHHHHHHHHHH
Confidence 99999886643222334444443333
No 456
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=47.60 E-value=9.3 Score=28.61 Aligned_cols=21 Identities=38% Similarity=0.722 Sum_probs=17.2
Q ss_pred ccCCCCceecCCCCCCCccccc
Q 000425 1486 VCDLCGAKFSALSAPGCIICGM 1507 (1525)
Q Consensus 1486 ~Cp~cgA~y~~~~g~~C~iC~l 1507 (1525)
.||.||+..... .+.|+.|+-
T Consensus 1 ~Cp~CG~~~~~~-~~fC~~CG~ 21 (23)
T PF13240_consen 1 YCPNCGAEIEDD-AKFCPNCGT 21 (23)
T ss_pred CCcccCCCCCCc-CcchhhhCC
Confidence 499999998765 888999874
No 457
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=47.49 E-value=78 Score=36.23 Aligned_cols=35 Identities=17% Similarity=0.218 Sum_probs=26.6
Q ss_pred CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEE
Q 000425 52 KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYT 88 (1525)
Q Consensus 52 ~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~t 88 (1525)
.++..+. .++.++++-..+|.+++||+.+++++..
T Consensus 13 s~~~~l~--~~~~~Ll~iT~~G~l~vWnl~~~k~~~~ 47 (219)
T PF07569_consen 13 SPVSFLE--CNGSYLLAITSSGLLYVWNLKKGKAVLP 47 (219)
T ss_pred CceEEEE--eCCCEEEEEeCCCeEEEEECCCCeeccC
Confidence 4454444 4578888888999999999999876543
No 458
>PF10276 zf-CHCC: Zinc-finger domain; InterPro: IPR019401 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a short conserved zinc-finger domain. It contains the sequence motif Cx8Hx14Cx2C. ; PDB: 2JVM_A 2JRR_A 2JZ8_A.
Probab=47.49 E-value=11 Score=32.09 Aligned_cols=20 Identities=30% Similarity=0.673 Sum_probs=13.8
Q ss_pred ceeccC-CCccccCCCCceec
Q 000425 1476 LSRLST-IGYDVCDLCGAKFS 1495 (1525)
Q Consensus 1476 ltpIy~-~~~v~Cp~cgA~y~ 1495 (1525)
|-+|-+ ...+.|||||.+|.
T Consensus 20 yl~l~~~~~~~~CpYCg~~yv 40 (40)
T PF10276_consen 20 YLNLDDEPGPVVCPYCGTRYV 40 (40)
T ss_dssp EEE-TTTTCEEEETTTTEEEE
T ss_pred EEecCCCCCeEECCCCCCEEC
Confidence 334444 35699999999984
No 459
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=47.47 E-value=1.2e+02 Score=39.41 Aligned_cols=111 Identities=19% Similarity=0.240 Sum_probs=65.3
Q ss_pred EEEEcCC----CCEEEEEEeccceEEEEEee----cCCccceEEccCCC--CCe--EEEEEcCCCCEEEEEECCCcEEEE
Q 000425 11 HLALTPL----QPVVFFGFHRRMSVTVVGTV----EGGRAPTKIKTDLK--KPI--VNLACHPRLPVLYVAYADGLIRAY 78 (1525)
Q Consensus 11 ~LA~hP~----qp~Vf~s~~~d~sI~vv~ti----eggk~~tklk~~H~--~~V--~sLaf~Pdg~~LaSGs~DgtIrLW 78 (1525)
-|+|.|. -|.+++--| .+.|.||..- |.++....-..+.. -+| .++-|||....|++=..+..--++
T Consensus 61 GlsW~P~~~~~~paLLAVQH-kkhVtVWqL~~s~~e~~K~l~sQtcEi~e~~pvLpQGCVWHPk~~iL~VLT~~dvSV~~ 139 (671)
T PF15390_consen 61 GLSWAPPCTADTPALLAVQH-KKHVTVWQLCPSTTERNKLLMSQTCEIREPFPVLPQGCVWHPKKAILTVLTARDVSVLP 139 (671)
T ss_pred eeeecCcccCCCCceEEEec-cceEEEEEeccCccccccceeeeeeeccCCcccCCCcccccCCCceEEEEecCceeEee
Confidence 3667776 566666666 5778888753 22222211111121 222 347899999987764444333455
Q ss_pred EcCCC--cEEEEEecCCceeecCceeEEEeCCCCEEEEEe-CCCcEEEEeCC
Q 000425 79 NIHTY--AVHYTLQLDNTIKLLGAGAFAFHPTLEWLFVGD-RRGTLLAWDVS 127 (1525)
Q Consensus 79 Dl~t~--k~l~tL~H~~~V~~~ga~sVaFSPdg~~L~SgS-~DgtVrVWDl~ 127 (1525)
+++.. .....++..+-|. |-+|.+||+.|+.+- ..=.-.|||-.
T Consensus 140 sV~~d~srVkaDi~~~G~Ih-----CACWT~DG~RLVVAvGSsLHSyiWd~~ 186 (671)
T PF15390_consen 140 SVHCDSSRVKADIKTSGLIH-----CACWTKDGQRLVVAVGSSLHSYIWDSA 186 (671)
T ss_pred eeeeCCceEEEeccCCceEE-----EEEecCcCCEEEEEeCCeEEEEEecCc
Confidence 65533 3333344566788 999999998776543 44456789865
No 460
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=47.39 E-value=3.9e+02 Score=33.51 Aligned_cols=150 Identities=11% Similarity=0.119 Sum_probs=73.3
Q ss_pred EEcCCCCEEEEEEeccceEEEEEeecCCccceEEccCCC-CCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe-
Q 000425 13 ALTPLQPVVFFGFHRRMSVTVVGTVEGGRAPTKIKTDLK-KPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ- 90 (1525)
Q Consensus 13 A~hP~qp~Vf~s~~~d~sI~vv~tieggk~~tklk~~H~-~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~- 90 (1525)
+|++++..++|+++.+..-.++...-.....+.+.. +. ....+..++|+++.++--.....++-.|+.+.+....+.
T Consensus 42 ~ft~dG~kllF~s~~dg~~nly~lDL~t~~i~QLTd-g~g~~~~g~~~s~~~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~ 120 (386)
T PF14583_consen 42 CFTDDGRKLLFASDFDGNRNLYLLDLATGEITQLTD-GPGDNTFGGFLSPDDRALYYVKNGRSLRRVDLDTLEERVVYEV 120 (386)
T ss_dssp -B-TTS-EEEEEE-TTSS-EEEEEETTT-EEEE----SS-B-TTT-EE-TTSSEEEEEETTTEEEEEETTT--EEEEEE-
T ss_pred CcCCCCCEEEEEeccCCCcceEEEEcccCEEEECcc-CCCCCccceEEecCCCeEEEEECCCeEEEEECCcCcEEEEEEC
Confidence 567888888888876555555443222223333322 22 112245677888877665566788888999987766666
Q ss_pred cCCceeecCceeEEE--eCCCCEEEEEe---CC-------------------CcEEEEeCCCCCceeeEeecccCcCeEE
Q 000425 91 LDNTIKLLGAGAFAF--HPTLEWLFVGD---RR-------------------GTLLAWDVSIERPSMIGIIQVGSQPITS 146 (1525)
Q Consensus 91 H~~~V~~~ga~sVaF--SPdg~~L~SgS---~D-------------------gtVrVWDl~t~~~s~i~il~gH~~~Vts 146 (1525)
....+- ...| +.|+..++-.- .| ..|..-|+.+.....+ +. -+..+..
T Consensus 121 p~~~~g-----~gt~v~n~d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v--~~-~~~wlgH 192 (386)
T PF14583_consen 121 PDDWKG-----YGTWVANSDCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVV--FE-DTDWLGH 192 (386)
T ss_dssp -TTEEE-----EEEEEE-TTSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEE--EE-ESS-EEE
T ss_pred Cccccc-----ccceeeCCCccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEE--Ee-cCccccC
Confidence 555554 4455 34566544321 11 1122224443222222 22 2456678
Q ss_pred EEEcCCCCEEEEEECCC-----CeeEEEee
Q 000425 147 VAWLPMLRLLVTLCRDG-----SLQVWKTR 171 (1525)
Q Consensus 147 VafsPdg~~LaSgS~Dg-----tIkVWDlr 171 (1525)
+.|+|..+-+++=|-.| .-|||-++
T Consensus 193 ~~fsP~dp~li~fCHEGpw~~Vd~RiW~i~ 222 (386)
T PF14583_consen 193 VQFSPTDPTLIMFCHEGPWDLVDQRIWTIN 222 (386)
T ss_dssp EEEETTEEEEEEEEE-S-TTTSS-SEEEEE
T ss_pred cccCCCCCCEEEEeccCCcceeceEEEEEE
Confidence 99999777666544333 34889877
No 461
>PRK13684 Ycf48-like protein; Provisional
Probab=47.15 E-value=5.8e+02 Score=30.97 Aligned_cols=124 Identities=16% Similarity=0.134 Sum_probs=67.5
Q ss_pred EEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECCCcEEE-EEcCCCcEEEEEe--cCCceeecCceeEEEeCCC
Q 000425 33 VVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYADGLIRA-YNIHTYAVHYTLQ--LDNTIKLLGAGAFAFHPTL 109 (1525)
Q Consensus 33 vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~DgtIrL-WDl~t~k~l~tL~--H~~~V~~~ga~sVaFSPdg 109 (1525)
++-+.++++.=..+..+-...+.++.+.|++.+++++. .|.+.. +| ..++....+. ....+. ++.+.+++
T Consensus 154 i~~S~DgG~tW~~~~~~~~g~~~~i~~~~~g~~v~~g~-~G~i~~s~~-~gg~tW~~~~~~~~~~l~-----~i~~~~~g 226 (334)
T PRK13684 154 IYRTTDGGKNWEALVEDAAGVVRNLRRSPDGKYVAVSS-RGNFYSTWE-PGQTAWTPHQRNSSRRLQ-----SMGFQPDG 226 (334)
T ss_pred EEEECCCCCCceeCcCCCcceEEEEEECCCCeEEEEeC-CceEEEEcC-CCCCeEEEeeCCCcccce-----eeeEcCCC
Confidence 34444555444444443456789999999887776664 454432 22 1223333333 346677 89999999
Q ss_pred CEEEEEeCCCcEEEEeCCCC-CceeeEee-cccCcCeEEEEEcCCCCEEEEEECCCCe
Q 000425 110 EWLFVGDRRGTLLAWDVSIE-RPSMIGII-QVGSQPITSVAWLPMLRLLVTLCRDGSL 165 (1525)
Q Consensus 110 ~~L~SgS~DgtVrVWDl~t~-~~s~i~il-~gH~~~VtsVafsPdg~~LaSgS~DgtI 165 (1525)
+.++++ ..|.+++=..... .-+..... ..-...+.++.|+|++..++. +.+|.|
T Consensus 227 ~~~~vg-~~G~~~~~s~d~G~sW~~~~~~~~~~~~~l~~v~~~~~~~~~~~-G~~G~v 282 (334)
T PRK13684 227 NLWMLA-RGGQIRFNDPDDLESWSKPIIPEITNGYGYLDLAYRTPGEIWAG-GGNGTL 282 (334)
T ss_pred CEEEEe-cCCEEEEccCCCCCccccccCCccccccceeeEEEcCCCCEEEE-cCCCeE
Confidence 876665 4576653212211 11111000 001235778999998876655 456655
No 462
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=47.15 E-value=2.1e+02 Score=38.49 Aligned_cols=69 Identities=22% Similarity=0.355 Sum_probs=47.3
Q ss_pred eEEEeCCCCEEEEEeCCCcEEEEeCCC----------CCceee--E-------eecccCcCeEEEEEcCC---CCEEEEE
Q 000425 102 AFAFHPTLEWLFVGDRRGTLLAWDVSI----------ERPSMI--G-------IIQVGSQPITSVAWLPM---LRLLVTL 159 (1525)
Q Consensus 102 sVaFSPdg~~L~SgS~DgtVrVWDl~t----------~~~s~i--~-------il~gH~~~VtsVafsPd---g~~LaSg 159 (1525)
.|..+|+|.+++..|..+.+-+ .+-. ....+. . ....+...|..+.|||. +..|+.-
T Consensus 89 ~i~~n~~g~~lal~G~~~v~V~-~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~~~~l~vL 167 (717)
T PF10168_consen 89 QISLNPTGSLLALVGPRGVVVL-ELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSESDSHLVVL 167 (717)
T ss_pred EEEECCCCCEEEEEcCCcEEEE-EeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCCCCeEEEE
Confidence 7888999999998887654433 3211 001111 0 11244567889999996 4788888
Q ss_pred ECCCCeeEEEee
Q 000425 160 CRDGSLQVWKTR 171 (1525)
Q Consensus 160 S~DgtIkVWDlr 171 (1525)
+.|+++|+||+.
T Consensus 168 tsdn~lR~y~~~ 179 (717)
T PF10168_consen 168 TSDNTLRLYDIS 179 (717)
T ss_pred ecCCEEEEEecC
Confidence 999999999985
No 463
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=46.94 E-value=26 Score=29.43 Aligned_cols=25 Identities=24% Similarity=0.236 Sum_probs=22.6
Q ss_pred HHHHHHHHHHcCChHHHHHHHHHHh
Q 000425 809 FRQLGYACIKYGQFDSAKETFEVIA 833 (1525)
Q Consensus 809 wk~Lg~~AL~~G~~~lAeec~q~~~ 833 (1525)
|-.||+.-++.|+++-|+++|+++=
T Consensus 4 ~~~la~~~~~~G~~~~A~~~~~~~l 28 (44)
T PF13428_consen 4 WLALARAYRRLGQPDEAERLLRRAL 28 (44)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 5679999999999999999999863
No 464
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=46.56 E-value=1.6e+02 Score=31.47 Aligned_cols=57 Identities=14% Similarity=0.028 Sum_probs=38.5
Q ss_pred HHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh
Q 000425 772 YAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD 834 (1525)
Q Consensus 772 LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d 834 (1525)
+..+.|+++.|+..++..-. ..++++.... -|..||......|+++.|.++|+++-.
T Consensus 44 ~~~~~g~~~~A~~~~~~al~---l~~~~~~~~~---~~~~lg~~~~~~g~~~eA~~~~~~Al~ 100 (168)
T CHL00033 44 SAQSEGEYAEALQNYYEAMR---LEIDPYDRSY---ILYNIGLIHTSNGEHTKALEYYFQALE 100 (168)
T ss_pred HHHHcCCHHHHHHHHHHHHh---ccccchhhHH---HHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 45566888888887766421 0111111112 356799999999999999999999754
No 465
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=46.12 E-value=1.1e+02 Score=28.30 Aligned_cols=56 Identities=18% Similarity=0.113 Sum_probs=40.7
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHH---HHhcCCcchHHHHHHHHHHHHHHHHHch
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFL---ALAKDHSRGADVKAQATICAQYKIAVTL 1366 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~---~~v~~~~e~~e~~~li~icreYilal~l 1366 (1525)
...++.|.+.=+.|++++|+..+++.+. ..+....+...-..+-..+.+|+-=..-
T Consensus 6 ~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~l~~~~~~~~~~~~~~~l~~k~~~yl~RAE~ 64 (69)
T PF04212_consen 6 IELIKKAVEADEAGNYEEALELYKEAIEYLMQALKSESNPERRQALRQKMKEYLERAEK 64 (69)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHHHH
Confidence 5678899999999999999999999999 3333333333334488888888764433
No 466
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=46.06 E-value=1e+02 Score=35.90 Aligned_cols=139 Identities=21% Similarity=0.200 Sum_probs=87.1
Q ss_pred HHHHHHHHhccCCCCCChhHHHHHHcC-C-----CCcccchhhcccc-CCCCCcchhhhHHhhcCCHHHHHHHHHHhhcc
Q 000425 720 SEILYQITSRFDSLRITPRSLDILAKG-P-----PVCGDLAVSLSQA-GPQFTQVLRGIYAIKALRFSTALSVLKDEFLR 792 (1525)
Q Consensus 720 r~~l~~~~~~~~s~~i~~~~l~~L~~~-~-----~~~~elAl~ls~~-~~~~d~~~rF~LAl~lg~l~~A~~ia~~~~~~ 792 (1525)
+..|+++++.-|+..-....+..++++ | ....+-||++..+ +-..+.= .-.=-+.|++++|+.-.+..- .
T Consensus 55 ~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNY--G~FLC~qg~~~eA~q~F~~Al-~ 131 (250)
T COG3063 55 KKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNY--GAFLCAQGRPEEAMQQFERAL-A 131 (250)
T ss_pred HHHHHHHHHhCcccHHHHHHHHHHHHHcCChhhHHHHHHHHHhcCCCccchhhhh--hHHHHhCCChHHHHHHHHHHH-h
Confidence 345666666556655555555555542 2 1233445555541 1001110 011123467888887776542 2
Q ss_pred cCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhHHHHHH
Q 000425 793 SRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNPSAMRRLAQRLEEEGANPELRRYCE 872 (1525)
Q Consensus 793 ~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~~~~c~ 872 (1525)
.+.|+-++ +.|+ -+|--||++|+++.|++-|+++ |+.-..+...+.+|++.--++|.+..++-..+
T Consensus 132 ~P~Y~~~s---~t~e---N~G~Cal~~gq~~~A~~~l~ra--------L~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~ 197 (250)
T COG3063 132 DPAYGEPS---DTLE---NLGLCALKAGQFDQAEEYLKRA--------LELDPQFPPALLELARLHYKAGDYAPARLYLE 197 (250)
T ss_pred CCCCCCcc---hhhh---hhHHHHhhcCCchhHHHHHHHH--------HHhCcCCChHHHHHHHHHHhcccchHHHHHHH
Confidence 45665443 3444 5999999999999999999998 45567888899999998888998888876666
Q ss_pred HHH
Q 000425 873 RIL 875 (1525)
Q Consensus 873 ~il 875 (1525)
++.
T Consensus 198 ~~~ 200 (250)
T COG3063 198 RYQ 200 (250)
T ss_pred HHH
Confidence 644
No 467
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=45.56 E-value=2.4e+02 Score=37.09 Aligned_cols=121 Identities=14% Similarity=0.147 Sum_probs=68.8
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHHHHhcCCcchHHHHHHHH----HHHHHHHHHchHHHHHHhhccCCCCcccc--
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFLALAKDHSRGADVKAQAT----ICAQYKIAVTLLQEILRLQKVQGPSAAIS-- 1384 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~~~li~----icreYilal~ll~El~R~~~l~~~~~~~~-- 1384 (1525)
...+..|..++..|++.+|+..|++++...-.+. ++.-.+. .-.+|--|+..|++..+.. .++.+..
T Consensus 23 ~~~~~~a~~~~~~~~~~~A~~~~~~~~~~~p~~~----~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~---~~~~~~~~~ 95 (899)
T TIGR02917 23 ESLIEAAKSYLQKNKYKAAIIQLKNALQKDPNDA----EARFLLGKIYLALGDYAAAEKELRKALSLG---YPKNQVLPL 95 (899)
T ss_pred HHHHHHHHHHHHcCChHhHHHHHHHHHHhCCCCH----HHHHHHHHHHHHcCCHHHHHHHHHHHHHcC---CChhhhHHH
Confidence 3478899999999999999999999987211111 1111111 1234556666655554433 1111000
Q ss_pred ------chHHHHHHHHhhcCCCCc-hhHHHHHHHHHHHHHHHhccHHHHHHHHHHHHccCC
Q 000425 1385 ------AKDEMARLSRHLGSLPLQ-TKHRINCIRTAIKRNMEVQNYAYAKQMLELLLSKAP 1438 (1525)
Q Consensus 1385 ------~~~R~lELaaYfT~c~Lq-p~H~~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p 1438 (1525)
...+.-+...+|+...+. +....-++...-...++.++|..|..+.+++++..|
T Consensus 96 ~a~~~~~~g~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~a~~~~~ 156 (899)
T TIGR02917 96 LARAYLLQGKFQQVLDELPGKTLLDDEGAAELLALRGLAYLGLGQLELAQKSYEQALAIDP 156 (899)
T ss_pred HHHHHHHCCCHHHHHHhhcccccCCchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCC
Confidence 000112344566666533 333333344444455778899999999999888877
No 468
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=45.15 E-value=1.1e+02 Score=38.42 Aligned_cols=41 Identities=12% Similarity=0.010 Sum_probs=34.3
Q ss_pred CCCCeEEEEEcCCCCEEEEEECCCcEEEEEcCCCcEEEEEe
Q 000425 50 LKKPIVNLACHPRLPVLYVAYADGLIRAYNIHTYAVHYTLQ 90 (1525)
Q Consensus 50 H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWDl~t~k~l~tL~ 90 (1525)
....+.+|..+|++.+.|+...=|.|-++|+.++..++..+
T Consensus 306 ~~R~~~~i~~sP~~~laA~tDslGRV~LiD~~~~~vvrmWK 346 (415)
T PF14655_consen 306 SKREGESICLSPSGRLAAVTDSLGRVLLIDVARGIVVRMWK 346 (415)
T ss_pred CCceEEEEEECCCCCEEEEEcCCCcEEEEECCCChhhhhhc
Confidence 44568889999999988888777999999999887776666
No 469
>PF14870 PSII_BNR: Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=44.52 E-value=6.5e+02 Score=30.48 Aligned_cols=126 Identities=15% Similarity=0.172 Sum_probs=61.1
Q ss_pred EEEeecCCccceEEcc--CCCCCeEEEEEcCCCCEEEEEECCCcEEEEE-cCCCcEEEEEe--cCCceeecCceeEEEeC
Q 000425 33 VVGTVEGGRAPTKIKT--DLKKPIVNLACHPRLPVLYVAYADGLIRAYN-IHTYAVHYTLQ--LDNTIKLLGAGAFAFHP 107 (1525)
Q Consensus 33 vv~tieggk~~tklk~--~H~~~V~sLaf~Pdg~~LaSGs~DgtIrLWD-l~t~k~l~tL~--H~~~V~~~ga~sVaFSP 107 (1525)
++.+.+++..=..+.. ........+....++..+++ ...|. ||- ...++....+. ....++ .+..++
T Consensus 83 ll~T~DgG~tW~~v~l~~~lpgs~~~i~~l~~~~~~l~-~~~G~--iy~T~DgG~tW~~~~~~~~gs~~-----~~~r~~ 154 (302)
T PF14870_consen 83 LLHTTDGGKTWERVPLSSKLPGSPFGITALGDGSAELA-GDRGA--IYRTTDGGKTWQAVVSETSGSIN-----DITRSS 154 (302)
T ss_dssp EEEESSTTSS-EE----TT-SS-EEEEEEEETTEEEEE-ETT----EEEESSTTSSEEEEE-S----EE-----EEEE-T
T ss_pred EEEecCCCCCcEEeecCCCCCCCeeEEEEcCCCcEEEE-cCCCc--EEEeCCCCCCeeEcccCCcceeE-----eEEECC
Confidence 4455566554444432 12233333443333333333 34444 343 33344444433 446677 888899
Q ss_pred CCCEEEEEeCCCcEEEEeCCCCCceeeEeecccCcCeEEEEEcCCCCEEEEEECCCCeeEEE
Q 000425 108 TLEWLFVGDRRGTLLAWDVSIERPSMIGIIQVGSQPITSVAWLPMLRLLVTLCRDGSLQVWK 169 (1525)
Q Consensus 108 dg~~L~SgS~DgtVrVWDl~t~~~s~i~il~gH~~~VtsVafsPdg~~LaSgS~DgtIkVWD 169 (1525)
||.++++++.-....-||--. ......-..-..+|..+.|.|++.+.+.+ ..+.|+.=|
T Consensus 155 dG~~vavs~~G~~~~s~~~G~--~~w~~~~r~~~~riq~~gf~~~~~lw~~~-~Gg~~~~s~ 213 (302)
T PF14870_consen 155 DGRYVAVSSRGNFYSSWDPGQ--TTWQPHNRNSSRRIQSMGFSPDGNLWMLA-RGGQIQFSD 213 (302)
T ss_dssp TS-EEEEETTSSEEEEE-TT---SS-EEEE--SSS-EEEEEE-TTS-EEEEE-TTTEEEEEE
T ss_pred CCcEEEEECcccEEEEecCCC--ccceEEccCccceehhceecCCCCEEEEe-CCcEEEEcc
Confidence 999998888777778888652 11111111235789999999998876654 778777665
No 470
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=44.48 E-value=2.1e+02 Score=38.68 Aligned_cols=102 Identities=11% Similarity=-0.072 Sum_probs=75.1
Q ss_pred hhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhh--------hHHH
Q 000425 768 LRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIADY--------ESIL 839 (1525)
Q Consensus 768 ~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~--------~~Ll 839 (1525)
.+..+.+..|+++.|.+.+++.-... |....+ |-.||...+..|+++.|+++|+++-.. -.+-
T Consensus 364 ~~a~~l~~~g~~~eA~~~l~~al~~~------P~n~~l---~~~lA~l~~~~g~~~~A~~~l~~al~l~Pd~~~l~~~~a 434 (765)
T PRK10049 364 LLSQVAKYSNDLPQAEMRARELAYNA------PGNQGL---RIDYASVLQARGWPRAAENELKKAEVLEPRNINLEVEQA 434 (765)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHhC------CCCHHH---HHHHHHHHHhcCCHHHHHHHHHHHHhhCCCChHHHHHHH
Confidence 35567788999999999998764222 122234 557999999999999999999998754 1455
Q ss_pred HHHHHhCCHHHHHHHHHHHHHhcCChhHHHHHHHHHhhc
Q 000425 840 DLFICHLNPSAMRRLAQRLEEEGANPELRRYCERILRVR 878 (1525)
Q Consensus 840 lLy~~~gd~e~L~kLa~~ae~~g~~~~~~~~c~~il~~~ 878 (1525)
.++...|+.+...++.+.+.+.--++-....-.|.+..+
T Consensus 435 ~~al~~~~~~~A~~~~~~ll~~~Pd~~~~~~~~~~~~~~ 473 (765)
T PRK10049 435 WTALDLQEWRQMDVLTDDVVAREPQDPGVQRLARARDVH 473 (765)
T ss_pred HHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHhc
Confidence 578889999999999998887665555544455555433
No 471
>PLN03218 maturation of RBCL 1; Provisional
Probab=43.80 E-value=1.5e+02 Score=41.75 Aligned_cols=88 Identities=16% Similarity=0.150 Sum_probs=51.8
Q ss_pred HHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHh---------hhhHHHHHH
Q 000425 772 YAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIA---------DYESILDLF 842 (1525)
Q Consensus 772 LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~---------d~~~LllLy 842 (1525)
...+.|+++.|+++.++... ....|+... |..|.+.+.+.|+++.|++.|+.+. -|..|+-.|
T Consensus 658 a~~k~G~~eeA~~l~~eM~k-----~G~~pd~~t---ynsLI~ay~k~G~~eeA~~lf~eM~~~g~~PdvvtyN~LI~gy 729 (1060)
T PLN03218 658 VAGHAGDLDKAFEILQDARK-----QGIKLGTVS---YSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITAL 729 (1060)
T ss_pred HHHhCCCHHHHHHHHHHHHH-----cCCCCCHHH---HHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHH
Confidence 34456666666666655321 112233233 4457777777777777777776653 256666777
Q ss_pred HHhCCHHHHHHHHHHHHHhcCChhH
Q 000425 843 ICHLNPSAMRRLAQRLEEEGANPEL 867 (1525)
Q Consensus 843 ~~~gd~e~L~kLa~~ae~~g~~~~~ 867 (1525)
...|+.+.-.++.+...+.|..+..
T Consensus 730 ~k~G~~eeAlelf~eM~~~Gi~Pd~ 754 (1060)
T PLN03218 730 CEGNQLPKALEVLSEMKRLGLCPNT 754 (1060)
T ss_pred HHCCCHHHHHHHHHHHHHcCCCCCH
Confidence 7777777666666666666655443
No 472
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=43.62 E-value=12 Score=33.18 Aligned_cols=27 Identities=15% Similarity=0.318 Sum_probs=19.8
Q ss_pred eeccceeccCCCccccCCCCceecCCC
Q 000425 1472 CAATLSRLSTIGYDVCDLCGAKFSALS 1498 (1525)
Q Consensus 1472 Ca~tltpIy~~~~v~Cp~cgA~y~~~~ 1498 (1525)
|...++-+...+.+.|||||++-.-+.
T Consensus 12 Cg~~~~~~~~~~~irCp~Cg~rIl~K~ 38 (49)
T COG1996 12 CGREVELDQETRGIRCPYCGSRILVKE 38 (49)
T ss_pred cCCeeehhhccCceeCCCCCcEEEEec
Confidence 666666566688899999998876443
No 473
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=43.50 E-value=2.2e+02 Score=35.27 Aligned_cols=74 Identities=22% Similarity=0.316 Sum_probs=50.1
Q ss_pred hhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHh-------hhhHHHHHHHHhC
Q 000425 774 IKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIA-------DYESILDLFICHL 846 (1525)
Q Consensus 774 l~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~-------d~~~LllLy~~~g 846 (1525)
+..++.+.|++.+++--.+.++-| .. .-.+|+.|+++++++.|++.|+++- ++-.|..+|...|
T Consensus 305 l~~~~~~~al~~~e~~lk~~P~~~------~l---~l~lgrl~~~~~~~~~A~~~le~al~~~P~~~~~~~La~~~~~~g 375 (398)
T PRK10747 305 LKTNNPEQLEKVLRQQIKQHGDTP------LL---WSTLGQLLMKHGEWQEASLAFRAALKQRPDAYDYAWLADALDRLH 375 (398)
T ss_pred ccCCChHHHHHHHHHHHhhCCCCH------HH---HHHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHcC
Confidence 455888999988876542222221 22 3479999999999999999999983 4445556666666
Q ss_pred CHHHHHHHHH
Q 000425 847 NPSAMRRLAQ 856 (1525)
Q Consensus 847 d~e~L~kLa~ 856 (1525)
+.++=..+-+
T Consensus 376 ~~~~A~~~~~ 385 (398)
T PRK10747 376 KPEEAAAMRR 385 (398)
T ss_pred CHHHHHHHHH
Confidence 6555544444
No 474
>PLN03218 maturation of RBCL 1; Provisional
Probab=42.97 E-value=1.2e+02 Score=42.63 Aligned_cols=112 Identities=21% Similarity=0.124 Sum_probs=82.0
Q ss_pred CCcccchhhcccc----CCCCCcc---hhhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcC
Q 000425 748 PVCGDLAVSLSQA----GPQFTQV---LRGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYG 820 (1525)
Q Consensus 748 ~~~~elAl~ls~~----~~~~d~~---~rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G 820 (1525)
.|..+.|+++-.. +-..|.. .-.+.-.++|+++.|.++.++... ....|+... |-.|-+.+.+.|
T Consensus 662 ~G~~eeA~~l~~eM~k~G~~pd~~tynsLI~ay~k~G~~eeA~~lf~eM~~-----~g~~Pdvvt---yN~LI~gy~k~G 733 (1060)
T PLN03218 662 AGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKS-----IKLRPTVST---MNALITALCEGN 733 (1060)
T ss_pred CCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhCCCHHHHHHHHHHHHH-----cCCCCCHHH---HHHHHHHHHHCC
Confidence 3467777776652 1011111 112334578999999999887531 122344444 667999999999
Q ss_pred ChHHHHHHHHHHhh---------hhHHHHHHHHhCCHHHHHHHHHHHHHhcCChhH
Q 000425 821 QFDSAKETFEVIAD---------YESILDLFICHLNPSAMRRLAQRLEEEGANPEL 867 (1525)
Q Consensus 821 ~~~lAeec~q~~~d---------~~~LllLy~~~gd~e~L~kLa~~ae~~g~~~~~ 867 (1525)
+++.|.+.|+...+ |..|+..|...|+.+.-.++.+.+.+.|..+..
T Consensus 734 ~~eeAlelf~eM~~~Gi~Pd~~Ty~sLL~a~~k~G~le~A~~l~~~M~k~Gi~pd~ 789 (1060)
T PLN03218 734 QLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNL 789 (1060)
T ss_pred CHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCCH
Confidence 99999999998763 788999999999999999999999999987765
No 475
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=42.52 E-value=35 Score=26.56 Aligned_cols=26 Identities=19% Similarity=0.355 Sum_probs=22.5
Q ss_pred HHHHHHHHhhcCChHHHHHHHHHHHH
Q 000425 1313 CFKTGLAHLEQNQLPDALSCFDEAFL 1338 (1525)
Q Consensus 1313 ~lk~gy~~~t~gkf~eAl~~Fr~iL~ 1338 (1525)
....|.-++..|++++|+.+|+++|.
T Consensus 4 ~~~~g~~~~~~~~~~~A~~~~~~al~ 29 (34)
T PF00515_consen 4 YYNLGNAYFQLGDYEEALEYYQRALE 29 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCchHHHHHHHHHHH
Confidence 46789999999999999999999986
No 476
>PF00301 Rubredoxin: Rubredoxin; InterPro: IPR004039 Rubredoxin is a low molecular weight iron-containing bacterial protein involved in electron transfer [, ], sometimes replacing ferredoxin as an electron carrier []. The 3-D structures of a number of rubredoxins have been solved [, ]. The fold belongs to the alpha+beta class, with 2 alpha-helices and 2-3 beta-strands. Its active site contains an iron ion which is co-ordinated by the sulphurs of four conserved cysteine residues forming an almost regular tetrahedron. The conserved cysteines reside on two loops, which are the most conserved regions of the protein. In addition, a ring of acidic residues in the proximity of the [Fe(Cys)4] centre is also well-conserved []. ; GO: 0009055 electron carrier activity, 0046872 metal ion binding; PDB: 2RDV_C 1RDV_A 1S24_A 1T9O_B 1B2J_A 1SMW_A 2PVE_B 1BFY_A 1T9P_C 1C09_C ....
Probab=42.34 E-value=14 Score=32.55 Aligned_cols=24 Identities=25% Similarity=0.735 Sum_probs=18.5
Q ss_pred ccCCCCceecCCCCC------------------CCcccccee
Q 000425 1486 VCDLCGAKFSALSAP------------------GCIICGMGS 1509 (1525)
Q Consensus 1486 ~Cp~cgA~y~~~~g~------------------~C~iC~l~~ 1509 (1525)
+|+.||-.|.+..|- .|++|+.++
T Consensus 3 ~C~~CgyvYd~~~Gd~~~~i~pGt~F~~Lp~~w~CP~C~a~K 44 (47)
T PF00301_consen 3 QCPVCGYVYDPEKGDPENGIPPGTPFEDLPDDWVCPVCGAPK 44 (47)
T ss_dssp EETTTSBEEETTTBBGGGTB-TT--GGGS-TT-B-TTTSSBG
T ss_pred CCCCCCEEEcCCcCCcccCcCCCCCHHHCCCCCcCcCCCCcc
Confidence 699999999987642 899998764
No 477
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=42.30 E-value=40 Score=30.24 Aligned_cols=57 Identities=21% Similarity=0.345 Sum_probs=39.4
Q ss_pred hhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcC-ChHHHHHHHHHHhh
Q 000425 769 RGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYG-QFDSAKETFEVIAD 834 (1525)
Q Consensus 769 rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G-~~~lAeec~q~~~d 834 (1525)
+-....+.|+++.|++..++.-... | ....=|-.+|...+..| +++.|.++|+++=.
T Consensus 9 ~g~~~~~~~~~~~A~~~~~~ai~~~------p---~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~ 66 (69)
T PF13414_consen 9 LGQIYFQQGDYEEAIEYFEKAIELD------P---NNAEAYYNLGLAYMKLGKDYEEAIEDFEKALK 66 (69)
T ss_dssp HHHHHHHTTHHHHHHHHHHHHHHHS------T---THHHHHHHHHHHHHHTTTHHHHHHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHcC------C---CCHHHHHHHHHHHHHhCccHHHHHHHHHHHHH
Confidence 3345667777787777776543211 1 12334667999999999 79999999998743
No 478
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=42.23 E-value=14 Score=37.94 Aligned_cols=31 Identities=19% Similarity=0.426 Sum_probs=26.0
Q ss_pred CCceeeeccceeccC--CCccccCCCCceecCC
Q 000425 1467 DPSQFCAATLSRLST--IGYDVCDLCGAKFSAL 1497 (1525)
Q Consensus 1467 ~pf~iCa~tltpIy~--~~~v~Cp~cgA~y~~~ 1497 (1525)
....+|..+=+.-|. +.-+.|||||+.|.+.
T Consensus 7 GtKR~Cp~CG~kFYDLnk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 7 GTKRTCPSCGAKFYDLNKDPIVCPKCGTEFPPE 39 (108)
T ss_pred CCcccCCCCcchhccCCCCCccCCCCCCccCcc
Confidence 556788888888887 6678899999999987
No 479
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=41.93 E-value=14 Score=38.66 Aligned_cols=32 Identities=19% Similarity=0.273 Sum_probs=28.3
Q ss_pred CCceeeeccceeccC--CCccccCCCCceecCCC
Q 000425 1467 DPSQFCAATLSRLST--IGYDVCDLCGAKFSALS 1498 (1525)
Q Consensus 1467 ~pf~iCa~tltpIy~--~~~v~Cp~cgA~y~~~~ 1498 (1525)
....+|..+=+..|. ..-+.|||||+.|.+.+
T Consensus 7 GtKr~Cp~cg~kFYDLnk~p~vcP~cg~~~~~~~ 40 (129)
T TIGR02300 7 GTKRICPNTGSKFYDLNRRPAVSPYTGEQFPPEE 40 (129)
T ss_pred CccccCCCcCccccccCCCCccCCCcCCccCcch
Confidence 556899999999998 88999999999999885
No 480
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=41.51 E-value=18 Score=29.54 Aligned_cols=24 Identities=33% Similarity=0.695 Sum_probs=18.3
Q ss_pred cccCCCCceecCCC-CCCCccccce
Q 000425 1485 DVCDLCGAKFSALS-APGCIICGMG 1508 (1525)
Q Consensus 1485 v~Cp~cgA~y~~~~-g~~C~iC~l~ 1508 (1525)
-+|+.||-.|..+. -..|++|+..
T Consensus 3 ~~C~~CG~i~~g~~~p~~CP~Cg~~ 27 (34)
T cd00729 3 WVCPVCGYIHEGEEAPEKCPICGAP 27 (34)
T ss_pred EECCCCCCEeECCcCCCcCcCCCCc
Confidence 36888998887665 6789988864
No 481
>PF10313 DUF2415: Uncharacterised protein domain (DUF2415); InterPro: IPR019417 This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif.
Probab=41.47 E-value=49 Score=28.76 Aligned_cols=30 Identities=20% Similarity=0.289 Sum_probs=24.8
Q ss_pred CeEEEEEcCCC---CEEEEEECCCCeeEEEeee
Q 000425 143 PITSVAWLPML---RLLVTLCRDGSLQVWKTRV 172 (1525)
Q Consensus 143 ~VtsVafsPdg---~~LaSgS~DgtIkVWDlr~ 172 (1525)
.|+++.|+|.. .+|+-+-.-+.|.|+|+|.
T Consensus 2 AvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~ 34 (43)
T PF10313_consen 2 AVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRS 34 (43)
T ss_pred CeEEEEeCCCCCcccEEEEEccCCeEEEEEccc
Confidence 68999999853 4788877778999999984
No 482
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=41.45 E-value=65 Score=34.04 Aligned_cols=61 Identities=15% Similarity=0.147 Sum_probs=44.8
Q ss_pred CcchhhhHH---hhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhh
Q 000425 765 TQVLRGIYA---IKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIAD 834 (1525)
Q Consensus 765 d~~~rF~LA---l~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d 834 (1525)
+.+..+.+| .++|+++.|.+.....-...++. ... |-.+|...+..|+++.|.++|+++=.
T Consensus 57 ~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~------~~a---~~~lg~~l~~~g~~~eAi~~~~~Al~ 120 (144)
T PRK15359 57 SWRAHIALAGTWMMLKEYTTAINFYGHALMLDASH------PEP---VYQTGVCLKMMGEPGLAREAFQTAIK 120 (144)
T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCC------cHH---HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 555555555 68899999999987765322222 234 44699999999999999999998743
No 483
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=41.23 E-value=1.4e+02 Score=36.32 Aligned_cols=77 Identities=22% Similarity=0.319 Sum_probs=51.8
Q ss_pred hcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHh----hh-----hHHHHHHHHh
Q 000425 775 KALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIA----DY-----ESILDLFICH 845 (1525)
Q Consensus 775 ~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~----d~-----~~LllLy~~~ 845 (1525)
...+++.|.+.++..- -.-|+|-..+- .||+.+|.+|+++-|.++++.+- || +.|.--|--.
T Consensus 192 ~~~~~d~A~~~l~kAl---qa~~~cvRAsi------~lG~v~~~~g~y~~AV~~~e~v~eQn~~yl~evl~~L~~~Y~~l 262 (389)
T COG2956 192 ASSDVDRARELLKKAL---QADKKCVRASI------ILGRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQL 262 (389)
T ss_pred hhhhHHHHHHHHHHHH---hhCccceehhh------hhhHHHHhccchHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHh
Confidence 4467777888777653 23355544433 49999999999999999999876 22 3344456667
Q ss_pred CCHHHHHHHHHHHHH
Q 000425 846 LNPSAMRRLAQRLEE 860 (1525)
Q Consensus 846 gd~e~L~kLa~~ae~ 860 (1525)
|+.+...--...+-+
T Consensus 263 g~~~~~~~fL~~~~~ 277 (389)
T COG2956 263 GKPAEGLNFLRRAME 277 (389)
T ss_pred CCHHHHHHHHHHHHH
Confidence 877766555555544
No 484
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=41.07 E-value=95 Score=29.84 Aligned_cols=52 Identities=13% Similarity=0.097 Sum_probs=40.0
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHH---HHhcCCcchHHHHHHHHHHHHHHH
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFL---ALAKDHSRGADVKAQATICAQYKI 1362 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~---~~v~~~~e~~e~~~li~icreYil 1362 (1525)
...++.|...=++|++++|+.++++.|. ..+..+.+...-.-+-..+.+|+-
T Consensus 7 ~~l~~~Ave~d~~~~y~eA~~~Y~~~i~~~~~~~k~e~~~~~k~~ir~K~~eYl~ 61 (75)
T cd02677 7 AELIRLALEKEEEGDYEAAFEFYRAGVDLLLKGVQGDSSPERREAVKRKIAEYLK 61 (75)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHH
Confidence 5678999999999999999999999999 445555444444446677888863
No 485
>PLN03077 Protein ECB2; Provisional
Probab=40.56 E-value=1.2e+02 Score=41.22 Aligned_cols=81 Identities=15% Similarity=0.136 Sum_probs=48.0
Q ss_pred hHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHHh--------hhhHHHHHH
Q 000425 771 IYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVIA--------DYESILDLF 842 (1525)
Q Consensus 771 ~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~--------d~~~LllLy 842 (1525)
++-.+.|+++.|.++.++-. -. |+... |..|=.+|-.+||.++||.+.+++- -|--|.-+|
T Consensus 633 ~~l~r~G~~~eA~~~~~~m~----~~----pd~~~---~~aLl~ac~~~~~~e~~e~~a~~l~~l~p~~~~~y~ll~n~y 701 (857)
T PLN03077 633 DLLGRAGKLTEAYNFINKMP----IT----PDPAV---WGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLY 701 (857)
T ss_pred HHHHhCCCHHHHHHHHHHCC----CC----CCHHH---HHHHHHHHHHcCChHHHHHHHHHHHhhCCCCcchHHHHHHHH
Confidence 44456677777777766531 11 12222 5566667777777777777666543 344455567
Q ss_pred HHhCCHHHHHHHHHHHHHhc
Q 000425 843 ICHLNPSAMRRLAQRLEEEG 862 (1525)
Q Consensus 843 ~~~gd~e~L~kLa~~ae~~g 862 (1525)
...|+-+...++-+.-+++|
T Consensus 702 a~~g~~~~a~~vr~~M~~~g 721 (857)
T PLN03077 702 ADAGKWDEVARVRKTMRENG 721 (857)
T ss_pred HHCCChHHHHHHHHHHHHcC
Confidence 77777776666666666555
No 486
>KOG1129 consensus TPR repeat-containing protein [General function prediction only]
Probab=40.49 E-value=27 Score=41.99 Aligned_cols=50 Identities=22% Similarity=0.386 Sum_probs=33.9
Q ss_pred ChhHHHHHHHHHHHHHHcCChHHHHHHHHHHhhhhHHHHHHHHhCCHHHHHHHHHHHHHhc
Q 000425 802 TSQLFHRFRQLGYACIKYGQFDSAKETFEVIADYESILDLFICHLNPSAMRRLAQRLEEEG 862 (1525)
Q Consensus 802 ~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~~d~~~LllLy~~~gd~e~L~kLa~~ae~~g 862 (1525)
..+.| =.||-.|..-|++++|+.||+.+= -.--.+-|+|..|+-.+-.+|
T Consensus 394 aaDvW---YNlg~vaV~iGD~nlA~rcfrlaL--------~~d~~h~ealnNLavL~~r~G 443 (478)
T KOG1129|consen 394 AADVW---YNLGFVAVTIGDFNLAKRCFRLAL--------TSDAQHGEALNNLAVLAARSG 443 (478)
T ss_pred hhhhh---hccceeEEeccchHHHHHHHHHHh--------ccCcchHHHHHhHHHHHhhcC
Confidence 34554 469999999999999999998652 122234455666655555555
No 487
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=40.27 E-value=1.8e+02 Score=39.22 Aligned_cols=74 Identities=20% Similarity=0.272 Sum_probs=48.8
Q ss_pred EEEEEEcCCCCEEEEEEeccceEEEEEe-------ecCCccceE----------EccCCCCCeEEEEEcCC---CCEEEE
Q 000425 9 EVHLALTPLQPVVFFGFHRRMSVTVVGT-------VEGGRAPTK----------IKTDLKKPIVNLACHPR---LPVLYV 68 (1525)
Q Consensus 9 e~~LA~hP~qp~Vf~s~~~d~sI~vv~t-------ieggk~~tk----------lk~~H~~~V~sLaf~Pd---g~~LaS 68 (1525)
.-+|.++|++..+.+.......|..... +++++.... +...+...|..+.|||. +.+|++
T Consensus 87 v~~i~~n~~g~~lal~G~~~v~V~~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~~~~l~v 166 (717)
T PF10168_consen 87 VHQISLNPTGSLLALVGPRGVVVLELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSESDSHLVV 166 (717)
T ss_pred EEEEEECCCCCEEEEEcCCcEEEEEeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCCCCeEEE
Confidence 4578899999988777655444433321 222222111 11224567899999997 478888
Q ss_pred EECCCcEEEEEcCC
Q 000425 69 AYADGLIRAYNIHT 82 (1525)
Q Consensus 69 Gs~DgtIrLWDl~t 82 (1525)
=..|+++|+||+..
T Consensus 167 Ltsdn~lR~y~~~~ 180 (717)
T PF10168_consen 167 LTSDNTLRLYDISD 180 (717)
T ss_pred EecCCEEEEEecCC
Confidence 88999999999864
No 488
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=39.81 E-value=1.1e+02 Score=29.66 Aligned_cols=56 Identities=18% Similarity=0.250 Sum_probs=42.1
Q ss_pred hHHHHHHHHHHhhcCChHHHHHHHHHHHH---HHhcCCcchHHHHHHHHHHHHHHHHHc
Q 000425 1310 ASVCFKTGLAHLEQNQLPDALSCFDEAFL---ALAKDHSRGADVKAQATICAQYKIAVT 1365 (1525)
Q Consensus 1310 ~~~~lk~gy~~~t~gkf~eAl~~Fr~iL~---~~v~~~~e~~e~~~li~icreYilal~ 1365 (1525)
....++.|..+=+.|++++|+.+|.+.|. ..+....+.....-+-..+.+|+-=..
T Consensus 6 a~~l~~~Ave~D~~g~y~eAl~~Y~~aie~l~~~lk~e~d~~~k~~~r~ki~eY~~RAE 64 (77)
T cd02683 6 AKEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTKDEAKKKNLRQKISEYMDRAE 64 (77)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHHH
Confidence 36678999999999999999999999999 445444444444556778888875433
No 489
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=39.27 E-value=4.4e+02 Score=29.43 Aligned_cols=23 Identities=17% Similarity=0.225 Sum_probs=19.9
Q ss_pred HHHHhccHHHHHHHHHHHHccCC
Q 000425 1416 RNMEVQNYAYAKQMLELLLSKAP 1438 (1525)
Q Consensus 1416 ~~~K~kNy~tAa~fA~rLL~l~p 1438 (1525)
..++.++|..|....+++++..|
T Consensus 175 ~~~~~g~~~~A~~~~~~al~~~p 197 (235)
T TIGR03302 175 FYLKRGAYVAAINRFETVVENYP 197 (235)
T ss_pred HHHHcCChHHHHHHHHHHHHHCC
Confidence 35788999999999999998876
No 490
>PRK15331 chaperone protein SicA; Provisional
Probab=38.74 E-value=4.6e+02 Score=29.18 Aligned_cols=121 Identities=11% Similarity=0.096 Sum_probs=70.1
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHchHHHHHHhhccCCCCccccchHHHH
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFLALAKDHSRGADVKAQATICAQYKIAVTLLQEILRLQKVQGPSAAISAKDEMA 1390 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~~~v~~~~e~~e~~~li~icreYilal~ll~El~R~~~l~~~~~~~~~~~R~l 1390 (1525)
..-.+.||.++..||+.||...||-+.. -+.- .++|.+||-.+..+...= . + .-..-
T Consensus 38 e~iY~~Ay~~y~~Gk~~eA~~~F~~L~~---~d~~-----------n~~Y~~GLaa~~Q~~k~y------~--~-Ai~~Y 94 (165)
T PRK15331 38 DGLYAHAYEFYNQGRLDEAETFFRFLCI---YDFY-----------NPDYTMGLAAVCQLKKQF------Q--K-ACDLY 94 (165)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHH---hCcC-----------cHHHHHHHHHHHHHHHHH------H--H-HHHHH
Confidence 3356779999999999999999998744 2222 357888887743332211 0 1 01344
Q ss_pred HHHHhhcCCCCchhHHHHHHHHHHHHHHHhccHHHHHHHHHHHHccCC-hhhHHHHHHHHHHHHhcCCCCc
Q 000425 1391 RLSRHLGSLPLQTKHRINCIRTAIKRNMEVQNYAYAKQMLELLLSKAP-ASKQDELRSLIDMCVQRGLSNK 1460 (1525)
Q Consensus 1391 ELaaYfT~c~Lqp~H~~L~Lr~Am~~~~K~kNy~tAa~fA~rLL~l~p-~~~~~qarkil~~ce~~~~d~~ 1460 (1525)
.+|..+...+=-|.++.== -.+..++...|..=-.--++... ....++|+..|.+-..++++..
T Consensus 95 ~~A~~l~~~dp~p~f~agq------C~l~l~~~~~A~~~f~~a~~~~~~~~l~~~A~~~L~~l~~~~~~~~ 159 (165)
T PRK15331 95 AVAFTLLKNDYRPVFFTGQ------CQLLMRKAAKARQCFELVNERTEDESLRAKALVYLEALKTAETEQH 159 (165)
T ss_pred HHHHHcccCCCCccchHHH------HHHHhCCHHHHHHHHHHHHhCcchHHHHHHHHHHHHHHHccccccc
Confidence 4555555555555443211 12334444444433333333333 7788889998887777766654
No 491
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=38.44 E-value=1.2e+02 Score=29.16 Aligned_cols=52 Identities=13% Similarity=0.065 Sum_probs=39.9
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHH---HHhcCCcchHHHHHHHHHHHHHHH
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFL---ALAKDHSRGADVKAQATICAQYKI 1362 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~---~~v~~~~e~~e~~~li~icreYil 1362 (1525)
...+++|...=+.|++++|+.++.+.|. ..+..+.+...-..+-..+.+|+-
T Consensus 7 i~lv~~Av~~D~~g~y~eA~~lY~~ale~~~~~~k~e~~~~~k~~lr~k~~eyl~ 61 (75)
T cd02684 7 IALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYETDAQRKEALRQKVLQYVS 61 (75)
T ss_pred HHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHH
Confidence 5678999999999999999999999999 344444444444457778888864
No 492
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=37.98 E-value=1.2e+02 Score=29.28 Aligned_cols=56 Identities=11% Similarity=0.163 Sum_probs=41.9
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHH---HHhcCCcchHHHHHHHHHHHHHHHHHch
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFL---ALAKDHSRGADVKAQATICAQYKIAVTL 1366 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~---~~v~~~~e~~e~~~li~icreYilal~l 1366 (1525)
...++.|.++=..|+++||+.++...+. -++..-.+.....-+-..+.||+-=..-
T Consensus 7 ~~~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~yr~ki~eY~~Rae~ 65 (75)
T cd02682 7 RKYAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLIYEQMINEYKRRIEV 65 (75)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHHHH
Confidence 5578899999999999999999999988 4444444444445577888888754443
No 493
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=37.59 E-value=1.3e+02 Score=28.29 Aligned_cols=54 Identities=19% Similarity=0.113 Sum_probs=40.4
Q ss_pred HHHHHHHHHHhhcCChHHHHHHHHHHHH---HHhcCCcchHHHHHHHHHHHHHHHHH
Q 000425 1311 SVCFKTGLAHLEQNQLPDALSCFDEAFL---ALAKDHSRGADVKAQATICAQYKIAV 1364 (1525)
Q Consensus 1311 ~~~lk~gy~~~t~gkf~eAl~~Fr~iL~---~~v~~~~e~~e~~~li~icreYilal 1364 (1525)
...++.|.+.=+.|++++|+.++.+.+. ..+...........+-..|++|+-=+
T Consensus 9 ~~li~~Av~~d~~g~~~eAl~~Y~~a~e~l~~~~~~~~~~~~~~~~~~k~~eyl~ra 65 (77)
T smart00745 9 KELISKALKADEAGDYEEALELYKKAIEYLLEGIKVESDSKRREAVKAKAAEYLDRA 65 (77)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHHH
Confidence 5578888888899999999999999998 33333344444556788889987543
No 494
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=37.48 E-value=1.9e+02 Score=37.84 Aligned_cols=48 Identities=8% Similarity=-0.066 Sum_probs=22.7
Q ss_pred HHHHHHHHcCChHHHHHHHHHHh--------hhhHHHHHHHHhCCHHHHHHHHHHH
Q 000425 811 QLGYACIKYGQFDSAKETFEVIA--------DYESILDLFICHLNPSAMRRLAQRL 858 (1525)
Q Consensus 811 ~Lg~~AL~~G~~~lAeec~q~~~--------d~~~LllLy~~~gd~e~L~kLa~~a 858 (1525)
.+|...+..|+++.|.++|+++- -+-.|-.+|...|+.+.-..+.+.|
T Consensus 404 ~lg~~~~~~g~~~~A~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~a 459 (615)
T TIGR00990 404 HRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRC 459 (615)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCccCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 35555555555555555555542 1223334444555544444444433
No 495
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=37.25 E-value=2.6e+02 Score=34.39 Aligned_cols=58 Identities=16% Similarity=0.062 Sum_probs=39.6
Q ss_pred CCEEEEEECCCcEEEEEcCCCcEEEEEe-cCCceeecCceeEEEeCCCCEEEEEeCCCcEEEEeC
Q 000425 63 LPVLYVAYADGLIRAYNIHTYAVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDV 126 (1525)
Q Consensus 63 g~~LaSGs~DgtIrLWDl~t~k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl 126 (1525)
+..+++++.||.+.+.|..+++.+...+ ....+. ....+ .+..|+.++.||.+..++.
T Consensus 335 ~g~l~v~~~~G~l~~ld~~tG~~~~~~~~~~~~~~----s~P~~--~~~~l~v~t~~G~l~~~~~ 393 (394)
T PRK11138 335 NGYLVVGDSEGYLHWINREDGRFVAQQKVDSSGFL----SEPVV--ADDKLLIQARDGTVYAITR 393 (394)
T ss_pred CCEEEEEeCCCEEEEEECCCCCEEEEEEcCCCcce----eCCEE--ECCEEEEEeCCceEEEEeC
Confidence 4567788889999999999998877776 333222 01111 2457888888998887653
No 496
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=37.00 E-value=4.1e+02 Score=28.66 Aligned_cols=104 Identities=20% Similarity=0.235 Sum_probs=62.3
Q ss_pred EEEcCCCCEEEEEECCCcEEEEEcCCC--------cEEEEEecCCceeecCceeEE---EeCC--CCEEEEEeCCCcEEE
Q 000425 57 LACHPRLPVLYVAYADGLIRAYNIHTY--------AVHYTLQLDNTIKLLGAGAFA---FHPT--LEWLFVGDRRGTLLA 123 (1525)
Q Consensus 57 Laf~Pdg~~LaSGs~DgtIrLWDl~t~--------k~l~tL~H~~~V~~~ga~sVa---FSPd--g~~L~SgS~DgtVrV 123 (1525)
-.|....+.|+.+..-++|.|.+.+.. ..+..|.-...|+ +++ |.|+ ...|+.| ...++.+
T Consensus 4 GkfDG~~pcL~~aT~~gKV~IH~ph~~~~~~~~~~~~i~~LNin~~it-----alaaG~l~~~~~~D~LliG-t~t~lla 77 (136)
T PF14781_consen 4 GKFDGVHPCLACATTGGKVFIHNPHERGQRTGRQDSDISFLNINQEIT-----ALAAGRLKPDDGRDCLLIG-TQTSLLA 77 (136)
T ss_pred EEeCCCceeEEEEecCCEEEEECCCccccccccccCceeEEECCCceE-----EEEEEecCCCCCcCEEEEe-ccceEEE
Confidence 456667778888888899999986633 3455566566677 664 4332 3445555 4567899
Q ss_pred EeCCCCCceeeEeecccCcCeEEEEEcC----CCCEEEEEECCCCeeEEEe
Q 000425 124 WDVSIERPSMIGIIQVGSQPITSVAWLP----MLRLLVTLCRDGSLQVWKT 170 (1525)
Q Consensus 124 WDl~t~~~s~i~il~gH~~~VtsVafsP----dg~~LaSgS~DgtIkVWDl 170 (1525)
||+... ..+. +..-...|+++.+-. +.++++.|+ +..|+-+|.
T Consensus 78 YDV~~N-~d~F--yke~~DGvn~i~~g~~~~~~~~l~ivGG-ncsi~Gfd~ 124 (136)
T PF14781_consen 78 YDVENN-SDLF--YKEVPDGVNAIVIGKLGDIPSPLVIVGG-NCSIQGFDY 124 (136)
T ss_pred EEcccC-chhh--hhhCccceeEEEEEecCCCCCcEEEECc-eEEEEEeCC
Confidence 999853 2222 223346677777632 344555543 455555543
No 497
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=36.96 E-value=2.6e+02 Score=37.12 Aligned_cols=93 Identities=14% Similarity=0.086 Sum_probs=59.9
Q ss_pred ccceEEEEEeecCCccceEEccCCCCCeEEEEEc--CCCCEEEEEECCCcEEEEEc---------CCCcEEEEEe---cC
Q 000425 27 RRMSVTVVGTVEGGRAPTKIKTDLKKPIVNLACH--PRLPVLYVAYADGLIRAYNI---------HTYAVHYTLQ---LD 92 (1525)
Q Consensus 27 ~d~sI~vv~tieggk~~tklk~~H~~~V~sLaf~--Pdg~~LaSGs~DgtIrLWDl---------~t~k~l~tL~---H~ 92 (1525)
++..+.+|++..+........ ...+.|..++|. |+++.+++-+..+.|.+|.- .+..+++.+. |.
T Consensus 49 ~~~~LtIWD~~~~~lE~~~~f-~~~~~I~dLDWtst~d~qsiLaVGf~~~v~l~~Q~R~dy~~~~p~w~~i~~i~i~~~T 127 (631)
T PF12234_consen 49 SRSELTIWDTRSGVLEYEESF-SEDDPIRDLDWTSTPDGQSILAVGFPHHVLLYTQLRYDYTNKGPSWAPIRKIDISSHT 127 (631)
T ss_pred CCCEEEEEEcCCcEEEEeeee-cCCCceeeceeeecCCCCEEEEEEcCcEEEEEEccchhhhcCCcccceeEEEEeecCC
Confidence 356667777644432222112 247899999986 56777777778889999852 1234556555 43
Q ss_pred -CceeecCceeEEEeCCCCEEEEEeCCCcEEEEeCC
Q 000425 93 -NTIKLLGAGAFAFHPTLEWLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 93 -~~V~~~ga~sVaFSPdg~~L~SgS~DgtVrVWDl~ 127 (1525)
.+|. +..|.++|..++.+ ++.+.|+|-.
T Consensus 128 ~h~Ig-----ds~Wl~~G~LvV~s--GNqlfv~dk~ 156 (631)
T PF12234_consen 128 PHPIG-----DSIWLKDGTLVVGS--GNQLFVFDKW 156 (631)
T ss_pred CCCcc-----ceeEecCCeEEEEe--CCEEEEECCC
Confidence 6777 99999998765544 3567777644
No 498
>PRK10115 protease 2; Provisional
Probab=36.89 E-value=1.2e+03 Score=31.37 Aligned_cols=108 Identities=11% Similarity=0.071 Sum_probs=60.3
Q ss_pred EEEcCCCCEEEEEEeccce----EEEEEeecCCccceEEccCCCCCeEEEEEcCCCCEEEEEECC------CcEEEEEcC
Q 000425 12 LALTPLQPVVFFGFHRRMS----VTVVGTVEGGRAPTKIKTDLKKPIVNLACHPRLPVLYVAYAD------GLIRAYNIH 81 (1525)
Q Consensus 12 LA~hP~qp~Vf~s~~~d~s----I~vv~tieggk~~tklk~~H~~~V~sLaf~Pdg~~LaSGs~D------gtIrLWDl~ 81 (1525)
+.++|++.+++++.+...+ +.+.+...+......+. ..-..++|.+++..|+....| ..|..+++.
T Consensus 132 ~~~Spdg~~la~~~d~~G~E~~~l~v~d~~tg~~l~~~i~----~~~~~~~w~~D~~~~~y~~~~~~~~~~~~v~~h~lg 207 (686)
T PRK10115 132 MAITPDNTIMALAEDFLSRRQYGIRFRNLETGNWYPELLD----NVEPSFVWANDSWTFYYVRKHPVTLLPYQVWRHTIG 207 (686)
T ss_pred EEECCCCCEEEEEecCCCcEEEEEEEEECCCCCCCCcccc----CcceEEEEeeCCCEEEEEEecCCCCCCCEEEEEECC
Confidence 4678999999888765333 34444332211122221 112569999999866555432 357778888
Q ss_pred CC--cEEEEEe-cCCceeecCceeEEEeCCCCEEEEEe---CCCcEEEEeCC
Q 000425 82 TY--AVHYTLQ-LDNTIKLLGAGAFAFHPTLEWLFVGD---RRGTLLAWDVS 127 (1525)
Q Consensus 82 t~--k~l~tL~-H~~~V~~~ga~sVaFSPdg~~L~SgS---~DgtVrVWDl~ 127 (1525)
+. +-...++ ...... ..+..+.|+++++..+ .++.+.+|+..
T Consensus 208 t~~~~d~lv~~e~~~~~~----~~~~~s~d~~~l~i~~~~~~~~~~~l~~~~ 255 (686)
T PRK10115 208 TPASQDELVYEEKDDTFY----VSLHKTTSKHYVVIHLASATTSEVLLLDAE 255 (686)
T ss_pred CChhHCeEEEeeCCCCEE----EEEEEcCCCCEEEEEEECCccccEEEEECc
Confidence 77 3333444 222222 0223344888776544 45678888854
No 499
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=36.71 E-value=2e+02 Score=38.51 Aligned_cols=101 Identities=12% Similarity=0.100 Sum_probs=67.3
Q ss_pred EEEEEEeccceEEEEEeec------C-------------CccceEEccCCCCCeEEEEEc--CCCCEEEEEECCCcEEEE
Q 000425 20 VVFFGFHRRMSVTVVGTVE------G-------------GRAPTKIKTDLKKPIVNLACH--PRLPVLYVAYADGLIRAY 78 (1525)
Q Consensus 20 ~Vf~s~~~d~sI~vv~tie------g-------------gk~~tklk~~H~~~V~sLaf~--Pdg~~LaSGs~DgtIrLW 78 (1525)
-|++.+.+++.|.+|.+.+ . .++.+.++ ....+.++++| ...+++|+++....|.||
T Consensus 115 EVLl~c~DdG~V~~Yyt~~I~~~i~~~~~~~~~~~~r~~i~P~f~~~--v~~SaWGLdIh~~~~~rlIAVSsNs~~VTVF 192 (717)
T PF08728_consen 115 EVLLLCTDDGDVLAYYTETIIEAIERFSEDNDSGFSRLKIKPFFHLR--VGASAWGLDIHDYKKSRLIAVSSNSQEVTVF 192 (717)
T ss_pred eEEEEEecCCeEEEEEHHHHHHHHHhhccccccccccccCCCCeEee--cCCceeEEEEEecCcceEEEEecCCceEEEE
Confidence 4555556788888776411 0 11233333 35688999999 888899999999999988
Q ss_pred EcCCC--cE-E-EEEecCCceeecCceeEEEeCCC---C---EEEEEeCCCcEEEEeCC
Q 000425 79 NIHTY--AV-H-YTLQLDNTIKLLGAGAFAFHPTL---E---WLFVGDRRGTLLAWDVS 127 (1525)
Q Consensus 79 Dl~t~--k~-l-~tL~H~~~V~~~ga~sVaFSPdg---~---~L~SgS~DgtVrVWDl~ 127 (1525)
-+... +. . ....|...|- +|+|-++. . ++++++-.|.+.+|++.
T Consensus 193 af~l~~~r~~~~~s~~~~hNIP-----~VSFl~~~~d~~G~v~v~a~dI~G~v~~~~I~ 246 (717)
T PF08728_consen 193 AFALVDERFYHVPSHQHSHNIP-----NVSFLDDDLDPNGHVKVVATDISGEVWTFKIK 246 (717)
T ss_pred EEeccccccccccccccccCCC-----eeEeecCCCCCccceEEEEEeccCcEEEEEEE
Confidence 65432 11 1 1222566777 88886643 2 78888999999998874
No 500
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=36.58 E-value=59 Score=30.00 Aligned_cols=53 Identities=17% Similarity=0.231 Sum_probs=0.0
Q ss_pred hhhHHhhcCCHHHHHHHHHHhhcccCCCCCCCCChhHHHHHHHHHHHHHHcCChHHHHHHHHHH
Q 000425 769 RGIYAIKALRFSTALSVLKDEFLRSRDYPKCPPTSQLFHRFRQLGYACIKYGQFDSAKETFEVI 832 (1525)
Q Consensus 769 rF~LAl~lg~l~~A~~ia~~~~~~~~d~~~~~~~~~~~~rwk~Lg~~AL~~G~~~lAeec~q~~ 832 (1525)
+|+-|+. -++.|+++.+... +.....-.-+-.+|......|+++.|++.|+++
T Consensus 20 ~~~~A~~--~~~~al~~~~~~~---------~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~a 72 (78)
T PF13424_consen 20 RYDEALD--YYEKALDIEEQLG---------DDHPDTANTLNNLGECYYRLGDYEEALEYYQKA 72 (78)
T ss_dssp -HHHHHH--HHHHHHHHHHHTT---------THHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred CHHHHHH--HHHHHHHHHHHHC---------CCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Done!