Query 003405
Match_columns 823
No_of_seqs 222 out of 891
Neff 8.4
Searched_HMMs 46136
Date Thu Mar 28 22:50:36 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/003405.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/003405hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2063 Vacuolar assembly/sort 100.0 2.1E-87 4.4E-92 772.3 54.1 701 2-773 1-745 (877)
2 KOG2114 Vacuolar assembly/sort 100.0 3E-42 6.6E-47 383.1 46.4 615 12-777 20-753 (933)
3 KOG2066 Vacuolar assembly/sort 100.0 1.2E-37 2.5E-42 345.0 45.5 604 18-798 41-743 (846)
4 PF00780 CNH: CNH domain; Int 100.0 1.3E-33 2.9E-38 301.8 28.6 240 21-282 2-273 (275)
5 smart00036 CNH Domain found in 99.9 7E-25 1.5E-29 235.6 26.7 244 27-292 14-301 (302)
6 PF10366 Vps39_1: Vacuolar sor 99.9 4.7E-22 1E-26 177.9 9.5 107 507-617 1-108 (108)
7 COG5422 ROM1 RhoGEF, Guanine n 99.8 4.6E-20 9.9E-25 205.8 18.5 261 4-282 842-1143(1175)
8 KOG2034 Vacuolar sorting prote 99.6 1.5E-14 3.1E-19 164.8 19.5 409 237-774 288-734 (911)
9 smart00299 CLH Clathrin heavy 99.5 6.2E-14 1.3E-18 133.8 11.4 121 631-774 6-139 (140)
10 KOG4305 RhoGEF GTPase [Signal 99.5 1.4E-13 2.9E-18 161.6 13.9 207 75-282 752-997 (1029)
11 PF00637 Clathrin: Region in C 99.3 1.4E-13 3.1E-18 131.8 -0.8 122 631-775 6-140 (143)
12 cd00200 WD40 WD40 domain, foun 98.6 7.3E-05 1.6E-09 78.0 30.1 233 16-273 9-258 (289)
13 KOG0985 Vesicle coat protein c 98.6 4.1E-06 8.8E-11 97.0 20.9 334 295-779 1043-1419(1666)
14 KOG0294 WD40 repeat-containing 98.3 2.9E-05 6.2E-10 79.9 18.1 166 16-185 43-238 (362)
15 cd00200 WD40 WD40 domain, foun 98.3 0.00049 1.1E-08 71.7 26.1 186 17-224 52-249 (289)
16 KOG2066 Vacuolar assembly/sort 98.2 0.0012 2.6E-08 76.0 29.7 80 511-608 613-692 (846)
17 KOG0576 Mitogen-activated prot 98.0 4.1E-05 9E-10 86.4 12.2 240 10-272 485-786 (829)
18 KOG2048 WD40 repeat protein [G 97.9 0.0036 7.9E-08 70.8 24.2 238 15-305 24-279 (691)
19 KOG0276 Vesicle coat complex C 97.9 0.0038 8.2E-08 69.9 23.5 267 18-324 185-517 (794)
20 KOG0291 WD40-repeat-containing 97.8 0.011 2.4E-07 67.7 27.0 222 16-264 307-551 (893)
21 KOG3621 WD40 repeat-containing 97.8 0.0015 3.2E-08 74.4 20.1 281 17-320 36-372 (726)
22 PF05131 Pep3_Vps18: Pep3/Vps1 97.8 0.00037 7.9E-09 66.3 12.5 106 238-352 35-147 (147)
23 KOG1539 WD repeat protein [Gen 97.7 0.00082 1.8E-08 77.4 16.3 174 18-213 452-634 (910)
24 KOG0289 mRNA splicing factor [ 97.6 0.0012 2.5E-08 70.9 14.8 136 18-173 349-492 (506)
25 KOG1273 WD40 repeat protein [G 97.6 0.002 4.3E-08 66.6 15.7 234 22-278 31-294 (405)
26 KOG0279 G protein beta subunit 97.6 0.021 4.5E-07 58.5 22.4 245 12-275 11-273 (315)
27 TIGR03866 PQQ_ABC_repeats PQQ- 97.5 0.16 3.4E-06 54.1 30.1 246 28-301 3-279 (300)
28 KOG2114 Vacuolar assembly/sort 97.5 0.00064 1.4E-08 78.7 11.2 102 640-764 406-512 (933)
29 KOG1446 Histone H3 (Lys4) meth 97.5 0.029 6.2E-07 58.4 22.0 213 5-222 3-260 (311)
30 KOG0976 Rho/Rac1-interacting s 97.5 6.8E-05 1.5E-09 85.0 3.3 223 18-263 948-1193(1265)
31 KOG0310 Conserved WD40 repeat- 97.4 0.0036 7.9E-08 68.2 15.7 142 18-181 155-305 (487)
32 KOG0985 Vesicle coat protein c 97.4 0.0014 2.9E-08 77.0 12.9 222 506-772 873-1116(1666)
33 TIGR03866 PQQ_ABC_repeats PQQ- 97.3 0.22 4.7E-06 53.0 27.8 227 26-276 42-291 (300)
34 KOG1274 WD40 repeat protein [G 97.2 0.036 7.7E-07 65.1 21.5 180 15-213 55-249 (933)
35 KOG0291 WD40-repeat-containing 97.2 0.14 3E-06 59.2 25.5 115 74-188 54-179 (893)
36 PF00637 Clathrin: Region in C 97.1 0.00015 3.4E-09 69.1 1.3 98 506-621 43-140 (143)
37 KOG1274 WD40 repeat protein [G 97.1 0.011 2.3E-07 69.4 15.9 158 8-185 88-263 (933)
38 smart00299 CLH Clathrin heavy 97.1 0.0042 9E-08 59.0 10.7 88 506-611 42-130 (140)
39 PTZ00420 coronin; Provisional 97.1 0.25 5.5E-06 57.7 27.1 118 69-190 69-203 (568)
40 PTZ00420 coronin; Provisional 97.1 0.13 2.8E-06 60.1 24.4 156 16-185 74-249 (568)
41 PF10282 Lactonase: Lactonase, 97.1 0.22 4.7E-06 55.1 25.4 238 25-277 47-340 (345)
42 PF14761 HPS3_N: Hermansky-Pud 97.0 0.021 4.6E-07 57.1 15.2 161 5-184 7-214 (215)
43 KOG0279 G protein beta subunit 97.0 0.058 1.3E-06 55.3 18.2 152 17-191 106-269 (315)
44 KOG1407 WD40 repeat protein [F 97.0 0.084 1.8E-06 53.7 19.0 238 18-278 22-275 (313)
45 KOG0299 U3 snoRNP-associated p 97.0 0.09 2E-06 57.4 20.2 253 17-276 143-426 (479)
46 PRK11028 6-phosphogluconolacto 97.0 0.34 7.3E-06 53.0 25.9 224 23-263 43-304 (330)
47 COG2706 3-carboxymuconate cycl 96.9 0.55 1.2E-05 50.2 24.9 221 24-262 49-320 (346)
48 KOG0318 WD40 repeat stress pro 96.9 0.16 3.5E-06 56.3 21.1 172 16-210 363-544 (603)
49 PTZ00421 coronin; Provisional 96.8 0.37 8E-06 55.8 25.6 156 16-186 75-247 (493)
50 KOG2445 Nuclear pore complex c 96.8 0.052 1.1E-06 56.4 15.7 150 26-185 73-257 (361)
51 KOG0275 Conserved WD40 repeat- 96.7 0.027 5.8E-07 58.4 13.2 200 16-229 213-428 (508)
52 KOG1036 Mitotic spindle checkp 96.7 0.06 1.3E-06 55.9 15.5 128 18-167 56-187 (323)
53 KOG0649 WD40 repeat protein [G 96.6 0.77 1.7E-05 46.5 22.0 226 21-262 17-273 (325)
54 PLN00181 protein SPA1-RELATED; 96.5 0.35 7.5E-06 59.9 24.2 149 17-186 533-692 (793)
55 KOG0318 WD40 repeat stress pro 96.5 2.1 4.5E-05 48.0 28.5 263 18-305 283-564 (603)
56 KOG0266 WD40 repeat-containing 96.5 0.33 7.2E-06 55.8 21.8 155 18-191 205-371 (456)
57 KOG0587 Traf2- and Nck-interac 96.4 0.0012 2.5E-08 77.3 1.5 230 13-263 637-901 (953)
58 KOG0278 Serine/threonine kinas 96.4 0.057 1.2E-06 54.5 12.6 164 26-211 112-282 (334)
59 KOG2111 Uncharacterized conser 96.3 0.27 5.8E-06 51.5 17.6 138 26-187 17-170 (346)
60 KOG0646 WD40 repeat protein [G 96.2 0.076 1.6E-06 58.0 13.7 149 15-185 80-248 (476)
61 KOG2110 Uncharacterized conser 96.2 0.18 3.9E-06 53.8 15.8 153 15-186 86-250 (391)
62 PLN00181 protein SPA1-RELATED; 96.2 2.2 4.7E-05 52.9 28.3 230 17-263 484-738 (793)
63 KOG0310 Conserved WD40 repeat- 96.2 0.37 8E-06 53.1 18.5 211 12-246 64-289 (487)
64 KOG0292 Vesicle coat complex C 96.2 1.9 4.1E-05 51.2 24.9 243 18-281 208-496 (1202)
65 KOG1036 Mitotic spindle checkp 96.1 0.16 3.4E-06 53.0 14.7 145 18-185 15-164 (323)
66 KOG0772 Uncharacterized conser 96.1 0.29 6.3E-06 54.3 17.2 261 22-299 175-485 (641)
67 PTZ00421 coronin; Provisional 96.1 0.58 1.3E-05 54.1 21.0 154 16-188 125-294 (493)
68 KOG2055 WD40 repeat protein [G 96.0 0.29 6.3E-06 53.6 16.7 182 17-219 214-410 (514)
69 KOG4378 Nuclear protein COP1 [ 96.0 0.16 3.5E-06 55.8 14.7 149 16-185 121-281 (673)
70 KOG0266 WD40 repeat-containing 96.0 2.1 4.5E-05 49.3 24.9 236 16-275 159-420 (456)
71 KOG0273 Beta-transducin family 96.0 1.8 4E-05 47.7 22.3 256 17-298 236-520 (524)
72 KOG2055 WD40 repeat protein [G 95.9 0.02 4.3E-07 62.3 7.4 113 18-143 391-512 (514)
73 KOG0646 WD40 repeat protein [G 95.9 0.26 5.7E-06 54.0 15.6 163 17-189 124-312 (476)
74 KOG2315 Predicted translation 95.9 0.61 1.3E-05 52.4 18.7 199 97-303 148-394 (566)
75 PF10366 Vps39_1: Vacuolar sor 95.8 0.016 3.5E-07 52.2 5.3 65 671-756 3-67 (108)
76 KOG0285 Pleiotropic regulator 95.8 2.4 5.2E-05 45.3 21.6 170 3-196 138-320 (460)
77 KOG0274 Cdc4 and related F-box 95.7 0.16 3.5E-06 59.1 14.5 149 16-188 331-486 (537)
78 KOG0306 WD40-repeat-containing 95.7 0.32 7E-06 56.3 16.0 155 20-195 418-592 (888)
79 COG2706 3-carboxymuconate cycl 95.6 3.2 6.8E-05 44.6 22.0 178 96-273 17-234 (346)
80 KOG1539 WD repeat protein [Gen 95.6 1.3 2.8E-05 52.1 20.4 189 13-228 73-279 (910)
81 KOG0276 Vesicle coat complex C 95.5 2.3 5E-05 48.5 21.4 220 16-262 13-256 (794)
82 PLN03081 pentatricopeptide (PP 95.3 5.5 0.00012 48.5 26.7 60 506-576 260-320 (697)
83 KOG0647 mRNA export protein (c 95.3 1.7 3.7E-05 45.4 18.1 192 75-274 27-238 (347)
84 KOG0315 G-protein beta subunit 95.2 1.4 3E-05 44.9 16.9 217 22-263 48-288 (311)
85 KOG2048 WD40 repeat protein [G 95.2 1 2.2E-05 51.7 17.9 194 16-228 291-509 (691)
86 KOG0294 WD40 repeat-containing 95.2 1.7 3.7E-05 45.7 18.0 179 75-263 83-281 (362)
87 KOG2106 Uncharacterized conser 95.0 7.4 0.00016 43.6 23.0 151 26-195 212-368 (626)
88 TIGR02658 TTQ_MADH_Hv methylam 95.0 4.4 9.5E-05 44.6 21.9 160 97-263 29-223 (352)
89 PRK11028 6-phosphogluconolacto 95.0 6.7 0.00014 42.8 25.4 178 96-273 13-218 (330)
90 KOG0271 Notchless-like WD40 re 95.0 0.85 1.8E-05 49.0 15.5 171 16-189 247-444 (480)
91 KOG0319 WD40-repeat-containing 95.0 3.8 8.3E-05 47.7 21.6 225 16-263 107-354 (775)
92 KOG0647 mRNA export protein (c 94.9 0.98 2.1E-05 47.1 15.3 150 14-186 70-230 (347)
93 KOG0772 Uncharacterized conser 94.9 0.24 5.3E-06 54.8 11.5 146 21-184 321-488 (641)
94 KOG1523 Actin-related protein 94.8 2.7 5.8E-05 44.4 18.3 147 13-175 7-164 (361)
95 KOG0288 WD40 repeat protein Ti 94.8 0.42 9E-06 51.7 12.7 171 17-212 220-403 (459)
96 PF03178 CPSF_A: CPSF A subuni 94.8 0.54 1.2E-05 51.3 14.4 142 17-175 24-192 (321)
97 KOG0315 G-protein beta subunit 94.8 2.2 4.8E-05 43.4 16.9 148 67-218 33-189 (311)
98 KOG0640 mRNA cleavage stimulat 94.7 0.77 1.7E-05 47.9 14.0 180 17-217 217-418 (430)
99 KOG2106 Uncharacterized conser 94.7 4.8 0.0001 45.0 20.6 179 17-227 247-482 (626)
100 KOG1188 WD40 repeat protein [G 94.7 0.41 8.9E-06 50.6 12.1 139 26-185 40-197 (376)
101 KOG0305 Anaphase promoting com 94.6 1.1 2.5E-05 50.7 16.2 188 16-225 260-462 (484)
102 PF04053 Coatomer_WDAD: Coatom 94.5 1.6 3.4E-05 49.8 17.5 125 87-227 117-256 (443)
103 KOG2110 Uncharacterized conser 94.5 4.6 9.9E-05 43.5 19.4 139 120-263 91-248 (391)
104 KOG0319 WD40-repeat-containing 94.5 1.8 3.8E-05 50.4 17.5 280 17-323 63-372 (775)
105 KOG1524 WD40 repeat-containing 94.4 1.9 4.1E-05 48.3 16.7 253 27-306 76-353 (737)
106 PF14727 PHTB1_N: PTHB1 N-term 94.3 5.4 0.00012 44.9 20.9 132 27-175 38-194 (418)
107 KOG0275 Conserved WD40 repeat- 94.3 0.57 1.2E-05 48.9 12.0 178 17-216 264-457 (508)
108 KOG0296 Angio-associated migra 94.2 4.8 0.0001 43.2 18.7 150 18-189 150-361 (399)
109 KOG0273 Beta-transducin family 94.1 1.4 3E-05 48.6 15.0 146 17-184 360-523 (524)
110 PLN03081 pentatricopeptide (PP 94.0 12 0.00026 45.7 25.2 60 506-575 361-420 (697)
111 PF04053 Coatomer_WDAD: Coatom 94.0 12 0.00027 42.5 23.4 264 76-361 33-317 (443)
112 PLN03077 Protein ECB2; Provisi 94.0 22 0.00048 44.5 28.1 59 506-574 425-483 (857)
113 KOG0284 Polyadenylation factor 93.9 0.32 7E-06 52.5 9.6 161 10-194 132-302 (464)
114 KOG1445 Tumor-specific antigen 93.9 0.63 1.4E-05 52.8 12.2 136 27-181 641-781 (1012)
115 KOG4378 Nuclear protein COP1 [ 93.9 0.66 1.4E-05 51.3 12.0 132 28-179 179-314 (673)
116 KOG2076 RNA polymerase III tra 93.9 1.3 2.8E-05 52.8 15.2 239 535-774 154-463 (895)
117 KOG2096 WD40 repeat protein [G 93.9 2.8 6E-05 44.2 15.9 144 84-227 236-406 (420)
118 KOG0284 Polyadenylation factor 93.8 0.68 1.5E-05 50.1 11.8 171 26-223 108-293 (464)
119 KOG0307 Vesicle coat complex C 93.8 0.25 5.3E-06 59.8 9.4 217 28-264 82-328 (1049)
120 KOG0306 WD40-repeat-containing 93.7 2.4 5.1E-05 49.5 16.5 147 16-183 65-218 (888)
121 KOG0288 WD40 repeat protein Ti 93.7 1 2.3E-05 48.8 12.9 137 22-178 308-454 (459)
122 KOG0293 WD40 repeat-containing 93.4 1.7 3.7E-05 47.2 13.9 171 18-207 314-493 (519)
123 KOG0650 WD40 repeat nucleolar 93.4 4.3 9.2E-05 46.3 17.5 82 58-140 384-469 (733)
124 PF13432 TPR_16: Tetratricopep 93.4 0.25 5.4E-06 39.6 6.2 55 306-365 3-57 (65)
125 TIGR02917 PEP_TPR_lipo putativ 93.2 27 0.00059 43.1 27.2 98 647-775 786-886 (899)
126 KOG0263 Transcription initiati 93.2 0.6 1.3E-05 54.4 10.9 108 17-143 536-649 (707)
127 KOG2139 WD40 repeat protein [G 93.2 12 0.00026 40.3 19.4 166 11-185 135-310 (445)
128 KOG1240 Protein kinase contain 93.1 2.9 6.4E-05 51.4 16.7 163 16-193 1098-1282(1431)
129 KOG0289 mRNA splicing factor [ 92.9 8.4 0.00018 42.4 18.1 136 18-173 305-450 (506)
130 PRK11447 cellulose synthase su 92.8 39 0.00085 43.9 28.2 53 307-364 358-410 (1157)
131 TIGR03300 assembly_YfgL outer 92.7 19 0.00042 39.9 26.9 125 165-299 239-377 (377)
132 KOG0976 Rho/Rac1-interacting s 92.6 0.25 5.4E-06 57.2 6.7 148 111-263 941-1108(1265)
133 KOG0274 Cdc4 and related F-box 92.5 26 0.00056 41.0 25.7 232 16-275 208-452 (537)
134 KOG1273 WD40 repeat protein [G 92.5 1 2.2E-05 47.3 10.3 103 23-143 162-278 (405)
135 KOG0639 Transducin-like enhanc 92.4 1.2 2.5E-05 49.5 11.2 145 24-188 475-626 (705)
136 PF02239 Cytochrom_D1: Cytochr 92.4 9.2 0.0002 42.6 18.9 174 97-274 18-212 (369)
137 KOG0316 Conserved WD40 repeat- 92.4 14 0.0003 37.6 24.3 260 17-301 18-299 (307)
138 KOG0973 Histone transcription 92.3 9.9 0.00022 46.3 19.6 126 16-143 69-201 (942)
139 KOG0293 WD40 repeat-containing 92.3 3.9 8.5E-05 44.5 14.7 143 26-186 236-386 (519)
140 PF03178 CPSF_A: CPSF A subuni 92.2 2.4 5.3E-05 46.2 14.0 87 97-185 64-158 (321)
141 KOG0285 Pleiotropic regulator 92.2 3.3 7.2E-05 44.3 13.9 123 16-158 277-404 (460)
142 PF08596 Lgl_C: Lethal giant l 92.2 0.83 1.8E-05 51.2 10.3 142 28-186 157-337 (395)
143 KOG0295 WD40 repeat-containing 92.1 6 0.00013 42.5 15.6 159 75-239 193-376 (406)
144 KOG0642 Cell-cycle nuclear pro 92.1 3.1 6.7E-05 47.0 14.2 156 27-196 307-482 (577)
145 PLN03218 maturation of RBCL 1; 92.1 44 0.00096 42.6 30.6 50 729-778 720-772 (1060)
146 KOG0282 mRNA splicing factor [ 92.0 1.2 2.5E-05 49.3 10.7 163 18-203 301-479 (503)
147 KOG0278 Serine/threonine kinas 92.0 2.3 5E-05 43.3 11.9 136 16-174 143-284 (334)
148 PLN03218 maturation of RBCL 1; 91.9 46 0.001 42.5 26.4 41 731-771 757-800 (1060)
149 PRK11447 cellulose synthase su 91.8 27 0.00059 45.3 24.8 52 308-364 277-328 (1157)
150 KOG0282 mRNA splicing factor [ 91.7 1.4 3E-05 48.8 10.8 235 18-274 216-472 (503)
151 KOG0649 WD40 repeat protein [G 91.6 3.6 7.8E-05 41.8 12.7 136 18-175 116-264 (325)
152 KOG0286 G-protein beta subunit 91.6 18 0.0004 37.9 18.1 152 19-189 148-308 (343)
153 PF10282 Lactonase: Lactonase, 91.5 25 0.00055 38.6 29.1 246 28-291 1-310 (345)
154 KOG1272 WD40-repeat-containing 91.4 0.57 1.2E-05 51.5 7.5 141 18-179 255-403 (545)
155 KOG2096 WD40 repeat protein [G 91.4 8.2 0.00018 40.8 15.5 151 74-226 85-264 (420)
156 KOG0283 WD40 repeat-containing 91.4 13 0.00028 44.1 18.8 173 27-221 381-572 (712)
157 PF12341 DUF3639: Protein of u 91.3 0.42 9.1E-06 31.2 3.9 26 17-42 2-27 (27)
158 KOG1645 RING-finger-containing 91.2 5.5 0.00012 43.4 14.4 22 27-48 249-270 (463)
159 KOG1446 Histone H3 (Lys4) meth 91.2 9.3 0.0002 40.3 15.7 145 16-175 142-293 (311)
160 KOG0316 Conserved WD40 repeat- 91.2 5.2 0.00011 40.5 13.3 80 4-104 133-214 (307)
161 KOG2321 WD40 repeat protein [G 91.2 11 0.00023 43.0 17.1 181 76-263 52-258 (703)
162 PLN03077 Protein ECB2; Provisi 91.1 50 0.0011 41.3 27.3 61 506-576 324-384 (857)
163 PF02239 Cytochrom_D1: Cytochr 91.1 8 0.00017 43.1 16.6 254 24-302 46-348 (369)
164 PF04762 IKI3: IKI3 family; I 91.1 51 0.0011 41.4 35.1 81 547-644 813-897 (928)
165 KOG0650 WD40 repeat nucleolar 90.6 1.2 2.6E-05 50.6 9.1 110 74-186 520-639 (733)
166 KOG0643 Translation initiation 90.6 22 0.00047 37.0 17.2 157 17-194 53-230 (327)
167 KOG2445 Nuclear pore complex c 90.5 12 0.00026 39.5 15.6 156 6-176 3-193 (361)
168 KOG4328 WD40 protein [Function 90.4 5.7 0.00012 43.9 13.8 169 26-212 291-480 (498)
169 KOG0771 Prolactin regulatory e 90.2 4.8 0.0001 43.9 13.0 156 17-193 147-320 (398)
170 KOG0283 WD40 repeat-containing 90.1 3.9 8.5E-05 48.2 13.2 97 87-186 380-483 (712)
171 KOG0643 Translation initiation 90.1 12 0.00026 38.7 15.0 98 74-174 9-111 (327)
172 PF08662 eIF2A: Eukaryotic tra 90.0 18 0.00039 36.2 16.7 104 75-184 59-179 (194)
173 PF14559 TPR_19: Tetratricopep 89.9 0.28 6E-06 39.6 2.8 50 310-364 1-50 (68)
174 KOG1517 Guanine nucleotide bin 89.8 3.5 7.6E-05 50.0 12.6 147 24-186 1219-1383(1387)
175 KOG2079 Vacuolar assembly/sort 89.7 61 0.0013 40.1 30.2 239 506-753 542-826 (1206)
176 PRK10747 putative protoheme IX 89.7 3.8 8.2E-05 46.2 12.7 177 549-755 190-388 (398)
177 KOG0265 U5 snRNP-specific prot 89.2 31 0.00067 36.4 17.4 155 15-190 46-210 (338)
178 KOG0272 U4/U6 small nuclear ri 89.0 9.3 0.0002 41.9 14.0 185 16-221 217-415 (459)
179 KOG1332 Vesicle coat complex C 89.0 5.3 0.00012 40.7 11.4 129 22-166 19-158 (299)
180 TIGR02917 PEP_TPR_lipo putativ 89.0 67 0.0015 39.6 27.3 55 721-775 762-819 (899)
181 KOG0268 Sof1-like rRNA process 88.6 12 0.00025 40.4 14.2 103 76-181 148-254 (433)
182 KOG2041 WD40 repeat protein [G 88.6 38 0.00082 39.8 18.9 160 16-185 14-187 (1189)
183 KOG0292 Vesicle coat complex C 88.4 67 0.0015 38.9 26.6 194 19-215 138-385 (1202)
184 PF11768 DUF3312: Protein of u 88.4 54 0.0012 37.8 31.4 75 138-215 239-318 (545)
185 KOG2321 WD40 repeat protein [G 88.3 10 0.00022 43.2 14.2 168 28-216 147-334 (703)
186 KOG0299 U3 snoRNP-associated p 88.2 7.1 0.00015 43.2 12.6 147 18-186 204-358 (479)
187 KOG0640 mRNA cleavage stimulat 88.0 4.9 0.00011 42.2 10.7 161 9-188 162-339 (430)
188 KOG1407 WD40 repeat protein [F 88.0 9.1 0.0002 39.5 12.4 142 24-189 116-266 (313)
189 PF12894 Apc4_WD40: Anaphase-p 87.9 1.3 2.9E-05 33.2 5.0 33 13-45 8-42 (47)
190 KOG0305 Anaphase promoting com 87.9 8.8 0.00019 43.8 13.8 137 26-184 313-461 (484)
191 KOG1408 WD40 repeat protein [F 87.7 9.3 0.0002 44.5 13.6 185 18-222 461-669 (1080)
192 KOG0269 WD40 repeat-containing 87.6 5.2 0.00011 47.0 11.8 148 17-184 134-296 (839)
193 KOG0264 Nucleosome remodeling 87.6 47 0.001 36.8 18.4 145 27-185 191-348 (422)
194 KOG1587 Cytoplasmic dynein int 87.5 10 0.00022 44.3 14.4 156 18-185 349-517 (555)
195 KOG2139 WD40 repeat protein [G 87.5 20 0.00043 38.8 15.0 124 97-220 122-262 (445)
196 KOG0296 Angio-associated migra 87.3 31 0.00067 37.3 16.3 118 74-193 63-187 (399)
197 KOG0268 Sof1-like rRNA process 87.0 4.2 9.1E-05 43.6 9.7 213 18-263 68-302 (433)
198 KOG0270 WD40 repeat-containing 86.8 6 0.00013 43.5 11.1 108 18-143 331-449 (463)
199 PF13512 TPR_18: Tetratricopep 86.7 1.7 3.8E-05 40.9 6.2 69 302-379 12-80 (142)
200 PF14783 BBS2_Mid: Ciliary BBS 86.5 21 0.00046 32.1 12.6 54 27-102 16-70 (111)
201 KOG1188 WD40 repeat protein [G 86.4 11 0.00023 40.4 12.3 145 26-187 84-245 (376)
202 KOG1408 WD40 repeat protein [F 86.3 15 0.00032 43.0 14.2 153 17-187 502-674 (1080)
203 KOG0307 Vesicle coat complex C 86.1 6.3 0.00014 48.3 11.8 160 16-195 116-295 (1049)
204 COG4946 Uncharacterized protei 84.9 49 0.0011 37.1 16.7 113 18-151 363-485 (668)
205 PF12895 Apc3: Anaphase-promot 84.8 2.6 5.6E-05 35.7 6.0 52 307-364 32-83 (84)
206 KOG2111 Uncharacterized conser 84.7 41 0.00088 35.9 15.4 151 15-185 93-257 (346)
207 KOG2034 Vacuolar sorting prote 84.6 3.3 7.1E-05 49.5 8.4 196 551-779 363-609 (911)
208 PF13525 YfiO: Outer membrane 83.6 2.8 6E-05 42.4 6.7 66 305-379 10-75 (203)
209 KOG0271 Notchless-like WD40 re 83.6 17 0.00037 39.5 12.4 127 27-175 337-471 (480)
210 KOG0263 Transcription initiati 83.0 54 0.0012 38.9 17.1 165 25-212 462-635 (707)
211 KOG0882 Cyclophilin-related pe 82.6 25 0.00055 39.0 13.4 165 13-194 95-315 (558)
212 PRK11138 outer membrane biogen 82.1 92 0.002 34.9 27.2 127 165-300 254-393 (394)
213 KOG4190 Uncharacterized conser 81.8 11 0.00024 42.4 10.6 159 18-190 737-912 (1034)
214 PF12234 Rav1p_C: RAVE protein 81.7 22 0.00047 42.2 13.6 108 77-184 31-156 (631)
215 KOG1920 IkappaB kinase complex 81.6 65 0.0014 40.4 17.7 183 17-226 69-276 (1265)
216 KOG0272 U4/U6 small nuclear ri 81.4 93 0.002 34.5 19.3 246 10-280 171-434 (459)
217 PF07719 TPR_2: Tetratricopept 81.3 3.6 7.8E-05 27.7 4.5 25 341-365 3-27 (34)
218 PF13414 TPR_11: TPR repeat; P 81.1 1.3 2.8E-05 35.8 2.5 57 304-365 7-64 (69)
219 KOG4283 Transcription-coupled 80.9 31 0.00066 36.3 12.7 140 73-215 41-207 (397)
220 PF13371 TPR_9: Tetratricopept 80.9 2.6 5.5E-05 34.4 4.3 51 309-364 4-54 (73)
221 KOG1897 Damage-specific DNA bi 80.6 75 0.0016 39.0 17.4 161 19-195 309-488 (1096)
222 KOG0265 U5 snRNP-specific prot 80.6 83 0.0018 33.4 15.9 137 26-184 102-246 (338)
223 KOG0313 Microtubule binding pr 80.4 95 0.0021 33.9 21.3 236 18-281 107-394 (423)
224 PRK15174 Vi polysaccharide exp 80.3 1.5E+02 0.0032 36.0 26.3 58 302-364 44-101 (656)
225 PF04841 Vps16_N: Vps16, N-ter 79.4 1.2E+02 0.0025 34.4 28.5 83 237-324 260-343 (410)
226 PF08662 eIF2A: Eukaryotic tra 79.1 73 0.0016 31.9 16.0 124 137-274 39-188 (194)
227 KOG0295 WD40 repeat-containing 79.1 20 0.00043 38.7 11.0 65 27-110 305-371 (406)
228 KOG1275 PAB-dependent poly(A) 79.1 16 0.00035 44.0 11.2 136 19-177 180-334 (1118)
229 KOG1240 Protein kinase contain 78.8 26 0.00057 43.7 13.2 152 21-185 1056-1226(1431)
230 KOG4497 Uncharacterized conser 78.5 28 0.00061 37.1 11.7 108 75-185 318-432 (447)
231 TIGR02795 tol_pal_ybgF tol-pal 78.2 5.8 0.00013 35.3 6.2 67 304-379 6-72 (119)
232 KOG3881 Uncharacterized conser 78.1 7.8 0.00017 42.1 7.8 76 17-110 248-327 (412)
233 PF13360 PQQ_2: PQQ-like domai 77.9 84 0.0018 31.9 19.4 169 127-302 37-231 (238)
234 KOG0553 TPR repeat-containing 77.6 4.7 0.0001 42.5 5.9 61 304-369 119-180 (304)
235 KOG0645 WD40 repeat protein [G 77.6 97 0.0021 32.5 18.3 148 76-224 15-180 (312)
236 KOG0302 Ribosome Assembly prot 77.2 8.3 0.00018 41.7 7.6 72 16-103 302-378 (440)
237 KOG0281 Beta-TrCP (transducin 77.0 34 0.00073 36.7 11.8 156 12-193 233-397 (499)
238 PRK10866 outer membrane biogen 76.9 6.1 0.00013 41.3 6.7 66 305-379 37-102 (243)
239 PF14762 HPS3_Mid: Hermansky-P 76.7 33 0.00072 37.8 12.3 165 211-376 96-322 (374)
240 KOG0303 Actin-binding protein 76.7 26 0.00056 38.3 11.1 115 74-191 80-210 (472)
241 PF12895 Apc3: Anaphase-promot 76.7 2.3 4.9E-05 36.0 2.9 57 312-371 1-57 (84)
242 PF13570 PQQ_3: PQQ-like domai 76.6 3.5 7.5E-05 29.5 3.3 24 20-43 15-38 (40)
243 KOG1034 Transcriptional repres 76.6 2.5 5.3E-05 44.8 3.5 66 21-101 314-379 (385)
244 KOG2695 WD40 repeat protein [G 76.3 17 0.00037 39.0 9.5 114 26-157 264-387 (425)
245 KOG1517 Guanine nucleotide bin 75.9 1.6E+02 0.0035 36.7 18.3 157 17-186 1110-1289(1387)
246 PF09976 TPR_21: Tetratricopep 75.9 7.8 0.00017 36.7 6.6 20 345-364 91-110 (145)
247 KOG0301 Phospholipase A2-activ 75.3 1.8E+02 0.0039 34.4 22.2 254 17-301 15-288 (745)
248 KOG2041 WD40 repeat protein [G 75.3 1.8E+02 0.004 34.5 31.3 156 511-689 828-989 (1189)
249 KOG0277 Peroxisomal targeting 75.2 67 0.0015 33.3 13.0 140 25-185 72-222 (311)
250 KOG1900 Nuclear pore complex, 75.2 6.5 0.00014 49.1 7.0 89 14-104 176-273 (1311)
251 PRK10803 tol-pal system protei 74.7 7.5 0.00016 41.0 6.7 66 305-379 147-213 (263)
252 KOG0270 WD40 repeat-containing 74.6 20 0.00043 39.7 9.7 156 25-186 191-362 (463)
253 KOG0300 WD40 repeat-containing 74.4 56 0.0012 34.6 12.5 111 17-146 315-431 (481)
254 KOG1840 Kinesin light chain [C 73.9 56 0.0012 37.9 13.9 55 311-365 336-393 (508)
255 PRK11788 tetratricopeptide rep 73.7 87 0.0019 34.6 15.5 56 304-364 111-166 (389)
256 KOG1538 Uncharacterized conser 73.4 20 0.00043 41.6 9.7 146 76-227 13-165 (1081)
257 KOG0639 Transducin-like enhanc 73.3 44 0.00095 37.7 12.0 152 17-187 419-584 (705)
258 KOG1538 Uncharacterized conser 73.1 94 0.002 36.4 14.8 74 541-630 640-715 (1081)
259 PF08553 VID27: VID27 cytoplas 73.0 9.4 0.0002 46.3 7.6 67 16-101 577-645 (794)
260 KOG0269 WD40 repeat-containing 72.7 21 0.00046 42.1 10.0 150 26-193 100-259 (839)
261 PF13360 PQQ_2: PQQ-like domai 72.6 1.1E+02 0.0025 30.9 19.3 176 19-223 28-229 (238)
262 KOG0973 Histone transcription 72.4 28 0.0006 42.6 11.2 139 6-166 119-274 (942)
263 PF11768 DUF3312: Protein of u 70.6 13 0.00029 42.6 7.6 46 305-353 413-458 (545)
264 KOG0302 Ribosome Assembly prot 70.3 50 0.0011 36.0 11.3 101 26-143 270-378 (440)
265 KOG1912 WD40 repeat protein [G 70.3 93 0.002 37.2 14.2 169 26-213 79-291 (1062)
266 TIGR02658 TTQ_MADH_Hv methylam 70.1 1.8E+02 0.0039 32.1 28.9 248 26-299 12-328 (352)
267 PF13424 TPR_12: Tetratricopep 70.0 4.9 0.00011 33.2 3.3 57 309-365 14-72 (78)
268 PF13429 TPR_15: Tetratricopep 69.6 25 0.00053 37.3 9.4 73 702-774 182-262 (280)
269 KOG0303 Actin-binding protein 69.3 1.9E+02 0.004 32.0 17.3 107 27-145 95-205 (472)
270 PF13174 TPR_6: Tetratricopept 69.2 7.8 0.00017 25.7 3.6 28 344-378 5-32 (33)
271 KOG4640 Anaphase-promoting com 68.8 18 0.00038 41.9 8.1 72 116-191 22-99 (665)
272 KOG0322 G-protein beta subunit 68.8 38 0.00081 35.2 9.6 140 18-168 152-304 (323)
273 KOG1897 Damage-specific DNA bi 68.4 51 0.0011 40.4 12.0 140 16-174 583-731 (1096)
274 KOG2919 Guanine nucleotide-bin 68.0 1.6E+02 0.0036 31.6 14.2 154 21-187 165-331 (406)
275 PRK04922 tolB translocation pr 67.5 2.2E+02 0.0049 32.3 21.6 142 80-227 252-414 (433)
276 KOG0771 Prolactin regulatory e 67.4 11 0.00024 41.3 5.9 61 17-95 282-344 (398)
277 KOG2063 Vacuolar assembly/sort 66.4 7.5 0.00016 47.5 4.9 69 667-756 463-532 (877)
278 KOG3881 Uncharacterized conser 66.4 2.1E+02 0.0046 31.5 15.4 148 18-186 107-279 (412)
279 COG4105 ComL DNA uptake lipopr 66.3 15 0.00033 38.1 6.4 69 302-379 36-104 (254)
280 PF00400 WD40: WD domain, G-be 66.0 16 0.00034 25.4 4.8 31 12-42 7-39 (39)
281 TIGR00540 hemY_coli hemY prote 65.8 1.6E+02 0.0035 33.2 15.5 177 549-753 190-395 (409)
282 KOG0308 Conserved WD40 repeat- 65.6 72 0.0016 37.2 12.0 148 26-186 37-203 (735)
283 PRK03629 tolB translocation pr 65.3 2.5E+02 0.0054 31.9 21.8 144 78-227 245-409 (429)
284 KOG1126 DNA-binding cell divis 64.9 25 0.00055 41.0 8.5 63 552-624 563-625 (638)
285 KOG1524 WD40 repeat-containing 64.5 2.7E+02 0.0058 32.1 19.8 209 27-263 117-348 (737)
286 KOG3616 Selective LIM binding 63.8 1.2E+02 0.0026 36.0 13.4 42 532-573 777-818 (1636)
287 KOG1963 WD40 repeat protein [G 63.8 62 0.0013 38.9 11.5 71 75-145 292-377 (792)
288 KOG3617 WD40 and TPR repeat-co 63.3 75 0.0016 38.3 11.8 74 306-379 805-898 (1416)
289 PF07569 Hira: TUP1-like enhan 63.2 27 0.00059 35.8 7.7 30 16-45 12-41 (219)
290 PRK01742 tolB translocation pr 62.7 2.7E+02 0.0059 31.5 20.1 128 78-213 250-388 (429)
291 COG4946 Uncharacterized protei 62.2 2.8E+02 0.006 31.5 16.0 112 78-192 364-485 (668)
292 KOG2315 Predicted translation 62.0 3E+02 0.0065 31.8 16.8 124 97-228 253-394 (566)
293 PF00515 TPR_1: Tetratricopept 62.0 6.9 0.00015 26.5 2.2 25 341-365 3-27 (34)
294 PF01535 PPR: PPR repeat; Int 62.0 11 0.00025 24.5 3.3 26 549-574 3-28 (31)
295 KOG0547 Translocase of outer m 62.0 91 0.002 35.4 11.7 61 307-379 401-461 (606)
296 PRK15359 type III secretion sy 61.1 9.3 0.0002 36.2 3.7 58 303-365 27-84 (144)
297 COG3063 PilF Tfp pilus assembl 60.7 7.1 0.00015 39.7 2.8 31 339-369 103-133 (250)
298 PF13176 TPR_7: Tetratricopept 60.6 12 0.00027 25.9 3.3 24 549-572 2-25 (36)
299 cd00189 TPR Tetratricopeptide 60.1 15 0.00033 29.8 4.6 56 305-365 5-60 (100)
300 KOG0645 WD40 repeat protein [G 59.6 2.3E+02 0.0051 29.8 21.9 154 16-184 14-180 (312)
301 TIGR03302 OM_YfiO outer membra 59.2 23 0.00049 36.3 6.6 68 303-379 36-103 (235)
302 smart00564 PQQ beta-propeller 59.1 17 0.00036 24.4 3.7 21 24-44 4-24 (33)
303 KOG1063 RNA polymerase II elon 57.8 1E+02 0.0022 36.3 11.5 141 16-177 54-214 (764)
304 PF14779 BBS1: Ciliary BBSome 57.5 41 0.0009 35.1 7.8 26 17-42 177-211 (257)
305 KOG0313 Microtubule binding pr 56.8 1.1E+02 0.0023 33.5 10.8 111 16-143 300-418 (423)
306 PF00400 WD40: WD domain, G-be 56.8 45 0.00097 22.9 5.8 34 67-101 4-39 (39)
307 PF10395 Utp8: Utp8 family; I 56.8 4.2E+02 0.0091 31.8 22.8 213 88-301 41-305 (670)
308 KOG0264 Nucleosome remodeling 56.6 1.1E+02 0.0023 34.1 11.0 110 16-143 272-404 (422)
309 PF13181 TPR_8: Tetratricopept 56.3 18 0.00039 24.3 3.5 25 341-365 3-27 (34)
310 PF13428 TPR_14: Tetratricopep 56.1 25 0.00054 25.5 4.5 32 341-379 3-34 (44)
311 KOG0301 Phospholipase A2-activ 55.7 1.4E+02 0.0031 35.2 12.2 157 16-184 101-288 (745)
312 PRK11788 tetratricopeptide rep 55.4 2.9E+02 0.0063 30.4 15.2 181 549-772 144-327 (389)
313 PF09976 TPR_21: Tetratricopep 55.2 26 0.00056 33.0 5.7 57 306-364 17-73 (145)
314 KOG2394 WD40 protein DMR-N9 [G 54.9 1.6E+02 0.0035 33.8 12.1 127 17-144 220-363 (636)
315 KOG0281 Beta-TrCP (transducin 54.7 1.3E+02 0.0029 32.4 10.9 210 18-263 199-428 (499)
316 KOG1832 HIV-1 Vpr-binding prot 54.0 20 0.00042 43.1 5.2 107 16-144 1101-1215(1516)
317 PF12854 PPR_1: PPR repeat 53.6 19 0.00042 24.7 3.3 23 549-571 10-32 (34)
318 PF13414 TPR_11: TPR repeat; P 53.5 8.4 0.00018 30.9 1.7 34 340-373 4-38 (69)
319 PF10395 Utp8: Utp8 family; I 52.0 5E+02 0.011 31.2 19.2 201 17-225 72-306 (670)
320 KOG1445 Tumor-specific antigen 51.8 4E+02 0.0087 31.4 14.7 127 97-223 605-749 (1012)
321 KOG0290 Conserved WD40 repeat- 51.5 3.3E+02 0.0071 29.0 14.2 75 16-106 150-230 (364)
322 KOG2395 Protein involved in va 51.4 36 0.00078 38.8 6.5 65 17-100 431-497 (644)
323 PF00780 CNH: CNH domain; Int 51.3 3.1E+02 0.0066 28.6 16.2 136 125-263 6-165 (275)
324 TIGR00756 PPR pentatricopeptid 51.2 26 0.00057 23.1 3.7 27 549-575 3-29 (35)
325 KOG0321 WD40 repeat-containing 50.8 1E+02 0.0022 36.1 9.9 118 21-143 151-301 (720)
326 PF06977 SdiA-regulated: SdiA- 50.4 3.2E+02 0.0069 28.6 14.8 153 18-181 66-247 (248)
327 TIGR02795 tol_pal_ybgF tol-pal 49.7 35 0.00076 30.1 5.4 65 306-379 45-109 (119)
328 PLN03088 SGT1, suppressor of 49.2 27 0.00058 38.7 5.4 21 305-325 7-27 (356)
329 PF14561 TPR_20: Tetratricopep 49.2 80 0.0017 27.3 7.2 62 549-617 25-86 (90)
330 PF12894 Apc4_WD40: Anaphase-p 49.2 48 0.001 24.9 5.0 36 148-183 2-40 (47)
331 KOG0300 WD40 repeat-containing 49.1 1.9E+02 0.004 30.9 10.8 78 9-105 350-430 (481)
332 COG5276 Uncharacterized conser 49.1 3.6E+02 0.0079 28.8 14.8 153 21-193 176-336 (370)
333 KOG1587 Cytoplasmic dynein int 49.0 5.1E+02 0.011 30.5 17.1 77 16-105 242-325 (555)
334 PF03002 Somatostatin: Somatos 48.8 7.7 0.00017 22.4 0.5 13 804-816 3-15 (18)
335 PRK01742 tolB translocation pr 48.5 4.5E+02 0.0098 29.7 23.9 143 75-224 203-361 (429)
336 KOG2076 RNA polymerase III tra 48.4 2.2E+02 0.0048 34.9 12.6 79 646-753 429-508 (895)
337 PF08311 Mad3_BUB1_I: Mad3/BUB 48.3 1.3E+02 0.0027 27.9 8.9 102 604-753 20-124 (126)
338 KOG1272 WD40-repeat-containing 48.2 83 0.0018 35.3 8.5 95 76-174 252-352 (545)
339 KOG1920 IkappaB kinase complex 48.2 1.6E+02 0.0035 37.2 11.7 38 539-576 945-982 (1265)
340 PF14781 BBS2_N: Ciliary BBSom 47.1 2.5E+02 0.0054 26.4 12.3 61 126-186 10-83 (136)
341 KOG4547 WD40 repeat-containing 46.6 2.5E+02 0.0054 32.5 12.2 119 27-166 71-193 (541)
342 PF13174 TPR_6: Tetratricopept 46.4 32 0.0007 22.6 3.5 24 551-574 5-28 (33)
343 TIGR02552 LcrH_SycD type III s 46.1 21 0.00046 32.7 3.5 55 306-365 23-77 (135)
344 PRK10049 pgaA outer membrane p 46.1 6.6E+02 0.014 31.0 23.1 55 305-364 20-74 (765)
345 PF13041 PPR_2: PPR repeat fam 46.0 32 0.00069 25.7 3.8 28 549-576 6-33 (50)
346 KOG4283 Transcription-coupled 46.0 2.9E+02 0.0063 29.4 11.6 142 17-167 44-198 (397)
347 KOG4532 WD40-like repeat conta 45.9 88 0.0019 32.7 7.8 69 24-106 213-285 (344)
348 smart00777 Mad3_BUB1_I Mad3/BU 45.7 83 0.0018 29.1 7.1 99 602-751 18-122 (125)
349 PF13374 TPR_10: Tetratricopep 45.6 33 0.00072 23.9 3.7 25 549-573 5-29 (42)
350 PF13181 TPR_8: Tetratricopept 45.4 40 0.00087 22.5 3.9 26 549-574 4-29 (34)
351 PF12234 Rav1p_C: RAVE protein 45.1 2.6E+02 0.0057 33.4 12.7 107 115-223 30-155 (631)
352 PF13812 PPR_3: Pentatricopept 44.0 43 0.00093 22.2 3.9 27 548-574 3-29 (34)
353 PRK15174 Vi polysaccharide exp 43.9 4.3E+02 0.0094 31.9 15.0 53 309-365 186-238 (656)
354 PLN03088 SGT1, suppressor of 43.7 32 0.0007 38.1 4.9 54 307-365 43-96 (356)
355 PF14781 BBS2_N: Ciliary BBSom 43.6 2.8E+02 0.0061 26.0 11.7 113 24-148 8-130 (136)
356 KOG4532 WD40-like repeat conta 43.6 3.4E+02 0.0074 28.6 11.5 69 117-185 161-234 (344)
357 KOG0322 G-protein beta subunit 43.5 56 0.0012 34.0 6.0 66 76-141 252-321 (323)
358 KOG4328 WD40 protein [Function 43.5 2.2E+02 0.0049 32.0 10.9 113 18-145 324-452 (498)
359 PF04841 Vps16_N: Vps16, N-ter 43.3 5.3E+02 0.012 29.1 23.6 70 157-226 216-289 (410)
360 PF04762 IKI3: IKI3 family; I 42.8 8.1E+02 0.018 31.1 28.1 110 17-139 24-146 (928)
361 smart00320 WD40 WD40 repeats. 42.7 45 0.00098 21.2 3.9 27 16-42 12-40 (40)
362 PF13176 TPR_7: Tetratricopept 42.7 35 0.00075 23.6 3.2 22 343-364 3-24 (36)
363 PF12688 TPR_5: Tetratrico pep 42.4 47 0.001 30.5 4.9 54 309-364 47-100 (120)
364 PF06433 Me-amine-dh_H: Methyl 42.3 1.1E+02 0.0023 33.5 8.3 117 177-297 18-161 (342)
365 PF07719 TPR_2: Tetratricopept 42.2 46 0.00099 22.0 3.8 25 549-573 4-28 (34)
366 PF08309 LVIVD: LVIVD repeat; 42.1 93 0.002 22.7 5.4 31 17-47 2-32 (42)
367 KOG1963 WD40 repeat protein [G 41.0 5.8E+02 0.012 31.1 14.5 109 19-143 208-322 (792)
368 KOG1034 Transcriptional repres 40.8 1.6E+02 0.0035 31.7 9.0 106 19-143 93-211 (385)
369 PF07494 Reg_prop: Two compone 40.8 43 0.00092 21.1 3.1 19 17-35 5-24 (24)
370 TIGR03300 assembly_YfgL outer 40.4 5.4E+02 0.012 28.3 26.5 126 166-300 189-337 (377)
371 KOG3630 Nuclear pore complex, 40.2 1.1E+02 0.0025 38.2 8.7 83 86-172 168-258 (1405)
372 KOG0612 Rho-associated, coiled 39.9 2.1 4.6E-05 52.6 -5.3 181 84-282 1092-1295(1317)
373 KOG3617 WD40 and TPR repeat-co 39.0 60 0.0013 39.0 6.1 72 304-375 1097-1181(1416)
374 KOG0290 Conserved WD40 repeat- 38.7 5.2E+02 0.011 27.6 15.4 113 75-187 96-230 (364)
375 KOG4340 Uncharacterized conser 38.7 40 0.00088 35.6 4.2 70 298-379 8-77 (459)
376 KOG0277 Peroxisomal targeting 38.5 4E+02 0.0086 27.9 11.0 68 17-103 105-178 (311)
377 PF12854 PPR_1: PPR repeat 38.2 41 0.0009 23.0 3.0 26 301-326 8-33 (34)
378 PRK10803 tol-pal system protei 38.0 68 0.0015 33.9 6.0 64 307-379 187-250 (263)
379 KOG4227 WD40 repeat protein [G 37.8 2.5E+02 0.0054 30.9 10.0 110 17-143 106-225 (609)
380 PF13428 TPR_14: Tetratricopep 37.8 52 0.0011 23.8 3.7 26 549-574 4-29 (44)
381 KOG2079 Vacuolar assembly/sort 37.6 6.7E+02 0.014 31.7 14.5 38 537-574 792-829 (1206)
382 KOG3630 Nuclear pore complex, 37.6 3.3E+02 0.0071 34.5 11.9 140 75-216 100-263 (1405)
383 PRK15359 type III secretion sy 37.5 37 0.00079 32.1 3.6 52 308-364 66-117 (144)
384 PF08450 SGL: SMP-30/Gluconola 37.4 4.7E+02 0.01 26.7 20.1 191 27-245 12-233 (246)
385 PRK09782 bacteriophage N4 rece 37.2 4.9E+02 0.011 33.2 14.3 188 550-773 82-296 (987)
386 PF01011 PQQ: PQQ enzyme repea 35.8 40 0.00086 23.7 2.7 19 27-45 1-19 (38)
387 KOG4499 Ca2+-binding protein R 35.3 3.5E+02 0.0075 28.0 10.0 111 136-280 138-257 (310)
388 TIGR02800 propeller_TolB tol-p 35.2 6.7E+02 0.014 27.9 21.5 193 26-244 201-416 (417)
389 PRK02603 photosystem I assembl 35.1 92 0.002 30.2 6.2 54 309-364 44-97 (172)
390 KOG2394 WD40 protein DMR-N9 [G 35.1 6E+02 0.013 29.4 12.7 193 27-254 186-391 (636)
391 PRK05137 tolB translocation pr 35.0 7.1E+02 0.015 28.1 22.9 134 75-212 201-349 (435)
392 PF14156 AbbA_antirepres: Anti 34.4 52 0.0011 25.8 3.1 39 732-770 12-57 (63)
393 PF10607 CLTH: CTLH/CRA C-term 34.1 79 0.0017 29.7 5.3 58 305-364 6-65 (145)
394 PF12569 NARP1: NMDA receptor- 33.9 8.3E+02 0.018 28.6 24.6 62 306-379 10-71 (517)
395 KOG3621 WD40 repeat-containing 33.9 7.5E+02 0.016 29.6 13.7 30 16-45 124-155 (726)
396 PF13424 TPR_12: Tetratricopep 33.5 56 0.0012 26.7 3.7 27 339-365 5-31 (78)
397 KOG0321 WD40 repeat-containing 33.5 96 0.0021 36.2 6.5 63 26-104 64-131 (720)
398 PF08728 CRT10: CRT10; InterP 33.4 9.6E+02 0.021 29.2 17.0 148 15-166 39-219 (717)
399 PF07035 Mic1: Colon cancer-as 33.4 2.2E+02 0.0048 27.8 8.2 66 507-572 31-115 (167)
400 PF15390 DUF4613: Domain of un 33.1 2.9E+02 0.0062 32.4 10.0 57 117-175 158-221 (671)
401 smart00028 TPR Tetratricopepti 32.9 52 0.0011 20.2 2.8 25 341-365 3-27 (34)
402 TIGR02552 LcrH_SycD type III s 32.7 70 0.0015 29.2 4.7 54 306-364 57-110 (135)
403 KOG4547 WD40 repeat-containing 32.5 8.6E+02 0.019 28.3 19.5 166 88-255 70-260 (541)
404 PF13429 TPR_15: Tetratricopep 32.3 1.2E+02 0.0027 31.9 7.1 28 547-574 147-174 (280)
405 KOG2168 Cullins [Cell cycle co 32.0 5.6E+02 0.012 31.5 12.6 208 560-800 482-720 (835)
406 PF10433 MMS1_N: Mono-function 31.2 8.8E+02 0.019 28.1 20.7 216 26-263 222-476 (504)
407 PF09295 ChAPs: ChAPs (Chs5p-A 30.7 92 0.002 35.0 5.8 57 303-364 237-293 (395)
408 KOG3611 Semaphorins [Signal tr 30.6 2.1E+02 0.0046 34.9 9.2 80 10-101 400-490 (737)
409 KOG1070 rRNA processing protei 30.4 1E+02 0.0022 39.5 6.4 72 547-644 1531-1605(1710)
410 COG1729 Uncharacterized protei 30.4 91 0.002 32.7 5.3 60 303-364 144-203 (262)
411 KOG3380 Actin-related protein 30.3 1.1E+02 0.0024 29.0 5.2 66 295-365 30-101 (152)
412 KOG4640 Anaphase-promoting com 30.1 2.1E+02 0.0045 33.6 8.4 89 76-166 21-114 (665)
413 PRK02889 tolB translocation pr 29.7 8.6E+02 0.019 27.5 22.4 144 78-227 242-406 (427)
414 KOG1275 PAB-dependent poly(A) 29.4 3.2E+02 0.007 33.6 10.0 132 25-182 146-294 (1118)
415 COG3292 Predicted periplasmic 29.3 6.4E+02 0.014 29.6 11.9 147 155-304 162-320 (671)
416 KOG4497 Uncharacterized conser 29.3 1.8E+02 0.004 31.3 7.2 134 28-181 63-205 (447)
417 PF13432 TPR_16: Tetratricopep 29.3 48 0.001 26.0 2.5 20 345-364 3-22 (65)
418 KOG1332 Vesicle coat complex C 29.2 6.8E+02 0.015 26.1 11.6 144 17-176 103-277 (299)
419 PF03704 BTAD: Bacterial trans 28.9 90 0.0019 29.2 4.8 56 305-365 67-122 (146)
420 TIGR02521 type_IV_pilW type IV 28.8 1E+02 0.0022 30.4 5.4 54 307-365 38-91 (234)
421 cd00189 TPR Tetratricopeptide 28.8 77 0.0017 25.3 3.9 56 305-365 39-94 (100)
422 KOG2178 Predicted sugar kinase 28.6 3.7E+02 0.0081 29.9 9.7 39 167-205 286-335 (409)
423 KOG3785 Uncharacterized conser 28.5 1.1E+02 0.0024 33.3 5.6 66 292-364 17-82 (557)
424 PF07721 TPR_4: Tetratricopept 28.2 71 0.0015 20.3 2.7 21 550-570 5-25 (26)
425 KOG4649 PQQ (pyrrolo-quinoline 28.1 7.3E+02 0.016 26.1 19.8 29 16-44 52-81 (354)
426 CHL00033 ycf3 photosystem I as 27.8 1.1E+02 0.0024 29.4 5.4 54 310-365 45-98 (168)
427 KOG1896 mRNA cleavage and poly 27.8 1.9E+02 0.0042 36.6 8.0 105 112-221 1094-1200(1366)
428 KOG2280 Vacuolar assembly/sort 27.6 1.1E+03 0.023 28.8 13.6 63 117-182 219-285 (829)
429 KOG1898 Splicing factor 3b, su 27.3 4.3E+02 0.0094 33.1 10.7 90 85-175 943-1038(1205)
430 KOG2280 Vacuolar assembly/sort 27.0 1.2E+03 0.026 28.3 14.9 106 118-228 36-159 (829)
431 KOG3616 Selective LIM binding 27.0 4.4E+02 0.0095 31.7 10.2 60 512-571 622-686 (1636)
432 KOG4714 Nucleoporin [Nuclear s 26.7 47 0.001 34.5 2.4 65 19-101 182-252 (319)
433 PRK10049 pgaA outer membrane p 26.6 1.3E+03 0.028 28.5 28.7 166 552-755 278-454 (765)
434 PF07720 TPR_3: Tetratricopept 26.6 71 0.0015 22.4 2.6 19 345-363 7-25 (36)
435 KOG1840 Kinesin light chain [C 26.2 1.2E+02 0.0026 35.2 5.9 59 307-365 206-267 (508)
436 COG3071 HemY Uncharacterized e 26.1 9.5E+02 0.021 26.8 15.5 193 531-755 168-388 (400)
437 PF01403 Sema: Sema domain; I 26.1 1.6E+02 0.0035 33.5 7.0 71 5-86 350-432 (433)
438 PF14559 TPR_19: Tetratricopep 26.0 76 0.0016 24.9 3.2 28 549-576 28-55 (68)
439 PF12816 Vps8: Golgi CORVET co 25.9 1.5E+02 0.0032 29.9 5.8 119 663-802 18-156 (196)
440 KOG1523 Actin-related protein 25.9 4.5E+02 0.0097 28.4 9.3 101 115-216 11-121 (361)
441 PF14655 RAB3GAP2_N: Rab3 GTPa 25.7 5.6E+02 0.012 29.1 10.9 99 120-222 7-144 (415)
442 TIGR03302 OM_YfiO outer membra 25.6 6.9E+02 0.015 25.1 16.5 57 306-364 76-140 (235)
443 PF10433 MMS1_N: Mono-function 25.4 1.1E+03 0.024 27.3 21.3 155 20-192 259-443 (504)
444 TIGR00990 3a0801s09 mitochondr 25.4 1.4E+02 0.0031 35.6 6.7 61 305-370 370-431 (615)
445 COG5159 RPN6 26S proteasome re 25.3 6.6E+02 0.014 26.8 10.2 74 305-379 8-97 (421)
446 smart00668 CTLH C-terminal to 25.2 1.3E+02 0.0029 22.9 4.3 51 304-354 5-55 (58)
447 PF13512 TPR_18: Tetratricopep 25.1 1.5E+02 0.0032 28.1 5.2 69 307-379 54-132 (142)
448 KOG1129 TPR repeat-containing 25.1 7.5E+02 0.016 27.0 10.8 140 511-682 229-375 (478)
449 PRK11189 lipoprotein NlpI; Pro 25.0 1.2E+02 0.0027 32.4 5.5 28 548-575 238-265 (296)
450 PF08596 Lgl_C: Lethal giant l 24.9 88 0.0019 35.2 4.4 42 4-45 74-116 (395)
451 PF14727 PHTB1_N: PTHB1 N-term 24.8 2.6E+02 0.0056 31.7 8.1 28 15-42 279-316 (418)
452 KOG1896 mRNA cleavage and poly 24.7 1.6E+03 0.035 29.0 16.5 59 204-264 593-655 (1366)
453 KOG2659 LisH motif-containing 24.4 1.5E+02 0.0032 30.4 5.4 62 301-364 65-128 (228)
454 PF08450 SGL: SMP-30/Gluconola 23.6 7.9E+02 0.017 25.0 20.5 168 19-205 43-232 (246)
455 PRK10370 formate-dependent nit 23.5 1.1E+02 0.0023 30.8 4.4 24 341-364 146-169 (198)
456 PF08309 LVIVD: LVIVD repeat; 22.9 2.3E+02 0.005 20.7 4.8 29 158-186 2-31 (42)
457 PF10647 Gmad1: Lipoprotein Lp 22.9 8.7E+02 0.019 25.2 14.1 106 116-222 113-234 (253)
458 KOG1174 Anaphase-promoting com 22.9 1.2E+02 0.0026 33.8 4.7 56 308-369 446-502 (564)
459 CHL00033 ycf3 photosystem I as 22.8 1.6E+02 0.0035 28.3 5.4 71 308-378 80-152 (168)
460 PRK00178 tolB translocation pr 22.7 1.1E+03 0.024 26.4 21.6 144 78-227 245-409 (430)
461 TIGR00990 3a0801s09 mitochondr 22.2 1.2E+02 0.0026 36.4 5.2 56 309-369 340-396 (615)
462 PF10516 SHNi-TPR: SHNi-TPR; 22.1 1.1E+02 0.0025 21.7 3.0 26 340-365 2-27 (38)
463 KOG1310 WD40 repeat protein [G 21.7 3E+02 0.0065 31.8 7.5 72 17-106 51-128 (758)
464 PLN02919 haloacid dehalogenase 21.6 1.8E+03 0.039 28.5 23.9 241 21-281 630-952 (1057)
465 COG2976 Uncharacterized protei 21.3 1.7E+02 0.0037 29.4 5.0 22 342-363 129-150 (207)
466 PRK14574 hmsH outer membrane p 21.3 9.1E+02 0.02 30.1 12.5 56 305-364 297-352 (822)
467 KOG0267 Microtubule severing p 21.1 3.7E+02 0.0081 32.2 8.3 153 19-175 73-256 (825)
468 PF02064 MAS20: MAS20 protein 20.5 1.4E+02 0.003 27.5 4.0 38 343-380 67-107 (121)
469 KOG2005 26S proteasome regulat 20.5 2.9E+02 0.0062 32.8 7.2 74 606-680 178-252 (878)
470 PRK14574 hmsH outer membrane p 20.3 1.2E+03 0.026 29.1 13.2 174 550-753 38-228 (822)
471 KOG1063 RNA polymerase II elon 20.3 8.5E+02 0.018 29.1 10.9 110 73-183 570-698 (764)
No 1
>KOG2063 consensus Vacuolar assembly/sorting proteins VPS39/VAM6/VPS3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=2.1e-87 Score=772.29 Aligned_cols=701 Identities=36% Similarity=0.566 Sum_probs=568.8
Q ss_pred CCCcccccccccCCCCcEE-EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeE
Q 003405 2 VHNAFDSLELISNCSPKID-AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILS 80 (823)
Q Consensus 2 ~~~af~~~~l~~~~~~~I~-ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~q 80 (823)
||.||.++++...+|..|+ |+++||+.||+||.+|.|++|.+.+...+.++. .-..++.+ .+.++|+||++
T Consensus 1 m~~a~~~~~i~~~~~~~vd~~va~~~~~l~vGt~~G~L~lY~i~~~~~~~~~~-----~~~~~~~~---~~~~~kk~i~~ 72 (877)
T KOG2063|consen 1 MHKAFTLVEILERLPLEVDLCVAAYGNHLYVGTRDGDLYLYSIYERGNPESVE-----LVTETVKF---EKEFSKKPINK 72 (877)
T ss_pred CCcccchhhHhhhcCCccchHHHHhCCEEEEEcCCCcEEEEeccccccccchh-----hhcchhHH---hhhhccchhHH
Confidence 8999999999999999999 999999999999999999999988765543210 00001111 23467899999
Q ss_pred EEEecccCceeeEeCc-EEEEeCCCCcccccccCCCCcEEEEeeCCC---c----eEEEEEcCeEEEEEEcCCCceeEee
Q 003405 81 MEVLASRQLLLSLSES-IAFHRLPNLETIAVLTKAKGANVYSWDDRR---G----FLCFARQKRVCIFRHDGGRGFVEVK 152 (823)
Q Consensus 81 I~~~~~~~~Ll~l~d~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~---~----~l~V~~kkki~l~~~~~~~~f~~~k 152 (823)
|.+++..+++++++|+ +.+|.++++++.+...+.||++.|+.+..+ + .+|+..++++..|.|.++..+...+
T Consensus 73 l~~~~~~~~ll~l~dsqi~~~~l~~~~~~~~~~~~Kg~~~f~~~~~~~s~~~~~~~i~~~~~k~~~~~~~~~~~~~~~~~ 152 (877)
T KOG2063|consen 73 LLVCASLELLLILSDSQIAVHKLPELEPVPSGTRLKGASLFTIDLRPISTGPSVYEICLSVRKRLIRFFWNGRDGIVLVK 152 (877)
T ss_pred HhhcchhcchheecCCcceeeecCcccccccccccccceeeccccccccCCcceEEEEeeccceEEEEEecCCCceEEEE
Confidence 9999999999999995 999999999998888999999999997654 4 5888889999999999755677888
Q ss_pred eecCCCCceEEEecCCeEEEEEcCceEEEEcCC-CCeeeccCCC---CCCCCEEEEccCC-eEEEEeCCeEEEEcCCCcc
Q 003405 153 DFGVPDTVKSMSWCGENICIAIRKGYMILNATN-GALSEVFPSG---RIGPPLVVSLLSG-ELLLGKENIGVFVDQNGKL 227 (823)
Q Consensus 153 ei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~-~~~~~L~~~~---~~~~p~i~~~~~~-EfLL~~~~~gvfv~~~G~~ 227 (823)
++.+|+.|.+++|.|..+|+|..+.|++++..+ |....+++.+ .+..|+|+++.++ ++++|+|+.|+|||.+|..
T Consensus 153 ~~~~~~~p~~~~~~~~~~c~~~~~~~~ii~~~~~~~~~~~~~s~~~~~s~~P~I~~l~~~~~ll~~kd~~gv~vd~~G~~ 232 (877)
T KOG2063|consen 153 ELGFPDVPKARAWCGHIVCLGLKKSYYIINNTSKGVGPNLFPSSMDNESRKPLIKSLSDQSELLLGKDNIGVVVDLNGII 232 (877)
T ss_pred ecccccchhhhcccceeEEEeecceeEEEecCCCccccceeeeccccccCCCeEEEecCCceEEEccCceEEEEecCCcc
Confidence 999999999999999999999998888888764 4455566665 4568999999998 8899999999999999998
Q ss_pred ccCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEE-eeCCcccccccCCe-EEEeccceEEEeeccC-hh
Q 003405 228 LQADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTI-VLQNVRHLIPSSNA-VVVALENSIFGLFPVP-LG 304 (823)
Q Consensus 228 ~~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i-~l~~~~~l~~~~~~-v~v~s~~~I~~l~~~~-~~ 304 (823)
..++++.|+..|.++++..||++|+.++.+|||+.. ++.++|+| +++.++.+++++++ +|+++-+.+|++.|+| +.
T Consensus 233 ~~~~~l~ws~~P~~v~~~~PYlIa~~~~~veI~s~~-~~qlvQSI~~~~~~~~l~s~~~~i~~~~~~s~v~~L~p~~~~~ 311 (877)
T KOG2063|consen 233 AQRGTLVWSEVPLSVVVESPYLIALLDRSVEIRSKL-DGQLVQSITPLSNGRSLLSAHNGIIFVASLSNVWILVPVSNFE 311 (877)
T ss_pred cCCCceEecccchhhcccCceEEEEccccEEEEecc-CHHHhhccccccccceeeecCCcEEEEEeccceEEEEeccchH
Confidence 668999999999999999999999999999999997 68999999 99998888766555 5555669999999999 99
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHH-HHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCCCCCC
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRF-AHYLFDTGSYEEAMEHFLASQVDITYALSLYPSIVLPK 383 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~-a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l~~~~ 383 (823)
.||++|++.++|++|++|++..... .+.+...+..++.++ |+.+|.+++|++||.+|.++.+||+.||+|||++++..
T Consensus 312 ~qi~~lL~~k~fe~ai~L~e~~~~~-~p~~~~~i~~~~~l~~a~~lf~q~~f~ea~~~F~~~~~d~~~vi~lfP~l~p~~ 390 (877)
T KOG2063|consen 312 KQIQDLLQEKSFEEAISLAEILDSP-NPKEKRQISCIKILIDAFELFLQKQFEEAMSLFEKSEIDPRHVISLFPDLLPSE 390 (877)
T ss_pred HHHHHHHHhhhHHHHHHHHhccCCC-ChHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhccChHHHHHhchhhcCCc
Confidence 9999999999999999999987422 222223455667777 89999999999999999999999999999999997443
Q ss_pred CcCCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCcchhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhccCC
Q 003405 384 TTVVPEPERLLDISSDAPSLSRGSSGMSDDMESSPPAQLSELDENATLKSKKMSHNTLMALIKFLQKKRSSIIEKATAEG 463 (823)
Q Consensus 384 ~~~~~~~~~~~~~~~~~~~l~~~~~~v~~~~~~~~p~~l~~~d~~~~le~~~~~~~a~~~L~~yL~~~R~~~~~~~~~~~ 463 (823)
.... .+...+ | .+.+.+...+ .. .|.-+++.||++.|++..++.+...
T Consensus 391 ~~~~----~~~~~v---p---------------~~~~~~~~~~--------~v--~a~l~~~~ylt~~r~~~~~~l~~~~ 438 (877)
T KOG2063|consen 391 NSSI----EFTGVV---P---------------IRAPELRGGD--------LV--PAVLALIVYLTQSRREENKKLNKYK 438 (877)
T ss_pred cccc----ceeeec---c---------------CchhhhccCc--------cc--chhhhhhhHhHHHHHHHHHHHHHhh
Confidence 3110 110000 0 0000111111 11 3555899999988887655433211
Q ss_pred chhHhhhcccCCCcCCCccccccCCCCCCCCCccccHHHHHHHHHHHHHHHHHhcCChhhHHhhhcCCC-cccHHHHHHH
Q 003405 464 TEEVVLDAVGDNFTSHDSTRFKKSSKGRGTIPMYSGAREMAAILDTALLQALLLTGQSSAALELLKGLN-YCDVKICEEI 542 (823)
Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vDT~Ll~~y~~~~~~~~l~~ll~~~n-~c~~~~~~~~ 542 (823)
-. .. ..+.. ...+....+....++++|||+|||||+++++. ...++++.+| +|++++++.+
T Consensus 439 m~-----~~-~~~~~-----------~~~s~~~~~~~~~~~~~IDttLlk~Yl~~n~~-~v~~llrlen~~c~vee~e~~ 500 (877)
T KOG2063|consen 439 ML-----YM-NYFKN-----------TLISELLKSDLNDILELIDTTLLKCYLETNPG-LVGPLLRLENNHCDVEEIETV 500 (877)
T ss_pred hh-----HH-hhhhc-----------cCcchhhccchHHHHHHHHHHHHHHHHhcCch-hhhhhhhccCCCcchHHHHHH
Confidence 00 00 00000 01112223346678999999999999999864 6678888876 9999999999
Q ss_pred HHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhccc-CCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCc
Q 003405 543 LQKKNHYTALLELYKSNARHREALKLLHELVEESK-SNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCP 621 (823)
Q Consensus 543 L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~-~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p 621 (823)
|+++++|.+|+.||+.+|+|++||++|++++++.. .| .....+++.+++||++++.++.+|||+|+.|+++.+|
T Consensus 501 L~k~~~y~~Li~LY~~kg~h~~AL~ll~~l~d~~~~~d-----~~~~~~~e~ii~YL~~l~~~~~~Li~~y~~wvl~~~p 575 (877)
T KOG2063|consen 501 LKKSKKYRELIELYATKGMHEKALQLLRDLVDEDSDTD-----SFQLDGLEKIIEYLKKLGAENLDLILEYADWVLNKNP 575 (877)
T ss_pred HHhcccHHHHHHHHHhccchHHHHHHHHHHhccccccc-----cchhhhHHHHHHHHHHhcccchhHHHHHhhhhhccCc
Confidence 99999999999999999999999999999998763 32 2234567889999999999999999999999999999
Q ss_pred ccccccccc------CCCChHHHHHHHhhcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccC
Q 003405 622 TQTIELFLS------GNIPADLVNSYLKQYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWD 695 (823)
Q Consensus 622 ~~~~~if~~------~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~ 695 (823)
+.|++||++ +++++++|+.||.+..|.+++.|||+++.++ ...+..+||.|+.+|++.+.+. .. .+++.
T Consensus 576 ~~gi~Ift~~~~~~~~sis~~~Vl~~l~~~~~~l~I~YLE~li~~~-~~~~~~lht~ll~ly~e~v~~~-~~--~~~kg- 650 (877)
T KOG2063|consen 576 EAGIQIFTSEDKQEAESISRDDVLNYLKSKEPKLLIPYLEHLISDN-RLTSTLLHTVLLKLYLEKVLEQ-AS--TDGKG- 650 (877)
T ss_pred hhheeeeeccChhhhccCCHHHHHHHhhhhCcchhHHHHHHHhHhc-cccchHHHHHHHHHHHHHHhhc-cC--chhcc-
Confidence 999999999 3699999999999999999999999999875 4568899999999999998741 11 01111
Q ss_pred cccchH--HHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHhCCCch-------------
Q 003405 696 EKAYSP--TRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKVFLINQ------------- 760 (823)
Q Consensus 696 ~~~~~~--~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L~D~~~------------- 760 (823)
++..+ .|+||+.||+.|+.|+++.+|..++.+.|++|+++++||||+|++||+||++.|+|++.
T Consensus 651 -~e~~E~~~rekl~~~l~~s~~Y~p~~~L~~~~~~~l~ee~aill~rl~khe~aL~Iyv~~L~d~~~A~~Yc~~~y~~~~ 729 (877)
T KOG2063|consen 651 -EEAPETTVREKLLDFLESSDLYDPQLLLERLNGDELYEERAILLGRLGKHEEALHIYVHELDDIDAAESYCLPQYESDK 729 (877)
T ss_pred -ccchhhhHHHHHHHHhhhhcccCcchhhhhccchhHHHHHHHHHhhhhhHHHHHHHHHHHhcchhHHHHHHHHhccCCC
Confidence 12222 59999999999999999999999999999999999999999999999999999998844
Q ss_pred ---hHHHHHHHHhcCC
Q 003405 761 ---PVFLLIRRMAMDI 773 (823)
Q Consensus 761 ---a~~~~l~~~y~~~ 773 (823)
.+|..||++|++.
T Consensus 730 ~~~~~y~~lL~~~l~~ 745 (877)
T KOG2063|consen 730 TNKEIYLTLLRIYLNP 745 (877)
T ss_pred cccHHHHHHHHHHhcc
Confidence 4777888888885
No 2
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=3e-42 Score=383.10 Aligned_cols=615 Identities=15% Similarity=0.201 Sum_probs=427.8
Q ss_pred ccCCCC-cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCce
Q 003405 12 ISNCSP-KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLL 90 (823)
Q Consensus 12 ~~~~~~-~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~L 90 (823)
..++.+ .|+|+++.++.|++||.+|.|..++ ..+++.+.|+++...-|.+|.++.+.+.|
T Consensus 20 ~~~~~G~~isc~~s~~~~vvigt~~G~V~~Ln-------------------~s~~~~~~fqa~~~siv~~L~~~~~~~~L 80 (933)
T KOG2114|consen 20 LENFVGNAISCCSSSTGSVVIGTADGRVVILN-------------------SSFQLIRGFQAYEQSIVQFLYILNKQNFL 80 (933)
T ss_pred cccCCCCceeEEcCCCceEEEeeccccEEEec-------------------ccceeeehheecchhhhhHhhcccCceEE
Confidence 445545 9999999999999999999999986 23455567777754458889999999999
Q ss_pred eeEeC---c----EEEEeCCCCcccc--------ccc------CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcC--CC
Q 003405 91 LSLSE---S----IAFHRLPNLETIA--------VLT------KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDG--GR 146 (823)
Q Consensus 91 l~l~d---~----l~~~~L~~l~~~~--------~i~------~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~--~~ 146 (823)
+++++ | |++|++...++.. .+. ....+++++|+.+-..++|| .++.|..|+.+- ++
T Consensus 81 ~sv~Ed~~~np~llkiw~lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs~~l~~Iv~Gf~nG~V~~~~GDi~RDr 160 (933)
T KOG2114|consen 81 FSVGEDEQGNPVLLKIWDLEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVSEDLKTIVCGFTNGLVICYKGDILRDR 160 (933)
T ss_pred EEEeecCCCCceEEEEecccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEEccccEEEEEecCcEEEEEcCcchhcc
Confidence 99986 2 7899986553111 111 23356788888887789999 555666666542 22
Q ss_pred ceeEeeeecCCCCceEEEecC--Ce-EEEEEcCceEEEEcCCCC--eeeccCCCCCCCCEEEEc---cCCeEEEEeCCeE
Q 003405 147 GFVEVKDFGVPDTVKSMSWCG--EN-ICIAIRKGYMILNATNGA--LSEVFPSGRIGPPLVVSL---LSGELLLGKENIG 218 (823)
Q Consensus 147 ~f~~~kei~~~~~~~~l~~~~--~~-i~v~~~~~y~lidl~~~~--~~~L~~~~~~~~p~i~~~---~~~EfLL~~~~~g 218 (823)
..+...+....++|+++++.. .. ++++|.+...++.+++.. ...+-..|. ++-|.. +.++|++|.++..
T Consensus 161 gsr~~~~~~~~~pITgL~~~~d~~s~lFv~Tt~~V~~y~l~gr~p~~~~ld~~G~---~lnCss~~~~t~qfIca~~e~l 237 (933)
T KOG2114|consen 161 GSRQDYSHRGKEPITGLALRSDGKSVLFVATTEQVMLYSLSGRTPSLKVLDNNGI---SLNCSSFSDGTYQFICAGSEFL 237 (933)
T ss_pred ccceeeeccCCCCceeeEEecCCceeEEEEecceeEEEEecCCCcceeeeccCCc---cceeeecCCCCccEEEecCceE
Confidence 222222445668999999984 34 788888999999998433 122333332 333332 3457999999999
Q ss_pred EEEcCCCccccCCceeec-CCCcEEEEeC-CEEEEEeCC-eEEEEEccCCCceeEEE---eeCC-----------c-ccc
Q 003405 219 VFVDQNGKLLQADRICWS-EAPIAVIIQK-PYAIALLPR-RVEVRSLRVPYALIQTI---VLQN-----------V-RHL 280 (823)
Q Consensus 219 vfv~~~G~~~~~~~i~w~-~~P~~v~~~~-PYll~~~~~-~ieV~~l~~~~~lvQ~i---~l~~-----------~-~~l 280 (823)
.|++.+|+ +.+++|+ +....+.+.. .|+++++++ +.+.-+..+ ....+.+ .+++ . +.+
T Consensus 238 ~fY~sd~~---~~cfaf~~g~kk~~~~~~~g~~L~v~~~~~~~~~s~s~-ss~~~i~~~~d~~n~~v~ys~vl~~l~d~l 313 (933)
T KOG2114|consen 238 YFYDSDGR---GPCFAFEVGEKKEMLVFSFGLLLCVTTDKGTENTSLSN-SSSNRIFKAYDLRNRYVLYSSVLEDLSDNL 313 (933)
T ss_pred EEEcCCCc---ceeeeecCCCeEEEEEEecCEEEEEEccCCCCCcccCc-cchhheeehhhhcCcccchHHhHHHHHHHH
Confidence 99999998 5799998 7766666554 899999873 222211111 1000111 1111 1 112
Q ss_pred cccC-CeEEEeccceEEEeeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHH
Q 003405 281 IPSS-NAVVVALENSIFGLFPVPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAM 359 (823)
Q Consensus 281 ~~~~-~~v~v~s~~~I~~l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~ 359 (823)
...+ +.+++++++.+.+|.++|++.+++.|++++.|+.|+.||++.. .+. ..++.|+++||.+||.+|+|++|+
T Consensus 314 ~~w~~~~~vltsdg~~~~L~ek~le~kL~iL~kK~ly~~Ai~LAk~~~-~d~----d~~~~i~~kYgd~Ly~Kgdf~~A~ 388 (933)
T KOG2114|consen 314 IEWSFDCLVLTSDGVVHELIEKDLETKLDILFKKNLYKVAINLAKSQH-LDE----DTLAEIHRKYGDYLYGKGDFDEAT 388 (933)
T ss_pred HhcCCcEEEEecCCceeeeeeccHHHHHHHHHHhhhHHHHHHHHHhcC-CCH----HHHHHHHHHHHHHHHhcCCHHHHH
Confidence 2223 5567788999999999999999999999999999999998762 222 357789999999999999999999
Q ss_pred HHHHhc--CCCHHHHHHhCCCCCCCCCcCCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCcchhhhhhhhhhhh
Q 003405 360 EHFLAS--QVDITYALSLYPSIVLPKTTVVPEPERLLDISSDAPSLSRGSSGMSDDMESSPPAQLSELDENATLKSKKMS 437 (823)
Q Consensus 360 ~~f~~~--~~dP~~vi~Lfp~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~v~~~~~~~~p~~l~~~d~~~~le~~~~~ 437 (823)
++++++ -+||++||..|++-
T Consensus 389 ~qYI~tI~~le~s~Vi~kfLda---------------------------------------------------------- 410 (933)
T KOG2114|consen 389 DQYIETIGFLEPSEVIKKFLDA---------------------------------------------------------- 410 (933)
T ss_pred HHHHHHcccCChHHHHHHhcCH----------------------------------------------------------
Confidence 999994 58999999999554
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCchhHhhhcccCCCcCCCccccccCCCCCCCCCccccHHHHHHHHHHHHHHHHHh
Q 003405 438 HNTLMALIKFLQKKRSSIIEKATAEGTEEVVLDAVGDNFTSHDSTRFKKSSKGRGTIPMYSGAREMAAILDTALLQALLL 517 (823)
Q Consensus 438 ~~a~~~L~~yL~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vDT~Ll~~y~~ 517 (823)
+.+++|+.||+..+++-+ |++ ...|.|+.||.+
T Consensus 411 -q~IknLt~YLe~L~~~gl--a~~--------------------------------------------dhttlLLncYiK 443 (933)
T KOG2114|consen 411 -QRIKNLTSYLEALHKKGL--ANS--------------------------------------------DHTTLLLNCYIK 443 (933)
T ss_pred -HHHHHHHHHHHHHHHccc--ccc--------------------------------------------hhHHHHHHHHHH
Confidence 468899999999877643 211 467899999999
Q ss_pred cCChhhHHhhhcCCC----cccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHH
Q 003405 518 TGQSSAALELLKGLN----YCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPES 593 (823)
Q Consensus 518 ~~~~~~l~~ll~~~n----~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~ 593 (823)
+++.+.+..|++.-. ..|++.+.++|++.+++++.-.|..+.++|+++|.++.+.. ++++.
T Consensus 444 lkd~~kL~efI~~~~~g~~~fd~e~al~Ilr~snyl~~a~~LA~k~~~he~vl~ille~~---------------~ny~e 508 (933)
T KOG2114|consen 444 LKDVEKLTEFISKCDKGEWFFDVETALEILRKSNYLDEAELLATKFKKHEWVLDILLEDL---------------HNYEE 508 (933)
T ss_pred hcchHHHHHHHhcCCCcceeeeHHHHHHHHHHhChHHHHHHHHHHhccCHHHHHHHHHHh---------------cCHHH
Confidence 999999999998633 56899999999999999999999999999999999998853 56789
Q ss_pred HHHHhhcCCCCC-hhhHHHhhhhhhhcCcccccccccc---CCC--ChH-------HHHHHHhh--cCchhHHHHHHHHh
Q 003405 594 IIEYLKPLCGTD-PMLVLEFSMLVLESCPTQTIELFLS---GNI--PAD-------LVNSYLKQ--YSPSMQGRYLELML 658 (823)
Q Consensus 594 ~i~yL~~L~~~~-~~li~~y~~wll~~~p~~~~~if~~---~~l--~~~-------~Vl~~L~~--~~~~~~~~YLE~li 658 (823)
+++|+++|+.++ ...+.+|++||+.++|++++.+|+. +.- +.. .-++++-- .+++....||+.+.
T Consensus 509 Al~yi~slp~~e~l~~l~kyGk~Ll~h~P~~t~~ili~~~t~~~~~~~~~~~s~~~~~~~~i~if~~~~~~~~~Fl~~~~ 588 (933)
T KOG2114|consen 509 ALRYISSLPISELLRTLNKYGKILLEHDPEETMKILIELITELNSQGKGKSLSNIPDSIEFIGIFSQNYQILLNFLESMS 588 (933)
T ss_pred HHHHHhcCCHHHHHHHHHHHHHHHHhhChHHHHHHHHHHHhhcCCCCCCchhhcCccchhheeeeccCHHHHHHHHHHHH
Confidence 999999998766 4668899999999999999999886 110 000 11222221 23566777777644
Q ss_pred hccc-CCCChhHHHHHHHHHHHHH-----------HHHhhhhhhhc--ccCcc----------------c---chHHHHH
Q 003405 659 AMNE-NSISGNLQNEMVQIYLSEV-----------LDWYSDLSAQQ--KWDEK----------------A---YSPTRKK 705 (823)
Q Consensus 659 ~~~~-~~~~~~~h~~L~~lYl~~i-----------~~~~~~~~~~~--~~~~~----------------~---~~~~r~k 705 (823)
.... .....++-.++.++++..- ......+...+ ..|++ + ..-...|
T Consensus 589 E~s~~s~e~~~i~~t~~~~~l~~~sf~~~~~~~n~~~~l~h~~~~~~~~sdpq~kt~~~~~l~~~~~~~~~~~~~~~l~k 668 (933)
T KOG2114|consen 589 EISPDSEEVLEIIYTLLELSLMQKSFVTKPFEFNLEAELAHYQQYEGFDSDPQVKTTTLYDLYLELDAEDVPERTIILRK 668 (933)
T ss_pred hcCCCchhhhccccchhhhhhhhccccccchhhccHHHHHHHHhhcccccChhhhhccchhhHHHHHhhhcccccchhhh
Confidence 3211 1111123333444444220 00000000000 00000 0 0000111
Q ss_pred HHHHhhh-cCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHH------------HHhCCCchhHHHHHHHHhcC
Q 003405 706 LLSALES-ISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYV------------HKVFLINQPVFLLIRRMAMD 772 (823)
Q Consensus 706 Ll~fL~~-s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv------------~~L~D~~~a~~~~l~~~y~~ 772 (823)
--.++.. -++||.+.+|-+|+.+++.....++|+|++..++-+.... ..+++.+|.+|..+|+||++
T Consensus 669 sn~l~d~~~~nvd~d~al~l~qm~df~dg~ly~~~k~k~~~dl~~~~~q~~d~E~~it~~~~~g~~~p~l~~~~L~yF~~ 748 (933)
T KOG2114|consen 669 SNKLLDYAASNVDEDAALLLSQMSDFTDGLLYSYEKLKEGQDLMLYFQQISDPETVITLCERLGKEDPSLWLHALKYFVS 748 (933)
T ss_pred hcchhhhhhccccchHHHHHHHHhCCCchHHHHHhhccchHHHHHHHHHhhChHHHHHHHHHhCccChHHHHHHHHHHhh
Confidence 1112222 2459999999999999999999999999998887666654 56777799999999999999
Q ss_pred CCCCc
Q 003405 773 IKPLV 777 (823)
Q Consensus 773 ~~~~~ 777 (823)
.+...
T Consensus 749 ~~~i~ 753 (933)
T KOG2114|consen 749 EESIE 753 (933)
T ss_pred hcchh
Confidence 87433
No 3
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00 E-value=1.2e-37 Score=345.00 Aligned_cols=604 Identities=17% Similarity=0.224 Sum_probs=405.9
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S 96 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~ 96 (823)
.|+|++.++.++++||.+|.++.+..+++.. ++++.++. ..+++.+.++++ |
T Consensus 41 ~is~~av~~~~~~~GtH~g~v~~~~~~~~~~----------------------~~~~~s~~-----~~~Gey~asCS~DG 93 (846)
T KOG2066|consen 41 AISCCAVHDKFFALGTHRGAVYLTTCQGNPK----------------------TNFDHSSS-----ILEGEYVASCSDDG 93 (846)
T ss_pred HHHHHHhhcceeeeccccceEEEEecCCccc----------------------cccccccc-----ccCCceEEEecCCC
Confidence 7999999999999999999999998776431 11222222 567788999997 6
Q ss_pred -EEEEeCCCCcccccccCCCCcEEEEeeCCC-----ceEEEEEcCeEEEEEEcCCCceeEeeee---cCCCCceEEEecC
Q 003405 97 -IAFHRLPNLETIAVLTKAKGANVYSWDDRR-----GFLCFARQKRVCIFRHDGGRGFVEVKDF---GVPDTVKSMSWCG 167 (823)
Q Consensus 97 -l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~-----~~l~V~~kkki~l~~~~~~~~f~~~kei---~~~~~~~~l~~~~ 167 (823)
|.+..+.+-+..++....+.+.+++++++. +++++|..+.+.+|+.+ .|...+.+ ...++|.++.|+|
T Consensus 94 kv~I~sl~~~~~~~~~df~rpiksial~Pd~~~~~sk~fv~GG~aglvL~er~---wlgnk~~v~l~~~eG~I~~i~W~g 170 (846)
T KOG2066|consen 94 KVVIGSLFTDDEITQYDFKRPIKSIALHPDFSRQQSKQFVSGGMAGLVLSERN---WLGNKDSVVLSEGEGPIHSIKWRG 170 (846)
T ss_pred cEEEeeccCCccceeEecCCcceeEEeccchhhhhhhheeecCcceEEEehhh---hhcCccceeeecCccceEEEEecC
Confidence 889999887777777777889999999873 34555655668888754 33222233 2347999999999
Q ss_pred CeEEEEEcCceEEEEcCCCCeeeccCCCC------CCCCEEEEccCCeEEEE-eCCeEEEEcCCCccccCCceeecCCC-
Q 003405 168 ENICIAIRKGYMILNATNGALSEVFPSGR------IGPPLVVSLLSGELLLG-KENIGVFVDQNGKLLQADRICWSEAP- 239 (823)
Q Consensus 168 ~~i~v~~~~~y~lidl~~~~~~~L~~~~~------~~~p~i~~~~~~EfLL~-~~~~gvfv~~~G~~~~~~~i~w~~~P- 239 (823)
+.|.|++..|..++|+.+++.....+... ..+|..++.+++.+++| .++.-++.=.++..+.-.++..+..|
T Consensus 171 ~lIAWand~Gv~vyd~~~~~~l~~i~~p~~~~R~e~fpphl~W~~~~~LVIGW~d~v~i~~I~~~~s~~a~~~~~~~~~~ 250 (846)
T KOG2066|consen 171 NLIAWANDDGVKVYDTPTRQRLTNIPPPSQSVRPELFPPHLHWQDEDRLVIGWGDSVKICSIKKRSSSEARSFRLPSLKK 250 (846)
T ss_pred cEEEEecCCCcEEEeccccceeeccCCCCCCCCcccCCCceEecCCCeEEEecCCeEEEEEEecccccccccccCCccce
Confidence 99999999999999999987665554432 23577888888888884 55655554333322212334444432
Q ss_pred cEE---EEeCCEEEEEeC--CeEEEEEccC-----------C-------CceeEEEeeCC------------c-------
Q 003405 240 IAV---IIQKPYAIALLP--RRVEVRSLRV-----------P-------YALIQTIVLQN------------V------- 277 (823)
Q Consensus 240 ~~v---~~~~PYll~~~~--~~ieV~~l~~-----------~-------~~lvQ~i~l~~------------~------- 277 (823)
..+ .-...|+-++.+ ..+.+-.... + ...++-+.... .
T Consensus 251 V~~~s~f~~s~~isGla~lg~qLv~L~f~~~~~~~e~~s~~~~~r~~~~~peir~~~~~~~Ei~~Dal~~~~~e~~~~~D 330 (846)
T KOG2066|consen 251 VEIVSHFETSFYISGLAPLGDQLVVLGFDKDISEGEFTSARPSSRAKGNRPEIRIVSLNNDEICSDALIVRGFEELSIND 330 (846)
T ss_pred eeeEEEeeeeeeeeccccccceeEEEeeecccccccccccchhhhccCCCceEEeccccchhhhhhhhhhcchhhcCCcc
Confidence 222 223456666665 2333332210 0 01112111111 0
Q ss_pred cccc---ccCCeEEEeccceEEEeeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCC
Q 003405 278 RHLI---PSSNAVVVALENSIFGLFPVPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGS 354 (823)
Q Consensus 278 ~~l~---~~~~~v~v~s~~~I~~l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~ 354 (823)
.+|. .....+|+.|+++|....+.+.+++|+||+++++|++|+..++..... .......++...|.++|...++
T Consensus 331 Y~L~~~~~~~~~yyIvspkDiV~a~~~~~~Dhi~Wll~~k~yeeAl~~~k~~~~~---~~~~~i~kv~~~yI~HLl~~~~ 407 (846)
T KOG2066|consen 331 YHLGGHPKTEPLYYIVSPKDIVVAKERDQEDHIDWLLEKKKYEEALDAAKASIGN---EERFVIKKVGKTYIDHLLFEGK 407 (846)
T ss_pred ccccCCCCCCceEEEecCCceEEEeecCcchhHHHHHHhhHHHHHHHHHHhccCC---ccccchHHHHHHHHHHHHhcch
Confidence 1121 224568999999999999999999999999999999999999987422 1122577899999999999999
Q ss_pred HHHHHHHHHhc-CCCH---HHHHHhCCCCCCCCCcCCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCcchhhhh
Q 003405 355 YEEAMEHFLAS-QVDI---TYALSLYPSIVLPKTTVVPEPERLLDISSDAPSLSRGSSGMSDDMESSPPAQLSELDENAT 430 (823)
Q Consensus 355 f~~A~~~f~~~-~~dP---~~vi~Lfp~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~v~~~~~~~~p~~l~~~d~~~~ 430 (823)
|++|.....+- +-+- ..-+..|-.+ +.+.+.+. -++ ..|+.+++.
T Consensus 408 y~~Aas~~p~m~gn~~~eWe~~V~~f~e~-----------~~l~~Ia~--------------~lP-t~~~rL~p~----- 456 (846)
T KOG2066|consen 408 YDEAASLCPKMLGNNAAEWELWVFKFAEL-----------DQLTDIAP--------------YLP-TGPPRLKPL----- 456 (846)
T ss_pred HHHHHhhhHHHhcchHHHHHHHHHHhccc-----------cccchhhc--------------cCC-CCCcccCch-----
Confidence 99999775431 1000 0001111110 11111111 011 111222110
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHhhccCCchhHhhhcccCCCcCCCccccccCCCCCCCCCccccHHHHHHHHHHH
Q 003405 431 LKSKKMSHNTLMALIKFLQKKRSSIIEKATAEGTEEVVLDAVGDNFTSHDSTRFKKSSKGRGTIPMYSGAREMAAILDTA 510 (823)
Q Consensus 431 le~~~~~~~a~~~L~~yL~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vDT~ 510 (823)
-.-..|..||......+... +..|.+ .+.....+++.
T Consensus 457 --------vYemvLve~L~~~~~~F~e~---------i~~Wp~-------------------------~Lys~l~iisa- 493 (846)
T KOG2066|consen 457 --------VYEMVLVEFLASDVKGFLEL---------IKEWPG-------------------------HLYSVLTIISA- 493 (846)
T ss_pred --------HHHHHHHHHHHHHHHHHHHH---------HHhCCh-------------------------hhhhhhHHHhh-
Confidence 23445667776444444321 111211 11112222221
Q ss_pred HHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhc----CcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCccccc
Q 003405 511 LLQALLLTGQSSAALELLKGLNYCDVKICEEILQKK----NHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHT 586 (823)
Q Consensus 511 Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~----~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~ 586 (823)
.++.++++ ...+.||.||...++|.+|++++.++.+.+.
T Consensus 494 ----------------------------~~~q~~q~Se~~~L~e~La~LYl~d~~Y~~Al~~ylklk~~~v--------- 536 (846)
T KOG2066|consen 494 ----------------------------TEPQIKQNSESTALLEVLAHLYLYDNKYEKALPIYLKLQDKDV--------- 536 (846)
T ss_pred ----------------------------cchHHHhhccchhHHHHHHHHHHHccChHHHHHHHHhccChHH---------
Confidence 11122111 1122399999999999999999999875543
Q ss_pred ccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCcccccccccc--CCCChHHHHHHHhhcCchhHHHHHHHHhhcccCC
Q 003405 587 QKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQTIELFLS--GNIPADLVNSYLKQYSPSMQGRYLELMLAMNENS 664 (823)
Q Consensus 587 ~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~~~if~~--~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~ 664 (823)
.+.+.+ ..-.+.+.+-..-+|..+-+.++++|++ +++|+..|++.+. ..|+.+..||..+...+ ..
T Consensus 537 --------f~lI~k--~nL~d~i~~~Iv~Lmll~skka~~lLldn~d~ip~a~Vveql~-~~P~~l~~YL~kl~~rd-~~ 604 (846)
T KOG2066|consen 537 --------FDLIKK--HNLFDQIKDQIVLLMLLDSKKAIDLLLDNRDSISPSEVVEQLE-DNPKLLYCYLHKLFKRD-HF 604 (846)
T ss_pred --------HHHHHH--HhhHHHHHHHHHHHHccchhhHHHHHhhccccCCHHHHHHHHh-cChHHHHHHHHHHhhcC-cc
Confidence 222221 1123556666667888888899999987 5799999999999 45999999999998754 46
Q ss_pred CChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccH
Q 003405 665 ISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQH 744 (823)
Q Consensus 665 ~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h 744 (823)
...++|++++++|+++ .|++|++||+.|++|++++|+++|.+.++++|.+|||||||+|
T Consensus 605 ~~~~y~dk~I~LYAEy---------------------Drk~LLPFLr~s~~Y~lekA~eiC~q~~~~~E~VYlLgrmGn~ 663 (846)
T KOG2066|consen 605 MGSEYHDKQIELYAEY---------------------DRKKLLPFLRKSQNYNLEKALEICSQKNFYEELVYLLGRMGNA 663 (846)
T ss_pred ccchhhhHHHHHHHHH---------------------hHhhhhHHHHhcCCCCHHHHHHHHHhhCcHHHHHHHHHhhcch
Confidence 7889999999999999 5999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhCCC-----------chhHHHHHHHHhcCCCCCcch--------------hhhcc-chhHHHHHHHHHHH
Q 003405 745 ELALSLYVHKVFLI-----------NQPVFLLIRRMAMDIKPLVTE--------------HEIKH-INWRVLQATIIKLF 798 (823)
Q Consensus 745 ~~AL~ilv~~L~D~-----------~~a~~~~l~~~y~~~~~~~~~--------------~~~~~-~~~~~~~~~~~~~~ 798 (823)
++||.++|++|+|+ ++.+|-.|..+.++.|+.++. +-+.+ ++.|-||-+|.|+-
T Consensus 664 k~AL~lII~el~die~AIefvKeq~D~eLWe~LI~~~ldkPe~~~~ll~i~~~~dpl~ii~kip~g~~IPnLrdsl~Kil 743 (846)
T KOG2066|consen 664 KEALKLIINELRDIEKAIEFVKEQDDSELWEDLINYSLDKPEFIKALLNIGEHEDPLLIIRKIPDGLEIPNLRDSLVKIL 743 (846)
T ss_pred HHHHHHHHHHhhCHHHHHHHHHhcCCHHHHHHHHHHhhcCcHHHHHHHHhhhcccHHHHHhcCCCCCCCccHHHHHHHHH
Confidence 99999999999999 456999999999998866552 12233 56777888887764
No 4
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=100.00 E-value=1.3e-33 Score=301.77 Aligned_cols=240 Identities=32% Similarity=0.553 Sum_probs=204.2
Q ss_pred EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-EEE
Q 003405 21 AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES-IAF 99 (823)
Q Consensus 21 ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~-l~~ 99 (823)
|++.|+++|+|||++| |++|++.... ...+. .++.+|+||.++|+.++|++++|+ +.+
T Consensus 2 c~~~~~~~L~vGt~~G-l~~~~~~~~~-----------------~~~~i---~~~~~I~ql~vl~~~~~llvLsd~~l~~ 60 (275)
T PF00780_consen 2 CADSWGDRLLVGTEDG-LYVYDLSDPS-----------------KPTRI---LKLSSITQLSVLPELNLLLVLSDGQLYV 60 (275)
T ss_pred CcccCCCEEEEEECCC-EEEEEecCCc-----------------cceeE---eecceEEEEEEecccCEEEEEcCCccEE
Confidence 8899999999999999 8888872111 01111 224569999999999999999996 999
Q ss_pred EeCCCCcccc---------------cccCCCCcEEEEe-e--CCCceEEEEEcCeEEEEEEcCC-Cce-eEeeeecCCCC
Q 003405 100 HRLPNLETIA---------------VLTKAKGANVYSW-D--DRRGFLCFARQKRVCIFRHDGG-RGF-VEVKDFGVPDT 159 (823)
Q Consensus 100 ~~L~~l~~~~---------------~i~~~kg~~~fa~-~--~~~~~l~V~~kkki~l~~~~~~-~~f-~~~kei~~~~~ 159 (823)
|+|+.+.+.. .++..+||++|+. . ....++||+.||+|.+|+|..+ ..| +..||+.+|++
T Consensus 61 ~~L~~l~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~lp~~ 140 (275)
T PF00780_consen 61 YDLDSLEPVSTSAPLAFPKSRSLPTKLPETKGVSFFAVNGGHEGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEISLPDP 140 (275)
T ss_pred EEchhhccccccccccccccccccccccccCCeeEEeeccccccceEEEEEECCEEEEEEEECCcccccceeEEEEcCCC
Confidence 9998876544 5678899999992 2 2224699999999999999864 468 88999999999
Q ss_pred ceEEEecCCeEEEEEcCceEEEEcCCCCeeeccCCCCC----------CC-CEEEEccCCeEEEEeCCeEEEEcCCCccc
Q 003405 160 VKSMSWCGENICIAIRKGYMILNATNGALSEVFPSGRI----------GP-PLVVSLLSGELLLGKENIGVFVDQNGKLL 228 (823)
Q Consensus 160 ~~~l~~~~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~----------~~-p~i~~~~~~EfLL~~~~~gvfv~~~G~~~ 228 (823)
|++|+|.++.||+|++++|.++|+.++...++++.+.. .. +.+..++++|||||+++.|+|+|.+|+++
T Consensus 141 ~~~i~~~~~~i~v~~~~~f~~idl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~Ll~~~~~g~fv~~~G~~~ 220 (275)
T PF00780_consen 141 PSSIAFLGNKICVGTSKGFYLIDLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQLSDNEFLLCYDNIGVFVNKNGEPS 220 (275)
T ss_pred cEEEEEeCCEEEEEeCCceEEEecCCCCceEEeCccCCcchhhhcccCCCceEEEEeCCceEEEEecceEEEEcCCCCcC
Confidence 99999999999999999999999999999999865432 23 44567788999999999999999999999
Q ss_pred cCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeCCcccccc
Q 003405 229 QADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQNVRHLIP 282 (823)
Q Consensus 229 ~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l~~ 282 (823)
|+++|+|++.|.++++.+|||+++++++||||++. ++.++|+|++++.+.+.+
T Consensus 221 r~~~i~W~~~p~~~~~~~pyli~~~~~~iEV~~~~-~~~lvQ~i~~~~~~~l~~ 273 (275)
T PF00780_consen 221 RKSTIQWSSAPQSVAYSSPYLIAFSSNSIEVRSLE-TGELVQTIPLPNIRLLCS 273 (275)
T ss_pred cccEEEcCCchhEEEEECCEEEEECCCEEEEEECc-CCcEEEEEECCCEEEEec
Confidence 88999999999999999999999999999999998 599999999999887765
No 5
>smart00036 CNH Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
Probab=99.94 E-value=7e-25 Score=235.57 Aligned_cols=244 Identities=21% Similarity=0.324 Sum_probs=186.8
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c--EEEEeCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S--IAFHRLP 103 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~--l~~~~L~ 103 (823)
++|++||++|....+ +.+... ++ .+.+++++|+||.++++.++|++|+| . +.+|+|.
T Consensus 14 ~~lL~GTe~Gly~~~-~~~~~~----------------~~---~kl~~~~~v~q~~v~~~~~lLi~Lsgk~~~L~~~~L~ 73 (302)
T smart00036 14 KWLLVGTEEGLYVLN-ISDQPG----------------TL---EKLIGRRSVTQIWVLEENNVLLMISGKKPQLYSHPLS 73 (302)
T ss_pred cEEEEEeCCceEEEE-cccCCC----------------Ce---EEecCcCceEEEEEEhhhCEEEEEeCCcceEEEEEHH
Confidence 599999999976665 432110 11 23356889999999999999999999 4 9999997
Q ss_pred CCcc----------------cccccCCCCcEEEEeeCCC--ceEEEEEcCeEEEEEEcCC-CceeEeeee----cCCCCc
Q 003405 104 NLET----------------IAVLTKAKGANVYSWDDRR--GFLCFARQKRVCIFRHDGG-RGFVEVKDF----GVPDTV 160 (823)
Q Consensus 104 ~l~~----------------~~~i~~~kg~~~fa~~~~~--~~l~V~~kkki~l~~~~~~-~~f~~~kei----~~~~~~ 160 (823)
.+.. ...+.++|||+.|++.... .++|+|.+++|.+|+|... ..|...|++ ..++.+
T Consensus 74 ~L~~~~~~~~~~~~~~~~~~~~~~~~tkGc~~~~v~~~~~~~~l~~A~~~~i~l~~~~~~~~~f~~~k~~~~~~~~~~~~ 153 (302)
T smart00036 74 ALVEKKEALGSARLVIRKNVLTKIPDTKGCHLCAVVNGKRSLFLCVALQSSVVLLQWYNPLKKFKLFKSKFLFPLISPVP 153 (302)
T ss_pred HhhhhhhccCCccccccccceEeCCcCCceEEEEEEcCCCcEEEEEEcCCeEEEEEccChhhhhhhhcccccccCCCCcc
Confidence 6653 2356888999999987655 3589999999999999742 246666653 233444
Q ss_pred eEEEec-----CCeEEEEEcC-ceEEEEcCC--CCeeec------cCCCCCCCCEEEEccCCeEEEEeCCeEEEEcCCC-
Q 003405 161 KSMSWC-----GENICIAIRK-GYMILNATN--GALSEV------FPSGRIGPPLVVSLLSGELLLGKENIGVFVDQNG- 225 (823)
Q Consensus 161 ~~l~~~-----~~~i~v~~~~-~y~lidl~~--~~~~~L------~~~~~~~~p~i~~~~~~EfLL~~~~~gvfv~~~G- 225 (823)
....|. ++.||||+++ +|.++++.+ ....+. ...+. ..+.+..++++|||||+++.|+|||.+|
T Consensus 154 ~~~~~~~~~~~~~~lcvG~~~~~~~~~~~~~~~~~~~d~sl~~~~~~~~~-~p~~i~~l~~~e~Llc~~~~~v~Vn~~G~ 232 (302)
T smart00036 154 VFVELVSSSFERPGICIGSDKGGGDVVQFHESLVSKEDLSLPFLSEETSL-KPISVVQVPRDEFLLCYDEFGVFVNLYGK 232 (302)
T ss_pred ceEeeecccccceEEEEEEcCCCCeEEEEeeccccccccccccccccccc-CceEEEEECCCeEEEEECcEEEEEeCCCC
Confidence 444453 5689999997 999999865 221111 11111 2344667889999999999999999999
Q ss_pred ccccCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEe---eCCcccccccCCeEEEecc
Q 003405 226 KLLQADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIV---LQNVRHLIPSSNAVVVALE 292 (823)
Q Consensus 226 ~~~~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~---l~~~~~l~~~~~~v~v~s~ 292 (823)
+.++...+.|++.|.+++|..|||+++.+++||||++. ++.++|+|+ .++.++|+..++.++++|.
T Consensus 233 ~~~r~~~l~w~~~p~~~~~~~pyll~~~~~~ievr~l~-~~~l~q~i~~~~~~~~r~L~~~~~~I~~~s~ 301 (302)
T smart00036 233 RRSRNPILHWEFMPESFAYHSPYLLAFHDNGIEIRSIK-TGELLQELADRETRKIRLLGSSDRKILLSSS 301 (302)
T ss_pred ccccceEEEcCCcccEEEEECCEEEEEcCCcEEEEECC-CCceEEEEecCCCcceEEEecCCCeEEEEec
Confidence 77777889999999999999999999999999999997 589999998 5677888866777777763
No 6
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=99.87 E-value=4.7e-22 Score=177.89 Aligned_cols=107 Identities=36% Similarity=0.569 Sum_probs=92.1
Q ss_pred HHHHHHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCccccc
Q 003405 507 LDTALLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHT 586 (823)
Q Consensus 507 vDT~Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~ 586 (823)
|||+||+||+.++ +..+++|+|.||+|++++|++.|+++++|.+|+.||..+|+|++||++|+++++++..+..++
T Consensus 1 VDTaLlk~Yl~~~-~~~l~~llr~~N~C~~~~~e~~L~~~~~~~eL~~lY~~kg~h~~AL~ll~~l~~~~~~~~~~~--- 76 (108)
T PF10366_consen 1 VDTALLKCYLETN-PSLLGPLLRLPNYCDLEEVEEVLKEHGKYQELVDLYQGKGLHRKALELLKKLADEEDSDEEDP--- 76 (108)
T ss_pred CcHHHHHHHHHhC-HHHHHHHHccCCcCCHHHHHHHHHHcCCHHHHHHHHHccCccHHHHHHHHHHhcccccccccc---
Confidence 6999999999995 578999999999999999999999999999999999999999999999999998532221111
Q ss_pred ccCChHHH-HHHhhcCCCCChhhHHHhhhhhh
Q 003405 587 QKFNPESI-IEYLKPLCGTDPMLVLEFSMLVL 617 (823)
Q Consensus 587 ~~~~~~~~-i~yL~~L~~~~~~li~~y~~wll 617 (823)
...++..+ |+||++|++++.++|++|++|++
T Consensus 77 ~~~~~~~~iv~yL~~L~~~~~dLI~~~s~WvL 108 (108)
T PF10366_consen 77 FLSGVKETIVQYLQKLGNEDLDLIFEYSDWVL 108 (108)
T ss_pred cccCchhHHHHHHHhCChhhhHHHHHhccccC
Confidence 12344455 99999999999999999999985
No 7
>COG5422 ROM1 RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms]
Probab=99.84 E-value=4.6e-20 Score=205.80 Aligned_cols=261 Identities=21% Similarity=0.323 Sum_probs=189.5
Q ss_pred CcccccccccCC---CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCe
Q 003405 4 NAFDSLELISNC---SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPI 78 (823)
Q Consensus 4 ~af~~~~l~~~~---~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I 78 (823)
.-|++.++...+ .+++.|+-.++ +.+++||+.|. ++.....+. +.+.+.+.......|
T Consensus 842 ~~ft~~~~~~~Ff~~~nkvn~v~~~dsgr~ll~~T~kgl-Yis~~k~~~----------------~~f~kpI~~l~~~nI 904 (1175)
T COG5422 842 LWFTSFPICDQFFSTTNKVNPVPLYDSGRKLLTGTNKGL-YISNRKDNV----------------NRFNKPIDLLQEPNI 904 (1175)
T ss_pred hheeeccchhheeeccceecceeeccCCCeEEEecccee-EEEEeccCc----------------ccccccHHHHhcCCc
Confidence 346667776665 47899988885 79999999996 333322221 111222223346789
Q ss_pred eEEEEecccCceeeEeCc-EEEEeCCCCcccccc------cCCCCcEEEEeeCCCce-EEEEEcCe-----EEEEEE---
Q 003405 79 LSMEVLASRQLLLSLSES-IAFHRLPNLETIAVL------TKAKGANVYSWDDRRGF-LCFARQKR-----VCIFRH--- 142 (823)
Q Consensus 79 ~qI~~~~~~~~Ll~l~d~-l~~~~L~~l~~~~~i------~~~kg~~~fa~~~~~~~-l~V~~kkk-----i~l~~~--- 142 (823)
+||.++++.++++.++|. ++-+.|+-.+..... .....+.+|..+.+.|. +++++|.. +.+++.
T Consensus 905 SQi~vieey~lmlllsdk~LY~~pl~vid~~~~~~~kksr~~~~hvsffk~G~C~gk~lv~~~kS~~~~~~l~v~e~~~~ 984 (1175)
T COG5422 905 SQIIVIEEYKLMLLLSDKKLYSCPLDVIDASTEENVKKSRIVNGHVSFFKQGFCNGKRLVCAVKSSSLSATLAVIEAPLA 984 (1175)
T ss_pred ceeeehhhhhHHHHhhcCeeecCccchhhhhhhhhhhhhhheeceeEEEeecccccceEEEeeeeheeeeeeeeecchhh
Confidence 999999999999999995 676666433221111 12234667777776664 44444432 334441
Q ss_pred ---cC-CC--c-e--eEeeeecCCCCceEEEecCCeEEEEEcCceEEEEcCCCCeeeccCCC----------CCCCCEEE
Q 003405 143 ---DG-GR--G-F--VEVKDFGVPDTVKSMSWCGENICIAIRKGYMILNATNGALSEVFPSG----------RIGPPLVV 203 (823)
Q Consensus 143 ---~~-~~--~-f--~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~~~~~~L~~~~----------~~~~p~i~ 203 (823)
.. +. . + ...+|+.+|..|.++.|..+.||||++++|.++++.+-..++|+.+- +..+|+.+
T Consensus 985 ~~~~~s~n~Kk~lt~~~~~el~v~~E~~sv~Flk~KlcIgC~kgFeIvsle~l~~esLL~paD~s~~~~~~ken~kpiai 1064 (1175)
T COG5422 985 LKKNKSGNLKKALTIELSTELYVPSEPLSVHFLKNKLCIGCKKGFEIVSLENLRTESLLNPADTSPLFFEKKENTKPIAI 1064 (1175)
T ss_pred hhcccCcchhhhhhhhheEEEEecCceeeeeeeccceEEeecCCceEeechhhhhHhhcCcccccHHHHhhcccCceEEE
Confidence 11 11 1 1 12568899999999999999999999999999999988877777552 23567766
Q ss_pred EccCCeEEEEeCCeEEEEcCCCcccc-CCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeCCcccccc
Q 003405 204 SLLSGELLLGKENIGVFVDQNGKLLQ-ADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQNVRHLIP 282 (823)
Q Consensus 204 ~~~~~EfLL~~~~~gvfv~~~G~~~~-~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l~~ 282 (823)
..-.+|||+|+++.|+|||.+|+..| ...+.|++.|..++.++|||+++.++.|||+++. ++.+|+++--.+++++..
T Consensus 1065 ~rv~~eFLLCys~faFfVN~~Gwrkrts~i~~Weg~Pq~FalsypYIlaf~~~fIeIr~ie-TgeLI~~ilg~~IRlLt~ 1143 (1175)
T COG5422 1065 FRVSGEFLLCYSEFAFFVNDQGWRKRTSWIFHWEGEPQEFALSYPYILAFEPNFIEIRHIE-TGELIRCILGHNIRLLTD 1143 (1175)
T ss_pred EeeCCcEEEEecceeEEEcCcCceecccEEEEEcCccceeeeecceEEEecCceEEEEecc-cceeeeeeccCceEEeec
Confidence 55556999999999999999998775 3568899999999999999999999999999997 799999998778787765
No 8
>KOG2034 consensus Vacuolar sorting protein PEP3/VPS18 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.62 E-value=1.5e-14 Score=164.78 Aligned_cols=409 Identities=16% Similarity=0.201 Sum_probs=276.5
Q ss_pred CCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeCC-----cccccc--cCCeEEEeccceEEEeeccChhHHHHH
Q 003405 237 EAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQN-----VRHLIP--SSNAVVVALENSIFGLFPVPLGAQIVQ 309 (823)
Q Consensus 237 ~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~-----~~~l~~--~~~~v~v~s~~~I~~l~~~~~~~qI~~ 309 (823)
.+|..++....+++-+..+.|.+.+..+ +..+..-+++. .-.+++ .-+.+++-|.+.|+.+...+-..-|+.
T Consensus 288 ~~p~~ivLT~yH~LLl~~d~V~avs~Ln-~~vI~~~~~n~s~~g~~LGlv~D~va~~~w~YTq~~vf~~~vndE~R~vWk 366 (911)
T KOG2034|consen 288 EPPKAIVLTEFHFLLLYADRVLAVSLLN-GEVIYRDQFNESELGGILGLVSDSVAETFWLYTQTSVFEYGVNDEARDVWK 366 (911)
T ss_pred CCcceehHHHHHHHHHhcCceeeeeccC-ccccchhccCchhcccceeeeeccccceEEEEEeceeeeeeeccchHHHHH
Confidence 4688888888788888888888888775 54443333332 122232 234578889999999998888888877
Q ss_pred H-HhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCCCCCCCcCCC
Q 003405 310 L-TASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSIVLPKTTVVP 388 (823)
Q Consensus 310 L-l~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l~~~~~~~~~ 388 (823)
. +++|+|+.|++.|..-| ..+..+..++|..+|..++|..|++.+.+......+|...|-.+- +.
T Consensus 367 ~yLd~g~y~kAL~~ar~~p--------~~le~Vl~~qAdf~f~~k~y~~AA~~yA~t~~~FEEVaLKFl~~~--~~---- 432 (911)
T KOG2034|consen 367 TYLDKGEFDKALEIARTRP--------DALETVLLKQADFLFQDKEYLRAAEIYAETLSSFEEVALKFLEIN--QE---- 432 (911)
T ss_pred HHHhcchHHHHHHhccCCH--------HHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhhHHHHHHHHHhcC--CH----
Confidence 6 79999999999996532 246678999999999999999999999999888889999997772 11
Q ss_pred CCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCcchhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhccCCchhHh
Q 003405 389 EPERLLDISSDAPSLSRGSSGMSDDMESSPPAQLSELDENATLKSKKMSHNTLMALIKFLQKKRSSIIEKATAEGTEEVV 468 (823)
Q Consensus 389 ~~~~~~~~~~~~~~l~~~~~~v~~~~~~~~p~~l~~~d~~~~le~~~~~~~a~~~L~~yL~~~R~~~~~~~~~~~~~~~~ 468 (823)
+.+... +.+.. + .++.-| +-...+|..||.+.--..++...+.
T Consensus 433 --~~L~~~------L~KKL-------~-----~lt~~d-----------k~q~~~Lv~WLlel~L~~Ln~l~~~------ 475 (911)
T KOG2034|consen 433 --RALRTF------LDKKL-------D-----RLTPED-----------KTQRDALVTWLLELYLEQLNDLDST------ 475 (911)
T ss_pred --HHHHHH------HHHHH-------h-----hCChHH-----------HHHHHHHHHHHHHHHHHHHhccccc------
Confidence 111000 00000 0 000000 1345578888877654333211100
Q ss_pred hhcccCCCcCCCccccccCCCCCCCCCccccHHHHHHHHHHHHHHHHHhcCChhhHHhhhcC-CCcccHHHHHHHHHhcC
Q 003405 469 LDAVGDNFTSHDSTRFKKSSKGRGTIPMYSGAREMAAILDTALLQALLLTGQSSAALELLKG-LNYCDVKICEEILQKKN 547 (823)
Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vDT~Ll~~y~~~~~~~~l~~ll~~-~n~c~~~~~~~~L~~~~ 547 (823)
..+..+..|+.. ++.. ..+..+... ....+-+.|...+.+++
T Consensus 476 -------------------------------de~~~en~~~~~-~~~~-----re~~~~~~~~~~~~nretv~~l~~~~~ 518 (911)
T KOG2034|consen 476 -------------------------------DEEALENWRLEY-DEVQ-----REFSKFLVLHKDELNRETVYQLLASHG 518 (911)
T ss_pred -------------------------------ChhHHHHHHHHH-HHHH-----HHHHHHHHhhHHhhhHHHHHHHHHHcc
Confidence 011111111111 1110 011111111 11235678888888888
Q ss_pred cHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCccccccc
Q 003405 548 HYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQTIEL 627 (823)
Q Consensus 548 ~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~~~i 627 (823)
+.++++.+.--.++++..+..|..... .+.+.+.|.+- .+.++.++|++-|+.+.|.+++..
T Consensus 519 ~~e~ll~fA~l~~d~~~vv~~~~q~e~----------------yeeaLevL~~~--~~~el~yk~ap~Li~~~p~~tV~~ 580 (911)
T KOG2034|consen 519 RQEELLQFANLIKDYEFVVSYWIQQEN----------------YEEALEVLLNQ--RNPELFYKYAPELITHSPKETVSA 580 (911)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHhc--cchhhHHHhhhHHHhcCcHHHHHH
Confidence 888888888888888888888876532 24566666642 467899999999999999999999
Q ss_pred ccc--CCCChHHHHHHHhhc-------CchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCccc
Q 003405 628 FLS--GNIPADLVNSYLKQY-------SPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKA 698 (823)
Q Consensus 628 f~~--~~l~~~~Vl~~L~~~-------~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~ 698 (823)
++. +.+++..+.+.|.-. .+...++|||+.+.. .+..++.+||.|..+|+...
T Consensus 581 wm~~~d~~~~~li~~~L~~~~~~~~~~~~~~~i~yl~f~~~~-l~~~~~~ihn~ll~lya~~~----------------- 642 (911)
T KOG2034|consen 581 WMAQKDLDPNRLIPPILSYFSNWHSEYEENQAIRYLEFCIEV-LGMTNPAIHNSLLHLYAKHE----------------- 642 (911)
T ss_pred HHHccccCchhhhHHHHHHHhcCCccccHHHHHHHHHHHHHh-ccCcCHHHHHHHHHHhhcCC-----------------
Confidence 998 346666655555421 245799999999875 46789999999999999762
Q ss_pred chHHHHHHHHHh------hhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHH--------------hCCC
Q 003405 699 YSPTRKKLLSAL------ESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHK--------------VFLI 758 (823)
Q Consensus 699 ~~~~r~kLl~fL------~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~--------------L~D~ 758 (823)
|..|+-.| ++...||+..+++.|-+.+-...+++|+..|+.|++|.++...- -.|-
T Consensus 643 ----~~~ll~~le~~~~~~~~~~YDl~~alRlc~~~~~~ra~V~l~~~l~l~~~aVdlAL~~d~dlak~~A~~~ee~e~l 718 (911)
T KOG2034|consen 643 ----RDDLLLYLEIIKFMKSRVHYDLDYALRLCLKFKKTRACVFLLCMLNLFEDAVDLALQFDIDLAKVIANDPEEDEDL 718 (911)
T ss_pred ----ccchHHHHHHHhhccccceecHHHHHHHHHHhCccceeeeHHHHHHHHHHHHHHHhhcCHHHHhhhhcChhhHHHH
Confidence 33333333 33578999999999998877789999999999999999988731 1112
Q ss_pred chhHHHHHHHHhcCCC
Q 003405 759 NQPVFLLIRRMAMDIK 774 (823)
Q Consensus 759 ~~a~~~~l~~~y~~~~ 774 (823)
+..+|....++++...
T Consensus 719 rKkLWLkIAkh~v~~~ 734 (911)
T KOG2034|consen 719 RKKLWLKIAKHVVKQE 734 (911)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 4579999999888753
No 9
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=99.52 E-value=6.2e-14 Score=133.79 Aligned_cols=121 Identities=20% Similarity=0.203 Sum_probs=105.8
Q ss_pred CCCChHHHHHHHhhc-CchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHH
Q 003405 631 GNIPADLVNSYLKQY-SPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSA 709 (823)
Q Consensus 631 ~~l~~~~Vl~~L~~~-~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~f 709 (823)
++++++.|++.+... .+..++.|||+++..+ ..++.+||+|+++|++. .+.+++.|
T Consensus 6 ~~~~~~~vv~~~~~~~~~~~l~~yLe~~~~~~--~~~~~~~~~li~ly~~~---------------------~~~~ll~~ 62 (140)
T smart00299 6 DPIDVSEVVELFEKRNLLEELIPYLESALKLN--SENPALQTKLIELYAKY---------------------DPQKEIER 62 (140)
T ss_pred CcCCHHHHHHHHHhCCcHHHHHHHHHHHHccC--ccchhHHHHHHHHHHHH---------------------CHHHHHHH
Confidence 468899999999843 5889999999999753 47899999999999976 37899999
Q ss_pred hh-hcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHhCCCc-----------hhHHHHHHHHhcCCC
Q 003405 710 LE-SISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKVFLIN-----------QPVFLLIRRMAMDIK 774 (823)
Q Consensus 710 L~-~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L~D~~-----------~a~~~~l~~~y~~~~ 774 (823)
|+ +++.||++.+++.|...++++|.++||+|+|+|++|+++++..++|++ +.+|..+++++++.+
T Consensus 63 l~~~~~~yd~~~~~~~c~~~~l~~~~~~l~~k~~~~~~Al~~~l~~~~d~~~a~~~~~~~~~~~lw~~~~~~~l~~~ 139 (140)
T smart00299 63 LDNKSNHYDIEKVGKLCEKAKLYEEAVELYKKDGNFKDAIVTLIEHLGNYEKAIEYFVKQNNPELWAEVLKALLDKP 139 (140)
T ss_pred HHhccccCCHHHHHHHHHHcCcHHHHHHHHHhhcCHHHHHHHHHHcccCHHHHHHHHHhCCCHHHHHHHHHHHHccC
Confidence 99 899999999999999999999999999999999999999999999974 347777777766654
No 10
>KOG4305 consensus RhoGEF GTPase [Signal transduction mechanisms]
Probab=99.50 E-value=1.4e-13 Score=161.57 Aligned_cols=207 Identities=27% Similarity=0.410 Sum_probs=150.6
Q ss_pred CCCeeEEEEecccCceeeEeCc-EEEEeCCCC-------cccccc-c-CCCCcEEEEeeCCCc-eEEEE-----EcCeEE
Q 003405 75 KKPILSMEVLASRQLLLSLSES-IAFHRLPNL-------ETIAVL-T-KAKGANVYSWDDRRG-FLCFA-----RQKRVC 138 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~d~-l~~~~L~~l-------~~~~~i-~-~~kg~~~fa~~~~~~-~l~V~-----~kkki~ 138 (823)
+..|+|+.+.++.++++++.|. +..+.+.-+ ...... . -.+.+..|..+.+.| .++++ ..+.+.
T Consensus 752 ~~~i~q~~v~ee~~~l~~l~dk~Ly~~~l~~~~ae~~~~~~~~~~~~vl~~~v~~fk~g~~~gk~~v~~~~~~~l~~~v~ 831 (1029)
T KOG4305|consen 752 KNNISQIEVNEESKLLLLLIDKKLYYCPLSMIDAEGNIASKTSREETVLRRHVDFFKEGDCKGKILVCAVKSSVLGNTVK 831 (1029)
T ss_pred ccchhhhhhhhhccceeeehhhHHhhCCcceeeeccccccccccccchhhhhhhhhhcccccCceEEEEEeeccCCceEE
Confidence 3489999999999999999983 444443211 001000 0 112345566655555 23333 235677
Q ss_pred EEEEc----C---CC---ce--eEeeeecCCCCceEEEecCCeEEEEEcCceEEEEcCCCCeeeccCCC----------C
Q 003405 139 IFRHD----G---GR---GF--VEVKDFGVPDTVKSMSWCGENICIAIRKGYMILNATNGALSEVFPSG----------R 196 (823)
Q Consensus 139 l~~~~----~---~~---~f--~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~~~~~~L~~~~----------~ 196 (823)
+|+.- . +. .| +.++|+.++..+.++.|..+.+|||+.+++.++++.....+.+.++. +
T Consensus 832 i~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~s~~flk~k~~v~~~k~f~i~sl~~~~~~~l~~~~~~~~~~~~~~~ 911 (1029)
T KOG4305|consen 832 IFEFLLVISNPASGNELKKFLKVGLTDFFVDSEPVSVSFLKNKLCVGCKKGFEIVSLSNKTAESLLNPADNSPLFFEKRE 911 (1029)
T ss_pred EEechhhhcCCcchhhhhhhhhccchhccccccchhHhHhccceeeeecCCCceeccchhhhhccCCCccchHHHHhhhc
Confidence 77631 1 11 11 23557888999999999999999999999999998876655555442 2
Q ss_pred CCCCEEEEccCCeEEEEeCCeEEEEcCCCcccc-CCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeC
Q 003405 197 IGPPLVVSLLSGELLLGKENIGVFVDQNGKLLQ-ADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQ 275 (823)
Q Consensus 197 ~~~p~i~~~~~~EfLL~~~~~gvfv~~~G~~~~-~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~ 275 (823)
..+|+.+..-.+||++|+++.++|||.+|+.++ +..+.|.+.|..++.++|||+++.++.||||++. ++.++|.+.-+
T Consensus 912 ~~kp~~ifri~~~Fllcy~~~~f~vn~~G~~~~~~~~~~w~g~p~~~a~~~~yiia~~~~fIeI~~~~-t~eli~~i~~~ 990 (1029)
T KOG4305|consen 912 NTKPVAIFRISGEFLLCYDEFAFFVNDQGWRSRTSWIFLWEGEPQEFALSYPYIIAFGDNFIEIRDLE-TGELIQIILGQ 990 (1029)
T ss_pred cCceeEEEEecCeEEEEecceEEEEcCCcceecccEEEEEcCccceeeeecceEEEecCceEEEEecc-cceeeEEeecc
Confidence 356776543344999999999999999999886 4568899999999999999999999999999997 69999999888
Q ss_pred Ccccccc
Q 003405 276 NVRHLIP 282 (823)
Q Consensus 276 ~~~~l~~ 282 (823)
+++.+..
T Consensus 991 ~Ir~~~~ 997 (1029)
T KOG4305|consen 991 NIRLLTS 997 (1029)
T ss_pred ceeEeec
Confidence 8877654
No 11
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=99.33 E-value=1.4e-13 Score=131.78 Aligned_cols=122 Identities=24% Similarity=0.386 Sum_probs=103.7
Q ss_pred CCCChHHHHHHHhh-cCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHH-HHHHH
Q 003405 631 GNIPADLVNSYLKQ-YSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTR-KKLLS 708 (823)
Q Consensus 631 ~~l~~~~Vl~~L~~-~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r-~kLl~ 708 (823)
++.+++.|+..+.+ ..+..+..|||.++..+ +..++.+||.|+.+|++. .+ ++|+.
T Consensus 6 ~~~~~~~vi~~~~~~~~~~~l~~yLe~~~~~~-~~~~~~~~~~L~~ly~~~---------------------~~~~~l~~ 63 (143)
T PF00637_consen 6 DPLEISEVISAFEERNQPEELIEYLEALVKEN-KENNPDLHTLLLELYIKY---------------------DPYEKLLE 63 (143)
T ss_dssp TTSCSCCCHHHCTTTT-GGGCTCCHHHHHHTS-TC-SHHHHHHHHHHHHCT---------------------TTCCHHHH
T ss_pred CccCHHHHHHHHHhCCCHHHHHHHHHHHHhcc-cccCHHHHHHHHHHHHhc---------------------CCchHHHH
Confidence 35667777877766 34889999999999643 456799999999999986 24 89999
Q ss_pred HhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHhCCC-----------chhHHHHHHHHhcCCCC
Q 003405 709 ALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKVFLI-----------NQPVFLLIRRMAMDIKP 775 (823)
Q Consensus 709 fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L~D~-----------~~a~~~~l~~~y~~~~~ 775 (823)
||++++.||++.++++|...++++|.++||+|+|+|++|+++ +.+++|+ ++.+|..|++++++.++
T Consensus 64 ~L~~~~~yd~~~~~~~c~~~~l~~~a~~Ly~~~~~~~~al~i-~~~~~~~~~a~e~~~~~~~~~l~~~l~~~~l~~~~ 140 (143)
T PF00637_consen 64 FLKTSNNYDLDKALRLCEKHGLYEEAVYLYSKLGNHDEALEI-LHKLKDYEEAIEYAKKVDDPELWEQLLKYCLDSKP 140 (143)
T ss_dssp TTTSSSSS-CTHHHHHHHTTTSHHHHHHHHHCCTTHTTCSST-SSSTHCSCCCTTTGGGCSSSHHHHHHHHHHCTSTC
T ss_pred HcccccccCHHHHHHHHHhcchHHHHHHHHHHcccHHHHHHH-HHHHccHHHHHHHHHhcCcHHHHHHHHHHHHhcCc
Confidence 999999999999999999999999999999999999999998 8888888 45699999999998754
No 12
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=98.59 E-value=7.3e-05 Score=78.01 Aligned_cols=233 Identities=18% Similarity=0.189 Sum_probs=144.2
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
+..|+|++... +.|++|+.+|.+.+|++.... ....... +..+|..+...+..+.+++.
T Consensus 9 ~~~i~~~~~~~~~~~l~~~~~~g~i~i~~~~~~~------------------~~~~~~~-~~~~i~~~~~~~~~~~l~~~ 69 (289)
T cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE------------------LLRTLKG-HTGPVRDVAASADGTYLASG 69 (289)
T ss_pred CCCEEEEEEcCCCCEEEEeecCcEEEEEEeeCCC------------------cEEEEec-CCcceeEEEECCCCCEEEEE
Confidence 46799988775 789999999999999876422 1122222 35678899999988788887
Q ss_pred eC-c-EEEEeCCCCcccccccCC-CCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeec-CCCCceEEEecC-
Q 003405 94 SE-S-IAFHRLPNLETIAVLTKA-KGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWCG- 167 (823)
Q Consensus 94 ~d-~-l~~~~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~~- 167 (823)
+. + +.+|++.+.+....+... ..+.++++.++...++++. .+.+.+|.....+ ....+. .++.+.++.|..
T Consensus 70 ~~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~ 146 (289)
T cd00200 70 SSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGK---CLTTLRGHTDWVNSVAFSPD 146 (289)
T ss_pred cCCCeEEEEEcCcccceEEEeccCCcEEEEEEcCCCCEEEEecCCCeEEEEECCCcE---EEEEeccCCCcEEEEEEcCc
Confidence 74 5 999999765444333322 3677788877654566665 7788888876332 222232 346789999985
Q ss_pred -CeEEEEE-cCceEEEEcCCCCeeeccCCCCCCCCEEEEccCC-eEEEEe-CCeEEEEcCC-CccccCCce-eecCCCcE
Q 003405 168 -ENICIAI-RKGYMILNATNGALSEVFPSGRIGPPLVVSLLSG-ELLLGK-ENIGVFVDQN-GKLLQADRI-CWSEAPIA 241 (823)
Q Consensus 168 -~~i~v~~-~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~-EfLL~~-~~~gvfv~~~-G~~~~~~~i-~w~~~P~~ 241 (823)
..++.+. .....++|+.+++....+.........+...+++ .++++. ++...+++.. |... ..+ .....+..
T Consensus 147 ~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~--~~~~~~~~~i~~ 224 (289)
T cd00200 147 GTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL--GTLRGHENGVNS 224 (289)
T ss_pred CCEEEEEcCCCcEEEEEccccccceeEecCccccceEEECCCcCEEEEecCCCcEEEEECCCCcee--cchhhcCCceEE
Confidence 4566666 5678899998776554444322112234444555 444444 4555566654 3332 223 22334455
Q ss_pred EEEeC--CEEEEEe-CCeEEEEEccCCCceeEEEe
Q 003405 242 VIIQK--PYAIALL-PRRVEVRSLRVPYALIQTIV 273 (823)
Q Consensus 242 v~~~~--PYll~~~-~~~ieV~~l~~~~~lvQ~i~ 273 (823)
+.+.. .++++.. .+.+.++++. ++...+.+.
T Consensus 225 ~~~~~~~~~~~~~~~~~~i~i~~~~-~~~~~~~~~ 258 (289)
T cd00200 225 VAFSPDGYLLASGSEDGTIRVWDLR-TGECVQTLS 258 (289)
T ss_pred EEEcCCCcEEEEEcCCCcEEEEEcC-CceeEEEcc
Confidence 55543 3555555 5789999986 466666665
No 13
>KOG0985 consensus Vesicle coat protein clathrin, heavy chain [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.58 E-value=4.1e-06 Score=97.04 Aligned_cols=334 Identities=16% Similarity=0.186 Sum_probs=221.0
Q ss_pred EEEeeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchH----------hhhh-------cHHHHHHHHHHHHHccCCHHH
Q 003405 295 IFGLFPVPLGAQIVQLTASGDFEEALALCKLLPPEDAS----------LRAA-------KEGSIHIRFAHYLFDTGSYEE 357 (823)
Q Consensus 295 I~~l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~----------~~~~-------~~~~i~~~~a~~lf~~~~f~~ 357 (823)
|-+|.-.+..+-.+.-++.+.||||++..+.+.-..+. ++++ ....+..+.|-..+..+...+
T Consensus 1043 I~rLdnyDa~~ia~iai~~~LyEEAF~ifkkf~~n~~A~~VLie~i~~ldRA~efAe~~n~p~vWsqlakAQL~~~~v~d 1122 (1666)
T KOG0985|consen 1043 INRLDNYDAPDIAEIAIENQLYEEAFAIFKKFDMNVSAIQVLIENIGSLDRAYEFAERCNEPAVWSQLAKAQLQGGLVKD 1122 (1666)
T ss_pred HHHhccCCchhHHHHHhhhhHHHHHHHHHHHhcccHHHHHHHHHHhhhHHHHHHHHHhhCChHHHHHHHHHHHhcCchHH
Confidence 55666667666666678899999999999876311111 1100 113456667777888888999
Q ss_pred HHHHHHhcCCCHH---HHHHhCCCCCCCCCcCCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCCCCCcchhhhhhhhh
Q 003405 358 AMEHFLASQVDIT---YALSLYPSIVLPKTTVVPEPERLLDISSDAPSLSRGSSGMSDDMESSPPAQLSELDENATLKSK 434 (823)
Q Consensus 358 A~~~f~~~~~dP~---~vi~Lfp~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~v~~~~~~~~p~~l~~~d~~~~le~~ 434 (823)
|++-|++++ ||+ +||..-.
T Consensus 1123 AieSyikad-Dps~y~eVi~~a~--------------------------------------------------------- 1144 (1666)
T KOG0985|consen 1123 AIESYIKAD-DPSNYLEVIDVAS--------------------------------------------------------- 1144 (1666)
T ss_pred HHHHHHhcC-CcHHHHHHHHHHH---------------------------------------------------------
Confidence 999988865 221 2221100
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhccCCchhHhhhcccCCCcCCCccccccCCCCCCCCCccccHHHHHHHHHHHHHHH
Q 003405 435 KMSHNTLMALIKFLQKKRSSIIEKATAEGTEEVVLDAVGDNFTSHDSTRFKKSSKGRGTIPMYSGAREMAAILDTALLQA 514 (823)
Q Consensus 435 ~~~~~a~~~L~~yL~~~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vDT~Ll~~ 514 (823)
+ -.....|++||.-.|++... .-||+.|+-+
T Consensus 1145 ~--~~~~edLv~yL~MaRkk~~E-----------------------------------------------~~id~eLi~A 1175 (1666)
T KOG0985|consen 1145 R--TGKYEDLVKYLLMARKKVRE-----------------------------------------------PYIDSELIFA 1175 (1666)
T ss_pred h--cCcHHHHHHHHHHHHHhhcC-----------------------------------------------ccchHHHHHH
Confidence 0 02466799999888776421 1589999999
Q ss_pred HHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHH
Q 003405 515 LLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESI 594 (823)
Q Consensus 515 y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~ 594 (823)
|.++++-..+..|+.+||..++..+-+.|-+.++|++.-.+|..-.++.+--..+..+++-. -+
T Consensus 1176 yAkt~rl~elE~fi~gpN~A~i~~vGdrcf~~~~y~aAkl~y~~vSN~a~La~TLV~LgeyQ----------------~A 1239 (1666)
T KOG0985|consen 1176 YAKTNRLTELEEFIAGPNVANIQQVGDRCFEEKMYEAAKLLYSNVSNFAKLASTLVYLGEYQ----------------GA 1239 (1666)
T ss_pred HHHhchHHHHHHHhcCCCchhHHHHhHHHhhhhhhHHHHHHHHHhhhHHHHHHHHHHHHHHH----------------HH
Confidence 99999988899999999999999999999999999999999999999988777777775311 11
Q ss_pred HHHhhcCCCCChhhHHHhhhhh------hhcCcccccccccc-CCCChHHHHHHHhhcC-chhHHHHHHHHhhcccCCCC
Q 003405 595 IEYLKPLCGTDPMLVLEFSMLV------LESCPTQTIELFLS-GNIPADLVNSYLKQYS-PSMQGRYLELMLAMNENSIS 666 (823)
Q Consensus 595 i~yL~~L~~~~~~li~~y~~wl------l~~~p~~~~~if~~-~~l~~~~Vl~~L~~~~-~~~~~~YLE~li~~~~~~~~ 666 (823)
++--++-.. --.|+-.-.- ++..+-.|+.|.+. +. -++++++-+..+ -+.++.-||.-..- ....
T Consensus 1240 VD~aRKAns---~ktWK~VcfaCvd~~EFrlAQiCGL~iivhade--Leeli~~Yq~rGyFeElIsl~Ea~LGL--ERAH 1312 (1666)
T KOG0985|consen 1240 VDAARKANS---TKTWKEVCFACVDKEEFRLAQICGLNIIVHADE--LEELIEYYQDRGYFEELISLLEAGLGL--ERAH 1312 (1666)
T ss_pred HHHhhhccc---hhHHHHHHHHHhchhhhhHHHhcCceEEEehHh--HHHHHHHHHhcCcHHHHHHHHHhhhch--hHHH
Confidence 211111110 0111111000 11112235555443 11 122333333211 12233333332211 1134
Q ss_pred hhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhh-cCCCChHHHhccCCCCchhhHHHHHhhccccHH
Q 003405 667 GNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALES-ISGYNPEVLLKRLPADALYEERAILLGKMNQHE 745 (823)
Q Consensus 667 ~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~-s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~ 745 (823)
-.+.|+|+.+|..+ .+.|++..|+- .+.-+..++++.|...-++.|.++||.+-..++
T Consensus 1313 MgmfTELaiLYsky---------------------kp~km~EHl~LFwsRvNipKviRA~eqahlW~ElvfLY~~y~eyD 1371 (1666)
T KOG0985|consen 1313 MGMFTELAILYSKY---------------------KPEKMMEHLKLFWSRVNIPKVIRAAEQAHLWSELVFLYDKYEEYD 1371 (1666)
T ss_pred HHHHHHHHHHHHhc---------------------CHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHhhhhhh
Confidence 56889999999988 36888888887 788999999999988889999999999999998
Q ss_pred HHHHHHHHH---------hCCC-----chhHHHHHHHHhcCCCCCcch
Q 003405 746 LALSLYVHK---------VFLI-----NQPVFLLIRRMAMDIKPLVTE 779 (823)
Q Consensus 746 ~AL~ilv~~---------L~D~-----~~a~~~~l~~~y~~~~~~~~~ 779 (823)
.|.-..+.. .+|+ +--+||-..+.|++.+|+...
T Consensus 1372 NAa~tmm~h~teaw~~~~FKdii~kVaNvElyYkAi~FYl~~~P~lln 1419 (1666)
T KOG0985|consen 1372 NAALTMMEHPTEAWDHGQFKDIITKVANVELYYKAIQFYLDFHPLLLN 1419 (1666)
T ss_pred HHHHHHHhCChhhhhhhhHHHHHHHHhhHHHHHHHHHHHHHhChHHHH
Confidence 887766643 2222 346999999999999988774
No 14
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=98.35 E-value=2.9e-05 Score=79.87 Aligned_cols=166 Identities=19% Similarity=0.299 Sum_probs=120.3
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCC---------------CCC--------ccc---ccccccceeeee
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRS---------------PPS--------DYQ---SLRKESYELERT 69 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~---------------~~~--------d~~---~l~~~~~~l~~~ 69 (823)
...|+|+++.+.++.=|.+|-+|++|++.....-+. +.+ |.. .-....|++..+
T Consensus 43 ~~sitavAVs~~~~aSGssDetI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~s 122 (362)
T KOG0294|consen 43 AGSITALAVSGPYVASGSSDETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKS 122 (362)
T ss_pred ccceeEEEecceeEeccCCCCcEEEEeccchhhhcceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeee
Confidence 357999999999999999999999999987654221 111 111 125667888888
Q ss_pred ecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCc
Q 003405 70 ISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRG 147 (823)
Q Consensus 70 ~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~ 147 (823)
+++. +.+|+.|.+=|...+.+++++ + +..|+|-..+......-.+..+.+.+.++..+++|+.+++|-+|+.+..
T Consensus 123 lK~H-~~~Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a~v~~L~~~at~v~w~~~Gd~F~v~~~~~i~i~q~d~A-- 199 (362)
T KOG0294|consen 123 LKAH-KGQVTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVAFVLNLKNKATLVSWSPQGDHFVVSGRNKIDIYQLDNA-- 199 (362)
T ss_pred eccc-ccccceeEecCCCceEEEEcCCceeeeehhhcCccceeeccCCcceeeEEcCCCCEEEEEeccEEEEEecccH--
Confidence 8765 556999999999999999987 4 9999997655332222233456677777667899999999999999843
Q ss_pred eeEeeeecCCCCceEEEec-CCeEEEEEcCce-EEEEcCC
Q 003405 148 FVEVKDFGVPDTVKSMSWC-GENICIAIRKGY-MILNATN 185 (823)
Q Consensus 148 f~~~kei~~~~~~~~l~~~-~~~i~v~~~~~y-~lidl~~ 185 (823)
..++++..|-.+.++.|. ++.+++|-.++. .+.|.++
T Consensus 200 -~v~~~i~~~~r~l~~~~l~~~~L~vG~d~~~i~~~D~ds 238 (362)
T KOG0294|consen 200 -SVFREIENPKRILCATFLDGSELLVGGDNEWISLKDTDS 238 (362)
T ss_pred -hHhhhhhccccceeeeecCCceEEEecCCceEEEeccCC
Confidence 345677777778888887 566888877643 4445554
No 15
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=98.25 E-value=0.00049 Score=71.67 Aligned_cols=186 Identities=15% Similarity=0.187 Sum_probs=117.3
Q ss_pred CcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..+.++.... +.+++|+.+|.+.+|++.... ....+.. +..+|..+...+..+++++.+
T Consensus 52 ~~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~~------------------~~~~~~~-~~~~i~~~~~~~~~~~~~~~~ 112 (289)
T cd00200 52 GPVRDVAASADGTYLASGSSDKTIRLWDLETGE------------------CVRTLTG-HTSYVSSVAFSPDGRILSSSS 112 (289)
T ss_pred cceeEEEECCCCCEEEEEcCCCeEEEEEcCccc------------------ceEEEec-cCCcEEEEEEcCCCCEEEEec
Confidence 3554554443 489999999999999976421 1122222 245899999998866666666
Q ss_pred -Cc-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeecC-CCCceEEEecCC-
Q 003405 95 -ES-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWCGE- 168 (823)
Q Consensus 95 -d~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~~~- 168 (823)
|+ +.+|++.+.+....+. ....++.++++++...++++. .+.+.+|....++. .+.+.. .+.+.++.|..+
T Consensus 113 ~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~---~~~~~~~~~~i~~~~~~~~~ 189 (289)
T cd00200 113 RDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKC---VATLTGHTGEVNSVAFSPDG 189 (289)
T ss_pred CCCeEEEEECCCcEEEEEeccCCCcEEEEEEcCcCCEEEEEcCCCcEEEEEcccccc---ceeEecCccccceEEECCCc
Confidence 55 9999998665554443 233578888888755666666 78888888764322 222222 347899999854
Q ss_pred -eEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEe--CCeEEEEcCC
Q 003405 169 -NICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGK--ENIGVFVDQN 224 (823)
Q Consensus 169 -~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~--~~~gvfv~~~ 224 (823)
.++++.. ....++|+.+++....+.........+...+++.++++. ++...+++..
T Consensus 190 ~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~ 249 (289)
T cd00200 190 EKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLR 249 (289)
T ss_pred CEEEEecCCCcEEEEECCCCceecchhhcCCceEEEEEcCCCcEEEEEcCCCcEEEEEcC
Confidence 6777764 668899998876655552211111123344556677654 4555556654
No 16
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=98.24 E-value=0.0012 Score=76.01 Aligned_cols=80 Identities=25% Similarity=0.292 Sum_probs=66.8
Q ss_pred HHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCC
Q 003405 511 LLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFN 590 (823)
Q Consensus 511 Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~ 590 (823)
++.+|++.+.+ .|.+|++..++|+++.+.++|.+.|+|++++++..+-|++.+||.+...- +.+
T Consensus 613 ~I~LYAEyDrk-~LLPFLr~s~~Y~lekA~eiC~q~~~~~E~VYlLgrmGn~k~AL~lII~e---------------l~d 676 (846)
T KOG2066|consen 613 QIELYAEYDRK-KLLPFLRKSQNYNLEKALEICSQKNFYEELVYLLGRMGNAKEALKLIINE---------------LRD 676 (846)
T ss_pred HHHHHHHHhHh-hhhHHHHhcCCCCHHHHHHHHHhhCcHHHHHHHHHhhcchHHHHHHHHHH---------------hhC
Confidence 44668888764 67799999999999999999999999999999999999999999998752 245
Q ss_pred hHHHHHHhhcCCCCChhh
Q 003405 591 PESIIEYLKPLCGTDPML 608 (823)
Q Consensus 591 ~~~~i~yL~~L~~~~~~l 608 (823)
++.||+|.+. ..|.||
T Consensus 677 ie~AIefvKe--q~D~eL 692 (846)
T KOG2066|consen 677 IEKAIEFVKE--QDDSEL 692 (846)
T ss_pred HHHHHHHHHh--cCCHHH
Confidence 6899999984 335444
No 17
>KOG0576 consensus Mitogen-activated protein kinase kinase kinase kinase (MAP4K), germinal center kinase family [Signal transduction mechanisms]
Probab=98.01 E-value=4.1e-05 Score=86.40 Aligned_cols=240 Identities=18% Similarity=0.275 Sum_probs=152.2
Q ss_pred ccccCCCCcEEEEEEe------CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEE
Q 003405 10 ELISNCSPKIDAVASY------GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEV 83 (823)
Q Consensus 10 ~l~~~~~~~I~ci~~~------~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~ 83 (823)
.+....|.+|.|+++| ++.+.+|+++| |+..++++.... +. .+ .| ...++=+.+
T Consensus 485 KvfngCpl~i~~aaswIhp~t~dq~ll~gaeeg-Iy~lnlnel~e~-------------~l--e~---l~-~~r~Twly~ 544 (829)
T KOG0576|consen 485 KVFNGCPLRIHCAASWIHPSTRDQALLFGAEEG-IYTLNLNELHEA-------------TL--EK---LF-PRRCTWLYV 544 (829)
T ss_pred HHhccCcccceecccccCcchhhhHhhhhhccc-eeeccccccccc-------------cH--hh---cc-ccCceEEEe
Confidence 3455667899999999 36899999999 455655542211 11 11 11 234444444
Q ss_pred ecccCceeeEeCc---EEEEeCCC--------------------------CcccccccCCCCcEEEEeeCCC----ceEE
Q 003405 84 LASRQLLLSLSES---IAFHRLPN--------------------------LETIAVLTKAKGANVYSWDDRR----GFLC 130 (823)
Q Consensus 84 ~~~~~~Ll~l~d~---l~~~~L~~--------------------------l~~~~~i~~~kg~~~fa~~~~~----~~l~ 130 (823)
++ |.|..+++. ++-|++.. +....+++++|||...||-.+. .++|
T Consensus 545 ~~--n~l~slsgks~~ly~H~l~~l~~~~~~~~~~s~~~h~~per~~prk~a~stkipeTkgc~~c~V~R~~~~g~~~lc 622 (829)
T KOG0576|consen 545 IN--NVLTSLSGKSTQLYSHDLGGLFEAGEGTLFGSIIVHKEPERILPRKFALSTKIPETKGCQQCCVVRNPYTGGKFLC 622 (829)
T ss_pred cC--ceeeeccCCccceeecchHHHHhhhcccccccccccCCCccccchhhceeeecCccccceeeeeecCCCCCCceee
Confidence 43 445555541 44444421 1223457889999999987663 2689
Q ss_pred EEEcCeEEEEEEcC-CCceeEeeeec--CCCCceEEEec------CCeEEEEEcCc---------eEEEEcCCCCeeecc
Q 003405 131 FARQKRVCIFRHDG-GRGFVEVKDFG--VPDTVKSMSWC------GENICIAIRKG---------YMILNATNGALSEVF 192 (823)
Q Consensus 131 V~~kkki~l~~~~~-~~~f~~~kei~--~~~~~~~l~~~------~~~i~v~~~~~---------y~lidl~~~~~~~L~ 192 (823)
-+....+.+.+|.. -..|-.+|.|. +|.+....+.. +..+|+|...+ |...+......-.+.
T Consensus 623 ~alp~sivl~qwy~Pm~kf~l~k~i~~pl~~p~~~f~~l~~~~~e~p~vc~Gv~~~~~~~~~~v~f~~~~~~~~~~w~~~ 702 (829)
T KOG0576|consen 623 GALPTSIVLLQWYEPMNKFMLVKSISFPLPSPLSVFEMLVLPESEYPQVCVGVSAGGGTLNNEVLFHTAFLNSDSSWDIE 702 (829)
T ss_pred cccCceeEEeeecChHHhhhHHHhcccCCCCccchhhhccccCcccceeeeeccCCCCCCCceeEEEeccccccccccee
Confidence 99999999999975 22465566543 33322222221 35799997732 112222222222222
Q ss_pred CCCCC-CCCEEEEccCCeEEEEeCCeEEEEcCCCccc----cCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCc
Q 003405 193 PSGRI-GPPLVVSLLSGELLLGKENIGVFVDQNGKLL----QADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYA 267 (823)
Q Consensus 193 ~~~~~-~~p~i~~~~~~EfLL~~~~~gvfv~~~G~~~----~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~ 267 (823)
..+.. ..|.+..+..+-+++|++++.-.++.+|++. ..+.+.|+..|.++++...-++++.+.+++.+++. +..
T Consensus 703 ~~~~~~~v~~v~qvdrd~I~v~~~n~V~~v~lqG~~~~~~~~~sel~f~f~iesv~~~~gsvlaf~~hgvqgr~l~-S~~ 781 (829)
T KOG0576|consen 703 AAGETLPVPQVTQVDRDTILVLFENMVKIVNLQGNGKVAVKLLSELTFDFDIESVVCLQGSVLAFWKHGVQGRSLT-SNE 781 (829)
T ss_pred ccCcccCCceeEEecccceEeeecCeeEEEeccCCccccccccccccccCCcceEEeeCCceecccCCcceeeecc-chH
Confidence 22221 2456777788899999999999999999643 13678899999999999999999999999999997 355
Q ss_pred eeEEE
Q 003405 268 LIQTI 272 (823)
Q Consensus 268 lvQ~i 272 (823)
+-|.|
T Consensus 782 vtqei 786 (829)
T KOG0576|consen 782 VTQEI 786 (829)
T ss_pred HHHHH
Confidence 55554
No 18
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=97.88 E-value=0.0036 Score=70.76 Aligned_cols=238 Identities=18% Similarity=0.175 Sum_probs=147.7
Q ss_pred CCCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 15 CSPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 15 ~~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
.|..|+|++--. +.|.+|=++|.|-+|++.. .|-+...+.+-..+.|+.|.-. +.+.|++
T Consensus 24 ~Ps~I~slA~s~kS~~lAvsRt~g~IEiwN~~~-----------------~w~~~~vi~g~~drsIE~L~W~-e~~RLFS 85 (691)
T KOG2048|consen 24 KPSEIVSLAYSHKSNQLAVSRTDGNIEIWNLSN-----------------NWFLEPVIHGPEDRSIESLAWA-EGGRLFS 85 (691)
T ss_pred eccceEEEEEeccCCceeeeccCCcEEEEccCC-----------------CceeeEEEecCCCCceeeEEEc-cCCeEEe
Confidence 378999988664 6899999999999998764 3444455555556789999888 4578888
Q ss_pred EeC-c-EEEEeCCCCcccccccCCCCcE-EEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecC---CCCceEEEec
Q 003405 93 LSE-S-IAFHRLPNLETIAVLTKAKGAN-VYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGV---PDTVKSMSWC 166 (823)
Q Consensus 93 l~d-~-l~~~~L~~l~~~~~i~~~kg~~-~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~---~~~~~~l~~~ 166 (823)
.+- | |.-||+.++++...+...-|+- ++|+++....++|++...+ +|-+..+.. +...+..+ .+.+.+++|.
T Consensus 86 ~g~sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~p~~~~l~IgcddGv-l~~~s~~p~-~I~~~r~l~rq~sRvLslsw~ 163 (691)
T KOG2048|consen 86 SGLSGSITEWDLHTLKQKYNIDSNGGAIWSIAINPENTILAIGCDDGV-LYDFSIGPD-KITYKRSLMRQKSRVLSLSWN 163 (691)
T ss_pred ecCCceEEEEecccCceeEEecCCCcceeEEEeCCccceEEeecCCce-EEEEecCCc-eEEEEeecccccceEEEEEec
Confidence 765 5 9999999998877654444433 5788888888999988875 444432211 11122222 2678899998
Q ss_pred CC--eEEEEEcCc-eEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeCCeEEEEcCCCccccCCceeecCCCcEEE
Q 003405 167 GE--NICIAIRKG-YMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKENIGVFVDQNGKLLQADRICWSEAPIAVI 243 (823)
Q Consensus 167 ~~--~i~v~~~~~-y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~~~gvfv~~~G~~~~~~~i~w~~~P~~v~ 243 (823)
++ .|+.|+..+ ..+.|..+++..-+... ..+|-..++++|.|+ +.
T Consensus 164 ~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~---------------------------~~d~l~k~~~~iVWS-----v~ 211 (691)
T KOG2048|consen 164 PTGTKIAGGSIDGVIRIWDVKSGQTLHIITM---------------------------QLDRLSKREPTIVWS-----VL 211 (691)
T ss_pred CCccEEEecccCceEEEEEcCCCceEEEeee---------------------------cccccccCCceEEEE-----EE
Confidence 54 478888877 67778776643221110 111211124566663 22
Q ss_pred EeCCEEEEE-eC-CeEEEEEccCCCceeEEEeeCCcccc--c--ccCCeEEEec-cceEEEeeccChhH
Q 003405 244 IQKPYAIAL-LP-RRVEVRSLRVPYALIQTIVLQNVRHL--I--PSSNAVVVAL-ENSIFGLFPVPLGA 305 (823)
Q Consensus 244 ~~~PYll~~-~~-~~ieV~~l~~~~~lvQ~i~l~~~~~l--~--~~~~~v~v~s-~~~I~~l~~~~~~~ 305 (823)
+-.+=.++- .+ +.|.+.+.. .+.++|+..+-+...+ . ..++.++++. +..|+++...+-..
T Consensus 212 ~Lrd~tI~sgDS~G~V~FWd~~-~gTLiqS~~~h~adVl~Lav~~~~d~vfsaGvd~~ii~~~~~~~~~ 279 (691)
T KOG2048|consen 212 FLRDSTIASGDSAGTVTFWDSI-FGTLIQSHSCHDADVLALAVADNEDRVFSAGVDPKIIQYSLTTNKS 279 (691)
T ss_pred EeecCcEEEecCCceEEEEccc-CcchhhhhhhhhcceeEEEEcCCCCeEEEccCCCceEEEEecCCcc
Confidence 223322222 22 567777776 4788888765544332 2 2234555543 66777777665433
No 19
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.85 E-value=0.0038 Score=69.90 Aligned_cols=267 Identities=15% Similarity=0.141 Sum_probs=150.5
Q ss_pred cEEEEEEeC-C---EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 18 KIDAVASYG-L---KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 18 ~I~ci~~~~-~---~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
-|+||+-+. + +|+-|..|-++.+|+-+. -.++++..++ ...|+....-|+..++++.
T Consensus 185 GVN~Vdyy~~gdkpylIsgaDD~tiKvWDyQt------------------k~CV~TLeGH-t~Nvs~v~fhp~lpiiisg 245 (794)
T KOG0276|consen 185 GVNCVDYYTGGDKPYLISGADDLTIKVWDYQT------------------KSCVQTLEGH-TNNVSFVFFHPELPIIISG 245 (794)
T ss_pred CcceEEeccCCCcceEEecCCCceEEEeecch------------------HHHHHHhhcc-cccceEEEecCCCcEEEEe
Confidence 589999885 3 899999999999998542 1233344443 4689999999999999999
Q ss_pred eC-c-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeE
Q 003405 94 SE-S-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENI 170 (823)
Q Consensus 94 ~d-~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i 170 (823)
++ | +++|.-.+++.-..+. ....+=+++..+..+.++||-.....++....+ +|..+|.-.| .|
T Consensus 246 sEDGTvriWhs~Ty~lE~tLn~gleRvW~I~~~k~~~~i~vG~Deg~i~v~lgre------------eP~vsMd~~g-KI 312 (794)
T KOG0276|consen 246 SEDGTVRIWNSKTYKLEKTLNYGLERVWCIAAHKGDGKIAVGFDEGSVTVKLGRE------------EPAVSMDSNG-KI 312 (794)
T ss_pred cCCccEEEecCcceehhhhhhcCCceEEEEeecCCCCeEEEeccCCcEEEEccCC------------CCceeecCCc-cE
Confidence 97 5 9999987776443321 112233444445556778887776666666532 2344444333 67
Q ss_pred EEEEcCceEEEEcCC---------CCeeeccC--CCC-CCC----------CEEEEccCCeEEEEeCCeEEEEcCCCccc
Q 003405 171 CIAIRKGYMILNATN---------GALSEVFP--SGR-IGP----------PLVVSLLSGELLLGKENIGVFVDQNGKLL 228 (823)
Q Consensus 171 ~v~~~~~y~lidl~~---------~~~~~L~~--~~~-~~~----------p~i~~~~~~EfLL~~~~~gvfv~~~G~~~ 228 (823)
+++..++..-.++.+ |+..+|-. .|. ... ..++..+++||++-. .++.--..-|.
T Consensus 313 iwa~~~ei~~~~~ks~~~~~ev~DgErL~LsvKeLgs~eiyPq~L~hsPNGrfV~VcgdGEyiIyT-ala~RnK~fG~-- 389 (794)
T KOG0276|consen 313 IWAVHSEIQAVNLKSVGAQKEVTDGERLPLSVKELGSVEIYPQTLAHSPNGRFVVVCGDGEYIIYT-ALALRNKAFGS-- 389 (794)
T ss_pred EEEcCceeeeeeceeccCcccccCCccccchhhhccccccchHHhccCCCCcEEEEecCccEEEEE-eeehhhccccc--
Confidence 777777766666543 22211110 010 001 123344555555411 00000000111
Q ss_pred cCCceeecCCC----------------------------cEEEEeCCEEEEEeC-CeEEEEEccCCCceeEEEeeCCccc
Q 003405 229 QADRICWSEAP----------------------------IAVIIQKPYAIALLP-RRVEVRSLRVPYALIQTIVLQNVRH 279 (823)
Q Consensus 229 ~~~~i~w~~~P----------------------------~~v~~~~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l~~~~~ 279 (823)
.-.+.|...| ..-.+..|+++++.. +++-++++. ++.+|+.|.... +.
T Consensus 390 -~~eFvw~~dsne~avRes~~~vki~knfke~ksi~~~~~~e~i~gg~Llg~~ss~~~~fydW~-~~~lVrrI~v~~-k~ 466 (794)
T KOG0276|consen 390 -GLEFVWAADSNEFAVRESNGNVKIFKNFKEHKSIRPDMSAEGIFGGPLLGVRSSDFLCFYDWE-SGELVRRIEVTS-KH 466 (794)
T ss_pred -ceeEEEcCCCCeEEEEecCCceEEEecceeccccccccceeeecCCceEEEEeCCeEEEEEcc-cceEEEEEeecc-ce
Confidence 0112222221 122345677777765 789999996 599999998775 34
Q ss_pred ccccCC--eEEEeccceEEEeeccChhHHHHHHHhcCC------HHHHHHHhh
Q 003405 280 LIPSSN--AVVVALENSIFGLFPVPLGAQIVQLTASGD------FEEALALCK 324 (823)
Q Consensus 280 l~~~~~--~v~v~s~~~I~~l~~~~~~~qI~~Ll~~~~------~e~Al~L~~ 324 (823)
+.-.++ -+-++++.+.|.|.-- .+.+...++.|. +++|+..+-
T Consensus 467 v~w~d~g~lVai~~d~Sfyil~~n--~d~v~~a~e~g~~v~eeGiedAfevLg 517 (794)
T KOG0276|consen 467 VYWSDNGELVAIAGDDSFYILKFN--ADAVANAVEQGIEVTEEGIEDAFEVLG 517 (794)
T ss_pred eEEecCCCEEEEEecCceeEEEec--HHHHHHHHhcCCCCcchhHHHHHHHHh
Confidence 443333 3445677775555433 345555555443 566666654
No 20
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.82 E-value=0.011 Score=67.72 Aligned_cols=222 Identities=18% Similarity=0.282 Sum_probs=147.7
Q ss_pred CCcEEEEEEe--CCEEEEEeCC-CcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 16 SPKIDAVASY--GLKILLGCSD-GSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 16 ~~~I~ci~~~--~~~L~vGT~~-G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
..+|..+... |+.|.+|+.. |.|++|.-..+ .|.+.++ ++ -..|+.+.+-|+.+++++
T Consensus 307 ~~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsE----------------sYVlKQQ--gH-~~~i~~l~YSpDgq~iaT 367 (893)
T KOG0291|consen 307 DQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSE----------------SYVLKQQ--GH-SDRITSLAYSPDGQLIAT 367 (893)
T ss_pred cceeeEEEecccCCEEEEcCCccceEEEEEeecc----------------ceeeecc--cc-ccceeeEEECCCCcEEEe
Confidence 3577777666 8999999876 99999986542 4555322 32 468999999999999998
Q ss_pred EeC-c-EEEEeCCC-CcccccccCCCCcEEEEeeCCCc-eEEEEEcCeEEEEEEcCCCceeEeeeecCCCCce--EEEec
Q 003405 93 LSE-S-IAFHRLPN-LETIAVLTKAKGANVYSWDDRRG-FLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVK--SMSWC 166 (823)
Q Consensus 93 l~d-~-l~~~~L~~-l~~~~~i~~~kg~~~fa~~~~~~-~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~--~l~~~ 166 (823)
=+| + |++|+..+ +=.++--+.+.|++.+++..... .++......+..+.....+.| |.+..|.+++ +++..
T Consensus 368 G~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRYrNf---RTft~P~p~QfscvavD 444 (893)
T KOG0291|consen 368 GAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRYRNF---RTFTSPEPIQFSCVAVD 444 (893)
T ss_pred ccCCCcEEEEeccCceEEEEeccCCCceEEEEEEecCCEEEEeecCCeEEeeeeccccee---eeecCCCceeeeEEEEc
Confidence 887 5 99999754 22222223456788888766543 345568889988888754444 5667787776 44444
Q ss_pred --CCeEEEEEcCceE--EEEcCCCCeeeccCCCCCCCCEE--EEccCCeEEEE--eCCeEE---EEcCCCccccCCceee
Q 003405 167 --GENICIAIRKGYM--ILNATNGALSEVFPSGRIGPPLV--VSLLSGELLLG--KENIGV---FVDQNGKLLQADRICW 235 (823)
Q Consensus 167 --~~~i~v~~~~~y~--lidl~~~~~~~L~~~~~~~~p~i--~~~~~~EfLL~--~~~~gv---fv~~~G~~~~~~~i~w 235 (823)
|..+|.|....|. +.+++||+..+++.-.+. |+. +..+.+..|.. .|...= .++..|. ..+++-
T Consensus 445 ~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEg--PVs~l~f~~~~~~LaS~SWDkTVRiW~if~s~~~---vEtl~i 519 (893)
T KOG0291|consen 445 PSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEG--PVSGLSFSPDGSLLASGSWDKTVRIWDIFSSSGT---VETLEI 519 (893)
T ss_pred CCCCEEEeeccceEEEEEEEeecCeeeehhcCCCC--cceeeEEccccCeEEeccccceEEEEEeeccCce---eeeEee
Confidence 7889999998775 468999999888865432 443 23355565552 333221 2344454 467777
Q ss_pred cCCCcEEEEe--CCEEEEEe-CCeEEEEEccC
Q 003405 236 SEAPIAVIIQ--KPYAIALL-PRRVEVRSLRV 264 (823)
Q Consensus 236 ~~~P~~v~~~--~PYll~~~-~~~ieV~~l~~ 264 (823)
......+.+. .-=|.+.+ ++.|.+++..+
T Consensus 520 ~sdvl~vsfrPdG~elaVaTldgqItf~d~~~ 551 (893)
T KOG0291|consen 520 RSDVLAVSFRPDGKELAVATLDGQITFFDIKE 551 (893)
T ss_pred ccceeEEEEcCCCCeEEEEEecceEEEEEhhh
Confidence 7777777776 33444444 56888888763
No 21
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=97.82 E-value=0.0015 Score=74.38 Aligned_cols=281 Identities=12% Similarity=0.150 Sum_probs=167.5
Q ss_pred CcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-
Q 003405 17 PKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE- 95 (823)
Q Consensus 17 ~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d- 95 (823)
.+.+|+++.++++.+|++.|.|+.|+-.... . +..++-.+..|+-...+...+.+++.+.
T Consensus 36 v~lTc~dst~~~l~~GsS~G~lyl~~R~~~~-----------------~--~~~~~~~~~~~~~~~~vs~~e~lvAagt~ 96 (726)
T KOG3621|consen 36 VKLTCVDATEEYLAMGSSAGSVYLYNRHTGE-----------------M--RKLKNEGATGITCVRSVSSVEYLVAAGTA 96 (726)
T ss_pred EEEEEeecCCceEEEecccceEEEEecCchh-----------------h--hcccccCccceEEEEEecchhHhhhhhcC
Confidence 4789999999999999999999999733211 1 1112212344555555666666666654
Q ss_pred -c-EEEEeCCCCcc-----cccccC--CCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCcee-Eeee-ecCCCCceEE
Q 003405 96 -S-IAFHRLPNLET-----IAVLTK--AKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFV-EVKD-FGVPDTVKSM 163 (823)
Q Consensus 96 -~-l~~~~L~~l~~-----~~~i~~--~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~-~~ke-i~~~~~~~~l 163 (823)
| |.++.+.+-.+ .+...+ ..-|++.+++.+.-++++| .+++|.+.+++....|. ...+ ...+..|..+
T Consensus 97 ~g~V~v~ql~~~~p~~~~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s~~~~~~~~q~il~~ds~IVQl 176 (726)
T KOG3621|consen 97 SGRVSVFQLNKELPRDLDYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDSRQAFLSKSQEILSEDSEIVQL 176 (726)
T ss_pred CceEEeehhhccCCCcceeeccccccCCceEEEEEecccccEEeecCCCceEEEEEechhhhhccccceeeccCcceEEe
Confidence 4 78887754111 111122 2357788888887789999 66777777776421221 2223 4567888899
Q ss_pred EecCCeEEEEEcCceEEEEcCCCCeeeccCCCCCC-CCE-EEEcc-----CCeEEEE-eCCe-EEEEcCCCcccc-----
Q 003405 164 SWCGENICIAIRKGYMILNATNGALSEVFPSGRIG-PPL-VVSLL-----SGELLLG-KENI-GVFVDQNGKLLQ----- 229 (823)
Q Consensus 164 ~~~~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~-~p~-i~~~~-----~~EfLL~-~~~~-gvfv~~~G~~~~----- 229 (823)
......+.|++-..-.+++++.++++.+=.-.+.+ .++ .|.++ ..-++.| +.+. ..-+|.+|...+
T Consensus 177 D~~q~~LLVStl~r~~Lc~tE~eti~QIG~k~R~~~~~~GACF~~g~~~~q~~~IycaRPG~RlWead~~G~V~~Thqfk 256 (726)
T KOG3621|consen 177 DYLQSYLLVSTLTRCILCQTEAETITQIGKKPRKSLIDFGACFFPGQCKAQKPQIYCARPGLRLWEADFAGEVIKTHQFK 256 (726)
T ss_pred ecccceehHhhhhhhheeecchhHHHHhcCCCcCCccccceEEeeccccCCCceEEEecCCCceEEeecceeEEEeeehh
Confidence 88888899998877778888766544432211111 111 22222 2234443 3332 233555664432
Q ss_pred -------CCceeecC--CC---------------cEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeC----Cccccc
Q 003405 230 -------ADRICWSE--AP---------------IAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQ----NVRHLI 281 (823)
Q Consensus 230 -------~~~i~w~~--~P---------------~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~----~~~~l~ 281 (823)
.+.|...+ .| +......-+++++++.+|.|.+..| .|++... ++.-+.
T Consensus 257 ~ala~~p~p~i~~~s~esp~~~~~~~~~q~ls~~k~~~l~~~~vLa~te~Giyv~d~~~----~~v~l~se~~~DI~dVs 332 (726)
T KOG3621|consen 257 DALARPPAPEIPIRSLESPNQRSLPSGTQHLSLSKSSTLHSDRVLAWTEVGIYVFDSNN----SQVYLWSEGGHDILDVS 332 (726)
T ss_pred hhhccCCCCcccCCCcCCccccCCCCCccccccceeEEeecceEEEeecceEEEEEecc----ceEEEeecCCCceeEEe
Confidence 12222222 11 1233344579999999998888764 3555433 122234
Q ss_pred ccCCeEEEec-cceEEEeeccChhHHHHHHHhcCCHHHHH
Q 003405 282 PSSNAVVVAL-ENSIFGLFPVPLGAQIVQLTASGDFEEAL 320 (823)
Q Consensus 282 ~~~~~v~v~s-~~~I~~l~~~~~~~qI~~Ll~~~~~e~Al 320 (823)
++++.+|+-. ++.+..+........+..|+..|..--++
T Consensus 333 ~~~neiFvL~~d~~l~~~sv~s~qr~l~~l~~~G~~m~~~ 372 (726)
T KOG3621|consen 333 HCGNEIFVLNLDRGLKVESVASRQRKLESLCRCGKEMFVL 372 (726)
T ss_pred ecCceEEEEecCCceeEEEeehhHHHHHHHHhhchhhhhh
Confidence 5677777654 55688888888899999999999655433
No 22
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=97.78 E-value=0.00037 Score=66.29 Aligned_cols=106 Identities=25% Similarity=0.335 Sum_probs=85.7
Q ss_pred CCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeC--Cccccc----ccCCeEEEeccceEEEeeccChhHHHHHH-
Q 003405 238 APIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQ--NVRHLI----PSSNAVVVALENSIFGLFPVPLGAQIVQL- 310 (823)
Q Consensus 238 ~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~--~~~~l~----~~~~~v~v~s~~~I~~l~~~~~~~qI~~L- 310 (823)
.|.+++...-|++.+.++.+.|.+..+ +.+|+.-.+. .++.+. ...+.+++.|++.||.+....-+..|+.+
T Consensus 35 ~p~si~lT~~H~llL~~~~l~~vn~L~-~~vV~e~~~~~~~~~~~gl~~D~~~~t~W~ys~~~I~ei~i~~E~r~vWk~y 113 (147)
T PF05131_consen 35 PPLSIALTEFHLLLLYSDRLIAVNRLN-NKVVFEESLLETGGKILGLCRDPSSNTFWLYSSNSIFEIVINNEDRDVWKIY 113 (147)
T ss_pred CcceEEeeceeeeEEeCCEEEEEEecC-CcEEEEEEeccCCcceeeEEEcCCCCeEEEEeCCeeEEEEcCcchHHHHHHH
Confidence 399999999999999999999998875 7777655442 223322 24568999999999999999999999987
Q ss_pred HhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHcc
Q 003405 311 TASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDT 352 (823)
Q Consensus 311 l~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~ 352 (823)
+++|+|++|+++|+..+ .+...|..++|.++|.+
T Consensus 114 l~~~~fd~Al~~~~~~~--------~~~d~V~~~qa~~lf~k 147 (147)
T PF05131_consen 114 LDKGDFDEALQYCKTNP--------AQRDQVLIKQADHLFQK 147 (147)
T ss_pred HhcCcHHHHHHHccCCH--------HHHHHHHHHHHHHHhhC
Confidence 79999999999997641 25668899999999974
No 23
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=97.72 E-value=0.00082 Score=77.42 Aligned_cols=174 Identities=16% Similarity=0.184 Sum_probs=113.3
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES 96 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~ 96 (823)
+-.|+.+||++.+||++.|.|-.|+++..-.. ..|- . ...++.+|+.+.+......+++.+ ||
T Consensus 452 ~av~vs~CGNF~~IG~S~G~Id~fNmQSGi~r------------~sf~---~-~~ah~~~V~gla~D~~n~~~vsa~~~G 515 (910)
T KOG1539|consen 452 TAVCVSFCGNFVFIGYSKGTIDRFNMQSGIHR------------KSFG---D-SPAHKGEVTGLAVDGTNRLLVSAGADG 515 (910)
T ss_pred EEEEEeccCceEEEeccCCeEEEEEcccCeee------------cccc---c-CccccCceeEEEecCCCceEEEccCcc
Confidence 44578899999999999999999998754321 1110 0 112478999998876655555543 36
Q ss_pred -EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeec-CCCCceEEEec--CCeEE
Q 003405 97 -IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWC--GENIC 171 (823)
Q Consensus 97 -l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~--~~~i~ 171 (823)
+++|+..+...+.++.-.-+++.+......+.++++. +-.|.+|... .-+.+|++. ..+.++.++|. |..|+
T Consensus 516 ilkfw~f~~k~l~~~l~l~~~~~~iv~hr~s~l~a~~~ddf~I~vvD~~---t~kvvR~f~gh~nritd~~FS~DgrWli 592 (910)
T KOG1539|consen 516 ILKFWDFKKKVLKKSLRLGSSITGIVYHRVSDLLAIALDDFSIRVVDVV---TRKVVREFWGHGNRITDMTFSPDGRWLI 592 (910)
T ss_pred eEEEEecCCcceeeeeccCCCcceeeeeehhhhhhhhcCceeEEEEEch---hhhhhHHhhccccceeeeEeCCCCcEEE
Confidence 8999987665554443333444444444445556653 3456666655 123456654 46799999998 55788
Q ss_pred EEEc-CceEEEEcCCCCeeeccCCCCCCCCEE--EEccCCeEEEE
Q 003405 172 IAIR-KGYMILNATNGALSEVFPSGRIGPPLV--VSLLSGELLLG 213 (823)
Q Consensus 172 v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i--~~~~~~EfLL~ 213 (823)
.++. ....+.|+-+|...+-+..+. ||. ...++++||..
T Consensus 593 sasmD~tIr~wDlpt~~lID~~~vd~---~~~sls~SPngD~LAT 634 (910)
T KOG1539|consen 593 SASMDSTIRTWDLPTGTLIDGLLVDS---PCTSLSFSPNGDFLAT 634 (910)
T ss_pred EeecCCcEEEEeccCcceeeeEecCC---cceeeEECCCCCEEEE
Confidence 8777 567899999998877766542 443 23477888864
No 24
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=97.63 E-value=0.0012 Score=70.91 Aligned_cols=136 Identities=16% Similarity=0.252 Sum_probs=97.3
Q ss_pred cEEEEEEeCCE--EEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVASYGLK--ILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~~~~~--L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
.++|...|-+- +-.||.||.|.+|++..... ...|.+ +..||..|..-+++-.|++-+|
T Consensus 349 ~~ts~~fHpDgLifgtgt~d~~vkiwdlks~~~------------------~a~Fpg-ht~~vk~i~FsENGY~Lat~ad 409 (506)
T KOG0289|consen 349 EYTSAAFHPDGLIFGTGTPDGVVKIWDLKSQTN------------------VAKFPG-HTGPVKAISFSENGYWLATAAD 409 (506)
T ss_pred eeEEeeEcCCceEEeccCCCceEEEEEcCCccc------------------cccCCC-CCCceeEEEeccCceEEEEEec
Confidence 58899888654 34578999999999876442 224444 4679999999998888888888
Q ss_pred -c-EEEEeCCCCcccc--cccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcC-CCceeEeeeecCCC-CceEEEecCCe
Q 003405 96 -S-IAFHRLPNLETIA--VLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDG-GRGFVEVKDFGVPD-TVKSMSWCGEN 169 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~--~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~-~~~f~~~kei~~~~-~~~~l~~~~~~ 169 (823)
+ |++|||.+++... .+...++++.+++|.....++++ ...+.+|.... .+.|+.++++.... ..+++.|....
T Consensus 410 d~~V~lwDLRKl~n~kt~~l~~~~~v~s~~fD~SGt~L~~~-g~~l~Vy~~~k~~k~W~~~~~~~~~sg~st~v~Fg~~a 488 (506)
T KOG0289|consen 410 DGSVKLWDLRKLKNFKTIQLDEKKEVNSLSFDQSGTYLGIA-GSDLQVYICKKKTKSWTEIKELADHSGLSTGVRFGEHA 488 (506)
T ss_pred CCeEEEEEehhhcccceeeccccccceeEEEcCCCCeEEee-cceeEEEEEecccccceeeehhhhcccccceeeecccc
Confidence 5 9999998776433 35666789999999876667666 88888888763 44677787776554 44555444333
Q ss_pred EEEE
Q 003405 170 ICIA 173 (823)
Q Consensus 170 i~v~ 173 (823)
.+++
T Consensus 489 q~l~ 492 (506)
T KOG0289|consen 489 QYLA 492 (506)
T ss_pred eEEe
Confidence 3333
No 25
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=97.62 E-value=0.002 Score=66.60 Aligned_cols=234 Identities=17% Similarity=0.243 Sum_probs=135.5
Q ss_pred EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C-cEEE
Q 003405 22 VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E-SIAF 99 (823)
Q Consensus 22 i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d-~l~~ 99 (823)
+..||..|.+|+.+|.+.+|+.... ...+.+.+ +-.||+.|.--+....|++-+ | .+.+
T Consensus 31 Fs~~G~~lAvGc~nG~vvI~D~~T~------------------~iar~lsa-H~~pi~sl~WS~dgr~LltsS~D~si~l 91 (405)
T KOG1273|consen 31 FSRWGDYLAVGCANGRVVIYDFDTF------------------RIARMLSA-HVRPITSLCWSRDGRKLLTSSRDWSIKL 91 (405)
T ss_pred eccCcceeeeeccCCcEEEEEcccc------------------chhhhhhc-cccceeEEEecCCCCEeeeecCCceeEE
Confidence 4567899999999999999996532 22233333 357999999999988888876 4 4999
Q ss_pred EeCCCCcccccccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEee--eecCCCCceEEEec--CCeEEEE
Q 003405 100 HRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEVK--DFGVPDTVKSMSWC--GENICIA 173 (823)
Q Consensus 100 ~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~k--ei~~~~~~~~l~~~--~~~i~v~ 173 (823)
|++.+-.+...+.....+......+.....||+ .+..-.+..+++...-...+ +..+...+.+..+. |..|+.|
T Consensus 92 wDl~~gs~l~rirf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h~~Lp~d~d~dln~sas~~~fdr~g~yIitG 171 (405)
T KOG1273|consen 92 WDLLKGSPLKRIRFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKHSVLPKDDDGDLNSSASHGVFDRRGKYIITG 171 (405)
T ss_pred EeccCCCceeEEEccCccceeeeccccCCeEEEEEecCCcEEEEecCCceeeccCCCccccccccccccccCCCCEEEEe
Confidence 999766666555444555555556655444444 55555566666321111111 23333344433343 7899999
Q ss_pred EcCc-eEEEEcCCCCeeeccCCCC--CCCCEEEEccCCeEEE--EeCCeEEEEc--------CCCcccc-------CCce
Q 003405 174 IRKG-YMILNATNGALSEVFPSGR--IGPPLVVSLLSGELLL--GKENIGVFVD--------QNGKLLQ-------ADRI 233 (823)
Q Consensus 174 ~~~~-y~lidl~~~~~~~L~~~~~--~~~p~i~~~~~~EfLL--~~~~~gvfv~--------~~G~~~~-------~~~i 233 (823)
+.++ ..++|..+-+...-+..-. ..+-++.. ..++|++ +.|...=.|+ .+|++.. -+..
T Consensus 172 tsKGkllv~~a~t~e~vas~rits~~~IK~I~~s-~~g~~liiNtsDRvIR~ye~~di~~~~r~~e~e~~~K~qDvVNk~ 250 (405)
T KOG1273|consen 172 TSKGKLLVYDAETLECVASFRITSVQAIKQIIVS-RKGRFLIINTSDRVIRTYEISDIDDEGRDGEVEPEHKLQDVVNKL 250 (405)
T ss_pred cCcceEEEEecchheeeeeeeechheeeeEEEEe-ccCcEEEEecCCceEEEEehhhhcccCccCCcChhHHHHHHHhhh
Confidence 9977 5677877754333222211 11223333 3566665 3443332332 3344321 1345
Q ss_pred eecCCCcEEEEeCCEEEEEeCC--eEEEEEccCCCceeEEEeeCCcc
Q 003405 234 CWSEAPIAVIIQKPYAIALLPR--RVEVRSLRVPYALIQTIVLQNVR 278 (823)
Q Consensus 234 ~w~~~P~~v~~~~PYll~~~~~--~ieV~~l~~~~~lvQ~i~l~~~~ 278 (823)
+|..- .+.-..-|++|-+.+ .+.|..-. .+.+|..+.-+.+.
T Consensus 251 ~Wk~c--cfs~dgeYv~a~s~~aHaLYIWE~~-~GsLVKILhG~kgE 294 (405)
T KOG1273|consen 251 QWKKC--CFSGDGEYVCAGSARAHALYIWEKS-IGSLVKILHGTKGE 294 (405)
T ss_pred hhhhe--eecCCccEEEeccccceeEEEEecC-CcceeeeecCCchh
Confidence 56322 122246799998875 45565543 47888877666543
No 26
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=97.60 E-value=0.021 Score=58.49 Aligned_cols=245 Identities=13% Similarity=0.152 Sum_probs=153.6
Q ss_pred ccCCCCcEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccC
Q 003405 12 ISNCSPKIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQ 88 (823)
Q Consensus 12 ~~~~~~~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~ 88 (823)
+..-+..|+.+.... +-++=+..|-.+..|.+..+... .+. ..+.++++ ...|+.+.+.++.+
T Consensus 11 l~gh~d~Vt~la~~~~~~~~l~sasrDk~ii~W~L~~dd~~------------~G~-~~r~~~GH-sH~v~dv~~s~dg~ 76 (315)
T KOG0279|consen 11 LEGHTDWVTALAIKIKNSDILVSASRDKTIIVWKLTSDDIK------------YGV-PVRRLTGH-SHFVSDVVLSSDGN 76 (315)
T ss_pred ecCCCceEEEEEeecCCCceEEEcccceEEEEEEeccCccc------------cCc-eeeeeecc-ceEecceEEccCCc
Confidence 344456777766553 46777889999999998876321 111 24566663 57899999999999
Q ss_pred ceeeEe-Cc-EEEEeCCCCccccc-ccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEE
Q 003405 89 LLLSLS-ES-IAFHRLPNLETIAV-LTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMS 164 (823)
Q Consensus 89 ~Ll~l~-d~-l~~~~L~~l~~~~~-i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~ 164 (823)
++++-+ |+ +.+||+.+-++... +...+.+.+++++.+..+||-+ ..|.|.+|...++-.+. +.+..-.+-|.++.
T Consensus 77 ~alS~swD~~lrlWDl~~g~~t~~f~GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~g~ck~t-~~~~~~~~WVscvr 155 (315)
T KOG0279|consen 77 FALSASWDGTLRLWDLATGESTRRFVGHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTLGVCKYT-IHEDSHREWVSCVR 155 (315)
T ss_pred eEEeccccceEEEEEecCCcEEEEEEecCCceEEEEecCCCceeecCCCcceeeeeeecccEEEE-EecCCCcCcEEEEE
Confidence 999886 65 99999977554433 4567899999999999899998 55678888877543232 22222267788999
Q ss_pred ecCC---eEEEEEc--CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE--EeCCeEEEEcCCCccccCCceeecC
Q 003405 165 WCGE---NICIAIR--KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL--GKENIGVFVDQNGKLLQADRICWSE 237 (823)
Q Consensus 165 ~~~~---~i~v~~~--~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL--~~~~~gvfv~~~G~~~~~~~i~w~~ 237 (823)
|..+ .+++... +...+-|+.+-+...-++-......-+...+|+-... +.|..++..|.+-. ++.-.++.
T Consensus 156 fsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpDGslcasGgkdg~~~LwdL~~~---k~lysl~a 232 (315)
T KOG0279|consen 156 FSPNESNPIIVSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPDGSLCASGGKDGEAMLWDLNEG---KNLYSLEA 232 (315)
T ss_pred EcCCCCCcEEEEccCCceEEEEccCCcchhhccccccccEEEEEECCCCCEEecCCCCceEEEEEccCC---ceeEeccC
Confidence 9843 4555544 5678888887665444432111111222334444333 23444555543211 12222222
Q ss_pred --CCcEEEEe--CCEEEEEeCCeEEEEEccCCCceeEEEeeC
Q 003405 238 --APIAVIIQ--KPYAIALLPRRVEVRSLRVPYALIQTIVLQ 275 (823)
Q Consensus 238 --~P~~v~~~--~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~ 275 (823)
...+++|. .+.|++....+|.|.++. ++..+.++.+.
T Consensus 233 ~~~v~sl~fspnrywL~~at~~sIkIwdl~-~~~~v~~l~~d 273 (315)
T KOG0279|consen 233 FDIVNSLCFSPNRYWLCAATATSIKIWDLE-SKAVVEELKLD 273 (315)
T ss_pred CCeEeeEEecCCceeEeeccCCceEEEecc-chhhhhhcccc
Confidence 22334433 567888888899999995 67777776553
No 27
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.51 E-value=0.16 Score=54.05 Aligned_cols=246 Identities=14% Similarity=0.196 Sum_probs=134.3
Q ss_pred EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EEEEeCCC
Q 003405 28 KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IAFHRLPN 104 (823)
Q Consensus 28 ~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~~L~~ 104 (823)
-++.+..+|.+.+|++.... ..+.+.. ...+..+...+....+++.+. + +.+|+..+
T Consensus 3 ~~~s~~~d~~v~~~d~~t~~------------------~~~~~~~--~~~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~ 62 (300)
T TIGR03866 3 AYVSNEKDNTISVIDTATLE------------------VTRTFPV--GQRPRGITLSKDGKLLYVCASDSDTIQVIDLAT 62 (300)
T ss_pred EEEEecCCCEEEEEECCCCc------------------eEEEEEC--CCCCCceEECCCCCEEEEEECCCCeEEEEECCC
Confidence 34567889999999865321 1223322 233566888888777655543 3 99999877
Q ss_pred CcccccccCCCCcEEEEeeCCCceEEEEE--cCeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEcC--ce
Q 003405 105 LETIAVLTKAKGANVYSWDDRRGFLCFAR--QKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIRK--GY 178 (823)
Q Consensus 105 l~~~~~i~~~kg~~~fa~~~~~~~l~V~~--kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~~--~y 178 (823)
.+....+....++..++++++...++++. .+.+.+|..... ..+..+..+..+.+++|. |+.++++... ..
T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~---~~~~~~~~~~~~~~~~~~~dg~~l~~~~~~~~~~ 139 (300)
T TIGR03866 63 GEVIGTLPSGPDPELFALHPNGKILYIANEDDNLVTVIDIETR---KVLAEIPVGVEPEGMAVSPDGKIVVNTSETTNMA 139 (300)
T ss_pred CcEEEeccCCCCccEEEECCCCCEEEEEcCCCCeEEEEECCCC---eEEeEeeCCCCcceEEECCCCCEEEEEecCCCeE
Confidence 65544444334556777877766676663 456767666532 233444445567888887 4556666653 34
Q ss_pred EEEEcCCCCeeeccCCCCCCCCE-EEEccCCeEEE-Ee--CCeEEEEcCC-CccccCCceeec--------CCCcEEEEe
Q 003405 179 MILNATNGALSEVFPSGRIGPPL-VVSLLSGELLL-GK--ENIGVFVDQN-GKLLQADRICWS--------EAPIAVIIQ 245 (823)
Q Consensus 179 ~lidl~~~~~~~L~~~~~~~~p~-i~~~~~~EfLL-~~--~~~gvfv~~~-G~~~~~~~i~w~--------~~P~~v~~~ 245 (823)
..+|..+++.......+. .|. +...+++..++ +. ++...++|.. |... ..+.+. ..|..+.+.
T Consensus 140 ~~~d~~~~~~~~~~~~~~--~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~--~~~~~~~~~~~~~~~~~~~i~~s 215 (300)
T TIGR03866 140 HFIDTKTYEIVDNVLVDQ--RPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVI--KKITFEIPGVHPEAVQPVGIKLT 215 (300)
T ss_pred EEEeCCCCeEEEEEEcCC--CccEEEECCCCCEEEEEcCCCCEEEEEEcCcceee--eeeeecccccccccCCccceEEC
Confidence 567887776543332222 232 33445666654 32 4455556654 3322 223221 124455554
Q ss_pred --CCEEEEEe--CCeEEEEEccCCCceeEEEeeCC-cccc--cccCCeEEEec--cceEEEeecc
Q 003405 246 --KPYAIALL--PRRVEVRSLRVPYALIQTIVLQN-VRHL--IPSSNAVVVAL--ENSIFGLFPV 301 (823)
Q Consensus 246 --~PYll~~~--~~~ieV~~l~~~~~lvQ~i~l~~-~~~l--~~~~~~v~v~s--~~~I~~l~~~ 301 (823)
..++++.. .+.+.|+++. ++.++..+.... +..+ .+.+..+++++ ++.|......
T Consensus 216 ~dg~~~~~~~~~~~~i~v~d~~-~~~~~~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~ 279 (300)
T TIGR03866 216 KDGKTAFVALGPANRVAVVDAK-TYEVLDYLLVGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVA 279 (300)
T ss_pred CCCCEEEEEcCCCCeEEEEECC-CCcEEEEEEeCCCcceEEECCCCCEEEEEcCCCCeEEEEECC
Confidence 34554432 3578888886 566666554322 1111 23444555543 3455554443
No 28
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.48 E-value=0.00064 Score=78.67 Aligned_cols=102 Identities=16% Similarity=0.251 Sum_probs=78.7
Q ss_pred HHHhhcCchhHHHHHHHHhhcccCCCChhHHHH-HHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcC----
Q 003405 640 SYLKQYSPSMQGRYLELMLAMNENSISGNLQNE-MVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESIS---- 714 (823)
Q Consensus 640 ~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~-L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~---- 714 (823)
.||....-..+..|||.|+..+ ....=|++ |+-.|++- . .-.||..|.+.-+
T Consensus 406 kfLdaq~IknLt~YLe~L~~~g---la~~dhttlLLncYiKl-k-------------------d~~kL~efI~~~~~g~~ 462 (933)
T KOG2114|consen 406 KFLDAQRIKNLTSYLEALHKKG---LANSDHTTLLLNCYIKL-K-------------------DVEKLTEFISKCDKGEW 462 (933)
T ss_pred HhcCHHHHHHHHHHHHHHHHcc---cccchhHHHHHHHHHHh-c-------------------chHHHHHHHhcCCCcce
Confidence 3444333446888999999742 34445655 56677753 1 2489999998876
Q ss_pred CCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHhCCCchhHHH
Q 003405 715 GYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKVFLINQPVFL 764 (823)
Q Consensus 715 ~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L~D~~~a~~~ 764 (823)
.+|.+.++++|...++.+|.-+|--|-++|+-+|+++++.++|+..|+-+
T Consensus 463 ~fd~e~al~Ilr~snyl~~a~~LA~k~~~he~vl~ille~~~ny~eAl~y 512 (933)
T KOG2114|consen 463 FFDVETALEILRKSNYLDEAELLATKFKKHEWVLDILLEDLHNYEEALRY 512 (933)
T ss_pred eeeHHHHHHHHHHhChHHHHHHHHHHhccCHHHHHHHHHHhcCHHHHHHH
Confidence 89999999999999998888888888999999999999999999777543
No 29
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=97.48 E-value=0.029 Score=58.41 Aligned_cols=213 Identities=16% Similarity=0.254 Sum_probs=132.6
Q ss_pred cccccccccCCCCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCC--C--C--------Cccc-------------
Q 003405 5 AFDSLELISNCSPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRS--P--P--------SDYQ------------- 57 (823)
Q Consensus 5 af~~~~l~~~~~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~--~--~--------~d~~------------- 57 (823)
.|.+....++...+|+|++-.. ..+...++|-.+.+|+........+ + | +.+.
T Consensus 3 s~~~ak~f~~~~~~i~sl~fs~~G~~litss~dDsl~LYd~~~g~~~~ti~skkyG~~~~~Fth~~~~~i~sStk~d~tI 82 (311)
T KOG1446|consen 3 SFRPAKVFRETNGKINSLDFSDDGLLLITSSEDDSLRLYDSLSGKQVKTINSKKYGVDLACFTHHSNTVIHSSTKEDDTI 82 (311)
T ss_pred ccccccccccCCCceeEEEecCCCCEEEEecCCCeEEEEEcCCCceeeEeecccccccEEEEecCCceEEEccCCCCCce
Confidence 4677777888889999999875 5788888998999999876543211 0 0 0000
Q ss_pred ---ccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C-cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE
Q 003405 58 ---SLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E-SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA 132 (823)
Q Consensus 58 ---~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d-~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~ 132 (823)
.|.... .+|-|.++ ++.|+.|.+-|..+..++-+ | .|++|++..-+... +-...+-...|.+++.-.+|++
T Consensus 83 ryLsl~dNk--ylRYF~GH-~~~V~sL~~sP~~d~FlS~S~D~tvrLWDlR~~~cqg-~l~~~~~pi~AfDp~GLifA~~ 158 (311)
T KOG1446|consen 83 RYLSLHDNK--YLRYFPGH-KKRVNSLSVSPKDDTFLSSSLDKTVRLWDLRVKKCQG-LLNLSGRPIAAFDPEGLIFALA 158 (311)
T ss_pred EEEEeecCc--eEEEcCCC-CceEEEEEecCCCCeEEecccCCeEEeeEecCCCCce-EEecCCCcceeECCCCcEEEEe
Confidence 011111 24556654 78999999999999988877 4 49999986433221 1122233345567765566777
Q ss_pred EcC-eEEEEEEcC--CCceeEeeeecCC--CCceEEEec--CCeEEEEEcCc-eEEEEcCCCCeeeccCCC--CCCCCEE
Q 003405 133 RQK-RVCIFRHDG--GRGFVEVKDFGVP--DTVKSMSWC--GENICIAIRKG-YMILNATNGALSEVFPSG--RIGPPLV 202 (823)
Q Consensus 133 ~kk-ki~l~~~~~--~~~f~~~kei~~~--~~~~~l~~~--~~~i~v~~~~~-y~lidl~~~~~~~L~~~~--~~~~p~i 202 (823)
.+. .|.||..+. ...|.... +..+ ...+.|.|. |..|.+++..+ -.++|.-+|....-|..- ...-|+.
T Consensus 159 ~~~~~IkLyD~Rs~dkgPF~tf~-i~~~~~~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~ 237 (311)
T KOG1446|consen 159 NGSELIKLYDLRSFDKGPFTTFS-ITDNDEAEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPNAGNLPLS 237 (311)
T ss_pred cCCCeEEEEEecccCCCCceeEc-cCCCCccceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccCCCCccee
Confidence 665 889998863 12343221 1112 245677776 56899999966 477898888865554331 1123554
Q ss_pred EE-ccCCeEEEEe-C-CeEEEEc
Q 003405 203 VS-LLSGELLLGK-E-NIGVFVD 222 (823)
Q Consensus 203 ~~-~~~~EfLL~~-~-~~gvfv~ 222 (823)
+. .|+++|+++. + +...+.+
T Consensus 238 a~ftPds~Fvl~gs~dg~i~vw~ 260 (311)
T KOG1446|consen 238 ATFTPDSKFVLSGSDDGTIHVWN 260 (311)
T ss_pred EEECCCCcEEEEecCCCcEEEEE
Confidence 44 4899999854 3 3344444
No 30
>KOG0976 consensus Rho/Rac1-interacting serine/threonine kinase Citron [Signal transduction mechanisms]
Probab=97.48 E-value=6.8e-05 Score=84.95 Aligned_cols=223 Identities=19% Similarity=0.223 Sum_probs=129.6
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES- 96 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~- 96 (823)
+|=-.-..++.|.+.|..|..+.. +..+.++ . +.-..+.|+.+.++..++..+.++.+
T Consensus 948 kiFkA~tIEdwilfatqtglffts-isqprNp-----------------s---riagp~svtslE~mseI~cvamI~ns~ 1006 (1265)
T KOG0976|consen 948 KIFKAGTIEDWILFATQTGLFFTS-ISQPRNP-----------------S---RIAGPKSVTSLEPMSEIHCVAMIGNSK 1006 (1265)
T ss_pred eeecccccccceeEeecCCceEEE-eecCCCc-----------------h---hhcCccccccccccceeeEEEEEecCc
Confidence 333333446899999999986653 3322111 0 11135788888888888888888873
Q ss_pred --EEEEeCCCCcc-----cc-----cccCCCCcEEEEeeCCCc--eEEEEE--cCeEEEEEEcCCCceeEeeeecCCCCc
Q 003405 97 --IAFHRLPNLET-----IA-----VLTKAKGANVYSWDDRRG--FLCFAR--QKRVCIFRHDGGRGFVEVKDFGVPDTV 160 (823)
Q Consensus 97 --l~~~~L~~l~~-----~~-----~i~~~kg~~~fa~~~~~~--~l~V~~--kkki~l~~~~~~~~f~~~kei~~~~~~ 160 (823)
+....++++.. .+ .++...+++.+......| ++-+.. --.+..|--..| .|...-.+..|. |
T Consensus 1007 ~qla~ipldsL~lamqst~pSirpeVlpef~hvh~i~yhQqngqrfll~sddt~lh~rkyn~trd-~fs~~akl~vpe-P 1084 (1265)
T KOG0976|consen 1007 FQLADIPLDSLELAMQSTDPSIRPEVLPEFSHVHPISYHQQNGQRFLLESDDTFLHFRKYNDTRD-RFSRTAKLKVPE-P 1084 (1265)
T ss_pred ceeecCchhHHHHHHhcCCCccchhhhhhhcCcceeEEEEecccchhhhhhhhHHHHhhhcccch-hhhhcccccCCC-c
Confidence 33333333321 01 123333444444433322 111110 000111111112 244444566673 4
Q ss_pred eEEEec-CCeEEEEEcCceE-EEEcCCCC---eeeccCCCCCCCCE-EEEccCCeEEEEeCCeEEEEcCCCccccCCcee
Q 003405 161 KSMSWC-GENICIAIRKGYM-ILNATNGA---LSEVFPSGRIGPPL-VVSLLSGELLLGKENIGVFVDQNGKLLQADRIC 234 (823)
Q Consensus 161 ~~l~~~-~~~i~v~~~~~y~-lidl~~~~---~~~L~~~~~~~~p~-i~~~~~~EfLL~~~~~gvfv~~~G~~~~~~~i~ 234 (823)
.+.... ...+++++.+-|+ -+|-.+.. ...+.++.....|. ...++.+|++++|.|-|+|||..|..+|..+|.
T Consensus 1085 lsFies~P~gfifa~dtfyyv~ldhqsss~vsARklm~p~~~~yp~sA~si~anelllaYQnkGifVnl~Geqsrn~sie 1164 (1265)
T KOG0976|consen 1085 LSFIESEPYGFIFAFDTFYYVELDHQSSSGVSARKLMDPPNPRYPGSAISIGANELLLAYQNKGIFVNLSGEQSRNTSIE 1164 (1265)
T ss_pred hhhhhcCcceEEEecceEEEEeecccCCCCCchhhhcCCCCCCCCcchhhccHHHHHHHhhccCeEEecccccCCccccc
Confidence 444444 3446666665433 34544322 23445444333343 234577899999999999999999988788899
Q ss_pred ecCCCcEEEEeCCEEEEEeCCeEEEEEcc
Q 003405 235 WSEAPIAVIIQKPYAIALLPRRVEVRSLR 263 (823)
Q Consensus 235 w~~~P~~v~~~~PYll~~~~~~ieV~~l~ 263 (823)
|+..|..+.|..|++..+++++++|+-+.
T Consensus 1165 wekmp~ef~YtspilyiVhddsiei~~is 1193 (1265)
T KOG0976|consen 1165 WEKMPGEFTYTSPILYIVHDDSIEIHPIS 1193 (1265)
T ss_pred cccCCCCccccCceEEEeccCCccccccC
Confidence 99999999999999999999999998774
No 31
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=97.44 E-value=0.0036 Score=68.25 Aligned_cols=142 Identities=15% Similarity=0.229 Sum_probs=101.5
Q ss_pred cEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
-|.|.++.. .-++-|.-||.|..|+....++ . +.++. +..||+.+..+|...++++.+
T Consensus 155 YVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~~-------------~----v~eln--hg~pVe~vl~lpsgs~iasAg 215 (487)
T KOG0310|consen 155 YVRCGDISPANDHIVVTGSYDGKVRLWDTRSLTS-------------R----VVELN--HGCPVESVLALPSGSLIASAG 215 (487)
T ss_pred eeEeeccccCCCeEEEecCCCceEEEEEeccCCc-------------e----eEEec--CCCceeeEEEcCCCCEEEEcC
Confidence 466766654 3688999999999999765431 1 12221 367999999999977777777
Q ss_pred Cc-EEEEeCCCC-ccccc-ccCCCCcEEEEeeCCCce-EEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC--C
Q 003405 95 ES-IAFHRLPNL-ETIAV-LTKAKGANVYSWDDRRGF-LCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG--E 168 (823)
Q Consensus 95 d~-l~~~~L~~l-~~~~~-i~~~kg~~~fa~~~~~~~-l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~--~ 168 (823)
+. +++||+.+- +.+.. ....|.||+.++..+..+ +..+..+.+.+|... .++.+....+|.++.+|+... .
T Consensus 216 Gn~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sLD~~VKVfd~t---~~Kvv~s~~~~~pvLsiavs~dd~ 292 (487)
T KOG0310|consen 216 GNSVKVWDLTTGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSLDRHVKVFDTT---NYKVVHSWKYPGPVLSIAVSPDDQ 292 (487)
T ss_pred CCeEEEEEecCCceehhhhhcccceEEEEEeecCCceEeecccccceEEEEcc---ceEEEEeeecccceeeEEecCCCc
Confidence 64 999999632 22222 225688999988776655 456699999999976 466666778899999998873 4
Q ss_pred eEEEEEcCceEEE
Q 003405 169 NICIAIRKGYMIL 181 (823)
Q Consensus 169 ~i~v~~~~~y~li 181 (823)
.+++|..++-..+
T Consensus 293 t~viGmsnGlv~~ 305 (487)
T KOG0310|consen 293 TVVIGMSNGLVSI 305 (487)
T ss_pred eEEEecccceeee
Confidence 6778876654443
No 32
>KOG0985 consensus Vesicle coat protein clathrin, heavy chain [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.42 E-value=0.0014 Score=76.96 Aligned_cols=222 Identities=18% Similarity=0.256 Sum_probs=141.2
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccc
Q 003405 506 ILDTALLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEH 585 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~ 585 (823)
.+..+|-|.|+..|... ..|++.+++.|...|-..|+++.- .|+.+.+.+|+-+.-|- .-..+..
T Consensus 873 a~hnAlaKIyIDSNNnP--E~fLkeN~yYDs~vVGkYCEKRDP--~lA~vaYerGqcD~elI--~vcNeNS--------- 937 (1666)
T KOG0985|consen 873 ATHNALAKIYIDSNNNP--ERFLKENPYYDSKVVGKYCEKRDP--HLACVAYERGQCDLELI--NVCNENS--------- 937 (1666)
T ss_pred HHHhhhhheeecCCCCh--HHhcccCCcchhhHHhhhhcccCC--ceEEEeecccCCcHHHH--HhcCchh---------
Confidence 46778889999886432 367777777788877777766544 35556666666543221 1000000
Q ss_pred cccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCccc------cccccccCCCChH----HHHHHHhhcCchhHHHHHH
Q 003405 586 TQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQ------TIELFLSGNIPAD----LVNSYLKQYSPSMQGRYLE 655 (823)
Q Consensus 586 ~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~------~~~if~~~~l~~~----~Vl~~L~~~~~~~~~~YLE 655 (823)
++ ..-.+||- +-.|.+|. +..|-+.+|-. .++.=..++-+|+ .|-.|+...-|..++.-||
T Consensus 938 --lf--K~~aRYlv--~R~D~~LW---~~VL~e~n~~rRqLiDqVv~tal~E~~dPe~vS~tVkAfMtadLp~eLIELLE 1008 (1666)
T KOG0985|consen 938 --LF--KSQARYLV--ERSDPDLW---AKVLNEENPYRRQLIDQVVQTALPETQDPEEVSVTVKAFMTADLPNELIELLE 1008 (1666)
T ss_pred --HH--HHHHHHHH--hccChHHH---HHHHhccChHHHHHHHHHHHhcCCccCChHHHHHHHHHHHhcCCcHHHHHHHH
Confidence 00 12234443 12233321 11122222211 1111112344554 3557777667999999999
Q ss_pred HHhhcccC-CCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHH
Q 003405 656 LMLAMNEN-SISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEER 734 (823)
Q Consensus 656 ~li~~~~~-~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~ 734 (823)
.+|..+.. +...++.|.|+.-=+.. .|.+.+.++..-..||+..+-.++-.+.|+||-
T Consensus 1009 KIvL~~S~Fse~~nLQnLLiLtAika---------------------d~trVm~YI~rLdnyDa~~ia~iai~~~LyEEA 1067 (1666)
T KOG0985|consen 1009 KIVLDNSVFSENRNLQNLLILTAIKA---------------------DRTRVMEYINRLDNYDAPDIAEIAIENQLYEEA 1067 (1666)
T ss_pred HHhcCCcccccchhhhhhHHHHHhhc---------------------ChHHHHHHHHHhccCCchhHHHHHhhhhHHHHH
Confidence 99975311 23455566554433322 589999999999999999999999999999999
Q ss_pred HHHhhccccHHHHHHHHHHHhCCC-----------chhHHHHHHHHhcC
Q 003405 735 AILLGKMNQHELALSLYVHKVFLI-----------NQPVFLLIRRMAMD 772 (823)
Q Consensus 735 ~~Ll~klg~h~~AL~ilv~~L~D~-----------~~a~~~~l~~~y~~ 772 (823)
--++.|-..|.+|++.++...++. .+++|..|.+-=+.
T Consensus 1068 F~ifkkf~~n~~A~~VLie~i~~ldRA~efAe~~n~p~vWsqlakAQL~ 1116 (1666)
T KOG0985|consen 1068 FAIFKKFDMNVSAIQVLIENIGSLDRAYEFAERCNEPAVWSQLAKAQLQ 1116 (1666)
T ss_pred HHHHHHhcccHHHHHHHHHHhhhHHHHHHHHHhhCChHHHHHHHHHHHh
Confidence 999999999999999999988877 46799988775554
No 33
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.29 E-value=0.22 Score=52.95 Aligned_cols=227 Identities=10% Similarity=0.139 Sum_probs=125.9
Q ss_pred CCEEEE-EeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EEEEe
Q 003405 26 GLKILL-GCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IAFHR 101 (823)
Q Consensus 26 ~~~L~v-GT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~~ 101 (823)
++.+|+ |..+|.+.+|+..... ..+.+.. ...+..+.+.+..+.+++.+. + +.+|+
T Consensus 42 g~~l~~~~~~~~~v~~~d~~~~~------------------~~~~~~~--~~~~~~~~~~~~g~~l~~~~~~~~~l~~~d 101 (300)
T TIGR03866 42 GKLLYVCASDSDTIQVIDLATGE------------------VIGTLPS--GPDPELFALHPNGKILYIANEDDNLVTVID 101 (300)
T ss_pred CCEEEEEECCCCeEEEEECCCCc------------------EEEeccC--CCCccEEEECCCCCEEEEEcCCCCeEEEEE
Confidence 456754 6788999999865321 1222221 123456777777776665542 4 99999
Q ss_pred CCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEc--Cc
Q 003405 102 LPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIR--KG 177 (823)
Q Consensus 102 L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~--~~ 177 (823)
+.+.+.+..+........++++++...++++....-.++.++.. .......+..+..+.+++|. |..++++.. ..
T Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~s~dg~~l~~~~~~~~~ 180 (300)
T TIGR03866 102 IETRKVLAEIPVGVEPEGMAVSPDGKIVVNTSETTNMAHFIDTK-TYEIVDNVLVDQRPRFAEFTADGKELWVSSEIGGT 180 (300)
T ss_pred CCCCeEEeEeeCCCCcceEEECCCCCEEEEEecCCCeEEEEeCC-CCeEEEEEEcCCCccEEEECCCCCEEEEEcCCCCE
Confidence 97765444333222346677777766777765543223333322 12233334455677888887 456766653 56
Q ss_pred eEEEEcCCCCeeeccCC---C---CCCCCE-EEEccCCeEE-EE--eCCeEEEEcCC-CccccCCceeecCCCcEEEEe-
Q 003405 178 YMILNATNGALSEVFPS---G---RIGPPL-VVSLLSGELL-LG--KENIGVFVDQN-GKLLQADRICWSEAPIAVIIQ- 245 (823)
Q Consensus 178 y~lidl~~~~~~~L~~~---~---~~~~p~-i~~~~~~EfL-L~--~~~~gvfv~~~-G~~~~~~~i~w~~~P~~v~~~- 245 (823)
..++|+.+++...-+.. + ....|. +...+++.++ ++ .++....+|.. |+.. ..+.-...|..+.+.
T Consensus 181 v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~--~~~~~~~~~~~~~~~~ 258 (300)
T TIGR03866 181 VSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVL--DYLLVGQRVWQLAFTP 258 (300)
T ss_pred EEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCCcEE--EEEEeCCCcceEEECC
Confidence 88999998865332211 1 111233 3334566654 32 23334445543 3322 122223345666663
Q ss_pred -CCEEEEEe--CCeEEEEEccCCCceeEEEeeCC
Q 003405 246 -KPYAIALL--PRRVEVRSLRVPYALIQTIVLQN 276 (823)
Q Consensus 246 -~PYll~~~--~~~ieV~~l~~~~~lvQ~i~l~~ 276 (823)
..+|++.. .+.|.|+++. ++..++++.+.+
T Consensus 259 ~g~~l~~~~~~~~~i~v~d~~-~~~~~~~~~~~~ 291 (300)
T TIGR03866 259 DEKYLLTTNGVSNDVSVIDVA-ALKVIKSIKVGR 291 (300)
T ss_pred CCCEEEEEcCCCCeEEEEECC-CCcEEEEEEccc
Confidence 34776653 4689999996 588888887643
No 34
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=97.23 E-value=0.036 Score=65.13 Aligned_cols=180 Identities=16% Similarity=0.196 Sum_probs=121.8
Q ss_pred CCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 15 CSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 15 ~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
....|.|+..+++.++.||.+++++.|.+.+...+. ++..| .-|++.+.+.-.+++++.=+
T Consensus 55 ~g~~v~~ia~~s~~f~~~s~~~tv~~y~fps~~~~~---------------iL~Rf----tlp~r~~~v~g~g~~iaags 115 (933)
T KOG1274|consen 55 SGELVSSIACYSNHFLTGSEQNTVLRYKFPSGEEDT---------------ILARF----TLPIRDLAVSGSGKMIAAGS 115 (933)
T ss_pred cCceeEEEeecccceEEeeccceEEEeeCCCCCccc---------------eeeee----eccceEEEEecCCcEEEeec
Confidence 455789999999999999999999999987654321 12222 25899999998888888777
Q ss_pred C--cEEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeE------eeeecCCCCceEEE
Q 003405 95 E--SIAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVE------VKDFGVPDTVKSMS 164 (823)
Q Consensus 95 d--~l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~------~kei~~~~~~~~l~ 164 (823)
| .|++.++.+......+...+ .+..+.++++.-+++|+ +.+++.+|.+..+..... -.|+.....+..++
T Consensus 116 dD~~vK~~~~~D~s~~~~lrgh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~a 195 (933)
T KOG1274|consen 116 DDTAVKLLNLDDSSQEKVLRGHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLA 195 (933)
T ss_pred CceeEEEEeccccchheeecccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeee
Confidence 7 38888887665443333333 46677778877788877 888999999874321111 11344456777889
Q ss_pred ec---CCeEEEEEcCceEEEEcCCCCeeeccCCCC-CC-CCEEEEccCCeEEEE
Q 003405 165 WC---GENICIAIRKGYMILNATNGALSEVFPSGR-IG-PPLVVSLLSGELLLG 213 (823)
Q Consensus 165 ~~---~~~i~v~~~~~y~lidl~~~~~~~L~~~~~-~~-~p~i~~~~~~EfLL~ 213 (823)
|. |..++++.++...+|+..+....--+.... ++ --++.+.++|++|-+
T Consensus 196 W~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAA 249 (933)
T KOG1274|consen 196 WHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRDKLSSSKFSDLQWSPNGKYIAA 249 (933)
T ss_pred ecCCCCeEEeeccCCeEEEEccCCceeheeecccccccceEEEEEcCCCcEEee
Confidence 98 456888888999999988765433332211 11 223445567777764
No 35
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.22 E-value=0.14 Score=59.20 Aligned_cols=115 Identities=14% Similarity=0.248 Sum_probs=80.4
Q ss_pred CCCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCC-----
Q 003405 74 SKKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGR----- 146 (823)
Q Consensus 74 ~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~----- 146 (823)
+++.|..|.+-|..-+|+++-+ | ..+..+..-..++.....++|.+++.+++..+++|+..+-+.||+..+..
T Consensus 54 ~~~NI~~ialSp~g~lllavdE~g~~~lvs~~~r~Vlh~f~fk~~v~~i~fSPng~~fav~~gn~lqiw~~P~~~~~~~~ 133 (893)
T KOG0291|consen 54 TRYNITRIALSPDGTLLLAVDERGRALLVSLLSRSVLHRFNFKRGVGAIKFSPNGKFFAVGCGNLLQIWHAPGEIKNEFN 133 (893)
T ss_pred cCCceEEEEeCCCceEEEEEcCCCcEEEEecccceeeEEEeecCccceEEECCCCcEEEEEecceeEEEecCcchhcccC
Confidence 3678999999999878887776 5 33334433233455566789999999999888999999999999976421
Q ss_pred ceeEeeeecCC-CCceEEEecCCe--EEEEEc-CceEEEEcCCCCe
Q 003405 147 GFVEVKDFGVP-DTVKSMSWCGEN--ICIAIR-KGYMILNATNGAL 188 (823)
Q Consensus 147 ~f~~~kei~~~-~~~~~l~~~~~~--i~v~~~-~~y~lidl~~~~~ 188 (823)
.|...+.+..+ +.++++.|..++ +.+|++ ..-.+++++..+.
T Consensus 134 pFvl~r~~~g~fddi~si~Ws~DSr~l~~gsrD~s~rl~~v~~~k~ 179 (893)
T KOG0291|consen 134 PFVLHRTYLGHFDDITSIDWSDDSRLLVTGSRDLSARLFGVDGNKN 179 (893)
T ss_pred cceEeeeecCCccceeEEEeccCCceEEeccccceEEEEEeccccc
Confidence 24555555444 799999999664 555555 3566777665433
No 36
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=97.15 E-value=0.00015 Score=69.07 Aligned_cols=98 Identities=33% Similarity=0.406 Sum_probs=75.4
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccc
Q 003405 506 ILDTALLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEH 585 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~ 585 (823)
.+.|.|+.+|++.++.+.+..|++..+..|++.+.+.|.+++.+++.+.+|.+.|+|++|+.++.++.+
T Consensus 43 ~~~~~L~~ly~~~~~~~~l~~~L~~~~~yd~~~~~~~c~~~~l~~~a~~Ly~~~~~~~~al~i~~~~~~----------- 111 (143)
T PF00637_consen 43 DLHTLLLELYIKYDPYEKLLEFLKTSNNYDLDKALRLCEKHGLYEEAVYLYSKLGNHDEALEILHKLKD----------- 111 (143)
T ss_dssp HHHHHHHHHHHCTTTCCHHHHTTTSSSSS-CTHHHHHHHTTTSHHHHHHHHHCCTTHTTCSSTSSSTHC-----------
T ss_pred HHHHHHHHHHHhcCCchHHHHHcccccccCHHHHHHHHHhcchHHHHHHHHHHcccHHHHHHHHHHHcc-----------
Confidence 689999999999887568889999877899999999999999999999999999999999997443322
Q ss_pred cccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCc
Q 003405 586 TQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCP 621 (823)
Q Consensus 586 ~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p 621 (823)
.+.+++|..+.+ +.+++..-...+++..|
T Consensus 112 -----~~~a~e~~~~~~--~~~l~~~l~~~~l~~~~ 140 (143)
T PF00637_consen 112 -----YEEAIEYAKKVD--DPELWEQLLKYCLDSKP 140 (143)
T ss_dssp -----SCCCTTTGGGCS--SSHHHHHHHHHHCTSTC
T ss_pred -----HHHHHHHHHhcC--cHHHHHHHHHHHHhcCc
Confidence 234456776544 44555555555555444
No 37
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=97.13 E-value=0.011 Score=69.37 Aligned_cols=158 Identities=18% Similarity=0.305 Sum_probs=103.9
Q ss_pred ccccccCCCCcEEEEE--EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec
Q 003405 8 SLELISNCSPKIDAVA--SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA 85 (823)
Q Consensus 8 ~~~l~~~~~~~I~ci~--~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~ 85 (823)
..-|+.++...|.|++ ..|+.+..|.+|=.|.+.+....+. .+.+.++ +.||.+|...|
T Consensus 88 ~~~iL~Rftlp~r~~~v~g~g~~iaagsdD~~vK~~~~~D~s~------------------~~~lrgh-~apVl~l~~~p 148 (933)
T KOG1274|consen 88 EDTILARFTLPIRDLAVSGSGKMIAAGSDDTAVKLLNLDDSSQ------------------EKVLRGH-DAPVLQLSYDP 148 (933)
T ss_pred ccceeeeeeccceEEEEecCCcEEEeecCceeEEEEeccccch------------------heeeccc-CCceeeeeEcC
Confidence 3446677755555554 4456999999999988877553221 1233333 78999999999
Q ss_pred ccCceeeEe-Cc-EEEEeCCCCcccccccC---------CCCcEEEEeeCCCceEE-EEEcCeEEEEEEcCCC-ceeEee
Q 003405 86 SRQLLLSLS-ES-IAFHRLPNLETIAVLTK---------AKGANVYSWDDRRGFLC-FARQKRVCIFRHDGGR-GFVEVK 152 (823)
Q Consensus 86 ~~~~Ll~l~-d~-l~~~~L~~l~~~~~i~~---------~kg~~~fa~~~~~~~l~-V~~kkki~l~~~~~~~-~f~~~k 152 (823)
+.++|.+.+ || |.+|++.+.....++.. .+-|+-.++.++.|.++ +++++.|.+|...+.. .|. ++
T Consensus 149 ~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~-Lr 227 (933)
T KOG1274|consen 149 KGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSRKGWELQFK-LR 227 (933)
T ss_pred CCCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEccCCceehee-ec
Confidence 999988875 56 99999987543322222 23455567788877765 4599999999987422 232 22
Q ss_pred eecCCCCceEEEec--CCeEEEEEc-CceEEEEcCC
Q 003405 153 DFGVPDTVKSMSWC--GENICIAIR-KGYMILNATN 185 (823)
Q Consensus 153 ei~~~~~~~~l~~~--~~~i~v~~~-~~y~lidl~~ 185 (823)
.=........++|. |..|..++. ++..+.|.++
T Consensus 228 ~~~~ss~~~~~~wsPnG~YiAAs~~~g~I~vWnv~t 263 (933)
T KOG1274|consen 228 DKLSSSKFSDLQWSPNGKYIAASTLDGQILVWNVDT 263 (933)
T ss_pred ccccccceEEEEEcCCCcEEeeeccCCcEEEEeccc
Confidence 21223347778887 566777666 5677888773
No 38
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=97.10 E-value=0.0042 Score=58.96 Aligned_cols=88 Identities=27% Similarity=0.355 Sum_probs=71.3
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhc-CCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCccc
Q 003405 506 ILDTALLQALLLTGQSSAALELLK-GLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDE 584 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~-~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~ 584 (823)
.+.|.|+.+|++.++. .+.++++ ..+..|++.+...|.+++.+++.+.+|.+.|+|++|+++..+..
T Consensus 42 ~~~~~li~ly~~~~~~-~ll~~l~~~~~~yd~~~~~~~c~~~~l~~~~~~l~~k~~~~~~Al~~~l~~~----------- 109 (140)
T smart00299 42 ALQTKLIELYAKYDPQ-KEIERLDNKSNHYDIEKVGKLCEKAKLYEEAVELYKKDGNFKDAIVTLIEHL----------- 109 (140)
T ss_pred hHHHHHHHHHHHHCHH-HHHHHHHhccccCCHHHHHHHHHHcCcHHHHHHHHHhhcCHHHHHHHHHHcc-----------
Confidence 5889999999998764 5668887 77889999999999999999999999999999999999988631
Q ss_pred ccccCChHHHHHHhhcCCCCChhhHHH
Q 003405 585 HTQKFNPESIIEYLKPLCGTDPMLVLE 611 (823)
Q Consensus 585 ~~~~~~~~~~i~yL~~L~~~~~~li~~ 611 (823)
..++.+++|.++- .+.++...
T Consensus 110 ----~d~~~a~~~~~~~--~~~~lw~~ 130 (140)
T smart00299 110 ----GNYEKAIEYFVKQ--NNPELWAE 130 (140)
T ss_pred ----cCHHHHHHHHHhC--CCHHHHHH
Confidence 1357889998852 34444433
No 39
>PTZ00420 coronin; Provisional
Probab=97.09 E-value=0.25 Score=57.74 Aligned_cols=118 Identities=18% Similarity=0.373 Sum_probs=78.7
Q ss_pred eecCCCCCCeeEEEEecc-cCceeeEeC-c-EEEEeCCCCc--------cccccc-CCCCcEEEEeeCCCceE-EEE-Ec
Q 003405 69 TISGFSKKPILSMEVLAS-RQLLLSLSE-S-IAFHRLPNLE--------TIAVLT-KAKGANVYSWDDRRGFL-CFA-RQ 134 (823)
Q Consensus 69 ~~~~~~k~~I~qI~~~~~-~~~Ll~l~d-~-l~~~~L~~l~--------~~~~i~-~~kg~~~fa~~~~~~~l-~V~-~k 134 (823)
.+.++ +.+|..+..-|. .++|++.++ + |++|++++-. +...+. ..+.++.++++++...+ +.+ ..
T Consensus 69 ~L~gH-~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~D 147 (568)
T PTZ00420 69 KLKGH-TSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFD 147 (568)
T ss_pred EEcCC-CCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCC
Confidence 34443 679999999986 567777775 5 9999997421 111121 23467888888865444 444 56
Q ss_pred CeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEc-CceEEEEcCCCCeee
Q 003405 135 KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIR-KGYMILNATNGALSE 190 (823)
Q Consensus 135 kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~~~~~ 190 (823)
+.|.||.+..+. ....+..++.+.+++|. |+.++.++. +...++|+.+++...
T Consensus 148 gtIrIWDl~tg~---~~~~i~~~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~ 203 (568)
T PTZ00420 148 SFVNIWDIENEK---RAFQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIAS 203 (568)
T ss_pred CeEEEEECCCCc---EEEEEecCCcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEE
Confidence 788888887442 22334456789999997 556666664 568999999886543
No 40
>PTZ00420 coronin; Provisional
Probab=97.07 E-value=0.13 Score=60.15 Aligned_cols=156 Identities=15% Similarity=0.198 Sum_probs=94.5
Q ss_pred CCcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCcee-
Q 003405 16 SPKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLL- 91 (823)
Q Consensus 16 ~~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll- 91 (823)
...|.|++.+ ++.|+.|+.||.|.+|++....... .........+.+ +..+|..|..-|....++
T Consensus 74 ~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~----------~~i~~p~~~L~g-H~~~V~sVaf~P~g~~iLa 142 (568)
T PTZ00420 74 TSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESV----------KEIKDPQCILKG-HKKKISIIDWNPMNYYIMC 142 (568)
T ss_pred CCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccc----------cccccceEEeec-CCCcEEEEEECCCCCeEEE
Confidence 4579998876 3588999999999999986432100 000011122333 357899999999776544
Q ss_pred eEe-Cc-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeecCCCC-ceE-EEec
Q 003405 92 SLS-ES-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFGVPDT-VKS-MSWC 166 (823)
Q Consensus 92 ~l~-d~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~~~~~-~~~-l~~~ 166 (823)
+.+ |+ |++|++.+-+....+.....+++++++++...++++. .++|.||....+. .+.++..... +.+ ..|.
T Consensus 143 SgS~DgtIrIWDl~tg~~~~~i~~~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~---~i~tl~gH~g~~~s~~v~~ 219 (568)
T PTZ00420 143 SSGFDSFVNIWDIENEKRAFQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQE---IASSFHIHDGGKNTKNIWI 219 (568)
T ss_pred EEeCCCeEEEEECCCCcEEEEEecCCcEEEEEECCCCCEEEEEecCCEEEEEECCCCc---EEEEEecccCCceeEEEEe
Confidence 444 45 9999997765444443345678888888766677664 5678888877442 2233333322 211 1222
Q ss_pred ------CCeEEE-EEcC----ceEEEEcCC
Q 003405 167 ------GENICI-AIRK----GYMILNATN 185 (823)
Q Consensus 167 ------~~~i~v-~~~~----~y~lidl~~ 185 (823)
++.|+. |..+ .+.+.|+.+
T Consensus 220 ~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~ 249 (568)
T PTZ00420 220 DGLGGDDNYILSTGFSKNNMREMKLWDLKN 249 (568)
T ss_pred eeEcCCCCEEEEEEcCCCCccEEEEEECCC
Confidence 234443 4442 589999885
No 41
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.06 E-value=0.22 Score=55.09 Aligned_cols=238 Identities=14% Similarity=0.231 Sum_probs=141.0
Q ss_pred eCCEEEEEeC----CCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe--Cc-E
Q 003405 25 YGLKILLGCS----DGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS--ES-I 97 (823)
Q Consensus 25 ~~~~L~vGT~----~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~--d~-l 97 (823)
.++.||+.++ .|.|..|.+..+.. ..+.+.+.... ...-.+|.+.+....|++-. ++ +
T Consensus 47 ~~~~LY~~~e~~~~~g~v~~~~i~~~~g--------------~L~~~~~~~~~-g~~p~~i~~~~~g~~l~vany~~g~v 111 (345)
T PF10282_consen 47 DGRRLYVVNEGSGDSGGVSSYRIDPDTG--------------TLTLLNSVPSG-GSSPCHIAVDPDGRFLYVANYGGGSV 111 (345)
T ss_dssp TSSEEEEEETTSSTTTEEEEEEEETTTT--------------EEEEEEEEEES-SSCEEEEEECTTSSEEEEEETTTTEE
T ss_pred CCCEEEEEEccccCCCCEEEEEECCCcc--------------eeEEeeeeccC-CCCcEEEEEecCCCEEEEEEccCCeE
Confidence 5689999987 57999998876421 22333333322 34556788888877777654 23 9
Q ss_pred EEEeCCCC---cccc-c-----------ccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCC-ceeEeeeecC--C
Q 003405 98 AFHRLPNL---ETIA-V-----------LTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGR-GFVEVKDFGV--P 157 (823)
Q Consensus 98 ~~~~L~~l---~~~~-~-----------i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~-~f~~~kei~~--~ 157 (823)
.+|++..- .... . ......++.+.++++...++|+ ...+|.+|.++... .+.....+.+ .
T Consensus 112 ~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G 191 (345)
T PF10282_consen 112 SVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPG 191 (345)
T ss_dssp EEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTT
T ss_pred EEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccC
Confidence 99998652 1111 0 1233467888888887777776 45789999998543 3554444444 4
Q ss_pred CCceEEEec--CCeEEEEEc--CceEEEEcC--CCCee-----eccCCCCCC--CCEEEE-ccCCeEEEEeC----CeEE
Q 003405 158 DTVKSMSWC--GENICIAIR--KGYMILNAT--NGALS-----EVFPSGRIG--PPLVVS-LLSGELLLGKE----NIGV 219 (823)
Q Consensus 158 ~~~~~l~~~--~~~i~v~~~--~~y~lidl~--~~~~~-----~L~~~~~~~--~p~i~~-~~~~EfLL~~~----~~gv 219 (823)
..|+.|.|. |..++|.+. +...++++. +|... ...+.+-.. .|.-+. .+++.||.+.+ ...+
T Consensus 192 ~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~v 271 (345)
T PF10282_consen 192 SGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISV 271 (345)
T ss_dssp SSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEE
T ss_pred CCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEE
Confidence 789999998 457878775 567777877 55332 222222111 244333 46788887542 3445
Q ss_pred E-Ec-CCCccccCCceeecC-CCcEEEE--eCCEEEEEeC--CeEEEEEcc-CCCceeE---EEeeCCc
Q 003405 220 F-VD-QNGKLLQADRICWSE-APIAVII--QKPYAIALLP--RRVEVRSLR-VPYALIQ---TIVLQNV 277 (823)
Q Consensus 220 f-v~-~~G~~~~~~~i~w~~-~P~~v~~--~~PYll~~~~--~~ieV~~l~-~~~~lvQ---~i~l~~~ 277 (823)
| +| .+|.......+.-.+ .|+.+++ ..-||++... +.|.|+.+. +++.+.+ .+.++++
T Consensus 272 f~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~~tG~l~~~~~~~~~~~p 340 (345)
T PF10282_consen 272 FDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDIDPDTGKLTPVGSSVPIPSP 340 (345)
T ss_dssp EEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEEESSSE
T ss_pred EEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEeCCCCcEEEecccccCCCC
Confidence 5 43 345554323333323 4999998 5678888875 578888773 2354433 3445443
No 42
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=97.05 E-value=0.021 Score=57.06 Aligned_cols=161 Identities=14% Similarity=0.148 Sum_probs=106.9
Q ss_pred cccccccccCCCCcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEE
Q 003405 5 AFDSLELISNCSPKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEV 83 (823)
Q Consensus 5 af~~~~l~~~~~~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~ 83 (823)
+|.+..+.+.- ...+++++.| +.|||+|..+.|-+|++..+.. .+..+|... .+|.+|..
T Consensus 7 ~F~sQ~v~~~~-~EP~~~c~~g~d~Lfva~~g~~Vev~~l~~~~~----------------~~~~~F~Tv--~~V~~l~y 67 (215)
T PF14761_consen 7 PFGSQNVVPCE-QEPTAVCCGGPDALFVAASGCKVEVYDLEQEEC----------------PLLCTFSTV--GRVLQLVY 67 (215)
T ss_pred ccCCceeeccc-cCcceeeccCCceEEEEcCCCEEEEEEcccCCC----------------ceeEEEcch--hheeEEEe
Confidence 56666555443 2556666667 9999999999999999883221 223344433 68999999
Q ss_pred ecccCceeeEeC-c-------EEEEe-CCC----Cccc-----------------------ccccCCCCcEEEEeeCCCc
Q 003405 84 LASRQLLLSLSE-S-------IAFHR-LPN----LETI-----------------------AVLTKAKGANVYSWDDRRG 127 (823)
Q Consensus 84 ~~~~~~Ll~l~d-~-------l~~~~-L~~----l~~~-----------------------~~i~~~kg~~~fa~~~~~~ 127 (823)
.+.++.+++|-+ + +++|- +.. -+++ -.++-....++++..+.+|
T Consensus 68 ~~~GDYlvTlE~k~~~~~~~fvR~Y~NWr~~~~~~~~v~vRiaG~~v~~~~~~~~~~qleiiElPl~~~p~ciaCC~~tG 147 (215)
T PF14761_consen 68 SEAGDYLVTLEEKNKRSPVDFVRAYFNWRSQKEENSPVRVRIAGHRVTPSFNESSKDQLEIIELPLSEPPLCIACCPVTG 147 (215)
T ss_pred ccccceEEEEEeecCCccceEEEEEEEhhhhcccCCcEEEEEcccccccCCCCccccceEEEEecCCCCCCEEEecCCCC
Confidence 999999999965 1 45542 211 1111 0122333566778888889
Q ss_pred eEEEEEcCeEEEEEEcCC----Ccee--Eeee----ecCCCCceEEEecCCeEEEEEcCceEEEEcC
Q 003405 128 FLCFARQKRVCIFRHDGG----RGFV--EVKD----FGVPDTVKSMSWCGENICIAIRKGYMILNAT 184 (823)
Q Consensus 128 ~l~V~~kkki~l~~~~~~----~~f~--~~ke----i~~~~~~~~l~~~~~~i~v~~~~~y~lidl~ 184 (823)
.|+||.++++.||+.... ..+. ...+ +...-.|+.+++.++.|.+.+..+..++-+.
T Consensus 148 ~LlVg~~~~l~lf~l~~~~~~~~~~~~lDFe~~l~~~~~~~~p~~v~ic~~yiA~~s~~ev~Vlkl~ 214 (215)
T PF14761_consen 148 NLLVGCGNKLVLFTLKYQTIQSEKFSFLDFERSLIDHIDNFKPTQVAICEGYIAVMSDLEVLVLKLE 214 (215)
T ss_pred CEEEEcCCEEEEEEEEEEEEecccccEEechhhhhheecCceEEEEEEEeeEEEEecCCEEEEEEEe
Confidence 999999999999987521 1111 1111 1223468899999999999999888877653
No 43
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=97.02 E-value=0.058 Score=55.29 Aligned_cols=152 Identities=16% Similarity=0.218 Sum_probs=103.5
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCeeEEEEeccc-C-cee
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPILSMEVLASR-Q-LLL 91 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~qI~~~~~~-~-~Ll 91 (823)
..+.|++.- +.+++-|+.|-+|..|+.-+.. .|++ ...+ +.=|+++...|.. + +++
T Consensus 106 ~dVlsva~s~dn~qivSGSrDkTiklwnt~g~c---------------k~t~----~~~~~~~WVscvrfsP~~~~p~Iv 166 (315)
T KOG0279|consen 106 KDVLSVAFSTDNRQIVSGSRDKTIKLWNTLGVC---------------KYTI----HEDSHREWVSCVRFSPNESNPIIV 166 (315)
T ss_pred CceEEEEecCCCceeecCCCcceeeeeeecccE---------------EEEE----ecCCCcCcEEEEEEcCCCCCcEEE
Confidence 356666554 4588999999999999865421 2222 1222 4569999999985 3 333
Q ss_pred eEeC-c-EEEEeCCCCccccc-ccCCCCcEEEEeeCCCceEEEEEc--CeEEEEEEcCCCceeEeeeecCCCCceEEEec
Q 003405 92 SLSE-S-IAFHRLPNLETIAV-LTKAKGANVYSWDDRRGFLCFARQ--KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC 166 (823)
Q Consensus 92 ~l~d-~-l~~~~L~~l~~~~~-i~~~kg~~~fa~~~~~~~l~V~~k--kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~ 166 (823)
..+. + |++|+|.+++..+. +...+-++.++++++. .+|.... .+++++.+..++. +..+.-.++|.+++|.
T Consensus 167 s~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpDG-slcasGgkdg~~~LwdL~~~k~---lysl~a~~~v~sl~fs 242 (315)
T KOG0279|consen 167 SASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPDG-SLCASGGKDGEAMLWDLNEGKN---LYSLEAFDIVNSLCFS 242 (315)
T ss_pred EccCCceEEEEccCCcchhhccccccccEEEEEECCCC-CEEecCCCCceEEEEEccCCce---eEeccCCCeEeeEEec
Confidence 3333 3 99999998875544 4556678899999885 4555433 4566666665432 3344455789999998
Q ss_pred CC--eEEEEEcCceEEEEcCCCCeeec
Q 003405 167 GE--NICIAIRKGYMILNATNGALSEV 191 (823)
Q Consensus 167 ~~--~i~v~~~~~y~lidl~~~~~~~L 191 (823)
.+ .||.|+..+..|.|+.++....-
T Consensus 243 pnrywL~~at~~sIkIwdl~~~~~v~~ 269 (315)
T KOG0279|consen 243 PNRYWLCAATATSIKIWDLESKAVVEE 269 (315)
T ss_pred CCceeEeeccCCceEEEeccchhhhhh
Confidence 65 59999999999999998765433
No 44
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=97.01 E-value=0.084 Score=53.69 Aligned_cols=238 Identities=13% Similarity=0.184 Sum_probs=144.6
Q ss_pred cEEEEE--EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEe
Q 003405 18 KIDAVA--SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLS 94 (823)
Q Consensus 18 ~I~ci~--~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~ 94 (823)
++-+++ ++|..|.=|..++++.+|+++...- ..+.. ..+. ...|.|+.--|.. +++++.+
T Consensus 22 ~v~Sv~wn~~g~~lasgs~dktv~v~n~e~~r~--------------~~~~~--~~gh-~~svdql~w~~~~~d~~atas 84 (313)
T KOG1407|consen 22 KVHSVAWNCDGTKLASGSFDKTVSVWNLERDRF--------------RKELV--YRGH-TDSVDQLCWDPKHPDLFATAS 84 (313)
T ss_pred cceEEEEcccCceeeecccCCceEEEEecchhh--------------hhhhc--ccCC-CcchhhheeCCCCCcceEEec
Confidence 455444 5688999999999999999886421 11111 1122 4579999988877 4555555
Q ss_pred C--cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEc-CeEEEEEEcCCCceeEeeeecCCCCceEEEec-CCeE
Q 003405 95 E--SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQ-KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC-GENI 170 (823)
Q Consensus 95 d--~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~k-kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~-~~~i 170 (823)
. .+.+|+...-+++..+....+-...+..++.+.+||+.| ..|.++... .++..++..++-.+.-++|. .+.+
T Consensus 85 ~dk~ir~wd~r~~k~~~~i~~~~eni~i~wsp~g~~~~~~~kdD~it~id~r---~~~~~~~~~~~~e~ne~~w~~~nd~ 161 (313)
T KOG1407|consen 85 GDKTIRIWDIRSGKCTARIETKGENINITWSPDGEYIAVGNKDDRITFIDAR---TYKIVNEEQFKFEVNEISWNNSNDL 161 (313)
T ss_pred CCceEEEEEeccCcEEEEeeccCcceEEEEcCCCCEEEEecCcccEEEEEec---ccceeehhcccceeeeeeecCCCCE
Confidence 4 399999988877777665555555677877777888755 455555544 34445555555556667776 3456
Q ss_pred EEEEcC-c-eEEEEcCCCCeeeccCCCCCCCCEEE--EccCCeEE-EEe-CCeEEEEcCCCccc-c-CCceeecCCCcEE
Q 003405 171 CIAIRK-G-YMILNATNGALSEVFPSGRIGPPLVV--SLLSGELL-LGK-ENIGVFVDQNGKLL-Q-ADRICWSEAPIAV 242 (823)
Q Consensus 171 ~v~~~~-~-y~lidl~~~~~~~L~~~~~~~~p~i~--~~~~~EfL-L~~-~~~gvfv~~~G~~~-~-~~~i~w~~~P~~v 242 (823)
++.+.. + ..|+.-- ...++.........|++ .-++|.++ ++. |...=.-|.+--.. | -+.+.|+-.-.++
T Consensus 162 Fflt~GlG~v~ILsyp--sLkpv~si~AH~snCicI~f~p~GryfA~GsADAlvSLWD~~ELiC~R~isRldwpVRTlSF 239 (313)
T KOG1407|consen 162 FFLTNGLGCVEILSYP--SLKPVQSIKAHPSNCICIEFDPDGRYFATGSADALVSLWDVDELICERCISRLDWPVRTLSF 239 (313)
T ss_pred EEEecCCceEEEEecc--ccccccccccCCcceEEEEECCCCceEeeccccceeeccChhHhhhheeeccccCceEEEEe
Confidence 666653 2 3343332 11122111111123433 33566554 443 33333334333211 1 2457787777788
Q ss_pred EEeCCEEEEEeCC-eEEEEEccCCCceeEEEeeCCcc
Q 003405 243 IIQKPYAIALLPR-RVEVRSLRVPYALIQTIVLQNVR 278 (823)
Q Consensus 243 ~~~~PYll~~~~~-~ieV~~l~~~~~lvQ~i~l~~~~ 278 (823)
.+..-||-..+++ .|.|-.+. ++.-+..|++.+..
T Consensus 240 S~dg~~lASaSEDh~IDIA~ve-tGd~~~eI~~~~~t 275 (313)
T KOG1407|consen 240 SHDGRMLASASEDHFIDIAEVE-TGDRVWEIPCEGPT 275 (313)
T ss_pred ccCcceeeccCccceEEeEecc-cCCeEEEeeccCCc
Confidence 8888999888874 78898886 79888889887753
No 45
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=96.98 E-value=0.09 Score=57.36 Aligned_cols=253 Identities=15% Similarity=0.171 Sum_probs=143.0
Q ss_pred CcEEEEEEeCC--EEEEEeCCCcEEEEcCCCCCCC-CCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 17 PKIDAVASYGL--KILLGCSDGSLKIYSPGSSESD-RSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 17 ~~I~ci~~~~~--~L~vGT~~G~l~~y~~~~~~~~-~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
..|+|++...+ ++|=+..+|+|..|++...... ..-++|- -++ ..+...+.-..-+.+-|..+.+-++ +..++.
T Consensus 143 ~s~~~vals~d~~~~fsask~g~i~kw~v~tgk~~~~i~~~~e-v~k-~~~~~~k~~r~~h~keil~~avS~D-gkylat 219 (479)
T KOG0299|consen 143 LSVTSVALSPDDKRVFSASKDGTILKWDVLTGKKDRYIIERDE-VLK-SHGNPLKESRKGHVKEILTLAVSSD-GKYLAT 219 (479)
T ss_pred CcceEEEeeccccceeecCCCcceeeeehhcCcccccccccch-hhh-hccCCCCcccccccceeEEEEEcCC-CcEEEe
Confidence 36777777764 9999999999999987643321 0000000 000 0111111100012355666766666 445555
Q ss_pred eC-c--EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeee-ecCCCCceEEEec-
Q 003405 94 SE-S--IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKD-FGVPDTVKSMSWC- 166 (823)
Q Consensus 94 ~d-~--l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~ke-i~~~~~~~~l~~~- 166 (823)
++ + |.+|+..+++++......+| |...|.-..+..+..+ ..+++.+|.... +..+.. +--++.|.+|.-.
T Consensus 220 gg~d~~v~Iw~~~t~ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~~---~s~vetlyGHqd~v~~IdaL~ 296 (479)
T KOG0299|consen 220 GGRDRHVQIWDCDTLEHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSIDQ---LSYVETLYGHQDGVLGIDALS 296 (479)
T ss_pred cCCCceEEEecCcccchhhcccccccceeeeeeecCccceeeeecCCceEEEehhH---hHHHHHHhCCccceeeechhc
Confidence 65 3 88999999988766444443 4445555555555554 778888888773 222222 2346778887665
Q ss_pred -CCeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeCCeE-----------EEE--cCCCccccCC
Q 003405 167 -GENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKENIG-----------VFV--DQNGKLLQAD 231 (823)
Q Consensus 167 -~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~~~g-----------vfv--~~~G~~~~~~ 231 (823)
+..++||-+ +...+..+ ...++-+|..++...-|++.+++++|+.+.++-. +|+ ..+|-.....
T Consensus 297 reR~vtVGgrDrT~rlwKi-~eesqlifrg~~~sidcv~~In~~HfvsGSdnG~IaLWs~~KKkplf~~~~AHgv~~~~~ 375 (479)
T KOG0299|consen 297 RERCVTVGGRDRTVRLWKI-PEESQLIFRGGEGSIDCVAFINDEHFVSGSDNGSIALWSLLKKKPLFTSRLAHGVIPELD 375 (479)
T ss_pred ccceEEeccccceeEEEec-cccceeeeeCCCCCeeeEEEecccceeeccCCceEEEeeecccCceeEeeccccccCCcc
Confidence 568999944 67778888 4456666766665567888999999998876522 332 2333221111
Q ss_pred cee---ecCCCcEEEEeCCEEEEEeCCeEEEEEccCC---CceeEEEeeCC
Q 003405 232 RIC---WSEAPIAVIIQKPYAIALLPRRVEVRSLRVP---YALIQTIVLQN 276 (823)
Q Consensus 232 ~i~---w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~---~~lvQ~i~l~~ 276 (823)
+++ |-.....+.+..-+..+-.++++-+.-+.+. -.+++.+++.+
T Consensus 376 ~~~~~~Witsla~i~~sdL~asGS~~G~vrLW~i~~g~r~i~~l~~ls~~G 426 (479)
T KOG0299|consen 376 PVNGNFWITSLAVIPGSDLLASGSWSGCVRLWKIEDGLRAINLLYSLSLVG 426 (479)
T ss_pred ccccccceeeeEecccCceEEecCCCCceEEEEecCCccccceeeeccccc
Confidence 222 5333222222222222333356777766432 25677777766
No 46
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.98 E-value=0.34 Score=53.04 Aligned_cols=224 Identities=13% Similarity=0.150 Sum_probs=129.1
Q ss_pred EEeCCEEEEEeC-CCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EE
Q 003405 23 ASYGLKILLGCS-DGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IA 98 (823)
Q Consensus 23 ~~~~~~L~vGT~-~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~ 98 (823)
...++.||+|+. +|.|..|++++.. ......... . .....+|...|..+.+++.+. + +.
T Consensus 43 spd~~~lyv~~~~~~~i~~~~~~~~g---------------~l~~~~~~~-~-~~~p~~i~~~~~g~~l~v~~~~~~~v~ 105 (330)
T PRK11028 43 SPDKRHLYVGVRPEFRVLSYRIADDG---------------ALTFAAESP-L-PGSPTHISTDHQGRFLFSASYNANCVS 105 (330)
T ss_pred CCCCCEEEEEECCCCcEEEEEECCCC---------------ceEEeeeec-C-CCCceEEEECCCCCEEEEEEcCCCeEE
Confidence 334678999864 7888889876311 122222222 1 234678888998888887764 3 99
Q ss_pred EEeCCCC----cccccccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEe----eeecCCCCceEEEecC-
Q 003405 99 FHRLPNL----ETIAVLTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEV----KDFGVPDTVKSMSWCG- 167 (823)
Q Consensus 99 ~~~L~~l----~~~~~i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~----kei~~~~~~~~l~~~~- 167 (823)
+|++.+. +....+....+++.++++++...++|+ ..++|.+|.+..+..+... ..+...+.|.++.|..
T Consensus 106 v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pd 185 (330)
T PRK11028 106 VSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPN 185 (330)
T ss_pred EEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCcccccCCCceecCCCCCCceEEECCC
Confidence 9988632 122222334567888888877777666 4478999999743223211 1233456788999874
Q ss_pred -CeEEEEEc--CceEEEEcCC--CCeee---c--cCCCCC--CCCE-EEEccCCeEEEEeC---C-eEEE-EcCCCc-cc
Q 003405 168 -ENICIAIR--KGYMILNATN--GALSE---V--FPSGRI--GPPL-VVSLLSGELLLGKE---N-IGVF-VDQNGK-LL 228 (823)
Q Consensus 168 -~~i~v~~~--~~y~lidl~~--~~~~~---L--~~~~~~--~~p~-i~~~~~~EfLL~~~---~-~gvf-v~~~G~-~~ 228 (823)
..+++++. +...+++++. ++... + .|.+.. ..|. +...+++.++.+.+ + ..+| ++.+|. ..
T Consensus 186 g~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~ 265 (330)
T PRK11028 186 QQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSEDGSVLS 265 (330)
T ss_pred CCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEE
Confidence 47888875 5677888873 33221 1 221100 1122 33346777776432 2 3333 344442 11
Q ss_pred cCCceeecCCCcEEEEe--CCEEEEEeC--CeEEEEEcc
Q 003405 229 QADRICWSEAPIAVIIQ--KPYAIALLP--RRVEVRSLR 263 (823)
Q Consensus 229 ~~~~i~w~~~P~~v~~~--~PYll~~~~--~~ieV~~l~ 263 (823)
....+.....|..+.+. ..||++... +.|.|+.+.
T Consensus 266 ~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v~v~~~~ 304 (330)
T PRK11028 266 FEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHISVYEID 304 (330)
T ss_pred EeEEEeccccCCceEECCCCCEEEEEEccCCcEEEEEEc
Confidence 12333334467777764 568887664 578888763
No 47
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=96.91 E-value=0.55 Score=50.18 Aligned_cols=221 Identities=13% Similarity=0.184 Sum_probs=131.4
Q ss_pred EeCCEEEEEeCC---CcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-E
Q 003405 24 SYGLKILLGCSD---GSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-I 97 (823)
Q Consensus 24 ~~~~~L~vGT~~---G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l 97 (823)
..+++||++-++ |.+-.|.++.+.. ...+...+. ...++=..+.+.+....+++-.= + |
T Consensus 49 ~~~~~LY~v~~~~~~ggvaay~iD~~~G--------------~Lt~ln~~~-~~g~~p~yvsvd~~g~~vf~AnY~~g~v 113 (346)
T COG2706 49 PDQRHLYVVNEPGEEGGVAAYRIDPDDG--------------RLTFLNRQT-LPGSPPCYVSVDEDGRFVFVANYHSGSV 113 (346)
T ss_pred CCCCEEEEEEecCCcCcEEEEEEcCCCC--------------eEEEeeccc-cCCCCCeEEEECCCCCEEEEEEccCceE
Confidence 334679999665 8888998886432 223333322 22345589999988766665432 3 9
Q ss_pred EEEeCCCC---ccc-ccc----------cCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEeeeecC--CCC
Q 003405 98 AFHRLPNL---ETI-AVL----------TKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEVKDFGV--PDT 159 (823)
Q Consensus 98 ~~~~L~~l---~~~-~~i----------~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~kei~~--~~~ 159 (823)
.++.+.+. .+. ..+ +....|++.-++++...+||. .-.||.+|.+.++ .+....+..+ ...
T Consensus 114 ~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~Dri~~y~~~dg-~L~~~~~~~v~~G~G 192 (346)
T COG2706 114 SVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGTDRIFLYDLDDG-KLTPADPAEVKPGAG 192 (346)
T ss_pred EEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCCceEEEEEcccC-ccccccccccCCCCC
Confidence 99988542 111 111 112236666667766666665 5578999999854 4555444333 469
Q ss_pred ceEEEecCC--eEEEEEc--CceEEEEcCC--CCeeeccCC--------CCCCCCEEEEccCCeEEEEeC----CeEEE-
Q 003405 160 VKSMSWCGE--NICIAIR--KGYMILNATN--GALSEVFPS--------GRIGPPLVVSLLSGELLLGKE----NIGVF- 220 (823)
Q Consensus 160 ~~~l~~~~~--~i~v~~~--~~y~lidl~~--~~~~~L~~~--------~~~~~p~i~~~~~~EfLL~~~----~~gvf- 220 (823)
|+.|.|..+ ..++.+. +...++..++ |+...+-.. |.....-|...+++.||-++| ..++|
T Consensus 193 PRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~ 272 (346)
T COG2706 193 PRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFS 272 (346)
T ss_pred cceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEE
Confidence 999999844 3444443 5566665554 544444322 222222344457899999765 45666
Q ss_pred EcCCCccccCCceee---cC-CCcEEEEe--CCEEEEEeC--CeEEEEEc
Q 003405 221 VDQNGKLLQADRICW---SE-APIAVIIQ--KPYAIALLP--RRVEVRSL 262 (823)
Q Consensus 221 v~~~G~~~~~~~i~w---~~-~P~~v~~~--~PYll~~~~--~~ieV~~l 262 (823)
|+..|... ..+.| .+ .|+.+.+. .-||++... +.|.|+.+
T Consensus 273 V~~~~g~L--~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~ 320 (346)
T COG2706 273 VDPDGGKL--ELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFER 320 (346)
T ss_pred EcCCCCEE--EEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEEE
Confidence 77776432 23333 22 38887765 579999987 45777776
No 48
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=96.85 E-value=0.16 Score=56.32 Aligned_cols=172 Identities=9% Similarity=0.141 Sum_probs=110.8
Q ss_pred CCcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 16 SPKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 16 ~~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
.+.|.|+++++ +.++.-.=|-+|.+.++..+.-.. ...++ ++.+|. -+.+.+..+.+++.|
T Consensus 363 ~nqI~~~~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~----------------~~~~~-lg~QP~-~lav~~d~~~avv~~ 424 (603)
T KOG0318|consen 363 TNQIKGMAASESGELFTIGWDDTLRVISLKDNGYTK----------------SEVVK-LGSQPK-GLAVLSDGGTAVVAC 424 (603)
T ss_pred cceEEEEeecCCCcEEEEecCCeEEEEecccCcccc----------------cceee-cCCCce-eEEEcCCCCEEEEEe
Confidence 46899999998 788777777778877665432210 01111 334555 678888878888888
Q ss_pred C-cEEEEe-CCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC--e
Q 003405 95 E-SIAFHR-LPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE--N 169 (823)
Q Consensus 95 d-~l~~~~-L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~--~ 169 (823)
+ +|.++. ... +..++-.=..+++|++++...+||+ ...++.||.+.++..-.+.+......+|+.+++..+ .
T Consensus 425 ~~~iv~l~~~~~---~~~~~~~y~~s~vAv~~~~~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~y 501 (603)
T KOG0318|consen 425 ISDIVLLQDQTK---VSSIPIGYESSAVAVSPDGSEVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAY 501 (603)
T ss_pred cCcEEEEecCCc---ceeeccccccceEEEcCCCCEEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcE
Confidence 8 476664 322 2223323356788999988889998 667899999997542233455566789999999844 5
Q ss_pred EEEEEc-CceEEEEcCCCCe---eeccCCCCCCCCEEEEccCCeE
Q 003405 170 ICIAIR-KGYMILNATNGAL---SEVFPSGRIGPPLVVSLLSGEL 210 (823)
Q Consensus 170 i~v~~~-~~y~lidl~~~~~---~~L~~~~~~~~p~i~~~~~~Ef 210 (823)
+..|.. +...++|..++.+ ...|..++ -.++.+.++++.
T Consensus 502 la~~Da~rkvv~yd~~s~~~~~~~w~FHtak--I~~~aWsP~n~~ 544 (603)
T KOG0318|consen 502 LAAGDASRKVVLYDVASREVKTNRWAFHTAK--INCVAWSPNNKL 544 (603)
T ss_pred EEEeccCCcEEEEEcccCceecceeeeeeee--EEEEEeCCCceE
Confidence 666555 6788899988755 22233322 234555555553
No 49
>PTZ00421 coronin; Provisional
Probab=96.84 E-value=0.37 Score=55.76 Aligned_cols=156 Identities=14% Similarity=0.224 Sum_probs=97.9
Q ss_pred CCcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-Ccee
Q 003405 16 SPKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLL 91 (823)
Q Consensus 16 ~~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll 91 (823)
...|.|++.. ++.|+.|+.||.|.+|++....... ........+.+ +.++|..|..-|.. ++|+
T Consensus 75 ~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~-----------~~~~~l~~L~g-H~~~V~~l~f~P~~~~iLa 142 (493)
T PTZ00421 75 EGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQ-----------NISDPIVHLQG-HTKKVGIVSFHPSAMNVLA 142 (493)
T ss_pred CCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCcccc-----------ccCcceEEecC-CCCcEEEEEeCcCCCCEEE
Confidence 4578988865 3589999999999999987532100 00111223333 36789999999875 5677
Q ss_pred eEe-Cc-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCC--CceEEEe
Q 003405 92 SLS-ES-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPD--TVKSMSW 165 (823)
Q Consensus 92 ~l~-d~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~--~~~~l~~ 165 (823)
+-+ |+ |++|++.+-+....+. ....+..++++++...++.+ ..++|.+|....+. .+.++.... .+..+.|
T Consensus 143 Sgs~DgtVrIWDl~tg~~~~~l~~h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~---~v~tl~~H~~~~~~~~~w 219 (493)
T PTZ00421 143 SAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGT---IVSSVEAHASAKSQRCLW 219 (493)
T ss_pred EEeCCCEEEEEECCCCeEEEEEcCCCCceEEEEEECCCCEEEEecCCCEEEEEECCCCc---EEEEEecCCCCcceEEEE
Confidence 666 45 9999997655444433 23457888888876667776 45678888876442 233333222 2345667
Q ss_pred cC--CeE-EEEEc----CceEEEEcCCC
Q 003405 166 CG--ENI-CIAIR----KGYMILNATNG 186 (823)
Q Consensus 166 ~~--~~i-~v~~~----~~y~lidl~~~ 186 (823)
.. +.+ .+|.. +.+.+.|+.+.
T Consensus 220 ~~~~~~ivt~G~s~s~Dr~VklWDlr~~ 247 (493)
T PTZ00421 220 AKRKDLIITLGCSKSQQRQIMLWDTRKM 247 (493)
T ss_pred cCCCCeEEEEecCCCCCCeEEEEeCCCC
Confidence 63 333 34432 46888898754
No 50
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=96.77 E-value=0.052 Score=56.35 Aligned_cols=150 Identities=13% Similarity=0.289 Sum_probs=95.3
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEe--Cc-EEEEe
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLS--ES-IAFHR 101 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~--d~-l~~~~ 101 (823)
|+-|...+.|+++.+|.-....... ..+.+....+.. -++..|+.|+..|.. ++.++.+ || |++|.
T Consensus 73 GqvvA~cS~Drtv~iWEE~~~~~~~---------~~~~Wv~~ttl~-DsrssV~DV~FaP~hlGLklA~~~aDG~lRIYE 142 (361)
T KOG2445|consen 73 GQVVATCSYDRTVSIWEEQEKSEEA---------HGRRWVRRTTLV-DSRSSVTDVKFAPKHLGLKLAAASADGILRIYE 142 (361)
T ss_pred cceEEEEecCCceeeeeeccccccc---------ccceeEEEEEee-cCCcceeEEEecchhcceEEEEeccCcEEEEEe
Confidence 5678888999999999743222110 112343332222 347899999999976 5544443 46 89998
Q ss_pred CCCC---c------cccccc--CC-CCcEEEEeeCCC-----ceEEEEEcC------eEEEEEEcC-CCceeEeeee-cC
Q 003405 102 LPNL---E------TIAVLT--KA-KGANVYSWDDRR-----GFLCFARQK------RVCIFRHDG-GRGFVEVKDF-GV 156 (823)
Q Consensus 102 L~~l---~------~~~~i~--~~-kg~~~fa~~~~~-----~~l~V~~kk------ki~l~~~~~-~~~f~~~kei-~~ 156 (823)
.++. . .+..+. .. ..-.+||++-+. ..|+||... ++.||+... ++.+.++-++ ..
T Consensus 143 A~dp~nLs~W~Lq~Ei~~~~~pp~~~~~~~~CvsWn~sr~~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~d~ 222 (361)
T KOG2445|consen 143 APDPMNLSQWTLQHEIQNVIDPPGKNKQPCFCVSWNPSRMHEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELPDH 222 (361)
T ss_pred cCCccccccchhhhhhhhccCCcccccCcceEEeeccccccCceEEEEcccCCccccceEEEEecCCcceeeeehhcCCC
Confidence 7642 1 111111 11 122356664333 358998776 899999975 3356666564 45
Q ss_pred CCCceEEEecC------CeEEEEEcCceEEEEcCC
Q 003405 157 PDTVKSMSWCG------ENICIAIRKGYMILNATN 185 (823)
Q Consensus 157 ~~~~~~l~~~~------~~i~v~~~~~y~lidl~~ 185 (823)
+++|+.++|.. ..|.+|++.+..|+++..
T Consensus 223 ~dpI~di~wAPn~Gr~y~~lAvA~kDgv~I~~v~~ 257 (361)
T KOG2445|consen 223 TDPIRDISWAPNIGRSYHLLAVATKDGVRIFKVKV 257 (361)
T ss_pred CCcceeeeeccccCCceeeEEEeecCcEEEEEEee
Confidence 78999999973 259999999999999875
No 51
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=96.72 E-value=0.027 Score=58.42 Aligned_cols=200 Identities=15% Similarity=0.252 Sum_probs=117.3
Q ss_pred CCcEEEEE--EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVA--SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~--~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
+..++|.- ..+++|+-|+-||.|-+|+-....-. +...|+..-.|. ....+|..|....+.++|.+=
T Consensus 213 KSh~EcA~FSPDgqyLvsgSvDGFiEVWny~~GKlr----------KDLkYQAqd~fM-Mmd~aVlci~FSRDsEMlAsG 281 (508)
T KOG0275|consen 213 KSHVECARFSPDGQYLVSGSVDGFIEVWNYTTGKLR----------KDLKYQAQDNFM-MMDDAVLCISFSRDSEMLASG 281 (508)
T ss_pred ccchhheeeCCCCceEeeccccceeeeehhccchhh----------hhhhhhhhccee-ecccceEEEeecccHHHhhcc
Confidence 34677743 45789999999999999985532211 111222111221 125789999988888888876
Q ss_pred e-Cc-EEEEeCCCCccccc--ccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecC-CCCceEEEec-
Q 003405 94 S-ES-IAFHRLPNLETIAV--LTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWC- 166 (823)
Q Consensus 94 ~-d~-l~~~~L~~l~~~~~--i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~- 166 (823)
+ || |++|.+.+-.-+.. -...||+++...+.+.+.+.-+ ....+.+--.+.++. +||+.- ..-+....|.
T Consensus 282 sqDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~---LKEfrGHsSyvn~a~ft~ 358 (508)
T KOG0275|consen 282 SQDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKC---LKEFRGHSSYVNEATFTD 358 (508)
T ss_pred CcCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchh---HHHhcCccccccceEEcC
Confidence 6 35 99999865322211 1346899998888777655443 455555555554432 344322 2234444444
Q ss_pred -CCeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCC--EEEEccC--CeEEEE-eCCeEEEEcCCCcccc
Q 003405 167 -GENICIAIR-KGYMILNATNGALSEVFPSGRIGPP--LVVSLLS--GELLLG-KENIGVFVDQNGKLLQ 229 (823)
Q Consensus 167 -~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p--~i~~~~~--~EfLL~-~~~~gvfv~~~G~~~~ 229 (823)
|+.|.-++. ....+.+..++....-|.++....| .+..++. ..|++| +.+..+++|.+|..+|
T Consensus 359 dG~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNrsntv~imn~qGQvVr 428 (508)
T KOG0275|consen 359 DGHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCNRSNTVYIMNMQGQVVR 428 (508)
T ss_pred CCCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcCCCCceEEEEcCCCeEEEEeccceEEe
Confidence 556655555 5577778877765544433222222 2334443 356677 4567778899998764
No 52
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=96.69 E-value=0.06 Score=55.94 Aligned_cols=128 Identities=17% Similarity=0.195 Sum_probs=86.5
Q ss_pred cEEEEEEe-CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C
Q 003405 18 KIDAVASY-GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E 95 (823)
Q Consensus 18 ~I~ci~~~-~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d 95 (823)
.|.|.+-. +..+++|+-||.|..|+++..... +-+-+..+|..|.-.+..+.+++=+ |
T Consensus 56 plL~c~F~d~~~~~~G~~dg~vr~~Dln~~~~~--------------------~igth~~~i~ci~~~~~~~~vIsgsWD 115 (323)
T KOG1036|consen 56 PLLDCAFADESTIVTGGLDGQVRRYDLNTGNED--------------------QIGTHDEGIRCIEYSYEVGCVISGSWD 115 (323)
T ss_pred ceeeeeccCCceEEEeccCceEEEEEecCCcce--------------------eeccCCCceEEEEeeccCCeEEEcccC
Confidence 45554444 468999999999999998754321 1123467999999998888777654 4
Q ss_pred c-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC
Q 003405 96 S-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG 167 (823)
Q Consensus 96 ~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~ 167 (823)
+ |++|+...-.........| ..||.+....+|+|| ..+++.+|.+..-..+-..+|-.+.-.+++++...
T Consensus 116 ~~ik~wD~R~~~~~~~~d~~k--kVy~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~p 187 (323)
T KOG1036|consen 116 KTIKFWDPRNKVVVGTFDQGK--KVYCMDVSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVP 187 (323)
T ss_pred ccEEEEeccccccccccccCc--eEEEEeccCCEEEEeecCceEEEEEcccccchhhhccccceeEEEEEEEec
Confidence 3 9999976421222222223 567777666789996 77889999988533343456767777888887763
No 53
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=96.61 E-value=0.77 Score=46.45 Aligned_cols=226 Identities=13% Similarity=0.072 Sum_probs=132.0
Q ss_pred EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-EEE
Q 003405 21 AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES-IAF 99 (823)
Q Consensus 21 ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~-l~~ 99 (823)
++...++++++|.++|.|.++.+..-.+.+. ..++-..+-.+++ +..||..+..- ...|++.+|| |+-
T Consensus 17 a~sp~~~~l~agn~~G~iav~sl~sl~s~sa--------~~~gk~~iv~eqa-hdgpiy~~~f~--d~~Lls~gdG~V~g 85 (325)
T KOG0649|consen 17 AISPSKQYLFAGNLFGDIAVLSLKSLDSGSA--------EPPGKLKIVPEQA-HDGPIYYLAFH--DDFLLSGGDGLVYG 85 (325)
T ss_pred hhCCcceEEEEecCCCeEEEEEehhhhcccc--------CCCCCcceeeccc-cCCCeeeeeee--hhheeeccCceEEE
Confidence 4566778999999999999999886544321 0011111112233 36788888655 4678888887 888
Q ss_pred EeCCCCcc-------------ccc-ccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcC-CCceeEeeeecC-CCCceEE
Q 003405 100 HRLPNLET-------------IAV-LTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDG-GRGFVEVKDFGV-PDTVKSM 163 (823)
Q Consensus 100 ~~L~~l~~-------------~~~-i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~-~~~f~~~kei~~-~~~~~~l 163 (823)
|...+++. ... ......++++.+++..+.+..|..... +|+|+- +..|+ +++.- .|-+.++
T Consensus 86 w~W~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgGD~~-~y~~dlE~G~i~--r~~rGHtDYvH~v 162 (325)
T KOG0649|consen 86 WEWNEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGGDGV-IYQVDLEDGRIQ--REYRGHTDYVHSV 162 (325)
T ss_pred eeehhhhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEEEecCCeE-EEEEEecCCEEE--EEEcCCcceeeee
Confidence 87755321 111 234557889999987777766665444 888863 22242 33322 3667777
Q ss_pred EecC--CeEEEEEc-CceEEEEcCCCCeeeccCCCCC---CCC-----EEEEccCCeEEEEeC--CeEEEEcCCCccccC
Q 003405 164 SWCG--ENICIAIR-KGYMILNATNGALSEVFPSGRI---GPP-----LVVSLLSGELLLGKE--NIGVFVDQNGKLLQA 230 (823)
Q Consensus 164 ~~~~--~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~---~~p-----~i~~~~~~EfLL~~~--~~gvfv~~~G~~~~~ 230 (823)
.-++ ..|+-|.. ....+.|+.|++....+.+-+. .+| +.+.-.+..-|+|-+ +..++-=..-++ .
T Consensus 163 v~R~~~~qilsG~EDGtvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgGp~lslwhLrsse~--t 240 (325)
T KOG0649|consen 163 VGRNANGQILSGAEDGTVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGGPKLSLWHLRSSES--T 240 (325)
T ss_pred eecccCcceeecCCCccEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecCCCceeEEeccCCCc--e
Confidence 6653 35666666 4578899999876555433211 233 222234556777654 345553333332 2
Q ss_pred CceeecCCCcEEEEeCCEEEEEeC-CeEEEEEc
Q 003405 231 DRICWSEAPIAVIIQKPYAIALLP-RRVEVRSL 262 (823)
Q Consensus 231 ~~i~w~~~P~~v~~~~PYll~~~~-~~ieV~~l 262 (823)
..+.++.....+.|..--+++... +.++-+.+
T Consensus 241 ~vfpipa~v~~v~F~~d~vl~~G~g~~v~~~~l 273 (325)
T KOG0649|consen 241 CVFPIPARVHLVDFVDDCVLIGGEGNHVQSYTL 273 (325)
T ss_pred EEEecccceeEeeeecceEEEeccccceeeeee
Confidence 455566666666666666666653 44554444
No 54
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.54 E-value=0.35 Score=59.90 Aligned_cols=149 Identities=11% Similarity=0.192 Sum_probs=93.2
Q ss_pred CcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec-ccCceee
Q 003405 17 PKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA-SRQLLLS 92 (823)
Q Consensus 17 ~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~-~~~~Ll~ 92 (823)
..|+|++.. ++.|+.|+.||.|.+|++... .....+.+ +..+|..+..-+ ..++|++
T Consensus 533 ~~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~------------------~~~~~~~~-H~~~V~~l~~~p~~~~~L~S 593 (793)
T PLN00181 533 SKLSGICWNSYIKSQVASSNFEGVVQVWDVARS------------------QLVTEMKE-HEKRVWSIDYSSADPTLLAS 593 (793)
T ss_pred CceeeEEeccCCCCEEEEEeCCCeEEEEECCCC------------------eEEEEecC-CCCCEEEEEEcCCCCCEEEE
Confidence 356776653 468999999999999997632 12233333 367899999987 4566776
Q ss_pred EeC-c-EEEEeCCCCcccccccCCCCcEEEEeeCCC-ceEEEE-EcCeEEEEEEcCCCceeEeeee-cCCCCceEEEecC
Q 003405 93 LSE-S-IAFHRLPNLETIAVLTKAKGANVYSWDDRR-GFLCFA-RQKRVCIFRHDGGRGFVEVKDF-GVPDTVKSMSWCG 167 (823)
Q Consensus 93 l~d-~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~-~~l~V~-~kkki~l~~~~~~~~f~~~kei-~~~~~~~~l~~~~ 167 (823)
.++ + |++|++.+-..+..+.....+.+++..... ..++++ ..+.|.+|.....+. ....+ .-...+.++.|.+
T Consensus 594 gs~Dg~v~iWd~~~~~~~~~~~~~~~v~~v~~~~~~g~~latgs~dg~I~iwD~~~~~~--~~~~~~~h~~~V~~v~f~~ 671 (793)
T PLN00181 594 GSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKL--PLCTMIGHSKTVSYVRFVD 671 (793)
T ss_pred EcCCCEEEEEECCCCcEEEEEecCCCeEEEEEeCCCCCEEEEEeCCCeEEEEECCCCCc--cceEecCCCCCEEEEEEeC
Confidence 664 5 999999765544444333455566654433 356666 456677777653321 11222 2235788899874
Q ss_pred -CeEEEEEc-CceEEEEcCCC
Q 003405 168 -ENICIAIR-KGYMILNATNG 186 (823)
Q Consensus 168 -~~i~v~~~-~~y~lidl~~~ 186 (823)
+.++.|.. ....+.|+.++
T Consensus 672 ~~~lvs~s~D~~ikiWd~~~~ 692 (793)
T PLN00181 672 SSTLVSSSTDNTLKLWDLSMS 692 (793)
T ss_pred CCEEEEEECCCEEEEEeCCCC
Confidence 45555544 56788898753
No 55
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=96.47 E-value=2.1 Score=47.95 Aligned_cols=263 Identities=13% Similarity=0.125 Sum_probs=158.6
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES 96 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~ 96 (823)
.+-|+-. .++|+.=.-+|+|-.++....+ ..+.+.++ .|+|+.+.+.++...+++-+ ||
T Consensus 283 qvG~lWq-kd~lItVSl~G~in~ln~~d~~------------------~~~~i~GH-nK~ITaLtv~~d~~~i~SgsyDG 342 (603)
T KOG0318|consen 283 QVGCLWQ-KDHLITVSLSGTINYLNPSDPS------------------VLKVISGH-NKSITALTVSPDGKTIYSGSYDG 342 (603)
T ss_pred EEEEEEe-CCeEEEEEcCcEEEEecccCCC------------------hhheeccc-ccceeEEEEcCCCCEEEeeccCc
Confidence 4566655 6778888889998888754322 12334444 68999999999998888776 56
Q ss_pred -EEEEeCCCCccccc--ccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC--eEE
Q 003405 97 -IAFHRLPNLETIAV--LTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE--NIC 171 (823)
Q Consensus 97 -l~~~~L~~l~~~~~--i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~--~i~ 171 (823)
|.-|+..+-..-.. -.....++.++..+....+.++-...+.+..+.++ .+..-.-+.++..|++++...+ .++
T Consensus 343 ~I~~W~~~~g~~~~~~g~~h~nqI~~~~~~~~~~~~t~g~Dd~l~~~~~~~~-~~t~~~~~~lg~QP~~lav~~d~~~av 421 (603)
T KOG0318|consen 343 HINSWDSGSGTSDRLAGKGHTNQIKGMAASESGELFTIGWDDTLRVISLKDN-GYTKSEVVKLGSQPKGLAVLSDGGTAV 421 (603)
T ss_pred eEEEEecCCccccccccccccceEEEEeecCCCcEEEEecCCeEEEEecccC-cccccceeecCCCceeEEEcCCCCEEE
Confidence 88898865322111 11233466677766555677889999988888743 3443333678889999998844 888
Q ss_pred EEEcCceEEEEcCCCCeeeccCCCCCCCCEEEE-ccCCeEEEE-eCCeEEEEcCCCcccc--CCceeecCCCcEEEEeC-
Q 003405 172 IAIRKGYMILNATNGALSEVFPSGRIGPPLVVS-LLSGELLLG-KENIGVFVDQNGKLLQ--ADRICWSEAPIAVIIQK- 246 (823)
Q Consensus 172 v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~-~~~~EfLL~-~~~~gvfv~~~G~~~~--~~~i~w~~~P~~v~~~~- 246 (823)
+++.++..++.-.++-. -.+.+=. .+++.. .+..|+.++ .|.....+...|.... .-.+.-.+++..++|..
T Consensus 422 v~~~~~iv~l~~~~~~~--~~~~~y~-~s~vAv~~~~~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd 498 (603)
T KOG0318|consen 422 VACISDIVLLQDQTKVS--SIPIGYE-SSAVAVSPDGSEVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPD 498 (603)
T ss_pred EEecCcEEEEecCCcce--eeccccc-cceEEEcCCCCEEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCC
Confidence 99999988887443321 1122211 233433 345688885 4555666777774321 12344456677777753
Q ss_pred -CEEEEEeC-CeEEEEEccCCCce-eEEEeeCCccccc--ccCCeEEEec---cceEEEeeccChhH
Q 003405 247 -PYAIALLP-RRVEVRSLRVPYAL-IQTIVLQNVRHLI--PSSNAVVVAL---ENSIFGLFPVPLGA 305 (823)
Q Consensus 247 -PYll~~~~-~~ieV~~l~~~~~l-vQ~i~l~~~~~l~--~~~~~v~v~s---~~~I~~l~~~~~~~ 305 (823)
-|+.+-.- +.+.+|++.+ ... --.+.+-..+..+ -..+...+|| ++.|+......+.+
T Consensus 499 ~~yla~~Da~rkvv~yd~~s-~~~~~~~w~FHtakI~~~aWsP~n~~vATGSlDt~Viiysv~kP~~ 564 (603)
T KOG0318|consen 499 GAYLAAGDASRKVVLYDVAS-REVKTNRWAFHTAKINCVAWSPNNKLVATGSLDTNVIIYSVKKPAK 564 (603)
T ss_pred CcEEEEeccCCcEEEEEccc-CceecceeeeeeeeEEEEEeCCCceEEEeccccceEEEEEccChhh
Confidence 46666553 6788999863 322 1123333334333 2334445555 35566655544433
No 56
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.46 E-value=0.33 Score=55.82 Aligned_cols=155 Identities=17% Similarity=0.226 Sum_probs=104.9
Q ss_pred cEEEEEE--eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVAS--YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~--~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
-|.+++. .+..+.=|+.|+++.+|++... ....+.++++ ...|+.+...|..+++++-++
T Consensus 205 ~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~-----------------~~~~~~l~gH-~~~v~~~~f~p~g~~i~Sgs~ 266 (456)
T KOG0266|consen 205 GVSDVAFSPDGSYLLSGSDDKTLRIWDLKDD-----------------GRNLKTLKGH-STYVTSVAFSPDGNLLVSGSD 266 (456)
T ss_pred ceeeeEECCCCcEEEEecCCceEEEeeccCC-----------------CeEEEEecCC-CCceEEEEecCCCCEEEEecC
Confidence 4655443 3568999999999999998322 1234566654 678999999999988888776
Q ss_pred -c-EEEEeCCCCcccccccCC-CCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeecCCC---CceEEEec--
Q 003405 96 -S-IAFHRLPNLETIAVLTKA-KGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFGVPD---TVKSMSWC-- 166 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~~~~---~~~~l~~~-- 166 (823)
+ |++|++.+.+....+..- .+++.++.+.+...++.+. ++.|.+|....+ .+...+++.-.+ +++++.|.
T Consensus 267 D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~-~~~~~~~~~~~~~~~~~~~~~fsp~ 345 (456)
T KOG0266|consen 267 DGTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGNLLVSASYDGTIRVWDLETG-SKLCLKLLSGAENSAPVTSVQFSPN 345 (456)
T ss_pred CCcEEEEeccCCeEEEeeeccCCceEEEEECCCCCEEEEcCCCccEEEEECCCC-ceeeeecccCCCCCCceeEEEECCC
Confidence 4 999999886655444333 3788888888877777774 556667766633 222233333222 45777776
Q ss_pred CCeEEEEEcC-ceEEEEcCCCCeeec
Q 003405 167 GENICIAIRK-GYMILNATNGALSEV 191 (823)
Q Consensus 167 ~~~i~v~~~~-~y~lidl~~~~~~~L 191 (823)
+..++.++.. ...+.|+..+.....
T Consensus 346 ~~~ll~~~~d~~~~~w~l~~~~~~~~ 371 (456)
T KOG0266|consen 346 GKYLLSASLDRTLKLWDLRSGKSVGT 371 (456)
T ss_pred CcEEEEecCCCeEEEEEccCCcceee
Confidence 5567777774 788889887654333
No 57
>KOG0587 consensus Traf2- and Nck-interacting kinase and related germinal center kinase (GCK) family protein kinases [Signal transduction mechanisms]
Probab=96.43 E-value=0.0012 Score=77.33 Aligned_cols=230 Identities=20% Similarity=0.346 Sum_probs=152.6
Q ss_pred cCCCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 13 SNCSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 13 ~~~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
.++.+.|.|++.||.++.+||..|.-+ ++..+.. ..|.+ .+.+.-.|+.+++..+.+.+
T Consensus 637 k~~~se~~~aa~~g~n~~~~t~~gl~l-ld~s~q~--------------k~~~~------i~~rrfqq~~~le~~n~l~t 695 (953)
T KOG0587|consen 637 KRFNSEILCAALWGVNLLVGTESGLML-LDRSGQG--------------KVYPL------INRRRFQQMDVLEGLNVLVT 695 (953)
T ss_pred HhhhhhHHHHHhcCcceeeccccccee-eccccCc--------------ccCCc------ccchhcccccccCCcceeEE
Confidence 455678999999999999999999644 4433221 22322 23567888999999999999
Q ss_pred EeC-c--EEEEeCCCCcc-----cccccCCCCcEEEEeeCC-----------CceEEEEEcCeEEEEEEcCC--CceeEe
Q 003405 93 LSE-S--IAFHRLPNLET-----IAVLTKAKGANVYSWDDR-----------RGFLCFARQKRVCIFRHDGG--RGFVEV 151 (823)
Q Consensus 93 l~d-~--l~~~~L~~l~~-----~~~i~~~kg~~~fa~~~~-----------~~~l~V~~kkki~l~~~~~~--~~f~~~ 151 (823)
.++ . +.+|.+..+.. .+.+++..|-+.+..-.+ -.+++++.+..+.+|-|... ..|...
T Consensus 696 is~~~~~~~~~y~s~~~~k~l~~d~e~ek~~~~~~~~~~~~~~~~~~~k~~~ik~l~is~~~s~evy~~apk~~~k~~~~ 775 (953)
T KOG0587|consen 696 ISGKKDKLRVYYLSWLRNKILHNDPEVEKKQGWTTVGDLEGCIHYKVVKYERIKFLVIALKSSVEVYAWAPKPYHKFMAF 775 (953)
T ss_pred EeccccccceecchHHhhhhhhcCchhhhhccchhhhhhhcchhhhHHHHHHHHHhheeccccceeeecCCchHHHHHhh
Confidence 999 2 77777654321 111222222222221111 02589999999999999742 223334
Q ss_pred eee-cCCCCceEEEec---CCe--EEEEEcCceEEEEcCCCCeeeccCCC---CCCCCEEEE-ccC--C-eEEEEeCCeE
Q 003405 152 KDF-GVPDTVKSMSWC---GEN--ICIAIRKGYMILNATNGALSEVFPSG---RIGPPLVVS-LLS--G-ELLLGKENIG 218 (823)
Q Consensus 152 kei-~~~~~~~~l~~~---~~~--i~v~~~~~y~lidl~~~~~~~L~~~~---~~~~p~i~~-~~~--~-EfLL~~~~~g 218 (823)
+.+ .+++.+..+... ++. +..|+..++..+|...+...++.++. +...|.+.. .++ + +.++|+++.+
T Consensus 776 ~s~~~~~~~~~~~d~~~ee~~~~~v~~gs~~~~~~~~~~~~~~~~v~~~~~~q~~~~~~~~~~~~~~~~~~~l~~~~~e~ 855 (953)
T KOG0587|consen 776 KSFGELVHKPLLVDLTVEEGQRLKVIYGSCAGFHAVDVDSGSVYDIYLPTHIQCSITPHAIIILPNTDGMELLLCYEDEG 855 (953)
T ss_pred hhhhhhcccchhccchhhcCceEEEEecCcccccccccCCCCCCCCcCCcchhhcccceeEecCCCcchHHHhhhhhccc
Confidence 432 455566555443 443 66677789999999998877776553 223344333 332 2 5678999999
Q ss_pred EEEcCCCccccCCceeecCCCcEEEE-eCCEEEEEeCCeEEEEEcc
Q 003405 219 VFVDQNGKLLQADRICWSEAPIAVII-QKPYAIALLPRRVEVRSLR 263 (823)
Q Consensus 219 vfv~~~G~~~~~~~i~w~~~P~~v~~-~~PYll~~~~~~ieV~~l~ 263 (823)
+.++.-|+....--.+|-..|.++++ +..-+.+..++.++++++.
T Consensus 856 ~~~~~~~~~~k~v~~~~~~~~Ss~a~~~~~n~~g~~~ka~e~~s~e 901 (953)
T KOG0587|consen 856 VYVNTYGRITKDVVLQWGEMPTSVAYIRSNQIMGWGEKAIEIRSVE 901 (953)
T ss_pred ccccCccchHHHHHHhcCCCCCcceeeecccccccCcccceeeccc
Confidence 99999998654455789999999995 4667778888889999874
No 58
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=96.39 E-value=0.057 Score=54.48 Aligned_cols=164 Identities=17% Similarity=0.232 Sum_probs=117.6
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLP 103 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~ 103 (823)
.++|+-|..+-.|.+|+++....+ + +.+.+. ...|..+.-..+.+.+++-+| + |++|+..
T Consensus 112 s~~lltgg~ekllrvfdln~p~Ap-------------p----~E~~gh-tg~Ir~v~wc~eD~~iLSSadd~tVRLWD~r 173 (334)
T KOG0278|consen 112 SNYLLTGGQEKLLRVFDLNRPKAP-------------P----KEISGH-TGGIRTVLWCHEDKCILSSADDKTVRLWDHR 173 (334)
T ss_pred chhhhccchHHHhhhhhccCCCCC-------------c----hhhcCC-CCcceeEEEeccCceEEeeccCCceEEEEec
Confidence 468999999999999998865421 1 123333 457888887877777777777 3 9999998
Q ss_pred CCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC-eEEEEEcCce--EE
Q 003405 104 NLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE-NICIAIRKGY--MI 180 (823)
Q Consensus 104 ~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~-~i~v~~~~~y--~l 180 (823)
+-..+.++.....+++.-+..+...|.++-...|.++..+ .|..+|++.+|-.|.+-+...+ -++|+-...+ +.
T Consensus 174 Tgt~v~sL~~~s~VtSlEvs~dG~ilTia~gssV~Fwdak---sf~~lKs~k~P~nV~SASL~P~k~~fVaGged~~~~k 250 (334)
T KOG0278|consen 174 TGTEVQSLEFNSPVTSLEVSQDGRILTIAYGSSVKFWDAK---SFGLLKSYKMPCNVESASLHPKKEFFVAGGEDFKVYK 250 (334)
T ss_pred cCcEEEEEecCCCCcceeeccCCCEEEEecCceeEEeccc---cccceeeccCccccccccccCCCceEEecCcceEEEE
Confidence 8777777777778999888887777788888888776666 5888999999988888777743 3555544333 55
Q ss_pred EEcCCCCeeeccCCCCCCCCE--EEEccCCeEE
Q 003405 181 LNATNGALSEVFPSGRIGPPL--VVSLLSGELL 211 (823)
Q Consensus 181 idl~~~~~~~L~~~~~~~~p~--i~~~~~~EfL 211 (823)
+|..||.-...+..|..+ |+ +...+++|.-
T Consensus 251 fDy~TgeEi~~~nkgh~g-pVhcVrFSPdGE~y 282 (334)
T KOG0278|consen 251 FDYNTGEEIGSYNKGHFG-PVHCVRFSPDGELY 282 (334)
T ss_pred EeccCCceeeecccCCCC-ceEEEEECCCCcee
Confidence 788898877776544333 43 3334677753
No 59
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=96.34 E-value=0.27 Score=51.54 Aligned_cols=138 Identities=9% Similarity=0.077 Sum_probs=81.6
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc---------
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES--------- 96 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~--------- 96 (823)
.....+||++| +.+|+++.-... ..+++. ...+.-...+-..|+|..+++|
T Consensus 17 ~ScFava~~~G-friyn~~P~ke~----------------~~r~~~---~~G~~~veMLfR~N~laLVGGg~~pky~pNk 76 (346)
T KOG2111|consen 17 HSCFAVATDTG-FRIYNCDPFKES----------------ASRQFI---DGGFKIVEMLFRSNYLALVGGGSRPKYPPNK 76 (346)
T ss_pred CceEEEEecCc-eEEEecCchhhh----------------hhhccc---cCchhhhhHhhhhceEEEecCCCCCCCCCce
Confidence 35789999999 588987631110 011111 1112222223355788888763
Q ss_pred EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEec---CCeEEEE
Q 003405 97 IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC---GENICIA 173 (823)
Q Consensus 97 l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~---~~~i~v~ 173 (823)
|-+||=-.-..+..+.....+..+++.. .+|+|+.+.+|.+|.+..+ .+.++.+..-..|++++-. -+.-+++
T Consensus 77 viIWDD~k~~~i~el~f~~~I~~V~l~r--~riVvvl~~~I~VytF~~n--~k~l~~~et~~NPkGlC~~~~~~~k~~La 152 (346)
T KOG2111|consen 77 VIIWDDLKERCIIELSFNSEIKAVKLRR--DRIVVVLENKIYVYTFPDN--PKLLHVIETRSNPKGLCSLCPTSNKSLLA 152 (346)
T ss_pred EEEEecccCcEEEEEEeccceeeEEEcC--CeEEEEecCeEEEEEcCCC--hhheeeeecccCCCceEeecCCCCceEEE
Confidence 7788732333444555666777777664 4799999999999999843 4556666665667766554 2333333
Q ss_pred Ec----CceEEEEcCCCC
Q 003405 174 IR----KGYMILNATNGA 187 (823)
Q Consensus 174 ~~----~~y~lidl~~~~ 187 (823)
+. .+..++|+....
T Consensus 153 fPg~k~GqvQi~dL~~~~ 170 (346)
T KOG2111|consen 153 FPGFKTGQVQIVDLASTK 170 (346)
T ss_pred cCCCccceEEEEEhhhcC
Confidence 33 356677776543
No 60
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=96.25 E-value=0.076 Score=58.02 Aligned_cols=149 Identities=14% Similarity=0.188 Sum_probs=100.6
Q ss_pred CCCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 15 CSPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 15 ~~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
.|..+.|+.+.. .+|+-||..|.||+|.+.... +...+.+ +-++|+.|+...+..++++
T Consensus 80 ~Pg~v~al~s~n~G~~l~ag~i~g~lYlWelssG~------------------LL~v~~a-HYQ~ITcL~fs~dgs~iiT 140 (476)
T KOG0646|consen 80 LPGPVHALASSNLGYFLLAGTISGNLYLWELSSGI------------------LLNVLSA-HYQSITCLKFSDDGSHIIT 140 (476)
T ss_pred cccceeeeecCCCceEEEeecccCcEEEEEecccc------------------HHHHHHh-hccceeEEEEeCCCcEEEe
Confidence 478899998875 688999999999999876421 2222221 2479999999988888877
Q ss_pred EeC-c-EEEEeCCCCc---------cccccc-CCCCcEEEEeeCCC--ceEE-EEEcCeEEEEEEcCCCceeEeeeecCC
Q 003405 93 LSE-S-IAFHRLPNLE---------TIAVLT-KAKGANVYSWDDRR--GFLC-FARQKRVCIFRHDGGRGFVEVKDFGVP 157 (823)
Q Consensus 93 l~d-~-l~~~~L~~l~---------~~~~i~-~~kg~~~fa~~~~~--~~l~-V~~kkki~l~~~~~~~~f~~~kei~~~ 157 (823)
=+. | |.+|.+.++- |.+... ..-.++.+.++... ++++ +...+.+.+|.+..+ ..+..+.+|
T Consensus 141 gskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS~g---~LLlti~fp 217 (476)
T KOG0646|consen 141 GSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWDLSLG---VLLLTITFP 217 (476)
T ss_pred cCCCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEEeccc---eeeEEEecC
Confidence 764 5 9999875432 111111 11245555555432 3454 447788899999865 346678899
Q ss_pred CCceEEEec--CCeEEEEEcCc-eEEEEcCC
Q 003405 158 DTVKSMSWC--GENICIAIRKG-YMILNATN 185 (823)
Q Consensus 158 ~~~~~l~~~--~~~i~v~~~~~-y~lidl~~ 185 (823)
..+.+++.. +..+++|+..+ +.+.++.+
T Consensus 218 ~si~av~lDpae~~~yiGt~~G~I~~~~~~~ 248 (476)
T KOG0646|consen 218 SSIKAVALDPAERVVYIGTEEGKIFQNLLFK 248 (476)
T ss_pred CcceeEEEcccccEEEecCCcceEEeeehhc
Confidence 999999987 56788998854 55556543
No 61
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=96.19 E-value=0.18 Score=53.75 Aligned_cols=153 Identities=16% Similarity=0.199 Sum_probs=99.5
Q ss_pred CCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCC--CCCCeeEEEEecccCceee
Q 003405 15 CSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGF--SKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 15 ~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~--~k~~I~qI~~~~~~~~Ll~ 92 (823)
+|..|-||-...++|+|--++- |++|++... ++..++... ..+.+..+..-.+...|..
T Consensus 86 fpt~IL~VrmNr~RLvV~Lee~-IyIydI~~M------------------klLhTI~t~~~n~~gl~AlS~n~~n~ylAy 146 (391)
T KOG2110|consen 86 FPTSILAVRMNRKRLVVCLEES-IYIYDIKDM------------------KLLHTIETTPPNPKGLCALSPNNANCYLAY 146 (391)
T ss_pred cCCceEEEEEccceEEEEEccc-EEEEecccc------------------eeehhhhccCCCccceEeeccCCCCceEEe
Confidence 4778889999999999988877 999998743 233332222 1233444444333333333
Q ss_pred EeC---c-EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEEEcCe--EEEEEEcCCCceeEeeeecCCCCceEEEe
Q 003405 93 LSE---S-IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFARQKR--VCIFRHDGGRGFVEVKDFGVPDTVKSMSW 165 (823)
Q Consensus 93 l~d---~-l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~~kkk--i~l~~~~~~~~f~~~kei~~~~~~~~l~~ 165 (823)
=+. | |.+|++.+++++..+.--+| +.+++++.+...|+-|..|. |.+|....+..+.+.|-=..|-.|-+++|
T Consensus 147 p~s~t~GdV~l~d~~nl~~v~~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~F 226 (391)
T KOG2110|consen 147 PGSTTSGDVVLFDTINLQPVNTINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSF 226 (391)
T ss_pred cCCCCCceEEEEEcccceeeeEEEecCCceeEEEECCCCCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEE
Confidence 222 3 99999999998887776664 45678888877788886554 56788876544444443233678889999
Q ss_pred cC--CeEEEEEcC-ceEEEEcCCC
Q 003405 166 CG--ENICIAIRK-GYMILNATNG 186 (823)
Q Consensus 166 ~~--~~i~v~~~~-~y~lidl~~~ 186 (823)
.. ..|+++..+ ...++-+.+.
T Consensus 227 s~ds~~L~~sS~TeTVHiFKL~~~ 250 (391)
T KOG2110|consen 227 SPDSQFLAASSNTETVHIFKLEKV 250 (391)
T ss_pred CCCCCeEEEecCCCeEEEEEeccc
Confidence 84 456666554 4677777653
No 62
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.18 E-value=2.2 Score=52.91 Aligned_cols=230 Identities=11% Similarity=0.089 Sum_probs=124.9
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeE
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSL 93 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l 93 (823)
..|+|++.. ++.++.|..||.|.+|+......... ...+. ..... +..+|..+...+.. +.+++.
T Consensus 484 ~~V~~i~fs~dg~~latgg~D~~I~iwd~~~~~~~~~---------~~~~~-~~~~~--~~~~v~~l~~~~~~~~~las~ 551 (793)
T PLN00181 484 NLVCAIGFDRDGEFFATAGVNKKIKIFECESIIKDGR---------DIHYP-VVELA--SRSKLSGICWNSYIKSQVASS 551 (793)
T ss_pred CcEEEEEECCCCCEEEEEeCCCEEEEEECCccccccc---------ccccc-eEEec--ccCceeeEEeccCCCCEEEEE
Confidence 457777654 56899999999999998653211100 00010 11111 23567777776643 445554
Q ss_pred e-Cc-EEEEeCCCCccccccc-CCCCcEEEEeeCC-CceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEec--
Q 003405 94 S-ES-IAFHRLPNLETIAVLT-KAKGANVYSWDDR-RGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC-- 166 (823)
Q Consensus 94 ~-d~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~-~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~-- 166 (823)
+ |+ |.+|++.+-+.+..+. ....|..+++++. ...++.+ ..+.|.+|....+. .+..+.....+.++.|.
T Consensus 552 ~~Dg~v~lWd~~~~~~~~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~---~~~~~~~~~~v~~v~~~~~ 628 (793)
T PLN00181 552 NFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGV---SIGTIKTKANICCVQFPSE 628 (793)
T ss_pred eCCCeEEEEECCCCeEEEEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCc---EEEEEecCCCeEEEEEeCC
Confidence 4 35 9999987655443332 2346788888753 3456666 45678888876432 22333345678888884
Q ss_pred -CCeEEEEEcC-ceEEEEcCCCCe--eeccCCCCCCCCEEEEccCCeEEE-E-eCCeEEEEcCCCccc--cCCce-eecC
Q 003405 167 -GENICIAIRK-GYMILNATNGAL--SEVFPSGRIGPPLVVSLLSGELLL-G-KENIGVFVDQNGKLL--QADRI-CWSE 237 (823)
Q Consensus 167 -~~~i~v~~~~-~y~lidl~~~~~--~~L~~~~~~~~p~i~~~~~~EfLL-~-~~~~gvfv~~~G~~~--~~~~i-~w~~ 237 (823)
|..++.|... ...++|+.++.. ..+..... .-..+.. .++..++ + .|+..-+.|..-..+ ...++ .+.+
T Consensus 629 ~g~~latgs~dg~I~iwD~~~~~~~~~~~~~h~~-~V~~v~f-~~~~~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~g 706 (793)
T PLN00181 629 SGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSK-TVSYVRF-VDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMG 706 (793)
T ss_pred CCCEEEEEeCCCeEEEEECCCCCccceEecCCCC-CEEEEEE-eCCCEEEEEECCCEEEEEeCCCCccccCCcceEEEcC
Confidence 5678888774 568889887542 22222111 1112222 3555555 3 345555555421000 00111 2322
Q ss_pred ---CCcEEEE--eCCEEEEEeC-CeEEEEEcc
Q 003405 238 ---APIAVII--QKPYAIALLP-RRVEVRSLR 263 (823)
Q Consensus 238 ---~P~~v~~--~~PYll~~~~-~~ieV~~l~ 263 (823)
.+..+.+ ..+||++... +.+-|++..
T Consensus 707 h~~~i~~v~~s~~~~~lasgs~D~~v~iw~~~ 738 (793)
T PLN00181 707 HTNVKNFVGLSVSDGYIATGSETNEVFVYHKA 738 (793)
T ss_pred CCCCeeEEEEcCCCCEEEEEeCCCEEEEEECC
Confidence 1223333 3568877764 678888764
No 63
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=96.18 E-value=0.37 Score=53.11 Aligned_cols=211 Identities=13% Similarity=0.141 Sum_probs=124.3
Q ss_pred ccCCCCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCc
Q 003405 12 ISNCSPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQL 89 (823)
Q Consensus 12 ~~~~~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~ 89 (823)
+.++...+.+++.. |..+..|...|.+.+|+... -...+++.+ +..||..+..-|..+.
T Consensus 64 ~srFk~~v~s~~fR~DG~LlaaGD~sG~V~vfD~k~------------------r~iLR~~~a-h~apv~~~~f~~~d~t 124 (487)
T KOG0310|consen 64 FSRFKDVVYSVDFRSDGRLLAAGDESGHVKVFDMKS------------------RVILRQLYA-HQAPVHVTKFSPQDNT 124 (487)
T ss_pred HHhhccceeEEEeecCCeEEEccCCcCcEEEecccc------------------HHHHHHHhh-ccCceeEEEecccCCe
Confidence 34555667766655 78899999999999998331 012344443 3678888888887777
Q ss_pred eeeEe-Cc--EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEeeeecCCCCceEE
Q 003405 90 LLSLS-ES--IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSM 163 (823)
Q Consensus 90 Ll~l~-d~--l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~kei~~~~~~~~l 163 (823)
+++-+ |+ +++|++.+-..+..+...+ -+.+-++.+..+.++|. =..+|.+|..+... ..+.|+.-..+|-++
T Consensus 125 ~l~s~sDd~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~--~~v~elnhg~pVe~v 202 (487)
T KOG0310|consen 125 MLVSGSDDKVVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLT--SRVVELNHGCPVESV 202 (487)
T ss_pred EEEecCCCceEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCC--ceeEEecCCCceeeE
Confidence 66655 44 8899987644222222222 23445555555666664 45789898887421 446678778889999
Q ss_pred EecCC--eEEEEEcCceEEEEcCCCCe--eeccCCCCCCCCEEEEccCCeEEEE--eCC-eEEEEcCCCccccCCceeec
Q 003405 164 SWCGE--NICIAIRKGYMILNATNGAL--SEVFPSGRIGPPLVVSLLSGELLLG--KEN-IGVFVDQNGKLLQADRICWS 236 (823)
Q Consensus 164 ~~~~~--~i~v~~~~~y~lidl~~~~~--~~L~~~~~~~~p~i~~~~~~EfLL~--~~~-~gvfv~~~G~~~~~~~i~w~ 236 (823)
.+.++ .|.-|..+++.+.|+.+|.. ...++..+. --|.....++.-|+. -|. .=+|-..+=+. .+.+.++
T Consensus 203 l~lpsgs~iasAgGn~vkVWDl~~G~qll~~~~~H~Kt-VTcL~l~s~~~rLlS~sLD~~VKVfd~t~~Kv--v~s~~~~ 279 (487)
T KOG0310|consen 203 LALPSGSLIASAGGNSVKVWDLTTGGQLLTSMFNHNKT-VTCLRLASDSTRLLSGSLDRHVKVFDTTNYKV--VHSWKYP 279 (487)
T ss_pred EEcCCCCEEEEcCCCeEEEEEecCCceehhhhhcccce-EEEEEeecCCceEeecccccceEEEEccceEE--EEeeecc
Confidence 88844 55556668999999996642 222222221 122222234445552 232 22332111122 3556666
Q ss_pred CCCcEEEEeC
Q 003405 237 EAPIAVIIQK 246 (823)
Q Consensus 237 ~~P~~v~~~~ 246 (823)
++..+++...
T Consensus 280 ~pvLsiavs~ 289 (487)
T KOG0310|consen 280 GPVLSIAVSP 289 (487)
T ss_pred cceeeEEecC
Confidence 6666666543
No 64
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.16 E-value=1.9 Score=51.16 Aligned_cols=243 Identities=13% Similarity=0.171 Sum_probs=127.1
Q ss_pred cEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
-|+-++.|. --++=|..|-.+..|..++. +.|+.. ++.++ -.+|+.+..-|..+++++-++
T Consensus 208 GVNwaAfhpTlpliVSG~DDRqVKlWrmnet---------------KaWEvD-tcrgH-~nnVssvlfhp~q~lIlSnsE 270 (1202)
T KOG0292|consen 208 GVNWAAFHPTLPLIVSGADDRQVKLWRMNET---------------KAWEVD-TCRGH-YNNVSSVLFHPHQDLILSNSE 270 (1202)
T ss_pred ccceEEecCCcceEEecCCcceeeEEEeccc---------------cceeeh-hhhcc-cCCcceEEecCccceeEecCC
Confidence 455566666 45666777777777776643 345443 33333 468999999999999999997
Q ss_pred -c-EEEEeCCCCccccccc----------CCCCcEEEEeeCCCceE---------EEEEcCeEEEEEEcCCC----ceeE
Q 003405 96 -S-IAFHRLPNLETIAVLT----------KAKGANVYSWDDRRGFL---------CFARQKRVCIFRHDGGR----GFVE 150 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i~----------~~kg~~~fa~~~~~~~l---------~V~~kkki~l~~~~~~~----~f~~ 150 (823)
+ +++|++.+-..+.+.. .-...+.||..-+.|.+ |+++.....+| .++.. .|..
T Consensus 271 DksirVwDm~kRt~v~tfrrendRFW~laahP~lNLfAAgHDsGm~VFkleRErpa~~v~~n~LfY-vkd~~i~~~d~~t 349 (1202)
T KOG0292|consen 271 DKSIRVWDMTKRTSVQTFRRENDRFWILAAHPELNLFAAGHDSGMIVFKLERERPAYAVNGNGLFY-VKDRFIRSYDLRT 349 (1202)
T ss_pred CccEEEEecccccceeeeeccCCeEEEEEecCCcceeeeecCCceEEEEEcccCceEEEcCCEEEE-EccceEEeeeccc
Confidence 3 9999996543332211 11123344443333321 11111111111 11000 0111
Q ss_pred eeee---------cCCCCceEEEec--CCeEEEEE---cCceEEEEcCCCCeeecc-CCC-CCCCCEEEEccCCeEEE-E
Q 003405 151 VKDF---------GVPDTVKSMSWC--GENICIAI---RKGYMILNATNGALSEVF-PSG-RIGPPLVVSLLSGELLL-G 213 (823)
Q Consensus 151 ~kei---------~~~~~~~~l~~~--~~~i~v~~---~~~y~lidl~~~~~~~L~-~~~-~~~~p~i~~~~~~EfLL-~ 213 (823)
.++. .+-+++.+|++. .+.+.+.+ ...|.++.+........- +.. ++.---.+++..+.|.+ -
T Consensus 350 ~~d~~v~~lr~~g~~~~~~~smsYNpae~~vlics~~~n~~y~L~~ipk~~~~~~~~~~~~k~tG~~a~fvarNrfavl~ 429 (1202)
T KOG0292|consen 350 QKDTAVASLRRPGTLWQPPRSLSYNPAENAVLICSNLDNGEYELVQIPKDSDGVSDGKDVKKGTGEGALFVARNRFAVLD 429 (1202)
T ss_pred cccceeEeccCCCcccCCcceeeeccccCeEEEEeccCCCeEEEEEecCcccccCCchhhhcCCCCceEEEEecceEEEE
Confidence 1221 112688889887 34555553 256888776543211111 100 00001122333444433 2
Q ss_pred e-CCeEEEEcCCCccccCCceeecCCCcEEEEeC-CEEEEEeCCeEEEEEccCCCceeEEEeeCCccccc
Q 003405 214 K-ENIGVFVDQNGKLLQADRICWSEAPIAVIIQK-PYAIALLPRRVEVRSLRVPYALIQTIVLQNVRHLI 281 (823)
Q Consensus 214 ~-~~~gvfv~~~G~~~~~~~i~w~~~P~~v~~~~-PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l~ 281 (823)
+ +..+++-|...+.+ ..+.-+.....+.+.. ..++..++++|.+++++ ....+-++.++..+..+
T Consensus 430 k~~~~v~ik~l~N~vt--kkl~~~~~~~~IF~ag~g~lll~~~~~v~lfdvQ-q~~~~~si~~s~vkyvv 496 (1202)
T KOG0292|consen 430 KSNEQVVIKNLKNKVT--KKLLLPESTDDIFYAGTGNLLLRSPDSVTLFDVQ-QKKKVGSIKVSKVKYVV 496 (1202)
T ss_pred ecCcceEEecccchhh--hcccCcccccceeeccCccEEEEcCCeEEEEEee-cceEEEEEecCceeEEE
Confidence 3 33344444444433 2444444556667664 58888889999999997 46667778777765543
No 65
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=96.14 E-value=0.16 Score=52.97 Aligned_cols=145 Identities=17% Similarity=0.233 Sum_probs=98.0
Q ss_pred cEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
.|+++--- .+.|++++=||.+.+|++..+. +...++ ++.||......+....+.--.|
T Consensus 15 ~IS~v~f~~~~~~LLvssWDgslrlYdv~~~~------------------l~~~~~--~~~plL~c~F~d~~~~~~G~~d 74 (323)
T KOG1036|consen 15 GISSVKFSPSSSDLLVSSWDGSLRLYDVPANS------------------LKLKFK--HGAPLLDCAFADESTIVTGGLD 74 (323)
T ss_pred ceeeEEEcCcCCcEEEEeccCcEEEEeccchh------------------hhhhee--cCCceeeeeccCCceEEEeccC
Confidence 46665433 4799999999999999977542 111222 2679999988886666666667
Q ss_pred c-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEEEE
Q 003405 96 S-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENICIA 173 (823)
Q Consensus 96 ~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~v~ 173 (823)
| |+.+|+.+-....-....+++.+++.....++++-| =.++|.++.-... ...-.+..+..|-+|...|+.|+||
T Consensus 75 g~vr~~Dln~~~~~~igth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~~---~~~~~~d~~kkVy~~~v~g~~LvVg 151 (323)
T KOG1036|consen 75 GQVRRYDLNTGNEDQIGTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRNK---VVVGTFDQGKKVYCMDVSGNRLVVG 151 (323)
T ss_pred ceEEEEEecCCcceeeccCCCceEEEEeeccCCeEEEcccCccEEEEecccc---ccccccccCceEEEEeccCCEEEEe
Confidence 6 999999764433223345677887777666655554 5677766655421 1111133456788999999999998
Q ss_pred Ec-CceEEEEcCC
Q 003405 174 IR-KGYMILNATN 185 (823)
Q Consensus 174 ~~-~~y~lidl~~ 185 (823)
+. +...++|+.+
T Consensus 152 ~~~r~v~iyDLRn 164 (323)
T KOG1036|consen 152 TSDRKVLIYDLRN 164 (323)
T ss_pred ecCceEEEEEccc
Confidence 87 7789999876
No 66
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=96.09 E-value=0.29 Score=54.27 Aligned_cols=261 Identities=13% Similarity=0.150 Sum_probs=146.1
Q ss_pred EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc--EEE
Q 003405 22 VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES--IAF 99 (823)
Q Consensus 22 i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~--l~~ 99 (823)
++.-|-+++-|.-|-+++.|+..+.... .+.|..+..+ ...+|+.+..-+..+.+||+++. .++
T Consensus 175 ~Dp~GaR~~sGs~Dy~v~~wDf~gMdas-----------~~~fr~l~P~---E~h~i~sl~ys~Tg~~iLvvsg~aqakl 240 (641)
T KOG0772|consen 175 VDPSGARFVSGSLDYTVKFWDFQGMDAS-----------MRSFRQLQPC---ETHQINSLQYSVTGDQILVVSGSAQAKL 240 (641)
T ss_pred ecCCCceeeeccccceEEEEeccccccc-----------chhhhccCcc---cccccceeeecCCCCeEEEEecCcceeE
Confidence 4445679999999999999999876542 1122211111 13578889999999999999983 788
Q ss_pred EeCCCCcc---------cccccCCCC----cEEEEeeCCCc--eEEEEEcCeEEEEEEcCCC-ceeEeeeecC---CCCc
Q 003405 100 HRLPNLET---------IAVLTKAKG----ANVYSWDDRRG--FLCFARQKRVCIFRHDGGR-GFVEVKDFGV---PDTV 160 (823)
Q Consensus 100 ~~L~~l~~---------~~~i~~~kg----~~~fa~~~~~~--~l~V~~kkki~l~~~~~~~-~f~~~kei~~---~~~~ 160 (823)
++=+.++. +..+..+|| +++-|+++... +|-.+-...+.|+.+...+ +...+|.... --+|
T Consensus 241 ~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~ 320 (641)
T KOG0772|consen 241 LDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPV 320 (641)
T ss_pred EccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchhheeEEeeccCCCcccCc
Confidence 87665542 222334454 45566666553 4555677788888876421 2222222111 1378
Q ss_pred eEEEec--CCeEEEEEc-CceEEEEcCCCCeeeccCCCC-----CCCCEEEEccCCeEEEEe--CCeEEEEcCCCccccC
Q 003405 161 KSMSWC--GENICIAIR-KGYMILNATNGALSEVFPSGR-----IGPPLVVSLLSGELLLGK--ENIGVFVDQNGKLLQA 230 (823)
Q Consensus 161 ~~l~~~--~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~-----~~~p~i~~~~~~EfLL~~--~~~gvfv~~~G~~~~~ 230 (823)
++.+|. |..|..|+. .+..+.+..+-.+.+.+-..+ ..--+|....++.+|+.+ |+..=.-|..- ...
T Consensus 321 tsC~~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq--~kk 398 (641)
T KOG0772|consen 321 TSCAWNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQ--FKK 398 (641)
T ss_pred eeeecCCCcchhhhcccCCceeeeecCCcccccceEeeeccCCCCceeEEEeccccchhhhccCCCceeeeeccc--ccc
Confidence 888998 456777776 456667764433333332211 122456666788888854 33332223211 112
Q ss_pred CceeecCCCcEE----EEeCC---EEEEEeC-------CeEEEEEccCCCceeEEEeeCCccccccc----CCeEEEec-
Q 003405 231 DRICWSEAPIAV----IIQKP---YAIALLP-------RRVEVRSLRVPYALIQTIVLQNVRHLIPS----SNAVVVAL- 291 (823)
Q Consensus 231 ~~i~w~~~P~~v----~~~~P---Yll~~~~-------~~ieV~~l~~~~~lvQ~i~l~~~~~l~~~----~~~v~v~s- 291 (823)
+-..|.+-|..+ +++.| -|++-++ ..+.+++-. +...||+|.++++....+. -+.+++.+
T Consensus 399 pL~~~tgL~t~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~-t~d~v~ki~i~~aSvv~~~WhpkLNQi~~gsg 477 (641)
T KOG0772|consen 399 PLNVRTGLPTPFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRM-TLDTVYKIDISTASVVRCLWHPKLNQIFAGSG 477 (641)
T ss_pred chhhhcCCCccCCCCccccCCCceEEEecccccCCCCCceEEEEecc-ceeeEEEecCCCceEEEEeecchhhheeeecC
Confidence 334455544322 12222 1222211 236677765 5788999999876543322 24566554
Q ss_pred cceEEEee
Q 003405 292 ENSIFGLF 299 (823)
Q Consensus 292 ~~~I~~l~ 299 (823)
++.++++.
T Consensus 478 dG~~~vyY 485 (641)
T KOG0772|consen 478 DGTAHVYY 485 (641)
T ss_pred CCceEEEE
Confidence 45555544
No 67
>PTZ00421 coronin; Provisional
Probab=96.08 E-value=0.58 Score=54.14 Aligned_cols=154 Identities=12% Similarity=0.121 Sum_probs=88.8
Q ss_pred CCcEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 16 SPKIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 16 ~~~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
...|.|++.+. +.|+.|+.||.|.+|++.... ....+.+ +..+|..|..-+..+++++
T Consensus 125 ~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~------------------~~~~l~~-h~~~V~sla~spdG~lLat 185 (493)
T PTZ00421 125 TKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGK------------------AVEVIKC-HSDQITSLEWNLDGSLLCT 185 (493)
T ss_pred CCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCe------------------EEEEEcC-CCCceEEEEEECCCCEEEE
Confidence 35788888773 478899999999999986421 1223333 3568999999998888887
Q ss_pred EeC-c-EEEEeCCCCcccccccCCCCc--EEEEeeCCCceEE-EEE----cCeEEEEEEcCCCceeEeeeecCCCCceEE
Q 003405 93 LSE-S-IAFHRLPNLETIAVLTKAKGA--NVYSWDDRRGFLC-FAR----QKRVCIFRHDGGRGFVEVKDFGVPDTVKSM 163 (823)
Q Consensus 93 l~d-~-l~~~~L~~l~~~~~i~~~kg~--~~fa~~~~~~~l~-V~~----kkki~l~~~~~~~~f~~~kei~~~~~~~~l 163 (823)
.+. + |++|++.+-+.+..+....+. .......+.+.++ ++. .+.|.+|.......-....++.....+...
T Consensus 186 gs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d~~~~~~~~ 265 (493)
T PTZ00421 186 TSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIP 265 (493)
T ss_pred ecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccCCCCceEEE
Confidence 775 5 999999776554443222221 1223333334443 332 366888876532111111122111222223
Q ss_pred Eec--CCeEEEEEc--CceEEEEcCCCCe
Q 003405 164 SWC--GENICIAIR--KGYMILNATNGAL 188 (823)
Q Consensus 164 ~~~--~~~i~v~~~--~~y~lidl~~~~~ 188 (823)
.|. ++.+++|.+ ....++|+.++..
T Consensus 266 ~~d~d~~~L~lggkgDg~Iriwdl~~~~~ 294 (493)
T PTZ00421 266 FFDEDTNLLYIGSKGEGNIRCFELMNERL 294 (493)
T ss_pred EEcCCCCEEEEEEeCCCeEEEEEeeCCce
Confidence 343 456777653 4567788876643
No 68
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=96.03 E-value=0.29 Score=53.57 Aligned_cols=182 Identities=15% Similarity=0.219 Sum_probs=114.2
Q ss_pred CcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|+|+.-|. .-|.++.-||.+.+|.+++..+. .+++. .+.+.||......|.+.-.++.+
T Consensus 214 ~~I~sv~FHp~~plllvaG~d~~lrifqvDGk~N~----------------~lqS~-~l~~fPi~~a~f~p~G~~~i~~s 276 (514)
T KOG2055|consen 214 GGITSVQFHPTAPLLLVAGLDGTLRIFQVDGKVNP----------------KLQSI-HLEKFPIQKAEFAPNGHSVIFTS 276 (514)
T ss_pred CCceEEEecCCCceEEEecCCCcEEEEEecCccCh----------------hheee-eeccCccceeeecCCCceEEEec
Confidence 4799999887 47899999999999999876542 12221 24577999999999877455555
Q ss_pred C-c--EEEEeCCC-----CcccccccCCCCcEEEEeeCCCceEEEEEcC-eEEEEEEcCCCceeEeeeecCCCCceEEEe
Q 003405 95 E-S--IAFHRLPN-----LETIAVLTKAKGANVYSWDDRRGFLCFARQK-RVCIFRHDGGRGFVEVKDFGVPDTVKSMSW 165 (823)
Q Consensus 95 d-~--l~~~~L~~-----l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kk-ki~l~~~~~~~~f~~~kei~~~~~~~~l~~ 165 (823)
. . ++.|+|.+ +.+....+ -|.+..|.|+.+..+|+++.+. -|.|..-..+ ++ +..+.+++.++.++|
T Consensus 277 ~rrky~ysyDle~ak~~k~~~~~g~e-~~~~e~FeVShd~~fia~~G~~G~I~lLhakT~-el--i~s~KieG~v~~~~f 352 (514)
T KOG2055|consen 277 GRRKYLYSYDLETAKVTKLKPPYGVE-EKSMERFEVSHDSNFIAIAGNNGHIHLLHAKTK-EL--ITSFKIEGVVSDFTF 352 (514)
T ss_pred ccceEEEEeeccccccccccCCCCcc-cchhheeEecCCCCeEEEcccCceEEeehhhhh-hh--hheeeeccEEeeEEE
Confidence 5 2 88999854 22333222 4577889998887777776444 4545444422 22 233556889999999
Q ss_pred cCC--eEEEEE-cCceEEEEcCCCCeeeccCC-CCCCCCEEEEccCCeEEEEeCCeEE
Q 003405 166 CGE--NICIAI-RKGYMILNATNGALSEVFPS-GRIGPPLVVSLLSGELLLGKENIGV 219 (823)
Q Consensus 166 ~~~--~i~v~~-~~~y~lidl~~~~~~~L~~~-~~~~~p~i~~~~~~EfLL~~~~~gv 219 (823)
..+ .|++.. ..+..+.|+.+..+...|.- |...--.+|...++.+|-+-.+.|+
T Consensus 353 sSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~v~gts~~~S~ng~ylA~GS~~Gi 410 (514)
T KOG2055|consen 353 SSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGSVHGTSLCISLNGSYLATGSDSGI 410 (514)
T ss_pred ecCCcEEEEEcCCceEEEEecCCcceEEEEeecCccceeeeeecCCCceEEeccCcce
Confidence 843 333333 46889999998766555432 2211122333344555443334444
No 69
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=96.00 E-value=0.16 Score=55.83 Aligned_cols=149 Identities=11% Similarity=0.197 Sum_probs=96.2
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
..+|+|++-.. .+|.-|...|.|.+..+..... ...|+.-+...|..+..-+....|++.
T Consensus 121 ~stvt~v~YN~~DeyiAsvs~gGdiiih~~~t~~~------------------tt~f~~~sgqsvRll~ys~skr~lL~~ 182 (673)
T KOG4378|consen 121 QSTVTYVDYNNTDEYIASVSDGGDIIIHGTKTKQK------------------TTTFTIDSGQSVRLLRYSPSKRFLLSI 182 (673)
T ss_pred cceeEEEEecCCcceeEEeccCCcEEEEecccCcc------------------ccceecCCCCeEEEeecccccceeeEe
Confidence 46899998654 5888899999999887553221 012222224456667776665555544
Q ss_pred -eC-c-EEEEeCCCCcccccccC--CCCcEEEEeeCCCce--EEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEec
Q 003405 94 -SE-S-IAFHRLPNLETIAVLTK--AKGANVYSWDDRRGF--LCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC 166 (823)
Q Consensus 94 -~d-~-l~~~~L~~l~~~~~i~~--~kg~~~fa~~~~~~~--l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~ 166 (823)
+| | |.+|+.....|+..-.. .-.|.-+|.++.... +-||..+||.+|..... +..+-+....|-.+++|.
T Consensus 183 asd~G~VtlwDv~g~sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~---~s~~~l~y~~Plstvaf~ 259 (673)
T KOG4378|consen 183 ASDKGAVTLWDVQGMSPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQ---ASTDRLTYSHPLSTVAFS 259 (673)
T ss_pred eccCCeEEEEeccCCCcccchhhhccCCcCcceecCCccceEEEecccceEEEeecccc---cccceeeecCCcceeeec
Confidence 56 5 99999987666543211 123444555554443 45678999999988732 223345566777888887
Q ss_pred --CCeEEEEEcC-ceEEEEcCC
Q 003405 167 --GENICIAIRK-GYMILNATN 185 (823)
Q Consensus 167 --~~~i~v~~~~-~y~lidl~~ 185 (823)
|..+|.|+.+ +.+.||+..
T Consensus 260 ~~G~~L~aG~s~G~~i~YD~R~ 281 (673)
T KOG4378|consen 260 ECGTYLCAGNSKGELIAYDMRS 281 (673)
T ss_pred CCceEEEeecCCceEEEEeccc
Confidence 6689999885 467788875
No 70
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=95.97 E-value=2.1 Score=49.32 Aligned_cols=236 Identities=15% Similarity=0.175 Sum_probs=141.0
Q ss_pred CCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
...|+|+... |+.+.-++.+|.+.++....... .+.+...+ +...|..+...+....++..
T Consensus 159 ~~sv~~~~fs~~g~~l~~~~~~~~i~~~~~~~~~~----------------~~~~~l~~-h~~~v~~~~fs~d~~~l~s~ 221 (456)
T KOG0266|consen 159 CPSVTCVDFSPDGRALAAASSDGLIRIWKLEGIKS----------------NLLRELSG-HTRGVSDVAFSPDGSYLLSG 221 (456)
T ss_pred cCceEEEEEcCCCCeEEEccCCCcEEEeecccccc----------------hhhccccc-cccceeeeEECCCCcEEEEe
Confidence 3567775544 46788888999999998632210 11222212 35689999999999988888
Q ss_pred eCc--EEEEeCCCC-cccccc-cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeec-CCCCceEEEec-
Q 003405 94 SES--IAFHRLPNL-ETIAVL-TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWC- 166 (823)
Q Consensus 94 ~d~--l~~~~L~~l-~~~~~i-~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~- 166 (823)
+++ +++|++..- ..+..+ .-...++++++++....++.| ..+.|.|+....+. ..+-+. -.+.++++++.
T Consensus 222 s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~---~~~~l~~hs~~is~~~f~~ 298 (456)
T KOG0266|consen 222 SDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGE---CVRKLKGHSDGISGLAFSP 298 (456)
T ss_pred cCCceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCe---EEEeeeccCCceEEEEECC
Confidence 873 999999322 222222 233467788888776455555 66778888877432 222232 24688888887
Q ss_pred -CCeEEEEEc-CceEEEEcCCCCee--eccCCCCCCCCE--EEEccCCeEEE-EeCC-eEEEEcCCCccccCCceeecCC
Q 003405 167 -GENICIAIR-KGYMILNATNGALS--EVFPSGRIGPPL--VVSLLSGELLL-GKEN-IGVFVDQNGKLLQADRICWSEA 238 (823)
Q Consensus 167 -~~~i~v~~~-~~y~lidl~~~~~~--~L~~~~~~~~p~--i~~~~~~EfLL-~~~~-~gvfv~~~G~~~~~~~i~w~~~ 238 (823)
|+.++.|.. ....+.|+.++... ..+.......|. +..-+++.+++ +..+ ..-+.+.... .....|...
T Consensus 299 d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~~---~~~~~~~~~ 375 (456)
T KOG0266|consen 299 DGNLLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDRTLKLWDLRSG---KSVGTYTGH 375 (456)
T ss_pred CCCEEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCCeEEEEEccCC---cceeeeccc
Confidence 455666654 55788999999843 333332221133 33347788877 4444 6666665422 122333332
Q ss_pred CcE------E--EEeCCEEEEEeC-CeEEEEEccCCCceeEEEeeC
Q 003405 239 PIA------V--IIQKPYAIALLP-RRVEVRSLRVPYALIQTIVLQ 275 (823)
Q Consensus 239 P~~------v--~~~~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l~ 275 (823)
+.. . .-...|++.-.. ..|.++++. ++..+|.+...
T Consensus 376 ~~~~~~~~~~~~~~~~~~i~sg~~d~~v~~~~~~-s~~~~~~l~~h 420 (456)
T KOG0266|consen 376 SNLVRCIFSPTLSTGGKLIYSGSEDGSVYVWDSS-SGGILQRLEGH 420 (456)
T ss_pred CCcceeEecccccCCCCeEEEEeCCceEEEEeCC-ccchhhhhcCC
Confidence 221 1 112456666654 578899886 47777777654
No 71
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=95.96 E-value=1.8 Score=47.72 Aligned_cols=256 Identities=14% Similarity=0.163 Sum_probs=137.1
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|||++.. |+.|..|..+|.+.+|+..+.- ..++ +.+|.||..|+--...+.+++-+
T Consensus 236 kdVT~L~Wn~~G~~LatG~~~G~~riw~~~G~l-------------------~~tl-~~HkgPI~slKWnk~G~yilS~~ 295 (524)
T KOG0273|consen 236 KDVTSLDWNNDGTLLATGSEDGEARIWNKDGNL-------------------ISTL-GQHKGPIFSLKWNKKGTYILSGG 295 (524)
T ss_pred CCcceEEecCCCCeEEEeecCcEEEEEecCchh-------------------hhhh-hccCCceEEEEEcCCCCEEEecc
Confidence 579999988 6899999999999999866532 2222 35689999999999888888765
Q ss_pred -Cc-EEEEeCCCCcccccccCCCCc-EEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeec-CCCCceEEEecCCe-
Q 003405 95 -ES-IAFHRLPNLETIAVLTKAKGA-NVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWCGEN- 169 (823)
Q Consensus 95 -d~-l~~~~L~~l~~~~~i~~~kg~-~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~~~~- 169 (823)
|+ ..+|+--+-....+.+..... ..+-.-.+..+.+-..+..|.+|+++.++. ++.+. -..+|.+|.|....
T Consensus 296 vD~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~~~~F~ts~td~~i~V~kv~~~~P---~~t~~GH~g~V~alk~n~tg~ 372 (524)
T KOG0273|consen 296 VDGTTILWDAHTGTVKQQFEFHSAPALDVDWQSNDEFATSSTDGCIHVCKVGEDRP---VKTFIGHHGEVNALKWNPTGS 372 (524)
T ss_pred CCccEEEEeccCceEEEeeeeccCCccceEEecCceEeecCCCceEEEEEecCCCc---ceeeecccCceEEEEECCCCc
Confidence 55 788886332221111111110 111111122234445777899999986542 23222 24688899998443
Q ss_pred EEEEEc--CceEEEEcCCCCe-eeccCCCCC--------CCCEEEEccCCeEEEE--eCCeEEEEcC-CCccccCCceee
Q 003405 170 ICIAIR--KGYMILNATNGAL-SEVFPSGRI--------GPPLVVSLLSGELLLG--KENIGVFVDQ-NGKLLQADRICW 235 (823)
Q Consensus 170 i~v~~~--~~y~lidl~~~~~-~~L~~~~~~--------~~p~i~~~~~~EfLL~--~~~~gvfv~~-~G~~~~~~~i~w 235 (823)
+...+. ..-.|.++.++.. -.+...++. .-|..-....+-.+++ +|+..-..|. .|.++ .++.=
T Consensus 373 LLaS~SdD~TlkiWs~~~~~~~~~l~~Hskei~t~~wsp~g~v~~n~~~~~~l~sas~dstV~lwdv~~gv~i--~~f~k 450 (524)
T KOG0273|consen 373 LLASCSDDGTLKIWSMGQSNSVHDLQAHSKEIYTIKWSPTGPVTSNPNMNLMLASASFDSTVKLWDVESGVPI--HTLMK 450 (524)
T ss_pred eEEEecCCCeeEeeecCCCcchhhhhhhccceeeEeecCCCCccCCCcCCceEEEeecCCeEEEEEccCCcee--Eeecc
Confidence 333332 3456666554321 111111100 0121111223344442 3444444442 34432 33322
Q ss_pred cCCCc-EEEEe--CCEEEEEe-CCeEEEEEccCCCceeEEEeeCCc-cccc--ccCCeEEE-eccceEEEe
Q 003405 236 SEAPI-AVIIQ--KPYAIALL-PRRVEVRSLRVPYALIQTIVLQNV-RHLI--PSSNAVVV-ALENSIFGL 298 (823)
Q Consensus 236 ~~~P~-~v~~~--~PYll~~~-~~~ieV~~l~~~~~lvQ~i~l~~~-~~l~--~~~~~v~v-~s~~~I~~l 298 (823)
...|. ++++. .-|+..-. ++.|.|.+.+ ++.++|+..-.++ ..++ ..++.+-+ ++++.+.++
T Consensus 451 H~~pVysvafS~~g~ylAsGs~dg~V~iws~~-~~~l~~s~~~~~~Ifel~Wn~~G~kl~~~~sd~~vcvl 520 (524)
T KOG0273|consen 451 HQEPVYSVAFSPNGRYLASGSLDGCVHIWSTK-TGKLVKSYQGTGGIFELCWNAAGDKLGACASDGSVCVL 520 (524)
T ss_pred CCCceEEEEecCCCcEEEecCCCCeeEecccc-chheeEeecCCCeEEEEEEcCCCCEEEEEecCCCceEE
Confidence 33343 34444 34555444 3678999987 6889988765554 2222 23333332 456665554
No 72
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=95.90 E-value=0.02 Score=62.26 Aligned_cols=113 Identities=18% Similarity=0.279 Sum_probs=80.0
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES- 96 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~- 96 (823)
+-.|+..++..+.+|++.|.|-+|+.+..-.+.. .+.++.+.+. ...|+.|+.-+..++|.+++..
T Consensus 391 ts~~~S~ng~ylA~GS~~GiVNIYd~~s~~~s~~------------PkPik~~dNL-tt~Itsl~Fn~d~qiLAiaS~~~ 457 (514)
T KOG2055|consen 391 TSLCISLNGSYLATGSDSGIVNIYDGNSCFASTN------------PKPIKTVDNL-TTAITSLQFNHDAQILAIASRVK 457 (514)
T ss_pred eeeeecCCCceEEeccCcceEEEeccchhhccCC------------CCchhhhhhh-heeeeeeeeCcchhhhhhhhhcc
Confidence 3446668889999999999999998554322211 1122333344 4589999999999999999862
Q ss_pred ---EEEEeCCCCcccc----cccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEc
Q 003405 97 ---IAFHRLPNLETIA----VLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHD 143 (823)
Q Consensus 97 ---l~~~~L~~l~~~~----~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~ 143 (823)
+++..+|+...-. +-...-.+++++.+++.|.+||| -++++.+|++.
T Consensus 458 knalrLVHvPS~TVFsNfP~~n~~vg~vtc~aFSP~sG~lAvGNe~grv~l~kL~ 512 (514)
T KOG2055|consen 458 KNALRLVHVPSCTVFSNFPTSNTKVGHVTCMAFSPNSGYLAVGNEAGRVHLFKLH 512 (514)
T ss_pred ccceEEEeccceeeeccCCCCCCcccceEEEEecCCCceEEeecCCCceeeEeec
Confidence 7777777643211 12334468899999999999999 56788898875
No 73
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=95.87 E-value=0.26 Score=53.96 Aligned_cols=163 Identities=12% Similarity=0.108 Sum_probs=103.5
Q ss_pred CcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc--Cceee
Q 003405 17 PKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR--QLLLS 92 (823)
Q Consensus 17 ~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~--~~Ll~ 92 (823)
..|+|+-.-+ ..++-|.+||.|++|.+..-.+... .........+. .+.-+|+.+.+-... .++++
T Consensus 124 Q~ITcL~fs~dgs~iiTgskDg~V~vW~l~~lv~a~~---------~~~~~p~~~f~-~HtlsITDl~ig~Gg~~~rl~T 193 (476)
T KOG0646|consen 124 QSITCLKFSDDGSHIITGSKDGAVLVWLLTDLVSADN---------DHSVKPLHIFS-DHTLSITDLQIGSGGTNARLYT 193 (476)
T ss_pred cceeEEEEeCCCcEEEecCCCccEEEEEEEeeccccc---------CCCccceeeec-cCcceeEEEEecCCCccceEEE
Confidence 3699977665 5899999999999998775433211 11122223333 246799999988763 56788
Q ss_pred EeC--cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEc-CeEEEE---EEcC-CC-----ce--e--EeeeecC
Q 003405 93 LSE--SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQ-KRVCIF---RHDG-GR-----GF--V--EVKDFGV 156 (823)
Q Consensus 93 l~d--~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~k-kki~l~---~~~~-~~-----~f--~--~~kei~~ 156 (823)
.+. .+++|++..-..+.++..-..++++++++...++.||.. .+|.+. .+.+ .+ .+ . ..+-+.-
T Consensus 194 aS~D~t~k~wdlS~g~LLlti~fp~si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~G 273 (476)
T KOG0646|consen 194 ASEDRTIKLWDLSLGVLLLTITFPSSIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVG 273 (476)
T ss_pred ecCCceEEEEEeccceeeEEEecCCcceeEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccceeeeecc
Confidence 776 399999977665656555667889999988777888844 455433 3331 10 00 0 0111111
Q ss_pred -C--CCceEEEec--CCeEEEEEc-CceEEEEcCCCCee
Q 003405 157 -P--DTVKSMSWC--GENICIAIR-KGYMILNATNGALS 189 (823)
Q Consensus 157 -~--~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~~~~ 189 (823)
. ..|+|++.. |+.+.-|.. ..|++-|+.+.+..
T Consensus 274 h~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~Q~i 312 (476)
T KOG0646|consen 274 HENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSKQCI 312 (476)
T ss_pred ccCCcceeEEEEecCccEEEeeCCCCCEEEEecchHHHH
Confidence 1 378888776 666777766 56889998876543
No 74
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=95.87 E-value=0.61 Score=52.42 Aligned_cols=199 Identities=21% Similarity=0.233 Sum_probs=115.0
Q ss_pred EEEEeCCCCcccccccCCCCcEEEEeeCCC--ceEEEE------EcCeEEEEEEcCCCcee--EeeeecCCCCceEEEec
Q 003405 97 IAFHRLPNLETIAVLTKAKGANVYSWDDRR--GFLCFA------RQKRVCIFRHDGGRGFV--EVKDFGVPDTVKSMSWC 166 (823)
Q Consensus 97 l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~--~~l~V~------~kkki~l~~~~~~~~f~--~~kei~~~~~~~~l~~~ 166 (823)
+++|++.+++.+...-...+++.|++++.. ..+||- ....+.||.+....... ..|.+.-.|.+ .|.|.
T Consensus 148 v~f~~~~~f~~~~~kl~~~~i~~f~lSpgp~~~~vAvyvPe~kGaPa~vri~~~~~~~~~~~~a~ksFFkadkv-qm~WN 226 (566)
T KOG2315|consen 148 VQFYDLGSFKTIQHKLSVSGITMLSLSPGPEPPFVAVYVPEKKGAPASVRIYKYPEEGQHQPVANKSFFKADKV-QMKWN 226 (566)
T ss_pred EEEEecCCccceeeeeeccceeeEEecCCCCCceEEEEccCCCCCCcEEEEeccccccccchhhhcccccccee-EEEec
Confidence 999999988766554456899999998764 457764 22458899887322221 12334444544 56787
Q ss_pred --CCe-EEEEEc---C---ce------EEEEcCCCC-eeeccCCCCCCCCEEEEccC-CeEEEEeC---CeEEEEcCCCc
Q 003405 167 --GEN-ICIAIR---K---GY------MILNATNGA-LSEVFPSGRIGPPLVVSLLS-GELLLGKE---NIGVFVDQNGK 226 (823)
Q Consensus 167 --~~~-i~v~~~---~---~y------~lidl~~~~-~~~L~~~~~~~~p~i~~~~~-~EfLL~~~---~~gvfv~~~G~ 226 (823)
|+. +|+++. + +| +++++++.+ ..+|...| .-..+++.++ .||-+|++ ...-|+|..|+
T Consensus 227 ~~gt~LLvLastdVDktn~SYYGEq~Lyll~t~g~s~~V~L~k~G--PVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~~ 304 (566)
T KOG2315|consen 227 KLGTALLVLASTDVDKTNASYYGEQTLYLLATQGESVSVPLLKEG--PVHDVTWSPSGREFAVVYGFMPAKVTIFNLRGK 304 (566)
T ss_pred cCCceEEEEEEEeecCCCccccccceEEEEEecCceEEEecCCCC--CceEEEECCCCCEEEEEEecccceEEEEcCCCC
Confidence 554 555543 1 22 355665222 22332222 1223566654 59999876 44667788887
Q ss_pred cccCCceeecCCCcEEEEeCC--EEEEEe-----CCeEEEEEccCCCceeEEEeeCCccccc--ccCCeEEEecc-----
Q 003405 227 LLQADRICWSEAPIAVIIQKP--YAIALL-----PRRVEVRSLRVPYALIQTIVLQNVRHLI--PSSNAVVVALE----- 292 (823)
Q Consensus 227 ~~~~~~i~w~~~P~~v~~~~P--Yll~~~-----~~~ieV~~l~~~~~lvQ~i~l~~~~~l~--~~~~~v~v~s~----- 292 (823)
+. ..+...|..-+++.| .++++. ++.+||.++.| ..++-++...+....- ++|.-|+.||.
T Consensus 305 ~v----~df~egpRN~~~fnp~g~ii~lAGFGNL~G~mEvwDv~n-~K~i~~~~a~~tt~~eW~PdGe~flTATTaPRlr 379 (566)
T KOG2315|consen 305 PV----FDFPEGPRNTAFFNPHGNIILLAGFGNLPGDMEVWDVPN-RKLIAKFKAANTTVFEWSPDGEYFLTATTAPRLR 379 (566)
T ss_pred Ee----EeCCCCCccceEECCCCCEEEEeecCCCCCceEEEeccc-hhhccccccCCceEEEEcCCCcEEEEEeccccEE
Confidence 53 556677777777665 233333 36899999985 6666666555433221 23333444432
Q ss_pred ----ceEEEeeccCh
Q 003405 293 ----NSIFGLFPVPL 303 (823)
Q Consensus 293 ----~~I~~l~~~~~ 303 (823)
-.||...-..+
T Consensus 380 vdNg~KiwhytG~~l 394 (566)
T KOG2315|consen 380 VDNGIKIWHYTGSLL 394 (566)
T ss_pred ecCCeEEEEecCcee
Confidence 25776665543
No 75
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=95.81 E-value=0.016 Score=52.17 Aligned_cols=65 Identities=20% Similarity=0.134 Sum_probs=58.4
Q ss_pred HHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHH
Q 003405 671 NEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSL 750 (823)
Q Consensus 671 ~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~i 750 (823)
|.|...|+.. ....+.+||+..+.-|++.+-..+..++.+.|.+.+|..-|+|++||++
T Consensus 3 TaLlk~Yl~~---------------------~~~~l~~llr~~N~C~~~~~e~~L~~~~~~~eL~~lY~~kg~h~~AL~l 61 (108)
T PF10366_consen 3 TALLKCYLET---------------------NPSLLGPLLRLPNYCDLEEVEEVLKEHGKYQELVDLYQGKGLHRKALEL 61 (108)
T ss_pred HHHHHHHHHh---------------------CHHHHHHHHccCCcCCHHHHHHHHHHcCCHHHHHHHHHccCccHHHHHH
Confidence 7789999976 2578999999999999999999999999999999999999999999999
Q ss_pred HHHHhC
Q 003405 751 YVHKVF 756 (823)
Q Consensus 751 lv~~L~ 756 (823)
+..--.
T Consensus 62 l~~l~~ 67 (108)
T PF10366_consen 62 LKKLAD 67 (108)
T ss_pred HHHHhc
Confidence 986444
No 76
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=95.79 E-value=2.4 Score=45.28 Aligned_cols=170 Identities=14% Similarity=0.160 Sum_probs=109.6
Q ss_pred CCcccccccccCCCCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeE
Q 003405 3 HNAFDSLELISNCSPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILS 80 (823)
Q Consensus 3 ~~af~~~~l~~~~~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~q 80 (823)
|-+|....+++.-.+-|.|++.. +.++.-|..|+++.+|++... ++.-+..+. ...|..
T Consensus 138 HapwKl~rVi~gHlgWVr~vavdP~n~wf~tgs~DrtikIwDlatg------------------~LkltltGh-i~~vr~ 198 (460)
T KOG0285|consen 138 HAPWKLYRVISGHLGWVRSVAVDPGNEWFATGSADRTIKIWDLATG------------------QLKLTLTGH-IETVRG 198 (460)
T ss_pred cCcceehhhhhhccceEEEEeeCCCceeEEecCCCceeEEEEcccC------------------eEEEeecch-hheeee
Confidence 55677777777777789997765 568999999999999997642 122233332 567888
Q ss_pred EEEecccCceeeEeCc--EEEEeCCCCccccc-ccCCCCcEEEEeeCCCceEEEEEcC-eEEEEEEcCCCceeEeeeecC
Q 003405 81 MEVLASRQLLLSLSES--IAFHRLPNLETIAV-LTKAKGANVYSWDDRRGFLCFARQK-RVCIFRHDGGRGFVEVKDFGV 156 (823)
Q Consensus 81 I~~~~~~~~Ll~l~d~--l~~~~L~~l~~~~~-i~~~kg~~~fa~~~~~~~l~V~~kk-ki~l~~~~~~~~f~~~kei~~ 156 (823)
+.+-+..-.|++++++ |+.|+|..-+.+.. -....+|.+.++.+....|+-+.+. .+.++.++-.... ..+
T Consensus 199 vavS~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V-----~~l 273 (460)
T KOG0285|consen 199 VAVSKRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASV-----HVL 273 (460)
T ss_pred eeecccCceEEEecCCCeeEEEechhhhhHHHhccccceeEEEeccccceeEEecCCcceEEEeeecccceE-----EEe
Confidence 8899999999999984 99999965544421 2344567777777665566665443 3455555421111 112
Q ss_pred C---CCceEEEec--CCeEEEEEc-CceEEEEcCCCCe-eeccCCCC
Q 003405 157 P---DTVKSMSWC--GENICIAIR-KGYMILNATNGAL-SEVFPSGR 196 (823)
Q Consensus 157 ~---~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~~~-~~L~~~~~ 196 (823)
+ .++.++.+. +..|+-|+. ....+.|+..|+. ..++...+
T Consensus 274 ~GH~~~V~~V~~~~~dpqvit~S~D~tvrlWDl~agkt~~tlt~hkk 320 (460)
T KOG0285|consen 274 SGHTNPVASVMCQPTDPQVITGSHDSTVRLWDLRAGKTMITLTHHKK 320 (460)
T ss_pred cCCCCcceeEEeecCCCceEEecCCceEEEeeeccCceeEeeecccc
Confidence 2 355666555 455655554 6788999988864 44544433
No 77
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=95.73 E-value=0.16 Score=59.05 Aligned_cols=149 Identities=17% Similarity=0.238 Sum_probs=96.0
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS- 94 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~- 94 (823)
...|.|+...++.++-|+.||.|.+|+.. ..++.+..++ +..+|..+.+-.+ +.++.-+
T Consensus 331 ~~~V~~v~~~~~~lvsgs~d~~v~VW~~~------------------~~~cl~sl~g-H~~~V~sl~~~~~-~~~~Sgs~ 390 (537)
T KOG0274|consen 331 TGPVNCVQLDEPLLVSGSYDGTVKVWDPR------------------TGKCLKSLSG-HTGRVYSLIVDSE-NRLLSGSL 390 (537)
T ss_pred cccEEEEEecCCEEEEEecCceEEEEEhh------------------hceeeeeecC-CcceEEEEEecCc-ceEEeeee
Confidence 45799999999999999999999999865 3455666666 4678999866554 6777665
Q ss_pred Cc-EEEEeCCCC-cccccccCCCCcE-EEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCC--CCceEEEecCC-
Q 003405 95 ES-IAFHRLPNL-ETIAVLTKAKGAN-VYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVP--DTVKSMSWCGE- 168 (823)
Q Consensus 95 d~-l~~~~L~~l-~~~~~i~~~kg~~-~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~--~~~~~l~~~~~- 168 (823)
|+ |++|++.+. +-+..+..-.++. .+.+.. .-.+.-...+.|.++....+. ..+.+..+ ..+.++++...
T Consensus 391 D~~IkvWdl~~~~~c~~tl~~h~~~v~~l~~~~-~~Lvs~~aD~~Ik~WD~~~~~---~~~~~~~~~~~~v~~l~~~~~~ 466 (537)
T KOG0274|consen 391 DTTIKVWDLRTKRKCIHTLQGHTSLVSSLLLRD-NFLVSSSADGTIKLWDAEEGE---CLRTLEGRHVGGVSALALGKEE 466 (537)
T ss_pred ccceEeecCCchhhhhhhhcCCccccccccccc-ceeEeccccccEEEeecccCc---eeeeeccCCcccEEEeecCcce
Confidence 44 999999887 5444433322222 111110 111223366778777655332 22333332 56777777634
Q ss_pred eEEEEEcCceEEEEcCCCCe
Q 003405 169 NICIAIRKGYMILNATNGAL 188 (823)
Q Consensus 169 ~i~v~~~~~y~lidl~~~~~ 188 (823)
.+|.+....+.+.|+.+++.
T Consensus 467 il~s~~~~~~~l~dl~~~~~ 486 (537)
T KOG0274|consen 467 ILCSSDDGSVKLWDLRSGTL 486 (537)
T ss_pred EEEEecCCeeEEEecccCch
Confidence 56666668899999998753
No 78
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.68 E-value=0.32 Score=56.26 Aligned_cols=155 Identities=17% Similarity=0.199 Sum_probs=99.6
Q ss_pred EEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc-E
Q 003405 20 DAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES-I 97 (823)
Q Consensus 20 ~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~-l 97 (823)
+|+..+++++++|+.+|.|.+|++-... +..+.++ +...|=.|.+.|...-.++-+ |. |
T Consensus 418 ~~Fvpgd~~Iv~G~k~Gel~vfdlaS~~------------------l~Eti~A-HdgaIWsi~~~pD~~g~vT~saDktV 478 (888)
T KOG0306|consen 418 SKFVPGDRYIVLGTKNGELQVFDLASAS------------------LVETIRA-HDGAIWSISLSPDNKGFVTGSADKTV 478 (888)
T ss_pred EEecCCCceEEEeccCCceEEEEeehhh------------------hhhhhhc-cccceeeeeecCCCCceEEecCCcEE
Confidence 3466778999999999999999976432 2222333 357899999999887777665 43 9
Q ss_pred EEEeCCCC--cc-----------cccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEE
Q 003405 98 AFHRLPNL--ET-----------IAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSM 163 (823)
Q Consensus 98 ~~~~L~~l--~~-----------~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l 163 (823)
++|+..-. .+ ..++.-...+.++.++++...+||+ ...++.+|-.+.=+.|..+.-..+ |+.||
T Consensus 479 kfWdf~l~~~~~gt~~k~lsl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDtlKFflsLYGHkL--PV~sm 556 (888)
T KOG0306|consen 479 KFWDFKLVVSVPGTQKKVLSLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDTLKFFLSLYGHKL--PVLSM 556 (888)
T ss_pred EEEeEEEEeccCcccceeeeeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEEEEecceeeeeeeccccc--ceeEE
Confidence 99985210 11 1123333467778888888889998 889999999983222233333334 57777
Q ss_pred EecC-CeEEEEEc--CceEEEEcCCCC-eeeccCCC
Q 003405 164 SWCG-ENICIAIR--KGYMILNATNGA-LSEVFPSG 195 (823)
Q Consensus 164 ~~~~-~~i~v~~~--~~y~lidl~~~~-~~~L~~~~ 195 (823)
...- ..+|+... +...+.=++=|. ..++|...
T Consensus 557 DIS~DSklivTgSADKnVKiWGLdFGDCHKS~fAHd 592 (888)
T KOG0306|consen 557 DISPDSKLIVTGSADKNVKIWGLDFGDCHKSFFAHD 592 (888)
T ss_pred eccCCcCeEEeccCCCceEEeccccchhhhhhhccc
Confidence 7763 34555443 566666665553 35566553
No 79
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=95.59 E-value=3.2 Score=44.60 Aligned_cols=178 Identities=19% Similarity=0.213 Sum_probs=108.9
Q ss_pred cEEEEeCC----CCcccccccCCCCcEEEEeeCCCceEEEEE----cCeEEEEEEcCC-CceeEeeeecCCC-CceEEEe
Q 003405 96 SIAFHRLP----NLETIAVLTKAKGANVYSWDDRRGFLCFAR----QKRVCIFRHDGG-RGFVEVKDFGVPD-TVKSMSW 165 (823)
Q Consensus 96 ~l~~~~L~----~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~----kkki~l~~~~~~-~~f~~~kei~~~~-~~~~l~~ 165 (823)
+|.+|++. ++.....+...-+.+.++++++..++.++. ..++.-|+|+.+ ..+..+.+..+++ +|+.++.
T Consensus 17 gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsv 96 (346)
T COG2706 17 GIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSV 96 (346)
T ss_pred ceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEE
Confidence 57788776 344444556667888999998887776653 457899999853 3466666666665 4478888
Q ss_pred c--CCeEEEEEc--CceEEEEcCC-CCeeeccC----CCC-----CCCC--E-EEEccCCeEEEEeC----CeEEEEcCC
Q 003405 166 C--GENICIAIR--KGYMILNATN-GALSEVFP----SGR-----IGPP--L-VVSLLSGELLLGKE----NIGVFVDQN 224 (823)
Q Consensus 166 ~--~~~i~v~~~--~~y~lidl~~-~~~~~L~~----~~~-----~~~p--~-i~~~~~~EfLL~~~----~~gvfv~~~ 224 (823)
. |..+++|+= ..+.++-+++ |....... .+. +..| . +..-+++.+|++.| ...+|-=.+
T Consensus 97 d~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~Dri~~y~~~d 176 (346)
T COG2706 97 DEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGTDRIFLYDLDD 176 (346)
T ss_pred CCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCCceEEEEEccc
Confidence 7 456777754 4566666644 55433321 121 1122 2 23347888888654 444553347
Q ss_pred CccccCCc--eeecCCCcEEEEeC--CEEEEE--eCCeEEEEEccCC-C--ceeEEEe
Q 003405 225 GKLLQADR--ICWSEAPIAVIIQK--PYAIAL--LPRRVEVRSLRVP-Y--ALIQTIV 273 (823)
Q Consensus 225 G~~~~~~~--i~w~~~P~~v~~~~--PYll~~--~~~~ieV~~l~~~-~--~lvQ~i~ 273 (823)
|..+.... ++=..-|++++|+. +|...+ ..+.|.|+...+. + ..+|++.
T Consensus 177 g~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~ 234 (346)
T COG2706 177 GKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTID 234 (346)
T ss_pred CccccccccccCCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeec
Confidence 87654333 33345699999974 444444 4588999987421 2 3467764
No 80
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=95.57 E-value=1.3 Score=52.15 Aligned_cols=189 Identities=15% Similarity=0.159 Sum_probs=109.5
Q ss_pred cCCCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 13 SNCSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 13 ~~~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
..+|.+|+|++++.+.+|+|...- |+.|.-.. ........ +...|..+..+.+ .++.
T Consensus 73 ~~lp~~I~alas~~~~vy~A~g~~-i~~~~rgk-------------------~i~~~~~~-~~a~v~~l~~fGe--~lia 129 (910)
T KOG1539|consen 73 KPLPDKITALASDKDYVYVASGNK-IYAYARGK-------------------HIRHTTLL-HGAKVHLLLPFGE--HLIA 129 (910)
T ss_pred CCCCCceEEEEecCceEEEecCcE-EEEEEccc-------------------eEEEEecc-ccceEEEEeeecc--eEEE
Confidence 467899999999999999998875 66664221 11112222 2467888877754 3444
Q ss_pred EeC--cEEEEeCCCC-cccc-cccCC----CCcEEEEeeCCC--ceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCce
Q 003405 93 LSE--SIAFHRLPNL-ETIA-VLTKA----KGANVYSWDDRR--GFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVK 161 (823)
Q Consensus 93 l~d--~l~~~~L~~l-~~~~-~i~~~----kg~~~fa~~~~~--~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~ 161 (823)
+.. .+.+|...+- ++.+ .++.. +++++++ ++.. ..|+|| .+.++.|+-++.++.....+++ ++.|+
T Consensus 130 ~d~~~~l~vw~~s~~~~e~~l~~~~~~~~~~~Ital~-HP~TYLNKIvvGs~~G~lql~Nvrt~K~v~~f~~~--~s~IT 206 (910)
T KOG1539|consen 130 VDISNILFVWKTSSIQEELYLQSTFLKVEGDFITALL-HPSTYLNKIVVGSSQGRLQLWNVRTGKVVYTFQEF--FSRIT 206 (910)
T ss_pred EEccCcEEEEEeccccccccccceeeeccCCceeeEe-cchhheeeEEEeecCCcEEEEEeccCcEEEEeccc--cccee
Confidence 432 4889886552 1111 01111 2244432 3333 257776 6678889988855433333333 36777
Q ss_pred EEEecC--CeEEEEEcC-ceEEEEcCCCCeeeccCCCCCCCCEEEE-ccCCeEEEEeC---CeEEEEcCCCccc
Q 003405 162 SMSWCG--ENICIAIRK-GYMILNATNGALSEVFPSGRIGPPLVVS-LLSGELLLGKE---NIGVFVDQNGKLL 228 (823)
Q Consensus 162 ~l~~~~--~~i~v~~~~-~y~lidl~~~~~~~L~~~~~~~~p~i~~-~~~~EfLL~~~---~~gvfv~~~G~~~ 228 (823)
++.-.. |-+.+|+.+ ...++|+..+++.--|.... ++-..+. -.||+-+++.. +...|.|.++++.
T Consensus 207 ~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~sFk~d~-g~VtslSFrtDG~p~las~~~~G~m~~wDLe~kkl 279 (910)
T KOG1539|consen 207 AIEQSPALDVVAIGLENGTVIIFNLKFDKILMSFKQDW-GRVTSLSFRTDGNPLLASGRSNGDMAFWDLEKKKL 279 (910)
T ss_pred EeccCCcceEEEEeccCceEEEEEcccCcEEEEEEccc-cceeEEEeccCCCeeEEeccCCceEEEEEcCCCee
Confidence 776543 679999885 57888999887655554421 1111122 24666666432 3345678877653
No 81
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.48 E-value=2.3 Score=48.55 Aligned_cols=220 Identities=12% Similarity=0.164 Sum_probs=130.5
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
..+|-||+.|. -.+..+--+|.+.+|+-+. ..++++|. .+..||..-+.+...|.+++=
T Consensus 13 SdRVKsVd~HPtePw~la~LynG~V~IWnyet------------------qtmVksfe-V~~~PvRa~kfiaRknWiv~G 73 (794)
T KOG0276|consen 13 SDRVKSVDFHPTEPWILAALYNGDVQIWNYET------------------QTMVKSFE-VSEVPVRAAKFIARKNWIVTG 73 (794)
T ss_pred CCceeeeecCCCCceEEEeeecCeeEEEeccc------------------ceeeeeee-ecccchhhheeeeccceEEEe
Confidence 45899999996 4899999999999998542 23456654 457899999999999999999
Q ss_pred eCc--EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEEEc-CeEEEEEEcCCCceeEeeeecC-CCCceEEEec--
Q 003405 94 SES--IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFARQ-KRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWC-- 166 (823)
Q Consensus 94 ~d~--l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~~k-kki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~-- 166 (823)
+|. |++|+..+++.+...+.-. =+.++++++..+.+.-+.. -.|.++.|.. .+.-.+.+.- .+-+-.+++.
T Consensus 74 sDD~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~P~vLtsSDDm~iKlW~we~--~wa~~qtfeGH~HyVMqv~fnPk 151 (794)
T KOG0276|consen 74 SDDMQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTLPYVLTSSDDMTIKLWDWEN--EWACEQTFEGHEHYVMQVAFNPK 151 (794)
T ss_pred cCCceEEEEecccceeeEEeeccccceeeeeecCCCCeEEecCCccEEEEeeccC--ceeeeeEEcCcceEEEEEEecCC
Confidence 983 9999999988665432222 3557788887766555544 4567888874 2433333322 2355566666
Q ss_pred CCeEEEE--EcCceEEEEcCCCCeeeccCC--CCCCCCEEEEcc--CCeEEE-EeCCeEEEE-cCCCccccCCcee-ecC
Q 003405 167 GENICIA--IRKGYMILNATNGALSEVFPS--GRIGPPLVVSLL--SGELLL-GKENIGVFV-DQNGKLLQADRIC-WSE 237 (823)
Q Consensus 167 ~~~i~v~--~~~~y~lidl~~~~~~~L~~~--~~~~~p~i~~~~--~~EfLL-~~~~~gvfv-~~~G~~~~~~~i~-w~~ 237 (823)
++.-+++ ..+...+.++.+. .+.|.. ...+-.+|...+ +.-+|+ +.|+..+-| |.+.+ .+++ .++
T Consensus 152 D~ntFaS~sLDrTVKVWslgs~--~~nfTl~gHekGVN~Vdyy~~gdkpylIsgaDD~tiKvWDyQtk----~CV~TLeG 225 (794)
T KOG0276|consen 152 DPNTFASASLDRTVKVWSLGSP--HPNFTLEGHEKGVNCVDYYTGGDKPYLISGADDLTIKVWDYQTK----SCVQTLEG 225 (794)
T ss_pred CccceeeeeccccEEEEEcCCC--CCceeeeccccCcceEEeccCCCcceEEecCCCceEEEeecchH----HHHHHhhc
Confidence 2222222 2244555555432 222222 122345666553 345777 677766644 43332 2222 222
Q ss_pred C--CcEEEEeC---CEEEEEeC-CeEEEEEc
Q 003405 238 A--PIAVIIQK---PYAIALLP-RRVEVRSL 262 (823)
Q Consensus 238 ~--P~~v~~~~---PYll~~~~-~~ieV~~l 262 (823)
. -.+.++.. |-|+.-++ +++-|.+-
T Consensus 226 Ht~Nvs~v~fhp~lpiiisgsEDGTvriWhs 256 (794)
T KOG0276|consen 226 HTNNVSFVFFHPELPIIISGSEDGTVRIWNS 256 (794)
T ss_pred ccccceEEEecCCCcEEEEecCCccEEEecC
Confidence 2 23344443 44555554 46777664
No 82
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=95.32 E-value=5.5 Score=48.54 Aligned_cols=60 Identities=20% Similarity=0.252 Sum_probs=42.5
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhcC-CCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcc
Q 003405 506 ILDTALLQALLLTGQSSAALELLKG-LNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEES 576 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~~-~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~ 576 (823)
.+.++|+.+|.+.++.+....++.. +.. | .--|..|+..|.+.|++++|++++.+.....
T Consensus 260 ~~~n~Li~~y~k~g~~~~A~~vf~~m~~~-~----------~vt~n~li~~y~~~g~~~eA~~lf~~M~~~g 320 (697)
T PLN03081 260 FVSCALIDMYSKCGDIEDARCVFDGMPEK-T----------TVAWNSMLAGYALHGYSEEALCLYYEMRDSG 320 (697)
T ss_pred eeHHHHHHHHHHCCCHHHHHHHHHhCCCC-C----------hhHHHHHHHHHHhCCCHHHHHHHHHHHHHcC
Confidence 4678889999998775555444432 110 1 1247889999999999999999999886543
No 83
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=95.27 E-value=1.7 Score=45.42 Aligned_cols=192 Identities=11% Similarity=0.153 Sum_probs=113.2
Q ss_pred CCCeeEEEEecccCceeeEe--Cc-EEEEeCCC---CcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCc
Q 003405 75 KKPILSMEVLASRQLLLSLS--ES-IAFHRLPN---LETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRG 147 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~--d~-l~~~~L~~---l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~ 147 (823)
...|..|..-|....+++.+ |+ |++|++.. +.+.....-...+-.+|.+++...++.+ ..|.+.+|.+..+ +
T Consensus 27 ~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~WsddgskVf~g~~Dk~~k~wDL~S~-Q 105 (347)
T KOG0647|consen 27 EDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDGSKVFSGGCDKQAKLWDLASG-Q 105 (347)
T ss_pred ccchheeEeccccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEEEccCCceEEeeccCCceEEEEccCC-C
Confidence 45688888888655566554 55 99999854 3333333444567788888888788877 8888999998855 3
Q ss_pred eeEeeeecC-CCCceEEEecCCeE--EEEEcC---ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-E-eCCeEE
Q 003405 148 FVEVKDFGV-PDTVKSMSWCGENI--CIAIRK---GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-G-KENIGV 219 (823)
Q Consensus 148 f~~~kei~~-~~~~~~l~~~~~~i--~v~~~~---~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~-~~~~gv 219 (823)
. ..+.. ..+++++.|.+... |+++.+ .....|+.+.. ++....-..+-..+- ...++++ + .+....
T Consensus 106 ~---~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~--pv~t~~LPeRvYa~D-v~~pm~vVata~r~i~ 179 (347)
T KOG0647|consen 106 V---SQVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSN--PVATLQLPERVYAAD-VLYPMAVVATAERHIA 179 (347)
T ss_pred e---eeeeecccceeEEEEecCCCcceeEecccccceeecccCCCC--eeeeeeccceeeehh-ccCceeEEEecCCcEE
Confidence 2 23333 46889999996654 777763 45555655322 111110000011111 1233433 3 445556
Q ss_pred EEcCCCcccc----CCceeecCCCcEEEEe-CCEEEEEeCCeEEEEEccCCCceeEEEee
Q 003405 220 FVDQNGKLLQ----ADRICWSEAPIAVIIQ-KPYAIALLPRRVEVRSLRVPYALIQTIVL 274 (823)
Q Consensus 220 fv~~~G~~~~----~~~i~w~~~P~~v~~~-~PYll~~~~~~ieV~~l~~~~~lvQ~i~l 274 (823)
.+|.++.++. .+++.|-..-.++.-. .-|.++-.++.+.|+.+. +..-.+.+.+
T Consensus 180 vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiEGrv~iq~id-~~~~~~nFtF 238 (347)
T KOG0647|consen 180 VYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIEGRVAIQYID-DPNPKDNFTF 238 (347)
T ss_pred EEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeecceEEEEecC-CCCccCceeE
Confidence 6787665542 3677785443333322 568888888999999985 3322444443
No 84
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=95.25 E-value=1.4 Score=44.92 Aligned_cols=217 Identities=13% Similarity=0.184 Sum_probs=125.6
Q ss_pred EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEE
Q 003405 22 VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAF 99 (823)
Q Consensus 22 i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~ 99 (823)
++..++.|.+|.+. .|.+|+++..... .+.++.+. .+.|+.+..=.....+.+=+| | +++
T Consensus 48 iTpdk~~LAaa~~q-hvRlyD~~S~np~----------------Pv~t~e~h-~kNVtaVgF~~dgrWMyTgseDgt~kI 109 (311)
T KOG0315|consen 48 ITPDKKDLAAAGNQ-HVRLYDLNSNNPN----------------PVATFEGH-TKNVTAVGFQCDGRWMYTGSEDGTVKI 109 (311)
T ss_pred EcCCcchhhhccCC-eeEEEEccCCCCC----------------ceeEEecc-CCceEEEEEeecCeEEEecCCCceEEE
Confidence 44444566666665 4788998765431 12344443 567888876666667777776 5 999
Q ss_pred EeCCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCC---CCceEEEec--CCeEEEE
Q 003405 100 HRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVP---DTVKSMSWC--GENICIA 173 (823)
Q Consensus 100 ~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~---~~~~~l~~~--~~~i~v~ 173 (823)
|+|..+.-.........|+.++++++.+.++++ ....|.++.+..+ .+. .| .+| ..+++++.. |..+..+
T Consensus 110 WdlR~~~~qR~~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~~-~c~--~~-liPe~~~~i~sl~v~~dgsml~a~ 185 (311)
T KOG0315|consen 110 WDLRSLSCQRNYQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGEN-SCT--HE-LIPEDDTSIQSLTVMPDGSMLAAA 185 (311)
T ss_pred EeccCcccchhccCCCCcceEEecCCcceEEeecCCCcEEEEEccCC-ccc--cc-cCCCCCcceeeEEEcCCCcEEEEe
Confidence 999886433223334578999999999999998 4456888887744 232 12 123 255665554 6778877
Q ss_pred EcCce-EEEEcCCCC-eeeccCCCCC----CCCEEEE-ccCCeEEEE-e-CCeEEEEcCCCcc----c--cCCceeecCC
Q 003405 174 IRKGY-MILNATNGA-LSEVFPSGRI----GPPLVVS-LLSGELLLG-K-ENIGVFVDQNGKL----L--QADRICWSEA 238 (823)
Q Consensus 174 ~~~~y-~lidl~~~~-~~~L~~~~~~----~~p~i~~-~~~~EfLL~-~-~~~gvfv~~~G~~----~--~~~~i~w~~~ 238 (823)
+.++- ++-++-+++ ..++.|..+- ..-+-|. .+++.+|.. . |...-+-+.+|.. + ......|.
T Consensus 186 nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~kle~~l~gh~rWvWd-- 263 (311)
T KOG0315|consen 186 NNKGNCYVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFKLELVLTGHQRWVWD-- 263 (311)
T ss_pred cCCccEEEEEccCCCccccceEhhheecccceEEEEEECCCCcEEEeecCCceEEEEecCCceeeEEEeecCCceEEe--
Confidence 77654 455766654 3444444321 1112222 367777773 3 3333344555541 0 01235553
Q ss_pred CcEEEEeCCEEEEEeCC-eEEEEEcc
Q 003405 239 PIAVIIQKPYAIALLPR-RVEVRSLR 263 (823)
Q Consensus 239 P~~v~~~~PYll~~~~~-~ieV~~l~ 263 (823)
-.+....-||+...++ ...+.++.
T Consensus 264 -c~FS~dg~YlvTassd~~~rlW~~~ 288 (311)
T KOG0315|consen 264 -CAFSADGEYLVTASSDHTARLWDLS 288 (311)
T ss_pred -eeeccCccEEEecCCCCceeecccc
Confidence 2244456788888774 34455554
No 85
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=95.22 E-value=1 Score=51.74 Aligned_cols=194 Identities=12% Similarity=0.093 Sum_probs=115.4
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS- 94 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~- 94 (823)
+..+.|++.++..++=|..|+.|.+-...+.... ... +...+.+++ +..+...+.|+++-
T Consensus 291 ~hdvrs~av~~~~l~sgG~d~~l~i~~s~~~~~~-------------~h~---~~~~~p~~~---~v~~a~~~~L~~~w~ 351 (691)
T KOG2048|consen 291 AHDVRSMAVIENALISGGRDFTLAICSSREFKNM-------------DHR---QKNLFPASD---RVSVAPENRLLVLWK 351 (691)
T ss_pred cccceeeeeecceEEecceeeEEEEccccccCch-------------hhh---ccccccccc---eeecCccceEEEEec
Confidence 4579999999999999999999887654432211 000 001121222 22333345666665
Q ss_pred C-cEEEEeCCCCc------cc----ccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeE--eeeecCC-CCc
Q 003405 95 E-SIAFHRLPNLE------TI----AVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVE--VKDFGVP-DTV 160 (823)
Q Consensus 95 d-~l~~~~L~~l~------~~----~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~--~kei~~~-~~~ 160 (823)
+ ++..|.+.+-. .. ..+.+..++++-|++++...||++.=.+..||++..+...+. ++...+. -++
T Consensus 352 ~h~v~lwrlGS~~~~g~~~~~~Llkl~~k~~~nIs~~aiSPdg~~Ia~st~~~~~iy~L~~~~~vk~~~v~~~~~~~~~a 431 (691)
T KOG2048|consen 352 AHGVDLWRLGSVILQGEYNYIHLLKLFTKEKENISCAAISPDGNLIAISTVSRTKIYRLQPDPNVKVINVDDVPLALLDA 431 (691)
T ss_pred cccccceeccCcccccccChhhheeeecCCccceeeeccCCCCCEEEEeeccceEEEEeccCcceeEEEeccchhhhccc
Confidence 4 58888886541 11 123455678888899988889999999999999985431211 1111111 133
Q ss_pred eEEEe--cCCeEEEEEc--CceEEEEcCCCCeeeccCCCCC-CCCEEEE---ccCCeEEEEeCC--eEEEEcCCCccc
Q 003405 161 KSMSW--CGENICIAIR--KGYMILNATNGALSEVFPSGRI-GPPLVVS---LLSGELLLGKEN--IGVFVDQNGKLL 228 (823)
Q Consensus 161 ~~l~~--~~~~i~v~~~--~~y~lidl~~~~~~~L~~~~~~-~~p~i~~---~~~~EfLL~~~~--~gvfv~~~G~~~ 228 (823)
..+.+ .++.++++.. .+...+++.+++-.++.+.... ..|-|+. .++|+++.+.+. ....+|.++...
T Consensus 432 ~~i~ftid~~k~~~~s~~~~~le~~el~~ps~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~t~g~I~v~nl~~~~~ 509 (691)
T KOG2048|consen 432 SAISFTIDKNKLFLVSKNIFSLEEFELETPSFKELKSIQSQAKCPSISRLVVSSDGNYIAAISTRGQIFVYNLETLES 509 (691)
T ss_pred eeeEEEecCceEEEEecccceeEEEEecCcchhhhhccccccCCCcceeEEEcCCCCEEEEEeccceEEEEEccccee
Confidence 34444 3777777774 4566777777776666554322 2344443 478888876543 334468777644
No 86
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=95.21 E-value=1.7 Score=45.69 Aligned_cols=179 Identities=17% Similarity=0.220 Sum_probs=110.1
Q ss_pred CCCeeEEEEecccC--ceeeEeC-c-EEEEeCCCCcccccccC-CCCcEEEEeeCCCce--EEEEEcCeEEEEEEcCCC-
Q 003405 75 KKPILSMEVLASRQ--LLLSLSE-S-IAFHRLPNLETIAVLTK-AKGANVYSWDDRRGF--LCFARQKRVCIFRHDGGR- 146 (823)
Q Consensus 75 k~~I~qI~~~~~~~--~Ll~l~d-~-l~~~~L~~l~~~~~i~~-~kg~~~fa~~~~~~~--l~V~~kkki~l~~~~~~~- 146 (823)
...|+.++.-+... .|++.+| | |.+|+..+++...++.. ...|+.+++++.. . |.|+..+.+.++-+-.++
T Consensus 83 agsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~Vt~lsiHPS~-KLALsVg~D~~lr~WNLV~Gr~ 161 (362)
T KOG0294|consen 83 AGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQVTDLSIHPSG-KLALSVGGDQVLRTWNLVRGRV 161 (362)
T ss_pred ccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeecccccccceeEecCCC-ceEEEEcCCceeeeehhhcCcc
Confidence 46788888888764 7888887 5 99999887765544322 2348889998864 4 455666777666554332
Q ss_pred ceeEeeeecCCCCceEEEec--CCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEE-EccCCeEEEEeCCeEE-EEc
Q 003405 147 GFVEVKDFGVPDTVKSMSWC--GENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVV-SLLSGELLLGKENIGV-FVD 222 (823)
Q Consensus 147 ~f~~~kei~~~~~~~~l~~~--~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~-~~~~~EfLL~~~~~gv-fv~ 222 (823)
.|. ..+...++.+.|. |+..+++.++...++-+.+..+..-.... .+++++ +...++++++.|+..+ +.|
T Consensus 162 a~v----~~L~~~at~v~w~~~Gd~F~v~~~~~i~i~q~d~A~v~~~i~~~--~r~l~~~~l~~~~L~vG~d~~~i~~~D 235 (362)
T KOG0294|consen 162 AFV----LNLKNKATLVSWSPQGDHFVVSGRNKIDIYQLDNASVFREIENP--KRILCATFLDGSELLVGGDNEWISLKD 235 (362)
T ss_pred cee----eccCCcceeeEEcCCCCEEEEEeccEEEEEecccHhHhhhhhcc--ccceeeeecCCceEEEecCCceEEEec
Confidence 232 2345567778887 77788888888888877765432111111 234444 4455667778776544 445
Q ss_pred CCCccccCCceeecCC---CcEEE-EeC---CEEEEEeC-CeEEEEEcc
Q 003405 223 QNGKLLQADRICWSEA---PIAVI-IQK---PYAIALLP-RRVEVRSLR 263 (823)
Q Consensus 223 ~~G~~~~~~~i~w~~~---P~~v~-~~~---PYll~~~~-~~ieV~~l~ 263 (823)
.+.. ...-.+... ...++ |.. -||+.+++ +.|-|.++.
T Consensus 236 ~ds~---~~~~~~~AH~~RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~ 281 (362)
T KOG0294|consen 236 TDSD---TPLTEFLAHENRVKDIASYTNPEHEYLVTASSDGFIKVWDID 281 (362)
T ss_pred cCCC---ccceeeecchhheeeeEEEecCCceEEEEeccCceEEEEEcc
Confidence 4421 123333333 34555 333 27777776 578888874
No 87
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=95.02 E-value=7.4 Score=43.56 Aligned_cols=151 Identities=15% Similarity=0.227 Sum_probs=92.0
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-EEEEeCCC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES-IAFHRLPN 104 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~-l~~~~L~~ 104 (823)
+.+|.|-+-.|.++.|....+.-. ++ .-.|....|+-|..+...|+.+.+--=++| +.+|+-..
T Consensus 212 d~nliit~Gk~H~~Fw~~~~~~l~-----------k~----~~~fek~ekk~Vl~v~F~engdviTgDS~G~i~Iw~~~~ 276 (626)
T KOG2106|consen 212 DPNLIITCGKGHLYFWTLRGGSLV-----------KR----QGIFEKREKKFVLCVTFLENGDVITGDSGGNILIWSKGT 276 (626)
T ss_pred CCcEEEEeCCceEEEEEccCCceE-----------EE----eeccccccceEEEEEEEcCCCCEEeecCCceEEEEeCCC
Confidence 579999999999999976553311 11 113344567889999988887765443445 99998654
Q ss_pred Ccccccc-cCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCC---CceEEEecCCeEEEEEcCceEE
Q 003405 105 LETIAVL-TKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPD---TVKSMSWCGENICIAIRKGYMI 180 (823)
Q Consensus 105 l~~~~~i-~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~---~~~~l~~~~~~i~v~~~~~y~l 180 (823)
-....++ ...+|+-+.|.-.+ |.|.-+.|.+ .|..|++ ..++.+|+.+|| +|+.++=.+.-|+||+.+.+.+
T Consensus 277 ~~~~k~~~aH~ggv~~L~~lr~-GtllSGgKDR-ki~~Wd~--~y~k~r~~elPe~~G~iRtv~e~~~di~vGTtrN~iL 352 (626)
T KOG2106|consen 277 NRISKQVHAHDGGVFSLCMLRD-GTLLSGGKDR-KIILWDD--NYRKLRETELPEQFGPIRTVAEGKGDILVGTTRNFIL 352 (626)
T ss_pred ceEEeEeeecCCceEEEEEecC-ccEeecCccc-eEEeccc--cccccccccCchhcCCeeEEecCCCcEEEeeccceEE
Confidence 4322221 12234434333332 4444453322 2556663 367888999985 7888877655599999988877
Q ss_pred E-EcCCCCeeeccCCC
Q 003405 181 L-NATNGALSEVFPSG 195 (823)
Q Consensus 181 i-dl~~~~~~~L~~~~ 195 (823)
. +++++-....+..+
T Consensus 353 ~Gt~~~~f~~~v~gh~ 368 (626)
T KOG2106|consen 353 QGTLENGFTLTVQGHG 368 (626)
T ss_pred EeeecCCceEEEEecc
Confidence 6 45555444444443
No 88
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=95.02 E-value=4.4 Score=44.59 Aligned_cols=160 Identities=7% Similarity=-0.006 Sum_probs=98.1
Q ss_pred EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEc-----------CeEEEEEEcCCCceeEeeeecCCCCce----
Q 003405 97 IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQ-----------KRVCIFRHDGGRGFVEVKDFGVPDTVK---- 161 (823)
Q Consensus 97 l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~k-----------kki~l~~~~~~~~f~~~kei~~~~~~~---- 161 (823)
+++++..+.+.+..++-.+.-+.. ++++...+.|+.. ..|.+|... ..+.++++.+|+.|.
T Consensus 29 v~ViD~~~~~v~g~i~~G~~P~~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~---t~~~~~~i~~p~~p~~~~~ 104 (352)
T TIGR02658 29 VYTIDGEAGRVLGMTDGGFLPNPV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQ---THLPIADIELPEGPRFLVG 104 (352)
T ss_pred EEEEECCCCEEEEEEEccCCCcee-ECCCCCEEEEEeccccccccCCCCCEEEEEECc---cCcEEeEEccCCCchhhcc
Confidence 677777777666555544444443 7777777777644 567777766 345678888887766
Q ss_pred ----EEEec--CCeEEEEE---cCceEEEEcCCCCeeeccCCCCCCCCEEEEccCC-eEEEEeCCeEEE--EcCCCcccc
Q 003405 162 ----SMSWC--GENICIAI---RKGYMILNATNGALSEVFPSGRIGPPLVVSLLSG-ELLLGKENIGVF--VDQNGKLLQ 229 (823)
Q Consensus 162 ----~l~~~--~~~i~v~~---~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~-EfLL~~~~~gvf--v~~~G~~~~ 229 (823)
.+++. |..++|++ .....++|+.++++..-.+.+. -+.+...+++ .+++|.|..... .+.+|+..
T Consensus 105 ~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~--~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~- 181 (352)
T TIGR02658 105 TYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPD--CYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPK- 181 (352)
T ss_pred CccceEEECCCCCEEEEecCCCCCEEEEEECCCCcEEEEEeCCC--CcEEEEecCCccEEEeecCceEEEEecCCCceE-
Confidence 55555 56788887 4679999999998766555432 2444444444 445587765444 45666632
Q ss_pred CCceee--c------CCCcEEEEeCCEEEEEeCCeEEEEEcc
Q 003405 230 ADRICW--S------EAPIAVIIQKPYAIALLPRRVEVRSLR 263 (823)
Q Consensus 230 ~~~i~w--~------~~P~~v~~~~PYll~~~~~~ieV~~l~ 263 (823)
..+... . ..|.......-++++-+++.+.+.++.
T Consensus 182 ~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs~eG~V~~id~~ 223 (352)
T TIGR02658 182 IKPTEVFHPEDEYLINHPAYSNKSGRLVWPTYTGKIFQIDLS 223 (352)
T ss_pred EeeeeeecCCccccccCCceEcCCCcEEEEecCCeEEEEecC
Confidence 223332 2 334222223456666667778888753
No 89
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=95.01 E-value=6.7 Score=42.76 Aligned_cols=178 Identities=14% Similarity=0.103 Sum_probs=103.3
Q ss_pred cEEEEeCCC---CcccccccCCCCcEEEEeeCCCceEEEEE--cCeEEEEEEcCCCceeEeeeecCCCCceEEEec--CC
Q 003405 96 SIAFHRLPN---LETIAVLTKAKGANVYSWDDRRGFLCFAR--QKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GE 168 (823)
Q Consensus 96 ~l~~~~L~~---l~~~~~i~~~kg~~~fa~~~~~~~l~V~~--kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~ 168 (823)
+|.+|++.+ ++.+..++...+.+.++++++...++|+. .+.|.+|.++.+..+...+.+..++.|..+++. |+
T Consensus 13 ~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~ 92 (330)
T PRK11028 13 QIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGR 92 (330)
T ss_pred CEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCC
Confidence 388888842 23333443345667788888877787773 467888888743345555566677788999988 55
Q ss_pred eEEEEEc--CceEEEEcCC-CCeeeccCC-CCCCCCEEE-EccCCeEEEEe---CCeEEEEcC--CCcccc--CCce--e
Q 003405 169 NICIAIR--KGYMILNATN-GALSEVFPS-GRIGPPLVV-SLLSGELLLGK---ENIGVFVDQ--NGKLLQ--ADRI--C 234 (823)
Q Consensus 169 ~i~v~~~--~~y~lidl~~-~~~~~L~~~-~~~~~p~i~-~~~~~EfLL~~---~~~gvfv~~--~G~~~~--~~~i--~ 234 (823)
.++++.. ....++++++ |........ .....|..+ .-+++.++++. ++....++. .|.... ...+ .
T Consensus 93 ~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~ 172 (330)
T PRK11028 93 FLFSASYNANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTV 172 (330)
T ss_pred EEEEEEcCCCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCcccccCCCceecC
Confidence 6888764 5567888863 322211111 011123333 34567776532 244444443 343321 1111 1
Q ss_pred ecCCCcEEEEeC--CEEEEEeC--CeEEEEEccC-CC--ceeEEEe
Q 003405 235 WSEAPIAVIIQK--PYAIALLP--RRVEVRSLRV-PY--ALIQTIV 273 (823)
Q Consensus 235 w~~~P~~v~~~~--PYll~~~~--~~ieV~~l~~-~~--~lvQ~i~ 273 (823)
=...|..+++.. .|+++... +.|-++++.. ++ .++|++.
T Consensus 173 ~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~ 218 (330)
T PRK11028 173 EGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIECVQTLD 218 (330)
T ss_pred CCCCCceEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEe
Confidence 245688888764 38888875 6888888852 12 3467664
No 90
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=95.01 E-value=0.85 Score=48.95 Aligned_cols=171 Identities=16% Similarity=0.219 Sum_probs=93.0
Q ss_pred CCcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCc--c-ccc-ccccceeee-eecCCCCCCeeEE--------
Q 003405 16 SPKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSD--Y-QSL-RKESYELER-TISGFSKKPILSM-------- 81 (823)
Q Consensus 16 ~~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d--~-~~l-~~~~~~l~~-~~~~~~k~~I~qI-------- 81 (823)
...|+|+...| +.||-|+.|++|.+|+.....-...=.+. . +.+ -...|.+.. .|.. ++..+.+.
T Consensus 247 T~~VTCvrwGG~gliySgS~DrtIkvw~a~dG~~~r~lkGHahwvN~lalsTdy~LRtgaf~~-t~~~~~~~se~~~~Al 325 (480)
T KOG0271|consen 247 TASVTCVRWGGEGLIYSGSQDRTIKVWRALDGKLCRELKGHAHWVNHLALSTDYVLRTGAFDH-TGRKPKSFSEEQKKAL 325 (480)
T ss_pred ccceEEEEEcCCceEEecCCCceEEEEEccchhHHHhhcccchheeeeeccchhhhhcccccc-ccccCCChHHHHHHHH
Confidence 45799999885 68999999999999986642110000000 0 000 000111100 0000 01111100
Q ss_pred ----EEec-ccCceeeEeCc--EEEEeCCCCc-cccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEe
Q 003405 82 ----EVLA-SRQLLLSLSES--IAFHRLPNLE-TIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEV 151 (823)
Q Consensus 82 ----~~~~-~~~~Ll~l~d~--l~~~~L~~l~-~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~ 151 (823)
.+.+ ..+.|++-||+ +.+|+-...+ ++.... ..+-|+.+..+++...|+-| ..|.|.++..+.++....+
T Consensus 326 ~rY~~~~~~~~erlVSgsDd~tlflW~p~~~kkpi~rmtgHq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~tGk~lasf 405 (480)
T KOG0271|consen 326 ERYEAVLKDSGERLVSGSDDFTLFLWNPFKSKKPITRMTGHQALVNHVSFSPDGRYIASASFDKSVKLWDGRTGKFLASF 405 (480)
T ss_pred HHHHHhhccCcceeEEecCCceEEEecccccccchhhhhchhhheeeEEECCCccEEEEeecccceeeeeCCCcchhhhh
Confidence 0001 22679999995 6777643332 333322 22346778888887777776 7888999888755322222
Q ss_pred eeecCCCCceEEEecCC-eEEEEEc--CceEEEEcCCCCee
Q 003405 152 KDFGVPDTVKSMSWCGE-NICIAIR--KGYMILNATNGALS 189 (823)
Q Consensus 152 kei~~~~~~~~l~~~~~-~i~v~~~--~~y~lidl~~~~~~ 189 (823)
|-- -+.+-.++|..+ .+.|... +...+.++.+.+..
T Consensus 406 RGH--v~~VYqvawsaDsRLlVS~SkDsTLKvw~V~tkKl~ 444 (480)
T KOG0271|consen 406 RGH--VAAVYQVAWSADSRLLVSGSKDSTLKVWDVRTKKLK 444 (480)
T ss_pred hhc--cceeEEEEeccCccEEEEcCCCceEEEEEeeeeeec
Confidence 211 146778999966 4666655 45678888876543
No 91
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=94.98 E-value=3.8 Score=47.73 Aligned_cols=225 Identities=12% Similarity=0.119 Sum_probs=131.3
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
|.-+..++..+..|..|..||.+.+|++... | +...++++ ...|..+..=|..+..+..++
T Consensus 107 Pvi~ma~~~~g~LlAtggaD~~v~VWdi~~~-----------------~-~th~fkG~-gGvVssl~F~~~~~~~lL~sg 167 (775)
T KOG0319|consen 107 PVITMAFDPTGTLLATGGADGRVKVWDIKNG-----------------Y-CTHSFKGH-GGVVSSLLFHPHWNRWLLASG 167 (775)
T ss_pred CeEEEEEcCCCceEEeccccceEEEEEeeCC-----------------E-EEEEecCC-CceEEEEEeCCccchhheeec
Confidence 4456677777889999999999999998753 2 23466665 567888877777766444443
Q ss_pred ---c-EEEEeCCCCcc-ccc-ccCCCCcEEEEeeCCCce-EEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC
Q 003405 96 ---S-IAFHRLPNLET-IAV-LTKAKGANVYSWDDRRGF-LCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE 168 (823)
Q Consensus 96 ---~-l~~~~L~~l~~-~~~-i~~~kg~~~fa~~~~~~~-l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~ 168 (823)
+ +.+|++.+-.+ .+. ......++..+..++.-. +.++..|-+.+|.+. ..+..+-+++-+.+-++.+..+
T Consensus 168 ~~D~~v~vwnl~~~~tcl~~~~~H~S~vtsL~~~~d~~~~ls~~RDkvi~vwd~~---~~~~l~~lp~ye~~E~vv~l~~ 244 (775)
T KOG0319|consen 168 ATDGTVRVWNLNDKRTCLHTMILHKSAVTSLAFSEDSLELLSVGRDKVIIVWDLV---QYKKLKTLPLYESLESVVRLRE 244 (775)
T ss_pred CCCceEEEEEcccCchHHHHHHhhhhheeeeeeccCCceEEEeccCcEEEEeehh---hhhhhheechhhheeeEEEech
Confidence 3 99999975433 221 122335677777666433 444455555666664 2334444555556666655432
Q ss_pred -------e-EEEEEcCceEEEEcCCCCeeeccCC--CCCCCCEEEEccCCeEE-EEeCCeEEEEcC-CCccccCCceeec
Q 003405 169 -------N-ICIAIRKGYMILNATNGALSEVFPS--GRIGPPLVVSLLSGELL-LGKENIGVFVDQ-NGKLLQADRICWS 236 (823)
Q Consensus 169 -------~-i~v~~~~~y~lidl~~~~~~~L~~~--~~~~~p~i~~~~~~EfL-L~~~~~gvfv~~-~G~~~~~~~i~w~ 236 (823)
. +.+|...-+.++|..+++...-... +.....+......+.++ +..+...++|+. +++++ +.-+-+.
T Consensus 245 ~~~~~~~~~~TaG~~g~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~~~~~~l~vtaeQnl~l~d~~~l~i~-k~ivG~n 323 (775)
T KOG0319|consen 245 ELGGKGEYIITAGGSGVVQYWDSESGKCVYKQRQSDSEEIDHLLAIESMSQLLLVTAEQNLFLYDEDELTIV-KQIVGYN 323 (775)
T ss_pred hcCCcceEEEEecCCceEEEEecccchhhhhhccCCchhhhcceeccccCceEEEEccceEEEEEccccEEe-hhhcCCc
Confidence 3 4444445577888877653222111 11111112122334444 456667777764 44544 4555677
Q ss_pred CCCcEEEEeC---CEEEEEeC-CeEEEEEcc
Q 003405 237 EAPIAVIIQK---PYAIALLP-RRVEVRSLR 263 (823)
Q Consensus 237 ~~P~~v~~~~---PYll~~~~-~~ieV~~l~ 263 (823)
+....+-+.. .|+.+.+. ..+-++++.
T Consensus 324 dEI~Dm~~lG~e~~~laVATNs~~lr~y~~~ 354 (775)
T KOG0319|consen 324 DEILDMKFLGPEESHLAVATNSPELRLYTLP 354 (775)
T ss_pred hhheeeeecCCccceEEEEeCCCceEEEecC
Confidence 7777777766 57766664 567788764
No 92
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=94.92 E-value=0.98 Score=47.13 Aligned_cols=150 Identities=11% Similarity=0.178 Sum_probs=97.8
Q ss_pred CCCCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCcee
Q 003405 14 NCSPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLL 91 (823)
Q Consensus 14 ~~~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll 91 (823)
+.+..|-|++.-+ ..+|.|..||.+..|++..+.. .++ +.+..||..+.-++..+.=+
T Consensus 70 ~~~~PvL~v~WsddgskVf~g~~Dk~~k~wDL~S~Q~-------------------~~v-~~Hd~pvkt~~wv~~~~~~c 129 (347)
T KOG0647|consen 70 SHDGPVLDVCWSDDGSKVFSGGCDKQAKLWDLASGQV-------------------SQV-AAHDAPVKTCHWVPGMNYQC 129 (347)
T ss_pred ccCCCeEEEEEccCCceEEeeccCCceEEEEccCCCe-------------------eee-eecccceeEEEEecCCCcce
Confidence 3455677777654 5999999999999999875421 232 23478999999999877434
Q ss_pred eEeC----cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCC-CceeEeeeecCCCCceEEEe
Q 003405 92 SLSE----SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGG-RGFVEVKDFGVPDTVKSMSW 165 (823)
Q Consensus 92 ~l~d----~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~-~~f~~~kei~~~~~~~~l~~ 165 (823)
+.++ .|++||+..-.++.++.- . -.++|.+.....++|+ ..|.|.+|.+.++ .+|+.+ +-.+.-.+++++.
T Consensus 130 l~TGSWDKTlKfWD~R~~~pv~t~~L-P-eRvYa~Dv~~pm~vVata~r~i~vynL~n~~te~k~~-~SpLk~Q~R~va~ 206 (347)
T KOG0647|consen 130 LVTGSWDKTLKFWDTRSSNPVATLQL-P-ERVYAADVLYPMAVVATAERHIAVYNLENPPTEFKRI-ESPLKWQTRCVAC 206 (347)
T ss_pred eEecccccceeecccCCCCeeeeeec-c-ceeeehhccCceeEEEecCCcEEEEEcCCCcchhhhh-cCcccceeeEEEE
Confidence 4444 299999876666554322 1 1456666666667777 6788999999753 234322 3345567888877
Q ss_pred c--CCeEEEEEcCc-eEEEEcCCC
Q 003405 166 C--GENICIAIRKG-YMILNATNG 186 (823)
Q Consensus 166 ~--~~~i~v~~~~~-y~lidl~~~ 186 (823)
. ++.-.+|.-.+ ..+-.++.+
T Consensus 207 f~d~~~~alGsiEGrv~iq~id~~ 230 (347)
T KOG0647|consen 207 FQDKDGFALGSIEGRVAIQYIDDP 230 (347)
T ss_pred EecCCceEeeeecceEEEEecCCC
Confidence 6 33456666543 445555554
No 93
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=94.90 E-value=0.24 Score=54.82 Aligned_cols=146 Identities=15% Similarity=0.211 Sum_probs=88.8
Q ss_pred EEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCC-CCCCeeEEEEecccCceeeEeC-
Q 003405 21 AVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGF-SKKPILSMEVLASRQLLLSLSE- 95 (823)
Q Consensus 21 ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~-~k~~I~qI~~~~~~~~Ll~l~d- 95 (823)
..|+|+ +.+..|+-||.|-+|+...... ++...++. ++ ....|+.|..-...++|++-+.
T Consensus 321 tsC~~nrdg~~iAagc~DGSIQ~W~~~~~~v-------------~p~~~vk~--AH~~g~~Itsi~FS~dg~~LlSRg~D 385 (641)
T KOG0772|consen 321 TSCAWNRDGKLIAAGCLDGSIQIWDKGSRTV-------------RPVMKVKD--AHLPGQDITSISFSYDGNYLLSRGFD 385 (641)
T ss_pred eeeecCCCcchhhhcccCCceeeeecCCccc-------------ccceEeee--ccCCCCceeEEEeccccchhhhccCC
Confidence 344553 5789999999999998643221 11222222 22 2458999999999999999986
Q ss_pred c-EEEEeCCCCccc-c---cccCCCCcEEEEeeCCCceEEEEE--c-----CeEEEEEEcCCCceeEeeeecCCC-CceE
Q 003405 96 S-IAFHRLPNLETI-A---VLTKAKGANVYSWDDRRGFLCFAR--Q-----KRVCIFRHDGGRGFVEVKDFGVPD-TVKS 162 (823)
Q Consensus 96 ~-l~~~~L~~l~~~-~---~i~~~kg~~~fa~~~~~~~l~V~~--k-----kki~l~~~~~~~~f~~~kei~~~~-~~~~ 162 (823)
+ +++|+|..++.. . .++....-+-+|.+++...|+.|. . .++.+|.-. .|..+.+|.++. .+..
T Consensus 386 ~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~---t~d~v~ki~i~~aSvv~ 462 (641)
T KOG0772|consen 386 DTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDRM---TLDTVYKIDISTASVVR 462 (641)
T ss_pred CceeeeeccccccchhhhcCCCccCCCCccccCCCceEEEecccccCCCCCceEEEEecc---ceeeEEEecCCCceEEE
Confidence 3 999999887522 1 122223334445565543333331 1 124444333 576677766653 5566
Q ss_pred EEecC--CeEEEEEcCc--eEEEEcC
Q 003405 163 MSWCG--ENICIAIRKG--YMILNAT 184 (823)
Q Consensus 163 l~~~~--~~i~v~~~~~--y~lidl~ 184 (823)
+.|.. |.|++|+..+ +++||-+
T Consensus 463 ~~WhpkLNQi~~gsgdG~~~vyYdp~ 488 (641)
T KOG0772|consen 463 CLWHPKLNQIFAGSGDGTAHVYYDPN 488 (641)
T ss_pred EeecchhhheeeecCCCceEEEECcc
Confidence 77874 6899998753 5566644
No 94
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=94.84 E-value=2.7 Score=44.37 Aligned_cols=147 Identities=16% Similarity=0.224 Sum_probs=97.7
Q ss_pred cCCCCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCce
Q 003405 13 SNCSPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLL 90 (823)
Q Consensus 13 ~~~~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~L 90 (823)
..+...|+|.+-.. ..+.++.++-.+++|...+.. .++...+.+. +.+.|+-|.-.|..|.+
T Consensus 7 ~~~~~pitchAwn~drt~iAv~~~~~evhiy~~~~~~---------------~w~~~htls~-Hd~~vtgvdWap~snrI 70 (361)
T KOG1523|consen 7 HRLLEPITCHAWNSDRTQIAVSPNNHEVHIYSMLGAD---------------LWEPAHTLSE-HDKIVTGVDWAPKSNRI 70 (361)
T ss_pred eeccCceeeeeecCCCceEEeccCCceEEEEEecCCC---------------Cceeceehhh-hCcceeEEeecCCCCce
Confidence 34567899988765 479999999999999855432 2444444433 36789999999999999
Q ss_pred eeEeC--cEEEEeCCC---Ccccccc-cCCCCcEEEEeeCCCceEEEEEc-CeEEEEEEcCCCceeEeee--ecCCCCce
Q 003405 91 LSLSE--SIAFHRLPN---LETIAVL-TKAKGANVYSWDDRRGFLCFARQ-KRVCIFRHDGGRGFVEVKD--FGVPDTVK 161 (823)
Q Consensus 91 l~l~d--~l~~~~L~~---l~~~~~i-~~~kg~~~fa~~~~~~~l~V~~k-kki~l~~~~~~~~f~~~ke--i~~~~~~~ 161 (823)
+.++- +-++|..++ .++...+ .-.+.++++...+....++|+.. |.|.++.+.+.+.+-.-|- .++-.+|+
T Consensus 71 vtcs~drnayVw~~~~~~~WkptlvLlRiNrAAt~V~WsP~enkFAVgSgar~isVcy~E~ENdWWVsKhikkPirStv~ 150 (361)
T KOG1523|consen 71 VTCSHDRNAYVWTQPSGGTWKPTLVLLRINRAATCVKWSPKENKFAVGSGARLISVCYYEQENDWWVSKHIKKPIRSTVT 150 (361)
T ss_pred eEccCCCCccccccCCCCeeccceeEEEeccceeeEeecCcCceEEeccCccEEEEEEEecccceehhhhhCCcccccee
Confidence 99984 588888743 2332222 23346777777777778999855 5566666654222322222 34457899
Q ss_pred EEEecCCeEEEEEc
Q 003405 162 SMSWCGENICIAIR 175 (823)
Q Consensus 162 ~l~~~~~~i~v~~~ 175 (823)
++.|..+.+..+..
T Consensus 151 sldWhpnnVLlaaG 164 (361)
T KOG1523|consen 151 SLDWHPNNVLLAAG 164 (361)
T ss_pred eeeccCCcceeccc
Confidence 99999765544443
No 95
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=94.78 E-value=0.42 Score=51.71 Aligned_cols=171 Identities=14% Similarity=0.156 Sum_probs=105.6
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|++++.. +..++-.|+||.+.+|+++.. .+..++.+ +..+|+..........++..+
T Consensus 220 g~it~~d~d~~~~~~iAas~d~~~r~Wnvd~~------------------r~~~TLsG-HtdkVt~ak~~~~~~~vVsgs 280 (459)
T KOG0288|consen 220 GNITSIDFDSDNKHVIAASNDKNLRLWNVDSL------------------RLRHTLSG-HTDKVTAAKFKLSHSRVVSGS 280 (459)
T ss_pred CCcceeeecCCCceEEeecCCCceeeeeccch------------------hhhhhhcc-cccceeeehhhccccceeecc
Confidence 458777655 468888999999999997642 22234444 356788877776666644444
Q ss_pred C--cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEE-E-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEec--CC
Q 003405 95 E--SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCF-A-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GE 168 (823)
Q Consensus 95 d--~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V-~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~ 168 (823)
. .++.|+|.+-.-..++-....|+.++++ +..++ + ..+||.+|..... ..++++++.+.++++... |.
T Consensus 281 ~DRtiK~WDl~k~~C~kt~l~~S~cnDI~~~---~~~~~SgH~DkkvRfwD~Rs~---~~~~sv~~gg~vtSl~ls~~g~ 354 (459)
T KOG0288|consen 281 ADRTIKLWDLQKAYCSKTVLPGSQCNDIVCS---ISDVISGHFDKKVRFWDIRSA---DKTRSVPLGGRVTSLDLSMDGL 354 (459)
T ss_pred ccchhhhhhhhhhheeccccccccccceEec---ceeeeecccccceEEEeccCC---ceeeEeecCcceeeEeeccCCe
Confidence 3 3999998763222223333456666665 12222 2 5678888776633 456788888999988765 44
Q ss_pred eEEEEEc-CceEEEEcCCCCeeeccCCCC----CCCCEEEEccCCeEEE
Q 003405 169 NICIAIR-KGYMILNATNGALSEVFPSGR----IGPPLVVSLLSGELLL 212 (823)
Q Consensus 169 ~i~v~~~-~~y~lidl~~~~~~~L~~~~~----~~~p~i~~~~~~EfLL 212 (823)
.|...++ ....++|+.+..+...+.-.. +.-.-++..|+++++.
T Consensus 355 ~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfSpd~~Yva 403 (459)
T KOG0288|consen 355 ELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFSPDGSYVA 403 (459)
T ss_pred EEeeecCCCceeeeecccccEEEEeeccccccccccceeEECCCCceee
Confidence 5555555 567888988877766664321 0012244456677766
No 96
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=94.77 E-value=0.54 Score=51.29 Aligned_cols=142 Identities=15% Similarity=0.182 Sum_probs=79.4
Q ss_pred CcEEEEEEeC---------CEEEEEeC---------C-CcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCC
Q 003405 17 PKIDAVASYG---------LKILLGCS---------D-GSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKP 77 (823)
Q Consensus 17 ~~I~ci~~~~---------~~L~vGT~---------~-G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~ 77 (823)
..++|++.+. ..++|||. . |.|++|.+...... ....+++.+.. + +.+
T Consensus 24 E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~-----------~~~l~~i~~~~-~-~g~ 90 (321)
T PF03178_consen 24 EHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPEN-----------NFKLKLIHSTE-V-KGP 90 (321)
T ss_dssp EEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS----------------EEEEEEEEE-E-SS-
T ss_pred ceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEccccc-----------ceEEEEEEEEe-e-cCc
Confidence 4678877764 47999987 2 99999998864110 12233333322 2 679
Q ss_pred eeEEEEecccCceeeEeC-cEEEEeCCCCc---ccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCC-CceeEe
Q 003405 78 ILSMEVLASRQLLLSLSE-SIAFHRLPNLE---TIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGG-RGFVEV 151 (823)
Q Consensus 78 I~qI~~~~~~~~Ll~l~d-~l~~~~L~~l~---~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~-~~f~~~ 151 (823)
|..|..+. +.+++-.+ .|.+|++..-+ +.......--++...+. ...|+|+ ..+.+.+++|+.. +.+..+
T Consensus 91 V~ai~~~~--~~lv~~~g~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~--~~~I~vgD~~~sv~~~~~~~~~~~l~~v 166 (321)
T PF03178_consen 91 VTAICSFN--GRLVVAVGNKLYVYDLDNSKTLLKKAFYDSPFYITSLSVF--KNYILVGDAMKSVSLLRYDEENNKLILV 166 (321)
T ss_dssp EEEEEEET--TEEEEEETTEEEEEEEETTSSEEEEEEE-BSSSEEEEEEE--TTEEEEEESSSSEEEEEEETTTE-EEEE
T ss_pred ceEhhhhC--CEEEEeecCEEEEEEccCcccchhhheecceEEEEEEecc--ccEEEEEEcccCEEEEEEEccCCEEEEE
Confidence 99999994 34555555 58888875433 22211111133333333 3478888 7789999999852 234433
Q ss_pred eeecCCCCceEEEec--CCeEEEEEc
Q 003405 152 KDFGVPDTVKSMSWC--GENICIAIR 175 (823)
Q Consensus 152 kei~~~~~~~~l~~~--~~~i~v~~~ 175 (823)
..-..|-.++++.+. ++.++++.+
T Consensus 167 a~d~~~~~v~~~~~l~d~~~~i~~D~ 192 (321)
T PF03178_consen 167 ARDYQPRWVTAAEFLVDEDTIIVGDK 192 (321)
T ss_dssp EEESS-BEEEEEEEE-SSSEEEEEET
T ss_pred EecCCCccEEEEEEecCCcEEEEEcC
Confidence 222224456666665 235555554
No 97
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=94.76 E-value=2.2 Score=43.43 Aligned_cols=148 Identities=11% Similarity=0.204 Sum_probs=98.5
Q ss_pred eeeecCCCCCCeeEEEEecccCceeeEeC-cEEEEeCCCCcc--ccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEE
Q 003405 67 ERTISGFSKKPILSMEVLASRQLLLSLSE-SIAFHRLPNLET--IAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFR 141 (823)
Q Consensus 67 ~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~l~~~~L~~l~~--~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~ 141 (823)
.++++ +....|+.+.+-|+...|.+-+. .|++|++.+..+ +.+.. ..|+|+++....+..-+.-+ -...+.|+.
T Consensus 33 ~rTiq-h~dsqVNrLeiTpdk~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseDgt~kIWd 111 (311)
T KOG0315|consen 33 SRTIQ-HPDSQVNRLEITPDKKDLAAAGNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSEDGTVKIWD 111 (311)
T ss_pred EEEEe-cCccceeeEEEcCCcchhhhccCCeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCCceEEEEe
Confidence 44544 34678999999999888887776 499999987654 33333 34788887776554334444 334566777
Q ss_pred EcCCCceeEeeeecCCCCceEEEecCC--eEEEEEcC-ceEEEEcCCC-CeeeccCCCCCCCCEEEEccCCeEEEEeCCe
Q 003405 142 HDGGRGFVEVKDFGVPDTVKSMSWCGE--NICIAIRK-GYMILNATNG-ALSEVFPSGRIGPPLVVSLLSGELLLGKENI 217 (823)
Q Consensus 142 ~~~~~~f~~~kei~~~~~~~~l~~~~~--~i~v~~~~-~y~lidl~~~-~~~~L~~~~~~~~p~i~~~~~~EfLL~~~~~ 217 (823)
.+. ..-.|++..+.++.++....+ -+++|..+ ...+-|+.+. ...++.|-....--.+...+++.+|.+.++-
T Consensus 112 lR~---~~~qR~~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnk 188 (311)
T KOG0315|consen 112 LRS---LSCQRNYQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNK 188 (311)
T ss_pred ccC---cccchhccCCCCcceEEecCCcceEEeecCCCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCC
Confidence 763 334567777889999988855 48888885 4788899876 3455666433211223345778888876654
Q ss_pred E
Q 003405 218 G 218 (823)
Q Consensus 218 g 218 (823)
|
T Consensus 189 G 189 (311)
T KOG0315|consen 189 G 189 (311)
T ss_pred c
Confidence 4
No 98
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=94.73 E-value=0.77 Score=47.92 Aligned_cols=180 Identities=12% Similarity=0.199 Sum_probs=93.7
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCccccccccccee-eeeec-CCCCCCeeEEEEecccCceee
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYEL-ERTIS-GFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l-~~~~~-~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
..+.|+..| |++|.+||..-++..|+++- +++ ..... .-++..|+|+..-+..++-++
T Consensus 217 ~~vrsiSfHPsGefllvgTdHp~~rlYdv~T------------------~QcfvsanPd~qht~ai~~V~Ys~t~~lYvT 278 (430)
T KOG0640|consen 217 EPVRSISFHPSGEFLLVGTDHPTLRLYDVNT------------------YQCFVSANPDDQHTGAITQVRYSSTGSLYVT 278 (430)
T ss_pred ceeeeEeecCCCceEEEecCCCceeEEeccc------------------eeEeeecCcccccccceeEEEecCCccEEEE
Confidence 356777776 68999999999999999773 221 11000 112568999999999888888
Q ss_pred Ee-Cc-EEEEeCCCCcccccccCCCCcEEEEe--eCCCceEE--EEEcCeEEEEEEcCCCceeEeeeecCCCCc-----e
Q 003405 93 LS-ES-IAFHRLPNLETIAVLTKAKGANVYSW--DDRRGFLC--FARQKRVCIFRHDGGRGFVEVKDFGVPDTV-----K 161 (823)
Q Consensus 93 l~-d~-l~~~~L~~l~~~~~i~~~kg~~~fa~--~~~~~~l~--V~~kkki~l~~~~~~~~f~~~kei~~~~~~-----~ 161 (823)
-+ || |++|+--+-+=+.++....|-..+|- -...++.+ -|...-+.++++..++ .++++.-.+.- +
T Consensus 279 aSkDG~IklwDGVS~rCv~t~~~AH~gsevcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R---~l~~YtGAg~tgrq~~r 355 (430)
T KOG0640|consen 279 ASKDGAIKLWDGVSNRCVRTIGNAHGGSEVCSAVFTKNGKYILSSGKDSTVKLWEISTGR---MLKEYTGAGTTGRQKHR 355 (430)
T ss_pred eccCCcEEeeccccHHHHHHHHhhcCCceeeeEEEccCCeEEeecCCcceeeeeeecCCc---eEEEEecCCcccchhhh
Confidence 77 46 99998644333333433333333322 11122222 2333445566665443 12222111000 0
Q ss_pred EE-Eec--CCeEEEEEc--CceEEEEcCCCCeeeccCCCCCCCC-EEEEccCC-eEEEEeCCe
Q 003405 162 SM-SWC--GENICIAIR--KGYMILNATNGALSEVFPSGRIGPP-LVVSLLSG-ELLLGKENI 217 (823)
Q Consensus 162 ~l-~~~--~~~i~v~~~--~~y~lidl~~~~~~~L~~~~~~~~p-~i~~~~~~-EfLL~~~~~ 217 (823)
+- .|. .+.+.+-.. .+.+-.|-.++....+.+.|-.+.+ .|+..+.+ -|+-|.++.
T Consensus 356 tqAvFNhtEdyVl~pDEas~slcsWdaRtadr~~l~slgHn~a~R~i~HSP~~p~FmTcsdD~ 418 (430)
T KOG0640|consen 356 TQAVFNHTEDYVLFPDEASNSLCSWDARTADRVALLSLGHNGAVRWIVHSPVEPAFMTCSDDF 418 (430)
T ss_pred hhhhhcCccceEEccccccCceeeccccchhhhhhcccCCCCCceEEEeCCCCCceeeecccc
Confidence 00 111 122222222 2344455566666777777654433 34444444 455576654
No 99
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=94.71 E-value=4.8 Score=44.97 Aligned_cols=179 Identities=17% Similarity=0.234 Sum_probs=102.3
Q ss_pred CcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 17 PKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 17 ~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
..|.|++--+ +.++-|.++|.|++|+-. +....++.+ .+...|-.+..+.+ +.|++ ++
T Consensus 247 k~Vl~v~F~engdviTgDS~G~i~Iw~~~------------------~~~~~k~~~-aH~ggv~~L~~lr~-GtllS-Gg 305 (626)
T KOG2106|consen 247 KFVLCVTFLENGDVITGDSGGNILIWSKG------------------TNRISKQVH-AHDGGVFSLCMLRD-GTLLS-GG 305 (626)
T ss_pred eEEEEEEEcCCCCEEeecCCceEEEEeCC------------------CceEEeEee-ecCCceEEEEEecC-ccEee-cC
Confidence 4688998876 589999999999999742 334455555 45778999998876 56666 55
Q ss_pred c---EEEEeCCCCcccc--cccCCCC-cEEEEeeCCCceEEEE-------------------------------------
Q 003405 96 S---IAFHRLPNLETIA--VLTKAKG-ANVYSWDDRRGFLCFA------------------------------------- 132 (823)
Q Consensus 96 ~---l~~~~L~~l~~~~--~i~~~kg-~~~fa~~~~~~~l~V~------------------------------------- 132 (823)
. |..|+ .+++... .+++.+| +..++- +.+-|.|+
T Consensus 306 KDRki~~Wd-~~y~k~r~~elPe~~G~iRtv~e--~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hps~~q 382 (626)
T KOG2106|consen 306 KDRKIILWD-DNYRKLRETELPEQFGPIRTVAE--GKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHPSKNQ 382 (626)
T ss_pred ccceEEecc-ccccccccccCchhcCCeeEEec--CCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcCCChhh
Confidence 2 88887 4443221 1222222 111111 11112222
Q ss_pred -----EcCeEEEEEEcCCCceeEeeeecCCCCceEEEec-CCeEEEEEcC-ceEEEEcCCCCeeeccCCCCCCCCEE--E
Q 003405 133 -----RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC-GENICIAIRK-GYMILNATNGALSEVFPSGRIGPPLV--V 203 (823)
Q Consensus 133 -----~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~-~~~i~v~~~~-~y~lidl~~~~~~~L~~~~~~~~p~i--~ 203 (823)
..|.+.|+. +. +......+.|++.+..|. .+.|.+|+.. ...++|+++.....+-.- ..|+- .
T Consensus 383 ~~T~gqdk~v~lW~--~~---k~~wt~~~~d~~~~~~fhpsg~va~Gt~~G~w~V~d~e~~~lv~~~~d---~~~ls~v~ 454 (626)
T KOG2106|consen 383 LLTCGQDKHVRLWN--DH---KLEWTKIIEDPAECADFHPSGVVAVGTATGRWFVLDTETQDLVTIHTD---NEQLSVVR 454 (626)
T ss_pred eeeccCcceEEEcc--CC---ceeEEEEecCceeEeeccCcceEEEeeccceEEEEecccceeEEEEec---CCceEEEE
Confidence 222222222 11 111123456788888887 2388999885 578899988655554433 23543 3
Q ss_pred EccCCeEE-EEe-CCeEE--EEcCCCcc
Q 003405 204 SLLSGELL-LGK-ENIGV--FVDQNGKL 227 (823)
Q Consensus 204 ~~~~~EfL-L~~-~~~gv--fv~~~G~~ 227 (823)
..+++.|| ++. |+..+ -|+.+|+.
T Consensus 455 ysp~G~~lAvgs~d~~iyiy~Vs~~g~~ 482 (626)
T KOG2106|consen 455 YSPDGAFLAVGSHDNHIYIYRVSANGRK 482 (626)
T ss_pred EcCCCCEEEEecCCCeEEEEEECCCCcE
Confidence 34677665 454 44433 35777754
No 100
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=94.69 E-value=0.41 Score=50.60 Aligned_cols=139 Identities=11% Similarity=0.227 Sum_probs=82.7
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecc--cC-ceeeEeCc-EEEEe
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLAS--RQ-LLLSLSES-IAFHR 101 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~--~~-~Ll~l~d~-l~~~~ 101 (823)
+..+.+|.++|.+.+|+.... .....|++. +..++++.++.. -. +..+.+|| |++|+
T Consensus 40 e~~vav~lSngsv~lyd~~tg------------------~~l~~fk~~-~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~wD 100 (376)
T KOG1188|consen 40 ETAVAVSLSNGSVRLYDKGTG------------------QLLEEFKGP-PATTNGVRFISCDSPHGVISCSSDGTVRLWD 100 (376)
T ss_pred ceeEEEEecCCeEEEEeccch------------------hhhheecCC-CCcccceEEecCCCCCeeEEeccCCeEEEEE
Confidence 357999999999999984421 123455543 567888888873 23 34444576 99999
Q ss_pred CCCCcccccccCC--CCcEEEEeeC--CCceEEEEEcC-----eEEEEEEcCCCceeEeee--ecCCCCceEEEec---C
Q 003405 102 LPNLETIAVLTKA--KGANVYSWDD--RRGFLCFARQK-----RVCIFRHDGGRGFVEVKD--FGVPDTVKSMSWC---G 167 (823)
Q Consensus 102 L~~l~~~~~i~~~--kg~~~fa~~~--~~~~l~V~~kk-----ki~l~~~~~~~~f~~~ke--i~~~~~~~~l~~~---~ 167 (823)
+......+.+.-. .|..+.+++. +.+.+|.+.-. .+.+|.|+...+ .++- =+-.|.|+++.|. .
T Consensus 101 ~Rs~~e~a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq--~l~~~~eSH~DDVT~lrFHP~~p 178 (376)
T KOG1188|consen 101 IRSQAESARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQ--LLRQLNESHNDDVTQLRFHPSDP 178 (376)
T ss_pred eecchhhhheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEeccccc--hhhhhhhhccCcceeEEecCCCC
Confidence 9876655443211 2333444443 44567777332 244555553222 1221 1235789999998 3
Q ss_pred CeEEEEEcCce-EEEEcCC
Q 003405 168 ENICIAIRKGY-MILNATN 185 (823)
Q Consensus 168 ~~i~v~~~~~y-~lidl~~ 185 (823)
+.+.=|+..++ .++|++.
T Consensus 179 nlLlSGSvDGLvnlfD~~~ 197 (376)
T KOG1188|consen 179 NLLLSGSVDGLVNLFDTKK 197 (376)
T ss_pred CeEEeecccceEEeeecCC
Confidence 45666666665 6778764
No 101
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=94.56 E-value=1.1 Score=50.72 Aligned_cols=188 Identities=12% Similarity=0.124 Sum_probs=120.6
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
...|.|++..+..+-.|+.+|.|+++++..... ..+. ...++..|-.++.-+..+.+.+=+.
T Consensus 260 ~~rvg~laW~~~~lssGsr~~~I~~~dvR~~~~-----------------~~~~-~~~H~qeVCgLkws~d~~~lASGgn 321 (484)
T KOG0305|consen 260 ASRVGSLAWNSSVLSSGSRDGKILNHDVRISQH-----------------VVST-LQGHRQEVCGLKWSPDGNQLASGGN 321 (484)
T ss_pred CceeEEEeccCceEEEecCCCcEEEEEEecchh-----------------hhhh-hhcccceeeeeEECCCCCeeccCCC
Confidence 357899998889999999999999999775322 1111 2234678999999988887776554
Q ss_pred -c-EEEEeCCCCcccccccCCC-CcEEEEeeCCC-ceEEEEEc---CeEEEEEEcCCCceeEeeeecCCCCceEEEecCC
Q 003405 96 -S-IAFHRLPNLETIAVLTKAK-GANVYSWDDRR-GFLCFARQ---KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE 168 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~-~~l~V~~k---kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~ 168 (823)
+ +.+|+....+++.++..-+ .|.+++.++-. +.||+|.+ +.|.++-...+ ..++.+.....|.+|.|...
T Consensus 322 DN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~~g---~~i~~vdtgsQVcsL~Wsk~ 398 (484)
T KOG0305|consen 322 DNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQSGLLATGGGSADRCIKFWNTNTG---ARIDSVDTGSQVCSLIWSKK 398 (484)
T ss_pred ccceEeccCCCccccEEEeccceeeeEeeeCCCccCceEEcCCCcccEEEEEEcCCC---cEecccccCCceeeEEEcCC
Confidence 3 9999996666665554444 56788888754 67888743 34444444422 35667778889999999965
Q ss_pred e--EEEEEc---CceEEEEcCCCCeeeccCCCCCCCCEE-EEccCCeEEE--EeCCeEEEEcCCC
Q 003405 169 N--ICIAIR---KGYMILNATNGALSEVFPSGRIGPPLV-VSLLSGELLL--GKENIGVFVDQNG 225 (823)
Q Consensus 169 ~--i~v~~~---~~y~lidl~~~~~~~L~~~~~~~~p~i-~~~~~~EfLL--~~~~~gvfv~~~G 225 (823)
. ||.+.. +...+++..+-.....+... ..+-+- ...++++.++ +.|+..=|.+..+
T Consensus 399 ~kEi~sthG~s~n~i~lw~~ps~~~~~~l~gH-~~RVl~la~SPdg~~i~t~a~DETlrfw~~f~ 462 (484)
T KOG0305|consen 399 YKELLSTHGYSENQITLWKYPSMKLVAELLGH-TSRVLYLALSPDGETIVTGAADETLRFWNLFD 462 (484)
T ss_pred CCEEEEecCCCCCcEEEEeccccceeeeecCC-cceeEEEEECCCCCEEEEecccCcEEeccccC
Confidence 4 776654 34567776653222222211 122222 2336677766 3556666766655
No 102
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=94.53 E-value=1.6 Score=49.77 Aligned_cols=125 Identities=18% Similarity=0.290 Sum_probs=72.4
Q ss_pred cCceeeEeCc-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCC-----------CceeEeeee
Q 003405 87 RQLLLSLSES-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGG-----------RGFVEVKDF 154 (823)
Q Consensus 87 ~~~Ll~l~d~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~-----------~~f~~~kei 154 (823)
+.+|++-+++ |.+|++.+-+.+.++.- .+|..+.++++...++++.+..+.|++++.+ ..|..+.|+
T Consensus 117 G~LL~~~~~~~i~~yDw~~~~~i~~i~v-~~vk~V~Ws~~g~~val~t~~~i~il~~~~~~~~~~~~~g~e~~f~~~~E~ 195 (443)
T PF04053_consen 117 GNLLGVKSSDFICFYDWETGKLIRRIDV-SAVKYVIWSDDGELVALVTKDSIYILKYNLEAVAAIPEEGVEDAFELIHEI 195 (443)
T ss_dssp SSSEEEEETTEEEEE-TTT--EEEEESS--E-EEEEE-TTSSEEEEE-S-SEEEEEE-HHHHHHBTTTB-GGGEEEEEEE
T ss_pred CcEEEEECCCCEEEEEhhHcceeeEEec-CCCcEEEEECCCCEEEEEeCCeEEEEEecchhcccccccCchhceEEEEEe
Confidence 5566666666 99999988877776643 4578888998878899999999999988632 136666665
Q ss_pred cCCCCceEEEecCCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEe-C--CeEEEEcCCCcc
Q 003405 155 GVPDTVKSMSWCGENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGK-E--NIGVFVDQNGKL 227 (823)
Q Consensus 155 ~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~-~--~~gvfv~~~G~~ 227 (823)
.+.|++.+|.++.+++.+.+. .-+ +.+|.+ ..+..++..-+|+++ + +..+++|.++.+
T Consensus 196 --~~~IkSg~W~~d~fiYtT~~~-lkY-l~~Ge~-----------~~i~~ld~~~yllgy~~~~~~ly~~Dr~~~v 256 (443)
T PF04053_consen 196 --SERIKSGCWVEDCFIYTTSNH-LKY-LVNGET-----------GIIAHLDKPLYLLGYLPKENRLYLIDRDGNV 256 (443)
T ss_dssp ---S--SEEEEETTEEEEE-TTE-EEE-EETTEE-----------EEEEE-SS--EEEEEETTTTEEEEE-TT--E
T ss_pred --cceeEEEEEEcCEEEEEcCCe-EEE-EEcCCc-----------ceEEEcCCceEEEEEEccCCEEEEEECCCCE
Confidence 568999999999777777652 222 334433 334445555667753 3 567778877764
No 103
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=94.53 E-value=4.6 Score=43.48 Aligned_cols=139 Identities=15% Similarity=0.148 Sum_probs=71.1
Q ss_pred EEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeec-CCCCceEE---EecCCeEEEEEc-----CceEEEEcCCCCeee
Q 003405 120 YSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSM---SWCGENICIAIR-----KGYMILNATNGALSE 190 (823)
Q Consensus 120 fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l---~~~~~~i~v~~~-----~~y~lidl~~~~~~~ 190 (823)
.+|.-++.+++|.....|.||.++. .+.+..|. .|..++++ +...+.-.+++. .+..++|+.+-+...
T Consensus 91 L~VrmNr~RLvV~Lee~IyIydI~~---MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~ 167 (391)
T KOG2110|consen 91 LAVRMNRKRLVVCLEESIYIYDIKD---MKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVN 167 (391)
T ss_pred EEEEEccceEEEEEcccEEEEeccc---ceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEcccceeee
Confidence 3444455688998888898888883 34444442 34555543 333332233332 356777777655444
Q ss_pred ccCCCCCCCCEEEEccCCeEEEE-eCC---eEEEEcCCCccc---cCCceeecCCCcEEEE--eCCEEEEEeC-CeEEEE
Q 003405 191 VFPSGRIGPPLVVSLLSGELLLG-KEN---IGVFVDQNGKLL---QADRICWSEAPIAVII--QKPYAIALLP-RRVEVR 260 (823)
Q Consensus 191 L~~~~~~~~p~i~~~~~~EfLL~-~~~---~gvfv~~~G~~~---~~~~i~w~~~P~~v~~--~~PYll~~~~-~~ieV~ 260 (823)
.+.-.+..-.++..-++|..|-. .+. .-||--.+|... ||+.. ....-+++| ..+||.+.++ ..|.|+
T Consensus 168 ~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~--~~~IySL~Fs~ds~~L~~sS~TeTVHiF 245 (391)
T KOG2110|consen 168 TINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTY--PVSIYSLSFSPDSQFLAASSNTETVHIF 245 (391)
T ss_pred EEEecCCceeEEEECCCCCEEEEeccCceEEEEEEcCCccEeeeeeCCce--eeEEEEEEECCCCCeEEEecCCCeEEEE
Confidence 44332222122333355555553 332 235555777654 34433 111122333 3567777665 466666
Q ss_pred Ecc
Q 003405 261 SLR 263 (823)
Q Consensus 261 ~l~ 263 (823)
.+.
T Consensus 246 KL~ 248 (391)
T KOG2110|consen 246 KLE 248 (391)
T ss_pred Eec
Confidence 663
No 104
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=94.50 E-value=1.8 Score=50.38 Aligned_cols=280 Identities=15% Similarity=0.173 Sum_probs=153.7
Q ss_pred CcEEEEE--EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVA--SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~--~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|+|.+ ..++.|+.....+.+.+|.++.. ++++..++.+.+||--+.+.|.. -+++-+
T Consensus 63 d~ita~~l~~d~~~L~~a~rs~llrv~~L~tg------------------k~irswKa~He~Pvi~ma~~~~g-~LlAtg 123 (775)
T KOG0319|consen 63 DEITALALTPDEEVLVTASRSQLLRVWSLPTG------------------KLIRSWKAIHEAPVITMAFDPTG-TLLATG 123 (775)
T ss_pred hhhheeeecCCccEEEEeeccceEEEEEcccc------------------hHhHhHhhccCCCeEEEEEcCCC-ceEEec
Confidence 3566644 34568899999999999987743 34566677778999999999987 444444
Q ss_pred C--c-EEEEeCCCCcccccccCCCCcE-EEEeeCCCc--eEEEE-EcCeEEEEEEcCCCc-eeEeeeecCCCCceEEEec
Q 003405 95 E--S-IAFHRLPNLETIAVLTKAKGAN-VYSWDDRRG--FLCFA-RQKRVCIFRHDGGRG-FVEVKDFGVPDTVKSMSWC 166 (823)
Q Consensus 95 d--~-l~~~~L~~l~~~~~i~~~kg~~-~fa~~~~~~--~l~V~-~kkki~l~~~~~~~~-f~~~kei~~~~~~~~l~~~ 166 (823)
+ + +.+|++..-.-.+.+...+|+. +.+.+++.- .|+.+ ....+.+|.+...+. ...++. --+.++++++.
T Consensus 124 gaD~~v~VWdi~~~~~th~fkG~gGvVssl~F~~~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~--H~S~vtsL~~~ 201 (775)
T KOG0319|consen 124 GADGRVKVWDIKNGYCTHSFKGHGGVVSSLLFHPHWNRWLLASGATDGTVRVWNLNDKRTCLHTMIL--HKSAVTSLAFS 201 (775)
T ss_pred cccceEEEEEeeCCEEEEEecCCCceEEEEEeCCccchhheeecCCCceEEEEEcccCchHHHHHHh--hhhheeeeeec
Confidence 3 4 9999986543344444445543 455555443 24454 444555665553321 111111 12478888886
Q ss_pred C--CeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccC-----CeEEEEeCCeEEE--EcCCCccc----cCCc
Q 003405 167 G--ENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLS-----GELLLGKENIGVF--VDQNGKLL----QADR 232 (823)
Q Consensus 167 ~--~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~-----~EfLL~~~~~gvf--v~~~G~~~----~~~~ 232 (823)
. +.++-+.+ +-..+.|+.+-+.....|.-.+... ++...+ +++++..++.|++ .+.+|... ++++
T Consensus 202 ~d~~~~ls~~RDkvi~vwd~~~~~~l~~lp~ye~~E~-vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~~~~~~~~~~ 280 (775)
T KOG0319|consen 202 EDSLELLSVGRDKVIIVWDLVQYKKLKTLPLYESLES-VVRLREELGGKGEYIITAGGSGVVQYWDSESGKCVYKQRQSD 280 (775)
T ss_pred cCCceEEEeccCcEEEEeehhhhhhhheechhhheee-EEEechhcCCcceEEEEecCCceEEEEecccchhhhhhccCC
Confidence 3 34444444 4556678765544444443222111 223333 5788877766654 45444211 2233
Q ss_pred eeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEee-----CCcccccccCCeEEEeccceEEEeeccC-hhHH
Q 003405 233 ICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVL-----QNVRHLIPSSNAVVVALENSIFGLFPVP-LGAQ 306 (823)
Q Consensus 233 i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l-----~~~~~l~~~~~~v~v~s~~~I~~l~~~~-~~~q 306 (823)
.+==..-..+......+++..+..+.+++.. +..++..|.- -+.+++.+.++.+.|||++.=.++...| +.-|
T Consensus 281 ~~e~~~~~~~~~~~~~l~vtaeQnl~l~d~~-~l~i~k~ivG~ndEI~Dm~~lG~e~~~laVATNs~~lr~y~~~~~~c~ 359 (775)
T KOG0319|consen 281 SEEIDHLLAIESMSQLLLVTAEQNLFLYDED-ELTIVKQIVGYNDEILDMKFLGPEESHLAVATNSPELRLYTLPTSYCQ 359 (775)
T ss_pred chhhhcceeccccCceEEEEccceEEEEEcc-ccEEehhhcCCchhheeeeecCCccceEEEEeCCCceEEEecCCCceE
Confidence 1110011122233456666667777777764 3444444321 2235667777888899887655555333 3333
Q ss_pred HHHHHhcCCHHHHHHHh
Q 003405 307 IVQLTASGDFEEALALC 323 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~ 323 (823)
+-.|.=|.-++|.
T Consensus 360 ----ii~GH~e~vlSL~ 372 (775)
T KOG0319|consen 360 ----IIPGHTEAVLSLD 372 (775)
T ss_pred ----EEeCchhheeeee
Confidence 3445555555554
No 105
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=94.36 E-value=1.9 Score=48.28 Aligned_cols=253 Identities=16% Similarity=0.173 Sum_probs=129.9
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLPN 104 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~ 104 (823)
+.+.|.++||.+.+.+-.+. ..+.+.++ ..+|..=.--+++--|++.++ | |++|.=..
T Consensus 76 d~~~i~s~DGkf~il~k~~r-------------------VE~sv~AH-~~A~~~gRW~~dGtgLlt~GEDG~iKiWSrsG 135 (737)
T KOG1524|consen 76 DTLLICSNDGRFVILNKSAR-------------------VERSISAH-AAAISSGRWSPDGAGLLTAGEDGVIKIWSRSG 135 (737)
T ss_pred ceEEEEcCCceEEEecccch-------------------hhhhhhhh-hhhhhhcccCCCCceeeeecCCceEEEEeccc
Confidence 57899999999888752221 11222221 223333333345555777776 6 89997433
Q ss_pred CcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEEEEEcCceEEEEcC
Q 003405 105 LETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENICIAIRKGYMILNAT 184 (823)
Q Consensus 105 l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~ 184 (823)
.=.-.-......+.+.+.+++...++....+.+.|=.+.......+.| .-.+.+.++.|...+=.+++..+=+-+-+.
T Consensus 136 MLRStl~Q~~~~v~c~~W~p~S~~vl~c~g~h~~IKpL~~n~k~i~Wk--AHDGiiL~~~W~~~s~lI~sgGED~kfKvW 213 (737)
T KOG1524|consen 136 MLRSTVVQNEESIRCARWAPNSNSIVFCQGGHISIKPLAANSKIIRWR--AHDGLVLSLSWSTQSNIIASGGEDFRFKIW 213 (737)
T ss_pred hHHHHHhhcCceeEEEEECCCCCceEEecCCeEEEeecccccceeEEe--ccCcEEEEeecCccccceeecCCceeEEee
Confidence 211111223345667777777655444444444333332211111111 124567788898544334444443334444
Q ss_pred CCCeeeccCCCCCCCCE--EEEccCCeEEEEeCCeEEEEcC-CCccccCCceeecCCCcEEEE--eCCEEEEEe------
Q 003405 185 NGALSEVFPSGRIGPPL--VVSLLSGELLLGKENIGVFVDQ-NGKLLQADRICWSEAPIAVII--QKPYAIALL------ 253 (823)
Q Consensus 185 ~~~~~~L~~~~~~~~p~--i~~~~~~EfLL~~~~~gvfv~~-~G~~~~~~~i~w~~~P~~v~~--~~PYll~~~------ 253 (823)
++.-..||.......|+ +.+-++..|+|+.-+..-|-.. .|. .-.+.|+..-..+++ ....++..+
T Consensus 214 D~~G~~Lf~S~~~ey~ITSva~npd~~~~v~S~nt~R~~~p~~GS---ifnlsWS~DGTQ~a~gt~~G~v~~A~~ieq~l 290 (737)
T KOG1524|consen 214 DAQGANLFTSAAEEYAITSVAFNPEKDYLLWSYNTARFSSPRVGS---IFNLSWSADGTQATCGTSTGQLIVAYAIEQQL 290 (737)
T ss_pred cccCcccccCChhccceeeeeeccccceeeeeeeeeeecCCCccc---eEEEEEcCCCceeeccccCceEEEeeeehhhh
Confidence 55566777765443454 4445677788876555555332 233 245778655443332 222332222
Q ss_pred ----------C-CeEEEEEccCCCceeEEEeeCCccccc-ccCCeEEEeccceEEEeeccChhHH
Q 003405 254 ----------P-RRVEVRSLRVPYALIQTIVLQNVRHLI-PSSNAVVVALENSIFGLFPVPLGAQ 306 (823)
Q Consensus 254 ----------~-~~ieV~~l~~~~~lvQ~i~l~~~~~l~-~~~~~v~v~s~~~I~~l~~~~~~~q 306 (823)
+ ..|+++++.. .....+.+|+...-. -.-..+++++...||.+..+.|..+
T Consensus 291 ~~~n~~~t~~~r~~I~vrdV~~--~v~d~LE~p~rv~k~sL~Y~hLvvaTs~qvyiys~knwntp 353 (737)
T KOG1524|consen 291 VSGNLKATSKSRKSITVRDVAT--GVQDILEFPQRVVKFSLGYGHLVVATSLQVYIYSEKNWNTP 353 (737)
T ss_pred hhccceeEeeccceEEeehhhh--hHHHHhhCccceeeeeeceeEEEEEeccEEEEEecCCccCc
Confidence 1 2466666642 122233344321111 1124577899999999999988876
No 106
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=94.34 E-value=5.4 Score=44.91 Aligned_cols=132 Identities=24% Similarity=0.337 Sum_probs=73.0
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec-----ccCceeeEeC-cEEEE
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA-----SRQLLLSLSE-SIAFH 100 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~-----~~~~Ll~l~d-~l~~~ 100 (823)
++|+||+-+|.|.+|.-...... ....-+..+ + +.||-||..=+ +.+.|.||.- .+.+|
T Consensus 38 d~IivGS~~G~LrIy~P~~~~~~-----------~~~lllE~~---l-~~PILqv~~G~F~s~~~~~~LaVLhP~kl~vY 102 (418)
T PF14727_consen 38 DKIIVGSYSGILRIYDPSGNEFQ-----------PEDLLLETQ---L-KDPILQVECGKFVSGSEDLQLAVLHPRKLSVY 102 (418)
T ss_pred cEEEEeccccEEEEEccCCCCCC-----------CccEEEEEe---c-CCcEEEEEeccccCCCCcceEEEecCCEEEEE
Confidence 59999999999999986432211 011222222 2 57999988764 3356777776 47777
Q ss_pred eCCC------------CcccccccCCCCcEEEEeeC---C--CceEEEE-EcCeEEEEEEcCCCcee-EeeeecCCCCce
Q 003405 101 RLPN------------LETIAVLTKAKGANVYSWDD---R--RGFLCFA-RQKRVCIFRHDGGRGFV-EVKDFGVPDTVK 161 (823)
Q Consensus 101 ~L~~------------l~~~~~i~~~kg~~~fa~~~---~--~~~l~V~-~kkki~l~~~~~~~~f~-~~kei~~~~~~~ 161 (823)
.+.. ++.+....-.+.+-.||+++ . +..|||= ...++.+|+-+.- .|. .+-.+.+|+|+.
T Consensus 103 ~v~~~~g~~~~g~~~~L~~~yeh~l~~~a~nm~~G~Fgg~~~~~~IcVQS~DG~L~~feqe~~-~f~~~lp~~llPgPl~ 181 (418)
T PF14727_consen 103 SVSLVDGTVEHGNQYQLELIYEHSLQRTAYNMCCGPFGGVKGRDFICVQSMDGSLSFFEQESF-AFSRFLPDFLLPGPLC 181 (418)
T ss_pred EEEecCCCcccCcEEEEEEEEEEecccceeEEEEEECCCCCCceEEEEEecCceEEEEeCCcE-EEEEEcCCCCCCcCeE
Confidence 7621 11222212222333344432 1 2468885 8889999988732 232 223466776654
Q ss_pred EEEecCCeEEEEEc
Q 003405 162 SMSWCGENICIAIR 175 (823)
Q Consensus 162 ~l~~~~~~i~v~~~ 175 (823)
-+.- -|.+++++.
T Consensus 182 Y~~~-tDsfvt~ss 194 (418)
T PF14727_consen 182 YCPR-TDSFVTASS 194 (418)
T ss_pred Eeec-CCEEEEecC
Confidence 4332 344444443
No 107
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=94.30 E-value=0.57 Score=48.90 Aligned_cols=178 Identities=17% Similarity=0.254 Sum_probs=105.2
Q ss_pred CcEEEEEEeCC--EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYGL--KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~~--~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..+-|++.-++ -|.-|..||.|.+|.+... .+.+.|...+.+.|+.+....+..-+++-+
T Consensus 264 ~aVlci~FSRDsEMlAsGsqDGkIKvWri~tG------------------~ClRrFdrAHtkGvt~l~FSrD~SqiLS~s 325 (508)
T KOG0275|consen 264 DAVLCISFSRDSEMLASGSQDGKIKVWRIETG------------------QCLRRFDRAHTKGVTCLSFSRDNSQILSAS 325 (508)
T ss_pred cceEEEeecccHHHhhccCcCCcEEEEEEecc------------------hHHHHhhhhhccCeeEEEEccCcchhhccc
Confidence 36888887764 6899999999999987632 233444433467899999888877777666
Q ss_pred C-c-EEEEeCCCCcccccccCCCCcEEEE----eeCCCceE-EEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEec-
Q 003405 95 E-S-IAFHRLPNLETIAVLTKAKGANVYS----WDDRRGFL-CFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC- 166 (823)
Q Consensus 95 d-~-l~~~~L~~l~~~~~i~~~kg~~~fa----~~~~~~~l-~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~- 166 (823)
- . +.+|.|.+-+- +...+|-++|. ..++...| .......+.++..+.+......|.....-++.++...
T Consensus 326 fD~tvRiHGlKSGK~---LKEfrGHsSyvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~P 402 (508)
T KOG0275|consen 326 FDQTVRIHGLKSGKC---LKEFRGHSSYVNEATFTDDGHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLP 402 (508)
T ss_pred ccceEEEeccccchh---HHHhcCccccccceEEcCCCCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcC
Confidence 3 3 89998865432 23445544442 22333333 4446777877776543222223322222344555544
Q ss_pred --CCeEEEEEc-CceEEEEcCCCCeeeccCCCCC-CCCEE--EEccCCeEEEEeCC
Q 003405 167 --GENICIAIR-KGYMILNATNGALSEVFPSGRI-GPPLV--VSLLSGELLLGKEN 216 (823)
Q Consensus 167 --~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~-~~p~i--~~~~~~EfLL~~~~ 216 (823)
...++|+++ +..+++|++ |++..-|..|+. +-..| +..+.+|++-|-++
T Consensus 403 Knpeh~iVCNrsntv~imn~q-GQvVrsfsSGkREgGdFi~~~lSpkGewiYcigE 457 (508)
T KOG0275|consen 403 KNPEHFIVCNRSNTVYIMNMQ-GQVVRSFSSGKREGGDFINAILSPKGEWIYCIGE 457 (508)
T ss_pred CCCceEEEEcCCCeEEEEecc-ceEEeeeccCCccCCceEEEEecCCCcEEEEEcc
Confidence 234666666 456777775 677677776642 22222 23366777766543
No 108
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=94.21 E-value=4.8 Score=43.24 Aligned_cols=150 Identities=16% Similarity=0.203 Sum_probs=94.0
Q ss_pred cEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
.|+=+..|. ..|+.|++||.+..|.+.... ..+.+.+ +..+++.=.++|...++++..+
T Consensus 150 dieWl~WHp~a~illAG~~DGsvWmw~ip~~~------------------~~kv~~G-h~~~ct~G~f~pdGKr~~tgy~ 210 (399)
T KOG0296|consen 150 DIEWLKWHPRAHILLAGSTDGSVWMWQIPSQA------------------LCKVMSG-HNSPCTCGEFIPDGKRILTGYD 210 (399)
T ss_pred ceEEEEecccccEEEeecCCCcEEEEECCCcc------------------eeeEecC-CCCCcccccccCCCceEEEEec
Confidence 445455554 589999999999999987532 1233334 3578888888999888888876
Q ss_pred -c-EEEEeCCCCccccccc---------------------CCCCcEE-----------EEeeCCC---------------
Q 003405 96 -S-IAFHRLPNLETIAVLT---------------------KAKGANV-----------YSWDDRR--------------- 126 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i~---------------------~~kg~~~-----------fa~~~~~--------------- 126 (823)
+ |.+|++.+-+|...+. ..+++.+ +|.+...
T Consensus 211 dgti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~esve 290 (399)
T KOG0296|consen 211 DGTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDESVE 290 (399)
T ss_pred CceEEEEecCCCceeEEecccccCcCCccccccccceeEeccCCccEEEEccccceEEEecCCCCccccccchhhhhhhh
Confidence 5 9999986544433222 1111111 1112100
Q ss_pred --------ceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC-CeEEEEEcCc-eEEEEcCCCCee
Q 003405 127 --------GFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG-ENICIAIRKG-YMILNATNGALS 189 (823)
Q Consensus 127 --------~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~-~~i~v~~~~~-y~lidl~~~~~~ 189 (823)
+..++| ..++|.||..... .++. -...++.|+.+.|.+ ..|+-++.++ ....|..+|+..
T Consensus 291 ~~~~ss~lpL~A~G~vdG~i~iyD~a~~-~~R~--~c~he~~V~~l~w~~t~~l~t~c~~g~v~~wDaRtG~l~ 361 (399)
T KOG0296|consen 291 SIPSSSKLPLAACGSVDGTIAIYDLAAS-TLRH--ICEHEDGVTKLKWLNTDYLLTACANGKVRQWDARTGQLK 361 (399)
T ss_pred hcccccccchhhcccccceEEEEecccc-hhhe--eccCCCceEEEEEcCcchheeeccCceEEeeeccccceE
Confidence 012344 7788888887632 2332 234567899999998 5677777754 678888888653
No 109
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=94.14 E-value=1.4 Score=48.60 Aligned_cols=146 Identities=18% Similarity=0.269 Sum_probs=93.8
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec--------c
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA--------S 86 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~--------~ 86 (823)
++|+|+.-. +..|.-.+.||++.+|+....... ..++++ .+.|.-|.-.| .
T Consensus 360 g~V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~~------------------~~l~~H-skei~t~~wsp~g~v~~n~~ 420 (524)
T KOG0273|consen 360 GEVNALKWNPTGSLLASCSDDGTLKIWSMGQSNSV------------------HDLQAH-SKEIYTIKWSPTGPVTSNPN 420 (524)
T ss_pred CceEEEEECCCCceEEEecCCCeeEeeecCCCcch------------------hhhhhh-ccceeeEeecCCCCccCCCc
Confidence 578898877 567788889999999985543221 111222 23344444433 2
Q ss_pred cCc-ee-eEeCc-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCce
Q 003405 87 RQL-LL-SLSES-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVK 161 (823)
Q Consensus 87 ~~~-Ll-~l~d~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~ 161 (823)
.|. ++ +..|+ |++|+.....+++.+. ..-+|.+++..++...++-| ..+.+.|+...-+ +.+|+..-.+.|-
T Consensus 421 ~~~~l~sas~dstV~lwdv~~gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~---~l~~s~~~~~~If 497 (524)
T KOG0273|consen 421 MNLMLASASFDSTVKLWDVESGVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTG---KLVKSYQGTGGIF 497 (524)
T ss_pred CCceEEEeecCCeEEEEEccCCceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEeccccch---heeEeecCCCeEE
Confidence 333 33 33465 9999998887777652 33467788888887788888 4455666655533 3345555556788
Q ss_pred EEEec--CCeEEEEEc-CceEEEEcC
Q 003405 162 SMSWC--GENICIAIR-KGYMILNAT 184 (823)
Q Consensus 162 ~l~~~--~~~i~v~~~-~~y~lidl~ 184 (823)
.++|. |+.|+++.+ ...+++|+.
T Consensus 498 el~Wn~~G~kl~~~~sd~~vcvldlr 523 (524)
T KOG0273|consen 498 ELCWNAAGDKLGACASDGSVCVLDLR 523 (524)
T ss_pred EEEEcCCCCEEEEEecCCCceEEEec
Confidence 89997 788888877 456777753
No 110
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=94.04 E-value=12 Score=45.66 Aligned_cols=60 Identities=18% Similarity=0.327 Sum_probs=42.6
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhc
Q 003405 506 ILDTALLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEE 575 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~ 575 (823)
.+.++|+..|.+.++.+....++..=.. ....-|..|+.-|.++|+.++|++++.+....
T Consensus 361 ~~~~~Li~~y~k~G~~~~A~~vf~~m~~----------~d~~t~n~lI~~y~~~G~~~~A~~lf~~M~~~ 420 (697)
T PLN03081 361 VANTALVDLYSKWGRMEDARNVFDRMPR----------KNLISWNALIAGYGNHGRGTKAVEMFERMIAE 420 (697)
T ss_pred eehHHHHHHHHHCCCHHHHHHHHHhCCC----------CCeeeHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 4678899999998775554444432000 01234889999999999999999999987643
No 111
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=94.03 E-value=12 Score=42.55 Aligned_cols=264 Identities=14% Similarity=0.092 Sum_probs=119.2
Q ss_pred CCeeEEEEecccCceeeEeCc-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeee
Q 003405 76 KPILSMEVLASRQLLLSLSES-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKD 153 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~d~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~ke 153 (823)
-....|..-|....++|+.|| ..+|.-..+... ...+ +..|++.. ++..||.. .++|.||+=-.+ ...+.
T Consensus 33 ~~p~~ls~npngr~v~V~g~geY~iyt~~~~r~k---~~G~-g~~~vw~~-~n~yAv~~~~~~I~I~kn~~~---~~~k~ 104 (443)
T PF04053_consen 33 IYPQSLSHNPNGRFVLVCGDGEYEIYTALAWRNK---AFGS-GLSFVWSS-RNRYAVLESSSTIKIYKNFKN---EVVKS 104 (443)
T ss_dssp S--SEEEE-TTSSEEEEEETTEEEEEETTTTEEE---EEEE--SEEEE-T-SSEEEEE-TTS-EEEEETTEE----TT--
T ss_pred cCCeeEEECCCCCEEEEEcCCEEEEEEccCCccc---ccCc-eeEEEEec-CccEEEEECCCeEEEEEcCcc---ccceE
Confidence 356778888988888887776 666763333221 1222 34455555 55677774 566877732111 11234
Q ss_pred ecCCCCceEEEecCCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-EeCCeEEEEcCCCc------
Q 003405 154 FGVPDTVKSMSWCGENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-GKENIGVFVDQNGK------ 226 (823)
Q Consensus 154 i~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~~~~gvfv~~~G~------ 226 (823)
+.+|..+..|.- |..+++......+++|..+++...-+.... -..+.+.++++++. ++++..++.+.+-+
T Consensus 105 i~~~~~~~~If~-G~LL~~~~~~~i~~yDw~~~~~i~~i~v~~--vk~V~Ws~~g~~val~t~~~i~il~~~~~~~~~~~ 181 (443)
T PF04053_consen 105 IKLPFSVEKIFG-GNLLGVKSSDFICFYDWETGKLIRRIDVSA--VKYVIWSDDGELVALVTKDSIYILKYNLEAVAAIP 181 (443)
T ss_dssp ---SS-EEEEE--SSSEEEEETTEEEEE-TTT--EEEEESS-E---EEEEE-TTSSEEEEE-S-SEEEEEE-HHHHHHBT
T ss_pred EcCCcccceEEc-CcEEEEECCCCEEEEEhhHcceeeEEecCC--CcEEEEECCCCEEEEEeCCeEEEEEecchhccccc
Confidence 555655666544 777888877779999999987655555421 12456666666543 44544444332222
Q ss_pred ----cccCCceee-cCCCcEEEEeCCEEEEEeC-CeEEEEEccCCCc--eeEEEeeCCccccc---ccCCeEEEe-ccce
Q 003405 227 ----LLQADRICW-SEAPIAVIIQKPYAIALLP-RRVEVRSLRVPYA--LIQTIVLQNVRHLI---PSSNAVVVA-LENS 294 (823)
Q Consensus 227 ----~~~~~~i~w-~~~P~~v~~~~PYll~~~~-~~ieV~~l~~~~~--lvQ~i~l~~~~~l~---~~~~~v~v~-s~~~ 294 (823)
.-.-..+.= +....+.+|..- ++..+. +.+.- +.+ +. .+.++ +..-.|. ...+.+|+. -+..
T Consensus 182 ~~g~e~~f~~~~E~~~~IkSg~W~~d-~fiYtT~~~lkY--l~~-Ge~~~i~~l--d~~~yllgy~~~~~~ly~~Dr~~~ 255 (443)
T PF04053_consen 182 EEGVEDAFELIHEISERIKSGCWVED-CFIYTTSNHLKY--LVN-GETGIIAHL--DKPLYLLGYLPKENRLYLIDRDGN 255 (443)
T ss_dssp TTB-GGGEEEEEEE-S--SEEEEETT-EEEEE-TTEEEE--EET-TEEEEEEE---SS--EEEEEETTTTEEEEE-TT--
T ss_pred ccCchhceEEEEEecceeEEEEEEcC-EEEEEcCCeEEE--EEc-CCcceEEEc--CCceEEEEEEccCCEEEEEECCCC
Confidence 100011111 334555555554 333332 22221 222 21 12222 2221221 123455544 4556
Q ss_pred EEEeeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHH
Q 003405 295 IFGLFPVPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEH 361 (823)
Q Consensus 295 I~~l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~ 361 (823)
|..+..-+-.-+-+..+-++.+++++...+.- ..+. +--...+...+.+|-++|-.+.|++.
T Consensus 256 v~~~~ld~~~~~fk~av~~~d~~~v~~~i~~~----~ll~-~i~~~~~~~i~~fL~~~G~~e~AL~~ 317 (443)
T PF04053_consen 256 VISYELDLSELEFKTAVLRGDFEEVLRMIAAS----NLLP-NIPKDQGQSIARFLEKKGYPELALQF 317 (443)
T ss_dssp EEEEE--HHHHHHHHHHHTT-HHH-----HHH----HTGG-G--HHHHHHHHHHHHHTT-HHHHHHH
T ss_pred EEEEEECHHHHHHHHHHHcCChhhhhhhhhhh----hhcc-cCChhHHHHHHHHHHHCCCHHHHHhh
Confidence 76665555556667778999999977766421 0000 00123466778888899999999877
No 112
>PLN03077 Protein ECB2; Provisional
Probab=94.01 E-value=22 Score=44.50 Aligned_cols=59 Identities=19% Similarity=0.261 Sum_probs=42.3
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 506 ILDTALLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
.+.++|+..|.+.++.+....++..=.. ...--|..++.-|...|+.++|++++.+...
T Consensus 425 ~~~n~Li~~y~k~g~~~~A~~vf~~m~~----------~d~vs~~~mi~~~~~~g~~~eA~~lf~~m~~ 483 (857)
T PLN03077 425 VVANALIEMYSKCKCIDKALEVFHNIPE----------KDVISWTSIIAGLRLNNRCFEALIFFRQMLL 483 (857)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHhCCC----------CCeeeHHHHHHHHHHCCCHHHHHHHHHHHHh
Confidence 5788899999998775555444432000 0112377899999999999999999999864
No 113
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=93.91 E-value=0.32 Score=52.47 Aligned_cols=161 Identities=14% Similarity=0.271 Sum_probs=103.5
Q ss_pred ccccCCCCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc
Q 003405 10 ELISNCSPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR 87 (823)
Q Consensus 10 ~l~~~~~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~ 87 (823)
-|++.=...|+|+..- +.+++=|..+|.|..|..+- ..++.+++.++-+|..+..-|.-
T Consensus 132 tilQaHDs~Vr~m~ws~~g~wmiSgD~gG~iKyWqpnm-------------------nnVk~~~ahh~eaIRdlafSpnD 192 (464)
T KOG0284|consen 132 TILQAHDSPVRTMKWSHNGTWMISGDKGGMIKYWQPNM-------------------NNVKIIQAHHAEAIRDLAFSPND 192 (464)
T ss_pred HHhhhhcccceeEEEccCCCEEEEcCCCceEEecccch-------------------hhhHHhhHhhhhhhheeccCCCC
Confidence 3344444567777643 57999999999999997442 22344445555789999999988
Q ss_pred CceeeEeC-c-EEEEeCCCCcccccccC-CCCcEEEEeeCCCceEEEEEcCeEEEEEEcC--CCceeEeeeecCCCCceE
Q 003405 88 QLLLSLSE-S-IAFHRLPNLETIAVLTK-AKGANVYSWDDRRGFLCFARQKRVCIFRHDG--GRGFVEVKDFGVPDTVKS 162 (823)
Q Consensus 88 ~~Ll~l~d-~-l~~~~L~~l~~~~~i~~-~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~--~~~f~~~kei~~~~~~~~ 162 (823)
...++++| + |++|+...-++-..+.. .=.+.++..++..+.|+++.|..+ +=-|+. ++....+ ..-...|.+
T Consensus 193 skF~t~SdDg~ikiWdf~~~kee~vL~GHgwdVksvdWHP~kgLiasgskDnl-VKlWDprSg~cl~tl--h~HKntVl~ 269 (464)
T KOG0284|consen 193 SKFLTCSDDGTIKIWDFRMPKEERVLRGHGWDVKSVDWHPTKGLIASGSKDNL-VKLWDPRSGSCLATL--HGHKNTVLA 269 (464)
T ss_pred ceeEEecCCCeEEEEeccCCchhheeccCCCCcceeccCCccceeEEccCCce-eEeecCCCcchhhhh--hhccceEEE
Confidence 99999998 5 99998644332222211 124778888999999999988772 333442 2211111 112457888
Q ss_pred EEec--CCeEEEEEc-CceEEEEcCCCCeeeccCC
Q 003405 163 MSWC--GENICIAIR-KGYMILNATNGALSEVFPS 194 (823)
Q Consensus 163 l~~~--~~~i~v~~~-~~y~lidl~~~~~~~L~~~ 194 (823)
+.|. ++.+.-+.+ ..-.++|+. ...+|+..
T Consensus 270 ~~f~~n~N~Llt~skD~~~kv~DiR--~mkEl~~~ 302 (464)
T KOG0284|consen 270 VKFNPNGNWLLTGSKDQSCKVFDIR--TMKELFTY 302 (464)
T ss_pred EEEcCCCCeeEEccCCceEEEEehh--HhHHHHHh
Confidence 8898 456666666 456788887 45555543
No 114
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=93.91 E-value=0.63 Score=52.82 Aligned_cols=136 Identities=13% Similarity=0.157 Sum_probs=80.4
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-Cceee-EeC-cEEEEeCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLS-LSE-SIAFHRLP 103 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~-l~d-~l~~~~L~ 103 (823)
.+|.|||++|.|.+|.+..+..++. ..+..+..++ +.-+|..|..=|-. ++|++ ..| .|.+|+|.
T Consensus 641 ~rLAVa~ddg~i~lWr~~a~gl~e~-----------~~tPe~~lt~-h~eKI~slRfHPLAadvLa~asyd~Ti~lWDl~ 708 (1012)
T KOG1445|consen 641 ERLAVATDDGQINLWRLTANGLPEN-----------EMTPEKILTI-HGEKITSLRFHPLAADVLAVASYDSTIELWDLA 708 (1012)
T ss_pred HHeeecccCceEEEEEeccCCCCcc-----------cCCcceeeec-ccceEEEEEecchhhhHhhhhhccceeeeeehh
Confidence 6899999999999999886544321 1111222222 24567777766633 44443 346 49999997
Q ss_pred CCcccccc-cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEEEEEcCceEEE
Q 003405 104 NLETIAVL-TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENICIAIRKGYMIL 181 (823)
Q Consensus 104 ~l~~~~~i-~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~li 181 (823)
+-+.-.++ ....++-.||++++..+++-+ ...+|.+|+-..+. ..++|-..|- .-+|..|.|++...|.++
T Consensus 709 ~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~~rVy~Prs~e--~pv~Eg~gpv-----gtRgARi~wacdgr~viv 781 (1012)
T KOG1445|consen 709 NAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGTLRVYEPRSRE--QPVYEGKGPV-----GTRGARILWACDGRIVIV 781 (1012)
T ss_pred hhhhhheeccCcCceeEEEECCCCcceeeeecCceEEEeCCCCCC--CccccCCCCc-----cCcceeEEEEecCcEEEE
Confidence 65433332 334567779999988777654 66789999876432 2344432221 123444555555555443
No 115
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=93.88 E-value=0.66 Score=51.28 Aligned_cols=132 Identities=11% Similarity=0.183 Sum_probs=84.9
Q ss_pred EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EEEEeCCC
Q 003405 28 KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IAFHRLPN 104 (823)
Q Consensus 28 ~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~~L~~ 104 (823)
.|.+...+|.+..|++++... .+ .+...++.|-.-|...|..+.|++-.+ . |.+|+...
T Consensus 179 lL~~asd~G~VtlwDv~g~sp--------------~~----~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s 240 (673)
T KOG4378|consen 179 LLSIASDKGAVTLWDVQGMSP--------------IF----HASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRS 240 (673)
T ss_pred eeEeeccCCeEEEEeccCCCc--------------cc----chhhhccCCcCcceecCCccceEEEecccceEEEeeccc
Confidence 466778899999999876542 11 111234678888888888877766654 3 99999875
Q ss_pred CcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEEEEEcCceE
Q 003405 105 LETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENICIAIRKGYM 179 (823)
Q Consensus 105 l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~ 179 (823)
-+....+.-....+.+++.++...+|+| .+.+|..|..+....-..+.. ...-.+++++|.-.. .|.+++.|.
T Consensus 241 ~~s~~~l~y~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R~~k~Pv~v~s-ah~~sVt~vafq~s~-tvltkssln 314 (673)
T KOG4378|consen 241 QASTDRLTYSHPLSTVAFSECGTYLCAGNSKGELIAYDMRSTKAPVAVRS-AHDASVTRVAFQPSP-TVLTKSSLN 314 (673)
T ss_pred ccccceeeecCCcceeeecCCceEEEeecCCceEEEEecccCCCCceEee-ecccceeEEEeeecc-eeeeccccc
Confidence 4444444445567788889888889998 778888888875332222221 122358888886443 444554444
No 116
>KOG2076 consensus RNA polymerase III transcription factor TFIIIC [Transcription]
Probab=93.87 E-value=1.3 Score=52.81 Aligned_cols=239 Identities=21% Similarity=0.189 Sum_probs=127.0
Q ss_pred cHHHHHHHHHh----c----CcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCC--Cc--cccc-ccCChHHHHH-Hhh-
Q 003405 535 DVKICEEILQK----K----NHYTALLELYKSNARHREALKLLHELVEESKSNQ--SQ--DEHT-QKFNPESIIE-YLK- 599 (823)
Q Consensus 535 ~~~~~~~~L~~----~----~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~--~~--~~~~-~~~~~~~~i~-yL~- 599 (823)
+.++++++|++ . --|..|+..|...|++++||..|..-++-.-.|. |. .++. ++...+.+.- |-+
T Consensus 154 ~~eeA~~i~~EvIkqdp~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d~e~W~~ladls~~~~~i~qA~~cy~rA 233 (895)
T KOG2076|consen 154 DLEEAEEILMEVIKQDPRNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKDYELWKRLADLSEQLGNINQARYCYSRA 233 (895)
T ss_pred CHHHHHHHHHHHHHhCccchhhHHHHHHHHHHcccHHHHHHHHHHHHhcCCCChHHHHHHHHHHHhcccHHHHHHHHHHH
Confidence 45666666654 2 3388999999999999999999998776654432 11 0111 1122222221 111
Q ss_pred -cCCCCChhhHHHhhhhhhhcC-cccccccccc--CCCCh---HHHHHHH--------hhcCchhHHHHHHHHhhcccCC
Q 003405 600 -PLCGTDPMLVLEFSMLVLESC-PTQTIELFLS--GNIPA---DLVNSYL--------KQYSPSMQGRYLELMLAMNENS 664 (823)
Q Consensus 600 -~L~~~~~~li~~y~~wll~~~-p~~~~~if~~--~~l~~---~~Vl~~L--------~~~~~~~~~~YLE~li~~~~~~ 664 (823)
++.+++.+++|+.+...-+.. -..|++-|.. ...|| +++.+-+ .......+..+||.-+......
T Consensus 234 I~~~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l~~~~p~~d~er~~d~i~~~~~~~~~~~~~e~a~~~le~~~s~~~~~ 313 (895)
T KOG2076|consen 234 IQANPSNWELIYERSSLYQKTGDLKRAMETFLQLLQLDPPVDIERIEDLIRRVAHYFITHNERERAAKALEGALSKEKDE 313 (895)
T ss_pred HhcCCcchHHHHHHHHHHHHhChHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhcccc
Confidence 235778999999987766542 2234433333 12222 2222222 2222456788888766532223
Q ss_pred CChhHHHHHHHHHHHHHHHHhh-----hhhh--hcccCcccchH---HHHHHHHHhhhc--CCCChHH------------
Q 003405 665 ISGNLQNEMVQIYLSEVLDWYS-----DLSA--QQKWDEKAYSP---TRKKLLSALESI--SGYNPEV------------ 720 (823)
Q Consensus 665 ~~~~~h~~L~~lYl~~i~~~~~-----~~~~--~~~~~~~~~~~---~r~kLl~fL~~s--~~Yd~~~------------ 720 (823)
.+-+-++.++++||..-..... .... .++ |++++.. .|.....|.+-. -.|++..
T Consensus 314 ~~~ed~ni~ael~l~~~q~d~~~~~i~~~~~r~~e~-d~~e~~~~~~~~~~~~~~~~~~~~~s~~l~v~rl~icL~~L~~ 392 (895)
T KOG2076|consen 314 ASLEDLNILAELFLKNKQSDKALMKIVDDRNRESEK-DDSEWDTDERRREEPNALCEVGKELSYDLRVIRLMICLVHLKE 392 (895)
T ss_pred ccccHHHHHHHHHHHhHHHHHhhHHHHHHhccccCC-ChhhhhhhhhccccccccccCCCCCCccchhHhHhhhhhcccc
Confidence 4566788999999875321110 0000 000 1111100 011111111111 1122222
Q ss_pred ------HhccCCC--------CchhhHHHHHhhccccHHHHHHHHHHHhCCC---chhHHHHHHHHhcCCC
Q 003405 721 ------LLKRLPA--------DALYEERAILLGKMNQHELALSLYVHKVFLI---NQPVFLLIRRMAMDIK 774 (823)
Q Consensus 721 ------~L~~~~~--------~~l~~e~~~Ll~klg~h~~AL~ilv~~L~D~---~~a~~~~l~~~y~~~~ 774 (823)
++..+.. -+|+-..+=+|-..|++++|++++.--.+.. +..+|+-+.+.|++.+
T Consensus 393 ~e~~e~ll~~l~~~n~~~~d~~dL~~d~a~al~~~~~~~~Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l~ 463 (895)
T KOG2076|consen 393 RELLEALLHFLVEDNVWVSDDVDLYLDLADALTNIGKYKEALRLLSPITNREGYQNAFVWYKLARCYMELG 463 (895)
T ss_pred cchHHHHHHHHHHhcCChhhhHHHHHHHHHHHHhcccHHHHHHHHHHHhcCccccchhhhHHHHHHHHHHh
Confidence 2222211 1466667778899999999999998655443 3569999999999875
No 117
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=93.86 E-value=2.8 Score=44.19 Aligned_cols=144 Identities=8% Similarity=0.111 Sum_probs=89.3
Q ss_pred ecccCceeeEeC---cEEEEeC-----CCCcccccccCCC----CcEEEEeeCCCceEEEE-EcCeEEEEEEc----CCC
Q 003405 84 LASRQLLLSLSE---SIAFHRL-----PNLETIAVLTKAK----GANVYSWDDRRGFLCFA-RQKRVCIFRHD----GGR 146 (823)
Q Consensus 84 ~~~~~~Ll~l~d---~l~~~~L-----~~l~~~~~i~~~k----g~~~fa~~~~~~~l~V~-~kkki~l~~~~----~~~ 146 (823)
+...+..++.|+ .|++|.. .+++.+..+...| +++.+|.+++..+++-+ ...++.||..+ .+.
T Consensus 236 vSP~GRFia~~gFTpDVkVwE~~f~kdG~fqev~rvf~LkGH~saV~~~aFsn~S~r~vtvSkDG~wriwdtdVrY~~~q 315 (420)
T KOG2096|consen 236 VSPDGRFIAVSGFTPDVKVWEPIFTKDGTFQEVKRVFSLKGHQSAVLAAAFSNSSTRAVTVSKDGKWRIWDTDVRYEAGQ 315 (420)
T ss_pred eCCCCcEEEEecCCCCceEEEEEeccCcchhhhhhhheeccchhheeeeeeCCCcceeEEEecCCcEEEeeccceEecCC
Confidence 334466666666 3888865 2455554444555 57889999888776555 55778787543 122
Q ss_pred ceeEeeeecCC-----CCceEEEec--CCeEEEEEcCceEEEEcCCCCeeeccCCC-CCCCCEEEEccCCeEEEEeCCeE
Q 003405 147 GFVEVKDFGVP-----DTVKSMSWC--GENICIAIRKGYMILNATNGALSEVFPSG-RIGPPLVVSLLSGELLLGKENIG 218 (823)
Q Consensus 147 ~f~~~kei~~~-----~~~~~l~~~--~~~i~v~~~~~y~lidl~~~~~~~L~~~~-~~~~p~i~~~~~~EfLL~~~~~g 218 (823)
.-+.+|+.+.| ..|.-++.. |+.+.++..+...+++..+|+..+-+..- ...-.++...+++.|+..+++-.
T Consensus 316 Dpk~Lk~g~~pl~aag~~p~RL~lsP~g~~lA~s~gs~l~~~~se~g~~~~~~e~~h~~~Is~is~~~~g~~~atcGdr~ 395 (420)
T KOG2096|consen 316 DPKILKEGSAPLHAAGSEPVRLELSPSGDSLAVSFGSDLKVFASEDGKDYPELEDIHSTTISSISYSSDGKYIATCGDRY 395 (420)
T ss_pred CchHhhcCCcchhhcCCCceEEEeCCCCcEEEeecCCceEEEEcccCccchhHHHhhcCceeeEEecCCCcEEeeeccee
Confidence 23445665333 345455554 78899999999999999988765554321 11223455667899999766533
Q ss_pred --EEEcCCCcc
Q 003405 219 --VFVDQNGKL 227 (823)
Q Consensus 219 --vfv~~~G~~ 227 (823)
+|-|.-|..
T Consensus 396 vrv~~ntpg~~ 406 (420)
T KOG2096|consen 396 VRVIRNTPGWH 406 (420)
T ss_pred eeeecCCCchh
Confidence 444566653
No 118
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=93.84 E-value=0.68 Score=50.10 Aligned_cols=171 Identities=15% Similarity=0.205 Sum_probs=96.7
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLP 103 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~ 103 (823)
|++|++||..|.+-.|+... ..|+.+. .-+..+|.-++.......+++=.+ | |++|...
T Consensus 108 GRRLltgs~SGEFtLWNg~~----------------fnFEtil---QaHDs~Vr~m~ws~~g~wmiSgD~gG~iKyWqpn 168 (464)
T KOG0284|consen 108 GRRLLTGSQSGEFTLWNGTS----------------FNFETIL---QAHDSPVRTMKWSHNGTWMISGDKGGMIKYWQPN 168 (464)
T ss_pred CceeEeecccccEEEecCce----------------eeHHHHh---hhhcccceeEEEccCCCEEEEcCCCceEEecccc
Confidence 58999999999999997431 2333332 234789999999998777776544 3 9999753
Q ss_pred CCcccccc--cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCC-CCceEEEecCC--eEEEEEc-C
Q 003405 104 NLETIAVL--TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVP-DTVKSMSWCGE--NICIAIR-K 176 (823)
Q Consensus 104 ~l~~~~~i--~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~-~~~~~l~~~~~--~i~v~~~-~ 176 (823)
+..+..+ .....|..++.+++...++-+ -++.|.|+...... .-+-+.-+ -.|+++.|... .|+.|.+ +
T Consensus 169 -mnnVk~~~ahh~eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~k---ee~vL~GHgwdVksvdWHP~kgLiasgskDn 244 (464)
T KOG0284|consen 169 -MNNVKIIQAHHAEAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPK---EERVLRGHGWDVKSVDWHPTKGLIASGSKDN 244 (464)
T ss_pred -hhhhHHhhHhhhhhhheeccCCCCceeEEecCCCeEEEEeccCCc---hhheeccCCCCcceeccCCccceeEEccCCc
Confidence 1111111 111234556666544444444 44556665544221 11112223 37999999954 4666666 4
Q ss_pred ceEEEEcCCCCee-eccCCCCCCCCEEE---EccCCeEEEE--eCCeEEEEcC
Q 003405 177 GYMILNATNGALS-EVFPSGRIGPPLVV---SLLSGELLLG--KENIGVFVDQ 223 (823)
Q Consensus 177 ~y~lidl~~~~~~-~L~~~~~~~~p~i~---~~~~~EfLL~--~~~~gvfv~~ 223 (823)
-..+.|..+|+.. .+... +.+|+ ..+++.+|+. .|..+-.+|.
T Consensus 245 lVKlWDprSg~cl~tlh~H----KntVl~~~f~~n~N~Llt~skD~~~kv~Di 293 (464)
T KOG0284|consen 245 LVKLWDPRSGSCLATLHGH----KNTVLAVKFNPNGNWLLTGSKDQSCKVFDI 293 (464)
T ss_pred eeEeecCCCcchhhhhhhc----cceEEEEEEcCCCCeeEEccCCceEEEEeh
Confidence 4566676666432 22221 22333 3366788873 5555555554
No 119
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=93.79 E-value=0.25 Score=59.78 Aligned_cols=217 Identities=19% Similarity=0.259 Sum_probs=116.6
Q ss_pred EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccC-ceeeEeC-c-EEEEeCCC
Q 003405 28 KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQ-LLLSLSE-S-IAFHRLPN 104 (823)
Q Consensus 28 ~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~-~Ll~l~d-~-l~~~~L~~ 104 (823)
-|.=|++||.|..|+......+ ....++.++. .+..+|.-|.+=+..+ +|..-++ | |.+|||.+
T Consensus 82 lIaGG~edG~I~ly~p~~~~~~------------~~~~~la~~~-~h~G~V~gLDfN~~q~nlLASGa~~geI~iWDlnn 148 (1049)
T KOG0307|consen 82 LIAGGLEDGNIVLYDPASIIAN------------ASEEVLATKS-KHTGPVLGLDFNPFQGNLLASGADDGEILIWDLNK 148 (1049)
T ss_pred eeeccccCCceEEecchhhccC------------cchHHHhhhc-ccCCceeeeeccccCCceeeccCCCCcEEEeccCC
Confidence 3778899999999986653111 1222333333 2367899988888665 5665555 5 99999988
Q ss_pred Ccccccc---cCCCCcEEEEeeCCCce-EEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC---CeEEEEEcC
Q 003405 105 LETIAVL---TKAKGANVYSWDDRRGF-LCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG---ENICIAIRK 176 (823)
Q Consensus 105 l~~~~~i---~~~kg~~~fa~~~~~~~-l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~---~~i~v~~~~ 176 (823)
++...+. ...-.|++++++..... +|-+ ...+..|+.++..+...++.+..-.-.+..++|.. +.|.+++..
T Consensus 149 ~~tP~~~~~~~~~~eI~~lsWNrkvqhILAS~s~sg~~~iWDlr~~~pii~ls~~~~~~~~S~l~WhP~~aTql~~As~d 228 (1049)
T KOG0307|consen 149 PETPFTPGSQAPPSEIKCLSWNRKVSHILASGSPSGRAVIWDLRKKKPIIKLSDTPGRMHCSVLAWHPDHATQLLVASGD 228 (1049)
T ss_pred cCCCCCCCCCCCcccceEeccchhhhHHhhccCCCCCceeccccCCCcccccccCCCccceeeeeeCCCCceeeeeecCC
Confidence 7644332 23346777877765543 4444 33467677766332222222222112355899983 358888774
Q ss_pred c----eEEEEcCCCC-eeeccCCCCCCCCEEEEc-cCCeEEE--EeCCeEEEEcC-CCcccc----C----CceeecCCC
Q 003405 177 G----YMILNATNGA-LSEVFPSGRIGPPLVVSL-LSGELLL--GKENIGVFVDQ-NGKLLQ----A----DRICWSEAP 239 (823)
Q Consensus 177 ~----y~lidl~~~~-~~~L~~~~~~~~p~i~~~-~~~EfLL--~~~~~gvfv~~-~G~~~~----~----~~i~w~~~P 239 (823)
. +.+-|+.... ....+..-..+.-.+-+. .|.++|| ++|+..+.-|. .|+... + ..++|...
T Consensus 229 d~~PviqlWDlR~assP~k~~~~H~~GilslsWc~~D~~lllSsgkD~~ii~wN~~tgEvl~~~p~~~nW~fdv~w~pr- 307 (1049)
T KOG0307|consen 229 DSAPVIQLWDLRFASSPLKILEGHQRGILSLSWCPQDPRLLLSSGKDNRIICWNPNTGEVLGELPAQGNWCFDVQWCPR- 307 (1049)
T ss_pred CCCceeEeecccccCCchhhhcccccceeeeccCCCCchhhhcccCCCCeeEecCCCceEeeecCCCCcceeeeeecCC-
Confidence 2 4445544321 111121111110112233 3447777 35666666654 232221 1 23445221
Q ss_pred cEEEEeCCEEEEEe--CCeEEEEEccC
Q 003405 240 IAVIIQKPYAIALL--PRRVEVRSLRV 264 (823)
Q Consensus 240 ~~v~~~~PYll~~~--~~~ieV~~l~~ 264 (823)
.|-+++.. ++.|+|+++..
T Consensus 308 ------~P~~~A~asfdgkI~I~sl~~ 328 (1049)
T KOG0307|consen 308 ------NPSVMAAASFDGKISIYSLQG 328 (1049)
T ss_pred ------Ccchhhhheeccceeeeeeec
Confidence 35455444 47899999874
No 120
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=93.74 E-value=2.4 Score=49.49 Aligned_cols=147 Identities=14% Similarity=0.116 Sum_probs=86.9
Q ss_pred CCcEEEEEEeCCE--EEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASYGLK--ILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~~~~--L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
+..++|+.++++. |.||..||.|.+|+...... .-.+. -+|++|+-|.....+.+|++=
T Consensus 65 k~evt~l~~~~d~l~lAVGYaDGsVqif~~~s~~~------------------~~tfn-gHK~AVt~l~fd~~G~rlaSG 125 (888)
T KOG0306|consen 65 KAEVTCLRSSDDILLLAVGYADGSVQIFSLESEEI------------------LITFN-GHKAAVTTLKFDKIGTRLASG 125 (888)
T ss_pred cceEEEeeccCCcceEEEEecCceEEeeccCCCce------------------eeeec-ccccceEEEEEcccCceEeec
Confidence 3479999999874 59999999999998664311 11222 358999999999887777765
Q ss_pred e-Cc-EEEEeCCCCcccccccCCCC-cE-EEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCe
Q 003405 94 S-ES-IAFHRLPNLETIAVLTKAKG-AN-VYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGEN 169 (823)
Q Consensus 94 ~-d~-l~~~~L~~l~~~~~i~~~kg-~~-~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~ 169 (823)
+ |+ |.+||+-.-+...++..-|. |+ ++.+..+.-.+.++...-|.++.+.....|... +.--..+-+|++.++.
T Consensus 126 skDt~IIvwDlV~E~Gl~rL~GHkd~iT~~~F~~~~~~lvS~sKDs~iK~WdL~tqhCf~Th--vd~r~Eiw~l~~~~~~ 203 (888)
T KOG0306|consen 126 SKDTDIIVWDLVGEEGLFRLRGHKDSITQALFLNGDSFLVSVSKDSMIKFWDLETQHCFETH--VDHRGEIWALVLDEKL 203 (888)
T ss_pred CCCccEEEEEeccceeeEEeecchHHHhHHhccCCCeEEEEeccCceEEEEecccceeeeEE--ecccceEEEEEEecce
Confidence 5 34 99999854333333322221 22 223332222233344455666666522122211 1123466677777743
Q ss_pred -EEEEEcCceEEEEc
Q 003405 170 -ICIAIRKGYMILNA 183 (823)
Q Consensus 170 -i~v~~~~~y~lidl 183 (823)
|..|+.++..++++
T Consensus 204 lvt~~~dse~~v~~L 218 (888)
T KOG0306|consen 204 LVTAGTDSELKVWEL 218 (888)
T ss_pred EEEEecCCceEEEEe
Confidence 45555578888887
No 121
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=93.69 E-value=1 Score=48.78 Aligned_cols=137 Identities=14% Similarity=0.123 Sum_probs=85.4
Q ss_pred EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C-cEEE
Q 003405 22 VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E-SIAF 99 (823)
Q Consensus 22 i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d-~l~~ 99 (823)
|++.+..++=|-.|+.|..|++.... ..+++.. ...|+.+.+..+..-+++.+ | .+.+
T Consensus 308 I~~~~~~~~SgH~DkkvRfwD~Rs~~------------------~~~sv~~--gg~vtSl~ls~~g~~lLsssRDdtl~v 367 (459)
T KOG0288|consen 308 IVCSISDVISGHFDKKVRFWDIRSAD------------------KTRSVPL--GGRVTSLDLSMDGLELLSSSRDDTLKV 367 (459)
T ss_pred eEecceeeeecccccceEEEeccCCc------------------eeeEeec--CcceeeEeeccCCeEEeeecCCCceee
Confidence 33346667778889999999866432 1223332 34899999999988888887 3 4999
Q ss_pred EeCCCCccccc--ccCCC---CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC--CeEE
Q 003405 100 HRLPNLETIAV--LTKAK---GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG--ENIC 171 (823)
Q Consensus 100 ~~L~~l~~~~~--i~~~k---g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~--~~i~ 171 (823)
+++.+++-... -...| +.+..+++++...++.| ..+.+.|+....++.-..++.-..+..|++++|.+ ..+.
T Consensus 368 iDlRt~eI~~~~sA~g~k~asDwtrvvfSpd~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~~aI~s~~W~~sG~~Ll 447 (459)
T KOG0288|consen 368 IDLRTKEIRQTFSAEGFKCASDWTRVVFSPDGSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSNAAITSLSWNPSGSGLL 447 (459)
T ss_pred eecccccEEEEeeccccccccccceeEECCCCceeeeccCCCcEEEEEccCceEEEEeccCCCCcceEEEEEcCCCchhh
Confidence 99988763322 11222 34556677776667766 77788888877553212233222233689999983 4444
Q ss_pred EEEcCce
Q 003405 172 IAIRKGY 178 (823)
Q Consensus 172 v~~~~~y 178 (823)
-+.+..|
T Consensus 448 sadk~~~ 454 (459)
T KOG0288|consen 448 SADKQKA 454 (459)
T ss_pred cccCCcc
Confidence 4444333
No 122
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=93.44 E-value=1.7 Score=47.17 Aligned_cols=171 Identities=9% Similarity=0.145 Sum_probs=98.4
Q ss_pred cEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-
Q 003405 18 KIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS- 94 (823)
Q Consensus 18 ~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~- 94 (823)
.++|.+.+ +.+++.|..|+.+++|+++++... .+ ++....+|..|.+-++...+++++
T Consensus 314 S~~sc~W~pDg~~~V~Gs~dr~i~~wdlDgn~~~-------------~W------~gvr~~~v~dlait~Dgk~vl~v~~ 374 (519)
T KOG0293|consen 314 SVSSCAWCPDGFRFVTGSPDRTIIMWDLDGNILG-------------NW------EGVRDPKVHDLAITYDGKYVLLVTV 374 (519)
T ss_pred CcceeEEccCCceeEecCCCCcEEEecCCcchhh-------------cc------cccccceeEEEEEcCCCcEEEEEec
Confidence 44444433 468999999999999999876532 22 222235688888888887766655
Q ss_pred Cc-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC-CeEE
Q 003405 95 ES-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG-ENIC 171 (823)
Q Consensus 95 d~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~-~~i~ 171 (823)
|. +.+|+..+..-...+..-.+++.|+++.+...+.|- ....+.++.+.+.+..++..-.....-+..-+|-| +.-+
T Consensus 375 d~~i~l~~~e~~~dr~lise~~~its~~iS~d~k~~LvnL~~qei~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg~~~~f 454 (519)
T KOG0293|consen 375 DKKIRLYNREARVDRGLISEEQPITSFSISKDGKLALVNLQDQEIHLWDLEENKLVRKYFGHKQGHFIIRSCFGGGNDKF 454 (519)
T ss_pred ccceeeechhhhhhhccccccCceeEEEEcCCCcEEEEEcccCeeEEeecchhhHHHHhhcccccceEEEeccCCCCcce
Confidence 53 888876543323345566789999998876443343 66778888776332111111111122223334443 3234
Q ss_pred EEEc---CceEEEEcCCCCeeeccCCCCCCCCEEEEccC
Q 003405 172 IAIR---KGYMILNATNGALSEVFPSGRIGPPLVVSLLS 207 (823)
Q Consensus 172 v~~~---~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~ 207 (823)
+++. +..+|.+..+|.....++--...-.|+.+-|.
T Consensus 455 iaSGSED~kvyIWhr~sgkll~~LsGHs~~vNcVswNP~ 493 (519)
T KOG0293|consen 455 IASGSEDSKVYIWHRISGKLLAVLSGHSKTVNCVSWNPA 493 (519)
T ss_pred EEecCCCceEEEEEccCCceeEeecCCcceeeEEecCCC
Confidence 4433 35777888877765555432222344444443
No 123
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=93.41 E-value=4.3 Score=46.29 Aligned_cols=82 Identities=17% Similarity=0.152 Sum_probs=56.5
Q ss_pred ccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccccccCCCCcEEEEeeCCC--ceEEEEE
Q 003405 58 SLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLTKAKGANVYSWDDRR--GFLCFAR 133 (823)
Q Consensus 58 ~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~--~~l~V~~ 133 (823)
.|++.+..+...+++ ++..|..|.+-+.++.|+.=+| | |++|.+.+..-+.++...--+.++++++.. +.|+||.
T Consensus 384 dLrPFPt~~~lvyrG-Htg~Vr~iSvdp~G~wlasGsdDGtvriWEi~TgRcvr~~~~d~~I~~vaw~P~~~~~vLAvA~ 462 (733)
T KOG0650|consen 384 DLRPFPTRCALVYRG-HTGLVRSISVDPSGEWLASGSDDGTVRIWEIATGRCVRTVQFDSEIRSVAWNPLSDLCVLAVAV 462 (733)
T ss_pred hcCCCcceeeeeEec-cCCeEEEEEecCCcceeeecCCCCcEEEEEeecceEEEEEeecceeEEEEecCCCCceeEEEEe
Confidence 344444444334443 4678999999999888888877 5 999999877655555444566777777654 5688887
Q ss_pred cCeEEEE
Q 003405 134 QKRVCIF 140 (823)
Q Consensus 134 kkki~l~ 140 (823)
...+.|.
T Consensus 463 ~~~~~iv 469 (733)
T KOG0650|consen 463 GECVLIV 469 (733)
T ss_pred cCceEEe
Confidence 7775443
No 124
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=93.35 E-value=0.25 Score=39.59 Aligned_cols=55 Identities=22% Similarity=0.378 Sum_probs=42.8
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
+...+++.|+|++|+..++.....++ .-...+...|..++.+|++++|...|.+.
T Consensus 3 ~a~~~~~~g~~~~A~~~~~~~l~~~P-----~~~~a~~~lg~~~~~~g~~~~A~~~~~~a 57 (65)
T PF13432_consen 3 LARALYQQGDYDEAIAAFEQALKQDP-----DNPEAWYLLGRILYQQGRYDEALAYYERA 57 (65)
T ss_dssp HHHHHHHCTHHHHHHHHHHHHHCCST-----THHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHCC-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 45678999999999999987632221 23457778899999999999999999863
No 125
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=93.22 E-value=27 Score=43.06 Aligned_cols=98 Identities=16% Similarity=0.108 Sum_probs=61.7
Q ss_pred chhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCC
Q 003405 647 PSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLP 726 (823)
Q Consensus 647 ~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~ 726 (823)
...+..+++.++... ..++...+.++..|...- + .+-+.+++. ++..-+
T Consensus 786 ~~~A~~~~~~~~~~~--p~~~~~~~~l~~~~~~~~-------------~--------~~A~~~~~~--------~~~~~~ 834 (899)
T TIGR02917 786 YDKAIKHYRTVVKKA--PDNAVVLNNLAWLYLELK-------------D--------PRALEYAEK--------ALKLAP 834 (899)
T ss_pred HHHHHHHHHHHHHhC--CCCHHHHHHHHHHHHhcC-------------c--------HHHHHHHHH--------HHhhCC
Confidence 556777888887642 245677777777776530 0 112222222 122222
Q ss_pred CC-chhhHHHHHhhccccHHHHHHHHHHHh--CCCchhHHHHHHHHhcCCCC
Q 003405 727 AD-ALYEERAILLGKMNQHELALSLYVHKV--FLINQPVFLLIRRMAMDIKP 775 (823)
Q Consensus 727 ~~-~l~~e~~~Ll~klg~h~~AL~ilv~~L--~D~~~a~~~~l~~~y~~~~~ 775 (823)
.+ .+..-.+.++.++|++++|+.++-.-+ +..++.++..+...|...+.
T Consensus 835 ~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~~~~~~~~l~~~~~~~g~ 886 (899)
T TIGR02917 835 NIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALLATGR 886 (899)
T ss_pred CCcHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHHHcCC
Confidence 22 355677888999999999999887654 33467888888888887543
No 126
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=93.19 E-value=0.6 Score=54.39 Aligned_cols=108 Identities=18% Similarity=0.246 Sum_probs=79.3
Q ss_pred CcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..+.|+..|- ++++-|++|-++..|++.... .+|-|.+ ++.||+.+.+.|..-.|.+-+
T Consensus 536 sDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~------------------~VRiF~G-H~~~V~al~~Sp~Gr~LaSg~ 596 (707)
T KOG0263|consen 536 SDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGN------------------SVRIFTG-HKGPVTALAFSPCGRYLASGD 596 (707)
T ss_pred cccceEEECCcccccccCCCCceEEEEEcCCCc------------------EEEEecC-CCCceEEEEEcCCCceEeecc
Confidence 5788998886 588999999999999966422 2566766 489999999999776677666
Q ss_pred C-c-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEc
Q 003405 95 E-S-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHD 143 (823)
Q Consensus 95 d-~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~ 143 (823)
+ + |.+|++.+-+.+.... ....+.++..+.+.+.++++ ....|.+|.+.
T Consensus 597 ed~~I~iWDl~~~~~v~~l~~Ht~ti~SlsFS~dg~vLasgg~DnsV~lWD~~ 649 (707)
T KOG0263|consen 597 EDGLIKIWDLANGSLVKQLKGHTGTIYSLSFSRDGNVLASGGADNSVRLWDLT 649 (707)
T ss_pred cCCcEEEEEcCCCcchhhhhcccCceeEEEEecCCCEEEecCCCCeEEEEEch
Confidence 5 5 9999998755444322 22335566667777788887 67788888654
No 127
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=93.16 E-value=12 Score=40.33 Aligned_cols=166 Identities=16% Similarity=0.219 Sum_probs=101.7
Q ss_pred cccCCCCcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc
Q 003405 11 LISNCSPKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR 87 (823)
Q Consensus 11 l~~~~~~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~ 87 (823)
|.+.....|+|++-- ..-+.||+..| |++|.........- +.+. .+.-....++.-+..+|+.|.--++.
T Consensus 135 Lks~sQrnvtclawRPlsaselavgCr~g-IciW~~s~tln~~r----~~~~--~s~~~~qvl~~pgh~pVtsmqwn~dg 207 (445)
T KOG2139|consen 135 LKSVSQRNVTCLAWRPLSASELAVGCRAG-ICIWSDSRTLNANR----NIRM--MSTHHLQVLQDPGHNPVTSMQWNEDG 207 (445)
T ss_pred ecchhhcceeEEEeccCCcceeeeeecce-eEEEEcCccccccc----cccc--ccccchhheeCCCCceeeEEEEcCCC
Confidence 334445679998754 35899999999 67787654332110 0000 00000011122234689999999988
Q ss_pred CceeeEeC---cEEEEeCCCCcccccc-cCCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeecCCCCceE
Q 003405 88 QLLLSLSE---SIAFHRLPNLETIAVL-TKAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFGVPDTVKS 162 (823)
Q Consensus 88 ~~Ll~l~d---~l~~~~L~~l~~~~~i-~~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~~~~~~~~ 162 (823)
..|+.-+= .+.+|+-++-..++-+ ...-|++....+++...++.+. .....++. ..+.+...+-+-.++.+++
T Consensus 208 t~l~tAS~gsssi~iWdpdtg~~~pL~~~glgg~slLkwSPdgd~lfaAt~davfrlw~--e~q~wt~erw~lgsgrvqt 285 (445)
T KOG2139|consen 208 TILVTASFGSSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDAVFRLWQ--ENQSWTKERWILGSGRVQT 285 (445)
T ss_pred CEEeecccCcceEEEEcCCCCCcccccccCCCceeeEEEcCCCCEEEEecccceeeeeh--hcccceecceeccCCceee
Confidence 77776652 2999997654433222 2334777788888887776664 34334442 2223443444667889999
Q ss_pred EEec--CCeEEEEEcCceEEEEcCC
Q 003405 163 MSWC--GENICIAIRKGYMILNATN 185 (823)
Q Consensus 163 l~~~--~~~i~v~~~~~y~lidl~~ 185 (823)
-+|. |..|.+++..+=.++.+.-
T Consensus 286 acWspcGsfLLf~~sgsp~lysl~f 310 (445)
T KOG2139|consen 286 ACWSPCGSFLLFACSGSPRLYSLTF 310 (445)
T ss_pred eeecCCCCEEEEEEcCCceEEEEee
Confidence 9997 7889999988877777653
No 128
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=93.13 E-value=2.9 Score=51.43 Aligned_cols=163 Identities=12% Similarity=0.154 Sum_probs=88.8
Q ss_pred CCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc--C-ce
Q 003405 16 SPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR--Q-LL 90 (823)
Q Consensus 16 ~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~--~-~L 90 (823)
.++++|+..+ ++.+.+||+||.+.+.+++..... +......+........++..+...-.. . ++
T Consensus 1098 ~sr~~~vt~~~~~~~~Av~t~DG~v~~~~id~~~~~-----------~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~l 1166 (1431)
T KOG1240|consen 1098 GSRVEKVTMCGNGDQFAVSTKDGSVRVLRIDHYNVS-----------KRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVL 1166 (1431)
T ss_pred CCceEEEEeccCCCeEEEEcCCCeEEEEEccccccc-----------cceeeeeecccccCCCceEEeecccccccceeE
Confidence 4577777666 479999999999999998864321 111111111000012345555433322 2 45
Q ss_pred eeEeC--cEEEEeCCCCcccccc---cCCCCcEEEEeeCCCceEEEEEcCe-EEEEEEcCCCcee-EeeeecCCC--Cce
Q 003405 91 LSLSE--SIAFHRLPNLETIAVL---TKAKGANVYSWDDRRGFLCFARQKR-VCIFRHDGGRGFV-EVKDFGVPD--TVK 161 (823)
Q Consensus 91 l~l~d--~l~~~~L~~l~~~~~i---~~~kg~~~fa~~~~~~~l~V~~kkk-i~l~~~~~~~~f~-~~kei~~~~--~~~ 161 (823)
+..++ ++..|+.....-.-++ .+.--++++|+++...-+++|..++ +.+|.++ |+ .+.+...|. +|+
T Consensus 1167 vy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~idp~~~WlviGts~G~l~lWDLR----F~~~i~sw~~P~~~~i~ 1242 (1431)
T KOG1240|consen 1167 VYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIVIDPWCNWLVIGTSRGQLVLWDLR----FRVPILSWEHPARAPIR 1242 (1431)
T ss_pred EEEEeccceEEecchhhhhHHhhhcCccccceeEEEecCCceEEEEecCCceEEEEEee----cCceeecccCcccCCcc
Confidence 55666 3888887654322111 1222478899998876688885554 4444443 42 233333443 455
Q ss_pred EEEec---C--Ce-EEEEE--cCceEEEEcCCCCeeeccC
Q 003405 162 SMSWC---G--EN-ICIAI--RKGYMILNATNGALSEVFP 193 (823)
Q Consensus 162 ~l~~~---~--~~-i~v~~--~~~y~lidl~~~~~~~L~~ 193 (823)
.+..+ + .. +..|. .++..+.|+.+|..+..+-
T Consensus 1243 ~v~~~~~~~~~S~~vs~~~~~~nevs~wn~~~g~~~~vl~ 1282 (1431)
T KOG1240|consen 1243 HVWLCPTYPQESVSVSAGSSSNNEVSTWNMETGLRQTVLW 1282 (1431)
T ss_pred eEEeeccCCCCceEEEecccCCCceeeeecccCcceEEEE
Confidence 55443 2 22 33333 1578888998886555443
No 129
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=92.88 E-value=8.4 Score=42.36 Aligned_cols=136 Identities=15% Similarity=0.194 Sum_probs=85.3
Q ss_pred cEEEEEE--eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-
Q 003405 18 KIDAVAS--YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS- 94 (823)
Q Consensus 18 ~I~ci~~--~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~- 94 (823)
.|+.+.. .|++++-..+||+...+++.... .+......-+.-.++...+=|+..++.+=+
T Consensus 305 ~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~-----------------~lt~vs~~~s~v~~ts~~fHpDgLifgtgt~ 367 (506)
T KOG0289|consen 305 PVTGLSLHPTGEYLLSASNDGTWAFSDISSGS-----------------QLTVVSDETSDVEYTSAAFHPDGLIFGTGTP 367 (506)
T ss_pred cceeeeeccCCcEEEEecCCceEEEEEccCCc-----------------EEEEEeeccccceeEEeeEcCCceEEeccCC
Confidence 3444433 36899999999998877765321 222111111223577777777754444433
Q ss_pred Cc-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEEEcCe-EEEEEEcCCCceeEeeeecCCCC--ceEEEec--C
Q 003405 95 ES-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFARQKR-VCIFRHDGGRGFVEVKDFGVPDT--VKSMSWC--G 167 (823)
Q Consensus 95 d~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~~kkk-i~l~~~~~~~~f~~~kei~~~~~--~~~l~~~--~ 167 (823)
|+ |++|++.+-......+ ..-.++.++..++...++++.... +.+|.++. .+.+|.+.+++. +.++.|. |
T Consensus 368 d~~vkiwdlks~~~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLRK---l~n~kt~~l~~~~~v~s~~fD~SG 444 (506)
T KOG0289|consen 368 DGVVKIWDLKSQTNVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLRK---LKNFKTIQLDEKKEVNSLSFDQSG 444 (506)
T ss_pred CceEEEEEcCCccccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEehh---hcccceeeccccccceeEEEcCCC
Confidence 35 9999997654333322 223577888899888899998877 66666653 334566777764 8888887 5
Q ss_pred CeEEEE
Q 003405 168 ENICIA 173 (823)
Q Consensus 168 ~~i~v~ 173 (823)
..+.++
T Consensus 445 t~L~~~ 450 (506)
T KOG0289|consen 445 TYLGIA 450 (506)
T ss_pred CeEEee
Confidence 566665
No 130
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=92.85 E-value=39 Score=43.86 Aligned_cols=53 Identities=9% Similarity=0.069 Sum_probs=38.9
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+.++..|++++|+..++.....++ .....+...|..++.+|+|++|...|.+
T Consensus 358 g~~~~~~g~~~eA~~~~~~Al~~~P-----~~~~a~~~Lg~~~~~~g~~~eA~~~y~~ 410 (1157)
T PRK11447 358 GDAALKANNLAQAERLYQQARQVDN-----TDSYAVLGLGDVAMARKDYAAAERYYQQ 410 (1157)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCC-----CCHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 4456789999999999976421111 1123455668999999999999999986
No 131
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=92.69 E-value=19 Score=39.95 Aligned_cols=125 Identities=11% Similarity=0.117 Sum_probs=69.7
Q ss_pred ecCCeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEe-CCeEEEEcC-CCccccCCceeecCCC--
Q 003405 165 WCGENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGK-ENIGVFVDQ-NGKLLQADRICWSEAP-- 239 (823)
Q Consensus 165 ~~~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~-~~~gvfv~~-~G~~~~~~~i~w~~~P-- 239 (823)
..++.++++.. .....+|..+|+...-.+.+....|.+ .++.+.++. +...+.+|. +|+ +.|+...
T Consensus 239 ~~~~~vy~~~~~g~l~a~d~~tG~~~W~~~~~~~~~p~~---~~~~vyv~~~~G~l~~~d~~tG~------~~W~~~~~~ 309 (377)
T TIGR03300 239 VDGGQVYAVSYQGRVAALDLRSGRVLWKRDASSYQGPAV---DDNRLYVTDADGVVVALDRRSGS------ELWKNDELK 309 (377)
T ss_pred EECCEEEEEEcCCEEEEEECCCCcEEEeeccCCccCceE---eCCEEEEECCCCeEEEEECCCCc------EEEcccccc
Confidence 34778888776 467889999987655444332222332 344555543 344455554 343 4454321
Q ss_pred ----cEEEEeCCEEEEEeC-CeEEEEEccCCCceeEEEeeCCcccc---cccCCeEEEec-cceEEEee
Q 003405 240 ----IAVIIQKPYAIALLP-RRVEVRSLRVPYALIQTIVLQNVRHL---IPSSNAVVVAL-ENSIFGLF 299 (823)
Q Consensus 240 ----~~v~~~~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l~~~~~l---~~~~~~v~v~s-~~~I~~l~ 299 (823)
...+....++++... +.+.+.+.. ++.++.++++.+.... .-.++.+|+++ ++.|++++
T Consensus 310 ~~~~ssp~i~g~~l~~~~~~G~l~~~d~~-tG~~~~~~~~~~~~~~~sp~~~~~~l~v~~~dG~l~~~~ 377 (377)
T TIGR03300 310 YRQLTAPAVVGGYLVVGDFEGYLHWLSRE-DGSFVARLKTDGSGIASPPVVVGDGLLVQTRDGDLYAFR 377 (377)
T ss_pred CCccccCEEECCEEEEEeCCCEEEEEECC-CCCEEEEEEcCCCccccCCEEECCEEEEEeCCceEEEeC
Confidence 112334556666665 456667765 5888888877653211 12355677665 55788763
No 132
>KOG0976 consensus Rho/Rac1-interacting serine/threonine kinase Citron [Signal transduction mechanisms]
Probab=92.64 E-value=0.25 Score=57.23 Aligned_cols=148 Identities=14% Similarity=0.183 Sum_probs=85.8
Q ss_pred ccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCc---eeEeeeecCCCCceEEEecCCeEEEEEcCceEEEEcCC--
Q 003405 111 LTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRG---FVEVKDFGVPDTVKSMSWCGENICIAIRKGYMILNATN-- 185 (823)
Q Consensus 111 i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~---f~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~-- 185 (823)
+...+||+.|..+.--.-||++..-.+.+..+...+. +.-.+++...++..+|.+. +.+|+.+. .++|+.-
T Consensus 941 l~apnnlkiFkA~tIEdwilfatqtglfftsisqprNpsriagp~svtslE~mseI~cv---amI~ns~~-qla~iplds 1016 (1265)
T KOG0976|consen 941 LEAPNNLKIFKAGTIEDWILFATQTGLFFTSISQPRNPSRIAGPKSVTSLEPMSEIHCV---AMIGNSKF-QLADIPLDS 1016 (1265)
T ss_pred HhccccceeecccccccceeEeecCCceEEEeecCCCchhhcCccccccccccceeeEE---EEEecCcc-eeecCchhH
Confidence 3455788888665322234555444443333332111 1222344444444444443 55666554 4455432
Q ss_pred ---------CC-eeeccCCCCCCCCEEE-EccCCeEEEEeC----CeEEEEcCCCccccCCceeecCCCcEEEEeCCEEE
Q 003405 186 ---------GA-LSEVFPSGRIGPPLVV-SLLSGELLLGKE----NIGVFVDQNGKLLQADRICWSEAPIAVIIQKPYAI 250 (823)
Q Consensus 186 ---------~~-~~~L~~~~~~~~p~i~-~~~~~EfLL~~~----~~gvfv~~~G~~~~~~~i~w~~~P~~v~~~~PYll 250 (823)
.. ...+||-.....|+-. ..+...|++.++ .++.|++..|+.++...+.|+ .|.++++..||.|
T Consensus 1017 L~lamqst~pSirpeVlpef~hvh~i~yhQqngqrfll~sddt~lh~rkyn~trd~fs~~akl~vp-ePlsFies~P~gf 1095 (1265)
T KOG0976|consen 1017 LELAMQSTDPSIRPEVLPEFSHVHPISYHQQNGQRFLLESDDTFLHFRKYNDTRDRFSRTAKLKVP-EPLSFIESEPYGF 1095 (1265)
T ss_pred HHHHHhcCCCccchhhhhhhcCcceeEEEEecccchhhhhhhhHHHHhhhcccchhhhhcccccCC-CchhhhhcCcceE
Confidence 11 1223333222334432 234446666544 578999999998888899999 9999999999999
Q ss_pred EEeCCeEEEEEcc
Q 003405 251 ALLPRRVEVRSLR 263 (823)
Q Consensus 251 ~~~~~~ieV~~l~ 263 (823)
++..+++++.-+.
T Consensus 1096 ifa~dtfyyv~ld 1108 (1265)
T KOG0976|consen 1096 IFAFDTFYYVELD 1108 (1265)
T ss_pred EEecceEEEEeec
Confidence 9999988887763
No 133
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=92.55 E-value=26 Score=41.05 Aligned_cols=232 Identities=11% Similarity=0.077 Sum_probs=140.3
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS- 94 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~- 94 (823)
...+.|...+++++.-|+.+++|+.|+... .+.+...+.+. ...|-.+......++|+.-+
T Consensus 208 ~~~~~~~q~~~~~~~~~s~~~tl~~~~~~~-----------------~~~i~~~l~GH-~g~V~~l~~~~~~~~lvsgS~ 269 (537)
T KOG0274|consen 208 DHVVLCLQLHDGFFKSGSDDSTLHLWDLNN-----------------GYLILTRLVGH-FGGVWGLAFPSGGDKLVSGST 269 (537)
T ss_pred cchhhhheeecCeEEecCCCceeEEeeccc-----------------ceEEEeeccCC-CCCceeEEEecCCCEEEEEec
Confidence 457889999999999999999999998542 23333334443 67899999988778888887
Q ss_pred C-cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEE
Q 003405 95 E-SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENIC 171 (823)
Q Consensus 95 d-~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~ 171 (823)
| .+++|+..+-+-...+. ...-+.-+++.... ..++ .+..|.++.+..+......+. -.++|+++...++.++
T Consensus 270 D~t~rvWd~~sg~C~~~l~-gh~stv~~~~~~~~-~~~sgs~D~tVkVW~v~n~~~l~l~~~--h~~~V~~v~~~~~~lv 345 (537)
T KOG0274|consen 270 DKTERVWDCSTGECTHSLQ-GHTSSVRCLTIDPF-LLVSGSRDNTVKVWDVTNGACLNLLRG--HTGPVNCVQLDEPLLV 345 (537)
T ss_pred CCcEEeEecCCCcEEEEec-CCCceEEEEEccCc-eEeeccCCceEEEEeccCcceEEEecc--ccccEEEEEecCCEEE
Confidence 5 49999976665444332 22223334444333 3343 567788888885532333333 4578999999999999
Q ss_pred EEEcCc-eEEEEcCCCCeeeccCCCCCCCCEEE-EccCCeEEE--EeCCeEEEEcCCCccccCCcee-ecCC---CcEEE
Q 003405 172 IAIRKG-YMILNATNGALSEVFPSGRIGPPLVV-SLLSGELLL--GKENIGVFVDQNGKLLQADRIC-WSEA---PIAVI 243 (823)
Q Consensus 172 v~~~~~-y~lidl~~~~~~~L~~~~~~~~p~i~-~~~~~EfLL--~~~~~gvfv~~~G~~~~~~~i~-w~~~---P~~v~ 243 (823)
.|+-.+ ..+.|..+++...-+.... ..... .++..+.++ +.|.....-|..+.. .++. ..+. ...+.
T Consensus 346 sgs~d~~v~VW~~~~~~cl~sl~gH~--~~V~sl~~~~~~~~~Sgs~D~~IkvWdl~~~~---~c~~tl~~h~~~v~~l~ 420 (537)
T KOG0274|consen 346 SGSYDGTVKVWDPRTGKCLKSLSGHT--GRVYSLIVDSENRLLSGSLDTTIKVWDLRTKR---KCIHTLQGHTSLVSSLL 420 (537)
T ss_pred EEecCceEEEEEhhhceeeeeecCCc--ceEEEEEecCcceEEeeeeccceEeecCCchh---hhhhhhcCCcccccccc
Confidence 888866 5677888765433332211 11111 223314443 344434444443331 1222 2221 13344
Q ss_pred EeCCEEEEEe-CCeEEEEEccCCCceeEEEeeC
Q 003405 244 IQKPYAIALL-PRRVEVRSLRVPYALIQTIVLQ 275 (823)
Q Consensus 244 ~~~PYll~~~-~~~ieV~~l~~~~~lvQ~i~l~ 275 (823)
...-+++.-. ++.|.+.+.. .+...+++.-+
T Consensus 421 ~~~~~Lvs~~aD~~Ik~WD~~-~~~~~~~~~~~ 452 (537)
T KOG0274|consen 421 LRDNFLVSSSADGTIKLWDAE-EGECLRTLEGR 452 (537)
T ss_pred cccceeEeccccccEEEeecc-cCceeeeeccC
Confidence 4445555554 4679999986 58888888764
No 134
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=92.46 E-value=1 Score=47.30 Aligned_cols=103 Identities=16% Similarity=0.244 Sum_probs=66.0
Q ss_pred EEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc-EEEE
Q 003405 23 ASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES-IAFH 100 (823)
Q Consensus 23 ~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~-l~~~ 100 (823)
+-.|+.++.||+.|.+++|+.. +.+++..++..+...|.||.+......+++-+ |. |+.|
T Consensus 162 dr~g~yIitGtsKGkllv~~a~------------------t~e~vas~rits~~~IK~I~~s~~g~~liiNtsDRvIR~y 223 (405)
T KOG1273|consen 162 DRRGKYIITGTSKGKLLVYDAE------------------TLECVASFRITSVQAIKQIIVSRKGRFLIINTSDRVIRTY 223 (405)
T ss_pred cCCCCEEEEecCcceEEEEecc------------------hheeeeeeeechheeeeEEEEeccCcEEEEecCCceEEEE
Confidence 3346899999999999999744 34566666654457899999888877777665 54 8999
Q ss_pred eCCCC---------cccccccCCCC---cEEEEeeCCCceEEEEEcCeEEEEEEc
Q 003405 101 RLPNL---------ETIAVLTKAKG---ANVYSWDDRRGFLCFARQKRVCIFRHD 143 (823)
Q Consensus 101 ~L~~l---------~~~~~i~~~kg---~~~fa~~~~~~~l~V~~kkki~l~~~~ 143 (823)
++.++ ++.+++.+.-+ =+.+|.+.+.-.+|.+..+.=-+|-|.
T Consensus 224 e~~di~~~~r~~e~e~~~K~qDvVNk~~Wk~ccfs~dgeYv~a~s~~aHaLYIWE 278 (405)
T KOG1273|consen 224 EISDIDDEGRDGEVEPEHKLQDVVNKLQWKKCCFSGDGEYVCAGSARAHALYIWE 278 (405)
T ss_pred ehhhhcccCccCCcChhHHHHHHHhhhhhhheeecCCccEEEeccccceeEEEEe
Confidence 87543 23333222111 123445555456777766655566664
No 135
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=92.43 E-value=1.2 Score=49.51 Aligned_cols=145 Identities=13% Similarity=0.262 Sum_probs=91.2
Q ss_pred EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc-EEEEe
Q 003405 24 SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES-IAFHR 101 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~-l~~~~ 101 (823)
..++.|+||.+.-+|-+|++...+.. .+. +.. -+...-..+.+-|+.++.+.++ || |.+|+
T Consensus 475 pdgrtLivGGeastlsiWDLAapTpr--------------ika--elt-ssapaCyALa~spDakvcFsccsdGnI~vwD 537 (705)
T KOG0639|consen 475 PDGRTLIVGGEASTLSIWDLAAPTPR--------------IKA--ELT-SSAPACYALAISPDAKVCFSCCSDGNIAVWD 537 (705)
T ss_pred CCCceEEeccccceeeeeeccCCCcc--------------hhh--hcC-CcchhhhhhhcCCccceeeeeccCCcEEEEE
Confidence 34678999999999999998754321 111 111 1112344466777888877665 66 99999
Q ss_pred CCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEcCc
Q 003405 102 LPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIRKG 177 (823)
Q Consensus 102 L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~~~ 177 (823)
|.+-..+.++. -..|++++.+..+...|--+ ..+.+.-+....+ +++.+..+...|-++..+ ++.+.||..++
T Consensus 538 Lhnq~~VrqfqGhtDGascIdis~dGtklWTGGlDntvRcWDlreg---rqlqqhdF~SQIfSLg~cP~~dWlavGMens 614 (705)
T KOG0639|consen 538 LHNQTLVRQFQGHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREG---RQLQQHDFSSQIFSLGYCPTGDWLAVGMENS 614 (705)
T ss_pred cccceeeecccCCCCCceeEEecCCCceeecCCCccceeehhhhhh---hhhhhhhhhhhheecccCCCccceeeecccC
Confidence 97644333322 33588888888777666665 4555554444433 334455566778788776 78999999865
Q ss_pred e-EEEEcCCCCe
Q 003405 178 Y-MILNATNGAL 188 (823)
Q Consensus 178 y-~lidl~~~~~ 188 (823)
+ .++..+..+.
T Consensus 615 ~vevlh~skp~k 626 (705)
T KOG0639|consen 615 NVEVLHTSKPEK 626 (705)
T ss_pred cEEEEecCCccc
Confidence 5 4555544333
No 136
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=92.38 E-value=9.2 Score=42.62 Aligned_cols=174 Identities=14% Similarity=0.117 Sum_probs=100.5
Q ss_pred EEEEeCCCCcccccccCCCCcEE-EEeeCCCceEEEEEc-CeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEE
Q 003405 97 IAFHRLPNLETIAVLTKAKGANV-YSWDDRRGFLCFARQ-KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICI 172 (823)
Q Consensus 97 l~~~~L~~l~~~~~i~~~kg~~~-fa~~~~~~~l~V~~k-kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v 172 (823)
|.+.+..+.+.+..++...+++. .+..++...+.|+.+ ..|.++.... .+.++++.....+.++++. |..+++
T Consensus 18 v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~---~~~v~~i~~G~~~~~i~~s~DG~~~~v 94 (369)
T PF02239_consen 18 VAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLAT---GKVVATIKVGGNPRGIAVSPDGKYVYV 94 (369)
T ss_dssp EEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTS---SSEEEEEE-SSEEEEEEE--TTTEEEE
T ss_pred EEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCc---ccEEEEEecCCCcceEEEcCCCCEEEE
Confidence 99999988887777765555544 344565556777754 4566666552 3467788888899999886 668888
Q ss_pred EEc--CceEEEEcCCCCeeeccCCC------CCCCCE--EEEccCCeEEEEeC--CeEEEEcCCC-ccccCCceeecCCC
Q 003405 173 AIR--KGYMILNATNGALSEVFPSG------RIGPPL--VVSLLSGELLLGKE--NIGVFVDQNG-KLLQADRICWSEAP 239 (823)
Q Consensus 173 ~~~--~~y~lidl~~~~~~~L~~~~------~~~~p~--i~~~~~~EfLL~~~--~~gvfv~~~G-~~~~~~~i~w~~~P 239 (823)
++. ..+.++|..+.+...-++.+ ...++. +..-...+|+++-. +....+|... .+.....+.=...|
T Consensus 95 ~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i~~g~~~ 174 (369)
T PF02239_consen 95 ANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSDPKNLKVTTIKVGRFP 174 (369)
T ss_dssp EEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETTTSSCEEEEEEE--TTE
T ss_pred EecCCCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEeccccccceeeecccccc
Confidence 864 78999999988765544432 112332 22235668888643 3556677433 22222344445567
Q ss_pred cEEEEeCC--EEEEE-e-CCeEEEEEccCCCceeEEEee
Q 003405 240 IAVIIQKP--YAIAL-L-PRRVEVRSLRVPYALIQTIVL 274 (823)
Q Consensus 240 ~~v~~~~P--Yll~~-~-~~~ieV~~l~~~~~lvQ~i~l 274 (823)
....+... |+++- . ++.+-|.+.. ++.++..+..
T Consensus 175 ~D~~~dpdgry~~va~~~sn~i~viD~~-~~k~v~~i~~ 212 (369)
T PF02239_consen 175 HDGGFDPDGRYFLVAANGSNKIAVIDTK-TGKLVALIDT 212 (369)
T ss_dssp EEEEE-TTSSEEEEEEGGGTEEEEEETT-TTEEEEEEE-
T ss_pred cccccCcccceeeecccccceeEEEeec-cceEEEEeec
Confidence 77777643 55543 3 4678888876 4666655543
No 137
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=92.38 E-value=14 Score=37.57 Aligned_cols=260 Identities=15% Similarity=0.141 Sum_probs=156.9
Q ss_pred CcEEEEEE--eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVAS--YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~--~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|.++-- .|++.+...+|-++..|+.... .++++..+. ...|....+......+..++
T Consensus 18 gaV~avryN~dGnY~ltcGsdrtvrLWNp~rg------------------~liktYsgh-G~EVlD~~~s~Dnskf~s~G 78 (307)
T KOG0316|consen 18 GAVRAVRYNVDGNYCLTCGSDRTVRLWNPLRG------------------ALIKTYSGH-GHEVLDAALSSDNSKFASCG 78 (307)
T ss_pred cceEEEEEccCCCEEEEcCCCceEEeeccccc------------------ceeeeecCC-CceeeeccccccccccccCC
Confidence 44555443 3577888888999999974432 245565554 45677777776666666665
Q ss_pred C-c-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeec-CCCCceEEEecCCe
Q 003405 95 E-S-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWCGEN 169 (823)
Q Consensus 95 d-~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~~~~ 169 (823)
+ . +.+|+..+-+...+.. ..-.++.+..+++...++-+ ...++.+|.-+.. .|..++-+. .-|.+.++...+.-
T Consensus 79 gDk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS~-s~ePiQildea~D~V~Si~v~~he 157 (307)
T KOG0316|consen 79 GDKAVQVWDVNTGKVDRRFRGHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRSR-SFEPIQILDEAKDGVSSIDVAEHE 157 (307)
T ss_pred CCceEEEEEcccCeeeeecccccceeeEEEecCcceEEEeccccceeEEEEcccC-CCCccchhhhhcCceeEEEecccE
Confidence 4 2 9999987765443321 12247888889888766665 7788888877633 466555432 35789999988888
Q ss_pred EEEEEc-CceEEEEcCCCCee-eccCCCCCCCCEEEEccCCeEEE--EeCCeEEEEcC-CCcccc--CCceeecCCCcE-
Q 003405 170 ICIAIR-KGYMILNATNGALS-EVFPSGRIGPPLVVSLLSGELLL--GKENIGVFVDQ-NGKLLQ--ADRICWSEAPIA- 241 (823)
Q Consensus 170 i~v~~~-~~y~lidl~~~~~~-~L~~~~~~~~p~i~~~~~~EfLL--~~~~~gvfv~~-~G~~~~--~~~i~w~~~P~~- 241 (823)
|+-|+. ..|..+|+..|+.. +.+.. .-.++...+++.+.| +.|...-.+|. .|+... ++-.. .++...
T Consensus 158 IvaGS~DGtvRtydiR~G~l~sDy~g~---pit~vs~s~d~nc~La~~l~stlrLlDk~tGklL~sYkGhkn-~eykldc 233 (307)
T KOG0316|consen 158 IVAGSVDGTVRTYDIRKGTLSSDYFGH---PITSVSFSKDGNCSLASSLDSTLRLLDKETGKLLKSYKGHKN-MEYKLDC 233 (307)
T ss_pred EEeeccCCcEEEEEeecceeehhhcCC---cceeEEecCCCCEEEEeeccceeeecccchhHHHHHhccccc-ceeeeee
Confidence 888877 56899999988643 23322 113455556777766 34555555553 454331 01000 011111
Q ss_pred -EEEeCCEEEEEeC-CeEEEEEccCCCceeEEEeeCCcccc-----cccCCeEEEeccceEEEeecc
Q 003405 242 -VIIQKPYAIALLP-RRVEVRSLRVPYALIQTIVLQNVRHL-----IPSSNAVVVALENSIFGLFPV 301 (823)
Q Consensus 242 -v~~~~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l~~~~~l-----~~~~~~v~v~s~~~I~~l~~~ 301 (823)
+.-..-++++-++ +.+-++++.+ ..++-.++..+...+ .+....+++|+++.++....-
T Consensus 234 ~l~qsdthV~sgSEDG~Vy~wdLvd-~~~~sk~~~~~~v~v~dl~~hp~~~~f~~A~~~~~~~~~~~ 299 (307)
T KOG0316|consen 234 CLNQSDTHVFSGSEDGKVYFWDLVD-ETQISKLSVVSTVIVTDLSCHPTMDDFITATGHGDLFWYQE 299 (307)
T ss_pred eecccceeEEeccCCceEEEEEecc-ceeeeeeccCCceeEEeeecccCccceeEecCCceeceeeh
Confidence 1222345666665 4677888874 666666555543211 123456888888877665443
No 138
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=92.31 E-value=9.9 Score=46.29 Aligned_cols=126 Identities=16% Similarity=0.269 Sum_probs=76.1
Q ss_pred CCcEEEE--EEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcc-cccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 16 SPKIDAV--ASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDY-QSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 16 ~~~I~ci--~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~-~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
...|+|+ ...|.+|+.|++|+.+.+|.-.+ .+...-.+.. +.-....|+....+.+ +..-|..+.--|...+|++
T Consensus 69 ~~sv~CVR~S~dG~~lAsGSDD~~v~iW~~~~-~~~~~~fgs~g~~~~vE~wk~~~~l~~-H~~DV~Dv~Wsp~~~~lvS 146 (942)
T KOG0973|consen 69 DGSVNCVRFSPDGSYLASGSDDRLVMIWERAE-IGSGTVFGSTGGAKNVESWKVVSILRG-HDSDVLDVNWSPDDSLLVS 146 (942)
T ss_pred cCceeEEEECCCCCeEeeccCcceEEEeeecc-cCCcccccccccccccceeeEEEEEec-CCCccceeccCCCccEEEE
Confidence 3578887 34467999999999999998663 1110000000 0001123444444444 3567999999997777777
Q ss_pred Ee-Cc-EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEE-EcCeEEEEEEc
Q 003405 93 LS-ES-IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFA-RQKRVCIFRHD 143 (823)
Q Consensus 93 l~-d~-l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~-~kkki~l~~~~ 143 (823)
++ |+ |.+|+..+|+.+..+.... -|.-+++|+-...+|.- -.|.|.+|+..
T Consensus 147 ~s~DnsViiwn~~tF~~~~vl~~H~s~VKGvs~DP~Gky~ASqsdDrtikvwrt~ 201 (942)
T KOG0973|consen 147 VSLDNSVIIWNAKTFELLKVLRGHQSLVKGVSWDPIGKYFASQSDDRTLKVWRTS 201 (942)
T ss_pred ecccceEEEEccccceeeeeeecccccccceEECCccCeeeeecCCceEEEEEcc
Confidence 76 44 9999999886443321111 12234557766667775 56778888855
No 139
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=92.27 E-value=3.9 Score=44.51 Aligned_cols=143 Identities=13% Similarity=0.179 Sum_probs=88.8
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--cEEEEeCC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--SIAFHRLP 103 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~l~~~~L~ 103 (823)
|++|.-|++|.+..+|.+..+. .+++.++..++ .++|..|.--|+...|++|+- .+.+|+..
T Consensus 236 GkyLAsaSkD~Taiiw~v~~d~---------------~~kl~~tlvgh-~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~ 299 (519)
T KOG0293|consen 236 GKYLASASKDSTAIIWIVVYDV---------------HFKLKKTLVGH-SQPVSYIMWSPDDRYLLACGFDEVLSLWDVD 299 (519)
T ss_pred CeeEeeccCCceEEEEEEecCc---------------ceeeeeeeecc-cCceEEEEECCCCCeEEecCchHheeeccCC
Confidence 5789999999999888765432 25555555554 679999999999999999875 48999987
Q ss_pred CCcccccccCC--CCcEEEEeeCCCceEEEEEcCeEEEEEEcC-CCceeEeeeecCCCCceEEEec--CCe-EEEEEcCc
Q 003405 104 NLETIAVLTKA--KGANVYSWDDRRGFLCFARQKRVCIFRHDG-GRGFVEVKDFGVPDTVKSMSWC--GEN-ICIAIRKG 177 (823)
Q Consensus 104 ~l~~~~~i~~~--kg~~~fa~~~~~~~l~V~~kkki~l~~~~~-~~~f~~~kei~~~~~~~~l~~~--~~~-i~v~~~~~ 177 (823)
+-+....-+.. -.+++.|..++.-++++|..++- ++.|+- ++.....+.+..| .+..|+.. |.. +.++....
T Consensus 300 tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~dr~-i~~wdlDgn~~~~W~gvr~~-~v~dlait~Dgk~vl~v~~d~~ 377 (519)
T KOG0293|consen 300 TGDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPDRT-IIMWDLDGNILGNWEGVRDP-KVHDLAITYDGKYVLLVTVDKK 377 (519)
T ss_pred cchhhhhcccCcCCCcceeEEccCCceeEecCCCCc-EEEecCCcchhhcccccccc-eeEEEEEcCCCcEEEEEecccc
Confidence 64433222222 34567788888767777754432 445542 2222222222222 34444443 555 44455577
Q ss_pred eEEEEcCCC
Q 003405 178 YMILNATNG 186 (823)
Q Consensus 178 y~lidl~~~ 186 (823)
..+++..+.
T Consensus 378 i~l~~~e~~ 386 (519)
T KOG0293|consen 378 IRLYNREAR 386 (519)
T ss_pred eeeechhhh
Confidence 788887654
No 140
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=92.25 E-value=2.4 Score=46.18 Aligned_cols=87 Identities=15% Similarity=0.164 Sum_probs=59.4
Q ss_pred EEEEeCCC-------CcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCe
Q 003405 97 IAFHRLPN-------LETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGEN 169 (823)
Q Consensus 97 l~~~~L~~-------l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~ 169 (823)
+.+|.+.+ ++.++...-.-+|++++.- .+.++++.+++|.+|++..++.+...-....+-.++++...++.
T Consensus 64 i~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~--~~~lv~~~g~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~~~~~ 141 (321)
T PF03178_consen 64 ILVFEISESPENNFKLKLIHSTEVKGPVTAICSF--NGRLVVAVGNKLYVYDLDNSKTLLKKAFYDSPFYITSLSVFKNY 141 (321)
T ss_dssp EEEEEECSS-----EEEEEEEEEESS-EEEEEEE--TTEEEEEETTEEEEEEEETTSSEEEEEEE-BSSSEEEEEEETTE
T ss_pred EEEEEEEcccccceEEEEEEEEeecCcceEhhhh--CCEEEEeecCEEEEEEccCcccchhhheecceEEEEEEeccccE
Confidence 77777655 2233332222234555444 56799999999999999966436555566677799999999999
Q ss_pred EEEEEc-CceEEEEcCC
Q 003405 170 ICIAIR-KGYMILNATN 185 (823)
Q Consensus 170 i~v~~~-~~y~lidl~~ 185 (823)
|++|.. ++..++..+.
T Consensus 142 I~vgD~~~sv~~~~~~~ 158 (321)
T PF03178_consen 142 ILVGDAMKSVSLLRYDE 158 (321)
T ss_dssp EEEEESSSSEEEEEEET
T ss_pred EEEEEcccCEEEEEEEc
Confidence 999987 7777775443
No 141
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=92.24 E-value=3.3 Score=44.25 Aligned_cols=123 Identities=16% Similarity=0.290 Sum_probs=84.7
Q ss_pred CCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
.+.|..+-+. +.+++-|+.||+|..|++....+ +. .. ..+++.|..+..=|..+.+.+-
T Consensus 277 ~~~V~~V~~~~~dpqvit~S~D~tvrlWDl~agkt-----------------~~-tl-t~hkksvral~lhP~e~~fASa 337 (460)
T KOG0285|consen 277 TNPVASVMCQPTDPQVITGSHDSTVRLWDLRAGKT-----------------MI-TL-THHKKSVRALCLHPKENLFASA 337 (460)
T ss_pred CCcceeEEeecCCCceEEecCCceEEEeeeccCce-----------------eE-ee-ecccceeeEEecCCchhhhhcc
Confidence 3466666666 67999999999999999775321 11 11 1358899999999888877766
Q ss_pred e-CcEEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEEEcCe-EEEEEEcCCCceeEeeeecCCC
Q 003405 94 S-ESIAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFARQKR-VCIFRHDGGRGFVEVKDFGVPD 158 (823)
Q Consensus 94 ~-d~l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~~kkk-i~l~~~~~~~~f~~~kei~~~~ 158 (823)
+ |.++-|+++.-+.+.++..-+ .+++.+++.+ +.+++|..+. +.++.|..+..|+....+.-|+
T Consensus 338 s~dnik~w~~p~g~f~~nlsgh~~iintl~~nsD-~v~~~G~dng~~~fwdwksg~nyQ~~~t~vqpG 404 (460)
T KOG0285|consen 338 SPDNIKQWKLPEGEFLQNLSGHNAIINTLSVNSD-GVLVSGGDNGSIMFWDWKSGHNYQRGQTIVQPG 404 (460)
T ss_pred CCccceeccCCccchhhccccccceeeeeeeccC-ceEEEcCCceEEEEEecCcCcccccccccccCC
Confidence 5 469999999877776654433 3566777765 5677776655 5567787665566555454444
No 142
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=92.19 E-value=0.83 Score=51.23 Aligned_cols=142 Identities=15% Similarity=0.193 Sum_probs=72.6
Q ss_pred EEEEEeCCCcEEEEcCCCCCCCCCCCCccccccccccee--eeeecCCCCCCeeEEEEec--------------------
Q 003405 28 KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYEL--ERTISGFSKKPILSMEVLA-------------------- 85 (823)
Q Consensus 28 ~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l--~~~~~~~~k~~I~qI~~~~-------------------- 85 (823)
.+++||+.|.+..|.+..... ..|.. .... ...+.+|.+|..+.
T Consensus 157 ~L~vGTn~G~v~~fkIlp~~~-------------g~f~v~~~~~~-~~~~~~i~~I~~i~~~~G~~a~At~~~~~~l~~g 222 (395)
T PF08596_consen 157 CLLVGTNSGNVLTFKILPSSN-------------GRFSVQFAGAT-TNHDSPILSIIPINADTGESALATISAMQGLSKG 222 (395)
T ss_dssp EEEEEETTSEEEEEEEEE-GG-------------G-EEEEEEEEE---SS----EEEEEETTT--B-B-BHHHHHGGGGT
T ss_pred EEEEEeCCCCEEEEEEecCCC-------------CceEEEEeecc-ccCCCceEEEEEEECCCCCcccCchhHhhccccC
Confidence 799999999999998853221 12332 2221 12356777777773
Q ss_pred --ccCceeeEeC-cEEEEeCCCCcccccccCC-CCcEEEEee----CCC--ceEEEEEcCeEEEEEEcCCCceeEeeeec
Q 003405 86 --SRQLLLSLSE-SIAFHRLPNLETIAVLTKA-KGANVYSWD----DRR--GFLCFARQKRVCIFRHDGGRGFVEVKDFG 155 (823)
Q Consensus 86 --~~~~Ll~l~d-~l~~~~L~~l~~~~~i~~~-kg~~~fa~~----~~~--~~l~V~~kkki~l~~~~~~~~f~~~kei~ 155 (823)
-.++++++++ +++++.+++-+..++..+. ..|...++- ... ..+|+...+.+.+|.+- .++.++++.
T Consensus 223 ~~i~g~vVvvSe~~irv~~~~~~k~~~K~~~~~~~~~~~~vv~~~~~~~~~~Lv~l~~~G~i~i~SLP---~Lkei~~~~ 299 (395)
T PF08596_consen 223 ISIPGYVVVVSESDIRVFKPPKSKGAHKSFDDPFLCSSASVVPTISRNGGYCLVCLFNNGSIRIYSLP---SLKEIKSVS 299 (395)
T ss_dssp ----EEEEEE-SSEEEEE-TT---EEEEE-SS-EEEEEEEEEEEE-EEEEEEEEEEETTSEEEEEETT---T--EEEEEE
T ss_pred CCcCcEEEEEcccceEEEeCCCCcccceeeccccccceEEEEeecccCCceEEEEEECCCcEEEEECC---CchHhhccc
Confidence 1136777777 5999999877655543322 122222221 111 23566678889999987 355666655
Q ss_pred CCC-------CceEEEecCCeEEEEEcCceEEEEcCCC
Q 003405 156 VPD-------TVKSMSWCGENICIAIRKGYMILNATNG 186 (823)
Q Consensus 156 ~~~-------~~~~l~~~~~~i~v~~~~~y~lidl~~~ 186 (823)
+|. .-.++...|+.+++....+..++.+...
T Consensus 300 l~~~~d~~~~~~ssis~~Gdi~~~~gpsE~~l~sv~~~ 337 (395)
T PF08596_consen 300 LPPPLDSRRLSSSSISRNGDIFYWTGPSEIQLFSVWGE 337 (395)
T ss_dssp -SS---HHHHTT-EE-TTS-EEEE-SSSEEEEEEEES-
T ss_pred CCCccccccccccEECCCCCEEEEeCcccEEEEEEEcc
Confidence 542 1234455578788887888887776543
No 143
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=92.11 E-value=6 Score=42.49 Aligned_cols=159 Identities=11% Similarity=0.107 Sum_probs=80.9
Q ss_pred CCCeeEEEEecccCceeeEe-C-cEEEEeCCCCcccccccCC-CCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeE
Q 003405 75 KKPILSMEVLASRQLLLSLS-E-SIAFHRLPNLETIAVLTKA-KGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVE 150 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~-d-~l~~~~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~ 150 (823)
+--|..+..+|..+.+++++ | .++.|+..+.--+.+.+.- .=+..|.++.+...++-+ ....+.++.......-..
T Consensus 193 ~h~vS~V~f~P~gd~ilS~srD~tik~We~~tg~cv~t~~~h~ewvr~v~v~~DGti~As~s~dqtl~vW~~~t~~~k~~ 272 (406)
T KOG0295|consen 193 EHGVSSVFFLPLGDHILSCSRDNTIKAWECDTGYCVKTFPGHSEWVRMVRVNQDGTIIASCSNDQTLRVWVVATKQCKAE 272 (406)
T ss_pred ccceeeEEEEecCCeeeecccccceeEEecccceeEEeccCchHhEEEEEecCCeeEEEecCCCceEEEEEeccchhhhh
Confidence 45788899999999999888 4 4999998765433332221 135677777664333333 334566665543211122
Q ss_pred eeeecCCCCceEEEec------------C-----CeEEEEEc-CceEEEEcCCCCe-eeccCCCCCCCCEEEEccCCeEE
Q 003405 151 VKDFGVPDTVKSMSWC------------G-----ENICIAIR-KGYMILNATNGAL-SEVFPSGRIGPPLVVSLLSGELL 211 (823)
Q Consensus 151 ~kei~~~~~~~~l~~~------------~-----~~i~v~~~-~~y~lidl~~~~~-~~L~~~~~~~~p~i~~~~~~EfL 211 (823)
+++.. .++.+++|. | ..+..|.+ +...+.|+.+|.. ..|...+.-. --+..-+.|.||
T Consensus 273 lR~hE--h~vEci~wap~~~~~~i~~at~~~~~~~~l~s~SrDktIk~wdv~tg~cL~tL~ghdnwV-r~~af~p~Gkyi 349 (406)
T KOG0295|consen 273 LREHE--HPVECIAWAPESSYPSISEATGSTNGGQVLGSGSRDKTIKIWDVSTGMCLFTLVGHDNWV-RGVAFSPGGKYI 349 (406)
T ss_pred hhccc--cceEEEEecccccCcchhhccCCCCCccEEEeecccceEEEEeccCCeEEEEEeccccee-eeeEEcCCCeEE
Confidence 33332 245555553 1 13333443 4466777777642 2222222111 112223556676
Q ss_pred E-EeCCeEE-EEcCCCccccCCceeecCCC
Q 003405 212 L-GKENIGV-FVDQNGKLLQADRICWSEAP 239 (823)
Q Consensus 212 L-~~~~~gv-fv~~~G~~~~~~~i~w~~~P 239 (823)
+ |.|+-.+ ..+.... +....|+.++
T Consensus 350 ~ScaDDktlrvwdl~~~---~cmk~~~ah~ 376 (406)
T KOG0295|consen 350 LSCADDKTLRVWDLKNL---QCMKTLEAHE 376 (406)
T ss_pred EEEecCCcEEEEEeccc---eeeeccCCCc
Confidence 6 4554333 3343322 3455566444
No 144
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=92.07 E-value=3.1 Score=47.00 Aligned_cols=156 Identities=16% Similarity=0.150 Sum_probs=93.3
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc-EEEEeCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES-IAFHRLPN 104 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~-l~~~~L~~ 104 (823)
..|+.|+.+|+|-.|+++..... ....++.+-+|.++ +.||-++.+.++...++.-+ || |..|.+|.
T Consensus 307 p~lit~sed~~lk~WnLqk~~~s----------~~~~~epi~tfraH-~gPVl~v~v~~n~~~~ysgg~Dg~I~~w~~p~ 375 (577)
T KOG0642|consen 307 PVLITASEDGTLKLWNLQKAKKS----------AEKDVEPILTFRAH-EGPVLCVVVPSNGEHCYSGGIDGTIRCWNLPP 375 (577)
T ss_pred CeEEEeccccchhhhhhcccCCc----------cccceeeeEEEecc-cCceEEEEecCCceEEEeeccCceeeeeccCC
Confidence 58999999999999999542111 11233334466654 78999999999888888776 55 99998862
Q ss_pred -------Cccc---ccccCCCCcE-EEEeeCCCceEEEE-EcCeEEEEEEcCCCc--eeEeeeecCCCCceEEEecCCe-
Q 003405 105 -------LETI---AVLTKAKGAN-VYSWDDRRGFLCFA-RQKRVCIFRHDGGRG--FVEVKDFGVPDTVKSMSWCGEN- 169 (823)
Q Consensus 105 -------l~~~---~~i~~~kg~~-~fa~~~~~~~l~V~-~kkki~l~~~~~~~~--f~~~kei~~~~~~~~l~~~~~~- 169 (823)
.++. ..+....++. .++++..+.+|... ...++.+++..+... |...+| .+.|+++.+.+..
T Consensus 376 n~dp~ds~dp~vl~~~l~Ghtdavw~l~~s~~~~~Llscs~DgTvr~w~~~~~~~~~f~~~~e---~g~Plsvd~~ss~~ 452 (577)
T KOG0642|consen 376 NQDPDDSYDPSVLSGTLLGHTDAVWLLALSSTKDRLLSCSSDGTVRLWEPTEESPCTFGEPKE---HGYPLSVDRTSSRP 452 (577)
T ss_pred CCCcccccCcchhccceeccccceeeeeecccccceeeecCCceEEeeccCCcCccccCCccc---cCCcceEeeccchh
Confidence 1110 0112222333 46666655555443 566666665553322 433333 3467777776532
Q ss_pred --EEEEEc-CceEEEEcCCCCeeeccCCCC
Q 003405 170 --ICIAIR-KGYMILNATNGALSEVFPSGR 196 (823)
Q Consensus 170 --i~v~~~-~~y~lidl~~~~~~~L~~~~~ 196 (823)
.+...+ ..|.++++..++...+++.+.
T Consensus 453 a~~~~s~~~~~~~~~~~ev~s~~~~~~s~~ 482 (577)
T KOG0642|consen 453 AHSLASFRFGYTSIDDMEVVSDLLIFESSA 482 (577)
T ss_pred HhhhhhcccccccchhhhhhhheeeccccC
Confidence 333344 345667788777777776643
No 145
>PLN03218 maturation of RBCL 1; Provisional
Probab=92.06 E-value=44 Score=42.61 Aligned_cols=50 Identities=12% Similarity=-0.050 Sum_probs=37.2
Q ss_pred chhhHHHHHhhccccHHHHHHHHHHHhC---CCchhHHHHHHHHhcCCCCCcc
Q 003405 729 ALYEERAILLGKMNQHELALSLYVHKVF---LINQPVFLLIRRMAMDIKPLVT 778 (823)
Q Consensus 729 ~l~~e~~~Ll~klg~h~~AL~ilv~~L~---D~~~a~~~~l~~~y~~~~~~~~ 778 (823)
..+.-.+-.|++.|+.++|++++-.-.. .++...|.+|+..|...+..-.
T Consensus 720 vtyN~LI~gy~k~G~~eeAlelf~eM~~~Gi~Pd~~Ty~sLL~a~~k~G~le~ 772 (1060)
T PLN03218 720 STMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADV 772 (1060)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHCCCHHH
Confidence 3466777889999999999999875332 2466789999998888655433
No 146
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=92.03 E-value=1.2 Score=49.32 Aligned_cols=163 Identities=12% Similarity=0.129 Sum_probs=104.0
Q ss_pred cEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
.+.|+-.+- +-+++|+.+|.|..|++.... ++++.. .+-.+|..|..++.....++-+
T Consensus 301 ~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~k------------------vvqeYd-~hLg~i~~i~F~~~g~rFissS 361 (503)
T KOG0282|consen 301 VPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGK------------------VVQEYD-RHLGAILDITFVDEGRRFISSS 361 (503)
T ss_pred CceeeecCCCCCcEEEEecCCCcEEEEeccchH------------------HHHHHH-hhhhheeeeEEccCCceEeeec
Confidence 466776663 458999999999999977432 222222 1246899999999999999999
Q ss_pred C-c-EEEEeCCCCcccccccCCC--CcEEEEeeCCCceE-EEEEcCeEEEEEEcCCCceeE-----eeeecCCCCceEEE
Q 003405 95 E-S-IAFHRLPNLETIAVLTKAK--GANVYSWDDRRGFL-CFARQKRVCIFRHDGGRGFVE-----VKDFGVPDTVKSMS 164 (823)
Q Consensus 95 d-~-l~~~~L~~l~~~~~i~~~k--g~~~fa~~~~~~~l-~V~~kkki~l~~~~~~~~f~~-----~kei~~~~~~~~l~ 164 (823)
| + +.+|....-.++..+.... .+-++++.++.+.+ |=...+.|.+|.... .|+. .+-...++-...+.
T Consensus 362 Ddks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~--~~r~nkkK~feGh~vaGys~~v~ 439 (503)
T KOG0282|consen 362 DDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVP--PFRLNKKKRFEGHSVAGYSCQVD 439 (503)
T ss_pred cCccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCceEEEEeccc--ccccCHhhhhcceeccCceeeEE
Confidence 8 3 9999875433332222211 23345566655433 334778888888653 1221 12256677777888
Q ss_pred ec--CCeEEEEEcC-ceEEEEcCCCCeeeccCCCCCCCCEEE
Q 003405 165 WC--GENICIAIRK-GYMILNATNGALSEVFPSGRIGPPLVV 203 (823)
Q Consensus 165 ~~--~~~i~v~~~~-~y~lidl~~~~~~~L~~~~~~~~p~i~ 203 (823)
|. |..||-|... ...++|-.+.+....+... ..||+.
T Consensus 440 fSpDG~~l~SGdsdG~v~~wdwkt~kl~~~lkah--~~~ci~ 479 (503)
T KOG0282|consen 440 FSPDGRTLCSGDSDGKVNFWDWKTTKLVSKLKAH--DQPCIG 479 (503)
T ss_pred EcCCCCeEEeecCCccEEEeechhhhhhhccccC--CcceEE
Confidence 76 6689999885 4677888776554444443 235553
No 147
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=91.99 E-value=2.3 Score=43.29 Aligned_cols=136 Identities=13% Similarity=0.180 Sum_probs=82.5
Q ss_pred CCcEEEEE-EeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVA-SYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~-~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
+..|.-+. +++ +.++=.+.+|+|..|++.... .+++. ..+.+|+.+.+.++ +..+++
T Consensus 143 tg~Ir~v~wc~eD~~iLSSadd~tVRLWD~rTgt------------------~v~sL--~~~s~VtSlEvs~d-G~ilTi 201 (334)
T KOG0278|consen 143 TGGIRTVLWCHEDKCILSSADDKTVRLWDHRTGT------------------EVQSL--EFNSPVTSLEVSQD-GRILTI 201 (334)
T ss_pred CCcceeEEEeccCceEEeeccCCceEEEEeccCc------------------EEEEE--ecCCCCcceeeccC-CCEEEE
Confidence 34565433 344 455555999999999976432 12222 22689999999877 566777
Q ss_pred eCc--EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEc--CCCceeEeeeecCCCCceEEEecCCe
Q 003405 94 SES--IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHD--GGRGFVEVKDFGVPDTVKSMSWCGEN 169 (823)
Q Consensus 94 ~d~--l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~--~~~~f~~~kei~~~~~~~~l~~~~~~ 169 (823)
+++ |.+|+-.+|..+....---+|.+-.+.++. .++|+...-..+|+++ .+.+.... .-.-+++|.|+.|..+.
T Consensus 202 a~gssV~Fwdaksf~~lKs~k~P~nV~SASL~P~k-~~fVaGged~~~~kfDy~TgeEi~~~-nkgh~gpVhcVrFSPdG 279 (334)
T KOG0278|consen 202 AYGSSVKFWDAKSFGLLKSYKMPCNVESASLHPKK-EFFVAGGEDFKVYKFDYNTGEEIGSY-NKGHFGPVHCVRFSPDG 279 (334)
T ss_pred ecCceeEEeccccccceeeccCccccccccccCCC-ceEEecCcceEEEEEeccCCceeeec-ccCCCCceEEEEECCCC
Confidence 773 999999888766443222356666677777 4666656655566654 23222111 00125788999988554
Q ss_pred EEEEE
Q 003405 170 ICIAI 174 (823)
Q Consensus 170 i~v~~ 174 (823)
-.++.
T Consensus 280 E~yAs 284 (334)
T KOG0278|consen 280 ELYAS 284 (334)
T ss_pred ceeec
Confidence 33333
No 148
>PLN03218 maturation of RBCL 1; Provisional
Probab=91.90 E-value=46 Score=42.46 Aligned_cols=41 Identities=12% Similarity=0.115 Sum_probs=30.1
Q ss_pred hhHHHHHhhccccHHHHHHHHHHHhC---CCchhHHHHHHHHhc
Q 003405 731 YEERAILLGKMNQHELALSLYVHKVF---LINQPVFLLIRRMAM 771 (823)
Q Consensus 731 ~~e~~~Ll~klg~h~~AL~ilv~~L~---D~~~a~~~~l~~~y~ 771 (823)
+.-.+-.+++.|+.++|.+++-.-++ .++..+|.+|..++.
T Consensus 757 y~sLL~a~~k~G~le~A~~l~~~M~k~Gi~pd~~tynsLIglc~ 800 (1060)
T PLN03218 757 YSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCITGLCL 800 (1060)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHH
Confidence 34455678999999999999887553 346677888877654
No 149
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=91.79 E-value=27 Score=45.33 Aligned_cols=52 Identities=13% Similarity=0.116 Sum_probs=30.8
Q ss_pred HHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 308 VQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 308 ~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
..++..|++++|+..++.....++ .-..++...|..++.+|+|++|..+|.+
T Consensus 277 ~~~~~~g~~~~A~~~l~~aL~~~P-----~~~~a~~~Lg~~~~~~g~~~eA~~~l~~ 328 (1157)
T PRK11447 277 LAAVDSGQGGKAIPELQQAVRANP-----KDSEALGALGQAYSQQGDRARAVAQFEK 328 (1157)
T ss_pred HHHHHCCCHHHHHHHHHHHHHhCC-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 344567777777777765321111 1123455567777777777777777765
No 150
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=91.75 E-value=1.4 Score=48.82 Aligned_cols=235 Identities=12% Similarity=0.130 Sum_probs=135.3
Q ss_pred cEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
-|+|+.... ..|+=|+-||.|++|++-.+ ..+++++.++ +++|..+.--+....+++.+
T Consensus 216 gvsai~~fp~~~hLlLS~gmD~~vklW~vy~~-----------------~~~lrtf~gH-~k~Vrd~~~s~~g~~fLS~s 277 (503)
T KOG0282|consen 216 GVSAIQWFPKKGHLLLSGGMDGLVKLWNVYDD-----------------RRCLRTFKGH-RKPVRDASFNNCGTSFLSAS 277 (503)
T ss_pred ccchhhhccceeeEEEecCCCceEEEEEEecC-----------------cceehhhhcc-hhhhhhhhccccCCeeeeee
Confidence 466655443 45677899999999987642 2356777765 78999998888887788777
Q ss_pred -Cc-EEEEeCCCCcccccccCCCCcEEEEeeCCCce-EEEE-EcCeEEEEEEcCCCceeEeeeec-CCCCceEEEecC-C
Q 003405 95 -ES-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGF-LCFA-RQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWCG-E 168 (823)
Q Consensus 95 -d~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~-l~V~-~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~~-~ 168 (823)
|. +++||..+-+.......-+-.+++...++... +.|| .+++|..|.++.++ + ++|+. --+.+..+.|.. +
T Consensus 278 fD~~lKlwDtETG~~~~~f~~~~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~k-v--vqeYd~hLg~i~~i~F~~~g 354 (503)
T KOG0282|consen 278 FDRFLKLWDTETGQVLSRFHLDKVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGK-V--VQEYDRHLGAILDITFVDEG 354 (503)
T ss_pred cceeeeeeccccceEEEEEecCCCceeeecCCCCCcEEEEecCCCcEEEEeccchH-H--HHHHHhhhhheeeeEEccCC
Confidence 64 99999876554443322232333333444433 3344 77888888777542 2 33431 124677788873 3
Q ss_pred eEEEEEc--CceEEEEcCCCCeeeccC-CCCCCCCEEEEccCCeEEEE--eCCe-EEEEcCCC-cccc--CCceee-cCC
Q 003405 169 NICIAIR--KGYMILNATNGALSEVFP-SGRIGPPLVVSLLSGELLLG--KENI-GVFVDQNG-KLLQ--ADRICW-SEA 238 (823)
Q Consensus 169 ~i~v~~~--~~y~lidl~~~~~~~L~~-~~~~~~p~i~~~~~~EfLL~--~~~~-gvfv~~~G-~~~~--~~~i~w-~~~ 238 (823)
.=++.+. +.+.+.+...+....+.. ....+-|++..-+++.++.| .+|. ++|-...- +..+ +-.=.| .++
T Consensus 355 ~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~~~r~nkkK~feGh~vaGy 434 (503)
T KOG0282|consen 355 RRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVPPFRLNKKKRFEGHSVAGY 434 (503)
T ss_pred ceEeeeccCccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCceEEEEecccccccCHhhhhcceeccCc
Confidence 3333333 567777766554333322 22224588888888888886 3454 44421111 1111 111112 455
Q ss_pred CcEEEEe--CCEEEEEeC-CeEEEEEccCCCceeEEEee
Q 003405 239 PIAVIIQ--KPYAIALLP-RRVEVRSLRVPYALIQTIVL 274 (823)
Q Consensus 239 P~~v~~~--~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l 274 (823)
+..+.+. .-||+.-.. +.+.+++.+ +..++-++..
T Consensus 435 s~~v~fSpDG~~l~SGdsdG~v~~wdwk-t~kl~~~lka 472 (503)
T KOG0282|consen 435 SCQVDFSPDGRTLCSGDSDGKVNFWDWK-TTKLVSKLKA 472 (503)
T ss_pred eeeEEEcCCCCeEEeecCCccEEEeech-hhhhhhcccc
Confidence 6666654 357766664 568888885 5555544443
No 151
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=91.64 E-value=3.6 Score=41.80 Aligned_cols=136 Identities=14% Similarity=0.192 Sum_probs=85.1
Q ss_pred cEEEEEE--eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVAS--YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~--~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
.|+++-. -++.|+.+..||.++.++++... +.+.+++++. =|-.+..=....-+++=++
T Consensus 116 eINam~ldP~enSi~~AgGD~~~y~~dlE~G~------------------i~r~~rGHtD-YvH~vv~R~~~~qilsG~E 176 (325)
T KOG0649|consen 116 EINAMWLDPSENSILFAGGDGVIYQVDLEDGR------------------IQREYRGHTD-YVHSVVGRNANGQILSGAE 176 (325)
T ss_pred ccceeEeccCCCcEEEecCCeEEEEEEecCCE------------------EEEEEcCCcc-eeeeeeecccCcceeecCC
Confidence 4665544 46778777799999999987421 2345555533 2332222123334455454
Q ss_pred -c-EEEEeCCCCcccccccCCCCcEE---------EEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEE
Q 003405 96 -S-IAFHRLPNLETIAVLTKAKGANV---------YSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMS 164 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i~~~kg~~~---------fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~ 164 (823)
| +++|++.+-+.+..++..|+-++ .|+..+..-++.+..+++.++.+... ..+.-|++|.+++-+.
T Consensus 177 DGtvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgGp~lslwhLrss---e~t~vfpipa~v~~v~ 253 (325)
T KOG0649|consen 177 DGTVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGGPKLSLWHLRSS---ESTCVFPIPARVHLVD 253 (325)
T ss_pred CccEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecCCCceeEEeccCC---CceEEEecccceeEee
Confidence 5 99999977654444443332221 33433334577788899999998843 2345688999999999
Q ss_pred ecCCeEEEEEc
Q 003405 165 WCGENICIAIR 175 (823)
Q Consensus 165 ~~~~~i~v~~~ 175 (823)
|..+.+..|-.
T Consensus 254 F~~d~vl~~G~ 264 (325)
T KOG0649|consen 254 FVDDCVLIGGE 264 (325)
T ss_pred eecceEEEecc
Confidence 99888777653
No 152
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=91.63 E-value=18 Score=37.90 Aligned_cols=152 Identities=7% Similarity=0.122 Sum_probs=96.5
Q ss_pred EEEEEEe-CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec-ccCceeeEe-C
Q 003405 19 IDAVASY-GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA-SRQLLLSLS-E 95 (823)
Q Consensus 19 I~ci~~~-~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~-~~~~Ll~l~-d 95 (823)
++|..-- +++|+-|+-|.+..+|+++.. +..+.|.++ ..-|-.|.+.| ..|..++-+ |
T Consensus 148 lScC~f~dD~~ilT~SGD~TCalWDie~g------------------~~~~~f~GH-~gDV~slsl~p~~~ntFvSg~cD 208 (343)
T KOG0286|consen 148 LSCCRFLDDNHILTGSGDMTCALWDIETG------------------QQTQVFHGH-TGDVMSLSLSPSDGNTFVSGGCD 208 (343)
T ss_pred eEEEEEcCCCceEecCCCceEEEEEcccc------------------eEEEEecCC-cccEEEEecCCCCCCeEEecccc
Confidence 5554433 579999999999999998742 123455554 46799999999 556666553 5
Q ss_pred c-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEEEc-CeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeE
Q 003405 96 S-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFARQ-KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENI 170 (823)
Q Consensus 96 ~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~~k-kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i 170 (823)
. -++|++.+-.-..+.+ ....+++++..++..-++-+.. ....+|.++.+++......-...-++++++|. |..+
T Consensus 209 ~~aklWD~R~~~c~qtF~ghesDINsv~ffP~G~afatGSDD~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlL 288 (343)
T KOG0286|consen 209 KSAKLWDVRSGQCVQTFEGHESDINSVRFFPSGDAFATGSDDATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLL 288 (343)
T ss_pred cceeeeeccCcceeEeecccccccceEEEccCCCeeeecCCCceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEE
Confidence 4 8999986543222221 2235677777776655666644 45678988876544322222233578899887 6667
Q ss_pred EEEEc-CceEEEEcCCCCee
Q 003405 171 CIAIR-KGYMILNATNGALS 189 (823)
Q Consensus 171 ~v~~~-~~y~lidl~~~~~~ 189 (823)
+.|+. ..-.+.|.-.++..
T Consensus 289 fagy~d~~c~vWDtlk~e~v 308 (343)
T KOG0286|consen 289 FAGYDDFTCNVWDTLKGERV 308 (343)
T ss_pred EeeecCCceeEeeccccceE
Confidence 77765 34556675555443
No 153
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=91.48 E-value=25 Score=38.64 Aligned_cols=246 Identities=17% Similarity=0.249 Sum_probs=137.0
Q ss_pred EEEEEeCC----CcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC------cE
Q 003405 28 KILLGCSD----GSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE------SI 97 (823)
Q Consensus 28 ~L~vGT~~----G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d------~l 97 (823)
.+|||+-. |.|+.|.++.... .......... ...-.-|.+.++.++|.+..+ +|
T Consensus 1 ~~~vgsy~~~~~~gI~~~~~d~~~g--------------~l~~~~~~~~--~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v 64 (345)
T PF10282_consen 1 TLYVGSYTNGKGGGIYVFRFDEETG--------------TLTLVQTVAE--GENPSWLAVSPDGRRLYVVNEGSGDSGGV 64 (345)
T ss_dssp EEEEEECCSSSSTEEEEEEEETTTT--------------EEEEEEEEEE--SSSECCEEE-TTSSEEEEEETTSSTTTEE
T ss_pred CEEEEcCCCCCCCcEEEEEEcCCCC--------------CceEeeeecC--CCCCceEEEEeCCCEEEEEEccccCCCCE
Confidence 47899888 7899999854332 2222222221 245566778888899988865 28
Q ss_pred EEEeCCC----Cccccccc-CCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEee-ee-----------cCCC
Q 003405 98 AFHRLPN----LETIAVLT-KAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEVK-DF-----------GVPD 158 (823)
Q Consensus 98 ~~~~L~~----l~~~~~i~-~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~k-ei-----------~~~~ 158 (823)
..|.+.. ++.+.++. .-.+...++++++...++|+ ....+.+|.+..+....... .+ ....
T Consensus 65 ~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~ 144 (345)
T PF10282_consen 65 SSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGP 144 (345)
T ss_dssp EEEEEETTTTEEEEEEEEEESSSCEEEEEECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSST
T ss_pred EEEEECCCcceeEEeeeeccCCCCcEEEEEecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccc
Confidence 8887654 34444433 33444567888888888888 46789999998543333221 11 1223
Q ss_pred CceEEEec--CCeEEEEEc--CceEEEEcCCCC--eee----ccCCCCCCCCEEE-EccCCeEEEEe---CC-eEEE-Ec
Q 003405 159 TVKSMSWC--GENICIAIR--KGYMILNATNGA--LSE----VFPSGRIGPPLVV-SLLSGELLLGK---EN-IGVF-VD 222 (823)
Q Consensus 159 ~~~~l~~~--~~~i~v~~~--~~y~lidl~~~~--~~~----L~~~~~~~~p~i~-~~~~~EfLL~~---~~-~gvf-v~ 222 (823)
.|.++.+. |+.++++.. ....+++++.+. ... -++.|. -|.-+ .-+++.++.+. ++ ..+| ++
T Consensus 145 h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~--GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~ 222 (345)
T PF10282_consen 145 HPHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGS--GPRHLAFSPDGKYAYVVNELSNTVSVFDYD 222 (345)
T ss_dssp CEEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTS--SEEEEEE-TTSSEEEEEETTTTEEEEEEEE
T ss_pred cceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCC--CCcEEEEcCCcCEEEEecCCCCcEEEEeec
Confidence 56677777 457777765 457788887654 322 124443 35433 33566655432 23 3333 34
Q ss_pred -CCCccccC---Cce--eecC--CCcEEEEe--CCEEEEEeC--CeEEEEEcc-CCC--ceeEEEeeCC--cccc--ccc
Q 003405 223 -QNGKLLQA---DRI--CWSE--APIAVIIQ--KPYAIALLP--RRVEVRSLR-VPY--ALIQTIVLQN--VRHL--IPS 283 (823)
Q Consensus 223 -~~G~~~~~---~~i--~w~~--~P~~v~~~--~PYll~~~~--~~ieV~~l~-~~~--~lvQ~i~l~~--~~~l--~~~ 283 (823)
..|..... .++ .|.. .|..+++. .-||++-.. +.|-|+++. .++ ..++.++..+ ++.+ .+.
T Consensus 223 ~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~ 302 (345)
T PF10282_consen 223 PSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPD 302 (345)
T ss_dssp TTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TT
T ss_pred ccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCC
Confidence 34532211 111 2433 36677776 668888775 689999983 223 4566676633 2333 345
Q ss_pred CCeEEEec
Q 003405 284 SNAVVVAL 291 (823)
Q Consensus 284 ~~~v~v~s 291 (823)
++.++++.
T Consensus 303 g~~l~Va~ 310 (345)
T PF10282_consen 303 GRYLYVAN 310 (345)
T ss_dssp SSEEEEEE
T ss_pred CCEEEEEe
Confidence 55566654
No 154
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=91.45 E-value=0.57 Score=51.48 Aligned_cols=141 Identities=18% Similarity=0.320 Sum_probs=96.1
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C-
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E- 95 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d- 95 (823)
.+.|--.|+.-+-+|-+.|+|..|.-...+ + +++- . -++.+|..|.+-+.+..+++-+ |
T Consensus 255 ~vm~qNP~NaVih~GhsnGtVSlWSP~ske---------------P--LvKi-L-cH~g~V~siAv~~~G~YMaTtG~Dr 315 (545)
T KOG1272|consen 255 DVMKQNPYNAVIHLGHSNGTVSLWSPNSKE---------------P--LVKI-L-CHRGPVSSIAVDRGGRYMATTGLDR 315 (545)
T ss_pred chhhcCCccceEEEcCCCceEEecCCCCcc---------------h--HHHH-H-hcCCCcceEEECCCCcEEeeccccc
Confidence 455566677788999999999999733211 1 1111 1 1378999999999888887765 4
Q ss_pred cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcC--C--CceeEeeeecCCCCceEEEec--CCe
Q 003405 96 SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDG--G--RGFVEVKDFGVPDTVKSMSWC--GEN 169 (823)
Q Consensus 96 ~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~--~--~~f~~~kei~~~~~~~~l~~~--~~~ 169 (823)
.+++|||..+....+...--+++..+++. +|.++++...-+.||.-.. + ..+--+ .-.++.+|..+.|+ .|.
T Consensus 316 ~~kIWDlR~~~ql~t~~tp~~a~~ls~Sq-kglLA~~~G~~v~iw~d~~~~s~~~~~pYm-~H~~~~~V~~l~FcP~EDv 393 (545)
T KOG1272|consen 316 KVKIWDLRNFYQLHTYRTPHPASNLSLSQ-KGLLALSYGDHVQIWKDALKGSGHGETPYM-NHRCGGPVEDLRFCPYEDV 393 (545)
T ss_pred ceeEeeeccccccceeecCCCcccccccc-ccceeeecCCeeeeehhhhcCCCCCCcchh-hhccCcccccceeccHHHe
Confidence 39999999887665544434566666664 5788999999999887432 1 111111 12456788899888 589
Q ss_pred EEEEEcCceE
Q 003405 170 ICIAIRKGYM 179 (823)
Q Consensus 170 i~v~~~~~y~ 179 (823)
|+||...++.
T Consensus 394 LGIGH~~G~t 403 (545)
T KOG1272|consen 394 LGIGHAGGIT 403 (545)
T ss_pred eeccccCCce
Confidence 9999977664
No 155
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=91.43 E-value=8.2 Score=40.80 Aligned_cols=151 Identities=11% Similarity=0.165 Sum_probs=87.9
Q ss_pred CCCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccc------cccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEc
Q 003405 74 SKKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIA------VLTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHD 143 (823)
Q Consensus 74 ~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~------~i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~ 143 (823)
+++.|+.+....+...|.++|+ + |.+|++.+|.... +++ ....+.++..++-..++|. ...+|.+|...
T Consensus 85 H~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~DF~~~eHr~~R~nve-~dhpT~V~FapDc~s~vv~~~~g~~l~vyk~~ 163 (420)
T KOG2096|consen 85 HKKEVTDVAFSSDGKKLATISGDRSIRLWDVRDFENKEHRCIRQNVE-YDHPTRVVFAPDCKSVVVSVKRGNKLCVYKLV 163 (420)
T ss_pred cCCceeeeEEcCCCceeEEEeCCceEEEEecchhhhhhhhHhhcccc-CCCceEEEECCCcceEEEEEccCCEEEEEEee
Confidence 4779999999999999999997 4 9999998875321 122 1245666666665555555 44678888753
Q ss_pred ----CCCceeEeee--ecCC----CCceEEEecCCeEEEEEc---CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeE
Q 003405 144 ----GGRGFVEVKD--FGVP----DTVKSMSWCGENICIAIR---KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGEL 210 (823)
Q Consensus 144 ----~~~~f~~~ke--i~~~----~~~~~l~~~~~~i~v~~~---~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~Ef 210 (823)
++..+..+++ ..++ -.+..+...|+..++.+. +..++.++. |+...-........-.....+++.|
T Consensus 164 K~~dG~~~~~~v~~D~~~f~~kh~v~~i~iGiA~~~k~imsas~dt~i~lw~lk-Gq~L~~idtnq~~n~~aavSP~GRF 242 (420)
T KOG2096|consen 164 KKTDGSGSHHFVHIDNLEFERKHQVDIINIGIAGNAKYIMSASLDTKICLWDLK-GQLLQSIDTNQSSNYDAAVSPDGRF 242 (420)
T ss_pred ecccCCCCcccccccccccchhcccceEEEeecCCceEEEEecCCCcEEEEecC-CceeeeeccccccccceeeCCCCcE
Confidence 1112222221 2222 144556666776555544 567888887 5543333332222223334578888
Q ss_pred EEEeC---Ce---EEEEcCCCc
Q 003405 211 LLGKE---NI---GVFVDQNGK 226 (823)
Q Consensus 211 LL~~~---~~---gvfv~~~G~ 226 (823)
+.++. +. -+++..+|.
T Consensus 243 ia~~gFTpDVkVwE~~f~kdG~ 264 (420)
T KOG2096|consen 243 IAVSGFTPDVKVWEPIFTKDGT 264 (420)
T ss_pred EEEecCCCCceEEEEEeccCcc
Confidence 87543 32 234566774
No 156
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=91.42 E-value=13 Score=44.09 Aligned_cols=173 Identities=9% Similarity=0.145 Sum_probs=104.4
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe--Cc-EEEEeCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS--ES-IAFHRLP 103 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~--d~-l~~~~L~ 103 (823)
++|+=..-|-++..|++... ++.+.|.+ ..-|+.|..-|--+.-++=+ |+ +.+|.++
T Consensus 381 ~fLLSSSMDKTVRLWh~~~~------------------~CL~~F~H--ndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~ 440 (712)
T KOG0283|consen 381 NFLLSSSMDKTVRLWHPGRK------------------ECLKVFSH--NDFVTCVAFNPVDDRYFISGSLDGKVRLWSIS 440 (712)
T ss_pred CeeEeccccccEEeecCCCc------------------ceeeEEec--CCeeEEEEecccCCCcEeecccccceEEeecC
Confidence 57777888999999986642 23344432 46799998888665544332 44 9999987
Q ss_pred CCcccccccCC-CCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecC-------CCCceEEEec-CC--eEE
Q 003405 104 NLETIAVLTKA-KGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGV-------PDTVKSMSWC-GE--NIC 171 (823)
Q Consensus 104 ~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~-------~~~~~~l~~~-~~--~i~ 171 (823)
+-+...- .+. .=|+++|..++..-.+|| .++.+.+|...+. .+..-..|.+ ...|+++.+. ++ .|.
T Consensus 441 d~~Vv~W-~Dl~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~l-k~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vL 518 (712)
T KOG0283|consen 441 DKKVVDW-NDLRDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGL-KLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVL 518 (712)
T ss_pred cCeeEee-hhhhhhheeEEeccCCceEEEEEeccEEEEEEccCC-eEEEeeeEeeccCccccCceeeeeEecCCCCCeEE
Confidence 6543221 122 347889998885456666 6677778877733 3443333332 2369999998 43 367
Q ss_pred EEEc-CceEEEEcCCCCeeeccCCCCC--CCCEEEEccCCeEEE-EeCCeEEEE
Q 003405 172 IAIR-KGYMILNATNGALSEVFPSGRI--GPPLVVSLLSGELLL-GKENIGVFV 221 (823)
Q Consensus 172 v~~~-~~y~lidl~~~~~~~L~~~~~~--~~p~i~~~~~~EfLL-~~~~~gvfv 221 (823)
|.+. +...|+|..+......|.--.+ .+-..-...+++++| +.++..|++
T Consensus 519 VTSnDSrIRI~d~~~~~lv~KfKG~~n~~SQ~~Asfs~Dgk~IVs~seDs~VYi 572 (712)
T KOG0283|consen 519 VTSNDSRIRIYDGRDKDLVHKFKGFRNTSSQISASFSSDGKHIVSASEDSWVYI 572 (712)
T ss_pred EecCCCceEEEeccchhhhhhhcccccCCcceeeeEccCCCEEEEeecCceEEE
Confidence 7666 7899999876555444431111 111122234666666 355555554
No 157
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=91.30 E-value=0.42 Score=31.16 Aligned_cols=26 Identities=31% Similarity=0.452 Sum_probs=23.7
Q ss_pred CcEEEEEEeCCEEEEEeCCCcEEEEc
Q 003405 17 PKIDAVASYGLKILLGCSDGSLKIYS 42 (823)
Q Consensus 17 ~~I~ci~~~~~~L~vGT~~G~l~~y~ 42 (823)
..|+|++..++++.++|+.|.|.+|.
T Consensus 2 E~i~aia~g~~~vavaTS~~~lRifs 27 (27)
T PF12341_consen 2 EEIEAIAAGDSWVAVATSAGYLRIFS 27 (27)
T ss_pred ceEEEEEccCCEEEEEeCCCeEEecC
Confidence 47999999999999999999998873
No 158
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=91.19 E-value=5.5 Score=43.37 Aligned_cols=22 Identities=23% Similarity=0.368 Sum_probs=19.0
Q ss_pred CEEEEEeCCCcEEEEcCCCCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSES 48 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~ 48 (823)
++||-|..+|.|++|+....++
T Consensus 249 h~IYaGl~nG~VlvyD~R~~~~ 270 (463)
T KOG1645|consen 249 HVIYAGLQNGMVLVYDMRQPEG 270 (463)
T ss_pred ceeEEeccCceEEEEEccCCCc
Confidence 5899999999999999876543
No 159
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=91.18 E-value=9.3 Score=40.32 Aligned_cols=145 Identities=11% Similarity=0.106 Sum_probs=84.5
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
+..|.+.+.-|-...+|...+.|.+|++..-. +.++....- ..-.....++|+.-+.+..+++-++
T Consensus 142 ~~pi~AfDp~GLifA~~~~~~~IkLyD~Rs~d-------------kgPF~tf~i-~~~~~~ew~~l~FS~dGK~iLlsT~ 207 (311)
T KOG1446|consen 142 GRPIAAFDPEGLIFALANGSELIKLYDLRSFD-------------KGPFTTFSI-TDNDEAEWTDLEFSPDGKSILLSTN 207 (311)
T ss_pred CCcceeECCCCcEEEEecCCCeEEEEEecccC-------------CCCceeEcc-CCCCccceeeeEEcCCCCEEEEEeC
Confidence 34566666666677888888888999876532 223322111 1012467999999999887777776
Q ss_pred -c-EEEEeCCCCcccccccCCCC---cE-EEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC
Q 003405 96 -S-IAFHRLPNLETIAVLTKAKG---AN-VYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE 168 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i~~~kg---~~-~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~ 168 (823)
+ +++.+-.+-..+.+....++ .+ ..+..++..++..+ .+++|.+|....+..+..++.- ...++.++.|...
T Consensus 208 ~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~~~-~~~~~~~~~fnP~ 286 (311)
T KOG1446|consen 208 ASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLRGP-NGGPVSCVRFNPR 286 (311)
T ss_pred CCcEEEEEccCCcEeeeEeeccCCCCcceeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEecCC-CCCCccccccCCc
Confidence 3 66666543333333222222 22 23445666666666 4578889888755444444432 3457778888765
Q ss_pred eEEEEEc
Q 003405 169 NICIAIR 175 (823)
Q Consensus 169 ~i~v~~~ 175 (823)
...+++.
T Consensus 287 ~~mf~sa 293 (311)
T KOG1446|consen 287 YAMFVSA 293 (311)
T ss_pred eeeeeec
Confidence 5444444
No 160
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=91.17 E-value=5.2 Score=40.52 Aligned_cols=80 Identities=16% Similarity=0.312 Sum_probs=61.8
Q ss_pred CcccccccccCCCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEE
Q 003405 4 NAFDSLELISNCSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEV 83 (823)
Q Consensus 4 ~af~~~~l~~~~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~ 83 (823)
..|++.+++....-.|++|+..+.-|+-|+-||++..|++....- . ...+ ..||+.+..
T Consensus 133 ~s~ePiQildea~D~V~Si~v~~heIvaGS~DGtvRtydiR~G~l------------------~--sDy~-g~pit~vs~ 191 (307)
T KOG0316|consen 133 RSFEPIQILDEAKDGVSSIDVAEHEIVAGSVDGTVRTYDIRKGTL------------------S--SDYF-GHPITSVSF 191 (307)
T ss_pred CCCCccchhhhhcCceeEEEecccEEEeeccCCcEEEEEeeccee------------------e--hhhc-CCcceeEEe
Confidence 357788888888889999999999999999999999999875321 1 0122 469999999
Q ss_pred ecccCceeeEe-Cc-EEEEeCCC
Q 003405 84 LASRQLLLSLS-ES-IAFHRLPN 104 (823)
Q Consensus 84 ~~~~~~Ll~l~-d~-l~~~~L~~ 104 (823)
-+..|..++=+ |+ +++.+=.+
T Consensus 192 s~d~nc~La~~l~stlrLlDk~t 214 (307)
T KOG0316|consen 192 SKDGNCSLASSLDSTLRLLDKET 214 (307)
T ss_pred cCCCCEEEEeeccceeeecccch
Confidence 99999877655 43 77766443
No 161
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=91.16 E-value=11 Score=43.01 Aligned_cols=181 Identities=18% Similarity=0.147 Sum_probs=104.3
Q ss_pred CCeeEEEEecccCceeeEeCc---EEEEeCCCCcccc-cccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeE
Q 003405 76 KPILSMEVLASRQLLLSLSES---IAFHRLPNLETIA-VLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVE 150 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~d~---l~~~~L~~l~~~~-~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~ 150 (823)
..=+.|.+-|.+..+++.+-. |++|++.++.... .-...-.|.+.-++++-..+|+. ..+.|-+ .-..|..+
T Consensus 52 ~ast~ik~s~DGqY~lAtG~YKP~ikvydlanLSLKFERhlDae~V~feiLsDD~SK~v~L~~DR~Ief-Hak~G~hy-- 128 (703)
T KOG2321|consen 52 TASTRIKVSPDGQYLLATGTYKPQIKVYDLANLSLKFERHLDAEVVDFEILSDDYSKSVFLQNDRTIEF-HAKYGRHY-- 128 (703)
T ss_pred cccceeEecCCCcEEEEecccCCceEEEEcccceeeeeecccccceeEEEeccchhhheEeecCceeee-hhhcCeee--
Confidence 456779999999999988773 9999998765321 11233455666667776666655 4455532 22223211
Q ss_pred eeeecCCCCceEEEec---CCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeC---CeEEEEcCC
Q 003405 151 VKDFGVPDTVKSMSWC---GENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKE---NIGVFVDQN 224 (823)
Q Consensus 151 ~kei~~~~~~~~l~~~---~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~---~~gvfv~~~ 224 (823)
.+-+|-.-+.|++. -+..|+|+..+.+-+|+..|+...-|...... --++.++...=|||++ +..-|.|.-
T Consensus 129 --~~RIP~~GRDm~y~~~scDly~~gsg~evYRlNLEqGrfL~P~~~~~~~-lN~v~in~~hgLla~Gt~~g~VEfwDpR 205 (703)
T KOG2321|consen 129 --RTRIPKFGRDMKYHKPSCDLYLVGSGSEVYRLNLEQGRFLNPFETDSGE-LNVVSINEEHGLLACGTEDGVVEFWDPR 205 (703)
T ss_pred --eeecCcCCccccccCCCccEEEeecCcceEEEEcccccccccccccccc-ceeeeecCccceEEecccCceEEEecch
Confidence 12245555566554 36799999999999999999865555442211 1123333322344433 344566643
Q ss_pred CccccCCceeecCC------------CcEEEEeC-CEEEEE--eCCeEEEEEcc
Q 003405 225 GKLLQADRICWSEA------------PIAVIIQK-PYAIAL--LPRRVEVRSLR 263 (823)
Q Consensus 225 G~~~~~~~i~w~~~------------P~~v~~~~-PYll~~--~~~~ieV~~l~ 263 (823)
-+ ++-+++..... |.++.|.. +-=+|+ ..+.+-||+++
T Consensus 206 ~k-srv~~l~~~~~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLR 258 (703)
T KOG2321|consen 206 DK-SRVGTLDAASSVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLR 258 (703)
T ss_pred hh-hhheeeecccccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcc
Confidence 32 22333433322 55666655 444444 44789999987
No 162
>PLN03077 Protein ECB2; Provisional
Probab=91.10 E-value=50 Score=41.34 Aligned_cols=61 Identities=18% Similarity=0.267 Sum_probs=43.4
Q ss_pred HHHHHHHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcc
Q 003405 506 ILDTALLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEES 576 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~ 576 (823)
.+.++|+..|.+.++.+....+++.=. . ...--|..++.-|.+.|++++|++++.+.....
T Consensus 324 ~~~n~Li~~y~k~g~~~~A~~vf~~m~--~--------~d~~s~n~li~~~~~~g~~~~A~~lf~~M~~~g 384 (857)
T PLN03077 324 SVCNSLIQMYLSLGSWGEAEKVFSRME--T--------KDAVSWTAMISGYEKNGLPDKALETYALMEQDN 384 (857)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHhhCC--C--------CCeeeHHHHHHHHHhCCCHHHHHHHHHHHHHhC
Confidence 577899999999877555444443200 0 011348899999999999999999999876443
No 163
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=91.10 E-value=8 Score=43.11 Aligned_cols=254 Identities=13% Similarity=0.150 Sum_probs=124.8
Q ss_pred EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC---cEEEE
Q 003405 24 SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE---SIAFH 100 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d---~l~~~ 100 (823)
..++++|+.+.||.|.++++... +++++++. + ..-..+.+.+....+++-+- .+.++
T Consensus 46 ~Dgr~~yv~~rdg~vsviD~~~~------------------~~v~~i~~-G-~~~~~i~~s~DG~~~~v~n~~~~~v~v~ 105 (369)
T PF02239_consen 46 PDGRYLYVANRDGTVSVIDLATG------------------KVVATIKV-G-GNPRGIAVSPDGKYVYVANYEPGTVSVI 105 (369)
T ss_dssp T-SSEEEEEETTSEEEEEETTSS------------------SEEEEEE--S-SEEEEEEE--TTTEEEEEEEETTEEEEE
T ss_pred CCCCEEEEEcCCCeEEEEECCcc------------------cEEEEEec-C-CCcceEEEcCCCCEEEEEecCCCceeEe
Confidence 34679999999999999987642 23445442 2 33455777887777665542 39999
Q ss_pred eCCCCcccccccCC--------CCcEEEEeeCCCceEEEEEc--CeEEEEEEcCCCceeEeeeecCCCCceEEEecC--C
Q 003405 101 RLPNLETIAVLTKA--------KGANVYSWDDRRGFLCFARQ--KRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG--E 168 (823)
Q Consensus 101 ~L~~l~~~~~i~~~--------kg~~~fa~~~~~~~l~V~~k--kki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~--~ 168 (823)
+..+++++..++.. ..+..+...+.....+++.+ .+|.+..+... .....+.+.....+....|.. .
T Consensus 106 D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d~-~~~~~~~i~~g~~~~D~~~dpdgr 184 (369)
T PF02239_consen 106 DAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSDP-KNLKVTTIKVGRFPHDGGFDPDGR 184 (369)
T ss_dssp ETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETTTS-SCEEEEEEE--TTEEEEEE-TTSS
T ss_pred ccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEeccc-cccceeeecccccccccccCcccc
Confidence 99999877654321 12233333444445666666 45655555533 233445566667777888873 4
Q ss_pred eEEEEEc--CceEEEEcCCCCeeeccCCCCCCCC--EEEEc-----------cCCeEE---EEeCCeEEEEc-CCCcccc
Q 003405 169 NICIAIR--KGYMILNATNGALSEVFPSGRIGPP--LVVSL-----------LSGELL---LGKENIGVFVD-QNGKLLQ 229 (823)
Q Consensus 169 ~i~v~~~--~~y~lidl~~~~~~~L~~~~~~~~p--~i~~~-----------~~~EfL---L~~~~~gvfv~-~~G~~~~ 229 (823)
.+++|.. +...++|..++....+.+.|+...| ..... ..+++. ++.+...+ ++ ...+.+
T Consensus 185 y~~va~~~sn~i~viD~~~~k~v~~i~~g~~p~~~~~~~~php~~g~vw~~~~~~~~~~~~ig~~~v~v-~d~~~wkvv- 262 (369)
T PF02239_consen 185 YFLVAANGSNKIAVIDTKTGKLVALIDTGKKPHPGPGANFPHPGFGPVWATSGLGYFAIPLIGTDPVSV-HDDYAWKVV- 262 (369)
T ss_dssp EEEEEEGGGTEEEEEETTTTEEEEEEE-SSSBEETTEEEEEETTTEEEEEEEBSSSSEEEEEE--TTT--STTTBTSEE-
T ss_pred eeeecccccceeEEEeeccceEEEEeeccccccccccccccCCCcceEEeeccccceecccccCCcccc-chhhcCeEE-
Confidence 5666654 4567889988877666666653221 11111 111221 11111111 11 111111
Q ss_pred CCceeecCCCcEEEEe--CCEEEEE---eC--CeEEEEEccCCCceeEEEeeCCc-c--cc--cccCCeEEEec---cce
Q 003405 230 ADRICWSEAPIAVIIQ--KPYAIAL---LP--RRVEVRSLRVPYALIQTIVLQNV-R--HL--IPSSNAVVVAL---ENS 294 (823)
Q Consensus 230 ~~~i~w~~~P~~v~~~--~PYll~~---~~--~~ieV~~l~~~~~lvQ~i~l~~~-~--~l--~~~~~~v~v~s---~~~ 294 (823)
.+|.-.+.|..+..+ .+|+.+= .+ +.|.|.+.. +...+.++..... + ++ ...|..++++. ++.
T Consensus 263 -~~I~~~G~glFi~thP~s~~vwvd~~~~~~~~~v~viD~~-tl~~~~~i~~~~~~~~~h~ef~~dG~~v~vS~~~~~~~ 340 (369)
T PF02239_consen 263 -KTIPTQGGGLFIKTHPDSRYVWVDTFLNPDADTVQVIDKK-TLKVVKTITPGPGKRVVHMEFNPDGKEVWVSVWDGNGA 340 (369)
T ss_dssp -EEEE-SSSS--EE--TT-SEEEEE-TT-SSHT-EEEEECC-GTEEEE-HHHHHT--EEEEEE-TTSSEEEEEEE--TTE
T ss_pred -EEEECCCCcceeecCCCCccEEeeccCCCCCceEEEEECc-CcceeEEEeccCCCcEeccEECCCCCEEEEEEecCCCE
Confidence 234445666444443 4677776 22 478888886 3555555532211 1 21 23456666653 225
Q ss_pred EEEeeccC
Q 003405 295 IFGLFPVP 302 (823)
Q Consensus 295 I~~l~~~~ 302 (823)
|..+....
T Consensus 341 i~v~D~~T 348 (369)
T PF02239_consen 341 IVVYDAKT 348 (369)
T ss_dssp EEEEETTT
T ss_pred EEEEECCC
Confidence 55554443
No 164
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=91.09 E-value=51 Score=41.45 Aligned_cols=81 Identities=21% Similarity=0.305 Sum_probs=50.1
Q ss_pred CcHHHHHHHHHHhc--cHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCcccc
Q 003405 547 NHYTALLELYKSNA--RHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQT 624 (823)
Q Consensus 547 ~~~~~L~~ly~~~g--~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~ 624 (823)
++...++.-|.+++ +.+.||.++.++.+... ...+.+++||--|- |.+.+++.+.=+-+. -
T Consensus 813 ~~l~~IlTa~vkk~Pp~le~aL~~I~~l~~~~~-----------~~ae~alkyl~fLv--Dvn~Ly~~ALG~YDl----~ 875 (928)
T PF04762_consen 813 KYLQPILTAYVKKSPPDLEEALQLIKELREEDP-----------ESAEEALKYLCFLV--DVNKLYDVALGTYDL----E 875 (928)
T ss_pred hhHHHHHHHHHhcCchhHHHHHHHHHHHHhcCh-----------HHHHHHHhHheeec--cHHHHHHHHhhhcCH----H
Confidence 44556777788888 89999999999975421 11367888887554 667777665333331 1
Q ss_pred ccccccC--CCChHHHHHHHhh
Q 003405 625 IELFLSG--NIPADLVNSYLKQ 644 (823)
Q Consensus 625 ~~if~~~--~l~~~~Vl~~L~~ 644 (823)
+.+++.. +.+|.+-++||++
T Consensus 876 Lal~VAq~SQkDPKEYLPfL~~ 897 (928)
T PF04762_consen 876 LALMVAQQSQKDPKEYLPFLQE 897 (928)
T ss_pred HHHHHHHHhccChHHHHHHHHH
Confidence 2233332 3566666666654
No 165
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=90.60 E-value=1.2 Score=50.55 Aligned_cols=110 Identities=20% Similarity=0.238 Sum_probs=75.9
Q ss_pred CCCCeeEEEEecccCceeeEeC-c----EEEEeCCCCcccccccCCCCc-EEEEeeCCCceEEEEEcCeEEEEEEcCCCc
Q 003405 74 SKKPILSMEVLASRQLLLSLSE-S----IAFHRLPNLETIAVLTKAKGA-NVYSWDDRRGFLCFARQKRVCIFRHDGGRG 147 (823)
Q Consensus 74 ~k~~I~qI~~~~~~~~Ll~l~d-~----l~~~~L~~l~~~~~i~~~kg~-~~fa~~~~~~~l~V~~kkki~l~~~~~~~~ 147 (823)
+.++|.|+.-=..+..|.+++- + |.+|.|..-.......+.||. .+....+...+++|++++.+.||.+... .
T Consensus 520 ~~k~i~~vtWHrkGDYlatV~~~~~~~~VliHQLSK~~sQ~PF~kskG~vq~v~FHPs~p~lfVaTq~~vRiYdL~kq-e 598 (733)
T KOG0650|consen 520 HPKSIRQVTWHRKGDYLATVMPDSGNKSVLIHQLSKRKSQSPFRKSKGLVQRVKFHPSKPYLFVATQRSVRIYDLSKQ-E 598 (733)
T ss_pred cCCccceeeeecCCceEEEeccCCCcceEEEEecccccccCchhhcCCceeEEEecCCCceEEEEeccceEEEehhHH-H
Confidence 3589999999888898888764 2 999999765544444556664 3455677788999999999999998732 1
Q ss_pred eeEeeeec-CCCCceEEEec--CCeEEEEEc-CceEEEEcCCC
Q 003405 148 FVEVKDFG-VPDTVKSMSWC--GENICIAIR-KGYMILNATNG 186 (823)
Q Consensus 148 f~~~kei~-~~~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~ 186 (823)
.+|++. ..--+.+|+.. |+.|++|+- +..+-+|++-+
T Consensus 599 --lvKkL~tg~kwiS~msihp~GDnli~gs~d~k~~WfDldls 639 (733)
T KOG0650|consen 599 --LVKKLLTGSKWISSMSIHPNGDNLILGSYDKKMCWFDLDLS 639 (733)
T ss_pred --HHHHHhcCCeeeeeeeecCCCCeEEEecCCCeeEEEEcccC
Confidence 223321 12234455554 677777765 77888888744
No 166
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=90.58 E-value=22 Score=36.96 Aligned_cols=157 Identities=7% Similarity=0.179 Sum_probs=99.0
Q ss_pred CcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|=|++... +.+.-|..|-++..|+++... . ..+.+ ...+|..+..-+..++.++.+
T Consensus 53 GavW~~Did~~s~~liTGSAD~t~kLWDv~tGk-----------------~-la~~k--~~~~Vk~~~F~~~gn~~l~~t 112 (327)
T KOG0643|consen 53 GAVWCCDIDWDSKHLITGSADQTAKLWDVETGK-----------------Q-LATWK--TNSPVKRVDFSFGGNLILAST 112 (327)
T ss_pred ceEEEEEecCCcceeeeccccceeEEEEcCCCc-----------------E-EEEee--cCCeeEEEeeccCCcEEEEEe
Confidence 4676766553 689999999999999976422 1 12222 257899999999999999999
Q ss_pred C---c----EEEEeCCCCc-------cccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCC
Q 003405 95 E---S----IAFHRLPNLE-------TIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPD 158 (823)
Q Consensus 95 d---~----l~~~~L~~l~-------~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~ 158 (823)
| | |.++++...+ |...|+ .-..++...+++...+|+.+ .+.+|.+|....+.++..-.+.. ..
T Consensus 113 D~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~skit~a~Wg~l~~~ii~Ghe~G~is~~da~~g~~~v~s~~~h-~~ 191 (327)
T KOG0643|consen 113 DKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSKITSALWGPLGETIIAGHEDGSISIYDARTGKELVDSDEEH-SS 191 (327)
T ss_pred hhhcCcceEEEEEEccCChhhhcccCceEEecCCccceeeeeecccCCEEEEecCCCcEEEEEcccCceeeechhhh-cc
Confidence 8 2 8999987432 222221 11334444556554556666 55678888887654454433332 23
Q ss_pred CceEEEecCCe-EE-EEEc-CceEEEEcCCCCeeeccCC
Q 003405 159 TVKSMSWCGEN-IC-IAIR-KGYMILNATNGALSEVFPS 194 (823)
Q Consensus 159 ~~~~l~~~~~~-i~-v~~~-~~y~lidl~~~~~~~L~~~ 194 (823)
.|..|.+..+. .+ -|++ +.-.++|+.+-+++.-+..
T Consensus 192 ~Ind~q~s~d~T~FiT~s~Dttakl~D~~tl~v~Kty~t 230 (327)
T KOG0643|consen 192 KINDLQFSRDRTYFITGSKDTTAKLVDVRTLEVLKTYTT 230 (327)
T ss_pred ccccccccCCcceEEecccCccceeeeccceeeEEEeee
Confidence 77888887443 33 3333 5667888887655544443
No 167
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=90.53 E-value=12 Score=39.53 Aligned_cols=156 Identities=13% Similarity=0.195 Sum_probs=90.7
Q ss_pred ccccccccCCCCcEEE--EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEE
Q 003405 6 FDSLELISNCSPKIDA--VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEV 83 (823)
Q Consensus 6 f~~~~l~~~~~~~I~c--i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~ 83 (823)
|...++..+-.--|.| .+.||+++.-+++|+++.+|+.+.+++ ++.+....+.. ...|-+|.=
T Consensus 3 ~s~~pi~s~h~DlihdVs~D~~GRRmAtCSsDq~vkI~d~~~~s~--------------~W~~Ts~Wrah-~~Si~rV~W 67 (361)
T KOG2445|consen 3 FSMAPIDSGHKDLIHDVSFDFYGRRMATCSSDQTVKIWDSTSDSG--------------TWSCTSSWRAH-DGSIWRVVW 67 (361)
T ss_pred ccccccccCCcceeeeeeecccCceeeeccCCCcEEEEeccCCCC--------------ceEEeeeEEec-CCcEEEEEe
Confidence 4555555554444666 467899999999999999999754332 56665555543 567777765
Q ss_pred ec-ccC-ceeeEe-C-cEEEEeCC--CCcc-------cccccCCC-CcEEEEeeCCC-ce-EE-EEEcCeEEEEEEcC--
Q 003405 84 LA-SRQ-LLLSLS-E-SIAFHRLP--NLET-------IAVLTKAK-GANVYSWDDRR-GF-LC-FARQKRVCIFRHDG-- 144 (823)
Q Consensus 84 ~~-~~~-~Ll~l~-d-~l~~~~L~--~l~~-------~~~i~~~k-g~~~fa~~~~~-~~-l~-V~~kkki~l~~~~~-- 144 (823)
.+ +.+ ++.++| | ++.+|.-. +++. ..++...+ .++.++..+.. |. ++ +....-+.||+.-+
T Consensus 68 AhPEfGqvvA~cS~Drtv~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~dp~ 147 (361)
T KOG2445|consen 68 AHPEFGQVVATCSYDRTVSIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTDVKFAPKHLGLKLAAASADGILRIYEAPDPM 147 (361)
T ss_pred cCccccceEEEEecCCceeeeeecccccccccceeEEEEEeecCCcceeEEEecchhcceEEEEeccCcEEEEEecCCcc
Confidence 54 444 455555 4 49999753 2221 11222222 34444444432 33 33 34777889999753
Q ss_pred -CCceeEeeeec-CC-------CCceEEEecC-----CeEEEEEcC
Q 003405 145 -GRGFVEVKDFG-VP-------DTVKSMSWCG-----ENICIAIRK 176 (823)
Q Consensus 145 -~~~f~~~kei~-~~-------~~~~~l~~~~-----~~i~v~~~~ 176 (823)
-+.+....|+. ++ .+.-|+.|+. ..|.||+..
T Consensus 148 nLs~W~Lq~Ei~~~~~pp~~~~~~~~CvsWn~sr~~~p~iAvgs~e 193 (361)
T KOG2445|consen 148 NLSQWTLQHEIQNVIDPPGKNKQPCFCVSWNPSRMHEPLIAVGSDE 193 (361)
T ss_pred ccccchhhhhhhhccCCcccccCcceEEeeccccccCceEEEEccc
Confidence 12345555664 22 3445777762 346777654
No 168
>KOG4328 consensus WD40 protein [Function unknown]
Probab=90.39 E-value=5.7 Score=43.89 Aligned_cols=169 Identities=16% Similarity=0.208 Sum_probs=100.4
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe--Cc-EEEEeC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS--ES-IAFHRL 102 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~--d~-l~~~~L 102 (823)
+..+++|++-|.+.++++.-+.+ .|...+ .++++|..|.+=|-...+++-+ |+ .++||+
T Consensus 291 ~~~vl~~~~~G~f~~iD~R~~~s--------------~~~~~~----lh~kKI~sv~~NP~~p~~laT~s~D~T~kIWD~ 352 (498)
T KOG4328|consen 291 SRSVLFGDNVGNFNVIDLRTDGS--------------EYENLR----LHKKKITSVALNPVCPWFLATASLDQTAKIWDL 352 (498)
T ss_pred CccEEEeecccceEEEEeecCCc--------------cchhhh----hhhcccceeecCCCCchheeecccCcceeeeeh
Confidence 36899999999888888765432 232221 3467999999998775555444 34 899999
Q ss_pred CCCcc-----cccccCCCCcEEEEeeCCCceEEE-EEcCeEEEEEEc-CCCceeEeeeecCCC------CceEEEec--C
Q 003405 103 PNLET-----IAVLTKAKGANVYSWDDRRGFLCF-ARQKRVCIFRHD-GGRGFVEVKDFGVPD------TVKSMSWC--G 167 (823)
Q Consensus 103 ~~l~~-----~~~i~~~kg~~~fa~~~~~~~l~V-~~kkki~l~~~~-~~~~f~~~kei~~~~------~~~~l~~~--~ 167 (823)
..+.. ++.++..+.|++.+.++..|.|+. ..+..|.||... -+..+....+|.-+. .+---.|. .
T Consensus 353 R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT~~D~~IRv~dss~~sa~~~p~~~I~Hn~~t~RwlT~fKA~W~P~~ 432 (498)
T KOG4328|consen 353 RQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTTCQDNEIRVFDSSCISAKDEPLGTIPHNNRTGRWLTPFKAAWDPDY 432 (498)
T ss_pred hhhcCCCCcceecccccceeeeeEEcCCCCceEeeccCCceEEeecccccccCCccceeeccCcccccccchhheeCCCc
Confidence 87643 334566678888888888877544 477889999763 111222222332111 11122344 2
Q ss_pred CeEEEEE-cCceEEEEcCCCC-eeeccCCCCCCCCEEEEc-cCCeEEE
Q 003405 168 ENICIAI-RKGYMILNATNGA-LSEVFPSGRIGPPLVVSL-LSGELLL 212 (823)
Q Consensus 168 ~~i~v~~-~~~y~lidl~~~~-~~~L~~~~~~~~p~i~~~-~~~EfLL 212 (823)
+.|++|. .+...++|-+.++ +.++..+-..+-|++..+ +-+..++
T Consensus 433 ~li~vg~~~r~IDv~~~~~~q~v~el~~P~~~tI~~vn~~HP~~~~~~ 480 (498)
T KOG4328|consen 433 NLIVVGRYPRPIDVFDGNGGQMVCELHDPESSTIPSVNEFHPMRDTLA 480 (498)
T ss_pred cEEEEeccCcceeEEcCCCCEEeeeccCccccccccceeeccccccee
Confidence 4456653 3778888888777 455544322233444333 4444455
No 169
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=90.15 E-value=4.8 Score=43.92 Aligned_cols=156 Identities=16% Similarity=0.168 Sum_probs=94.8
Q ss_pred CcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-
Q 003405 17 PKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE- 95 (823)
Q Consensus 17 ~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d- 95 (823)
.++-|+...|..|.+|..||++.+++...... +.... -+++.|..|..-|+...|+.++.
T Consensus 147 ~k~vaf~~~gs~latgg~dg~lRv~~~Ps~~t------------------~l~e~-~~~~eV~DL~FS~dgk~lasig~d 207 (398)
T KOG0771|consen 147 QKVVAFNGDGSKLATGGTDGTLRVWEWPSMLT------------------ILEEI-AHHAEVKDLDFSPDGKFLASIGAD 207 (398)
T ss_pred ceEEEEcCCCCEeeeccccceEEEEecCcchh------------------hhhhH-hhcCccccceeCCCCcEEEEecCC
Confidence 36667777778999999999999998554321 11111 13678999999999999999986
Q ss_pred cEEEEeCCCCccccccc-CCC-----CcEEEEeeCCCceEEE-EEc---CeEEEEE---EcCCCceeEeee-ecCCCCce
Q 003405 96 SIAFHRLPNLETIAVLT-KAK-----GANVYSWDDRRGFLCF-ARQ---KRVCIFR---HDGGRGFVEVKD-FGVPDTVK 161 (823)
Q Consensus 96 ~l~~~~L~~l~~~~~i~-~~k-----g~~~fa~~~~~~~l~V-~~k---kki~l~~---~~~~~~f~~~ke-i~~~~~~~ 161 (823)
+..+|+..+...+...+ +.| .|.+-..+.+ ..+.+ +.. +++..+. |+.+ .|-+.+. +.-...+.
T Consensus 208 ~~~VW~~~~g~~~a~~t~~~k~~~~~~cRF~~d~~~-~~l~laa~~~~~~~v~~~~~~~w~~~-~~l~~~~~~~~~~siS 285 (398)
T KOG0771|consen 208 SARVWSVNTGAALARKTPFSKDEMFSSCRFSVDNAQ-ETLRLAASQFPGGGVRLCDISLWSGS-NFLRLRKKIKRFKSIS 285 (398)
T ss_pred ceEEEEeccCchhhhcCCcccchhhhhceecccCCC-ceEEEEEecCCCCceeEEEeeeeccc-cccchhhhhhccCcce
Confidence 69999987664444332 222 3333333323 33322 222 3344433 4433 2322222 22334788
Q ss_pred EEEec--CCeEEEEEc-CceEEEEcCCCCeeeccC
Q 003405 162 SMSWC--GENICIAIR-KGYMILNATNGALSEVFP 193 (823)
Q Consensus 162 ~l~~~--~~~i~v~~~-~~y~lidl~~~~~~~L~~ 193 (823)
+|+.. |..+.+|+. ....+++..+=+...+++
T Consensus 286 sl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~vk 320 (398)
T KOG0771|consen 286 SLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYVK 320 (398)
T ss_pred eEEEcCCCcEEEEeccCCcEEEEEeceeeeeEeeh
Confidence 88876 678999988 457777776654444443
No 170
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=90.11 E-value=3.9 Score=48.21 Aligned_cols=97 Identities=13% Similarity=0.218 Sum_probs=61.3
Q ss_pred cCceeeEe-C-cEEEEeCCCCcccccccCCCCcEEEEeeCCCceEEE-E-EcCeEEEEEEcCCCceeEeeeecCCCCceE
Q 003405 87 RQLLLSLS-E-SIAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCF-A-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKS 162 (823)
Q Consensus 87 ~~~Ll~l~-d-~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V-~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~ 162 (823)
.++||+=+ | .|++|.+..-+-+....-..=||+++.++...+.++ | ...|+.|+.+.+. +.... ..+.+-|++
T Consensus 380 n~fLLSSSMDKTVRLWh~~~~~CL~~F~HndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~-~Vv~W--~Dl~~lITA 456 (712)
T KOG0283|consen 380 NNFLLSSSMDKTVRLWHPGRKECLKVFSHNDFVTCVAFNPVDDRYFISGSLDGKVRLWSISDK-KVVDW--NDLRDLITA 456 (712)
T ss_pred CCeeEeccccccEEeecCCCcceeeEEecCCeeEEEEecccCCCcEeecccccceEEeecCcC-eeEee--hhhhhhhee
Confidence 36666655 5 499998754322222222334677777665444344 3 8899999998843 33333 235588999
Q ss_pred EEec--CCeEEEEEcCceEE-EEcCCC
Q 003405 163 MSWC--GENICIAIRKGYMI-LNATNG 186 (823)
Q Consensus 163 l~~~--~~~i~v~~~~~y~l-idl~~~ 186 (823)
+++. |...+||+=++|+. |++.+.
T Consensus 457 vcy~PdGk~avIGt~~G~C~fY~t~~l 483 (712)
T KOG0283|consen 457 VCYSPDGKGAVIGTFNGYCRFYDTEGL 483 (712)
T ss_pred EEeccCCceEEEEEeccEEEEEEccCC
Confidence 9998 67899999998865 455443
No 171
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=90.09 E-value=12 Score=38.71 Aligned_cols=98 Identities=17% Similarity=0.241 Sum_probs=61.0
Q ss_pred CCCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccccccCCCCcE-EEEeeCCCceEEEE-EcCeEEEEEEcCCCcee
Q 003405 74 SKKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLTKAKGAN-VYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFV 149 (823)
Q Consensus 74 ~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~~~kg~~-~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~ 149 (823)
+.+||+||+.-.+.++|++++- . ..+|.-.+-+.+.+-..-.|+. ++.++.+...++-+ ....+.|+....++..
T Consensus 9 HERplTqiKyN~eGDLlFscaKD~~~~vw~s~nGerlGty~GHtGavW~~Did~~s~~liTGSAD~t~kLWDv~tGk~l- 87 (327)
T KOG0643|consen 9 HERPLTQIKYNREGDLLFSCAKDSTPTVWYSLNGERLGTYDGHTGAVWCCDIDWDSKHLITGSADQTAKLWDVETGKQL- 87 (327)
T ss_pred CccccceEEecCCCcEEEEecCCCCceEEEecCCceeeeecCCCceEEEEEecCCcceeeeccccceeEEEEcCCCcEE-
Confidence 3689999999999999999995 3 7777543433333322333432 23334444445444 6677888888766432
Q ss_pred EeeeecCCCCceEEEec-CCeEEEEE
Q 003405 150 EVKDFGVPDTVKSMSWC-GENICIAI 174 (823)
Q Consensus 150 ~~kei~~~~~~~~l~~~-~~~i~v~~ 174 (823)
-...++.+++.+.|. ++.+|+++
T Consensus 88 --a~~k~~~~Vk~~~F~~~gn~~l~~ 111 (327)
T KOG0643|consen 88 --ATWKTNSPVKRVDFSFGGNLILAS 111 (327)
T ss_pred --EEeecCCeeEEEeeccCCcEEEEE
Confidence 234567788988887 44444443
No 172
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=90.05 E-value=18 Score=36.22 Aligned_cols=104 Identities=14% Similarity=0.258 Sum_probs=67.6
Q ss_pred CCCeeEEEEecccCceeeEeC---c-EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcC----eEEEEEEcCCC
Q 003405 75 KKPILSMEVLASRQLLLSLSE---S-IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQK----RVCIFRHDGGR 146 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~d---~-l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kk----ki~l~~~~~~~ 146 (823)
..+|..+.--|..+.++++.+ . +.+|++. .+++..+. ...++.++.+++...++++.-+ .+.+|...
T Consensus 59 ~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~-~~~i~~~~-~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~--- 133 (194)
T PF08662_consen 59 EGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVK-GKKIFSFG-TQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVR--- 133 (194)
T ss_pred CCceEEEEECcCCCEEEEEEccCCcccEEEcCc-ccEeEeec-CCCceEEEECCCCCEEEEEEccCCCcEEEEEECC---
Confidence 347999999999988888764 2 9999985 44444443 3456788889887777777433 35565554
Q ss_pred ceeEeeeecCCCCceEEEec--CCeEEEEEc-------CceEEEEcC
Q 003405 147 GFVEVKDFGVPDTVKSMSWC--GENICIAIR-------KGYMILNAT 184 (823)
Q Consensus 147 ~f~~~kei~~~~~~~~l~~~--~~~i~v~~~-------~~y~lidl~ 184 (823)
....+.+..-+ .++.++|. |..++.++. ++|.|.+..
T Consensus 134 ~~~~i~~~~~~-~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~ 179 (194)
T PF08662_consen 134 KKKKISTFEHS-DATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ 179 (194)
T ss_pred CCEEeeccccC-cEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence 23444444333 46889998 455555542 456666664
No 173
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=89.86 E-value=0.28 Score=39.64 Aligned_cols=50 Identities=24% Similarity=0.406 Sum_probs=38.2
Q ss_pred HHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 310 LTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 310 Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
|++.|+|++|+.+++.....++ .-..+....|..++..|+|++|...+.+
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p-----~~~~~~~~la~~~~~~g~~~~A~~~l~~ 50 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNP-----DNPEARLLLAQCYLKQGQYDEAEELLER 50 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTT-----TSHHHHHHHHHHHHHTT-HHHHHHHHHC
T ss_pred ChhccCHHHHHHHHHHHHHHCC-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 5789999999999987521111 2335667789999999999999999986
No 174
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=89.85 E-value=3.5 Score=50.01 Aligned_cols=147 Identities=17% Similarity=0.216 Sum_probs=88.8
Q ss_pred EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCeeEEEEeccc-CceeeEe-Cc-EEE
Q 003405 24 SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPILSMEVLASR-QLLLSLS-ES-IAF 99 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~qI~~~~~~-~~Ll~l~-d~-l~~ 99 (823)
.+|+.++.|..||.|..|+..-...+ .++...+.++ ..+|.++.+=+.. +-|++-| || |.+
T Consensus 1219 ~~gn~i~AGfaDGsvRvyD~R~a~~d---------------s~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~ 1283 (1387)
T KOG1517|consen 1219 VHGNIIAAGFADGSVRVYDRRMAPPD---------------SLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGDIQL 1283 (1387)
T ss_pred cCCceEEEeecCCceEEeecccCCcc---------------ccceeecccCCcccceeEEeecCCCcceeeeccCCeEEE
Confidence 34688999999999999986533221 1122222332 3469998887743 4366555 46 999
Q ss_pred EeCCCCcccccccCCC----C--cEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeee---ecC--CCCceEEEecCC
Q 003405 100 HRLPNLETIAVLTKAK----G--ANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKD---FGV--PDTVKSMSWCGE 168 (823)
Q Consensus 100 ~~L~~l~~~~~i~~~k----g--~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~ke---i~~--~~~~~~l~~~~~ 168 (823)
|++..-...+.+.... | .+++.+++....|+.|..+.|.||...+. ....+|. +.- ...+.|++|...
T Consensus 1284 ~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiAsGs~q~ikIy~~~G~-~l~~~k~n~~F~~q~~gs~scL~FHP~ 1362 (1387)
T KOG1517|consen 1284 LDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIASGSAQLIKIYSLSGE-QLNIIKYNPGFMGQRIGSVSCLAFHPH 1362 (1387)
T ss_pred EecccCcccccceeeeccccCccceeeeeccCCCeeeecCcceEEEEecChh-hhcccccCcccccCcCCCcceeeecch
Confidence 9985421111111222 5 78899999888888887799999998854 3333332 111 145678888755
Q ss_pred e--EEEEEcCc-eEEEEcCCC
Q 003405 169 N--ICIAIRKG-YMILNATNG 186 (823)
Q Consensus 169 ~--i~v~~~~~-y~lidl~~~ 186 (823)
. +.+|+... ..+|....+
T Consensus 1363 ~~llAaG~~Ds~V~iYs~~k~ 1383 (1387)
T KOG1517|consen 1363 RLLLAAGSADSTVSIYSCEKP 1383 (1387)
T ss_pred hHhhhhccCCceEEEeecCCc
Confidence 3 55554433 345555443
No 175
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=89.73 E-value=61 Score=40.14 Aligned_cols=239 Identities=16% Similarity=0.118 Sum_probs=129.4
Q ss_pred HHHHHHHHHHHhcCChhhHHhhh-cCCCc-ccHHHHHHHHHhcCcHHHHHHHHHH-hccHHHHHHHHHHHhhcccCCCCc
Q 003405 506 ILDTALLQALLLTGQSSAALELL-KGLNY-CDVKICEEILQKKNHYTALLELYKS-NARHREALKLLHELVEESKSNQSQ 582 (823)
Q Consensus 506 ~vDT~Ll~~y~~~~~~~~l~~ll-~~~n~-c~~~~~~~~L~~~~~~~~L~~ly~~-~g~~~~AL~ll~~l~~~~~~d~~~ 582 (823)
.+-..|...|++.+ ...+..++ +.+.+ .|++.+.+.+++++.|+.++++..+ .++|+-+|.=+......+.+.++.
T Consensus 542 vl~~sL~dy~~e~~-l~~ie~lIv~le~~sLDld~vlki~kq~~lfd~liYv~~kafNDY~tplvell~~~~~difs~sE 620 (1206)
T KOG2079|consen 542 VLAPSLADYLLEEE-LKYIENLIVTLEPSSLDLDVVLKICKQYNLFDGLIYVNNKAFNDYDTPLVELLSRISNDIFSPSE 620 (1206)
T ss_pred HHHHHHHHHHHhcC-HHHHHhheeecCcccccHHHHHHHHHHhCCcceEEEEeeehhcccccHHHHHHHHhhccccCCcc
Confidence 45666777677654 23344443 33444 5999999999999999998776654 688888887776655444433222
Q ss_pred ccc------------cc------cCChH-------HHHH-HhhcC---CCCChhhHHHhhhhhhhcCcccccccccc---
Q 003405 583 DEH------------TQ------KFNPE-------SIIE-YLKPL---CGTDPMLVLEFSMLVLESCPTQTIELFLS--- 630 (823)
Q Consensus 583 ~~~------------~~------~~~~~-------~~i~-yL~~L---~~~~~~li~~y~~wll~~~p~~~~~if~~--- 630 (823)
..+ .. .+..+ ...+ .+..+ .+.+-+.-+-|.+.+++.||.+.+.++..
T Consensus 621 q~~gn~~f~yvs~cLTG~~YP~~~~~ie~~~~V~~el~r~cfS~v~~k~~~e~e~~fPYlrllLk~d~~~flnvls~afd 700 (1206)
T KOG2079|consen 621 QRLGNTIFVYVSYCLTGRFYPFGLHPIEEQGSVSHELLRNCFSSVTTKGNPEEEPAFPYLRLLLKSDPSRFLNVLSEAFD 700 (1206)
T ss_pred ccCCceEEEeeehhhcccccccccCchHhhchhhHHHHHHHhhcCCcCCCCccCcccHHHHHHHhhCHHHHHHHHHHHhh
Confidence 111 00 00011 1122 22222 12334556778888888888887655432
Q ss_pred ------CC--CChHHHHHHHhhc-C--chhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccc
Q 003405 631 ------GN--IPADLVNSYLKQY-S--PSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAY 699 (823)
Q Consensus 631 ------~~--l~~~~Vl~~L~~~-~--~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~ 699 (823)
|+ ..+..|++.|... . ....+.||-++... ....+.+=.--.+.|-..+..+..+ . ..+..
T Consensus 701 ~~~Fsldn~lv~rq~iI~~L~~~mk~e~s~~~~~lifiaq~--~s~yrqli~~s~shlq~~vitlcss-~-----~hs~r 772 (1206)
T KOG2079|consen 701 ASLFSLDNELVSRQYIIDLLLDAMKDEGSIRVLVLIFIAQS--ISKYRQLIKVSNSHLQCVVITLCSS-R-----VHSIR 772 (1206)
T ss_pred hhhhccchhhhhHHHHHHHHHHHhcccccchhhhHHHHHHH--hhhhhHHhhhhHHHHHHHHHhhccC-c-----ccchh
Confidence 21 3344555555431 1 12355666665432 0111111111112222222222110 0 00111
Q ss_pred hHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHH
Q 003405 700 SPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVH 753 (823)
Q Consensus 700 ~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~ 753 (823)
.-....|...|---...+.++-+..|++.+++.-.-+||.|.|+++.||+.|+.
T Consensus 773 En~~~alesll~lyh~~~de~~il~a~~~~~y~Vl~hi~~k~~kyed~l~~iLe 826 (1206)
T KOG2079|consen 773 ENSQIALESLLPLYHSRTDENFILEAKEKNFYKVLFHIYKKENKYEDALSLILE 826 (1206)
T ss_pred HHHHHHHHhhccceeccChHHHHHHhhhcccceeHHHHHhhhhhHHHHHHHHHH
Confidence 111224444433334455566788888999999999999999999999999996
No 176
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=89.66 E-value=3.8 Score=46.23 Aligned_cols=177 Identities=13% Similarity=0.149 Sum_probs=106.9
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCccc-c------------cccCChHHHHHHhhcCC---CCChhhHHHh
Q 003405 549 YTALLELYKSNARHREALKLLHELVEESKSNQSQDE-H------------TQKFNPESIIEYLKPLC---GTDPMLVLEF 612 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~-~------------~~~~~~~~~i~yL~~L~---~~~~~li~~y 612 (823)
+..++.+|...|++++|++++..+......++.... + ....+.+...++.+++. ..+.++...|
T Consensus 190 l~ll~~~~~~~gdw~~a~~~l~~l~k~~~~~~~~~~~l~~~a~~~l~~~~~~~~~~~~l~~~w~~lp~~~~~~~~~~~~~ 269 (398)
T PRK10747 190 LRLAEQAYIRTGAWSSLLDILPSMAKAHVGDEEHRAMLEQQAWIGLMDQAMADQGSEGLKRWWKNQSRKTRHQVALQVAM 269 (398)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHhCCHHHhCCHHHHHHH
Confidence 456788999999999999888888754332110000 0 00001122333333332 2356677777
Q ss_pred hhhhhhcC-cccccccccc---CCCChH--HHHHHHhhcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhh
Q 003405 613 SMLVLESC-PTQTIELFLS---GNIPAD--LVNSYLKQYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYS 686 (823)
Q Consensus 613 ~~wll~~~-p~~~~~if~~---~~l~~~--~Vl~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~ 686 (823)
+..++..+ ++.|.+++.+ .+.++. .+...+...++......+|.+... ...++.++-.++.+|+..
T Consensus 270 A~~l~~~g~~~~A~~~L~~~l~~~~~~~l~~l~~~l~~~~~~~al~~~e~~lk~--~P~~~~l~l~lgrl~~~~------ 341 (398)
T PRK10747 270 AEHLIECDDHDTAQQIILDGLKRQYDERLVLLIPRLKTNNPEQLEKVLRQQIKQ--HGDTPLLWSTLGQLLMKH------ 341 (398)
T ss_pred HHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHhhccCCChHHHHHHHHHHHhh--CCCCHHHHHHHHHHHHHC------
Confidence 77777754 4455555544 223333 234444445577788899988764 346889999999999975
Q ss_pred hhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHh
Q 003405 687 DLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKV 755 (823)
Q Consensus 687 ~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L 755 (823)
+.+.+.|..+...++ .=|+..-+.+.+-++.++|++++|.++|=.-|
T Consensus 342 ----------~~~~~A~~~le~al~------------~~P~~~~~~~La~~~~~~g~~~~A~~~~~~~l 388 (398)
T PRK10747 342 ----------GEWQEASLAFRAALK------------QRPDAYDYAWLADALDRLHKPEEAAAMRRDGL 388 (398)
T ss_pred ----------CCHHHHHHHHHHHHh------------cCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 223344555544333 23443334578889999999999999887654
No 177
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=89.23 E-value=31 Score=36.42 Aligned_cols=155 Identities=10% Similarity=0.064 Sum_probs=96.3
Q ss_pred CCCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 15 CSPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 15 ~~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
-+..|-++.-. |..+.-|..|-.|+.|++.++.. .+.. .+ -++.+|..+.-.+..+.+++
T Consensus 46 h~geI~~~~F~P~gs~~aSgG~Dr~I~LWnv~gdce--------------N~~~---lk-gHsgAVM~l~~~~d~s~i~S 107 (338)
T KOG0265|consen 46 HKGEIYTIKFHPDGSCFASGGSDRAIVLWNVYGDCE--------------NFWV---LK-GHSGAVMELHGMRDGSHILS 107 (338)
T ss_pred CcceEEEEEECCCCCeEeecCCcceEEEEecccccc--------------ceee---ec-cccceeEeeeeccCCCEEEE
Confidence 34567665544 56889999999999999776542 1211 12 34789999999999999999
Q ss_pred EeC-c-EEEEeCCCCccccccc-CCCCcEEEEeeCCCce-EEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC
Q 003405 93 LSE-S-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGF-LCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG 167 (823)
Q Consensus 93 l~d-~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~-l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~ 167 (823)
++. . +..||..+-+.+.+.. ..+-+++++....... +|-+ -.+.+.+|..+.. ..+|.+.-+-+.++++|.+
T Consensus 108 ~gtDk~v~~wD~~tG~~~rk~k~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k---~~~~t~~~kyqltAv~f~d 184 (338)
T KOG0265|consen 108 CGTDKTVRGWDAETGKRIRKHKGHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKK---EAIKTFENKYQLTAVGFKD 184 (338)
T ss_pred ecCCceEEEEecccceeeehhccccceeeecCccccCCeEEEecCCCceEEEEeeccc---chhhccccceeEEEEEecc
Confidence 986 3 9999986654333211 1122333332222223 3334 4466777777622 1223333455788999973
Q ss_pred ---CeEEEEEcCceEEEEcCCCCeee
Q 003405 168 ---ENICIAIRKGYMILNATNGALSE 190 (823)
Q Consensus 168 ---~~i~v~~~~~y~lidl~~~~~~~ 190 (823)
..++-|-.+...+-|+..+...-
T Consensus 185 ~s~qv~sggIdn~ikvWd~r~~d~~~ 210 (338)
T KOG0265|consen 185 TSDQVISGGIDNDIKVWDLRKNDGLY 210 (338)
T ss_pred cccceeeccccCceeeeccccCcceE
Confidence 34666677888888886654433
No 178
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=89.05 E-value=9.3 Score=41.86 Aligned_cols=185 Identities=11% Similarity=0.114 Sum_probs=110.2
Q ss_pred CCcEEEEEEeC----CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCcee
Q 003405 16 SPKIDAVASYG----LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLL 91 (823)
Q Consensus 16 ~~~I~ci~~~~----~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll 91 (823)
..+|.|++.+- .++.-|..||++.+|..+.... ...+.++ -..|..+..=|.+.+|.
T Consensus 217 ~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~e~~------------------l~~l~gH-~~RVs~VafHPsG~~L~ 277 (459)
T KOG0272|consen 217 TSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQETP------------------LQDLEGH-LARVSRVAFHPSGKFLG 277 (459)
T ss_pred ccceeeEEEccCCCccceeeeccCCceeeeccCCCcc------------------hhhhhcc-hhhheeeeecCCCceee
Confidence 45788888774 3899999999999998765321 2233333 46899999999999988
Q ss_pred eEe-C-cEEEEeCCCCc-ccccccCCCCcEEEEeeCCCceEEEEEcCe-EEEEEEcCCCceeEeeeecCCCCceEEEecC
Q 003405 92 SLS-E-SIAFHRLPNLE-TIAVLTKAKGANVYSWDDRRGFLCFARQKR-VCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG 167 (823)
Q Consensus 92 ~l~-d-~l~~~~L~~l~-~~~~i~~~kg~~~fa~~~~~~~l~V~~kkk-i~l~~~~~~~~f~~~kei~~~~~~~~l~~~~ 167 (823)
+-| | .-.+||+.+-+ ...+-.-.||+..++...+...++-+.-.. -.|+....++..-.+.. -..+|.+++|..
T Consensus 278 TasfD~tWRlWD~~tk~ElL~QEGHs~~v~~iaf~~DGSL~~tGGlD~~~RvWDlRtgr~im~L~g--H~k~I~~V~fsP 355 (459)
T KOG0272|consen 278 TASFDSTWRLWDLETKSELLLQEGHSKGVFSIAFQPDGSLAATGGLDSLGRVWDLRTGRCIMFLAG--HIKEILSVAFSP 355 (459)
T ss_pred ecccccchhhcccccchhhHhhcccccccceeEecCCCceeeccCccchhheeecccCcEEEEecc--cccceeeEeECC
Confidence 877 5 37889986533 223345678999999988766555543322 24555543332211111 135788999985
Q ss_pred Ce--EEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEcc-CCeEEE--EeCCeEEEE
Q 003405 168 EN--ICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLL-SGELLL--GKENIGVFV 221 (823)
Q Consensus 168 ~~--i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~-~~EfLL--~~~~~gvfv 221 (823)
|. |.-|.. +.-.+.|+...+.....|--.+-.--|+..+ .+.||+ ++|+..=..
T Consensus 356 NGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~nlVS~Vk~~p~~g~fL~TasyD~t~kiW 415 (459)
T KOG0272|consen 356 NGYHLATGSSDNTCKVWDLRMRSELYTIPAHSNLVSQVKYSPQEGYFLVTASYDNTVKIW 415 (459)
T ss_pred CceEEeecCCCCcEEEeeecccccceecccccchhhheEecccCCeEEEEcccCcceeee
Confidence 54 333333 3466777776554333332211111123333 567887 356554333
No 179
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.99 E-value=5.3 Score=40.71 Aligned_cols=129 Identities=21% Similarity=0.338 Sum_probs=78.8
Q ss_pred EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec--ccCceeeEe-Cc-E
Q 003405 22 VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA--SRQLLLSLS-ES-I 97 (823)
Q Consensus 22 i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~--~~~~Ll~l~-d~-l 97 (823)
++-||.+|.-.++||+|.+|.+..+.. ..+..+..+. ..||-|+.-.. -.++|.+++ |+ |
T Consensus 19 lDyygkrlATcsSD~tVkIf~v~~n~~---------------s~ll~~L~Gh-~GPVwqv~wahPk~G~iLAScsYDgkV 82 (299)
T KOG1332|consen 19 LDYYGKRLATCSSDGTVKIFEVRNNGQ---------------SKLLAELTGH-SGPVWKVAWAHPKFGTILASCSYDGKV 82 (299)
T ss_pred hhhhcceeeeecCCccEEEEEEcCCCC---------------ceeeeEecCC-CCCeeEEeecccccCcEeeEeecCceE
Confidence 566899999999999999998775432 2334455554 67999988776 457777776 55 8
Q ss_pred EEEeCCCCc--cccccc-CCCCcEEEEeeCCC-c-eEEEE-EcCeEEEEEEcCCCceeEeeeecC-CCCceEEEec
Q 003405 98 AFHRLPNLE--TIAVLT-KAKGANVYSWDDRR-G-FLCFA-RQKRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWC 166 (823)
Q Consensus 98 ~~~~L~~l~--~~~~i~-~~kg~~~fa~~~~~-~-~l~V~-~kkki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~ 166 (823)
.+|.-.+-. ..+.-. -.-.+++++.-+.. | .|++| ...+|.+++++.+......|.+.. +-.+.+++|.
T Consensus 83 IiWke~~g~w~k~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g~w~t~ki~~aH~~GvnsVswa 158 (299)
T KOG1332|consen 83 IIWKEENGRWTKAYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGGWTTSKIVFAHEIGVNSVSWA 158 (299)
T ss_pred EEEecCCCchhhhhhhhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCCCCccchhhhhccccccceeeec
Confidence 899764322 111100 11234555544332 3 45555 889999999986533332232221 2356677775
No 180
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=88.98 E-value=67 Score=39.60 Aligned_cols=55 Identities=24% Similarity=0.183 Sum_probs=34.1
Q ss_pred HhccCCCC-chhhHHHHHhhccccHHHHHHHHHHHh--CCCchhHHHHHHHHhcCCCC
Q 003405 721 LLKRLPAD-ALYEERAILLGKMNQHELALSLYVHKV--FLINQPVFLLIRRMAMDIKP 775 (823)
Q Consensus 721 ~L~~~~~~-~l~~e~~~Ll~klg~h~~AL~ilv~~L--~D~~~a~~~~l~~~y~~~~~ 775 (823)
++..-+.+ ......+.+|.++|++++|+.++=.-+ .-.++.++..+..+|...+.
T Consensus 762 ~l~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~l~~~~~~~~~ 819 (899)
T TIGR02917 762 WLKTHPNDAVLRTALAELYLAQKDYDKAIKHYRTVVKKAPDNAVVLNNLAWLYLELKD 819 (899)
T ss_pred HHHhCCCCHHHHHHHHHHHHHCcCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCc
Confidence 33333433 244566777888888888888876533 32345566777777776554
No 181
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=88.63 E-value=12 Score=40.38 Aligned_cols=103 Identities=9% Similarity=0.163 Sum_probs=60.9
Q ss_pred CCeeEEEEecccCceeeEeCcEEEEeCCCCccccccc-CCCCcEEEEeeCCC-ceEEEE-EcCeEEEEEEcCCCceeEee
Q 003405 76 KPILSMEVLASRQLLLSLSESIAFHRLPNLETIAVLT-KAKGANVYSWDDRR-GFLCFA-RQKRVCIFRHDGGRGFVEVK 152 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~d~l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~-~~l~V~-~kkki~l~~~~~~~~f~~~k 152 (823)
..+.-|......+...++++.|.+|+..-..|+.+.. +...++++..++.. ..|+.+ ..+.|.||....+ ..++
T Consensus 148 s~~~gIdh~~~~~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfNpvETsILas~~sDrsIvLyD~R~~---~Pl~ 224 (433)
T KOG0268|consen 148 SVYLGIDHHRKNSVFATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFNPVETSILASCASDRSIVLYDLRQA---SPLK 224 (433)
T ss_pred ccccccccccccccccccCceeeecccccCCccceeecCCCceeEEecCCCcchheeeeccCCceEEEecccC---Cccc
Confidence 3344455555555666666667777765554544322 22234444445433 345554 8899999998854 3456
Q ss_pred eecCCCCceEEEecCC-eEEEEEcCceEEE
Q 003405 153 DFGVPDTVKSMSWCGE-NICIAIRKGYMIL 181 (823)
Q Consensus 153 ei~~~~~~~~l~~~~~-~i~v~~~~~y~li 181 (823)
.+.+.-..-+|+|..+ ..+++-...|.++
T Consensus 225 KVi~~mRTN~IswnPeafnF~~a~ED~nlY 254 (433)
T KOG0268|consen 225 KVILTMRTNTICWNPEAFNFVAANEDHNLY 254 (433)
T ss_pred eeeeeccccceecCccccceeeccccccce
Confidence 6666677889999854 4555555555544
No 182
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=88.57 E-value=38 Score=39.81 Aligned_cols=160 Identities=17% Similarity=0.229 Sum_probs=91.1
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccc-cccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLR-KESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~-~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
+.+..|++..- +++..|..||.+.+..+...+++.... .|. ..+..+.++..++ +..|.-+.--+. +-=++
T Consensus 14 nvkL~c~~WNke~gyIAcgG~dGlLKVlKl~t~t~d~~~~----glaa~snLsmNQtLeGH-~~sV~vvTWNe~-~QKLT 87 (1189)
T KOG2041|consen 14 NVKLHCAEWNKESGYIACGGADGLLKVLKLGTDTTDLNKS----GLAAASNLSMNQTLEGH-NASVMVVTWNEN-NQKLT 87 (1189)
T ss_pred CceEEEEEEcccCCeEEeccccceeEEEEccccCCccccc----ccccccccchhhhhccC-cceEEEEEeccc-ccccc
Confidence 34778987653 699999999999999988766542110 010 0111122222332 344544433333 44455
Q ss_pred EeC--c-EEEEeCCCCcccccc--cCCCC-cEEEEeeCCCceEEEE-EcCeEEEEEEcCCCcee-EeeeecCCCCceEEE
Q 003405 93 LSE--S-IAFHRLPNLETIAVL--TKAKG-ANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFV-EVKDFGVPDTVKSMS 164 (823)
Q Consensus 93 l~d--~-l~~~~L~~l~~~~~i--~~~kg-~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~-~~kei~~~~~~~~l~ 164 (823)
-+| | |-+|.|.+-+..... .+.|. |.+++++.+..+||++ ....+++=..++++.|. .+|-.. ...+.
T Consensus 88 tSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~dG~kIcIvYeDGavIVGsvdGNRIwgKeLkg~~----l~hv~ 163 (1189)
T KOG2041|consen 88 TSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNLDGTKICIVYEDGAVIVGSVDGNRIWGKELKGQL----LAHVL 163 (1189)
T ss_pred ccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcCCCcEEEEEEccCCEEEEeeccceecchhcchhe----cccee
Confidence 566 5 788998664433221 22332 4567888888889988 66667666777666563 233222 23667
Q ss_pred ecCC--eEEEEEc-CceEEEEcCC
Q 003405 165 WCGE--NICIAIR-KGYMILNATN 185 (823)
Q Consensus 165 ~~~~--~i~v~~~-~~y~lidl~~ 185 (823)
|..+ .+.++.. .+-.++|.+.
T Consensus 164 ws~D~~~~Lf~~ange~hlydnqg 187 (1189)
T KOG2041|consen 164 WSEDLEQALFKKANGETHLYDNQG 187 (1189)
T ss_pred ecccHHHHHhhhcCCcEEEecccc
Confidence 7744 3555555 4556666553
No 183
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.43 E-value=67 Score=38.93 Aligned_cols=194 Identities=10% Similarity=0.125 Sum_probs=98.9
Q ss_pred EEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCC---cc-------cccccccceeeeeecCCCCCCeeEEEEecc
Q 003405 19 IDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPS---DY-------QSLRKESYELERTISGFSKKPILSMEVLAS 86 (823)
Q Consensus 19 I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~---d~-------~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~ 86 (823)
|.|..-|. +.++=|+=|-+|.+|++.+-.-...+|+ |. +.|...+=..++.+-.-+.+.|+-+..=|.
T Consensus 138 VMcAqFhptEDlIVSaSLDQTVRVWDisGLRkk~~~pg~~e~~~~~~~~~~dLfg~~DaVVK~VLEGHDRGVNwaAfhpT 217 (1202)
T KOG0292|consen 138 VMCAQFHPTEDLIVSASLDQTVRVWDISGLRKKNKAPGSLEDQMRGQQGNSDLFGQTDAVVKHVLEGHDRGVNWAAFHPT 217 (1202)
T ss_pred EEeeccCCccceEEEecccceEEEEeecchhccCCCCCCchhhhhccccchhhcCCcCeeeeeeecccccccceEEecCC
Confidence 55555554 5777888899999999986433222222 10 112111111222222123567888888888
Q ss_pred cCceeeEeCc--EEEEeCCCCcccc---cccCCCCcEEEEeeCCCc-------------------------------eEE
Q 003405 87 RQLLLSLSES--IAFHRLPNLETIA---VLTKAKGANVYSWDDRRG-------------------------------FLC 130 (823)
Q Consensus 87 ~~~Ll~l~d~--l~~~~L~~l~~~~---~i~~~kg~~~fa~~~~~~-------------------------------~l~ 130 (823)
..++++=+|. |++|.+..-+.-. --....+|+++-.++... +-|
T Consensus 218 lpliVSG~DDRqVKlWrmnetKaWEvDtcrgH~nnVssvlfhp~q~lIlSnsEDksirVwDm~kRt~v~tfrrendRFW~ 297 (1202)
T KOG0292|consen 218 LPLIVSGADDRQVKLWRMNETKAWEVDTCRGHYNNVSSVLFHPHQDLILSNSEDKSIRVWDMTKRTSVQTFRRENDRFWI 297 (1202)
T ss_pred cceEEecCCcceeeEEEeccccceeehhhhcccCCcceEEecCccceeEecCCCccEEEEecccccceeeeeccCCeEEE
Confidence 7777777773 9999997643211 012234555544443221 123
Q ss_pred EEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEEEEEcCceEEEEcCCCCeeec---cCCCCCCCC---EEEE
Q 003405 131 FARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENICIAIRKGYMILNATNGALSEV---FPSGRIGPP---LVVS 204 (823)
Q Consensus 131 V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~~~~~~L---~~~~~~~~p---~i~~ 204 (823)
++..-++-+|-...+..+...| +.-..-+.+..++.++++..+...-+|+.+.+-..+ -..|....| +.-.
T Consensus 298 laahP~lNLfAAgHDsGm~VFk---leRErpa~~v~~n~LfYvkd~~i~~~d~~t~~d~~v~~lr~~g~~~~~~~smsYN 374 (1202)
T KOG0292|consen 298 LAAHPELNLFAAGHDSGMIVFK---LERERPAYAVNGNGLFYVKDRFIRSYDLRTQKDTAVASLRRPGTLWQPPRSLSYN 374 (1202)
T ss_pred EEecCCcceeeeecCCceEEEE---EcccCceEEEcCCEEEEEccceEEeeeccccccceeEeccCCCcccCCcceeeec
Confidence 3333333333322221221111 221233456678999999988888899888543333 333322122 2222
Q ss_pred ccCCeEEEEeC
Q 003405 205 LLSGELLLGKE 215 (823)
Q Consensus 205 ~~~~EfLL~~~ 215 (823)
...+-+|+|.+
T Consensus 375 pae~~vlics~ 385 (1202)
T KOG0292|consen 375 PAENAVLICSN 385 (1202)
T ss_pred cccCeEEEEec
Confidence 23456677753
No 184
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=88.39 E-value=54 Score=37.82 Aligned_cols=75 Identities=13% Similarity=0.286 Sum_probs=47.6
Q ss_pred EEEEEcCCCceeE--eeeecCCCCceEEEec--CCeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE
Q 003405 138 CIFRHDGGRGFVE--VKDFGVPDTVKSMSWC--GENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL 212 (823)
Q Consensus 138 ~l~~~~~~~~f~~--~kei~~~~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL 212 (823)
.+|+...+ .++. +..++++..|.+.++. .+.+++|+. +...++|...+.+... ... -....+.+-+++.+++
T Consensus 239 ciYE~~r~-klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t~~~-ka~-~~P~~iaWHp~gai~~ 315 (545)
T PF11768_consen 239 CIYECSRN-KLQRVSVTSIPLPSQVICCARSPSEDKLVLGCEDGSIILYDTTRGVTLLA-KAE-FIPTLIAWHPDGAIFV 315 (545)
T ss_pred EEEEeecC-ceeEEEEEEEecCCcceEEecCcccceEEEEecCCeEEEEEcCCCeeeee-eec-ccceEEEEcCCCcEEE
Confidence 57887754 3443 3457888999999997 468999998 5688899877644322 111 1123445556666655
Q ss_pred EeC
Q 003405 213 GKE 215 (823)
Q Consensus 213 ~~~ 215 (823)
..+
T Consensus 316 V~s 318 (545)
T PF11768_consen 316 VGS 318 (545)
T ss_pred EEc
Confidence 433
No 185
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=88.32 E-value=10 Score=43.16 Aligned_cols=168 Identities=14% Similarity=0.170 Sum_probs=92.6
Q ss_pred EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCCC-
Q 003405 28 KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLPN- 104 (823)
Q Consensus 28 ~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~- 104 (823)
-||+|.+.-.|+.++++... | .+.+. .+..+++.|.+-+.++++.+=++ | |-+|+...
T Consensus 147 Dly~~gsg~evYRlNLEqGr----------------f--L~P~~-~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~k 207 (703)
T KOG2321|consen 147 DLYLVGSGSEVYRLNLEQGR----------------F--LNPFE-TDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDK 207 (703)
T ss_pred cEEEeecCcceEEEEccccc----------------c--ccccc-cccccceeeeecCccceEEecccCceEEEecchhh
Confidence 46666666666777665421 1 12222 12468888888888777766664 5 89998643
Q ss_pred -----Cccccccc------CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC---e
Q 003405 105 -----LETIAVLT------KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE---N 169 (823)
Q Consensus 105 -----l~~~~~i~------~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~---~ 169 (823)
++...++. ....++++....+.-.++|| ....+.||.+...+ --.+|+-...-+|..+.|..+ .
T Consensus 208 srv~~l~~~~~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa~~-pl~~kdh~~e~pi~~l~~~~~~~q~ 286 (703)
T KOG2321|consen 208 SRVGTLDAASSVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRASK-PLLVKDHGYELPIKKLDWQDTDQQN 286 (703)
T ss_pred hhheeeecccccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcccCC-ceeecccCCccceeeecccccCCCc
Confidence 22111221 12236777777665468888 55778899988543 223444434447889999743 3
Q ss_pred EEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEc-cCCeEEEEeCC
Q 003405 170 ICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSL-LSGELLLGKEN 216 (823)
Q Consensus 170 i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~-~~~EfLL~~~~ 216 (823)
-++.+. +-..|.|-.+|+......+... -.-+|.+ +.|-|+++.++
T Consensus 287 ~v~S~Dk~~~kiWd~~~Gk~~asiEpt~~-lND~C~~p~sGm~f~Ane~ 334 (703)
T KOG2321|consen 287 KVVSMDKRILKIWDECTGKPMASIEPTSD-LNDFCFVPGSGMFFTANES 334 (703)
T ss_pred eEEecchHHhhhcccccCCceeeccccCC-cCceeeecCCceEEEecCC
Confidence 444444 4445566666654332222111 1224444 34556666553
No 186
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=88.23 E-value=7.1 Score=43.19 Aligned_cols=147 Identities=14% Similarity=0.180 Sum_probs=94.0
Q ss_pred cEEEEE--EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-
Q 003405 18 KIDAVA--SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS- 94 (823)
Q Consensus 18 ~I~ci~--~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~- 94 (823)
.|.|++ +.+++|..|.-+-.|++|+.+... -++.+++ ++.+|..+..=..-+-|.+.|
T Consensus 204 eil~~avS~Dgkylatgg~d~~v~Iw~~~t~e------------------hv~~~~g-hr~~V~~L~fr~gt~~lys~s~ 264 (479)
T KOG0299|consen 204 EILTLAVSSDGKYLATGGRDRHVQIWDCDTLE------------------HVKVFKG-HRGAVSSLAFRKGTSELYSASA 264 (479)
T ss_pred eeEEEEEcCCCcEEEecCCCceEEEecCcccc------------------hhhcccc-cccceeeeeeecCccceeeeec
Confidence 455544 456789999999999999866422 1233443 377899988777777777766
Q ss_pred C-cEEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCC-eE
Q 003405 95 E-SIAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE-NI 170 (823)
Q Consensus 95 d-~l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~-~i 170 (823)
| ++++|++..+..+.++..-. ++..+..-....+++|+ ..+.+.+|++.... +.+. ....+.|.|+++.++ -.
T Consensus 265 Drsvkvw~~~~~s~vetlyGHqd~v~~IdaL~reR~vtVGgrDrT~rlwKi~ees--qlif-rg~~~sidcv~~In~~Hf 341 (479)
T KOG0299|consen 265 DRSVKVWSIDQLSYVETLYGHQDGVLGIDALSRERCVTVGGRDRTVRLWKIPEES--QLIF-RGGEGSIDCVAFINDEHF 341 (479)
T ss_pred CCceEEEehhHhHHHHHHhCCccceeeechhcccceEEeccccceeEEEeccccc--eeee-eCCCCCeeeEEEecccce
Confidence 5 59999998877665543333 34444333333467888 77889999996432 1111 122357888988854 46
Q ss_pred EEEEc-CceEEEEcCCC
Q 003405 171 CIAIR-KGYMILNATNG 186 (823)
Q Consensus 171 ~v~~~-~~y~lidl~~~ 186 (823)
+-|+. ....+.++.+.
T Consensus 342 vsGSdnG~IaLWs~~KK 358 (479)
T KOG0299|consen 342 VSGSDNGSIALWSLLKK 358 (479)
T ss_pred eeccCCceEEEeeeccc
Confidence 66666 45677777643
No 187
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=88.04 E-value=4.9 Score=42.23 Aligned_cols=161 Identities=11% Similarity=0.202 Sum_probs=95.5
Q ss_pred cccccCC---CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCeeEEE
Q 003405 9 LELISNC---SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPILSME 82 (823)
Q Consensus 9 ~~l~~~~---~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~qI~ 82 (823)
.|+++.+ -..|+|++-|- .-|.=|.+|++|..|+....+. |+.+ +-|. -.+|..|.
T Consensus 162 hPvIRTlYDH~devn~l~FHPre~ILiS~srD~tvKlFDfsK~sa------------KrA~------K~~qd~~~vrsiS 223 (430)
T KOG0640|consen 162 HPVIRTLYDHVDEVNDLDFHPRETILISGSRDNTVKLFDFSKTSA------------KRAF------KVFQDTEPVRSIS 223 (430)
T ss_pred CceEeehhhccCcccceeecchhheEEeccCCCeEEEEecccHHH------------HHHH------HHhhccceeeeEe
Confidence 3566555 34789999986 4577799999999999765332 2222 2232 36899999
Q ss_pred EecccCceeeEeCc--EEEEeCCCCcc-ccccc---CCCCcEEEEeeCCCceEEE-EEc-CeEEEEEEcCCCceeEeeee
Q 003405 83 VLASRQLLLSLSES--IAFHRLPNLET-IAVLT---KAKGANVYSWDDRRGFLCF-ARQ-KRVCIFRHDGGRGFVEVKDF 154 (823)
Q Consensus 83 ~~~~~~~Ll~l~d~--l~~~~L~~l~~-~~~i~---~~kg~~~fa~~~~~~~l~V-~~k-kki~l~~~~~~~~f~~~kei 154 (823)
.=|.++.|++=+|. +++|+..+++- .+..+ ..-+++.+-.++. +.+.| |.| ..|.||..-.++..+.+.+-
T Consensus 224 fHPsGefllvgTdHp~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys~t-~~lYvTaSkDG~IklwDGVS~rCv~t~~~A 302 (430)
T KOG0640|consen 224 FHPSGEFLLVGTDHPTLRLYDVNTYQCFVSANPDDQHTGAITQVRYSST-GSLYVTASKDGAIKLWDGVSNRCVRTIGNA 302 (430)
T ss_pred ecCCCceEEEecCCCceeEEeccceeEeeecCcccccccceeEEEecCC-ccEEEEeccCCcEEeeccccHHHHHHHHhh
Confidence 99999999999994 99999987641 11111 2234555555544 45555 444 44666655434333333332
Q ss_pred cCCCCceEEEecCCe-EEEEEc--CceEEEEcCCCCe
Q 003405 155 GVPDTVKSMSWCGEN-ICIAIR--KGYMILNATNGAL 188 (823)
Q Consensus 155 ~~~~~~~~l~~~~~~-i~v~~~--~~y~lidl~~~~~ 188 (823)
.-...|.+..|..|. -++.+. +-..+..+.+|+.
T Consensus 303 H~gsevcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R~ 339 (430)
T KOG0640|consen 303 HGGSEVCSAVFTKNGKYILSSGKDSTVKLWEISTGRM 339 (430)
T ss_pred cCCceeeeEEEccCCeEEeecCCcceeeeeeecCCce
Confidence 334466676666442 222222 3455566666654
No 188
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=87.98 E-value=9.1 Score=39.47 Aligned_cols=142 Identities=13% Similarity=0.207 Sum_probs=79.9
Q ss_pred EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EEEE
Q 003405 24 SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IAFH 100 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~ 100 (823)
..|+++.+|.+|-.|...+.... +.....+ + +.-++.|.-- ..|-++.++. | |.+.
T Consensus 116 p~g~~~~~~~kdD~it~id~r~~------------------~~~~~~~-~-~~e~ne~~w~-~~nd~Fflt~GlG~v~IL 174 (313)
T KOG1407|consen 116 PDGEYIAVGNKDDRITFIDARTY------------------KIVNEEQ-F-KFEVNEISWN-NSNDLFFLTNGLGCVEIL 174 (313)
T ss_pred CCCCEEEEecCcccEEEEEeccc------------------ceeehhc-c-cceeeeeeec-CCCCEEEEecCCceEEEE
Confidence 34678888888888777764432 2221111 1 2345555444 3344555554 4 8888
Q ss_pred eCCCCcccccccCC-CCcEEEEeeCCCceEEEEEcCe-EEEEEEcCCCceeEeeeec-CCCCceEEEecC--CeEEEEEc
Q 003405 101 RLPNLETIAVLTKA-KGANVYSWDDRRGFLCFARQKR-VCIFRHDGGRGFVEVKDFG-VPDTVKSMSWCG--ENICIAIR 175 (823)
Q Consensus 101 ~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~kkk-i~l~~~~~~~~f~~~kei~-~~~~~~~l~~~~--~~i~v~~~ 175 (823)
..|+|+|+.+|..- -+|-++..++....+++|.-.. +.++..+. ..=.|-|+ +.-+|++++|.- ..|.-|+.
T Consensus 175 sypsLkpv~si~AH~snCicI~f~p~GryfA~GsADAlvSLWD~~E---LiC~R~isRldwpVRTlSFS~dg~~lASaSE 251 (313)
T KOG1407|consen 175 SYPSLKPVQSIKAHPSNCICIEFDPDGRYFATGSADALVSLWDVDE---LICERCISRLDWPVRTLSFSHDGRMLASASE 251 (313)
T ss_pred eccccccccccccCCcceEEEEECCCCceEeeccccceeeccChhH---hhhheeeccccCceEEEEeccCcceeeccCc
Confidence 88999998876432 2555555677777799985554 44554441 21122221 345888998874 44444444
Q ss_pred Cce-EEEEcCCCCee
Q 003405 176 KGY-MILNATNGALS 189 (823)
Q Consensus 176 ~~y-~lidl~~~~~~ 189 (823)
..| .|=++.||...
T Consensus 252 Dh~IDIA~vetGd~~ 266 (313)
T KOG1407|consen 252 DHFIDIAEVETGDRV 266 (313)
T ss_pred cceEEeEecccCCeE
Confidence 333 33345666543
No 189
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=87.94 E-value=1.3 Score=33.24 Aligned_cols=33 Identities=21% Similarity=0.315 Sum_probs=27.0
Q ss_pred cCCCCcEEEEEEeC--CEEEEEeCCCcEEEEcCCC
Q 003405 13 SNCSPKIDAVASYG--LKILLGCSDGSLKIYSPGS 45 (823)
Q Consensus 13 ~~~~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~ 45 (823)
..++.+|+|++.+- +-|.+||++|.|.+|+++.
T Consensus 8 k~l~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~~ 42 (47)
T PF12894_consen 8 KNLPSRVSCMSWCPTMDLIALGTEDGEVLVYRLNW 42 (47)
T ss_pred cCCCCcEEEEEECCCCCEEEEEECCCeEEEEECCC
Confidence 34577888887775 6899999999999999753
No 190
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=87.92 E-value=8.8 Score=43.78 Aligned_cols=137 Identities=15% Similarity=0.165 Sum_probs=92.4
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-----EEEE
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES-----IAFH 100 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~-----l~~~ 100 (823)
++++.-|.+|+.+++|+...... ...+.. ++.+|.-|.-.|-..=||+.++| |++|
T Consensus 313 ~~~lASGgnDN~~~Iwd~~~~~p------------------~~~~~~-H~aAVKA~awcP~q~~lLAsGGGs~D~~i~fw 373 (484)
T KOG0305|consen 313 GNQLASGGNDNVVFIWDGLSPEP------------------KFTFTE-HTAAVKALAWCPWQSGLLATGGGSADRCIKFW 373 (484)
T ss_pred CCeeccCCCccceEeccCCCccc------------------cEEEec-cceeeeEeeeCCCccCceEEcCCCcccEEEEE
Confidence 47899999999999998732111 112222 36789999999977667777663 9999
Q ss_pred eCCCCcccccccCCCCcEEEEeeCCCceEEEE---EcCeEEEEEEcCCCceeEeeeecC-CCCceEEEec--CCeEEEEE
Q 003405 101 RLPNLETIAVLTKAKGANVYSWDDRRGFLCFA---RQKRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWC--GENICIAI 174 (823)
Q Consensus 101 ~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~---~kkki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~--~~~i~v~~ 174 (823)
+..+-+.+..+...-.|..++.......||.+ .+..|.||.+.. ++.+.++.- .+.+..++|. |..|+.|.
T Consensus 374 n~~~g~~i~~vdtgsQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~ps---~~~~~~l~gH~~RVl~la~SPdg~~i~t~a 450 (484)
T KOG0305|consen 374 NTNTGARIDSVDTGSQVCSLIWSKKYKELLSTHGYSENQITLWKYPS---MKLVAELLGHTSRVLYLALSPDGETIVTGA 450 (484)
T ss_pred EcCCCcEecccccCCceeeEEEcCCCCEEEEecCCCCCcEEEEeccc---cceeeeecCCcceeEEEEECCCCCEEEEec
Confidence 98766656555555567777788777777776 456789999872 444444332 3567777776 56788887
Q ss_pred cCc-eEEEEcC
Q 003405 175 RKG-YMILNAT 184 (823)
Q Consensus 175 ~~~-y~lidl~ 184 (823)
..+ ..+.++-
T Consensus 451 ~DETlrfw~~f 461 (484)
T KOG0305|consen 451 ADETLRFWNLF 461 (484)
T ss_pred ccCcEEecccc
Confidence 743 3444443
No 191
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=87.69 E-value=9.3 Score=44.53 Aligned_cols=185 Identities=21% Similarity=0.264 Sum_probs=96.0
Q ss_pred cEEE--EEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc--CceeeE
Q 003405 18 KIDA--VASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR--QLLLSL 93 (823)
Q Consensus 18 ~I~c--i~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~--~~Ll~l 93 (823)
-|.| |...+++|.-|..-|.|.+|++.+..- .+ +...+...|..+..-... +.|+.-
T Consensus 461 G~R~~~vSp~gqhLAsGDr~GnlrVy~Lq~l~~----------------~~---~~eAHesEilcLeyS~p~~~~kLLAS 521 (1080)
T KOG1408|consen 461 GFRALAVSPDGQHLASGDRGGNLRVYDLQELEY----------------TC---FMEAHESEILCLEYSFPVLTNKLLAS 521 (1080)
T ss_pred ceEEEEECCCcceecccCccCceEEEEehhhhh----------------hh---heecccceeEEEeecCchhhhHhhhh
Confidence 3555 455578999999999999999875321 11 001123445554443211 234333
Q ss_pred eC-c--EEEEeCC-CCcccccccC-CCCcEE--EEee-CCCceEEEEEcCeEEEEEEcC----CCceeEeeeecCCCCce
Q 003405 94 SE-S--IAFHRLP-NLETIAVLTK-AKGANV--YSWD-DRRGFLCFARQKRVCIFRHDG----GRGFVEVKDFGVPDTVK 161 (823)
Q Consensus 94 ~d-~--l~~~~L~-~l~~~~~i~~-~kg~~~--fa~~-~~~~~l~V~~kkki~l~~~~~----~~~f~~~kei~~~~~~~ 161 (823)
+. + |++|+.. ++.+..++.. ...+++ |+.+ -+...|-.|..|.| .|+... ++.|.......-..+.-
T Consensus 522 asrdRlIHV~Dv~rny~l~qtld~HSssITsvKFa~~gln~~MiscGADksi-mFr~~qk~~~g~~f~r~t~t~~ktTlY 600 (1080)
T KOG1408|consen 522 ASRDRLIHVYDVKRNYDLVQTLDGHSSSITSVKFACNGLNRKMISCGADKSI-MFRVNQKASSGRLFPRHTQTLSKTTLY 600 (1080)
T ss_pred ccCCceEEEEecccccchhhhhcccccceeEEEEeecCCceEEEeccCchhh-heehhccccCceeccccccccccceEE
Confidence 32 3 8898874 4555554433 233444 3333 23334555666666 555432 33443221111123344
Q ss_pred EEEecCC---eEEEEEcCceEEEEcCCCCeeeccCCCC--CCCCEEEEc-cCCeEEE--EeCCeEEEEc
Q 003405 162 SMSWCGE---NICIAIRKGYMILNATNGALSEVFPSGR--IGPPLVVSL-LSGELLL--GKENIGVFVD 222 (823)
Q Consensus 162 ~l~~~~~---~i~v~~~~~y~lidl~~~~~~~L~~~~~--~~~p~i~~~-~~~EfLL--~~~~~gvfv~ 222 (823)
-|+...+ .+.++..+...++|+++|+.+..|.-+. .+.++-+.+ +.+-++. |.|...-|+|
T Consensus 601 Dm~Vdp~~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScsdktl~~~D 669 (1080)
T KOG1408|consen 601 DMAVDPTSKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCSDKTLCFVD 669 (1080)
T ss_pred EeeeCCCcceEEEEecccceEEEeccccceeeeecccccCCCceEEEEECCCccEEEEeecCCceEEEE
Confidence 4555432 3445555899999999999888886533 233332322 3445555 3444444444
No 192
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.65 E-value=5.2 Score=46.96 Aligned_cols=148 Identities=15% Similarity=0.256 Sum_probs=92.7
Q ss_pred CcEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-Cceee
Q 003405 17 PKIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLS 92 (823)
Q Consensus 17 ~~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~ 92 (823)
..+.|++.+. +.|+-|+.||+|..|++....+.. ++.+ ..-.|..++..|.. +..++
T Consensus 134 Rs~~~ldfh~tep~iliSGSQDg~vK~~DlR~~~S~~------------------t~~~-nSESiRDV~fsp~~~~~F~s 194 (839)
T KOG0269|consen 134 RSANKLDFHSTEPNILISGSQDGTVKCWDLRSKKSKS------------------TFRS-NSESIRDVKFSPGYGNKFAS 194 (839)
T ss_pred cceeeeeeccCCccEEEecCCCceEEEEeeecccccc------------------cccc-cchhhhceeeccCCCceEEE
Confidence 4788988875 578889999999999998765421 1111 12356666666644 66777
Q ss_pred EeC-c-EEEEeCCCCccc-ccccCCC-CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC
Q 003405 93 LSE-S-IAFHRLPNLETI-AVLTKAK-GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG 167 (823)
Q Consensus 93 l~d-~-l~~~~L~~l~~~-~~i~~~k-g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~ 167 (823)
..| | |..|||..-+.. -++..-. .+.+.-+++++..|+-| ..|.+.|+.+.+.+.+ ..-.|..--++..+.|+.
T Consensus 195 ~~dsG~lqlWDlRqp~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~~~-~~~tInTiapv~rVkWRP 273 (839)
T KOG0269|consen 195 IHDSGYLQLWDLRQPDRCEKKLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTDSRAK-PKHTINTIAPVGRVKWRP 273 (839)
T ss_pred ecCCceEEEeeccCchhHHHHhhcccCceEEEeecCCCceeeecCCCccEEEEeccCCCcc-ceeEEeecceeeeeeecc
Confidence 778 5 899999653211 1111222 24455567777778887 5577999999865422 222355556888999984
Q ss_pred C-e--EE---EEEcCceEEEEcC
Q 003405 168 E-N--IC---IAIRKGYMILNAT 184 (823)
Q Consensus 168 ~-~--i~---v~~~~~y~lidl~ 184 (823)
. . |. .+...+..+.|+.
T Consensus 274 ~~~~hLAtcsmv~dtsV~VWDvr 296 (839)
T KOG0269|consen 274 ARSYHLATCSMVVDTSVHVWDVR 296 (839)
T ss_pred CccchhhhhhccccceEEEEeec
Confidence 2 1 11 1223456677765
No 193
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=87.58 E-value=47 Score=36.82 Aligned_cols=145 Identities=11% Similarity=0.199 Sum_probs=88.1
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEeC-c-EEEEeCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLSE-S-IAFHRLP 103 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~d-~-l~~~~L~ 103 (823)
..|+-|.++|.++.|+++...... +.......+.+ +...|+-+..-+.. +++.+++| + +.+||+.
T Consensus 191 g~Lls~~~d~~i~lwdi~~~~~~~-----------~~~~p~~~~~~-h~~~VeDV~~h~~h~~lF~sv~dd~~L~iwD~R 258 (422)
T KOG0264|consen 191 GTLLSGSDDHTICLWDINAESKED-----------KVVDPKTIFSG-HEDVVEDVAWHPLHEDLFGSVGDDGKLMIWDTR 258 (422)
T ss_pred eeEeeccCCCcEEEEeccccccCC-----------ccccceEEeec-CCcceehhhccccchhhheeecCCCeEEEEEcC
Confidence 478999999999999998755421 01111112232 35678777766654 56777887 4 9999987
Q ss_pred C--CcccccccC-CCCcEEEEeeCCCce-EEEE-EcCeEEEEEEcCCCceeEeeeec-CCCCceEEEecCC--eEEEEEc
Q 003405 104 N--LETIAVLTK-AKGANVYSWDDRRGF-LCFA-RQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWCGE--NICIAIR 175 (823)
Q Consensus 104 ~--l~~~~~i~~-~kg~~~fa~~~~~~~-l~V~-~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~~~--~i~v~~~ 175 (823)
+ .++...+.. ...++++++++-.+. |+-| ..+.|.+|..++-+ ..+..+. -.+.+..+.|..+ .|...+.
T Consensus 259 ~~~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~--~~lh~~e~H~dev~~V~WSPh~etvLASSg 336 (422)
T KOG0264|consen 259 SNTSKPSHSVKAHSAEVNCVAFNPFNEFILATGSADKTVALWDLRNLN--KPLHTFEGHEDEVFQVEWSPHNETVLASSG 336 (422)
T ss_pred CCCCCCcccccccCCceeEEEeCCCCCceEEeccCCCcEEEeechhcc--cCceeccCCCcceEEEEeCCCCCceeEecc
Confidence 4 333333222 346778888876654 4555 58899999887421 1122221 2468899999843 3333332
Q ss_pred --CceEEEEcCC
Q 003405 176 --KGYMILNATN 185 (823)
Q Consensus 176 --~~y~lidl~~ 185 (823)
+...+.|+..
T Consensus 337 ~D~rl~vWDls~ 348 (422)
T KOG0264|consen 337 TDRRLNVWDLSR 348 (422)
T ss_pred cCCcEEEEeccc
Confidence 4566677654
No 194
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=87.52 E-value=10 Score=44.29 Aligned_cols=156 Identities=12% Similarity=0.109 Sum_probs=84.0
Q ss_pred cEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccC-ceeeE
Q 003405 18 KIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQ-LLLSL 93 (823)
Q Consensus 18 ~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~-~Ll~l 93 (823)
.++++...- ++.+|||+.|.|+.-.-.+..... ...|+....+ ..+..+|.-|.-.|-.. .++++
T Consensus 349 ~~t~~~F~~~~p~~FiVGTe~G~v~~~~r~g~~~~~----------~~~~~~~~~~-~~h~g~v~~v~~nPF~~k~fls~ 417 (555)
T KOG1587|consen 349 GATSLKFEPTDPNHFIVGTEEGKVYKGCRKGYTPAP----------EVSYKGHSTF-ITHIGPVYAVSRNPFYPKNFLSV 417 (555)
T ss_pred ceeeEeeccCCCceEEEEcCCcEEEEEeccCCcccc----------cccccccccc-cccCcceEeeecCCCccceeeee
Confidence 466666553 689999999999873322221110 0111111111 12356888888777553 45566
Q ss_pred eC-cEEEEeCC-CCcccccccCCC-CcEEEEeeCCCceEEEEEc--CeEEEEEEcCCCceeEeeeecCCCCce-EEEec-
Q 003405 94 SE-SIAFHRLP-NLETIAVLTKAK-GANVYSWDDRRGFLCFARQ--KRVCIFRHDGGRGFVEVKDFGVPDTVK-SMSWC- 166 (823)
Q Consensus 94 ~d-~l~~~~L~-~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~~k--kki~l~~~~~~~~f~~~kei~~~~~~~-~l~~~- 166 (823)
+| ++++|... ...|+....... -++..+.++.++.++++++ ..|-++.+..+. -..+.......++. .+.|.
T Consensus 418 gDW~vriWs~~~~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~~~-~~Pv~s~~~~~~~l~~~~~s~ 496 (555)
T KOG1587|consen 418 GDWTVRIWSEDVIASPLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQDD-EEPVLSQKVCSPALTRVRWSP 496 (555)
T ss_pred ccceeEeccccCCCCcchhhhhccceeeeeEEcCcCceEEEEEcCCCceehhhhhccc-cCCcccccccccccceeecCC
Confidence 67 59999875 544554332222 3778888887765555544 444444443221 11121121223333 33443
Q ss_pred -CCeEEEEEcCc-eEEEEcCC
Q 003405 167 -GENICIAIRKG-YMILNATN 185 (823)
Q Consensus 167 -~~~i~v~~~~~-y~lidl~~ 185 (823)
|..|++|...| ..++++..
T Consensus 497 ~g~~lavGd~~G~~~~~~l~~ 517 (555)
T KOG1587|consen 497 NGKLLAVGDANGTTHILKLSE 517 (555)
T ss_pred CCcEEEEecCCCcEEEEEcCc
Confidence 56799998855 56666653
No 195
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=87.49 E-value=20 Score=38.76 Aligned_cols=124 Identities=17% Similarity=0.291 Sum_probs=79.8
Q ss_pred EEEEeCCCCcc-cccccCCCCcEEEEeeCCC-ceEEEEEcCeEEEEEEcCC----CceeEee----e-ecCC--CCceEE
Q 003405 97 IAFHRLPNLET-IAVLTKAKGANVYSWDDRR-GFLCFARQKRVCIFRHDGG----RGFVEVK----D-FGVP--DTVKSM 163 (823)
Q Consensus 97 l~~~~L~~l~~-~~~i~~~kg~~~fa~~~~~-~~l~V~~kkki~l~~~~~~----~~f~~~k----e-i~~~--~~~~~l 163 (823)
|.+|+=.+-.+ +-+-..-+++++++..+.. ..++||.+..|.|+..+.. +..+... + +.-| -+|++|
T Consensus 122 Vriy~ksst~pt~Lks~sQrnvtclawRPlsaselavgCr~gIciW~~s~tln~~r~~~~~s~~~~qvl~~pgh~pVtsm 201 (445)
T KOG2139|consen 122 VRIYDKSSTCPTKLKSVSQRNVTCLAWRPLSASELAVGCRAGICIWSDSRTLNANRNIRMMSTHHLQVLQDPGHNPVTSM 201 (445)
T ss_pred EEEeccCCCCCceecchhhcceeEEEeccCCcceeeeeecceeEEEEcCcccccccccccccccchhheeCCCCceeeEE
Confidence 77886543111 1111244689999998765 4699999999999987631 1101000 1 1122 488999
Q ss_pred EecCCeEEEEEc----CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeCCeEEE
Q 003405 164 SWCGENICIAIR----KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKENIGVF 220 (823)
Q Consensus 164 ~~~~~~i~v~~~----~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~~~gvf 220 (823)
.|..+..-+++. +.+.+-|.++|..++|.+.|-.+.-+..+.+++.++.|..--++|
T Consensus 202 qwn~dgt~l~tAS~gsssi~iWdpdtg~~~pL~~~glgg~slLkwSPdgd~lfaAt~davf 262 (445)
T KOG2139|consen 202 QWNEDGTILVTASFGSSSIMIWDPDTGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDAVF 262 (445)
T ss_pred EEcCCCCEEeecccCcceEEEEcCCCCCcccccccCCCceeeEEEcCCCCEEEEeccccee
Confidence 999654333332 578999999999999987765455567788999998865433333
No 196
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=87.32 E-value=31 Score=37.34 Aligned_cols=118 Identities=12% Similarity=0.192 Sum_probs=75.1
Q ss_pred CCCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCcee
Q 003405 74 SKKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFV 149 (823)
Q Consensus 74 ~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~ 149 (823)
++.+|..+.+-|..+++++=++ . -.+|++.+-+....++.-| .|++...+-+...|+-| ...+++||+...+..+.
T Consensus 63 H~~svFavsl~P~~~l~aTGGgDD~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~~~~ 142 (399)
T KOG0296|consen 63 HTDSVFAVSLHPNNNLVATGGGDDLAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGGEQW 142 (399)
T ss_pred cCCceEEEEeCCCCceEEecCCCceEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecCCCccEEEEEcccCceEE
Confidence 4779999999994444433332 2 6899998776665565555 46666666666667776 77899999987543222
Q ss_pred EeeeecCCCCceEEEec--CCeEEEEEcC-ceEEEEcCCCCeeeccC
Q 003405 150 EVKDFGVPDTVKSMSWC--GENICIAIRK-GYMILNATNGALSEVFP 193 (823)
Q Consensus 150 ~~kei~~~~~~~~l~~~--~~~i~v~~~~-~y~lidl~~~~~~~L~~ 193 (823)
.+- .--+.+..|.|. +..++.|+.. ..-+..+.++....+++
T Consensus 143 ~~~--~e~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~~~~kv~~ 187 (399)
T KOG0296|consen 143 KLD--QEVEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQALCKVMS 187 (399)
T ss_pred Eee--cccCceEEEEecccccEEEeecCCCcEEEEECCCcceeeEec
Confidence 211 112456667776 5678888774 56677887754444544
No 197
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=86.95 E-value=4.2 Score=43.64 Aligned_cols=213 Identities=10% Similarity=0.141 Sum_probs=110.6
Q ss_pred cEEEEEEeCC---EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYGL---KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~~---~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
-|.|++-+-+ .++-|.-||.|.+|++.... +.+.+++. ...|.-|.+-. +-.++++
T Consensus 68 GV~~lakhp~~ls~~aSGs~DG~VkiWnlsqR~------------------~~~~f~AH-~G~V~Gi~v~~--~~~~tvg 126 (433)
T KOG0268|consen 68 GVSCLAKHPNKLSTVASGSCDGEVKIWNLSQRE------------------CIRTFKAH-EGLVRGICVTQ--TSFFTVG 126 (433)
T ss_pred ccchhhcCcchhhhhhccccCceEEEEehhhhh------------------hhheeecc-cCceeeEEecc--cceEEec
Confidence 5888888875 47999999999999976422 22344433 56788887765 7788888
Q ss_pred Cc--EEEEeCCCCcccccccCCCCcEEEEeeCCC---ceEEEEEcCeEEEEEEcCCCceeEeeeecCC-CCceEEEec--
Q 003405 95 ES--IAFHRLPNLETIAVLTKAKGANVYSWDDRR---GFLCFARQKRVCIFRHDGGRGFVEVKDFGVP-DTVKSMSWC-- 166 (823)
Q Consensus 95 d~--l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~---~~l~V~~kkki~l~~~~~~~~f~~~kei~~~-~~~~~l~~~-- 166 (823)
|+ |+.|.+.. .+..++-. + +...+-+. .-.++.++..|-||....+. .++.+... |.+.++.+.
T Consensus 127 dDKtvK~wk~~~-~p~~tilg-~---s~~~gIdh~~~~~~FaTcGe~i~IWD~~R~~---Pv~smswG~Dti~svkfNpv 198 (433)
T KOG0268|consen 127 DDKTVKQWKIDG-PPLHTILG-K---SVYLGIDHHRKNSVFATCGEQIDIWDEQRDN---PVSSMSWGADSISSVKFNPV 198 (433)
T ss_pred CCcceeeeeccC-Ccceeeec-c---ccccccccccccccccccCceeeecccccCC---ccceeecCCCceeEEecCCC
Confidence 84 99997643 12222111 0 11111111 22444455566666554221 22223222 566777776
Q ss_pred CCeEEEE--EcCceEEEEcCCCCeeeccCCCCCCC-CEEEEccCC-eEEEEeCCeEEE-EcCCCccccCCceeecC---C
Q 003405 167 GENICIA--IRKGYMILNATNGALSEVFPSGRIGP-PLVVSLLSG-ELLLGKENIGVF-VDQNGKLLQADRICWSE---A 238 (823)
Q Consensus 167 ~~~i~v~--~~~~y~lidl~~~~~~~L~~~~~~~~-p~i~~~~~~-EfLL~~~~~gvf-v~~~G~~~~~~~i~w~~---~ 238 (823)
...|..+ ..++..++|+.+++...-...+ ++ .-|++-++. .|..+.++..++ +|- +-..++--...+ .
T Consensus 199 ETsILas~~sDrsIvLyD~R~~~Pl~KVi~~--mRTN~IswnPeafnF~~a~ED~nlY~~Dm--R~l~~p~~v~~dhvsA 274 (433)
T KOG0268|consen 199 ETSILASCASDRSIVLYDLRQASPLKKVILT--MRTNTICWNPEAFNFVAANEDHNLYTYDM--RNLSRPLNVHKDHVSA 274 (433)
T ss_pred cchheeeeccCCceEEEecccCCccceeeee--ccccceecCccccceeeccccccceehhh--hhhcccchhhccccee
Confidence 2344444 4578999999887532222111 11 235554422 344455554333 331 111111111111 2
Q ss_pred CcEEEEeC---CEEEEEeCCeEEEEEcc
Q 003405 239 PIAVIIQK---PYAIALLPRRVEVRSLR 263 (823)
Q Consensus 239 P~~v~~~~---PYll~~~~~~ieV~~l~ 263 (823)
...+.|.. -|+-|-++++|-|+.+.
T Consensus 275 V~dVdfsptG~EfvsgsyDksIRIf~~~ 302 (433)
T KOG0268|consen 275 VMDVDFSPTGQEFVSGSYDKSIRIFPVN 302 (433)
T ss_pred EEEeccCCCcchhccccccceEEEeecC
Confidence 22333321 25555666788888774
No 198
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=86.78 E-value=6 Score=43.52 Aligned_cols=108 Identities=16% Similarity=0.189 Sum_probs=66.2
Q ss_pred cEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
.|+-++... ...++||+||+|+.|++.... +..| +.+++ ..+|+.|.+-....-+++.+
T Consensus 331 ~VEkv~w~~~se~~f~~~tddG~v~~~D~R~~~-------------~~vw----t~~AH-d~~ISgl~~n~~~p~~l~t~ 392 (463)
T KOG0270|consen 331 EVEKVAWDPHSENSFFVSTDDGTVYYFDIRNPG-------------KPVW----TLKAH-DDEISGLSVNIQTPGLLSTA 392 (463)
T ss_pred ceEEEEecCCCceeEEEecCCceEEeeecCCCC-------------Ccee----EEEec-cCCcceEEecCCCCcceeec
Confidence 555555443 578999999999999977542 2233 33443 56999999988774444443
Q ss_pred C--c-EEEEeCCCCccccccc---CCCCcEEEEeeCCC-ceEEEEEcCe-EEEEEEc
Q 003405 95 E--S-IAFHRLPNLETIAVLT---KAKGANVYSWDDRR-GFLCFARQKR-VCIFRHD 143 (823)
Q Consensus 95 d--~-l~~~~L~~l~~~~~i~---~~kg~~~fa~~~~~-~~l~V~~kkk-i~l~~~~ 143 (823)
. + |++|.++.-.+..... +.--..+|+.+++. +.++++..|. +.++.+.
T Consensus 393 s~d~~Vklw~~~~~~~~~v~~~~~~~~rl~c~~~~~~~a~~la~GG~k~~~~vwd~~ 449 (463)
T KOG0270|consen 393 STDKVVKLWKFDVDSPKSVKEHSFKLGRLHCFALDPDVAFTLAFGGEKAVLRVWDIF 449 (463)
T ss_pred cccceEEEEeecCCCCcccccccccccceeecccCCCcceEEEecCccceEEEeecc
Confidence 2 3 9999987654422110 11114566777665 4577775444 5566554
No 199
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=86.67 E-value=1.7 Score=40.91 Aligned_cols=69 Identities=20% Similarity=0.323 Sum_probs=49.7
Q ss_pred ChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 302 PLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 302 ~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
.+-++.+..+++|+|++|+..++.+... -+. ...........|+.+|.+++|++|...+.+ +|.|.|.-
T Consensus 12 ~ly~~a~~~l~~~~Y~~A~~~le~L~~r-yP~-g~ya~qAqL~l~yayy~~~~y~~A~a~~~r-------FirLhP~h 80 (142)
T PF13512_consen 12 ELYQEAQEALQKGNYEEAIKQLEALDTR-YPF-GEYAEQAQLDLAYAYYKQGDYEEAIAAYDR-------FIRLHPTH 80 (142)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHhc-CCC-CcccHHHHHHHHHHHHHccCHHHHHHHHHH-------HHHhCCCC
Confidence 3456678889999999999999876211 011 112345667779999999999999988775 56676665
No 200
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=86.50 E-value=21 Score=32.14 Aligned_cols=54 Identities=13% Similarity=0.289 Sum_probs=40.9
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc-EEEEeC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES-IAFHRL 102 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~-l~~~~L 102 (823)
+.|+|||+|..|.+|.-++ +..++.. ..+|+.|..+....+...+.+| |-+|+-
T Consensus 16 ~eLlvGs~D~~IRvf~~~e--------------------~~~Ei~e--~~~v~~L~~~~~~~F~Y~l~NGTVGvY~~ 70 (111)
T PF14783_consen 16 NELLVGSDDFEIRVFKGDE--------------------IVAEITE--TDKVTSLCSLGGGRFAYALANGTVGVYDR 70 (111)
T ss_pred ceEEEecCCcEEEEEeCCc--------------------EEEEEec--ccceEEEEEcCCCEEEEEecCCEEEEEeC
Confidence 5899999999999997332 2333332 3679999999988888888886 888864
No 201
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=86.40 E-value=11 Score=40.40 Aligned_cols=145 Identities=11% Similarity=0.244 Sum_probs=84.4
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE-eC------cEE
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL-SE------SIA 98 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l-~d------~l~ 98 (823)
...|+.+++||+|..|++...... ....+.+.+.+|-..+..=-..+ ++++ ++ .|.
T Consensus 84 ~h~v~s~ssDG~Vr~wD~Rs~~e~----------------a~~~~~~~~~~~f~~ld~nck~~-ii~~GtE~~~s~A~v~ 146 (376)
T KOG1188|consen 84 PHGVISCSSDGTVRLWDIRSQAES----------------ARISWTQQSGTPFICLDLNCKKN-IIACGTELTRSDASVV 146 (376)
T ss_pred CCeeEEeccCCeEEEEEeecchhh----------------hheeccCCCCCcceEeeccCcCC-eEEeccccccCceEEE
Confidence 468999999999999998753211 01122222222333222111112 2222 22 188
Q ss_pred EEeCCCCcc-ccccc--CCCCcEEEEeeCCCceE-EEE-EcCeEEEEEEcCCCce-eEeeeecCCCCceEEEecCC----
Q 003405 99 FHRLPNLET-IAVLT--KAKGANVYSWDDRRGFL-CFA-RQKRVCIFRHDGGRGF-VEVKDFGVPDTVKSMSWCGE---- 168 (823)
Q Consensus 99 ~~~L~~l~~-~~~i~--~~kg~~~fa~~~~~~~l-~V~-~kkki~l~~~~~~~~f-~~~kei~~~~~~~~l~~~~~---- 168 (823)
+|+...-+. +.... ....|+.++..++...+ +-| +..-+.+|....+.+= ....-+.....|..+.|.++
T Consensus 147 lwDvR~~qq~l~~~~eSH~DDVT~lrFHP~~pnlLlSGSvDGLvnlfD~~~d~EeDaL~~viN~~sSI~~igw~~~~ykr 226 (376)
T KOG1188|consen 147 LWDVRSEQQLLRQLNESHNDDVTQLRFHPSDPNLLLSGSVDGLVNLFDTKKDNEEDALLHVINHGSSIHLIGWLSKKYKR 226 (376)
T ss_pred EEEeccccchhhhhhhhccCcceeEEecCCCCCeEEeecccceEEeeecCCCcchhhHHHhhcccceeeeeeeecCCcce
Confidence 898865443 33322 23468888888776544 444 6666788988754211 11222455567889999854
Q ss_pred eEEEEEcCceEEEEcCCCC
Q 003405 169 NICIAIRKGYMILNATNGA 187 (823)
Q Consensus 169 ~i~v~~~~~y~lidl~~~~ 187 (823)
..|+.....|.+++++.+.
T Consensus 227 I~clTH~Etf~~~ele~~~ 245 (376)
T KOG1188|consen 227 IMCLTHMETFAIYELEDGS 245 (376)
T ss_pred EEEEEccCceeEEEccCCC
Confidence 3666666889999999876
No 202
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=86.32 E-value=15 Score=43.02 Aligned_cols=153 Identities=13% Similarity=0.177 Sum_probs=87.1
Q ss_pred CcEEEEEEeC-----CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-C-c
Q 003405 17 PKIDAVASYG-----LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-Q-L 89 (823)
Q Consensus 17 ~~I~ci~~~~-----~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~-~ 89 (823)
.+|-|.+--. +.|.=|..|-.|++|++.. .|.+..+...+ ...|+.++..-.. | .
T Consensus 502 sEilcLeyS~p~~~~kLLASasrdRlIHV~Dv~r-----------------ny~l~qtld~H-SssITsvKFa~~gln~~ 563 (1080)
T KOG1408|consen 502 SEILCLEYSFPVLTNKLLASASRDRLIHVYDVKR-----------------NYDLVQTLDGH-SSSITSVKFACNGLNRK 563 (1080)
T ss_pred ceeEEEeecCchhhhHhhhhccCCceEEEEeccc-----------------ccchhhhhccc-ccceeEEEEeecCCceE
Confidence 4677765432 3567778888889998653 56666555443 5789999988765 2 3
Q ss_pred eeeEe-CcEEEEeCCC-------CcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeee-cCCCC
Q 003405 90 LLSLS-ESIAFHRLPN-------LETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDF-GVPDT 159 (823)
Q Consensus 90 Ll~l~-d~l~~~~L~~-------l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei-~~~~~ 159 (823)
++.++ |...+|+... |...+......-...++|++..+.++.+ ..|+|.||.+..+++-+..|.- .-.+.
T Consensus 564 MiscGADksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~ 643 (1080)
T KOG1408|consen 564 MISCGADKSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPTSKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGD 643 (1080)
T ss_pred EEeccCchhhheehhccccCceeccccccccccceEEEeeeCCCcceEEEEecccceEEEeccccceeeeecccccCCCc
Confidence 55543 3344444321 1001111111112346777776665554 7789999999866543333321 11234
Q ss_pred ceEEEecCCeEEEEEc---CceEEEEcCCCC
Q 003405 160 VKSMSWCGENICIAIR---KGYMILNATNGA 187 (823)
Q Consensus 160 ~~~l~~~~~~i~v~~~---~~y~lidl~~~~ 187 (823)
+.-+......|++++. +...++|..+|+
T Consensus 644 lIKv~lDPSgiY~atScsdktl~~~Df~sgE 674 (1080)
T KOG1408|consen 644 LIKVILDPSGIYLATSCSDKTLCFVDFVSGE 674 (1080)
T ss_pred eEEEEECCCccEEEEeecCCceEEEEeccch
Confidence 4455555555666544 667888887764
No 203
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=86.12 E-value=6.3 Score=48.25 Aligned_cols=160 Identities=19% Similarity=0.271 Sum_probs=98.0
Q ss_pred CCcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc----C
Q 003405 16 SPKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR----Q 88 (823)
Q Consensus 16 ~~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~----~ 88 (823)
.+.|..++.. ++.|.=|.++|.|++|+++....+-+ . + +..+..+|..+.-. .
T Consensus 116 ~G~V~gLDfN~~q~nlLASGa~~geI~iWDlnn~~tP~~--------------~-----~-~~~~~~eI~~lsWNrkvqh 175 (1049)
T KOG0307|consen 116 TGPVLGLDFNPFQGNLLASGADDGEILIWDLNKPETPFT--------------P-----G-SQAPPSEIKCLSWNRKVSH 175 (1049)
T ss_pred CCceeeeeccccCCceeeccCCCCcEEEeccCCcCCCCC--------------C-----C-CCCCcccceEeccchhhhH
Confidence 3467777665 35789999999999999987443211 0 0 12345555555432 1
Q ss_pred ceeeEeC-c-EEEEeCCCCcccccccCCCC---cEEEEeeCCC-ceEEEEEc-CeEEEE-EEcCCCceeEeeeecC-CCC
Q 003405 89 LLLSLSE-S-IAFHRLPNLETIAVLTKAKG---ANVYSWDDRR-GFLCFARQ-KRVCIF-RHDGGRGFVEVKDFGV-PDT 159 (823)
Q Consensus 89 ~Ll~l~d-~-l~~~~L~~l~~~~~i~~~kg---~~~fa~~~~~-~~l~V~~k-kki~l~-~~~~~~~f~~~kei~~-~~~ 159 (823)
+|...+. | ..+|||..-+++..+....+ |+.++++++. .+++|+.. ++.-++ .|+-+.--..+|++.. .-.
T Consensus 176 ILAS~s~sg~~~iWDlr~~~pii~ls~~~~~~~~S~l~WhP~~aTql~~As~dd~~PviqlWDlR~assP~k~~~~H~~G 255 (1049)
T KOG0307|consen 176 ILASGSPSGRAVIWDLRKKKPIIKLSDTPGRMHCSVLAWHPDHATQLLVASGDDSAPVIQLWDLRFASSPLKILEGHQRG 255 (1049)
T ss_pred HhhccCCCCCceeccccCCCcccccccCCCccceeeeeeCCCCceeeeeecCCCCCceeEeecccccCCchhhhcccccc
Confidence 2444444 4 89999987677665555554 8889998876 56888744 333333 3442111123455422 347
Q ss_pred ceEEEec--CCeEEEEEc--CceEEEEcCCCCeeeccCCC
Q 003405 160 VKSMSWC--GENICIAIR--KGYMILNATNGALSEVFPSG 195 (823)
Q Consensus 160 ~~~l~~~--~~~i~v~~~--~~y~lidl~~~~~~~L~~~~ 195 (823)
|.+|.|+ ++.+.+.+. +....-|.+||++..=+|.+
T Consensus 256 ilslsWc~~D~~lllSsgkD~~ii~wN~~tgEvl~~~p~~ 295 (1049)
T KOG0307|consen 256 ILSLSWCPQDPRLLLSSGKDNRIICWNPNTGEVLGELPAQ 295 (1049)
T ss_pred eeeeccCCCCchhhhcccCCCCeeEecCCCceEeeecCCC
Confidence 8999999 446666666 45666677888776666664
No 204
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=84.94 E-value=49 Score=37.12 Aligned_cols=113 Identities=15% Similarity=0.305 Sum_probs=71.2
Q ss_pred cEEEEEEeCCEEEEEeCCC-cEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc
Q 003405 18 KIDAVASYGLKILLGCSDG-SLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES 96 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G-~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~ 96 (823)
+-.-+.+.++.+++||+|| .|-+|+.... ++.+-.+.+ ..|..+.+-+....+++--|.
T Consensus 363 rY~r~~~~~e~~vigt~dgD~l~iyd~~~~------------------e~kr~e~~l--g~I~av~vs~dGK~~vvaNdr 422 (668)
T COG4946 363 RYRRIQVDPEGDVIGTNDGDKLGIYDKDGG------------------EVKRIEKDL--GNIEAVKVSPDGKKVVVANDR 422 (668)
T ss_pred EEEEEccCCcceEEeccCCceEEEEecCCc------------------eEEEeeCCc--cceEEEEEcCCCcEEEEEcCc
Confidence 3344555667899999999 7888875532 122222233 578889888876655554453
Q ss_pred --EEEEeCCCCcccccccCCC--CcEEEEeeCCCceEEEE-----EcCeEEEEEEcCCCceeEe
Q 003405 97 --IAFHRLPNLETIAVLTKAK--GANVYSWDDRRGFLCFA-----RQKRVCIFRHDGGRGFVEV 151 (823)
Q Consensus 97 --l~~~~L~~l~~~~~i~~~k--g~~~fa~~~~~~~l~V~-----~kkki~l~~~~~~~~f~~~ 151 (823)
+.++++.+-.+. .+.+.. =++-|+++++...++-+ ..+.|.+|...+++.+..+
T Consensus 423 ~el~vididngnv~-~idkS~~~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy~vT 485 (668)
T COG4946 423 FELWVIDIDNGNVR-LIDKSEYGLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDVT 485 (668)
T ss_pred eEEEEEEecCCCee-EecccccceeEEEEEcCCceeEEEecCcceeeeeEEEEecCCCeEEEec
Confidence 777777654321 122222 36779999888777776 3467888888866544433
No 205
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=84.76 E-value=2.6 Score=35.71 Aligned_cols=52 Identities=31% Similarity=0.435 Sum_probs=32.4
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
..-+++.|+|++|+.+++.. ..+ .....++-.+|..++..|+|++|+.+|.+
T Consensus 32 a~~~~~~~~y~~A~~~~~~~-~~~-----~~~~~~~~l~a~~~~~l~~y~eAi~~l~~ 83 (84)
T PF12895_consen 32 AQCYFQQGKYEEAIELLQKL-KLD-----PSNPDIHYLLARCLLKLGKYEEAIKALEK 83 (84)
T ss_dssp HHHHHHTTHHHHHHHHHHCH-THH-----HCHHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHCCCHHHHHHHHHHh-CCC-----CCCHHHHHHHHHHHHHhCCHHHHHHHHhc
Confidence 34456778888888888662 111 12234455567888888888888887764
No 206
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=84.70 E-value=41 Score=35.85 Aligned_cols=151 Identities=14% Similarity=0.201 Sum_probs=85.8
Q ss_pred CCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCeeEEEEecccCc-eee
Q 003405 15 CSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPILSMEVLASRQL-LLS 92 (823)
Q Consensus 15 ~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~qI~~~~~~~~-Ll~ 92 (823)
+...|.+|..-+++|+|-+++ .|++|....+. +..+.+.... .+.. ..+.|..+. +++
T Consensus 93 f~~~I~~V~l~r~riVvvl~~-~I~VytF~~n~-----------------k~l~~~et~~NPkGl--C~~~~~~~k~~La 152 (346)
T KOG2111|consen 93 FNSEIKAVKLRRDRIVVVLEN-KIYVYTFPDNP-----------------KLLHVIETRSNPKGL--CSLCPTSNKSLLA 152 (346)
T ss_pred eccceeeEEEcCCeEEEEecC-eEEEEEcCCCh-----------------hheeeeecccCCCce--EeecCCCCceEEE
Confidence 356888999999999888765 57888755322 1122222111 1221 222232232 333
Q ss_pred EeC---c-EEEEeCCCCccc--ccc-cCCCCcEEEEeeCCCceEEEEEcCe--EEEEEEcCCCceeEeeeecCCCCceEE
Q 003405 93 LSE---S-IAFHRLPNLETI--AVL-TKAKGANVYSWDDRRGFLCFARQKR--VCIFRHDGGRGFVEVKDFGVPDTVKSM 163 (823)
Q Consensus 93 l~d---~-l~~~~L~~l~~~--~~i-~~~kg~~~fa~~~~~~~l~V~~kkk--i~l~~~~~~~~f~~~kei~~~~~~~~l 163 (823)
.-+ | |.+.+|...+.- ..| .....+.+++++.+...++-+..|. |.||.-..+....++|.=.-+-.+.+|
T Consensus 153 fPg~k~GqvQi~dL~~~~~~~p~~I~AH~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~i 232 (346)
T KOG2111|consen 153 FPGFKTGQVQIVDLASTKPNAPSIINAHDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCI 232 (346)
T ss_pred cCCCccceEEEEEhhhcCcCCceEEEcccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEE
Confidence 322 3 888888665541 112 2234678899998877788775554 345554434333444433335578899
Q ss_pred EecC--CeEEEEEcCc-eEEEEcCC
Q 003405 164 SWCG--ENICIAIRKG-YMILNATN 185 (823)
Q Consensus 164 ~~~~--~~i~v~~~~~-y~lidl~~ 185 (823)
+|.. ..+|++++++ ..++.+..
T Consensus 233 aFSp~~s~LavsSdKgTlHiF~l~~ 257 (346)
T KOG2111|consen 233 AFSPNSSWLAVSSDKGTLHIFSLRD 257 (346)
T ss_pred EeCCCccEEEEEcCCCeEEEEEeec
Confidence 9984 4688888865 46666654
No 207
>KOG2034 consensus Vacuolar sorting protein PEP3/VPS18 [Intracellular trafficking, secretion, and vesicular transport]
Probab=84.57 E-value=3.3 Score=49.52 Aligned_cols=196 Identities=15% Similarity=0.172 Sum_probs=108.7
Q ss_pred HHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhh-HHHhhhhhhhcCc-ccccccc
Q 003405 551 ALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPML-VLEFSMLVLESCP-TQTIELF 628 (823)
Q Consensus 551 ~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~l-i~~y~~wll~~~p-~~~~~if 628 (823)
..-..|..+|+|++||++-..- | +..+. +.+++..+++... ..|-+++
T Consensus 363 ~vWk~yLd~g~y~kAL~~ar~~------------------p------------~~le~Vl~~qAdf~f~~k~y~~AA~~y 412 (911)
T KOG2034|consen 363 DVWKTYLDKGEFDKALEIARTR------------------P------------DALETVLLKQADFLFQDKEYLRAAEIY 412 (911)
T ss_pred HHHHHHHhcchHHHHHHhccCC------------------H------------HHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 4557899999999999986531 1 11122 2344545544320 0122222
Q ss_pred ccCCCCh-HHHHHHHhhcCchhHHHHHHHHhhcccC----CCChhHHHHHHHHHHHHHHHHhh-hhhhhcccCcccchHH
Q 003405 629 LSGNIPA-DLVNSYLKQYSPSMQGRYLELMLAMNEN----SISGNLQNEMVQIYLSEVLDWYS-DLSAQQKWDEKAYSPT 702 (823)
Q Consensus 629 ~~~~l~~-~~Vl~~L~~~~~~~~~~YLE~li~~~~~----~~~~~~h~~L~~lYl~~i~~~~~-~~~~~~~~~~~~~~~~ 702 (823)
..-.-+- +.++.||+..++..+..||..=.. +.+ ..-..+-+=|+.+|++.+.+... +...-+.. .......
T Consensus 413 A~t~~~FEEVaLKFl~~~~~~~L~~~L~KKL~-~lt~~dk~q~~~Lv~WLlel~L~~Ln~l~~~de~~~en~-~~~~~~~ 490 (911)
T KOG2034|consen 413 AETLSSFEEVALKFLEINQERALRTFLDKKLD-RLTPEDKTQRDALVTWLLELYLEQLNDLDSTDEEALENW-RLEYDEV 490 (911)
T ss_pred HHhhhhHHHHHHHHHhcCCHHHHHHHHHHHHh-hCChHHHHHHHHHHHHHHHHHHHHHhcccccChhHHHHH-HHHHHHH
Confidence 2211222 357899988777788888876333 111 11122555588999999877651 11000000 0111223
Q ss_pred HHHHHHHhhh-cCCCChHHHhccCCCCchhhHHHH-------------HhhccccHHHHHHHHHHHhCCC----------
Q 003405 703 RKKLLSALES-ISGYNPEVLLKRLPADALYEERAI-------------LLGKMNQHELALSLYVHKVFLI---------- 758 (823)
Q Consensus 703 r~kLl~fL~~-s~~Yd~~~~L~~~~~~~l~~e~~~-------------Ll~klg~h~~AL~ilv~~L~D~---------- 758 (823)
++++..|+.. +..-|=+++...+..++=.++.++ -+-.-|++++||+++... .+.
T Consensus 491 ~re~~~~~~~~~~~~nretv~~l~~~~~~~e~ll~fA~l~~d~~~vv~~~~q~e~yeeaLevL~~~-~~~el~yk~ap~L 569 (911)
T KOG2034|consen 491 QREFSKFLVLHKDELNRETVYQLLASHGRQEELLQFANLIKDYEFVVSYWIQQENYEEALEVLLNQ-RNPELFYKYAPEL 569 (911)
T ss_pred HHHHHHHHHhhHHhhhHHHHHHHHHHccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-cchhhHHHhhhHH
Confidence 4445555554 345566777777765543333322 456788999999999975 443
Q ss_pred -------------------chhHHHHHHHHhcCCCCCcch
Q 003405 759 -------------------NQPVFLLIRRMAMDIKPLVTE 779 (823)
Q Consensus 759 -------------------~~a~~~~l~~~y~~~~~~~~~ 779 (823)
..++.++++.|+...+....+
T Consensus 570 i~~~p~~tV~~wm~~~d~~~~~li~~~L~~~~~~~~~~~~ 609 (911)
T KOG2034|consen 570 ITHSPKETVSAWMAQKDLDPNRLIPPILSYFSNWHSEYEE 609 (911)
T ss_pred HhcCcHHHHHHHHHccccCchhhhHHHHHHHhcCCccccH
Confidence 346777788888887644443
No 208
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=83.61 E-value=2.8 Score=42.42 Aligned_cols=66 Identities=26% Similarity=0.434 Sum_probs=45.6
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
.+...++++|+|++|+.+++.+. ................|..+|..++|++|...|.+ +|..||.-
T Consensus 10 ~~a~~~~~~g~y~~Ai~~f~~l~--~~~P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~-------fi~~yP~~ 75 (203)
T PF13525_consen 10 QKALEALQQGDYEEAIKLFEKLI--DRYPNSPYAPQAQLMLAYAYYKQGDYEEAIAAYER-------FIKLYPNS 75 (203)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHHH--HH-TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHH-------HHHH-TT-
T ss_pred HHHHHHHHCCCHHHHHHHHHHHH--HHCCCChHHHHHHHHHHHHHHHcCCHHHHHHHHHH-------HHHHCCCC
Confidence 45667889999999999998762 11111113345666789999999999999999885 67788776
No 209
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=83.59 E-value=17 Score=39.47 Aligned_cols=127 Identities=13% Similarity=0.227 Sum_probs=80.5
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C-cEEEEeCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E-SIAFHRLPN 104 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d-~l~~~~L~~ 104 (823)
.+++-|+.|++++.|+-..... .+....+ +.+-|+++..-|+..++.+-+ | .|++|+..+
T Consensus 337 erlVSgsDd~tlflW~p~~~kk-----------------pi~rmtg-Hq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~t 398 (480)
T KOG0271|consen 337 ERLVSGSDDFTLFLWNPFKSKK-----------------PITRMTG-HQALVNHVSFSPDGRYIASASFDKSVKLWDGRT 398 (480)
T ss_pred ceeEEecCCceEEEeccccccc-----------------chhhhhc-hhhheeeEEECCCccEEEEeecccceeeeeCCC
Confidence 5799999999999997443221 1111122 256799999999877766665 5 499999887
Q ss_pred Cccccccc-CCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCceeEeeeec-CCCCceEEEec--CCeEEEEEc
Q 003405 105 LETIAVLT-KAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSWC--GENICIAIR 175 (823)
Q Consensus 105 l~~~~~i~-~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~~--~~~i~v~~~ 175 (823)
-+.+.++. ....|.-++...+. ++.|. ....|.+|+++.. +...+++ -.|.+-++.|. |..+|=|-+
T Consensus 399 Gk~lasfRGHv~~VYqvawsaDs-RLlVS~SkDsTLKvw~V~tk---Kl~~DLpGh~DEVf~vDwspDG~rV~sggk 471 (480)
T KOG0271|consen 399 GKFLASFRGHVAAVYQVAWSADS-RLLVSGSKDSTLKVWDVRTK---KLKQDLPGHADEVFAVDWSPDGQRVASGGK 471 (480)
T ss_pred cchhhhhhhccceeEEEEeccCc-cEEEEcCCCceEEEEEeeee---eecccCCCCCceEEEEEecCCCceeecCCC
Confidence 76665532 22234556666654 45554 4456888888721 1222444 34799999998 445554443
No 210
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=82.97 E-value=54 Score=38.90 Aligned_cols=165 Identities=12% Similarity=0.145 Sum_probs=101.6
Q ss_pred eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeC
Q 003405 25 YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRL 102 (823)
Q Consensus 25 ~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L 102 (823)
.+++|+=+++|+++..|.+...+ ..=..++ +-.||=.+...|..-..++.++ + -.+|.-
T Consensus 462 d~rfLlScSED~svRLWsl~t~s------------------~~V~y~G-H~~PVwdV~F~P~GyYFatas~D~tArLWs~ 522 (707)
T KOG0263|consen 462 DRRFLLSCSEDSSVRLWSLDTWS------------------CLVIYKG-HLAPVWDVQFAPRGYYFATASHDQTARLWST 522 (707)
T ss_pred cccceeeccCCcceeeeecccce------------------eEEEecC-CCcceeeEEecCCceEEEecCCCceeeeeec
Confidence 34677777888999999876432 1112333 2568888888888777888875 4 678876
Q ss_pred CCCccccc-ccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecC-CCCceEEEec--CCeEEEEEc-C
Q 003405 103 PNLETIAV-LTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWC--GENICIAIR-K 176 (823)
Q Consensus 103 ~~l~~~~~-i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~--~~~i~v~~~-~ 176 (823)
+...|... ......+.++.++++...++-| .++.+.++....+... |-+.- .++++++++. |..+.-|.. .
T Consensus 523 d~~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~V---RiF~GH~~~V~al~~Sp~Gr~LaSg~ed~ 599 (707)
T KOG0263|consen 523 DHNKPLRIFAGHLSDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNSV---RIFTGHKGPVTALAFSPCGRYLASGDEDG 599 (707)
T ss_pred ccCCchhhhcccccccceEEECCcccccccCCCCceEEEEEcCCCcEE---EEecCCCCceEEEEEcCCCceEeecccCC
Confidence 65544432 2344567777788887767776 7788888887654322 22221 3588888887 555665555 3
Q ss_pred ceEEEEcCCCCe-eeccCCCCCCCCEEEEccCCeEEE
Q 003405 177 GYMILNATNGAL-SEVFPSGRIGPPLVVSLLSGELLL 212 (823)
Q Consensus 177 ~y~lidl~~~~~-~~L~~~~~~~~p~i~~~~~~EfLL 212 (823)
-..+.|+.+|+. ..+... +...-.+....++.+|+
T Consensus 600 ~I~iWDl~~~~~v~~l~~H-t~ti~SlsFS~dg~vLa 635 (707)
T KOG0263|consen 600 LIKIWDLANGSLVKQLKGH-TGTIYSLSFSRDGNVLA 635 (707)
T ss_pred cEEEEEcCCCcchhhhhcc-cCceeEEEEecCCCEEE
Confidence 467889998754 333333 21111222334556655
No 211
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=82.62 E-value=25 Score=39.04 Aligned_cols=165 Identities=10% Similarity=0.126 Sum_probs=96.9
Q ss_pred cCCCCcEEEEEEeCC---EEEEE-eCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccC
Q 003405 13 SNCSPKIDAVASYGL---KILLG-CSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQ 88 (823)
Q Consensus 13 ~~~~~~I~ci~~~~~---~L~vG-T~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~ 88 (823)
..+|..++|+.+-|+ .|.|+ -++|.+.+|+-..+... .--+++++-.||.++.+.+..+
T Consensus 95 ~~lPg~a~wv~skGd~~s~IAVs~~~sg~i~VvD~~~d~~q-----------------~~~fkklH~sPV~~i~y~qa~D 157 (558)
T KOG0882|consen 95 VDLPGFAEWVTSKGDKISLIAVSLFKSGKIFVVDGFGDFCQ-----------------DGYFKKLHFSPVKKIRYNQAGD 157 (558)
T ss_pred ccCCCceEEecCCCCeeeeEEeecccCCCcEEECCcCCcCc-----------------cceecccccCceEEEEeecccc
Confidence 345788899888874 67777 46788898874432210 1123445568999999999988
Q ss_pred ceeeEeC-c-EEEEeCC-CCccc--------------ccccCCCC-cEEEEeeCCCceEEE-EEcCeEEEEEEcCCCcee
Q 003405 89 LLLSLSE-S-IAFHRLP-NLETI--------------AVLTKAKG-ANVYSWDDRRGFLCF-ARQKRVCIFRHDGGRGFV 149 (823)
Q Consensus 89 ~Ll~l~d-~-l~~~~L~-~l~~~--------------~~i~~~kg-~~~fa~~~~~~~l~V-~~kkki~l~~~~~~~~f~ 149 (823)
..++.-. | |..|... .++.. .-.++.|+ .++|++++..-++.. +-+++|.+|..+.++..+
T Consensus 158 s~vSiD~~gmVEyWs~e~~~qfPr~~l~~~~K~eTdLy~f~K~Kt~pts~Efsp~g~qistl~~DrkVR~F~~KtGklvq 237 (558)
T KOG0882|consen 158 SAVSIDISGMVEYWSAEGPFQFPRTNLNFELKHETDLYGFPKAKTEPTSFEFSPDGAQISTLNPDRKVRGFVFKTGKLVQ 237 (558)
T ss_pred ceeeccccceeEeecCCCcccCccccccccccccchhhcccccccCccceEEccccCcccccCcccEEEEEEeccchhhh
Confidence 8887765 5 8899876 23211 11234443 345777665544432 355666666554332111
Q ss_pred Ee-----------------------------eeecCCC--CceEEEec--CCeEEEEEcCceEEEEcCCCCeeeccCC
Q 003405 150 EV-----------------------------KDFGVPD--TVKSMSWC--GENICIAIRKGYMILNATNGALSEVFPS 194 (823)
Q Consensus 150 ~~-----------------------------kei~~~~--~~~~l~~~--~~~i~v~~~~~y~lidl~~~~~~~L~~~ 194 (823)
.+ ||+.-.+ .-+.+.|. |+.+.+|+--+..++|+.|+++..++..
T Consensus 238 eiDE~~t~~~~q~ks~y~l~~VelgRRmaverelek~~~~~~~~~~fdes~~flly~t~~gikvin~~tn~v~ri~gk 315 (558)
T KOG0882|consen 238 EIDEVLTDAQYQPKSPYGLMHVELGRRMAVERELEKHGSTVGTNAVFDESGNFLLYGTILGIKVINLDTNTVVRILGK 315 (558)
T ss_pred hhhccchhhhhccccccccceeehhhhhhHHhhHhhhcCcccceeEEcCCCCEEEeecceeEEEEEeecCeEEEEecc
Confidence 11 1110001 11223333 6778888888888889888877666543
No 212
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=82.05 E-value=92 Score=34.86 Aligned_cols=127 Identities=9% Similarity=0.140 Sum_probs=69.2
Q ss_pred ecCCeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEE-eCCeEEEEcCCCccccCCceeecCCC---
Q 003405 165 WCGENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLG-KENIGVFVDQNGKLLQADRICWSEAP--- 239 (823)
Q Consensus 165 ~~~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~-~~~~gvfv~~~G~~~~~~~i~w~~~P--- 239 (823)
..++.++++.. .....+|..+|+...-.+.+....|. ..++.+.++ .++..+.+|.. .+.+.|....
T Consensus 254 v~~~~vy~~~~~g~l~ald~~tG~~~W~~~~~~~~~~~---~~~~~vy~~~~~g~l~ald~~-----tG~~~W~~~~~~~ 325 (394)
T PRK11138 254 VVGGVVYALAYNGNLVALDLRSGQIVWKREYGSVNDFA---VDGGRIYLVDQNDRVYALDTR-----GGVELWSQSDLLH 325 (394)
T ss_pred EECCEEEEEEcCCeEEEEECCCCCEEEeecCCCccCcE---EECCEEEEEcCCCeEEEEECC-----CCcEEEcccccCC
Confidence 34777777765 56788999999765544443221222 123444443 34444445542 2344553321
Q ss_pred ---cEEEEeCCEEEEEeC-CeEEEEEccCCCceeEEEeeCCcccc---cccCCeEEEec-cceEEEeec
Q 003405 240 ---IAVIIQKPYAIALLP-RRVEVRSLRVPYALIQTIVLQNVRHL---IPSSNAVVVAL-ENSIFGLFP 300 (823)
Q Consensus 240 ---~~v~~~~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l~~~~~l---~~~~~~v~v~s-~~~I~~l~~ 300 (823)
.+.++..-+|++... +.+.+.+.. ++.++-+..+.+.... .-.++.+|+.+ ++.|+++.+
T Consensus 326 ~~~~sp~v~~g~l~v~~~~G~l~~ld~~-tG~~~~~~~~~~~~~~s~P~~~~~~l~v~t~~G~l~~~~~ 393 (394)
T PRK11138 326 RLLTAPVLYNGYLVVGDSEGYLHWINRE-DGRFVAQQKVDSSGFLSEPVVADDKLLIQARDGTVYAITR 393 (394)
T ss_pred CcccCCEEECCEEEEEeCCCEEEEEECC-CCCEEEEEEcCCCcceeCCEEECCEEEEEeCCceEEEEeC
Confidence 122345667777665 456667775 5777766665432111 12356778765 557888764
No 213
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=81.83 E-value=11 Score=42.39 Aligned_cols=159 Identities=21% Similarity=0.280 Sum_probs=89.7
Q ss_pred cEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 18 KIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 18 ~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
+|..+.+-+ +..+-+++|-++..|.+.....+ ..+..+.-+.++ +++||..|..+.....++ -||
T Consensus 737 ~iRai~AidNENSFiSASkDKTVKLWSik~EgD~-----------~~tsaCQfTY~a-Hkk~i~~igfL~~lr~i~-ScD 803 (1034)
T KOG4190|consen 737 KIRAIAAIDNENSFISASKDKTVKLWSIKPEGDE-----------IGTSACQFTYQA-HKKPIHDIGFLADLRSIA-SCD 803 (1034)
T ss_pred HhHHHHhcccccceeeccCCceEEEEEeccccCc-----------cccceeeeEhhh-ccCcccceeeeeccceee-ecc
Confidence 455554443 34556688999999988754332 111122222333 489999999998754444 456
Q ss_pred -cEEEEeCCCCcccccccC----CCCcEEEEeeC-CCceEEEE---EcCeEEEEEEcCCCceeEeee--ecCC-CCceEE
Q 003405 96 -SIAFHRLPNLETIAVLTK----AKGANVYSWDD-RRGFLCFA---RQKRVCIFRHDGGRGFVEVKD--FGVP-DTVKSM 163 (823)
Q Consensus 96 -~l~~~~L~~l~~~~~i~~----~kg~~~fa~~~-~~~~l~V~---~kkki~l~~~~~~~~f~~~ke--i~~~-~~~~~l 163 (823)
++++|+-.--.+..++.. ..|-+..|+.. ++. |.++ ....+.+|.-+....-.++|- -+.| ..++++
T Consensus 804 ~giHlWDPFigr~Laq~~dapk~~a~~~ikcl~nv~~~-iliAgcsaeSTVKl~DaRsce~~~E~kVcna~~Pna~~R~i 882 (1034)
T KOG4190|consen 804 GGIHLWDPFIGRLLAQMEDAPKEGAGGNIKCLENVDRH-ILIAGCSAESTVKLFDARSCEWTCELKVCNAPGPNALTRAI 882 (1034)
T ss_pred CcceeecccccchhHhhhcCcccCCCceeEecccCcch-heeeeccchhhheeeecccccceeeEEeccCCCCchheeEE
Confidence 699998533222222211 11333444432 332 3332 445666776654322233443 2344 456777
Q ss_pred Eec--CCeEEEEEcCc-eEEEEcCCCCeee
Q 003405 164 SWC--GENICIAIRKG-YMILNATNGALSE 190 (823)
Q Consensus 164 ~~~--~~~i~v~~~~~-y~lidl~~~~~~~ 190 (823)
+.. ||.+.+|..++ ..++|..+|.+..
T Consensus 883 aVa~~GN~lAa~LSnGci~~LDaR~G~vIN 912 (1034)
T KOG4190|consen 883 AVADKGNKLAAALSNGCIAILDARNGKVIN 912 (1034)
T ss_pred EeccCcchhhHHhcCCcEEEEecCCCceec
Confidence 766 78899998865 5778998887643
No 214
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=81.71 E-value=22 Score=42.18 Aligned_cols=108 Identities=9% Similarity=0.231 Sum_probs=63.2
Q ss_pred CeeEEEEecccCceeeEeCc--EEEEeCCC--CcccccccCCCCcEEEEe---eCCCceEEEEEcCeEEEEEEc------
Q 003405 77 PILSMEVLASRQLLLSLSES--IAFHRLPN--LETIAVLTKAKGANVYSW---DDRRGFLCFARQKRVCIFRHD------ 143 (823)
Q Consensus 77 ~I~qI~~~~~~~~Ll~l~d~--l~~~~L~~--l~~~~~i~~~kg~~~fa~---~~~~~~l~V~~kkki~l~~~~------ 143 (823)
..+.+..-.-....+|-+++ +++|+... ++.-........+..+.. .+....++||..++|.+|.-.
T Consensus 31 ~~~li~gss~~k~a~V~~~~~~LtIWD~~~~~lE~~~~f~~~~~I~dLDWtst~d~qsiLaVGf~~~v~l~~Q~R~dy~~ 110 (631)
T PF12234_consen 31 NPSLISGSSIKKIAVVDSSRSELTIWDTRSGVLEYEESFSEDDPIRDLDWTSTPDGQSILAVGFPHHVLLYTQLRYDYTN 110 (631)
T ss_pred CcceEeecccCcEEEEECCCCEEEEEEcCCcEEEEeeeecCCCceeeceeeecCCCCEEEEEEcCcEEEEEEccchhhhc
Confidence 34444433322333443343 89998854 222222222223333222 333356899999999999632
Q ss_pred CCCceeEeeeec----CCCCceEEEecCC-eEEEEEcCceEEEEcC
Q 003405 144 GGRGFVEVKDFG----VPDTVKSMSWCGE-NICIAIRKGYMILNAT 184 (823)
Q Consensus 144 ~~~~f~~~kei~----~~~~~~~l~~~~~-~i~v~~~~~y~lidl~ 184 (823)
.+..+..++++. .|++|....|.++ .++||+.++..++|-.
T Consensus 111 ~~p~w~~i~~i~i~~~T~h~Igds~Wl~~G~LvV~sGNqlfv~dk~ 156 (631)
T PF12234_consen 111 KGPSWAPIRKIDISSHTPHPIGDSIWLKDGTLVVGSGNQLFVFDKW 156 (631)
T ss_pred CCcccceeEEEEeecCCCCCccceeEecCCeEEEEeCCEEEEECCC
Confidence 122455666653 4789999999954 6888999999988743
No 215
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=81.62 E-value=65 Score=40.37 Aligned_cols=183 Identities=21% Similarity=0.229 Sum_probs=99.0
Q ss_pred CcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|.++.-.. +.++|+|..|.+++.+.... ..+.+ +.-...|.-+.--|+.+.+++++
T Consensus 69 ~~i~s~~fl~d~~~i~v~~~~G~iilvd~et~----------------~~eiv----g~vd~GI~aaswS~Dee~l~liT 128 (1265)
T KOG1920|consen 69 DEIVSVQFLADTNSICVITALGDIILVDPETL----------------ELEIV----GNVDNGISAASWSPDEELLALIT 128 (1265)
T ss_pred cceEEEEEecccceEEEEecCCcEEEEccccc----------------ceeee----eeccCceEEEeecCCCcEEEEEe
Confidence 4788877664 68999999999998853321 22222 22346799999999988888888
Q ss_pred C-cEEEEeCCCCcccccccCCCCcEEEEeeCC-CceEEEEEcCeEEEEEEcCCCce--eE-eee-----ecCCCCceEEE
Q 003405 95 E-SIAFHRLPNLETIAVLTKAKGANVYSWDDR-RGFLCFARQKRVCIFRHDGGRGF--VE-VKD-----FGVPDTVKSMS 164 (823)
Q Consensus 95 d-~l~~~~L~~l~~~~~i~~~kg~~~fa~~~~-~~~l~V~~kkki~l~~~~~~~~f--~~-~ke-----i~~~~~~~~l~ 164 (823)
. +..++.-.+|+++.- +.. ++=+.. ...+-||=.|+=.=|+.+.++.- .+ .+| +...+.=++++
T Consensus 129 ~~~tll~mT~~f~~i~E----~~L--~~d~~~~sk~v~VGwGrkeTqfrgs~gr~~~~~~~~~ek~~~~~~~~~~~~~Is 202 (1265)
T KOG1920|consen 129 GRQTLLFMTKDFEPIAE----KPL--DADDERKSKFVNVGWGRKETQFRGSEGRQAARQKIEKEKALEQIEQDDHKTSIS 202 (1265)
T ss_pred CCcEEEEEeccccchhc----ccc--ccccccccccceecccccceeeecchhhhcccccccccccccchhhccCCceEE
Confidence 7 444554556666642 111 111211 23466776666666666544321 11 111 22345556799
Q ss_pred ecCCeEEEE-------Ec-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEE-----eCCeEEEEcCCCc
Q 003405 165 WCGENICIA-------IR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLG-----KENIGVFVDQNGK 226 (823)
Q Consensus 165 ~~~~~i~v~-------~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~-----~~~~gvfv~~~G~ 226 (823)
|+|+.=+|| +. +.+.++|-+ |.....-.+.....++..+.|.|-.+=+ .++..+|+..+|-
T Consensus 203 WRgDg~~fAVs~~~~~~~~RkirV~drE-g~Lns~se~~~~l~~~LsWkPsgs~iA~iq~~~sd~~IvffErNGL 276 (1265)
T KOG1920|consen 203 WRGDGEYFAVSFVESETGTRKIRVYDRE-GALNSTSEPVEGLQHSLSWKPSGSLIAAIQCKTSDSDIVFFERNGL 276 (1265)
T ss_pred EccCCcEEEEEEEeccCCceeEEEeccc-chhhcccCcccccccceeecCCCCeEeeeeecCCCCcEEEEecCCc
Confidence 998752222 22 446666654 3221111111112345555554443321 2344677777773
No 216
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=81.36 E-value=93 Score=34.46 Aligned_cols=246 Identities=11% Similarity=0.075 Sum_probs=134.8
Q ss_pred ccccCCCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecc-cC
Q 003405 10 ELISNCSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLAS-RQ 88 (823)
Q Consensus 10 ~l~~~~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~-~~ 88 (823)
++-..-|.........++.|+-|+=+|.+.+|+.+.- .....+.++ ...|..+..-|. .+
T Consensus 171 Q~gd~rPis~~~fS~ds~~laT~swsG~~kvW~~~~~------------------~~~~~l~gH-~~~v~~~~fhP~~~~ 231 (459)
T KOG0272|consen 171 QVGDTRPISGCSFSRDSKHLATGSWSGLVKVWSVPQC------------------NLLQTLRGH-TSRVGAAVFHPVDSD 231 (459)
T ss_pred hccCCCcceeeEeecCCCeEEEeecCCceeEeecCCc------------------ceeEEEecc-ccceeeEEEccCCCc
Confidence 3344445666677777889999999999999985532 223445544 456777776665 23
Q ss_pred -ceeeEe-Cc-EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEE
Q 003405 89 -LLLSLS-ES-IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSM 163 (823)
Q Consensus 89 -~Ll~l~-d~-l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l 163 (823)
-+++++ || +++|.+.+-.++..+. -...|..++.+++...|.-+ -...-.+|.+....+. ..+|= -+..+-++
T Consensus 232 ~~lat~s~Dgtvklw~~~~e~~l~~l~gH~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~El-L~QEG-Hs~~v~~i 309 (459)
T KOG0272|consen 232 LNLATASADGTVKLWKLSQETPLQDLEGHLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSEL-LLQEG-HSKGVFSI 309 (459)
T ss_pred cceeeeccCCceeeeccCCCcchhhhhcchhhheeeeecCCCceeeecccccchhhcccccchhh-Hhhcc-ccccccee
Confidence 455655 55 9999987755554432 22356678888877666666 3333344444321110 01110 13477788
Q ss_pred EecCC--eEEEEEcCce-EEEEcCCCCeeeccCCCCCCCCEE-EEccCCeEEEE---eCCeEEEEcCCCccccCCceeec
Q 003405 164 SWCGE--NICIAIRKGY-MILNATNGALSEVFPSGRIGPPLV-VSLLSGELLLG---KENIGVFVDQNGKLLQADRICWS 236 (823)
Q Consensus 164 ~~~~~--~i~v~~~~~y-~lidl~~~~~~~L~~~~~~~~p~i-~~~~~~EfLL~---~~~~gvfv~~~G~~~~~~~i~w~ 236 (823)
+|.-+ .++-|.-..+ .+.|+.+|+..=.+.- ..+|+. +..+.+-+.|+ .|+.+=+-|..++.. -.+|.=.
T Consensus 310 af~~DGSL~~tGGlD~~~RvWDlRtgr~im~L~g--H~k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR~r~~-ly~ipAH 386 (459)
T KOG0272|consen 310 AFQPDGSLAATGGLDSLGRVWDLRTGRCIMFLAG--HIKEILSVAFSPNGYHLATGSSDNTCKVWDLRMRSE-LYTIPAH 386 (459)
T ss_pred EecCCCceeeccCccchhheeecccCcEEEEecc--cccceeeEeECCCceEEeecCCCCcEEEeeeccccc-ceecccc
Confidence 88733 3333333444 6789999876544442 123442 23344445553 244555556655422 2223222
Q ss_pred CC-CcEEEEeC--C-EEEEEe-CCeEEEEEccCCCceeEEEeeCCcccc
Q 003405 237 EA-PIAVIIQK--P-YAIALL-PRRVEVRSLRVPYALIQTIVLQNVRHL 280 (823)
Q Consensus 237 ~~-P~~v~~~~--P-Yll~~~-~~~ieV~~l~~~~~lvQ~i~l~~~~~l 280 (823)
.. ...|-|.. . ||+... ++.+.|++-. +...++++.--..+.+
T Consensus 387 ~nlVS~Vk~~p~~g~fL~TasyD~t~kiWs~~-~~~~~ksLaGHe~kV~ 434 (459)
T KOG0272|consen 387 SNLVSQVKYSPQEGYFLVTASYDNTVKIWSTR-TWSPLKSLAGHEGKVI 434 (459)
T ss_pred cchhhheEecccCCeEEEEcccCcceeeecCC-CcccchhhcCCccceE
Confidence 22 22333431 2 444443 4789999875 4667777654444443
No 217
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=81.29 E-value=3.6 Score=27.72 Aligned_cols=25 Identities=28% Similarity=0.478 Sum_probs=20.8
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 341 IHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 341 i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
+....|..++..|+|++|..+|.++
T Consensus 3 ~~~~lg~~~~~~~~~~~A~~~~~~a 27 (34)
T PF07719_consen 3 AWYYLGQAYYQLGNYEEAIEYFEKA 27 (34)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 4566799999999999999999873
No 218
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=81.11 E-value=1.3 Score=35.82 Aligned_cols=57 Identities=23% Similarity=0.254 Sum_probs=42.6
Q ss_pred hHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccC-CHHHHHHHHHhc
Q 003405 304 GAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTG-SYEEAMEHFLAS 365 (823)
Q Consensus 304 ~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~-~f~~A~~~f~~~ 365 (823)
...-..+++.|+|++|+..++....-+ ..-..++...|..++..+ +|++|+..|.++
T Consensus 7 ~~~g~~~~~~~~~~~A~~~~~~ai~~~-----p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~a 64 (69)
T PF13414_consen 7 YNLGQIYFQQGDYEEAIEYFEKAIELD-----PNNAEAYYNLGLAYMKLGKDYEEAIEDFEKA 64 (69)
T ss_dssp HHHHHHHHHTTHHHHHHHHHHHHHHHS-----TTHHHHHHHHHHHHHHTTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHcC-----CCCHHHHHHHHHHHHHhCccHHHHHHHHHHH
Confidence 344566789999999999997642111 123457778899999999 799999999874
No 219
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=80.91 E-value=31 Score=36.33 Aligned_cols=140 Identities=17% Similarity=0.196 Sum_probs=80.3
Q ss_pred CCCCCeeEEEEecccCceeeEeC--c-EEEEeCCCCc---ccccccCCC--------CcEEEEeeC------CCceEEEE
Q 003405 73 FSKKPILSMEVLASRQLLLSLSE--S-IAFHRLPNLE---TIAVLTKAK--------GANVYSWDD------RRGFLCFA 132 (823)
Q Consensus 73 ~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~~L~~l~---~~~~i~~~k--------g~~~fa~~~------~~~~l~V~ 132 (823)
.+...|+.+.+.+..+..++-++ | +.+|+|.+-. .-.-+.+.+ +.+-|++.. +.|.+.-+
T Consensus 41 ~HgGsvNsL~id~tegrymlSGgadgsi~v~Dl~n~t~~e~s~li~k~~c~v~~~h~~~Hky~iss~~WyP~DtGmFtss 120 (397)
T KOG4283|consen 41 PHGGSVNSLQIDLTEGRYMLSGGADGSIAVFDLQNATDYEASGLIAKHKCIVAKQHENGHKYAISSAIWYPIDTGMFTSS 120 (397)
T ss_pred cCCCccceeeeccccceEEeecCCCccEEEEEeccccchhhccceeheeeeccccCCccceeeeeeeEEeeecCceeecc
Confidence 45678999999988777665543 4 9999986432 111111111 122222211 11211111
Q ss_pred -EcCeEEEEEEcCCCceeEeeeecCCCCceEEEec----CC-eEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEc
Q 003405 133 -RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC----GE-NICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSL 205 (823)
Q Consensus 133 -~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~----~~-~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~ 205 (823)
..+.+.++... ..+..-.+.+++.|-+-+|. .. .|.+|++ ....+.|+.+|...-.+.--+...-.+.+.
T Consensus 121 SFDhtlKVWDtn---TlQ~a~~F~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs~sH~LsGHr~~vlaV~Ws 197 (397)
T KOG4283|consen 121 SFDHTLKVWDTN---TLQEAVDFKMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGSFSHTLSGHRDGVLAVEWS 197 (397)
T ss_pred cccceEEEeecc---cceeeEEeecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCcceeeeccccCceEEEEec
Confidence 34444444333 33444557778877766664 23 3777777 578999999998766665444333345577
Q ss_pred cCCeEEEEeC
Q 003405 206 LSGELLLGKE 215 (823)
Q Consensus 206 ~~~EfLL~~~ 215 (823)
+..|++|+..
T Consensus 198 p~~e~vLatg 207 (397)
T KOG4283|consen 198 PSSEWVLATG 207 (397)
T ss_pred cCceeEEEec
Confidence 8999999643
No 220
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=80.89 E-value=2.6 Score=34.39 Aligned_cols=51 Identities=29% Similarity=0.420 Sum_probs=39.6
Q ss_pred HHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 309 QLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 309 ~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.++++++|++|+..++.....++ .--..+..+|..++..|+|++|+..|.+
T Consensus 4 ~~~~~~~~~~A~~~~~~~l~~~p-----~~~~~~~~~a~~~~~~g~~~~A~~~l~~ 54 (73)
T PF13371_consen 4 IYLQQEDYEEALEVLERALELDP-----DDPELWLQRARCLFQLGRYEEALEDLER 54 (73)
T ss_pred HHHhCCCHHHHHHHHHHHHHhCc-----ccchhhHHHHHHHHHhccHHHHHHHHHH
Confidence 56789999999999987532111 1224577789999999999999999987
No 221
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=80.60 E-value=75 Score=39.03 Aligned_cols=161 Identities=15% Similarity=0.203 Sum_probs=92.5
Q ss_pred EEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc----CceeeE
Q 003405 19 IDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR----QLLLSL 93 (823)
Q Consensus 19 I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~----~~Ll~l 93 (823)
++|++.-+ +.||||+.-|.=..-.+...+..++ ...+..++.+. .||..+.++... .-+++|
T Consensus 309 assi~~L~ng~lFvGS~~gdSqLi~L~~e~d~gs-----------y~~ilet~~NL--gPI~Dm~Vvd~d~q~q~qivtC 375 (1096)
T KOG1897|consen 309 ASSINYLDNGVLFVGSRFGDSQLIKLNTEPDVGS-----------YVVILETFVNL--GPIVDMCVVDLDRQGQGQIVTC 375 (1096)
T ss_pred hhhhhcccCceEEEeccCCceeeEEccccCCCCc-----------hhhhhhhcccc--cceeeEEEEeccccCCceEEEE
Confidence 45666554 7999999999765555554443311 11222345554 599999998743 568888
Q ss_pred eC----c-EEEEeCC-CCccc--ccccCCCCcEEEE--eeCCC-ceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceE
Q 003405 94 SE----S-IAFHRLP-NLETI--AVLTKAKGANVYS--WDDRR-GFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKS 162 (823)
Q Consensus 94 ~d----~-l~~~~L~-~l~~~--~~i~~~kg~~~fa--~~~~~-~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~ 162 (823)
|+ | +.+.+-. ..+.. ..++..||.=.+. ++++. ..++++.-..-.++.+++. +.....-.+.+.-++
T Consensus 376 sGa~kdgSLRiiRngi~I~e~A~i~l~Gikg~w~lk~~v~~~~d~ylvlsf~~eTrvl~i~~e--~ee~~~~gf~~~~~T 453 (1096)
T KOG1897|consen 376 SGAFKDGSLRIIRNGIGIDELASIDLPGIKGMWSLKSMVDENYDNYLVLSFISETRVLNISEE--VEETEDPGFSTDEQT 453 (1096)
T ss_pred eCCCCCCcEEEEecccccceeeEeecCCccceeEeeccccccCCcEEEEEeccceEEEEEccc--eEEeccccccccCce
Confidence 87 2 7776532 12212 2345556544444 23332 2677775555556666632 444443333334444
Q ss_pred EEe---cCCeEEEEEcCceEEEEcCCCCeeeccCCC
Q 003405 163 MSW---CGENICIAIRKGYMILNATNGALSEVFPSG 195 (823)
Q Consensus 163 l~~---~~~~i~v~~~~~y~lidl~~~~~~~L~~~~ 195 (823)
+.. .|++|.=++.++..+++-. |...+.-+++
T Consensus 454 if~S~i~g~~lvQvTs~~iRl~ss~-~~~~~W~~p~ 488 (1096)
T KOG1897|consen 454 IFCSTINGNQLVQVTSNSIRLVSSA-GLRSEWRPPG 488 (1096)
T ss_pred EEEEccCCceEEEEecccEEEEcch-hhhhcccCCC
Confidence 433 3778888888999999866 4444444443
No 222
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=80.57 E-value=83 Score=33.37 Aligned_cols=137 Identities=12% Similarity=0.184 Sum_probs=78.3
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe--C-c-EEEEe
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS--E-S-IAFHR 101 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~--d-~-l~~~~ 101 (823)
++.|+=...|-++..|+.+... ..+.++.. .+-|+.+. ....+.-++++ | + +++||
T Consensus 102 ~s~i~S~gtDk~v~~wD~~tG~------------------~~rk~k~h-~~~vNs~~-p~rrg~~lv~SgsdD~t~kl~D 161 (338)
T KOG0265|consen 102 GSHILSCGTDKTVRGWDAETGK------------------RIRKHKGH-TSFVNSLD-PSRRGPQLVCSGSDDGTLKLWD 161 (338)
T ss_pred CCEEEEecCCceEEEEecccce------------------eeehhccc-cceeeecC-ccccCCeEEEecCCCceEEEEe
Confidence 3466666677777777755321 12333332 44566666 33446555555 3 3 99999
Q ss_pred CCCCcccccccCCCCcEEEEeeCCCceEEE-EEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC--Ce-EEEEEcCc
Q 003405 102 LPNLETIAVLTKAKGANVYSWDDRRGFLCF-ARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG--EN-ICIAIRKG 177 (823)
Q Consensus 102 L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V-~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~--~~-i~v~~~~~ 177 (823)
+.+-+.+.+.+..-..++|+.++....+.. +..+.|.++....+.....++- -.|+|++|.... .. +--+.+..
T Consensus 162 ~R~k~~~~t~~~kyqltAv~f~d~s~qv~sggIdn~ikvWd~r~~d~~~~lsG--h~DtIt~lsls~~gs~llsnsMd~t 239 (338)
T KOG0265|consen 162 IRKKEAIKTFENKYQLTAVGFKDTSDQVISGGIDNDIKVWDLRKNDGLYTLSG--HADTITGLSLSRYGSFLLSNSMDNT 239 (338)
T ss_pred ecccchhhccccceeEEEEEecccccceeeccccCceeeeccccCcceEEeec--ccCceeeEEeccCCCccccccccce
Confidence 986665555444446788999887766554 4788888887753322221211 137888887652 21 22233344
Q ss_pred eEEEEcC
Q 003405 178 YMILNAT 184 (823)
Q Consensus 178 y~lidl~ 184 (823)
..+.|+.
T Consensus 240 vrvwd~r 246 (338)
T KOG0265|consen 240 VRVWDVR 246 (338)
T ss_pred EEEEEec
Confidence 5555554
No 223
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=80.38 E-value=95 Score=33.95 Aligned_cols=236 Identities=17% Similarity=0.247 Sum_probs=123.4
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEE-EEeccc--CceeeEe
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSM-EVLASR--QLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI-~~~~~~--~~Ll~l~ 94 (823)
-|+++...+++|+-|+-||.+.+|+..+.. .++..+. ..+|... .++++. ..+++-+
T Consensus 107 WVSsv~~~~~~IltgsYDg~~riWd~~Gk~-------------------~~~~~Gh-t~~ik~v~~v~~n~~~~~fvsas 166 (423)
T KOG0313|consen 107 WVSSVKGASKWILTGSYDGTSRIWDLKGKS-------------------IKTIVGH-TGPIKSVAWVIKNSSSCLFVSAS 166 (423)
T ss_pred hhhhhcccCceEEEeecCCeeEEEecCCce-------------------EEEEecC-CcceeeeEEEecCCccceEEEec
Confidence 466666668999999999999999876532 1222333 3466643 333322 2244444
Q ss_pred C-c-EEEEeCCCCcccc---cc--cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCC-----Ccee--------Eeee
Q 003405 95 E-S-IAFHRLPNLETIA---VL--TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGG-----RGFV--------EVKD 153 (823)
Q Consensus 95 d-~-l~~~~L~~l~~~~---~i--~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~-----~~f~--------~~ke 153 (823)
. . +.+|....-+..- .. ....+|.++.++.+..++|-| -...|.|+....+ ..+. ..++
T Consensus 167 ~Dqtl~Lw~~~~~~~~~~~~~~~~GHk~~V~sVsv~~sgtr~~SgS~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~ 246 (423)
T KOG0313|consen 167 MDQTLRLWKWNVGENKVKALKVCRGHKRSVDSVSVDSSGTRFCSGSWDTMLKIWSVETDEEDELESSSNRRRKKQKREKE 246 (423)
T ss_pred CCceEEEEEecCchhhhhHHhHhcccccceeEEEecCCCCeEEeecccceeeecccCCCccccccccchhhhhhhhhhhc
Confidence 3 3 7777654322111 11 233467788888887777777 6677878762110 0010 0111
Q ss_pred -------ecC---CCCceEEEecCCeEEEEEc--CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEE--eCCeEE
Q 003405 154 -------FGV---PDTVKSMSWCGENICIAIR--KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLG--KENIGV 219 (823)
Q Consensus 154 -------i~~---~~~~~~l~~~~~~i~v~~~--~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~--~~~~gv 219 (823)
+.+ -+++.++.|.+..+.+... .....-|+.+|....-...++. -.+|...+....|+| .+....
T Consensus 247 ~~~r~P~vtl~GHt~~Vs~V~w~d~~v~yS~SwDHTIk~WDletg~~~~~~~~~ks-l~~i~~~~~~~Ll~~gssdr~ir 325 (423)
T KOG0313|consen 247 GGTRTPLVTLEGHTEPVSSVVWSDATVIYSVSWDHTIKVWDLETGGLKSTLTTNKS-LNCISYSPLSKLLASGSSDRHIR 325 (423)
T ss_pred ccccCceEEecccccceeeEEEcCCCceEeecccceEEEEEeecccceeeeecCcc-eeEeecccccceeeecCCCCcee
Confidence 112 2688899999866555544 6678889998765433332221 122323344455553 233334
Q ss_pred EEcC---CCcccc---------CCceeecCCCcEEEEeCCEEE--EEeCCeEEEEEccCCCceeEEEeeCCccccc
Q 003405 220 FVDQ---NGKLLQ---------ADRICWSEAPIAVIIQKPYAI--ALLPRRVEVRSLRVPYALIQTIVLQNVRHLI 281 (823)
Q Consensus 220 fv~~---~G~~~~---------~~~i~w~~~P~~v~~~~PYll--~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l~ 281 (823)
.+|+ +|..++ -..+.|+-. .+|.+ +-+++.+-+.+++-+..-..+|.--+-+.+.
T Consensus 326 l~DPR~~~gs~v~~s~~gH~nwVssvkwsp~-------~~~~~~S~S~D~t~klWDvRS~k~plydI~~h~DKvl~ 394 (423)
T KOG0313|consen 326 LWDPRTGDGSVVSQSLIGHKNWVSSVKWSPT-------NEFQLVSGSYDNTVKLWDVRSTKAPLYDIAGHNDKVLS 394 (423)
T ss_pred ecCCCCCCCceeEEeeecchhhhhheecCCC-------CceEEEEEecCCeEEEEEeccCCCcceeeccCCceEEE
Confidence 4443 222221 123444322 23443 3445788888887433344455433334443
No 224
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=80.25 E-value=1.5e+02 Score=35.99 Aligned_cols=58 Identities=5% Similarity=-0.045 Sum_probs=40.1
Q ss_pred ChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 302 PLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 302 ~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+..-+..++++|++++|+.+++....... .. .......|..++..|++++|...|.+
T Consensus 44 ~~~~~~~~~~~~g~~~~A~~l~~~~l~~~p----~~-~~~l~~l~~~~l~~g~~~~A~~~l~~ 101 (656)
T PRK15174 44 NIILFAIACLRKDETDVGLTLLSDRVLTAK----NG-RDLLRRWVISPLASSQPDAVLQVVNK 101 (656)
T ss_pred CHHHHHHHHHhcCCcchhHHHhHHHHHhCC----Cc-hhHHHHHhhhHhhcCCHHHHHHHHHH
Confidence 345567788999999999999976521111 01 12333445666779999999999987
No 225
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=79.42 E-value=1.2e+02 Score=34.41 Aligned_cols=83 Identities=12% Similarity=0.005 Sum_probs=45.6
Q ss_pred CCCcEEEEeCC-EEEEEeCCeEEEEEccCCCceeEEEeeCCcccccccCCeEEEeccceEEEeeccChhHHHHHHHhcCC
Q 003405 237 EAPIAVIIQKP-YAIALLPRRVEVRSLRVPYALIQTIVLQNVRHLIPSSNAVVVALENSIFGLFPVPLGAQIVQLTASGD 315 (823)
Q Consensus 237 ~~P~~v~~~~P-Yll~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l~~~~~~v~v~s~~~I~~l~~~~~~~qI~~Ll~~~~ 315 (823)
..|..+.|+.- =++...++.+.+.... +..+ ++..++..++.+..+++-|-|.+.+..|..+|-.. ...+.-|.
T Consensus 260 ~~p~~~~WCG~dav~l~~~~~l~lvg~~--~~~~-~~~~~~~~~l~~E~DG~riit~~~~~~l~~Vp~~~--~~if~igs 334 (410)
T PF04841_consen 260 SPPKQMAWCGNDAVVLSWEDELLLVGPD--GDSI-SFWYDGPVILVSEIDGVRIITSTSHEFLQRVPDST--ENIFRIGS 334 (410)
T ss_pred CCCcEEEEECCCcEEEEeCCEEEEECCC--CCce-EEeccCceEEeccCCceEEEeCCceEEEEECCHHH--HHHhcccC
Confidence 45566665542 2333334454444432 2222 23334444566667778888999999999998653 35555555
Q ss_pred HHHHHHHhh
Q 003405 316 FEEALALCK 324 (823)
Q Consensus 316 ~e~Al~L~~ 324 (823)
-+=|--|++
T Consensus 335 ~~p~a~L~~ 343 (410)
T PF04841_consen 335 TSPGAILLD 343 (410)
T ss_pred CCccHHHHH
Confidence 444444443
No 226
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=79.14 E-value=73 Score=31.85 Aligned_cols=124 Identities=15% Similarity=0.231 Sum_probs=65.7
Q ss_pred EEEEEEcCCCceeEeeeecC--CCCceEEEec--CCeEEEEEc---CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCe
Q 003405 137 VCIFRHDGGRGFVEVKDFGV--PDTVKSMSWC--GENICIAIR---KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGE 209 (823)
Q Consensus 137 i~l~~~~~~~~f~~~kei~~--~~~~~~l~~~--~~~i~v~~~---~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~E 209 (823)
..||.++.... ....+.+ .++|..++|. |+.+++... ....++|++.. .++..+......+.+.|++.
T Consensus 39 ~~l~~~~~~~~--~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~---~i~~~~~~~~n~i~wsP~G~ 113 (194)
T PF08662_consen 39 FELFYLNEKNI--PVESIELKKEGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGK---KIFSFGTQPRNTISWSPDGR 113 (194)
T ss_pred EEEEEEecCCC--ccceeeccCCCceEEEEECcCCCEEEEEEccCCcccEEEcCccc---EeEeecCCCceEEEECCCCC
Confidence 45666643211 1223333 3469999998 566655542 46788888632 22333333344577788898
Q ss_pred EEEE--eCCe---EEEEcCCC-cccc------CCceeecCCCcEEEEeCCEEEEEe-------CCeEEEEEccCCCceeE
Q 003405 210 LLLG--KENI---GVFVDQNG-KLLQ------ADRICWSEAPIAVIIQKPYAIALL-------PRRVEVRSLRVPYALIQ 270 (823)
Q Consensus 210 fLL~--~~~~---gvfv~~~G-~~~~------~~~i~w~~~P~~v~~~~PYll~~~-------~~~ieV~~l~~~~~lvQ 270 (823)
+++. .++. ..|.|.+. .... ...+.|+-. .-|+++.. ++++.|++.. +.++.
T Consensus 114 ~l~~~g~~n~~G~l~~wd~~~~~~i~~~~~~~~t~~~WsPd-------Gr~~~ta~t~~r~~~dng~~Iw~~~--G~~l~ 184 (194)
T PF08662_consen 114 FLVLAGFGNLNGDLEFWDVRKKKKISTFEHSDATDVEWSPD-------GRYLATATTSPRLRVDNGFKIWSFQ--GRLLY 184 (194)
T ss_pred EEEEEEccCCCcEEEEEECCCCEEeeccccCcEEEEEEcCC-------CCEEEEEEeccceeccccEEEEEec--CeEeE
Confidence 8774 3443 45666542 1110 122333222 23554443 2567788873 66666
Q ss_pred EEee
Q 003405 271 TIVL 274 (823)
Q Consensus 271 ~i~l 274 (823)
..++
T Consensus 185 ~~~~ 188 (194)
T PF08662_consen 185 KKPF 188 (194)
T ss_pred ecch
Confidence 6554
No 227
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=79.09 E-value=20 Score=38.71 Aligned_cols=65 Identities=17% Similarity=0.242 Sum_probs=50.1
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc--EEEEeCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES--IAFHRLPN 104 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~--l~~~~L~~ 104 (823)
+.+..|..||+|.+|++.-. .+.-+..++ ..-|..+.+-|.+..+++++|. +.+|++.+
T Consensus 305 ~~l~s~SrDktIk~wdv~tg------------------~cL~tL~gh-dnwVr~~af~p~Gkyi~ScaDDktlrvwdl~~ 365 (406)
T KOG0295|consen 305 QVLGSGSRDKTIKIWDVSTG------------------MCLFTLVGH-DNWVRGVAFSPGGKYILSCADDKTLRVWDLKN 365 (406)
T ss_pred cEEEeecccceEEEEeccCC------------------eEEEEEecc-cceeeeeEEcCCCeEEEEEecCCcEEEEEecc
Confidence 58999999999999997631 233344444 4679999999999999999993 99999987
Q ss_pred Cccccc
Q 003405 105 LETIAV 110 (823)
Q Consensus 105 l~~~~~ 110 (823)
..-+..
T Consensus 366 ~~cmk~ 371 (406)
T KOG0295|consen 366 LQCMKT 371 (406)
T ss_pred ceeeec
Confidence 654433
No 228
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=79.06 E-value=16 Score=43.97 Aligned_cols=136 Identities=13% Similarity=0.202 Sum_probs=83.4
Q ss_pred EEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc--
Q 003405 19 IDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES-- 96 (823)
Q Consensus 19 I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~-- 96 (823)
|+=+...+++++.|...|+|..-+.+ +++.++++.++ ...|..+.+ ..|+|++|+=.
T Consensus 180 v~imR~Nnr~lf~G~t~G~V~LrD~~------------------s~~~iht~~aH-s~siSDfDv--~GNlLitCG~S~R 238 (1118)
T KOG1275|consen 180 VTIMRYNNRNLFCGDTRGTVFLRDPN------------------SFETIHTFDAH-SGSISDFDV--QGNLLITCGYSMR 238 (1118)
T ss_pred eEEEEecCcEEEeecccceEEeecCC------------------cCceeeeeecc-ccceeeeec--cCCeEEEeecccc
Confidence 55566667899999999998876543 34556666665 456777654 46888888631
Q ss_pred ---------EEEEeCCCCcccccccCCCCcEEEEeeCCC-ceEEEEE-cCeEEEEE---EcCCCceeEeeee-cCCCCce
Q 003405 97 ---------IAFHRLPNLETIAVLTKAKGANVYSWDDRR-GFLCFAR-QKRVCIFR---HDGGRGFVEVKDF-GVPDTVK 161 (823)
Q Consensus 97 ---------l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~-~~l~V~~-kkki~l~~---~~~~~~f~~~kei-~~~~~~~ 161 (823)
|+||||..++.+..++...|-.+.-..+.. .++||+. -....+.. |.+- -..++-+ .....+.
T Consensus 239 ~~~l~~D~FvkVYDLRmmral~PI~~~~~P~flrf~Psl~t~~~V~S~sGq~q~vd~~~lsNP--~~~~~~v~p~~s~i~ 316 (1118)
T KOG1275|consen 239 RYNLAMDPFVKVYDLRMMRALSPIQFPYGPQFLRFHPSLTTRLAVTSQSGQFQFVDTATLSNP--PAGVKMVNPNGSGIS 316 (1118)
T ss_pred cccccccchhhhhhhhhhhccCCcccccCchhhhhcccccceEEEEecccceeeccccccCCC--ccceeEEccCCCcce
Confidence 889999888777776666666655555544 3567663 33444444 3321 1112222 2223356
Q ss_pred EEEec--CCeEEEEEcCc
Q 003405 162 SMSWC--GENICIAIRKG 177 (823)
Q Consensus 162 ~l~~~--~~~i~v~~~~~ 177 (823)
++.+. |+.+.+|...+
T Consensus 317 ~fDiSsn~~alafgd~~g 334 (1118)
T KOG1275|consen 317 AFDISSNGDALAFGDHEG 334 (1118)
T ss_pred eEEecCCCceEEEecccC
Confidence 65554 66787777643
No 229
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=78.81 E-value=26 Score=43.65 Aligned_cols=152 Identities=12% Similarity=0.141 Sum_probs=82.3
Q ss_pred EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EE
Q 003405 21 AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IA 98 (823)
Q Consensus 21 ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~ 98 (823)
....++.+++-|++||+|.+|+..+...+.. .+.-.-+. .....++.++..++..+-+++=++ | |.
T Consensus 1056 ~s~~~~s~FvsgS~DGtVKvW~~~k~~~~~~-----------s~rS~lty-s~~~sr~~~vt~~~~~~~~Av~t~DG~v~ 1123 (1431)
T KOG1240|consen 1056 VSSEHTSLFVSGSDDGTVKVWNLRKLEGEGG-----------SARSELTY-SPEGSRVEKVTMCGNGDQFAVSTKDGSVR 1123 (1431)
T ss_pred ecCCCCceEEEecCCceEEEeeehhhhcCcc-----------eeeeeEEE-eccCCceEEEEeccCCCeEEEEcCCCeEE
Confidence 3334457889999999999999876543311 11111111 113578999999998877666664 6 88
Q ss_pred EEeCCCCccc------ccccCCC--Cc----EEEEeeCCCceEEEEEcC-eEEEEEEcCCCceeEeeeecC-CCCceEEE
Q 003405 99 FHRLPNLETI------AVLTKAK--GA----NVYSWDDRRGFLCFARQK-RVCIFRHDGGRGFVEVKDFGV-PDTVKSMS 164 (823)
Q Consensus 99 ~~~L~~l~~~------~~i~~~k--g~----~~fa~~~~~~~l~V~~kk-ki~l~~~~~~~~f~~~kei~~-~~~~~~l~ 164 (823)
+++++....- ..++..+ |+ +.|........++++.+. +|..+........... +..+ .+.+++++
T Consensus 1124 ~~~id~~~~~~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~iv~~D~r~~~~~w~l-k~~~~hG~vTSi~ 1202 (1431)
T KOG1240|consen 1124 VLRIDHYNVSKRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRIVSWDTRMRHDAWRL-KNQLRHGLVTSIV 1202 (1431)
T ss_pred EEEccccccccceeeeeecccccCCCceEEeecccccccceeEEEEEeccceEEecchhhhhHHhh-hcCccccceeEEE
Confidence 9888763211 1112222 22 112222222234444443 3433333211111111 1222 36788887
Q ss_pred ec--CCeEEEEEcCce-EEEEcCC
Q 003405 165 WC--GENICIAIRKGY-MILNATN 185 (823)
Q Consensus 165 ~~--~~~i~v~~~~~y-~lidl~~ 185 (823)
.. ++.+|+|+.+|. .+.|+.=
T Consensus 1203 idp~~~WlviGts~G~l~lWDLRF 1226 (1431)
T KOG1240|consen 1203 IDPWCNWLVIGTSRGQLVLWDLRF 1226 (1431)
T ss_pred ecCCceEEEEecCCceEEEEEeec
Confidence 76 678999999876 4557654
No 230
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=78.47 E-value=28 Score=37.15 Aligned_cols=108 Identities=15% Similarity=0.230 Sum_probs=75.3
Q ss_pred CCCeeEEEEecccCceeeEeCc----EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeE
Q 003405 75 KKPILSMEVLASRQLLLSLSES----IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVE 150 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~d~----l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~ 150 (823)
+..|..|..-++...+.+..|. +-+|++.+++....+.....+.+|..++...+++|... +..+|-|....-+..
T Consensus 318 k~g~g~lafs~Ds~y~aTrnd~~PnalW~Wdlq~l~l~avLiQk~piraf~WdP~~prL~vctg-~srLY~W~psg~~~V 396 (447)
T KOG4497|consen 318 KCGAGKLAFSCDSTYAATRNDKYPNALWLWDLQNLKLHAVLIQKHPIRAFEWDPGRPRLVVCTG-KSRLYFWAPSGPRVV 396 (447)
T ss_pred ccccceeeecCCceEEeeecCCCCceEEEEechhhhhhhhhhhccceeEEEeCCCCceEEEEcC-CceEEEEcCCCceEE
Confidence 3457777777788888888883 89999999886666667778999999999888877633 344788864322211
Q ss_pred eeeecCCC-CceEEEec--CCeEEEEEcCceEEEEcCC
Q 003405 151 VKDFGVPD-TVKSMSWC--GENICIAIRKGYMILNATN 185 (823)
Q Consensus 151 ~kei~~~~-~~~~l~~~--~~~i~v~~~~~y~lidl~~ 185 (823)
-++.++ .|+.+.|. |+.|.+..+..|++--+.+
T Consensus 397 --~vP~~GF~i~~l~W~~~g~~i~l~~kDafc~a~ve~ 432 (447)
T KOG4497|consen 397 --GVPKKGFNIQKLQWLQPGEFIVLCGKDAFCVAIVED 432 (447)
T ss_pred --ecCCCCceeeeEEecCCCcEEEEEcCCceEEEEecC
Confidence 112222 57788887 7778887777787654443
No 231
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=78.15 E-value=5.8 Score=35.32 Aligned_cols=67 Identities=18% Similarity=0.350 Sum_probs=47.2
Q ss_pred hHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 304 GAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 304 ~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
-.....+++.|++++|+..++......+. .......+...|..++..++|++|+.+|.+ ++.++|+.
T Consensus 6 ~~~~~~~~~~~~~~~A~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~-------~~~~~p~~ 72 (119)
T TIGR02795 6 YDAALLVLKAGDYADAIQAFQAFLKKYPK--STYAPNAHYWLGEAYYAQGKYADAAKAFLA-------VVKKYPKS 72 (119)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHCCC--ccccHHHHHHHHHHHHhhccHHHHHHHHHH-------HHHHCCCC
Confidence 35567788999999999999775211110 011234566789999999999999999986 34556554
No 232
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=78.13 E-value=7.8 Score=42.05 Aligned_cols=76 Identities=17% Similarity=0.240 Sum_probs=55.0
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
+.|+|+.-. +++||+|+..|.+..|++..... +...++++ ...|..|..-|...++.+++
T Consensus 248 ~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl-----------------~g~~~kg~-tGsirsih~hp~~~~las~G 309 (412)
T KOG3881|consen 248 NPISSTGLTPSGNFIYTGNTKGQLAKFDLRGGKL-----------------LGCGLKGI-TGSIRSIHCHPTHPVLASCG 309 (412)
T ss_pred CcceeeeecCCCcEEEEecccchhheecccCcee-----------------eccccCCc-cCCcceEEEcCCCceEEeec
Confidence 457776543 68999999999999999775321 11224555 46899999999888888876
Q ss_pred -C-cEEEEeCCCCccccc
Q 003405 95 -E-SIAFHRLPNLETIAV 110 (823)
Q Consensus 95 -d-~l~~~~L~~l~~~~~ 110 (823)
| .|++|+..+-+.++.
T Consensus 310 LDRyvRIhD~ktrkll~k 327 (412)
T KOG3881|consen 310 LDRYVRIHDIKTRKLLHK 327 (412)
T ss_pred cceeEEEeecccchhhhh
Confidence 5 499999877544444
No 233
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=77.87 E-value=84 Score=31.87 Aligned_cols=169 Identities=14% Similarity=0.146 Sum_probs=86.5
Q ss_pred ceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceE-EEecCCeEEEEEc-CceEEEEcCCCCeeecc-CCC--C--CC
Q 003405 127 GFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKS-MSWCGENICIAIR-KGYMILNATNGALSEVF-PSG--R--IG 198 (823)
Q Consensus 127 ~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~-l~~~~~~i~v~~~-~~y~lidl~~~~~~~L~-~~~--~--~~ 198 (823)
+.++++ ....|..+....++ ...++.+++++.. ....++.+++++. .....+|..+|+...-. ... . ..
T Consensus 37 ~~v~~~~~~~~l~~~d~~tG~---~~W~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~ 113 (238)
T PF13360_consen 37 GRVYVASGDGNLYALDAKTGK---VLWRFDLPGPISGAPVVDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVR 113 (238)
T ss_dssp TEEEEEETTSEEEEEETTTSE---EEEEEECSSCGGSGEEEETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB
T ss_pred CEEEEEcCCCEEEEEECCCCC---EEEEeeccccccceeeecccccccccceeeeEecccCCcceeeeeccccccccccc
Confidence 456665 44445444443232 2333444444332 2456888998886 45888999999865442 111 0 01
Q ss_pred CCEEEEccCCeEEEEe-CCeEEEEc-CCCccccCCceeecCCC------------cEEEEeCCEEEEEeCCe--EEEEEc
Q 003405 199 PPLVVSLLSGELLLGK-ENIGVFVD-QNGKLLQADRICWSEAP------------IAVIIQKPYAIALLPRR--VEVRSL 262 (823)
Q Consensus 199 ~p~i~~~~~~EfLL~~-~~~gvfv~-~~G~~~~~~~i~w~~~P------------~~v~~~~PYll~~~~~~--ieV~~l 262 (823)
.+....+.++.++++. +...+.+| .+|+.. ........+ ....+....+++...+. +.+ ++
T Consensus 114 ~~~~~~~~~~~~~~~~~~g~l~~~d~~tG~~~--w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g~~~~~-d~ 190 (238)
T PF13360_consen 114 SSSSPAVDGDRLYVGTSSGKLVALDPKTGKLL--WKYPVGEPRGSSPISSFSDINGSPVISDGRVYVSSGDGRVVAV-DL 190 (238)
T ss_dssp --SEEEEETTEEEEEETCSEEEEEETTTTEEE--EEEESSTT-SS--EEEETTEEEEEECCTTEEEEECCTSSEEEE-ET
T ss_pred cccCceEecCEEEEEeccCcEEEEecCCCcEE--EEeecCCCCCCcceeeecccccceEEECCEEEEEcCCCeEEEE-EC
Confidence 1222233355566655 56667778 467642 122222222 23334445777777654 566 77
Q ss_pred cCCCceeEEEeeCCccc-ccccCCeEEEec-cceEEEeeccC
Q 003405 263 RVPYALIQTIVLQNVRH-LIPSSNAVVVAL-ENSIFGLFPVP 302 (823)
Q Consensus 263 ~~~~~lvQ~i~l~~~~~-l~~~~~~v~v~s-~~~I~~l~~~~ 302 (823)
. ++..+-+.+..+... ....++.+|+++ ++.|+++....
T Consensus 191 ~-tg~~~w~~~~~~~~~~~~~~~~~l~~~~~~~~l~~~d~~t 231 (238)
T PF13360_consen 191 A-TGEKLWSKPISGIYSLPSVDGGTLYVTSSDGRLYALDLKT 231 (238)
T ss_dssp T-TTEEEEEECSS-ECECEECCCTEEEEEETTTEEEEEETTT
T ss_pred C-CCCEEEEecCCCccCCceeeCCEEEEEeCCCEEEEEECCC
Confidence 6 355332333333222 234466677665 67888876543
No 234
>KOG0553 consensus TPR repeat-containing protein [General function prediction only]
Probab=77.64 E-value=4.7 Score=42.50 Aligned_cols=61 Identities=21% Similarity=0.333 Sum_probs=43.5
Q ss_pred hHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc-CCCH
Q 003405 304 GAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS-QVDI 369 (823)
Q Consensus 304 ~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~-~~dP 369 (823)
.-+...+.+-|+|+.|+.=|++....|. ...+.+.+.|..++..|+|++|.+.|.++ ++||
T Consensus 119 cNRAAAy~~Lg~~~~AVkDce~Al~iDp-----~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP 180 (304)
T KOG0553|consen 119 CNRAAAYSKLGEYEDAVKDCESALSIDP-----HYSKAYGRLGLAYLALGKYEEAIEAYKKALELDP 180 (304)
T ss_pred HHHHHHHHHhcchHHHHHHHHHHHhcCh-----HHHHHHHHHHHHHHccCcHHHHHHHHHhhhccCC
Confidence 3456666677777777777765432222 34566778899999999999999999884 4555
No 235
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=77.61 E-value=97 Score=32.45 Aligned_cols=148 Identities=17% Similarity=0.245 Sum_probs=86.0
Q ss_pred CCeeEEEEecc-cCceeeEeCc--EEEEeCCC---Ccccccc--cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCC
Q 003405 76 KPILSMEVLAS-RQLLLSLSES--IAFHRLPN---LETIAVL--TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGR 146 (823)
Q Consensus 76 ~~I~qI~~~~~-~~~Ll~l~d~--l~~~~L~~---l~~~~~i--~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~ 146 (823)
.+|-.+..=|. ..+|.++++. |++|.+.+ +.-.+.+ .-.|.+..+|..+....+|.| ....+.||+-.++
T Consensus 15 ~r~W~~awhp~~g~ilAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~~- 93 (312)
T KOG0645|consen 15 DRVWSVAWHPGKGVILASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWKKEDG- 93 (312)
T ss_pred CcEEEEEeccCCceEEEeecCCceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEeecCCC-
Confidence 35555555555 3456666653 99999874 2211111 234677788998887778887 5566777776644
Q ss_pred ceeEeeeecCC-CCceEEEec--CCeEEEEEc-CceEEEEcCCCC---eeeccCCCCCCCCEEEEccCCeEEE--EeCCe
Q 003405 147 GFVEVKDFGVP-DTVKSMSWC--GENICIAIR-KGYMILNATNGA---LSEVFPSGRIGPPLVVSLLSGELLL--GKENI 217 (823)
Q Consensus 147 ~f~~~kei~~~-~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~~---~~~L~~~~~~~~p~i~~~~~~EfLL--~~~~~ 217 (823)
.|.-+..+.-+ ..+++++|. |+.+.-+++ ++.-+.-+..+. ...++..-.+.--.+++=|..++|. .|||.
T Consensus 94 efecv~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYDnT 173 (312)
T KOG0645|consen 94 EFECVATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYDNT 173 (312)
T ss_pred ceeEEeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEEcCCcceeEEeccCCe
Confidence 68766555555 389999998 556666665 455555544321 2222322211122344546667776 36776
Q ss_pred EEEEcCC
Q 003405 218 GVFVDQN 224 (823)
Q Consensus 218 gvfv~~~ 224 (823)
.=|+..+
T Consensus 174 Ik~~~~~ 180 (312)
T KOG0645|consen 174 IKVYRDE 180 (312)
T ss_pred EEEEeec
Confidence 6555433
No 236
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=77.17 E-value=8.3 Score=41.69 Aligned_cols=72 Identities=21% Similarity=0.305 Sum_probs=50.3
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
+..|+.|...+ ..|.-|..+|++.+|++....++ ..+.+|+ +++.||+.|.--|...-.+..
T Consensus 302 ~sDVNVISWnr~~~lLasG~DdGt~~iwDLR~~~~~---------------~pVA~fk-~Hk~pItsieW~p~e~s~iaa 365 (440)
T KOG0302|consen 302 NSDVNVISWNRREPLLASGGDDGTLSIWDLRQFKSG---------------QPVATFK-YHKAPITSIEWHPHEDSVIAA 365 (440)
T ss_pred CCceeeEEccCCcceeeecCCCceEEEEEhhhccCC---------------CcceeEE-eccCCeeEEEeccccCceEEe
Confidence 34555554444 36999999999999998865542 1234555 568999999988866555544
Q ss_pred eC---cEEEEeCC
Q 003405 94 SE---SIAFHRLP 103 (823)
Q Consensus 94 ~d---~l~~~~L~ 103 (823)
++ .|.+|+|.
T Consensus 366 sg~D~QitiWDls 378 (440)
T KOG0302|consen 366 SGEDNQITIWDLS 378 (440)
T ss_pred ccCCCcEEEEEee
Confidence 43 39999984
No 237
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=76.97 E-value=34 Score=36.74 Aligned_cols=156 Identities=14% Similarity=0.261 Sum_probs=95.2
Q ss_pred ccCCCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCcee
Q 003405 12 ISNCSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLL 91 (823)
Q Consensus 12 ~~~~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll 91 (823)
+..-...|-|..-.+.-|+-|++|.++.+|+.+.... .++.-+ +-.+|..+.+- .++++
T Consensus 233 L~GHtGSVLCLqyd~rviisGSSDsTvrvWDv~tge~------------------l~tlih-HceaVLhlrf~--ng~mv 291 (499)
T KOG0281|consen 233 LTGHTGSVLCLQYDERVIVSGSSDSTVRVWDVNTGEP------------------LNTLIH-HCEAVLHLRFS--NGYMV 291 (499)
T ss_pred hhcCCCcEEeeeccceEEEecCCCceEEEEeccCCch------------------hhHHhh-hcceeEEEEEe--CCEEE
Confidence 3333567889988888889999999999999774321 122111 23467777654 57899
Q ss_pred eEeC--cEEEEeCCCCccccc----ccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecC-CCCceEE
Q 003405 92 SLSE--SIAFHRLPNLETIAV----LTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGV-PDTVKSM 163 (823)
Q Consensus 92 ~l~d--~l~~~~L~~l~~~~~----i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~-~~~~~~l 163 (823)
+++- .+.+|++..-..++- +.....++.+-.++. .|+-| ..+.|.++..... +|. +.+.- .-.|.|+
T Consensus 292 tcSkDrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~k--yIVsASgDRTikvW~~st~-efv--Rtl~gHkRGIACl 366 (499)
T KOG0281|consen 292 TCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDK--YIVSASGDRTIKVWSTSTC-EFV--RTLNGHKRGIACL 366 (499)
T ss_pred EecCCceeEEEeccCchHHHHHHHHhhhhhheeeeccccc--eEEEecCCceEEEEeccce-eee--hhhhcccccceeh
Confidence 9885 499999965432221 111123444433432 44444 5677777776633 343 22221 1356677
Q ss_pred EecCCeEEEEEc-CceEEEEcCCCCeeeccC
Q 003405 164 SWCGENICIAIR-KGYMILNATNGALSEVFP 193 (823)
Q Consensus 164 ~~~~~~i~v~~~-~~y~lidl~~~~~~~L~~ 193 (823)
.++|..++=|+. ....+.|+..|.....+.
T Consensus 367 QYr~rlvVSGSSDntIRlwdi~~G~cLRvLe 397 (499)
T KOG0281|consen 367 QYRDRLVVSGSSDNTIRLWDIECGACLRVLE 397 (499)
T ss_pred hccCeEEEecCCCceEEEEeccccHHHHHHh
Confidence 777777777766 567888998886554443
No 238
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=76.95 E-value=6.1 Score=41.25 Aligned_cols=66 Identities=12% Similarity=0.165 Sum_probs=48.5
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
.+...++++|+|++|+.+++.+....+ . ...........|..+|..++|++|...|.+ +|.+||+-
T Consensus 37 ~~A~~~~~~g~y~~Ai~~f~~l~~~yP-~-s~~a~~a~l~la~ayy~~~~y~~A~~~~e~-------fi~~~P~~ 102 (243)
T PRK10866 37 ATAQQKLQDGNWKQAITQLEALDNRYP-F-GPYSQQVQLDLIYAYYKNADLPLAQAAIDR-------FIRLNPTH 102 (243)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCC-C-ChHHHHHHHHHHHHHHhcCCHHHHHHHHHH-------HHHhCcCC
Confidence 457778899999999999988632111 0 112234455679999999999999999886 67788776
No 239
>PF14762 HPS3_Mid: Hermansky-Pudlak syndrome 3, middle region
Probab=76.75 E-value=33 Score=37.77 Aligned_cols=165 Identities=13% Similarity=0.118 Sum_probs=100.1
Q ss_pred EEEeCCeEEEEcCCCccccCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCce-------eEEE--eeCCc----
Q 003405 211 LLGKENIGVFVDQNGKLLQADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYAL-------IQTI--VLQNV---- 277 (823)
Q Consensus 211 LL~~~~~gvfv~~~G~~~~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~l-------vQ~i--~l~~~---- 277 (823)
+++....|++++..+....-....+++....++...-|+-+++..++|=+.++-.... +.+. .+|..
T Consensus 96 ffS~~~~GyLY~i~~~~~lls~Y~yt~~~~~~~l~~~fLhaiT~~gLet~TlR~s~~~~~~~~~~id~t~~~cP~~s~~v 175 (374)
T PF14762_consen 96 FFSTPHQGYLYNISKPVELLSTYQYTAPVQQVVLTDQFLHAITSEGLETYTLRCSAAAARNEDPYIDTTLKACPPVSMPV 175 (374)
T ss_pred EEecCcceEEEEeeccceEEEEEecCccceEEEeecceeeeeeccccceEEEecchHHhhccCCccccccccCCCCCcch
Confidence 4466788999987665333467778888899999999999999988887665411001 1111 12311
Q ss_pred -----------ccccccCCeEEEec---------------cceEEEeeccChhHHHHHHHhcC-------------CHHH
Q 003405 278 -----------RHLIPSSNAVVVAL---------------ENSIFGLFPVPLGAQIVQLTASG-------------DFEE 318 (823)
Q Consensus 278 -----------~~l~~~~~~v~v~s---------------~~~I~~l~~~~~~~qI~~Ll~~~-------------~~e~ 318 (823)
+.++..++.+++.| .=.+|.+...|+.+-..++++.. ...|
T Consensus 176 c~lgl~~FigL~~v~~~~~hlILLtka~~~~~~~~s~~~~~W~LYiL~~~~~~~Ly~dm~e~A~~yk~~~~~~y~hLL~E 255 (374)
T PF14762_consen 176 CLLGLQPFIGLQAVCHFKNHLILLTKADSEDTEERSSSESSWNLYILNTPSPEQLYKDMVEYANSYKTASPQSYHHLLSE 255 (374)
T ss_pred HHhhhhhhcceeeEeecCCEEEEEEcCCCcccCCcccccCcceEEEEcCCCHHHHHHHHHHHHHHhccCChHHHHHHHHH
Confidence 12233344444322 22588888888877766665422 1455
Q ss_pred HHHHhhhCC----CcchHhhhhcHHHHHHH----HHHHHHc--cCCHHHHHHHHHhcCCCHHHHHHhC
Q 003405 319 ALALCKLLP----PEDASLRAAKEGSIHIR----FAHYLFD--TGSYEEAMEHFLASQVDITYALSLY 376 (823)
Q Consensus 319 Al~L~~~~~----~~~~~~~~~~~~~i~~~----~a~~lf~--~~~f~~A~~~f~~~~~dP~~vi~Lf 376 (823)
|--|++... ..+. .++++++...++ .|+.+-+ +++|..|+-+|..++.++.+||...
T Consensus 256 aHlLLRsaL~~~~~~~~-~~~~eL~~l~reSca~LGD~~~r~~~~d~~lA~pYYkMS~l~i~~Vl~ri 322 (374)
T PF14762_consen 256 AHLLLRSALLDPSQEES-EEKNELRELFRESCALLGDCYSRSDEKDYHLAAPYYKMSGLSISEVLNRI 322 (374)
T ss_pred HHHHHHHHhhhhhhccc-chHHHHHHHHHHHHHHHHhHhhccchHHHHHHHHHHHhcCCCHHHHHHHh
Confidence 655554321 1111 112233333332 3555444 7899999999999999999999984
No 240
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=76.74 E-value=26 Score=38.30 Aligned_cols=115 Identities=18% Similarity=0.360 Sum_probs=73.8
Q ss_pred CCCCeeEEEEeccc-CceeeEeCc--EEEEeCCCCc-------cccc-ccCCCCcEEEEeeCCCce--EEEEEcCeEEEE
Q 003405 74 SKKPILSMEVLASR-QLLLSLSES--IAFHRLPNLE-------TIAV-LTKAKGANVYSWDDRRGF--LCFARQKRVCIF 140 (823)
Q Consensus 74 ~k~~I~qI~~~~~~-~~Ll~l~d~--l~~~~L~~l~-------~~~~-i~~~kg~~~fa~~~~~~~--l~V~~kkki~l~ 140 (823)
++.+|..+.-.|-. +.+.+.+|. +.+|.+|+-- |+-. ....|.|-.++.++.... +..+....+.++
T Consensus 80 Ht~~vLDi~w~PfnD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~v~iW 159 (472)
T KOG0303|consen 80 HTAPVLDIDWCPFNDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNTVSIW 159 (472)
T ss_pred ccccccccccCccCCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCceEEEE
Confidence 36789888888866 456666763 9999997521 1111 122344555666654322 334467777777
Q ss_pred EEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEc-CceEEEEcCCCCeeec
Q 003405 141 RHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIR-KGYMILNATNGALSEV 191 (823)
Q Consensus 141 ~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~-~~y~lidl~~~~~~~L 191 (823)
-..-+.... .+.-||.|.+|+|. |+.+|-+++ +...++|..+|++..-
T Consensus 160 nv~tgeali---~l~hpd~i~S~sfn~dGs~l~TtckDKkvRv~dpr~~~~v~e 210 (472)
T KOG0303|consen 160 NVGTGEALI---TLDHPDMVYSMSFNRDGSLLCTTCKDKKVRVIDPRRGTVVSE 210 (472)
T ss_pred eccCCceee---ecCCCCeEEEEEeccCCceeeeecccceeEEEcCCCCcEeee
Confidence 766443222 23478999999998 667887777 6789999988876444
No 241
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=76.70 E-value=2.3 Score=36.03 Aligned_cols=57 Identities=25% Similarity=0.399 Sum_probs=38.0
Q ss_pred hcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHH
Q 003405 312 ASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITY 371 (823)
Q Consensus 312 ~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~ 371 (823)
++|+|+.|+.+++.....+. .+ ......-..|..+|..|+|++|+..+.+...+|..
T Consensus 1 ~~~~y~~Ai~~~~k~~~~~~--~~-~~~~~~~~la~~~~~~~~y~~A~~~~~~~~~~~~~ 57 (84)
T PF12895_consen 1 DQGNYENAIKYYEKLLELDP--TN-PNSAYLYNLAQCYFQQGKYEEAIELLQKLKLDPSN 57 (84)
T ss_dssp HTT-HHHHHHHHHHHHHHHC--GT-HHHHHHHHHHHHHHHTTHHHHHHHHHHCHTHHHCH
T ss_pred CCccHHHHHHHHHHHHHHCC--CC-hhHHHHHHHHHHHHHCCCHHHHHHHHHHhCCCCCC
Confidence 46899999999987631111 01 12223334699999999999999999775555543
No 242
>PF13570 PQQ_3: PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=76.58 E-value=3.5 Score=29.52 Aligned_cols=24 Identities=25% Similarity=0.410 Sum_probs=18.3
Q ss_pred EEEEEeCCEEEEEeCCCcEEEEcC
Q 003405 20 DAVASYGLKILLGCSDGSLKIYSP 43 (823)
Q Consensus 20 ~ci~~~~~~L~vGT~~G~l~~y~~ 43 (823)
++....++.||+|+.+|.|+.++.
T Consensus 15 ~~~~v~~g~vyv~~~dg~l~ald~ 38 (40)
T PF13570_consen 15 SSPAVAGGRVYVGTGDGNLYALDA 38 (40)
T ss_dssp S--EECTSEEEEE-TTSEEEEEET
T ss_pred cCCEEECCEEEEEcCCCEEEEEeC
Confidence 345777899999999999999874
No 243
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=76.57 E-value=2.5 Score=44.78 Aligned_cols=66 Identities=14% Similarity=0.215 Sum_probs=49.5
Q ss_pred EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCcEEEE
Q 003405 21 AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSESIAFH 100 (823)
Q Consensus 21 ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~l~~~ 100 (823)
+++.|++-|.+|...|.++.|++....+.. ..++ .+..+.+.|.|........+|+.++|...+|
T Consensus 314 ~~d~~~~~la~gnq~g~v~vwdL~~~ep~~------------~ttl---~~s~~~~tVRQ~sfS~dgs~lv~vcdd~~Vw 378 (385)
T KOG1034|consen 314 AFDPWQKMLALGNQSGKVYVWDLDNNEPPK------------CTTL---THSKSGSTVRQTSFSRDGSILVLVCDDGTVW 378 (385)
T ss_pred eecHHHHHHhhccCCCcEEEEECCCCCCcc------------CceE---EeccccceeeeeeecccCcEEEEEeCCCcEE
Confidence 577888999999999999999998765421 1112 2234568999999999999999889854444
Q ss_pred e
Q 003405 101 R 101 (823)
Q Consensus 101 ~ 101 (823)
.
T Consensus 379 r 379 (385)
T KOG1034|consen 379 R 379 (385)
T ss_pred E
Confidence 4
No 244
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=76.31 E-value=17 Score=39.02 Aligned_cols=114 Identities=14% Similarity=0.206 Sum_probs=63.7
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EEEEeC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IAFHRL 102 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~~L 102 (823)
++-|+-|+.+|.|+.+|+.....+. .+... .-++..+|+.++++.-.+--++.+| | |++||+
T Consensus 264 ~nLv~~GcRngeI~~iDLR~rnqG~------------~~~a~---rlyh~Ssvtslq~Lq~s~q~LmaS~M~gkikLyD~ 328 (425)
T KOG2695|consen 264 DNLVFNGCRNGEIFVIDLRCRNQGN------------GWCAQ---RLYHDSSVTSLQILQFSQQKLMASDMTGKIKLYDL 328 (425)
T ss_pred CCeeEecccCCcEEEEEeeecccCC------------CcceE---EEEcCcchhhhhhhccccceEeeccCcCceeEeee
Confidence 5678999999999999987543221 12111 1245789999999984555555566 5 999998
Q ss_pred CCCcccccccCCC-CcE-----EEEeeCCCceEEEEEcC-eEEEEEEcCCCceeEeeeecCC
Q 003405 103 PNLETIAVLTKAK-GAN-----VYSWDDRRGFLCFARQK-RVCIFRHDGGRGFVEVKDFGVP 157 (823)
Q Consensus 103 ~~l~~~~~i~~~k-g~~-----~fa~~~~~~~l~V~~kk-ki~l~~~~~~~~f~~~kei~~~ 157 (823)
.-.+-...+.... .++ -+.+++..|.|+.+... -..|+..+.+. .+.++++|
T Consensus 329 R~~K~~~~V~qYeGHvN~~a~l~~~v~~eeg~I~s~GdDcytRiWsl~~gh---Ll~tipf~ 387 (425)
T KOG2695|consen 329 RATKCKKSVMQYEGHVNLSAYLPAHVKEEEGSIFSVGDDCYTRIWSLDSGH---LLCTIPFP 387 (425)
T ss_pred hhhhcccceeeeecccccccccccccccccceEEEccCeeEEEEEecccCc---eeeccCCC
Confidence 5443210111111 111 24455555655443333 23466666442 34455444
No 245
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=75.94 E-value=1.6e+02 Score=36.66 Aligned_cols=157 Identities=13% Similarity=0.130 Sum_probs=87.7
Q ss_pred CcEEEEEEeC----CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCC-C--Ce-eEEEEe--cc
Q 003405 17 PKIDAVASYG----LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSK-K--PI-LSMEVL--AS 86 (823)
Q Consensus 17 ~~I~ci~~~~----~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k-~--~I-~qI~~~--~~ 86 (823)
..|+.++--+ ..+.+|++||.|.+|.--.... ...+++..+.+.+. . +. ..+.+. ..
T Consensus 1110 t~Vs~l~liNe~D~aLlLtas~dGvIRIwk~y~~~~-------------~~~eLVTaw~~Ls~~~~~~r~~~~v~dWqQ~ 1176 (1387)
T KOG1517|consen 1110 TRVSDLELINEQDDALLLTASSDGVIRIWKDYADKW-------------KKPELVTAWSSLSDQLPGARGTGLVVDWQQQ 1176 (1387)
T ss_pred CccceeeeecccchhheeeeccCceEEEeccccccc-------------CCceeEEeeccccccCccCCCCCeeeehhhh
Confidence 4677766554 3689999999999997332211 23344433322210 0 00 111111 13
Q ss_pred cCceeeEeCc--EEEEeCCCCcccccccCC--CCcEEEEeeCCCc-eEEEE-EcCeEEEEEEcC---CCceeEeeeecCC
Q 003405 87 RQLLLSLSES--IAFHRLPNLETIAVLTKA--KGANVYSWDDRRG-FLCFA-RQKRVCIFRHDG---GRGFVEVKDFGVP 157 (823)
Q Consensus 87 ~~~Ll~l~d~--l~~~~L~~l~~~~~i~~~--kg~~~fa~~~~~~-~l~V~-~kkki~l~~~~~---~~~f~~~kei~~~ 157 (823)
.+.|++-+|. |.+||...=.....++.. ..+++...+...| .|++| ..+.+.+|..+- +......|+..-.
T Consensus 1177 ~G~Ll~tGd~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~ 1256 (1387)
T KOG1517|consen 1177 SGHLLVTGDVRSIRIWDAHKEQVVADIPYGSSTLVTALSADLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDV 1256 (1387)
T ss_pred CCeEEecCCeeEEEEEecccceeEeecccCCCccceeecccccCCceEEEeecCCceEEeecccCCccccceeecccCCc
Confidence 3566666663 899998653333333322 2345555454444 45555 778899998753 2123445666555
Q ss_pred CCceEEEec--CC-eEEEEEc-CceEEEEcCCC
Q 003405 158 DTVKSMSWC--GE-NICIAIR-KGYMILNATNG 186 (823)
Q Consensus 158 ~~~~~l~~~--~~-~i~v~~~-~~y~lidl~~~ 186 (823)
++|..+.+. |. -|+-|+. ....++|+...
T Consensus 1257 ~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~ 1289 (1387)
T KOG1517|consen 1257 EPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMS 1289 (1387)
T ss_pred ccceeEEeecCCCcceeeeccCCeEEEEecccC
Confidence 668888776 22 2555544 67889998874
No 246
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=75.85 E-value=7.8 Score=36.66 Aligned_cols=20 Identities=30% Similarity=0.620 Sum_probs=9.5
Q ss_pred HHHHHHccCCHHHHHHHHHh
Q 003405 345 FAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 345 ~a~~lf~~~~f~~A~~~f~~ 364 (823)
.|..++.+|+|++|+..+..
T Consensus 91 LA~~~~~~~~~d~Al~~L~~ 110 (145)
T PF09976_consen 91 LARILLQQGQYDEALATLQQ 110 (145)
T ss_pred HHHHHHHcCCHHHHHHHHHh
Confidence 34444445555555544433
No 247
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=75.35 E-value=1.8e+02 Score=34.41 Aligned_cols=254 Identities=16% Similarity=0.148 Sum_probs=119.4
Q ss_pred CcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeE-EEEecccCceeeEe
Q 003405 17 PKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILS-MEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~q-I~~~~~~~~Ll~l~ 94 (823)
..+..++... ..+.=|+.||++.+|.-.+.. |.-...+.+. +.-|.- +...+..+-.++.+
T Consensus 15 ~DVr~v~~~~~~~i~s~sRd~t~~vw~~~~~~----------------~l~~~~~~~~-~g~i~~~i~y~e~~~~~l~~g 77 (745)
T KOG0301|consen 15 SDVRAVAVTDGVCIISGSRDGTVKVWAKKGKQ----------------YLETHAFEGP-KGFIANSICYAESDKGRLVVG 77 (745)
T ss_pred cchheeEecCCeEEeecCCCCceeeeeccCcc----------------cccceecccC-cceeeccceeccccCcceEee
Confidence 3443333333 357778889999999754321 1111112211 222333 55565333333333
Q ss_pred --Cc-EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCe
Q 003405 95 --ES-IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGEN 169 (823)
Q Consensus 95 --d~-l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~ 169 (823)
|. +.+|.+...+|...+..-+ +|.+.+.+++ +.++-+ -.+.+.+++...-. ..++ .-+-.+-++...+..
T Consensus 78 ~~D~~i~v~~~~~~~P~~~LkgH~snVC~ls~~~~-~~~iSgSWD~TakvW~~~~l~--~~l~--gH~asVWAv~~l~e~ 152 (745)
T KOG0301|consen 78 GMDTTIIVFKLSQAEPLYTLKGHKSNVCSLSIGED-GTLISGSWDSTAKVWRIGELV--YSLQ--GHTASVWAVASLPEN 152 (745)
T ss_pred cccceEEEEecCCCCchhhhhccccceeeeecCCc-CceEecccccceEEecchhhh--cccC--CcchheeeeeecCCC
Confidence 33 8899998888887654444 3333333333 233333 34555555544200 0000 001233344444333
Q ss_pred EEEEEcCceEEEEcCC-CCeeeccCCCCCCCCEEEEccCCeEEEEeCC-eEEEEcCCCccccCCceeecCCCcEEEE---
Q 003405 170 ICIAIRKGYMILNATN-GALSEVFPSGRIGPPLVVSLLSGELLLGKEN-IGVFVDQNGKLLQADRICWSEAPIAVII--- 244 (823)
Q Consensus 170 i~v~~~~~y~lidl~~-~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~~-~gvfv~~~G~~~~~~~i~w~~~P~~v~~--- 244 (823)
.++.- +.=..|-+.. ++....|......-.-++.++++.||=|.++ ...+.+.+|...++ ..+. .++.|
T Consensus 153 ~~vTg-saDKtIklWk~~~~l~tf~gHtD~VRgL~vl~~~~flScsNDg~Ir~w~~~ge~l~~----~~gh-tn~vYsis 226 (745)
T KOG0301|consen 153 TYVTG-SADKTIKLWKGGTLLKTFSGHTDCVRGLAVLDDSHFLSCSNDGSIRLWDLDGEVLLE----MHGH-TNFVYSIS 226 (745)
T ss_pred cEEec-cCcceeeeccCCchhhhhccchhheeeeEEecCCCeEeecCCceEEEEeccCceeee----eecc-ceEEEEEE
Confidence 22221 1112222222 3333333322222223445678899987654 55677888875421 1111 11111
Q ss_pred ---eCCEEEEEeC-CeEEEEEccCCCceeEEEeeCCccccc----ccCCeEEEeccceEEEeecc
Q 003405 245 ---QKPYAIALLP-RRVEVRSLRVPYALIQTIVLQNVRHLI----PSSNAVVVALENSIFGLFPV 301 (823)
Q Consensus 245 ---~~PYll~~~~-~~ieV~~l~~~~~lvQ~i~l~~~~~l~----~~~~~v~v~s~~~I~~l~~~ 301 (823)
...-|+...+ +.+.|... ...+|+|.+|....=. ..++.+.-+|++.|+.+...
T Consensus 227 ~~~~~~~Ivs~gEDrtlriW~~---~e~~q~I~lPttsiWsa~~L~NgDIvvg~SDG~VrVfT~~ 288 (745)
T KOG0301|consen 227 MALSDGLIVSTGEDRTLRIWKK---DECVQVITLPTTSIWSAKVLLNGDIVVGGSDGRVRVFTVD 288 (745)
T ss_pred ecCCCCeEEEecCCceEEEeec---CceEEEEecCccceEEEEEeeCCCEEEeccCceEEEEEec
Confidence 1223333333 46777764 3789999999853211 12444444567777766654
No 248
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=75.32 E-value=1.8e+02 Score=34.51 Aligned_cols=156 Identities=12% Similarity=0.031 Sum_probs=91.6
Q ss_pred HHHHHHhcCChhhHHhhhcC--CCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCccccccc
Q 003405 511 LLQALLLTGQSSAALELLKG--LNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQK 588 (823)
Q Consensus 511 Ll~~y~~~~~~~~l~~ll~~--~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~ 588 (823)
++.||...++-+.|..+.+. +++=-+....+.+.+.|+-++.+.-|.+.++..+|+....+|.++.
T Consensus 828 ~~ecly~le~f~~LE~la~~Lpe~s~llp~~a~mf~svGMC~qAV~a~Lr~s~pkaAv~tCv~LnQW~------------ 895 (1189)
T KOG2041|consen 828 QIECLYRLELFGELEVLARTLPEDSELLPVMADMFTSVGMCDQAVEAYLRRSLPKAAVHTCVELNQWG------------ 895 (1189)
T ss_pred HHHHHHHHHhhhhHHHHHHhcCcccchHHHHHHHHHhhchHHHHHHHHHhccCcHHHHHHHHHHHHHH------------
Confidence 45666666655555555543 3444567777788888888888888889999999998888887664
Q ss_pred CChHHHHHHhhcCCCC-ChhhHHHhhhhhhhcCc-ccccccccc--CCCChHHHHHHHhhcCchhHHHHHHHHhhcccCC
Q 003405 589 FNPESIIEYLKPLCGT-DPMLVLEFSMLVLESCP-TQTIELFLS--GNIPADLVNSYLKQYSPSMQGRYLELMLAMNENS 664 (823)
Q Consensus 589 ~~~~~~i~yL~~L~~~-~~~li~~y~~wll~~~p-~~~~~if~~--~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~ 664 (823)
.+++.-+...-. --.||-+|+..+++... .++++.--+ ..++..+++..+-....+--..||+-= .
T Consensus 896 ----~avelaq~~~l~qv~tliak~aaqll~~~~~~eaIe~~Rka~~~~daarll~qmae~e~~K~~p~lr~K---k--- 965 (1189)
T KOG2041|consen 896 ----EAVELAQRFQLPQVQTLIAKQAAQLLADANHMEAIEKDRKAGRHLDAARLLSQMAEREQEKYVPYLRLK---K--- 965 (1189)
T ss_pred ----HHHHHHHhccchhHHHHHHHHHHHHHhhcchHHHHHHhhhcccchhHHHHHHHHhHHHhhccCCHHHHH---H---
Confidence 233333332222 23567788887777432 233333222 247777777776643222223344320 0
Q ss_pred CChhHHHHHHHHHHHHHHHHhhhhh
Q 003405 665 ISGNLQNEMVQIYLSEVLDWYSDLS 689 (823)
Q Consensus 665 ~~~~~h~~L~~lYl~~i~~~~~~~~ 689 (823)
-=.+--.|++-|.+.+....+.+.
T Consensus 966 -lYVL~AlLvE~h~~~ik~~~~~~~ 989 (1189)
T KOG2041|consen 966 -LYVLGALLVENHRQTIKELRKIDK 989 (1189)
T ss_pred -HHHHHHHHHHHHHHHHHHhhhhhh
Confidence 001233477777777766655443
No 249
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=75.19 E-value=67 Score=33.27 Aligned_cols=140 Identities=10% Similarity=0.145 Sum_probs=75.7
Q ss_pred eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe--Cc-EEEEe
Q 003405 25 YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS--ES-IAFHR 101 (823)
Q Consensus 25 ~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~--d~-l~~~~ 101 (823)
+++.++....||.|.+|+....+.+ ++.++. ++..|..+.--+..+..++.+ |+ |++|+
T Consensus 72 ~e~~~~~a~GDGSLrl~d~~~~s~P-----------------i~~~kE-H~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~ 133 (311)
T KOG0277|consen 72 HENQVIAASGDGSLRLFDLTMPSKP-----------------IHKFKE-HKREVYSVDWNTVRRRIFLTSSWDGTIKLWD 133 (311)
T ss_pred CcceEEEEecCceEEEeccCCCCcc-----------------hhHHHh-hhhheEEeccccccceeEEeeccCCceEeec
Confidence 3578999999999999995543221 223332 255666665555444444443 55 99998
Q ss_pred CCCCcccccccCCCCcEE-EEeeCCCce-EEEE-EcCeEEEEEEcCCCceeEeeeecCCC-CceEEEecC--CeEEEEEc
Q 003405 102 LPNLETIAVLTKAKGANV-YSWDDRRGF-LCFA-RQKRVCIFRHDGGRGFVEVKDFGVPD-TVKSMSWCG--ENICIAIR 175 (823)
Q Consensus 102 L~~l~~~~~i~~~kg~~~-fa~~~~~~~-l~V~-~kkki~l~~~~~~~~f~~~kei~~~~-~~~~l~~~~--~~i~v~~~ 175 (823)
..--..+.+......|.. .+.++..+- ++-+ ....+.++.++....+.. |..+. ++.++.|.. ..+.+...
T Consensus 134 ~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk~~~---i~ah~~Eil~cdw~ky~~~vl~Tg~ 210 (311)
T KOG0277|consen 134 PNRPNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSPGKFMS---IEAHNSEILCCDWSKYNHNVLATGG 210 (311)
T ss_pred CCCCcceEeecCCccEEEEEecCCCCCCeEEEccCCceEEEEEecCCCceeE---EEeccceeEeecccccCCcEEEecC
Confidence 644333333222222221 222443332 3333 556677888764222322 44454 888999973 34444322
Q ss_pred --CceEEEEcCC
Q 003405 176 --KGYMILNATN 185 (823)
Q Consensus 176 --~~y~lidl~~ 185 (823)
+.....|+.+
T Consensus 211 vd~~vr~wDir~ 222 (311)
T KOG0277|consen 211 VDNLVRGWDIRN 222 (311)
T ss_pred CCceEEEEehhh
Confidence 4566677765
No 250
>KOG1900 consensus Nuclear pore complex, Nup155 component (D Nup154, sc Nup157/Nup170) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=75.17 E-value=6.5 Score=49.06 Aligned_cols=89 Identities=20% Similarity=0.225 Sum_probs=53.2
Q ss_pred CCCCcEEEEE-EeCCEEEEEeCCCcEEE--EcCCCCCCCC----CCCCcccccccccceeeeeecCCCCCCeeEEEEecc
Q 003405 14 NCSPKIDAVA-SYGLKILLGCSDGSLKI--YSPGSSESDR----SPPSDYQSLRKESYELERTISGFSKKPILSMEVLAS 86 (823)
Q Consensus 14 ~~~~~I~ci~-~~~~~L~vGT~~G~l~~--y~~~~~~~~~----~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~ 86 (823)
..+..|+||. ..+++||+|..||.|+- |....+=.+. ...+.. .+......+. ++.+.++.||.||.+...
T Consensus 176 ~dg~~V~~I~~t~nGRIF~~G~dg~lyEl~Yq~~~gWf~~rc~Kiclt~s-~ls~lvPs~~-~~~~~~~dpI~qi~ID~S 253 (1311)
T KOG1900|consen 176 VDGVSVNCITYTENGRIFFAGRDGNLYELVYQAEDGWFGSRCRKICLTKS-VLSSLVPSLL-SVPGSSKDPIRQITIDNS 253 (1311)
T ss_pred cCCceEEEEEeccCCcEEEeecCCCEEEEEEeccCchhhcccccccCchh-HHHHhhhhhh-cCCCCCCCcceeeEeccc
Confidence 3466899999 44579999999997743 3222111000 000000 0000000011 222344779999999999
Q ss_pred cCceeeEeC-c-EEEEeCCC
Q 003405 87 RQLLLSLSE-S-IAFHRLPN 104 (823)
Q Consensus 87 ~~~Ll~l~d-~-l~~~~L~~ 104 (823)
++++.++++ + +.+|++..
T Consensus 254 R~IlY~lsek~~v~~Y~i~~ 273 (1311)
T KOG1900|consen 254 RNILYVLSEKGTVSAYDIGG 273 (1311)
T ss_pred cceeeeeccCceEEEEEccC
Confidence 999999998 5 99999854
No 251
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=74.72 E-value=7.5 Score=41.04 Aligned_cols=66 Identities=11% Similarity=0.210 Sum_probs=46.2
Q ss_pred HHHHHH-HhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 305 AQIVQL-TASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 305 ~qI~~L-l~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
+....| ++.|+|++|+..++.+....+ ........+-..|..+|..|+|++|...|.+ |+..||+-
T Consensus 147 ~~A~~l~~~~~~y~~Ai~af~~fl~~yP--~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~-------vv~~yP~s 213 (263)
T PRK10803 147 NAAIALVQDKSRQDDAIVAFQNFVKKYP--DSTYQPNANYWLGQLNYNKGKKDDAAYYFAS-------VVKNYPKS 213 (263)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHCc--CCcchHHHHHHHHHHHHHcCCHHHHHHHHHH-------HHHHCCCC
Confidence 344455 678999999999877521100 0012234566779999999999999999875 77788765
No 252
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=74.57 E-value=20 Score=39.66 Aligned_cols=156 Identities=12% Similarity=0.185 Sum_probs=90.0
Q ss_pred eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeee--eecC----CCCCCeeEEEEecc-cCceeeEe-C-
Q 003405 25 YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELER--TISG----FSKKPILSMEVLAS-RQLLLSLS-E- 95 (823)
Q Consensus 25 ~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~--~~~~----~~k~~I~qI~~~~~-~~~Ll~l~-d- 95 (823)
-|+++.|||-+-.|.+|++.-...-.. .-.|+....+..+ ..+. .+..+|..|..-.. .++|++=+ |
T Consensus 191 ~gNyvAiGtmdp~IeIWDLDI~d~v~P----~~~LGs~~sk~~~k~~k~~~~~~gHTdavl~Ls~n~~~~nVLaSgsaD~ 266 (463)
T KOG0270|consen 191 AGNYVAIGTMDPEIEIWDLDIVDAVLP----CVTLGSKASKKKKKKGKRSNSASGHTDAVLALSWNRNFRNVLASGSADK 266 (463)
T ss_pred CcceEEEeccCceeEEecccccccccc----ceeechhhhhhhhhhcccccccccchHHHHHHHhccccceeEEecCCCc
Confidence 357999999999999999875332100 0012211110000 0000 11223333333322 24455444 3
Q ss_pred cEEEEeCCCCccccccc-CCCCcEEEEeeCCCceE-EEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC---Ce
Q 003405 96 SIAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFL-CFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG---EN 169 (823)
Q Consensus 96 ~l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l-~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~---~~ 169 (823)
.|.+|++.+-++..+++ ..+.|++..+++..+.+ .-| .++.+.++..+.-. ..-++..+.+.+-.++|.- +.
T Consensus 267 TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~--~s~~~wk~~g~VEkv~w~~~se~~ 344 (463)
T KOG0270|consen 267 TVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPS--NSGKEWKFDGEVEKVAWDPHSENS 344 (463)
T ss_pred eEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccceEEeeeccCcc--ccCceEEeccceEEEEecCCCcee
Confidence 39999999888777665 67889999998876654 444 56778888776311 1123455667888889973 45
Q ss_pred EEEEEcCc-eEEEEcCCC
Q 003405 170 ICIAIRKG-YMILNATNG 186 (823)
Q Consensus 170 i~v~~~~~-y~lidl~~~ 186 (823)
.++++..+ .+-+|+.+.
T Consensus 345 f~~~tddG~v~~~D~R~~ 362 (463)
T KOG0270|consen 345 FFVSTDDGTVYYFDIRNP 362 (463)
T ss_pred EEEecCCceEEeeecCCC
Confidence 66666655 355677653
No 253
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=74.43 E-value=56 Score=34.63 Aligned_cols=111 Identities=13% Similarity=0.253 Sum_probs=71.5
Q ss_pred CcEEEEEEeC-CE-EEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASYG-LK-ILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~~-~~-L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
...+-++.|. ++ ++-.+.|-++..|+..+.. .-+..|++. ...|+....- ....++.=+
T Consensus 315 ~ELtHcstHptQrLVvTsSrDtTFRLWDFReaI-----------------~sV~VFQGH-tdtVTS~vF~-~dd~vVSgS 375 (481)
T KOG0300|consen 315 SELTHCSTHPTQRLVVTSSRDTTFRLWDFREAI-----------------QSVAVFQGH-TDTVTSVVFN-TDDRVVSGS 375 (481)
T ss_pred hhccccccCCcceEEEEeccCceeEeccchhhc-----------------ceeeeeccc-ccceeEEEEe-cCCceeecC
Confidence 3455444454 34 4555678888888866321 123455654 3467765433 345666667
Q ss_pred Cc--EEEEeCCCCc-ccccccCCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCC
Q 003405 95 ES--IAFHRLPNLE-TIAVLTKAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGR 146 (823)
Q Consensus 95 d~--l~~~~L~~l~-~~~~i~~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~ 146 (823)
|. |++|+|.+.. ++.+|.....++-++++.....|++-. ++.+.+|...+.+
T Consensus 376 DDrTvKvWdLrNMRsplATIRtdS~~NRvavs~g~~iIAiPhDNRqvRlfDlnG~R 431 (481)
T KOG0300|consen 376 DDRTVKVWDLRNMRSPLATIRTDSPANRVAVSKGHPIIAIPHDNRQVRLFDLNGNR 431 (481)
T ss_pred CCceEEEeeeccccCcceeeecCCccceeEeecCCceEEeccCCceEEEEecCCCc
Confidence 73 9999998874 555665556788888887777788874 4668999998654
No 254
>KOG1840 consensus Kinesin light chain [Cytoskeleton]
Probab=73.93 E-value=56 Score=37.89 Aligned_cols=55 Identities=24% Similarity=0.272 Sum_probs=39.5
Q ss_pred HhcCCHHHHHHHhhhCCC--cchHhhhh-cHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 311 TASGDFEEALALCKLLPP--EDASLRAA-KEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 311 l~~~~~e~Al~L~~~~~~--~~~~~~~~-~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
-.+++||+|..+.+.... .+..-..+ .+..++...|..++..|+|++|-+.|.++
T Consensus 336 ~~~~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~a 393 (508)
T KOG1840|consen 336 QSMNEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKA 393 (508)
T ss_pred HHhcchhHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHHH
Confidence 467899999998875310 00000011 46678899999999999999999999874
No 255
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=73.70 E-value=87 Score=34.62 Aligned_cols=56 Identities=11% Similarity=0.012 Sum_probs=38.0
Q ss_pred hHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 304 GAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 304 ~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
..-...+...|++++|+.+++.....+. .....+...|..+...|+|++|.+.|.+
T Consensus 111 ~~La~~~~~~g~~~~A~~~~~~~l~~~~-----~~~~~~~~la~~~~~~g~~~~A~~~~~~ 166 (389)
T PRK11788 111 QELGQDYLKAGLLDRAEELFLQLVDEGD-----FAEGALQQLLEIYQQEKDWQKAIDVAER 166 (389)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHcCCc-----chHHHHHHHHHHHHHhchHHHHHHHHHH
Confidence 3345666788899999888876532211 1223455667778888889888888876
No 256
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=73.39 E-value=20 Score=41.61 Aligned_cols=146 Identities=10% Similarity=0.091 Sum_probs=81.5
Q ss_pred CCeeEEEEecccCceeeEeCc-EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeee
Q 003405 76 KPILSMEVLASRQLLLSLSES-IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKD 153 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~d~-l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~ke 153 (823)
+.|..|..-|++.-|++-+|+ +.+|+..+-....++..-| -|.++|...+..+++-|.-.|. +.-|... ..-+-.
T Consensus 13 hci~d~afkPDGsqL~lAAg~rlliyD~ndG~llqtLKgHKDtVycVAys~dGkrFASG~aDK~-VI~W~~k--lEG~Lk 89 (1081)
T KOG1538|consen 13 HCINDIAFKPDGTQLILAAGSRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKS-VIIWTSK--LEGILK 89 (1081)
T ss_pred cchheeEECCCCceEEEecCCEEEEEeCCCcccccccccccceEEEEEEccCCceeccCCCcee-EEEeccc--ccceee
Confidence 488999999988777777774 9999987655555444444 3567777776666676644443 2334321 111112
Q ss_pred ecCCCCceEEEecC-CeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEE-EEccCCeEEE-Ee-CCeEEEEcCCCcc
Q 003405 154 FGVPDTVKSMSWCG-ENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLV-VSLLSGELLL-GK-ENIGVFVDQNGKL 227 (823)
Q Consensus 154 i~~~~~~~~l~~~~-~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i-~~~~~~EfLL-~~-~~~gvfv~~~G~~ 227 (823)
++-.|.|+||.|.. +.+...+. ++|-+....+..++.--. +.+-+. .+-.++.++. +. |+..-+-|..|++
T Consensus 90 YSH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~ks---s~R~~~CsWtnDGqylalG~~nGTIsiRNk~gEe 165 (1081)
T KOG1538|consen 90 YSHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKS---SSRIICCSWTNDGQYLALGMFNGTISIRNKNGEE 165 (1081)
T ss_pred eccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhh---heeEEEeeecCCCcEEEEeccCceEEeecCCCCc
Confidence 44568999999974 22333333 566655544432221111 112222 2446676654 54 3444455777765
No 257
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=73.29 E-value=44 Score=37.69 Aligned_cols=152 Identities=11% Similarity=0.154 Sum_probs=90.2
Q ss_pred CcEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCeeEEEEecccCceee
Q 003405 17 PKIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPILSMEVLASRQLLLS 92 (823)
Q Consensus 17 ~~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~qI~~~~~~~~Ll~ 92 (823)
..+.|..+.. +++|.|..-| |.+|++....+.. + +.+..... ..-|...+++|+..-|++
T Consensus 419 GEvVcAvtIS~~trhVyTgGkgc-VKVWdis~pg~k~------------P---vsqLdcl~rdnyiRSckL~pdgrtLiv 482 (705)
T KOG0639|consen 419 GEVVCAVTISNPTRHVYTGGKGC-VKVWDISQPGNKS------------P---VSQLDCLNRDNYIRSCKLLPDGRTLIV 482 (705)
T ss_pred CcEEEEEEecCCcceeEecCCCe-EEEeeccCCCCCC------------c---cccccccCcccceeeeEecCCCceEEe
Confidence 3677766554 5889887655 7999988653210 0 11111111 235777888888777777
Q ss_pred EeC--cEEEEeCCCCccc--ccc-cCCCCcEEEEeeCCCc-eEEEEEcCeEEEEEEcCCCceeEeeeec-CCCCceEEEe
Q 003405 93 LSE--SIAFHRLPNLETI--AVL-TKAKGANVYSWDDRRG-FLCFARQKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSW 165 (823)
Q Consensus 93 l~d--~l~~~~L~~l~~~--~~i-~~~kg~~~fa~~~~~~-~l~V~~kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~ 165 (823)
=++ .|.+|+|..-.+- ..+ .....|.+.+++++.. ++.......|.||.+.+- ..++.+. -+|.+.||..
T Consensus 483 GGeastlsiWDLAapTprikaeltssapaCyALa~spDakvcFsccsdGnI~vwDLhnq---~~VrqfqGhtDGascIdi 559 (705)
T KOG0639|consen 483 GGEASTLSIWDLAAPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQ---TLVRQFQGHTDGASCIDI 559 (705)
T ss_pred ccccceeeeeeccCCCcchhhhcCCcchhhhhhhcCCccceeeeeccCCcEEEEEcccc---eeeecccCCCCCceeEEe
Confidence 666 3999999542211 111 1223566666676653 233446788888888732 2344443 3578888877
Q ss_pred c--CCeEEEEEc-CceEEEEcCCCC
Q 003405 166 C--GENICIAIR-KGYMILNATNGA 187 (823)
Q Consensus 166 ~--~~~i~v~~~-~~y~lidl~~~~ 187 (823)
. |..|+-|-- +.....|+.+|+
T Consensus 560 s~dGtklWTGGlDntvRcWDlregr 584 (705)
T KOG0639|consen 560 SKDGTKLWTGGLDNTVRCWDLREGR 584 (705)
T ss_pred cCCCceeecCCCccceeehhhhhhh
Confidence 6 556666643 556666776653
No 258
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=73.09 E-value=94 Score=36.40 Aligned_cols=74 Identities=16% Similarity=0.256 Sum_probs=53.1
Q ss_pred HHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCC-CCChhhHHHhhhhhhhc
Q 003405 541 EILQKKNHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLC-GTDPMLVLEFSMLVLES 619 (823)
Q Consensus 541 ~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~-~~~~~li~~y~~wll~~ 619 (823)
+.+.=.|++++.|.++.+.|.-.+||+++..+.-- +.+-+|+..=. .+...|+.+-+.|--..
T Consensus 640 ~~~Ay~gKF~EAAklFk~~G~enRAlEmyTDlRMF----------------D~aQE~~~~g~~~eKKmL~RKRA~WAr~~ 703 (1081)
T KOG1538|consen 640 DVFAYQGKFHEAAKLFKRSGHENRALEMYTDLRMF----------------DYAQEFLGSGDPKEKKMLIRKRADWARNI 703 (1081)
T ss_pred HHHHhhhhHHHHHHHHHHcCchhhHHHHHHHHHHH----------------HHHHHHhhcCChHHHHHHHHHHHHHhhhc
Confidence 34444688999999999999999999999877521 34556665322 23367889999998875
Q ss_pred C-cccccccccc
Q 003405 620 C-PTQTIELFLS 630 (823)
Q Consensus 620 ~-p~~~~~if~~ 630 (823)
+ |..|.+++++
T Consensus 704 kePkaAAEmLiS 715 (1081)
T KOG1538|consen 704 KEPKAAAEMLIS 715 (1081)
T ss_pred CCcHHHHHHhhc
Confidence 4 6677777776
No 259
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=72.96 E-value=9.4 Score=46.32 Aligned_cols=67 Identities=18% Similarity=0.405 Sum_probs=49.8
Q ss_pred CCcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 16 SPKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 16 ~~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
+...+|+++.+ ++|+||+++|.|..|+-.+ ++. ...+.++ ..||..|.|-..+..+|+-|
T Consensus 577 ~~~Fs~~aTt~~G~iavgs~~G~IRLyd~~g---------------~~A---KT~lp~l-G~pI~~iDvt~DGkwilaTc 637 (794)
T PF08553_consen 577 KNNFSCFATTEDGYIAVGSNKGDIRLYDRLG---------------KRA---KTALPGL-GDPIIGIDVTADGKWILATC 637 (794)
T ss_pred CCCceEEEecCCceEEEEeCCCcEEeecccc---------------hhh---hhcCCCC-CCCeeEEEecCCCcEEEEee
Confidence 45788988876 7999999999999997221 111 1123344 57999999999999999999
Q ss_pred Cc-EEEEe
Q 003405 95 ES-IAFHR 101 (823)
Q Consensus 95 d~-l~~~~ 101 (823)
+. |.+++
T Consensus 638 ~tyLlLi~ 645 (794)
T PF08553_consen 638 KTYLLLID 645 (794)
T ss_pred cceEEEEE
Confidence 94 65554
No 260
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=72.74 E-value=21 Score=42.12 Aligned_cols=150 Identities=11% Similarity=0.127 Sum_probs=85.2
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEe-Cc-EEEEeC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLS-ES-IAFHRL 102 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~-d~-l~~~~L 102 (823)
.+.|+...+.|.|.+|+++..... ++...|.. +.+.|+.+..=+.. ++|++=+ || |+.|||
T Consensus 100 ~NlIAT~s~nG~i~vWdlnk~~rn---------------k~l~~f~E-H~Rs~~~ldfh~tep~iliSGSQDg~vK~~Dl 163 (839)
T KOG0269|consen 100 SNLIATCSTNGVISVWDLNKSIRN---------------KLLTVFNE-HERSANKLDFHSTEPNILISGSQDGTVKCWDL 163 (839)
T ss_pred hhhheeecCCCcEEEEecCccccc---------------hhhhHhhh-hccceeeeeeccCCccEEEecCCCceEEEEee
Confidence 467899999999999998864211 01112222 35678887766544 4555444 35 999999
Q ss_pred CCCcccccc-cCCCCcEEEEeeCCCceEEEEEc--CeEEEEEEcC-CCceeEeeeecCCCCceEEEecCCeEEEEEc---
Q 003405 103 PNLETIAVL-TKAKGANVYSWDDRRGFLCFARQ--KRVCIFRHDG-GRGFVEVKDFGVPDTVKSMSWCGENICIAIR--- 175 (823)
Q Consensus 103 ~~l~~~~~i-~~~kg~~~fa~~~~~~~l~V~~k--kki~l~~~~~-~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~--- 175 (823)
..-+...+. .....+..+...+..+..+++.. +-+++|.++. ++.+ .|-..-.+++.++.|..+..++|+.
T Consensus 164 R~~~S~~t~~~nSESiRDV~fsp~~~~~F~s~~dsG~lqlWDlRqp~r~~--~k~~AH~GpV~c~nwhPnr~~lATGGRD 241 (839)
T KOG0269|consen 164 RSKKSKSTFRSNSESIRDVKFSPGYGNKFASIHDSGYLQLWDLRQPDRCE--KKLTAHNGPVLCLNWHPNREWLATGGRD 241 (839)
T ss_pred ecccccccccccchhhhceeeccCCCceEEEecCCceEEEeeccCchhHH--HHhhcccCceEEEeecCCCceeeecCCC
Confidence 654433221 11112333444554455555433 3344555443 1111 1112335789999999776666655
Q ss_pred CceEEEEcCCCCeeeccC
Q 003405 176 KGYMILNATNGALSEVFP 193 (823)
Q Consensus 176 ~~y~lidl~~~~~~~L~~ 193 (823)
+...+.|+.+++..++..
T Consensus 242 K~vkiWd~t~~~~~~~~t 259 (839)
T KOG0269|consen 242 KMVKIWDMTDSRAKPKHT 259 (839)
T ss_pred ccEEEEeccCCCccceeE
Confidence 568889998876554443
No 261
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=72.57 E-value=1.1e+02 Score=30.87 Aligned_cols=176 Identities=12% Similarity=0.114 Sum_probs=84.6
Q ss_pred EEEEE-EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-Cc
Q 003405 19 IDAVA-SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-ES 96 (823)
Q Consensus 19 I~ci~-~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d~ 96 (823)
..|.. ..++++|+++.+|.|+.++.... +..+ ++. . ..++....++. .+.+++.+ |+
T Consensus 28 ~~~~~~~~~~~v~~~~~~~~l~~~d~~tG--------------~~~W----~~~-~-~~~~~~~~~~~-~~~v~v~~~~~ 86 (238)
T PF13360_consen 28 PVATAVPDGGRVYVASGDGNLYALDAKTG--------------KVLW----RFD-L-PGPISGAPVVD-GGRVYVGTSDG 86 (238)
T ss_dssp EEETEEEETTEEEEEETTSEEEEEETTTS--------------EEEE----EEE-C-SSCGGSGEEEE-TTEEEEEETTS
T ss_pred ccceEEEeCCEEEEEcCCCEEEEEECCCC--------------CEEE----Eee-c-cccccceeeec-cccccccccee
Confidence 34433 48899999999999999986421 1111 111 1 12222222333 34555554 44
Q ss_pred -EEEEeCCCCcccccc-----cC--CCCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeecCCC---------
Q 003405 97 -IAFHRLPNLETIAVL-----TK--AKGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFGVPD--------- 158 (823)
Q Consensus 97 -l~~~~L~~l~~~~~i-----~~--~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~~~~--------- 158 (823)
+..++..+-+..-+. +. .......++. ...++++. ...|..+....++. +.+..++.
T Consensus 87 ~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~g~l~~~d~~tG~~---~w~~~~~~~~~~~~~~~ 161 (238)
T PF13360_consen 87 SLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVD--GDRLYVGTSSGKLVALDPKTGKL---LWKYPVGEPRGSSPISS 161 (238)
T ss_dssp EEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEE--TTEEEEEETCSEEEEEETTTTEE---EEEEESSTT-SS--EEE
T ss_pred eeEecccCCcceeeeeccccccccccccccCceEe--cCEEEEEeccCcEEEEecCCCcE---EEEeecCCCCCCcceee
Confidence 777776554432221 11 1112223333 33455555 56665555543421 12222211
Q ss_pred ---CceEEEecCCeEEEEEcCc-eEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-Ee-CCeEEEEcC
Q 003405 159 ---TVKSMSWCGENICIAIRKG-YMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-GK-ENIGVFVDQ 223 (823)
Q Consensus 159 ---~~~~l~~~~~~i~v~~~~~-y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~-~~~gvfv~~ 223 (823)
....+.+.++.++++...+ ..-+|+.+|+...-.+.+. +...+...++.+. +. ++..+.+|.
T Consensus 162 ~~~~~~~~~~~~~~v~~~~~~g~~~~~d~~tg~~~w~~~~~~---~~~~~~~~~~~l~~~~~~~~l~~~d~ 229 (238)
T PF13360_consen 162 FSDINGSPVISDGRVYVSSGDGRVVAVDLATGEKLWSKPISG---IYSLPSVDGGTLYVTSSDGRLYALDL 229 (238)
T ss_dssp ETTEEEEEECCTTEEEEECCTSSEEEEETTTTEEEEEECSS----ECECEECCCTEEEEEETTTEEEEEET
T ss_pred ecccccceEEECCEEEEEcCCCeEEEEECCCCCEEEEecCCC---ccCCceeeCCEEEEEeCCCEEEEEEC
Confidence 1234444456888888766 3334999998543222211 1221333444444 43 456666664
No 262
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=72.36 E-value=28 Score=42.65 Aligned_cols=139 Identities=12% Similarity=0.180 Sum_probs=85.4
Q ss_pred ccccccccCCCCcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEE
Q 003405 6 FDSLELISNCSPKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEV 83 (823)
Q Consensus 6 f~~~~l~~~~~~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~ 83 (823)
|....++..-+..|.-++.- +..+.-|.-|+.|++|+.. +|+..+.+.++ ...|.-+..
T Consensus 119 wk~~~~l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~------------------tF~~~~vl~~H-~s~VKGvs~ 179 (942)
T KOG0973|consen 119 WKVVSILRGHDSDVLDVNWSPDDSLLVSVSLDNSVIIWNAK------------------TFELLKVLRGH-QSLVKGVSW 179 (942)
T ss_pred eeEEEEEecCCCccceeccCCCccEEEEecccceEEEEccc------------------cceeeeeeecc-cccccceEE
Confidence 34444444445566655433 6788999999999999843 45556655544 578999999
Q ss_pred ecccCceeeEeCc--EEEEeCCCCccccccc----CCCCcEEEE---eeCCCceEEEE--EcCe---EEEEEEcCCCcee
Q 003405 84 LASRQLLLSLSES--IAFHRLPNLETIAVLT----KAKGANVYS---WDDRRGFLCFA--RQKR---VCIFRHDGGRGFV 149 (823)
Q Consensus 84 ~~~~~~Ll~l~d~--l~~~~L~~l~~~~~i~----~~kg~~~fa---~~~~~~~l~V~--~kkk---i~l~~~~~~~~f~ 149 (823)
.|-+..+.+.+|. |++|+..++.-...+. ...+-+.|. .+++...|+++ +++. +.|++.+ .++
T Consensus 180 DP~Gky~ASqsdDrtikvwrt~dw~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~n~~~~~~~IieR~---tWk 256 (942)
T KOG0973|consen 180 DPIGKYFASQSDDRTLKVWRTSDWGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAVNGGKSTIAIIERG---TWK 256 (942)
T ss_pred CCccCeeeeecCCceEEEEEcccceeeEeeccchhhCCCcceeeecccCCCcCeecchhhccCCcceeEEEecC---Cce
Confidence 9999999999983 9999987754222221 223344443 36665566654 4444 4444443 354
Q ss_pred EeeeecCCC-CceEEEec
Q 003405 150 EVKDFGVPD-TVKSMSWC 166 (823)
Q Consensus 150 ~~kei~~~~-~~~~l~~~ 166 (823)
.-+.+.-++ ++.++.|.
T Consensus 257 ~~~~LvGH~~p~evvrFn 274 (942)
T KOG0973|consen 257 VDKDLVGHSAPVEVVRFN 274 (942)
T ss_pred eeeeeecCCCceEEEEeC
Confidence 444554444 44455553
No 263
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=70.60 E-value=13 Score=42.61 Aligned_cols=46 Identities=15% Similarity=0.031 Sum_probs=28.4
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccC
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTG 353 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~ 353 (823)
.-+.+.+..+++++|+.++.++...... ..-. ......+.+||++-
T Consensus 413 eL~~~yl~~~qi~eAi~lL~smnW~~~g--~~C~-~~L~~I~n~Ll~~p 458 (545)
T PF11768_consen 413 ELISQYLRCDQIEEAINLLLSMNWNTMG--EQCF-HCLSAIVNHLLRQP 458 (545)
T ss_pred HHHHHHHhcCCHHHHHHHHHhCCccccH--HHHH-HHHHHHHHHHhcCC
Confidence 3466899999999999999887532110 0111 22334567777653
No 264
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=70.33 E-value=50 Score=35.99 Aligned_cols=101 Identities=12% Similarity=0.189 Sum_probs=61.7
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEeCC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHRLP 103 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~ 103 (823)
++-++=++-||.|.+|++..... ...+ .++++ ..-|+-|.--....+|++=+| | +++|+|.
T Consensus 270 ~~vfaScS~DgsIrIWDiRs~~~--------------~~~~--~~kAh-~sDVNVISWnr~~~lLasG~DdGt~~iwDLR 332 (440)
T KOG0302|consen 270 DGVFASCSCDGSIRIWDIRSGPK--------------KAAV--STKAH-NSDVNVISWNRREPLLASGGDDGTLSIWDLR 332 (440)
T ss_pred CceEEeeecCceEEEEEecCCCc--------------ccee--Eeecc-CCceeeEEccCCcceeeecCCCceEEEEEhh
Confidence 35566778899999999875321 1111 12333 446777776666664444344 5 9999998
Q ss_pred CCcccc---cccCCC-CcEEEEeeCCC-ceEEE-EEcCeEEEEEEc
Q 003405 104 NLETIA---VLTKAK-GANVYSWDDRR-GFLCF-ARQKRVCIFRHD 143 (823)
Q Consensus 104 ~l~~~~---~i~~~k-g~~~fa~~~~~-~~l~V-~~kkki~l~~~~ 143 (823)
.++.-. ....-| .++++..++.- +.+++ +....|.|+.+.
T Consensus 333 ~~~~~~pVA~fk~Hk~pItsieW~p~e~s~iaasg~D~QitiWDls 378 (440)
T KOG0302|consen 333 QFKSGQPVATFKYHKAPITSIEWHPHEDSVIAASGEDNQITIWDLS 378 (440)
T ss_pred hccCCCcceeEEeccCCeeEEEeccccCceEEeccCCCcEEEEEee
Confidence 876432 222222 57777777543 34444 477889888764
No 265
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=70.30 E-value=93 Score=37.22 Aligned_cols=169 Identities=15% Similarity=0.171 Sum_probs=94.9
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc----CceeeEeC-c-EEE
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR----QLLLSLSE-S-IAF 99 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~----~~Ll~l~d-~-l~~ 99 (823)
.-.|.+|.-.|.|.+++..-... +-.+ ..+.++|.++.=++.. .+|+++.. + +.+
T Consensus 79 ~lliAsaD~~GrIil~d~~~~s~------------------~~~l-~~~~~~~qdl~W~~~rd~Srd~LlaIh~ss~lvL 139 (1062)
T KOG1912|consen 79 QLLIASADISGRIILVDFVLASV------------------INWL-SHSNDSVQDLCWVPARDDSRDVLLAIHGSSTLVL 139 (1062)
T ss_pred ceeEEeccccCcEEEEEehhhhh------------------hhhh-cCCCcchhheeeeeccCcchheeEEecCCcEEEE
Confidence 34677888889998887442111 0011 2345788888888744 57888887 3 889
Q ss_pred EeCCCCcccccccCCCC-cEEEEeeCC-CceEEE-EEcCeEEEEEEcC-------CCceeEeee--------------ec
Q 003405 100 HRLPNLETIAVLTKAKG-ANVYSWDDR-RGFLCF-ARQKRVCIFRHDG-------GRGFVEVKD--------------FG 155 (823)
Q Consensus 100 ~~L~~l~~~~~i~~~kg-~~~fa~~~~-~~~l~V-~~kkki~l~~~~~-------~~~f~~~ke--------------i~ 155 (823)
|+-.+-+..=+-.-... ..+|.+|+= ...+|| +.++.+.+....+ +++|+...+ ..
T Consensus 140 wntdtG~k~Wk~~ys~~iLs~f~~DPfd~rh~~~l~s~g~vl~~~~l~~sep~~pgk~~qI~sd~Sdl~~lere~at~ns 219 (1062)
T KOG1912|consen 140 WNTDTGEKFWKYDYSHEILSCFRVDPFDSRHFCVLGSKGFVLSCKDLGLSEPDVPGKEFQITSDHSDLAHLERETATGNS 219 (1062)
T ss_pred EEccCCceeeccccCCcceeeeeeCCCCcceEEEEccCceEEEEeccCCCCCCCCceeEEEecCccchhhhhhhhhcccc
Confidence 97654332111111112 234666652 234555 4666676666532 123332222 11
Q ss_pred CCCCceEEEe---------c---CCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEEEc--cCCeEEEE
Q 003405 156 VPDTVKSMSW---------C---GENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVVSL--LSGELLLG 213 (823)
Q Consensus 156 ~~~~~~~l~~---------~---~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~--~~~EfLL~ 213 (823)
....+.+..| . .|.+++.+.++..++|++=.+.....|..+.+.|.+-.+ +..|+|.|
T Consensus 220 ~ts~~~sa~fity~a~faf~p~~rn~lfi~~prellv~dle~~~~l~vvpier~~akfv~vlP~~~rd~Lfc 291 (1062)
T KOG1912|consen 220 TTSTPASAYFITYCAQFAFSPHWRNILFITFPRELLVFDLEYECCLAVVPIERGGAKFVDVLPDPRRDALFC 291 (1062)
T ss_pred ccCCCcchhHHHHHHhhhcChhhhceEEEEeccceEEEcchhhceeEEEEeccCCcceeEeccCCCcceEEE
Confidence 1112222222 2 478999999999999998766666666655556665544 34466655
No 266
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=70.07 E-value=1.8e+02 Score=32.14 Aligned_cols=248 Identities=10% Similarity=0.004 Sum_probs=131.3
Q ss_pred CCEEEEEeCC-----CcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee-Ee-----
Q 003405 26 GLKILLGCSD-----GSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS-LS----- 94 (823)
Q Consensus 26 ~~~L~vGT~~-----G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~-l~----- 94 (823)
..++||-... |.+++++... .++...+. .++.|-- + +-|+...+.+ .+
T Consensus 12 ~~~v~V~d~~~~~~~~~v~ViD~~~------------------~~v~g~i~-~G~~P~~-~-~spDg~~lyva~~~~~R~ 70 (352)
T TIGR02658 12 ARRVYVLDPGHFAATTQVYTIDGEA------------------GRVLGMTD-GGFLPNP-V-VASDGSFFAHASTVYSRI 70 (352)
T ss_pred CCEEEEECCcccccCceEEEEECCC------------------CEEEEEEE-ccCCCce-e-ECCCCCEEEEEecccccc
Confidence 3578887665 8888887543 22233332 2233333 3 6666655444 45
Q ss_pred ----C--cEEEEeCCCCcccccccC--------CCCcEEEEeeCCCceEEEEE---cCeEEEEEEcCCCceeEeeeecCC
Q 003405 95 ----E--SIAFHRLPNLETIAVLTK--------AKGANVYSWDDRRGFLCFAR---QKRVCIFRHDGGRGFVEVKDFGVP 157 (823)
Q Consensus 95 ----d--~l~~~~L~~l~~~~~i~~--------~kg~~~fa~~~~~~~l~V~~---kkki~l~~~~~~~~f~~~kei~~~ 157 (823)
+ .|.+|+..+++.+..++- ...-..|+++++...+.|+. ...+.++....+ +.++|+.+|
T Consensus 71 ~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~---kvv~ei~vp 147 (352)
T TIGR02658 71 ARGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGK---AFVRMMDVP 147 (352)
T ss_pred ccCCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCC---cEEEEEeCC
Confidence 3 299999999887755432 22334788888877788773 567878877733 456788888
Q ss_pred CCceEEEecCC-eEEEEEcCceEEEEcC-CCCe----eeccCCCC---CCCCEEEEccCCeEEEEeCCeEEEEcCCCccc
Q 003405 158 DTVKSMSWCGE-NICIAIRKGYMILNAT-NGAL----SEVFPSGR---IGPPLVVSLLSGELLLGKENIGVFVDQNGKLL 228 (823)
Q Consensus 158 ~~~~~l~~~~~-~i~v~~~~~y~lidl~-~~~~----~~L~~~~~---~~~p~i~~~~~~EfLL~~~~~gvfv~~~G~~~ 228 (823)
+...-.....+ ....+-......+.+. +|+. ..+|...+ ..+|...+.++..+.+.+++....+|..|..+
T Consensus 148 ~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs~eG~V~~id~~~~~~ 227 (352)
T TIGR02658 148 DCYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPKIKPTEVFHPEDEYLINHPAYSNKSGRLVWPTYTGKIFQIDLSSGDA 227 (352)
T ss_pred CCcEEEEecCCccEEEeecCceEEEEecCCCceEEeeeeeecCCccccccCCceEcCCCcEEEEecCCeEEEEecCCCcc
Confidence 86655555432 2222222222222222 2221 12222111 02453333344566677776666677555433
Q ss_pred cCCceeecC-------------CCcEEEEe--CCEEEEEe-C----------CeEEEEEccCCCceeEEEeeCCcc---c
Q 003405 229 QADRICWSE-------------APIAVIIQ--KPYAIALL-P----------RRVEVRSLRVPYALIQTIVLQNVR---H 279 (823)
Q Consensus 229 ~~~~i~w~~-------------~P~~v~~~--~PYll~~~-~----------~~ieV~~l~~~~~lvQ~i~l~~~~---~ 279 (823)
. ....|+. -++-++++ .-++++.. . +.|.|.+.. ++..+-+|...... .
T Consensus 228 ~-~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~-t~kvi~~i~vG~~~~~ia 305 (352)
T TIGR02658 228 K-FLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAK-TGKRLRKIELGHEIDSIN 305 (352)
T ss_pred e-ecceeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECC-CCeEEEEEeCCCceeeEE
Confidence 1 1222321 11226665 34666532 1 468888886 57888888765421 2
Q ss_pred ccccCC-eEEEec--cceEEEee
Q 003405 280 LIPSSN-AVVVAL--ENSIFGLF 299 (823)
Q Consensus 280 l~~~~~-~v~v~s--~~~I~~l~ 299 (823)
+.+.++ .+|+++ ++.|..+.
T Consensus 306 vS~Dgkp~lyvtn~~s~~VsViD 328 (352)
T TIGR02658 306 VSQDAKPLLYALSTGDKTLYIFD 328 (352)
T ss_pred ECCCCCeEEEEeCCCCCcEEEEE
Confidence 334455 555554 34455444
No 267
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=70.04 E-value=4.9 Score=33.22 Aligned_cols=57 Identities=19% Similarity=0.310 Sum_probs=37.7
Q ss_pred HHHhcCCHHHHHHHhhhCCCcchHhh--hhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 309 QLTASGDFEEALALCKLLPPEDASLR--AAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 309 ~Ll~~~~~e~Al~L~~~~~~~~~~~~--~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
.+...|+|++|+..++.....-.... ...........|..+...|+|++|+++|.++
T Consensus 14 ~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~a 72 (78)
T PF13424_consen 14 VYRELGRYDEALDYYEKALDIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKA 72 (78)
T ss_dssp HHHHTT-HHHHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred HHHHcCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 44678999999888876421100011 1123455667899999999999999999874
No 268
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=69.62 E-value=25 Score=37.30 Aligned_cols=73 Identities=19% Similarity=0.167 Sum_probs=43.0
Q ss_pred HHHHHHHHhhhcCCCC-hHHH----hccCCCC-chhhHHHHHhhccccHHHHHHHHHHHhC-CC-chhHHHHHHHHhcCC
Q 003405 702 TRKKLLSALESISGYN-PEVL----LKRLPAD-ALYEERAILLGKMNQHELALSLYVHKVF-LI-NQPVFLLIRRMAMDI 773 (823)
Q Consensus 702 ~r~kLl~fL~~s~~Yd-~~~~----L~~~~~~-~l~~e~~~Ll~klg~h~~AL~ilv~~L~-D~-~~a~~~~l~~~y~~~ 773 (823)
.+..|..+|-....++ +..+ ....+.+ .+....+..|.++|+.++||..+-.-+. ++ ++.+...+...+...
T Consensus 182 ~~~~l~~~li~~~~~~~~~~~l~~~~~~~~~~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~d~~~~~~~a~~l~~~ 261 (280)
T PF13429_consen 182 ARNALAWLLIDMGDYDEAREALKRLLKAAPDDPDLWDALAAAYLQLGRYEEALEYLEKALKLNPDDPLWLLAYADALEQA 261 (280)
T ss_dssp HHHHHHHHHCTTCHHHHHHHHHHHHHHH-HTSCCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHT--
T ss_pred HHHHHHHHHHHCCChHHHHHHHHHHHHHCcCHHHHHHHHHHHhccccccccccccccccccccccccccccccccccccc
Confidence 3455555555444444 1222 3333333 5888999999999999999999987665 33 566666666666554
Q ss_pred C
Q 003405 774 K 774 (823)
Q Consensus 774 ~ 774 (823)
+
T Consensus 262 g 262 (280)
T PF13429_consen 262 G 262 (280)
T ss_dssp -
T ss_pred c
Confidence 3
No 269
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=69.26 E-value=1.9e+02 Score=32.00 Aligned_cols=107 Identities=16% Similarity=0.255 Sum_probs=68.4
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEe-Cc-EEEEeCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLS-ES-IAFHRLP 103 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~-d~-l~~~~L~ 103 (823)
+-|.=|++|.++.+|.+.++.... ..++.+....++ .++|--+.-=|.. |+|++-+ |+ |.+|+..
T Consensus 95 ~vIASgSeD~~v~vW~IPe~~l~~-----------~ltepvv~L~gH-~rrVg~V~wHPtA~NVLlsag~Dn~v~iWnv~ 162 (472)
T KOG0303|consen 95 CVIASGSEDTKVMVWQIPENGLTR-----------DLTEPVVELYGH-QRRVGLVQWHPTAPNVLLSAGSDNTVSIWNVG 162 (472)
T ss_pred ceeecCCCCceEEEEECCCccccc-----------CcccceEEEeec-ceeEEEEeecccchhhHhhccCCceEEEEecc
Confidence 467889999999999988654321 111112222333 4566666655533 7777665 44 9999998
Q ss_pred CCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCC
Q 003405 104 NLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGG 145 (823)
Q Consensus 104 ~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~ 145 (823)
+-+...++..-.-|.+++.+.+.+.+|-+ ..|||.|+.-..+
T Consensus 163 tgeali~l~hpd~i~S~sfn~dGs~l~TtckDKkvRv~dpr~~ 205 (472)
T KOG0303|consen 163 TGEALITLDHPDMVYSMSFNRDGSLLCTTCKDKKVRVIDPRRG 205 (472)
T ss_pred CCceeeecCCCCeEEEEEeccCCceeeeecccceeEEEcCCCC
Confidence 76655444433345667778777788877 5578988876644
No 270
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=69.15 E-value=7.8 Score=25.72 Aligned_cols=28 Identities=36% Similarity=0.729 Sum_probs=23.7
Q ss_pred HHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCC
Q 003405 344 RFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPS 378 (823)
Q Consensus 344 ~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~ 378 (823)
..|..++..|++++|...|.+ |+..||+
T Consensus 5 ~~a~~~~~~g~~~~A~~~~~~-------~~~~~P~ 32 (33)
T PF13174_consen 5 RLARCYYKLGDYDEAIEYFQR-------LIKRYPD 32 (33)
T ss_dssp HHHHHHHHHCHHHHHHHHHHH-------HHHHSTT
T ss_pred HHHHHHHHccCHHHHHHHHHH-------HHHHCcC
Confidence 458888889999999999986 7788885
No 271
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=68.85 E-value=18 Score=41.91 Aligned_cols=72 Identities=13% Similarity=0.353 Sum_probs=52.9
Q ss_pred CcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCC-Cce-EEEec--CCeEEEEEcC-ceEEEEcCCCCee
Q 003405 116 GANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPD-TVK-SMSWC--GENICIAIRK-GYMILNATNGALS 189 (823)
Q Consensus 116 g~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~-~~~-~l~~~--~~~i~v~~~~-~y~lidl~~~~~~ 189 (823)
++..+-+++....||.+ .++++.+++.. |+++..+++|+ +++ +++|+ |..|.||.++ ...+.|..+|...
T Consensus 22 ~i~~~ewnP~~dLiA~~t~~gelli~R~n----~qRlwtip~p~~~v~~sL~W~~DGkllaVg~kdG~I~L~Dve~~~~l 97 (665)
T KOG4640|consen 22 NIKRIEWNPKMDLIATRTEKGELLIHRLN----WQRLWTIPIPGENVTASLCWRPDGKLLAVGFKDGTIRLHDVEKGGRL 97 (665)
T ss_pred ceEEEEEcCccchhheeccCCcEEEEEec----cceeEeccCCCCccceeeeecCCCCEEEEEecCCeEEEEEccCCCce
Confidence 34456667766778877 55668888887 56677888775 455 89998 7789999995 5789999987643
Q ss_pred ec
Q 003405 190 EV 191 (823)
Q Consensus 190 ~L 191 (823)
.=
T Consensus 98 ~~ 99 (665)
T KOG4640|consen 98 VS 99 (665)
T ss_pred ec
Confidence 33
No 272
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=68.76 E-value=38 Score=35.21 Aligned_cols=140 Identities=16% Similarity=0.214 Sum_probs=78.4
Q ss_pred cEEEEE---EeCC--EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCcee-
Q 003405 18 KIDAVA---SYGL--KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLL- 91 (823)
Q Consensus 18 ~I~ci~---~~~~--~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll- 91 (823)
.+.|.. .++. .+..|.++|.+.+|++.....- -++ +..++++.... -++.||..+.+.+..+.=+
T Consensus 152 svmc~~~~~~c~s~~lllaGyEsghvv~wd~S~~~~~-------~~~-~~~~kv~~~~a-sh~qpvlsldyas~~~rGis 222 (323)
T KOG0322|consen 152 SVMCQDKDHACGSTFLLLAGYESGHVVIWDLSTGDKI-------IQL-PQSSKVESPNA-SHKQPVLSLDYASSCDRGIS 222 (323)
T ss_pred ceeeeeccccccceEEEEEeccCCeEEEEEccCCcee-------ecc-ccccccccchh-hccCcceeeeechhhcCCcC
Confidence 466765 1122 5789999999999998864210 000 01112222222 2467899998887544332
Q ss_pred -eEeCcEEEEeCCCC-cc--ccc--ccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCCCCceEEE
Q 003405 92 -SLSESIAFHRLPNL-ET--IAV--LTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMS 164 (823)
Q Consensus 92 -~l~d~l~~~~L~~l-~~--~~~--i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~ 164 (823)
..+|.+.+|.+..- .. +.. --+..|++-+.+-++...++-| =..+|.+|.|+..+-...+|-- .+.+.+++
T Consensus 223 gga~dkl~~~Sl~~s~gslq~~~e~~lknpGv~gvrIRpD~KIlATAGWD~RiRVyswrtl~pLAVLkyH--sagvn~vA 300 (323)
T KOG0322|consen 223 GGADDKLVMYSLNHSTGSLQIRKEITLKNPGVSGVRIRPDGKILATAGWDHRIRVYSWRTLNPLAVLKYH--SAGVNAVA 300 (323)
T ss_pred CCccccceeeeeccccCcccccceEEecCCCccceEEccCCcEEeecccCCcEEEEEeccCCchhhhhhh--hcceeEEE
Confidence 22234777777532 11 111 1133466767777666555554 6789999999843322222211 36788888
Q ss_pred ecCC
Q 003405 165 WCGE 168 (823)
Q Consensus 165 ~~~~ 168 (823)
|..+
T Consensus 301 fspd 304 (323)
T KOG0322|consen 301 FSPD 304 (323)
T ss_pred eCCC
Confidence 8754
No 273
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=68.44 E-value=51 Score=40.37 Aligned_cols=140 Identities=16% Similarity=0.171 Sum_probs=78.9
Q ss_pred CCcE--EEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKI--DAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I--~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
|..| +|+.....+|++|+.||.++.|.++..+.-.+ ..+...|+.++..+ ++|. ......++++
T Consensus 583 PRSIl~~~~e~d~~yLlvalgdG~l~~fv~d~~tg~ls-d~Kk~~lGt~P~~L-r~f~------------sk~~t~vfa~ 648 (1096)
T KOG1897|consen 583 PRSILLTTFEGDIHYLLVALGDGALLYFVLDINTGQLS-DRKKVTLGTQPISL-RTFS------------SKSRTAVFAL 648 (1096)
T ss_pred chheeeEEeeccceEEEEEcCCceEEEEEEEcccceEc-cccccccCCCCcEE-EEEe------------eCCceEEEEe
Confidence 4444 45666678999999999999998876543211 00112344433333 1211 1233567888
Q ss_pred eCc-EEEEeC-CCC--cccccccCCCCcEEEEe-eC--CCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEec
Q 003405 94 SES-IAFHRL-PNL--ETIAVLTKAKGANVYSW-DD--RRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC 166 (823)
Q Consensus 94 ~d~-l~~~~L-~~l--~~~~~i~~~kg~~~fa~-~~--~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~ 166 (823)
+|. ..+|.- ..| .+. ..|-+..+|- +. -+..++.+.+..+.+..++.=.+. ..+.+++.+.|+.+++.
T Consensus 649 sdrP~viY~~n~kLv~spl----s~kev~~~c~f~s~a~~d~l~~~~~~~l~i~tid~iqkl-~irtvpl~~~prrI~~q 723 (1096)
T KOG1897|consen 649 SDRPTVIYSSNGKLVYSPL----SLKEVNHMCPFNSDAYPDSLASANGGALTIGTIDEIQKL-HIRTVPLGESPRRICYQ 723 (1096)
T ss_pred CCCCEEEEecCCcEEEecc----chHHhhhhcccccccCCceEEEecCCceEEEEecchhhc-ceeeecCCCChhheEec
Confidence 885 333322 222 111 1222222222 22 224688889999999999841111 23448888999999998
Q ss_pred CCeEEEEE
Q 003405 167 GENICIAI 174 (823)
Q Consensus 167 ~~~i~v~~ 174 (823)
..+++++.
T Consensus 724 ~~sl~~~v 731 (1096)
T KOG1897|consen 724 ESSLTFGV 731 (1096)
T ss_pred ccceEEEE
Confidence 76666653
No 274
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=68.01 E-value=1.6e+02 Score=31.59 Aligned_cols=154 Identities=14% Similarity=0.160 Sum_probs=90.0
Q ss_pred EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCeeEEEEecccCceeeEeC-c--
Q 003405 21 AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPILSMEVLASRQLLLSLSE-S-- 96 (823)
Q Consensus 21 ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~qI~~~~~~~~Ll~l~d-~-- 96 (823)
|+...|.+||.|.+.. |.+|++.+..-.- ..|..... -.++ +.-|.++..-|....++.+.. +
T Consensus 165 ~Fs~DGeqlfaGykrc-irvFdt~RpGr~c-----------~vy~t~~~-~k~gq~giisc~a~sP~~~~~~a~gsY~q~ 231 (406)
T KOG2919|consen 165 QFSPDGEQLFAGYKRC-IRVFDTSRPGRDC-----------PVYTTVTK-GKFGQKGIISCFAFSPMDSKTLAVGSYGQR 231 (406)
T ss_pred EecCCCCeEeecccce-EEEeeccCCCCCC-----------cchhhhhc-ccccccceeeeeeccCCCCcceeeecccce
Confidence 4555578999999887 7999875432110 01111100 0133 334666777776655666665 2
Q ss_pred EEEEeCCCCcccccccCC-CCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeee--ecCCCCceEEEec----CCe
Q 003405 97 IAFHRLPNLETIAVLTKA-KGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKD--FGVPDTVKSMSWC----GEN 169 (823)
Q Consensus 97 l~~~~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~ke--i~~~~~~~~l~~~----~~~ 169 (823)
+-+|.-..-.|...+..- .||+-.+..++..+++++.+|.=.|..|+.+..-..+.+ -.+.++=+-|-|. |+.
T Consensus 232 ~giy~~~~~~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~~~pv~~L~rhv~~TNQRI~FDld~~~~~ 311 (406)
T KOG2919|consen 232 VGIYNDDGRRPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYSRDPVYALERHVGDTNQRILFDLDPKGEI 311 (406)
T ss_pred eeeEecCCCCceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhccchhhhhhhhccCccceEEEecCCCCce
Confidence 667766666666554433 489999999888899999887666666653210011111 1223344555554 556
Q ss_pred EEEE-EcCceEEEEcCC-CC
Q 003405 170 ICIA-IRKGYMILNATN-GA 187 (823)
Q Consensus 170 i~v~-~~~~y~lidl~~-~~ 187 (823)
+.=| ++....+.|+++ |.
T Consensus 312 LasG~tdG~V~vwdlk~~gn 331 (406)
T KOG2919|consen 312 LASGDTDGSVRVWDLKDLGN 331 (406)
T ss_pred eeccCCCccEEEEecCCCCC
Confidence 6666 345567888887 55
No 275
>PRK04922 tolB translocation protein TolB; Provisional
Probab=67.50 E-value=2.2e+02 Score=32.26 Aligned_cols=142 Identities=17% Similarity=0.256 Sum_probs=74.5
Q ss_pred EEEEecccCceee-EeC-c---EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEEEcC--eEEEEEEcC-CCceeE
Q 003405 80 SMEVLASRQLLLS-LSE-S---IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFARQK--RVCIFRHDG-GRGFVE 150 (823)
Q Consensus 80 qI~~~~~~~~Ll~-l~d-~---l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~~kk--ki~l~~~~~-~~~f~~ 150 (823)
.....|+.+.+++ .+. + |.++++.+-+ ...+....+ ...++++++...++++..+ ...||.++. +...+.
T Consensus 252 ~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~-~~~lt~~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~ 330 (433)
T PRK04922 252 APSFSPDGRRLALTLSRDGNPEIYVMDLGSRQ-LTRLTNHFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAER 330 (433)
T ss_pred CceECCCCCEEEEEEeCCCCceEEEEECCCCC-eEECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEE
Confidence 4566677665544 332 3 8888876533 222322222 2346677776667666543 345676652 212222
Q ss_pred eeeecC-CCCceEEEec--CCeEEEEEcC----ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-EeC----CeE
Q 003405 151 VKDFGV-PDTVKSMSWC--GENICIAIRK----GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-GKE----NIG 218 (823)
Q Consensus 151 ~kei~~-~~~~~~l~~~--~~~i~v~~~~----~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~~----~~g 218 (823)
+.. .....+.+|. |+.|++.... ...++|+.+++.+.+...+....| ...+++.+++ ..+ ...
T Consensus 331 ---lt~~g~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~~~~~p--~~spdG~~i~~~s~~~g~~~L 405 (433)
T PRK04922 331 ---LTFQGNYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGSLDESP--SFAPNGSMVLYATREGGRGVL 405 (433)
T ss_pred ---eecCCCCccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCCCCCCc--eECCCCCEEEEEEecCCceEE
Confidence 222 1233456776 6778877643 256789888887766543222223 3456676655 332 123
Q ss_pred EEEcCCCcc
Q 003405 219 VFVDQNGKL 227 (823)
Q Consensus 219 vfv~~~G~~ 227 (823)
+.++.+|..
T Consensus 406 ~~~~~~g~~ 414 (433)
T PRK04922 406 AAVSTDGRV 414 (433)
T ss_pred EEEECCCCc
Confidence 445667753
No 276
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=67.42 E-value=11 Score=41.26 Aligned_cols=61 Identities=23% Similarity=0.274 Sum_probs=44.4
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|+|+++- |+++.+||.+|.|.+|+... .+...-++..+..-|+.+...|+.+.+..++
T Consensus 282 ~siSsl~VS~dGkf~AlGT~dGsVai~~~~~------------------lq~~~~vk~aH~~~VT~ltF~Pdsr~~~svS 343 (398)
T KOG0771|consen 282 KSISSLAVSDDGKFLALGTMDGSVAIYDAKS------------------LQRLQYVKEAHLGFVTGLTFSPDSRYLASVS 343 (398)
T ss_pred CcceeEEEcCCCcEEEEeccCCcEEEEEece------------------eeeeEeehhhheeeeeeEEEcCCcCcccccc
Confidence 468987765 57999999999999998543 2222223333445899999999988888776
Q ss_pred C
Q 003405 95 E 95 (823)
Q Consensus 95 d 95 (823)
-
T Consensus 344 s 344 (398)
T KOG0771|consen 344 S 344 (398)
T ss_pred c
Confidence 4
No 277
>KOG2063 consensus Vacuolar assembly/sorting proteins VPS39/VAM6/VPS3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=66.40 E-value=7.5 Score=47.47 Aligned_cols=69 Identities=20% Similarity=0.157 Sum_probs=56.7
Q ss_pred hhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcC-CCChHHHhccCCCCchhhHHHHHhhccccHH
Q 003405 667 GNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESIS-GYNPEVLLKRLPADALYEERAILLGKMNQHE 745 (823)
Q Consensus 667 ~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~-~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~ 745 (823)
..+-++|...|+.. .+.-...+|+..+ +-+++.+-..+...+.+.+.++||..-|+|+
T Consensus 463 ~~IDttLlk~Yl~~---------------------n~~~v~~llrlen~~c~vee~e~~L~k~~~y~~Li~LY~~kg~h~ 521 (877)
T KOG2063|consen 463 ELIDTTLLKCYLET---------------------NPGLVGPLLRLENNHCDVEEIETVLKKSKKYRELIELYATKGMHE 521 (877)
T ss_pred HHHHHHHHHHHHhc---------------------CchhhhhhhhccCCCcchHHHHHHHHhcccHHHHHHHHHhccchH
Confidence 35677888888876 2466677888866 7888888888888889999999999999999
Q ss_pred HHHHHHHHHhC
Q 003405 746 LALSLYVHKVF 756 (823)
Q Consensus 746 ~AL~ilv~~L~ 756 (823)
+||+++...-.
T Consensus 522 ~AL~ll~~l~d 532 (877)
T KOG2063|consen 522 KALQLLRDLVD 532 (877)
T ss_pred HHHHHHHHHhc
Confidence 99999987544
No 278
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=66.36 E-value=2.1e+02 Score=31.52 Aligned_cols=148 Identities=14% Similarity=0.179 Sum_probs=81.4
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-- 95 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-- 95 (823)
.|-...-.+++|..++++|.+.++........++ .+.. .. +..++..+.-.+...-+++-++
T Consensus 107 ~I~gl~~~dg~Litc~~sG~l~~~~~k~~d~hss-------------~l~~-la--~g~g~~~~r~~~~~p~Iva~GGke 170 (412)
T KOG3881|consen 107 SIKGLKLADGTLITCVSSGNLQVRHDKSGDLHSS-------------KLIK-LA--TGPGLYDVRQTDTDPYIVATGGKE 170 (412)
T ss_pred cccchhhcCCEEEEEecCCcEEEEeccCCccccc-------------ccee-ee--cCCceeeeccCCCCCceEecCchh
Confidence 5666667788999999999999998664332211 1110 00 1234455555555545555444
Q ss_pred c---EEEEeCCCCcccccccCCCCcE-----E----------EEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecC
Q 003405 96 S---IAFHRLPNLETIAVLTKAKGAN-----V----------YSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGV 156 (823)
Q Consensus 96 ~---l~~~~L~~l~~~~~i~~~kg~~-----~----------fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~ 156 (823)
. +.+|+|..-+++ -..|++- . |.-+.....++-+ .-..+.+|.-...| +.+..|.+
T Consensus 171 ~~n~lkiwdle~~~qi---w~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~~qR--RPV~~fd~ 245 (412)
T KOG3881|consen 171 NINELKIWDLEQSKQI---WSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRYHQVRLYDTRHQR--RPVAQFDF 245 (412)
T ss_pred cccceeeeecccceee---eeccCCCCccccceeeeeeccceecCCCCCceEEEEecceeEEEecCcccC--cceeEecc
Confidence 1 788988654433 2223221 0 1111112234433 33568888776332 44555555
Q ss_pred CC-CceEEEec--CCeEEEEEc-CceEEEEcCCC
Q 003405 157 PD-TVKSMSWC--GENICIAIR-KGYMILNATNG 186 (823)
Q Consensus 157 ~~-~~~~l~~~--~~~i~v~~~-~~y~lidl~~~ 186 (823)
.+ ++.+++.. |++|++|+. .+...+|+.++
T Consensus 246 ~E~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~ 279 (412)
T KOG3881|consen 246 LENPISSTGLTPSGNFIYTGNTKGQLAKFDLRGG 279 (412)
T ss_pred ccCcceeeeecCCCcEEEEecccchhheecccCc
Confidence 43 55566655 788999987 45667776654
No 279
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=66.26 E-value=15 Score=38.08 Aligned_cols=69 Identities=19% Similarity=0.312 Sum_probs=52.2
Q ss_pred ChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 302 PLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 302 ~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
-+-.+....++.|.|++|...++.+... .+... ..++.....|+..+..++|++|...+.+ .|.+||.-
T Consensus 36 ~LY~~g~~~L~~gn~~~A~~~fe~l~~~-~p~s~-~~~qa~l~l~yA~Yk~~~y~~A~~~~dr-------Fi~lyP~~ 104 (254)
T COG4105 36 ELYNEGLTELQKGNYEEAIKYFEALDSR-HPFSP-YSEQAQLDLAYAYYKNGEYDLALAYIDR-------FIRLYPTH 104 (254)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHc-CCCCc-ccHHHHHHHHHHHHhcccHHHHHHHHHH-------HHHhCCCC
Confidence 3667888899999999999999886311 11111 3366777789999999999999988664 66788776
No 280
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=65.99 E-value=16 Score=25.37 Aligned_cols=31 Identities=23% Similarity=0.396 Sum_probs=25.0
Q ss_pred ccCCCCcEEEEEEeC--CEEEEEeCCCcEEEEc
Q 003405 12 ISNCSPKIDAVASYG--LKILLGCSDGSLKIYS 42 (823)
Q Consensus 12 ~~~~~~~I~ci~~~~--~~L~vGT~~G~l~~y~ 42 (823)
+..-...|+|++... +.|+.|+.||.|.+|+
T Consensus 7 ~~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 7 FRGHSSSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred EcCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence 344456899988875 6999999999999985
No 281
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=65.79 E-value=1.6e+02 Score=33.20 Aligned_cols=177 Identities=8% Similarity=0.048 Sum_probs=96.1
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcc-c---------c---cccCChHHHHHHhhcCCC---CChhhHHHh
Q 003405 549 YTALLELYKSNARHREALKLLHELVEESKSNQSQD-E---------H---TQKFNPESIIEYLKPLCG---TDPMLVLEF 612 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~-~---------~---~~~~~~~~~i~yL~~L~~---~~~~li~~y 612 (823)
+..++.+|...|++++|++++.++......++... . + ....+.+...++...+.. .+..+...+
T Consensus 190 l~ll~~~~~~~~d~~~a~~~l~~l~k~~~~~~~~~~~l~~~a~~~~l~~~~~~~~~~~L~~~~~~~p~~~~~~~~l~~~~ 269 (409)
T TIGR00540 190 LKLAEEAYIRSGAWQALDDIIDNMAKAGLFDDEEFADLEQKAEIGLLDEAMADEGIDGLLNWWKNQPRHRRHNIALKIAL 269 (409)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHCCHHHhCCHHHHHHH
Confidence 44678999999999999999888875432211000 0 0 000011122233333321 255666677
Q ss_pred hhhhhhcC-cccccccccc--CCCChHH-----HHH---HHhhcCchhHHHHHHHHhhcccCCCCh--hHHHHHHHHHHH
Q 003405 613 SMLVLESC-PTQTIELFLS--GNIPADL-----VNS---YLKQYSPSMQGRYLELMLAMNENSISG--NLQNEMVQIYLS 679 (823)
Q Consensus 613 ~~wll~~~-p~~~~~if~~--~~l~~~~-----Vl~---~L~~~~~~~~~~YLE~li~~~~~~~~~--~~h~~L~~lYl~ 679 (823)
+..+++.+ ++.|.+++.+ +..|.+. .+. .+...++......+|..... ...++ .++..|+.+|..
T Consensus 270 a~~l~~~g~~~~A~~~l~~~l~~~pd~~~~~~~~l~~~~~l~~~~~~~~~~~~e~~lk~--~p~~~~~~ll~sLg~l~~~ 347 (409)
T TIGR00540 270 AEHLIDCDDHDSAQEIIFDGLKKLGDDRAISLPLCLPIPRLKPEDNEKLEKLIEKQAKN--VDDKPKCCINRALGQLLMK 347 (409)
T ss_pred HHHHHHCCChHHHHHHHHHHHhhCCCcccchhHHHHHhhhcCCCChHHHHHHHHHHHHh--CCCChhHHHHHHHHHHHHH
Confidence 77666643 4445555544 1111111 222 22223345567777766553 23567 888889988876
Q ss_pred HHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHH
Q 003405 680 EVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVH 753 (823)
Q Consensus 680 ~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~ 753 (823)
. + ...+-..+|+.- .+++.-|+..-....+-++-++|++++|.+++=.
T Consensus 348 ~----------------~----~~~~A~~~le~a------~a~~~~p~~~~~~~La~ll~~~g~~~~A~~~~~~ 395 (409)
T TIGR00540 348 H----------------G----EFIEAADAFKNV------AACKEQLDANDLAMAADAFDQAGDKAEAAAMRQD 395 (409)
T ss_pred c----------------c----cHHHHHHHHHHh------HHhhcCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 4 1 134444455521 1222233333344778899999999999887764
No 282
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=65.56 E-value=72 Score=37.22 Aligned_cols=148 Identities=13% Similarity=0.222 Sum_probs=82.9
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C-cEEEEeCC
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E-SIAFHRLP 103 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d-~l~~~~L~ 103 (823)
++.||-|..||.|..|++..+.+..+ ..|. .+++.+ ..=|+.|.+.-..+-|++++ | .|++|+-.
T Consensus 37 ~ryLfTgGRDg~i~~W~~~~d~~~~s----------~~~~--asme~H-sDWVNDiiL~~~~~tlIS~SsDtTVK~W~~~ 103 (735)
T KOG0308|consen 37 GRYLFTGGRDGIIRLWSVTQDSNEPS----------TPYI--ASMEHH-SDWVNDIILCGNGKTLISASSDTTVKVWNAH 103 (735)
T ss_pred CceEEecCCCceEEEeccccccCCcc----------cchh--hhhhhh-HhHHhhHHhhcCCCceEEecCCceEEEeecc
Confidence 34699999999999999887654211 1121 122222 34577787777777888886 4 49999864
Q ss_pred CCc-c-cccccCCC-CcEEEEeeCCCceEEE--EEcCeEEEEEEcCCC-----ceeEeeeecCC----CCceEEEecCC-
Q 003405 104 NLE-T-IAVLTKAK-GANVYSWDDRRGFLCF--ARQKRVCIFRHDGGR-----GFVEVKDFGVP----DTVKSMSWCGE- 168 (823)
Q Consensus 104 ~l~-~-~~~i~~~k-g~~~fa~~~~~~~l~V--~~kkki~l~~~~~~~-----~f~~~kei~~~----~~~~~l~~~~~- 168 (823)
..- . +..+..-+ =|+++++-.....+++ |..++|.+|.+..+. .|..+..-+++ +.+-+++...+
T Consensus 104 ~~~~~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~ 183 (735)
T KOG0308|consen 104 KDNTFCMSTIRTHKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTG 183 (735)
T ss_pred cCcchhHhhhhcccchheeeeecccCceeEEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcc
Confidence 331 1 11222222 2455555222233444 378999999887431 13222221222 56667766643
Q ss_pred eEEEEEc--CceEEEEcCCC
Q 003405 169 NICIAIR--KGYMILNATNG 186 (823)
Q Consensus 169 ~i~v~~~--~~y~lidl~~~ 186 (823)
+++|+-. +...+.|..++
T Consensus 184 t~ivsGgtek~lr~wDprt~ 203 (735)
T KOG0308|consen 184 TIIVSGGTEKDLRLWDPRTC 203 (735)
T ss_pred eEEEecCcccceEEeccccc
Confidence 3454433 45666777665
No 283
>PRK03629 tolB translocation protein TolB; Provisional
Probab=65.28 E-value=2.5e+02 Score=31.94 Aligned_cols=144 Identities=8% Similarity=0.061 Sum_probs=78.0
Q ss_pred eeEEEEecccCceeeEeC--c---EEEEeCCCCcccccccCC-CCcEEEEeeCCCceEEEEEcC--eEEEEEEcCC-Cce
Q 003405 78 ILSMEVLASRQLLLSLSE--S---IAFHRLPNLETIAVLTKA-KGANVYSWDDRRGFLCFARQK--RVCIFRHDGG-RGF 148 (823)
Q Consensus 78 I~qI~~~~~~~~Ll~l~d--~---l~~~~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~kk--ki~l~~~~~~-~~f 148 (823)
+......|+...|+..++ + |.++++.+-+. ..+... ..+..++++++...|+.+..+ ...||.++.+ ...
T Consensus 245 ~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~-~~lt~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~ 323 (429)
T PRK03629 245 NGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQI-RQVTDGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGAP 323 (429)
T ss_pred cCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCE-EEccCCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCCe
Confidence 445677787776666543 3 78888765332 122222 244567777776667665443 3567766532 122
Q ss_pred eEeeeecC-CCCceEEEec--CCeEEEEEcC----ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEE-eCC----
Q 003405 149 VEVKDFGV-PDTVKSMSWC--GENICIAIRK----GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLG-KEN---- 216 (823)
Q Consensus 149 ~~~kei~~-~~~~~~l~~~--~~~i~v~~~~----~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~-~~~---- 216 (823)
+ .+.. .....+..|. |+.|+++... ...++|+.++....|........|. ..+++..++. ..+
T Consensus 324 ~---~lt~~~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~~~~~~~p~--~SpDG~~i~~~s~~~~~~ 398 (429)
T PRK03629 324 Q---RITWEGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTDTFLDETPS--IAPNGTMVIYSSSQGMGS 398 (429)
T ss_pred E---EeecCCCCccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCCCCCCCCce--ECCCCCEEEEEEcCCCce
Confidence 2 2221 2233455665 5677776542 2456899888877665432222333 4477877763 321
Q ss_pred eEEEEcCCCcc
Q 003405 217 IGVFVDQNGKL 227 (823)
Q Consensus 217 ~gvfv~~~G~~ 227 (823)
....++.+|+.
T Consensus 399 ~l~~~~~~G~~ 409 (429)
T PRK03629 399 VLNLVSTDGRF 409 (429)
T ss_pred EEEEEECCCCC
Confidence 12345777764
No 284
>KOG1126 consensus DNA-binding cell division cycle control protein [Cell cycle control, cell division, chromosome partitioning]
Probab=64.91 E-value=25 Score=40.97 Aligned_cols=63 Identities=25% Similarity=0.116 Sum_probs=38.6
Q ss_pred HHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCcccc
Q 003405 552 LLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQT 624 (823)
Q Consensus 552 L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~ 624 (823)
-+.++...++|++||..|.++..-.-++.+ -+....+.-+.++ ..++-+.+..|.++.||.-+
T Consensus 563 ~~~il~~~~~~~eal~~LEeLk~~vP~es~--------v~~llgki~k~~~--~~~~Al~~f~~A~~ldpkg~ 625 (638)
T KOG1126|consen 563 RASILFSLGRYVEALQELEELKELVPQESS--------VFALLGKIYKRLG--NTDLALLHFSWALDLDPKGA 625 (638)
T ss_pred HHHHHHhhcchHHHHHHHHHHHHhCcchHH--------HHHHHHHHHHHHc--cchHHHHhhHHHhcCCCccc
Confidence 356778889999999999988643321100 0111222223344 34566677779999999854
No 285
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=64.54 E-value=2.7e+02 Score=32.08 Aligned_cols=209 Identities=14% Similarity=0.137 Sum_probs=101.0
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-cEEEEeCCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-SIAFHRLPNL 105 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~l~~~~L~~l 105 (823)
.-|+-..+||.|.+|.-.+. +..+...+ ..+|....-.|..+-++-+.+ .+.+-.|.--
T Consensus 117 tgLlt~GEDG~iKiWSrsGM-------------------LRStl~Q~-~~~v~c~~W~p~S~~vl~c~g~h~~IKpL~~n 176 (737)
T KOG1524|consen 117 AGLLTAGEDGVIKIWSRSGM-------------------LRSTVVQN-EESIRCARWAPNSNSIVFCQGGHISIKPLAAN 176 (737)
T ss_pred ceeeeecCCceEEEEeccch-------------------HHHHHhhc-CceeEEEEECCCCCceEEecCCeEEEeecccc
Confidence 35677778888888863321 11111112 357888888888776665555 4666655211
Q ss_pred cccccccCCCCcEEEEeeCCC-ceEEEEEcCeEEEEEEcC--CCceeEeeeecCCCCceEEEecCCeEEEEEcCceEEEE
Q 003405 106 ETIAVLTKAKGANVYSWDDRR-GFLCFARQKRVCIFRHDG--GRGFVEVKDFGVPDTVKSMSWCGENICIAIRKGYMILN 182 (823)
Q Consensus 106 ~~~~~i~~~kg~~~fa~~~~~-~~l~V~~kkki~l~~~~~--~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~~~y~lid 182 (823)
..+-+-..-.|+ ..+++-+. .-|.+........=-|++ ...|.. -.-..+|++++|..+.++..- +|....
T Consensus 177 ~k~i~WkAHDGi-iL~~~W~~~s~lI~sgGED~kfKvWD~~G~~Lf~S---~~~ey~ITSva~npd~~~~v~--S~nt~R 250 (737)
T KOG1524|consen 177 SKIIRWRAHDGL-VLSLSWSTQSNIIASGGEDFRFKIWDAQGANLFTS---AAEEYAITSVAFNPEKDYLLW--SYNTAR 250 (737)
T ss_pred cceeEEeccCcE-EEEeecCccccceeecCCceeEEeecccCcccccC---Chhccceeeeeeccccceeee--eeeeee
Confidence 111110111232 23333322 123333333332223442 222321 111358999999966533322 244555
Q ss_pred cCCCCeeeccCCC--CCCCCEEEEccCCeEEEEe--CC--------------eEEEE-cCCCccccCCceeecCCCcEEE
Q 003405 183 ATNGALSEVFPSG--RIGPPLVVSLLSGELLLGK--EN--------------IGVFV-DQNGKLLQADRICWSEAPIAVI 243 (823)
Q Consensus 183 l~~~~~~~L~~~~--~~~~p~i~~~~~~EfLL~~--~~--------------~gvfv-~~~G~~~~~~~i~w~~~P~~v~ 243 (823)
++...+-++|..+ ..+.-+.+..+.+.+++++ +. ..+-+ |.... + ...+.++.....+.
T Consensus 251 ~~~p~~GSifnlsWS~DGTQ~a~gt~~G~v~~A~~ieq~l~~~n~~~t~~~r~~I~vrdV~~~-v-~d~LE~p~rv~k~s 328 (737)
T KOG1524|consen 251 FSSPRVGSIFNLSWSADGTQATCGTSTGQLIVAYAIEQQLVSGNLKATSKSRKSITVRDVATG-V-QDILEFPQRVVKFS 328 (737)
T ss_pred ecCCCccceEEEEEcCCCceeeccccCceEEEeeeehhhhhhccceeEeeccceEEeehhhhh-H-HHHhhCccceeeee
Confidence 5555555555431 1111122222333333322 10 11111 11000 0 14566777777888
Q ss_pred EeCCEEEEEeCCeEEEEEcc
Q 003405 244 IQKPYAIALLPRRVEVRSLR 263 (823)
Q Consensus 244 ~~~PYll~~~~~~ieV~~l~ 263 (823)
..+.|+++.....+.|++-+
T Consensus 329 L~Y~hLvvaTs~qvyiys~k 348 (737)
T KOG1524|consen 329 LGYGHLVVATSLQVYIYSEK 348 (737)
T ss_pred eceeEEEEEeccEEEEEecC
Confidence 88999999999999998865
No 286
>KOG3616 consensus Selective LIM binding factor [Transcription]
Probab=63.80 E-value=1.2e+02 Score=35.99 Aligned_cols=42 Identities=24% Similarity=0.463 Sum_probs=37.8
Q ss_pred CcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHh
Q 003405 532 NYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHELV 573 (823)
Q Consensus 532 n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~ 573 (823)
|.-|++.+++...+.+++...+.+|.+.|++++|.++-.+..
T Consensus 777 n~~dfe~ae~lf~e~~~~~dai~my~k~~kw~da~kla~e~~ 818 (1636)
T KOG3616|consen 777 NKGDFEIAEELFTEADLFKDAIDMYGKAGKWEDAFKLAEECH 818 (1636)
T ss_pred cchhHHHHHHHHHhcchhHHHHHHHhccccHHHHHHHHHHhc
Confidence 556789999999999999999999999999999999887764
No 287
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=63.76 E-value=62 Score=38.89 Aligned_cols=71 Identities=23% Similarity=0.274 Sum_probs=46.2
Q ss_pred CCCeeEEEEecccCc-eeeEeCc-EEEEeCCCCcccccccCCCCc------------EEEEeeCCCceEEE-EEcCeEEE
Q 003405 75 KKPILSMEVLASRQL-LLSLSES-IAFHRLPNLETIAVLTKAKGA------------NVYSWDDRRGFLCF-ARQKRVCI 139 (823)
Q Consensus 75 k~~I~qI~~~~~~~~-Ll~l~d~-l~~~~L~~l~~~~~i~~~kg~------------~~fa~~~~~~~l~V-~~kkki~l 139 (823)
..+|..|.+.|+.+. .+++.|+ +.+..+++++...+|...+.+ +.+++++..+.++. +....|++
T Consensus 292 gs~I~~i~vS~ds~~~sl~~~DNqI~li~~~dl~~k~tIsgi~~~~~~~k~~~~~l~t~~~idpr~~~~vln~~~g~vQ~ 371 (792)
T KOG1963|consen 292 GSPILHIVVSPDSDLYSLVLEDNQIHLIKASDLEIKSTISGIKPPTPSTKTRPQSLTTGVSIDPRTNSLVLNGHPGHVQF 371 (792)
T ss_pred CCeeEEEEEcCCCCeEEEEecCceEEEEeccchhhhhhccCccCCCccccccccccceeEEEcCCCCceeecCCCceEEE
Confidence 469999999998876 5556674 888888776644444433333 34677875544333 45567888
Q ss_pred EEEcCC
Q 003405 140 FRHDGG 145 (823)
Q Consensus 140 ~~~~~~ 145 (823)
|..-.+
T Consensus 372 ydl~td 377 (792)
T KOG1963|consen 372 YDLYTD 377 (792)
T ss_pred Eecccc
Confidence 876544
No 288
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=63.30 E-value=75 Score=38.28 Aligned_cols=74 Identities=20% Similarity=0.188 Sum_probs=48.4
Q ss_pred HHHHH-HhcCCHHHHHHHhhhCCCcchHh---------h----------hhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 306 QIVQL-TASGDFEEALALCKLLPPEDASL---------R----------AAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 306 qI~~L-l~~~~~e~Al~L~~~~~~~~~~~---------~----------~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
.+.-| ++-|+.|+|+.|..++..-|-.. . +-.++.-+-+||-+|=.+++.+.|+++|.++
T Consensus 805 kvAvLAieLgMlEeA~~lYr~ckR~DLlNKlyQs~g~w~eA~eiAE~~DRiHLr~Tyy~yA~~Lear~Di~~AleyyEK~ 884 (1416)
T KOG3617|consen 805 KVAVLAIELGMLEEALILYRQCKRYDLLNKLYQSQGMWSEAFEIAETKDRIHLRNTYYNYAKYLEARRDIEAALEYYEKA 884 (1416)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHHHHhhccceehhhhHHHHHHHHHhhccHHHHHHHHHhc
Confidence 34444 56677777777776653221100 0 0012334556788888999999999999999
Q ss_pred CCCHHHHHHhCCCC
Q 003405 366 QVDITYALSLYPSI 379 (823)
Q Consensus 366 ~~dP~~vi~Lfp~l 379 (823)
++.--+|-++..+.
T Consensus 885 ~~hafev~rmL~e~ 898 (1416)
T KOG3617|consen 885 GVHAFEVFRMLKEY 898 (1416)
T ss_pred CChHHHHHHHHHhC
Confidence 98777777776555
No 289
>PF07569 Hira: TUP1-like enhancer of split; InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=63.24 E-value=27 Score=35.77 Aligned_cols=30 Identities=17% Similarity=0.212 Sum_probs=27.4
Q ss_pred CCcEEEEEEeCCEEEEEeCCCcEEEEcCCC
Q 003405 16 SPKIDAVASYGLKILLGCSDGSLKIYSPGS 45 (823)
Q Consensus 16 ~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~ 45 (823)
+..+..+.+.+++|.+=|++|.+++|++..
T Consensus 12 gs~~~~l~~~~~~Ll~iT~~G~l~vWnl~~ 41 (219)
T PF07569_consen 12 GSPVSFLECNGSYLLAITSSGLLYVWNLKK 41 (219)
T ss_pred CCceEEEEeCCCEEEEEeCCCeEEEEECCC
Confidence 568889999999999999999999999775
No 290
>PRK01742 tolB translocation protein TolB; Provisional
Probab=62.72 E-value=2.7e+02 Score=31.55 Aligned_cols=128 Identities=14% Similarity=0.081 Sum_probs=68.6
Q ss_pred eeEEEEecccCceeeEe--Cc---EEEEeCCCCcccccccCCC-CcEEEEeeCCCceEEEEEc--CeEEEEEEcCCC-ce
Q 003405 78 ILSMEVLASRQLLLSLS--ES---IAFHRLPNLETIAVLTKAK-GANVYSWDDRRGFLCFARQ--KRVCIFRHDGGR-GF 148 (823)
Q Consensus 78 I~qI~~~~~~~~Ll~l~--d~---l~~~~L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~~k--kki~l~~~~~~~-~f 148 (823)
...+...|+.+.|++.+ ++ |.++++.+-+ ...+.... .....+++++...|+.+.. ....||.+..+. ..
T Consensus 250 ~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~ 328 (429)
T PRK01742 250 NGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGT-PSQLTSGAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGA 328 (429)
T ss_pred cCceeECCCCCEEEEEEecCCcEEEEEEECCCCC-eEeeccCCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCe
Confidence 33566777777666654 34 4455554322 22222222 2345667777666665543 457788875321 11
Q ss_pred eEeeeecCCCCceEEEec--CCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEE
Q 003405 149 VEVKDFGVPDTVKSMSWC--GENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLG 213 (823)
Q Consensus 149 ~~~kei~~~~~~~~l~~~--~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~ 213 (823)
..+ .... .+..|. |+.|+++.......+|+.+|+...+........| ...+++++++.
T Consensus 329 ~~l---~~~~--~~~~~SpDG~~ia~~~~~~i~~~Dl~~g~~~~lt~~~~~~~~--~~sPdG~~i~~ 388 (429)
T PRK01742 329 SLV---GGRG--YSAQISADGKTLVMINGDNVVKQDLTSGSTEVLSSTFLDESP--SISPNGIMIIY 388 (429)
T ss_pred EEe---cCCC--CCccCCCCCCEEEEEcCCCEEEEECCCCCeEEecCCCCCCCc--eECCCCCEEEE
Confidence 111 1111 234555 5677777777777799999877665433221222 24567777773
No 291
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=62.18 E-value=2.8e+02 Score=31.47 Aligned_cols=112 Identities=12% Similarity=0.157 Sum_probs=65.8
Q ss_pred eeEEEEecccCceeeEeCc--EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeee
Q 003405 78 ILSMEVLASRQLLLSLSES--IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDF 154 (823)
Q Consensus 78 I~qI~~~~~~~~Ll~l~d~--l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei 154 (823)
-.++.+.++ ..++-.-|| +-+|+..+-+.....+..-++-++.++++...++|+ -+-.|.++.++.++ .+.+ +-
T Consensus 364 Y~r~~~~~e-~~vigt~dgD~l~iyd~~~~e~kr~e~~lg~I~av~vs~dGK~~vvaNdr~el~vididngn-v~~i-dk 440 (668)
T COG4946 364 YRRIQVDPE-GDVIGTNDGDKLGIYDKDGGEVKRIEKDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGN-VRLI-DK 440 (668)
T ss_pred EEEEccCCc-ceEEeccCCceEEEEecCCceEEEeeCCccceEEEEEcCCCcEEEEEcCceEEEEEEecCCC-eeEe-cc
Confidence 345555555 222222234 899998765533223344556667777765556666 33356677777553 2211 11
Q ss_pred cCCCCceEEEecCC--eEEEEEcCceE-----EEEcCCCCeeecc
Q 003405 155 GVPDTVKSMSWCGE--NICIAIRKGYM-----ILNATNGALSEVF 192 (823)
Q Consensus 155 ~~~~~~~~l~~~~~--~i~v~~~~~y~-----lidl~~~~~~~L~ 192 (823)
+--+-|+.+.|..+ .|.+|...+|+ ++|+.++++-.+-
T Consensus 441 S~~~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~~Kiy~vT 485 (668)
T COG4946 441 SEYGLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDGGKIYDVT 485 (668)
T ss_pred cccceeEEEEEcCCceeEEEecCcceeeeeEEEEecCCCeEEEec
Confidence 22357888999854 68888888775 5778877765553
No 292
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=62.01 E-value=3e+02 Score=31.80 Aligned_cols=124 Identities=18% Similarity=0.206 Sum_probs=72.2
Q ss_pred EEEEeCCCCcccccccCCCCcEEEEeeCCCceEEEE---EcCeEEEEEEcCCCceeEeeeecCCCCceEEEec---CCeE
Q 003405 97 IAFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFA---RQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC---GENI 170 (823)
Q Consensus 97 l~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~---~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~---~~~i 170 (823)
++++.....+-.-.+.+.-.|++++++.+...+||. .--++.||-.+.+-. +.+|+.|+.-.+. |+.|
T Consensus 253 Lyll~t~g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~~~v------~df~egpRN~~~fnp~g~ii 326 (566)
T KOG2315|consen 253 LYLLATQGESVSVPLLKEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLRGKPV------FDFPEGPRNTAFFNPHGNII 326 (566)
T ss_pred EEEEEecCceEEEecCCCCCceEEEECCCCCEEEEEEecccceEEEEcCCCCEe------EeCCCCCccceEECCCCCEE
Confidence 444444422222234455579999999887665554 668899998884321 3467777744443 6777
Q ss_pred EEE-Ec---CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeC--------CeEEEEcCCCccc
Q 003405 171 CIA-IR---KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKE--------NIGVFVDQNGKLL 228 (823)
Q Consensus 171 ~v~-~~---~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~--------~~gvfv~~~G~~~ 228 (823)
|+| +. ....+.|+.+.+...-+. ....-...+.+++|+++... |-.=+.+.+|++.
T Consensus 327 ~lAGFGNL~G~mEvwDv~n~K~i~~~~--a~~tt~~eW~PdGe~flTATTaPRlrvdNg~KiwhytG~~l 394 (566)
T KOG2315|consen 327 LLAGFGNLPGDMEVWDVPNRKLIAKFK--AANTTVFEWSPDGEYFLTATTAPRLRVDNGIKIWHYTGSLL 394 (566)
T ss_pred EEeecCCCCCceEEEeccchhhccccc--cCCceEEEEcCCCcEEEEEeccccEEecCCeEEEEecCcee
Confidence 766 33 467788877633221111 11223556789999988432 2223456677654
No 293
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=61.96 E-value=6.9 Score=26.46 Aligned_cols=25 Identities=28% Similarity=0.499 Sum_probs=20.5
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 341 IHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 341 i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
+....|..++..++|++|++.|.++
T Consensus 3 ~~~~~g~~~~~~~~~~~A~~~~~~a 27 (34)
T PF00515_consen 3 AYYNLGNAYFQLGDYEEALEYYQRA 27 (34)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHhCCchHHHHHHHHH
Confidence 4556799999999999999999884
No 294
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=61.96 E-value=11 Score=24.45 Aligned_cols=26 Identities=31% Similarity=0.521 Sum_probs=23.2
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 549 YTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
|..++..|.+.|+.++|++++.+..+
T Consensus 3 y~~li~~~~~~~~~~~a~~~~~~M~~ 28 (31)
T PF01535_consen 3 YNSLISGYCKMGQFEEALEVFDEMRE 28 (31)
T ss_pred HHHHHHHHHccchHHHHHHHHHHHhH
Confidence 66789999999999999999998764
No 295
>KOG0547 consensus Translocase of outer mitochondrial membrane complex, subunit TOM70/TOM72 [Intracellular trafficking, secretion, and vesicular transport]
Probab=61.95 E-value=91 Score=35.44 Aligned_cols=61 Identities=20% Similarity=0.179 Sum_probs=38.2
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
-+..+-.++|++|++=++....-++ .--.-+.+.|..+|++++|+++|..|.+ ++..||..
T Consensus 401 gQm~flL~q~e~A~aDF~Kai~L~p-----e~~~~~iQl~~a~Yr~~k~~~~m~~Fee-------~kkkFP~~ 461 (606)
T KOG0547|consen 401 GQMRFLLQQYEEAIADFQKAISLDP-----ENAYAYIQLCCALYRQHKIAESMKTFEE-------AKKKFPNC 461 (606)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCh-----hhhHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHhCCCC
Confidence 3344444566666666654321111 1112356668888899999999999986 56888887
No 296
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=61.08 E-value=9.3 Score=36.22 Aligned_cols=58 Identities=17% Similarity=0.101 Sum_probs=42.7
Q ss_pred hhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 303 LGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 303 ~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
+......+.+.|+|++|+..++.....++ .....+...|..+...|+|++|...|.++
T Consensus 27 ~~~~g~~~~~~g~~~~A~~~~~~al~~~P-----~~~~a~~~lg~~~~~~g~~~~A~~~y~~A 84 (144)
T PRK15359 27 VYASGYASWQEGDYSRAVIDFSWLVMAQP-----WSWRAHIALAGTWMMLKEYTTAINFYGHA 84 (144)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCC-----CcHHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 33446677899999999999876421111 22345677789999999999999999874
No 297
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=60.67 E-value=7.1 Score=39.74 Aligned_cols=31 Identities=29% Similarity=0.474 Sum_probs=27.1
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHhcCCCH
Q 003405 339 GSIHIRFAHYLFDTGSYEEAMEHFLASQVDI 369 (823)
Q Consensus 339 ~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP 369 (823)
..|..-||+.|-.+|+|++|+.+|.++-.||
T Consensus 103 GdVLNNYG~FLC~qg~~~eA~q~F~~Al~~P 133 (250)
T COG3063 103 GDVLNNYGAFLCAQGRPEEAMQQFERALADP 133 (250)
T ss_pred cchhhhhhHHHHhCCChHHHHHHHHHHHhCC
Confidence 4678889999999999999999999976554
No 298
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=60.64 E-value=12 Score=25.90 Aligned_cols=24 Identities=13% Similarity=0.297 Sum_probs=20.4
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHH
Q 003405 549 YTALLELYKSNARHREALKLLHEL 572 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l 572 (823)
+..|+.+|...|+|++|++++.+.
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~a 25 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQA 25 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHH
Confidence 457899999999999999999873
No 299
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=60.11 E-value=15 Score=29.81 Aligned_cols=56 Identities=23% Similarity=0.299 Sum_probs=41.1
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
.+...+...|++++|+..++....... ....++...|..++..+++++|++.|.++
T Consensus 5 ~~a~~~~~~~~~~~A~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 60 (100)
T cd00189 5 NLGNLYYKLGDYDEALEYYEKALELDP-----DNADAYYNLAAAYYKLGKYEEALEDYEKA 60 (100)
T ss_pred HHHHHHHHHhcHHHHHHHHHHHHhcCC-----ccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455667889999999999876421111 11256677889999999999999998863
No 300
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=59.59 E-value=2.3e+02 Score=29.76 Aligned_cols=154 Identities=16% Similarity=0.246 Sum_probs=101.7
Q ss_pred CCcEEEEEEeCC---EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 16 SPKIDAVASYGL---KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 16 ~~~I~ci~~~~~---~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
+.++=.++.+.. -|+-|..|-.|.+|+..+. .++++.......+++.|..+.--|..++|.+
T Consensus 14 ~~r~W~~awhp~~g~ilAscg~Dk~vriw~~~~~---------------~s~~ck~vld~~hkrsVRsvAwsp~g~~La~ 78 (312)
T KOG0645|consen 14 KDRVWSVAWHPGKGVILASCGTDKAVRIWSTSSG---------------DSWTCKTVLDDGHKRSVRSVAWSPHGRYLAS 78 (312)
T ss_pred CCcEEEEEeccCCceEEEeecCCceEEEEecCCC---------------CcEEEEEeccccchheeeeeeecCCCcEEEE
Confidence 346666776654 5888999999999986641 2455554445567899999999888776665
Q ss_pred Ee-Cc-EEEEeCC--CCcccccccCC-CCcEEEEeeCCCceEEEEE-cCeEEEEEEcCCCceeEeeeec-CCCCceEEEe
Q 003405 93 LS-ES-IAFHRLP--NLETIAVLTKA-KGANVYSWDDRRGFLCFAR-QKRVCIFRHDGGRGFVEVKDFG-VPDTVKSMSW 165 (823)
Q Consensus 93 l~-d~-l~~~~L~--~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~~~~f~~~kei~-~~~~~~~l~~ 165 (823)
-+ |. +.+|.=. +|+-+..++.- ..|.+++.+.+...|+... .|.+-|++..++.+|.-.--+. -...++.+.|
T Consensus 79 aSFD~t~~Iw~k~~~efecv~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~deddEfec~aVL~~HtqDVK~V~W 158 (312)
T KOG0645|consen 79 ASFDATVVIWKKEDGEFECVATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDDEFECIAVLQEHTQDVKHVIW 158 (312)
T ss_pred eeccceEEEeecCCCceeEEeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCCcEEEEeeeccccccccEEEE
Confidence 54 54 7777543 46655555433 3578888998888888874 4678899998766674322111 1246888999
Q ss_pred cCCe-EEEEE--cCceEEEEcC
Q 003405 166 CGEN-ICIAI--RKGYMILNAT 184 (823)
Q Consensus 166 ~~~~-i~v~~--~~~y~lidl~ 184 (823)
.+.. |.+.+ .+...+++-.
T Consensus 159 HPt~dlL~S~SYDnTIk~~~~~ 180 (312)
T KOG0645|consen 159 HPTEDLLFSCSYDNTIKVYRDE 180 (312)
T ss_pred cCCcceeEEeccCCeEEEEeec
Confidence 8743 44443 3566666544
No 301
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=59.22 E-value=23 Score=36.28 Aligned_cols=68 Identities=21% Similarity=0.274 Sum_probs=46.8
Q ss_pred hhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 303 LGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 303 ~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
.-.+...+++.|+|++|+..++......+ ........+...|..++..++|++|+..|.+ ++..+|+-
T Consensus 36 ~~~~g~~~~~~~~~~~A~~~~~~~~~~~p--~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~-------~l~~~p~~ 103 (235)
T TIGR03302 36 LYEEAKEALDSGDYTEAIKYFEALESRYP--FSPYAEQAQLDLAYAYYKSGDYAEAIAAADR-------FIRLHPNH 103 (235)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCC--CchhHHHHHHHHHHHHHhcCCHHHHHHHHHH-------HHHHCcCC
Confidence 34445667889999999999976521111 0012234566679999999999999999986 45666654
No 302
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=59.06 E-value=17 Score=24.39 Aligned_cols=21 Identities=19% Similarity=0.295 Sum_probs=17.9
Q ss_pred EeCCEEEEEeCCCcEEEEcCC
Q 003405 24 SYGLKILLGCSDGSLKIYSPG 44 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~ 44 (823)
..++.+|+|+.+|.|+.++..
T Consensus 4 ~~~~~v~~~~~~g~l~a~d~~ 24 (33)
T smart00564 4 LSDGTVYVGSTDGTLYALDAK 24 (33)
T ss_pred EECCEEEEEcCCCEEEEEEcc
Confidence 456789999999999999854
No 303
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=57.78 E-value=1e+02 Score=36.30 Aligned_cols=141 Identities=16% Similarity=0.221 Sum_probs=77.7
Q ss_pred CCcEEEEEEe--CC---EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCce
Q 003405 16 SPKIDAVASY--GL---KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLL 90 (823)
Q Consensus 16 ~~~I~ci~~~--~~---~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~L 90 (823)
..+|+|+.-- .+ +++.|.+||.+.+|.+..+ .+..+.++++ ..+.+..+.... .+
T Consensus 54 ~a~VnC~~~l~~s~~~a~~vsG~sD~~v~lW~l~~~----------------~~~~i~~~~g-~~~~~~cv~a~~---~~ 113 (764)
T KOG1063|consen 54 VARVNCVHWLPTSEIVAEMVSGDSDGRVILWKLRDE----------------YLIKIYTIQG-HCKECVCVVARS---SV 113 (764)
T ss_pred ccceEEEEEcccccccceEEEccCCCcEEEEEEeeh----------------heEEEEeecC-cceeEEEEEeee---eE
Confidence 3578888755 22 6999999999999987721 1222344454 344554443322 22
Q ss_pred ee--EeCc-EEEEeCCCCc--ccccc-cCCCCcE--EEEeeCCCce--EEEE-EcCeEEEEEEcCCCceeEeeeecCC-C
Q 003405 91 LS--LSES-IAFHRLPNLE--TIAVL-TKAKGAN--VYSWDDRRGF--LCFA-RQKRVCIFRHDGGRGFVEVKDFGVP-D 158 (823)
Q Consensus 91 l~--l~d~-l~~~~L~~l~--~~~~i-~~~kg~~--~fa~~~~~~~--l~V~-~kkki~l~~~~~~~~f~~~kei~~~-~ 158 (823)
.. .+|+ +.+|+-..-+ ....+ ..+|-+- +++.-.+.+. ++++ .++.+.+|.-..+ .|..+.|+.-. |
T Consensus 114 ~~~~~ad~~v~vw~~~~~e~~~~~~~rf~~k~~ipLcL~~~~~~~~~lla~Ggs~~~v~~~s~~~d-~f~~v~el~GH~D 192 (764)
T KOG1063|consen 114 MTCKAADGTVSVWDKQQDEVFLLAVLRFEIKEAIPLCLAALKNNKTFLLACGGSKFVVDLYSSSAD-SFARVAELEGHTD 192 (764)
T ss_pred EEeeccCceEEEeecCCCceeeehheehhhhhHhhHHHhhhccCCcEEEEecCcceEEEEeccCCc-ceeEEEEeeccch
Confidence 22 3565 8899873222 01111 1112111 1222223333 4555 4455556655534 68888777543 7
Q ss_pred CceEEEec---CCeEEEEEcCc
Q 003405 159 TVKSMSWC---GENICIAIRKG 177 (823)
Q Consensus 159 ~~~~l~~~---~~~i~v~~~~~ 177 (823)
=|++++|. ++.+++++.++
T Consensus 193 WIrsl~f~~~~~~~~~laS~SQ 214 (764)
T KOG1063|consen 193 WIRSLAFARLGGDDLLLASSSQ 214 (764)
T ss_pred hhhhhhhhccCCCcEEEEecCC
Confidence 89999987 44677777654
No 304
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=57.48 E-value=41 Score=35.13 Aligned_cols=26 Identities=19% Similarity=0.359 Sum_probs=22.1
Q ss_pred CcEEEEEEeC---------CEEEEEeCCCcEEEEc
Q 003405 17 PKIDAVASYG---------LKILLGCSDGSLKIYS 42 (823)
Q Consensus 17 ~~I~ci~~~~---------~~L~vGT~~G~l~~y~ 42 (823)
..|+|+++-. ..|+|||++|.|++.+
T Consensus 177 t~ITcm~tikk~~~d~~a~scLViGTE~~~i~iLd 211 (257)
T PF14779_consen 177 TVITCMATIKKSSADEDAVSCLVIGTESGEIYILD 211 (257)
T ss_pred ceeEEeeeecccccCCCCcceEEEEecCCeEEEEC
Confidence 3799998763 4899999999999987
No 305
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=56.84 E-value=1.1e+02 Score=33.54 Aligned_cols=111 Identities=14% Similarity=0.262 Sum_probs=74.1
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCce-ee
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLL-LS 92 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~L-l~ 92 (823)
+..+.|++... +.|.-|++|-.+..|+-..... .....++.++ +.=|+.+.--|..... +.
T Consensus 300 ~ksl~~i~~~~~~~Ll~~gssdr~irl~DPR~~~g---------------s~v~~s~~gH-~nwVssvkwsp~~~~~~~S 363 (423)
T KOG0313|consen 300 NKSLNCISYSPLSKLLASGSSDRHIRLWDPRTGDG---------------SVVSQSLIGH-KNWVSSVKWSPTNEFQLVS 363 (423)
T ss_pred CcceeEeecccccceeeecCCCCceeecCCCCCCC---------------ceeEEeeecc-hhhhhheecCCCCceEEEE
Confidence 34688998886 5788999999999998554321 1222344433 5578888888877554 44
Q ss_pred Ee-Cc-EEEEeCCCCc-ccccccCCCCcEEEEeeCCCceEEE-E-EcCeEEEEEEc
Q 003405 93 LS-ES-IAFHRLPNLE-TIAVLTKAKGANVYSWDDRRGFLCF-A-RQKRVCIFRHD 143 (823)
Q Consensus 93 l~-d~-l~~~~L~~l~-~~~~i~~~kg~~~fa~~~~~~~l~V-~-~kkki~l~~~~ 143 (823)
.+ |+ +++|+..+-+ |++.|.. .+-..|+++-..+..+| | ..++|.||+..
T Consensus 364 ~S~D~t~klWDvRS~k~plydI~~-h~DKvl~vdW~~~~~IvSGGaD~~l~i~~~~ 418 (423)
T KOG0313|consen 364 GSYDNTVKLWDVRSTKAPLYDIAG-HNDKVLSVDWNEGGLIVSGGADNKLRIFKGS 418 (423)
T ss_pred EecCCeEEEEEeccCCCcceeecc-CCceEEEEeccCCceEEeccCcceEEEeccc
Confidence 44 55 9999997655 6665544 35567888766654333 3 77889888865
No 306
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=56.83 E-value=45 Score=22.95 Aligned_cols=34 Identities=26% Similarity=0.222 Sum_probs=26.0
Q ss_pred eeeecCCCCCCeeEEEEecccCceeeEeC-c-EEEEe
Q 003405 67 ERTISGFSKKPILSMEVLASRQLLLSLSE-S-IAFHR 101 (823)
Q Consensus 67 ~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~ 101 (823)
.+++++. ..+|..|..-|..+.+++.+. + |++|+
T Consensus 4 ~~~~~~h-~~~i~~i~~~~~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 4 VRTFRGH-SSSINSIAWSPDGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp EEEEESS-SSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred EEEEcCC-CCcEEEEEEecccccceeeCCCCEEEEEC
Confidence 3444443 679999999999888888876 4 88885
No 307
>PF10395 Utp8: Utp8 family; InterPro: IPR018843 Utp8 is an essential component of the nuclear tRNA export machinery in Saccharomyces cerevisiae (Baker's yeast). It is a tRNA binding protein that acts at a step between tRNA maturation /aminoacylation, and translocation of the tRNA across the nuclear pore complex [].
Probab=56.79 E-value=4.2e+02 Score=31.82 Aligned_cols=213 Identities=16% Similarity=0.199 Sum_probs=116.1
Q ss_pred CceeeEeCc-EEEEeC---CCCcccccccCCCCcEEEEeeCC---CceEEEE--EcCe--EEEEEEcC-----C---Cce
Q 003405 88 QLLLSLSES-IAFHRL---PNLETIAVLTKAKGANVYSWDDR---RGFLCFA--RQKR--VCIFRHDG-----G---RGF 148 (823)
Q Consensus 88 ~~Ll~l~d~-l~~~~L---~~l~~~~~i~~~kg~~~fa~~~~---~~~l~V~--~kkk--i~l~~~~~-----~---~~f 148 (823)
.+-+-++.. |.-|-+ |.+-+-..++.+..|+++.+... ....|+| .+|| +.+.+.+. + ..-
T Consensus 41 ~IdiGIS~S~ISsYIi~PTPKLiwsypi~pt~iV~~~dV~~~~~~~~~~~~glt~rKk~~ll~i~~~~~~~~~~~~~~e~ 120 (670)
T PF10395_consen 41 QIDIGISGSAISSYIIKPTPKLIWSYPISPTTIVECCDVLEKSDGKKLYCVGLTERKKFKLLLIERKVGSTEDGTVNSET 120 (670)
T ss_pred eEEEEeccchhhheecCCCcceeEeeccCcCceEEEEEeEecCCCcEEEEEEEeeCCeeEEEEEEccCccccccccCccc
Confidence 445556663 544443 33333345666677777766432 3355666 3443 44555441 0 112
Q ss_pred eEeeeecCCCCceEEEec--CCeEEEEEcCc-eEEEEcCCCCeeeccC--CCCCC----CCEEEEcc---CCeEEE--Ee
Q 003405 149 VEVKDFGVPDTVKSMSWC--GENICIAIRKG-YMILNATNGALSEVFP--SGRIG----PPLVVSLL---SGELLL--GK 214 (823)
Q Consensus 149 ~~~kei~~~~~~~~l~~~--~~~i~v~~~~~-y~lidl~~~~~~~L~~--~~~~~----~p~i~~~~---~~EfLL--~~ 214 (823)
...-++.+.+.+.++.+. +..|++...+| ..++|.+.+.....-. ..... .-.|.... .++|++ |.
T Consensus 121 ~~~~~~kl~~kvv~Ik~~~~~~~I~vvl~nG~i~~~d~~~~~l~~~~~l~~~~~~~v~ys~fv~~~~~~~~~~~ll~v~~ 200 (670)
T PF10395_consen 121 TNEFELKLDDKVVGIKFSSDGKIIYVVLENGSIQIYDFSENSLEKVPQLKLKSSINVSYSKFVNDFELENGKDLLLTVSQ 200 (670)
T ss_pred cceEEEEcccceEEEEEecCCCEEEEEEcCCcEEEEeccccccccccccccccccceehhhhhcccccccCCceEEEEEE
Confidence 233457788999999998 45688888854 6788884332221111 11110 01111111 234543 55
Q ss_pred -CCeEEE---EcC--CCcccc-CC--ceee-cCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeCCc-------
Q 003405 215 -ENIGVF---VDQ--NGKLLQ-AD--RICW-SEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQNV------- 277 (823)
Q Consensus 215 -~~~gvf---v~~--~G~~~~-~~--~i~w-~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~~------- 277 (823)
++..+. +.. ++.+.- -. .+.- ...-..++|...-+..+..+.|+++++.+ ..+.++|.++..
T Consensus 201 ~~~~k~~ykL~~l~~~~~~~~El~s~~~e~~~~~~s~f~Y~~G~LY~l~~~~i~~ysip~-f~~~~tI~l~~ii~~~~~~ 279 (670)
T PF10395_consen 201 LSNSKLSYKLISLSNESSSIFELSSTILENFGLEDSKFCYQFGKLYQLSKKTISSYSIPN-FQIQKTISLPSIIDKESDD 279 (670)
T ss_pred cCCCcEEEEEEEeccCCcceEEeehheeccCCcccceEEEeCCEEEEEeCCEEEEEEcCC-ceEEEEEEechhhcccccc
Confidence 332111 112 222110 00 1111 11224577888888888999999999974 788899988832
Q ss_pred --ccccccCCeEEEeccceEEEeecc
Q 003405 278 --RHLIPSSNAVVVALENSIFGLFPV 301 (823)
Q Consensus 278 --~~l~~~~~~v~v~s~~~I~~l~~~ 301 (823)
.+...+.++++++.++.||.+.-+
T Consensus 280 ~vSl~~~s~nRvLLs~~nkIyLld~~ 305 (670)
T PF10395_consen 280 LVSLKPPSPNRVLLSVNNKIYLLDLK 305 (670)
T ss_pred ceEeecCCCCeEEEEcCCEEEEEeeh
Confidence 223456788999999999997654
No 308
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=56.59 E-value=1.1e+02 Score=34.14 Aligned_cols=110 Identities=15% Similarity=0.247 Sum_probs=70.1
Q ss_pred CCcEEEEEEe--CC-EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceee
Q 003405 16 SPKIDAVASY--GL-KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLS 92 (823)
Q Consensus 16 ~~~I~ci~~~--~~-~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~ 92 (823)
...|.|++.. ++ -|.-|+.||+|..|++..-. .+ ..++.+ ++..|.|+.-.|...-+++
T Consensus 272 ~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~-------------~~----lh~~e~-H~dev~~V~WSPh~etvLA 333 (422)
T KOG0264|consen 272 SAEVNCVAFNPFNEFILATGSADKTVALWDLRNLN-------------KP----LHTFEG-HEDEVFQVEWSPHNETVLA 333 (422)
T ss_pred CCceeEEEeCCCCCceEEeccCCCcEEEeechhcc-------------cC----ceeccC-CCcceEEEEeCCCCCceeE
Confidence 4578888754 34 56788889999999977532 11 123333 3779999999998877666
Q ss_pred EeC--c-EEEEeCCCCccccc---------------ccCCCCcEEEEeeCCCce-EEE-EEcCeEEEEEEc
Q 003405 93 LSE--S-IAFHRLPNLETIAV---------------LTKAKGANVYSWDDRRGF-LCF-ARQKRVCIFRHD 143 (823)
Q Consensus 93 l~d--~-l~~~~L~~l~~~~~---------------i~~~kg~~~fa~~~~~~~-l~V-~~kkki~l~~~~ 143 (823)
-+. + +.+|++........ -.-.-.|+.|..+++... ||- +-.+-+.|++..
T Consensus 334 SSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~~kV~DfsWnp~ePW~I~SvaeDN~LqIW~~s 404 (422)
T KOG0264|consen 334 SSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHTAKVSDFSWNPNEPWTIASVAEDNILQIWQMA 404 (422)
T ss_pred ecccCCcEEEEeccccccccChhhhccCCcceeEEecCcccccccccCCCCCCeEEEEecCCceEEEeecc
Confidence 654 4 99999965321110 011123566777776653 443 355667788776
No 309
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=56.29 E-value=18 Score=24.26 Aligned_cols=25 Identities=28% Similarity=0.378 Sum_probs=21.6
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 341 IHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 341 i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
++...|..+...|++++|+..|.++
T Consensus 3 ~~~~lg~~y~~~~~~~~A~~~~~~a 27 (34)
T PF13181_consen 3 AYYNLGKIYEQLGDYEEALEYFEKA 27 (34)
T ss_dssp HHHHHHHHHHHTTSHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4566789999999999999999874
No 310
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=56.07 E-value=25 Score=25.53 Aligned_cols=32 Identities=25% Similarity=0.207 Sum_probs=25.3
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 341 IHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 341 i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
....+|..+...|++++|...|.+ ++.+.|+.
T Consensus 3 ~~~~la~~~~~~G~~~~A~~~~~~-------~l~~~P~~ 34 (44)
T PF13428_consen 3 AWLALARAYRRLGQPDEAERLLRR-------ALALDPDD 34 (44)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHH-------HHHHCcCC
Confidence 456679999999999999999987 44555554
No 311
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=55.71 E-value=1.4e+02 Score=35.20 Aligned_cols=157 Identities=18% Similarity=0.229 Sum_probs=82.5
Q ss_pred CCcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcc-----ccc---------ccccc------eeeeeecCCC
Q 003405 16 SPKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDY-----QSL---------RKESY------ELERTISGFS 74 (823)
Q Consensus 16 ~~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~-----~~l---------~~~~~------~l~~~~~~~~ 74 (823)
...|+|+.... ..++=|+=|-+..+|.+.+-...- ..+.. ..| ..+.. ++.++|.+ +
T Consensus 101 ~snVC~ls~~~~~~~iSgSWD~TakvW~~~~l~~~l-~gH~asVWAv~~l~e~~~vTgsaDKtIklWk~~~~l~tf~g-H 178 (745)
T KOG0301|consen 101 KSNVCSLSIGEDGTLISGSWDSTAKVWRIGELVYSL-QGHTASVWAVASLPENTYVTGSADKTIKLWKGGTLLKTFSG-H 178 (745)
T ss_pred ccceeeeecCCcCceEecccccceEEecchhhhccc-CCcchheeeeeecCCCcEEeccCcceeeeccCCchhhhhcc-c
Confidence 45788887665 356778888888888765432210 00000 011 01111 12344554 3
Q ss_pred CCCeeEEEEecccCceeeEeC-c-EEEEeCCCCcccccccCCCCcEE--EEeeC---CCceEEEEEcCeEEEEEEcCCCc
Q 003405 75 KKPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLTKAKGANV--YSWDD---RRGFLCFARQKRVCIFRHDGGRG 147 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~~~kg~~~--fa~~~---~~~~l~V~~kkki~l~~~~~~~~ 147 (823)
...|..+.+++.. -++++++ | |+.|++.. +.+. ...|-+. ++++. +.-.++.|-.+++.|+..+
T Consensus 179 tD~VRgL~vl~~~-~flScsNDg~Ir~w~~~g-e~l~---~~~ghtn~vYsis~~~~~~~Ivs~gEDrtlriW~~~---- 249 (745)
T KOG0301|consen 179 TDCVRGLAVLDDS-HFLSCSNDGSIRLWDLDG-EVLL---EMHGHTNFVYSISMALSDGLIVSTGEDRTLRIWKKD---- 249 (745)
T ss_pred hhheeeeEEecCC-CeEeecCCceEEEEeccC-ceee---eeeccceEEEEEEecCCCCeEEEecCCceEEEeecC----
Confidence 5689999999874 4566665 6 89999833 1111 1222222 33331 1123455577888777766
Q ss_pred eeEeeeecCCCCce-EEEec-CCeEEEEEcCc-eEEEEcC
Q 003405 148 FVEVKDFGVPDTVK-SMSWC-GENICIAIRKG-YMILNAT 184 (823)
Q Consensus 148 f~~~kei~~~~~~~-~l~~~-~~~i~v~~~~~-y~lidl~ 184 (823)
.-.+.|.+|..-. +.... ++-|++|.+.+ ..++...
T Consensus 250 -e~~q~I~lPttsiWsa~~L~NgDIvvg~SDG~VrVfT~~ 288 (745)
T KOG0301|consen 250 -ECVQVITLPTTSIWSAKVLLNGDIVVGGSDGRVRVFTVD 288 (745)
T ss_pred -ceEEEEecCccceEEEEEeeCCCEEEeccCceEEEEEec
Confidence 2245677776332 33333 45566776655 4555544
No 312
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=55.45 E-value=2.9e+02 Score=30.36 Aligned_cols=181 Identities=18% Similarity=0.114 Sum_probs=91.2
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCcccccccc
Q 003405 549 YTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQTIELF 628 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~~~if 628 (823)
+..|+.+|...|++++|++++.++........ ...... .-...+.-|+ ...+.+--.++....++.+|....
T Consensus 144 ~~~la~~~~~~g~~~~A~~~~~~~~~~~~~~~-~~~~~~-~~~~la~~~~---~~~~~~~A~~~~~~al~~~p~~~~--- 215 (389)
T PRK11788 144 LQQLLEIYQQEKDWQKAIDVAERLEKLGGDSL-RVEIAH-FYCELAQQAL---ARGDLDAARALLKKALAADPQCVR--- 215 (389)
T ss_pred HHHHHHHHHHhchHHHHHHHHHHHHHhcCCcc-hHHHHH-HHHHHHHHHH---hCCCHHHHHHHHHHHHhHCcCCHH---
Confidence 55788999999999999999998764321100 000000 0000111111 122333334444444444443210
Q ss_pred ccCCCChHHHH--HHHhhcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHH
Q 003405 629 LSGNIPADLVN--SYLKQYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKL 706 (823)
Q Consensus 629 ~~~~l~~~~Vl--~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kL 706 (823)
....+ -+......+.+..+++.++.. ++......++.|+..|... +...+...-+
T Consensus 216 ------~~~~la~~~~~~g~~~~A~~~~~~~~~~-~p~~~~~~~~~l~~~~~~~----------------g~~~~A~~~l 272 (389)
T PRK11788 216 ------ASILLGDLALAQGDYAAAIEALERVEEQ-DPEYLSEVLPKLMECYQAL----------------GDEAEGLEFL 272 (389)
T ss_pred ------HHHHHHHHHHHCCCHHHHHHHHHHHHHH-ChhhHHHHHHHHHHHHHHc----------------CCHHHHHHHH
Confidence 11111 122223356678888887753 2222234556677777643 0111122223
Q ss_pred HHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHhC-CCchhHHHHHHHHhcC
Q 003405 707 LSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKVF-LINQPVFLLIRRMAMD 772 (823)
Q Consensus 707 l~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L~-D~~~a~~~~l~~~y~~ 772 (823)
...++. -++.....-.+-++.+.|+.++|+..+-.-++ +++...+..++..++.
T Consensus 273 ~~~~~~------------~p~~~~~~~la~~~~~~g~~~~A~~~l~~~l~~~P~~~~~~~l~~~~~~ 327 (389)
T PRK11788 273 RRALEE------------YPGADLLLALAQLLEEQEGPEAAQALLREQLRRHPSLRGFHRLLDYHLA 327 (389)
T ss_pred HHHHHh------------CCCchHHHHHHHHHHHhCCHHHHHHHHHHHHHhCcCHHHHHHHHHHhhh
Confidence 332222 22233335667789999999999999976553 3455555556666553
No 313
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=55.21 E-value=26 Score=33.03 Aligned_cols=57 Identities=23% Similarity=0.297 Sum_probs=30.5
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
++...+..+....+-..++.+....+. ..-........|..+|.+|+|++|...|..
T Consensus 17 ~~~~~~~~~~~~~~~~~~~~l~~~~~~--s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~ 73 (145)
T PF09976_consen 17 QALQALQAGDPAKAEAAAEQLAKDYPS--SPYAALAALQLAKAAYEQGDYDEAKAALEK 73 (145)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCC--ChHHHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 444455667766666655444211110 001123344457777777888887777765
No 314
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=54.93 E-value=1.6e+02 Score=33.78 Aligned_cols=127 Identities=12% Similarity=0.213 Sum_probs=77.0
Q ss_pred CcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCC----CCccccc---c---cccceeeeeecCCCCCCeeEEEE
Q 003405 17 PKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSP----PSDYQSL---R---KESYELERTISGFSKKPILSMEV 83 (823)
Q Consensus 17 ~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~----~~d~~~l---~---~~~~~l~~~~~~~~k~~I~qI~~ 83 (823)
.+++|+..- +..+.+.-.+|.++.|+..-......| ..++..+ . +..-..+... .++..+|++...
T Consensus 220 tsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~~k~~~~f~i~t~ksk~~rNPv~~w-~~~~g~in~f~F 298 (636)
T KOG2394|consen 220 SSVTCIKWVPGSDSLFLVAHASGNLYLYDKEIVCGATAPSYQALKDGDQFAILTSKSKKTRNPVARW-HIGEGSINEFAF 298 (636)
T ss_pred cceEEEEEEeCCCceEEEEEecCceEEeeccccccCCCCcccccCCCCeeEEeeeeccccCCcccee-EeccccccceeE
Confidence 578887654 346777889999999986432221111 1111111 0 0000000011 234568999999
Q ss_pred ecccCceeeEeC-c-EEEEeCCCCcccccc-cCCCCcEEEEeeCCCceEEEEEcC-eEEEEEEcC
Q 003405 84 LASRQLLLSLSE-S-IAFHRLPNLETIAVL-TKAKGANVYSWDDRRGFLCFARQK-RVCIFRHDG 144 (823)
Q Consensus 84 ~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i-~~~kg~~~fa~~~~~~~l~V~~kk-ki~l~~~~~ 144 (823)
.+....|.+++. | +++|+..+.+..... .-.-|..++|++++...|+++... -+.+|.+..
T Consensus 299 S~DG~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGEDDLVtVwSf~e 363 (636)
T KOG2394|consen 299 SPDGKYLATVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSPDGKYIVTGGEDDLVTVWSFEE 363 (636)
T ss_pred cCCCceEEEEecCceEEEeeccHHHHHHHHHhhccceEEEEEcCCccEEEecCCcceEEEEEecc
Confidence 999999999986 6 999987665433221 122477889999998778888554 456777663
No 315
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=54.69 E-value=1.3e+02 Score=32.44 Aligned_cols=210 Identities=13% Similarity=0.229 Sum_probs=114.0
Q ss_pred cEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--
Q 003405 18 KIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-- 95 (823)
Q Consensus 18 ~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-- 95 (823)
-|=|+.-.++.++-|..|.+|.+|+.+. +.+.+...++ ...|..+..... +++++
T Consensus 199 gVYClQYDD~kiVSGlrDnTikiWD~n~------------------~~c~~~L~GH-tGSVLCLqyd~r----viisGSS 255 (499)
T KOG0281|consen 199 GVYCLQYDDEKIVSGLRDNTIKIWDKNS------------------LECLKILTGH-TGSVLCLQYDER----VIVSGSS 255 (499)
T ss_pred ceEEEEecchhhhcccccCceEEecccc------------------HHHHHhhhcC-CCcEEeeeccce----EEEecCC
Confidence 3667777778999999999999998542 2233333343 457888877654 34443
Q ss_pred -c-EEEEeCCCCcccccc-cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecCC--CCceEEEecCCe
Q 003405 96 -S-IAFHRLPNLETIAVL-TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGVP--DTVKSMSWCGEN 169 (823)
Q Consensus 96 -~-l~~~~L~~l~~~~~i-~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~~--~~~~~l~~~~~~ 169 (823)
. |.+|+..+-+++.++ ....++.-..+++ |.++-. ..+.+.++......... .+-+-.. -.+..+.|.+..
T Consensus 256 DsTvrvWDv~tge~l~tlihHceaVLhlrf~n--g~mvtcSkDrsiaVWdm~sps~it-~rrVLvGHrAaVNvVdfd~ky 332 (499)
T KOG0281|consen 256 DSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSN--GYMVTCSKDRSIAVWDMASPTDIT-LRRVLVGHRAAVNVVDFDDKY 332 (499)
T ss_pred CceEEEEeccCCchhhHHhhhcceeEEEEEeC--CEEEEecCCceeEEEeccCchHHH-HHHHHhhhhhheeeeccccce
Confidence 2 999999888776542 2333333333332 444333 45667777765321110 1111111 134455555555
Q ss_pred EEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-Ee-CCeE-EEEcCCCcccc--------CCceeecC
Q 003405 170 ICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-GK-ENIG-VFVDQNGKLLQ--------ADRICWSE 237 (823)
Q Consensus 170 i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~-~~~g-vfv~~~G~~~~--------~~~i~w~~ 237 (823)
|+-|+. +...+.+++|+.....+.-.+ +.+.|.-=++.|++ +. |+.. +|--..|...| -.+|.|..
T Consensus 333 IVsASgDRTikvW~~st~efvRtl~gHk--RGIAClQYr~rlvVSGSSDntIRlwdi~~G~cLRvLeGHEeLvRciRFd~ 410 (499)
T KOG0281|consen 333 IVSASGDRTIKVWSTSTCEFVRTLNGHK--RGIACLQYRDRLVVSGSSDNTIRLWDIECGACLRVLEGHEELVRCIRFDN 410 (499)
T ss_pred EEEecCCceEEEEeccceeeehhhhccc--ccceehhccCeEEEecCCCceEEEEeccccHHHHHHhchHHhhhheeecC
Confidence 554444 557778888876555444322 33444444567776 33 3332 33223443221 14566654
Q ss_pred CCcEEEEeCCEEEEEeCCeEEEEEcc
Q 003405 238 APIAVIIQKPYAIALLPRRVEVRSLR 263 (823)
Q Consensus 238 ~P~~v~~~~PYll~~~~~~ieV~~l~ 263 (823)
.. -+-+.+++.|.|.++.
T Consensus 411 kr--------IVSGaYDGkikvWdl~ 428 (499)
T KOG0281|consen 411 KR--------IVSGAYDGKIKVWDLQ 428 (499)
T ss_pred ce--------eeeccccceEEEEecc
Confidence 31 2224556788888874
No 316
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=53.95 E-value=20 Score=43.12 Aligned_cols=107 Identities=12% Similarity=0.184 Sum_probs=69.1
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL 93 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l 93 (823)
..--+|++.-+ ++|.||+-.|.|++|++......+ ++ +-+..+|+.|.--..+.++++.
T Consensus 1101 ~~~fTc~afs~~~~hL~vG~~~Geik~~nv~sG~~e~------------------s~-ncH~SavT~vePs~dgs~~Lts 1161 (1516)
T KOG1832|consen 1101 TALFTCIAFSGGTNHLAVGSHAGEIKIFNVSSGSMEE------------------SV-NCHQSAVTLVEPSVDGSTQLTS 1161 (1516)
T ss_pred ccceeeEEeecCCceEEeeeccceEEEEEccCccccc------------------cc-cccccccccccccCCcceeeee
Confidence 34567776554 799999999999999876543211 11 1236799999888888888888
Q ss_pred eC-c---EEEEeCCC-CcccccccCCCCcEEEEeeCCCceEEEEE-cCeEEEEEEcC
Q 003405 94 SE-S---IAFHRLPN-LETIAVLTKAKGANVYSWDDRRGFLCFAR-QKRVCIFRHDG 144 (823)
Q Consensus 94 ~d-~---l~~~~L~~-l~~~~~i~~~kg~~~fa~~~~~~~l~V~~-kkki~l~~~~~ 144 (823)
+- . ..+|++.+ +.+.+ ...+++++........-+|+. +++..+|...-
T Consensus 1162 ss~S~PlsaLW~~~s~~~~~H---sf~ed~~vkFsn~~q~r~~gt~~d~a~~YDvqT 1215 (1516)
T KOG1832|consen 1162 SSSSSPLSALWDASSTGGPRH---SFDEDKAVKFSNSLQFRALGTEADDALLYDVQT 1215 (1516)
T ss_pred ccccCchHHHhccccccCccc---cccccceeehhhhHHHHHhcccccceEEEeccc
Confidence 76 2 56788754 33333 345556655554443445663 45667887763
No 317
>PF12854 PPR_1: PPR repeat
Probab=53.65 E-value=19 Score=24.68 Aligned_cols=23 Identities=35% Similarity=0.593 Sum_probs=21.2
Q ss_pred HHHHHHHHHHhccHHHHHHHHHH
Q 003405 549 YTALLELYKSNARHREALKLLHE 571 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~ 571 (823)
|..|+.-|.+.|+.++|++++.+
T Consensus 10 y~~lI~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 10 YNTLIDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHh
Confidence 77899999999999999999875
No 318
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=53.53 E-value=8.4 Score=30.85 Aligned_cols=34 Identities=29% Similarity=0.459 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHhc-CCCHHHHH
Q 003405 340 SIHIRFAHYLFDTGSYEEAMEHFLAS-QVDITYAL 373 (823)
Q Consensus 340 ~i~~~~a~~lf~~~~f~~A~~~f~~~-~~dP~~vi 373 (823)
.+....|..++..++|++|+..|.++ ..||....
T Consensus 4 ~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~ 38 (69)
T PF13414_consen 4 EAWYNLGQIYFQQGDYEEAIEYFEKAIELDPNNAE 38 (69)
T ss_dssp HHHHHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHH
Confidence 45677899999999999999999984 34554433
No 319
>PF10395 Utp8: Utp8 family; InterPro: IPR018843 Utp8 is an essential component of the nuclear tRNA export machinery in Saccharomyces cerevisiae (Baker's yeast). It is a tRNA binding protein that acts at a step between tRNA maturation /aminoacylation, and translocation of the tRNA across the nuclear pore complex [].
Probab=52.03 E-value=5e+02 Score=31.24 Aligned_cols=201 Identities=9% Similarity=0.098 Sum_probs=93.1
Q ss_pred CcEEEEEEeC-----CEEEEEeCCCcE-EEEcCCCCC-CCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCc
Q 003405 17 PKIDAVASYG-----LKILLGCSDGSL-KIYSPGSSE-SDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQL 89 (823)
Q Consensus 17 ~~I~ci~~~~-----~~L~vGT~~G~l-~~y~~~~~~-~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~ 89 (823)
.-|+|++++. +.-++|.+++.- +...+.... ..+....++. ....+ ..+ . +.+|..|++....+.
T Consensus 72 ~iV~~~dV~~~~~~~~~~~~glt~rKk~~ll~i~~~~~~~~~~~~~~e--~~~~~----~~k-l-~~kvv~Ik~~~~~~~ 143 (670)
T PF10395_consen 72 TIVECCDVLEKSDGKKLYCVGLTERKKFKLLLIERKVGSTEDGTVNSE--TTNEF----ELK-L-DDKVVGIKFSSDGKI 143 (670)
T ss_pred ceEEEEEeEecCCCcEEEEEEEeeCCeeEEEEEEccCccccccccCcc--ccceE----EEE-c-ccceEEEEEecCCCE
Confidence 3689999984 367889777763 333333321 0000000000 00111 112 2 578999999977777
Q ss_pred eeeEeC-c-EEEEeC--CCCcccccccCCCCcE-E---EEee----CCCceEEEE-E--cCeE--EEEEEcC-CCceeEe
Q 003405 90 LLSLSE-S-IAFHRL--PNLETIAVLTKAKGAN-V---YSWD----DRRGFLCFA-R--QKRV--CIFRHDG-GRGFVEV 151 (823)
Q Consensus 90 Ll~l~d-~-l~~~~L--~~l~~~~~i~~~kg~~-~---fa~~----~~~~~l~V~-~--kkki--~l~~~~~-~~~f~~~ 151 (823)
++++.+ | +.+|+. ..++.+..+...+..+ . |.-+ .....+++. . ++++ .+|.+.. ......+
T Consensus 144 I~vvl~nG~i~~~d~~~~~l~~~~~l~~~~~~~v~ys~fv~~~~~~~~~~~ll~v~~~~~~k~~ykL~~l~~~~~~~~El 223 (670)
T PF10395_consen 144 IYVVLENGSIQIYDFSENSLEKVPQLKLKSSINVSYSKFVNDFELENGKDLLLTVSQLSNSKLSYKLISLSNESSSIFEL 223 (670)
T ss_pred EEEEEcCCcEEEEeccccccccccccccccccceehhhhhcccccccCCceEEEEEEcCCCcEEEEEEEeccCCcceEEe
Confidence 766665 6 899987 2333322222222221 1 1111 112333333 2 3443 4566621 1112222
Q ss_pred ee--ecCCCCc-eEEEecCCeEEEEEcCceEEEEcCCCCe---eecc---CCCCCCCCEEEEccCCeEEEEeCCeEEEEc
Q 003405 152 KD--FGVPDTV-KSMSWCGENICIAIRKGYMILNATNGAL---SEVF---PSGRIGPPLVVSLLSGELLLGKENIGVFVD 222 (823)
Q Consensus 152 ke--i~~~~~~-~~l~~~~~~i~v~~~~~y~lidl~~~~~---~~L~---~~~~~~~p~i~~~~~~EfLL~~~~~gvfv~ 222 (823)
.. +.....- ..+++..+.++-=.++....+++.+-+. .++. .......-.+.++..+.+||+.++..+.+|
T Consensus 224 ~s~~~e~~~~~~s~f~Y~~G~LY~l~~~~i~~ysip~f~~~~tI~l~~ii~~~~~~~vSl~~~s~nRvLLs~~nkIyLld 303 (670)
T PF10395_consen 224 SSTILENFGLEDSKFCYQFGKLYQLSKKTISSYSIPNFQIQKTISLPSIIDKESDDLVSLKPPSPNRVLLSVNNKIYLLD 303 (670)
T ss_pred ehheeccCCcccceEEEeCCEEEEEeCCEEEEEEcCCceEEEEEEechhhccccccceEeecCCCCeEEEEcCCEEEEEe
Confidence 21 1111111 1233333333222556666666655432 2222 111111113455677899999999998888
Q ss_pred CCC
Q 003405 223 QNG 225 (823)
Q Consensus 223 ~~G 225 (823)
..=
T Consensus 304 ~~~ 306 (670)
T PF10395_consen 304 LKF 306 (670)
T ss_pred ehh
Confidence 533
No 320
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=51.77 E-value=4e+02 Score=31.37 Aligned_cols=127 Identities=13% Similarity=0.113 Sum_probs=70.2
Q ss_pred EEEEeCCCCc-----ccccccCCCCcEEEEeeC-CCceEEEEEc-CeEEEEEEcCCCc----eeEeeeecC-CCCceEEE
Q 003405 97 IAFHRLPNLE-----TIAVLTKAKGANVYSWDD-RRGFLCFARQ-KRVCIFRHDGGRG----FVEVKDFGV-PDTVKSMS 164 (823)
Q Consensus 97 l~~~~L~~l~-----~~~~i~~~kg~~~fa~~~-~~~~l~V~~k-kki~l~~~~~~~~----f~~~kei~~-~~~~~~l~ 164 (823)
|.+|.|..-- .+..+.....++.|++++ +..+++||.. ..|.|+++..+.. ...-+++.. .+.|.++.
T Consensus 605 iai~el~~PGrLPDgv~p~l~Ngt~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slR 684 (1012)
T KOG1445|consen 605 IAIYELNEPGRLPDGVMPGLFNGTLVTDLHWDPFDDERLAVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKITSLR 684 (1012)
T ss_pred EEEEEcCCCCCCCcccccccccCceeeecccCCCChHHeeecccCceEEEEEeccCCCCcccCCcceeeecccceEEEEE
Confidence 7888885421 122333334566777765 3357999965 5677888863211 122233333 36888998
Q ss_pred ec---CCeEEEEEc-CceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEE--EEeCCeEEEEcC
Q 003405 165 WC---GENICIAIR-KGYMILNATNGALSEVFPSGRIGPPLVVSLLSGELL--LGKENIGVFVDQ 223 (823)
Q Consensus 165 ~~---~~~i~v~~~-~~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfL--L~~~~~gvfv~~ 223 (823)
|. .+.+.++.- ....+.|+.++.-..-+.....+-=-+.+.+++..+ +|.|.....++.
T Consensus 685 fHPLAadvLa~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~~rVy~P 749 (1012)
T KOG1445|consen 685 FHPLAADVLAVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGTLRVYEP 749 (1012)
T ss_pred ecchhhhHhhhhhccceeeeeehhhhhhhheeccCcCceeEEEECCCCcceeeeecCceEEEeCC
Confidence 87 355555543 567888888764322222111111124455666544 467766666664
No 321
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=51.49 E-value=3.3e+02 Score=29.01 Aligned_cols=75 Identities=13% Similarity=0.164 Sum_probs=47.9
Q ss_pred CCcEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-Ccee
Q 003405 16 SPKIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLL 91 (823)
Q Consensus 16 ~~~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll 91 (823)
...+++++..+ +.|-+.+=|-+-.+|+++...++ .+..+. ..+.|.|-.|...... +++.
T Consensus 150 ~aPlTSFDWne~dp~~igtSSiDTTCTiWdie~~~~~---------------~vkTQL-IAHDKEV~DIaf~~~s~~~FA 213 (364)
T KOG0290|consen 150 CAPLTSFDWNEVDPNLIGTSSIDTTCTIWDIETGVSG---------------TVKTQL-IAHDKEVYDIAFLKGSRDVFA 213 (364)
T ss_pred CCcccccccccCCcceeEeecccCeEEEEEEeecccc---------------ceeeEE-EecCcceeEEEeccCccceEE
Confidence 45678887765 34444445677788888754221 111122 2347889999888743 6777
Q ss_pred eEeC-c-EEEEeCCCCc
Q 003405 92 SLSE-S-IAFHRLPNLE 106 (823)
Q Consensus 92 ~l~d-~-l~~~~L~~l~ 106 (823)
+++. | +++|+|..++
T Consensus 214 SvgaDGSvRmFDLR~le 230 (364)
T KOG0290|consen 214 SVGADGSVRMFDLRSLE 230 (364)
T ss_pred EecCCCcEEEEEecccc
Confidence 7764 6 9999998765
No 322
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=51.38 E-value=36 Score=38.77 Aligned_cols=65 Identities=15% Similarity=0.345 Sum_probs=47.1
Q ss_pred CcEEEEEEeC-CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC
Q 003405 17 PKIDAVASYG-LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 17 ~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
+.-+|+++.+ +.|+||+.+|.|..|+-.. .+. ...+.++ ..+|.+|.+-..+..+++-|+
T Consensus 431 ~nFsc~aTT~sG~IvvgS~~GdIRLYdri~---------------~~A---KTAlPgL-G~~I~hVdvtadGKwil~Tc~ 491 (644)
T KOG2395|consen 431 NNFSCFATTESGYIVVGSLKGDIRLYDRIG---------------RRA---KTALPGL-GDAIKHVDVTADGKWILATCK 491 (644)
T ss_pred cccceeeecCCceEEEeecCCcEEeehhhh---------------hhh---hhccccc-CCceeeEEeeccCcEEEEecc
Confidence 4667887776 6999999999999997311 010 1123445 479999999999999999999
Q ss_pred c-EEEE
Q 003405 96 S-IAFH 100 (823)
Q Consensus 96 ~-l~~~ 100 (823)
. +.+.
T Consensus 492 tyLlLi 497 (644)
T KOG2395|consen 492 TYLLLI 497 (644)
T ss_pred cEEEEE
Confidence 5 4443
No 323
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=51.26 E-value=3.1e+02 Score=28.61 Aligned_cols=136 Identities=7% Similarity=0.036 Sum_probs=75.1
Q ss_pred CCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecC--CeEEEEEcCceEEEEcCCCCeeec-----cCCC--
Q 003405 125 RRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCG--ENICIAIRKGYMILNATNGALSEV-----FPSG-- 195 (823)
Q Consensus 125 ~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~--~~i~v~~~~~y~lidl~~~~~~~L-----~~~~-- 195 (823)
....++||+..++.+|.+........+ ... ..|+.|.... +.+++-..+...++++.+-..... .+.+
T Consensus 6 ~~~~L~vGt~~Gl~~~~~~~~~~~~~i--~~~-~~I~ql~vl~~~~~llvLsd~~l~~~~L~~l~~~~~~~~~~~~~~~~ 82 (275)
T PF00780_consen 6 WGDRLLVGTEDGLYVYDLSDPSKPTRI--LKL-SSITQLSVLPELNLLLVLSDGQLYVYDLDSLEPVSTSAPLAFPKSRS 82 (275)
T ss_pred CCCEEEEEECCCEEEEEecCCccceeE--eec-ceEEEEEEecccCEEEEEcCCccEEEEchhhcccccccccccccccc
Confidence 345799999999988888422222222 111 2377777764 344444446677788765322111 0000
Q ss_pred -----CCCCCEE--E--Ec-cCCeEEE-EeCCeEEEEcCCC---cc-ccCCceeecCCCcEEEEeCCEEEEEeCCeEEEE
Q 003405 196 -----RIGPPLV--V--SL-LSGELLL-GKENIGVFVDQNG---KL-LQADRICWSEAPIAVIIQKPYAIALLPRRVEVR 260 (823)
Q Consensus 196 -----~~~~p~i--~--~~-~~~EfLL-~~~~~gvfv~~~G---~~-~~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~ 260 (823)
...+.+. + .. ....+|+ +......++...+ .. .....+..++.|..+.+....+++...+..++.
T Consensus 83 ~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~lp~~~~~i~~~~~~i~v~~~~~f~~i 162 (275)
T PF00780_consen 83 LPTKLPETKGVSFFAVNGGHEGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEISLPDPPSSIAFLGNKICVGTSKGFYLI 162 (275)
T ss_pred ccccccccCCeeEEeeccccccceEEEEEECCEEEEEEEECCcccccceeEEEEcCCCcEEEEEeCCEEEEEeCCceEEE
Confidence 0011221 1 11 2233444 3333333333222 11 123568888999999999999999999999999
Q ss_pred Ecc
Q 003405 261 SLR 263 (823)
Q Consensus 261 ~l~ 263 (823)
++.
T Consensus 163 dl~ 165 (275)
T PF00780_consen 163 DLN 165 (275)
T ss_pred ecC
Confidence 885
No 324
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=51.18 E-value=26 Score=23.15 Aligned_cols=27 Identities=33% Similarity=0.551 Sum_probs=23.7
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhc
Q 003405 549 YTALLELYKSNARHREALKLLHELVEE 575 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~ 575 (823)
|..++.-|.+.|++++|++++.+....
T Consensus 3 ~n~li~~~~~~~~~~~a~~~~~~M~~~ 29 (35)
T TIGR00756 3 YNTLIDGLCKAGRVEEALELFKEMLER 29 (35)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHc
Confidence 667889999999999999999988644
No 325
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=50.83 E-value=1e+02 Score=36.06 Aligned_cols=118 Identities=15% Similarity=0.199 Sum_probs=66.2
Q ss_pred EEEEeCC-EEEEEeCCCcEEEEcCCCCCCCCC-------------CCCcccccccccceeeeeecCCCCCCeeE---EEE
Q 003405 21 AVASYGL-KILLGCSDGSLKIYSPGSSESDRS-------------PPSDYQSLRKESYELERTISGFSKKPILS---MEV 83 (823)
Q Consensus 21 ci~~~~~-~L~vGT~~G~l~~y~~~~~~~~~~-------------~~~d~~~l~~~~~~l~~~~~~~~k~~I~q---I~~ 83 (823)
|+...+. -++.|..||.+.+|++..+..+.- .|.-.+.+.++. +.-++ ....|.+ ...
T Consensus 151 cf~~~n~~vF~tGgRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~----~k~kA-~s~ti~ssvTvv~ 225 (720)
T KOG0321|consen 151 CFMPTNPAVFCTGGRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRI----RKWKA-ASNTIFSSVTVVL 225 (720)
T ss_pred hhccCCCcceeeccCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCchhhccc----ccccc-ccCceeeeeEEEE
Confidence 3334443 457799999999999887653210 010011111111 01111 1345666 666
Q ss_pred ecccCceeeEeC--c-EEEEeCCCCc------ccc------cccCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEc
Q 003405 84 LASRQLLLSLSE--S-IAFHRLPNLE------TIA------VLTKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHD 143 (823)
Q Consensus 84 ~~~~~~Ll~l~d--~-l~~~~L~~l~------~~~------~i~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~ 143 (823)
+.+.+.|++.+. + |+||||.+.. +.. .-.+..|.+.+++|....++++. ..+.|..|-..
T Consensus 226 fkDe~tlaSaga~D~~iKVWDLRk~~~~~r~ep~~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD~sIy~ynm~ 301 (720)
T KOG0321|consen 226 FKDESTLASAGAADSTIKVWDLRKNYTAYRQEPRGSDKYPTHSKRSVGQVNLILDSSGTYLFASCTDNSIYFYNMR 301 (720)
T ss_pred EeccceeeeccCCCcceEEEeecccccccccCCCcccCccCcccceeeeEEEEecCCCCeEEEEecCCcEEEEecc
Confidence 677777887765 3 9999996532 111 11223478888888766677665 55666666544
No 326
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=50.42 E-value=3.2e+02 Score=28.57 Aligned_cols=153 Identities=11% Similarity=0.148 Sum_probs=73.1
Q ss_pred cEEEEEEeC-CEEEEEeC-CCcEEEEcCCCCCCCCCCCCcccccccccceeee-eecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYG-LKILLGCS-DGSLKIYSPGSSESDRSPPSDYQSLRKESYELER-TISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~-~~L~vGT~-~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~-~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..+.|+.-+ +.++|..+ +|.|+.+.+...+... ....+.-.. .+...+.+.++-|...+..+.|++..
T Consensus 66 D~EgI~y~g~~~~vl~~Er~~~L~~~~~~~~~~~~---------~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~k 136 (248)
T PF06977_consen 66 DYEGITYLGNGRYVLSEERDQRLYIFTIDDDTTSL---------DRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAK 136 (248)
T ss_dssp SEEEEEE-STTEEEEEETTTTEEEEEEE----TT-----------EEEEEEEE---S---SS--EEEEEETTTTEEEEEE
T ss_pred CceeEEEECCCEEEEEEcCCCcEEEEEEecccccc---------chhhceEEecccccCCCcceEEEEEcCCCCEEEEEe
Confidence 456666666 45556553 8889888875433210 001110000 11112346799999999999999988
Q ss_pred Cc--EEEEeCCC----Cc-------ccc-cccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCC---
Q 003405 95 ES--IAFHRLPN----LE-------TIA-VLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVP--- 157 (823)
Q Consensus 95 d~--l~~~~L~~----l~-------~~~-~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~--- 157 (823)
+. ..+|.+.. .. ... .....+..+.+++++..+.+.|-....=.|.+++....+.. .+.+.
T Consensus 137 E~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~es~~l~~~d~~G~~~~--~~~L~~g~ 214 (248)
T PF06977_consen 137 ERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDESRLLLELDRQGRVVS--SLSLDRGF 214 (248)
T ss_dssp ESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEETTTTEEEEE-TT--EEE--EEE-STTG
T ss_pred CCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeEEEEECCCCeEEEECCCCCEEE--EEEeCCcc
Confidence 83 44444322 11 001 12245567888899988887776554333455543212222 22222
Q ss_pred -------CCceEEEecC-CeEEEEEc-CceEEE
Q 003405 158 -------DTVKSMSWCG-ENICIAIR-KGYMIL 181 (823)
Q Consensus 158 -------~~~~~l~~~~-~~i~v~~~-~~y~li 181 (823)
..|-+|++.. +.|++++. +.|+.+
T Consensus 215 ~gl~~~~~QpEGIa~d~~G~LYIvsEpNlfy~f 247 (248)
T PF06977_consen 215 HGLSKDIPQPEGIAFDPDGNLYIVSEPNLFYRF 247 (248)
T ss_dssp GG-SS---SEEEEEE-TT--EEEEETTTEEEEE
T ss_pred cCcccccCCccEEEECCCCCEEEEcCCceEEEe
Confidence 1577999984 46777766 555543
No 327
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=49.72 E-value=35 Score=30.08 Aligned_cols=65 Identities=15% Similarity=0.181 Sum_probs=45.3
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
....+.+.|++++|+..++......+. ......+....|..++..+++++|..++.+ ++..+|+-
T Consensus 45 l~~~~~~~~~~~~A~~~~~~~~~~~p~--~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~-------~~~~~p~~ 109 (119)
T TIGR02795 45 LGEAYYAQGKYADAAKAFLAVVKKYPK--SPKAPDALLKLGMSLQELGDKEKAKATLQQ-------VIKRYPGS 109 (119)
T ss_pred HHHHHHhhccHHHHHHHHHHHHHHCCC--CCcccHHHHHHHHHHHHhCChHHHHHHHHH-------HHHHCcCC
Confidence 455678899999999999875211110 011235677788999999999999999875 45566554
No 328
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=49.22 E-value=27 Score=38.73 Aligned_cols=21 Identities=19% Similarity=0.200 Sum_probs=11.8
Q ss_pred HHHHHHHhcCCHHHHHHHhhh
Q 003405 305 AQIVQLTASGDFEEALALCKL 325 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~ 325 (823)
.+...++..|+|++|+.+++.
T Consensus 7 ~~a~~a~~~~~~~~Ai~~~~~ 27 (356)
T PLN03088 7 DKAKEAFVDDDFALAVDLYTQ 27 (356)
T ss_pred HHHHHHHHcCCHHHHHHHHHH
Confidence 445555556666666655544
No 329
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=49.21 E-value=80 Score=27.30 Aligned_cols=62 Identities=24% Similarity=0.292 Sum_probs=40.3
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhh
Q 003405 549 YTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVL 617 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll 617 (823)
...|+..|...|+++.|++.+.++...+.... .+. .-...++++.-|+..+ +++.+|=+.|.
T Consensus 25 r~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~~--~~~----ar~~ll~~f~~lg~~~-plv~~~RRkL~ 86 (90)
T PF14561_consen 25 RYALADALLAAGDYEEALDQLLELVRRDRDYE--DDA----ARKRLLDIFELLGPGD-PLVSEYRRKLA 86 (90)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHCC-TTCC--CCH----HHHHHHHHHHHH-TT--HHHHHHHHHHH
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHhCcccc--ccH----HHHHHHHHHHHcCCCC-hHHHHHHHHHH
Confidence 34789999999999999999999986553211 110 1246678888777655 47777766553
No 330
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=49.15 E-value=48 Score=24.88 Aligned_cols=36 Identities=25% Similarity=0.625 Sum_probs=28.2
Q ss_pred eeEeeeecCCCCceEEEec--CCeEEEEEcCc-eEEEEc
Q 003405 148 FVEVKDFGVPDTVKSMSWC--GENICIAIRKG-YMILNA 183 (823)
Q Consensus 148 f~~~kei~~~~~~~~l~~~--~~~i~v~~~~~-y~lidl 183 (823)
|+.+.|-.++.++++++|+ .+.|.+|+.++ ..++.+
T Consensus 2 f~~~~~k~l~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl 40 (47)
T PF12894_consen 2 FRQLGEKNLPSRVSCMSWCPTMDLIALGTEDGEVLVYRL 40 (47)
T ss_pred cceecccCCCCcEEEEEECCCCCEEEEEECCCeEEEEEC
Confidence 5556677788999999999 46799999855 666666
No 331
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=49.14 E-value=1.9e+02 Score=30.94 Aligned_cols=78 Identities=13% Similarity=0.255 Sum_probs=52.8
Q ss_pred cccccCCCCcEEEEE-EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc
Q 003405 9 LELISNCSPKIDAVA-SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR 87 (823)
Q Consensus 9 ~~l~~~~~~~I~ci~-~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~ 87 (823)
+.+.+.-...++++. .-+++++-|+.|-++.+|++....++ . .++. ...+++.|.+....
T Consensus 350 V~VFQGHtdtVTS~vF~~dd~vVSgSDDrTvKvWdLrNMRsp--------------l---ATIR--tdS~~NRvavs~g~ 410 (481)
T KOG0300|consen 350 VAVFQGHTDTVTSVVFNTDDRVVSGSDDRTVKVWDLRNMRSP--------------L---ATIR--TDSPANRVAVSKGH 410 (481)
T ss_pred eeeecccccceeEEEEecCCceeecCCCceEEEeeeccccCc--------------c---eeee--cCCccceeEeecCC
Confidence 333333344666544 44689999999999999998865432 1 1211 25789999888776
Q ss_pred CceeeEeCc--EEEEeCCCC
Q 003405 88 QLLLSLSES--IAFHRLPNL 105 (823)
Q Consensus 88 ~~Ll~l~d~--l~~~~L~~l 105 (823)
.++.+=-|+ |.+|+|..-
T Consensus 411 ~iIAiPhDNRqvRlfDlnG~ 430 (481)
T KOG0300|consen 411 PIIAIPHDNRQVRLFDLNGN 430 (481)
T ss_pred ceEEeccCCceEEEEecCCC
Confidence 666666674 999999754
No 332
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=49.10 E-value=3.6e+02 Score=28.84 Aligned_cols=153 Identities=16% Similarity=0.170 Sum_probs=93.7
Q ss_pred EEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCcEEEE
Q 003405 21 AVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSESIAFH 100 (823)
Q Consensus 21 ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~l~~~ 100 (823)
-++.-|++-|+..-+|-|.+.++....++ .++.+. ++ ...+....+.++...+++..+++.+.
T Consensus 176 ~v~ISGn~AYvA~~d~GL~ivDVSnp~sP---------------vli~~~-n~-g~g~~sv~vsdnr~y~vvy~egvliv 238 (370)
T COG5276 176 DVAISGNYAYVAWRDGGLTIVDVSNPHSP---------------VLIGSY-NT-GPGTYSVSVSDNRAYLVVYDEGVLIV 238 (370)
T ss_pred eEEEecCeEEEEEeCCCeEEEEccCCCCC---------------eEEEEE-ec-CCceEEEEecCCeeEEEEcccceEEE
Confidence 35556789999999998999887754331 233221 22 22577777888877788777788887
Q ss_pred eCCCCcccccc-----cCCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcC-CCceeEeeeecCCC-CceEEEecCCeEEE
Q 003405 101 RLPNLETIAVL-----TKAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDG-GRGFVEVKDFGVPD-TVKSMSWCGENICI 172 (823)
Q Consensus 101 ~L~~l~~~~~i-----~~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~-~~~f~~~kei~~~~-~~~~l~~~~~~i~v 172 (823)
+-.+.+.+..+ ...-+++.|.+..+ ...|+ ..+.+.+..+.+ ...|. .-.+.+++ ..+++...|+.+|+
T Consensus 239 d~s~~ssp~~~gsyet~~p~~~s~v~Vs~~--~~Yvadga~gl~~idisnp~spfl-~ss~~t~g~~a~gi~ay~~y~yi 315 (370)
T COG5276 239 DVSGPSSPTVFGSYETSNPVSISTVPVSGE--YAYVADGAKGLPIIDISNPPSPFL-SSSLDTAGYQAAGIRAYGNYNYI 315 (370)
T ss_pred ecCCCCCceEeeccccCCcccccceecccc--eeeeeccccCceeEeccCCCCCch-hccccCCCccccceEEecCeeEe
Confidence 77655432211 11123344555543 23344 234455555553 11231 12345555 78888888999999
Q ss_pred EEcCceEEEEcCCCCeeeccC
Q 003405 173 AIRKGYMILNATNGALSEVFP 193 (823)
Q Consensus 173 ~~~~~y~lidl~~~~~~~L~~ 193 (823)
+.++...+++....+...+.+
T Consensus 316 adkn~g~vV~~s~~s~m~~~~ 336 (370)
T COG5276 316 ADKNTGAVVDASPPSMMDKRP 336 (370)
T ss_pred ccCCceEEEeCCChhhccccc
Confidence 999988999988766555433
No 333
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=48.99 E-value=5.1e+02 Score=30.55 Aligned_cols=77 Identities=12% Similarity=0.180 Sum_probs=48.9
Q ss_pred CCcEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-C--c
Q 003405 16 SPKIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-Q--L 89 (823)
Q Consensus 16 ~~~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~--~ 89 (823)
+..|+|++-+- +.++.|+-+|.|..|++....... ...+. ....-++.++.++.=+... + +
T Consensus 242 ~s~v~~~~f~p~~p~ll~gG~y~GqV~lWD~~~~~~~~------------~s~ls-~~~~sh~~~v~~vvW~~~~~~~~f 308 (555)
T KOG1587|consen 242 PSEVTCLKFCPFDPNLLAGGCYNGQVVLWDLRKGSDTP------------PSGLS-ALEVSHSEPVTAVVWLQNEHNTEF 308 (555)
T ss_pred CCceeEEEeccCCcceEEeeccCceEEEEEccCCCCCC------------Ccccc-cccccCCcCeEEEEEeccCCCCce
Confidence 35677777653 688999999999999988654310 11111 1112236688887666532 3 6
Q ss_pred eeeEeCc-EEEEeCCCC
Q 003405 90 LLSLSES-IAFHRLPNL 105 (823)
Q Consensus 90 Ll~l~d~-l~~~~L~~l 105 (823)
+-+.+|| |..|++..+
T Consensus 309 ~s~ssDG~i~~W~~~~l 325 (555)
T KOG1587|consen 309 FSLSSDGSICSWDTDML 325 (555)
T ss_pred EEEecCCcEeeeecccc
Confidence 6666786 999977654
No 334
>PF03002 Somatostatin: Somatostatin/Cortistatin family; InterPro: IPR018142 Somatostatin inhibits the release of the pituitary growth hormone, somatotropin and inhibits the release of glucagon and insulin from the pancreas of fasted animals. Cortistatin is a cortical neuropeptide with neuronal depressant and sleep-modulating properties [].; GO: 0005179 hormone activity, 0005576 extracellular region
Probab=48.78 E-value=7.7 Score=22.43 Aligned_cols=13 Identities=31% Similarity=0.856 Sum_probs=11.3
Q ss_pred hhCCCccceeeee
Q 003405 804 RLMPSRSYIWKGF 816 (823)
Q Consensus 804 ~~~~~~~~~~~~~ 816 (823)
|-+|-+.|.||+|
T Consensus 3 ~k~~CknffWK~~ 15 (18)
T PF03002_consen 3 RKAGCKNFFWKTF 15 (18)
T ss_pred ccccccceeeccc
Confidence 5678899999998
No 335
>PRK01742 tolB translocation protein TolB; Provisional
Probab=48.47 E-value=4.5e+02 Score=29.74 Aligned_cols=143 Identities=17% Similarity=0.166 Sum_probs=76.2
Q ss_pred CCCeeEEEEecccCceeeEeC--c---EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEEE--cCeEEEEEEcC-C
Q 003405 75 KKPILSMEVLASRQLLLSLSE--S---IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFAR--QKRVCIFRHDG-G 145 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~d--~---l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~~--kkki~l~~~~~-~ 145 (823)
+.+|.....-|+.+.++..+. + |.+|++.+-+.. .+...++ ....+++++...|+++. .+...||.++. +
T Consensus 203 ~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~-~l~~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~ 281 (429)
T PRK01742 203 SQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARK-VVASFRGHNGAPAFSPDGSRLAFASSKDGVLNIYVMGANG 281 (429)
T ss_pred CCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceE-EEecCCCccCceeECCCCCEEEEEEecCCcEEEEEEECCC
Confidence 457888889999888887763 2 888888653321 1222232 33567777766777763 44567777753 2
Q ss_pred CceeEeeeecCCCCceEEEec--CCeEEEEEc--CceEEEE--cCCCCeeeccCCCCCCCCEEEEccCCeEEE-EeCCeE
Q 003405 146 RGFVEVKDFGVPDTVKSMSWC--GENICIAIR--KGYMILN--ATNGALSEVFPSGRIGPPLVVSLLSGELLL-GKENIG 218 (823)
Q Consensus 146 ~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~--~~y~lid--l~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~~~~g 218 (823)
.....+. .-+..+.+..|. |..|+++.. ....+++ ..++....+ .... . .....+++..++ +..+..
T Consensus 282 ~~~~~lt--~~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~~~~-~--~~~~SpDG~~ia~~~~~~i 355 (429)
T PRK01742 282 GTPSQLT--SGAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASGGGASLV-GGRG-Y--SAQISADGKTLVMINGDNV 355 (429)
T ss_pred CCeEeec--cCCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCCCCeEEe-cCCC-C--CccCCCCCCEEEEEcCCCE
Confidence 2222221 123456677887 456776654 2344444 444444333 2211 1 123346666665 333333
Q ss_pred EEEcCC
Q 003405 219 VFVDQN 224 (823)
Q Consensus 219 vfv~~~ 224 (823)
+.+|..
T Consensus 356 ~~~Dl~ 361 (429)
T PRK01742 356 VKQDLT 361 (429)
T ss_pred EEEECC
Confidence 335643
No 336
>KOG2076 consensus RNA polymerase III transcription factor TFIIIC [Transcription]
Probab=48.44 E-value=2.2e+02 Score=34.88 Aligned_cols=79 Identities=16% Similarity=0.100 Sum_probs=48.0
Q ss_pred CchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccC
Q 003405 646 SPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRL 725 (823)
Q Consensus 646 ~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~ 725 (823)
.++.++.||--++.. ++.....+.-.++..|.+-= .+..-+.| -++||..-
T Consensus 429 ~~~~Al~~l~~i~~~-~~~~~~~vw~~~a~c~~~l~--------------------e~e~A~e~--------y~kvl~~~ 479 (895)
T KOG2076|consen 429 KYKEALRLLSPITNR-EGYQNAFVWYKLARCYMELG--------------------EYEEAIEF--------YEKVLILA 479 (895)
T ss_pred cHHHHHHHHHHHhcC-ccccchhhhHHHHHHHHHHh--------------------hHHHHHHH--------HHHHHhcC
Confidence 466788888888753 33344556667777777531 01111111 13455555
Q ss_pred CCC-chhhHHHHHhhccccHHHHHHHHHH
Q 003405 726 PAD-ALYEERAILLGKMNQHELALSLYVH 753 (823)
Q Consensus 726 ~~~-~l~~e~~~Ll~klg~h~~AL~ilv~ 753 (823)
|.+ +-.--.+-|+-++|+|++|++.+-.
T Consensus 480 p~~~D~Ri~Lasl~~~~g~~EkalEtL~~ 508 (895)
T KOG2076|consen 480 PDNLDARITLASLYQQLGNHEKALETLEQ 508 (895)
T ss_pred CCchhhhhhHHHHHHhcCCHHHHHHHHhc
Confidence 554 3334556688899999999998875
No 337
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=48.32 E-value=1.3e+02 Score=27.87 Aligned_cols=102 Identities=19% Similarity=0.111 Sum_probs=63.4
Q ss_pred CChhhHHHhhhhhhhcCccccccccccCCCChHHHHHHHhhcCchhHHHHHHHHhhcccCCCChhHHHH--HHHHHHHHH
Q 003405 604 TDPMLVLEFSMLVLESCPTQTIELFLSGNIPADLVNSYLKQYSPSMQGRYLELMLAMNENSISGNLQNE--MVQIYLSEV 681 (823)
Q Consensus 604 ~~~~li~~y~~wll~~~p~~~~~if~~~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~--L~~lYl~~i 681 (823)
+-++...+|..|+.+..|..+. ..-+..-||.++.... .++.++|. .+.+.+.+.
T Consensus 20 DPL~~w~~yI~w~~~~~p~~~~---------------------~~~L~~lLer~~~~f~--~~~~Y~nD~RylkiWi~ya 76 (126)
T PF08311_consen 20 DPLDPWLRYIKWIEENYPSGGK---------------------QSGLLELLERCIRKFK--DDERYKNDERYLKIWIKYA 76 (126)
T ss_dssp -CHHHHHHHHHHHHHHCTTCCC---------------------CHHHHHHHHHHHHHHT--TSGGGTT-HHHHHHHHHHH
T ss_pred CChHHHHHHHHHHHHHCCCCCc---------------------hhHHHHHHHHHHHHHh--hhHhhcCCHHHHHHHHHHH
Confidence 3467788999999998887222 1223445555554321 23445542 444555443
Q ss_pred HHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCC-chhhHHHHHhhccccHHHHHHHHHH
Q 003405 682 LDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPAD-ALYEERAILLGKMNQHELALSLYVH 753 (823)
Q Consensus 682 ~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~-~l~~e~~~Ll~klg~h~~AL~ilv~ 753 (823)
.- ......+..||..+. +.... .|+++-|.++.+.|++++|-.|+-.
T Consensus 77 ~~----------------~~~~~~if~~l~~~~---------IG~~~A~fY~~wA~~le~~~~~~~A~~I~~~ 124 (126)
T PF08311_consen 77 DL----------------SSDPREIFKFLYSKG---------IGTKLALFYEEWAEFLEKRGNFKKADEIYQL 124 (126)
T ss_dssp TT----------------BSHHHHHHHHHHHHT---------TSTTBHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HH----------------ccCHHHHHHHHHHcC---------ccHHHHHHHHHHHHHHHHcCCHHHHHHHHHh
Confidence 10 114677888888764 44444 5889999999999999999999864
No 338
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=48.21 E-value=83 Score=35.31 Aligned_cols=95 Identities=9% Similarity=0.190 Sum_probs=59.6
Q ss_pred CCeeEEEEecccCceeeEeC--c-EEEEeCCCCcccccc-cCCCCcEEEEeeCCCceEEE-EEcCeEEEEEEcCCCceeE
Q 003405 76 KPILSMEVLASRQLLLSLSE--S-IAFHRLPNLETIAVL-TKAKGANVYSWDDRRGFLCF-ARQKRVCIFRHDGGRGFVE 150 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~d--~-l~~~~L~~l~~~~~i-~~~kg~~~fa~~~~~~~l~V-~~kkki~l~~~~~~~~f~~ 150 (823)
.++.-|..=|. |-++.++. | |++|.-..-+|..++ .-.-+|+++|++++...++- |..+++.||.+. .|..
T Consensus 252 G~~~vm~qNP~-NaVih~GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR---~~~q 327 (545)
T KOG1272|consen 252 GRTDVMKQNPY-NAVIHLGHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLR---NFYQ 327 (545)
T ss_pred CccchhhcCCc-cceEEEcCCCceEEecCCCCcchHHHHHhcCCCcceEEECCCCcEEeecccccceeEeeec---cccc
Confidence 34554444443 55566655 4 999976555555442 33447889999987655554 488899999888 4555
Q ss_pred eeeecCCCCceEEEecCC-eEEEEE
Q 003405 151 VKDFGVPDTVKSMSWCGE-NICIAI 174 (823)
Q Consensus 151 ~kei~~~~~~~~l~~~~~-~i~v~~ 174 (823)
+..+..|.+...+++... .+.+|.
T Consensus 328 l~t~~tp~~a~~ls~SqkglLA~~~ 352 (545)
T KOG1272|consen 328 LHTYRTPHPASNLSLSQKGLLALSY 352 (545)
T ss_pred cceeecCCCccccccccccceeeec
Confidence 555555777777776533 344444
No 339
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=48.19 E-value=1.6e+02 Score=37.17 Aligned_cols=38 Identities=16% Similarity=0.135 Sum_probs=32.1
Q ss_pred HHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhhcc
Q 003405 539 CEEILQKKNHYTALLELYKSNARHREALKLLHELVEES 576 (823)
Q Consensus 539 ~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~ 576 (823)
-.+.|++.++|++.+.+|.+.|++++||+-|....++.
T Consensus 945 ya~hL~~~~~~~~Aal~Ye~~GklekAl~a~~~~~dWr 982 (1265)
T KOG1920|consen 945 YADHLREELMSDEAALMYERCGKLEKALKAYKECGDWR 982 (1265)
T ss_pred HHHHHHHhccccHHHHHHHHhccHHHHHHHHHHhccHH
Confidence 34666778899999999999999999999999877654
No 340
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=47.15 E-value=2.5e+02 Score=26.36 Aligned_cols=61 Identities=8% Similarity=0.137 Sum_probs=42.0
Q ss_pred CceEEE-EEcCeEEEEEEcCCC-----ceeEeeeecCCCCceEEEec-------CCeEEEEEcCceEEEEcCCC
Q 003405 126 RGFLCF-ARQKRVCIFRHDGGR-----GFVEVKDFGVPDTVKSMSWC-------GENICIAIRKGYMILNATNG 186 (823)
Q Consensus 126 ~~~l~V-~~kkki~l~~~~~~~-----~f~~~kei~~~~~~~~l~~~-------~~~i~v~~~~~y~lidl~~~ 186 (823)
.++|++ ...+||.|+.-.... .-..++.+.+...|++|+-- .+.|++|+.+....||+.+.
T Consensus 10 ~pcL~~aT~~gKV~IH~ph~~~~~~~~~~~~i~~LNin~~italaaG~l~~~~~~D~LliGt~t~llaYDV~~N 83 (136)
T PF14781_consen 10 HPCLACATTGGKVFIHNPHERGQRTGRQDSDISFLNINQEITALAAGRLKPDDGRDCLLIGTQTSLLAYDVENN 83 (136)
T ss_pred ceeEEEEecCCEEEEECCCccccccccccCceeEEECCCceEEEEEEecCCCCCcCEEEEeccceEEEEEcccC
Confidence 345544 477899888643211 11234557788899988542 46899999999999999864
No 341
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=46.60 E-value=2.5e+02 Score=32.49 Aligned_cols=119 Identities=14% Similarity=0.171 Sum_probs=67.3
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeCc--EEEEeCCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSES--IAFHRLPN 104 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d~--l~~~~L~~ 104 (823)
.-+++||..|.|..|++.+.. .++.+.. +-+..+|+.+.--.+...+-++++. +..|...+
T Consensus 71 ~~lvlgt~~g~v~~ys~~~g~--------------it~~~st---~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~ 133 (541)
T KOG4547|consen 71 SMLVLGTPQGSVLLYSVAGGE--------------ITAKLST---DKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKE 133 (541)
T ss_pred eEEEeecCCccEEEEEecCCe--------------EEEEEec---CCCCCcceeeecccccCceEecCCceeEEEEeccc
Confidence 478999999999999976532 2232221 1234578877666666777777763 55555433
Q ss_pred Cccccc-ccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeee-cCCCCceEEEec
Q 003405 105 LETIAV-LTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDF-GVPDTVKSMSWC 166 (823)
Q Consensus 105 l~~~~~-i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei-~~~~~~~~l~~~ 166 (823)
...+.. -...+.+...|+.++.+.+.+| -+.|.+|-....+ .+.-| -.+++++++.|.
T Consensus 134 ~~~~~~~~~~~~~~~sl~is~D~~~l~~a-s~~ik~~~~~~ke---vv~~ftgh~s~v~t~~f~ 193 (541)
T KOG4547|consen 134 KVIIRIWKEQKPLVSSLCISPDGKILLTA-SRQIKVLDIETKE---VVITFTGHGSPVRTLSFT 193 (541)
T ss_pred ceeeeeeccCCCccceEEEcCCCCEEEec-cceEEEEEccCce---EEEEecCCCcceEEEEEE
Confidence 321111 0223455667777774444444 5667677766321 22222 235677777764
No 342
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=46.36 E-value=32 Score=22.57 Aligned_cols=24 Identities=21% Similarity=0.328 Sum_probs=21.2
Q ss_pred HHHHHHHHhccHHHHHHHHHHHhh
Q 003405 551 ALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 551 ~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
.++..|...|++++|++++.++.+
T Consensus 5 ~~a~~~~~~g~~~~A~~~~~~~~~ 28 (33)
T PF13174_consen 5 RLARCYYKLGDYDEAIEYFQRLIK 28 (33)
T ss_dssp HHHHHHHHHCHHHHHHHHHHHHHH
T ss_pred HHHHHHHHccCHHHHHHHHHHHHH
Confidence 578889999999999999998864
No 343
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=46.13 E-value=21 Score=32.73 Aligned_cols=55 Identities=25% Similarity=0.349 Sum_probs=40.6
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
....+++.|++++|+..++.....++ .........|..++..++|++|...|.++
T Consensus 23 ~a~~~~~~~~~~~A~~~~~~~~~~~p-----~~~~~~~~la~~~~~~~~~~~A~~~~~~~ 77 (135)
T TIGR02552 23 LAYNLYQQGRYDEALKLFQLLAAYDP-----YNSRYWLGLAACCQMLKEYEEAIDAYALA 77 (135)
T ss_pred HHHHHHHcccHHHHHHHHHHHHHhCC-----CcHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566789999999999876521111 22355667799999999999999998874
No 344
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=46.09 E-value=6.6e+02 Score=30.98 Aligned_cols=55 Identities=5% Similarity=-0.053 Sum_probs=40.2
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
+-+.-..-.|++++|+.++..+...+ ......+...|..+...+++++|...|.+
T Consensus 20 d~~~ia~~~g~~~~A~~~~~~~~~~~-----~~~a~~~~~lA~~~~~~g~~~~A~~~~~~ 74 (765)
T PRK10049 20 DWLQIALWAGQDAEVITVYNRYRVHM-----QLPARGYAAVAVAYRNLKQWQNSLTLWQK 74 (765)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhC-----CCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 33444456899999999987753111 12334577788999999999999999987
No 345
>PF13041 PPR_2: PPR repeat family
Probab=46.01 E-value=32 Score=25.68 Aligned_cols=28 Identities=32% Similarity=0.552 Sum_probs=25.0
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhcc
Q 003405 549 YTALLELYKSNARHREALKLLHELVEES 576 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~~ 576 (823)
|..++.-|.+.|++++|++++.+.....
T Consensus 6 yn~li~~~~~~~~~~~a~~l~~~M~~~g 33 (50)
T PF13041_consen 6 YNTLISGYCKAGKFEEALKLFKEMKKRG 33 (50)
T ss_pred HHHHHHHHHHCcCHHHHHHHHHHHHHcC
Confidence 7789999999999999999999987554
No 346
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=45.97 E-value=2.9e+02 Score=29.38 Aligned_cols=142 Identities=15% Similarity=0.215 Sum_probs=78.0
Q ss_pred CcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCccccc-ccccceeeeeecCCCCCCeeEEEEeccc-Ccee
Q 003405 17 PKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSL-RKESYELERTISGFSKKPILSMEVLASR-QLLL 91 (823)
Q Consensus 17 ~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l-~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll 91 (823)
..|++.... ++.++=|..||.+.+|+++..+.-+.+ -| .+..-...++..+.+|..|+.+.--|-- ++..
T Consensus 44 GsvNsL~id~tegrymlSGgadgsi~v~Dl~n~t~~e~s-----~li~k~~c~v~~~h~~~Hky~iss~~WyP~DtGmFt 118 (397)
T KOG4283|consen 44 GSVNSLQIDLTEGRYMLSGGADGSIAVFDLQNATDYEAS-----GLIAKHKCIVAKQHENGHKYAISSAIWYPIDTGMFT 118 (397)
T ss_pred CccceeeeccccceEEeecCCCccEEEEEeccccchhhc-----cceeheeeeccccCCccceeeeeeeEEeeecCceee
Confidence 345554433 468899999999999999865532111 01 0111111223334467789888877744 3443
Q ss_pred eEe-C-cEEEEeCCCCcccccccCCCC-cEEEEeeC---CCceEEEEEc-CeEEEEEEcCCCceeEeeeecC-CCCceEE
Q 003405 92 SLS-E-SIAFHRLPNLETIAVLTKAKG-ANVYSWDD---RRGFLCFARQ-KRVCIFRHDGGRGFVEVKDFGV-PDTVKSM 163 (823)
Q Consensus 92 ~l~-d-~l~~~~L~~l~~~~~i~~~kg-~~~fa~~~---~~~~l~V~~k-kki~l~~~~~~~~f~~~kei~~-~~~~~~l 163 (823)
+-+ | .+++|+..+++..... +..+ |-.-+.++ ..+.|++|.+ -+|.+..+..+ .|... ++- -+.|.++
T Consensus 119 ssSFDhtlKVWDtnTlQ~a~~F-~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SG-s~sH~--LsGHr~~vlaV 194 (397)
T KOG4283|consen 119 SSSFDHTLKVWDTNTLQEAVDF-KMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASG-SFSHT--LSGHRDGVLAV 194 (397)
T ss_pred cccccceEEEeecccceeeEEe-ecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCC-cceee--eccccCceEEE
Confidence 333 4 3999999887643321 1111 11112222 1234666644 46888888755 35432 111 2688899
Q ss_pred EecC
Q 003405 164 SWCG 167 (823)
Q Consensus 164 ~~~~ 167 (823)
.|..
T Consensus 195 ~Wsp 198 (397)
T KOG4283|consen 195 EWSP 198 (397)
T ss_pred Eecc
Confidence 9984
No 347
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=45.91 E-value=88 Score=32.69 Aligned_cols=69 Identities=16% Similarity=0.112 Sum_probs=43.7
Q ss_pred EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCc-eeeEeCc---EEE
Q 003405 24 SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQL-LLSLSES---IAF 99 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~-Ll~l~d~---l~~ 99 (823)
..+....||+.||++.+|++.....+ ...+.+.+-.++.+|.-...-+..-+ |+..+|+ +++
T Consensus 213 ~~~~~FAv~~Qdg~~~I~DVR~~~tp--------------m~~~sstrp~hnGa~R~c~Fsl~g~lDLLf~sEhfs~~hv 278 (344)
T KOG4532|consen 213 ENDLQFAVVFQDGTCAIYDVRNMATP--------------MAEISSTRPHHNGAFRVCRFSLYGLLDLLFISEHFSRVHV 278 (344)
T ss_pred cCcceEEEEecCCcEEEEEecccccc--------------hhhhcccCCCCCCceEEEEecCCCcceEEEEecCcceEEE
Confidence 33457899999999999998865432 11111112234567777766654333 6667884 888
Q ss_pred EeCCCCc
Q 003405 100 HRLPNLE 106 (823)
Q Consensus 100 ~~L~~l~ 106 (823)
.++.++.
T Consensus 279 ~D~R~~~ 285 (344)
T KOG4532|consen 279 VDTRNYV 285 (344)
T ss_pred EEcccCc
Confidence 8887764
No 348
>smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p.
Probab=45.71 E-value=83 Score=29.11 Aligned_cols=99 Identities=22% Similarity=0.118 Sum_probs=0.0
Q ss_pred CCCChhhHHHhhhhhhhcCccccccccccCCCChHHHHHHHhhcCchhHHHHHHHHhhcccCCCChhHHH-----HHHHH
Q 003405 602 CGTDPMLVLEFSMLVLESCPTQTIELFLSGNIPADLVNSYLKQYSPSMQGRYLELMLAMNENSISGNLQN-----EMVQI 676 (823)
Q Consensus 602 ~~~~~~li~~y~~wll~~~p~~~~~if~~~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~-----~L~~l 676 (823)
+.+-++...+|.+|+.+.-|..+. ..-+...||.++.... .++.++| ++=..
T Consensus 18 ~dDPL~~w~~yI~W~~~~~p~g~~---------------------~s~L~~lLerc~~~f~--~~~~YknD~RyLkiWi~ 74 (125)
T smart00777 18 GDDPLDLWLRYIKWTEENYPQGGK---------------------ESGLLTLLERCIRYFE--DDERYKNDPRYLKIWLK 74 (125)
T ss_pred CCCChHHHHHHHHHHHHhCCCCCc---------------------hhhHHHHHHHHHHHhh--hhhhhcCCHHHHHHHHH
Q ss_pred HHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCc-hhhHHHHHhhccccHHHHHHHH
Q 003405 677 YLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADA-LYEERAILLGKMNQHELALSLY 751 (823)
Q Consensus 677 Yl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~-l~~e~~~Ll~klg~h~~AL~il 751 (823)
|++... .+..+..||.... +..... |+++-|.++.+.|+.++|-+++
T Consensus 75 ya~~~~-------------------dp~~if~~L~~~~---------IG~~~AlfYe~~A~~lE~~g~~~~A~~iy 122 (125)
T smart00777 75 YADNCD-------------------EPRELFQFLYSKG---------IGTKLALFYEEWAQLLEAAGRYKKADEVY 122 (125)
T ss_pred HHHhcC-------------------CHHHHHHHHHHCC---------cchhhHHHHHHHHHHHHHcCCHHHHHHHH
No 349
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=45.60 E-value=33 Score=23.90 Aligned_cols=25 Identities=36% Similarity=0.563 Sum_probs=20.9
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHh
Q 003405 549 YTALLELYKSNARHREALKLLHELV 573 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~ 573 (823)
+..|+..|...|++++|++++.+..
T Consensus 5 ~~~la~~~~~~g~~~~A~~~~~~al 29 (42)
T PF13374_consen 5 LNNLANAYRAQGRYEEALELLEEAL 29 (42)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHhhhhcchhhHHHHHHH
Confidence 4578999999999999999998764
No 350
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=45.36 E-value=40 Score=22.47 Aligned_cols=26 Identities=27% Similarity=0.422 Sum_probs=22.5
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 549 YTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
+..++.+|...|++++|++.+.+..+
T Consensus 4 ~~~lg~~y~~~~~~~~A~~~~~~a~~ 29 (34)
T PF13181_consen 4 YYNLGKIYEQLGDYEEALEYFEKALE 29 (34)
T ss_dssp HHHHHHHHHHTTSHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHh
Confidence 56789999999999999999987653
No 351
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=45.15 E-value=2.6e+02 Score=33.35 Aligned_cols=107 Identities=15% Similarity=0.306 Sum_probs=59.4
Q ss_pred CCcEEEEeeCCCceEEEEE--cCeEEEEEEcCCCceeEeeeecCCCCceEEEec----CC-eEEEEEcCceEEEEcC---
Q 003405 115 KGANVYSWDDRRGFLCFAR--QKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC----GE-NICIAIRKGYMILNAT--- 184 (823)
Q Consensus 115 kg~~~fa~~~~~~~l~V~~--kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~----~~-~i~v~~~~~y~lidl~--- 184 (823)
++++.+.-+. .+.+|++. +..+.|+...++. +..-..+.-.++|+.+.|. |+ .+.||+.+...++.-.
T Consensus 30 ~~~~li~gss-~~k~a~V~~~~~~LtIWD~~~~~-lE~~~~f~~~~~I~dLDWtst~d~qsiLaVGf~~~v~l~~Q~R~d 107 (631)
T PF12234_consen 30 SNPSLISGSS-IKKIAVVDSSRSELTIWDTRSGV-LEYEESFSEDDPIRDLDWTSTPDGQSILAVGFPHHVLLYTQLRYD 107 (631)
T ss_pred CCcceEeecc-cCcEEEEECCCCEEEEEEcCCcE-EEEeeeecCCCceeeceeeecCCCCEEEEEEcCcEEEEEEccchh
Confidence 4444443333 35667663 5667777776442 2222224557899999997 33 4889998887776531
Q ss_pred --CC--C---eeeccCCCCCCCCE--EEEccCCeEEEEeCCeEEEEcC
Q 003405 185 --NG--A---LSEVFPSGRIGPPL--VVSLLSGELLLGKENIGVFVDQ 223 (823)
Q Consensus 185 --~~--~---~~~L~~~~~~~~p~--i~~~~~~EfLL~~~~~gvfv~~ 223 (823)
+. . +..+--.+-...|+ .++++++.++++.++..+..+.
T Consensus 108 y~~~~p~w~~i~~i~i~~~T~h~Igds~Wl~~G~LvV~sGNqlfv~dk 155 (631)
T PF12234_consen 108 YTNKGPSWAPIRKIDISSHTPHPIGDSIWLKDGTLVVGSGNQLFVFDK 155 (631)
T ss_pred hhcCCcccceeEEEEeecCCCCCccceeEecCCeEEEEeCCEEEEECC
Confidence 11 1 11110011112343 4567888888888776655553
No 352
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=44.03 E-value=43 Score=22.18 Aligned_cols=27 Identities=33% Similarity=0.469 Sum_probs=23.7
Q ss_pred cHHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 548 HYTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 548 ~~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
-|..++..|.+.|+++.|++++....+
T Consensus 3 ty~~ll~a~~~~g~~~~a~~~~~~M~~ 29 (34)
T PF13812_consen 3 TYNALLRACAKAGDPDAALQLFDEMKE 29 (34)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 377899999999999999999998764
No 353
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=43.92 E-value=4.3e+02 Score=31.94 Aligned_cols=53 Identities=21% Similarity=0.176 Sum_probs=35.8
Q ss_pred HHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 309 QLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 309 ~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
.++..|++++|+..++.....+. . .........|..++..|+|++|...|.++
T Consensus 186 ~l~~~g~~~eA~~~~~~~l~~~~-~---~~~~~~~~l~~~l~~~g~~~eA~~~~~~a 238 (656)
T PRK15174 186 SFLNKSRLPEDHDLARALLPFFA-L---ERQESAGLAVDTLCAVGKYQEAIQTGESA 238 (656)
T ss_pred HHHHcCCHHHHHHHHHHHHhcCC-C---cchhHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 46788999999999887532211 0 01122244567788999999999998873
No 354
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=43.70 E-value=32 Score=38.10 Aligned_cols=54 Identities=17% Similarity=0.095 Sum_probs=39.0
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
...++..|+|++|+..++.....+. .....+.+.|..++..|+|++|...|.++
T Consensus 43 a~~~~~~g~~~eAl~~~~~Al~l~P-----~~~~a~~~lg~~~~~lg~~~eA~~~~~~a 96 (356)
T PLN03088 43 AQANIKLGNFTEAVADANKAIELDP-----SLAKAYLRKGTACMKLEEYQTAKAALEKG 96 (356)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHhCc-----CCHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 4456788999999998876421111 22345677789999999999999998773
No 355
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=43.62 E-value=2.8e+02 Score=26.00 Aligned_cols=113 Identities=12% Similarity=0.107 Sum_probs=60.4
Q ss_pred EeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEec-----ccCceeeEeC-cE
Q 003405 24 SYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLA-----SRQLLLSLSE-SI 97 (823)
Q Consensus 24 ~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~-----~~~~Ll~l~d-~l 97 (823)
.....|..+|..|.|++++.......... .... + ++-++ .+.|+.|..-+ ..+.|++=+. .|
T Consensus 8 G~~pcL~~aT~~gKV~IH~ph~~~~~~~~-------~~~~---i-~~LNi-n~~italaaG~l~~~~~~D~LliGt~t~l 75 (136)
T PF14781_consen 8 GVHPCLACATTGGKVFIHNPHERGQRTGR-------QDSD---I-SFLNI-NQEITALAAGRLKPDDGRDCLLIGTQTSL 75 (136)
T ss_pred CCceeEEEEecCCEEEEECCCcccccccc-------ccCc---e-eEEEC-CCceEEEEEEecCCCCCcCEEEEeccceE
Confidence 33457899999999999975543321100 0001 1 12234 45677775444 3456666666 48
Q ss_pred EEEeCCCCcccccccCCCCcEEEEeeC---CC-ceEEEEEcCeEEEEEEcCCCce
Q 003405 98 AFHRLPNLETIAVLTKAKGANVYSWDD---RR-GFLCFARQKRVCIFRHDGGRGF 148 (823)
Q Consensus 98 ~~~~L~~l~~~~~i~~~kg~~~fa~~~---~~-~~l~V~~kkki~l~~~~~~~~f 148 (823)
.+|+..+-.-+....-..|++++.+.. .. ..++|+..-.|.=|.+.+.+.|
T Consensus 76 laYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGncsi~Gfd~~G~e~f 130 (136)
T PF14781_consen 76 LAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGNCSIQGFDYEGNEIF 130 (136)
T ss_pred EEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECceEEEEEeCCCCcEEE
Confidence 888875432222222335788877742 12 2344555555666666544333
No 356
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=43.60 E-value=3.4e+02 Score=28.57 Aligned_cols=69 Identities=13% Similarity=0.224 Sum_probs=44.5
Q ss_pred cEEEEeeCCCce-EEEEEcCeEEEEEEcCCCce-eEeeeecCCCCceEEEecC--CeEEEEEcCce-EEEEcCC
Q 003405 117 ANVYSWDDRRGF-LCFARQKRVCIFRHDGGRGF-VEVKDFGVPDTVKSMSWCG--ENICIAIRKGY-MILNATN 185 (823)
Q Consensus 117 ~~~fa~~~~~~~-l~V~~kkki~l~~~~~~~~f-~~~kei~~~~~~~~l~~~~--~~i~v~~~~~y-~lidl~~ 185 (823)
.+..+++++... .||+--+++..|.++.+... ..+++-+..|.=-+.+|.. ....||+..+| .++|+..
T Consensus 161 ~ns~~~snd~~~~~~Vgds~~Vf~y~id~~sey~~~~~~a~t~D~gF~~S~s~~~~~FAv~~Qdg~~~I~DVR~ 234 (344)
T KOG4532|consen 161 QNSLHYSNDPSWGSSVGDSRRVFRYAIDDESEYIENIYEAPTSDHGFYNSFSENDLQFAVVFQDGTCAIYDVRN 234 (344)
T ss_pred eeeeEEcCCCceEEEecCCCcceEEEeCCccceeeeeEecccCCCceeeeeccCcceEEEEecCCcEEEEEecc
Confidence 455666766654 46667788999999864332 2344555556556667763 45777888666 5778765
No 357
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=43.51 E-value=56 Score=34.02 Aligned_cols=66 Identities=15% Similarity=0.308 Sum_probs=47.4
Q ss_pred CCeeEEEEecccCceeeEe-Cc-EEEEeCCCCcccccccCC-CCcEEEEeeCCCceEEEEEc-CeEEEEE
Q 003405 76 KPILSMEVLASRQLLLSLS-ES-IAFHRLPNLETIAVLTKA-KGANVYSWDDRRGFLCFARQ-KRVCIFR 141 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~-d~-l~~~~L~~l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~k-kki~l~~ 141 (823)
..|..+.+=|+..++.+-+ |+ +++|...++.|...+..- -||+++|..++.+.++.|.+ .+|.++.
T Consensus 252 pGv~gvrIRpD~KIlATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd~~lmAaaskD~rISLWk 321 (323)
T KOG0322|consen 252 PGVSGVRIRPDGKILATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPDCELMAAASKDARISLWK 321 (323)
T ss_pred CCccceEEccCCcEEeecccCCcEEEEEeccCCchhhhhhhhcceeEEEeCCCCchhhhccCCceEEeee
Confidence 4677777777766666554 45 999999888887654332 48999999988777777744 5676665
No 358
>KOG4328 consensus WD40 protein [Function unknown]
Probab=43.50 E-value=2.2e+02 Score=31.97 Aligned_cols=113 Identities=19% Similarity=0.260 Sum_probs=72.6
Q ss_pred cEEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 18 KIDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 18 ~I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
+|++++.+- ..+.-|..|++..+|++..-.. ++.. ++... -++++|..-..-|..+-|++-|
T Consensus 324 KI~sv~~NP~~p~~laT~s~D~T~kIWD~R~l~~------------K~sp-~lst~--~HrrsV~sAyFSPs~gtl~TT~ 388 (498)
T KOG4328|consen 324 KITSVALNPVCPWFLATASLDQTAKIWDLRQLRG------------KASP-FLSTL--PHRRSVNSAYFSPSGGTLLTTC 388 (498)
T ss_pred ccceeecCCCCchheeecccCcceeeeehhhhcC------------CCCc-ceecc--cccceeeeeEEcCCCCceEeec
Confidence 888888774 5789999999999999775332 1111 22222 2478999999999887765554
Q ss_pred -Cc-EEEEeC----CCCcccccccCC----CCcEEE--EeeCCCceEEEE-EcCeEEEEEEcCC
Q 003405 95 -ES-IAFHRL----PNLETIAVLTKA----KGANVY--SWDDRRGFLCFA-RQKRVCIFRHDGG 145 (823)
Q Consensus 95 -d~-l~~~~L----~~l~~~~~i~~~----kg~~~f--a~~~~~~~l~V~-~kkki~l~~~~~~ 145 (823)
|+ |.+|+- ..+++...|... +-.+-| +++++...|+|| ..++|-+|.-.++
T Consensus 389 ~D~~IRv~dss~~sa~~~p~~~I~Hn~~t~RwlT~fKA~W~P~~~li~vg~~~r~IDv~~~~~~ 452 (498)
T KOG4328|consen 389 QDNEIRVFDSSCISAKDEPLGTIPHNNRTGRWLTPFKAAWDPDYNLIVVGRYPRPIDVFDGNGG 452 (498)
T ss_pred cCCceEEeecccccccCCccceeeccCcccccccchhheeCCCccEEEEeccCcceeEEcCCCC
Confidence 44 999987 344444333211 222333 246665667777 6778888877644
No 359
>PF04841 Vps16_N: Vps16, N-terminal region; InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=43.28 E-value=5.3e+02 Score=29.10 Aligned_cols=70 Identities=14% Similarity=0.223 Sum_probs=43.3
Q ss_pred CCCceEEEec--CCeEEEEEcCceE-EEEcCCCCeeeccCCCCCCCC-EEEEccCCeEEEEeCCeEEEEcCCCc
Q 003405 157 PDTVKSMSWC--GENICIAIRKGYM-ILNATNGALSEVFPSGRIGPP-LVVSLLSGELLLGKENIGVFVDQNGK 226 (823)
Q Consensus 157 ~~~~~~l~~~--~~~i~v~~~~~y~-lidl~~~~~~~L~~~~~~~~p-~i~~~~~~EfLL~~~~~gvfv~~~G~ 226 (823)
.++++.|+.. |..|++-+.+++. +++.+-.+..--+..+....| .+.+.+++-+++..++...+++..|.
T Consensus 216 ~~~i~~iavSpng~~iAl~t~~g~l~v~ssDf~~~~~e~~~~~~~~p~~~~WCG~dav~l~~~~~l~lvg~~~~ 289 (410)
T PF04841_consen 216 DGPIIKIAVSPNGKFIALFTDSGNLWVVSSDFSEKLCEFDTDSKSPPKQMAWCGNDAVVLSWEDELLLVGPDGD 289 (410)
T ss_pred CCCeEEEEECCCCCEEEEEECCCCEEEEECcccceeEEeecCcCCCCcEEEEECCCcEEEEeCCEEEEECCCCC
Confidence 3567766654 6677777776654 444332333333333333344 46788888888877888888887776
No 360
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=42.80 E-value=8.1e+02 Score=31.06 Aligned_cols=110 Identities=16% Similarity=0.154 Sum_probs=61.0
Q ss_pred CcEEEEEEeCCEEEEEeCCCcE-EEEcCCCCCCCCCCCCcccccccccceeeeeec----CCCCCCeeEEEEecccCcee
Q 003405 17 PKIDAVASYGLKILLGCSDGSL-KIYSPGSSESDRSPPSDYQSLRKESYELERTIS----GFSKKPILSMEVLASRQLLL 91 (823)
Q Consensus 17 ~~I~ci~~~~~~L~vGT~~G~l-~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~----~~~k~~I~qI~~~~~~~~Ll 91 (823)
..-+|.+...+.+|+.+..... ....+....... .....-.+. ......|..+..+++.+-++
T Consensus 24 ~~~~~~d~~sd~i~~~~~~~~~~~~i~~~~~~~~~------------~~~~l~s~~~~~~~~~~~~ivs~~yl~d~~~l~ 91 (928)
T PF04762_consen 24 ITATAFDSDSDSIYFVLGPNEIDYVIELDRFSQDG------------SVEVLASWDAPLPDDPNDKIVSFQYLADSESLC 91 (928)
T ss_pred cceEEEecCCCeEEEEECCCCcceEEEEEeeccCC------------ceeEEEeccccCCcCCCCcEEEEEeccCCCcEE
Confidence 3456777778888887776655 344443322211 011111211 11245799999999887655
Q ss_pred eEe-Cc-EEEEeC----CCCcccccccC-CCCcEEEEeeCCCceEEEEEcC-eEEE
Q 003405 92 SLS-ES-IAFHRL----PNLETIAVLTK-AKGANVYSWDDRRGFLCFARQK-RVCI 139 (823)
Q Consensus 92 ~l~-d~-l~~~~L----~~l~~~~~i~~-~kg~~~fa~~~~~~~l~V~~kk-ki~l 139 (823)
+.. +| |.++.. .+- .+..+.. -.|+.+.+++++...++++.+. ++.+
T Consensus 92 ~~~~~Gdi~~~~~~~~~~~~-~~E~VG~vd~GI~a~~WSPD~Ella~vT~~~~l~~ 146 (928)
T PF04762_consen 92 IALASGDIILVREDPDPDED-EIEIVGSVDSGILAASWSPDEELLALVTGEGNLLL 146 (928)
T ss_pred EEECCceEEEEEccCCCCCc-eeEEEEEEcCcEEEEEECCCcCEEEEEeCCCEEEE
Confidence 554 56 666622 211 1111222 3489999999988777777544 4433
No 361
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=42.70 E-value=45 Score=21.19 Aligned_cols=27 Identities=19% Similarity=0.317 Sum_probs=21.2
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEc
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYS 42 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~ 42 (823)
...|.|+.... +.++.|+.+|.+.+|+
T Consensus 12 ~~~i~~~~~~~~~~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 12 TGPVTSVAFSPDGKYLASASDDGTIKLWD 40 (40)
T ss_pred CCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence 34788887764 5899999999988774
No 362
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=42.67 E-value=35 Score=23.61 Aligned_cols=22 Identities=23% Similarity=0.391 Sum_probs=18.2
Q ss_pred HHHHHHHHccCCHHHHHHHHHh
Q 003405 343 IRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 343 ~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
...|..+...|+|++|+++|.+
T Consensus 3 ~~Lg~~~~~~g~~~~Ai~~y~~ 24 (36)
T PF13176_consen 3 NNLGRIYRQQGDYEKAIEYYEQ 24 (36)
T ss_dssp HHHHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHH
Confidence 3468889999999999999987
No 363
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=42.39 E-value=47 Score=30.50 Aligned_cols=54 Identities=28% Similarity=0.354 Sum_probs=32.4
Q ss_pred HHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 309 QLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 309 ~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+-.-|++++|+.+++....+. ..+..-..+..-+|..|+..|+.++|+..+..
T Consensus 47 tlr~LG~~deA~~~L~~~~~~~--p~~~~~~~l~~f~Al~L~~~gr~~eAl~~~l~ 100 (120)
T PF12688_consen 47 TLRNLGRYDEALALLEEALEEF--PDDELNAALRVFLALALYNLGRPKEALEWLLE 100 (120)
T ss_pred HHHHcCCHHHHHHHHHHHHHHC--CCccccHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 3445677888887776542110 00112334556677778888888888877765
No 364
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=42.26 E-value=1.1e+02 Score=33.48 Aligned_cols=117 Identities=16% Similarity=0.173 Sum_probs=61.8
Q ss_pred ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEe------C------CeEEEEcCCCccccCCceeecCCCcEEEE
Q 003405 177 GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGK------E------NIGVFVDQNGKLLQADRICWSEAPIAVII 244 (823)
Q Consensus 177 ~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~------~------~~gvfv~~~G~~~~~~~i~w~~~P~~v~~ 244 (823)
..+++|-++++..-..+.|-. |.++..+++.++... . +..-.+|...-. ....|..+..|+..+.
T Consensus 18 rv~viD~d~~k~lGmi~~g~~--~~~~~spdgk~~y~a~T~~sR~~rG~RtDvv~~~D~~TL~-~~~EI~iP~k~R~~~~ 94 (342)
T PF06433_consen 18 RVYVIDADSGKLLGMIDTGFL--GNVALSPDGKTIYVAETFYSRGTRGERTDVVEIWDTQTLS-PTGEIEIPPKPRAQVV 94 (342)
T ss_dssp EEEEEETTTTEEEEEEEEESS--EEEEE-TTSSEEEEEEEEEEETTEEEEEEEEEEEETTTTE-EEEEEEETTS-B--BS
T ss_pred eEEEEECCCCcEEEEeecccC--CceeECCCCCEEEEEEEEEeccccccceeEEEEEecCcCc-ccceEecCCcchheec
Confidence 477888888877666665532 233333455444321 1 223334433321 1345666665666555
Q ss_pred eCCEEEEEeCC-------------eEEEEEccCCCceeEEEeeCCcccccccC-CeEE-EeccceEEE
Q 003405 245 QKPYAIALLPR-------------RVEVRSLRVPYALIQTIVLQNVRHLIPSS-NAVV-VALENSIFG 297 (823)
Q Consensus 245 ~~PYll~~~~~-------------~ieV~~l~~~~~lvQ~i~l~~~~~l~~~~-~~v~-v~s~~~I~~ 297 (823)
.+++.++++.+ +|.|.++. .+..+..|+.|++-++.+.+ +.|+ +|.++++..
T Consensus 95 ~~~~~~~ls~dgk~~~V~N~TPa~SVtVVDl~-~~kvv~ei~~PGC~~iyP~~~~~F~~lC~DGsl~~ 161 (342)
T PF06433_consen 95 PYKNMFALSADGKFLYVQNFTPATSVTVVDLA-AKKVVGEIDTPGCWLIYPSGNRGFSMLCGDGSLLT 161 (342)
T ss_dssp --GGGEEE-TTSSEEEEEEESSSEEEEEEETT-TTEEEEEEEGTSEEEEEEEETTEEEEEETTSCEEE
T ss_pred ccccceEEccCCcEEEEEccCCCCeEEEEECC-CCceeeeecCCCEEEEEecCCCceEEEecCCceEE
Confidence 56666666542 57777876 47889999999986665433 3333 344444443
No 365
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=42.20 E-value=46 Score=22.04 Aligned_cols=25 Identities=16% Similarity=0.218 Sum_probs=21.2
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHh
Q 003405 549 YTALLELYKSNARHREALKLLHELV 573 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~ 573 (823)
+..++..|...|++++|++.+.+..
T Consensus 4 ~~~lg~~~~~~~~~~~A~~~~~~al 28 (34)
T PF07719_consen 4 WYYLGQAYYQLGNYEEAIEYFEKAL 28 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 5578999999999999999998765
No 366
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=42.14 E-value=93 Score=22.74 Aligned_cols=31 Identities=16% Similarity=0.095 Sum_probs=25.0
Q ss_pred CcEEEEEEeCCEEEEEeCCCcEEEEcCCCCC
Q 003405 17 PKIDAVASYGLKILLGCSDGSLKIYSPGSSE 47 (823)
Q Consensus 17 ~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~ 47 (823)
.....+...|+.+||+..++-|.++++...+
T Consensus 2 G~a~~v~v~g~yaYva~~~~Gl~IvDISnPs 32 (42)
T PF08309_consen 2 GDARDVAVSGNYAYVADGNNGLVIVDISNPS 32 (42)
T ss_pred ceEEEEEEECCEEEEEeCCCCEEEEECCCCC
Confidence 3466788999999999887778999987543
No 367
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=40.97 E-value=5.8e+02 Score=31.14 Aligned_cols=109 Identities=16% Similarity=0.181 Sum_probs=69.9
Q ss_pred EEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe-C
Q 003405 19 IDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS-E 95 (823)
Q Consensus 19 I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~-d 95 (823)
++|+..+ ++.+..|..||.|++|+-.+.+.. +.+.. . -+.+..+|..+....++..|++=+ +
T Consensus 208 ~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~~~~-----------~~t~t---~-lHWH~~~V~~L~fS~~G~~LlSGG~E 272 (792)
T KOG1963|consen 208 ITCVALSPNERYLAAGDSDGRILVWRDFGSSDD-----------SETCT---L-LHWHHDEVNSLSFSSDGAYLLSGGRE 272 (792)
T ss_pred ceeEEeccccceEEEeccCCcEEEEeccccccc-----------cccce---E-EEecccccceeEEecCCceEeecccc
Confidence 5666554 689999999999999985442211 11221 1 235567899999998887776533 2
Q ss_pred c-EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEE-EEcCeEEEEEEc
Q 003405 96 S-IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCF-ARQKRVCIFRHD 143 (823)
Q Consensus 96 ~-l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V-~~kkki~l~~~~ 143 (823)
+ +..|.+.+-+ ..-+++.-+ +..|.++++....++ ...+.|.+....
T Consensus 273 ~VLv~Wq~~T~~-kqfLPRLgs~I~~i~vS~ds~~~sl~~~DNqI~li~~~ 322 (792)
T KOG1963|consen 273 GVLVLWQLETGK-KQFLPRLGSPILHIVVSPDSDLYSLVLEDNQIHLIKAS 322 (792)
T ss_pred eEEEEEeecCCC-cccccccCCeeEEEEEcCCCCeEEEEecCceEEEEecc
Confidence 4 7889987655 222455443 456777777654433 367888777764
No 368
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=40.83 E-value=1.6e+02 Score=31.74 Aligned_cols=106 Identities=16% Similarity=0.189 Sum_probs=66.0
Q ss_pred EEEEEEeC-----CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-Cceee
Q 003405 19 IDAVASYG-----LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLS 92 (823)
Q Consensus 19 I~ci~~~~-----~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~ 92 (823)
-+|.-+++ -++.+|..-|.|.+.+..... +.+...++ ...|+.|+.-|.. +++++
T Consensus 93 ytcsw~yd~~~~~p~la~~G~~GvIrVid~~~~~------------------~~~~~~gh-G~sINeik~~p~~~qlvls 153 (385)
T KOG1034|consen 93 YTCSWSYDSNTGNPFLAAGGYLGVIRVIDVVSGQ------------------CSKNYRGH-GGSINEIKFHPDRPQLVLS 153 (385)
T ss_pred EEEEEEecCCCCCeeEEeecceeEEEEEecchhh------------------hccceecc-CccchhhhcCCCCCcEEEE
Confidence 46666664 278999999999998865422 12222333 5689999988876 67777
Q ss_pred EeC--cEEEEeCCCCcccccccCCC----CcEEEEeeCCCceEE-EEEcCeEEEEEEc
Q 003405 93 LSE--SIAFHRLPNLETIAVLTKAK----GANVYSWDDRRGFLC-FARQKRVCIFRHD 143 (823)
Q Consensus 93 l~d--~l~~~~L~~l~~~~~i~~~k----g~~~fa~~~~~~~l~-V~~kkki~l~~~~ 143 (823)
.+. .|++|++.+-.-+..+.... .|.++-.+.+..+|+ .|....|+++++.
T Consensus 154 ~SkD~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~ 211 (385)
T KOG1034|consen 154 ASKDHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLN 211 (385)
T ss_pred ecCCceEEEEeccCCeEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecC
Confidence 775 39999987654333222222 133333344433443 4577888888876
No 369
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=40.77 E-value=43 Score=21.06 Aligned_cols=19 Identities=11% Similarity=0.153 Sum_probs=14.8
Q ss_pred CcEEEEEEeC-CEEEEEeCC
Q 003405 17 PKIDAVASYG-LKILLGCSD 35 (823)
Q Consensus 17 ~~I~ci~~~~-~~L~vGT~~ 35 (823)
..|.|+.... ++|+|||.+
T Consensus 5 n~I~~i~~D~~G~lWigT~~ 24 (24)
T PF07494_consen 5 NNIYSIYEDSDGNLWIGTYN 24 (24)
T ss_dssp SCEEEEEE-TTSCEEEEETS
T ss_pred CeEEEEEEcCCcCEEEEeCC
Confidence 4789988876 699999974
No 370
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=40.42 E-value=5.4e+02 Score=28.31 Aligned_cols=126 Identities=14% Similarity=0.092 Sum_probs=62.0
Q ss_pred cCCeEEEEEc-CceEEEEcCCCCeeeccCC----CC---------CCCCEEEEccCCeEEEE-eCCeEEEEcCCCccccC
Q 003405 166 CGENICIAIR-KGYMILNATNGALSEVFPS----GR---------IGPPLVVSLLSGELLLG-KENIGVFVDQNGKLLQA 230 (823)
Q Consensus 166 ~~~~i~v~~~-~~y~lidl~~~~~~~L~~~----~~---------~~~p~i~~~~~~EfLL~-~~~~gvfv~~~G~~~~~ 230 (823)
.++.+++++. .....+|..+|+...-.+. +. ...|.+ .++.+.++ .++..+.+|.. .
T Consensus 189 ~~~~v~~~~~~g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~---~~~~vy~~~~~g~l~a~d~~-----t 260 (377)
T TIGR03300 189 ADGGVLVGFAGGKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVV---DGGQVYAVSYQGRVAALDLR-----S 260 (377)
T ss_pred ECCEEEEECCCCEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEE---ECCEEEEEEcCCEEEEEECC-----C
Confidence 4667888877 4567889999875432221 11 112332 23444443 34444445541 2
Q ss_pred CceeecCC---CcEEEEeCCEEEEEeC-CeEEEEEccCCCceeEEE-eeCCcc--cccccCCeEEEe-ccceEEEeec
Q 003405 231 DRICWSEA---PIAVIIQKPYAIALLP-RRVEVRSLRVPYALIQTI-VLQNVR--HLIPSSNAVVVA-LENSIFGLFP 300 (823)
Q Consensus 231 ~~i~w~~~---P~~v~~~~PYll~~~~-~~ieV~~l~~~~~lvQ~i-~l~~~~--~l~~~~~~v~v~-s~~~I~~l~~ 300 (823)
+.+.|... +...+....+|++..+ +.+...+.. ++..+-.. .+.+.. .....++.+|+. .++.|+++..
T Consensus 261 G~~~W~~~~~~~~~p~~~~~~vyv~~~~G~l~~~d~~-tG~~~W~~~~~~~~~~ssp~i~g~~l~~~~~~G~l~~~d~ 337 (377)
T TIGR03300 261 GRVLWKRDASSYQGPAVDDNRLYVTDADGVVVALDRR-SGSELWKNDELKYRQLTAPAVVGGYLVVGDFEGYLHWLSR 337 (377)
T ss_pred CcEEEeeccCCccCceEeCCEEEEECCCCeEEEEECC-CCcEEEccccccCCccccCEEECCEEEEEeCCCEEEEEEC
Confidence 34456332 2334456677777775 457777765 45544322 222211 001134455543 3456666654
No 371
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=40.17 E-value=1.1e+02 Score=38.21 Aligned_cols=83 Identities=12% Similarity=0.176 Sum_probs=50.1
Q ss_pred ccCceeeEeCc-EEEEeCCCCcc-cccccCCCCcEEEEeeCCCceEEEEEcC-eEEEEEEcCCCceeEeeeecC-----C
Q 003405 86 SRQLLLSLSES-IAFHRLPNLET-IAVLTKAKGANVYSWDDRRGFLCFARQK-RVCIFRHDGGRGFVEVKDFGV-----P 157 (823)
Q Consensus 86 ~~~~Ll~l~d~-l~~~~L~~l~~-~~~i~~~kg~~~fa~~~~~~~l~V~~kk-ki~l~~~~~~~~f~~~kei~~-----~ 157 (823)
..+.+++|+|. |.+..+....+ +.+.+..+.++++|+.+..-+++|+... ++.=|+-.. ...+++.. +
T Consensus 168 p~n~av~l~dlsl~V~~~~~~~~~v~s~p~t~~~Tav~WSprGKQl~iG~nnGt~vQy~P~l----eik~~ip~Pp~~e~ 243 (1405)
T KOG3630|consen 168 PLNSAVDLSDLSLRVKSTKQLAQNVTSFPVTNSQTAVLWSPRGKQLFIGRNNGTEVQYEPSL----EIKSEIPEPPVEEN 243 (1405)
T ss_pred cchhhhhccccchhhhhhhhhhhhhcccCcccceeeEEeccccceeeEecCCCeEEEeeccc----ceeecccCCCcCCC
Confidence 34678888884 77776654432 2334556778889998876677777443 344444331 11223322 4
Q ss_pred CCceEEEecCCeEEE
Q 003405 158 DTVKSMSWCGENICI 172 (823)
Q Consensus 158 ~~~~~l~~~~~~i~v 172 (823)
..|.+++|.++..++
T Consensus 244 yrvl~v~Wl~t~efl 258 (1405)
T KOG3630|consen 244 YRVLSVTWLSTQEFL 258 (1405)
T ss_pred cceeEEEEecceeEE
Confidence 678999999876433
No 372
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms]
Probab=39.92 E-value=2.1 Score=52.60 Aligned_cols=181 Identities=13% Similarity=0.063 Sum_probs=109.5
Q ss_pred ecccCceeeEeC----c-EEEEeCCCCcc---cccccCCCCcEEEEeeCCC----ceEEEEEcCeEEEEEEcCCCce-eE
Q 003405 84 LASRQLLLSLSE----S-IAFHRLPNLET---IAVLTKAKGANVYSWDDRR----GFLCFARQKRVCIFRHDGGRGF-VE 150 (823)
Q Consensus 84 ~~~~~~Ll~l~d----~-l~~~~L~~l~~---~~~i~~~kg~~~fa~~~~~----~~l~V~~kkki~l~~~~~~~~f-~~ 150 (823)
++..++|..+++ | +++++..+++. -..+...++|..|+++... +.++++.++-+.+|++..+..+ ..
T Consensus 1092 ~e~e~~l~~l~~~~~eg~lsl~~~~~~~~~~~~~~V~~s~~~~l~~~~~~~k~~~~~~il~i~k~~~v~~vt~~d~~~~~ 1171 (1317)
T KOG0612|consen 1092 DEAEQILPLLQGSRLEGWLSLPPRQNLDRDWKRIYVIVSSKKILFYVSEQDKEQSGPLILDIKKLFHVRQVTQTDVRRAD 1171 (1317)
T ss_pred chhhcchhhhhhhhhhcccccCcccccccchheeEEeecccceEeeeccccccccchhhhhhhhceeEEeecccccccch
Confidence 556677776665 3 77776554433 1235567888889886543 3467788888889999865433 34
Q ss_pred eeeecCCCCceEEEecCCeEEEEEcCceEEEEcCCCCe----eeccC----CCCCCCCEEEEc--cCCeEEEEeCCeEEE
Q 003405 151 VKDFGVPDTVKSMSWCGENICIAIRKGYMILNATNGAL----SEVFP----SGRIGPPLVVSL--LSGELLLGKENIGVF 220 (823)
Q Consensus 151 ~kei~~~~~~~~l~~~~~~i~v~~~~~y~lidl~~~~~----~~L~~----~~~~~~p~i~~~--~~~EfLL~~~~~gvf 220 (823)
.+|+ |..++.+.-..+. ++...+|..+++..+.. -+..+ .+..+.-|+.++ .-+++++++...+..
T Consensus 1172 ~~ei--p~~fq~l~~~~~~--~~~~~~f~~l~l~~~~~v~~~~~~~~~l~~~~~~~~~~~k~l~~~~~~ye~~~~~~~~~ 1247 (1317)
T KOG0612|consen 1172 AKEI--PRIFQILYANEGE--SGHPSEFSYLSLGPNSLVHKGHEFIPFLYHFPTNCEACIKPLWHMFKAYECRRCHIKCH 1247 (1317)
T ss_pred hhhc--chhHHHHHhhccc--ccCccccchhhccchhhcCCCCcchHHHhhcchhHHHHhhhcccchhHHHHHHhhcccc
Confidence 5566 7777776555444 77778888777763211 00000 000011122221 113888888899999
Q ss_pred EcCCCccccCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeCCcccccc
Q 003405 221 VDQNGKLLQADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQNVRHLIP 282 (823)
Q Consensus 221 v~~~G~~~~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l~~ 282 (823)
+|.-|+ +.|+ +..+.+..++++.+.+...+ ...+||++..+..+.+..
T Consensus 1248 ~d~~~k------~m~p--~ky~~~~a~~l~l~a~~~~d------q~eWV~~l~k~~~k~~~~ 1295 (1317)
T KOG0612|consen 1248 KDHMDK------IMAP--CKYDTSSARHLLLLAESTED------QAKWVQRLVKKIPKPLPA 1295 (1317)
T ss_pred cccccc------ccCc--ccccccCCccceeccCCchH------HHHHHHHHhcccCCCCCc
Confidence 998887 4443 33777788899888875544 246888876655444333
No 373
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=38.97 E-value=60 Score=39.04 Aligned_cols=72 Identities=21% Similarity=0.329 Sum_probs=49.2
Q ss_pred hHHHHHHHhcCCHHHHHHHhhhC-------------CCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHH
Q 003405 304 GAQIVQLTASGDFEEALALCKLL-------------PPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDIT 370 (823)
Q Consensus 304 ~~qI~~Ll~~~~~e~Al~L~~~~-------------~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~ 370 (823)
+.-++-|....+|+.|++||..- +..|..+..+....|..+.|..+..+|.|..|-..|.+++---+
T Consensus 1097 ekAV~lL~~ar~~~~AlqlC~~~nv~vtee~aE~mTp~Kd~~~~e~~R~~vLeqvae~c~qQG~Yh~AtKKfTQAGdKl~ 1176 (1416)
T KOG3617|consen 1097 EKAVNLLCLAREFSGALQLCKNRNVRVTEEFAELMTPTKDDMPNEQERKQVLEQVAELCLQQGAYHAATKKFTQAGDKLS 1176 (1416)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCchhHHHHHhcCcCcCCCccHHHHHHHHHHHHHHHHhccchHHHHHHHhhhhhHHH
Confidence 44455666677788888888642 11222222334567788899999999999999999999775444
Q ss_pred HHHHh
Q 003405 371 YALSL 375 (823)
Q Consensus 371 ~vi~L 375 (823)
..=+|
T Consensus 1177 AMraL 1181 (1416)
T KOG3617|consen 1177 AMRAL 1181 (1416)
T ss_pred HHHHH
Confidence 44444
No 374
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=38.70 E-value=5.2e+02 Score=27.61 Aligned_cols=113 Identities=13% Similarity=0.268 Sum_probs=70.3
Q ss_pred CCCeeEEEEeccc-----CceeeEeCcEEEEeCCC----CcccccccCCC------CcEEEEeeCCCc-eEEEE-EcCeE
Q 003405 75 KKPILSMEVLASR-----QLLLSLSESIAFHRLPN----LETIAVLTKAK------GANVYSWDDRRG-FLCFA-RQKRV 137 (823)
Q Consensus 75 k~~I~qI~~~~~~-----~~Ll~l~d~l~~~~L~~----l~~~~~i~~~k------g~~~fa~~~~~~-~l~V~-~kkki 137 (823)
..|+++|.-+|.. ++|.+.+|.+++|.... ++....+...| ..++|-.++-.. .|.+. ..-..
T Consensus 96 ~YP~tK~~wiPd~~g~~pdlLATs~D~LRlWri~~ee~~~~~~~~L~~~kns~~~aPlTSFDWne~dp~~igtSSiDTTC 175 (364)
T KOG0290|consen 96 PYPVTKLMWIPDSKGVYPDLLATSSDFLRLWRIGDEESRVELQSVLNNNKNSEFCAPLTSFDWNEVDPNLIGTSSIDTTC 175 (364)
T ss_pred CCCccceEecCCccccCcchhhcccCeEEEEeccCcCCceehhhhhccCcccccCCcccccccccCCcceeEeecccCeE
Confidence 5799999999976 46667777899999863 22222222222 456677665433 34443 56677
Q ss_pred EEEEEcCCC-ceeEeeeecCCCCceEEEecC--CeEE--EEEcCceEEEEcCCCC
Q 003405 138 CIFRHDGGR-GFVEVKDFGVPDTVKSMSWCG--ENIC--IAIRKGYMILNATNGA 187 (823)
Q Consensus 138 ~l~~~~~~~-~f~~~kei~~~~~~~~l~~~~--~~i~--v~~~~~y~lidl~~~~ 187 (823)
+|+.+..+- ...+++-|.-..++--++|.+ ..+| ||-..+..++|+..-.
T Consensus 176 TiWdie~~~~~~vkTQLIAHDKEV~DIaf~~~s~~~FASvgaDGSvRmFDLR~le 230 (364)
T KOG0290|consen 176 TIWDIETGVSGTVKTQLIAHDKEVYDIAFLKGSRDVFASVGADGSVRMFDLRSLE 230 (364)
T ss_pred EEEEEeeccccceeeEEEecCcceeEEEeccCccceEEEecCCCcEEEEEecccc
Confidence 788876431 123444455566788899984 3344 4455789999998643
No 375
>KOG4340 consensus Uncharacterized conserved protein [Function unknown]
Probab=38.69 E-value=40 Score=35.64 Aligned_cols=70 Identities=13% Similarity=0.121 Sum_probs=49.6
Q ss_pred eeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCC
Q 003405 298 LFPVPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYP 377 (823)
Q Consensus 298 l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp 377 (823)
+.+-.+..-|..|+..++|++|+++..... +.++ +-+.-....|+.++.-.+|.+|.+.+.+. -.+||
T Consensus 8 i~EGeftaviy~lI~d~ry~DaI~~l~s~~-Er~p----~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL-------~ql~P 75 (459)
T KOG4340|consen 8 IPEGEFTAVVYRLIRDARYADAIQLLGSEL-ERSP----RSRAGLSLLGYCYYRLQEFALAAECYEQL-------GQLHP 75 (459)
T ss_pred CCCCchHHHHHHHHHHhhHHHHHHHHHHHH-hcCc----cchHHHHHHHHHHHHHHHHHHHHHHHHHH-------HhhCh
Confidence 445556778999999999999999997652 2211 22222344578888889999999999873 35666
Q ss_pred CC
Q 003405 378 SI 379 (823)
Q Consensus 378 ~l 379 (823)
..
T Consensus 76 ~~ 77 (459)
T KOG4340|consen 76 EL 77 (459)
T ss_pred HH
Confidence 55
No 376
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.49 E-value=4e+02 Score=27.90 Aligned_cols=68 Identities=19% Similarity=0.261 Sum_probs=48.8
Q ss_pred CcEEEEEEe---CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-Cceee
Q 003405 17 PKIDAVASY---GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLS 92 (823)
Q Consensus 17 ~~I~ci~~~---~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~ 92 (823)
.+|-+++.. ++.++.++=||+|.+|+.+...+ +++|.+ +...|-|-.--|.. |++..
T Consensus 105 ~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~S------------------v~Tf~g-h~~~Iy~a~~sp~~~nlfas 165 (311)
T KOG0277|consen 105 REVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNS------------------VQTFNG-HNSCIYQAAFSPHIPNLFAS 165 (311)
T ss_pred hheEEeccccccceeEEeeccCCceEeecCCCCcc------------------eEeecC-CccEEEEEecCCCCCCeEEE
Confidence 456666643 46888999999999998665332 345555 36788888888766 77777
Q ss_pred EeC-c-EEEEeCC
Q 003405 93 LSE-S-IAFHRLP 103 (823)
Q Consensus 93 l~d-~-l~~~~L~ 103 (823)
.+. + +++|++.
T Consensus 166 ~Sgd~~l~lwdvr 178 (311)
T KOG0277|consen 166 ASGDGTLRLWDVR 178 (311)
T ss_pred ccCCceEEEEEec
Confidence 765 5 9999964
No 377
>PF12854 PPR_1: PPR repeat
Probab=38.18 E-value=41 Score=23.02 Aligned_cols=26 Identities=23% Similarity=0.309 Sum_probs=21.5
Q ss_pred cChhHHHHHHHhcCCHHHHHHHhhhC
Q 003405 301 VPLGAQIVQLTASGDFEEALALCKLL 326 (823)
Q Consensus 301 ~~~~~qI~~Ll~~~~~e~Al~L~~~~ 326 (823)
..+..-|+.+.+.|++++|+++++..
T Consensus 8 ~ty~~lI~~~Ck~G~~~~A~~l~~~M 33 (34)
T PF12854_consen 8 VTYNTLIDGYCKAGRVDEAFELFDEM 33 (34)
T ss_pred hHHHHHHHHHHHCCCHHHHHHHHHhC
Confidence 34667789999999999999998754
No 378
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=38.01 E-value=68 Score=33.87 Aligned_cols=64 Identities=13% Similarity=0.146 Sum_probs=45.7
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
-+.++..|+|++|+..++......+ ...+........|..++..|++++|...|.+ |+..||+-
T Consensus 187 G~~y~~~g~~~~A~~~f~~vv~~yP--~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~-------vi~~yP~s 250 (263)
T PRK10803 187 GQLNYNKGKKDDAAYYFASVVKNYP--KSPKAADAMFKVGVIMQDKGDTAKAKAVYQQ-------VIKKYPGT 250 (263)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHHCC--CCcchhHHHHHHHHHHHHcCCHHHHHHHHHH-------HHHHCcCC
Confidence 4455789999999999877521100 1123445556678888999999999999985 67888765
No 379
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=37.84 E-value=2.5e+02 Score=30.86 Aligned_cols=110 Identities=10% Similarity=0.127 Sum_probs=66.4
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
..|=|.+.. +.+||-|.++|++...+++...+ -+ +.. .+-.+..|-.+.+-|.-|.+++.+
T Consensus 106 SNIF~L~F~~~N~~~~SG~~~~~VI~HDiEt~qs--------------i~-V~~--~~~~~~~VY~m~~~P~DN~~~~~t 168 (609)
T KOG4227|consen 106 SNIFSLEFDLENRFLYSGERWGTVIKHDIETKQS--------------IY-VAN--ENNNRGDVYHMDQHPTDNTLIVVT 168 (609)
T ss_pred cceEEEEEccCCeeEecCCCcceeEeeeccccee--------------ee-eec--ccCcccceeecccCCCCceEEEEe
Confidence 467787655 46899999999999988774321 11 111 112256899999999999999999
Q ss_pred C-c-EEEEeCCCCc-ccccc-cC--CCCcEEEEeeCCCce-EEEE-EcCeEEEEEEc
Q 003405 95 E-S-IAFHRLPNLE-TIAVL-TK--AKGANVYSWDDRRGF-LCFA-RQKRVCIFRHD 143 (823)
Q Consensus 95 d-~-l~~~~L~~l~-~~~~i-~~--~kg~~~fa~~~~~~~-l~V~-~kkki~l~~~~ 143 (823)
+ + |.+|+..+-. ++..+ .. -++-...-.++..+. |+|+ .+...-+|...
T Consensus 169 ~~~~V~~~D~Rd~~~~~~~~~~AN~~~~F~t~~F~P~~P~Li~~~~~~~G~~~~D~R 225 (609)
T KOG4227|consen 169 RAKLVSFIDNRDRQNPISLVLPANSGKNFYTAEFHPETPALILVNSETGGPNVFDRR 225 (609)
T ss_pred cCceEEEEeccCCCCCCceeeecCCCccceeeeecCCCceeEEeccccCCCCceeec
Confidence 8 5 8899875432 22111 11 122222223444443 5555 45556666554
No 380
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=37.78 E-value=52 Score=23.79 Aligned_cols=26 Identities=27% Similarity=0.393 Sum_probs=23.0
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 549 YTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
+..|+..|...|++++|.+++.+...
T Consensus 4 ~~~la~~~~~~G~~~~A~~~~~~~l~ 29 (44)
T PF13428_consen 4 WLALARAYRRLGQPDEAERLLRRALA 29 (44)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 45789999999999999999998764
No 381
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=37.59 E-value=6.7e+02 Score=31.72 Aligned_cols=38 Identities=26% Similarity=0.392 Sum_probs=31.2
Q ss_pred HHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 537 KICEEILQKKNHYTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 537 ~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
++-...++++++|.-|..+|.+.++|+.||+.+.+..+
T Consensus 792 e~~il~a~~~~~y~Vl~hi~~k~~kyed~l~~iLe~n~ 829 (1206)
T KOG2079|consen 792 ENFILEAKEKNFYKVLFHIYKKENKYEDALSLILETND 829 (1206)
T ss_pred HHHHHHhhhcccceeHHHHHhhhhhHHHHHHHHHHhhh
Confidence 33344456789999999999999999999999998764
No 382
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=37.56 E-value=3.3e+02 Score=34.47 Aligned_cols=140 Identities=14% Similarity=0.102 Sum_probs=71.9
Q ss_pred CCCeeEEEEecccCceeeEe--Cc--EEEEeCCCCccccc----------ccCCCCcEE--EEeeCCCc--eEEEEEcCe
Q 003405 75 KKPILSMEVLASRQLLLSLS--ES--IAFHRLPNLETIAV----------LTKAKGANV--YSWDDRRG--FLCFARQKR 136 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~--d~--l~~~~L~~l~~~~~----------i~~~kg~~~--fa~~~~~~--~l~V~~kkk 136 (823)
+-+|..+.+.++.-..+|+. +| |.+||+.+|..-.. +..-|++-- ..+++... ..+.....+
T Consensus 100 ~~pi~~~v~~~D~t~s~v~~tsng~~v~~fD~~~fs~s~~~~~~pl~~s~ts~ek~vf~~~~~wnP~vp~n~av~l~dls 179 (1405)
T KOG3630|consen 100 EIPIVIFVCFHDATDSVVVSTSNGEAVYSFDLEEFSESRYETTVPLKNSATSFEKPVFQLKNVWNPLVPLNSAVDLSDLS 179 (1405)
T ss_pred cccceEEEeccCCceEEEEEecCCceEEEEehHhhhhhhhhhccccccccchhccccccccccccCCccchhhhhccccc
Confidence 45788888887755544443 34 78888876542211 111122211 22232221 123335667
Q ss_pred EEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEcCceEEEEcCCCCeeeccC---CCCCCC-CEEEEccCCeE
Q 003405 137 VCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIRKGYMILNATNGALSEVFP---SGRIGP-PLVVSLLSGEL 210 (823)
Q Consensus 137 i~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~~~y~lidl~~~~~~~L~~---~~~~~~-p~i~~~~~~Ef 210 (823)
|.++...... +. ...+.+--..++++|. |-.+++|..++-.+-=.-++++....+ .-...+ -+++|+...||
T Consensus 180 l~V~~~~~~~-~~-v~s~p~t~~~Tav~WSprGKQl~iG~nnGt~vQy~P~leik~~ip~Pp~~e~yrvl~v~Wl~t~ef 257 (1405)
T KOG3630|consen 180 LRVKSTKQLA-QN-VTSFPVTNSQTAVLWSPRGKQLFIGRNNGTEVQYEPSLEIKSEIPEPPVEENYRVLSVTWLSTQEF 257 (1405)
T ss_pred hhhhhhhhhh-hh-hcccCcccceeeEEeccccceeeEecCCCeEEEeecccceeecccCCCcCCCcceeEEEEecceeE
Confidence 7666665321 11 1123344567899997 667888887654332222233222221 111122 25789999999
Q ss_pred EEEeCC
Q 003405 211 LLGKEN 216 (823)
Q Consensus 211 LL~~~~ 216 (823)
++.+.+
T Consensus 258 lvvy~n 263 (1405)
T KOG3630|consen 258 LVVYGN 263 (1405)
T ss_pred EEEecc
Confidence 998754
No 383
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=37.51 E-value=37 Score=32.10 Aligned_cols=52 Identities=15% Similarity=-0.006 Sum_probs=38.4
Q ss_pred HHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 308 VQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 308 ~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
..+...|++++|+...+....-++ .-...+...|..++..|++++|...|.+
T Consensus 66 ~~~~~~g~~~~A~~~y~~Al~l~p-----~~~~a~~~lg~~l~~~g~~~eAi~~~~~ 117 (144)
T PRK15359 66 GTWMMLKEYTTAINFYGHALMLDA-----SHPEPVYQTGVCLKMMGEPGLAREAFQT 117 (144)
T ss_pred HHHHHHhhHHHHHHHHHHHHhcCC-----CCcHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 345678999999999976421111 1224566779999999999999999986
No 384
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=37.42 E-value=4.7e+02 Score=26.74 Aligned_cols=191 Identities=19% Similarity=0.185 Sum_probs=95.4
Q ss_pred CEE-EEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEe-cccCceeeEeC-cEEEEeCC
Q 003405 27 LKI-LLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVL-ASRQLLLSLSE-SIAFHRLP 103 (823)
Q Consensus 27 ~~L-~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~-~~~~~Ll~l~d-~l~~~~L~ 103 (823)
+.| |+....|.|+.++..... ...+. . ....-+.+. +. +.+++... ++.++++.
T Consensus 12 g~l~~~D~~~~~i~~~~~~~~~-------------------~~~~~-~--~~~~G~~~~~~~-g~l~v~~~~~~~~~d~~ 68 (246)
T PF08450_consen 12 GRLYWVDIPGGRIYRVDPDTGE-------------------VEVID-L--PGPNGMAFDRPD-GRLYVADSGGIAVVDPD 68 (246)
T ss_dssp TEEEEEETTTTEEEEEETTTTE-------------------EEEEE-S--SSEEEEEEECTT-SEEEEEETTCEEEEETT
T ss_pred CEEEEEEcCCCEEEEEECCCCe-------------------EEEEe-c--CCCceEEEEccC-CEEEEEEcCceEEEecC
Confidence 444 555678999998755321 11111 1 124445555 44 55555554 57777765
Q ss_pred CC--ccccccc----CCCCcEEEEeeCCCceEEEEEcCe--------EEEEEEcCCCceeEeeeecCCCCceEEEec--C
Q 003405 104 NL--ETIAVLT----KAKGANVYSWDDRRGFLCFARQKR--------VCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--G 167 (823)
Q Consensus 104 ~l--~~~~~i~----~~kg~~~fa~~~~~~~l~V~~kkk--------i~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~ 167 (823)
+- +.+.... ....++.++++++ |.+.++.... =.+|++..+.....+.+ --..|.+|+|. +
T Consensus 69 ~g~~~~~~~~~~~~~~~~~~ND~~vd~~-G~ly~t~~~~~~~~~~~~g~v~~~~~~~~~~~~~~--~~~~pNGi~~s~dg 145 (246)
T PF08450_consen 69 TGKVTVLADLPDGGVPFNRPNDVAVDPD-GNLYVTDSGGGGASGIDPGSVYRIDPDGKVTVVAD--GLGFPNGIAFSPDG 145 (246)
T ss_dssp TTEEEEEEEEETTCSCTEEEEEEEE-TT-S-EEEEEECCBCTTCGGSEEEEEEETTSEEEEEEE--EESSEEEEEEETTS
T ss_pred CCcEEEEeeccCCCcccCCCceEEEcCC-CCEEEEecCCCccccccccceEEECCCCeEEEEec--CcccccceEECCcc
Confidence 43 2222221 3345677888876 5577763321 34788775422222221 11357789988 4
Q ss_pred CeEEEEEcCc--eEEEEcCC--CCe--eecc-CCCC-CCCCEEEEc-cCCeEEEE--eCCeEEEEcCCCccccCCceeec
Q 003405 168 ENICIAIRKG--YMILNATN--GAL--SEVF-PSGR-IGPPLVVSL-LSGELLLG--KENIGVFVDQNGKLLQADRICWS 236 (823)
Q Consensus 168 ~~i~v~~~~~--y~lidl~~--~~~--~~L~-~~~~-~~~p~i~~~-~~~EfLL~--~~~~gvfv~~~G~~~~~~~i~w~ 236 (823)
+.++|+.... ...+++.. +.. ...+ .... ...|--+.+ .++.+.++ .++....++.+|+.. ..|..+
T Consensus 146 ~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~G~~~--~~i~~p 223 (246)
T PF08450_consen 146 KTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGGGRIVVFDPDGKLL--REIELP 223 (246)
T ss_dssp SEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETTTEEEEEETTSCEE--EEEE-S
T ss_pred hheeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCCCEEEEECCCccEE--EEEcCC
Confidence 5788887643 44556542 211 1222 2111 122443333 45566664 345566677788754 344444
Q ss_pred -CCCcEEEEe
Q 003405 237 -EAPIAVIIQ 245 (823)
Q Consensus 237 -~~P~~v~~~ 245 (823)
..|..+++-
T Consensus 224 ~~~~t~~~fg 233 (246)
T PF08450_consen 224 VPRPTNCAFG 233 (246)
T ss_dssp SSSEEEEEEE
T ss_pred CCCEEEEEEE
Confidence 455666653
No 385
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=37.20 E-value=4.9e+02 Score=33.17 Aligned_cols=188 Identities=10% Similarity=-0.019 Sum_probs=96.6
Q ss_pred HHHHHHHHHhccHHHHHHHHHHHhhcccCCCCccc-ccccCChHHHHHHhhcC---CCCChhhH--HH----------hh
Q 003405 550 TALLELYKSNARHREALKLLHELVEESKSNQSQDE-HTQKFNPESIIEYLKPL---CGTDPMLV--LE----------FS 613 (823)
Q Consensus 550 ~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~-~~~~~~~~~~i~yL~~L---~~~~~~li--~~----------y~ 613 (823)
..|+.+|...|+.++|+..+.+....+-.+...-. +........++.+..++ .+.+.+.. +- |.
T Consensus 82 ~~LA~~yl~~g~~~~A~~~~~kAv~ldP~n~~~~~~La~i~~~~kA~~~ye~l~~~~P~n~~~~~~la~~~~~~~~l~y~ 161 (987)
T PRK09782 82 LYLAEAYRHFGHDDRARLLLEDQLKRHPGDARLERSLAAIPVEVKSVTTVEELLAQQKACDAVPTLRCRSEVGQNALRLA 161 (987)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHhcCcccHHHHHHHHHhccChhHHHHHHHHHHhCCCChhHHHHHHHHhhccchhhhh
Confidence 57888888888888888888887655432211000 00111222333443333 22222211 11 22
Q ss_pred hhhhhcCccccccccccCCCC-hHHHHHHHh------hcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhh
Q 003405 614 MLVLESCPTQTIELFLSGNIP-ADLVNSYLK------QYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYS 686 (823)
Q Consensus 614 ~wll~~~p~~~~~if~~~~l~-~~~Vl~~L~------~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~ 686 (823)
.+ ..+..+++ .-..... ...|+.+.. ..+.+.++..|+.++.. +..+.....+|..+|+..+.
T Consensus 162 q~---eqAl~AL~-lr~~~~~~~~~vL~L~~~rlY~~l~dw~~Ai~lL~~L~k~--~pl~~~~~~~L~~ay~q~l~---- 231 (987)
T PRK09782 162 QL---PVARAQLN-DATFAASPEGKTLRTDLLQRAIYLKQWSQADTLYNEARQQ--NTLSAAERRQWFDVLLAGQL---- 231 (987)
T ss_pred hH---HHHHHHHH-HhhhCCCCCcHHHHHHHHHHHHHHhCHHHHHHHHHHHHhc--CCCCHHHHHHHHHHHHHhhC----
Confidence 22 22223343 1111222 223333331 12235678888888864 34567888888888886421
Q ss_pred hhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHH----HhCCCchhH
Q 003405 687 DLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVH----KVFLINQPV 762 (823)
Q Consensus 687 ~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~----~L~D~~~a~ 762 (823)
++++..+ |.+ .++ .+..+....+=.|-+.|++++|.+++.. .++++.+.-
T Consensus 232 ----------------~~~a~al------~~~--~lk--~d~~l~~ala~~yi~~G~~~~A~~~L~~~~~~~~~~~~~~~ 285 (987)
T PRK09782 232 ----------------DDRLLAL------QSQ--GIF--TDPQSRITYATALAYRGEKARLQHYLIENKPLFTTDAQEKS 285 (987)
T ss_pred ----------------HHHHHHH------hch--hcc--cCHHHHHHHHHHHHHCCCHHHHHHHHHhCcccccCCCccHH
Confidence 1333333 221 222 2334556667778888888888888874 345555566
Q ss_pred HHHHHHHhcCC
Q 003405 763 FLLIRRMAMDI 773 (823)
Q Consensus 763 ~~~l~~~y~~~ 773 (823)
|.-++.-+...
T Consensus 286 ~~~~l~r~~~~ 296 (987)
T PRK09782 286 WLYLLSKYSAN 296 (987)
T ss_pred HHHHHHhccCc
Confidence 65555544444
No 386
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=35.75 E-value=40 Score=23.73 Aligned_cols=19 Identities=21% Similarity=0.377 Sum_probs=16.3
Q ss_pred CEEEEEeCCCcEEEEcCCC
Q 003405 27 LKILLGCSDGSLKIYSPGS 45 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~ 45 (823)
+.+|+||.+|.|+.++...
T Consensus 1 ~~v~~~~~~g~l~AlD~~T 19 (38)
T PF01011_consen 1 GRVYVGTPDGYLYALDAKT 19 (38)
T ss_dssp TEEEEETTTSEEEEEETTT
T ss_pred CEEEEeCCCCEEEEEECCC
Confidence 4799999999999998653
No 387
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=35.32 E-value=3.5e+02 Score=28.00 Aligned_cols=111 Identities=14% Similarity=0.209 Sum_probs=58.6
Q ss_pred eEEEEEEcCCCceeEeeeecCCCCceEEEecCC-e-EEEEEcCceEE--EE--cCCCCe---eeccCCCCCCCCEEEEcc
Q 003405 136 RVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGE-N-ICIAIRKGYMI--LN--ATNGAL---SEVFPSGRIGPPLVVSLL 206 (823)
Q Consensus 136 ki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~-~-i~v~~~~~y~l--id--l~~~~~---~~L~~~~~~~~p~i~~~~ 206 (823)
+-.+|.|..+.+...++. .-+.+.+++|..+ . .++-.+..|.+ +| +.+|.. ..++...++ +|.-...+
T Consensus 138 ~g~Ly~~~~~h~v~~i~~--~v~IsNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~-~~~e~~~P 214 (310)
T KOG4499|consen 138 GGELYSWLAGHQVELIWN--CVGISNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKS-QPFESLEP 214 (310)
T ss_pred ccEEEEeccCCCceeeeh--hccCCccccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccC-CCcCCCCC
Confidence 345777765444443332 2245668899843 3 44444467777 44 666642 333333221 12111122
Q ss_pred CCeEEEEeCCeEEEEcCCCccccCCceeecCCCcEEEEeCCEEEEEeCCeEEEEEccCCCceeEEEeeCCcccc
Q 003405 207 SGELLLGKENIGVFVDQNGKLLQADRICWSEAPIAVIIQKPYAIALLPRRVEVRSLRVPYALIQTIVLQNVRHL 280 (823)
Q Consensus 207 ~~EfLL~~~~~gvfv~~~G~~~~~~~i~w~~~P~~v~~~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~~~~l 280 (823)
+ |+.+|.+|.. |+-++....+.-.+.. ++.+.++|.+|..+.-
T Consensus 215 D----------Gm~ID~eG~L--------------------~Va~~ng~~V~~~dp~-tGK~L~eiklPt~qit 257 (310)
T KOG4499|consen 215 D----------GMTIDTEGNL--------------------YVATFNGGTVQKVDPT-TGKILLEIKLPTPQIT 257 (310)
T ss_pred C----------cceEccCCcE--------------------EEEEecCcEEEEECCC-CCcEEEEEEcCCCceE
Confidence 2 3333555542 5555666667766765 5888899888865543
No 388
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=35.15 E-value=6.7e+02 Score=27.86 Aligned_cols=193 Identities=13% Similarity=0.140 Sum_probs=0.0
Q ss_pred CCEEEEEeCCC---cEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c---E
Q 003405 26 GLKILLGCSDG---SLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S---I 97 (823)
Q Consensus 26 ~~~L~vGT~~G---~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~---l 97 (823)
+++|+.++..+ .|++|++.... ......+ ...+......|..+.+++..+ + |
T Consensus 201 g~~la~~~~~~~~~~i~v~d~~~g~-------------------~~~~~~~-~~~~~~~~~spDg~~l~~~~~~~~~~~i 260 (417)
T TIGR02800 201 GQKLAYVSFESGKPEIYVQDLATGQ-------------------REKVASF-PGMNGAPAFSPDGSKLAVSLSKDGNPDI 260 (417)
T ss_pred CCEEEEEEcCCCCcEEEEEECCCCC-------------------EEEeecC-CCCccceEECCCCCEEEEEECCCCCccE
Q ss_pred EEEeCCCCcccccccCCCCcEEEEeeCCCceEEEEEcC----eEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEE
Q 003405 98 AFHRLPNLETIAVLTKAKGANVYSWDDRRGFLCFARQK----RVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENIC 171 (823)
Q Consensus 98 ~~~~L~~l~~~~~i~~~kg~~~fa~~~~~~~l~V~~kk----ki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~ 171 (823)
.+|++.+-...............++.++...|+++..+ .|.++...++..-..... ........|. |+.|+
T Consensus 261 ~~~d~~~~~~~~l~~~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~---~~~~~~~~~spdg~~i~ 337 (417)
T TIGR02800 261 YVMDLDGKQLTRLTNGPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFR---GGYNASPSWSPDGDLIA 337 (417)
T ss_pred EEEECCCCCEEECCCCCCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecC---CCCccCeEECCCCCEEE
Q ss_pred EEEcC----ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeC-----CeEEEEcCCCccccCCceeecCCCcEE
Q 003405 172 IAIRK----GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKE-----NIGVFVDQNGKLLQADRICWSEAPIAV 242 (823)
Q Consensus 172 v~~~~----~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~-----~~gvfv~~~G~~~~~~~i~w~~~P~~v 242 (823)
++... ...++|+.++..+.+...+....| ...+++.+|+... ....+.+.+|...+.-+.. .+.....
T Consensus 338 ~~~~~~~~~~i~~~d~~~~~~~~l~~~~~~~~p--~~spdg~~l~~~~~~~~~~~l~~~~~~g~~~~~~~~~-~g~~~~~ 414 (417)
T TIGR02800 338 FVHREGGGFNIAVMDLDGGGERVLTDTGLDESP--SFAPNGRMILYATTRGGRGVLGLVSTDGRFRARLPLG-NGDVREP 414 (417)
T ss_pred EEEccCCceEEEEEeCCCCCeEEccCCCCCCCc--eECCCCCEEEEEEeCCCcEEEEEEECCCceeeECCCC-CCCcCCC
Q ss_pred EE
Q 003405 243 II 244 (823)
Q Consensus 243 ~~ 244 (823)
.|
T Consensus 415 ~w 416 (417)
T TIGR02800 415 AW 416 (417)
T ss_pred CC
No 389
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=35.11 E-value=92 Score=30.18 Aligned_cols=54 Identities=17% Similarity=0.210 Sum_probs=37.7
Q ss_pred HHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 309 QLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 309 ~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+...|++++|+..++....... .......++...|..++..|+|++|...+.+
T Consensus 44 ~~~~~g~~~~A~~~~~~al~~~~--~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~ 97 (172)
T PRK02603 44 SAQADGEYAEALENYEEALKLEE--DPNDRSYILYNMGIIYASNGEHDKALEYYHQ 97 (172)
T ss_pred HHHHcCCHHHHHHHHHHHHHHhh--ccchHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 45678999999998875421100 0011234677789999999999999999876
No 390
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=35.08 E-value=6e+02 Score=29.43 Aligned_cols=193 Identities=14% Similarity=0.255 Sum_probs=93.6
Q ss_pred CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeE--eCc-EEEEeCC
Q 003405 27 LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSL--SES-IAFHRLP 103 (823)
Q Consensus 27 ~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l--~d~-l~~~~L~ 103 (823)
-.|.||-+.|.+.+++....+. ..+...-....|..|+.|+=++...-+++. .+| +.+|+..
T Consensus 186 ~dllIGf~tGqvq~idp~~~~~---------------sklfne~r~i~ktsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~ 250 (636)
T KOG2394|consen 186 LDLLIGFTTGQVQLIDPINFEV---------------SKLFNEERLINKSSVTCIKWVPGSDSLFLVAHASGNLYLYDKE 250 (636)
T ss_pred cceEEeeccCceEEecchhhHH---------------HHhhhhcccccccceEEEEEEeCCCceEEEEEecCceEEeecc
Confidence 3689999999999887443111 011101112346789999999876544444 444 8887652
Q ss_pred CCccc--ccccCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEec--CCeEEEEEcCce-
Q 003405 104 NLETI--AVLTKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWC--GENICIAIRKGY- 178 (823)
Q Consensus 104 ~l~~~--~~i~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~~~y- 178 (823)
-.-.. +.-.-.|.-..|++....+ .+.+=-+.+|.-+ .+.|...+|. |..+..+...+|
T Consensus 251 ~~~~~t~p~~~~~k~~~~f~i~t~ks-----k~~rNPv~~w~~~-----------~g~in~f~FS~DG~~LA~VSqDGfL 314 (636)
T KOG2394|consen 251 IVCGATAPSYQALKDGDQFAILTSKS-----KKTRNPVARWHIG-----------EGSINEFAFSPDGKYLATVSQDGFL 314 (636)
T ss_pred ccccCCCCcccccCCCCeeEEeeeec-----cccCCccceeEec-----------cccccceeEcCCCceEEEEecCceE
Confidence 11100 0001112122233221110 0000012222211 1245555665 445666666666
Q ss_pred EEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE--EeCCeEEEEc-CCCccccC--CceeecCCCcEEEEeCCEEEEEe
Q 003405 179 MILNATNGALSEVFPSGRIGPPLVVSLLSGELLL--GKENIGVFVD-QNGKLLQA--DRICWSEAPIAVIIQKPYAIALL 253 (823)
Q Consensus 179 ~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL--~~~~~gvfv~-~~G~~~~~--~~i~w~~~P~~v~~~~PYll~~~ 253 (823)
.++|..+-....++..=-.+--|+++.+|+.|++ +-|++.-... .+++.+-| +-=.| .+.+.+.||.....
T Consensus 315 RvF~fdt~eLlg~mkSYFGGLLCvcWSPDGKyIvtGGEDDLVtVwSf~erRVVARGqGHkSW----Vs~VaFDpytt~~e 390 (636)
T KOG2394|consen 315 RIFDFDTQELLGVMKSYFGGLLCVCWSPDGKYIVTGGEDDLVTVWSFEERRVVARGQGHKSW----VSVVAFDPYTTSTE 390 (636)
T ss_pred EEeeccHHHHHHHHHhhccceEEEEEcCCccEEEecCCcceEEEEEeccceEEEeccccccc----eeeEeecccccccc
Confidence 5777765332222211001346889999999998 3566554433 34443311 12234 34445567887766
Q ss_pred C
Q 003405 254 P 254 (823)
Q Consensus 254 ~ 254 (823)
+
T Consensus 391 e 391 (636)
T KOG2394|consen 391 E 391 (636)
T ss_pred c
Confidence 4
No 391
>PRK05137 tolB translocation protein TolB; Provisional
Probab=35.04 E-value=7.1e+02 Score=28.15 Aligned_cols=134 Identities=15% Similarity=0.144 Sum_probs=77.2
Q ss_pred CCCeeEEEEecccCceeeEeC--c---EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEEEc--CeEEEEEEcC-C
Q 003405 75 KKPILSMEVLASRQLLLSLSE--S---IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFARQ--KRVCIFRHDG-G 145 (823)
Q Consensus 75 k~~I~qI~~~~~~~~Ll~l~d--~---l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~~k--kki~l~~~~~-~ 145 (823)
..+|......|+.+.|+.+++ + |.++++.+-+. ..+...++ +...+++++...|++... +...||.++. +
T Consensus 201 ~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~-~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~ 279 (435)
T PRK05137 201 SSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQR-ELVGNFPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRS 279 (435)
T ss_pred CCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcE-EEeecCCCcccCcEECCCCCEEEEEEecCCCceEEEEECCC
Confidence 467888888998888877764 2 88999865432 22333443 345677777666765532 3345666652 2
Q ss_pred CceeEeeeecCCCCceEEEec--CCeEEEEEcC----ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE
Q 003405 146 RGFVEVKDFGVPDTVKSMSWC--GENICIAIRK----GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL 212 (823)
Q Consensus 146 ~~f~~~kei~~~~~~~~l~~~--~~~i~v~~~~----~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL 212 (823)
...+.+.. .+....+..|. |..|++++.+ ..+++|+.++..+.+...+.. .....+.++++.++
T Consensus 280 ~~~~~Lt~--~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~~~-~~~~~~SpdG~~ia 349 (435)
T PRK05137 280 GTTTRLTD--SPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGGGR-YSTPVWSPRGDLIA 349 (435)
T ss_pred CceEEccC--CCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCCCc-ccCeEECCCCCEEE
Confidence 22222211 22334566776 5678777753 467788888877766543211 11133456676655
No 392
>PF14156 AbbA_antirepres: Antirepressor AbbA
Probab=34.41 E-value=52 Score=25.75 Aligned_cols=39 Identities=23% Similarity=0.299 Sum_probs=30.8
Q ss_pred hHHHHHhhccccHHHHHHHHHHHhCCCc-------hhHHHHHHHHh
Q 003405 732 EERAILLGKMNQHELALSLYVHKVFLIN-------QPVFLLIRRMA 770 (823)
Q Consensus 732 ~e~~~Ll~klg~h~~AL~ilv~~L~D~~-------~a~~~~l~~~y 770 (823)
+|+.+|+.-+=+|+-|++++..+|.|++ ..-|-.|.+.|
T Consensus 12 EE~~LLLdiLf~q~YA~Ells~El~DIE~G~K~vD~~~Yk~l~rLy 57 (63)
T PF14156_consen 12 EEKKLLLDILFQQNYASELLSSELNDIENGTKNVDESQYKQLLRLY 57 (63)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccHHHHHHHHHHH
Confidence 6889999999999999999999999983 34455555544
No 393
>PF10607 CLTH: CTLH/CRA C-terminal to LisH motif domain; InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined [].
Probab=34.05 E-value=79 Score=29.69 Aligned_cols=58 Identities=24% Similarity=0.198 Sum_probs=42.6
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhh--hcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRA--AKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~--~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+|...+.+|+++.|++.++...+. .++. .-.-.++.+....+..+++..+|+.+..+
T Consensus 6 ~~I~~~I~~g~i~~Ai~w~~~~~~~--l~~~~~~L~f~L~~q~fiell~~~~~~~Ai~y~r~ 65 (145)
T PF10607_consen 6 KKIRQAILNGDIDPAIEWLNENFPE--LLKRNSSLEFELRCQQFIELLREGDIMEAIEYARK 65 (145)
T ss_pred HHHHHHHHcCCHHHHHHHHHHcCHH--HHhcCCchhHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 4677888999999999999886311 1111 12346677777888889999999998776
No 394
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=33.93 E-value=8.3e+02 Score=28.60 Aligned_cols=62 Identities=24% Similarity=0.261 Sum_probs=45.5
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCCC
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPSI 379 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~l 379 (823)
++.-+.+.|.+++|+..++..... -+ ....+...-|..++.-|++++|...+.. +|..-|+-
T Consensus 10 ~~~il~e~g~~~~AL~~L~~~~~~--I~---Dk~~~~E~rA~ll~kLg~~~eA~~~y~~-------Li~rNPdn 71 (517)
T PF12569_consen 10 KNSILEEAGDYEEALEHLEKNEKQ--IL---DKLAVLEKRAELLLKLGRKEEAEKIYRE-------LIDRNPDN 71 (517)
T ss_pred HHHHHHHCCCHHHHHHHHHhhhhh--CC---CHHHHHHHHHHHHHHcCCHHHHHHHHHH-------HHHHCCCc
Confidence 566778999999999999775211 11 1234566778999999999999999875 55555554
No 395
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=33.90 E-value=7.5e+02 Score=29.64 Aligned_cols=30 Identities=20% Similarity=0.244 Sum_probs=24.2
Q ss_pred CCcEEEEEEeC--CEEEEEeCCCcEEEEcCCC
Q 003405 16 SPKIDAVASYG--LKILLGCSDGSLKIYSPGS 45 (823)
Q Consensus 16 ~~~I~ci~~~~--~~L~vGT~~G~l~~y~~~~ 45 (823)
+.+|||++.-. .++|.|.+.|.|..-.++.
T Consensus 124 ~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 124 KCRVTALEWSKNGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred CceEEEEEecccccEEeecCCCceEEEEEech
Confidence 56899987553 5999999999998877664
No 396
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=33.55 E-value=56 Score=26.72 Aligned_cols=27 Identities=19% Similarity=0.337 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 339 GSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 339 ~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
..+....|..++..|+|++|+++|.++
T Consensus 5 a~~~~~la~~~~~~~~~~~A~~~~~~a 31 (78)
T PF13424_consen 5 ANAYNNLARVYRELGRYDEALDYYEKA 31 (78)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 456677899999999999999999974
No 397
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=33.45 E-value=96 Score=36.19 Aligned_cols=63 Identities=21% Similarity=0.301 Sum_probs=44.1
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceee-eeecC--CCCCCeeEEEEecccCceeeEeCc--EEEE
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELE-RTISG--FSKKPILSMEVLASRQLLLSLSES--IAFH 100 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~-~~~~~--~~k~~I~qI~~~~~~~~Ll~l~d~--l~~~ 100 (823)
+.-||+|.++|.|..|+..... +.+. ++++. .++.+|.++.-+|....||..+++ +++|
T Consensus 64 eHiLavadE~G~i~l~dt~~~~----------------fr~ee~~lk~~~aH~nAifDl~wapge~~lVsasGDsT~r~W 127 (720)
T KOG0321|consen 64 EHILAVADEDGGIILFDTKSIV----------------FRLEERQLKKPLAHKNAIFDLKWAPGESLLVSASGDSTIRPW 127 (720)
T ss_pred cceEEEecCCCceeeecchhhh----------------cchhhhhhcccccccceeEeeccCCCceeEEEccCCceeeee
Confidence 4579999999999999855322 2211 12222 247799999999976677777663 9999
Q ss_pred eCCC
Q 003405 101 RLPN 104 (823)
Q Consensus 101 ~L~~ 104 (823)
++..
T Consensus 128 dvk~ 131 (720)
T KOG0321|consen 128 DVKT 131 (720)
T ss_pred eecc
Confidence 9854
No 398
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=33.43 E-value=9.6e+02 Score=29.20 Aligned_cols=148 Identities=14% Similarity=0.208 Sum_probs=78.2
Q ss_pred CCCcEEEEEEeCCEEEEEeCCCcEEEEcCCCCCCCCCCCCccc-cc-cccccee--eeeecCC-CCCCeeEEEEec--cc
Q 003405 15 CSPKIDAVASYGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQ-SL-RKESYEL--ERTISGF-SKKPILSMEVLA--SR 87 (823)
Q Consensus 15 ~~~~I~ci~~~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~-~l-~~~~~~l--~~~~~~~-~k~~I~qI~~~~--~~ 87 (823)
+++..+|+.. ++.+||+. .+.|++|+......... .+|. .+ .++.+.. .+.+... ....|+.|.+-. ..
T Consensus 39 fKNNLtalsq-~n~LFiA~-~s~I~Vy~~d~l~~~p~--~~p~~~~~t~p~~~~~~D~~~s~~p~PHtIN~i~v~~lg~~ 114 (717)
T PF08728_consen 39 FKNNLTALSQ-RNLLFIAY-QSEIYVYDPDGLTQLPS--RKPCLRFDTKPEFTSTPDRLISTWPFPHTINFIKVGDLGGE 114 (717)
T ss_pred cccceeEEec-CCEEEEEE-CCEEEEEecCCcccccc--cccccccccCccccccccccccCCCCCceeeEEEecccCCe
Confidence 4567788877 88999977 77899998765433110 0100 00 0001100 0000000 134688886654 34
Q ss_pred CceeeEeC-c-EEEEeCCCCc----cc-------ccccCC-------CCcEEE--Eee--CCCceEEEE-EcCeEEEEEE
Q 003405 88 QLLLSLSE-S-IAFHRLPNLE----TI-------AVLTKA-------KGANVY--SWD--DRRGFLCFA-RQKRVCIFRH 142 (823)
Q Consensus 88 ~~Ll~l~d-~-l~~~~L~~l~----~~-------~~i~~~-------kg~~~f--a~~--~~~~~l~V~-~kkki~l~~~ 142 (823)
++|++++| | |.+|...++- .. ...... -+.++. ++. .....|||+ -+..|.||-+
T Consensus 115 EVLl~c~DdG~V~~Yyt~~I~~~i~~~~~~~~~~~~r~~i~P~f~~~v~~SaWGLdIh~~~~~rlIAVSsNs~~VTVFaf 194 (717)
T PF08728_consen 115 EVLLLCTDDGDVLAYYTETIIEAIERFSEDNDSGFSRLKIKPFFHLRVGASAWGLDIHDYKKSRLIAVSSNSQEVTVFAF 194 (717)
T ss_pred eEEEEEecCCeEEEEEHHHHHHHHHhhccccccccccccCCCCeEeecCCceeEEEEEecCcceEEEEecCCceEEEEEE
Confidence 67888888 6 8888763321 00 000001 133332 222 223346776 4456888877
Q ss_pred cC-CCceeEeeeecCCCCceEEEec
Q 003405 143 DG-GRGFVEVKDFGVPDTVKSMSWC 166 (823)
Q Consensus 143 ~~-~~~f~~~kei~~~~~~~~l~~~ 166 (823)
.. +..+...+++...+.|=+++|.
T Consensus 195 ~l~~~r~~~~~s~~~~hNIP~VSFl 219 (717)
T PF08728_consen 195 ALVDERFYHVPSHQHSHNIPNVSFL 219 (717)
T ss_pred eccccccccccccccccCCCeeEee
Confidence 64 3335445566677788888887
No 399
>PF07035 Mic1: Colon cancer-associated protein Mic1-like; InterPro: IPR009755 This entry represents the C terminus (approximately 160 residues) of a number of proteins that resemble colon cancer-associated protein Mic1.
Probab=33.38 E-value=2.2e+02 Score=27.82 Aligned_cols=66 Identities=24% Similarity=0.304 Sum_probs=37.6
Q ss_pred HHHHHHHHHHhcCChhhHHhhhcCC----------------C--cccHHHHHHHHHhcC-cHHHHHHHHHHhccHHHHHH
Q 003405 507 LDTALLQALLLTGQSSAALELLKGL----------------N--YCDVKICEEILQKKN-HYTALLELYKSNARHREALK 567 (823)
Q Consensus 507 vDT~Ll~~y~~~~~~~~l~~ll~~~----------------n--~c~~~~~~~~L~~~~-~~~~L~~ly~~~g~~~~AL~ 567 (823)
+...|+.+....+....+..|++-- + ..-..-+.+.|+.-+ .+++.+.++..+|++-+||.
T Consensus 31 L~~lli~lLi~~~~~~~L~qllq~~Vi~DSk~lA~~LLs~~~~~~~~~Ql~lDMLkRL~~~~~~iievLL~~g~vl~ALr 110 (167)
T PF07035_consen 31 LYELLIDLLIRNGQFSQLHQLLQYHVIPDSKPLACQLLSLGNQYPPAYQLGLDMLKRLGTAYEEIIEVLLSKGQVLEALR 110 (167)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHhhcccCCcHHHHHHHHHhHccChHHHHHHHHHHHHhhhhHHHHHHHHHhCCCHHHHHH
Confidence 4555666666666555555555521 1 112344555555545 66666667777777777777
Q ss_pred HHHHH
Q 003405 568 LLHEL 572 (823)
Q Consensus 568 ll~~l 572 (823)
+.++.
T Consensus 111 ~ar~~ 115 (167)
T PF07035_consen 111 YARQY 115 (167)
T ss_pred HHHHc
Confidence 76653
No 400
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=33.12 E-value=2.9e+02 Score=32.43 Aligned_cols=57 Identities=18% Similarity=0.271 Sum_probs=40.6
Q ss_pred cEEEEeeCCCceEEEEEcCeEEEEEEcCCC------ceeEeeeecCCCCceEEEecCC-eEEEEEc
Q 003405 117 ANVYSWDDRRGFLCFARQKRVCIFRHDGGR------GFVEVKDFGVPDTVKSMSWCGE-NICIAIR 175 (823)
Q Consensus 117 ~~~fa~~~~~~~l~V~~kkki~l~~~~~~~------~f~~~kei~~~~~~~~l~~~~~-~i~v~~~ 175 (823)
+++-|+..+..+++||.+..+--|-|+... .|..+ |.+.+.|.+|.-.++ .+.|++.
T Consensus 158 IhCACWT~DG~RLVVAvGSsLHSyiWd~~qKtL~~CsfcPV--Fdv~~~Icsi~AT~dsqVAvaTE 221 (671)
T PF15390_consen 158 IHCACWTKDGQRLVVAVGSSLHSYIWDSAQKTLHRCSFCPV--FDVGGYICSIEATVDSQVAVATE 221 (671)
T ss_pred EEEEEecCcCCEEEEEeCCeEEEEEecCchhhhhhCCccee--ecCCCceEEEEEeccceEEEEec
Confidence 566677777789999999999999998532 13222 445667888876654 5777765
No 401
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=32.87 E-value=52 Score=20.25 Aligned_cols=25 Identities=20% Similarity=0.406 Sum_probs=20.6
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 341 IHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 341 i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
.+...|..++..++|++|...|.++
T Consensus 3 ~~~~~a~~~~~~~~~~~a~~~~~~~ 27 (34)
T smart00028 3 ALYNLGNAYLKLGDYDEALEYYEKA 27 (34)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 3456788999999999999999764
No 402
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=32.73 E-value=70 Score=29.16 Aligned_cols=54 Identities=19% Similarity=0.161 Sum_probs=38.3
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
....+...|++++|+..++..-..+. .....+...|..+...|++++|+..|.+
T Consensus 57 la~~~~~~~~~~~A~~~~~~~~~~~p-----~~~~~~~~la~~~~~~g~~~~A~~~~~~ 110 (135)
T TIGR02552 57 LAACCQMLKEYEEAIDAYALAAALDP-----DDPRPYFHAAECLLALGEPESALKALDL 110 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCC-----CChHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 34455678999999988776421111 1234556678899999999999999976
No 403
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=32.47 E-value=8.6e+02 Score=28.33 Aligned_cols=166 Identities=13% Similarity=0.084 Sum_probs=83.1
Q ss_pred CceeeEeC--c-EEEEeCCCCcccccccC---CCCcEEEEeeCCCceE-EEEEcCeEEEEEEcCCCceeEeeeecCCCCc
Q 003405 88 QLLLSLSE--S-IAFHRLPNLETIAVLTK---AKGANVYSWDDRRGFL-CFARQKRVCIFRHDGGRGFVEVKDFGVPDTV 160 (823)
Q Consensus 88 ~~Ll~l~d--~-l~~~~L~~l~~~~~i~~---~kg~~~fa~~~~~~~l-~V~~kkki~l~~~~~~~~f~~~kei~~~~~~ 160 (823)
..+++++- | |.+|....-+-...+.. --++++...+.+.++| -|+...++..+.....+.++..++. +..+
T Consensus 70 t~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~--~~~~ 147 (541)
T KOG4547|consen 70 TSMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVIIRIWKEQ--KPLV 147 (541)
T ss_pred ceEEEeecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccccCceEecCCceeEEEEecccceeeeeeccC--CCcc
Confidence 45666664 3 77777654432222221 1245555445555554 3334445444444433334444432 3467
Q ss_pred eEEEecCC-eEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEE---EEcc--CCeEEEEeCC-----eEEEEcCCCcc-c
Q 003405 161 KSMSWCGE-NICIAIRKGYMILNATNGALSEVFPSGRIGPPLV---VSLL--SGELLLGKEN-----IGVFVDQNGKL-L 228 (823)
Q Consensus 161 ~~l~~~~~-~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i---~~~~--~~EfLL~~~~-----~gvfv~~~G~~-~ 228 (823)
.+++..++ .+.+.-.++..++|++++++..-|+.-.+....+ ..+. .++++|.... .+-+++..++. +
T Consensus 148 ~sl~is~D~~~l~~as~~ik~~~~~~kevv~~ftgh~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~v~~~~kkks 227 (541)
T KOG4547|consen 148 SSLCISPDGKILLTASRQIKVLDIETKEVVITFTGHGSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWVVEKEDKKKS 227 (541)
T ss_pred ceEEEcCCCCEEEeccceEEEEEccCceEEEEecCCCcceEEEEEEEeccccccceeeeccccccceeEEEEEcccccch
Confidence 77777765 3444444789999999999888887422111111 1111 2578885432 23344443321 1
Q ss_pred cCCceeecCCCcEEE------EeCCEEEEEeCC
Q 003405 229 QADRICWSEAPIAVI------IQKPYAIALLPR 255 (823)
Q Consensus 229 ~~~~i~w~~~P~~v~------~~~PYll~~~~~ 255 (823)
-..++.-++.|..+- ...|++++.+-+
T Consensus 228 ~~~sl~~~dipv~~ds~~~ed~~~~l~lAst~~ 260 (541)
T KOG4547|consen 228 LSCSLTVPDIPVTSDSGLLEDGTIPLVLASTLI 260 (541)
T ss_pred hheeeccCCCCeEeccccccccccceEEeeecc
Confidence 134555566654432 233666666543
No 404
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=32.27 E-value=1.2e+02 Score=31.88 Aligned_cols=28 Identities=25% Similarity=0.381 Sum_probs=23.7
Q ss_pred CcHHHHHHHHHHhccHHHHHHHHHHHhh
Q 003405 547 NHYTALLELYKSNARHREALKLLHELVE 574 (823)
Q Consensus 547 ~~~~~L~~ly~~~g~~~~AL~ll~~l~~ 574 (823)
..+..++.+|...|++++|++.+.+-..
T Consensus 147 ~~~~~~a~~~~~~G~~~~A~~~~~~al~ 174 (280)
T PF13429_consen 147 RFWLALAEIYEQLGDPDKALRDYRKALE 174 (280)
T ss_dssp HHHHHHHHHHHHCCHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3466789999999999999999987653
No 405
>KOG2168 consensus Cullins [Cell cycle control, cell division, chromosome partitioning]
Probab=31.98 E-value=5.6e+02 Score=31.50 Aligned_cols=208 Identities=13% Similarity=0.153 Sum_probs=0.0
Q ss_pred ccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCccccccccccCCCChHH--
Q 003405 560 ARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQTIELFLSGNIPADL-- 637 (823)
Q Consensus 560 g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~~~if~~~~l~~~~-- 637 (823)
|+++.|+.++.+. +....+....|| ...+|+ +++..+..+-+.+.-+.-++.+
T Consensus 482 gqfe~AI~fL~~~-----------~~~~~dAVH~AI------------~l~~lg--lL~~~~s~~~~ll~~d~~d~~k~~ 536 (835)
T KOG2168|consen 482 GQFERAIEFLHRE-----------EPNRIDAVHVAI------------ALAELG--LLRTSSSTSQELLSIDPNDPPKSR 536 (835)
T ss_pred HhHHHHHHHHHhh-----------cCCcchhHHHHH------------HHHHhh--hhccCCCCCCcccccCCCCCcccc
Q ss_pred -------HHHHHhhcCchhHHHHHHH--HhhcccCCCChh-HHHHHHHHHHHH---HHHHhhhhhhhcccCcccchHHHH
Q 003405 638 -------VNSYLKQYSPSMQGRYLEL--MLAMNENSISGN-LQNEMVQIYLSE---VLDWYSDLSAQQKWDEKAYSPTRK 704 (823)
Q Consensus 638 -------Vl~~L~~~~~~~~~~YLE~--li~~~~~~~~~~-~h~~L~~lYl~~---i~~~~~~~~~~~~~~~~~~~~~r~ 704 (823)
++.+++.+.......+|+| ++.-+......+ +|+.+..+-++. ....+...+.+|...++.+.+.+.
T Consensus 537 ~lnf~rLi~~Ytk~fe~~d~~~al~y~~~lr~~~d~q~~~l~l~~v~~lVl~t~~~f~~iLG~i~~dG~r~~G~l~~f~~ 616 (835)
T KOG2168|consen 537 RLNFARLIIAYTKSFEYTDTRVALQYYYLLRLNKDPQGSNLFLKCVCELVLETEEEFDLILGKIKPDGSREPGLLDEFLP 616 (835)
T ss_pred cccHHHHHHHHHHHHHhccchhhhheeeeecccCChhHHHHHHHHHHHHHHhccccHHHHhcccCCCCCCCcchHhhhcc
Q ss_pred ---HHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccccHHHHHHHHHHHhCCC-------------chhHHHHHHH
Q 003405 705 ---KLLSALESISGYNPEVLLKRLPADALYEERAILLGKMNQHELALSLYVHKVFLI-------------NQPVFLLIRR 768 (823)
Q Consensus 705 ---kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg~h~~AL~ilv~~L~D~-------------~~a~~~~l~~ 768 (823)
-+..++.. +-+.+...++++.-+.||...|.+..|+.++-.-|.+. -.++-..+..
T Consensus 617 ~~~~~~~i~~~--------vA~~a~~~G~~~~sI~LY~lag~yd~al~link~LS~~l~~~~~~~~n~erl~~La~~~~~ 688 (835)
T KOG2168|consen 617 LIEDLQKIILE--------VASEADEDGLFEDAILLYHLAGDYDKALELINKLLSQVLHSPTLGQSNKERLGDLALSMND 688 (835)
T ss_pred chhhHHHHHHH--------HHHHHHhcCCHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhhcccCCcchhhHHHHHHHHHH
Q ss_pred HhcCCCCCcchhhhccchhHHHHHHHHHHHHH
Q 003405 769 MAMDIKPLVTEHEIKHINWRVLQATIIKLFFS 800 (823)
Q Consensus 769 ~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 800 (823)
+|-+.+-.....+.+-...=+.=.++-++|..
T Consensus 689 ~y~~~~~~~~~~~~~t~~lLl~~~~~f~~y~~ 720 (835)
T KOG2168|consen 689 IYESNKGDSAKVVVKTLSLLLDLVSFFDLYHN 720 (835)
T ss_pred HHHhccCcchhhHHHHHHHHHHHHHHHHHHhh
No 406
>PF10433 MMS1_N: Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A ....
Probab=31.22 E-value=8.8e+02 Score=28.07 Aligned_cols=216 Identities=18% Similarity=0.191 Sum_probs=106.7
Q ss_pred CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEeC-c---EEEE
Q 003405 26 GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLSE-S---IAFH 100 (823)
Q Consensus 26 ~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~d-~---l~~~ 100 (823)
++++++++++|.|+...+..+.. ...+ ..+-.. ..+.+.+..+... .++++-++ | +.-+
T Consensus 222 ~~~~lL~~e~G~l~~l~l~~~~~--------------~i~i-~~~g~~-~~~~s~l~~l~~g~d~lf~gs~~gds~l~~~ 285 (504)
T PF10433_consen 222 GDRILLQDEDGDLYLLTLDNDGG--------------SISI-TYLGTL-CSIASSLTYLKNGGDYLFVGSEFGDSQLLQI 285 (504)
T ss_dssp SSEEEEEETTSEEEEEEEEEEEE--------------EEEE-EEEEE---S-ESEEEEESTT--EEEEEESSS-EEEEEE
T ss_pred CCEEEEEeCCCeEEEEEEEECCC--------------eEEE-EEcCCc-CChhheEEEEcCCCEEEEEEEecCCcEEEEE
Confidence 46899999999999887664220 0111 111111 2345667767653 26776666 3 4555
Q ss_pred eCCCCcccccccCCCCcEEEEeeCC----Cc------eEEEE----EcCeEEEEEEcCCCcee--EeeeecCCCCceEEE
Q 003405 101 RLPNLETIAVLTKAKGANVYSWDDR----RG------FLCFA----RQKRVCIFRHDGGRGFV--EVKDFGVPDTVKSMS 164 (823)
Q Consensus 101 ~L~~l~~~~~i~~~kg~~~fa~~~~----~~------~l~V~----~kkki~l~~~~~~~~f~--~~kei~~~~~~~~l~ 164 (823)
....++.+.++.....+..|++.+. .. .|+++ .+..|.+++..- ... ......+|+ ++.+=
T Consensus 286 ~~~~l~~~~~~~N~~Pi~D~~v~~~~~~~~~~~~~~~~lv~~sG~g~~gsL~~lr~Gi--~~~~~~~~~~~l~~-v~~iW 362 (504)
T PF10433_consen 286 SLSNLEVLDSLPNWGPIVDFCVVDSSNSGQPSNPSSDQLVACSGAGKRGSLRILRNGI--GIEGLELASSELPG-VTGIW 362 (504)
T ss_dssp ESESEEEEEEE----SEEEEEEE-TSSSSS-------EEEEEESSGGG-EEEEEEESB--EEE--EEEEEEEST-EEEEE
T ss_pred eCCCcEEEEeccCcCCccceEEeccccCCCCcccccceEEEEECcCCCCcEEEEeccC--CceeeeeeccCCCC-ceEEE
Confidence 5556666666666677888888432 12 55444 234566666542 133 222344555 66652
Q ss_pred ec------CCeEEEEEcCceEEEEcCC----CCeeecc--CCCCCCCCEEE-EccCCeEEEEeCCeEEEEcCCCccccCC
Q 003405 165 WC------GENICIAIRKGYMILNATN----GALSEVF--PSGRIGPPLVV-SLLSGELLLGKENIGVFVDQNGKLLQAD 231 (823)
Q Consensus 165 ~~------~~~i~v~~~~~y~lidl~~----~~~~~L~--~~~~~~~p~i~-~~~~~EfLL~~~~~gvfv~~~G~~~~~~ 231 (823)
.. +..+++++..+-.++-+.. ....++- ....+.+-+.+ .+.++-++=.+.+....++..+. +.
T Consensus 363 ~l~~~~~~~~~lv~S~~~~T~vl~~~~~d~~e~~~e~~~~~f~~~~~Tl~~~~~~~~~ivQVt~~~i~l~~~~~~---~~ 439 (504)
T PF10433_consen 363 TLKLSSSDHSYLVLSFPNETRVLQISEGDDGEEVEEVEEDGFDTDEPTLAAGNVGDGRIVQVTPKGIRLIDLEDG---KL 439 (504)
T ss_dssp EE-SSSSSBSEEEEEESSEEEEEEES----SSEEEEE---TS-SSS-EEEEEEETTTEEEEEESSEEEEEESSST---SE
T ss_pred EeeecCCCceEEEEEcCCceEEEEEecccCCcchhhhhhccCCCCCCCeEEEEcCCCeEEEEecCeEEEEECCCC---eE
Confidence 22 2468888887777777752 2332220 11111112222 33433333366555555553222 22
Q ss_pred ceee---cCC-CcEEEEeCCEEEEEeC-CeEEEEEcc
Q 003405 232 RICW---SEA-PIAVIIQKPYAIALLP-RRVEVRSLR 263 (823)
Q Consensus 232 ~i~w---~~~-P~~v~~~~PYll~~~~-~~ieV~~l~ 263 (823)
.-.| .+. .....+..|++++... +.+....+.
T Consensus 440 ~~~w~~~~~~~I~~a~~~~~~v~v~~~~~~~~~~~~~ 476 (504)
T PF10433_consen 440 TQEWKPPAGSIIVAASINDPQVLVALSGGELVYFELD 476 (504)
T ss_dssp EEEEE-TTS---SEEEESSSEEEEEE-TTEEEEEEEE
T ss_pred EEEEeCCCCCeEEEEEECCCEEEEEEeCCcEEEEEEE
Confidence 3456 222 3445567789887774 555555554
No 407
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=30.66 E-value=92 Score=35.00 Aligned_cols=57 Identities=19% Similarity=0.207 Sum_probs=39.3
Q ss_pred hhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 303 LGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 303 ~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
+..|++-|+++++++.|+.+++......+ .. -.-....|..+...|+|++|+..+-.
T Consensus 237 L~~Qa~fLl~k~~~~lAL~iAk~av~lsP----~~-f~~W~~La~~Yi~~~d~e~ALlaLNs 293 (395)
T PF09295_consen 237 LNLQAEFLLSKKKYELALEIAKKAVELSP----SE-FETWYQLAECYIQLGDFENALLALNS 293 (395)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHhCc----hh-HHHHHHHHHHHHhcCCHHHHHHHHhc
Confidence 45688889999999999999987421111 01 12233457778888999999977664
No 408
>KOG3611 consensus Semaphorins [Signal transduction mechanisms]
Probab=30.63 E-value=2.1e+02 Score=34.85 Aligned_cols=80 Identities=21% Similarity=0.267 Sum_probs=50.7
Q ss_pred ccccCC-CCcEEEEEEe------C--CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCC-CCCee
Q 003405 10 ELISNC-SPKIDAVASY------G--LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFS-KKPIL 79 (823)
Q Consensus 10 ~l~~~~-~~~I~ci~~~------~--~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~-k~~I~ 79 (823)
|++-.. ....+-|.++ + +-+|+||++|+|+.......... ... .++.+++-|. ..||.
T Consensus 400 P~l~~~~~~~~t~I~Vd~~~~~~~~ydVlflGTd~G~vlKvV~~~~~~~-----------~~~-~llEElqvf~~~~pI~ 467 (737)
T KOG3611|consen 400 PLLVKTGDYRLTQIVVDRVAGLDGNYDVLFLGTDAGTVLKVVSPGKESG-----------KSN-VLLEELQVFPDAEPIR 467 (737)
T ss_pred ceEEecccceEEEEEEEEecCCCCcEEEEEEecCCCeEEEEEecCCccC-----------ccc-eeEEEEeecCCCCcee
Confidence 555444 5666666665 2 35899999999988764433211 111 1334444453 36899
Q ss_pred EEEEecccCceeeEeC-cEEEEe
Q 003405 80 SMEVLASRQLLLSLSE-SIAFHR 101 (823)
Q Consensus 80 qI~~~~~~~~Ll~l~d-~l~~~~ 101 (823)
-|++.+..+.|+|-++ +|.=..
T Consensus 468 ~m~Ls~~~~~LyVgs~~gV~qvp 490 (737)
T KOG3611|consen 468 SMQLSSKRGSLYVGSRSGVVQVP 490 (737)
T ss_pred EEEecccCCeEEEEccCcEEEee
Confidence 9999999998888887 454433
No 409
>KOG1070 consensus rRNA processing protein Rrp5 [RNA processing and modification]
Probab=30.43 E-value=1e+02 Score=39.53 Aligned_cols=72 Identities=15% Similarity=0.284 Sum_probs=46.3
Q ss_pred CcHHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCcccccc
Q 003405 547 NHYTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQTIE 626 (823)
Q Consensus 547 ~~~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~~~ 626 (823)
.-|..|+.+|.+.+++++|-+++.... ++.+ ........|+..||++|.+++.+
T Consensus 1531 ~V~~~L~~iy~k~ek~~~A~ell~~m~-------------------------KKF~-q~~~vW~~y~~fLl~~ne~~aa~ 1584 (1710)
T KOG1070|consen 1531 TVHLKLLGIYEKSEKNDEADELLRLML-------------------------KKFG-QTRKVWIMYADFLLRQNEAEAAR 1584 (1710)
T ss_pred HHHHHHHHHHHHhhcchhHHHHHHHHH-------------------------HHhc-chhhHHHHHHHHHhcccHHHHHH
Confidence 346678999999999999999886543 2223 23345557888888888855543
Q ss_pred -cccc--CCCChHHHHHHHhh
Q 003405 627 -LFLS--GNIPADLVNSYLKQ 644 (823)
Q Consensus 627 -if~~--~~l~~~~Vl~~L~~ 644 (823)
++-+ ..+|..+-++++..
T Consensus 1585 ~lL~rAL~~lPk~eHv~~Isk 1605 (1710)
T KOG1070|consen 1585 ELLKRALKSLPKQEHVEFISK 1605 (1710)
T ss_pred HHHHHHHhhcchhhhHHHHHH
Confidence 3333 34666555555554
No 410
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.38 E-value=91 Score=32.75 Aligned_cols=60 Identities=22% Similarity=0.250 Sum_probs=40.9
Q ss_pred hhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 303 LGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 303 ~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.-++.-.+++.|+|.+|..-+..+.+.- ........-+==.|..+|.+|+|++|...|..
T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~Y--P~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~ 203 (262)
T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKY--PNSTYTPNAYYWLGESLYAQGDYEDAAYIFAR 203 (262)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcC--CCCcccchhHHHHHHHHHhcccchHHHHHHHH
Confidence 4567778889999999988887642100 00011222233358899999999999999986
No 411
>KOG3380 consensus Actin-related protein Arp2/3 complex, subunit ARPC5 [Cytoskeleton]
Probab=30.29 E-value=1.1e+02 Score=28.95 Aligned_cols=66 Identities=20% Similarity=0.180 Sum_probs=45.1
Q ss_pred EEEeeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHH------HHHHHccCCHHHHHHHHHhc
Q 003405 295 IFGLFPVPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRF------AHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 295 I~~l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~------a~~lf~~~~f~~A~~~f~~~ 365 (823)
+-....-|-..||+.|+++|+..+|++.+=..++-. .+.+.++.+| ++.-|++.+.+++.+.+..-
T Consensus 30 ~~~a~~gp~~~ev~sll~qg~~~~AL~~aL~~~P~~-----t~~q~vK~~a~~~v~~vL~~ik~adI~~~v~~Ls~e 101 (152)
T KOG3380|consen 30 VESAALGPDEREVRSLLTQGKSLEALQTALLNPPYG-----TKDQEVKDRALNVVLKVLTSIKQADIEAAVKKLSTE 101 (152)
T ss_pred hhhhccCCChHHHHHHHHcccHHHHHHHHHhCCCCC-----CccHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhHH
Confidence 334455677889999999999999999986654221 1223333332 56678888888888776653
No 412
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=30.10 E-value=2.1e+02 Score=33.64 Aligned_cols=89 Identities=16% Similarity=0.282 Sum_probs=61.2
Q ss_pred CCeeEEEEecccCceeeEeC-c-EEEEeCCCCccccccc-CCCCcE-EEEeeCCCceEEEE-EcCeEEEEEEcCCCceeE
Q 003405 76 KPILSMEVLASRQLLLSLSE-S-IAFHRLPNLETIAVLT-KAKGAN-VYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVE 150 (823)
Q Consensus 76 ~~I~qI~~~~~~~~Ll~l~d-~-l~~~~L~~l~~~~~i~-~~kg~~-~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~ 150 (823)
..|..+.--|..+++...++ | |.+|++. .+.+-+++ ...+++ +.|+.++...|+|| ..++|.|.....+.....
T Consensus 21 ~~i~~~ewnP~~dLiA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~DGkllaVg~kdG~I~L~Dve~~~~l~~ 99 (665)
T KOG4640|consen 21 INIKRIEWNPKMDLIATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRPDGKLLAVGFKDGTIRLHDVEKGGRLVS 99 (665)
T ss_pred cceEEEEEcCccchhheeccCCcEEEEEec-cceeEeccCCCCccceeeeecCCCCEEEEEecCCeEEEEEccCCCceec
Confidence 46778888888888888876 6 9999887 44344444 455666 88999886678999 567888888764321111
Q ss_pred eeeecCCCCceEEEec
Q 003405 151 VKDFGVPDTVKSMSWC 166 (823)
Q Consensus 151 ~kei~~~~~~~~l~~~ 166 (823)
.-++..+.++++.|.
T Consensus 100 -~~~s~e~~is~~~w~ 114 (665)
T KOG4640|consen 100 -FLFSVETDISKGIWD 114 (665)
T ss_pred -cccccccchheeecc
Confidence 124445678888885
No 413
>PRK02889 tolB translocation protein TolB; Provisional
Probab=29.69 E-value=8.6e+02 Score=27.45 Aligned_cols=144 Identities=15% Similarity=0.150 Sum_probs=71.8
Q ss_pred eeEEEEecccCceee-EeC-c---EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEEEc--CeEEEEEEcC-CCce
Q 003405 78 ILSMEVLASRQLLLS-LSE-S---IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFARQ--KRVCIFRHDG-GRGF 148 (823)
Q Consensus 78 I~qI~~~~~~~~Ll~-l~d-~---l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~~k--kki~l~~~~~-~~~f 148 (823)
+......|+.+.++. .+. + |..+++..-. ...+....+ .+..+++++...|+++.. ....||.+.. +...
T Consensus 242 ~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~-~~~lt~~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~ 320 (427)
T PRK02889 242 NSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSG-LRRLTQSSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGAA 320 (427)
T ss_pred ccceEECCCCCEEEEEEccCCCceEEEEECCCCC-cEECCCCCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCce
Confidence 445666777766654 443 3 4555543221 222322222 334566776656665543 3566777652 2122
Q ss_pred eEeeeecCC-CCceEEEec--CCeEEEEEcC----ceEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-EeCC----
Q 003405 149 VEVKDFGVP-DTVKSMSWC--GENICIAIRK----GYMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-GKEN---- 216 (823)
Q Consensus 149 ~~~kei~~~-~~~~~l~~~--~~~i~v~~~~----~y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~~~---- 216 (823)
+. +... ....+..|. |+.|+++... ...++|+.+++...+...+....| .+.+++..++ +.+.
T Consensus 321 ~~---lt~~g~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~~~~~~p--~~spdg~~l~~~~~~~g~~ 395 (427)
T PRK02889 321 QR---VTFTGSYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTDTTRDESP--SFAPNGRYILYATQQGGRS 395 (427)
T ss_pred EE---EecCCCCcCceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccCCCCccCc--eECCCCCEEEEEEecCCCE
Confidence 22 1111 122344564 6778777653 256789988887766543222223 3455666555 3321
Q ss_pred eEEEEcCCCcc
Q 003405 217 IGVFVDQNGKL 227 (823)
Q Consensus 217 ~gvfv~~~G~~ 227 (823)
....++.+|+.
T Consensus 396 ~l~~~~~~g~~ 406 (427)
T PRK02889 396 VLAAVSSDGRI 406 (427)
T ss_pred EEEEEECCCCc
Confidence 23345666653
No 414
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=29.36 E-value=3.2e+02 Score=33.61 Aligned_cols=132 Identities=10% Similarity=0.157 Sum_probs=72.7
Q ss_pred eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC--c-EEEEe
Q 003405 25 YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE--S-IAFHR 101 (823)
Q Consensus 25 ~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d--~-l~~~~ 101 (823)
.+..+..|.-+-.+..++++...- .+..+.+... +.++...|..+.+++ | |.+-+
T Consensus 146 ~~~~~i~Gg~Q~~li~~Dl~~~~e-------------------~r~~~v~a~~---v~imR~Nnr~lf~G~t~G~V~LrD 203 (1118)
T KOG1275|consen 146 GPSTLIMGGLQEKLIHIDLNTEKE-------------------TRTTNVSASG---VTIMRYNNRNLFCGDTRGTVFLRD 203 (1118)
T ss_pred CCcceeecchhhheeeeeccccee-------------------eeeeeccCCc---eEEEEecCcEEEeecccceEEeec
Confidence 345677777777777777654211 1111222212 677888889999998 4 88889
Q ss_pred CCCCcccccccCCC-CcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeE---------eeeecCCCCceEEEec---CC
Q 003405 102 LPNLETIAVLTKAK-GANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVE---------VKDFGVPDTVKSMSWC---GE 168 (823)
Q Consensus 102 L~~l~~~~~i~~~k-g~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~---------~kei~~~~~~~~l~~~---~~ 168 (823)
+.++++++++..-. .++.|.+. |.+.++++.....|-+.-| .|.+ +--+.+|-.|+-+.|. -.
T Consensus 204 ~~s~~~iht~~aHs~siSDfDv~---GNlLitCG~S~R~~~l~~D-~FvkVYDLRmmral~PI~~~~~P~flrf~Psl~t 279 (1118)
T KOG1275|consen 204 PNSFETIHTFDAHSGSISDFDVQ---GNLLITCGYSMRRYNLAMD-PFVKVYDLRMMRALSPIQFPYGPQFLRFHPSLTT 279 (1118)
T ss_pred CCcCceeeeeeccccceeeeecc---CCeEEEeeccccccccccc-chhhhhhhhhhhccCCcccccCchhhhhcccccc
Confidence 99998877643322 23445443 2333332222222222112 2332 2234566666666665 35
Q ss_pred eEEEEEcCc-eEEEE
Q 003405 169 NICIAIRKG-YMILN 182 (823)
Q Consensus 169 ~i~v~~~~~-y~lid 182 (823)
.+||++.++ +..+|
T Consensus 280 ~~~V~S~sGq~q~vd 294 (1118)
T KOG1275|consen 280 RLAVTSQSGQFQFVD 294 (1118)
T ss_pred eEEEEecccceeecc
Confidence 688887754 56777
No 415
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=29.34 E-value=6.4e+02 Score=29.59 Aligned_cols=147 Identities=10% Similarity=0.044 Sum_probs=81.7
Q ss_pred cCCCCceEEEec-CCeEEEEEcCceEEEEcCCCCeeeccCCCCCCCCEEEEc--cCCeEEEEeCCeEEEEcCCCccccCC
Q 003405 155 GVPDTVKSMSWC-GENICIAIRKGYMILNATNGALSEVFPSGRIGPPLVVSL--LSGELLLGKENIGVFVDQNGKLLQAD 231 (823)
Q Consensus 155 ~~~~~~~~l~~~-~~~i~v~~~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~--~~~EfLL~~~~~gvfv~~~G~~~~~~ 231 (823)
....++..+.+. .+.+++|+.++-..+|-.+|+...+-+..- .+++.... -.++.-++.++-...++..|.....-
T Consensus 162 l~d~~V~aLv~D~~g~lWvgT~dGL~~fd~~~gkalql~s~~~-dk~I~al~~d~qg~LWVGTdqGv~~~e~~G~~~sn~ 240 (671)
T COG3292 162 LKDTPVVALVFDANGRLWVGTPDGLSYFDAGRGKALQLASPPL-DKAINALIADVQGRLWVGTDQGVYLQEAEGWRASNW 240 (671)
T ss_pred ccCccceeeeeeccCcEEEecCCcceEEccccceEEEcCCCcc-hhhHHHHHHHhcCcEEEEeccceEEEchhhcccccc
Confidence 334466677766 568999999999999999998877755421 12332222 35688888877666677788422111
Q ss_pred ceeecCCCc-EEEE-eCCEEEEEeCCeEEEEEccCCCceeEEEeeCC------ccccc-ccCCeEEEeccceEEEeeccC
Q 003405 232 RICWSEAPI-AVII-QKPYAIALLPRRVEVRSLRVPYALIQTIVLQN------VRHLI-PSSNAVVVALENSIFGLFPVP 302 (823)
Q Consensus 232 ~i~w~~~P~-~v~~-~~PYll~~~~~~ieV~~l~~~~~lvQ~i~l~~------~~~l~-~~~~~v~v~s~~~I~~l~~~~ 302 (823)
....+.... .+.- ..-|+-.-+++++-+..+.+ + -+|....+. +..+. ..++.+.+.+...|+++..-|
T Consensus 241 ~~~lp~~~I~ll~qD~qG~lWiGTenGl~r~~l~r-q-~Lq~~~~~~~l~~S~vnsL~~D~dGsLWv~t~~giv~~~~a~ 318 (671)
T COG3292 241 GPMLPSGNILLLVQDAQGELWIGTENGLWRTRLPR-Q-GLQIPLSKMHLGVSTVNSLWLDTDGSLWVGTYGGIVRYLTAD 318 (671)
T ss_pred CCCCcchheeeeecccCCCEEEeecccceeEecCC-C-CccccccccCCccccccceeeccCCCEeeeccCceEEEecch
Confidence 111111111 1111 13455555566666555543 2 122211111 11222 245678888888888888777
Q ss_pred hh
Q 003405 303 LG 304 (823)
Q Consensus 303 ~~ 304 (823)
|.
T Consensus 319 w~ 320 (671)
T COG3292 319 WK 320 (671)
T ss_pred hh
Confidence 53
No 416
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=29.32 E-value=1.8e+02 Score=31.28 Aligned_cols=134 Identities=13% Similarity=0.240 Sum_probs=77.4
Q ss_pred EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEeC-c--EEEEeCCC
Q 003405 28 KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLSE-S--IAFHRLPN 104 (823)
Q Consensus 28 ~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~d-~--l~~~~L~~ 104 (823)
.|.+...+|.+.++++... .+... +. -+..++..+.--|+...++.-++ . |.+|.|.+
T Consensus 63 ilC~~yk~~~vqvwsl~Qp----------------ew~ck--Id-eg~agls~~~WSPdgrhiL~tseF~lriTVWSL~t 123 (447)
T KOG4497|consen 63 ILCVAYKDPKVQVWSLVQP----------------EWYCK--ID-EGQAGLSSISWSPDGRHILLTSEFDLRITVWSLNT 123 (447)
T ss_pred eeeeeeccceEEEEEeecc----------------eeEEE--ec-cCCCcceeeeECCCcceEeeeecceeEEEEEEecc
Confidence 4566788888888887642 22221 11 13568888999998877777777 3 89999876
Q ss_pred CcccccccCC-CCcEEEEeeCCCceEEEEEcCeEE-EEEEcCCCceeEeeeecCCC-CceEEEec--CCeEEEEEc-Cce
Q 003405 105 LETIAVLTKA-KGANVYSWDDRRGFLCFARQKRVC-IFRHDGGRGFVEVKDFGVPD-TVKSMSWC--GENICIAIR-KGY 178 (823)
Q Consensus 105 l~~~~~i~~~-kg~~~fa~~~~~~~l~V~~kkki~-l~~~~~~~~f~~~kei~~~~-~~~~l~~~--~~~i~v~~~-~~y 178 (823)
-+... ++-. .|+.-++.+++..+.++..++... .+.+..-+.+..+|++.++- .-+++.|. |+.+.|=.. -+|
T Consensus 124 ~~~~~-~~~pK~~~kg~~f~~dg~f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~dT~DltgieWsPdg~~laVwd~~Ley 202 (447)
T KOG4497|consen 124 QKGYL-LPHPKTNVKGYAFHPDGQFCAILSRRDCKDYVQISSCKAWILLKEFKLDTIDLTGIEWSPDGNWLAVWDNVLEY 202 (447)
T ss_pred ceeEE-ecccccCceeEEECCCCceeeeeecccHHHHHHHHhhHHHHHHHhcCCCcccccCceECCCCcEEEEecchhhh
Confidence 54221 2222 355667778776666777666431 11221111234456665552 45678887 444444333 466
Q ss_pred EEE
Q 003405 179 MIL 181 (823)
Q Consensus 179 ~li 181 (823)
.++
T Consensus 203 kv~ 205 (447)
T KOG4497|consen 203 KVY 205 (447)
T ss_pred eee
Confidence 644
No 417
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=29.31 E-value=48 Score=25.97 Aligned_cols=20 Identities=35% Similarity=0.702 Sum_probs=18.2
Q ss_pred HHHHHHccCCHHHHHHHHHh
Q 003405 345 FAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 345 ~a~~lf~~~~f~~A~~~f~~ 364 (823)
.|..++..|+|++|...|.+
T Consensus 3 ~a~~~~~~g~~~~A~~~~~~ 22 (65)
T PF13432_consen 3 LARALYQQGDYDEAIAAFEQ 22 (65)
T ss_dssp HHHHHHHCTHHHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHHHHH
Confidence 57889999999999999987
No 418
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=29.21 E-value=6.8e+02 Score=26.12 Aligned_cols=144 Identities=17% Similarity=0.213 Sum_probs=75.4
Q ss_pred CcEEEEEEe----CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecc------
Q 003405 17 PKIDAVASY----GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLAS------ 86 (823)
Q Consensus 17 ~~I~ci~~~----~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~------ 86 (823)
..++||+.- |=.|+.|++||.|.++..+.+..-.+ .+... -+.-.|+.+.-.|.
T Consensus 103 ~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g~w~t---------------~ki~~-aH~~GvnsVswapa~~~g~~ 166 (299)
T KOG1332|consen 103 ASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGGWTT---------------SKIVF-AHEIGVNSVSWAPASAPGSL 166 (299)
T ss_pred ccceeecccccccceEEEEeeCCCcEEEEEEcCCCCccc---------------hhhhh-ccccccceeeecCcCCCccc
Confidence 456776632 34789999999999988776511100 00000 11223333333332
Q ss_pred --------cCceeeE-eCc-EEEEeCCCCccc--ccccCCCC-cEEEEeeCCC----ceEEEE-EcCeEEEEEEcCC-Cc
Q 003405 87 --------RQLLLSL-SES-IAFHRLPNLETI--AVLTKAKG-ANVYSWDDRR----GFLCFA-RQKRVCIFRHDGG-RG 147 (823)
Q Consensus 87 --------~~~Ll~l-~d~-l~~~~L~~l~~~--~~i~~~kg-~~~fa~~~~~----~~l~V~-~kkki~l~~~~~~-~~ 147 (823)
...|++= ||+ |++|...+-+.+ .++..-++ +..+|..+.. ..|+-+ ..+++.||..+.. ..
T Consensus 167 ~~~~~~~~~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~gl~~s~iAS~SqDg~viIwt~~~e~e~ 246 (299)
T KOG1332|consen 167 VDQGPAAKVKRLVSGGCDNLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVGLPKSTIASCSQDGTVIIWTKDEEYEP 246 (299)
T ss_pred cccCcccccceeeccCCccceeeeecCCcchhhhhhhhhcchhhhhhhhccccCCCceeeEEecCCCcEEEEEecCccCc
Confidence 1223332 233 999988663221 11211111 2234444433 234444 6678889887642 23
Q ss_pred eeEeeeecCCCCceEEEec--CCeEEEEEcC
Q 003405 148 FVEVKDFGVPDTVKSMSWC--GENICIAIRK 176 (823)
Q Consensus 148 f~~~kei~~~~~~~~l~~~--~~~i~v~~~~ 176 (823)
++..-.-.+|+.+-.++|. |+.+.|+...
T Consensus 247 wk~tll~~f~~~~w~vSWS~sGn~LaVs~Gd 277 (299)
T KOG1332|consen 247 WKKTLLEEFPDVVWRVSWSLSGNILAVSGGD 277 (299)
T ss_pred ccccccccCCcceEEEEEeccccEEEEecCC
Confidence 3221112378888888886 8888888774
No 419
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=28.90 E-value=90 Score=29.18 Aligned_cols=56 Identities=23% Similarity=0.204 Sum_probs=38.2
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
..++.+...|++++|+.+++.....++ --..++...-..+...|++.+|+.+|.+.
T Consensus 67 ~l~~~~~~~~~~~~a~~~~~~~l~~dP-----~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~ 122 (146)
T PF03704_consen 67 RLAEALLEAGDYEEALRLLQRALALDP-----YDEEAYRLLMRALAAQGRRAEALRVYERY 122 (146)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHHST-----T-HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHhccCHHHHHHHHHHHHhcCC-----CCHHHHHHHHHHHHHCcCHHHHHHHHHHH
Confidence 345556779999999999987531111 12356777778888999999999998763
No 420
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=28.83 E-value=1e+02 Score=30.44 Aligned_cols=54 Identities=20% Similarity=0.205 Sum_probs=39.4
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
...+...|++++|+..++.....++ .....+...|..++..|+|++|...|.++
T Consensus 38 a~~~~~~~~~~~A~~~~~~~l~~~p-----~~~~~~~~la~~~~~~~~~~~A~~~~~~a 91 (234)
T TIGR02521 38 ALGYLEQGDLEVAKENLDKALEHDP-----DDYLAYLALALYYQQLGELEKAEDSFRRA 91 (234)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCc-----ccHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4556788999999999876421111 12245566789999999999999999874
No 421
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=28.78 E-value=77 Score=25.34 Aligned_cols=56 Identities=21% Similarity=0.272 Sum_probs=39.3
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
.....+...+++++|+..++....... .........|..++..+++++|...|.++
T Consensus 39 ~~~~~~~~~~~~~~a~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 94 (100)
T cd00189 39 NLAAAYYKLGKYEEALEDYEKALELDP-----DNAKAYYNLGLAYYKLGKYEEALEAYEKA 94 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCC-----cchhHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 344555677999999998876421111 11245667788899999999999998774
No 422
>KOG2178 consensus Predicted sugar kinase [Carbohydrate transport and metabolism]
Probab=28.64 E-value=3.7e+02 Score=29.85 Aligned_cols=39 Identities=18% Similarity=0.385 Sum_probs=27.5
Q ss_pred CCeEEEEEcCceEEEEcCCCC-----------eeeccCCCCCCCCEEEEc
Q 003405 167 GENICIAIRKGYMILNATNGA-----------LSEVFPSGRIGPPLVVSL 205 (823)
Q Consensus 167 ~~~i~v~~~~~y~lidl~~~~-----------~~~L~~~~~~~~p~i~~~ 205 (823)
||.+||+|.+|-.-|.+..|- ++++-|.+-+.+|+|++-
T Consensus 286 ~DGliVaTPTGSTAYS~sAGGSlvhP~vpAIlvTPICPhSLSFRPIIlPd 335 (409)
T KOG2178|consen 286 GDGLIVATPTGSTAYSASAGGSLVHPSVPAILVTPICPHSLSFRPIILPD 335 (409)
T ss_pred cceEEEecCCchhhhHhhcCCceecCCCCeEEEeccCCCcccccceEccC
Confidence 788999999887777776542 245555556678888763
No 423
>KOG3785 consensus Uncharacterized conserved protein [Function unknown]
Probab=28.48 E-value=1.1e+02 Score=33.25 Aligned_cols=66 Identities=24% Similarity=0.386 Sum_probs=48.2
Q ss_pred cceEEEeeccChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 292 ENSIFGLFPVPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 292 ~~~I~~l~~~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
+..+-..+..|- ++.++.+..|+-|++|++--.+.+ +.....+..=.|+..|.-|+|++|+..+.-
T Consensus 17 ~~~~kkarK~P~---Ledfls~rDytGAislLefk~~~~----~EEE~~~~lWia~C~fhLgdY~~Al~~Y~~ 82 (557)
T KOG3785|consen 17 GPTIKKARKMPE---LEDFLSNRDYTGAISLLEFKLNLD----REEEDSLQLWIAHCYFHLGDYEEALNVYTF 82 (557)
T ss_pred CCcchhhhcCch---HHHHHhcccchhHHHHHHHhhccc----hhhhHHHHHHHHHHHHhhccHHHHHHHHHH
Confidence 344555566664 889999999999999997532221 123456677789999999999999998764
No 424
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=28.20 E-value=71 Score=20.28 Aligned_cols=21 Identities=29% Similarity=0.257 Sum_probs=18.4
Q ss_pred HHHHHHHHHhccHHHHHHHHH
Q 003405 550 TALLELYKSNARHREALKLLH 570 (823)
Q Consensus 550 ~~L~~ly~~~g~~~~AL~ll~ 570 (823)
..|+..|...|++++|..++.
T Consensus 5 ~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 5 LALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHcCCHHHHHHHHh
Confidence 468899999999999999875
No 425
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=28.13 E-value=7.3e+02 Score=26.15 Aligned_cols=29 Identities=24% Similarity=0.434 Sum_probs=22.7
Q ss_pred CCcEEE-EEEeCCEEEEEeCCCcEEEEcCC
Q 003405 16 SPKIDA-VASYGLKILLGCSDGSLKIYSPG 44 (823)
Q Consensus 16 ~~~I~c-i~~~~~~L~vGT~~G~l~~y~~~ 44 (823)
..+|+| ....|+++++|+..|.++....+
T Consensus 52 g~RiE~sa~vvgdfVV~GCy~g~lYfl~~~ 81 (354)
T KOG4649|consen 52 GVRIECSAIVVGDFVVLGCYSGGLYFLCVK 81 (354)
T ss_pred CceeeeeeEEECCEEEEEEccCcEEEEEec
Confidence 357776 34578999999999999887654
No 426
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=27.84 E-value=1.1e+02 Score=29.37 Aligned_cols=54 Identities=15% Similarity=0.159 Sum_probs=37.5
Q ss_pred HHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 310 LTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 310 Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
+...|+|++|+..++....... .......++...|..+...|++++|+..|.++
T Consensus 45 ~~~~g~~~~A~~~~~~al~l~~--~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~A 98 (168)
T CHL00033 45 AQSEGEYAEALQNYYEAMRLEI--DPYDRSYILYNIGLIHTSNGEHTKALEYYFQA 98 (168)
T ss_pred HHHcCCHHHHHHHHHHHHhccc--cchhhHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3467999999999876421110 00112346777899999999999999998763
No 427
>KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification]
Probab=27.81 E-value=1.9e+02 Score=36.56 Aligned_cols=105 Identities=21% Similarity=0.208 Sum_probs=59.4
Q ss_pred cCCCCcEEEEeeCCCceEEEEEcCeEEEEEEcCCCceeEeeeecCCCCceEEEecCCeEEEEEc-CceEEEEcCC-CCee
Q 003405 112 TKAKGANVYSWDDRRGFLCFARQKRVCIFRHDGGRGFVEVKDFGVPDTVKSMSWCGENICIAIR-KGYMILNATN-GALS 189 (823)
Q Consensus 112 ~~~kg~~~fa~~~~~~~l~V~~kkki~l~~~~~~~~f~~~kei~~~~~~~~l~~~~~~i~v~~~-~~y~lidl~~-~~~~ 189 (823)
.+.||.. -++.+-.|.++.+...||.++.|+.+....-+--+.+|--+.+|...++.|++|.- +++..+-.+. +...
T Consensus 1094 eE~KGtV-savceV~G~l~~~~GqKI~v~~l~r~~~ligVaFiD~~~yv~s~~~vknlIl~gDV~ksisfl~fqeep~rl 1172 (1366)
T KOG1896|consen 1094 EEQKGTV-SAVCEVRGHLLSSQGQKIIVRKLDRDSELIGVAFIDLPLYVHSMKVVKNLILAGDVMKSISFLGFQEEPYRL 1172 (1366)
T ss_pred hhcccce-EEEEEeccEEEEccCcEEEEEEeccCCcceeeEEeccceeEEehhhhhhheehhhhhhceEEEEEccCceEE
Confidence 3445533 34455568899999999999999644444433334555556666666777777765 4555554433 2233
Q ss_pred eccCCCCCCCCEEEEccCCeEEEEeCCeEEEE
Q 003405 190 EVFPSGRIGPPLVVSLLSGELLLGKENIGVFV 221 (823)
Q Consensus 190 ~L~~~~~~~~p~i~~~~~~EfLL~~~~~gvfv 221 (823)
.|+..+ ..|+-+. .-|||+--+++++.+
T Consensus 1173 sL~srd--~~~l~v~--s~EFLVdg~~L~flv 1200 (1366)
T KOG1896|consen 1173 SLLSRD--FEPLNVY--STEFLVDGSNLSFLV 1200 (1366)
T ss_pred EEeecC--Cchhhce--eeeeEEcCCeeEEEE
Confidence 343332 3333221 236666555555554
No 428
>KOG2280 consensus Vacuolar assembly/sorting protein VPS16 [Intracellular trafficking, secretion, and vesicular transport]
Probab=27.61 E-value=1.1e+03 Score=28.83 Aligned_cols=63 Identities=21% Similarity=0.426 Sum_probs=37.7
Q ss_pred cEEEEeeCCCceEEEE-EcCeEEEEEEcCCCceeEeeeecC--CCCceEEEecCCe-EEEEEcCceEEEE
Q 003405 117 ANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGFVEVKDFGV--PDTVKSMSWCGEN-ICIAIRKGYMILN 182 (823)
Q Consensus 117 ~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f~~~kei~~--~~~~~~l~~~~~~-i~v~~~~~y~lid 182 (823)
+.-++++++.++||.. ..++|.+...+-. +..-|+.+ .++|..|+|+||. +.+.......++.
T Consensus 219 ~~ki~VS~n~~~laLyt~~G~i~~vs~D~~---~~lce~~~~~~~~p~qm~WcgndaVvl~~e~~l~lvg 285 (829)
T KOG2280|consen 219 VVKISVSPNRRFLALYTETGKIWVVSIDLS---QILCEFNCTDHDPPKQMAWCGNDAVVLSWEVNLMLVG 285 (829)
T ss_pred EEEEEEcCCcceEEEEecCCcEEEEecchh---hhhhccCCCCCCchHhceeecCCceEEEEeeeEEEEc
Confidence 4446778777888776 4455655555422 22335543 4789999999764 5555544444443
No 429
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=27.33 E-value=4.3e+02 Score=33.14 Aligned_cols=90 Identities=13% Similarity=0.040 Sum_probs=51.5
Q ss_pred cccCceeeEeC-cEEEEeCCCCcccccccCCCCcEEEEee--CCCceEEEE-EcCeEEEEEEcC-CCceeEeeeecCCCC
Q 003405 85 ASRQLLLSLSE-SIAFHRLPNLETIAVLTKAKGANVYSWD--DRRGFLCFA-RQKRVCIFRHDG-GRGFVEVKDFGVPDT 159 (823)
Q Consensus 85 ~~~~~Ll~l~d-~l~~~~L~~l~~~~~i~~~kg~~~fa~~--~~~~~l~V~-~kkki~l~~~~~-~~~f~~~kei~~~~~ 159 (823)
|-.+.+++=.+ -+++|++++-+...+ ...|.+..+-.+ ....+|+|+ .+..+.+|.|+. ++++...-+-..|--
T Consensus 943 ~f~~~~LagvG~~l~~YdlG~K~lLRk-~e~k~~p~~Is~iqt~~~RI~VgD~qeSV~~~~y~~~~n~l~~fadD~~pR~ 1021 (1205)
T KOG1898|consen 943 PFQGRVLAGVGRFLRLYDLGKKKLLRK-CELKFIPNRISSIQTYGARIVVGDIQESVHFVRYRREDNQLIVFADDPVPRH 1021 (1205)
T ss_pred ccCCEEEEecccEEEEeeCChHHHHhh-hhhccCceEEEEEeecceEEEEeeccceEEEEEEecCCCeEEEEeCCCccce
Confidence 33444444333 399999976543322 122222221111 123479999 888899999874 445555555555666
Q ss_pred ceEEEecC-CeEEEEEc
Q 003405 160 VKSMSWCG-ENICIAIR 175 (823)
Q Consensus 160 ~~~l~~~~-~~i~v~~~ 175 (823)
++++.+++ +++.+|.+
T Consensus 1022 Vt~~~~lD~~tvagaDr 1038 (1205)
T KOG1898|consen 1022 VTALELLDYDTVAGADR 1038 (1205)
T ss_pred eeEEEEecCCceeeccc
Confidence 66666664 56777776
No 430
>KOG2280 consensus Vacuolar assembly/sorting protein VPS16 [Intracellular trafficking, secretion, and vesicular transport]
Probab=27.00 E-value=1.2e+03 Score=28.34 Aligned_cols=106 Identities=12% Similarity=0.165 Sum_probs=54.5
Q ss_pred EEEEeeCCCceEEEEE-----------cCeEEEEEEcCCCceeEeeeecC-CCCceEEEecCC--eEEEEEcCceEEEEc
Q 003405 118 NVYSWDDRRGFLCFAR-----------QKRVCIFRHDGGRGFVEVKDFGV-PDTVKSMSWCGE--NICIAIRKGYMILNA 183 (823)
Q Consensus 118 ~~fa~~~~~~~l~V~~-----------kkki~l~~~~~~~~f~~~kei~~-~~~~~~l~~~~~--~i~v~~~~~y~lidl 183 (823)
..|++.+-.|.|+|.. ...|.||...| +. +..+.. ...+.+|.|.++ .|||.-.....++++
T Consensus 36 ~~fa~Ap~gGpIAV~r~p~~~~~~~~a~~~I~If~~sG-~l---L~~~~w~~~~lI~mgWs~~eeLI~v~k~g~v~Vy~~ 111 (829)
T KOG2280|consen 36 VYFACAPFGGPIAVTRSPSKLVPLYSARPYIRIFNISG-QL---LGRILWKHGELIGMGWSDDEELICVQKDGTVHVYGL 111 (829)
T ss_pred eEEEecccCCceEEEecccccccccccceeEEEEeccc-cc---hHHHHhcCCCeeeecccCCceEEEEeccceEEEeec
Confidence 3455555556666652 23467777763 21 222222 237889999865 466666677788887
Q ss_pred CCCCeeeccCCCCC--CCCEE-EE-ccCCeEEEEeCCeEEEEcCCCccc
Q 003405 184 TNGALSEVFPSGRI--GPPLV-VS-LLSGELLLGKENIGVFVDQNGKLL 228 (823)
Q Consensus 184 ~~~~~~~L~~~~~~--~~p~i-~~-~~~~EfLL~~~~~gvfv~~~G~~~ 228 (823)
......+ +..|.. ...+. ++ ..+|=+++..++..+.++..+++.
T Consensus 112 ~ge~ie~-~svg~e~~~~~I~ec~~f~~GVavlt~~g~v~~i~~~~~~~ 159 (829)
T KOG2280|consen 112 LGEFIES-NSVGFESQMSDIVECRFFHNGVAVLTVSGQVILINGVEEPK 159 (829)
T ss_pred chhhhcc-cccccccccCceeEEEEecCceEEEecCCcEEEEcCCCcch
Confidence 6543332 132211 11111 12 234444555555555555555543
No 431
>KOG3616 consensus Selective LIM binding factor [Transcription]
Probab=26.96 E-value=4.4e+02 Score=31.69 Aligned_cols=60 Identities=15% Similarity=0.155 Sum_probs=41.4
Q ss_pred HHHHHhcCChhhHHhhhcC-----CCcccHHHHHHHHHhcCcHHHHHHHHHHhccHHHHHHHHHH
Q 003405 512 LQALLLTGQSSAALELLKG-----LNYCDVKICEEILQKKNHYTALLELYKSNARHREALKLLHE 571 (823)
Q Consensus 512 l~~y~~~~~~~~l~~ll~~-----~n~c~~~~~~~~L~~~~~~~~L~~ly~~~g~~~~AL~ll~~ 571 (823)
+.+|++.+.+......... .+.--++.+..-|.+...|+-..++|.+-.++++||+.+++
T Consensus 622 iqlyika~~p~~a~~~a~n~~~l~~de~il~~ia~alik~elydkagdlfeki~d~dkale~fkk 686 (1636)
T KOG3616|consen 622 IQLYIKAGKPAKAARAALNDEELLADEEILEHIAAALIKGELYDKAGDLFEKIHDFDKALECFKK 686 (1636)
T ss_pred HHHHHHcCCchHHHHhhcCHHHhhccHHHHHHHHHHHHhhHHHHhhhhHHHHhhCHHHHHHHHHc
Confidence 4678887655433322211 12223455566677788999999999999999999999986
No 432
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=26.70 E-value=47 Score=34.47 Aligned_cols=65 Identities=15% Similarity=0.213 Sum_probs=0.0
Q ss_pred EEEEEEeC---CEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEeccc-CceeeEe
Q 003405 19 IDAVASYG---LKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASR-QLLLSLS 94 (823)
Q Consensus 19 I~ci~~~~---~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~-~~Ll~l~ 94 (823)
++|.|.|. +.+..||.+|.+-+|+.. ...+.-+.-..+|++|.++..=|.. +.|++++
T Consensus 182 v~~l~~hp~qq~~v~cgt~dg~~~l~d~r------------------n~~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~s 243 (319)
T KOG4714|consen 182 VTALCSHPAQQHLVCCGTDDGIVGLWDAR------------------NVAMPVSLLKAHKAEIWEVHFHPKNPEHLFTCS 243 (319)
T ss_pred chhhhCCcccccEEEEecCCCeEEEEEcc------------------cccchHHHHHHhhhhhhheeccCCCchheeEec
Q ss_pred C-c-EEEEe
Q 003405 95 E-S-IAFHR 101 (823)
Q Consensus 95 d-~-l~~~~ 101 (823)
+ | +..|+
T Consensus 244 edGslw~wd 252 (319)
T KOG4714|consen 244 EDGSLWHWD 252 (319)
T ss_pred CCCcEEEEc
No 433
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=26.59 E-value=1.3e+03 Score=28.47 Aligned_cols=166 Identities=13% Similarity=0.041 Sum_probs=92.9
Q ss_pred HHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHh-hcCCCCChhhHHHhhhhhhhcCcccccccccc
Q 003405 552 LLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYL-KPLCGTDPMLVLEFSMLVLESCPTQTIELFLS 630 (823)
Q Consensus 552 L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL-~~L~~~~~~li~~y~~wll~~~p~~~~~if~~ 630 (823)
++.+|...|++++|++++.+.......+. .. ..+....+. --+.....+--.++..-+.+.+|... .++..
T Consensus 278 la~~yl~~g~~e~A~~~l~~~l~~~p~~~---~~----~~~~~~~L~~a~~~~g~~~eA~~~l~~~~~~~P~~~-~~~~~ 349 (765)
T PRK10049 278 VASAYLKLHQPEKAQSILTELFYHPETIA---DL----SDEELADLFYSLLESENYPGALTVTAHTINNSPPFL-RLYGS 349 (765)
T ss_pred HHHHHHhcCCcHHHHHHHHHHhhcCCCCC---CC----ChHHHHHHHHHHHhcccHHHHHHHHHHHhhcCCceE-eecCC
Confidence 68899999999999999998764332110 00 001111110 01123344555555556666666642 23321
Q ss_pred -CCCCh-HH--H---H--HHHhhcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchH
Q 003405 631 -GNIPA-DL--V---N--SYLKQYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSP 701 (823)
Q Consensus 631 -~~l~~-~~--V---l--~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~ 701 (823)
...|- +. + . -.+.....+.++..|+.++.. .+ .++.++..++.+|...=
T Consensus 350 ~~~~p~~~~~~a~~~~a~~l~~~g~~~eA~~~l~~al~~-~P-~n~~l~~~lA~l~~~~g-------------------- 407 (765)
T PRK10049 350 PTSIPNDDWLQGQSLLSQVAKYSNDLPQAEMRARELAYN-AP-GNQGLRIDYASVLQARG-------------------- 407 (765)
T ss_pred CCCCCCchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CC-CCHHHHHHHHHHHHhcC--------------------
Confidence 11111 11 1 1 111223456788889988764 23 45788888888887540
Q ss_pred HHHHHHHHhhhcCCCChHHHhccCCCC-chhhHHHHHhhccccHHHHHHHHHHHh
Q 003405 702 TRKKLLSALESISGYNPEVLLKRLPAD-ALYEERAILLGKMNQHELALSLYVHKV 755 (823)
Q Consensus 702 ~r~kLl~fL~~s~~Yd~~~~L~~~~~~-~l~~e~~~Ll~klg~h~~AL~ilv~~L 755 (823)
...+....|+ .++..-|++ .+...++..+-++|++++|..++=.-+
T Consensus 408 ~~~~A~~~l~--------~al~l~Pd~~~l~~~~a~~al~~~~~~~A~~~~~~ll 454 (765)
T PRK10049 408 WPRAAENELK--------KAEVLEPRNINLEVEQAWTALDLQEWRQMDVLTDDVV 454 (765)
T ss_pred CHHHHHHHHH--------HHHhhCCCChHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 1122222222 345555654 577789999999999999998886544
No 434
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=26.57 E-value=71 Score=22.45 Aligned_cols=19 Identities=32% Similarity=0.707 Sum_probs=15.2
Q ss_pred HHHHHHccCCHHHHHHHHH
Q 003405 345 FAHYLFDTGSYEEAMEHFL 363 (823)
Q Consensus 345 ~a~~lf~~~~f~~A~~~f~ 363 (823)
.|..++.+|+|++|.+.|.
T Consensus 7 ~a~~~y~~~ky~~A~~~~~ 25 (36)
T PF07720_consen 7 LAYNFYQKGKYDEAIHFFQ 25 (36)
T ss_dssp HHHHHHHTT-HHHHHHHHH
T ss_pred HHHHHHHHhhHHHHHHHHH
Confidence 4778899999999999944
No 435
>KOG1840 consensus Kinesin light chain [Cytoskeleton]
Probab=26.21 E-value=1.2e+02 Score=35.19 Aligned_cols=59 Identities=22% Similarity=0.259 Sum_probs=39.8
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCc--ch-HhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 307 IVQLTASGDFEEALALCKLLPPE--DA-SLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~--~~-~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
..++..+|+|+.|..+++...+. +. ......+......+|..+-..++|++|...|.++
T Consensus 206 a~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~A 267 (508)
T KOG1840|consen 206 AEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEA 267 (508)
T ss_pred HHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHH
Confidence 45667899999999999764211 00 0011123344446899999999999999988873
No 436
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=26.08 E-value=9.5e+02 Score=26.81 Aligned_cols=193 Identities=17% Similarity=0.137 Sum_probs=111.4
Q ss_pred CCcccHHHHHHHHHhcCcHHH----HHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccC---------------Ch
Q 003405 531 LNYCDVKICEEILQKKNHYTA----LLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKF---------------NP 591 (823)
Q Consensus 531 ~n~c~~~~~~~~L~~~~~~~~----L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~---------------~~ 591 (823)
++.-..+.+.+.+..+.+..+ ....|...|.+..++.++-++........ .+..++. +.
T Consensus 168 d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~~~~ll~~l~~L~ka~~l~~--~e~~~le~~a~~glL~q~~~~~~~ 245 (400)
T COG3071 168 DYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSD--EEAARLEQQAWEGLLQQARDDNGS 245 (400)
T ss_pred CchhHHHHHHHHHHhCcCChHHHHHHHHHHHHhccHHHHHHHHHHHHHccCCCh--HHHHHHHHHHHHHHHHHHhccccc
Confidence 445556666677765544433 46779999999999999999886554311 0100000 00
Q ss_pred HHHHHHhhcCC---CCChhhHHHhhhhhhhcCcc-cccccccc---CCCChH--HHHHHHhhcCchhHHHHHHHHhhccc
Q 003405 592 ESIIEYLKPLC---GTDPMLVLEFSMLVLESCPT-QTIELFLS---GNIPAD--LVNSYLKQYSPSMQGRYLELMLAMNE 662 (823)
Q Consensus 592 ~~~i~yL~~L~---~~~~~li~~y~~wll~~~p~-~~~~if~~---~~l~~~--~Vl~~L~~~~~~~~~~YLE~li~~~~ 662 (823)
+-..++.+.++ ..+.+++-.|+..+++.+-. .|.++..+ ...++. ..++.+...+|.-+++=+|+-+..
T Consensus 246 ~gL~~~W~~~pr~lr~~p~l~~~~a~~li~l~~~~~A~~~i~~~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e~~l~~-- 323 (400)
T COG3071 246 EGLKTWWKNQPRKLRNDPELVVAYAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQ-- 323 (400)
T ss_pred hHHHHHHHhccHHhhcChhHHHHHHHHHHHcCChHHHHHHHHHHHHhccChhHHHHHhhcCCCCchHHHHHHHHHHHh--
Confidence 11111222221 12345555555555554333 33443332 134444 345555556677788888887653
Q ss_pred CCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhhcCCCChHHHhccCCCCchhhHHHHHhhccc
Q 003405 663 NSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALESISGYNPEVLLKRLPADALYEERAILLGKMN 742 (823)
Q Consensus 663 ~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~s~~Yd~~~~L~~~~~~~l~~e~~~Ll~klg 742 (823)
...+|.++-+|..+|+..- .-.|-..+|+. +++.=++..=+.+.+..+.++|
T Consensus 324 h~~~p~L~~tLG~L~~k~~--------------------~w~kA~~~lea--------Al~~~~s~~~~~~la~~~~~~g 375 (400)
T COG3071 324 HPEDPLLLSTLGRLALKNK--------------------LWGKASEALEA--------ALKLRPSASDYAELADALDQLG 375 (400)
T ss_pred CCCChhHHHHHHHHHHHhh--------------------HHHHHHHHHHH--------HHhcCCChhhHHHHHHHHHHcC
Confidence 3467899999999999761 23555666663 3333333344678899999999
Q ss_pred cHHHHHHHHHHHh
Q 003405 743 QHELALSLYVHKV 755 (823)
Q Consensus 743 ~h~~AL~ilv~~L 755 (823)
+-++|=+..-.-|
T Consensus 376 ~~~~A~~~r~e~L 388 (400)
T COG3071 376 EPEEAEQVRREAL 388 (400)
T ss_pred ChHHHHHHHHHHH
Confidence 9888766655443
No 437
>PF01403 Sema: Sema domain; InterPro: IPR001627 The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in a hepatocyte growth factor receptor, in SEX protein [] and in viral proteins. CD100 (also called SEMA4D) is associated with PTPase and serine kinase activity. CD100 increases PMA, CD3 and CD2 induced T cell proliferation, increases CD45 induced T cell adhesion, induces B cell homotypic adhesion and down-regulates B cell expression of CD23. The Sema domain is characterised by a conserved set of cysteine residues, which form four disulphide bonds to stabilise the structure. The Sema domain fold is a variation of the beta propeller topology, with seven blades radially arranged around a central axis. Each blade contains a four- stranded (strands A to D) antiparallel beta sheet. The inner strand of each blade (A) lines the channel at the centre of the propeller, with strands B and C of the same repeat radiating outward, and strand D of the next repeat forming the outer edge of the blade. The large size of the Sema domain is not due to a single inserted domain but results from the presence of additional secondary structure elements inserted in most of the blades. The Sema domain uses a 'loop and hook' system to close the circle between the first and the last blades. The blades are constructed sequentially with an N-terminal beta- strand closing the circle by providing the outermost strand (D) of the seventh (C-terminal) blade. The beta-propeller is further stabilised by an extension of the N terminus, providing an additional, fifth beta-strand on the outer edge of blade 6 [, , ]. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0005515 protein binding; PDB: 3NVX_A 3NVQ_A 3OL2_A 1OLZ_B 3OKT_A 3AL9_B 3OKY_A 3AL8_B 3NVN_A 3OKW_A ....
Probab=26.05 E-value=1.6e+02 Score=33.50 Aligned_cols=71 Identities=25% Similarity=0.345 Sum_probs=36.1
Q ss_pred cccccccccCCCC--cEEEEEEe----CC----EEEEEeCCCcEE-EEcCCCCCCCCCCCCcccccccccceeeeeecCC
Q 003405 5 AFDSLELISNCSP--KIDAVASY----GL----KILLGCSDGSLK-IYSPGSSESDRSPPSDYQSLRKESYELERTISGF 73 (823)
Q Consensus 5 af~~~~l~~~~~~--~I~ci~~~----~~----~L~vGT~~G~l~-~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~ 73 (823)
+....++...... ..++|++. ++ -+|+||++|.|+ ...+....... ...+ ++...+.+
T Consensus 350 ~~~~~p~~~~~~~~~~~T~i~v~~v~~~~~~~tV~flGT~~G~l~K~v~~~~~~~~~----------~~~~-~iee~~~~ 418 (433)
T PF01403_consen 350 PISGQPLFTRQGVNYRLTSIAVDRVQVENGSYTVAFLGTDDGRLHKKVVLSNSSSGH----------YESY-IIEEIQVF 418 (433)
T ss_dssp -CCGSCSEEEETSSS-EEEEEEEEEEETTTCEEEEEEEETTSEEEEEEEESSSSTCT-----------EEE-EEEEEE-S
T ss_pred CCCCcceeeeccccceeeEEEEEEEecCCCcEEEEEEecCCceEEEEEEecCCCCcc----------cccE-EEEEEeec
Confidence 3344444433322 56766655 22 479999999999 56554433210 0011 22233444
Q ss_pred CC-CCeeEEEEecc
Q 003405 74 SK-KPILSMEVLAS 86 (823)
Q Consensus 74 ~k-~~I~qI~~~~~ 86 (823)
.. .||..+.+.++
T Consensus 419 ~~~~pI~~~~l~~~ 432 (433)
T PF01403_consen 419 PDSEPIQSMKLSPK 432 (433)
T ss_dssp TSC-EEEEEEEETT
T ss_pred CCCCceEEEEeccC
Confidence 43 48888877654
No 438
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=25.98 E-value=76 Score=24.95 Aligned_cols=28 Identities=25% Similarity=0.324 Sum_probs=22.5
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHHhhcc
Q 003405 549 YTALLELYKSNARHREALKLLHELVEES 576 (823)
Q Consensus 549 ~~~L~~ly~~~g~~~~AL~ll~~l~~~~ 576 (823)
...|+.+|...|++++|.++|.++....
T Consensus 28 ~~~la~~~~~~g~~~~A~~~l~~~~~~~ 55 (68)
T PF14559_consen 28 RLLLAQCYLKQGQYDEAEELLERLLKQD 55 (68)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHCCHGGG
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHC
Confidence 3468999999999999999999876443
No 439
>PF12816 Vps8: Golgi CORVET complex core vacuolar protein 8
Probab=25.95 E-value=1.5e+02 Score=29.85 Aligned_cols=119 Identities=13% Similarity=0.131 Sum_probs=72.9
Q ss_pred CCCChhHHHHHHHHHHHHHHHHhhhhhhhcccCcccchHHHHHHHHHhhh--cCCCChHHHhccCCCCchhhHHHHHhhc
Q 003405 663 NSISGNLQNEMVQIYLSEVLDWYSDLSAQQKWDEKAYSPTRKKLLSALES--ISGYNPEVLLKRLPADALYEERAILLGK 740 (823)
Q Consensus 663 ~~~~~~~h~~L~~lYl~~i~~~~~~~~~~~~~~~~~~~~~r~kLl~fL~~--s~~Yd~~~~L~~~~~~~l~~e~~~Ll~k 740 (823)
+...|.+-+.++..|.+.= .-..+-.++-. -+..|.+.+++.|..++|+...+++|-|
T Consensus 18 ~~lpp~v~k~lv~~y~~~~--------------------~~~~lE~lI~~LD~~~LDidq~i~lC~~~~LydalIYv~n~ 77 (196)
T PF12816_consen 18 KSLPPEVFKALVEHYASKG--------------------RLERLEQLILHLDPSSLDIDQVIKLCKKHGLYDALIYVWNR 77 (196)
T ss_pred CCCCHHHHHHHHHHHHHCC--------------------CHHHHHHHHHhCCHHhcCHHHHHHHHHHCCCCCeeeeeeec
Confidence 3467788888888887641 12333333333 3579999999999999999999999977
Q ss_pred -cccHHHHHHHHHHHhCCCc-----------------hhHHHHHHHHhcCCCCCcchhhhccchhHHHHHHHHHHHHHHh
Q 003405 741 -MNQHELALSLYVHKVFLIN-----------------QPVFLLIRRMAMDIKPLVTEHEIKHINWRVLQATIIKLFFSSL 802 (823)
Q Consensus 741 -lg~h~~AL~ilv~~L~D~~-----------------~a~~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 802 (823)
+|.+-.=|.-++..+.... ..++.-+-..+. -..-.+++.+.+=.+.-....+.++.||.-
T Consensus 78 ~l~DYvTPL~~ll~~i~~~~~~~~~~~~~~~~~~~~~~kil~Yls~~L~-Gr~yP~g~~i~~~~~~~ak~~i~~~Lfs~~ 156 (196)
T PF12816_consen 78 ALNDYVTPLEELLELIRSALNKCQIFDSSSEEDSELGYKILVYLSYCLT-GRQYPSGEIIPEEKAPSAKREIYSFLFSGT 156 (196)
T ss_pred cccCCcHHHHHHHHHHHHhhhcccccCcchhhhhhhHHHHHHHHHHHHc-CCCCCCCCCCChhHHHHHHHHHHHHHHcCC
Confidence 5888665555554433321 112222222222 222233333344556677788888888753
No 440
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=25.94 E-value=4.5e+02 Score=28.39 Aligned_cols=101 Identities=13% Similarity=0.252 Sum_probs=60.3
Q ss_pred CCcEEEEeeCCCceEEEEEc-CeEEEEEEcCCCceeEeeeecCCC-CceEEEec--CCeEEEE-EcCceEEEEc-CCCCe
Q 003405 115 KGANVYSWDDRRGFLCFARQ-KRVCIFRHDGGRGFVEVKDFGVPD-TVKSMSWC--GENICIA-IRKGYMILNA-TNGAL 188 (823)
Q Consensus 115 kg~~~fa~~~~~~~l~V~~k-kki~l~~~~~~~~f~~~kei~~~~-~~~~l~~~--~~~i~v~-~~~~y~lidl-~~~~~ 188 (823)
..+++-|.+.++..++|... +.+.||+..+...++...++.-.+ .++++.|. .+.|.=+ .++.-++... .+|+-
T Consensus 11 ~pitchAwn~drt~iAv~~~~~evhiy~~~~~~~w~~~htls~Hd~~vtgvdWap~snrIvtcs~drnayVw~~~~~~~W 90 (361)
T KOG1523|consen 11 EPITCHAWNSDRTQIAVSPNNHEVHIYSMLGADLWEPAHTLSEHDKIVTGVDWAPKSNRIVTCSHDRNAYVWTQPSGGTW 90 (361)
T ss_pred CceeeeeecCCCceEEeccCCceEEEEEecCCCCceeceehhhhCcceeEEeecCCCCceeEccCCCCccccccCCCCee
Confidence 56778888998888999854 589999998654466555554443 67788997 3444433 3344444444 44432
Q ss_pred ---eeccCCCCCCCCEEEEcc-CCeEEEEeCC
Q 003405 189 ---SEVFPSGRIGPPLVVSLL-SGELLLGKEN 216 (823)
Q Consensus 189 ---~~L~~~~~~~~p~i~~~~-~~EfLL~~~~ 216 (823)
..|+...+ ..-+|.+.+ .+.|.++.+.
T Consensus 91 kptlvLlRiNr-AAt~V~WsP~enkFAVgSga 121 (361)
T KOG1523|consen 91 KPTLVLLRINR-AATCVKWSPKENKFAVGSGA 121 (361)
T ss_pred ccceeEEEecc-ceeeEeecCcCceEEeccCc
Confidence 23333332 234555544 4567776653
No 441
>PF14655 RAB3GAP2_N: Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=25.75 E-value=5.6e+02 Score=29.06 Aligned_cols=99 Identities=21% Similarity=0.288 Sum_probs=58.5
Q ss_pred EEeeCCCceEEEEEcCeEEEE--EEcC------CCceeEeeeecCC----CCceEEEec-----C--------CeEEEEE
Q 003405 120 YSWDDRRGFLCFARQKRVCIF--RHDG------GRGFVEVKDFGVP----DTVKSMSWC-----G--------ENICIAI 174 (823)
Q Consensus 120 fa~~~~~~~l~V~~kkki~l~--~~~~------~~~f~~~kei~~~----~~~~~l~~~-----~--------~~i~v~~ 174 (823)
+++.++...|++|...++.+. .|.. ++.+...-...+. +.|+++.|. + ..|+||+
T Consensus 7 isls~~~d~laiA~~~r~vil~~~w~~~~~~~~~~~~~~~~~g~l~~~~~e~ITsi~clpl~s~~~s~~~~dw~~I~VG~ 86 (415)
T PF14655_consen 7 ISLSPDGDLLAIARGQRLVILTSKWDSSRKGENENTYSISWSGPLDDEPGECITSILCLPLSSQKRSTGGPDWTCIAVGT 86 (415)
T ss_pred EEecCCCCEEEEEcCCEEEEEEeeccccccCCCCCeEEEEeeeeccCCCCCEEEEEEEEEeecccccCCCCCcEEEEEEe
Confidence 456666677899988887776 4521 1123222222222 578888875 2 3599999
Q ss_pred cCceEEEEcCCCCe--eeccCCCCCCCCEEE------------EccCCeEEEEeCCeEEEEc
Q 003405 175 RKGYMILNATNGAL--SEVFPSGRIGPPLVV------------SLLSGELLLGKENIGVFVD 222 (823)
Q Consensus 175 ~~~y~lidl~~~~~--~~L~~~~~~~~p~i~------------~~~~~EfLL~~~~~gvfv~ 222 (823)
.++|..+=..+|.. .+++.. .|+.. ....+|+.|.+.+..+++|
T Consensus 87 ssG~vrfyte~G~LL~~Q~~h~----~pV~~ik~~~~~~~~~~~~~~eel~ily~~~v~~Id 144 (415)
T PF14655_consen 87 SSGYVRFYTENGVLLLSQLLHE----EPVLKIKCRSTKIPRHPGDSSEELSILYPSAVVIID 144 (415)
T ss_pred cccEEEEEeccchHHHHHhcCc----cceEEEEecccCCCCCCcccccEEEEEECCEEEEEe
Confidence 99999888777752 222221 12111 0112788888887766665
No 442
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=25.64 E-value=6.9e+02 Score=25.07 Aligned_cols=57 Identities=18% Similarity=0.132 Sum_probs=36.0
Q ss_pred HHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHcc--------CCHHHHHHHHHh
Q 003405 306 QIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDT--------GSYEEAMEHFLA 364 (823)
Q Consensus 306 qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~--------~~f~~A~~~f~~ 364 (823)
....+.+.|++++|+..++......+ .........-..|..++.. +++++|...|.+
T Consensus 76 la~~~~~~~~~~~A~~~~~~~l~~~p--~~~~~~~a~~~~g~~~~~~~~~~~~~~~~~~~A~~~~~~ 140 (235)
T TIGR03302 76 LAYAYYKSGDYAEAIAAADRFIRLHP--NHPDADYAYYLRGLSNYNQIDRVDRDQTAAREAFEAFQE 140 (235)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHHCc--CCCchHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHH
Confidence 34566789999999999987521111 0112223344556666654 778888888875
No 443
>PF10433 MMS1_N: Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A ....
Probab=25.42 E-value=1.1e+03 Score=27.27 Aligned_cols=155 Identities=14% Similarity=0.262 Sum_probs=84.6
Q ss_pred EEEEEeC-C--EEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecc----cC----
Q 003405 20 DAVASYG-L--KILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLAS----RQ---- 88 (823)
Q Consensus 20 ~ci~~~~-~--~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~----~~---- 88 (823)
+|++.-+ + .||+|++.|.=..|.+.. ..+.+...+.+. .||..+.+.+. ..
T Consensus 259 s~l~~l~~g~d~lf~gs~~gds~l~~~~~----------------~~l~~~~~~~N~--~Pi~D~~v~~~~~~~~~~~~~ 320 (504)
T PF10433_consen 259 SSLTYLKNGGDYLFVGSEFGDSQLLQISL----------------SNLEVLDSLPNW--GPIVDFCVVDSSNSGQPSNPS 320 (504)
T ss_dssp SEEEEESTT--EEEEEESSS-EEEEEEES----------------ESEEEEEEE------SEEEEEEE-TSSSSS-----
T ss_pred heEEEEcCCCEEEEEEEecCCcEEEEEeC----------------CCcEEEEeccCc--CCccceEEeccccCCCCcccc
Confidence 4444444 4 999998877644444331 234555566665 79999999853 22
Q ss_pred --ceeeEeC-c----EEEEeCC-CCc-ccccccCCCCcE-EEEeeCC---CceEEEEEcCeEEEEEEcC---CCceeEee
Q 003405 89 --LLLSLSE-S----IAFHRLP-NLE-TIAVLTKAKGAN-VYSWDDR---RGFLCFARQKRVCIFRHDG---GRGFVEVK 152 (823)
Q Consensus 89 --~Ll~l~d-~----l~~~~L~-~l~-~~~~i~~~kg~~-~fa~~~~---~~~l~V~~kkki~l~~~~~---~~~f~~~k 152 (823)
.++++|+ | +++..-. ..+ .........|++ .+++... ...+++.....-.++++.. ...+..+.
T Consensus 321 ~~~lv~~sG~g~~gsL~~lr~Gi~~~~~~~~~~~l~~v~~iW~l~~~~~~~~~lv~S~~~~T~vl~~~~~d~~e~~~e~~ 400 (504)
T PF10433_consen 321 SDQLVACSGAGKRGSLRILRNGIGIEGLELASSELPGVTGIWTLKLSSSDHSYLVLSFPNETRVLQISEGDDGEEVEEVE 400 (504)
T ss_dssp --EEEEEESSGGG-EEEEEEESBEEE--EEEEEEESTEEEEEEE-SSSSSBSEEEEEESSEEEEEEES----SSEEEEE-
T ss_pred cceEEEEECcCCCCcEEEEeccCCceeeeeeccCCCCceEEEEeeecCCCceEEEEEcCCceEEEEEecccCCcchhhhh
Confidence 6788887 2 6655321 111 011123344554 3555433 3467777777777888852 22333332
Q ss_pred --eecCCCCceEEEec-CCeEEEEEcCceEEEEcCCCCeeecc
Q 003405 153 --DFGVPDTVKSMSWC-GENICIAIRKGYMILNATNGALSEVF 192 (823)
Q Consensus 153 --ei~~~~~~~~l~~~-~~~i~v~~~~~y~lidl~~~~~~~L~ 192 (823)
.+....+-..++.. ++.++=.+.++..+++..+++.....
T Consensus 401 ~~~f~~~~~Tl~~~~~~~~~ivQVt~~~i~l~~~~~~~~~~~w 443 (504)
T PF10433_consen 401 EDGFDTDEPTLAAGNVGDGRIVQVTPKGIRLIDLEDGKLTQEW 443 (504)
T ss_dssp --TS-SSS-EEEEEEETTTEEEEEESSEEEEEESSSTSEEEEE
T ss_pred hccCCCCCCCeEEEEcCCCeEEEEecCeEEEEECCCCeEEEEE
Confidence 13333333344444 67888899999999998877665443
No 444
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=25.36 E-value=1.4e+02 Score=35.61 Aligned_cols=61 Identities=16% Similarity=0.130 Sum_probs=36.7
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc-CCCHH
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS-QVDIT 370 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~-~~dP~ 370 (823)
.....+...|++++|+..++.....++ ....++...|..++..|+|++|...|.++ .++|.
T Consensus 370 ~la~~~~~~g~~~eA~~~~~~al~~~p-----~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~ 431 (615)
T TIGR00990 370 KRASMNLELGDPDKAEEDFDKALKLNS-----EDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPD 431 (615)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCC-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCcc
Confidence 344556678888888887765321110 11234555677777777777777777664 34553
No 445
>COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=25.33 E-value=6.6e+02 Score=26.85 Aligned_cols=74 Identities=22% Similarity=0.279 Sum_probs=41.4
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHh----hhhcHHHHHHHHHHHHHccCCHH----------HHHHHHHhcCC--C
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASL----RAAKEGSIHIRFAHYLFDTGSYE----------EAMEHFLASQV--D 368 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~----~~~~~~~i~~~~a~~lf~~~~f~----------~A~~~f~~~~~--d 368 (823)
+-...+++.+++++|+.....+.+.+... .+.+... -...+..+.++|+|. +||..|.+..+ =
T Consensus 8 e~a~~~v~~~~~~~ai~~yk~iL~kg~s~dek~~nEqE~t-vlel~~lyv~~g~~~~l~~~i~~sre~m~~ftk~k~~Ki 86 (421)
T COG5159 8 ELANNAVKSNDIEKAIGEYKRILGKGVSKDEKTLNEQEAT-VLELFKLYVSKGDYCSLGDTITSSREAMEDFTKPKITKI 86 (421)
T ss_pred HHHHHhhhhhhHHHHHHHHHHHhcCCCChhhhhhhHHHHH-HHHHHHHHHhcCCcchHHHHHHhhHHHHHHhcchhHHHH
Confidence 34677899999999999988764332111 1111111 222345566778765 45555555332 2
Q ss_pred HHHHHHhCCCC
Q 003405 369 ITYALSLYPSI 379 (823)
Q Consensus 369 P~~vi~Lfp~l 379 (823)
.|.+|..||..
T Consensus 87 irtLiekf~~~ 97 (421)
T COG5159 87 IRTLIEKFPYS 97 (421)
T ss_pred HHHHHHhcCCC
Confidence 35566666554
No 446
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=25.18 E-value=1.3e+02 Score=22.91 Aligned_cols=51 Identities=24% Similarity=0.220 Sum_probs=30.4
Q ss_pred hHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCC
Q 003405 304 GAQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGS 354 (823)
Q Consensus 304 ~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~ 354 (823)
..+|...+..|+++.|++.++............-.-.++++.-..+...++
T Consensus 5 ~~~i~~~i~~g~~~~a~~~~~~~~~~l~~~~~~l~f~L~~q~~lell~~~~ 55 (58)
T smart00668 5 RKRIRELILKGDWDEALEWLSSLKPPLLERNSKLEFELRKQKFLELVRQGK 55 (58)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHcCHHHhccCCCchhHHHHHHHHHHHHcCC
Confidence 357888999999999999998863211000011123455555555555443
No 447
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=25.15 E-value=1.5e+02 Score=28.14 Aligned_cols=69 Identities=16% Similarity=0.181 Sum_probs=36.7
Q ss_pred HHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCH----------HHHHHhC
Q 003405 307 IVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDI----------TYALSLY 376 (823)
Q Consensus 307 I~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP----------~~vi~Lf 376 (823)
+....++++|++|++-++.+-.-. ....+...+.=..|+..|.+.+ .++.-|...+.|| ..||..|
T Consensus 54 ~yayy~~~~y~~A~a~~~rFirLh--P~hp~vdYa~Y~~gL~~~~~~~--~~~~~~~~~drD~~~~~~A~~~f~~lv~~y 129 (142)
T PF13512_consen 54 AYAYYKQGDYEEAIAAYDRFIRLH--PTHPNVDYAYYMRGLSYYEQDE--GSLQSFFRSDRDPTPARQAFRDFEQLVRRY 129 (142)
T ss_pred HHHHHHccCHHHHHHHHHHHHHhC--CCCCCccHHHHHHHHHHHHHhh--hHHhhhcccccCcHHHHHHHHHHHHHHHHC
Confidence 455679999999999988752000 0011334444445666665443 2333322334444 3466667
Q ss_pred CCC
Q 003405 377 PSI 379 (823)
Q Consensus 377 p~l 379 (823)
|+-
T Consensus 130 P~S 132 (142)
T PF13512_consen 130 PNS 132 (142)
T ss_pred cCC
Confidence 654
No 448
>KOG1129 consensus TPR repeat-containing protein [General function prediction only]
Probab=25.13 E-value=7.5e+02 Score=27.01 Aligned_cols=140 Identities=21% Similarity=0.266 Sum_probs=77.5
Q ss_pred HHHHHHhcCChhhHHhhhcCCCcccHHHHHHHHHhcCc---HHHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccc
Q 003405 511 LLQALLLTGQSSAALELLKGLNYCDVKICEEILQKKNH---YTALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQ 587 (823)
Q Consensus 511 Ll~~y~~~~~~~~l~~ll~~~n~c~~~~~~~~L~~~~~---~~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~ 587 (823)
+=|||+.++-+....+++.. -|.+... |--|+..|.+-.+.+.||.++.+-.+.--. ++..
T Consensus 229 ~gkCylrLgm~r~Aekqlqs-----------sL~q~~~~dTfllLskvY~ridQP~~AL~~~~~gld~fP~-----~VT~ 292 (478)
T KOG1129|consen 229 MGKCYLRLGMPRRAEKQLQS-----------SLTQFPHPDTFLLLSKVYQRIDQPERALLVIGEGLDSFPF-----DVTY 292 (478)
T ss_pred HHHHHHHhcChhhhHHHHHH-----------HhhcCCchhHHHHHHHHHHHhccHHHHHHHHhhhhhcCCc-----hhhh
Confidence 55899988754444444331 2222222 446789999999999999998875543211 1222
Q ss_pred cCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCcccc--ccccccCCCChHHHHHHHhhcCchhHHHHHHHHhhcccCCC
Q 003405 588 KFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQT--IELFLSGNIPADLVNSYLKQYSPSMQGRYLELMLAMNENSI 665 (823)
Q Consensus 588 ~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~--~~if~~~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~~ 665 (823)
+-+..++-+-+. +.+--.++-+.+++.+|... +.... +.+.=...|+++.+|-..+..++ ..
T Consensus 293 l~g~ARi~eam~-----~~~~a~~lYk~vlk~~~~nvEaiAcia---------~~yfY~~~PE~AlryYRRiLqmG--~~ 356 (478)
T KOG1129|consen 293 LLGQARIHEAME-----QQEDALQLYKLVLKLHPINVEAIACIA---------VGYFYDNNPEMALRYYRRILQMG--AQ 356 (478)
T ss_pred hhhhHHHHHHHH-----hHHHHHHHHHHHHhcCCccceeeeeee---------eccccCCChHHHHHHHHHHHHhc--CC
Confidence 222223333222 11222333445555554421 11111 11121234899999999998864 46
Q ss_pred ChhHHHH--HHHHHHHHHH
Q 003405 666 SGNLQNE--MVQIYLSEVL 682 (823)
Q Consensus 666 ~~~~h~~--L~~lYl~~i~ 682 (823)
++++.+. |..+|...+.
T Consensus 357 speLf~NigLCC~yaqQ~D 375 (478)
T KOG1129|consen 357 SPELFCNIGLCCLYAQQID 375 (478)
T ss_pred ChHHHhhHHHHHHhhcchh
Confidence 7887775 6789987653
No 449
>PRK11189 lipoprotein NlpI; Provisional
Probab=25.05 E-value=1.2e+02 Score=32.38 Aligned_cols=28 Identities=25% Similarity=0.236 Sum_probs=23.9
Q ss_pred cHHHHHHHHHHhccHHHHHHHHHHHhhc
Q 003405 548 HYTALLELYKSNARHREALKLLHELVEE 575 (823)
Q Consensus 548 ~~~~L~~ly~~~g~~~~AL~ll~~l~~~ 575 (823)
-|.-|+..|...|++++|+..+.+-...
T Consensus 238 a~~~Lg~~~~~~g~~~~A~~~~~~Al~~ 265 (296)
T PRK11189 238 TYFYLAKYYLSLGDLDEAAALFKLALAN 265 (296)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 3668999999999999999999887644
No 450
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=24.91 E-value=88 Score=35.20 Aligned_cols=42 Identities=19% Similarity=0.254 Sum_probs=30.6
Q ss_pred CcccccccccCCCCcEEEEEEeC-CEEEEEeCCCcEEEEcCCC
Q 003405 4 NAFDSLELISNCSPKIDAVASYG-LKILLGCSDGSLKIYSPGS 45 (823)
Q Consensus 4 ~af~~~~l~~~~~~~I~ci~~~~-~~L~vGT~~G~l~~y~~~~ 45 (823)
+-|.+..++.--...|+|++.-+ +++.||.++|.|.+.|+.+
T Consensus 74 ~gf~P~~l~~~~~g~vtal~~S~iGFvaigy~~G~l~viD~RG 116 (395)
T PF08596_consen 74 EGFLPLTLLDAKQGPVTALKNSDIGFVAIGYESGSLVVIDLRG 116 (395)
T ss_dssp EEEEEEEEE---S-SEEEEEE-BTSEEEEEETTSEEEEEETTT
T ss_pred cccCchhheeccCCcEeEEecCCCcEEEEEecCCcEEEEECCC
Confidence 34566666666678999998866 8999999999999999864
No 451
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=24.80 E-value=2.6e+02 Score=31.72 Aligned_cols=28 Identities=21% Similarity=0.490 Sum_probs=19.6
Q ss_pred CCCcEEEEEEeCC----------EEEEEeCCCcEEEEc
Q 003405 15 CSPKIDAVASYGL----------KILLGCSDGSLKIYS 42 (823)
Q Consensus 15 ~~~~I~ci~~~~~----------~L~vGT~~G~l~~y~ 42 (823)
+...-.|++.|.- .+.|||++|.|++|.
T Consensus 279 Ld~~p~~~~~Y~~~~~~~~~~~~~llV~t~t~~LlVy~ 316 (418)
T PF14727_consen 279 LDYNPSCFCPYRVPWYNEPSTRLNLLVGTHTGTLLVYE 316 (418)
T ss_pred cCCceeeEEEEEeecccCCCCceEEEEEecCCeEEEEe
Confidence 3345566665531 489999999999995
No 452
>KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification]
Probab=24.69 E-value=1.6e+03 Score=28.99 Aligned_cols=59 Identities=15% Similarity=0.215 Sum_probs=38.9
Q ss_pred EccCCeEEE-EeCCeEEEEcCCCccccCCceeecC--CCcEEEEeCCEEEEEeC-CeEEEEEccC
Q 003405 204 SLLSGELLL-GKENIGVFVDQNGKLLQADRICWSE--APIAVIIQKPYAIALLP-RRVEVRSLRV 264 (823)
Q Consensus 204 ~~~~~EfLL-~~~~~gvfv~~~G~~~~~~~i~w~~--~P~~v~~~~PYll~~~~-~~ieV~~l~~ 264 (823)
.++++..++ .+......+|.+-+.. ..+.|.. .........|||++... +.+.++.+.+
T Consensus 593 nlg~~rriVQVtp~~~rllDg~~r~l--q~i~fd~~~~vv~~sv~dpyv~v~~~~g~i~~~~l~~ 655 (1366)
T KOG1896|consen 593 NLGNERRIVQVTPSGLRLLDGDLRML--QRIPFDSGAIVVQTSVADPYVAVRSSEGRITLYDLEE 655 (1366)
T ss_pred ecCCceEEEEEccceeEEecCcchhe--eEeccccCCcEEEEeccCceEEEEEcCCceEEEEecc
Confidence 456666666 5555555666444543 4566644 55778899999999986 5677777653
No 453
>KOG2659 consensus LisH motif-containing protein [Cytoskeleton]
Probab=24.44 E-value=1.5e+02 Score=30.43 Aligned_cols=62 Identities=26% Similarity=0.239 Sum_probs=44.9
Q ss_pred cChhHHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhc--HHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 301 VPLGAQIVQLTASGDFEEALALCKLLPPEDASLRAAK--EGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 301 ~~~~~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~--~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+-.-||..+++.|++++|+.++.++.++ .++.+. .-.++++...-+.++|.-++|++.+..
T Consensus 65 ~~eR~~Ir~~I~~G~Ie~Aie~in~l~Pe--iLd~n~~l~F~Lq~q~lIEliR~~~~eeal~F~q~ 128 (228)
T KOG2659|consen 65 MDERLQIRRAIEEGQIEEAIEKVNQLNPE--ILDTNRELFFHLQQLHLIELIREGKTEEALEFAQT 128 (228)
T ss_pred HhHHHHHHHHHHhccHHHHHHHHHHhChH--HHccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHH
Confidence 34456899999999999999999887422 132222 234566667778899999999988765
No 454
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=23.61 E-value=7.9e+02 Score=25.00 Aligned_cols=168 Identities=14% Similarity=0.098 Sum_probs=88.9
Q ss_pred EEEEEE-eCCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceee-eeecC-CCCCCeeEEEEecccCceeeEeC
Q 003405 19 IDAVAS-YGLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELE-RTISG-FSKKPILSMEVLASRQLLLSLSE 95 (823)
Q Consensus 19 I~ci~~-~~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~-~~~~~-~~k~~I~qI~~~~~~~~Ll~l~d 95 (823)
..++.. -++.+|++...|... ++.... ..+.. ..... ......+.+.+.++.++.++-..
T Consensus 43 ~G~~~~~~~g~l~v~~~~~~~~-~d~~~g----------------~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~ 105 (246)
T PF08450_consen 43 NGMAFDRPDGRLYVADSGGIAV-VDPDTG----------------KVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSG 105 (246)
T ss_dssp EEEEEECTTSEEEEEETTCEEE-EETTTT----------------EEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEEC
T ss_pred ceEEEEccCCEEEEEEcCceEE-EecCCC----------------cEEEEeeccCCCcccCCCceEEEcCCCCEEEEecC
Confidence 333333 368899998877644 453321 11111 11111 12357888999998886666554
Q ss_pred c-----E---EEEeCCC-CcccccccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEcCCCc-eeEeee-ecCCC---C
Q 003405 96 S-----I---AFHRLPN-LETIAVLTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHDGGRG-FVEVKD-FGVPD---T 159 (823)
Q Consensus 96 ~-----l---~~~~L~~-l~~~~~i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~~~~~-f~~~ke-i~~~~---~ 159 (823)
. . .+|.+.. -+...........+-++++++...+.|+ .+++|.-|.+..... +...+. +.+++ .
T Consensus 106 ~~~~~~~~~g~v~~~~~~~~~~~~~~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 185 (246)
T PF08450_consen 106 GGGASGIDPGSVYRIDPDGKVTVVADGLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGY 185 (246)
T ss_dssp CBCTTCGGSEEEEEEETTSEEEEEEEEESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCE
T ss_pred CCccccccccceEEECCCCeEEEEecCcccccceEECCcchheeecccccceeEEEeccccccceeeeeeEEEcCCCCcC
Confidence 1 1 2443321 1211112334456678888887777776 567777777764322 322222 23332 4
Q ss_pred ceEEEecC-CeEEEEE--cCceEEEEcCCCCeeeccCCCCCCCCEEEEc
Q 003405 160 VKSMSWCG-ENICIAI--RKGYMILNATNGALSEVFPSGRIGPPLVVSL 205 (823)
Q Consensus 160 ~~~l~~~~-~~i~v~~--~~~y~lidl~~~~~~~L~~~~~~~~p~i~~~ 205 (823)
|-+|++.. ..|+++. ......+|.+ |+.....+.+. .+|..+.+
T Consensus 186 pDG~~vD~~G~l~va~~~~~~I~~~~p~-G~~~~~i~~p~-~~~t~~~f 232 (246)
T PF08450_consen 186 PDGLAVDSDGNLWVADWGGGRIVVFDPD-GKLLREIELPV-PRPTNCAF 232 (246)
T ss_dssp EEEEEEBTTS-EEEEEETTTEEEEEETT-SCEEEEEE-SS-SSEEEEEE
T ss_pred CCcceEcCCCCEEEEEcCCCEEEEECCC-ccEEEEEcCCC-CCEEEEEE
Confidence 88999983 4677774 4678888877 76655544432 14544444
No 455
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=23.49 E-value=1.1e+02 Score=30.78 Aligned_cols=24 Identities=21% Similarity=0.286 Sum_probs=14.9
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHh
Q 003405 341 IHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 341 i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
.+...|..+|..|+|++|..+|.+
T Consensus 146 al~~LA~~~~~~g~~~~Ai~~~~~ 169 (198)
T PRK10370 146 ALMLLASDAFMQADYAQAIELWQK 169 (198)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHH
Confidence 344456666666666666666664
No 456
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=22.92 E-value=2.3e+02 Score=20.68 Aligned_cols=29 Identities=17% Similarity=0.335 Sum_probs=23.7
Q ss_pred CCceEEEecCCeEEEEEc-CceEEEEcCCC
Q 003405 158 DTVKSMSWCGENICIAIR-KGYMILNATNG 186 (823)
Q Consensus 158 ~~~~~l~~~~~~i~v~~~-~~y~lidl~~~ 186 (823)
+....+...|+..+++.. .+..++|+++.
T Consensus 2 G~a~~v~v~g~yaYva~~~~Gl~IvDISnP 31 (42)
T PF08309_consen 2 GDARDVAVSGNYAYVADGNNGLVIVDISNP 31 (42)
T ss_pred ceEEEEEEECCEEEEEeCCCCEEEEECCCC
Confidence 345677888999999966 88999999874
No 457
>PF10647 Gmad1: Lipoprotein LpqB beta-propeller domain; InterPro: IPR018910 The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues.
Probab=22.89 E-value=8.7e+02 Score=25.22 Aligned_cols=106 Identities=14% Similarity=0.225 Sum_probs=63.6
Q ss_pred CcEEEEeeCCCceEEEEE----cCeEEEEEEc--CCC---ceeEeeeec--CCCCceEEEecC-CeEEEEEcC---ceE-
Q 003405 116 GANVYSWDDRRGFLCFAR----QKRVCIFRHD--GGR---GFVEVKDFG--VPDTVKSMSWCG-ENICIAIRK---GYM- 179 (823)
Q Consensus 116 g~~~fa~~~~~~~l~V~~----kkki~l~~~~--~~~---~f~~~kei~--~~~~~~~l~~~~-~~i~v~~~~---~y~- 179 (823)
.++.|.++++..++++.. +.+|.+--+. .+. .+....++. ....++.+.|.+ +.|+|+... ...
T Consensus 113 ~I~~l~vSpDG~RvA~v~~~~~~~~v~va~V~r~~~g~~~~l~~~~~~~~~~~~~v~~v~W~~~~~L~V~~~~~~~~~~~ 192 (253)
T PF10647_consen 113 RITALRVSPDGTRVAVVVEDGGGGRVYVAGVVRDGDGVPRRLTGPRRVAPPLLSDVTDVAWSDDSTLVVLGRSAGGPVVR 192 (253)
T ss_pred ceEEEEECCCCcEEEEEEecCCCCeEEEEEEEeCCCCCcceeccceEecccccCcceeeeecCCCEEEEEeCCCCCceeE
Confidence 688999999988988887 6777666553 221 111122322 236889999995 467777663 122
Q ss_pred EEEcCCCCeeeccCCCCCCCCEEEEccCCeEEEEeCCeEEEEc
Q 003405 180 ILNATNGALSEVFPSGRIGPPLVVSLLSGELLLGKENIGVFVD 222 (823)
Q Consensus 180 lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL~~~~~gvfv~ 222 (823)
.+....+....+ +......|.+..-.+.+.++..++-+++..
T Consensus 193 ~v~~dG~~~~~l-~~~~~~~~v~a~~~~~~~~~~t~~~~~~~~ 234 (253)
T PF10647_consen 193 LVSVDGGPSTPL-PSVNLGVPVVAVAASPSTVYVTDDGGVLQS 234 (253)
T ss_pred EEEccCCccccc-CCCCCCcceEEeeCCCcEEEEECCCcEEEC
Confidence 355555555555 332333455555455555667777777653
No 458
>KOG1174 consensus Anaphase-promoting complex (APC), subunit 7 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=22.89 E-value=1.2e+02 Score=33.80 Aligned_cols=56 Identities=13% Similarity=0.124 Sum_probs=39.1
Q ss_pred HHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc-CCCH
Q 003405 308 VQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS-QVDI 369 (823)
Q Consensus 308 ~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~-~~dP 369 (823)
+-+...|+++++++|++.... +..--.+|...|..+-....+.+||++|..+ .+||
T Consensus 446 EL~~~Eg~~~D~i~LLe~~L~------~~~D~~LH~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP 502 (564)
T KOG1174|consen 446 ELCQVEGPTKDIIKLLEKHLI------IFPDVNLHNHLGDIMRAQNEPQKAMEYYYKALRQDP 502 (564)
T ss_pred HHHHhhCccchHHHHHHHHHh------hccccHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCc
Confidence 333456788888888876421 1111247888899999999999999999874 3444
No 459
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=22.75 E-value=1.6e+02 Score=28.28 Aligned_cols=71 Identities=17% Similarity=0.092 Sum_probs=41.9
Q ss_pred HHHHhcCCHHHHHHHhhhCCCcchH--hhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhcCCCHHHHHHhCCC
Q 003405 308 VQLTASGDFEEALALCKLLPPEDAS--LRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLASQVDITYALSLYPS 378 (823)
Q Consensus 308 ~~Ll~~~~~e~Al~L~~~~~~~~~~--~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~~~dP~~vi~Lfp~ 378 (823)
..+...|++++|+..++.....+.. .....+..++...|..+...|+|++|...|.++.---+..+.+.|.
T Consensus 80 ~~~~~~g~~~eA~~~~~~Al~~~~~~~~~~~~la~i~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~a~~~~p~ 152 (168)
T CHL00033 80 LIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPG 152 (168)
T ss_pred HHHHHcCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHhhHHHHHcccHHHHHHHHHHHHHHHHHHHHhCcc
Confidence 3446789999999998764211110 0011234555666666678999887777776654333344455543
No 460
>PRK00178 tolB translocation protein TolB; Provisional
Probab=22.70 E-value=1.1e+03 Score=26.38 Aligned_cols=144 Identities=13% Similarity=0.164 Sum_probs=72.3
Q ss_pred eeEEEEecccCceeeEe-C-c---EEEEeCCCCcccccccCCCC-cEEEEeeCCCceEEEEEcC--eEEEEEEcC-CCce
Q 003405 78 ILSMEVLASRQLLLSLS-E-S---IAFHRLPNLETIAVLTKAKG-ANVYSWDDRRGFLCFARQK--RVCIFRHDG-GRGF 148 (823)
Q Consensus 78 I~qI~~~~~~~~Ll~l~-d-~---l~~~~L~~l~~~~~i~~~kg-~~~fa~~~~~~~l~V~~kk--ki~l~~~~~-~~~f 148 (823)
+......|+.+.++... . + |.++++.+-+ ...+....+ ....+++++...|++...+ .-.||.++- +...
T Consensus 245 ~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~-~~~lt~~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~ 323 (430)
T PRK00178 245 NGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQ-LSRVTNHPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRA 323 (430)
T ss_pred cCCeEECCCCCEEEEEEccCCCceEEEEECCCCC-eEEcccCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCE
Confidence 33456667776665443 2 3 7778876533 122322222 2335566666666655433 345666542 2122
Q ss_pred eEeeeecCC-CCceEEEec--CCeEEEEEcC--c--eEEEEcCCCCeeeccCCCCCCCCEEEEccCCeEEE-EeCC----
Q 003405 149 VEVKDFGVP-DTVKSMSWC--GENICIAIRK--G--YMILNATNGALSEVFPSGRIGPPLVVSLLSGELLL-GKEN---- 216 (823)
Q Consensus 149 ~~~kei~~~-~~~~~l~~~--~~~i~v~~~~--~--y~lidl~~~~~~~L~~~~~~~~p~i~~~~~~EfLL-~~~~---- 216 (823)
+. +... .......|. |+.|++.... . ..++|+.++....+...+....| ...+++..++ +.+.
T Consensus 324 ~~---lt~~~~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~~~~~~~p--~~spdg~~i~~~~~~~g~~ 398 (430)
T PRK00178 324 ER---VTFVGNYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTDTSLDESP--SVAPNGTMLIYATRQQGRG 398 (430)
T ss_pred EE---eecCCCCccceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccCCCCCCCc--eECCCCCEEEEEEecCCce
Confidence 22 2111 122234555 6677777642 2 45678888877666544322234 3345666655 3321
Q ss_pred eEEEEcCCCcc
Q 003405 217 IGVFVDQNGKL 227 (823)
Q Consensus 217 ~gvfv~~~G~~ 227 (823)
....++.+|..
T Consensus 399 ~l~~~~~~g~~ 409 (430)
T PRK00178 399 VLMLVSINGRV 409 (430)
T ss_pred EEEEEECCCCc
Confidence 23445667754
No 461
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=22.16 E-value=1.2e+02 Score=36.35 Aligned_cols=56 Identities=21% Similarity=0.129 Sum_probs=39.4
Q ss_pred HHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHhc-CCCH
Q 003405 309 QLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLAS-QVDI 369 (823)
Q Consensus 309 ~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~~-~~dP 369 (823)
-++..|++++|+..++.....++ .....+...|..++..|+|++|...|.++ ..+|
T Consensus 340 ~~~~~g~~~eA~~~~~kal~l~P-----~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~p 396 (615)
T TIGR00990 340 FKCLKGKHLEALADLSKSIELDP-----RVTQSYIKRASMNLELGDPDKAEEDFDKALKLNS 396 (615)
T ss_pred HHHHcCCHHHHHHHHHHHHHcCC-----CcHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCC
Confidence 34578999999999876421111 22345666788889999999999998874 4444
No 462
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=22.13 E-value=1.1e+02 Score=21.75 Aligned_cols=26 Identities=12% Similarity=0.298 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHhc
Q 003405 340 SIHIRFAHYLFDTGSYEEAMEHFLAS 365 (823)
Q Consensus 340 ~i~~~~a~~lf~~~~f~~A~~~f~~~ 365 (823)
+++.+.|..-+...+|++|..-|.++
T Consensus 2 dv~~~Lgeisle~e~f~qA~~D~~~a 27 (38)
T PF10516_consen 2 DVYDLLGEISLENENFEQAIEDYEKA 27 (38)
T ss_pred cHHHHHHHHHHHhccHHHHHHHHHHH
Confidence 36777899999999999999998874
No 463
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=21.72 E-value=3e+02 Score=31.79 Aligned_cols=72 Identities=21% Similarity=0.259 Sum_probs=49.2
Q ss_pred CcEEEEEEe--CCEEEEEeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCeeEEEEecccCceeeEe
Q 003405 17 PKIDAVASY--GLKILLGCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPILSMEVLASRQLLLSLS 94 (823)
Q Consensus 17 ~~I~ci~~~--~~~L~vGT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I~qI~~~~~~~~Ll~l~ 94 (823)
.-|.|.+.. |..|.=|+.|-.+.+|+..+ +++...+..-+...|...+.+|..+==++++
T Consensus 51 GCVN~LeWn~dG~lL~SGSDD~r~ivWd~~~------------------~KllhsI~TgHtaNIFsvKFvP~tnnriv~s 112 (758)
T KOG1310|consen 51 GCVNCLEWNADGELLASGSDDTRLIVWDPFE------------------YKLLHSISTGHTANIFSVKFVPYTNNRIVLS 112 (758)
T ss_pred ceecceeecCCCCEEeecCCcceEEeecchh------------------cceeeeeecccccceeEEeeeccCCCeEEEe
Confidence 457777755 46899999999999998442 3333333323357899999999775444444
Q ss_pred C---c-EEEEeCCCCc
Q 003405 95 E---S-IAFHRLPNLE 106 (823)
Q Consensus 95 d---~-l~~~~L~~l~ 106 (823)
. . |++|++...+
T Consensus 113 gAgDk~i~lfdl~~~~ 128 (758)
T KOG1310|consen 113 GAGDKLIKLFDLDSSK 128 (758)
T ss_pred ccCcceEEEEeccccc
Confidence 3 2 9999998654
No 464
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=21.62 E-value=1.8e+03 Score=28.47 Aligned_cols=241 Identities=12% Similarity=0.056 Sum_probs=0.0
Q ss_pred EEEEeCCEEEE-EeCCCcEEEEcCCCCCCCCCCCCcccccccccceeeeeecCCCCCCe----------------eEEEE
Q 003405 21 AVASYGLKILL-GCSDGSLKIYSPGSSESDRSPPSDYQSLRKESYELERTISGFSKKPI----------------LSMEV 83 (823)
Q Consensus 21 ci~~~~~~L~v-GT~~G~l~~y~~~~~~~~~~~~~d~~~l~~~~~~l~~~~~~~~k~~I----------------~qI~~ 83 (823)
|++..++.||| -+.++.|..++..... ++.+.+.+.... ..|.+
T Consensus 630 avd~~gn~LYVaDt~n~~Ir~id~~~~~-------------------V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~ 690 (1057)
T PLN02919 630 AYNAKKNLLYVADTENHALREIDFVNET-------------------VRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCF 690 (1057)
T ss_pred EEeCCCCEEEEEeCCCceEEEEecCCCE-------------------EEEEeccCcccCCCCCChhhhHhhcCCCeEEEE
Q ss_pred ec-ccCceeeEeCc--EEEEeCCC---------------CcccccccCCCCcEEEEeeCCCceEEEE--EcCeEEEEEEc
Q 003405 84 LA-SRQLLLSLSES--IAFHRLPN---------------LETIAVLTKAKGANVYSWDDRRGFLCFA--RQKRVCIFRHD 143 (823)
Q Consensus 84 ~~-~~~~Ll~l~d~--l~~~~L~~---------------l~~~~~i~~~kg~~~fa~~~~~~~l~V~--~kkki~l~~~~ 143 (823)
.+ ...+.++-+++ |.+|+..+ ...........+.+.++++++...++|+ ..++|.+|...
T Consensus 691 dp~~g~LyVad~~~~~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~ 770 (1057)
T PLN02919 691 EPVNEKVYIAMAGQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLK 770 (1057)
T ss_pred ecCCCeEEEEECCCCeEEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECC
Q ss_pred CCCceeEee-------------------eecCCCCceEEEec-CCeEEEEEc--CceEEEEcCCCCeeeccCCCCCC---
Q 003405 144 GGRGFVEVK-------------------DFGVPDTVKSMSWC-GENICIAIR--KGYMILNATNGALSEVFPSGRIG--- 198 (823)
Q Consensus 144 ~~~~f~~~k-------------------ei~~~~~~~~l~~~-~~~i~v~~~--~~y~lidl~~~~~~~L~~~~~~~--- 198 (823)
.+......- .-..-..|.++++. ++.++|+.. +...++|..++....+...|..+
T Consensus 771 tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~d 850 (1057)
T PLN02919 771 TGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKD 850 (1057)
T ss_pred CCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCC
Q ss_pred ---------CCEEEEc-cCCeEEE--EeCCeEEEEcCCCccc-cCCceeecC--CCcEEEEeCCEEEEEeCCeEEEEEcc
Q 003405 199 ---------PPLVVSL-LSGELLL--GKENIGVFVDQNGKLL-QADRICWSE--APIAVIIQKPYAIALLPRRVEVRSLR 263 (823)
Q Consensus 199 ---------~p~i~~~-~~~EfLL--~~~~~gvfv~~~G~~~-~~~~i~w~~--~P~~v~~~~PYll~~~~~~ieV~~l~ 263 (823)
.|.-+.+ +++.+++ ..++..-.+|.+.... +..++...+ +|....-...++-...+..+.|.++.
T Consensus 851 G~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 930 (1057)
T PLN02919 851 GKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNKGEAAEILTLELKGVQPPRPKSKSLKRLRRRSSADTQVIKVD 930 (1057)
T ss_pred CcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCCCccceeEeeccccccCCCCcccchhhhhhcccccCceeecC
Q ss_pred CC-----CceeEEEeeCCccccc
Q 003405 264 VP-----YALIQTIVLQNVRHLI 281 (823)
Q Consensus 264 ~~-----~~lvQ~i~l~~~~~l~ 281 (823)
+ +.+--.|.++...++.
T Consensus 931 -~~~~~~~~~~~~~~~~~~~~~~ 952 (1057)
T PLN02919 931 -GVTSLEGDLQLKISLPPGYHFS 952 (1057)
T ss_pred -CcccccceEEEEEECCCCCccC
No 465
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.29 E-value=1.7e+02 Score=29.35 Aligned_cols=22 Identities=14% Similarity=0.237 Sum_probs=10.7
Q ss_pred HHHHHHHHHccCCHHHHHHHHH
Q 003405 342 HIRFAHYLFDTGSYEEAMEHFL 363 (823)
Q Consensus 342 ~~~~a~~lf~~~~f~~A~~~f~ 363 (823)
..+.|..++.+++||+|+..+.
T Consensus 129 ~lRLArvq~q~~k~D~AL~~L~ 150 (207)
T COG2976 129 ALRLARVQLQQKKADAALKTLD 150 (207)
T ss_pred HHHHHHHHHHhhhHHHHHHHHh
Confidence 3344555555555555554443
No 466
>PRK14574 hmsH outer membrane protein; Provisional
Probab=21.26 E-value=9.1e+02 Score=30.13 Aligned_cols=56 Identities=11% Similarity=0.022 Sum_probs=42.5
Q ss_pred HHHHHHHhcCCHHHHHHHhhhCCCcchHhhhhcHHHHHHHHHHHHHccCCHHHHHHHHHh
Q 003405 305 AQIVQLTASGDFEEALALCKLLPPEDASLRAAKEGSIHIRFAHYLFDTGSYEEAMEHFLA 364 (823)
Q Consensus 305 ~qI~~Ll~~~~~e~Al~L~~~~~~~~~~~~~~~~~~i~~~~a~~lf~~~~f~~A~~~f~~ 364 (823)
+.+-.|...|++.+++...+.+....... -..+...+|.+++..++=++|...|.+
T Consensus 297 Drl~aL~~r~r~~~vi~~y~~l~~~~~~~----P~y~~~a~adayl~~~~P~kA~~l~~~ 352 (822)
T PRK14574 297 DRLGALLVRHQTADLIKEYEAMEAEGYKM----PDYARRWAASAYIDRRLPEKAAPILSS 352 (822)
T ss_pred HHHHHHHHhhhHHHHHHHHHHhhhcCCCC----CHHHHHHHHHHHHhcCCcHHHHHHHHH
Confidence 34445678999999999999885332111 236788889999999999999999886
No 467
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=21.06 E-value=3.7e+02 Score=32.15 Aligned_cols=153 Identities=10% Similarity=0.136 Sum_probs=0.0
Q ss_pred EEEEEEeC--CEEEEEeCCCcEEEEcCCCCC------------------------CCCCCCCcccccccccceeeeeecC
Q 003405 19 IDAVASYG--LKILLGCSDGSLKIYSPGSSE------------------------SDRSPPSDYQSLRKESYELERTISG 72 (823)
Q Consensus 19 I~ci~~~~--~~L~vGT~~G~l~~y~~~~~~------------------------~~~~~~~d~~~l~~~~~~l~~~~~~ 72 (823)
|.|+.-.- ..|..|+.+|+|.+|++++.. -..+...|..+...+..-++..+++
T Consensus 73 IeSl~f~~~E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~~~~a~gStdtd~~iwD~Rk~Gc~~~~~s 152 (825)
T KOG0267|consen 73 IESLTFDTSERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYGEFFASGSTDTDLKIWDIRKKGCSHTYKS 152 (825)
T ss_pred ceeeecCcchhhhcccccCCceeeeehhhhhhhhhhhccccCcceeeeccceEEeccccccccceehhhhccCceeeecC
Q ss_pred CCCCCeeEEEEecccCceeeEeCc--EEEEeCCCCccccccc-CCCCcEEEEeeCCCceEEEE-EcCeEEEEEEcCCCce
Q 003405 73 FSKKPILSMEVLASRQLLLSLSES--IAFHRLPNLETIAVLT-KAKGANVYSWDDRRGFLCFA-RQKRVCIFRHDGGRGF 148 (823)
Q Consensus 73 ~~k~~I~qI~~~~~~~~Ll~l~d~--l~~~~L~~l~~~~~i~-~~kg~~~fa~~~~~~~l~V~-~kkki~l~~~~~~~~f 148 (823)
+..-|+-+..-|....+..=+|. +++|++..-+...... ..-.++..-.++....++-| .++.+.++... .|
T Consensus 153 -~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk~~~ef~~~e~~v~sle~hp~e~Lla~Gs~d~tv~f~dle---tf 228 (825)
T KOG0267|consen 153 -HTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGKLSKEFKSHEGKVQSLEFHPLEVLLAPGSSDRTVRFWDLE---TF 228 (825)
T ss_pred -CcceeEEEeecCCCceeeccCCcceeeeecccccccccccccccccccccccCchhhhhccCCCCceeeeeccc---ee
Q ss_pred eEee-eecCCCCceEEEecCCeEEEEEc
Q 003405 149 VEVK-DFGVPDTVKSMSWCGENICIAIR 175 (823)
Q Consensus 149 ~~~k-ei~~~~~~~~l~~~~~~i~v~~~ 175 (823)
..+- .=...+.|++..|..+.-++.+.
T Consensus 229 e~I~s~~~~~~~v~~~~fn~~~~~~~~G 256 (825)
T KOG0267|consen 229 EVISSGKPETDGVRSLAFNPDGKIVLSG 256 (825)
T ss_pred EEeeccCCccCCceeeeecCCceeeecC
No 468
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=20.55 E-value=1.4e+02 Score=27.52 Aligned_cols=38 Identities=29% Similarity=0.522 Sum_probs=30.3
Q ss_pred HHHHHHHHccCCHHHHHHHHHhc---CCCHHHHHHhCCCCC
Q 003405 343 IRFAHYLFDTGSYEEAMEHFLAS---QVDITYALSLYPSIV 380 (823)
Q Consensus 343 ~~~a~~lf~~~~f~~A~~~f~~~---~~dP~~vi~Lfp~l~ 380 (823)
.+.|..|..+|++++|+.||..+ --.|.++|..|-.-+
T Consensus 67 V~lGE~L~~~G~~~~aa~hf~nAl~V~~qP~~LL~i~q~tl 107 (121)
T PF02064_consen 67 VQLGEQLLAQGDYEEAAEHFYNALKVCPQPAELLQIYQKTL 107 (121)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHTSSSHHHHHHHHHHHS
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhCCCHHHHHHHHHhhC
Confidence 34699999999999999999885 468899988876554
No 469
>KOG2005 consensus 26S proteasome regulatory complex, subunit RPN1/PSMD2 [Posttranslational modification, protein turnover, chaperones]
Probab=20.49 E-value=2.9e+02 Score=32.83 Aligned_cols=74 Identities=19% Similarity=0.200 Sum_probs=49.2
Q ss_pred hhhHHHhhhhhhhcCcc-ccccccccCCCChHHHHHHHhhcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHH
Q 003405 606 PMLVLEFSMLVLESCPT-QTIELFLSGNIPADLVNSYLKQYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSE 680 (823)
Q Consensus 606 ~~li~~y~~wll~~~p~-~~~~if~~~~l~~~~Vl~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~ 680 (823)
.+|+.+-.++.|+||.+ +|++++++ -=..+.+++|...+.-+-.+.||+.++.--....+..+-..-...|+..
T Consensus 178 ~~l~~~iV~f~mkHNAE~eAiDlL~E-ve~id~l~~~Vd~~n~~RvclYl~sc~~~lP~Pdd~~ll~~a~~IYlKf 252 (878)
T KOG2005|consen 178 LDLVQEIVPFHMKHNAEFEAIDLLME-VEGIDLLLDYVDEHNYQRVCLYLTSCVPLLPGPDDVALLRTALKIYLKF 252 (878)
T ss_pred HHHHHHHHHHHHhccchhHHHHHHHH-hhhHhHHHHHhhhhhHHHHHHHHHHHhhcCCCchhhHHHHHHHHHHHHH
Confidence 46778888999999876 57777776 1124567888887766678889999874211112233666666677764
No 470
>PRK14574 hmsH outer membrane protein; Provisional
Probab=20.34 E-value=1.2e+03 Score=29.15 Aligned_cols=174 Identities=12% Similarity=0.067 Sum_probs=84.2
Q ss_pred HHHHHHHHHhccHHHHHHHHHHHhhcccCCCCcccccccCChHHHHHHhhcCCCCChhhHHHhhhhhhhcCccccccccc
Q 003405 550 TALLELYKSNARHREALKLLHELVEESKSNQSQDEHTQKFNPESIIEYLKPLCGTDPMLVLEFSMLVLESCPTQTIELFL 629 (823)
Q Consensus 550 ~~L~~ly~~~g~~~~AL~ll~~l~~~~~~d~~~~~~~~~~~~~~~i~yL~~L~~~~~~li~~y~~wll~~~p~~~~~if~ 629 (823)
.+-+.+..+.|+++.|++.+.+.....-... +. ....+.++-.++ +.+.|+..+.
T Consensus 38 y~~aii~~r~Gd~~~Al~~L~qaL~~~P~~~--~a------v~dll~l~~~~G-----------------~~~~A~~~~e 92 (822)
T PRK14574 38 YDSLIIRARAGDTAPVLDYLQEESKAGPLQS--GQ------VDDWLQIAGWAG-----------------RDQEVIDVYE 92 (822)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHhhCccch--hh------HHHHHHHHHHcC-----------------CcHHHHHHHH
Confidence 3567788899999999999998764321100 00 001122221111 2222333222
Q ss_pred c----CCCChHHHH----HHHhhcCchhHHHHHHHHhhcccCCCChhHHHHHHHHHHHHHHH--HhhhhhhhcccCcccc
Q 003405 630 S----GNIPADLVN----SYLKQYSPSMQGRYLELMLAMNENSISGNLQNEMVQIYLSEVLD--WYSDLSAQQKWDEKAY 699 (823)
Q Consensus 630 ~----~~l~~~~Vl----~~L~~~~~~~~~~YLE~li~~~~~~~~~~~h~~L~~lYl~~i~~--~~~~~~~~~~~~~~~~ 699 (823)
+ ++.+....+ -+......+.++..++.++.. ++ .++++..-|+.+|++.-.. .+.....-...++ .
T Consensus 93 ka~~p~n~~~~~llalA~ly~~~gdyd~Aiely~kaL~~-dP-~n~~~l~gLa~~y~~~~q~~eAl~~l~~l~~~dp-~- 168 (822)
T PRK14574 93 RYQSSMNISSRGLASAARAYRNEKRWDQALALWQSSLKK-DP-TNPDLISGMIMTQADAGRGGVVLKQATELAERDP-T- 168 (822)
T ss_pred HhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhh-CC-CCHHHHHHHHHHHhhcCCHHHHHHHHHHhcccCc-c-
Confidence 2 123332222 233333456777888888764 33 3466666777777764100 0000000000111 1
Q ss_pred hHHHHHHHHHhhh-cCCC-Ch----HHHhccCCCC-chhhHHHHHhhccccHHHHHHHHHH
Q 003405 700 SPTRKKLLSALES-ISGY-NP----EVLLKRLPAD-ALYEERAILLGKMNQHELALSLYVH 753 (823)
Q Consensus 700 ~~~r~kLl~fL~~-s~~Y-d~----~~~L~~~~~~-~l~~e~~~Ll~klg~h~~AL~ilv~ 753 (823)
.... +++.+|.. ...+ +. +++++.-|.+ +...+.+..+.+.|-+..|++++-.
T Consensus 169 ~~~~-l~layL~~~~~~~~~AL~~~ekll~~~P~n~e~~~~~~~~l~~~~~~~~a~~l~~~ 228 (822)
T PRK14574 169 VQNY-MTLSYLNRATDRNYDALQASSEAVRLAPTSEEVLKNHLEILQRNRIVEPALRLAKE 228 (822)
T ss_pred hHHH-HHHHHHHHhcchHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCcHHHHHHHHh
Confidence 1112 33333332 2222 22 3344444443 5777888889999999999988774
No 471
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=20.29 E-value=8.5e+02 Score=29.13 Aligned_cols=110 Identities=13% Similarity=0.209 Sum_probs=63.8
Q ss_pred CCCCCeeEEEEecccCceeeEe-C-cEEEEeCCCCccc-ccccCCCCcEEEE----eeCCCceEEEE-EcCeEEEEEEcC
Q 003405 73 FSKKPILSMEVLASRQLLLSLS-E-SIAFHRLPNLETI-AVLTKAKGANVYS----WDDRRGFLCFA-RQKRVCIFRHDG 144 (823)
Q Consensus 73 ~~k~~I~qI~~~~~~~~Ll~l~-d-~l~~~~L~~l~~~-~~i~~~kg~~~fa----~~~~~~~l~V~-~kkki~l~~~~~ 144 (823)
++.-.|+||..-|+..+|+++| | .+++|.-.+-... ......|.=+-+- ..++.-.++-+ ..|++.+|+...
T Consensus 570 ~HsLTVT~l~FSpdg~~LLsvsRDRt~sl~~~~~~~~~e~~fa~~k~HtRIIWdcsW~pde~~FaTaSRDK~VkVW~~~~ 649 (764)
T KOG1063|consen 570 GHSLTVTRLAFSPDGRYLLSVSRDRTVSLYEVQEDIKDEFRFACLKAHTRIIWDCSWSPDEKYFATASRDKKVKVWEEPD 649 (764)
T ss_pred ccceEEEEEEECCCCcEEEEeecCceEEeeeeecccchhhhhccccccceEEEEcccCcccceeEEecCCceEEEEeccC
Confidence 3467899999999999999998 4 3888875321100 0001112111111 12322225555 567899999875
Q ss_pred CCceeEeee---ecCCCCceEEEecC-------CeEEEEEcCc-eEEEEc
Q 003405 145 GRGFVEVKD---FGVPDTVKSMSWCG-------ENICIAIRKG-YMILNA 183 (823)
Q Consensus 145 ~~~f~~~ke---i~~~~~~~~l~~~~-------~~i~v~~~~~-y~lidl 183 (823)
++ -..+.+ +.+.+.+++++|.+ +.+.||..++ .+++..
T Consensus 650 ~~-d~~i~~~a~~~~~~aVTAv~~~~~~~~e~~~~vavGle~GeI~l~~~ 698 (764)
T KOG1063|consen 650 LR-DKYISRFACLKFSLAVTAVAYLPVDHNEKGDVVAVGLEKGEIVLWRR 698 (764)
T ss_pred ch-hhhhhhhchhccCCceeeEEeeccccccccceEEEEecccEEEEEec
Confidence 42 122223 45568899998872 3578888754 455553
Done!