Query 000944
Match_columns 1213
No_of_seqs 168 out of 717
Neff 8.9
Searched_HMMs 46136
Date Thu Mar 28 11:21:16 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/000944.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/000944hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1897 Damage-specific DNA bi 100.0 6E-171 1E-175 1470.8 113.0 1072 1-1211 1-1095(1096)
2 KOG1898 Splicing factor 3b, su 100.0 2E-166 4E-171 1437.9 91.5 1197 1-1212 1-1204(1205)
3 KOG1896 mRNA cleavage and poly 100.0 2E-143 5E-148 1260.0 97.5 1135 1-1210 1-1361(1366)
4 COG5161 SFT1 Pre-mRNA cleavage 100.0 3.5E-96 8E-101 828.6 72.3 1125 1-1209 2-1313(1319)
5 PF10433 MMS1_N: Mono-function 100.0 2.6E-59 5.6E-64 568.4 48.7 458 75-591 1-503 (504)
6 PF03178 CPSF_A: CPSF A subuni 100.0 3.9E-50 8.4E-55 460.6 36.4 300 860-1181 2-321 (321)
7 KOG2048 WD40 repeat protein [G 98.3 0.012 2.7E-07 69.1 39.1 163 871-1056 372-548 (691)
8 KOG0318 WD40 repeat stress pro 98.1 0.031 6.7E-07 64.0 39.9 141 897-1055 454-601 (603)
9 KOG0647 mRNA export protein (c 97.6 0.01 2.3E-07 63.3 20.7 191 581-798 27-219 (347)
10 KOG1036 Mitotic spindle checkp 97.5 0.014 2.9E-07 62.7 21.0 182 582-799 14-199 (323)
11 PRK11028 6-phosphogluconolacto 97.5 0.36 7.7E-06 55.8 36.1 100 557-676 10-111 (330)
12 KOG1446 Histone H3 (Lys4) meth 97.5 0.24 5.2E-06 53.7 29.4 149 746-974 150-307 (311)
13 KOG0291 WD40-repeat-containing 97.5 0.48 1E-05 57.0 45.4 365 582-1057 146-551 (893)
14 PF10282 Lactonase: Lactonase, 97.3 0.3 6.5E-06 56.8 31.2 226 548-796 47-310 (345)
15 KOG2106 Uncharacterized conser 97.3 0.61 1.3E-05 53.6 37.5 383 542-1054 203-624 (626)
16 KOG0318 WD40 repeat stress pro 97.2 0.65 1.4E-05 53.7 35.6 303 551-969 293-601 (603)
17 PF10282 Lactonase: Lactonase, 97.2 0.73 1.6E-05 53.6 33.0 287 602-978 2-332 (345)
18 KOG1539 WD repeat protein [Gen 97.1 1.2 2.5E-05 54.5 46.2 239 393-677 27-277 (910)
19 KOG0291 WD40-repeat-containing 97.1 1.2 2.5E-05 53.9 49.8 261 502-798 266-541 (893)
20 PRK11028 6-phosphogluconolacto 97.0 1 2.3E-05 51.9 36.3 116 919-1035 196-327 (330)
21 cd00200 WD40 WD40 domain, foun 96.9 0.87 1.9E-05 49.9 31.2 171 861-1057 74-250 (289)
22 PF03178 CPSF_A: CPSF A subuni 96.8 0.54 1.2E-05 54.1 26.5 171 860-1054 62-263 (321)
23 cd00200 WD40 WD40 domain, foun 96.7 1.1 2.3E-05 49.2 28.2 125 861-1009 158-288 (289)
24 KOG1274 WD40 repeat protein [G 96.7 0.4 8.7E-06 58.9 24.5 222 599-929 25-263 (933)
25 KOG0319 WD40-repeat-containing 96.6 2.8 6E-05 50.7 30.6 172 860-1057 260-443 (775)
26 KOG0319 WD40-repeat-containing 96.5 3.2 6.8E-05 50.3 40.5 226 541-798 64-302 (775)
27 KOG0282 mRNA splicing factor [ 96.3 0.032 6.9E-07 63.3 11.7 175 860-1057 280-463 (503)
28 KOG0306 WD40-repeat-containing 96.1 4.9 0.00011 48.8 39.5 169 861-1056 395-580 (888)
29 KOG0647 mRNA export protein (c 96.1 1.4 2.9E-05 47.8 21.8 131 919-1058 93-230 (347)
30 KOG1539 WD repeat protein [Gen 96.1 5.4 0.00012 49.1 37.3 117 540-679 203-322 (910)
31 KOG0285 Pleiotropic regulator 96.1 1 2.2E-05 49.6 21.0 71 584-676 154-224 (460)
32 PLN00181 protein SPA1-RELATED; 96.0 5.4 0.00012 52.2 32.1 175 861-1055 599-792 (793)
33 KOG0276 Vesicle coat complex C 95.8 4.5 9.9E-05 48.0 25.7 61 948-1013 429-491 (794)
34 PF08596 Lgl_C: Lethal giant l 95.7 1.6 3.5E-05 51.2 22.8 217 541-774 3-300 (395)
35 KOG0296 Angio-associated migra 95.7 4.8 0.0001 44.9 26.5 83 919-1005 307-392 (399)
36 KOG0315 G-protein beta subunit 95.6 2.1 4.5E-05 45.2 19.9 174 860-1056 105-288 (311)
37 PF00780 CNH: CNH domain; Int 95.4 5.9 0.00013 44.2 25.8 42 938-979 223-264 (275)
38 KOG2321 WD40 repeat protein [G 95.2 1.3 2.8E-05 51.8 19.0 157 539-719 136-300 (703)
39 KOG2110 Uncharacterized conser 95.2 6.8 0.00015 43.8 23.6 169 861-1059 69-251 (391)
40 KOG1273 WD40 repeat protein [G 95.1 0.91 2E-05 49.2 16.2 202 898-1149 35-251 (405)
41 KOG0299 U3 snoRNP-associated p 95.1 1.5 3.2E-05 50.1 18.5 48 919-966 401-450 (479)
42 KOG2111 Uncharacterized conser 94.9 7.2 0.00016 42.8 27.2 290 747-1127 18-320 (346)
43 KOG0310 Conserved WD40 repeat- 94.9 6.9 0.00015 45.3 23.3 273 581-968 26-306 (487)
44 COG2706 3-carboxymuconate cycl 94.9 8.4 0.00018 43.1 33.1 131 546-696 199-342 (346)
45 KOG2111 Uncharacterized conser 94.8 7.8 0.00017 42.5 26.3 75 538-618 180-257 (346)
46 KOG0283 WD40 repeat-containing 94.8 0.77 1.7E-05 56.1 16.5 110 541-675 371-481 (712)
47 PLN00181 protein SPA1-RELATED; 94.5 12 0.00027 48.9 28.5 131 919-1057 554-691 (793)
48 KOG0263 Transcription initiati 94.5 4.3 9.4E-05 49.5 21.6 90 861-971 558-650 (707)
49 KOG0282 mRNA splicing factor [ 94.4 0.77 1.7E-05 52.6 14.2 248 717-1053 244-502 (503)
50 KOG2055 WD40 repeat protein [G 94.1 9.3 0.0002 44.0 21.6 138 655-796 223-363 (514)
51 KOG0290 Conserved WD40 repeat- 93.6 1.2 2.7E-05 47.7 13.2 151 896-1057 57-228 (364)
52 KOG1273 WD40 repeat protein [G 93.4 14 0.00031 40.4 21.2 138 897-1057 164-323 (405)
53 KOG0645 WD40 repeat protein [G 93.3 14 0.0003 39.8 26.5 263 748-1124 27-306 (312)
54 KOG2106 Uncharacterized conser 93.0 22 0.00048 41.5 40.4 137 899-1057 419-582 (626)
55 KOG1036 Mitotic spindle checkp 93.0 17 0.00036 39.9 27.3 282 541-929 15-317 (323)
56 KOG2055 WD40 repeat protein [G 92.7 20 0.00043 41.4 21.4 123 529-675 290-417 (514)
57 KOG0279 G protein beta subunit 92.3 19 0.00042 38.9 22.9 187 455-675 106-313 (315)
58 KOG0299 U3 snoRNP-associated p 92.3 1.3 2.8E-05 50.5 11.7 125 922-1056 226-356 (479)
59 COG2706 3-carboxymuconate cycl 92.1 24 0.00052 39.6 29.4 217 558-795 63-308 (346)
60 PTZ00421 coronin; Provisional 91.9 37 0.00081 41.3 29.6 115 545-675 36-156 (493)
61 KOG0646 WD40 repeat protein [G 91.7 5.1 0.00011 46.0 15.4 102 954-1057 189-308 (476)
62 KOG1538 Uncharacterized conser 91.7 35 0.00076 41.1 22.4 253 16-310 13-291 (1081)
63 PF14727 PHTB1_N: PTHB1 N-term 91.2 37 0.00081 40.1 30.8 208 440-674 117-359 (418)
64 KOG0310 Conserved WD40 repeat- 90.8 37 0.00081 39.6 21.2 111 538-672 195-306 (487)
65 KOG4378 Nuclear protein COP1 [ 90.3 24 0.00051 41.1 18.9 94 919-1015 186-283 (673)
66 KOG1407 WD40 repeat protein [F 90.2 29 0.00064 37.2 19.9 178 547-799 116-294 (313)
67 KOG0273 Beta-transducin family 90.2 42 0.00092 39.0 26.4 88 919-1011 431-522 (524)
68 KOG0294 WD40 repeat-containing 89.9 35 0.00077 37.6 25.5 72 582-675 44-115 (362)
69 PF14727 PHTB1_N: PTHB1 N-term 89.6 50 0.0011 39.0 29.6 224 551-795 90-357 (418)
70 PF02239 Cytochrom_D1: Cytochr 89.4 49 0.0011 38.7 27.4 89 520-617 16-108 (369)
71 KOG1517 Guanine nucleotide bin 88.8 48 0.001 42.5 21.3 206 861-1104 1087-1313(1387)
72 KOG0315 G-protein beta subunit 88.7 9.3 0.0002 40.5 13.2 104 949-1057 48-155 (311)
73 KOG0276 Vesicle coat complex C 88.4 27 0.00058 41.9 18.1 175 857-1057 32-216 (794)
74 KOG3881 Uncharacterized conser 88.2 6.9 0.00015 44.1 12.7 111 942-1056 104-234 (412)
75 KOG0278 Serine/threonine kinas 88.1 6.9 0.00015 41.4 11.9 91 859-970 204-297 (334)
76 KOG2110 Uncharacterized conser 88.0 52 0.0011 37.1 22.5 161 935-1126 79-245 (391)
77 KOG0283 WD40 repeat-containing 87.9 18 0.0004 44.6 17.1 175 861-1059 391-579 (712)
78 KOG0266 WD40 repeat-containing 87.5 34 0.00074 41.4 19.8 135 919-1057 224-365 (456)
79 KOG2445 Nuclear pore complex c 87.2 2.3 5E-05 46.2 8.1 83 880-970 167-256 (361)
80 KOG0274 Cdc4 and related F-box 86.5 57 0.0012 40.1 20.7 168 861-1056 272-441 (537)
81 KOG2114 Vacuolar assembly/sort 86.3 42 0.00092 42.0 18.8 200 541-764 27-244 (933)
82 KOG2321 WD40 repeat protein [G 86.2 14 0.00031 43.7 14.2 103 954-1056 147-258 (703)
83 KOG0316 Conserved WD40 repeat- 86.1 50 0.0011 35.0 21.9 88 860-968 205-297 (307)
84 KOG1897 Damage-specific DNA bi 85.9 1.2E+02 0.0025 39.0 43.6 171 922-1127 749-940 (1096)
85 KOG0646 WD40 repeat protein [G 85.5 21 0.00046 41.2 14.9 108 544-675 45-153 (476)
86 KOG1538 Uncharacterized conser 85.1 86 0.0019 38.1 19.8 146 886-1055 136-292 (1081)
87 TIGR03866 PQQ_ABC_repeats PQQ- 84.8 69 0.0015 35.5 31.7 98 861-976 180-285 (300)
88 KOG0274 Cdc4 and related F-box 84.6 28 0.00061 42.7 16.9 170 861-1058 312-484 (537)
89 KOG2919 Guanine nucleotide-bin 84.0 45 0.00097 36.9 15.8 176 861-1057 134-328 (406)
90 KOG2048 WD40 repeat protein [G 83.8 1.2E+02 0.0025 37.2 47.1 103 548-674 214-318 (691)
91 KOG1274 WD40 repeat protein [G 83.7 1.4E+02 0.0029 38.0 24.6 113 538-676 55-169 (933)
92 KOG0305 Anaphase promoting com 83.4 82 0.0018 37.8 19.3 207 550-795 189-405 (484)
93 KOG0289 mRNA splicing factor [ 83.0 98 0.0021 35.8 22.2 39 861-904 412-450 (506)
94 KOG0650 WD40 repeat nucleolar 82.9 17 0.00036 43.3 13.0 102 897-1012 532-637 (733)
95 KOG0296 Angio-associated migra 82.8 91 0.002 35.3 25.9 112 538-674 105-219 (399)
96 KOG0277 Peroxisomal targeting 82.6 27 0.00058 37.4 13.1 67 898-970 21-91 (311)
97 KOG2096 WD40 repeat protein [G 82.5 12 0.00026 41.0 10.9 114 16-142 134-256 (420)
98 KOG0266 WD40 repeat-containing 82.3 95 0.0021 37.6 20.3 112 542-676 206-319 (456)
99 KOG0278 Serine/threonine kinas 82.0 23 0.0005 37.6 12.3 131 920-1060 165-301 (334)
100 KOG0772 Uncharacterized conser 81.5 19 0.00042 42.1 12.6 176 919-1128 290-486 (641)
101 PTZ00420 coronin; Provisional 81.3 1.5E+02 0.0032 36.8 29.0 132 920-1057 148-294 (568)
102 PF08596 Lgl_C: Lethal giant l 80.4 52 0.0011 38.8 16.5 107 861-979 108-252 (395)
103 KOG0640 mRNA cleavage stimulat 80.3 24 0.00051 38.6 12.0 207 557-798 192-417 (430)
104 KOG0263 Transcription initiati 80.1 38 0.00083 41.8 15.1 189 398-617 447-649 (707)
105 KOG0649 WD40 repeat protein [G 80.1 88 0.0019 33.4 17.1 187 340-567 22-235 (325)
106 PF04053 Coatomer_WDAD: Coatom 80.0 1.4E+02 0.0031 35.8 21.9 217 692-1012 34-262 (443)
107 KOG0289 mRNA splicing factor [ 79.9 93 0.002 36.0 16.9 111 543-675 309-419 (506)
108 KOG3881 Uncharacterized conser 79.8 6.7 0.00015 44.2 8.1 102 860-981 226-331 (412)
109 KOG1034 Transcriptional repres 79.3 29 0.00064 38.4 12.5 117 881-1012 88-211 (385)
110 KOG0277 Peroxisomal targeting 78.9 99 0.0021 33.3 18.5 55 557-616 36-90 (311)
111 KOG0316 Conserved WD40 repeat- 78.5 97 0.0021 33.0 18.0 171 861-1057 40-214 (307)
112 KOG0641 WD40 repeat protein [G 77.6 96 0.0021 32.4 15.8 55 118-174 93-147 (350)
113 KOG0275 Conserved WD40 repeat- 77.0 17 0.00036 39.7 9.8 239 861-1162 236-492 (508)
114 PF13360 PQQ_2: PQQ-like domai 76.9 1.1E+02 0.0024 32.8 17.7 170 861-1057 47-231 (238)
115 PF14783 BBS2_Mid: Ciliary BBS 76.9 66 0.0014 30.2 12.4 92 583-702 1-92 (111)
116 KOG3621 WD40 repeat-containing 76.6 8.3 0.00018 46.8 8.2 145 860-1024 55-207 (726)
117 PHA02713 hypothetical protein; 75.7 57 0.0012 40.5 15.9 151 898-1058 352-533 (557)
118 PF13360 PQQ_2: PQQ-like domai 75.7 1.2E+02 0.0026 32.5 23.6 170 860-1053 3-188 (238)
119 KOG2096 WD40 repeat protein [G 75.5 1.4E+02 0.003 33.2 19.8 230 539-795 86-349 (420)
120 KOG1645 RING-finger-containing 75.3 1.5E+02 0.0032 34.2 16.7 42 861-907 412-453 (463)
121 PF12894 Apc4_WD40: Anaphase-p 75.2 6.4 0.00014 30.7 4.6 38 574-617 4-41 (47)
122 KOG0288 WD40 repeat protein Ti 75.1 24 0.00053 40.2 10.8 91 861-967 364-458 (459)
123 PHA03098 kelch-like protein; P 74.9 64 0.0014 39.9 16.2 134 898-1041 343-491 (534)
124 KOG0286 G-protein beta subunit 74.5 1.4E+02 0.0031 32.8 28.6 280 582-967 56-342 (343)
125 PF00780 CNH: CNH domain; Int 74.2 1.5E+02 0.0032 32.9 23.9 138 467-619 103-257 (275)
126 PF08553 VID27: VID27 cytoplas 73.9 1.4E+02 0.0031 38.3 18.3 91 518-615 502-604 (794)
127 PTZ00421 coronin; Provisional 73.5 2.2E+02 0.0049 34.7 27.9 114 541-675 77-198 (493)
128 KOG4441 Proteins containing BT 73.4 41 0.00088 41.9 13.6 141 898-1048 381-533 (571)
129 KOG0273 Beta-transducin family 73.1 2E+02 0.0042 33.8 18.0 88 861-970 299-389 (524)
130 PF14783 BBS2_Mid: Ciliary BBS 73.1 83 0.0018 29.6 13.7 91 515-613 20-110 (111)
131 KOG1240 Protein kinase contain 73.0 97 0.0021 40.6 16.3 128 531-676 1091-1226(1431)
132 PF15390 DUF4613: Domain of un 72.5 20 0.00042 43.0 9.7 91 896-1000 122-219 (671)
133 PF14779 BBS1: Ciliary BBSome 72.2 26 0.00055 38.2 9.9 78 581-671 176-254 (257)
134 PHA02713 hypothetical protein; 72.0 1.2E+02 0.0026 37.8 17.4 153 898-1059 304-489 (557)
135 KOG0268 Sof1-like rRNA process 70.7 34 0.00074 38.4 10.4 113 936-1056 140-259 (433)
136 PF08662 eIF2A: Eukaryotic tra 70.5 1.3E+02 0.0029 31.4 15.0 127 899-1036 18-152 (194)
137 KOG0305 Anaphase promoting com 70.2 79 0.0017 37.9 14.2 91 860-970 197-288 (484)
138 KOG0649 WD40 repeat protein [G 70.2 1.5E+02 0.0031 31.8 14.2 146 896-1057 20-187 (325)
139 KOG4378 Nuclear protein COP1 [ 70.0 1.2E+02 0.0026 35.7 14.8 51 746-797 132-184 (673)
140 TIGR03300 assembly_YfgL outer 69.4 2.3E+02 0.005 33.1 21.4 168 861-1055 156-337 (377)
141 KOG1517 Guanine nucleotide bin 68.3 3.1E+02 0.0066 35.8 18.8 136 520-675 1231-1381(1387)
142 KOG4328 WD40 protein [Function 68.1 2.5E+02 0.0053 33.0 21.7 32 581-617 234-265 (498)
143 PF08662 eIF2A: Eukaryotic tra 67.4 1.7E+02 0.0036 30.7 15.6 136 757-976 39-185 (194)
144 KOG0640 mRNA cleavage stimulat 67.1 20 0.00043 39.2 7.6 154 861-1032 239-404 (430)
145 KOG0293 WD40 repeat-containing 67.1 2.4E+02 0.0053 32.5 17.1 142 471-623 326-476 (519)
146 KOG2394 WD40 protein DMR-N9 [G 65.6 86 0.0019 37.2 12.7 78 953-1035 301-382 (636)
147 KOG0650 WD40 repeat nucleolar 64.9 3.2E+02 0.007 33.2 18.1 49 557-613 420-468 (733)
148 KOG1332 Vesicle coat complex C 64.5 78 0.0017 33.8 11.1 112 549-677 23-136 (299)
149 TIGR03300 assembly_YfgL outer 63.8 2.9E+02 0.0063 32.2 26.7 121 522-672 253-376 (377)
150 KOG0288 WD40 repeat protein Ti 63.0 95 0.0021 35.7 12.1 94 520-623 322-422 (459)
151 KOG1188 WD40 repeat protein [G 62.6 1.7E+02 0.0037 32.9 13.7 176 599-794 40-230 (376)
152 KOG1407 WD40 repeat protein [F 62.3 1.4E+02 0.0031 32.3 12.6 126 861-1011 88-218 (313)
153 KOG4441 Proteins containing BT 62.0 2E+02 0.0044 35.8 16.5 185 859-1058 300-499 (571)
154 TIGR02658 TTQ_MADH_Hv methylam 61.8 3.1E+02 0.0066 31.8 29.2 29 767-795 288-317 (352)
155 KOG1332 Vesicle coat complex C 61.5 66 0.0014 34.4 9.9 96 919-1015 32-137 (299)
156 PF00400 WD40: WD domain, G-be 58.9 17 0.00037 26.4 4.0 29 581-615 11-39 (39)
157 KOG0284 Polyadenylation factor 58.4 2.3E+02 0.0049 32.8 14.0 246 658-1008 109-376 (464)
158 PF14761 HPS3_N: Hermansky-Pud 58.3 37 0.00081 35.8 7.7 64 941-1004 132-207 (215)
159 PF14781 BBS2_N: Ciliary BBSom 57.3 36 0.00078 33.0 6.7 76 881-973 47-128 (136)
160 PF12341 DUF3639: Protein of u 57.0 17 0.00036 24.7 3.1 26 582-615 2-27 (27)
161 PHA03098 kelch-like protein; P 56.7 1.2E+02 0.0026 37.5 13.5 136 898-1042 295-442 (534)
162 KOG2079 Vacuolar assembly/sort 56.6 3.1E+02 0.0068 35.8 16.2 107 547-675 97-205 (1206)
163 KOG0294 WD40 repeat-containing 56.5 3.3E+02 0.0071 30.5 23.4 86 921-1012 190-281 (362)
164 KOG0645 WD40 repeat protein [G 55.1 3.2E+02 0.0069 29.9 16.1 106 550-675 74-180 (312)
165 KOG1240 Protein kinase contain 54.7 6.7E+02 0.015 33.5 19.4 68 599-677 1061-1130(1431)
166 KOG0286 G-protein beta subunit 54.2 3.4E+02 0.0074 30.0 16.4 110 943-1055 55-173 (343)
167 PF00325 Crp: Bacterial regula 53.9 21 0.00046 25.3 3.4 26 1186-1211 4-29 (32)
168 KOG1188 WD40 repeat protein [G 52.1 3.2E+02 0.0069 30.9 13.6 185 898-1129 40-241 (376)
169 KOG0279 G protein beta subunit 49.6 3.9E+02 0.0086 29.4 25.1 89 919-1011 213-312 (315)
170 PF08553 VID27: VID27 cytoplas 49.0 49 0.0011 42.2 8.0 86 294-412 542-628 (794)
171 KOG0301 Phospholipase A2-activ 48.6 1.9E+02 0.0042 35.6 12.2 108 942-1058 178-290 (745)
172 KOG3617 WD40 and TPR repeat-co 47.3 92 0.002 39.0 9.4 118 875-1012 8-131 (1416)
173 KOG1587 Cytoplasmic dynein int 47.0 1.3E+02 0.0029 37.0 11.0 114 941-1058 396-518 (555)
174 TIGR03548 mutarot_permut cycli 47.0 4.2E+02 0.0092 30.1 15.1 119 898-1023 73-203 (323)
175 KOG0643 Translation initiation 46.7 4.3E+02 0.0093 29.0 15.6 116 861-994 75-200 (327)
176 KOG0772 Uncharacterized conser 45.6 6.1E+02 0.013 30.4 23.3 62 896-967 419-484 (641)
177 KOG0268 Sof1-like rRNA process 45.0 2.3E+02 0.0051 32.2 11.3 167 861-1056 168-345 (433)
178 KOG0303 Actin-binding protein 44.8 2.7E+02 0.0059 32.0 11.9 100 942-1054 80-201 (472)
179 PF12234 Rav1p_C: RAVE protein 44.7 83 0.0018 39.1 8.9 95 860-969 51-155 (631)
180 PHA02790 Kelch-like protein; P 44.6 3.4E+02 0.0073 33.1 14.4 137 898-1044 272-414 (480)
181 KOG3914 WD repeat protein WDR4 44.0 3.3E+02 0.0071 31.4 12.5 145 895-1058 71-225 (390)
182 KOG0302 Ribosome Assembly prot 42.6 3E+02 0.0064 31.5 11.7 93 861-970 281-378 (440)
183 PF08450 SGL: SMP-30/Gluconola 42.0 4.8E+02 0.01 28.2 22.5 114 941-1057 38-165 (246)
184 KOG0272 U4/U6 small nuclear ri 41.6 6.3E+02 0.014 29.5 19.5 73 540-619 176-251 (459)
185 KOG0271 Notchless-like WD40 re 41.1 1.4E+02 0.003 34.1 8.9 121 860-998 179-302 (480)
186 KOG0269 WD40 repeat-containing 40.6 1.1E+02 0.0023 38.1 8.7 134 778-991 134-270 (839)
187 KOG0306 WD40-repeat-containing 40.5 8.6E+02 0.019 30.7 41.8 173 860-1053 434-619 (888)
188 KOG3621 WD40 repeat-containing 40.4 2.3E+02 0.0051 35.0 11.3 118 538-676 34-155 (726)
189 cd00216 PQQ_DH Dehydrogenases 39.8 7.8E+02 0.017 30.0 19.2 62 857-930 253-321 (488)
190 PF11768 DUF3312: Protein of u 39.5 7.9E+02 0.017 30.0 31.2 101 767-949 249-350 (545)
191 PF14779 BBS1: Ciliary BBSome 39.4 1.2E+02 0.0026 33.1 8.2 70 882-967 177-255 (257)
192 KOG0293 WD40 repeat-containing 38.4 7E+02 0.015 29.1 20.0 273 455-764 225-514 (519)
193 KOG4328 WD40 protein [Function 36.8 6.3E+02 0.014 29.8 13.4 99 860-978 257-359 (498)
194 KOG0771 Prolactin regulatory e 36.5 2E+02 0.0044 33.1 9.6 31 581-617 281-311 (398)
195 KOG4547 WD40 repeat-containing 35.9 8.8E+02 0.019 29.5 17.7 134 860-1012 15-172 (541)
196 KOG0271 Notchless-like WD40 re 35.6 7.4E+02 0.016 28.6 26.8 72 537-616 113-186 (480)
197 KOG0295 WD40 repeat-containing 35.3 2.8E+02 0.0061 31.6 10.1 63 598-676 303-365 (406)
198 KOG0265 U5 snRNP-specific prot 35.3 6.7E+02 0.015 28.0 15.9 143 896-1060 101-250 (338)
199 KOG1900 Nuclear pore complex, 34.9 2.1E+02 0.0046 38.1 10.5 102 953-1057 90-208 (1311)
200 KOG1445 Tumor-specific antigen 34.4 3.9E+02 0.0085 32.6 11.6 71 546-622 588-662 (1012)
201 KOG0285 Pleiotropic regulator 34.1 7.6E+02 0.016 28.2 15.7 137 919-1061 298-444 (460)
202 KOG2445 Nuclear pore complex c 33.6 1.4E+02 0.0031 33.0 7.5 65 27-92 184-256 (361)
203 KOG0321 WD40 repeat-containing 33.5 4.4E+02 0.0095 32.4 11.9 139 919-1057 73-249 (720)
204 COG1654 BirA Biotin operon rep 33.1 51 0.0011 28.9 3.4 26 1185-1210 20-45 (79)
205 KOG1408 WD40 repeat protein [F 32.4 1.1E+03 0.024 29.6 34.7 96 958-1057 614-714 (1080)
206 COG5276 Uncharacterized conser 32.2 7.6E+02 0.016 27.6 14.7 132 924-1057 65-200 (370)
207 KOG0280 Uncharacterized conser 31.6 3.9E+02 0.0084 29.7 10.2 102 553-675 89-196 (339)
208 KOG0270 WD40 repeat-containing 31.6 2.8E+02 0.0061 32.3 9.7 81 529-617 321-404 (463)
209 KOG0275 Conserved WD40 repeat- 31.5 4E+02 0.0087 29.6 10.3 127 529-675 339-467 (508)
210 COG5170 CDC55 Serine/threonine 30.4 70 0.0015 35.2 4.5 61 919-979 47-126 (460)
211 KOG0303 Actin-binding protein 30.0 9.3E+02 0.02 28.0 15.2 74 543-622 38-116 (472)
212 PLN02153 epithiospecifier prot 29.7 9E+02 0.019 27.7 15.8 136 898-1042 86-256 (341)
213 PF08728 CRT10: CRT10; InterP 28.9 1.5E+02 0.0033 37.3 7.7 63 994-1056 177-246 (717)
214 KOG1445 Tumor-specific antigen 28.8 2.5E+02 0.0055 34.1 8.9 63 549-617 640-708 (1012)
215 KOG0643 Translation initiation 28.8 8.2E+02 0.018 26.9 21.6 93 898-1005 159-254 (327)
216 TIGR03547 muta_rot_YjhT mutatr 28.4 5.6E+02 0.012 29.4 12.3 127 860-1000 168-328 (346)
217 PTZ00420 coronin; Provisional 28.3 1.2E+03 0.027 28.9 30.9 68 545-617 34-105 (568)
218 KOG0973 Histone transcription 27.9 3.6E+02 0.0078 35.0 10.7 118 16-143 70-200 (942)
219 TIGR03548 mutarot_permut cycli 27.5 5.4E+02 0.012 29.2 11.8 134 898-1040 124-306 (323)
220 KOG1408 WD40 repeat protein [F 27.4 1.1E+03 0.024 29.6 13.9 118 540-676 548-672 (1080)
221 KOG1587 Cytoplasmic dynein int 26.7 1.3E+03 0.028 28.6 18.6 75 541-617 244-323 (555)
222 KOG4283 Transcription-coupled 26.5 9.3E+02 0.02 26.8 16.3 91 861-970 125-219 (397)
223 PHA02790 Kelch-like protein; P 26.3 1.2E+03 0.027 28.2 16.2 91 947-1043 356-452 (480)
224 KOG3679 Predicted coiled-coil 25.9 3.6E+02 0.0077 30.3 9.0 62 703-764 211-274 (802)
225 TIGR03866 PQQ_ABC_repeats PQQ- 25.3 9E+02 0.02 26.3 34.7 44 749-795 86-132 (300)
226 PF13412 HTH_24: Winged helix- 25.2 70 0.0015 24.7 2.7 38 1174-1211 7-44 (48)
227 KOG1354 Serine/threonine prote 25.2 2.2E+02 0.0048 32.1 7.2 66 898-975 37-121 (433)
228 KOG1524 WD40 repeat-containing 24.4 1.3E+03 0.029 27.9 24.3 149 420-591 77-237 (737)
229 PLN02153 epithiospecifier prot 24.4 7E+02 0.015 28.6 12.1 123 898-1023 138-293 (341)
230 KOG0264 Nucleosome remodeling 24.3 8.7E+02 0.019 28.5 12.0 78 581-677 272-349 (422)
231 PF12894 Apc4_WD40: Anaphase-p 24.2 1.2E+02 0.0026 23.6 3.7 41 49-93 2-42 (47)
232 PF15390 DUF4613: Domain of un 24.2 4.3E+02 0.0092 32.4 9.7 87 755-907 316-406 (671)
233 PF04053 Coatomer_WDAD: Coatom 24.1 4.4E+02 0.0095 31.7 10.3 57 953-1014 116-175 (443)
234 PF02239 Cytochrom_D1: Cytochr 24.1 1.2E+03 0.026 27.2 18.4 29 768-796 162-190 (369)
235 TIGR02276 beta_rpt_yvtn 40-res 23.7 2.7E+02 0.0058 20.2 5.7 36 748-786 4-42 (42)
236 KOG0771 Prolactin regulatory e 23.3 2.9E+02 0.0062 32.0 7.9 135 24-173 195-334 (398)
237 KOG1272 WD40-repeat-containing 22.7 2.3E+02 0.005 33.3 7.0 88 859-968 272-360 (545)
238 COG5276 Uncharacterized conser 22.7 1.1E+03 0.024 26.4 19.4 141 541-719 88-231 (370)
239 KOG0313 Microtubule binding pr 22.6 1.2E+03 0.027 26.9 21.6 67 598-679 114-181 (423)
240 KOG0302 Ribosome Assembly prot 22.5 1.2E+03 0.027 26.9 13.0 137 918-1059 232-381 (440)
241 KOG0281 Beta-TrCP (transducin 22.3 2.5E+02 0.0054 31.6 6.9 105 942-1057 236-349 (499)
242 PF13545 HTH_Crp_2: Crp-like h 22.3 1.1E+02 0.0023 26.3 3.6 26 1186-1211 30-55 (76)
243 KOG0308 Conserved WD40 repeat- 21.7 1.6E+03 0.035 27.9 15.1 114 941-1057 115-244 (735)
244 PF08309 LVIVD: LVIVD repeat; 21.6 3E+02 0.0066 20.9 5.3 27 986-1012 3-29 (42)
245 PF06977 SdiA-regulated: SdiA- 21.6 1.1E+03 0.024 25.8 18.2 191 580-795 20-239 (248)
246 KOG1063 RNA polymerase II elon 21.0 7.2E+02 0.016 30.9 10.8 66 551-617 632-699 (764)
247 PF08220 HTH_DeoR: DeoR-like h 20.9 1.2E+02 0.0025 24.7 3.3 25 1187-1211 17-41 (57)
248 KOG2395 Protein involved in va 20.8 9.5E+02 0.021 29.1 11.5 81 549-641 394-478 (644)
249 PF13404 HTH_AsnC-type: AsnC-t 20.5 71 0.0015 24.2 1.7 35 1175-1209 8-42 (42)
250 KOG1063 RNA polymerase II elon 20.2 1.3E+03 0.029 28.8 12.8 68 599-676 630-700 (764)
251 KOG0308 Conserved WD40 repeat- 20.1 7.4E+02 0.016 30.6 10.6 110 943-1056 171-285 (735)
252 PF08279 HTH_11: HTH domain; 20.0 1.3E+02 0.0027 24.0 3.3 26 1185-1210 16-41 (55)
No 1
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=100.00 E-value=5.6e-171 Score=1470.78 Aligned_cols=1072 Identities=34% Similarity=0.521 Sum_probs=956.2
Q ss_pred CeEEEEEeeCCCceeEEEEEEecCCCCceEEEEeCCEEEEEeecCCCCeEEEEEEEeeeeeeEeeEEeeCCCCeeEEEEE
Q 000944 1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVG 80 (1213)
Q Consensus 1 m~~y~~t~~~pt~v~~~v~~~f~~~~~~~LVv~k~~~Levy~i~~~g~L~~v~~~~l~g~I~~i~~~r~~~~~~d~L~v~ 80 (1213)
|+.|+.|+++||+|.+|+.|||+++...||+|||+|.||+|.++++| |+.+.+.|+||+|..|+.+||++.++|+|+|.
T Consensus 1 ~~~Y~vtaqkpT~V~~av~gnFts~e~~nlivAk~~~lei~~~~~~G-Lq~i~sv~ifg~I~~i~~fRp~g~~kD~LfV~ 79 (1096)
T KOG1897|consen 1 SMNYVVTAQKPTAVVTAVVGNFTSPENLNLIVAKGNRLEILLVEPNG-LQPITSVPIFGTIATIALFRPPGSDKDYLFVA 79 (1096)
T ss_pred CeeEEEEecCCceEeEEEeecccCccceeeeeeccceEEEEeecccc-ceeeEeeccceeEEEEEeecCCCCCcceEEEE
Confidence 78999999999999999999999999999999999999999999998 99999999999999999999999999999999
Q ss_pred eccceEEEEEEeCCCCcEeEEee-eeccccCcccccCCceEEECCCCCEEEEEecccceEEEEEecCCCCc-eeeecccc
Q 000944 81 SDSGRIVILEYNPSKNVFDKIHQ-ETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAAR-LTISSPLE 158 (1213)
Q Consensus 81 ~~~~~l~il~~d~~~~~~~tis~-~~~~~~g~~~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~~~~~~~~~-~~~~~p~e 158 (1213)
|+++++++|+||....+.++..+ ...+|.| |+..+|++++|||.+|.|++++|+|.+.|+|+.+++.-. -.....+.
T Consensus 80 t~~~~~~iL~~d~~~~~vv~~a~~~v~dr~g-r~s~~g~~~~VDp~~R~Igl~~yqgl~~vIp~d~~~sht~~s~l~~fn 158 (1096)
T KOG1897|consen 80 TDSYRYFILEWDEESIQVVTRAHGDVSDRSG-RPSDNGQILLVDPKGRVIGLHLYQGLFKVIPIDSDESHTGGSLLKAFN 158 (1096)
T ss_pred ECcceEEEEEEccccceEEEEeccccccccc-ccCCCceEEEECCCCcEEEEEeecCeEEEEEecccccccCcccccccc
Confidence 99999999999985445555544 3456666 888999999999999999999999999999997652100 00000000
Q ss_pred ccccccEEEEeeeeccCCCCcEEEEEEeeccccccCcchhccccccceEEEEEEEcCCceeee-eeeeccCCCcceEEec
Q 000944 159 AHKSHTIVYSICGIDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSR-KWSEPVDNGANMLVTV 237 (1213)
Q Consensus 159 ~~~~~~~i~~~~fl~~~~~~p~~a~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~lp~~~~~lipl 237 (1213)
.....-+++||+||+ +..+|++|+||.+ . ..+|+++|++|+....+.+ .|+.++..++..+||+
T Consensus 159 ~rfdel~v~Di~fly-~~s~pt~~vly~D---s-----------~~~Hv~~yelnl~~ke~~~~~w~~~v~~~a~~li~V 223 (1096)
T KOG1897|consen 159 VRFDELNVYDIKFLY-GCSDPTLAVLYKD---S-----------DGRHVKTYELNLRDKEFVKGPWSNNVDNGASMLIPV 223 (1096)
T ss_pred cccCcceEEEEEEEc-CCCCCceEEEEEc---C-----------CCcEEEEEEeccchhhccccccccccccCCceeeec
Confidence 001123799999998 7789999999543 2 1689999999998665544 4998899999999999
Q ss_pred CCCCCCCCeEEEEeeceEEEEeCCCCceeeecCCCCCCCCCcceEEEEEEEEEecCceEEEEEeCCCCEEEEEEEeCCce
Q 000944 238 PGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEH 317 (1213)
Q Consensus 238 p~~~~~~~GvLv~~~~~i~y~~~~~~~~~~~~p~~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~G~l~~l~l~~~~~~ 317 (1213)
|.+. |||||+|++.|.|.+++....- .|.. ..+..++||..+.. +..+||++|++|+||++.+...+.+
T Consensus 224 P~~~---gGvlV~ge~~I~Y~~~~~~~ai--~p~~-----~~~~t~~~~~~v~~-~~~~yLl~d~~G~Lf~l~l~~~~e~ 292 (1096)
T KOG1897|consen 224 PSPI---GGVLVIGEEFIVYMSGDNFVAI--APLT-----AEQSTIVCYGRVDL-QGSRYLLGDEDGMLFKLLLSHTGET 292 (1096)
T ss_pred CCCC---ceEEEEeeeEEEEeeCCceeEe--cccc-----cCCceEEEcccccC-CccEEEEecCCCcEEEEEeeccccc
Confidence 9999 9999999999999998643211 1221 12567899998764 5578999999999999999988887
Q ss_pred eee--eEEEEeCCCCcceeEEEEcCCeEEEEeeeCCeEEEEEeecCCCCCcccccCCccccccCCCceeeccCCcccEEE
Q 000944 318 VSE--LKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVR 395 (1213)
Q Consensus 318 v~~--l~i~~l~~~~~~s~l~~l~~~~lFvgS~~gds~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 395 (1213)
+.+ ++++++|++++++||++|++|+||+||++|||+|+++....+. ..+..+
T Consensus 293 ~s~~~lkve~lge~siassi~~L~ng~lFvGS~~gdSqLi~L~~e~d~--------------------------gsy~~i 346 (1096)
T KOG1897|consen 293 VSGLDLKVEYLGETSIASSINYLDNGVLFVGSRFGDSQLIKLNTEPDV--------------------------GSYVVI 346 (1096)
T ss_pred ccceEEEEEecCCcchhhhhhcccCceEEEeccCCceeeEEccccCCC--------------------------Cchhhh
Confidence 777 8999999999999999999999999999999999999764311 124678
Q ss_pred EEEeccCCcccceEEeccCCCCCCcEEEEEecCCCCeEEEEccCCcceEEEEecCCCCcceEEEeeeCCCCCCceEEEEE
Q 000944 396 IEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVS 475 (1213)
Q Consensus 396 ~d~~~n~gPI~D~~~~~~~~~~~~~lv~~sG~g~~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l~~~~~~~~~~~lvlS 475 (1213)
+++++|+|||.||++.+...|+++++++|||++|+|+||++|+||++++++++++||+ +++|+++....++++.||++|
T Consensus 347 let~~NLgPI~Dm~Vvd~d~q~q~qivtCsGa~kdgSLRiiRngi~I~e~A~i~l~Gi-kg~w~lk~~v~~~~d~ylvls 425 (1096)
T KOG1897|consen 347 LETFVNLGPIVDMCVVDLDRQGQGQIVTCSGAFKDGSLRIIRNGIGIDELASIDLPGI-KGMWSLKSMVDENYDNYLVLS 425 (1096)
T ss_pred hhhcccccceeeEEEEeccccCCceEEEEeCCCCCCcEEEEecccccceeeEeecCCc-cceeEeeccccccCCcEEEEE
Confidence 9999999999999999988788999999999999999999999999999999999995 999999987778899999999
Q ss_pred ecCceeEEEeccceeeecCCCccCCCCeEEEEeecCCeEEEEeCCcEEEEeCCCceeeeeCCCCccEEEEEecCCEEEEE
Q 000944 476 FNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIA 555 (1213)
Q Consensus 476 ~~~~T~vl~~~~~~~e~~~~gf~~~~~Tl~a~~~~~~~ivQVT~~~i~l~~~~~~~~~~~~~~~~~I~~as~~~~~v~v~ 555 (1213)
|.++|++|.++++++|..+.||.++++||+|+.++++.++|||+++||+++...+..+|.+|.+..|..|+.+..+|+|+
T Consensus 426 f~~eTrvl~i~~e~ee~~~~gf~~~~~Tif~S~i~g~~lvQvTs~~iRl~ss~~~~~~W~~p~~~ti~~~~~n~sqVvvA 505 (1096)
T KOG1897|consen 426 FISETRVLNISEEVEETEDPGFSTDEQTIFCSTINGNQLVQVTSNSIRLVSSAGLRSEWRPPGKITIGVVSANASQVVVA 505 (1096)
T ss_pred eccceEEEEEccceEEeccccccccCceEEEEccCCceEEEEecccEEEEcchhhhhcccCCCceEEEEEeecceEEEEe
Confidence 99999999998889999999999999999999998888999999999999988899999999999999999999999999
Q ss_pred EeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCc
Q 000944 556 LSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPE 635 (1213)
Q Consensus 556 ~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~ 635 (1213)
..++.+++++++..+ |.+..+++++.||+|+++.+.+++...+.+++||+|+..+.++...||..+...+....+..|+
T Consensus 506 ~~~~~l~y~~i~~~~-l~e~~~~~~e~evaCLDisp~~d~~~~s~~~aVG~Ws~~~~~l~~~pd~~~~~~~~l~~~~iPR 584 (1096)
T KOG1897|consen 506 GGGLALFYLEIEDGG-LREVSHKEFEYEVACLDISPLGDAPNKSRLLAVGLWSDISMILTFLPDLILITHEQLSGEIIPR 584 (1096)
T ss_pred cCccEEEEEEeeccc-eeeeeeheecceeEEEecccCCCCCCcceEEEEEeecceEEEEEECCCcceeeeeccCCCccch
Confidence 877899999888655 8999999999999999999998777788999999997766666666996666666666778899
Q ss_pred eeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCccE
Q 000944 636 SLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPW 715 (1213)
Q Consensus 636 Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p~ 715 (1213)
|+.+..++. ...||+|+++||.|++|.++..+|++++.|+.++|++|+.|+++...+.+++|||++|||
T Consensus 585 SIl~~~~e~-----------d~~yLlvalgdG~l~~fv~d~~tg~lsd~Kk~~lGt~P~~Lr~f~sk~~t~vfa~sdrP~ 653 (1096)
T KOG1897|consen 585 SILLTTFEG-----------DIHYLLVALGDGALLYFVLDINTGQLSDRKKVTLGTQPISLRTFSSKSRTAVFALSDRPT 653 (1096)
T ss_pred heeeEEeec-----------cceEEEEEcCCceEEEEEEEcccceEccccccccCCCCcEEEEEeeCCceEEEEeCCCCE
Confidence 999988763 378999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEeCCeEEEEecCccccceeeccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEE
Q 000944 716 LGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVII 795 (1213)
Q Consensus 716 ~i~~~~~~~~~~~~~~~~v~~~~~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~ 795 (1213)
++|+.++++.|+|++.+.+..+|||++.++++++++++++.|+|++++++ +++++|++|++++||||+||+.+.+|.|.
T Consensus 654 viY~~n~kLv~spls~kev~~~c~f~s~a~~d~l~~~~~~~l~i~tid~i-qkl~irtvpl~~~prrI~~q~~sl~~~v~ 732 (1096)
T KOG1897|consen 654 VIYSSNGKLVYSPLSLKEVNHMCPFNSDAYPDSLASANGGALTIGTIDEI-QKLHIRTVPLGESPRRICYQESSLTFGVL 732 (1096)
T ss_pred EEEecCCcEEEeccchHHhhhhcccccccCCceEEEecCCceEEEEecch-hhcceeeecCCCChhheEecccceEEEEE
Confidence 99999999999999999999999999999999999999999999999999 89999999999999999999988889998
Q ss_pred EccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEE
Q 000944 796 ETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCL 875 (1213)
Q Consensus 796 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~ 875 (1213)
+.+.+...+ + .+++.|.++++++|++|+++++.
T Consensus 733 s~r~e~~~~---------------------------------------------~--~~ee~~~s~l~vlD~nTf~vl~~ 765 (1096)
T KOG1897|consen 733 SNRIESSAE---------------------------------------------Y--YGEEYEVSFLRVLDQNTFEVLSS 765 (1096)
T ss_pred ecccccchh---------------------------------------------h--cCCcceEEEEEEecCCceeEEee
Confidence 854322110 0 01246789999999999999999
Q ss_pred EEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCe
Q 000944 876 LELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGR 955 (1213)
Q Consensus 876 ~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ 955 (1213)
++|+++|.+.|+++++|.+ +...|++|||++.++ +|.+|..|||++|++.+ +.+|++++++.++|+|++++.|||+
T Consensus 766 hef~~~E~~~Si~s~~~~~-d~~t~~vVGT~~v~P--de~ep~~GRIivfe~~e-~~~L~~v~e~~v~Gav~aL~~fngk 841 (1096)
T KOG1897|consen 766 HEFERNETALSIISCKFTD-DPNTYYVVGTGLVYP--DENEPVNGRIIVFEFEE-LNSLELVAETVVKGAVYALVEFNGK 841 (1096)
T ss_pred ccccccceeeeeeeeeecC-CCceEEEEEEEeecc--CCCCcccceEEEEEEec-CCceeeeeeeeeccceeehhhhCCe
Confidence 9999999999999999987 447999999999999 79999999999999999 4599999999999999999999999
Q ss_pred EEEEeCCeEEEEecCCce-eeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEE
Q 000944 956 LLAGIGPVLRLYDLGKKR-LLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAH 1034 (1213)
Q Consensus 956 ll~~~g~~l~i~~~~~~~-l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~ 1034 (1213)
|+||+|++|.+|+|..++ |...|... .|+++.+|++.+|+|+|||+|+|+++++|+.+++.|+++|||++|+|+++++
T Consensus 842 llA~In~~vrLye~t~~~eLr~e~~~~-~~~~aL~l~v~gdeI~VgDlm~Sitll~y~~~eg~f~evArD~~p~Wmtave 920 (1096)
T KOG1897|consen 842 LLAGINQSVRLYEWTTERELRIECNIS-NPIIALDLQVKGDEIAVGDLMRSITLLQYKGDEGNFEEVARDYNPNWMTAVE 920 (1096)
T ss_pred EEEecCcEEEEEEccccceehhhhccc-CCeEEEEEEecCcEEEEeeccceEEEEEEeccCCceEEeehhhCccceeeEE
Confidence 999999999999999774 55556665 6889999999999999999999999999999999999999999999999999
Q ss_pred eecCCeeeeecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCC-----
Q 000944 1035 HIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPG----- 1109 (1213)
Q Consensus 1035 ~ld~~~~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~----- 1109 (1213)
++|+++++++|++||+++++++.+.+++ + ++++|...+.||+|+.|++|+++++.+.
T Consensus 921 il~~d~ylgae~~gNlf~v~~d~~~~td--~----------------eR~~l~~~~~~hlGelvn~f~hg~lv~~~~~s~ 982 (1096)
T KOG1897|consen 921 ILDDDTYLGAENSGNLFTVRKDSDATTD--E----------------ERQILEEVGKFHLGELVNKFRHGSLVMQLGDSM 982 (1096)
T ss_pred EecCceEEeecccccEEEEEecCCCCch--h----------------hhhcccceeeEEeccceeeeeecceEeeccccc
Confidence 9999999999999999999999877654 2 2689999999999999999999987653
Q ss_pred --CccEEEEEecccceEEEEecCChhHHHHHHHHHHHHHhcCCCCcCCCccccccccC-----CCCceeechhhhhcccC
Q 000944 1110 --GGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYF-----PVKDVIDGDLCEQFPTL 1182 (1213)
Q Consensus 1110 --~~~~i~~~t~~Gsig~l~~l~~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~~-----p~~~~iDGdll~~fl~l 1182 (1213)
..+.++|||++|+||.+..+ ..+.+.+|..||++|++..+++||++|.+||+++. |++|||||||+|+|++|
T Consensus 983 ~~~~~~vlfgTv~GsIG~i~sl-~~d~~~fL~~Lq~~irk~i~s~gglsH~~yrsf~~e~~~~P~~gfIDGDLiEsfl~l 1061 (1096)
T KOG1897|consen 983 IPLEPKVLFGTVNGSIGIIVSL-PQDWYDFLEELQRRIRKVIKSVGGLSHMDYRSFEFEKRTSPVKGFIDGDLIESFLDL 1061 (1096)
T ss_pred cCCCCcEEEEEccceEEEEEec-CcchhHHHHHHHHHHHHhhcccCCcchhhHhhhhcccccCCCcCcccchHHHhhhcc
Confidence 24679999999999999999 57789999999999999999999999999999875 99999999999999999
Q ss_pred CHHHHHHHHHHcCCC-----HHHHHHHHHHHHhc
Q 000944 1183 SLDLQRKIADELDRT-----PGEILKKLEEIRNK 1211 (1213)
Q Consensus 1183 ~~~~q~~i~~~~~~~-----~~~i~~~l~~l~~~ 1211 (1213)
+.+.+.+|++++..+ +++|++.+|+|++.
T Consensus 1062 ~~~~~~~i~~~~~~~~~~~s~~el~k~vEel~rl 1095 (1096)
T KOG1897|consen 1062 SRSKMREIVRGLEHTESLASVQELLKIVEELTRL 1095 (1096)
T ss_pred CHHHHHHHHhhcccccccCCHHHHHHHHHHHHhc
Confidence 999999999999766 99999999999874
No 2
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=100.00 E-value=1.7e-166 Score=1437.87 Aligned_cols=1197 Identities=60% Similarity=0.988 Sum_probs=1096.8
Q ss_pred CeEEEEEeeCCCceeEEEEEEecCCCCceEEEEeCCEEEEEeecCC-CCeEEEEEEEeeeeeeEeeEEeeCCCCeeEEEE
Q 000944 1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENS-GRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVV 79 (1213)
Q Consensus 1 m~~y~~t~~~pt~v~~~v~~~f~~~~~~~LVv~k~~~Levy~i~~~-g~L~~v~~~~l~g~I~~i~~~r~~~~~~d~L~v 79 (1213)
||+|+.|++.+|+|.+++.|+|.+++..++++++++.|++|+++++ |+++.++++.+||+|++++++|.++..+|+|+|
T Consensus 1 m~lysltlq~~t~i~~~~~g~fs~~k~qeIv~~~~s~l~L~~~d~~~G~l~~i~~~~vFg~Irsla~~~lt~~~kD~LaV 80 (1205)
T KOG1898|consen 1 MFLYSLTLQNQTGIVQAIYGNFSGPKAQEIVLGRGSILELYRIDENDGRLKTICRQEVFGTIRSLAAFRLTGGTKDYLAV 80 (1205)
T ss_pred CchhhhhhhcccceeeeehhhccCCchheEEEEeeeEEEEEEecCCCceEEEEEEEeehhhhhhhhccccCCCCccEEEE
Confidence 8999999999999999999999999999999999999999999965 999999999999999999999999999999999
Q ss_pred EeccceEEEEEEeCCCCcEeEEeeeeccccCcccccCCceEEECCCCCEEEEEecccceEEEEEecCCCCceeeeccccc
Q 000944 80 GSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEA 159 (1213)
Q Consensus 80 ~~~~~~l~il~~d~~~~~~~tis~~~~~~~g~~~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~~~~~~~~~~~~~~p~e~ 159 (1213)
++|+|+++|++|+.+...|+++++++++++|.++..||.|+++||.|||+++++.+++++||.++|+..+++++++|+|+
T Consensus 81 ~SDSGri~il~y~~ek~~~~~~~qetfGks~~rrivpG~y~~idp~Gra~misave~~kLvyvlnrD~~a~ltisSplea 160 (1205)
T KOG1898|consen 81 GSDSGRISILEYNNEKNHFEKLHQETFGKSGCRRIVPGQYLAIDPKGRAVMISAVEKQKLVYVLNRDGAARLTISSPLEA 160 (1205)
T ss_pred EcCCceEEEEEechhhhccccccccccCcccceEeccccEEEEcCCccceeeehhhcCcEEEEEccchhhhceecCchhh
Confidence 99999999999999988899999999999999999999999999999999999999999999999999889999999999
Q ss_pred cccccEEEEeeeeccCCCCcEEEEEEeeccccccCcchhccccccceEEEEEEEcCCceeeeeeeeccCCCcceEEecCC
Q 000944 160 HKSHTIVYSICGIDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239 (1213)
Q Consensus 160 ~~~~~~i~~~~fl~~~~~~p~~a~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~~~~~liplp~ 239 (1213)
|+.++.+++|+.+|.||.||++|.|+.+|.++.++++|.+.....+++++|++++++|++.|+|+.+++...+.+++||.
T Consensus 161 hk~~sic~~l~~Vd~gf~np~fa~LE~dy~~a~~d~tgeaa~~~~~~l~fYeldlglnhvvrk~s~p~~~~~n~l~~VP~ 240 (1205)
T KOG1898|consen 161 HKAHSICLDLVGVDVGFENPIFAALERDYSEADNDPTGEAATMTQKVLTFYELDLGLNHVVRKASEPVNHFGNFLLTVPG 240 (1205)
T ss_pred ccCCcEEEEEEEEeccCCCceEEEEeechhhcccCchhhhhhccccceeEEEEecccceeEEEcccccCCCceEEEEecC
Confidence 99999999999999999999999999999999999999999888999999999999999999999999999999999999
Q ss_pred CCCCCCeEEEEeeceEEEEeCC-CCceeeecCCCCCCCCC--cceEEEEEEEEEecCceEEEEEeCCCCEEEEEEEeCCc
Q 000944 240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRADLPAE--RGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE 316 (1213)
Q Consensus 240 ~~~~~~GvLv~~~~~i~y~~~~-~~~~~~~~p~~~~~~~~--~~~~i~~~~~~~~~~~~~~ll~~~~G~l~~l~l~~~~~ 316 (1213)
..+++.|++|+.+|.+.|.+.. .|.+.+++|+|...+.+ +...+.+.+.+..+++.+++++.++|++|++++..|++
T Consensus 241 G~D~ps~v~vc~~n~~~y~~~~d~p~~ri~~~rr~~~L~~~~~~vliv~s~~hk~k~~ff~llqt~~GD~fk~tl~~d~d 320 (1205)
T KOG1898|consen 241 GSDGPSGVLVCAENYLLYRNLGDHPDVRIPIERRINELSDAEDGVLIVSSAEHKTKSMFFFLLQTEYGDLFKLTLEHDGD 320 (1205)
T ss_pred CCCCCcceEEecCceeeccccccCCCEEeccccccccCCccccccEEEEeecccccCCeEEEEEecCCceEEEEEecCCC
Confidence 8888899999999999999987 78889999987654432 45566555555667889999999999999999999999
Q ss_pred eeeeeEEEEeCCCCcceeEEEEcCCeEEEEeeeCCeEEEEEeecCCCCCcccccCCccccccCCCceeeccCCcccEEEE
Q 000944 317 HVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRI 396 (1213)
Q Consensus 317 ~v~~l~i~~l~~~~~~s~l~~l~~~~lFvgS~~gds~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 396 (1213)
.+..+++.|++++|.+..|+++++|+||++|++||+.||++..++.++|+ -++.|+.+++ +...|.|+.+.++..+
T Consensus 321 ~v~el~lkYfDtvp~a~~L~I~k~GfLf~~sE~~n~~lyq~~~LG~~~~~---~s~~~~~~~~-~~~~f~p~~l~nL~~~ 396 (1205)
T KOG1898|consen 321 NVVELRLKYFDTVPCALQLCILKTGFLFVASEFGNHRLYQFEKLGEEDDD---FSNAMTSEEG-KSVFFEPRILKNLSPV 396 (1205)
T ss_pred cceeeeeehhcCCccceEEEEeccceEEEhhhccCcceeehhhcCCCccc---hhhhcccccC-cceeccccccccccch
Confidence 88999999999999999999999999999999999999999999855433 3444555566 7889999999999999
Q ss_pred EEeccCCcccceEEeccCCCCCCcEEEEEecCCCCeEEEEccCCcceEEEEecCCCCcceEEEeeeCCCCCCceEEEEEe
Q 000944 397 EQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476 (1213)
Q Consensus 397 d~~~n~gPI~D~~~~~~~~~~~~~lv~~sG~g~~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l~~~~~~~~~~~lvlS~ 476 (1213)
+.++|+.|++|+.+++.++++.+||++|||+|++++|++||+|+...+++..++|+.++++|+++....+.+|.||++||
T Consensus 397 ~~i~sl~p~~d~~I~~~~ne~~~qi~~~cg~~~~sslr~lR~gle~sel~~t~lp~~~ta~WTvk~~~td~ydsyivvsF 476 (1205)
T KOG1898|consen 397 SSVESLSPLLDISIGDDSNEDTPQIYSACGRGPRSSLRILRNGLEVSELLVTELPGNPTATWTVKKNITDVYDSYIVVSF 476 (1205)
T ss_pred hhhhccCccceeEeeccCcccchhhhhhhCcCccccchhhccccchHHHhhhccCCCCceEEEEcCccccccceEEEEEe
Confidence 99999999999999999888899999999999999999999999999999999999889999999988889999999999
Q ss_pred cCceeEEEeccceeeecCCCccCCCCeEEEEeecCCeEEEEeCCcEEEEeCCCceeeeeCCCCccEEEEEecCCEEEEEE
Q 000944 477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIAL 556 (1213)
Q Consensus 477 ~~~T~vl~~~~~~~e~~~~gf~~~~~Tl~a~~~~~~~ivQVT~~~i~l~~~~~~~~~~~~~~~~~I~~as~~~~~v~v~~ 556 (1213)
.+.|+|+++|+.++|++++||..+.+||+|+.++++.+|||++++||.+...+++.+|.+|++..|+.++++..+|++++
T Consensus 477 ~n~TlVLsIgesveEvtdsgFls~~~Tl~~~l~Gd~slVQi~~d~iRhi~~~~r~~ew~~P~~~~Iv~~avnr~qiVval 556 (1205)
T KOG1898|consen 477 VNGTLVLSIGESVEEVTDSGFLSTTPTLACSLMGDDSLVQIHPDGIRHIRPTKRINEWKTPERVRIVKCAVNRRQIVVAL 556 (1205)
T ss_pred eccEEEEEcchhHHHhhhcccccCCceEEEEEecCCcEEEEchhhhhhcccccccccccCCCceEEEEEeecceEEEEEc
Confidence 99999999999999999999999999999999999999999999999999989999999999999999999999999999
Q ss_pred eCCEEEEEEEccCCCeEEe-eeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCc
Q 000944 557 SGGELIYFEVDMTGQLLEV-EKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPE 635 (1213)
Q Consensus 557 s~~~l~~l~~~~~~~l~~~-~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~ 635 (1213)
++|+++||+++.+|++.|+ ++.+++.+++|+++.+.+.|.+.++++++|+.++++++++|+|+.+++.++.|.++..|.
T Consensus 557 Sngelvyfe~d~sgql~E~~er~tl~~~vac~ai~~~~~g~krsrfla~a~~d~~vriisL~p~d~l~~ls~q~l~~~~~ 636 (1205)
T KOG1898|consen 557 SNGELVYFEGDVSGQLNEFTERVTLSTDVACLAIGQDPEGEKRSRFLALASVDNMVRIISLDPSDCLQPLSVQGLSSPPE 636 (1205)
T ss_pred cCCeEEEEEeccCccceeeeeeeeeceeehhhccCCCCcchhhcceeeeeccccceeEEEecCcceEEEccccccCCCcc
Confidence 9999999999988999998 889999999999999998888899999999999999999999999999999999999999
Q ss_pred eeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCccE
Q 000944 636 SLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPW 715 (1213)
Q Consensus 636 Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p~ 715 (1213)
|++++++....-+ .....||++|++||.|+++.+|...|++.+.++|++|.+||+|.++...+.+.+++.++|||
T Consensus 637 s~~iv~~~~~~~~-----~~~~L~l~~GL~NGvllR~~id~v~G~l~d~rtR~lG~~pvkLf~~~~~~~s~vL~lSsr~w 711 (1205)
T KOG1898|consen 637 SLCIVEMEATGGT-----DVAQLYLLIGLRNGVLLRFVIDTVTGQLLDIRTRFLGLRPVKLFPISMRGQSDVLALSSRPW 711 (1205)
T ss_pred ceEEEEecccCCc-----cceeEEEEecccccEEEEEEecccccceeeeheeeeccccceEEEEeecCcceeEEecCChh
Confidence 9999998753110 12479999999999999999999999999999999999999999998889999999999999
Q ss_pred EEEEeCCeEEEEecCccccceeeccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEE
Q 000944 716 LGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVII 795 (1213)
Q Consensus 716 ~i~~~~~~~~~~~~~~~~v~~~~~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~ 795 (1213)
+.|+.++.+.++|++.+.+..++||.+..||.|++++..+.|+|..+++.+..++.+..|++.|||++++||++++++++
T Consensus 712 l~y~~~~~~h~t~Isy~~l~~as~~~S~qcpeGiv~i~~n~l~i~~~~~~g~~~n~~~~~l~~tprkvv~h~es~lLii~ 791 (1205)
T KOG1898|consen 712 LLYTYQQEFHLTPISYSTLEHASPFCSEQCPEGIVAISKNTLRIIALDKLGKVLNVDGFPLAYTPRKVVIHPESGLLIIG 791 (1205)
T ss_pred hhhhhcceeeeecccccchhccccccccCCCcchhhhhhhhhheeeehhhcccccccccccccCcceEEEecCCCeEEEE
Confidence 99999999999999999999999999999999999999999999999998778999999999999999999999999999
Q ss_pred EccCCCCCHHHH--HHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceE
Q 000944 796 ETDQGALTAEER--EAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTT 873 (1213)
Q Consensus 796 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~ 873 (1213)
+++++.+....- .+....... ..++++|++-..+.+. .......++..++.|.++.+ |.|+++++|+.+++.+
T Consensus 792 ~td~~~~~~~~a~~~~~~~g~v~-~s~~~~e~e~g~em~~---~~~~~~~~~~v~~~p~a~~~-w~s~I~~~d~~s~~~~ 866 (1205)
T KOG1898|consen 792 RTDHNATLTKDARKNQMEAGGVL-ESGEEKEDEMGGEMEI---IGREEVLPENVYGSPRAGNG-WVSSIRVFDPKSGKII 866 (1205)
T ss_pred EecccchhhHHHhhhhhhccccc-ccccccchhhccchhh---hccccccccccccCcccccC-ccceEEEEcCCCCceE
Confidence 998877653220 000000000 0112221111001100 00001223345677765433 9999999999999999
Q ss_pred EEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc
Q 000944 874 CLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ 953 (1213)
Q Consensus 874 ~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~ 953 (1213)
+.+++..||...|++.+.|++.+...+++||++.+... +...-+.|++|.|++.++|.+|+++|+++++|+|.|||+|+
T Consensus 867 ~~~~l~~ne~a~~v~~~~fs~~~~~~~~~v~~~~~~~l-~~~~~~~g~~ytyk~~~~g~~lellh~T~~~~~v~Ai~~f~ 945 (1205)
T KOG1898|consen 867 CLVELGQNEAAFSVCAVDFSSSEYQPFVAVGVATTEQL-DSKSISSGFVYTYKFVRNGDKLELLHKTEIPGPVGAICPFQ 945 (1205)
T ss_pred EEEeecCCcchhheeeeeeccCCCceEEEEEeeccccc-cccccCCCceEEEEEEecCceeeeeeccCCCccceEEeccC
Confidence 99999999999999999998866668999999987653 22233789999999999888999999999999999999999
Q ss_pred CeEEEEeCCeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEE
Q 000944 954 GRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAA 1033 (1213)
Q Consensus 954 g~ll~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~ 1033 (1213)
|++++|+|+.+++|++.+++|++++....+|.+|+++.+...+|+|||.++|+.+++|++++++|..+|.|+.|||+|++
T Consensus 946 ~~~LagvG~~l~~YdlG~K~lLRk~e~k~~p~~Is~iqt~~~RI~VgD~qeSV~~~~y~~~~n~l~~fadD~~pR~Vt~~ 1025 (1205)
T KOG1898|consen 946 GRVLAGVGRFLRLYDLGKKKLLRKCELKFIPNRISSIQTYGARIVVGDIQESVHFVRYRREDNQLIVFADDPVPRHVTAL 1025 (1205)
T ss_pred CEEEEecccEEEEeeCChHHHHhhhhhccCceEEEEEeecceEEEEeeccceEEEEEEecCCCeEEEEeCCCccceeeEE
Confidence 99999999999999999999999999987799999999999999999999999999999999999999999999999999
Q ss_pred EeecCCeeeeecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCCCccE
Q 000944 1034 HHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGES 1113 (1213)
Q Consensus 1034 ~~ld~~~~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~ 1113 (1213)
.++|+++++++|++||+++++.+++..+.++|||++..++|+++.++++.+|.+...+||+||.+++++..++.+++++.
T Consensus 1026 ~~lD~~tvagaDrfGNi~~vR~P~d~~e~~~edpt~~k~~~~~g~lN~~~~K~~~i~~f~v~Dvits~q~~~~i~~a~e~ 1105 (1205)
T KOG1898|consen 1026 ELLDYDTVAGADRFGNIAVVRIPPDVSEEASEDPTELKIAWEQGFLNDAPQKVQLISQFFVGDVITSLQKVSSIPGARES 1105 (1205)
T ss_pred EEecCCceeeccccCcEEEEECCCcchhhhccCCccccceecccccccccHhhhhhhhccccCeeeeceeeeeccCCcce
Confidence 99999999999999999999999999988889999999999999999999999999999999999999999888888999
Q ss_pred EEEEecccceEEEEecCChhHHHHHHHHHHHHHhcCCCCcCCCccccccccCCCCceeechhhhhcccCCHHHHHHHHHH
Q 000944 1114 VIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADE 1193 (1213)
Q Consensus 1114 i~~~t~~Gsig~l~~l~~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~~p~~~~iDGdll~~fl~l~~~~q~~i~~~ 1193 (1213)
++|+|..|+||+++|+.+++++++++.+|..|++..|+++|.+|.+||++++|.|.+||||++|+|+.|+..+|++||+.
T Consensus 1106 ~iy~tl~GtiG~f~p~~s~~d~~Ff~~~e~~~r~e~ppl~GrDH~~yRsyy~Pvk~VIDGDlceqF~~L~~~~Qe~va~e 1185 (1205)
T KOG1898|consen 1106 LIYTTLLGTIGVFAPFLSREDVDFFQHLEMHMRKEYPPLLGRDHLEYRSYYAPVKKVIDGDLCEQFLRLEENQQEEVAEE 1185 (1205)
T ss_pred eeeeeccccceEEeecccccchHHHHHHHHhccccCCcccCcchhhhhhhccchhhcccHHHHHHHhhCCHHHHHHHHhc
Confidence 99999999999999999888999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCHHHHHHHHHHHHhcc
Q 000944 1194 LDRTPGEILKKLEEIRNKI 1212 (1213)
Q Consensus 1194 ~~~~~~~i~~~l~~l~~~~ 1212 (1213)
+++++++|.+.||++|.+.
T Consensus 1186 l~~ti~eI~kkledir~~~ 1204 (1205)
T KOG1898|consen 1186 LDRTIEEISKKLEDIRTRY 1204 (1205)
T ss_pred ccCCHHHHHHHHHHHHhhc
Confidence 9999999999999999875
No 3
>KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification]
Probab=100.00 E-value=2.2e-143 Score=1260.01 Aligned_cols=1135 Identities=21% Similarity=0.305 Sum_probs=899.8
Q ss_pred CeEEEEEeeCCCceeEEEEEEecCCCCceEEEEeCCEEEEEeecC-----------CC------CeEEEEEEEeeeeeeE
Q 000944 1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPEN-----------SG------RIETLVSTEIFGAIRS 63 (1213)
Q Consensus 1 m~~y~~t~~~pt~v~~~v~~~f~~~~~~~LVv~k~~~Levy~i~~-----------~g------~L~~v~~~~l~g~I~~ 63 (1213)
|+...++.++||+|++|+.|+|+....+||||++.|.|+||++.. ++ +|++++++.+||+|.+
T Consensus 1 m~~vykq~h~~T~ve~s~ag~Ft~~~~~nlvV~~~N~L~vyri~~~~e~~t~~~~~~~~~~~~~~LeLv~~~~l~GnV~s 80 (1366)
T KOG1896|consen 1 MFAVYKQEHDPTVVENSSAGLFTNNRTENLVVAGTNILRVYRISRDAEALTKNDPGDMGKAHRKKLELVAEFKLFGNVTS 80 (1366)
T ss_pred CcchhhhccCchhhccceeeeEecCCCcceEEecccEEEEEEeccchhhccccCccccccccceEEEEEEEEEeecceee
Confidence 666677799999999999999999999999999999999999974 11 3999999999999999
Q ss_pred eeEEeeCCCCeeEEEEEeccceEEEEEEeCCCCcEeEEeeeeccccC----cccccCCceEEECCCCCEEEEEecccceE
Q 000944 64 LAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSG----CRRIVPGQYLAVDPKGRAVMIGACEKQKL 139 (1213)
Q Consensus 64 i~~~r~~~~~~d~L~v~~~~~~l~il~~d~~~~~~~tis~~~~~~~g----~~~~~~~~~l~VDP~~r~ia~~~~~~~~~ 139 (1213)
|++++..++++|.|+++|++||+++++||+.++.|.|+|+|+|+.+- ++....+|.++|||++||+++..|...++
T Consensus 81 i~~~~~~gs~rD~LlL~f~~AKiSvlefD~~t~sl~TlSLHyfE~~~~~~~~~~~~~~p~vrvDPdsrCa~llvyg~~m~ 160 (1366)
T KOG1896|consen 81 IAKLPLKGSNRDALLLLFKDAKISVLEFDPQTNSLRTLSLHYFEGPEFRKGLVGRAKIPTVRVDPDSRCALLLVYGLRMA 160 (1366)
T ss_pred EEEeecCCCCcceEEEEeccceEEEEEecCCccceeeeeeEEeccccccccccccccCceEEECCCCCeEEEEEecceEE
Confidence 99999999999999999999999999999999999999999998653 44455688999999999999999999999
Q ss_pred EEEEecCCC---Cce-----eeeccc---------cccccccEEEEeeeeccCCCCcEEEEEEeeccccccCcchhcccc
Q 000944 140 VYVLNRDTA---ARL-----TISSPL---------EAHKSHTIVYSICGIDCGFDNPIFAAIELDYSEADQDSTGQAASE 202 (1213)
Q Consensus 140 v~~~~~~~~---~~~-----~~~~p~---------e~~~~~~~i~~~~fl~~~~~~p~~a~L~~~~~~~~~~~~~~~~~~ 202 (1213)
++|+++++. +++ +.+++. +......+|+|++||+ ||.+||+|+| |+ ++++|.|+..
T Consensus 161 iLpf~~~e~~~~~~~~~~~~~~ss~~~pSyvi~~reLdeki~niiD~qFLh-gY~ePTl~IL---ye-p~~tw~grv~-- 233 (1366)
T KOG1896|consen 161 ILPFRVNEHLDDEELFPSGFSKSSFTAPSYVIALRELDEKIKNIIDFQFLH-GYYEPTLAIL---YE-PEQTWAGRVI-- 233 (1366)
T ss_pred EeeccccccccccccccccccccccccceeEEEhhhhhhhhccceeEEeec-CcccceEEEE---ec-ccccccceEE--
Confidence 999976431 001 011111 1112345799999998 9999999999 54 6678888644
Q ss_pred ccceEEEEEEEcCCceee----eeeee-ccCCCcceEEecCCCCCCCCeEEEEeeceEEEEeCCCCceeeecCCCCC---
Q 000944 203 AQKNLTFYELDLGLNHVS----RKWSE-PVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRAD--- 274 (1213)
Q Consensus 203 ~~~~~~~~~~~~~~~~~~----~~~~~-~lp~~~~~liplp~~~~~~~GvLv~~~~~i~y~~~~~~~~~~~~p~~~~--- 274 (1213)
.++.++.-..+++|.-+ -.|+. .||+||++..++|.|+ ||+||++.|.++|.+++.++.+++++.-..
T Consensus 234 -~r~dt~~~vaisLni~q~~hpVI~sv~sLP~D~~~~~~vp~pi---GgvLv~~~n~~iy~nqsv~~~gv~LNs~a~~~t 309 (1366)
T KOG1896|consen 234 -LRKDTCVLVAISLNITQKVHPVIWSVLSLPFDCYQATAVPTPI---GGVLVFTVNNLIYLNQSVSPYGVALNSYASKYT 309 (1366)
T ss_pred -EecCcEEEEEEEcCccccccceEeeeccCChhhhhceeecccC---ccEEEEeeeeEEEEccCCCceeEEecchhhccc
Confidence 45555554444444333 24764 5999999999999999 999999999999999999988888875322
Q ss_pred -CC--CCcceEE--EEEEEEEecCceEEEEEeCCCCEEEEEEEeC-CceeeeeEEEEeCCCCcceeEEEEcCCeEEEEee
Q 000944 275 -LP--AERGVLI--VSAATHRQKTLFFFLLQTEYGDIFKVTLEHD-NEHVSELKIKYFDTIPVTASMCVLKSGYLFAASE 348 (1213)
Q Consensus 275 -~~--~~~~~~i--~~~~~~~~~~~~~~ll~~~~G~l~~l~l~~~-~~~v~~l~i~~l~~~~~~s~l~~l~~~~lFvgS~ 348 (1213)
.+ ..+...+ .|...... ..+.++++..+|++|+|++..| ++.|..+++..+.....++|++...|+++|+||+
T Consensus 310 ~fpl~~qs~v~i~ld~a~~t~i-~~dk~vis~~~Gd~y~Ltl~~D~~r~V~~~~f~k~~asvl~t~~v~~~n~llFlGSr 388 (1366)
T KOG1896|consen 310 AFPLIPQSGVRIELDCANATWI-SNDKCVISLKNGDLYLLTLILDIGRSVQLLHFDKFKASVLATSIVGHGNNLLFLGSR 388 (1366)
T ss_pred CCccccccceEEEEeeccceee-cCCeEEEecCCCcEEEEEEEeccccchhhhhhhhhhcccceeeeeccCCccEEEEec
Confidence 11 1122222 22211112 3468999999999999999999 5678877777777667889999999999999999
Q ss_pred eCCeEEEEEeecCCC------CCcccccCC-----cc-------cccc--------CC-------CceeeccCCcccEEE
Q 000944 349 FGNHALYQFQAIGAD------PDVEASSST-----LM-------ETEE--------GF-------QPVFFQPRGLKNLVR 395 (1213)
Q Consensus 349 ~gds~l~~~~~~~~~------~~~~~~~~~-----~~-------~~~~--------~~-------~~~~~~~~~~~~l~~ 395 (1213)
.|||.|++|..+.+. .|+.+.++. .+ ++|+ +. .....+ ...+++
T Consensus 389 lgnSlll~~s~~~~~~~e~~~re~~d~~~~~~~~~~~d~~~d~~~~d~~~~~~~~~g~~~~~g~~a~~t~~---~f~fev 465 (1366)
T KOG1896|consen 389 LGNSLLLRFSELLQRASEGVRREEGDTESDGYSKKRVDDTQDVRRDDEKSAELFEAGSEENYGSGAQETVQ---PFSFEV 465 (1366)
T ss_pred CCCEEEEEehhccccCCccccccccCCcCCcchhhcccchhhhhhhhhhccchhhccccccCCcccceeee---eeEEee
Confidence 999999999987541 111111110 01 1110 00 001111 146899
Q ss_pred EEEeccCCcccceEEeccCC----------CCC-CcEEEEEecCCCCeEEEEccCCcceEEEEecCCCCcceEEEeeeCC
Q 000944 396 IEQVESLMPIMDMRIANLFE----------EEA-PQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNV 464 (1213)
Q Consensus 396 ~d~~~n~gPI~D~~~~~~~~----------~~~-~~lv~~sG~g~~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l~~~~ 464 (1213)
+|+++|+|||.||+++.... +.. .++|+|+|+|++|+|+++|+.|+|++.+++++||+ .++|++..+.
T Consensus 466 cDsL~NIGPi~~~avG~~~~~~~~~~gl~~~~~~~elV~~sGhgkngaL~V~r~sI~P~i~t~fel~Gc-~~iWtV~~~~ 544 (1366)
T KOG1896|consen 466 CDSLPNIGPITDFAVGKRSSASEAVEGLSPHNKCLELVATSGHGKNGALSVIRRSIRPEIATEFELPGC-VDIWTVFIKG 544 (1366)
T ss_pred hhccccccccccceeccccchhhhccCCCCCCCeEEEEEeccCCCCcceEEEeecccceeeEEEEecCe-eeEEEEEEec
Confidence 99999999999999986541 112 47999999999999999999999999999999996 9999999853
Q ss_pred -----CCCCceEEEEEecCceeEEEeccceeeecCCCccCCCCeEEEEeecCCe-EEEEeCCcEEEEeCC-CceeeeeCC
Q 000944 465 -----NDEFDAYIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIRHIRED-GRINEWRTP 537 (1213)
Q Consensus 465 -----~~~~~~~lvlS~~~~T~vl~~~~~~~e~~~~gf~~~~~Tl~a~~~~~~~-ivQVT~~~i~l~~~~-~~~~~~~~~ 537 (1213)
.+..|.|+++|..++|+||+.++++.|++.++|..+.+||++|++++++ +|||||+++|+++.+ ..++.+...
T Consensus 545 ~~~~~~~~~h~~lilS~e~~t~il~tge~~~Ev~~s~f~~~~~Tl~~gnlg~~rriVQVtp~~~rllDg~~r~lq~i~fd 624 (1366)
T KOG1896|consen 545 RKREEDNTQHLYLILSTESRTMILETGEELLEVSGSGFTRDGPTLFAGNLGNERRIVQVTPSGLRLLDGDLRMLQRIPFD 624 (1366)
T ss_pred cccccccCcceEEEeecccchhhhhccchhhhcccceeEeccceEEEEecCCceEEEEEccceeEEecCcchheeEeccc
Confidence 2345899999999999999999999999999999999999999998875 999999999999985 788999888
Q ss_pred CCccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEee---------------------------
Q 000944 538 GKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIA--------------------------- 590 (1213)
Q Consensus 538 ~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~--------------------------- 590 (1213)
.+..+++++++|+||++..++|.+.+|+++++.. .-..+..+...+.++++.
T Consensus 625 ~~~~vv~~sv~dpyv~v~~~~g~i~~~~l~~~s~-rl~~~~~~s~~~~sv~~~~dlsg~f~~~s~l~~k~~~~~gr~~~~ 703 (1366)
T KOG1896|consen 625 SGAIVVQTSVADPYVAVRSSEGRITLYDLEEKSH-RLALHDPMSFKVVSVSLPADLSGMFTTLSDLSLKGNEANGRSSEA 703 (1366)
T ss_pred cCCcEEEEeccCceEEEEEcCCceEEEEeccccc-hhhccCcccceeEEEechhhhccceEEEeeecccCcccccccccc
Confidence 8889999999999999999999999999876410 000000011111111110
Q ss_pred ------e--cC--C-Cc--eeeeEEEEEEeCCcEEEEEeCCCCce-eEeEEee-----c-------C---CCCceeEEEE
Q 000944 591 ------S--VP--E-GR--KRSRFLAVGSYDNTIRILSLDPDDCM-QILSVQS-----V-------S---SPPESLLFLE 641 (1213)
Q Consensus 591 ------~--~~--~-~~--~~~~~l~v~~~~~~i~i~sl~p~~~l-~~~~~~~-----l-------~---~~p~Sl~~~~ 641 (1213)
+ +. + |. +...|+++++.+|.+.||++ |+..+ -.+..-. | + ...++.++.+
T Consensus 704 ~~~~~~~~kv~~~egg~~~~~~~~~~~~~e~g~leiy~~-pd~~lVf~v~~f~~~~~~L~~~~~~~~~~~~~s~~~~l~q 782 (1366)
T KOG1896|consen 704 EGLQSLPCKVDDEEGGSPEQEPYWCVFVTESGTLEIYAL-PDFDLVFEVDMFDTGNRVLMDSRLRGPTTNKESEDLELKQ 782 (1366)
T ss_pred cccccCCccccCCCCCCcccCceEEEEEcCCCceEEEcc-CCcceEEEeeccCCCcceEEeecccCccccccccchHHHH
Confidence 0 00 0 00 01178999999999999999 76433 2222110 0 0 0112233334
Q ss_pred eecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC--CCCcc--cc-----------cc-----eeee----------cC
Q 000944 642 VQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM--VTGQL--SD-----------SR-----SRFL----------GL 691 (1213)
Q Consensus 642 ~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~--~~~~l--~~-----------~~-----~~~l----------G~ 691 (1213)
.....+|.+- ...+++|++-+.+|.++.|+.-+ ..+.. .. .+ .+.. +.
T Consensus 783 ~~~~~L~~e~--~~~e~~L~lv~~~~eil~Ykaf~~~~~~~~~~~f~kvp~~~~~~~~~p~~~~~~~~~~~~e~~~~~~~ 860 (1366)
T KOG1896|consen 783 LFVNPLGSEI--VFKEPHLFLVVSDNEILIYKAFPQLSQGNLKVFFKKVPHNLNIRTDKPHFLCKKREGGGAEEGASVSV 860 (1366)
T ss_pred hhccccchhh--hccCCceEEEEeCceEEEEeeccccCccchhhhhhhCCHhhcccccCCcccchhhccccccccccccc
Confidence 3334444432 13578899999999999997654 22210 00 00 0000 11
Q ss_pred CCCeEEEE-EECCeeEEEEecCccEE-EEEeCCeEEEEecCcc-ccceeeccccCCCCceEEEEe-CCeEEEEEEcc---
Q 000944 692 RPPKLFSV-VVGGRAAMLCLSSRPWL-GYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA-GNALRVFTIER--- 764 (1213)
Q Consensus 692 ~pv~l~~~-~~~~~~~v~~~g~~p~~-i~~~~~~~~~~~~~~~-~v~~~~~f~~~~~~~~~i~~~-~~~L~i~~l~~--- 764 (1213)
.-.+++.+ .++|++++|+||.+|+| +.+-++.++++|+..+ ++.+++||++.+||+||+|++ ++.|+||.++.
T Consensus 861 ~~~~m~~f~~i~ghsgvfv~Gs~P~~il~t~rg~lr~h~~~gngpv~sfapfhnvn~p~gfiyvd~~~~l~i~~lp~~~~ 940 (1366)
T KOG1896|consen 861 IVQRMTYFEDIGGHSGVFVTGSKPYLILLTFRGVLRFHPVFGNGPVGSFAPFHNVNCPRGFIYVDRQGELVICVLPEALS 940 (1366)
T ss_pred eeeeEEeeccccCeeEEEEecCCceEEEEEcccccceeeeecCCcceeeeeeeccCCCcceEEECCCceEEEEEcchhcc
Confidence 11234444 36799999999999955 5678999999999665 799999999999999999998 57999999998
Q ss_pred CCCeeEEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCC
Q 000944 765 LGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLS 844 (1213)
Q Consensus 765 ~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 844 (1213)
+|++||+|+|||+.|||+++||++.+.|+|+++.+..|.. .++|+++ ++...
T Consensus 941 Ydn~wPvkkIpl~~T~~~vvYh~e~~vy~v~t~~~~~~~~--------------~~~d~~e--------------~~~~~ 992 (1366)
T KOG1896|consen 941 YDNKWPVKKIPLRKTPHQVVYHYEKKVYAVITSTPVPYER--------------LGEDGEE--------------EVISR 992 (1366)
T ss_pred cCCCCcccccccccchhheeeeccceEEEEEEeccceeee--------------ccccccc--------------ccccc
Confidence 6899999999999999999999999999999986644321 1222221 11223
Q ss_pred ccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEecc----CCCceEEEEEeeecCccCCCCCCccc
Q 000944 845 DEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHD----KEHGTLLAVGTAKGLQFWPKRNIVAG 920 (1213)
Q Consensus 845 ~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~----~~~~~~i~VGT~~~~~~~~e~~~~~G 920 (1213)
+++..+|. ...+.|+|++|.+|++++.++|+++|++++|+.+.|.. +++++|++|||++.++ ||.++||
T Consensus 993 de~~~~p~----~~~f~i~LisP~sw~vi~~iefq~~E~v~~~k~v~L~~~~t~~~~k~ylavGT~~~~g---EDv~~RG 1065 (1366)
T KOG1896|consen 993 DENVIHPE----GEQFSIQLISPESWEVIDKIEFQENEHVLHMKYVILDDEETTKGKKPYLAVGTAFIQG---EDVPARG 1065 (1366)
T ss_pred cccccccc----cccceeEEecCCccccccccccCccceeeEEEEEEEEecccccCCcceEEEEEeeccc---ccccCcc
Confidence 45556664 34679999999999999999999999999999999987 4668999999999998 9999999
Q ss_pred EEEEEEEEeC---------CceEEEEEEEeecCcceEeccccCeEEEEeCCeEEEEec-CCceeeceeeecCccceEEEE
Q 000944 921 YIHIYRFVEE---------GKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDL-GKKRLLRKCENKLFPNTIVSI 990 (1213)
Q Consensus 921 ri~v~~i~~~---------~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~~l~i~~~-~~~~l~~~~~~~~~~~~i~~l 990 (1213)
|+++|+|.+. ..|+|.++.++++|+|.++|+++|+|+.++|+||++|+| ++..|.++||+| .|.|++++
T Consensus 1066 r~hi~diIeVVPepgkP~t~~KlKel~~eE~KGtVsavceV~G~l~~~~GqKI~v~~l~r~~~ligVaFiD-~~~yv~s~ 1144 (1366)
T KOG1896|consen 1066 RIHIFDIIEVVPEPGKPFTKNKLKELYIEEQKGTVSAVCEVRGHLLSSQGQKIIVRKLDRDSELIGVAFID-LPLYVHSM 1144 (1366)
T ss_pred cEEEEEEEEecCCCCCCcccceeeeeehhhcccceEEEEEeccEEEEccCcEEEEEEeccCCcceeeEEec-cceeEEeh
Confidence 9999999982 259999999999999999999999999999999999999 577899999998 69999999
Q ss_pred EEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecC-C--eeeeecCCCcEEEEecCCCCCcccccCC
Q 000944 991 NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDF-D--TMAGADKFGNIYFVRLPQDVSDEIEEDP 1067 (1213)
Q Consensus 991 ~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~-~--~~l~~D~~gnl~il~~~~~~~~~~~~~~ 1067 (1213)
++.+|+|++||+|+|++|++|++++.+|.+++||..++.+++++|+.+ + .++++|++|||++|.|.|++.+ ++
T Consensus 1145 ~~vknlIl~gDV~ksisfl~fqeep~rlsL~srd~~~l~v~s~EFLVdg~~L~flvsDa~rNi~vy~Y~Pe~~e----S~ 1220 (1366)
T KOG1896|consen 1145 KVVKNLILAGDVMKSISFLGFQEEPYRLSLLSRDFEPLNVYSTEFLVDGSNLSFLVSDADRNIHVYMYAPENIE----SL 1220 (1366)
T ss_pred hhhhhheehhhhhhceEEEEEccCceEEEEeecCCchhhceeeeeEEcCCeeEEEEEcCCCcEEEEEeCCCCcc----cc
Confidence 999999999999999999999999999999999999999999998654 4 4799999999999999998774 44
Q ss_pred CCCccccccCccCCcccceeeeeeeecCceeceEEEeeecC-----CCccEEE--EEecccceEEEEecCChhHHHHHHH
Q 000944 1068 TGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVP-----GGGESVI--YGTVMGSLGAMLAFSSRDDVDFFSH 1140 (1213)
Q Consensus 1068 ~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~-----~~~~~i~--~~t~~Gsig~l~~l~~~~~~~~L~~ 1140 (1213)
. |+||.++++||+|..|++|.+..... .+.+... |||+||++|+++|+ +++.||+|..
T Consensus 1221 ~--------------G~RLv~radfhvg~~vs~m~~lp~~~~~e~~~~~~~~~~v~gtlDG~l~~~~Pl-~e~~YRRL~~ 1285 (1366)
T KOG1896|consen 1221 S--------------GQRLVRRADFHVGAHVSTMFRLPCHQNAEFGSNSPMFYEVFGTLDGGLGHLVPL-DEKTYRRLLM 1285 (1366)
T ss_pred C--------------cceeeeeeeeEeccceeeeEeccccccchhccCCchhhhhhcccCCceeEEecC-CHHHHHHHHH
Confidence 3 79999999999999999999854211 1233344 89999999999999 8899999999
Q ss_pred HHHHHHhcCCCCcCCCcccccccc------CCCCceeechhhhhcccCCHHHHHHHHHHcCCCHHHHHHHHHHHHh
Q 000944 1141 LEMHMRQEHPPLCGRDHMAYRSAY------FPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIRN 1210 (1213)
Q Consensus 1141 lq~~l~~~~~~~~gl~~~~~R~~~------~p~~~~iDGdll~~fl~l~~~~q~~i~~~~~~~~~~i~~~l~~l~~ 1210 (1213)
||++|..+++|+|||||++||... +|.+++|||++|.+|..|+.++|.++|+++|+++.+|+++|.+|..
T Consensus 1286 lQn~L~~~~~hv~GLNPr~yR~~~s~~~~~n~~r~ilDg~ll~~f~yl~~~er~elA~kiGt~~~eIl~DLvel~~ 1361 (1366)
T KOG1896|consen 1286 LQNALMDRLPHVGGLNPRAYRLLDSSLQLSNSLRSILDGELLNRFSYLSMSEREELAHKIGTTRKEILDDLVELDR 1361 (1366)
T ss_pred HHHHHHHhhhhhcCCCHHHhhhccchhhhcCCCcccchHhHHHHhhccchhhHHHHHHhcCCCHHHHHHHHHHHHH
Confidence 999999999999999999999764 5789999999999999999999999999999999999999999864
No 4
>COG5161 SFT1 Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification]
Probab=100.00 E-value=3.5e-96 Score=828.57 Aligned_cols=1125 Identities=14% Similarity=0.157 Sum_probs=832.8
Q ss_pred CeEEEEEeeCCCceeEEEEEEecCCCCceEEEEeCCEEEEEeecCCCCeEEEEEEEeeeeeeEeeEEeeCCCCeeEEEEE
Q 000944 1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVG 80 (1213)
Q Consensus 1 m~~y~~t~~~pt~v~~~v~~~f~~~~~~~LVv~k~~~Levy~i~~~g~L~~v~~~~l~g~I~~i~~~r~~~~~~d~L~v~ 80 (1213)
|++|.. +..+|+++||+.|+|++....+|+|.++|.++||+..-+++|.++.++.+++.+++|..+.-..+++|+|+++
T Consensus 2 ~~~y~d-~~d~tv~~~~~ag~Ft~s~~~~llv~~~Nil~v~~~~~d~~l~l~de~~~~e~~t~I~~~pq~~se~~~lll~ 80 (1319)
T COG5161 2 NYLYSD-ESDWTVTEGCSAGLFTPSRTCSLLVYNGNILAVRLWKYDSGLVLVDEHMLLEKVTQIEKYPQISSEQDGLLLL 80 (1319)
T ss_pred cchhhh-hhHHHHhhccccceeeccccceEEEEeccEEEEEEeeccCCeeEchHHhhhhhhhhhhhcccccCccceEEEE
Confidence 455655 8899999999999999989999999999999999999888899999999999999999998788899999999
Q ss_pred eccceEEEEEEeCCCCcEeEEeeeeccccCcc----cccCCceEEECCCCCEEEEEecccceEEEEEecC--CCCcee--
Q 000944 81 SDSGRIVILEYNPSKNVFDKIHQETFGKSGCR----RIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRD--TAARLT-- 152 (1213)
Q Consensus 81 ~~~~~l~il~~d~~~~~~~tis~~~~~~~g~~----~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~~~~~--~~~~~~-- 152 (1213)
|..+|.++++||...+.|-|++.|+|+-.+.- ....-.-+..||++.| |++.+++.....|+.-+ ..++.+
T Consensus 81 t~~akis~lrf~sq~n~f~TislhyyeGKfkgksLvelak~stle~D~~ssc-aLlfneDi~~flpfhvnkndddev~~d 159 (1319)
T COG5161 81 THRAKISLLRFDSQANEFRTISLHYYEGKFKGKSLVELAKFSTLEFDIRSSC-ALLFNEDIGNFLPFHVNKNDDDEVRID 159 (1319)
T ss_pred eccceEEEEEehhhcccceeEEEeeeccccCCchhhhhhhhhheeeccCccc-hhhhhhhhhhcccccccCCcccccccc
Confidence 99999999999999999999999999654422 2223456789999977 56788877776776422 211110
Q ss_pred -----------------------------------eeccc------cccccccEEEEeeeeccCCCCcEEEEEEeecccc
Q 000944 153 -----------------------------------ISSPL------EAHKSHTIVYSICGIDCGFDNPIFAAIELDYSEA 191 (1213)
Q Consensus 153 -----------------------------------~~~p~------e~~~~~~~i~~~~fl~~~~~~p~~a~L~~~~~~~ 191 (1213)
++.|. |..-...+|+|++||. +|..||+|+| |. +
T Consensus 160 ~D~~~~~~~~~h~~i~psqgtntfnkrkrt~~~~kfsaPs~Vl~~seld~~ikniiD~~FL~-ny~~PTvall---Y~-P 234 (1319)
T COG5161 160 VDLGMFQMSKRHFSIFPSQGTNTFNKRKRTLFPGKFSAPSKVLKFSELDGKIKNIIDFVFLE-NYSIPTVALL---YD-P 234 (1319)
T ss_pred ccccHHHHHHHHhhcCCCCCccccchhhhhhcCCcccCceeEEEehhhhccccccEEEEeec-cCCCceEEEE---ec-c
Confidence 11120 1112445789999998 9999999999 43 4
Q ss_pred ccCcchhccccc-cceEEEEEEEcCCceeeee-eeeccCCCcceEEecCCCCCCCCeEEEEeeceEEEEeCCCCceeeec
Q 000944 192 DQDSTGQAASEA-QKNLTFYELDLGLNHVSRK-WSEPVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVI 269 (1213)
Q Consensus 192 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~lp~~~~~liplp~~~~~~~GvLv~~~~~i~y~~~~~~~~~~~~ 269 (1213)
+..|++.....+ ......+.+|++.....-. .-..+|+|-+..+|+| .|.|++|.|..+|++..+...++++
T Consensus 235 kl~~~~~~ti~k~p~~~~v~Tldl~~~~saVI~~~~~lP~d~~~~v~~p------~Gall~g~neli~idstg~~~~I~l 308 (1319)
T COG5161 235 KLSLPRKYTILKNPYNAIVFTLDLGAGRSAVIDEFLVLPRDFRVTVAGP------VGALLFGSNELILIDSTGSSYTIPL 308 (1319)
T ss_pred cccccceeEeecCceeEEEEEEecCcchhhhhHhHhcCCceEEEEEecc------cceEEEecccEEEEecCCcEEEeec
Confidence 445665322212 2445666777764332211 2234899999999988 6999999999999999887777776
Q ss_pred CCCCC----C--CCCcc-eEEEEE------EEEEe-c-CceEEEEEeCCCCEEEEEEEeCCceeeeeEEEEe-------C
Q 000944 270 PRRAD----L--PAERG-VLIVSA------ATHRQ-K-TLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYF-------D 327 (1213)
Q Consensus 270 p~~~~----~--~~~~~-~~i~~~------~~~~~-~-~~~~~ll~~~~G~l~~l~l~~~~~~v~~l~i~~l-------~ 327 (1213)
+.-.. . .++.. ..+.|. -|... + ....+++++-+|+.|+|.+..||+++..+.+..+ .
T Consensus 309 Ns~~~k~~~~~~v~d~s~~d~n~~~~gttsIwipsSK~~~etl~l~dl~g~~yyl~~~~dgk~iigfdi~~L~~e~dllk 388 (1319)
T COG5161 309 NSMSEKYGGNKIVEDISLSDVNCFSRGTTSIWIPSSKCLIETLFLGDLNGDRYYLRISMDGKRIIGFDIASLEFEGDLLK 388 (1319)
T ss_pred hhhHHHhcCCceEeecccceeeEeecCceeeeccCcccccceEEEEecCCCEEEEEEEeccceeeccceeeeeeeccccc
Confidence 64211 0 11110 111111 11111 1 1257899999999999999999998877555443 2
Q ss_pred CCCcceeEEEEcCCeEEEEeeeCCeEEEEEeecCCCCC----cc--cc---cCCccccc----c------CCCceeeccC
Q 000944 328 TIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPD----VE--AS---SSTLMETE----E------GFQPVFFQPR 388 (1213)
Q Consensus 328 ~~~~~s~l~~l~~~~lFvgS~~gds~l~~~~~~~~~~~----~~--~~---~~~~~~~~----~------~~~~~~~~~~ 388 (1213)
..|.++|+..+++..+|.|+..+||.+++|.++....| +. .. .+..|++. | .++....+ .
T Consensus 389 ~~s~~~Cv~~~n~~l~f~g~g~~ns~vlr~~~l~~tiEtR~~eG~~~l~g~nDeEmdD~y~apEn~l~~n~~~~v~~~-~ 467 (1319)
T COG5161 389 KGSAVSCVGHVNNLLFFGGVGDSNSRVLRIKSLLPTIETRASEGVGPLEGGNDEEMDDEYSAPENKLFGNKEQEVRRQ-D 467 (1319)
T ss_pred cCCCCeeEEEcCceEEEEEecCCceEEEEecccCCchhhhhhcCCCcccCCChhhhhhhhcccccccccCcccceeec-c
Confidence 45789999999999999999999999999998753110 00 00 11112221 1 11111111 1
Q ss_pred CcccEEEEEEeccCCcccceEEeccCC--------CCCCcEEEEEecCCCCeEEEEccCCcceEEEEecCCCCcceEEEe
Q 000944 389 GLKNLVRIEQVESLMPIMDMRIANLFE--------EEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTV 460 (1213)
Q Consensus 389 ~~~~l~~~d~~~n~gPI~D~~~~~~~~--------~~~~~lv~~sG~g~~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l 460 (1213)
+.+.+++++.+.|+|||.||++++.+. .+...+|+.+|.+..|+|.+++..+.|.+.+.+.+-++ +.+|++
T Consensus 468 ~p~d~el~~~l~n~gpitdfavgkv~v~kglP~pN~g~l~lV~t~G~ds~~~l~V~~ts~~P~I~~~~~fi~~-e~vw~~ 546 (1319)
T COG5161 468 EPYDAELFNALSNAGPITDFAVGKVDVEKGLPIPNIGLLNLVVTKGSDSEAALAVEGTSLEPCICTVSSFIPL-EIVWSQ 546 (1319)
T ss_pred CcchhHHhhhhccCCcccceeeeeccceecCCCCCccceeeEEeccCCCcceEEEEeccccceeeehccccch-hheeeh
Confidence 234578999999999999999987652 23457999999999999999999999999999998885 999999
Q ss_pred eeCCC---CCCceEEEEEecCceeEEEeccceeeecCCCccCCCCeEEEEeecCC-eEEEEeCCcEEEEeCC-Cceeeee
Q 000944 461 KKNVN---DEFDAYIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDD-SLMQVHPSGIRHIRED-GRINEWR 535 (1213)
Q Consensus 461 ~~~~~---~~~~~~lvlS~~~~T~vl~~~~~~~e~~~~gf~~~~~Tl~a~~~~~~-~ivQVT~~~i~l~~~~-~~~~~~~ 535 (1213)
+.... ...-.|+++|...+|+||+.++++.+.....|..+..|++.+.++.+ ++|||||+.+++++.+ ++++.+.
T Consensus 547 kI~g~lr~~~~~~~~~ls~~s~S~If~~~e~f~l~~~g~~~rd~~Tl~~~~fgee~rvVQvtp~~l~~yD~~lR~l~~~~ 626 (1319)
T COG5161 547 KIRGYLRCSRALDFYILSRVSDSRIFRWSEEFLLEVSGEYTRDVNTLLFVEFGEENRVVQVTPSYLLRYDQDLRMLGRVE 626 (1319)
T ss_pred hccceehhcceeeEEEeecccccceeeccccceeeecceeeccccEEEeeeccCcceEEEecchHhhhhcccceeeeeEe
Confidence 87532 12246899999999999999999988888889999999999999865 6999999999999977 5666666
Q ss_pred CCCCccEEEEEecCCEEEEEEeCCEEEEEEEccCC-CeEEeeeec-cCc-ceEEEEeeec--------CCC-ceeeeEEE
Q 000944 536 TPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTG-QLLEVEKHE-MSG-DVACLDIASV--------PEG-RKRSRFLA 603 (1213)
Q Consensus 536 ~~~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~-~l~~~~~~~-l~~-~is~i~i~~~--------~~~-~~~~~~l~ 603 (1213)
+.. ..|.+.+++||++++...+|.+..|++++.. .|..+.-.+ +.. ..+++-+... +.. ++....++
T Consensus 627 F~~-~~V~~~Sv~Dp~ilvv~~~g~i~~f~~~ekn~rL~k~dl~~~l~d~k~~s~v~~dsN~~g~f~ig~~~Sq~e~~l~ 705 (1319)
T COG5161 627 FAS-RAVEARSVRDPLILVVRDSGKILTFYDREKNMRLFKIDLVTCLADAKNKSFVLSDSNSLGIFDIGKRISQLEPCLV 705 (1319)
T ss_pred ece-eeeEEEeccCCEEEEEEecCceEEEEehhhhchhccCChHHHHHhhhhheEeccCcccccceecccchhhhchhhh
Confidence 532 2488999999999999999999888877532 122210000 111 1111111100 000 01112233
Q ss_pred EEEeCCcEEE-EEeCCC----------CceeEeE---Eeec-C---CCCceeEEEEeecccCCCCCCCCCCceEEEEEee
Q 000944 604 VGSYDNTIRI-LSLDPD----------DCMQILS---VQSV-S---SPPESLLFLEVQASVGGEDGADHPASLFLNAGLQ 665 (1213)
Q Consensus 604 v~~~~~~i~i-~sl~p~----------~~l~~~~---~~~l-~---~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~ 665 (1213)
.+... ..++ +.-.|. +++...+ ...+ . ..| | +.+.....+|++. ...||+.-..
T Consensus 706 ~~~~~-~~q~~~~~s~~~D~~~e~dg~dQlte~~~~~tynl~d~~f~lp-s--i~~~mVa~lg~D~----keeyLf~~s~ 777 (1319)
T COG5161 706 KGLPY-AIQFSPEASPAMDLAGEEDGDDQLTEISMSLTYNLIDMLFRLP-S--IGNYMVAYLGLDL----KEEYLFDNSL 777 (1319)
T ss_pred hcCcc-cceeccccCcchhhccccccchhhhhHHHHHHHhhhhhhccCh-h--hhhhhhHhhcccc----cchheehhhc
Confidence 33221 1111 111110 0110000 0001 0 112 1 1222223344433 5789999999
Q ss_pred CCeEEEEEEeCCCCc---ccccce-eeecCCC-------------CeEEEEEECCeeEEEEecCccEEEE-EeCCeEEEE
Q 000944 666 NGVLFRTVVDMVTGQ---LSDSRS-RFLGLRP-------------PKLFSVVVGGRAAMLCLSSRPWLGY-IHRGRFLLT 727 (1213)
Q Consensus 666 ~G~l~~~~~~~~~~~---l~~~~~-~~lG~~p-------------v~l~~~~~~~~~~v~~~g~~p~~i~-~~~~~~~~~ 727 (1213)
.|.++.|+--+.... ..-++. ..+-..| +++.-.+..|++.+|++|..|+++. ..+....+.
T Consensus 778 ~~EI~~yk~~l~r~~~f~~nvTRndlAitGaPdna~~Ka~sSV~ri~m~f~~~vghs~~fvTg~~pfl~~s~~~s~~k~f 857 (1319)
T COG5161 778 SSEIVFYKTHLPRHVSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPFLCRSRYTSSSKAF 857 (1319)
T ss_pred CceEEEEeecccccchhhhhcchhhhhccCCCcchhhhhcccccceeEEEeeccCeEEEEEcCCccEEEEEeccCCccee
Confidence 999999875332110 000000 0000111 1233334568999999999999875 456667777
Q ss_pred ecCccccceeeccccCCCCceEEEEeCC-eEEEEEEcc----CCCeeEEEEEeCCCccceeeecCCCceEEEEEccCCCC
Q 000944 728 PLSYETLEYAASFSSDQCVEGVVSVAGN-ALRVFTIER----LGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGAL 802 (1213)
Q Consensus 728 ~~~~~~v~~~~~f~~~~~~~~~i~~~~~-~L~i~~l~~----~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~ 802 (1213)
|...-++.+++||+. .|++|+++. .+++|+... .++||+++++|++.|..+++||+..+.|+|..++..++
T Consensus 858 ~~gNIPlvsv~p~s~----rgy~~Vd~~~~vr~~~~~~dn~y~gnK~p~k~~~~~Ktlqklvyh~~~~~~~Vgsc~~~~f 933 (1319)
T COG5161 858 HRGNIPLVSVIPLSK----RGYLMVDNVLGVRASQYVFDNGYVGNKNPVKRTPKHKTLQKLVYHCAGRYMVVGSCEEAGF 933 (1319)
T ss_pred ecCCCceeeeeeccc----ccEEEEecccceeEEEEEeccceecccCceeeccccccccceeeeccceEEEEEeeeecCc
Confidence 777668899999986 689999864 788888775 35899999999999999999999999999998876665
Q ss_pred CHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCc
Q 000944 803 TAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNE 882 (1213)
Q Consensus 803 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E 882 (1213)
.+ .+||+|... .-++ ..|++ +-+++.+.|++|.+|+++++|+|+++|
T Consensus 934 ~~--------------~gEdgE~~i---------------~~D~--Nvpha--eg~~~~vdL~spksw~vID~yef~~ne 980 (1319)
T COG5161 934 SP--------------KGEDGESGI---------------PVDT--NVPHA--EGYRFYVDLYSPKSWEVIDTYEFDENE 980 (1319)
T ss_pred cc--------------cCCCCCccC---------------ccCC--CCccc--ccceeeEEEecCcceeEeeeeecccce
Confidence 43 234432211 1111 22332 356789999999999999999999999
Q ss_pred eEEEEEEEEecc----CCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeC---------CceEEEEEEEeecCcceEe
Q 000944 883 AAFSICTVNFHD----KEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEE---------GKSLELLHKTQVEGIPLAL 949 (1213)
Q Consensus 883 ~v~s~~~~~l~~----~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~---------~~kl~~~~~~~~~g~V~ai 949 (1213)
.+.+|+.+.++. +++++||+|||++..+ ||.|.+||+++|+|.+. ..|||++..++++|.|..+
T Consensus 981 ~v~~i~~~~l~~~~~tk~k~pyi~vgtt~~~g---ED~p~rG~~hv~eII~VVP~pg~P~t~~KLK~~~~Ee~kGTV~~v 1057 (1319)
T COG5161 981 YVFHIKYLILDDMQGTKGKSPYILVGTTFIEG---EDRPARGRLHVLEIISVVPSPGSPFTDCKLKVLGIEETKGTVVRV 1057 (1319)
T ss_pred eeeeeeeeeeeccccccCCCceEEEEeeeccc---CccCCcCceEEEEEEEecCCCCCCcccceeeEEehhhcccEEEEE
Confidence 999999998876 5677999999999997 99999999999999872 3699999999999999999
Q ss_pred ccccCeEEEEeCCeEEEEecCC-ceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCc
Q 000944 950 CQFQGRLLAGIGPVLRLYDLGK-KRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPR 1028 (1213)
Q Consensus 950 ~~~~g~ll~~~g~~l~i~~~~~-~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~ 1028 (1213)
|++.|+++.++||||.++++++ ..+++++|+| ++++++++++.+|++++||+|++++|+-|++++++|..+++....+
T Consensus 1058 cEV~G~~~~~qgqKV~Vr~i~~~~~iipV~F~D-l~~ft~s~k~~~Nlll~gD~~qg~~F~gF~~ePyRm~l~s~s~~~~ 1136 (1319)
T COG5161 1058 CEVRGKIALCQGQKVMVRKIDRSSGIIPVGFYD-LHIFTSSIKVVKNLLLAGDIYQGLSFFGFQSEPYRMHLISSSEPLR 1136 (1319)
T ss_pred EEEccEEEeccCcEEEEEEecccCCcceeEEEe-eeeeeehhhhhhheeehhhhhcCcEEEEecCCcEEEEEecCCchhh
Confidence 9999999999999999999985 4689999999 7999999999999999999999999999999999999999999999
Q ss_pred ceEEEEeecCC---eeeeecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEee
Q 000944 1029 WLTAAHHIDFD---TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKAS 1105 (1213)
Q Consensus 1029 ~~~~~~~ld~~---~~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~ 1105 (1213)
.+++.+||..+ .++++|+.||++++.|+|+.++ +..|+||..++.||+|...++|.-..
T Consensus 1137 n~~s~efLv~G~~lyf~~~Da~gnih~l~Y~P~np~------------------S~sG~RLV~rssFtlhs~~~~m~llP 1198 (1319)
T COG5161 1137 NATSTEFLVTGNELYFLCCDAKGNIHGLTYSPNNPI------------------SMSGARLVKRSSFTLHSAEIKMNLLP 1198 (1319)
T ss_pred cchhhHhhccCCeeEEEEEcCCCCEEEEecCCCCcc------------------ccCcceeEeeccccccchhhhhhhcc
Confidence 99999988664 4799999999999999996653 13489999999999999999887542
Q ss_pred ecC------CCccEEEEEecccceEEEEecCChhHHHHHHHHHHHHHhcCCCCcCCCcccccccc------CCCCceeec
Q 000944 1106 LVP------GGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAY------FPVKDVIDG 1173 (1213)
Q Consensus 1106 ~~~------~~~~~i~~~t~~Gsig~l~~l~~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~~------~p~~~~iDG 1173 (1213)
... .+..+.+|+-++|++..++|| +++.|++|..+|+++..+..+++||||+.||-.. +|.+..+|+
T Consensus 1199 rn~efG~~~~~~f~~v~~~sdG~l~~vvpi-sd~~YrrL~~IQ~~i~~r~~~vgGLNpr~yRL~~d~~~~~~s~r~~ld~ 1277 (1319)
T COG5161 1199 RNSEFGAGFKKNFIMVYSRSDGMLIHVVPI-SDAHYRRLLGIQTAIMARLKSVGGLNPRDYRLNSDIHLHSLSLRSPLDL 1277 (1319)
T ss_pred chhhhCCCCCCceeEEEEccCCcEEEEecc-CHHHHHHHHHHHHHHHHHHHhhcCCChhhhhhccCHHHhcCCcccchhh
Confidence 111 235678999999999999999 7889999999999999999999999999999653 478999999
Q ss_pred hhhhhcccCCHHHHHHHHHHcCCCHHHHHHHHHHHH
Q 000944 1174 DLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIR 1209 (1213)
Q Consensus 1174 dll~~fl~l~~~~q~~i~~~~~~~~~~i~~~l~~l~ 1209 (1213)
.+|..|-.|+.+.|+++|.++|+-...++.++.++.
T Consensus 1278 ~ii~~F~y~~~~~r~sva~kaGr~~~~e~~D~i~~~ 1313 (1319)
T COG5161 1278 HIINLFSYFDMSTRESVASKAGRIDRKEISDMIASL 1313 (1319)
T ss_pred hhhhhhhhcchhhhhHHHhhcCCchHHHHHHHHHHH
Confidence 999999999999999999999987654443544443
No 5
>PF10433 MMS1_N: Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A ....
Probab=100.00 E-value=2.6e-59 Score=568.40 Aligned_cols=458 Identities=40% Similarity=0.650 Sum_probs=329.1
Q ss_pred eEEEEEeccceEEEEEEeCCCCcEeE--Eee-eeccccCcccccCCceEEECCCCCEEEEEecccceEEEEEec----CC
Q 000944 75 DYIVVGSDSGRIVILEYNPSKNVFDK--IHQ-ETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNR----DT 147 (1213)
Q Consensus 75 d~L~v~~~~~~l~il~~d~~~~~~~t--is~-~~~~~~g~~~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~~~~----~~ 147 (1213)
|+|+|+++++++++++|+++++++.+ +++ ..+.+++.++..+|++++|||.|||+|+++|++.+.|+++++ +.
T Consensus 1 D~L~v~tdsg~l~~l~~~~~~~~~~~~~v~~~~~~~~~~~r~~~~G~~l~vDP~~R~i~v~a~e~~~~v~~l~~~~~~~~ 80 (504)
T PF10433_consen 1 DSLVVTTDSGKLSILEYDPSTHGFFKEFVHQWEPLSKSGSRLSQPGQYLAVDPSGRCIAVSAYEGNFLVYPLNRSLDSDI 80 (504)
T ss_dssp -EEEEEETTTEEEEEEEEEETTEE-E-EEEEEEE---SSSEB-TT--EEEE-TTSSEEEEEEBTTEEEEEE-SS----T-
T ss_pred CEEEEEECCCCEEEEEEECCCCccceeeEEEeEecCCCCCChhcCCcEEEECCcCCEEEEEecCCeEEEEEecccccccc
Confidence 79999999999999999998888643 333 678899999999999999999999999999999999999987 11
Q ss_pred CCceeeeccccccccccEEEEeeeec--cCCCCcEEEEEEeeccccccCcchhccccccceEEEEEEEcC--Cceeeeee
Q 000944 148 AARLTISSPLEAHKSHTIVYSICGID--CGFDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLG--LNHVSRKW 223 (1213)
Q Consensus 148 ~~~~~~~~p~e~~~~~~~i~~~~fl~--~~~~~p~~a~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 223 (1213)
.....+++|+ ++.++|++|+||+ .++++|++|+|+++.. ..+++.+|+|+.. +++..+++
T Consensus 81 ~~~~~~~~pi---~s~~~i~~~~FL~~~~~~~~p~la~L~~~~~-------------~~~~~~~y~w~~~~~l~~~~~~~ 144 (504)
T PF10433_consen 81 AFSPHINSPI---KSEGNILDMCFLHPSVGYDNPTLAILYVDSQ-------------RRTHLVTYEWSLDDGLNHVISKS 144 (504)
T ss_dssp TT---EEEE-----S-SEEEEEEEES---S-SS-EEEEEEEETT--------------EEEEEEEE--------EETTTT
T ss_pred cccccccccc---cCCceEEEEEEEecccCCCCceEEEEEEEec-------------ccceeEEEeeecccccceeeeec
Confidence 2223344565 3477999999998 6899999999987532 2578999998765 33333332
Q ss_pred eec--c---CCCcceEEecCCCCCCCCeEEEEeeceEEEEeCCCC---ceeeecCCCCCCCCCcceEEEEEEEEE-----
Q 000944 224 SEP--V---DNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHP---DVRAVIPRRADLPAERGVLIVSAATHR----- 290 (1213)
Q Consensus 224 ~~~--l---p~~~~~liplp~~~~~~~GvLv~~~~~i~y~~~~~~---~~~~~~p~~~~~~~~~~~~i~~~~~~~----- 290 (1213)
... + ...+++|||||.+. ||+||++++.++|.++... ....+++... ......|++|++..
T Consensus 145 ~~~~~l~~~~~~p~~LIPlp~~~---ggllV~~~~~i~y~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~p~~~~~~ 218 (504)
T PF10433_consen 145 TLPIRLPNEDELPSFLIPLPNPP---GGLLVGGENIIIYKNHLIGSGDYSFLSIPSPP---SSSSSLWTSWARPERNISY 218 (504)
T ss_dssp EEEE--EEEE-TTEEEEEE-TTT----SEEEEESSEEEEEE------TTEEEEE--H----HHHTS-EEEEEE------S
T ss_pred cccccccccCCCccEEEEcCCCC---cEEEEECCEEEEEecccccccccccccccCCc---cCCCceEEEEEecccccee
Confidence 221 2 11249999999988 9999999999999976432 1122222110 01246788888743
Q ss_pred ecCceEEEEEeCCCCEEEEEEEeCCceeeeeEEEEeCC-CCcceeEEEEcCC--eEEEEeeeCCeEEEEEeecCCCCCcc
Q 000944 291 QKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDT-IPVTASMCVLKSG--YLFAASEFGNHALYQFQAIGADPDVE 367 (1213)
Q Consensus 291 ~~~~~~~ll~~~~G~l~~l~l~~~~~~v~~l~i~~l~~-~~~~s~l~~l~~~--~lFvgS~~gds~l~~~~~~~~~~~~~ 367 (1213)
.++.+++||++++|+||+|.+..++. ++++.++|+ .++|+++++++++ +||+||+.|||+|+++..
T Consensus 219 ~~~~~~~lL~~e~G~l~~l~l~~~~~---~i~i~~~g~~~~~~s~l~~l~~g~d~lf~gs~~gds~l~~~~~-------- 287 (504)
T PF10433_consen 219 DKDGDRILLQDEDGDLYLLTLDNDGG---SISITYLGTLCSIASSLTYLKNGGDYLFVGSEFGDSQLLQISL-------- 287 (504)
T ss_dssp STTSSEEEEEETTSEEEEEEEEEEEE---EEEEEEEEE--S-ESEEEEESTT--EEEEEESSS-EEEEEEES--------
T ss_pred cCCCCEEEEEeCCCeEEEEEEEECCC---eEEEEEcCCcCChhheEEEEcCCCEEEEEEEecCCcEEEEEeC--------
Confidence 24678999999999999999999875 789999999 8999999999999 999999999999999973
Q ss_pred cccCCccccccCCCceeeccCCcccEEEEEEeccCCcccceEEeccCCCCCC------cEEEEEecCCCCeEEEEccCCc
Q 000944 368 ASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAP------QIFTLCGRGPRSSLRILRPGLA 441 (1213)
Q Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~n~gPI~D~~~~~~~~~~~~------~lv~~sG~g~~GsL~~lr~gi~ 441 (1213)
.+++++++++|||||+||++++..+++++ +|++|||.|++|+|+++|+|++
T Consensus 288 -----------------------~~l~~~~~~~N~~Pi~D~~v~~~~~~~~~~~~~~~~lv~~sG~g~~gsL~~lr~Gi~ 344 (504)
T PF10433_consen 288 -----------------------SNLEVLDSLPNWGPIVDFCVVDSSNSGQPSNPSSDQLVACSGAGKRGSLRILRNGIG 344 (504)
T ss_dssp -----------------------ESEEEEEEE----SEEEEEEE-TSSSSS-------EEEEEESSGGG-EEEEEEESBE
T ss_pred -----------------------CCcEEEEeccCcCCccceEEeccccCCCCcccccceEEEEECcCCCCcEEEEeccCC
Confidence 25899999999999999999987655555 9999999999999999999999
Q ss_pred ce--EEEEecCCCCcceEEEeeeCCCCCCceEEEEEecCceeEEEec-----cceeeecCCCccCCCCeEEEEeecCCeE
Q 000944 442 VS--EMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIG-----ETVEEVSDSGFLDTTPSLAVSLIGDDSL 514 (1213)
Q Consensus 442 ~~--~~~~~~l~~~~~~iw~l~~~~~~~~~~~lvlS~~~~T~vl~~~-----~~~~e~~~~gf~~~~~Tl~a~~~~~~~i 514 (1213)
++ ..+..++|+ ++++|+++....+ |.||++|++++|+||+++ ++++|++..+|.++++||+||+++++.+
T Consensus 345 ~~~~~~~~~~l~~-v~~iW~l~~~~~~--~~~lv~S~~~~T~vl~~~~~d~~e~~~e~~~~~f~~~~~Tl~~~~~~~~~i 421 (504)
T PF10433_consen 345 IEGLELASSELPG-VTGIWTLKLSSSD--HSYLVLSFPNETRVLQISEGDDGEEVEEVEEDGFDTDEPTLAAGNVGDGRI 421 (504)
T ss_dssp EE--EEEEEEEST-EEEEEEE-SSSSS--BSEEEEEESSEEEEEEES----SSEEEEE---TS-SSS-EEEEEEETTTEE
T ss_pred ceeeeeeccCCCC-ceEEEEeeecCCC--ceEEEEEcCCceEEEEEecccCCcchhhhhhccCCCCCCCeEEEEcCCCeE
Confidence 99 888999999 5999999976433 899999999999999994 4566664349999999999999999999
Q ss_pred EEEeCCcEEEEeC--CCceeeeeCCCCccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEee---eeccCcceEEEEe
Q 000944 515 MQVHPSGIRHIRE--DGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVE---KHEMSGDVACLDI 589 (1213)
Q Consensus 515 vQVT~~~i~l~~~--~~~~~~~~~~~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~---~~~l~~~is~i~i 589 (1213)
||||+++||+++. .+...+|.+|.+..|++|++++++++++++++++++|+++......+.. ..+++.+|+|+++
T Consensus 422 vQVt~~~i~l~~~~~~~~~~~w~~~~~~~I~~a~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eis~l~i 501 (504)
T PF10433_consen 422 VQVTPKGIRLIDLEDGKLTQEWKPPAGSIIVAASINDPQVLVALSGGELVYFELDDNKISVSDNDETILELDNEISCLSI 501 (504)
T ss_dssp EEEESSEEEEEESSSTSEEEEEE-TTS---SEEEESSSEEEEEE-TTEEEEEEEETTEEEEEEE----EE-SS-EEEEE-
T ss_pred EEEecCeEEEEECCCCeEEEEEeCCCCCeEEEEEECCCEEEEEEeCCcEEEEEEECCceeeeeeccccccCCCceEEEEe
Confidence 9999999999974 4778899999999999999999999999999999999998543211211 2347899999998
Q ss_pred ee
Q 000944 590 AS 591 (1213)
Q Consensus 590 ~~ 591 (1213)
.|
T Consensus 502 ~p 503 (504)
T PF10433_consen 502 EP 503 (504)
T ss_dssp --
T ss_pred CC
Confidence 65
No 6
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=100.00 E-value=3.9e-50 Score=460.63 Aligned_cols=300 Identities=35% Similarity=0.632 Sum_probs=254.1
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCC--CceEEEEEeeecCccCCCCCCcc-cEEEEEEEEeC---Cce
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKE--HGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVEE---GKS 933 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~--~~~~i~VGT~~~~~~~~e~~~~~-Gri~v~~i~~~---~~k 933 (1213)
|+++|+|+.+|+++++++|+++|+++|++.|+|.+.. .++|++|||++..+ |+.+++ |||++|++.+. .++
T Consensus 2 s~i~l~d~~~~~~~~~~~l~~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~---~~~~~~~Gri~v~~i~~~~~~~~~ 78 (321)
T PF03178_consen 2 SSIRLVDPTTFEVLDSFELEPNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYG---EDPEPSSGRILVFEISESPENNFK 78 (321)
T ss_dssp -EEEEEETTTSSEEEEEEEETTEEEEEEEEEEETTS---SSEEEEEEEEE--T---TSSS-S-EEEEEEEECSS-----E
T ss_pred cEEEEEeCCCCeEEEEEECCCCceEEEEEEEEEcCccccccCEEEEEeccccc---ccccccCcEEEEEEEEcccccceE
Confidence 6999999999999999999999999999999998632 57999999999997 566555 99999999984 359
Q ss_pred EEEEEEEeecCcceEeccccCeEEEEeCCeEEEEecCCce-eeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEe
Q 000944 934 LELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKR-LLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 934 l~~~~~~~~~g~V~ai~~~~g~ll~~~g~~l~i~~~~~~~-l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
|+++++++++|||+||+.++|+|++|+|++|++|+|+.++ |.++|+++ .+.++++|.+.+|+|++||+++|+++++|+
T Consensus 79 l~~i~~~~~~g~V~ai~~~~~~lv~~~g~~l~v~~l~~~~~l~~~~~~~-~~~~i~sl~~~~~~I~vgD~~~sv~~~~~~ 157 (321)
T PF03178_consen 79 LKLIHSTEVKGPVTAICSFNGRLVVAVGNKLYVYDLDNSKTLLKKAFYD-SPFYITSLSVFKNYILVGDAMKSVSLLRYD 157 (321)
T ss_dssp EEEEEEEEESS-EEEEEEETTEEEEEETTEEEEEEEETTSSEEEEEEE--BSSSEEEEEEETTEEEEEESSSSEEEEEEE
T ss_pred EEEEEEEeecCcceEhhhhCCEEEEeecCEEEEEEccCcccchhhheec-ceEEEEEEeccccEEEEEEcccCEEEEEEE
Confidence 9999999999999999999999999999999999999888 99999998 488999999999999999999999999999
Q ss_pred ccCCeEEEeeccCCCcceEEEEee-cCCeeeeecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCccc-ceeeee
Q 000944 1013 RDENQLYIFADDSVPRWLTAAHHI-DFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPN-KMEEIV 1090 (1213)
Q Consensus 1013 ~~~~~l~~~a~D~~~~~~~~~~~l-d~~~~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~L~~~~ 1090 (1213)
+++.+|.++|||+.++|++++.++ |++.++++|++||+++|++++...++ . .+. +|...+
T Consensus 158 ~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~gnl~~l~~~~~~~~~----~--------------~~~~~L~~~~ 219 (321)
T PF03178_consen 158 EENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKDGNLFVLRYNPEIPNS----R--------------DGDPKLERIS 219 (321)
T ss_dssp TTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETTSEEEEEEE-SS-SST----T--------------TTTTBEEEEE
T ss_pred ccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCCCeEEEEEECCCCccc----c--------------cccccceeEE
Confidence 988889999999999999999999 77799999999999999999865532 1 255 999999
Q ss_pred eeecCceeceEEEeeecC--CCc-----cEEEEEecccceEEEEecCChhHHHHHHHHHHHHHhcCCCCcCCCccccccc
Q 000944 1091 QFHVGDVVTSLQKASLVP--GGG-----ESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSA 1163 (1213)
Q Consensus 1091 ~~~lg~~v~~~~~~~~~~--~~~-----~~i~~~t~~Gsig~l~~l~~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~~ 1163 (1213)
+||+|+.|++++++++.+ ... +.++|+|.+|+||.++|+.++++|++|+.||+.|.+.+++++|++|++||++
T Consensus 220 ~f~lg~~v~~~~~~~l~~~~~~~~~~~~~~i~~~T~~G~Ig~l~p~l~~~~~~~L~~lQ~~l~~~~~~~~gl~~~~~R~~ 299 (321)
T PF03178_consen 220 SFHLGDIVNSFRRGSLIPRSGSSESPNRPQILYGTVDGSIGVLIPFLSEEEYRFLQALQNNLRKHIPSLGGLNPRSFRSY 299 (321)
T ss_dssp EEE-SS-EEEEEE--SS--SSSS-TTEEEEEEEEETTS-EEEEEE-E-HHHHHHHHHHHHHHHHHS--TTS--HHHHTSE
T ss_pred EEECCCccceEEEEEeeecCCCCcccccceEEEEecCCEEEEEEecCCHHHHHHHHHHHHHHHhhCCCCccCChHHhccc
Confidence 999999999999998766 222 5699999999999999943889999999999999999999999999999999
Q ss_pred cCC----CCceeechhhhhccc
Q 000944 1164 YFP----VKDVIDGDLCEQFPT 1181 (1213)
Q Consensus 1164 ~~p----~~~~iDGdll~~fl~ 1181 (1213)
++| +++|||||||++|++
T Consensus 300 ~~~~~~~~~~~iDgdll~~fl~ 321 (321)
T PF03178_consen 300 KNPRMKRSKGFIDGDLLEQFLE 321 (321)
T ss_dssp EESEEE--BSEEEHHHHHGGGG
T ss_pred cCccccCCCccCcHHHHHHhhC
Confidence 988 899999999999986
No 7
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=98.27 E-value=0.012 Score=69.12 Aligned_cols=163 Identities=14% Similarity=0.099 Sum_probs=99.6
Q ss_pred ceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeec---Ccce
Q 000944 871 NTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVE---GIPL 947 (1213)
Q Consensus 871 ~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~---g~V~ 947 (1213)
.-+..+..++.|.+.|.+.- .-.+.|++||.. |+.+|++..++ .+++..-.+.+ -++.
T Consensus 372 ~~Llkl~~k~~~nIs~~aiS-----Pdg~~Ia~st~~-------------~~~iy~L~~~~-~vk~~~v~~~~~~~~~a~ 432 (691)
T KOG2048|consen 372 IHLLKLFTKEKENISCAAIS-----PDGNLIAISTVS-------------RTKIYRLQPDP-NVKVINVDDVPLALLDAS 432 (691)
T ss_pred hhheeeecCCccceeeeccC-----CCCCEEEEeecc-------------ceEEEEeccCc-ceeEEEeccchhhhccce
Confidence 34456667777888776542 234899999974 44578877654 33333322222 1222
Q ss_pred Ee--ccccCeEEEEeCC--eEEEEecCC---ceeeceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEeccCCeE
Q 000944 948 AL--CQFQGRLLAGIGP--VLRLYDLGK---KRLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYRRDENQL 1018 (1213)
Q Consensus 948 ai--~~~~g~ll~~~g~--~l~i~~~~~---~~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~~~~l 1018 (1213)
++ ..-+++++++..+ .+..++++. +++.....-..++ +|..|. ..+|+|.+.+....|+++... ..+.
T Consensus 433 ~i~ftid~~k~~~~s~~~~~le~~el~~ps~kel~~~~~~~~~~-~I~~l~~SsdG~yiaa~~t~g~I~v~nl~--~~~~ 509 (691)
T KOG2048|consen 433 AISFTIDKNKLFLVSKNIFSLEEFELETPSFKELKSIQSQAKCP-SISRLVVSSDGNYIAAISTRGQIFVYNLE--TLES 509 (691)
T ss_pred eeEEEecCceEEEEecccceeEEEEecCcchhhhhccccccCCC-cceeEEEcCCCCEEEEEeccceEEEEEcc--ccee
Confidence 22 2224555554433 455555553 3344333222233 455554 579999999988888886554 5566
Q ss_pred EEeeccCCCcceEEEEee--cCCeeeeecCCCcEEEEecC
Q 000944 1019 YIFADDSVPRWLTAAHHI--DFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1019 ~~~a~D~~~~~~~~~~~l--d~~~~l~~D~~gnl~il~~~ 1056 (1213)
..+.-+.. ..+|++.+- +.+++++++.++.++-|+.+
T Consensus 510 ~~l~~rln-~~vTa~~~~~~~~~~lvvats~nQv~efdi~ 548 (691)
T KOG2048|consen 510 HLLKVRLN-IDVTAAAFSPFVRNRLVVATSNNQVFEFDIE 548 (691)
T ss_pred ecchhccC-cceeeeeccccccCcEEEEecCCeEEEEecc
Confidence 66665555 788888754 55689999999999998884
No 8
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=98.15 E-value=0.031 Score=63.98 Aligned_cols=141 Identities=16% Similarity=0.197 Sum_probs=99.7
Q ss_pred CceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc-cCeEEEEe--CCeEEEEecCCce
Q 000944 897 HGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF-QGRLLAGI--GPVLRLYDLGKKR 973 (1213)
Q Consensus 897 ~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~ll~~~--g~~l~i~~~~~~~ 973 (1213)
.+.+++||-- -|.+++|.+..+. ..+..-..+.+|++++|.-- ++.++++. ..++.+|+...++
T Consensus 454 ~~~~vaVGG~------------Dgkvhvysl~g~~-l~ee~~~~~h~a~iT~vaySpd~~yla~~Da~rkvv~yd~~s~~ 520 (603)
T KOG0318|consen 454 DGSEVAVGGQ------------DGKVHVYSLSGDE-LKEEAKLLEHRAAITDVAYSPDGAYLAAGDASRKVVLYDVASRE 520 (603)
T ss_pred CCCEEEEecc------------cceEEEEEecCCc-ccceeeeecccCCceEEEECCCCcEEEEeccCCcEEEEEcccCc
Confidence 4689999963 3679999998752 23344556789999998765 34555544 3589999998665
Q ss_pred e--eceeeecCccceEEEEEEe--CCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCc
Q 000944 974 L--LRKCENKLFPNTIVSINTY--RDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGN 1049 (1213)
Q Consensus 974 l--~~~~~~~~~~~~i~~l~~~--~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gn 1049 (1213)
. .+.+|.. ..|.++... +..+.-|-+-..|.++..+. +.+- ..+++.++..++.+..+|+.+++.+-.+-|
T Consensus 521 ~~~~~w~FHt---akI~~~aWsP~n~~vATGSlDt~Viiysv~k-P~~~-i~iknAH~~gVn~v~wlde~tvvSsG~Da~ 595 (603)
T KOG0318|consen 521 VKTNRWAFHT---AKINCVAWSPNNKLVATGSLDTNVIIYSVKK-PAKH-IIIKNAHLGGVNSVAWLDESTVVSSGQDAN 595 (603)
T ss_pred eecceeeeee---eeEEEEEeCCCceEEEeccccceEEEEEccC-hhhh-eEeccccccCceeEEEecCceEEeccCcce
Confidence 4 2334432 356666643 44556677777888877754 3233 778999999999999999999998888888
Q ss_pred EEEEec
Q 000944 1050 IYFVRL 1055 (1213)
Q Consensus 1050 l~il~~ 1055 (1213)
|.+...
T Consensus 596 iK~W~v 601 (603)
T KOG0318|consen 596 IKVWNV 601 (603)
T ss_pred eEEecc
Confidence 887654
No 9
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=97.57 E-value=0.01 Score=63.27 Aligned_cols=191 Identities=19% Similarity=0.171 Sum_probs=99.5
Q ss_pred CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEE
Q 000944 581 SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFL 660 (1213)
Q Consensus 581 ~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~L 660 (1213)
+..|++|++.+. ...++++|.|||+|++|.+..+..+..-..+.++..+=+ +.+.+ +...+
T Consensus 27 ~DsIS~l~FSP~-----~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~---v~Wsd-----------dgskV 87 (347)
T KOG0647|consen 27 EDSISALAFSPQ-----ADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLD---VCWSD-----------DGSKV 87 (347)
T ss_pred ccchheeEeccc-----cCceEEecccCCceEEEEEecCCcccchhhhccCCCeEE---EEEcc-----------CCceE
Confidence 356899999763 356788999999999999954334443222333222222 23322 34668
Q ss_pred EEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEec--CccEEEEEeCCeEEEEecCccccceee
Q 000944 661 NAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLS--SRPWLGYIHRGRFLLTPLSYETLEYAA 738 (1213)
Q Consensus 661 ligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g--~~p~~i~~~~~~~~~~~~~~~~v~~~~ 738 (1213)
+.|.-||.+-.|.+. ++++. ..-.=..||+-.++--+...-.+++| |+++-++.-|..--++.+...+-....
T Consensus 88 f~g~~Dk~~k~wDL~--S~Q~~---~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~pv~t~~LPeRvYa~ 162 (347)
T KOG0647|consen 88 FSGGCDKQAKLWDLA--SGQVS---QVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSNPVATLQLPERVYAA 162 (347)
T ss_pred EeeccCCceEEEEcc--CCCee---eeeecccceeEEEEecCCCcceeEecccccceeecccCCCCeeeeeeccceeeeh
Confidence 899999999887775 34332 12222467776553211112234444 333333332222122222221111111
Q ss_pred ccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEEcc
Q 000944 739 SFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETD 798 (1213)
Q Consensus 739 ~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~ 798 (1213)
.. -.|-..+...+..|.+..|..--..+.--.=||+...|.|+-+++...+++...+
T Consensus 163 Dv---~~pm~vVata~r~i~vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiE 219 (347)
T KOG0647|consen 163 DV---LYPMAVVATAERHIAVYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIE 219 (347)
T ss_pred hc---cCceeEEEecCCcEEEEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeec
Confidence 10 0122233344778999998651011222223677777888888888888777543
No 10
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=97.54 E-value=0.014 Score=62.71 Aligned_cols=182 Identities=14% Similarity=0.222 Sum_probs=101.5
Q ss_pred cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEE
Q 000944 582 GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLN 661 (1213)
Q Consensus 582 ~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Ll 661 (1213)
..||.+.+.+ .+..|+|++|||++++|++ +...+...-.. -.-++=+.+. +....+
T Consensus 14 d~IS~v~f~~------~~~~LLvssWDgslrlYdv-~~~~l~~~~~~-----~~plL~c~F~------------d~~~~~ 69 (323)
T KOG1036|consen 14 DGISSVKFSP------SSSDLLVSSWDGSLRLYDV-PANSLKLKFKH-----GAPLLDCAFA------------DESTIV 69 (323)
T ss_pred hceeeEEEcC------cCCcEEEEeccCcEEEEec-cchhhhhheec-----CCceeeeecc------------CCceEE
Confidence 3478887753 3567888899999999999 43222211110 0111122332 345688
Q ss_pred EEeeCCeEEEEEEeCCCCcccccceeeecC--CCCeEEEEEECCeeEEEEec-CccEEEEEeCCeEEEEecCccccceee
Q 000944 662 AGLQNGVLFRTVVDMVTGQLSDSRSRFLGL--RPPKLFSVVVGGRAAMLCLS-SRPWLGYIHRGRFLLTPLSYETLEYAA 738 (1213)
Q Consensus 662 igl~~G~l~~~~~~~~~~~l~~~~~~~lG~--~pv~l~~~~~~~~~~v~~~g-~~p~~i~~~~~~~~~~~~~~~~v~~~~ 738 (1213)
.|.-||.+..|.++..+ ..++|+ .|++..... -...+++..| ++..-++..+++...-.. +.-..+
T Consensus 70 ~G~~dg~vr~~Dln~~~-------~~~igth~~~i~ci~~~-~~~~~vIsgsWD~~ik~wD~R~~~~~~~~--d~~kkV- 138 (323)
T KOG1036|consen 70 TGGLDGQVRRYDLNTGN-------EDQIGTHDEGIRCIEYS-YEVGCVISGSWDKTIKFWDPRNKVVVGTF--DQGKKV- 138 (323)
T ss_pred EeccCceEEEEEecCCc-------ceeeccCCCceEEEEee-ccCCeEEEcccCccEEEEecccccccccc--ccCceE-
Confidence 99999999999986542 224443 455544311 1222344433 444444444331111111 101111
Q ss_pred ccccCCCCceEEE-EeCCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEEccC
Q 000944 739 SFSSDQCVEGVVS-VAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQ 799 (1213)
Q Consensus 739 ~f~~~~~~~~~i~-~~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~ 799 (1213)
|......+-++. ..+..+.+..+..++..+..|.=+|++..|.|+..|...-|++.+.+-
T Consensus 139 -y~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~~pn~eGy~~sSieG 199 (323)
T KOG1036|consen 139 -YCMDVSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSLKYQTRCVALVPNGEGYVVSSIEG 199 (323)
T ss_pred -EEEeccCCEEEEeecCceEEEEEcccccchhhhccccceeEEEEEEEecCCCceEEEeecc
Confidence 111112344554 345677788888776666677778888999999999777788887654
No 11
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.48 E-value=0.36 Score=55.77 Aligned_cols=100 Identities=15% Similarity=0.254 Sum_probs=62.6
Q ss_pred eCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEe-CCcEEEEEeCCCCceeEeEEeecCCCCc
Q 000944 557 SGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSSPPE 635 (1213)
Q Consensus 557 s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~-~~~i~i~sl~p~~~l~~~~~~~l~~~p~ 635 (1213)
.++.|..+.++.+|++..+...+....+..+++.+ ..++++++.+ ++.+.+|.++.+..+..+........|.
T Consensus 10 ~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~sp------d~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~ 83 (330)
T PRK11028 10 ESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISP------DKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPT 83 (330)
T ss_pred CCCCEEEEEECCCCceeeeeEEecCCCCccEEECC------CCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCce
Confidence 46788888887557777666665556677888864 3567888877 7889999996444455444333333454
Q ss_pred eeEEEEeecccCCCCCCCCCCceEEEE-EeeCCeEEEEEEeC
Q 000944 636 SLLFLEVQASVGGEDGADHPASLFLNA-GLQNGVLFRTVVDM 676 (1213)
Q Consensus 636 Sl~~~~~~~~~~~~~~~~~~~~~~Lli-gl~~G~l~~~~~~~ 676 (1213)
.+.+. ..++ +|++ ...+|.+..|.++.
T Consensus 84 ~i~~~-------------~~g~-~l~v~~~~~~~v~v~~~~~ 111 (330)
T PRK11028 84 HISTD-------------HQGR-FLFSASYNANCVSVSPLDK 111 (330)
T ss_pred EEEEC-------------CCCC-EEEEEEcCCCeEEEEEECC
Confidence 43321 1133 3444 44578888888753
No 12
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=97.47 E-value=0.24 Score=53.73 Aligned_cols=149 Identities=14% Similarity=0.242 Sum_probs=94.3
Q ss_pred CceEEEEe--C-CeEEEEEEccCCCeeEEEEEeCC----CccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhc
Q 000944 746 VEGVVSVA--G-NALRVFTIERLGETFNETALPLR----YTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAA 818 (1213)
Q Consensus 746 ~~~~i~~~--~-~~L~i~~l~~~~~~~~~r~i~l~----~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 818 (1213)
|.|++++. + +.+++..+-.++ +=|.+++.+. ..-..|-|+|.-|.+++.+.
T Consensus 150 p~GLifA~~~~~~~IkLyD~Rs~d-kgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~--------------------- 207 (311)
T KOG1446|consen 150 PEGLIFALANGSELIKLYDLRSFD-KGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTN--------------------- 207 (311)
T ss_pred CCCcEEEEecCCCeEEEEEecccC-CCCceeEccCCCCccceeeeEEcCCCCEEEEEeC---------------------
Confidence 57888764 3 478888888874 4566777665 24556777887777666541
Q ss_pred CCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCc
Q 000944 819 GMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHG 898 (1213)
Q Consensus 819 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~ 898 (1213)
.+.+.++|.-+++.+.+++..+++--..+ ...|. ...
T Consensus 208 ----------------------------------------~s~~~~lDAf~G~~~~tfs~~~~~~~~~~-~a~ft--Pds 244 (311)
T KOG1446|consen 208 ----------------------------------------ASFIYLLDAFDGTVKSTFSGYPNAGNLPL-SATFT--PDS 244 (311)
T ss_pred ----------------------------------------CCcEEEEEccCCcEeeeEeeccCCCCcce-eEEEC--CCC
Confidence 13578888888888899988877653222 22232 223
Q ss_pred eEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeE--EEEeCCeEEEEecCCcee
Q 000944 899 TLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRL--LAGIGPVLRLYDLGKKRL 974 (1213)
Q Consensus 899 ~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~l--l~~~g~~l~i~~~~~~~l 974 (1213)
+|++.|.. .|||++|++..+ +-..+....-.|++.++. ||.+. .++...++.+|-.+..++
T Consensus 245 ~Fvl~gs~------------dg~i~vw~~~tg--~~v~~~~~~~~~~~~~~~-fnP~~~mf~sa~s~l~fw~p~~~~~ 307 (311)
T KOG1446|consen 245 KFVLSGSD------------DGTIHVWNLETG--KKVAVLRGPNGGPVSCVR-FNPRYAMFVSASSNLVFWLPDEDAL 307 (311)
T ss_pred cEEEEecC------------CCcEEEEEcCCC--cEeeEecCCCCCCccccc-cCCceeeeeecCceEEEEecccccc
Confidence 67766653 599999999654 222222222356666665 87643 556667888887765443
No 13
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.47 E-value=0.48 Score=56.98 Aligned_cols=365 Identities=15% Similarity=0.181 Sum_probs=189.1
Q ss_pred cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEE
Q 000944 582 GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLN 661 (1213)
Q Consensus 582 ~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Ll 661 (1213)
.+|++++... .+.++++|+.|-+.+++.+++.+.+.+ ..+...-..+.-..++ .+...|+
T Consensus 146 ddi~si~Ws~------DSr~l~~gsrD~s~rl~~v~~~k~~~~---~~l~gHkd~VvacfF~-----------~~~~~l~ 205 (893)
T KOG0291|consen 146 DDITSIDWSD------DSRLLVTGSRDLSARLFGVDGNKNLFT---YALNGHKDYVVACFFG-----------ANSLDLY 205 (893)
T ss_pred cceeEEEecc------CCceEEeccccceEEEEEeccccccce---EeccCCCcceEEEEec-----------cCcceEE
Confidence 5788998853 588999999999999999976544432 2232222223222332 2456688
Q ss_pred EEeeCCeEEEEEEeCCCCcccc----------------------------cceeeecCCCCeEEEEEECCeeEEEEec--
Q 000944 662 AGLQNGVLFRTVVDMVTGQLSD----------------------------SRSRFLGLRPPKLFSVVVGGRAAMLCLS-- 711 (1213)
Q Consensus 662 igl~~G~l~~~~~~~~~~~l~~----------------------------~~~~~lG~~pv~l~~~~~~~~~~v~~~g-- 711 (1213)
.-.+||.+++|..+........ .++.++-..++++......-...+++.|
T Consensus 206 tvskdG~l~~W~~~~~P~~~~~~~kd~eg~~d~~~~~~~Eek~~~~~~~k~~k~~ln~~~~kvtaa~fH~~t~~lvvgFs 285 (893)
T KOG0291|consen 206 TVSKDGALFVWTCDLRPPELDKAEKDEEGSDDEEMDEDGEEKTHKIFWYKTKKHYLNQNSSKVTAAAFHKGTNLLVVGFS 285 (893)
T ss_pred EEecCceEEEEEecCCCcccccccccccccccccccccchhhhcceEEEEEEeeeecccccceeeeeccCCceEEEEEec
Confidence 9999999999998722111000 0111222333444433222222333332
Q ss_pred CccEEEEEeCCeEEEEecCc--cccceeeccccCCCCceEEEEeC--CeEEEEEEccCCCeeEEEEEeCCCccceeeecC
Q 000944 712 SRPWLGYIHRGRFLLTPLSY--ETLEYAASFSSDQCVEGVVSVAG--NALRVFTIERLGETFNETALPLRYTPRRFVLQP 787 (1213)
Q Consensus 712 ~~p~~i~~~~~~~~~~~~~~--~~v~~~~~f~~~~~~~~~i~~~~--~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~ 787 (1213)
+.-+.+|.-.+.--+|.++. ++|.++ .|+.. .+-+.+-+. ++|.+-.... +.+..+.-.=-.....++|+|
T Consensus 286 sG~f~LyelP~f~lih~LSis~~~I~t~-~~N~t--GDWiA~g~~klgQLlVweWqs--EsYVlKQQgH~~~i~~l~YSp 360 (893)
T KOG0291|consen 286 SGEFGLYELPDFNLIHSLSISDQKILTV-SFNST--GDWIAFGCSKLGQLLVWEWQS--ESYVLKQQGHSDRITSLAYSP 360 (893)
T ss_pred CCeeEEEecCCceEEEEeecccceeeEE-Eeccc--CCEEEEcCCccceEEEEEeec--cceeeeccccccceeeEEECC
Confidence 33333444333334444432 233332 22221 233333333 2444443332 111111110011234677777
Q ss_pred CCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeC
Q 000944 788 KKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDP 867 (1213)
Q Consensus 788 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~ 867 (1213)
+....+-.+.+ +.++++|.
T Consensus 361 Dgq~iaTG~eD-------------------------------------------------------------gKVKvWn~ 379 (893)
T KOG0291|consen 361 DGQLIATGAED-------------------------------------------------------------GKVKVWNT 379 (893)
T ss_pred CCcEEEeccCC-------------------------------------------------------------CcEEEEec
Confidence 77665544311 14677776
Q ss_pred CCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcce
Q 000944 868 RSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPL 947 (1213)
Q Consensus 868 ~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ 947 (1213)
.++-++.+ |.++- ..+..++|.. ....++..+. -|++-.|+++.- +-.+ ....+.|+.
T Consensus 380 ~SgfC~vT--FteHt--s~Vt~v~f~~---~g~~llssSL-----------DGtVRAwDlkRY-rNfR---Tft~P~p~Q 437 (893)
T KOG0291|consen 380 QSGFCFVT--FTEHT--SGVTAVQFTA---RGNVLLSSSL-----------DGTVRAWDLKRY-RNFR---TFTSPEPIQ 437 (893)
T ss_pred cCceEEEE--eccCC--CceEEEEEEe---cCCEEEEeec-----------CCeEEeeeeccc-ceee---eecCCCcee
Confidence 55444433 33222 2223333432 2344455543 478889998753 1122 223345553
Q ss_pred Eecc-c--cCeEEEEeCC---eEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEe
Q 000944 948 ALCQ-F--QGRLLAGIGP---VLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIF 1021 (1213)
Q Consensus 948 ai~~-~--~g~ll~~~g~---~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~ 1021 (1213)
.-|. . .|-|++|-++ .|++|.++..+|+..-.-+--|..-.+++..++.++-|---+-|-++..-....+.+.+
T Consensus 438 fscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDkTVRiW~if~s~~~vEtl 517 (893)
T KOG0291|consen 438 FSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDKTVRIWDIFSSSGTVETL 517 (893)
T ss_pred eeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccceEEEEEeeccCceeeeE
Confidence 3222 2 2555555444 57889999888877644432243334567788888888777777776543333344443
Q ss_pred eccCCCcceEEEEe-ecCCeeeeecCCCcEEEEecCC
Q 000944 1022 ADDSVPRWLTAAHH-IDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1022 a~D~~~~~~~~~~~-ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
. ....++++.| -|...+.++.-+|+|.+|+...
T Consensus 518 ~---i~sdvl~vsfrPdG~elaVaTldgqItf~d~~~ 551 (893)
T KOG0291|consen 518 E---IRSDVLAVSFRPDGKELAVATLDGQITFFDIKE 551 (893)
T ss_pred e---eccceeEEEEcCCCCeEEEEEecceEEEEEhhh
Confidence 3 2344666666 4666899999999999998654
No 14
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.31 E-value=0.3 Score=56.78 Aligned_cols=226 Identities=16% Similarity=0.238 Sum_probs=128.4
Q ss_pred cCCEEEEEEe----CCEEEEEEEccC-CCeEEeeeec-cCcceEEEEeeecCCCceeeeEEEEEEe-CCcEEEEEeCCCC
Q 000944 548 NRLQVVIALS----GGELIYFEVDMT-GQLLEVEKHE-MSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDD 620 (1213)
Q Consensus 548 ~~~~v~v~~s----~~~l~~l~~~~~-~~l~~~~~~~-l~~~is~i~i~~~~~~~~~~~~l~v~~~-~~~i~i~sl~p~~ 620 (1213)
.+.++.++.+ .+.+..|.++++ |+|.++.+.. .......+++.+ ...+++++.+ +|++.++.++++.
T Consensus 47 ~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~------~g~~l~vany~~g~v~v~~l~~~g 120 (345)
T PF10282_consen 47 DGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDP------DGRFLYVANYGGGSVSVFPLDDDG 120 (345)
T ss_dssp TSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECT------TSSEEEEEETTTTEEEEEEECTTS
T ss_pred CCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEEEEec------CCCEEEEEEccCCeEEEEEccCCc
Confidence 4555555544 458999999987 8888876665 455667787754 3667888876 8999999997664
Q ss_pred ceeEeE-Eeec-----------CCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccce--
Q 000944 621 CMQILS-VQSV-----------SSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS-- 686 (1213)
Q Consensus 621 ~l~~~~-~~~l-----------~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~-- 686 (1213)
.+.... .... ...|+++.+. ..++..+.+-++...+..|.++...+.+.....
T Consensus 121 ~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~-------------pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~ 187 (345)
T PF10282_consen 121 SLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFS-------------PDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIK 187 (345)
T ss_dssp EEEEEEEEEESEEEESSTTTTSSTCEEEEEE--------------TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEE
T ss_pred ccceeeeecccCCCCCcccccccccceeEEEC-------------CCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccc
Confidence 453321 1110 0123443221 124555566888889999999876655543221
Q ss_pred eeecCCCCeEEEEEECCeeEEEEecCcc--EEEEE-e--CCeEEEEe---cCccccc---eeeccccCCCC-ceEEEEe-
Q 000944 687 RFLGLRPPKLFSVVVGGRAAMLCLSSRP--WLGYI-H--RGRFLLTP---LSYETLE---YAASFSSDQCV-EGVVSVA- 753 (1213)
Q Consensus 687 ~~lG~~pv~l~~~~~~~~~~v~~~g~~p--~~i~~-~--~~~~~~~~---~~~~~v~---~~~~f~~~~~~-~~~i~~~- 753 (1213)
...|..|-.+. +.- +...+++.++.. ..++. . .+.+.... ..-.... ..+.+.. .| ..++|++
T Consensus 188 ~~~G~GPRh~~-f~p-dg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~i--spdg~~lyvsn 263 (345)
T PF10282_consen 188 VPPGSGPRHLA-FSP-DGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAI--SPDGRFLYVSN 263 (345)
T ss_dssp CSTTSSEEEEE-E-T-TSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE---TTSSEEEEEE
T ss_pred cccCCCCcEEE-EcC-CcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEE--ecCCCEEEEEe
Confidence 23455565544 211 334455554333 22332 2 34433221 1100010 1111111 12 3488886
Q ss_pred --CCeEEEEEEccC-CCeeEEEEEeC-CCccceeeecCCCceEEEEE
Q 000944 754 --GNALRVFTIERL-GETFNETALPL-RYTPRRFVLQPKKKLMVIIE 796 (1213)
Q Consensus 754 --~~~L~i~~l~~~-~~~~~~r~i~l-~~tp~~i~y~~~~~~~~v~~ 796 (1213)
.+.+.+..++.. +.--.+..++. +..||.++.+|..+.++|+.
T Consensus 264 r~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~ 310 (345)
T PF10282_consen 264 RGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVAN 310 (345)
T ss_dssp CTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEE
T ss_pred ccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEe
Confidence 579999999642 23345677888 56799999999999888875
No 15
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=97.25 E-value=0.61 Score=53.55 Aligned_cols=383 Identities=17% Similarity=0.182 Sum_probs=201.6
Q ss_pred EEEEEec--CCEEEEEEeCCEEEEEEEccCCCeEEe----eeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 542 IVKVGSN--RLQVVIALSGGELIYFEVDMTGQLLEV----EKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 542 I~~as~~--~~~v~v~~s~~~l~~l~~~~~~~l~~~----~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
+..|+.. ++.++|....+.+.++.... |.|.+- ++.+- +-|.|+++.+-+ =++.|..+|.|.||+
T Consensus 203 v~~a~FHPtd~nliit~Gk~H~~Fw~~~~-~~l~k~~~~fek~ek-k~Vl~v~F~eng-------dviTgDS~G~i~Iw~ 273 (626)
T KOG2106|consen 203 VFLATFHPTDPNLIITCGKGHLYFWTLRG-GSLVKRQGIFEKREK-KFVLCVTFLENG-------DVITGDSGGNILIWS 273 (626)
T ss_pred EEEEEeccCCCcEEEEeCCceEEEEEccC-CceEEEeeccccccc-eEEEEEEEcCCC-------CEEeecCCceEEEEe
Confidence 5556654 67888887777888885542 344432 22333 668999987522 256777789999998
Q ss_pred eCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEE-------EEeCCCCcccccc-ee
Q 000944 616 LDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRT-------VVDMVTGQLSDSR-SR 687 (1213)
Q Consensus 616 l~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~-------~~~~~~~~l~~~~-~~ 687 (1213)
-.. -. ++.+. . . ..+..+-+|.+++|.|+.= .+|..=..+.+.+ ..
T Consensus 274 ~~~---~~-~~k~~----------~---a---------H~ggv~~L~~lr~GtllSGgKDRki~~Wd~~y~k~r~~elPe 327 (626)
T KOG2106|consen 274 KGT---NR-ISKQV----------H---A---------HDGGVFSLCMLRDGTLLSGGKDRKIILWDDNYRKLRETELPE 327 (626)
T ss_pred CCC---ce-EEeEe----------e---e---------cCCceEEEEEecCccEeecCccceEEeccccccccccccCch
Confidence 631 11 22111 1 1 2356777888888888641 1110000000000 00
Q ss_pred eecCCCCeEEEEEECCeeEEEEecCccEEEEE-eCCeEEEEecC-ccccceeeccccCCCCceEEEEe-CCeEEEEEEcc
Q 000944 688 FLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYI-HRGRFLLTPLS-YETLEYAASFSSDQCVEGVVSVA-GNALRVFTIER 764 (1213)
Q Consensus 688 ~lG~~pv~l~~~~~~~~~~v~~~g~~p~~i~~-~~~~~~~~~~~-~~~v~~~~~f~~~~~~~~~i~~~-~~~L~i~~l~~ 764 (1213)
..| |++-. ..+...+++-..|-.++.. -++.+...-.. .+.+-.++. ....+-++-.. ++.+.+=.
T Consensus 328 ~~G--~iRtv---~e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~---hps~~q~~T~gqdk~v~lW~--- 396 (626)
T KOG2106|consen 328 QFG--PIRTV---AEGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLAT---HPSKNQLLTCGQDKHVRLWN--- 396 (626)
T ss_pred hcC--CeeEE---ecCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEc---CCChhheeeccCcceEEEcc---
Confidence 111 23322 1234446665666666653 22332222111 111111111 00112233222 33444333
Q ss_pred CCCeeEEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCC
Q 000944 765 LGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLS 844 (1213)
Q Consensus 765 ~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 844 (1213)
+.+ +.-+..+.+..+.+.+||.. .+++.+..
T Consensus 397 -~~k-~~wt~~~~d~~~~~~fhpsg-~va~Gt~~---------------------------------------------- 427 (626)
T KOG2106|consen 397 -DHK-LEWTKIIEDPAECADFHPSG-VVAVGTAT---------------------------------------------- 427 (626)
T ss_pred -CCc-eeEEEEecCceeEeeccCcc-eEEEeecc----------------------------------------------
Confidence 222 33444567788889999976 65555421
Q ss_pred ccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEE
Q 000944 845 DEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHI 924 (1213)
Q Consensus 845 ~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v 924 (1213)
+...++|.++...+.. .-+ +|.+.+|.. . ....|++||+- -+.||+
T Consensus 428 ---------------G~w~V~d~e~~~lv~~-~~d-~~~ls~v~y---s--p~G~~lAvgs~------------d~~iyi 473 (626)
T KOG2106|consen 428 ---------------GRWFVLDTETQDLVTI-HTD-NEQLSVVRY---S--PDGAFLAVGSH------------DNHIYI 473 (626)
T ss_pred ---------------ceEEEEecccceeEEE-Eec-CCceEEEEE---c--CCCCEEEEecC------------CCeEEE
Confidence 1335566665333332 223 788777654 2 33589999996 357999
Q ss_pred EEEEeCCceEEEEEEEeecCcceEeccc-cCeEEEEeC--CeEEEEecCCc---------ee-eceeeec------Cccc
Q 000944 925 YRFVEEGKSLELLHKTQVEGIPLALCQF-QGRLLAGIG--PVLRLYDLGKK---------RL-LRKCENK------LFPN 985 (1213)
Q Consensus 925 ~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~ll~~~g--~~l~i~~~~~~---------~l-~~~~~~~------~~~~ 985 (1213)
|+++.+|+|++.+.+..- .|++.+.-- ++..+.+.- -.|..|....- ++ .-.|.+- +.++
T Consensus 474 y~Vs~~g~~y~r~~k~~g-s~ithLDwS~Ds~~~~~~S~d~eiLyW~~~~~~~~ts~kDvkW~t~~c~lGF~v~g~s~~t 552 (626)
T KOG2106|consen 474 YRVSANGRKYSRVGKCSG-SPITHLDWSSDSQFLVSNSGDYEILYWKPSECKQITSVKDVKWATYTCTLGFEVFGGSDGT 552 (626)
T ss_pred EEECCCCcEEEEeeeecC-ceeEEeeecCCCceEEeccCceEEEEEccccCcccceecceeeeeeEEEEEEEEecccCCc
Confidence 999999999988887666 777877543 455555443 35555633211 11 1111110 0011
Q ss_pred --eEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCC-eeeeecCCCcEEEEe
Q 000944 986 --TIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD-TMAGADKFGNIYFVR 1054 (1213)
Q Consensus 986 --~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~-~~l~~D~~gnl~il~ 1054 (1213)
.+++-+-.++.+..||..-=+.+++|--...+-.-.-.--+.+.++++.|+-++ .++.+-++-.|+..+
T Consensus 553 ~i~a~~rs~~~~~lA~gdd~g~v~lf~yPc~s~rA~~he~~ghs~~vt~V~Fl~~d~~li~tg~D~Si~qW~ 624 (626)
T KOG2106|consen 553 DINAVARSHCEKLLASGDDFGKVHLFSYPCSSPRAPSHEYGGHSSHVTNVAFLCKDSHLISTGKDTSIMQWR 624 (626)
T ss_pred hHHHhhhhhhhhhhhccccCceEEEEccccCCCcccceeeccccceeEEEEEeeCCceEEecCCCceEEEEE
Confidence 122223357888999999999999984322221111122356778899998666 445544666665543
No 16
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=97.24 E-value=0.65 Score=53.66 Aligned_cols=303 Identities=13% Similarity=0.158 Sum_probs=164.4
Q ss_pred EEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeec
Q 000944 551 QVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSV 630 (1213)
Q Consensus 551 ~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l 630 (1213)
.|-|. .+|.|-||..+...-+..+ .-=.+.|+|+++.+ ...+++.|..||.|.-|.+.....- .+
T Consensus 293 lItVS-l~G~in~ln~~d~~~~~~i--~GHnK~ITaLtv~~------d~~~i~SgsyDG~I~~W~~~~g~~~------~~ 357 (603)
T KOG0318|consen 293 LITVS-LSGTINYLNPSDPSVLKVI--SGHNKSITALTVSP------DGKTIYSGSYDGHINSWDSGSGTSD------RL 357 (603)
T ss_pred EEEEE-cCcEEEEecccCCChhhee--cccccceeEEEEcC------CCCEEEeeccCceEEEEecCCcccc------cc
Confidence 33334 4778888875532211111 01135689999864 3478999999999888887422111 11
Q ss_pred CCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEe
Q 000944 631 SSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCL 710 (1213)
Q Consensus 631 ~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~ 710 (1213)
...++.-.+..|... +...+.-+|..| .|-+..+. .+.........+|.+|.-+...+ +...+++.
T Consensus 358 ~g~~h~nqI~~~~~~---------~~~~~~t~g~Dd-~l~~~~~~--~~~~t~~~~~~lg~QP~~lav~~--d~~~avv~ 423 (603)
T KOG0318|consen 358 AGKGHTNQIKGMAAS---------ESGELFTIGWDD-TLRVISLK--DNGYTKSEVVKLGSQPKGLAVLS--DGGTAVVA 423 (603)
T ss_pred ccccccceEEEEeec---------CCCcEEEEecCC-eEEEEecc--cCcccccceeecCCCceeEEEcC--CCCEEEEE
Confidence 111222233343321 112333444443 33333321 11223333457999998766422 33455566
Q ss_pred cCccEEEEEeCCeEEEEecCccc-cceeeccccCCCCceEEEE--eCCeEEEEEEccCCCeeEEEEEeCCCccceeeecC
Q 000944 711 SSRPWLGYIHRGRFLLTPLSYET-LEYAASFSSDQCVEGVVSV--AGNALRVFTIERLGETFNETALPLRYTPRRFVLQP 787 (1213)
Q Consensus 711 g~~p~~i~~~~~~~~~~~~~~~~-v~~~~~f~~~~~~~~~i~~--~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~ 787 (1213)
+..-.++..+.+++...|+.+++ ..+++|- ...+.+ .++.+.+.++......=-.+.++.++.+..|+|+|
T Consensus 424 ~~~~iv~l~~~~~~~~~~~~y~~s~vAv~~~------~~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySp 497 (603)
T KOG0318|consen 424 CISDIVLLQDQTKVSSIPIGYESSAVAVSPD------GSEVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSP 497 (603)
T ss_pred ecCcEEEEecCCcceeeccccccceEEEcCC------CCEEEEecccceEEEEEecCCcccceeeeecccCCceEEEECC
Confidence 66666677766777766765542 2222221 233444 35689999998732122346778889999999999
Q ss_pred CCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeC
Q 000944 788 KKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDP 867 (1213)
Q Consensus 788 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~ 867 (1213)
+...+++.=- .+ .+.++|-
T Consensus 498 d~~yla~~Da-~r------------------------------------------------------------kvv~yd~ 516 (603)
T KOG0318|consen 498 DGAYLAAGDA-SR------------------------------------------------------------KVVLYDV 516 (603)
T ss_pred CCcEEEEecc-CC------------------------------------------------------------cEEEEEc
Confidence 8876555410 00 1233333
Q ss_pred CCCceE-EEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcc
Q 000944 868 RSANTT-CLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIP 946 (1213)
Q Consensus 868 ~~~~~~-~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V 946 (1213)
.+.++. ...-|- --+|.|| .|+- +..+++-|. ... .+++|++++.... +..+....+.|
T Consensus 517 ~s~~~~~~~w~FH-takI~~~-aWsP----~n~~vATGS--lDt----------~Viiysv~kP~~~--i~iknAH~~gV 576 (603)
T KOG0318|consen 517 ASREVKTNRWAFH-TAKINCV-AWSP----NNKLVATGS--LDT----------NVIIYSVKKPAKH--IIIKNAHLGGV 576 (603)
T ss_pred ccCceecceeeee-eeeEEEE-EeCC----CceEEEecc--ccc----------eEEEEEccChhhh--eEeccccccCc
Confidence 332221 011111 1244454 3442 334544443 222 4889999987534 33334456669
Q ss_pred eEeccccCeEEEEeCC--eEEEEec
Q 000944 947 LALCQFQGRLLAGIGP--VLRLYDL 969 (1213)
Q Consensus 947 ~ai~~~~g~ll~~~g~--~l~i~~~ 969 (1213)
+++.-++..-+++.|+ .|++|.+
T Consensus 577 n~v~wlde~tvvSsG~Da~iK~W~v 601 (603)
T KOG0318|consen 577 NSVAWLDESTVVSSGQDANIKVWNV 601 (603)
T ss_pred eeEEEecCceEEeccCcceeEEecc
Confidence 9999999999999987 6777765
No 17
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=97.21 E-value=0.73 Score=53.57 Aligned_cols=287 Identities=16% Similarity=0.183 Sum_probs=152.1
Q ss_pred EEEEEeC----CcEEEEEeCCC-CceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEee----CCeEEEE
Q 000944 602 LAVGSYD----NTIRILSLDPD-DCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQ----NGVLFRT 672 (1213)
Q Consensus 602 l~v~~~~----~~i~i~sl~p~-~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~----~G~l~~~ 672 (1213)
++||+++ +.|+.|.++.+ ..|..+........|.-+.+. ....+|++... +|.+..|
T Consensus 2 ~~vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~--------------~~~~~LY~~~e~~~~~g~v~~~ 67 (345)
T PF10282_consen 2 LYVGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAVS--------------PDGRRLYVVNEGSGDSGGVSSY 67 (345)
T ss_dssp EEEEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE---------------TTSSEEEEEETTSSTTTEEEEE
T ss_pred EEEEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEEE--------------eCCCEEEEEEccccCCCCEEEE
Confidence 6788885 69999999554 355554432223345555432 13456667766 5799999
Q ss_pred EEeCCCCcccccceee-ecCCCCeEEEEEECCeeEEEEe--cCccEEEEE--eCCeEEEEecCc--c-----cccee-ec
Q 000944 673 VVDMVTGQLSDSRSRF-LGLRPPKLFSVVVGGRAAMLCL--SSRPWLGYI--HRGRFLLTPLSY--E-----TLEYA-AS 739 (1213)
Q Consensus 673 ~~~~~~~~l~~~~~~~-lG~~pv~l~~~~~~~~~~v~~~--g~~p~~i~~--~~~~~~~~~~~~--~-----~v~~~-~~ 739 (1213)
.++..++++....... .|..|+.+.- . .....++++ ++....++. ..|.+.-..-.. . +-... ..
T Consensus 68 ~i~~~~g~L~~~~~~~~~g~~p~~i~~-~-~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h 145 (345)
T PF10282_consen 68 RIDPDTGTLTLLNSVPSGGSSPCHIAV-D-PDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPH 145 (345)
T ss_dssp EEETTTTEEEEEEEEEESSSCEEEEEE-C-TTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTC
T ss_pred EECCCcceeEEeeeeccCCCCcEEEEE-e-cCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCccccccccc
Confidence 9998767765544333 6667766552 1 123344444 333333443 224433221000 0 00000 00
Q ss_pred cccCC-CC-ceEEEEe---CCeEEEEEEccCCCee---EEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHH
Q 000944 740 FSSDQ-CV-EGVVSVA---GNALRVFTIERLGETF---NETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAK 811 (1213)
Q Consensus 740 f~~~~-~~-~~~i~~~---~~~L~i~~l~~~~~~~---~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~ 811 (1213)
.|... -| ..++++. .+.+.+..++.-...+ ..-++|.+.-||+++++|+.+.++|++...+
T Consensus 146 ~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~----------- 214 (345)
T PF10282_consen 146 PHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSN----------- 214 (345)
T ss_dssp EEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTT-----------
T ss_pred ceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCC-----------
Confidence 11111 13 3467765 5799999998731112 3346788999999999999888777752100
Q ss_pred HHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCC--CceEEEEEcC----CCc-eE
Q 000944 812 KECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQ----DNE-AA 884 (1213)
Q Consensus 812 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~--~~~~~~~~~~----~~E-~v 884 (1213)
. ...+.+ ++.+ ++.+.....- .++ ..
T Consensus 215 ---------------------------------------------~-v~v~~~-~~~~g~~~~~~~~~~~~~~~~~~~~~ 247 (345)
T PF10282_consen 215 ---------------------------------------------T-VSVFDY-DPSDGSLTEIQTISTLPEGFTGENAP 247 (345)
T ss_dssp ---------------------------------------------E-EEEEEE-ETTTTEEEEEEEEESCETTSCSSSSE
T ss_pred ---------------------------------------------c-EEEEee-cccCCceeEEEEeeeccccccccCCc
Confidence 0 011111 1111 2222222211 111 22
Q ss_pred EEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCc-ceEec--cccCeEEEEeC
Q 000944 885 FSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGI-PLALC--QFQGRLLAGIG 961 (1213)
Q Consensus 885 ~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~-V~ai~--~~~g~ll~~~g 961 (1213)
.. +.+. ...+|+.|...- ...|.+|+++....+|+.+......|. +..+. +-+.+|+++..
T Consensus 248 ~~---i~is--pdg~~lyvsnr~-----------~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~ 311 (345)
T PF10282_consen 248 AE---IAIS--PDGRFLYVSNRG-----------SNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQ 311 (345)
T ss_dssp EE---EEE---TTSSEEEEEECT-----------TTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEET
T ss_pred ee---EEEe--cCCCEEEEEecc-----------CCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEec
Confidence 22 2233 223566666542 567899999765447888877777666 65554 35667777654
Q ss_pred --CeEEEEecC--Cceeecee
Q 000944 962 --PVLRLYDLG--KKRLLRKC 978 (1213)
Q Consensus 962 --~~l~i~~~~--~~~l~~~~ 978 (1213)
+.|.+|+++ ..+|....
T Consensus 312 ~s~~v~vf~~d~~tG~l~~~~ 332 (345)
T PF10282_consen 312 DSNTVSVFDIDPDTGKLTPVG 332 (345)
T ss_dssp TTTEEEEEEEETTTTEEEEEE
T ss_pred CCCeEEEEEEeCCCCcEEEec
Confidence 389999886 34565543
No 18
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=97.14 E-value=1.2 Score=54.54 Aligned_cols=239 Identities=16% Similarity=0.198 Sum_probs=127.2
Q ss_pred EEEEEEeccCCcccceEEeccCCCCCC-cEEEEEecCCCCeEEEEccCCcceEEEEe-cCCCCcceEEEeeeCCCCCCce
Q 000944 393 LVRIEQVESLMPIMDMRIANLFEEEAP-QIFTLCGRGPRSSLRILRPGLAVSEMAVS-QLPGVPSAVWTVKKNVNDEFDA 470 (1213)
Q Consensus 393 l~~~d~~~n~gPI~D~~~~~~~~~~~~-~lv~~sG~g~~GsL~~lr~gi~~~~~~~~-~l~~~~~~iw~l~~~~~~~~~~ 470 (1213)
++.+..+.|==| +.+... +.. .+.+|.|.. -.+..+-.+..+.-. ++| ..|-++... .+
T Consensus 27 fR~lG~vsn~VP---~~~~~~---~~~~~vtt~vgks-----fqvYd~~kl~ll~vs~~lp---~~I~alas~-----~~ 87 (910)
T KOG1539|consen 27 FRALGYVSNGVP---FRVVAL---GSTFYVTTCVGKS-----FQVYDVNKLNLLFVSKPLP---DKITALASD-----KD 87 (910)
T ss_pred hhhhceecCCCc---eeeeec---CceEEEEEecCce-----EEEEeccceEEEEecCCCC---CceEEEEec-----Cc
Confidence 567777888777 444432 122 344454422 233344444444332 455 466677542 45
Q ss_pred EEEEEecCceeEEEeccceeeecCCCccCCCCeEEEEeecCCeEEEEeCCcEEEEe-CCCc----e--eeeeCCCCccEE
Q 000944 471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIR-EDGR----I--NEWRTPGKRTIV 543 (1213)
Q Consensus 471 ~lvlS~~~~T~vl~~~~~~~e~~~~gf~~~~~Tl~a~~~~~~~ivQVT~~~i~l~~-~~~~----~--~~~~~~~~~~I~ 543 (1213)
|....+.+.-.+++-++.+...... .+.++..-...+..++-++.+++..+- .... . .......+..|+
T Consensus 88 ~vy~A~g~~i~~~~rgk~i~~~~~~----~~a~v~~l~~fGe~lia~d~~~~l~vw~~s~~~~e~~l~~~~~~~~~~~It 163 (910)
T KOG1539|consen 88 YVYVASGNKIYAYARGKHIRHTTLL----HGAKVHLLLPFGEHLIAVDISNILFVWKTSSIQEELYLQSTFLKVEGDFIT 163 (910)
T ss_pred eEEEecCcEEEEEEccceEEEEecc----ccceEEEEeeecceEEEEEccCcEEEEEeccccccccccceeeeccCCcee
Confidence 7777776666666666555544321 111121111122234444433333221 1110 0 111112233243
Q ss_pred E---EEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCC
Q 000944 544 K---VGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDD 620 (1213)
Q Consensus 544 ~---as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~ 620 (1213)
+ -+-.=+-|++..+.|.+.++.+.....+.+. ..+...|||+.=.| .-.+++||+.+|+|.|+.++-++
T Consensus 164 al~HP~TYLNKIvvGs~~G~lql~Nvrt~K~v~~f--~~~~s~IT~ieqsP------aLDVVaiG~~~G~ViifNlK~dk 235 (910)
T KOG1539|consen 164 ALLHPSTYLNKIVVGSSQGRLQLWNVRTGKVVYTF--QEFFSRITAIEQSP------ALDVVAIGLENGTVIIFNLKFDK 235 (910)
T ss_pred eEecchhheeeEEEeecCCcEEEEEeccCcEEEEe--cccccceeEeccCC------cceEEEEeccCceEEEEEcccCc
Confidence 3 3334466888788999999988743333333 33446788774433 35789999999999999997665
Q ss_pred ceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCC
Q 000944 621 CMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMV 677 (1213)
Q Consensus 621 ~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~ 677 (1213)
.+.... ++ -+...++. |. ..+.+.|..|..+|.+..|.++..
T Consensus 236 il~sFk-~d-~g~VtslS---Fr----------tDG~p~las~~~~G~m~~wDLe~k 277 (910)
T KOG1539|consen 236 ILMSFK-QD-WGRVTSLS---FR----------TDGNPLLASGRSNGDMAFWDLEKK 277 (910)
T ss_pred EEEEEE-cc-ccceeEEE---ec----------cCCCeeEEeccCCceEEEEEcCCC
Confidence 443222 11 11222332 22 236788999999999999988643
No 19
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=97.10 E-value=1.2 Score=53.88 Aligned_cols=261 Identities=16% Similarity=0.226 Sum_probs=149.6
Q ss_pred CeEEEEeecCCe--EEEEeCCcEE-EEeCC--CceeeeeCCCCccEEEEEec--CCEEEEEEeC-CEEEEEEEccCCCeE
Q 000944 502 PSLAVSLIGDDS--LMQVHPSGIR-HIRED--GRINEWRTPGKRTIVKVGSN--RLQVVIALSG-GELIYFEVDMTGQLL 573 (1213)
Q Consensus 502 ~Tl~a~~~~~~~--ivQVT~~~i~-l~~~~--~~~~~~~~~~~~~I~~as~~--~~~v~v~~s~-~~l~~l~~~~~~~l~ 573 (1213)
.-+-|+.+-.+. ++--.++++. |.... ..+..... ...+|..++++ ++.+++..+. |+|.+++-..+.-+.
T Consensus 266 ~kvtaa~fH~~t~~lvvgFssG~f~LyelP~f~lih~LSi-s~~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsEsYVl 344 (893)
T KOG0291|consen 266 SKVTAAAFHKGTNLLVVGFSSGEFGLYELPDFNLIHSLSI-SDQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSESYVL 344 (893)
T ss_pred cceeeeeccCCceEEEEEecCCeeEEEecCCceEEEEeec-ccceeeEEEecccCCEEEEcCCccceEEEEEeeccceee
Confidence 445555554332 3333445555 43322 45555555 33679999998 8899987544 688888765332111
Q ss_pred Eeeeec-cCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCC
Q 000944 574 EVEKHE-MSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGA 652 (1213)
Q Consensus 574 ~~~~~~-l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~ 652 (1213)
+.+ =...++|++..+ .+.+++.|..||.|.||......|+.+.+. + -..+..+.+..
T Consensus 345 ---KQQgH~~~i~~l~YSp------Dgq~iaTG~eDgKVKvWn~~SgfC~vTFte-H----ts~Vt~v~f~~-------- 402 (893)
T KOG0291|consen 345 ---KQQGHSDRITSLAYSP------DGQLIATGAEDGKVKVWNTQSGFCFVTFTE-H----TSGVTAVQFTA-------- 402 (893)
T ss_pred ---eccccccceeeEEECC------CCcEEEeccCCCcEEEEeccCceEEEEecc-C----CCceEEEEEEe--------
Confidence 111 124689999865 467999999999999999743333333221 1 11222233332
Q ss_pred CCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCcc-E--EEEE-eCCeEE-EE
Q 000944 653 DHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRP-W--LGYI-HRGRFL-LT 727 (1213)
Q Consensus 653 ~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p-~--~i~~-~~~~~~-~~ 727 (1213)
....|+...-||.+-.+.+.... +- ..+-+..|+.+..+..+-..-+++.|+.- + .+++ ..|++. +-
T Consensus 403 ---~g~~llssSLDGtVRAwDlkRYr----Nf-RTft~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiL 474 (893)
T KOG0291|consen 403 ---RGNVLLSSSLDGTVRAWDLKRYR----NF-RTFTSPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDIL 474 (893)
T ss_pred ---cCCEEEEeecCCeEEeeeecccc----ee-eeecCCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehh
Confidence 35678899999999777664221 11 23567788888876654222344455443 2 2333 334432 11
Q ss_pred ecCccccceeeccccCCCCceEEEEe-CCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEEcc
Q 000944 728 PLSYETLEYAASFSSDQCVEGVVSVA-GNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETD 798 (1213)
Q Consensus 728 ~~~~~~v~~~~~f~~~~~~~~~i~~~-~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~ 798 (1213)
.=...+|.+. .|+.. .+.++..+ ++++++=.+-. ++-.+..+++..-...+.++|+.+-++|++.+
T Consensus 475 sGHEgPVs~l-~f~~~--~~~LaS~SWDkTVRiW~if~--s~~~vEtl~i~sdvl~vsfrPdG~elaVaTld 541 (893)
T KOG0291|consen 475 SGHEGPVSGL-SFSPD--GSLLASGSWDKTVRIWDIFS--SSGTVETLEIRSDVLAVSFRPDGKELAVATLD 541 (893)
T ss_pred cCCCCcceee-EEccc--cCeEEeccccceEEEEEeec--cCceeeeEeeccceeEEEEcCCCCeEEEEEec
Confidence 1112233331 12211 12222223 67887766554 34467888888999999999999999999864
No 20
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=97.01 E-value=1 Score=51.89 Aligned_cols=116 Identities=7% Similarity=0.060 Sum_probs=66.0
Q ss_pred ccEEEEEEEEeCCceEEEEEEEee-----cCcce----EeccccCeEEEEeC--CeEEEEecCCc--eeeceeeec--Cc
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQV-----EGIPL----ALCQFQGRLLAGIG--PVLRLYDLGKK--RLLRKCENK--LF 983 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~-----~g~V~----ai~~~~g~ll~~~g--~~l~i~~~~~~--~l~~~~~~~--~~ 983 (1213)
.+.|.+|+++..+.+++.+.+... .++-+ ++.+=+.+++++.. +.|.+|+++.. .+....... ..
T Consensus 196 ~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~~~~~ 275 (330)
T PRK11028 196 NSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQPTETQ 275 (330)
T ss_pred CCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEEecccc
Confidence 578999999853235655544321 22222 23343456666642 57899998632 232222222 11
Q ss_pred cceEEEEEEeCCEEEEeec-CCcEEEEEEeccCCeEEEeeccCCCcceEEEEe
Q 000944 984 PNTIVSINTYRDRIYVGDI-QESFHFCKYRRDENQLYIFADDSVPRWLTAAHH 1035 (1213)
Q Consensus 984 ~~~i~~l~~~~~~I~vgD~-~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ 1035 (1213)
| ....++..+.+++++.. ...+.+++.+.+.+.|..+++=....+..++.+
T Consensus 276 p-~~~~~~~dg~~l~va~~~~~~v~v~~~~~~~g~l~~~~~~~~g~~P~~~~~ 327 (330)
T PRK11028 276 P-RGFNIDHSGKYLIAAGQKSHHISVYEIDGETGLLTELGRYAVGQGPMWVSV 327 (330)
T ss_pred C-CceEECCCCCEEEEEEccCCcEEEEEEcCCCCcEEEccccccCCCceEEEE
Confidence 2 13355667899999886 457999888766667777654333444444444
No 21
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.95 E-value=0.87 Score=49.94 Aligned_cols=171 Identities=11% Similarity=0.112 Sum_probs=96.9
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.+++++..+.+.+..+... ...+.++.. . ....+++.|+. .|.+.+|++... +......
T Consensus 74 ~i~i~~~~~~~~~~~~~~~-~~~i~~~~~---~--~~~~~~~~~~~------------~~~i~~~~~~~~--~~~~~~~- 132 (289)
T cd00200 74 TIRLWDLETGECVRTLTGH-TSYVSSVAF---S--PDGRILSSSSR------------DKTIKVWDVETG--KCLTTLR- 132 (289)
T ss_pred eEEEEEcCcccceEEEecc-CCcEEEEEE---c--CCCCEEEEecC------------CCeEEEEECCCc--EEEEEec-
Confidence 5677776665544444322 234555443 2 12356665541 478999998743 2222221
Q ss_pred eecCcceEecccc-CeEEE-Ee-CCeEEEEecCCceeeceeeecCccceEEEEEEeC--CEEEEeecCCcEEEEEEeccC
Q 000944 941 QVEGIPLALCQFQ-GRLLA-GI-GPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYR--DRIYVGDIQESFHFCKYRRDE 1015 (1213)
Q Consensus 941 ~~~g~V~ai~~~~-g~ll~-~~-g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~--~~I~vgD~~~Sv~~l~~~~~~ 1015 (1213)
...+++++++... +.+++ +. +..+++|++...+....-.. ....+.++.... +.++++.....+.++..+. .
T Consensus 133 ~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~--~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~-~ 209 (289)
T cd00200 133 GHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTG--HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLST-G 209 (289)
T ss_pred cCCCcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEec--CccccceEEECCCcCEEEEecCCCcEEEEECCC-C
Confidence 3567788877664 44444 44 67899999976554333221 233566666544 4888888877777766543 1
Q ss_pred CeEEEeeccCCCcceEEEEeecCC-eeeeecCCCcEEEEecCC
Q 000944 1016 NQLYIFADDSVPRWLTAAHHIDFD-TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1016 ~~l~~~a~D~~~~~~~~~~~ld~~-~~l~~D~~gnl~il~~~~ 1057 (1213)
..+..+ + .....+.++.+..++ .+++++.+|.+.+++...
T Consensus 210 ~~~~~~-~-~~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~ 250 (289)
T cd00200 210 KCLGTL-R-GHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250 (289)
T ss_pred ceecch-h-hcCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCC
Confidence 222222 1 123356666665444 456666799999998753
No 22
>PF03178 CPSF_A: CPSF A subunit region; InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=96.77 E-value=0.54 Score=54.06 Aligned_cols=171 Identities=15% Similarity=0.173 Sum_probs=115.7
Q ss_pred eEEEEEeCCC-------CceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCc
Q 000944 860 SCIRVLDPRS-------ANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGK 932 (1213)
Q Consensus 860 s~i~l~d~~~-------~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~ 932 (1213)
+.+.+++-.+ .+.++..+++ ..|++++.+ +.++++|.+ ++|++|++..+.
T Consensus 62 Gri~v~~i~~~~~~~~~l~~i~~~~~~--g~V~ai~~~-------~~~lv~~~g-------------~~l~v~~l~~~~- 118 (321)
T PF03178_consen 62 GRILVFEISESPENNFKLKLIHSTEVK--GPVTAICSF-------NGRLVVAVG-------------NKLYVYDLDNSK- 118 (321)
T ss_dssp EEEEEEEECSS-----EEEEEEEEEES--S-EEEEEEE-------TTEEEEEET-------------TEEEEEEEETTS-
T ss_pred cEEEEEEEEcccccceEEEEEEEEeec--CcceEhhhh-------CCEEEEeec-------------CEEEEEEccCcc-
Confidence 5677776555 2334555553 567787764 235777764 578999999872
Q ss_pred eEEEEEEEeecCcceEeccccCeEEEEeC-CeEEEEecC--CceeeceeeecCccceEEEEEEe--CCEEEEeecCCcEE
Q 000944 933 SLELLHKTQVEGIPLALCQFQGRLLAGIG-PVLRLYDLG--KKRLLRKCENKLFPNTIVSINTY--RDRIYVGDIQESFH 1007 (1213)
Q Consensus 933 kl~~~~~~~~~g~V~ai~~~~g~ll~~~g-~~l~i~~~~--~~~l~~~~~~~~~~~~i~~l~~~--~~~I~vgD~~~Sv~ 1007 (1213)
+|......+.+-.++++..++++++++-- +.+.++.|+ ..+|...|.-. .+..++++... ++.++++|....+.
T Consensus 119 ~l~~~~~~~~~~~i~sl~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~-~~~~v~~~~~l~d~~~~i~~D~~gnl~ 197 (321)
T PF03178_consen 119 TLLKKAFYDSPFYITSLSVFKNYILVGDAMKSVSLLRYDEENNKLILVARDY-QPRWVTAAEFLVDEDTIIVGDKDGNLF 197 (321)
T ss_dssp SEEEEEEE-BSSSEEEEEEETTEEEEEESSSSEEEEEEETTTE-EEEEEEES-S-BEEEEEEEE-SSSEEEEEETTSEEE
T ss_pred cchhhheecceEEEEEEeccccEEEEEEcccCEEEEEEEccCCEEEEEEecC-CCccEEEEEEecCCcEEEEEcCCCeEE
Confidence 48888888888899999999999988755 578888776 45577776543 36678888765 45999999999999
Q ss_pred EEEEecc------CC-eEEEeeccCCCcceEEE---Eeec---C-C-----eeeeecCCCcEEEEe
Q 000944 1008 FCKYRRD------EN-QLYIFADDSVPRWLTAA---HHID---F-D-----TMAGADKFGNIYFVR 1054 (1213)
Q Consensus 1008 ~l~~~~~------~~-~l~~~a~D~~~~~~~~~---~~ld---~-~-----~~l~~D~~gnl~il~ 1054 (1213)
+++++++ .. +|...+.=+..-.+++. .+.. . + .++.+-.+|.|.++-
T Consensus 198 ~l~~~~~~~~~~~~~~~L~~~~~f~lg~~v~~~~~~~l~~~~~~~~~~~~~~i~~~T~~G~Ig~l~ 263 (321)
T PF03178_consen 198 VLRYNPEIPNSRDGDPKLERISSFHLGDIVNSFRRGSLIPRSGSSESPNRPQILYGTVDGSIGVLI 263 (321)
T ss_dssp EEEE-SS-SSTTTTTTBEEEEEEEE-SS-EEEEEE--SS--SSSS-TTEEEEEEEEETTS-EEEEE
T ss_pred EEEECCCCcccccccccceeEEEEECCCccceEEEEEeeecCCCCcccccceEEEEecCCEEEEEE
Confidence 9999853 23 78877765555556655 2222 1 1 278888999998543
No 23
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.74 E-value=1.1 Score=49.23 Aligned_cols=125 Identities=18% Similarity=0.211 Sum_probs=71.9
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.+.++|..+++.+..+... ...+.++.. . ....++++|+. .|.+.+|++... + .+...
T Consensus 158 ~i~i~d~~~~~~~~~~~~~-~~~i~~~~~---~--~~~~~l~~~~~------------~~~i~i~d~~~~--~--~~~~~ 215 (289)
T cd00200 158 TIKLWDLRTGKCVATLTGH-TGEVNSVAF---S--PDGEKLLSSSS------------DGTIKLWDLSTG--K--CLGTL 215 (289)
T ss_pred cEEEEEccccccceeEecC-ccccceEEE---C--CCcCEEEEecC------------CCcEEEEECCCC--c--eecch
Confidence 4667776666655555422 224444432 2 22346776653 467889988653 2 12222
Q ss_pred -eecCcceEecccc-CeEEEEe--CCeEEEEecCCceeeceeeecCccceEEEEEEeC--CEEEEeecCCcEEEE
Q 000944 941 -QVEGIPLALCQFQ-GRLLAGI--GPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYR--DRIYVGDIQESFHFC 1009 (1213)
Q Consensus 941 -~~~g~V~ai~~~~-g~ll~~~--g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~--~~I~vgD~~~Sv~~l 1009 (1213)
...+++.+++-.. +.++++. ...+++|++...+....-. ..+..+.++.... +++++|..-..+.++
T Consensus 216 ~~~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~~~~~~~~~--~~~~~i~~~~~~~~~~~l~~~~~d~~i~iw 288 (289)
T cd00200 216 RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS--GHTNSVTSLAWSPDGKRLASGSADGTIRIW 288 (289)
T ss_pred hhcCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCCceeEEEcc--ccCCcEEEEEECCCCCEEEEecCCCeEEec
Confidence 3456888877654 5565554 5689999998655433322 2344566666554 677777766666553
No 24
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=96.67 E-value=0.4 Score=58.93 Aligned_cols=222 Identities=15% Similarity=0.184 Sum_probs=123.3
Q ss_pred eeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeE-----EEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEE
Q 000944 599 SRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLL-----FLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTV 673 (1213)
Q Consensus 599 ~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~-----~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~ 673 (1213)
..++.++..||.|++|.-..++ ..|.++. +..+. ....+++.|..++.+.+|.
T Consensus 25 gefi~tcgsdg~ir~~~~~sd~-----------e~P~ti~~~g~~v~~ia-----------~~s~~f~~~s~~~tv~~y~ 82 (933)
T KOG1274|consen 25 GEFICTCGSDGDIRKWKTNSDE-----------EEPETIDISGELVSSIA-----------CYSNHFLTGSEQNTVLRYK 82 (933)
T ss_pred CCEEEEecCCCceEEeecCCcc-----------cCCchhhccCceeEEEe-----------ecccceEEeeccceEEEee
Confidence 4488888889999999753221 1122222 11111 2345899999999999999
Q ss_pred EeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCcc-EEEE-EeCC--eEEEEecCccccceeeccccCCCCceE
Q 000944 674 VDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRP-WLGY-IHRG--RFLLTPLSYETLEYAASFSSDQCVEGV 749 (1213)
Q Consensus 674 ~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p-~~i~-~~~~--~~~~~~~~~~~v~~~~~f~~~~~~~~~ 749 (1213)
++.... ...-.++ +.|++-..+..+|. .+.+.|+-. ..+. ...+ ...+.+... ++.++ .|+. .+-|
T Consensus 83 fps~~~--~~iL~Rf--tlp~r~~~v~g~g~-~iaagsdD~~vK~~~~~D~s~~~~lrgh~a-pVl~l-~~~p---~~~f 152 (933)
T KOG1274|consen 83 FPSGEE--DTILARF--TLPIRDLAVSGSGK-MIAAGSDDTAVKLLNLDDSSQEKVLRGHDA-PVLQL-SYDP---KGNF 152 (933)
T ss_pred CCCCCc--cceeeee--eccceEEEEecCCc-EEEeecCceeEEEEeccccchheeecccCC-ceeee-eEcC---CCCE
Confidence 975321 1111111 23444444433332 333333333 2222 1111 112222111 12222 2222 1234
Q ss_pred EEE-e-CCeEEEEEEccC------CCeeEEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCC
Q 000944 750 VSV-A-GNALRVFTIERL------GETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMG 821 (1213)
Q Consensus 750 i~~-~-~~~L~i~~l~~~------~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 821 (1213)
+.+ + ++.+.|-.+.+. ..-.+.-..-+.....++++||....+++.+.+
T Consensus 153 LAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d----------------------- 209 (933)
T KOG1274|consen 153 LAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVD----------------------- 209 (933)
T ss_pred EEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccC-----------------------
Confidence 433 2 678888888861 000001111124556788899988888877531
Q ss_pred CCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEE
Q 000944 822 ENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLL 901 (1213)
Q Consensus 822 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i 901 (1213)
.++++++..+|+....+.-+..+...|.+.|. ....||
T Consensus 210 --------------------------------------~~Vkvy~r~~we~~f~Lr~~~~ss~~~~~~ws----PnG~Yi 247 (933)
T KOG1274|consen 210 --------------------------------------NTVKVYSRKGWELQFKLRDKLSSSKFSDLQWS----PNGKYI 247 (933)
T ss_pred --------------------------------------CeEEEEccCCceeheeecccccccceEEEEEc----CCCcEE
Confidence 27899999999988887777777776766665 345899
Q ss_pred EEEeeecCccCCCCCCcccEEEEEEEEe
Q 000944 902 AVGTAKGLQFWPKRNIVAGYIHIYRFVE 929 (1213)
Q Consensus 902 ~VGT~~~~~~~~e~~~~~Gri~v~~i~~ 929 (1213)
+-||- .|-|.||+++.
T Consensus 248 AAs~~------------~g~I~vWnv~t 263 (933)
T KOG1274|consen 248 AASTL------------DGQILVWNVDT 263 (933)
T ss_pred eeecc------------CCcEEEEeccc
Confidence 99986 57899999984
No 25
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.59 E-value=2.8 Score=50.74 Aligned_cols=172 Identities=14% Similarity=0.196 Sum_probs=107.2
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK 939 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~ 939 (1213)
+.++.+|+++++++....-++.|.+..+.... ....+++|-+.. .|++|+.+ .|+++..
T Consensus 260 g~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~----~~~~~l~vtaeQ-------------nl~l~d~~----~l~i~k~ 318 (775)
T KOG0319|consen 260 GVVQYWDSESGKCVYKQRQSDSEEIDHLLAIE----SMSQLLLVTAEQ-------------NLFLYDED----ELTIVKQ 318 (775)
T ss_pred ceEEEEecccchhhhhhccCCchhhhcceecc----ccCceEEEEccc-------------eEEEEEcc----ccEEehh
Confidence 57899999999887666655566644332211 234566665532 24566443 4555433
Q ss_pred -EeecCcceEecccc---CeEEEEeCC-eEEEEecCCc--eeeceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEE
Q 000944 940 -TQVEGIPLALCQFQ---GRLLAGIGP-VLRLYDLGKK--RLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCK 1010 (1213)
Q Consensus 940 -~~~~g~V~ai~~~~---g~ll~~~g~-~l~i~~~~~~--~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~ 1010 (1213)
.-.++-|+.+|-++ .+|++|.|. .+++|+...- ++++ ++.-.+.|+. ..+.+|+-|---+|+-+.|
T Consensus 319 ivG~ndEI~Dm~~lG~e~~~laVATNs~~lr~y~~~~~~c~ii~-----GH~e~vlSL~~~~~g~llat~sKD~svilWr 393 (775)
T KOG0319|consen 319 IVGYNDEILDMKFLGPEESHLAVATNSPELRLYTLPTSYCQIIP-----GHTEAVLSLDVWSSGDLLATGSKDKSVILWR 393 (775)
T ss_pred hcCCchhheeeeecCCccceEEEEeCCCceEEEecCCCceEEEe-----CchhheeeeeecccCcEEEEecCCceEEEEE
Confidence 24578888888887 799999986 7899976532 2222 2333567777 3456888888888999999
Q ss_pred EeccCCeEEEeecc-CCCcceEEEEe--ecCCeeeeecCCCcEEEEecCC
Q 000944 1011 YRRDENQLYIFADD-SVPRWLTAAHH--IDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1011 ~~~~~~~l~~~a~D-~~~~~~~~~~~--ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
++....+...+|.- .+...++++.+ ...+.++..-.++.|-+..++.
T Consensus 394 ~~~~~~~~~~~a~~~gH~~svgava~~~~~asffvsvS~D~tlK~W~l~~ 443 (775)
T KOG0319|consen 394 LNNNCSKSLCVAQANGHTNSVGAVAGSKLGASFFVSVSQDCTLKLWDLPK 443 (775)
T ss_pred ecCCcchhhhhhhhcccccccceeeecccCccEEEEecCCceEEEecCCC
Confidence 96555555555433 34455555554 2222355566677777776654
No 26
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.49 E-value=3.2 Score=50.27 Aligned_cols=226 Identities=13% Similarity=0.148 Sum_probs=109.0
Q ss_pred cEEEEEec-CCEEEEEEeCC-EEEEEEEccCCCeEEeeeecc--CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEe
Q 000944 541 TIVKVGSN-RLQVVIALSGG-ELIYFEVDMTGQLLEVEKHEM--SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSL 616 (1213)
Q Consensus 541 ~I~~as~~-~~~v~v~~s~~-~l~~l~~~~~~~l~~~~~~~l--~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl 616 (1213)
.|++-++. |+.+++....+ -+-.+.+.. |.+.. .... +.++.-|++.+ .+..++.|..|+.+.+|++
T Consensus 64 ~ita~~l~~d~~~L~~a~rs~llrv~~L~t-gk~ir--swKa~He~Pvi~ma~~~------~g~LlAtggaD~~v~VWdi 134 (775)
T KOG0319|consen 64 EITALALTPDEEVLVTASRSQLLRVWSLPT-GKLIR--SWKAIHEAPVITMAFDP------TGTLLATGGADGRVKVWDI 134 (775)
T ss_pred hhheeeecCCccEEEEeeccceEEEEEccc-chHhH--hHhhccCCCeEEEEEcC------CCceEEeccccceEEEEEe
Confidence 46666655 34444444444 455555553 33221 1222 35566677754 3478999999999999999
Q ss_pred CCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcc-----cccceeeecC
Q 000944 617 DPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQL-----SDSRSRFLGL 691 (1213)
Q Consensus 617 ~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l-----~~~~~~~lG~ 691 (1213)
....+ +.+....+++..+++ |.. ...+..|..|-.||.+..|.+....-.+ .......++.
T Consensus 135 ~~~~~--th~fkG~gGvVssl~---F~~---------~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H~S~vtsL~~ 200 (775)
T KOG0319|consen 135 KNGYC--THSFKGHGGVVSSLL---FHP---------HWNRWLLASGATDGTVRVWNLNDKRTCLHTMILHKSAVTSLAF 200 (775)
T ss_pred eCCEE--EEEecCCCceEEEEE---eCC---------ccchhheeecCCCceEEEEEcccCchHHHHHHhhhhheeeeee
Confidence 53322 222222333334443 332 2245667888899999999886432111 1111222221
Q ss_pred CCCeEEEEEECCeeEEEEecCccEEEEEe--CCeEEEEecCccccceeeccccCCC-CceEEEEeCCeEEEEEEccCC-C
Q 000944 692 RPPKLFSVVVGGRAAMLCLSSRPWLGYIH--RGRFLLTPLSYETLEYAASFSSDQC-VEGVVSVAGNALRVFTIERLG-E 767 (1213)
Q Consensus 692 ~pv~l~~~~~~~~~~v~~~g~~p~~i~~~--~~~~~~~~~~~~~v~~~~~f~~~~~-~~~~i~~~~~~L~i~~l~~~~-~ 767 (1213)
.+ ++...+-++=++.++++.. -..++.-|+.. .+.++.-.+..-- ...+++..+++=.++..+.-+ .
T Consensus 201 ~~--------d~~~~ls~~RDkvi~vwd~~~~~~l~~lp~ye-~~E~vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~ 271 (775)
T KOG0319|consen 201 SE--------DSLELLSVGRDKVIIVWDLVQYKKLKTLPLYE-SLESVVRLREELGGKGEYIITAGGSGVVQYWDSESGK 271 (775)
T ss_pred cc--------CCceEEEeccCcEEEEeehhhhhhhheechhh-heeeEEEechhcCCcceEEEEecCCceEEEEecccch
Confidence 11 1222333333555556643 23455566532 3444444433111 123444444444444444311 1
Q ss_pred eeEEEEEeCCCccceeeecCCCceEEEEEcc
Q 000944 768 TFNETALPLRYTPRRFVLQPKKKLMVIIETD 798 (1213)
Q Consensus 768 ~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~ 798 (1213)
-...++.|-++...++++.+..+.++.++.+
T Consensus 272 ~~~~~~~~~~~e~~~~~~~~~~~~~l~vtae 302 (775)
T KOG0319|consen 272 CVYKQRQSDSEEIDHLLAIESMSQLLLVTAE 302 (775)
T ss_pred hhhhhccCCchhhhcceeccccCceEEEEcc
Confidence 1122333323335566666666665655543
No 27
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=96.31 E-value=0.032 Score=63.30 Aligned_cols=175 Identities=10% Similarity=0.163 Sum_probs=101.9
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK 939 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~ 939 (1213)
.+++|+|.+|++++.++. .++.++|++. .- ++.+.+++|+. .|+|.-|++..+ + ++.+
T Consensus 280 ~~lKlwDtETG~~~~~f~--~~~~~~cvkf---~p-d~~n~fl~G~s------------d~ki~~wDiRs~--k--vvqe 337 (503)
T KOG0282|consen 280 RFLKLWDTETGQVLSRFH--LDKVPTCVKF---HP-DNQNIFLVGGS------------DKKIRQWDIRSG--K--VVQE 337 (503)
T ss_pred eeeeeeccccceEEEEEe--cCCCceeeec---CC-CCCcEEEEecC------------CCcEEEEeccch--H--HHHH
Confidence 489999999999887655 4567777643 32 23588899986 689999998865 2 2222
Q ss_pred E-eecCcceEeccc--cCeEEEEeC-CeEEEEecCCceeec-eeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEecc
Q 000944 940 T-QVEGIPLALCQF--QGRLLAGIG-PVLRLYDLGKKRLLR-KCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRD 1014 (1213)
Q Consensus 940 ~-~~~g~V~ai~~~--~g~ll~~~g-~~l~i~~~~~~~l~~-~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~ 1014 (1213)
. ..-|+|..|.-+ +-+.+.+.- ..+.+|+|+..-..+ .+....+.+..+.+...++++++=-+-.-+.++.-.+
T Consensus 338 Yd~hLg~i~~i~F~~~g~rFissSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~- 416 (503)
T KOG0282|consen 338 YDRHLGAILDITFVDEGRRFISSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVP- 416 (503)
T ss_pred HHhhhhheeeeEEccCCceEeeeccCccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCceEEEEeccc-
Confidence 2 224777777665 345666665 479999998542211 1222122233455555455444433333333333211
Q ss_pred CCeE---EEeeccCCCcceEEEEe-ecCCeeeeecCCCcEEEEecCC
Q 000944 1015 ENQL---YIFADDSVPRWLTAAHH-IDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1015 ~~~l---~~~a~D~~~~~~~~~~~-ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
..++ .....-..+-+.+.+.| .|.++++.+|.+|+++++..++
T Consensus 417 ~~r~nkkK~feGh~vaGys~~v~fSpDG~~l~SGdsdG~v~~wdwkt 463 (503)
T KOG0282|consen 417 PFRLNKKKRFEGHSVAGYSCQVDFSPDGRTLCSGDSDGKVNFWDWKT 463 (503)
T ss_pred ccccCHhhhhcceeccCceeeEEEcCCCCeEEeecCCccEEEeechh
Confidence 1111 11112222333344555 5666788899999999998765
No 28
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.14 E-value=4.9 Score=48.83 Aligned_cols=169 Identities=9% Similarity=0.142 Sum_probs=104.4
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
++++++-.+.+.+-+++-. .+.|.+.+ .+..|+++||. .|.+.+|++.... -++.+-
T Consensus 395 SikiWn~~t~kciRTi~~~---y~l~~~Fv-----pgd~~Iv~G~k------------~Gel~vfdlaS~~-l~Eti~-- 451 (888)
T KOG0306|consen 395 SIKIWNRDTLKCIRTITCG---YILASKFV-----PGDRYIVLGTK------------NGELQVFDLASAS-LVETIR-- 451 (888)
T ss_pred cEEEEEccCcceeEEeccc---cEEEEEec-----CCCceEEEecc------------CCceEEEEeehhh-hhhhhh--
Confidence 7899999888887766544 66665554 34689999995 6889999998631 122222
Q ss_pred eecCcceEeccc--c-CeEEEEeCCeEEEEecC--------Cce---eeceeeecCccceEEEEEEe--CCEEEEeecCC
Q 000944 941 QVEGIPLALCQF--Q-GRLLAGIGPVLRLYDLG--------KKR---LLRKCENKLFPNTIVSINTY--RDRIYVGDIQE 1004 (1213)
Q Consensus 941 ~~~g~V~ai~~~--~-g~ll~~~g~~l~i~~~~--------~~~---l~~~~~~~~~~~~i~~l~~~--~~~I~vgD~~~ 1004 (1213)
...|++-+|... + |.+.++..+.|.+|++. +++ |.....+. ++-.+.++.+. +.|+.||=+-.
T Consensus 452 AHdgaIWsi~~~pD~~g~vT~saDktVkfWdf~l~~~~~gt~~k~lsl~~~rtLe-l~ddvL~v~~Spdgk~LaVsLLdn 530 (888)
T KOG0306|consen 452 AHDGAIWSISLSPDNKGFVTGSADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTLE-LEDDVLCVSVSPDGKLLAVSLLDN 530 (888)
T ss_pred ccccceeeeeecCCCCceEEecCCcEEEEEeEEEEeccCcccceeeeeccceEEe-ccccEEEEEEcCCCcEEEEEeccC
Confidence 356887666554 3 34444555678899884 122 22223333 34456666554 78888988888
Q ss_pred cEEEEEEeccCCeEEEeeccCCCcceEEEEee-cCCeeeeecCCCcEEEEecC
Q 000944 1005 SFHFCKYRRDENQLYIFADDSVPRWLTAAHHI-DFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1005 Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~l-d~~~~l~~D~~gnl~il~~~ 1056 (1213)
-+.++-.++-.-.|.+.+ +..-|.+.+.- |...++.+-++.|+-++-++
T Consensus 531 TVkVyflDtlKFflsLYG---HkLPV~smDIS~DSklivTgSADKnVKiWGLd 580 (888)
T KOG0306|consen 531 TVKVYFLDTLKFFLSLYG---HKLPVLSMDISPDSKLIVTGSADKNVKIWGLD 580 (888)
T ss_pred eEEEEEecceeeeeeecc---cccceeEEeccCCcCeEEeccCCCceEEeccc
Confidence 888877664333344433 23335555532 32356777777787776654
No 29
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=96.12 E-value=1.4 Score=47.78 Aligned_cols=131 Identities=9% Similarity=0.115 Sum_probs=95.3
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeE--EEEeCC---eEEEEecCCceeeceeeecCccceEEEEEEe
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRL--LAGIGP---VLRLYDLGKKRLLRKCENKLFPNTIVSINTY 993 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~l--l~~~g~---~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~ 993 (1213)
-|.+.+|++..+ ....+.. ..+||.++.-+++++ +.+.|+ .|+.|+...... ++..+ +|-.+.++++.
T Consensus 93 Dk~~k~wDL~S~--Q~~~v~~--Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~p--v~t~~-LPeRvYa~Dv~ 165 (347)
T KOG0647|consen 93 DKQAKLWDLASG--QVSQVAA--HDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSNP--VATLQ-LPERVYAADVL 165 (347)
T ss_pred CCceEEEEccCC--Ceeeeee--cccceeEEEEecCCCcceeEecccccceeecccCCCCe--eeeee-ccceeeehhcc
Confidence 355678888876 6666654 569999999998877 667775 788888875443 34455 68889999999
Q ss_pred CCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceE-EE-EeecCCeeeeecCCCcEEEEecCCC
Q 000944 994 RDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLT-AA-HHIDFDTMAGADKFGNIYFVRLPQD 1058 (1213)
Q Consensus 994 ~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~-~~-~~ld~~~~l~~D~~gnl~il~~~~~ 1058 (1213)
....+|+.+-++|.++..+..+..+..+.. + ..|.+ ++ .|-|.+.++.+-.+|-+.+...++.
T Consensus 166 ~pm~vVata~r~i~vynL~n~~te~k~~~S-p-Lk~Q~R~va~f~d~~~~alGsiEGrv~iq~id~~ 230 (347)
T KOG0647|consen 166 YPMAVVATAERHIAVYNLENPPTEFKRIES-P-LKWQTRCVACFQDKDGFALGSIEGRVAIQYIDDP 230 (347)
T ss_pred CceeEEEecCCcEEEEEcCCCcchhhhhcC-c-ccceeeEEEEEecCCceEeeeecceEEEEecCCC
Confidence 999999999999999988654433332221 1 33444 33 3677788888999999999888763
No 30
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=96.11 E-value=5.4 Score=49.08 Aligned_cols=117 Identities=18% Similarity=0.230 Sum_probs=74.8
Q ss_pred ccEEEEEecC--CEEEEEEeCCEEEEEEEccCCCeEEeeeecc-CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEe
Q 000944 540 RTIVKVGSNR--LQVVIALSGGELIYFEVDMTGQLLEVEKHEM-SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSL 616 (1213)
Q Consensus 540 ~~I~~as~~~--~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l-~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl 616 (1213)
..|+++.-.. +-|+|.+.+|++++|.+..+..|.. .+. ...|+++++-. .+...+++|+.+|.+.+|.|
T Consensus 203 s~IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~s---Fk~d~g~VtslSFrt-----DG~p~las~~~~G~m~~wDL 274 (910)
T KOG1539|consen 203 SRITAIEQSPALDVVAIGLENGTVIIFNLKFDKILMS---FKQDWGRVTSLSFRT-----DGNPLLASGRSNGDMAFWDL 274 (910)
T ss_pred cceeEeccCCcceEEEEeccCceEEEEEcccCcEEEE---EEccccceeEEEecc-----CCCeeEEeccCCceEEEEEc
Confidence 4577766543 5678888999999998875443433 333 48899999954 24778999999999999999
Q ss_pred CCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCC
Q 000944 617 DPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTG 679 (1213)
Q Consensus 617 ~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~ 679 (1213)
+ ...+..+-...=...+... .+ ..+.+.|+-.-.|-.|-.|-+|..+|
T Consensus 275 e-~kkl~~v~~nah~~sv~~~---~f-----------l~~epVl~ta~~DnSlk~~vfD~~dg 322 (910)
T KOG1539|consen 275 E-KKKLINVTRNAHYGSVTGA---TF-----------LPGEPVLVTAGADNSLKVWVFDSGDG 322 (910)
T ss_pred C-CCeeeeeeeccccCCcccc---ee-----------cCCCceEeeccCCCceeEEEeeCCCC
Confidence 5 4344322110000111111 11 12567777777788888888886655
No 31
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=96.08 E-value=1 Score=49.59 Aligned_cols=71 Identities=14% Similarity=0.198 Sum_probs=47.8
Q ss_pred eEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEE
Q 000944 584 VACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAG 663 (1213)
Q Consensus 584 is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Llig 663 (1213)
|-|+++.| ...|++.|..|+++.||.+. ...|.. +....-...+.+++. ...+|||++
T Consensus 154 Vr~vavdP------~n~wf~tgs~DrtikIwDla-tg~Lkl-tltGhi~~vr~vavS--------------~rHpYlFs~ 211 (460)
T KOG0285|consen 154 VRSVAVDP------GNEWFATGSADRTIKIWDLA-TGQLKL-TLTGHIETVRGVAVS--------------KRHPYLFSA 211 (460)
T ss_pred EEEEeeCC------CceeEEecCCCceeEEEEcc-cCeEEE-eecchhheeeeeeec--------------ccCceEEEe
Confidence 56788865 46899999999999999994 333421 100000123343332 357999999
Q ss_pred eeCCeEEEEEEeC
Q 000944 664 LQNGVLFRTVVDM 676 (1213)
Q Consensus 664 l~~G~l~~~~~~~ 676 (1213)
..|+.+-.|.+..
T Consensus 212 gedk~VKCwDLe~ 224 (460)
T KOG0285|consen 212 GEDKQVKCWDLEY 224 (460)
T ss_pred cCCCeeEEEechh
Confidence 9999999998854
No 32
>PLN00181 protein SPA1-RELATED; Provisional
Probab=95.99 E-value=5.4 Score=52.16 Aligned_cols=175 Identities=14% Similarity=0.141 Sum_probs=100.8
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.++++|..+++.+..+... ..+.|+. +.. ....++++|+. .|.|++|++......+..+ .
T Consensus 599 ~v~iWd~~~~~~~~~~~~~--~~v~~v~---~~~-~~g~~latgs~------------dg~I~iwD~~~~~~~~~~~--~ 658 (793)
T PLN00181 599 SVKLWSINQGVSIGTIKTK--ANICCVQ---FPS-ESGRSLAFGSA------------DHKVYYYDLRNPKLPLCTM--I 658 (793)
T ss_pred EEEEEECCCCcEEEEEecC--CCeEEEE---EeC-CCCCEEEEEeC------------CCeEEEEECCCCCccceEe--c
Confidence 6899998888777666532 3444443 322 23468888864 4789999987542122222 2
Q ss_pred eecCcceEeccccCeEEEEe--CCeEEEEecCCce----eeceeeecCccc--eEEEEEEeCCEEEEeecCCcEEEEEEe
Q 000944 941 QVEGIPLALCQFQGRLLAGI--GPVLRLYDLGKKR----LLRKCENKLFPN--TIVSINTYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 941 ~~~g~V~ai~~~~g~ll~~~--g~~l~i~~~~~~~----l~~~~~~~~~~~--~i~~l~~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
...++|+++.-.++..+++. ...|.+|++.... ......+..+.. ..+++...+++|++|..-..+.++...
T Consensus 659 ~h~~~V~~v~f~~~~~lvs~s~D~~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~D~~v~iw~~~ 738 (793)
T PLN00181 659 GHSKTVSYVRFVDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKA 738 (793)
T ss_pred CCCCCEEEEEEeCCCEEEEEECCCEEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeCCCEEEEEECC
Confidence 45678888876666554433 3579999986321 111112221222 334555668899999988888886543
Q ss_pred ccCCeEEE----------eeccCCCcceEEEEee-cCCeeeeecCCCcEEEEec
Q 000944 1013 RDENQLYI----------FADDSVPRWLTAAHHI-DFDTMAGADKFGNIYFVRL 1055 (1213)
Q Consensus 1013 ~~~~~l~~----------~a~D~~~~~~~~~~~l-d~~~~l~~D~~gnl~il~~ 1055 (1213)
.....+.. +.-+....++.++.+- +...++++..+|+|.++++
T Consensus 739 ~~~~~~s~~~~~~~~~~~~~~~~~~~~V~~v~ws~~~~~lva~~~dG~I~i~~~ 792 (793)
T PLN00181 739 FPMPVLSYKFKTIDPVSGLEVDDASQFISSVCWRGQSSTLVAANSTGNIKILEM 792 (793)
T ss_pred CCCceEEEecccCCcccccccCCCCcEEEEEEEcCCCCeEEEecCCCcEEEEec
Confidence 21111110 0012223345565553 3346788999999999875
No 33
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.77 E-value=4.5 Score=48.00 Aligned_cols=61 Identities=23% Similarity=0.238 Sum_probs=38.0
Q ss_pred EeccccCeEEEEeC-CeEEEEecCCceeeceeeecCccceEEEEEEeCC-EEEEeecCCcEEEEEEec
Q 000944 948 ALCQFQGRLLAGIG-PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRD-RIYVGDIQESFHFCKYRR 1013 (1213)
Q Consensus 948 ai~~~~g~ll~~~g-~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~-~I~vgD~~~Sv~~l~~~~ 1013 (1213)
+-.-++|.|+...+ ..+.+|+|+...|++..... .-++-..+| .+++--.-.|..+++|+.
T Consensus 429 ~e~i~gg~Llg~~ss~~~~fydW~~~~lVrrI~v~-----~k~v~w~d~g~lVai~~d~Sfyil~~n~ 491 (794)
T KOG0276|consen 429 AEGIFGGPLLGVRSSDFLCFYDWESGELVRRIEVT-----SKHVYWSDNGELVAIAGDDSFYILKFNA 491 (794)
T ss_pred eeeecCCceEEEEeCCeEEEEEcccceEEEEEeec-----cceeEEecCCCEEEEEecCceeEEEecH
Confidence 34446788876444 57899999999998764332 223333333 333323346789999875
No 34
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=95.71 E-value=1.6 Score=51.20 Aligned_cols=217 Identities=16% Similarity=0.249 Sum_probs=103.7
Q ss_pred cEEEEEecC--CEEEEEEeCCEEEEEEEccC---C----------------------CeEEeee---------------e
Q 000944 541 TIVKVGSNR--LQVVIALSGGELIYFEVDMT---G----------------------QLLEVEK---------------H 578 (1213)
Q Consensus 541 ~I~~as~~~--~~v~v~~s~~~l~~l~~~~~---~----------------------~l~~~~~---------------~ 578 (1213)
.|++.+.+. .-++|++..|++++|+...+ + .|..+.+ +
T Consensus 3 ~v~~vs~a~~t~Elav~~~~GeVv~~k~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~l~di~~r~~~~~~~gf~P~~l~ 82 (395)
T PF08596_consen 3 SVTHVSFAPETLELAVGLESGEVVLFKFGKNQNYGNREQPPDLDYNFRRFSLNNSPGKLTDISDRAPPSLKEGFLPLTLL 82 (395)
T ss_dssp -EEEEEEETTTTEEEEEETTS-EEEEEEEE------------------S--GGGSS-SEEE-GGG--TT-SEEEEEEEEE
T ss_pred eEEEEEecCCCceEEEEccCCcEEEEEcccCCCCCccCCCcccCcccccccccCCCcceEEehhhCCcccccccCchhhe
Confidence 466666663 46788888999999886532 1 1111100 0
Q ss_pred cc-CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCC---------CCceeEEEEeecccCC
Q 000944 579 EM-SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSS---------PPESLLFLEVQASVGG 648 (1213)
Q Consensus 579 ~l-~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~---------~p~Sl~~~~~~~~~~~ 648 (1213)
++ ..+|||++... --|++||..+|++.|++++.- ..+..+.+.. .+.++.+.-+. ++
T Consensus 83 ~~~~g~vtal~~S~-------iGFvaigy~~G~l~viD~RGP---avI~~~~i~~~~~~~~~~~~vt~ieF~vm~---~~ 149 (395)
T PF08596_consen 83 DAKQGPVTALKNSD-------IGFVAIGYESGSLVVIDLRGP---AVIYNENIRESFLSKSSSSYVTSIEFSVMT---LG 149 (395)
T ss_dssp ---S-SEEEEEE-B-------TSEEEEEETTSEEEEEETTTT---EEEEEEEGGG--T-SS----EEEEEEEEEE----T
T ss_pred eccCCcEeEEecCC-------CcEEEEEecCCcEEEEECCCC---eEEeeccccccccccccccCeeEEEEEEEe---cC
Confidence 11 25577777653 359999999999999999522 2333333321 24444433332 22
Q ss_pred CCCCCCCCceEEEEEeeCCeEEEEEEeCCC-Cccc--cccee-eecCCCCeEEEEEEC-----------------C---e
Q 000944 649 EDGADHPASLFLNAGLQNGVLFRTVVDMVT-GQLS--DSRSR-FLGLRPPKLFSVVVG-----------------G---R 704 (1213)
Q Consensus 649 ~~~~~~~~~~~Lligl~~G~l~~~~~~~~~-~~l~--~~~~~-~lG~~pv~l~~~~~~-----------------~---~ 704 (1213)
++. -..+.|+||+..|.++.|.+.+.. +... ..... ......+.+.++... + .
T Consensus 150 ~D~---ySSi~L~vGTn~G~v~~fkIlp~~~g~f~v~~~~~~~~~~~~i~~I~~i~~~~G~~a~At~~~~~~l~~g~~i~ 226 (395)
T PF08596_consen 150 GDG---YSSICLLVGTNSGNVLTFKILPSSNGRFSVQFAGATTNHDSPILSIIPINADTGESALATISAMQGLSKGISIP 226 (395)
T ss_dssp TSS---SEEEEEEEEETTSEEEEEEEEE-GGG-EEEEEEEEE--SS----EEEEEETTT--B-B-BHHHHHGGGGT----
T ss_pred CCc---ccceEEEEEeCCCCEEEEEEecCCCCceEEEEeeccccCCCceEEEEEEECCCCCcccCchhHhhccccCCCcC
Confidence 211 257999999999999999998632 2211 00111 122333445544311 1 2
Q ss_pred eEEEEecCccEEEEEe-CCeEEEEecCccccce---eeccccCCCCceEEEEe-CCeEEEEEEccCCCeeEEEEE
Q 000944 705 AAMLCLSSRPWLGYIH-RGRFLLTPLSYETLEY---AASFSSDQCVEGVVSVA-GNALRVFTIERLGETFNETAL 774 (1213)
Q Consensus 705 ~~v~~~g~~p~~i~~~-~~~~~~~~~~~~~v~~---~~~f~~~~~~~~~i~~~-~~~L~i~~l~~~~~~~~~r~i 774 (1213)
..+++++++-.-++.. +++............. +.++.......+++++. .+.+++.++|.+ ..+.--++
T Consensus 227 g~vVvvSe~~irv~~~~~~k~~~K~~~~~~~~~~~~vv~~~~~~~~~~Lv~l~~~G~i~i~SLP~L-kei~~~~l 300 (395)
T PF08596_consen 227 GYVVVVSESDIRVFKPPKSKGAHKSFDDPFLCSSASVVPTISRNGGYCLVCLFNNGSIRIYSLPSL-KEIKSVSL 300 (395)
T ss_dssp EEEEEE-SSEEEEE-TT---EEEEE-SS-EEEEEEEEEEEE-EEEEEEEEEEETTSEEEEEETTT---EEEEEE-
T ss_pred cEEEEEcccceEEEeCCCCcccceeeccccccceEEEEeecccCCceEEEEEECCCcEEEEECCCc-hHhhcccC
Confidence 2466666666655542 2222222221111222 22222211123455554 679999999998 33443344
No 35
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=95.65 E-value=4.8 Score=44.87 Aligned_cols=83 Identities=18% Similarity=0.149 Sum_probs=55.7
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEecccc-CeEEE-EeCCeEEEEecCCceeeceeeecCccceEEEEEEeCCE
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ-GRLLA-GIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDR 996 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~-g~ll~-~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~ 996 (1213)
-|+|.+|+.. +.++.+..+.+.+|+.+.-.+ .+|++ +.+.+|+.|+-...+|...+.-+..+.+--.++..+++
T Consensus 307 dG~i~iyD~a----~~~~R~~c~he~~V~~l~w~~t~~l~t~c~~g~v~~wDaRtG~l~~~y~GH~~~Il~f~ls~~~~~ 382 (399)
T KOG0296|consen 307 DGTIAIYDLA----ASTLRHICEHEDGVTKLKWLNTDYLLTACANGKVRQWDARTGQLKFTYTGHQMGILDFALSPQKRL 382 (399)
T ss_pred cceEEEEecc----cchhheeccCCCceEEEEEcCcchheeeccCceEEeeeccccceEEEEecCchheeEEEEcCCCcE
Confidence 5899999987 566667777888899999888 67766 45668888888777776655443223333344555666
Q ss_pred EE-EeecCCc
Q 000944 997 IY-VGDIQES 1005 (1213)
Q Consensus 997 I~-vgD~~~S 1005 (1213)
++ ++|=..+
T Consensus 383 vvT~s~D~~a 392 (399)
T KOG0296|consen 383 VVTVSDDNTA 392 (399)
T ss_pred EEEecCCCeE
Confidence 66 3343333
No 36
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=95.58 E-value=2.1 Score=45.18 Aligned_cols=174 Identities=16% Similarity=0.170 Sum_probs=108.7
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK 939 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~ 939 (1213)
++++++|-..-. ..-+|+.+-.|.++.. . .++.-+++|+- +|+|.+|++..+...-+++-+
T Consensus 105 gt~kIWdlR~~~--~qR~~~~~spVn~vvl---h--pnQteLis~dq------------sg~irvWDl~~~~c~~~liPe 165 (311)
T KOG0315|consen 105 GTVKIWDLRSLS--CQRNYQHNSPVNTVVL---H--PNQTELISGDQ------------SGNIRVWDLGENSCTHELIPE 165 (311)
T ss_pred ceEEEEeccCcc--cchhccCCCCcceEEe---c--CCcceEEeecC------------CCcEEEEEccCCccccccCCC
Confidence 367888866522 2223444455555543 2 45667778874 799999999986333333333
Q ss_pred EeecCcceEeccc-cCeEEEEeCC--eEEEEecCC----ceeeceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEE
Q 000944 940 TQVEGIPLALCQF-QGRLLAGIGP--VLRLYDLGK----KRLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCK 1010 (1213)
Q Consensus 940 ~~~~g~V~ai~~~-~g~ll~~~g~--~l~i~~~~~----~~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~ 1010 (1213)
..-+|+++... .|..++|+++ ..|+|++-. ..|.++-.+++...++++.. ....+|...-+-+-+.+
T Consensus 166 --~~~~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~i-- 241 (311)
T KOG0315|consen 166 --DDTSIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKI-- 241 (311)
T ss_pred --CCcceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheecccceEEEEEECCCCcEEEeecCCceEEE--
Confidence 23566666554 5777777775 588998864 24666655665666777654 34567777777777777
Q ss_pred EeccCCeEEEeeccCCCcceEEEEeecCCee-eeecCCCcEEEEecC
Q 000944 1011 YRRDENQLYIFADDSVPRWLTAAHHIDFDTM-AGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1011 ~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~-l~~D~~gnl~il~~~ 1056 (1213)
|+.+..-..+..-+...||+..|.|-.++.+ +.+..++-..+....
T Consensus 242 wn~~~~~kle~~l~gh~rWvWdc~FS~dg~YlvTassd~~~rlW~~~ 288 (311)
T KOG0315|consen 242 WNTDDFFKLELVLTGHQRWVWDCAFSADGEYLVTASSDHTARLWDLS 288 (311)
T ss_pred EecCCceeeEEEeecCCceEEeeeeccCccEEEecCCCCceeecccc
Confidence 4444543344445567799999998766655 445556666665543
No 37
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=95.38 E-value=5.9 Score=44.19 Aligned_cols=42 Identities=14% Similarity=0.111 Sum_probs=33.2
Q ss_pred EEEeecCcceEeccccCeEEEEeCCeEEEEecCCceeeceee
Q 000944 938 HKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCE 979 (1213)
Q Consensus 938 ~~~~~~g~V~ai~~~~g~ll~~~g~~l~i~~~~~~~l~~~~~ 979 (1213)
....+.+.+.++.-...||++...+.|.||++.+.+|++.-.
T Consensus 223 ~~i~W~~~p~~~~~~~pyli~~~~~~iEV~~~~~~~lvQ~i~ 264 (275)
T PF00780_consen 223 STIQWSSAPQSVAYSSPYLIAFSSNSIEVRSLETGELVQTIP 264 (275)
T ss_pred cEEEcCCchhEEEEECCEEEEECCCEEEEEECcCCcEEEEEE
Confidence 334567788888888899999888889999999888866543
No 38
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=95.23 E-value=1.3 Score=51.80 Aligned_cols=157 Identities=17% Similarity=0.209 Sum_probs=82.7
Q ss_pred CccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccC-cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 539 KRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMS-GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 539 ~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~-~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
|..+..-..+.+..+++ ++.+++.|.++ .|.+. ..+..+ .++-|+++.+ -..++++|+.+|.|+.|.-.
T Consensus 136 GRDm~y~~~scDly~~g-sg~evYRlNLE-qGrfL--~P~~~~~~~lN~v~in~------~hgLla~Gt~~g~VEfwDpR 205 (703)
T KOG2321|consen 136 GRDMKYHKPSCDLYLVG-SGSEVYRLNLE-QGRFL--NPFETDSGELNVVSINE------EHGLLACGTEDGVVEFWDPR 205 (703)
T ss_pred CccccccCCCccEEEee-cCcceEEEEcc-ccccc--cccccccccceeeeecC------ccceEEecccCceEEEecch
Confidence 44444444444444444 57789888877 35432 223333 4566777754 34689999999999999752
Q ss_pred CCCceeEeEE-eecCCCC-----ceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecC
Q 000944 618 PDDCMQILSV-QSVSSPP-----ESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGL 691 (1213)
Q Consensus 618 p~~~l~~~~~-~~l~~~p-----~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~ 691 (1213)
.-+....+.. ..+++.| .++.-+.|. .....+-||+.+|.++.|.+....--+ ...+.+.
T Consensus 206 ~ksrv~~l~~~~~v~s~pg~~~~~svTal~F~-----------d~gL~~aVGts~G~v~iyDLRa~~pl~---~kdh~~e 271 (703)
T KOG2321|consen 206 DKSRVGTLDAASSVNSHPGGDAAPSVTALKFR-----------DDGLHVAVGTSTGSVLIYDLRASKPLL---VKDHGYE 271 (703)
T ss_pred hhhhheeeecccccCCCccccccCcceEEEec-----------CCceeEEeeccCCcEEEEEcccCCcee---ecccCCc
Confidence 1111211111 1122222 122223333 246789999999999999986532111 1234555
Q ss_pred CCCeEEEEEECC-eeEEEEecCccEEEEE
Q 000944 692 RPPKLFSVVVGG-RAAMLCLSSRPWLGYI 719 (1213)
Q Consensus 692 ~pv~l~~~~~~~-~~~v~~~g~~p~~i~~ 719 (1213)
-|++...+.-.+ .+.|+.+-.+..-||.
T Consensus 272 ~pi~~l~~~~~~~q~~v~S~Dk~~~kiWd 300 (703)
T KOG2321|consen 272 LPIKKLDWQDTDQQNKVVSMDKRILKIWD 300 (703)
T ss_pred cceeeecccccCCCceEEecchHHhhhcc
Confidence 666655432222 3334333333333543
No 39
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=95.19 E-value=6.8 Score=43.80 Aligned_cols=169 Identities=17% Similarity=0.224 Sum_probs=101.8
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.+++++-+...+++.+.|+. ++..|++ .++.+ +|.- .+-||+|+++ -++++|..
T Consensus 69 ~Lkv~~~Kk~~~ICe~~fpt-----~IL~Vrm---Nr~RL-vV~L-------------ee~IyIydI~----~MklLhTI 122 (391)
T KOG2110|consen 69 KLKVVHFKKKTTICEIFFPT-----SILAVRM---NRKRL-VVCL-------------EESIYIYDIK----DMKLLHTI 122 (391)
T ss_pred eEEEEEcccCceEEEEecCC-----ceEEEEE---ccceE-EEEE-------------cccEEEEecc----cceeehhh
Confidence 68999988888899888874 4455666 33444 4443 2348999997 58888875
Q ss_pred eec-CcceEeccc---cC--eEEEEe---CCeEEEEecCCceeeceeeecCc--cceEEEEEEeCCEEEEeecCCcEEEE
Q 000944 941 QVE-GIPLALCQF---QG--RLLAGI---GPVLRLYDLGKKRLLRKCENKLF--PNTIVSINTYRDRIYVGDIQESFHFC 1009 (1213)
Q Consensus 941 ~~~-g~V~ai~~~---~g--~ll~~~---g~~l~i~~~~~~~l~~~~~~~~~--~~~i~~l~~~~~~I~vgD~~~Sv~~l 1009 (1213)
+.. --+..+|.+ ++ +|..-. ...|++|+..+ |.++..+..+ +..+...+..|.+|.-+- -||- +.
T Consensus 123 ~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~n--l~~v~~I~aH~~~lAalafs~~G~llATAS-eKGT-VI 198 (391)
T KOG2110|consen 123 ETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTIN--LQPVNTINAHKGPLAALAFSPDGTLLATAS-EKGT-VI 198 (391)
T ss_pred hccCCCccceEeeccCCCCceEEecCCCCCceEEEEEccc--ceeeeEEEecCCceeEEEECCCCCEEEEec-cCce-EE
Confidence 543 333333333 32 555532 23788888764 3233333222 333444444555554332 2232 22
Q ss_pred E-E-eccCCeEEEeeccCCCcceEEEEeecCCe-eeeecCCCcEEEEecCCCC
Q 000944 1010 K-Y-RRDENQLYIFADDSVPRWLTAAHHIDFDT-MAGADKFGNIYFVRLPQDV 1059 (1213)
Q Consensus 1010 ~-~-~~~~~~l~~~a~D~~~~~~~~~~~ld~~~-~l~~D~~gnl~il~~~~~~ 1059 (1213)
| | -++..++.++=|-..+...++..|=-.+. +.++-..+.+++|++....
T Consensus 199 RVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~~ 251 (391)
T KOG2110|consen 199 RVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKVS 251 (391)
T ss_pred EEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEecccc
Confidence 2 2 24567888888888887778877744444 4567778999999987644
No 40
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=95.13 E-value=0.91 Score=49.24 Aligned_cols=202 Identities=17% Similarity=0.215 Sum_probs=116.0
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc-cCeEEEEe--CCeEEEEecCCcee
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF-QGRLLAGI--GPVLRLYDLGKKRL 974 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~ll~~~--g~~l~i~~~~~~~l 974 (1213)
..|++||++ .||+++|++..- ++-.+. ...--||+++|-- .|+.+.+. ..++.+|++-+...
T Consensus 35 G~~lAvGc~------------nG~vvI~D~~T~--~iar~l-saH~~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs~ 99 (405)
T KOG1273|consen 35 GDYLAVGCA------------NGRVVIYDFDTF--RIARML-SAHVRPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGSP 99 (405)
T ss_pred cceeeeecc------------CCcEEEEEcccc--chhhhh-hccccceeEEEecCCCCEeeeecCCceeEEEeccCCCc
Confidence 479999997 699999998863 111111 1123577888775 45554433 35899999987776
Q ss_pred eceeeecCccceEEEEE--Ee-CCEEEEeecCCcEEEEEEeccCCeEEEeeccCCC--cceEEEEeecCC--eeeeecCC
Q 000944 975 LRKCENKLFPNTIVSIN--TY-RDRIYVGDIQESFHFCKYRRDENQLYIFADDSVP--RWLTAAHHIDFD--TMAGADKF 1047 (1213)
Q Consensus 975 ~~~~~~~~~~~~i~~l~--~~-~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~--~~~~~~~~ld~~--~~l~~D~~ 1047 (1213)
+..-. |+++|.... .. .|..++.=+.+|-.+..|.. .+-..+++|... ..+-++.+.|.. .|+.+...
T Consensus 100 l~rir---f~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~--~~h~~Lp~d~d~dln~sas~~~fdr~g~yIitGtsK 174 (405)
T KOG1273|consen 100 LKRIR---FDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSD--PKHSVLPKDDDGDLNSSASHGVFDRRGKYIITGTSK 174 (405)
T ss_pred eeEEE---ccCccceeeeccccCCeEEEEEecCCcEEEEecC--CceeeccCCCccccccccccccccCCCCEEEEecCc
Confidence 55443 344555444 33 47777777777888888864 555556655432 222233344442 56778888
Q ss_pred CcEEEEecCCCCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCCCccEEEEEecccceEEE-
Q 000944 1048 GNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAM- 1126 (1213)
Q Consensus 1048 gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig~l- 1126 (1213)
|-|.+|+.. .++.++.|.+-. ++.++..-+.. .+..++.-|.|--|-..
T Consensus 175 Gkllv~~a~----------------------------t~e~vas~rits-~~~IK~I~~s~-~g~~liiNtsDRvIR~ye 224 (405)
T KOG1273|consen 175 GKLLVYDAE----------------------------TLECVASFRITS-VQAIKQIIVSR-KGRFLIINTSDRVIRTYE 224 (405)
T ss_pred ceEEEEecc----------------------------hheeeeeeeech-heeeeEEEEec-cCcEEEEecCCceEEEEe
Confidence 888887642 345666666654 34444433322 33455666666655422
Q ss_pred -EecCCh---hHHHHHHHHHHHHHhcC
Q 000944 1127 -LAFSSR---DDVDFFSHLEMHMRQEH 1149 (1213)
Q Consensus 1127 -~~l~~~---~~~~~L~~lq~~l~~~~ 1149 (1213)
-.|..+ .+.+.-.++|..+.+..
T Consensus 225 ~~di~~~~r~~e~e~~~K~qDvVNk~~ 251 (405)
T KOG1273|consen 225 ISDIDDEGRDGEVEPEHKLQDVVNKLQ 251 (405)
T ss_pred hhhhcccCccCCcChhHHHHHHHhhhh
Confidence 111111 13444567777666543
No 41
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=95.10 E-value=1.5 Score=50.07 Aligned_cols=48 Identities=25% Similarity=0.489 Sum_probs=40.7
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEeccc-cCe-EEEEeCCeEEE
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF-QGR-LLAGIGPVLRL 966 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~-ll~~~g~~l~i 966 (1213)
.|.+-+|.+.++-++++++++..+.|-|.+|+-. +|+ |++|+|+.=.+
T Consensus 401 ~G~vrLW~i~~g~r~i~~l~~ls~~GfVNsl~f~~sgk~ivagiGkEhRl 450 (479)
T KOG0299|consen 401 SGCVRLWKIEDGLRAINLLYSLSLVGFVNSLAFSNSGKRIVAGIGKEHRL 450 (479)
T ss_pred CCceEEEEecCCccccceeeecccccEEEEEEEccCCCEEEEeccccccc
Confidence 6889999999887799999999999999999944 555 99999985544
No 42
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=94.94 E-value=7.2 Score=42.78 Aligned_cols=290 Identities=13% Similarity=0.135 Sum_probs=158.0
Q ss_pred ceEEEEeCCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCC
Q 000944 747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNG 826 (1213)
Q Consensus 747 ~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 826 (1213)
.+|..+++.+.+|.+++++..+..-+..+-|...-.++| ..|.++++.-..
T Consensus 18 ScFava~~~Gfriyn~~P~ke~~~r~~~~~G~~~veMLf--R~N~laLVGGg~--------------------------- 68 (346)
T KOG2111|consen 18 SCFAVATDTGFRIYNCDPFKESASRQFIDGGFKIVEMLF--RSNYLALVGGGS--------------------------- 68 (346)
T ss_pred ceEEEEecCceEEEecCchhhhhhhccccCchhhhhHhh--hhceEEEecCCC---------------------------
Confidence 578888999999999999744444455554434444443 234444443100
Q ss_pred CcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEee
Q 000944 827 NMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTA 906 (1213)
Q Consensus 827 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~ 906 (1213)
+=-+| ...+-++|......+..++|.. . +..+.|. ...|||-+
T Consensus 69 --------------------~pky~-------pNkviIWDD~k~~~i~el~f~~--~---I~~V~l~----r~riVvvl- 111 (346)
T KOG2111|consen 69 --------------------RPKYP-------PNKVIIWDDLKERCIIELSFNS--E---IKAVKLR----RDRIVVVL- 111 (346)
T ss_pred --------------------CCCCC-------CceEEEEecccCcEEEEEEecc--c---eeeEEEc----CCeEEEEe-
Confidence 00112 1367888876666666666653 2 3445553 34555544
Q ss_pred ecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc----cCeEEEEeCC---eEEEEecCCceeeceee
Q 000944 907 KGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF----QGRLLAGIGP---VLRLYDLGKKRLLRKCE 979 (1213)
Q Consensus 907 ~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~----~g~ll~~~g~---~l~i~~~~~~~l~~~~~ 979 (1213)
..+|+||.... .+++++..++..-+-.+|.+ +..++|.-|. .|.+.++...+.-.-.+
T Consensus 112 ------------~~~I~VytF~~---n~k~l~~~et~~NPkGlC~~~~~~~k~~LafPg~k~GqvQi~dL~~~~~~~p~~ 176 (346)
T KOG2111|consen 112 ------------ENKIYVYTFPD---NPKLLHVIETRSNPKGLCSLCPTSNKSLLAFPGFKTGQVQIVDLASTKPNAPSI 176 (346)
T ss_pred ------------cCeEEEEEcCC---ChhheeeeecccCCCceEeecCCCCceEEEcCCCccceEEEEEhhhcCcCCceE
Confidence 46899999986 46666666664433334433 4567777775 56777765443311111
Q ss_pred ecCccceEEEEE--EeCCEEEEeecCCcEEEEEEecc-CCeEEEeeccCCCcceEEEEeecCCe-eeeecCCCcEEEEec
Q 000944 980 NKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYRRD-ENQLYIFADDSVPRWLTAAHHIDFDT-MAGADKFGNIYFVRL 1055 (1213)
Q Consensus 980 ~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~-~~~l~~~a~D~~~~~~~~~~~ld~~~-~l~~D~~gnl~il~~ 1055 (1213)
..++.+.|.++. ..|. +++.-..+|--+=-|++. +..+.++-|-..+-.++++.|-.+.. +.++-..|.+++|.+
T Consensus 177 I~AH~s~Iacv~Ln~~Gt-~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~~s~LavsSdKgTlHiF~l 255 (346)
T KOG2111|consen 177 INAHDSDIACVALNLQGT-LVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPNSSWLAVSSDKGTLHIFSL 255 (346)
T ss_pred EEcccCceeEEEEcCCcc-EEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCCccEEEEEcCCCeEEEEEe
Confidence 122334444444 3333 333334455444446665 46688888888888888888865554 567888999999998
Q ss_pred CCCCCcccccCCCCCccc--cccCccCCcccceeeeeeeecCceeceEEEeeecCCCccEEEEEecccceEEEE
Q 000944 1056 PQDVSDEIEEDPTGGKIK--WEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAML 1127 (1213)
Q Consensus 1056 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig~l~ 1127 (1213)
-+......+++ .-... +-.+|+.+ --..++|++.+. +.+.-+- -...+.++....||+.+.+.
T Consensus 256 ~~~~~~~~~~S--Sl~~~~~~lpky~~S----~wS~~~f~l~~~-~~~~~~f--g~~~nsvi~i~~Dgsy~k~~ 320 (346)
T KOG2111|consen 256 RDTENTEDESS--SLSFKRLVLPKYFSS----EWSFAKFQLPQG-TQCIIAF--GSETNTVIAICADGSYYKFK 320 (346)
T ss_pred ecCCCCccccc--cccccccccchhccc----ceeEEEEEccCC-CcEEEEe--cCCCCeEEEEEeCCcEEEEE
Confidence 65332210011 00000 00111111 113456666533 2222110 01236788888999998775
No 43
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=94.93 E-value=6.9 Score=45.26 Aligned_cols=273 Identities=13% Similarity=0.127 Sum_probs=146.5
Q ss_pred CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEE
Q 000944 581 SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFL 660 (1213)
Q Consensus 581 ~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~L 660 (1213)
...|+++++.+.. +-.+.+.+ +-.+.||+...-+....++. ......|+.+- ...-.|
T Consensus 26 ~~~vssl~fsp~~----P~d~aVt~--S~rvqly~~~~~~~~k~~sr--Fk~~v~s~~fR--------------~DG~Ll 83 (487)
T KOG0310|consen 26 HNSVSSLCFSPKH----PYDFAVTS--SVRVQLYSSVTRSVRKTFSR--FKDVVYSVDFR--------------SDGRLL 83 (487)
T ss_pred cCcceeEecCCCC----CCceEEec--ccEEEEEecchhhhhhhHHh--hccceeEEEee--------------cCCeEE
Confidence 3568888887643 23333333 45678887631111111110 01122333332 234567
Q ss_pred EEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCccEEEE-EeCCeEEEEecCc--ccccee
Q 000944 661 NAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGY-IHRGRFLLTPLSY--ETLEYA 737 (1213)
Q Consensus 661 ligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p~~i~-~~~~~~~~~~~~~--~~v~~~ 737 (1213)
.+|-..|.+-.|...+. ..+ +...--+.|++..+|...+...+..++|....-| .-.+.....-+.. +-+.+.
T Consensus 84 aaGD~sG~V~vfD~k~r-~iL---R~~~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~v~~~l~~htDYVR~g 159 (487)
T KOG0310|consen 84 AAGDESGHVKVFDMKSR-VIL---RQLYAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLSTAYVQAELSGHTDYVRCG 159 (487)
T ss_pred EccCCcCcEEEeccccH-HHH---HHHhhccCceeEEEecccCCeEEEecCCCceEEEEEcCCcEEEEEecCCcceeEee
Confidence 88999999977764321 011 1122235677777765444434444445544333 2223222111211 113222
Q ss_pred eccccCCCCceEEEEe---CCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHh
Q 000944 738 ASFSSDQCVEGVVSVA---GNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKEC 814 (1213)
Q Consensus 738 ~~f~~~~~~~~~i~~~---~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 814 (1213)
.++.. +.-+.++ ++.+++-.+-..+ -.+..+.=|.-...++|.|.-..++-+.
T Consensus 160 -~~~~~---~~hivvtGsYDg~vrl~DtR~~~--~~v~elnhg~pVe~vl~lpsgs~iasAg------------------ 215 (487)
T KOG0310|consen 160 -DISPA---NDHIVVTGSYDGKVRLWDTRSLT--SRVVELNHGCPVESVLALPSGSLIASAG------------------ 215 (487)
T ss_pred -ccccC---CCeEEEecCCCceEEEEEeccCC--ceeEEecCCCceeeEEEcCCCCEEEEcC------------------
Confidence 11111 2334555 3455544443321 2345555566777777777544433221
Q ss_pred hHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEecc
Q 000944 815 FEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHD 894 (1213)
Q Consensus 815 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~ 894 (1213)
.+.++++|-.++..+-.-.+..+.+|+|++...
T Consensus 216 --------------------------------------------Gn~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s--- 248 (487)
T KOG0310|consen 216 --------------------------------------------GNSVKVWDLTTGGQLLTSMFNHNKTVTCLRLAS--- 248 (487)
T ss_pred --------------------------------------------CCeEEEEEecCCceehhhhhcccceEEEEEeec---
Confidence 126889998766544444455788999998754
Q ss_pred CCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc--CeEEEEeCCeEEEEe
Q 000944 895 KEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ--GRLLAGIGPVLRLYD 968 (1213)
Q Consensus 895 ~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~--g~ll~~~g~~l~i~~ 968 (1213)
++..++-.|- -|++.+|++. .+|++|...++|||.+|.... ..+++|.++-+..+.
T Consensus 249 -~~~rLlS~sL-------------D~~VKVfd~t----~~Kvv~s~~~~~pvLsiavs~dd~t~viGmsnGlv~~r 306 (487)
T KOG0310|consen 249 -DSTRLLSGSL-------------DRHVKVFDTT----NYKVVHSWKYPGPVLSIAVSPDDQTVVIGMSNGLVSIR 306 (487)
T ss_pred -CCceEeeccc-------------ccceEEEEcc----ceEEEEeeecccceeeEEecCCCceEEEecccceeeee
Confidence 2234444333 3778899965 799999999999999988775 578888888665544
No 44
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=94.87 E-value=8.4 Score=43.14 Aligned_cols=131 Identities=19% Similarity=0.263 Sum_probs=84.6
Q ss_pred EecCCEEEEEEe-CCEEEEEEEccC-CCeEEeeeec-----cCc--ceEEEEeeecCCCceeeeEEEEEEe-CCcEEEEE
Q 000944 546 GSNRLQVVIALS-GGELIYFEVDMT-GQLLEVEKHE-----MSG--DVACLDIASVPEGRKRSRFLAVGSY-DNTIRILS 615 (1213)
Q Consensus 546 s~~~~~v~v~~s-~~~l~~l~~~~~-~~l~~~~~~~-----l~~--~is~i~i~~~~~~~~~~~~l~v~~~-~~~i~i~s 615 (1213)
.-++.++.+..+ ++++.++++++. |++.+++.+. ++. ..++|.+.+ ..+||.++-. ..+|.+|.
T Consensus 199 Hpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~------dGrFLYasNRg~dsI~~f~ 272 (346)
T COG2706 199 HPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISP------DGRFLYASNRGHDSIAVFS 272 (346)
T ss_pred cCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECC------CCCEEEEecCCCCeEEEEE
Confidence 334455554433 678888888865 6777764432 233 346666654 4688988876 56999999
Q ss_pred eCCCCc-eeEeEEeecCCC-CceeEEEEeecccCCCCCCCCCCceEEEEEeeC-CeEEEEEEeCCCCcccccceeeecCC
Q 000944 616 LDPDDC-MQILSVQSVSSP-PESLLFLEVQASVGGEDGADHPASLFLNAGLQN-GVLFRTVVDMVTGQLSDSRSRFLGLR 692 (1213)
Q Consensus 616 l~p~~~-l~~~~~~~l~~~-p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~-G~l~~~~~~~~~~~l~~~~~~~lG~~ 692 (1213)
++|+.. |+.+...+.... ||+.-+.. +..+|++...+ ..+..|+.++.+|.|.......-+..
T Consensus 273 V~~~~g~L~~~~~~~teg~~PR~F~i~~--------------~g~~Liaa~q~sd~i~vf~~d~~TG~L~~~~~~~~~p~ 338 (346)
T COG2706 273 VDPDGGKLELVGITPTEGQFPRDFNINP--------------SGRFLIAANQKSDNITVFERDKETGRLTLLGRYAVVPE 338 (346)
T ss_pred EcCCCCEEEEEEEeccCCcCCccceeCC--------------CCCEEEEEccCCCcEEEEEEcCCCceEEecccccCCCC
Confidence 998754 666655555454 88864422 34456666554 67889999999998766555555556
Q ss_pred CCeE
Q 000944 693 PPKL 696 (1213)
Q Consensus 693 pv~l 696 (1213)
|+-+
T Consensus 339 Pvcv 342 (346)
T COG2706 339 PVCV 342 (346)
T ss_pred cEEE
Confidence 6543
No 45
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=94.81 E-value=7.8 Score=42.53 Aligned_cols=75 Identities=28% Similarity=0.351 Sum_probs=51.0
Q ss_pred CCccEEEEEecCCEEEEE--EeCCEEEEEEEccCCC-eEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEE
Q 000944 538 GKRTIVKVGSNRLQVVIA--LSGGELIYFEVDMTGQ-LLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRIL 614 (1213)
Q Consensus 538 ~~~~I~~as~~~~~v~v~--~s~~~l~~l~~~~~~~-l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~ 614 (1213)
....|.+.+.+-+--+|| ...|+|+.+-=..+|+ +.|..+=.-..+|-||++.+ .+.|++|+...|+++||
T Consensus 180 H~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp------~~s~LavsSdKgTlHiF 253 (346)
T KOG2111|consen 180 HDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSP------NSSWLAVSSDKGTLHIF 253 (346)
T ss_pred ccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCC------CccEEEEEcCCCeEEEE
Confidence 445688888875443444 3457776543233454 55654333456799999975 58899999999999999
Q ss_pred EeCC
Q 000944 615 SLDP 618 (1213)
Q Consensus 615 sl~p 618 (1213)
+|.+
T Consensus 254 ~l~~ 257 (346)
T KOG2111|consen 254 SLRD 257 (346)
T ss_pred Eeec
Confidence 9954
No 46
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=94.81 E-value=0.77 Score=56.06 Aligned_cols=110 Identities=15% Similarity=0.201 Sum_probs=77.8
Q ss_pred cEEEEEecCCEEEEEEe-CCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCC
Q 000944 541 TIVKVGSNRLQVVIALS-GGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPD 619 (1213)
Q Consensus 541 ~I~~as~~~~~v~v~~s-~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~ 619 (1213)
.|...+...++.+|..+ |.++.+.++..+..|....+.+ =|||+++.|++ ..+++-|+-||.++||++ ++
T Consensus 371 DILDlSWSKn~fLLSSSMDKTVRLWh~~~~~CL~~F~Hnd---fVTcVaFnPvD-----DryFiSGSLD~KvRiWsI-~d 441 (712)
T KOG0283|consen 371 DILDLSWSKNNFLLSSSMDKTVRLWHPGRKECLKVFSHND---FVTCVAFNPVD-----DRYFISGSLDGKVRLWSI-SD 441 (712)
T ss_pred hheecccccCCeeEeccccccEEeecCCCcceeeEEecCC---eeEEEEecccC-----CCcEeecccccceEEeec-Cc
Confidence 58888888877777644 7788888877554554444443 39999999875 568889999999999999 76
Q ss_pred CceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 620 DCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 620 ~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
... +.-.++..+.+++++.. ..-+.+||+-+|....|...
T Consensus 442 ~~V--v~W~Dl~~lITAvcy~P--------------dGk~avIGt~~G~C~fY~t~ 481 (712)
T KOG0283|consen 442 KKV--VDWNDLRDLITAVCYSP--------------DGKGAVIGTFNGYCRFYDTE 481 (712)
T ss_pred Cee--EeehhhhhhheeEEecc--------------CCceEEEEEeccEEEEEEcc
Confidence 333 32223444455555432 34578899999999988753
No 47
>PLN00181 protein SPA1-RELATED; Provisional
Probab=94.53 E-value=12 Score=48.85 Aligned_cols=131 Identities=11% Similarity=0.156 Sum_probs=82.8
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEeccc--cCeEEEEeC--CeEEEEecCCceeeceeeecCccceEEEEEE--
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF--QGRLLAGIG--PVLRLYDLGKKRLLRKCENKLFPNTIVSINT-- 992 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~--~g~ll~~~g--~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~-- 992 (1213)
.|.|.+|++... ..+..+ ....++|++++-- ++.++++.+ ..|++|++...+.... +.. ...+.++..
T Consensus 554 Dg~v~lWd~~~~-~~~~~~--~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~--~~~-~~~v~~v~~~~ 627 (793)
T PLN00181 554 EGVVQVWDVARS-QLVTEM--KEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGT--IKT-KANICCVQFPS 627 (793)
T ss_pred CCeEEEEECCCC-eEEEEe--cCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEE--Eec-CCCeEEEEEeC
Confidence 478999998753 122222 3467899998764 455555444 4799999976543322 221 223445543
Q ss_pred -eCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecCC
Q 000944 993 -YRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 993 -~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
.++++++|..-..+.++..+.....+..+. .+...++++.+.+.+.++.+..+|.+.+++...
T Consensus 628 ~~g~~latgs~dg~I~iwD~~~~~~~~~~~~--~h~~~V~~v~f~~~~~lvs~s~D~~ikiWd~~~ 691 (793)
T PLN00181 628 ESGRSLAFGSADHKVYYYDLRNPKLPLCTMI--GHSKTVSYVRFVDSSTLVSSSTDNTLKLWDLSM 691 (793)
T ss_pred CCCCEEEEEeCCCeEEEEECCCCCccceEec--CCCCCEEEEEEeCCCEEEEEECCCEEEEEeCCC
Confidence 367899998888888765543221222222 234567777787777788889999999998764
No 48
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=94.52 E-value=4.3 Score=49.52 Aligned_cols=90 Identities=24% Similarity=0.409 Sum_probs=55.4
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
++|++|..++..+-.|. ..--.|++++ +. ....|++-|.. -|+|.+|++..+ ..++....
T Consensus 558 tVRlWDv~~G~~VRiF~-GH~~~V~al~---~S--p~Gr~LaSg~e------------d~~I~iWDl~~~-~~v~~l~~- 617 (707)
T KOG0263|consen 558 TVRLWDVSTGNSVRIFT-GHKGPVTALA---FS--PCGRYLASGDE------------DGLIKIWDLANG-SLVKQLKG- 617 (707)
T ss_pred eEEEEEcCCCcEEEEec-CCCCceEEEE---Ec--CCCceEeeccc------------CCcEEEEEcCCC-cchhhhhc-
Confidence 68888888887765541 1112333443 33 23467776653 589999999875 12332222
Q ss_pred eecCcceEecc-ccCeEEEE--eCCeEEEEecCC
Q 000944 941 QVEGIPLALCQ-FQGRLLAG--IGPVLRLYDLGK 971 (1213)
Q Consensus 941 ~~~g~V~ai~~-~~g~ll~~--~g~~l~i~~~~~ 971 (1213)
..|.|++|.- ..|.++|+ .++.|.+|++..
T Consensus 618 -Ht~ti~SlsFS~dg~vLasgg~DnsV~lWD~~~ 650 (707)
T KOG0263|consen 618 -HTGTIYSLSFSRDGNVLASGGADNSVRLWDLTK 650 (707)
T ss_pred -ccCceeEEEEecCCCEEEecCCCCeEEEEEchh
Confidence 2899988864 35555553 346899999864
No 49
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=94.39 E-value=0.77 Score=52.57 Aligned_cols=248 Identities=14% Similarity=0.126 Sum_probs=142.2
Q ss_pred EEEeCCeEEEEecCccccceeeccccCCCCceEEEEe-CCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCC-ceEEE
Q 000944 717 GYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVA-GNALRVFTIERLGETFNETALPLRYTPRRFVLQPKK-KLMVI 794 (1213)
Q Consensus 717 i~~~~~~~~~~~~~~~~v~~~~~f~~~~~~~~~i~~~-~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~-~~~~v 794 (1213)
+|.+++.++...-..++|..++. ..|+..|+.++ +..|++-..+ .+-...+..++..|.-+-+||+. +.|++
T Consensus 244 vy~~~~~lrtf~gH~k~Vrd~~~---s~~g~~fLS~sfD~~lKlwDtE---TG~~~~~f~~~~~~~cvkf~pd~~n~fl~ 317 (503)
T KOG0282|consen 244 VYDDRRCLRTFKGHRKPVRDASF---NNCGTSFLSASFDRFLKLWDTE---TGQVLSRFHLDKVPTCVKFHPDNQNIFLV 317 (503)
T ss_pred EecCcceehhhhcchhhhhhhhc---cccCCeeeeeecceeeeeeccc---cceEEEEEecCCCceeeecCCCCCcEEEE
Confidence 44545555544333334444332 22455677665 5566554444 44678899999999999999988 77777
Q ss_pred EEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEE
Q 000944 795 IETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTC 874 (1213)
Q Consensus 795 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~ 874 (1213)
..++ +.|+.+|..+++++.
T Consensus 318 G~sd-------------------------------------------------------------~ki~~wDiRs~kvvq 336 (503)
T KOG0282|consen 318 GGSD-------------------------------------------------------------KKIRQWDIRSGKVVQ 336 (503)
T ss_pred ecCC-------------------------------------------------------------CcEEEEeccchHHHH
Confidence 6542 246677777777665
Q ss_pred EEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE-eecCcceEecccc
Q 000944 875 LLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT-QVEGIPLALCQFQ 953 (1213)
Q Consensus 875 ~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~-~~~g~V~ai~~~~ 953 (1213)
.|. ++.-|+..++|-. +++.| + .|+ ++ +.+.+|+...+ -.++++... ...-|..++.+..
T Consensus 337 eYd----~hLg~i~~i~F~~-~g~rF-i-ssS-------Dd----ks~riWe~~~~-v~ik~i~~~~~hsmP~~~~~P~~ 397 (503)
T KOG0282|consen 337 EYD----RHLGAILDITFVD-EGRRF-I-SSS-------DD----KSVRIWENRIP-VPIKNIADPEMHTMPCLTLHPNG 397 (503)
T ss_pred HHH----hhhhheeeeEEcc-CCceE-e-eec-------cC----ccEEEEEcCCC-ccchhhcchhhccCcceecCCCC
Confidence 442 3444555555543 22333 2 222 12 26778887653 245554433 3345666677777
Q ss_pred CeEEE-EeCCeEEEEecCCc-eeeceeeec--CccceEEEEEE--eCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCC
Q 000944 954 GRLLA-GIGPVLRLYDLGKK-RLLRKCENK--LFPNTIVSINT--YRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVP 1027 (1213)
Q Consensus 954 g~ll~-~~g~~l~i~~~~~~-~l~~~~~~~--~~~~~i~~l~~--~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~ 1027 (1213)
+.+++ ++++.+.+|....+ ++.++-.+. ..+-|.+.+.. .+.+++-||.--.+.++.|++. ++.-.-+-. .
T Consensus 398 ~~~~aQs~dN~i~ifs~~~~~r~nkkK~feGh~vaGys~~v~fSpDG~~l~SGdsdG~v~~wdwkt~--kl~~~lkah-~ 474 (503)
T KOG0282|consen 398 KWFAAQSMDNYIAIFSTVPPFRLNKKKRFEGHSVAGYSCQVDFSPDGRTLCSGDSDGKVNFWDWKTT--KLVSKLKAH-D 474 (503)
T ss_pred CeehhhccCceEEEEecccccccCHhhhhcceeccCceeeEEEcCCCCeEEeecCCccEEEeechhh--hhhhccccC-C
Confidence 77766 77889999876532 232221111 12335555554 5899999999999999988753 221111111 1
Q ss_pred cceEEEEe--ecCCeeeeecCCCcEEEE
Q 000944 1028 RWLTAAHH--IDFDTMAGADKFGNIYFV 1053 (1213)
Q Consensus 1028 ~~~~~~~~--ld~~~~l~~D~~gnl~il 1053 (1213)
.-|+.+.. ...+.++.++-+|-|.++
T Consensus 475 ~~ci~v~wHP~e~Skvat~~w~G~Ikiw 502 (503)
T KOG0282|consen 475 QPCIGVDWHPVEPSKVATCGWDGLIKIW 502 (503)
T ss_pred cceEEEEecCCCcceeEecccCceeEec
Confidence 11333332 223457778888887764
No 50
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=94.07 E-value=9.3 Score=43.95 Aligned_cols=138 Identities=16% Similarity=0.114 Sum_probs=80.5
Q ss_pred CCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCccEEE-EE--eCCeEEEEecCc
Q 000944 655 PASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLG-YI--HRGRFLLTPLSY 731 (1213)
Q Consensus 655 ~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p~~i-~~--~~~~~~~~~~~~ 731 (1213)
...+.|+++--||.+-.|.+|-..... ....++-.-|+.-..+.-.|..-+|..|.++++. |. ....-.++|+..
T Consensus 223 p~~plllvaG~d~~lrifqvDGk~N~~--lqS~~l~~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g 300 (514)
T KOG2055|consen 223 PTAPLLLVAGLDGTLRIFQVDGKVNPK--LQSIHLEKFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYG 300 (514)
T ss_pred CCCceEEEecCCCcEEEEEecCccChh--heeeeeccCccceeeecCCCceEEEecccceEEEEeeccccccccccCCCC
Confidence 367888899999999999997432211 1223444555554444445677888888888764 43 223334555543
Q ss_pred cccceeeccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEE
Q 000944 732 ETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIE 796 (1213)
Q Consensus 732 ~~v~~~~~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~ 796 (1213)
.+=..+--|-...+ ..||.+.++.=.|--+... .+-.+-++.+.+..+-++++.+.+.+++++
T Consensus 301 ~e~~~~e~FeVShd-~~fia~~G~~G~I~lLhak-T~eli~s~KieG~v~~~~fsSdsk~l~~~~ 363 (514)
T KOG2055|consen 301 VEEKSMERFEVSHD-SNFIAIAGNNGHIHLLHAK-TKELITSFKIEGVVSDFTFSSDSKELLASG 363 (514)
T ss_pred cccchhheeEecCC-CCeEEEcccCceEEeehhh-hhhhhheeeeccEEeeEEEecCCcEEEEEc
Confidence 33222333322222 2367666543333333332 234567788889999999988887766654
No 51
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=93.63 E-value=1.2 Score=47.74 Aligned_cols=151 Identities=17% Similarity=0.129 Sum_probs=107.4
Q ss_pred CCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc------CeEEEEeCCeEEEEec
Q 000944 896 EHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ------GRLLAGIGPVLRLYDL 969 (1213)
Q Consensus 896 ~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~------g~ll~~~g~~l~i~~~ 969 (1213)
..+..++||+-... .+-+|.+..+++++..+.....++++-|++.+--+- ..|+++.|..|++|+.
T Consensus 57 ~~~~rla~gS~~Ee--------~~Nkvqiv~ld~~s~e~~~~a~fd~~YP~tK~~wiPd~~g~~pdlLATs~D~LRlWri 128 (364)
T KOG0290|consen 57 DKKFRLAVGSFIEE--------YNNKVQIVQLDEDSGELVEDANFDHPYPVTKLMWIPDSKGVYPDLLATSSDFLRLWRI 128 (364)
T ss_pred CcceeEEEeeeccc--------cCCeeEEEEEccCCCceeccCCCCCCCCccceEecCCccccCcchhhcccCeEEEEec
Confidence 56799999996532 357888888888766777777789999999886653 3589999999999998
Q ss_pred C--Cceeeceeeec-----CccceEEEEEE---eCCEEEEeecCCcEEEEEEeccCC---eEEEeeccCCCcceEEEEee
Q 000944 970 G--KKRLLRKCENK-----LFPNTIVSINT---YRDRIYVGDIQESFHFCKYRRDEN---QLYIFADDSVPRWLTAAHHI 1036 (1213)
Q Consensus 970 ~--~~~l~~~~~~~-----~~~~~i~~l~~---~~~~I~vgD~~~Sv~~l~~~~~~~---~l~~~a~D~~~~~~~~~~~l 1036 (1213)
. +.++...+.+. .++..+++.+- .-++|.+.-+-.-.+++....+.. +-.++|.| +.|..+.|+
T Consensus 129 ~~ee~~~~~~~~L~~~kns~~~aPlTSFDWne~dp~~igtSSiDTTCTiWdie~~~~~~vkTQLIAHD---KEV~DIaf~ 205 (364)
T KOG0290|consen 129 GDEESRVELQSVLNNNKNSEFCAPLTSFDWNEVDPNLIGTSSIDTTCTIWDIETGVSGTVKTQLIAHD---KEVYDIAFL 205 (364)
T ss_pred cCcCCceehhhhhccCcccccCCcccccccccCCcceeEeecccCeEEEEEEeeccccceeeEEEecC---cceeEEEec
Confidence 7 44433322221 23445677653 458888888888888877765422 45677776 567888888
Q ss_pred cCC--eeeeecCCCcEEEEecCC
Q 000944 1037 DFD--TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1037 d~~--~~l~~D~~gnl~il~~~~ 1057 (1213)
-.+ .|...-.+|.+++|++-.
T Consensus 206 ~~s~~~FASvgaDGSvRmFDLR~ 228 (364)
T KOG0290|consen 206 KGSRDVFASVGADGSVRMFDLRS 228 (364)
T ss_pred cCccceEEEecCCCcEEEEEecc
Confidence 644 466677899999998754
No 52
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=93.42 E-value=14 Score=40.42 Aligned_cols=138 Identities=15% Similarity=0.256 Sum_probs=77.8
Q ss_pred CceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc----cCe-EEE-EeCCeEEEEecC
Q 000944 897 HGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF----QGR-LLA-GIGPVLRLYDLG 970 (1213)
Q Consensus 897 ~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~----~g~-ll~-~~g~~l~i~~~~ 970 (1213)
+.+||++||+ +|.|++|+.. .++.+++..+.. +++|.++ +|+ |+. +....|+.|+..
T Consensus 164 ~g~yIitGts------------KGkllv~~a~----t~e~vas~rits-~~~IK~I~~s~~g~~liiNtsDRvIR~ye~~ 226 (405)
T KOG1273|consen 164 RGKYIITGTS------------KGKLLVYDAE----TLECVASFRITS-VQAIKQIIVSRKGRFLIINTSDRVIRTYEIS 226 (405)
T ss_pred CCCEEEEecC------------cceEEEEecc----hheeeeeeeech-heeeeEEEEeccCcEEEEecCCceEEEEehh
Confidence 4489999997 7999999976 567777766544 5666554 454 443 455567888875
Q ss_pred Cc-------eeeceeeecC----ccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCC
Q 000944 971 KK-------RLLRKCENKL----FPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039 (1213)
Q Consensus 971 ~~-------~l~~~~~~~~----~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~ 1039 (1213)
+- .+.+.-.++. .+-.-.+.+..+.||+.|. .+.-.+|-|...-+.|+.+=.-...- +++|-+
T Consensus 227 di~~~~r~~e~e~~~K~qDvVNk~~Wk~ccfs~dgeYv~a~s-~~aHaLYIWE~~~GsLVKILhG~kgE-----~l~DV~ 300 (405)
T KOG1273|consen 227 DIDDEGRDGEVEPEHKLQDVVNKLQWKKCCFSGDGEYVCAGS-ARAHALYIWEKSIGSLVKILHGTKGE-----ELLDVN 300 (405)
T ss_pred hhcccCccCCcChhHHHHHHHhhhhhhheeecCCccEEEecc-ccceeEEEEecCCcceeeeecCCchh-----heeecc
Confidence 21 1111000000 0111122233578888888 44455666776666766543211100 122221
Q ss_pred -----eeeeecCCCcEEEEecCC
Q 000944 1040 -----TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1040 -----~~l~~D~~gnl~il~~~~ 1057 (1213)
.++++=..|++++.....
T Consensus 301 whp~rp~i~si~sg~v~iw~~~~ 323 (405)
T KOG1273|consen 301 WHPVRPIIASIASGVVYIWAVVQ 323 (405)
T ss_pred cccceeeeeeccCCceEEEEeec
Confidence 356667889999877543
No 53
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=93.31 E-value=14 Score=39.84 Aligned_cols=263 Identities=15% Similarity=0.164 Sum_probs=148.1
Q ss_pred eEEEEe---CCeEEEEEEccCCCeeEEEEEe---CCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCC
Q 000944 748 GVVSVA---GNALRVFTIERLGETFNETALP---LRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMG 821 (1213)
Q Consensus 748 ~~i~~~---~~~L~i~~l~~~~~~~~~r~i~---l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 821 (1213)
|.++++ ++.++|-..... ..|..|++- =.++.|++++.|..++++.+..+-
T Consensus 27 g~ilAscg~Dk~vriw~~~~~-~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~---------------------- 83 (312)
T KOG0645|consen 27 GVILASCGTDKAVRIWSTSSG-DSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDA---------------------- 83 (312)
T ss_pred ceEEEeecCCceEEEEecCCC-CcEEEEEeccccchheeeeeeecCCCcEEEEeeccc----------------------
Confidence 444444 468888877763 457777664 247999999999999777665321
Q ss_pred CCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEe--CCCCceEEEEEcCCCceEEEEEEEEeccCCCce
Q 000944 822 ENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLD--PRSANTTCLLELQDNEAAFSICTVNFHDKEHGT 899 (1213)
Q Consensus 822 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d--~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~ 899 (1213)
+.-++. ..+|++++.+|=.++| ++.+.+.. ...
T Consensus 84 ---------------------------------------t~~Iw~k~~~efecv~~lEGHEnE----VK~Vaws~--sG~ 118 (312)
T KOG0645|consen 84 ---------------------------------------TVVIWKKEDGEFECVATLEGHENE----VKCVAWSA--SGN 118 (312)
T ss_pred ---------------------------------------eEEEeecCCCceeEEeeeeccccc----eeEEEEcC--CCC
Confidence 111111 2346677776666666 34455543 336
Q ss_pred EEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEE-----EEEEeecCcceEeccccCeEEE-EeCCeEEEEecC-Cc
Q 000944 900 LLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLEL-----LHKTQVEGIPLALCQFQGRLLA-GIGPVLRLYDLG-KK 972 (1213)
Q Consensus 900 ~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~-----~~~~~~~g~V~ai~~~~g~ll~-~~g~~l~i~~~~-~~ 972 (1213)
|++-.+-- =.+.++++++++ .++. -|..++++.+- .+-.+-|+. +..++|++|+.. +.
T Consensus 119 ~LATCSRD------------KSVWiWe~dedd-Efec~aVL~~HtqDVK~V~W--HPt~dlL~S~SYDnTIk~~~~~~dd 183 (312)
T KOG0645|consen 119 YLATCSRD------------KSVWIWEIDEDD-EFECIAVLQEHTQDVKHVIW--HPTEDLLFSCSYDNTIKVYRDEDDD 183 (312)
T ss_pred EEEEeeCC------------CeEEEEEecCCC-cEEEEeeeccccccccEEEE--cCCcceeEEeccCCeEEEEeecCCC
Confidence 77765521 126788988642 2222 13344443321 222334443 234789999877 66
Q ss_pred eeeceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCcE
Q 000944 973 RLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNI 1050 (1213)
Q Consensus 973 ~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl 1050 (1213)
.+.-.+.++.....+-++. ..+++++.++=-.-+.+.++-.+ + ...+.+.++.+. .+++.|..+=.++-|
T Consensus 184 dW~c~~tl~g~~~TVW~~~F~~~G~rl~s~sdD~tv~Iw~~~~~------~-~~~~sr~~Y~v~-W~~~~IaS~ggD~~i 255 (312)
T KOG0645|consen 184 DWECVQTLDGHENTVWSLAFDNIGSRLVSCSDDGTVSIWRLYTD------L-SGMHSRALYDVP-WDNGVIASGGGDDAI 255 (312)
T ss_pred CeeEEEEecCccceEEEEEecCCCceEEEecCCcceEeeeeccC------c-chhcccceEeee-ecccceEeccCCCEE
Confidence 6766777775444555554 44678887777667777664311 1 122334444433 224566666677888
Q ss_pred EEEecCCCCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCCCccEEEEEecccceE
Q 000944 1051 YFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLG 1124 (1213)
Q Consensus 1051 ~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig 1124 (1213)
.+|+-...+. +| ..+|...-+--.+.-||+++= .|..++.++.++-+|.+-
T Consensus 256 ~lf~~s~~~d-----~p---------------~~~l~~~~~~aHe~dVNsV~w---~p~~~~~L~s~~DDG~v~ 306 (312)
T KOG0645|consen 256 RLFKESDSPD-----EP---------------SWNLLAKKEGAHEVDVNSVQW---NPKVSNRLASGGDDGIVN 306 (312)
T ss_pred EEEEecCCCC-----Cc---------------hHHHHHhhhcccccccceEEE---cCCCCCceeecCCCceEE
Confidence 8888763221 12 122222111122346777663 344567888888888775
No 54
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=93.05 E-value=22 Score=41.48 Aligned_cols=137 Identities=16% Similarity=0.239 Sum_probs=80.5
Q ss_pred eEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEe--ccccCeEEEEe-CCeEEEEecCCc--e
Q 000944 899 TLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLAL--CQFQGRLLAGI-GPVLRLYDLGKK--R 973 (1213)
Q Consensus 899 ~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai--~~~~g~ll~~~-g~~l~i~~~~~~--~ 973 (1213)
..+++||+ .||.++++.... .+-.++.. +.+..++ .+-+.+|+++. .+.||+|..++. +
T Consensus 419 g~va~Gt~------------~G~w~V~d~e~~--~lv~~~~d--~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~~g~~ 482 (626)
T KOG2106|consen 419 GVVAVGTA------------TGRWFVLDTETQ--DLVTIHTD--NEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSANGRK 482 (626)
T ss_pred ceEEEeec------------cceEEEEecccc--eeEEEEec--CCceEEEEEcCCCCEEEEecCCCeEEEEEECCCCcE
Confidence 39999997 699999998763 56666654 4455444 33456776654 468999999853 4
Q ss_pred eeceeeecCccceEEEEEEe--CCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEE---Eee----cCC-----
Q 000944 974 LLRKCENKLFPNTIVSINTY--RDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAA---HHI----DFD----- 1039 (1213)
Q Consensus 974 l~~~~~~~~~~~~i~~l~~~--~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~---~~l----d~~----- 1039 (1213)
+.+..... .++|+.|+.. .+|+. +-.-+ ..+|-|.+..-+..--.||. .|.+.- .|. ..+
T Consensus 483 y~r~~k~~--gs~ithLDwS~Ds~~~~-~~S~d-~eiLyW~~~~~~~~ts~kDv--kW~t~~c~lGF~v~g~s~~t~i~a 556 (626)
T KOG2106|consen 483 YSRVGKCS--GSPITHLDWSSDSQFLV-SNSGD-YEILYWKPSECKQITSVKDV--KWATYTCTLGFEVFGGSDGTDINA 556 (626)
T ss_pred EEEeeeec--CceeEEeeecCCCceEE-eccCc-eEEEEEccccCcccceecce--eeeeeEEEEEEEEecccCCchHHH
Confidence 44443333 3789999753 34443 32222 34555654332222224554 344422 121 111
Q ss_pred --------eeeeecCCCcEEEEecCC
Q 000944 1040 --------TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1040 --------~~l~~D~~gnl~il~~~~ 1057 (1213)
.+..+|.+|-+++|+|+=
T Consensus 557 ~~rs~~~~~lA~gdd~g~v~lf~yPc 582 (626)
T KOG2106|consen 557 VARSHCEKLLASGDDFGKVHLFSYPC 582 (626)
T ss_pred hhhhhhhhhhhccccCceEEEEcccc
Confidence 246799999999999874
No 55
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=93.00 E-value=17 Score=39.93 Aligned_cols=282 Identities=16% Similarity=0.148 Sum_probs=149.5
Q ss_pred cEEEEEec--CCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCC
Q 000944 541 TIVKVGSN--RLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDP 618 (1213)
Q Consensus 541 ~I~~as~~--~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p 618 (1213)
.|+..... .+.++++..+|++.++++..+. + ..+......+.+.++.+ ...+++|.-||.|+.+.++.
T Consensus 15 ~IS~v~f~~~~~~LLvssWDgslrlYdv~~~~-l--~~~~~~~~plL~c~F~d-------~~~~~~G~~dg~vr~~Dln~ 84 (323)
T KOG1036|consen 15 GISSVKFSPSSSDLLVSSWDGSLRLYDVPANS-L--KLKFKHGAPLLDCAFAD-------ESTIVTGGLDGQVRRYDLNT 84 (323)
T ss_pred ceeeEEEcCcCCcEEEEeccCcEEEEeccchh-h--hhheecCCceeeeeccC-------CceEEEeccCceEEEEEecC
Confidence 47766665 5678888899999999887431 1 13344556777777743 44688999999999999964
Q ss_pred CCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeec--CCCCeE
Q 000944 619 DDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLG--LRPPKL 696 (1213)
Q Consensus 619 ~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG--~~pv~l 696 (1213)
...... ... ..|. .++.... ...-++.|.=|+.+-.+.... ....| .++=+.
T Consensus 85 ~~~~~i-gth---~~~i-~ci~~~~------------~~~~vIsgsWD~~ik~wD~R~---------~~~~~~~d~~kkV 138 (323)
T KOG1036|consen 85 GNEDQI-GTH---DEGI-RCIEYSY------------EVGCVISGSWDKTIKFWDPRN---------KVVVGTFDQGKKV 138 (323)
T ss_pred Ccceee-ccC---CCce-EEEEeec------------cCCeEEEcccCccEEEEeccc---------cccccccccCceE
Confidence 322211 100 0111 1122111 122344444455543332211 00111 112244
Q ss_pred EEEEECCeeEEEEecCccEEEEEeCCeE-----EEEecCccccceeeccccCCCCceEEEEe-CCeEEEEEEccC----C
Q 000944 697 FSVVVGGRAAMLCLSSRPWLGYIHRGRF-----LLTPLSYETLEYAASFSSDQCVEGVVSVA-GNALRVFTIERL----G 766 (1213)
Q Consensus 697 ~~~~~~~~~~v~~~g~~p~~i~~~~~~~-----~~~~~~~~~v~~~~~f~~~~~~~~~i~~~-~~~L~i~~l~~~----~ 766 (1213)
..+...+...|+.+.++-.++|.-++.= +-+++.. .+.+++-|.+ ..|+++-+ ++.+.+--+++- .
T Consensus 139 y~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~lky-qtR~v~~~pn---~eGy~~sSieGRVavE~~d~s~~~~s 214 (323)
T KOG1036|consen 139 YCMDVSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSLKY-QTRCVALVPN---GEGYVVSSIEGRVAVEYFDDSEEAQS 214 (323)
T ss_pred EEEeccCCEEEEeecCceEEEEEcccccchhhhcccccee-EEEEEEEecC---CCceEEEeecceEEEEccCCchHHhh
Confidence 4455666667777888888888644311 1122221 2455555543 35776654 677766666541 1
Q ss_pred CeeEEE-------EEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCC
Q 000944 767 ETFNET-------ALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENK 839 (1213)
Q Consensus 767 ~~~~~r-------~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 839 (1213)
.++..| -..+.+-++.|++||-.++|+-+.++
T Consensus 215 kkyaFkCHr~~~~~~~~~yPVNai~Fhp~~~tfaTgGsD----------------------------------------- 253 (323)
T KOG1036|consen 215 KKYAFKCHRLSEKDTEIIYPVNAIAFHPIHGTFATGGSD----------------------------------------- 253 (323)
T ss_pred hceeEEeeecccCCceEEEEeceeEeccccceEEecCCC-----------------------------------------
Confidence 111111 12233456777777777776655322
Q ss_pred CCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcc
Q 000944 840 YDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVA 919 (1213)
Q Consensus 840 ~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~ 919 (1213)
+.+..+|+.+.+.+. +|...|. |+.+..|.. ....++||+++.+...+.+....
T Consensus 254 --------------------G~V~~Wd~~~rKrl~--q~~~~~~--SI~slsfs~--dG~~LAia~sy~ye~~~~~~~~~ 307 (323)
T KOG1036|consen 254 --------------------GIVNIWDLFNRKRLK--QLAKYET--SISSLSFSM--DGSLLAIASSYQYERADTPTHER 307 (323)
T ss_pred --------------------ceEEEccCcchhhhh--hccCCCC--ceEEEEecc--CCCeEEEEechhhhcCCCCCCCC
Confidence 134455554443332 3444443 555556653 34799999999875322224456
Q ss_pred cEEEEEEEEe
Q 000944 920 GYIHIYRFVE 929 (1213)
Q Consensus 920 Gri~v~~i~~ 929 (1213)
-+|++..+.+
T Consensus 308 ~~i~I~~l~d 317 (323)
T KOG1036|consen 308 NAIFIRDLTD 317 (323)
T ss_pred CceEEEeccc
Confidence 6677766543
No 56
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=92.66 E-value=20 Score=41.41 Aligned_cols=123 Identities=15% Similarity=0.246 Sum_probs=74.0
Q ss_pred CceeeeeCCCCcc---EEE--EEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEE
Q 000944 529 GRINEWRTPGKRT---IVK--VGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLA 603 (1213)
Q Consensus 529 ~~~~~~~~~~~~~---I~~--as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~ 603 (1213)
.+++...+|-|.+ +-. ++-++++|+++=.+|.|.+|.... +.+ +....++..|+.+++.+ .+..++
T Consensus 290 ak~~k~~~~~g~e~~~~e~FeVShd~~fia~~G~~G~I~lLhakT-~el--i~s~KieG~v~~~~fsS------dsk~l~ 360 (514)
T KOG2055|consen 290 AKVTKLKPPYGVEEKSMERFEVSHDSNFIAIAGNNGHIHLLHAKT-KEL--ITSFKIEGVVSDFTFSS------DSKELL 360 (514)
T ss_pred cccccccCCCCcccchhheeEecCCCCeEEEcccCceEEeehhhh-hhh--hheeeeccEEeeEEEec------CCcEEE
Confidence 3455555554432 333 334567888775667888876542 222 24566778888888863 346666
Q ss_pred EEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 604 VGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 604 v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
++..+|.|.+|.++...++...-.+. .....|+|+.. ...||-+|...|.+-.|..+
T Consensus 361 ~~~~~GeV~v~nl~~~~~~~rf~D~G-~v~gts~~~S~--------------ng~ylA~GS~~GiVNIYd~~ 417 (514)
T KOG2055|consen 361 ASGGTGEVYVWNLRQNSCLHRFVDDG-SVHGTSLCISL--------------NGSYLATGSDSGIVNIYDGN 417 (514)
T ss_pred EEcCCceEEEEecCCcceEEEEeecC-ccceeeeeecC--------------CCceEEeccCcceEEEeccc
Confidence 66677899999996543333221111 01234554322 23489999999999888754
No 57
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=92.27 E-value=19 Score=38.91 Aligned_cols=187 Identities=16% Similarity=0.151 Sum_probs=102.0
Q ss_pred ceEEEeeeCCCCCCceEEEEEecCceeEEEe--ccceeeec-C--------CCccCC--CCeEEEEeecCCeEEEEeCCc
Q 000944 455 SAVWTVKKNVNDEFDAYIVVSFNNATLVLSI--GETVEEVS-D--------SGFLDT--TPSLAVSLIGDDSLMQVHPSG 521 (1213)
Q Consensus 455 ~~iw~l~~~~~~~~~~~lvlS~~~~T~vl~~--~~~~~e~~-~--------~gf~~~--~~Tl~a~~~~~~~ivQVT~~~ 521 (1213)
+++-++..+.+ +.-++-...+.|+-+.- ++-.-++. + -.|..+ ++||..+.. .+.
T Consensus 106 ~dVlsva~s~d---n~qivSGSrDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~---------Dkt 173 (315)
T KOG0279|consen 106 KDVLSVAFSTD---NRQIVSGSRDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASW---------DKT 173 (315)
T ss_pred CceEEEEecCC---CceeecCCCcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccC---------Cce
Confidence 56666666432 24556556677765442 32211111 1 123333 455555443 245
Q ss_pred EEEEeCCC-ceeeeeCCCCccEEEEEecC-CEEEEE-EeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCcee
Q 000944 522 IRHIREDG-RINEWRTPGKRTIVKVGSNR-LQVVIA-LSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKR 598 (1213)
Q Consensus 522 i~l~~~~~-~~~~~~~~~~~~I~~as~~~-~~v~v~-~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~ 598 (1213)
+++-+... .+..-.+.....+..+++.. ..++.. =.+|++.+..+++...+ ..++-...|-++++.| .
T Consensus 174 vKvWnl~~~~l~~~~~gh~~~v~t~~vSpDGslcasGgkdg~~~LwdL~~~k~l---ysl~a~~~v~sl~fsp------n 244 (315)
T KOG0279|consen 174 VKVWNLRNCQLRTTFIGHSGYVNTVTVSPDGSLCASGGKDGEAMLWDLNEGKNL---YSLEAFDIVNSLCFSP------N 244 (315)
T ss_pred EEEEccCCcchhhccccccccEEEEEECCCCCEEecCCCCceEEEEEccCCcee---EeccCCCeEeeEEecC------C
Confidence 66666442 12121222333466666663 333332 12467888877753322 2334345678888865 5
Q ss_pred eeEEEEEEeCCcEEEEEeCCCCceeEeEEeecC-----CCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEE
Q 000944 599 SRFLAVGSYDNTIRILSLDPDDCMQILSVQSVS-----SPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTV 673 (1213)
Q Consensus 599 ~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~-----~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~ 673 (1213)
+.||+.++. .+|.||++++...++.+..+... ..|+++.+.-. .....||+|..||.+..++
T Consensus 245 rywL~~at~-~sIkIwdl~~~~~v~~l~~d~~g~s~~~~~~~clslaws------------~dG~tLf~g~td~~irv~q 311 (315)
T KOG0279|consen 245 RYWLCAATA-TSIKIWDLESKAVVEELKLDGIGPSSKAGDPICLSLAWS------------ADGQTLFAGYTDNVIRVWQ 311 (315)
T ss_pred ceeEeeccC-CceEEEeccchhhhhhccccccccccccCCcEEEEEEEc------------CCCcEEEeeecCCcEEEEE
Confidence 788888886 45999999766555444332221 24666654432 2456899999999998887
Q ss_pred Ee
Q 000944 674 VD 675 (1213)
Q Consensus 674 ~~ 675 (1213)
+.
T Consensus 312 v~ 313 (315)
T KOG0279|consen 312 VA 313 (315)
T ss_pred ee
Confidence 63
No 58
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=92.27 E-value=1.3 Score=50.52 Aligned_cols=125 Identities=14% Similarity=0.217 Sum_probs=93.1
Q ss_pred EEEEEEEeCCceEEEEEE-EeecCcceEeccccC---eEEEEeCCeEEEEecCCceeeceeeecCccceEEEEEEe--CC
Q 000944 922 IHIYRFVEEGKSLELLHK-TQVEGIPLALCQFQG---RLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTY--RD 995 (1213)
Q Consensus 922 i~v~~i~~~~~kl~~~~~-~~~~g~V~ai~~~~g---~ll~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~--~~ 995 (1213)
|++|+.. .++.+.. +..+|+|.+++-..| .+.++...++.+|..++...+...+- ++.-|.+|+.. ..
T Consensus 226 v~Iw~~~----t~ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~~~s~vetlyG--Hqd~v~~IdaL~reR 299 (479)
T KOG0299|consen 226 VQIWDCD----TLEHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSIDQLSYVETLYG--HQDGVLGIDALSRER 299 (479)
T ss_pred EEEecCc----ccchhhcccccccceeeeeeecCccceeeeecCCceEEEehhHhHHHHHHhC--Cccceeeechhcccc
Confidence 4588876 4555555 567899999998865 34556677999999987766555433 57788888864 45
Q ss_pred EEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecC
Q 000944 996 RIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 996 ~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~ 1056 (1213)
-+-||-.-+++-+++. +++.+|+..+- .-+.-++.||+++.|+.+-.+|+|.+....
T Consensus 300 ~vtVGgrDrT~rlwKi-~eesqlifrg~---~~sidcv~~In~~HfvsGSdnG~IaLWs~~ 356 (479)
T KOG0299|consen 300 CVTVGGRDRTVRLWKI-PEESQLIFRGG---EGSIDCVAFINDEHFVSGSDNGSIALWSLL 356 (479)
T ss_pred eEEeccccceeEEEec-cccceeeeeCC---CCCeeeEEEecccceeeccCCceEEEeeec
Confidence 5668888899999888 55677777654 234557789999999999999999987753
No 59
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=92.10 E-value=24 Score=39.64 Aligned_cols=217 Identities=16% Similarity=0.288 Sum_probs=122.1
Q ss_pred CCEEEEEEEccC-CCeEEeeeeccCcce-EEEEeeecCCCceeeeEEEEEEe-CCcEEEEEeCCCCceeEeEEe--ecCC
Q 000944 558 GGELIYFEVDMT-GQLLEVEKHEMSGDV-ACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ--SVSS 632 (1213)
Q Consensus 558 ~~~l~~l~~~~~-~~l~~~~~~~l~~~i-s~i~i~~~~~~~~~~~~l~v~~~-~~~i~i~sl~p~~~l~~~~~~--~l~~ 632 (1213)
.|.+..|++|++ |.|..+.+..++... +.+++.+ ...+++++.+ .|+|.++-++.+..+..+... ....
T Consensus 63 ~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~------~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~ 136 (346)
T COG2706 63 EGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDE------DGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGS 136 (346)
T ss_pred cCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECC------CCCEEEEEEccCceEEEEEcccCCccccceeeeecCCC
Confidence 357888999865 888888777666544 6677754 4678888888 789999999655444432211 1111
Q ss_pred CCc----e--eEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccc--eeeecCCCCeEEEEEECCe
Q 000944 633 PPE----S--LLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSR--SRFLGLRPPKLFSVVVGGR 704 (1213)
Q Consensus 633 ~p~----S--l~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~--~~~lG~~pv~l~~~~~~~~ 704 (1213)
-|+ + .-...+. -.++..+.+-|+...+..|.++ +|.+.... ...-|.-|..+. |.- +.
T Consensus 137 ~p~~rQ~~~h~H~a~~t----------P~~~~l~v~DLG~Dri~~y~~~--dg~L~~~~~~~v~~G~GPRHi~-FHp-n~ 202 (346)
T COG2706 137 GPHERQESPHVHSANFT----------PDGRYLVVPDLGTDRIFLYDLD--DGKLTPADPAEVKPGAGPRHIV-FHP-NG 202 (346)
T ss_pred CCCccccCCccceeeeC----------CCCCEEEEeecCCceEEEEEcc--cCccccccccccCCCCCcceEE-EcC-CC
Confidence 121 0 0011221 1246677788888899999887 45554322 123344443332 111 22
Q ss_pred eEEEEec--CccEEEEEeC---CeEEEE------ecCccccceeeccccCCCCceEEEEe---CCeEEEEEEccCCCeeE
Q 000944 705 AAMLCLS--SRPWLGYIHR---GRFLLT------PLSYETLEYAASFSSDQCVEGVVSVA---GNALRVFTIERLGETFN 770 (1213)
Q Consensus 705 ~~v~~~g--~~p~~i~~~~---~~~~~~------~~~~~~v~~~~~f~~~~~~~~~i~~~---~~~L~i~~l~~~~~~~~ 770 (1213)
..+++.+ +....++..+ +++.-. |-++..-...+.++... ..-|+|++ .+.|.+.++++.+.++.
T Consensus 203 k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~-dGrFLYasNRg~dsI~~f~V~~~~g~L~ 281 (346)
T COG2706 203 KYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISP-DGRFLYASNRGHDSIAVFSVDPDGGKLE 281 (346)
T ss_pred cEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECC-CCCEEEEecCCCCeEEEEEEcCCCCEEE
Confidence 3333333 2223333322 333221 11222223333333322 24599997 36999999998644444
Q ss_pred -EEEEeCCC-ccceeeecCCCceEEEE
Q 000944 771 -ETALPLRY-TPRRFVLQPKKKLMVII 795 (1213)
Q Consensus 771 -~r~i~l~~-tp~~i~y~~~~~~~~v~ 795 (1213)
+...+..+ +||.....+..+.++++
T Consensus 282 ~~~~~~teg~~PR~F~i~~~g~~Liaa 308 (346)
T COG2706 282 LVGITPTEGQFPRDFNINPSGRFLIAA 308 (346)
T ss_pred EEEEeccCCcCCccceeCCCCCEEEEE
Confidence 44456666 59999999888777776
No 60
>PTZ00421 coronin; Provisional
Probab=91.86 E-value=37 Score=41.34 Aligned_cols=115 Identities=13% Similarity=0.103 Sum_probs=62.2
Q ss_pred EEecCCEEEEEEe-CCEEEEEEEccCCCeEEee-e-eccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCc
Q 000944 545 VGSNRLQVVIALS-GGELIYFEVDMTGQLLEVE-K-HEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDC 621 (1213)
Q Consensus 545 as~~~~~v~v~~s-~~~l~~l~~~~~~~l~~~~-~-~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~ 621 (1213)
+++++.++++... .|...++.+...|++.... . .--...|.++++.+. ...+++.|.+|++|.+|.+. +..
T Consensus 36 ~~~n~~~~a~~w~~~gg~~v~~~~~~G~~~~~~~~l~GH~~~V~~v~fsP~-----d~~~LaSgS~DgtIkIWdi~-~~~ 109 (493)
T PTZ00421 36 IACNDRFIAVPWQQLGSTAVLKHTDYGKLASNPPILLGQEGPIIDVAFNPF-----DPQKLFTASEDGTIMGWGIP-EEG 109 (493)
T ss_pred EeECCceEEEEEecCCceEEeeccccccCCCCCceEeCCCCCEEEEEEcCC-----CCCEEEEEeCCCEEEEEecC-CCc
Confidence 4556667666543 2334445544444332100 0 011357899998653 24589999999999999994 322
Q ss_pred eeE-eE--EeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 622 MQI-LS--VQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 622 l~~-~~--~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
+.. .. ...+......+..+.+.. ....+|+.|..||.+..|.+.
T Consensus 110 ~~~~~~~~l~~L~gH~~~V~~l~f~P----------~~~~iLaSgs~DgtVrIWDl~ 156 (493)
T PTZ00421 110 LTQNISDPIVHLQGHTKKVGIVSFHP----------SAMNVLASAGADMVVNVWDVE 156 (493)
T ss_pred cccccCcceEEecCCCCcEEEEEeCc----------CCCCEEEEEeCCCEEEEEECC
Confidence 210 00 011211122222233321 123578888999999888775
No 61
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=91.66 E-value=5.1 Score=46.02 Aligned_cols=102 Identities=15% Similarity=0.154 Sum_probs=64.7
Q ss_pred CeEEEE-eCCeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccC---------------Ce
Q 000944 954 GRLLAG-IGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDE---------------NQ 1017 (1213)
Q Consensus 954 g~ll~~-~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~---------------~~ 1017 (1213)
.+|+.+ ..+.+++|++....|+....++ .+..++.++..+..+++|..--.+++..+..-+ .+
T Consensus 189 ~rl~TaS~D~t~k~wdlS~g~LLlti~fp-~si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~ 267 (476)
T KOG0646|consen 189 ARLYTASEDRTIKLWDLSLGVLLLTITFP-SSIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQ 267 (476)
T ss_pred ceEEEecCCceEEEEEeccceeeEEEecC-CcceeEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccce
Confidence 456543 3468899999988877665554 244555666678899999988888776653211 11
Q ss_pred E-EEeeccCCCcceEEEEe-ecCCeeeeecCCCcEEEEecCC
Q 000944 1018 L-YIFADDSVPRWLTAAHH-IDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1018 l-~~~a~D~~~~~~~~~~~-ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
. ..++. .....+|+... .|...++.+|.+|++.+.+...
T Consensus 268 ~~~~~Gh-~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S 308 (476)
T KOG0646|consen 268 INVLVGH-ENESAITCLAISTDGTLLLSGDEDGKVCVWDIYS 308 (476)
T ss_pred eeeeccc-cCCcceeEEEEecCccEEEeeCCCCCEEEEecch
Confidence 1 11111 11134555553 5666788899999999987543
No 62
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=91.66 E-value=35 Score=41.13 Aligned_cols=253 Identities=15% Similarity=0.172 Sum_probs=122.7
Q ss_pred EEEEEEecCCCCceEEEEeCCEEEEEeecCCCCeEEEEEEEeeeeeeEeeEE----eeCCCCeeEEEEEeccceEEEEEE
Q 000944 16 AAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQF----RLTGSQKDYIVVGSDSGRIVILEY 91 (1213)
Q Consensus 16 ~~v~~~f~~~~~~~LVv~k~~~Levy~i~~~g~L~~v~~~~l~g~I~~i~~~----r~~~~~~d~L~v~~~~~~l~il~~ 91 (1213)
||+.--=+.|+..+||+|-++.|-||..++.+.|+.+.-+.. +|..++-- |+....-|-++|.-....=-+|+|
T Consensus 13 hci~d~afkPDGsqL~lAAg~rlliyD~ndG~llqtLKgHKD--tVycVAys~dGkrFASG~aDK~VI~W~~klEG~LkY 90 (1081)
T KOG1538|consen 13 HCINDIAFKPDGTQLILAAGSRLLVYDTSDGTLLQPLKGHKD--TVYCVAYAKDGKRFASGSADKSVIIWTSKLEGILKY 90 (1081)
T ss_pred cchheeEECCCCceEEEecCCEEEEEeCCCcccccccccccc--eEEEEEEccCCceeccCCCceeEEEecccccceeee
Confidence 455443345889999999999999999987665655443321 34444322 222234444554444333444555
Q ss_pred eCCC----CcEeEEeeee----ccccCcccccC-----------CceEEECCCCCEEEEEecccceEEEEEecCCCCcee
Q 000944 92 NPSK----NVFDKIHQET----FGKSGCRRIVP-----------GQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLT 152 (1213)
Q Consensus 92 d~~~----~~~~tis~~~----~~~~g~~~~~~-----------~~~l~VDP~~r~ia~~~~~~~~~v~~~~~~~~~~~~ 152 (1213)
+... ..|.+++++- +.+=|+..... ---++=.-+|.+.|+....|.+.+= ++.++.+..
T Consensus 91 SH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~kss~R~~~CsWtnDGqylalG~~nGTIsiR--Nk~gEek~~ 168 (1081)
T KOG1538|consen 91 SHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKSSSRIICCSWTNDGQYLALGMFNGTISIR--NKNGEEKVK 168 (1081)
T ss_pred ccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhhheeEEEeeecCCCcEEEEeccCceEEee--cCCCCcceE
Confidence 3321 1122222211 11111111100 0111233455666666666665442 233333344
Q ss_pred eeccccccccccEEEEeeeeccC--CCCcEEEEEEeeccccccCcchhccccccceEEEEEEEcCCceeeeeeeeccCCC
Q 000944 153 ISSPLEAHKSHTIVYSICGIDCG--FDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNG 230 (1213)
Q Consensus 153 ~~~p~e~~~~~~~i~~~~fl~~~--~~~p~~a~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~~ 230 (1213)
|..|= -+++.|+.++|-... ..+.++|++ +| .+.+.+|.++- ++. ..+.+|.++
T Consensus 169 I~Rpg---g~Nspiwsi~~~p~sg~G~~di~aV~---------DW--------~qTLSFy~LsG---~~I-gk~r~L~Fd 224 (1081)
T KOG1538|consen 169 IERPG---GSNSPIWSICWNPSSGEGRNDILAVA---------DW--------GQTLSFYQLSG---KQI-GKDRALNFD 224 (1081)
T ss_pred EeCCC---CCCCCceEEEecCCCCCCccceEEEE---------ec--------cceeEEEEecc---eee-cccccCCCC
Confidence 43331 145678888876422 124467776 23 35688888862 111 124468889
Q ss_pred cceEEecCCCCCCCCeEEEEeeceEE-EEeCCCCceeeecCCCCCCCCCcceEEEEEEEEEecCceEEEEEeCCCCEEEE
Q 000944 231 ANMLVTVPGGGDGPSGVLVCAENFVI-YKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKV 309 (1213)
Q Consensus 231 ~~~liplp~~~~~~~GvLv~~~~~i~-y~~~~~~~~~~~~p~~~~~~~~~~~~i~~~~~~~~~~~~~~ll~~~~G~l~~l 309 (1213)
++.|--+|+. ..+|+-|...++ ++...+...+. +- +.+.-+|+.. ..++...+.++-.||.+--.
T Consensus 225 P~CisYf~NG----Ey~LiGGsdk~L~~fTR~GvrLGT-vg------~~D~WIWtV~---~~PNsQ~v~~GCqDGTiACy 290 (1081)
T KOG1538|consen 225 PCCISYFTNG----EYILLGGSDKQLSLFTRDGVRLGT-VG------EQDSWIWTVQ---AKPNSQYVVVGCQDGTIACY 290 (1081)
T ss_pred chhheeccCC----cEEEEccCCCceEEEeecCeEEee-cc------ccceeEEEEE---EccCCceEEEEEccCeeehh
Confidence 8888877753 234444444433 44443321110 00 0011222222 22355677777777776433
Q ss_pred E
Q 000944 310 T 310 (1213)
Q Consensus 310 ~ 310 (1213)
.
T Consensus 291 N 291 (1081)
T KOG1538|consen 291 N 291 (1081)
T ss_pred h
Confidence 3
No 63
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=91.20 E-value=37 Score=40.08 Aligned_cols=208 Identities=14% Similarity=0.122 Sum_probs=112.9
Q ss_pred CcceEEEEecCCCCcceEEEeeeC--CCCCCceEEEEEecCceeEEEecccee-eecCCCccCCCCeEEEEeecCCeEEE
Q 000944 440 LAVSEMAVSQLPGVPSAVWTVKKN--VNDEFDAYIVVSFNNATLVLSIGETVE-EVSDSGFLDTTPSLAVSLIGDDSLMQ 516 (1213)
Q Consensus 440 i~~~~~~~~~l~~~~~~iw~l~~~--~~~~~~~~lvlS~~~~T~vl~~~~~~~-e~~~~gf~~~~~Tl~a~~~~~~~ivQ 516 (1213)
..++.+.+-.|+. ..|.+-.. .+...++++.+-..+.+.-+--++.+. ....++|..-.|-.|+... | .+|=
T Consensus 117 ~~L~~~yeh~l~~---~a~nm~~G~Fgg~~~~~~IcVQS~DG~L~~feqe~~~f~~~lp~~llPgPl~Y~~~t-D-sfvt 191 (418)
T PF14727_consen 117 YQLELIYEHSLQR---TAYNMCCGPFGGVKGRDFICVQSMDGSLSFFEQESFAFSRFLPDFLLPGPLCYCPRT-D-SFVT 191 (418)
T ss_pred EEEEEEEEEeccc---ceeEEEEEECCCCCCceEEEEEecCceEEEEeCCcEEEEEEcCCCCCCcCeEEeecC-C-EEEE
Confidence 3344444444543 34444332 122225666664444444332222221 2233567788888888775 3 3433
Q ss_pred EeCC-cEEEEe-----------------------CCCceeeeeCCCCccEEEEEec-----CCEEEEEEeCCEEEEEEEc
Q 000944 517 VHPS-GIRHIR-----------------------EDGRINEWRTPGKRTIVKVGSN-----RLQVVIALSGGELIYFEVD 567 (1213)
Q Consensus 517 VT~~-~i~l~~-----------------------~~~~~~~~~~~~~~~I~~as~~-----~~~v~v~~s~~~l~~l~~~ 567 (1213)
++.. .+..+. .++...+|...-|..|..-.+. .+.|+|. ....|..+ +
T Consensus 192 ~sss~~l~~Yky~~La~~s~~~~~~~~~~~~~~~~k~l~~dWs~nlGE~~l~i~v~~~~~~~~~IvvL-ger~Lf~l--~ 268 (418)
T PF14727_consen 192 ASSSWTLECYKYQDLASASEASSRQSGTEQDISSGKKLNPDWSFNLGEQALDIQVVRFSSSESDIVVL-GERSLFCL--K 268 (418)
T ss_pred ecCceeEEEecHHHhhhccccccccccccccccccccccceeEEECCceeEEEEEEEcCCCCceEEEE-ecceEEEE--c
Confidence 3332 333322 1245778988777766665543 3455554 45567665 4
Q ss_pred cCCCeEEeeeeccCcceEEEEeeecCCC--ceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecc
Q 000944 568 MTGQLLEVEKHEMSGDVACLDIASVPEG--RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQAS 645 (1213)
Q Consensus 568 ~~~~l~~~~~~~l~~~is~i~i~~~~~~--~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~ 645 (1213)
.+|.+.-. +.++...+|++....+.. ......++|++.++++.||. |..|..-. .++..|.++.+.++..
T Consensus 269 ~~G~l~~~--krLd~~p~~~~~Y~~~~~~~~~~~~~llV~t~t~~LlVy~---d~~L~WsA--~l~~~PVal~v~~~~~- 340 (418)
T PF14727_consen 269 DNGSLRFQ--KRLDYNPSCFCPYRVPWYNEPSTRLNLLVGTHTGTLLVYE---DTTLVWSA--QLPHVPVALSVANFNG- 340 (418)
T ss_pred CCCeEEEE--EecCCceeeEEEEEeecccCCCCceEEEEEecCCeEEEEe---CCeEEEec--CCCCCCEEEEecccCC-
Confidence 56866543 455778888887665211 11234588999999999996 32454433 3456788877776652
Q ss_pred cCCCCCCCCCCceEEEEEee-CCeEEEEEE
Q 000944 646 VGGEDGADHPASLFLNAGLQ-NGVLFRTVV 674 (1213)
Q Consensus 646 ~~~~~~~~~~~~~~Lligl~-~G~l~~~~~ 674 (1213)
..-+++.|. +|.|-...+
T Consensus 341 -----------~~G~IV~Ls~~G~L~v~YL 359 (418)
T PF14727_consen 341 -----------LKGLIVSLSDEGQLSVSYL 359 (418)
T ss_pred -----------CCceEEEEcCCCcEEEEEe
Confidence 223455555 788854443
No 64
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=90.82 E-value=37 Score=39.55 Aligned_cols=111 Identities=20% Similarity=0.313 Sum_probs=67.0
Q ss_pred CCccEEEEEecCC-EEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEe
Q 000944 538 GKRTIVKVGSNRL-QVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSL 616 (1213)
Q Consensus 538 ~~~~I~~as~~~~-~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl 616 (1213)
.|.+|.....-.+ ..++...++++.+..+...|++.- ....-.+.|||+.+.. .+.-|..|.-|+.+.+|++
T Consensus 195 hg~pVe~vl~lpsgs~iasAgGn~vkVWDl~~G~qll~-~~~~H~KtVTcL~l~s------~~~rLlS~sLD~~VKVfd~ 267 (487)
T KOG0310|consen 195 HGCPVESVLALPSGSLIASAGGNSVKVWDLTTGGQLLT-SMFNHNKTVTCLRLAS------DSTRLLSGSLDRHVKVFDT 267 (487)
T ss_pred CCCceeeEEEcCCCCEEEEcCCCeEEEEEecCCceehh-hhhcccceEEEEEeec------CCceEeecccccceEEEEc
Confidence 5556665555433 444444677888877764443221 1222457899999975 2345666666899999997
Q ss_pred CCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEE
Q 000944 617 DPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRT 672 (1213)
Q Consensus 617 ~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~ 672 (1213)
.+ +..++.-..++..-|+.+. .+...+.+|+.||.+..-
T Consensus 268 -t~--~Kvv~s~~~~~pvLsiavs--------------~dd~t~viGmsnGlv~~r 306 (487)
T KOG0310|consen 268 -TN--YKVVHSWKYPGPVLSIAVS--------------PDDQTVVIGMSNGLVSIR 306 (487)
T ss_pred -cc--eEEEEeeecccceeeEEec--------------CCCceEEEecccceeeee
Confidence 33 6666544333332333332 145678999999998654
No 65
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=90.26 E-value=24 Score=41.08 Aligned_cols=94 Identities=12% Similarity=0.197 Sum_probs=68.0
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcc--eEeccccCeEEEEeC--CeEEEEecCCceeeceeeecCccceEEEEEEeC
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIP--LALCQFQGRLLAGIG--PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYR 994 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V--~ai~~~~g~ll~~~g--~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~ 994 (1213)
+|.+.+|++.-- +-+.-+.+....|. .|.++.|..|++++| .+|++|+...+++.....++ .|...+...-++
T Consensus 186 ~G~VtlwDv~g~--sp~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~s~~~l~y~-~Plstvaf~~~G 262 (673)
T KOG4378|consen 186 KGAVTLWDVQGM--SPIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQASTDRLTYS-HPLSTVAFSECG 262 (673)
T ss_pred CCeEEEEeccCC--CcccchhhhccCCcCcceecCCccceEEEecccceEEEeecccccccceeeec-CCcceeeecCCc
Confidence 689999998742 22222222333444 345677999999998 58999999988887766677 487788888889
Q ss_pred CEEEEeecCCcEEEEEEeccC
Q 000944 995 DRIYVGDIQESFHFCKYRRDE 1015 (1213)
Q Consensus 995 ~~I~vgD~~~Sv~~l~~~~~~ 1015 (1213)
-++.+|...--+.+|..+..+
T Consensus 263 ~~L~aG~s~G~~i~YD~R~~k 283 (673)
T KOG4378|consen 263 TYLCAGNSKGELIAYDMRSTK 283 (673)
T ss_pred eEEEeecCCceEEEEecccCC
Confidence 999999998888887665443
No 66
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=90.22 E-value=29 Score=37.23 Aligned_cols=178 Identities=15% Similarity=0.229 Sum_probs=92.9
Q ss_pred ecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeE
Q 000944 547 SNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILS 626 (1213)
Q Consensus 547 ~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~ 626 (1213)
-.+.|+++.-.+..|.++.... ...+...++..++--++... ..++.+..+..|+|+|++. |. |+++.
T Consensus 116 p~g~~~~~~~kdD~it~id~r~---~~~~~~~~~~~e~ne~~w~~------~nd~Fflt~GlG~v~ILsy-ps--Lkpv~ 183 (313)
T KOG1407|consen 116 PDGEYIAVGNKDDRITFIDART---YKIVNEEQFKFEVNEISWNN------SNDLFFLTNGLGCVEILSY-PS--LKPVQ 183 (313)
T ss_pred CCCCEEEEecCcccEEEEEecc---cceeehhcccceeeeeeecC------CCCEEEEecCCceEEEEec-cc--ccccc
Confidence 3456666665566676654332 22223344445554444431 2445555555799999999 75 66542
Q ss_pred EeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeE
Q 000944 627 VQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAA 706 (1213)
Q Consensus 627 ~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~ 706 (1213)
.+.+.|..-..+++. ...-|+-+|..|-.+--+.++. +-..+...-=.-||+-..|
T Consensus 184 --si~AH~snCicI~f~-----------p~GryfA~GsADAlvSLWD~~E----LiC~R~isRldwpVRTlSF------- 239 (313)
T KOG1407|consen 184 --SIKAHPSNCICIEFD-----------PDGRYFATGSADALVSLWDVDE----LICERCISRLDWPVRTLSF------- 239 (313)
T ss_pred --ccccCCcceEEEEEC-----------CCCceEeeccccceeeccChhH----hhhheeeccccCceEEEEe-------
Confidence 333444322223443 2345777888877664444432 2221111100112222111
Q ss_pred EEEecCccEEEEEeCCeEEEEecCccccceeeccccCCCCceEEEEeCC-eEEEEEEccCCCeeEEEEEeCCCccceeee
Q 000944 707 MLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGN-ALRVFTIERLGETFNETALPLRYTPRRFVL 785 (1213)
Q Consensus 707 v~~~g~~p~~i~~~~~~~~~~~~~~~~v~~~~~f~~~~~~~~~i~~~~~-~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y 785 (1213)
+.+|++ +...+++ -+-|+.++.-+ .+..||-.+.--.+++
T Consensus 240 ------------S~dg~~------------------------lASaSEDh~IDIA~vetGd---~~~eI~~~~~t~tVAW 280 (313)
T KOG1407|consen 240 ------------SHDGRM------------------------LASASEDHFIDIAEVETGD---RVWEIPCEGPTFTVAW 280 (313)
T ss_pred ------------ccCcce------------------------eeccCccceEEeEecccCC---eEEEeeccCCceeEEe
Confidence 122221 1122222 44566666532 3457888888889999
Q ss_pred cCCCceEEEEEccC
Q 000944 786 QPKKKLMVIIETDQ 799 (1213)
Q Consensus 786 ~~~~~~~~v~~~~~ 799 (1213)
||...+++-+|.+.
T Consensus 281 HPk~~LLAyA~ddk 294 (313)
T KOG1407|consen 281 HPKRPLLAYACDDK 294 (313)
T ss_pred cCCCceeeEEecCC
Confidence 99999999998764
No 67
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=90.17 E-value=42 Score=38.97 Aligned_cols=88 Identities=10% Similarity=0.100 Sum_probs=51.1
Q ss_pred ccEEEEEEEEeCCceEEEEEEE-eecCcceEeccc-cCeEEE--EeCCeEEEEecCCceeeceeeecCccceEEEEEEeC
Q 000944 919 AGYIHIYRFVEEGKSLELLHKT-QVEGIPLALCQF-QGRLLA--GIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYR 994 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~-~~~g~V~ai~~~-~g~ll~--~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~ 994 (1213)
.+.+.+|+++. ...+|.. ...-|||++.-- +|++++ ..+..|++|.....++.+.-. ++-..+-++-+..+
T Consensus 431 dstV~lwdv~~----gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~~l~~s~~-~~~~Ifel~Wn~~G 505 (524)
T KOG0273|consen 431 DSTVKLWDVES----GVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTGKLVKSYQ-GTGGIFELCWNAAG 505 (524)
T ss_pred CCeEEEEEccC----CceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEeccccchheeEeec-CCCeEEEEEEcCCC
Confidence 45678888874 4445554 678999999766 455544 234578899988777754322 11122223333445
Q ss_pred CEEEEeecCCcEEEEEE
Q 000944 995 DRIYVGDIQESFHFCKY 1011 (1213)
Q Consensus 995 ~~I~vgD~~~Sv~~l~~ 1011 (1213)
|+|.+.=.-.++.++.+
T Consensus 506 ~kl~~~~sd~~vcvldl 522 (524)
T KOG0273|consen 506 DKLGACASDGSVCVLDL 522 (524)
T ss_pred CEEEEEecCCCceEEEe
Confidence 55555555555555443
No 68
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=89.87 E-value=35 Score=37.63 Aligned_cols=72 Identities=25% Similarity=0.337 Sum_probs=47.3
Q ss_pred cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEE
Q 000944 582 GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLN 661 (1213)
Q Consensus 582 ~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Ll 661 (1213)
..|+|+|. ...|++.|..|-+|+||.+.....+..+. ....++-.+.+.. .-...+|+
T Consensus 44 ~sitavAV--------s~~~~aSGssDetI~IYDm~k~~qlg~ll-----~HagsitaL~F~~---------~~S~shLl 101 (362)
T KOG0294|consen 44 GSITALAV--------SGPYVASGSSDETIHIYDMRKRKQLGILL-----SHAGSITALKFYP---------PLSKSHLL 101 (362)
T ss_pred cceeEEEe--------cceeEeccCCCCcEEEEeccchhhhccee-----ccccceEEEEecC---------Ccchhhee
Confidence 45788887 47899999999999999994333333221 1122333333332 11344899
Q ss_pred EEeeCCeEEEEEEe
Q 000944 662 AGLQNGVLFRTVVD 675 (1213)
Q Consensus 662 igl~~G~l~~~~~~ 675 (1213)
-|..||.+..++..
T Consensus 102 S~sdDG~i~iw~~~ 115 (362)
T KOG0294|consen 102 SGSDDGHIIIWRVG 115 (362)
T ss_pred eecCCCcEEEEEcC
Confidence 99999999998864
No 69
>PF14727 PHTB1_N: PTHB1 N-terminus
Probab=89.64 E-value=50 Score=39.04 Aligned_cols=224 Identities=13% Similarity=0.104 Sum_probs=128.4
Q ss_pred EEEEEEeCCEEEEEEEccC------C---CeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCc
Q 000944 551 QVVIALSGGELIYFEVDMT------G---QLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDC 621 (1213)
Q Consensus 551 ~v~v~~s~~~l~~l~~~~~------~---~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~ 621 (1213)
.++| +.-.+|.++.+... | +|..+-.+.+....-.|+..+++. .+...++.|=+-||.+.+|.=+ .
T Consensus 90 ~LaV-LhP~kl~vY~v~~~~g~~~~g~~~~L~~~yeh~l~~~a~nm~~G~Fgg-~~~~~~IcVQS~DG~L~~feqe---~ 164 (418)
T PF14727_consen 90 QLAV-LHPRKLSVYSVSLVDGTVEHGNQYQLELIYEHSLQRTAYNMCCGPFGG-VKGRDFICVQSMDGSLSFFEQE---S 164 (418)
T ss_pred eEEE-ecCCEEEEEEEEecCCCcccCcEEEEEEEEEEecccceeEEEEEECCC-CCCceEEEEEecCceEEEEeCC---c
Confidence 4444 35667777666311 2 233344455666666777777652 2236788888889999998642 1
Q ss_pred eeEeEEeecC--CCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCC---C----------------Cc
Q 000944 622 MQILSVQSVS--SPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMV---T----------------GQ 680 (1213)
Q Consensus 622 l~~~~~~~l~--~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~---~----------------~~ 680 (1213)
+. ...-++ ..|..++.+. ....+++...+..|..|+.... + ..
T Consensus 165 ~~--f~~~lp~~llPgPl~Y~~--------------~tDsfvt~sss~~l~~Yky~~La~~s~~~~~~~~~~~~~~~~k~ 228 (418)
T PF14727_consen 165 FA--FSRFLPDFLLPGPLCYCP--------------RTDSFVTASSSWTLECYKYQDLASASEASSRQSGTEQDISSGKK 228 (418)
T ss_pred EE--EEEEcCCCCCCcCeEEee--------------cCCEEEEecCceeEEEecHHHhhhcccccccccccccccccccc
Confidence 21 112233 2466666554 2344566666778888865211 0 01
Q ss_pred ccccceeeecCCCCeEEEEEEC-CeeEEEEecCccEEEEEeCCeEEEEecCccccceeeccccC--CCCc---eEEEEe-
Q 000944 681 LSDSRSRFLGLRPPKLFSVVVG-GRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSD--QCVE---GVVSVA- 753 (1213)
Q Consensus 681 l~~~~~~~lG~~pv~l~~~~~~-~~~~v~~~g~~p~~i~~~~~~~~~~~~~~~~v~~~~~f~~~--~~~~---~~i~~~- 753 (1213)
+...+...+|-..+.+..++.. ....|++.|+|.+++..++|.+++..--.-...++++|... ..++ .++..+
T Consensus 229 l~~dWs~nlGE~~l~i~v~~~~~~~~~IvvLger~Lf~l~~~G~l~~~krLd~~p~~~~~Y~~~~~~~~~~~~~llV~t~ 308 (418)
T PF14727_consen 229 LNPDWSFNLGEQALDIQVVRFSSSESDIVVLGERSLFCLKDNGSLRFQKRLDYNPSCFCPYRVPWYNEPSTRLNLLVGTH 308 (418)
T ss_pred ccceeEEECCceeEEEEEEEcCCCCceEEEEecceEEEEcCCCeEEEEEecCCceeeEEEEEeecccCCCCceEEEEEec
Confidence 2233456688888888776654 46689999999999999899888865433345677887662 2222 255544
Q ss_pred CCeEEEEEEccC-------CCeeEEEEEeCCCccceeeecCCCceEEEE
Q 000944 754 GNALRVFTIERL-------GETFNETALPLRYTPRRFVLQPKKKLMVII 795 (1213)
Q Consensus 754 ~~~L~i~~l~~~-------~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~ 795 (1213)
.+.|.|.+=..+ .....++.-.+...+--|+-..+...+-|.
T Consensus 309 t~~LlVy~d~~L~WsA~l~~~PVal~v~~~~~~~G~IV~Ls~~G~L~v~ 357 (418)
T PF14727_consen 309 TGTLLVYEDTTLVWSAQLPHVPVALSVANFNGLKGLIVSLSDEGQLSVS 357 (418)
T ss_pred CCeEEEEeCCeEEEecCCCCCCEEEEecccCCCCceEEEEcCCCcEEEE
Confidence 567766553332 122334444444455555554444443333
No 70
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=89.44 E-value=49 Score=38.69 Aligned_cols=89 Identities=12% Similarity=0.227 Sum_probs=51.9
Q ss_pred CcEEEEeCC--CceeeeeCCCCccEEEEE-ecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCc
Q 000944 520 SGIRHIRED--GRINEWRTPGKRTIVKVG-SNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGR 596 (1213)
Q Consensus 520 ~~i~l~~~~--~~~~~~~~~~~~~I~~as-~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~ 596 (1213)
..|.+++.. +.+..+....+..+..+. -.+.++.++-.+|.+..+.+.. ++ .+.+.......-++++.+
T Consensus 16 ~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~-~~--~v~~i~~G~~~~~i~~s~----- 87 (369)
T PF02239_consen 16 GSVAVIDGATNKVVARIPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLAT-GK--VVATIKVGGNPRGIAVSP----- 87 (369)
T ss_dssp TEEEEEETTT-SEEEEEE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTS-SS--EEEEEE-SSEEEEEEE-------
T ss_pred CEEEEEECCCCeEEEEEcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCc-cc--EEEEEecCCCcceEEEcC-----
Confidence 567778765 345555432221222222 2256888886678887765542 22 345566667777888864
Q ss_pred eeeeEEEEEEe-CCcEEEEEeC
Q 000944 597 KRSRFLAVGSY-DNTIRILSLD 617 (1213)
Q Consensus 597 ~~~~~l~v~~~-~~~i~i~sl~ 617 (1213)
..++++++.+ .+.+.+++..
T Consensus 88 -DG~~~~v~n~~~~~v~v~D~~ 108 (369)
T PF02239_consen 88 -DGKYVYVANYEPGTVSVIDAE 108 (369)
T ss_dssp -TTTEEEEEEEETTEEEEEETT
T ss_pred -CCCEEEEEecCCCceeEeccc
Confidence 3567787776 8999999863
No 71
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=88.79 E-value=48 Score=42.53 Aligned_cols=206 Identities=16% Similarity=0.176 Sum_probs=127.1
Q ss_pred EEEEEeCCCCceEEEEEc--CCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCC-ceEEEE
Q 000944 861 CIRVLDPRSANTTCLLEL--QDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEG-KSLELL 937 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~--~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~-~kl~~~ 937 (1213)
.++++|-+..+.+..|.- .+.-.|+.|+.+. .....++++|++ .|-|-+|+=-.++ .|.++|
T Consensus 1087 ~i~vwd~e~~~~l~~F~n~~~~~t~Vs~l~liN---e~D~aLlLtas~------------dGvIRIwk~y~~~~~~~eLV 1151 (1387)
T KOG1517|consen 1087 RIRVWDWEKGRLLNGFDNGAFPDTRVSDLELIN---EQDDALLLTASS------------DGVIRIWKDYADKWKKPELV 1151 (1387)
T ss_pred eEEEEecccCceeccccCCCCCCCccceeeeec---ccchhheeeecc------------CceEEEecccccccCCceeE
Confidence 566677666666655543 2445666666543 233577887775 4667677633332 245555
Q ss_pred EEE----------eecCcceEeccccCeEEEEeCC-eEEEEecCCceeeceeeecCccceEEEEEE---eCCEEEEeecC
Q 000944 938 HKT----------QVEGIPLALCQFQGRLLAGIGP-VLRLYDLGKKRLLRKCENKLFPNTIVSINT---YRDRIYVGDIQ 1003 (1213)
Q Consensus 938 ~~~----------~~~g~V~ai~~~~g~ll~~~g~-~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~---~~~~I~vgD~~ 1003 (1213)
... .-.|.|+.=.+.+|+|+++-+- .|.||+.+.++....-.+. ..+-+++|+. .+|.|++|=.-
T Consensus 1152 Taw~~Ls~~~~~~r~~~~v~dWqQ~~G~Ll~tGd~r~IRIWDa~~E~~~~diP~~-s~t~vTaLS~~~~~gn~i~AGfaD 1230 (1387)
T KOG1517|consen 1152 TAWSSLSDQLPGARGTGLVVDWQQQSGHLLVTGDVRSIRIWDAHKEQVVADIPYG-SSTLVTALSADLVHGNIIAAGFAD 1230 (1387)
T ss_pred EeeccccccCccCCCCCeeeehhhhCCeEEecCCeeEEEEEecccceeEeecccC-CCccceeecccccCCceEEEeecC
Confidence 331 1125666667778999887664 5779999877765544443 2556788874 47999999999
Q ss_pred CcEEEEEEecc-CCeEEEeeccCCCcc-eEEEEeecC--CeeeeecCCCcEEEEecCCCCCcccccCCCCCccccccCcc
Q 000944 1004 ESFHFCKYRRD-ENQLYIFADDSVPRW-LTAAHHIDF--DTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKL 1079 (1213)
Q Consensus 1004 ~Sv~~l~~~~~-~~~l~~~a~D~~~~~-~~~~~~ld~--~~~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~ 1079 (1213)
-|+-+|.-+-. .+.++-+.|...... +..+.+=-. +.++.+-.+|.|.+++.--...
T Consensus 1231 GsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~~~------------------- 1291 (1387)
T KOG1517|consen 1231 GSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMSSK------------------- 1291 (1387)
T ss_pred CceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccCCeEEEEecccCcc-------------------
Confidence 99988765433 355776666554432 444443221 2578889999999987543211
Q ss_pred CCcccceeeeeeeecCceeceEEEe
Q 000944 1080 NGAPNKMEEIVQFHVGDVVTSLQKA 1104 (1213)
Q Consensus 1080 ~~~~~~L~~~~~~~lg~~v~~~~~~ 1104 (1213)
..-+.....+..|.-.|++...
T Consensus 1292 ---e~~~~iv~~~~yGs~lTal~VH 1313 (1387)
T KOG1517|consen 1292 ---ETFLTIVAHWEYGSALTALTVH 1313 (1387)
T ss_pred ---cccceeeeccccCccceeeeec
Confidence 1234556666678777777654
No 72
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=88.69 E-value=9.3 Score=40.53 Aligned_cols=104 Identities=14% Similarity=0.128 Sum_probs=63.2
Q ss_pred eccccCeEEEEeCCeEEEEecCCceeeceeeecCccceEE--EEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCC
Q 000944 949 LCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIV--SINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSV 1026 (1213)
Q Consensus 949 i~~~~g~ll~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~--~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~ 1026 (1213)
|++-+++|++|.+++|++|++...+=.+++.++...-.++ .....+.-++-|----.+-+...+. -..-|.+.
T Consensus 48 iTpdk~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseDgt~kIWdlR~-----~~~qR~~~ 122 (311)
T KOG0315|consen 48 ITPDKKDLAAAGNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSEDGTVKIWDLRS-----LSCQRNYQ 122 (311)
T ss_pred EcCCcchhhhccCCeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCCceEEEEeccC-----cccchhcc
Confidence 4455677888888999999998766557887774322343 3444455555554434444433322 12223443
Q ss_pred CcceEEEEeecC--CeeeeecCCCcEEEEecCC
Q 000944 1027 PRWLTAAHHIDF--DTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1027 ~~~~~~~~~ld~--~~~l~~D~~gnl~il~~~~ 1057 (1213)
........++.. ..++.+|..|||++.++-.
T Consensus 123 ~~spVn~vvlhpnQteLis~dqsg~irvWDl~~ 155 (311)
T KOG0315|consen 123 HNSPVNTVVLHPNQTELISGDQSGNIRVWDLGE 155 (311)
T ss_pred CCCCcceEEecCCcceEEeecCCCcEEEEEccC
Confidence 333344444544 3689999999999998765
No 73
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.44 E-value=27 Score=41.92 Aligned_cols=175 Identities=5% Similarity=0.052 Sum_probs=114.4
Q ss_pred ceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEE
Q 000944 857 KWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLEL 936 (1213)
Q Consensus 857 ~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~ 936 (1213)
-|.+.+++++.+|...+.+++..+ +-+-+. +|- .+++-+++|.. -++|-+|+.. +++.
T Consensus 32 LynG~V~IWnyetqtmVksfeV~~-~PvRa~---kfi--aRknWiv~GsD------------D~~IrVfnyn----t~ek 89 (794)
T KOG0276|consen 32 LYNGDVQIWNYETQTMVKSFEVSE-VPVRAA---KFI--ARKNWIVTGSD------------DMQIRVFNYN----TGEK 89 (794)
T ss_pred eecCeeEEEecccceeeeeeeecc-cchhhh---eee--eccceEEEecC------------CceEEEEecc----ccee
Confidence 455789999999888888776543 222222 222 56789999975 3678888887 4555
Q ss_pred EEEEe-ecCcceEecc--ccCeEEEEeCC-eEEEEecCCceeeceeeecCccceEEEEEEe---CCEEEEeecCCcEEEE
Q 000944 937 LHKTQ-VEGIPLALCQ--FQGRLLAGIGP-VLRLYDLGKKRLLRKCENKLFPNTIVSINTY---RDRIYVGDIQESFHFC 1009 (1213)
Q Consensus 937 ~~~~~-~~g~V~ai~~--~~g~ll~~~g~-~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~---~~~I~vgD~~~Sv~~l 1009 (1213)
++..+ .+.-+.+|+. -..+++.+... .|++|+|+.+......|-- +..|+.++... .|...-+-+-+-|-+.
T Consensus 90 V~~FeAH~DyIR~iavHPt~P~vLtsSDDm~iKlW~we~~wa~~qtfeG-H~HyVMqv~fnPkD~ntFaS~sLDrTVKVW 168 (794)
T KOG0276|consen 90 VKTFEAHSDYIRSIAVHPTLPYVLTSSDDMTIKLWDWENEWACEQTFEG-HEHYVMQVAFNPKDPNTFASASLDRTVKVW 168 (794)
T ss_pred eEEeeccccceeeeeecCCCCeEEecCCccEEEEeeccCceeeeeEEcC-cceEEEEEEecCCCccceeeeeccccEEEE
Confidence 55543 3566666544 46788888776 6889999977655444443 56688877653 4677777777888887
Q ss_pred EEeccCCeEEEeeccCCCcceEEEEeecCC---eeeeecCCCcEEEEecCC
Q 000944 1010 KYRRDENQLYIFADDSVPRWLTAAHHIDFD---TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1010 ~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~---~~l~~D~~gnl~il~~~~ 1057 (1213)
++-.....+.+-| +.+.+.++.+...+ .++.+-.+.-+-+.+|..
T Consensus 169 slgs~~~nfTl~g---HekGVN~Vdyy~~gdkpylIsgaDD~tiKvWDyQt 216 (794)
T KOG0276|consen 169 SLGSPHPNFTLEG---HEKGVNCVDYYTGGDKPYLISGADDLTIKVWDYQT 216 (794)
T ss_pred EcCCCCCceeeec---cccCcceEEeccCCCcceEEecCCCceEEEeecch
Confidence 7754444455533 45667777776544 455555566777777653
No 74
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=88.23 E-value=6.9 Score=44.08 Aligned_cols=111 Identities=12% Similarity=0.158 Sum_probs=71.8
Q ss_pred ecCcceEeccccCeEEEEeCC-eEEEEecC-----CceeeceeeecCccceEEEEEEeC-CEEEEeecC--CcEEEEEEe
Q 000944 942 VEGIPLALCQFQGRLLAGIGP-VLRLYDLG-----KKRLLRKCENKLFPNTIVSINTYR-DRIYVGDIQ--ESFHFCKYR 1012 (1213)
Q Consensus 942 ~~g~V~ai~~~~g~ll~~~g~-~l~i~~~~-----~~~l~~~~~~~~~~~~i~~l~~~~-~~I~vgD~~--~Sv~~l~~~ 1012 (1213)
-.+++-.+...+|+|+.|+++ .+.+|... ..+|...+... +.+.+.-.... +++..|-.. +-+-+ |+
T Consensus 104 ~~~~I~gl~~~dg~Litc~~sG~l~~~~~k~~d~hss~l~~la~g~--g~~~~r~~~~~p~Iva~GGke~~n~lki--wd 179 (412)
T KOG3881|consen 104 GTKSIKGLKLADGTLITCVSSGNLQVRHDKSGDLHSSKLIKLATGP--GLYDVRQTDTDPYIVATGGKENINELKI--WD 179 (412)
T ss_pred ccccccchhhcCCEEEEEecCCcEEEEeccCCccccccceeeecCC--ceeeeccCCCCCceEecCchhcccceee--ee
Confidence 357888889999999999985 78899887 45566665543 34444443333 344446555 22333 33
Q ss_pred ccCCeEEEeeccC--------CCcceEEEEeecCC---eeeeecCCCcEEEEecC
Q 000944 1013 RDENQLYIFADDS--------VPRWLTAAHHIDFD---TMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1013 ~~~~~l~~~a~D~--------~~~~~~~~~~ld~~---~~l~~D~~gnl~il~~~ 1056 (1213)
.+..+-+.-||.. .|.|.+++.|++.+ .|+.+.+.+.+++|+-.
T Consensus 180 le~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~ 234 (412)
T KOG3881|consen 180 LEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRYHQVRLYDTR 234 (412)
T ss_pred cccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecceeEEEecCc
Confidence 3333333334332 68899999999772 58889999999998654
No 75
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=88.12 E-value=6.9 Score=41.36 Aligned_cols=91 Identities=12% Similarity=0.112 Sum_probs=64.0
Q ss_pred eeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 859 VSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 859 ~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
.+++.++|+.++..+.+|+++-+ +.+.+|. .+++++|-|-. -+.++.|+...+. .+.. +
T Consensus 204 gssV~Fwdaksf~~lKs~k~P~n-----V~SASL~--P~k~~fVaGge------------d~~~~kfDy~Tge-Ei~~-~ 262 (334)
T KOG0278|consen 204 GSSVKFWDAKSFGLLKSYKMPCN-----VESASLH--PKKEFFVAGGE------------DFKVYKFDYNTGE-EIGS-Y 262 (334)
T ss_pred CceeEEeccccccceeeccCccc-----ccccccc--CCCceEEecCc------------ceEEEEEeccCCc-eeee-c
Confidence 46899999999999999888754 3334453 44577776642 2456777766541 2222 4
Q ss_pred EEeecCcceEeccccCeEEEEeCC---eEEEEecC
Q 000944 939 KTQVEGIPLALCQFQGRLLAGIGP---VLRLYDLG 970 (1213)
Q Consensus 939 ~~~~~g~V~ai~~~~g~ll~~~g~---~l~i~~~~ 970 (1213)
.+...|||.|+.---+..+.|+|+ .|++|+..
T Consensus 263 nkgh~gpVhcVrFSPdGE~yAsGSEDGTirlWQt~ 297 (334)
T KOG0278|consen 263 NKGHFGPVHCVRFSPDGELYASGSEDGTIRLWQTT 297 (334)
T ss_pred ccCCCCceEEEEECCCCceeeccCCCceEEEEEec
Confidence 567789999998777777888886 79999875
No 76
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=87.97 E-value=52 Score=37.14 Aligned_cols=161 Identities=15% Similarity=0.106 Sum_probs=91.4
Q ss_pred EEEEEEeecCcceEeccccCeEEEEeCCeEEEEecCCceeeceeeec--C-ccceEEEEEEeCCEEEEeecCCcEEEEEE
Q 000944 935 ELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENK--L-FPNTIVSINTYRDRIYVGDIQESFHFCKY 1011 (1213)
Q Consensus 935 ~~~~~~~~~g~V~ai~~~~g~ll~~~g~~l~i~~~~~~~l~~~~~~~--~-~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~ 1011 (1213)
..++...++.+|.++..-..+|+|+.-.+||||++.+=+++.--... . .+..+.+.+..+.|+..=+...+=.++-|
T Consensus 79 ~~ICe~~fpt~IL~VrmNr~RLvV~Lee~IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~ 158 (391)
T KOG2110|consen 79 TTICEIFFPTSILAVRMNRKRLVVCLEESIYIYDIKDMKLLHTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLF 158 (391)
T ss_pred ceEEEEecCCceEEEEEccceEEEEEcccEEEEecccceeehhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEE
Confidence 34555677899999999999999999999999999887665332111 0 12334455556678888777776555556
Q ss_pred eccC-CeEEEeeccCCCcceEEEEeecCCeeee--ecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCcccceee
Q 000944 1012 RRDE-NQLYIFADDSVPRWLTAAHHIDFDTMAG--ADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEE 1088 (1213)
Q Consensus 1012 ~~~~-~~l~~~a~D~~~~~~~~~~~ld~~~~l~--~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 1088 (1213)
+... .....+. .+.-.+.+..|=.++++++ +|+---|++|..+. |++
T Consensus 159 d~~nl~~v~~I~--aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~-------------------------G~k--- 208 (391)
T KOG2110|consen 159 DTINLQPVNTIN--AHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPE-------------------------GQK--- 208 (391)
T ss_pred EcccceeeeEEE--ecCCceeEEEECCCCCEEEEeccCceEEEEEEcCC-------------------------ccE---
Confidence 5432 1112221 1111233334433444432 34444567776643 334
Q ss_pred eeeeecCceeceEEEeeecCCCccEEEEEecccceEEE
Q 000944 1089 IVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAM 1126 (1213)
Q Consensus 1089 ~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig~l 1126 (1213)
.-+|.-|....++...++.+.. ..+...+..|+|..+
T Consensus 209 l~eFRRG~~~~~IySL~Fs~ds-~~L~~sS~TeTVHiF 245 (391)
T KOG2110|consen 209 LYEFRRGTYPVSIYSLSFSPDS-QFLAASSNTETVHIF 245 (391)
T ss_pred eeeeeCCceeeEEEEEEECCCC-CeEEEecCCCeEEEE
Confidence 3456666666666655554432 233344456666644
No 77
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.87 E-value=18 Score=44.65 Aligned_cols=175 Identities=11% Similarity=0.150 Sum_probs=114.8
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
+++|+++...+++..|. .++.|+|++.-+.+ .+|++=|.- -|++-+|.|.+ -++++-.
T Consensus 391 TVRLWh~~~~~CL~~F~--HndfVTcVaFnPvD----DryFiSGSL------------D~KvRiWsI~d----~~Vv~W~ 448 (712)
T KOG0283|consen 391 TVRLWHPGRKECLKVFS--HNDFVTCVAFNPVD----DRYFISGSL------------DGKVRLWSISD----KKVVDWN 448 (712)
T ss_pred cEEeecCCCcceeeEEe--cCCeeEEEEecccC----CCcEeeccc------------ccceEEeecCc----CeeEeeh
Confidence 79999999888887654 55789998764442 478887764 36788899874 4677778
Q ss_pred eecCcceEeccc--cCeEEE-EeCCeEEEEecCCceeeceeeecC------ccceEEEEEEe---CCEEEEeecCCcEEE
Q 000944 941 QVEGIPLALCQF--QGRLLA-GIGPVLRLYDLGKKRLLRKCENKL------FPNTIVSINTY---RDRIYVGDIQESFHF 1008 (1213)
Q Consensus 941 ~~~g~V~ai~~~--~g~ll~-~~g~~l~i~~~~~~~l~~~~~~~~------~~~~i~~l~~~---~~~I~vgD~~~Sv~~ 1008 (1213)
+.+..|+|+|-. +.+.|+ +.....++|+-.+.+++..-.... -+-.||.+... .+.|+|...---|-+
T Consensus 449 Dl~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSnDSrIRI 528 (712)
T KOG0283|consen 449 DLRDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVLVTSNDSRIRI 528 (712)
T ss_pred hhhhhheeEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCCCCeEEEecCCCceEE
Confidence 899999999876 234455 556788899888766653221110 01257777764 356888766666666
Q ss_pred EEEeccCCeEEEeeccCCCc-ceEEEEee-cCCeeeeecCCCcEEEEecCCCC
Q 000944 1009 CKYRRDENQLYIFADDSVPR-WLTAAHHI-DFDTMAGADKFGNIYFVRLPQDV 1059 (1213)
Q Consensus 1009 l~~~~~~~~l~~~a~D~~~~-~~~~~~~l-d~~~~l~~D~~gnl~il~~~~~~ 1059 (1213)
+.- .+..|+..-|.+... .-..+.|. |...|+.+-.+.++++.++++..
T Consensus 529 ~d~--~~~~lv~KfKG~~n~~SQ~~Asfs~Dgk~IVs~seDs~VYiW~~~~~~ 579 (712)
T KOG0283|consen 529 YDG--RDKDLVHKFKGFRNTSSQISASFSSDGKHIVSASEDSWVYIWKNDSFN 579 (712)
T ss_pred Eec--cchhhhhhhcccccCCcceeeeEccCCCEEEEeecCceEEEEeCCCCc
Confidence 433 234444444443322 22344554 55577777789999999987643
No 78
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=87.55 E-value=34 Score=41.36 Aligned_cols=135 Identities=11% Similarity=0.146 Sum_probs=81.4
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEeccc-cCeEEEEeC--CeEEEEecCCceeeceeeecCccceEEEEEEeCC
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF-QGRLLAGIG--PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRD 995 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~ll~~~g--~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~ 995 (1213)
-+.|.+|++.+.++-++.+. .....|++++-. .|.++++.+ ..+++|+....+..++-....-+...++....++
T Consensus 224 D~tiriwd~~~~~~~~~~l~--gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~ 301 (456)
T KOG0266|consen 224 DKTLRIWDLKDDGRNLKTLK--GHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGN 301 (456)
T ss_pred CceEEEeeccCCCeEEEEec--CCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCCceEEEEECCCCC
Confidence 35688999954433444443 677888887654 455655443 4899999997666555333211333334445688
Q ss_pred EEEEeecCCcEEEEEEeccCCe---EEEeeccCCCcceEEEEee-cCCeeeeecCCCcEEEEecCC
Q 000944 996 RIYVGDIQESFHFCKYRRDENQ---LYIFADDSVPRWLTAAHHI-DFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 996 ~I~vgD~~~Sv~~l~~~~~~~~---l~~~a~D~~~~~~~~~~~l-d~~~~l~~D~~gnl~il~~~~ 1057 (1213)
+|+.|+...=+.+ |+..... +..+..+..+..++.+.|= +...++.+=.++.+.++++..
T Consensus 302 ~l~s~s~d~~i~v--wd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~ 365 (456)
T KOG0266|consen 302 LLVSASYDGTIRV--WDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDRTLKLWDLRS 365 (456)
T ss_pred EEEEcCCCccEEE--EECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCCeEEEEEccC
Confidence 9999976555555 5554444 3455555555455666663 334556666666777777654
No 79
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=87.22 E-value=2.3 Score=46.18 Aligned_cols=83 Identities=17% Similarity=0.218 Sum_probs=60.1
Q ss_pred CCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE-EeecCcceEeccc--cC--
Q 000944 880 DNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK-TQVEGIPLALCQF--QG-- 954 (1213)
Q Consensus 880 ~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~-~~~~g~V~ai~~~--~g-- 954 (1213)
.++...+..+|... .-..++|+||.- |+.+-.++.++|+.+++++|+..+.+ -+...||++|+-- -|
T Consensus 167 ~~~~~~~CvsWn~s-r~~~p~iAvgs~-------e~a~~~~~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~ 238 (361)
T KOG2445|consen 167 KNKQPCFCVSWNPS-RMHEPLIAVGSD-------EDAPHLNKVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRS 238 (361)
T ss_pred cccCcceEEeeccc-cccCceEEEEcc-------cCCccccceEEEEecCCcceeeeehhcCCCCCcceeeeeccccCCc
Confidence 45555556677733 245699999984 78899999999999999888776654 3678999998653 23
Q ss_pred e--EEEEeCCeEEEEecC
Q 000944 955 R--LLAGIGPVLRLYDLG 970 (1213)
Q Consensus 955 ~--ll~~~g~~l~i~~~~ 970 (1213)
+ |.+|.+.-|+||++.
T Consensus 239 y~~lAvA~kDgv~I~~v~ 256 (361)
T KOG2445|consen 239 YHLLAVATKDGVRIFKVK 256 (361)
T ss_pred eeeEEEeecCcEEEEEEe
Confidence 2 344555559999886
No 80
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=86.49 E-value=57 Score=40.15 Aligned_cols=168 Identities=13% Similarity=0.141 Sum_probs=105.4
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
+++++|-.++++...+. -..+.+.||... ++..++-+... .+++|++..+ ..+.++..
T Consensus 272 t~rvWd~~sg~C~~~l~-gh~stv~~~~~~--------~~~~~sgs~D~-----------tVkVW~v~n~-~~l~l~~~- 329 (537)
T KOG0274|consen 272 TERVWDCSTGECTHSLQ-GHTSSVRCLTID--------PFLLVSGSRDN-----------TVKVWDVTNG-ACLNLLRG- 329 (537)
T ss_pred cEEeEecCCCcEEEEec-CCCceEEEEEcc--------CceEeeccCCc-----------eEEEEeccCc-ceEEEecc-
Confidence 78999999999888777 344777777652 34444432222 3678888743 35555544
Q ss_pred eecCcceEeccccCeEEEEeCC-eEEEEecCCceeeceeeecCccceEEEEEEeC-CEEEEeecCCcEEEEEEeccCCeE
Q 000944 941 QVEGIPLALCQFQGRLLAGIGP-VLRLYDLGKKRLLRKCENKLFPNTIVSINTYR-DRIYVGDIQESFHFCKYRRDENQL 1018 (1213)
Q Consensus 941 ~~~g~V~ai~~~~g~ll~~~g~-~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~-~~I~vgD~~~Sv~~l~~~~~~~~l 1018 (1213)
..++|.++..-++.++++.-. .|.+|+....+.+..-. .+.-.|.++.+.. ++++=|=+.+++.+..+.... +.
T Consensus 330 -h~~~V~~v~~~~~~lvsgs~d~~v~VW~~~~~~cl~sl~--gH~~~V~sl~~~~~~~~~Sgs~D~~IkvWdl~~~~-~c 405 (537)
T KOG0274|consen 330 -HTGPVNCVQLDEPLLVSGSYDGTVKVWDPRTGKCLKSLS--GHTGRVYSLIVDSENRLLSGSLDTTIKVWDLRTKR-KC 405 (537)
T ss_pred -ccccEEEEEecCCEEEEEecCceEEEEEhhhceeeeeec--CCcceEEEEEecCcceEEeeeeccceEeecCCchh-hh
Confidence 889999998888888886654 68999998666544432 2345788887777 888888777888885554321 11
Q ss_pred EEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecC
Q 000944 1019 YIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1019 ~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~ 1056 (1213)
...- ..+..++..-.+....++.+-.+|-|.+.+..
T Consensus 406 ~~tl--~~h~~~v~~l~~~~~~Lvs~~aD~~Ik~WD~~ 441 (537)
T KOG0274|consen 406 IHTL--QGHTSLVSSLLLRDNFLVSSSADGTIKLWDAE 441 (537)
T ss_pred hhhh--cCCcccccccccccceeEeccccccEEEeecc
Confidence 1110 11222222223343456667778888887543
No 81
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=86.29 E-value=42 Score=41.98 Aligned_cols=200 Identities=18% Similarity=0.250 Sum_probs=99.5
Q ss_pred cEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeec-cCc-ceEEEEeeecCCCceeeeEEEEEEeCC----cEEEE
Q 000944 541 TIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHE-MSG-DVACLDIASVPEGRKRSRFLAVGSYDN----TIRIL 614 (1213)
Q Consensus 541 ~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~-l~~-~is~i~i~~~~~~~~~~~~l~v~~~~~----~i~i~ 614 (1213)
.|++++....-|++...+|.++.|. .++......+ .+. .++.+.+.. +...++.+|-..+ .+.||
T Consensus 27 ~isc~~s~~~~vvigt~~G~V~~Ln----~s~~~~~~fqa~~~siv~~L~~~~-----~~~~L~sv~Ed~~~np~llkiw 97 (933)
T KOG2114|consen 27 AISCCSSSTGSVVIGTADGRVVILN----SSFQLIRGFQAYEQSIVQFLYILN-----KQNFLFSVGEDEQGNPVLLKIW 97 (933)
T ss_pred ceeEEcCCCceEEEeeccccEEEec----ccceeeehheecchhhhhHhhccc-----CceEEEEEeecCCCCceEEEEe
Confidence 5788887888888888888888763 1222211111 111 133333321 1234455555533 67889
Q ss_pred EeCCC---C---ceeEeEEe--ecC--CCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCccccc
Q 000944 615 SLDPD---D---CMQILSVQ--SVS--SPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDS 684 (1213)
Q Consensus 615 sl~p~---~---~l~~~~~~--~l~--~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~ 684 (1213)
++++- + ++...... .-| ..|.|...+.- .-..+.||..||.++.|.=|.....-...
T Consensus 98 ~lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs~-------------~l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~ 164 (933)
T KOG2114|consen 98 DLEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVSE-------------DLKTIVCGFTNGLVICYKGDILRDRGSRQ 164 (933)
T ss_pred cccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEEc-------------cccEEEEEecCcEEEEEcCcchhccccce
Confidence 98653 1 11111111 122 23554433331 34568899999999999755432211112
Q ss_pred ceeeecCCCCeEEEEEECCeeEEEEecCccEEEEEeCCeE-EEEecCccccceeec-cccCCCCceEEEEeCCeEEEEEE
Q 000944 685 RSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRF-LLTPLSYETLEYAAS-FSSDQCVEGVVSVAGNALRVFTI 762 (1213)
Q Consensus 685 ~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p~~i~~~~~~~-~~~~~~~~~v~~~~~-f~~~~~~~~~i~~~~~~L~i~~l 762 (1213)
+...-|..|+.=..+...+...+|+....-..+|.-.|+. ...-++...+.-.|. |+.. ...|+++.++.|.+...
T Consensus 165 ~~~~~~~~pITgL~~~~d~~s~lFv~Tt~~V~~y~l~gr~p~~~~ld~~G~~lnCss~~~~--t~qfIca~~e~l~fY~s 242 (933)
T KOG2114|consen 165 DYSHRGKEPITGLALRSDGKSVLFVATTEQVMLYSLSGRTPSLKVLDNNGISLNCSSFSDG--TYQFICAGSEFLYFYDS 242 (933)
T ss_pred eeeccCCCCceeeEEecCCceeEEEEecceeEEEEecCCCcceeeeccCCccceeeecCCC--CccEEEecCceEEEEcC
Confidence 2234566676644445555555677666555555432332 222122222211111 1111 12467777777777766
Q ss_pred cc
Q 000944 763 ER 764 (1213)
Q Consensus 763 ~~ 764 (1213)
+.
T Consensus 243 d~ 244 (933)
T KOG2114|consen 243 DG 244 (933)
T ss_pred CC
Confidence 64
No 82
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=86.21 E-value=14 Score=43.67 Aligned_cols=103 Identities=21% Similarity=0.199 Sum_probs=72.6
Q ss_pred CeEEEEeCCeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccC--CeEEEeec-cC-----
Q 000944 954 GRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDE--NQLYIFAD-DS----- 1025 (1213)
Q Consensus 954 g~ll~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~--~~l~~~a~-D~----- 1025 (1213)
+.+++|.|..||=+++++.+++.--..+.-+..+++++....+|.+|+--..|-++.-+... .+|..... +.
T Consensus 147 Dly~~gsg~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~~ 226 (703)
T KOG2321|consen 147 DLYLVGSGSEVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGGD 226 (703)
T ss_pred cEEEeecCcceEEEEccccccccccccccccceeeeecCccceEEecccCceEEEecchhhhhheeeecccccCCCcccc
Confidence 56899999999999999887665433332356788889999999999988888885322211 22322221 11
Q ss_pred CCcceEEEEeecCC-eeeeecCCCcEEEEecC
Q 000944 1026 VPRWLTAAHHIDFD-TMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1026 ~~~~~~~~~~ld~~-~~l~~D~~gnl~il~~~ 1056 (1213)
....+|++.|=|++ ++.++-..|.+++|++-
T Consensus 227 ~~~svTal~F~d~gL~~aVGts~G~v~iyDLR 258 (703)
T KOG2321|consen 227 AAPSVTALKFRDDGLHVAVGTSTGSVLIYDLR 258 (703)
T ss_pred ccCcceEEEecCCceeEEeeccCCcEEEEEcc
Confidence 23458888888765 67889999999999864
No 83
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=86.13 E-value=50 Score=35.00 Aligned_cols=88 Identities=17% Similarity=0.258 Sum_probs=51.9
Q ss_pred eEEEEEeCCCCceEEEEEc-CCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLEL-QDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~-~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
|+++|+|..|++.+.+|.= ..-|.=+-.+ |. +....++=|. | -|.+|+|++.+. +++.
T Consensus 205 stlrLlDk~tGklL~sYkGhkn~eykldc~---l~--qsdthV~sgS--------E----DG~Vy~wdLvd~----~~~s 263 (307)
T KOG0316|consen 205 STLRLLDKETGKLLKSYKGHKNMEYKLDCC---LN--QSDTHVFSGS--------E----DGKVYFWDLVDE----TQIS 263 (307)
T ss_pred ceeeecccchhHHHHHhcccccceeeeeee---ec--ccceeEEecc--------C----CceEEEEEeccc----eeee
Confidence 5899999999998877643 2223322211 11 2334444443 2 489999999865 2333
Q ss_pred EEeecCcc--eEeccc--cCeEEEEeCCeEEEEe
Q 000944 939 KTQVEGIP--LALCQF--QGRLLAGIGPVLRLYD 968 (1213)
Q Consensus 939 ~~~~~g~V--~ai~~~--~g~ll~~~g~~l~i~~ 968 (1213)
+..+.+.| +.+.-. ...++.|.|+..+.|.
T Consensus 264 k~~~~~~v~v~dl~~hp~~~~f~~A~~~~~~~~~ 297 (307)
T KOG0316|consen 264 KLSVVSTVIVTDLSCHPTMDDFITATGHGDLFWY 297 (307)
T ss_pred eeccCCceeEEeeecccCccceeEecCCceecee
Confidence 44443333 444333 5678888888776653
No 84
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=85.90 E-value=1.2e+02 Score=39.04 Aligned_cols=171 Identities=15% Similarity=0.271 Sum_probs=109.8
Q ss_pred EEEEEEEeCCceEEEEEEEee--cCcceEeccc------cCeEEEEeC-----------CeEEEEecCC-ceeeceeeec
Q 000944 922 IHIYRFVEEGKSLELLHKTQV--EGIPLALCQF------QGRLLAGIG-----------PVLRLYDLGK-KRLLRKCENK 981 (1213)
Q Consensus 922 i~v~~i~~~~~kl~~~~~~~~--~g~V~ai~~~------~g~ll~~~g-----------~~l~i~~~~~-~~l~~~~~~~ 981 (1213)
+..+++-+. ..++.++..+. .-.+.+|+.. +-++++|.| ..|.+|++.+ ++|..+++..
T Consensus 749 ~s~l~vlD~-nTf~vl~~hef~~~E~~~Si~s~~~~~d~~t~~vVGT~~v~Pde~ep~~GRIivfe~~e~~~L~~v~e~~ 827 (1096)
T KOG1897|consen 749 VSFLRVLDQ-NTFEVLSSHEFERNETALSIISCKFTDDPNTYYVVGTGLVYPDENEPVNGRIIVFEFEELNSLELVAETV 827 (1096)
T ss_pred EEEEEEecC-CceeEEeeccccccceeeeeeeeeecCCCceEEEEEEEeeccCCCCcccceEEEEEEecCCceeeeeeee
Confidence 334444332 26777766555 4556666543 336777764 3678899987 7787777766
Q ss_pred CccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEee-cCCeeeeecCCCcEEEEecCCCCC
Q 000944 982 LFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHI-DFDTMAGADKFGNIYFVRLPQDVS 1060 (1213)
Q Consensus 982 ~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~l-d~~~~l~~D~~gnl~il~~~~~~~ 1060 (1213)
+.-.+.+|..++.++++| +-.++.+++|..+ +.|..-++-..+ .++..+. ..+.|+++|--+.+.++.|..+.
T Consensus 828 -v~Gav~aL~~fngkllA~-In~~vrLye~t~~-~eLr~e~~~~~~--~~aL~l~v~gdeI~VgDlm~Sitll~y~~~e- 901 (1096)
T KOG1897|consen 828 -VKGAVYALVEFNGKLLAG-INQSVRLYEWTTE-RELRIECNISNP--IIALDLQVKGDEIAVGDLMRSITLLQYKGDE- 901 (1096)
T ss_pred -eccceeehhhhCCeEEEe-cCcEEEEEEcccc-ceehhhhcccCC--eEEEEEEecCcEEEEeeccceEEEEEEeccC-
Confidence 344567888888888776 7889999999764 344443333323 4455543 44679999999999999997621
Q ss_pred cccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCCCccEEEEEecccceEEEE
Q 000944 1061 DEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAML 1127 (1213)
Q Consensus 1061 ~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig~l~ 1127 (1213)
..+..+|....+.-.++.... +....+.+..+|-+..+.
T Consensus 902 -----------------------g~f~evArD~~p~Wmtaveil-----~~d~ylgae~~gNlf~v~ 940 (1096)
T KOG1897|consen 902 -----------------------GNFEEVARDYNPNWMTAVEIL-----DDDTYLGAENSGNLFTVR 940 (1096)
T ss_pred -----------------------CceEEeehhhCccceeeEEEe-----cCceEEeecccccEEEEE
Confidence 245666666666655544331 234566777888888554
No 85
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=85.50 E-value=21 Score=41.21 Aligned_cols=108 Identities=18% Similarity=0.215 Sum_probs=69.0
Q ss_pred EEEecCCEEEEEEe-CCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCce
Q 000944 544 KVGSNRLQVVIALS-GGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCM 622 (1213)
Q Consensus 544 ~as~~~~~v~v~~s-~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l 622 (1213)
..+++..|++.+.. ...|.+.++....++ ..+..++..+.|++..+ ...|++.|+-.+.+++|.+.....|
T Consensus 45 l~~l~~~yllsaq~~rp~l~vw~i~k~~~~--~q~~v~Pg~v~al~s~n------~G~~l~ag~i~g~lYlWelssG~LL 116 (476)
T KOG0646|consen 45 LTALNNEYLLSAQLKRPLLHVWEILKKDQV--VQYIVLPGPVHALASSN------LGYFLLAGTISGNLYLWELSSGILL 116 (476)
T ss_pred hhhhchhheeeecccCccccccccCchhhh--hhhcccccceeeeecCC------CceEEEeecccCcEEEEEeccccHH
Confidence 34455677777643 335556555432222 13455678899998865 4689999989999999999544333
Q ss_pred eEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 623 QILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 623 ~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
..++ +.-.++-...+.. +..+++-|..||.++.|.+.
T Consensus 117 ~v~~-----aHYQ~ITcL~fs~-----------dgs~iiTgskDg~V~vW~l~ 153 (476)
T KOG0646|consen 117 NVLS-----AHYQSITCLKFSD-----------DGSHIITGSKDGAVLVWLLT 153 (476)
T ss_pred HHHH-----hhccceeEEEEeC-----------CCcEEEecCCCccEEEEEEE
Confidence 2221 2223444445432 46689999999999998764
No 86
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=85.10 E-value=86 Score=38.06 Aligned_cols=146 Identities=18% Similarity=0.253 Sum_probs=84.7
Q ss_pred EEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc-------CeE-E
Q 000944 886 SICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ-------GRL-L 957 (1213)
Q Consensus 886 s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~-------g~l-l 957 (1213)
+.|+|+= ..+|+++|-+. |.|-+-.-. +.-|.++--.---+.||.+|+-.. +-+ +
T Consensus 136 ~~CsWtn----DGqylalG~~n------------GTIsiRNk~-gEek~~I~Rpgg~Nspiwsi~~~p~sg~G~~di~aV 198 (1081)
T KOG1538|consen 136 ICCSWTN----DGQYLALGMFN------------GTISIRNKN-GEEKVKIERPGGSNSPIWSICWNPSSGEGRNDILAV 198 (1081)
T ss_pred EEeeecC----CCcEEEEeccC------------ceEEeecCC-CCcceEEeCCCCCCCCceEEEecCCCCCCccceEEE
Confidence 3456663 34899999763 444222110 111333222222356777765431 223 3
Q ss_pred EEeCCeEEEEecCCceeeceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEe
Q 000944 958 AGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHH 1035 (1213)
Q Consensus 958 ~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ 1035 (1213)
+-=|+++..|.++.+.+-+... ++....+|+ ..+.|+++|-.-+.+.+ |..++-+|--++. ...|..++..
T Consensus 199 ~DW~qTLSFy~LsG~~Igk~r~---L~FdP~CisYf~NGEy~LiGGsdk~L~~--fTR~GvrLGTvg~--~D~WIWtV~~ 271 (1081)
T KOG1538|consen 199 ADWGQTLSFYQLSGKQIGKDRA---LNFDPCCISYFTNGEYILLGGSDKQLSL--FTRDGVRLGTVGE--QDSWIWTVQA 271 (1081)
T ss_pred EeccceeEEEEecceeeccccc---CCCCchhheeccCCcEEEEccCCCceEE--EeecCeEEeeccc--cceeEEEEEE
Confidence 3447788888887665532222 222233444 56899999999999998 5566777777776 5566666654
Q ss_pred ecC-CeeeeecCCCcEEEEec
Q 000944 1036 IDF-DTMAGADKFGNIYFVRL 1055 (1213)
Q Consensus 1036 ld~-~~~l~~D~~gnl~il~~ 1055 (1213)
=-+ .++..+=.+|.|..|..
T Consensus 272 ~PNsQ~v~~GCqDGTiACyNl 292 (1081)
T KOG1538|consen 272 KPNSQYVVVGCQDGTIACYNL 292 (1081)
T ss_pred ccCCceEEEEEccCeeehhhh
Confidence 333 35666777888877764
No 87
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=84.82 E-value=69 Score=35.49 Aligned_cols=98 Identities=11% Similarity=0.066 Sum_probs=51.6
Q ss_pred EEEEEeCCCCceEEEEEcCCCc----eEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNE----AAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLEL 936 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E----~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~ 936 (1213)
.+.++|..+++.+..+.+.... ... ...+.+. ....+++++.+ ..+++.+|++... ++.
T Consensus 180 ~v~i~d~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~s--~dg~~~~~~~~-----------~~~~i~v~d~~~~--~~~- 242 (300)
T TIGR03866 180 TVSVIDVATRKVIKKITFEIPGVHPEAVQ-PVGIKLT--KDGKTAFVALG-----------PANRVAVVDAKTY--EVL- 242 (300)
T ss_pred EEEEEEcCcceeeeeeeecccccccccCC-ccceEEC--CCCCEEEEEcC-----------CCCeEEEEECCCC--cEE-
Confidence 5778888887777666553211 110 0112232 22234444432 1357888887532 322
Q ss_pred EEEEeecCcceEecc--ccCeEEEEe--CCeEEEEecCCceeec
Q 000944 937 LHKTQVEGIPLALCQ--FQGRLLAGI--GPVLRLYDLGKKRLLR 976 (1213)
Q Consensus 937 ~~~~~~~g~V~ai~~--~~g~ll~~~--g~~l~i~~~~~~~l~~ 976 (1213)
......+.+.+++- -+.+|+++. .++|.+|++...+.+.
T Consensus 243 -~~~~~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~~~~~ 285 (300)
T TIGR03866 243 -DYLLVGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAALKVIK 285 (300)
T ss_pred -EEEEeCCCcceEEECCCCCEEEEEcCCCCeEEEEECCCCcEEE
Confidence 11223456766653 334566654 3689999998766533
No 88
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=84.60 E-value=28 Score=42.70 Aligned_cols=170 Identities=14% Similarity=0.128 Sum_probs=103.1
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
++++++-.+...+....= ..+.|.| +.+. ..+++.|+. .|.|.+|++... ++-...+
T Consensus 312 tVkVW~v~n~~~l~l~~~-h~~~V~~---v~~~----~~~lvsgs~------------d~~v~VW~~~~~--~cl~sl~- 368 (537)
T KOG0274|consen 312 TVKVWDVTNGACLNLLRG-HTGPVNC---VQLD----EPLLVSGSY------------DGTVKVWDPRTG--KCLKSLS- 368 (537)
T ss_pred eEEEEeccCcceEEEecc-ccccEEE---EEec----CCEEEEEec------------CceEEEEEhhhc--eeeeeec-
Confidence 788888777766654432 3444444 4442 578998986 578999998843 3222222
Q ss_pred eecCcceEecccc-CeEEEEe-CCeEEEEecCCc-eeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCe
Q 000944 941 QVEGIPLALCQFQ-GRLLAGI-GPVLRLYDLGKK-RLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQ 1017 (1213)
Q Consensus 941 ~~~g~V~ai~~~~-g~ll~~~-g~~l~i~~~~~~-~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~ 1017 (1213)
-..+.|+++..-. .+++-+. -..|++|++... +.+. .+......+.++...+++++-+-+-..|.+.... .+.
T Consensus 369 gH~~~V~sl~~~~~~~~~Sgs~D~~IkvWdl~~~~~c~~--tl~~h~~~v~~l~~~~~~Lvs~~aD~~Ik~WD~~--~~~ 444 (537)
T KOG0274|consen 369 GHTGRVYSLIVDSENRLLSGSLDTTIKVWDLRTKRKCIH--TLQGHTSLVSSLLLRDNFLVSSSADGTIKLWDAE--EGE 444 (537)
T ss_pred CCcceEEEEEecCcceEEeeeeccceEeecCCchhhhhh--hhcCCcccccccccccceeEeccccccEEEeecc--cCc
Confidence 2679999984434 5555544 456999999877 4322 2333455566677777777766666667775433 333
Q ss_pred EEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecCCC
Q 000944 1018 LYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQD 1058 (1213)
Q Consensus 1018 l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~~~ 1058 (1213)
......-.+.-.+++..+. ...++++=.+|.+.++.+...
T Consensus 445 ~~~~~~~~~~~~v~~l~~~-~~~il~s~~~~~~~l~dl~~~ 484 (537)
T KOG0274|consen 445 CLRTLEGRHVGGVSALALG-KEEILCSSDDGSVKLWDLRSG 484 (537)
T ss_pred eeeeeccCCcccEEEeecC-cceEEEEecCCeeEEEecccC
Confidence 3222222222334444433 467888899999999987664
No 89
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=84.02 E-value=45 Score=36.94 Aligned_cols=176 Identities=14% Similarity=0.238 Sum_probs=97.7
Q ss_pred EEEEEeCCCCceEEEEEc-CCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE-
Q 000944 861 CIRVLDPRSANTTCLLEL-QDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH- 938 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~-~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~- 938 (1213)
-|.++|.-+++.-++|.. +.-+++++..++.|..+ .++|..|= +--|.+|++...|++.....
T Consensus 134 PIh~wdaftG~lraSy~~ydh~de~taAhsL~Fs~D--GeqlfaGy-------------krcirvFdt~RpGr~c~vy~t 198 (406)
T KOG2919|consen 134 PIHLWDAFTGKLRASYRAYDHQDEYTAAHSLQFSPD--GEQLFAGY-------------KRCIRVFDTSRPGRDCPVYTT 198 (406)
T ss_pred ceeeeeccccccccchhhhhhHHhhhhheeEEecCC--CCeEeecc-------------cceEEEeeccCCCCCCcchhh
Confidence 478888888888777754 44467777777788642 35565553 33578999988776544322
Q ss_pred ----EEeecCcceEe--ccccC-eEEE-EeCCeEEEEecCCceeeceeeecCccceEEEEEEe--CCEEEEeecCCcEEE
Q 000944 939 ----KTQVEGIPLAL--CQFQG-RLLA-GIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTY--RDRIYVGDIQESFHF 1008 (1213)
Q Consensus 939 ----~~~~~g~V~ai--~~~~g-~ll~-~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~--~~~I~vgD~~~Sv~~ 1008 (1213)
+.-+.|.+.++ .+.+. .+.+ +.|+.+-+|++++...+..-.- ..--|+.|... +|.++.| ++++-.+
T Consensus 199 ~~~~k~gq~giisc~a~sP~~~~~~a~gsY~q~~giy~~~~~~pl~llgg--h~gGvThL~~~edGn~lfsG-aRk~dkI 275 (406)
T KOG2919|consen 199 VTKGKFGQKGIISCFAFSPMDSKTLAVGSYGQRVGIYNDDGRRPLQLLGG--HGGGVTHLQWCEDGNKLFSG-ARKDDKI 275 (406)
T ss_pred hhcccccccceeeeeeccCCCCcceeeecccceeeeEecCCCCceeeecc--cCCCeeeEEeccCcCeeccc-ccCCCeE
Confidence 33446666544 33444 3444 5678999999998765444332 23357777642 4444433 2333333
Q ss_pred EEEecc--CCeEEEeecc---CCCcceEEEEeecC-Cee-eeecCCCcEEEEecCC
Q 000944 1009 CKYRRD--ENQLYIFADD---SVPRWLTAAHHIDF-DTM-AGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1009 l~~~~~--~~~l~~~a~D---~~~~~~~~~~~ld~-~~~-l~~D~~gnl~il~~~~ 1057 (1213)
+-|+-- ..-+..+.|. .+.|-... +|. +.+ ..+|.+|.+.+++...
T Consensus 276 l~WDiR~~~~pv~~L~rhv~~TNQRI~FD---ld~~~~~LasG~tdG~V~vwdlk~ 328 (406)
T KOG2919|consen 276 LCWDIRYSRDPVYALERHVGDTNQRILFD---LDPKGEILASGDTDGSVRVWDLKD 328 (406)
T ss_pred EEEeehhccchhhhhhhhccCccceEEEe---cCCCCceeeccCCCccEEEEecCC
Confidence 333311 1112222221 22221111 332 234 5578999999988654
No 90
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=83.75 E-value=1.2e+02 Score=37.21 Aligned_cols=103 Identities=18% Similarity=0.177 Sum_probs=61.1
Q ss_pred cCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCc-eeEeE
Q 000944 548 NRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDC-MQILS 626 (1213)
Q Consensus 548 ~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~-l~~~~ 626 (1213)
.++.++-.-|.|.+.+..-. .|+|.+ .....+.+|-||+..+ ....++++.-|+.+.-|++.+... ....+
T Consensus 214 rd~tI~sgDS~G~V~FWd~~-~gTLiq-S~~~h~adVl~Lav~~------~~d~vfsaGvd~~ii~~~~~~~~~~wv~~~ 285 (691)
T KOG2048|consen 214 RDSTIASGDSAGTVTFWDSI-FGTLIQ-SHSCHDADVLALAVAD------NEDRVFSAGVDPKIIQYSLTTNKSEWVINS 285 (691)
T ss_pred ecCcEEEecCCceEEEEccc-Ccchhh-hhhhhhcceeEEEEcC------CCCeEEEccCCCceEEEEecCCccceeeec
Confidence 34555545456777776433 344443 2344567899998854 456788888899998888865422 22222
Q ss_pred EeecCC-CCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEE
Q 000944 627 VQSVSS-PPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVV 674 (1213)
Q Consensus 627 ~~~l~~-~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~ 674 (1213)
...+.. ..+++++.+ ..|+-|-+|+.|+....
T Consensus 286 ~r~~h~hdvrs~av~~----------------~~l~sgG~d~~l~i~~s 318 (691)
T KOG2048|consen 286 RRDLHAHDVRSMAVIE----------------NALISGGRDFTLAICSS 318 (691)
T ss_pred cccCCcccceeeeeec----------------ceEEecceeeEEEEccc
Confidence 222221 244554432 36778888888876544
No 91
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=83.74 E-value=1.4e+02 Score=38.01 Aligned_cols=113 Identities=13% Similarity=0.230 Sum_probs=72.8
Q ss_pred CCccEEEEEecCCEEEEEEeCCEEEEEEEccC--CCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 538 GKRTIVKVGSNRLQVVIALSGGELIYFEVDMT--GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 538 ~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~--~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
.|..|...+....+++.+..++.+..+.+++- +.+ +.+.+ .++.++++.. ...++++|..|-.|.+++
T Consensus 55 ~g~~v~~ia~~s~~f~~~s~~~tv~~y~fps~~~~~i--L~Rft--lp~r~~~v~g------~g~~iaagsdD~~vK~~~ 124 (933)
T KOG1274|consen 55 SGELVSSIACYSNHFLTGSEQNTVLRYKFPSGEEDTI--LARFT--LPIRDLAVSG------SGKMIAAGSDDTAVKLLN 124 (933)
T ss_pred cCceeEEEeecccceEEeeccceEEEeeCCCCCccce--eeeee--ccceEEEEec------CCcEEEeecCceeEEEEe
Confidence 45678888888888888888899999887742 212 23333 3455555532 467999999999999999
Q ss_pred eCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 616 LDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 616 l~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
++ |..-+.+. -..-..+.-+.+ .....+|-+...||.+..|.++.
T Consensus 125 ~~-D~s~~~~l----rgh~apVl~l~~-----------~p~~~fLAvss~dG~v~iw~~~~ 169 (933)
T KOG1274|consen 125 LD-DSSQEKVL----RGHDAPVLQLSY-----------DPKGNFLAVSSCDGKVQIWDLQD 169 (933)
T ss_pred cc-ccchheee----cccCCceeeeeE-----------cCCCCEEEEEecCceEEEEEccc
Confidence 84 32222111 011111222233 23567899999999999998864
No 92
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=83.43 E-value=82 Score=37.80 Aligned_cols=207 Identities=16% Similarity=0.205 Sum_probs=102.7
Q ss_pred CEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEee
Q 000944 550 LQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQS 629 (1213)
Q Consensus 550 ~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~ 629 (1213)
+.++|++ +..+++.. ..+|.+.++.... ...|+++...+ ...+|+||+.+|.++||+.........+..
T Consensus 189 n~laVal-g~~vylW~-~~s~~v~~l~~~~-~~~vtSv~ws~------~G~~LavG~~~g~v~iwD~~~~k~~~~~~~-- 257 (484)
T KOG0305|consen 189 NVLAVAL-GQSVYLWS-ASSGSVTELCSFG-EELVTSVKWSP------DGSHLAVGTSDGTVQIWDVKEQKKTRTLRG-- 257 (484)
T ss_pred CeEEEEe-cceEEEEe-cCCCceEEeEecC-CCceEEEEECC------CCCEEEEeecCCeEEEEehhhccccccccC--
Confidence 4455553 33444433 2245555554443 67899998854 468999999999999999843322221110
Q ss_pred cCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCccc---ccceeeecCCCCe-EEEEEECC-e
Q 000944 630 VSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLS---DSRSRFLGLRPPK-LFSVVVGG-R 704 (1213)
Q Consensus 630 l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~---~~~~~~lG~~pv~-l~~~~~~~-~ 704 (1213)
...++ +..+.. ....+.+|.++|.++.+.+........ ..+....|.+=-. ...+..+| .
T Consensus 258 -~h~~r-vg~laW-------------~~~~lssGsr~~~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgnD 322 (484)
T KOG0305|consen 258 -SHASR-VGSLAW-------------NSSVLSSGSRDGKILNHDVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGND 322 (484)
T ss_pred -CcCce-eEEEec-------------cCceEEEecCCCcEEEEEEecchhhhhhhhcccceeeeeEECCCCCeeccCCCc
Confidence 00111 111222 356688999999999888754321111 0111111100000 00011222 2
Q ss_pred eEEEEecCccEEEEEeCCeEEEEecCcc-ccceeeccccCCCCceEEEEe----CCeEEEEEEccCCCeeEEEEEeCCCc
Q 000944 705 AAMLCLSSRPWLGYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA----GNALRVFTIERLGETFNETALPLRYT 779 (1213)
Q Consensus 705 ~~v~~~g~~p~~i~~~~~~~~~~~~~~~-~v~~~~~f~~~~~~~~~i~~~----~~~L~i~~l~~~~~~~~~r~i~l~~t 779 (1213)
+.+++.-.. .....+++...... ...+.||+.. +++..- +..++|-... ..-.++.+..+..
T Consensus 323 N~~~Iwd~~-----~~~p~~~~~~H~aAVKA~awcP~q~-----~lLAsGGGs~D~~i~fwn~~---~g~~i~~vdtgsQ 389 (484)
T KOG0305|consen 323 NVVFIWDGL-----SPEPKFTFTEHTAAVKALAWCPWQS-----GLLATGGGSADRCIKFWNTN---TGARIDSVDTGSQ 389 (484)
T ss_pred cceEeccCC-----CccccEEEeccceeeeEeeeCCCcc-----CceEEcCCCcccEEEEEEcC---CCcEecccccCCc
Confidence 233333110 00111222111110 1234456543 344332 2344443333 4567889999999
Q ss_pred cceeeecCCCceEEEE
Q 000944 780 PRRFVLQPKKKLMVII 795 (1213)
Q Consensus 780 p~~i~y~~~~~~~~v~ 795 (1213)
+..|+|.+..+-++..
T Consensus 390 VcsL~Wsk~~kEi~st 405 (484)
T KOG0305|consen 390 VCSLIWSKKYKELLST 405 (484)
T ss_pred eeeEEEcCCCCEEEEe
Confidence 9999999988765544
No 93
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=82.95 E-value=98 Score=35.79 Aligned_cols=39 Identities=21% Similarity=0.429 Sum_probs=26.1
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVG 904 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VG 904 (1213)
+++++|-...+-..+|.++++..+.+ +.|+ ....|+++|
T Consensus 412 ~V~lwDLRKl~n~kt~~l~~~~~v~s---~~fD--~SGt~L~~~ 450 (506)
T KOG0289|consen 412 SVKLWDLRKLKNFKTIQLDEKKEVNS---LSFD--QSGTYLGIA 450 (506)
T ss_pred eEEEEEehhhcccceeecccccccee---EEEc--CCCCeEEee
Confidence 58888877666667777777655544 3454 344788887
No 94
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=82.94 E-value=17 Score=43.33 Aligned_cols=102 Identities=17% Similarity=0.230 Sum_probs=64.7
Q ss_pred CceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEe--ccccCeEEEEeCCeEEEEecCCcee
Q 000944 897 HGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLAL--CQFQGRLLAGIGPVLRLYDLGKKRL 974 (1213)
Q Consensus 897 ~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai--~~~~g~ll~~~g~~l~i~~~~~~~l 974 (1213)
+..|+++-.+-. ..-++++.++.+. ..+..-..-+|-|.++ .+...+|++|.-..|.+|++....|
T Consensus 532 kGDYlatV~~~~---------~~~~VliHQLSK~---~sQ~PF~kskG~vq~v~FHPs~p~lfVaTq~~vRiYdL~kqel 599 (733)
T KOG0650|consen 532 KGDYLATVMPDS---------GNKSVLIHQLSKR---KSQSPFRKSKGLVQRVKFHPSKPYLFVATQRSVRIYDLSKQEL 599 (733)
T ss_pred CCceEEEeccCC---------CcceEEEEecccc---cccCchhhcCCceeEEEecCCCceEEEEeccceEEEehhHHHH
Confidence 347877654311 2345778888763 2222222346777665 4567899999999999999998777
Q ss_pred eceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEe
Q 000944 975 LRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 975 ~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
+++...- .-.+.+|. ..|+-+++|..-+-+..+..+
T Consensus 600 vKkL~tg--~kwiS~msihp~GDnli~gs~d~k~~WfDld 637 (733)
T KOG0650|consen 600 VKKLLTG--SKWISSMSIHPNGDNLILGSYDKKMCWFDLD 637 (733)
T ss_pred HHHHhcC--CeeeeeeeecCCCCeEEEecCCCeeEEEEcc
Confidence 6654221 12344444 457888999888877665544
No 95
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=82.76 E-value=91 Score=35.26 Aligned_cols=112 Identities=18% Similarity=0.327 Sum_probs=62.9
Q ss_pred CCccEEEEEecCCEEEEEEe--CCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 538 GKRTIVKVGSNRLQVVIALS--GGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 538 ~~~~I~~as~~~~~v~v~~s--~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
.+.+|++++.+.+-.++++. +|.+.++++...+ ....+..++.-|.-..- ...+.++++|+.||++-+|+
T Consensus 105 HKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~-----~~~~~~~e~~dieWl~W---Hp~a~illAG~~DGsvWmw~ 176 (399)
T KOG0296|consen 105 HKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGG-----EQWKLDQEVEDIEWLKW---HPRAHILLAGSTDGSVWMWQ 176 (399)
T ss_pred CCCceEEEEEccCceEEEecCCCccEEEEEcccCc-----eEEEeecccCceEEEEe---cccccEEEeecCCCcEEEEE
Confidence 34568888887665566654 3678777766322 12233333333322111 12578999999999999999
Q ss_pred eCCCCce-eEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEE
Q 000944 616 LDPDDCM-QILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVV 674 (1213)
Q Consensus 616 l~p~~~l-~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~ 674 (1213)
+ |.... +..+-.. .|... =++. +..-.++.|..||.+..+..
T Consensus 177 i-p~~~~~kv~~Gh~---~~ct~--G~f~-----------pdGKr~~tgy~dgti~~Wn~ 219 (399)
T KOG0296|consen 177 I-PSQALCKVMSGHN---SPCTC--GEFI-----------PDGKRILTGYDDGTIIVWNP 219 (399)
T ss_pred C-CCcceeeEecCCC---CCccc--cccc-----------CCCceEEEEecCceEEEEec
Confidence 9 65322 2222111 11111 1111 12234789999999987754
No 96
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.62 E-value=27 Score=37.36 Aligned_cols=67 Identities=24% Similarity=0.289 Sum_probs=55.0
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc---cCeEEEEeCC-eEEEEecC
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF---QGRLLAGIGP-VLRLYDLG 970 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~---~g~ll~~~g~-~l~i~~~~ 970 (1213)
.++++|.|+..+| ..-.|||+++++... ..+..+.+.++...++.++-- .+-+++|+|. .|++|+..
T Consensus 21 ~nrLavAt~q~yG-----l~G~G~L~ile~~~~-~gi~e~~s~d~~D~LfdV~Wse~~e~~~~~a~GDGSLrl~d~~ 91 (311)
T KOG0277|consen 21 ENRLAVATAQHYG-----LAGNGRLFILEVTDP-KGIQECQSYDTEDGLFDVAWSENHENQVIAASGDGSLRLFDLT 91 (311)
T ss_pred cchhheeehhhcc-----cccCceEEEEecCCC-CCeEEEEeeecccceeEeeecCCCcceEEEEecCceEEEeccC
Confidence 4789999999886 567999999999733 479999999999999988765 3467889984 89999965
No 97
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=82.48 E-value=12 Score=40.98 Aligned_cols=114 Identities=17% Similarity=0.178 Sum_probs=63.7
Q ss_pred EEEEEEecCCCCceEEEE--eCCEEEEEeec--CCCCeEEEEEEEe---eeeeeEeeEE--eeCCCCeeEEEEEeccceE
Q 000944 16 AAINGNFSGTKTPEIVVA--RGKVLELLRPE--NSGRIETLVSTEI---FGAIRSLAQF--RLTGSQKDYIVVGSDSGRI 86 (1213)
Q Consensus 16 ~~v~~~f~~~~~~~LVv~--k~~~Levy~i~--~~g~L~~v~~~~l---~g~I~~i~~~--r~~~~~~d~L~v~~~~~~l 86 (1213)
|.....|. |+...+||+ ++|.|.+|... ++|++-+-...-. |-+..++-.+ ...+. ..+|.+.+-.-
T Consensus 134 hpT~V~Fa-pDc~s~vv~~~~g~~l~vyk~~K~~dG~~~~~~v~~D~~~f~~kh~v~~i~iGiA~~---~k~imsas~dt 209 (420)
T KOG2096|consen 134 HPTRVVFA-PDCKSVVVSVKRGNKLCVYKLVKKTDGSGSHHFVHIDNLEFERKHQVDIINIGIAGN---AKYIMSASLDT 209 (420)
T ss_pred CceEEEEC-CCcceEEEEEccCCEEEEEEeeecccCCCCcccccccccccchhcccceEEEeecCC---ceEEEEecCCC
Confidence 44444664 566666654 69999999886 4554433222222 2222332222 23322 23343433333
Q ss_pred EEEEEeCCCCcEeEEeeeeccccCcccccCCceEEECCCCCEEEEEecccceEEEE
Q 000944 87 VILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYV 142 (1213)
Q Consensus 87 ~il~~d~~~~~~~tis~~~~~~~g~~~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~ 142 (1213)
-|+-|+-+.+-|-+|..... .....+|.|+||++|++.+.--..|+.
T Consensus 210 ~i~lw~lkGq~L~~idtnq~---------~n~~aavSP~GRFia~~gFTpDVkVwE 256 (420)
T KOG2096|consen 210 KICLWDLKGQLLQSIDTNQS---------SNYDAAVSPDGRFIAVSGFTPDVKVWE 256 (420)
T ss_pred cEEEEecCCceeeeeccccc---------cccceeeCCCCcEEEEecCCCCceEEE
Confidence 44557766444544432211 134679999999999999987777774
No 98
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=82.30 E-value=95 Score=37.56 Aligned_cols=112 Identities=19% Similarity=0.225 Sum_probs=65.3
Q ss_pred EEEEEec-CC-EEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCC
Q 000944 542 IVKVGSN-RL-QVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPD 619 (1213)
Q Consensus 542 I~~as~~-~~-~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~ 619 (1213)
|...+.. ++ +++=+..|+++..+.+...+.....- ..-...|+|+++.+ .+..++.|.+|++++||+++..
T Consensus 206 v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l-~gH~~~v~~~~f~p------~g~~i~Sgs~D~tvriWd~~~~ 278 (456)
T KOG0266|consen 206 VSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTL-KGHSTYVTSVAFSP------DGNLLVSGSDDGTVRIWDVRTG 278 (456)
T ss_pred eeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEe-cCCCCceEEEEecC------CCCEEEEecCCCcEEEEeccCC
Confidence 5555544 22 44434456788888773333322211 12345689999975 2478999999999999999533
Q ss_pred CceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 620 DCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 620 ~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
.+...+... ..+.+ .+.+. ....+|..+..||.+..|....
T Consensus 279 ~~~~~l~~h---s~~is--~~~f~-----------~d~~~l~s~s~d~~i~vwd~~~ 319 (456)
T KOG0266|consen 279 ECVRKLKGH---SDGIS--GLAFS-----------PDGNLLVSASYDGTIRVWDLET 319 (456)
T ss_pred eEEEeeecc---CCceE--EEEEC-----------CCCCEEEEcCCCccEEEEECCC
Confidence 333322211 11222 22332 2456788888899998887653
No 99
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=81.97 E-value=23 Score=37.62 Aligned_cols=131 Identities=15% Similarity=0.158 Sum_probs=83.2
Q ss_pred cEEEEEEEEeCCceEEEEEEEeecCcceEecccc-C-eEEEEeCCeEEEEecCCceeeceeeecCccceEE--EEEEeCC
Q 000944 920 GYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ-G-RLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIV--SINTYRD 995 (1213)
Q Consensus 920 Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~-g-~ll~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~--~l~~~~~ 995 (1213)
|.+-+|+... -+.+.+-+++.+|+++.... | .|..|-|..|..|+...=.+++ +++ +|..|. ||...++
T Consensus 165 ~tVRLWD~rT----gt~v~sL~~~s~VtSlEvs~dG~ilTia~gssV~Fwdaksf~~lK--s~k-~P~nV~SASL~P~k~ 237 (334)
T KOG0278|consen 165 KTVRLWDHRT----GTEVQSLEFNSPVTSLEVSQDGRILTIAYGSSVKFWDAKSFGLLK--SYK-MPCNVESASLHPKKE 237 (334)
T ss_pred CceEEEEecc----CcEEEEEecCCCCcceeeccCCCEEEEecCceeEEecccccccee--ecc-CccccccccccCCCc
Confidence 3455777653 45677778999999987653 4 4566999999999886544443 344 455554 5667788
Q ss_pred EEEEeecCCcEEEEEEeccCCeEEE-eeccCCCcceEEEEee-cCCeeeeecCCCcEEEEecCCCCC
Q 000944 996 RIYVGDIQESFHFCKYRRDENQLYI-FADDSVPRWLTAAHHI-DFDTMAGADKFGNIYFVRLPQDVS 1060 (1213)
Q Consensus 996 ~I~vgD~~~Sv~~l~~~~~~~~l~~-~a~D~~~~~~~~~~~l-d~~~~l~~D~~gnl~il~~~~~~~ 1060 (1213)
+.++|--+-=+.-+.|++++ .+.. ....+.| |-++.|- |.+.+..+..+|.|.+.+..+...
T Consensus 238 ~fVaGged~~~~kfDy~Tge-Ei~~~nkgh~gp--VhcVrFSPdGE~yAsGSEDGTirlWQt~~~~~ 301 (334)
T KOG0278|consen 238 FFVAGGEDFKVYKFDYNTGE-EIGSYNKGHFGP--VHCVRFSPDGELYASGSEDGTIRLWQTTPGKT 301 (334)
T ss_pred eEEecCcceEEEEEeccCCc-eeeecccCCCCc--eEEEEECCCCceeeccCCCceEEEEEecCCCc
Confidence 88888554333344444332 2222 1222333 4455564 545778888999999999988654
No 100
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=81.45 E-value=19 Score=42.05 Aligned_cols=176 Identities=14% Similarity=0.225 Sum_probs=92.7
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeec---CcceEecccc--CeEEE-EeC-CeEEEEecCCc-----eeeceeeecCccce
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVE---GIPLALCQFQ--GRLLA-GIG-PVLRLYDLGKK-----RLLRKCENKLFPNT 986 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~---g~V~ai~~~~--g~ll~-~~g-~~l~i~~~~~~-----~l~~~~~~~~~~~~ 986 (1213)
-|.+-+|++.+....++++..+... -+|++ |.++ |.++| |++ ..|.+|+.... -.++.|... +.-
T Consensus 290 DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~ts-C~~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~--g~~ 366 (641)
T KOG0772|consen 290 DGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTS-CAWNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLP--GQD 366 (641)
T ss_pred CCcEEEEecCCchhheeEEeeccCCCcccCcee-eecCCCcchhhhcccCCceeeeecCCcccccceEeeeccCC--CCc
Confidence 3667789988764345555444433 34444 5553 55544 554 68999997532 234555443 345
Q ss_pred EEEEE--EeCCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCe-eee------ecCCCcEEEEecCC
Q 000944 987 IVSIN--TYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDT-MAG------ADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 987 i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~-~l~------~D~~gnl~il~~~~ 1057 (1213)
|++|. ..+|+++---.-.++-++..+.-..-|..-..=..+.-.+.|.|--++. |+. ++..|+|++|+.
T Consensus 367 Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t~~~~tdc~FSPd~kli~TGtS~~~~~~~g~L~f~d~-- 444 (641)
T KOG0772|consen 367 ITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPTPFPGTDCCFSPDDKLILTGTSAPNGMTAGTLFFFDR-- 444 (641)
T ss_pred eeEEEeccccchhhhccCCCceeeeeccccccchhhhcCCCccCCCCccccCCCceEEEecccccCCCCCceEEEEec--
Confidence 66665 4567776555555666765543223332222222333455555543333 332 345677777653
Q ss_pred CCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCCCccEEEEEecccceEEEEe
Q 000944 1058 DVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLA 1128 (1213)
Q Consensus 1058 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig~l~~ 1128 (1213)
..|+.+....+. -.+..+. ++-..-++|+.|+.+|.+.++..
T Consensus 445 --------------------------~t~d~v~ki~i~--~aSvv~~-~WhpkLNQi~~gsgdG~~~vyYd 486 (641)
T KOG0772|consen 445 --------------------------MTLDTVYKIDIS--TASVVRC-LWHPKLNQIFAGSGDGTAHVYYD 486 (641)
T ss_pred --------------------------cceeeEEEecCC--CceEEEE-eecchhhheeeecCCCceEEEEC
Confidence 233333333332 1122222 11233468899999999987753
No 101
>PTZ00420 coronin; Provisional
Probab=81.27 E-value=1.5e+02 Score=36.78 Aligned_cols=132 Identities=14% Similarity=0.151 Sum_probs=71.9
Q ss_pred cEEEEEEEEeCCceEEEEEEEeecCcceEeccc-cCeEEEEe--CCeEEEEecCCceeeceeeec-C-ccceEEEE---E
Q 000944 920 GYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF-QGRLLAGI--GPVLRLYDLGKKRLLRKCENK-L-FPNTIVSI---N 991 (1213)
Q Consensus 920 Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~ll~~~--g~~l~i~~~~~~~l~~~~~~~-~-~~~~i~~l---~ 991 (1213)
|.|.+|++... +. +......+.|++++-- +|.++++. +.+|++|++...+.+..-..+ . ....++.+ .
T Consensus 148 gtIrIWDl~tg--~~--~~~i~~~~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~~fs 223 (568)
T PTZ00420 148 SFVNIWDIENE--KR--AFQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLG 223 (568)
T ss_pred CeEEEEECCCC--cE--EEEEecCCcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEEEEecccCCceeEEEEeeeEc
Confidence 67899998764 21 2222345778887643 56666644 468999999876554321111 0 01111121 2
Q ss_pred EeCCEEEEeecC----CcEEEEEEeccCCeEEEeeccCCCcceEEEEeecC--C-eeeeecCCCcEEEEecCC
Q 000944 992 TYRDRIYVGDIQ----ESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDF--D-TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 992 ~~~~~I~vgD~~----~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~--~-~~l~~D~~gnl~il~~~~ 1057 (1213)
..+++|+.+-.- +.+.++..+....-+..+.-|..+-.++. +.|. + .++++-.++++++|++..
T Consensus 224 ~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~~ld~~~~~L~p--~~D~~tg~l~lsGkGD~tIr~~e~~~ 294 (568)
T PTZ00420 224 GDDNYILSTGFSKNNMREMKLWDLKNTTSALVTMSIDNASAPLIP--HYDESTGLIYLIGKGDGNCRYYQHSL 294 (568)
T ss_pred CCCCEEEEEEcCCCCccEEEEEECCCCCCceEEEEecCCccceEE--eeeCCCCCEEEEEECCCeEEEEEccC
Confidence 345677764333 35777665532333444333333222221 2232 2 467788999999999865
No 102
>PF08596 Lgl_C: Lethal giant larvae(Lgl) like, C-terminal; InterPro: IPR013905 The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=80.43 E-value=52 Score=38.76 Aligned_cols=107 Identities=16% Similarity=0.167 Sum_probs=62.9
Q ss_pred EEEEEeCCCCceEEEEEcC-------CCceEEEEEEEEeccCC---CceEEEEEeeecCccCCCCCCcccEEEEEEEEe-
Q 000944 861 CIRVLDPRSANTTCLLELQ-------DNEAAFSICTVNFHDKE---HGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVE- 929 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~-------~~E~v~s~~~~~l~~~~---~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~- 929 (1213)
++-++|-..-.++..-.+. ..+.+++++...+.-++ ....++|||. .|.+++|.+..
T Consensus 108 ~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn------------~G~v~~fkIlp~ 175 (395)
T PF08596_consen 108 SLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLLVGTN------------SGNVLTFKILPS 175 (395)
T ss_dssp EEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEEEEET------------TSEEEEEEEEE-
T ss_pred cEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEEEEeC------------CCCEEEEEEecC
Confidence 7888888766777664443 35677777665443222 2389999996 69999999985
Q ss_pred C--CceEEEEEEE-eecCcceEecccc------------------------CeEEEEeCCeEEEEecCCceeeceee
Q 000944 930 E--GKSLELLHKT-QVEGIPLALCQFQ------------------------GRLLAGIGPVLRLYDLGKKRLLRKCE 979 (1213)
Q Consensus 930 ~--~~kl~~~~~~-~~~g~V~ai~~~~------------------------g~ll~~~g~~l~i~~~~~~~l~~~~~ 979 (1213)
. ++..+..... ..+++|.+|+.++ |+++++.-..++++.+...+...+.+
T Consensus 176 ~~g~f~v~~~~~~~~~~~~i~~I~~i~~~~G~~a~At~~~~~~l~~g~~i~g~vVvvSe~~irv~~~~~~k~~~K~~ 252 (395)
T PF08596_consen 176 SNGRFSVQFAGATTNHDSPILSIIPINADTGESALATISAMQGLSKGISIPGYVVVVSESDIRVFKPPKSKGAHKSF 252 (395)
T ss_dssp GGG-EEEEEEEEE--SS----EEEEEETTT--B-B-BHHHHHGGGGT----EEEEEE-SSEEEEE-TT---EEEEE-
T ss_pred CCCceEEEEeeccccCCCceEEEEEEECCCCCcccCchhHhhccccCCCcCcEEEEEcccceEEEeCCCCcccceee
Confidence 2 3455555555 5679999988873 25666777889999998766555544
No 103
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=80.32 E-value=24 Score=38.64 Aligned_cols=207 Identities=16% Similarity=0.159 Sum_probs=104.9
Q ss_pred eCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCc-
Q 000944 557 SGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPE- 635 (1213)
Q Consensus 557 s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~- 635 (1213)
.++++.+|.+.....-....-++-...+-||++.| ...|+++|+....+++|.++.-.++ + ++.|.
T Consensus 192 rD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHP------sGefllvgTdHp~~rlYdv~T~Qcf--v-----sanPd~ 258 (430)
T KOG0640|consen 192 RDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHP------SGEFLLVGTDHPTLRLYDVNTYQCF--V-----SANPDD 258 (430)
T ss_pred CCCeEEEEecccHHHHHHHHHhhccceeeeEeecC------CCceEEEecCCCceeEEeccceeEe--e-----ecCccc
Confidence 46778887665211000001122235678898865 4789999999999999998532222 1 12232
Q ss_pred --eeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeec----CCCC---eEEEEEECCeeE
Q 000944 636 --SLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLG----LRPP---KLFSVVVGGRAA 706 (1213)
Q Consensus 636 --Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG----~~pv---~l~~~~~~~~~~ 706 (1213)
+-.+.+.... ..++.|+ -|..||.+-.|. -. +++-.+.+| ...| .|. .+...
T Consensus 259 qht~ai~~V~Ys--------~t~~lYv-TaSkDG~IklwD--GV----S~rCv~t~~~AH~gsevcSa~Ft----kn~ky 319 (430)
T KOG0640|consen 259 QHTGAITQVRYS--------STGSLYV-TASKDGAIKLWD--GV----SNRCVRTIGNAHGGSEVCSAVFT----KNGKY 319 (430)
T ss_pred ccccceeEEEec--------CCccEEE-EeccCCcEEeec--cc----cHHHHHHHHhhcCCceeeeEEEc----cCCeE
Confidence 1223333221 2345554 788899885543 11 111111111 1111 121 12334
Q ss_pred EEEecCccEE-EE---EeCCeEEEEec---CccccceeeccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCC--
Q 000944 707 MLCLSSRPWL-GY---IHRGRFLLTPL---SYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLR-- 777 (1213)
Q Consensus 707 v~~~g~~p~~-i~---~~~~~~~~~~~---~~~~v~~~~~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~-- 777 (1213)
++..|.-..+ ++ +.+--..|.-. .-+...+-+.|+.. .+-+++.++.+..+|.-+.- +.--.+..++|
T Consensus 320 iLsSG~DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNht--EdyVl~pDEas~slcsWdaR-tadr~~l~slgHn 396 (430)
T KOG0640|consen 320 ILSSGKDSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNHT--EDYVLFPDEASNSLCSWDAR-TADRVALLSLGHN 396 (430)
T ss_pred EeecCCcceeeeeeecCCceEEEEecCCcccchhhhhhhhhcCc--cceEEccccccCceeecccc-chhhhhhcccCCC
Confidence 4444433322 22 22222223222 11233444556553 34556677777778877762 22334556676
Q ss_pred CccceeeecCCCceEEEEEcc
Q 000944 778 YTPRRFVLQPKKKLMVIIETD 798 (1213)
Q Consensus 778 ~tp~~i~y~~~~~~~~v~~~~ 798 (1213)
..||-|.++|....|+-+..+
T Consensus 397 ~a~R~i~HSP~~p~FmTcsdD 417 (430)
T KOG0640|consen 397 GAVRWIVHSPVEPAFMTCSDD 417 (430)
T ss_pred CCceEEEeCCCCCceeeeccc
Confidence 589999999988777665544
No 104
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=80.08 E-value=38 Score=41.76 Aligned_cols=189 Identities=14% Similarity=0.218 Sum_probs=103.7
Q ss_pred EeccCCcccceEEeccCCCCCCcEEEEEecCCCCeEEEEccCCcceEEEEecCCCCcceEEEeeeCCCCCCceEEEEEec
Q 000944 398 QVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFN 477 (1213)
Q Consensus 398 ~~~n~gPI~D~~~~~~~~~~~~~lv~~sG~g~~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l~~~~~~~~~~~lvlS~~ 477 (1213)
-+-.-|||....+.. +..-|+.||+ ++++|...-.-..... ... |..-=+|-+... .+.-|.+-..-
T Consensus 447 L~GH~GPVyg~sFsP----d~rfLlScSE---D~svRLWsl~t~s~~V-~y~--GH~~PVwdV~F~---P~GyYFatas~ 513 (707)
T KOG0263|consen 447 LYGHSGPVYGCSFSP----DRRFLLSCSE---DSSVRLWSLDTWSCLV-IYK--GHLAPVWDVQFA---PRGYYFATASH 513 (707)
T ss_pred eecCCCceeeeeecc----cccceeeccC---CcceeeeecccceeEE-Eec--CCCcceeeEEec---CCceEEEecCC
Confidence 456678998877654 2335777775 4456554433222221 111 222347888764 33334444344
Q ss_pred CceeEEEeccceeeecCCCccCCCCeEEEEeecCCeEEEEeCCcEEEEeCC--CceeeeeCC----------CCccEEEE
Q 000944 478 NATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIRED--GRINEWRTP----------GKRTIVKV 545 (1213)
Q Consensus 478 ~~T~vl~~~~~~~e~~~~gf~~~~~Tl~a~~~~~~~ivQVT~~~i~l~~~~--~~~~~~~~~----------~~~~I~~a 545 (1213)
++|.-+...+. ..---|+||.+.|-.-++++|++-.+.++. +-+.-|..- ...+|++.
T Consensus 514 D~tArLWs~d~----------~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~VRiF~GH~~~V~al 583 (707)
T KOG0263|consen 514 DQTARLWSTDH----------NKPLRIFAGHLSDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNSVRIFTGHKGPVTAL 583 (707)
T ss_pred Cceeeeeeccc----------CCchhhhcccccccceEEECCcccccccCCCCceEEEEEcCCCcEEEEecCCCCceEEE
Confidence 56665554321 111234555555444555666555555432 334444332 22345554
Q ss_pred Ee--cCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 546 GS--NRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 546 s~--~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
+. ++-+++-+-.+|.|.++.+.....+.++..+ ...|.++++.. .+.++++|..|++|++|++.
T Consensus 584 ~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~H--t~ti~SlsFS~------dg~vLasgg~DnsV~lWD~~ 649 (707)
T KOG0263|consen 584 AFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGH--TGTIYSLSFSR------DGNVLASGGADNSVRLWDLT 649 (707)
T ss_pred EEcCCCceEeecccCCcEEEEEcCCCcchhhhhcc--cCceeEEEEec------CCCEEEecCCCCeEEEEEch
Confidence 44 4566666666778888877632222222112 56678888853 46899999999999999884
No 105
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=80.06 E-value=88 Score=33.37 Aligned_cols=187 Identities=14% Similarity=0.250 Sum_probs=94.2
Q ss_pred CCeEEEEeeeCCeEEEEEeecCCCCCcccccCCccccccCCCceeeccCCcccEEEEEEeccCCcccceEEeccCCCCCC
Q 000944 340 SGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAP 419 (1213)
Q Consensus 340 ~~~lFvgS~~gds~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~d~~~n~gPI~D~~~~~~~~~~~~ 419 (1213)
..+||+|+-+||=.++.+.++.....+++ .+..+.-.-..=|||.+|...+
T Consensus 22 ~~~l~agn~~G~iav~sl~sl~s~sa~~~----------------------gk~~iv~eqahdgpiy~~~f~d------- 72 (325)
T KOG0649|consen 22 KQYLFAGNLFGDIAVLSLKSLDSGSAEPP----------------------GKLKIVPEQAHDGPIYYLAFHD------- 72 (325)
T ss_pred ceEEEEecCCCeEEEEEehhhhccccCCC----------------------CCcceeeccccCCCeeeeeeeh-------
Confidence 45799999999999999877542211111 1233444466779999999874
Q ss_pred cEEEEEecCC------------CCeEEEEccCCcceEEEEecCCCCcceEEEeeeCCCCCCceEEEEEecCceeEEEec-
Q 000944 420 QIFTLCGRGP------------RSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIG- 486 (1213)
Q Consensus 420 ~lv~~sG~g~------------~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l~~~~~~~~~~~lvlS~~~~T~vl~~~- 486 (1213)
+++..+|-|. .+.=+..+.-++.+. ...++|.+ ..||..+.. .+.++.. ....++.++
T Consensus 73 ~~Lls~gdG~V~gw~W~E~~es~~~K~lwe~~~P~~~-~~~evPeI-Nam~ldP~e-----nSi~~Ag--GD~~~y~~dl 143 (325)
T KOG0649|consen 73 DFLLSGGDGLVYGWEWNEEEESLATKRLWEVKIPMQV-DAVEVPEI-NAMWLDPSE-----NSILFAG--GDGVIYQVDL 143 (325)
T ss_pred hheeeccCceEEEeeehhhhhhccchhhhhhcCcccc-CcccCCcc-ceeEeccCC-----CcEEEec--CCeEEEEEEe
Confidence 3444444431 222223333333333 36788885 899988652 2333333 333344432
Q ss_pred --cceeeec--CCCcc------CCCCeEEEEeecCCeEEEE----eCCcEEEEeCCCceeeeeCCCCccEEEEEecCCEE
Q 000944 487 --ETVEEVS--DSGFL------DTTPSLAVSLIGDDSLMQV----HPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQV 552 (1213)
Q Consensus 487 --~~~~e~~--~~gf~------~~~~Tl~a~~~~~~~ivQV----T~~~i~l~~~~~~~~~~~~~~~~~I~~as~~~~~v 552 (1213)
+.|...- .+.+. ...+-|+.|.= || -+.| |.+.+..+..-+.-...+|.-|+=|-+.+.+++.+
T Consensus 144 E~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~E-DG-tvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWl 221 (325)
T KOG0649|consen 144 EDGRIQREYRGHTDYVHSVVGRNANGQILSGAE-DG-TVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWL 221 (325)
T ss_pred cCCEEEEEEcCCcceeeeeeecccCcceeecCC-Cc-cEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceE
Confidence 2232220 11111 01122222221 11 1111 33444444432233333444555688888888887
Q ss_pred EEEEeCCEEEEEEEc
Q 000944 553 VIALSGGELIYFEVD 567 (1213)
Q Consensus 553 ~v~~s~~~l~~l~~~ 567 (1213)
+.. .+..+.++.+.
T Consensus 222 vCG-gGp~lslwhLr 235 (325)
T KOG0649|consen 222 VCG-GGPKLSLWHLR 235 (325)
T ss_pred Eec-CCCceeEEecc
Confidence 654 44467776665
No 106
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=80.04 E-value=1.4e+02 Score=35.76 Aligned_cols=217 Identities=14% Similarity=0.145 Sum_probs=104.2
Q ss_pred CCCeEEEEEECCeeEEEEecCccEEEEEeCC-eEEEEecCccccceeeccccCCCCceEEEEe-CCeEEEEEEccCCCee
Q 000944 692 RPPKLFSVVVGGRAAMLCLSSRPWLGYIHRG-RFLLTPLSYETLEYAASFSSDQCVEGVVSVA-GNALRVFTIERLGETF 769 (1213)
Q Consensus 692 ~pv~l~~~~~~~~~~v~~~g~~p~~i~~~~~-~~~~~~~~~~~v~~~~~f~~~~~~~~~i~~~-~~~L~i~~l~~~~~~~ 769 (1213)
.|..+..-+ ..+.|.+||+..+.+|+..+ +-... .+ -.. ..|.. .+.|+..+ .+.+.|. ..+ +.-
T Consensus 34 ~p~~ls~np--ngr~v~V~g~geY~iyt~~~~r~k~~---G~-g~~-~vw~~---~n~yAv~~~~~~I~I~--kn~-~~~ 100 (443)
T PF04053_consen 34 YPQSLSHNP--NGRFVLVCGDGEYEIYTALAWRNKAF---GS-GLS-FVWSS---RNRYAVLESSSTIKIY--KNF-KNE 100 (443)
T ss_dssp --SEEEE-T--TSSEEEEEETTEEEEEETTTTEEEEE---EE--SE-EEE-T---SSEEEEE-TTS-EEEE--ETT-EE-
T ss_pred CCeeEEECC--CCCEEEEEcCCEEEEEEccCCccccc---Cc-eeE-EEEec---CccEEEEECCCeEEEE--EcC-ccc
Confidence 455555422 45688889999999998422 11110 00 001 11222 35566666 4567664 222 122
Q ss_pred EEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccC
Q 000944 770 NETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYG 849 (1213)
Q Consensus 770 ~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 849 (1213)
.+++|++..++.+|.+ -+++.+...
T Consensus 101 ~~k~i~~~~~~~~If~---G~LL~~~~~---------------------------------------------------- 125 (443)
T PF04053_consen 101 VVKSIKLPFSVEKIFG---GNLLGVKSS---------------------------------------------------- 125 (443)
T ss_dssp TT-----SS-EEEEE----SSSEEEEET----------------------------------------------------
T ss_pred cceEEcCCcccceEEc---CcEEEEECC----------------------------------------------------
Confidence 3567888778888876 334443320
Q ss_pred CCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEe
Q 000944 850 YPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVE 929 (1213)
Q Consensus 850 ~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~ 929 (1213)
..+.++|-.+++.+.+++..+ +..|... +..+++++-|.. -+++++.+.
T Consensus 126 ----------~~i~~yDw~~~~~i~~i~v~~------vk~V~Ws--~~g~~val~t~~-------------~i~il~~~~ 174 (443)
T PF04053_consen 126 ----------DFICFYDWETGKLIRRIDVSA------VKYVIWS--DDGELVALVTKD-------------SIYILKYNL 174 (443)
T ss_dssp ----------TEEEEE-TTT--EEEEESS-E-------EEEEE---TTSSEEEEE-S--------------SEEEEEE-H
T ss_pred ----------CCEEEEEhhHcceeeEEecCC------CcEEEEE--CCCCEEEEEeCC-------------eEEEEEecc
Confidence 168999988888999888763 2222222 345788877643 256666554
Q ss_pred C--------C--ceEEEEEEEeecCcceEeccccCeEEEEeCCeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEE
Q 000944 930 E--------G--KSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYV 999 (1213)
Q Consensus 930 ~--------~--~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~v 999 (1213)
+ | ..+..+++ +...|.+.+-.++-++.+..+.|.. +-..+--..+.++. +.|++......|.+++
T Consensus 175 ~~~~~~~~~g~e~~f~~~~E--~~~~IkSg~W~~d~fiYtT~~~lkY--l~~Ge~~~i~~ld~-~~yllgy~~~~~~ly~ 249 (443)
T PF04053_consen 175 EAVAAIPEEGVEDAFELIHE--ISERIKSGCWVEDCFIYTTSNHLKY--LVNGETGIIAHLDK-PLYLLGYLPKENRLYL 249 (443)
T ss_dssp HHHHHBTTTB-GGGEEEEEE--E-S--SEEEEETTEEEEE-TTEEEE--EETTEEEEEEE-SS---EEEEEETTTTEEEE
T ss_pred hhcccccccCchhceEEEEE--ecceeEEEEEEcCEEEEEcCCeEEE--EEcCCcceEEEcCC-ceEEEEEEccCCEEEE
Confidence 2 2 15666654 4677777777788777777776654 44344334455553 6666666665677777
Q ss_pred eecCCcEEEEEEe
Q 000944 1000 GDIQESFHFCKYR 1012 (1213)
Q Consensus 1000 gD~~~Sv~~l~~~ 1012 (1213)
-|-...+.-+..+
T Consensus 250 ~Dr~~~v~~~~ld 262 (443)
T PF04053_consen 250 IDRDGNVISYELD 262 (443)
T ss_dssp E-TT--EEEEE--
T ss_pred EECCCCEEEEEEC
Confidence 7766666555544
No 107
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=79.94 E-value=93 Score=35.98 Aligned_cols=111 Identities=10% Similarity=0.138 Sum_probs=64.0
Q ss_pred EEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCce
Q 000944 543 VKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCM 622 (1213)
Q Consensus 543 ~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l 622 (1213)
.++...+.|++-+..++...+-.+.....+..+....-+-+++|.++.+ .+.+++.|+.||.+.||.+.....
T Consensus 309 ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~~fHp------DgLifgtgt~d~~vkiwdlks~~~- 381 (506)
T KOG0289|consen 309 LSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSAAFHP------DGLIFGTGTPDGVVKIWDLKSQTN- 381 (506)
T ss_pred eeeccCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEeeEcC------CceEEeccCCCceEEEEEcCCccc-
Confidence 3344456777766445555444443222344432222334578888854 467888888899999999953211
Q ss_pred eEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 623 QILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 623 ~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
++ ..+ +++-.+..+.. ..+..||.++..||.+..|.+.
T Consensus 382 --~a--~Fp--ght~~vk~i~F---------sENGY~Lat~add~~V~lwDLR 419 (506)
T KOG0289|consen 382 --VA--KFP--GHTGPVKAISF---------SENGYWLATAADDGSVKLWDLR 419 (506)
T ss_pred --cc--cCC--CCCCceeEEEe---------ccCceEEEEEecCCeEEEEEeh
Confidence 11 111 22222222221 2367999999999998777774
No 108
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=79.84 E-value=6.7 Score=44.16 Aligned_cols=102 Identities=19% Similarity=0.279 Sum_probs=71.1
Q ss_pred eEEEEEeCCCC-ceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 860 SCIRVLDPRSA-NTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 860 s~i~l~d~~~~-~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
+.++++|+... .++.++.|. |.+.+. +.+. ...++|.+|++ +|-+-.|+.... |+--..
T Consensus 226 hqvR~YDt~~qRRPV~~fd~~--E~~is~--~~l~--p~gn~Iy~gn~------------~g~l~~FD~r~~--kl~g~~ 285 (412)
T KOG3881|consen 226 HQVRLYDTRHQRRPVAQFDFL--ENPISS--TGLT--PSGNFIYTGNT------------KGQLAKFDLRGG--KLLGCG 285 (412)
T ss_pred eeEEEecCcccCcceeEeccc--cCccee--eeec--CCCcEEEEecc------------cchhheecccCc--eeeccc
Confidence 68999999876 467777776 554432 2232 34578888886 566677777653 555454
Q ss_pred EEeecCcceEeccccC-eEEEEeC--CeEEEEecCCceeeceeeec
Q 000944 939 KTQVEGIPLALCQFQG-RLLAGIG--PVLRLYDLGKKRLLRKCENK 981 (1213)
Q Consensus 939 ~~~~~g~V~ai~~~~g-~ll~~~g--~~l~i~~~~~~~l~~~~~~~ 981 (1213)
-..+.|.+.+|.--.+ .+++.+| .-|+||+.+.++|+-++...
T Consensus 286 ~kg~tGsirsih~hp~~~~las~GLDRyvRIhD~ktrkll~kvYvK 331 (412)
T KOG3881|consen 286 LKGITGSIRSIHCHPTHPVLASCGLDRYVRIHDIKTRKLLHKVYVK 331 (412)
T ss_pred cCCccCCcceEEEcCCCceEEeeccceeEEEeecccchhhhhhhhh
Confidence 6778999999976654 7887777 46889999988887666544
No 109
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=79.30 E-value=29 Score=38.42 Aligned_cols=117 Identities=15% Similarity=0.220 Sum_probs=73.7
Q ss_pred CceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccC--eEEE
Q 000944 881 NEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQG--RLLA 958 (1213)
Q Consensus 881 ~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g--~ll~ 958 (1213)
.|+-.-=++|.++...+.+|+|+|.. +|-|+++++.+. ++.-- -.-..+++..|..... .|++
T Consensus 88 ~~Esfytcsw~yd~~~~~p~la~~G~------------~GvIrVid~~~~--~~~~~-~~ghG~sINeik~~p~~~qlvl 152 (385)
T KOG1034|consen 88 HDESFYTCSWSYDSNTGNPFLAAGGY------------LGVIRVIDVVSG--QCSKN-YRGHGGSINEIKFHPDRPQLVL 152 (385)
T ss_pred CCcceEEEEEEecCCCCCeeEEeecc------------eeEEEEEecchh--hhccc-eeccCccchhhhcCCCCCcEEE
Confidence 34434446777776556799998863 789999998875 21111 1123455655554433 3655
Q ss_pred EeC--CeEEEEecCCceeece-eeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEe
Q 000944 959 GIG--PVLRLYDLGKKRLLRK-CENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 959 ~~g--~~l~i~~~~~~~l~~~-~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
+.. +.|++|+++....+-+ +-+..+.--|.|++ ..+++|+-+-+-+|+.+.+.+
T Consensus 153 s~SkD~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~ 211 (385)
T KOG1034|consen 153 SASKDHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLN 211 (385)
T ss_pred EecCCceEEEEeccCCeEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecC
Confidence 443 5899999987764322 11112233466665 468899999999999999886
No 110
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=78.94 E-value=99 Score=33.30 Aligned_cols=55 Identities=16% Similarity=0.261 Sum_probs=39.5
Q ss_pred eCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEe
Q 000944 557 SGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSL 616 (1213)
Q Consensus 557 s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl 616 (1213)
.+|+|.+++++..+.+.+..+.+.+.-.--++-.+ .....++++.-||+++||.+
T Consensus 36 G~G~L~ile~~~~~gi~e~~s~d~~D~LfdV~Wse-----~~e~~~~~a~GDGSLrl~d~ 90 (311)
T KOG0277|consen 36 GNGRLFILEVTDPKGIQECQSYDTEDGLFDVAWSE-----NHENQVIAASGDGSLRLFDL 90 (311)
T ss_pred cCceEEEEecCCCCCeEEEEeeecccceeEeeecC-----CCcceEEEEecCceEEEecc
Confidence 47899999986444588877776665555555543 23567778888999999997
No 111
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=78.51 E-value=97 Score=32.95 Aligned_cols=171 Identities=9% Similarity=0.050 Sum_probs=103.3
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
+++|++|..+..+.+|.=--+|. +.+... . ++ .-++-|.+ | -.+++|++..+ +.++.. .
T Consensus 40 tvrLWNp~rg~liktYsghG~EV-lD~~~s-~---Dn-skf~s~Gg--------D----k~v~vwDV~TG-kv~Rr~--r 98 (307)
T KOG0316|consen 40 TVRLWNPLRGALIKTYSGHGHEV-LDAALS-S---DN-SKFASCGG--------D----KAVQVWDVNTG-KVDRRF--R 98 (307)
T ss_pred eEEeecccccceeeeecCCCcee-eecccc-c---cc-cccccCCC--------C----ceEEEEEcccC-eeeeec--c
Confidence 79999999888888776544443 222221 1 22 22222221 1 13678888765 111111 1
Q ss_pred eecCcceEeccccCeEEEEeC---CeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCe
Q 000944 941 QVEGIPLALCQFQGRLLAGIG---PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQ 1017 (1213)
Q Consensus 941 ~~~g~V~ai~~~~g~ll~~~g---~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~ 1017 (1213)
-..+.|.++.--...-+++.| .++.+|+-..+...++-.++..---+.+|.+.+-.|+.|-.---+-.+..+
T Consensus 99 gH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS~s~ePiQildea~D~V~Si~v~~heIvaGS~DGtvRtydiR----- 173 (307)
T KOG0316|consen 99 GHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRSRSFEPIQILDEAKDGVSSIDVAEHEIVAGSVDGTVRTYDIR----- 173 (307)
T ss_pred cccceeeEEEecCcceEEEeccccceeEEEEcccCCCCccchhhhhcCceeEEEecccEEEeeccCCcEEEEEee-----
Confidence 123445554432333344444 489999988777666655553334588999999999999877766555443
Q ss_pred EEEeeccCCCcceEEEEeecCCe-eeeecCCCcEEEEecCC
Q 000944 1018 LYIFADDSVPRWLTAAHHIDFDT-MAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1018 l~~~a~D~~~~~~~~~~~ld~~~-~l~~D~~gnl~il~~~~ 1057 (1213)
.-.+..|+...-++++.|-.++. .+++-.++.+++++...
T Consensus 174 ~G~l~sDy~g~pit~vs~s~d~nc~La~~l~stlrLlDk~t 214 (307)
T KOG0316|consen 174 KGTLSSDYFGHPITSVSFSKDGNCSLASSLDSTLRLLDKET 214 (307)
T ss_pred cceeehhhcCCcceeEEecCCCCEEEEeeccceeeecccch
Confidence 22345688888888888765544 57788888888876543
No 112
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=77.64 E-value=96 Score=32.43 Aligned_cols=55 Identities=24% Similarity=0.242 Sum_probs=37.6
Q ss_pred ceEEECCCCCEEEEEecccceEEEEEecCCCCceeeeccccccccccEEEEeeeecc
Q 000944 118 QYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDC 174 (1213)
Q Consensus 118 ~~l~VDP~~r~ia~~~~~~~~~v~~~~~~~~~~~~~~~p~e~~~~~~~i~~~~fl~~ 174 (1213)
...+-.|+|..+|....++.++++||+.+... .. ..-+|...-.+.|.||+||+.
T Consensus 93 yc~~ws~~geliatgsndk~ik~l~fn~dt~~-~~-g~dle~nmhdgtirdl~fld~ 147 (350)
T KOG0641|consen 93 YCTAWSPCGELIATGSNDKTIKVLPFNADTCN-AT-GHDLEFNMHDGTIRDLAFLDD 147 (350)
T ss_pred EEEEecCccCeEEecCCCceEEEEeccccccc-cc-CcceeeeecCCceeeeEEecC
Confidence 34467899999999999999999999865421 11 001222223457899999983
No 113
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=77.05 E-value=17 Score=39.71 Aligned_cols=239 Identities=15% Similarity=0.196 Sum_probs=135.7
Q ss_pred EEEEEeCCCCceEEEEEcCC-------CceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCC--
Q 000944 861 CIRVLDPRSANTTCLLELQD-------NEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEG-- 931 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~-------~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~-- 931 (1213)
.|++++..+++.-..+..+. +..+.|+. |. ...++++-|. .-|.|.+|+|..+-
T Consensus 236 FiEVWny~~GKlrKDLkYQAqd~fMMmd~aVlci~---FS--RDsEMlAsGs------------qDGkIKvWri~tG~Cl 298 (508)
T KOG0275|consen 236 FIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVLCIS---FS--RDSEMLASGS------------QDGKIKVWRIETGQCL 298 (508)
T ss_pred eeeeehhccchhhhhhhhhhhcceeecccceEEEe---ec--ccHHHhhccC------------cCCcEEEEEEecchHH
Confidence 56677766665432222211 23445543 32 2235666554 25899999998751
Q ss_pred ceEEEEEEEeecCcceEeccc--cCeEEEEe-CCeEEEEecCCceeeceeeecCccceEEEEE--EeCCEEEEeecCCcE
Q 000944 932 KSLELLHKTQVEGIPLALCQF--QGRLLAGI-GPVLRLYDLGKKRLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESF 1006 (1213)
Q Consensus 932 ~kl~~~~~~~~~g~V~ai~~~--~g~ll~~~-g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv 1006 (1213)
+++..- ....|+|+.-- +..++.+. .+.+.++.+...++++. +..+.+|+.... ..++.|+-+-.--++
T Consensus 299 RrFdrA----HtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKE--frGHsSyvn~a~ft~dG~~iisaSsDgtv 372 (508)
T KOG0275|consen 299 RRFDRA----HTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKE--FRGHSSYVNEATFTDDGHHIISASSDGTV 372 (508)
T ss_pred HHhhhh----hccCeeEEEEccCcchhhcccccceEEEeccccchhHHH--hcCccccccceEEcCCCCeEEEecCCccE
Confidence 233333 33567776544 34566644 46889999998877654 233466777654 357888888777788
Q ss_pred EEEEEeccC--CeEEEeeccCCCcceEEEEee--cCCeeeeecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCc
Q 000944 1007 HFCKYRRDE--NQLYIFADDSVPRWLTAAHHI--DFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGA 1082 (1213)
Q Consensus 1007 ~~l~~~~~~--~~l~~~a~D~~~~~~~~~~~l--d~~~~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1082 (1213)
-++.-++.+ .++...+.|+. +.++..+ +.+.++++.+...++++.+..+.-.. .
T Consensus 373 kvW~~KtteC~~Tfk~~~~d~~---vnsv~~~PKnpeh~iVCNrsntv~imn~qGQvVrs----f--------------- 430 (508)
T KOG0275|consen 373 KVWHGKTTECLSTFKPLGTDYP---VNSVILLPKNPEHFIVCNRSNTVYIMNMQGQVVRS----F--------------- 430 (508)
T ss_pred EEecCcchhhhhhccCCCCccc---ceeEEEcCCCCceEEEEcCCCeEEEEeccceEEee----e---------------
Confidence 776544322 45555565543 3344444 33478999999999998876532210 0
Q ss_pred ccceeeeeeeecCceeceEEEeeecCCCccEEEEEecccceEEEEecCChhHHHHHHHHHHHHHhcCCCCcCCCcccccc
Q 000944 1083 PNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRS 1162 (1213)
Q Consensus 1083 ~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~Gsig~l~~l~~~~~~~~L~~lq~~l~~~~~~~~gl~~~~~R~ 1162 (1213)
..+.=.=|+.|++... | ...-+.+.+.||-+|++.-+.. .||+-|..+-+-+-|+.|-..+.
T Consensus 431 -----sSGkREgGdFi~~~lS----p-kGewiYcigED~vlYCF~~~sG--------~LE~tl~VhEkdvIGl~HHPHqN 492 (508)
T KOG0275|consen 431 -----SSGKREGGDFINAILS----P-KGEWIYCIGEDGVLYCFSVLSG--------KLERTLPVHEKDVIGLTHHPHQN 492 (508)
T ss_pred -----ccCCccCCceEEEEec----C-CCcEEEEEccCcEEEEEEeecC--------ceeeeeecccccccccccCcccc
Confidence 0011123566666543 2 2345566677777777766532 23344444445556776655543
No 114
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=76.87 E-value=1.1e+02 Score=32.75 Aligned_cols=170 Identities=18% Similarity=0.120 Sum_probs=92.4
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.+..+|..+++.+.+++++ ++..... . . . ...++|++. .|+++.++...+ ..+-.....
T Consensus 47 ~l~~~d~~tG~~~W~~~~~-~~~~~~~-~--~---~-~~~v~v~~~------------~~~l~~~d~~tG-~~~W~~~~~ 105 (238)
T PF13360_consen 47 NLYALDAKTGKVLWRFDLP-GPISGAP-V--V---D-GGRVYVGTS------------DGSLYALDAKTG-KVLWSIYLT 105 (238)
T ss_dssp EEEEEETTTSEEEEEEECS-SCGGSGE-E--E---E-TTEEEEEET------------TSEEEEEETTTS-CEEEEEEE-
T ss_pred EEEEEECCCCCEEEEeecc-cccccee-e--e---c-ccccccccc------------eeeeEecccCCc-ceeeeeccc
Confidence 7889999999999999883 3211111 1 1 1 234455552 347777774443 222221221
Q ss_pred e--ecC--cceEeccccCeEEEEe-CCeEEEEecCCceeeceeeecCccc----------eEEEEEEeCCEEEEeecCCc
Q 000944 941 Q--VEG--IPLALCQFQGRLLAGI-GPVLRLYDLGKKRLLRKCENKLFPN----------TIVSINTYRDRIYVGDIQES 1005 (1213)
Q Consensus 941 ~--~~g--~V~ai~~~~g~ll~~~-g~~l~i~~~~~~~l~~~~~~~~~~~----------~i~~l~~~~~~I~vgD~~~S 1005 (1213)
. ..+ ...+....++.++++. +..|+.++..+.+++-..... .+. ....+...++.|++++....
T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g~ 184 (238)
T PF13360_consen 106 SSPPAGVRSSSSPAVDGDRLYVGTSSGKLVALDPKTGKLLWKYPVG-EPRGSSPISSFSDINGSPVISDGRVYVSSGDGR 184 (238)
T ss_dssp SSCTCSTB--SEEEEETTEEEEEETCSEEEEEETTTTEEEEEEESS-TT-SS--EEEETTEEEEEECCTTEEEEECCTSS
T ss_pred cccccccccccCceEecCEEEEEeccCcEEEEecCCCcEEEEeecC-CCCCCcceeeecccccceEEECCEEEEEcCCCe
Confidence 1 111 1122222377888877 789999999877654332222 111 11333444678888887776
Q ss_pred EEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecCC
Q 000944 1006 FHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1006 v~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
++.++...++... .++. .........+++.+++++++|++++++...
T Consensus 185 --~~~~d~~tg~~~w-~~~~--~~~~~~~~~~~~~l~~~~~~~~l~~~d~~t 231 (238)
T PF13360_consen 185 --VVAVDLATGEKLW-SKPI--SGIYSLPSVDGGTLYVTSSDGRLYALDLKT 231 (238)
T ss_dssp --EEEEETTTTEEEE-EECS--S-ECECEECCCTEEEEEETTTEEEEEETTT
T ss_pred --EEEEECCCCCEEE-EecC--CCccCCceeeCCEEEEEeCCCEEEEEECCC
Confidence 3444555555333 3332 223332345667777777999999998654
No 115
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=76.87 E-value=66 Score=30.20 Aligned_cols=92 Identities=17% Similarity=0.083 Sum_probs=53.2
Q ss_pred ceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEE
Q 000944 583 DVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNA 662 (1213)
Q Consensus 583 ~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lli 662 (1213)
+|++|++..+.. ....-|+||+.|..|++|.= ++.+..+.. ...+.+++ .+. .....-
T Consensus 1 ~V~al~~~d~d~--dg~~eLlvGs~D~~IRvf~~--~e~~~Ei~e---~~~v~~L~--~~~-------------~~~F~Y 58 (111)
T PF14783_consen 1 NVTALCLFDFDG--DGENELLVGSDDFEIRVFKG--DEIVAEITE---TDKVTSLC--SLG-------------GGRFAY 58 (111)
T ss_pred CeeEEEEEecCC--CCcceEEEecCCcEEEEEeC--CcEEEEEec---ccceEEEE--EcC-------------CCEEEE
Confidence 478888887642 24577999999999999974 322332222 12344433 222 233567
Q ss_pred EeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEEC
Q 000944 663 GLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVG 702 (1213)
Q Consensus 663 gl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~ 702 (1213)
|+.||.+-.|+-.. .+ ....-..+|+.+..+.++
T Consensus 59 ~l~NGTVGvY~~~~---Rl---WRiKSK~~~~~~~~~D~~ 92 (111)
T PF14783_consen 59 ALANGTVGVYDRSQ---RL---WRIKSKNQVTSMAFYDIN 92 (111)
T ss_pred EecCCEEEEEeCcc---ee---eeeccCCCeEEEEEEcCC
Confidence 89999998886321 11 112223456667665554
No 116
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=76.60 E-value=8.3 Score=46.78 Aligned_cols=145 Identities=14% Similarity=0.196 Sum_probs=92.1
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCC-ceEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEG-KSLELLH 938 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~-~kl~~~~ 938 (1213)
+.+-|++-.+++. -.+..+-.+.++|.+++. ....++|.||+ +||+-+|.+.+.. ..+.++.
T Consensus 55 G~lyl~~R~~~~~-~~~~~~~~~~~~~~~~vs----~~e~lvAagt~------------~g~V~v~ql~~~~p~~~~~~t 117 (726)
T KOG3621|consen 55 GSVYLYNRHTGEM-RKLKNEGATGITCVRSVS----SVEYLVAAGTA------------SGRVSVFQLNKELPRDLDYVT 117 (726)
T ss_pred ceEEEEecCchhh-hcccccCccceEEEEEec----chhHhhhhhcC------------CceEEeehhhccCCCcceeec
Confidence 3556666544332 222333356666666654 33467788876 7999999999853 2577776
Q ss_pred EEee--cCcceEeccccC--eEEEEeC-CeEEEEecCCcee--eceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEE
Q 000944 939 KTQV--EGIPLALCQFQG--RLLAGIG-PVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKY 1011 (1213)
Q Consensus 939 ~~~~--~g~V~ai~~~~g--~ll~~~g-~~l~i~~~~~~~l--~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~ 1011 (1213)
..+. +--|+|++.-.+ ++..|=- .+|..-.++...+ .+.-..-.+.+.|++|+....+++|+++.+++-+
T Consensus 118 ~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s~~~~~~~~q~il~~ds~IVQlD~~q~~LLVStl~r~~Lc--- 194 (726)
T KOG3621|consen 118 PCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDSRQAFLSKSQEILSEDSEIVQLDYLQSYLLVSTLTRCILC--- 194 (726)
T ss_pred cccccCCceEEEEEecccccEEeecCCCceEEEEEechhhhhccccceeeccCcceEEeecccceehHhhhhhhhee---
Confidence 6666 788999988754 3444332 2666666664211 1111111246789999999999999999999754
Q ss_pred eccCCeEEEeecc
Q 000944 1012 RRDENQLYIFADD 1024 (1213)
Q Consensus 1012 ~~~~~~l~~~a~D 1024 (1213)
+.+..++..+++-
T Consensus 195 ~tE~eti~QIG~k 207 (726)
T KOG3621|consen 195 QTEAETITQIGKK 207 (726)
T ss_pred ecchhHHHHhcCC
Confidence 4555667777753
No 117
>PHA02713 hypothetical protein; Provisional
Probab=75.72 E-value=57 Score=40.54 Aligned_cols=151 Identities=14% Similarity=0.076 Sum_probs=90.7
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeC----------------
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIG---------------- 961 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g---------------- 961 (1213)
..++++|... + . .....+..|+...+ +++.+.....+-.-.+++.++|+|.+.-|
T Consensus 352 g~IYviGG~~--~---~--~~~~sve~Ydp~~~--~W~~~~~mp~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~ 422 (557)
T PHA02713 352 DTIYAIGGQN--G---T--NVERTIECYTMGDD--KWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSI 422 (557)
T ss_pred CEEEEECCcC--C---C--CCCceEEEEECCCC--eEEECCCCCcccccccEEEECCEEEEEeCCCcccccccccccccc
Confidence 4688888642 1 1 12234667776655 44544433322222355667888866444
Q ss_pred ---------CeEEEEecCCceeeceeeecCccceEEEEEEeCCEEE-EeecC---CcE-EEEEEeccC-CeEEEeeccCC
Q 000944 962 ---------PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIY-VGDIQ---ESF-HFCKYRRDE-NQLYIFADDSV 1026 (1213)
Q Consensus 962 ---------~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~-vgD~~---~Sv-~~l~~~~~~-~~l~~~a~D~~ 1026 (1213)
+++..|+....++...+... .+-.-.++.+.++.|+ +|... ... .+.+|+++. ++-..++.=+.
T Consensus 423 ~~~~~~~~~~~ve~YDP~td~W~~v~~m~-~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~ 501 (557)
T PHA02713 423 DMEEDTHSSNKVIRYDTVNNIWETLPNFW-TGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTES 501 (557)
T ss_pred cccccccccceEEEECCCCCeEeecCCCC-cccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCc
Confidence 24677888878887666543 1223345667788876 45432 112 367899887 78888887667
Q ss_pred CcceEEEEeecCCeeeeecCCCcEEEEecCCC
Q 000944 1027 PRWLTAAHHIDFDTMAGADKFGNIYFVRLPQD 1058 (1213)
Q Consensus 1027 ~~~~~~~~~ld~~~~l~~D~~gnl~il~~~~~ 1058 (1213)
+|....+..+++.-++++-.+|.-.+-.|++.
T Consensus 502 ~r~~~~~~~~~~~iyv~Gg~~~~~~~e~yd~~ 533 (557)
T PHA02713 502 RLSALHTILHDNTIMMLHCYESYMLQDTFNVY 533 (557)
T ss_pred ccccceeEEECCEEEEEeeecceeehhhcCcc
Confidence 77777777787666666555665455566664
No 118
>PF13360 PQQ_2: PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=75.66 E-value=1.2e+02 Score=32.51 Aligned_cols=170 Identities=16% Similarity=0.150 Sum_probs=91.9
Q ss_pred eEEEEEeCCCCceEEEEEcCCC-ceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQDN-EAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~-E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
+.|.-+|+.+++.+.++.+.+. ....+. .+. ...++++++. .|.|+.+++..+ +++.
T Consensus 3 g~l~~~d~~tG~~~W~~~~~~~~~~~~~~---~~~---~~~~v~~~~~------------~~~l~~~d~~tG----~~~W 60 (238)
T PF13360_consen 3 GTLSALDPRTGKELWSYDLGPGIGGPVAT---AVP---DGGRVYVASG------------DGNLYALDAKTG----KVLW 60 (238)
T ss_dssp SEEEEEETTTTEEEEEEECSSSCSSEEET---EEE---ETTEEEEEET------------TSEEEEEETTTS----EEEE
T ss_pred CEEEEEECCCCCEEEEEECCCCCCCccce---EEE---eCCEEEEEcC------------CCEEEEEECCCC----CEEE
Confidence 3788999999999999988542 222211 111 1245666642 577777776543 4455
Q ss_pred EEeecCcceEe-ccccCeEEEEe-CCeEEEEecCCceeecee-ee--cCcc-ceEEEEEEeCCEEEEeecCCcEEEEEEe
Q 000944 939 KTQVEGIPLAL-CQFQGRLLAGI-GPVLRLYDLGKKRLLRKC-EN--KLFP-NTIVSINTYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 939 ~~~~~g~V~ai-~~~~g~ll~~~-g~~l~i~~~~~~~l~~~~-~~--~~~~-~~i~~l~~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
+.+.++++... ...++.++++. ++.|+.++..+.++.-.. .. ...+ .......+.++++++++....+ +.++
T Consensus 61 ~~~~~~~~~~~~~~~~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l--~~~d 138 (238)
T PF13360_consen 61 RFDLPGPISGAPVVDGGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSGKL--VALD 138 (238)
T ss_dssp EEECSSCGGSGEEEETTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCSEE--EEEE
T ss_pred EeeccccccceeeecccccccccceeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEeccCcE--EEEe
Confidence 55556665433 45577887766 457777876655443221 11 1011 1123344558899988885554 4456
Q ss_pred ccCCeEEEeeccCCCcce---------EEEEeecCCeeeeecCCCcEEEE
Q 000944 1013 RDENQLYIFADDSVPRWL---------TAAHHIDFDTMAGADKFGNIYFV 1053 (1213)
Q Consensus 1013 ~~~~~l~~~a~D~~~~~~---------~~~~~ld~~~~l~~D~~gnl~il 1053 (1213)
.+.+++..-..-..+... .+...++++.+++++.+|.+..+
T Consensus 139 ~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g~~~~~ 188 (238)
T PF13360_consen 139 PKTGKLLWKYPVGEPRGSSPISSFSDINGSPVISDGRVYVSSGDGRVVAV 188 (238)
T ss_dssp TTTTEEEEEEESSTT-SS--EEEETTEEEEEECCTTEEEEECCTSSEEEE
T ss_pred cCCCcEEEEeecCCCCCCcceeeecccccceEEECCEEEEEcCCCeEEEE
Confidence 555554222111222211 12234455577778888875444
No 119
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=75.51 E-value=1.4e+02 Score=33.22 Aligned_cols=230 Identities=13% Similarity=0.180 Sum_probs=104.5
Q ss_pred CccEEEEEe--cCCEEEEEEeCCEEEEEEEccCCCeEEe------eeeccCcceEEEEeeecCCCceeeeEEEEEEeCCc
Q 000944 539 KRTIVKVGS--NRLQVVIALSGGELIYFEVDMTGQLLEV------EKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNT 610 (1213)
Q Consensus 539 ~~~I~~as~--~~~~v~v~~s~~~l~~l~~~~~~~l~~~------~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~ 610 (1213)
+..|+..+. ++.+++-+..|+.|.++.++. +... ...+.+ .++.+.+.+. ...+++..-...+
T Consensus 86 ~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~D---F~~~eHr~~R~nve~d-hpT~V~FapD-----c~s~vv~~~~g~~ 156 (420)
T KOG2096|consen 86 KKEVTDVAFSSDGKKLATISGDRSIRLWDVRD---FENKEHRCIRQNVEYD-HPTRVVFAPD-----CKSVVVSVKRGNK 156 (420)
T ss_pred CCceeeeEEcCCCceeEEEeCCceEEEEecch---hhhhhhhHhhccccCC-CceEEEECCC-----cceEEEEEccCCE
Confidence 345665554 456777777788888887762 1111 233444 4667776542 1223333233568
Q ss_pred EEEEEeCC--CCce--eEeEEeecCC-CCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcc-ccc
Q 000944 611 IRILSLDP--DDCM--QILSVQSVSS-PPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQL-SDS 684 (1213)
Q Consensus 611 i~i~sl~p--~~~l--~~~~~~~l~~-~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l-~~~ 684 (1213)
+.+|.+.. |..+ ..+..+.+.. --+++.+..++. .++.-|+.-...|-.++.|.+. |++ ...
T Consensus 157 l~vyk~~K~~dG~~~~~~v~~D~~~f~~kh~v~~i~iGi---------A~~~k~imsas~dt~i~lw~lk---Gq~L~~i 224 (420)
T KOG2096|consen 157 LCVYKLVKKTDGSGSHHFVHIDNLEFERKHQVDIINIGI---------AGNAKYIMSASLDTKICLWDLK---GQLLQSI 224 (420)
T ss_pred EEEEEeeecccCCCCcccccccccccchhcccceEEEee---------cCCceEEEEecCCCcEEEEecC---Cceeeee
Confidence 88998743 1111 1111111111 012333333322 1123344333444444444432 221 111
Q ss_pred ceeeecCCCCeEEEEEECCeeEEEEecCcc----E-EEEEeCCeEE----EEecCc-cccceeeccccCCCCceEEEEe-
Q 000944 685 RSRFLGLRPPKLFSVVVGGRAAMLCLSSRP----W-LGYIHRGRFL----LTPLSY-ETLEYAASFSSDQCVEGVVSVA- 753 (1213)
Q Consensus 685 ~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p----~-~i~~~~~~~~----~~~~~~-~~v~~~~~f~~~~~~~~~i~~~- 753 (1213)
...++...-..+. .+...+.++|=.| | ++|..+|.++ ...+.. +.-.....|++. ..-.+.++
T Consensus 225 dtnq~~n~~aavS----P~GRFia~~gFTpDVkVwE~~f~kdG~fqev~rvf~LkGH~saV~~~aFsn~--S~r~vtvSk 298 (420)
T KOG2096|consen 225 DTNQSSNYDAAVS----PDGRFIAVSGFTPDVKVWEPIFTKDGTFQEVKRVFSLKGHQSAVLAAAFSNS--STRAVTVSK 298 (420)
T ss_pred ccccccccceeeC----CCCcEEEEecCCCCceEEEEEeccCcchhhhhhhheeccchhheeeeeeCCC--cceeEEEec
Confidence 1111111111221 1345677777666 3 2566555432 222221 112222344442 23455555
Q ss_pred CCeEEEEEEcc-C----CCe-eEEEEEeC---CCccceeeecCCCceEEEE
Q 000944 754 GNALRVFTIER-L----GET-FNETALPL---RYTPRRFVLQPKKKLMVII 795 (1213)
Q Consensus 754 ~~~L~i~~l~~-~----~~~-~~~r~i~l---~~tp~~i~y~~~~~~~~v~ 795 (1213)
++..+|-.++- | |.+ +..-.+|+ |..|.|+..+|+.+.+++.
T Consensus 299 DG~wriwdtdVrY~~~qDpk~Lk~g~~pl~aag~~p~RL~lsP~g~~lA~s 349 (420)
T KOG2096|consen 299 DGKWRIWDTDVRYEAGQDPKILKEGSAPLHAAGSEPVRLELSPSGDSLAVS 349 (420)
T ss_pred CCcEEEeeccceEecCCCchHhhcCCcchhhcCCCceEEEeCCCCcEEEee
Confidence 56777766663 1 111 12222454 5678888888877777765
No 120
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=75.30 E-value=1.5e+02 Score=34.19 Aligned_cols=42 Identities=26% Similarity=0.347 Sum_probs=33.2
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeee
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAK 907 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~ 907 (1213)
++.|.|+.+++++.++.+. |.+..++.... ++..|+++=|.+
T Consensus 412 ~lil~D~~s~evvQ~l~~~--epv~Dicp~~~---n~~syLa~LTd~ 453 (463)
T KOG1645|consen 412 ELILQDPHSFEVVQTLALS--EPVLDICPNDT---NGSSYLALLTDD 453 (463)
T ss_pred eeEEeccchhheeeecccC--cceeecceeec---CCcchhhheecc
Confidence 7899999999999888777 88888877655 345788877754
No 121
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=75.24 E-value=6.4 Score=30.66 Aligned_cols=38 Identities=16% Similarity=0.480 Sum_probs=31.2
Q ss_pred EeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 574 EVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 574 ~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
.+..+.+..+|++++..| .-.++|+|+.+|.|.+|.++
T Consensus 4 ~~~~k~l~~~v~~~~w~P------~mdLiA~~t~~g~v~v~Rl~ 41 (47)
T PF12894_consen 4 QLGEKNLPSRVSCMSWCP------TMDLIALGTEDGEVLVYRLN 41 (47)
T ss_pred eecccCCCCcEEEEEECC------CCCEEEEEECCCeEEEEECC
Confidence 345567778899999976 46799999999999999984
No 122
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=75.14 E-value=24 Score=40.17 Aligned_cols=91 Identities=20% Similarity=0.260 Sum_probs=55.3
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.+.++|-.+.++...|.-+.+-......-+.|. ....|++.|.+ .|++|+|++..+ |++.+.+.
T Consensus 364 tl~viDlRt~eI~~~~sA~g~k~asDwtrvvfS--pd~~YvaAGS~------------dgsv~iW~v~tg--KlE~~l~~ 427 (459)
T KOG0288|consen 364 TLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFS--PDGSYVAAGSA------------DGSVYIWSVFTG--KLEKVLSL 427 (459)
T ss_pred ceeeeecccccEEEEeeccccccccccceeEEC--CCCceeeeccC------------CCcEEEEEccCc--eEEEEecc
Confidence 466677777776665543333222222223343 45689999986 689999999987 88888776
Q ss_pred eecC-cceEeccc--cCeEEEEeCC-eEEEE
Q 000944 941 QVEG-IPLALCQF--QGRLLAGIGP-VLRLY 967 (1213)
Q Consensus 941 ~~~g-~V~ai~~~--~g~ll~~~g~-~l~i~ 967 (1213)
.... +++++.-- +.+|++|-.+ .+.+|
T Consensus 428 s~s~~aI~s~~W~~sG~~Llsadk~~~v~lW 458 (459)
T KOG0288|consen 428 STSNAAITSLSWNPSGSGLLSADKQKAVTLW 458 (459)
T ss_pred CCCCcceEEEEEcCCCchhhcccCCcceEec
Confidence 6655 67766432 3445544433 34444
No 123
>PHA03098 kelch-like protein; Provisional
Probab=74.89 E-value=64 Score=39.95 Aligned_cols=134 Identities=10% Similarity=0.054 Sum_probs=75.9
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeC--------CeEEEEec
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIG--------PVLRLYDL 969 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g--------~~l~i~~~ 969 (1213)
..++++|... .......+..|+...+ +++.+.....+..-.+.+.++|+|.+.-| +.+..|++
T Consensus 343 ~~lyv~GG~~-------~~~~~~~v~~yd~~~~--~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~ 413 (534)
T PHA03098 343 NRIYVIGGIY-------NSISLNTVESWKPGES--KWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSL 413 (534)
T ss_pred CEEEEEeCCC-------CCEecceEEEEcCCCC--ceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccceEEEEeC
Confidence 4688888642 1122345666766554 44444332222223455667888877655 35778888
Q ss_pred CCceeeceeeecCccceEEEEEEeCCEEEE-eecCC------cEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCee
Q 000944 970 GKKRLLRKCENKLFPNTIVSINTYRDRIYV-GDIQE------SFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTM 1041 (1213)
Q Consensus 970 ~~~~l~~~~~~~~~~~~i~~l~~~~~~I~v-gD~~~------Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~ 1041 (1213)
...++...+... .+....+..+.++.|++ |-... --.+..|+.+.++-..++.-..+++-.++..+++.-+
T Consensus 414 ~t~~W~~~~~~p-~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~~iy 491 (534)
T PHA03098 414 NTNKWSKGSPLP-ISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIY 491 (534)
T ss_pred CCCeeeecCCCC-ccccCceEEEECCEEEEECCccCCCCCcccceEEEecCCCCceeeCCCCCcccccceEEEECCEEE
Confidence 877776654333 12233455566777654 42111 1127778888888777776566676555555554433
No 124
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=74.46 E-value=1.4e+02 Score=32.83 Aligned_cols=280 Identities=13% Similarity=0.055 Sum_probs=145.6
Q ss_pred cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEE
Q 000944 582 GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLN 661 (1213)
Q Consensus 582 ~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Ll 661 (1213)
..|.|+.... .++.++.+..||.+.||..-..... + .++ +|.+-.+.-.- .+...++-
T Consensus 56 ~Ki~~~~ws~------Dsr~ivSaSqDGklIvWDs~TtnK~---h--aip-l~s~WVMtCA~----------sPSg~~VA 113 (343)
T KOG0286|consen 56 NKIYAMDWST------DSRRIVSASQDGKLIVWDSFTTNKV---H--AIP-LPSSWVMTCAY----------SPSGNFVA 113 (343)
T ss_pred cceeeeEecC------CcCeEEeeccCCeEEEEEcccccce---e--EEe-cCceeEEEEEE----------CCCCCeEE
Confidence 4578888854 4678888888999999976222111 1 111 24443332211 22456777
Q ss_pred EEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEE-EecCccEEEEEeCCeEEEEecCc--cccceee
Q 000944 662 AGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAML-CLSSRPWLGYIHRGRFLLTPLSY--ETLEYAA 738 (1213)
Q Consensus 662 igl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~-~~g~~p~~i~~~~~~~~~~~~~~--~~v~~~~ 738 (1213)
||--|.....|.+...+.+-.-...+.+...---+..+.+-....++ ..|+.++.++.-....+...... .+|.++.
T Consensus 114 cGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD~~ilT~SGD~TCalWDie~g~~~~~f~GH~gDV~sls 193 (343)
T KOG0286|consen 114 CGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDDNHILTGSGDMTCALWDIETGQQTQVFHGHTGDVMSLS 193 (343)
T ss_pred ecCcCceeEEEecccccccccceeeeeecCccceeEEEEEcCCCceEecCCCceEEEEEcccceEEEEecCCcccEEEEe
Confidence 88888888888886331111111122232222233333332333333 34777777775333233332221 1233321
Q ss_pred ccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCCC-ccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHh
Q 000944 739 SFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRY-TPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEA 817 (1213)
Q Consensus 739 ~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~-tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 817 (1213)
--.+ -++.|+ +++.=.-+.+.+......+++.+-.+ -...|.|+|....|+-.. +
T Consensus 194 l~p~--~~ntFv--Sg~cD~~aklWD~R~~~c~qtF~ghesDINsv~ffP~G~afatGS-D------------------- 249 (343)
T KOG0286|consen 194 LSPS--DGNTFV--SGGCDKSAKLWDVRSGQCVQTFEGHESDINSVRFFPSGDAFATGS-D------------------- 249 (343)
T ss_pred cCCC--CCCeEE--ecccccceeeeeccCcceeEeecccccccceEEEccCCCeeeecC-C-------------------
Confidence 1100 123344 33222333445443446677776543 677888888664444321 0
Q ss_pred cCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCC
Q 000944 818 AGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEH 897 (1213)
Q Consensus 818 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~ 897 (1213)
-.+.+|+|-...+.+..|+-+ -.+..+.++-|..++
T Consensus 250 -----------------------------------------D~tcRlyDlRaD~~~a~ys~~--~~~~gitSv~FS~SG- 285 (343)
T KOG0286|consen 250 -----------------------------------------DATCRLYDLRADQELAVYSHD--SIICGITSVAFSKSG- 285 (343)
T ss_pred -----------------------------------------CceeEEEeecCCcEEeeeccC--cccCCceeEEEcccc-
Confidence 025778887776666666522 233345555565333
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeCC---eEEEE
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGP---VLRLY 967 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~---~l~i~ 967 (1213)
.++..|-. -....+|+.-+.. +.-.+. -.++-|+++..-.+...+|.|+ .++||
T Consensus 286 -RlLfagy~------------d~~c~vWDtlk~e-~vg~L~--GHeNRvScl~~s~DG~av~TgSWDs~lriW 342 (343)
T KOG0286|consen 286 -RLLFAGYD------------DFTCNVWDTLKGE-RVGVLA--GHENRVSCLGVSPDGMAVATGSWDSTLRIW 342 (343)
T ss_pred -cEEEeeec------------CCceeEeeccccc-eEEEee--ccCCeeEEEEECCCCcEEEecchhHheeec
Confidence 46666621 1345677765532 222332 4568899988888888888886 56666
No 125
>PF00780 CNH: CNH domain; InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []: Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1. This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=74.17 E-value=1.5e+02 Score=32.86 Aligned_cols=138 Identities=14% Similarity=0.213 Sum_probs=74.8
Q ss_pred CCceEEEEEecCceeEEEecc---ce-eeecCCCccCCCCeEEEEeecCCeEEEEeCCcEEEEeCC-CceeeeeCCCC--
Q 000944 467 EFDAYIVVSFNNATLVLSIGE---TV-EEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIRED-GRINEWRTPGK-- 539 (1213)
Q Consensus 467 ~~~~~lvlS~~~~T~vl~~~~---~~-~e~~~~gf~~~~~Tl~a~~~~~~~ivQVT~~~i~l~~~~-~~~~~~~~~~~-- 539 (1213)
....+|++.....=.+++... .+ .+.. .|.....-..+..+ ++.++=.+.++..+++.. ....++..+..
T Consensus 103 ~~~~~L~va~kk~i~i~~~~~~~~~f~~~~k--e~~lp~~~~~i~~~-~~~i~v~~~~~f~~idl~~~~~~~l~~~~~~~ 179 (275)
T PF00780_consen 103 EGSRRLCVAVKKKILIYEWNDPRNSFSKLLK--EISLPDPPSSIAFL-GNKICVGTSKGFYLIDLNTGSPSELLDPSDSS 179 (275)
T ss_pred ccceEEEEEECCEEEEEEEECCcccccceeE--EEEcCCCcEEEEEe-CCEEEEEeCCceEEEecCCCCceEEeCccCCc
Confidence 345677777777556677732 23 2222 24444333333333 566777778888888866 33444432222
Q ss_pred ----------ccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCC
Q 000944 540 ----------RTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDN 609 (1213)
Q Consensus 540 ----------~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~ 609 (1213)
.++....+.++-++++. ++ +-+| ++.+|.......++++..+.+++.. ..++++.. ++
T Consensus 180 ~~~~~~~~~~~~~~~~~~~~~e~Ll~~-~~-~g~f-v~~~G~~~r~~~i~W~~~p~~~~~~--------~pyli~~~-~~ 247 (275)
T PF00780_consen 180 SSFKSRNSSSKPLGIFQLSDNEFLLCY-DN-IGVF-VNKNGEPSRKSTIQWSSAPQSVAYS--------SPYLIAFS-SN 247 (275)
T ss_pred chhhhcccCCCceEEEEeCCceEEEEe-cc-eEEE-EcCCCCcCcccEEEcCCchhEEEEE--------CCEEEEEC-CC
Confidence 12444444445555553 32 2222 3445654333456777778777774 34554433 46
Q ss_pred cEEEEEeCCC
Q 000944 610 TIRILSLDPD 619 (1213)
Q Consensus 610 ~i~i~sl~p~ 619 (1213)
.|+|+++...
T Consensus 248 ~iEV~~~~~~ 257 (275)
T PF00780_consen 248 SIEVRSLETG 257 (275)
T ss_pred EEEEEECcCC
Confidence 6999999544
No 126
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=73.91 E-value=1.4e+02 Score=38.27 Aligned_cols=91 Identities=19% Similarity=0.482 Sum_probs=58.1
Q ss_pred eCCcEEEEeCC--CceeeeeCCCCccEEEEEe-------cCCEEEEEEeCCEEEEEEEccCC-CeEEeeeec--cCcceE
Q 000944 518 HPSGIRHIRED--GRINEWRTPGKRTIVKVGS-------NRLQVVIALSGGELIYFEVDMTG-QLLEVEKHE--MSGDVA 585 (1213)
Q Consensus 518 T~~~i~l~~~~--~~~~~~~~~~~~~I~~as~-------~~~~v~v~~s~~~l~~l~~~~~~-~l~~~~~~~--l~~~is 585 (1213)
.++.+.-++.. +.+.+|+.....+|..-+- .+.+.++.++++.|..+...-+| ++..-..++ -....+
T Consensus 502 ~~~~ly~mDLe~GKVV~eW~~~~~~~v~~~~p~~K~aqlt~e~tflGls~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs 581 (794)
T PF08553_consen 502 NPNKLYKMDLERGKVVEEWKVHDDIPVVDIAPDSKFAQLTNEQTFLGLSDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFS 581 (794)
T ss_pred CCCceEEEecCCCcEEEEeecCCCcceeEecccccccccCCCceEEEECCCceEEeccCCCCCceeeccccccccCCCce
Confidence 35666667654 7789999866554544332 35678888888888665433233 222111222 234567
Q ss_pred EEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 586 CLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 586 ~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
|++-. ...++|||..+|.|++|+
T Consensus 582 ~~aTt-------~~G~iavgs~~G~IRLyd 604 (794)
T PF08553_consen 582 CFATT-------EDGYIAVGSNKGDIRLYD 604 (794)
T ss_pred EEEec-------CCceEEEEeCCCcEEeec
Confidence 88764 356899999999999997
No 127
>PTZ00421 coronin; Provisional
Probab=73.52 E-value=2.2e+02 Score=34.70 Aligned_cols=114 Identities=12% Similarity=0.160 Sum_probs=65.2
Q ss_pred cEEEEEec---CCEEEEEEeCCEEEEEEEccCCCe----EEeeeec-cCcceEEEEeeecCCCceeeeEEEEEEeCCcEE
Q 000944 541 TIVKVGSN---RLQVVIALSGGELIYFEVDMTGQL----LEVEKHE-MSGDVACLDIASVPEGRKRSRFLAVGSYDNTIR 612 (1213)
Q Consensus 541 ~I~~as~~---~~~v~v~~s~~~l~~l~~~~~~~l----~~~~~~~-l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~ 612 (1213)
.|...+.+ +.+++.+..|+.|.++.+...+.. ..+..+. -...|.++++.+. ...+++.|.+|++|.
T Consensus 77 ~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~-----~~~iLaSgs~DgtVr 151 (493)
T PTZ00421 77 PIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPS-----AMNVLASAGADMVVN 151 (493)
T ss_pred CEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcC-----CCCEEEEEeCCCEEE
Confidence 46666654 235555556788888887643210 0111111 1356888888652 246899999999999
Q ss_pred EEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 613 ILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 613 i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
||+++....+.. +......+.-+.+.. ...+|+.|..||.+-.|.+.
T Consensus 152 IWDl~tg~~~~~-----l~~h~~~V~sla~sp-----------dG~lLatgs~Dg~IrIwD~r 198 (493)
T PTZ00421 152 VWDVERGKAVEV-----IKCHSDQITSLEWNL-----------DGSLLCTTSKDKKLNIIDPR 198 (493)
T ss_pred EEECCCCeEEEE-----EcCCCCceEEEEEEC-----------CCCEEEEecCCCEEEEEECC
Confidence 999954322221 111122222223321 24568889999999877654
No 128
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=73.43 E-value=41 Score=41.88 Aligned_cols=141 Identities=18% Similarity=0.173 Sum_probs=89.3
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeC--------CeEEEEec
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIG--------PVLRLYDL 969 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g--------~~l~i~~~ 969 (1213)
...+|||-..... .-.-+-.|+...+ ++..+........=.+++.++|+|.+.-| .++..|+.
T Consensus 381 g~iYavGG~dg~~-------~l~svE~YDp~~~--~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP 451 (571)
T KOG4441|consen 381 GKLYAVGGFDGEK-------SLNSVECYDPVTN--KWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDP 451 (571)
T ss_pred CEEEEEecccccc-------ccccEEEecCCCC--cccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEcC
Confidence 4788888653111 1112333444433 77777766555555778899999988777 46778998
Q ss_pred CCceeeceeeecCccceEEEEEEeCCEEE-EeecCCc---EEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeec
Q 000944 970 GKKRLLRKCENKLFPNTIVSINTYRDRIY-VGDIQES---FHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGAD 1045 (1213)
Q Consensus 970 ~~~~l~~~~~~~~~~~~i~~l~~~~~~I~-vgD~~~S---v~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D 1045 (1213)
..+++...+.... +-.-..+.+.+++|+ +|..... -++-+|++..++-..++--..++....+..++...++++-
T Consensus 452 ~t~~W~~~~~M~~-~R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~~~rs~~g~~~~~~~ly~vGG 530 (571)
T KOG4441|consen 452 ETNTWTLIAPMNT-RRSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMTSPRSAVGVVVLGGKLYAVGG 530 (571)
T ss_pred CCCceeecCCccc-ccccceEEEECCEEEEECCccCCCccceEEEEcCCCCceeEcccCccccccccEEEECCEEEEEec
Confidence 8888877665542 222334677888876 6653321 1256788888888888777777777666667666554433
Q ss_pred CCC
Q 000944 1046 KFG 1048 (1213)
Q Consensus 1046 ~~g 1048 (1213)
-+|
T Consensus 531 ~~~ 533 (571)
T KOG4441|consen 531 FDG 533 (571)
T ss_pred ccC
Confidence 333
No 129
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=73.14 E-value=2e+02 Score=33.84 Aligned_cols=88 Identities=18% Similarity=0.273 Sum_probs=46.9
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
++-++|..++++...++|...+. +.+ .| . +..+|.. . .+.|+|+||.+..++..-+..+
T Consensus 299 ttilwd~~~g~~~q~f~~~s~~~-lDV-dW-~---~~~~F~t---s----------~td~~i~V~kv~~~~P~~t~~G-- 357 (524)
T KOG0273|consen 299 TTILWDAHTGTVKQQFEFHSAPA-LDV-DW-Q---SNDEFAT---S----------STDGCIHVCKVGEDRPVKTFIG-- 357 (524)
T ss_pred cEEEEeccCceEEEeeeeccCCc-cce-EE-e---cCceEee---c----------CCCceEEEEEecCCCcceeeec--
Confidence 67889998888888777765441 111 12 1 1223322 1 2367888888877532233332
Q ss_pred eecCcceEeccc-cCeEEEEeC--CeEEEEecC
Q 000944 941 QVEGIPLALCQF-QGRLLAGIG--PVLRLYDLG 970 (1213)
Q Consensus 941 ~~~g~V~ai~~~-~g~ll~~~g--~~l~i~~~~ 970 (1213)
..|+|.+|.-- .|.|+++.. .++++|...
T Consensus 358 -H~g~V~alk~n~tg~LLaS~SdD~TlkiWs~~ 389 (524)
T KOG0273|consen 358 -HHGEVNALKWNPTGSLLASCSDDGTLKIWSMG 389 (524)
T ss_pred -ccCceEEEEECCCCceEEEecCCCeeEeeecC
Confidence 44666655432 245554333 255566543
No 130
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=73.12 E-value=83 Score=29.56 Aligned_cols=91 Identities=12% Similarity=0.141 Sum_probs=59.3
Q ss_pred EEEeCCcEEEEeCCCceeeeeCCCCccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCC
Q 000944 515 MQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPE 594 (1213)
Q Consensus 515 vQVT~~~i~l~~~~~~~~~~~~~~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~ 594 (1213)
+==....||++..+..+.+.... +..+..+.+..++++-++.+|++-+++.. ..+=+..-.++++||+...+.
T Consensus 20 vGs~D~~IRvf~~~e~~~Ei~e~-~~v~~L~~~~~~~F~Y~l~NGTVGvY~~~-----~RlWRiKSK~~~~~~~~~D~~- 92 (111)
T PF14783_consen 20 VGSDDFEIRVFKGDEIVAEITET-DKVTSLCSLGGGRFAYALANGTVGVYDRS-----QRLWRIKSKNQVTSMAFYDIN- 92 (111)
T ss_pred EecCCcEEEEEeCCcEEEEEecc-cceEEEEEcCCCEEEEEecCCEEEEEeCc-----ceeeeeccCCCeEEEEEEcCC-
Confidence 33345678899888888887753 44566777778888889999999887532 111122233568888887654
Q ss_pred CceeeeEEEEEEeCCcEEE
Q 000944 595 GRKRSRFLAVGSYDNTIRI 613 (1213)
Q Consensus 595 ~~~~~~~l~v~~~~~~i~i 613 (1213)
.....-+++|..+|.+.+
T Consensus 93 -gdG~~eLI~GwsnGkve~ 110 (111)
T PF14783_consen 93 -GDGVPELIVGWSNGKVEV 110 (111)
T ss_pred -CCCceEEEEEecCCeEEe
Confidence 123445666655888764
No 131
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=72.96 E-value=97 Score=40.63 Aligned_cols=128 Identities=15% Similarity=0.054 Sum_probs=70.8
Q ss_pred eeeeeCCCCccEEEEEec--CCEEEEEEeCCEEEEEEEccC-CC---eEEeeeec--cCcceEEEEeeecCCCceeeeEE
Q 000944 531 INEWRTPGKRTIVKVGSN--RLQVVIALSGGELIYFEVDMT-GQ---LLEVEKHE--MSGDVACLDIASVPEGRKRSRFL 602 (1213)
Q Consensus 531 ~~~~~~~~~~~I~~as~~--~~~v~v~~s~~~l~~l~~~~~-~~---l~~~~~~~--l~~~is~i~i~~~~~~~~~~~~l 602 (1213)
...|.+ .+.++...+.+ +++++++.+||.+.++.++.. +. .....+.. .+..+..|-.... ...+..+
T Consensus 1091 ~ltys~-~~sr~~~vt~~~~~~~~Av~t~DG~v~~~~id~~~~~~~~~~~~ri~n~~~~g~vv~m~a~~~---~~~S~~l 1166 (1431)
T KOG1240|consen 1091 ELTYSP-EGSRVEKVTMCGNGDQFAVSTKDGSVRVLRIDHYNVSKRVATQVRIPNLKKDGVVVSMHAFTA---IVQSHVL 1166 (1431)
T ss_pred eEEEec-cCCceEEEEeccCCCeEEEEcCCCeEEEEEccccccccceeeeeecccccCCCceEEeecccc---cccceeE
Confidence 344543 45556555544 678999999999999999852 21 11112222 2333433332221 1235677
Q ss_pred EEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 603 AVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 603 ~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
+.++..+.+..|.......+-.+.-+.-.+...|+++- ..-.|+++|+..|.++.|.+..
T Consensus 1167 vy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~id--------------p~~~WlviGts~G~l~lWDLRF 1226 (1431)
T KOG1240|consen 1167 VYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIVID--------------PWCNWLVIGTSRGQLVLWDLRF 1226 (1431)
T ss_pred EEEEeccceEEecchhhhhHHhhhcCccccceeEEEec--------------CCceEEEEecCCceEEEEEeec
Confidence 77777777777766322111111101111334555442 2456999999999999987754
No 132
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=72.52 E-value=20 Score=43.05 Aligned_cols=91 Identities=20% Similarity=0.230 Sum_probs=60.0
Q ss_pred CCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc--CeEEEEeCCeEEEEecCC--
Q 000944 896 EHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ--GRLLAGIGPVLRLYDLGK-- 971 (1213)
Q Consensus 896 ~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~--g~ll~~~g~~l~i~~~~~-- 971 (1213)
.++..++|=|+..-. +++.+-.++.+.|. .....|-|+|-|.-+ .+|++|+|+.++-|-|++
T Consensus 122 Pk~~iL~VLT~~dvS------------V~~sV~~d~srVka--Di~~~G~IhCACWT~DG~RLVVAvGSsLHSyiWd~~q 187 (671)
T PF15390_consen 122 PKKAILTVLTARDVS------------VLPSVHCDSSRVKA--DIKTSGLIHCACWTKDGQRLVVAVGSSLHSYIWDSAQ 187 (671)
T ss_pred CCCceEEEEecCcee------------EeeeeeeCCceEEE--eccCCceEEEEEecCcCCEEEEEeCCeEEEEEecCch
Confidence 356899999886443 45666666444443 346789999988764 599999999999888874
Q ss_pred ceeeceeeec--CccceEEEEEEe-CCEEEEe
Q 000944 972 KRLLRKCENK--LFPNTIVSINTY-RDRIYVG 1000 (1213)
Q Consensus 972 ~~l~~~~~~~--~~~~~i~~l~~~-~~~I~vg 1000 (1213)
+.|.+-.|.. .+..+|-+|... +..|.|+
T Consensus 188 KtL~~CsfcPVFdv~~~Icsi~AT~dsqVAva 219 (671)
T PF15390_consen 188 KTLHRCSFCPVFDVGGYICSIEATVDSQVAVA 219 (671)
T ss_pred hhhhhCCcceeecCCCceEEEEEeccceEEEE
Confidence 4465443322 345677777653 3344444
No 133
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=72.16 E-value=26 Score=38.22 Aligned_cols=78 Identities=15% Similarity=0.270 Sum_probs=52.7
Q ss_pred CcceEEEEeeecCC-CceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceE
Q 000944 581 SGDVACLDIASVPE-GRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLF 659 (1213)
Q Consensus 581 ~~~is~i~i~~~~~-~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~ 659 (1213)
...|+||+-..... .......+++||.+|.|+|++ |. .+..+.+-.+++.|..+...-.-+ .....
T Consensus 176 ~t~ITcm~tikk~~~d~~a~scLViGTE~~~i~iLd--~~-af~il~~~~lpsvPv~i~~~G~~d----------evdyR 242 (257)
T PF14779_consen 176 QTVITCMATIKKSSADEDAVSCLVIGTESGEIYILD--PQ-AFTILKQVQLPSVPVFISVSGQYD----------EVDYR 242 (257)
T ss_pred CceeEEeeeecccccCCCCcceEEEEecCCeEEEEC--ch-hheeEEEEecCCCceEEEEEeeee----------ccceE
Confidence 45688987643221 122456899999999888775 43 466777778888898765442211 13567
Q ss_pred EEEEeeCCeEEE
Q 000944 660 LNAGLQNGVLFR 671 (1213)
Q Consensus 660 Lligl~~G~l~~ 671 (1213)
+++.+|||.++.
T Consensus 243 I~Va~Rdg~iy~ 254 (257)
T PF14779_consen 243 IVVACRDGKIYT 254 (257)
T ss_pred EEEEeCCCEEEE
Confidence 999999999854
No 134
>PHA02713 hypothetical protein; Provisional
Probab=72.02 E-value=1.2e+02 Score=37.77 Aligned_cols=153 Identities=12% Similarity=0.076 Sum_probs=86.3
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeCC-------eEEEEecC
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGP-------VLRLYDLG 970 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~-------~l~i~~~~ 970 (1213)
...+++|.... .......+..|+...+ +...+.....+-.=.+++.++|+|.+.-|. .+..|++.
T Consensus 304 ~~IYviGG~~~------~~~~~~~v~~Yd~~~n--~W~~~~~m~~~R~~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~ 375 (557)
T PHA02713 304 NEIIIAGGYNF------NNPSLNKVYKINIENK--IHVELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMG 375 (557)
T ss_pred CEEEEEcCCCC------CCCccceEEEEECCCC--eEeeCCCCcchhhceeEEEECCEEEEECCcCCCCCCceEEEEECC
Confidence 36788885311 1112345667776654 444444333322234566777877654332 47789988
Q ss_pred CceeeceeeecCccceEEEEEEeCCEEEE-eecCC---------------------cEEEEEEeccCCeEEEeeccCCCc
Q 000944 971 KKRLLRKCENKLFPNTIVSINTYRDRIYV-GDIQE---------------------SFHFCKYRRDENQLYIFADDSVPR 1028 (1213)
Q Consensus 971 ~~~l~~~~~~~~~~~~i~~l~~~~~~I~v-gD~~~---------------------Sv~~l~~~~~~~~l~~~a~D~~~~ 1028 (1213)
..++...+... .+..-.+..+.++.|+| |-... .-.+.+|+++.++-..++.=..+|
T Consensus 376 ~~~W~~~~~mp-~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m~~~r 454 (557)
T PHA02713 376 DDKWKMLPDMP-IALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGT 454 (557)
T ss_pred CCeEEECCCCC-cccccccEEEECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCCeEeecCCCCccc
Confidence 87777665543 12233345567787765 43211 234778998888888887655666
Q ss_pred ceEEEEeecCCeeeeecCCC--cE--EEEecCCCC
Q 000944 1029 WLTAAHHIDFDTMAGADKFG--NI--YFVRLPQDV 1059 (1213)
Q Consensus 1029 ~~~~~~~ld~~~~l~~D~~g--nl--~il~~~~~~ 1059 (1213)
.-.++..+++.-++++...+ .. .+..|+|..
T Consensus 455 ~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~ 489 (557)
T PHA02713 455 IRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNT 489 (557)
T ss_pred ccCcEEEECCEEEEEeCCCCCCccceeEEEecCCC
Confidence 55555667655554443221 12 356787753
No 135
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=70.70 E-value=34 Score=38.40 Aligned_cols=113 Identities=13% Similarity=0.182 Sum_probs=66.7
Q ss_pred EEEEEeecCcceEeccc-cCeEEEEeCCeEEEEecCCceeeceeeec-CccceEEEEE--EeCCEEE-EeecCCcEEEEE
Q 000944 936 LLHKTQVEGIPLALCQF-QGRLLAGIGPVLRLYDLGKKRLLRKCENK-LFPNTIVSIN--TYRDRIY-VGDIQESFHFCK 1010 (1213)
Q Consensus 936 ~~~~~~~~g~V~ai~~~-~g~ll~~~g~~l~i~~~~~~~l~~~~~~~-~~~~~i~~l~--~~~~~I~-vgD~~~Sv~~l~ 1010 (1213)
.++...-++.++.|... .+...++.|++|.||+..... ++..+. ..+ .+.+++ .....|+ .+-..+|+.++.
T Consensus 140 p~~tilg~s~~~gIdh~~~~~~FaTcGe~i~IWD~~R~~--Pv~smswG~D-ti~svkfNpvETsILas~~sDrsIvLyD 216 (433)
T KOG0268|consen 140 PLHTILGKSVYLGIDHHRKNSVFATCGEQIDIWDEQRDN--PVSSMSWGAD-SISSVKFNPVETSILASCASDRSIVLYD 216 (433)
T ss_pred cceeeeccccccccccccccccccccCceeeecccccCC--ccceeecCCC-ceeEEecCCCcchheeeeccCCceEEEe
Confidence 45566667777777665 357889999999999986432 222222 222 244444 3444455 445777999977
Q ss_pred EeccC--CeEEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecC
Q 000944 1011 YRRDE--NQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1011 ~~~~~--~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~ 1056 (1213)
.+... .+++.--|-..-.|- . ....|++++.+.||+.|++-
T Consensus 217 ~R~~~Pl~KVi~~mRTN~Iswn----P-eafnF~~a~ED~nlY~~DmR 259 (433)
T KOG0268|consen 217 LRQASPLKKVILTMRTNTICWN----P-EAFNFVAANEDHNLYTYDMR 259 (433)
T ss_pred cccCCccceeeeeccccceecC----c-cccceeeccccccceehhhh
Confidence 65432 333333333222221 1 11257889999999998853
No 136
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=70.45 E-value=1.3e+02 Score=31.43 Aligned_cols=127 Identities=13% Similarity=0.113 Sum_probs=67.1
Q ss_pred eEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecc--ccCeEEEEeCC---eEEEEecCCce
Q 000944 899 TLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQ--FQGRLLAGIGP---VLRLYDLGKKR 973 (1213)
Q Consensus 899 ~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~--~~g~ll~~~g~---~l~i~~~~~~~ 973 (1213)
++++|=+..... ......-|..-+|.++..+...+.+. .+-.|+|.++.- -+.++++..|. ++.+|+...+.
T Consensus 18 ~~l~~~~~~~~~--~~~ks~~~~~~l~~~~~~~~~~~~i~-l~~~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~ 94 (194)
T PF08662_consen 18 DYLLVKVQTRVD--KSGKSYYGEFELFYLNEKNIPVESIE-LKKEGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKK 94 (194)
T ss_pred CEEEEEEEEeec--cCcceEEeeEEEEEEecCCCccceee-ccCCCceEEEEECcCCCEEEEEEccCCcccEEEcCcccE
Confidence 455555552211 12333446667777765433333332 223578888854 35677666654 79999996333
Q ss_pred eeceeeecCccceEEEEEEeCCEEEEeecCCc---EEEEEEeccCCeEEEeeccCCCcceEEEEee
Q 000944 974 LLRKCENKLFPNTIVSINTYRDRIYVGDIQES---FHFCKYRRDENQLYIFADDSVPRWLTAAHHI 1036 (1213)
Q Consensus 974 l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~S---v~~l~~~~~~~~l~~~a~D~~~~~~~~~~~l 1036 (1213)
+ ..+..-+...+.-+..++++++|..-.. +.| |+.. +...++...++ .++.++.-
T Consensus 95 i---~~~~~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~--wd~~--~~~~i~~~~~~-~~t~~~Ws 152 (194)
T PF08662_consen 95 I---FSFGTQPRNTISWSPDGRFLVLAGFGNLNGDLEF--WDVR--KKKKISTFEHS-DATDVEWS 152 (194)
T ss_pred e---EeecCCCceEEEECCCCCEEEEEEccCCCcEEEE--EECC--CCEEeeccccC-cEEEEEEc
Confidence 2 2233222233445567999999876543 666 4433 33444443333 24555543
No 137
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=70.19 E-value=79 Score=37.91 Aligned_cols=91 Identities=21% Similarity=0.187 Sum_probs=56.5
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK 939 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~ 939 (1213)
.++-|.+..++++..-.+|. .|.++|+.- . ....+|+|||. .|.+.+|++.+.. +++.+..
T Consensus 197 ~~vylW~~~s~~v~~l~~~~-~~~vtSv~w---s--~~G~~LavG~~------------~g~v~iwD~~~~k-~~~~~~~ 257 (484)
T KOG0305|consen 197 QSVYLWSASSGSVTELCSFG-EELVTSVKW---S--PDGSHLAVGTS------------DGTVQIWDVKEQK-KTRTLRG 257 (484)
T ss_pred ceEEEEecCCCceEEeEecC-CCceEEEEE---C--CCCCEEEEeec------------CCeEEEEehhhcc-ccccccC
Confidence 37889999888877777777 677777643 2 34589999997 6899999987641 1221111
Q ss_pred EeecCcceEeccccCeEEEEeCC-eEEEEecC
Q 000944 940 TQVEGIPLALCQFQGRLLAGIGP-VLRLYDLG 970 (1213)
Q Consensus 940 ~~~~g~V~ai~~~~g~ll~~~g~-~l~i~~~~ 970 (1213)
-..+-|.+++--..-+..+.+. +|..+++.
T Consensus 258 -~h~~rvg~laW~~~~lssGsr~~~I~~~dvR 288 (484)
T KOG0305|consen 258 -SHASRVGSLAWNSSVLSSGSRDGKILNHDVR 288 (484)
T ss_pred -CcCceeEEEeccCceEEEecCCCcEEEEEEe
Confidence 0345555555443334444443 56666665
No 138
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=70.16 E-value=1.5e+02 Score=31.84 Aligned_cols=146 Identities=12% Similarity=0.125 Sum_probs=89.5
Q ss_pred CCceEEEEEeeecCccCCCCCCcccEEEEEEEEeC-------CceEEEEEEEeecCcceEeccccCeEEEEeCCeEEEEe
Q 000944 896 EHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEE-------GKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD 968 (1213)
Q Consensus 896 ~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~-------~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~~l~i~~ 968 (1213)
+.++|+++|..+ |-|-++.++.- .-|++.+.....+||+|.+...+.+|+.|--..|+=|.
T Consensus 20 p~~~~l~agn~~------------G~iav~sl~sl~s~sa~~~gk~~iv~eqahdgpiy~~~f~d~~Lls~gdG~V~gw~ 87 (325)
T KOG0649|consen 20 PSKQYLFAGNLF------------GDIAVLSLKSLDSGSAEPPGKLKIVPEQAHDGPIYYLAFHDDFLLSGGDGLVYGWE 87 (325)
T ss_pred CcceEEEEecCC------------CeEEEEEehhhhccccCCCCCcceeeccccCCCeeeeeeehhheeeccCceEEEee
Confidence 456889988763 67888888761 12788888888899999999999999988878899898
Q ss_pred cCC-------ceee-ceeeec--Cccc-eEEEE--EEeC-CEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEE
Q 000944 969 LGK-------KRLL-RKCENK--LFPN-TIVSI--NTYR-DRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAH 1034 (1213)
Q Consensus 969 ~~~-------~~l~-~~~~~~--~~~~-~i~~l--~~~~-~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~ 1034 (1213)
|.+ ++|- .+..++ ..+. -|.+| .... ..+++| -++ .++.++-+.+++...-|-.. -++-++.
T Consensus 88 W~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~~Ag--GD~-~~y~~dlE~G~i~r~~rGHt-DYvH~vv 163 (325)
T KOG0649|consen 88 WNEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAG--GDG-VIYQVDLEDGRIQREYRGHT-DYVHSVV 163 (325)
T ss_pred ehhhhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEEEec--CCe-EEEEEEecCCEEEEEEcCCc-ceeeeee
Confidence 863 1111 111111 1111 23344 3333 445554 333 36777777777766654321 1122211
Q ss_pred e-ecCCeeeeecCCCcEEEEecCC
Q 000944 1035 H-IDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1035 ~-ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
. =..+.++.+-.+|.+++.+.-.
T Consensus 164 ~R~~~~qilsG~EDGtvRvWd~kt 187 (325)
T KOG0649|consen 164 GRNANGQILSGAEDGTVRVWDTKT 187 (325)
T ss_pred ecccCcceeecCCCccEEEEeccc
Confidence 1 1123688888899998887543
No 139
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=69.98 E-value=1.2e+02 Score=35.68 Aligned_cols=51 Identities=14% Similarity=0.269 Sum_probs=37.8
Q ss_pred CceEE-EEe-CCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEEEc
Q 000944 746 VEGVV-SVA-GNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIET 797 (1213)
Q Consensus 746 ~~~~i-~~~-~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~ 797 (1213)
.+.+| .++ ++.|.|-.+.. +++-..-+++-|...|.+-|++..+.+.+..+
T Consensus 132 ~DeyiAsvs~gGdiiih~~~t-~~~tt~f~~~sgqsvRll~ys~skr~lL~~as 184 (673)
T KOG4378|consen 132 TDEYIASVSDGGDIIIHGTKT-KQKTTTFTIDSGQSVRLLRYSPSKRFLLSIAS 184 (673)
T ss_pred CcceeEEeccCCcEEEEeccc-CccccceecCCCCeEEEeecccccceeeEeec
Confidence 34444 444 67888877776 46777888888899999999998887776653
No 140
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=69.43 E-value=2.3e+02 Score=33.09 Aligned_cols=168 Identities=14% Similarity=0.054 Sum_probs=83.2
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEE-EEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAF-SICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK 939 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~-s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~ 939 (1213)
.+..+|+++++.+..++........ +-..-.+. + ..+++|+. .|+++.++...+ ++ +.+
T Consensus 156 ~l~a~d~~tG~~~W~~~~~~~~~~~~~~~sp~~~--~--~~v~~~~~------------~g~v~ald~~tG--~~--~W~ 215 (377)
T TIGR03300 156 RLTALDAATGERLWTYSRVTPALTLRGSASPVIA--D--GGVLVGFA------------GGKLVALDLQTG--QP--LWE 215 (377)
T ss_pred eEEEEEcCCCceeeEEccCCCceeecCCCCCEEE--C--CEEEEECC------------CCEEEEEEccCC--CE--eee
Confidence 6788899999988877765432110 00000011 1 24555553 467777776544 11 111
Q ss_pred EeecCc-----------ce-EeccccCeEEEE-eCCeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcE
Q 000944 940 TQVEGI-----------PL-ALCQFQGRLLAG-IGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESF 1006 (1213)
Q Consensus 940 ~~~~g~-----------V~-ai~~~~g~ll~~-~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv 1006 (1213)
.....+ +. +....+++++++ .++.++.++....+..-.... +. .......++.|++++....+
T Consensus 216 ~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~~~~~g~l~a~d~~tG~~~W~~~~---~~-~~~p~~~~~~vyv~~~~G~l 291 (377)
T TIGR03300 216 QRVALPKGRTELERLVDVDGDPVVDGGQVYAVSYQGRVAALDLRSGRVLWKRDA---SS-YQGPAVDDNRLYVTDADGVV 291 (377)
T ss_pred eccccCCCCCchhhhhccCCccEEECCEEEEEEcCCEEEEEECCCCcEEEeecc---CC-ccCceEeCCEEEEECCCCeE
Confidence 111111 00 011125666654 456888888875543221111 11 12334568999998866565
Q ss_pred EEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEec
Q 000944 1007 HFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRL 1055 (1213)
Q Consensus 1007 ~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~ 1055 (1213)
..+.. ...++.--..+.... ..+.-.+..+.+++.+.+|.+++++.
T Consensus 292 ~~~d~--~tG~~~W~~~~~~~~-~~ssp~i~g~~l~~~~~~G~l~~~d~ 337 (377)
T TIGR03300 292 VALDR--RSGSELWKNDELKYR-QLTAPAVVGGYLVVGDFEGYLHWLSR 337 (377)
T ss_pred EEEEC--CCCcEEEccccccCC-ccccCEEECCEEEEEeCCCEEEEEEC
Confidence 55443 333322111111111 11112345567888899999999864
No 141
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=68.31 E-value=3.1e+02 Score=35.83 Aligned_cols=136 Identities=21% Similarity=0.298 Sum_probs=78.0
Q ss_pred CcEEEEeC-----CCceeeeeCCCC-ccEEEEEecC-CE--EEEEEeCCEEEEEEEccC---CCeEEeeeeccCcceEEE
Q 000944 520 SGIRHIRE-----DGRINEWRTPGK-RTIVKVGSNR-LQ--VVIALSGGELIYFEVDMT---GQLLEVEKHEMSGDVACL 587 (1213)
Q Consensus 520 ~~i~l~~~-----~~~~~~~~~~~~-~~I~~as~~~-~~--v~v~~s~~~l~~l~~~~~---~~l~~~~~~~l~~~is~i 587 (1213)
-++|+++- +..+..|+.-.. .+|++++... ++ ++-+..+|.|.++.+..+ ..+....+.+..+..|||
T Consensus 1231 GsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal 1310 (1387)
T KOG1517|consen 1231 GSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMSSKETFLTIVAHWEYGSALTAL 1310 (1387)
T ss_pred CceEEeecccCCccccceeecccCCcccceeEEeecCCCcceeeeccCCeEEEEecccCcccccceeeeccccCccceee
Confidence 35677663 235666764211 2399999874 33 333446889998877642 113333445555668899
Q ss_pred EeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeec--C-CCCceeEEEEeecccCCCCCCCCCCceEEEEEe
Q 000944 588 DIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSV--S-SPPESLLFLEVQASVGGEDGADHPASLFLNAGL 664 (1213)
Q Consensus 588 ~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l--~-~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl 664 (1213)
.+.+ .+..++.|.. +.+.||++..+ .+..+-..+. . ..+...+ ..| ...+..|.+|.
T Consensus 1311 ~VH~------hapiiAsGs~-q~ikIy~~~G~-~l~~~k~n~~F~~q~~gs~sc-L~F-----------HP~~~llAaG~ 1370 (1387)
T KOG1517|consen 1311 TVHE------HAPIIASGSA-QLIKIYSLSGE-QLNIIKYNPGFMGQRIGSVSC-LAF-----------HPHRLLLAAGS 1370 (1387)
T ss_pred eecc------CCCeeeecCc-ceEEEEecChh-hhcccccCcccccCcCCCcce-eee-----------cchhHhhhhcc
Confidence 8853 5778888887 89999999433 3322211111 0 0111111 122 22467788887
Q ss_pred eCCeEEEEEEe
Q 000944 665 QNGVLFRTVVD 675 (1213)
Q Consensus 665 ~~G~l~~~~~~ 675 (1213)
.|-.+-.|.-.
T Consensus 1371 ~Ds~V~iYs~~ 1381 (1387)
T KOG1517|consen 1371 ADSTVSIYSCE 1381 (1387)
T ss_pred CCceEEEeecC
Confidence 77777777654
No 142
>KOG4328 consensus WD40 protein [Function unknown]
Probab=68.14 E-value=2.5e+02 Score=32.96 Aligned_cols=32 Identities=31% Similarity=0.503 Sum_probs=23.0
Q ss_pred CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 581 SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 581 ~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
..+|+||.+.+.. ...+.....||+++.-.++
T Consensus 234 s~~Vs~l~F~P~n-----~s~i~ssSyDGtiR~~D~~ 265 (498)
T KOG4328|consen 234 SGPVSGLKFSPAN-----TSQIYSSSYDGTIRLQDFE 265 (498)
T ss_pred CccccceEecCCC-----hhheeeeccCceeeeeeec
Confidence 3568999997643 3345566779999988884
No 143
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=67.45 E-value=1.7e+02 Score=30.73 Aligned_cols=136 Identities=10% Similarity=0.196 Sum_probs=76.8
Q ss_pred EEEEEEccCCCeeEEEEEeCC--CccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCC
Q 000944 757 LRVFTIERLGETFNETALPLR--YTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENG 834 (1213)
Q Consensus 757 L~i~~l~~~~~~~~~r~i~l~--~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 834 (1213)
..+..++.- ..++..+++. +.++.++++|..+.|+|+....
T Consensus 39 ~~l~~~~~~--~~~~~~i~l~~~~~I~~~~WsP~g~~favi~g~~----------------------------------- 81 (194)
T PF08662_consen 39 FELFYLNEK--NIPVESIELKKEGPIHDVAWSPNGNEFAVIYGSM----------------------------------- 81 (194)
T ss_pred EEEEEEecC--CCccceeeccCCCceEEEEECcCCCEEEEEEccC-----------------------------------
Confidence 444455432 2456777774 3589999999999998885210
Q ss_pred CCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCC
Q 000944 835 DDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPK 914 (1213)
Q Consensus 835 ~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e 914 (1213)
| ..+.|+|.+ ++.+.. |+. ..+.++ .|. ...+++++|..-+
T Consensus 82 ----------------~--------~~v~lyd~~-~~~i~~--~~~-~~~n~i-~ws----P~G~~l~~~g~~n------ 122 (194)
T PF08662_consen 82 ----------------P--------AKVTLYDVK-GKKIFS--FGT-QPRNTI-SWS----PDGRFLVLAGFGN------ 122 (194)
T ss_pred ----------------C--------cccEEEcCc-ccEeEe--ecC-CCceEE-EEC----CCCCEEEEEEccC------
Confidence 1 146777765 444433 332 222332 232 3346888876421
Q ss_pred CCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc-cCeE-EEEe-------CCeEEEEecCCceeec
Q 000944 915 RNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF-QGRL-LAGI-------GPVLRLYDLGKKRLLR 976 (1213)
Q Consensus 915 ~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~-~g~l-l~~~-------g~~l~i~~~~~~~l~~ 976 (1213)
..|.+.+|++. +.+.+.+.+..++ +.++-- +|+. +.+. .+.+.||.+..+.|.+
T Consensus 123 ---~~G~l~~wd~~----~~~~i~~~~~~~~-t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~G~~l~~ 185 (194)
T PF08662_consen 123 ---LNGDLEFWDVR----KKKKISTFEHSDA-TDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQGRLLYK 185 (194)
T ss_pred ---CCcEEEEEECC----CCEEeeccccCcE-EEEEEcCCCCEEEEEEeccceeccccEEEEEecCeEeEe
Confidence 24889999987 4555655555443 333322 4544 4442 4567888887665533
No 144
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=67.13 E-value=20 Score=39.21 Aligned_cols=154 Identities=16% Similarity=0.297 Sum_probs=79.5
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.++|+|-+|.++.-+-. ++..+--++..|.... ...+++-|. .-|-|.+|+=..+ |...-...
T Consensus 239 ~~rlYdv~T~Qcfvsan-Pd~qht~ai~~V~Ys~--t~~lYvTaS------------kDG~IklwDGVS~--rCv~t~~~ 301 (430)
T KOG0640|consen 239 TLRLYDVNTYQCFVSAN-PDDQHTGAITQVRYSS--TGSLYVTAS------------KDGAIKLWDGVSN--RCVRTIGN 301 (430)
T ss_pred ceeEEeccceeEeeecC-cccccccceeEEEecC--CccEEEEec------------cCCcEEeeccccH--HHHHHHHh
Confidence 56777777766654333 3333333555566543 335555444 2477888875543 21111112
Q ss_pred eecCcceEeccc--cCeEEEEeCC--eEEEEecCCceeeceeeecC---ccceEEE--EEEeCCEEEEeecCCcEEEEEE
Q 000944 941 QVEGIPLALCQF--QGRLLAGIGP--VLRLYDLGKKRLLRKCENKL---FPNTIVS--INTYRDRIYVGDIQESFHFCKY 1011 (1213)
Q Consensus 941 ~~~g~V~ai~~~--~g~ll~~~g~--~l~i~~~~~~~l~~~~~~~~---~~~~i~~--l~~~~~~I~vgD~~~Sv~~l~~ 1011 (1213)
...|+-.|=..| ||+++.+.|. .+++|++...+.+..-.--+ -+-.-+. .+-..+|++.-|-. |..+..|
T Consensus 302 AH~gsevcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNhtEdyVl~pDEa-s~slcsW 380 (430)
T KOG0640|consen 302 AHGGSEVCSAVFTKNGKYILSSGKDSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNHTEDYVLFPDEA-SNSLCSW 380 (430)
T ss_pred hcCCceeeeEEEccCCeEEeecCCcceeeeeeecCCceEEEEecCCcccchhhhhhhhhcCccceEEccccc-cCceeec
Confidence 223332222223 7888888885 68899998766543211100 0000011 12235788877753 4456667
Q ss_pred ecc---CCeEEEeeccCCCcceEE
Q 000944 1012 RRD---ENQLYIFADDSVPRWLTA 1032 (1213)
Q Consensus 1012 ~~~---~~~l~~~a~D~~~~~~~~ 1032 (1213)
+.- ...|..++....+||.+.
T Consensus 381 daRtadr~~l~slgHn~a~R~i~H 404 (430)
T KOG0640|consen 381 DARTADRVALLSLGHNGAVRWIVH 404 (430)
T ss_pred cccchhhhhhcccCCCCCceEEEe
Confidence 542 234566666777777664
No 145
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=67.10 E-value=2.4e+02 Score=32.53 Aligned_cols=142 Identities=13% Similarity=0.229 Sum_probs=76.4
Q ss_pred EEEEEecCceeEEE-eccce-eeecCCCccCCCCeEEE-EeecCCe-EEEEe-CCcEEEEeCCCceeeeeCCCCccEEEE
Q 000944 471 YIVVSFNNATLVLS-IGETV-EEVSDSGFLDTTPSLAV-SLIGDDS-LMQVH-PSGIRHIREDGRINEWRTPGKRTIVKV 545 (1213)
Q Consensus 471 ~lvlS~~~~T~vl~-~~~~~-~e~~~~gf~~~~~Tl~a-~~~~~~~-ivQVT-~~~i~l~~~~~~~~~~~~~~~~~I~~a 545 (1213)
-+|.+.++.|.+.- +++++ ...+ | ..+ |-++. +...||. ++=|| .+.|++++-..+...=...+..+|+.-
T Consensus 326 ~~V~Gs~dr~i~~wdlDgn~~~~W~--g-vr~-~~v~dlait~Dgk~vl~v~~d~~i~l~~~e~~~dr~lise~~~its~ 401 (519)
T KOG0293|consen 326 RFVTGSPDRTIIMWDLDGNILGNWE--G-VRD-PKVHDLAITYDGKYVLLVTVDKKIRLYNREARVDRGLISEEQPITSF 401 (519)
T ss_pred eeEecCCCCcEEEecCCcchhhccc--c-ccc-ceeEEEEEcCCCcEEEEEecccceeeechhhhhhhccccccCceeEE
Confidence 36777777665533 23322 1111 1 111 33333 2233443 44444 467777764421111011234569888
Q ss_pred Eec--CCEEEEEEeCCEEEEEEEccCCCeEEe-eeeccCcce-EEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCc
Q 000944 546 GSN--RLQVVIALSGGELIYFEVDMTGQLLEV-EKHEMSGDV-ACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDC 621 (1213)
Q Consensus 546 s~~--~~~v~v~~s~~~l~~l~~~~~~~l~~~-~~~~l~~~i-s~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~ 621 (1213)
++. +.++++-+.+.++.+..+.+.....++ .+++-..-| ||+.- ....+++.|..|+.|+||.......
T Consensus 402 ~iS~d~k~~LvnL~~qei~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg-------~~~~fiaSGSED~kvyIWhr~sgkl 474 (519)
T KOG0293|consen 402 SISKDGKLALVNLQDQEIHLWDLEENKLVRKYFGHKQGHFIIRSCFGG-------GNDKFIASGSEDSKVYIWHRISGKL 474 (519)
T ss_pred EEcCCCcEEEEEcccCeeEEeecchhhHHHHhhcccccceEEEeccCC-------CCcceEEecCCCceEEEEEccCCce
Confidence 886 457888889999999887743222223 222222223 55543 2468999999999999999744433
Q ss_pred ee
Q 000944 622 MQ 623 (1213)
Q Consensus 622 l~ 623 (1213)
+.
T Consensus 475 l~ 476 (519)
T KOG0293|consen 475 LA 476 (519)
T ss_pred eE
Confidence 33
No 146
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=65.55 E-value=86 Score=37.22 Aligned_cols=78 Identities=17% Similarity=0.276 Sum_probs=55.6
Q ss_pred cCeEEEEeCC--eEEEEecCCceeece-eeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeec-cCCCc
Q 000944 953 QGRLLAGIGP--VLRLYDLGKKRLLRK-CENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFAD-DSVPR 1028 (1213)
Q Consensus 953 ~g~ll~~~g~--~l~i~~~~~~~l~~~-~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~-D~~~~ 1028 (1213)
+|+.+|++++ -|+||+++..+|+.. .+|-. ....++-+..+.||++|---+=|+++.|.+ .+ ++|| .-+.-
T Consensus 301 DG~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFG-GLLCvcWSPDGKyIvtGGEDDLVtVwSf~e--rR--VVARGqGHkS 375 (636)
T KOG2394|consen 301 DGKYLATVSQDGFLRIFDFDTQELLGVMKSYFG-GLLCVCWSPDGKYIVTGGEDDLVTVWSFEE--RR--VVARGQGHKS 375 (636)
T ss_pred CCceEEEEecCceEEEeeccHHHHHHHHHhhcc-ceEEEEEcCCccEEEecCCcceEEEEEecc--ce--EEEecccccc
Confidence 5778888876 689999998888754 22322 345666677899999998888888988753 33 4555 34677
Q ss_pred ceEEEEe
Q 000944 1029 WLTAAHH 1035 (1213)
Q Consensus 1029 ~~~~~~~ 1035 (1213)
||+.+.|
T Consensus 376 WVs~VaF 382 (636)
T KOG2394|consen 376 WVSVVAF 382 (636)
T ss_pred ceeeEee
Confidence 8887766
No 147
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=64.86 E-value=3.2e+02 Score=33.16 Aligned_cols=49 Identities=14% Similarity=0.176 Sum_probs=33.8
Q ss_pred eCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEE
Q 000944 557 SGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRI 613 (1213)
Q Consensus 557 s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i 613 (1213)
.+|++.++++.. ...+.+.+++.+|.|++..+.+ ...+|+|+.... +.|
T Consensus 420 dDGtvriWEi~T---gRcvr~~~~d~~I~~vaw~P~~----~~~vLAvA~~~~-~~i 468 (733)
T KOG0650|consen 420 DDGTVRIWEIAT---GRCVRTVQFDSEIRSVAWNPLS----DLCVLAVAVGEC-VLI 468 (733)
T ss_pred CCCcEEEEEeec---ceEEEEEeecceeEEEEecCCC----CceeEEEEecCc-eEE
Confidence 567888888763 2445678899999999998754 345666666543 443
No 148
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=64.52 E-value=78 Score=33.84 Aligned_cols=112 Identities=22% Similarity=0.181 Sum_probs=69.1
Q ss_pred CCEEEEEEeCCEEEEEEEccCCCeEEeeeec-cCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEE
Q 000944 549 RLQVVIALSGGELIYFEVDMTGQLLEVEKHE-MSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSV 627 (1213)
Q Consensus 549 ~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~-l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~ 627 (1213)
+..++-+.+|+.+.+|++.++++...+..+. -..+|.-++..+ .+...+|+.+..||.+.||.-. .........
T Consensus 23 gkrlATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wah----Pk~G~iLAScsYDgkVIiWke~-~g~w~k~~e 97 (299)
T KOG1332|consen 23 GKRLATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAH----PKFGTILASCSYDGKVIIWKEE-NGRWTKAYE 97 (299)
T ss_pred cceeeeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecc----cccCcEeeEeecCceEEEEecC-CCchhhhhh
Confidence 4556667789999999998777533332222 134566666544 2357899999999999999863 222322111
Q ss_pred -eecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCC
Q 000944 628 -QSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMV 677 (1213)
Q Consensus 628 -~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~ 677 (1213)
...++...|+. +.. .+....|.|+..||.+-.++++..
T Consensus 98 ~~~h~~SVNsV~---wap---------heygl~LacasSDG~vsvl~~~~~ 136 (299)
T KOG1332|consen 98 HAAHSASVNSVA---WAP---------HEYGLLLACASSDGKVSVLTYDSS 136 (299)
T ss_pred hhhhcccceeec---ccc---------cccceEEEEeeCCCcEEEEEEcCC
Confidence 11122233332 222 335788999999999988888754
No 149
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=63.77 E-value=2.9e+02 Score=32.22 Aligned_cols=121 Identities=11% Similarity=0.192 Sum_probs=62.4
Q ss_pred EEEEeCCCceeeeeCCCCccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCc-ceEEEEeeecCCCceeee
Q 000944 522 IRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSG-DVACLDIASVPEGRKRSR 600 (1213)
Q Consensus 522 i~l~~~~~~~~~~~~~~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~-~is~i~i~~~~~~~~~~~ 600 (1213)
++.++...-...|..+.+. ....++.++.+++...+|.++.++.. +|++.- ....+.. ..++..+ ...
T Consensus 253 l~a~d~~tG~~~W~~~~~~-~~~p~~~~~~vyv~~~~G~l~~~d~~-tG~~~W-~~~~~~~~~~ssp~i--------~g~ 321 (377)
T TIGR03300 253 VAALDLRSGRVLWKRDASS-YQGPAVDDNRLYVTDADGVVVALDRR-SGSELW-KNDELKYRQLTAPAV--------VGG 321 (377)
T ss_pred EEEEECCCCcEEEeeccCC-ccCceEeCCEEEEECCCCeEEEEECC-CCcEEE-ccccccCCccccCEE--------ECC
Confidence 4444443333446543222 23334567888888778899888764 343211 1112222 1222222 234
Q ss_pred EEEEEEeCCcEEEEEeCCCCceeEeEEeecCCC--CceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEE
Q 000944 601 FLAVGSYDNTIRILSLDPDDCMQILSVQSVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRT 672 (1213)
Q Consensus 601 ~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~--p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~ 672 (1213)
.++++..+|.+.+++.+.. +.+...++... ..+..+. ...|+++..||.|+.|
T Consensus 322 ~l~~~~~~G~l~~~d~~tG---~~~~~~~~~~~~~~~sp~~~----------------~~~l~v~~~dG~l~~~ 376 (377)
T TIGR03300 322 YLVVGDFEGYLHWLSREDG---SFVARLKTDGSGIASPPVVV----------------GDGLLVQTRDGDLYAF 376 (377)
T ss_pred EEEEEeCCCEEEEEECCCC---CEEEEEEcCCCccccCCEEE----------------CCEEEEEeCCceEEEe
Confidence 6778888898888876322 22222222211 1122211 1248899999999776
No 150
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=63.00 E-value=95 Score=35.66 Aligned_cols=94 Identities=13% Similarity=0.189 Sum_probs=52.6
Q ss_pred CcEEEEeCC--CceeeeeCCCCccEEEEEecC--CEEEEEEeCCEEEEEEEccCCCeEEe---eeeccCcceEEEEeeec
Q 000944 520 SGIRHIRED--GRINEWRTPGKRTIVKVGSNR--LQVVIALSGGELIYFEVDMTGQLLEV---EKHEMSGDVACLDIASV 592 (1213)
Q Consensus 520 ~~i~l~~~~--~~~~~~~~~~~~~I~~as~~~--~~v~v~~s~~~l~~l~~~~~~~l~~~---~~~~l~~~is~i~i~~~ 592 (1213)
+.||..+.. .+..+. |.|..|+..+..- ..++.+..++.+-++.+...+ +... .-....++.+...+.|
T Consensus 322 kkvRfwD~Rs~~~~~sv--~~gg~vtSl~ls~~g~~lLsssRDdtl~viDlRt~e-I~~~~sA~g~k~asDwtrvvfSp- 397 (459)
T KOG0288|consen 322 KKVRFWDIRSADKTRSV--PLGGRVTSLDLSMDGLELLSSSRDDTLKVIDLRTKE-IRQTFSAEGFKCASDWTRVVFSP- 397 (459)
T ss_pred cceEEEeccCCceeeEe--ecCcceeeEeeccCCeEEeeecCCCceeeeeccccc-EEEEeeccccccccccceeEECC-
Confidence 346776633 333333 4566788888762 344444445566555433211 1111 1122234455555544
Q ss_pred CCCceeeeEEEEEEeCCcEEEEEeCCCCcee
Q 000944 593 PEGRKRSRFLAVGSYDNTIRILSLDPDDCMQ 623 (1213)
Q Consensus 593 ~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~ 623 (1213)
...+++.|..||+|+||++... .++
T Consensus 398 -----d~~YvaAGS~dgsv~iW~v~tg-KlE 422 (459)
T KOG0288|consen 398 -----DGSYVAAGSADGSVYIWSVFTG-KLE 422 (459)
T ss_pred -----CCceeeeccCCCcEEEEEccCc-eEE
Confidence 4789999999999999999543 443
No 151
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=62.58 E-value=1.7e+02 Score=32.92 Aligned_cols=176 Identities=9% Similarity=0.064 Sum_probs=90.3
Q ss_pred eeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCC
Q 000944 599 SRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVT 678 (1213)
Q Consensus 599 ~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~ 678 (1213)
...++++..+|++++|+......++.. ...|.-+--+.+.. ...-..++.+..||.+-.|.+....
T Consensus 40 e~~vav~lSngsv~lyd~~tg~~l~~f-----k~~~~~~N~vrf~~---------~ds~h~v~s~ssDG~Vr~wD~Rs~~ 105 (376)
T KOG1188|consen 40 ETAVAVSLSNGSVRLYDKGTGQLLEEF-----KGPPATTNGVRFIS---------CDSPHGVISCSSDGTVRLWDIRSQA 105 (376)
T ss_pred ceeEEEEecCCeEEEEeccchhhhhee-----cCCCCcccceEEec---------CCCCCeeEEeccCCeEEEEEeecch
Confidence 467899999999999987432223322 22222221223321 1134457788899999888875431
Q ss_pred CcccccceeeecCCC-CeEEEEEECCeeEEEEecCcc------EEEEEeCCeEE-EEec---CccccceeeccccCCCCc
Q 000944 679 GQLSDSRSRFLGLRP-PKLFSVVVGGRAAMLCLSSRP------WLGYIHRGRFL-LTPL---SYETLEYAASFSSDQCVE 747 (1213)
Q Consensus 679 ~~l~~~~~~~lG~~p-v~l~~~~~~~~~~v~~~g~~p------~~i~~~~~~~~-~~~~---~~~~v~~~~~f~~~~~~~ 747 (1213)
+. ..+.-+..| -.|..+..+....+++||..- .++|.-|..=+ +..+ ..++|+++. |+.. -|+
T Consensus 106 -e~---a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~DDVT~lr-FHP~-~pn 179 (376)
T KOG1188|consen 106 -ES---ARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQLLRQLNESHNDDVTQLR-FHPS-DPN 179 (376)
T ss_pred -hh---hheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEeccccchhhhhhhhccCcceeEE-ecCC-CCC
Confidence 11 122233333 445555655567899998432 23444332211 1111 122455543 5543 256
Q ss_pred eEEEEe-CCeEEEEEEccC-CCeeEEEEEeCCCccceeeecCCC--ceEEE
Q 000944 748 GVVSVA-GNALRVFTIERL-GETFNETALPLRYTPRRFVLQPKK--KLMVI 794 (1213)
Q Consensus 748 ~~i~~~-~~~L~i~~l~~~-~~~~~~r~i~l~~tp~~i~y~~~~--~~~~v 794 (1213)
-++..+ ++-+.+..+..- +.---...|..+.+.+++.++-.. +.+++
T Consensus 180 lLlSGSvDGLvnlfD~~~d~EeDaL~~viN~~sSI~~igw~~~~ykrI~cl 230 (376)
T KOG1188|consen 180 LLLSGSVDGLVNLFDTKKDNEEDALLHVINHGSSIHLIGWLSKKYKRIMCL 230 (376)
T ss_pred eEEeecccceEEeeecCCCcchhhHHHhhcccceeeeeeeecCCcceEEEE
Confidence 666555 333444444431 011123456677778888887544 54444
No 152
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=62.28 E-value=1.4e+02 Score=32.27 Aligned_cols=126 Identities=16% Similarity=0.163 Sum_probs=84.4
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
.++++|..+.+.....+-.-++.. ..|+- ..+|+++|-.. -+|-.+++. +.+.+..+
T Consensus 88 ~ir~wd~r~~k~~~~i~~~~eni~---i~wsp----~g~~~~~~~kd------------D~it~id~r----~~~~~~~~ 144 (313)
T KOG1407|consen 88 TIRIWDIRSGKCTARIETKGENIN---ITWSP----DGEYIAVGNKD------------DRITFIDAR----TYKIVNEE 144 (313)
T ss_pred eEEEEEeccCcEEEEeeccCcceE---EEEcC----CCCEEEEecCc------------ccEEEEEec----ccceeehh
Confidence 789999999988888876654432 23432 34788888642 245555554 56666777
Q ss_pred eecCcceEeccc--cCeEEEEeC-CeEEEEecCCceeeceeeecCccceEEEE--EEeCCEEEEeecCCcEEEEEE
Q 000944 941 QVEGIPLALCQF--QGRLLAGIG-PVLRLYDLGKKRLLRKCENKLFPNTIVSI--NTYRDRIYVGDIQESFHFCKY 1011 (1213)
Q Consensus 941 ~~~g~V~ai~~~--~g~ll~~~g-~~l~i~~~~~~~l~~~~~~~~~~~~i~~l--~~~~~~I~vgD~~~Sv~~l~~ 1011 (1213)
+++--+.-++-- |+.++...| .+|.|..+. .|.++-.+.++|+...+| ...+.|..+|-+--.+++...
T Consensus 145 ~~~~e~ne~~w~~~nd~Fflt~GlG~v~ILsyp--sLkpv~si~AH~snCicI~f~p~GryfA~GsADAlvSLWD~ 218 (313)
T KOG1407|consen 145 QFKFEVNEISWNNSNDLFFLTNGLGCVEILSYP--SLKPVQSIKAHPSNCICIEFDPDGRYFATGSADALVSLWDV 218 (313)
T ss_pred cccceeeeeeecCCCCEEEEecCCceEEEEecc--ccccccccccCCcceEEEEECCCCceEeeccccceeeccCh
Confidence 777667666554 456677788 699999887 565665555556554444 457899999987777777443
No 153
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=61.99 E-value=2e+02 Score=35.79 Aligned_cols=185 Identities=12% Similarity=0.132 Sum_probs=111.3
Q ss_pred eeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 859 VSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 859 ~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
...++.+||.+.+-...-.++..-.-.++.+ + +...+++|--.. .....-.+..|+...+ ++..+.
T Consensus 300 ~~~ve~yd~~~~~w~~~a~m~~~r~~~~~~~--~----~~~lYv~GG~~~------~~~~l~~ve~YD~~~~--~W~~~a 365 (571)
T KOG4441|consen 300 LRSVECYDPKTNEWSSLAPMPSPRCRVGVAV--L----NGKLYVVGGYDS------GSDRLSSVERYDPRTN--QWTPVA 365 (571)
T ss_pred cceeEEecCCcCcEeecCCCCcccccccEEE--E----CCEEEEEccccC------CCcccceEEEecCCCC--ceeccC
Confidence 3478888887653332223332222223333 2 236777775421 1223344556666555 455566
Q ss_pred EEeecCcceEeccccCeEEEEeCC-------eEEEEecCCceeeceeeecCccceEEEEEEeCCEEEE-ee--cCC-c-E
Q 000944 939 KTQVEGIPLALCQFQGRLLAGIGP-------VLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYV-GD--IQE-S-F 1006 (1213)
Q Consensus 939 ~~~~~g~V~ai~~~~g~ll~~~g~-------~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~v-gD--~~~-S-v 1006 (1213)
....+..=.+++.++|.|.|.-|. ++..|+....++..++.... +-......+.+++|++ |= ... . =
T Consensus 366 ~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va~m~~-~r~~~gv~~~~g~iYi~GG~~~~~~~l~ 444 (571)
T KOG4441|consen 366 PMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVAPMLT-RRSGHGVAVLGGKLYIIGGGDGSSNCLN 444 (571)
T ss_pred CccCccccceeEEECCEEEEEeccccccccccEEEecCCCCcccccCCCCc-ceeeeEEEEECCEEEEEcCcCCCccccc
Confidence 655555566677777776654443 47788888888888876653 4566677788887763 32 221 1 2
Q ss_pred EEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCc--E-EEEecCCC
Q 000944 1007 HFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGN--I-YFVRLPQD 1058 (1213)
Q Consensus 1007 ~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gn--l-~il~~~~~ 1058 (1213)
++..|++..++=..++.=..+|....+..+++.-++++..+|. + .+-.|+|.
T Consensus 445 sve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~ 499 (571)
T KOG4441|consen 445 SVECYDPETNTWTLIAPMNTRRSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPE 499 (571)
T ss_pred eEEEEcCCCCceeecCCcccccccceEEEECCEEEEECCccCCCccceEEEEcCC
Confidence 3556898889989888888888888888888776666665552 1 13446663
No 154
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=61.82 E-value=3.1e+02 Score=31.83 Aligned_cols=29 Identities=10% Similarity=0.047 Sum_probs=23.9
Q ss_pred CeeEEEEEeCCCccceeeecCCCc-eEEEE
Q 000944 767 ETFNETALPLRYTPRRFVLQPKKK-LMVII 795 (1213)
Q Consensus 767 ~~~~~r~i~l~~tp~~i~y~~~~~-~~~v~ 795 (1213)
..-.+++|+++.-|..|+++|+.+ .+++.
T Consensus 288 t~kvi~~i~vG~~~~~iavS~Dgkp~lyvt 317 (352)
T TIGR02658 288 TGKRLRKIELGHEIDSINVSQDAKPLLYAL 317 (352)
T ss_pred CCeEEEEEeCCCceeeEEECCCCCeEEEEe
Confidence 456789999999999999999988 55544
No 155
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=61.54 E-value=66 Score=34.39 Aligned_cols=96 Identities=16% Similarity=0.214 Sum_probs=65.1
Q ss_pred ccEEEEEEEEeCCceEEEEEE-EeecCcceEeccc---cCeEEEEe--CCeEEEEecCCceeeceeeecCccceEEEEEE
Q 000944 919 AGYIHIYRFVEEGKSLELLHK-TQVEGIPLALCQF---QGRLLAGI--GPVLRLYDLGKKRLLRKCENKLFPNTIVSINT 992 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~-~~~~g~V~ai~~~---~g~ll~~~--g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~ 992 (1213)
-|.|.+|++..++. .+++.+ .-..|||..+.-- -|.|++++ ..||.||+-++.++.+...+..+..-+.++.-
T Consensus 32 D~tVkIf~v~~n~~-s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgkVIiWke~~g~w~k~~e~~~h~~SVNsV~w 110 (299)
T KOG1332|consen 32 DGTVKIFEVRNNGQ-SKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGKVIIWKEENGRWTKAYEHAAHSASVNSVAW 110 (299)
T ss_pred CccEEEEEEcCCCC-ceeeeEecCCCCCeeEEeecccccCcEeeEeecCceEEEEecCCCchhhhhhhhhhcccceeecc
Confidence 46788999988754 333332 4568999887654 35666644 46999999998888776555544445566653
Q ss_pred ----eCCEEEEeecCCcEEEEEEeccC
Q 000944 993 ----YRDRIYVGDIQESFHFCKYRRDE 1015 (1213)
Q Consensus 993 ----~~~~I~vgD~~~Sv~~l~~~~~~ 1015 (1213)
.+=.+++|-.--.|.+|.|+.++
T Consensus 111 apheygl~LacasSDG~vsvl~~~~~g 137 (299)
T KOG1332|consen 111 APHEYGLLLACASSDGKVSVLTYDSSG 137 (299)
T ss_pred cccccceEEEEeeCCCcEEEEEEcCCC
Confidence 34456677777789999998773
No 156
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=58.87 E-value=17 Score=26.36 Aligned_cols=29 Identities=34% Similarity=0.542 Sum_probs=24.1
Q ss_pred CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 581 SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 581 ~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
...|.|+++.+ ...+++.|..|++|.+|+
T Consensus 11 ~~~i~~i~~~~------~~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 11 SSSINSIAWSP------DGNFLASGSSDGTIRVWD 39 (39)
T ss_dssp SSSEEEEEEET------TSSEEEEEETTSEEEEEE
T ss_pred CCcEEEEEEec------ccccceeeCCCCEEEEEC
Confidence 36789999976 367899999999999985
No 157
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=58.40 E-value=2.3e+02 Score=32.77 Aligned_cols=246 Identities=15% Similarity=0.180 Sum_probs=132.0
Q ss_pred eEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEEEEECCeeEEEEecCccEEEEEeC-CeEEEEecCccccce
Q 000944 658 LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHR-GRFLLTPLSYETLEY 736 (1213)
Q Consensus 658 ~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~~~~~~~~~v~~~g~~p~~i~~~~-~~~~~~~~~~~~v~~ 736 (1213)
..|+.|...|.+.-++....+.+ -..+.=..||+ ++-...+..|+|..++ |.++|...+.+.+..
T Consensus 109 RRLltgs~SGEFtLWNg~~fnFE----tilQaHDs~Vr----------~m~ws~~g~wmiSgD~gG~iKyWqpnmnnVk~ 174 (464)
T KOG0284|consen 109 RRLLTGSQSGEFTLWNGTSFNFE----TILQAHDSPVR----------TMKWSHNGTWMISGDKGGMIKYWQPNMNNVKI 174 (464)
T ss_pred ceeEeecccccEEEecCceeeHH----HHhhhhcccce----------eEEEccCCCEEEEcCCCceEEecccchhhhHH
Confidence 45889999998876653211110 01111123333 3333445567777654 456676555443322
Q ss_pred ee----------ccccCCCCceEEEEe-CCeEEEEEEccCCCeeEEEEEe-CCCccceeeecCCCceEEEEEccCCCCCH
Q 000944 737 AA----------SFSSDQCVEGVVSVA-GNALRVFTIERLGETFNETALP-LRYTPRRFVLQPKKKLMVIIETDQGALTA 804 (1213)
Q Consensus 737 ~~----------~f~~~~~~~~~i~~~-~~~L~i~~l~~~~~~~~~r~i~-l~~tp~~i~y~~~~~~~~v~~~~~~~~~~ 804 (1213)
+. .|.. -..-|+.++ ++.++|=..-.. -.-|.+. =|.-|+.+.+||...+++...-
T Consensus 175 ~~ahh~eaIRdlafSp--nDskF~t~SdDg~ikiWdf~~~---kee~vL~GHgwdVksvdWHP~kgLiasgsk------- 242 (464)
T KOG0284|consen 175 IQAHHAEAIRDLAFSP--NDSKFLTCSDDGTIKIWDFRMP---KEERVLRGHGWDVKSVDWHPTKGLIASGSK------- 242 (464)
T ss_pred hhHhhhhhhheeccCC--CCceeEEecCCCeEEEEeccCC---chhheeccCCCCcceeccCCccceeEEccC-------
Confidence 21 1211 123477554 567766433321 1122221 1457889999998877665420
Q ss_pred HHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCceeeEEEEEeCCCCceEEEE-EcCCCce
Q 000944 805 EEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLL-ELQDNEA 883 (1213)
Q Consensus 805 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d~~~~~~~~~~-~~~~~E~ 883 (1213)
++ .++|+||++++++++. .++
T Consensus 243 ---------------------------------------------------Dn---lVKlWDprSg~cl~tlh~HK---- 264 (464)
T KOG0284|consen 243 ---------------------------------------------------DN---LVKLWDPRSGSCLATLHGHK---- 264 (464)
T ss_pred ---------------------------------------------------Cc---eeEeecCCCcchhhhhhhcc----
Confidence 01 6899999999887653 222
Q ss_pred EEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE-EeecCcceEe--ccccCeEEEEe
Q 000944 884 AFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK-TQVEGIPLAL--CQFQGRLLAGI 960 (1213)
Q Consensus 884 v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~-~~~~g~V~ai--~~~~g~ll~~~ 960 (1213)
.++..++|. .+.+|++-|. +|. -+.+|++. .++.+.. .-.+.-|+++ .+++..|++..
T Consensus 265 -ntVl~~~f~--~n~N~Llt~s--------kD~----~~kv~DiR----~mkEl~~~r~Hkkdv~~~~WhP~~~~lftsg 325 (464)
T KOG0284|consen 265 -NTVLAVKFN--PNGNWLLTGS--------KDQ----SCKVFDIR----TMKELFTYRGHKKDVTSLTWHPLNESLFTSG 325 (464)
T ss_pred -ceEEEEEEc--CCCCeeEEcc--------CCc----eEEEEehh----HhHHHHHhhcchhhheeeccccccccceeec
Confidence 244445564 3347877664 233 34577765 2332222 1244556777 67788887766
Q ss_pred CC--eEEEEecC-CceeeceeeecCccceEEEEEE--eCCEEEEeecCCcEEE
Q 000944 961 GP--VLRLYDLG-KKRLLRKCENKLFPNTIVSINT--YRDRIYVGDIQESFHF 1008 (1213)
Q Consensus 961 g~--~l~i~~~~-~~~l~~~~~~~~~~~~i~~l~~--~~~~I~vgD~~~Sv~~ 1008 (1213)
|. .|+.|.+. .+.+...- . +.-..|.+|.. .+-.+.-|+--+.+-|
T Consensus 326 g~Dgsvvh~~v~~~~p~~~i~-~-AHd~~iwsl~~hPlGhil~tgsnd~t~rf 376 (464)
T KOG0284|consen 326 GSDGSVVHWVVGLEEPLGEIP-P-AHDGEIWSLAYHPLGHILATGSNDRTVRF 376 (464)
T ss_pred cCCCceEEEeccccccccCCC-c-ccccceeeeeccccceeEeecCCCcceee
Confidence 64 78888887 33332221 1 12335677765 3555556666666655
No 158
>PF14761 HPS3_N: Hermansky-Pudlak syndrome 3
Probab=58.35 E-value=37 Score=35.78 Aligned_cols=64 Identities=22% Similarity=0.235 Sum_probs=41.6
Q ss_pred eecCcc--eEeccccCeEEEEeCCeEEEEecCCcee--eceeeec--------CccceEEEEEEeCCEEEEeecCC
Q 000944 941 QVEGIP--LALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENK--------LFPNTIVSINTYRDRIYVGDIQE 1004 (1213)
Q Consensus 941 ~~~g~V--~ai~~~~g~ll~~~g~~l~i~~~~~~~l--~~~~~~~--------~~~~~i~~l~~~~~~I~vgD~~~ 1004 (1213)
..+-++ .|.|++.|.|++|+++++.+|.+..... .+..+.| ......+.+...++||.+-+-.+
T Consensus 132 Pl~~~p~ciaCC~~tG~LlVg~~~~l~lf~l~~~~~~~~~~~~lDFe~~l~~~~~~~~p~~v~ic~~yiA~~s~~e 207 (215)
T PF14761_consen 132 PLSEPPLCIACCPVTGNLLVGCGNKLVLFTLKYQTIQSEKFSFLDFERSLIDHIDNFKPTQVAICEGYIAVMSDLE 207 (215)
T ss_pred cCCCCCCEEEecCCCCCEEEEcCCEEEEEEEEEEEEecccccEEechhhhhheecCceEEEEEEEeeEEEEecCCE
Confidence 345555 4568899999999999999999863322 1111111 12334677888899988765444
No 159
>PF14781 BBS2_N: Ciliary BBSome complex subunit 2, N-terminal
Probab=57.28 E-value=36 Score=33.01 Aligned_cols=76 Identities=14% Similarity=0.153 Sum_probs=49.9
Q ss_pred CceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecc--cc---Ce
Q 000944 881 NEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQ--FQ---GR 955 (1213)
Q Consensus 881 ~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~--~~---g~ 955 (1213)
|+.++|++..+|.....+..++|||.. .|+.|++.++ .-++.++++..|.+|.- ++ +-
T Consensus 47 n~~italaaG~l~~~~~~D~LliGt~t-------------~llaYDV~~N----~d~Fyke~~DGvn~i~~g~~~~~~~~ 109 (136)
T PF14781_consen 47 NQEITALAAGRLKPDDGRDCLLIGTQT-------------SLLAYDVENN----SDLFYKEVPDGVNAIVIGKLGDIPSP 109 (136)
T ss_pred CCceEEEEEEecCCCCCcCEEEEeccc-------------eEEEEEcccC----chhhhhhCccceeEEEEEecCCCCCc
Confidence 478899999999755667999999974 4789999875 23455667777777632 32 33
Q ss_pred EEEEeCC-eEEEEecCCce
Q 000944 956 LLAGIGP-VLRLYDLGKKR 973 (1213)
Q Consensus 956 ll~~~g~-~l~i~~~~~~~ 973 (1213)
|++.=|+ .|.=|+.+.++
T Consensus 110 l~ivGGncsi~Gfd~~G~e 128 (136)
T PF14781_consen 110 LVIVGGNCSIQGFDYEGNE 128 (136)
T ss_pred EEEECceEEEEEeCCCCcE
Confidence 4443333 45555555443
No 160
>PF12341 DUF3639: Protein of unknown function (DUF3639) ; InterPro: IPR022100 This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important.
Probab=57.00 E-value=17 Score=24.66 Aligned_cols=26 Identities=19% Similarity=0.427 Sum_probs=21.0
Q ss_pred cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 582 GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 582 ~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
.+|.|+++. ..|+++++..+-++||+
T Consensus 2 E~i~aia~g--------~~~vavaTS~~~lRifs 27 (27)
T PF12341_consen 2 EEIEAIAAG--------DSWVAVATSAGYLRIFS 27 (27)
T ss_pred ceEEEEEcc--------CCEEEEEeCCCeEEecC
Confidence 467888883 57999999988888874
No 161
>PHA03098 kelch-like protein; Provisional
Probab=56.74 E-value=1.2e+02 Score=37.54 Aligned_cols=136 Identities=14% Similarity=0.117 Sum_probs=74.4
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeCC-------eEEEEecC
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGP-------VLRLYDLG 970 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~-------~l~i~~~~ 970 (1213)
..++++|-.. ........++.|+.... +++.+.....+-.-.+++.++|+|++.-|. .+..|+..
T Consensus 295 ~~lyv~GG~~------~~~~~~~~v~~yd~~~~--~W~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~~~~~~~v~~yd~~ 366 (534)
T PHA03098 295 NVIYFIGGMN------KNNLSVNSVVSYDTKTK--SWNKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPG 366 (534)
T ss_pred CEEEEECCCc------CCCCeeccEEEEeCCCC--eeeECCCCCcccccceEEEECCEEEEEeCCCCCEecceEEEEcCC
Confidence 4677777432 12223345777777655 444443322222234566678888765552 46678887
Q ss_pred CceeeceeeecCccceEEEEEEeCCEEEE-eecCC----cEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeee
Q 000944 971 KKRLLRKCENKLFPNTIVSINTYRDRIYV-GDIQE----SFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMA 1042 (1213)
Q Consensus 971 ~~~l~~~~~~~~~~~~i~~l~~~~~~I~v-gD~~~----Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l 1042 (1213)
.+++...+... .+-.-.+..+.++.|++ |-..+ .=.+.+|+...++-..++.-+.++...++...++.-++
T Consensus 367 ~~~W~~~~~lp-~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~~iyv 442 (534)
T PHA03098 367 ESKWREEPPLI-FPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYV 442 (534)
T ss_pred CCceeeCCCcC-cCCccceEEEECCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCCccccCceEEEECCEEEE
Confidence 77776655443 23333344556777664 43211 12456788777777777665566655554455544333
No 162
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=56.60 E-value=3.1e+02 Score=35.82 Aligned_cols=107 Identities=15% Similarity=0.322 Sum_probs=62.6
Q ss_pred ecCCEEEEEEeCCEEEEEEEccCCCeEEe-eeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEe
Q 000944 547 SNRLQVVIALSGGELIYFEVDMTGQLLEV-EKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQIL 625 (1213)
Q Consensus 547 ~~~~~v~v~~s~~~l~~l~~~~~~~l~~~-~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~ 625 (1213)
+....+++.++.|.+..+.+. |.|... ........|+|+++.. .+.+++.|..+|-|.+|+.+-...+..+
T Consensus 97 ~~~~~ivi~Ts~ghvl~~d~~--~nL~~~~~ne~v~~~Vtsvafn~------dg~~l~~G~~~G~V~v~D~~~~k~l~~i 168 (1206)
T KOG2079|consen 97 IVVVPIVIGTSHGHVLLSDMT--GNLGPLHQNERVQGPVTSVAFNQ------DGSLLLAGLGDGHVTVWDMHRAKILKVI 168 (1206)
T ss_pred eeeeeEEEEcCchhhhhhhhh--cccchhhcCCccCCcceeeEecC------CCceeccccCCCcEEEEEccCCcceeee
Confidence 334456667677777665443 444432 2234457799999954 5778999999999999998543334433
Q ss_pred EEeecCCCCc-eeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 626 SVQSVSSPPE-SLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 626 ~~~~l~~~p~-Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
... ..|. ++..+... ++...++.+-+-|.++...++
T Consensus 169 ~e~---~ap~t~vi~v~~t-----------~~nS~llt~D~~Gsf~~lv~n 205 (1206)
T KOG2079|consen 169 TEH---GAPVTGVIFVGRT-----------SQNSKLLTSDTGGSFWKLVFN 205 (1206)
T ss_pred eec---CCccceEEEEEEe-----------CCCcEEEEccCCCceEEEEec
Confidence 211 2333 33322211 123357777777877555544
No 163
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=56.46 E-value=3.3e+02 Score=30.49 Aligned_cols=86 Identities=16% Similarity=0.091 Sum_probs=45.7
Q ss_pred EEEEEEEEeCCceEEEEEEEeecCcceEe-ccccCeEEEEeCC-eEEEEecCCceeeceeeecCccceEEEEEEeCC---
Q 000944 921 YIHIYRFVEEGKSLELLHKTQVEGIPLAL-CQFQGRLLAGIGP-VLRLYDLGKKRLLRKCENKLFPNTIVSINTYRD--- 995 (1213)
Q Consensus 921 ri~v~~i~~~~~kl~~~~~~~~~g~V~ai-~~~~g~ll~~~g~-~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~--- 995 (1213)
+|-+|+++.. ++-. ..+.+--+.++ +..+++|++|..+ .+.+++-++ ....+++.+++..|-++.+..|
T Consensus 190 ~i~i~q~d~A--~v~~--~i~~~~r~l~~~~l~~~~L~vG~d~~~i~~~D~ds--~~~~~~~~AH~~RVK~i~~~~~~~~ 263 (362)
T KOG0294|consen 190 KIDIYQLDNA--SVFR--EIENPKRILCATFLDGSELLVGGDNEWISLKDTDS--DTPLTEFLAHENRVKDIASYTNPEH 263 (362)
T ss_pred EEEEEecccH--hHhh--hhhccccceeeeecCCceEEEecCCceEEEeccCC--CccceeeecchhheeeeEEEecCCc
Confidence 5668887753 1100 00111223333 3346778887765 455555544 4455666666777888887766
Q ss_pred -EEEEeecCCcEEEEEEe
Q 000944 996 -RIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 996 -~I~vgD~~~Sv~~l~~~ 1012 (1213)
+|+-.-.--.|.++..+
T Consensus 264 ~~lvTaSSDG~I~vWd~~ 281 (362)
T KOG0294|consen 264 EYLVTASSDGFIKVWDID 281 (362)
T ss_pred eEEEEeccCceEEEEEcc
Confidence 44444444445554443
No 164
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=55.07 E-value=3.2e+02 Score=29.94 Aligned_cols=106 Identities=15% Similarity=0.206 Sum_probs=61.8
Q ss_pred CEEEEEEeCCEEEEEEEccCCCeEEeeeecc-CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEe
Q 000944 550 LQVVIALSGGELIYFEVDMTGQLLEVEKHEM-SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQ 628 (1213)
Q Consensus 550 ~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l-~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~ 628 (1213)
.+++.+.-+.+.++++ +.++.+..+..++= +.||-|++... ...+||.++.|.++=|+.++.++.++.++.
T Consensus 74 ~~La~aSFD~t~~Iw~-k~~~efecv~~lEGHEnEVK~Vaws~------sG~~LATCSRDKSVWiWe~deddEfec~aV- 145 (312)
T KOG0645|consen 74 RYLASASFDATVVIWK-KEDGEFECVATLEGHENEVKCVAWSA------SGNYLATCSRDKSVWIWEIDEDDEFECIAV- 145 (312)
T ss_pred cEEEEeeccceEEEee-cCCCceeEEeeeeccccceeEEEEcC------CCCEEEEeeCCCeEEEEEecCCCcEEEEee-
Confidence 3444333344555443 22455555544442 47899999964 578999999999999999976666765542
Q ss_pred ecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 629 SVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 629 ~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
|...-..++.+.... ....|+-+.-|..+-.|..+
T Consensus 146 -L~~HtqDVK~V~WHP-----------t~dlL~S~SYDnTIk~~~~~ 180 (312)
T KOG0645|consen 146 -LQEHTQDVKHVIWHP-----------TEDLLFSCSYDNTIKVYRDE 180 (312)
T ss_pred -eccccccccEEEEcC-----------CcceeEEeccCCeEEEEeec
Confidence 111123344444321 23445555556666666654
No 165
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=54.68 E-value=6.7e+02 Score=33.54 Aligned_cols=68 Identities=13% Similarity=0.096 Sum_probs=38.4
Q ss_pred eeEEEEEEeCCcEEEEEeCCCCce--eEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 599 SRFLAVGSYDNTIRILSLDPDDCM--QILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 599 ~~~l~v~~~~~~i~i~sl~p~~~l--~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
+.+++.|..||+|++|.+..-..- ..-+..-.+-.......+.+- .....+.||..||.+....++-
T Consensus 1061 ~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~~vt~~-----------~~~~~~Av~t~DG~v~~~~id~ 1129 (1431)
T KOG1240|consen 1061 TSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVEKVTMC-----------GNGDQFAVSTKDGSVRVLRIDH 1129 (1431)
T ss_pred CceEEEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceEEEEec-----------cCCCeEEEEcCCCeEEEEEccc
Confidence 468888888999999998532100 001111111011122222221 2445677899999999998875
Q ss_pred C
Q 000944 677 V 677 (1213)
Q Consensus 677 ~ 677 (1213)
.
T Consensus 1130 ~ 1130 (1431)
T KOG1240|consen 1130 Y 1130 (1431)
T ss_pred c
Confidence 3
No 166
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=54.18 E-value=3.4e+02 Score=30.03 Aligned_cols=110 Identities=6% Similarity=0.010 Sum_probs=66.8
Q ss_pred cCcceEecccc--CeEEE-EeCCeEEEEecC-Ccee--eceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEecc-C
Q 000944 943 EGIPLALCQFQ--GRLLA-GIGPVLRLYDLG-KKRL--LRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRD-E 1015 (1213)
Q Consensus 943 ~g~V~ai~~~~--g~ll~-~~g~~l~i~~~~-~~~l--~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~-~ 1015 (1213)
.+.|+++.--. .+|+. ++..++.||+-- .+|. ++....+ .+.......+++|.+|-+-.=.++|..... .
T Consensus 55 ~~Ki~~~~ws~Dsr~ivSaSqDGklIvWDs~TtnK~haipl~s~W---VMtCA~sPSg~~VAcGGLdN~Csiy~ls~~d~ 131 (343)
T KOG0286|consen 55 LNKIYAMDWSTDSRRIVSASQDGKLIVWDSFTTNKVHAIPLPSSW---VMTCAYSPSGNFVACGGLDNKCSIYPLSTRDA 131 (343)
T ss_pred ccceeeeEecCCcCeEEeeccCCeEEEEEcccccceeEEecCcee---EEEEEECCCCCeEEecCcCceeEEEecccccc
Confidence 34455554432 24444 344688898753 2322 1211111 134456677999999999998888877632 3
Q ss_pred CeEEEeeccC--CCcceEEEEeecCCeeeeecCCCcEEEEec
Q 000944 1016 NQLYIFADDS--VPRWLTAAHHIDFDTMAGADKFGNIYFVRL 1055 (1213)
Q Consensus 1016 ~~l~~~a~D~--~~~~~~~~~~ld~~~~l~~D~~gnl~il~~ 1055 (1213)
.-...++|.. +.-++++|.|+|+..++.+--+....+.+.
T Consensus 132 ~g~~~v~r~l~gHtgylScC~f~dD~~ilT~SGD~TCalWDi 173 (343)
T KOG0286|consen 132 EGNVRVSRELAGHTGYLSCCRFLDDNHILTGSGDMTCALWDI 173 (343)
T ss_pred cccceeeeeecCccceeEEEEEcCCCceEecCCCceEEEEEc
Confidence 3344444443 456778999999888988777766666554
No 167
>PF00325 Crp: Bacterial regulatory proteins, crp family; InterPro: IPR001808 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. These proteins are very diverse, but for convenience may be grouped into subfamilies on the basis of sequence similarity. This family groups together a range of proteins, including anr, crp, clp, cysR, fixK, flp, fnr, fnrN, hlyX and ntcA [, ]. Within this family, the HTH motif is situated towards the C terminus.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 2OZ6_A 1CGP_B 2GZW_C 1O3T_B 3ROU_A 2CGP_A 3RDI_A 1I5Z_A 3IYD_H 3FWE_B ....
Probab=53.92 E-value=21 Score=25.26 Aligned_cols=26 Identities=23% Similarity=0.337 Sum_probs=21.9
Q ss_pred HHHHHHHHcCCCHHHHHHHHHHHHhc
Q 000944 1186 LQRKIADELDRTPGEILKKLEEIRNK 1211 (1213)
Q Consensus 1186 ~q~~i~~~~~~~~~~i~~~l~~l~~~ 1211 (1213)
.+++||+.+|.+++.+.+.|.+|+.+
T Consensus 4 tr~diA~~lG~t~ETVSR~l~~l~~~ 29 (32)
T PF00325_consen 4 TRQDIADYLGLTRETVSRILKKLERQ 29 (32)
T ss_dssp -HHHHHHHHTS-HHHHHHHHHHHHHT
T ss_pred CHHHHHHHhCCcHHHHHHHHHHHHHc
Confidence 46899999999999999999998875
No 168
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=52.12 E-value=3.2e+02 Score=30.89 Aligned_cols=185 Identities=17% Similarity=0.180 Sum_probs=100.8
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc------cCeEE--EEeCCeEEEEec
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF------QGRLL--AGIGPVLRLYDL 969 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~------~g~ll--~~~g~~l~i~~~ 969 (1213)
+..++||.+ .|.+.+|+.... .+|+. ++|+...+..+ ..+.+ ++.-..|++|+.
T Consensus 40 e~~vav~lS------------ngsv~lyd~~tg-~~l~~-----fk~~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~wD~ 101 (376)
T KOG1188|consen 40 ETAVAVSLS------------NGSVRLYDKGTG-QLLEE-----FKGPPATTNGVRFISCDSPHGVISCSSDGTVRLWDI 101 (376)
T ss_pred ceeEEEEec------------CCeEEEEeccch-hhhhe-----ecCCCCcccceEEecCCCCCeeEEeccCCeEEEEEe
Confidence 466777765 567788886652 23332 22332222111 23333 344458999988
Q ss_pred CCce-eeceeeecCc--cceEEEEEEeCCEEEEeecC----CcEEEEEEeccCCeEEEeeccCCCcceEEEEee--cCCe
Q 000944 970 GKKR-LLRKCENKLF--PNTIVSINTYRDRIYVGDIQ----ESFHFCKYRRDENQLYIFADDSVPRWLTAAHHI--DFDT 1040 (1213)
Q Consensus 970 ~~~~-l~~~~~~~~~--~~~i~~l~~~~~~I~vgD~~----~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~l--d~~~ 1040 (1213)
.... --+..+-..- +.....+.+.++.|.+|+-. -++.|+.++.. .++.-.--|.+.-.+|++.|- +.+.
T Consensus 102 Rs~~e~a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~-qq~l~~~~eSH~DDVT~lrFHP~~pnl 180 (376)
T KOG1188|consen 102 RSQAESARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSE-QQLLRQLNESHNDDVTQLRFHPSDPNL 180 (376)
T ss_pred ecchhhhheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEeccc-cchhhhhhhhccCcceeEEecCCCCCe
Confidence 6321 1111111111 33334455688999998644 35666555543 333222334455568888764 4456
Q ss_pred eeeecCCCcEEEEecCCCCCcccccCCCCCccccccCccCCcccceeeeeeeecCceeceEEEeeecCCCccEEEEEecc
Q 000944 1041 MAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVM 1120 (1213)
Q Consensus 1041 ~l~~D~~gnl~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~~~~lg~~v~~~~~~~~~~~~~~~i~~~t~~ 1120 (1213)
++.+-.+|-+-+|....+.. || -| ...++.|..|-++.- ...+...|++-|-.
T Consensus 181 LlSGSvDGLvnlfD~~~d~E----eD------------------aL--~~viN~~sSI~~igw---~~~~ykrI~clTH~ 233 (376)
T KOG1188|consen 181 LLSGSVDGLVNLFDTKKDNE----ED------------------AL--LHVINHGSSIHLIGW---LSKKYKRIMCLTHM 233 (376)
T ss_pred EEeecccceEEeeecCCCcc----hh------------------hH--HHhhcccceeeeeee---ecCCcceEEEEEcc
Confidence 77888899999998876533 22 12 223666666655432 12334568888888
Q ss_pred cceEEEEec
Q 000944 1121 GSLGAMLAF 1129 (1213)
Q Consensus 1121 Gsig~l~~l 1129 (1213)
+++. +-.+
T Consensus 234 Etf~-~~el 241 (376)
T KOG1188|consen 234 ETFA-IYEL 241 (376)
T ss_pred Ccee-EEEc
Confidence 8877 3345
No 169
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=49.57 E-value=3.9e+02 Score=29.39 Aligned_cols=89 Identities=16% Similarity=0.210 Sum_probs=61.0
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCe--EEEEeCCeEEEEecCCceeeceeeec-------CccceEEE
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGR--LLAGIGPVLRLYDLGKKRLLRKCENK-------LFPNTIVS 989 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~--ll~~~g~~l~i~~~~~~~l~~~~~~~-------~~~~~i~~ 989 (1213)
.|.+++|++.+. | -++..+...+|.|+|--..+ |.+|++..|+||+++.+..+..-..+ .-.-+.++
T Consensus 213 dg~~~LwdL~~~--k--~lysl~a~~~v~sl~fspnrywL~~at~~sIkIwdl~~~~~v~~l~~d~~g~s~~~~~~~cls 288 (315)
T KOG0279|consen 213 DGEAMLWDLNEG--K--NLYSLEAFDIVNSLCFSPNRYWLCAATATSIKIWDLESKAVVEELKLDGIGPSSKAGDPICLS 288 (315)
T ss_pred CceEEEEEccCC--c--eeEeccCCCeEeeEEecCCceeEeeccCCceEEEeccchhhhhhccccccccccccCCcEEEE
Confidence 588999999875 2 27777888999999877554 46688999999999866433211111 01224555
Q ss_pred EEEe--CCEEEEeecCCcEEEEEE
Q 000944 990 INTY--RDRIYVGDIQESFHFCKY 1011 (1213)
Q Consensus 990 l~~~--~~~I~vgD~~~Sv~~l~~ 1011 (1213)
+... +..++.|+--..+-+++.
T Consensus 289 laws~dG~tLf~g~td~~irv~qv 312 (315)
T KOG0279|consen 289 LAWSADGQTLFAGYTDNVIRVWQV 312 (315)
T ss_pred EEEcCCCcEEEeeecCCcEEEEEe
Confidence 5544 778888888887777654
No 170
>PF08553 VID27: VID27 cytoplasmic protein; InterPro: IPR013863 This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=49.03 E-value=49 Score=42.24 Aligned_cols=86 Identities=15% Similarity=0.266 Sum_probs=60.7
Q ss_pred ceEEEEEeCCCCEEEEEEEeCCceeeeeEEEEeCCCCcceeEEEEcCCeEEEEeeeCCeEEEEEeecCCCCCcccccCCc
Q 000944 294 LFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTL 373 (1213)
Q Consensus 294 ~~~~ll~~~~G~l~~l~l~~~~~~v~~l~i~~l~~~~~~s~l~~l~~~~lFvgS~~gds~l~~~~~~~~~~~~~~~~~~~ 373 (1213)
...-+++..+-.||++.-...+..+..-......+.+--+|++.-.+|++.|||.-||=.||- ..+
T Consensus 542 ~e~tflGls~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~aTt~~G~iavgs~~G~IRLyd--~~g------------ 607 (794)
T PF08553_consen 542 NEQTFLGLSDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFSCFATTEDGYIAVGSNKGDIRLYD--RLG------------ 607 (794)
T ss_pred CCceEEEECCCceEEeccCCCCCceeeccccccccCCCceEEEecCCceEEEEeCCCcEEeec--ccc------------
Confidence 346788888999999988777643322122233444567899999999999999999988873 111
Q ss_pred cccccCCCceeeccCCcccEEEEEEeccCC-cccceEEec
Q 000944 374 METEEGFQPVFFQPRGLKNLVRIEQVESLM-PIMDMRIAN 412 (1213)
Q Consensus 374 ~~~~~~~~~~~~~~~~~~~l~~~d~~~n~g-PI~D~~~~~ 412 (1213)
.+--..||++| ||+.+++..
T Consensus 608 -------------------~~AKT~lp~lG~pI~~iDvt~ 628 (794)
T PF08553_consen 608 -------------------KRAKTALPGLGDPIIGIDVTA 628 (794)
T ss_pred -------------------hhhhhcCCCCCCCeeEEEecC
Confidence 12345678887 999998864
No 171
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=48.64 E-value=1.9e+02 Score=35.55 Aligned_cols=108 Identities=17% Similarity=0.218 Sum_probs=73.2
Q ss_pred ecCcceEeccccC-eEEEEeC-CeEEEEecCCceeeceeeecCccceEEEEE-EeCCEEEEeecCCc-EEEEEEeccCCe
Q 000944 942 VEGIPLALCQFQG-RLLAGIG-PVLRLYDLGKKRLLRKCENKLFPNTIVSIN-TYRDRIYVGDIQES-FHFCKYRRDENQ 1017 (1213)
Q Consensus 942 ~~g~V~ai~~~~g-~ll~~~g-~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~-~~~~~I~vgD~~~S-v~~l~~~~~~~~ 1017 (1213)
....|.+++.+.+ .++.|.+ ..|..|+++..-|.+. .++.+++-+|+ ..++-++|..--++ +-+ |+.+
T Consensus 178 HtD~VRgL~vl~~~~flScsNDg~Ir~w~~~ge~l~~~---~ghtn~vYsis~~~~~~~Ivs~gEDrtlri--W~~~--- 249 (745)
T KOG0301|consen 178 HTDCVRGLAVLDDSHFLSCSNDGSIRLWDLDGEVLLEM---HGHTNFVYSISMALSDGLIVSTGEDRTLRI--WKKD--- 249 (745)
T ss_pred chhheeeeEEecCCCeEeecCCceEEEEeccCceeeee---eccceEEEEEEecCCCCeEEEecCCceEEE--eecC---
Confidence 6789999999977 6666655 4788999977766443 22456788888 55666666666554 444 3322
Q ss_pred EEEeeccCCCc-ceEEEEeecCCeeeeecCCCcEEEEecCCC
Q 000944 1018 LYIFADDSVPR-WLTAAHHIDFDTMAGADKFGNIYFVRLPQD 1058 (1213)
Q Consensus 1018 l~~~a~D~~~~-~~~~~~~ld~~~~l~~D~~gnl~il~~~~~ 1058 (1213)
+.++-=..|. .+.++.++.++.|+++-.+|-+++|..++.
T Consensus 250 -e~~q~I~lPttsiWsa~~L~NgDIvvg~SDG~VrVfT~~k~ 290 (745)
T KOG0301|consen 250 -ECVQVITLPTTSIWSAKVLLNGDIVVGGSDGRVRVFTVDKD 290 (745)
T ss_pred -ceEEEEecCccceEEEEEeeCCCEEEeccCceEEEEEeccc
Confidence 1111112233 566888899999999999999999998754
No 172
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=47.27 E-value=92 Score=38.97 Aligned_cols=118 Identities=16% Similarity=0.149 Sum_probs=74.9
Q ss_pred EEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcc--eEeccc
Q 000944 875 LLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIP--LALCQF 952 (1213)
Q Consensus 875 ~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V--~ai~~~ 952 (1213)
.++|.+...+.++.+|- ...++++|..-- ....|.+-+|- +.+ . -..++.-|| +++|.-
T Consensus 8 kIE~~Dsdavsti~SWH----PsePlfAVA~fS--------~er~GSVtIfa-dtG--E----Pqr~Vt~P~hatSLCWH 68 (1416)
T KOG3617|consen 8 KIEFLDSDAVSTISSWH----PSEPLFAVASFS--------PERGGSVTIFA-DTG--E----PQRDVTYPVHATSLCWH 68 (1416)
T ss_pred eeecccccccccccccC----CCCceeEEEEec--------CCCCceEEEEe-cCC--C----CCcccccceehhhhccC
Confidence 35566656666677775 346899887642 22356665553 211 1 223444555 567877
Q ss_pred cCeEEEEeCC---eEEEEecCCceeece-eeecCccceEEEEEEeCCEEEEeecCCcEEEEEEe
Q 000944 953 QGRLLAGIGP---VLRLYDLGKKRLLRK-CENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 953 ~g~ll~~~g~---~l~i~~~~~~~l~~~-~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
-.+++.++|- .+.+|.-..++...+ ..+. .+.....-+-.++.++-+|.+-++.+++|+
T Consensus 69 pe~~vLa~gwe~g~~~v~~~~~~e~htv~~th~-a~i~~l~wS~~G~~l~t~d~~g~v~lwr~d 131 (1416)
T KOG3617|consen 69 PEEFVLAQGWEMGVSDVQKTNTTETHTVVETHP-APIQGLDWSHDGTVLMTLDNPGSVHLWRYD 131 (1416)
T ss_pred hHHHHHhhccccceeEEEecCCceeeeeccCCC-CCceeEEecCCCCeEEEcCCCceeEEEEee
Confidence 7788888874 456776666654443 3332 354555556689999999999999999997
No 173
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=46.97 E-value=1.3e+02 Score=37.00 Aligned_cols=114 Identities=11% Similarity=0.053 Sum_probs=66.6
Q ss_pred eecCcceEec--cccCeEEEEeCC-eEEEEecC-CceeeceeeecCccceEEEEEE--e-CCEEEEeecCCcEEEEEEec
Q 000944 941 QVEGIPLALC--QFQGRLLAGIGP-VLRLYDLG-KKRLLRKCENKLFPNTIVSINT--Y-RDRIYVGDIQESFHFCKYRR 1013 (1213)
Q Consensus 941 ~~~g~V~ai~--~~~g~ll~~~g~-~l~i~~~~-~~~l~~~~~~~~~~~~i~~l~~--~-~~~I~vgD~~~Sv~~l~~~~ 1013 (1213)
-..|+|+++. +|..+++.++|. .+++|.-+ ... +.-+++.-+.+++++.- . --..+++|.+-=+.++.+..
T Consensus 396 ~h~g~v~~v~~nPF~~k~fls~gDW~vriWs~~~~~~--Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~ 473 (555)
T KOG1587|consen 396 THIGPVYAVSRNPFYPKNFLSVGDWTVRIWSEDVIAS--PLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQ 473 (555)
T ss_pred ccCcceEeeecCCCccceeeeeccceeEeccccCCCC--cchhhhhccceeeeeEEcCcCceEEEEEcCCCceehhhhhc
Confidence 4579999984 466677666664 78888765 322 11122211334555553 2 34566788888777765542
Q ss_pred cCCeEEEeeccCCCcceEEEEeecC--CeeeeecCCCcEEEEecCCC
Q 000944 1014 DENQLYIFADDSVPRWLTAAHHIDF--DTMAGADKFGNIYFVRLPQD 1058 (1213)
Q Consensus 1014 ~~~~l~~~a~D~~~~~~~~~~~ld~--~~~l~~D~~gnl~il~~~~~ 1058 (1213)
+....+++......+....+... ..+.++|..|+++++++++.
T Consensus 474 --~~~~Pv~s~~~~~~~l~~~~~s~~g~~lavGd~~G~~~~~~l~~~ 518 (555)
T KOG1587|consen 474 --DDEEPVLSQKVCSPALTRVRWSPNGKLLAVGDANGTTHILKLSES 518 (555)
T ss_pred --cccCCcccccccccccceeecCCCCcEEEEecCCCcEEEEEcCch
Confidence 33334444333323333344333 24788999999999999763
No 174
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=46.97 E-value=4.2e+02 Score=30.12 Aligned_cols=119 Identities=17% Similarity=0.116 Sum_probs=62.6
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcc--eEeccccCeEEEEeC-------CeEEEEe
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIP--LALCQFQGRLLAGIG-------PVLRLYD 968 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V--~ai~~~~g~ll~~~g-------~~l~i~~ 968 (1213)
..++++|-.. + ......++.|++....+..+.....+++.+. .+.+.++++|.+.-| +.++.|+
T Consensus 73 ~~lyviGG~~------~-~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd 145 (323)
T TIGR03548 73 NGIYYIGGSN------S-SERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFN 145 (323)
T ss_pred CEEEEEcCCC------C-CCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEc
Confidence 4678888531 1 1123456777776653322111112333333 456667888877555 2677888
Q ss_pred cCCceeeceeeecCccceEEEEEEeCCEEEE-eecC--CcEEEEEEeccCCeEEEeec
Q 000944 969 LGKKRLLRKCENKLFPNTIVSINTYRDRIYV-GDIQ--ESFHFCKYRRDENQLYIFAD 1023 (1213)
Q Consensus 969 ~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~v-gD~~--~Sv~~l~~~~~~~~l~~~a~ 1023 (1213)
+...++...+.....+-.-..+.+.++.|+| |=.. ....++.|+.+.++-..++.
T Consensus 146 ~~~~~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~~yd~~~~~W~~~~~ 203 (323)
T TIGR03548 146 LETQEWFELPDFPGEPRVQPVCVKLQNELYVFGGGSNIAYTDGYKYSPKKNQWQKVAD 203 (323)
T ss_pred CCCCCeeECCCCCCCCCCcceEEEECCEEEEEcCCCCccccceEEEecCCCeeEECCC
Confidence 8777776554322101112233456666654 4221 12335678887777777764
No 175
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=46.69 E-value=4.3e+02 Score=28.96 Aligned_cols=116 Identities=16% Similarity=0.280 Sum_probs=73.4
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCC-----ce--
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEG-----KS-- 933 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~-----~k-- 933 (1213)
+++|+|-.+++++...+++. .++.+.|.- ..+++++.|. +..-..|.+.+|++.... ..
T Consensus 75 t~kLWDv~tGk~la~~k~~~-----~Vk~~~F~~--~gn~~l~~tD-------~~mg~~~~v~~fdi~~~~~~~~s~ep~ 140 (327)
T KOG0643|consen 75 TAKLWDVETGKQLATWKTNS-----PVKRVDFSF--GGNLILASTD-------KQMGYTCFVSVFDIRDDSSDIDSEEPY 140 (327)
T ss_pred eeEEEEcCCCcEEEEeecCC-----eeEEEeecc--CCcEEEEEeh-------hhcCcceEEEEEEccCChhhhcccCce
Confidence 78999999999999887764 456666653 3468888885 455578999999998641 12
Q ss_pred EEEEEEEeecCcceEecc-ccCeEEEEeC-CeEEEEecCCc-eeeceeeecCccceEEEEEEeC
Q 000944 934 LELLHKTQVEGIPLALCQ-FQGRLLAGIG-PVLRLYDLGKK-RLLRKCENKLFPNTIVSINTYR 994 (1213)
Q Consensus 934 l~~~~~~~~~g~V~ai~~-~~g~ll~~~g-~~l~i~~~~~~-~l~~~~~~~~~~~~i~~l~~~~ 994 (1213)
+++... + ..+..++-. .+.+|+++=- .+|..|+.... +++..... +...|.+|.-..
T Consensus 141 ~kI~t~-~-skit~a~Wg~l~~~ii~Ghe~G~is~~da~~g~~~v~s~~~--h~~~Ind~q~s~ 200 (327)
T KOG0643|consen 141 LKIPTP-D-SKITSALWGPLGETIIAGHEDGSISIYDARTGKELVDSDEE--HSSKINDLQFSR 200 (327)
T ss_pred EEecCC-c-cceeeeeecccCCEEEEecCCCcEEEEEcccCceeeechhh--hccccccccccC
Confidence 333222 2 445555543 4555655443 48899998864 45544443 244566665443
No 176
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=45.61 E-value=6.1e+02 Score=30.41 Aligned_cols=62 Identities=19% Similarity=0.464 Sum_probs=43.2
Q ss_pred CCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc---CeEEEEeCC-eEEEE
Q 000944 896 EHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ---GRLLAGIGP-VLRLY 967 (1213)
Q Consensus 896 ~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~---g~ll~~~g~-~l~i~ 967 (1213)
....+|+-||+.. .....|.|++|+-. ++..+++.++.++--.-|... +.|+++.|. .+++|
T Consensus 419 Pd~kli~TGtS~~------~~~~~g~L~f~d~~----t~d~v~ki~i~~aSvv~~~WhpkLNQi~~gsgdG~~~vy 484 (641)
T KOG0772|consen 419 PDDKLILTGTSAP------NGMTAGTLFFFDRM----TLDTVYKIDISTASVVRCLWHPKLNQIFAGSGDGTAHVY 484 (641)
T ss_pred CCceEEEeccccc------CCCCCceEEEEecc----ceeeEEEecCCCceEEEEeecchhhheeeecCCCceEEE
Confidence 3468999999864 45677888888854 788999888876544445554 456777775 55554
No 177
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=44.97 E-value=2.3e+02 Score=32.16 Aligned_cols=167 Identities=14% Similarity=0.193 Sum_probs=90.3
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEE-EE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELL-HK 939 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~-~~ 939 (1213)
.+.++|+.--.++.+++..-. . +.++++. ...+.|+.+++- .+-|++|+..... -|+.+ -.
T Consensus 168 ~i~IWD~~R~~Pv~smswG~D-t---i~svkfN--pvETsILas~~s-----------DrsIvLyD~R~~~-Pl~KVi~~ 229 (433)
T KOG0268|consen 168 QIDIWDEQRDNPVSSMSWGAD-S---ISSVKFN--PVETSILASCAS-----------DRSIVLYDLRQAS-PLKKVILT 229 (433)
T ss_pred eeeecccccCCccceeecCCC-c---eeEEecC--CCcchheeeecc-----------CCceEEEecccCC-ccceeeee
Confidence 356666654445555444322 2 2333342 334555555543 3458899987652 12222 22
Q ss_pred EeecCcceEeccccCeEEEEeCC--eEEEEecCCceeec-eeeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEecc
Q 000944 940 TQVEGIPLALCQFQGRLLAGIGP--VLRLYDLGKKRLLR-KCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYRRD 1014 (1213)
Q Consensus 940 ~~~~g~V~ai~~~~g~ll~~~g~--~l~i~~~~~~~l~~-~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~ 1014 (1213)
...++. |-.+ +++..++.+. .+|.|++. .|.. ...+..+.+.+.++. ..|.-++-|-.-+||-++..++.
T Consensus 230 mRTN~I--swnP-eafnF~~a~ED~nlY~~DmR--~l~~p~~v~~dhvsAV~dVdfsptG~EfvsgsyDksIRIf~~~~~ 304 (433)
T KOG0268|consen 230 MRTNTI--CWNP-EAFNFVAANEDHNLYTYDMR--NLSRPLNVHKDHVSAVMDVDFSPTGQEFVSGSYDKSIRIFPVNHG 304 (433)
T ss_pred ccccce--ecCc-cccceeeccccccceehhhh--hhcccchhhcccceeEEEeccCCCcchhccccccceEEEeecCCC
Confidence 222222 2233 4555555554 56655553 3321 122223456666665 45889999999999999888643
Q ss_pred CCeEEEeeccC----CCcceEEEEe-ecCCeeeeecCCCcEEEEecC
Q 000944 1015 ENQLYIFADDS----VPRWLTAAHH-IDFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1015 ~~~l~~~a~D~----~~~~~~~~~~-ld~~~~l~~D~~gnl~il~~~ 1056 (1213)
-+||. .-..|.++.. .|...++.+..+||+.+++-+
T Consensus 305 ------~SRdiYhtkRMq~V~~Vk~S~Dskyi~SGSdd~nvRlWka~ 345 (433)
T KOG0268|consen 305 ------HSRDIYHTKRMQHVFCVKYSMDSKYIISGSDDGNVRLWKAK 345 (433)
T ss_pred ------cchhhhhHhhhheeeEEEEeccccEEEecCCCcceeeeecc
Confidence 22332 1123556665 444456778889999997643
No 178
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=44.82 E-value=2.7e+02 Score=32.03 Aligned_cols=100 Identities=10% Similarity=0.091 Sum_probs=57.5
Q ss_pred ecCcceE--eccccCeEEEEeCC--eEEEEecCCceeec-----eeeecCccceEEEE------------EEeCCEEEEe
Q 000944 942 VEGIPLA--LCQFQGRLLAGIGP--VLRLYDLGKKRLLR-----KCENKLFPNTIVSI------------NTYRDRIYVG 1000 (1213)
Q Consensus 942 ~~g~V~a--i~~~~g~ll~~~g~--~l~i~~~~~~~l~~-----~~~~~~~~~~i~~l------------~~~~~~I~vg 1000 (1213)
..|||.- -|++|+..|++.+. +++||++.+.-|.+ +.++..+.-.+--+ ...+|.|.+.
T Consensus 80 Ht~~vLDi~w~PfnD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~v~iW 159 (472)
T KOG0303|consen 80 HTAPVLDIDWCPFNDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNTVSIW 159 (472)
T ss_pred ccccccccccCccCCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCceEEEE
Confidence 5678854 48899999887663 89999997654432 22222111001001 1145666665
Q ss_pred ecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEe-ecCCeeeeecCCCcEEEEe
Q 000944 1001 DIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHH-IDFDTMAGADKFGNIYFVR 1054 (1213)
Q Consensus 1001 D~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~-ld~~~~l~~D~~gnl~il~ 1054 (1213)
++-.+-.++.. . +|--++++.| -|.+.++.+-++.-+.+++
T Consensus 160 nv~tgeali~l----------~---hpd~i~S~sfn~dGs~l~TtckDKkvRv~d 201 (472)
T KOG0303|consen 160 NVGTGEALITL----------D---HPDMVYSMSFNRDGSLLCTTCKDKKVRVID 201 (472)
T ss_pred eccCCceeeec----------C---CCCeEEEEEeccCCceeeeecccceeEEEc
Confidence 55555444432 2 5555667666 3545677778888888864
No 179
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=44.73 E-value=83 Score=39.14 Aligned_cols=95 Identities=17% Similarity=0.213 Sum_probs=61.8
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEE-----EeCC--c
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRF-----VEEG--K 932 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i-----~~~~--~ 932 (1213)
+++.++|...........|+.++.|..+. |... .+++..++||-.. ++++|.- ...+ +
T Consensus 51 ~~LtIWD~~~~~lE~~~~f~~~~~I~dLD-Wtst-~d~qsiLaVGf~~-------------~v~l~~Q~R~dy~~~~p~w 115 (631)
T PF12234_consen 51 SELTIWDTRSGVLEYEESFSEDDPIRDLD-WTST-PDGQSILAVGFPH-------------HVLLYTQLRYDYTNKGPSW 115 (631)
T ss_pred CEEEEEEcCCcEEEEeeeecCCCceeece-eeec-CCCCEEEEEEcCc-------------EEEEEEccchhhhcCCccc
Confidence 47889998877766777787888887764 4432 2457899998642 4444432 1112 2
Q ss_pred -eEEEEEEEe-ecCcceEeccc-cCeEEEEeCCeEEEEec
Q 000944 933 -SLELLHKTQ-VEGIPLALCQF-QGRLLAGIGPVLRLYDL 969 (1213)
Q Consensus 933 -kl~~~~~~~-~~g~V~ai~~~-~g~ll~~~g~~l~i~~~ 969 (1213)
.++.+.-.+ ++.|+...+-. +|.|++|.|++++||+-
T Consensus 116 ~~i~~i~i~~~T~h~Igds~Wl~~G~LvV~sGNqlfv~dk 155 (631)
T PF12234_consen 116 APIRKIDISSHTPHPIGDSIWLKDGTLVVGSGNQLFVFDK 155 (631)
T ss_pred ceeEEEEeecCCCCCccceeEecCCeEEEEeCCEEEEECC
Confidence 344443233 35788777776 57899999999999853
No 180
>PHA02790 Kelch-like protein; Provisional
Probab=44.64 E-value=3.4e+02 Score=33.08 Aligned_cols=137 Identities=10% Similarity=0.048 Sum_probs=75.3
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCeEEEEeC----CeEEEEecCCce
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIG----PVLRLYDLGKKR 973 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g----~~l~i~~~~~~~ 973 (1213)
..++++|... .....+ .+..|+...+ +...+.....+-.-.+.+.++|+|.+.-| +.+..|+...++
T Consensus 272 ~~lyviGG~~------~~~~~~-~v~~Ydp~~~--~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~~~~sve~ydp~~n~ 342 (480)
T PHA02790 272 EVVYLIGGWM------NNEIHN-NAIAVNYISN--NWIPIPPMNSPRLYASGVPANNKLYVVGGLPNPTSVERWFHGDAA 342 (480)
T ss_pred CEEEEEcCCC------CCCcCC-eEEEEECCCC--EEEECCCCCchhhcceEEEECCEEEEECCcCCCCceEEEECCCCe
Confidence 4677888542 111111 2344554433 44444433322223455667888766544 246677777677
Q ss_pred eeceeeecCccceEEEEEEeCCEEEE-eecCCc-EEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeee
Q 000944 974 LLRKCENKLFPNTIVSINTYRDRIYV-GDIQES-FHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGA 1044 (1213)
Q Consensus 974 l~~~~~~~~~~~~i~~l~~~~~~I~v-gD~~~S-v~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~ 1044 (1213)
+...+... .+-.-.+..+.++.|+| |-...+ -.+..|++..++-..++.=..++...++..+++.-++++
T Consensus 343 W~~~~~l~-~~r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~~IYv~G 414 (480)
T PHA02790 343 WVNMPSLL-KPRCNPAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVG 414 (480)
T ss_pred EEECCCCC-CCCcccEEEEECCEEEEecCcCCCCccEEEEeCCCCEEEeCCCCCCccccceEEEECCEEEEEC
Confidence 76665543 12233355667888764 433221 235568888888888777667776665556665544444
No 181
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=43.97 E-value=3.3e+02 Score=31.43 Aligned_cols=145 Identities=14% Similarity=0.173 Sum_probs=88.2
Q ss_pred CCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccc--cCeEEEEe--CC--eEEEEe
Q 000944 895 KEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF--QGRLLAGI--GP--VLRLYD 968 (1213)
Q Consensus 895 ~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~--~g~ll~~~--g~--~l~i~~ 968 (1213)
.+...+++|+|.- -+.++|++..+....+++....++-..++|..+ .-+.+++- |. .+.+++
T Consensus 71 s~~~~llAv~~~~------------K~~~~f~~~~~~~~~kl~~~~~v~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s 138 (390)
T KOG3914|consen 71 SDSGRLVAVATSS------------KQRAVFDYRENPKGAKLLDVSCVPKRPTAISFIREDTSVLVADKAGDVYSFDILS 138 (390)
T ss_pred CCCceEEEEEeCC------------CceEEEEEecCCCcceeeeEeecccCcceeeeeeccceEEEEeecCCceeeeeec
Confidence 3566899999873 234677776653246777667777777777655 34555544 33 233344
Q ss_pred cCCceeeceeeecCccceEEEEE--EeCCEEEEeecCCcEEEEEEecc--CCeEEEeeccCCCcceEEEEeecCCeeeee
Q 000944 969 LGKKRLLRKCENKLFPNTIVSIN--TYRDRIYVGDIQESFHFCKYRRD--ENQLYIFADDSVPRWLTAAHHIDFDTMAGA 1044 (1213)
Q Consensus 969 ~~~~~l~~~~~~~~~~~~i~~l~--~~~~~I~vgD~~~Sv~~l~~~~~--~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~ 1044 (1213)
.+....... .- +-++++.+. ..+.+|+.+|--.=|.+.+|..- -..+-+= +.-+|....+.++..++-+
T Consensus 139 ~~~~~~~~~--lG-hvSml~dVavS~D~~~IitaDRDEkIRvs~ypa~f~IesfclG----H~eFVS~isl~~~~~LlS~ 211 (390)
T KOG3914|consen 139 ADSGRCEPI--LG-HVSMLLDVAVSPDDQFIITADRDEKIRVSRYPATFVIESFCLG----HKEFVSTISLTDNYLLLSG 211 (390)
T ss_pred ccccCcchh--hh-hhhhhheeeecCCCCEEEEecCCceEEEEecCcccchhhhccc----cHhheeeeeeccCceeeec
Confidence 332111111 11 234555554 45689999999999999998421 1222222 2334666677777778888
Q ss_pred cCCCcEEEEecCCC
Q 000944 1045 DKFGNIYFVRLPQD 1058 (1213)
Q Consensus 1045 D~~gnl~il~~~~~ 1058 (1213)
--+++|+++++-..
T Consensus 212 sGD~tlr~Wd~~sg 225 (390)
T KOG3914|consen 212 SGDKTLRLWDITSG 225 (390)
T ss_pred CCCCcEEEEecccC
Confidence 88999999988653
No 182
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=42.58 E-value=3e+02 Score=31.55 Aligned_cols=93 Identities=17% Similarity=0.278 Sum_probs=54.6
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEE-
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHK- 939 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~- 939 (1213)
+|++.|-.+.......-.+..+.=.-+.+|. .+.++++-|-. .|.+.++++..-... +.+..
T Consensus 281 sIrIWDiRs~~~~~~~~~kAh~sDVNVISWn----r~~~lLasG~D------------dGt~~iwDLR~~~~~-~pVA~f 343 (440)
T KOG0302|consen 281 SIRIWDIRSGPKKAAVSTKAHNSDVNVISWN----RREPLLASGGD------------DGTLSIWDLRQFKSG-QPVATF 343 (440)
T ss_pred eEEEEEecCCCccceeEeeccCCceeeEEcc----CCcceeeecCC------------CceEEEEEhhhccCC-CcceeE
Confidence 6777777665322222223333322334443 34567776643 588899988753111 34444
Q ss_pred EeecCcceEeccc--cCeEEEEeC--CeEEEEecC
Q 000944 940 TQVEGIPLALCQF--QGRLLAGIG--PVLRLYDLG 970 (1213)
Q Consensus 940 ~~~~g~V~ai~~~--~g~ll~~~g--~~l~i~~~~ 970 (1213)
+-.++||++|.-- ....+++.| ++|.+|++.
T Consensus 344 k~Hk~pItsieW~p~e~s~iaasg~D~QitiWDls 378 (440)
T KOG0302|consen 344 KYHKAPITSIEWHPHEDSVIAASGEDNQITIWDLS 378 (440)
T ss_pred EeccCCeeEEEeccccCceEEeccCCCcEEEEEee
Confidence 3458999999654 466666666 489999885
No 183
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=42.03 E-value=4.8e+02 Score=28.18 Aligned_cols=114 Identities=17% Similarity=0.087 Sum_probs=69.7
Q ss_pred eecCcceEecc-ccCeEEEEeCCeEEEEecCCceeeceeeec--C-ccceEEEEEEe-CCEEEEeecCCcE-------EE
Q 000944 941 QVEGIPLALCQ-FQGRLLAGIGPVLRLYDLGKKRLLRKCENK--L-FPNTIVSINTY-RDRIYVGDIQESF-------HF 1008 (1213)
Q Consensus 941 ~~~g~V~ai~~-~~g~ll~~~g~~l~i~~~~~~~l~~~~~~~--~-~~~~i~~l~~~-~~~I~vgD~~~Sv-------~~ 1008 (1213)
...+|+..... -+|.|++|....+.++++...++...+... . -......+.+. +..++++|..... .+
T Consensus 38 ~~~~~~G~~~~~~~g~l~v~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v 117 (246)
T PF08450_consen 38 DLPGPNGMAFDRPDGRLYVADSGGIAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSV 117 (246)
T ss_dssp ESSSEEEEEEECTTSEEEEEETTCEEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEE
T ss_pred ecCCCceEEEEccCCEEEEEEcCceEEEecCCCcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccce
Confidence 34556655555 479999999999999998877665554441 1 12345566654 3559999987753 58
Q ss_pred EEEeccCCeEEEeeccCCCcceEEEEee-cCCeeeeecCCC-cEEEEecCC
Q 000944 1009 CKYRRDENQLYIFADDSVPRWLTAAHHI-DFDTMAGADKFG-NIYFVRLPQ 1057 (1213)
Q Consensus 1009 l~~~~~~~~l~~~a~D~~~~~~~~~~~l-d~~~~l~~D~~g-nl~il~~~~ 1057 (1213)
++++.+ .+...+..+... ...+.+- |++.+.++|... .|+.|.++.
T Consensus 118 ~~~~~~-~~~~~~~~~~~~--pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~ 165 (246)
T PF08450_consen 118 YRIDPD-GKVTVVADGLGF--PNGIAFSPDGKTLYVADSFNGRIWRFDLDA 165 (246)
T ss_dssp EEEETT-SEEEEEEEEESS--EEEEEEETTSSEEEEEETTTTEEEEEEEET
T ss_pred EEECCC-CeEEEEecCccc--ccceEECCcchheeecccccceeEEEeccc
Confidence 888877 666666655322 2222232 334566677744 566666653
No 184
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=41.56 E-value=6.3e+02 Score=29.47 Aligned_cols=73 Identities=16% Similarity=0.146 Sum_probs=43.5
Q ss_pred ccEEEEEec--CCEEEEEEeCCEEEEEEEccCCCeEEeeeec-cCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEe
Q 000944 540 RTIVKVGSN--RLQVVIALSGGELIYFEVDMTGQLLEVEKHE-MSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSL 616 (1213)
Q Consensus 540 ~~I~~as~~--~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~-l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl 616 (1213)
.+|..|+.. +..++-+.-+|...+..... . ..+.++. =..++.++.+.|.. ...-++.|..||++.+|++
T Consensus 176 rPis~~~fS~ds~~laT~swsG~~kvW~~~~-~--~~~~~l~gH~~~v~~~~fhP~~----~~~~lat~s~Dgtvklw~~ 248 (459)
T KOG0272|consen 176 RPISGCSFSRDSKHLATGSWSGLVKVWSVPQ-C--NLLQTLRGHTSRVGAAVFHPVD----SDLNLATASADGTVKLWKL 248 (459)
T ss_pred CcceeeEeecCCCeEEEeecCCceeEeecCC-c--ceeEEEeccccceeeEEEccCC----CccceeeeccCCceeeecc
Confidence 456665554 34444444467777776552 2 1111111 13568888887753 2345777888999999999
Q ss_pred CCC
Q 000944 617 DPD 619 (1213)
Q Consensus 617 ~p~ 619 (1213)
+.+
T Consensus 249 ~~e 251 (459)
T KOG0272|consen 249 SQE 251 (459)
T ss_pred CCC
Confidence 654
No 185
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=41.12 E-value=1.4e+02 Score=34.10 Aligned_cols=121 Identities=12% Similarity=0.074 Sum_probs=68.3
Q ss_pred eEEEEEeCCCCceEEEEEcC-CCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 860 SCIRVLDPRSANTTCLLELQ-DNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~-~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
++|+++||++++.+-+ .|. .-..|++++-=.+.-...-.+++- ...-|-+.+|++.-. -.+..
T Consensus 179 g~I~lwdpktg~~~g~-~l~gH~K~It~Lawep~hl~p~~r~las------------~skDg~vrIWd~~~~---~~~~~ 242 (480)
T KOG0271|consen 179 GSIRLWDPKTGQQIGR-ALRGHKKWITALAWEPLHLVPPCRRLAS------------SSKDGSVRIWDTKLG---TCVRT 242 (480)
T ss_pred CeEEEecCCCCCcccc-cccCcccceeEEeecccccCCCccceec------------ccCCCCEEEEEccCc---eEEEE
Confidence 5899999999876532 122 123444443212211111122221 122466788887652 23333
Q ss_pred EEeecCcceEeccccCeEEEEeC--CeEEEEecCCceeeceeeecCccceEEEEEEeCCEEE
Q 000944 939 KTQVEGIPLALCQFQGRLLAGIG--PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIY 998 (1213)
Q Consensus 939 ~~~~~g~V~ai~~~~g~ll~~~g--~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~ 998 (1213)
..-...+|+|++.=+..|+++-. ..|++|+-.+.++.+.-.- +..-|..|....+|++
T Consensus 243 lsgHT~~VTCvrwGG~gliySgS~DrtIkvw~a~dG~~~r~lkG--HahwvN~lalsTdy~L 302 (480)
T KOG0271|consen 243 LSGHTASVTCVRWGGEGLIYSGSQDRTIKVWRALDGKLCRELKG--HAHWVNHLALSTDYVL 302 (480)
T ss_pred eccCccceEEEEEcCCceEEecCCCceEEEEEccchhHHHhhcc--cchheeeeeccchhhh
Confidence 34567899999987555555333 4799999988777554322 3456777776666655
No 186
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=40.62 E-value=1.1e+02 Score=38.11 Aligned_cols=134 Identities=16% Similarity=0.220 Sum_probs=0.0
Q ss_pred CccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCccccCCCCCCCCc
Q 000944 778 YTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDEQYGYPKAESDK 857 (1213)
Q Consensus 778 ~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 857 (1213)
++.+++.+|+...-+++..+..
T Consensus 134 Rs~~~ldfh~tep~iliSGSQD---------------------------------------------------------- 155 (839)
T KOG0269|consen 134 RSANKLDFHSTEPNILISGSQD---------------------------------------------------------- 155 (839)
T ss_pred cceeeeeeccCCccEEEecCCC----------------------------------------------------------
Q ss_pred eeeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEE
Q 000944 858 WVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELL 937 (1213)
Q Consensus 858 ~~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~ 937 (1213)
++++++|-....-..++.= ...|+.-|.|.- +...+++-+-. .|.+..|++.+. +.-.+
T Consensus 156 --g~vK~~DlR~~~S~~t~~~----nSESiRDV~fsp-~~~~~F~s~~d------------sG~lqlWDlRqp--~r~~~ 214 (839)
T KOG0269|consen 156 --GTVKCWDLRSKKSKSTFRS----NSESIRDVKFSP-GYGNKFASIHD------------SGYLQLWDLRQP--DRCEK 214 (839)
T ss_pred --ceEEEEeeecccccccccc----cchhhhceeecc-CCCceEEEecC------------CceEEEeeccCc--hhHHH
Q ss_pred EEEeecCcceEeccccCeEEEEeCC---eEEEEecCCceeeceeeecCccceEEEEE
Q 000944 938 HKTQVEGIPLALCQFQGRLLAGIGP---VLRLYDLGKKRLLRKCENKLFPNTIVSIN 991 (1213)
Q Consensus 938 ~~~~~~g~V~ai~~~~g~ll~~~g~---~l~i~~~~~~~l~~~~~~~~~~~~i~~l~ 991 (1213)
-.....|||+++.---++-..|.|. +++||++.+.+.-++-...+ ...+..++
T Consensus 215 k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~~~~~~tInT-iapv~rVk 270 (839)
T KOG0269|consen 215 KLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTDSRAKPKHTINT-IAPVGRVK 270 (839)
T ss_pred HhhcccCceEEEeecCCCceeeecCCCccEEEEeccCCCccceeEEee-cceeeeee
No 187
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=40.53 E-value=8.6e+02 Score=30.70 Aligned_cols=173 Identities=15% Similarity=0.175 Sum_probs=97.7
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeC--C---ceE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEE--G---KSL 934 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~--~---~kl 934 (1213)
+.++++|-.+...+.+.+ .+|- +.|.+.....+..++-|.+-.. =+++-|++... | +.|
T Consensus 434 Gel~vfdlaS~~l~Eti~--AHdg----aIWsi~~~pD~~g~vT~saDkt----------VkfWdf~l~~~~~gt~~k~l 497 (888)
T KOG0306|consen 434 GELQVFDLASASLVETIR--AHDG----AIWSISLSPDNKGFVTGSADKT----------VKFWDFKLVVSVPGTQKKVL 497 (888)
T ss_pred CceEEEEeehhhhhhhhh--cccc----ceeeeeecCCCCceEEecCCcE----------EEEEeEEEEeccCcccceee
Confidence 356677665544443333 2221 2355544333445555554221 13333443332 2 126
Q ss_pred EEEEEE--eecCcceEeccc-cCeE-EEE-eCCeEEEEecCCceeeceeeecCccceEEEE--EEeCCEEEEeecCCcEE
Q 000944 935 ELLHKT--QVEGIPLALCQF-QGRL-LAG-IGPVLRLYDLGKKRLLRKCENKLFPNTIVSI--NTYRDRIYVGDIQESFH 1007 (1213)
Q Consensus 935 ~~~~~~--~~~g~V~ai~~~-~g~l-l~~-~g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l--~~~~~~I~vgD~~~Sv~ 1007 (1213)
++.++. +...-|.|+..- +|++ +++ .++++.+|-++.=++ ...-|- +-.++.+| +...+.|+.|-+-+.|-
T Consensus 498 sl~~~rtLel~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDtlKF-flsLYG-HkLPV~smDIS~DSklivTgSADKnVK 575 (888)
T KOG0306|consen 498 SLKHTRTLELEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDTLKF-FLSLYG-HKLPVLSMDISPDSKLIVTGSADKNVK 575 (888)
T ss_pred eeccceEEeccccEEEEEEcCCCcEEEEEeccCeEEEEEecceee-eeeecc-cccceeEEeccCCcCeEEeccCCCceE
Confidence 665554 557778777654 4554 444 478999999987665 222222 33445554 45678999999999999
Q ss_pred EEEEeccCCeEEEeeccCCCcceEEEEeecCCe-eeeecCCCcEEEE
Q 000944 1008 FCKYRRDENQLYIFADDSVPRWLTAAHHIDFDT-MAGADKFGNIYFV 1053 (1213)
Q Consensus 1008 ~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~-~l~~D~~gnl~il 1053 (1213)
++-.+-.+=.=..+|.| -.++++.|+..+. |+.+-++|-+.-+
T Consensus 576 iWGLdFGDCHKS~fAHd---DSvm~V~F~P~~~~FFt~gKD~kvKqW 619 (888)
T KOG0306|consen 576 IWGLDFGDCHKSFFAHD---DSVMSVQFLPKTHLFFTCGKDGKVKQW 619 (888)
T ss_pred Eeccccchhhhhhhccc---CceeEEEEcccceeEEEecCcceEEee
Confidence 98776544333444433 2467888887664 5677777776544
No 188
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=40.36 E-value=2.3e+02 Score=35.03 Aligned_cols=118 Identities=16% Similarity=0.248 Sum_probs=71.3
Q ss_pred CCccEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcce-EEEEeeecCCCceeeeEEEEEEeCCcEEEEEe
Q 000944 538 GKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDV-ACLDIASVPEGRKRSRFLAVGSYDNTIRILSL 616 (1213)
Q Consensus 538 ~~~~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~i-s~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl 616 (1213)
.....++.+..+.|+++..+.|.+++|.=. .|....+ +-.-..-+ +.+++.+ ...++|+|+.+|.|.++.+
T Consensus 34 ~~v~lTc~dst~~~l~~GsS~G~lyl~~R~-~~~~~~~-~~~~~~~~~~~~~vs~------~e~lvAagt~~g~V~v~ql 105 (726)
T KOG3621|consen 34 ARVKLTCVDATEEYLAMGSSAGSVYLYNRH-TGEMRKL-KNEGATGITCVRSVSS------VEYLVAAGTASGRVSVFQL 105 (726)
T ss_pred ceEEEEEeecCCceEEEecccceEEEEecC-chhhhcc-cccCccceEEEEEecc------hhHhhhhhcCCceEEeehh
Confidence 334578888889999999988988877422 1221111 11112223 3344432 4678999999999999988
Q ss_pred CCCCceeE--eEEeecCC-CCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 617 DPDDCMQI--LSVQSVSS-PPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 617 ~p~~~l~~--~~~~~l~~-~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
+.. +.+ +-..+.+. .+..+....+.. ....|+.|-.-|.+..-+++.
T Consensus 106 ~~~--~p~~~~~~t~~d~~~~~rVTal~Ws~-----------~~~k~ysGD~~Gkv~~~~L~s 155 (726)
T KOG3621|consen 106 NKE--LPRDLDYVTPCDKSHKCRVTALEWSK-----------NGMKLYSGDSQGKVVLTELDS 155 (726)
T ss_pred hcc--CCCcceeeccccccCCceEEEEEecc-----------cccEEeecCCCceEEEEEech
Confidence 431 211 11111111 245554455542 456789999999999999876
No 189
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=39.82 E-value=7.8e+02 Score=30.00 Aligned_cols=62 Identities=8% Similarity=0.145 Sum_probs=37.7
Q ss_pred ceeeEEEEEeCCCCceEEEEEcCCCce-----EEEEEEEEec--cCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEe
Q 000944 857 KWVSCIRVLDPRSANTTCLLELQDNEA-----AFSICTVNFH--DKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVE 929 (1213)
Q Consensus 857 ~~~s~i~l~d~~~~~~~~~~~~~~~E~-----v~s~~~~~l~--~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~ 929 (1213)
.+.+.+.=+|.++++.+.+++..+... ..+....... +.....++++|+. .|+++.|+...
T Consensus 253 ~~~~~l~Ald~~tG~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~~------------~G~l~ald~~t 320 (488)
T cd00216 253 LYTDSIVALDADTGKVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAPK------------NGFFYVLDRTT 320 (488)
T ss_pred CceeeEEEEcCCCCCEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEECC------------CceEEEEECCC
Confidence 345688889999999999988754321 1111111111 1122356777763 58898888876
Q ss_pred C
Q 000944 930 E 930 (1213)
Q Consensus 930 ~ 930 (1213)
+
T Consensus 321 G 321 (488)
T cd00216 321 G 321 (488)
T ss_pred C
Confidence 5
No 190
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=39.48 E-value=7.9e+02 Score=30.01 Aligned_cols=101 Identities=18% Similarity=0.261 Sum_probs=70.5
Q ss_pred CeeEEEEEeCCCccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCcccccCCCCCCCCCCCCcc
Q 000944 767 ETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKYDPLSDE 846 (1213)
Q Consensus 767 ~~~~~r~i~l~~tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 846 (1213)
++..+.+||++..+.-.+.+|....++++|.+
T Consensus 249 qrvsvtsipL~s~v~~ca~sp~E~kLvlGC~D------------------------------------------------ 280 (545)
T PF11768_consen 249 QRVSVTSIPLPSQVICCARSPSEDKLVLGCED------------------------------------------------ 280 (545)
T ss_pred eEEEEEEEecCCcceEEecCcccceEEEEecC------------------------------------------------
Confidence 56789999999999999999988888888842
Q ss_pred ccCCCCCCCCceeeEEEEEeCCCCceE-EEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEE
Q 000944 847 QYGYPKAESDKWVSCIRVLDPRSANTT-CLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIY 925 (1213)
Q Consensus 847 ~~~~p~~~~~~~~s~i~l~d~~~~~~~-~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~ 925 (1213)
+++.++|.....+. ...+|-| ....|- ....+++||.. +|.|.+|
T Consensus 281 -------------gSiiLyD~~~~~t~~~ka~~~P-----~~iaWH----p~gai~~V~s~------------qGelQ~F 326 (545)
T PF11768_consen 281 -------------GSIILYDTTRGVTLLAKAEFIP-----TLIAWH----PDGAIFVVGSE------------QGELQCF 326 (545)
T ss_pred -------------CeEEEEEcCCCeeeeeeecccc-----eEEEEc----CCCcEEEEEcC------------CceEEEE
Confidence 26788887665332 2222322 334443 23579999985 8999999
Q ss_pred EEEeCCceEEEEEEEeecCcceEe
Q 000944 926 RFVEEGKSLELLHKTQVEGIPLAL 949 (1213)
Q Consensus 926 ~i~~~~~kl~~~~~~~~~g~V~ai 949 (1213)
++.-+.-+++++.+...+.++..+
T Consensus 327 D~ALspi~~qLlsEd~~P~~~L~L 350 (545)
T PF11768_consen 327 DMALSPIKMQLLSEDATPKSTLQL 350 (545)
T ss_pred EeecCccceeeccccCCCccEEee
Confidence 988765577777776556555443
No 191
>PF14779 BBS1: Ciliary BBSome complex subunit 1
Probab=39.41 E-value=1.2e+02 Score=33.13 Aligned_cols=70 Identities=16% Similarity=0.232 Sum_probs=47.9
Q ss_pred ceEEEEEEEEecc--CCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc------
Q 000944 882 EAAFSICTVNFHD--KEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ------ 953 (1213)
Q Consensus 882 E~v~s~~~~~l~~--~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~------ 953 (1213)
-.|+||.+++-.. .+....++|||- .|.||+++-. -.+++++..+++.+..|+..+
T Consensus 177 t~ITcm~tikk~~~d~~a~scLViGTE------------~~~i~iLd~~----af~il~~~~lpsvPv~i~~~G~~devd 240 (257)
T PF14779_consen 177 TVITCMATIKKSSADEDAVSCLVIGTE------------SGEIYILDPQ----AFTILKQVQLPSVPVFISVSGQYDEVD 240 (257)
T ss_pred ceeEEeeeecccccCCCCcceEEEEec------------CCeEEEECch----hheeEEEEecCCCceEEEEEeeeeccc
Confidence 3567888766443 234599999996 5788887743 577888888888887776543
Q ss_pred CeEEEEeC-CeEEEE
Q 000944 954 GRLLAGIG-PVLRLY 967 (1213)
Q Consensus 954 g~ll~~~g-~~l~i~ 967 (1213)
.+|++++- ++||+.
T Consensus 241 yRI~Va~Rdg~iy~i 255 (257)
T PF14779_consen 241 YRIVVACRDGKIYTI 255 (257)
T ss_pred eEEEEEeCCCEEEEE
Confidence 36666554 466643
No 192
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=38.43 E-value=7e+02 Score=29.06 Aligned_cols=273 Identities=14% Similarity=0.162 Sum_probs=121.9
Q ss_pred ceEEEeeeCCCCCCceEEEEEecCceeEE-Eec--cceeeecCCCccCCCCeEEEEeecCCe-EEEEeC-CcEEEEeCC-
Q 000944 455 SAVWTVKKNVNDEFDAYIVVSFNNATLVL-SIG--ETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHP-SGIRHIRED- 528 (1213)
Q Consensus 455 ~~iw~l~~~~~~~~~~~lvlS~~~~T~vl-~~~--~~~~e~~~~gf~~~~~Tl~a~~~~~~~-ivQVT~-~~i~l~~~~- 528 (1213)
..+|-+..+ .+.+||.-+..+.|.+. .+. ..++-. ..-.-...+.++..--.|.+ ++-.-. +.+++-+-.
T Consensus 225 dEVWfl~FS---~nGkyLAsaSkD~Taiiw~v~~d~~~kl~-~tlvgh~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~t 300 (519)
T KOG0293|consen 225 DEVWFLQFS---HNGKYLASASKDSTAIIWIVVYDVHFKLK-KTLVGHSQPVSYIMWSPDDRYLLACGFDEVLSLWDVDT 300 (519)
T ss_pred CcEEEEEEc---CCCeeEeeccCCceEEEEEEecCcceeee-eeeecccCceEEEEECCCCCeEEecCchHheeeccCCc
Confidence 568999884 45689998888888864 342 222111 00000111222222222222 111111 112332211
Q ss_pred -CceeeeeCCCCccEEEEEec-CCE-EEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEE
Q 000944 529 -GRINEWRTPGKRTIVKVGSN-RLQ-VVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVG 605 (1213)
Q Consensus 529 -~~~~~~~~~~~~~I~~as~~-~~~-v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~ 605 (1213)
.......-..|.+..+|+-+ |.+ ++....+++++...+| |.+..-=.-.-...|.++++.. ...++++.
T Consensus 301 gd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~dr~i~~wdlD--gn~~~~W~gvr~~~v~dlait~------Dgk~vl~v 372 (519)
T KOG0293|consen 301 GDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPDRTIIMWDLD--GNILGNWEGVRDPKVHDLAITY------DGKYVLLV 372 (519)
T ss_pred chhhhhcccCcCCCcceeEEccCCceeEecCCCCcEEEecCC--cchhhcccccccceeEEEEEcC------CCcEEEEE
Confidence 11111111112345555544 443 3444456778777665 3221110001124578899865 35566666
Q ss_pred EeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccc
Q 000944 606 SYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSR 685 (1213)
Q Consensus 606 ~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~ 685 (1213)
+.|..+.+|+...-.....++.+ ..-.|..+.+ ..-+.++.+.+-.+..+.+.. ..+.
T Consensus 373 ~~d~~i~l~~~e~~~dr~lise~---~~its~~iS~--------------d~k~~LvnL~~qei~LWDl~e--~~lv--- 430 (519)
T KOG0293|consen 373 TVDKKIRLYNREARVDRGLISEE---QPITSFSISK--------------DGKLALVNLQDQEIHLWDLEE--NKLV--- 430 (519)
T ss_pred ecccceeeechhhhhhhcccccc---CceeEEEEcC--------------CCcEEEEEcccCeeEEeecch--hhHH---
Confidence 67889999987321000012211 1123433322 344666777777776665542 1111
Q ss_pred eeeecCCCCeEE-EEEECCee-EEEEecCcc--EEEE-EeCCeEEEEecCc--cccceeeccccCCCCceEEEEe-CCeE
Q 000944 686 SRFLGLRPPKLF-SVVVGGRA-AMLCLSSRP--WLGY-IHRGRFLLTPLSY--ETLEYAASFSSDQCVEGVVSVA-GNAL 757 (1213)
Q Consensus 686 ~~~lG~~pv~l~-~~~~~~~~-~v~~~g~~p--~~i~-~~~~~~~~~~~~~--~~v~~~~~f~~~~~~~~~i~~~-~~~L 757 (1213)
.++.|.+.-++. +.=++|.+ .+++.|+.- ..|+ ..+|++... ++. ..+++++- ++..|.-|+.++ ++++
T Consensus 431 ~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED~kvyIWhr~sgkll~~-LsGHs~~vNcVsw--NP~~p~m~ASasDDgtI 507 (519)
T KOG0293|consen 431 RKYFGHKQGHFIIRSCFGGGNDKFIASGSEDSKVYIWHRISGKLLAV-LSGHSKTVNCVSW--NPADPEMFASASDDGTI 507 (519)
T ss_pred HHhhcccccceEEEeccCCCCcceEEecCCCceEEEEEccCCceeEe-ecCCcceeeEEec--CCCCHHHhhccCCCCeE
Confidence 234554433332 11134444 566666443 3343 334544321 221 22333321 233355666554 5688
Q ss_pred EEEEEcc
Q 000944 758 RVFTIER 764 (1213)
Q Consensus 758 ~i~~l~~ 764 (1213)
+|=...+
T Consensus 508 RIWg~~~ 514 (519)
T KOG0293|consen 508 RIWGPSD 514 (519)
T ss_pred EEecCCc
Confidence 7765554
No 193
>KOG4328 consensus WD40 protein [Function unknown]
Probab=36.80 E-value=6.3e+02 Score=29.82 Aligned_cols=99 Identities=23% Similarity=0.317 Sum_probs=50.0
Q ss_pred eEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEE--EE
Q 000944 860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLE--LL 937 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~--~~ 937 (1213)
++|++.|-++..--.-+.++..+...|.. .+. ..+..++.|+.+ |.+.+++...++.... .+
T Consensus 257 GtiR~~D~~~~i~e~v~s~~~d~~~fs~~--d~~--~e~~~vl~~~~~------------G~f~~iD~R~~~s~~~~~~l 320 (498)
T KOG4328|consen 257 GTIRLQDFEGNISEEVLSLDTDNIWFSSL--DFS--AESRSVLFGDNV------------GNFNVIDLRTDGSEYENLRL 320 (498)
T ss_pred ceeeeeeecchhhHHHhhcCccceeeeec--ccc--CCCccEEEeecc------------cceEEEEeecCCccchhhhh
Confidence 57777776543211112233333333322 232 223455556543 4455666655443222 33
Q ss_pred EEEeecCcceEeccccCeEEEEeC--CeEEEEecCCceeecee
Q 000944 938 HKTQVEGIPLALCQFQGRLLAGIG--PVLRLYDLGKKRLLRKC 978 (1213)
Q Consensus 938 ~~~~~~g~V~ai~~~~g~ll~~~g--~~l~i~~~~~~~l~~~~ 978 (1213)
|++ +-.-.++.+++.+++++.| +...||++. +|..++
T Consensus 321 h~k--KI~sv~~NP~~p~~laT~s~D~T~kIWD~R--~l~~K~ 359 (498)
T KOG4328|consen 321 HKK--KITSVALNPVCPWFLATASLDQTAKIWDLR--QLRGKA 359 (498)
T ss_pred hhc--ccceeecCCCCchheeecccCcceeeeehh--hhcCCC
Confidence 443 2233456778888888666 467788874 344444
No 194
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=36.54 E-value=2e+02 Score=33.13 Aligned_cols=31 Identities=23% Similarity=0.518 Sum_probs=26.4
Q ss_pred CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 581 SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 581 ~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
...||||+... ..+++++|+-+|+|.||...
T Consensus 281 ~~siSsl~VS~------dGkf~AlGT~dGsVai~~~~ 311 (398)
T KOG0771|consen 281 FKSISSLAVSD------DGKFLALGTMDGSVAIYDAK 311 (398)
T ss_pred cCcceeEEEcC------CCcEEEEeccCCcEEEEEec
Confidence 35799999964 57899999999999999883
No 195
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=35.88 E-value=8.8e+02 Score=29.49 Aligned_cols=134 Identities=10% Similarity=0.083 Sum_probs=82.9
Q ss_pred eEEEEEeCCCCceEEEEEcCCC----ceEEEEEEEEecc----------------CCCceEEEEEeeecCccCCCCCCcc
Q 000944 860 SCIRVLDPRSANTTCLLELQDN----EAAFSICTVNFHD----------------KEHGTLLAVGTAKGLQFWPKRNIVA 919 (1213)
Q Consensus 860 s~i~l~d~~~~~~~~~~~~~~~----E~v~s~~~~~l~~----------------~~~~~~i~VGT~~~~~~~~e~~~~~ 919 (1213)
+.+++++..+.+.-. +|.|. +.+++.. |-|.- +....++|.||. +
T Consensus 15 g~l~iw~t~~~~~~~--e~~p~~~~s~t~~~~~-w~L~~~~s~~k~~~~~~~~~~s~~t~~lvlgt~------------~ 79 (541)
T KOG4547|consen 15 GRLRIWDTAKNQLQQ--EFAPIASLSGTCTYTK-WGLSADYSPMKWLSLEKAKKASLDTSMLVLGTP------------Q 79 (541)
T ss_pred CeEEEEEccCceeee--eeccchhccCcceeEE-EEEEeccchHHHHhHHHHhhccCCceEEEeecC------------C
Confidence 368888876654433 44443 3433332 22321 234589999996 6
Q ss_pred cEEEEEEEEeCCceEEEEEE-EeecCcceEeccc-cCeEEEEeCCe--EEEEecCCceeeceeeecCccceEEEEEEeCC
Q 000944 920 GYIHIYRFVEEGKSLELLHK-TQVEGIPLALCQF-QGRLLAGIGPV--LRLYDLGKKRLLRKCENKLFPNTIVSINTYRD 995 (1213)
Q Consensus 920 Gri~v~~i~~~~~kl~~~~~-~~~~g~V~ai~~~-~g~ll~~~g~~--l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~ 995 (1213)
|-|++|.+..+ +++-..+ ....|+|.++..- +-..+.++|.. +..|.+...++.++...+ +..+.++....+
T Consensus 80 g~v~~ys~~~g--~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~--~~~~~sl~is~D 155 (541)
T KOG4547|consen 80 GSVLLYSVAGG--EITAKLSTDKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVIIRIWKEQ--KPLVSSLCISPD 155 (541)
T ss_pred ccEEEEEecCC--eEEEEEecCCCCCcceeeecccccCceEecCCceeEEEEecccceeeeeeccC--CCccceEEEcCC
Confidence 88999999876 5655544 4568999988744 33556677764 445666666665554433 335667776666
Q ss_pred EEEEeecCCcEEEEEEe
Q 000944 996 RIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 996 ~I~vgD~~~Sv~~l~~~ 1012 (1213)
-=+.+.+.+.+.++...
T Consensus 156 ~~~l~~as~~ik~~~~~ 172 (541)
T KOG4547|consen 156 GKILLTASRQIKVLDIE 172 (541)
T ss_pred CCEEEeccceEEEEEcc
Confidence 44556677888886554
No 196
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=35.62 E-value=7.4e+02 Score=28.56 Aligned_cols=72 Identities=17% Similarity=0.258 Sum_probs=42.6
Q ss_pred CCCccEEEEEecCC--EEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEE
Q 000944 537 PGKRTIVKVGSNRL--QVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRIL 614 (1213)
Q Consensus 537 ~~~~~I~~as~~~~--~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~ 614 (1213)
..+..|+++..... +++-...|.++.+..++.+-.+.... -=.+=|.|++..| ...+++.|+-||+|.+|
T Consensus 113 GH~e~Vl~~~fsp~g~~l~tGsGD~TvR~WD~~TeTp~~t~K--gH~~WVlcvawsP------Dgk~iASG~~dg~I~lw 184 (480)
T KOG0271|consen 113 GHGEAVLSVQFSPTGSRLVTGSGDTTVRLWDLDTETPLFTCK--GHKNWVLCVAWSP------DGKKIASGSKDGSIRLW 184 (480)
T ss_pred CCCCcEEEEEecCCCceEEecCCCceEEeeccCCCCcceeec--CCccEEEEEEECC------CcchhhccccCCeEEEe
Confidence 45566777777642 33322223356665554321111110 0123489999976 46899999999999999
Q ss_pred Ee
Q 000944 615 SL 616 (1213)
Q Consensus 615 sl 616 (1213)
+-
T Consensus 185 dp 186 (480)
T KOG0271|consen 185 DP 186 (480)
T ss_pred cC
Confidence 84
No 197
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=35.34 E-value=2.8e+02 Score=31.57 Aligned_cols=63 Identities=19% Similarity=0.239 Sum_probs=42.4
Q ss_pred eeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 598 RSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 598 ~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
...+++.|.||++|.+|.+...++|-.+.. ...=.+++++ ..+.-||+-...|+.|-.|.+..
T Consensus 303 ~~~~l~s~SrDktIk~wdv~tg~cL~tL~g--hdnwVr~~af--------------~p~Gkyi~ScaDDktlrvwdl~~ 365 (406)
T KOG0295|consen 303 GGQVLGSGSRDKTIKIWDVSTGMCLFTLVG--HDNWVRGVAF--------------SPGGKYILSCADDKTLRVWDLKN 365 (406)
T ss_pred CccEEEeecccceEEEEeccCCeEEEEEec--ccceeeeeEE--------------cCCCeEEEEEecCCcEEEEEecc
Confidence 457999999999999999954445433321 1111334333 22566898999999999998854
No 198
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=35.30 E-value=6.7e+02 Score=27.97 Aligned_cols=143 Identities=20% Similarity=0.281 Sum_probs=80.8
Q ss_pred CCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEecccc-CeEEEEeCC---eEEEEecCC
Q 000944 896 EHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ-GRLLAGIGP---VLRLYDLGK 971 (1213)
Q Consensus 896 ~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~-g~ll~~~g~---~l~i~~~~~ 971 (1213)
++...+-.||.+ ++..|++..+ +-...|+ ...+.|.++++.+ |--+++.|+ ++++|+...
T Consensus 101 d~s~i~S~gtDk-------------~v~~wD~~tG--~~~rk~k-~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~ 164 (338)
T KOG0265|consen 101 DGSHILSCGTDK-------------TVRGWDAETG--KRIRKHK-GHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRK 164 (338)
T ss_pred CCCEEEEecCCc-------------eEEEEecccc--eeeehhc-cccceeeecCccccCCeEEEecCCCceEEEEeecc
Confidence 445677777754 5778998875 2221221 2345667776653 445666665 799999986
Q ss_pred ceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEee--ccCCCcceEEEEeecCCeeeee-cCCC
Q 000944 972 KRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFA--DDSVPRWLTAAHHIDFDTMAGA-DKFG 1048 (1213)
Q Consensus 972 ~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a--~D~~~~~~~~~~~ld~~~~l~~-D~~g 1048 (1213)
+.-.+.-... .+..++..+-..+.++.|=+-.-+.+...+.. +-+..+. +|. ++.+...-++.++.+ -++.
T Consensus 165 k~~~~t~~~k-yqltAv~f~d~s~qv~sggIdn~ikvWd~r~~-d~~~~lsGh~Dt----It~lsls~~gs~llsnsMd~ 238 (338)
T KOG0265|consen 165 KEAIKTFENK-YQLTAVGFKDTSDQVISGGIDNDIKVWDLRKN-DGLYTLSGHADT----ITGLSLSRYGSFLLSNSMDN 238 (338)
T ss_pred cchhhccccc-eeEEEEEecccccceeeccccCceeeeccccC-cceEEeecccCc----eeeEEeccCCCccccccccc
Confidence 5443322112 34455566666778888877776666444322 2233332 333 455555555555444 4466
Q ss_pred cEEEEecCCCCC
Q 000944 1049 NIYFVRLPQDVS 1060 (1213)
Q Consensus 1049 nl~il~~~~~~~ 1060 (1213)
.+.++..-|-++
T Consensus 239 tvrvwd~rp~~p 250 (338)
T KOG0265|consen 239 TVRVWDVRPFAP 250 (338)
T ss_pred eEEEEEecccCC
Confidence 777777766543
No 199
>KOG1900 consensus Nuclear pore complex, Nup155 component (D Nup154, sc Nup157/Nup170) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=34.89 E-value=2.1e+02 Score=38.11 Aligned_cols=102 Identities=18% Similarity=0.170 Sum_probs=62.1
Q ss_pred cCeEEEEeCCeEEEEecCC-ceeeceeeecCccceEEEEEE-----------eCCEEEEeecCCcEEEE-EEeccCCeEE
Q 000944 953 QGRLLAGIGPVLRLYDLGK-KRLLRKCENKLFPNTIVSINT-----------YRDRIYVGDIQESFHFC-KYRRDENQLY 1019 (1213)
Q Consensus 953 ~g~ll~~~g~~l~i~~~~~-~~l~~~~~~~~~~~~i~~l~~-----------~~~~I~vgD~~~Sv~~l-~~~~~~~~l~ 1019 (1213)
=++..++++++|++|++++ .++ +.++++...|..+.. .+..+++++.++=+.+- .+++..+.+.
T Consensus 90 I~RaWiTiDn~L~lWny~~~~e~---~~~d~~shtIl~V~LvkPkpgvFv~~IqhlLvvaT~~ei~ilgV~~~~~~~~~~ 166 (1311)
T KOG1900|consen 90 IGRAWITIDNNLFLWNYESDNEL---AEYDGLSHTILKVGLVKPKPGVFVPEIQHLLVVATPVEIVILGVSFDEFTGELS 166 (1311)
T ss_pred hcceEEEeCCeEEEEEcCCCCcc---ccccchhhhheeeeeecCCCCcchhhhheeEEecccceEEEEEEEeccccCccc
Confidence 3788999999999999986 334 445544445555543 24456777766632221 2233233332
Q ss_pred Eeecc----CCCcceEEEEeecCCeeeeecCCCcEEEEecCC
Q 000944 1020 IFADD----SVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1020 ~~a~D----~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
....- .....|.++..-+++.|+.+-++|||+=+.|..
T Consensus 167 ~f~~~~~i~~dg~~V~~I~~t~nGRIF~~G~dg~lyEl~Yq~ 208 (1311)
T KOG1900|consen 167 IFNTSFKISVDGVSVNCITYTENGRIFFAGRDGNLYELVYQA 208 (1311)
T ss_pred ccccceeeecCCceEEEEEeccCCcEEEeecCCCEEEEEEec
Confidence 22222 224445565556777888888888999888865
No 200
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=34.37 E-value=3.9e+02 Score=32.61 Aligned_cols=71 Identities=23% Similarity=0.309 Sum_probs=44.3
Q ss_pred EecCCEEEEEEe--CCEEEEEEEccCCCeEE--eeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCc
Q 000944 546 GSNRLQVVIALS--GGELIYFEVDMTGQLLE--VEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDC 621 (1213)
Q Consensus 546 s~~~~~v~v~~s--~~~l~~l~~~~~~~l~~--~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~ 621 (1213)
+.+...++|-+. +|.|-+|++.+-|.|-. +..+.-..-|+-+...+++ ..-++|++.+|.|.+|.+ +.+.
T Consensus 588 can~~rvAVPL~g~gG~iai~el~~PGrLPDgv~p~l~Ngt~vtDl~WdPFD-----~~rLAVa~ddg~i~lWr~-~a~g 661 (1012)
T KOG1445|consen 588 CANNKRVAVPLAGSGGVIAIYELNEPGRLPDGVMPGLFNGTLVTDLHWDPFD-----DERLAVATDDGQINLWRL-TANG 661 (1012)
T ss_pred eeccceEEEEecCCCceEEEEEcCCCCCCCcccccccccCceeeecccCCCC-----hHHeeecccCceEEEEEe-ccCC
Confidence 345566666553 57888999876554321 1112222345666665543 456999999999999999 5444
Q ss_pred e
Q 000944 622 M 622 (1213)
Q Consensus 622 l 622 (1213)
+
T Consensus 662 l 662 (1012)
T KOG1445|consen 662 L 662 (1012)
T ss_pred C
Confidence 4
No 201
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=34.14 E-value=7.6e+02 Score=28.22 Aligned_cols=137 Identities=12% Similarity=0.142 Sum_probs=84.2
Q ss_pred ccEEEEEEEEeCCceEEEEEEEeecCcceEecccc-CeEEEEeC-CeEEEEecCCceeeceeeecCccceEEEEEEe-CC
Q 000944 919 AGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ-GRLLAGIG-PVLRLYDLGKKRLLRKCENKLFPNTIVSINTY-RD 995 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~-g~ll~~~g-~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~-~~ 995 (1213)
-+.|-+|++..+ +.+..+. ..+-.|.|+|--- .+++++.+ ..+.-|++..+.++.--+. ....+.+|.+. ++
T Consensus 298 D~tvrlWDl~ag-kt~~tlt--~hkksvral~lhP~e~~fASas~dnik~w~~p~g~f~~nlsg--h~~iintl~~nsD~ 372 (460)
T KOG0285|consen 298 DSTVRLWDLRAG-KTMITLT--HHKKSVRALCLHPKENLFASASPDNIKQWKLPEGEFLQNLSG--HNAIINTLSVNSDG 372 (460)
T ss_pred CceEEEeeeccC-ceeEeee--cccceeeEEecCCchhhhhccCCccceeccCCccchhhcccc--ccceeeeeeeccCc
Confidence 356778887765 2333332 2456677777653 34555444 4677888887766554222 35578888886 56
Q ss_pred EEEEeecCCcEEEEEEeccCCeEEEeeccCCCc------ceEEEEee-cCCeeeeecCCCcEEEEecCCCCCc
Q 000944 996 RIYVGDIQESFHFCKYRRDENQLYIFADDSVPR------WLTAAHHI-DFDTMAGADKFGNIYFVRLPQDVSD 1061 (1213)
Q Consensus 996 ~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~------~~~~~~~l-d~~~~l~~D~~gnl~il~~~~~~~~ 1061 (1213)
++++|--.-++.|.+|... ..+.....-.+|- .+++..|= ....++.+..+.+|-+|+.+..+++
T Consensus 373 v~~~G~dng~~~fwdwksg-~nyQ~~~t~vqpGSl~sEagI~as~fDktg~rlit~eadKtIk~~keDe~aT~ 444 (460)
T KOG0285|consen 373 VLVSGGDNGSIMFWDWKSG-HNYQRGQTIVQPGSLESEAGIFASCFDKTGSRLITGEADKTIKMYKEDEHATE 444 (460)
T ss_pred eEEEcCCceEEEEEecCcC-cccccccccccCCccccccceeEEeecccCceEEeccCCcceEEEecccccCc
Confidence 7777777778999888753 2222221112222 23443331 2236899999999999999887765
No 202
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=33.56 E-value=1.4e+02 Score=33.00 Aligned_cols=65 Identities=15% Similarity=0.262 Sum_probs=50.8
Q ss_pred CceEEEEeCC------EEEEEeecCCC-CeEEEEEEEeee-eeeEeeEEeeCCCCeeEEEEEeccceEEEEEEe
Q 000944 27 TPEIVVARGK------VLELLRPENSG-RIETLVSTEIFG-AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYN 92 (1213)
Q Consensus 27 ~~~LVv~k~~------~Levy~i~~~g-~L~~v~~~~l~g-~I~~i~~~r~~~~~~d~L~v~~~~~~l~il~~d 92 (1213)
++.|++.-.+ ...||+.+++| +..-+++.+..+ .|++|.--...|...+.|.+.+++| +-|+..-
T Consensus 184 ~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~y~~lAvA~kDg-v~I~~v~ 256 (361)
T KOG2445|consen 184 EPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRSYHLLAVATKDG-VRIFKVK 256 (361)
T ss_pred CceEEEEcccCCccccceEEEEecCCcceeeeehhcCCCCCcceeeeeccccCCceeeEEEeecCc-EEEEEEe
Confidence 3456666566 89999999888 788888888765 6888886666778889999999998 7776554
No 203
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=33.52 E-value=4.4e+02 Score=32.37 Aligned_cols=139 Identities=17% Similarity=0.108 Sum_probs=73.9
Q ss_pred ccEEEEEEEEeCCceEE---EEEEEeecCcceEeccccC--eEEEEeC-CeEEEEecCCceeeceeeecCccceEEEE--
Q 000944 919 AGYIHIYRFVEEGKSLE---LLHKTQVEGIPLALCQFQG--RLLAGIG-PVLRLYDLGKKRLLRKCENKLFPNTIVSI-- 990 (1213)
Q Consensus 919 ~Gri~v~~i~~~~~kl~---~~~~~~~~g~V~ai~~~~g--~ll~~~g-~~l~i~~~~~~~l~~~~~~~~~~~~i~~l-- 990 (1213)
.|.|.+|+..+-..+++ +..-.-..++|..+....| .||.+.| +.+++|+....++.+...+-.+.--+-++
T Consensus 73 ~G~i~l~dt~~~~fr~ee~~lk~~~aH~nAifDl~wapge~~lVsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf 152 (720)
T KOG0321|consen 73 DGGIILFDTKSIVFRLEERQLKKPLAHKNAIFDLKWAPGESLLVSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECF 152 (720)
T ss_pred CCceeeecchhhhcchhhhhhcccccccceeEeeccCCCceeEEEccCCceeeeeeeccceeecceeecccccccchhhh
Confidence 36677777665322222 2222334688888888765 4566777 58999999988877662221111011111
Q ss_pred ----------EEeCCEEEEeecCCcE--EEEEE--------eccCCeEEEeec---------cCCCcceEEEEeecCCee
Q 000944 991 ----------NTYRDRIYVGDIQESF--HFCKY--------RRDENQLYIFAD---------DSVPRWLTAAHHIDFDTM 1041 (1213)
Q Consensus 991 ----------~~~~~~I~vgD~~~Sv--~~l~~--------~~~~~~l~~~a~---------D~~~~~~~~~~~ld~~~~ 1041 (1213)
--.++-|++-|++-.. .+.+| ++.+.-....++ -...-+||.+.|.|++++
T Consensus 153 ~~~n~~vF~tGgRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~ti~ssvTvv~fkDe~tl 232 (720)
T KOG0321|consen 153 MPTNPAVFCTGGRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNTIFSSVTVVLFKDESTL 232 (720)
T ss_pred ccCCCcceeeccCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCchhhccccccccccCceeeeeEEEEEecccee
Confidence 1235567777776654 11111 111001111111 111123666678898888
Q ss_pred eeecC-CCcEEEEecCC
Q 000944 1042 AGADK-FGNIYFVRLPQ 1057 (1213)
Q Consensus 1042 l~~D~-~gnl~il~~~~ 1057 (1213)
+.+-. ++-|.++++-.
T Consensus 233 aSaga~D~~iKVWDLRk 249 (720)
T KOG0321|consen 233 ASAGAADSTIKVWDLRK 249 (720)
T ss_pred eeccCCCcceEEEeecc
Confidence 65554 88898887643
No 204
>COG1654 BirA Biotin operon repressor [Transcription]
Probab=33.12 E-value=51 Score=28.90 Aligned_cols=26 Identities=23% Similarity=0.400 Sum_probs=23.6
Q ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHh
Q 000944 1185 DLQRKIADELDRTPGEILKKLEEIRN 1210 (1213)
Q Consensus 1185 ~~q~~i~~~~~~~~~~i~~~l~~l~~ 1210 (1213)
....++++++|+++..|++.++.|+.
T Consensus 20 ~SGe~La~~LgiSRtaVwK~Iq~Lr~ 45 (79)
T COG1654 20 VSGEKLAEELGISRTAVWKHIQQLRE 45 (79)
T ss_pred ccHHHHHHHHCccHHHHHHHHHHHHH
Confidence 45689999999999999999999985
No 205
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=32.44 E-value=1.1e+03 Score=29.59 Aligned_cols=96 Identities=10% Similarity=0.144 Sum_probs=64.4
Q ss_pred EEeCCeEEEEecCCceeece--eeecCccc-eEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEEEeeccC-CCcceEEE
Q 000944 958 AGIGPVLRLYDLGKKRLLRK--CENKLFPN-TIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDS-VPRWLTAA 1033 (1213)
Q Consensus 958 ~~~g~~l~i~~~~~~~l~~~--~~~~~~~~-~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~-~~~~~~~~ 1033 (1213)
+++..+|.||+.+..|..+. ...+.=+. .=+.++..+.||...=.-|-+.|+.|-.. +.+|+-+ +.--||-+
T Consensus 614 ~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lDPSgiY~atScsdktl~~~Df~sg----EcvA~m~GHsE~VTG~ 689 (1080)
T KOG1408|consen 614 VCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILDPSGIYLATSCSDKTLCFVDFVSG----ECVAQMTGHSEAVTGV 689 (1080)
T ss_pred EecccceEEEeccccceeeeecccccCCCceEEEEECCCccEEEEeecCCceEEEEeccc----hhhhhhcCcchheeee
Confidence 35556899999987665431 22221122 23456677889988888889999888643 3344444 33446777
Q ss_pred EeecC-CeeeeecCCCcEEEEecCC
Q 000944 1034 HHIDF-DTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1034 ~~ld~-~~~l~~D~~gnl~il~~~~ 1057 (1213)
.|+.+ ..+|..-.+|.||+++++.
T Consensus 690 kF~nDCkHlISvsgDgCIFvW~lp~ 714 (1080)
T KOG1408|consen 690 KFLNDCKHLISVSGDGCIFVWKLPL 714 (1080)
T ss_pred eecccchhheeecCCceEEEEECch
Confidence 78765 4788888999999999764
No 206
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=32.18 E-value=7.6e+02 Score=27.62 Aligned_cols=132 Identities=13% Similarity=0.040 Sum_probs=87.5
Q ss_pred EEEEEeCC--ceEEEEEEEeecCcceEeccccCeEEEEeCC-eEEEEecCCce-eeceeeecCccceEEEEEEeCCEEEE
Q 000944 924 IYRFVEEG--KSLELLHKTQVEGIPLALCQFQGRLLAGIGP-VLRLYDLGKKR-LLRKCENKLFPNTIVSINTYRDRIYV 999 (1213)
Q Consensus 924 v~~i~~~~--~kl~~~~~~~~~g~V~ai~~~~g~ll~~~g~-~l~i~~~~~~~-l~~~~~~~~~~~~i~~l~~~~~~I~v 999 (1213)
++++-... ....++.....++-+.-+..-..+..+|=++ -|+++++.... =..+.++.+ .-+.-...+.+|+.+|
T Consensus 65 i~ditn~~~~t~~~l~~~i~~~~l~~Dv~vse~yvyvad~ssGL~IvDIS~P~sP~~~~~lnt-~gyaygv~vsGn~aYV 143 (370)
T COG5276 65 ILDITNVSLQTHDVLLSVINARDLFADVRVSEEYVYVADWSSGLRIVDISTPDSPTLIGFLNT-DGYAYGVYVSGNYAYV 143 (370)
T ss_pred eccccCcccccCcceEEEEehhhhhheeEecccEEEEEcCCCceEEEeccCCCCcceeccccC-CceEEEEEecCCEEEE
Confidence 55554431 1233444444555555555556677777665 58888886432 222345552 3377778889999999
Q ss_pred eecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCCeeeeecCCCcEEEEecCC
Q 000944 1000 GDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1000 gD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~~ 1057 (1213)
+|+-.++-++... ++.+=.+.+|=..+-|-+.-..++.+.-.+++.++.|.+++.+.
T Consensus 144 adlddgfLivdvs-dpssP~lagrya~~~~d~~~v~ISGn~AYvA~~d~GL~ivDVSn 200 (370)
T COG5276 144 ADLDDGFLIVDVS-DPSSPQLAGRYALPGGDTHDVAISGNYAYVAWRDGGLTIVDVSN 200 (370)
T ss_pred eeccCcEEEEECC-CCCCceeeeeeccCCCCceeEEEecCeEEEEEeCCCeEEEEccC
Confidence 9999998776653 35555666776667676655568888888899999999998764
No 207
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=31.56 E-value=3.9e+02 Score=29.66 Aligned_cols=102 Identities=13% Similarity=0.122 Sum_probs=56.8
Q ss_pred EEEEeCCEEEEEEEccC---CCeEEeeeeccCc-ceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEe
Q 000944 553 VIALSGGELIYFEVDMT---GQLLEVEKHEMSG-DVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQ 628 (1213)
Q Consensus 553 ~v~~s~~~l~~l~~~~~---~~l~~~~~~~l~~-~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~ 628 (1213)
+.+-..|+|.+++.+.. ..|..+..+.+.. +..|+++.+. +.-++++..+|++.+.+. .+..++.+.
T Consensus 89 ~~a~a~G~i~~~r~~~~~ss~~L~~ls~~ki~~~~~lslD~~~~------~~~i~vs~s~G~~~~v~~-t~~~le~vq-- 159 (339)
T KOG0280|consen 89 LDAHARGQIQLYRNDEDESSVHLRGLSSKKISVVEALSLDISTS------GTKIFVSDSRGSISGVYE-TEMVLEKVQ-- 159 (339)
T ss_pred eeccccceEEEEeeccceeeeeecccchhhhhheeeeEEEeecc------CceEEEEcCCCcEEEEec-ceeeeeecc--
Confidence 33446788888887743 2344443333332 2456666542 333888888898886665 332333221
Q ss_pred ecCCCCceeEE--EEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 629 SVSSPPESLLF--LEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 629 ~l~~~p~Sl~~--~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
.+ -++.... .++. ......++-|-.||.|..+.+.
T Consensus 160 ~w--k~He~E~Wta~f~----------~~~pnlvytGgDD~~l~~~D~R 196 (339)
T KOG0280|consen 160 TW--KVHEFEAWTAKFS----------DKEPNLVYTGGDDGSLSCWDIR 196 (339)
T ss_pred cc--cccceeeeeeecc----------cCCCceEEecCCCceEEEEEec
Confidence 11 1222221 2222 2234678899999999998875
No 208
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=31.55 E-value=2.8e+02 Score=32.33 Aligned_cols=81 Identities=17% Similarity=0.386 Sum_probs=54.3
Q ss_pred CceeeeeCCCCccEEEEEec---CCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEE
Q 000944 529 GRINEWRTPGKRTIVKVGSN---RLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVG 605 (1213)
Q Consensus 529 ~~~~~~~~~~~~~I~~as~~---~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~ 605 (1213)
..-..|++..+ |-..+.+ ....++.+.+|.+.+|.+...|+..- ....=+.+|++|++... ...+++.+
T Consensus 321 ~s~~~wk~~g~--VEkv~w~~~se~~f~~~tddG~v~~~D~R~~~~~vw-t~~AHd~~ISgl~~n~~-----~p~~l~t~ 392 (463)
T KOG0270|consen 321 NSGKEWKFDGE--VEKVAWDPHSENSFFVSTDDGTVYYFDIRNPGKPVW-TLKAHDDEISGLSVNIQ-----TPGLLSTA 392 (463)
T ss_pred ccCceEEeccc--eEEEEecCCCceeEEEecCCceEEeeecCCCCCcee-EEEeccCCcceEEecCC-----CCcceeec
Confidence 56778988644 5555543 34566667789999998876653211 11222468999999643 34567777
Q ss_pred EeCCcEEEEEeC
Q 000944 606 SYDNTIRILSLD 617 (1213)
Q Consensus 606 ~~~~~i~i~sl~ 617 (1213)
..++.+.+|.++
T Consensus 393 s~d~~Vklw~~~ 404 (463)
T KOG0270|consen 393 STDKVVKLWKFD 404 (463)
T ss_pred cccceEEEEeec
Confidence 789999999994
No 209
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=31.49 E-value=4e+02 Score=29.59 Aligned_cols=127 Identities=15% Similarity=0.159 Sum_probs=66.0
Q ss_pred CceeeeeCCCCccEEEEEec--CCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEE
Q 000944 529 GRINEWRTPGKRTIVKVGSN--RLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGS 606 (1213)
Q Consensus 529 ~~~~~~~~~~~~~I~~as~~--~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~ 606 (1213)
+.+.+++. ..+-|..|... +.+++-+.++|++.+........+..+....-+..|-.+-+.| +....++||-
T Consensus 339 K~LKEfrG-HsSyvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~P-----Knpeh~iVCN 412 (508)
T KOG0275|consen 339 KCLKEFRG-HSSYVNEATFTDDGHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLP-----KNPEHFIVCN 412 (508)
T ss_pred hhHHHhcC-ccccccceEEcCCCCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcC-----CCCceEEEEc
Confidence 44444442 22346666654 3466666678888887655332233222222233344444433 2355677888
Q ss_pred eCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 607 YDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 607 ~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
.+++++|..+... -....+...-. .-.. +... . .....+++|.-.||.|+.|...
T Consensus 413 rsntv~imn~qGQ-vVrsfsSGkRE--gGdF-i~~~-l---------SpkGewiYcigED~vlYCF~~~ 467 (508)
T KOG0275|consen 413 RSNTVYIMNMQGQ-VVRSFSSGKRE--GGDF-INAI-L---------SPKGEWIYCIGEDGVLYCFSVL 467 (508)
T ss_pred CCCeEEEEeccce-EEeeeccCCcc--CCce-EEEE-e---------cCCCcEEEEEccCcEEEEEEee
Confidence 8899999988321 11111111100 0111 1111 1 1245688899999999888764
No 210
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=30.45 E-value=70 Score=35.17 Aligned_cols=61 Identities=18% Similarity=0.262 Sum_probs=40.4
Q ss_pred ccEEEEEEEEeC------------Cc--eEEEEEEEeecCcceEecccc-----CeEEEEeCCeEEEEecCCceeeceee
Q 000944 919 AGYIHIYRFVEE------------GK--SLELLHKTQVEGIPLALCQFQ-----GRLLAGIGPVLRLYDLGKKRLLRKCE 979 (1213)
Q Consensus 919 ~Gri~v~~i~~~------------~~--kl~~~~~~~~~g~V~ai~~~~-----g~ll~~~g~~l~i~~~~~~~l~~~~~ 979 (1213)
.||+.+|+-.+. ++ .+..+.+.++...|.+|..++ .+|+++....|++|.+-++.|..++.
T Consensus 47 gGRVvlfer~~s~~ceykf~teFQshe~EFDYLkSleieEKin~I~w~~~t~r~hFLlstNdktiKlWKiyeknlk~va~ 126 (460)
T COG5170 47 GGRVVLFEREKSYGCEYKFFTEFQSHELEFDYLKSLEIEEKINAIEWFDDTGRNHFLLSTNDKTIKLWKIYEKNLKVVAE 126 (460)
T ss_pred CceEEEeecccccccchhhhhhhcccccchhhhhhccHHHHhhheeeecCCCcceEEEecCCceeeeeeeecccchhhhc
Confidence 589888886552 11 233445566677788887774 35666667789999998776544443
No 211
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=30.04 E-value=9.3e+02 Score=27.97 Aligned_cols=74 Identities=18% Similarity=0.257 Sum_probs=42.0
Q ss_pred EEEEecCCEEEEEEe---CCEEEEEEEccCCCeEEeeeeccCcc--eEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 543 VKVGSNRLQVVIALS---GGELIYFEVDMTGQLLEVEKHEMSGD--VACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 543 ~~as~~~~~v~v~~s---~~~l~~l~~~~~~~l~~~~~~~l~~~--is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
..|+++..||+|... +|.+.++-+...|++..--.+--.+. +.-++..+. ....++.|..|.++.||.+
T Consensus 38 ~fcavNPkfiAvi~easgGgaf~ViPl~k~Gr~d~~~P~v~GHt~~vLDi~w~Pf-----nD~vIASgSeD~~v~vW~I- 111 (472)
T KOG0303|consen 38 SFCAVNPKFVAVIIEASGGGAFLVIPLVKTGRMDASYPLVCGHTAPVLDIDWCPF-----NDCVIASGSEDTKVMVWQI- 111 (472)
T ss_pred cccccCCceEEEEEecCCCcceeecccccccccCCCCCCccCccccccccccCcc-----CCceeecCCCCceEEEEEC-
Confidence 467778888887653 23555544443343321111111222 233344443 3567888888999999999
Q ss_pred CCCce
Q 000944 618 PDDCM 622 (1213)
Q Consensus 618 p~~~l 622 (1213)
|+..|
T Consensus 112 Pe~~l 116 (472)
T KOG0303|consen 112 PENGL 116 (472)
T ss_pred CCccc
Confidence 87655
No 212
>PLN02153 epithiospecifier protein
Probab=29.66 E-value=9e+02 Score=27.68 Aligned_cols=136 Identities=11% Similarity=0.064 Sum_probs=69.3
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEee-cCc----ceEeccccCeEEEEeC-----------
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQV-EGI----PLALCQFQGRLLAGIG----------- 961 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~-~g~----V~ai~~~~g~ll~~~g----------- 961 (1213)
..++++|-.. +. .....++.|+.... +.+.+..... .+| -.+++.++++|++.-|
T Consensus 86 ~~iyv~GG~~------~~-~~~~~v~~yd~~t~--~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~ 156 (341)
T PLN02153 86 TKLYIFGGRD------EK-REFSDFYSYDTVKN--EWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPE 156 (341)
T ss_pred CEEEEECCCC------CC-CccCcEEEEECCCC--EEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCc
Confidence 4678887532 11 12235777877655 4554443211 111 2345567788766433
Q ss_pred --CeEEEEecCCceeeceeeecC--ccceEEEEEEeCCEEEE-eecC-----------CcEEEEEEeccCCeEEEeec--
Q 000944 962 --PVLRLYDLGKKRLLRKCENKL--FPNTIVSINTYRDRIYV-GDIQ-----------ESFHFCKYRRDENQLYIFAD-- 1023 (1213)
Q Consensus 962 --~~l~i~~~~~~~l~~~~~~~~--~~~~i~~l~~~~~~I~v-gD~~-----------~Sv~~l~~~~~~~~l~~~a~-- 1023 (1213)
+.+.+|++...++........ .+-...++.+.++.|+| |-.. ..-.+..|+.+.++-..++.
T Consensus 157 ~~~~v~~yd~~~~~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g 236 (341)
T PLN02153 157 RFRTIEAYNIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTG 236 (341)
T ss_pred ccceEEEEECCCCeEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCCcEEeccccC
Confidence 146778887766654332110 01112344556766654 3211 11235567777777777653
Q ss_pred -cCCCcceEEEEeecCCeee
Q 000944 1024 -DSVPRWLTAAHHIDFDTMA 1042 (1213)
Q Consensus 1024 -D~~~~~~~~~~~ld~~~~l 1042 (1213)
-+.+|...++..+++.-++
T Consensus 237 ~~P~~r~~~~~~~~~~~iyv 256 (341)
T PLN02153 237 AKPSARSVFAHAVVGKYIII 256 (341)
T ss_pred CCCCCcceeeeEEECCEEEE
Confidence 2456665555566644343
No 213
>PF08728 CRT10: CRT10; InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance [].
Probab=28.94 E-value=1.5e+02 Score=37.33 Aligned_cols=63 Identities=11% Similarity=0.217 Sum_probs=48.0
Q ss_pred CCEEEEeecCCcEEEEEEeccCCeEEEeeccCCCcceEEEEeecCC-------eeeeecCCCcEEEEecC
Q 000944 994 RDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD-------TMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 994 ~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~-------~~l~~D~~gnl~il~~~ 1056 (1213)
-++|+||+=..+|+++.|...+.+............+-++.|++.+ .++++|..|++++++..
T Consensus 177 ~rlIAVSsNs~~VTVFaf~l~~~r~~~~~s~~~~hNIP~VSFl~~~~d~~G~v~v~a~dI~G~v~~~~I~ 246 (717)
T PF08728_consen 177 SRLIAVSSNSQEVTVFAFALVDERFYHVPSHQHSHNIPNVSFLDDDLDPNGHVKVVATDISGEVWTFKIK 246 (717)
T ss_pred ceEEEEecCCceEEEEEEeccccccccccccccccCCCeeEeecCCCCCccceEEEEEeccCcEEEEEEE
Confidence 5889999999999999998755444443333455666788888665 46789999999998873
No 214
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=28.81 E-value=2.5e+02 Score=34.12 Aligned_cols=63 Identities=24% Similarity=0.412 Sum_probs=44.7
Q ss_pred CCEEEEEEeCCEEEEEEEccCCCeEEe-----eeecc-CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 549 RLQVVIALSGGELIYFEVDMTGQLLEV-----EKHEM-SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 549 ~~~v~v~~s~~~l~~l~~~~~~~l~~~-----~~~~l-~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
+..++|+..+|.|-+.++..+| +.+. ..+.. ...|+++-+.++ .+.+++++.+|.+|++|.|.
T Consensus 640 ~~rLAVa~ddg~i~lWr~~a~g-l~e~~~tPe~~lt~h~eKI~slRfHPL-----AadvLa~asyd~Ti~lWDl~ 708 (1012)
T KOG1445|consen 640 DERLAVATDDGQINLWRLTANG-LPENEMTPEKILTIHGEKITSLRFHPL-----AADVLAVASYDSTIELWDLA 708 (1012)
T ss_pred hHHeeecccCceEEEEEeccCC-CCcccCCcceeeecccceEEEEEecch-----hhhHhhhhhccceeeeeehh
Confidence 4678889899999999887654 3322 11222 245777776654 47889999999999999994
No 215
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=28.78 E-value=8.2e+02 Score=26.92 Aligned_cols=93 Identities=19% Similarity=0.245 Sum_probs=57.2
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEeecCcceEeccccCe--EEEEeC-CeEEEEecCCcee
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGR--LLAGIG-PVLRLYDLGKKRL 974 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~~g~--ll~~~g-~~l~i~~~~~~~l 974 (1213)
.++|+-|-. .|.|-.|++..+ +.-+-...+....+..|+...++ ++.+.- ..-++++...-..
T Consensus 159 ~~~ii~Ghe------------~G~is~~da~~g--~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~Dttakl~D~~tl~v 224 (327)
T KOG0643|consen 159 GETIIAGHE------------DGSISIYDARTG--KELVDSDEEHSSKINDLQFSRDRTYFITGSKDTTAKLVDVRTLEV 224 (327)
T ss_pred CCEEEEecC------------CCcEEEEEcccC--ceeeechhhhccccccccccCCcceEEecccCccceeeeccceee
Confidence 477777743 588888998764 22222334555677777777653 344332 3556777765555
Q ss_pred eceeeecCccceEEEEEEeCCEEEEeecCCc
Q 000944 975 LRKCENKLFPNTIVSINTYRDRIYVGDIQES 1005 (1213)
Q Consensus 975 ~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~S 1005 (1213)
++.-..+ .|....+|....+.+++|-=++.
T Consensus 225 ~Kty~te-~PvN~aaisP~~d~VilgGGqeA 254 (327)
T KOG0643|consen 225 LKTYTTE-RPVNTAAISPLLDHVILGGGQEA 254 (327)
T ss_pred EEEeeec-ccccceecccccceEEecCCcee
Confidence 4443333 46677788888898888755444
No 216
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=28.36 E-value=5.6e+02 Score=29.40 Aligned_cols=127 Identities=12% Similarity=0.051 Sum_probs=61.0
Q ss_pred eEEEEEeCCC--CceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEE
Q 000944 860 SCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELL 937 (1213)
Q Consensus 860 s~i~l~d~~~--~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~ 937 (1213)
..+..+||.+ |+.+......+.. ....+.+ +..++++|-... .....-.++.|+++.+..+.+.+
T Consensus 168 ~~v~~YDp~t~~W~~~~~~p~~~r~---~~~~~~~----~~~iyv~GG~~~------~~~~~~~~~~y~~~~~~~~W~~~ 234 (346)
T TIGR03547 168 KNVLSYDPSTNQWRNLGENPFLGTA---GSAIVHK----GNKLLLINGEIK------PGLRTAEVKQYLFTGGKLEWNKL 234 (346)
T ss_pred ceEEEEECCCCceeECccCCCCcCC---CceEEEE----CCEEEEEeeeeC------CCccchheEEEEecCCCceeeec
Confidence 4788889865 6655433221111 1112223 246888886431 11122335566665442345444
Q ss_pred EEEeecC-----cc--eEeccccCeEEEEeCC------------------------eEEEEecCCceeeceeeecCccce
Q 000944 938 HKTQVEG-----IP--LALCQFQGRLLAGIGP------------------------VLRLYDLGKKRLLRKCENKLFPNT 986 (1213)
Q Consensus 938 ~~~~~~g-----~V--~ai~~~~g~ll~~~g~------------------------~l~i~~~~~~~l~~~~~~~~~~~~ 986 (1213)
.....+. .. .+.+.++|+|.+.-|. .+.+|+.+..++...+.+. .+..
T Consensus 235 ~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~lp-~~~~ 313 (346)
T TIGR03547 235 PPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNGKWSKVGKLP-QGLA 313 (346)
T ss_pred CCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCCcccccCCCC-CCce
Confidence 4332211 11 1245578888765542 3567888877776655443 1222
Q ss_pred EEEEEEeCCEEE-Ee
Q 000944 987 IVSINTYRDRIY-VG 1000 (1213)
Q Consensus 987 i~~l~~~~~~I~-vg 1000 (1213)
-.+..+.++.|+ +|
T Consensus 314 ~~~~~~~~~~iyv~G 328 (346)
T TIGR03547 314 YGVSVSWNNGVLLIG 328 (346)
T ss_pred eeEEEEcCCEEEEEe
Confidence 223334455544 44
No 217
>PTZ00420 coronin; Provisional
Probab=28.26 E-value=1.2e+03 Score=28.90 Aligned_cols=68 Identities=22% Similarity=0.225 Sum_probs=38.8
Q ss_pred EEecCCEEEEEE--e-CCEEEEEEEccCCCeEEeeeec-cCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 545 VGSNRLQVVIAL--S-GGELIYFEVDMTGQLLEVEKHE-MSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 545 as~~~~~v~v~~--s-~~~l~~l~~~~~~~l~~~~~~~-l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
.++++.++++.. + +|.+-.+.+...++...+..+. -...|.+++..+. ...+++.|..|++|.||.+.
T Consensus 34 ia~n~~~~A~~w~~~gGG~~gvI~L~~~~r~~~v~~L~gH~~~V~~lafsP~-----~~~lLASgS~DgtIrIWDi~ 105 (568)
T PTZ00420 34 IACSSGFVAVPWEVEGGGLIGAIRLENQMRKPPVIKLKGHTSSILDLQFNPC-----FSEILASGSEDLTIRVWEIP 105 (568)
T ss_pred EeeCCCeEEEEEEcCCCCceeEEEeeecCCCceEEEEcCCCCCEEEEEEcCC-----CCCEEEEEeCCCeEEEEECC
Confidence 444566666644 2 3334334443222222221111 1356889988652 24689999999999999993
No 218
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=27.91 E-value=3.6e+02 Score=35.03 Aligned_cols=118 Identities=12% Similarity=0.077 Sum_probs=64.9
Q ss_pred EEEEEEecCCCCceEEEEeCC-EEEEEeecCC---------CCeEEEEEEEeeeeeeE--eeEEeeCCCCeeEEEEEecc
Q 000944 16 AAINGNFSGTKTPEIVVARGK-VLELLRPENS---------GRIETLVSTEIFGAIRS--LAQFRLTGSQKDYIVVGSDS 83 (1213)
Q Consensus 16 ~~v~~~f~~~~~~~LVv~k~~-~Levy~i~~~---------g~L~~v~~~~l~g~I~~--i~~~r~~~~~~d~L~v~~~~ 83 (1213)
.++.|.+.+++..+|....+. .+.||+..+. |....+....-+.+.++ -.+.....+.-|.++ .+-+
T Consensus 70 ~sv~CVR~S~dG~~lAsGSDD~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp~~~~l-vS~s 148 (942)
T KOG0973|consen 70 GSVNCVRFSPDGSYLASGSDDRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWSPDDSLL-VSVS 148 (942)
T ss_pred CceeEEEECCCCCeEeeccCcceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccCCCccEE-EEec
Confidence 467776666788889988865 5688876631 11222222221111110 000001122334444 4445
Q ss_pred ceEEEEEEeCCCCcEeEEe-eeeccccCcccccCCceEEECCCCCEEEEEecccceEEEEE
Q 000944 84 GRIVILEYNPSKNVFDKIH-QETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVL 143 (1213)
Q Consensus 84 ~~l~il~~d~~~~~~~tis-~~~~~~~g~~~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~~ 143 (1213)
..-+++-||..+.+..++- -|.-. ..=+..||-|+++|....++.++|+-.
T Consensus 149 ~DnsViiwn~~tF~~~~vl~~H~s~---------VKGvs~DP~Gky~ASqsdDrtikvwrt 200 (942)
T KOG0973|consen 149 LDNSVIIWNAKTFELLKVLRGHQSL---------VKGVSWDPIGKYFASQSDDRTLKVWRT 200 (942)
T ss_pred ccceEEEEccccceeeeeeeccccc---------ccceEECCccCeeeeecCCceEEEEEc
Confidence 6666777888765443331 12111 122689999999999999999888753
No 219
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=27.52 E-value=5.4e+02 Score=29.23 Aligned_cols=134 Identities=13% Similarity=0.080 Sum_probs=69.4
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEee-cCcceEeccccCeEEEEeCC------eEEEEecC
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQV-EGIPLALCQFQGRLLAGIGP------VLRLYDLG 970 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~~-~g~V~ai~~~~g~ll~~~g~------~l~i~~~~ 970 (1213)
..++++|-... + ...-.++.|+...+ +.+.+..... .-.-.+++.++++|.+.-|. .+..|++.
T Consensus 124 ~~iYv~GG~~~-~------~~~~~v~~yd~~~~--~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~~yd~~ 194 (323)
T TIGR03548 124 GTLYVGGGNRN-G------KPSNKSYLFNLETQ--EWFELPDFPGEPRVQPVCVKLQNELYVFGGGSNIAYTDGYKYSPK 194 (323)
T ss_pred CEEEEEeCcCC-C------ccCceEEEEcCCCC--CeeECCCCCCCCCCcceEEEECCEEEEEcCCCCccccceEEEecC
Confidence 36777776421 1 11345778887765 4444433211 11123445678888765442 35678888
Q ss_pred Cceeeceeeec--Cccce---EEEEEEeCCEEE-EeecCC-----------------------------------cEEEE
Q 000944 971 KKRLLRKCENK--LFPNT---IVSINTYRDRIY-VGDIQE-----------------------------------SFHFC 1009 (1213)
Q Consensus 971 ~~~l~~~~~~~--~~~~~---i~~l~~~~~~I~-vgD~~~-----------------------------------Sv~~l 1009 (1213)
..++...+... ..|.. ..++.+.++.|+ +|-..+ +=.++
T Consensus 195 ~~~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 274 (323)
T TIGR03548 195 KNQWQKVADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVIDLATMKDESLKGYKKEYFLKPPEWYNWNRKIL 274 (323)
T ss_pred CCeeEECCCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHhhhhhccchhhhhhHHHHhCCCccccCcCceEE
Confidence 77775554321 11221 223444456554 554321 12467
Q ss_pred EEeccCCeEEEeeccC-CCcceEEEEeecCCe
Q 000944 1010 KYRRDENQLYIFADDS-VPRWLTAAHHIDFDT 1040 (1213)
Q Consensus 1010 ~~~~~~~~l~~~a~D~-~~~~~~~~~~ld~~~ 1040 (1213)
.|+...++-..++.-+ .+|...++..+++.-
T Consensus 275 ~yd~~~~~W~~~~~~p~~~r~~~~~~~~~~~i 306 (323)
T TIGR03548 275 IYNVRTGKWKSIGNSPFFARCGAALLLTGNNI 306 (323)
T ss_pred EEECCCCeeeEcccccccccCchheEEECCEE
Confidence 7888777777776432 345444444555443
No 220
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=27.40 E-value=1.1e+03 Score=29.55 Aligned_cols=118 Identities=16% Similarity=0.236 Sum_probs=63.5
Q ss_pred ccEEEEEec---CCEEEEEEeCCEEEEEEEcc---CCCeEEeeeeccCc-ceEEEEeeecCCCceeeeEEEEEEeCCcEE
Q 000944 540 RTIVKVGSN---RLQVVIALSGGELIYFEVDM---TGQLLEVEKHEMSG-DVACLDIASVPEGRKRSRFLAVGSYDNTIR 612 (1213)
Q Consensus 540 ~~I~~as~~---~~~v~v~~s~~~l~~l~~~~---~~~l~~~~~~~l~~-~is~i~i~~~~~~~~~~~~l~v~~~~~~i~ 612 (1213)
..|++.... .+.=+|...-.+.++|+... +|.+..-.+..+.+ .+--|+..+ ...++++++.|..|+
T Consensus 548 ssITsvKFa~~gln~~MiscGADksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp------~~k~v~t~cQDrnir 621 (1080)
T KOG1408|consen 548 SSITSVKFACNGLNRKMISCGADKSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDP------TSKLVVTVCQDRNIR 621 (1080)
T ss_pred cceeEEEEeecCCceEEEeccCchhhheehhccccCceeccccccccccceEEEeeeCC------CcceEEEEecccceE
Confidence 456555443 34556654444666666543 23333322223322 234455543 478899999999999
Q ss_pred EEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 613 ILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 613 i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
||.+....... ..++.-+ .--++.=+.+ ....+||.....|-.|..|.+..
T Consensus 622 if~i~sgKq~k-~FKgs~~-~eG~lIKv~l-----------DPSgiY~atScsdktl~~~Df~s 672 (1080)
T KOG1408|consen 622 IFDIESGKQVK-SFKGSRD-HEGDLIKVIL-----------DPSGIYLATSCSDKTLCFVDFVS 672 (1080)
T ss_pred EEeccccceee-eeccccc-CCCceEEEEE-----------CCCccEEEEeecCCceEEEEecc
Confidence 99994332211 1111111 1112211122 23678999999999998887753
No 221
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=26.73 E-value=1.3e+03 Score=28.61 Aligned_cols=75 Identities=9% Similarity=0.051 Sum_probs=37.1
Q ss_pred cEEEEEec--CCE-EEEEEeCCEEEEEEEccCCC--eEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEE
Q 000944 541 TIVKVGSN--RLQ-VVIALSGGELIYFEVDMTGQ--LLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILS 615 (1213)
Q Consensus 541 ~I~~as~~--~~~-v~v~~s~~~l~~l~~~~~~~--l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~s 615 (1213)
.|+++..+ +++ ++..+.+|++++..+...+. ...+....-.+...|.++.=+.+ ... .=++.++.||.|..|.
T Consensus 244 ~v~~~~f~p~~p~ll~gG~y~GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~-~~~-~~f~s~ssDG~i~~W~ 321 (555)
T KOG1587|consen 244 EVTCLKFCPFDPNLLAGGCYNGQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQN-EHN-TEFFSLSSDGSICSWD 321 (555)
T ss_pred ceeEEEeccCCcceEEeeccCceEEEEEccCCCCCCCcccccccccCCcCeEEEEEecc-CCC-CceEEEecCCcEeeee
Confidence 47777766 444 44456788998886653221 11111112223333433322211 112 2244555599999887
Q ss_pred eC
Q 000944 616 LD 617 (1213)
Q Consensus 616 l~ 617 (1213)
++
T Consensus 322 ~~ 323 (555)
T KOG1587|consen 322 TD 323 (555)
T ss_pred cc
Confidence 63
No 222
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=26.45 E-value=9.3e+02 Score=26.81 Aligned_cols=91 Identities=16% Similarity=0.177 Sum_probs=58.1
Q ss_pred EEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEE
Q 000944 861 CIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKT 940 (1213)
Q Consensus 861 ~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~ 940 (1213)
+++++|.+|.++...|.|+ |.|++-..-++. -..-+|++||.-. .+-++++..+ ......+
T Consensus 125 tlKVWDtnTlQ~a~~F~me--~~VYshamSp~a--~sHcLiA~gtr~~------------~VrLCDi~SG--s~sH~Ls- 185 (397)
T KOG4283|consen 125 TLKVWDTNTLQEAVDFKME--GKVYSHAMSPMA--MSHCLIAAGTRDV------------QVRLCDIASG--SFSHTLS- 185 (397)
T ss_pred eEEEeecccceeeEEeecC--ceeehhhcChhh--hcceEEEEecCCC------------cEEEEeccCC--cceeeec-
Confidence 8999999999888888887 466553332232 2347888888532 2557888776 2222211
Q ss_pred eecCcceEecc--ccCeEEEEeC--CeEEEEecC
Q 000944 941 QVEGIPLALCQ--FQGRLLAGIG--PVLRLYDLG 970 (1213)
Q Consensus 941 ~~~g~V~ai~~--~~g~ll~~~g--~~l~i~~~~ 970 (1213)
-.++.|.|+.- -.+++++.-+ ..+++|++.
T Consensus 186 GHr~~vlaV~Wsp~~e~vLatgsaDg~irlWDiR 219 (397)
T KOG4283|consen 186 GHRDGVLAVEWSPSSEWVLATGSADGAIRLWDIR 219 (397)
T ss_pred cccCceEEEEeccCceeEEEecCCCceEEEEEee
Confidence 24677777754 3567766443 478899885
No 223
>PHA02790 Kelch-like protein; Provisional
Probab=26.32 E-value=1.2e+03 Score=28.19 Aligned_cols=91 Identities=12% Similarity=0.083 Sum_probs=58.4
Q ss_pred eEeccccCeEEEEeC-----CeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEE-eecCCcEEEEEEeccCCeEEE
Q 000944 947 LALCQFQGRLLAGIG-----PVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYV-GDIQESFHFCKYRRDENQLYI 1020 (1213)
Q Consensus 947 ~ai~~~~g~ll~~~g-----~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~v-gD~~~Sv~~l~~~~~~~~l~~ 1020 (1213)
.+.+.++|+|.+.-| +.+..|+....++...+... .+-.-.++.+.++.|+| |- . .-.|+++.++=..
T Consensus 356 ~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~~~~~m~-~~r~~~~~~~~~~~IYv~GG---~--~e~ydp~~~~W~~ 429 (480)
T PHA02790 356 PAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQFGPSTY-YPHYKSCALVFGRRLFLVGR---N--AEFYCESSNTWTL 429 (480)
T ss_pred cEEEEECCEEEEecCcCCCCccEEEEeCCCCEEEeCCCCC-CccccceEEEECCEEEEECC---c--eEEecCCCCcEeE
Confidence 355667888766444 34667888877777665544 23334456677888865 42 2 3457777788888
Q ss_pred eeccCCCcceEEEEeecCCeeee
Q 000944 1021 FADDSVPRWLTAAHHIDFDTMAG 1043 (1213)
Q Consensus 1021 ~a~D~~~~~~~~~~~ld~~~~l~ 1043 (1213)
++.=..+|.-..+..+++.-+++
T Consensus 430 ~~~m~~~r~~~~~~v~~~~IYvi 452 (480)
T PHA02790 430 IDDPIYPRDNPELIIVDNKLLLI 452 (480)
T ss_pred cCCCCCCccccEEEEECCEEEEE
Confidence 87666677777767777665443
No 224
>KOG3679 consensus Predicted coiled-coil protein [General function prediction only]
Probab=25.93 E-value=3.6e+02 Score=30.28 Aligned_cols=62 Identities=10% Similarity=0.105 Sum_probs=40.8
Q ss_pred CeeEEEEecCccEEEEEeCCeEEEEecCccccceeeccccCCCC--ceEEEEeCCeEEEEEEcc
Q 000944 703 GRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV--EGVVSVAGNALRVFTIER 764 (1213)
Q Consensus 703 ~~~~v~~~g~~p~~i~~~~~~~~~~~~~~~~v~~~~~f~~~~~~--~~~i~~~~~~L~i~~l~~ 764 (1213)
..+.||+.|+|.++...++|.+++..--.=...++-|+++..-. +.++.-.+|.|.|.+=-.
T Consensus 211 sassvfvlgernffclkdngqirfmkrldwspscflpycsvsegtintlignhnnmlhiyqdvt 274 (802)
T KOG3679|consen 211 SASSVFVLGERNFFCLKDNGQIRFMKRLDWSPSCFLPYCSVSEGTINTLIGNHNNMLHIYQDVT 274 (802)
T ss_pred ccceEEEEeccceEEEccCCEEEEeeccCCCcccccccccccccchhhhhcCCCceEEeeeehh
Confidence 35689999999999999999998875433345677777664311 233333456666665443
No 225
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=25.26 E-value=9e+02 Score=26.25 Aligned_cols=44 Identities=23% Similarity=0.395 Sum_probs=29.4
Q ss_pred EEEEe---CCeEEEEEEccCCCeeEEEEEeCCCccceeeecCCCceEEEE
Q 000944 749 VVSVA---GNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVII 795 (1213)
Q Consensus 749 ~i~~~---~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~~~~~~~~v~ 795 (1213)
.++++ ++.+.+..+.. .-.++.++.+..|+.++++|..+.+++.
T Consensus 86 ~l~~~~~~~~~l~~~d~~~---~~~~~~~~~~~~~~~~~~~~dg~~l~~~ 132 (300)
T TIGR03866 86 ILYIANEDDNLVTVIDIET---RKVLAEIPVGVEPEGMAVSPDGKIVVNT 132 (300)
T ss_pred EEEEEcCCCCeEEEEECCC---CeEEeEeeCCCCcceEEECCCCCEEEEE
Confidence 45543 35666666554 2346677777788999999988776655
No 226
>PF13412 HTH_24: Winged helix-turn-helix DNA-binding; PDB: 1I1G_B 2IA0_B 3I4P_A 2GQQ_A 2L4A_A 2CFX_B 2DBB_B 2EFO_A 2EFQ_A 2PN6_A ....
Probab=25.20 E-value=70 Score=24.69 Aligned_cols=38 Identities=13% Similarity=0.219 Sum_probs=28.4
Q ss_pred hhhhhcccCCHHHHHHHHHHcCCCHHHHHHHHHHHHhc
Q 000944 1174 DLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIRNK 1211 (1213)
Q Consensus 1174 dll~~fl~l~~~~q~~i~~~~~~~~~~i~~~l~~l~~~ 1211 (1213)
.+|....+-+.-.+.++|+.+|.+...+.+.|.+|.++
T Consensus 7 ~Il~~l~~~~~~t~~ela~~~~is~~tv~~~l~~L~~~ 44 (48)
T PF13412_consen 7 KILNYLRENPRITQKELAEKLGISRSTVNRYLKKLEEK 44 (48)
T ss_dssp HHHHHHHHCTTS-HHHHHHHHTS-HHHHHHHHHHHHHT
T ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHC
Confidence 34444455566789999999999999999999998764
No 227
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=25.18 E-value=2.2e+02 Score=32.10 Aligned_cols=66 Identities=17% Similarity=0.249 Sum_probs=40.5
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCc--------------eEEEEEEEeecCcceEeccccC-----eEEE
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGK--------------SLELLHKTQVEGIPLALCQFQG-----RLLA 958 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~--------------kl~~~~~~~~~g~V~ai~~~~g-----~ll~ 958 (1213)
.+++++|.. .||+.+|+=.+... .+..+-+.++.-.+..|..++. +|+.
T Consensus 37 Ge~LatGdk------------gGRVv~f~r~~~~~~ey~~~t~fqshepEFDYLkSleieEKinkIrw~~~~n~a~FLls 104 (433)
T KOG1354|consen 37 GERLATGDK------------GGRVVLFEREKLYKGEYNFQTEFQSHEPEFDYLKSLEIEEKINKIRWLDDGNLAEFLLS 104 (433)
T ss_pred cceEeecCC------------CCeEEEeecccccccceeeeeeeeccCcccchhhhhhhhhhhhhceecCCCCccEEEEe
Confidence 478888864 58888887554310 1222333445566666666632 5566
Q ss_pred EeCCeEEEEecCCceee
Q 000944 959 GIGPVLRLYDLGKKRLL 975 (1213)
Q Consensus 959 ~~g~~l~i~~~~~~~l~ 975 (1213)
+....+++|...++...
T Consensus 105 tNdktiKlWKi~er~~k 121 (433)
T KOG1354|consen 105 TNDKTIKLWKIRERGSK 121 (433)
T ss_pred cCCcceeeeeeeccccc
Confidence 66678999999765443
No 228
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=24.43 E-value=1.3e+03 Score=27.89 Aligned_cols=149 Identities=17% Similarity=0.281 Sum_probs=79.8
Q ss_pred cEEEEEecCCCCeEEEEccCCcceEEEEecCCCCcceEEEeeeCCCCCCceEEEEEecCceeEEEeccc--eeeecCCC-
Q 000944 420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIGET--VEEVSDSG- 496 (1213)
Q Consensus 420 ~lv~~sG~g~~GsL~~lr~gi~~~~~~~~~l~~~~~~iw~l~~~~~~~~~~~lvlS~~~~T~vl~~~~~--~~e~~~~g- 496 (1213)
.++.|| .+|.+.+|.+.-+.+...+..-..+..+-|.- +.|-++..||+ +.-.+.+|
T Consensus 77 ~~~i~s---~DGkf~il~k~~rVE~sv~AH~~A~~~gRW~~-----------------dGtgLlt~GEDG~iKiWSrsGM 136 (737)
T KOG1524|consen 77 TLLICS---NDGRFVILNKSARVERSISAHAAAISSGRWSP-----------------DGAGLLTAGEDGVIKIWSRSGM 136 (737)
T ss_pred eEEEEc---CCceEEEecccchhhhhhhhhhhhhhhcccCC-----------------CCceeeeecCCceEEEEeccch
Confidence 456665 46778888888777765544433222345632 22333333321 21122222
Q ss_pred ----ccCCCCeEEEEeecCC--eEEEEeCCcEEEEe--CCCceeeeeCCCCccEEEEEecCCEEEEEEeCCEEEEEEE-c
Q 000944 497 ----FLDTTPSLAVSLIGDD--SLMQVHPSGIRHIR--EDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEV-D 567 (1213)
Q Consensus 497 ----f~~~~~Tl~a~~~~~~--~ivQVT~~~i~l~~--~~~~~~~~~~~~~~~I~~as~~~~~v~v~~s~~~l~~l~~-~ 567 (1213)
+.-++..++|...+.+ .++-.-...+.+=. ...++..|+.-+| -|.+++-+...=+|+ ++|+-..|++ |
T Consensus 137 LRStl~Q~~~~v~c~~W~p~S~~vl~c~g~h~~IKpL~~n~k~i~WkAHDG-iiL~~~W~~~s~lI~-sgGED~kfKvWD 214 (737)
T KOG1524|consen 137 LRSTVVQNEESIRCARWAPNSNSIVFCQGGHISIKPLAANSKIIRWRAHDG-LVLSLSWSTQSNIIA-SGGEDFRFKIWD 214 (737)
T ss_pred HHHHHhhcCceeEEEEECCCCCceEEecCCeEEEeecccccceeEEeccCc-EEEEeecCcccccee-ecCCceeEEeec
Confidence 2335677888887543 25444444444422 2367888998555 367777665444444 5777666665 3
Q ss_pred cCCCeEEeeeeccCcceEEEEeee
Q 000944 568 MTGQLLEVEKHEMSGDVACLDIAS 591 (1213)
Q Consensus 568 ~~~~l~~~~~~~l~~~is~i~i~~ 591 (1213)
..|.. .+.+-.-++.|++++..+
T Consensus 215 ~~G~~-Lf~S~~~ey~ITSva~np 237 (737)
T KOG1524|consen 215 AQGAN-LFTSAAEEYAITSVAFNP 237 (737)
T ss_pred ccCcc-cccCChhccceeeeeecc
Confidence 23431 112333357789998864
No 229
>PLN02153 epithiospecifier protein
Probab=24.41 E-value=7e+02 Score=28.58 Aligned_cols=123 Identities=11% Similarity=0.056 Sum_probs=61.4
Q ss_pred ceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEEEEe---ecCcceEeccccCeEEEEeC-------------
Q 000944 898 GTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQ---VEGIPLALCQFQGRLLAGIG------------- 961 (1213)
Q Consensus 898 ~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~~~~---~~g~V~ai~~~~g~ll~~~g------------- 961 (1213)
..++++|.....+. ......-..+.+|+...+ +.+.+.... ..-.-.+++.++++|.+.-|
T Consensus 138 ~~iyv~GG~~~~~~-~~~~~~~~~v~~yd~~~~--~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~ 214 (341)
T PLN02153 138 NHVYVFGGVSKGGL-MKTPERFRTIEAYNIADG--KWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDY 214 (341)
T ss_pred CEEEEECCccCCCc-cCCCcccceEEEEECCCC--eEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccce
Confidence 46788887532110 001112235677887765 444433211 11111245567888765322
Q ss_pred --CeEEEEecCCceeeceeeecCcc--ceEEEEEEeCCEEE-EeecC------------CcEEEEEEeccCCeEEEeec
Q 000944 962 --PVLRLYDLGKKRLLRKCENKLFP--NTIVSINTYRDRIY-VGDIQ------------ESFHFCKYRRDENQLYIFAD 1023 (1213)
Q Consensus 962 --~~l~i~~~~~~~l~~~~~~~~~~--~~i~~l~~~~~~I~-vgD~~------------~Sv~~l~~~~~~~~l~~~a~ 1023 (1213)
+.+++|++...++.........| -...+..+.+++|+ +|-.. .+-.++.|+.+.++-..+..
T Consensus 215 ~~~~v~~yd~~~~~W~~~~~~g~~P~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~~W~~~~~ 293 (341)
T PLN02153 215 ESNAVQFFDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETLVWEKLGE 293 (341)
T ss_pred ecCceEEEEcCCCcEEeccccCCCCCCcceeeeEEECCEEEEECcccCCccccccccccccccEEEEEcCccEEEeccC
Confidence 35788888877776554321122 23345556666654 55431 01246667766666655543
No 230
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=24.31 E-value=8.7e+02 Score=28.49 Aligned_cols=78 Identities=17% Similarity=0.208 Sum_probs=48.1
Q ss_pred CcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEE
Q 000944 581 SGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFL 660 (1213)
Q Consensus 581 ~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~L 660 (1213)
+.++-|+++.+. ...+||.|..|++|.+|.++ +......+ +++.-..+.-+++.. .....|
T Consensus 272 ~~~vn~~~fnp~-----~~~ilAT~S~D~tV~LwDlR-nL~~~lh~---~e~H~dev~~V~WSP----------h~etvL 332 (422)
T KOG0264|consen 272 SAEVNCVAFNPF-----NEFILATGSADKTVALWDLR-NLNKPLHT---FEGHEDEVFQVEWSP----------HNETVL 332 (422)
T ss_pred CCceeEEEeCCC-----CCceEEeccCCCcEEEeech-hcccCcee---ccCCCcceEEEEeCC----------CCCcee
Confidence 467899999764 36788888889999999995 31121111 111111222233332 245566
Q ss_pred EEEeeCCeEEEEEEeCC
Q 000944 661 NAGLQNGVLFRTVVDMV 677 (1213)
Q Consensus 661 ligl~~G~l~~~~~~~~ 677 (1213)
-.+..||.|..+.+...
T Consensus 333 ASSg~D~rl~vWDls~i 349 (422)
T KOG0264|consen 333 ASSGTDRRLNVWDLSRI 349 (422)
T ss_pred EecccCCcEEEEecccc
Confidence 66778999999988643
No 231
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=24.23 E-value=1.2e+02 Score=23.63 Aligned_cols=41 Identities=15% Similarity=0.349 Sum_probs=29.8
Q ss_pred eEEEEEEEeeeeeeEeeEEeeCCCCeeEEEEEeccceEEEEEEeC
Q 000944 49 IETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNP 93 (1213)
Q Consensus 49 L~~v~~~~l~g~I~~i~~~r~~~~~~d~L~v~~~~~~l~il~~d~ 93 (1213)
++++.++.+-..|..+. ..| ..|.|.+.++++.+.+-++|-
T Consensus 2 f~~~~~k~l~~~v~~~~-w~P---~mdLiA~~t~~g~v~v~Rl~~ 42 (47)
T PF12894_consen 2 FRQLGEKNLPSRVSCMS-WCP---TMDLIALGTEDGEVLVYRLNW 42 (47)
T ss_pred cceecccCCCCcEEEEE-ECC---CCCEEEEEECCCeEEEEECCC
Confidence 45667777777766433 333 468999999999999988853
No 232
>PF15390 DUF4613: Domain of unknown function (DUF4613)
Probab=24.19 E-value=4.3e+02 Score=32.39 Aligned_cols=87 Identities=23% Similarity=0.377 Sum_probs=60.0
Q ss_pred CeEEEEEEccCCCeeEEEEEeCCC--ccceeeecCCCceEEEEEccCCCCCHHHHHHHHHHhhHhcCCCCCCCCCccccc
Q 000944 755 NALRVFTIERLGETFNETALPLRY--TPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQME 832 (1213)
Q Consensus 755 ~~L~i~~l~~~~~~~~~r~i~l~~--tp~~i~y~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 832 (1213)
..|.+.+++. .-...||+.+.+ +|.-|+|++..+..+|+...
T Consensus 316 ShLvLVtfe~--~VT~trKV~IPGILvPDliAfn~kaq~VAVASNT---------------------------------- 359 (671)
T PF15390_consen 316 SHLVLVTFER--KVTTTRKVSIPGILVPDLIAFNPKAQVVAVASNT---------------------------------- 359 (671)
T ss_pred eeEEEEEeec--ceEEeeeeccccccccceeeeCCcCCEEEEEecC----------------------------------
Confidence 3566666665 245678888876 99999999999999998521
Q ss_pred CCCCCCCCCCCCccccCCCCCCCCceeeEEEEEe--CCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeee
Q 000944 833 NGDDENKYDPLSDEQYGYPKAESDKWVSCIRVLD--PRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAK 907 (1213)
Q Consensus 833 ~~~~~~~~~~~~~~~~~~p~~~~~~~~s~i~l~d--~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~ 907 (1213)
...|.+++ +.++--+..++++.+|++-.+|..+ ++-=+|.||--+
T Consensus 360 --------------------------cn~ilVYSv~~s~mPniQqIqLe~~ERPKGiCFlt----dklLLilVGkqK 406 (671)
T PF15390_consen 360 --------------------------CNIILVYSVTPSSMPNIQQIQLESNERPKGICFLT----DKLLLILVGKQK 406 (671)
T ss_pred --------------------------CcEEEEEEeccccCCCeeEEEcccCCCCceeeEcc----CCeEEEEecccc
Confidence 01233332 3456667889999999999988743 344566777554
No 233
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=24.14 E-value=4.4e+02 Score=31.69 Aligned_cols=57 Identities=18% Similarity=0.305 Sum_probs=31.9
Q ss_pred cCeEEEEeCC-eEEEEecCCceeeceeeecCccceEEEEEEe--CCEEEEeecCCcEEEEEEecc
Q 000944 953 QGRLLAGIGP-VLRLYDLGKKRLLRKCENKLFPNTIVSINTY--RDRIYVGDIQESFHFCKYRRD 1014 (1213)
Q Consensus 953 ~g~ll~~~g~-~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~--~~~I~vgD~~~Sv~~l~~~~~ 1014 (1213)
.|+|+...++ .|.+|+|...++++... ++ .+..+... ++++.+.. .+++.+++|+.+
T Consensus 116 ~G~LL~~~~~~~i~~yDw~~~~~i~~i~---v~-~vk~V~Ws~~g~~val~t-~~~i~il~~~~~ 175 (443)
T PF04053_consen 116 GGNLLGVKSSDFICFYDWETGKLIRRID---VS-AVKYVIWSDDGELVALVT-KDSIYILKYNLE 175 (443)
T ss_dssp -SSSEEEEETTEEEEE-TTT--EEEEES---S--E-EEEEE-TTSSEEEEE--S-SEEEEEE-HH
T ss_pred cCcEEEEECCCCEEEEEhhHcceeeEEe---cC-CCcEEEEECCCCEEEEEe-CCeEEEEEecch
Confidence 3777665555 49999999888866542 22 24455544 55666553 678889888765
No 234
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=24.08 E-value=1.2e+03 Score=27.23 Aligned_cols=29 Identities=10% Similarity=0.245 Sum_probs=20.4
Q ss_pred eeEEEEEeCCCccceeeecCCCceEEEEE
Q 000944 768 TFNETALPLRYTPRRFVLQPKKKLMVIIE 796 (1213)
Q Consensus 768 ~~~~r~i~l~~tp~~i~y~~~~~~~~v~~ 796 (1213)
.+.++.++.+..|+-..+.|..+.|+++.
T Consensus 162 ~~~~~~i~~g~~~~D~~~dpdgry~~va~ 190 (369)
T PF02239_consen 162 NLKVTTIKVGRFPHDGGFDPDGRYFLVAA 190 (369)
T ss_dssp CEEEEEEE--TTEEEEEE-TTSSEEEEEE
T ss_pred ccceeeecccccccccccCcccceeeecc
Confidence 46678888888889888888887777764
No 235
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=23.69 E-value=2.7e+02 Score=20.24 Aligned_cols=36 Identities=17% Similarity=0.207 Sum_probs=24.7
Q ss_pred eEEEEe---CCeEEEEEEccCCCeeEEEEEeCCCccceeeec
Q 000944 748 GVVSVA---GNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786 (1213)
Q Consensus 748 ~~i~~~---~~~L~i~~l~~~~~~~~~r~i~l~~tp~~i~y~ 786 (1213)
..+|++ .+++.+ ++. ......++++++..|+.++++
T Consensus 4 ~~lyv~~~~~~~v~~--id~-~~~~~~~~i~vg~~P~~i~~~ 42 (42)
T TIGR02276 4 TKLYVTNSGSNTVSV--IDT-ATNKVIATIPVGGYPFGVAVS 42 (42)
T ss_pred CEEEEEeCCCCEEEE--EEC-CCCeEEEEEECCCCCceEEeC
Confidence 456664 356665 443 245678899999999999864
No 236
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=23.31 E-value=2.9e+02 Score=31.98 Aligned_cols=135 Identities=16% Similarity=0.178 Sum_probs=72.7
Q ss_pred CCCCceEEEEeCCEEEEEeecCCCCeEEEEEEEeeeeeeEeeEEeeC-CCCeeEEEEEec--cceEEEEEEeCCCCcEeE
Q 000944 24 GTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLT-GSQKDYIVVGSD--SGRIVILEYNPSKNVFDK 100 (1213)
Q Consensus 24 ~~~~~~LVv~k~~~Levy~i~~~g~L~~v~~~~l~g~I~~i~~~r~~-~~~~d~L~v~~~--~~~l~il~~d~~~~~~~t 100 (1213)
+++..-|+-...+..+||..+ +| ..+++..-+++=..++.+|+. ...++.|.+.+. .++-+.+ |+-...
T Consensus 195 S~dgk~lasig~d~~~VW~~~-~g--~~~a~~t~~~k~~~~~~cRF~~d~~~~~l~laa~~~~~~~v~~-~~~~~w---- 266 (398)
T KOG0771|consen 195 SPDGKFLASIGADSARVWSVN-TG--AALARKTPFSKDEMFSSCRFSVDNAQETLRLAASQFPGGGVRL-CDISLW---- 266 (398)
T ss_pred CCCCcEEEEecCCceEEEEec-cC--chhhhcCCcccchhhhhceecccCCCceEEEEEecCCCCceeE-EEeeee----
Confidence 356667888888899999887 32 444444446655556666653 222255555433 2222222 111000
Q ss_pred Eeeeecccc--CcccccCCceEEECCCCCEEEEEecccceEEEEEecCCCCceeeeccccccccccEEEEeeeec
Q 000944 101 IHQETFGKS--GCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGID 173 (1213)
Q Consensus 101 is~~~~~~~--g~~~~~~~~~l~VDP~~r~ia~~~~~~~~~v~~~~~~~~~~~~~~~p~e~~~~~~~i~~~~fl~ 173 (1213)
+-..+.+. -..+.....-++|+++|+++|+...+|..+|+...+=. .+.+ .| ++|- .+|..+.|+.
T Consensus 267 -~~~~~l~~~~~~~~~~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq--~~~~-vk-~aH~--~~VT~ltF~P 334 (398)
T KOG0771|consen 267 -SGSNFLRLRKKIKRFKSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQ--RLQY-VK-EAHL--GFVTGLTFSP 334 (398)
T ss_pred -ccccccchhhhhhccCcceeEEEcCCCcEEEEeccCCcEEEEEeceee--eeEe-eh-hhhe--eeeeeEEEcC
Confidence 00011110 01122335678999999999999999999888653211 0111 12 3333 3677888875
No 237
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=22.74 E-value=2.3e+02 Score=33.26 Aligned_cols=88 Identities=14% Similarity=0.172 Sum_probs=53.0
Q ss_pred eeEEEEEeCCCCceEEEEEcCCCceEEEEEEEEeccCCCceEEEEEeeecCccCCCCCCcccEEEEEEEEeCCceEEEEE
Q 000944 859 VSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLH 938 (1213)
Q Consensus 859 ~s~i~l~d~~~~~~~~~~~~~~~E~v~s~~~~~l~~~~~~~~i~VGT~~~~~~~~e~~~~~Gri~v~~i~~~~~kl~~~~ 938 (1213)
.+++-|++|.+.+.+-.+-.-. -.+.+ +.+. ..-.+.+-|+... ++-||++. .++.++
T Consensus 272 nGtVSlWSP~skePLvKiLcH~-g~V~s---iAv~---~~G~YMaTtG~Dr-----------~~kIWDlR----~~~ql~ 329 (545)
T KOG1272|consen 272 NGTVSLWSPNSKEPLVKILCHR-GPVSS---IAVD---RGGRYMATTGLDR-----------KVKIWDLR----NFYQLH 329 (545)
T ss_pred CceEEecCCCCcchHHHHHhcC-CCcce---EEEC---CCCcEEeeccccc-----------ceeEeeec----cccccc
Confidence 3578889988776543221111 12222 3333 2223444555432 46789886 455777
Q ss_pred EEeecCcceEeccc-cCeEEEEeCCeEEEEe
Q 000944 939 KTQVEGIPLALCQF-QGRLLAGIGPVLRLYD 968 (1213)
Q Consensus 939 ~~~~~g~V~ai~~~-~g~ll~~~g~~l~i~~ 968 (1213)
+...+-+...+.-- .|.|+++.|.-+.+|.
T Consensus 330 t~~tp~~a~~ls~SqkglLA~~~G~~v~iw~ 360 (545)
T KOG1272|consen 330 TYRTPHPASNLSLSQKGLLALSYGDHVQIWK 360 (545)
T ss_pred eeecCCCccccccccccceeeecCCeeeeeh
Confidence 76667777776554 5778889999999985
No 238
>COG5276 Uncharacterized conserved protein [Function unknown]
Probab=22.68 E-value=1.1e+03 Score=26.40 Aligned_cols=141 Identities=13% Similarity=0.124 Sum_probs=78.4
Q ss_pred cEEEEEecCCEEEEEEeCCEEEEEEEccCCCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC-CC
Q 000944 541 TIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD-PD 619 (1213)
Q Consensus 541 ~I~~as~~~~~v~v~~s~~~l~~l~~~~~~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~-p~ 619 (1213)
-+..+.+.++|+.++-...-|+.+.+..-.+-.+...+..+.-.-|.++ ..+|+.|+.|+.-+.++.+. |.
T Consensus 88 l~~Dv~vse~yvyvad~ssGL~IvDIS~P~sP~~~~~lnt~gyaygv~v--------sGn~aYVadlddgfLivdvsdps 159 (370)
T COG5276 88 LFADVRVSEEYVYVADWSSGLRIVDISTPDSPTLIGFLNTDGYAYGVYV--------SGNYAYVADLDDGFLIVDVSDPS 159 (370)
T ss_pred hhheeEecccEEEEEcCCCceEEEeccCCCCcceeccccCCceEEEEEe--------cCCEEEEeeccCcEEEEECCCCC
Confidence 3667788899999986666788887753233333444444443445555 47899999997667777772 22
Q ss_pred CceeEeEEeecCCC-CceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeCCCCcccccceeeecCCCCeEEE
Q 000944 620 DCMQILSVQSVSSP-PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFS 698 (1213)
Q Consensus 620 ~~l~~~~~~~l~~~-p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~~~~~l~~~~~~~lG~~pv~l~~ 698 (1213)
+. ......+++.- .+.+.+ ...+-++.-+||-|....+... ..|+-+.+
T Consensus 160 sP-~lagrya~~~~d~~~v~I----------------SGn~AYvA~~d~GL~ivDVSnp-------------~sPvli~~ 209 (370)
T COG5276 160 SP-QLAGRYALPGGDTHDVAI----------------SGNYAYVAWRDGGLTIVDVSNP-------------HSPVLIGS 209 (370)
T ss_pred Cc-eeeeeeccCCCCceeEEE----------------ecCeEEEEEeCCCeEEEEccCC-------------CCCeEEEE
Confidence 11 11111122111 111111 2356677777777755544211 12444444
Q ss_pred EEEC-CeeEEEEecCccEEEEE
Q 000944 699 VVVG-GRAAMLCLSSRPWLGYI 719 (1213)
Q Consensus 699 ~~~~-~~~~v~~~g~~p~~i~~ 719 (1213)
.+.+ +...+++..+|-++.--
T Consensus 210 ~n~g~g~~sv~vsdnr~y~vvy 231 (370)
T COG5276 210 YNTGPGTYSVSVSDNRAYLVVY 231 (370)
T ss_pred EecCCceEEEEecCCeeEEEEc
Confidence 4554 45677777777776543
No 239
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=22.65 E-value=1.2e+03 Score=26.88 Aligned_cols=67 Identities=19% Similarity=0.280 Sum_probs=38.3
Q ss_pred eeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCCCC-ceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEeC
Q 000944 598 RSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPP-ESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDM 676 (1213)
Q Consensus 598 ~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~~p-~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~~ 676 (1213)
.+.|++.|++||.++||++... ....+.-. +-| .+..-+.- ......++.+..|-.+.-|.++.
T Consensus 114 ~~~~IltgsYDg~~riWd~~Gk-~~~~~~Gh---t~~ik~v~~v~~-----------n~~~~~fvsas~Dqtl~Lw~~~~ 178 (423)
T KOG0313|consen 114 ASKWILTGSYDGTSRIWDLKGK-SIKTIVGH---TGPIKSVAWVIK-----------NSSSCLFVSASMDQTLRLWKWNV 178 (423)
T ss_pred cCceEEEeecCCeeEEEecCCc-eEEEEecC---CcceeeeEEEec-----------CCccceEEEecCCceEEEEEecC
Confidence 3789999999999999999433 22222100 111 12211111 11223456677778888888876
Q ss_pred CCC
Q 000944 677 VTG 679 (1213)
Q Consensus 677 ~~~ 679 (1213)
...
T Consensus 179 ~~~ 181 (423)
T KOG0313|consen 179 GEN 181 (423)
T ss_pred chh
Confidence 543
No 240
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=22.48 E-value=1.2e+03 Score=26.85 Aligned_cols=137 Identities=12% Similarity=0.042 Sum_probs=76.5
Q ss_pred cccEEEEEEEEeCCceEEEEEEEeecCcceEeccc--cCeEEE--EeCCeEEEEecCCce----eeceeeecCccceEEE
Q 000944 918 VAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF--QGRLLA--GIGPVLRLYDLGKKR----LLRKCENKLFPNTIVS 989 (1213)
Q Consensus 918 ~~Gri~v~~i~~~~~kl~~~~~~~~~g~V~ai~~~--~g~ll~--~~g~~l~i~~~~~~~----l~~~~~~~~~~~~i~~ 989 (1213)
|++-|++++...++|+.....-.-....|--++-- ...+++ ++-..|.||+..... +..+| +++.|.-
T Consensus 232 c~~~I~lw~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~DgsIrIWDiRs~~~~~~~~~kA----h~sDVNV 307 (440)
T KOG0302|consen 232 CVKGIHLWEPSTGSWKVDQRPFTGHTKSVEDLQWSPTEDGVFASCSCDGSIRIWDIRSGPKKAAVSTKA----HNSDVNV 307 (440)
T ss_pred cccceEeeeeccCceeecCccccccccchhhhccCCccCceEEeeecCceEEEEEecCCCccceeEeec----cCCceee
Confidence 56778888888876665544333334444444442 233444 344689999987542 33333 5555555
Q ss_pred EEEe--CCEEEEeecCCcEEEEEEeccCCeEEEee-ccCCCcceEEEEe--ecCCeeeeecCCCcEEEEecCCCC
Q 000944 990 INTY--RDRIYVGDIQESFHFCKYRRDENQLYIFA-DDSVPRWLTAAHH--IDFDTMAGADKFGNIYFVRLPQDV 1059 (1213)
Q Consensus 990 l~~~--~~~I~vgD~~~Sv~~l~~~~~~~~l~~~a-~D~~~~~~~~~~~--ld~~~~l~~D~~gnl~il~~~~~~ 1059 (1213)
|+.. -++|+-||=---+.+...+.-... ..+| -+++..-+|+++. .+.+.++++-.+..+.+..+.-+.
T Consensus 308 ISWnr~~~lLasG~DdGt~~iwDLR~~~~~-~pVA~fk~Hk~pItsieW~p~e~s~iaasg~D~QitiWDlsvE~ 381 (440)
T KOG0302|consen 308 ISWNRREPLLASGGDDGTLSIWDLRQFKSG-QPVATFKYHKAPITSIEWHPHEDSVIAASGEDNQITIWDLSVEA 381 (440)
T ss_pred EEccCCcceeeecCCCceEEEEEhhhccCC-CcceeEEeccCCeeEEEeccccCceEEeccCCCcEEEEEeeccC
Confidence 5543 457778876666666544321111 1222 2233444667764 455567777777778887765443
No 241
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=22.31 E-value=2.5e+02 Score=31.59 Aligned_cols=105 Identities=16% Similarity=0.220 Sum_probs=67.0
Q ss_pred ecCcceEeccccCeEEE-Ee-CCeEEEEecCCceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeEE
Q 000944 942 VEGIPLALCQFQGRLLA-GI-GPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLY 1019 (1213)
Q Consensus 942 ~~g~V~ai~~~~g~ll~-~~-g~~l~i~~~~~~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l~ 1019 (1213)
..|.|.|+. |..++++ +. ..++.+|+|...+.++.-+.. --.+..+...+++++-.---+|+.+.+.....
T Consensus 236 HtGSVLCLq-yd~rviisGSSDsTvrvWDv~tge~l~tlihH--ceaVLhlrf~ng~mvtcSkDrsiaVWdm~sps---- 308 (499)
T KOG0281|consen 236 HTGSVLCLQ-YDERVIVSGSSDSTVRVWDVNTGEPLNTLIHH--CEAVLHLRFSNGYMVTCSKDRSIAVWDMASPT---- 308 (499)
T ss_pred CCCcEEeee-ccceEEEecCCCceEEEEeccCCchhhHHhhh--cceeEEEEEeCCEEEEecCCceeEEEeccCch----
Confidence 568888765 5556555 33 358999999988776665443 24688899999999999999999997764311
Q ss_pred EeeccCCCcce-----EEEEeecCC--eeeeecCCCcEEEEecCC
Q 000944 1020 IFADDSVPRWL-----TAAHHIDFD--TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1020 ~~a~D~~~~~~-----~~~~~ld~~--~~l~~D~~gnl~il~~~~ 1057 (1213)
|-..+.| .++..+|.+ .|+.+..+..|.+.....
T Consensus 309 ----~it~rrVLvGHrAaVNvVdfd~kyIVsASgDRTikvW~~st 349 (499)
T KOG0281|consen 309 ----DITLRRVLVGHRAAVNVVDFDDKYIVSASGDRTIKVWSTST 349 (499)
T ss_pred ----HHHHHHHHhhhhhheeeeccccceEEEecCCceEEEEeccc
Confidence 1111111 122334333 456666677777766543
No 242
>PF13545 HTH_Crp_2: Crp-like helix-turn-helix domain; PDB: 3LA2_A 3LA3_B 3LA7_A 3B02_A 3E97_A 2H6C_B 1OMI_A 2BGC_H 2BEO_A 2GAU_A ....
Probab=22.29 E-value=1.1e+02 Score=26.28 Aligned_cols=26 Identities=23% Similarity=0.386 Sum_probs=23.6
Q ss_pred HHHHHHHHcCCCHHHHHHHHHHHHhc
Q 000944 1186 LQRKIADELDRTPGEILKKLEEIRNK 1211 (1213)
Q Consensus 1186 ~q~~i~~~~~~~~~~i~~~l~~l~~~ 1211 (1213)
.|+++|+.+|++++.+.+.|.+|+++
T Consensus 30 t~~~iA~~~g~sr~tv~r~l~~l~~~ 55 (76)
T PF13545_consen 30 TQEEIADMLGVSRETVSRILKRLKDE 55 (76)
T ss_dssp SHHHHHHHHTSCHHHHHHHHHHHHHT
T ss_pred CHHHHHHHHCCCHHHHHHHHHHHHHC
Confidence 47899999999999999999999875
No 243
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=21.71 E-value=1.6e+03 Score=27.91 Aligned_cols=114 Identities=15% Similarity=0.085 Sum_probs=64.8
Q ss_pred eecCcceEecc-ccCeEEEEeC---CeEEEEecCCc--eee------ceeeec-CccceEEEEEEe--CCEEEEeecCCc
Q 000944 941 QVEGIPLALCQ-FQGRLLAGIG---PVLRLYDLGKK--RLL------RKCENK-LFPNTIVSINTY--RDRIYVGDIQES 1005 (1213)
Q Consensus 941 ~~~g~V~ai~~-~~g~ll~~~g---~~l~i~~~~~~--~l~------~~~~~~-~~~~~i~~l~~~--~~~I~vgD~~~S 1005 (1213)
+.+.-|.|++. .++.-++|.| .+|++|++... +++ ..+... +.-..+.++... +..|+-|+..+=
T Consensus 115 ~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGgtek~ 194 (735)
T KOG0308|consen 115 THKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGGTEKD 194 (735)
T ss_pred cccchheeeeecccCceeEEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecCcccc
Confidence 45677888877 4444444443 58999999733 222 112111 111123344443 447888899988
Q ss_pred EEEEEEeccCCeEEEeeccCCCcceEEEEeecCC-eeeeecCCCcEEEEecCC
Q 000944 1006 FHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD-TMAGADKFGNIYFVRLPQ 1057 (1213)
Q Consensus 1006 v~~l~~~~~~~~l~~~a~D~~~~~~~~~~~ld~~-~~l~~D~~gnl~il~~~~ 1057 (1213)
+-++.-+.-...+.+.| +.-.|-++-..|++ +++.+-.+|-|.+..+.+
T Consensus 195 lr~wDprt~~kimkLrG---HTdNVr~ll~~dDGt~~ls~sSDgtIrlWdLgq 244 (735)
T KOG0308|consen 195 LRLWDPRTCKKIMKLRG---HTDNVRVLLVNDDGTRLLSASSDGTIRLWDLGQ 244 (735)
T ss_pred eEEeccccccceeeeec---cccceEEEEEcCCCCeEeecCCCceEEeeeccc
Confidence 88854443334444433 22234444444544 688899999999987754
No 244
>PF08309 LVIVD: LVIVD repeat; InterPro: IPR013211 This repeat is found in bacterial and archaeal cell surface proteins, many of which are hypothetical. The secondary structure corresponding to this repeat is predicted to comprise 4 beta-strands, which may associate to form a beta-propeller. The repeat copy number varies from 2-14. This repeat is sometimes found with the PKD domain IPR000601 from INTERPRO.
Probab=21.57 E-value=3e+02 Score=20.89 Aligned_cols=27 Identities=11% Similarity=0.146 Sum_probs=23.5
Q ss_pred eEEEEEEeCCEEEEeecCCcEEEEEEe
Q 000944 986 TIVSINTYRDRIYVGDIQESFHFCKYR 1012 (1213)
Q Consensus 986 ~i~~l~~~~~~I~vgD~~~Sv~~l~~~ 1012 (1213)
.+..+.+.+|+.+++|-..++.++...
T Consensus 3 ~a~~v~v~g~yaYva~~~~Gl~IvDIS 29 (42)
T PF08309_consen 3 DARDVAVSGNYAYVADGNNGLVIVDIS 29 (42)
T ss_pred eEEEEEEECCEEEEEeCCCCEEEEECC
Confidence 467889999999999999999997763
No 245
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=21.55 E-value=1.1e+03 Score=25.85 Aligned_cols=191 Identities=15% Similarity=0.184 Sum_probs=78.9
Q ss_pred cCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeEeEEeecCC--CCceeEEEEeecccCCCCCCCCCCc
Q 000944 580 MSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSS--PPESLLFLEVQASVGGEDGADHPAS 657 (1213)
Q Consensus 580 l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~~~~~~l~~--~p~Sl~~~~~~~~~~~~~~~~~~~~ 657 (1213)
+..++|.|+..+. ...+.+|.-..+.|..++++ ...++. -++.. -++.+.++. +.
T Consensus 20 ~~~e~SGLTy~pd-----~~tLfaV~d~~~~i~els~~-G~vlr~---i~l~g~~D~EgI~y~g--------------~~ 76 (248)
T PF06977_consen 20 ILDELSGLTYNPD-----TGTLFAVQDEPGEIYELSLD-GKVLRR---IPLDGFGDYEGITYLG--------------NG 76 (248)
T ss_dssp --S-EEEEEEETT-----TTEEEEEETTTTEEEEEETT---EEEE---EE-SS-SSEEEEEE-S--------------TT
T ss_pred ccCCccccEEcCC-----CCeEEEEECCCCEEEEEcCC-CCEEEE---EeCCCCCCceeEEEEC--------------CC
Confidence 3467999998752 23445554446666656652 222222 23322 355554421 34
Q ss_pred eEEEEEeeCCeEEEEEEeCCCCccccc--ceeeecCCCCe---EE--EEEECCeeEEEEecCccEEEEEeCC---eEEEE
Q 000944 658 LFLNAGLQNGVLFRTVVDMVTGQLSDS--RSRFLGLRPPK---LF--SVVVGGRAAMLCLSSRPWLGYIHRG---RFLLT 727 (1213)
Q Consensus 658 ~~Lligl~~G~l~~~~~~~~~~~l~~~--~~~~lG~~pv~---l~--~~~~~~~~~v~~~g~~p~~i~~~~~---~~~~~ 727 (1213)
.|++..=++|.|+.+.++.....+... ....+|..... +- .....++.-+++.-..|..+|.-++ ...+.
T Consensus 77 ~~vl~~Er~~~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~ 156 (248)
T PF06977_consen 77 RYVLSEERDQRLYIFTIDDDTTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLF 156 (248)
T ss_dssp EEEEEETTTTEEEEEEE----TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--E
T ss_pred EEEEEEcCCCcEEEEEEeccccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCCCChhhEEEccccCcccee
Confidence 566666668999988886543332211 12223322000 11 1111233444455566766665433 11111
Q ss_pred ecCcc-------ccceeeccccCCCCceEEEEeCCeEEEEEEccCCCeeEEEEEeCCC----------ccceeeecCCCc
Q 000944 728 PLSYE-------TLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRY----------TPRRFVLQPKKK 790 (1213)
Q Consensus 728 ~~~~~-------~v~~~~~f~~~~~~~~~i~~~~~~L~i~~l~~~~~~~~~r~i~l~~----------tp~~i~y~~~~~ 790 (1213)
-.... .+..++.++-..-.+.++..++.+=++..++. +.-+.+.++|.. .|.-|++.++-+
T Consensus 157 ~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~es~~l~~~d~--~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~ 234 (248)
T PF06977_consen 157 VSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDESRLLLELDR--QGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGN 234 (248)
T ss_dssp EEE-HHHH-HT--SS---EEEEETTTTEEEEEETTTTEEEEE-T--T--EEEEEE-STTGGG-SS---SEEEEEE-TT--
T ss_pred eccccccccccceeccccceEEcCCCCeEEEEECCCCeEEEECC--CCCEEEEEEeCCcccCcccccCCccEEEECCCCC
Confidence 11000 11111111111112345555543333455554 234566677765 799999999887
Q ss_pred eEEEE
Q 000944 791 LMVII 795 (1213)
Q Consensus 791 ~~~v~ 795 (1213)
+|+|.
T Consensus 235 LYIvs 239 (248)
T PF06977_consen 235 LYIVS 239 (248)
T ss_dssp EEEEE
T ss_pred EEEEc
Confidence 77775
No 246
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=20.99 E-value=7.2e+02 Score=30.95 Aligned_cols=66 Identities=14% Similarity=0.081 Sum_probs=46.6
Q ss_pred EEEEEEeCCEEEEEEEccC--CCeEEeeeeccCcceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeC
Q 000944 551 QVVIALSGGELIYFEVDMT--GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617 (1213)
Q Consensus 551 ~v~v~~s~~~l~~l~~~~~--~~l~~~~~~~l~~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~ 617 (1213)
|.+-+..|.++.+.+..++ ..+.+....+++..||++++.++.. ......+++|+..|.|.+|..+
T Consensus 632 ~FaTaSRDK~VkVW~~~~~~d~~i~~~a~~~~~~aVTAv~~~~~~~-~e~~~~vavGle~GeI~l~~~~ 699 (764)
T KOG1063|consen 632 YFATASRDKKVKVWEEPDLRDKYISRFACLKFSLAVTAVAYLPVDH-NEKGDVVAVGLEKGEIVLWRRK 699 (764)
T ss_pred eeEEecCCceEEEEeccCchhhhhhhhchhccCCceeeEEeecccc-ccccceEEEEecccEEEEEecc
Confidence 4555555677777776543 3344445567788899999887642 2345689999999999999974
No 247
>PF08220 HTH_DeoR: DeoR-like helix-turn-helix domain; InterPro: IPR001034 The deoR-type HTH domain is a DNA-binding, helix-turn-helix (HTH) domain of about 50-60 amino acids present in transcription regulators of the deoR family, involved in sugar catabolism. This family of prokaryotic regulators is named after the Escherichia coli protein DeoR, a repressor of the deo operon, which encodes nucleotide and deoxyribonucleotide catabolic enzymes. DeoR also negatively regulates the expression of nupG and tsx, a nucleoside-specific transport protein and a channel-forming protein, respectively. DeoR-like transcription repressors occur in diverse bacteria as regulators of sugar and nucleoside metabolic systems. The effector molecules for deoR-like regulators are generally phosphorylated intermediates of the relevant metabolic pathway. The DNA-binding deoR-type HTH domain occurs usually in the N-terminal part. The C-terminal part can contain an effector-binding domain and/or an oligomerisation domain. DeoR occurs as an octamer, whilst glpR and agaR are tetramers. Several operators may be bound simultaneously, which could facilitate DNA looping [, ].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular
Probab=20.93 E-value=1.2e+02 Score=24.68 Aligned_cols=25 Identities=16% Similarity=0.375 Sum_probs=22.5
Q ss_pred HHHHHHHcCCCHHHHHHHHHHHHhc
Q 000944 1187 QRKIADELDRTPGEILKKLEEIRNK 1211 (1213)
Q Consensus 1187 q~~i~~~~~~~~~~i~~~l~~l~~~ 1211 (1213)
-.++|+.+|+|...|.++|.+|.++
T Consensus 17 ~~ela~~~~VS~~TiRRDl~~L~~~ 41 (57)
T PF08220_consen 17 VKELAEEFGVSEMTIRRDLNKLEKQ 41 (57)
T ss_pred HHHHHHHHCcCHHHHHHHHHHHHHC
Confidence 3788999999999999999999875
No 248
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=20.83 E-value=9.5e+02 Score=29.11 Aligned_cols=81 Identities=17% Similarity=0.368 Sum_probs=45.3
Q ss_pred CCEEEEEEeCCEEEEEEEccCCC--eEEeeeeccC--cceEEEEeeecCCCceeeeEEEEEEeCCcEEEEEeCCCCceeE
Q 000944 549 RLQVVIALSGGELIYFEVDMTGQ--LLEVEKHEMS--GDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQI 624 (1213)
Q Consensus 549 ~~~v~v~~s~~~l~~l~~~~~~~--l~~~~~~~l~--~~is~i~i~~~~~~~~~~~~l~v~~~~~~i~i~sl~p~~~l~~ 624 (1213)
+.+-+|.++++.|..+...-.|. |.-.+.+++. ..-+|++-. .+.+++||.-+|.|++|+- +-.
T Consensus 394 ~e~TlvGLs~n~vfriDpRv~~~~kl~~~q~kqy~~k~nFsc~aTT-------~sG~IvvgS~~GdIRLYdr-----i~~ 461 (644)
T KOG2395|consen 394 SEQTLVGLSDNSVFRIDPRVQGKNKLAVVQSKQYSTKNNFSCFATT-------ESGYIVVGSLKGDIRLYDR-----IGR 461 (644)
T ss_pred ccccEEeecCCceEEecccccCcceeeeeeccccccccccceeeec-------CCceEEEeecCCcEEeehh-----hhh
Confidence 34556677787775543322232 2222334432 345677664 3668999999999999974 222
Q ss_pred eEEeecCCCCceeEEEE
Q 000944 625 LSVQSVSSPPESLLFLE 641 (1213)
Q Consensus 625 ~~~~~l~~~p~Sl~~~~ 641 (1213)
-....+|+....+.-+.
T Consensus 462 ~AKTAlPgLG~~I~hVd 478 (644)
T KOG2395|consen 462 RAKTALPGLGDAIKHVD 478 (644)
T ss_pred hhhhcccccCCceeeEE
Confidence 23345666555544444
No 249
>PF13404 HTH_AsnC-type: AsnC-type helix-turn-helix domain; PDB: 2ZNY_E 2ZNZ_G 1RI7_A 2CYY_A 2E1C_A 2VC1_B 2QZ8_A 2W29_C 2IVM_B 2VBX_B ....
Probab=20.49 E-value=71 Score=24.21 Aligned_cols=35 Identities=9% Similarity=0.195 Sum_probs=22.8
Q ss_pred hhhhcccCCHHHHHHHHHHcCCCHHHHHHHHHHHH
Q 000944 1175 LCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIR 1209 (1213)
Q Consensus 1175 ll~~fl~l~~~~q~~i~~~~~~~~~~i~~~l~~l~ 1209 (1213)
+|..+..=+.---.+||+.+|+|...+..-+..|+
T Consensus 8 Il~~Lq~d~r~s~~~la~~lglS~~~v~~Ri~rL~ 42 (42)
T PF13404_consen 8 ILRLLQEDGRRSYAELAEELGLSESTVRRRIRRLE 42 (42)
T ss_dssp HHHHHHH-TTS-HHHHHHHHTS-HHHHHHHHHHHH
T ss_pred HHHHHHHcCCccHHHHHHHHCcCHHHHHHHHHHhC
Confidence 33333333334457899999999999999888774
No 250
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=20.19 E-value=1.3e+03 Score=28.83 Aligned_cols=68 Identities=16% Similarity=0.270 Sum_probs=38.7
Q ss_pred eeEEEEEEeCCcEEEEEeCCCC---ceeEeEEeecCCCCceeEEEEeecccCCCCCCCCCCceEEEEEeeCCeEEEEEEe
Q 000944 599 SRFLAVGSYDNTIRILSLDPDD---CMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVD 675 (1213)
Q Consensus 599 ~~~l~v~~~~~~i~i~sl~p~~---~l~~~~~~~l~~~p~Sl~~~~~~~~~~~~~~~~~~~~~~Lligl~~G~l~~~~~~ 675 (1213)
+++++.+..|.++.+|.. ++. .+..+...+.+...+.+.++.... ......+-+|++.|.++.|+..
T Consensus 630 e~~FaTaSRDK~VkVW~~-~~~~d~~i~~~a~~~~~~aVTAv~~~~~~~---------~e~~~~vavGle~GeI~l~~~~ 699 (764)
T KOG1063|consen 630 EKYFATASRDKKVKVWEE-PDLRDKYISRFACLKFSLAVTAVAYLPVDH---------NEKGDVVAVGLEKGEIVLWRRK 699 (764)
T ss_pred cceeEEecCCceEEEEec-cCchhhhhhhhchhccCCceeeEEeecccc---------ccccceEEEEecccEEEEEecc
Confidence 556888888999999998 543 222222122221111111111111 1234477899999999999875
Q ss_pred C
Q 000944 676 M 676 (1213)
Q Consensus 676 ~ 676 (1213)
.
T Consensus 700 ~ 700 (764)
T KOG1063|consen 700 R 700 (764)
T ss_pred c
Confidence 3
No 251
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=20.13 E-value=7.4e+02 Score=30.64 Aligned_cols=110 Identities=16% Similarity=0.251 Sum_probs=63.1
Q ss_pred cCcceEecccc-CeEEEEeC--CeEEEEecCC-ceeeceeeecCccceEEEEEEeCCEEEEeecCCcEEEEEEeccCCeE
Q 000944 943 EGIPLALCQFQ-GRLLAGIG--PVLRLYDLGK-KRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQL 1018 (1213)
Q Consensus 943 ~g~V~ai~~~~-g~ll~~~g--~~l~i~~~~~-~~l~~~~~~~~~~~~i~~l~~~~~~I~vgD~~~Sv~~l~~~~~~~~l 1018 (1213)
+.+||+++.-+ |.++++-| +.|.+|+... +++++..-.. -+..+.-+...+++++-|-.---|-+.....+.-..
T Consensus 171 k~siYSLA~N~t~t~ivsGgtek~lr~wDprt~~kimkLrGHT-dNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~ 249 (735)
T KOG0308|consen 171 KDSIYSLAMNQTGTIIVSGGTEKDLRLWDPRTCKKIMKLRGHT-DNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLA 249 (735)
T ss_pred ccceeeeecCCcceEEEecCcccceEEeccccccceeeeeccc-cceEEEEEcCCCCeEeecCCCceEEeeeccccceee
Confidence 45677777653 56777555 3688998873 3444432121 244455555667788877665556665443222222
Q ss_pred EE-eeccCCCcceEEEEeecCCeeeeecCCCcEEEEecC
Q 000944 1019 YI-FADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLP 1056 (1213)
Q Consensus 1019 ~~-~a~D~~~~~~~~~~~ld~~~~l~~D~~gnl~il~~~ 1056 (1213)
.. +.+|. .|...+. =+...+..+|++|||+.-.+.
T Consensus 250 T~~vH~e~--VWaL~~~-~sf~~vYsG~rd~~i~~Tdl~ 285 (735)
T KOG0308|consen 250 TYIVHKEG--VWALQSS-PSFTHVYSGGRDGNIYRTDLR 285 (735)
T ss_pred eEEeccCc--eEEEeeC-CCcceEEecCCCCcEEecccC
Confidence 22 23333 6655443 122467889999999987654
No 252
>PF08279 HTH_11: HTH domain; InterPro: IPR013196 Winged helix DNA-binding proteins share a related winged helix-turn-helix DNA-binding motif, where the "wings", or loops, are small beta-sheets. The winged helix motif consists of two wings (W1, W2), three alpha helices (H1, H2, H3) and three beta-sheets (S1, S2, S3) arranged in the order H1-S1-H2-H3-S2-W1-S3-W2 []. The DNA-recognition helix makes sequence-specific DNA contacts with the major groove of DNA, while the wings make different DNA contacts, often with the minor groove or the backbone of DNA. Several winged-helix proteins display an exposed patch of hydrophobic residues thought to mediate protein-protein interactions. This entry represents a subset of the winged helix domain superfamily which is predominantly found in bacterial proteins, though there are also some archaeal and eukaryotic examples. This domain is commonly found in the biotin (vitamin H) repressor protein BirA which regulates transcription of the biotin operon []. It is also found in other proteins including regulators of amino acid biosynthsis such as LysM [], and regulators of carbohydrate metabolisms such as LicR and FrvR [, ].; PDB: 1HXD_B 2EWN_B 1BIA_A 1BIB_A 1J5Y_A 3V7S_A 3V7C_A 3RKW_A 3RIR_A 3RKX_A ....
Probab=20.03 E-value=1.3e+02 Score=23.99 Aligned_cols=26 Identities=23% Similarity=0.420 Sum_probs=21.8
Q ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHh
Q 000944 1185 DLQRKIADELDRTPGEILKKLEEIRN 1210 (1213)
Q Consensus 1185 ~~q~~i~~~~~~~~~~i~~~l~~l~~ 1210 (1213)
-...++|+++++|...|.++|.+|+.
T Consensus 16 it~~eLa~~l~vS~rTi~~~i~~L~~ 41 (55)
T PF08279_consen 16 ITAKELAEELGVSRRTIRRDIKELRE 41 (55)
T ss_dssp BEHHHHHHHCTS-HHHHHHHHHHHHH
T ss_pred cCHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 35688999999999999999999865
Done!